LLMOps · Eval — the hottest AI repositories on heatdrop

Newcomers Heavyweights

Hottest Accelerating

LLMOps · Eval

newcomers · gaining speed

codalab/codalab-competitions

+0.1 ★/day→steady

A decade-old open-source platform for running ML competitions, now gently nudging users toward its own successor.

★ 537 Python LLMOps · Eval · explained

polyaxon/traceml

+0.1 ★/day→steady

A Python library for logging metrics, artifacts, and dataframes without needing a live Polyaxon server.

★ 533 Python LLMOps · Eval · explained

irockel/tda

+0.1 ★/day→steady

TDA is a Swing GUI and MCP server that parses, categorizes, and diagnoses thread dumps so you don't have to read raw stack traces at 2 AM.

★ 543 Java Coding Assistants · explained

mlcommons/ck

+0.2 ★/day→steady

A community-built automation framework trying to make ML benchmarking reproducible across the chaos of GPUs, containers, and constantly shifting software stacks.

★ 647 Python LLMOps · Eval · explained

theopenconversationkit/tock

+0.2 ★/day→steady

Tock is an open-source conversational AI platform for teams who want to build bots without surrendering their data to a SaaS black box.

★ 606 Kotlin Chat Assistants · explained

kubeflow/mpi-operator

+0.2 ★/day→steady

A Kubernetes operator that turns allreduce-style distributed training into a declarative YAML file, handling the messy pod orchestration so you don't have to.

★ 528 Go ML Frameworks · explained

wcipriano/pretty-print-confusion-matrix

+0.2 ★/day→steady

A thin wrapper around seaborn and matplotlib that makes confusion matrices actually readable, with string labels and decent colormaps.

★ 538 Python LLMOps · Eval · explained

hyperactive-project/Hyperactive

+0.2 ★/day→steady

Hyperactive abstracts away the mess of hyperparameter tuning by separating what you optimize from how you optimize it.

★ 550 Python ML Frameworks · explained

torrvision/crayon

+0.2 ★/day→steady

A Docker-wrapped REST API that lets any language log to TensorBoard, not just TensorFlow.

★ 781 Python LLMOps · Eval · explained

jind11/TextFooler

+0.2 ★/day→steady

A 2019 adversarial attack that fools text classifiers by replacing words with semantically similar synonyms—no model retraining required.

★ 529 Python LLMOps · Eval · explained

IBM/FfDL

+0.2 ★/day→steady

FfDL was IBM's attempt to run TensorFlow and PyTorch as a service on Kubernetes—now frozen in read-only mode.

★ 690 Go ML Frameworks · explained

infocusp/tf_cnnvis

+0.2 ★/day→steady

Reconstruct what your convolutional network sees, layer by layer, then pipe it straight into TensorBoard.

★ 779 Python Computer Vision · explained

williamFalcon/test-tube

+0.2 ★/day→steady

A Python library that extends argparse to log experiments and parallelize hyperparameter search across GPUs or SLURM clusters without rewriting your training scripts.

★ 735 JavaScript LLMOps · Eval · explained

neptune-ai/neptune-client

+0.2 ★/day→steady

The 2.x experiment tracker has been superseded by a new client built for foundation-model scale.

★ 622 Python LLMOps · Eval · explained

ing-bank/popmon

+0.2 ★/day→steady

A Python library that watches your pandas or Spark data for distribution drift, then emails you when things go sideways.

★ 512 Python LLMOps · Eval · explained

HunterMcGushion/hyperparameter_hunter

+0.2 ★/day→steady

A hyperparameter optimizer that treats your entire project history as its starting point, not a blank slate.

★ 707 Python LLMOps · Eval · explained

Picovoice/speech-to-text-benchmark

+0.2 ★/day→steady

Picovoice built a benchmarking framework that pits cloud APIs, open-source models, and its own engines against the same audio datasets.

★ 693 Python Language Models · explained

triton-inference-server/model_analyzer

+0.2 ★/day→steady

A CLI tool that brute-forces or heuristically searches the configuration space for NVIDIA's Triton Inference Server, then hands you a report on the trade-offs.

★ 512 Python Inference · Serving · explained

kubeflow/kale

+0.3 ★/day→steady

Kale turns a tagged notebook into a Kubeflow Pipeline without rewriting a line of Python.

★ 691 Python LLMOps · Eval · explained

JasperSnoek/spearmint

+0.3 ★/day→steady

A legacy Bayesian optimization package that still works if you can stomach Python 2.7 and Protocol Buffers.

★ 1.4k Python ML Frameworks · explained

loading more…