← all repositories

ScalingIntelligence/KernelBench

A benchmark suite evaluating LLMs' ability to generate efficient CUDA and DSL GPU kernels from PyTorch operator specifications.

1k stars Jupyter Notebook LLMOps · EvalCoding Assistants
KernelBench
Velocity · 7d
+1.8
★ / day
Trend
steady
star history

KernelBench tasks LLMs with transpiling PyTorch operators to GPU kernels and provides an evaluation toolkit to measure correctness and performance. It contains 250 problems across three difficulty levels: single-kernel operators like convolutions and matrix multiplies, fused kernel patterns combining multiple operations, and full model architecture optimizations. The repository supports automated benchmarking with evaluation scripts and is published as an ICML 2025 paper.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.