← all repositories

RahulSChand/gpu_poor

Web-based calculator that estimates GPU memory requirements, token/s throughput, and fine-tuning time for any LLM.

1.4k stars JavaScript Inference · ServingLLMOps · Eval
gpu_poor
Velocity · 7d
+1.4
★ / day
Trend
steady
star history

A JavaScript tool that helps users determine if their GPU can run a specific LLM and what performance to expect. It calculates vRAM usage broken down by model size, KV cache, activation memory, and quantization overhead for GGML, bitsandbytes, and QLoRA methods. It also estimates token generation speed and fine-tuning iteration times, supporting inference frameworks like vLLM, llama.cpp, and HuggingFace.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.