Computer Vision

big names on the move

+770 ★/day↗accelerating

Because commodity WiFi already bounces off your body, RuView uses cheap ESP32 nodes to detect presence, vital signs, and even body pose without cameras or wearables.

★ 86.7k Rust Domain Apps · explained Feature

PaddlePaddle/PaddleOCR

+68 ★/day↘cooling

It turns images and PDFs into structured JSON and Markdown so your RAG pipeline doesn't have to squint.

★ 86.3k Python Computer Vision · explained Feature

ultralytics/ultralytics

+37 ★/day↗accelerating

Ultralytics wants to stop you from stitching together separate repos for every computer vision task by bundling detection, segmentation, tracking, and pose estimation into one YOLO-backed package.

★ 59.9k Python Computer Vision · explained

roboflow/supervision

+34 ★/day↘cooling

It exists to handle the tedious wiring—annotations, dataset formats, tracking—that sits between a trained model and a useful application.

★ 48.4k Python Computer Vision · explained Feature

microsoft/OmniParser

+29 ★/day↗accelerating

OmniParser turns raw screenshots into structured, labeled UI elements so vision-language models can finally click what they mean to click.

★ 25.2k Jupyter Notebook Agents · explained

upscayl/upscayl

+26 ★/day↘cooling

Upscayl gives desktop users a free way to enlarge and enhance low-resolution images using local Real-ESRGAN models and a Vulkan-compatible GPU.

★ 47.6k TypeScript Computer Vision · explained

ashishpatel26/500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code

+24 ★/day↘cooling

It rounds up hundreds of AI project links so you don't have to hunt them down yourself.

★ 35.7k Learning · explained

google-ai-edge/mediapipe

+22 ★/day↗accelerating

It exists to let developers run customized vision, text, and audio machine learning across mobile, web, and edge hardware without cloud round-trips.

★ 36.3k C++ Computer Vision · explained

opencv/opencv

+20 ★/day→steady

OpenCV is an open-source C++ computer vision library whose own README acts as a portal rather than a product page.

★ 90.2k C++ Computer Vision · explained

blakeblackshear/frigate

+19 ★/day↘cooling

Frigate performs real-time, local object detection on IP camera streams using OpenCV and TensorFlow, designed to integrate tightly with Home Assistant.

★ 34.6k TypeScript Computer Vision · explained

tesseract-ocr/tesseract

+18 ★/day↘cooling

It turns images of text into searchable documents across more than 100 languages, offering both a command-line tool and a C++ library for builders.

★ 75.6k C++ Computer Vision · explained

hiroi-sora/Umi-OCR

+15 ★/day↘cooling

It exists so you can extract text from screenshots, PDFs, and barcodes without a network connection or a cloud bill.

★ 46.2k Python Computer Vision · explained

graphdeco-inria/gaussian-splatting

+13 ★/day↗accelerating

It renders high-quality novel views of real-world scenes at 30 fps by replacing costly neural radiance fields with optimized 3D Gaussians.

★ 22.8k Python Computer Vision · explained

deepseek-ai/DeepSeek-OCR

+11 ★/day↗accelerating

An OCR model that asks how few vision tokens an LLM needs before it can no longer read the page.

★ 23.7k Python Inference · Serving · explained

xinntao/Real-ESRGAN

+11 ★/day↘cooling

Real-ESRGAN turns the ESRGAN research model into a practical tool for upscaling and restoring real-world images and videos using only synthetic training data.

★ 36.3k Python Image · Video · Audio · explained

CMU-Perceptual-Computing-Lab/openpose

+11 ★/day↗accelerating

CMU's real-time multi-person pose estimator detects body, face, hands, and feet simultaneously—body runtime stays flat even as the crowd grows.

★ 34.3k C++ Computer Vision · explained

facefusion/facefusion

+10 ★/day→steady

It turns face swaps and lip-syncs into queued, retryable batch jobs instead of one-off scripts.

★ 29.4k Python Image · Video · Audio · explained

ocrmypdf/OCRmyPDF

+8.9 ★/day↘cooling

OCRmyPDF exists because most free OCR tools botch text placement, bloat file sizes, or mangle image resolution when trying to make scanned documents searchable.

★ 34.3k Python Computer Vision · explained

deepinsight/insightface

+7.6 ★/day→steady

It bundles detection, recognition, alignment, and reconstruction into a single research-grade toolbox.

★ 29.3k Python Computer Vision · explained

ultralytics/yolov5

+6.3 ★/day↗accelerating

YOLOv5 made real-time object detection as easy as `torch.hub.load`, then exported to everything from iOS to edge chips.

★ 57.7k Python Computer Vision · explained

loading more…