Computer Vision

Computer Vision

newcomers · gaining speed
01
roboflow/supervision
+530 ★/dayaccelerating

A model-agnostic Python toolkit that handles the boring parts of computer vision: annotations, dataset juggling, and tracking.

43.6k Python Computer Vision · explained
02
PaddlePaddle/PaddleOCR
+332 ★/dayaccelerating

PaddleOCR turns scans and PDFs into structured Markdown or JSON using a tiny vision-language model that punches above its weight class.

81.8k Python Computer Vision · explained
03
opencv/opencv
+159 ★/dayaccelerating

OpenCV is the de facto standard for computer vision, and its README is almost aggressively humble about it.

88.9k C++ Computer Vision · explained
04
NVlabs/Eagle
+61 ★/dayaccelerating

Eagle is less a single model than NVIDIA's internal R&D pipeline for multimodal AI, now open-sourced with three generations of VLMs and a grounding specialist.

2.4k Python Language Models · explained
05
ruvnet/RuView
+363 ★/dayaccelerating

A $9 ESP32 board turns radio reflections into room-scale presence detection, vital signs, and pose estimation — no lenses, no wearables, no cloud.

72.9k Rust Domain Apps · explained
06
facebookresearch/sam-3d-body
+32 ★/dayaccelerating

A foundation model that turns one image into a full 3D body mesh, optionally guided by keypoints or masks like the original SAM.

3.2k Python Computer Vision · explained
07
liwenxi/SWIFT-AI
+14 ★/dayaccelerating

A speed-focused deep learning system for analyzing massive scientific images, from crowds to cancer slides to galaxies.

1.4k Jupyter Notebook Computer Vision · explained
08
ZhengPeng7/BiRefNet
+13 ★/dayaccelerating

BiRefNet splits images into layers using bilateral references, then offers a whole zoo of task-specific weights for everything from background removal to camouflaged-object detection.

3.7k Python Computer Vision · explained
09
lightly-ai/lightly-train
+13 ★/dayaccelerating

A single Python framework that pretrains DINOv2/v3 on unlabeled data, then fine-tunes and distills for detection and segmentation tasks.

1.6k Python Computer Vision · explained
10
hustvl/4DGaussians
+12 ★/dayaccelerating

Extends 3D Gaussian Splatting to time-varying scenes without sacrificing the real-time rendering speed that made the original technique appealing.

3.7k Jupyter Notebook Computer Vision · explained
12
RapidAI/RapidOCR
+10 ★/dayaccelerating

An ONNX-exported, multi-engine OCR toolkit that runs offline on basically anything.

6.8k Python Computer Vision · explained
13
blakeblackshear/frigate
+25 ★/dayaccelerating

Frigate is a local NVR that runs AI object detection on IP cameras without phoning home to the cloud.

33.7k TypeScript Computer Vision · explained
15
colmap/colmap
+7.1 ★/dayaccelerating

A battle-tested pipeline that reconstructs 3D scenes from unordered image collections using structure-from-motion and multi-view stereo.

11.9k C++ Computer Vision · explained
16
yossTheDev/removerized
+5.3 ★/dayaccelerating

A browser-native image toolkit that removes backgrounds and upscales photos using local AI models—no server, no subscription, no data leaving your laptop.

1.5k TypeScript Computer Vision · explained
17
MrNeRF/LichtFeld-Studio
+5.9 ★/dayaccelerating

LichtFeld Studio wraps the entire 3D Gaussian Splatting pipeline—training, editing, exporting, automating—into a single C++ desktop app instead of a chain of Python scripts.

3.2k C++ Computer Vision · explained
18
ubicomplab/rPPG-Toolbox
+1.4 ★/dayaccelerating

A unified training and evaluation framework for remote photoplethysmography—turning ordinary camera video into physiological signals without contact.

1.1k Python Domain Apps · explained
19
mittagessen/kraken
+0.7 ★/daysteady

A turn-key OCR engine built for historical manuscripts, non-Latin scripts, and the messy reality of digitization.

1k Python Computer Vision · explained
20
stereolabs/zed-sdk
+0.9 ★/dayaccelerating

Official samples and tutorials for Stereolabs' ZED depth cameras, bundling SLAM, object detection, body tracking, and spatial mapping behind a single C++/Python API.

1.2k C++ Computer Vision · explained
loading more…

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.