Image · Video · Audio

Image · Video · Audio

newcomers · gaining speed
06
Zardinality/WGAN-tensorflow
+0.2 ★/daysteady

A straightforward notebook implementation of Wasserstein GAN that lets you flip the loss signs and still trains, because duality is weird like that.

579 Jupyter Notebook Image · Video · Audio · explained
07
zsdonghao/text-to-image
+0.2 ★/daysteady

A 2016 paper implementation that generates flower images from text descriptions, built when TensorFlow 1.x was fresh and "skip thought vectors" sounded futuristic.

599 Python Image · Video · Audio · explained
08
HRLTY/TP-GAN
+0.2 ★/daysteady

A 2017 ICCV paper that synthesizes frontal faces from extreme side angles using two perceptual paths at once.

510 Python Image · Video · Audio · explained
09
stanfordnlp/mac-network
+0.2 ★/daysteady

Stanford's MAC cell breaks visual reasoning into explicit, inspectable computation steps—rare honesty in a field that usually hides its work.

513 Python Computer Vision · explained
10
evancohen/sonus
+0.2 ★/daysteady

Sonus gives Node.js projects offline hotword detection, then streams speech to cloud STT only after you get its attention.

638 JavaScript Image · Video · Audio · explained
11
pathak22/pyflow
+0.2 ★/daysteady

A Python shim around Ce Liu's venerable C++ Coarse2Fine optical flow, minus the OpenCV dependency headache.

661 C++ Computer Vision · explained
13

A TensorFlow implementation of extreme learned image compression that trades exact reconstruction for tiny file sizes by letting a generator dream up the textures.

533 Python Computer Vision · explained
15
sergeytulyakov/mocogan
+0.2 ★/daysteady

MoCoGAN disentangles motion and content in video generation, letting you swap faces while keeping the expression—or vice versa.

602 Python Image · Video · Audio · explained
16
jsn5/dancenet
+0.2 ★/daysteady

A Keras project that generates new dance sequences by compressing video frames into a latent space, then predicting the next pose with an LSTM and Mixture Density Network.

519 Python Image · Video · Audio · explained
18
rui1996/DeRaindrop
+0.2 ★/daysteady

A 2018 CVPR spotlight paper that uses attention maps to stop generative networks from inventing plausible-looking but wrong background details behind raindrops.

548 Python Computer Vision · explained
19
cvondrick/videogan
+0.2 ★/daysteady

A Torch7 implementation that generates short, plausible video clips by separating foreground motion from static backgrounds using adversarial training.

706 Lua Image · Video · Audio · explained
20
alphacep/vosk
+0.2 ★/daysteady

VOSK skips neural network training in favor of storing every audio chunk it has ever seen, then fingerprint-matches new input against the hoard.

500 C Image · Video · Audio · explained
loading more…

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.