qiuqiangkong/audioset_tagging_cnn
Pretrained audio neural networks (PANNs) for audio tagging and sound event detection trained on AudioSet.

Velocity · 7d
+0.7
★ / day
Trend
→steady
star history
PANNs provides large-scale pretrained CNN models for audio pattern recognition, including audio tagging and sound event detection. Models are trained on 5000 hours of AudioSet data with 527 sound classes. The repository includes pre-trained models like Cnn14 that can be used directly for inference on audio files, outputting class probabilities and embeddings for the detected sound events.