← all repositories

xinjli/allosaurus

A pretrained universal phone recognizer built with PyTorch that identifies phonetic units in audio across over 2000 languages.

allosaurus
Velocity · 7d
+0.3
★ / day
Trend
steady
star history

Allosaurus is a deep learning-based speech recognition system that converts audio input into sequences of phonemes. It is trained on multilingual allophone data and can perform inference across thousands of languages without language-specific training. The model is distributed as a pretrained system via pip and provides both a Python API and command-line interface for audio-to-phoneme transcription.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.