xinjli/allosaurus
A pretrained universal phone recognizer built with PyTorch that identifies phonetic units in audio across over 2000 languages.

Velocity · 7d
+0.3
★ / day
Trend
→steady
star history
Allosaurus is a deep learning-based speech recognition system that converts audio input into sequences of phonemes. It is trained on multilingual allophone data and can perform inference across thousands of languages without language-specific training. The model is distributed as a pretrained system via pip and provides both a Python API and command-line interface for audio-to-phoneme transcription.