AutoArk/GPA
A unified audio model performing automatic speech recognition, text-to-speech synthesis, and voice conversion.

Velocity · 7d
+3.1
★ / day
Trend
→steady
star history
GPA (General Purpose Audio) is a compact transformer-based model that handles three speech tasks—ASR, TTS, and voice conversion—using shared model weights. The project includes standalone TTS runtimes, ONNX runtime support for cross-platform inference, and supports multiple precision modes (INT8, FP16, FP32) for deployment flexibility.