← all repositories

AutoArk/GPA

A unified audio model performing automatic speech recognition, text-to-speech synthesis, and voice conversion.

537 stars Python Image · Video · Audio
GPA
Velocity · 7d
+3.1
★ / day
Trend
steady
star history

GPA (General Purpose Audio) is a compact transformer-based model that handles three speech tasks—ASR, TTS, and voice conversion—using shared model weights. The project includes standalone TTS runtimes, ONNX runtime support for cross-platform inference, and supports multiple precision modes (INT8, FP16, FP32) for deployment flexibility.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.