voxos-ai/bolna
Open-source framework for building voice-first conversational AI agents that can make phone calls, transcribe speech, process conversations with LLMs, and synthesize responses.

Bolna provides an end-to-end platform for creating voice-driven conversational applications. The framework orchestrates multiple AI components: speech-to-text transcription via providers like Deepgram and Whisper, LLM-based conversation handling using OpenAI, Llama, Mistral and others, and text-to-speech synthesis through ElevenLabs, AWS Polly, or XTTS. It integrates with telephony providers like Twilio to initiate and manage phone calls, allowing agents to conduct real-time conversations and perform tasks like sending emails or scheduling after calls complete.