Vocode-core is a framework for building real-time conversational AI voice agents. It serves as a conversational orchestrator and pipeline that integrates speech-to-text, large language models, and text-to-speech services to enable low-latency voice interactions.
The project features a provider-agnostic interface that allows for swappable speech and language model providers, including support for both cloud APIs and local binaries. It distinguishes itself through a specialized telephony integration layer that enables agents to be deployed across phone lines, WebRTC, and virtual meeting platforms.
The framework covers a broad range of capabilities including agent orchestration with custom personas and tool assignments, real-time audio streaming with interruption handling, and comprehensive telephony management for inbound and outbound call lifecycles. It also includes speech processing tools for multi-language transcription, synthetic voice cloning, and event-driven webhooks for monitoring call milestones.