1 repo
Systems that support interaction through multiple modalities such as text, voice, and audio processing.
Distinguishing note: Focuses on the integration of speech and audio processing into conversational AI, distinct from text-only chatbot implementations.
Explore 1 awesome GitHub repository matching artificial intelligence & ml · Multimodal Conversational Interfaces. Refine with filters or upvote what's useful.
Quivr is a retrieval-augmented generation platform designed to transform raw documents into searchable knowledge bases. It functions as a centralized environment where users can ingest files, index them into vector databases, and interact with language models to receive contextually relevant, data-backed responses. The platform distinguishes itself through an agentic workflow orchestrator that sequences retrieval tasks, tool execution, and model interactions to resolve complex, multi-step queries. This engine is entirely configuration-driven, allowing users to define document ingestion, chunk
Develop chatbots that process text files and answer user queries through both text and audio by integrating speech-to-text and streaming response capabilities.