1 repo
Converts spoken audio input into text using speech recognition models.
Distinguishing note: Focuses on conversational input interfaces rather than general audio processing.
Explore 1 awesome GitHub repository matching graphics & multimedia · Voice-to-Text Transcription. Refine with filters or upvote what's useful.
Dify is an open-source platform for building, orchestrating, and deploying generative AI applications and autonomous agents. It provides a visual development environment that allows users to design complex, multi-step logic chains and conversational flows, which can then be published as APIs, web interfaces, or embedded widgets. The platform acts as a centralized infrastructure layer, managing model connections, prompt templates, and knowledge retrieval to support scalable AI-powered services. What distinguishes the platform is its focus on stateful application design and workflow orchestrati
Captures spoken messages and converts them into text using speech recognition models.