VoiceInk is a system-wide speech-to-text dictation tool that converts spoken audio into text using local or cloud AI models. It functions as a local AI transcription engine and a context-aware voice assistant, allowing users to insert transcribed text directly into any active application on the operating system.
The project distinguishes itself through the use of custom vocabulary management, which trains transcription engines to recognize industry-specific technical terms, professional terminology, and personal names. It further enhances output by using large language models to refine raw transcriptions into polished text, leveraging context injected from the system clipboard and active screen content.
The software includes a hybrid-mode speech recognition system that can operate entirely offline for privacy or utilize remote servers for expanded language support. It features application-specific automation that switches transcription models and dictation profiles based on the active window, alongside configurable keyboard shortcuts for recording control.
The application is written in Swift.