# cjpais/Handy

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/cjpais-handy).**

15,515 stars · 1,071 forks · Rust · mit

## Links

- GitHub: https://github.com/cjpais/Handy
- Homepage: https://handy.computer
- awesome-repositories: https://awesome-repositories.com/repository/cjpais-handy.md

## Topics

`accessibility` `cross-platform` `speech-to-text` `tauri-v2`

## Description

Handy is a local speech-to-text automation tool designed to convert spoken audio into text and inject it directly into active desktop applications. By running machine learning models entirely on the host hardware, it provides a private, offline-first environment for dictation and command execution. The system functions as a background service that manages microphone input, transcription state, and text output, enabling hands-free typing across various software environments.

The project distinguishes itself through a modular pipeline that integrates local language models for post-transcription refinement. Users can configure custom prompts to automatically format, translate, or correct raw speech output before it is inserted into the target application. This workflow is further enhanced by event-driven automation hooks, which allow the system to trigger custom scripts, keyboard shortcuts, or command sequences in response to transcription events.

Beyond core dictation, the software offers extensive control over the transcription environment, including hardware-aware audio management and real-time translation capabilities. It supports fine-grained adjustments to transcription accuracy, such as vocabulary correction for technical terminology and configurable input latency. The system also maintains a history of past sessions and provides tools for managing clipboard states and system memory usage.

## Tags

### Artificial Intelligence & ML

- [Transcription Model Selectors](https://awesome-repositories.com/f/artificial-intelligence-ml/audio-transcription/multilingual-transcription/transcription-model-selectors.md) — Provides configurable selection of local speech-to-text models to balance transcription accuracy and hardware performance. ([source](https://handy.computer/docs))
- [Local AI Inference](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-deployment-and-serving/local-and-on-device-inference/local-ai-inference.md) — Executes speech-to-text models locally on host hardware to maintain data privacy and offline functionality.
- [Speech-to-Text Engines](https://awesome-repositories.com/f/artificial-intelligence-ml/speech-to-text-engines.md) — Converts spoken audio into written text locally and injects it into active applications.
- [Transcription Refinement Pipelines](https://awesome-repositories.com/f/artificial-intelligence-ml/speech-to-text-integrations/transcription-refinement-pipelines.md) — The application applies automated AI-driven corrections, formatting, or translation to raw speech-to-text output before pasting it into active applications for polished results. ([source](https://handy.computer/docs))
- [AI Writing Assistants](https://awesome-repositories.com/f/artificial-intelligence-ml/ai-writing-assistants.md) — Processes transcribed speech through local language models to format, translate, or refine text.
- [Audio Transcription](https://awesome-repositories.com/f/artificial-intelligence-ml/audio-transcription.md) — Converts spoken audio into text locally using selectable machine learning models. ([source](https://handy.computer/docs))
- [Local Model Integrations](https://awesome-repositories.com/f/artificial-intelligence-ml/local-model-integrations.md) — Connects to local language model services to ensure private, offline-first text processing. ([source](https://handy.computer/docs/post-processing))
- [Multilingual Speech Translation](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/speech-processing/multilingual-speech-translation.md) — Converts foreign speech into target language text in real-time during the transcription process. ([source](https://handy.computer/docs/models))
- [Vocabulary Extension Sets](https://awesome-repositories.com/f/artificial-intelligence-ml/vocabulary-management/vocabulary-extension-sets.md) — Allows users to define custom vocabulary lists to improve transcription accuracy for technical terminology and jargon. ([source](https://handy.computer/docs/advanced))
- [Audio Hardware Profiles](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-optimization-and-inference/hardware-and-acceleration/hardware-abstraction-layers/audio-hardware-interfaces/audio-hardware-profiles.md) — Configures microphone inputs and audio behaviors to maintain consistent transcription quality.
- [Transcription Correction Sensitivity](https://awesome-repositories.com/f/artificial-intelligence-ml/audio-transcription/transcription-exporters/transcription-correction-sensitivity.md) — Adjusts the sensitivity of custom word corrections to balance between strict matching and aggressive replacement. ([source](https://handy.computer/docs/debug))

### Development Tools & Productivity

- [Keyboard Input Automation](https://awesome-repositories.com/f/development-tools-productivity/input-pointer-automation/keyboard-automation/keyboard-input-automation.md) — Automates the insertion of transcribed text into active windows and legacy software. ([source](https://handy.computer/docs/paste-methods))
- [Transcription Input Orchestrators](https://awesome-repositories.com/f/development-tools-productivity/cross-platform-input-multiplexers/transcription-input-orchestrators.md) — Manages microphone input, transcription state, and clipboard history to enable hands-free text entry.
- [Global Hotkey Managers](https://awesome-repositories.com/f/development-tools-productivity/global-hotkey-managers.md) — Registers global hotkeys to trigger text transformation and editing independently of standard dictation. ([source](https://handy.computer/docs/post-processing))
- [Transcription Event Hooks](https://awesome-repositories.com/f/development-tools-productivity/workflow-automation-tools/system-integration-interfaces/automation-integration/custom-scripting-engines/transcription-event-hooks.md) — Triggers custom scripts and keyboard shortcuts in response to transcription events.
- [Clipboard Management](https://awesome-repositories.com/f/development-tools-productivity/clipboard-management.md) — Manages clipboard state by optionally restoring previous content after text insertion. ([source](https://handy.computer/docs/advanced))
- [Command Automation](https://awesome-repositories.com/f/development-tools-productivity/command-automation.md) — Automates command submission by simulating keyboard shortcuts after text insertion. ([source](https://handy.computer/docs/advanced))
- [Task Automation Scripts](https://awesome-repositories.com/f/development-tools-productivity/task-automation-scripts.md) — Pipes transcribed text directly into shell scripts for programmatic input processing. ([source](https://handy.computer/docs/cli))

### Operating Systems & Systems Programming

- [Desktop Automation](https://awesome-repositories.com/f/operating-systems-systems-programming/desktop-environment-frameworks/desktop-environment-components/desktop-automation.md) — Triggers custom scripts and keyboard shortcuts to streamline text entry and command execution across desktop environments.
- [Cross-Application Input Simulators](https://awesome-repositories.com/f/operating-systems-systems-programming/desktop-environment-frameworks/desktop-environment-components/desktop-applications/cross-application-input-simulators.md) — Simulates keyboard shortcuts and clipboard operations to reliably insert text into active applications.
- [Desktop Applications](https://awesome-repositories.com/f/operating-systems-systems-programming/desktop-environment-frameworks/desktop-environment-components/desktop-applications.md) — Provides CLI-based control for managing transcription states and desktop environment integration. ([source](https://handy.computer/docs/cli))
- [Audio Input Selectors](https://awesome-repositories.com/f/operating-systems-systems-programming/audio-input-selectors.md) — Allows users to select specific microphone hardware for consistent audio capture. ([source](https://handy.computer/docs/general))

### DevOps & Infrastructure

- [System Input Handling](https://awesome-repositories.com/f/devops-infrastructure/infrastructure/operational-observability-access/system-input-handling.md) — Injects text and simulates keyboard shortcuts into active applications via low-level OS APIs.
- [AI System Instructions](https://awesome-repositories.com/f/devops-infrastructure/configuration-management/application-settings-management/application-behavior-configurations/ai-system-instructions.md) — Defines custom system instructions to guide how language models process and format transcribed text. ([source](https://handy.computer/docs/post-processing))

### Data & Databases

- [AI Text Refinement Pipelines](https://awesome-repositories.com/f/data-databases/text-processing-utilities/text-processing-tools/ai-text-refinement-pipelines.md) — Refines and transforms raw transcribed text using local language models to improve formatting and grammar.
- [Modular Pipeline Orchestration](https://awesome-repositories.com/f/data-databases/data-processing-pipelines/processing-pipelines/modular-pipeline-orchestration.md) — Orchestrates a modular pipeline that chains transcription with secondary text transformation steps.

### Software Engineering & Architecture

- [Hardware Abstraction Layers](https://awesome-repositories.com/f/software-engineering-architecture/hardware-abstraction-layers.md) — Provides a consistent interface for managing microphone selection and audio hardware profiles.
- [Event-Driven Hooks](https://awesome-repositories.com/f/software-engineering-architecture/event-driven-hooks.md) — Triggers custom scripts and workflows in response to transcription events or hotkey signals.
- [Speech-to-Text Normalizers](https://awesome-repositories.com/f/software-engineering-architecture/string-validation-and-normalization/speech-to-text-normalizers.md) — Formats output by converting spoken numbers into digits and applying punctuation and capitalization. ([source](https://handy.computer/docs/models))

### User Interface & Experience

- [Recording Controls](https://awesome-repositories.com/f/user-interface-experience/component-utilities/keyboard-shortcuts/recording-controls.md) — Assigns custom keyboard triggers to interrupt or cancel active recording sessions. ([source](https://handy.computer/docs/debug))

### Networking & Communication

- [Audio Modality Controls](https://awesome-repositories.com/f/networking-communication/audio-management-systems/audio-stream-property-controls/audio-input-cleaning/audio-modality-controls.md) — Controls audio input modalities by muting system audio and providing cues during recording sessions. ([source](https://handy.computer/docs/general))
