# pot-app/pot-desktop

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/pot-app-pot-desktop).**

17,110 stars · 823 forks · JavaScript · gpl-3.0

## Links

- GitHub: https://github.com/pot-app/pot-desktop
- Homepage: https://pot-app.com
- awesome-repositories: https://awesome-repositories.com/repository/pot-app-pot-desktop.md

## Topics

`linux` `macos` `ocr` `pot` `pot-app` `recognize` `tauri` `translate` `translation` `tts` `windows`

## Description

This application is a cross-platform desktop utility designed for automated translation, optical character recognition, and speech synthesis. It functions as a modular client that integrates various local and remote language services, allowing users to process text through hotkeys, clipboard monitoring, or direct input.

The software distinguishes itself through a plugin-based architecture and a built-in automation framework. By exposing a local network interface, it enables external applications and scripts to programmatically trigger its translation and recognition workflows. Users can further customize their experience by configuring proxy-based traffic routing to bypass regional restrictions and managing window positioning to ensure context-aware display across the desktop.

The application supports a wide range of language processing tasks, including automated language detection, text formatting, and the synchronization of vocabulary data with external study tools. It provides flexible input methods, such as screen capture and text selection integration, while offering silent background processing options to streamline multilingual workflows.

## Tags

### Artificial Intelligence & ML

- [Automated Translation Workflows](https://awesome-repositories.com/f/artificial-intelligence-ml/artificial-intelligence-tooling/language-model-integrations/translation-services/automated-translation-workflows.md) — Translates text across multiple services using hotkeys, clipboard monitoring, or direct input to streamline multilingual communication workflows.
- [Optical Character Recognition](https://awesome-repositories.com/f/artificial-intelligence-ml/optical-character-recognition.md) — Extracts and converts text from screen captures or image files into editable digital content using local or cloud-based engines.
- [Translation Services](https://awesome-repositories.com/f/artificial-intelligence-ml/artificial-intelligence-tooling/language-model-integrations/translation-services.md) — Integrates third-party translation providers via API keys to perform automated text processing. ([source](https://pot-app.com/docs/api/translate/openai.html))
- [Screen Text Extractors](https://awesome-repositories.com/f/artificial-intelligence-ml/optical-character-recognition/screen-text-extractors.md) — Identifies and converts visual text from screen captures into editable digital content. ([source](https://pot-app.com/docs/))
- [Language Detection Tools](https://awesome-repositories.com/f/artificial-intelligence-ml/language-detection-tools.md) — Identifies the language of input text using local or remote processing services. ([source](https://pot-app.com/docs/config/recognize.html))
- [Speech Synthesis](https://awesome-repositories.com/f/artificial-intelligence-ml/speech-synthesis.md) — Converts written text into natural sounding speech using integrated synthesis services. ([source](https://pot-app.com/docs/))

### Development Tools & Productivity

- [Translation Utilities](https://awesome-repositories.com/f/development-tools-productivity/cross-platform-desktop-utilities/translation-utilities.md) — Acts as a cross-platform desktop utility for automated translation, OCR, and speech synthesis.
- [Translation Tools](https://awesome-repositories.com/f/development-tools-productivity/translation-tools.md) — Extracts text from screen captures and translates content using configurable local or cloud-based engines.
- [System Clipboard Access](https://awesome-repositories.com/f/development-tools-productivity/clipboard-management/system-clipboard-access.md) — Observes the operating system clipboard to automatically detect and process text selections for immediate translation or recognition.
- [Dictionary and Translation Tools](https://awesome-repositories.com/f/development-tools-productivity/dictionary-and-translation-tools.md) — Features a plugin-based architecture for integrating dictionary services and vocabulary tools.
- [Global Hotkey Managers](https://awesome-repositories.com/f/development-tools-productivity/global-hotkey-managers.md) — Hooks into system-level keyboard events to trigger application functions and translation workflows from any active desktop window.
- [Third-Party Service Integrations](https://awesome-repositories.com/f/development-tools-productivity/third-party-service-integrations.md) — Integrates third-party optical character recognition providers by supplying authentication credentials to enable text extraction from images and screen captures. ([source](https://pot-app.com/docs/invoke.html))
- [Workflow Automation APIs](https://awesome-repositories.com/f/development-tools-productivity/workflow-automation-apis.md) — Exposes a network interface to allow external software and custom plugins to trigger translation and recognition tasks programmatically.
- [Vocabulary Acquisition Tools](https://awesome-repositories.com/f/development-tools-productivity/help-support-systems/vocabulary-acquisition-tools.md) — Exports translated terms and phrases to external flashcard or dictionary applications to facilitate long-term language learning and retention.
- [Screen Capture Tools](https://awesome-repositories.com/f/development-tools-productivity/screen-capture-tools.md) — Configures system-level screen capture utilities to ensure compatibility when native capture methods are incompatible with the current display server. ([source](https://pot-app.com/docs/wayland.html))

### Data & Databases

- [Screen Capture Extraction](https://awesome-repositories.com/f/data-databases/text-processing-utilities/text-extraction/screen-capture-extraction.md) — Captures screen regions and copies recognized text to the clipboard to streamline translation. ([source](https://pot-app.com/docs/config/recognize.html))
- [Selected Text Retrieval](https://awesome-repositories.com/f/data-databases/text-processing-utilities/text-extraction/selected-text-retrieval.md) — Provides one-click translation of highlighted text by capturing selections from the system clipboard or active windows. ([source](https://pot-app.com/docs/invoke.html))

### Networking & Communication

- [Local HTTP Servers](https://awesome-repositories.com/f/networking-communication/local-http-servers.md) — Exposes a network port to allow external software and scripts to trigger translation and recognition services through programmatic commands.
- [Network Proxy Configurations](https://awesome-repositories.com/f/networking-communication/network-infrastructure-routing/network-infrastructure-configuration/network-configuration/network-proxy-configurations.md) — Routes network traffic through configurable proxy servers to bypass regional restrictions. ([source](https://pot-app.com/docs/config/general.html))
- [Traffic Routing Proxies](https://awesome-repositories.com/f/networking-communication/network-infrastructure-routing/network-infrastructure-configuration/network-infrastructure/traffic-routing-proxies.md) — Directs network requests through configurable servers to bypass regional restrictions and ensure reliable connectivity for external translation services.

### Operating Systems & Systems Programming

- [Desktop Automation](https://awesome-repositories.com/f/operating-systems-systems-programming/desktop-environment-frameworks/desktop-environment-components/desktop-automation.md) — Exposes a local network API allowing external applications to trigger translation and recognition workflows.

### Software Engineering & Architecture

- [Modular Plugin Architectures](https://awesome-repositories.com/f/software-engineering-architecture/modular-plugin-architectures.md) — Extends core functionality by loading external service modules that integrate directly into the application's translation and recognition workflows.
- [Modular Plugin Systems](https://awesome-repositories.com/f/software-engineering-architecture/modular-plugin-systems.md) — Features a modular plugin architecture that allows users to create and install custom service modules to extend core translation and recognition capabilities. ([source](https://pot-app.com/docs/plugin.html))
- [Data Processing Pipelines](https://awesome-repositories.com/f/software-engineering-architecture/data-processing-pipelines.md) — Coordinates multiple concurrent translation and recognition providers to process input data and format results for user display or external export.
- [External Application Integrations](https://awesome-repositories.com/f/software-engineering-architecture/integration-extensibility/workflow-automation-integrations/external-application-integrations.md) — Sends translated text and vocabulary data to external flashcard and dictionary applications to facilitate long-term language study and vocabulary retention. ([source](https://pot-app.com/docs/api/collection/anki.html))

### Web Development

- [Local API Servers](https://awesome-repositories.com/f/web-development/local-api-servers.md) — Exposes a local network port to allow external software and plugins to trigger translation and recognition services programmatically. ([source](https://pot-app.com/docs/invoke.html))
- [Third-Party API Integrations](https://awesome-repositories.com/f/web-development/third-party-api-integrations.md) — Connects third-party translation, recognition, and voice synthesis providers by configuring authentication credentials to extend core application capabilities.
- [Background Translation Services](https://awesome-repositories.com/f/web-development/translation-tools/background-translation-services.md) — Performs translations in the background and copies results to the clipboard without opening a window. ([source](https://pot-app.com/docs/config/translate.html))

### User Interface & Experience

- [Overlay](https://awesome-repositories.com/f/user-interface-experience/window-managers/overlay.md) — Positions interface elements relative to the cursor or active window coordinates to provide context-aware information display across the desktop.
- [Window Management](https://awesome-repositories.com/f/user-interface-experience/window-management.md) — Provides window positioning policies to ensure translation overlays appear at the cursor location when native coordinate tracking is unavailable. ([source](https://pot-app.com/docs/wayland.html))

### Content Management & Publishing

- [Translation History Interfaces](https://awesome-repositories.com/f/content-management-publishing/content-management-systems/translation-management/translation-history-interfaces.md) — Maintains an open window to accumulate multiple text selections for easier processing. ([source](https://pot-app.com/docs/config/translate.html))
