ESearch | Awesome Repository

eSearch is a desktop tool that combines screen capture, image annotation, screen recording, optical character recognition (OCR), and text search and translation into a single application. It is built around a modular architecture that coordinates these tasks through an event-driven capture pipeline, allowing users to capture screen regions, annotate them with drawing and shape tools, and then extract text using a local-first OCR engine or optional cloud services.

The project distinguishes itself by integrating a command-line interface for triggering capture and recognition tasks, enabling scripted or automated workflows without a graphical interface. It also includes a configurable media recording stack for capturing screen activity, audio, or camera feeds with adjustable bitrate and resolution, and a multi-engine search router that routes recognized text to user-configured search or translation engines through a pluggable adapter system.

Beyond its core capture and recognition capabilities, eSearch provides a full image annotation editor for drawing, shaping, filtering, and adjusting captured content, as well as a text search and translation engine for quick information retrieval. The project's plugin-based service architecture allows for extending functionality by loading independent service modules for OCR, search, and translation through a unified interface.

Features

Desktop Applications - A desktop application that captures screen regions and extracts text using local or online OCR services.
Local-First Engines - Performs text recognition directly on the device using bundled models, with optional cloud fallback.
Optical Character Recognition - Extracts text from captured images using local or online OCR services.
Desktop OCR - Searches or translates recognized text from screen captures using custom or built-in engines.

Features

Desktop Applications - A desktop application that captures screen regions and extracts text using local or online OCR services.
Local-First Engines - Performs text recognition directly on the device using bundled models, with optional cloud fallback.
Optical Character Recognition - Extracts text from captured images using local or online OCR services.
Desktop OCR - Searches or translates recognized text from screen captures using custom or built-in engines.