eSearch is a desktop tool that combines screen capture, image annotation, screen recording, optical character recognition (OCR), and text search and translation into a single application. It is built around a modular architecture that coordinates these tasks through an event-driven capture pipeline, allowing users to capture screen regions, annotate them with drawing and shape tools, and then extract text using a local-first OCR engine or optional cloud services.
The project distinguishes itself by integrating a command-line interface for triggering capture and recognition tasks, enabling scripted or automated workflows without a graphical interface. It also includes a configurable media recording stack for capturing screen activity, audio, or camera feeds with adjustable bitrate and resolution, and a multi-engine search router that routes recognized text to user-configured search or translation engines through a pluggable adapter system.
Beyond its core capture and recognition capabilities, eSearch provides a full image annotation editor for drawing, shaping, filtering, and adjusting captured content, as well as a text search and translation engine for quick information retrieval. The project's plugin-based service architecture allows for extending functionality by loading independent service modules for OCR, search, and translation through a unified interface.