# thejoefin/text-grab

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/thejoefin-text-grab).**

4,610 stars · 299 forks · C# · mit

## Links

- GitHub: https://github.com/TheJoeFin/Text-Grab
- Homepage: https://www.microsoft.com/en-us/p/text-grab/9mznkqj7sl0b?cid=TextGrabGitHub
- awesome-repositories: https://awesome-repositories.com/repository/thejoefin-text-grab.md

## Topics

`dotnet` `local-ai` `msix` `ocr` `window-10` `windows` `windows-11` `wpf`

## Description

Text-Grab is a desktop utility that captures text from screen regions, images, PDFs, and native user interface elements using on-device optical character recognition (OCR) and Windows UI Automation. It processes text entirely locally without sending data to external services, and extracts text directly from UI controls with perfect accuracy by reading the accessibility tree. The application also includes a persistent snippet dictionary for instant retrieval of frequently used text via a configurable system-wide hotkey.

The tool supports building reusable extraction workflows by saving capture regions alongside pattern-based transformation rules that apply regex cleaning and structuring to OCR results. It can batch-process entire folders of images or PDFs through a single-threaded pipeline, applying the same saved configurations to each file. An integrated editor cleans and restores captured text using line removal, pattern extraction, and a spreadsheet mode for tabular data.

Text-Grab runs as a Windows-native application written in C#, with no additional services or internet connection required for its core text capture and extraction features.

## Tags

### Artificial Intelligence & ML

- [OCR](https://awesome-repositories.com/f/artificial-intelligence-ml/on-device-models/vision-language-models/ocr.md) — Captures text from screen regions, images, and PDFs using local optical character recognition without internet.
- [Local-First Engines](https://awesome-repositories.com/f/artificial-intelligence-ml/ocr-engines/local-first-engines.md) — Performs on-device OCR with bundled local models, never sending data to external services.
- [Screen Text Extractors](https://awesome-repositories.com/f/artificial-intelligence-ml/optical-character-recognition/screen-text-extractors.md) — Captures text from any screen area using on-device OCR and copies it to the clipboard.
- [UI Element Text Extractors](https://awesome-repositories.com/f/artificial-intelligence-ml/optical-character-recognition/screen-text-extractors/ui-element-text-extractors.md) — Extracts text directly from native UI controls via the accessibility tree, bypassing OCR for perfect accuracy.
- [Batch Folder Processors](https://awesome-repositories.com/f/artificial-intelligence-ml/structured-data-extraction/asynchronous-extraction-engines/document-extraction-engines/batch-folder-processors.md) — Applies saved capture regions and pattern rules to extract text from entire folders of images or PDFs.
- [Batch File Processors](https://awesome-repositories.com/f/artificial-intelligence-ml/agent-skill-management/file-management/agentic-run-file-exposers/batch-file-processors.md) — Walks folder trees and dispatches each file through OCR in a single-threaded pipeline.
- [Interactive Cleaners](https://awesome-repositories.com/f/artificial-intelligence-ml/on-device-models/vision-language-models/ocr/structured-ocr-outputs/interactive-cleaners.md) — Provides an integrated editor to clean OCR results using line removal, pattern extraction, and spreadsheet mode.
- [Interactive OCR Editors](https://awesome-repositories.com/f/artificial-intelligence-ml/on-device-models/vision-language-models/ocr/structured-ocr-outputs/interactive-ocr-editors.md) — Provides an integrated editor to clean and structure OCR results with line and table tools. ([source](https://github.com/TheJoeFin/Text-Grab))
- [Post-Processing Editors](https://awesome-repositories.com/f/artificial-intelligence-ml/on-device-models/vision-language-models/ocr/structured-ocr-outputs/post-processing-editors.md) — Provides an integrated editor to clean OCR output using line tools, pattern extraction, and spreadsheet mode.

### Part of an Awesome List

- [Text Extraction and OCR](https://awesome-repositories.com/f/awesome-lists/more/text-extraction-and-ocr.md) — Extracts text from images, videos, PDFs using on-device OCR without external data transfer. ([source](https://github.com/TheJoeFin/Text-Grab/blob/main/.gitattributes))
- [Structural Text Transformations](https://awesome-repositories.com/f/awesome-lists/devtools/structural-text-transformations.md) — Applies formatting, deduplication, pattern extraction, and table editing to transform OCR results. ([source](https://github.com/TheJoeFin/Text-Grab/blob/main/README.md))

### Data & Databases

- [Text Extraction](https://awesome-repositories.com/f/data-databases/text-processing-utilities/text-extraction.md) — Extracts text from native UI controls without OCR for perfect accuracy. ([source](https://github.com/TheJoeFin/Text-Grab/blob/main/README.md))
- [Screen Capture Extraction](https://awesome-repositories.com/f/data-databases/text-processing-utilities/text-extraction/screen-capture-extraction.md) — Captures text from screen regions or image files using on-device OCR without internet. ([source](https://github.com/TheJoeFin/Text-Grab))
- [Batch File Text Extractors](https://awesome-repositories.com/f/data-databases/text-processing-utilities/text-extraction/batch-file-text-extractors.md) — Extracts text from all images and PDFs in a folder using a sequential batch pipeline. ([source](https://github.com/TheJoeFin/Text-Grab/blob/main/README.md))

### Development Tools & Productivity

- [OCR Batch Processors](https://awesome-repositories.com/f/development-tools-productivity/batch-image-processors/ocr-batch-processors.md) — Applies saved capture regions and pattern rules to automatically extract text from entire folders of images or PDFs.
- [Text Snippet Managers](https://awesome-repositories.com/f/development-tools-productivity/text-snippet-managers.md) — Ships a persistent snippet dictionary for instant retrieval of frequently used text via a global hotkey.

### User Interface & Experience

- [Native UI Tree Traversers](https://awesome-repositories.com/f/user-interface-experience/component-hierarchies/hierarchy-traversers/native-ui-tree-traversers.md) — Reads text from native UI elements by traversing the Windows accessibility tree.
