# lukas-blecher/LaTeX-OCR

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/lukas-blecher-latex-ocr).**

16,190 stars · 1,280 forks · Python · mit

## Links

- GitHub: https://github.com/lukas-blecher/LaTeX-OCR
- Homepage: https://lukas-blecher.github.io/LaTeX-OCR/
- awesome-repositories: https://awesome-repositories.com/repository/lukas-blecher-latex-ocr.md

## Topics

`dataset` `deep-learning` `im2latex` `im2markup` `im2text` `image-processing` `image2text` `latex` `latex-ocr` `machine-learning` `math-ocr` `ocr` `python` `pytorch` `transformer` `vision-transformer` `vit`

## Description

LaTeX-OCR is a specialized optical character recognition system designed to identify and transcribe complex mathematical symbols and their spatial relationships from images. It functions as a machine learning engine that converts visual representations of equations into structured LaTeX code for use in technical documentation and academic typesetting.

The project utilizes a hierarchical vision-based encoding and autoregressive sequence decoding architecture to process input images and generate mathematical notation token by token. Beyond its core recognition capabilities, the system provides an interactive interface for capturing formulas directly from screenshots and exposes a network service that allows external applications to integrate automated transcription into their own workflows.

The software includes a framework for training and fine-tuning models, enabling users to prepare specialized datasets and adjust parameters to improve recognition accuracy for unique symbols or specific handwriting styles. The project is distributed as a Python-based library and includes tools for both command-line interaction and programmatic integration.

## Tags

### Scientific & Mathematical Computing

- [Image-to-LaTeX Converters](https://awesome-repositories.com/f/scientific-mathematical-computing/numerical-mathematical-foundations/mathematical-typesetting-engines/mathematical-typesetting/latex-math-rendering/image-to-latex-converters.md) — The project provides image-to-LaTeX conversion by applying machine learning models to identify mathematical symbols and their spatial relationships within visual representations of formulas. ([source](https://cdn.jsdelivr.net/gh/lukas-blecher/LaTeX-OCR@main/README.md))
- [Formula Recognition Engines](https://awesome-repositories.com/f/scientific-mathematical-computing/numerical-mathematical-foundations/mathematical-typesetting-engines/mathematical-typesetting/latex-math-rendering/formula-recognition-engines.md) — A machine learning tool that converts images of mathematical equations into structured LaTeX code for document typesetting.
- [Mathematical Digitization Engines](https://awesome-repositories.com/f/scientific-mathematical-computing/numerical-mathematical-foundations/mathematical-typesetting-engines/mathematical-typesetting/latex-math-rendering/mathematical-digitization-engines.md) — Converting images or screenshots of complex equations into structured LaTeX code for use in academic papers and technical documentation.
- [Automated Transcription Services](https://awesome-repositories.com/f/scientific-mathematical-computing/numerical-mathematical-foundations/mathematical-typesetting-engines/mathematical-typesetting/automated-transcription-services.md) — Integrating machine learning models into external applications to automatically convert visual mathematical content into machine-readable typesetting formats.
- [Scientific Document Processing](https://awesome-repositories.com/f/scientific-mathematical-computing/research-analysis-workflows/scientific-document-processing.md) — Streamlining the creation of technical documents by automating the translation of visual mathematical notation into standard code formats.

### Artificial Intelligence & ML

- [Transcription APIs](https://awesome-repositories.com/f/artificial-intelligence-ml/audio-transcription/transcription-apis.md) — A network service that provides automated formula recognition capabilities for integration into external applications and research workflows.
- [Transcription API Services](https://awesome-repositories.com/f/artificial-intelligence-ml/generative-ai-resources/workflow-execution-backends/api-integration-services/transcription-api-services.md) — The project exposes recognition services through a network interface, allowing external applications to integrate automated mathematical transcription capabilities into their own workflows. ([source](https://cdn.jsdelivr.net/gh/lukas-blecher/LaTeX-OCR@main/README.md))
- [Custom Model Training](https://awesome-repositories.com/f/artificial-intelligence-ml/custom-model-training.md) — Preparing specialized datasets and adjusting model parameters to improve recognition accuracy for unique mathematical symbols or specific handwriting styles.
- [Custom Vision Training](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-training-and-tuning/computer-vision-and-recognition/custom-vision-training.md) — The project supports custom model training by allowing users to prepare specialized datasets and adjust learning parameters to improve recognition accuracy for unique symbols. ([source](https://cdn.jsdelivr.net/gh/lukas-blecher/LaTeX-OCR@main/README.md))
- [Optical Character Recognition](https://awesome-repositories.com/f/artificial-intelligence-ml/optical-character-recognition.md) — A specialized computer vision system designed to identify and transcribe complex mathematical symbols and spatial relationships from images.
- [Vision Transformers](https://awesome-repositories.com/f/artificial-intelligence-ml/vision-transformers.md) — Processes input images through a hierarchical attention mechanism to map visual features into a sequence of latent mathematical tokens.
- [Deep Learning Frameworks](https://awesome-repositories.com/f/artificial-intelligence-ml/deep-learning-frameworks.md) — A framework for preparing datasets and fine-tuning recognition engines to improve accuracy for unique symbols or specific handwriting styles.
- [Autoregressive Decoding Strategies](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/architectures/transformer/autoregressive-decoding-strategies.md) — Generates LaTeX strings token by token by predicting the next character based on previously generated symbols and visual context.

### Testing & Quality Assurance

- [Formula Capture Tools](https://awesome-repositories.com/f/testing-quality-assurance/automation-interaction-tools/screenshot-capture/formula-capture-tools.md) — The project provides an interactive interface for capturing mathematical formulas from screenshots, converting visual equations into structured notation for standard typesetting. ([source](https://cdn.jsdelivr.net/gh/lukas-blecher/LaTeX-OCR@main/README.md))
