This project is an automated image translation system and pipeline specifically optimized for manga and comics. It provides a sequence of text detection, machine translation, and typesetting, and is available as an image translation API, a command-line tool for batch processing, and an LLM-powered translator.
The system utilizes OCR to detect text regions and an inpainter to remove original content by synthesizing background pixels. Translated text is then overlaid using an automated typesetter that manages font sizes, colors, and reading directions based on the original coordinates.
The workflow includes image upscaling to improve detection accuracy and a terminology management system using regular expression dictionaries and glossaries for text correction. Final outputs can be generated as multi-layered files that separate the original artwork, inpainted backgrounds, and translated text boxes.