awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Vision-Based UI Parsing Libraries · Awesome GitHub Repositories

1 repo

Awesome GitHub RepositoriesVision-Based UI Parsing Libraries

Libraries for transforming interface screenshots into structured data for AI interaction.

Distinguishing note: Focuses on the transformation of static screenshots into machine-readable formats.

Explore 1 awesome GitHub repository matching artificial intelligence & ml · Vision-Based UI Parsing Libraries. Refine with filters or upvote what's useful.

  1. Home
  2. Artificial Intelligence & ML
  3. Vision-Based UI Parsing Libraries

Awesome Vision-Based UI Parsing Libraries GitHub Repositories

Describe the repository you're looking for…
Find the best repos with AI.We'll search the best matching repositories with AI.
  • microsoft/OmniParser

    microsoft/OmniParser

    24,377View on GitHub↗

    OmniParser is a multimodal interaction engine designed to function as a desktop automation agent. It interprets visual screen information to execute complex, multi-step tasks across operating system environments by bridging visual interface perception with language models. Through a continuous cycle of observation and command execution, the system grounds high-level natural language instructions into precise, coordinate-based actions. The project distinguishes itself by utilizing vision-based parsing to interact with software interfaces without requiring access to underlying application progr

    Transforms static screenshots of software interfaces into structured data formats for artificial intelligence models.

    Jupyter Notebook
    24,377View on GitHub↗