awesome-repositories.comBlog
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPBlogSitemapPrivacyTerms
Vision-Based UI Parsers · Awesome GitHub Repositories

1 repo

Awesome GitHub RepositoriesVision-Based UI Parsers

Tools that convert visual interface screenshots into structured data for element identification.

Distinguishing note: Focuses on the parser tool specifically for element identification in automated workflows.

Explore 1 awesome GitHub repository matching artificial intelligence & ml · Vision-Based UI Parsers. Refine with filters or upvote what's useful.

  1. Home
  2. Artificial Intelligence & ML
  3. Vision-Based UI Parsers

Awesome Vision-Based UI Parsers GitHub Repositories

Describe the repository you're looking for…
Find the best repos with AI.We'll search the best matching repositories with AI.
  • microsoft/OmniParser

    microsoft/OmniParser

    24,377View on GitHub↗

    OmniParser is a multimodal interaction engine designed to function as a desktop automation agent. It interprets visual screen information to execute complex, multi-step tasks across operating system environments by bridging visual interface perception with language models. Through a continuous cycle of observation and command execution, the system grounds high-level natural language instructions into precise, coordinate-based actions. The project distinguishes itself by utilizing vision-based parsing to interact with software interfaces without requiring access to underlying application progr

    Converts visual interface screenshots into structured data representations to enable accurate element identification.

    Jupyter Notebook
    24,377View on GitHub↗