2 repos
Models designed to interpret and analyze visual data, charts, or cross-modal inputs alongside text.
Explore 2 awesome GitHub repositories matching artificial intelligence & ml · Multimodal Perception Models. Refine with filters or upvote what's useful.
This project is an artificial intelligence-powered frontend generator that translates visual design inputs into functional source code. It functions as a workflow engine that interprets graphical user interfaces, mapping layout structures and styling rules to structured markup and programming language syntax. The tool
This project is a comprehensive educational resource and knowledge base dedicated to the development and application of large language models and autonomous agentic systems. It provides a structured framework for understanding prompt engineering, context management, and the architectural patterns required to build task