1 repo
Analyzes screen captures to identify visual elements for interaction.
Explore 1 awesome GitHub repository matching testing & quality assurance · Vision-Enabled. Refine with filters or upvote what's useful.
Open Interpreter is an autonomous agent runtime that translates natural language instructions into executable code to interact with local software and operating systems. It functions as an orchestration framework that connects language models to a secure execution environment, enabling the development of agents capable
Identifies visual elements via screen capture to programmatically simulate mouse and keyboard interactions.