# bytebot-ai/bytebot

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/bytebot-ai-bytebot).**

10,413 stars · 1,354 forks · TypeScript · apache-2.0

## Links

- GitHub: https://github.com/bytebot-ai/bytebot
- Homepage: https://www.bytebot.ai/
- awesome-repositories: https://awesome-repositories.com/repository/bytebot-ai-bytebot.md

## Topics

`agent` `agentic-ai` `agents` `ai` `ai-agents` `ai-tools` `anthropic` `automation` `bytebot` `computer-use` `computer-use-agent` `cua` `desktop` `desktop-automation` `docker` `gemini` `llm` `mcp` `openai`

## Description

Bytebot is an LLM desktop automation framework and virtual Linux desktop environment. It enables AI agents to plan and execute mouse and keyboard actions on a virtual computer using natural language, allowing for autonomous desktop automation and the integration of legacy systems that lack native APIs.

The system operates as an LLM API gateway and a Model Context Protocol server, routing requests across multiple language model providers with integrated load balancing and rate limiting. It provides isolated, containerized environments where agents use visual reasoning to interpret screenshots and translate goals into precise UI actions.

The platform includes a comprehensive suite of orchestration tools for managing asynchronous task lifecycles, programmatic desktop control via REST, and real-time state streaming via WebSockets. It supports hybrid control modes, allowing users to monitor agent execution through a browser-based viewer and intervene manually when necessary.

Deployment is supported through Docker Compose, Helm charts for Kubernetes orchestration, and one-click cloud templates for private infrastructure hosting.

## Tags

### Artificial Intelligence & ML

- [Desktop Automation Frameworks](https://awesome-repositories.com/f/artificial-intelligence-ml/desktop-automation-frameworks.md) — Implements a comprehensive framework for planning and executing autonomous mouse and keyboard actions on a virtual desktop via LLMs.
- [Visual Reasoning](https://awesome-repositories.com/f/artificial-intelligence-ml/agent-architectures/orchestration-engines/ai-agent/reasoning-action-loops/visual-reasoning.md) — Implements visual reasoning to interpret screenshots and translate natural language goals into precise UI coordinates.
- [Human-in-the-loop Workflows](https://awesome-repositories.com/f/artificial-intelligence-ml/agentic-systems-frameworks/agent-orchestration-multi-agent/control-flow-and-workflows/human-in-the-loop-workflows.md) — Provides mechanisms for pausing agent execution to allow humans to resolve errors or demonstrate workflows via recorded actions. ([source](https://docs.bytebot.ai/guides/takeover-mode.md))
- [AI Agent Orchestrators](https://awesome-repositories.com/f/artificial-intelligence-ml/ai-agent-orchestrators.md) — Plans and executes sequences of complex workflows using LLMs with real-time visual feedback.
- [Autonomous Agent Loops](https://awesome-repositories.com/f/artificial-intelligence-ml/autonomous-agent-loops.md) — Implements an autonomous loop of planning, acting, and observing to adapt to visual feedback and recover from errors.
- [Autonomous Task Execution](https://awesome-repositories.com/f/artificial-intelligence-ml/autonomous-task-execution.md) — Executes autonomous, multi-step intelligence tasks programmatically and tracks progress via real-time updates. ([source](https://docs.bytebot.ai/api-reference/introduction.md))
- [LLM Gateways](https://awesome-repositories.com/f/artificial-intelligence-ml/llm-gateways.md) — Routes requests across multiple LLM providers with integrated load balancing, rate limiting, and failover. ([source](https://docs.bytebot.ai/deployment/litellm.md))
- [Model Context Protocol Servers](https://awesome-repositories.com/f/artificial-intelligence-ml/model-context-protocol-servers.md) — Implements a Model Context Protocol server that exposes virtual desktop control tools to external AI clients.
- [Multi-Agent Task Orchestrators](https://awesome-repositories.com/f/artificial-intelligence-ml/multi-agent-task-orchestrators.md) — Coordinates complex, multi-step workflows involving virtual desktop operation and web portal navigation via natural language. ([source](https://cdn.jsdelivr.net/gh/bytebot-ai/bytebot@main/README.md))
- [Dynamic Plan Refinement](https://awesome-repositories.com/f/artificial-intelligence-ml/task-planning-systems/dynamic-plan-refinement.md) — Adjusts task plans and action sequences in real-time based on visual feedback from the virtual desktop. ([source](https://docs.bytebot.ai/core-concepts/agent-system.md))
- [Visual Interface Parsers](https://awesome-repositories.com/f/artificial-intelligence-ml/visual-interface-parsers.md) — Interacts with user interface elements using visual intelligence to decompose graphical interfaces for machine reasoning. ([source](https://docs.bytebot.ai/core-concepts/rpa-comparison.md))
- [Visual State Capture](https://awesome-repositories.com/f/artificial-intelligence-ml/visual-state-capture.md) — Captures screenshots of the current environment and retrieves the precise cursor position for AI analysis. ([source](https://docs.bytebot.ai/api-reference/computer-use/unified-endpoint.md))
- [Agent Deployment](https://awesome-repositories.com/f/artificial-intelligence-ml/agent-deployment.md) — Launches full virtual computer environments and agent stacks via one-click templates. ([source](https://docs.bytebot.ai/deployment/railway.md))
- [Autonomous Flow Overrides](https://awesome-repositories.com/f/artificial-intelligence-ml/agentic-systems-frameworks/development-runtime-environments/manual-agent-loaders/autonomous-flow-overrides.md) — Allows users to override autonomous control and manually guide the agent through specific actions via takeover mode. ([source](https://docs.bytebot.ai/quickstart))
- [Agent Streaming Interfaces](https://awesome-repositories.com/f/artificial-intelligence-ml/agentic-systems-frameworks/integration-deployment/agent-frameworks/agent-runtimes/agent-streaming-interfaces.md) — Provides a real-time stream of agent actions and tool results through a web-based interface. ([source](https://docs.bytebot.ai/api-reference/agent/ui.md))
- [Human-in-the-Loop Steering](https://awesome-repositories.com/f/artificial-intelligence-ml/agentic-systems-frameworks/integration-deployment/agent-frameworks/agent-runtimes/steering-and-streaming-controls/human-in-the-loop-steering.md) — Toggles between fully autonomous task completion and manual human steering of the virtual desktop. ([source](https://docs.bytebot.ai/introduction.md))
- [Model Context Protocol](https://awesome-repositories.com/f/artificial-intelligence-ml/agentic-systems-frameworks/model-integration-serving/model-integration-interfaces/model-context-protocol.md) — Exposes desktop control tools via a standardized Model Context Protocol endpoint for external AI clients.
- [Model Context Protocol Clients](https://awesome-repositories.com/f/artificial-intelligence-ml/model-context-protocol-clients.md) — Integrates external clients via the Model Context Protocol to extend the agent's available tools and capabilities. ([source](https://docs.bytebot.ai/quickstart.md))
- [Model Context Protocol Implementations](https://awesome-repositories.com/f/artificial-intelligence-ml/model-context-protocol-implementations.md) — Implements a Model Context Protocol server to share desktop control tools with external clients over SSE. ([source](https://docs.bytebot.ai/api-reference/introduction.md))
- [Model Provider Proxies](https://awesome-repositories.com/f/artificial-intelligence-ml/model-provider-proxies.md) — Centralizes API key management and implements load balancing across multiple LLM providers through a proxy layer.

### Operating Systems & Systems Programming

- [Desktop Automation](https://awesome-repositories.com/f/operating-systems-systems-programming/desktop-environment-frameworks/desktop-environment-components/desktop-automation.md) — Uses natural language to control a virtual computer and its applications to perform multi-step tasks.
- [Virtualized Desktop Environments](https://awesome-repositories.com/f/operating-systems-systems-programming/virtualization-emulation/virtualization-platforms/virtualization-platforms/virtualized-desktop-environments.md) — Provides isolated Linux desktop environments running in Docker containers for secure and reproducible AI agent execution.

### Part of an Awesome List

- [Natural Language Automation](https://awesome-repositories.com/f/awesome-lists/productivity/task-automation/natural-language-automation.md) — Translates descriptive natural language goals into executable computer actions that adapt to interface changes. ([source](https://docs.bytebot.ai/core-concepts/rpa-comparison.md))

### Development Tools & Productivity

- [Keyboard Input Automation](https://awesome-repositories.com/f/development-tools-productivity/input-pointer-automation/keyboard-automation/keyboard-input-automation.md) — Simulates keyboard input by typing strings and pressing specific keys within the virtual environment. ([source](https://docs.bytebot.ai/api-reference/computer-use/examples.md))
- [Mouse Control Automation](https://awesome-repositories.com/f/development-tools-productivity/input-pointer-automation/mouse-automation/mouse-control-automation.md) — Simulates mouse interactions including moving the cursor, clicking, and dragging based on coordinates. ([source](https://docs.bytebot.ai/api-reference/computer-use/examples.md))
- [RESTful Desktop Control](https://awesome-repositories.com/f/development-tools-productivity/input-pointer-automation/mouse-automation/restful-desktop-control.md) — Provides a REST interface to programmatically execute desktop actions such as capturing screenshots and moving the mouse. ([source](https://docs.bytebot.ai/core-concepts/desktop-environment.md))
- [Sequential Task Dependencies](https://awesome-repositories.com/f/development-tools-productivity/task-dependency-management/agent-task-dependency-resolvers/sequential-task-dependencies.md) — Executes sequences of tasks where subsequent actions rely directly on the output of previous steps. ([source](https://docs.bytebot.ai/guides/task-creation.md))

### DevOps & Infrastructure

- [Container Isolation Technologies](https://awesome-repositories.com/f/devops-infrastructure/container-isolation-technologies.md) — Runs each virtual desktop in a dedicated container to ensure secure isolation from the host system. ([source](https://docs.bytebot.ai/))
- [Containerized Environments](https://awesome-repositories.com/f/devops-infrastructure/containerized-environments.md) — Runs isolated Linux environments in Docker containers to provide a secure workspace for AI agents. ([source](https://docs.bytebot.ai/core-concepts/agent-system.md))
- [Legacy Interface Automators](https://awesome-repositories.com/f/devops-infrastructure/execution-environments/legacy-runtime-simulators/legacy-interface-automators.md) — Bridges legacy software lacking APIs by automating user interface interactions via a virtual desktop. ([source](https://docs.bytebot.ai/introduction.md))
- [Virtual Input Simulation](https://awesome-repositories.com/f/devops-infrastructure/virtual-hardware-interfaces/virtual-input-simulation.md) — Manipulates the virtual mouse and keyboard and captures screen images to automate computer interactions. ([source](https://docs.bytebot.ai/api-reference/introduction.md))
- [Task Lifecycle Management](https://awesome-repositories.com/f/devops-infrastructure/automation-orchestration/task-execution-frameworks/task-job-management/task-lifecycle-management.md) — Tracks the full lifecycle of a job from creation and queuing to final completion or failure. ([source](https://docs.bytebot.ai/guides/task-creation.md))
- [Storage and Data Persistence](https://awesome-repositories.com/f/devops-infrastructure/cloud-infrastructure/storage-data-persistence.md) — Configures storage classes and disk sizes to ensure data persistence across environment restarts. ([source](https://docs.bytebot.ai/deployment/helm.md))
- [Application Environment Managers](https://awesome-repositories.com/f/devops-infrastructure/configuration-management/environment-management/application-environment-managers.md) — Manages the runtime configuration, installation state, and session persistence of desktop applications. ([source](https://cdn.jsdelivr.net/gh/bytebot-ai/bytebot@main/README.md))
- [Container Deployment](https://awesome-repositories.com/f/devops-infrastructure/container-deployment.md) — Provides environment installation using Docker Compose and one-click cloud deployment services. ([source](https://docs.bytebot.ai/introduction.md))
- [Custom Desktop Image Extensions](https://awesome-repositories.com/f/devops-infrastructure/custom-desktop-image-extensions.md) — Allows the installation of additional applications and tools via custom Dockerfiles for the virtual environment. ([source](https://docs.bytebot.ai/core-concepts/desktop-environment.md))
- [API-Driven Task Triggers](https://awesome-repositories.com/f/devops-infrastructure/http-api-interfaces/api-driven-task-triggers.md) — Allows external applications to trigger new agent tasks and retrieve execution history via HTTP requests. ([source](https://docs.bytebot.ai/core-concepts/agent-system.md))
- [Asynchronous Task Queuing](https://awesome-repositories.com/f/devops-infrastructure/infrastructure/networking/messaging-infrastructure-integrations/asynchronous-task-queuing.md) — Organizes and prioritizes asynchronous jobs with automatic recovery for failed actions. ([source](https://docs.bytebot.ai/core-concepts/agent-system.md))
- [Kubernetes Orchestration](https://awesome-repositories.com/f/devops-infrastructure/kubernetes-orchestration.md) — Installs the environment on a cluster using Helm charts to manage agent lifecycles. ([source](https://docs.bytebot.ai/deployment/helm.md))
- [AI Agent Instance Lifecycle Management](https://awesome-repositories.com/f/devops-infrastructure/kubernetes-orchestration/ai-agent-instance-lifecycle-management.md) — Provides a deployment framework using Helm charts to manage the lifecycle of isolated AI agent instances on a cluster.

### Security & Cryptography

- [Execution Isolation](https://awesome-repositories.com/f/security-cryptography/execution-isolation.md) — Runs virtual desktops in isolated containers with restricted network access to protect the host system. ([source](https://docs.bytebot.ai/core-concepts/architecture.md))
- [Automated Login Frameworks](https://awesome-repositories.com/f/security-cryptography/authentication-services/automated-login-frameworks.md) — Automates multi-step login flows, including passwords and 2FA, using browser extensions within the virtual environment. ([source](https://docs.bytebot.ai/guides/password-management.md))

### Business & Productivity Software

- [Interaction History Inspectors](https://awesome-repositories.com/f/business-productivity-software/interaction-history-tracking/interaction-history-inspectors.md) — Displays a formatted history of AI interactions, including captured screenshots and technical tool outputs. ([source](https://docs.bytebot.ai/api-reference/agent/ui.md))
- [Agent Task Assignment](https://awesome-repositories.com/f/business-productivity-software/task-workflow-automation/productivity-task-management/task-management/tasks-frameworks/agent-task-assignment.md) — Allows the creation of new tasks with specific descriptions, priorities, and associated files for the agent to process. ([source](https://docs.bytebot.ai/api-reference/agent/tasks.md))

### Data & Databases

- [Persistent Conversation Stores](https://awesome-repositories.com/f/data-databases/persistent-conversation-stores.md) — Stores task metadata and AI dialogue in a database to maintain state across sessions. ([source](https://docs.bytebot.ai/core-concepts/architecture.md))
- [Virtual Environment File I/O](https://awesome-repositories.com/f/data-databases/virtual-file-systems/virtual-environment-file-i-o.md) — Reads and writes files directly within the virtual desktop environment to handle data persistence. ([source](https://docs.bytebot.ai/api-reference/computer-use/examples.md))

### Networking & Communication

- [Real-time Communication](https://awesome-repositories.com/f/networking-communication/communication-platforms-services/real-time-communication.md) — Enables instantaneous communication between users and agents via live chat and visual monitoring tools. ([source](https://docs.bytebot.ai/core-concepts/agent-system.md))

### Software Engineering & Architecture

- [Asynchronous Task Orchestrators](https://awesome-repositories.com/f/software-engineering-architecture/asynchronous-task-orchestrators.md) — Manages the lifecycle of asynchronous jobs through a programmatic REST API for queuing and tracking.

### System Administration & Monitoring

- [Remote Desktop Environments](https://awesome-repositories.com/f/system-administration-monitoring/remote-desktop-environments.md) — Enables interaction with the virtual environment through a browser-based interface without local clients. ([source](https://docs.bytebot.ai/core-concepts/desktop-environment.md))
- [Task Monitoring](https://awesome-repositories.com/f/system-administration-monitoring/task-monitoring.md) — Retrieves the current state and execution progress of specific tasks using their identifiers. ([source](https://docs.bytebot.ai/api-reference/agent/tasks.md))

### User Interface & Experience

- [Virtual Desktop Monitoring](https://awesome-repositories.com/f/user-interface-experience/live-stream-dashboards/virtual-desktop-monitoring.md) — Provides a live stream of the virtual desktop with the ability to take manual control via a viewer. ([source](https://docs.bytebot.ai/core-concepts/architecture.md))

### Web Development

- [UI State Streaming](https://awesome-repositories.com/f/web-development/real-time-data-streaming/ui-state-streaming.md) — Streams real-time desktop views and agent status updates to a web interface via WebSockets.
- [Real-Time UI Synchronization](https://awesome-repositories.com/f/web-development/real-time-ui-synchronization.md) — Uses WebSockets to stream real-time UI updates and agent status to custom user interfaces. ([source](https://docs.bytebot.ai/api-reference/introduction))
