# openinterpreter/01

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/openinterpreter-01).**

5,129 stars · 530 forks · Python · AGPL-3.0

## Links

- GitHub: https://github.com/openinterpreter/01
- Homepage: https://01.openinterpreter.com/
- awesome-repositories: https://awesome-repositories.com/repository/openinterpreter-01.md

## Description

01 is a voice-to-code agent and language model voice interface framework that enables natural language control of computers and devices. It functions as a real-time audio streaming server and a cross-platform voice client, translating spoken instructions into executable code to automate software, manage files, and browse the web.

The system supports both local and cloud-based language models, alongside local or hosted speech-to-text and text-to-speech engines. It is designed for custom hardware integration, providing the means to build embedded AI voice controllers using microcontrollers like the ESP32, including 3D-printable files for case fabrication and hardware assembly guidance.

The project covers a broad range of capabilities, including audio processing via WebSockets, agent behavior configuration through profile management, and remote access via server tunneling. It also includes security features such as execution environment isolation and system change auditing to manage the risks of autonomous code execution.

The system can be deployed across various platforms, from low-power microcontrollers to full desktop operating systems, using a unified server and client execution model.

## Tags

### Artificial Intelligence & ML

- [Voice Assistant Hardware](https://awesome-repositories.com/f/artificial-intelligence-ml/voice-assistants/voice-assistant-hardware.md) — Provides instructions and materials for building physical voice interface devices using ESP32 chips and microcontrollers.
- [Voice Controlled Computing](https://awesome-repositories.com/f/artificial-intelligence-ml/voice-controlled-computing.md) — Enables the execution of system-level operations and complex computer tasks via spoken natural language commands. ([source](https://cdn.jsdelivr.net/gh/openinterpreter/01@main/README.md))
- [Language Model Integrations](https://awesome-repositories.com/f/artificial-intelligence-ml/artificial-intelligence-tooling/language-model-integrations.md) — Provides adapters and streaming interfaces to connect the system to various hosted or local language model providers. ([source](https://01.openinterpreter.com/server/configure.md))
- [Natural Language Command Translation](https://awesome-repositories.com/f/artificial-intelligence-ml/llm-translation-integrations/natural-language-command-translation.md) — Translates natural language voice input into executable system commands for desktop and file management automation.
- [Local Model Integrations](https://awesome-repositories.com/f/artificial-intelligence-ml/local-model-integrations.md) — Integrates on-device inference providers to enable intelligence layers without reliance on external cloud APIs. ([source](https://01.openinterpreter.com/server/configure))
- [Cross-Platform Deployments](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-deployment-and-serving/local-and-on-device-inference/edge-ai-model-deployment/cross-platform-deployments.md) — Enables deployment across a diverse hardware spectrum, from ESP32 microcontrollers to full desktop operating systems. ([source](https://cdn.jsdelivr.net/gh/openinterpreter/01@main/README.md))
- [Speech-to-Text and Text-to-Speech Integrations](https://awesome-repositories.com/f/artificial-intelligence-ml/speech-to-text-and-text-to-speech-integrations.md) — Integrates both speech-to-text and text-to-speech engines to process voice data without external cloud APIs. ([source](https://01.openinterpreter.com/server/light))
- [Assistant Personalization](https://awesome-repositories.com/f/artificial-intelligence-ml/voice-assistants/assistant-personalization.md) — Supports the definition of custom system messages to establish the identity and behavioral constraints of the voice assistant. ([source](https://01.openinterpreter.com/server/configure.md))
- [Voice-to-Code Translation](https://awesome-repositories.com/f/artificial-intelligence-ml/voice-to-code-translation.md) — Converts spoken natural language requests into executable code to automate file management and software operations.
- [Multimodal Context Providers](https://awesome-repositories.com/f/artificial-intelligence-ml/agent-context-providers/multimodal-context-providers.md) — Monitors ambient audio and surroundings to provide multimodal situational awareness to the agent when not actively prompted. ([source](https://01.openinterpreter.com/client/android-ios.md))
- [Voice-Activated Triggers](https://awesome-repositories.com/f/artificial-intelligence-ml/agentic-systems-frameworks/conversational-voice-interaction/voice-agents/voice-activity-detection/voice-activated-triggers.md) — Captures audio input via manual buttons or voice activity detection to trigger server requests.
- [Agent Behavioral Configuration](https://awesome-repositories.com/f/artificial-intelligence-ml/custom-ai-models/agent-behavioral-configuration.md) — Allows customizing AI agent responses and behaviors through specialized instructions and system messages. ([source](https://cdn.jsdelivr.net/gh/openinterpreter/01@main/README.md))

### Part of an Awesome List

- [Real-Time Audio WebSockets](https://awesome-repositories.com/f/awesome-lists/devtools/websockets-and-real-time/real-time-audio-websockets.md) — Transmits real-time audio byte streams between clients and servers using WebSockets for low-latency communication.

### Development Tools & Productivity

- [Natural Language Automation](https://awesome-repositories.com/f/development-tools-productivity/cross-platform-automation-frameworks/natural-language-automation.md) — Translates natural language instructions into cross-platform operating system tasks, web browsing, and software operations. ([source](https://01.openinterpreter.com/))
- [Agent Model Profiles](https://awesome-repositories.com/f/development-tools-productivity/cli-profiling-tools/profile-management/agent-model-profiles.md) — Defines language models and context windows through profiles to customize agent intelligence and voice output. ([source](https://01.openinterpreter.com/server/configure))
- [Isolated Execution Environments](https://awesome-repositories.com/f/development-tools-productivity/isolated-execution-environments.md) — Runs autonomous code in virtual machines or restricted accounts to prevent unintended changes to the host system. ([source](https://01.openinterpreter.com/safety/measures))
- [Cross-Platform Compatibility](https://awesome-repositories.com/f/development-tools-productivity/platforms-runtimes-language-services/cross-platform-compatibility.md) — Supports a flexible server design that operates across diverse environments from ESP32 microcontrollers to desktop OSs.

### Graphics & Multimedia

- [Hardware Audio Streaming](https://awesome-repositories.com/f/graphics-multimedia/audio-music/audio-streaming-engines/audio-playback-engines/chunked-audio-streaming/hardware-audio-streaming.md) — Streams audio byte streams directly to physical hardware devices for real-time voice interaction and playback. ([source](https://01.openinterpreter.com/server/light.md))
- [Audio Input Capture](https://awesome-repositories.com/f/graphics-multimedia/audio-input-capture.md) — Captures raw audio data from microphones using push-to-talk or voice activity detection for system interaction. ([source](https://01.openinterpreter.com/client/android-ios.md))

### Hardware & IoT

- [IoT Device Server Implementation](https://awesome-repositories.com/f/hardware-iot/iot-device-server-implementation.md) — Implements a server backend optimized for managing communication and logic for IoT hardware. ([source](https://01.openinterpreter.com/server/introduction.md))
- [Hardware Assembly Guides](https://awesome-repositories.com/f/hardware-iot/hardware-assembly-guides.md) — Enables the assembly of portable or desktop intelligent devices using microcontrollers and custom physical configurations. ([source](https://01.openinterpreter.com/setup/introduction.md))
- [Physical Construction Instructions](https://awesome-repositories.com/f/hardware-iot/physical-construction-instructions.md) — Provides comprehensive visual and written instructions for physically constructing the voice interface hardware. ([source](https://01.openinterpreter.com/hardware/01-light/assembly.md))

### Networking & Communication

- [Voice Clients](https://awesome-repositories.com/f/networking-communication/cross-platform-chat-clients/voice-clients.md) — Ships a flexible voice client for capturing audio and playing responses across desktop, mobile, and ESP32 platforms.
- [Real-Time Voice Backend Hosting](https://awesome-repositories.com/f/networking-communication/real-time-voice-backend-hosting.md) — Runs a real-time server facilitating bidirectional audio streams via WebSockets for AI agents. ([source](https://01.openinterpreter.com/server/livekit.md))
- [Audio Streaming Servers](https://awesome-repositories.com/f/networking-communication/socket-networking/audio-streaming-servers.md) — Implements a dedicated server for low-latency audio streaming between hardware clients and an intelligence backend.
- [Voice and Vision Processing](https://awesome-repositories.com/f/networking-communication/voice-and-vision-processing.md) — Processes natural language audio input to trigger system actions, file management, and software control. ([source](https://01.openinterpreter.com/setup/introduction.md))
- [Natural Language Interfaces](https://awesome-repositories.com/f/networking-communication/voice-and-vision-processing/embedded-voice-processing/natural-language-interfaces.md) — Provides a hardware-integrated system for deploying natural language interfaces on low-power chips and custom electronics.
- [WebSocket Connection Management](https://awesome-repositories.com/f/networking-communication/websocket-connection-management.md) — Configures WebSocket addresses to establish live, low-latency communication between a mobile device and the computer. ([source](https://01.openinterpreter.com/client/native-ios.md))
- [WebSocket Clients and Servers](https://awesome-repositories.com/f/networking-communication/distributed-systems-p2p/distributed-computing/communication-protocols/websocket-implementations/websocket-clients-and-servers.md) — Implements both client and server components for bidirectional communication via WebSockets to link interfaces and backends. ([source](https://01.openinterpreter.com/server/flags.md))
- [Local Server Tunnels](https://awesome-repositories.com/f/networking-communication/local-server-tunnels.md) — Tunnels a local server to a public URL using a proxy to facilitate remote access from external networks. ([source](https://01.openinterpreter.com/server/flags.md))
- [Remote Access & Control](https://awesome-repositories.com/f/networking-communication/remote-access-control.md) — Provides interfaces and protocols for remotely interacting with a home server's files and applications via a mobile device. ([source](https://01.openinterpreter.com/client/android-ios))
- [Remote Device Connectivity](https://awesome-repositories.com/f/networking-communication/remote-device-connectivity.md) — Sets network and server credentials via firmware or captive portals to establish connectivity between hardware and servers. ([source](https://01.openinterpreter.com/client/esp32.md))
- [Voice Server Client Libraries](https://awesome-repositories.com/f/networking-communication/voice-platform-integrations/voice-server-client-libraries.md) — Provides dedicated client libraries to connect various programming languages and platforms to the voice server. ([source](https://01.openinterpreter.com/server/livekit.md))

### User Interface & Experience

- [Microcontroller Audio Interfaces](https://awesome-repositories.com/f/user-interface-experience/voice-input-integration/microcontroller-audio-interfaces.md) — Integrates microcontrollers with audio and wireless capabilities to provide a physical interface for voice interactions. ([source](https://01.openinterpreter.com/hardware/01-light/chip.md))
- [Voice Interaction Interfaces](https://awesome-repositories.com/f/user-interface-experience/voice-interaction-interfaces.md) — Provides the user interface layer for speech-based input and synthesized audio output to control a computer. ([source](https://01.openinterpreter.com/))
- [Voice Interfaces](https://awesome-repositories.com/f/user-interface-experience/voice-interfaces.md) — Installs the necessary runtime dependencies and environments to enable natural language computer control across various operating systems. ([source](https://01.openinterpreter.com/setup/installation))

### Web Development

- [Headless Server Hosting](https://awesome-repositories.com/f/web-development/web-infrastructure-deployment/web-infrastructure-servers/web-server-hosting/headless-server-hosting.md) — Hosts non-graphical backend logic to provide API access and a communication link between computers and hardware. ([source](https://01.openinterpreter.com/hardware/01-light/connect.md))

### DevOps & Infrastructure

- [Assistant Behavioral Profiles](https://awesome-repositories.com/f/devops-infrastructure/configuration-management/file-based-configuration/configuration-profiles/assistant-behavioral-profiles.md) — Allows users to create and switch between named configuration profiles to adjust model settings and agent behaviors. ([source](https://01.openinterpreter.com/server/configure.md))

### Operating Systems & Systems Programming

- [Client-Server Hardware Architectures](https://awesome-repositories.com/f/operating-systems-systems-programming/client-server-hardware-architectures.md) — Separates the high-level audio capture and playback interface from the heavy computational logic on the server.

### Security & Cryptography

- [Change Auditing](https://awesome-repositories.com/f/security-cryptography/application-and-system-security/sandbox-and-isolation/isolated-execution-sandboxes/wasm-sandboxes/sandbox-verification/change-auditing.md) — Compares system files and settings before and after sessions to identify and revert unexpected modifications made by the agent. ([source](https://01.openinterpreter.com/safety/measures))
- [Network Access Restrictions](https://awesome-repositories.com/f/security-cryptography/network-access-restrictions.md) — Uses firewalls and VPNs to implement policies that limit the agent's ability to interact with the broader network. ([source](https://01.openinterpreter.com/safety/measures))

### Software Engineering & Architecture

- [Modular Provider Interfaces](https://awesome-repositories.com/f/software-engineering-architecture/modular-provider-interfaces.md) — Implements architectural patterns that allow swapping between local on-device inference and cloud-based APIs for processing.

### System Administration & Monitoring

- [Remote Agent Hardware Linking](https://awesome-repositories.com/f/system-administration-monitoring/remote-agent-hardware-linking.md) — Establishes architectural links between mobile devices and home machines to allow agent-driven control of files and IoT devices. ([source](https://01.openinterpreter.com/client/android-ios.md))
