# 78/xiaozhi-esp32

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/78-xiaozhi-esp32).**

24,092 stars · 5,143 forks · C++ · mit

## Links

- GitHub: https://github.com/78/xiaozhi-esp32
- Homepage: https://xiaozhi.me
- awesome-repositories: https://awesome-repositories.com/repository/78-xiaozhi-esp32.md

## Topics

`chatbot` `esp32` `mcp`

## Description

Xiaozhi-esp32 is an open-source firmware platform designed for building voice-interactive embedded systems on resource-constrained microcontrollers. It functions as an IoT conversational device platform that manages live audio input, speech synthesis, and conversational state transitions to facilitate real-time natural language interaction.

The system distinguishes itself by bridging language models with physical hardware through standardized protocols, allowing for the execution of commands on local peripherals or remote smart home services. It utilizes a specialized architecture to coordinate audio buffering, task scheduling, and network connectivity, ensuring that voice interactions remain coherent and responsive on low-power hardware.

Beyond core voice processing, the platform supports the configuration of custom assets such as wake words and visual themes, while providing integrated monitoring for battery levels and display feedback. The firmware is structured to handle complex interactions by mapping processed voice commands to specific hardware signals and external service requests.

## Tags

### Artificial Intelligence & ML

- [Voice Assistants](https://awesome-repositories.com/f/artificial-intelligence-ml/voice-assistants.md) — Open-source firmware platform for building voice-interactive embedded systems.
- [Voice Interaction Systems](https://awesome-repositories.com/f/artificial-intelligence-ml/voice-interaction-systems.md) — Builds conversational hardware devices that process live audio for natural language interaction.
- [Conversational Audio Streams](https://awesome-repositories.com/f/artificial-intelligence-ml/text-to-speech/conversational-audio-streams.md) — Facilitates real-time voice interaction through integrated speech recognition and synthesis. ([source](https://cdn.jsdelivr.net/gh/78/xiaozhi-esp32@main/README.md))
- [Personalized Assistants](https://awesome-repositories.com/f/artificial-intelligence-ml/voice-assistants/personalized-assistants.md) — Supports user identification and tailored interaction styles for voice-enabled hardware.

### Hardware & IoT

- [IoT Platforms](https://awesome-repositories.com/f/hardware-iot/connectivity-iot/internet-of-things/iot-platforms.md) — Framework for connecting embedded hardware to cloud-based voice services and smart home ecosystems.
- [Firmware Management](https://awesome-repositories.com/f/hardware-iot/firmware-management.md) — Provides a platform for developing custom firmware for voice-interactive embedded systems.
- [Home Automation](https://awesome-repositories.com/f/hardware-iot/connectivity-iot/home-automation.md) — Connects voice-controlled hardware to home automation platforms for physical device control.
- [Remote Hardware Controllers](https://awesome-repositories.com/f/hardware-iot/remote-hardware-controllers.md) — Executes commands on local peripherals using standardized protocols for automated task completion. ([source](https://cdn.jsdelivr.net/gh/78/xiaozhi-esp32@main/README.md))

### Graphics & Multimedia

- [Audio Processing Systems](https://awesome-repositories.com/f/graphics-multimedia/media-processing-analysis/audio-processing-systems.md) — Specialized architecture for managing live voice input and speech synthesis on embedded hardware.

### Operating Systems & Systems Programming

- [Interrupt-Driven Buffers](https://awesome-repositories.com/f/operating-systems-systems-programming/kernel-core-internals/process-and-memory-management/memory-management/buffer-and-cache-management/binary-buffer-managers/trace-buffer-managers/audio-buffers/interrupt-driven-buffers.md) — Uses hardware interrupts to manage real-time audio streams on resource-constrained microcontrollers.

### Software Engineering & Architecture

- [State Machine Logic](https://awesome-repositories.com/f/software-engineering-architecture/state-machine-logic.md) — Coordinates conversational flow transitions between listening, processing, and speaking modes.

### System Administration & Monitoring

- [Firmware Power Optimization](https://awesome-repositories.com/f/system-administration-monitoring/power-optimization-utilities/firmware-power-optimization.md) — Optimizes firmware builds to enable complex voice interactions on low-power hardware.

### Web Development

- [API Bridges](https://awesome-repositories.com/f/web-development/api-bridges.md) — Maps natural language commands to remote API endpoints for smart home and service interaction.
