# idootop/mi-gpt

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/idootop-mi-gpt).**

12,458 stars · 1,736 forks · TypeScript · MIT · archived

## Links

- GitHub: https://github.com/idootop/mi-gpt
- awesome-repositories: https://awesome-repositories.com/repository/idootop-mi-gpt.md

## Description

mi-gpt is a voice assistant bridge and agent orchestrator that connects smart speakers to large language models. It functions as an integration layer that routes audio requests from hardware speakers to AI providers and converts generated text back into speech via a customizable synthesis system.

The project features a retrieval-augmented generation knowledge base that uses embeddings and external documents to provide context-aware responses. It includes a persona definition system for configuring behavioral rules, system prompts, and roleplay characteristics, alongside a plugin architecture for extending specialized behaviors and tool execution.

The system covers agent-based home automation for executing tasks via voice commands, continuous dialogue management to maintain session context, and streaming text-to-speech to reduce audio latency. It also includes utilities for conversation history backups and network routing through SOCKS proxies.

The application is distributed as container images compatible with multiple CPU architectures.

## Tags

### Artificial Intelligence & ML

- [Text-to-Speech](https://awesome-repositories.com/f/artificial-intelligence-ml/text-to-speech.md) — Synthesizes generated text into spoken audio via a voice provider for playback on smart speakers. ([source](https://github.com/idootop/mi-gpt/blob/main/docs/how-it-works.md))
- [Hybrid Short-and-Long Term Memory](https://awesome-repositories.com/f/artificial-intelligence-ml/agent-architectures/memory-management-systems/long-term-memory-stores/hybrid-short-and-long-term-memory.md) — Implements a memory architecture that integrates immediate session context with persistent long-term storage for coherent dialogues.
- [Vector Knowledge Bases](https://awesome-repositories.com/f/artificial-intelligence-ml/agent-memory-stores/weaviate-knowledge-stores/vector-knowledge-bases.md) — Utilizes vector embeddings and external documents to inject factual data into the prompt context.
- [Knowledge Base Retrieval](https://awesome-repositories.com/f/artificial-intelligence-ml/agentic-rag-development/knowledge-base-retrieval.md) — Uses embeddings and external documents to provide responses based on wikis and historical data. ([source](https://github.com/idootop/mi-gpt/blob/main/docs/roadmap.md))
- [Voice Interaction Management](https://awesome-repositories.com/f/artificial-intelligence-ml/agentic-systems-frameworks/conversational-voice-interaction/voice-agents/voice-activity-detection/wake-word-detection/voice-interaction-management.md) — Coordinates the loop of polling device conversations and forwarding them to a language model for response generation. ([source](https://github.com/idootop/mi-gpt/blob/main/docs/how-it-works.md))
- [Conversation Memory Stores](https://awesome-repositories.com/f/artificial-intelligence-ml/conversation-memory-stores.md) — Stores previous interactions to provide context-aware responses and recall details from earlier conversations. ([source](https://cdn.jsdelivr.net/gh/idootop/mi-gpt@main/README.md))
- [Persona Configurations](https://awesome-repositories.com/f/artificial-intelligence-ml/custom-ai-assistant-development/persona-configurations.md) — Provides a system for defining AI personalities, behavioral rules, and tone through specific persona configurations. ([source](https://cdn.jsdelivr.net/gh/idootop/mi-gpt@main/README.md))
- [Dialogue Context Management](https://awesome-repositories.com/f/artificial-intelligence-ml/dialogue-context-management.md) — Tracks interaction history and session modes to maintain context across multiple conversational turns.
- [LLM Provider Integrations](https://awesome-repositories.com/f/artificial-intelligence-ml/llm-provider-integrations.md) — Implements connectivity and authentication adapters to link the system to external large language model providers. ([source](https://github.com/idootop/mi-gpt/blob/main/docs/settings.md))
- [LLM Voice Bridges](https://awesome-repositories.com/f/artificial-intelligence-ml/llm-voice-bridges.md) — Bridges smart speakers to LLM APIs to enable advanced conversational intelligence and voice control.
- [Persona Definitions](https://awesome-repositories.com/f/artificial-intelligence-ml/model-configuration/role-based-model-assignment/chat-role-assignment/persona-definitions.md) — Defines identity, behavior rules, and prompt templates to personalize interaction dynamics. ([source](https://github.com/idootop/mi-gpt/blob/main/docs/settings.md))
- [System Prompt Configurations](https://awesome-repositories.com/f/artificial-intelligence-ml/prompt-engineering/system-configuration-layers/system-prompt-configurations.md) — Allows the definition of behavioral instructions and roles via system prompt configurations to guide AI responses. ([source](https://github.com/idootop/mi-gpt/blob/main/docs/changelog.md))
- [RAG Context Retrieval](https://awesome-repositories.com/f/artificial-intelligence-ml/rag-context-retrieval.md) — Implements a retrieval-augmented generation system using embeddings and documents to provide context-aware responses.
- [Text-to-Speech Integrations](https://awesome-repositories.com/f/artificial-intelligence-ml/speech-to-text-integrations/text-to-speech-integrations.md) — Sends generated text incrementally to speaker hardware to reduce audio latency.
- [Assistant Personalization](https://awesome-repositories.com/f/artificial-intelligence-ml/voice-assistants/assistant-personalization.md) — Enables the creation of personalized voice assistants with custom personas, system prompts, and behavioral rules.
- [Conversational Flow Controllers](https://awesome-repositories.com/f/artificial-intelligence-ml/agentic-systems-frameworks/conversational-voice-interaction/conversational-ai-agents/conversational-turn-detection/conversational-flow-controllers.md) — Manages how the assistant is triggered via keywords to handle single requests or continuous modes. ([source](https://github.com/idootop/mi-gpt/blob/main/docs/faq.md))
- [Voice-Activated Triggers](https://awesome-repositories.com/f/artificial-intelligence-ml/agentic-systems-frameworks/conversational-voice-interaction/voice-agents/voice-activity-detection/voice-activated-triggers.md) — Allows switching the active speech voice profile using specific keywords detected in the audio stream. ([source](https://github.com/idootop/mi-gpt/blob/main/docs/tts.md))
- [Persona Assignments](https://awesome-repositories.com/f/artificial-intelligence-ml/agentic-systems-frameworks/integration-deployment/agent-frameworks/configuration-and-specifications/agent-persona-definitions/persona-assignments.md) — Updates the agent's identity and behavioral characteristics using natural language commands during active sessions. ([source](https://github.com/idootop/mi-gpt/blob/main/docs/faq.md))
- [Conversation State Managers](https://awesome-repositories.com/f/artificial-intelligence-ml/conversation-state-managers.md) — Allows controlling dialogue modes and clearing conversation context through voice commands. ([source](https://github.com/idootop/mi-gpt/blob/main/docs/roadmap.md))
- [Model Response Streaming](https://awesome-repositories.com/f/artificial-intelligence-ml/model-response-parsers/model-response-aggregation/model-response-streaming.md) — Sends text to the speaker incrementally so audio playback begins immediately during response generation. ([source](https://cdn.jsdelivr.net/gh/idootop/mi-gpt@main/README.md))
- [Conversational Dialogue Systems](https://awesome-repositories.com/f/artificial-intelligence-ml/natural-language-interfaces/conversational-dialogue-systems.md) — Manages speaker states and response streaming to keep the microphone active between conversation turns. ([source](https://github.com/idootop/mi-gpt/blob/main/docs/compatibility.md))
- [Speech Synthesis Customizations](https://awesome-repositories.com/f/artificial-intelligence-ml/speech-synthesis-customizations.md) — Allows replacing default system voices with high-quality models to customize the assistant's auditory profile. ([source](https://cdn.jsdelivr.net/gh/idootop/mi-gpt@main/README.md))
- [Interaction Triggers](https://awesome-repositories.com/f/artificial-intelligence-ml/voice-assistants/voice-personalization/character-appearance-and-voice-customizers/interaction-triggers.md) — Enables the customization of trigger keywords and audio prompts to manage assistant activation and exit sequences. ([source](https://github.com/idootop/mi-gpt/blob/main/docs/settings.md))

### Hardware & IoT

- [Smart Speaker AI Integrations](https://awesome-repositories.com/f/hardware-iot/smart-speaker-ai-integrations.md) — Connects smart speakers to large language models to replace default voice assistants with custom AI intelligence.
- [Voice Request Routers](https://awesome-repositories.com/f/hardware-iot/xiaomi-speaker-integrations/voice-request-routers.md) — Routes voice requests from smart speakers to large language models to enable advanced conversational intelligence. ([source](https://github.com/idootop/mi-gpt/blob/main/docs/changelog.md))
- [Smart Speaker Integration Layers](https://awesome-repositories.com/f/hardware-iot/smart-speaker-integration-layers.md) — Routes audio requests from hardware speakers to large language models and text-to-speech engines.
- [Home Automation](https://awesome-repositories.com/f/hardware-iot/connectivity-iot/home-automation.md) — Integrates large language models with smart home devices to execute complex automation tasks via voice commands.
- [Embedded Wake-Word Detections](https://awesome-repositories.com/f/hardware-iot/on-device-applications/embedded-wake-word-detections.md) — Supports the configuration of custom wake words and audio alerts to trigger assistant interactions on hardware. ([source](https://github.com/idootop/mi-gpt/blob/main/docs/changelog.md))

### Part of an Awesome List

- [AI Agent Orchestration](https://awesome-repositories.com/f/awesome-lists/devtools/ai-agent-orchestration.md) — Provides a mechanism to coordinate AI agents and tools for executing complex home automation tasks. ([source](https://github.com/idootop/mi-gpt/blob/main/docs/roadmap.md))
- [ChatGPT 项目](https://awesome-repositories.com/f/awesome-lists/more/chatgpt.md) — Listed in the “ChatGPT 项目” section of the Great Open Source Project awesome list.

### Development Tools & Productivity

- [AI Agent Orchestrators](https://awesome-repositories.com/f/development-tools-productivity/backend-orchestrators/ai-agent-orchestrators.md) — Coordinates AI agents and smart home devices to execute tasks based on natural language intent.
- [Plugin Systems](https://awesome-repositories.com/f/development-tools-productivity/plugin-systems.md) — Provides a plugin system to extend the assistant's capabilities and specialized behaviors through modular extensions. ([source](https://github.com/idootop/mi-gpt/blob/main/docs/roadmap.md))

### Software Engineering & Architecture

- [Intent-to-Skill Mappings](https://awesome-repositories.com/f/software-engineering-architecture/intent-based-coordination/intent-to-skill-mappings.md) — Maps natural language user intent to specific smart home devices or internet tools for task execution.
- [AI Provider Gateways](https://awesome-repositories.com/f/software-engineering-architecture/api-gateways/ai-provider-gateways.md) — Routes conversational requests and speech synthesis tasks to third-party providers via configurable endpoints.
- [Modular Plugin Extensions](https://awesome-repositories.com/f/software-engineering-architecture/modular-plugin-extensions.md) — Integrates specialized behaviors and tool execution capabilities through a modular plugin system.

### Data & Databases

- [Web Data Retrievers](https://awesome-repositories.com/f/data-databases/real-time-data-integration-platforms/web-data-retrievers.md) — Retrieves real-time information from the internet to provide current data within AI-generated responses. ([source](https://github.com/idootop/mi-gpt/blob/main/docs/roadmap.md))

### Graphics & Multimedia

- [Voice Customization](https://awesome-repositories.com/f/graphics-multimedia/media-processing-analysis/audio-processing-systems/audio-processing/text-to-speech-engines/text-to-speech-engines/voice-customization.md) — Routes text-to-speech processing through external providers to customize the auditory profile of the voice assistant.

### Web Development

- [Speech Delivery Optimization](https://awesome-repositories.com/f/web-development/backend-development/request-response-handling/http-response-handling/response-streaming/speech-delivery-optimization.md) — Controls whether output is sent incrementally or as a complete sentence to improve speaker delivery speed. ([source](https://github.com/idootop/mi-gpt/blob/main/docs/changelog.md))
