We curate 22 open-source GitHub repositories matching "open source chatgpt alternatives". Results are ranked by relevance to your query — pick filters below to narrow, or refine with AI.
This is a full-featured chatbot framework and Next.js web application designed for integrating various large language model providers into a web interface. It serves as a template for building AI chatbots that can generate text and structured data through a unified interface. The project functions as an authenticated AI application, incorporating built-in user identity verification and session management. It includes a suite for AI tool integration, allowing language models to execute tool calls and generate structured objects by connecting to external data and functions. The framework provi
Vercel AI Chatbot is a self-hostable Next.js web application and chatbot framework that integrates multiple LLM providers, includes authentication, conversation memory, and tool support, making it a comprehensive open-source alternative to ChatGPT that fits your requirements for a local or self-hosted conversational AI assistant with a chat UI and streaming responses.
Open WebUI is a self-hosted, web-based platform designed for interacting with local and remote artificial intelligence models. It functions as a unified interface and orchestration suite, enabling users to build, deploy, and manage specialized AI agents equipped with custom instructions, external tool access, and private knowledge bases. The platform distinguishes itself through a modular architecture that supports complex AI workflows. It features a plugin-based framework for custom logic and pipeline-based request processing, allowing developers to filter or transform data streams before th
Open WebUI is a self-hosted web interface that lets you chat with local or remote LLMs, supports model switching, custom prompts, streaming, and conversation memory, making it a comprehensive ChatGPT alternative for your own infrastructure.
NextChat is a self-hosted web application that provides a unified interface for interacting with multiple large language models. It functions as a conversational platform where users can manage and switch between diverse AI providers through configurable API backends, maintaining full control over their data and infrastructure. The platform features a persistent session layer designed to handle long-running dialogues by managing message history and context. It distinguishes itself through a structured prompt engineering environment that allows for the development and application of templates
NextChat is a self-hosted web UI that unifies multiple LLM backends (including Ollama for local models) with streaming, custom prompts, and conversation memory, making it a direct open-source alternative to ChatGPT.
SillyTavern is a comprehensive interface and orchestration platform designed for immersive AI roleplay and interactive chat experiences. It functions as a unified gateway that connects users to a wide array of local and cloud-based large language models, providing a centralized environment to manage complex character personas, narrative context, and model-driven interactions. The platform distinguishes itself through its advanced prompt engineering and automation capabilities. It utilizes a sophisticated macro-based templating engine and vector-database retrieval to dynamically inject lore, c
SillyTavern is a self-hostable LLM chat interface that connects to multiple local and cloud backends with streaming, custom prompts, and conversation memory via vector-database retrieval—directly matching your need for a locally-run ChatGPT alternative.
Danswer is an LLM application framework and RAG engine that provides a self-hosted interface for connecting large language models to private data. It serves as an enterprise AI chat interface and agent orchestrator, enabling the creation of specialized assistants with custom instructions and knowledge bases. The platform differentiates itself through an observability dashboard for tracking query history and token consumption, as well as a white-labeled interface for customized branding. It includes a multi-step research workflow for producing long-form reports and a sandboxed environment for
Danswer offers a self-hostable, LLM-powered chat interface with custom instructions, conversation memory, and support for multiple LLM backends, making it a comprehensive conversational AI assistant alternative to ChatGPT that meets all your listed features.
Chatbox is a cross-platform desktop application that provides a unified interface for interacting with a wide range of artificial intelligence models. It functions as a model-agnostic client, allowing users to connect to various third-party AI providers or execute open-source models directly on their own hardware. By centralizing these diverse services into a single workspace, the application enables users to manage multiple chat sessions, adjust model parameters, and switch between different AI backends with ease. The project distinguishes itself through a local-first architecture that prior
Chatbox is a cross-platform desktop client that connects to multiple LLM backends (including local models via Ollama) and offers a unified chat interface with session management, making it a self-hostable alternative to ChatGPT that covers most of your desired features like streaming and multiple model support.
Wenda is an LLM orchestration platform and custom workflow engine designed to manage multiple language model backends through a unified interface. It functions as a self-hosted AI gateway that enables the execution of complex task sequences and automated conversation flows. The system utilizes JavaScript plugins to orchestrate workflows and trigger external API calls. It supports retrieval augmented generation by injecting relevant data from vector stores and offline files into prompts to increase response accuracy. The platform is built for private network deployments, featuring multi-user
Wenda is a self-hosted LLM orchestration platform with a multi-user chat interface, supporting multiple model backends, streaming, custom prompts, and conversation memory — fitting the search for an open-source, locally-run ChatGPT alternative.
GPT4All is a cross-platform runtime environment designed to execute large language models directly on local consumer hardware. By leveraging an optimized C++ inference backend, it enables private, offline AI interactions without requiring an internet connection or external cloud services. The project provides a comprehensive ecosystem for managing the entire model lifecycle, including discovery, downloading, and configuration of local weights. What distinguishes the platform is its integrated retrieval-augmented generation engine, which allows users to index local documents into semantic vect
gpt4all is a self-hostable, offline-capable runtime that lets you run large language models locally with a built-in chat interface, document indexing, and conversation history, making it a comprehensive open-source alternative to ChatGPT that meets your feature needs.
This project is a self-hosted large language model chat interface and AI model aggregator. It provides a unified web environment for interacting with multiple AI providers and local models, acting as a provider-agnostic API gateway to standardize requests across different endpoints. The platform functions as an agentic AI framework and generative UI workspace, enabling the construction of specialized assistants with custom instructions and subagents. It features a sandboxed code interpreter for secure execution of multiple programming languages and a generative UI system that renders interact
This repository is a self-hosted large language model chat interface and AI model aggregator that provides a unified web environment with support for multiple AI providers and local models, streaming, custom instructions, and conversation memory — exactly the comprehensive ChatGPT alternative this search describes.
This project is a comprehensive platform for hosting and interacting with large language models directly on local hardware. It provides a web-based graphical interface that allows users to manage model loading, configure generation parameters, and execute text or chat interactions entirely offline. By running models locally, the software ensures complete data privacy and eliminates reliance on external cloud services for generative tasks. Beyond basic inference, the platform functions as a versatile workbench for generative AI development. It includes an integrated pipeline for fine-tuning mo
Text Generation WebUI is a comprehensive self-hosted platform for running and interacting with large language models via a chat web interface, supporting multiple model backends, streaming, custom prompts, and conversation memory — exactly the open-source ChatGPT alternative you're looking for.
Dify is an open-source platform for building, orchestrating, and deploying generative AI applications and autonomous agents. It provides a visual development environment that allows users to design complex, multi-step logic chains and conversational flows, which can then be published as APIs, web interfaces, or embedded widgets. The platform acts as a centralized infrastructure layer, managing model connections, prompt templates, and knowledge retrieval to support scalable AI-powered services. What distinguishes the platform is its focus on stateful application design and workflow orchestrati
Dify is a complete open-source platform for building and deploying generative AI applications, including conversational assistants, with support for multiple LLM backends, self-hosting, a chat web UI, streaming responses, custom prompts, and conversation memory — directly fitting your need for a self-hosted ChatGPT alternative.
Chatbot-ui is a self-hosted AI dashboard and LLM chat interface that serves as a centralized hub for interacting with multiple artificial intelligence providers. It functions as a multi-provider AI client and model orchestrator, allowing users to send prompts and receive responses from various large language models through a unified conversational interface. The project enables multi-model AI chat within a single workspace, allowing for the comparison of outputs and capabilities across different backends. It provides a private frontend for AI workspace management, where users can organize cha
Chatbot-ui is a self-hosted AI dashboard that provides a unified chat interface for multiple LLM providers, supporting streaming responses, conversation memory, and custom prompts, making it a comprehensive alternative to ChatGPT for local deployment.
This project is a terminal-based command line interface client and agent orchestrator for interacting with multiple large language model providers. It functions as an OpenAI API client and a local API gateway that exposes chat completions and embeddings through an HTTP server. The system distinguishes itself by providing a retrieval-augmented generation tool for indexing local files and URLs into a vector database to provide custom document context. It allows for the creation of specialized AI agents that combine custom system prompts with tool calling and external function execution. The to
aichat is a local command-line client and API gateway that works with multiple LLM providers and supports custom agents, streaming, and document context via RAG, making it a solid self-hosted conversational AI assistant — but it lacks a built-in web chat UI, relying on a terminal interface instead.
Jan is a desktop application that functions as a local artificial intelligence model runtime and an open-standard API server. It enables the execution of large language models directly on local hardware, ensuring that data remains private and accessible offline while providing a unified interface for managing model weights and inference runtimes. The platform distinguishes itself by offering a modular inference backend that allows users to swap execution engines based on hardware compatibility and performance needs. It acts as a cross-platform orchestrator, providing the ability to switch bet
Jan is a desktop application that runs local LLMs privately offline, supports multiple inference backends (like llama.cpp), and provides a chat interface for conversational AI, fitting your need for a self-hosted alternative to ChatGPT—though its primary interface is a desktop app rather than a web UI.
This project is a Bing AI proxy gateway and web-based chat interface. It functions as a Go reverse proxy that routes HTTP traffic to external AI endpoints, allowing users to bypass regional network restrictions and login requirements. The system utilizes SOCKS5 proxy tunneling and environment-variable configuration to circumvent IP-based blocking and regional firewalls. It manages authentication by injecting predefined session cookies into outgoing requests to maintain private AI sessions and unlock personalized features. The application integrates a Vue-based single page application for the
This is a self-hosted web chat interface that proxies to Microsoft's Bing AI, giving you a conversational assistant with streaming responses and conversation memory, but it only supports a single external backend rather than multiple LLM options.
Cherry Studio is a cross-platform desktop application that serves as a centralized workspace for managing and interacting with multiple artificial intelligence models. It functions as a local-first orchestrator, prioritizing user privacy by storing all conversation history and knowledge bases directly on your device. By providing a unified interface for both cloud-based and local AI services, the platform simplifies API key management and allows for consistent model interaction across different operating systems. The application distinguishes itself through a robust retrieval-augmented genera
Cherry Studio is a cross-platform desktop client for managing and chatting with multiple cloud and local LLMs, storing conversations locally for privacy—making it a self-hostable ChatGPT alternative, though its interface is a desktop app rather than a web UI.
DeepReasoning is a self-hosted AI gateway and chat interface that provides an LLM inference API. It functions as a bridge that merges reasoning traces from DeepSeek R1 with the generative capabilities of Claude models to facilitate complex problem solving. The system is delivered as a dockerized application, allowing for deployment on private infrastructure. This architecture enables private LLM inference and secure local management of API keys and authentication tokens on user-controlled hardware. The project covers multi-model orchestration by combining chain-of-thought reasoning and gener
DeepReasoning is a self-hosted AI gateway and chat interface that integrates multiple LLMs (DeepSeek and Claude) with streaming and reasoning capabilities, making it a fitting conversational AI assistant for private deployment, though its multi-model support is limited to those two models.
Serge is a self-hosted web chat interface for running large language models locally using the llama.cpp inference engine. It loads GGUF-format model files directly on your own machine, removing the need for internet connectivity or external API keys, and streams responses to the browser in real time via WebSocket connections. The project is packaged for containerized deployment using Docker and Docker Compose, with a Traefik reverse proxy that handles HTTP and WebSocket routing along with automatic TLS certificate management. Ready-made Kubernetes manifests are also provided, enabling deploym
Serge is a self-hosted web chat interface for running LLMs locally with llama.cpp, offering streaming responses and Docker deployment — it fits your need for a local AI assistant, though it is limited to GGUF models from a single inference engine rather than supporting multiple backends.
Khoj is a self-hosted artificial intelligence platform designed for personal knowledge management and semantic information retrieval. It functions as a private assistant that indexes your local documents, notes, and external workspaces, allowing you to interact with your data through natural language queries and conversational chat. By maintaining a local-first architecture, the system ensures that your information remains under your control while providing context-aware responses grounded in your personal knowledge base. The platform distinguishes itself through a modular, cross-platform int
Khoj is a self-hosted AI assistant that indexes your personal documents and enables conversational chat with LLMs, which aligns closely with your need for a local ChatGPT alternative, though its focus on personal knowledge management may be narrower than a general-purpose chat interface.
Flowise is a low-code platform designed for building and deploying complex language model workflows through a visual, node-based interface. It functions as an orchestrator for autonomous multi-agent systems, allowing users to construct conversational pipelines by connecting language models, memory stores, and external tools on a drag-and-drop canvas. The platform distinguishes itself through its support for sophisticated agentic patterns, including supervisor-worker delegation and iterative reasoning strategies. Users can design directed acyclic graphs to manage conditional branching, state p
Flowise is a low-code platform for building conversational AI workflows with a node-based editor, supporting multiple LLMs, self-hosting, and a chat web UI, making it a flexible alternative to ChatGPT that covers the key required features.
Chatbox is a desktop client and multi-provider chat interface for interacting with large language model APIs across various service providers and local installations. It functions as a local-first AI conversation manager that stores chat history and user settings directly on the device. The application provides a unified interface to connect multiple AI backends for text generation and image creation. It includes a specialized rendering system for AI responses that supports technical documentation through syntax highlighting, Markdown, and Latex mathematical notation. The platform manages pr
Chatbox is a desktop chat client that connects to multiple LLM backends (including local ones) and stores conversations locally—exactly the kind of local-only or self-hosted ChatGPT alternative this search wants, though its interface is a desktop app rather than a web UI.
This is a browser-based LLM chat interface that runs models locally via WebGPU, providing a self-contained alternative to ChatGPT with support for multiple open-source models and streaming responses.