# khoj-ai/khoj

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/khoj-ai-khoj).**

32,535 stars · 1,965 forks · Python · agpl-3.0

## Links

- GitHub: https://github.com/khoj-ai/khoj
- Homepage: https://khoj.dev
- awesome-repositories: https://awesome-repositories.com/repository/khoj-ai-khoj.md

## Topics

`agent` `ai` `assistant` `chat` `chatgpt` `emacs` `image-generation` `llama3` `llamacpp` `llm` `obsidian` `obsidian-md` `offline-llm` `productivity` `rag` `research` `self-hosted` `semantic-search` `stt` `whatsapp-ai`

## Description

Khoj is a self-hosted artificial intelligence platform designed for personal knowledge management and semantic information retrieval. It functions as a private assistant that indexes your local documents, notes, and external workspaces, allowing you to interact with your data through natural language queries and conversational chat. By maintaining a local-first architecture, the system ensures that your information remains under your control while providing context-aware responses grounded in your personal knowledge base.

The platform distinguishes itself through a modular, cross-platform integration layer that embeds intelligent search and chat capabilities directly into your existing workflows. Whether you are working within text editors, web browsers, or mobile messaging applications, Khoj provides a unified interface to your data. It supports advanced retrieval strategies, such as dual-model architectures for semantic mapping and real-time internet grounding, which allow the assistant to synthesize private notes with external information while providing clear source citations.

Beyond its core retrieval capabilities, the system offers a comprehensive suite of tools for data orchestration and research automation. It includes a pluggable ingestion pipeline for diverse file formats, automated query scheduling, and the ability to execute code or generate visual content directly within the chat interface. Users can configure custom agents, manage model routing, and secure their deployments with multi-user authentication, making it suitable for both individual use and enterprise-grade environments.

## Tags

### Artificial Intelligence & ML

- [Personal AI Assistants](https://awesome-repositories.com/f/artificial-intelligence-ml/personal-ai-assistants.md) — Enables the creation of a personal AI assistant that indexes local documents and web content to automate research. ([source](https://github.com/khoj-ai/khoj))
- [Semantic Search Engines](https://awesome-repositories.com/f/artificial-intelligence-ml/semantic-search-engines.md) — Performs context-aware searches across diverse file formats using vector embeddings.
- [Retrieval Architectures](https://awesome-repositories.com/f/artificial-intelligence-ml/retrieval-architectures.md) — Implements dual-model architectures to map queries and documents into a shared vector space for context-aware retrieval.
- [Retrieval-Augmented Generation](https://awesome-repositories.com/f/artificial-intelligence-ml/retrieval-augmented-generation.md) — Grounds responses in real-time internet data by automatically performing web searches when fresh information is required. ([source](https://docs.khoj.dev/category/features))
- [Self-Hosted AI Models](https://awesome-repositories.com/f/artificial-intelligence-ml/self-hosted-ai-models.md) — Khoj allows deployment of the AI assistant on private infrastructure to maintain control over data, configuration, and model selection. ([source](https://github.com/khoj-ai/khoj))
- [Agent Configurations](https://awesome-repositories.com/f/artificial-intelligence-ml/agent-configurations.md) — Allows users to define custom system prompts to create specialized agents that perform specific roles. ([source](https://docs.khoj.dev/category/features))
- [Inference Orchestrators](https://awesome-repositories.com/f/artificial-intelligence-ml/inference-orchestrators.md) — Manages model execution across local hardware and remote gateways to balance privacy and performance.
- [Local Model Runners](https://awesome-repositories.com/f/artificial-intelligence-ml/local-model-runners.md) — Connects to a local model runner to execute large language models directly on self-hosted hardware. ([source](https://docs.khoj.dev/category/advanced-self-hosting))
- [AI Code Interpreters](https://awesome-repositories.com/f/artificial-intelligence-ml/ai-code-interpreters.md) — Generates and runs Python code to perform data analysis, create visualizations, or handle complex quantitative tasks. ([source](https://docs.khoj.dev/category/features))
- [Chat Interaction Controls](https://awesome-repositories.com/f/artificial-intelligence-ml/chat-interaction-controls.md) — Supports commands within chat queries to specify information sources, generate images, or run code snippets for precise responses. ([source](https://docs.khoj.dev/features/chat))

### Business & Productivity Software

- [Personal Knowledge Management Systems](https://awesome-repositories.com/f/business-productivity-software/personal-knowledge-management-systems.md) — Organizes and retrieves information from private notes and documents using natural language search and AI-assisted discovery.
- [Note Taking Search](https://awesome-repositories.com/f/business-productivity-software/note-taking-search.md) — Searches through personal notes using natural language queries powered by machine learning models. ([source](https://docs.khoj.dev/category/features))
- [Workspace Integrations](https://awesome-repositories.com/f/business-productivity-software/workspace-integrations.md) — Connects Notion workspaces by configuring API keys and integration settings to enable searching and chatting with external data. ([source](https://docs.khoj.dev/category/data-sources))

### Data & Databases

- [Semantic Search Engines](https://awesome-repositories.com/f/data-databases/semantic-search-engines.md) — A retrieval tool that uses vector embeddings to perform conceptual searches across diverse personal data sources and local file systems. ([source](https://docs.khoj.dev/features/search))
- [Vector Indexing Engines](https://awesome-repositories.com/f/data-databases/vector-indexing-engines.md) — Converts unstructured text into high-dimensional numerical embeddings to enable fast similarity searches.
- [Data Indexing Tools](https://awesome-repositories.com/f/data-databases/data-indexing-tools.md) — Processes diverse data sources including documents, code repositories, and web pages using modular natural language models. ([source](https://docs.khoj.dev/features/all-features))
- [Data Ingestion Pipelines](https://awesome-repositories.com/f/data-databases/data-ingestion-pipelines.md) — Uses pluggable connectors to normalize and index diverse file formats into a unified searchable knowledge base.
- [Data Ingestion Tools](https://awesome-repositories.com/f/data-databases/data-ingestion-tools.md) — Enables users to upload personal data files to the platform for searching, chatting, and interacting with their own documents. ([source](https://docs.khoj.dev/category/data-sources))
- [Data Synchronization Tools](https://awesome-repositories.com/f/data-databases/data-synchronization-tools.md) — Combines personal notes, documents, and folders from various sources for use in unified chat interactions. ([source](https://docs.khoj.dev/get-started/setup))
- [Incremental Indexing Mechanisms](https://awesome-repositories.com/f/data-databases/incremental-indexing-mechanisms.md) — Monitors file systems and data sources to incrementally update the search index upon content changes.
- [Query Filtering Systems](https://awesome-repositories.com/f/data-databases/query-filtering-systems.md) — Narrows down search results and chat responses by applying structured query syntax to target specific entries. ([source](https://docs.khoj.dev/category/miscellaneous))

### DevOps & Infrastructure

- [Self-Hosted AI Infrastructure](https://awesome-repositories.com/f/devops-infrastructure/self-hosted-ai-infrastructure.md) — Deploys and manages private artificial intelligence services on local hardware to maintain full control over data and processing.
- [Enterprise Deployment Solutions](https://awesome-repositories.com/f/devops-infrastructure/enterprise-deployment-solutions.md) — Khoj provides enterprise-grade deployment options, offering secure access and custom integration capabilities for teams and businesses. ([source](https://github.com/khoj-ai/khoj))
- [Self-Hosted AI Platforms](https://awesome-repositories.com/f/devops-infrastructure/self-hosted-ai-platforms.md) — Manages private data indexing and model orchestration while maintaining full control over user information.
- [Self-Hosted Applications](https://awesome-repositories.com/f/devops-infrastructure/self-hosted-applications.md) — Khoj supports deployment on private hardware to maintain full control over data processing, storage, and personal information indexing. ([source](https://docs.khoj.dev/category/get-started))

### Security & Cryptography

- [Private Data Privacy Tools](https://awesome-repositories.com/f/security-cryptography/private-data-privacy-tools.md) — Enables conversational interaction with personal data while ensuring privacy through local-first processing and secure remote access.
- [Authentication Strategies](https://awesome-repositories.com/f/security-cryptography/authentication-strategies.md) — Secures multi-user access for self-hosted instances using authentication methods like Magic Links or OAuth. ([source](https://docs.khoj.dev/category/advanced-self-hosting))

### Development Tools & Productivity

- [Editor AI Assistants](https://awesome-repositories.com/f/development-tools-productivity/editor-ai-assistants.md) — Enables iterative AI-assisted conversations for content generation and discovery within the text editor. ([source](https://docs.khoj.dev/clients/emacs))
- [Note-Taking App AI Assistants](https://awesome-repositories.com/f/development-tools-productivity/note-taking-app-ai-assistants.md) — Facilitates AI-assisted information retrieval and content generation within the note-taking application. ([source](https://docs.khoj.dev/clients/obsidian))
- [Editor Search Integrations](https://awesome-repositories.com/f/development-tools-productivity/editor-search-integrations.md) — Performs natural language searches across private notes or the public internet directly from the editor. ([source](https://docs.khoj.dev/clients/emacs))
- [Local File Syncing](https://awesome-repositories.com/f/development-tools-productivity/local-file-syncing.md) — Connects local folders to the application to provide context for search results and conversational responses. ([source](https://docs.khoj.dev/clients/desktop))
- [Cross-Platform Tooling](https://awesome-repositories.com/f/development-tools-productivity/cross-platform-tooling.md) — Connects AI-powered search and chat capabilities across multiple editors, browsers, and mobile devices.
- [Editor Integrations](https://awesome-repositories.com/f/development-tools-productivity/editor-integrations.md) — Khoj allows users to interact with their personal knowledge base and second brain directly from within the text editor environment. ([source](https://docs.khoj.dev/category/clients))
- [Editor Plugins](https://awesome-repositories.com/f/development-tools-productivity/editor-plugins.md) — Embeds intelligent search and chat capabilities directly into existing text editors and messaging applications.
- [Note-Taking App Knowledge Interfaces](https://awesome-repositories.com/f/development-tools-productivity/note-taking-app-knowledge-interfaces.md) — Allows interaction with personal knowledge bases and second brains directly from within the note-taking application. ([source](https://docs.khoj.dev/category/clients))

### Web Development

- [Web Search Interfaces](https://awesome-repositories.com/f/web-development/web-search-interfaces.md) — Performs fast, incremental, and local-first searches across personal data for a responsive search experience. ([source](https://docs.khoj.dev/clients/web))
- [Web Chat Interfaces](https://awesome-repositories.com/f/web-development/web-chat-interfaces.md) — Offers a web-based AI assistant for retrieving information from private notes or the internet. ([source](https://docs.khoj.dev/clients/web))
- [Web Interfaces](https://awesome-repositories.com/f/web-development/web-interfaces.md) — Provides a standard web browser interface to interact with personal knowledge bases and retrieve information from any device. ([source](https://docs.khoj.dev/category/clients))

### User Interface & Experience

- [Desktop Chat Interfaces](https://awesome-repositories.com/f/user-interface-experience/desktop-chat-interfaces.md) — Provides a dedicated desktop application for natural language conversations to retrieve information from personal collections. ([source](https://docs.khoj.dev/category/clients))
