# tobi/qmd

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/tobi-qmd).**

9,498 stars · 517 forks · TypeScript · mit

## Links

- GitHub: https://github.com/tobi/qmd
- awesome-repositories: https://awesome-repositories.com/repository/tobi-qmd.md

## Description

qmd is a local semantic search engine and RAG knowledge base indexer that functions as a Model Context Protocol server. It converts local documents, markdown files, and codebases into a searchable database to provide retrieval augmented generation capabilities for AI agents.

The system exposes its search and retrieval tools via stdio or HTTP. It utilizes local model files for embeddings and reranking, supporting query expansion across multiple languages.

The project employs abstract syntax tree based chunking to split source code at function and class boundaries. It implements hybrid vector-keyword indexing and metadata-driven context assignment to improve retrieval accuracy, while operating as a background daemon to maintain model residency in memory.

## Tags

### Artificial Intelligence & ML

- [Semantic Search Engines](https://awesome-repositories.com/f/artificial-intelligence-ml/semantic-search-engines.md) — Functions as a local semantic search engine indexing code and markdown for meaning-based retrieval.
- [Local Document Indexing](https://awesome-repositories.com/f/artificial-intelligence-ml/agentic-rag-development/knowledge-base-retrieval/local-document-indexing.md) — Indexes local documents into vector stores using semantic chunking to support RAG pipelines.
- [Model Context Protocol](https://awesome-repositories.com/f/artificial-intelligence-ml/agentic-systems-frameworks/model-integration-serving/model-integration-interfaces/model-context-protocol.md) — Uses the Model Context Protocol to connect local document search tools to AI agents.
- [RAG Document Retrieval](https://awesome-repositories.com/f/artificial-intelligence-ml/documentation-retrieval-engines/rag-document-retrieval.md) — Provides retrieval of specific document snippets and line ranges to serve as grounded context for LLM responses. ([source](https://cdn.jsdelivr.net/gh/tobi/qmd@main/README.md))
- [MCP Servers](https://awesome-repositories.com/f/artificial-intelligence-ml/mcp-servers.md) — Provides a Model Context Protocol server implementation to expose search and retrieval tools to AI agents. ([source](https://cdn.jsdelivr.net/gh/tobi/qmd@main/README.md))
- [Model Context Protocol Servers](https://awesome-repositories.com/f/artificial-intelligence-ml/model-context-protocol-servers.md) — Acts as an MCP server that exposes local search and retrieval tools to large language models.
- [Code-Aware Chunking](https://awesome-repositories.com/f/artificial-intelligence-ml/semantic-chunking/code-aware-chunking.md) — Implements AST-based chunking to ensure source code is split at logical function and class boundaries.
- [Semantic Search](https://awesome-repositories.com/f/artificial-intelligence-ml/semantic-search.md) — Implements semantic search that understands query intent across local files using hybrid retrieval and reranking.
- [RAG Pipelines](https://awesome-repositories.com/f/artificial-intelligence-ml/language-model-orchestration/retrieval-augmented-generation/rag-pipelines.md) — Provides a RAG pipeline using local embeddings and query expansion to augment AI model outputs.
- [Local Embedding Generators](https://awesome-repositories.com/f/artificial-intelligence-ml/local-embedding-generators.md) — Generates vector embeddings and performs reranking locally using on-device model files.
- [Local Model Integrations](https://awesome-repositories.com/f/artificial-intelligence-ml/local-model-integrations.md) — Integrates local model files to perform multi-language embeddings and reranking without external APIs. ([source](https://cdn.jsdelivr.net/gh/tobi/qmd@main/README.md))
- [In-Memory Model Sessions](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/frameworks/training-systems/model-persistence-systems/in-memory-model-sessions.md) — Keeps embedding and reranking models loaded in memory as a background process to reduce inference latency.

### Data & Databases

- [Local Knowledge Base Indexers](https://awesome-repositories.com/f/data-databases/search-indexing-technologies/search-indexing/search-and-indexing/local-knowledge-base-indexers.md) — Converts local markdown files and codebases into a searchable database for rapid semantic retrieval. ([source](https://cdn.jsdelivr.net/gh/tobi/qmd@main/README.md))
- [Hybrid Search](https://awesome-repositories.com/f/data-databases/hybrid-search.md) — Combines vector similarity and keyword search with reranking to find highly relevant content in local documents. ([source](https://cdn.jsdelivr.net/gh/tobi/qmd@main/README.md))
- [Hybrid Vector-Keyword Indexing](https://awesome-repositories.com/f/data-databases/hybrid-vector-keyword-indexing.md) — Implements a hybrid indexing system combining dense vector embeddings with inverted keyword indices.
- [Local Knowledge Bases](https://awesome-repositories.com/f/data-databases/local-knowledge-bases.md) — Converts local markdown files and codebases into a searchable database for private information retrieval.
- [Search & Information Retrieval](https://awesome-repositories.com/f/data-databases/search-indexing-technologies/search-indexing/search-information-retrieval.md) — Implements capabilities to query indexed local documents and retrieve relevant information based on user requests. ([source](https://github.com/tobi/qmd/tree/main/docs/))
- [Syntax-Aware Chunking](https://awesome-repositories.com/f/data-databases/semantic-code-indexing/syntax-aware-chunking.md) — Uses abstract syntax tree analysis to split source code at logical boundaries for precise AI context. ([source](https://cdn.jsdelivr.net/gh/tobi/qmd@main/README.md))
- [Retrieval Context Metadata](https://awesome-repositories.com/f/data-databases/search-indexing-technologies/search-indexing/search-information-retrieval/code-context-search/retrieval-context-metadata.md) — Attaches descriptive metadata to folders and collections to improve the relevance of retrieved content for language models. ([source](https://cdn.jsdelivr.net/gh/tobi/qmd@main/README.md))

### Development Tools & Productivity

- [AI-Workflow Code Index Builders](https://awesome-repositories.com/f/development-tools-productivity/search-indexing-tools/local-file-indexers/ai-workflow-code-index-builders.md) — Builds structured, searchable indexes of local codebases split by function and class for AI agents.
- [Markdown Indexers](https://awesome-repositories.com/f/development-tools-productivity/search-indexing-tools/local-file-indexers/markdown-indexers.md) — Scans local directories for markdown documents to build a searchable index for fast content retrieval. ([source](https://github.com/tobi/qmd/tree/main/docs/))

### Operating Systems & Systems Programming

- [Background Daemons](https://awesome-repositories.com/f/operating-systems-systems-programming/system-administration-maintenance/system-services/background-daemons.md) — Operates as a long-lived background daemon to maintain model residency in memory. ([source](https://cdn.jsdelivr.net/gh/tobi/qmd@main/README.md))
