Deep Searcher is an open-source retrieval-augmented generation engine that indexes private documents into a vector database and uses large language models to answer complex questions with cited reasoning. It functions as both a command-line interface and a web API research tool, enabling users to load data and generate comprehensive reports by combining indexed private information with LLM-powered analysis.
The system distinguishes itself through a plugin-based provider architecture that supports multiple embedding models, LLM providers, vector databases, and file loaders as interchangeable components. It offers multi-LLM orchestration, coordinating several large language model services to answer queries by routing requests and aggregating responses, while also providing configurable embedding pipelines and vector database retrieval for similarity search.
The project includes CLI-driven data ingestion for local documents and web content, with support for PDFs and text files, alongside web crawling capabilities. Configuration options allow users to select and authenticate with various embedding, LLM, vector database, file loader, and web crawler providers, while the web API service layer exposes query and data loading functions as HTTP endpoints for programmatic access.