ParadeDB is a database extension that integrates full-text search, vector database capabilities, and real-time analytics directly into a relational engine. It functions as a plugin that adds new storage and query execution capabilities to an existing database architecture.
The project distinguishes itself by supporting hybrid search workflows that combine lexical keyword matching with dense and sparse vector similarity in a single query. It utilizes reciprocal rank fusion to merge these ranked result sets and employs logical replication to synchronize data from external instances, removing the need for manual ETL pipelines.
The system covers broad capability areas including columnar-based indexing for high-performance aggregations and faceted search. It also includes features for search result highlighting, match offset location, and transactional consistency via multi-version concurrency control.
The software can be deployed using Docker containers or through cloud platforms such as Railway.