3 dépôts
Storage solutions that manage unstructured documents alongside structured metadata for filtering.
Distinguishing note: Focuses on the combination of document storage and metadata filtering.
Explore 3 awesome GitHub repositories matching data & databases · Metadata-Aware Document Stores. Refine with filters or upvote what's useful.
Chroma is a specialized vector database designed to index and retrieve high-dimensional data representations for semantic similarity search. It functions as a comprehensive platform for information retrieval, enabling the storage and management of unstructured documents alongside structured metadata. By mapping data into numerical representations, the system facilitates rapid similarity lookups across large datasets. The platform distinguishes itself through a hybrid search infrastructure that combines dense vector embeddings with sparse keyword and regular expression matching to balance sema
Manages unstructured documents alongside structured metadata to enable precise filtering and retrieval operations.
Airweave is a unified AI knowledge base platform that syncs data from external APIs into a searchable layer for retrieval-augmented generation. It provides a pre-built data connector library and a framework for building custom connectors, enabling the extraction, transformation, and synchronization of structured and unstructured data from SaaS applications. The platform includes a hybrid vector retrieval system that combines semantic, neural, and keyword search strategies to deliver grounded context for AI agents. The platform distinguishes itself through an agentic search engine that iterati
Preserves document metadata like ownership and timestamps alongside synced content for search filtering.
iBook is a self-hosted e-book library management system that allows individuals or organizations to set up and maintain their own digital collection. The platform provides durable storage for e-book files and their associated metadata, a RESTful API for library operations, static file serving for direct browser downloads, and a self-contained deployment model that runs on a user's own server. Beyond the core library functionality, iBook includes a freely accessible digital archive of technical and general e-books in EPUB and PDF formats, role-based access control for catalog and administrativ
Provides a durable database schema that maps each e-book to its metadata and file location.