25 个仓库
Database systems that store, retrieve, and manage information as semi-structured document objects.
Distinguishing note: Focuses on document-oriented persistence where relationships and validation are managed by a centralized controller.
Explore 25 awesome GitHub repositories matching data & databases · Document Stores. Refine with filters or upvote what's useful.
ERPNext is a comprehensive enterprise resource planning suite designed to integrate core organizational functions, including accounting, inventory, human resources, and project management, into a single unified platform. It operates as a metadata-driven business application, where data structures and application logic are defined through configuration rather than hard-coded programming to facilitate rapid customization. The system distinguishes itself through a robust security and governance framework that enforces granular, role-based access control across all document operations. It feature
Records are stored as structured objects where the system manages relationships and validation logic through a centralized controller.
This project is a distributed, document-oriented database system designed to store information in flexible, hierarchical structures. It supports horizontal scaling through automated sharding and maintains high availability across global clusters using a multi-node replication protocol. By executing multi-document operations as atomic units, the system ensures data integrity and consistency across distributed environments. The platform distinguishes itself by integrating advanced vector-based indexing, which enables semantic similarity searches alongside traditional geospatial and lexical quer
Stores data in flexible, hierarchical structures that allow for dynamic schemas without requiring rigid table definitions.
Chroma is a specialized vector database designed to index and retrieve high-dimensional data representations for semantic similarity search. It functions as a comprehensive platform for information retrieval, enabling the storage and management of unstructured documents alongside structured metadata. By mapping data into numerical representations, the system facilitates rapid similarity lookups across large datasets. The platform distinguishes itself through a hybrid search infrastructure that combines dense vector embeddings with sparse keyword and regular expression matching to balance sema
Saves documents and associated metadata in a database to enable efficient retrieval and management of unstructured data.
Kotaemon is an orchestration framework designed for building modular, agentic workflows that integrate document processing, retrieval-augmented generation, and multi-step reasoning. It provides a comprehensive platform for developing document-based question answering systems, allowing users to chain language models, prompt templates, and external tools into complex, automated pipelines. The system distinguishes itself through a highly modular architecture that emphasizes component-based composition and schema-driven data exchange. It supports autonomous agents capable of decomposing complex q
Configures storage backends for managing full-text and vector-based document indices.
This project is a reactive, offline-first NoSQL database engine designed for JavaScript applications. It provides a robust framework for managing application state by synchronizing data across browsers, mobile devices, and server-side runtimes. By treating local storage as the primary source of truth, it enables applications to remain functional without network connectivity, automatically reconciling changes with remote backends once a connection is restored. The database distinguishes itself through a modular architecture that supports cross-environment synchronization and high-performance d
Maintains a local JSON document store with schema validation and indexing as the primary source of truth.
This project serves as a comprehensive technical reference for the architecture and design of data-intensive applications. It provides a structured analysis of the fundamental principles required to build reliable, scalable, and maintainable software systems, covering the core trade-offs inherent in modern data infrastructure. The repository explores the mechanics of distributed data management, including strategies for replication, partitioning, and achieving consensus across multiple nodes. It details the design of storage engines, indexing techniques, and transaction management models, whi
Stores information as flexible JSON documents to accommodate semi-structured data.
The Gemini Cookbook is a comprehensive collection of implementation patterns, code samples, and development guides designed for building applications with Google Gemini models. It serves as a central resource for developers to integrate multimodal generative artificial intelligence into their software, providing the necessary frameworks to manage model interactions, stateful workflows, and structured data extraction. The repository distinguishes itself by offering specialized toolkits for autonomous agent orchestration, enabling the construction of agents that can execute code, browse the web
Provides persistent storage containers for document embeddings to maintain data availability.
node-lessons is a comprehensive Node.js programming course and instructional guide. It provides a collection of guided lessons and code examples designed to teach the fundamentals of the Node.js runtime and server-side JavaScript development. The project serves as a practical guide for building web servers and backend applications, specifically covering the implementation of HTTP servers, request routing, and middleware chains. It includes specialized instructional material on managing asynchronous JavaScript workflows through promises and flow control, as well as guides for integrating NoSQL
Implements the storage and retrieval of semi-structured information as JSON document objects.
NeDB is a JavaScript embedded NoSQL document store designed for Node.js and the browser. It functions as an in-memory data store with the option to persist documents to a local file system, ensuring data survives application restarts. The project utilizes a MongoDB-compatible API to perform data operations, allowing it to serve as a lightweight document indexing system and a persistent file database without requiring a separate database server. Capabilities include querying, inserting, updating, and deleting documents, as well as the ability to create indexes on specific fields to accelerate
Provides a NoSQL document store for managing semi-structured data in memory or on disk.
llm-universe is a structured learning resource and technical guide focused on the development of large language model applications. It serves as a curriculum for mastering model orchestration, the creation of autonomous conversational agents, and the implementation of retrieval-augmented generation systems. The project provides detailed instructions on connecting model APIs with memory and tools to create execution chains. It specifically covers the construction of retrieval pipelines, including the process of cleaning raw documents, generating embeddings, and integrating vector databases to
Provides detailed instructions on cleaning and slicing diverse document types before storing them in vector databases.
This project is a unified, cloud-native policy engine designed to decouple authorization and security logic from application codebases. It functions as a centralized authorization service that evaluates structured input data against declarative rules, enabling consistent policy enforcement across microservices, infrastructure, and continuous integration pipelines. The engine utilizes a specialized logic programming language to express complex constraints, which are compiled into an optimized intermediate representation for high-performance evaluation. By supporting both sidecar-based deployme
Provides capabilities to retrieve, create, and modify structured data documents to support complex policy evaluation.
Leanote is a collaborative Markdown editor, hierarchical note manager, and self-hosted blogging platform. It functions as a knowledge base that uses a document store to organize structured notebooks and rich-text documents. The system enables real-time co-authoring, allowing multiple users to simultaneously edit documents and brainstorm ideas. It also includes a publishing engine that transforms private notes into public-facing blogs using customizable themes and multi-contributor management. The platform provides tools for knowledge management through notebooks and tags, supporting both ric
Employs a NoSQL document store for flexible, schema-less storage of notes and notebooks.
This project is a comprehensive learning resource and reference guide for software architecture and distributed systems design. It serves as a structured curriculum for engineers to study fundamental architectural patterns, scalability strategies, and distributed computing theory, specifically tailored to prepare for technical interviews and professional engineering roles. The repository distinguishes itself by providing a curated collection of industry-standard infrastructure tools and methodologies. It covers the selection and implementation of technologies for data storage, message brokeri
Explains database systems that store, retrieve, and manage information as semi-structured document objects.
FerretDB is an open-source database emulator and protocol translator that mimics a MongoDB environment to support existing drivers and client tools on a relational backend. It functions as a stateless database proxy that converts binary wire protocol messages into SQL statements, allowing a relational engine to handle document-oriented requests. The project serves as a migration tool for moving applications from MongoDB to PostgreSQL without rewriting queries or changing client drivers. It achieves this by using PostgreSQL as a document store, storing and querying BSON documents through a tra
Uses PostgreSQL as a document store to store and query BSON documents through a translation layer.
LiteDB is a serverless, embedded NoSQL document database for .NET applications. It persists data into a single portable file, functioning as a BSON data store that resides within the application process rather than running as a separate server. The system is ACID compliant, utilizing write-ahead logging to ensure atomic, consistent, isolated, and durable transactions. It includes built-in encryption to provide secure local data storage and protect files on disk from unauthorized access. The project covers object-document mapping to convert classes into document formats, indexed search capabi
Functions as a serverless embedded NoSQL document store for .NET applications.
LiteDB is a serverless NoSQL document store and embedded database engine for .NET applications. It persists unstructured documents and binary data into a single standalone disk file, allowing the database to run within the application process rather than as a separate server. The system supports strongly typed queries through Language Integrated Query and allows the execution of standard SQL commands for data retrieval and transformation. It provides native mapping of plain classes into document formats and secures stored information via symmetric-key file encryption. The engine includes cap
Provides a NoSQL document store for .NET applications that manages semi-structured data objects.
TinyDB is a lightweight, document-oriented database and embedded NoSQL engine. It stores data as documents in local files, providing a persistence layer that operates without a separate server process. The system is an extensible document store featuring a middleware architecture. This allows for the customization of storage backends and the interception of data operations to transform how information is stored and retrieved. The database manages unstructured data using JSON-based serialization and supports pluggable storage backends for local file persistence.
Implements a document-oriented data model for storing semi-structured information as flexible documents.
The Frappe HR Management System is a human resources platform built on the Frappe framework for managing employee lifecycles, payroll, and attendance. The system provides tools for payroll and tax automation, including the generation of salary structures, tax slab calculations, and automated payslips. It includes an attendance and leave manager with geolocation-based check-ins and configurable holiday calendars, as well as a performance appraisal framework for goal alignment and structured review cycles. Additional capabilities cover the full employee lifecycle from onboarding to exit, along
Implements a document-based data model that stores business entities as versioned records with attached files.
This project is a MongoDB Eloquent ORM and NoSQL query builder for the Laravel framework. It provides an active record implementation that maps MongoDB collections and documents to programmable models for data manipulation. The system enables schemaless data management, allowing applications to handle dynamic data structures without the need for rigid database migrations or predefined tables. It integrates MongoDB into Laravel applications to store and retrieve flexible document data using standard PHP patterns. The library covers document store querying and Eloquent model mapping, utilizing
Enables the execution of complex queries against a NoSQL document store using a fluent interface.
该项目是一系列参考实现、示例代码和入门套件,用于使用 JavaScript SDK 将 Firebase 后端服务集成到 Web 应用中。它作为一个实用指南,用于引导具有云托管认证、数据库和无服务器逻辑的项目。 该仓库提供了实现实时数据同步、用户身份管理和事件驱动云函数的具体示例。它还包括使用本地服务模拟器在生产部署前在本地机器上测试云功能的参考代码。 该代码库涵盖了广泛的功能,包括 NoSQL 和关系型数据存储、全球 CDN 上的静态资产托管,以及声明式安全规则的强制执行。它还演示了身份验证的集成以及在托管环境中执行服务器端逻辑的方法。
Stores and synchronizes semi-structured document objects in the cloud for real-time access.