Why is frappe/erpnext a recommended Document Stores GitHub Repositories repository?

Records are stored as structured objects where the system manages relationships and validation logic through a centralized controller.

Why is mongodb/mongo a recommended Document Stores GitHub Repositories repository?

Stores data in flexible, hierarchical structures that allow for dynamic schemas without requiring rigid table definitions.

Why is chroma-core/chroma a recommended Document Stores GitHub Repositories repository?

Saves documents and associated metadata in a database to enable efficient retrieval and management of unstructured data.

Why is cinnamon/kotaemon a recommended Document Stores GitHub Repositories repository?

Configures storage backends for managing full-text and vector-based document indices.

Why is pubkey/rxdb a recommended Document Stores GitHub Repositories repository?

Maintains a local JSON document store with schema validation and indexing as the primary source of truth.

Why is vonng/ddia a recommended Document Stores GitHub Repositories repository?

Stores information as flexible JSON documents to accommodate semi-structured data.

Why is google-gemini/cookbook a recommended Document Stores GitHub Repositories repository?

Provides persistent storage containers for document embeddings to maintain data availability.

Why is alsotang/node-lessons a recommended Document Stores GitHub Repositories repository?

Implements the storage and retrieval of semi-structured information as JSON document objects.

Why is louischatriot/nedb a recommended Document Stores GitHub Repositories repository?

Provides a NoSQL document store for managing semi-structured data in memory or on disk.

Why is datawhalechina/llm-universe a recommended Document Stores GitHub Repositories repository?

Provides detailed instructions on cleaning and slicing diverse document types before storing them in vector databases.

25 个仓库

Awesome GitHub RepositoriesDocument Stores

Database systems that store, retrieve, and manage information as semi-structured document objects.

Distinguishing note: Focuses on document-oriented persistence where relationships and validation are managed by a centralized controller.

Explore 25 awesome GitHub repositories matching data & databases · Document Stores. Refine with filters or upvote what's useful.

用 AI 发现最棒的仓库。我们将通过 AI 为您搜索最匹配的仓库。

frappe/erpnext
frappe/erpnext
35,726在 GitHub 上查看
ERPNext is a comprehensive enterprise resource planning suite designed to integrate core organizational functions, including accounting, inventory, human resources, and project management, into a single unified platform. It operates as a metadata-driven business application, where data structures and application logic are defined through configuration rather than hard-coded programming to facilitate rapid customization. The system distinguishes itself through a robust security and governance framework that enforces granular, role-based access control across all document operations. It feature
Records are stored as structured objects where the system manages relationships and validation logic through a centralized controller.
Pythonaccountingasset-managementcrm
在 GitHub 上查看35,726
mongodb/mongo
mongodb/mongo
28,158在 GitHub 上查看
This project is a distributed, document-oriented database system designed to store information in flexible, hierarchical structures. It supports horizontal scaling through automated sharding and maintains high availability across global clusters using a multi-node replication protocol. By executing multi-document operations as atomic units, the system ensures data integrity and consistency across distributed environments. The platform distinguishes itself by integrating advanced vector-based indexing, which enables semantic similarity searches alongside traditional geospatial and lexical quer
Stores data in flexible, hierarchical structures that allow for dynamic schemas without requiring rigid table definitions.
C++c-plus-plusdatabasemongodb
在 GitHub 上查看28,158
chroma-core/chroma
chroma-core/chroma
26,198在 GitHub 上查看
Chroma is a specialized vector database designed to index and retrieve high-dimensional data representations for semantic similarity search. It functions as a comprehensive platform for information retrieval, enabling the storage and management of unstructured documents alongside structured metadata. By mapping data into numerical representations, the system facilitates rapid similarity lookups across large datasets. The platform distinguishes itself through a hybrid search infrastructure that combines dense vector embeddings with sparse keyword and regular expression matching to balance sema
Saves documents and associated metadata in a database to enable efficient retrieval and management of unstructured data.
Rustaidatabasedocument-retrieval
在 GitHub 上查看26,198
cinnamon/kotaemon
Cinnamon/kotaemon
25,139在 GitHub 上查看
Kotaemon is an orchestration framework designed for building modular, agentic workflows that integrate document processing, retrieval-augmented generation, and multi-step reasoning. It provides a comprehensive platform for developing document-based question answering systems, allowing users to chain language models, prompt templates, and external tools into complex, automated pipelines. The system distinguishes itself through a highly modular architecture that emphasizes component-based composition and schema-driven data exchange. It supports autonomous agents capable of decomposing complex q
Configures storage backends for managing full-text and vector-based document indices.
Pythonchatbotllmsopen-source
在 GitHub 上查看25,139
pubkey/rxdb
pubkey/rxdb
23,048在 GitHub 上查看
This project is a reactive, offline-first NoSQL database engine designed for JavaScript applications. It provides a robust framework for managing application state by synchronizing data across browsers, mobile devices, and server-side runtimes. By treating local storage as the primary source of truth, it enables applications to remain functional without network connectivity, automatically reconciling changes with remote backends once a connection is restored. The database distinguishes itself through a modular architecture that supports cross-environment synchronization and high-performance d
Maintains a local JSON document store with schema validation and indexing as the primary source of truth.
TypeScriptangularbrowser-databasecouchdb
在 GitHub 上查看23,048
vonng/ddia
Vonng/ddia
22,648在 GitHub 上查看
This project serves as a comprehensive technical reference for the architecture and design of data-intensive applications. It provides a structured analysis of the fundamental principles required to build reliable, scalable, and maintainable software systems, covering the core trade-offs inherent in modern data infrastructure. The repository explores the mechanics of distributed data management, including strategies for replication, partitioning, and achieving consensus across multiple nodes. It details the design of storage engines, indexing techniques, and transaction management models, whi
Stores information as flexible JSON documents to accommodate semi-structured data.
Pythonbookdatabaseddia
在 GitHub 上查看22,648
google-gemini/cookbook
google-gemini/cookbook
17,418在 GitHub 上查看
The Gemini Cookbook is a comprehensive collection of implementation patterns, code samples, and development guides designed for building applications with Google Gemini models. It serves as a central resource for developers to integrate multimodal generative artificial intelligence into their software, providing the necessary frameworks to manage model interactions, stateful workflows, and structured data extraction. The repository distinguishes itself by offering specialized toolkits for autonomous agent orchestration, enabling the construction of agents that can execute code, browse the web
Provides persistent storage containers for document embeddings to maintain data availability.
Jupyter Notebookgeminigemini-api
在 GitHub 上查看17,418
alsotang/node-lessons
alsotang/node-lessons
16,450在 GitHub 上查看
node-lessons is a comprehensive Node.js programming course and instructional guide. It provides a collection of guided lessons and code examples designed to teach the fundamentals of the Node.js runtime and server-side JavaScript development. The project serves as a practical guide for building web servers and backend applications, specifically covering the implementation of HTTP servers, request routing, and middleware chains. It includes specialized instructional material on managing asynchronous JavaScript workflows through promises and flow control, as well as guides for integrating NoSQL
Implements the storage and retrieval of semi-structured information as JSON document objects.
JavaScriptjavascriptnodejs
在 GitHub 上查看16,450
louischatriot/nedb
louischatriot/nedb
13,540在 GitHub 上查看
NeDB is a JavaScript embedded NoSQL document store designed for Node.js and the browser. It functions as an in-memory data store with the option to persist documents to a local file system, ensuring data survives application restarts. The project utilizes a MongoDB-compatible API to perform data operations, allowing it to serve as a lightweight document indexing system and a persistent file database without requiring a separate database server. Capabilities include querying, inserting, updating, and deleting documents, as well as the ability to create indexes on specific fields to accelerate
Provides a NoSQL document store for managing semi-structured data in memory or on disk.
JavaScript
在 GitHub 上查看13,540
datawhalechina/llm-universe
datawhalechina/llm-universe
13,269在 GitHub 上查看
llm-universe is a structured learning resource and technical guide focused on the development of large language model applications. It serves as a curriculum for mastering model orchestration, the creation of autonomous conversational agents, and the implementation of retrieval-augmented generation systems. The project provides detailed instructions on connecting model APIs with memory and tools to create execution chains. It specifically covers the construction of retrieval pipelines, including the process of cleaning raw documents, generating embeddings, and integrating vector databases to
Provides detailed instructions on cleaning and slicing diverse document types before storing them in vector databases.
Jupyter Notebooklangchainrag
在 GitHub 上查看13,269
open-policy-agent/opa
open-policy-agent/opa
11,860在 GitHub 上查看
This project is a unified, cloud-native policy engine designed to decouple authorization and security logic from application codebases. It functions as a centralized authorization service that evaluates structured input data against declarative rules, enabling consistent policy enforcement across microservices, infrastructure, and continuous integration pipelines. The engine utilizes a specialized logic programming language to express complex constraints, which are compiled into an optimized intermediate representation for high-performance evaluation. By supporting both sidecar-based deployme
Provides capabilities to retrieve, create, and modify structured data documents to support complex policy evaluation.
Goauthorizationcloud-nativecompliance
在 GitHub 上查看11,860
leanote/leanote
leanote/leanote
11,695在 GitHub 上查看
Leanote is a collaborative Markdown editor, hierarchical note manager, and self-hosted blogging platform. It functions as a knowledge base that uses a document store to organize structured notebooks and rich-text documents. The system enables real-time co-authoring, allowing multiple users to simultaneously edit documents and brainstorm ideas. It also includes a publishing engine that transforms private notes into public-facing blogs using customizable themes and multi-contributor management. The platform provides tools for knowledge management through notebooks and tags, supporting both ric
Employs a NoSQL document store for flexible, schema-less storage of notes and notebooks.
JavaScriptevernoteleanote
在 GitHub 上查看11,695
madd86/awesome-system-design
madd86/awesome-system-design
11,695在 GitHub 上查看
This project is a comprehensive learning resource and reference guide for software architecture and distributed systems design. It serves as a structured curriculum for engineers to study fundamental architectural patterns, scalability strategies, and distributed computing theory, specifically tailored to prepare for technical interviews and professional engineering roles. The repository distinguishes itself by providing a curated collection of industry-standard infrastructure tools and methodologies. It covers the selection and implementation of technologies for data storage, message brokeri
Explains database systems that store, retrieve, and manage information as semi-structured document objects.
distributed-systemshadoop-ecosysteminterview
在 GitHub 上查看11,695
ferretdb/ferretdb
FerretDB/FerretDB
10,976在 GitHub 上查看
FerretDB is an open-source database emulator and protocol translator that mimics a MongoDB environment to support existing drivers and client tools on a relational backend. It functions as a stateless database proxy that converts binary wire protocol messages into SQL statements, allowing a relational engine to handle document-oriented requests. The project serves as a migration tool for moving applications from MongoDB to PostgreSQL without rewriting queries or changing client drivers. It achieves this by using PostgreSQL as a document store, storing and querying BSON documents through a tra
Uses PostgreSQL as a document store to store and query BSON documents through a translation layer.
Go
在 GitHub 上查看10,976
mbdavid/litedb
mbdavid/LiteDB
9,410在 GitHub 上查看
LiteDB is a serverless, embedded NoSQL document database for .NET applications. It persists data into a single portable file, functioning as a BSON data store that resides within the application process rather than running as a separate server. The system is ACID compliant, utilizing write-ahead logging to ensure atomic, consistent, isolated, and durable transactions. It includes built-in encryption to provide secure local data storage and protect files on disk from unauthorized access. The project covers object-document mapping to convert classes into document formats, indexed search capabi
Functions as a serverless embedded NoSQL document store for .NET applications.
C#
在 GitHub 上查看9,410
litedb-org/litedb
litedb-org/LiteDB
9,409在 GitHub 上查看
LiteDB is a serverless NoSQL document store and embedded database engine for .NET applications. It persists unstructured documents and binary data into a single standalone disk file, allowing the database to run within the application process rather than as a separate server. The system supports strongly typed queries through Language Integrated Query and allows the execution of standard SQL commands for data retrieval and transformation. It provides native mapping of plain classes into document formats and secures stored information via symmetric-key file encryption. The engine includes cap
Provides a NoSQL document store for .NET applications that manages semi-structured data objects.
C#databasedotnethacktoberfest
在 GitHub 上查看9,409
msiemens/tinydb
msiemens/tinydb
7,529在 GitHub 上查看
TinyDB is a lightweight, document-oriented database and embedded NoSQL engine. It stores data as documents in local files, providing a persistence layer that operates without a separate server process. The system is an extensible document store featuring a middleware architecture. This allows for the customization of storage backends and the interception of data operations to transform how information is stored and retrieved. The database manages unstructured data using JSON-based serialization and supports pluggable storage backends for local file persistence.
Implements a document-oriented data model for storing semi-structured information as flexible documents.
Pythondatabasedocumentdbjson
在 GitHub 上查看7,529
frappe/hrms
frappe/hrms
7,530在 GitHub 上查看
The Frappe HR Management System is a human resources platform built on the Frappe framework for managing employee lifecycles, payroll, and attendance. The system provides tools for payroll and tax automation, including the generation of salary structures, tax slab calculations, and automated payslips. It includes an attendance and leave manager with geolocation-based check-ins and configurable holiday calendars, as well as a performance appraisal framework for goal alignment and structured review cycles. Additional capabilities cover the full employee lifecycle from onboarding to exit, along
Implements a document-based data model that stores business entities as versioned records with attached files.
Pythonattendanceemployeeerpnext
在 GitHub 上查看7,530
jenssegers/laravel-mongodb
jenssegers/laravel-mongodb
7,075在 GitHub 上查看
This project is a MongoDB Eloquent ORM and NoSQL query builder for the Laravel framework. It provides an active record implementation that maps MongoDB collections and documents to programmable models for data manipulation. The system enables schemaless data management, allowing applications to handle dynamic data structures without the need for rigid database migrations or predefined tables. It integrates MongoDB into Laravel applications to store and retrieve flexible document data using standard PHP patterns. The library covers document store querying and Eloquent model mapping, utilizing
Enables the execution of complex queries against a NoSQL document store using a fluent interface.
PHP
在 GitHub 上查看7,075
firebase/quickstart-js
firebase/quickstart-js
5,367在 GitHub 上查看
该项目是一系列参考实现、示例代码和入门套件，用于使用 JavaScript SDK 将 Firebase 后端服务集成到 Web 应用中。它作为一个实用指南，用于引导具有云托管认证、数据库和无服务器逻辑的项目。该仓库提供了实现实时数据同步、用户身份管理和事件驱动云函数的具体示例。它还包括使用本地服务模拟器在生产部署前在本地机器上测试云功能的参考代码。该代码库涵盖了广泛的功能，包括 NoSQL 和关系型数据存储、全球 CDN 上的静态资产托管，以及声明式安全规则的强制执行。它还演示了身份验证的集成以及在托管环境中执行服务器端逻辑的方法。
Stores and synchronizes semi-structured document objects in the cloud for real-time access.
TypeScript
在 GitHub 上查看5,367

Awesome Document Stores GitHub Repositories

frappe/erpnext

mongodb/mongo

chroma-core/chroma

Cinnamon/kotaemon

pubkey/rxdb

Vonng/ddia

google-gemini/cookbook

alsotang/node-lessons

louischatriot/nedb

datawhalechina/llm-universe

open-policy-agent/opa

leanote/leanote

madd86/awesome-system-design

FerretDB/FerretDB

mbdavid/LiteDB

litedb-org/LiteDB

msiemens/tinydb

frappe/hrms

jenssegers/laravel-mongodb

firebase/quickstart-js

探索子标签