Why is twentyhq/twenty a recommended Data Schema Management GitHub Repositories repository?

Enables the definition and modification of custom data models, objects, and relationships through a declarative schema-first approach.

Why is begriffs/postgrest a recommended Data Schema Management GitHub Repositories repository?

Handles schema versioning to allow the data interface to evolve without impacting existing clients.

Why is pubkey/rxdb a recommended Data Schema Management GitHub Repositories repository?

Provides programmatic definition and lifecycle management for data schemas to ensure consistent document structures.

Why is vonng/ddia a recommended Data Schema Management GitHub Repositories repository?

Uses interface definition languages to specify data structures for consistent encoding across systems.

Why is datalab-to/surya a recommended Data Schema Management GitHub Repositories repository?

Defines and stores data structures centrally to reference them by identifier across multiple extraction requests.

Why is mementum/backtrader a recommended Data Schema Management GitHub Repositories repository?

Allows adding custom data fields to existing market data sources by mapping new lines to input columns.

Why is milanm/devops-roadmap a recommended Data Schema Management GitHub Repositories repository?

Implements runtime schema version detection to ensure compatibility during application updates.

Why is elysiajs/elysia a recommended Data Schema Management GitHub Repositories repository?

Applies custom encoding or sanitization logic to incoming and outgoing data to ensure consistency with defined schemas.

Why is apache/doris a recommended Data Schema Management GitHub Repositories repository?

Manages dynamic data schemas by supporting semi-structured data and rapid modifications.

Why is ajv-validator/ajv a recommended Data Schema Management GitHub Repositories repository?

Organizes complex, recursive, or cross-referenced data definitions into maintainable and reusable components across large software projects.

64 مستودعات

Awesome GitHub RepositoriesData Schema Management

Tools for defining, versioning, and modifying data models, object structures, and relational schemas through code or declarative definitions.

Distinguishing note: Focuses on the programmatic definition and lifecycle management of data schemas rather than generic database storage or query execution.

Explore 64 awesome GitHub repositories matching data & databases · Data Schema Management. Refine with filters or upvote what's useful.

اعثر على أفضل المستودعات باستخدام الذكاء الاصطناعي.سنبحث عن أفضل المستودعات المطابقة باستخدام الذكاء الاصطناعي.

twentyhq/twenty
twentyhq/twenty
50,113عرض على GitHub
Twenty is a headless customer relationship management framework that enables developers to build, version, and deploy custom business applications using code. By utilizing a declarative approach to data modeling, the platform allows for the definition of custom objects, fields, and complex relationships directly within the source code. This schema-driven architecture automatically generates corresponding REST and GraphQL APIs, ensuring that data structures and interface components remain synchronized across development and production environments. The platform distinguishes itself through a m
Enables the definition and modification of custom data models, objects, and relationships through a declarative schema-first approach.
TypeScriptcrmcrm-systemcustomer
عرض على GitHub50,113
begriffs/postgrest
begriffs/postgrest
27,234عرض على GitHub
PostgREST is a standalone web server that automatically transforms a PostgreSQL database into a RESTful API. It serves as an API gateway that translates HTTP requests into SQL queries, mapping the database schema directly to endpoints without the need for manual route definitions. The system utilizes a JWT authentication layer to validate user identities and map incoming web requests to specific database roles. This allows the server to delegate authorization and permission enforcement to the internal PostgreSQL role system. It includes a generator for OpenAPI specifications to provide stand
Handles schema versioning to allow the data interface to evolve without impacting existing clients.
Haskell
عرض على GitHub27,234
pubkey/rxdb
pubkey/rxdb
23,048عرض على GitHub
This project is a reactive, offline-first NoSQL database engine designed for JavaScript applications. It provides a robust framework for managing application state by synchronizing data across browsers, mobile devices, and server-side runtimes. By treating local storage as the primary source of truth, it enables applications to remain functional without network connectivity, automatically reconciling changes with remote backends once a connection is restored. The database distinguishes itself through a modular architecture that supports cross-environment synchronization and high-performance d
Provides programmatic definition and lifecycle management for data schemas to ensure consistent document structures.
TypeScriptangularbrowser-databasecouchdb
عرض على GitHub23,048
vonng/ddia
Vonng/ddia
22,648عرض على GitHub
This project serves as a comprehensive technical reference for the architecture and design of data-intensive applications. It provides a structured analysis of the fundamental principles required to build reliable, scalable, and maintainable software systems, covering the core trade-offs inherent in modern data infrastructure. The repository explores the mechanics of distributed data management, including strategies for replication, partitioning, and achieving consensus across multiple nodes. It details the design of storage engines, indexing techniques, and transaction management models, whi
Uses interface definition languages to specify data structures for consistent encoding across systems.
Pythonbookdatabaseddia
عرض على GitHub22,648
datalab-to/surya
datalab-to/surya
20,889عرض على GitHub
Surya is a document processing platform designed to transform unstructured files into structured, machine-readable data. It provides a comprehensive suite of tools for text recognition, layout analysis, and reading order detection, enabling the conversion of PDFs and images into formats such as JSON, HTML, or markdown. The platform is built to handle complex document workflows, offering capabilities for data extraction, document segmentation, and automated form completion. The platform distinguishes itself through a robust pipeline-based architecture that allows users to chain analysis tasks
Defines and stores data structures centrally to reference them by identifier across multiple extraction requests.
Python
عرض على GitHub20,889
mementum/backtrader
mementum/backtrader
20,462عرض على GitHub
Backtrader is a Python framework designed for the development, backtesting, and live execution of algorithmic trading strategies. It provides a comprehensive environment for quantitative finance, allowing users to simulate trading logic against historical market data or connect directly to brokerage platforms for automated real-time trading. The project distinguishes itself through a unified event-driven architecture that treats backtesting and live trading with the same API. This consistency is supported by a flexible data-feed abstraction layer that normalizes diverse financial sources, ena
Allows adding custom data fields to existing market data sources by mapping new lines to input columns.
Pythonbacktestingmetaclasspython
عرض على GitHub20,462
milanm/devops-roadmap
milanm/DevOps-Roadmap
18,752عرض على GitHub
DevOps-Roadmap is a comprehensive educational repository and knowledge base designed to guide technical professionals through the complexities of modern software engineering. It functions as a structured curriculum and reference library, covering the full spectrum of skills required to master system architecture, infrastructure management, and cloud operations. The project distinguishes itself by bridging the gap between high-level architectural design and the practical realities of engineering leadership. It provides curated insights into distributed systems, data consistency, and scalable d
Implements runtime schema version detection to ensure compatibility during application updates.
awsazurecomputer-science
عرض على GitHub18,752
elysiajs/elysia
elysiajs/elysia
18,531عرض على GitHub
Elysia is a high-performance TypeScript web framework designed for building type-safe backend services. It provides a modular, plugin-based architecture that allows developers to compose server logic, middleware, and validation schemas into scalable application instances. By leveraging native web standards, the framework ensures portability across diverse JavaScript runtimes, including Node.js, Deno, and various edge computing environments. The framework distinguishes itself through its focus on end-to-end type safety, automatically synchronizing request and response definitions between the s
Applies custom encoding or sanitization logic to incoming and outgoing data to ensure consistency with defined schemas.
TypeScriptbunframeworkhttp
عرض على GitHub18,531
apache/doris
apache/doris
15,526عرض على GitHub
Doris is a distributed SQL data warehouse designed for high-performance analytical workloads and real-time data processing. It functions as a unified platform that integrates traditional relational warehousing with lakehouse query capabilities, allowing users to execute analytical operations directly against external data lakes without requiring data migration. The system distinguishes itself through a shared-nothing, massively parallel processing architecture that utilizes vectorized query execution and columnar storage to maintain sub-second latency. It supports dynamic schema evolution, en
Manages dynamic data schemas by supporting semi-structured data and rapid modifications.
Javaagentaibigquery
عرض على GitHub15,526
ajv-validator/ajv
ajv-validator/ajv
14,733عرض على GitHub
Ajv is a high-performance data validation framework that compiles JSON schemas into optimized, standalone JavaScript functions. By transforming declarative schema definitions into executable code, it eliminates runtime interpretation overhead and provides a secure, efficient way to enforce data integrity across both browser and server environments. The library distinguishes itself through its focus on performance and type safety. It employs advanced compilation techniques, including abstract syntax tree optimization and function caching, to ensure rapid validation. Beyond standard checks, it
Organizes complex, recursive, or cross-referenced data definitions into maintainable and reusable components across large software projects.
TypeScriptajvjson-schemavalidator
عرض على GitHub14,733
data-centric-ai-community/ydata-profiling
Data-Centric-AI-Community/ydata-profiling
13,618عرض على GitHub
This library provides a diagnostic toolkit for automated data profiling and exploratory analysis. It generates comprehensive statistical summaries and visual reports for tabular datasets, enabling users to identify distribution patterns, missing values, and quality anomalies through a unified interface. The project distinguishes itself by offering differential analysis, which allows for the comparison of two dataset versions to track structural and statistical changes over time. It supports large-scale data processing through lazy evaluation and provides interactive widgets that embed directl
Provides differential analysis to track statistical and structural changes between two dataset versions.
Python
عرض على GitHub13,618
dbt-labs/dbt-core
dbt-labs/dbt-core
13,051عرض على GitHub
dbt-core is a command-line framework for transforming data within a warehouse using modular SQL and version control. It functions as a data transformation engine that enables users to define data structures and business logic through declarative configuration files, which the system then compiles into executable code. By managing complex data dependencies through a directed acyclic graph, it ensures that transformation tasks execute in the correct order while maintaining a manifest-driven state to track lineage and execution history. The project distinguishes itself through an adapter-based d
Maintains multiple concurrent versions of a data model to allow for safe transitions between schema or logic updates.
Rustanalyticsbusiness-intelligencedata-modeling
عرض على GitHub13,051
provectus/kafka-ui
provectus/kafka-ui
12,158عرض على GitHub
kafka-ui is a web interface and centralized control plane for administering Apache Kafka clusters, topics, and brokers. It functions as a distributed message queue dashboard and orchestrator, allowing for the oversight of multiple distributed Kafka environments from a single management interface. The project provides dedicated tools for producing and inspecting messages within topics using various serialization and encoding formats. It includes a schema registry client for defining and versioning data schemas and a consumer monitoring dashboard to track offsets and calculate partition lag. T
Includes a schema registry client for defining and versioning data formats to ensure consistent encoding.
Javaapache-kafkabig-datacluster-management
عرض على GitHub12,158
linnovate/mean
linnovate/mean
12,061عرض على GitHub
This project is a full stack project generator and boilerplate for the MEAN stack, combining MongoDB, Express, Angular, and Node.js. It provides a pre-configured architecture and scaffolding tools to bootstrap JavaScript applications with a database, backend server, and frontend framework. The project includes a Dockerized application template to ensure consistent deployment and local development across different hardware configurations. It features a Node.js API scaffold that integrates token-based security, request validation, and interactive API documentation. The codebase covers broader
Enables programmatic definition and mapping of application objects to database collections.
TypeScriptangularexpressjavascript
عرض على GitHub12,061
matrix-org/synapse
matrix-org/synapse
12,013عرض على GitHub
Synapse is a decentralized communication server implementation that enables real-time messaging and data exchange across the global Matrix federation. It functions as a homeserver, allowing operators to host their own nodes while maintaining control over personal data and user identity within a distributed network. The server utilizes a federated messaging protocol to exchange messages and user data with independent servers, ensuring consistent state across the network. To support high-traffic environments, it employs a distributed service architecture that offloads tasks to independent backg
Manages the evolution of data storage structures to support new features while maintaining cross-version compatibility.
Pythonmatrix-orgpython
عرض على GitHub12,013
keplergl/kepler.gl
keplergl/kepler.gl
11,871عرض على GitHub
Kepler.gl is a web-based geospatial visualization framework designed for rendering large-scale location datasets. It functions as a modular React mapping component that enables developers to embed interactive, high-performance geographic visualizations into web applications, serving as a comprehensive engine for building browser-based GIS dashboards. The library distinguishes itself through a highly extensible architecture that centers on centralized state management. By utilizing a predictable state-driven model, it allows for the programmatic control of map layers, filters, and viewport set
Automatically detects field types and indices in raw datasets for structured processing.
TypeScriptdata-visualizationgeospatialkepler
عرض على GitHub11,871
realm/realm-java
realm/realm-java
11,464عرض على GitHub
Realm Java is a NoSQL mobile object database and reactive database engine. It provides a persistent local data store that saves native objects directly to disk, replacing traditional SQL storage and object-relational mapping layers. The system functions as a real-time data synchronizer, coordinating local database changes with a cloud backend across multiple devices. It integrates a reactive engine that uses change listeners and asynchronous event streams to automatically update user interfaces when underlying data changes. The project covers object-oriented data modeling, CRUD operations, a
Manages database structure updates through a migration system that preserves data during application upgrades.
Java
عرض على GitHub11,464
mantle/mantle
Mantle/Mantle
11,255عرض على GitHub
Mantle is a framework for mapping raw data structures and JSON into typed model objects for Cocoa and Cocoa Touch applications. It serves as a data serialization engine and JSON data mapper that transforms dictionaries and arrays into structured model objects. The framework distinguishes itself through an Objective-C persistence layer that manages model disk archiving via keyed archivers. It includes specialized logic for model version management, allowing outdated archived data structures to be upgraded to match current schemas during deserialization. The project covers a broad range of dat
Manages schema changes and upgrades archived data to ensure compatibility between model versions.
Objective-C
عرض على GitHub11,255
yugabyte/yugabyte-db
yugabyte/yugabyte-db
10,349عرض على GitHub
YugabyteDB is a distributed SQL database and relational data store designed for horizontal scalability and high availability across multiple nodes or regions. It functions as a cloud-native system that ensures continuous availability and supports PostgreSQL compatible query languages and drivers. The system includes specialized capabilities as a vector database for AI, utilizing high-dimensional indexing to perform similarity searches. It is engineered as a multi-region cloud database that synchronizes data across different geographic locations to maintain global availability. The project co
Manages concurrent schema changes using table-level locking to ensure consistent data model modifications.
Ccloud-nativecppdatabase
عرض على GitHub10,349
doctrine/orm
doctrine/orm
10,172عرض على GitHub
Doctrine ORM is a PHP object-relational mapper that connects application objects to relational database tables. It uses the data mapper and identity map patterns to decouple the in-memory object model from the database schema, allowing developers to manage data persistence without writing manual SQL. The project features a dedicated object-oriented query language and programmatic builder for retrieving data based on entities rather than tables. It implements a unit-of-work system to track object changes during a request and synchronize them via atomic transactions. The capability surface inc
Tracks incremental schema versions to keep the database structure in sync with the application code.
PHPhacktoberfest
عرض على GitHub10,172

Awesome Data Schema Management GitHub Repositories

twentyhq/twenty

begriffs/postgrest

pubkey/rxdb

Vonng/ddia

datalab-to/surya

mementum/backtrader

milanm/DevOps-Roadmap

elysiajs/elysia

apache/doris

ajv-validator/ajv

Data-Centric-AI-Community/ydata-profiling

dbt-labs/dbt-core

provectus/kafka-ui

linnovate/mean

matrix-org/synapse

keplergl/kepler.gl

realm/realm-java

Mantle/Mantle

yugabyte/yugabyte-db

doctrine/orm

استكشف الوسوم الفرعية