Why is cube-js/cube a recommended Data Model Generation GitHub Repositories repository?

Automates the creation of metrics and dimensions from existing database tables.

Why is faker-js/faker a recommended Data Model Generation GitHub Repositories repository?

Creates basic data types efficiently to minimize memory usage during high-volume generation.

Why is dbt-labs/dbt-core a recommended Data Model Generation GitHub Repositories repository?

Generates database schemas and tables without data to verify structure and dependencies during development.

Why is hypothesisworks/hypothesis a recommended Data Model Generation GitHub Repositories repository?

Generates basic scalar data types such as integers, floats, and booleans for testing.

Why is yangjianxin1/firefly a recommended Data Model Generation GitHub Repositories repository?

Formats raw training datasets into specific prompt structures to ensure conversational consistency across different models.

Why is inventree/inventree a recommended Data Model Generation GitHub Repositories repository?

Builds the entire data layer on Django's ORM, using model introspection to drive API endpoints and serialization.

Why is tortoise/tortoise-orm a recommended Data Model Generation GitHub Repositories repository?

Defines database tables as Python classes with typed fields, mirroring Django's model API for familiar object-oriented data modeling.

Why is querydsl/querydsl a recommended Data Model Generation GitHub Repositories repository?

Creates JavaBean classes based on database tables to simplify data transfer and the population of data clauses.

Why is carltongibson/django-filter a recommended Data Model Generation GitHub Repositories repository?

Automatically creates search filters and user-facing forms based on Django model field definitions.

Why is jqhph/dcat-admin a recommended Data Model Generation GitHub Repositories repository?

Implements a model-driven architecture where database schemas and model attributes drive the generation of CRUD views.

12 Repos

Awesome GitHub RepositoriesData Model Generation

Automated creation of structured data definitions from database tables to define metrics and dimensions.

Distinct from Data Table Generators: Distinct from data table generators: focuses on generating semantic model definitions rather than database table structures.

Explore 12 awesome GitHub repositories matching data & databases · Data Model Generation. Refine with filters or upvote what's useful.

Finde die besten Repos mit KI.Wir suchen mit KI nach den am besten passenden Repositories.

cube-js/cube
cube-js/cube
20,251Auf GitHub ansehen
Cube is a semantic data layer that provides a unified framework for defining business metrics, dimensions, and relationships across diverse data sources. By acting as a headless business intelligence engine, it transforms raw data into a governed model that can be queried via SQL, REST, and GraphQL interfaces. This architecture ensures consistent data definitions and logic across all downstream analytical applications and reporting tools. The platform distinguishes itself through its integrated conversational AI capabilities, which allow users to explore data using natural language. It orches
Automates the creation of metrics and dimensions from existing database tables.
Rustagentic-analyticsagentsai
Auf GitHub ansehen20,251
faker-js/faker
faker-js/faker
14,896Auf GitHub ansehen
Faker is a library for generating synthetic data and mock information to populate development and testing environments. It provides a structured way to create realistic values such as names, addresses, and dates, allowing developers to validate application logic and visualize user interfaces without relying on production data. The library distinguishes itself through its support for deterministic generation, which uses fixed seeds to ensure that data sequences remain identical across multiple test executions. It also features a modular architecture that separates generation logic into indepen
Creates basic data types efficiently to minimize memory usage during high-volume generation.
TypeScriptbrowserdatafake
Auf GitHub ansehen14,896
dbt-labs/dbt-core
dbt-labs/dbt-core
13,051Auf GitHub ansehen
dbt-core is a command-line framework for transforming data within a warehouse using modular SQL and version control. It functions as a data transformation engine that enables users to define data structures and business logic through declarative configuration files, which the system then compiles into executable code. By managing complex data dependencies through a directed acyclic graph, it ensures that transformation tasks execute in the correct order while maintaining a manifest-driven state to track lineage and execution history. The project distinguishes itself through an adapter-based d
Generates database schemas and tables without data to verify structure and dependencies during development.
Rustanalyticsbusiness-intelligencedata-modeling
Auf GitHub ansehen13,051
hypothesisworks/hypothesis
HypothesisWorks/hypothesis
8,717Auf GitHub ansehen
Hypothesis is a Python property-based testing library and data generation engine. It enables the discovery of edge cases and bugs by generating a wide range of randomized inputs based on defined strategies and shrinking complex failing examples to their smallest possible form. It also functions as a state machine testing framework to verify system behavior across sequences of interdependent operations. The project features a fuzzing integration layer that converts raw byte buffers from coverage-guided fuzzers into structured test cases. It includes a persistence mechanism to store and synchro
Generates basic scalar data types such as integers, floats, and booleans for testing.
Pythonfuzzingproperty-based-testingpython
Auf GitHub ansehen8,717
yangjianxin1/firefly
yangjianxin1/Firefly
6,642Auf GitHub ansehen
Firefly is a training framework and inference engine for large language models. It functions as a toolkit for pre-training and fine-tuning various open-weight architectures, providing a system for model alignment and parameter-efficient fine-tuning. The project includes utilities for merging adapter weights back into base models to create standalone files. It also provides a model alignment toolkit to format training data according to specific prompt templates, ensuring conversational consistency across different models. The framework supports distributed model training and preference-based
Formats raw training datasets into specific prompt structures to ensure conversational consistency across different models.
Pythonalpacaaquilabaichuan
Auf GitHub ansehen6,642
inventree/inventree
inventree/InvenTree
6,350Auf GitHub ansehen
InvenTree is an open-source inventory management platform built on Django, designed for tracking parts, stock levels, and supply chain operations through a web interface and REST API. The system uses barcodes—including QR codes, 1D barcodes, and Data Matrix codes—as primary identifiers for scanning, linking, and triggering inventory actions, and extends core functionality through a Python plugin framework supporting custom actions, UI panels, barcode handlers, and scheduled tasks. The platform distinguishes itself through a comprehensive plugin-based extensibility system that allows custom in
Builds the entire data layer on Django's ORM, using model introspection to drive API endpoints and serialization.
Pythondjangohacktoberfestpython
Auf GitHub ansehen6,350
tortoise/tortoise-orm
tortoise/tortoise-orm
5,582Auf GitHub ansehen
Tortoise ORM is an asynchronous object-relational mapper for Python that mirrors Django's model and queryset API while running on asyncio. It defines database tables as Python classes with typed fields and supports foreign key, many-to-many, and one-to-one relations, providing a chainable query API for filtering, annotating, grouping, and prefetching related objects without blocking the event loop. The ORM includes a built-in migration engine that detects model changes, generates migration files, and applies or reverts schema changes through a command-line tool. It connects to PostgreSQL, MyS
Defines database tables as Python classes with typed fields, mirroring Django's model API for familiar object-oriented data modeling.
Pythonasyncasynciomysql
Auf GitHub ansehen5,582
querydsl/querydsl
querydsl/querydsl
4,964Auf GitHub ansehen
Querydsl ist ein Framework für die Konstruktion typsicherer Abfragen. Es verwendet eine Fluent-API und annotationsbasierte Codegenerierung, um Spiegelklassen aus Domänenmodellen zu erstellen, was eine Validierung von Datenbankabfragen zur Kompilierzeit ermöglicht und die Notwendigkeit manueller String-Verkettung beseitigt. Das Projekt bietet eine einheitliche Abfragesyntax, die in spezifische Dialekte für mehrere Backends übersetzt wird, einschließlich SQL, MongoDB, Lucene und JDO. Es unterstützt fortgeschrittene Abfragefunktionen wie Common Table Expressions, Window-Funktionen, geospatiale Operationen und komplexe verschachtelte Subqueries. Über den Datenabruf hinaus deckt das Framework die typsichere DML-Ausführung für Massenaktualisierungen und -löschungen sowie das Result-Mapping in Java-Beans, Konstruktoren oder Tupel ab. Es beinhaltet Unterstützung für In-Memory-Collection-Abfragen und integriert sich in das Spring-Framework für Transaktionsmanagement und Verbindungsverwaltung.
Creates JavaBean classes based on database tables to simplify data transfer and the population of data clauses.
Java
Auf GitHub ansehen4,964
carltongibson/django-filter
carltongibson/django-filter
4,683Auf GitHub ansehen
django-filter ist eine Bibliothek zum Filtern von Django QuerySets durch Zuordnung von URL-Parametern zu Datenbankabfragen unter Verwendung deklarativer Syntax. Sie fungiert als Modellfilter-Generator und Datenbank-Abfragetool, das Anfrageparameter in Abfragen übersetzt, um Datenergebnisse zu verfeinern. Das Projekt ermöglicht die automatische Generierung von Suchfiltern und benutzerorientierten Formularen basierend auf Modellfelddefinitionen. Es unterstützt fortgeschrittene Abfragelogik durch benutzerdefinierte Suchmethoden, Beziehungspfade und Volltextsuche-Integration, während es ein System zur Verfeinerung von Ergebnissen innerhalb von REST-API-Endpunkten bereitstellt. Der Funktionsumfang umfasst die Erstellung interaktiver Filterformulare, das Rendern gefilterter Listenansichten und die Verwendung von Factory-Funktionen für die dynamische Instanziierung von Filtersets. Das System handhabt zudem typspezifische Filterung für Booleans, UUIDs und ISO 8601-Zeitstempel sowie anfragebasierte Filterung unter Verwendung der authentifizierten Benutzeridentität.
Automatically creates search filters and user-facing forms based on Django model field definitions.
Python
Auf GitHub ansehen4,683
jqhph/dcat-admin
jqhph/dcat-admin
4,022Auf GitHub ansehen
dcat-admin ist ein Laravel-Admin-Panel-Framework, das verwendet wird, um datengesteuerte Administrationsschnittstellen schnell zu erstellen. Es fungiert als CRUD-Generator und Backend-Scaffolding-Tool, das automatisch Schnittstellen zum Erstellen, Lesen, Aktualisieren und Löschen basierend auf Datenbanktabellenschemata produziert. Das System zeichnet sich durch eine plugin-basierte Erweiterungsarchitektur und die Möglichkeit aus, mehrere unabhängige administrative Instanzen innerhalb einer einzigen Installation auszuführen. Es bietet spezialisierte Tools für das Mapping externer APIs auf Formulare und Tabellen sowie einen ereignisgesteuerten Formular-Lebenszyklus für die Ausführung benutzerdefinierter Logik während der Auflösung und Übermittlung. Das Framework deckt ein breites Spektrum an Funktionsbereichen ab, einschließlich rollenbasierter Zugriffskontrolle für die Verwaltung hierarchischer Berechtigungen, einer umfassenden Suite von Datenverwaltungs-Grids mit Inline-Bearbeitung und mehrstufigen Formular-Workflows. Es enthält zudem Datenvisualisierungstools für operative Dashboards und eine Vielzahl von Content-Handling-Utilities für stückweise große Datei-Uploads und Rich-Text-Bearbeitung. Kommandozeilen-Utilities werden bereitgestellt, um die Generierung administrativer Komponenten und Action-Klassen zu automatisieren.
Implements a model-driven architecture where database schemas and model attributes drive the generation of CRUD views.
PHPlaravellaravel-admin
Auf GitHub ansehen4,022
xo/dbtpl
xo/dbtpl
3,892Auf GitHub ansehen
dbtpl ist ein Datenbankmodell-Generator und Schema-Introspektor, der typsicheren Quellcode und Datenmodelle aus SQL-Datenbankschemata erzeugt. Er fungiert als Kommandozeilentool, das SQL-Typen auf Go-Structs und Ausführungsfunktionen abbildet, während es ein Template-basiertes System zur Erstellung benutzerdefinierter Quellcodes und Schemadefinitionen bereitstellt. Das Tool unterstützt mehrere Datenbank-Engines, darunter PostgreSQL, MySQL, SQLite, Oracle und SQL Server. Es zeichnet sich dadurch aus, dass Benutzer benutzerdefinierte Ausgabeformate über Text-Templates definieren können und die Möglichkeit besteht, kompatible Modelldefinitionen für das Django-Webframework zu generieren. Seine Funktionen erstrecken sich auf die Erstellung von SQL-Skripten für Datenbankmigrationen und -spiegelungen sowie den Export von Schemadarstellungen in JSON- oder YAML-Formaten. Zusätzlich kann es Dot-Dateien generieren, um Datenbankbeziehungen und Strukturdiagramme zu visualisieren.
Produces compatible model definitions for the Django web framework based on database schemas or SQL queries.
Gocode-generatorgolangmicrosoft-sql-server
Auf GitHub ansehen3,892
wisser/jailer
Wisser/Jailer
3,130Auf GitHub ansehen
Jailer is a suite of specialized tools for AI-assisted SQL management, referential integrity preservation, and relational data browsing. It provides a system for generating referentially intact database subsets, allowing users to extract consistent slices of relational data while preserving foreign key constraints and dependencies. The project features an AI-driven SQL assistant that uses natural language to generate, optimize, and refactor queries based on database schemas. It also includes a data migration tool that analyzes SQL patterns to reverse engineer models and map associations betwe
Generates a relational data extraction model based on rows selected manually within a data browser.
Javadatabasedb2export
Auf GitHub ansehen3,130