awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Data Management & Governance · Awesome GitHub Repositories

26 repos

Awesome GitHub RepositoriesData Management & Governance

Frameworks and policies that ensure data quality, security, compliance, and lifecycle management across an organization.

Explore 26 awesome GitHub repositories matching data & databases · Data Management & Governance. Refine with filters or upvote what's useful.

  1. Home
  2. Data & Databases
  3. Data Governance and Modeling
  4. Data Management & Governance

Awesome Data Management & Governance GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • sindresorhus/awesome

    sindresorhus/awesome

    438,690GitHubView on GitHub↗

    This project is a community-curated knowledge base that organizes vast technical ecosystems into a hierarchical, human-readable directory. It serves as a comprehensive index of libraries, frameworks, and methodologies, designed to facilitate discovery and professional development across the entire spectrum of software

    awesomeawesome-listlists
  • public-apis/public-apis

    public-apis/public-apis

    399,192GitHubView on GitHub↗

    This project is a comprehensive, community-driven directory of public service endpoints designed to facilitate the discovery and integration of external data sources. It serves as a centralized registry where developers can locate reliable third-party APIs to augment their applications with specialized functionality, r

    Pythonapiapisdataset
  • vinta/awesome-python

    vinta/awesome-python

    283,687GitHubView on GitHub↗

    This project is a comprehensive, community-curated directory that organizes a vast landscape of Python software libraries, frameworks, and tools. It serves as a centralized knowledge base designed to facilitate ecosystem navigation and accelerate developer discovery across the entire software development lifecycle. Th

    Pythonawesomecollectionspython
  • awesome-selfhosted/awesome-selfhosted

    awesome-selfhosted/awesome-selfhosted

    274,152GitHubView on GitHub↗

    This project is a comprehensive, curated repository of self-hosted software designed to assist users in discovering and evaluating applications for private server environments. It organizes a vast array of tools into categories spanning communication, infrastructure, media, and productivity, providing a centralized res

    awesomeawesome-listcloud
  • avelino/awesome-go

    avelino/awesome-go

    165,543GitHubView on GitHub↗

    This project serves as a comprehensive language ecosystem index, functioning as a centralized, community-curated directory for the Go programming language. It organizes a vast landscape of software components, libraries, and development tools into a structured, navigable hierarchy, enabling developers to efficiently di

    Goawesomeawesome-listgo
  • Snailclimb/JavaGuide

    Snailclimb/JavaGuide

    153,828GitHubView on GitHub↗

    This project is a comprehensive educational repository providing technical documentation and learning materials across a wide range of computer science and software engineering domains. It serves as a centralized knowledge base for developers, covering core programming concepts, database management, distributed systems

    Javaalgorithmsdistributed-systemsinterview
  • langchain-ai/langchain

    langchain-ai/langchain

    127,015GitHubView on GitHub↗

    LangChain is an orchestration framework designed for building, managing, and deploying applications powered by large language models. It provides a unified integration layer that normalizes disparate model provider APIs into a consistent set of primitives, enabling developers to build complex, multi-step AI workflows t

    Pythonagentsaiai-agents
  • iptv-org/iptv

    iptv-org/iptv

    111,772GitHubView on GitHub↗

    This project is a community-maintained, open-source repository that functions as a centralized directory for streaming metadata. It aggregates publicly available network stream links and organizes them into standardized, machine-readable playlist formats. By acting strictly as a metadata-only index, the platform enable

    TypeScriptiptvm3uplaylist
  • immich-app/immich

    immich-app/immich

    92,953GitHubView on GitHub↗

    Immich is a self-hosted media management platform designed to provide a centralized, private repository for photos and videos. It functions as a comprehensive system for organizing, backing up, and viewing personal media collections across mobile devices, web browsers, and external storage locations. By maintaining ful

    TypeScriptbackup-toolfluttergoogle-photos
  • gin-gonic/gin

    gin-gonic/gin

    88,134GitHubView on GitHub↗

    Gin is a web framework designed for building high-performance web services and APIs. It functions as a middleware-oriented engine that processes incoming HTTP requests through a sequential chain of handlers, allowing for the modular management of cross-cutting concerns such as authentication and logging. The framework

    Goframeworkgingo
  • home-assistant/core

    home-assistant/core

    84,936GitHubView on GitHub↗

    Home Assistant is a centralized home automation platform designed to orchestrate diverse internet-connected devices and services. It functions as a local-first control system that normalizes heterogeneous hardware protocols into a unified set of entities, attributes, and services. The core architecture relies on an eve

    Pythonasynciohacktoberfesthome-automation
  • laravel/laravel

    laravel/laravel

    83,758GitHubView on GitHub↗

    Laravel is a comprehensive full-stack web framework designed for building scalable server-side applications. It provides an integrated development environment that centers on an object-relational mapper for database abstraction, a robust routing system, and a sophisticated service container for dependency injection. Th

    Bladeframeworklaravelphp
  • syncthing/syncthing

    syncthing/syncthing

    80,036GitHubView on GitHub↗

    Syncthing is a decentralized file synchronization engine that maintains consistent data states across multiple devices through peer-to-peer mesh networking. It operates as a background daemon that automatically replicates file creations, modifications, and deletions between trusted nodes without requiring central serve

    Gogop2ppeer-to-peer
  • elastic/elasticsearch

    elastic/elasticsearch

    76,163GitHubView on GitHub↗

    Elasticsearch is a distributed search engine and document store designed for the high-performance indexing and retrieval of massive volumes of unstructured data. It functions as a centralized analytics platform, providing a schema-flexible architecture that organizes information into searchable indices while maintainin

    Javaelasticsearchjavasearch-engine
  • infiniflow/ragflow

    infiniflow/ragflow

    73,425GitHubView on GitHub↗

    This project is a comprehensive retrieval-augmented generation platform designed for building, managing, and deploying knowledge-based AI applications. It provides a unified environment for organizing datasets, configuring conversational chat assistants, and developing autonomous agents that execute multi-step reasonin

    Pythonagentagenticagentic-ai
  • redis/redis

    redis/redis

    73,096GitHubView on GitHub↗

    Redis is an in-memory, key-value database designed to provide sub-millisecond latency for read and write operations. It functions as a versatile data platform, serving as a distributed cache, a message broker, a NoSQL document store, and a vector database. The system utilizes an event-driven, single-threaded loop to pr

    Ccachecachingdatabase
  • apache/superset

    apache/superset

    70,587GitHubView on GitHub↗

    Superset is a web-based business intelligence platform designed for data exploration, visualization, and interactive dashboarding. It functions as a query-driven analytics engine that connects to various SQL databases, allowing users to perform ad-hoc analysis, define virtual metrics, and build complex data visualizati

    TypeScriptanalyticsapacheapache-superset
  • sindresorhus/awesome-nodejs

    sindresorhus/awesome-nodejs

    65,038GitHubView on GitHub↗

    This project is a community-driven directory that aggregates essential software projects and educational content for the Node.js ecosystem. It functions as a centralized knowledge base and discovery index, designed to simplify the navigation of a fragmented technical landscape by providing a structured collection of hi

    awesomeawesome-listjavascript
  • nocodb/nocodb

    nocodb/nocodb

    62,131GitHubView on GitHub↗

    NocoDB is a visual platform that transforms relational databases into collaborative, spreadsheet-style workspaces. By acting as a headless database backend, it provides a unified environment for designing database structures, managing record relationships, and interacting with data without requiring manual SQL queries.

    TypeScriptairtableairtable-alternativeautomatic-api
  • minio/minio

    minio/minio

    60,346GitHubView on GitHub↗

    MinIO is a software-defined, cloud-native object storage server designed to manage large volumes of unstructured data. It functions as a distributed storage cluster that aggregates multiple independent nodes into a unified, scalable pool, providing a high-performance infrastructure compatible with standard cloud storag

    Goamazon-s3cloudcloudnative
Prev12Next

Explore sub-tags

  • Backup and Recovery Systems4 sub-tagsTools focused on point-in-time data protection, system restoration, and disaster recovery workflows.
  • Crowdsourced DatasetsDatasets maintained through collaborative community contributions and peer-reviewed updates.
  • Data Governance1 sub-tagFrameworks and policies for managing the quality, security, and compliance of organizational data assets.
  • Data Integrity and Validation3 sub-tags
Utilities for enforcing structural consistency, schema compliance, and quality assurance across datasets.
  • Data Lifecycle and Retention3 sub-tagsAutomated policies and architectures for managing data aging, storage tiering, and archival compliance.
  • Data Management Tools5 sub-tagsAdministrative tools used to organize, track, and maintain metadata and data assets throughout their lifecycle.
  • Data Migration ServicesCapabilities for moving data between distributed storage clusters.
  • Data Path ConfigurationsSettings and utilities for defining where session data, logs, and agent customizations are stored.
  • Database Infrastructure Components2 sub-tagsFoundational software engines, drivers, and caching mechanisms for persistent data storage and access.
  • Dataset Orchestration APIs2 sub-tagsProgrammatic interfaces for managing, updating, and deleting specific data records or dataset configurations.
  • Distributed IdentifiersSystems for generating unique IDs across multiple nodes or shards.
  • File Synchronization ToolsSoftware for managing, synchronizing, and sharing files across multiple devices and endpoints.
  • File Versioning SystemsMechanisms for tracking file changes, including trash recovery and historical state management.
  • Metadata Management Systems3 sub-tagsSpecialized tools for organizing, indexing, and storing descriptive information about data assets.
  • Statistics Data ManagementTools for viewing, analyzing, and correcting long-term historical entity data.