awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Data & Databases · Awesome GitHub Repositories

171 repos

Awesome GitHub RepositoriesData & Databases

This category covers data storage, management, processing, analysis, and various database technologies and their operations.

Explore 171 awesome GitHub repositories matching data & databases · Data & Databases. Refine with filters or upvote what's useful.

  1. Home
  2. Data & Databases

Awesome Data & Databases GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • huggingface/transformers

    huggingface/transformers

    156,730GitHubView on GitHub↗

    Transformers is a comprehensive library for machine learning that provides a unified interface for training, fine-tuning, and deploying transformer-based models. It supports a wide range of tasks, including text classification, language modeling, question answering, and sequence-to-sequence translation, while offering

    Pythonaudiodeep-learningdeepseek
  • Snailclimb/JavaGuide

    Snailclimb/JavaGuide

    153,828GitHubView on GitHub↗

    This project is a comprehensive educational repository providing technical documentation and learning materials across a wide range of computer science and software engineering domains. It serves as a centralized knowledge base for developers, covering core programming concepts, database management, distributed systems

    Javaalgorithmsdistributed-systemsinterview
  • yt-dlp/yt-dlp

    yt-dlp/yt-dlp

    147,702GitHubView on GitHub↗

    This project is a command-line media downloader designed for the systematic retrieval and organization of digital content from diverse online platforms. It functions as an extensible extraction engine that utilizes a declarative format-selection pipeline to automate the identification, merging, and downloading of speci

    Pythonclidownloaderpython
  • f/prompts.chat

    f/prompts.chat

    145,637GitHubView on GitHub↗

    Prompts.chat is a community-driven repository and management platform for AI prompts and agent skills. It provides a centralized interface for users to search, retrieve, and save prompts, while offering structured storage for multi-file agent skills that include documentation and supporting assets. The platform distin

    HTMLaiartificial-intelligenceawesome-list
  • 521xueweihan/HelloGitHub

    521xueweihan/HelloGitHub

    143,312GitHubView on GitHub↗

    HelloGitHub is a centralized discovery platform and technical knowledge repository designed to help developers identify high-quality open-source projects, libraries, and infrastructure. It functions as a structured directory that aggregates specialized development tools and educational materials, organizing them by tec

    Pythonawesomegithubhellogithub
  • golang/go

    golang/go

    132,649GitHubView on GitHub↗

    Go is a statically typed, compiled programming language designed for building scalable, concurrent software. It provides a memory-safe execution environment that combines a high-performance runtime with a self-hosting compiler toolchain, enabling the creation of statically linked machine code binaries without external

    Gogogolanglanguage
  • microsoft/PowerToys

    microsoft/PowerToys

    129,929GitHubView on GitHub↗

    PowerToys is a collection of background-resident system utilities designed to extend native operating system functionality and streamline desktop workflows. It operates as a modular toolkit, utilizing a central plugin-based host architecture that allows users to dynamically enable or disable specific features for syste

    C#advanced-pastecolor-pickercommand-palette
  • langgenius/dify

    langgenius/dify

    129,826GitHubView on GitHub↗

    Dify is a self-hosted platform designed for the orchestration of multi-container application stacks. It provides a unified environment for managing complex service deployments, coordinating background worker processes, and maintaining database dependencies through standardized configuration files. The platform disting

    TypeScriptagentagentic-aiagentic-framework
  • langchain-ai/langchain

    langchain-ai/langchain

    127,015GitHubView on GitHub↗

    LangChain is an orchestration framework designed for building, managing, and deploying applications powered by large language models. It provides a unified integration layer that normalizes disparate model provider APIs into a consistent set of primitives, enabling developers to build complex, multi-step AI workflows t

    Pythonagentsaiai-agents
  • kubernetes/kubernetes

    kubernetes/kubernetes

    120,673GitHubView on GitHub↗

    Kubernetes is a distributed container orchestration platform that automates the deployment, scaling, and management of containerized applications across clusters of computing nodes. It functions as a declarative infrastructure controller, utilizing a control loop architecture that continuously monitors the current syst

    Gocncfcontainersgo
  • ripienaar/free-for-dev

    ripienaar/free-for-dev

    118,073GitHubView on GitHub↗

    This project is a community-maintained directory of technical resources, tools, and services that offer free tiers for developers. It serves as a centralized reference point for discovering infrastructure, software, and educational materials, helping individuals and teams minimize operational costs while building and s

    HTMLawesome-listfree-for-developers
  • excalidraw/excalidraw

    excalidraw/excalidraw

    117,138GitHubView on GitHub↗

    This project is a virtual whiteboard component and vector graphics editor designed for creating diagrams with a hand-drawn aesthetic. It provides a canvas-based drawing engine that can be embedded directly into web applications, allowing users to manipulate shapes, upload images, and export visual data into standard fo

    TypeScriptcanvascollaborationdiagrams
  • justjavac/free-programming-books-zh_CN

    justjavac/free-programming-books-zh_CN

    116,327GitHubView on GitHub↗

    This project is a centralized, community-vetted repository that serves as a comprehensive hub for free technical literature and educational resources. It functions as an open-source directory, aggregating links to books, tutorials, and documentation to support developers in mastering diverse programming languages, soft

    androidangularbooks
  • nodejs/node

    nodejs/node

    115,848GitHubView on GitHub↗

    This project is an open-source JavaScript runtime built on the V8 engine. It provides a comprehensive environment for executing JavaScript code outside of a web browser, offering foundational primitives for process management, multi-core load distribution, and parallel execution through worker threads. The runtime inc

    JavaScriptjavascriptjslinux
  • d3/d3

    d3/d3

    112,379GitHubView on GitHub↗

    D3 is a modular library providing low-level primitives for creating data-driven visualizations. It functions as a flexible framework that allows for direct control over visual presentation by mapping abstract data dimensions to graphical properties, such as position, color, and size, without imposing predefined chart a

    Shellchartchartsd3
  • iptv-org/iptv

    iptv-org/iptv

    111,772GitHubView on GitHub↗

    This project is a community-maintained, open-source repository that functions as a centralized directory for streaming metadata. It aggregates publicly available network stream links and organizes them into standardized, machine-readable playlist formats. By acting strictly as a metadata-only index, the platform enable

    TypeScriptiptvm3uplaylist
  • godotengine/godot

    godotengine/godot

    106,855GitHubView on GitHub↗

    Godot is a comprehensive, node-based game engine designed for building interactive 2D and 3D applications. It provides an integrated development environment that utilizes a hierarchical scene system to organize objects, propagate spatial transformations, and manage lifecycle events. The engine functions as a cross-plat

    C++game-developmentgame-enginegamedev
  • denoland/deno

    denoland/deno

    106,258GitHubView on GitHub↗

    Deno is a high-performance runtime for JavaScript and TypeScript that prioritizes security and developer productivity. Built on the V8 engine, it provides a secure execution environment that enforces a default-deny security model, requiring explicit user authorization for access to system resources like the file system

    Rustdenojavascriptrust
  • GrowingGit/GitHub-Chinese-Top-Charts

    GrowingGit/GitHub-Chinese-Top-Charts

    106,216GitHubView on GitHub↗

    This project functions as a curated software directory and developer resource index, providing a centralized platform for discovering and evaluating high-quality open-source repositories. It serves as an aggregator that monitors trending software and educational resources, organizing them by technical domain and progra

    Java
  • Comfy-Org/ComfyUI

    Comfy-Org/ComfyUI

    103,654GitHubView on GitHub↗

    ComfyUI is a node-based generative AI orchestration engine designed for constructing, testing, and executing complex image and video synthesis pipelines. By utilizing a directed acyclic graph execution model, the platform allows users to build reproducible workflows through modular, interconnected processing blocks wit

    Pythonaicomfycomfyui
Prev123…9Next

Browse tags

  • API Data Management1 sub-tagMechanisms for filtering or selecting specific data fields returned by an application programming interface.
  • API Data Retrieval1 sub-tagTools and logic for managing how large datasets are requested and broken into manageable chunks from remote services.
  • API Layers1 sub-tagMiddleware and frameworks that provide an interface layer between client applications and underlying data sources.
  • Asynchronous Data Handling2 sub-tagsUtilities for managing non-blocking data operations and background tasks to maintain application responsiveness.
Automation Scripting APIs2 sub-tags
Programming interfaces designed to automate the manipulation and management of external system data and user records.
  • Cloud Storage Integrations1 sub-tagConnectors and drivers that enable applications to interact with remote object storage and cloud-based file systems.
  • Community Analytics1 sub-tagTools for measuring and visualizing the activity, engagement, and contributions of members within a community.
  • Community Data Platforms1 sub-tagPlatforms that centralize and synchronize data contributed by multiple users or sources into a unified repository.
  • Data Abstraction Layers5 sub-tagsSoftware layers that provide a unified interface for interacting with diverse storage backends and data structures.
  • Data Access Patterns1 sub-tagMethodologies and low-level techniques for reading from and writing to data storage systems efficiently.
  • Data Access and Querying8 sub-tagsInterfaces, query languages, and abstraction layers used to interact with and retrieve data from storage systems.
  • Data Analysis & Visualization12 sub-tagsThis group focuses on tools and techniques for analyzing, interpreting, and visually representing data.
  • Data Architectures2 sub-tagsStructural designs and organizational patterns for managing, partitioning, and modeling complex data systems.
  • Data Categories1 sub-tagCollections of structured information categorized by specific themes or temporal characteristics.
  • Data Collection2 sub-tagsSystems and automated processes designed to gather, harvest, and ingest information from external sources.
  • Data Collection Infrastructure1 sub-tagScalable frameworks and distributed systems built to support large-scale data gathering and web crawling operations.
  • Data Collections & Datasets12 sub-tagsThis group comprises various types of data collections and datasets, including domain-specific and open data.
  • Data Compression2 sub-tagsAlgorithms and utilities that reduce the size of data for efficient storage and transmission.
  • Data Consistency Models1 sub-tagFrameworks defining how data updates are propagated and synchronized across distributed nodes.
  • Data Containers1 sub-tagFoundational structures and base classes used to encapsulate and organize data for application use.
  • Data Conversion1 sub-tagUtilities for transforming data from one representation or encoding to another.
  • Data Deduplication2 sub-tagsTools that identify and remove redundant information to optimize storage space and data integrity.
  • Data Distribution Patterns1 sub-tagStandardized formats and protocols for sharing and distributing data across different systems and languages.
  • Data Domains1 sub-tagSpecialized datasets focused on specific industry sectors or subject matter areas.
  • Data Engineering and Infrastructure5 sub-tagsFoundational tools for large-scale data collection, ingestion, storage management, and reliability.
  • Data Engines1 sub-tagCore processing engines that manage data storage, retrieval, and synchronization, often optimized for local environments.
  • Data Export1 sub-tagTools for extracting and formatting data from internal systems for external use or archival.
  • Data Export Formats1 sub-tagSpecific file types and schemas used for outputting data, including specialized formats like OCR results.
  • Data Extensions1 sub-tagAdd-ons and plugins that extend the functionality of database systems to support bulk operations.
  • Data Filtering Strategies1 sub-tagLogic and rulesets for excluding or including specific data points based on defined criteria.
  • Data Filtering Utilities1 sub-tagFunctional utilities for processing and refining tabular data or lists based on user-defined filters.
  • Data Formatting1 sub-tagTools that transform raw data into human-readable formats or standardized visual representations.
  • Data Framing1 sub-tagMechanisms for structuring and delimiting data streams to ensure correct parsing during transmission.
  • Data Governance and Modeling6 sub-tagsFrameworks for defining schemas, ensuring standardization, and managing data assets and sovereignty.
  • Data Handling1 sub-tagGeneral-purpose libraries and tools for managing, serializing, and processing data within an application.
  • Data Inspection1 sub-tagUtilities for viewing, debugging, and formatting raw data for easier human analysis.
  • Data Integration & Synchronization13 sub-tagsThis group covers tools and strategies for integrating and synchronizing data across different systems.
  • Data Integration Architectures1 sub-tagFrameworks and patterns for moving and transforming data between disparate systems and storage environments.
  • Data Interoperability1 sub-tagStandards and protocols that enable different software systems to exchange and interpret shared data structures.
  • Data Management11 sub-tagsTools and utilities for maintaining, organizing, protecting, and migrating data throughout its operational lifecycle.
  • Data Management Interfaces1 sub-tagGraphical or programmatic interfaces designed for viewing, editing, and managing tabular data sets.
  • Data Operations1 sub-tagSystems and workflows focused on the routine maintenance and manipulation of individual data records.
  • Data Organization Tools1 sub-tagSoftware designed to categorize, index, and structure information for improved accessibility and retrieval.
  • Data Platforms3 sub-tagsComprehensive environments that provide specialized infrastructure for storing, analyzing, and monitoring specific types of data.
  • Data Preparation1 sub-tagTools that clean, format, and segment raw data to prepare it for downstream analysis or ingestion.
  • Data Processing Extensions1 sub-tagAdd-on components that enhance database functionality by performing specialized data cleaning or refinement tasks.
  • Data Processing Models1 sub-tagArchitectural approaches for processing data streams, such as handling information in discrete packets.
  • Data Processing Patterns1 sub-tagStandardized methods and techniques for converting data structures into formats suitable for storage or transmission.
  • Data Processing Pipelines18 sub-tagsSystems and workflows for ingesting, transforming, and orchestrating high-throughput data processing tasks.
  • Data Processing Services1 sub-tagManaged services that automate the delivery and ingestion of data from external sources.
  • Data Processing Utilities4 sub-tagsLibraries and algorithms used to perform specific data manipulation tasks like deduplication, streaming, or reduction.
  • Data Recovery Tools1 sub-tagSpecialized utilities designed to reconstruct or recover corrupted or misaligned data files.
  • Data Redundancy1 sub-tagTechniques and algorithms that ensure data availability and fault tolerance through redundant storage methods.
  • Data Resources1 sub-tagDatasets and reference materials used to support knowledge discovery and information research.
  • Data Serialization Formats8 sub-tagsLibraries and protocols that define how data is encoded, structured, and serialized for storage or network transport.
  • Data Sharing2 sub-tagsMechanisms that allow controlled access to data sets by sharing specific views or base data structures.
  • Data Stores1 sub-tagStorage systems engineered to maintain strict data consistency across distributed environments.
  • Data Synchronization Engines1 sub-tagEngines that maintain consistency between multiple data sources by propagating changes in real time.
  • Data Templating1 sub-tagTools for defining and applying patterns to format data, particularly for temporal or string-based values.
  • Data Transfer1 sub-tagInfrastructure components designed to move large volumes of data across network boundaries efficiently.
  • Database Access Patterns1 sub-tagStandardized methods for retrieving and iterating through database records using cursors or similar mechanisms.
  • Database Concepts3 sub-tagsFundamental principles and architectural components that define how databases operate, store, and manage data integrity.
  • Database Design Patterns2 sub-tagsBest practices for modeling data structures and enforcing attribute validation within database schemas.
  • Database Extensions1 sub-tagPlugins and add-ons that provide additional functionality or features for specific database management systems.
  • Database Infrastructure2 sub-tagsMiddleware and routing components that manage connections and traffic between applications and database clusters.
  • Database Management Systems8 sub-tagsCore engines, storage architectures, and operational configurations for persistent data management.
  • Database Resources1 sub-tagReference materials and documentation specifically focused on relational database systems.
  • Database Services1 sub-tagManaged cloud-based offerings that provide database hosting, maintenance, and operational support.
  • Dataset Management4 sub-tagsCollections of annotated media and structured data specifically curated for training and evaluating machine learning computer vision models.
  • Enterprise Data Platforms1 sub-tagCentralized systems that provide organizational access to large-scale data repositories and internal information discovery tools.
  • File Processing1 sub-tagTools designed to transform, convert, or manipulate the structure and format of digital files.
  • Geospatial Data & Services9 sub-tagsThis group includes services, tools, and data related to geographical information and location.
  • Graph Computing Systems3 sub-tagsTechnologies for modeling, processing, and analyzing data based on graph theory and relational connections.
  • Processor Utilities1 sub-tagSpecialized software components that perform specific data transformations based on the input media or data type.
  • Public Data APIs1 sub-tagInterfaces that provide programmatic access to publicly available datasets and government or institutional information services.
  • Public Welfare APIs1 sub-tagProgramming interfaces that facilitate access to data regarding social services, community support, and charitable initiatives.
  • SQL Development1 sub-tagSoftware environments and utilities that assist developers in writing, testing, and refining structured query language code.
  • Search and Indexing Technologies3 sub-tagsSpecialized tools for indexing, searching, and retrieving information across diverse data stores.
  • Storage Abstraction3 sub-tagsMiddleware layers that provide a unified interface for interacting with diverse underlying storage backends and hardware.
  • Storage Adapters2 sub-tagsSoftware connectors that enable applications to interface with specific cloud or local storage systems.
  • Storage Architectures2 sub-tagsStructural patterns and methodologies for organizing, indexing, and retrieving data within a storage system.
  • Storage Integrations1 sub-tagTools and utilities that connect storage systems to external authentication, security, or management workflows.
  • Storage Management Tools1 sub-tagAdministrative utilities that allow users to configure, monitor, and maintain storage resources via command-line interfaces.
  • Storage Services2 sub-tagsManaged infrastructure solutions that provide persistent storage capabilities for files and data objects.
  • Text Processing Utilities3 sub-tagsLibraries and tools specifically designed for extracting, inspecting, and manipulating textual data.
  • Vector Embeddings1 sub-tagAlgorithms and services that convert unstructured data into numerical representations for machine learning applications.
  • Visual Data Management1 sub-tagInterfaces and dashboards designed to visualize, inspect, and manage complex data structures.