awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Data & Databases · Awesome GitHub Repositories

168 repos

Awesome GitHub RepositoriesData & Databases

This category covers data storage, management, processing, analysis, and various database technologies and their operations.

Explore 168 awesome GitHub repositories matching data & databases · Data & Databases. Refine with filters or upvote what's useful.

  1. Home
  2. Data & Databases

Awesome Data & Databases GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • laravel/laravel

    laravel/laravel

    83,758GitHubView on GitHub↗

    Laravel is a comprehensive full-stack web framework designed for building scalable server-side applications. It provides an integrated development environment that centers on an object-relational mapper for database abstraction, a robust routing system, and a sophisticated service container for dependency injection. Th

    Bladeframeworklaravelphp
  • louislam/uptime-kuma

    louislam/uptime-kuma

    82,999GitHubView on GitHub↗

    Uptime Kuma is a self-hosted monitoring platform designed to track the availability and performance of network services and websites. It functions as a centralized dashboard that executes asynchronous health checks on a scheduled interval, providing real-time visibility into infrastructure health and service uptime. T

    JavaScriptdockermonitormonitoring
  • macrozheng/mall

    macrozheng/mall

    82,926GitHubView on GitHub↗

    This project is an enterprise-grade Java framework designed for building scalable, full-stack e-commerce applications. It provides a comprehensive foundation for microservice-based distributed architectures, enabling the development of complex retail platforms that include product management, order processing, and secu

    Javadockerelasticsearchelk
  • MunGell/awesome-for-beginners

    MunGell/awesome-for-beginners

    82,766GitHubView on GitHub↗

    This project is a curated directory of software repositories specifically selected to help newcomers make their first open-source contributions. It serves as a collaborative knowledge base that aggregates entry-level development opportunities, providing a structured path for novice developers to practice version contro

    awesomeawesome-listbeginner-project
  • bregman-arie/devops-exercises

    bregman-arie/devops-exercises

    81,169GitHubView on GitHub↗

    This project is a comprehensive educational curriculum designed to build proficiency across modern infrastructure, cloud-native technologies, and systems administration. It functions as a reference library and interview preparation resource, offering a structured collection of conceptual questions, practical coding cha

    Pythonansibleawsazure
  • punkpeye/awesome-mcp-servers

    punkpeye/awesome-mcp-servers

    81,101GitHubView on GitHub↗

    This project serves as a centralized directory and interoperability hub for the Model Context Protocol, providing a curated collection of standardized service connectors that bridge artificial intelligence models with external software, databases, and APIs. It facilitates the integration of AI agents with diverse ecosy

    aimcp
  • DopplerHQ/awesome-interview-questions

    DopplerHQ/awesome-interview-questions

    81,035GitHubView on GitHub↗

    This project is a comprehensive, community-sourced repository of technical interview questions and study materials. It serves as a centralized index for software engineers to prepare for technical assessments, benchmark their personal knowledge, and identify gaps in their expertise across a wide range of programming la

    android-interview-questionsangularjs-interview-questionsawesome
  • syncthing/syncthing

    syncthing/syncthing

    80,036GitHubView on GitHub↗

    Syncthing is a decentralized file synchronization engine that maintains consistent data states across multiple devices through peer-to-peer mesh networking. It operates as a background daemon that automatically replicates file creations, modifications, and deletions between trusted nodes without requiring central serve

    Gogop2ppeer-to-peer
  • hacksider/Deep-Live-Cam

    hacksider/Deep-Live-Cam

    79,568GitHubView on GitHub↗

    Deep-Live-Cam is a generative video transformation tool designed for real-time facial manipulation and cinematic enhancement. It functions as a local-first AI runtime, performing all media processing directly on the user's hardware to ensure complete data privacy without external network dependencies. By utilizing a hi

    Pythonaiai-deep-fakeai-face
  • modelcontextprotocol/servers

    modelcontextprotocol/servers

    79,000GitHubView on GitHub↗

    The Model Context Protocol is a standardized communication framework designed to connect language models to external data sources, functional tools, and interactive user interfaces. It provides a vendor-neutral interface layer that enables AI hosts to discover and execute capabilities across heterogeneous service envir

    TypeScript
  • browser-use/browser-use

    browser-use/browser-use

    78,576GitHubView on GitHub↗

    Browser-use is a framework for building autonomous agents that navigate, interact with, and extract data from web interfaces using natural language instructions. By acting as an orchestration layer between large language models and browser automation protocols, it enables the execution of complex, multi-step workflows

    Pythonai-agentsai-toolsbrowser-automation
  • anuraghazra/github-readme-stats

    anuraghazra/github-readme-stats

    78,445GitHubView on GitHub↗

    This project is a serverless service that generates dynamic, themeable visual summaries of software development activity. It functions as an automated metadata visualizer, transforming raw platform logs and repository metrics into resolution-independent vector graphics that can be embedded directly into markdown enviro

    JavaScriptdynamicprofile-readmereadme-generator
  • junegunn/fzf

    junegunn/fzf

    77,987GitHubView on GitHub↗

    This project is a general-purpose command-line filter that provides an interactive interface for processing standard input streams. It enables real-time fuzzy searching, data selection, and transformation, allowing users to navigate complex information or file systems directly within their terminal. By utilizing a pipe

    Gobashclifish
  • hoppscotch/hoppscotch

    hoppscotch/hoppscotch

    77,888GitHubView on GitHub↗

    Hoppscotch is an open-source API development ecosystem designed for building, testing, and debugging REST, GraphQL, and real-time APIs. It provides a unified platform that functions across web browsers, desktop applications, and command-line interfaces, allowing developers to manage the entire API lifecycle from a sing

    TypeScriptapiapi-clientapi-rest
  • netdata/netdata

    netdata/netdata

    77,812GitHubView on GitHub↗

    Netdata is a distributed observability platform designed for real-time infrastructure monitoring and performance tracking. It functions as a high-frequency agent that collects system, container, and application metrics with per-second precision, providing both local visualization and centralized aggregation across comp

    Caialertingcncf
  • nomic-ai/gpt4all

    nomic-ai/gpt4all

    77,146GitHubView on GitHub↗

    GPT4All is a cross-platform runtime environment designed to execute large language models directly on local consumer hardware. By leveraging an optimized C++ inference backend, it enables private, offline AI interactions without requiring an internet connection or external cloud services. The project provides a compreh

    C++ai-chatllm-inference
  • elastic/elasticsearch

    elastic/elasticsearch

    76,163GitHubView on GitHub↗

    Elasticsearch is a distributed search engine and document store designed for the high-performance indexing and retrieval of massive volumes of unstructured data. It functions as a centralized analytics platform, providing a schema-flexible architecture that organizes information into searchable indices while maintainin

    Javaelasticsearchjavasearch-engine
  • d2l-ai/d2l-zh

    d2l-ai/d2l-zh

    75,708GitHubView on GitHub↗

    This project is an open-source, interactive educational platform designed to teach deep learning through a comprehensive, code-first curriculum. It provides a structured learning path that covers foundational mathematics, modern neural network architectures, and practical optimization techniques, enabling practitioners

    Pythonbookchinesecomputer-vision
  • mlabonne/llm-course

    mlabonne/llm-course

    75,340GitHubView on GitHub↗

    This project is a comprehensive educational curriculum and engineering handbook focused on the lifecycle of large language models. It serves as a structured knowledge base for machine learning practitioners, covering the fundamental mathematical and architectural principles of transformer-based sequence modeling, as we

    courselarge-language-modelsllm
  • Stirling-Tools/Stirling-PDF

    Stirling-Tools/Stirling-PDF

    74,357GitHubView on GitHub↗

    Stirling-PDF is a self-hosted document processing suite designed for secure, private file management. It functions as a comprehensive transformation engine that executes complex operations—such as merging, splitting, converting, and redacting documents—directly on the host machine. The platform provides both a browser-

    TypeScriptdockerhacktoberfestjava
Prev1…345…9Next

Browse tags

  • API Data Management1 sub-tagMechanisms for filtering or selecting specific data fields returned by an application programming interface.
  • API Data Retrieval1 sub-tagTools and logic for managing how large datasets are requested and broken into manageable chunks from remote services.
  • API Layers1 sub-tagMiddleware and frameworks that provide an interface layer between client applications and underlying data sources.
  • Asynchronous Data Handling2 sub-tagsUtilities for managing non-blocking data operations and background tasks to maintain application responsiveness.
Automation Scripting APIs2 sub-tags
Programming interfaces designed to automate the manipulation and management of external system data and user records.
  • Cloud Storage Integrations1 sub-tagConnectors and drivers that enable applications to interact with remote object storage and cloud-based file systems.
  • Community Analytics1 sub-tagTools for measuring and visualizing the activity, engagement, and contributions of members within a community.
  • Community Data Platforms1 sub-tagPlatforms that centralize and synchronize data contributed by multiple users or sources into a unified repository.
  • Data Abstraction Layers5 sub-tagsSoftware layers that provide a unified interface for interacting with diverse storage backends and data structures.
  • Data Access Patterns1 sub-tagMethodologies and low-level techniques for reading from and writing to data storage systems efficiently.
  • Data Access and Querying8 sub-tagsInterfaces, query languages, and abstraction layers used to interact with and retrieve data from storage systems.
  • Data Analysis & Visualization12 sub-tagsThis group focuses on tools and techniques for analyzing, interpreting, and visually representing data.
  • Data Architectures2 sub-tagsStructural designs and organizational patterns for managing, partitioning, and modeling complex data systems.
  • Data Categories1 sub-tagCollections of structured information categorized by specific themes or temporal characteristics.
  • Data Collection2 sub-tagsSystems and automated processes designed to gather, harvest, and ingest information from external sources.
  • Data Collection Infrastructure1 sub-tagScalable frameworks and distributed systems built to support large-scale data gathering and web crawling operations.
  • Data Collections & Datasets12 sub-tagsThis group comprises various types of data collections and datasets, including domain-specific and open data.
  • Data Compression2 sub-tagsAlgorithms and utilities that reduce the size of data for efficient storage and transmission.
  • Data Consistency Models1 sub-tagFrameworks defining how data updates are propagated and synchronized across distributed nodes.
  • Data Containers1 sub-tagFoundational structures and base classes used to encapsulate and organize data for application use.
  • Data Conversion1 sub-tagUtilities for transforming data from one representation or encoding to another.
  • Data Deduplication2 sub-tagsTools that identify and remove redundant information to optimize storage space and data integrity.
  • Data Distribution Patterns1 sub-tagStandardized formats and protocols for sharing and distributing data across different systems and languages.
  • Data Domains1 sub-tagSpecialized datasets focused on specific industry sectors or subject matter areas.
  • Data Engineering and Infrastructure5 sub-tagsFoundational tools for large-scale data collection, ingestion, storage management, and reliability.
  • Data Engines1 sub-tagCore processing engines that manage data storage, retrieval, and synchronization, often optimized for local environments.
  • Data Export1 sub-tagTools for extracting and formatting data from internal systems for external use or archival.
  • Data Export Formats1 sub-tagSpecific file types and schemas used for outputting data, including specialized formats like OCR results.
  • Data Extensions1 sub-tagAdd-ons and plugins that extend the functionality of database systems to support bulk operations.
  • Data Filtering Strategies1 sub-tagLogic and rulesets for excluding or including specific data points based on defined criteria.
  • Data Filtering Utilities1 sub-tagFunctional utilities for processing and refining tabular data or lists based on user-defined filters.
  • Data Formatting1 sub-tagTools that transform raw data into human-readable formats or standardized visual representations.
  • Data Framing1 sub-tagMechanisms for structuring and delimiting data streams to ensure correct parsing during transmission.
  • Data Governance and Modeling6 sub-tagsFrameworks for defining schemas, ensuring standardization, and managing data assets and sovereignty.
  • Data Handling1 sub-tagGeneral-purpose libraries and tools for managing, serializing, and processing data within an application.
  • Data Inspection1 sub-tagUtilities for viewing, debugging, and formatting raw data for easier human analysis.
  • Data Integration & Synchronization12 sub-tagsThis group covers tools and strategies for integrating and synchronizing data across different systems.
  • Data Integration Architectures1 sub-tagFrameworks and patterns for moving and transforming data between disparate systems and storage environments.
  • Data Interoperability1 sub-tagStandards and protocols that enable different software systems to exchange and interpret shared data structures.
  • Data Management11 sub-tagsTools and utilities for maintaining, organizing, protecting, and migrating data throughout its operational lifecycle.
  • Data Management Interfaces1 sub-tagGraphical or programmatic interfaces designed for viewing, editing, and managing tabular data sets.
  • Data Operations1 sub-tagSystems and workflows focused on the routine maintenance and manipulation of individual data records.
  • Data Organization Tools1 sub-tagSoftware designed to categorize, index, and structure information for improved accessibility and retrieval.
  • Data Platforms3 sub-tagsComprehensive environments that provide specialized infrastructure for storing, analyzing, and monitoring specific types of data.
  • Data Preparation1 sub-tagTools that clean, format, and segment raw data to prepare it for downstream analysis or ingestion.
  • Data Processing Extensions1 sub-tagAdd-on components that enhance database functionality by performing specialized data cleaning or refinement tasks.
  • Data Processing Models1 sub-tagArchitectural approaches for processing data streams, such as handling information in discrete packets.
  • Data Processing Patterns1 sub-tagStandardized methods and techniques for converting data structures into formats suitable for storage or transmission.
  • Data Processing Pipelines18 sub-tagsSystems and workflows for ingesting, transforming, and orchestrating high-throughput data processing tasks.
  • Data Processing Services1 sub-tagManaged services that automate the delivery and ingestion of data from external sources.
  • Data Processing Utilities4 sub-tagsLibraries and algorithms used to perform specific data manipulation tasks like deduplication, streaming, or reduction.
  • Data Recovery Tools1 sub-tagSpecialized utilities designed to reconstruct or recover corrupted or misaligned data files.
  • Data Redundancy1 sub-tagTechniques and algorithms that ensure data availability and fault tolerance through redundant storage methods.
  • Data Resources1 sub-tagDatasets and reference materials used to support knowledge discovery and information research.
  • Data Serialization Formats8 sub-tagsLibraries and protocols that define how data is encoded, structured, and serialized for storage or network transport.
  • Data Sharing2 sub-tagsMechanisms that allow controlled access to data sets by sharing specific views or base data structures.
  • Data Stores1 sub-tagStorage systems engineered to maintain strict data consistency across distributed environments.
  • Data Synchronization Engines1 sub-tagEngines that maintain consistency between multiple data sources by propagating changes in real time.
  • Data Templating1 sub-tagTools for defining and applying patterns to format data, particularly for temporal or string-based values.
  • Data Transfer1 sub-tagInfrastructure components designed to move large volumes of data across network boundaries efficiently.
  • Database Access Patterns1 sub-tagStandardized methods for retrieving and iterating through database records using cursors or similar mechanisms.
  • Database Concepts3 sub-tagsFundamental principles and architectural components that define how databases operate, store, and manage data integrity.
  • Database Design Patterns2 sub-tagsBest practices for modeling data structures and enforcing attribute validation within database schemas.
  • Database Extensions1 sub-tagPlugins and add-ons that provide additional functionality or features for specific database management systems.
  • Database Infrastructure2 sub-tagsMiddleware and routing components that manage connections and traffic between applications and database clusters.
  • Database Management Systems8 sub-tagsCore engines, storage architectures, and operational configurations for persistent data management.
  • Database Resources1 sub-tagReference materials and documentation specifically focused on relational database systems.
  • Database Services1 sub-tagManaged cloud-based offerings that provide database hosting, maintenance, and operational support.
  • Dataset Management4 sub-tagsCollections of annotated media and structured data specifically curated for training and evaluating machine learning computer vision models.
  • Enterprise Data Platforms1 sub-tagCentralized systems that provide organizational access to large-scale data repositories and internal information discovery tools.
  • File Processing1 sub-tagTools designed to transform, convert, or manipulate the structure and format of digital files.
  • Geospatial Data & Services9 sub-tagsThis group includes services, tools, and data related to geographical information and location.
  • Graph Computing Systems3 sub-tagsTechnologies for modeling, processing, and analyzing data based on graph theory and relational connections.
  • Processor Utilities1 sub-tagSpecialized software components that perform specific data transformations based on the input media or data type.
  • Public Data APIs1 sub-tagInterfaces that provide programmatic access to publicly available datasets and government or institutional information services.
  • Public Welfare APIs1 sub-tagProgramming interfaces that facilitate access to data regarding social services, community support, and charitable initiatives.
  • SQL Development1 sub-tagSoftware environments and utilities that assist developers in writing, testing, and refining structured query language code.
  • Search and Indexing Technologies3 sub-tagsSpecialized tools for indexing, searching, and retrieving information across diverse data stores.
  • Storage Abstraction3 sub-tagsMiddleware layers that provide a unified interface for interacting with diverse underlying storage backends and hardware.
  • Storage Adapters2 sub-tagsSoftware connectors that enable applications to interface with specific cloud or local storage systems.
  • Storage Architectures2 sub-tagsStructural patterns and methodologies for organizing, indexing, and retrieving data within a storage system.
  • Storage Integrations1 sub-tagTools and utilities that connect storage systems to external authentication, security, or management workflows.
  • Storage Management Tools1 sub-tagAdministrative utilities that allow users to configure, monitor, and maintain storage resources via command-line interfaces.
  • Storage Services2 sub-tagsManaged infrastructure solutions that provide persistent storage capabilities for files and data objects.
  • Text Processing Utilities3 sub-tagsLibraries and tools specifically designed for extracting, inspecting, and manipulating textual data.
  • Vector EmbeddingsAlgorithms and services that convert unstructured data into numerical representations for machine learning applications.
  • Visual Data Management1 sub-tagInterfaces and dashboards designed to visualize, inspect, and manage complex data structures.