awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Data & Databases · Awesome GitHub Repositories

168 repos

Awesome GitHub RepositoriesData & Databases

This category covers data storage, management, processing, analysis, and various database technologies and their operations.

Explore 168 awesome GitHub repositories matching data & databases · Data & Databases. Refine with filters or upvote what's useful.

  1. Home
  2. Data & Databases

Awesome Data & Databases GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • OpenHands/OpenHands

    OpenHands/OpenHands

    67,974GitHubView on GitHub↗

    OpenHands is an autonomous agent framework designed for software engineering workflows. It provides a modular platform for orchestrating AI agents that reason, plan, and execute tasks within isolated, containerized development environments. By integrating with standard version control and development tools, the system

    Pythonagentartificial-intelligencechatgpt
  • chartjs/Chart.js

    chartjs/Chart.js

    67,174GitHubView on GitHub↗

    Chart.js is a declarative data visualization framework that renders interactive, responsive charts directly onto an HTML5 canvas element. It functions as a configuration-driven engine, transforming structured datasets into complex graphical representations by merging user-defined settings with global defaults. The libr

    JavaScriptcanvaschartgraph
  • leonardomso/33-js-concepts

    leonardomso/33-js-concepts

    66,252GitHubView on GitHub↗

    This project is a comprehensive educational repository designed to help developers master the core mechanics, runtime behaviors, and browser-native capabilities of the JavaScript language. It provides a structured knowledge base that covers fundamental language features, such as prototype-based inheritance and event-lo

    JavaScriptangularconceptses6
  • scikit-learn/scikit-learn

    scikit-learn/scikit-learn

    65,178GitHubView on GitHub↗

    Scikit-learn is a machine learning library for predictive data analysis that provides a collection of algorithms for supervised and unsupervised learning. It functions as a comprehensive toolkit for data preprocessing, dimensionality reduction, and model selection, allowing users to classify data objects, predict conti

    Pythondata-analysisdata-sciencemachine-learning
  • sindresorhus/awesome-nodejs

    sindresorhus/awesome-nodejs

    65,038GitHubView on GitHub↗

    This project is a community-driven directory that aggregates essential software projects and educational content for the Node.js ecosystem. It functions as a centralized knowledge base and discovery index, designed to simplify the navigation of a fragmented technical landscape by providing a structured collection of hi

    awesomeawesome-listjavascript
  • localstack/localstack

    localstack/localstack

    64,423GitHubView on GitHub↗

    LocalStack is an infrastructure development environment that provides a local simulation of cloud services. By leveraging container-orchestrated service lifecycles, it allows developers to build, test, and debug cloud-native applications on their local machines without requiring remote connectivity or incurring cloud p

    Pythonawscloudcontinuous-integration
  • keras-team/keras

    keras-team/keras

    63,858GitHubView on GitHub↗

    Keras is a high-level deep learning framework designed for constructing and training neural networks through the composition of modular, functional layers. It serves as a comprehensive modeling toolkit that provides standardized procedures for defining, evaluating, and deploying complex architectures. By utilizing a di

    Pythondata-sciencedeep-learningjax
  • toeverything/AFFiNE

    toeverything/AFFiNE

    63,081GitHubView on GitHub↗

    AFFiNE is a collaborative knowledge base and productivity suite designed as a private-first, local-first platform. It provides an integrated workspace that combines structured documents with an infinite digital canvas, allowing users to organize complex information through a block-based model. By prioritizing local dat

    TypeScriptappcrdteditor
  • socketio/socket.io

    socketio/socket.io

    62,883GitHubView on GitHub↗

    Socket.io is a real-time communication engine that enables bidirectional, event-based data exchange between clients and servers. It provides a robust transport-agnostic protocol layer that automatically manages connection lifecycles, including heartbeat signals, automatic reconnection, and seamless fallback between Web

    TypeScriptjavascriptnodejssocket-io
  • prometheus/prometheus

    prometheus/prometheus

    62,853GitHubView on GitHub↗

    Prometheus is a comprehensive monitoring and alerting platform designed to track infrastructure health and application performance. It functions as a time series database that ingests, indexes, and queries high-frequency numerical data points. By utilizing a pull-based model, the system periodically collects multi-dime

    Goalertinggraphinghacktoberfest
  • openinterpreter/open-interpreter

    openinterpreter/open-interpreter

    62,257GitHubView on GitHub↗

    Open Interpreter is an autonomous agent runtime that translates natural language instructions into executable code to interact with local software and operating systems. It functions as an orchestration framework that connects language models to a secure execution environment, enabling the development of agents capable

    Pythonchatgptgpt-4interpreter
  • nocodb/nocodb

    nocodb/nocodb

    62,131GitHubView on GitHub↗

    NocoDB is a visual platform that transforms relational databases into collaborative, spreadsheet-style workspaces. By acting as a headless database backend, it provides a unified environment for designing database structures, managing record relationships, and interacting with data without requiring manual SQL queries.

    TypeScriptairtableairtable-alternativeautomatic-api
  • tldr-pages/tldr

    tldr-pages/tldr

    61,318GitHubView on GitHub↗

    This project is a community-driven repository of simplified, example-based reference guides for command-line tools. It functions as an open-source knowledge base designed to provide concise, practical usage examples that help developers navigate complex software without the need for lengthy technical documentation. Th

    Markdownandroidbsdcheatsheet
  • xingshaocheng/architect-awesome

    xingshaocheng/architect-awesome

    60,831GitHubView on GitHub↗

    This project serves as a comprehensive knowledge base and reference for distributed systems engineering and enterprise software architecture. It provides a structured collection of technical resources, design patterns, and methodologies intended to assist in the design, maintenance, and scaling of complex, high-perform

  • OpenBB-finance/OpenBB

    OpenBB-finance/OpenBB

    60,502GitHubView on GitHub↗

    OpenBB is a financial data platform and investment research terminal designed to aggregate, normalize, and distribute market data across analytical workflows. It functions as a comprehensive ecosystem that bridges disparate financial data providers with custom applications, spreadsheets, and internal modeling infrastru

    Pythonaicryptoderivatives
  • unclecode/crawl4ai

    unclecode/crawl4ai

    60,452GitHubView on GitHub↗

    Crawl4AI is an AI-powered web crawling and data extraction engine designed to transform complex web content into structured formats. It functions as a headless browser orchestrator, enabling the navigation of dynamic websites, the execution of custom scripts, and the capture of visual assets like screenshots and PDFs.

    Python
  • minio/minio

    minio/minio

    60,346GitHubView on GitHub↗

    MinIO is a software-defined, cloud-native object storage server designed to manage large volumes of unstructured data. It functions as a distributed storage cluster that aggregates multiple independent nodes into a unified, scalable pool, providing a high-performance infrastructure compatible with standard cloud storag

    Goamazon-s3cloudcloudnative
  • adam-p/markdown-here

    adam-p/markdown-here

    60,151GitHubView on GitHub↗

    Markdown Here is a browser extension that enables rich text composition within web-based editors that lack native formatting support. By transforming plain text markdown syntax into rendered HTML, it allows users to draft professional emails and documents using standard markup, including headers, tables, and footnotes,

    JavaScript
  • BurntSushi/ripgrep

    BurntSushi/ripgrep

    60,093GitHubView on GitHub↗

    ripgrep is a command-line utility designed for searching through large file trees and source code repositories. It functions as a recursive text processor that traverses directories to locate and display matching patterns, serving as a high-performance alternative to traditional search tools. The tool distinguishes it

    Rustclicommand-linecommand-line-tool
  • scrapy/scrapy

    scrapy/scrapy

    59,824GitHubView on GitHub↗

    Scrapy is a comprehensive framework designed for automated web data extraction and large-scale crawling. It operates on an asynchronous, event-driven engine that manages non-blocking network requests and data processing tasks, allowing for the efficient retrieval of structured information from web documents using path-

    Pythoncrawlercrawlingframework
Prev1…567…9Next

Browse tags

  • API Data Management1 sub-tagMechanisms for filtering or selecting specific data fields returned by an application programming interface.
  • API Data Retrieval1 sub-tagTools and logic for managing how large datasets are requested and broken into manageable chunks from remote services.
  • API Layers1 sub-tagMiddleware and frameworks that provide an interface layer between client applications and underlying data sources.
  • Asynchronous Data Handling2 sub-tagsUtilities for managing non-blocking data operations and background tasks to maintain application responsiveness.
Automation Scripting APIs2 sub-tags
Programming interfaces designed to automate the manipulation and management of external system data and user records.
  • Cloud Storage Integrations1 sub-tagConnectors and drivers that enable applications to interact with remote object storage and cloud-based file systems.
  • Community Analytics1 sub-tagTools for measuring and visualizing the activity, engagement, and contributions of members within a community.
  • Community Data Platforms1 sub-tagPlatforms that centralize and synchronize data contributed by multiple users or sources into a unified repository.
  • Data Abstraction Layers5 sub-tagsSoftware layers that provide a unified interface for interacting with diverse storage backends and data structures.
  • Data Access Patterns1 sub-tagMethodologies and low-level techniques for reading from and writing to data storage systems efficiently.
  • Data Access and Querying8 sub-tagsInterfaces, query languages, and abstraction layers used to interact with and retrieve data from storage systems.
  • Data Analysis & Visualization12 sub-tagsThis group focuses on tools and techniques for analyzing, interpreting, and visually representing data.
  • Data Architectures2 sub-tagsStructural designs and organizational patterns for managing, partitioning, and modeling complex data systems.
  • Data Categories1 sub-tagCollections of structured information categorized by specific themes or temporal characteristics.
  • Data Collection2 sub-tagsSystems and automated processes designed to gather, harvest, and ingest information from external sources.
  • Data Collection Infrastructure1 sub-tagScalable frameworks and distributed systems built to support large-scale data gathering and web crawling operations.
  • Data Collections & Datasets12 sub-tagsThis group comprises various types of data collections and datasets, including domain-specific and open data.
  • Data Compression2 sub-tagsAlgorithms and utilities that reduce the size of data for efficient storage and transmission.
  • Data Consistency Models1 sub-tagFrameworks defining how data updates are propagated and synchronized across distributed nodes.
  • Data Containers1 sub-tagFoundational structures and base classes used to encapsulate and organize data for application use.
  • Data Conversion1 sub-tagUtilities for transforming data from one representation or encoding to another.
  • Data Deduplication2 sub-tagsTools that identify and remove redundant information to optimize storage space and data integrity.
  • Data Distribution Patterns1 sub-tagStandardized formats and protocols for sharing and distributing data across different systems and languages.
  • Data Domains1 sub-tagSpecialized datasets focused on specific industry sectors or subject matter areas.
  • Data Engineering and Infrastructure5 sub-tagsFoundational tools for large-scale data collection, ingestion, storage management, and reliability.
  • Data Engines1 sub-tagCore processing engines that manage data storage, retrieval, and synchronization, often optimized for local environments.
  • Data Export1 sub-tagTools for extracting and formatting data from internal systems for external use or archival.
  • Data Export Formats1 sub-tagSpecific file types and schemas used for outputting data, including specialized formats like OCR results.
  • Data Extensions1 sub-tagAdd-ons and plugins that extend the functionality of database systems to support bulk operations.
  • Data Filtering Strategies1 sub-tagLogic and rulesets for excluding or including specific data points based on defined criteria.
  • Data Filtering Utilities1 sub-tagFunctional utilities for processing and refining tabular data or lists based on user-defined filters.
  • Data Formatting1 sub-tagTools that transform raw data into human-readable formats or standardized visual representations.
  • Data Framing1 sub-tagMechanisms for structuring and delimiting data streams to ensure correct parsing during transmission.
  • Data Governance and Modeling6 sub-tagsFrameworks for defining schemas, ensuring standardization, and managing data assets and sovereignty.
  • Data Handling1 sub-tagGeneral-purpose libraries and tools for managing, serializing, and processing data within an application.
  • Data Inspection1 sub-tagUtilities for viewing, debugging, and formatting raw data for easier human analysis.
  • Data Integration & Synchronization12 sub-tagsThis group covers tools and strategies for integrating and synchronizing data across different systems.
  • Data Integration Architectures1 sub-tagFrameworks and patterns for moving and transforming data between disparate systems and storage environments.
  • Data Interoperability1 sub-tagStandards and protocols that enable different software systems to exchange and interpret shared data structures.
  • Data Management11 sub-tagsTools and utilities for maintaining, organizing, protecting, and migrating data throughout its operational lifecycle.
  • Data Management Interfaces1 sub-tagGraphical or programmatic interfaces designed for viewing, editing, and managing tabular data sets.
  • Data Operations1 sub-tagSystems and workflows focused on the routine maintenance and manipulation of individual data records.
  • Data Organization Tools1 sub-tagSoftware designed to categorize, index, and structure information for improved accessibility and retrieval.
  • Data Platforms3 sub-tagsComprehensive environments that provide specialized infrastructure for storing, analyzing, and monitoring specific types of data.
  • Data Preparation1 sub-tagTools that clean, format, and segment raw data to prepare it for downstream analysis or ingestion.
  • Data Processing Extensions1 sub-tagAdd-on components that enhance database functionality by performing specialized data cleaning or refinement tasks.
  • Data Processing Models1 sub-tagArchitectural approaches for processing data streams, such as handling information in discrete packets.
  • Data Processing Patterns1 sub-tagStandardized methods and techniques for converting data structures into formats suitable for storage or transmission.
  • Data Processing Pipelines18 sub-tagsSystems and workflows for ingesting, transforming, and orchestrating high-throughput data processing tasks.
  • Data Processing Services1 sub-tagManaged services that automate the delivery and ingestion of data from external sources.
  • Data Processing Utilities4 sub-tagsLibraries and algorithms used to perform specific data manipulation tasks like deduplication, streaming, or reduction.
  • Data Recovery Tools1 sub-tagSpecialized utilities designed to reconstruct or recover corrupted or misaligned data files.
  • Data Redundancy1 sub-tagTechniques and algorithms that ensure data availability and fault tolerance through redundant storage methods.
  • Data Resources1 sub-tagDatasets and reference materials used to support knowledge discovery and information research.
  • Data Serialization Formats8 sub-tagsLibraries and protocols that define how data is encoded, structured, and serialized for storage or network transport.
  • Data Sharing2 sub-tagsMechanisms that allow controlled access to data sets by sharing specific views or base data structures.
  • Data Stores1 sub-tagStorage systems engineered to maintain strict data consistency across distributed environments.
  • Data Synchronization Engines1 sub-tagEngines that maintain consistency between multiple data sources by propagating changes in real time.
  • Data Templating1 sub-tagTools for defining and applying patterns to format data, particularly for temporal or string-based values.
  • Data Transfer1 sub-tagInfrastructure components designed to move large volumes of data across network boundaries efficiently.
  • Database Access Patterns1 sub-tagStandardized methods for retrieving and iterating through database records using cursors or similar mechanisms.
  • Database Concepts3 sub-tagsFundamental principles and architectural components that define how databases operate, store, and manage data integrity.
  • Database Design Patterns2 sub-tagsBest practices for modeling data structures and enforcing attribute validation within database schemas.
  • Database Extensions1 sub-tagPlugins and add-ons that provide additional functionality or features for specific database management systems.
  • Database Infrastructure2 sub-tagsMiddleware and routing components that manage connections and traffic between applications and database clusters.
  • Database Management Systems8 sub-tagsCore engines, storage architectures, and operational configurations for persistent data management.
  • Database Resources1 sub-tagReference materials and documentation specifically focused on relational database systems.
  • Database Services1 sub-tagManaged cloud-based offerings that provide database hosting, maintenance, and operational support.
  • Dataset Management4 sub-tagsCollections of annotated media and structured data specifically curated for training and evaluating machine learning computer vision models.
  • Enterprise Data Platforms1 sub-tagCentralized systems that provide organizational access to large-scale data repositories and internal information discovery tools.
  • File Processing1 sub-tagTools designed to transform, convert, or manipulate the structure and format of digital files.
  • Geospatial Data & Services9 sub-tagsThis group includes services, tools, and data related to geographical information and location.
  • Graph Computing Systems3 sub-tagsTechnologies for modeling, processing, and analyzing data based on graph theory and relational connections.
  • Processor Utilities1 sub-tagSpecialized software components that perform specific data transformations based on the input media or data type.
  • Public Data APIs1 sub-tagInterfaces that provide programmatic access to publicly available datasets and government or institutional information services.
  • Public Welfare APIs1 sub-tagProgramming interfaces that facilitate access to data regarding social services, community support, and charitable initiatives.
  • SQL Development1 sub-tagSoftware environments and utilities that assist developers in writing, testing, and refining structured query language code.
  • Search and Indexing Technologies3 sub-tagsSpecialized tools for indexing, searching, and retrieving information across diverse data stores.
  • Storage Abstraction3 sub-tagsMiddleware layers that provide a unified interface for interacting with diverse underlying storage backends and hardware.
  • Storage Adapters2 sub-tagsSoftware connectors that enable applications to interface with specific cloud or local storage systems.
  • Storage Architectures2 sub-tagsStructural patterns and methodologies for organizing, indexing, and retrieving data within a storage system.
  • Storage Integrations1 sub-tagTools and utilities that connect storage systems to external authentication, security, or management workflows.
  • Storage Management Tools1 sub-tagAdministrative utilities that allow users to configure, monitor, and maintain storage resources via command-line interfaces.
  • Storage Services2 sub-tagsManaged infrastructure solutions that provide persistent storage capabilities for files and data objects.
  • Text Processing Utilities3 sub-tagsLibraries and tools specifically designed for extracting, inspecting, and manipulating textual data.
  • Vector EmbeddingsAlgorithms and services that convert unstructured data into numerical representations for machine learning applications.
  • Visual Data Management1 sub-tagInterfaces and dashboards designed to visualize, inspect, and manage complex data structures.