awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Data & Databases · Awesome GitHub Repositories

168 repos

Awesome GitHub RepositoriesData & Databases

This category covers data storage, management, processing, analysis, and various database technologies and their operations.

Explore 168 awesome GitHub repositories matching data & databases · Data & Databases. Refine with filters or upvote what's useful.

  1. Home
  2. Data & Databases

Awesome Data & Databases GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • pathwaycom/pathway

    pathwaycom/pathway

    59,684GitHubView on GitHub↗

    Pathway is a high-performance data processing framework designed for building unified batch and streaming pipelines. It functions as an orchestrator for complex data transformations, utilizing a differential dataflow engine to process updates incrementally. By treating static datasets and continuous event streams with

    Pythonbatch-processingdata-analyticsdata-pipelines
  • nuxt/nuxt

    nuxt/nuxt

    59,659GitHubView on GitHub↗

    Nuxt is a universal web framework designed for building full-stack applications that seamlessly transition between server-side rendering and client-side interactivity. It provides a comprehensive development environment that automates routing, dependency injection, and type generation, allowing developers to focus on a

    TypeScriptcsrframeworkfull-stack
  • jgraph/drawio-desktop

    jgraph/drawio-desktop

    59,481GitHubView on GitHub↗

    This project is a cross-platform desktop application designed for creating, editing, and managing structured diagrams and technical workflows. It provides a visual modeling environment that allows users to construct complex charts through a drag-and-drop interface, supporting the documentation of processes, software ar

    JavaScriptdiagram-editorelectron-appgraphics
  • CorentinJ/Real-Time-Voice-Cloning

    CorentinJ/Real-Time-Voice-Cloning

    59,355GitHubView on GitHub↗

    This project is a neural text-to-speech engine and voice cloning toolkit designed to generate synthetic speech that mimics the vocal characteristics of a target speaker. It functions as a real-time audio synthesizer, utilizing a deep learning pipeline to convert written text into high-fidelity speech output with minima

    Pythondeep-learningpythonpytorch
  • git/git

    git/git

    59,192GitHubView on GitHub↗

    Git is a distributed version control system and command-line tool designed for tracking changes in source code and coordinating collaborative software development. It functions as a content-addressable storage platform where project data is maintained as immutable objects indexed by cryptographic hashes, ensuring data

    Cchacktoberfestshell
  • Solido/awesome-flutter

    Solido/awesome-flutter

    59,015GitHubView on GitHub↗

    This project is a community-curated directory of resources, libraries, and tools designed to support developers working with the Flutter framework. It functions as a centralized knowledge base, organizing high-quality external references into a structured, human-readable format to assist in the discovery of technical m

    Dartandroidawesomeawesome-list
  • angular/angular.js

    angular/angular.js

    58,970GitHubView on GitHub↗

    AngularJS is a structural framework for building dynamic web applications by extending standard HTML with custom tags and attributes. It operates as a client-side template engine that transforms declarative markup into interactive components, organizing application logic through a model-view-controller pattern. By util

    JavaScript
  • PlexPt/awesome-chatgpt-prompts-zh

    PlexPt/awesome-chatgpt-prompts-zh

    58,347GitHubView on GitHub↗

    This project is a community-driven library of structured text inputs designed to guide large language models into specific roles, behaviors, and operational modes. It functions as a comprehensive repository of prompt engineering resources, providing reusable templates that allow users to override default model tendenci

    chat-gptchatgptchatgpt3
  • rails/rails

    rails/rails

    58,297GitHubView on GitHub↗

    This project is a full-stack web framework designed for building database-backed applications through a standardized architectural pattern. It provides a comprehensive suite of integrated libraries that manage the entire request-response lifecycle, from routing incoming web traffic to rendering dynamic server-side temp

    Rubyactivejobactiverecordframework
  • sharkdp/bat

    sharkdp/bat

    57,298GitHubView on GitHub↗

    This project is a command-line text viewer designed to enhance terminal output through automatic syntax highlighting and integrated file management. It functions as a replacement for standard system pagers, providing a readable interface for large text streams, source code, and markup files by applying color-coded form

    Rustclicommand-linegit
  • FFmpeg/FFmpeg

    FFmpeg/FFmpeg

    57,281GitHubView on GitHub↗

    FFmpeg is a cross-platform framework and multimedia processing suite designed for the manipulation, transcoding, and streaming of audio and video data. It functions as a comprehensive collection of command-line tools and low-level libraries that provide high-performance encoding and decoding capabilities for a wide ran

    Caudiocffmpeg
  • zylon-ai/private-gpt

    zylon-ai/private-gpt

    57,116GitHubView on GitHub↗

    This project is a privacy-first backend service designed to facilitate retrieval-augmented generation by processing local documents into searchable vector representations. It provides a modular architecture that allows users to ingest diverse file formats, manage document metadata, and perform semantic searches to prov

    Python
  • usememos/memos

    usememos/memos

    57,067GitHubView on GitHub↗

    Memos is a self-hosted, container-native knowledge management platform designed for capturing and organizing personal notes. It functions as a private workspace where users can create content using markdown, tags, and media embeds to streamline daily productivity. The system is built to be deployed as a portable servic

    Godockerfossgo
  • pmndrs/zustand

    pmndrs/zustand

    57,057GitHubView on GitHub↗

    Zustand is a state management library that provides a centralized store for managing shared application data. It functions as a reactive container that connects application state to components, allowing them to subscribe to specific slices of data and trigger updates automatically. By utilizing selector-based data acce

    TypeScripthacktoberfesthooksreact
  • withastro/astro

    withastro/astro

    56,962GitHubView on GitHub↗

    Astro is a content-driven web framework designed for building multi-page applications that prioritize performance by shipping minimal JavaScript to the browser. It functions as a static site generator and server-side rendering engine, transforming source files into optimized HTML documents. By utilizing an island archi

    TypeScriptastroblogbrowser
  • soimort/you-get

    soimort/you-get

    56,737GitHubView on GitHub↗

    This project is a command-line utility designed to fetch video, audio, and image content from a wide range of web platforms. It functions by parsing page metadata and utilizing modular, site-specific scripts to extract direct media stream URLs from complex web structures, enabling the local archiving of digital media f

    Python
  • pathwaycom/llm-app

    pathwaycom/llm-app

    56,311GitHubView on GitHub↗

    This project is a data processing engine and AI application platform designed for building production-grade machine learning workflows. It provides a unified programming model that handles both historical batch data and live stream ingestion, enabling the development of real-time ETL pipelines and scalable data transfo

    Jupyter Notebookchatbothugging-facellm
  • remix-run/react-router

    remix-run/react-router

    56,250GitHubView on GitHub↗

    React Router is a navigation and data-loading framework that maps URL patterns to nested component hierarchies. It functions as a full-stack router, coordinating server-side resource fetching with client-side hydration to synchronize application state across different environments. By providing a declarative interface

    TypeScript
  • pocketbase/pocketbase

    pocketbase/pocketbase

    56,221GitHubView on GitHub↗

    Pocketbase is a backend-as-a-service platform that provides a self-contained, single-binary server for building full-stack applications. It integrates a relational database, authentication, and file storage into one executable process, eliminating the need for external infrastructure or complex server management. The

    Goauthenticationbackendgolang
  • meilisearch/meilisearch

    meilisearch/meilisearch

    55,992GitHubView on GitHub↗

    Meilisearch is a Rust-based search engine providing typo-tolerant full-text and vector-based semantic search with real-time conversational capabilities.

    Rustaiapiapp-search
Prev1…6789Next

Browse tags

  • API Data Management1 sub-tagMechanisms for filtering or selecting specific data fields returned by an application programming interface.
  • API Data Retrieval1 sub-tagTools and logic for managing how large datasets are requested and broken into manageable chunks from remote services.
  • API Layers1 sub-tagMiddleware and frameworks that provide an interface layer between client applications and underlying data sources.
  • Asynchronous Data Handling2 sub-tagsUtilities for managing non-blocking data operations and background tasks to maintain application responsiveness.
Automation Scripting APIs2 sub-tags
Programming interfaces designed to automate the manipulation and management of external system data and user records.
  • Cloud Storage Integrations1 sub-tagConnectors and drivers that enable applications to interact with remote object storage and cloud-based file systems.
  • Community Analytics1 sub-tagTools for measuring and visualizing the activity, engagement, and contributions of members within a community.
  • Community Data Platforms1 sub-tagPlatforms that centralize and synchronize data contributed by multiple users or sources into a unified repository.
  • Data Abstraction Layers5 sub-tagsSoftware layers that provide a unified interface for interacting with diverse storage backends and data structures.
  • Data Access Patterns1 sub-tagMethodologies and low-level techniques for reading from and writing to data storage systems efficiently.
  • Data Access and Querying8 sub-tagsInterfaces, query languages, and abstraction layers used to interact with and retrieve data from storage systems.
  • Data Analysis & Visualization12 sub-tagsThis group focuses on tools and techniques for analyzing, interpreting, and visually representing data.
  • Data Architectures2 sub-tagsStructural designs and organizational patterns for managing, partitioning, and modeling complex data systems.
  • Data Categories1 sub-tagCollections of structured information categorized by specific themes or temporal characteristics.
  • Data Collection2 sub-tagsSystems and automated processes designed to gather, harvest, and ingest information from external sources.
  • Data Collection Infrastructure1 sub-tagScalable frameworks and distributed systems built to support large-scale data gathering and web crawling operations.
  • Data Collections & Datasets12 sub-tagsThis group comprises various types of data collections and datasets, including domain-specific and open data.
  • Data Compression2 sub-tagsAlgorithms and utilities that reduce the size of data for efficient storage and transmission.
  • Data Consistency Models1 sub-tagFrameworks defining how data updates are propagated and synchronized across distributed nodes.
  • Data Containers1 sub-tagFoundational structures and base classes used to encapsulate and organize data for application use.
  • Data Conversion1 sub-tagUtilities for transforming data from one representation or encoding to another.
  • Data Deduplication2 sub-tagsTools that identify and remove redundant information to optimize storage space and data integrity.
  • Data Distribution Patterns1 sub-tagStandardized formats and protocols for sharing and distributing data across different systems and languages.
  • Data Domains1 sub-tagSpecialized datasets focused on specific industry sectors or subject matter areas.
  • Data Engineering and Infrastructure5 sub-tagsFoundational tools for large-scale data collection, ingestion, storage management, and reliability.
  • Data Engines1 sub-tagCore processing engines that manage data storage, retrieval, and synchronization, often optimized for local environments.
  • Data Export1 sub-tagTools for extracting and formatting data from internal systems for external use or archival.
  • Data Export Formats1 sub-tagSpecific file types and schemas used for outputting data, including specialized formats like OCR results.
  • Data Extensions1 sub-tagAdd-ons and plugins that extend the functionality of database systems to support bulk operations.
  • Data Filtering Strategies1 sub-tagLogic and rulesets for excluding or including specific data points based on defined criteria.
  • Data Filtering Utilities1 sub-tagFunctional utilities for processing and refining tabular data or lists based on user-defined filters.
  • Data Formatting1 sub-tagTools that transform raw data into human-readable formats or standardized visual representations.
  • Data Framing1 sub-tagMechanisms for structuring and delimiting data streams to ensure correct parsing during transmission.
  • Data Governance and Modeling6 sub-tagsFrameworks for defining schemas, ensuring standardization, and managing data assets and sovereignty.
  • Data Handling1 sub-tagGeneral-purpose libraries and tools for managing, serializing, and processing data within an application.
  • Data Inspection1 sub-tagUtilities for viewing, debugging, and formatting raw data for easier human analysis.
  • Data Integration & Synchronization12 sub-tagsThis group covers tools and strategies for integrating and synchronizing data across different systems.
  • Data Integration Architectures1 sub-tagFrameworks and patterns for moving and transforming data between disparate systems and storage environments.
  • Data Interoperability1 sub-tagStandards and protocols that enable different software systems to exchange and interpret shared data structures.
  • Data Management11 sub-tagsTools and utilities for maintaining, organizing, protecting, and migrating data throughout its operational lifecycle.
  • Data Management Interfaces1 sub-tagGraphical or programmatic interfaces designed for viewing, editing, and managing tabular data sets.
  • Data Operations1 sub-tagSystems and workflows focused on the routine maintenance and manipulation of individual data records.
  • Data Organization Tools1 sub-tagSoftware designed to categorize, index, and structure information for improved accessibility and retrieval.
  • Data Platforms3 sub-tagsComprehensive environments that provide specialized infrastructure for storing, analyzing, and monitoring specific types of data.
  • Data Preparation1 sub-tagTools that clean, format, and segment raw data to prepare it for downstream analysis or ingestion.
  • Data Processing Extensions1 sub-tagAdd-on components that enhance database functionality by performing specialized data cleaning or refinement tasks.
  • Data Processing Models1 sub-tagArchitectural approaches for processing data streams, such as handling information in discrete packets.
  • Data Processing Patterns1 sub-tagStandardized methods and techniques for converting data structures into formats suitable for storage or transmission.
  • Data Processing Pipelines18 sub-tagsSystems and workflows for ingesting, transforming, and orchestrating high-throughput data processing tasks.
  • Data Processing Services1 sub-tagManaged services that automate the delivery and ingestion of data from external sources.
  • Data Processing Utilities4 sub-tagsLibraries and algorithms used to perform specific data manipulation tasks like deduplication, streaming, or reduction.
  • Data Recovery Tools1 sub-tagSpecialized utilities designed to reconstruct or recover corrupted or misaligned data files.
  • Data Redundancy1 sub-tagTechniques and algorithms that ensure data availability and fault tolerance through redundant storage methods.
  • Data Resources1 sub-tagDatasets and reference materials used to support knowledge discovery and information research.
  • Data Serialization Formats8 sub-tagsLibraries and protocols that define how data is encoded, structured, and serialized for storage or network transport.
  • Data Sharing2 sub-tagsMechanisms that allow controlled access to data sets by sharing specific views or base data structures.
  • Data Stores1 sub-tagStorage systems engineered to maintain strict data consistency across distributed environments.
  • Data Synchronization Engines1 sub-tagEngines that maintain consistency between multiple data sources by propagating changes in real time.
  • Data Templating1 sub-tagTools for defining and applying patterns to format data, particularly for temporal or string-based values.
  • Data Transfer1 sub-tagInfrastructure components designed to move large volumes of data across network boundaries efficiently.
  • Database Access Patterns1 sub-tagStandardized methods for retrieving and iterating through database records using cursors or similar mechanisms.
  • Database Concepts3 sub-tagsFundamental principles and architectural components that define how databases operate, store, and manage data integrity.
  • Database Design Patterns2 sub-tagsBest practices for modeling data structures and enforcing attribute validation within database schemas.
  • Database Extensions1 sub-tagPlugins and add-ons that provide additional functionality or features for specific database management systems.
  • Database Infrastructure2 sub-tagsMiddleware and routing components that manage connections and traffic between applications and database clusters.
  • Database Management Systems8 sub-tagsCore engines, storage architectures, and operational configurations for persistent data management.
  • Database Resources1 sub-tagReference materials and documentation specifically focused on relational database systems.
  • Database Services1 sub-tagManaged cloud-based offerings that provide database hosting, maintenance, and operational support.
  • Dataset Management4 sub-tagsCollections of annotated media and structured data specifically curated for training and evaluating machine learning computer vision models.
  • Enterprise Data Platforms1 sub-tagCentralized systems that provide organizational access to large-scale data repositories and internal information discovery tools.
  • File Processing1 sub-tagTools designed to transform, convert, or manipulate the structure and format of digital files.
  • Geospatial Data & Services9 sub-tagsThis group includes services, tools, and data related to geographical information and location.
  • Graph Computing Systems3 sub-tagsTechnologies for modeling, processing, and analyzing data based on graph theory and relational connections.
  • Processor Utilities1 sub-tagSpecialized software components that perform specific data transformations based on the input media or data type.
  • Public Data APIs1 sub-tagInterfaces that provide programmatic access to publicly available datasets and government or institutional information services.
  • Public Welfare APIs1 sub-tagProgramming interfaces that facilitate access to data regarding social services, community support, and charitable initiatives.
  • SQL Development1 sub-tagSoftware environments and utilities that assist developers in writing, testing, and refining structured query language code.
  • Search and Indexing Technologies3 sub-tagsSpecialized tools for indexing, searching, and retrieving information across diverse data stores.
  • Storage Abstraction3 sub-tagsMiddleware layers that provide a unified interface for interacting with diverse underlying storage backends and hardware.
  • Storage Adapters2 sub-tagsSoftware connectors that enable applications to interface with specific cloud or local storage systems.
  • Storage Architectures2 sub-tagsStructural patterns and methodologies for organizing, indexing, and retrieving data within a storage system.
  • Storage Integrations1 sub-tagTools and utilities that connect storage systems to external authentication, security, or management workflows.
  • Storage Management Tools1 sub-tagAdministrative utilities that allow users to configure, monitor, and maintain storage resources via command-line interfaces.
  • Storage Services2 sub-tagsManaged infrastructure solutions that provide persistent storage capabilities for files and data objects.
  • Text Processing Utilities3 sub-tagsLibraries and tools specifically designed for extracting, inspecting, and manipulating textual data.
  • Vector EmbeddingsAlgorithms and services that convert unstructured data into numerical representations for machine learning applications.
  • Visual Data Management1 sub-tagInterfaces and dashboards designed to visualize, inspect, and manage complex data structures.