awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Data Persistence and Storage · Awesome GitHub Repositories

33 repos

Awesome GitHub RepositoriesData Persistence and Storage

Technologies and architectures dedicated to the durable storage and long-term management of digital information.

Explore 33 awesome GitHub repositories matching data & databases · Data Persistence and Storage. Refine with filters or upvote what's useful.

  1. Home
  2. Data & Databases
  3. Data Engineering and Infrastructure
  4. Data Persistence and Storage

Awesome Data Persistence and Storage GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • vinta/awesome-python

    vinta/awesome-python

    283,687GitHubView on GitHub↗

    This project is a comprehensive, community-curated directory that organizes a vast landscape of Python software libraries, frameworks, and tools. It serves as a centralized knowledge base designed to facilitate ecosystem navigation and accelerate developer discovery across the entire software development lifecycle. Th

    Pythonawesomecollectionspython
  • torvalds/linux

    torvalds/linux

    217,986GitHubView on GitHub↗

    The Linux kernel is a monolithic operating system kernel that serves as the primary interface between computer hardware and software applications. It provides the foundational infrastructure for managing system resources, including memory allocation, process scheduling, and synchronization primitives. The project inclu

    C
  • trimstray/the-book-of-secret-knowledge

    trimstray/the-book-of-secret-knowledge

    206,980GitHubView on GitHub↗

    This project serves as a centralized, community-driven repository of technical knowledge and administrative resources. It provides a structured taxonomy that aggregates disparate information into a searchable framework, supporting continuous learning and rapid problem-solving for system administrators and cybersecurity

    awesomeawesome-listbsd
  • Significant-Gravitas/AutoGPT

    Significant-Gravitas/AutoGPT

    181,891GitHubView on GitHub↗

    AutoGPT is an orchestration platform designed for building, managing, and deploying autonomous agents. It provides a visual canvas-based environment where users can assemble agents by connecting modular blocks that represent actions, data flows, and conditional logic. The platform supports the entire agent lifecycle, i

    Pythonaiartificial-intelligenceautonomous-agents
  • langchain-ai/langchain

    langchain-ai/langchain

    127,015GitHubView on GitHub↗

    LangChain is an orchestration framework designed for building, managing, and deploying applications powered by large language models. It provides a unified integration layer that normalizes disparate model provider APIs into a consistent set of primitives, enabling developers to build complex, multi-step AI workflows t

    Pythonagentsaiai-agents
  • kubernetes/kubernetes

    kubernetes/kubernetes

    120,673GitHubView on GitHub↗

    Kubernetes is a distributed container orchestration platform that automates the deployment, scaling, and management of containerized applications across clusters of computing nodes. It functions as a declarative infrastructure controller, utilizing a control loop architecture that continuously monitors the current syst

    Gocncfcontainersgo
  • excalidraw/excalidraw

    excalidraw/excalidraw

    117,138GitHubView on GitHub↗

    This project is a virtual whiteboard component and vector graphics editor designed for creating diagrams with a hand-drawn aesthetic. It provides a canvas-based drawing engine that can be embedded directly into web applications, allowing users to manipulate shapes, upload images, and export visual data into standard fo

    TypeScriptcanvascollaborationdiagrams
  • pytorch/pytorch

    pytorch/pytorch

    97,601GitHubView on GitHub↗

    PyTorch is a machine learning framework centered on a GPU-ready tensor library that supports multi-dimensional array operations across both CPU and accelerator hardware. It provides a foundational infrastructure for mathematical computation and dynamic neural network construction, utilizing a tape-based automatic diffe

    Pythonautograddeep-learninggpu
  • immich-app/immich

    immich-app/immich

    92,953GitHubView on GitHub↗

    Immich is a self-hosted media management platform designed to provide a centralized, private repository for photos and videos. It functions as a comprehensive system for organizing, backing up, and viewing personal media collections across mobile devices, web browsers, and external storage locations. By maintaining ful

    TypeScriptbackup-toolfluttergoogle-photos
  • ChatGPTNextWeb/NextChat

    ChatGPTNextWeb/NextChat

    87,317GitHubView on GitHub↗

    NextChat is a self-hosted web application that provides a unified interface for interacting with multiple large language models. It functions as a conversational platform where users can manage and switch between diverse AI providers through configurable API backends, maintaining full control over their data and infras

    TypeScriptcalclaudechatgptclaude
  • macrozheng/mall

    macrozheng/mall

    82,926GitHubView on GitHub↗

    This project is an enterprise-grade Java framework designed for building scalable, full-stack e-commerce applications. It provides a comprehensive foundation for microservice-based distributed architectures, enabling the development of complex retail platforms that include product management, order processing, and secu

    Javadockerelasticsearchelk
  • netdata/netdata

    netdata/netdata

    77,812GitHubView on GitHub↗

    Netdata is a distributed observability platform designed for real-time infrastructure monitoring and performance tracking. It functions as a high-frequency agent that collects system, container, and application metrics with per-second precision, providing both local visualization and centralized aggregation across comp

    Caialertingcncf
  • elastic/elasticsearch

    elastic/elasticsearch

    76,163GitHubView on GitHub↗

    Elasticsearch is a distributed search engine and document store designed for the high-performance indexing and retrieval of massive volumes of unstructured data. It functions as a centralized analytics platform, providing a schema-flexible architecture that organizes information into searchable indices while maintainin

    Javaelasticsearchjavasearch-engine
  • redis/redis

    redis/redis

    73,096GitHubView on GitHub↗

    Redis is an in-memory, key-value database designed to provide sub-millisecond latency for read and write operations. It functions as a versatile data platform, serving as a distributed cache, a message broker, a NoSQL document store, and a vector database. The system utilizes an event-driven, single-threaded loop to pr

    Ccachecachingdatabase
  • awesomedata/awesome-public-datasets

    awesomedata/awesome-public-datasets

    72,846GitHubView on GitHub↗

    This project is a community-maintained, open-access directory of high-quality public datasets. It serves as a centralized reference point for researchers, developers, and data scientists to locate reliable information sources across a wide spectrum of industries and scientific fields. By providing a structured index, t

    aaron-swartzawesome-public-datasetsdatasets
  • grafana/grafana

    grafana/grafana

    72,295GitHubView on GitHub↗

    Grafana is an observability data platform designed to aggregate metrics, logs, and traces from diverse sources into a unified environment. It functions as a centralized interface for visualizing complex telemetry data, transforming raw streams into interactive dashboards that support real-time system health tracking an

    TypeScriptalertinganalyticsbusiness-intelligence
  • Eugeny/tabby

    Eugeny/tabby

    68,976GitHubView on GitHub↗

    Tabby is a cross-platform terminal emulator and desktop application suite designed for managing command-line workflows and remote infrastructure. It provides a comprehensive environment for terminal session orchestration, allowing users to organize multiple active sessions through split panes and custom layouts. The ap

    TypeScriptserialssh-clienttelnet-client
  • danielmiessler/SecLists

    danielmiessler/SecLists

    68,943GitHubView on GitHub↗

    SecLists is a comprehensive repository of security testing assets, functioning as a centralized knowledge base and collection of wordlists for professionals conducting vulnerability assessments and penetration testing. It provides a vast array of usernames, passwords, and payloads designed for brute-force and fuzzing a

    PHP
  • OpenHands/OpenHands

    OpenHands/OpenHands

    67,974GitHubView on GitHub↗

    OpenHands is an autonomous agent framework designed for software engineering workflows. It provides a modular platform for orchestrating AI agents that reason, plan, and execute tasks within isolated, containerized development environments. By integrating with standard version control and development tools, the system

    Pythonagentartificial-intelligencechatgpt
  • leonardomso/33-js-concepts

    leonardomso/33-js-concepts

    66,252GitHubView on GitHub↗

    This project is a comprehensive educational repository designed to help developers master the core mechanics, runtime behaviors, and browser-native capabilities of the JavaScript language. It provides a structured knowledge base that covers fundamental language features, such as prototype-based inheritance and event-lo

    JavaScriptangularconceptses6
Prev12Next

Explore sub-tags

  • Data Persistence Management2 sub-tagsSystems that manage the lifecycle and scheduling of data writing operations to ensure reliable storage.
  • Data Persistence Strategies2 sub-tagsApproaches for ensuring data remains available and consistent across system restarts or local storage environments.
  • Data Storage10 sub-tagsComponents and utilities that facilitate the saving, retrieving, and managing of data within an application environment.
Data Storage Architectures
6 sub-tags
Structural designs and patterns that define how data is organized and accessed within a storage system.
  • Data Storage Layers2 sub-tagsSoftware abstractions that provide a dedicated interface for interacting with underlying database or storage systems.
  • Filesystem Abstractions3 sub-tagsComponents focused on low-level file system logic, management, and containerized volume mounting rather than general data storage.
  • Persistence & Durability1 sub-tagMechanisms that ensure data remains intact and accessible over time, even during system failures or interruptions.
  • Specialized Storage Engines3 sub-tagsHigh-performance storage backends optimized for specific data structures like inverted indices or distributed key-value consensus.
  • Storage Command-Line InterfacesCLI tools for managing storage buckets and policies.
  • Storage Solutions2 sub-tagsInfrastructure platforms designed to store large volumes of data, typically in cloud or object-based environments.