19 个仓库
Utilities for maintaining database integrity and performance.
Distinguishing note: Focuses on storage optimization and index rebuilding.
Explore 19 awesome GitHub repositories matching data & databases · Database Maintenance. Refine with filters or upvote what's useful.
Claude-mem is an agentic memory persistence system designed to provide AI assistants with long-term context across multiple development sessions. It functions as a background orchestrator that captures, summarizes, and indexes interaction history, allowing models to maintain continuity and recall technical decisions from past tasks. By utilizing a vector-augmented context engine, the system injects relevant historical observations into active sessions, ensuring that AI agents remain informed without exceeding finite token budgets. The project distinguishes itself through an endless memory arc
Provides utilities to check integrity, resolve locks, and optimize storage through vacuuming.
This project is a privacy-focused, self-hosted metasearch engine that aggregates results from a wide array of web, academic, and media sources into a single, unified interface. By acting as a proxy between the user and external search providers, it strips identifying headers and tracking parameters from requests, ensuring that search activity remains anonymous and protected from third-party profiling. The platform distinguishes itself through a modular, plugin-based architecture that allows for extensive customization of search behavior, result filtering, and interface branding. It supports a
Computes and displays hash values for strings to assist in security and file verification tasks.
This project is a cross-platform credential management suite designed to store sensitive information in encrypted local databases. It functions as a secure desktop application that provides a unified environment for organizing secrets, generating passwords, and managing multi-factor authentication tokens. By utilizing industry-standard file formats, the application ensures that stored credentials remain secure and interoperable across different operating systems. The software distinguishes itself through deep integration with hardware-backed security and system-level services. It supports phy
Executes administrative operations and security audits through a modular command-line interface.
This project provides a framework for managing multi-agent systems, designed to automate complex software development, infrastructure, and business workflows. It functions as a multi-agent workflow orchestrator that routes tasks to domain-specific workers while maintaining state persistence and infrastructure automation. By leveraging large language models, the system decomposes high-level objectives into actionable plans, ensuring that complex operations are executed with consistency and reliability. The framework distinguishes itself through its hierarchical agent registry and policy-driven
Automates vacuuming and bloat prevention to maintain long-term system health.
This project serves as a comprehensive technical reference and educational platform for the Ethereum ecosystem. It provides a deep dive into the fundamental architecture of decentralized ledger systems, covering the core mechanisms that enable trustless state transitions, cryptographic security, and network consensus. The documentation distinguishes itself by bridging high-level conceptual frameworks with practical implementation details. It details the lifecycle of smart contract development, from source code compilation and bytecode analysis to deployment and interaction patterns. Furthermo
Provides methods for verifying data integrity using cryptographic proofs.
Agent Skills is a centralized registry and management system designed for the discovery, auditing, and integration of reusable procedural modules into automated agent workflows. It provides a structured environment for sourcing verified capabilities that extend the functional range of AI agents, enabling the development and scaling of complex, multi-step automated processes. The platform distinguishes itself through a security-first approach to module integration, utilizing audit-verified data to ensure that capabilities meet safety requirements before they are deployed. It incorporates a dec
Bundles security and performance verification data with procedural modules to ensure safety before execution.
Excelize is a library for reading and writing spreadsheet files in the Office Open XML format. It provides a comprehensive suite of tools for programmatically creating, modifying, and analyzing workbooks, worksheets, and cell data, ensuring compatibility across various office software suites through structured XML serialization. The library distinguishes itself with a built-in formula calculation engine that evaluates complex mathematical and logical expressions directly against workbook data. It also features a memory-mapped streaming architecture, which allows for the efficient processing o
Authenticates DNS records to ensure data integrity and origin during transit.
bbolt is an ACID-compliant embedded key-value store for Go applications. It persists all data in a single memory-mapped file on disk, organizing information using B+ trees to facilitate sorted key iteration and efficient range queries. The project distinguishes itself through a hierarchical data organization model, allowing buckets to be nested within other buckets to create a tree-like structure. It employs a single-writer, multi-reader locking mechanism and copy-on-write transactions to ensure serializable isolation and data integrity. The system includes comprehensive data management capa
Provides a utility to rebuild the page freelist to correctly identify available storage space.
Soar is a suite of specialized tools designed for analyzing MySQL performance, advising on indexing, and optimizing SQL syntax. It functions as a performance analyzer, index advisor, and query optimizer to identify bottlenecks and suggest structural improvements for faster execution. The project distinguishes itself through a system for rewriting SQL statements into optimized equivalent versions using custom heuristic rules and patterns. It also features a dedicated index advisor that evaluates query patterns and database metadata to recommend the creation of new indexes. Its broader capabil
Consolidates multiple table alteration statements into single operations to maintain performance and reduce downtime.
Noms is a distributed version control database and content-addressable data store. It identifies data by cryptographic hashes to ensure integrity and deduplication, while tracking dataset state changes through a sequence of immutable commits to enable branching, forking, and historical recovery. The system functions as a peer-to-peer data synchronizer, reconciling state between disconnected database instances to ensure all nodes converge on the same data. It distinguishes itself as a schema-flexible document store that supports self-describing types, allowing schemas to evolve and widen as ne
Produces a single unique hash representing the entire database state to verify that data is identical across peers.
Datahaven is a decentralized storage network and smart contract platform designed for storing verifiable data with cryptographic integrity checks. It functions as a peer-to-peer system that ensures data availability and prevents loss through redundancy and tamper-evident content addressing. The project incorporates a restaking security layer that protects the network from malicious actors by leveraging economic stakes and automated slashing mechanisms. It also features a cross-chain asset bridge that transfers tokens and messages between different blockchain networks using finality proofs. T
Replicates critical data across multiple providers with periodic challenges to verify custody and prevent loss.
Mox is a self-hosted email server that runs as a single compiled Go binary, handling the full lifecycle of sending and receiving email through SMTP, IMAP4rev2, and a built-in webmail application. It is designed to be operated without external dependencies or runtime plugins, with all mail services — including spam filtering, queue management, and web interfaces for administration and account management — contained in one executable. The server distinguishes itself through automated TLS certificate management via ACME, DNS-based autoconfiguration for email clients, and file-based configuration
Checks the consistency of database files and message files in a data directory, optionally fixing issues.
seL4 is a formally verified microkernel whose C implementation is backed by machine-checked mathematical proofs of correctness, confidentiality, integrity, and availability. It enforces strict isolation between processes through hardware-enforced address space separation and a capability-based access control system, where each process holds explicit rights only to the resources it has been granted. The kernel exposes hardware resources through a minimal API of system calls that manage threads, address spaces, and inter-process communication, with synchronous IPC supporting sender-identifying b
Provides formal proofs that the kernel enforces access control policies and ensures resource availability.
Buildbot is a Python-based continuous integration framework and distributed build orchestrator. It functions as a build automation engine that coordinates the retrieval of source code, the execution of build steps, and the reporting of results through a central controller and a network of remote worker agents. The system is distinguished by a plugin-based extensibility architecture and a master-worker distribution model. It allows for dynamic build modification at runtime and supports a pluggable database backend for persisting system state and historical build data. The project covers a bro
Performs database maintenance tasks, including log optimization and compression, to maintain system health.
rippled is a peer-to-peer node implementation for the XRP Ledger. It serves as a distributed ledger consensus engine and a blockchain API gateway, allowing servers to validate transactions and maintain a shared, immutable sequence of ledger state. The project features a built-in decentralized exchange engine capable of executing atomic cross-currency payments and managing order books. It includes an on-chain asset management system for issuing and controlling fungible tokens and NFTs, as well as a Byzantine Fault Tolerant system for coordinating validators. The platform provides comprehensiv
Runs safety checks separate from transaction processing to validate the internal consistency and health of the ledger state.
Pigsty is a full-stack orchestration suite for deploying, monitoring, and managing high-availability PostgreSQL clusters and their supporting infrastructure. It functions as a cluster management platform and high-availability suite that automates failover, manages virtual IPs, and ensures data consistency through distributed consensus. The project distinguishes itself by providing a comprehensive database infrastructure-as-code framework and a dedicated observability stack. It incorporates a backup and recovery manager supporting point-in-time recovery via S3-compatible object storage, alongs
Provides utilities for reorganizing tables, reclaiming unused space, and verifying relation integrity.
Kanidm is a centralized identity management server designed to handle authentication, authorization, and directory services across distributed infrastructure. It provides a comprehensive framework for managing human and service accounts, utilizing a schema-driven database to store identity records, group memberships, and system attributes. The platform supports a wide range of authentication methods, including passkeys, passwords, and standard protocols like OAuth2, OIDC, LDAP, and RADIUS. The system distinguishes itself through a granular access control engine that enforces security policies
IdentityServer verifies the consistency of database data, such as referential integrity or membership relationships, to ensure overall data health.
Orioledb 是一个专为 PostgreSQL 设计的云原生存储引擎,旨在替换默认存储层,以提高现代硬件上的垂直扩展性和性能。它作为一个索引组织表存储,直接在主索引内组织表行,以加速数据检索。 该引擎利用撤销日志存储系统来管理数据版本控制,这消除了手动清理(vacuuming)的需要并防止了表膨胀。它还通过块级和页级数据压缩进一步减少了磁盘占用。 该项目提供了高级索引管理和自动化数据库维护功能。它包括通过行级日志记录实现高可用性恢复的功能,以及用于分析空间利用率和验证表完整性的工具。
Includes integrity verifiers that scan indices to detect corruption without blocking concurrent access.
This project provides the formal technical specifications and reference logic for the Ethereum proof-of-stake consensus layer. It defines the standards for block production, state transition rules, and the beacon chain logic required to ensure consistent network agreement. The implementation covers specialized mechanisms for chain security and efficiency, including fork-choice algorithms for canonical chain determination, committee-based signature aggregation, and KZG-based blob commitments for data availability. It also specifies the protocols for light client synchronization using sync comm
Selects and retrieves a subset of data columns at each slot to verify network data availability.