9 Repos
Environments for long-term data persistence and historical state tracking.
Distinguishing note: Focuses on the archival aspect rather than active database management.
Explore 9 awesome GitHub repositories matching data & databases · Data Archiving Systems. Refine with filters or upvote what's useful.
This project is a comprehensive dataset and archive of classical Chinese poetry, prose, and Confucian classics. It serves as a digital humanities corpus, providing machine-readable access to hundreds of thousands of poems and detailed poet biographies, specifically spanning the Tang and Song dynasties. The collection is distinguished by its scholarly depth, incorporating textual variation annotations to track disputed characters across different source editions. It also includes tonal pattern mapping to describe the rhythmic and phonetic structures of the verse, alongside a popularity ranking
Maintains a curated collection of poems and poets in a structured database for long-term research.
Trilium is a local-first personal knowledge management system designed to store and organize information on a user's own device. It functions as a hierarchical knowledge base where every note acts as both a standalone document and a container for nested children, allowing for complex information relationships and deep categorization. The platform distinguishes itself through a component-based interface that dynamically renders diverse content types, including rich text, diagrams, and executable scripts. All data is persisted within a single relational database file, which supports a versioned
Maintains historical states and recovery options to protect information integrity.
Letta is a framework for building, deploying, and managing autonomous AI agents that maintain persistent state across long-term interactions. It provides a comprehensive suite of primitives for defining agents with configurable personas, modular memory blocks, and tool-use capabilities, enabling them to retain user preferences and conversation history over extended sessions. The platform distinguishes itself through its advanced memory management and orchestration capabilities. It allows agents to autonomously update their own memory, perform retrieval-augmented generation, and coordinate com
Initializes long-term storage containers for persistent agent memory across sessions.
This repository serves as a public archive for the raw datasets and analytical code used to support journalistic reporting. It functions as a platform for reproducible research, providing the necessary materials for users to verify published findings and conduct independent statistical analysis. The collection utilizes a versioned storage model to track historical changes to both data and processing scripts. By organizing information into a structured directory hierarchy, the repository maps specific journalistic projects to their corresponding inputs and outputs, ensuring that the methodolog
Acts as a public repository for raw data files and processing scripts that allow users to reproduce analysis.
CVAT is an open-source, web-based platform designed for annotating images, videos, and 3D point clouds to create high-quality training datasets for machine learning. It functions as a containerized server that orchestrates the entire lifecycle of computer vision data, from initial task creation and manual labeling to quality assurance and final dataset export. The platform distinguishes itself through deep integration with machine learning models, allowing users to deploy custom AI models as serverless functions for automated object detection, tracking, and skeleton annotation. It supports co
Enables the creation of compressed snapshots for long-term project preservation and disaster recovery.
Sui is a blockchain platform featuring an object-centric state model and resource-oriented smart contracts. It utilizes parallel transaction execution to increase network throughput and supports programmable transaction blocks that bundle multiple operations into single atomic units. The platform distinguishes itself with a capability-based access control system and zero-knowledge login mechanisms, enabling users to authenticate via identity providers without seed phrases. It also implements deterministic object addressing to allow predictable state lookups and supports the creation of soulbo
Provides endpoints for accessing long-term archived network data that is no longer available on full nodes.
FUXA ist ein webbasiertes SCADA-System und eine industrielle HMI-Software zur Überwachung und Steuerung von industrieller Hardware. Es fungiert als Gateway für Industrieprotokolle und als Data Historian, der die Aufnahme von Telemetriedaten von speicherprogrammierbaren Steuerungen (SPS) und Sensoren über Standard-Kommunikationsprotokolle ermöglicht. Das Projekt bietet eine flussbasierte Automatisierungs-Engine, mit der komplexe operative Logik in einer visuellen Node-and-Edge-Umgebung erstellt werden kann. Es nutzt SVG-basiertes Rendering für seine grafischen Oberflächen, sodass Benutzer eigene HMI-Widgets und Dashboards entwickeln können, die mit Echtzeit-Gerätetelemetrie synchronisiert werden. Das System deckt ein breites Spektrum an operativen Funktionen ab, darunter Alarmmanagement mit schwellenwertbasierten Alerts, Zeitreihen-Datenarchivierung in relationalen oder Zeitreihen-Datenbanken sowie die Ausführung benutzerdefinierter Logik über eine integrierte JavaScript-Engine. Es bietet zudem Tools für die Konfiguration der Datenerfassung, den Export und Import von Projektstatus sowie rollenbasierte Zugriffskontrolle für die administrative Sicherheit. FUXA kann als Full-Stack-Anwendung auf verschiedenen Betriebssystemen oder als Headless-Binary auf Embedded-Geräten bereitgestellt werden.
Stores time-stamped process data in databases like SQLite or InfluxDB for long-term industrial analysis.
Starward ist ein Drittanbieter-Game-Launcher und eine Verwaltungsanwendung für miHoYo-Titel, die als Alternative zur offiziellen Distributionssoftware für die Installation, den Start und die Verwaltung dieser Spiele dient. Das Projekt integriert einen Spiel-Account-Manager für den Wechsel zwischen Benutzerprofilen und regionalen Servern ohne manuelle Re-Authentifizierung. Es enthält zudem einen Gacha-Pull-Tracker zur Archivierung historischer Beschwörungsdaten und Belohnungen sowie einen Gameplay-Statistik-Monitor zur Aufzeichnung der gesamten Spielzeit und Aktivitätshistorie. Die Anwendung bietet zudem eine zentralisierte Mediengalerie zum Organisieren und Betrachten aufgenommener Gameplay-Screenshots und Belohnungsdatensätze. Zusätzliche Funktionen umfassen die Integration externer Utility-Tools und die Verwaltung mehrerer Softwareinstallationen über eine einzige Schnittstelle.
Saves historical randomization data from summoning systems into a local archive for future reference.
This project is a community-driven knowledge base that archives historical statements and insights from influential technology leaders. It functions as a reference library designed to preserve the intellectual history of the information technology field and the evolution of computing philosophy. The repository utilizes a static, flat-file data structure formatted in markdown, which eliminates the need for complex database management or server-side runtime environments. By leveraging a version control system, the project maintains a complete audit trail of all additions and editorial changes,
Archives content in a static markdown format to ensure permanent, database-free access to historical data.