High-Performance Structured Log Databases

Scalable open-source engines designed for rapid ingestion, indexing, and querying of structured log data streams.

Find the best repos with AI.We'll search the best matching repositories with AI.

signoz/signoz
SigNoz/signoz
27,355View on GitHub
SigNoz is a full-stack observability platform designed to collect, store, and visualize metrics, logs, and distributed traces in a unified environment. It leverages OpenTelemetry-based data collection to ingest telemetry from diverse sources using vendor-neutral protocols, ensuring interoperability across complex microservices architectures. The platform utilizes a high-performance columnar storage engine to enable rapid aggregation and filtering, providing a centralized backend for monitoring application health and performance. What distinguishes the platform is its focus on automated instru
TypeScriptObservability PlatformsAnalytical DatabasesApplication Performance Monitoring
View on GitHub27,355
alibaba/higress
alibaba/higress
7,558View on GitHub
Higress is an AI API gateway and cloud-native traffic manager that functions as a Kubernetes ingress controller. It provides a centralized system for routing, securing, and optimizing traffic directed toward large language models, AI agents, and microservice architectures. The project distinguishes itself through deep AI orchestration, including the ability to host and manage Model Context Protocol servers that transform REST APIs into tools for AI agents. It features specialized AI infrastructure for model request proxying, protocol translation across multiple providers, and semantic-based c
GoAI API GatewaysAI Gateway ManagementIngress Controllers
View on GitHub7,558
grafana/loki
grafana/loki
27,640View on GitHub
Loki is a horizontally scalable, highly available log aggregation engine designed to store and query massive volumes of unstructured log data. It functions as a distributed observability platform that correlates logs, metrics, and traces to provide comprehensive visibility into the health and performance of complex infrastructure. The system distinguishes itself through a distributed query execution model that processes large datasets in parallel across cluster nodes. It utilizes label-based stream indexing and a distributed index to map log data to specific chunks, enabling rapid retrieval w
GoDistributed Observability SystemsLog Storage EnginesObservability Platforms
View on GitHub27,640
meilisearch/meilisearch
meilisearch/meilisearch
58,118View on GitHub
Meilisearch is a Rust-based search engine providing typo-tolerant full-text and vector-based semantic search with real-time conversational capabilities.
RustDeveloper-Focused Search ToolsDocument Indexing EnginesFinite State Transducers
View on GitHub58,118
wtfutil/wtf
wtfutil/wtf
16,971View on GitHub
This project is a modular, terminal-based dashboard framework designed to aggregate and display real-time information within a grid-aligned interface. It functions as a centralized monitoring tool that translates data from local system resources, infrastructure services, and external web APIs into a unified, text-based display. The dashboard is distinguished by its plugin-based architecture, which allows users to encapsulate distinct data sources and display logic into isolated, independently managed modules. Users define their workspace through declarative configuration files or an interacti
GoTerminal-Based DashboardsInfrastructure MonitoringPlugin-Based Architectures
View on GitHub16,971
mastra-ai/mastra
mastra-ai/mastra
21,221View on GitHub
Mastra is an orchestration framework designed for building, deploying, and managing autonomous AI agents and multi-agent systems. It provides a comprehensive suite of primitives for creating resilient AI applications, including durable workflow orchestration, event-driven agent loops, and semantic memory management. By integrating these core components, the platform enables developers to build complex, multi-step processes that can reason about goals and execute tasks without manual intervention. The framework distinguishes itself through its focus on observability and secure, isolated execut
TypeScriptAI Agent Orchestration FrameworksAI AgentsAI Application Deployment Platforms
View on GitHub21,221
apache/kafka
apache/kafka
32,846View on GitHub
Kafka is a distributed event streaming platform designed for capturing, storing, and processing real-time data streams across interconnected nodes. It functions as a distributed commit log, providing a fault-tolerant storage mechanism that records state changes sequentially to ensure data consistency and durability across distributed environments. The platform distinguishes itself through a partitioned commit log architecture that enables horizontal scaling and parallel processing of data streams. It integrates a stream processing engine for continuous transformations and aggregations, while
JavaDistributed Event Streaming PlatformsDistributed Commit LogsData Streaming Platforms
View on GitHub32,846
victoriametrics/victoriametrics
VictoriaMetrics/VictoriaMetrics
16,343View on GitHub
VictoriaMetrics is a high-performance, scalable time series database and observability platform designed for long-term storage and analysis of metric, log, and trace data. It functions as a unified backend for monitoring ecosystems, offering full compatibility with industry-standard protocols and query languages. The system is built to handle massive data volumes through a distributed architecture that supports horizontal scaling and efficient data lifecycle management. The platform distinguishes itself through a storage engine that utilizes consistent hashing for data sharding and log-struct
GoTime Series DatabasesMetric Data IngestionMetric Query Languages
View on GitHub16,343
grafana/grafana
grafana/grafana
74,456View on GitHub
Grafana is an observability data platform designed to aggregate metrics, logs, and traces from diverse sources into a unified environment. It functions as a centralized interface for visualizing complex telemetry data, transforming raw streams into interactive dashboards that support real-time system health tracking and performance monitoring. The platform distinguishes itself through a plugin-based modular architecture that integrates disparate databases, cloud services, and monitoring tools via a standardized data abstraction layer. This framework allows for the dynamic loading of external
TypeScriptObservability Data PlatformsObservability DashboardsTelemetry Collection and Aggregation
View on GitHub74,456
karpathy/autoresearch
karpathy/autoresearch
87,119View on GitHub
Autoresearch is an autonomous machine learning research agent and architecture search framework. It employs a closed-loop system to programmatically rewrite training and architecture source code to discover optimal language model configurations. The system iteratively modifies code and evaluates performance metrics to improve model quality based on a target objective. It optimizes model performance and training efficiency by tracking validation bits per byte, which allows for a fair comparison of architectural changes independently of vocabulary size. The framework manages the full training
PythonAutomated Machine LearningAutonomous Research AgentsArchitecture Search Frameworks
View on GitHub87,119
uber-go/zap
uber-go/zap
24,329View on GitHub
Zap is a high-performance structured logging library designed for production environments. It provides a framework for generating machine-readable logs that minimize memory overhead and CPU usage, allowing for efficient event analysis and system monitoring. The library distinguishes itself through a focus on zero-allocation logging, utilizing buffer pooling to reduce garbage collection pressure during high-frequency operations. It enforces strict data typing through compile-time checks and structured field encoding, which ensures consistent output without the performance cost of reflection-ba
GoLogging LibrariesStructured Logging FrameworksHigh-Performance Logging
View on GitHub24,329
dotnet/core
dotnet/core
21,897View on GitHub
This project is a cross-platform development framework and managed runtime environment designed for building high-performance applications. It provides a comprehensive toolkit for constructing web services, cloud-native microservices, and desktop applications, utilizing a unified runtime that handles memory management and execution across diverse operating systems. The framework distinguishes itself through a native ahead-of-time compilation toolchain that transforms source code into optimized, self-contained machine code binaries. This capability enables fast startup times and reduced memory
PowerShellBuild-Time CompilersCross-Platform Desktop DevelopmentCross-Platform Development Frameworks
View on GitHub21,897
clickhouse/clickhouse
ClickHouse/ClickHouse
48,229View on GitHub
ClickHouse is a high-performance, columnar analytical database designed for real-time query execution and large-scale data aggregation. It functions as a distributed data warehouse capable of processing petabytes of information, while also providing an embedded engine that integrates directly into applications for native query capabilities without external dependencies. The system is built to handle high-throughput ingestion and complex analytical workloads, delivering millisecond-level latency for interactive dashboards and operational monitoring. The platform distinguishes itself through ad
C++Access Control SystemsAgent AnalyticsAgentic Architectures
View on GitHub48,229
go-kratos/kratos
go-kratos/kratos
25,743View on GitHub
Kratos is a toolkit for building cloud-native microservices in Go. It provides a comprehensive suite of framework primitives, including a dedicated toolset for API-first development using Protobuf to generate server and client code for gRPC and HTTP. The project is distinguished by its pluggable service infrastructure, which allows for the swapping of configuration stores, service registries, and data encoding formats. It utilizes a composable middleware pipeline to inject cross-cutting concerns such as authentication, request validation, and circuit breaking into the service flow. The frame
GoCloud Native Development FrameworksAPI Code GeneratorsAPI-First Design Workflows
View on GitHub25,743
burntsushi/ripgrep
BurntSushi/ripgrep
65,112View on GitHub
ripgrep is a command-line utility designed for searching through large file trees and source code repositories. It functions as a recursive text processor that traverses directories to locate and display matching patterns, serving as a high-performance alternative to traditional search tools. The tool distinguishes itself through a focus on execution speed and intelligent file handling. It utilizes a finite automata-based regular expression engine to ensure linear time complexity and employs hardware-level acceleration for literal byte sequence scanning. By integrating with version control sy
RustCommand Line Search UtilitiesSearch UtilitiesText Search Tools
View on GitHub65,112
modelcontextprotocol/typescript-sdk
modelcontextprotocol/typescript-sdk
12,674View on GitHub
This project provides a TypeScript software development kit for the Model Context Protocol, a standard designed to facilitate bidirectional communication between AI applications and external data sources or tools. It serves as a foundational framework for building both clients and servers, enabling language models to interact with external systems through a unified, decoupled interface. The SDK distinguishes itself by implementing a transport-agnostic connection layer that supports both local standard input-output streams and remote HTTP endpoints. It utilizes a JSON-RPC message bus to manage
TypeScriptModel Context ProtocolAI Agent Tool IntegrationsLanguage Model Orchestration
View on GitHub12,674
elastic/elasticsearch
elastic/elasticsearch
77,012View on GitHub
Elasticsearch is a distributed search engine and document store designed for the high-performance indexing and retrieval of massive volumes of unstructured data. It functions as a centralized analytics platform, providing a schema-flexible architecture that organizes information into searchable indices while maintaining global cluster state through a distributed consensus mechanism. The platform distinguishes itself through its integrated approach to observability, security, and advanced analytics. It combines full-text, vector, and hybrid search capabilities with machine learning-driven insi
JavaDistributed Search EnginesData Analytics EnginesDistributed Document Stores
View on GitHub77,012
go-kit/kit
go-kit/kit
27,430View on GitHub
Kit is a microservices architectural framework and toolkit for Go. It provides a set of standardized primitives and abstractions for implementing service, endpoint, and transport layers in a decoupled manner. The framework focuses on system instrumentation through integrated distributed request tracing and a service instrumentation toolkit that utilizes counters and gauges to export performance data to external monitoring backends. It includes a structured logging library that records system events as key-value pairs to ensure compatibility with log aggregation tools. The project covers a br
GoMicroservice ArchitecturesDistributed TracingInstrumentation Libraries
View on GitHub27,430
typesense/typesense
typesense/typesense
25,254View on GitHub
Typesense is a distributed search engine designed to provide sub-millisecond query latency across massive datasets. It functions as both a high-performance indexing and retrieval engine and a comprehensive search experience platform, offering built-in typo tolerance and tools for managing relevance through synonym configuration, result curation, and complex filtering. The platform distinguishes itself by utilizing in-memory indexing to maintain high-throughput data retrieval and integrating vector database capabilities to support semantic similarity searches. It ensures data consistency and h
C++Distributed Search EnginesSearch EnginesSearch Experience Platforms
View on GitHub25,254
pinojs/pino
pinojs/pino
17,941View on GitHub
Pino is a high-performance logging library for Node.js applications designed to minimize overhead and prevent blocking the main event loop. It generates machine-readable logs using newline-delimited JSON, facilitating efficient ingestion and analysis by external monitoring and log aggregation platforms. The library distinguishes itself by offloading log processing and formatting to worker threads, ensuring that heavy logging tasks do not impact application responsiveness. It also provides a decoupled command-line utility that transforms structured production logs into human-readable text, sim
JavaScriptLogging LibrariesNode.js Development UtilitiesAsynchronous Logging
View on GitHub17,941
prometheus/prometheus
prometheus/prometheus
64,569View on GitHub
Prometheus is a comprehensive monitoring and alerting platform designed to track infrastructure health and application performance. It functions as a time series database that ingests, indexes, and queries high-frequency numerical data points. By utilizing a pull-based model, the system periodically collects multi-dimensional metrics from monitored targets, storing them in an optimized block storage format that supports high-throughput ingestion and efficient historical analysis. The platform distinguishes itself through a specialized query engine that enables real-time analysis of performanc
GoOperational Health and AlertingTime SeriesTime-Series Block Storage
View on GitHub64,569
googlecontainertools/skaffold
GoogleContainerTools/skaffold
15,856View on GitHub
Skaffold is a command-line tool that automates the build, push, and deployment lifecycle for containerized applications on Kubernetes. It functions as a continuous development engine, monitoring source code for changes to trigger incremental updates, manifest hydration, and automated deployments to a cluster. By abstracting the underlying build and deployment tools, it provides a unified interface for managing the inner development loop. The platform distinguishes itself through its environment-aware configuration and flexible build orchestration. It supports diverse build strategies, includi
GoInner Loop Development ToolsContinuous DeploymentContinuous Development Engines
View on GitHub15,856
duckdb/duckdb
duckdb/duckdb
38,805View on GitHub
DuckDB is an in-process analytical database engine designed to run directly within an application process. As a zero-dependency, embedded system, it provides enterprise-grade SQL data processing capabilities without the overhead of managing a dedicated database server. It is built to handle complex analytical and aggregation tasks by storing and retrieving information in columns, allowing for high-performance relational data manipulation. The engine distinguishes itself through a columnar vectorized execution model that maximizes CPU cache efficiency during query operations. It employs adapti
C++Analytical DatabasesColumnar EnginesEmbedded Databases
View on GitHub38,805
addyosmani/agent-skills
addyosmani/agent-skills
60,849View on GitHub
Agent-skills is a collection of structured instructions and behavioral personas designed to standardize how AI coding agents perform engineering tasks. It functions as a workflow orchestrator that maps natural language intent to repeatable technical sequences and verification checklists. The project distinguishes itself through the use of specialized markdown-defined roles, such as security auditors or test engineers, to apply targeted domain expertise. It employs an evidence-based verification model that requires runtime data or passing tests as mandatory exit criteria to ensure AI-generated
ShellAI Agent OrchestratorsEngineering StandardizationsEngineering Workflow Automation
View on GitHub60,849
influxdata/influxdb
influxdata/influxdb
31,556View on GitHub
InfluxDB is a specialized time series database platform engineered for the high-speed ingestion, compression, and retrieval of timestamped data at scale. It functions as a distributed metrics platform, providing the infrastructure necessary to organize and analyze massive volumes of time-stamped information to identify trends, patterns, and anomalies within complex data streams. The platform distinguishes itself through a functional dataflow engine that utilizes a specialized programming language for complex analytical transformations and automated tasks. This architecture is supported by a p
RustTime Series DatabasesDomain Specific LanguagesData Ingestion Plugins
View on GitHub31,556
aws-powertools/powertools-lambda-python
aws-powertools/powertools-lambda-python
3,267View on GitHub
AWS Powertools for Python is a utility framework designed for building production-ready Python functions on AWS Lambda. It provides a comprehensive suite of tools for observability, event parsing, routing, and idempotency management to streamline the development of serverless applications. The project distinguishes itself through specialized capabilities for event-driven architectures and AI agent orchestration. It enables the implementation of AI agents by exposing functions as tools via OpenAPI schemas and managing conversation states. Additionally, it features an idempotency library that p
PythonServerless Function DevelopmentAgent-Integrated FunctionsAPI Request Handling
View on GitHub3,267
jqlang/jq
jqlang/jq
34,901View on GitHub
This project is a command-line processor designed for the parsing, filtering, and transformation of structured data streams. It functions as a declarative programming environment that treats data as immutable streams, allowing users to perform complex structural modifications through the composition of small, reusable functions. By utilizing a recursive tree traversal engine, the system enables the navigation, inspection, and modification of deeply nested hierarchical data structures. The engine distinguishes itself through a stream-oriented architecture that processes input records one by on
CCommand-Line Data ProcessorsData Transformation LanguagesStream Filters
View on GitHub34,901
delgan/loguru
Delgan/loguru
23,964View on GitHub
Loguru is a Python logging library and thread-safe framework designed for recording system events and diagnostic messages. It functions as a structured logging tool that can serialize messages into JSON strings with metadata for automated parsing and analysis. The library includes a specialized exception tracker that captures unhandled crashes across main and background threads, rendering detailed stack traces that include local variable values. It further distinguishes itself through a unified routing pipeline that can intercept messages from the standard library logging module and dispatch
PythonApplication LoggingApplication Event LoggersApplication Log Routing
View on GitHub23,964
pola-rs/polars
pola-rs/polars
38,855View on GitHub
Polars is a high-performance columnar data processing library designed for efficient analytical workflows. It functions as a structured data library that organizes information into typed columns, utilizing the Apache Arrow memory format to enable zero-copy data sharing and cache-friendly, vectorized operations. The engine is built to handle large-scale tabular datasets, providing both local and distributed analytical runtimes that scale from single-machine environments to multi-node clusters. The project distinguishes itself through a sophisticated lazy query engine that constructs abstract e
RustAnalytical Data EnginesColumnar Data ProcessorsDistributed Query Engines
View on GitHub38,855
chroma-core/chroma
chroma-core/chroma
26,198View on GitHub
Chroma is a specialized vector database designed to index and retrieve high-dimensional data representations for semantic similarity search. It functions as a comprehensive platform for information retrieval, enabling the storage and management of unstructured documents alongside structured metadata. By mapping data into numerical representations, the system facilitates rapid similarity lookups across large datasets. The platform distinguishes itself through a hybrid search infrastructure that combines dense vector embeddings with sparse keyword and regular expression matching to balance sema
RustVector DatabasesHybrid Search EnginesVector Search
View on GitHub26,198

High-Performance Structured Log Databases

SigNoz/signoz

alibaba/higress

grafana/loki

meilisearch/meilisearch

wtfutil/wtf

mastra-ai/mastra

apache/kafka

VictoriaMetrics/VictoriaMetrics

grafana/grafana

karpathy/autoresearch

uber-go/zap

dotnet/core

ClickHouse/ClickHouse

go-kratos/kratos

BurntSushi/ripgrep

modelcontextprotocol/typescript-sdk

elastic/elasticsearch

go-kit/kit

typesense/typesense

pinojs/pino

prometheus/prometheus

GoogleContainerTools/skaffold

duckdb/duckdb

addyosmani/agent-skills

influxdata/influxdb

aws-powertools/powertools-lambda-python

jqlang/jq

Delgan/loguru

pola-rs/polars

chroma-core/chroma