Why is doocs/advanced-java a recommended Materialized Views GitHub Repositories repository?

Constructs read-optimized data projections by aggregating events from multiple services into materialized views.

Why is karanpratapsingh/system-design a recommended Materialized Views GitHub Repositories repository?

Describes the use of pre-computed data sets to optimize query performance.

Why is vonng/ddia a recommended Materialized Views GitHub Repositories repository?

Maintains pre-computed views to provide near-instant responses for frequently accessed aggregate metrics.

Why is timescale/timescaledb a recommended Materialized Views GitHub Repositories repository?

Provides incremental continuous aggregates that refresh materialized views by processing only new or changed data.

Why is letta-ai/letta a recommended Materialized Views GitHub Repositories repository?

Clears associated information by deleting specific data archives.

Why is questdb/questdb a recommended Materialized Views GitHub Repositories repository?

Refreshes downstream aggregates and time-series metrics incrementally as new data arrives to ensure up-to-date analytical insights.

Why is prestodb/presto a recommended Materialized Views GitHub Repositories repository?

Persists query results to disk to accelerate data retrieval and improve read performance.

Why is cachethq/cachet a recommended Materialized Views GitHub Repositories repository?

Provides functionality to delete historical metric data points to maintain accurate performance records.

Why is dbt-labs/dbt-core a recommended Materialized Views GitHub Repositories repository?

Determines how data models are physically stored in the warehouse to balance performance and build efficiency.

Why is starrocks/starrocks a recommended Materialized Views GitHub Repositories repository?

Provides automated materialized views that pre-calculate result sets to accelerate query response times.

26 dépôts

Awesome GitHub RepositoriesMaterialized Views

Pre-computed data sets derived from queries to accelerate read performance.

Distinguishing note: Focuses on the architectural use of cached query results, distinct from standard database views.

Explore 26 awesome GitHub repositories matching data & databases · Materialized Views. Refine with filters or upvote what's useful.

Trouvez les meilleurs dépôts grâce à l'IA.Nous recherchons les dépôts les plus pertinents grâce à l'IA.

doocs/advanced-java
doocs/advanced-java
78,987Voir sur GitHub
This project is a comprehensive Java backend engineering guide and technical reference focused on high-concurrency design, distributed systems, and microservices architecture. It provides detailed strategies for decomposing monolithic applications, managing service discovery, and implementing the architectural patterns required for scalable backend environments. The repository distinguishes itself through an extensive collection of big data algorithmic references and database scaling strategies. It covers memory-efficient techniques for analyzing massive datasets, such as Top-K element extrac
Constructs read-optimized data projections by aggregating events from multiple services into materialized views.
Javaadvanced-javadistributed-search-enginedistributed-systems
Voir sur GitHub78,987
karanpratapsingh/system-design
karanpratapsingh/system-design
44,051Voir sur GitHub
This project is a comprehensive educational resource focused on the principles, patterns, and trade-offs required to design scalable, reliable, and high-performance distributed systems. It provides a structured curriculum that covers the fundamental architectural strategies necessary for building modern software infrastructure, ranging from high-level system decomposition to low-level networking and data management. The repository distinguishes itself by offering deep dives into complex architectural patterns, such as microservices-based decomposition, event-driven communication, and command-
Describes the use of pre-computed data sets to optimize query performance.
architecturedistributed-systemsengineering
Voir sur GitHub44,051
vonng/ddia
Vonng/ddia
22,648Voir sur GitHub
This project serves as a comprehensive technical reference for the architecture and design of data-intensive applications. It provides a structured analysis of the fundamental principles required to build reliable, scalable, and maintainable software systems, covering the core trade-offs inherent in modern data infrastructure. The repository explores the mechanics of distributed data management, including strategies for replication, partitioning, and achieving consensus across multiple nodes. It details the design of storage engines, indexing techniques, and transaction management models, whi
Maintains pre-computed views to provide near-instant responses for frequently accessed aggregate metrics.
Pythonbookdatabaseddia
Voir sur GitHub22,648
timescale/timescaledb
timescale/timescaledb
21,876Voir sur GitHub
TimescaleDB is an open-source PostgreSQL extension that adds native time-series capabilities to the database. At its core, it transforms standard PostgreSQL tables into hypertables—automatically partitioned by time intervals—so data is stored in fixed-size chunks without manual sharding. The extension includes a library of over 200 built-in SQL functions purpose-built for time-series workloads, such as time bucketing, gap filling, percentile estimation, and time-weighted averages. What distinguishes TimescaleDB from generic PostgreSQL is its set of integrated time-series features that work th
Provides incremental continuous aggregates that refresh materialized views by processing only new or changed data.
Canalyticsdatabasefinancial-analysis
Voir sur GitHub21,876
letta-ai/letta
letta-ai/letta
21,168Voir sur GitHub
Letta is a framework for building, deploying, and managing autonomous AI agents that maintain persistent state across long-term interactions. It provides a comprehensive suite of primitives for defining agents with configurable personas, modular memory blocks, and tool-use capabilities, enabling them to retain user preferences and conversation history over extended sessions. The platform distinguishes itself through its advanced memory management and orchestration capabilities. It allows agents to autonomously update their own memory, perform retrieval-augmented generation, and coordinate com
Clears associated information by deleting specific data archives.
Pythonaiai-agentsllm
Voir sur GitHub21,168
questdb/questdb
questdb/questdb
17,062Voir sur GitHub
QuestDB is a high-performance, distributed time-series database designed for the ingestion, storage, and analysis of massive datasets. It functions as a real-time analytics platform that utilizes a columnar storage engine to optimize disk input and output, enabling efficient analytical scans and complex windowing operations on streaming data. The platform distinguishes itself through specialized capabilities for handling asynchronous time-series streams, including advanced join algorithms that align disparate data sets based on precise timestamp lookups. It supports high-volume ingestion thro
Refreshes downstream aggregates and time-series metrics incrementally as new data arrives to ensure up-to-date analytical insights.
Javacapital-marketscppdatabase
Voir sur GitHub17,062
prestodb/presto
prestodb/presto
16,711Voir sur GitHub
Presto is a distributed SQL query engine designed for high-performance analytical processing across heterogeneous data sources. It functions as a data federation platform and massively parallel processing engine, allowing users to execute interactive queries against diverse storage systems without requiring data migration. By mapping remote metadata and structures to a unified relational namespace, it enables seamless cross-platform analysis through a standard SQL interface. The engine distinguishes itself through a pluggable connector architecture and a shared-nothing distributed processing
Persists query results to disk to accelerate data retrieval and improve read performance.
Javabig-datadatahadoop
Voir sur GitHub16,711
cachethq/cachet
cachethq/cachet
14,932Voir sur GitHub
Cachet is a self-hosted, open-source status page system designed to communicate service uptime, incident history, and infrastructure performance to end users. It provides a centralized dashboard for managing the operational lifecycle of system components, tracking service disruptions, and scheduling maintenance windows. The platform distinguishes itself through a comprehensive RESTful API that enables programmatic status page management and automated incident reporting. It supports deep integration with external monitoring tools, allowing for the synchronization of performance metrics and the
Provides functionality to delete historical metric data points to maintain accurate performance records.
PHPcachetlaravelphp
Voir sur GitHub14,932
dbt-labs/dbt-core
dbt-labs/dbt-core
13,051Voir sur GitHub
dbt-core is a command-line framework for transforming data within a warehouse using modular SQL and version control. It functions as a data transformation engine that enables users to define data structures and business logic through declarative configuration files, which the system then compiles into executable code. By managing complex data dependencies through a directed acyclic graph, it ensures that transformation tasks execute in the correct order while maintaining a manifest-driven state to track lineage and execution history. The project distinguishes itself through an adapter-based d
Determines how data models are physically stored in the warehouse to balance performance and build efficiency.
Rustanalyticsbusiness-intelligencedata-modeling
Voir sur GitHub13,051
starrocks/starrocks
StarRocks/starrocks
11,789Voir sur GitHub
StarRocks is a distributed SQL OLAP database engine designed for real-time analytics and high-performance multi-dimensional analysis. It functions as a data lakehouse query engine that enables SQL execution across large datasets and external open table formats without requiring local data imports. The system employs a shared-nothing distributed architecture and utilizes the MySQL protocol to integrate with business intelligence tools. It maintains real-time data consistency through a primary key upsert model and accelerates query response times using vectorized execution and cost-based optimi
Provides automated materialized views that pre-calculate result sets to accelerate query response times.
Javaanalyticsbig-datacloudnative
Voir sur GitHub11,789
electric-sql/electric
electric-sql/electric
9,909Voir sur GitHub
Electric is a Postgres data synchronization engine and replication proxy designed to enable local-first software. It replicates data from Postgres databases to client-side stores in real time using logical replication, allowing applications to maintain a local embedded database for offline access and low-latency updates. The system distinguishes itself by using shapes to filter and authorize specific subsets of database rows and columns before streaming them to clients or edge workers. It further supports multi-user collaboration by integrating a conflict-free replicated data type framework t
Transforms a stream of logical database operations into a local data structure to maintain current state.
Elixircrdtcrdtselixir
Voir sur GitHub9,909
risingwavelabs/risingwave
risingwavelabs/risingwave
9,093Voir sur GitHub
RisingWave is a cloud-native streaming database and real-time analytics engine that uses standard SQL to process continuous data streams. It functions as a streaming data lakehouse, combining the capabilities of a streaming SQL database with a platform that integrates streaming ingestion with open table formats. The system is distinguished by its use of the PostgreSQL wire protocol, allowing it to integrate with existing SQL tools and drivers. It employs a decoupled compute and storage architecture, persisting streaming state and materialized views in cloud object storage to enable independen
Builds and maintains real-time materialized views of streaming data with support for cascading dependencies.
Rustapache-icebergdata-engineeringdatabase
Voir sur GitHub9,093
lancedb/lancedb
lancedb/lancedb
9,031Voir sur GitHub
LanceDB is a vector database and columnar data store designed to function as a versioned dataset manager and vector search engine. It serves as a high-performance backend for indexing and retrieving high-dimensional embeddings, providing the foundation for machine learning data pipelines. The system distinguishes itself through a combination of cloud-native object storage and immutable version tracking, allowing for data time-travel and reproducible AI experiments. It integrates hybrid search capabilities, merging dense vector similarity with BM25 full-text search and SQL-like scalar filters
Creates persisted views that transform source data and store derived embeddings for high-performance search.
HTMLapproximate-nearest-neighbor-searchimage-searchnearest-neighbor-search
Voir sur GitHub9,031
growthbook/growthbook
growthbook/growthbook
7,351Voir sur GitHub
GrowthBook is a feature flagging and experimentation platform that utilizes a warehouse-native approach to data analysis. It serves as a system for managing feature rollouts and conducting A/B tests by executing SQL queries directly against existing data warehouses to calculate experiment results. The platform is distinguished by its integration of a Model Context Protocol server, which allows AI coding assistants and IDEs to manage flags and query analytics using natural language. It also provides specialized capabilities for AI model optimization, enabling the testing of prompts and models
Implements incremental data refreshes for intermediate results to reduce warehouse query costs.
TypeScriptab-testingabtestabtesting
Voir sur GitHub7,351
ibis-project/ibis
ibis-project/ibis
6,574Voir sur GitHub
Ibis is a portable Python dataframe library and multi-backend query engine that provides a unified interface for executing data transformations across diverse compute engines. It functions as a Python SQL expression compiler and dialect transpiler, allowing users to define data logic once and execute it across cloud warehouses, embedded databases, and distributed clusters without rewriting code. The project distinguishes itself through a database backend abstraction that decouples transformation logic from the underlying execution engine. It enables polyglot data workflows by mixing raw SQL s
Builds pre-computed data sets from queries that can be incrementally updated for high-performance access.
Pythonbigqueryclickhousedatabase
Voir sur GitHub6,574
maciek-roboblog/claude-code-usage-monitor
Maciek-roboblog/Claude-Code-Usage-Monitor
6,617Voir sur GitHub
Transforms raw session data into real-time, daily, and monthly summaries through map-reduce operations.
Pythonaianalyticsclaude
Voir sur GitHub6,617
jlfwong/speedscope
jlfwong/speedscope
6,501Voir sur GitHub
Speedscope is a web-based performance profiler that visualizes profiling data through interactive flamegraphs and timeline views. It ingests performance profiles from a wide range of sources, including Chrome, Firefox, Safari, Node.js, .NET Core, Instruments, Hermes, GHC, and Ruby, normalizing them into a common schema for unified analysis. The tool distinguishes itself with a canvas-based rendering engine that draws flamegraphs without DOM nodes for each frame, and a WebAssembly-based rendering pipeline for high-performance drawing. It offers left-heavy stack sorting to surface the most time
Drags and drops a processed profile into Speedscope to view it with sourcemaps applied.
TypeScriptflamegraphflamegraphsperformance-profiling
Voir sur GitHub6,501
materializeinc/materialize
MaterializeInc/materialize
6,314Voir sur GitHub
Materialize is a streaming SQL database that continuously ingests live data from sources such as Kafka, Redpanda, PostgreSQL, and MySQL, and incrementally maintains materialized views. It provides a PostgreSQL-compatible query engine that accepts standard SQL over the PostgreSQL wire protocol, enabling any existing SQL client or BI tool to query real-time data. The system also includes a Model Context Protocol (MCP) server that exposes live materialized view data to AI agents, providing fresh context without polling. Materialize distinguishes itself through its ability to offer configurable c
Updates materialized views by processing only changed rows rather than full dataset recalculation.
Rust
Voir sur GitHub6,314
dimitri/pgloader
dimitri/pgloader
6,295Voir sur GitHub
pgloader is a command-line tool that automates the migration of data and schema from various source databases and file formats into PostgreSQL. It combines schema discovery, parallel data pipelines, and type casting into a single, declarative workflow, using PostgreSQL's COPY protocol for high-throughput bulk loading. The tool distinguishes itself by compiling a dedicated command language into concurrent reader-writer pipelines that handle schema introspection, data transformation, and error-resilient batch processing. It supports migrating entire databases from MySQL, MS SQL, SQLite, and Pos
Converts MS SQL views into PostgreSQL tables during migration.
Common Lispclozure-clcommon-lispcsv
Voir sur GitHub6,295
apache/pinot
apache/pinot
6,098Voir sur GitHub
Pinot is a distributed, columnar analytical database designed for high-concurrency, low-latency query processing. It functions as a real-time OLAP datastore, enabling interactive, user-facing analytics by ingesting and querying massive datasets from both streaming and batch sources. The system architecture relies on a centralized controller for cluster coordination and a distributed segment-based storage model to ensure horizontal scalability. The platform distinguishes itself through a hybrid ingestion pipeline that unifies real-time event streams and historical batch data into a single quer
Provides administrative interfaces to create, inspect, and remove materialized views.
Java
Voir sur GitHub6,098

Awesome Materialized Views GitHub Repositories

doocs/advanced-java

karanpratapsingh/system-design

Vonng/ddia

timescale/timescaledb

letta-ai/letta

questdb/questdb

prestodb/presto

cachethq/cachet

dbt-labs/dbt-core

StarRocks/starrocks

electric-sql/electric

risingwavelabs/risingwave

lancedb/lancedb

growthbook/growthbook

ibis-project/ibis

Maciek-roboblog/Claude-Code-Usage-Monitor

jlfwong/speedscope

MaterializeInc/materialize

dimitri/pgloader

apache/pinot

Explorer les sous-tags