8 dépôts
Techniques for improving database execution plans, such as injecting literal values into SQL statements.
Distinct from Deterministic Value Injection: The candidates focus on language literals or test injection, not database-specific execution plan optimization.
Explore 8 awesome GitHub repositories matching data & databases · Query Plan Optimizations. Refine with filters or upvote what's useful.
Dapper is a high-performance micro-ORM and SQL object mapper for .NET. It functions as an ADO.NET extension library that adds data mapping capabilities directly to database connections, allowing SQL query results to be transformed into typed objects. The project prioritizes execution speed and low memory overhead by using intermediate language generation to map database columns to object properties. It further optimizes performance through the use of concurrent caching for mapping functions and literal value injection to improve database execution plans. The library covers a broad range of d
Allows embedding numeric or boolean constants directly into queries to improve database index efficiency.
Apache DataFusion is an extensible, columnar SQL query engine that runs embedded within a host application without requiring a separate server process. It processes data in columnar batches using Apache Arrow for memory-efficient analytics, and can scale analytic workloads across multiple nodes for parallel execution. The engine supports both SQL and DataFrame queries through a modular, streaming architecture that allows custom operators, data sources, functions, and optimizer rules. The engine distinguishes itself through its modular extension framework, which enables building custom query e
Applies advanced optimizations like filter pushdown, join reordering, and expression simplification automatically.
Soar is a suite of specialized tools designed for analyzing MySQL performance, advising on indexing, and optimizing SQL syntax. It functions as a performance analyzer, index advisor, and query optimizer to identify bottlenecks and suggest structural improvements for faster execution. The project distinguishes itself through a system for rewriting SQL statements into optimized equivalent versions using custom heuristic rules and patterns. It also features a dedicated index advisor that evaluates query patterns and database metadata to recommend the creation of new indexes. Its broader capabil
Applies a fixed set of rewrite rules to transform logical plans and improve execution speed.
Calcite est un framework pour l'analyse, l'optimisation et la traduction de requêtes SQL en algèbre relationnelle pour une exécution sur diverses sources de données. Il fonctionne comme un moteur de requête multi-sources, une bibliothèque d'analyse SQL et un optimiseur d'algèbre relationnelle. Le projet fournit un moteur d'optimisation basé sur les coûts qui transforme les plans de requête logiques en plans d'exécution physiques efficaces à l'aide de règles enfichables. Il utilise des adaptateurs de traduction pour convertir les requêtes SQL standard dans les formats natifs de bases de données et systèmes de messagerie externes, permettant la fédération de données sur des systèmes de stockage hétérogènes. Le système couvre le cycle de vie complet des requêtes, incluant l'analyse SQL et la validation par rapport aux schémas, la traduction d'expressions en opérateurs algébriques et la sélection de plans d'exécution efficaces. Il inclut également une interface en ligne de commande pour exécuter des requêtes et gérer les connexions aux sources de données.
Transforms logical plans into efficient physical plans using pluggable rules and cost-based functions.
Redka is a SQL-backed key-value store that implements a Redis-compatible interface. It acts as a relational data bridge, translating non-relational protocol commands into relational queries to provide persistence and ACID transactions using a SQL database backend. The system allows for relational data introspection by exposing internal key-value data as SQL views, enabling direct querying and analysis via relational syntax. It can be deployed as a standalone network server or as an embedded library within a Go application. The project supports a variety of data structures, including strings,
Exposes internal key-value data as SQL views to enable direct querying and analysis via relational syntax.
Ce projet est un dépôt de localisation chinoise et un projet de traduction technique conçu pour rendre les projets de programmation concis et la documentation technique accessibles aux sinophones. Il fournit une collection de ressources traduites et des mappages curés de terminologie informatique pour assurer une traduction cohérente des concepts techniques. Le projet implémente un flux de travail de localisation logicielle qui convertit les guides techniques et la documentation de base de code en langue anglaise vers le chinois. Ce processus utilise une ressource de glossaire technique et un modèle de localisation piloté par les ressources pour maintenir le sens original et le contexte technique du matériel source. Le dépôt couvre la traduction de documentation technique et la gestion de mappages de traduction standardisés pour assurer une formulation technique cohérente à travers divers projets logiciels.
Transforms query programs into more efficient versions using priority-based restructuring functions.
MiniOB is an open-source educational relational database kernel designed for learning the internals of database systems. It implements a dual-engine storage architecture combining B+ Tree and LSM-Tree, supports SQL parsing and query execution, and provides transactional processing with multi-version concurrency control. The system communicates with clients using the MySQL wire protocol and includes a vector database extension for storing and querying high-dimensional vectors. The project distinguishes itself through its comprehensive coverage of core database concepts in a single, learnable c
Rewrites query syntax trees using rules and statistics to produce faster execution plans.
This project is an educational resource and technical manual for Apache Spark, focused on the architecture and practical application of large-scale data processing. It serves as a guide for big data engineering and distributed computing, covering the principles of parallel processing and fault-tolerant data distribution. The material provides instructional content on designing distributed ETL pipelines and implementing data analysis workflows. It includes tutorials for polyglot data processing, offering patterns and examples for using Python, Scala, and Java within a unified environment. The
Provides detailed explanations of the Catalyst optimizer's rule-based and cost-based relational query transformation.