Why is duckdb/duckdb a recommended SQL Engines GitHub Repositories repository?

Executes standard SQL commands to transform, join, and analyze data from diverse formats.

Why is prestodb/presto a recommended SQL Engines GitHub Repositories repository?

Supports complex data manipulation through advanced SQL operations like UPDATE and MERGE.

Why is harelba/q a recommended SQL Engines GitHub Repositories repository?

Acts as a relational query processor that treats CSV and TSV files as tables for SQL execution.

Why is databendlabs/databend a recommended SQL Engines GitHub Repositories repository?

Implements a SQL-compliant engine that manages complex query execution over large-scale cloud storage.

Why is dinedal/textql a recommended SQL Engines GitHub Repositories repository?

Provides a relational query processor that treats CSV files as tables for SQL execution.

Why is alasql/alasql a recommended SQL Engines GitHub Repositories repository?

Implements a relational query processor that executes SQL against JavaScript arrays and JSON objects.

Why is wireservice/csvkit a recommended SQL Engines GitHub Repositories repository?

Translates SQL queries into in-memory operations on CSV data without requiring a database server.

Why is apache/hive a recommended SQL Engines GitHub Repositories repository?

Provides a SQL-on-Hadoop data warehouse that queries and manages petabytes of data stored in distributed storage.

Why is baserow/baserow a recommended SQL Engines GitHub Repositories repository?

Translates a custom expression language into optimized SQL queries for efficient server-side data calculation.

9 repositorios

Awesome GitHub RepositoriesSQL Engines

Q: What are the best Awesome SQL Engines GitHub Repositories?

Relational query processors for standard SQL execution. **Distinguishing note:** Focuses on the query processing capability rather than the storage engine. Explore 9 awesome GitHub repositories matching data & databases · SQL Engines. Refine with filters or upvote what's useful. Top picks: duckdb/duckdb, prestodb/presto, harelba/q, databendlabs/databend, dinedal/textql, alasql/alasql, wireservice/csvkit, apache/hive, baserow/baserow.

Relational query processors for standard SQL execution.

Distinguishing note: Focuses on the query processing capability rather than the storage engine.

Explore 9 awesome GitHub repositories matching data & databases · SQL Engines. Refine with filters or upvote what's useful.

Encuentra los mejores repositorios con IA.Buscaremos los repositorios que mejor coincidan usando IA.

duckdb/duckdb
duckdb/duckdb
38,805Ver en GitHub
DuckDB is an in-process analytical database engine designed to run directly within an application process. As a zero-dependency, embedded system, it provides enterprise-grade SQL data processing capabilities without the overhead of managing a dedicated database server. It is built to handle complex analytical and aggregation tasks by storing and retrieving information in columns, allowing for high-performance relational data manipulation. The engine distinguishes itself through a columnar vectorized execution model that maximizes CPU cache efficiency during query operations. It employs adapti
Executes standard SQL commands to transform, join, and analyze data from diverse formats.
C++analyticsdatabaseembedded-database
Ver en GitHub38,805
prestodb/presto
prestodb/presto
16,711Ver en GitHub
Presto is a distributed SQL query engine designed for high-performance analytical processing across heterogeneous data sources. It functions as a data federation platform and massively parallel processing engine, allowing users to execute interactive queries against diverse storage systems without requiring data migration. By mapping remote metadata and structures to a unified relational namespace, it enables seamless cross-platform analysis through a standard SQL interface. The engine distinguishes itself through a pluggable connector architecture and a shared-nothing distributed processing
Supports complex data manipulation through advanced SQL operations like UPDATE and MERGE.
Javabig-datadatahadoop
Ver en GitHub16,711
harelba/q
harelba/q
10,353Ver en GitHub
q is a command-line utility for the processing, filtering, and aggregation of tabular text and database files using standard SQL syntax. It functions as a query engine that treats CSV and TSV files, as well as standard input, as relational database tables. The tool distinguishes itself by providing a persistent cache layer that stores processed tabular data in a binary format to accelerate repeated queries on large datasets. It also maps individual filenames or stream identifiers to relational table names, enabling SQL joins across disparate text files. The project covers a broad range of da
Acts as a relational query processor that treats CSV and TSV files as tables for SQL execution.
Pythonclicommand-linecommand-line-tool
Ver en GitHub10,353
databendlabs/databend
databendlabs/databend
9,351Ver en GitHub
Databend is a cloud-native data warehouse and OLAP database designed for large-scale analytics. It functions as a SQL-compliant engine and serverless analytics platform that separates compute from storage to allow for independent scaling. The system integrates vector database capabilities, indexing high-dimensional embeddings to enable semantic, hybrid, and full-text searches across massive datasets. It further distinguishes itself through serverless compute management that automatically scales resources based on demand and shuts them down during idle periods. The platform covers a broad set
Implements a SQL-compliant engine that manages complex query execution over large-scale cloud storage.
Rustaibigdatacloud-native
Ver en GitHub9,351
dinedal/textql
dinedal/textql
9,109Ver en GitHub
TextQL is a command line SQL query engine designed to execute relational queries directly against structured text files, such as CSV and TSV, without requiring a database import. It functions as a relational text file analyzer and a CSV processor that treats plain text files as virtual tables for filtering, joining, and aggregating data. The tool is built as a pipe-compatible data transformation utility, allowing it to process data from standard input and output formatted datasets. It enables relational joins across multiple files or directories within a single query to analyze relationships
Provides a relational query processor that treats CSV files as tables for SQL execution.
Go
Ver en GitHub9,109
alasql/alasql
AlaSQL/alasql
7,278Ver en GitHub
AlaSQL is a JavaScript SQL database engine that allows for the filtering, grouping, and joining of in-memory object arrays and JSON data. It functions as an in-memory SQL database and client-side data processor, enabling the execution of SQL statements against JavaScript arrays and external data sources in both browser and server environments. The project serves as a universal data query tool capable of performing relational joins across diverse sources, such as merging Google Spreadsheets, SQLite files, and remote APIs into a single result set. It also acts as an IndexedDB SQL wrapper, allow
Implements a relational query processor that executes SQL against JavaScript arrays and JSON objects.
JavaScript
Ver en GitHub7,278
wireservice/csvkit
wireservice/csvkit
6,390Ver en GitHub
csvkit is a composable Unix-style command-line toolkit for converting, filtering, and analyzing CSV files directly from the terminal. It provides a suite of focused single-purpose commands that can be combined via pipes to build complex data processing workflows, with a modular architecture that includes a column-type inference engine for automatically detecting data types and a streaming-pipeline design for efficient handling of tabular data. The toolkit distinguishes itself through its SQL-engine abstraction layer, which allows users to run SQL queries directly against CSV files without req
Translates SQL queries into in-memory operations on CSV data without requiring a database server.
Python
Ver en GitHub6,390
apache/hive
apache/hive
6,012Ver en GitHub
Apache Hive is a SQL-on-Hadoop data warehouse that enables querying and managing petabytes of data stored in distributed storage such as HDFS and cloud storage services. It provides a familiar SQL interface for batch analytics and reporting, supported by a core set of components including the HiveServer2 Thrift service for remote query execution, the Hive Metastore Service for central metadata management, the Hive ACID Transaction Engine for concurrent read-write operations, and the Hive LLAP Interactive Engine for low-latency analytical processing. The WebHCat REST API offers an HTTP interfac
Provides a SQL-on-Hadoop data warehouse that queries and manages petabytes of data stored in distributed storage.
Javaapachebig-datadatabase
Ver en GitHub6,012
baserow/baserow
baserow/baserow
4,188Ver en GitHub
Baserow is a self-hosted, no-code relational database platform built on PostgreSQL. It provides a spreadsheet-like interface for structuring and managing data without writing code, while exposing all database resources via a REST API to support headless architectures. The platform distinguishes itself by integrating large language models and embedding servers to power AI assistants and automated data generation. It further extends its utility as a no-code application builder, allowing users to create custom internal portals, dashboards, and business tools using visual logic and managed data.
Translates a custom expression language into optimized SQL queries for efficient server-side data calculation.
Pythonairtableairtable-alternativeairtable-replacement
Ver en GitHub4,188

Awesome SQL Engines GitHub Repositories

duckdb/duckdb

prestodb/presto

harelba/q

databendlabs/databend

dinedal/textql

AlaSQL/alasql

wireservice/csvkit

apache/hive

baserow/baserow

Explorar subetiquetas