AliSQL

AliSQL

AliSQL is a fork of MySQL by Alibaba that extends the relational database management system with enhancements for high performance, scalability, and enterprise-grade availability. It retains the core MySQL identity as a SQL-based database for storing, organizing, and retrieving structured data, while adding optimizations for large-scale transactional and analytical workloads.

The project differentiates itself through a set of Alibaba-specific improvements, including a columnar engine for accelerating analytical queries directly on MySQL tables, and a distributed, shared-nothing NDB Cluster engine for horizontal scalability and synchronous replication with automatic failover. It also provides an integrated high-availability solution through InnoDB Cluster, combining Group Replication, MySQL Router, and MySQL Shell for deploying fault-tolerant clusters. Additional differentiators include support for vector similarity search using HNSW indexing, a NoSQL document store API for JSON collections, and the HeatWave in-memory columnar query accelerator.

Beyond these core differentiators, AliSQL covers the full breadth of MySQL capabilities: comprehensive API integration across .NET, C, C++, Java, Node.js, ODBC, PHP, and Python; data backup and restore with incremental, online, and cloud storage options; data replication and sync via Group Replication and GTID-based replication; and security features including encryption, authentication (LDAP, Kerberos, PAM), data masking, and auditing. It also includes tools for database administration, monitoring, performance optimization, and Kubernetes-based deployment and orchestration.

The project is documented through the standard MySQL documentation surface, covering installation, configuration, and administration of the server and its associated tools.

Features

Relational Data Storage - Provides core relational data storage with SQL, transactions, indexes, and constraints for structured data.

Relational Database Management Systems - An enterprise relational database management system for storing and retrieving structured data.

Analytical Query Engines - Executes complex aggregations and joins across large datasets using a columnar engine for faster analysis.

Live Table Analytics - Runs complex analytical queries directly on live MySQL tables using a columnar engine, removing the need for separate ETL pipelines.

Automatic Failover Controllers - Detects node failures and promotes replicas to primary automatically.

Columnar Analytics - Executes analytical SQL queries directly against MySQL tables using a DuckDB-powered columnar engine.

In-Database Analytical Engines - Embeds a DuckDB columnar engine to execute analytical SQL queries directly on MySQL tables.

In-Memory Columnar Accelerators - Ships an in-memory columnar query accelerator that delivers sub-second responses for analytical workloads.

In-Database Columnar Engines - MySQL executes analytical SQL workloads directly on a columnar storage engine, accelerating aggregation and scan-heavy queries.

DuckDB-Powered Engines - Executes analytical SQL queries directly against MySQL tables using an embedded DuckDB columnar engine.

MySQL Server Restorations - MySQL reconstructs a MySQL server from a previously created backup to recover from failures or migrate data.

Point-in-Time Database Snapshots - MySQL creates point-in-time snapshots of data and restores them to recover from failures or migrate environments.

Savepoints - Provides named savepoints within transactions for partial rollback without aborting the entire operation.

Data Replication - Copies changes from one database instance to replicas to distribute read load or provide failover targets.

MySQL Connectors - Ships a standard JDBC driver for connecting Java applications to MySQL servers.

Transaction State Management - Controls transaction commit and rollback operations programmatically from applications.

DB System Provisioning - Creates and manages MySQL database systems through the HeatWave Console for hosting structured data.

Distributed SQL Databases - MySQL deploys a shared-nothing cluster of MySQL servers that automatically partitions data and provides high availability through synchronous replication.

Query Routing - Routes database queries to healthy MySQL servers, skipping failed nodes to maintain availability.

Columnar Query Accelerators - Offloads analytical queries to a columnar engine for faster execution than the standard row-based engine.

Horizontal Database Scaling - Adds new data nodes to an existing cluster to increase storage and throughput.

Horizontal Scaling - Partitions tables automatically across cluster nodes for horizontal scaling without application changes.

Hybrid Transactional Analytical Platforms - Executes complex analytical queries directly on live transactional data using an integrated columnar engine.

JSON Document Storage - Creates, modifies, and queries collections of JSON documents stored in a MySQL database.

X DevAPI Document Stores - Provides a full X DevAPI document store for CRUD operations on JSON collections without SQL.

MySQL Administration Tools - Manages MySQL server instances, user accounts, and security configurations.

MySQL Cluster Management - Provisions and scales a columnar analytics cluster attached to a DB System for high-performance query execution.

Connection Routings - Integrates with MySQL Router to direct client traffic to the appropriate server instance, adapting to topology changes.

InnoDB Cluster Deployments - Provides an integrated high-availability solution combining Group Replication, MySQL Router, and MySQL Shell.

NDB Cluster Engines - Implements a distributed, shared-nothing database engine with horizontal scalability and synchronous replication.

In-Memory Columnar Accelerators - Ships an in-memory columnar query accelerator that runs analytical SQL workloads directly on MySQL tables.

GTID-Based Replications - Tracks and applies transactions across servers using Global Transaction Identifiers.

ReplicaSet Deployments - Creates new asynchronous GTID-based replication topologies with one primary and multiple secondaries using AdminAPI commands.

Server Backup - MySQL creates a consistent physical backup of a MySQL server while it remains online and serving queries.

Cluster Lifecycle Management - Starts, stops, and restarts NDB Cluster nodes and processes from a central management interface.

InnoDB Cluster Lifecycles - Manages the lifecycle of InnoDB Cluster, ReplicaSet, and ClusterSet deployments via AdminAPI.

SQL Placeholder Bindings - Binds values to named or positional placeholders in SQL statements to prevent injection.

Raw SQL Execution - Executes arbitrary SQL commands against relational tables and views.

Traffic Routing - Routes client connections to the appropriate MySQL server based on load or topology, balancing requests across replicas.

SQL CRUD Operations - Executes SQL-style CRUD operations on relational tables within the database.

Group - Replicates data synchronously across multiple MySQL servers for consistency and automatic failover.

Row Deletions - Removes rows from tables based on specified search conditions using SQL.

Row Insertions - Adds new records to database tables with specified column values via SQL commands.

Schema Management - Creates, drops, and lists schemas and accesses their collections and tables programmatically.

Database Session Lifecycles - Provides session management for connecting to MySQL servers with authentication and transaction handling.

SQL-Based Row Updates - Modifies existing records in tables using SQL expressions and conditional predicates.

SQL Query Editors - MySQL executes SQL statements against a connected database, displays results as tables or charts, and supports reusable query files.

HNSW Indexes - Builds and queries Hierarchical Navigable Small World indexes for fast approximate nearest-neighbor search on vector data.

Vector Search - Executes similarity searches over vector columns using HNSW indexing to find nearest neighbors in high-dimensional data.

Vector Similarity Search - Finds rows whose vector embeddings are nearest to a query vector using an HNSW index for approximate nearest-neighbor search.

Database Session Management - Opens and controls connections to MySQL servers, handling authentication, transactions, and session lifecycle.

MySQL Command Line Interfaces - Provides a unified command-line interface for administering and monitoring MySQL server instances.

SQL Query Executions - Executes traditional SQL statements against MySQL Server from C or C++ code.

PHP SQL Executions - Provides PHP extension functions to run SQL statements against a MySQL server.

Database Failover Routers - Redirects traffic away from a failed or unreachable MySQL server to a remaining healthy one to maintain availability.

Distributed Database Clusters - Manages deployments through dedicated agent and client software for high availability and redundancy.

Database High Availability - Configures a group of MySQL server instances with automatic failover and transparent client routing.

Distributed Database Deployment - MySQL spreads data across multiple nodes to eliminate single points of failure and keep the database available during hardware faults.

Load Balancers - Distributes incoming database connections among multiple MySQL servers to improve throughput and resource utilization.

Database - Distributes client connections among multiple MySQL instances to prevent any single server from becoming a bottleneck.

Node.js Database Integration - Provides a driver to connect Node.js applications to MySQL servers for executing queries.

Transit and At-Rest Encryption - Protects stored tables, logs, and network connections using configurable cipher suites and key management.

Transactional Row Locking - Wraps operations in transactions, creates savepoints, and locks rows to ensure data consistency.

Database Permissions - Authenticates users and grants or denies permissions on databases, tables, and columns to control read and write access.

MySQL Connections - Provides official PHP extensions to establish database connections from PHP scripts to MySQL.

SQL Row Selection - Retrieves rows matching search criteria with column projection and sorting using SQL.

Contextual AI Assistants - Provides an AI assistant that answers questions using on-screen context and indexed documentation.

Conversational Document Querying - Provides a fluent API to retrieve JSON documents from collections with projection and sorting.

Generative AI Integration Patterns - Integrates generative AI capabilities for data analysis and processing directly within the database.

Document Removal Operations - Ships a fluent API to delete JSON documents from collections based on search conditions.

In-Database Model Executions - Trains and executes machine learning models directly within the database environment.

Database Lifecycle Operators - Automates the lifecycle of MySQL clusters inside Kubernetes using dedicated operators.

C and C++ APIs - Provides C++ and plain C APIs to build applications that interact with MySQL databases.

Database Value Substitutions - Swaps original database values with predefined or generated alternatives to hide sensitive information.

Activity Auditing - Logs client connections, disconnections, and database actions to a file for compliance with regulatory guidelines.

Object Storage Federated Queries - MySQL queries data residing in object storage alongside MySQL tables using a single SQL interface, enabling unified analysis of diverse datasets.

Multi-Primary Write Operations - Allows every server in a replication group to process write requests concurrently without a single primary bottleneck.

Cluster Restorations - MySQL recreates a MySQL Cluster from a previously taken backup to recover from data loss or corruption.

Cross-Database Schema Migrations - Transfers database objects and data from another database system into MySQL with automated type mapping.

MySQL Target Migrations - MySQL transfers schema and data from other database platforms to MySQL with automated conversion.

Document CRUD Operations - MySQL creates, reads, updates, and deletes documents in a schema-less collection using a fluent query-builder API.

Document Updates - MySQL updates fields of existing documents based on search conditions.

Database Backup Restoration - MySQL restores a MySQL database from a backup image or directory, applying transaction logs to bring the data to a consistent state.

Backup Compression - MySQL reduces the size of backup files using compression to save disk space and transfer time.

AI Workload Data Ingestion - MySQL imports structured and unstructured data into tables optimized for machine learning and generative AI pipelines.

Asynchronous Replication Channels - MySQL keeps replica clusters synchronized with the primary cluster using dedicated asynchronous replication channels.

X Protocol Connections - Opens sessions to MySQL servers using the X Protocol for document store and relational access.

X Protocol Sessions - Opens sessions over the X Protocol to run NoSQL-style document store operations.

Database Design Tools - Provides a tool to create and edit logical and physical data models with forward and reverse engineering support.

Active Cluster Failovers - Switches client traffic from an unavailable primary cluster to a replica cluster.

Active Cluster Routers - Directs client read and write requests to the correct cluster in a multi-datacenter deployment.

Database Object Browsers - MySQL allows inspecting schemas, tables, views, routines, users, background tasks, AI models, and vector stores without writing SQL.

Replication Topology Adoptions - Imports running replication topologies into AdminAPI for unified management.

Database Schema Designers - MySQL provides a graphical interface to visually create, edit, and manage database schemas.

Database Schema Visualizers - Renders an interactive entity-relationship diagram from a live database or a design model.

Cross-Site Cluster Links - Links primary clusters with replicas in alternate locations for site-level failover.

Document Schema Enforcement - Enforces JSON schema validation on documents added to collections to ensure data consistency.

Document Insertion - MySQL adds new JSON documents to a collection with automatic ID generation.

Cloud Storage Backups - MySQL writes a backup directly to a cloud storage bucket (OCI, S3, GCP, OpenStack) for off-site retention.

Cloud Storage Restorations - MySQL downloads and restores a backup from cloud object storage services such as Amazon S3 or Oracle Cloud Infrastructure Object Storage.

HeatWave Data Loaders - MySQL loads data from external sources into the HeatWave engine to make it available for analytical queries.

Fluent Query Builders - Offers a contextual, chainable query-builder interface to construct database operations across the full DML surface.

Natural Language Querying - Translates plain-English or Chinese questions into SQL commands via an AI-powered CLI assistant.

JSON Data Import - Reads JSON files and inserts their contents into MySQL tables or collections using the utility API.

Upgrade Readiness Checks - Checks server upgrade readiness and imports JSON data into the database.

Script-Based Administrations - Executes JavaScript or Python scripts against a MySQL server to automate database management tasks.

Automated Cluster Operations - Runs NDB Cluster management tasks without manual intervention using the Cluster Manager agent and client.

Kubernetes Deployments - Automates provisioning and lifecycle management of MySQL NDB Clusters inside Kubernetes.

Primary Role Transfers - Performs controlled primary role transfers between clusters for high availability.

MySQL Extensions - Allows adding new functionality by writing and registering plugins that integrate directly with the server.

MySQL Integrations - Opens sessions over the X Protocol to run NoSQL-style document store operations.

MySQL Compatibility - Connects to MySQL servers using the classic protocol for backward-compatible SQL operations.

AdminAPI Scripting Interfaces - Provides an administrative scripting interface that automates deployment and lifecycle management of InnoDB Cluster topologies.

Prepared Statements - Supports preparing, binding parameters, and executing parameterized SQL statements.

Query Result Fetching - Fetches rows and column data from executed SQL statements using result-handling functions.

Read Replicas - Offloads read queries to replica servers while keeping all writes on the source to improve throughput.

Passive Replica Reads - Serves read-only queries from replica clusters while they remain passive and do not accept writes.

Analytical Replica Reads - Runs analytical queries on a replica server so heavy analysis does not degrade source performance.

Generative AI Insights - Generates insights from database content using integrated generative AI for natural-language interactions.

Replica Load Balancers - Distributes read queries among multiple MySQL replica servers to improve throughput and reduce latency.

Read Load Balancers - Distributes read queries across multiple MySQL replicas to improve throughput and reduce latency.

Schema Change Logs - Records every schema modification with a timestamp and user identity, enabling rollback to any prior state.

SQL Execution Interfaces - Provides a C++ interface to run SQL statements against a MySQL database and retrieve result sets.

Key-Based Table Partitions - Divides tables into smaller physical segments based on a key to improve query performance and manageability.

Scripting Environment SQL Executions - Issues SQL statements against tables through a programmatic interface for scripting environments.

Database Management Extensions - Provides a native graphical interface within Visual Studio Code for managing MySQL databases and running queries.

Database REST APIs - Exposes tables, views, and procedures as RESTful endpoints with configurable CRUD operations.

Incremental Backup Storage - Captures only changes since the last backup to reduce storage and speed up subsequent backups.

Database Incremental Backups - MySQL captures only the changes since the last backup to reduce storage and speed up subsequent backups.

Database - Coordinates rolling upgrades of MySQL Cluster nodes to new software versions.

Manual Promotions - Promotes a secondary instance to primary on demand when the current primary becomes unavailable.

Database Cluster Operators - Runs and operates database clusters inside Kubernetes using a dedicated operator.

Database Cluster Deployments - Provides a dedicated operator for deploying MySQL NDB Cluster on Kubernetes.

NDB Cluster Deployments - Ships a dedicated operator for deploying MySQL NDB Cluster on Kubernetes.

Database Deployments - Deploys and manages MySQL instances within Kubernetes using a dedicated operator.

Version Upgrades - Migrates MySQL installations to newer releases while preserving data.

Automatic Primary Elections - Automatically elects a new primary writer when the current one becomes unavailable.

Database PAM Authentication - Authenticates MySQL users through the Pluggable Authentication Modules interface, supporting Unix passwords and LDAP.

C-API Wrappers - Provides low-level C functions to connect to a MySQL server and execute SQL queries directly from C programs.

SQL Expression Evaluation - Constructs and evaluates arithmetic, logical, and JSON expressions for filtering and projection in database operations.

Multi-Language Script Execution - Executes SQL, JavaScript, and Python scripts against a MySQL server from within the editor.

Multi-Provider Authenticators - Provides REST API authentication against built-in accounts, MySQL accounts, and third-party OAuth2 providers.

Data Hashing Utilities - Applies cryptographic hashing to produce fixed-size digests used for data validation and signing.

Data Masking - De-identifies strings, numbers, phone numbers, and emails through obfuscation, random generation, and dictionary replacement.

Database Value Obfuscation - Replaces real values in database fields with substitutes that remove identifying characteristics while preserving data usability.

Digital Signature Generators - Creates and verifies cryptographic signatures to confirm data origin and integrity.

Encrypted Backups - Encrypts backup images and directories during backup operations to protect data at rest.

Encryption Key Management - Stores and manages encryption keys using Oracle Key Vault as a backend keyring.

Asymmetric Encryption - Implements public-key cryptography for encrypting sensitive data with separate encryption and decryption keys.

REST Resource Authorizers - Controls data access per user or role using ownership, role, group, and hierarchy-based authorization models.

Kerberos Authentication - Authenticates MySQL users using their Kerberos principals for single sign-on.

LDAP Authentication - Authenticates MySQL users against an LDAP directory using password, SASL, or GSSAPI/Kerberos methods.

Database Server Hardening - Secures a MySQL deployment by applying enterprise-grade security features and following a step-by-step hardening procedure on Linux.

SQL Pattern Allowlists - Provides per-account SQL pattern allowlisting with a learning mode to build custom allowlists for enforcement.

User Access Management - Defines user accounts and group permissions to control who can manage or query the HeatWave environment.

Host-Based Access Restrictors - Controls which client hosts and user accounts may connect and what operations they may perform.

Database Upgrade Validations - Migrates MySQL 5.7 to 8.0 with documented steps and pre-upgrade testing.

Machine Learning Pipelines - Automates end-to-end machine learning pipelines, from data preparation to inference, within the database.

Database Node Distribution - Distributes data and processing across independent nodes for horizontal scaling without contention.

Database Server Management - Provides a graphical dashboard to start, stop, and monitor MySQL server processes.

Server Thread Pools - Replaces one-thread-per-connection with a thread pool that reuses threads to reduce overhead under many concurrent clients.

Client-Server Setups - MySQL branch that integrates analytical engines as a native storage engine.

Infrastructure and Backend - Optimized MySQL distribution for high-scale environments.

alibabaAliSQL

Features

Open-source alternatives to AliSQL

apache/pinot

GreptimeTeam/greptimedb

aws/aws-cdk

erikgrinaker/toydb

Star history