Apache Pulsar is a cloud-native distributed pub-sub messaging system designed for high-performance data ingestion. It functions as a geo-replicated data streamer and a multi-tenant event streaming platform, providing a serverless stream processing engine and a tiered storage messaging broker.
The system distinguishes itself by separating serving layers from storage layers to allow independent scaling of compute and data retention. It features native geo-replication to synchronize messages across different geographical regions and employs a multi-layered tenant isolation model using authentication and storage quotas to support multiple organizations on a single cluster.
The platform provides capabilities for atomic transaction management, message offset replay, and strict message ordering guarantees. Its operational surface includes a pluggable connector framework for external system connectivity, tiered storage for offloading historical data, and a REST interface for cluster management and resource provisioning.
The project provides official containerized deployment images and supports horizontal infrastructure scaling.