1 مستودع
Optimization of data read/write paths between compute engines and distributed storage.
Distinct from Big Data Processing: Specifically focuses on the orchestration layer performance rather than general big data processing frameworks.
Explore 1 awesome GitHub repository matching data & databases · Performance Tuning. Refine with filters or upvote what's useful.
Alluxio is a virtual distributed file system and data orchestration layer that serves as a high-performance caching layer between cloud storage and compute clusters. It acts as a distributed data cache designed to accelerate data access for large-scale analytics and machine learning workloads. The system provides a unified interface that presents multiple heterogeneous storage backends as a single coherent namespace. This allows for the unification of diverse storage systems, enabling computation engines to access data from different providers without changing application code. The project c
Accelerates data-intensive workloads by optimizing how computation engines read and write to distributed storage.