2 个仓库
Optimizations for reducing data ingestion latency through lazy decoding and parallel metadata retrieval.
Distinct from Parallel Data Loading: Candidates focus on UI lazy loading or specific audio pre-loading; this is a general framework-level acceleration for ML data.
Explore 2 awesome GitHub repositories matching data & databases · Data Loading Accelerators. Refine with filters or upvote what's useful.
This project is a dataset management framework and cross-framework data loader that provides a unified interface for reading data formats compatible with TensorFlow, JAX, and PyTorch. It serves as a library of curated public datasets provided as data streams and includes tools for building, versioning, and documenting large-scale datasets. The system differentiates itself through a distributed data processing engine capable of managing massive datasets across clusters using parallelized pipelines. It utilizes builder-based construction to standardize how data is downloaded and prepared, while
Increases loading speed using partial decoding, parallel metadata retrieval, and in-memory caching.
这是一个 PyTorch 模型服务框架,旨在通过可扩展的网络端点在生产环境中部署和扩展机器学习模型。它充当高性能推理服务器、优化器和模型生命周期管理器,处理模型加载、请求批处理和硬件加速。 该系统通过先进的编排和优化功能脱颖而出,例如使用执行图将多个模型链接到顺序工作流中,以及采用动态批处理来提高吞吐量和降低延迟。它通过连续批处理和张量并行化为生成式 AI 和大型语言模型提供专门支持。 广泛的功能领域包括跨 NVIDIA、AMD 和 Apple Silicon 等不同硬件的 GPU 资源管理,以及用于注册、版本控制和工作节点扩展的全面模型生命周期管理。它还集成了用于通过 Prometheus 兼容指标跟踪系统健康状况和模型性能的可观测性工具。 该服务器通过用于生命周期控制和运行时参数配置的命令行界面进行管理。
Accelerates media data loading and preprocessing using optimized building blocks to eliminate bottlenecks.