1 repo
Systems for tracking the origin, versioning, and transformation history of datasets used in machine learning workflows.
Distinguishing note: Focuses on data provenance and versioning rather than the training process itself.
Explore 1 awesome GitHub repository matching artificial intelligence & ml · Data Lineage. Refine with filters or upvote what's useful.
Track data sources automatically during model training by logging paths, formats, and versions of datasets read from distributed storage systems.