awesome-repositories.com
© 2026 Bringes Technology SRL·VAT RO45896025·hello@bringes.io
MCPSitemapPrivacyTerms
Machine Learning Data Engineering · Awesome GitHub Repositories

1 repo

Awesome GitHub RepositoriesMachine Learning Data Engineering

Tools and processes for cleaning, transforming, and preparing raw data for machine learning model training.

Explore 1 awesome GitHub repository matching artificial intelligence & ml · Machine Learning Data Engineering. Refine with filters or upvote what's useful.

  1. Home
  2. Artificial Intelligence & ML
  3. Machine Learning Pipelines
  4. Machine Learning Data Engineering

Awesome Machine Learning Data Engineering GitHub Repositories

Describe the repository you're looking for…
We'll search the best matching repositories with AI.
  • karpathy/nanoGPT

    karpathy/nanoGPT

    53,461GitHubView on GitHub↗

    nanoGPT is a lightweight engine for training and fine-tuning transformer-based language models from scratch. It provides a minimalist codebase designed for educational exploration and rapid experimentation with neural network architectures, utilizing self-attention and feed-forward layers to process sequences and predi

    Python

Explore sub-tags

  • Dataset Preprocessing UtilitiesTools for converting raw data into optimized binary formats for efficient model ingestion.