2 dépôts
Processes for converting raw data formats into numerical representations.
Distinguishing note: Focuses on the conversion of non-numeric data types for ML consumption.
Explore 2 awesome GitHub repositories matching artificial intelligence & ml · Data Encoding. Refine with filters or upvote what's useful.
This project is an interactive data science environment that combines code execution, rich media visualization, and narrative documentation into a persistent, browser-based platform. It serves as a comprehensive educational resource for scientific computing, providing a framework for iterative data analysis and machine learning prototyping. The environment is distinguished by its focus on high-performance numerical computing, utilizing vectorized array operations and memory-mapped data structures to handle large-scale computations efficiently. It features a unified estimator interface that st
Converts pixel values into numerical features to enable machine learning models to process visual information.
minGPT is a minimal implementation of the Transformer architecture designed for training and experimenting with language models. It functions as a neural network training framework and a text generation engine, providing the necessary tools to manage data loading, backpropagation, and parameter updates for custom deep learning models. The project is structured as an educational resource for understanding how transformer architectures function by building and training models from scratch. It utilizes a modular block architecture and transformer-based self-attention to process sequences, allowi
Converts raw text data into numerical representations suitable for machine learning consumption.