# nyandwi/machine_learning_complete

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/nyandwi-machine-learning-complete).**

4,983 stars · 833 forks · Jupyter Notebook · mit

## Links

- GitHub: https://github.com/Nyandwi/machine_learning_complete
- Homepage: https://nyandwi.com/machine_learning_complete/
- awesome-repositories: https://awesome-repositories.com/repository/nyandwi-machine-learning-complete.md

## Topics

`computer-vision` `data-analysis` `data-science` `data-visualization` `datascience` `deep-learning` `keras` `machine-learning` `matplotlib` `neural-networks` `nlp` `numpy` `open-source` `pandas` `python` `scikit-learn` `seaborn` `tensorflow`

## Description

This is an interactive notebook-based course that teaches machine learning from Python fundamentals through deep learning and natural language processing. It uses real datasets and multiple frameworks within a structured, hands-on curriculum that combines concise explanations with executable code cells, built-in datasets, and embedded exercise checkpoints. Learning progresses through data preparation and exploration, classical machine learning workflows, computer vision with convolutional neural networks, and natural language processing with deep learning, all delivered as a cohesive progression of Jupyter notebooks.

The pedagogical approach uses multiple frameworks—including NumPy, Pandas, scikit-learn, TensorFlow, Keras, and Hugging Face—in a single cohesive sequence. Each concept is introduced with minimal explanatory text and runnable code that can be modified and rerun, and inline tasks require immediate application of newly introduced techniques. The curriculum builds skills across data loading, manipulation, visualization, and preprocessing; classical machine learning algorithms; neural network construction and training; computer vision pipelines; and natural language processing tasks including text classification with transformers.

The entire curriculum is delivered as Jupyter notebooks that combine text, code, and visualizations, and can be run interactively in any notebook environment.

## Tags

### Artificial Intelligence & ML

- [Machine Learning Training](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/machine-learning-training.md) — Provides a comprehensive suite for training, fine-tuning, and aligning machine learning models using historical data. ([source](https://nyandwi.com/machine_learning_complete/12_intro_to_sklearn/))
- [Classical Machine Learning](https://awesome-repositories.com/f/artificial-intelligence-ml/classical-machine-learning.md) — Implements classical algorithms including Random Forests, SVMs, and Linear Regression for structured data.
- [Classification and Clustering Models](https://awesome-repositories.com/f/artificial-intelligence-ml/classification-and-clustering-models.md) — Implements traditional classification and clustering models, including Decision Trees and Random Forests. ([source](https://cdn.jsdelivr.net/gh/nyandwi/machine_learning_complete@main/README.md))
- [Classification Models](https://awesome-repositories.com/f/artificial-intelligence-ml/classification-models.md) — Constructs neural networks and algorithms to assign predefined labels to data points through binary and multiclass classification. ([source](https://nyandwi.com/machine_learning_complete/11_ml_fundamentals/))
- [Convolutional Neural Networks](https://awesome-repositories.com/f/artificial-intelligence-ml/convolutional-neural-networks.md) — Develops convolutional neural networks for extracting visual features and classifying images. ([source](https://nyandwi.com/machine_learning_complete/28_intro_to_computer_vision_and_cnn/))
- [Sequence Padding Utilities](https://awesome-repositories.com/f/artificial-intelligence-ml/convolutional-operations/input-padding-utilities/padding-maskers/sequence-padding-utilities.md) — Standardizes input sequence lengths using truncation or padding with special tokens. ([source](https://nyandwi.com/machine_learning_complete/31_intro_to_nlp_and_text_preprocessing/))
- [Data Preparation](https://awesome-repositories.com/f/artificial-intelligence-ml/data-preparation.md) — Provides utilities for cleaning, normalizing, and structuring raw datasets for machine learning workflows. ([source](https://nyandwi.com/machine_learning_complete/12_intro_to_sklearn/))
- [Data Cleansing](https://awesome-repositories.com/f/artificial-intelligence-ml/dataset-preparation-utilities/data-cleansing.md) — Provides implementations for removing duplicates and handling missing data to ensure high-quality training sets. ([source](https://nyandwi.com/machine_learning_complete/07_intro_to_data_preparation/))
- [Ensemble Methods](https://awesome-repositories.com/f/artificial-intelligence-ml/decision-trees/ensemble-methods.md) — Provides implementations of ensemble methods like Random Forests to improve model accuracy and stability.
- [Deep Learning Architectures](https://awesome-repositories.com/f/artificial-intelligence-ml/deep-learning-architectures.md) — Provides frameworks and guides for constructing multi-layered neural networks using sequential and functional architectures. ([source](https://nyandwi.com/machine_learning_complete/25_intro_to_tensorflow_for_deeplearning/))
- [Deep Learning Development](https://awesome-repositories.com/f/artificial-intelligence-ml/deep-learning-development.md) — Provides comprehensive guides for designing and training CNNs and RNNs using TensorFlow.
- [Feature Scale Normalization](https://awesome-repositories.com/f/artificial-intelligence-ml/feature-scale-normalization.md) — Implements preprocessing techniques to scale numeric features to a standard range for stable model convergence. ([source](https://nyandwi.com/machine_learning_complete/07_intro_to_data_preparation/))
- [Linear Regression](https://awesome-repositories.com/f/artificial-intelligence-ml/linear-regression.md) — Implements statistical methods for modeling relationships between variables using linear equations for continuous value prediction. ([source](https://nyandwi.com/machine_learning_complete/13_linear_models_for_regression/))
- [Machine Learning Implementations](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning-implementations.md) — Provides code-based reference implementations of core classical machine learning algorithms. ([source](https://nyandwi.com/machine_learning_complete/))
- [Computer Vision](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/frameworks/computer-vision.md) — Implements image classification and augmentation pipelines using convolutional neural networks and pretrained models. ([source](https://cdn.jsdelivr.net/gh/nyandwi/machine_learning_complete@main/README.md))
- [Computer Vision Pipelines](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/frameworks/computer-vision/computer-vision-pipelines.md) — Implements image classification pipelines using image augmentation and convolutional neural networks.
- [Layered Architectures](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/frameworks/model-construction/neural-network-layers/convolution-layers/layered-architectures.md) — Builds layered architectures that stack convolutional, pooling, and dense units for image classification.
- [Model Evaluation Metrics](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-evaluation-and-validation/model-evaluation-metrics.md) — Implements the measurement of predictive quality using regression metrics and classification scores on unseen data. ([source](https://nyandwi.com/machine_learning_complete/11_ml_fundamentals/))
- [Training and Evaluation Pipelines](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-training-and-tuning/training-frameworks/training-and-evaluation-pipelines.md) — Provides end-to-end workflows for optimizing network weights and validating performance against test sets. ([source](https://nyandwi.com/machine_learning_complete/28_intro_to_computer_vision_and_cnn/))
- [Model Performance Evaluators](https://awesome-repositories.com/f/artificial-intelligence-ml/model-performance-evaluators.md) — Provides tools to quantify model reliability using accuracy, precision, recall, and confusion matrices. ([source](https://nyandwi.com/machine_learning_complete/16_support_vector_machines_for_classification/))
- [Classical ML Algorithms](https://awesome-repositories.com/f/artificial-intelligence-ml/model-training/classical-ml-algorithms.md) — Provides practical implementations of classical supervised and unsupervised algorithms like decision trees and linear models. ([source](https://nyandwi.com/machine_learning_complete/outline/))
- [Natural Language Processing](https://awesome-repositories.com/f/artificial-intelligence-ml/natural-language-processing.md) — Implements natural language processing techniques including word embeddings and transformer-based text classification. ([source](https://cdn.jsdelivr.net/gh/nyandwi/machine_learning_complete@main/README.md))
- [Text Tokenization](https://awesome-repositories.com/f/artificial-intelligence-ml/natural-language-processing/text-tokenization.md) — Segments raw text into words or characters and encodes them into integers for model compatibility. ([source](https://nyandwi.com/machine_learning_complete/31_intro_to_nlp_and_text_preprocessing/))
- [Word Embeddings](https://awesome-repositories.com/f/artificial-intelligence-ml/natural-language-processing/word-embeddings.md) — Creates vector representations of words that capture semantic relationships in high-dimensional space. ([source](https://nyandwi.com/machine_learning_complete/32_using_word_embeddings_to_represent_texts/))
- [Regression Neural Networks](https://awesome-repositories.com/f/artificial-intelligence-ml/neural-network-construction/regression-neural-networks.md) — Constructs deep learning models with customizable input and output layers to predict continuous numerical values. ([source](https://nyandwi.com/machine_learning_complete/26_neural_networks_for_regresion_with_tensorflow/))
- [Neural Network Training](https://awesome-repositories.com/f/artificial-intelligence-ml/neural-network-training.md) — Implements the process of adjusting internal network weights via iterative cycles to minimize error and recognize patterns. ([source](https://nyandwi.com/machine_learning_complete/27_neural_networks_for_classification_with_tensorflow/))
- [Random Forest Ensembles](https://awesome-repositories.com/f/artificial-intelligence-ml/random-forest-ensembles.md) — Trains random forest ensembles that aggregate multiple decision trees to minimize overfitting. ([source](https://nyandwi.com/machine_learning_complete/20_random_forests_for_classification/))
- [Recurrent Neural Networks](https://awesome-repositories.com/f/artificial-intelligence-ml/recurrent-neural-networks.md) — Implements recurrent neural network architectures like LSTM and GRU for processing sequential and time-series data. ([source](https://nyandwi.com/machine_learning_complete/24_intro_to_neural_networks/))
- [Regression Models](https://awesome-repositories.com/f/artificial-intelligence-ml/regression-models.md) — Implements various models to estimate continuous numeric outcomes based on provided input features. ([source](https://nyandwi.com/machine_learning_complete/11_ml_fundamentals/))
- [Sentiment Analysis Tools](https://awesome-repositories.com/f/artificial-intelligence-ml/sentiment-analysis-tools.md) — Implements general-purpose sentiment labeling tools to classify text as positive, negative, or neutral. ([source](https://nyandwi.com/machine_learning_complete/32_using_word_embeddings_to_represent_texts/))
- [Sequence Modeling](https://awesome-repositories.com/f/artificial-intelligence-ml/sequence-modeling.md) — Implements sequence modeling techniques including attention-based transformers for context-aware text analysis.
- [Sequential Learning](https://awesome-repositories.com/f/artificial-intelligence-ml/sequential-learning.md) — Implements methods for training models on time-series or ordered data sequences to capture temporal dependencies. ([source](https://nyandwi.com/machine_learning_complete/33_recurrent_neural_networks/))
- [Supervised Learning](https://awesome-repositories.com/f/artificial-intelligence-ml/supervised-learning.md) — Includes comprehensive implementations for training models on labeled data to solve classification and regression tasks. ([source](https://github.com/Nyandwi/machine_learning_complete/blob/main/5_intro_to_machine_learning/ml_fundamentals.md))
- [Tabular Feature Engineering](https://awesome-repositories.com/f/artificial-intelligence-ml/tabular-feature-engineering.md) — Implements processes for converting raw tabular data into model-ready formats through encoding and cleaning. ([source](https://nyandwi.com/machine_learning_complete/22_intro_to_unsupervised_learning_with_kmeans_clustering/))
- [Text Classifiers](https://awesome-repositories.com/f/artificial-intelligence-ml/text-classifiers.md) — Provides models and tools for assigning predefined categories or sentiments to text documents. ([source](https://nyandwi.com/machine_learning_complete/33_recurrent_neural_networks/))
- [Text-to-Numeric Transformations](https://awesome-repositories.com/f/artificial-intelligence-ml/text-to-numeric-transformations.md) — Converts raw text into numerical representations through tokenization and padding for machine learning model compatibility. ([source](https://nyandwi.com/machine_learning_complete/34_using_cnns_and_rnns_for_texts_classification/))
- [Training and Testing Splits](https://awesome-repositories.com/f/artificial-intelligence-ml/training-and-testing-splits.md) — Implements methods for dividing datasets into distinct training, validation, and test sets to evaluate generalization. ([source](https://nyandwi.com/machine_learning_complete/07_intro_to_data_preparation/))
- [Transfer Learning](https://awesome-repositories.com/f/artificial-intelligence-ml/transfer-learning.md) — Implements transfer learning to adapt pretrained models to new tasks by freezing existing layers. ([source](https://nyandwi.com/machine_learning_complete/30_cnn_architectures_and_transfer_learning/))
- [Transformer Architecture Implementation](https://awesome-repositories.com/f/artificial-intelligence-ml/transformer-architecture-implementation.md) — Provides implementations of sequence-to-sequence transformer models using self-attention mechanisms, encoders, and decoders. ([source](https://nyandwi.com/machine_learning_complete/24_intro_to_neural_networks/))
- [Unsupervised Learning](https://awesome-repositories.com/f/artificial-intelligence-ml/unsupervised-learning.md) — Implements algorithms for discovering patterns in unlabeled data via clustering and dimensionality reduction. ([source](https://github.com/Nyandwi/machine_learning_complete/blob/main/5_intro_to_machine_learning/ml_fundamentals.md))
- [Activation Functions](https://awesome-repositories.com/f/artificial-intelligence-ml/activation-functions.md) — Configures non-linear activation functions like ReLU and Softmax to dictate neural network output formats. ([source](https://nyandwi.com/machine_learning_complete/24_intro_to_neural_networks/))
- [Boosting Algorithms](https://awesome-repositories.com/f/artificial-intelligence-ml/boosting-algorithms.md) — Trains a sequence of models where each subsequent model is designed to minimize the errors of the previous ones. ([source](https://nyandwi.com/machine_learning_complete/21_ensemble_models/))
- [Centroid-Based Clustering](https://awesome-repositories.com/f/artificial-intelligence-ml/centroid-based-clustering.md) — Implements clustering algorithms that group unlabeled data by iteratively calculating centroids and assigning points. ([source](https://nyandwi.com/machine_learning_complete/22_intro_to_unsupervised_learning_with_kmeans_clustering/))
- [Clustering](https://awesome-repositories.com/f/artificial-intelligence-ml/clustering.md) — Applies unsupervised methods to group data points into similar clusters based on inherent patterns without labels. ([source](https://nyandwi.com/machine_learning_complete/11_ml_fundamentals/))
- [Image Augmentation](https://awesome-repositories.com/f/artificial-intelligence-ml/computer-vision-systems/computer-vision/image-augmentation.md) — Provides image augmentation techniques like random rotation and flipping to increase dataset variety. ([source](https://nyandwi.com/machine_learning_complete/29_cnn_for_real_world_data_and_image_augmentation/))
- [Dataset Distribution Analysis](https://awesome-repositories.com/f/artificial-intelligence-ml/dataset-quality-analyzers/dataset-distribution-analysis.md) — Provides visual interfaces and statistical tools to understand dataset distributions and evaluate model outputs. ([source](https://nyandwi.com/machine_learning_complete/25_intro_to_tensorflow_for_deeplearning/))
- [Decision Trees](https://awesome-repositories.com/f/artificial-intelligence-ml/decision-trees.md) — Trains supervised models using hierarchical tree structures to categorize data into discrete classes. ([source](https://nyandwi.com/machine_learning_complete/18_decision_trees_for_classification/))
- [Model Logic Visualization](https://awesome-repositories.com/f/artificial-intelligence-ml/decision-trees/model-logic-visualization.md) — Generates plots of decision trees to illustrate the rules used to reach specific classifications. ([source](https://nyandwi.com/machine_learning_complete/18_decision_trees_for_classification/))
- [Deep Learning Libraries](https://awesome-repositories.com/f/artificial-intelligence-ml/deep-learning-libraries.md) — Implements neural network architectures, convolutional layers, and transformers using the TensorFlow library.
- [Dense Neural Networks](https://awesome-repositories.com/f/artificial-intelligence-ml/dense-neural-networks.md) — Develops dense neural networks with fully connected layers where every neuron connects to the subsequent layer. ([source](https://nyandwi.com/machine_learning_complete/24_intro_to_neural_networks/))
- [End-to-End Training Pipelines](https://awesome-repositories.com/f/artificial-intelligence-ml/end-to-end-training-pipelines.md) — Orchestrates end-to-end training pipelines encompassing data preparation, training, and validation. ([source](https://nyandwi.com/machine_learning_complete/25_intro_to_tensorflow_for_deeplearning/))
- [Bagging Ensembles](https://awesome-repositories.com/f/artificial-intelligence-ml/ensemble-learning/bagging-ensembles.md) — Implements bagging ensembles to fit base models across different data splits to reduce variance and overfitting. ([source](https://nyandwi.com/machine_learning_complete/21_ensemble_models/))
- [Ensemble Methods](https://awesome-repositories.com/f/artificial-intelligence-ml/ensemble-methods.md) — Implements techniques to combine predictions from multiple classifiers to produce a more accurate single output. ([source](https://nyandwi.com/machine_learning_complete/21_ensemble_models/))
- [Feature Correlation Analysis](https://awesome-repositories.com/f/artificial-intelligence-ml/feature-correlation-analysis.md) — Implements statistical methods to evaluate relationships between variables using correlation matrices. ([source](https://nyandwi.com/machine_learning_complete/06_exploratory_data_analysis/))
- [Feature Extraction Models](https://awesome-repositories.com/f/artificial-intelligence-ml/feature-extraction-models.md) — Implements feature extraction models that transform raw input into high-dimensional vector representations.
- [Feature Importance Attribution](https://awesome-repositories.com/f/artificial-intelligence-ml/feature-importance-attribution.md) — Implements methods for quantifying the relative contribution of individual input variables to model predictions. ([source](https://nyandwi.com/machine_learning_complete/17_decision_trees_for_regression/))
- [Kernel Density Estimation](https://awesome-repositories.com/f/artificial-intelligence-ml/kernel-density-estimation.md) — Implements histograms and kernel density estimates to visualize the probability density and spread of variables. ([source](https://nyandwi.com/machine_learning_complete/04_data_visualization_with_seaborn/))
- [Linear Classifiers](https://awesome-repositories.com/f/artificial-intelligence-ml/linear-classifiers.md) — Implements classification models such as Logistic Regression to predict discrete target labels. ([source](https://nyandwi.com/machine_learning_complete/14_linear_models_for_classification/))
- [Kernel Mappings](https://awesome-repositories.com/f/artificial-intelligence-ml/linear-regression/polynomial-feature-mapping/kernel-mappings.md) — Uses kernel mappings to project data into higher dimensions for non-linear category separation.
- [Machine Learning Operations](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning-operations.md) — Covers the ML lifecycle including data preparation, hyperparameter tuning, and model evaluation.
- [Regression Ensembles](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/algorithms/predictive-machine-learning-analytics/regression-ensembles.md) — Implements ensembles of randomized decision trees to perform regression and minimize overfitting. ([source](https://nyandwi.com/machine_learning_complete/19_random_forests_for_regression/))
- [Model Persistence](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-training-and-tuning/data-and-checkpointing/model-loading/model-persistence.md) — Provides methods for saving trained estimators to disk and reloading them for reuse without retraining. ([source](https://nyandwi.com/machine_learning_complete/26_neural_networks_for_regresion_with_tensorflow/))
- [Classification Fine-Tuning](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-training-and-tuning/fine-tuning-and-customization/model-fine-tuning/fine-tuned-model-deployment/classification-fine-tuning.md) — Implements gradient-based fine-tuning of pretrained BERT transformers for specialized text classification tasks. ([source](https://nyandwi.com/machine_learning_complete/35_using_pretrained_bert_for_text_classification/))
- [Model Architecture Selection](https://awesome-repositories.com/f/artificial-intelligence-ml/model-architecture-selection.md) — Provides criteria and methodologies for choosing between linear, tree-based, or neural network architectures. ([source](https://github.com/Nyandwi/machine_learning_complete/blob/main/5_intro_to_machine_learning/ml_fundamentals.md))
- [Hyperparameter Tuning](https://awesome-repositories.com/f/artificial-intelligence-ml/model-fine-tuning-resources/hyperparameter-tuning.md) — Provides techniques for systematically testing parameter combinations to find optimal model configurations. ([source](https://nyandwi.com/machine_learning_complete/18_decision_trees_for_classification/))
- [Model Performance Optimization](https://awesome-repositories.com/f/artificial-intelligence-ml/model-optimization/profiling-and-benchmarking/model-performance-optimization.md) — Offers methods to optimize model behavior and accuracy through parameter adjustment and processing pipelines. ([source](https://nyandwi.com/machine_learning_complete/12_intro_to_sklearn/))
- [Hyperparameter Optimization](https://awesome-repositories.com/f/artificial-intelligence-ml/model-optimization/training-efficiency/hyperparameter-optimization.md) — Implements automated methods for searching and selecting optimal configuration parameters to improve model performance. ([source](https://nyandwi.com/machine_learning_complete/15_support_vector_machines_for_regression/))
- [Model Performance Visualizations](https://awesome-repositories.com/f/artificial-intelligence-ml/model-performance-visualizations.md) — Implements graphical representations of model evaluation metrics, such as loss curves and confusion matrices, to diagnose performance. ([source](https://nyandwi.com/machine_learning_complete/25_intro_to_tensorflow_for_deeplearning/))
- [Image Labeling Engines](https://awesome-repositories.com/f/artificial-intelligence-ml/model-predictions/prediction-engines/image-labeling-engines.md) — Provides logic to generate class labels and probabilities for image data using trained models. ([source](https://nyandwi.com/machine_learning_complete/29_cnn_for_real_world_data_and_image_augmentation/))
- [Model Training Monitoring](https://awesome-repositories.com/f/artificial-intelligence-ml/model-training-monitoring.md) — Ships tools for tracking and visualizing scalar metrics and loss curves over training epochs. ([source](https://nyandwi.com/machine_learning_complete/35_using_pretrained_bert_for_text_classification/))
- [Model Persistence](https://awesome-repositories.com/f/artificial-intelligence-ml/model-training/model-persistence.md) — Implements mechanisms for saving and retrieving trained model architectures and weights from disk. ([source](https://nyandwi.com/machine_learning_complete/28_intro_to_computer_vision_and_cnn/))
- [Cross-Validation Utilities](https://awesome-repositories.com/f/artificial-intelligence-ml/model-validation-tools/cross-validation-utilities.md) — Provides utilities for splitting datasets and performing iterative model validation using cross-validation techniques. ([source](https://nyandwi.com/machine_learning_complete/13_linear_models_for_regression/))
- [Neural Network Regularization](https://awesome-repositories.com/f/artificial-intelligence-ml/neural-network-regularization.md) — Implements regularization techniques, such as dropout layers, to prevent overfitting and improve generalization. ([source](https://nyandwi.com/machine_learning_complete/28_intro_to_computer_vision_and_cnn/))
- [Model Finetuning](https://awesome-repositories.com/f/artificial-intelligence-ml/object-detection/model-finetuning.md) — Implements model finetuning by unfreezing deep layers of pretrained networks to improve accuracy. ([source](https://nyandwi.com/machine_learning_complete/30_cnn_architectures_and_transfer_learning/))
- [Overfitting Reduction Techniques](https://awesome-repositories.com/f/artificial-intelligence-ml/overfitting-reduction-techniques.md) — Implements techniques to improve model generalization by constraining tree depth or adjusting class weights. ([source](https://nyandwi.com/machine_learning_complete/18_decision_trees_for_classification/))
- [Reinforcement Learning](https://awesome-repositories.com/f/artificial-intelligence-ml/reinforcement-learning.md) — Provides guides on developing strategies for agents to maximize rewards through trial and error. ([source](https://github.com/Nyandwi/machine_learning_complete/blob/main/5_intro_to_machine_learning/ml_fundamentals.md))
- [Semi-supervised Learning Pipelines](https://awesome-repositories.com/f/artificial-intelligence-ml/semi-supervised-learning-pipelines.md) — Provides workflows for training models that combine small amounts of labeled data with large unlabeled datasets. ([source](https://github.com/Nyandwi/machine_learning_complete/blob/main/5_intro_to_machine_learning/ml_fundamentals.md))
- [Support Vector Machines](https://awesome-repositories.com/f/artificial-intelligence-ml/support-vector-machines.md) — Implements support vector machines to create decision boundaries for data classification. ([source](https://nyandwi.com/machine_learning_complete/16_support_vector_machines_for_classification/))
- [Training Callbacks](https://awesome-repositories.com/f/artificial-intelligence-ml/training-callbacks.md) — Implements lifecycle hooks and callbacks to automate model saving and early stopping during neural network training. ([source](https://nyandwi.com/machine_learning_complete/27_neural_networks_for_classification_with_tensorflow/))

### Data & Databases

- [Data Analysis](https://awesome-repositories.com/f/data-databases/data-analysis.md) — Implements a complete suite for examining metrics and identifying patterns through data analysis. ([source](https://nyandwi.com/machine_learning_complete/outline/))
- [Data Visualization](https://awesome-repositories.com/f/data-databases/data-analysis-visualization/visualization-frameworks-libraries/data-visualization.md) — Implements a variety of charts including line, scatter, bar, and pie plots to visualize data relationships. ([source](https://nyandwi.com/machine_learning_complete/03_data_visualizations_with_matplotlib/))
- [Categorical Encoders](https://awesome-repositories.com/f/data-databases/data-categorization/categorical-encoders.md) — Provides tools for converting categorical string data into numeric representations for machine learning compatibility. ([source](https://nyandwi.com/machine_learning_complete/07_intro_to_data_preparation/))
- [Data Cleaning Pipelines](https://awesome-repositories.com/f/data-databases/data-cleaning-pipelines.md) — Converts raw datasets into compatible formats using missing value imputation, categorical encoding, and numeric scaling. ([source](https://nyandwi.com/machine_learning_complete/11_ml_fundamentals/))
- [Array and Tensor Manipulation](https://awesome-repositories.com/f/data-databases/data-processing-pipelines/data-transformation/array-tensor-manipulation.md) — Implements operations for reshaping, filtering, and transforming multi-dimensional tensors. ([source](https://nyandwi.com/machine_learning_complete/25_intro_to_tensorflow_for_deeplearning/))
- [Array Filtering](https://awesome-repositories.com/f/data-databases/data-processing-pipelines/data-transformation/array-tensor-manipulation/array-filtering.md) — Provides tools for retrieving specific data points from arrays using indexing, slicing, and conditional filtering. ([source](https://nyandwi.com/machine_learning_complete/01_intro_to_Numpy_for_data_computation/))
- [Data Visualization Charts](https://awesome-repositories.com/f/data-databases/data-visualization-charts.md) — Generates a variety of charts including bar, histogram, scatter, and pie plots for data analysis. ([source](https://nyandwi.com/machine_learning_complete/05_data_visualization%20with_pandas/))
- [Dataset Loading](https://awesome-repositories.com/f/data-databases/dataset-loading.md) — Provides methods for retrieving datasets from public URLs and converting them into Pandas DataFrames. ([source](https://nyandwi.com/machine_learning_complete/13_linear_models_for_regression/))
- [Exploratory Data Analysis](https://awesome-repositories.com/f/data-databases/exploratory-data-analysis.md) — Guides users through cleaning and manipulating datasets to discover patterns and optimize features for modeling. ([source](https://cdn.jsdelivr.net/gh/nyandwi/machine_learning_complete@main/README.md))
- [Missing Data Imputation](https://awesome-repositories.com/f/data-databases/missing-data-imputation.md) — Implements methods for detecting and filling gaps in datasets using scalar replacement and interpolation. ([source](https://nyandwi.com/machine_learning_complete/02_data_manipulation_with_pandas/))
- [Missing Data Removal](https://awesome-repositories.com/f/data-databases/missing-data-removal.md) — Implements methods for dropping rows or columns containing null values based on specified thresholds. ([source](https://nyandwi.com/machine_learning_complete/10_handling_missing_values/))
- [Missing Value Imputation](https://awesome-repositories.com/f/data-databases/missing-value-imputation.md) — Provides workflows for filling missing data using mean, median, or most frequent values. ([source](https://nyandwi.com/machine_learning_complete/10_handling_missing_values/))
- [Tabular Data Processors](https://awesome-repositories.com/f/data-databases/tabular-data-processors.md) — Provides utilities for filtering, aggregating, and joining tabular data from external files. ([source](https://cdn.jsdelivr.net/gh/nyandwi/machine_learning_complete@main/README.md))
- [Text Vectorizers](https://awesome-repositories.com/f/data-databases/vector-storage/text-vectorizers.md) — Converts raw text strings into numerical vector representations through normalization and tokenization. ([source](https://nyandwi.com/machine_learning_complete/31_intro_to_nlp_and_text_preprocessing/))
- [Distribution Analysis Plots](https://awesome-repositories.com/f/data-databases/box-plots/distribution-analysis-plots.md) — Provides implementations of histograms, count plots, and bar charts for analyzing data modality and spread. ([source](https://nyandwi.com/machine_learning_complete/06_exploratory_data_analysis/))
- [Categorical Data Visualizations](https://awesome-repositories.com/f/data-databases/categorical-data-visualizations.md) — Implements bar, box, and violin plots to compare aggregated values across different data categories. ([source](https://nyandwi.com/machine_learning_complete/04_data_visualization_with_seaborn/))
- [Conditional Data Filters](https://awesome-repositories.com/f/data-databases/conditional-data-filters.md) — Implements logic to extract specific data subsets based on conditional thresholds and predicate logic. ([source](https://nyandwi.com/machine_learning_complete/06_exploratory_data_analysis/))
- [Image Classifiers](https://awesome-repositories.com/f/data-databases/data-categorization/classification-labelers/image-classifiers.md) — Uses pretrained image classifiers to predict categories without requiring additional training. ([source](https://nyandwi.com/machine_learning_complete/30_cnn_architectures_and_transfer_learning/))
- [Data Import and Export](https://awesome-repositories.com/f/data-databases/data-import-and-export.md) — Provides utilities for reading and writing datasets to standard file formats like CSV and Excel. ([source](https://nyandwi.com/machine_learning_complete/02_data_manipulation_with_pandas/))
- [Data Joins](https://awesome-repositories.com/f/data-databases/data-joins.md) — Provides general purpose joining functionality to merge data structures based on shared keys or indexes. ([source](https://nyandwi.com/machine_learning_complete/02_data_manipulation_with_pandas/))
- [Training Data Pipelines](https://awesome-repositories.com/f/data-databases/data-processing-pipelines/data-processing/ml-data-pipelines/training-data-pipelines.md) — Builds training data pipelines to ensure consistent preprocessing across different datasets. ([source](https://nyandwi.com/machine_learning_complete/13_linear_models_for_regression/))
- [Tensor Joining and Splitting](https://awesome-repositories.com/f/data-databases/data-processing-pipelines/data-transformation/array-tensor-manipulation/tensor-transformations/tensor-joining-and-splitting.md) — Provides operations for concatenating, stacking, or dividing tensors and arrays along specified dimensions. ([source](https://nyandwi.com/machine_learning_complete/01_intro_to_Numpy_for_data_computation/))
- [Categorical Distribution Analysis](https://awesome-repositories.com/f/data-databases/data-visualization-charts/categorical-comparison-charts/categorical-distribution-analysis.md) — Analyzes frequency distributions of categorical features using value counts and pie charts. ([source](https://nyandwi.com/machine_learning_complete/14_linear_models_for_classification/))
- [Multivariate Relationship Visualizers](https://awesome-repositories.com/f/data-databases/database-relationship-mappings/multivariate-relationship-visualizers.md) — Creates scatter plots and grouped count plots to analyze joint distributions across multiple variables. ([source](https://nyandwi.com/machine_learning_complete/06_exploratory_data_analysis/))
- [Image Preprocessing Pipelines](https://awesome-repositories.com/f/data-databases/dataset-preparation-tools/image-text-pair-pipelines/image-preprocessing-pipelines.md) — Provides a pipeline to normalize image pixels and create batches for efficient deep learning training. ([source](https://nyandwi.com/machine_learning_complete/28_intro_to_computer_vision_and_cnn/))
- [Dataset Transformations](https://awesome-repositories.com/f/data-databases/dataset-transformations.md) — Provides functions for mapping and manipulating data using custom functions and lambdas across columns. ([source](https://nyandwi.com/machine_learning_complete/02_data_manipulation_with_pandas/))
- [Tabular Data Aggregation](https://awesome-repositories.com/f/data-databases/functional-data-aggregation/tabular-data-aggregation.md) — Provides mathematical reduction of table columns into summary values using group-by operations. ([source](https://nyandwi.com/machine_learning_complete/02_data_manipulation_with_pandas/))
- [Label-Based Data Selection](https://awesome-repositories.com/f/data-databases/label-based-data-selection.md) — Enables retrieval of specific rows and columns using integer positions, labels, or conditional logic. ([source](https://nyandwi.com/machine_learning_complete/02_data_manipulation_with_pandas/))
- [Semi-Supervised Training](https://awesome-repositories.com/f/data-databases/label-based-data-selection/metadata-labelers/model-assisted-labelers/semi-supervised-training.md) — Guides users on improving model performance by leveraging small amounts of labeled data alongside unlabeled data. ([source](https://nyandwi.com/machine_learning_complete/11_ml_fundamentals/))
- [Imputation Methods](https://awesome-repositories.com/f/data-databases/missing-data-imputation/imputation-methods.md) — Implements predictive imputation by modeling missing features as functions of other variables using regression. ([source](https://nyandwi.com/machine_learning_complete/10_handling_missing_values/))
- [Missing Value Detection](https://awesome-repositories.com/f/data-databases/missing-value-detection.md) — Identifies null entries and uses heatmaps to visualize the location of missing data. ([source](https://nyandwi.com/machine_learning_complete/06_exploratory_data_analysis/))

### Education & Learning Resources

- [AI & Machine Learning Education](https://awesome-repositories.com/f/education-learning-resources/technical-domain-education/ai-machine-learning-education.md) — Provides a comprehensive educational suite covering AI and ML algorithms with practical implementation guides.
- [Machine Learning Fundamentals](https://awesome-repositories.com/f/education-learning-resources/technical-domain-education/ai-machine-learning-education/machine-learning-fundamentals.md) — Provides a structured educational curriculum covering foundational machine learning workflows, preprocessing, and model training. ([source](https://nyandwi.com/machine_learning_complete/extras/ack/))
- [Curated Learning Resources](https://awesome-repositories.com/f/education-learning-resources/curated-learning-resources.md) — Aggregates a curated collection of external documentation, books, and courses specifically for learning MLOps. ([source](https://github.com/Nyandwi/machine_learning_complete/blob/main/010_mlops/1_mlops_guide.md))

### Part of an Awesome List

- [Deep Learning](https://awesome-repositories.com/f/awesome-lists/ai/deep-learning.md) — Implements complex neural network architectures specialized for computer vision and natural language processing. ([source](https://nyandwi.com/machine_learning_complete/))
- [Data Preprocessing for Modeling](https://awesome-repositories.com/f/awesome-lists/ai/machine-learning-and-data-science/data-preprocessing-for-modeling.md) — Converts categorical text labels into numerical formats required for training machine learning models. ([source](https://nyandwi.com/machine_learning_complete/20_random_forests_for_classification/))
- [Neural Networks and Deep Learning](https://awesome-repositories.com/f/awesome-lists/ai/neural-networks-and-deep-learning.md) — Provides comprehensive guides on building densely connected and recurrent neural networks for predictive tasks. ([source](https://cdn.jsdelivr.net/gh/nyandwi/machine_learning_complete@main/README.md))
- [Text-to-Sequence Conversion](https://awesome-repositories.com/f/awesome-lists/ai/sequence-to-sequence-models/text-sequence-generators/text-to-sequence-conversion.md) — Transforms sentences into ordered lists of tokens to preserve the original sequence of words. ([source](https://nyandwi.com/machine_learning_complete/31_intro_to_nlp_and_text_preprocessing/))
- [Dimensionality Reduction](https://awesome-repositories.com/f/awesome-lists/data/dimensionality-reduction.md) — Implements techniques to compress high-dimensional data into principal components to reduce redundancy and increase speed. ([source](https://nyandwi.com/machine_learning_complete/23_a_practical_intro_to_principal_components_analysis/))
- [Feature Engineering](https://awesome-repositories.com/f/awesome-lists/data/feature-engineering.md) — Implements techniques for creating and enriching predictive variables to improve model performance. ([source](https://nyandwi.com/machine_learning_complete/07_intro_to_data_preparation/))
- [Numerical Computation](https://awesome-repositories.com/f/awesome-lists/devtools/numerical-computation.md) — Performs high-performance numerical computation using multi-dimensional arrays and vectorized operations.
- [Visual Pattern Recognition](https://awesome-repositories.com/f/awesome-lists/ai/computer-vision-and-image-processing/visual-pattern-recognition.md) — Implements convolutional neural networks and image augmentation to recognize and classify visual patterns. ([source](https://nyandwi.com/machine_learning_complete/outline/))
- [Pre-trained Models](https://awesome-repositories.com/f/awesome-lists/ai/pre-trained-models.md) — Demonstrates how to import and utilize ready-made pre-trained models for image and text classification. ([source](https://nyandwi.com/machine_learning_complete/25_intro_to_tensorflow_for_deeplearning/))
- [Self-Supervised Learning](https://awesome-repositories.com/f/awesome-lists/ai/self-supervised-learning.md) — Implements methods for training models by automatically generating labels from the internal structure of unlabeled data. ([source](https://github.com/Nyandwi/machine_learning_complete/blob/main/5_intro_to_machine_learning/ml_fundamentals.md))

### Programming Languages & Runtimes

- [Built-in Data Collections](https://awesome-repositories.com/f/programming-languages-runtimes/built-in-data-collections.md) — Provides foundational instruction on using Python lists, dictionaries, tuples, and sets to store data. ([source](https://nyandwi.com/machine_learning_complete/00_intro_to_python/))
- [Reshaping](https://awesome-repositories.com/f/programming-languages-runtimes/language-features-paradigms/language-features/array-operations/reshaping.md) — Implements methods for modifying array dimensions and layouts, including axis flipping and reshaping. ([source](https://nyandwi.com/machine_learning_complete/01_intro_to_Numpy_for_data_computation/))
- [Python Educational Fundamentals](https://awesome-repositories.com/f/programming-languages-runtimes/programming-language-varieties/domain-specific-languages/python-for-machine-learning/python-educational-fundamentals.md) — Provides instructional materials covering Python syntax and mathematical principles specifically for AI implementation. ([source](https://nyandwi.com/machine_learning_complete/))

### Scientific & Mathematical Computing

- [Multi-Dimensional Arrays](https://awesome-repositories.com/f/scientific-mathematical-computing/multi-dimensional-arrays.md) — Implements the creation of multi-dimensional arrays and dataframes from basic Python collections. ([source](https://nyandwi.com/machine_learning_complete/02_data_manipulation_with_pandas/))
- [Array Initialization](https://awesome-repositories.com/f/scientific-mathematical-computing/multi-dimensional-arrays/array-initialization.md) — Demonstrates how to generate vectors and matrices using NumPy patterns like zeros and ones. ([source](https://nyandwi.com/machine_learning_complete/01_intro_to_Numpy_for_data_computation/))
- [Numerical Computing](https://awesome-repositories.com/f/scientific-mathematical-computing/numerical-mathematical-foundations/mathematical-libraries-and-utilities/mathematics/numerical-computing.md) — Provides a foundation for matrix operations, basic statistics, and numerical mathematical functions. ([source](https://cdn.jsdelivr.net/gh/nyandwi/machine_learning_complete@main/README.md))
- [Dataset Manipulation Tools](https://awesome-repositories.com/f/scientific-mathematical-computing/research-analysis-workflows/research-and-data-analysis-tools/research-and-analysis-tools/dataset-manipulation-tools.md) — Offers comprehensive guides for cleaning, transforming, and reshaping structured datasets using Pandas. ([source](https://nyandwi.com/machine_learning_complete/))
- [Array Manipulations](https://awesome-repositories.com/f/scientific-mathematical-computing/array-manipulations.md) — Add new elements, repeat existing values, sort data, or extract unique elements from a dataset. ([source](https://nyandwi.com/machine_learning_complete/01_intro_to_Numpy_for_data_computation/))
- [Element-wise Array Operations](https://awesome-repositories.com/f/scientific-mathematical-computing/element-wise-array-operations.md) — Executes high-performance arithmetic, trigonometric, and logarithmic operations across multi-dimensional arrays. ([source](https://nyandwi.com/machine_learning_complete/01_intro_to_Numpy_for_data_computation/))
- [Scientific Computing](https://awesome-repositories.com/f/scientific-mathematical-computing/high-performance-execution-environments/scientific-computing-platforms/scientific-computing.md) — Implements high-performance matrix operations and numerical array manipulations using NumPy.
- [Matrix Operations](https://awesome-repositories.com/f/scientific-mathematical-computing/high-performance-execution-environments/scientific-computing-platforms/scientific-computing/matrix-operations.md) — Provides implementations for high-performance multidimensional array manipulation and linear algebra. ([source](https://nyandwi.com/machine_learning_complete/extras/tools-overview/))
- [Statistical Analysis Libraries](https://awesome-repositories.com/f/scientific-mathematical-computing/numerical-mathematical-foundations/statistics-probability/statistical-analysis-libraries/statistical-metric-calculators/statistical-analysis-libraries.md) — Utilizes libraries to compute fundamental summary metrics including mean, median, and variance. ([source](https://nyandwi.com/machine_learning_complete/01_intro_to_Numpy_for_data_computation/))

### DevOps & Infrastructure

- [Tooling Landscapes](https://awesome-repositories.com/f/devops-infrastructure/devops/mlops/tooling-landscapes.md) — Directs users to catalogs and research papers that categorize tools used for maintaining machine learning systems. ([source](https://github.com/Nyandwi/machine_learning_complete/blob/main/010_mlops/1_mlops_guide.md))

### Graphics & Multimedia

- [Matplotlib](https://awesome-repositories.com/f/graphics-multimedia/chart-generators/matplotlib.md) — Provides a reference for generating statistical plots, heatmaps, and distribution charts.
- [Numerical Statistics Analyzers](https://awesome-repositories.com/f/graphics-multimedia/media-processing-analysis/media-analysis/automated-media-analyzers/library-statistics-analyzers/numerical-statistics-analyzers.md) — Provides tools for calculating descriptive statistics like mean and standard deviation across numerical datasets. ([source](https://nyandwi.com/machine_learning_complete/06_exploratory_data_analysis/))

### Software Engineering & Architecture

- [Sequence Pipelines](https://awesome-repositories.com/f/software-engineering-architecture/modular-program-composition/function-composition/sequence-pipelines.md) — Uses sequence pipelines to encapsulate multiple data transformation steps for consistent training and testing.

### System Administration & Monitoring

- [Model Failure Analyzers](https://awesome-repositories.com/f/system-administration-monitoring/diagnostic-tools/diagnostics/failure-analysis-tools/build-failure-troubleshooting/failure-pattern-analyzers/model-failure-analyzers.md) — Implements methods to identify failure patterns using learning curves to determine if more data or tuning is required. ([source](https://nyandwi.com/machine_learning_complete/11_ml_fundamentals/))
- [Model Training Metrics](https://awesome-repositories.com/f/system-administration-monitoring/monitoring-and-observability/observability-platforms/metric-performance-monitors/model-training-metrics.md) — Plots accuracy and loss curves for training and validation sets to detect and mitigate model overfitting. ([source](https://nyandwi.com/machine_learning_complete/28_intro_to_computer_vision_and_cnn/))
