# ljpzzz/machinelearning

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/ljpzzz-machinelearning).**

8,706 stars · 3,705 forks · Jupyter Notebook · MIT

## Links

- GitHub: https://github.com/ljpzzz/machinelearning
- awesome-repositories: https://awesome-repositories.com/repository/ljpzzz-machinelearning.md

## Topics

`algorithms` `machinelearning` `reinforcementlearning` `scikit-learn`

## Description

This project is a machine learning implementation library featuring a collection of code examples that implement supervised, unsupervised, and reinforcement learning algorithms from scratch. It provides a comprehensive set of toolkits for core machine learning components, including a natural language processing toolkit, a reinforcement learning framework, and suites for data dimensionality reduction and pattern mining.

The library includes specialized implementations for reinforcement learning, such as Q-Learning, Deep Q-Networks, and Actor-Critic agents. The natural language processing capabilities cover text vectorization, semantic analysis, and Chinese text analysis, while the dimensionality reduction suite implements algorithms like Principal Component Analysis and Local Linear Embedding.

The project also covers a wide range of supervised learning models, including classification, regression, and ensemble learning methods. Additional capabilities include unsupervised clustering, data mining for frequent pattern extraction, statistical data sampling using Markov Chain Monte Carlo, and the development of collaborative filtering recommendation systems.

The implementation is provided as a collection of Jupyter Notebooks.

## Tags

### Artificial Intelligence & ML

- [Machine Learning Implementations](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning-implementations.md) — Provides a collection of from-scratch implementations of supervised, unsupervised, and reinforcement learning algorithms.
- [Reinforcement Learning](https://awesome-repositories.com/f/artificial-intelligence-ml/reinforcement-learning.md) — Provides a comprehensive framework for implementing reinforcement learning agents using Q-learning and actor-critic methods. ([source](https://github.com/ljpzzz/machinelearning#readme))
- [Clustering](https://awesome-repositories.com/f/artificial-intelligence-ml/clustering.md) — Provides a framework of unsupervised methods for grouping unlabeled data using distance and density metrics. ([source](https://github.com/ljpzzz/machinelearning#readme))
- [Deep Reinforcement Learning Implementations](https://awesome-repositories.com/f/artificial-intelligence-ml/deep-q-learning-implementations/deep-reinforcement-learning-implementations.md) — Implements a variety of reinforcement learning agents including Q-Learning, DQN, and Actor-Critic methods.
- [Dimensionality Reduction](https://awesome-repositories.com/f/artificial-intelligence-ml/dimensionality-reduction.md) — Implements core dimensionality reduction algorithms including PCA, LDA, and SVD to simplify high-dimensional datasets. ([source](https://github.com/ljpzzz/machinelearning#readme))
- [Experience Replay Buffers](https://awesome-repositories.com/f/artificial-intelligence-ml/experience-replay-buffers.md) — Implements memory buffers to store agent transitions and break temporal correlations during deep reinforcement learning training.
- [Supervised Learning Models](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/algorithms/core-algorithmic-paradigms/supervised-learning-models.md) — Provides implementations of supervised learning models for both classification and regression tasks.
- [Unsupervised Learning Algorithms](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/algorithms/core-algorithmic-paradigms/unsupervised-learning-algorithms.md) — Implements unsupervised algorithms for grouping unlabeled data and discovering hidden structures.
- [Natural Language Processing](https://awesome-repositories.com/f/artificial-intelligence-ml/natural-language-processing.md) — Provides a comprehensive toolkit for analyzing and vectorizing human language data.
- [Temporal Difference Learning](https://awesome-repositories.com/f/artificial-intelligence-ml/temporal-difference-learning.md) — Implements algorithms that update value estimates incrementally by comparing current predictions with subsequent observations.
- [Asynchronous](https://awesome-repositories.com/f/artificial-intelligence-ml/actor-critic-architectures/advantage-actor-critic-implementations/asynchronous.md) — Provides an implementation of the A3C agent that uses multiple parallel workers for improved stability. ([source](https://github.com/ljpzzz/machinelearning/blob/master/reinforcement-learning/a3c.py))
- [Boosting Algorithms](https://awesome-repositories.com/f/artificial-intelligence-ml/boosting-algorithms.md) — Implements sequential ensemble methods where each new model corrects the errors of its predecessors to improve accuracy.
- [Centroid-Based Clustering](https://awesome-repositories.com/f/artificial-intelligence-ml/centroid-based-clustering.md) — Provides unsupervised learning implementations that group data by iteratively calculating central points.
- [Chinese Natural Language Processing](https://awesome-repositories.com/f/artificial-intelligence-ml/chinese-natural-language-processing.md) — Implements natural language processing techniques specifically for analyzing Chinese text. ([source](https://github.com/ljpzzz/machinelearning/blob/master/natural-language-processing/chinese_digging.ipynb))
- [Classification Models](https://awesome-repositories.com/f/artificial-intelligence-ml/classification-models.md) — Provides a variety of classification models, including decision trees and support vector machines. ([source](https://github.com/ljpzzz/machinelearning#readme))
- [BIRCH Clustering](https://awesome-repositories.com/f/artificial-intelligence-ml/clustering-algorithms/birch-clustering.md) — Implements Balanced Iterative Reducing and Clustering for efficient handling of large datasets via hierarchical grouping. ([source](https://github.com/ljpzzz/machinelearning/blob/master/classic-machine-learning/birch_cluster.ipynb))
- [Spectral Clustering](https://awesome-repositories.com/f/artificial-intelligence-ml/clustering-algorithms/spectral-clustering.md) — Implements clustering methods using the eigenvalues of a similarity matrix to identify optimal community structures. ([source](https://github.com/ljpzzz/machinelearning/blob/master/classic-machine-learning/spectral_cluster.ipynb))
- [Collaborative Filtering Models](https://awesome-repositories.com/f/artificial-intelligence-ml/collaborative-filtering-models.md) — Provides model architectures for predicting user-item interactions via collaborative filtering. ([source](https://github.com/ljpzzz/machinelearning#readme))
- [Decision Trees](https://awesome-repositories.com/f/artificial-intelligence-ml/decision-trees.md) — Implements decision tree classifiers that split features based on information gain. ([source](https://github.com/ljpzzz/machinelearning/blob/master/classic-machine-learning/decision_tree_classifier_1.ipynb))
- [Deep Learning Architectures](https://awesome-repositories.com/f/artificial-intelligence-ml/deep-learning-architectures.md) — Implements structural compositions for deep, convolutional, and recurrent neural networks. ([source](https://github.com/ljpzzz/machinelearning#readme))
- [Double DQN Implementations](https://awesome-repositories.com/f/artificial-intelligence-ml/deep-q-learning-frameworks/double-dqn-implementations.md) — Implements Double DQNs to reduce overestimation of action values by decoupling selection from estimation. ([source](https://github.com/ljpzzz/machinelearning/blob/master/reinforcement-learning/ddqn.py))
- [Deep Q-Learning Implementations](https://awesome-repositories.com/f/artificial-intelligence-ml/deep-q-learning-implementations.md) — Implements deep Q-learning algorithms using neural networks and experience replay. ([source](https://github.com/ljpzzz/machinelearning/blob/master/reinforcement-learning/dqn.py))
- [Density-Based Clustering](https://awesome-repositories.com/f/artificial-intelligence-ml/density-based-clustering.md) — Implements clustering methods that group data points based on local density reachability to identify arbitrary shapes. ([source](https://github.com/ljpzzz/machinelearning/blob/master/classic-machine-learning/dbscan_cluster.ipynb))
- [Locally Linear Embeddings](https://awesome-repositories.com/f/artificial-intelligence-ml/dimensionality-reduction/locally-linear-embeddings.md) — Provides an implementation of Locally Linear Embedding to preserve local neighborhood structures during dimensionality reduction. ([source](https://github.com/ljpzzz/machinelearning/blob/master/classic-machine-learning/lle.ipynb))
- [Ensemble Learning](https://awesome-repositories.com/f/artificial-intelligence-ml/ensemble-learning.md) — Provides a framework for combining multiple base learners using boosting and forest techniques. ([source](https://github.com/ljpzzz/machinelearning#readme))
- [XGBoost Integrations](https://awesome-repositories.com/f/artificial-intelligence-ml/ensemble-learning-libraries/gradient-boosting-libraries/xgboost-integrations.md) — Implements optimized gradient boosting decision trees for efficient regression and classification tasks. ([source](https://github.com/ljpzzz/machinelearning/blob/master/readme.md))
- [AdaBoost Classifiers](https://awesome-repositories.com/f/artificial-intelligence-ml/ensemble-learning/adaboost-classifiers.md) — Implements AdaBoost algorithms for aggregating weak learners to improve predictive accuracy. ([source](https://github.com/ljpzzz/machinelearning/blob/master/ensemble-learning/adaboost-classifier.ipynb))
- [Priority-Based Replay Buffers](https://awesome-repositories.com/f/artificial-intelligence-ml/experience-replay-buffers/priority-based-replay-buffers.md) — Implements priority-based replay buffers using TD-error to improve training efficiency. ([source](https://github.com/ljpzzz/machinelearning/blob/master/reinforcement-learning/ddqn_prioritised_replay.py))
- [Frequent Itemset Mining](https://awesome-repositories.com/f/artificial-intelligence-ml/frequent-itemset-mining.md) — Implements algorithms for discovering common associations and recurring patterns within transactional datasets. ([source](https://github.com/ljpzzz/machinelearning/tree/master/classic-machine-learning))
- [Gradient Boosting](https://awesome-repositories.com/f/artificial-intelligence-ml/gradient-boosting.md) — Implements iterative gradient boosting ensembles that build decision trees sequentially to minimize loss. ([source](https://github.com/ljpzzz/machinelearning/blob/master/ensemble-learning/gbdt_classifier.ipynb))
- [K-Means Clustering](https://awesome-repositories.com/f/artificial-intelligence-ml/k-means-clustering.md) — Provides centroid-based partitioning of data into a specified number of clusters. ([source](https://github.com/ljpzzz/machinelearning/blob/master/classic-machine-learning/kmeans_cluster.ipynb))
- [K-Nearest Neighbor Classifiers](https://awesome-repositories.com/f/artificial-intelligence-ml/k-nearest-neighbor-classifiers.md) — Implements K-Nearest Neighbor classifiers based on majority voting of closest training samples. ([source](https://github.com/ljpzzz/machinelearning/blob/master/classic-machine-learning/knn_classifier.ipynb))
- [Linear Regression](https://awesome-repositories.com/f/artificial-intelligence-ml/linear-regression.md) — Implements various linear regression models, including ridge and lasso, for continuous value prediction. ([source](https://github.com/ljpzzz/machinelearning#readme))
- [Interaction Matrix Factorizers](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-deployment-and-serving/model-hubs-and-pre-made-models/model-weights/latent-factor-analyzers/interaction-matrix-factorizers.md) — Provides algorithms to decompose user-item interaction data into latent factor representations for recommendation systems.
- [Markov Chain Monte Carlo Sampling](https://awesome-repositories.com/f/artificial-intelligence-ml/markov-chain-monte-carlo-sampling.md) — Implements MCMC sampling algorithms to estimate complex probability distributions. ([source](https://github.com/ljpzzz/machinelearning/blob/master/mathematics/mcmc_2.ipynb))
- [Gibbs Samplers](https://awesome-repositories.com/f/artificial-intelligence-ml/markov-chain-monte-carlo-sampling/gibbs-samplers.md) — Implements Gibbs sampling and MCMC for synthetic data generation and probability distribution estimation.
- [Hidden Markov Models](https://awesome-repositories.com/f/artificial-intelligence-ml/markov-state-transition-models/hidden-markov-models.md) — Implements Hidden Markov Models for sequence modeling and state estimation. ([source](https://github.com/ljpzzz/machinelearning/blob/master/natural-language-processing/hmm.ipynb))
- [Naive Bayes Classifiers](https://awesome-repositories.com/f/artificial-intelligence-ml/naive-bayes-classifiers.md) — Implements probabilistic classification models based on Bayes theorem from scratch. ([source](https://github.com/ljpzzz/machinelearning/blob/master/classic-machine-learning/native_bayes.ipynb))
- [Word Embeddings](https://awesome-repositories.com/f/artificial-intelligence-ml/natural-language-processing/word-embeddings.md) — Implements Word2Vec to create dense vector representations that capture semantic word relationships. ([source](https://github.com/ljpzzz/machinelearning/tree/master/natural-language-processing))
- [Policy Gradient Implementations](https://awesome-repositories.com/f/artificial-intelligence-ml/policy-gradient-implementations.md) — Implements policy gradient methods to optimize neural networks for action selection. ([source](https://github.com/ljpzzz/machinelearning/blob/master/reinforcement-learning/policy_gradient.py))
- [Deep Deterministic](https://awesome-repositories.com/f/artificial-intelligence-ml/policy-gradient-implementations/deep-deterministic.md) — Implements the DDPG algorithm for solving reinforcement learning problems in continuous action spaces. ([source](https://github.com/ljpzzz/machinelearning/blob/master/reinforcement-learning/ddpg.py))
- [Random Forest Ensembles](https://awesome-repositories.com/f/artificial-intelligence-ml/random-forest-ensembles.md) — Implements random forest ensembles that aggregate predictions from multiple decision trees to reduce variance. ([source](https://github.com/ljpzzz/machinelearning/blob/master/ensemble-learning/random_forest_classifier.ipynb))
- [Dueling Network Architectures](https://awesome-repositories.com/f/artificial-intelligence-ml/reinforcement-learning-value-estimators/dueling-network-architectures.md) — Implements dueling network architectures that separate state value and action advantage estimation. ([source](https://github.com/ljpzzz/machinelearning/blob/master/reinforcement-learning/duel_dqn.py))
- [Sarsa Update Implementations](https://awesome-repositories.com/f/artificial-intelligence-ml/reinforcement-learning-value-estimators/sarsa-update-implementations.md) — Implements the SARSA on-policy temporal difference learning algorithm. ([source](https://github.com/ljpzzz/machinelearning/blob/master/reinforcement-learning/sarsa_windy_world.py))
- [Support Vector Machines](https://awesome-repositories.com/f/artificial-intelligence-ml/support-vector-machines.md) — Provides implementations of support vector machines to establish decision boundaries between data classes. ([source](https://github.com/ljpzzz/machinelearning/blob/master/classic-machine-learning/svm_classifier.ipynb))
- [Tabular Q-Learning](https://awesome-repositories.com/f/artificial-intelligence-ml/tabular-q-learning.md) — Implements tabular Q-learning for finding optimal paths via temporal difference learning. ([source](https://github.com/ljpzzz/machinelearning/blob/master/reinforcement-learning/q_learning_windy_world.py))
- [Topic Modeling Libraries](https://awesome-repositories.com/f/artificial-intelligence-ml/topic-modeling-libraries.md) — Provides a library for discovering latent thematic structures using LDA and Non-negative Matrix Factorization. ([source](https://github.com/ljpzzz/machinelearning/blob/master/natural-language-processing/nmf.ipynb))
- [Latent Dirichlet Allocations](https://awesome-repositories.com/f/artificial-intelligence-ml/topic-modeling-libraries/latent-dirichlet-allocations.md) — Implements Latent Dirichlet Allocation for discovering hidden themes in text corpora. ([source](https://github.com/ljpzzz/machinelearning/blob/master/classic-machine-learning/lda.ipynb))

### Data & Databases

- [Text Vectorizers](https://awesome-repositories.com/f/data-databases/vector-storage/text-vectorizers.md) — Provides tools for converting raw text into numerical vector representations using TF-IDF and hashing.
- [MCMC Sampling](https://awesome-repositories.com/f/data-databases/data-management/sample-data-loaders/statistical-sampling/mcmc-sampling.md) — Implements Markov Chain Monte Carlo algorithms to generate samples from complex posterior probability distributions.
- [Data Mining](https://awesome-repositories.com/f/data-databases/data-mining.md) — Implements algorithms like Apriori and FP-Tree for extracting patterns and actionable insights from large datasets. ([source](https://github.com/ljpzzz/machinelearning#readme))

### Scientific & Mathematical Computing

- [Principal Component Analysis](https://awesome-repositories.com/f/scientific-mathematical-computing/linear-algebra-routines/principal-component-analysis.md) — Implements Principal Component Analysis for variance-based dimensionality reduction. ([source](https://github.com/ljpzzz/machinelearning/blob/master/classic-machine-learning/pca.ipynb))
- [Ridge Regression](https://awesome-repositories.com/f/scientific-mathematical-computing/ridge-regression.md) — Implements L2-regularized ridge regression to prevent overfitting in linear models. ([source](https://github.com/ljpzzz/machinelearning/blob/master/classic-machine-learning/ridge_regression_1.ipynb))