30 open-source projects similar to jwarmenhoven/islr-python, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best ISLR Python alternative.
This project is a Python machine learning library and data science toolkit designed for building predictive models and analyzing complex datasets. It provides a collection of implementations for common supervised and unsupervised algorithms using the Scikit-Learn framework. The toolkit includes a predictive modeling suite for generating predictions from historical data and a statistical analysis framework for applying Bayesian modeling and causality tests. It also features a data visualization suite based on Matplotlib for rendering static charts and graphs to interpret classifier boundaries
This project is a machine learning textbook companion and code reference that translates theoretical statistical learning exercises into executable implementations. It serves as a programmatic study guide for implementing foundational machine learning algorithms and solving structured data problems. The repository provides predictive modeling notebooks that combine narrative explanations with code to derive and validate statistical algorithms. These implementations are available as a reference for both Python and R, utilizing the Scikit-Learn API for model fitting and prediction. The codebas
This project is a machine learning educational curriculum and learning platform delivered through interactive Jupyter Notebooks. It serves as a comprehensive guide for mastering the Python data science toolkit, providing structured tutorials for numerical computing, tabular data manipulation, and statistical visualization. The curriculum includes specific implementation guides for Scikit-Learn and a practical course on TensorFlow for constructing, training, and deploying neural networks and computer vision models. It covers the end-to-end process of building predictive models, from initial pr
100-Days-Of-ML-Code is a machine learning curriculum and instructional resource designed as a structured 100-day learning path. It provides a sequence of daily milestones that cover the mathematical foundations and practical implementations of machine learning algorithms. The project is organized into specialized courses for supervised and unsupervised learning. Supervised learning materials cover the implementation of predictive models such as linear regression, decision trees, and support vector machines. Unsupervised learning materials focus on clustering models, including K-Means and hier
This repository provides a comprehensive educational framework for mastering machine learning and deep learning through a structured curriculum. It integrates theoretical mathematical foundations—including calculus, probability, and linear algebra—with hands-on laboratory implementations that require learners to build algorithms and neural network architectures from scratch. The project distinguishes itself by emphasizing first-principles development, ensuring that students understand the underlying mechanics of backpropagation, layer-wise computation, and model optimization. It covers a broa
This project is a machine learning curriculum and data science educational resource. It provides a structured set of instructional materials and hands-on projects designed for learning machine learning concepts and the implementation of predictive models. The resource functions as a training guide for supervised learning, focusing on the development of models for image classification and digit recognition. It uses a project-based training approach that pairs theoretical lessons with dataset-driven model training and evaluation. The curriculum covers the mathematical foundations of machine le
This repository is a collection of machine learning course materials, providing study notes and Python implementation examples for a professional specialization. It serves as a guide for supervised and unsupervised learning, focusing on the application of fundamental algorithms. The content covers a broad range of machine learning education, including the mathematical foundations and practical prototyping of models. It specifically provides resources for implementing regression, classification, clustering, and dimensionality reduction techniques. The project is organized as a curriculum-base
This project is a comprehensive machine learning educational resource and tutorial series delivered as a collection of interactive Jupyter Notebooks. It provides practical Python implementations for the end-to-end machine learning lifecycle, covering supervised and unsupervised learning, deep learning, and reinforcement learning. The resource distinguishes itself by providing detailed implementation guides for complex architectures, including transformers, generative adversarial networks, and convolutional neural networks. It also features specialized courseware for developing reinforcement l
This project is an interactive data science environment that combines code execution, rich media visualization, and narrative documentation into a persistent, browser-based platform. It serves as a comprehensive educational resource for scientific computing, providing a framework for iterative data analysis and machine learning prototyping. The environment is distinguished by its focus on high-performance numerical computing, utilizing vectorized array operations and memory-mapped data structures to handle large-scale computations efficiently. It features a unified estimator interface that st
This project is a Python data analysis library and exploratory data analysis framework designed for processing raw datasets. It provides a suite of tools for examining data, identifying anomalies, and applying statistical methods to uncover patterns. The repository functions as a machine learning modeling toolkit and a statistical data modeling suite. It includes predictive algorithms and mathematical models used to analyze relationships between data variables and derive insights from complex datasets. The project covers a broad range of capabilities including data science, machine learning
Plotnine is a data visualization library for Python based on the Grammar of Graphics. It serves as a declarative statistical plotting framework and multi-panel plotting engine, allowing users to create complex charts by mapping data variables to visual properties such as position, color, and size. The project is distinguished by its use of a layered composition model and a statistical transformation engine that performs aggregations and computations before rendering visuals. It features a comprehensive system for multi-panel faceting, which enables the splitting of a single visualization into
This is an educational Python implementation of every algorithm from Li Hang's textbook on statistical learning methods. The project provides a comprehensive collection of supervised learning algorithms covering classification, regression, and sequence modeling techniques, implemented from scratch for learning and reference purposes. The repository covers a broad range of foundational machine learning methods, including decision trees built using the ID3 algorithm with information gain, ensemble boosting through AdaBoost that combines threshold-based weak learners, and probabilistic sequence
Lihang is a statistical learning algorithm library and framework providing implementations of supervised and unsupervised machine learning models. It functions as a reference repository that translates statistical learning theories into executable code for data classification and pattern recognition. The project features specialized tools for probabilistic model implementation, utilizing likelihood estimation and Bayesian methods to determine optimal model parameters. It includes a sequential data labeling tool for identifying patterns in ordered data sequences and supports both linear and no
This repository provides a collection of machine learning algorithms implemented from scratch using pure Python. It serves as an educational resource designed to demonstrate the internal logic and mathematical foundations of predictive models without relying on external machine learning frameworks or black-box libraries. The project distinguishes itself by mapping code implementations directly to their underlying statistical and calculus-based formulas. Each model is constructed using base language primitives and manual gradient descent optimization, allowing users to observe the mechanics of
This project is a collection of foundational machine learning algorithms and data science tools implemented in Python. It focuses on building the logic of these tools using basic programming primitives rather than relying on specialized libraries. The implementation covers several core domains, including a linear algebra library for matrix and vector operations, a statistical analysis toolkit for probability and hypothesis testing, and a framework for map-reduce distributed processing. It also includes implementations for natural language processing, graph theory for network analysis, and var
This project is a curated directory of educational roadmaps and resource hubs for artificial intelligence, deep learning, and machine learning. It serves as a centralized collection of academic lectures, instructional videos, and courses designed to provide structured learning paths for AI practitioners. The directory covers specialized academic curricula across several core domains, including computer vision, natural language processing, and reinforcement learning. It also provides access to niche educational content such as medical imaging, Bayesian deep learning, and probabilistic graphica
Smile is a comprehensive JVM machine learning library and statistical computing toolkit. It provides a suite of algorithms for classification, regression, and clustering, implemented natively for Java, Scala, and Kotlin. The project also functions as a deep learning framework, a natural language processing library, and an inference engine for large language models. The library distinguishes itself through GPU acceleration via LibTorch bindings and support for the ONNX model interchange format. It includes specialized capabilities for large language model inference, featuring Byte-Pair Encodin
This project provides a collection of machine learning algorithms implemented from scratch in Python. It serves as an educational resource using interactive notebooks that combine code with mathematical explanations to demonstrate the first principles of data science. The repository includes reference implementations for neural networks, such as multilayer perceptrons with backpropagation, and supervised learning models including linear and logistic regression. It also covers unsupervised learning through k-means clustering and Gaussian anomaly detection. The codebase covers a broad range of
This project is a technical learning resource and algorithm reference guide consisting of pedagogical study notes on machine learning. It provides academic summaries and conceptual breakdowns designed to help students navigate comprehensive machine learning textbooks. The content is structured as a collection of notes covering the theoretical foundations and implementation logic of supervised, unsupervised, semi-supervised, and reinforcement learning algorithms. It focuses on the mathematical foundations and logic behind various algorithmic approaches to solving data problems. The resource u
DeepSpeedExamples is a collection of reference implementations and scripts for training, fine-tuning, and executing inference on large-scale AI models using DeepSpeed optimization. It provides a distributed model training guide and practical workflows for adapting large language models through memory-efficient techniques. The repository includes specialized implementations for pipeline parallelism to handle models exceeding single GPU memory and a suite of examples for ZeRO memory optimization to reduce per-device overhead. It also features standardized test suites for benchmarking the throug
Linfa is a classical machine learning framework and statistical learning suite implemented in Rust. It provides a collection of algorithms for supervised and unsupervised learning, focused on traditional statistical methods such as regression, clustering, and decision trees. The toolkit is distinguished by its ability to be compiled into WebAssembly, enabling analytical models to execute within browser environments. It employs a trait-based algorithm interface to standardize the process of training and prediction across its various models. The library covers a broad range of capabilities, in
This project is a scientific computing framework for the .NET ecosystem, providing a comprehensive suite of libraries for numerical analysis, statistics, and mathematical optimization. It serves as a foundational toolkit for developing applications in machine learning, digital signal processing, and computer vision. The framework provides specialized toolkits for training and deploying predictive models, including neural networks, support vector machines, and decision trees. It further distinguishes itself with deep integrations for real-time visual analysis, such as object tracking and facia
PRML is a Python machine learning library and statistical learning toolkit. It provides code implementations of supervised and unsupervised learning concepts, including regression, classification, and neural network algorithms for statistical data modeling. The project functions as a pattern recognition toolkit used to identify theoretical structures within numerical datasets. It includes a neural network framework for solving nonlinear data mappings and a linear algebra toolkit that utilizes vectorized operations and matrix calculations. The library covers a broad range of capabilities, inc
This project is a comprehensive collection of machine learning educational resources, featuring a Python-based curriculum, study guides for deep learning, and a specialized knowledge base for machine learning operations. It provides structured learning paths that guide users from foundational programming through to advanced neural network implementations. The repository focuses on interactive learning by providing a directory of executable notebooks and cloud-hosted experiments. It maps theoretical research papers and textbooks to practical code implementations and maintains a curated directo
This project is a machine learning coursework repository containing a collection of Python exercises and notebooks. It is designed for implementing foundational machine learning algorithms and completing curriculum assignments through interactive documents that combine instructional text and executable code. The repository provides code formatted for compatibility with automated grading systems, allowing for the submission and validation of technical exercises. It includes predefined environment configurations and dependency locks to ensure consistent execution of data science tools across di
Gonum is a numerical computing library for the Go programming language, providing a collection of packages for scientific computing, linear algebra, statistics, and optimization. It functions as a framework for performing complex numerical computations and solving systems of linear equations. The project includes a dedicated graph analysis framework for modeling network graphs and solving connectivity and pathfinding problems. It also provides a statistical analysis toolkit for computing descriptive and inferential statistics and estimating mixture entropy. The library's capability surface c
This project is a deep learning educational resource providing a collection of TensorFlow tutorials and programming exercises. It serves as a set of machine learning code samples designed for university-level courses on machine learning research. The repository focuses on machine learning education and deep learning research, providing practical examples for implementing neural networks from scratch. It supports neural network prototyping and the development of TensorFlow models to help users apply deep learning theory to software implementations.
This is a machine learning educational repository consisting of a collection of notebooks and code examples. It provides practical implementations of diverse machine learning algorithms and workflows, ranging from traditional scientific computing to deep learning. The project features specific implementations of Scikit-Learn models, such as decision trees, random forests, and support vector machines, as well as TensorFlow examples for building neural networks, convolutional layers, and recurrent architectures. It also includes tutorials on reinforcement learning development and the creation o
This repository serves as a structured educational resource for machine learning and data science, providing a centralized collection of tutorials, lecture notes, and implementation guides. It is designed to support self-directed learning by organizing complex technical concepts into a clear, hierarchical path that spans from foundational statistical methods to advanced deep learning architectures. The project distinguishes itself through a comprehensive approach to skill development, bridging the gap between theoretical algorithmic foundations and functional software applications. It offers
This project is a structured machine learning course and educational program designed to teach data analysis and gradient boosting. It consists of a ten-week curriculum that combines theoretical readings and videos with an interactive learning path. The material is delivered through a searchable documentation site and a course generator that produces book-formatted content for offline study. The curriculum integrates interactive notebooks, demo assignments, and competitive challenges to provide a practice environment for applying concepts to real-world datasets. The project utilizes a markdo