30 open-source projects similar to yandexdataschool/practical_rl, ranked by how many features they have in common. Compare stars, activity and what each one does to find the best Practical RL alternative.
This project is a collection of reinforcement learning implementations and educational materials written in Python. It provides neural network architectures for solving control tasks through deep reinforcement learning, spanning value-based and policy-gradient methods. The repository includes a library of evolutionary strategies and genetic algorithms as alternatives to gradient-based learning. It also features a model-based system for predicting future environment states and rewards to enable internal simulation and offline planning. The codebase covers a wide range of capabilities, includi
This project is a collection of PyTorch learning resources and educational guides designed to teach the construction and training of neural networks. It serves as a comprehensive deep learning tutorial covering various model architectures and practical implementation strategies. The resources provide specific guidance on implementing computer vision tasks, such as image classification and synthetic imagery generation, as well as reinforcement learning agents using value networks and experience replay. It also covers sequential data modeling through recurrent networks and generative modeling u
This project is an educational repository of reinforcement learning agents and tutorials implemented using TensorFlow. It provides a practical codebase for both model-free and model-based learning agents, designed to demonstrate how AI agents learn through trial and error. The collection features detailed implementations of various algorithmic approaches, including Deep Q-Networks and Policy Gradient methods. It specifically covers Actor-Critic architectures for continuous and discrete action spaces, alongside Proximal Policy Optimization and Deep Deterministic Policy Gradients. The framewor
DRL is a curated educational resource that teaches deep reinforcement learning through a structured series of lectures and videos. It covers the three main families of reinforcement learning methods: actor-critic architectures, value-based algorithms like Q-learning and DQN, and policy-based techniques that directly optimize an agent's action-selection strategy. The curriculum extends beyond these core topics to include imitation learning, multi-agent training, and methods for handling continuous action spaces. Content is organized as markdown-driven documentation that generates static, navig
TensorLayer is a backend-agnostic tensor library and deep learning framework designed for building neural network architectures. It provides a neural network abstraction layer that allows model logic to run across different deep learning engines using high-level layers and model components. The project serves as a deep reinforcement learning toolkit for implementing policy-based, value-based, and actor-critic agents. It includes specialized tools for managing experience replay and gradient-based policy optimization to handle both discrete and continuous action spaces. To support reinforcemen
This project is a PyTorch reinforcement learning library and agent training framework. It provides a suite of deep reinforcement learning algorithms, including DQN, PPO, and SAC, to facilitate the development of autonomous agents that optimize behavior through trial and error. The library focuses on the implementation of various actor-critic methods and deep learning architectures for research into autonomous decision making. It enables the training of intelligent agents within diverse environments by leveraging PyTorch-based model implementations. The codebase covers core reinforcement lear
This repository is a collection of practical deep learning implementations and examples built using the TensorFlow framework. It provides a variety of neural network architectures focusing on natural language processing, recommendation systems, reinforcement learning, and time series prediction. The project features a range of specialized models, including sequence-to-sequence and transformer architectures for text processing, and factorization machines for personalized ranking and retrieval. It also includes implementations of reinforcement learning agents using actor-critic and policy gradi
This is a PyTorch-based toolkit for training reinforcement learning agents, providing implementations of standard and hierarchical deep RL algorithms. It is designed as a library for deep reinforcement learning research and experimentation, supporting both discrete and continuous control tasks through a collection of algorithm implementations. The project distinguishes itself by offering a hierarchical reinforcement learning framework that decomposes complex long-horizon tasks into manageable sub-goals using meta-controllers and lower-level policies. It also includes a Hindsight Experience Re
This project is a machine learning implementation library featuring a collection of code examples that implement supervised, unsupervised, and reinforcement learning algorithms from scratch. It provides a comprehensive set of toolkits for core machine learning components, including a natural language processing toolkit, a reinforcement learning framework, and suites for data dimensionality reduction and pattern mining. The library includes specialized implementations for reinforcement learning, such as Q-Learning, Deep Q-Networks, and Actor-Critic agents. The natural language processing capab
Hands-on-RL is an interactive educational resource and collection of Jupyter notebooks designed for learning reinforcement learning. It combines technical theory with practical, runnable code to demonstrate the implementation and training of mainstream reinforcement learning agents. The project focuses on bridging the gap between theory and practice through a tutorial structure that organizes explanations and executable code blocks sequentially. It enables the prototyping of reinforcement learning models to observe their behavior and performance in real-time. The implementation utilizes a mo
DouZero is a deep reinforcement learning framework and training system designed to teach digital agents to master complex card games. It provides the infrastructure to implement high-throughput reinforcement learning pipelines and evaluate the competitive success of game agents. The system utilizes a distributed actor-learner architecture that separates game simulation actors from GPU training devices to accelerate model convergence. It combines Monte Carlo Tree Search with policy-based value estimation to determine optimal moves through recursive evaluation and random sampling. The toolkit
Baselines is a comprehensive suite of frameworks for reinforcement learning algorithm implementation, imitation learning, and training orchestration. It provides a library of standardized learning algorithms used to benchmark and replicate research results, alongside a deep learning policy framework for constructing neural network architectures such as multi-layer perceptrons, convolutional networks, and long short-term memory networks. The project includes a specialized imitation learning toolkit that enables agents to mimic expert behavior through behavior cloning and generative adversarial
This project is a comprehensive collection of practical code examples and implementation libraries for machine learning. It provides a wide array of reference materials for building supervised, unsupervised, and reinforcement learning algorithms. The repository serves as a multi-domain resource, featuring specific implementation suites for financial AI, Bayesian statistical modeling, and deep learning architectures. It includes a framework for training intelligent agents using policy gradients and actor-critic models, as well as practical guides for fine-tuning transformers and utilizing larg
This project is a deep reinforcement learning agent and AI game automation tool designed to master game objectives by analyzing visual input. It implements a Deep Q-Network to train an autonomous bot that learns to play a video game by maximizing rewards through deep Q-learning. The system utilizes a convolutional neural network to process raw pixel data from game frames, identifying patterns to determine optimal real-time actions. Training is stabilized through the use of an experience replay buffer and an epsilon-greedy action selection strategy to balance exploration and exploitation. The
This repository is a comprehensive collection of instructional guides and practical examples for Python development, focusing on machine learning, data science, and web scraping. It provides implementations for neural networks, reinforcement learning algorithms, and deep learning architectures using PyTorch, alongside detailed manuals for scientific computing and data visualization. The project distinguishes itself by offering specialized tutorials on concurrent programming to optimize CPU performance and guides for setting up Linux development environments. It covers the implementation of ad
Leela Zero is a deep learning Go engine and reinforcement learning system that implements the AlphaGo Zero approach. It utilizes deep residual convolutional networks and Monte Carlo Tree Search to determine optimal moves and analyze the game of Go. The project functions as a neural network training tool that generates data through automated self-play. It uses a supervised learning pipeline to refine network weights, allowing the system to improve its game-playing capabilities without relying on human-provided data or expert knowledge. The engine includes game scoring logic to determine winne
Minigo is a TensorFlow-based reinforcement learning engine designed to master the game of Go. It functions as a comprehensive system for training neural networks to predict board policies and game outcomes, utilizing a model trainer to generate self-play data and optimize weights. The project is distinguished by its ability to perform large-scale game simulations using Kubernetes to distribute worker nodes across CPU, GPU, and TPU hardware. It employs a Monte Carlo Tree Search implementation to identify optimal moves and supports specialized hardware acceleration, including inference on Edge
This repository provides a comprehensive library of reinforcement learning algorithms designed for training autonomous agents. It serves as a research-oriented collection of implementations that cover fundamental decision-making strategies, including dynamic programming, temporal difference learning, and policy gradient methods. The project distinguishes itself by offering specialized frameworks for deep reinforcement learning and structured decision modeling. It includes implementations for deep Q-learning that utilize neural networks, experience replay, and prioritized sampling to approxima
This project is an educational platform and research toolkit designed to teach deep learning through a combination of mathematical theory, visual diagrams, and executable code. It provides a comprehensive environment for building, training, and evaluating neural networks, grounding complex concepts in interactive computational notebooks that allow for hands-on experimentation. The framework distinguishes itself by interleaving theoretical foundations—including linear algebra, calculus, and probability—with practical implementations across multiple industry-standard libraries. It supports flex
keras-rl is a reinforcement learning library that enables the training of neural agents using Keras. It serves as a framework for implementing deep reinforcement learning agents that interact with simulated environments to discover optimal behaviors and maximize cumulative rewards. The library provides a system for configuring, training, and managing neural network agents. It handles the interaction loop between agents and environments, allowing models to learn through direct experience and gradient-based optimization. The framework includes capabilities for model weight management, allowing
This repository is a collection of implementation references and solved notebooks covering supervised, unsupervised, and reinforcement learning techniques. It provides practical guides for building predictive models, clustering algorithms, and autonomous agents. The project includes specific implementations for neural network architectures, such as multi-layer perceptrons for digit recognition, and recommender systems using collaborative and content-based filtering. It also features reinforcement learning systems that utilize deep Q-learning to optimize decision-making policies. The codebase
This project is a deep reinforcement learning curriculum providing educational materials and implementation exercises for mastering neural network-based agents. It serves as a framework for building reference versions of value-based and policy-based methods to solve sequential decision problems. The project provides specific implementations for continuous control simulations and multi-agent reinforcement learning, where agents are trained to cooperate or compete in shared environments. It includes a policy gradient framework for optimizing agent behavior through methods such as REINFORCE. Ca
This project is a comprehensive research platform designed for the end-to-end lifecycle of robotic learning. It provides a modular framework for training neural network policies—specifically through imitation and reinforcement learning—and deploying them onto physical robotic hardware. By offering a unified interface for hardware abstraction, the platform decouples high-level control logic from the specific sensors and actuators of diverse robotic systems. The framework distinguishes itself through a standardized approach to data and policy management. It utilizes a consistent schema for reco
Spinning Up is a deep reinforcement learning curriculum designed to teach the theory and implementation of deep reinforcement learning algorithms. It serves as a guided educational resource for understanding how agents interact with environments through mathematical models and code. The project provides a research roadmap consisting of a curated collection of influential research papers and theoretical concepts. This literature study is designed to guide a deeper exploration of specific reinforcement learning domains. The curriculum covers the implementation of reinforcement learning logic t
Dopamine is a reinforcement learning research framework designed for prototyping and testing algorithms across diverse simulated environments. It provides an agent development toolkit that utilizes a flat class hierarchy to facilitate the creation and extension of learning agents. The framework includes a standardization layer via environment wrappers that connect agents to various physics simulations and gaming environments. It also features a high-performance experience replay buffer for storing and sampling transition data to improve training stability, alongside a dedicated hyperparameter
This project is a comprehensive deep reinforcement learning course and training platform. It provides a structured educational curriculum that combines theoretical lessons with hands-on tutorials to teach the implementation of neural networks and agent behavior. The platform integrates a model sharing hub where users can upload, download, and version trained machine learning models. It also features a benchmarking system that uses leaderboards to evaluate and compare agent performance against community standards. The educational experience is delivered through interactive notebooks and inclu
SO-ARM100 is an open-source robot arm hardware project providing 3D-printable designs and assembly guides for building affordable robotic arms. It includes calibration software to synchronize motor communication parameters and arm positions via USB, alongside hardware designs for tactile sensing robotic grippers. The project distinguishes itself through the integration of touch-sensing and flexible filaments for adaptive grasping. It also provides a dedicated imitation learning dataset tool, featuring a web interface for labeling and visualizing robotics data to train machine learning models
Tianshou is a reinforcement learning framework designed for developing and testing agents. It provides a system for implementing custom agents by defining policies and parameter update rules to optimize agent behavior. The framework decouples neural network architectures from update logic through policy-based abstractions and separates data pre-processing from gradient updates. It utilizes a collector-driven pipeline to stream experience from environments into structured memory buffers for sampled learning. The system supports vectorized environment execution to run multiple parallel instanc
Tensorpack is a high-level TensorFlow neural network framework and research library designed for building and training deep learning models. It provides a collection of reproducible neural network architectures for computer vision, generative tasks, reinforcement learning, and natural language processing. The project distinguishes itself through a specialized deep learning data pipeline that uses pure Python for parallel data loading and streaming. It includes a multi-GPU training orchestrator for distributing workloads via data-parallel strategies and a dedicated interpretability toolkit for