This project is an educational repository of reinforcement learning agents and tutorials implemented using TensorFlow. It provides a practical codebase for both model-free and model-based learning agents, designed to demonstrate how AI agents learn through trial and error. The collection features detailed implementations of various algorithmic approaches, including Deep Q-Networks and Policy Gradient methods. It specifically covers Actor-Critic architectures for continuous and discrete action spaces, alongside Proximal Policy Optimization and Deep Deterministic Policy Gradients. The framewor
This repository provides a comprehensive library of reinforcement learning algorithms designed for training autonomous agents. It serves as a research-oriented collection of implementations that cover fundamental decision-making strategies, including dynamic programming, temporal difference learning, and policy gradient methods. The project distinguishes itself by offering specialized frameworks for deep reinforcement learning and structured decision modeling. It includes implementations for deep Q-learning that utilize neural networks, experience replay, and prioritized sampling to approxima
This project is a collection of reinforcement learning implementations and educational materials written in Python. It provides neural network architectures for solving control tasks through deep reinforcement learning, spanning value-based and policy-gradient methods. The repository includes a library of evolutionary strategies and genetic algorithms as alternatives to gradient-based learning. It also features a model-based system for predicting future environment states and rewards to enable internal simulation and offline planning. The codebase covers a wide range of capabilities, includi
Baselines is a comprehensive suite of frameworks for reinforcement learning algorithm implementation, imitation learning, and training orchestration. It provides a library of standardized learning algorithms used to benchmark and replicate research results, alongside a deep learning policy framework for constructing neural network architectures such as multi-layer perceptrons, convolutional networks, and long short-term memory networks. The project includes a specialized imitation learning toolkit that enables agents to mimic expert behavior through behavior cloning and generative adversarial