# dlr-rm/stable-baselines3

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/dlr-rm-stable-baselines3).**

12,765 stars · 2,069 forks · Python · mit

## Links

- GitHub: https://github.com/DLR-RM/stable-baselines3
- Homepage: https://stable-baselines3.readthedocs.io
- awesome-repositories: https://awesome-repositories.com/repository/dlr-rm-stable-baselines3.md

## Topics

`baselines` `gsde` `gym` `machine-learning` `openai` `python` `pytorch` `reinforcement-learning` `reinforcement-learning-algorithms` `robotics` `sb3` `sde` `stable-baselines` `toolbox`

## Description

Stable-baselines3 is a reinforcement learning library built on the PyTorch deep learning framework. It provides a collection of reliable, standardized implementations of reinforcement learning algorithms designed for training, testing, and benchmarking agent policies in diverse simulated environments.

The library functions as an agent training toolkit that emphasizes modularity and reproducibility. It features a unified environment interface and supports vectorized execution to accelerate data collection across multiple simulation instances. Users can customize neural network architectures, feature extractors, and policy definitions to suit specific observation and action spaces, while built-in tools for deterministic seeding ensure consistent results across training runs.

Beyond core training, the project includes comprehensive utilities for managing the agent lifecycle. This encompasses memory-efficient experience replay buffering, advanced exploration strategies for continuous control, and automated monitoring of performance metrics. The framework also supports the export and distribution of trained models, facilitating collaboration and deployment across various hardware and runtime environments.

## Tags

### Artificial Intelligence & ML

- [Reinforcement Learning](https://awesome-repositories.com/f/artificial-intelligence-ml/reinforcement-learning.md) — Provides a collection of reliable implementations of reinforcement learning algorithms for training and benchmarking agents.
- [Agent Training Tools](https://awesome-repositories.com/f/artificial-intelligence-ml/agent-training-tools.md) — Functions as a comprehensive toolkit for executing simulations, managing buffers, and monitoring agent learning.
- [Deep Learning Frameworks](https://awesome-repositories.com/f/artificial-intelligence-ml/deep-learning-frameworks.md) — Built on PyTorch to provide standardized interfaces for creating and training neural network-based policies.
- [Reinforcement Learning Environments](https://awesome-repositories.com/f/artificial-intelligence-ml/reinforcement-learning-environments.md) — Implements a standardized interface for agents to interact with diverse simulation environments for training. ([source](https://stable-baselines3.readthedocs.io/](https://stable-baselines3.readthedocs.io/))
- [Deterministic](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-training-and-tuning/training-utilities/deterministic.md) — Ensures reproducible results across training runs by providing tools for deterministic random seeding. ([source](https://stable-baselines3.readthedocs.io/en/master/guide/migration.html))
- [Policy Architectures](https://awesome-repositories.com/f/artificial-intelligence-ml/modular-training-architectures/policy-architectures.md) — Enables configuration of neural network structures and hidden layers to tailor agent behavior to specific observation spaces. ([source](https://stable-baselines3.readthedocs.io/en/master/))
- [Reinforcement Learning Algorithms](https://awesome-repositories.com/f/artificial-intelligence-ml/reinforcement-learning-algorithms.md) — Implements standard reinforcement learning algorithms to facilitate training and comparative analysis of policies. ([source](https://stable-baselines3.readthedocs.io/en/master/guide/migration.html))
- [Reinforcement Learning Training Pipelines](https://awesome-repositories.com/f/artificial-intelligence-ml/reinforcement-learning-training-pipelines.md) — Provides a consistent interface for implementing and training reinforcement learning agents to solve complex tasks.
- [Experience Replay Buffers](https://awesome-repositories.com/f/artificial-intelligence-ml/experience-replay-buffers.md) — Implements memory-efficient experience replay buffers to decouple data collection from gradient-based optimization.
- [Custom](https://awesome-repositories.com/f/artificial-intelligence-ml/feature-extractors/custom.md) — Supports specialized neural network modules to process raw observations like images or multi-modal data. ([source](https://stable-baselines3.readthedocs.io/en/master/guide/custom_policy.html))
- [Model Exporters](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-deployment-and-serving/serialization-and-export-formats/model-exporters.md) — Provides utilities to export and serialize trained reinforcement learning models for cross-platform deployment and inference. ([source](https://stable-baselines3.readthedocs.io/en/master/))
- [Custom Policy Definitions](https://awesome-repositories.com/f/artificial-intelligence-ml/modular-training-architectures/policy-architectures/custom-policy-definitions.md) — Allows granular control over actor-critic architectures by supporting custom policy class definitions. ([source](https://stable-baselines3.readthedocs.io/en/master/guide/custom_policy.html))
- [Multi-Output Action Architectures](https://awesome-repositories.com/f/artificial-intelligence-ml/policy-gradient-implementations/multi-output-action-architectures.md) — The library handles dictionary action spaces to allow for independent or mixed discrete and continuous action outputs within a single policy. ([source](https://stable-baselines3.readthedocs.io/en/master/misc/projects.html))
- [Reinforcement Learning Algorithm Analyzers](https://awesome-repositories.com/f/artificial-intelligence-ml/reinforcement-learning-algorithms/reinforcement-learning-algorithm-analyzers.md) — Provides tools for comparing and benchmarking the performance of different reinforcement learning algorithms across environments. ([source](https://cdn.jsdelivr.net/gh/DLR-RM/stable-baselines3@master/README.md))
- [Vectorized Environments](https://awesome-repositories.com/f/artificial-intelligence-ml/reinforcement-learning-environments/vectorized-environments.md) — Accelerates data collection by running multiple simulation instances in parallel across CPU cores. ([source](https://stable-baselines3.readthedocs.io/en/master/))
- [Intrinsic Reward Modules](https://awesome-repositories.com/f/artificial-intelligence-ml/exploration-strategies/intrinsic-reward-modules.md) — Integrates intrinsic reward mechanisms to improve agent exploration in environments with sparse feedback. ([source](https://stable-baselines3.readthedocs.io/en/master/misc/projects.html))
- [Reinforcement Learning Environments](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/frameworks/reinforcement-learning-environments.md) — Provides a standardized interface for agent-environment interaction across heterogeneous reinforcement learning tasks.
- [Agent Performance Evaluators](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/frameworks/reinforcement-learning-environments/reinforcement-learning-performance-visualizers/agent-performance-evaluators.md) — Assesses agent behavior and policy stability during training using automated callbacks and video recording. ([source](https://stable-baselines3.readthedocs.io/en/master/))
- [Training Monitoring Tools](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/training-monitoring-and-profiling/training-observability-systems/training-monitoring-tools.md) — Tracks and logs training metrics and hyperparameters to external visualization tools for analysis. ([source](https://stable-baselines3.readthedocs.io/en/master/))
- [Noise](https://awesome-repositories.com/f/artificial-intelligence-ml/exploration-strategies/noise.md) — Enhances training stability in continuous control tasks by utilizing pink noise exploration strategies. ([source](https://stable-baselines3.readthedocs.io/en/master/misc/projects.html))
- [State-Dependent Exploration](https://awesome-repositories.com/f/artificial-intelligence-ml/exploration-strategies/state-dependent-exploration.md) — Improves agent performance in complex environments through specialized state-dependent exploration strategies. ([source](https://stable-baselines3.readthedocs.io/en/master/guide/migration.html))
- [Intrinsic Reward Mechanisms](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/machine-learning-training/objectives-and-optimization/mathematical-training-objectives/reward-functions/intrinsic-reward-mechanisms.md) — Augments standard objective functions with auxiliary signals to encourage exploration in sparse-reward environments.
- [Performance Benchmarking](https://awesome-repositories.com/f/artificial-intelligence-ml/reinforcement-learning-algorithms/reinforcement-learning-simulators/performance-benchmarking.md) — Compares reinforcement learning algorithm performance by evaluating agents and tracking metrics across environments.
- [Optimizer Configurations](https://awesome-repositories.com/f/artificial-intelligence-ml/reinforcement-learning-optimizers/optimizer-configurations.md) — Allows customization of the optimization process by selecting specific optimizer classes and parameters. ([source](https://stable-baselines3.readthedocs.io/en/master/guide/migration.html))
- [Remote Model Hubs](https://awesome-repositories.com/f/artificial-intelligence-ml/machine-learning/infrastructure/model-deployment-and-serving/model-hubs-and-pre-made-models/model-management-utilities/remote-model-hubs.md) — Facilitates community collaboration by enabling the upload, versioning, and distribution of pre-trained agents. ([source](https://stable-baselines3.readthedocs.io/en/master/guide/integrations.html))
- [Model Exporting](https://awesome-repositories.com/f/artificial-intelligence-ml/model-training/model-exporting.md) — Supports exporting and distributing trained agents for use across different hardware and runtime environments.
- [Neural Network Architectures](https://awesome-repositories.com/f/artificial-intelligence-ml/neural-network-architectures.md) — Supports configuration of specialized policy networks and feature extractors for diverse observation spaces.
- [Modular Architectures](https://awesome-repositories.com/f/artificial-intelligence-ml/neural-network-architectures/modular-architectures.md) — Separates feature extraction from decision-making logic to allow flexible neural network configurations.
- [Observation Transformers](https://awesome-repositories.com/f/artificial-intelligence-ml/observation-processing/observation-transformers.md) — Processes complex multi-input observation spaces by transforming them into unified feature vectors for agent consumption. ([source](https://stable-baselines3.readthedocs.io/en/master/guide/custom_policy.html))
- [Training Callbacks](https://awesome-repositories.com/f/artificial-intelligence-ml/training-callbacks.md) — Provides callback mechanisms to inject custom logic into the training loop for monitoring and checkpointing.

### Part of an Awesome List

- [Deep Learning](https://awesome-repositories.com/f/awesome-lists/ai/deep-learning.md) — Listed in the “Deep Learning” section of the Awesome Python awesome list.
- [Reinforcement Learning](https://awesome-repositories.com/f/awesome-lists/ai/reinforcement-learning.md) — Reliable PyTorch implementations of reinforcement learning algorithms.

### Development Tools & Productivity

- [Parallel Execution](https://awesome-repositories.com/f/development-tools-productivity/parallel-execution.md) — Executes multiple simulation instances in parallel to accelerate data collection and improve training efficiency.

### Scientific & Mathematical Computing

- [Research and Data Analysis Tools](https://awesome-repositories.com/f/scientific-mathematical-computing/research-analysis-workflows/research-and-data-analysis-tools.md) — Provides a platform for research into advanced exploration strategies and continuous control tasks.
