This project is a clean fork of the original veRL project to support vision language models, we thank all the authors for providing such a high-performance RL training framework.
Features
Reasoning Models - User-friendly framework for training reasoning-focused models.