←BacklangfengQ/verl-agent0Copy as MarkdownView on GitHub↗1,548 stars·140 forks·Python·apache-2.0·0 viewshuggingface.co/papers/2505.10978↗Verl AgentFeaturesDense Reward Optimization - Group-in-group policy optimization for agent training.Reinforcement Learning Frameworks - Policy optimization framework for training language model agents.