# ruc-nlpir/arpo

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/ruc-nlpir-arpo).**

1,049 stars · 60 forks · Python

## Links

- GitHub: https://github.com/RUC-NLPIR/ARPO
- awesome-repositories: https://awesome-repositories.com/repository/ruc-nlpir-arpo.md

## Description

[ICLR 2026] Agentic Reinforced Policy Optimization (ARPO)

## Tags

### Part of an Awesome List

- [Dense Reward Optimization](https://awesome-repositories.com/f/awesome-lists/ai/dense-reward-optimization.md) — Agentic reinforcement learning for policy optimization.
- [Policy Optimization](https://awesome-repositories.com/f/awesome-lists/ai/policy-optimization.md) — Agentic reinforcement learning for policy optimization.
- [Tool Optimization](https://awesome-repositories.com/f/awesome-lists/ai/tool-optimization.md) — Agentic reinforced policy optimization.
