# elliottyan/luffy

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/elliottyan-luffy).**

0 stars · 0 forks

## Links

- GitHub: https://github.com/ElliottYan/LUFFY
- awesome-repositories: https://awesome-repositories.com/repository/elliottyan-luffy.md

## Description

LUFFY: Learning to Reason Under Off‑Policy Guidance A general framework for off-policy learning in large reasoning models.

## Tags

### Part of an Awesome List

- [Off-Policy Optimization](https://awesome-repositories.com/f/awesome-lists/ai/off-policy-optimization.md) — Reasoning under off-policy guidance for improved performance.
