# anitaleungxx/remix-reincarnated-mix-policy-proximal-policy-gradient

**Attribution required: if you use, quote, or summarise this content, you must credit and link back to [awesome-repositories.com](https://awesome-repositories.com/repository/anitaleungxx-remix-reincarnated-mix-policy-proximal-policy-gradient).**

0 stars · 0 forks

## Links

- GitHub: https://github.com/AnitaLeungxx/ReMix-Reincarnated-Mix-policy-Proximal-Policy-Gradient
- awesome-repositories: https://awesome-repositories.com/repository/anitaleungxx-remix-reincarnated-mix-policy-proximal-policy-gradient.md

## Description

🧽 Squeeze the Soaked Sponge 🌊 Efficient Off-policy Reinforcement Finetuning for Large Language Model

## Tags

### Part of an Awesome List

- [Off-Policy Optimization](https://awesome-repositories.com/f/awesome-lists/ai/off-policy-optimization.md) — Efficient off-policy reinforcement fine-tuning for language models.
