←BackLliziniu/ReMax0Copy as MarkdownView on GitHub↗0 stars·0 forks·0 viewsReMaxReMax is a reinforcement learning method, tailored for reward maximization in RLHF. FeaturesCritic-Free Algorithms - Simple and efficient alignment method for large language models.