←BackCcmu-l3/l10Copy as MarkdownView on GitHub↗0 stars·0 forks·0 viewsL1L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning FeaturesRegularization Objectives - Controlling reasoning depth through reinforcement learning.