rlm is an LLM code execution engine and orchestration framework designed to coordinate multiple language model calls and recursive sub-tasks through a programmable environment. It provides a sandboxed REPL environment and a recursive context processor to handle inputs that exceed standard token limits by programmatically decomposing prompts.
The project differentiates itself through a reinforcement learning training harness used to teach models how to utilize recursive calls and code execution. It includes a reasoning visualization system that records and renders execution trajectories to analyze how models decompose and solve complex tasks.
The system supports secure code execution via pluggable backends, including cloud virtual machines, isolated containers, and local processes. It manages state across multiple turns using a REPL-based environment and allows for the injection of custom tools and external functions into the execution flow.
The framework is implemented in Python.