- Installation — Docker (recommended), pip, or building from source on NVIDIA / AMD.
- Quick Start —
docker pullto a GRPO run on Qwen3-4B in under an hour, on a single 8-GPU node.
Welcome
Getting Started
Install Miles and run your first RL training job — the two pages you need to go from zero to a working loop.
Two pages take you from a fresh machine to a running RL training job:

