0.77 m/s locomotion achieved in simulation
Our reinforcement learning policy reached 0.77 m/s sustained forward velocity on the 18-DOF quadruped model. The policy was trained using Isaac Gym with domain randomization across friction, payload, and terrain parameters. Per-joint clearance constraints keep feet above ground through the full gait cycle, and drift self-corrects without explicit heading control.