Robust Fall Recovery for Wheeled-Legged Robots via Curriculum-Guided Dynamic Reward Shaping

Learning to recover with wheel-leg coordination

Abstract

This paper presents a learning-based framework integrating Episode-based Dynamic Reward Shaping and curriculum learning, which dynamically balances exploration of diverse recovery maneuvers with precise posture refinement. We further demonstrate that synergistic wheel-leg coordination reduces joint torque consumption by 15.85%–26.2% and improves stabilization through energy transfer mechanisms. Extensive evaluations on two distinct platforms achieve recovery success rates up to 99.1% and 97.8% without platform-specific tuning.

Framework and Fall Simulation

The strategy can learn effective recovery actions and demonstrates remarkable robustness.

Face down.

Random state-I Recovery
Random state-II Recovery
Random state-III Recovery
Random state-IV Recovery
Random state-V Recovery

Back down.

Random state-I Recovery
Random state-II Recovery
Random state-III Recovery(Failure)

ED-policy vs Baseline

We show that our ED-policy can recover from the falling postures of the baseline.

ED-policy
Baseline

Recovery under Repeated Perturbations

We show that our policy performs on the Unitree Go2-W robot.

Wheel
Leg-driven

Diverse Terrains Depolyment

We show that our policy performs on the non-flat environments.

Box Grid
Pyramid Slope
Random Rough
Pyramid Stairs
Inverted Pyramid Stairs

Bibtex




  

Acknowledgements:

Template for this Website