Skip to content

aadhithya14/Model-Free-Goal-Conditioned-Quadrupedal-Locomotion

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 

Repository files navigation

Sim2Real RL on Solo12

Environment for learning robust goal conditioned agile locomotion policies on 12DOF quadruped robot Solo12 and Sim2Real transfer to the Real Robot.

The learning comprises of multiple stages.

  • The first stage includes offline imitation of the demonstration using PPO with imitation reward and random initialization at any point in demonstration trajectory.

  • The next stage involves discarding the demonstration and online learning on top of the base policy learnt using imitation. This stage involves task specific rewards, trajectory tracking reward for ensuring the robustness of the PD controller and torque smoothness penalties for easy transfer to the real robot.

  • The third stage involved devising robust Sim2Real transfer approach using careful system identification and domain randomization.

Results on the Real Robot

solo12.gif

The resulutant policy was found to be robust to push recovery, independent of initial position and displacement command driven. The policy also had very less impact on the knee joints during execution of agile locomotion gait like jumping.

About

Sim2Real Goal Conditioned RL for Quadruped Robot Locomotion (Solo12).

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages