Sim2Real RL on Solo12

Environment for learning robust goal conditioned agile locomotion policies on 12DOF quadruped robot Solo12 and Sim2Real transfer to the Real Robot.

The learning comprises of multiple stages.

The first stage includes offline imitation of the demonstration using PPO with imitation reward and random initialization at any point in demonstration trajectory.
The next stage involves discarding the demonstration and online learning on top of the base policy learnt using imitation. This stage involves task specific rewards, trajectory tracking reward for ensuring the robustness of the PD controller and torque smoothness penalties for easy transfer to the real robot.
The third stage involved devising robust Sim2Real transfer approach using careful system identification and domain randomization.

Results on the Real Robot

The resulutant policy was found to be robust to push recovery, independent of initial position and displacement command driven. The policy also had very less impact on the knee joints during execution of agile locomotion gait like jumping.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
Results		Results
robot_envs		robot_envs
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sim2Real RL on Solo12

Results on the Real Robot

About

Releases

Packages

Languages

aadhithya14/Model-Free-Goal-Conditioned-Quadrupedal-Locomotion

Folders and files

Latest commit

History

Repository files navigation

Sim2Real RL on Solo12

Results on the Real Robot

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages