[Paper] [Project Website] [Data]
Zichen Jeff Cui, Hengkai Pan, Aadhithya Iyer, Siddhant Haldar and Lerrel Pinto, New York University
This repo contains code for DynaMo visual pretraining, and for reproducing sim environment experiments. Datasets will be uploaded soon.
The following assumes our current working directory is the root directory of this project repo; tested on Ubuntu 22.04 LTS (amd64).
- Install the project environment:
conda env create --file=conda_env.yml
- Activate the environment:
conda activate dynamo-repro
- To enable logging, log in with a
wandb
account:Alternatively, to disable logging altogether, set the environment variablewandb login
WANDB_MODE
:export WANDB_MODE=disabled
(Updated Sep 29: sim kitchen dataset now supports lazy loading: set prefetch=False
in the sim kitchen configs. If you encounter errors, try downloading the latest dataset zips from the link above.)
- Download all files in the
datasets
directory and unzip. - In
./configs/env_vars/env_vars.yaml
, setdataset_root
to the unzipped parent directory containing all datasets. - In
./eval_configs/env_vars/env_vars.yaml
, setdataset_root
to the unzipped parent directory containing all datasets. - In
./eval_configs/env_vars/env_vars.yaml
, setsave_path
to where you want to save the rollout results (e.g. root directory of this repo). - Environments:
sim_kitchen
: Franka kitchen environmentblock_push_multiview
: Block push environmentlibero_goal
: LIBERO Goal environmentpusht
: Push-T environment
The following assumes our current working directory is the root directory of this project repo.
To reproduce the experiment results, the overall steps are:
-
Activate the conda environment with
conda activate dynamo-repro
-
Train the visual encoder with
python3 train.py --config-name=train_*
. A model snapshot will be saved to./exp_local/...
; -
In
eval_configs/encoder
, in the corresponding environment config, set the encoder file pathf
to the saved snapshot; -
Eval with
python3 online_eval.py --config-name=train_*
.
See below for detailed steps for each environment.
-
Train the encoder:
python3 train.py --config-name=train_sim_kitchen
Snapshots will be saved to a new timestamped directory
./exp_local/{date}/{time}_train_sim_kitchen_dynamo
.The encoder snapshot will be at
./exp_local/{date}/{time}_train_sim_kitchen_dynamo/encoder.pt
. -
In
eval_configs/encoder/kitchen_dynamo.yaml
, setSNAPSHOT_PATH
to the absolute path of the encoder snapshot above. -
Evaluation:
MUJOCO_GL=egl python3 online_eval.py --config-name=train_sim_kitchen
-
Train the encoder:
python3 train.py --config-name=train_blockpush
Snapshots will be saved to a new timestamped directory
./exp_local/{date}/{time}_train_blockpush_dynamo
.The encoder snapshot will be at
./exp_local/{date}/{time}_train_blockpush_dynamo/encoder.pt
. -
In
eval_configs/encoder/blockpush_dynamo.yaml
, setSNAPSHOT_PATH
to the absolute path of the encoder snapshot above. -
Evaluation:
ASSET_PATH=$(pwd) python3 online_eval.py --config-name=train_blockpush
(Evaluation requires including this repository in
ASSET_PATH
.)
-
Train:
python3 train.py --config-name=train_pusht
Snapshots will be saved to a new timestamped directory
./exp_local/{date}/{time}_train_pusht_dynamo
.The encoder snapshot will be at
./exp_local/{date}/{time}_train_pusht_dynamo/encoder.pt
-
In
eval_configs/encoder/pusht_dynamo.yaml
, setSNAPSHOT_PATH
to the absolute path of the encoder snapshot above. -
Evaluation:
python3 online_eval.py --config-name=train_pusht
-
Train:
python3 train.py --config-name=train_libero_goal
Snapshots will be saved to a new timestamped directory
./exp_local/{date}/{time}_train_libero_goal_dynamo
.The encoder snapshot will be at
./exp_local/{date}/{time}_train_libero_goal_dynamo/encoder.pt
-
In
eval_configs/encoder/libero_dynamo.yaml
, setSNAPSHOT_PATH
to the absolute path of the encoder snapshot above. -
Evaluation:
MUJOCO_GL=egl python3 online_eval.py --config-name=train_libero_goal
-
Plug in your dataset in these files:
datasets/your_dataset.py
configs/env/your_dataset.yaml
configs/env_vars/env_vars.yaml
-
Check the inverse/forward model configs:
configs/train_your_dataset.yaml
- This is the main config.
configs/ssl/dynamo_your_dataset.yaml
- If the model converges slowly, try setting
ema_beta
tonull
to use SimSiam instead of EMA encoder during training.
- If the model converges slowly, try setting
configs/projector/inverse_dynamics_your_dataset.yaml
- We find that setting the inverse dynamics
output_dim
to approximately the underlying state dimension usually works well.- For sim environments, this is the state-based observation dimension.
- For real environments, e.g. a 7DoF robot arm + gripper (1D) manipulating a rigid object (6D), this would be ~16 dimensions.
- We find that setting the inverse dynamics
-
Add linear probes for training diagnostics:
workspaces/your_workspace.py
- This template computes linear probe and nearest neighbor MSE from the image embeddings to states/actions, for monitoring training convergence.
- It assumes that your dataset class has
states
(batch
xtime
xstate_dim
) andactions
(batch
xtime
xaction_dim
) attributes.- For a real-world dataset, you can use proprioception as the state.