This is source code for the final project of course Reinforcement Learning

Training Instruction

We train all the algorithms on NTY Greene HPC.

sbatch scripts/ppo_train.sh ppo_lr05_ent15_cp2 0

For example, if you want to train the PPO algorithm in NYU HPC with the above command. ppo_lr05_ent15_cp2 means the learning rate $5 \cdot 1e^{-5}$, entropy coefficient $15 \cdot 0.001$, clip range $2\cdot 0.1$. The last parameter 0 means seed is 0.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
algos		algos
scripts		scripts
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

This is source code for the final project of course Reinforcement Learning

Training Instruction

About

Releases

Packages

Languages

NovTi/RL-Project

Folders and files

Latest commit

History

Repository files navigation

This is source code for the final project of course Reinforcement Learning

Training Instruction

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages