Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question: Did anyone solve MiniGrid-DoorKey-8x8-v0? #53

Open
ErikVester opened this issue Dec 31, 2020 · 1 comment
Open

Question: Did anyone solve MiniGrid-DoorKey-8x8-v0? #53

ErikVester opened this issue Dec 31, 2020 · 1 comment

Comments

@ErikVester
Copy link

Hi all,

Did anyone solve the MiniGrid-DoorKey-8x8-v0 environment with the PPO algorithm and if so, with which hyperparameters, environment steps and for how many frames did you run this?

Thanks! :)

Kind regards,

Erik

@rgobbel
Copy link

rgobbel commented Oct 4, 2022

I changed the default value of max_steps from 10 x size x size (in doorkey.py) to 100 x size x size, and it works fine. I also increased the number of frames to 800000, but it's obvious that it's learning long before it gets that far. The problem with the default settings is that with a room that large, most of the time it never makes it to the goal, which means there's no reward, so nothing is learned. The main thing is to let it keep trying for long enough in a single episode that it gets rewarded frequently enough to learn.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants