Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Gym examples don't represent ideal solutions #4

Open
NivenT opened this issue Aug 3, 2017 · 0 comments
Open

Gym examples don't represent ideal solutions #4

NivenT opened this issue Aug 3, 2017 · 0 comments

Comments

@NivenT
Copy link
Owner

NivenT commented Aug 3, 2017

I guess first off, ideally there would be an example for every training algorithm added. For now, the best way I've found to set these up seems to be to use OpenAI's gym as a source of environments.

Unfortunately though, this library really only offers tables and linear approximations as possible value functions. These are not always ideal for gym environments, so getting working examples requires appropriate choices of features and hyperparameters in order for the agent to learn well. However, doubly unfortunately, I'm still new to RL so the current solutions are not ideal, and can likely be improved.

For some context, you can

  • compare the performance of the cartpole agent before and after this commit: 47e8d16
  • see how well the mountain car agent does before it was removed in this commit: d3beac9

To sum it up, the issue here is the example solutions should be designed more carefully.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant