Skip to content
This repository has been archived by the owner on Nov 4, 2019. It is now read-only.

LSTM/GRU agent #9

Open
justheuristic opened this issue Jan 17, 2017 · 0 comments
Open

LSTM/GRU agent #9

justheuristic opened this issue Jan 17, 2017 · 0 comments
Assignees

Comments

@justheuristic
Copy link
Contributor

justheuristic commented Jan 17, 2017

The current A3C-feedforward policy is only using 4-frame history to predict policy and may fail in case the environment is still partially observable.
Would probably be nice to implement a policy with GRU/LSTM hidden units [instead of window or along with it].

A good proving ground would be a game with field of view like doom - DefendCenter or HealthGathering - just make sure that image preprocessing works fine with it.

Bonus kudos for implementing a soft attention mechanism that actually improves results [or does not harm and gives clues on what agent looks at]. link link

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

2 participants