Wrapping universe #6

justheuristic · 2017-01-03T01:14:29Z

By default Universe provides a low-level keyboard control with very high-dimensional action space AND a high-dimensional image + occasional other stuff as observations.

While it is a noble quest to tackle the environment as it is, we better first try learning something in a more RL-friendly setup:

smaller-resolution image
discrete action space: wrap only the essential actions like "turn car left/right", not "move cursor to that random location"
repeat the same action for N (e.g.4) timesteps to make effective sessions shorter

So, we need to create a wrapper environment that takes openai universe game and alters it's reset() and step() methods to make env easier for agent to master.

Here's one game that seems simple enough for this, i suggest sticking to it unless we bump into some unsurpassable obstacle.

One example of altering an environment this way can be found here

justheuristic · 2017-01-09T20:55:56Z

This is probably achievable through some of the wrappers
https://github.com/openai/universe-starter-agent/blob/master/envs.py#L9

More specifically, something like this
https://github.com/openai/universe-starter-agent/blob/master/envs.py#L211

justheuristic · 2017-01-18T00:33:31Z

Made most basic wrapper that just spawns one container per player: #13
It's probably possible to play much faster with some universe tricks - that's now a priority.

justheuristic added this to the Less ugly version 2.0 milestone Jan 3, 2017

justheuristic added the help wanted label Jan 6, 2017

justheuristic assigned feygina Jan 9, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wrapping universe #6

Wrapping universe #6

justheuristic commented Jan 3, 2017

justheuristic commented Jan 9, 2017 •

edited

Loading

justheuristic commented Jan 18, 2017

Wrapping universe #6

Wrapping universe #6

Comments

justheuristic commented Jan 3, 2017

justheuristic commented Jan 9, 2017 • edited Loading

justheuristic commented Jan 18, 2017

justheuristic commented Jan 9, 2017 •

edited

Loading