pysc2-RLagents

Notes on running an agent in the pysc2.env.sc2_env.SC2Env environment. In particular, showing details and brief descriptions of the TimeStep object (observation) fed to the step function of an agent or returned from calling the step function of an environment.

ResearchLog

Contains notes on developing RL agents for SC2LE.

Agents

Contains scripts for training and running RL agents in SC2LE.

`PySC2_A3C_AtariNet.py`

This script implements the A3C algorithm with the Atari-net architecture described in DeepMind's paper, for SC2LE. The code is based on Arthur Juliani's A3C implementation for the VizDoom environment (see above).

This is a generalized version of PySC2_A3C_old.py that works for all minigames and also contains some bug fixes.

To run the script, use the following command:

python PySC2_A3C_AtariNet.py --map_name CollectMineralShards

If --map_name is not supplied, the script runs DefeatRoaches by default.

`PySC2_A3C_old.py`

This is an initial script that only works for the DefeatRoaches minigame. Check out PySC2_A3C_AtariNet.py for the latest agent that runs on all minigames.

I initially focused on the DefeatRoaches minigame and so I only took in 7 screen features and 3 nonspatial features for the state space and the action space is limited to 17 base actions and their relevant arguments.

For the action space, I modeled the base actions and arguments independently. In addition, I also model x and y coordinates independently for spatial arguments, to further reduce the effective action space.

The agent currently samples the distributions returned from the policy networks for the actions taken, instead of an epsilon-greedy.

Also, the policy networks for the arguments are updated irregardless of whether the argument was used (eg. even if a no_op action is taken, the argument policies are still updated), which should probably be corrected.

Will be updating this to work with all the minigames.

As of 50 million steps on DefeatRoaches, the agent achieved max and average scores of 338 and 65, compared to DeepMind's Atari-net agent that achieved max and average scores of 351 and 101 after 600 million steps.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
Agents		Agents
Images		Images
Notes		Notes
ResearchLog		ResearchLog
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pysc2-RLagents

Important Links

Work by others

Notes

Total Action Space

List of Action Argument Types

Running an Agent

ResearchLog

Agents

`PySC2_A3C_AtariNet.py`

`PySC2_A3C_old.py`

This is an initial script that only works for the DefeatRoaches minigame. Check out PySC2_A3C_AtariNet.py for the latest agent that runs on all minigames.

About

Releases

Packages

Languages

License

SDSMT-SC2AI/pysc2-RLagents

Folders and files

Latest commit

History

Repository files navigation

pysc2-RLagents

Important Links

Work by others

Notes

Total Action Space

List of Action Argument Types

Running an Agent

ResearchLog

Agents

PySC2_A3C_AtariNet.py

PySC2_A3C_old.py

This is an initial script that only works for the DefeatRoaches minigame. Check out PySC2_A3C_AtariNet.py for the latest agent that runs on all minigames.

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

`PySC2_A3C_AtariNet.py`

`PySC2_A3C_old.py`

Packages