RL-MADDPG-navigation-gird-world

An implementation of MADDPG on grid world navigation use multiprocess, and there is worker for sampling, learner for learning policy, and evaluator for evaluate policy at a fixed interval.

├── Env          # 
|   ├── env.py     # base env gird world
├── Core          training core# 
|   ├── HER.py     # Hindsight experience replay, actually not use
|   ├── actor.py     # actor node for sampling data
|   ├── evaluator.py     # evaluate policy node
|   ├── learner.py     # learning process node
|   ├── logger.py     # record necessary info
|   ├── model.py     # define NN model
|   ├── normalizer.py     # normalize sampling data
|   ├── util.py     # some utils
├── arguments                       #train arguments
├── collection_experiments.py       # simple visualize, may use for collecting samples, but not use till now.
├── origin_obstacle_states.txt      # make obstacle same at all evironment, for simplifing problem
├── plot.py                         #plot
├── rollout_test.py                 # rollout saved policy
├── train.py                         #training entrance

python train.py # modify args in arguments.py

The results:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL-MADDPG-navigation-gird-world

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Env		Env
core		core
picture		picture
saved_models_1		saved_models_1
README.md		README.md
arguments.py		arguments.py
collection_experiments.py		collection_experiments.py
origin_obstacle_states.txt		origin_obstacle_states.txt
plot.py		plot.py
rollout_test.py		rollout_test.py
train.py		train.py

PiggyCh/RL-MADDPG-navigation-grid-world

Folders and files

Latest commit

History

Repository files navigation

RL-MADDPG-navigation-gird-world

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages