-
Notifications
You must be signed in to change notification settings - Fork 11
Home
Chua Cheow Huan edited this page Sep 30, 2019
·
6 revisions
Welcome to the reinforcement_learning wiki!
This repository contains codes that I have reproduced (while learning RL) for various reinforcement learning algorithms. The codes were tested on Colab.
If Github is not loading the Jupyter notebooks, a known Github issue, click here to view the notebooks on Jupyter's nbviewer.
Algorithms | Discrete | Continuous | Multithreaded | Multiprocessing | Tested on |
---|---|---|---|---|---|
DQN | ✔️ | CartPole-v0 | |||
Double DQN (DDQN) | ✔️ | CartPole-v0 | |||
Dueling DDQN | ✔️ | CartPole-v0 | |||
Dueling DDQN + PER | ✔️ | CartPole-v0 | |||
A3C (1) | ✔️ | ✔️ | ✔️ | ✔️(3) | CartPole-v0, Pendulum-v0 |
DPPO (2) | ✔️ | ✔️(3) | Pendulum-v0 | ||
RND + PPO | ✔️ | MountainCarContinuous-v0 |
(1): N-step returns used for critic's target.
(1): GAE used for computation of TD lambda return (for critic's target) & policy's advantage.
(3): Distributed Tensorflow & Python's multiprocessing package used.
The misc folder contains related example codes that I have put together while learning RL. See the README.md in the misc folder for more details.
Check out my blog for more information on my repositories.