Home

Welcome to the reinforcement_learning wiki!

What's in this repository?

This repository contains codes that I have reproduced (while learning RL) for various reinforcement learning algorithms. The codes were tested on Colab.

If Github is not loading the Jupyter notebooks, a known Github issue, click here to view the notebooks on Jupyter's nbviewer.

Implemented Algorithms

Algorithms	Discrete	Continuous	Multithreaded	Multiprocessing	Tested on
DQN	✔️				CartPole-v0
Double DQN (DDQN)	✔️				CartPole-v0
Dueling DDQN	✔️				CartPole-v0
Dueling DDQN + PER	✔️				CartPole-v0
A3C ⁽¹⁾	✔️	✔️	✔️	✔️⁽³⁾	CartPole-v0, Pendulum-v0
DPPO ⁽²⁾		✔️		✔️⁽³⁾	Pendulum-v0
RND + PPO		✔️			MountainCarContinuous-v0

^{^{(1): N-step returns used for critic's target.}}
^{^{(1): GAE used for computation of TD lambda return (for critic's target) & policy's advantage.}}
^{^{(3): Distributed Tensorflow & Python's multiprocessing package used.}}

misc folder

The misc folder contains related example codes that I have put together while learning RL. See the README.md in the misc folder for more details.

Blog

Check out my blog for more information on my repositories.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Home

What's in this repository?

Implemented Algorithms

misc folder

Blog

Clone this wiki locally