Policy Gradient Methods in Reinforcement Learning

This is a report on the use of policy gradient methods in Reinforcement Learning. The focus of this report is on the experimental comparison between 3 types of Actor-Critic methods. This investigation mainly revolves around the effect of the eligibility traces on Actor-Critic methods. These methods are the followings:

Actor-Critic with Eligibility Traces
Actor-Critic with Eligibility Traces only on the Critic but not the Actor
Actor-Critic Without any Eligibility Traces using one-step returns

The experiments are carried on the FrozenLake environment.

This entire report was done using Google Colaboratory and you can view the ipython notebook here.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
policy_gradient_traces.ipynb		policy_gradient_traces.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Policy Gradient Methods in Reinforcement Learning

About

Releases

Packages

Languages

rezazzr/policy_gradient_reinforcement_learning

Folders and files

Latest commit

History

Repository files navigation

Policy Gradient Methods in Reinforcement Learning

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages