Soft-Actor-Critic-Pytorch

[Paper] Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Tuomas Haarnoja, Aurick Zhou, Pieter Abbeel, Sergey Levine

Prerequisites

It is a good a practise to create a new virtual environment and then install the dependencies to prevent conflicts with the exsisting ones.
[Installing and creating virtual envs]

Python 3.6 or higher
NumPy
Pytorch 1.0+
Gym 0.17.2+
Matplotlib 3.1.1+
Ipython 7.8.0+
Pillow 6.2.0+
Or just run the requirements.txt as pip install -r requirements.py

Note :

requiremensts.txt contains packages that may not be used in this project

Getting Started

Install Pytorch
Install torchvision from the source

git clone https://github.com/pytorch/vision
cd vision
python setup.py install

Clone this repo (Fork and clone if necessary)

git clone https://github.com/rtharungowda/Soft-Actor-Critic-Pytorch.git
cd Soft-Actor-Critic-Pytorch/

Training

To train the models for 40,000 iterations

cd pendulumn
python train.py

Iterations midway
All iterations

Load the pre-trained model from checkpoint.pth

Visualizing trained model run (This also acts as test)

Create a gif of the test run

cd pendulumn
python visualization.py

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
pendulumn		pendulumn
LICENSE		LICENSE
README.md		README.md
animation.gif		animation.gif
pendulumn_sac_it22000.png		pendulumn_sac_it22000.png
pendulumn_sac_it40000.png		pendulumn_sac_it40000.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Soft-Actor-Critic-Pytorch

Prerequisites

Note :

Getting Started

Training

Visualizing trained model run (This also acts as test)

About

Releases

Packages

Languages

License

rtharungowda/Soft-Actor-Critic-Pytorch

Folders and files

Latest commit

History

Repository files navigation

Soft-Actor-Critic-Pytorch

Prerequisites

Note :

Getting Started

Training

Visualizing trained model run (This also acts as test)

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages