Skip to content

A TensorFlow-based framework for learning about and experimenting with reinforcement learning algorithms

License

Notifications You must be signed in to change notification settings

tpbarron/rlflow

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

74 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

RLFlow

Documentation Status

A framework for learning about and experimenting with reinforcement learning algorithms. It is built on top of TensorFlow and interfaces with OpenAI gym (universe should work, too). It aims to be as modular as possible so that new algorithms and ideas can easily be tested. I started it to gain a better understanding of core RL algorithms and maybe it can be useful for others as well.

Features

Algorithms (future algorithms italicized):

  • MDP algorithms

    • Value iteration
    • Policy iteration
  • Temporal Difference Learning

    • SARSA
    • Deep Q-Learning
    • Policy gradient Q-learning
  • Gradient algorithms

    • Vanilla policy gradient
    • Deterministic policy gradient
    • Natural policy gradient
  • Gradient-Free algorithms

    • Cross entropy method

Function approximators (defined by TFLearn model):

  • Linear
  • Neural network
  • RBF

Works with any OpenAI gym environment.

Future Enhancements

  • Improved TensorBoard logging
  • Improved model snapshotting to include exploration states, memories, etc.
  • Any suggestions?

Fixes

  • Errors / warnings on TensorFlow session save

License

About

A TensorFlow-based framework for learning about and experimenting with reinforcement learning algorithms

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages