Skip to content
/ mrl Public

Meta reinforcement learning model of context switching effects in human memory

Notifications You must be signed in to change notification settings

andrebeu/mrl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Meta Reinforcement Learning on bandits task.

Simple demo implementation of meta-RL in pytorch

Repo materials

  • utils.py defines the agent and task environment
  • main.ipynb has a simple training loop, and an evaluation of the learnt agent

Task

  • Two variants of meta-learning on bandits are implemented:
    • easy: bandits switch between episodes. i.e. for a given episode, the bandit with highest probability is fixed
    • medium: bandit switches once at given episode step
    • difficult: bandit switches within an episode with probability p.

Agent

  • policy gradient method
    • target can be REINFORCE (MC) or ActorCritic (TD)

Optimizations

  • forward layers computed in parallel by folding time into batch dimension

Resources

Wang et al., 2018 paper

About

Meta reinforcement learning model of context switching effects in human memory

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published