Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay
reinforcement-learning
linear-programming
thompson-sampling
epsilon-greedy
ucb
policy-evaluation
mdps
multi-armed-bandits
policy-iteration
randomised-algorithms
reinforcement-learning-excercises
kl-divergence
markovian-epidemic-processes
reinforcement-learning-analysis
multiarm-bandit
ucb1
howards-pi
batch-switching
randomized-policy-iteration
-
Updated
May 21, 2018 - Python