diff --git a/README.md b/README.md index 219c230..17378eb 100644 --- a/README.md +++ b/README.md @@ -6,5 +6,8 @@ Implementation of the several bandits algorithms for exploration discussed in th ## Policy Iteration An answer to exercise 4.7 from the book along with the recreation of example 4.2 +## Value Iteration +Recreated example 4.3 and answered question 4.9 + ### Project Goal Reinforce the learning from the book by demonstrating learned skills with exercises from Reinforcement Learning Book by Sutton and Barto and my own exercises.