Skip to content

Latest commit

 

History

History
12 lines (9 loc) · 354 Bytes

README.rst

File metadata and controls

12 lines (9 loc) · 354 Bytes

Deep Reinforcement Learning ODS '23

Homeworks structure:
  1. Cross-Entropy Method
  2. Cross-Entropy Method
  3. Policy Iteration & Value Iteration
  4. Q-Learning
  5. Deep Q-Network
  6. Proximal Policy Optimization

41st place on the Leaderboard