Skip to content

Latest commit

 

History

History
11 lines (6 loc) · 1.27 KB

README.md

File metadata and controls

11 lines (6 loc) · 1.27 KB

Deep reinforcement learning applied to mario kart

This project was developed by afcruzs and 0xDCA as the final project of the machine learning course at Universidad nacional de colombia.

The agent was trained using vainilla DQN adding the notion of 'check points', when the agent gets stucked for a certain amount of time (achieving a very bad reward) the training is restored to the last checkpoint, this prevents local minima and allows the agent to improve over time. Read more information in our technical report (spanish) here.

The model is executed/trained into a tiny flask server which is called from the Bizhawk emulator executing mario_kart.lua in the lua interpreter.

After roughly two weeks of training the agent was able to learn how to drive in the peach circuit and achieves acceptable performance. This is an example of the model running:

Demo