Playing 7 e mezzo with Deep Q-Learning

Inspired by the paper ”Playing Blackjack with Deep Q-Learning” by Allen Wu, I have developed this Reinforcement lerning project for a university exam.

Install and run

Create your own virtual env:
Install the dependencies in requirements.txt
Select the algorithm and the number of episodes in main.py
Run the main

Experimental results:

Policy evaluation

The first experiment was conducted to get an idea of the best policies, I evaluated each naive policy cause of the space of states isn't extremely large. The policies were chosen for each (player value, probability of bust) pair.

Example: Policy (4, 50)

If the player has in its hand a total value less than or equal to 4 and the probability of bust is less than or equal to 50% he takes a card otherwise he stay.

Below is a table with the winning percentage for each policy, the percentages have been calculated on 150,000 episodes.

We can see that the best results are obtained with policies whose player value limit is between 2.5 and 4 and the bust probability between 15% and 25%.

In following examples 0 stands for "HIT" and 1 stands for "STAY".

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
QL		QL
images		images
.gitignore		.gitignore
Deck.py		Deck.py
LICENSE		LICENSE
README.md		README.md
SetteEMezzoDQL.py		SetteEMezzoDQL.py
SetteEMezzoGame.py		SetteEMezzoGame.py
SetteEMezzoMC.py		SetteEMezzoMC.py
SetteEMezzoQL.py		SetteEMezzoQL.py
eps_test.py		eps_test.py
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Playing 7 e mezzo with Deep Q-Learning

Install and run

Experimental results:

Policy evaluation

Q-Learning

Deep Q-Learning

About

Releases

Packages

Languages

License

palace22/7-e-Mezzo-with-DQL

Folders and files

Latest commit

History

Repository files navigation

Playing 7 e mezzo with Deep Q-Learning

Install and run

Experimental results:

Policy evaluation

Q-Learning

Deep Q-Learning

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages