Reimplementation of DeepMind's AlphaGo Zero for the game Viergewinnt (WIP)
WORK IN PROGRESS
- Viergewinnt game logic
- Basic Monte-Carlo tree search
- Reinforcement learning model as MCTS strategy
$ python play_match.py --help
Usage: play_match.py [OPTIONS]
Play a match
Options:
--game [tictactoe|viergewinnt] Game to be played [required]
-x [random|human|mcts] Strategy for player X [required]
-o [random|human|mcts] Strategy for player O [required]
--help Show this message and exit.
$ python play_match.py -x human -o mcts --game viergewinnt
Visualization of MCTS search tree with networkx