AlphaGo

This repository contains a reference implementation of the AlphaGo AI by DeepMind.

How to play

Go

Bot vs. Bot

Run python bot_v_bot.py to let 2 Bots play against each other.

Human vs. Bot

Run python mcts_go.py to play against a bot.

Tic-Tac-Toe

Human vs. Bot

Run python play_ttt.py to play against an unbeatable bot.

Reinforcement Learning

Run python init_ac_agent.py --board-size 9 --output-file ./agents/ac_v1.h5
Run python self_play_ac.py --board-size 9 --learning-agent ./agents/ac_v1.h5 --num-games 5000 --experience-out ./experiences/exp_0001.h5 to let a bot play against itself and store experiences gathered during self play.
Run python train_ac.py --learning-agent ./agents/ac_v1.h5 --agent-out ./agents/ac_v2.h5 ./--lr 0.01 --bs 1024 experiences/exp_0001.h5 to use experience data for agent improvements via Deep Reinforcement Learning.
Run python eval_ac_bot.py --agent1 ./agents/ac_v2.h5 --agent2 ./agents/ac_v1.h5 --num-games 100 to check whether the new bot is stronger.

If the new agent is stronger start with it at 2.

Otherwise go to 2. again to generate more training data. Use multiple experience data files in 3.

Rinse and repeat.

Name		Name	Last commit message	Last commit date
Latest commit History 77 Commits
dlgo		dlgo
generated_games		generated_games
hack		hack
.gitignore		.gitignore
README.md		README.md
alpha_beta_go.py		alpha_beta_go.py
alphago_mcts_play.py		alphago_mcts_play.py
alphago_policy_rl.py		alphago_policy_rl.py
alphago_policy_sl.py		alphago_policy_sl.py
alphago_value.py		alphago_value.py
bot_v_bot.py		bot_v_bot.py
bot_v_gnugo.py		bot_v_gnugo.py
bot_v_x_gtp.py		bot_v_x_gtp.py
download_u_go_data.py		download_u_go_data.py
end_to_end.py		end_to_end.py
eval_ac_bot.py		eval_ac_bot.py
eval_pg_bot.py		eval_pg_bot.py
generate_mcts_games.py		generate_mcts_games.py
generate_zobrist.py		generate_zobrist.py
human_v_bot.py		human_v_bot.py
init_ac_agent.py		init_ac_agent.py
init_q_agent.py		init_q_agent.py
load_training_data.py		load_training_data.py
mcts_go.py		mcts_go.py
play_ttt.py		play_ttt.py
pruned_go.py		pruned_go.py
self_play_ac.py		self_play_ac.py
self_play_pg.py		self_play_pg.py
self_play_zero.py		self_play_zero.py
sgf_to_game_state.py		sgf_to_game_state.py
shell.nix		shell.nix
train_ac.py		train_ac.py
train_generator.py		train_generator.py
train_pg.py		train_pg.py
web_demo.py		web_demo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AlphaGo

How to play

Go

Bot vs. Bot

Human vs. Bot

Tic-Tac-Toe

Human vs. Bot

Reinforcement Learning

Resources

About

Releases

Packages

Languages

pmuens/alphago

Folders and files

Latest commit

History

Repository files navigation

AlphaGo

How to play

Go

Bot vs. Bot

Human vs. Bot

Tic-Tac-Toe

Human vs. Bot

Reinforcement Learning

Resources

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages