情報処理学会第82回全国大会講演論文集

2D-02

Playing mini-Hanabi card game with Q-learning

○ひいとう（京大），市来正裕（名大），中里研一（ボッシュ）

Hanabi card game is a cooperative card game. Unlike the other games, the players can't see their own cards and can only see other people's. So, it is very challenging for AI players to learn this game. In this study we simulated the Hanabi card game and trained the AI player by using the Q-learning method. However, Q-learning method will take a large amount of time if space states is numerous. Therefore, we parameterized the numbers and kinds of cards to estimate the size of the space states. Finally, we minimized the cards number and trained the AI player by using Q-learning in a short time.