情報処理学会 第82回全国大会 会期:2020年3月5日~7日 会場:金沢工業大学 扇が丘キャンパス 情報処理学会 第82回全国大会 会期:2020年3月5日~7日 会場:金沢工業大学 扇が丘キャンパス

2D-02
Playing mini-Hanabi card game with Q-learning
○ひい とう(京大),市来正裕(名大),中里研一(ボッシュ)
Hanabi card game is a cooperative card game. Unlike the other games, the players can't see their own cards and can only see other people's. So, it is very challenging for AI players to learn this game. In this study we simulated the Hanabi card game and trained the AI player by using the Q-learning method. However, Q-learning method will take a large amount of time if space states is numerous. Therefore, we parameterized the numbers and kinds of cards to estimate the size of the space states. Finally, we minimized the cards number and trained the AI player by using Q-learning in a short time.