5M-09
Comparative analysis of Reinforcement Learning and Evolutionary Strategy in General Video Game Playing
○朱 晉賢,ラック ターウォンマット,原田智広(立命館大)
This paper presents a comparative analysis between Reinforcement Learning (RL) and Evolutionary Strategy (ES) for training rollout bias in General Video Game Playing (GVGP). GVGP has become an emerging research field, where researchers attempt to develop AI programs that can play various types of video game without prior knowledge. Monte-Carlo Tree Search, which does not require explicit evaluation function, has been a popular technique in GVGP, and existing research has succeeded in improving its performance by biasing the rollouts with weights vector, which is trained by ES. This paper compares RL and ES, and investigates the advantages and disadvantages of both techniques as rollout bias training mechanism in the GVGP domain.

footer 著作権について 倫理綱領 プライバシーポリシー セキュリティ 情報処理学会