A new technical report on temporal difference (TD) learning for games and "self-play" algorithms for game-agent training is available. This report by Wolfgang Konen features a gentle introduction to TD learning for game play and gives hints for the practioner on the implementation of such algorithms . It shows the references to the most recent... Mehr »
Mr. Markus Thill has won the first price in the 2012 OPITZ CONSULTING “Innovation in Informatics” contest. Many congratulations from the CIOP team!! Mr. Thill’s thesis advanced the state of the art in reinforcement learning for complex board games, here Connect Four. Read more about his work on this page.