


{"id":750,"date":"2015-08-20T22:21:53","date_gmt":"2015-08-20T21:21:53","guid":{"rendered":"http:\/\/lwibs01.gm.fh-koeln.de\/blogs\/ciop\/?p=750"},"modified":"2015-09-02T12:05:08","modified_gmt":"2015-09-02T11:05:08","slug":"new-technical-report-on-temporal-difference-learning-for-games","status":"publish","type":"post","link":"https:\/\/blogs.gm.fh-koeln.de\/ciop\/2015\/08\/20\/new-technical-report-on-temporal-difference-learning-for-games\/","title":{"rendered":"New Technical Report on Temporal Difference Learning for Games"},"content":{"rendered":"<p class=\"lead\">\n\t<a class=\"thickbox\" href=\"http:\/\/lwibs01.gm.fh-koeln.de\/blogs\/ciop\/files\/2015\/08\/TD-TicTacToe2.png\"><img loading=\"lazy\" decoding=\"async\" alt=\"TD-TicTacToe2\" class=\"alignnone size-medium wp-image-755\" height=\"98\" src=\"http:\/\/lwibs01.gm.fh-koeln.de\/blogs\/ciop\/files\/2015\/08\/TD-TicTacToe2-300x74.png\" width=\"387\" \/><\/a>\n<\/p>\n<p>\n\tA new technical report on <strong>temporal difference (TD) learning for games<\/strong> and &quot;self-play&quot; algorithms for game-agent training is available. This report by Wolfgang Konen features a gentle introduction to TD learning for game play and gives hints for the practioner on the implementation of such algorithms . It shows the references to the most recent applications in this field and discusses in an appendix the more advanced topic of <strong>eligibility traces<\/strong> and how and why they work.\n<\/p>\n<p>\n\tThis report should be a help for people starting new in the field of TD learning for games and for people who work already in this field but struggle with specific details. It is an updated English translation of an earlier report in German language.\n<\/p>\n<p>\n\t<strong><span style=\"font-size:20px\">Publications <\/span><\/strong>\n<\/p>\n<div class=\"tp_single_publication\"><span class=\"tp_single_author\">Konen, Wolfgang: <\/span> <span class=\"tp_single_title\">Reinforcement Learning for Board Games: The Temporal Difference Algorithm<\/span>. <span class=\"tp_single_additional\"><span class=\"tp_pub_additional_institution\">Research Center CIOP (Computational Intelligence, Optimization and Data Mining) <\/span><span class=\"tp_pub_additional_address\">Cologne University of Applied Sciences, <\/span><span class=\"tp_pub_additional_year\">2015<\/span>.<\/span><\/div>\n<p>\n\t<a href=\"http:\/\/www.gm.fh-koeln.de\/ciopwebpub\/Kone15c.d\/TR-TDgame_EN.pdf\">PDF English<\/a>\n<\/p>\n<p>\n\t<a href=\"http:\/\/www.gm.fh-koeln.de\/ciopwebpub\/Kone15a.d\/TR-TDgame.pdf\">PDF German<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>A new technical report on temporal difference (TD) learning for games and &quot;self-play&quot; algorithms for game-agent training is available. This report by Wolfgang Konen features a gentle introduction to TD learning for game play and gives hints for the practioner on the implementation of such algorithms . It shows the references to the most recent...  <a href=\"https:\/\/blogs.gm.fh-koeln.de\/ciop\/2015\/08\/20\/new-technical-report-on-temporal-difference-learning-for-games\/\" class=\"more-link\" title=\"Read New Technical Report on Temporal Difference Learning for Games\"><?php _e(\"Read more &raquo;\",\"wpbootstrap\"); ?><\/a><\/p>\n","protected":false},"author":38,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[134,233,10130,224],"tags":[10132,7874,10153],"class_list":["post-750","post","type-post","status-publish","format-standard","hentry","category-allgemein","category-publications","category-reinforcement-learning","category-research","tag-game-learning","tag-games","tag-optimization"],"acf":[],"_links":{"self":[{"href":"https:\/\/blogs.gm.fh-koeln.de\/ciop\/wp-json\/wp\/v2\/posts\/750","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.gm.fh-koeln.de\/ciop\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.gm.fh-koeln.de\/ciop\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.gm.fh-koeln.de\/ciop\/wp-json\/wp\/v2\/users\/38"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.gm.fh-koeln.de\/ciop\/wp-json\/wp\/v2\/comments?post=750"}],"version-history":[{"count":11,"href":"https:\/\/blogs.gm.fh-koeln.de\/ciop\/wp-json\/wp\/v2\/posts\/750\/revisions"}],"predecessor-version":[{"id":769,"href":"https:\/\/blogs.gm.fh-koeln.de\/ciop\/wp-json\/wp\/v2\/posts\/750\/revisions\/769"}],"wp:attachment":[{"href":"https:\/\/blogs.gm.fh-koeln.de\/ciop\/wp-json\/wp\/v2\/media?parent=750"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.gm.fh-koeln.de\/ciop\/wp-json\/wp\/v2\/categories?post=750"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.gm.fh-koeln.de\/ciop\/wp-json\/wp\/v2\/tags?post=750"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}