{"id":5224,"date":"2016-01-28T03:02:07","date_gmt":"2016-01-28T03:02:07","guid":{"rendered":"http:\/\/www.kurzweilai.net\/?p=272197"},"modified":"2016-01-29T09:06:34","modified_gmt":"2016-01-29T09:06:34","slug":"google-machine-learning-system-is-first-to-defeat-professional-go-player","status":"publish","type":"post","link":"https:\/\/hoo.central12.com\/fugic\/2016\/01\/28\/google-machine-learning-system-is-first-to-defeat-professional-go-player\/","title":{"rendered":"Google machine-learning system is first to defeat professional Go player"},"content":{"rendered":"<div id=\"attachment_272225\" class=\"wp-caption aligncenter\" style=\"width: 434px;  border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align:center; display: block; margin-right: auto; margin-left: auto;\"><img class=\"wp-image-272225 \" title=\"Go\" src=\"http:\/\/www.kurzweilai.net\/images\/Go.jpg\" alt=\"\" width=\"424\" height=\"374\" \/><p style=' padding: 0 4px 5px; margin: 0;'  class=\"wp-caption-text\">Go is played on a grid of black lines (usually 19\u00d719). Game pieces, called stones, are played on the line intersections. (credit: Goban1\/Wikipedia)<\/p><\/div>\n<p>A deep-learning computer system called AlphaGo created by Google&#8217;s DeepMind team has defeated reigning three-time European Go champion Fan Hui 5 games to 0 &#8212; the first time a computer program has ever beaten a professional Go player, reports <a href=\"http:\/\/googleresearch.blogspot.com\/2016\/01\/alphago-mastering-ancient-game-of-go.html\" >Google Research blog<\/a> today (Jan. 27) &#8212; a feat previously thought to be at least a decade away.<\/p>\n<p>&#8220;AlphaGo uses general machine-learning techniques to allow it to improve itself, just by watching and playing games,&#8221; according to\u00a0David Silver and Demis Hassabis of Google DeepMind. Using a vast collection of more than 30 million Go moves from expert players, DeepMind researchers trained their system to play Go on its own.<\/p>\n<p>To achieve that, AlphaGo, as described in a paper in\u00a0<a href=\"http:\/\/www.nature.com\/nature\/journal\/v529\/n7587\/full\/nature16961.html\">Nature<\/a> today, combines a state-of-the-art tree search with two\u00a0<a href=\"https:\/\/en.wikipedia.org\/wiki\/Deep_learning#Deep_neural_networks\" >deep neural networks<\/a>, each containsing many layers with millions of neuron-like connections needed to deal with Go&#8217;s vast search space &#8212; more than a\u00a0<a href=\"https:\/\/en.wikipedia.org\/wiki\/Googol\" >googol<\/a> (10<sup>100<\/sup>) times larger than chess (a number greater than there are atoms in the universe).<\/p>\n<p>&#8220;We first trained [one of the two networks] on 30 million moves from games played by human experts, until it could predict the human move 57% of the time &#8230;. But our goal is to beat the best human players, not just mimic them, Silver and Hassabis said. &#8220;To do this, AlphaGo learned to discover new strategies for itself, by playing thousands of games between its neural networks, and gradually improving them using a trial-and-error process known as\u00a0<a href=\"https:\/\/en.wikipedia.org\/wiki\/Reinforcement_learning\" >reinforcement learning<\/a>.&#8221;<\/p>\n<div id=\"attachment_272212\" class=\"wp-caption aligncenter\" style=\"width: 434px;  border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align:center; display: block; margin-right: auto; margin-left: auto;\"><a href=\"http:\/\/www.kurzweilai.net\/google-machine-learning-system-is-first-to-defeat-professional-go-player\/alpha-go-elo-rating\" rel=\"attachment wp-att-272212\"><img class=\" wp-image-272212 noshadow\" title=\"Alpha Go Elo rating\" src=\"http:\/\/www.kurzweilai.net\/images\/Alpha-Go-Elo-rating.jpg\" alt=\"\" width=\"424\" height=\"407\" \/><\/a><p style=' padding: 0 4px 5px; margin: 0;'  class=\"wp-caption-text\">This figure from the Nature article shows the Elo rating (a 230 point gap corresponds to a 79% probability of winning) and approximate rank of AlphaGo (both single machine and distributed versions), the European champion Fan Hui (a professional 2-dan), and the strongest other Go programs, evaluated over thousands of games. Pale pink bars show the performance of other programs when given a four move headstart. (credit: David Silver et al.\/Nature)<\/p><\/div>\n<p>AlphaGo\u2019s next challenge will be to play the top Go player in the world over the last decade,\u00a0<a href=\"http:\/\/senseis.xmp.net\/?YiSeTol\" >Lee Sedol<\/a>. The match will take place this March in Seoul, South Korea.<\/p>\n<p>&#8220;While games are the perfect platform for developing and testing AI algorithms quickly and efficiently, ultimately we want to apply these techniques to important real-world problems,&#8221; the researchers say. &#8220;Because the methods we have used are general purpose, our hope is that one day they could be extended to help us address some of society\u2019s toughest and most pressing problems, from climate modelling to complex disease analysis.&#8221;<\/p>\n<p><iframe frameborder=\"0\" height=\"360\" src=\"https:\/\/www.youtube.com\/embed\/SUbqykXVx0A?rel=0\" width=\"640\"><\/iframe><br \/>\n<em>Google DeepMind | Ground-breaking AlphaGo masters the game of Go<\/em><\/p>\n<hr \/>\n<p><strong>Abstract of <em>Mastering the game of Go with deep neural networks and tree search<\/em><\/strong><\/p>\n<p>The game of Go has long been viewed as the most challenging of classic games for artificial intelligence owing to its enormous search space and the difficulty of evaluating board positions and moves. Here we introduce a new approach to computer Go that uses \u2018value networks\u2019 to evaluate board positions and \u2018policy networks\u2019 to select moves. These deep neural networks are trained by a novel combination of supervised learning from human expert games, and reinforcement learning from games of self-play. Without any lookahead search, the neural networks play Go at the level of state-of-the-art Monte Carlo tree search programs that simulate thousands of random games of self-play. We also introduce a new search algorithm that combines Monte Carlo simulation with value and policy networks. Using this search algorithm, our program AlphaGo achieved a 99.8% winning rate against other Go programs, and defeated the human European Go champion by 5 games to 0. This is the first time that a computer program has defeated a human professional player in the full-sized game of Go, a feat previously thought to be at least a decade away.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>A deep-learning computer system called AlphaGo created by Google&rsquo;s DeepMind team has defeated reigning three-time European Go champion Fan Hui 5 games to 0 &mdash; the first time a computer program has ever beaten a professional Go player, reports Google Research blog today (Jan. 27) &mdash; a feat previously thought to be at least a [&#8230;]<\/p>\n","protected":false},"author":13,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[46,43],"tags":[],"class_list":["post-5224","post","type-post","status-publish","format-standard","hentry","category-airobotics","category-news"],"_links":{"self":[{"href":"https:\/\/hoo.central12.com\/fugic\/wp-json\/wp\/v2\/posts\/5224"}],"collection":[{"href":"https:\/\/hoo.central12.com\/fugic\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/hoo.central12.com\/fugic\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/hoo.central12.com\/fugic\/wp-json\/wp\/v2\/users\/13"}],"replies":[{"embeddable":true,"href":"https:\/\/hoo.central12.com\/fugic\/wp-json\/wp\/v2\/comments?post=5224"}],"version-history":[{"count":3,"href":"https:\/\/hoo.central12.com\/fugic\/wp-json\/wp\/v2\/posts\/5224\/revisions"}],"predecessor-version":[{"id":5259,"href":"https:\/\/hoo.central12.com\/fugic\/wp-json\/wp\/v2\/posts\/5224\/revisions\/5259"}],"wp:attachment":[{"href":"https:\/\/hoo.central12.com\/fugic\/wp-json\/wp\/v2\/media?parent=5224"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/hoo.central12.com\/fugic\/wp-json\/wp\/v2\/categories?post=5224"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/hoo.central12.com\/fugic\/wp-json\/wp\/v2\/tags?post=5224"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}