{"id":5931,"date":"2016-03-02T01:06:15","date_gmt":"2016-03-02T01:06:15","guid":{"rendered":"http:\/\/www.kurzweilai.net\/?p=274807"},"modified":"2016-03-03T06:46:14","modified_gmt":"2016-03-03T06:46:14","slug":"how-predicting-shakespeares-writing-could-improve-our-understanding-of-natural-language","status":"publish","type":"post","link":"https:\/\/hoo.central12.com\/fugic\/2016\/03\/02\/how-predicting-shakespeares-writing-could-improve-our-understanding-of-natural-language\/","title":{"rendered":"How predicting Shakespeare&rsquo;s writing could improve our understanding of natural language"},"content":{"rendered":"<div id=\"attachment_274970\" class=\"wp-caption aligncenter\" style=\"width: 498px;  border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align:center; display: block; margin-right: auto; margin-left: auto;\"><img class=\" wp-image-274970\" title=\"Shakespeare\" src=\"http:\/\/www.kurzweilai.net\/images\/Shakespeare1.jpg\" alt=\"\" width=\"488\" height=\"408\" \/><p style=' padding: 0 4px 5px; margin: 0;'  class=\"wp-caption-text\">Google used the writings of 1,000 authors to train a deep neural network to predict writing patterns (credit: Martin Droeshout\/Wikimedia Commons)<\/p><\/div>\n<p>A\u00a0Google <a href=\"https:\/\/en.wikipedia.org\/wiki\/Natural_language_understanding\" >natural language understanding<\/a>\u00a0research group led by Ray Kurzweil is building software systems that can understand natural language at a human level. The goal is to understand and interpret meanings of spoken or written language.<\/p>\n<p>One key to achieving that understanding is establishing <em>context<\/em>, suggest researchers Chris Tar; Marc Pickett, PhD.; and Brian Strope, PhD., on the <em><a href=\"http:\/\/googleresearch.blogspot.co.uk\/2016\/02\/on-personalities-of-dead-authors.html\" ><em>Google Research Blog<\/em><\/a><\/em>.<\/p>\n<p>For example, take the phrase, \u201c<em>Great, ice cream for dinner!\u201d\u00a0 <\/em>If a six-year-old says it, it means something very different than if a parent says it, they point out.<\/p>\n<p>That is, knowing the characteristics of the speaker (or writer) can narrow down the set of possible meanings of a phrase.<\/p>\n<p>Similarly, the researchers suggested, a <a href=\"https:\/\/en.wikipedia.org\/wiki\/Deep_learning\">deep neural network<\/a>\u00a0(DNN) that takes into account the specific author\u2019s style and \u201cpersonality\u201d should be able to predict (with higher accuracy than with a random guess) the next sentence an author would be likely to write in a book.<\/p>\n<p>To test that idea, the researchers imported the text of 1,000 different authors from the\u00a0<a href=\"https:\/\/www.gutenberg.org\/\">Project Gutenberg<\/a> website.<\/p>\n<p>&#8220;The information our system derived is presumably representative of the author\u2019s word choice, thinking, and style,&#8221; say the researchers. &#8220;We call these \u201cAuthor vectors.\u2019&#8221;<\/p>\n<div id=\"attachment_274905\" class=\"wp-caption aligncenter\" style=\"width: 399px;  border: 1px solid #dddddd; background-color: #f3f3f3; padding-top: 4px; margin: 10px; text-align:center; display: block; margin-right: auto; margin-left: auto;\"><img class=\"size-full wp-image-274905\" title=\"author vectors\" src=\"http:\/\/www.kurzweilai.net\/images\/author-vectors.jpg\" alt=\"\" width=\"389\" height=\"240\" \/><p style=' padding: 0 4px 5px; margin: 0;'  class=\"wp-caption-text\">A section of a representation of &#8220;Author vectors&#8221; for some of the authors in the study. Note that contemporaries and influencers tend to be near each other (e.g., Marlowe and Shakespeare vs. Milton and Pope). It uses the t-SNE algorithm. (credit: Google\/Christopher Olah)<\/p><\/div>\n<p>Essentially, the system is saying, \u201cI&#8217;ve been told that this is Shakespeare, who tends to write like this, so I&#8217;ll take that into account when weighing which sentence is more likely to follow.&#8221; In effect, one can chat with a statistical representation of text written by Shakespeare, the researchers note.<\/p>\n<p>(Or in the future, suggest completions to the unfinished works of Philip K. Dick?)<\/p>\n<p>The system could enrich Google products through personalization, the researchers suggest. \u201cFor example, it could help provide more personalized response options for the recently introduced\u00a0<a href=\"http:\/\/googleresearch.blogspot.com\/2015\/11\/computer-respond-to-this-email.html\">Smart Reply feature<\/a>\u00a0in Inbox by Gmail\u201d (a system that could automatically determine if an email was answerable with a short reply, and compose a few suitable responses that a user could edit or send with just a tap).<\/p>\n","protected":false},"excerpt":{"rendered":"<p>A&nbsp;Google natural language understanding&nbsp;research group led by Ray Kurzweil is building software systems that can understand natural language at a human level. The goal is to understand and interpret meanings of spoken or written language. One key to achieving that understanding is establishing context, suggest researchers Chris Tar; Marc Pickett, PhD.; and Brian Strope, PhD., [&#8230;]<\/p>\n","protected":false},"author":13,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[46,43],"tags":[],"class_list":["post-5931","post","type-post","status-publish","format-standard","hentry","category-airobotics","category-news"],"_links":{"self":[{"href":"https:\/\/hoo.central12.com\/fugic\/wp-json\/wp\/v2\/posts\/5931"}],"collection":[{"href":"https:\/\/hoo.central12.com\/fugic\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/hoo.central12.com\/fugic\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/hoo.central12.com\/fugic\/wp-json\/wp\/v2\/users\/13"}],"replies":[{"embeddable":true,"href":"https:\/\/hoo.central12.com\/fugic\/wp-json\/wp\/v2\/comments?post=5931"}],"version-history":[{"count":4,"href":"https:\/\/hoo.central12.com\/fugic\/wp-json\/wp\/v2\/posts\/5931\/revisions"}],"predecessor-version":[{"id":5962,"href":"https:\/\/hoo.central12.com\/fugic\/wp-json\/wp\/v2\/posts\/5931\/revisions\/5962"}],"wp:attachment":[{"href":"https:\/\/hoo.central12.com\/fugic\/wp-json\/wp\/v2\/media?parent=5931"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/hoo.central12.com\/fugic\/wp-json\/wp\/v2\/categories?post=5931"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/hoo.central12.com\/fugic\/wp-json\/wp\/v2\/tags?post=5931"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}