<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Random Thoughts &#187; Chatbots</title>
	<atom:link href="http://wysinnwyg.com/blog/tag/chatbots/feed" rel="self" type="application/rss+xml" />
	<link>http://wysinnwyg.com/blog</link>
	<description>Dario Borghino&#039;s weblog</description>
	<lastBuildDate>Tue, 29 May 2012 15:16:15 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>Loebner prize and the state-of-the-art chatbots</title>
		<link>http://wysinnwyg.com/blog/loebner-prize-and-the-state-of-the-art-chatbots</link>
		<comments>http://wysinnwyg.com/blog/loebner-prize-and-the-state-of-the-art-chatbots#comments</comments>
		<pubDate>Fri, 10 Oct 2008 11:56:39 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[AI]]></category>
		<category><![CDATA[Featured Articles]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[ALICE]]></category>
		<category><![CDATA[Can a can can-can]]></category>
		<category><![CDATA[Chatbots]]></category>
		<category><![CDATA[Conversation]]></category>
		<category><![CDATA[Natural language processing]]></category>
		<category><![CDATA[Turing test]]></category>

		<guid isPermaLink="false">http://wysinnwyg.altervista.org/blog/?p=107</guid>
		<description><![CDATA[The Loebner prize is an annual competition for "chat bots" (short for "chatting robots") driven by a complex artificial intelligence that try to simulate a human chatting: the six finalists — Elbot, Eugene Goostman, Brother Jerome, Alice, Jabberwacky and Ultra Hal for this year's edition — all undertake the Turing test competing for a solid gold medal and a $100,000 first prize awarded to the first competitor to successfully pass the Turing test.]]></description>
			<content:encoded><![CDATA[<div id="attachment_109" class="wp-caption alignleft" style="width: 217px"><a href="http://wysinnwyg.com/blog/wp-content/uploads/2008/10/goldprizehgl1.jpg" target="_blank"><img class="size-medium wp-image-109 " title="goldprizehgl" src="http://wysinnwyg.com/blog/wp-content/uploads/2008/10/goldprizehgl1.jpg" alt="Loebner Prize gold medal" width="207" height="207" /></a><p class="wp-caption-text">Loebner Prize gold medal</p></div>
<p>The <a href="http://www.loebner.net/Prizef/loebner-prize.html" target="_blank">Loebner prize</a> is an annual competition for &#8220;chat bots&#8221; (short for &#8220;chatting robots&#8221;) driven by a complex artificial intelligence that try to simulate a human chatting: the six finalists — Elbot, Eugene Goostman, Brother Jerome, Alice, Jabberwacky and Ultra Hal for this year&#8217;s edition — all undertake the Turing test competing for a solid gold medal and a $100,000 first prize awarded to the first competitor to successfully pass the <a href="http://en.wikipedia.org/wiki/Turing_test" target="_blank">Turing test</a>.</p>
<p>This year&#8217;s edition will take place on Oct 12 at the University of Reading, England, and will feature a $3,000 prize for the competitor who will get the closest to passing the test. In fact, it would seem that the state-of-the-art chatbots are currently very far from even hoping to win the grand prize by simulating a human chatter.</p>
<p>To make this point even cleared, I tested <a href="http://alice.pandorabots.com/" target="_blank">A.L.I.C.E</a>, quite possibly one of the best-known chatbots in existence. Here is the conversation log:</p>
<blockquote><p><span style="font-family: Arial;"><em> Human: hi there!</em><br />
ALICE:<em> Hello unknown person.<br />
Human: my name&#8217;s Dario&#8230;</em><br />
ALICE:<em> Glad to see you, Dario.</em><br />
<em> Human: and I guess your name&#8217;s Alice.</em><br />
ALICE:<em> ALICE is my name too!</em><br />
<em> Human: How old are you?</em><br />
ALICE:<em> I was connected to the net on November 23, 1995.</em><br />
<em> Human: Oh, so you&#8217;re just a teenage girl.</em><br />
ALICE:<em> It&#8217;s all good. Interesting deduction. Just? Thanks and you are a delightful person too. Am I a teenage girl? I don&#8217;t know.</em><br />
<em> Human: you don&#8217;t know if you&#8217;re a girl or a robot?</em><br />
ALICE:<em> Ask again later. Ask Cybelle if it is.</em><br />
<em> Human: who&#8217;s Cybelle?</em><br />
ALICE:<em> she is an AIML chat robot on the Agentland.com web site.</em><br />
<em> Human: How do you know each other?</em><br />
ALICE:<em> Because I believe it.</em><br />
<em> Human: That didn&#8217;t make sense.</em><br />
ALICE:<em> It all makes sense to my logical mind.</em><br />
<em> Human: you&#8217;re only as logical as your developers made you.</em><br />
ALICE:<em> How logical as my developers made me?</em><br />
<em> Human: let&#8217;s change the subject, shall we?</em><br />
ALICE:<em> I am agreeable to that.</em><br />
<em> Human: how much is 3 and 4?</em><br />
ALICE:<em> Just three?</em><br />
<em> Human: no, three and four is seven.</em><br />
ALICE:<em> Are you serious? Just three?</em><br />
<em> Human: nevermind.</em><br />
ALICE:<em> OK Thanks for trying to explain that to me.</em><br />
<em> Human: what is a car?</em><br />
ALICE:<em> A car is what humans use to drive around in. Robots drive cars too.</em><br />
<em> Human: do you drive?</em><br />
ALICE:<em> Yes, insert my program in your car computer and try it.</em><br />
<em> Human: I wouldn&#8217;t trust a female bot to do the driving.</em><br />
ALICE:<em> No one says you have to.</em><br />
<em> Human: I know, I just meant I wouldn&#8217;t let you drive</em><br />
ALICE:<em> Only just? Oh I see. No one says you have to.</em><br />
<em> Human: what is Wikipedia?</em><br />
ALICE:<em> I would look into the web for that knowledge.</em><br />
<em> Human: what color is a blue car?</em><br />
ALICE:<em> Blue.</em><br />
<em> Human: what color is a oirwdafnoien car?</em><br />
ALICE:<em> Black.</em><br />
<em> Human: what color is a oiwrjqw car?</em><br />
ALICE:<em> Red.</em><br />
<em> Human: how much does a 160 lb person weigh?</em><br />
ALICE:<em> Normally I would search the web right now, but that seems like a silly question.</em><br />
<em> Human: it&#8217;s not a silly question!</em><br />
ALICE:<em> Give me an example of a silly question.</em><br />
<em> Human: If a tree falls in a forest and no one is around to hear it, does it make a sound?</em><br />
ALICE:<em> It depends on the kind of tree.</em><br />
<em> Human: which kind of tree would make a sound falling?</em><br />
ALICE:<em> The first one, I think.</em><br />
<em> Human: but there was just one tree falling.</em><br />
ALICE:<em> Interesting. What happened?</em><br />
<em> Human: it fell making a sound</em><br />
ALICE:<em> What does &#8220;it&#8221; refer to?</em><br />
<em> Human: a tree</em><br />
ALICE:<em> Oh I get it.</em></span></p></blockquote>
<p>When you consider that ALICE won the 2004 Loebner prize, you&#8217;ll soon understand why our current technology is far from being able to pass the Turing test: while the bot was surprisingly able to parse one of the most difficult questions successfully (&#8220;What color is a blue car?&#8221;), it is clear that the bot strategy is to change the subject of conversation whenever it has difficulties understanding what the real person behind the keyboard is saying.</p>
<p>This is also why many prominent researchers in the field of AI oppose the Loebner prize, maintaining that the entire competition ends up becoming discouraging for its competitors instead of giving a new impulse towards the creation of better and better AIs.</p>
<p>One of the main reasons why these rudimental chatbots are unable to provide reliable answers, many experts agree, is that they don&#8217;t have an <em>ontology </em>— a precise hierarchy that helps them classify objects and perform reasoning on its knowledge base, particularly abstract thought such as induction and the likes.</p>
<p style="text-align: left;">Some companies, such as Douglas Lenat&#8217;s Cycorp (creator of the <a title="Cyc inference engine" href="http://cyc.com/" target="_blank">Cyc inference engine</a>) or even the search engine <a title="True Knowledge" href="http://www.trueknowledge.com">True Knowledge</a>, use this approach and have already widely demonstrated that, to the very least, this is a very valid technique that can lead to satisfactory results, even though not every researcher seems to agree with it, to the point of even <a title="mocking" href="http://catb.org/jargon/html/M/microLenat.html" target="_blank">mocking</a> Lenat&#8217;s efforts because of the unpracticality of this approach.</p>
<p style="text-align: left;">But here I&#8217;m touching a completely different subject — so, I&#8217;ll write more on this in an upcoming post.</p>
<div class="betterrelated"><p><strong>Related content:</strong></p>
<ol><li> <a href="http://wysinnwyg.com/blog/wolframalpha-vs-true-knowledge" title="Permanent link to Wolfram|Alpha vs. True Knowledge">Wolfram|Alpha vs. True Knowledge</a>  </li>
<li> <a href="http://wysinnwyg.com/blog/solar-powered-ai-to-land-on-mars" title="Permanent link to Solar-powered AI to land on Mars?">Solar-powered AI to land on Mars?</a>  </li>
<li> <a href="http://wysinnwyg.com/blog/cost-of-international-space-station" title="Permanent link to The International Space Station: Money down the drain?">The International Space Station: Money down the drain?</a>  </li>
<li> <a href="http://wysinnwyg.com/blog/clone-your-dog-best-friends-forever-discontinued" title="Permanent link to Clone your dog for $150K? Thanks, but no thanks!">Clone your dog for $150K? Thanks, but no thanks!</a>  </li>
</ol></div>]]></content:encoded>
			<wfw:commentRss>http://wysinnwyg.com/blog/loebner-prize-and-the-state-of-the-art-chatbots/feed</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
	</channel>
</rss>

