<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Aligned Left Blog &#187; digitizing</title>
	<atom:link href="http://alignedleft.com/blog/tag/digitizing/feed/" rel="self" type="application/rss+xml" />
	<link>http://alignedleft.com/blog</link>
	<description>Exploring digital culture and dynamic media</description>
	<lastBuildDate>Thu, 15 Jul 2010 23:22:46 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>ReCAPTCHA All That Data</title>
		<link>http://alignedleft.com/blog/2008/02/recaptcha-all-that-data/</link>
		<comments>http://alignedleft.com/blog/2008/02/recaptcha-all-that-data/#comments</comments>
		<pubDate>Wed, 13 Feb 2008 16:43:28 +0000</pubDate>
		<dc:creator>Scott</dc:creator>
				<category><![CDATA[Thesis]]></category>
		<category><![CDATA[books]]></category>
		<category><![CDATA[digitizing]]></category>

		<guid isPermaLink="false">http://alignedleft.com/blog/2008/02/recaptcha-all-that-data/</guid>
		<description><![CDATA[

I was exploring Lawrence Lessig’s blog earlier and noticed that his commenting system uses reCAPTCHA, a CAPTCHA system run out of Carnegie Mellon.

What’s interesting about reCAPTCHA is that the words shown are unknown, even to the system.  A regular CAPTCHA displays known words in distorted text, hopefully in such a way that only humans [...]]]></description>
			<content:encoded><![CDATA[<p><img src='http://alignedleft.com/blog/wp-content/uploads/2008/02/recaptcha.png' alt='recaptcha.png' /></p>

<p>I was exploring <a href="http://lessig.org/blog/2008/02/20_minutes_or_so_on_why_i_am_4.html">Lawrence Lessig’s blog</a> earlier and noticed that his commenting system uses <a href="http://recaptcha.net/learnmore.html">reCAPTCHA</a>, a <a href="http://en.wikipedia.org/wiki/CAPTCHA">CAPTCHA</a> system run out of Carnegie Mellon.</p>

<p>What’s interesting about reCAPTCHA is that the words shown are unknown, even to the system.  A regular CAPTCHA displays known words in distorted text, hopefully in such a way that only humans (and not spam robots) can read them.  The human user types in the characters shown, and the system validates that the entry is correct.</p>

<p>reCAPTCHA uses words scanned from old books, and correlates the user’s input to a portion of the original work.  Like <a href="http://www.mturk.com/">Amazon’s Mechanical Turk</a>, the Carnegie Mellon system deploys micro-tasks and takes advantage of distributed human labor to complete giant projects via minimal contributions of many.  The idea, of course, is to perform OCR on books that are not practical to digitize using software (due to blurry letters, old typefaces, etc.).</p>
]]></content:encoded>
			<wfw:commentRss>http://alignedleft.com/blog/2008/02/recaptcha-all-that-data/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>

<!-- Dynamic Page Served (once) in 0.164 seconds -->
