<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Hitesh Sarda &#187; ocr</title>
	<atom:link href="http://hitesh.in/tag/ocr/feed/" rel="self" type="application/rss+xml" />
	<link>http://hitesh.in</link>
	<description>Thoughts on life, technology, education and entrepreneurship</description>
	<lastBuildDate>Thu, 29 Mar 2012 17:13:06 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Tip: Convert an image to text (OCR)</title>
		<link>http://hitesh.in/2009/tip-convert-an-image-to-text-ocr/</link>
		<comments>http://hitesh.in/2009/tip-convert-an-image-to-text-ocr/#comments</comments>
		<pubDate>Fri, 10 Jul 2009 21:01:28 +0000</pubDate>
		<dc:creator>Hitesh</dc:creator>
				<category><![CDATA[Education]]></category>
		<category><![CDATA[ocr]]></category>
		<category><![CDATA[tip]]></category>

		<guid isPermaLink="false">http://hitesh.in/?p=125</guid>
		<description><![CDATA[Haven&#8217;t there been times when you came across some text that you could not copy or edit? It could be for a variety of reasons, whether it is a scanned copy, or an image containing text, a screen shot etc. &#8230; <a href="http://hitesh.in/2009/tip-convert-an-image-to-text-ocr/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Haven&#8217;t there been times when you came across some text that you could not copy or edit? It could be for a variety of reasons, whether it is a scanned copy, or an image containing text, a screen shot etc.</p>
<p>I came across a similar situation yesterday, when I was looking for a word list to create a Pictionary for my kid. &#8220;Stretch mark mama&#8221; had this <a href="http://stretchmarkmama.blogspot.com/2008/12/pictionary-for-kids.html">great list</a>; unfortunately she shared it as an image. I needed to extract the text out of the image and remembered an old trick. So I thought I would as well share it.</p>
<p>The trick is to use the Document Imaging application that comes with MS Office.</p>
<p><span id="more-125"></span></p>
<p>The list looked something like this:</p>
<p><img src="http://hitesh.in/wp-content/uploads/2009/07/071009_2101_TipConverta1.png" alt="" /></p>
<p>I converted the  <abbr title="Joint Photographic Experts Group Image File">jpeg</abbr> file to a <abbr title="Tagged Image File Format">tiff</abbr> file using <a href="http://www.irfanview.com">IrfanView</a>. The other option is to print the file using the <em>MS Document Image Writer</em> printer, as shown below.</p>
<p><img src="http://hitesh.in/wp-content/uploads/2009/07/071009_2101_TipConverta2.png" alt="" /></p>
<p>Once you have a tiff file, open it in Document Imaging application.</p>
<p><img src="http://hitesh.in/wp-content/uploads/2009/07/071009_2101_TipConverta3.png" alt="" /></p>
<p>Click on the send to word icon (one with the little arrow and blue W), which will trigger an OCR process.</p>
<p><img src="http://hitesh.in/wp-content/uploads/2009/07/071009_2101_TipConverta4.png" alt="" /></p>
<p>When the <abbr title="Optical Character Recognition">OCR</abbr> completes, you get this output in Word:</p>
<table border="0">
<tbody>
<tr>
<td>Ankle</td>
<td>Arm</td>
<td>Banana</td>
<td>Bicycle</td>
<td>Big toe</td>
</tr>
<tr>
<td>Boat</td>
<td>Bow tie</td>
<td>Bumblebee</td>
<td>Bunk bed</td>
<td>Butterfly</td>
</tr>
<tr>
<td>Candy cane</td>
<td>Car</td>
<td>Carrot</td>
<td>Chair</td>
<td>Circle</td>
</tr>
</tbody>
</table>
<p>Hope it saves your day, one day <img src='http://hitesh.in/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
<p class="note">Note that MDI app is not installed in the standard Office setup, so you might need to customize your setup.</p>
]]></content:encoded>
			<wfw:commentRss>http://hitesh.in/2009/tip-convert-an-image-to-text-ocr/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

