<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: OCRFeeder 0.7.7 released</title>
	<atom:link href="http://www.joaquimrocha.com/2011/12/10/ocrfeeder-0-7-7-released/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.joaquimrocha.com/2011/12/10/ocrfeeder-0-7-7-released/</link>
	<description>Free Software and travelling.</description>
	<lastBuildDate>Fri, 10 May 2013 13:05:04 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
	<item>
		<title>By: Joaquim Rocha</title>
		<link>http://www.joaquimrocha.com/2011/12/10/ocrfeeder-0-7-7-released/comment-page-1/#comment-6573</link>
		<dc:creator>Joaquim Rocha</dc:creator>
		<pubDate>Wed, 25 Jan 2012 19:43:27 +0000</pubDate>
		<guid isPermaLink="false">http://www.joaquimrocha.com/?p=1214#comment-6573</guid>
		<description><![CDATA[Don&#039;t worry, happens all the time :)]]></description>
		<content:encoded><![CDATA[<p>Don&#8217;t worry, happens all the time <img src='http://www.joaquimrocha.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Russell Hedger</title>
		<link>http://www.joaquimrocha.com/2011/12/10/ocrfeeder-0-7-7-released/comment-page-1/#comment-6572</link>
		<dc:creator>Russell Hedger</dc:creator>
		<pubDate>Wed, 25 Jan 2012 19:42:08 +0000</pubDate>
		<guid isPermaLink="false">http://www.joaquimrocha.com/?p=1214#comment-6572</guid>
		<description><![CDATA[I can&#039;t even spell it correctly, Joaquim.]]></description>
		<content:encoded><![CDATA[<p>I can&#8217;t even spell it correctly, Joaquim.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Russell Hedger</title>
		<link>http://www.joaquimrocha.com/2011/12/10/ocrfeeder-0-7-7-released/comment-page-1/#comment-6571</link>
		<dc:creator>Russell Hedger</dc:creator>
		<pubDate>Wed, 25 Jan 2012 19:40:19 +0000</pubDate>
		<guid isPermaLink="false">http://www.joaquimrocha.com/?p=1214#comment-6571</guid>
		<description><![CDATA[Ha, sorry: Joachim, not Jürgen!]]></description>
		<content:encoded><![CDATA[<p>Ha, sorry: Joachim, not Jürgen!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Joaquim Rocha</title>
		<link>http://www.joaquimrocha.com/2011/12/10/ocrfeeder-0-7-7-released/comment-page-1/#comment-6570</link>
		<dc:creator>Joaquim Rocha</dc:creator>
		<pubDate>Wed, 25 Jan 2012 17:27:24 +0000</pubDate>
		<guid isPermaLink="false">http://www.joaquimrocha.com/?p=1214#comment-6570</guid>
		<description><![CDATA[I can add the plain text exportation feature to the CLI for the next version.

BTW, who is Jürgen? :)]]></description>
		<content:encoded><![CDATA[<p>I can add the plain text exportation feature to the CLI for the next version.</p>
<p>BTW, who is Jürgen? <img src='http://www.joaquimrocha.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Russell Hedger</title>
		<link>http://www.joaquimrocha.com/2011/12/10/ocrfeeder-0-7-7-released/comment-page-1/#comment-6568</link>
		<dc:creator>Russell Hedger</dc:creator>
		<pubDate>Wed, 25 Jan 2012 16:21:55 +0000</pubDate>
		<guid isPermaLink="false">http://www.joaquimrocha.com/?p=1214#comment-6568</guid>
		<description><![CDATA[Hello Jürgen,

I am using OCRFeeder from the repositories on Ubuntu 11.10, and it works quite well with Cuneiform as the OCR engine. However, when using the latest stable version of tesseract-ocr 3.01 (which I have found to be more accurate at OCR), character recognition does not work properly in OCRFeeder. Perhaps it is related to this issue:

http://code.google.com/p/tesseract-ocr/issues/detail?id=580

Also, I have been experimenting with using the OCRFeeder cli to feed scanned print into a speech engine as an aid for someone with a visual impairment. The gui does a pretty good job at recognising text boxes, performing ocr and exporting as plaintext. I have found that tesseract alone reads print quite well but gets confused by artifacts such as staples, edges and line graphics. OCRFeeder is much better at dealing with this issue, and passing the correct image parts to tesseract to be recognised. I notice that the cli only exports as html and odt, and it would be very helpful if any future version had the ability to output in plaintext, which could then be passed straight to a speech engine.]]></description>
		<content:encoded><![CDATA[<p>Hello Jürgen,</p>
<p>I am using OCRFeeder from the repositories on Ubuntu 11.10, and it works quite well with Cuneiform as the OCR engine. However, when using the latest stable version of tesseract-ocr 3.01 (which I have found to be more accurate at OCR), character recognition does not work properly in OCRFeeder. Perhaps it is related to this issue:</p>
<p><a href="http://code.google.com/p/tesseract-ocr/issues/detail?id=580" rel="nofollow">http://code.google.com/p/tesseract-ocr/issues/detail?id=580</a></p>
<p>Also, I have been experimenting with using the OCRFeeder cli to feed scanned print into a speech engine as an aid for someone with a visual impairment. The gui does a pretty good job at recognising text boxes, performing ocr and exporting as plaintext. I have found that tesseract alone reads print quite well but gets confused by artifacts such as staples, edges and line graphics. OCRFeeder is much better at dealing with this issue, and passing the correct image parts to tesseract to be recognised. I notice that the cli only exports as html and odt, and it would be very helpful if any future version had the ability to output in plaintext, which could then be passed straight to a speech engine.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
