<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Joaquim Rocha's Web Page</title>
	<atom:link href="http://www.joaquimrocha.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.joaquimrocha.com</link>
	<description>Linux, technology and art</description>
	<lastBuildDate>Sat, 07 Aug 2010 23:41:27 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.5</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Engaged</title>
		<link>http://www.joaquimrocha.com/2010/08/08/engaged/</link>
		<comments>http://www.joaquimrocha.com/2010/08/08/engaged/#comments</comments>
		<pubDate>Sat, 07 Aug 2010 23:31:25 +0000</pubDate>
		<dc:creator>Joaquim Rocha</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[life]]></category>

		<guid isPermaLink="false">http://www.joaquimrocha.com/?p=534</guid>
		<description><![CDATA[That&#8217;s right, after being seven years with Helena, I proposed to her. It was in the Grand Place, in the beautiful city of Brussels. 
We&#8217;ve been together for a long time, she&#8217;s a beautiful person, I love her and I wanna be with her.
Now, she already knew this but we&#8217;ve been talking about celebrating this [...]]]></description>
			<content:encoded><![CDATA[<p>That&#8217;s right, after being seven years with <a href="http://lenagrosso.wordpress.com">Helena</a>, I proposed to her. It was in the Grand Place, in the beautiful city of Brussels. </p>
<p>We&#8217;ve been together for a long time, she&#8217;s a beautiful person, I love her and I wanna be with her.<br />
Now, she already knew this but we&#8217;ve been talking about celebrating this relation with a party for our friends and family, the thing people usually call wedding; this is what marriage is to us, a celebration (besides the advantages that the government gives), we don&#8217;t see it as a requirement to have a serious relationship.</p>
<p>So, we don&#8217;t want a traditional wedding, we haven&#8217;t decided how it&#8217;s gonna be but we want it to be great. BTW, if you have <a href="http://www.gogolbordello.com/">Gogol Bordello</a>&#8217;s number, we might wanna ask for a budget!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.joaquimrocha.com/2010/08/08/engaged/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>GUADEC and InterRail</title>
		<link>http://www.joaquimrocha.com/2010/08/05/guadec-and-interrail/</link>
		<comments>http://www.joaquimrocha.com/2010/08/05/guadec-and-interrail/#comments</comments>
		<pubDate>Thu, 05 Aug 2010 07:33:24 +0000</pubDate>
		<dc:creator>Joaquim Rocha</dc:creator>
				<category><![CDATA[gnome]]></category>
		<category><![CDATA[grilo]]></category>
		<category><![CDATA[guadec]]></category>
		<category><![CDATA[igalia]]></category>
		<category><![CDATA[planet]]></category>
		<category><![CDATA[travel]]></category>
		<category><![CDATA[trip]]></category>

		<guid isPermaLink="false">http://www.joaquimrocha.com/?p=527</guid>
		<description><![CDATA[I&#8217;ve spent last week in Den Haag, attending GUADEC 2010, with many other fellow Igalians.
Although last year I also attended GUADEC, this year was like a first time to me as last year&#8217;s GUADEC was co-located with aKademy forming Gran Canaria Desktop summit and it felt different.
What do I think of this year&#8217;s? Javascript, web, [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve spent last week in Den Haag, attending <a href="http://www.guadec.org">GUADEC</a> 2010, with many other fellow Igalians.</p>
<p>Although last year I also attended GUADEC, this year was like a first time to me as last year&#8217;s GUADEC was co-located with aKademy forming Gran Canaria Desktop summit and it felt different.</p>
<p>What do I think of this year&#8217;s? Javascript, web, introspection and shell, that&#8217;s what I think.<br />
This is GNOME&#8217;s new route, making desktop development more webbish and it is seems like a smart one too. Now, I don&#8217;t really fall for Javascript, I think it&#8217;s ugly and not really the best choice (imho) for large projects but anyway that&#8217;s the beauty of GObject Introspection, in the future it should be easy to use whatever language one prefers.</p>
<p>As for the talks, I really enjoyed <a href="http://guadec.org/index.php/guadec/2010/paper/view/124">Luis Villa&#8217;s keynote</a>. <a href="http://blogs.gnome.org/xan">Xan</a> and Fernando did a great job getting the tragedy that some times the Foundation&#8217;s mailing list is and turning it into a comedy.<br />
<a href="http://blogs.igalia.com/itoral">Iago</a> gave a good talk about Grilo and <a href="http://blogs.igalia.com/jasuarez">Juan</a> complemented it in a lightning talk about the plugins we did using Rygel-grilo.</p>
<p>This year I gave again a lightening talk, this time about the <a href="http://www.joaquimrocha.com/2010/03/03/text-prediction-on-gnome/">Predictor Input Method</a> which you might one day use in a mobile device or on the desktop itself if you need assisted typing. There must be a GUADEC&#8217;s rule saying that the laptop where people present the lightening talks must be a crappy netbook that takes 2 seconds before it changes a slide&#8230;</p>
<p>For an overall feeling of GUADEC, you can check out <a href="http://blogs.igalia.com/vjaquez/2010/08/03/guadec-2010/" target="_blank">Victor&#8217;s post covering GUADEC</a>, I agree totally with him.</p>
<p>I could also meet and chat with nice people like <a href="http://monotonous.org/" target="_blank">Eitan Isaacson</a>, Patricia and others.</p>
<p>So let&#8217;s see how the projects presented in GUADEC evolve and wait GUADEC 2011 in Berlin.</p>
<p>And what this week? This week I&#8217;m on vacation doing an InterRail across a bunch of European countries together with my girlfriend. I visited Paris already, where I found out my french is good enough for basic stuff. Today we&#8217;re in Brussels, it&#8217;s my third time here but the first one as a turist. I&#8217;m sure the beers will taste as marvellous as always.</p>
<p>I&#8217;m also doing a new thing: travelling without my laptop, the N900 seems to be a perfect replacement, I (still?) love this gadget and it surelly spares some space in my backpack.</p>
<p>See you in some European city, I&#8217;m likely to be wearing a GNOME/Linux/Metal t-shirt&#8230; what else is new&#8230;?</p>
]]></content:encoded>
			<wfw:commentRss>http://www.joaquimrocha.com/2010/08/05/guadec-and-interrail/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>One more step in OCR with OCRFeeder 0.7</title>
		<link>http://www.joaquimrocha.com/2010/07/30/one-more-step-in-ocr-with-ocrfeeder-0-7/</link>
		<comments>http://www.joaquimrocha.com/2010/07/30/one-more-step-in-ocr-with-ocrfeeder-0-7/#comments</comments>
		<pubDate>Fri, 30 Jul 2010 13:44:14 +0000</pubDate>
		<dc:creator>Joaquim Rocha</dc:creator>
				<category><![CDATA[gnome]]></category>
		<category><![CDATA[gtk]]></category>
		<category><![CDATA[guadec]]></category>
		<category><![CDATA[gui]]></category>
		<category><![CDATA[ocr]]></category>
		<category><![CDATA[ocrfeeder]]></category>
		<category><![CDATA[planet]]></category>
		<category><![CDATA[programming]]></category>
		<category><![CDATA[python]]></category>

		<guid isPermaLink="false">http://www.joaquimrocha.com/?p=486</guid>
		<description><![CDATA[I have been hacking on some new and cool features on OCRFeeder for a while and now it is time to show them to the world in a new release.
These features I&#8217;m talking about fall mainly in 2 areas: improving the a11y of the UI and improving the recognition of documents.
A11y Improvement
The improvement of the [...]]]></description>
			<content:encoded><![CDATA[<p>I have been hacking on some new and cool features on <a href="http://live.gnome.org/OCRFeeder" target="_blank">OCRFeeder</a> for a while and now it is time to show them to the world in a new release.</p>
<p>These features I&#8217;m talking about fall mainly in 2 areas: improving the a11y of the UI and improving the recognition of documents.</p>
<p><strong>A11y Improvement</strong></p>
<p>The improvement of the a11y has the typical UI changes to include mnemonics, missing labels and relations, but also other approaches that have more to do with UX like using a progress dialog to inform users that time-taking operations are being carried. This means that now, the PDF importation and OCR won&#8217;t block the UI.<br />
Other changes in this category were the navigation through the content boxes (before, these could only be selected by clicking on them), the selection of all boxes and the deletion of selected boxes.</p>
<p>The following screenshot shows the box editor area of OCRFeeder with its mnemonics highlighted:</p>
<div id="attachment_508" class="wp-caption aligncenter" style="width: 160px"><a href="http://www.joaquimrocha.com/wp-content/uploads/2010/07/ocrfeeder_a11y1.png"><img class="size-medium wp-image-508" title="ocrfeeder_a11y" src="http://www.joaquimrocha.com/wp-content/uploads/2010/07/ocrfeeder_a11y1-150x300.png" alt="Box edition area" width="150" height="300" /></a><p class="wp-caption-text">Box edition area</p></div>
<p><strong>Recognition Improvements</strong></p>
<p>Sometimes, text columns are so close to each other that they end up being recognized as a single paragraph, so I added a post-detection method to solve this issue. This feature is optional and can be toggled from the Preferences dialog.</p>
<p>Here&#8217;s an example of the difference it makes:</p>
<div id="attachment_503" class="wp-caption aligncenter" style="width: 229px"><a href="http://www.joaquimrocha.com/wp-content/uploads/2010/07/ocrfeeder_no_columns1.png"><img class="size-medium wp-image-503" title="ocrfeeder_no_columns" src="http://www.joaquimrocha.com/wp-content/uploads/2010/07/ocrfeeder_no_columns1-219x300.png" alt="Before columns' detection improvements" width="219" height="300" /></a><p class="wp-caption-text">Before columns&#39; detection improvements</p></div>
<div id="attachment_504" class="wp-caption aligncenter" style="width: 230px"><a href="http://www.joaquimrocha.com/wp-content/uploads/2010/07/ocrfeeder_columns1.png"><img class="size-medium wp-image-504" title="ocrfeeder_columns" src="http://www.joaquimrocha.com/wp-content/uploads/2010/07/ocrfeeder_columns1-220x300.png" alt="After columns' detection improvements" width="220" height="300" /></a><p class="wp-caption-text">After columns&#39; detection improvements</p></div>
<p>Scanned document images are usually skewed and this makes it more difficult for the contents to be successfully detected and &#8220;OCRed&#8221;. I decided to implement an algorithm to deskew these images. The algorithm uses the <a href="http://en.wikipedia.org/wiki/Hough_transform" target="_blank">Hough transform</a> to try to find lines in the image and their angles and, while it is a bit slow, it works well:</p>
<div id="attachment_509" class="wp-caption aligncenter" style="width: 201px"><a href="http://www.joaquimrocha.com/wp-content/uploads/2010/07/ocrfeeder_skewed1.png"><img class="size-medium wp-image-509" title="ocrfeeder_skewed" src="http://www.joaquimrocha.com/wp-content/uploads/2010/07/ocrfeeder_skewed1-191x300.png" alt="Skewed image" width="191" height="300" /></a><p class="wp-caption-text">Skewed image</p></div>
<div id="attachment_510" class="wp-caption aligncenter" style="width: 201px"><a href="http://www.joaquimrocha.com/wp-content/uploads/2010/07/ocrfeeder_deskewed1.png"><img class="size-medium wp-image-510" title="ocrfeeder_deskewed" src="http://www.joaquimrocha.com/wp-content/uploads/2010/07/ocrfeeder_deskewed1-191x300.png" alt="Deskewed image" width="191" height="300" /></a><p class="wp-caption-text">Deskewed image</p></div>
<p>This action can be used in a loaded image but can also be configured to be automatically performed before the images are added. The Unpaper tool can now also be set to be clean images before adding them.<br />
This makes it much easier to successfully recognize images obtained from a scanner device.</p>
<p>Some fine tunning of the content boxes&#8217; bounds was done by trying to shorten their margins, that is, lowering the distance between the boxes and their actual contents.</p>
<p>The font size recognition was also tweaked to solve the problem of having paragraphs with initials (you know, the huge starting characters) which were influencing the whole paragraphs&#8217; font size.</p>
<p>To finish the recognition&#8217;s improvements, I have added an optional action to find and fix the text&#8217;s line breaks. Usually, OCR engines don&#8217;t consider &#8220;semantic line-breaks&#8221;, that is, OCR engines always insert a newline in the end of each line.<br />
Using some regular expressions, I try to find these &#8220;fake&#8221; line-breaks and recover the original flow of the text. Like some of the features mentioned above, this one can also be turned on/off from the Preferences dialog.</p>
<p>Here&#8217;s how the Preferences dialog looks like now:</p>
<p><a href="http://www.joaquimrocha.com/wp-content/uploads/2010/07/Preferences_dialog1.png"><img class="aligncenter size-medium wp-image-511" title="Preferences_dialog" src="http://www.joaquimrocha.com/wp-content/uploads/2010/07/Preferences_dialog1-263x300.png" alt="Preferences_dialog" width="263" height="300" /></a></p>
<p><a href="http://www.joaquimrocha.com/wp-content/uploads/2010/07/Preferences_dialog_recognition1.png"><img class="aligncenter size-medium wp-image-512" title="Preferences_dialog_recognition" src="http://www.joaquimrocha.com/wp-content/uploads/2010/07/Preferences_dialog_recognition1-263x300.png" alt="Preferences_dialog_recognition" width="263" height="300" /></a></p>
<p>To finish, images can now be dragged and dropped onto the pages&#8217; area and the mouse wheel can be used to scroll horizontally combining it with the Shift key, thanks to Stefan Löffler, and of course, several bugs were corrected and code was improved.</p>
<p>As you see, this is a &#8220;rich&#8221; new version of OCRFeeder that keeps being the easiest way to use OCR in a desktop. You are welcome to file bugs in <a href="bugzilla.gnome.org/" target="_blank">bugzilla</a> or to send patches and features&#8217; requests to its <a href="http://mail.gnome.org/mailman/listinfo/ocrfeeder-list" target="_blank">mailing list</a> or approaching me if you&#8217;re in <a href="http://www.guadec.org" target="_blank">GUADEC</a>.</p>
<p>Download: <a href="http://ftp.gnome.org/pub/GNOME/sources/ocrfeeder/0.7" target="_blank">OCRFeeder 0.7 tarball on GNOME FTP</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.joaquimrocha.com/2010/07/30/one-more-step-in-ocr-with-ocrfeeder-0-7/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
		<item>
		<title>GUADEC ES, a good beginning for GUADEC</title>
		<link>http://www.joaquimrocha.com/2010/07/24/guadec-es-a-good-beginning-for-guadec/</link>
		<comments>http://www.joaquimrocha.com/2010/07/24/guadec-es-a-good-beginning-for-guadec/#comments</comments>
		<pubDate>Sat, 24 Jul 2010 22:38:00 +0000</pubDate>
		<dc:creator>Joaquim Rocha</dc:creator>
				<category><![CDATA[a coruña]]></category>
		<category><![CDATA[events]]></category>
		<category><![CDATA[gnome]]></category>
		<category><![CDATA[guadec]]></category>
		<category><![CDATA[ocrfeeder]]></category>
		<category><![CDATA[planet]]></category>

		<guid isPermaLink="false">http://www.joaquimrocha.com/?p=455</guid>
		<description><![CDATA[Yesterday was the last day of the 7th edition of GUADEC Hispana, originally to be organized in Chile but due to the disastrous earthquake, it was moved to the city of Corunna, Spain.
Between hacking on OCRFeeder (expect a new version soon), giving a talk about it, attending nice presentations and chatting with people, I had [...]]]></description>
			<content:encoded><![CDATA[<p>Yesterday was the last day of the 7th edition of <a href="http://2010.guadec.es" target="_blank">GUADEC Hispana</a>, originally to be organized in Chile but due to the <a href="http://en.wikipedia.org/wiki/2010_Chile_earthquake" target="_blank">disastrous earthquake</a>, it was moved to the city of <a href="http://en.wikipedia.org/wiki/A_Coru%C3%B1a" target="_blank">Corunna</a>, Spain.</p>
<p>Between hacking on <a href="http://live.gnome.org/OCRFeeder" target="_blank">OCRFeeder</a> (expect a new version soon), giving a talk about it, attending nice presentations and chatting with people, I had a great time.<br />
Diego&#8217;s <a href="http://people.gnome.org/~diegoe/" target="_blank">presentation about Epiphany</a> was simply epic and <a href="http://blogs.igalia.com/mario" arget="_blank">Mario</a> gave a very complete crash course of git.</p>
<p>I guess there&#8217;s a first time for these things but Thursday, while I was giving a demo of the new OCRFeeder&#8217;s features, it crashed on me&#8230; Never again will I laugh at Mr. Gates and friends when their products freeze out of the blue (nah, it is too funny).<br />
Now that I think of it&#8230; was this the first time a Portuguese man gave a talk at GUADEC Hispana?</p>
<p>The presentation was a cut-down version of the one <a href="http://www.joaquimrocha.com/2010/02/09/fosdem-follow-up/" target="_blank">I gave at FOSDEM</a> this year and you can check its slides below (it&#8217;s in Spanish):</p>
<div style="width:425px" id="__ss_4822465"><strong style="display:block;margin:12px 0 4px"><a href="http://www.slideshare.net/j_rocha/ocrfeeder" title="Ocrfeeder">Ocrfeeder</a></strong><object id="__sse4822465" width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=ocrfeeder-100723053038-phpapp01&#038;stripped_title=ocrfeeder" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed name="__sse4822465" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=ocrfeeder-100723053038-phpapp01&#038;stripped_title=ocrfeeder" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object>
<div style="padding:5px 0 12px">View more <a href="http://www.slideshare.net/">presentations</a> from <a href="http://www.slideshare.net/j_rocha">Joaquim Rocha</a>.</div>
</div>
<p>(thanks to <a href="http://blogs.igalia.com/mrego/" target="_blank">Manuel Rego</a> for reviewing my Spanish in the slides)</p>
<p>Here&#8217;s the group photo of the GUADEC ES attendants:</p>
<p><img alt="" src="http://farm5.static.flickr.com/4081/4822344878_d030dd0057.jpg" title="GUADEC ES 2010 group photo" class="alignnone" width="500" height="333" /></p>
<p>And from next Monday on, I&#8217;ll be in Den Haag for <a href="http://www.guadec.org" target="_blank">GUADEC 2010</a>. My lightening talk about the <a href="http://www.joaquimrocha.com/2010/03/03/text-prediction-on-gnome/" target="_blank">Predictor Input Method</a> got accepted, so if you&#8217;re into this kind of stuff, I hope to see you there.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.joaquimrocha.com/2010/07/24/guadec-es-a-good-beginning-for-guadec/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SeriesFinale 0.6.1 released</title>
		<link>http://www.joaquimrocha.com/2010/07/19/seriesfinale-0-6-1-released/</link>
		<comments>http://www.joaquimrocha.com/2010/07/19/seriesfinale-0-6-1-released/#comments</comments>
		<pubDate>Mon, 19 Jul 2010 10:05:37 +0000</pubDate>
		<dc:creator>Joaquim Rocha</dc:creator>
				<category><![CDATA[gnome]]></category>
		<category><![CDATA[gtk]]></category>
		<category><![CDATA[hildon]]></category>
		<category><![CDATA[maemo]]></category>
		<category><![CDATA[python]]></category>
		<category><![CDATA[seriesfinale]]></category>

		<guid isPermaLink="false">http://www.joaquimrocha.com/?p=436</guid>
		<description><![CDATA[It&#8217;s been a while now since I released the last version of SeriesFinale.
The truth is that I&#8217;ve been busier than usual these days and of course, this is reflected on pet projects.
As some of you may have experienced, there was a kind of a nasty bug in SeriesFinale&#8217;s last version: it wouldn&#8217;t update certain shows [...]]]></description>
			<content:encoded><![CDATA[<p>It&#8217;s been a while now since I released the last version of <a href="http://maemo.org/downloads/product/Maemo5/seriesfinale/">SeriesFinale</a>.<br />
The truth is that I&#8217;ve been busier than usual these days and of course, this is reflected on pet projects.</p>
<p>As some of you may have experienced, there was a kind of a nasty bug in SeriesFinale&#8217;s last version: it wouldn&#8217;t update certain shows (when they had been added long ago)&#8230; and the good news is that this is one of the things that got fixed in this new version.</p>
<p>One of the good things <a href="http://blogs.igalia.com/jasuarez/">Juan</a> introduced for this version is how the next episodes to be watched are shown. Before, the episodes were shown according to their &#8220;first aired&#8221; date and in case of the same date for two or more episodes, the highest index one would be marked as the one to be watched. In this <a href="http://maemo.org/packages/package_instance/view/fremantle_extras-devel_free_armel/seriesfinale/0.6.1-1/">0.6.1</a> version, the episodes are shown according to their number and season, so, if episode #3 has the same air date as episode #4, #3 will always be shown as the next one to be watched.</p>
<p>It is now using a priority queue to download the series&#8217; covers and info that gives priority to the info. This means you won&#8217;t have to wait for the info AND covers to download when you hit the Update All menu but instead wait only for the info; the covers will then be downloaded in the background while you use the app normally.</p>
<p>I&#8217;m now introducing the new Russian translation, which Misha Ketslah had kindly sent to me a while ago but that I hadn&#8217;t had the time to integrate.</p>
<p>Here&#8217;s the list of major changes for this new version:</p>
<p>* Add Russian translation (thanks to Misha Ketslah)<br />
* Fix updating of shows<br />
* Use a priority queue to differentiate the downloads of covers or series&#8217; info<br />
* Use only one AsyncWorker at most to deal with the series<br />
* Prevent the download and usage of images to generate problems<br />
* Fix showing next episode<br />
* Add TheTVDB credits<br />
* Add THANKS file</p>
<p>So&#8230; what about that GNOME version, you ask? I&#8217;ve already started to port it to GNOME but couldn&#8217;t dedicate much time to it and taking into account that I&#8217;ll be on vacation very soon, it&#8217;s likely it will take a little longer. But I&#8217;m looking forward to use SeriesFinale on GNOME!</p>
<p>As for the N900 owners, I&#8217;ve just promoted the package to <a href="http://maemo.org/packages/package_instance/view/fremantle_extras-testing_free_armel/seriesfinale/0.6.1-1/">Extras-Testing</a> so either use the Extras-Devel repo as usual or wait ~10 days for it to appear in Extras.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.joaquimrocha.com/2010/07/19/seriesfinale-0-6-1-released/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
	</channel>
</rss>
