As has become usual every couple of weeks or so, I released a new version of OCRFeeder!
This is version 0.6.1 and the main changes this time are:
* Now you can increase or decrease the zoom using Ctrl+Mouse wheel. This kind of shortcut is well known in many GNOME applications and even I was missing it;
* Warning dialogs are now shown when something went wrong while opening an image;
* Fixed encoding problem when reading non-ASCII characters;
* Fixed error when configuring a new engine;
* Improved Debian package’s files (thanks to Alberto Garcia)
* Fixed zoom issues (sometimes the allowed zoom would not be consistent among tries);
It was a good week on OCRFeeder’s bug tracker, specially thanks to user Hank who reported important problems.
I am really glad about how OCRFeeder is turning out and I expect to make it even better with the help of its users, either by sending suggestions, reporting bugs or simply by using it you will be helping the project.
You can download OCRFeeder 0.6.1 tarball from GNOME FTP or optionally download a Debian package directly.
I was disappointed with the text completion provided by the N900 (eZiText) that, on top of that, is closed and I wondered if it was possible to have an Open Source solution to provide text prediction and completion.
I searched a bit and besides my original intentions of developing a library to search Free and Open Source dictionaries’ words from a prefix, I found Presage.
Presage is better than most text prediction systems I have seen out there because it really is text prediction, not text completion. This C++ library, retrieves words taking into account the surrounding text, not only the prefix or frequency of words. It uses a database representing N-grams that can be trained with more text; the more you train it, the more accurate it can be.
This means that is you type something like: “I m”
instead of suggesting nonsense things like: “I mouse” “I mother” “I market” or “I more“
it suggests something more like: “I must” “I met” “I mean” or “I might“
The difference is obvious!
So I developed a little wrapper around Presage in C that provides a yet very basic API to get text completion. Then I created a GTK+ Input Method context to control the user’s input in regular GTK+ text widgets and used the wrapper to process the inputted text. I called it: Predictor Input Method (not very original I know…).
The result is that Predictor suggests you words, even if you type a prefix or not, and lets you accept the candidate word or scroll through a list of suggestions as you can see in the video below:
Ctrl+Enter -> Selects the current candidate Ctrl+Up/Down -> Scrolls through the list of candidates Backspace -> Deletes the character previous to the cursor and suggests again Directional arrows -> Move cursor and discard suggestions
Who should use it
This kind of assistance technology can have many applications but the main ones are: the usage in small/mobile devices and the assistance of users with disabilities. Both have the same reasons behind: speeding the input and reducing failed characters, because the input required gets minimized;
Of course, you can as well use it in your GNOME desktop regularly for faster typing your emails, etc.
In the case of users with disabilities, a popup menu could be added to show a complete list of candidates and the bound fast-access keys.
Why is Free Software important in this
This is the kind of technology that everybody should have an interest in using a FOSS solution because of the obvious advantage that is developers from all over the world being able to modify it.
Suppose you’re creating a mobile phone and you choose a closed solution to provide text prediction for your phone. And then you find out you’re disappointing all your users from country X because that library you’re paying for does not support their language and the library owner is not interested that much in adding it. Now if you’re using an open solution, local communities from many places in the world can add support for their languages and your phone can have a better acceptance in places you hadn’t even imagined.
Software that reaches an international audience with different languages is software you want to have open.
If you are not using GTK+ Input Methods then you can use the wrapper text-predictor.cpp which is not tight to the Input Method code itself. And of course, you can copy the little tricks used on the Input Method code and apply it to your source (like delaying the retrieval of the candidates some fractions of a second to not block the input, etc.).
As promised, I’ve added the original air dates of the episodes to SeriesFinale.
This feature had been requested since people first knew of SeriesFinale and it is quite useful. It will tell you, by the color and actual date information, when a show’s episode was first aired in its network. Of course if you happen to watch a show on another network (usually this happens if you’re not from the same country as the TV show itself), then you won’t need this date to warn you to go sit and watch it but at least you see which episodes were already aired on their original country.
This release has not many changes apart from the date thing plus a couple of bug fixed and the Spanish translation (thanks to Juan A. Suárez Romero).
(By the way, as I write this, TheTVDB seems to be down so you are likely to have some trouble updating your shows and getting the air dates on it but I hope it will be up again soon)
FOSDEM was really nice this year. Still too many interesting presentations to attend than our physical condition allows but that’s life.
Like I announced on my last post, I gave two presentations there and I am glad with both of them. People seemed really interested in OCRFeeder and I hope they try it out, send me feedback and spread the word about it.
I could personally meet P. Christeas, who had send me a patch for it, and listen to the questions and suggestions of people about how OCRFeeder works.
I must say the most impressive presentation I attended was by Professor Andrew Tanenbaum himself, about MINIX 3, what a beautiful piece of software it seems.
If you have not attended it, maybe you can watch the video recording once it is available.
Later on I had a nice chat with him regarding web browsers on MINIX and the real portability of applications that are said to be multi-platform.
I have been working for the past months in Hildon Input Methods (HIM) and many things have changed in it (and many will) with only a target in mind: make the experience of typing in N900, physically or virtually, a great experience.
Still, one thing I dislike in HIM is the fact that it is semi-closed source. Now what’s this semi-closed source thingy? It means that some modules are open, others are closed (HIM is a complex project).
(this is not the project logo)
For the open ones, and have you failed noticing it, the sad truth was that they were updated from time to time. No open development was done… but this has changed!
Since last week, HIM’s open source modules are now developed “in the open”, using Gitorious (thanks to Kimmo)!
The modules are hildon-input-method and hildon-input-method-framework.
This constitutes another step of freedom inside Fremantle and I hope you enjoy it as much as I do!