Regarding my enrollment in the SAPO Summerbits program, I created a media filter for DSpace called OCR4DSpace.
With this media filter you are able to submit document images (scanned documents) and be able to search for their contents without having to fill them manually at submission time. The image contents are read by the OCR engines you have on your system.
The media filter is really simple to configure and use and hopefully will make some people’s life easier!
The web page is currently only in Portuguese but will be translated to English soon but rest your soul, the README is included in English.
To checkout the code, all you need is to run:
svn co svn://svn.softwarelivre.sapo.pt/ocrd/trunk/OCR4DSpace
Read the README file carefully and enjoy the automation that Optical Character Recognition can do for you!
Portugal needs more events like these, I hope next year more companies will join SAPO and Associação Ensino Livre and bring up the second edition of Summerbits with more projects!
So, today I continued my work in my SAPO Summerbits project. It consists in developing an OCR plugin for DSpace.
It’s been a while since I touched JAVA, the last time I did something with it was for BluePad which has a much simpler code-base.
Anyway, I don’t think I am a little rust on JAVA, it’s just that if you spend a while doing code in Python, there is some things you just take for granted. I mean, taking a look back at the JAVA API docs (good docs, BTW) just make me think: “Why the hell so much stuff for such a simple task (say, reading from a text file)!?”
But anyway, it is GOOD to switch programming languages once in while so you don’t stick too much to some stuff. One of the things I like doing most is *learning*, so I am looking forward to write more code in C once I get time, maybe learn C++ as well.
The new InstallBuilder – version 5.4.2 – was just released last Friday. It’s the first release of IB since I’ve been working for BitRock and only now I understand how powerful this piece of software is.
Also, JRuby stack was also released. Now you can be ready to develop using Ruby on top of JAVA in a few clicks.