Skip to main content

GImageReader: A Free and Accurate OCR Software

Would you like to extract text from a scanned image of a document? Or wish you didn’t have to retype text from a PDF document? GImageReader, the free OCR (optical character recognition) software helps you does that easily. it’s a GUI frontend to Google’s Tesseract OCR, perhaps one of the most accurate open source OCR engines and can be considered as an open-source alternative to the professional OCR ABBYY FineReader.

Main Features of GImageReader:
  1. It supports popular languages such as English, Spanish, French, German, Japanese, Italian, Korean, etc.
  2. Supports JPEG, PNG, GIF, TIFF images and PDF files
  3. You can directly acquire source image from digital scanners.
  4. Supports spell checking,
 Here is how to install and use GImageReader for Windows.

1. First download and install Tesseract OCR with English language data at here (current version 3, 1.8Mb).

2. Then download and install GImageReader (16Mb) from here. After installation run it. A configuration window will display.


3. In the configuration option, the field ‘Directory containing tesseract’ must be selected automatically, or enter the path C – Program Files – Tesseract-OCR (Windows 7).

4. In the field ‘Directory containing dictionaries’ enter the path C – Program Files – Tesseract-OCR – tessdata and apply the settings. [If problem, right click tessdata, select properties, copy, and paste the location path]

Now run the program select a scanned image or PDF document, select an area that you wan to extract text by dragging and click on ‘Recognize Selection’ button. That’s it.

Note: For spell checking, download spell checker dictionaries from OpenOffice and extract the files to: C – Program Files - gImageReader – Spelling Dictionaries (Windows 7). For best results, the resolution of source image should be between 200 dpi and 300 dpi for normal, 10-12 pt text.

Similar post: Google Doc as OCR

Comments

  1. The specific versions of OpenOffice spell checking dictionaries that are required by gImageReader are no longer available from OpenOffice. HOW CAN I OBTAIN THEM SO I CAN STILL USE gImageReader???????

    ReplyDelete

Post a Comment

Please leave your valuable comment below

Popular posts from this blog

Surf the Web Anonymously With Firefox Add-on Phproxy

There are several web based proxy servers available to surf the internet anonymously or as from another country. The Firefox add-on (Firefox 3 – 4) Phzilla helps you view a webpage or surf the internet using the PHProxy (a type of web based proxy server) proxy servers. It is very easy and convenient to use.

Restore Lost Capacity Of Your USB Flash Drive (How to)

Some malware can hide full capacity of your USB flash drive. For example, a 4GB pen drive sometimes shows only 500kb or less. An interesting part of this situation is that, even after removing the malware or formatting the USB flash drive, you will not get back its original capacity. Therefore, the question here is how to restore a USB flash drive to its full capacity.

PaperBus-free & fast web proxy solution for anonymous internet surfing

There are several free proxy solutions available for downloading (we had covered few of them in the previous posts), but from my personal experience most of them are very slow in my country.If you are looking for a free, fast and reliable proxy solution for anonymous surfing, Here is a new multi-platform application, PaperBus (ad-supported) which lets you surf anonymously and bypass internet filters.Paper bus (brought you by Open Terrace Ltd the same company that made commercial proxy service Freedur) is very easy to use. Simply install, and run. No registration required. The only down side is there will be an ad web browser tab popping up in every twenty minutes while you are using it.Another interesting feature of PaperBus is that you can create a list of websites you don't want to surf through PaperBus.PaperBus is compatible with Windows, Mac and Linux systems. Download appropriate version from here. (via)