Skip to main content

GImageReader: A Free and Accurate OCR Software

Would you like to extract text from a scanned image of a document? Or wish you didn’t have to retype text from a PDF document? GImageReader, the free OCR (optical character recognition) software helps you does that easily. it’s a GUI frontend to Google’s Tesseract OCR, perhaps one of the most accurate open source OCR engines and can be considered as an open-source alternative to the professional OCR ABBYY FineReader.

Main Features of GImageReader:
  1. It supports popular languages such as English, Spanish, French, German, Japanese, Italian, Korean, etc.
  2. Supports JPEG, PNG, GIF, TIFF images and PDF files
  3. You can directly acquire source image from digital scanners.
  4. Supports spell checking,
 Here is how to install and use GImageReader for Windows.

1. First download and install Tesseract OCR with English language data at here (current version 3, 1.8Mb).

2. Then download and install GImageReader (16Mb) from here. After installation run it. A configuration window will display.


3. In the configuration option, the field ‘Directory containing tesseract’ must be selected automatically, or enter the path C – Program Files – Tesseract-OCR (Windows 7).

4. In the field ‘Directory containing dictionaries’ enter the path C – Program Files – Tesseract-OCR – tessdata and apply the settings. [If problem, right click tessdata, select properties, copy, and paste the location path]

Now run the program select a scanned image or PDF document, select an area that you wan to extract text by dragging and click on ‘Recognize Selection’ button. That’s it.

Note: For spell checking, download spell checker dictionaries from OpenOffice and extract the files to: C – Program Files - gImageReader – Spelling Dictionaries (Windows 7). For best results, the resolution of source image should be between 200 dpi and 300 dpi for normal, 10-12 pt text.

Similar post: Google Doc as OCR

Comments

  1. The specific versions of OpenOffice spell checking dictionaries that are required by gImageReader are no longer available from OpenOffice. HOW CAN I OBTAIN THEM SO I CAN STILL USE gImageReader???????

    ReplyDelete

Post a Comment

Please leave your valuable comment below

Popular posts from this blog

Surf the Web Anonymously With Firefox Add-on Phproxy

There are several web based proxy servers available to surf the internet anonymously or as from another country. The Firefox add-on (Firefox 3 – 4) Phzilla helps you view a webpage or surf the internet using the PHProxy (a type of web based proxy server) proxy servers. It is very easy and convenient to use.

Restore Lost Capacity Of Your USB Flash Drive (How to)

Some malware can hide full capacity of your USB flash drive. For example, a 4GB pen drive sometimes shows only 500kb or less. An interesting part of this situation is that, even after removing the malware or formatting the USB flash drive, you will not get back its original capacity. Therefore, the question here is how to restore a USB flash drive to its full capacity.

Disable automatic Meta refresh/redirect of websites in browser for security reasons[how to]

Generally speaking, Meta refresh is a method used by some websites to instruct a web browser to automatically refresh/redirect the current web page after a given time interval. You can see this type of refresh/redirect especially in media sites. This is some times annoying or can be used for malicious purposes by redirecting you to a malicious site. If you don’t like this feature, you can disable this in your browser. Here is how to disable this in Internet Explore/Chrome, Firefox and Opera. Internet Explorer: Go to Tools - Internet options - Security tab - Custom Level button - Miscellaneous category - set "Allow Meta refresh" to Disable. Firefox: Go to Tools - Options - Advanced - General - Accessibility and tick the option next to ‘Warn me when web sites try to redirect or reload the page’. Alternatively you can use extension RefreshBlocker . Opera: Go to Preferences - Advanced - Network and uncheck "Enable automatic redirection".