Skip to main content

GImageReader: A Free and Accurate OCR Software

Would you like to extract text from a scanned image of a document? Or wish you didn’t have to retype text from a PDF document? GImageReader, the free OCR (optical character recognition) software helps you does that easily. it’s a GUI frontend to Google’s Tesseract OCR, perhaps one of the most accurate open source OCR engines and can be considered as an open-source alternative to the professional OCR ABBYY FineReader.

Main Features of GImageReader:
  1. It supports popular languages such as English, Spanish, French, German, Japanese, Italian, Korean, etc.
  2. Supports JPEG, PNG, GIF, TIFF images and PDF files
  3. You can directly acquire source image from digital scanners.
  4. Supports spell checking,
 Here is how to install and use GImageReader for Windows.

1. First download and install Tesseract OCR with English language data at here (current version 3, 1.8Mb).

2. Then download and install GImageReader (16Mb) from here. After installation run it. A configuration window will display.


3. In the configuration option, the field ‘Directory containing tesseract’ must be selected automatically, or enter the path C – Program Files – Tesseract-OCR (Windows 7).

4. In the field ‘Directory containing dictionaries’ enter the path C – Program Files – Tesseract-OCR – tessdata and apply the settings. [If problem, right click tessdata, select properties, copy, and paste the location path]

Now run the program select a scanned image or PDF document, select an area that you wan to extract text by dragging and click on ‘Recognize Selection’ button. That’s it.

Note: For spell checking, download spell checker dictionaries from OpenOffice and extract the files to: C – Program Files - gImageReader – Spelling Dictionaries (Windows 7). For best results, the resolution of source image should be between 200 dpi and 300 dpi for normal, 10-12 pt text.

Similar post: Google Doc as OCR

Comments

  1. The specific versions of OpenOffice spell checking dictionaries that are required by gImageReader are no longer available from OpenOffice. HOW CAN I OBTAIN THEM SO I CAN STILL USE gImageReader???????

    ReplyDelete

Post a Comment

Please leave your valuable comment below

Popular posts from this blog

Surf the Web Anonymously With Firefox Add-on Phproxy

There are several web based proxy servers available to surf the internet anonymously or as from another country. The Firefox add-on (Firefox 3 – 4) Phzilla helps you view a webpage or surf the internet using the PHProxy (a type of web based proxy server) proxy servers. It is very easy and convenient to use.

Restore Lost Capacity Of Your USB Flash Drive (How to)

Some malware can hide full capacity of your USB flash drive. For example, a 4GB pen drive sometimes shows only 500kb or less. An interesting part of this situation is that, even after removing the malware or formatting the USB flash drive, you will not get back its original capacity. Therefore, the question here is how to restore a USB flash drive to its full capacity.

Which is the safest browser- IE, Firefox, Chrome, Safari or Opera?

The safest way to surf the web is to pick the best browser, but how? The most important parameter while judging a browser is its security . Because of the ever growing threats via web browsers, browser developers have trying to give best security features to their browser. All major browsers have added security technologies to combat web-based threats, but, not all of them have taken the same approach, nor claim to stop the same breadth of attacks. NSS Labs , the leading independent security product testing and certification organization, has examined six major browsers- A pple Safari, Google Chrome, Internet Explorer 7, Internet Explorer 8, Mozilla Firefox, and Opera against socially engineered Malware and published a report called ‘ Web Browser Security - Socially Engineered Malware Protection ’. The report reveals that Internet Explorer 8 (RC1) is the safest one with a 69% catch rate against Malware. It was 30% in Mozilla Firefox (3.07), 24% in Apple Safari, 16% in Google Chro