OcrGui Manual 0.1

This manual is for OcrGui 0.2.1

Contents

Introduction
How to install OcrGui
Quick start
The main window
Saving multiple texts
The preferences window

Introduction

OcrGui is a G.U.I. (Graphic User Interface) for O.C.R. (Optical Character Recongnition). This program will help you to extract text from scanned images. It is developed in C language using GLib and GTK+ frameworks and supports two open source OCR engines:

OcrGui also provides a spell check using Hunspell, an open source spell checker.


How to install OcrGui

Open a terminal and change to the directory in which the file ocrgui-0.2.1.tar.gz was saved, then type the following commands:
  1. tar -xvf ocrgui-0.2.1.tar.gz
  2. cd ocrgui-0.2.1
  3. ./configure
  4. make
  5. make install (run with root privileges)

Quick start

  1. Run OcrGui clicking on the menu entry or typing ocrgui in a terminal window
  2. Click on Open an image file
  3. Select the image file to open
  4. Open preferences window: File → Preferences
  5. Select a dictionary for Tesseract (OCR engine)
  6. Select a dictionary for Hunspell (spellcheck program)
  7. Click on Extract text from image. Text will apper on left panel
  8. Click on Spellcheck with Hunspell to check the text
  9. Hunspell searches similar words in dictionary. If Hunspell doesn't find anything, or the found words are, inserting a new word is possible

  10. Click on Save text in a file

The main window

Run OcrGui clicking on the menu entry or typing ocrgui in a terminal window.
  1. Main toolbar
  2. Image toolbar
  3. Text toolbar
  4. List of opened images
  5. Image panel
  6. Text panel
The most left panel (4) shows all opened images.

To open an image, select File → Open or click on the Open an image button in the main toolbar (1). OcrGui permits to open more than one image at the same time. In the figure below the files text1.jpg and text2.jpg were opened.
To select an image, double click on its icon. It is possible to select one or more images, duoble clicking using Ctrl key. Once an image is selected, it can be closed or processed.

To close an image, select File → Close or just click on the Close button in the main toolbar (1).

To process an image, select Image → Recognition or click on the Extract text button in the text toolbar (2).

To check the extracted text, select Text → Spell check or click on the Spell check button in the text toolbar (3).

To save the text in a file, select Text → Save or click on the Save text button in the text toolbar (3).

Saving multiple texts

It is possible to save more than extracted text in the same file:
  1. Open two or more images
  2. Extract the texts, selecting one image at a time
  3. Select all images in the list, clicking on icons using Ctrl key
  4. Select Text → Save multiple text or click on Save button in the main toolbar (1)

The preferences window

To open the preferences window, select File → Preferences.