Beruflich Dokumente
Kultur Dokumente
Reference Manual
eFlow 4.5
www.TopImageSystems.com
Copyright Integra Run Time User Guide
This manual and the software described in it are furnished under license, and may be used or copied only
in accordance with the terms of such license. The content of this manual is provided for informational
use only, and is subject to change without notice. It should not in and of itself be construed as a
commitment by Top Image Systems Ltd, which assumes no responsibility of liability for any errors or
inaccuracies that may appear in this book.
The software that accompanies this manual is licensed for use by the Licensee only in strict accordance
with the software license agreement, which the Licensee should read carefully before commencing use of
the software. Except as permitted by the license, no part of this publication may be reproduced, stored in
retrieval system, or transmitted in any form of by any means, electronic, mechanical, recording, or
otherwise, without the prior written permission of Top Image Systems Ltd.
ii
eFLOW Run Time Modules Manual
Table of Contents
Chapter 1 Overview of eFLOW.................................................................................................... 1
The eFLOW System .................................................................................................................... 1
Chapter 2 Enterprise Manager .................................................................................................. 10
Overview................................................................................................................................... 10
Invoking and Using the Enterprise Manager ............................................................................ 11
Site Management....................................................................................................................... 12
Applications Management ........................................................................................................ 14
Security Management ............................................................................................................... 23
Statistics Settings ...................................................................................................................... 28
Licensing and Version Information .......................................................................................... 30
Chapter 3 Introduction to Module Activator............................................................................ 33
Overview................................................................................................................................... 33
Invoking the Controller............................................................................................................. 34
Chapter 4 The Controller ........................................................................................................... 35
About the Controller ................................................................................................................. 35
Controller Desktop Layout........................................................................................................ 36
Using the Menu Bar .................................................................................................................. 36
Using the Tool Bar.................................................................................................................... 39
Controller Options..................................................................................................................... 40
Mode Selection ......................................................................................................................... 42
Workload Mode ........................................................................................................................ 43
Workload Run-Time Module Information................................................................................ 43
Collections Mode ...................................................................................................................... 49
Stations Mode ........................................................................................................................... 52
Viewer Mode ............................................................................................................................ 53
Domain Mode ........................................................................................................................... 54
The Collection Viewer .............................................................................................................. 55
Using the Menu Bar .................................................................................................................. 55
Chapter 5 Input - Getting Started ............................................................................................. 59
Overview................................................................................................................................... 59
Features ..................................................................................................................................... 59
Activating the Portal Modules .................................................................................................. 59
Chapter 6 Scan Portal ................................................................................................................. 60
Chapter 7 File Portal................................................................................................................... 61
Overview................................................................................................................................... 61
Chapter 8 Page OCR................................................................................................................... 62
Overview................................................................................................................................... 62
Getting Started .......................................................................................................................... 62
Manual or Standby Processing.................................................................................................. 65
Chapter 9 Content Recognition - Processing ............................................................................ 66
Overview................................................................................................................................... 66
The Processing module ............................................................................................................. 66
Getting Started .......................................................................................................................... 67
Initiating the OCR Process........................................................................................................ 70
Chapter 10 Content Recognition - Form ID ............................................................................... 71
Overview................................................................................................................................... 71
Getting Started .......................................................................................................................... 72
FormID Toolbar ........................................................................................................................ 74
Initiating the Form Recognition Process................................................................................... 74
Chapter 11 Content Recognition - Manual Form ID ................................................................. 75
Overview................................................................................................................................... 75
Getting Started .......................................................................................................................... 76
iii
Copyright Integra Run Time User Guide
iv
eFLOW Run Time Modules Manual
All TiS product solutions plug into the eFLOW Unified Content Platform and take
advantages of the common services the platform provides.
1
Copyright Integra Run Time User Guide
Integra™
Captures data from all types of structured forms including paper, fax, image files, the
Internet and mobile devices. Integra automatically validates and delivers the data to the
target enterprise application. Integra’s exclusive advanced features deliver:
Greater operator productivity
Unprecedented character recognition and error detection
Powerful engines with an exclusive voting algorithm to achieve the most accurate
results
Faster and more efficient forms processing
Quick and simple “point and click” application design
Minimum human intervention means reduced labor costs and higher data integrity.
Freedom™
Reads, understands and processes semi-structured forms, such as incoming supplier
invoices, without the need to predefine the structure of all of the various forms. Freedom
captures, validates and delivers the data from the semi-structured form to the required
applications.
Freedom handles any type of semi-structured forms, including forms with
complicated tables and multi-page invoices
Freedom’s high-speed content recognition replaces the expensive manual process of
keying in data from the invoice
Freedom recognizes the semi-structured form and the fields according to keywords
and key terms situated anywhere on the page, such as Payment Date, Sum, Name
Offers unprecedented character recognition and error detection
Offers powerful “super engines” that vote to achieve the most accurate results
Offers faster and more efficient invoice processing
Delivers minimum human intervention to reduce labor costs and increased data
integrity:
Completion station for manually identifying field locations and field content
FreeCollect station for analyzing Completion data in order to update the
learning database.
Smart™
Smart performs fully-automatic classification of documents flowing into an organization.
Smart acts as a mailroom application, deciding which department, business unit or subject
each document belongs to, and subsequently, to whom the document should be directed.
Smart can classify incoming documents as invoices, change-of-address letters, and
purchase orders, or it can classify into more general categories, such as documents
belonging to the sales, finance or customer service department. These categories are called
classes.
2
Copyright Integra Run Time User Guide
Smart uses a technique called supervised learning to classify documents with little or no
prior knowledge about them. The software examines a limited sample of documents from
each class, and automatically learns the unique characteristics of that class. This process is
called training. From that point onwards, Smart can classify any incoming document into
the trained classes, with a high level of accuracy, and with no human intervention.
Once a decision is made to add stations to the system (for example, more content
recognition or content validation stations), the architectural design facilitates an increase in
computing and processing power. eFLOW and its products are fully scalable.
eFLOW’s flexible configuration and design allows for easy adaptation to the rapidly
changing needs of the computer market.
3
Copyright Integra Run Time User Guide
4
Copyright Integra Run Time User Guide
Content Validation
Completion
The Completion module receives data from the Processing
station. The exact data it receives depends upon the Visual
Designer definitions, but typically is invalid or unrecognizable
data. Unrecognized data is completed manually, with or without
the help of data dictionaries. The entire layout of the screen can
be customized for one or more stations.
Exceptions
Most forms have rules governing how various fields are filled in.
For example, one field may require digits only, while another
may require a valid address. Once these rules have been defined,
the operator cannot leave the data completion process until they
are obeyed. This can be a problem if an entry is illegible or
incomprehensible. The Exceptions module is designed to deal
with these cases. An Exception is a mark on the batch that allows
the operator to leave invalid data and advance to the next form.
The exceptions are reported later in the Exception Report. Batches
marked with exceptions will be sent to the Exceptions module.
Tile Validation
The Tile Validation module is used for mass validation and
decreasing the substitution errors of the recognition engines. The
module allows for the immediate viewing of recognition engine
results grouped in order by numeric and/or alpha characters. The
Tile module usually receives data produced by the Recognition
module, but can be applied anywhere in the system (using
Dynamic Workflow) according to the specific needs of each
project. Grouping the results by fields within flows, for example
by grouping machine-type and handprint fields separately, makes
it easier to spot false results and send them for correction in the
Completion module. Thus substitutions can be more quickly
eliminated.
Content Delivery
The Export module exports data to XML and to the common and
standard format, ASCII, and additionally exports both B/W and
color images. During run time, the Export module receives the
data, converts it to the defined format, and transfers it to the
export directory.
OCR Analyzer
The OCR Analyzer module calculates statistical data for Integra
form recognition. The statistics are prepared by comparing the
batch state after the Processing station phase (i.e. following
automatic recognition) with the batch state after the Export phase
(i.e. following human corrections). These statistics reflect the
success of the recognition (rejects and false positives) in terms of
several levels of recognition:
Form identification – FormOut! FeaturesID
Field recognition - the success of OCR per field
Character recognition - the success of OCR per character
5
Copyright Integra Run Time User Guide
6
Copyright Integra Run Time User Guide
Let us now look at a table with the basic objects and concepts of the design process:
7
Copyright Integra Run Time User Guide
The following diagram demonstrates the concepts of the logical parts (flows and forms) and
physical (EFI’s) parts of an Integra application. Notice that Flow 1 and Flow 3 use two
forms, while Flow 2 uses only a single form. Notice also that Form 2 is used in two flows,
both Flow 1 and Flow 3, and that Form 1 is associated with more than one EFI.
E F I1
E F I2 F o rm 1 F lo w 1
E F I4 F o rm 2
E F I4 F o rm 2
F lo w 2
E F I5 F o rm 3
E F I6 F o rm 4 F lo w 3
The form documents to be processed may be obtained from a number of different sources:
they may be scanned, faxed, or taken from computer image files.
Sample applications have been supplied with the system in the directory “…\sample
Applications”, for use as reference applications. For your own work and tests, however, you
should create other applications.
A batch is a set of forms (single or multiple pages) that are separated by a separator (a
specially designed page or a blank page). For example, if you want to group sets of 100
forms during input, then inserting a batch separator after such group will mark it as a batch.
A collection is the work unit, which is passed on to any station. A collection can be a batch
or subset of a batch and is defined in each station definition (in the DW designer).
8
Copyright Integra Run Time User Guide
Another way to understand the concepts of batches and separators is in terms of how they
are scanned in, as shown in the following diagram.
page
page
form
separator
page
page
form
form
separator
form batch
separator
form
form
form
separator
9
eFLOW Run Time Modules Manual
Overview
The Enterprise Manager manages all the top-level actions of the eFlow systems. Use the
Enterprise Manager to create, copy and delete entire applications, to set NT group and
user security, obtain licensing and version information or to simply switch between
applications. In the Visual Designer, the system integrator can define but not create an
application.
The Enterprise Manager performs several main tasks:
Allows you to create, copy, delete, duplicate and export applications
Sets the default application
Allows you to create and delete eFlow sites
Sets NT group and user security
Provides licensing information
Provides version information
Provides statistics setting per application
Sets autorun stations
The EDC provides the ability to manage any workflow from any location, as long as that
workflow’s application is contained within the same domain. Previously, remote access to a
local workflow was conditioned on specific security clearance to the server.
The eFlow domain has three tiers. At the top level is the EDC, which is defined during the
installation.
The second tier in the eFlow domain is the local eFlow application server. Because the
EDC functions as a universal domain server, it has the ability to distribute and synchronize
tasks to its subservient servers. For example, let’s assume an EDC is located the United
States, while the company receives TIFFs from its branch servers in India, Britain and,
South America. If the US and Britain receive and process the most data, while India and
South America receive significantly less, then the EDC will reroute and redistribute the
flow so that the US and Britain receive a greater portion of cache memory. At the same
time, the processing of tasks at all stations is seamlessly synchronized, meaning that the
eFlow domain appears to complete all eFlow tasks simultaneously.
The lowest and final tier in the domain is the client or worker level. Any operator, with
sufficient security permissions, may access the Enterprise Manager locally and receive
node updates in real time. This operator-admistrator can view the status of all nodes in the
eFlow domain, including the EDC itself.
10
eFLOW Run Time Modules Manual Chapter 2 Enterprise Manager
Enterprise
Manager
Scope
Pane
Enterprise
Manager
Result
Pane
Status
Bar
11
eFLOW Run Time Modules Manual Chapter 2 Enterprise Manager
Site Management
The same application can be used in different locations or sites in the eFlow system. The
Enterprise Manager allows you to add and delete sites.
Click the Sites Explorer tree node to display all the sites that exist in the system. To the
left, the tree will display the sites as tree nodes. To the right, the Enterprise Manager
Information Display Area will show the sites in list form:
Sites
Explorer
T
12
eFLOW Run Time Modules Manual Chapter 2 Enterprise Manager
5. Click OK.
6. A new site will appear in the eFlow Sites Explorer tree and in the Sites
Information Display Area:
To delete a site:
1. Right-click the site name in the eFlow Site Explorer.
3. You will be prompted to confirm that you wish to delete the site.
13
eFLOW Run Time Modules Manual Chapter 2 Enterprise Manager
Applications Management
The Enterprise Manager allows you to create, delete, duplicate, and export applications. It
also allows you to select which application will be the default application in the eFlow
system.
Note:
eFlow allows for several applications to run in the system at the same time.
However, for the system to work properly, a default application must be selected.
Click the Applications Explorer tree node to display all the applications that exist in the
system. To the left, the tree will display the applications as tree nodes. To the right, the
Enterprise Manager Information Display Area will show the applications in icon or list
form:
Enterprise
Manager
Explorer
Enterprise
Manager
Information
Display Area
14
eFLOW Run Time Modules Manual Chapter 2 Enterprise Manager
Applications
Explorer
Tree
15
eFLOW Run Time Modules Manual Chapter 2 Enterprise Manager
Choose… To…
6.
Figure: Select archive file window
6. Import security info 1. Click to mark this check box if you want to
import all the roles, permissions and system
users’ definitions from the existing CAB file.
5. When you have finished making your selection, click Open. You will return to the New
Application Installation window.
6. Click the Next button to continue.
7. Select the application primary server .
8. Click the Next button to continue. The application is ready to be installed.
16
eFLOW Run Time Modules Manual Chapter 2 Enterprise Manager
A status bar will provide information regarding the current status of the application creation.
17
eFLOW Run Time Modules Manual Chapter 2 Enterprise Manager
10. A new application will appear in the Application Explorer tree and in the Applications
Information Display Area:
18
eFLOW Run Time Modules Manual Chapter 2 Enterprise Manager
Stop an Application
1. To stop an application, first select the application in the Application Explorer.
.
2. Click the right-mouse button to obtain the application drop-down menu.
OR
Click the Stop button in the Enterprise Manager toolbar.
OR
Choose Stop from the Application menu.
19
eFLOW Run Time Modules Manual Chapter 2 Enterprise Manager
4. Select the primary node that you want the station to run on.
9. To run autorun stations, in the Application Tree, select the application in which you
defined the autorun stations, right click and select Enable Autorun. The autorun
stations will begin running in the background.
20
eFLOW Run Time Modules Manual Chapter 2 Enterprise Manager
Note: In order to run a station as autorun, you must have selected the following
station properties in the Visual Designer Object Inspector: StandBy and Allow
Unattended (checkboxes must be checked).
Delete an Application
Note: This is an irreversible procedure! Make sure that you really want to delete an
application before you take this step.
3. Select the Delete option. You will be prompted to confirm that you wish to delete
the application:
Duplicate an Application
1. To duplicate an application, first select the application in the Application Explorer.
2. Click the right-mouse button to obtain the application drop-down menu.
3. Select the Duplicate option. The Duplicate Application Wizard will open.
Choose… To…
1. Application Name 1. Enter a name for the application
2. Set as default 2. Click to mark this check box if you
21
eFLOW Run Time Modules Manual Chapter 2 Enterprise Manager
Choose… To…
application wish to set this application as the
default application.
3. Duplicate users 3. Click to mark this check box if you
security info want to duplicate all the roles,
permissions and system users’
definitions from the existing CAB file.
5. Click OK.
Export To CAB
1. To export a new application’s definitions to a CAB (compressed) file, first select the
application in the Application Explorer
2. Click the right-mouse button to obtain the application drop-down menu.
3. Select the Export option.
OR
Choose Export from the Application menu.
OR
Click the Export current application button on the toolbar.
4. The Export Application window will open. You will be prompted to select a name
for the application’s CAB file:
5. Enter the path and name for the file and click Save. The system will produce a CAB
file for the application.
22
eFLOW Run Time Modules Manual Chapter 2 Enterprise Manager
Security Management
eFlow allows you to define exactly which operator workstations, services, and roles will be
accessible throughout the application.
In the Enterprise Manager, you add users or groups from your network. You then use the
Visual Designer to define roles – administrator, supervisor, operator, etc. – and specify
which functions and stations can be accessed by each role. The last stage is to map NT
users from the Enterprise Manager to roles defined in the Visual designer.
By default every user has all permissions. However, as soon as you define an administrator
of a domain or an application, all other users lose all of their permissions. The administrator
must assign permissions to users and groups.
If you want to create a user group, which is different from those that already exist in your
domain or locally, you must first create the group using Windows Administrative Tools.
23
eFLOW Run Time Modules Manual Chapter 2 Enterprise Manager
4. Click Add. The name is added to the list in the box at the bottom of the dialog box.
5. Click OK. The new users or groups are added to the workspace.
6. In the drop-down menu on the right of the workspace, select a role for each user or
group listed.
24
eFLOW Run Time Modules Manual Chapter 2 Enterprise Manager
Only those users defined as domain administrators can perform domain actions.
4. Type a name of a user in the text box or select a name from the list.
5. Click Add. The name is added to the list in the box at the bottom of the dialog box.
6. Click OK. The new user with its role is added under the NT Users/groups node.
25
eFLOW Run Time Modules Manual Chapter 2 Enterprise Manager
The first step is to create a role for each group of users in your project. You can then assign
permissions for different project elements, by adding the relevant roles under the object
representing that element in the Application Tree.
3. A new object appears under Roles. Select it and define the relevant properties in
the Object Inspector pane on the right.
4. Add new permissions to the role. For each flow and station the options are Execute,
Read, and Write.
26
eFLOW Run Time Modules Manual Chapter 2 Enterprise Manager
Note: eFlow core checks permissions defined for Flows, Stations and certain eFlow services,
such as the WorkFlow engine.
If it is necessary to check permissions for Forms, Pages, Field Groups, and Fields, these
permissions should be set in the Visual Designer, and the customization code that will check
these permissions should be used.
27
eFLOW Run Time Modules Manual Chapter 2 Enterprise Manager
Statistics Settings
Statistics are generated per application. You must first define the statistics settings for the application.
You can then generate and view the statistics using a statistics reporting tool of your choice.
28
eFLOW Run Time Modules Manual Chapter 2 Enterprise Manager
7. In the Cleanup Settings tab, you can change set the application to purge statistics
data after a given number of days, or perform a manual cleanup of statistics data
immediately.
29
eFLOW Run Time Modules Manual Chapter 2 Enterprise Manager
30
eFLOW Run Time Modules Manual Chapter 2 Enterprise Manager
The License Id, issue date, expiration date and features (name, quantity allowed and
quantity used) are available for viewing in the list view format.
The License Feature for a station is in the form ef[StationName]. A license is assigned to each
engine with a character-per-second (CPS) limit, so the engine license name contains this acronym.
Engine licenses come in the form OCR_CPS_[EngineName].
31
eFLOW Run Time Modules Manual Chapter 2 Enterprise Manager
Note: device name and device Id are not filled, as this view is not associated with a
specific device.
There are two types of license currently available: concurrent and total.
The Concurrent Counters value provides the number of simultaneous logins allowed for the particular
module or engine. For example, for efDesigner (the Visual Designer), five different users may
concurrently access the module at different computer stations. This limitation does not refer to the total
number of users, but rather to users simultaneously logged in.
All OCR engine license checking is done using a “supported characters-per-second” value provided for
each engine. The total required license must include a sum of all the CPS-s used by all OCR stations for
each OCR engine type. The OCR engine licenses also use concurrent counters.
Total Usage Counters (sometimes called “per click” counters) refer to the maximum number of actions
allowed. Currently, only the “PageCount” functionality is supported. It represents the number of pages
that may be processed by the system.
32
eFLOW Run Time Modules Manual
Overview
The Module Activator is eFlow’s central module. It is the interface to the run-time part of
the system and is responsible for linking all its various components. Through the Module
Activator, you can activate several eFlow modules, according to defined permissions, thus
controlling the flow of data in the system.
The module types that you can launch from the Module Activator include all runtime
stations defined in the Workflow Designer. In addition, it is possible to launch the
following modules:
Visual Designer
Freedom Designer
Smart Designer
Controller
From the Windows Programs > TiS > eFlow menu, click the Module Activator option.
The Module Activator window appears:
33
eFLOW Run Time Modules Manual Chapter 3 Introduction to Module Activator
The Module Activator main window features a module panel. The module panel allows
activation of any of the available eFlow modules according to the user’s permissions. Use
the Controller to display information about the current session, and all system information.
34
eFLOW Run Time Modules Manual
When you open the Controller and select any of the stations from the workflow, you will
automatically see a display of the list of collections associated with the station, and their
status. This is the Workload display mode. You can also view the status of all the
collections in the system, and the status of active stations.
The Controller is essentially a production line manager. The Controller receives and displays
online information about the status of the system. It can be used not only to monitor system
activity but also to detect problematic bottlenecks, and solve them.
35
eFLOW Run Time Modules Manual Chapter 4 The Controller
Tool Bar
Mode
Selecti
Mode-
Dependent
Display
Collection
or Station
Details
Status
Bar
36
eFLOW Run Time Modules Manual Chapter 4 The Controller
37
eFLOW Run Time Modules Manual Chapter 4 The Controller
38
eFLOW Run Time Modules Manual Chapter 4 The Controller
To find out the name of the button, place the cursor anywhere on the button. A
tool tip appears below it.
39
eFLOW Run Time Modules Manual Chapter 4 The Controller
Controller Options
Controller settings define primary module, station, system and collection workload
settings. To change the Controller options, select the Options menu > Configure options.
The Controller Options window appears:
The Queue Thresholds Tab allows the user to select the maximum number of forms that may
be held “waiting” in the system per station:
40
eFLOW Run Time Modules Manual Chapter 4 The Controller
The Session Thresholds Tab allows you specify a minimum activity rate for each of the
run-time modules in a session:
41
eFLOW Run Time Modules Manual Chapter 4 The Controller
Min Speed
Use the edit box to specify the minimum activity speed expected for per module (rate units are module-
specific: for example, forms per minute, typed characters per minute). If work proceeds at a rate slower
than the specified threshold rate, the Controller will display the problem in the Stations View of the
Controller.
For example, if the rate is lower than a pre-defined threshold such as the quota of 50 pages scanned per
minute, the rate bar is displayed in red - telling the Controller that there has been a slow-down at one of
the scanning stations. For more information about the Stations View, see page 44.
Mode Selection
The left side of the Controller Desktop contains a display mode selection panel. Select the desired
display mode by clicking one of the icons on this panel. The following five display modes are available:
42
eFLOW Run Time Modules Manual Chapter 4 The Controller
Workload Mode
The default display mode is the Workload mode, which offers a general view of the
system. This view allows you to quickly detect potential problems in the system.
Workflow Diagram
The upper part of the screen provides a general overview of the system in the form of a
graphical chart of the system architecture: the workflow.
The number of collections and forms awaiting processing is indicated in numbers beneath the
run-time module. The number of collections appears on the left and the number of forms,
separated by a slash, appears on the right.
Underneath the run-time module, a load bar indicates whether the run-time module is working (white) or
paused (red).
43
eFLOW Run Time Modules Manual Chapter 4 The Controller
44
eFLOW Run Time Modules Manual Chapter 4 The Controller
Note: You cannot drag and drop a collection that has been split to another
module. (When a collection is split, forms are separated into different collections
that are individually processed/undergo separate Completion stages.)
You may not drag and drop a file from the Portals to another module, or from
another module to the Portals. The formats are not compatible. Files must
undergo Portal conversion before they become collections.
You cannot drag and drop locked collections, that is collections that are
currently running.
A drop-down mouse menu is available in the collections table via a right mouse click:
Figure:
Right Mouse Menu
The right-mouse menu allows you to quickly access the following commands: (Note that they are also
available through the toolbar and menu bar.)
Priorities
eFlow allows you to raise/restore the priority level of each collection in the system from the
dialog obtained by right-clicking on the collection or via the menu options.
45
eFLOW Run Time Modules Manual Chapter 4 The Controller
Find Collections
eFlow allows you to search for collections on the system by means of the full or partial
collection name. To find a collection, click on the following icon:
Find Collections Button
Filters
eFlow allows you to define certain parameters for the list of collection items as well as the
meta tags that are to be displayed in the Collections Table. To define these filters, click on
one of the following icons:
Collections Filter Button
The Collection filter allows you to define a filter for the list of collections that are
displayed. If no filter is used, ALL the collections will be displayed. If a filter is used, only
the collections that meet the criteria defined will be displayed.
In the Collection Filter dialog box, you can define the filter using an expression with the
following syntax: <name of the meta-tag> <logical operator> <value>. For example:
NumberOfForms = 4.
If more than one condition is required, each of them should be enclosed in parentheses and
AND or OR should be used to connect them. For example: (NumberOfForms> = 4) AND
(AnyBadPageOrForm = 0).
Notes:
Press CTRL + Space to open the drop-down list of meta-tags instead of
entering them manually. After selecting an item, press ENTER.
String values should be double quoted.
Boolean values should be 1 or 0 (1=true; 0=false)
Use this button to delete all active filters from the system.
46
eFLOW Run Time Modules Manual Chapter 4 The Controller
You can define which meta tag column will be displayed in the Collections table. The
default mode is when ALL meta tags are displayed. Meta tags that are removed will not be
visible in the table.
To modify the meta tag column, click on the Modify Collection Columns button.
To add a meta tag from the list, select the meta-tag from the Available Meta tags list and
then click the Add button. The meta tag now appears on the Visible Meta tags list. Click
OK. The meta tag is now displayed in the Collection table.
You may change the sequencing of visible meta tags in the collections table by selecting
from the Visible MetaTags list and using the up and down arrow buttons (to the right of the
list) to move the selected MetaTag to the desired position.
Click the Reset button to revert back to the original lists before changes were made.
47
eFLOW Run Time Modules Manual Chapter 4 The Controller
Each line in the Stations View represents one logged-in instance of the selected run-time module. The
table contains the following columns:
Status - The status of the current station. The status icon may be one of the following:
Green - this position is currently working.
Yellow - this position is currently idle. In case of a manual (typist) position, this status tells the
Controller that this typist is not working.
Red - Not-responding. This indication is typically used in case of a computerized position (not
typist), and indicates a program or computer malfunction.
User – The name of the user logged-in on this station.
Machine Name - The name of the computer.
Station Name - eFlow station name.
Quality – The quality of work in the station. For example: quality of recognition in Processing station
(recognized pages percentage).
Speed – The speed of the collection at the station.
Current PPM – Current rate of pages per minute at the station.
Average PPM – Average rate of pages per minute at the station.
You may click on a column title to sort the table by this column.
48
eFLOW Run Time Modules Manual Chapter 4 The Controller
Collections Mode
This mode shows the entire system from the perspective of the processed collections. It
shows all the collections currently in the system, and allows the Controller to check the
status of each collection.
You can filter the collection data by specific values via the drop-down list in each column heading, and
using the (Custom) option in the drop-down list you can define a customized filter for the data:
49
eFLOW Run Time Modules Manual Chapter 4 The Controller
Collection Parameters
Priority - a red flag in this column means that this collection has high priority, and will be processed
before non-priority collections.
Collection – Unique ID of the collection.
Flow - Name and icon of the flow to which the collection belongs. If a specific flow was selected in the
Flow Selection combo-box, then only this flow will be displayed in the table.
Status - The status of the collection. May be one of the following:
In process: the collection is currently being processed by one of the workstations. The name of the
workstation appears next to the icon.
Ready: the collection is ready to be processed. It will be processed when some position will pick it.
On Hold: The collection is placed on hold and must be released before it can be processed further.
Status may also be undetermined.
Time in System - the elapsed time since this collection entered the system. If this time is larger than a
pre-defined threshold, this number is displayed in red.
Meta Tags – Information about the collection and specific fields in the collection; for example,
DateCreated, AnyBadPages, and MissingTotal.
50
eFLOW Run Time Modules Manual Chapter 4 The Controller
51
eFLOW Run Time Modules Manual Chapter 4 The Controller
Stations Mode
This mode shows the entire system from the workers (typists) point-of-view. It allows the Controller to
monitor the performance of all typists.
You may sort the table by each of the columns by clicking on the column header.
52
eFLOW Run Time Modules Manual Chapter 4 The Controller
A drop-down mouse menu is available in the collections table via a right mouse
click:
The right-mouse menu allows you to quickly access the following commands:
Viewer Mode
Viewer Mode allows the user to view overall system usage and progress in large display. You can use
this view to immediately spot problems with station thresholds. For example, if not enough stations are
working to perform FormID and Processing, the Viewer Mode display will indicate this.
53
eFLOW Run Time Modules Manual Chapter 4 The Controller
Domain Mode
Domain Mode allows the user to view overall status of the applications on the eFLOW
domain:
54
eFLOW Run Time Modules Manual Chapter 4 The Controller
55
eFLOW Run Time Modules Manual Chapter 4 The Controller
Rotate right.
Rotate left.
56
eFLOW Run Time Modules Manual Chapter 4 The Controller
The Toolbar
You can locate collection images, enlarge or shrink an image for viewing efficiency,
and print the image by using the following buttons.
57
eFLOW Run Time Modules Manual Chapter 4 The Controller
58
eFLOW Run Time Modules Manual
Overview
eFLOW Portals provide data input to the system from two possible sources. Using the
Scan Portal, you can scan forms. Use the File Portal to select previously scanned forms
from existing image files.
Note: If you choose Scan Portal input, make sure you are connected to a scanner
before you begin.
Features
eFlow Scan Portal module supports all ISIS-standard scanner types. It also supports
multipage TIF files, color scanning, grey scale scanning, and black and white.
The Module Activator icons indicate the type of input you can select:
Scanning
Select the desired input type and click the appropriate icon.
59
eFLOW Run Time Modules Manual
For full details on using the Scan Portal see the online Help for the Scan Portal. (The Scan
Portal has yet to be documented.)
60
eFLOW Run Time Modules Manual
Overview
The File Portal allows you to select existing image files as your input data. The process
involves choosing one or more files from the list of available image files in a directory and
sending them onwards to the next queue.
For full details on using the File Portal see the online Help for the File Portal. (The File
Portal has yet to be documented.)
61
eFLOW Run Time Modules Manual
Overview
The Page OCR station receives Collections and creates a PRD (page-recognition data) file
for each page in the TIF.
By using this station the user can separate the two processes done by the FreeMatch station:
creating PRD files and running Freedom scripts. When the PageOCR station is run before
the FreeMatch station, FreeMatch will not create PRDs for the collections, and will run the
Freedom script(s) immediately, using the PRDs created in the PageOCR station.
Getting Started
Activating the Page OCR module Main Window
The Page OCR station can be activated through the Module Activator only by a user who has
permission to access the Page OCR module. For more information on the Module Activator,
refer to Chapter 2 Enterprise Manager of this manual.
It can also run in Unattended mode.
To invoke the Page OCR module:
From the Module Activator, click the Page OCR module icon. The following window
appears:
Menu
Bar
Toolbar
Page OCR
Progress
Log
Original
Image
Viewer
Status
Bar
62
eFLOW Run Time Modules Manual Chapter 8 Page OCR
The maxworkunits parameter defined in the Design stage (Visual Designer) determines
how many collections are retrieved for processing in each “Run”.
During processing, the page undergoing processing in the TIF file is displayed (by default,
on the right side of the screen). In addition, you can navigate via the interactive progress
log on the left which is created during processing, by clicking on the desired page. (Clicking
on the entry for the entire collection causes the first page in the collection to be displayed.)
During a run, the Progress Log first indicates that “Collections were locked”, and displays
the current collection number and, for each page, the page number and recognition rate:
average of confidence level of all words in the PRD.
If there is an error, the Progress Log displays an error message in red. If a run is aborted,
the log displays “Collections were discarded”.
After processing of the last collection (as specified in MaxWorkUnits) is complete, the
Progress Log indicates that collections have been freed for subsequent processing
(“Collections were sent to next station”). The log is cleared for every new Run.
Button Purpose
File Use the File menu to
Run: get the next set of available collections for processing
Pause/Resume: Temporarily stop processing of the current
collection, and resume a paused session
Abort: Stop processing of the current set of collections and discard
any changes
Exit: exit the Page OCR module.
Edit Use the Edit menu – Clear Log Messages – to clear the Progress Log.
View Use the View menu to select which information is displayed in the
central working area. You may toggle display of the Report of the
Processing progress, which will appear to the left, and the Image
File being processed, which will appear to the right.
63
eFLOW Run Time Modules Manual Chapter 8 Page OCR
Button Purpose
Options Use the Options menu to:
Help Use this command to open online help for the station.
Use this option to return the set of collections to the Page OCR
queue untouched. As in Pause, the process is stopped only
after it finishes a page.
64
eFLOW Run Time Modules Manual Chapter 8 Page OCR
Manual Operation
If the Options > StandBy menu option is not checked, the OCR process can be initiated
manually in one of two ways:
After the set of collections is processed, the station is idle until you issue a new Run
command to request a new set from the station queue.
Automatic Operation
Processing can also be invoked automatically.
If you set the Process > StandBy menu option, Page OCR continually polls to see if there are
collections in the queue, and will fetch a set of collections and start processing it as soon as
it detects it in the station’s queue. See the StandByInterval propery specified in Visual
Designer regarding how often the queue is polled.
65
eFLOW Run Time Modules Manual
Overview
The Processing module can perform both form identification and field content Processing.
Although separate Integra modules exist to perform form identification, it is not necessary
to run them previous to imputing image files to the Processing module.
Image files may be received from an Enterprise Portal module, the FormID module, or the
Manual FormID module. After receiving the image files, the Processing module processes
them and then sends them together with the text results to the Completion queue or any other
queue specified in the workflow.
Processing Stages
Processing involves the following stages:
Form image enhancement
Form identification and removal
Field image enhancement
Field segmentation
Optical character recognition (OCR) and/or intelligent character recognition (ICR)
Voting for optimal results
66
eFLOW Run Time Modules Manual Chapter 9 Content Recognition - Processing
Getting Started
Activating the Processing module Main Window
The Processing window controls the operation of the image Processing engine. It can be
activated through the Module Activator only by a user who has permission to access the
Processing module. For more information on the Module Activator, refer to Chapter 3 of this
manual.
To invoke the Processing module:
From the Module Activator, click the Process module icon. The following window appears:
Menu
Bar
Toolbar
Recognition
Progress
Report
Original
Image
Viewer
Status
Bar
Figure: Processing Module Window
When you click the Run button, the Processing module automatically starts to process the first
batch that has “waiting” status in the Processing queue.
The Status panel
Left rectangle - Processing status.
Right rectangle –Time of day displayed (from the last time that the CPU was available).
Button Purpose
File Use the File menu to exit the Processing module.
Edit Use the Edit menu to Copy report information, Select All the report
information, or Clear the report information. These commands also
can be activated using standard Windows keyboard shortcuts.
Search Use the Search menu to Find specific information in the report, Find
Next (find the next identical piece of information in the report), or
Go to line (jump to a particular line in the log). These commands
also can be activated using standard Windows keyboard shortcuts.
67
eFLOW Run Time Modules Manual Chapter 9 Content Recognition - Processing
Button Purpose
View Use the View menu to select which information is displayed in the
central working area. You may toggle display of the report of the
Processing progress, which will appear to the left, and the image
file being processed, which will appear to the right part.
You may also choose to display the debug log, which appears in a
separate window.
Further, it is possible to select which report items will be shown:
Show All
Show Pages and Final Results
Show Pages only
Button Purpose
Process Use the Process menu to initiate the OCR process. Select Go to
start, Pause, Abort or Standby. These commands can also be
activated using the toolbar buttons.
The command Run can also be activated using the keyboard
shortcut F5. Pause is activated using the keyboard shortcut Shift-
F5.
Zoom Use the Zoom menu to select the clarification level at which you
will see images files displayed.
Options
Use the Option to:
Standby – enter/leave stand-by mode
Report font – select the report font
Reconstruct image – reconstruct the image file
Reset Statistics - reset the statistics displayed
View Log Information – opens the logger window
Debug Script - use this option to debug custom events
Help Use this command to open the Integra Processing online help.
68
eFLOW Run Time Modules Manual Chapter 9 Content Recognition - Processing
Processing Toolbar
To find out the function of the button, place the cursor anywhere on the button; a tool tip
appears below it.
The Processing module toolbar contains the following buttons:
69
eFLOW Run Time Modules Manual Chapter 9 Content Recognition - Processing
If the default workflow configuration is used, and there are no errors and no data needs to
be filled in manually, the batch continues on to the next queue, for example, the Export
queue. Otherwise (if some data needs to be filled in manually), the batch is sent to the
Completion queue.
If there is an error, the report and Status Bar display an error message. If you try to reprocess
the batch without correcting the error, the report will repeatedly display an error message
until the error is corrected.
Automatic Operation
Processing is also invoked automatically by making the appropriate definition in the Options
menu. If you set the Process menu, StandBy option,” Integra will fetch a batch and start the
Processing process for the batch as soon as it appears in the Content Processing queue.
70
eFLOW Run Time Modules Manual
Overview
The FormID module performs form identification only. It is not necessary to run the FormID
module separately, as form identification can also be accomplished by the Processing
module. However, in the case of extremely heavy work load or many problematic forms
which are difficult to identify, it may be useful to have forms identified first through the
FormID module, with unrecognized forms processed through the Manual FormID module, and
field contents then recognized through the Processing module.
Image files may be directly received from an Enterprise Portal module. After receiving the
image files, the FormID module rapidly identifies them and then sends them to the Processing
queue, or any other queue specified in the workflow.
71
eFLOW Run Time Modules Manual Chapter 10 Content Recognition - Form ID
Getting Started
Activating FormID
The FormID window controls the operation of the FormID module. It is activated through the
Module Activator only by a user who has permission to access the FormID module. For more
information about the Module Activator, refer to Chapter 3 of this manual.
Menu
Bar
Toolbar
Recognition
Progress
Report
Original
Image
Viewer
Status
Bar
Figure: FormID Module Window
72
eFLOW Run Time Modules Manual Chapter 10 Content Recognition - Form ID
When you click the Run button, the FormID module automatically starts to process the
first batch that has “waiting” status in the FormID queue.
Button Purpose
File Same as for Processing module.
Help Use this command to open the Integra FormID online help.
73
eFLOW Run Time Modules Manual Chapter 10 Content Recognition - Form ID
FormID Toolbar
The FormID module toolbar contains the following buttons:
If the default workflow configuration is used, and there are no errors and no data needs to
be filled in manually, the batch continues on to the Processing queue. Otherwise (if some
data needs to be filled in manually), the batch is sent to the Manual FormID queue.
If there is an error, the report and Status Bar display an error message. If you try to reprocess
the batch without correcting the error, the report will repeatedly display an error message
until the error is corrected.
Automatic Operation
FormID is also invoked automatically by making the appropriate definition in the Options
menu. If you set the Options menu StandBy option, eFlow will start FormID a batch as soon as
it appears in the FormID queue.
74
eFLOW Run Time Modules Manual
Overview
The Manual FormID module is used to manually perform form identification for those forms
whose scanned images may be difficult to read or cut off, and cannot be identified
automatically.
Forms may fail to be identified for a number of reasons. A particular form may have been
scanned slightly to the left or right, meaning that the placement of identifying fields is in
the wrong space. Or, one or more pages in a form may have poor scanning resolution.
The Manual FormID module offers useful tools to resolve these difficulties, quickly and
simply. Using the Manual FormID module, it is possible to in fact shift the location of all
identifying fields to compensate for shifted scanning. Or, if only one or more pages in a
form have scanned poorly, it is possible simply to associate a page correctly with the form
that it belongs to.
Image files are received from FormID module, or even the Processing module. After
receiving the image files, the Manual FormID module allows the user to use different tools
and viewers to identify the pages and forms. Identified forms may then be sent to the
Processing queue or any other queue specified in the workflow.
75
eFLOW Run Time Modules Manual Chapter 11 Content Recognition - Manual Form ID
Getting Started
Activating the Manual FormID module Main Window
The Manual FormID window controls the operation of the image Manual FormID module. It is
activated only through the Integra Module Activator by a user who has permission to access
the Manual FormID module. For more information on the Module Activator, refer to Chapter 3
of this manual.
Menu
Bar
Tool
Bar
Page
Information
Area
Default
Action
Button
Status
Bar
Figure: Manual FormID Module Window
76
eFLOW Run Time Modules Manual Chapter 11 Content Recognition - Manual Form ID
Send After you have identified the form, send the form back to
the next queue.
77
eFLOW Run Time Modules Manual Chapter 11 Content Recognition - Manual Form ID
Debug Events Use the Debug Events option to debug custom Events.
Disable Events Use the Disable Events option to disable all the custom
events
78
eFLOW Run Time Modules Manual Chapter 11 Content Recognition - Manual Form ID
After you have identified the form, send the form back to the next
queue.
Note: This button will automatically change its functionality to provide the
appropriate next step for each procedure in the manual form-ID process.
79
eFLOW Run Time Modules Manual Chapter 11 Content Recognition - Manual Form ID
Page
Information
Get/Send
Button
Figure: Current Form Pages Information Area
2. The first batch waiting in the queue will be fetched. In the Page Information area, a
listing of the pages in the first form in the sequence will appear.
3. The original image will appear in the window to the right, as in the following figure:
Original
Image
Viewer
80
eFLOW Run Time Modules Manual Chapter 11 Content Recognition - Manual Form ID
81
eFLOW Run Time Modules Manual Chapter 11 Content Recognition - Manual Form ID
You may move between pages by selecting them in the Page Processing Information Area
Page column. Simply click a page row to select it.
The Page Processing Information Area provides information about the pages in the form,
their order, and whether or not they have been identified.
82
eFLOW Run Time Modules Manual Chapter 11 Content Recognition - Manual Form ID
In the Page column, the image order of the pages within the current form’s image file is
provided.
In the Order column, the logical Integra order of the pages within the form is specified.
In the Master column, the EFI with which the page is identified is specified.
1. In the Page Processing Information Area, click the page entry that you want to
change.
2. Click once in the Order column to reveal the drop-down list box.
3. Select the correct page number for the page.
Note: If only the number 1 is displayed in the Page Order Column, there is only
one page in that form.
Although a form may be correctly identified, one or more of the form pages may have
failed identification. For example, pages 1 and 3 of a particular form may be identified
correctly, and associated with the right EFI. But page 2 may remain unidentified. Using
the Original Image Viewer you may determine which EFI is the correct one for the
unidentified page.
1. In the Page Processing Information Area, click the page entry that you want to
change.
2. Click once in the Master Form column to reveal the drop-down list box.
3. Select the appropriate EFI for the page.
83
eFLOW Run Time Modules Manual Chapter 11 Content Recognition - Manual Form ID
4. The EFI will be displayed in the EFI Viewer in the bottom left. If you have made an
incorrect EFI selection, simply reselect the appropriate EFI from the Master Form
drop-down list box.
84
eFLOW Run Time Modules Manual Chapter 11 Content Recognition - Manual Form ID
Errors
If there is an error in activity, the Status Bar displays an error message. If you try to
reprocess the batch without correcting the error, the report will repeatedly display an error
message until the error is corrected.
Correcting Field ROI Placement
When you examine the image of a page/form in the Original Image Viewer, you may find that
the image is right- or left-shifted, or that the fields and OCR field regions (ROI’s) are
misaligned. (For more information about ROI’s, please see the Application Visual Designer
manual). During scanning, it may be that the form was misplaced to the left or right of
center into the scanner feeder tray. This may be why the form could not be identified, and
could also cause problems later in identifying the contents of various fields, as the ROI’s
may not be correctly located.
The Manual FormID module offers an important feature to correct field placement errors.
1. Scroll through the image by using the scroll bars or place the cursor on the image
and hold the mouse button down. The cursor turns into a hand. Drag the hand in the
desired direction.
2. Select a field and move it to the correct location. All other fields will be moved at
the same time to the new location.
If the form’s ROI placement cannot be changed, than it probably has successfully been
AutoIDed. In this case, see the next section for information on how to override the
automatic identification settings.
1. Select the Options menu > Enable Modifying AutoID Pages selection.
2. You will be prompted to confirm that you wish to override the automatic
identification system.
3. You may scroll through the image by using the scroll bars or place the cursor on the
image and hold the mouse button down. The cursor turns into a hand. Drag the hand
in the desired direction.
4. Select a field and move it to the correct location. All other fields will be moved at
the same time to the new location.
5. To disable any further changes, select the Options menu > Enable Modifying AutoID
Pages selection again.
85
eFLOW Run Time Modules Manual Chapter 11 Content Recognition - Manual Form ID
Send
When you have finished identifying the pages, click the Send button, or select the File
menu > Send selection, to return the identified form to the next queue.
Save
Click the Save button, or select the File menu > Save selection, to save the identified
form.
Discard
Click the Discard button, or select the File menu > Discard selection, to return the form to
the Manual FormID queue.
86
eFLOW Run Time Modules Manual
Overview
The Free Match module performs form identification only. Image files may be directly
received from an Enterprise Portal modules or from the OCRPage station. After receiving the
image files, the Free Match module rapidly identifies them and then sends them to the
Recognition queue, or to the Manual Match station if fields are incomplete.
87
eFLOW Run Time Modules Manual Chapter 12 Free Match
Getting Started
Activating Free Match
The Free Match window controls the operation of the Free Match module. It is activated
through the Module Activator only by a user who has permission to access the Free Match
module.
To invoke the Free Match module:
From the Module Activator, click the Free Match module icon. The following window appears:
Menu
Toolbar
Bar
Recognition
Progress
Report
Original
Image
Viewer
Status
Bar
Figure: Free Match Module Window
88
eFLOW Run Time Modules Manual Chapter 12 Free Match
When you click the Run button, the Free Match module automatically starts to process
the first batch that has “waiting” status in the Free Match queue.
Button Purpose
File Use the File menu to initiate the OCR process. Select Run to
start, Pause or Abort the OCR process. These commands can
also be activated using the toolbar buttons.
The command Run can also be activated using the keyboard
shortcut F5. Pause is activated using the keyboard shortcut
Shift-F5.
Edit Use the Edit menu to Copy report information, Select All the
report information, or Clear Log the report information. These
commands also can be activated using standard Windows
keyboard shortcuts.
Search Use the Search menu to Find specific information in the
report, Find Next (find the next identical piece of information
in the report), or Go to line (jump to a particular line in the
log). These commands also can be activated using standard
Windows keyboard shortcuts.
View Use the View menu to select which information is displayed
in the central working area. You may toggle display of the
report of the Processing progress, which will appear to the
left, and the image file being processed, which will appear to
the right part.
Further, it is possible to select which report items will be
shown:
Show All
Show Pages and Final Results
Show Pages only
Zoom Use the Zoom menu to select the clarification level at which
you will see images files displayed.
Options Use the Option to select standby, the report font, reconstruct
an image file, reset the statistics displayed, view log
information, or display the Debug Script.
Debug Script
To invoke the Debug Script function, open the Options menu. This debugging option allows you to track
problems in Scripts attached to the Free Match module during the run time stage.
89
eFLOW Run Time Modules Manual Chapter 12 Free Match
From the Integra Free Match window, click the Run button
OR
From the menu bar, select File > Run.
OR
Press the keyboard shortcut F5.
If the default workflow configuration is used, and there are no errors and no data needs to
be filled in manually, the batch continues on to the Recognition queue. Otherwise (if some
data needs to be filled in manually), the batch is sent to the Manual Match queue.
If there is an error, the report and Status Bar display an error message. If you try to reprocess
the batch without correcting the error, the report will repeatedly display an error message
until the error is corrected.
Automatic Operation
Free Match is also invoked automatically by making the appropriate definition in the Options
menu. If you set the Options menu StandBy option, the Free Match will start processing a
batch as soon as it appears in the Free Match queue.
90
eFLOW Run Time Modules Manual
Overview
The Manual Match module is used to manually find different scripts that will match different
types of documents.
Documents will reach this station for a number of reasons. A particular document may have
been scanned slightly to the left or right, meaning that the Free Match could not identify
any of the fields on this image. Or, some mandatory fields were not recognized.
The Manual Match module offers useful tools to resolve these difficulties, quickly and
simply. Using the Manual Match module, it is possible to in fact relocate all identified fields
and locate unidentified fields. Or, if some mandatory fields were not identified, it is
possible to set these fields and let Freedom calculate the rest of the fields.
Image files are received from Free Match module. After receiving the image files, the Manual
Match module allows the user to use different tools and viewers to identify the pages and
forms. Identified forms may then be sent to the Recognition queue or any other queue
specified in the workflow.
91
eFLOW Run Time Modules Manual Chapter 13 Manual Match
Getting Started
Activating the Manual Match Module Main Window
The Manual Match window controls the operation of the image Manual Match module. It is
activated only through the eFLOW Module Activator by a user who has permission to access
the Manual Match module.
To invoke the Manual Match module:
From the Module Activator, click the Manual Match module icon. The following window
appears:
Menu Bar
Toolbar
Page
Information
Area
Defau
lt
Actio
Status
Bar
Figure: Manual Match Module Window
The Status panel
Left rectangle – Manual Match status.
Right rectangle –Time of day displayed (from the last time that the CPU was available).
Manual Match Menu Bar
Seven menus are accessed from the Menu Bar: File, View, Actions, Navigate, Image,
Options and Help.
92
eFLOW Run Time Modules Manual Chapter 13 Manual Match
93
eFLOW Run Time Modules Manual Chapter 13 Manual Match
94
eFLOW Run Time Modules Manual Chapter 13 Manual Match
Note: This button will automatically change its functionality to provide the
appropriate next step for each procedure in the Manual Match process.
Page
Information
Get/Send
Button
Figure: Current Document Pages Information Area
2. The first batch waiting in the queue will be fetched. In the Page Information area, a
listing of the pages in the first document in the sequence will appear.
95
eFLOW Run Time Modules Manual Chapter 13 Manual Match
3. The original image will appear in the window to the left, as in the following figure:
4. Using the Matching Fields Viewer at the right you may manually locate fields on the
document. See the next section for further information.
5. Using the information in the Page Processing Information Area, you can solve a
number of different problems. You may select the script to which the page belongs
to, correct the page order, and position fields on the image. See the following
sections for further information.
96
eFLOW Run Time Modules Manual Chapter 13 Manual Match
97
eFLOW Run Time Modules Manual Chapter 13 Manual Match
You may move between pages by selecting them in the Page Processing Information Area
Page column. Simply click a page row to select it.
The Page Processing Information Area provides information about the pages in the
document, their order, and whether or not they have been identified.
In the Page column, the image order of the pages within the current document’s image file
is provided.
In the Order column, the logical Freedom order of the pages within the document is
specified.
In the Master column, the Freedom script with which the page is specified.
98
eFLOW Run Time Modules Manual Chapter 13 Manual Match
1. In the Page Processing Information Area, click the page entry that you want to
change.
2. Click once in the Order column to reveal the drop-down list box.
3. Select the correct page number for the page.
Note: If only the number 1 is displayed in the Page Order Column, there is only
one page in that document.
Although a page may be correctly identified, one or more of the form pages may have
failed identification. For example, pages 1 and 3 of a particular document may be
identified correctly, and associated with the right script. But page 2 may remain
unidentified. Using the Original Image Viewer you may determine which script is the correct
one for the unidentified page.
1. In the Page Processing Information Area, click the page entry that you want to
change.
2. Click once in the Master Form column to reveal the drop-down list box.
3. The drop down list contains several options, as shown below:
99
eFLOW Run Time Modules Manual Chapter 13 Manual Match
If the field has unsuccessfully undergone Free Match identification, re-placement of the
field on the image is a very simple procedure, and you may adjust ROI placement as
necessary:
When you have finished identifying the pages, click the Send button, or select the File
menu > Send selection, to return the identified form to the next queue.
100
eFLOW Run Time Modules Manual Chapter 13 Manual Match
Save
Click the Save button, or select the File menu > Save selection, to save the identified
document.
Discard
Click the Discard button, or select the File menu > Discard selection, to return the
document to the Manual Match queue.
101
eFLOW Run Time Modules Manual
Overview
The Tile module is usually the station used before the Completion module to make the
Completion module more efficient. The Tile module collects all the characters that were
recognized by the system and offers useful tools to determine which characters were falsely
recognized, quickly and simply. Using the Tile module, it is possible to identify, which
characters are correct and which are not. The Tile module displays the characters in a tile
format, which allows the user to browse through each character and accept it or reject it. All
characters are grouped together to allow for easy identification.
102
eFLOW Run Time Modules Manual Chapter 14 Tile Module
Getting Started
Activating the Tile Module Main Window
The Tile window controls the operation of the image Tile module. It is activated only through
the Integra Module Activator by a user who has permission to access the Tile module. For
more information on the Module Activator, refer to of this manual.
Menu
Tool
Page
Information
Area
Status
Figure: Tile Module Window
103
eFLOW Run Time Modules Manual Chapter 14 Tile Module
Debug Events Use the Debug Events option to debug custom Events.
Disable Events Use the Disable Events option to disable all the custom
events
104
eFLOW Run Time Modules Manual Chapter 14 Tile Module
Tile Toolbar
The Tile module tool bar contains the following buttons:
After you have identified the batch, send the batch back to the next
queue.
Tells you which character you are currently viewing and which
page you are on.
105
eFLOW Run Time Modules Manual Chapter 14 Tile Module
NOTE: Once you have gotten a batch the button becomes inactive.
Get
Butt
Chara
cter
106
eFLOW Run Time Modules Manual Chapter 14 Tile Module
4. Navigation is also possible using the arrow keys on the keyboard. Use the Enter
button to left click and type any character to right click. Left click the character and
the background turns red. It is now marked so instead of a falsely recognized
character it will appear as an * (asterisk). This character is not connected to the
recognized character and will override recognition results.
Right click the character to edit it. See the section
5. Editing the Characters for more information.
Rejected
Character
Accepted
Character
107
eFLOW Run Time Modules Manual Chapter 14 Tile Module
display by 10%. Using the Zoom In button and the Zoom Out button in the
tool bar. Click the View button, then right click the character. The Image Viewer
window opens as shown below:
Image
Viewer
T
Character o
o
Character
In Form
Click the Save button, or select the File menu > Save selection, to save the identified
form.
Discard
Click the Discard button, or select the File menu > Discard selection, to discard the
batch.
108
eFLOW Run Time Modules Manual
Overview
The Completion module receives data produced by the Processing module. The
unrecognized data are then corrected and completed manually. The forms are displayed in
page mode only, where the typist sees a full page and can move from field to field in the
page.
This station is used in cases where the processing (and the operator) can anticipate which
data will be on a form, but the exact location on a page is not known. The input can then be
used for Learning in order to improve subsequent processing.
The administrator or application designer uses the Visual Designer’s Screen Builder to define
what data is brought to the user’s attention and how it is displayed.
109
eFLOW Run Time Modules Manual Chapter 15 Content Validation in Completion
Getting Started
110
eFLOW Run Time Modules Manual Chapter 15 Content Validation in Completion
In some cases, the user must correct immediately; in other cases, it is permitted to move
within a defined range of fields and all field in the range must be completed before the user
can leave the range and continue working on other fields.
In the data completion screen, the operator is shown all or part of the form and is given text
boxes in which to enter all or part of the form data, as appropriate.
The user can specify the correct data in several ways: for example, typing, selection from a
lookup table, or demarcation on the page image of a field region containing the required
text.
To start the Completion process on a batch in the Completion queue, click the Get
Batch button. (Alternatively, if the station is running in Automatic mode instead of
Standby mode, the batch is retrieved automatically.)
If the Completion queue is populated, the first page requiring completion appears in the
Completion main window:
Original
Form
Image
Recognized
Field Data
111
eFLOW Run Time Modules Manual Chapter 15 Content Validation in Completion
Note that in the Integra Completion module, you can work in both Field Group mode and
Page mode. In Completion, all processing is in Page mode only.
The station is designed to allow uninterrupted field completion. In some cases the user is
permitted to move through all fields on a page despite problematic fields, and return to
attempt to resolve the problems at a later time.
When the last field requiring completion has been filled in, the batch is automatically sent
(Put) into the workflow. To cancel a Put operation, click in the bottom right corner of the
station screen on the “Click here” label within several seconds of the action.
Data Indicators
The following summarizes color coding for various types of fields. The selection of specific
colors used is done in the Application Visual Designer.
Note that in this document reference is made to the default set of colors, as well as the
default set of keyboard shortcuts.
View
Table Display
Two display modes are provided for Table data:
• Grid Mode: data is displayed in a standard grid with no direct relation to the dimensions
of the table in the original form
• Table Layout Mode: the locations of the table elements in this mode match the layout of
the table in the original form.
112
eFLOW Run Time Modules Manual Chapter 15 Content Validation in Completion
Layouts
The following screen layouts are available via the pop-up menu on the right bottom of the
Completion screen:
• Tab layout: each group is displayed in a tab
• Other customized layouts as listed in the menu.
113
eFLOW Run Time Modules Manual Chapter 15 Content Validation in Completion
Data Fields
The following illustrates valid and invalid data fields:
Data Manipulation
Field Completion
In order to finalize and fill in missing form data, the typist must fill in correct data for any
unrecognized characters. Whenever an unrecognized character (*) appears in the data field,
a character must be filled in to replace it.
Some types of fields have an input mask defined for them: for example, ##.##.#### for a
date field. This mask constrains the format of the data that may be entered in a field.
The original scanned field contents appear in the Form Image in yellow, with a red frame
surrounding the field. The Form Image is provided to ensure that quick reference to the
original data is available.
The typist may use this original form information to read the field contents and fill in the
appropriate missing information.
Note that in this document reference is made to the default set of keyboard shortcuts.
114
eFLOW Run Time Modules Manual Chapter 15 Content Validation in Completion
115
eFLOW Run Time Modules Manual Chapter 15 Content Validation in Completion
ROI Selection
If the data that should be specified in a field can be found on the page, it is possible to
highlight it and eliminate the need for typing it in. This action has two results: (a) populates
the field and (b) creates an ROI (rectangular region-of-interest) that is now associated with
the field, to be used to improve future processing.
To select ROIs:
• click on a specific word, or
• Press/Drag/Release (as text is highlighted in a word-processing application).
Note that when selecting text with the <Ctrl> key pressed, you can select words from
different lines (the default selection selects information from a single line only.)
Marking Exceptions
Refer to the chapter on
116
eFLOW Run Time Modules Manual
Content Validation in Completion module for instructions on how to mark exceptions. The exception list
is opened via the icon:
Table Manupulation
The following operations are available for table completion:
• Delete Row – Delete the current row in a data table and its associated row on
the image
• Insert Row Above – Add a new row to a table above the current row
• Insert Row Below – Add a new row to a table below the current row
• Copy Row from Above – Copy the data in the row above to the current row.
To auto-recognize a table: Move focus to a field in the table, and click the tool
button.
117
eFLOW Run Time Modules Manual
Document Merge
Attachments
An attachment page accompanies a form but it cannot be recognized or modified as can
other pages in the form.
By pressing the document merge button ( ) you can perform manual document
merging, and mark pages as attachments.
Remove the New Document indication from what is incorrectly considered the first
page in a form, to specify that it should actually be a continuation of the previous form.
Batch Manipulations
After the fields in the batch have been completed, the batch can be returned or “put” to the
If you want to cancel a batch-manipulation action before it is committed, click the “Cancel”
button that appears on the right bottom corner of the Completion screen; this must be done
within a few seconds of the action.
Navigation
The operator typically moves sequentially from one field to the next on a page and from
one form to the next in a batch. However, the work mode can vary as well: see Skipping
Level and Special Cases below.
Navigation Operations
The keystrokes specified in this section are based on the default shortcut definitions. See
the end of this chapter for information on customizing the shortcut-key mappings.
118
eFLOW Run Time Modules Manual Chapter 15 Content Validation in Completion
Skipping Level
The navigation behavior may vary, depending on the definition of a maximum “skipping
level” for your station. The skipping level defines how much freedom you have to navigate
among the fields in a range such as a field group, or an entire batch:
• Field Level: the user may only leave a field once the field contains valid data.
• Group Level: the user may move among the fields in a group but cannot leave the group
until all the fields in the group have been validated.
• Form Level: the user may move among the fields in a form but
cannot leave the form until all the fields in the form have been validated.
• Batch Level: the user may move among all the fields in a batch.
Example:
In the case where Group-level skipping is defined and there is a group containing three
fields – Price, VAT and Total – in this order.
Price and Total are invalid and only VAT is a valid field.
When focus is in Price, pressing Next Invalid Field moves the operator to Total. If Next
Invalid Field is now pressed in the Total field, focus is moved to Price, as this is the next
field requiring attention in the group.
In several special cases, a field behaves as an invalid field, even though its data is valid and
it does not appear as an invalid field (i.e., there is no Error Provider indication near the
field).
• One-time Stop: this is a field with data that needs to be confirmed, so you must review it
at least once. After you have stopped at the field once, it is no longer considered a one-
time stop field.
• Stop Always: Fields of this type must be stopped at every time you pass them.
Navigation Pane
The forms in a batch, and the pages in each form, must be reviewed and validated in
sequence. Using the Forms Navigation Pane ( ) tool button, you can view and access
specific forms and pages in the batch. In order to navigate to a specific page or form simply
double click on it.
The navigation will be permitted according to the navigation rules and policy in your
application.
119
eFLOW Run Time Modules Manual Chapter 15 Content Validation in Completion
• Forms and pages that contain invalid fields are marked with
• Forms and pages that contain valid fields are marked with
120
eFLOW Run Time Modules Manual Chapter 15 Content Validation in Completion
Customization
Keyboard Shortcut Customization
To customize the keyboard shortcut mapping, open the Shortcut Editor via the tool button
( ):
You can select an entry to modify, specify a new shortcut-key combination, and click
Assign to save the new definition.
Press the Print button to print the full list of current shortcut definitions.
121
eFLOW Run Time Modules Manual Chapter 15 Content Validation in Completion
Put Batch
Discard Batch
Exception – opens the list of exceptions for setting an exception for the
current field
Shortcut Editor – displays the shortcuts and enables editing of the shortcuts
Fit Width – Fits the image to the image viewer width area.
Fit to Screen – Fits the image to the viewer based on the screens resolution
Fit Page – Fits the image to the viewer area so that the entire image is
displayed without scrolling (see Figure above).
Delete Row – Deletes the current row in a data table and its associated row
on the image
Insert Row Above – adds a new row to a table above the current row
Insert Row Below – adds a new row to a table below the current row
Copy Row from Above – copies the data in the row above to the current row
Manual Merge - opens a Set Attachment (form organize) window, so that you
can remove the New Document indication from the second page to include it
in the same form as the first page
122
eFLOW Run Time Modules Manual Chapter 15 Content Validation in Completion
123
eFLOW Run Time Modules Manual
The station can be run in one of the following two modes, as specified via Visual Designer:
Mode 1: In this mode, FreeCollect learns only from fully-recognized fields: i.e.,
where both the field data (contents) and the field rectangle (position) have been
determined: either automatically, without user itervention, or where the user has
demarcated the field rectangle and possibly also entered field data.
Mode 2: Here, FreeCollect attempts to learn from fields that are not fully
recognized: i.e., a rectangle was not demarcated for the field.
The station tries to match text from the recognized page image to the completion data
entered by the operator. The algorithm looks for close matchs, and not only a perfect
match, allowing for OCR inaccuracies and possible format variations (i.e Date &
Amount ).
When a single plausible match is identified: the rectangle on the matching word(s)
is added to the Learning database.
If the matching results are not definitive enough: this field is not used for
learning.
The following parameters can be set in Visual Designer for the FreeCollect station:
CommandParams Mode:1
Mode:2
(see above for descriptions)
124
eFLOW Run Time Modules Manual
Overview
The Export module converts captured data into ASCII, XML or database output, and stores
the images and indices in the systems export destinations.
In the Application Visual Designer, the system integrator defined the format that
translates the data from the internal representation used throughout eFlow into the format
required by the client. Settings in the Application Visual Designer delineate the Export
file destinations, Export file formats, and report destinations and types. It is possible to use
or override these settings. During runtime processing, the Export module receives the data,
converts it to the previously defined format, and transfers it for export.
Converts and exports data which has finished the Completion and Exception stages
Exports image files
Data export to user-definable ASCII, INI or XML format
Data export to remote database.
125
eFLOW Run Time Modules Manual Chapter 17 Content Delivery - Export
Installation
The Export module is installed automatically during the eFlow installation.
Menu Bar
Tool Bar
Channels
Activity
Area
Export
Status
Display
Status
Bar
Figure: Export main window
2. Set Export Options. To change the export options click the Configure
button or select the Options menu > Configure selection. It is important to set the
export options before running the export processes. For more information about
Export Options, see the next section.
3. Click the Run button on the toolbar to start the Export process, or select the
File menu > Run option, or press F5. The Export module automatically starts to
process the collections waiting in the Export queue.
4. If a channel is exporting, it will display a light-green icon in the Channels Activity
area. If the channel’s icon is a dark green light, the channel is in idle mode or has
finished transmitting. If the channel’s icon is a red light the channel has stopped
working.
5. Information about the export status of the collections appears in the Export Status
Display area.
126
eFLOW Run Time Modules Manual Chapter 17 Content Delivery - Export
6. To stop the export process, click the Stop button on the toolbar, or select the
File menu > Stop option, or simply press the Esc key.
Extensions are:
127
eFLOW Run Time Modules Manual Chapter 17 Content Delivery - Export
XML Files
The XML file format is designed to offer the maximum amount of information regarding the flow and
batch. The following actual XML file tree-structure outlines of the information available. A table listing
object names and definitions follows.
TisDynamicDatabase
Collection
TIS_00000005
Properties
BatchName
CreationTime
FlowType
FormsInBatch
PagesOfBatch
CollectionSource
ExportPassCount
MTFFile
Form
FormName
Properties
FormType
Page
PageName
Properties
PageId
MatchCode
IdentificationMethod
IsAttachment
Field
FieldName
Properties
Contents
OriginalContentsNumberOfCompletions
FieldTable
TableName
Properties
FieldArray
NewColumn
FieldName
Properties
Contents
OriginalContents
NumberOfCompletions
FileCreationTime
DateTime
GMTOffset
StandardName
128
eFLOW Run Time Modules Manual Chapter 17 Content Delivery - Export
Most of the XML entry names are self-explanatory. Further definitions for XML entries are listed
below:
Entry Description
NOTE: If a field has an exception, it is created as an entry under the field entry.
A field can have more then one exception, but each will be written on a different
line.
INI Files
The INI file format specifies that batch information is separated into sections named
[BatchDetails], which contain batch properties.
129
eFLOW Run Time Modules Manual Chapter 17 Content Delivery - Export
Similarly, form information is contained in separate sections. Each form information section is assigned
the name of the specific form it describes.
File Storage
The Export module creates temporary storage for each flow with a separate directory using the flow
name, created under a directory named “ExportData”. Under each flow name directory a sub
directory is created for each batch with the name of the batch, containing batch-exporting files, waiting
to be launched. After transmission this directory is deleted.
Permanent storage is designed in exactly the same way. At the destination, the storage tree
structure is identical to temporary storage, except that flow name directories are created
under the directory specified in the options screen. If no path is specified, then the directory
tree structure is created in the TiS temporary directory.
Database Structure
Data can be exported to Microsoft SQL server RDBMS via OLE DB/ADO mechanism.
Export Configuration
Export configuration defines export process display, debugging, and working conditions.
They also allow you to change the export destinations, overriding Application Visual
130
eFLOW Run Time Modules Manual Chapter 17 Content Delivery - Export
Note: To use the definitions set in the Application Visual Designer, click the Load
Defaults button. When finished, click the OK button to apply the default settings.
The Export Configuration window contains two tabs. In the General Tab the following settings
can be defined:
View
Select the number of rows of information to be shown in the status list of the Export main
window.
Export to Database
In order to export data to Ms SQL server, first click the Export to Database check box.
Before activating the export process for the first time, a new database should be created.
First define a connection string to the remote database computer: Click the “…” button to
open the Data Link Properties dialog box and set the connection parameters. We
recommend that you test connection in order to verify connection parameters. Press the
Test Connection button to activate testing.
After connection to the remote database computer was succesfully tested, create the
database by clicking the Create Database button. The following dialog box opens:
131
eFLOW Run Time Modules Manual Chapter 17 Content Delivery - Export
132
eFLOW Run Time Modules Manual Chapter 17 Content Delivery - Export
Flow
Selector
Path
Type in the destination path. Or use the file browser button to open a file browser,
with which you can locate the destination directory.
If you select an FTP server as the export files’ destination, you will be prompted to
provide further information:
133
eFLOW Run Time Modules Manual Chapter 17 Content Delivery - Export
Figure: TP Options
134
eFLOW Run Time Modules Manual Chapter 17 Content Delivery - Export
FTP Parameters:
FTP Server
Provide the name or IP address of the host you wish to connect to.
User ID
Specify the user ID to use when logging on to the remote FTP host.
Password
Provide the password you use to connect to the Host specified in HostName with the user name described
in property User ID.
Remote Dir
Provide the full path of the local directory that will be used as the download path.
Port
Specify the FTP connection port. The default setting is 21. If you are not sure, it is recommended that
you not change this setting.
Timeout
Specify the time in seconds to wait for a response from the socket before the raising a system exception
flag, which will timeout the connection.
Connection Retries
Select how many connection retries to perform when a connection cannot be established.
Test FTP
You can immediately test and modify or correct your FTP settings by using the built in testing process
available through eFlow. Simply press the Test FTP button and the Test FTP Connection window opens,
revealing the results of your test:
Load Defaults
Reset the FTP settings to the default settings specified in the Application Visual Designer.
135
eFLOW Run Time Modules Manual Chapter 17 Content Delivery - Export
In the Channels Tab, select how many channels will be created to launch export
processes.
Thread Priority
Choose the thread’s level of Priority by selecting a level from the list box. A thread’s
Priority indicates the Windows CPU schedule priority used when scheduling the thread.
Adjust the priority higher or lower as needed. The Default setting is Normal. Settings are
defined below:
136
eFLOW Run Time Modules Manual Chapter 17 Content Delivery - Export
Setting Means
Idle The thread executes only when the system is idle-Windows
won't interrupt other threads to execute a thread with tpIdle
priority.
Note: Be cautious when selecting a high priority level. Setting a high thread
priority for a CPU intensive operation may “starve” the other threads in the
module. Only apply priority boosts to threads that spend most of their time
waiting for external events.
137
eFLOW Run Time Modules Manual Chapter 17 Content Delivery - Export
Export Queue
Data Retrieval
Offline
Data Conversion
If there are no
available objects
Batch stack
Online
Launching
Objects
138
eFLOW Run Time Modules Manual Chapter 17 Content Delivery - Export
Export
Status
Display
Channels
Activity
139
eFLOW Run Time Modules Manual Chapter 17 Content Delivery - Export
Batch error.
140
eFLOW Run Time Modules Manual Chapter 17 Content Delivery - Export
Status:
Batch status messages may be one of the following:
Status Bar
The Status Bar provides information about the status of the Export module itself, including icons that
show the whether the Export module is running an Export process, in idle, or closing:
141
eFLOW Run Time Modules Manual Chapter 17 Content Delivery - Export
During conversion, if there is a problem with the conversion process, an error message will appear and
the Export module will stop processing.
However, if an error occurs during transmission, the reaction of the Export module will depend on
whether or not you have enabled Offline mode in the Options window.
eFlow uses several procedures to avoid data loss, corruption or duplication due to failures during data
conversion or transmission. When errors occur, processing is stopped gracefully or continued in Offline
mode. When the Run command or button is activated, if the problem is resolved, the Export module
converts any file in queue, and exports any files left in the local, temporary, export directory.
How does this work? Say that a failure occurs during the data conversion to a file format. A batch is
removed only after conversion is finished, and all temporary files are created. That is, only after all the
data is converted to file format is a batch removed from the “conversion” queue. If conversion is not
completed, when you next Run the Export process, the batch will still be in queue so will be re-fetched
and converted again. The old temporary storage files, including the failed data conversion, will be
overwritten with the new, converted files. These files will then be exported.
Say that a failure occurs during transmission. Temporary converted files are removed only after
successful transmission of all the files in a batch. So, if some of the files are not successfully
transmitted, they will not be removed, and therefore are still in the queue to be “exported.” When you
next Run the Export process, the batch will still be in queue so will be re-transmitted, and old permanent
files will be overwritten.
Thus, eFlow ensures that full batch data is always stored and available, in the local storage directories
and/or in the TSD or at the remote site.
Offline Mode
If you do not select the Offline mode in the Options window Export Configurations, when there is a
problem exporting over the LAN or FTP that you have selected, the Export module will stop processing.
An error message will appear, and any conversions or export of batches assigned to export to the failed
LAN/FTP will be suspended.
When you select the Offline mode, in the case of a LAN/FTP failure, data is converted but not
transmitted. Note that this applies to both to local exporting (LAN) and to remote exporting (FTP).
You may decide to enable Offline mode when a network connection is down or overloaded. In this case
files that are waiting to be transmitted will still be converted from raw data, and stored temporarily.
When the network connection is re-established, once the Run command is issued, all the batches
converted to files will be transmitted.
Restart Conversion/Export
When you receive an Error batch status, check if the problem is with a CSM or network connection
failure. This information will be recorded in the Debug Logger. Once you have resolved the problem,
and/or the connection is reestablished, execute the Run command.
Click the Run button on the toolbar to start the Export process, or select the File menu > Run
option, or press F5.
Error Handling
142
eFLOW Run Time Modules Manual Chapter 17 Content Delivery - Export
If an error occurs during conversion, or during transmission when the Offline mode is not activated, the
error will be logged to the Debug Logger. An error message will appear in the Export Status Display area,
and the Export module will shut down after handling administrative tasks relating to data storage and
conversion.
Errors during conversion will stop the engine and an error message will appear.
The OnError event is triggered when a conversion or launching error occurs. Other conversion errors are:
GetCollection, GetImages, CreateXML, WriteFile, WriteExceptions.
If an error occurs during transmission when Offline mode is activated, the error will be logged to the
Debug Logger, but the Export module will continue processing.
Errors that may occur during transmission are: Connect, MakeDir, ChangeDir, and CopyFile.
143
eFLOW Run Time Modules Manual
Use the information in this chapter to examine the statistics for the engines, virtual engines
and voting methods, and determine what fine-tuning is necessary. Use statistics results to
locate segmentation and processing problems to improve how engines handle information.
The OCR Analyzer contains a powerful time-saving option called Refresh. Once you change
engine parameters in the Designer, the user may click the Refresh button to reload the
engine parameters without running all the run-time modules again. This considerably
simplifies testing any changes in the engine parameters, making fine-tuning quicker and
simpler.
This chapter deals with the OCR Analyzer module itself, a product used to test the
effectiveness of the OCR engines. The actual calibration of the engines is done in the Visual
Designer.
144
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
145
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
Once the initial Processing OCR Analyzer assessment is made, the user may perform engine tuning using
the Visual Designer. Once changes are made to the engines, the user proceeds through the rest of the steps
to analyze the results of these changes, and then performs any further fine-tuning as necessary.
Data Preparation
Before using this module, batches that we want to test must be sent through the entire
run-time process. This starts with the input process using the Scan Portal or File
Portal, and then continues with data Processing, Completion and storage though to the
Processing, Completion and Export steps.
We need the Processing results in order to evaluate engine processing capabilities, and
the Completion results for comparison purposes. Corrected Completion results are
therefore a necessary precondition to producing any assessments of engine results. The
OCR Analyzer module uses these Completion results for comparison purposes, but only
after the results are exported. The Export process prepares many different types of
information about the data processed and the methods (parameters) used to process the
data. The Export process also converts this information into a format, which is
appropriate for OCR Analyzer analysis. All of this information is used by the OCR Analyzer
to perform comparisons and analyze the quality of data Processing.
Once the OCR Analyzer produces information about the accuracy of data Processing, if
engine results are not sufficient, relevant engines may be retuned and the batches may
be processed again for further evaluation.
There are a number of issues to consider when starting the entire procedure. It is
important to determine what sort of testing you would like to do. You should select
whether to test a particular engine’s capabilities, or the recognition accuracy of an
EFI—that is, form recognition, or a particular set of fields. You may select to test whole
batches, or instead, individual forms. In order to perform effective analysis, consider the
recognition issues, which you wish to examine and tailor your analysis process to the
type of data that you wish to test.
146
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
The entire OCR Analysis data preparation procedure includes the following steps:
Design Phase
1. Using the Visual Designer, specify that the flow that you wish to test has
Flow Export properties set to produce DIF and REG files.
REG Page
DIF image
147
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
During the Processing step, you should choose exactly which data you want to analyze,
and at which design level it will be analyzed. For example, do you wish to test engine
processing of particular fields, which is testing at the field level, or do you wish to test
the processing capabilities of actual forms, which involves testing at the form level.
Data Results
Statistics are prepared by comparing a batch’s state after the Processing phase
with the batch state after the Export phase (following human corrections). The
statistics reflect the success of the processing at two levels:
Character recognition – the success of OCR voting per character.
Field recognition – the success of OCR voting per field.
You may display the results in either Field Recognition mode or Character
Recognition mode. Character Recognition mode is the default setting.
Character Recognition Mode
Total – Number of characters processed for each field.
Recognized – Number of characters correctly identified by OCR
voting.
False Recognition – Automatic OCR voting produced a character that
was incorrect, a false positive; requiring override by the typist.
Not Recognized – Characters that were not identified by OCR voting,
e.g. returned as unrecognized character.
148
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
Menu Bar
Select Flow
Mode
Selector EFI’s available Flow Tree Select Tree
in the flow Viewer display Viewer
mode Buttons
Open
the
Visual
Designer
Refresh
the
Engine
Parame
Status Bar
149
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
Desktop Layout
Using the Menu Bar
Three menus are accessed from the OCR Analyzer Menu bar: Select, View,
and Help.
150
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
151
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
Process Mode
In this mode the user selects the parameters for the testing process. The actual
processing is then performed.
Field Statistics
Testing results are compiled into statistical tables, which are shown in different
views in this mode.
152
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
153
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
Process Mode
In this mode the user selects the parameters for the testing process. The actual processing is performed.
Once you have configured all the parameters, press the Process
tool to start analysis.
154
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
After defining all the testing parameters, press the Process button to activate the Process
Activity dialog box:
155
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
Field
Processing
Information
Character
Processing
Information
Progress
Bar
Status Bar
156
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
In this dialog box, processing is actually activated. Start the processing by clicking the Run
button.
Control Buttons
Control buttons are used to activate and stop processing:
Status Bar
The status bar at the bottom offers information about the following:
Indication of the processing stage (Processing, Dumping, etc.).
Information about the current batch number, current page number, and total number of pages in the
batch.
Information Displays
A run-time summary of the number of fields and characters processed are displayed in the Field
Processing Information and Character Processing Information areas. The following information is
displayed:
Total number of processed fields and characters.
Percentage of recognized fields and characters.
Percentage of unrecognized fields and characters.
Percentage of false positive fields and characters.
157
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
To select this mode on the OCR Analyzer’s desktop, either click the Form View tool on the left side of the
desktop, or from the Select menu choose the Form View option. The Form View desktop appears:
Menu
Bar
Tool
Bar
Log
Viewer
Image
Viewer
Status
Bar
Figure: Form View Mode
158
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
159
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
On the right side of the desktop, the image of the form currently being evaluated is displayed. The field
currently tested is framed in the form image with color-coding identical to that in the log, indicating
which type of recognition is taking place for each field.
160
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
To examine which a field in the image that appears in the log, click that field. Once you
click a field in the log list, this action will highlight the field line, which also highlights a
frame around the same field in the image in a matching color. Any unrecognized characters
are displayed in deep blue. False positive characters are displayed in red.
Similarly, clicking a field in the image will highlight it in the log list. To view the image in
greater detail, use the zoom controls to enlarge or shrink the image.
161
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
Field/
Region
Identifier
Image
Viewer
Field
information
Status
Bar
To navigate to another field in the same form, simply select a different field from the combo box. You
may navigate to the previous/next instance of the same field within the current batch or the following
batch using the blue arrow keys in the tool bar.
Region Selection
The name of the ROI currently being examined is displayed in the number combo box directly to the
right of the Field Identification box. If a field is comprised of multiple regions, you may choose which
region to examine. The region is chosen by its place in the field (1st, 2nd, 3rd, etc.). To choose another
region simply type the appropriate location number in the box, or use the spin-arrows to advance from
region to region, or click on the region. If the field contains only one region, this text box is disabled.
You may select which image information to display by using a right mouse button menu, available by
putting the cursor anywhere within the image and clicking the image. Click the right mouse button to
activate a pop-up menu that contains the image display options. You may select to zoom in on a Field
Frame, Region Frame, Region Number or Character Frame.
162
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
Voting result
Voting results confidence levels (in brackets)
Unrecognized or False Positive characters are marked in blue or red, respectively.
Engine results
Engine results confidence levels (in brackets).
Engine guesses for the selected region
Engine guesses confidence level (in brackets). Up to 3 guesses are displayed.
You may navigate through cells in the table using the scroll bar.
If you select a cell in the table that pertains to a different character than the one highlighted in the image,
the segmentation rectangle changes in the image accordingly.
You may zoom in, zoom out, or use the Show Advanced Viewing button to obtain a more
detailed view of the image. To see the REG version of the image, click the View REG button.
An image can be viewed in the Advanced View screen, allowing advanced actions such as
rotating and saving part of the image to a file. The toolbar features tools widely available in
the Integra system.
163
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
To select this mode on the OCR Analyzer’s desktop, either click the Field Statistics tool on the left side of
the desktop, or from the Select menu choose the Field Statistics option. The Field Statistics desktop appears:
Menu Bar
Select
required
results view
Select
required
tab
Log
Viewer
Status
Bar
Figure: Field Statistics Mode
The Tools
The View Field Results button opens the View Field Results
window, which allows you to see more information about the
selected field.
164
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
The Last Statistic Result tab allows you to view how many false positive, unrecognized and recognized
results were found for each field.
View Results By
Use the View Results By list-box to select which way you would like to view the statistics:
Percentage and Numbers
Numbers (only)
Percentage (only)
Sort By
Use the Sort By list-box to select how to view the statistics:
Numbers
Percentage
The statistics table can be sorted by any column, using numbers or percentage.
The View Field Results button opens the field window, when the cursor is pointed at a cell within a
field row. Or, simply double clicking on any of the cells opens the field-window of the relevant field.
This opens the View Field Results window, whose functionality has been explained previously in this
chapter.
Clicking the Additional Statistical Details button will activate the Additional Statistical Details window
containing additional statistical details of the selected cell:
165
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
This window allows you to see information regarding each field including:
Percent of recognized characters
Percent of unrecognized characters
False positives
Empty results (no OCR results)
Empty export results
Number of unrecognized characters (asterisks) found per field
The total number of updated fields
The total number of fields.
This window allows you to see information regarding each character including:
Percent of recognized characters
Percent of unrecognized characters
False positives
Number of characters that the OCR recognition engine added
Number of characters that the OCR recognition engine deleted
Empty results (no OCR results)
Empty export results
The total number of characters.
Menu Bar
Select
required
Select
required
tab
Statistics
Viewer
Status
B
Figure: Comparison To Previous Results Tab
The statistics table display current and previous results listed by false positives, unrecognized characters,
and recognized characters.
If there is a difference between the last result and the current, the improvement in the results is
represented by a green arrow pointing down.
A worse result is represented by a red arrow pointing up.
166
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
The Match Code tab offers details regarding match code statistics Match codes are internal codes used
to determine the identification method that is used to recognize and EFI. A view of the Match Code tab
is shown in the figure below:
167
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
168
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
The fields that have undergone processing are listed in alphabetical order in the top area.
Following this is information regarding the EFI and EFI processing.
EFI Name lists the name of the EFI processed underneath it.
Total Time shows the processing time for all the fields.
Num of EFIs lists the total number of EFI’s processed in this processing run.
Avg. Process Time is the total processing time for all the fields divided by the number of EFIs
processed (Total Time/Num of EFIs).
Licensed Engines
To view engine licensing information, from the View Menu select the Licensed Engines option. A
window will appear listing the engine licenses currently resident in the Integra system:
169
eFLOW Run Time Modules Manual Chapter 18 OCR Analyzer
OCR Engine changes are listed beneath Virtual engine changes. In the figure above, a change has been
made to the Keep Enhancement property of the OCR Engine named OCRBCOD.
170
eFLOW Run Time Modules Manual
Messages are transferred to logs that can be displayed in the log-window and/or saved to a log-file. The
Debug Logger window’s main menu bar allows the user to configure the log displays, the filters, and the
logging method (window or file).
The main menu bar drop down menus allows you to perform the following actions:
File Menu
The FIle menu allows you to manipulate the information shown in the debug logger window.
Messages Menu
The Display menu allows you to manipulate the information shown in the debug logger window.
171
eFLOW Run Time Modules Manual Chapter 19 Debug Logger
Set max lines Set the maximum number of lines that the
debug logger window buffer can hold.
172
eFLOW Run Time Modules Manual Chapter 19 Debug Logger
Use the Filter Text option to select which debug messages to show (Include) or hide (Exclude).
Click on a column title in order to sort the messages according to this column.
Click on the drop down list of a column in order to filter the messages according to its values.
Drag one of the column titles to the designated area in order to group the messages by this column.
173
eFLOW Run Time Modules Manual
Note: Never modify any parameter using this utility unless you are 100%
sure of what you are doing (or if you are instructed to do so).
Basic Configuration
Launches the Basic Configuration application which allows the user to configure
eflow platform main definitions (paths, communication settings etc.).
eFLOW Folder
The main folder under which all "fixed" eFLOW data is installed (BIN, Help,
OCRs etc.)
OCR Viewer
Lannches an application that displays a PRD file.
174
eFLOW Run Time Modules Manual
Index
main tasks.................................................10
A
examine ROI’s in batches .............................57
activate Portal module .................................. 59 Exceptions module..........................................5
application ...................................................... 8 Export
Application Visual Designer .......................... 3 main tasks...............................................125
applications management ............................. 14
F
ASCII output .............................................. 125
File Portal
B
Image gathering........................................61
Batch File Portal module...........................................4
adjusting the view .................................... 57 form
batch separators .............................................. 7 capturing.....................................................3
batch status icons.......................................... 35 storing ........................................................3
Batch Viewer................................................... 57 FormID
Examine ROIs.......................................... 57 invoke FormID module ...................... 88 ,72
Open Image ............................................... 55 forms
Print .......................................................... 55 batch ...........................................................8
Printer Setup .............................................. 55 organization of ...........................................8
separator .....................................................8
C
I
calculates statistical data................................. 5
color-code flows ........................................... 35 ICR..................................................................6
Completion interpret form entries.......................................8
Always ................................................... 111
M
data completion process......................... 109
invoking module .................................... 110 Module Activator components......................33
move back a field................................... 119 monitor collections........................................35
Never ..................................................... 111
O
Reject..................................................... 111
skip to next error.................................... 118 OCR ................................................................6
unrecognized data .................................. 109 OCR Analyzer design phase .......................147
Completion module ................................ 109 ,5 OCR Analyzer fine tuning ..........................144
Content Delivery module ....................... 125 ,5 OCR Analyzer menus.....................................150
Controller ....................................................... 4 OCR Analyzer module ....................................5
Delete All Batches .................................. 37 OCR Analyzer process................................145
Delete Batch...................................... 39 ,37 OCR Analyzer steps....................................146
Flow Filter .............................................. 39 OCR engines ...................................................8
Hold Batch.................................. 45 ,39 ,37 OCR Visual Designer .....................................3
Release Batch ....................... 53 ,45 ,39 ,37 OMR ...............................................................6
View Batch........................................ 39 ,37
convert captured data.................................. 125 P
create user groups ................................... 27 ,23 previously scanned forms..............................59
Process
D
default configuration .................... 90 ,74 ,70
data preparation .......................................... 145 FormID stages .................................... 87 ,71
data processing ........................................... 145 invoke processing........... 103 ,92 ,76 ,67 ,62
data query ................................................... 145 ManualFormID stages ................ 102 ,91 ,75
data results.................................................. 145 Recognition stages ...................................66
Design StandBy.................................. 90 ,74 ,70 ,65
basic objects and concepts ......................... 7 Process module...............................................4
digital images ............................................... 35
S
E
sample applications.........................................8
EFI.................................................................. 7 scan forms .....................................................59
Empty Form Image...............................See EFI Scan Portal module .........................................4
engine tuning .............................................. 145 scanner
Enterprise Manager.................................. 10 ,6 ISIS compatible........................................59
175
eFLOW Run Time Modules Manual
176