You are on page 1of 3

--------------------------------------------------------------------Outline of the software

--------------------------------------------------------------------InftyReader Version 3.1


(C) Copyright 2000-2016:
Masakazu Suzuki (Kyushu Universirty),
InftyProject (http://www.inftyproject.org/),
Science Accessibility Net (http://www.sciaccess.net/).
--------------------------------------------------------------------1. About the package
This is a full setup package of InftyReader Version 3.1.
If you execute the installer
- InftyReaderE31x_Setup.exe,
then InftyReader will be installed to your PC.
If you want to use InftyReader on command prompt window, please use Infty.exe
refering to the file InftyHelpE.txt included in the package.
2. Outline of the Software
InftyReader is a software application to recognize scientific documents includin
g
mathematical expressions, developed in Masakazu Suzuki Laboratory,
Graduate school of Mathematics, Kyushu University in collaboration
with several cooperation partners.
The software recognizes only images carefully scanned in either 600 DPI
or 400 DPI. Image files have to be prepared in either TIF, PNG or GIF format.
InftyReader outputs the recognition results in various format:
IML, LaTeX, HR-TeX, XHTML(MathML) and MS Word document(XML).
IML is a XML file format related to InftyEditor, an editor of mathematical
documents developed in InftyProject and released from Science Accessibility
Net.
Using InftyEditor, user can correct and edit the recognition results
of InftyReader comparing the results with original images, and convert
the results into various formats mentioned above.
HR-TeX is a simplified LaTeX-like notation easier "to read" specially
designed for the blinds.
In XHTML output, mathematical expressions are output using MathML notation.
Here are some features of InftyReader:
1. It uses the OCR engines of Toshiba Corporation, "ExpressReaderPro",
and of MediaDrive Corporation, "WinReader", simultaneously to recognize
characters in ordinary text areas. (As for the characters and math
symbols in formulae, it uses Infty's OCR).
2. It can recognize tables including math expressions in the cells.

3. It can convert PDF files into LaTeX or XHTML(MathML) including


mathematical expressions.
It recognizes the page images of PDF files refering the text information
imbedded in PDF.
3. Caution ---- Important!
1. The source documents have to be clearly printed.
2. It should be scanned in 600dpi or 400dpi.
The scanning as black and white binary image in 600DPI is <<recomended>>.
3. InftyReader erases small noises, segments page images into picture areas,
table areas and text areas automatically, and then recognizes text and
table areas including mathematical expressions.
However, to get better recognition results, users are <<recommended>>
to erase noises and pictures before the recognition.
4. In binary scanning,
it is important to adjust the binarization threshold of the scanner so
that the number of the touched or broken characters is less than 1% of
the total number of the characters in each scanned page image.
4. Operating Environment
InftyReader runs on Windows XP, Windows Vista, Windows 7 and Windows 8, if the
PC is equipped with enough memory.
Note that it does not run on Windows 98, 2000 nor Me.
5. How to use InftyReader?
1. Select input file(s) or folder.
2. Input/select output docuent name
3. Press the "Start" button.
Then, the recognition results of the selected image files are saved in to
the file you specified by the "output docuent name". When, you select a
folder instead of files, all the image files in the folder of the specified
file type (TIF/GIF/PNG/BMP/PDF) are recognized and the results are output
into the files having the name(s) of the folders.
If you set check to the "Search Sub Folders" item under the "Option" menu,
InftyReader recognizes all the image files in the sub folders of the selected
folder. For example, if you select the folder "foldertop" having the subfolder
structure below,
foldertop
|-- subfolder1
|
|-- a.tif
|
|-- b.tif
|
|-- subfolder2
|-- c.tif
|-- d.tif
and if you select the file type "IML" for the output file type, then, you will
get the files "subfolder1.iml", "subfolder2.iml" in the folder "foldertop".
The recognition results of a.tif and b.tif (resp. c.tif and d.tif) are saved
in the file subfolder1.iml (resp. subfolder2.iml).

If you select LaTeX as output file type, you will get "subfolder1.tex",
"subfolder2.tex", and it is similar for other file types HR-TeX and XHTML.
6. License
To use InftyReader, you need to purchase a License Key and register it
to your PC. To purchase a license key, please go to the website
http://www.sciaccess.net/en/InftyReader/, or to the website
http://inftyreader.org of IDEAL group, Inc.
The license is basically limited to personal use, including the case
purchased by an institution for a specified user. Shared use by a small
group members in a section of an organization is also allowed.
In the default setting, the number of the pages recognizable by this
license is limited to 10000 pages per monthe. In case an institution
uses the software to service a number of clients or to digitize huge
numbers of volumes, please purchase an enterprise license package.
As for the details about enterprise license, please contact us:
Science Accessibility Net (office@mail.sciaccess.net).
InftyReader is usable under the following agreement.
(1) You may not modify the software in any manner. You may not reverse
engineer, decompile or disassemble the software.
(2) You may not sell the software without making a formal agreement
with Science Accessibility Net. You may distribute the software only
free of charge, without modifying the zip-package of the software.
(3) The author(s) and Science Accessibility Net shall have no obligation
to correct errors or inconveniences of the software.
(4) The author(s) and Science Accessibility Net shall not be responsible
for any lost and damage caused by the use of the software.
7. Report
Any report about the software will be welcome.
-------------------------------------Non Profit Organization
Science Accessibility Net (sAccessNet)
e-mail: support@mail.sciaccess.net
URL: http://www.sciaccess.net/
--------------------------------------