Beruflich Dokumente
Kultur Dokumente
Images
Olivia Grubert
Linyi Gao
Email: ogrubert@stanford.edu.
Email: linyigao@stanford.edu.
1
table. However, since our bag of expected words is quite limited, we are interested in
comparing these results with our own template matching or multi-class SVM classication
algorithms. We believe we may be able to perform better word matching on our line items
than the direct output of an OCR system.
2.1 Preliminary OCR Results
To test the feasibility of using OCR, we took several images of a nutrition facts label on
a mobile phone and passed these images directly to Abbyy Finereader 11. These images
had various resolutions and levels of perspective distortion. For the highest-resolution image
without distortion, OCR was able to correctly identify the table layout and recognize the
text without any preprocessing. However, as perspective distortion was increased, the ability
of OCR to detect the table layout deteriorated. Likewise, character recognition accuracy
decreased with image resolution. These preliminary results indicate that images taken from
a mobile phone will require signicant preprocessing to accurately convert information from
a nutrition facts image into an electronic database.
References
[1] W. Bieniecki, S. Grabowski, and W. Rosenberg, Image Preprocessing for Improving
OCR Accuracy, in Proceedings of the International Conference on Perspective Tech-
nologies in MEMS Design, 2007, pp. 75-80.
[2] R. Smith, An Overview of the Tesseract OCR Engine, in Proceedings of the Ninth
International Conference on Document Analysis and Recognition, vol. 2, 2007, pp. 629-
633.
[3] H. Jiang, C. Han, K. Fan, A Fast Approach to Detect and Correct Skew Documents,
in Proceedings of the 13th International Conference on Pattern Recognition, vol. 3,
1996, pp. 742-746.
2