Abstract
The goal of this work is the fast extraction of relevant information from document images. Examples of interesting information are the type of document (e.g. form, report, letter), the title of an article or the sender of a business letter, and a logo or figure on a page. The basic idea is to use non-textual cues from the document image before any OCR/ICR or word recognition is performed. The approach is based on a compact runlength representation of the binary image and allows a document type classification by white space analysis in a time comparable with the input of the compressed image. Graphics related information extraction needs approximately the same time.
Chapter PDF
References
D. S. Bloomberg, Textured Reductions for Document Image Analysis, Proc. SPIE, Vol. 2660, 1996, pp. 160–174.
Gerd Maderlechner, Symbolic Subtraction of Fixed Formatted Graphics and Text from Filled in Forms, Proc. IAPR Workshop on Machine Vision and Applications, Tokyo, Nov. 1990, pp. 457–459.
M. Ozaki, Logical Tagging of Document Images by White Space Pattern Matching, in Shape, Structure and Pattern Recognition by D. Dori and A. Bruckstein (editors), World Scientific, Singapore, 1995, pp. 350–359.
T. Pavlidis and J. Zhou, Page Segmentation and Classification, CVGIP: Graphical Models and Image Processing, Vol. 54, No. 6, 1992, pp. 484–496.
D. Rus and K. Summers, Using Non-Textual Cues for Electronic Document Browsing, in Digital Libraries: Current Issues by N. R. Adam, K. Bhargava, and Y. Yesha (editors), Lecture Notes in Computer Science, Springer Verlag Berlin, New York 1995, pp. 129–162.
P. Suda, C. Bridoux, B. Kämmerer, G. Maderlechner, Logo and word matching using a general approach to signal registration, Proc. ICDAR'97, pp. 61–65.
G. Maderlechner, T. Brückner, and P. Suda, Classification of documents by form and content, Pattern Recognition Letters, Vol. 18, No. 11-13, 1997, pp. 1225–1231.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Maderlechner, G., Suda, P. (1998). Information extraction from document images using white space and graphics analysis. In: Amin, A., Dori, D., Pudil, P., Freeman, H. (eds) Advances in Pattern Recognition. SSPR /SPR 1998. Lecture Notes in Computer Science, vol 1451. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0033268
Download citation
DOI: https://doi.org/10.1007/BFb0033268
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64858-1
Online ISBN: 978-3-540-68526-5
eBook Packages: Springer Book Archive