Abstract
In this paper, we propose an information extraction method which can restore the handwritten character information from prescribed and skewed form documents. The proposed method include the following aspects: boundary and successive internal line dectection, accurate skew angle measurement, line removal and broken character restoration using morphological analysis model of crossing shape. Using the proposed method, more than 95% of the horizontal and vertical crossing lines are correctly restored.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
M. Okada and M. Shridhar, “A Morphological Substraction Scheme for Form Analysis”, Proc. 13th Int. Conf. on Pattern Rec.(Vienna, Austria), Vol. 3, Track C, pp. 190–194, Aug. 1996.
B. Yu and A. K. Jain, “A Form Dropout System”, Proc. 13th Int. Conf. on Pattern Rec.(Vienna, Austria), Vol. 3, Track C, pp. 701–705, Aug. 1996.
L. O' Gorman and R. Kasturi, Document Image Analysis, IEEE Computer Society Press, 1995.
L. Wenyin and D. Dori, “Spare Pixel Tracking: A Fast Vectorization Algorithm applied to Engineering Drawings”, Proc. 13th Int. Conf. on Pattern Rec. (Vienna, Austria), Vol. 3, Track C, pp. 808–812, Aug. 1996.
H. S. Baird, “The Skew Angle of Printed Documents”, Proc. Conf. of the Society of Photographic Scientists and Engineers, pp. 14–21, 1987.
L. O'Gorman, “The Document Spectrum for Page Layout Analysis”, IEEE Trans. on PAMI, Vol. PAMI-15, No. 11, pp. 1162–1173, Nov. 1993.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yoo, JY., Kim, MK., Yong Han, S., Kwon, YB. (1998). Information extraction from a skewed form document in the presence of crossing characters. In: Tombre, K., Chhabra, A.K. (eds) Graphics Recognition Algorithms and Systems. GREC 1997. Lecture Notes in Computer Science, vol 1389. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-64381-8_45
Download citation
DOI: https://doi.org/10.1007/3-540-64381-8_45
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64381-4
Online ISBN: 978-3-540-69766-4
eBook Packages: Springer Book Archive