OCRdroid: A Framework to Digitize Text Using Mobile Phones

Mi Zhang¹⁹,
Anand Joshi¹⁸,
Ritesh Kadmawala¹⁸,
Karthik Dantu¹⁸,
Sameera Poduri¹⁸ &
…
Gaurav S. Sukhatme^18,19

Part of the book series: Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering ((LNICST,volume 35))

Included in the following conference series:

International Conference on Mobile Computing, Applications, and Services

990 Accesses
1 Citations
6 Altmetric

Abstract

As demand grows for mobile phone applications, research in optical character recognition, a technology well developed for scanned documents, is shifting focus to the recognition of text embedded in digital photographs. In this paper, we present OCRdroid, a generic framework for developing OCR-based applications on mobile phones. OCRdroid combines a light-weight image preprocessing suite installed inside the mobile phone and an OCR engine connected to a backend server. We demonstrate the power and functionality of this framework by implementing two applications called PocketPal and PocketReader based on OCRdroid on HTC Android G1 mobile phone. Initial evaluations of these pilot experiments demonstrate the potential of using OCRdroid framework for real-world OCR-based mobile applications.

This work was supported in part by NSF grant CCR-0120778 (CENS: Center for Embedded Networked Sensing), and by a gift from the Okawa Foundation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

TicQR: Flexible, Lightweight Linking of Paper and Digital Content Using Mobile Phones

Machine-Printed Character Recognition

An Automated Pipeline for Robust Image Processing and Optical Character Recognition of Historical Documents

References

ABBYY Mobile OCR Engine, http://www.abbyy.com/mobileocr/
GOCR - A Free Optical Character Recognition Program, http://jocr.sourceforge.net/
OCR resources (OCRopus), http://sites.google.com/site/ocropus/ocr-resources
OCRAD - The GNU OCR, http://www.gnu.org/software/ocrad/
OCRdroid, http://www-scf.usc.edu/~ananddjo/ocrdroid/index.php
Simple OCR - Optical Character Recognition, http://www.simpleocr.com/
Tesseract OCR Engine, http://code.google.com/p/tesseract-ocr/
Visual Codes, http://www.vs.inf.ethz.ch/res/show.html?what=visualcodes
WINTONE Mobile OCR Engine, http://www.wintone.com.cn/en/prod/44/detail270.aspx
Bieniecki, W., Grabowski, S., Rozenberg, W.: Image preprocessing for improving ocr accuracy. In: Perspective Technologies and Methods in MEMS Design, MEMSTECH 2007 (2007)
Google Scholar
Bruns, E., Bimber, O.: Adaptive training of video sets for image recognition on mobile phones (2009)
Google Scholar
Chen, X., Yang, J., Zhang, J., Waibel, A.: Automatic detection and recognition of signs from natural scenes (2004)
Google Scholar
Elmore, M., Martonosi, M.: A morphological image preprocessing suite for ocr on natural scene images (2008)
Google Scholar
Liang, J., Doermann, D., Li, H.P.: Camera-based analysis of text and documents: a survey. International Journal on Document Analysis and Recognition 7(2-3), 84–104 (2005)
Article Google Scholar
Luo, X.P., Li, J., Zhen, L.X.: Design and implementation of a card reader based on build-in camera. In: ICPR 2004: Proceedings of the Pattern Recognition, 17th International Conference on (ICPR 2004), vol. 1, pp. I: 417–420. IEEE Computer Society, Los Alamitos (2004)
Google Scholar
Mistry, P., Maes, P.: Quickies: Intelligent sticky notes. In: International Conference on Intelligent Environments (2008)
Google Scholar
Niblack, W.: An Introduction to Digital Image Processing. Prentice-Hall, Englewood Cliffs (1986)
Google Scholar
Ohbuchi, E., Hanaizumi, H., Hock, L.A.: Barcode readers using the camera device in mobile phones. In: CW 2004: Proceedings of the 2004 International Conference on Cyberworlds, pp. 260–265. IEEE Computer Society, Los Alamitos (2004)
Google Scholar
Otsu, N.: A threshold selection method from gray-level histograms. IEEE Transactions on Systems, Man and Cybernetics 9(1), 62–66 (1979)
Article MathSciNet Google Scholar
Rice, S.V., Jenkins, F.R., Nartker, T.A.: OCR accuracy: UNLV’s fifth annual test. INFORM, 10, xx–yy (1996)
Google Scholar
Sauvola, J., Pietikainen, M.: Adaptive document image binarization. Pattern Recognition 33(2), 225–236 (2000)
Article Google Scholar
Seeger, M., Dance, C.: Binarising camera images for OCR. In: Sixth International Conference on Document Analysis and Recognition (ICDAR 2001), pp. 54–58 (2001)
Google Scholar
Sezgin, M., Sankur, B.: Survey over image thresholding techniques and quantitative performance evaluation. Journal of Electronic Imaging 13(1), 146–168 (2004)
Article Google Scholar
Shafait, F., Keysers, D., Breuel, T.M.: Efficient implementation of local adaptive thresholding techniques using integral images. In: Document Recognition and Retrieval XV, vol. 6815, 681510 (2008)
Google Scholar
Ulges, A., Lampert, C.H., Breuel, T.M.: Document image dewarping using robust estimation of curled text lines. In: Eighth International Conference on Document Analysis and Recognition, pp. II: 1001–1005 (2005)
Google Scholar
Whitesell, K., Kutler, B., Ramanathan, N., Estrin, D.: A system determining indoor air quality from images air sensor captured cell phones (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science Department, USA
Anand Joshi, Ritesh Kadmawala, Karthik Dantu, Sameera Poduri & Gaurav S. Sukhatme
Electrical Engineering Department, University of Southern California, Los Angeles, CA, 90089, USA
Mi Zhang & Gaurav S. Sukhatme

Authors

Mi Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Anand Joshi
View author publications
You can also search for this author in PubMed Google Scholar
Ritesh Kadmawala
View author publications
You can also search for this author in PubMed Google Scholar
Karthik Dantu
View author publications
You can also search for this author in PubMed Google Scholar
Sameera Poduri
View author publications
You can also search for this author in PubMed Google Scholar
Gaurav S. Sukhatme
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Microsoft Corporation, One Microsoft Way, 98052, Redmond, WA, USA
Thomas Phan
DEIS - University of Bologna, Via Risorgimento, 2, 40137, Bologna, Italy
Rebecca Montanari
IBM T.J. Watson Research Center, 19 Skyline Drive, 10532, Hawthorne, NY, USA
Petros Zerfos

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, M., Joshi, A., Kadmawala, R., Dantu, K., Poduri, S., Sukhatme, G.S. (2010). OCRdroid: A Framework to Digitize Text Using Mobile Phones. In: Phan, T., Montanari, R., Zerfos, P. (eds) Mobile Computing, Applications, and Services. MobiCASE 2009. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 35. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12607-9_18

Download citation

DOI: https://doi.org/10.1007/978-3-642-12607-9_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12606-2
Online ISBN: 978-3-642-12607-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

OCRdroid: A Framework to Digitize Text Using Mobile Phones

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

TicQR: Flexible, Lightweight Linking of Paper and Digital Content Using Mobile Phones

Machine-Printed Character Recognition

An Automated Pipeline for Robust Image Processing and Optical Character Recognition of Historical Documents

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

OCRdroid: A Framework to Digitize Text Using Mobile Phones

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

TicQR: Flexible, Lightweight Linking of Paper and Digital Content Using Mobile Phones

Machine-Printed Character Recognition

An Automated Pipeline for Robust Image Processing and Optical Character Recognition of Historical Documents

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation