Abstract
Handwriting recognition aims at predicting a sequence of characters from an image of a handwritten text. Main approaches rely on learning statistical models such as Hidden Markov Models or Conditional Random Fields, whose quality is measured through character and word error rates while they are usually not trained to optimize such criterion. We propose an efficient method for learning Hidden Conditional Random Fields to optimize the error rate within the large margin framework.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Woodland, P.C., Povey, D.: Large scale discriminative training of hidden markov models for speech recognition. Computer Speech & Language (1) (2002)
Juang, B.H., Katagiri, S.: Discriminative learning for minimum error classification. IEEE Transactions on Signal Processing (12) (1992)
Fu, Q., He, X., Deng, L.: Phone-discriminating minimum classification error (p-mce) training for phonetic recognition. In: Interspeech (2007)
He, X., Deng, L., Chou, W.: A novel learning method for hidden markov models in speech and audio processing. In: Multimedia Signal Processing. IEEE (2006)
Povey, D., Woodland, P.C.: Minimum phone error and i-smoothing for improved discriminative training. In: ICASSP, vol. 1, p. I–105. IEEE (2002)
Deng, L., Wu, J., Droppo, J., Acero, A.: Analysis and comparison of two speech feature extraction/compensation algorithms. In: SPL (2005)
Cheng, C.-C., Sha, F., Saul, L.K.: Online learning and acoustic feature adaptation in large-margin hidden markov models. JSP (6) (December 2010)
Sha, F., Saul, L.K.: Large margin hidden markov models for automatic speech recognition. In: NIPS (2007)
Cheng, C.C., Sha, F., Saul, L.K.: A fast online algorithm for large margin training of continuous density hidden markov models. In: Interspeech (2009)
Do, T.M.T., Artieres, T.: Maximum margin training of gaussian hmms for handwriting recognition. In: ICDAR, pp. 976–980. IEEE Computer Society (2009)
Yu, D., Deng, L., He, X., Acero, A.: Large-margin minimum classification error training for large-scale speech recognition tasks. In: ICASSP (2007)
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: ICML Workshop (2001)
Gunawardana, A., Mahajan, M., Acero, A., Platt, J.C.: Hidden conditional random fields for phone classification. In: Interspeech (2005)
Do, T.-M.-T., Artieres, T.: Conditional random fields for online handwriting recognition. In: ICFHR (2006)
Morency, L.P., Quattoni, A., Darrell, T.: Latent-dynamic discriminative models for continuous gesture recognition. In: CPVR, pp. 1–8. IEEE (2007)
Wang, Y., Mori, G.: Max-margin hidden conditional random fields for human action recognition. In: CVPR, pp. 872–879. IEEE (2009)
Vinel, A., Do, T.M.T., Artières, T.: Joint optimization of hidden conditional random fields and non linear feature extraction. In: ICDAR (2011)
Soullard, Y., Artieres, T.: Hybrid hmm and hcrf model for sequence classification. In: ESANN (2011)
Reiter, S., Schuller, B., Rigoll, G.: Hidden conditional random fields for meeting segmentation. In: Multimedia and Expo. IEEE (2007)
Taskar, B., Guestrin, C., Koller, D.: Max-margin markov networks. In: NIPS (2003)
Do, T.M.T., Artières, T.: Large margin training for hidden markov models with partially observed states. In: ICML (2009)
Keshet, J., Cheng, C.-C., Stoehr, M., McAllester, D.A.: Direct error rate minimization of hidden markov models. In: Interspeech (2011)
Tsochantaridis, I., Joachims, T., Hofmann, T., Altun, Y.: Large margin methods for structured and interdependent output variables. JMLR (2) (2006)
Crammer, K., Dekel, O., Keshet, J., Shalev-Shwartz, S., Singer, Y.: Online passive-aggressive algorithms. Journal of Machine Learning Research (2006)
Tran, B.H., Seide, F., Steinbiss, T.: A word graph based n-best search in continuous speech recognition. In: ICSLP (1996)
Marti, U.V., Bunke, H.: A full english sentence database for off-line handwriting recognition. In: ICDAR (2002)
Marti, U.V., Bunke, H.: Handwritten sentence recognition. In: ICPR (2000)
Keshet, J., Shalev-Shwartz, S., Bengio, S., Singer, Y., Chazan, D.: Discriminative kernel-based phoneme sequence recognition. In: Interspeech (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Vinel, A., Artières, T. (2013). Maximizing Edit Distance Accuracy with Hidden Conditional Random Fields. In: Wilson, R., Hancock, E., Bors, A., Smith, W. (eds) Computer Analysis of Images and Patterns. CAIP 2013. Lecture Notes in Computer Science, vol 8047. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40261-6_5
Download citation
DOI: https://doi.org/10.1007/978-3-642-40261-6_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40260-9
Online ISBN: 978-3-642-40261-6
eBook Packages: Computer ScienceComputer Science (R0)