Speaker Verification Using Adapted User-Dependent Multilevel Fusion

Julian Fierrez-Aguilar²⁰,
Daniel Garcia-Romero²⁰,
Javier Ortega-Garcia²⁰ &
…
Joaquin Gonzalez-Rodriguez²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3541))

Included in the following conference series:

International Workshop on Multiple Classifier Systems

1932 Accesses
3 Citations
3 Altmetric

Abstract

In this paper we study the application of user-dependent score fusion to multilevel speaker recognition. After reviewing related works in multimodal biometric authentication, a new score fusion technique is described. The method is based on a form of Bayesian adaptation to derive the personalized fusion functions from prior user-independent data. Experimental results are reported using the MIT Lincoln Laboratory’s multilevel speaker verification system. It is experimentally shown that the proposed adapted fusion method outperforms both user independent and non-adapted user-dependent fusion approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Improving Performance of Speaker Identification Systems Using Score Level Fusion of Two Modes of Operation

Unifying Probabilistic Linear Discriminant Analysis Variants in Biometric Authentication

Improving Robustness of Speaker Verification by Fusion of Prompted Text-Dependent and Text-Independent Operation Modalities

References

Reynolds, D.A., Quatieri, T.F., Dunn, R.B.: Speaker verification using adapted Gaussian mixture models. Digital Signal Processing 10, 19–41 (2000)
Article Google Scholar
Campbell, W.M.: A SVM/HMM system for speaker recognition. In: Proc. ICASSP, pp. 209–302 (2003)
Google Scholar
Campbell, W.M., Reynolds, D.A., Campbell, J.: Fusing discriminative and generative methods for speaker recognition: Experiments on Switchboard and NFI/TNO field data. In: Proc. ODYSSEY, pp. 41–44 (2004)
Google Scholar
Reynolds, D.A., et al.: The SuperSID project: Exploiting high-level information for high-accuracy speaker recognition. In: Proc. ICASSP, pp. 784–787 (2003)
Google Scholar
Reynolds, D.A., et al.: The 2004 MIT Lincoln Laboratory Speaker Recognition System. In: Proc. ICASSP (2005) (to appear)
Google Scholar
NIST SRE Web, http://www.nist.gov/speech/tests/spk/2004/index.htm
Doddington, G., et al.: Sheeps, goats, lambs and wolves: A statistical analysis of speaker performance in the NIST 1998 SRE. In: Proc. ICSLP (1998)
Google Scholar
Bigun, E.S., Bigun, J., et al.: Expert conciliation for multi modal person authentication systems by Bayesian statistics. In: Bigün, J., Borgefors, G., Chollet, G. (eds.) AVBPA 1997. LNCS, vol. 1206, pp. 291–300. Springer, Heidelberg (1997)
Chapter Google Scholar
Jain, A.K., Ross, A.: Learning user-specific parameters in a multibiometric system. In: Proc. ICIP, pp. 57–60 (2002)
Google Scholar
Fierrez-Aguilar, J., et al.: A comparative evaluation of fusion strategies for multimodal biometric verification. In: Kittler, J., Nixon, M.S. (eds.) AVBPA 2003. LNCS, vol. 2688, pp. 830–837. Springer, Heidelberg (2003)
Chapter Google Scholar
Fierrez-Aguilar, J., et al.: Exploiting general knowledge in user-dependent fusion strategies for multimodal biometric verification. In: Proc. ICASSP, pp. 617–620 (2004)
Google Scholar
Toh, K.A., Jiang, X., Yau, W.Y.: Exploiting local and global decisions for multimodal biometrics verification. IEEE Trans. on SP 52, 3059–3072 (2004)
Article Google Scholar
Fierrez-Aguilar, J., et al.: Bayesian adaptation for user-dependent multimodal biometric authentication. Pattern Recognition (2005) (to appear)
Google Scholar
Kumar, A., Zhang, D.: Integrating palmprint with face for user authentication. In: Proc. MMUA (2003), available at http://mmua.cs.ucsb.edu/
Snelick, R., et al.: Large scale evaluation of multimodal biometric authentication using state-of-the-art systems. IEEE Trans. PAMI 27, 450–455 (2005)
Google Scholar
Poh, N., Bengio, S.: An Investigation of F-ratio client-dependent normalisation on biometric authentication tasks. In: Proc. ICASSP (2005) (to appear)
Google Scholar
Lee, C.H., Huo, Q.: On adaptive decision rules and decision parameter adaptation for automatic speech recognition. Proc. IEEE, 88, 1241–1269 (2000)
Google Scholar
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification. Wiley, Chichester (2001)
MATH Google Scholar
Gauvain, J.L., Lee, C.H.: Maximum a Posteriori estimation for multivariate Gaussian mixture observations of Markov chains. IEEE Trans. on SAP 2, 291–298 (1994)
Google Scholar
Reynolds, D.A.: Channel robust speaker verification via feature mapping. In: Proc. ICASSP, pp. 53–56 (2003)
Google Scholar
Auckenthaler, R., et al.: Score normalization for text-independent speaker verification systems. Digital Signal Processing 10, 42–54 (2000)
Article Google Scholar
Doddington, G.: Speaker recognition based on idiolectal differences between speakers. In: Proc. EUROSPEECH, pp. 2521–2524 (2001)
Google Scholar
Adami, A., Mihaescu, R., Reynolds, D.A., Godfrey, J.: Modeling prosodic dynamics for speaker recognition. In: Proc. ICASSP, pp. 788–791 (2003)
Google Scholar
Adami, A.G.: Modeling prosodic differences for speaker and language recognition. PhD thesis, OGI (2004)
Google Scholar
Martin, A., Doddington, G., et al.: The DET curve in assessment of decision task performance. In: Proc. EUROSPEECH 1997, pp. 1895–1898 (1997)
Google Scholar
Jain, A.K., Duin, R.P.W., Mao, J.: Statistical pattern recognition: A review. IEEE Trans. on PAMI 22, 4–37 (2000)
Google Scholar
Fierrez-Aguilar, J., Ortega-Garcia, J., Gonzalez-Rodriguez, J.: Target dependent score normalization techniques and their application to signature verification. IEEE Trans. on SMC-C 35 (2005) (to appear)
Google Scholar

Download references

Author information

Authors and Affiliations

Biometrics Research Lab./ATVS, Escuela Politecnica Superior, Universidad Autonoma de Madrid, Campus de Cantoblanco, C/ Francisco Tomas y Valiente 11, 28049, Madrid, Spain
Julian Fierrez-Aguilar, Daniel Garcia-Romero, Javier Ortega-Garcia & Joaquin Gonzalez-Rodriguez

Authors

Julian Fierrez-Aguilar
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Garcia-Romero
View author publications
You can also search for this author in PubMed Google Scholar
Javier Ortega-Garcia
View author publications
You can also search for this author in PubMed Google Scholar
Joaquin Gonzalez-Rodriguez
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

NASA Ames Research Center, Mail Stop 269-1, 94035-1000, Moffett Field, CA, USA
Nikunj C. Oza
Signal Processing and Pattern Recognition Laboratory, Electrical and Computer Engineering, Rowan University, 08028, Glassboro, NJ, USA
Robi Polikar
Centre for Vision, Speech and Signal Processing, University of Surrey, GU2 7XH, Guildford, UK
Josef Kittler
University of Cagliari, Department of Electrical and Electronic Engineering, Piazza d’Armi, 09123, Cagliari, Italy
Fabio Roli

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fierrez-Aguilar, J., Garcia-Romero, D., Ortega-Garcia, J., Gonzalez-Rodriguez, J. (2005). Speaker Verification Using Adapted User-Dependent Multilevel Fusion. In: Oza, N.C., Polikar, R., Kittler, J., Roli, F. (eds) Multiple Classifier Systems. MCS 2005. Lecture Notes in Computer Science, vol 3541. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11494683_36

Download citation

DOI: https://doi.org/10.1007/11494683_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26306-7
Online ISBN: 978-3-540-31578-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Speaker Verification Using Adapted User-Dependent Multilevel Fusion

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Improving Performance of Speaker Identification Systems Using Score Level Fusion of Two Modes of Operation

Unifying Probabilistic Linear Discriminant Analysis Variants in Biometric Authentication

Improving Robustness of Speaker Verification by Fusion of Prompted Text-Dependent and Text-Independent Operation Modalities

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Speaker Verification Using Adapted User-Dependent Multilevel Fusion

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Improving Performance of Speaker Identification Systems Using Score Level Fusion of Two Modes of Operation

Unifying Probabilistic Linear Discriminant Analysis Variants in Biometric Authentication

Improving Robustness of Speaker Verification by Fusion of Prompted Text-Dependent and Text-Independent Operation Modalities

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation