[go: up one dir, main page]

Skip to main content

Can Judges Trust the I-Vectors Scores?: A Comparative Study of Voices Comparison in the Forensic Domain

  • Conference paper
  • First Online:
Advances in Computing Systems and Applications (CSA 2020)

Abstract

Phonetic forensics is now widely used. Indeed, in some cases, the voice can be the only potential proof for investigations. In order to evaluate the performance of Forensic Voice Comparison domain (FVC), we will study two factors limiting the robustness of verification task. One depends on the type of transmission channel (telephone, microphone, etc.) and the other is related to the physiological difference between the speakers’ voices. Our work consist in adapting an open source platform for Automatic Speaker Recognition (ASR), “ALIZE”, for use in the forensic domain to estimate and represent the voice as an exhibit. For this, we will study and compare the two models GMM-UBM and i-vectors to evaluate performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Drygajlo, A., Haraksim, R.: Biometric evidence in forensic automatic speaker recognition. In: Handbook of Biometrics for Forensic Science, pp. 221–239. Springer (2017). https://doi.org/10.1007/978-3-319-50673-9_10

  2. Drygajlo, A., Jessen, M., Gfroerer, S., Wagner, I., Vermeulen, J., Niemi, T.: Methodological guidelines for best practice in forensic semiautomatic and automatic speaker recognition. Verlag für Polizeiwissenschaft (2016)

    Google Scholar 

  3. Morrison, G.S.: The impact in forensic voice comparison of lack of calibration and of mismatched conditions between the known-speaker recording and the relevant-population sample recordings. Forensic Sci. Int. 283, e1–e7 (2018)

    Article  Google Scholar 

  4. Reynolds, D.A., Quatieri, T.F., Dunn, R.B.: Speaker verification using adapted gaussian mixture models. Digit. Signal Process. 10(1–3), 19–41 (2000)

    Article  Google Scholar 

  5. Burget, L., Plchot, O., Cumani, S., Glembek, O., Matějka, P., Brümmer, N.: Discriminatively trained probabilistic linear discriminant analysis for speaker verification. In: 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4832–4835. IEEE (2011). https://doi.org/10.1109/ICASSP.2011.5947437

  6. Lei, L., Kun, S.: Speaker recognition using wavelet cepstral coefficient, i-vector, and cosine distance scoring and its application for forensics. J. Electr. Comput. Eng. 2016 (2016)

    Google Scholar 

  7. Tsuge, S., Ishihara, S.: Text-dependent forensic voice comparison: likelihood ratio estimation with the hidden Markov model (HMM) and Gaussian mixture model – universal background model (GMM-UBM) approaches. In: Proceedings of the Australasian Language Technology Association Workshop, pp. 17–25 (2018)

    Google Scholar 

  8. Larcher, A., Bonastre, J.F., Fauve, B., Lee, K.A., Levy, C., Li, H., Mason, J., Parfait, J.Y.: ALIZE 3.0-open source toolkit for state-of-the-art speaker recognition. In: Annual Conference of the International Speech Communication Association, Lyon, France (2013)

    Google Scholar 

  9. Kenny, P., Stafylakis, T., Ouellet, P., Alam, M.J.: JFA-based front ends for speaker recognition. In: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1705–1709. IEEE (2014). https://doi.org/10.1109/ICASSP.2014.6853889

  10. Dehak, N., Kenny, P.J., Dehak, R., Dumouchel, P., Ouellet, P.: Front-end factor analysis for speaker verification. IEEE Trans. Audio Speech Lang. Process. 19, 788–798 (2010)

    Article  Google Scholar 

  11. Sadjadi, S.O., Slaney, M., Heck, L.: MSR identity toolbox v1. 0: a MATLAB toolbox for speaker-recognition research. In: Speech and Language Processing Technical Committee Newsletter, Piscataway, NJ, USA, pp. 1–32. IEEE Signal Processing Society (2013)

    Google Scholar 

  12. Prince, S.J.D., Elder, J.H.: Probabilistic linear discriminant analysis for inferences about identity. In: IEEE 11th International Conference on Computer Vision 2007, Rio de Janeiro, pp. 1–8. IEEE (2007). https://doi.org/10.1109/ICCV.2007.4409052

  13. Dehak, N., Dehak, R., Glass, J.R., Reynolds, D.A., Kenny, P.: Cosine similarity scoring without score normalization techniques. In: Odyssey, p. 15 (2010)

    Google Scholar 

  14. Bu, H., Du, J., Na, X., Wu, B., Zheng, H.: Aishell-1: an open-source mandarin speech corpus and a speech recognition baseline. In: 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (O-COCOSDA) 2017, Seoul, pp. 1–5. IEEE (2017). https://doi.org/10.1109/ICSDA.2017.8384449

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kawthar Yasmine Zergat .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zergat, K.Y., Kahil, Y., Amrouche, A. (2021). Can Judges Trust the I-Vectors Scores?: A Comparative Study of Voices Comparison in the Forensic Domain. In: Senouci, M.R., Boudaren, M.E.Y., Sebbak, F., Mataoui, M. (eds) Advances in Computing Systems and Applications. CSA 2020. Lecture Notes in Networks and Systems, vol 199. Springer, Cham. https://doi.org/10.1007/978-3-030-69418-0_6

Download citation

Publish with us

Policies and ethics