Can Judges Trust the I-Vectors Scores?: A Comparative Study of Voices Comparison in the Forensic Domain

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 199))

Included in the following conference series:

International Conference on Computing Systems and Applications

462 Accesses

Abstract

Phonetic forensics is now widely used. Indeed, in some cases, the voice can be the only potential proof for investigations. In order to evaluate the performance of Forensic Voice Comparison domain (FVC), we will study two factors limiting the robustness of verification task. One depends on the type of transmission channel (telephone, microphone, etc.) and the other is related to the physiological difference between the speakers’ voices. Our work consist in adapting an open source platform for Automatic Speaker Recognition (ASR), “ALIZE”, for use in the forensic domain to estimate and represent the voice as an exhibit. For this, we will study and compare the two models GMM-UBM and i-vectors to evaluate performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Voice Cloning and Mismatch Conditions in Forensic Automatic Speaker Recognition

Investigating Language Variability on the Performance of Speaker Verification Systems

Homogeneity Measure for Forensic Voice Comparison: A Step Forward Reliability

References

Drygajlo, A., Haraksim, R.: Biometric evidence in forensic automatic speaker recognition. In: Handbook of Biometrics for Forensic Science, pp. 221–239. Springer (2017). https://doi.org/10.1007/978-3-319-50673-9_10
Drygajlo, A., Jessen, M., Gfroerer, S., Wagner, I., Vermeulen, J., Niemi, T.: Methodological guidelines for best practice in forensic semiautomatic and automatic speaker recognition. Verlag für Polizeiwissenschaft (2016)
Google Scholar
Morrison, G.S.: The impact in forensic voice comparison of lack of calibration and of mismatched conditions between the known-speaker recording and the relevant-population sample recordings. Forensic Sci. Int. 283, e1–e7 (2018)
Article Google Scholar
Reynolds, D.A., Quatieri, T.F., Dunn, R.B.: Speaker verification using adapted gaussian mixture models. Digit. Signal Process. 10(1–3), 19–41 (2000)
Article Google Scholar
Burget, L., Plchot, O., Cumani, S., Glembek, O., Matějka, P., Brümmer, N.: Discriminatively trained probabilistic linear discriminant analysis for speaker verification. In: 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 4832–4835. IEEE (2011). https://doi.org/10.1109/ICASSP.2011.5947437
Lei, L., Kun, S.: Speaker recognition using wavelet cepstral coefficient, i-vector, and cosine distance scoring and its application for forensics. J. Electr. Comput. Eng. 2016 (2016)
Google Scholar
Tsuge, S., Ishihara, S.: Text-dependent forensic voice comparison: likelihood ratio estimation with the hidden Markov model (HMM) and Gaussian mixture model – universal background model (GMM-UBM) approaches. In: Proceedings of the Australasian Language Technology Association Workshop, pp. 17–25 (2018)
Google Scholar
Larcher, A., Bonastre, J.F., Fauve, B., Lee, K.A., Levy, C., Li, H., Mason, J., Parfait, J.Y.: ALIZE 3.0-open source toolkit for state-of-the-art speaker recognition. In: Annual Conference of the International Speech Communication Association, Lyon, France (2013)
Google Scholar
Kenny, P., Stafylakis, T., Ouellet, P., Alam, M.J.: JFA-based front ends for speaker recognition. In: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1705–1709. IEEE (2014). https://doi.org/10.1109/ICASSP.2014.6853889
Dehak, N., Kenny, P.J., Dehak, R., Dumouchel, P., Ouellet, P.: Front-end factor analysis for speaker verification. IEEE Trans. Audio Speech Lang. Process. 19, 788–798 (2010)
Article Google Scholar
Sadjadi, S.O., Slaney, M., Heck, L.: MSR identity toolbox v1. 0: a MATLAB toolbox for speaker-recognition research. In: Speech and Language Processing Technical Committee Newsletter, Piscataway, NJ, USA, pp. 1–32. IEEE Signal Processing Society (2013)
Google Scholar
Prince, S.J.D., Elder, J.H.: Probabilistic linear discriminant analysis for inferences about identity. In: IEEE 11th International Conference on Computer Vision 2007, Rio de Janeiro, pp. 1–8. IEEE (2007). https://doi.org/10.1109/ICCV.2007.4409052
Dehak, N., Dehak, R., Glass, J.R., Reynolds, D.A., Kenny, P.: Cosine similarity scoring without score normalization techniques. In: Odyssey, p. 15 (2010)
Google Scholar
Bu, H., Du, J., Na, X., Wu, B., Zheng, H.: Aishell-1: an open-source mandarin speech corpus and a speech recognition baseline. In: 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (O-COCOSDA) 2017, Seoul, pp. 1–5. IEEE (2017). https://doi.org/10.1109/ICSDA.2017.8384449

Download references

Author information

Authors and Affiliations

USTHB, FEI, LCPTS, Speech Com. and Signal Proc. Lab, P.O. Box 32, El Alia, 16111, Bab Ezzouar, Algeria
Kawthar Yasmine Zergat, Yazid Kahil & Abderrahmane Amrouche

Authors

Kawthar Yasmine Zergat
View author publications
You can also search for this author in PubMed Google Scholar
Yazid Kahil
View author publications
You can also search for this author in PubMed Google Scholar
Abderrahmane Amrouche
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kawthar Yasmine Zergat .

Editor information

Editors and Affiliations

Ecole Militaire Polytechnique, Algiers, Algeria
Mustapha Reda Senouci
Ecole Militaire Polytechnique, Algiers, Algeria
Mohamed El Yazid Boudaren
Ecole Militaire Polytechnique, Algiers, Algeria
Faouzi Sebbak
Ecole Militaire Polytechnique, Algiers, Algeria
M'hamed Mataoui

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zergat, K.Y., Kahil, Y., Amrouche, A. (2021). Can Judges Trust the I-Vectors Scores?: A Comparative Study of Voices Comparison in the Forensic Domain. In: Senouci, M.R., Boudaren, M.E.Y., Sebbak, F., Mataoui, M. (eds) Advances in Computing Systems and Applications. CSA 2020. Lecture Notes in Networks and Systems, vol 199. Springer, Cham. https://doi.org/10.1007/978-3-030-69418-0_6

Download citation

DOI: https://doi.org/10.1007/978-3-030-69418-0_6
Published: 21 February 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-69417-3
Online ISBN: 978-3-030-69418-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Can Judges Trust the I-Vectors Scores?: A Comparative Study of Voices Comparison in the Forensic Domain

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Voice Cloning and Mismatch Conditions in Forensic Automatic Speaker Recognition

Investigating Language Variability on the Performance of Speaker Verification Systems

Homogeneity Measure for Forensic Voice Comparison: A Step Forward Reliability

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Can Judges Trust the I-Vectors Scores?: A Comparative Study of Voices Comparison in the Forensic Domain

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Voice Cloning and Mismatch Conditions in Forensic Automatic Speaker Recognition

Investigating Language Variability on the Performance of Speaker Verification Systems

Homogeneity Measure for Forensic Voice Comparison: A Step Forward Reliability

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation