default search action
6th IberSPEECH 2022: Granada, Spain
- Antonio M. Peinado, Ángel M. Gómez, José L. Pérez-Córdoba, Jose A. Gonzalez-Lopez:
6th International Conference, IberSPEECH 2022, Granada, Spain, 14-16 November 2022, Proceedings. ISCA 2022
Speech Synthesis
- Marek Strelec, Jonas Rohnke, Antonio Bonafonte, Mateusz Lajszczak, Trevor Wood:
Discrete Acoustic Space for an Efficient Sampling in Neural Text-To-Speech. 1-5 - Marc Arnela, Leonardo Pereira-Vivas, Jorge Egea:
An animated realistic head with vocal tract for the finite element simulation of vowel /a/. 6-10 - Ander González-Docasal, Aitor Álvarez, Haritz Arzelus:
Exploring the limits of neural voice cloning: A case study on two well-known personalities. 11-15 - Marc Freixes, Joan Claudi Socoró, Francesc Alías:
Analysis of iterative adaptive and quasi closed phase inverse filtering techniques on OPENGLOT synthetic vowels. 16-20
Automatic Speech Recognition
- José Manuel Ramírez Sánchez, Laura Docío Fernández, Carmen García-Mateo:
Galician's Language Technologies in the Digital Age. 21-25 - Alejandro Gomez-Alanis, Lukas Drude, Andreas Schwarz, Rupak Vignesh Swaminathan, Simon Wiesler:
Contextual-Utterance Training for Automatic Speech Recognition. 26-30 - Eder del Blanco, Inge Salomons, Eva Navas, Inma Hernáez:
Phone classification using electromyographic signals. 31-35 - Mikel Peñagarikano, Amparo Varona, Germán Bordel, Luis Javier Rodríguez-Fuentes:
Semisupervised training of a fully bilingual ASR system for Basque and Spanish. 36-40 - David Gimeno-Gómez, Carlos David Martínez-Hinarejos:
Speaker-Adapted End-to-End Visual Speech Recognition for Continuous Spanish. 41-45 - Fernando López, Jordi Luque:
Iterative pseudo-forced alignment by acoustic CTC loss for self-supervised ASR domain adaptation. 46-50
Speech and Audio Processing
- Wanying Ge, Hemlata Tak, Massimiliano Todisco, Nicholas W. D. Evans:
On the potential of jointly-optimised solutions to spoofing attack detection and automatic speaker verification. 51-55 - Pablo Gimeno, Alfonso Ortega, Antonio Miguel, Eduardo Lleida:
A Study on the Use of wav2vec Representations for Multiclass Audio Segmentation. 56-60 - Noelia Salor-Burdalo, Ascensión Gallardo-Antolín:
Respiratory Sound Classification Using an Attention LSTM Model with Mixup Data Augmentation. 61-65 - Juan Manuel Martín-Doñas, Iván González Torre, Aitor Álvarez, Joaquín Arellano:
The Vicomtech Spoofing-Aware Biometric System for the SASV Challenge. 66-70 - John Mendonça, Isabel Trancoso:
VoxCeleb-PT - a dataset for a speech processing course. 71-75
Affective Computing and Applications
- Miguel A. Pastor, Dayana Ribas, Alfonso Ortega, Antonio Miguel, Eduardo Lleida:
Cross-Corpus Speech Emotion Recognition with HuBERT Self-Supervised Representation. 76-80 - Cristina Luna Jiménez, Ricardo Kleinlein, Syaheerah Lebai Lutfi, Juan Manuel Montero, Fernando Fernández Martínez:
Analysis of Trustworthiness Recognition models from an aural and emotional perspective. 81-85 - Edward L. Campbell, Laura Docío Fernández, Nicholas Cummins, Carmen García-Mateo:
Speech and Text Processing for Major Depressive Disorder Detection. 86-90 - Clara Luis-Mingueza, Esther Rituerto-González, Carmen Peláez-Moreno:
Bridging the Semantic Gap with Affective Acoustic Scene Analysis: an Information Retrieval-based Approach. 91-95 - Emma Reyner-Fuentes, Esther Rituerto-González, Clara Luis-Mingueza, Carmen Peláez-Moreno, Celia López-Ongil:
Detecting Gender-based Violence aftereffects from Emotional Speech Paralinguistic Features. 96-100 - Rodrigo Sousa, Helena Sofia Pinto, Alberto Abad, Daniel Neto, Joaquim Gago:
Extraction of structural and semantic features for the identification of Psychosis in European Portuguese. 101-105
Natural Language Processing
- José-Ángel González, Encarna Segarra, Fernando García-Granada, Emilio Sanchis, Lluís F. Hurtado:
An Attentional Extractive Summarization Framework. 106-110 - Rui Ribeiro, Luísa Coheur:
SUMBot: Summarizing Context in Open-Domain Dialogue Systems. 111-115 - Jorge Mira Prats, Marcos Estecha-Garitagoitia, Mario Rodríguez-Cantelar, Luis Fernando D'Haro:
Automatic Detection of Inconsistencies in Open-Domain Chatbots. 116-120 - Andrés Piñeiro Martín, Carmen García-Mateo, Laura Docío Fernández, Maria del Carmen Lopez-Perez:
Ethics Guidelines for the Development of Virtual Assistants for e-Health. 121-125 - Asier Gutiérrez-Fandiño, David Pérez Fernández, Jordi Armengol-Estapé, David Griol, Zoraida Callejas:
esCorpius: A Massive Spanish Crawling Corpus. 126-130
Topics on Speech and Language Technologies
- Iván López-Espejo, Zheng-Hua Tan, Jesper Jensen:
An Experimental Study on Light Speech Features for Small-Footprint Keyword Spotting. 131-135 - Dayana Ribas, Miguel Ángel Pastor Yoldi, Antonio Miguel, David Martínez, Alfonso Ortega, Eduardo Lleida:
S3prl-Disorder: Open-Source Voice Disorder Detection System based in the Framework of S3PRL-toolkit. 136-140 - Filipe Reynaud, Eugénio Ribeiro, David Martins de Matos:
Active Learning Improves the Teacher's Experience: A Case Study in a Language Grounding Scenario. 141-145 - Celia García-Ruiz, Angel M. Gomez, Juan M. Martín-Doñas:
The role of window length and shift in complex-domain DNN-based speech enhancement. 146-150 - Yongjian Chen, Mireia Farrús:
Neural Detection of Cross-lingual Syntactic Knowledge. 151-155 - Sergio Izquierdo del Alamo, Beltrán Labrador, Alicia Lozano-Diez, Doroteo T. Toledano:
Efficient Transformers for End-to-End Neural Speaker Diarization. 156-160 - Vinícius G. dos Santos, Caroline Alves, Bruno Baldissera Carlotto, Bruno A. Papa Dias, Lucas Rafael Stefanel Gris, Renan de Lima Izaias, Maria Luiza Azevedo de Morais, Paula Marin de Oliveira, Rafael Sicoli, Flaviane Romani Fernandes Svartman, Marli Quadros Leite, Sandra Maria Aluísio:
CORAA NURC-SP Minimal Corpus: a manually annotated corpus of Brazilian Portuguese spontaneous speech. 161-165 - Federico Costa, Miquel India, Javier Hernando:
Speaker Characterization by means of Attention Pooling. 166-170 - Marina Escobar-Planas, Emilia Gómez, Carlos D. Martínez-Hinarejos:
Enhancing the Design of a Conversational Agent for an Ethical Interaction with Children. 171-175 - Isabel Carvalho, Hugo Gonçalo Oliveira, Catarina Silva:
Sentiment Analysis in Portuguese Dialogues. 176-180 - Eros Rosello, Alejandro Gómez Alanís, Manuel Chica, Angel M. Gomez, José A. González, Antonio M. Peinado:
On the application of conformers to logical access voice spoofing attack detection. 181-185 - Irune Zubiaga, Raquel Justo, M. Inés Torres, Mikel de Velasco:
Speech emotion recognition in Spanish TV Debates. 186-190 - Emanuel Matos, Mário Rodrigues, António J. S. Teixeira:
Assessing Transfer Learning and automatically annotated data in the development of Named Entity Recognizers for new domains. 191-195 - Anna Pompili, Tiago Luís, Nuno Monteiro, João Miranda, Carlos Mendes, Sérgio Paulo:
On the detection of acoustic events for public security: the challenges of the counter-terrorism domain. 196-200 - Manuel Chica, Alejandro Gómez Alanís, Eros Rosello, Angel M. Gomez, José A. González, Antonio M. Peinado:
Database dependence comparison in detection of physical access voice spoofing attacks. 201-205 - Cristina Luna Jiménez, Syaheerah Lebai Lutfi, Manuel Gil-Martín, Ricardo Kleinlein, Juan Manuel Montero, Fernando Fernández Martínez:
Measuring trust at zero-acquaintance using acted-emotional videos. 206-210
Special Session: Research and Development Projects, Demos, Ph.D. Thesis, Entrepreneurship
- Victoria Mingote, Antonio Miguel:
Representation and Metric Learning Advances for Deep Neural Network Face and Speaker Biometric Systems. 211-215 - Alejandro Gómez Alanís, José Andrés González López, Antonio Miguel Peinado Herreros:
Voice Biometric Systems based on Deep Neural Networks: A Ph.D. Thesis Overview. 216-220 - Juan Manuel Martín-Doñas, Antonio M. Peinado, Angel M. Gomez:
Online Multichannel Speech Enhancement combining Statistical Signal Processing and Deep Neural Networks: A Ph.D. Thesis Overview. 221-225 - Inma Hernáez, José Andrés González López, Eva Navas, José Luis Pérez-Córdoba, Ibon Saratxaga, Gonzalo Olivares, Jon Sánchez de la Fuente, Alberto Galdón, Victor García, Jesús del Castillo, Inge Salomons, Eder del Blanco Sierra:
ReSSInt project: voice restoration using Silent Speech Interfaces. 226-230 - Itziar Aldabe, Aritz Farwell, Eva Navas, Inma Hernáez, German Rigau:
ELE Project: an overview of the desk research. 231-234 - Mike Rizkalla, Thomas Chan, Emilio Granell, Chara Tsoukala, Aitor Carricondo, Carlos Bailon, María Teresa González, Vicent Alabau:
Snorble: An Interactive Children Companion. 235-236 - Ángel M. Gómez, Victoria E. Sánchez, Antonio M. Peinado, Juan M. Martín-Doñas, Alejandro Gómez Alanís, Amelia Villegas-Morcillo, Eros Rosello, Manuel Chica, Celia García, Iván López-Espejo:
Fusion of Classical Digital Signal Processing and Deep Learning methods (FTCAPPS). 237-240 - Carlos David Martínez-Hinarejos, David Gimeno-Gómez, Francisco Casacuberta, Emilio Granell, Roberto Paredes, Moisés Pastor, Enrique Vidal:
Spanish Lipreading in Realistic Scenarios: the LLEER project. 241-245 - José Andrés González López, Alberto Galdón, Gonzalo Olivares, Sneha Raman, David Murcia, Daniela Paolieri, Pedro Macizo, José L. Pérez-Córdoba, Antonio M. Peinado, Angel Gomez, Victoria E. Sánchez, Ana B. Chica:
Clinical Applications of Neuroscience: Locating Language Areas in Epileptic Patients and Restoring Speech in Paralyzed People. 246-250 - Juan Alos, Julien Boullié, M. Inés Torres, Eneko Ruiz, Andoni Beristain, Jacobo López Fernández, Iñaki Tellería, Janeth Carolina Carreño, Iker Garay, Arkaitz Carbajo, Amaia Santamaría, Urtzi Zubiate, Jon Ander Arzallus, Francisco Martínez, Adriana Martínez:
ORKESTA Comprehensive Solution for the Orchestration of Services and Soci-Sanitary Care at Home. 251-253 - Mikel Tainta, Javier Mikel Olaso, M. Inés Torres, Mirian Ecay-Torres, Nekane Balluerka, Naia Ros, Mikel Izquierdo, Mikel Saéz de Asteasu, Usune Etxebarria, Lucía Gayoso, Maider Mateo, Oliver Ibarrondo, Elena Alberdi, Estíbaliz Capetillo-Zárate, Jesus Angel Bravo, Pablo Martinez-Lage:
The CITA GO-ON trial: A person-centered, digital, intergenerational, and cost-effective dementia prevention multi-modal intervention model to guide strategic policies facing the demographic challenges of progressive aging. 254-256 - Antonio M. Peinado, Alejandro Gomez-Alanis, José Andrés González López, Angel M. Gomez, Eros Rosello, Manuel Chica-Villar, Jose C. Sanchez-Valera, José L. Pérez-Córdoba, Victoria E. Sánchez:
The BioVoz Project: Secure Speech Biometrics by Deep Processing Techniques. 257-261 - César González Ferreras, Valentín Cardeñoso-Payo, David Escudero Mancebo, Carlos Vivaracho-Pascual, Lourdes Aguilar, Valle Flores-Lucas, Mario Corrales-Astorgano:
Automatic evaluation of the pronunciation of people with Down syndrome in an educational video game (EvaProDown). 262-263 - Dayana Ribas, Antonio Miguel, Luis Guillen, Jose Javier Castejon, Juan Antonio Navarro, Alfonso Ortega, Luis Benavente:
SONOC Platform for Audio and Speech Analytics in Call Centers. 264-265
Albayzin Evaluations
- Haritz Arzelus, Iván G. Torres, Juan Manuel Martín-Doñas, Ander González-Docasal, Aitor Álvarez:
The Vicomtech-UPM Speech Transcription Systems for the Albayzín-RTVE 2022 Speech to Text Transcription Challenge. 266-270 - Fernando López, Jordi Luque:
TID Spanish ASR system for the Albayzin 2022 Speech-to-Text Transcription Challenge. 271-275 - Martin Kocour, Jahnavi Umesh, Martin Karafiát, Jan Svec, Fernando López, Jordi Luque, Karel Benes, Mireia Díez, Igor Szöke, Karel Veselý, Lukás Burget, Jan Cernocký:
BCN2BRNO: ASR System Fusion for Albayzin 2022 Speech to Text Challenge. 276-280 - Roman Shrestha, Cornelius Glackin, Julie A. Wall, Nigel Cannings:
Intelligent Voice Speaker Recognition and Diarization System for IberSpeech 2022 Albayzin Evaluations Speaker Diarization and Identity Assignment Challenge. 281-283 - Antonio Miguel, Alfonso Ortega, Eduardo Lleida:
ViVoLAB System Description for the S2TC IberSPEECH-RTVE 2022 challenge. 284 - Germán Bordel, Luis Javier Rodríguez-Fuentes, Mikel Peñagarikano, Amparo Varona:
GTTS Systems for the Albayzin 2022 Speech and Text Alignment Challenge. 285-289
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.