default search action
Pavel Denisov
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c20]Sakshi Deo Shukla, Pavel Denisov, Tugtekin Turan:
Advancing Topic Segmentation of Broadcasted Speech with Multilingual Semantic Embeddings. ECAI 2024: 3956-3963 - [c19]Xuankai Chang, Brian Yan, Kwanghee Choi, Jee-Weon Jung, Yichen Lu, Soumi Maiti, Roshan S. Sharma, Jiatong Shi, Jinchuan Tian, Shinji Watanabe, Yuya Fujita, Takashi Maekaku, Pengcheng Guo, Yao-Fei Cheng, Pavel Denisov, Kohei Saijo, Hsiu-Hsuan Wang:
Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study. ICASSP 2024: 11481-11485 - [c18]Pavel Denisov, Thang Vu:
Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training. NAACL-HLT (Findings) 2024: 814-834 - [i18]Pavel Denisov, Ngoc Thang Vu:
Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training. CoRR abs/2404.10922 (2024) - [i17]Sakshi Deo Shukla, Pavel Denisov, Tugtekin Turan:
Advancing Topic Segmentation of Broadcasted Speech with Multilingual Semantic Embeddings. CoRR abs/2409.06222 (2024) - [i16]Mehdi Ali, Michael Fromm, Klaudia Thellmann, Jan Ebert, Alexander Arno Weber, Richard Rutmann, Charvi Jain, Max Lübbering, Daniel Steinigen, Johannes Leveling, Katrin Klug, Jasper Schulze Buschhoff, Lena Jurkschat, Hammam Abdelwahab, Benny Jörg Stein, Karl-Heinz Sylla, Pavel Denisov, Nicolo' Brandizzi, Qasid Saleem, Anirban Bhowmick, Lennard Helmer, Chelsea Maria John, Pedro Ortiz Suarez, Malte Ostendorff, Alex Jude, Lalith Manjunath, Samuel Weinbach, Carolin Penke, Oleg Filatov, Shima Asaadi, Fabio Barth, Rafet Sifa, Fabian Küch, Andreas Herten, René Jäkel, Georg Rehm, Stefan Kesselheim, Joachim Köhler, Nicolas Flores-Herr:
Teuken-7B-Base & Teuken-7B-Instruct: Towards European LLMs. CoRR abs/2410.03730 (2024) - 2023
- [c17]Pavel Denisov, Ngoc Thang Vu:
Leveraging Multilingual Self-Supervised Pretrained Models for Sequence-to-Sequence End-to-End Spoken Language Understanding. ASRU 2023: 1-8 - [c16]Florian Lux, Julia Koch, Sarina Meyer, Thomas Bott, Nadja Schauffler, Pavel Denisov, Antje Schweitzer, Ngoc Thang Vu:
The IMS Toucan System for the Blizzard Challenge 2023. Blizzard Challenge 2023 - [c15]Sarina Meyer, Florian Lux, Julia Koch, Pavel Denisov, Pascal Tilli, Ngoc Thang Vu:
Prosody Is Not Identity: A Speaker Anonymization Approach Using Prosody Cloning. ICASSP 2023: 1-5 - [i15]Xuankai Chang, Brian Yan, Kwanghee Choi, Jee-Weon Jung, Yichen Lu, Soumi Maiti, Roshan S. Sharma, Jiatong Shi, Jinchuan Tian, Shinji Watanabe, Yuya Fujita, Takashi Maekaku, Pengcheng Guo, Yao-Fei Cheng, Pavel Denisov, Kohei Saijo, Hsiu-Hsuan Wang:
Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study. CoRR abs/2309.15800 (2023) - [i14]Pavel Denisov, Ngoc Thang Vu:
Leveraging Multilingual Self-Supervised Pretrained Models for Sequence-to-Sequence End-to-End Spoken Language Understanding. CoRR abs/2310.06103 (2023) - [i13]Florian Lux, Julia Koch, Sarina Meyer, Thomas Bott, Nadja Schauffler, Pavel Denisov, Antje Schweitzer, Ngoc Thang Vu:
The IMS Toucan System for the Blizzard Challenge 2023. CoRR abs/2310.17499 (2023) - 2022
- [j1]Injy Hamed, Pavel Denisov, Chia-Yu Li, Mohamed Elmahdy, Slim Abdennadher, Ngoc Thang Vu:
Investigations on speech recognition systems for low-resource dialectal Arabic-English code-switching speech. Comput. Speech Lang. 72: 101278 (2022) - [c14]Siddhant Arora, Siddharth Dalmia, Pavel Denisov, Xuankai Chang, Yushi Ueda, Yifan Peng, Yuekai Zhang, Sujay Kumar, Karthik Ganesan, Brian Yan, Ngoc Thang Vu, Alan W. Black, Shinji Watanabe:
ESPnet-SLU: Advancing Spoken Language Understanding Through ESPnet. ICASSP 2022: 7167-7171 - [c13]Sarina Meyer, Florian Lux, Pavel Denisov, Julia Koch, Pascal Tilli, Ngoc Thang Vu:
Speaker Anonymization with Phonetic Intermediate Representations. INTERSPEECH 2022: 4925-4929 - [c12]Sarina Meyer, Pascal Tilli, Pavel Denisov, Florian Lux, Julia Koch, Ngoc Thang Vu:
Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy. SLT 2022: 912-919 - [i12]Sarina Meyer, Florian Lux, Pavel Denisov, Julia Koch, Pascal Tilli, Ngoc Thang Vu:
Speaker Anonymization with Phonetic Intermediate Representations. CoRR abs/2207.04834 (2022) - [i11]Sarina Meyer, Pascal Tilli, Pavel Denisov, Florian Lux, Julia Koch, Ngoc Thang Vu:
Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy. CoRR abs/2210.07002 (2022) - 2021
- [c11]Pavel Denisov, Manuel Mager, Ngoc Thang Vu:
IMS' Systems for the IWSLT 2021 Low-Resource Speech Translation Task. IWSLT 2021: 175-181 - [c10]Abteen Ebrahimi, Manuel Mager, Adam Wiemerslage, Pavel Denisov, Arturo Oncevay, Danni Liu, Sai Koneru, Enes Yavuz Ugan, Zhaolin Li, Jan Niehues, Monica Romero, Iván G. Torre, Tanel Alumäe, Jiaming Kong, Sergey Polezhaev, Yury Belousov, Wei-Rui Chen, Peter Sullivan, Ife Adebara, Bashar Talafha, Alcides Alcoba Inciarte, Muhammad Abdul-Mageed, Luis Chiruzzo, Rolando Coto-Solano, Hilaria Cruz, Sofía Flores-Solórzano, Aldo Andrés Alvarez López, Iván V. Meza-Ruíz, John E. Ortega, Alexis Palmer, Rodolfo Zevallos, Kristine Stenzel, Thang Vu, Katharina Kann:
Findings of the Second AmericasNLP Competition on Speech-to-Text Translation. NeurIPS (Competition and Demos) 2021: 217-232 - [c9]Desh Raj, Pavel Denisov, Zhuo Chen, Hakan Erdogan, Zili Huang, Maokui He, Shinji Watanabe, Jun Du, Takuya Yoshioka, Yi Luo, Naoyuki Kanda, Jinyu Li, Scott Wisdom, John R. Hershey:
Integration of Speech Separation, Diarization, and Recognition for Multi-Speaker Meetings: System Description, Comparison, and Analysis. SLT 2021: 897-904 - [i10]Pavel Denisov, Manuel Mager, Ngoc Thang Vu:
IMS' Systems for the IWSLT 2021 Low-Resource Speech Translation Task. CoRR abs/2106.16055 (2021) - [i9]Injy Hamed, Pavel Denisov, Chia-Yu Li, Mohamed Elmahdy, Slim Abdennadher, Ngoc Thang Vu:
Investigations on Speech Recognition Systems for Low-Resource Dialectal Arabic-English Code-Switching Speech. CoRR abs/2108.12881 (2021) - [i8]Siddhant Arora, Siddharth Dalmia, Pavel Denisov, Xuankai Chang, Yushi Ueda, Yifan Peng, Yuekai Zhang, Sujay Kumar, Karthik Ganesan, Brian Yan, Ngoc Thang Vu, Alan W. Black, Shinji Watanabe:
ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet. CoRR abs/2111.14706 (2021) - 2020
- [c8]Chia-Yu Li, Daniel Ortega, Dirk Väth, Florian Lux, Lindsey Vanderlyn, Maximilian Schmidt, Michael Neumann, Moritz Völkel, Pavel Denisov, Sabrina Jenne, Zorica Kacarevic, Ngoc Thang Vu:
ADVISER: A Toolkit for Developing Multi-modal, Multi-domain and Socially-engaged Conversational Agents. ACL (demo) 2020: 279-286 - [c7]Pavel Denisov, Ngoc Thang Vu:
Pretrained Semantic Speech Embeddings for End-to-End Spoken Language Understanding via Cross-Modal Teacher-Student Learning. INTERSPEECH 2020: 881-885 - [i7]Chia-Yu Li, Daniel Ortega, Dirk Väth, Florian Lux, Lindsey Vanderlyn, Maximilian Schmidt, Michael Neumann, Moritz Völkel, Pavel Denisov, Sabrina Jenne, Zorica Kacarevic, Ngoc Thang Vu:
ADVISER: A Toolkit for Developing Multi-modal, Multi-domain and Socially-engaged Conversational Agents. CoRR abs/2005.01777 (2020) - [i6]Pavel Denisov, Ngoc Thang Vu:
Pretrained Semantic Speech Embeddings for End-to-End Spoken Language Understanding via Cross-Modal Teacher-Student Learning. CoRR abs/2007.01836 (2020) - [i5]Desh Raj, Pavel Denisov, Zhuo Chen, Hakan Erdogan, Zili Huang, Mao-Kui He, Shinji Watanabe, Jun Du, Takuya Yoshioka, Yi Luo, Naoyuki Kanda, Jinyu Li, Scott Wisdom, John R. Hershey:
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis. CoRR abs/2011.02014 (2020)
2010 – 2019
- 2019
- [c6]Daniel Ortega, Chia-Yu Li, Gisela Vallejo, Pavel Denisov, Ngoc Thang Vu:
Context-aware Neural-based Dialog Act Classification on Automatically Generated Transcriptions. ICASSP 2019: 7265-7269 - [c5]Alexander Zakharov, Pavel Denisov:
Advantages and Limitations of Forward Squint SAR In Single Pass Interferometric Mapping Of Topography. IGARSS 2019: 8614-8616 - [c4]Pavel Denisov, Ngoc Thang Vu:
End-to-End Multi-Speaker Speech Recognition Using Speaker Embeddings and Transfer Learning. INTERSPEECH 2019: 4425-4429 - [i4]Daniel Ortega, Chia-Yu Li, Gisela Vallejo, Pavel Denisov, Ngoc Thang Vu:
Context-aware Neural-based Dialog Act Classification on Automatically Generated Transcriptions. CoRR abs/1902.11060 (2019) - [i3]Pavel Denisov, Ngoc Thang Vu:
End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning. CoRR abs/1908.04737 (2019) - [i2]Pavel Denisov, Ngoc Thang Vu:
IMS-Speech: A Speech to Text Tool. CoRR abs/1908.04743 (2019) - 2018
- [c3]Pavel Denisov, Ngoc Thang Vu, Marc Ferras Font:
Unsupervised Domain Adaptation by Adversarial Learning for Robust Speech Recognition. ITG Symposium on Speech Communication 2018: 1-5 - [c2]Valentina Kustikova, Mikhail Krivonosov, Alexey S. Pimashkin, Pavel Denisov, Alexey Zaikin, Mikhail Ivanchenko, Iosif B. Meyerov, Alexey V. Semyanov:
CalciumCV: Computer Vision Software for Calcium Signaling in Astrocytes. AIST 2018: 168-179 - [c1]Alexander Zakharov, Ludmila Zakharova, Polina Mikhaylyukova, Pavel Denisov:
Atmospheric Effects on Radarsat-2 Interferograms of Tolbachik Volcanic Complex. IGARSS 2018: 2192-2195 - [i1]Pavel Denisov, Ngoc Thang Vu, Marc Ferras Font:
Unsupervised Domain Adaptation by Adversarial Learning for Robust Speech Recognition. CoRR abs/1807.11284 (2018)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-15 19:35 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint