default search action
Srikanth Ronanki
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c25]Raghuveer Peri, Sai Muralidhar Jayanthi, Srikanth Ronanki, Anshu Bhatia, Karel Mundnich, Saket Dingliwal, Nilaksh Das, Zejiang Hou, Goeric Huybrechts, Srikanth Vishnubhotla, Daniel Garcia-Romero, Sundararajan Srinivasan, Kyu J. Han, Katrin Kirchhoff:
SpeechGuard: Exploring the Adversarial Robustness of Multi-modal Large Language Models. ACL (Findings) 2024: 10018-10035 - [i19]Nilaksh Das, Saket Dingliwal, Srikanth Ronanki, Rohit Paturi, Zhaocheng Huang, Prashant Mathur, Jie Yuan, Dhanush Bekal, Xing Niu, Sai Muralidhar Jayanthi, Xilai Li, Karel Mundnich, Monica Sunkara, Sundararajan Srinivasan, Kyu J. Han, Katrin Kirchhoff:
SpeechVerse: A Large-scale Generalizable Audio Language Model. CoRR abs/2405.08295 (2024) - [i18]Raghuveer Peri, Sai Muralidhar Jayanthi, Srikanth Ronanki, Anshu Bhatia, Karel Mundnich, Saket Dingliwal, Nilaksh Das, Zejiang Hou, Goeric Huybrechts, Srikanth Vishnubhotla, Daniel Garcia-Romero, Sundararajan Srinivasan, Kyu J. Han, Katrin Kirchhoff:
SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models. CoRR abs/2405.08317 (2024) - [i17]Devang Kulshreshtha, Saket Dingliwal, Brady Houston, Nikolaos Pappas, Srikanth Ronanki:
Sequential Editing for Lifelong Training of Speech Recognition Models. CoRR abs/2406.17935 (2024) - 2023
- [c24]Veera Raghavendra Elluru, Devang Kulshreshtha, Rohit Paturi, Sravan Bodapati, Srikanth Ronanki:
Generalized Zero-Shot Audio-to-Intent Classification. ASRU 2023: 1-8 - [c23]Tyler Vuong, Karel Mundnich, Dhanush Bekal, Veera Raghavendra Elluru, Srikanth Ronanki, Sravan Bodapati:
AdaBERT-CTC: Leveraging BERT-CTC for Text-Only Domain Adaptation in ASR. EMNLP (Industry Track) 2023: 364-371 - [c22]Sai Muralidhar Jayanthi, Devang Kulshreshtha, Saket Dingliwal, Srikanth Ronanki, Sravan Bodapati:
Retrieve and Copy: Scaling ASR Personalization to Large Catalogs. EMNLP (Industry Track) 2023: 631-639 - [c21]Xilai Li, Goeric Huybrechts, Srikanth Ronanki, Jeff Farris, Sravan Bodapati:
Dynamic Chunk Convolution for Unified Streaming and Non-Streaming Conformer ASR. ICASSP 2023: 1-5 - [c20]Goeric Huybrechts, Srikanth Ronanki, Xilai Li, Hadis Nosrati, Sravan Bodapati, Katrin Kirchhoff:
DCTX-Conformer: Dynamic context carry-over for low latency unified streaming and non-streaming Conformer. INTERSPEECH 2023: 1658-1662 - [c19]Dhanush Bekal, Karthik Gopalakrishnan, Karel Mundnich, Srikanth Ronanki, Sravan Bodapati, Katrin Kirchhoff:
A Metric-Driven Approach to Conformer Layer Pruning for Efficient ASR Inference. INTERSPEECH 2023: 4079-4083 - [i16]Xilai Li, Goeric Huybrechts, Srikanth Ronanki, Jeff Farris, Sravan Bodapati:
Dynamic Chuck Convolution for Unified Streaming And Non-streaming Conformer ASR. CoRR abs/2304.09325 (2023) - [i15]Goeric Huybrechts, Srikanth Ronanki, Xilai Li, Hadis Nosrati, Sravan Bodapati, Katrin Kirchhoff:
DCTX-Conformer: Dynamic context carry-over for low latency unified streaming and non-streaming Conformer. CoRR abs/2306.08175 (2023) - [i14]Veera Raghavendra Elluru, Devang Kulshreshtha, Rohit Paturi, Sravan Bodapati, Srikanth Ronanki:
Generalized zero-shot audio-to-intent classification. CoRR abs/2311.02482 (2023) - [i13]Sai Muralidhar Jayanthi, Devang Kulshreshtha, Saket Dingliwal, Srikanth Ronanki, Sravan Bodapati:
Retrieve and Copy: Scaling ASR Personalization to Large Catalogs. CoRR abs/2311.08402 (2023) - 2022
- [c18]Dhanush Bekal, Sundararajan Srinivasan, Srikanth Ronanki, Sravan Bodapati, Katrin Kirchhoff:
Contextual Acoustic Barge-In Classification for Spoken Dialog Systems. INTERSPEECH 2022: 1091-1095 - [c17]Saket Dingliwal, Monica Sunkara, Srikanth Ronanki, Jeff Farris, Katrin Kirchhoff, Sravan Bodapati:
Personalization of CTC Speech Recognition Models. SLT 2022: 302-309 - [i12]Saket Dingliwal, Monica Sunkara, Srikanth Ronanki, Jeff Farris, Katrin Kirchhoff, Sravan Bodapati:
Personalization of CTC Speech Recognition Models. CoRR abs/2210.09510 (2022) - [i11]Dhanush Bekal, Sundararajan Srinivasan, Sravan Bodapati, Srikanth Ronanki, Katrin Kirchhoff:
Device Directedness with Contextual Cues for Spoken Dialog Systems. CoRR abs/2211.13280 (2022) - 2021
- [c16]Siddharth Dalmia, Yuzong Liu, Srikanth Ronanki, Katrin Kirchhoff:
Transformer-Transducers for Code-Switched Speech Recognition. ICASSP 2021: 5859-5863 - [c15]Ashish Shenoy, Sravan Bodapati, Monica Sunkara, Srikanth Ronanki, Katrin Kirchhoff:
Adapting Long Context NLM for ASR Rescoring in Conversational Agents. Interspeech 2021: 3246-3250 - [i10]Ashish Shenoy, Sravan Bodapati, Monica Sunkara, Srikanth Ronanki, Katrin Kirchhoff:
"What's The Context?" : Long Context NLM Adaptation for ASR Rescoring in Conversational Agents. CoRR abs/2104.11070 (2021) - 2020
- [c14]Monica Sunkara, Srikanth Ronanki, Dhanush Bekal, Sravan Bodapati, Katrin Kirchhoff:
Multimodal Semi-Supervised Learning Framework for Punctuation Prediction in Conversational Speech. INTERSPEECH 2020: 4911-4915 - [i9]Monica Sunkara, Srikanth Ronanki, Kalpit Dixit, Sravan Bodapati, Katrin Kirchhoff:
Robust Prediction of Punctuation and Truecasing for Medical ASR. CoRR abs/2007.02025 (2020) - [i8]Monica Sunkara, Srikanth Ronanki, Dhanush Bekal, Sravan Bodapati, Katrin Kirchhoff:
Multimodal Semi-supervised Learning Framework for Punctuation Prediction in Conversational Speech. CoRR abs/2008.00702 (2020) - [i7]Siddharth Dalmia, Yuzong Liu, Srikanth Ronanki, Katrin Kirchhoff:
Transformer-Transducers for Code-Switched Speech Recognition. CoRR abs/2011.15023 (2020)
2010 – 2019
- 2019
- [b1]Srikanth Ronanki:
Prosody generation for text-to-speech synthesis. University of Edinburgh, UK, 2019 - [c13]Javier Latorre, Jakub Lachowicz, Jaime Lorenzo-Trueba, Thomas Merritt, Thomas Drugman, Srikanth Ronanki, Viacheslav Klimkov:
Effect of Data Reduction on Sequence-to-sequence Neural TTS. ICASSP 2019: 7075-7079 - [c12]Viacheslav Klimkov, Srikanth Ronanki, Jonas Rohnke, Thomas Drugman:
Fine-Grained Robust Prosody Transfer for Single-Speaker Neural Text-To-Speech. INTERSPEECH 2019: 4440-4444 - [c11]Nishant Prateek, Mateusz Lajszczak, Roberto Barra-Chicote, Thomas Drugman, Jaime Lorenzo-Trueba, Thomas Merritt, Srikanth Ronanki, Trevor Wood:
In Other News: a Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data. NAACL-HLT (2) 2019: 205-213 - [i6]Nishant Prateek, Mateusz Lajszczak, Roberto Barra-Chicote, Thomas Drugman, Jaime Lorenzo-Trueba, Thomas Merritt, Srikanth Ronanki, Trevor Wood:
In Other News: A Bi-style Text-to-speech Model for Synthesizing Newscaster Voice with Limited Data. CoRR abs/1904.02790 (2019) - [i5]Viacheslav Klimkov, Srikanth Ronanki, Jonas Rohnke, Thomas Drugman:
Fine-grained robust prosody transfer for single-speaker neural text-to-speech. CoRR abs/1907.02479 (2019) - [i4]Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Héctor Delgado, Andreas Nautsch, Nicholas W. D. Evans, Md. Sahidullah, Ville Vestman, Tomi Kinnunen, Kong Aik Lee, Lauri Juvela, Paavo Alku, Yu-Huai Peng, Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Sébastien Le Maguer, Markus Becker, Fergus Henderson, Rob Clark, Yu Zhang, Quan Wang, Ye Jia, Kai Onuma, Koji Mushika, Takashi Kaneda, Yuan Jiang, Li-Juan Liu, Yi-Chiao Wu, Wen-Chin Huang, Tomoki Toda, Kou Tanaka, Hirokazu Kameoka, Ingmar Steiner, Driss Matrouf, Jean-François Bonastre, Avashna Govender, Srikanth Ronanki, Jing-Xuan Zhang, Zhen-Hua Ling:
The ASVspoof 2019 database. CoRR abs/1911.01601 (2019) - 2018
- [c10]Zack Hodari, Oliver Watts, Srikanth Ronanki, Simon King:
Learning Interpretable Control Dimensions for Speech Synthesis by Using External Data. INTERSPEECH 2018: 32-36 - [i3]Javier Latorre, Jakub Lachowicz, Jaime Lorenzo-Trueba, Thomas Merritt, Thomas Drugman, Srikanth Ronanki, Klimkov Viacheslav:
Effect of data reduction on sequence-to-sequence neural TTS. CoRR abs/1811.06315 (2018) - 2017
- [c9]Srikanth Ronanki, Manuel Sam Ribeiro, Felipe Espic, Oliver Watts:
The CSTR entry to the Blizzard Challenge 2017. Blizzard Challenge 2017 - [c8]Srikanth Ronanki, Oliver Watts, Simon King:
A Hierarchical Encoder-Decoder Model for Statistical Parametric Speech Synthesis. INTERSPEECH 2017: 1133-1137 - 2016
- [c7]Thomas Merritt, Srikanth Ronanki, Zhizheng Wu, Oliver Watts:
The CSTR entry to the Blizzard Challenge 2016. Blizzard Challenge 2016 - [c6]Gustav Eje Henter, Srikanth Ronanki, Oliver Watts, Mirjam Wester, Zhizheng Wu, Simon King:
Robust TTS duration modelling using DNNS. ICASSP 2016: 5130-5134 - [c5]Srikanth Ronanki, Gustav Eje Henter, Zhizheng Wu, Simon King:
A Template-Based Approach for Speech Synthesis Intonation Generation Using LSTMs. INTERSPEECH 2016: 2463-2467 - [c4]Srikanth Ronanki, Oliver Watts, Simon King, Gustav Eje Henter:
Median-based generation of synthetic speech durations using a non-parametric approach. SLT 2016: 686-692 - [c3]Srikanth Ronanki, Siva Reddy Gangireddy, Bajibabu Bollepalli, Simon King:
DNN-based Speech Synthesis for Indian Languages from ASCII text. SSW 2016: 70-75 - [c2]Srikanth Ronanki, Zhizheng Wu, Oliver Watts, Simon King:
A Demonstration of the Merlin Open Source Neural Network Speech Synthesis System. SSW 2016: 124 - [i2]Srikanth Ronanki, Siva Reddy Gangireddy, Bajibabu Bollepalli, Simon King:
DNN-based Speech Synthesis for Indian Languages from ASCII text. CoRR abs/1608.05374 (2016) - [i1]Srikanth Ronanki, Oliver Watts, Simon King, Gustav Eje Henter:
Median-Based Generation of Synthetic Speech Durations using a Non-Parametric Approach. CoRR abs/1608.06134 (2016) - 2015
- [c1]Oliver Watts, Srikanth Ronanki, Zhizeng Wu, Tuomi Raito, Attni Suni:
The NST-GlottHMM entry to the Blizzard Challenge 2015. Blizzard Challenge 2015
Coauthor Index
aka: Sravan Bodapati
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-29 00:25 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint