default search action
Speech Communication, Volume 51
Volume 51, Number 1, January 2009
- Ioannis Andrianakis, Paul R. White:
Speech spectral amplitude estimators using optimally shaped Gamma and Chi priors. 1-14 - Hyung-Min Park, Richard M. Stern:
Spatial separation of speech signals using amplitude estimation based on interaural comparisons of zero-crossings. 15-25 - René Carré:
Dynamic properties of an acoustic tube: Prediction of vowel systems. 26-41 - Randy Gomez, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Techniques in rapid unsupervised speaker adaptation based on HMM-Sufficient Statistics. 42-57 - Serajul Haque, Roberto Togneri, Anthony Zaknich:
Perceptual features for automatic speech recognition in noisy environments. 58-75 - Shigeaki Amano, Shuichi Sakamoto, Tadahisa Kondo, Yôiti Suzuki:
Development of familiarity-controlled word lists 2003 (FW03) to assess spoken-word intelligibility in Japanese. 76-82
Volume 51, Number 2, February 2009
- Wooil Kim, John H. L. Hansen:
Feature compensation in the cepstral domain employing model combination. 83-96 - Julie Fontecave Jallon, Frédéric Berthommier:
A semi-automatic method for extracting vocal tract movements from X-ray films. 97-115 - Hanny den Ouden, Leo G. M. Noordman, Jacques M. B. Terken:
Prosodic realizations of global and local structure and rhetorical relations in read aloud news reports. 116-129 - Annika Hämäläinen, Louis ten Bosch, Lou Boves:
Modelling pronunciation variation with single-path and multi-path syllable models: Issues to consider. 130-150 - Franz Pernkopf, Tuan Van Pham, Jeff A. Bilmes:
Broad phonetic classification using discriminative Bayesian networks. 151-166 - Wilbert Heeringa, Keith Johnson, Charlotte Gooskens:
Measuring Norwegian dialect distances using acoustic features. 167-183 - Tobias S. Andersen, Kaisa Tiippana, Jari Laarni, Ilpo Kojo, Mikko Sams:
The role of visual spatial attention in audiovisual speech perception. 184-193
Volume 51, Number 3, March 2009
- Hedvig Kjellström, Olov Engwall:
Audiovisual-to-articulatory inversion. 195-209 - Monja A. Knoll, Maria Uther, Alan Costall:
Effects of low-pass filtering on the judgment of vocal affect in speech directed to infants, adults and foreigners. 210-216 - Mónica Caballero, Asunción Moreno, Albino Nogueiras:
Multidialectal Spanish acoustic modeling for speech recognition. 217-229 - Yipeng Li, DeLiang Wang:
On the optimality of ideal binary time-frequency masks. 230-239 - Daniel Recasens, Aina Espinosa:
Dispersion and variability in Catalan five and six peripheral vowel systems. 240-258 - Huijun Ding, Ing Yann Soon, Soo Ngee Koh, Chai Kiat Yeo:
A spectral filtering method based on hybrid wiener filters for speech enhancement. 259-267 - Zeynep Inanoglu, Steve J. Young:
Data-driven emotion conversion in spoken English. 268-283 - Catherine Breslin, Mark J. F. Gales:
Directed decision trees for generating complementary systems. 284-295 - Monja A. Knoll, Lisa Scharrer, Alan Costall:
Are actresses better simulators than female students? The effects of simulation on prosodic modifications of infant- and foreigner-directed speech. 296-305
Volume 51, Number 4, April 2009
- Edward Ozimek, Dariusz Kutzner, Aleksander Sek, Andrzej Wicher:
Development and evaluation of Polish digit triplet test for auditory screening. 307-316 - Joseph Keshet, David Grangier, Samy Bengio:
Discriminative keyword spotting. 317-329 - Suphattharachai Chomphan, Takao Kobayashi:
Tone correctness improvement in speaker-independent average-voice-based Thai speech synthesis. 330-343 - D. Sciamarella, Guillermo Artana:
A water hammer analysis of pressure and flow in the voice production system. 344-351 - John-Paul Hosom:
Speaker-independent phoneme alignment using transition-dependent states. 352-368 - Adrian Garcia-Sierra, Randy L. Diehl, Craig Champlin:
Testing the double phonemic boundary in bilinguals. 369-378 - Markpong Jongtaveesataporn, Issara Thienlikit, Chai Wutiwiwatchai, Sadaoki Furui:
Lexical units for Thai LVCSR. 379-389 - Angel M. Gomez, Antonio M. Peinado, Victoria E. Sánchez, José L. Carmona:
A robust scheme for distributed speech recognition over loss-prone packet channels. 390-400
Volume 51, Number 5, May 2009
- Carlo Magi, Jouni Pohjalainen, Tom Bäckström, Paavo Alku:
Stabilised weighted linear prediction. 401-411 - Minwoo Jeong, Gary Geunbae Lee:
Multi-domain spoken language understanding with transfer learning. 412-424 - Andreas K. Maier, Tino Haderlein, Ulrich Eysholdt, Frank Rosanowski, Anton Batliner, Maria Schuster, Elmar Nöth:
PEAKS - A system for the automatic evaluation of voice and speech disorders. 425-437 - Peter Jancovic, Münevver Köküer:
Incorporating the voicing information into HMM-based automatic speech recognition in noisy environments. 438-451 - Francisco Campillo Díaz, Jan P. H. van Santen, Eduardo Rodríguez Banga:
Integrating phrasing and intonation modelling using syntactic and morphosyntactic information. 452-465 - Cheongjae Lee, Sangkeun Jung, Seokhwan Kim, Gary Geunbae Lee:
Example-based dialog modeling for practical multi-domain dialog system. 466-484
Volume 51, Number 6, June 2009
- Carlos Molina, Néstor Becerra Yoma, Jorge Wuth, Hiram Vivanco:
ASR based pronunciation evaluation with automatically generated competing vocabulary and classifier fusion. 485-498 - Matteo Gerosa, Diego Giuliani, Fabio Brugnara:
Towards age-independent acoustic modeling. 499-509 - Shigeaki Amano, Tadahisa Kondo, Kazumi Kato, Tomohiro Nakatani:
Development of Japanese infant speech database from longitudinal recordings. 510-520 - Kihyeon Kim, Robert H. Baran, Hanseok Ko:
Extension of two-channel transfer function based generalized sidelobe canceller for dealing with both background and point-source noise. 521-533 - Shi-Xiong Zhang, Man-Wai Mak:
A new adaptation approach to high-level speaker-model creation in speaker verification. 534-550 - Matthias Wölfel:
Signal adaptive spectral envelope estimation for robust speech recognition. 551-561
Volume 51, Number 7, July 2009
- Mats Blomberg, Kjell Elenius, David House, Inger Karlsson:
Note from the Guest Editors. 563 - Gunnar Fant:
A personal note from Gunnar Fant. 564-568 - Joseph-Jean Mariani:
Research infrastructures for Human Language Technologies: A vision from France. 569-584 - Yoko Greenberg, Nagisa Shibuya, Minoru Tsuzaki, Hiroaki Kato, Yoshinori Sagisaka:
Analysis on paralinguistic prosody control in perceptual impression space using multiple dimensional scaling. 585-593 - Nina Grønnum:
A Danish phonetically annotated spontaneous speech corpus (DanPASS). 594-603 - Dominic W. Massaro, Alexandra Jesse:
Read my lips: speech distortions in musical lyrics can be overcome (slightly) by facial information. 604-621 - Björn Lindblom, Randy L. Diehl, Carl Creeger:
Do 'Dominant Frequencies' explain the listener's response to formant and spectrum shape variations? 622-629 - Catherine Pelachaud:
Studies on gesture expressivity for a virtual agent. 630-639 - Andrew Rosenberg, Julia Hirschberg:
Charisma perception from text and speech. 640-655
Volume 51, Number 8, August 2009
- Yang Shao, DeLiang Wang:
Sequential organization of speech in computational auditory scene analysis. 657-667 - David P. Messing, Lorraine Delhorne, Ed Bruckert, Louis D. Braida, Oded Ghitza:
A non-linear efferent-inspired model of the auditory system; matching human confusions in stationary noise. 668-683 - Soyoung Lee, Gregory K. Iverson:
Vowel development in English and Korean: Similarities and differences in linguistic and non-linguistic factors. 684-694 - Philip J. B. Jackson, Veena D. Singampalli:
Statistical identification of articulation constraints in the production of speech. 695-710 - Chai Wutiwiwatchai, Sadaoki Furui:
Corrigendum to: "Thai speech processing technology: A review" [Speech Communication 49 (1) (2007) 8-27]. 711
Volume 51, Number 9, September 2009
- Mohamed Chetouani, Marcos Faúndez-Zanuy, Amir Hussain, Bruno Gas, Jean-Luc Zarader, Kuldip K. Paliwal:
Special issue on non-linear and non-conventional speech processing. 713 - Heyun Huang, Fuhuei Lin:
A speech feature extraction method using complexity measure for voice activity detection in WGN. 714-723 - Christophe Charbuillet, Bruno Gas, Mohamed Chetouani, Jean-Luc Zarader:
Optimizing feature complementarity by evolution strategy: Application to automatic speaker verification. 724-731 - Leila Zouari, Gérard Chollet:
Efficient codebooks for fast and accurate low resource ASR systems. 732-743 - Ignasi Iriondo Sanz, Santiago Planet, Joan Claudi Socoró, Elisa Martínez, Francesc Alías, Carlos Monzo:
Automatic refinement of an expressive speech corpus assembling subjective perception and automatic classification. 744-758 - Pedro Gómez Vilda, Roberto Fernández-Baíllo, María Victoria Rodellar Biarge, Victor Nieto Lluis, Agustín Álvarez Marquina, Luis Miguel Mazaira-Fernández, Rafael Martínez-Olalla, Juan Ignacio Godino-Llorente:
Glottal Source biometrical signature for voice pathology detection. 759-781 - Aïcha Bouzid, Noureddine Ellouze:
Voice source parameter measurement based on multi-scale analysis of electroglottographic signal. 782-792 - Bernd J. Kröger, Jim Kannampuzha, Christiane Neuschaefer-Rube:
Towards a neurocomputational model of speech production and perception. 793-809 - Cheng-Hsiung Hsieh, Ting-Yu Feng, Po-Chin Huang:
Energy-based VAD with grey magnitude spectral subtraction. 810-819 - Enric Monte-Moreno, Mohamed Chetouani, Marcos Faúndez-Zanuy, Jordi Solé i Casals:
Maximum likelihood linear programming data fusion for speaker recognition. 820-830
Volume 51, Number 10, October 2009
- Maxine Eskénazi:
Guest Editorial. 831 - Maxine Eskénazi:
An overview of spoken language technology for education. 832-844 - Helmer Strik, Khiet P. Truong, Febe de Wet, Catia Cucchiarini:
Comparing different approaches for automatic pronunciation error detection. 845-852 - Catia Cucchiarini, Ambra Neri, Helmer Strik:
Oral proficiency training in Dutch L2: The contribution of ASR-based corrective feedback. 853-863 - Febe de Wet, Christa van der Walt, Thomas Niesler:
Automatic assessment of oral language proficiency and listening comprehension. 864-874 - Yuichi Ohkawa, Motoyuki Suzuki, Hirokazu Ogasawara, Akinori Ito, Shozo Makino:
A speaker adaptation method for non-native speech using learners' native utterances for computer-assisted language learning systems. 875-882 - Klaus Zechner, Derrick Higgins, Xiaoming Xi, David M. Williamson:
Automatic scoring of non-native spontaneous speech in tests of spoken English. 883-895 - Si Wei, Guoping Hu, Yu Hu, Ren-Hua Wang:
A new method for mispronunciation detection using Support Vector Machine based on Pronunciation Space Models. 896-905 - Zöe Handley:
Is text-to-speech synthesis ready for use in computer-assisted language learning? 906-919 - Daniel Felps, Heather Bortfeld, Ricardo Gutierrez-Osuna:
Foreign accent conversion in computer assisted pronunciation training. 920-932 - Maria Paola Bissiri, Hartmut R. Pfitzinger:
Italian speakers learn lexical stress of German morphologically complex words. 933-947 - Oscar Saz, Shou-Chun Yin, Eduardo Lleida, Richard C. Rose, Carlos Vaquero, William Ricardo Rodríguez:
Tools and Technologies for Computer-Aided Speech and Language Therapy. 948-967 - Patti Price, Joseph Tepperman, Markus Iseli, Thao Duong, Matthew Black, Shizhen Wang, Christy Kim Boscardin, Margaret Heritage, P. David Pearson, Shrikanth S. Narayanan, Abeer Alwan:
Assessment of emerging reading skills in young native speakers and language learners. 968-984 - Jacques Duchateau, Yuk On Kong, Leen Cleuren, Lukas Latacz, Jan Roelens, Abdurrahman Samir, Kris Demuynck, Pol Ghesquière, Werner Verhelst, Hugo Van hamme:
Developing a reading tutor: Design and evaluation of dedicated speech recognition and synthesis modules. 985-994 - Hongcui Wang, Christopher J. Waple, Tatsuya Kawahara:
Computer Assisted Language Learning system based on dynamic question generation and error prediction for automatic speech recognition. 995-1005 - Ian McGraw, Brandon Yoshimoto, Stephanie Seneff:
Speech-enabled card games for incidental vocabulary acquisition in a foreign language. 1006-1023 - Preben Wik, Anna Hjalmarsson:
Embodied conversational agents in computer assisted language learning. 1024-1037
Volume 51, Number 11, November 2009
- Heiga Zen, Keiichi Tokuda, Alan W. Black:
Statistical parametric speech synthesis. 1039-1064 - Gilles Gonon, Frédéric Bimbot, Rémi Gribonval:
Probabilistic scoring using decision trees for fast and scalable speaker recognition. 1065-1081 - Jan P. H. van Santen, Emily Tucker Prud'hommeaux, Lois M. Black:
Automated assessment of prosody production. 1082-1097 - Nima Derakhshan, Ahmad Akbari, Ahmad Ayatollahi:
Noise power spectrum estimation using constrained variance spectral smoothing and minima tracking. 1098-1113 - Daniel A. Taft, David B. Grayden, Anthony N. Burkitt:
Speech coding with traveling wave delays: Desynchronizing cochlear implant frequency bands with cochlea-like group delays. 1114-1123 - Maarten Van Segbroeck, Hugo Van hamme:
Unsupervised learning of time-frequency patches as a noise-robust representation of speech. 1124-1138 - Sabato Marco Siniscalchi, Chin-Hui Lee:
A study on integrating acoustic-phonetic information into lattice rescoring for automatic speech recognition. 1139-1153
Volume 51, Number 12, December 2009
- Aren Jansen, Partha Niyogi:
Point process models for event-based speech recognition. 1155-1168 - Yao Qian, Frank K. Soong:
A Multi-Space Distribution (MSD) and two-stream tone modeling approach to Mandarin speech recognition. 1169-1179 - Jianfeng Chen, Koksoon Phua, Louis Shue, Hanwu Sun:
Performance evaluation of adaptive dual microphone systems. 1180-1193 - Veronique Stouten, Hugo Van hamme:
Automatic voice onset time estimation from reassignment spectra. 1194-1205 - Vassilis Pitsikalis, Petros Maragos:
Analysis and classification of speech signals by generalized fractal dimension features. 1206-1223 - Om Deshmukh, Ashish Verma:
Nucleus-level clustering for word-independent syllable stress classification. 1224-1233 - Klaus-Peter Engelbrecht, Michael Quade, Sebastian Möller:
Analysis of a new simulation approach to dialog system evaluation. 1234-1252 - Youyi Lu, Martin Cooke:
The contribution of changes in F0 and spectral tilt to increased intelligibility of speech produced in noise. 1253-1262 - K. Sreenivasa Rao, B. Yegnanarayana:
Duration modification using glottal closure instants and vowel onset points. 1263-1269
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.