The present work consists on the use of delta cepstra coefficients in Mel scale, wavelet and wavelet packet transforms to feed a system for automatic speaker identification based on neural networks. Different alternatives are tested for... more
The detection of changes in the parameter values of a nonlinear dynamic system is a branch of study with multiple applications. In this paper, we explore a variant of an automatic detector and clustering of slight parameter variations in... more
This project involved the design and development of a relational SQL-based database to generate an intonational model for an Argentine Spanish text to speech system. The first stage in the population of the database involved the massive... more
This work evaluates the efficiency of different word classes -part of speech-, normalized vs. non normalized counting for syllable and word occurrences, to predict non orthographic breaks of an Argentine Spanish database, designed for the... more
We evaluate here the application of two intonational models –quantitative and phonetic- to the analysis of an Argentine Spanish database of 741 broad-focus declarative sentences. The analytic model is the superpositional model proposed by... more
The goal of this project was the design and realisation of a database to be used in an automatic speech recognition system for a fixed telephone network. One thousand speakers, native to five Argentine dialectal regions, were recorded.... more
Synthesis by concatenation of natural speech improves perceptual results when phonemes and syllables are segmented at places where spectral variations are small [Klatt, D., 1987. Review of text-to-speech conversion for English. J. Acoust.... more
Los paquetes de onditas (WP) son una extensión de la transformada de onditas (WT) introducida recientemente. Si bien su aplicación ha sido amplia en problemas de compresión, filtrado y supresión de ruido, poco se ha hecho en el problema... more
El objetivo de este trabajo es el diseño y la aplicación de una prueba rápida de inteligibilidad del habla en ambientes ruidosos. La prueba está diseñada para niños de 6 a 12 años que asisten a escuelas o instituciones donde no se dispone... more
This work presents two new approaches for parameter estimation of the superpositional intonation model for German. These methods introduce linguistic assumptions, which allow to initialize a previous standard method. Also, eliminates... more
In this paper we model the segmental duration of Spanish spoken in Buenos Aires, considering its application in a text-to-speech system. The work was performed on two hand labeled databases. We use artificial neural networks as predictor,... more
This paper introduces Aromo text-to-speech system for Argentine Spanish, which was designed for telephony applications and is based on unit selection and concatenation. The system operates as a client-server engine that supports MRCP, SIP... more
El presente trabajo consiste en la utilización de Coeficientes Delta Cepstra en escala de Mel para alimentar un sistema de identificación automática del hablante basado en redes neuronales. Se ensayan diferentes alternativas para el... more