Thèse Année : 2013

Informed approach applied to sound and music analysis Approche informée pour l'analyse du son et de la musique

(1)

1 (Domaine Universitaire 351, cours de la Libération 33405 Talence Cedex - France) 3102

UB - Université de Bordeaux (35, place Pey Berland - 33076 Bordeaux - France) 259761
École Nationale Supérieure d'Électronique, Informatique et Radiocommunications de Bordeaux (ENSEIRB) (France) 300366
CNRS - Centre National de la Recherche Scientifique : UMR5800 / URA1304 (France) 441569

"> LaBRI - Laboratoire Bordelais de Recherche en Informatique

Dominique Fourer

Fonction : Auteur
PersonId : 915960

Laboratoire Bordelais de Recherche en Informatique

Résumé

In the ﬁeld of audio signal processing, analysis is an essential step which allows interactions with existing signals. In fact, the quality of transformed or synthesized audio signals depends on the accuracy over the estimated model parameters. However, theoretical limits exist and show that the best accuracy which can be reached by a classic estimator can be insufﬁcient for the most demanding applications (e.g. active listening of music). The work which is developed in this thesis revisits well known audio analysis problems like spectral analysis, automatic transcription of music and audio source separation using the novel "informed" approach. This approach takes advantage of a speciﬁc conﬁguration where the parameters of the elementary signals which compose a mixture are known before the mixing process. Using the tools which are proposed in this thesis, the minimal side information is computed and transmitted with the mixture signal. This allows any kind of transformation of the mixture signal with a constraint over the resulting quality. When the compatibility with existing audio formats is required, the side information is embedded directly into the analyzed audio signal using a watermarking technique. This work describes several theoretical and practical aspects of audio signal processing. We show that a classic estimator combined with the sufﬁcient side information can obtain better performance than classic approaches (classic estimation or pure coding).

En traitement du signal audio, l'analyse est une étape essentielle permettant de comprendre et d'interagir avec les signaux existants. En effet, la qualité des signaux obtenus par transformation ou par synthèse des paramètres estimés dépend de la précision des estimateurs utilisés. Cependant, des limitations théoriques existent et démontrent que la qualité maximale pouvant être atteinte avec une approche classique peut s'avérer insufﬁsante dans les applications les plus exigeantes (e.g. écoute active de la musique). Le travail présenté dans cette thèse revisite certains problèmes d'analyse usuels tels que l'analyse spectrale, la transcription automatique et la séparation de sources en utilisant une approche dite "informée". Cette nouvelle approche exploite la conﬁguration des studios de musique actuels qui maîtrisent la chaîne de traitement avant l'étape de création du mélange. Dans les solutions proposées, de l'information complémentaire minimale calculée est transmise en même temps que le signal de mélange aﬁn de permettre certaines transformations sur celui-ci tout en garantissant le niveau de qualité. Lorsqu'une compatibilité avec les formats audio existants est nécessaire, cette information est cachée à l'intérieur du mélange lui-même de manière inaudible grâce au tatouage audionumérique. Ce travail de thèse présente de nombreux aspects théoriques et pratiques dans lesquels nous montrons que la combinaison d'un estimateur avec de l'information complémentaire permet d'améliorer les performances des approches usuelles telles que l'estimation non informée ou le codage pur.

Mots clés

informed spectral analysis sinusoidal modeling audio coding source separation automatic transcription

estimation codage audio séparation de sources transcription automatique analyse spectrale informée modèle sinusoïdal

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP] Théorie de l'information et codage [math.IT] Théorie de l'information [cs.IT] Acoustique [physics.class-ph] Acoustique [physics.class-ph]

Fichier principal

these.pdf (1.58 Mo)

Résumé

Mots clés

Domaines

Dates et versions

Citer

Exporter

Collections

Partager