Hendriks et al., 2007 - Google Patents

DFT domain subspace based noise tracking for speech enhancement.

Hendriks et al., 2007

Document ID: 1589136635153368242
Author: Hendriks R; Jensen J; Heusdens R
Publication year: 2007
Publication venue: Interspeech

External Links

Cited by

Snippet

Most DFT domain based speech enhancement methods are dependent on an estimate of the noise power spectral density (PSD). For non-stationary noise sources it is desirable to estimate the noise PSD also in spectral regions where speech is present. In this paper a …

Continue reading at www.academia.edu (PDF) (other versions)

238000000354 decomposition reaction 0 abstract description 8

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L21/0232—Processing in the frequency domain
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G10L25/09—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters the extracted parameters being zero crossing rates
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G10L25/06—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters the extracted parameters being correlation coefficients
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/90—Pitch determination of speech signals
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
- G10L2015/0635—Training updating or merging of old and new templates; Mean values; Weighting
- G10L2015/0636—Threshold criteria for the updating

Similar Documents

Publication	Publication Date	Title
Hendriks et al.	2008	Noise tracking using DFT domain subspace decompositions
Taghia et al.	2011	An evaluation of noise power spectral density estimation algorithms in adverse acoustic environments
Cohen	2005	Relaxed statistical model for speech enhancement and a priori SNR estimation
KR100330230B1 (en)	2002-05-09	Noise suppression for low bitrate speech coder
Eaton et al.	2013	Noise-robust reverberation time estimation using spectral decay distributions with reduced computational cost
Suh et al.	2012	Multiple acoustic model-based discriminative likelihood ratio weighting for voice activity detection
Fraser et al.	1994	Multiple window spectrogram and time-frequency distributions
Li et al.	2016	Non-stationary noise power spectral density estimation based on regional statistics
Jaramillo et al.	2019	A study on how pre-whitening influences fundamental frequency estimation
Hendriks et al.	2007	DFT domain subspace based noise tracking for speech enhancement.
Gerkmann et al.	2010	Speech presence probability estimation based on temporal cepstrum smoothing
Batina et al.	2006	Noise power spectrum estimation for speech enhancement using an autoregressive model for speech power spectrum dynamics
Bavkar et al.	2013	PCA based single channel speech enhancement method for highly noisy environment
Wei et al.	2013	A novel prewhitening subspace method for enhancing speech corrupted by colored noise
KR100798056B1 (en)	2008-01-28	Speech Processing Method for Improving Sound Quality in Highly Negative Noise Environments
Nasr et al.	2018	Efficient implementation of adaptive wiener filter for pitch detection from noisy speech signals
Thakare	2010	Voice activity detector and noise trackers for speech recognition system in noisy environment
Eaton et al.	2013	A comparison of non-intrusive SNR estimation algorithms and the use of mapping functions
Farsi	2010	Improvement of minimum tracking in minimum statistics noise estimation method
Tao et al.	2023	Single Channel Speech Presence Probability Estimation based on Hybrid Global-Local Information
Hansen et al.	1989	Use of objective speech quality measures in selecting effective spectral estimation techniques for speech enhancement
Cohen	2005	From volatility modeling of financial time-series to stochastic modeling and enhancement of speech signals
Zavarehei et al.	2006	Inter-frame modeling of DFT trajectories of speech and noise for speech enhancement using Kalman filters
Wu et al.	2019	A Time Domain Estimation Algorithm for Speech Signal Pitch Period
Zavarehei et al.	2005	Speech enhancement using Kalman filters for restoration of short-time DFT trajectories