McAulay et al., 2003 - Google Patents

Speech enhancement using a soft-decision noise suppression filter

McAulay et al., 2003

Document ID: 9014905591625698995
Author: McAulay R; Malpass M
Publication year: 2003
Publication venue: IEEE Transactions on Acoustics, Speech, and Signal Processing

External Links

Cited by

Snippet

One way of enhancing speech in an additive acoustic noise environment is to perform a spectral decomposition of a frame of noisy speech and to attenuate a particular spectral line depending on how much the measured speech plus noise power exceeds an estimate of the …

Continue reading at ieeexplore.ieee.org (other versions)

230000001629 suppression 0 title abstract description 55

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L21/0232—Processing in the frequency domain
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using predictive techniques
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/90—Pitch determination of speech signals
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification

Similar Documents

Publication	Publication Date	Title
McAulay et al.	1980	Speech enhancement using a soft-decision noise suppression filter
Lim et al.	2005	Enhancement and bandwidth compression of noisy speech
US7313518B2 (en)	2007-12-25	Noise reduction method and device using two pass filtering
Cohen	2003	Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging
Yegnanarayana et al.	2002	Enhancement of reverberant speech using LP residual signal
EP1891624B1 (en)	2011-05-04	Multi-sensory speech enhancement using a speech-state model
US7117148B2 (en)	2006-10-03	Method of noise reduction using correction vectors based on dynamic aspects of speech and noise normalization
Cohen et al.	2008	Spectral enhancement methods
Soon et al.	2003	Speech enhancement using 2-D Fourier transform
Cohen	2005	Speech enhancement using super-Gaussian speech models and noncausal a priori SNR estimation
Yu et al.	2019	A deep neural network based Kalman filter for time domain speech enhancement
Krishnamoorthy et al.	2009	Reverberant speech enhancement by temporal and spectral processing
Roy et al.	2021	DeepLPC: A deep learning approach to augmented Kalman filter-based single-channel speech enhancement
O'Shaughnessy	2002	Enhancing speech degrated by additive noise or interfering speakers
WO2009043066A1 (en)	2009-04-09	Method and device for low-latency auditory model-based single-channel speech enhancement
Wisdom et al.	2015	Enhancement and recognition of reverberant and noisy speech by extending its coherence
Taşmaz et al.	2008	Speech enhancement based on undecimated wavelet packet-perceptual filterbanks and MMSE–STSA estimation in various noise environments
Kim et al.	2011	Mask classification for missing-feature reconstruction for robust speech recognition in unknown background noise
Krishnamoorthy et al.	2009	Temporal and spectral processing methods for processing of degraded speech: a review
Shao et al.	2005	A versatile speech enhancement system based on perceptual wavelet denoising
Nidhyananthan et al.	2014	A review on speech enhancement algorithms and why to combine with environment classification
Saleem et al.	2020	Machine learning approach for improving the intelligibility of noisy speech
Ju et al.	2006	A perceptually constrained GSVD-based approach for enhancing speech corrupted by colored noise
Selvi et al.	2017	A New Hybridized Speech Enhancement Technique for Stationary and Non-Stationary Noisy Environments
Kingsbury et al.	1997	Improving ASR performance for reverberant speech