Georganti et al., 2013 - Google Patents

Sound source distance estimation in rooms based on statistical properties of binaural signals

Georganti et al., 2013

Document ID: 7679757822981946421
Author: Georganti E; May T; Van De Par S; Mourjopoulos J
Publication year: 2013
Publication venue: IEEE transactions on audio, speech, and language processing

External Links

Cited by

Snippet

A novel method for the estimation of the distance of a sound source from binaural speech signals is proposed. The method relies on several statistical features extracted from such signals and their binaural cues. Firstly, the standard deviation of the difference of the …

Continue reading at www.researchgate.net (PDF) (other versions)

238000001514 detection method 0 abstract description 85

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding, i.e. using interchannel correlation to reduce redundancies, e.g. joint-stereo, intensity-coding, matrixing
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S3/00—Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received
- G01S3/80—Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received using ultrasonic, sonic or infrasonic waves
- G01S3/802—Systems for determining direction or deviation from predetermined direction
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H04S7/301—Automatic calibration of stereophonic sound system, e.g. with test microphone
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/15—Aspects of sound capture and related signal processing for recording or reproduction
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/005—Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00

Similar Documents

Publication	Publication Date	Title
Georganti et al.	2013	Sound source distance estimation in rooms based on statistical properties of binaural signals
May et al.	2010	A probabilistic model for robust localization based on a binaural auditory front-end
Pang et al.	2019	Multitask learning of time-frequency CNN for sound source localization
Mandel et al.	2006	An EM algorithm for localizing multiple sound sources in reverberant environments
Mandel et al.	2009	Model-based expectation-maximization source separation and localization
Epain et al.	2016	Spherical harmonic signal covariance and sound field diffuseness
US11943604B2 (en)	2024-03-26	Spatial audio processing
CN109839612A (en)	2019-06-04	Sounnd source direction estimation method based on time-frequency masking and deep neural network
Vesa	2009	Binaural sound source distance learning in rooms
Chen et al.	2015	Reverberant speech separation with probabilistic time–frequency masking for B-format recordings
Chen et al.	2018	A source counting method using acoustic vector sensor based on sparse modeling of DOA histogram
Goli et al.	2023	Deep learning-based speech specific source localization by using binaural and monaural microphone arrays in hearing aids
Delikaris-Manias et al.	2016	3D localization of multiple audio sources utilizing 2D DOA histograms
Hu et al.	2022	Decoupled multiple speaker direction-of-arrival estimator under reverberant environments
Dadvar et al.	2019	Robust binaural speech separation in adverse conditions based on deep neural network with modified spatial features and training target
Manocha et al.	2021	Dplm: A deep perceptual spatial-audio localization metric
Georganti et al.	2011	Speaker distance detection using a single microphone
Al-Karawi et al.	2024	The effects of distance and reverberation time on speaker recognition performance
Mandel et al.	2007	EM localization and separation using interaural level and phase cues
Tokgoz et al.	2021	Robust three-microphone speech source localization using randomized singular value decomposition
Krause et al.	2021	Data diversity for improving DNN-based localization of concurrent sound events
Aarabi et al.	2003	Robust sound localization using conditional time–frequency histograms
Georganti et al.	2013	Extracting sound-source-distance information from binaural signals
Georganti et al.	2014	Room statistics and direct-to-reverberant ratio estimation from dual-channel signals
Massicotte et al.	2022	LSTM with scattering decomposition-based feature extraction for binaural sound source localization