[go: up one dir, main page]

Georganti et al., 2013 - Google Patents

Sound source distance estimation in rooms based on statistical properties of binaural signals

Georganti et al., 2013

View PDF
Document ID
7679757822981946421
Author
Georganti E
May T
Van De Par S
Mourjopoulos J
Publication year
Publication venue
IEEE transactions on audio, speech, and language processing

External Links

Snippet

A novel method for the estimation of the distance of a sound source from binaural speech signals is proposed. The method relies on several statistical features extracted from such signals and their binaural cues. Firstly, the standard deviation of the difference of the …
Continue reading at www.researchgate.net (PDF) (other versions)

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding, i.e. using interchannel correlation to reduce redundancies, e.g. joint-stereo, intensity-coding, matrixing
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01SRADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
    • G01S3/00Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received
    • G01S3/80Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received using ultrasonic, sonic or infrasonic waves
    • G01S3/802Systems for determining direction or deviation from predetermined direction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/301Automatic calibration of stereophonic sound system, e.g. with test microphone
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/15Aspects of sound capture and related signal processing for recording or reproduction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/005Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00

Similar Documents

Publication Publication Date Title
Georganti et al. Sound source distance estimation in rooms based on statistical properties of binaural signals
May et al. A probabilistic model for robust localization based on a binaural auditory front-end
Pang et al. Multitask learning of time-frequency CNN for sound source localization
Mandel et al. An EM algorithm for localizing multiple sound sources in reverberant environments
Mandel et al. Model-based expectation-maximization source separation and localization
Epain et al. Spherical harmonic signal covariance and sound field diffuseness
US11943604B2 (en) Spatial audio processing
CN109839612A (en) Sounnd source direction estimation method based on time-frequency masking and deep neural network
Vesa Binaural sound source distance learning in rooms
Chen et al. Reverberant speech separation with probabilistic time–frequency masking for B-format recordings
Chen et al. A source counting method using acoustic vector sensor based on sparse modeling of DOA histogram
Goli et al. Deep learning-based speech specific source localization by using binaural and monaural microphone arrays in hearing aids
Delikaris-Manias et al. 3D localization of multiple audio sources utilizing 2D DOA histograms
Hu et al. Decoupled multiple speaker direction-of-arrival estimator under reverberant environments
Dadvar et al. Robust binaural speech separation in adverse conditions based on deep neural network with modified spatial features and training target
Manocha et al. Dplm: A deep perceptual spatial-audio localization metric
Georganti et al. Speaker distance detection using a single microphone
Al-Karawi et al. The effects of distance and reverberation time on speaker recognition performance
Mandel et al. EM localization and separation using interaural level and phase cues
Tokgoz et al. Robust three-microphone speech source localization using randomized singular value decomposition
Krause et al. Data diversity for improving DNN-based localization of concurrent sound events
Aarabi et al. Robust sound localization using conditional time–frequency histograms
Georganti et al. Extracting sound-source-distance information from binaural signals
Georganti et al. Room statistics and direct-to-reverberant ratio estimation from dual-channel signals
Massicotte et al. LSTM with scattering decomposition-based feature extraction for binaural sound source localization