Georganti et al., 2013 - Google Patents
Sound source distance estimation in rooms based on statistical properties of binaural signalsGeorganti et al., 2013
View PDF- Document ID
- 7679757822981946421
- Author
- Georganti E
- May T
- Van De Par S
- Mourjopoulos J
- Publication year
- Publication venue
- IEEE transactions on audio, speech, and language processing
External Links
Snippet
A novel method for the estimation of the distance of a sound source from binaural speech signals is proposed. The method relies on several statistical features extracted from such signals and their binaural cues. Firstly, the standard deviation of the difference of the …
- 238000001514 detection method 0 abstract description 85
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding, i.e. using interchannel correlation to reduce redundancies, e.g. joint-stereo, intensity-coding, matrixing
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S3/00—Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received
- G01S3/80—Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received using ultrasonic, sonic or infrasonic waves
- G01S3/802—Systems for determining direction or deviation from predetermined direction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H04S7/301—Automatic calibration of stereophonic sound system, e.g. with test microphone
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/15—Aspects of sound capture and related signal processing for recording or reproduction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/005—Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Georganti et al. | Sound source distance estimation in rooms based on statistical properties of binaural signals | |
| May et al. | A probabilistic model for robust localization based on a binaural auditory front-end | |
| Pang et al. | Multitask learning of time-frequency CNN for sound source localization | |
| Mandel et al. | An EM algorithm for localizing multiple sound sources in reverberant environments | |
| Mandel et al. | Model-based expectation-maximization source separation and localization | |
| Epain et al. | Spherical harmonic signal covariance and sound field diffuseness | |
| US11943604B2 (en) | Spatial audio processing | |
| CN109839612A (en) | Sounnd source direction estimation method based on time-frequency masking and deep neural network | |
| Vesa | Binaural sound source distance learning in rooms | |
| Chen et al. | Reverberant speech separation with probabilistic time–frequency masking for B-format recordings | |
| Chen et al. | A source counting method using acoustic vector sensor based on sparse modeling of DOA histogram | |
| Goli et al. | Deep learning-based speech specific source localization by using binaural and monaural microphone arrays in hearing aids | |
| Delikaris-Manias et al. | 3D localization of multiple audio sources utilizing 2D DOA histograms | |
| Hu et al. | Decoupled multiple speaker direction-of-arrival estimator under reverberant environments | |
| Dadvar et al. | Robust binaural speech separation in adverse conditions based on deep neural network with modified spatial features and training target | |
| Manocha et al. | Dplm: A deep perceptual spatial-audio localization metric | |
| Georganti et al. | Speaker distance detection using a single microphone | |
| Al-Karawi et al. | The effects of distance and reverberation time on speaker recognition performance | |
| Mandel et al. | EM localization and separation using interaural level and phase cues | |
| Tokgoz et al. | Robust three-microphone speech source localization using randomized singular value decomposition | |
| Krause et al. | Data diversity for improving DNN-based localization of concurrent sound events | |
| Aarabi et al. | Robust sound localization using conditional time–frequency histograms | |
| Georganti et al. | Extracting sound-source-distance information from binaural signals | |
| Georganti et al. | Room statistics and direct-to-reverberant ratio estimation from dual-channel signals | |
| Massicotte et al. | LSTM with scattering decomposition-based feature extraction for binaural sound source localization |