Wang et al., 2018 - Google Patents
Pseudo-determined blind source separation for ad-hoc microphone networksWang et al., 2018
View PDF- Document ID
- 6938251877395262311
- Author
- Wang L
- Cavallaro A
- Publication year
- Publication venue
- IEEE/ACM Transactions on Audio, Speech, and Language Processing
External Links
Snippet
We propose a pseudo-determined blind source separation framework that exploits the information from a large number of microphones in an ad-hoc network to extract and enhance sound sources in a reverberant scenario. After compensating for the time offsets …
- 238000000926 separation method 0 title abstract description 57
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/005—Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R25/00—Deaf-aid sets providing an auditory perception; Electric tinnitus maskers providing an auditory perception
- H04R25/40—Arrangements for obtaining a desired directivity characteristic
- H04R25/407—Circuits for combining signals of a plurality of transducers
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S3/00—Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received
- G01S3/80—Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received using ultrasonic, sonic or infrasonic waves
- G01S3/802—Systems for determining direction or deviation from predetermined direction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R29/00—Monitoring arrangements; Testing arrangements
- H04R29/004—Monitoring arrangements; Testing arrangements for microphones
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US10602267B2 (en) | Sound signal processing apparatus and method for enhancing a sound signal | |
| Zohourian et al. | Binaural speaker localization integrated into an adaptive beamformer for hearing aids | |
| Souden et al. | A multichannel MMSE-based framework for speech source separation and noise reduction | |
| Wang et al. | Over-determined source separation and localization using distributed microphones | |
| Himawan et al. | Clustered blind beamforming from ad-hoc microphone arrays | |
| CN109830245A (en) | A kind of more speaker's speech separating methods and system based on beam forming | |
| Wang et al. | Pseudo-determined blind source separation for ad-hoc microphone networks | |
| Ren et al. | A novel multiple sparse source localization using triangular pyramid microphone array | |
| Sivasankaran et al. | Keyword-based speaker localization: Localizing a target speaker in a multi-speaker environment | |
| Koldovský et al. | Semi-blind noise extraction using partially known position of the target source | |
| Thuene et al. | Maximum-likelihood approach to adaptive multichannel-Wiener postfiltering for wind-noise reduction | |
| Kovalyov et al. | Dsenet: Directional signal extraction network for hearing improvement on edge devices | |
| Mirabilii et al. | Spatial coherence-aware multi-channel wind noise reduction | |
| Zohourian et al. | GSC-based binaural speaker separation preserving spatial cues | |
| Li et al. | Local relative transfer function for sound source localization | |
| Zhang et al. | Directional gain based noise covariance matrix estimation for MVDR beamforming | |
| Huang et al. | A regression approach to speech source localization exploiting deep neural network | |
| Shujau et al. | Separation of speech sources using an acoustic vector sensor | |
| Yang et al. | Interference-controlled maximum noise reduction beamformer based on deep-learned interference manifold | |
| Zohny et al. | Modelling interaural level and phase cues with Student's t-distribution for robust clustering in MESSL | |
| Zhu et al. | Modified complementary joint sparse representations: a novel post-filtering to MVDR beamforming | |
| Taghia et al. | Dual-channel noise reduction based on a mixture of circular-symmetric complex Gaussians on unit hypersphere | |
| Levi et al. | A robust method to extract talker azimuth orientation using a large-aperture microphone array | |
| Pasha et al. | A survey on ad hoc signal processing: Applications, challenges and state-of-the-art techniques | |
| Pang et al. | The SEUEE System for the CHiME-8 MMCSG Challenge |