Ko et al., 2023 - Google Patents
Datasets for Detection and Localization of Speech Buried in Drone NoiseKo et al., 2023
- Document ID
- 4425907190844999217
- Author
- Ko J
- Chang J
- Rho D
- Kim T
- Publication year
- Publication venue
- INTER-NOISE and NOISE-CON Congress and Conference Proceedings
External Links
Snippet
This paper introduces datasets for detection and localization of speech buried in drone noise based on a scenario that a drone is used to search and rescue victims in disaster situations with microphones mounted in the drone. Since the distances between the blades …
- 230000004807 localization 0 title abstract description 27
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/005—Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R1/00—Details of transducers, loudspeakers or microphones
- H04R1/20—Arrangements for obtaining desired frequency or directional characteristics
- H04R1/32—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
- H04R1/40—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H04S7/301—Automatic calibration of stereophonic sound system, e.g. with test microphone
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Cobos et al. | Frequency-sliding generalized cross-correlation: A sub-band time delay estimation approach | |
CN106251877B (en) | Voice Sounnd source direction estimation method and device | |
JP5587396B2 (en) | System, method and apparatus for signal separation | |
US9093078B2 (en) | Acoustic source separation | |
Gunel et al. | Acoustic source separation of convolutive mixtures based on intensity vector statistics | |
CN109490822B (en) | Voice DOA estimation method based on ResNet | |
CN110610718B (en) | Method and device for extracting expected sound source voice signal | |
JP2007513530A (en) | Voice input system | |
US11818557B2 (en) | Acoustic processing device including spatial normalization, mask function estimation, and mask processing, and associated acoustic processing method and storage medium | |
Buchner et al. | TRINICON-based blind system identification with application to multiple-source localization and separation | |
Koldovský et al. | Semi-blind noise extraction using partially known position of the target source | |
WO2022256577A1 (en) | A method of speech enhancement and a mobile computing device implementing the method | |
Choi et al. | Convolutional neural network-based direction-of-arrival estimation using stereo microphones for drone | |
Beit-On et al. | Speaker localization using the direct-path dominance test for arbitrary arrays | |
CN115620739A (en) | Speech enhancement method for specified direction, electronic device and storage medium | |
Girin et al. | Audio source separation into the wild | |
Ko et al. | Datasets for Detection and Localization of Speech Buried in Drone Noise | |
CN115862632A (en) | Voice recognition method and device, electronic equipment and storage medium | |
Firoozabadi et al. | Combination of nested microphone array and subband processing for multiple simultaneous speaker localization | |
Gelderblom et al. | Deep complex convolutional recurrent network for multi-channel speech enhancement and dereverberation | |
Mittal et al. | Low Latency Two Stage Beamforming with Distributed Microphone Arrays Using a Planewave Decomposition | |
Kawase et al. | Real-time integration of statistical model-based speech enhancement with unsupervised noise PSD estimation using microphone array | |
Hammer et al. | FCN approach for dynamically locating multiple speakers | |
Pasha et al. | A survey on ad hoc signal processing: Applications, challenges and state-of-the-art techniques | |
Milano et al. | Sector-based interference cancellation for robust keyword spotting applications using an informed mpdr beamformer |