Ko et al., 2023 - Google Patents

Datasets for Detection and Localization of Speech Buried in Drone Noise

Ko et al., 2023

Document ID: 4425907190844999217
Author: Ko J; Chang J; Rho D; Kim T
Publication year: 2023
Publication venue: INTER-NOISE and NOISE-CON Congress and Conference Proceedings

External Links

Cited by

Snippet

This paper introduces datasets for detection and localization of speech buried in drone noise based on a scenario that a drone is used to search and rescue victims in disaster situations with microphones mounted in the drone. Since the distances between the blades …

Continue reading at www.ingentaconnect.com (other versions)

230000004807 localization 0 title abstract description 27

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/005—Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R1/00—Details of transducers, loudspeakers or microphones
- H04R1/20—Arrangements for obtaining desired frequency or directional characteristics
- H04R1/32—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
- H04R1/40—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H04S7/301—Automatic calibration of stereophonic sound system, e.g. with test microphone
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building

Similar Documents

Publication	Publication Date	Title
Cobos et al.	2020	Frequency-sliding generalized cross-correlation: A sub-band time delay estimation approach
CN106251877B (en)	2019-09-06	Voice Sounnd source direction estimation method and device
JP5587396B2 (en)	2014-09-10	System, method and apparatus for signal separation
US9093078B2 (en)	2015-07-28	Acoustic source separation
Gunel et al.	2008	Acoustic source separation of convolutive mixtures based on intensity vector statistics
CN109490822B (en)	2022-12-20	Voice DOA estimation method based on ResNet
CN110610718B (en)	2021-10-08	Method and device for extracting expected sound source voice signal
JP2007513530A (en)	2007-05-24	Voice input system
US11818557B2 (en)	2023-11-14	Acoustic processing device including spatial normalization, mask function estimation, and mask processing, and associated acoustic processing method and storage medium
Buchner et al.	2007	TRINICON-based blind system identification with application to multiple-source localization and separation
Koldovský et al.	2013	Semi-blind noise extraction using partially known position of the target source
WO2022256577A1 (en)	2022-12-08	A method of speech enhancement and a mobile computing device implementing the method
Choi et al.	2020	Convolutional neural network-based direction-of-arrival estimation using stereo microphones for drone
Beit-On et al.	2018	Speaker localization using the direct-path dominance test for arbitrary arrays
CN115620739A (en)	2023-01-17	Speech enhancement method for specified direction, electronic device and storage medium
Girin et al.	2019	Audio source separation into the wild
Ko et al.	2023	Datasets for Detection and Localization of Speech Buried in Drone Noise
CN115862632A (en)	2023-03-28	Voice recognition method and device, electronic equipment and storage medium
Firoozabadi et al.	2012	Combination of nested microphone array and subband processing for multiple simultaneous speaker localization
Gelderblom et al.	2021	Deep complex convolutional recurrent network for multi-channel speech enhancement and dereverberation
Mittal et al.	2024	Low Latency Two Stage Beamforming with Distributed Microphone Arrays Using a Planewave Decomposition
Kawase et al.	2016	Real-time integration of statistical model-based speech enhancement with unsupervised noise PSD estimation using microphone array
Hammer et al.	2020	FCN approach for dynamically locating multiple speakers
Pasha et al.	2019	A survey on ad hoc signal processing: Applications, challenges and state-of-the-art techniques
Milano et al.	2024	Sector-based interference cancellation for robust keyword spotting applications using an informed mpdr beamformer