[go: up one dir, main page]

Ko et al., 2023 - Google Patents

Datasets for Detection and Localization of Speech Buried in Drone Noise

Ko et al., 2023

Document ID
4425907190844999217
Author
Ko J
Chang J
Rho D
Kim T
Publication year
Publication venue
INTER-NOISE and NOISE-CON Congress and Conference Proceedings

External Links

Snippet

This paper introduces datasets for detection and localization of speech buried in drone noise based on a scenario that a drone is used to search and rescue victims in disaster situations with microphones mounted in the drone. Since the distances between the blades …
Continue reading at www.ingentaconnect.com (other versions)

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02166Microphone arrays; Beamforming
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/005Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/20Arrangements for obtaining desired frequency or directional characteristics
    • H04R1/32Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
    • H04R1/40Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/301Automatic calibration of stereophonic sound system, e.g. with test microphone
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/04Training, enrolment or model building

Similar Documents

Publication Publication Date Title
Cobos et al. Frequency-sliding generalized cross-correlation: A sub-band time delay estimation approach
CN106251877B (en) Voice Sounnd source direction estimation method and device
JP5587396B2 (en) System, method and apparatus for signal separation
US9093078B2 (en) Acoustic source separation
Gunel et al. Acoustic source separation of convolutive mixtures based on intensity vector statistics
CN109490822B (en) Voice DOA estimation method based on ResNet
CN110610718B (en) Method and device for extracting expected sound source voice signal
JP2007513530A (en) Voice input system
US11818557B2 (en) Acoustic processing device including spatial normalization, mask function estimation, and mask processing, and associated acoustic processing method and storage medium
Buchner et al. TRINICON-based blind system identification with application to multiple-source localization and separation
Koldovský et al. Semi-blind noise extraction using partially known position of the target source
WO2022256577A1 (en) A method of speech enhancement and a mobile computing device implementing the method
Choi et al. Convolutional neural network-based direction-of-arrival estimation using stereo microphones for drone
Beit-On et al. Speaker localization using the direct-path dominance test for arbitrary arrays
CN115620739A (en) Speech enhancement method for specified direction, electronic device and storage medium
Girin et al. Audio source separation into the wild
Ko et al. Datasets for Detection and Localization of Speech Buried in Drone Noise
CN115862632A (en) Voice recognition method and device, electronic equipment and storage medium
Firoozabadi et al. Combination of nested microphone array and subband processing for multiple simultaneous speaker localization
Gelderblom et al. Deep complex convolutional recurrent network for multi-channel speech enhancement and dereverberation
Mittal et al. Low Latency Two Stage Beamforming with Distributed Microphone Arrays Using a Planewave Decomposition
Kawase et al. Real-time integration of statistical model-based speech enhancement with unsupervised noise PSD estimation using microphone array
Hammer et al. FCN approach for dynamically locating multiple speakers
Pasha et al. A survey on ad hoc signal processing: Applications, challenges and state-of-the-art techniques
Milano et al. Sector-based interference cancellation for robust keyword spotting applications using an informed mpdr beamformer