[go: up one dir, main page]

Yoshida et al., 2020 - Google Patents

Sound quality improvement of extracted sound from video with rolling-shuttered camera

Yoshida et al., 2020

Document ID
14832864881564815771
Author
Yoshida A
Iwai K
Nishiur T
Publication year
Publication venue
2020 IEEE 9th Global Conference on Consumer Electronics (GCCE)

External Links

Snippet

Recent researches have proposed to photograph surface vibration of object with a rolling- shuttered camera to extract sound from video. These methods measure the surface of an object that vibrates according to target sound, which are more robust to noise than air …
Continue reading at ieeexplore.ieee.org (other versions)

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment; Cameras comprising an electronic image sensor, e.g. digital cameras, video cameras, TV cameras, video cameras, camcorders, webcams, camera modules for embedding in other devices, e.g. mobile phones, computers or vehicles
    • H04N5/225Television cameras; Cameras comprising an electronic image sensor, e.g. digital cameras, video cameras, video cameras, camcorders, webcams, camera modules for embedding in other devices, e.g. mobile phones, computers or vehicles
    • H04N5/232Devices for controlling television cameras, e.g. remote control; Control of cameras comprising an electronic image sensor, e.g. digital cameras, video cameras, TV cameras, video cameras, camcorders, webcams, camera modules for embedding in, e.g. mobile phones, computers or vehicles
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/06Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
    • G10L21/10Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids transforming into visible information
    • G10L2021/105Synthesis of the lips movements from speech, e.g. for talking heads

Similar Documents

Publication Publication Date Title
US10129658B2 (en) Method and apparatus for recovering audio signals from images
Ephrat et al. Looking to listen at the cocktail party: A speaker-independent audio-visual model for speech separation
Kaneko et al. Cyclegan-vc2: Improved cyclegan-based non-parallel voice conversion
Slaney et al. Facesync: A linear operator for measuring synchronization of video facial images and audio tracks
Tan et al. Audio-visual speech separation and dereverberation with a two-stage multimodal network
US9595259B2 (en) Sound source-separating device and sound source-separating method
Sun et al. UltraSE: single-channel speech enhancement using ultrasound
Almajai et al. Visually derived wiener filters for speech enhancement
DE60319796T2 (en) Noise reduction and audiovisual voice activity detection
JP2008263498A (en) Wind noise reducing device, sound signal recorder and imaging apparatus
CN118486318B (en) A method, medium and system for eliminating noise in outdoor live broadcast environment
US9165182B2 (en) Method and apparatus for using face detection information to improve speaker segmentation
JP6610725B2 (en) Sound processing apparatus and sound processing program
Yoshida et al. Sound quality improvement of extracted sound from video with rolling-shuttered camera
Peng et al. Bandwidth extension for speech acquired by laser Doppler vibrometer with an auxiliary microphone
JP7515121B2 (en) Speech activity detection device, speech activity detection method, and speech activity detection program
Yoshida et al. Interpolation of acoustic signals in sound capture with rolling-shuttered visual camera
JP5782402B2 (en) Voice quality objective evaluation apparatus and method
Yoshizawa et al. Speech extraction with RGB-intensity gradient on rolling-shutter video
Terano et al. Sound capture from rolling-shuttered visual camera based on edge detection
CN111933174A (en) Voice processing method, device, equipment and system
Nakano et al. Speech Quality Improvement Utilizing Out-of-Focus Areas in Rolling-Shutter Video on Speech Extraction
Anderson et al. Robust tri-modal automatic speech recognition for consumer applications
Nakano et al. Singular spectral analysis-based interpolation for missing segments of speech signals extracted from videos captured by dual rolling-shutter cameras
Shindo et al. Noise-reducing sound capture based on exposure-time of still camera