Yoshida et al., 2020 - Google Patents
Sound quality improvement of extracted sound from video with rolling-shuttered cameraYoshida et al., 2020
- Document ID
- 14832864881564815771
- Author
- Yoshida A
- Iwai K
- Nishiur T
- Publication year
- Publication venue
- 2020 IEEE 9th Global Conference on Consumer Electronics (GCCE)
External Links
Snippet
Recent researches have proposed to photograph surface vibration of object with a rolling- shuttered camera to extract sound from video. These methods measure the surface of an object that vibrates according to target sound, which are more robust to noise than air …
- 230000001629 suppression 0 abstract description 15
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/222—Studio circuitry; Studio devices; Studio equipment; Cameras comprising an electronic image sensor, e.g. digital cameras, video cameras, TV cameras, video cameras, camcorders, webcams, camera modules for embedding in other devices, e.g. mobile phones, computers or vehicles
- H04N5/225—Television cameras; Cameras comprising an electronic image sensor, e.g. digital cameras, video cameras, video cameras, camcorders, webcams, camera modules for embedding in other devices, e.g. mobile phones, computers or vehicles
- H04N5/232—Devices for controlling television cameras, e.g. remote control; Control of cameras comprising an electronic image sensor, e.g. digital cameras, video cameras, TV cameras, video cameras, camcorders, webcams, camera modules for embedding in, e.g. mobile phones, computers or vehicles
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids transforming into visible information
- G10L2021/105—Synthesis of the lips movements from speech, e.g. for talking heads
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10129658B2 (en) | Method and apparatus for recovering audio signals from images | |
Ephrat et al. | Looking to listen at the cocktail party: A speaker-independent audio-visual model for speech separation | |
Kaneko et al. | Cyclegan-vc2: Improved cyclegan-based non-parallel voice conversion | |
Slaney et al. | Facesync: A linear operator for measuring synchronization of video facial images and audio tracks | |
Tan et al. | Audio-visual speech separation and dereverberation with a two-stage multimodal network | |
US9595259B2 (en) | Sound source-separating device and sound source-separating method | |
Sun et al. | UltraSE: single-channel speech enhancement using ultrasound | |
Almajai et al. | Visually derived wiener filters for speech enhancement | |
DE60319796T2 (en) | Noise reduction and audiovisual voice activity detection | |
JP2008263498A (en) | Wind noise reducing device, sound signal recorder and imaging apparatus | |
CN118486318B (en) | A method, medium and system for eliminating noise in outdoor live broadcast environment | |
US9165182B2 (en) | Method and apparatus for using face detection information to improve speaker segmentation | |
JP6610725B2 (en) | Sound processing apparatus and sound processing program | |
Yoshida et al. | Sound quality improvement of extracted sound from video with rolling-shuttered camera | |
Peng et al. | Bandwidth extension for speech acquired by laser Doppler vibrometer with an auxiliary microphone | |
JP7515121B2 (en) | Speech activity detection device, speech activity detection method, and speech activity detection program | |
Yoshida et al. | Interpolation of acoustic signals in sound capture with rolling-shuttered visual camera | |
JP5782402B2 (en) | Voice quality objective evaluation apparatus and method | |
Yoshizawa et al. | Speech extraction with RGB-intensity gradient on rolling-shutter video | |
Terano et al. | Sound capture from rolling-shuttered visual camera based on edge detection | |
CN111933174A (en) | Voice processing method, device, equipment and system | |
Nakano et al. | Speech Quality Improvement Utilizing Out-of-Focus Areas in Rolling-Shutter Video on Speech Extraction | |
Anderson et al. | Robust tri-modal automatic speech recognition for consumer applications | |
Nakano et al. | Singular spectral analysis-based interpolation for missing segments of speech signals extracted from videos captured by dual rolling-shutter cameras | |
Shindo et al. | Noise-reducing sound capture based on exposure-time of still camera |