Sexton et al., 2021 - Google Patents
Automatic CNN-based enhancement of 360° video experience with multisensorial effectsSexton et al., 2021
View PDF- Document ID
- 4543776444978540675
- Author
- Sexton J
- Simiscuka A
- Mcguinness K
- Muntean G
- Publication year
- Publication venue
- IEEE Access
External Links
Snippet
High-resolution audio-visual virtual reality (VR) technologies currently offer satisfying experiences for both sight and hearing senses in the world of multimedia. However, the delivery of truly immersive experiences requires the incorporation of other senses such as …
- 230000000694 effects 0 title abstract description 60
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/017—Gesture based interaction, e.g. based on a set of recognized hand gestures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30017—Multimedia data retrieval; Retrieval of more than one type of audiovisual media
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids transforming into visible information
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television, VOD [Video On Demand]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television, VOD [Video On Demand]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11158102B2 (en) | Method and apparatus for processing information | |
US10120565B2 (en) | Methods and devices for presenting interactive media items | |
EP3889912B1 (en) | Method and apparatus for generating video | |
Sexton et al. | Automatic CNN-based enhancement of 360° video experience with multisensorial effects | |
US20130076788A1 (en) | Apparatus, method and software products for dynamic content management | |
US20160012136A1 (en) | Simultaneous Local and Cloud Searching System and Method | |
JP2013527947A5 (en) | ||
CN113299312B (en) | Image generation method, device, equipment and storage medium | |
US9129602B1 (en) | Mimicking user speech patterns | |
CN113923462A (en) | Video generation method, live broadcast processing method, video generation device, live broadcast processing device and readable medium | |
CN117241063B (en) | Live broadcast interaction method and system based on virtual reality technology | |
CN113316078B (en) | Data processing method and device, computer equipment and storage medium | |
US20180275861A1 (en) | Apparatus and Associated Methods | |
KR20120099814A (en) | Augmented reality contents service system and apparatus and method | |
JP6367748B2 (en) | Recognition device, video content presentation system | |
JP2021101252A (en) | Information processing method, information processing apparatus, and program | |
CN115359409B (en) | Video splitting method and device, computer equipment and storage medium | |
CN116755590A (en) | Virtual image processing method, device, enhancement realization equipment and storage medium | |
US11948599B2 (en) | Audio event detection with window-based prediction | |
KR20160069663A (en) | System And Method For Producing Education Cotent, And Service Server, Manager Apparatus And Client Apparatus using therefor | |
CN115104296A (en) | Audio message interface on a message platform | |
CN113762056A (en) | Singing video recognition method, device, equipment and storage medium | |
CN117593473B (en) | Method, apparatus and storage medium for generating motion image and video | |
CN115237248B (en) | Virtual object display method, device, equipment, storage medium and program product | |
JP6619072B2 (en) | SOUND SYNTHESIS DEVICE, SOUND SYNTHESIS METHOD, AND PROGRAM THEREOF |