Hu et al., 2017 - Google Patents
Deep 360 pilot: Learning a deep agent for piloting through 360 sports videosHu et al., 2017
- Document ID
- 16766917386257626387
- Author
- Hu H
- Lin Y
- Liu M
- Cheng H
- Chang Y
- Sun M
- Publication year
- Publication venue
- 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
External Links
Snippet
Watching a 360° sports video requires a viewer to continuously select a viewing angle, either through a sequence of mouse clicks or head movements. To relieve the viewer from this “360 piloting” task, we propose “deep 360 pilot”-a deep learning-based agent for piloting …
- 238000000034 method 0 abstract description 61
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20112—Image segmentation details
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30244—Information retrieval; Database structures therefor; File system structures therefor in image databases
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Hu et al. | Deep 360 pilot: Learning a deep agent for piloting through 360 sports videos | |
| Liu et al. | Pku-mmd: A large scale benchmark for continuous multi-modal human action understanding | |
| Weinzaepfel et al. | Dope: Distillation of part experts for whole-body 3d pose estimation in the wild | |
| Hu et al. | Deep 360 pilot: Learning a deep agent for piloting through 360deg sports videos | |
| Li et al. | Tracking in low frame rate video: A cascade particle filter with discriminative observers of different life spans | |
| Molchanov et al. | Online detection and classification of dynamic hand gestures with recurrent 3d convolutional neural network | |
| Höferlin et al. | Inter-active learning of ad-hoc classifiers for video visual analytics | |
| Zhou et al. | Cascaded interactional targeting network for egocentric video analysis | |
| Chan et al. | Player identification in hockey broadcast videos | |
| Gupta et al. | Online detection and classification of dynamic hand gestures with recurrent 3d convolutional neural networks | |
| Mesbahi et al. | Hand gesture recognition based on various deep learning YOLO models | |
| Khaire et al. | RGB+ D and deep learning-based real-time detection of suspicious event in Bank-ATMs | |
| Haq et al. | Improving badminton player detection using YOLOv3 with different training heuristic | |
| Baisware et al. | Review on recent advances in human action recognition in video data | |
| Xia et al. | Kiwifruit counting using KiwiDetector and KiwiTracker | |
| Wan et al. | Instance-level moving object segmentation from a single image with events | |
| Makris et al. | Robust 3d human pose estimation guided by filtered subsets of body keypoints | |
| Meshgi et al. | The state-of-the-art in handling occlusions for visual object tracking | |
| Zhang et al. | An inpainting SLAM approach for detecting and recovering regions with dynamic objects | |
| Suzuki et al. | Runner re-identification from single-view running video in the open-world setting | |
| Fujii | Computer vision for sports analytics | |
| Bazin et al. | Actionsnapping: Motion-based video synchronization | |
| Tian et al. | Object segmentation and key-pose based summarization for motion video | |
| Chaudhary et al. | Learning to segment generic handheld objects using class-agnostic deep comparison and segmentation network | |
| Wang et al. | A short survey on deep learning for skeleton-based action recognition |