Hu et al., 2017 - Google Patents

Deep 360 pilot: Learning a deep agent for piloting through 360 sports videos

Hu et al., 2017

Document ID: 16766917386257626387
Author: Hu H; Lin Y; Liu M; Cheng H; Chang Y; Sun M
Publication year: 2017
Publication venue: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

External Links

Cited by

Snippet

Watching a 360° sports video requires a viewer to continuously select a viewing angle, either through a sequence of mouse clicks or head movements. To relieve the viewer from this “360 piloting” task, we propose “deep 360 pilot”-a deep learning-based agent for piloting …

Continue reading at ieeexplore.ieee.org (other versions)

238000000034 method 0 abstract description 61

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20112—Image segmentation details
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30244—Information retrieval; Database structures therefor; File system structures therefor in image databases
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer

Similar Documents

Publication	Publication Date	Title
Hu et al.	2017	Deep 360 pilot: Learning a deep agent for piloting through 360 sports videos
Liu et al.	2017	Pku-mmd: A large scale benchmark for continuous multi-modal human action understanding
Weinzaepfel et al.	2020	Dope: Distillation of part experts for whole-body 3d pose estimation in the wild
Hu et al.	2017	Deep 360 pilot: Learning a deep agent for piloting through 360deg sports videos
Li et al.	2008	Tracking in low frame rate video: A cascade particle filter with discriminative observers of different life spans
Molchanov et al.	2016	Online detection and classification of dynamic hand gestures with recurrent 3d convolutional neural network
Höferlin et al.	2012	Inter-active learning of ad-hoc classifiers for video visual analytics
Zhou et al.	2016	Cascaded interactional targeting network for egocentric video analysis
Chan et al.	2021	Player identification in hockey broadcast videos
Gupta et al.	2016	Online detection and classification of dynamic hand gestures with recurrent 3d convolutional neural networks
Mesbahi et al.	2023	Hand gesture recognition based on various deep learning YOLO models
Khaire et al.	2021	RGB+ D and deep learning-based real-time detection of suspicious event in Bank-ATMs
Haq et al.	2023	Improving badminton player detection using YOLOv3 with different training heuristic
Baisware et al.	2019	Review on recent advances in human action recognition in video data
Xia et al.	2023	Kiwifruit counting using KiwiDetector and KiwiTracker
Wan et al.	2025	Instance-level moving object segmentation from a single image with events
Makris et al.	2019	Robust 3d human pose estimation guided by filtered subsets of body keypoints
Meshgi et al.	2015	The state-of-the-art in handling occlusions for visual object tracking
Zhang et al.	2025	An inpainting SLAM approach for detecting and recovering regions with dynamic objects
Suzuki et al.	2024	Runner re-identification from single-view running video in the open-world setting
Fujii	2025	Computer vision for sports analytics
Bazin et al.	2016	Actionsnapping: Motion-based video synchronization
Tian et al.	2014	Object segmentation and key-pose based summarization for motion video
Chaudhary et al.	2018	Learning to segment generic handheld objects using class-agnostic deep comparison and segmentation network
Wang et al.	2021	A short survey on deep learning for skeleton-based action recognition