[go: up one dir, main page]

Hu et al., 2017 - Google Patents

Deep 360 pilot: Learning a deep agent for piloting through 360 sports videos

Hu et al., 2017

Document ID
16766917386257626387
Author
Hu H
Lin Y
Liu M
Cheng H
Chang Y
Sun M
Publication year
Publication venue
2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

External Links

Snippet

Watching a 360° sports video requires a viewer to continuously select a viewing angle, either through a sequence of mouse clicks or head movements. To relieve the viewer from this “360 piloting” task, we propose “deep 360 pilot”-a deep learning-based agent for piloting …
Continue reading at ieeexplore.ieee.org (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
    • G06K9/46Extraction of features or characteristics of the image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6217Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • G06K9/6201Matching; Proximity measures
    • G06K9/6202Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20112Image segmentation details
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/00221Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/00624Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30781Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F17/30784Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
    • G06F17/30799Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • G06F17/30244Information retrieval; Database structures therefor; File system structures therefor in image databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer

Similar Documents

Publication Publication Date Title
Hu et al. Deep 360 pilot: Learning a deep agent for piloting through 360 sports videos
Liu et al. Pku-mmd: A large scale benchmark for continuous multi-modal human action understanding
Weinzaepfel et al. Dope: Distillation of part experts for whole-body 3d pose estimation in the wild
Hu et al. Deep 360 pilot: Learning a deep agent for piloting through 360deg sports videos
Li et al. Tracking in low frame rate video: A cascade particle filter with discriminative observers of different life spans
Molchanov et al. Online detection and classification of dynamic hand gestures with recurrent 3d convolutional neural network
Höferlin et al. Inter-active learning of ad-hoc classifiers for video visual analytics
Zhou et al. Cascaded interactional targeting network for egocentric video analysis
Chan et al. Player identification in hockey broadcast videos
Gupta et al. Online detection and classification of dynamic hand gestures with recurrent 3d convolutional neural networks
Mesbahi et al. Hand gesture recognition based on various deep learning YOLO models
Khaire et al. RGB+ D and deep learning-based real-time detection of suspicious event in Bank-ATMs
Haq et al. Improving badminton player detection using YOLOv3 with different training heuristic
Baisware et al. Review on recent advances in human action recognition in video data
Xia et al. Kiwifruit counting using KiwiDetector and KiwiTracker
Wan et al. Instance-level moving object segmentation from a single image with events
Makris et al. Robust 3d human pose estimation guided by filtered subsets of body keypoints
Meshgi et al. The state-of-the-art in handling occlusions for visual object tracking
Zhang et al. An inpainting SLAM approach for detecting and recovering regions with dynamic objects
Suzuki et al. Runner re-identification from single-view running video in the open-world setting
Fujii Computer vision for sports analytics
Bazin et al. Actionsnapping: Motion-based video synchronization
Tian et al. Object segmentation and key-pose based summarization for motion video
Chaudhary et al. Learning to segment generic handheld objects using class-agnostic deep comparison and segmentation network
Wang et al. A short survey on deep learning for skeleton-based action recognition