Wan et al., 2020 - Google Patents
Boosting image-based localization via randomly geometric data augmentationWan et al., 2020
- Document ID
- 4678195604553990225
- Author
- Wan Y
- Gao W
- Han S
- Wu Y
- Publication year
- Publication venue
- 2020 IEEE International Conference on Image Processing (ICIP)
External Links
Snippet
Visual localization is a fundamental problem in computer vision and robotics. Recently, deep learning has shown to be effective for robust monocular localization. Most deep learning- based methods utilize convolution neural network (CNN) to regress global 6 degree-of …
- 230000004807 localization 0 title abstract description 18
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20112—Image segmentation details
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T5/00—Image enhancement or restoration, e.g. from bit-mapped to bit-mapped creating a similar image
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Poggi et al. | On the synergies between machine learning and binocular stereo for depth estimation from images: A survey | |
Laga et al. | A survey on deep learning techniques for stereo-based depth estimation | |
Hu et al. | Deep depth completion from extremely sparse data: A survey | |
Kim et al. | Deep monocular depth estimation via integration of global and local predictions | |
Jiang et al. | Self-supervised relative depth learning for urban scene understanding | |
Marion et al. | Label fusion: A pipeline for generating ground truth labels for real rgbd data of cluttered scenes | |
CN109643368B (en) | Detecting objects in video data | |
Hu et al. | Single-image real-time rain removal based on depth-guided non-local features | |
Liu | Beyond pixels: exploring new representations and applications for motion analysis | |
Bešić et al. | Dynamic object removal and spatio-temporal RGB-D inpainting via geometry-aware adversarial learning | |
Zhu et al. | Ponderv2: Pave the way for 3d foundation model with a universal pre-training paradigm | |
Yin et al. | Or-nerf: Object removing from 3d scenes guided by multiview segmentation with neural radiance fields | |
Tao et al. | Indoor 3D semantic robot VSLAM based on mask regional convolutional neural network | |
Abdulwahab et al. | Adversarial learning for depth and viewpoint estimation from a single image | |
Wan et al. | Boosting image-based localization via randomly geometric data augmentation | |
Zhang et al. | Exploring semantic information extraction from different data forms in 3D point cloud semantic segmentation | |
Alletto et al. | Self-supervised optical flow estimation by projective bootstrap | |
Tian et al. | Monocular depth estimation based on a single image: a literature review | |
Zhao et al. | SG-GS: Photo-realistic Animatable Human Avatars with Semantically-Guided Gaussian Splatting | |
Teng et al. | Reconstructing three-dimensional models of objects using a Kinect sensor | |
Chen et al. | An improved BIM aided indoor localization method via enhancing cross-domain image retrieval based on deep learning | |
CN118071932A (en) | Three-dimensional static scene image reconstruction method and system | |
Wang et al. | 3D object detection algorithm for panoramic images with multi-scale convolutional neural network | |
Hou et al. | Octree-based approach for real-time 3d indoor mapping using rgb-d video data | |
Jang et al. | Two-phase approach for monocular object detection and 6-dof pose estimation |