Ren et al., 2022 - Google Patents

Multi-scale convolutional feature fusion for 6D pose estimation

Ren et al., 2022

Document ID: 6097157231323106715
Author: Ren Y; Liu J
Publication year: 2022
Publication venue: Proceedings of the 2022 6th International Conference on Video and Image Processing

External Links

Cited by

Snippet

In order to obtain accurate pose estimate and satisfy real-time needs, 6D pose estimation of objects is an important task to handle challenging under certain circumstances, such as noisy background, and lighting fluctuations. We propose a 6D pose estimation method with …

Continue reading at dl.acm.org (other versions)

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00281—Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20112—Image segmentation details
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T1/00—General purpose image data processing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS

Similar Documents

Publication	Publication Date	Title
Truong et al.	2023	Pdc-net+: Enhanced probabilistic dense correspondence network
Schmidt et al.	2016	Self-supervised visual descriptor learning for dense correspondence
Dvornik et al.	2019	On the importance of visual context for data augmentation in scene understanding
Tewari et al.	2018	High-fidelity monocular face reconstruction based on an unsupervised model-based face autoencoder
Tu et al.	2023	Consistent 3d hand reconstruction in video via self-supervised learning
US12106554B2 (en)	2024-10-01	Image sequence processing using neural networks
Zeng et al.	2021	Reference-based defect detection network
Liu et al.	2022	Explicit occlusion reasoning for multi-person 3d human pose estimation
Zhong et al.	2021	Mv-ton: Memory-based video virtual try-on network
Joung et al.	2019	Unsupervised stereo matching using confidential correspondence consistency
He et al.	2023	ContourPose: Monocular 6-D pose estimation method for reflective textureless metal parts
Jeon et al.	2022	Struct-MDC: Mesh-refined unsupervised depth completion leveraging structural regularities from visual SLAM
Goyal et al.	2023	Emotionally enhanced talking face generation
Yang et al.	2023	Deep face video inpainting via UV mapping
Kaskman et al.	2020	6 dof pose estimation of textureless objects from multiple rgb frames
Liu et al.	2021	FAMINet: Learning real-time semisupervised video object segmentation with steepest optimized optical flow
Cao et al.	2022	CMAN: Leaning global structure correlation for monocular 3D object detection
Agarwal et al.	2024	Unmasking the potential: evaluating image inpainting techniques for masked face reconstruction
Lee et al.	2020	Learning semantic correspondence exploiting an object-level prior
Tang et al.	2021	Image dataset creation and networks improvement method based on CAD model and edge operator for object detection in the manufacturing industry
Qin et al.	2025	Self-supervised single-image 3D face reconstruction method based on attention mechanism and attribute refinement
CN114943747A (en)	2022-08-26	Image analysis method and device, video editing method and device, and medium
Ren et al.	2022	Multi-scale convolutional feature fusion for 6D pose estimation
Wang et al.	2023	Versatile face animator: Driving arbitrary 3d facial avatar in rgbd space
Cho et al.	2023	Synthesizing industrial defect images under data imbalance