Ren et al., 2022 - Google Patents
Multi-scale convolutional feature fusion for 6D pose estimationRen et al., 2022
- Document ID
- 6097157231323106715
- Author
- Ren Y
- Liu J
- Publication year
- Publication venue
- Proceedings of the 2022 6th International Conference on Video and Image Processing
External Links
Snippet
In order to obtain accurate pose estimate and satisfy real-time needs, 6D pose estimation of objects is an important task to handle challenging under certain circumstances, such as noisy background, and lighting fluctuations. We propose a 6D pose estimation method with …
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00221—Acquiring or recognising human faces, facial parts, facial sketches, facial expressions
- G06K9/00268—Feature extraction; Face representation
- G06K9/00281—Local features and components; Facial parts ; Occluding parts, e.g. glasses; Geometrical relationships
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20112—Image segmentation details
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T1/00—General purpose image data processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2200/00—Indexing scheme for image data processing or generation, in general
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T17/00—Three dimensional [3D] modelling, e.g. data description of 3D objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Truong et al. | Pdc-net+: Enhanced probabilistic dense correspondence network | |
Schmidt et al. | Self-supervised visual descriptor learning for dense correspondence | |
Dvornik et al. | On the importance of visual context for data augmentation in scene understanding | |
Tewari et al. | High-fidelity monocular face reconstruction based on an unsupervised model-based face autoencoder | |
Tu et al. | Consistent 3d hand reconstruction in video via self-supervised learning | |
US12106554B2 (en) | Image sequence processing using neural networks | |
Zeng et al. | Reference-based defect detection network | |
Liu et al. | Explicit occlusion reasoning for multi-person 3d human pose estimation | |
Zhong et al. | Mv-ton: Memory-based video virtual try-on network | |
Joung et al. | Unsupervised stereo matching using confidential correspondence consistency | |
He et al. | ContourPose: Monocular 6-D pose estimation method for reflective textureless metal parts | |
Jeon et al. | Struct-MDC: Mesh-refined unsupervised depth completion leveraging structural regularities from visual SLAM | |
Goyal et al. | Emotionally enhanced talking face generation | |
Yang et al. | Deep face video inpainting via UV mapping | |
Kaskman et al. | 6 dof pose estimation of textureless objects from multiple rgb frames | |
Liu et al. | FAMINet: Learning real-time semisupervised video object segmentation with steepest optimized optical flow | |
Cao et al. | CMAN: Leaning global structure correlation for monocular 3D object detection | |
Agarwal et al. | Unmasking the potential: evaluating image inpainting techniques for masked face reconstruction | |
Lee et al. | Learning semantic correspondence exploiting an object-level prior | |
Tang et al. | Image dataset creation and networks improvement method based on CAD model and edge operator for object detection in the manufacturing industry | |
Qin et al. | Self-supervised single-image 3D face reconstruction method based on attention mechanism and attribute refinement | |
CN114943747A (en) | Image analysis method and device, video editing method and device, and medium | |
Ren et al. | Multi-scale convolutional feature fusion for 6D pose estimation | |
Wang et al. | Versatile face animator: Driving arbitrary 3d facial avatar in rgbd space | |
Cho et al. | Synthesizing industrial defect images under data imbalance |