Shen et al., 2023 - Google Patents
FlowFormer: 3D scene flow estimation for point clouds with transformersShen et al., 2023
- Document ID
- 9128533427160518126
- Author
- Shen Y
- Hui L
- Publication year
- Publication venue
- Knowledge-Based Systems
External Links
Snippet
Since estimating scene flow from point clouds is challenging, some methods involve the robust Transformer. However, there are two problems with these methods:(1) Dense connectivity of global attention reaches the efficiency bottleneck.(2) The lack of adaptive flow …
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6267—Classification techniques
- G06K9/6268—Classification techniques relating to the classification paradigm, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108509978B (en) | Multi-class target detection method and model based on CNN (CNN) multi-level feature fusion | |
CN110674688B (en) | Face recognition model acquisition method, system and medium for video monitoring scene | |
CN106157307B (en) | A kind of monocular image depth estimation method based on multiple dimensioned CNN and continuous CRF | |
CN110163286B (en) | Hybrid pooling-based domain adaptive image classification method | |
Cherabier et al. | Learning priors for semantic 3d reconstruction | |
CN108132968A (en) | Network text is associated with the Weakly supervised learning method of Semantic unit with image | |
CN115049841A (en) | Depth unsupervised multistep anti-domain self-adaptive high-resolution SAR image surface feature extraction method | |
Li et al. | Example-based image super-resolution with class-specific predictors | |
US20230019972A1 (en) | Systems and methods of contrastive point completion with fine-to-coarse refinement | |
CN117058437B (en) | A flower classification method, system, equipment and medium based on knowledge distillation | |
Shen et al. | FlowFormer: 3D scene flow estimation for point clouds with transformers | |
CN112801107B (en) | Image segmentation method and electronic equipment | |
CN109522831B (en) | Real-time vehicle detection method based on micro-convolution neural network | |
CN113139468A (en) | Video abstract generation method fusing local target features and global features | |
CN103049340A (en) | Image super-resolution reconstruction method of visual vocabularies and based on texture context constraint | |
Wang et al. | TF-SOD: A novel transformer framework for salient object detection | |
CN116452862A (en) | Image classification method based on domain generalization learning | |
Zhu et al. | Real-time crowd counting via lightweight scale-aware network | |
CN117523194A (en) | An image segmentation method based on sparse annotation | |
Yu et al. | Machine learning and signal processing for big multimedia analysis | |
CN118015276A (en) | A semi-supervised semantic segmentation method based on dual-path multi-scale | |
Zheng et al. | DCU-NET: Self-supervised monocular depth estimation based on densely connected U-shaped convolutional neural networks | |
Wang et al. | HQDec: self-supervised monocular depth estimation based on a high-quality decoder | |
Zhang et al. | Three things we need to know about transferring stable diffusion to visual dense prediction tasks | |
Ye et al. | GFSCompNet: remote sensing image compression network based on global feature-assisted segmentation |