Xu et al., 2022 - Google Patents
Refined marine object detector with attention-based spatial pyramid pooling networks and bidirectional feature fusion strategyXu et al., 2022
- Document ID
- 12783109108184095314
- Author
- Xu F
- Wang H
- Sun X
- Fu X
- Publication year
- Publication venue
- Neural Computing and Applications
External Links
Snippet
Marine object detection has become increasingly important in intelligent underwater robot. Because of color cast and blur in underwater images, features directly extracted from backbone networks usually lack interesting and discriminative characters, that affects …
- 230000004927 fusion 0 title abstract description 51
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30244—Information retrieval; Database structures therefor; File system structures therefor in image databases
- G06F17/30247—Information retrieval; Database structures therefor; File system structures therefor in image databases based on features automatically derived from the image data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Xu et al. | Refined marine object detector with attention-based spatial pyramid pooling networks and bidirectional feature fusion strategy | |
| Zhang et al. | Global context aware RCNN for object detection | |
| Ge et al. | A review of deep learning based target detection algorithms | |
| Ning et al. | Small object detection based on YOLOv8 in UAV perspective | |
| Zhou et al. | RSANet: towards real-time object detection with residual semantic-guided attention feature pyramid network | |
| Zhou et al. | Discriminative attention-augmented feature learning for facial expression recognition in the wild | |
| Hou et al. | M-YOLO: an object detector based on global context information for infrared images | |
| Jiao et al. | RS-YOLO: An efficient object detection algorithm for road scenes | |
| Wu et al. | YOLOv5_mamba: unmanned aerial vehicle object detection based on bidirectional dense feedback network and adaptive gate feature fusion | |
| Wang et al. | Multi-scale dense and attention mechanism for image semantic segmentation based on improved DeepLabv3+ | |
| Shi et al. | Umg-clip: A unified multi-granularity vision generalist for open-world understanding | |
| Luo et al. | Ebc-yolo: A remote sensing target recognition model adapted for complex environments | |
| Zhang et al. | An effective CNN and Transformer fusion network for camouflaged object detection | |
| Li et al. | Refine-fpn: Instance segmentation based on a non-local multi-feature aggregation mechanism | |
| Zeng et al. | C4D-YOLOv8: improved YOLOv8 for object detection on drone-captured images | |
| Xu et al. | Semantic-Orthogonal Multi-modal Attention Network for RGB-D Salient Object Detection: J. Xu et al. | |
| Aghaee et al. | MDSSD-MobV2: An embedded deconvolutional multispectral pedestrian detection based on SSD-MobileNetV2 | |
| Wang et al. | Insulator defect detection based on improved you-only-look-once v4 in complex scenarios | |
| Ding et al. | Learning efficient single stage pedestrian detection by squeeze-and-excitation network | |
| Hu et al. | ULAF-Net: Ultra lightweight attention fusion network for real-time semantic segmentation | |
| Fan et al. | Global contextual attention for pure regression object detection | |
| Ban et al. | Real-Time object detection based on convolutional block attention module | |
| Liu et al. | Siamese network with bidirectional feature pyramid for small target tracking | |
| Ren et al. | AF-DETR: efficient UAV small object detector via assemble-and-fusion mechanism | |
| Zhong et al. | DEEPAM: toward deeper attention module in residual convolutional neural networks |