Xu et al., 2022 - Google Patents

Refined marine object detector with attention-based spatial pyramid pooling networks and bidirectional feature fusion strategy

Xu et al., 2022

Document ID: 12783109108184095314
Author: Xu F; Wang H; Sun X; Fu X
Publication year: 2022
Publication venue: Neural Computing and Applications

External Links

Cited by

Snippet

Marine object detection has become increasingly important in intelligent underwater robot. Because of color cast and blur in underwater images, features directly extracted from backbone networks usually lack interesting and discriminative characters, that affects …

Continue reading at link.springer.com (other versions)

230000004927 fusion 0 title abstract description 51

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30244—Information retrieval; Database structures therefor; File system structures therefor in image databases
- G06F17/30247—Information retrieval; Database structures therefor; File system structures therefor in image databases based on features automatically derived from the image data
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30799—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using low-level visual features of the video content
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass

Similar Documents

Publication	Publication Date	Title
Xu et al.	2022	Refined marine object detector with attention-based spatial pyramid pooling networks and bidirectional feature fusion strategy
Zhang et al.	2021	Global context aware RCNN for object detection
Ge et al.	2024	A review of deep learning based target detection algorithms
Ning et al.	2024	Small object detection based on YOLOv8 in UAV perspective
Zhou et al.	2021	RSANet: towards real-time object detection with residual semantic-guided attention feature pyramid network
Zhou et al.	2022	Discriminative attention-augmented feature learning for facial expression recognition in the wild
Hou et al.	2022	M-YOLO: an object detector based on global context information for infrared images
Jiao et al.	2025	RS-YOLO: An efficient object detection algorithm for road scenes
Wu et al.	2024	YOLOv5_mamba: unmanned aerial vehicle object detection based on bidirectional dense feedback network and adaptive gate feature fusion
Wang et al.	2022	Multi-scale dense and attention mechanism for image semantic segmentation based on improved DeepLabv3+
Shi et al.	2024	Umg-clip: A unified multi-granularity vision generalist for open-world understanding
Luo et al.	2025	Ebc-yolo: A remote sensing target recognition model adapted for complex environments
Zhang et al.	2025	An effective CNN and Transformer fusion network for camouflaged object detection
Li et al.	2023	Refine-fpn: Instance segmentation based on a non-local multi-feature aggregation mechanism
Zeng et al.	2025	C4D-YOLOv8: improved YOLOv8 for object detection on drone-captured images
Xu et al.	2025	Semantic-Orthogonal Multi-modal Attention Network for RGB-D Salient Object Detection: J. Xu et al.
Aghaee et al.	2024	MDSSD-MobV2: An embedded deconvolutional multispectral pedestrian detection based on SSD-MobileNetV2
Wang et al.	2023	Insulator defect detection based on improved you-only-look-once v4 in complex scenarios
Ding et al.	2021	Learning efficient single stage pedestrian detection by squeeze-and-excitation network
Hu et al.	2024	ULAF-Net: Ultra lightweight attention fusion network for real-time semantic segmentation
Fan et al.	2022	Global contextual attention for pure regression object detection
Ban et al.	2020	Real-Time object detection based on convolutional block attention module
Liu et al.	2021	Siamese network with bidirectional feature pyramid for small target tracking
Ren et al.	2024	AF-DETR: efficient UAV small object detector via assemble-and-fusion mechanism
Zhong et al.	2024	DEEPAM: toward deeper attention module in residual convolutional neural networks