Huo et al., 2025 - Google Patents

PR-DETR: Extracting and utilizing prior knowledge for improved end-to-end object detection

Huo et al., 2025

Document ID: 13674316253581472264
Author: Huo Y; Yao M; Wang T; Tian Q; Zhao J; Liu X; Wang H
Publication year: 2025
Publication venue: Image and Vision Computing

External Links

Cited by

Snippet

The query initialization in the Transformer-based target detection algorithm has static characteristics, resulting in a limitation to flexibly adjust the degree of attention to different image features during the learning process. In addition, without the guidance of global …

Continue reading at www.sciencedirect.com (other versions)

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis

Similar Documents

Publication	Publication Date	Title
Hu et al.	2024	DGW‐YOLOv8: A small insulator target detection algorithm based on deformable attention backbone and WIoU loss function
Li et al.	2017	Facial expression recognition with faster R-CNN
Rochan et al.	2022	Unsupervised domain adaptation in lidar semantic segmentation with self-supervision and gated adapters
Li et al.	2020	Ss-yolo: An object detection algorithm based on YOLOv3 and shufflenet
CN117218102A (en)	2023-12-12	Insulator defect detection method and system based on improved YOLOv5
Shi et al.	2024	A multitask network and two large-scale datasets for change detection and captioning in remote sensing images
Su et al.	2022	EpNet: Power lines foreign object detection with Edge Proposal Network and data composition
Li et al.	2021	UDA‐Net: Densely attention network for underwater image enhancement
Zhou et al.	2024	DTKD-Net: Dual-teacher knowledge distillation lightweight network for water-related optics image enhancement
CN116010578A (en)	2023-04-25	Answer positioning method and device based on weak supervision double-flow visual language interaction
Li et al.	2022	Hybrid Convolutional-Transformer framework for drone-based few-shot weakly supervised object detection
Kyem et al.	2024	Weather-adaptive synthetic data generation for enhanced power line inspection using stargan
Huo et al.	2025	PR-DETR: Extracting and utilizing prior knowledge for improved end-to-end object detection
Xu et al.	2024	Generalization boosted adapter for open-vocabulary segmentation
Dong et al.	2025	Leveraging large-scale pretrained vision foundation models for label-efficient 3d point cloud segmentation
Pei et al.	2024	Lightweight transmission line defect identification method based on OFN network and distillation method
Wang et al.	2025	Llava-sg: Leveraging scene graphs as visual semantic expression in vision-language models
Zhang et al.	2024	Mimtracking: Masked image modeling enhanced vision transformer for visual object tracking
Xiang et al.	2024	BN-YOLO: a lightweight method for bird’s nest detection on transmission lines
CN117173401B (en)	2024-05-03	Semi-supervised medical image segmentation method and system based on cross guidance and feature level consistency dual regularization
Huang et al.	2025	Tosa: Token merging with spatial awareness
Ke et al.	2020	Vehicle logo recognition with small sample problem in complex scene based on data augmentation
Wang et al.	2025	DSFDcd: joint distribution sampling and feature decoupling deep network for remote sensing change detection
Liao et al.	2023	A Transformer-Based Framework for Tiny Object Detection
Zhang et al.	2025	Cr2pq: Continuous relative rotary positional query for dense visual representation learning