Huo et al., 2025 - Google Patents
PR-DETR: Extracting and utilizing prior knowledge for improved end-to-end object detectionHuo et al., 2025
- Document ID
- 13674316253581472264
- Author
- Huo Y
- Yao M
- Wang T
- Tian Q
- Zhao J
- Liu X
- Wang H
- Publication year
- Publication venue
- Image and Vision Computing
External Links
Snippet
The query initialization in the Transformer-based target detection algorithm has static characteristics, resulting in a limitation to flexibly adjust the degree of attention to different image features during the learning process. In addition, without the guidance of global …
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Hu et al. | DGW‐YOLOv8: A small insulator target detection algorithm based on deformable attention backbone and WIoU loss function | |
| Li et al. | Facial expression recognition with faster R-CNN | |
| Rochan et al. | Unsupervised domain adaptation in lidar semantic segmentation with self-supervision and gated adapters | |
| Li et al. | Ss-yolo: An object detection algorithm based on YOLOv3 and shufflenet | |
| CN117218102A (en) | Insulator defect detection method and system based on improved YOLOv5 | |
| Shi et al. | A multitask network and two large-scale datasets for change detection and captioning in remote sensing images | |
| Su et al. | EpNet: Power lines foreign object detection with Edge Proposal Network and data composition | |
| Li et al. | UDA‐Net: Densely attention network for underwater image enhancement | |
| Zhou et al. | DTKD-Net: Dual-teacher knowledge distillation lightweight network for water-related optics image enhancement | |
| CN116010578A (en) | Answer positioning method and device based on weak supervision double-flow visual language interaction | |
| Li et al. | Hybrid Convolutional-Transformer framework for drone-based few-shot weakly supervised object detection | |
| Kyem et al. | Weather-adaptive synthetic data generation for enhanced power line inspection using stargan | |
| Huo et al. | PR-DETR: Extracting and utilizing prior knowledge for improved end-to-end object detection | |
| Xu et al. | Generalization boosted adapter for open-vocabulary segmentation | |
| Dong et al. | Leveraging large-scale pretrained vision foundation models for label-efficient 3d point cloud segmentation | |
| Pei et al. | Lightweight transmission line defect identification method based on OFN network and distillation method | |
| Wang et al. | Llava-sg: Leveraging scene graphs as visual semantic expression in vision-language models | |
| Zhang et al. | Mimtracking: Masked image modeling enhanced vision transformer for visual object tracking | |
| Xiang et al. | BN-YOLO: a lightweight method for bird’s nest detection on transmission lines | |
| CN117173401B (en) | Semi-supervised medical image segmentation method and system based on cross guidance and feature level consistency dual regularization | |
| Huang et al. | Tosa: Token merging with spatial awareness | |
| Ke et al. | Vehicle logo recognition with small sample problem in complex scene based on data augmentation | |
| Wang et al. | DSFDcd: joint distribution sampling and feature decoupling deep network for remote sensing change detection | |
| Liao et al. | A Transformer-Based Framework for Tiny Object Detection | |
| Zhang et al. | Cr2pq: Continuous relative rotary positional query for dense visual representation learning |