[go: up one dir, main page]

Huo et al., 2025 - Google Patents

PR-DETR: Extracting and utilizing prior knowledge for improved end-to-end object detection

Huo et al., 2025

Document ID
13674316253581472264
Author
Huo Y
Yao M
Wang T
Tian Q
Zhao J
Liu X
Wang H
Publication year
Publication venue
Image and Vision Computing

External Links

Snippet

The query initialization in the Transformer-based target detection algorithm has static characteristics, resulting in a limitation to flexibly adjust the degree of attention to different image features during the learning process. In addition, without the guidance of global …
Continue reading at www.sciencedirect.com (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/20Handling natural language data
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
    • G06K9/46Extraction of features or characteristics of the image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06NCOMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T11/002D [Two Dimensional] image generation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis

Similar Documents

Publication Publication Date Title
Hu et al. DGW‐YOLOv8: A small insulator target detection algorithm based on deformable attention backbone and WIoU loss function
Li et al. Facial expression recognition with faster R-CNN
Rochan et al. Unsupervised domain adaptation in lidar semantic segmentation with self-supervision and gated adapters
Li et al. Ss-yolo: An object detection algorithm based on YOLOv3 and shufflenet
CN117218102A (en) Insulator defect detection method and system based on improved YOLOv5
Shi et al. A multitask network and two large-scale datasets for change detection and captioning in remote sensing images
Su et al. EpNet: Power lines foreign object detection with Edge Proposal Network and data composition
Li et al. UDA‐Net: Densely attention network for underwater image enhancement
Zhou et al. DTKD-Net: Dual-teacher knowledge distillation lightweight network for water-related optics image enhancement
CN116010578A (en) Answer positioning method and device based on weak supervision double-flow visual language interaction
Li et al. Hybrid Convolutional-Transformer framework for drone-based few-shot weakly supervised object detection
Kyem et al. Weather-adaptive synthetic data generation for enhanced power line inspection using stargan
Huo et al. PR-DETR: Extracting and utilizing prior knowledge for improved end-to-end object detection
Xu et al. Generalization boosted adapter for open-vocabulary segmentation
Dong et al. Leveraging large-scale pretrained vision foundation models for label-efficient 3d point cloud segmentation
Pei et al. Lightweight transmission line defect identification method based on OFN network and distillation method
Wang et al. Llava-sg: Leveraging scene graphs as visual semantic expression in vision-language models
Zhang et al. Mimtracking: Masked image modeling enhanced vision transformer for visual object tracking
Xiang et al. BN-YOLO: a lightweight method for bird’s nest detection on transmission lines
CN117173401B (en) Semi-supervised medical image segmentation method and system based on cross guidance and feature level consistency dual regularization
Huang et al. Tosa: Token merging with spatial awareness
Ke et al. Vehicle logo recognition with small sample problem in complex scene based on data augmentation
Wang et al. DSFDcd: joint distribution sampling and feature decoupling deep network for remote sensing change detection
Liao et al. A Transformer-Based Framework for Tiny Object Detection
Zhang et al. Cr2pq: Continuous relative rotary positional query for dense visual representation learning