Computer Science > Computer Vision and Pattern Recognition

arXiv:2304.04264 (cs)

[Submitted on 9 Apr 2023 (v1), last revised 18 Apr 2023 (this version, v4)]

Title:RGB-T Tracking Based on Mixed Attention

Authors:Yang Luo, Xiqing Guo, Mingtao Dong, Jin Yu

View PDF

Abstract:RGB-T tracking involves the use of images from both visible and thermal modalities. The primary objective is to adaptively leverage the relatively dominant modality in varying conditions to achieve more robust tracking compared to single-modality tracking. An RGB-T tracker based on mixed attention mechanism to achieve complementary fusion of modalities (referred to as MACFT) is proposed in this paper. In the feature extraction stage, we utilize different transformer backbone branches to extract specific and shared information from different modalities. By performing mixed attention operations in the backbone to enable information interaction and self-enhancement between the template and search images, it constructs a robust feature representation that better understands the high-level semantic features of the target. Then, in the feature fusion stage, a modality-adaptive fusion is achieved through a mixed attention-based modality fusion network, which suppresses the low-quality modality noise while enhancing the information of the dominant modality. Evaluation on multiple RGB-T public datasets demonstrates that our proposed tracker outperforms other RGB-T trackers on general evaluation metrics while also being able to adapt to longterm tracking scenarios.

Comments:	14 pages, 10 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2304.04264 [cs.CV]
	(or arXiv:2304.04264v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2304.04264
Journal reference:	Sensors 23, no. 14: 6609 (2023)
Related DOI:	https://doi.org/10.3390/s23146609

Submission history

From: Yang Luo [view email]
[v1] Sun, 9 Apr 2023 15:59:41 UTC (1,751 KB)
[v2] Tue, 11 Apr 2023 01:13:05 UTC (1,751 KB)
[v3] Mon, 17 Apr 2023 08:35:20 UTC (1,752 KB)
[v4] Tue, 18 Apr 2023 02:00:25 UTC (1,752 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:RGB-T Tracking Based on Mixed Attention

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:RGB-T Tracking Based on Mixed Attention

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators