Computer Science > Computer Vision and Pattern Recognition

arXiv:2405.16094 (cs)

[Submitted on 25 May 2024 (v1), last revised 3 Jun 2024 (this version, v2)]

Title:PLUG: Revisiting Amodal Segmentation with Foundation Model and Hierarchical Focus

Authors:Zhaochen Liu, Limeng Qiao, Xiangxiang Chu, Tingting Jiang

Abstract:Aiming to predict the complete shapes of partially occluded objects, amodal segmentation is an important step towards visual intelligence. With crucial significance, practical prior knowledge derives from sufficient training, while limited amodal annotations pose challenges to achieve better performance. To tackle this problem, utilizing the mighty priors accumulated in the foundation model, we propose the first SAM-based amodal segmentation approach, PLUG. Methodologically, a novel framework with hierarchical focus is presented to better adapt the task characteristics and unleash the potential capabilities of SAM. In the region level, due to the association and division in visible and occluded areas, inmodal and amodal regions are assigned as the focuses of distinct branches to avoid mutual disturbance. In the point level, we introduce the concept of uncertainty to explicitly assist the model in identifying and focusing on ambiguous points. Guided by the uncertainty map, a computation-economic point loss is applied to improve the accuracy of predicted boundaries. Experiments are conducted on several prominent datasets, and the results show that our proposed method outperforms existing methods with large margins. Even with fewer total parameters, our method still exhibits remarkable advantages.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2405.16094 [cs.CV]
	(or arXiv:2405.16094v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2405.16094

Submission history

From: Zhaochen Liu [view email]
[v1] Sat, 25 May 2024 06:58:20 UTC (3,534 KB)
[v2] Mon, 3 Jun 2024 08:27:09 UTC (3,534 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:PLUG: Revisiting Amodal Segmentation with Foundation Model and Hierarchical Focus

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:PLUG: Revisiting Amodal Segmentation with Foundation Model and Hierarchical Focus

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators