Computer Science > Computer Vision and Pattern Recognition

arXiv:2311.10121 (cs)

[Submitted on 16 Nov 2023 (v1), last revised 16 Apr 2024 (this version, v3)]

Title:Slide-SAM: Medical SAM Meets Sliding Window

Authors:Quan Quan, Fenghe Tang, Zikang Xu, Heqin Zhu, S.Kevin Zhou

Abstract:The Segment Anything Model (SAM) has achieved a notable success in two-dimensional image segmentation in natural images. However, the substantial gap between medical and natural images hinders its direct application to medical image segmentation tasks. Particularly in 3D medical images, SAM struggles to learn contextual relationships between slices, limiting its practical applicability. Moreover, applying 2D SAM to 3D images requires prompting the entire volume, which is time- and label-consuming. To address these problems, we propose Slide-SAM, which treats a stack of three adjacent slices as a prediction window. It firstly takes three slices from a 3D volume and point- or bounding box prompts on the central slice as inputs to predict segmentation masks for all three slices. Subsequently, the masks of the top and bottom slices are then used to generate new prompts for adjacent slices. Finally, step-wise prediction can be achieved by sliding the prediction window forward or backward through the entire volume. Our model is trained on multiple public and private medical datasets and demonstrates its effectiveness through extensive 3D segmetnation experiments, with the help of minimal prompts. Code is available at \url{this https URL}.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2311.10121 [cs.CV]
	(or arXiv:2311.10121v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2311.10121

Submission history

From: Quan Quan [view email]
[v1] Thu, 16 Nov 2023 10:45:46 UTC (6,293 KB)
[v2] Tue, 5 Dec 2023 07:10:25 UTC (6,377 KB)
[v3] Tue, 16 Apr 2024 14:35:13 UTC (8,383 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Slide-SAM: Medical SAM Meets Sliding Window

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Slide-SAM: Medical SAM Meets Sliding Window

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators