Computer Science > Computer Vision and Pattern Recognition

arXiv:2402.17521 (cs)

[Submitted on 27 Feb 2024 (v1), last revised 5 Aug 2024 (this version, v3)]

Title:AVS-Net: Point Sampling with Adaptive Voxel Size for 3D Scene Understanding

Authors:Hongcheng Yang, Dingkang Liang, Dingyuan Zhang, Zhe Liu, Zhikang Zou, Xingyu Jiang, Yingying Zhu

Abstract:The recent advancements in point cloud learning have enabled intelligent vehicles and robots to comprehend 3D environments better. However, processing large-scale 3D scenes remains a challenging problem, such that efficient downsampling methods play a crucial role in point cloud learning. Existing downsampling methods either require a huge computational burden or sacrifice fine-grained geometric information. For such purpose, this paper presents an advanced sampler that achieves both high accuracy and efficiency. The proposed method utilizes voxel centroid sampling as a foundation but effectively addresses the challenges regarding voxel size determination and the preservation of critical geometric cues. Specifically, we propose a Voxel Adaptation Module that adaptively adjusts voxel sizes with the reference of point-based downsampling ratio. This ensures that the sampling results exhibit a favorable distribution for comprehending various 3D objects or scenes. Meanwhile, we introduce a network compatible with arbitrary voxel sizes for sampling and feature extraction while maintaining high efficiency. The proposed approach is demonstrated with 3D object detection and 3D semantic segmentation. Compared to existing state-of-the-art methods, our approach achieves better accuracy on outdoor and indoor large-scale datasets, e.g. Waymo and ScanNet, with promising efficiency.

Comments:	11 pages, 7 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2402.17521 [cs.CV]
	(or arXiv:2402.17521v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2402.17521

Submission history

From: Hongchneg Yang [view email]
[v1] Tue, 27 Feb 2024 14:05:05 UTC (888 KB)
[v2] Tue, 16 Apr 2024 03:02:04 UTC (1,683 KB)
[v3] Mon, 5 Aug 2024 03:16:03 UTC (1,671 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:AVS-Net: Point Sampling with Adaptive Voxel Size for 3D Scene Understanding

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:AVS-Net: Point Sampling with Adaptive Voxel Size for 3D Scene Understanding

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators