Abstract
In recent years, the use of deep learning models for point cloud classification and segmentation tasks has increasingly become a hot topic in 3D point cloud research. However, the sparsity and inhomogeneity of point cloud data make it difficult to extract point cloud features. Meanwhile, how to effectively extract fine-grained local features becomes crucial in point cloud understanding. Therefore, in this study, we propose a novel FDA-PointNet+ + point cloud classification model based on fusion downsampling strategy and attention module. Firstly, the method proposes a fusion downsampling strategy, which performs hierarchical downsampling on the initial point cloud data, and then repeats the downsampling operation on the sampling results and performs feature fusion to form feature maps with multi-scale information to enhance the richness of local spatial point cloud feature information. Secondly, we incorporate a channel attention mechanism into PointNet+ + and propose a Local Feature Aggregation (LFA) module for point cloud local feature extraction. This method improves the local feature extraction capability of the network model by amplifying the relevant local features and suppressing the non-relevant features. Experimental results on the ModelNet40 dataset demonstrate that FDA-PointNet+ + achieves higher classification accuracy and robustness, with a 1.3% increase in overall accuracy (OA) and a 1.4% improvement in class accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Qi, C.R., Su, H., Ma, K., et al.: PointNet: deep learning on point sets for 3D classification and segmentation. In: Computer Vision and Pattern Recognition 2017, LNCS, pp. 72–85. Springer, Heidelberg (2017)
Qi, C.R., Yi, L., Su, H., et al.: PointNet++: deep hierarchical feature learning on point sets in a metric space. In: Advances in Neural Information Processing Systems 2017, LNCS, pp. 1–13. Springer, Heidelberg (2017)
Gu, P., et al.: Multi-head self-attention model for classification of temporal lobe epilepsy subtypes. In: Proceedings of the Frontiers in Physiology 11 2020, LNCS, pp. 1–13. Springer, Heidelberg (2020)
Su, H., Maji, S., Kalogerakis, E., et al.: Multi-view convolutional neural networks for 3D shape recognition. In: International Conference on Computer Vision 2015, LNCS, pp. 945–953. Springer, Heidelberg (2015)
Qi, C.R., Su, H., Niebner, M., et al.: Volumetric and multi-view CNNs for object classification on 3d data. In: Computer Vision and Pattern Recognition 2016, LNCS, pp. 5648–5656. Springer, Heidelberg (2016)
Wu, Z., Song, S., Khosla, A., et al.: 3D ShapeNets: a deep representation for volumetric shapes. In: Computer Vision and Pattern Recognition 2015, LNCS, pp. 1912–1920. Springer, Heidelberg (2015)
Simonovsky, M., Komodakis, N. Dynamic edge-conditioned filters in convolutional neural networks on graphs. In: Proceeding of CVPR 2017, LNCS, pp. 1–13. Springer, Heidelberg (2017)
Wang, Y., Sun, Y., Liu, Z., et al.: Dynamic graph CNN for learning on point clouds. In: ACM Transactions on Graphics (TOG) 2019, LNCS, pp. 1–12. Springer, Heidelberg (2019)
Guo, M.H., Cai, J.X., Liu, Z.N., et al.: PCT: point cloud transformer. In: Computational Visual Media 2021, LNCS, pp. 187–199. Springer, Heidelberg (2021)
Xu, Y., Fan, T., Xu, M., et al.: SpiderCNN: deep learning on point sets with parameterized convolutional filters. In: Proceedings of the European Conference on Computer Vision (ECCV) 2018, LNCS, pp. 87–102. Springer, Heidelberg (2018)
Qian, G., Li, Y., Peng, H., et al.: PointNext: revisiting PointNet++ with improved training and scaling strategies. In: Advances in Neural Information Processing Systems 2022, LNCS, pp. 23192–23204. Springer, Heidelberg (2022)
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2018, LNCS, pp. 7132–7141. Springer, Heidelberg (2018)
Woo, S., Park, J., Lee, J.Y., et al.: CBAM: convolutional block attention module. In: Proceedings of the European Conference on Computer Vision (ECCV) 2018, LNCS, pp. 3–19. Springer, Heidelberg (2018)
Guo, J., et al.: Automatic and accurate epilepsy ripple and fast ripple detection via virtual sample generation and attention neural networks. In: IEEE Transactions on Neural Systems and Rehabilitation Engineering 2020, LNCS, pp. 1710–1719. Springer, Heidelberg (2020)
Guo, J.: Detecting high-frequency oscillations for Stereoelectroencephalography in epilepsy via hypergraph learning. In: IEEE Transactions on Neural Systems and Rehabilitation Engineering 2021, LNCS, pp. 587–596. Springer, Heidelberg (2021)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: International Conference on Learning Representations (ICLR) 2015, LNCS, pp. 1–13. Springer, Heidelberg (2015)
Li, Y., Bu, R., Sun, M., et al.: PointCNN: convolution on x-transformed points. In: Advances in Neural Information Processing Systems 2018, LNCS, pp. 1–13. Springer, Heidelberg (2018)
Peng, X., Long, G., Shen, T., Wang, S., Jiang, J.: Sequential diagnosis prediction with transformer and ontological representation. In: 2021 IEEE International Conference on Data Mining (ICDM), LNCS, pp. 489–498. Springer, Heidelberg (2021)
Peng, X., Long, G., Yan, P., et al.: COVID-19 impact analysis on patients with complex health conditions: a literature review. In: 2023, LNCS, pp. 1–13. Springer, Heidelberg (2023)
Chen, D., et al.: Scalp EEG-based pain detection using convolutional neural network. In: IEEE Transactions on Neural Systems and Rehabilitation Engineering 2022, LNCS, pp. 1–13. Springer, Heidelberg (2022)
Peng, X., Long, G., Shen, T., Wang, S., Jiang, J., Zhang, C.: BiteNet: bidirectional temporal encoder network to predict medical outcomes. In: 2020 IEEE International Conference on Data Mining (ICDM), LNCS, pp. 1–13. Springer, Heidelberg (2020)
Niu, K., Guo, Z., Peng, X., et al.: P-ResUNet: segmentation of brain tissue with purified residual UNet. In: Computers in Biology and Medicine 2022, LNCS, pp. 1–13. Springer, Heidelberg (2022)
Niu, K., Lu, Y., Peng, X., et al.: Fusion of Sequential Visits and Medical Ontology for Mortality Prediction. In: Journal of Biomedical Informatics 2022, LNCS, pp. 1–13. Springer, Heidelberg (2022)
Acknowledgements
This work was supported by Young Tech Innovation Leading Talent Program of Ningbo City under Grant No. 2023QL008; Innovation Consortium Program for Green and Efficient Intelligent Appliance of Ningbo City under Grant No. 2022H002; The Industrial Science and Technology Research Project of Henan Province under Grants 232102210088, 232102210125, 222102210024.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Sun, W., Gu, P., Pan, Y., Ma, J., Cui, J., Han, P. (2024). FDA-PointNet++: A Point Cloud Classification Model Based on Fused Downsampling Strategy and Attention Module. In: Huang, DS., Premaratne, P., Yuan, C. (eds) Applied Intelligence. ICAI 2023. Communications in Computer and Information Science, vol 2014. Springer, Singapore. https://doi.org/10.1007/978-981-97-0903-8_24
Download citation
DOI: https://doi.org/10.1007/978-981-97-0903-8_24
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-0902-1
Online ISBN: 978-981-97-0903-8
eBook Packages: Computer ScienceComputer Science (R0)