FDA-PointNet++: A Point Cloud Classification Model Based on Fused Downsampling Strategy and Attention Module

Wei Sun⁸,
Peipei Gu⁸,
Yijie Pan⁹,
Junxia Ma⁸,
Jiantao Cui⁸ &
…
Pujie Han⁸

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 2014))

Included in the following conference series:

International Conference on Applied Intelligence

552 Accesses

Abstract

In recent years, the use of deep learning models for point cloud classification and segmentation tasks has increasingly become a hot topic in 3D point cloud research. However, the sparsity and inhomogeneity of point cloud data make it difficult to extract point cloud features. Meanwhile, how to effectively extract fine-grained local features becomes crucial in point cloud understanding. Therefore, in this study, we propose a novel FDA-PointNet+ + point cloud classification model based on fusion downsampling strategy and attention module. Firstly, the method proposes a fusion downsampling strategy, which performs hierarchical downsampling on the initial point cloud data, and then repeats the downsampling operation on the sampling results and performs feature fusion to form feature maps with multi-scale information to enhance the richness of local spatial point cloud feature information. Secondly, we incorporate a channel attention mechanism into PointNet+ + and propose a Local Feature Aggregation (LFA) module for point cloud local feature extraction. This method improves the local feature extraction capability of the network model by amplifying the relevant local features and suppressing the non-relevant features. Experimental results on the ModelNet40 dataset demonstrate that FDA-PointNet+ + achieves higher classification accuracy and robustness, with a 1.3% increase in overall accuracy (OA) and a 1.4% improvement in class accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 64.99; Price excludes VAT (USA)

Softcover Book: USD 84.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

SDANet: spatial deep attention-based for point cloud classification and segmentation

Article 30 March 2022

Multi-scale Spatial Offset-Attention Network for 3D Point Clouds Classification

Point cloud upsampling network based on pyramid pooling and self-attention mechanism

Article Open access 16 October 2024

References

Qi, C.R., Su, H., Ma, K., et al.: PointNet: deep learning on point sets for 3D classification and segmentation. In: Computer Vision and Pattern Recognition 2017, LNCS, pp. 72–85. Springer, Heidelberg (2017)
Google Scholar
Qi, C.R., Yi, L., Su, H., et al.: PointNet++: deep hierarchical feature learning on point sets in a metric space. In: Advances in Neural Information Processing Systems 2017, LNCS, pp. 1–13. Springer, Heidelberg (2017)
Google Scholar
Gu, P., et al.: Multi-head self-attention model for classification of temporal lobe epilepsy subtypes. In: Proceedings of the Frontiers in Physiology 11 2020, LNCS, pp. 1–13. Springer, Heidelberg (2020)
Google Scholar
Su, H., Maji, S., Kalogerakis, E., et al.: Multi-view convolutional neural networks for 3D shape recognition. In: International Conference on Computer Vision 2015, LNCS, pp. 945–953. Springer, Heidelberg (2015)
Google Scholar
Qi, C.R., Su, H., Niebner, M., et al.: Volumetric and multi-view CNNs for object classification on 3d data. In: Computer Vision and Pattern Recognition 2016, LNCS, pp. 5648–5656. Springer, Heidelberg (2016)
Google Scholar
Wu, Z., Song, S., Khosla, A., et al.: 3D ShapeNets: a deep representation for volumetric shapes. In: Computer Vision and Pattern Recognition 2015, LNCS, pp. 1912–1920. Springer, Heidelberg (2015)
Google Scholar
Simonovsky, M., Komodakis, N. Dynamic edge-conditioned filters in convolutional neural networks on graphs. In: Proceeding of CVPR 2017, LNCS, pp. 1–13. Springer, Heidelberg (2017)
Google Scholar
Wang, Y., Sun, Y., Liu, Z., et al.: Dynamic graph CNN for learning on point clouds. In: ACM Transactions on Graphics (TOG) 2019, LNCS, pp. 1–12. Springer, Heidelberg (2019)
Google Scholar
Guo, M.H., Cai, J.X., Liu, Z.N., et al.: PCT: point cloud transformer. In: Computational Visual Media 2021, LNCS, pp. 187–199. Springer, Heidelberg (2021)
Google Scholar
Xu, Y., Fan, T., Xu, M., et al.: SpiderCNN: deep learning on point sets with parameterized convolutional filters. In: Proceedings of the European Conference on Computer Vision (ECCV) 2018, LNCS, pp. 87–102. Springer, Heidelberg (2018)
Google Scholar
Qian, G., Li, Y., Peng, H., et al.: PointNext: revisiting PointNet++ with improved training and scaling strategies. In: Advances in Neural Information Processing Systems 2022, LNCS, pp. 23192–23204. Springer, Heidelberg (2022)
Google Scholar
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2018, LNCS, pp. 7132–7141. Springer, Heidelberg (2018)
Google Scholar
Woo, S., Park, J., Lee, J.Y., et al.: CBAM: convolutional block attention module. In: Proceedings of the European Conference on Computer Vision (ECCV) 2018, LNCS, pp. 3–19. Springer, Heidelberg (2018)
Google Scholar
Guo, J., et al.: Automatic and accurate epilepsy ripple and fast ripple detection via virtual sample generation and attention neural networks. In: IEEE Transactions on Neural Systems and Rehabilitation Engineering 2020, LNCS, pp. 1710–1719. Springer, Heidelberg (2020)
Google Scholar
Guo, J.: Detecting high-frequency oscillations for Stereoelectroencephalography in epilepsy via hypergraph learning. In: IEEE Transactions on Neural Systems and Rehabilitation Engineering 2021, LNCS, pp. 587–596. Springer, Heidelberg (2021)
Google Scholar
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: International Conference on Learning Representations (ICLR) 2015, LNCS, pp. 1–13. Springer, Heidelberg (2015)
Google Scholar
Li, Y., Bu, R., Sun, M., et al.: PointCNN: convolution on x-transformed points. In: Advances in Neural Information Processing Systems 2018, LNCS, pp. 1–13. Springer, Heidelberg (2018)
Google Scholar
Peng, X., Long, G., Shen, T., Wang, S., Jiang, J.: Sequential diagnosis prediction with transformer and ontological representation. In: 2021 IEEE International Conference on Data Mining (ICDM), LNCS, pp. 489–498. Springer, Heidelberg (2021)
Google Scholar
Peng, X., Long, G., Yan, P., et al.: COVID-19 impact analysis on patients with complex health conditions: a literature review. In: 2023, LNCS, pp. 1–13. Springer, Heidelberg (2023)
Google Scholar
Chen, D., et al.: Scalp EEG-based pain detection using convolutional neural network. In: IEEE Transactions on Neural Systems and Rehabilitation Engineering 2022, LNCS, pp. 1–13. Springer, Heidelberg (2022)
Google Scholar
Peng, X., Long, G., Shen, T., Wang, S., Jiang, J., Zhang, C.: BiteNet: bidirectional temporal encoder network to predict medical outcomes. In: 2020 IEEE International Conference on Data Mining (ICDM), LNCS, pp. 1–13. Springer, Heidelberg (2020)
Google Scholar
Niu, K., Guo, Z., Peng, X., et al.: P-ResUNet: segmentation of brain tissue with purified residual UNet. In: Computers in Biology and Medicine 2022, LNCS, pp. 1–13. Springer, Heidelberg (2022)
Google Scholar
Niu, K., Lu, Y., Peng, X., et al.: Fusion of Sequential Visits and Medical Ontology for Mortality Prediction. In: Journal of Biomedical Informatics 2022, LNCS, pp. 1–13. Springer, Heidelberg (2022)
Google Scholar

Download references

Acknowledgements

This work was supported by Young Tech Innovation Leading Talent Program of Ningbo City under Grant No. 2023QL008; Innovation Consortium Program for Green and Efficient Intelligent Appliance of Ningbo City under Grant No. 2022H002; The Industrial Science and Technology Research Project of Henan Province under Grants 232102210088, 232102210125, 222102210024.

Author information

Authors and Affiliations

Zhengzhou University of Light Industry, Zhengzhou, 450000, China
Wei Sun, Peipei Gu, Junxia Ma, Jiantao Cui & Pujie Han
Eastern Institute for Advanced Study, Eastern Institute of Technology, Ningbo, 315000, China
Yijie Pan

Authors

Wei Sun
View author publications
You can also search for this author in PubMed Google Scholar
Peipei Gu
View author publications
You can also search for this author in PubMed Google Scholar
Yijie Pan
View author publications
You can also search for this author in PubMed Google Scholar
Junxia Ma
View author publications
You can also search for this author in PubMed Google Scholar
Jiantao Cui
View author publications
You can also search for this author in PubMed Google Scholar
Pujie Han
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yijie Pan .

Editor information

Editors and Affiliations

Eastern Institute of Technology, Zhejiang, China
De-Shuang Huang
University of Wollongong, North Wollongong, NSW, Australia
Prashan Premaratne
Guangxi Academy of Sciences, Guangxi, China
Changan Yuan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sun, W., Gu, P., Pan, Y., Ma, J., Cui, J., Han, P. (2024). FDA-PointNet++: A Point Cloud Classification Model Based on Fused Downsampling Strategy and Attention Module. In: Huang, DS., Premaratne, P., Yuan, C. (eds) Applied Intelligence. ICAI 2023. Communications in Computer and Information Science, vol 2014. Springer, Singapore. https://doi.org/10.1007/978-981-97-0903-8_24

Download citation

DOI: https://doi.org/10.1007/978-981-97-0903-8_24
Published: 01 March 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-0902-1
Online ISBN: 978-981-97-0903-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics