[go: up one dir, main page]

Skip to main content

FDA-PointNet++: A Point Cloud Classification Model Based on Fused Downsampling Strategy and Attention Module

  • Conference paper
  • First Online:
Applied Intelligence (ICAI 2023)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 2014))

Included in the following conference series:

  • 552 Accesses

Abstract

In recent years, the use of deep learning models for point cloud classification and segmentation tasks has increasingly become a hot topic in 3D point cloud research. However, the sparsity and inhomogeneity of point cloud data make it difficult to extract point cloud features. Meanwhile, how to effectively extract fine-grained local features becomes crucial in point cloud understanding. Therefore, in this study, we propose a novel FDA-PointNet+ + point cloud classification model based on fusion downsampling strategy and attention module. Firstly, the method proposes a fusion downsampling strategy, which performs hierarchical downsampling on the initial point cloud data, and then repeats the downsampling operation on the sampling results and performs feature fusion to form feature maps with multi-scale information to enhance the richness of local spatial point cloud feature information. Secondly, we incorporate a channel attention mechanism into PointNet+ + and propose a Local Feature Aggregation (LFA) module for point cloud local feature extraction. This method improves the local feature extraction capability of the network model by amplifying the relevant local features and suppressing the non-relevant features. Experimental results on the ModelNet40 dataset demonstrate that FDA-PointNet+ + achieves higher classification accuracy and robustness, with a 1.3% increase in overall accuracy (OA) and a 1.4% improvement in class accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 64.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 84.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Qi, C.R., Su, H., Ma, K., et al.: PointNet: deep learning on point sets for 3D classification and segmentation. In: Computer Vision and Pattern Recognition 2017, LNCS, pp. 72–85. Springer, Heidelberg (2017)

    Google Scholar 

  2. Qi, C.R., Yi, L., Su, H., et al.: PointNet++: deep hierarchical feature learning on point sets in a metric space. In: Advances in Neural Information Processing Systems 2017, LNCS, pp. 1–13. Springer, Heidelberg (2017)

    Google Scholar 

  3. Gu, P., et al.: Multi-head self-attention model for classification of temporal lobe epilepsy subtypes. In: Proceedings of the Frontiers in Physiology 11 2020, LNCS, pp. 1–13. Springer, Heidelberg (2020)

    Google Scholar 

  4. Su, H., Maji, S., Kalogerakis, E., et al.: Multi-view convolutional neural networks for 3D shape recognition. In: International Conference on Computer Vision 2015, LNCS, pp. 945–953. Springer, Heidelberg (2015)

    Google Scholar 

  5. Qi, C.R., Su, H., Niebner, M., et al.: Volumetric and multi-view CNNs for object classification on 3d data. In: Computer Vision and Pattern Recognition 2016, LNCS, pp. 5648–5656. Springer, Heidelberg (2016)

    Google Scholar 

  6. Wu, Z., Song, S., Khosla, A., et al.: 3D ShapeNets: a deep representation for volumetric shapes. In: Computer Vision and Pattern Recognition 2015, LNCS, pp. 1912–1920. Springer, Heidelberg (2015)

    Google Scholar 

  7. Simonovsky, M., Komodakis, N. Dynamic edge-conditioned filters in convolutional neural networks on graphs. In: Proceeding of CVPR 2017, LNCS, pp. 1–13. Springer, Heidelberg (2017)

    Google Scholar 

  8. Wang, Y., Sun, Y., Liu, Z., et al.: Dynamic graph CNN for learning on point clouds. In: ACM Transactions on Graphics (TOG) 2019, LNCS, pp. 1–12. Springer, Heidelberg (2019)

    Google Scholar 

  9. Guo, M.H., Cai, J.X., Liu, Z.N., et al.: PCT: point cloud transformer. In: Computational Visual Media 2021, LNCS, pp. 187–199. Springer, Heidelberg (2021)

    Google Scholar 

  10. Xu, Y., Fan, T., Xu, M., et al.: SpiderCNN: deep learning on point sets with parameterized convolutional filters. In: Proceedings of the European Conference on Computer Vision (ECCV) 2018, LNCS, pp. 87–102. Springer, Heidelberg (2018)

    Google Scholar 

  11. Qian, G., Li, Y., Peng, H., et al.: PointNext: revisiting PointNet++ with improved training and scaling strategies. In: Advances in Neural Information Processing Systems 2022, LNCS, pp. 23192–23204. Springer, Heidelberg (2022)

    Google Scholar 

  12. Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2018, LNCS, pp. 7132–7141. Springer, Heidelberg (2018)

    Google Scholar 

  13. Woo, S., Park, J., Lee, J.Y., et al.: CBAM: convolutional block attention module. In: Proceedings of the European Conference on Computer Vision (ECCV) 2018, LNCS, pp. 3–19. Springer, Heidelberg (2018)

    Google Scholar 

  14. Guo, J., et al.: Automatic and accurate epilepsy ripple and fast ripple detection via virtual sample generation and attention neural networks. In: IEEE Transactions on Neural Systems and Rehabilitation Engineering 2020, LNCS, pp. 1710–1719. Springer, Heidelberg (2020)

    Google Scholar 

  15. Guo, J.: Detecting high-frequency oscillations for Stereoelectroencephalography in epilepsy via hypergraph learning. In: IEEE Transactions on Neural Systems and Rehabilitation Engineering 2021, LNCS, pp. 587–596. Springer, Heidelberg (2021)

    Google Scholar 

  16. Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: International Conference on Learning Representations (ICLR) 2015, LNCS, pp. 1–13. Springer, Heidelberg (2015)

    Google Scholar 

  17. Li, Y., Bu, R., Sun, M., et al.: PointCNN: convolution on x-transformed points. In: Advances in Neural Information Processing Systems 2018, LNCS, pp. 1–13. Springer, Heidelberg (2018)

    Google Scholar 

  18. Peng, X., Long, G., Shen, T., Wang, S., Jiang, J.: Sequential diagnosis prediction with transformer and ontological representation. In: 2021 IEEE International Conference on Data Mining (ICDM), LNCS, pp. 489–498. Springer, Heidelberg (2021)

    Google Scholar 

  19. Peng, X., Long, G., Yan, P., et al.: COVID-19 impact analysis on patients with complex health conditions: a literature review. In: 2023, LNCS, pp. 1–13. Springer, Heidelberg (2023)

    Google Scholar 

  20. Chen, D., et al.: Scalp EEG-based pain detection using convolutional neural network. In: IEEE Transactions on Neural Systems and Rehabilitation Engineering 2022, LNCS, pp. 1–13. Springer, Heidelberg (2022)

    Google Scholar 

  21. Peng, X., Long, G., Shen, T., Wang, S., Jiang, J., Zhang, C.: BiteNet: bidirectional temporal encoder network to predict medical outcomes. In: 2020 IEEE International Conference on Data Mining (ICDM), LNCS, pp. 1–13. Springer, Heidelberg (2020)

    Google Scholar 

  22. Niu, K., Guo, Z., Peng, X., et al.: P-ResUNet: segmentation of brain tissue with purified residual UNet. In: Computers in Biology and Medicine 2022, LNCS, pp. 1–13. Springer, Heidelberg (2022)

    Google Scholar 

  23. Niu, K., Lu, Y., Peng, X., et al.: Fusion of Sequential Visits and Medical Ontology for Mortality Prediction. In: Journal of Biomedical Informatics 2022, LNCS, pp. 1–13. Springer, Heidelberg (2022)

    Google Scholar 

Download references

Acknowledgements

This work was supported by Young Tech Innovation Leading Talent Program of Ningbo City under Grant No. 2023QL008; Innovation Consortium Program for Green and Efficient Intelligent Appliance of Ningbo City under Grant No. 2022H002; The Industrial Science and Technology Research Project of Henan Province under Grants 232102210088, 232102210125, 222102210024.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yijie Pan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sun, W., Gu, P., Pan, Y., Ma, J., Cui, J., Han, P. (2024). FDA-PointNet++: A Point Cloud Classification Model Based on Fused Downsampling Strategy and Attention Module. In: Huang, DS., Premaratne, P., Yuan, C. (eds) Applied Intelligence. ICAI 2023. Communications in Computer and Information Science, vol 2014. Springer, Singapore. https://doi.org/10.1007/978-981-97-0903-8_24

Download citation

  • DOI: https://doi.org/10.1007/978-981-97-0903-8_24

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-97-0902-1

  • Online ISBN: 978-981-97-0903-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics