Abstract
Integration of multi-omics data is essential for obtaining comprehensive insights into molecular mechanisms of complex diseases. While several methods have been proposed for analyzing multi-omics data in various applications, challenges persist in effectively handling heterogeneous and rich multi-omics data. In this paper, a Sparse Gating Enhanced Graph Convolutional AutoEncoder, named SGEGCAE, is proposed for multi-omics data integration and classification. Specifically, an enhanced graph convolutional autoencoder is developed, which integrates a basic autoencoder with a sparse gating strategy, aiming to combine attribute information with topological structure information of the graph for obtaining more comprehensive feature representations. To address the inherent variability and fluctuations in different omics data quality among samples, true class probability is introduced into the SGEGCAE to acquire reliable classification confidence. Furthermore, a tensor fusion network is designed to explore both inter-omics and intra-omics relationships in the label space to achieve ultimately multi-omics integration and classification. Extensive biomedical classification experiments are carried out on four datasets. In these experiments, the superior performance of the SGEGCAE is clearly validated compared to some state-of-the-art integrative analysis methods, demonstrating that the SGEGCAE might serve as an alternative method for multi-omics data integration and classification. The code and datasets for the SGEGCAE are available online at https://github.com/CDMBlab/SGEGCAE.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Subramanian, I., Verma, S., Kumar, S., Jere, A., Anamika, K.: Multi-omics data integration, interpretation, and its application. Bioinform. Biol. Insights 14, 1177932219899051 (2020)
Zhu, Y.T., Qiu, P., Ji, Y.: TCGA-assembler: open-source software for retrieving and processing TCGA data. Nat. Methods 11(6), 599–600 (2014)
Chai, H., et al.: Integrating multi-omics data through deep learning for accurate cancer prognosis prediction. Comput. Biol. Med. 134, 104481 (2021)
Acharjee, A., Kloosterman, B., Visser, R.G.F. Maliepaard, C.: Integration of multi-omics data for prediction of phenotypic traits using random forest. Bmc Bioinf. 17, 363-373 (2016)
Ma, B.S., et al.: Diagnostic classification of cancers using extreme gradient boosting algorithm and multi-omics data. Comput. Biol. Med. 121, 103761 (2020)
Leng, D.J., et al.: A benchmark study of deep learning-based multi-omics data fusion methods for cancer. Genome Biol. 23(1), 171 (2022)
Yang, Y., Tian, S., Qiu, Y., Zhao, P. Zou, Q.: MDICC: novel method for multi-omics data integration and cancer subtype identification. Brief Bioinform 23(3), bbac132 (2022)
Moon, S., Lee, H.: MOMA: a multi-task attention learning algorithm for multi-omics data interpretation and classification. Bioinformatics 38(8), 2287–2296 (2022)
Han, Z., Yang, F., Huang, J., Zhang, C. Yao, J.: Multimodal dynamics: dynamical fusion for trustworthy multimodal classification. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 20707–20717 (2022)
Zhao, C., et al.: CLCLSA: cross-omics linked embedding with contrastive learning and self attention for integration with incomplete multi-omics data. Comput. Biol. Med. 170, 108058 (2024)
Zhang, S., Tong, H., Xu, J., Maciejewski, R.: Graph convolutional networks: a comprehensive review. Comput. Soc. Netw. 6(1), 1–23 (2019)
Ouyang, D., et al.: Integration of multi-omics data using adaptive graph learning and attention mechanism for patient classification and biomarker identification. Comput. Biol. Med. 164, 107303 (2023)
Wang, T.X. et al.: MOGONET integrates multi-omics data using graph convolutional networks allowing patient classification and biomarker identification. Nature Commun. 12(1), 3445 (2021)
Li, X., et al.: MoGCN: a Multi-omics integration method based on graph convolutional network for cancer subtype analysis. Front. Genet. 13(2), 806842 (2022)
Tu, W. et al.: Deep fusion clustering network. In: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 9978–9987 (2021)
Gan, Y., Huang, X., Zou, G., Zhou, S. Guan, J.: Deep structural clustering for single-cell RNA-seq data jointly through autoencoder and graph neural network. Brief Bioinform 23(2), bbac018 (2022)
Corbière, C., Thome, N., Bar-Hen, A., Cord, M., Pérez, P.: Addressing failure prediction by learning model confidence. Adv. Neural Inf. Process. Syst. 32 (2019)
Na, S.: Tensor fusion network for multimodal sentiment analysis. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp. 1103–1114 (2017)
Acknowledgments
This study was funded by the National Natural Science Foundation of China (61972226 and 62172254).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Shang, J. et al. (2024). SGEGCAE: A Sparse Gating Enhanced Graph Convolutional Autoencoder for Multi-omics Data Integration and Classification. In: Huang, DS., Zhang, Q., Guo, J. (eds) Advanced Intelligent Computing in Bioinformatics. ICIC 2024. Lecture Notes in Computer Science(), vol 14881. Springer, Singapore. https://doi.org/10.1007/978-981-97-5689-6_12
Download citation
DOI: https://doi.org/10.1007/978-981-97-5689-6_12
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-5688-9
Online ISBN: 978-981-97-5689-6
eBook Packages: Computer ScienceComputer Science (R0)