[go: up one dir, main page]

CN111968742A - Cross-modal prediction system and method for lung cancer gene mutation - Google Patents

Cross-modal prediction system and method for lung cancer gene mutation Download PDF

Info

Publication number
CN111968742A
CN111968742A CN202010819881.5A CN202010819881A CN111968742A CN 111968742 A CN111968742 A CN 111968742A CN 202010819881 A CN202010819881 A CN 202010819881A CN 111968742 A CN111968742 A CN 111968742A
Authority
CN
China
Prior art keywords
lung cancer
gene mutation
prediction
real
mutation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010819881.5A
Other languages
Chinese (zh)
Inventor
武春燕
候立坤
佘云浪
陈昶
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Pulmonary Hospital
Original Assignee
Shanghai Pulmonary Hospital
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Pulmonary Hospital filed Critical Shanghai Pulmonary Hospital
Priority to CN202010819881.5A priority Critical patent/CN111968742A/en
Publication of CN111968742A publication Critical patent/CN111968742A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/30ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for calculating health indices; for individual health risk assessment
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B15/00ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment
    • G16B15/30Drug targeting using structural data; Docking or binding prediction
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B20/00ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
    • G16B20/50Mutagenesis
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B40/00ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B5/00ICT specially adapted for modelling or simulations in systems biology, e.g. gene-regulatory networks, protein interaction networks or metabolic networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Medical Informatics (AREA)
  • Theoretical Computer Science (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Biotechnology (AREA)
  • Data Mining & Analysis (AREA)
  • Biophysics (AREA)
  • Public Health (AREA)
  • Chemical & Material Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Artificial Intelligence (AREA)
  • Epidemiology (AREA)
  • Evolutionary Computation (AREA)
  • Molecular Biology (AREA)
  • General Physics & Mathematics (AREA)
  • Analytical Chemistry (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Biomedical Technology (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Pathology (AREA)
  • General Engineering & Computer Science (AREA)
  • Medicinal Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Primary Health Care (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Bioethics (AREA)
  • Software Systems (AREA)
  • Physiology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The invention provides a cross-modal prediction system and a cross-modal prediction method for lung cancer gene mutation, which relate to the field of neural networks and comprise the following steps: the acquisition module is used for acquiring computed tomography images, digital pathological images, real gene mutation types and real tumor mutation loads of patients before receiving targeted therapy and immunotherapy; the marking module is used for marking the computed tomography image and the digital pathological image to obtain a corresponding marked image; the training module is used for taking the labeled images of computed tomography and digital pathology as input, taking the real gene mutation type and the real tumor mutation load as output, and training to obtain a lung cancer gene mutation prediction model; and the prediction module is used for inputting the computed tomography image and the digital pathological image of the patient to be predicted into the lung cancer gene mutation prediction model to obtain the predicted gene mutation type and the predicted tumor mutation load. The invention can accurately predict the lung cancer gene type and the tumor mutation load, and provide doctors to evaluate the curative effects of targeted therapy and immunotherapy.

Description

一种肺癌基因突变的跨模态预测系统及方法A cross-modal prediction system and method for gene mutation in lung cancer

技术领域technical field

本发明涉及神经网络领域,尤其涉及一种肺癌基因突变的跨模态预测系统及方法。The invention relates to the field of neural networks, in particular to a cross-modality prediction system and method for gene mutation of lung cancer.

背景技术Background technique

肺癌是全球癌症相关死亡的首要病因。近年来,随着分子生物学的发展以及基因测序技术的进步,肺癌治疗已经从基于临床特征,病理分型的传统治疗模式发展到基于基因分子改变的精准治疗时代,驱动基因指导下的靶向治疗和针对免疫检查点的免疫治疗为肺癌患者提供了新的可能。然而,在未经选择的治疗人群中,只有少部分患者能从中获益。因此,准确分析基因突变状态,明确治疗靶点,筛选潜在获益人群对于肺癌的个性化精准治疗至关重要。Lung cancer is the leading cause of cancer-related death worldwide. In recent years, with the development of molecular biology and the advancement of gene sequencing technology, the treatment of lung cancer has developed from the traditional treatment mode based on clinical features and pathological typing to the era of precise treatment based on gene and molecular changes, driving gene-guided targeting Treatment and immunotherapy targeting immune checkpoints offer new possibilities for lung cancer patients. However, in unselected treatment populations, only a minority of patients benefit from it. Therefore, accurate analysis of gene mutation status, clear therapeutic targets, and screening of potential benefit groups are crucial for personalized and precise treatment of lung cancer.

当前,肺癌基因突变的检测主要依靠穿刺活检这一有创性操作,且后续的测序流程需要大量的时间和高昂的费用,无法满足临床大规模推广的需求。此外,由于肺癌的高度异质性,穿刺获取的少量标本难以反映肿瘤整体特征,无法实现对肿瘤基因突变状态的全面准确评估。At present, the detection of lung cancer gene mutation mainly relies on the invasive operation of needle biopsy, and the subsequent sequencing process requires a lot of time and high cost, which cannot meet the needs of large-scale clinical promotion. In addition, due to the high heterogeneity of lung cancer, a small number of specimens obtained by puncture cannot reflect the overall characteristics of the tumor, and cannot achieve a comprehensive and accurate assessment of tumor gene mutation status.

基于患者CT图像的影像组学模型可预测肺癌驱动基因突变,但多采用单一模态、单一时间序列的肿瘤影像数据,无法反映肿瘤及其微环境动态变化过程、无法解决肿瘤影像特点的地区差异性的问题;此外,该模型局限于肺癌的基因突变预测,未能评估靶向治疗与免疫治疗的生存获益,对肺癌患者精准治疗的指导价值有限在临床实际应用中具有较大的局限性。Radiomics models based on patient CT images can predict lung cancer driver gene mutations, but most of them use a single modality and a single time series of tumor imaging data, which cannot reflect the dynamic change process of tumors and their microenvironment, and cannot resolve regional differences in tumor imaging characteristics. In addition, the model is limited to gene mutation prediction of lung cancer, fails to evaluate the survival benefit of targeted therapy and immunotherapy, and has limited guiding value for precise treatment of lung cancer patients, which has great limitations in clinical application. .

因此,如何无创高效地分析基因突变特征,从而准确动态地指导靶向治疗与免疫治疗策略,是肺癌精准治疗领域中亟待解决的难题。Therefore, how to non-invasively and efficiently analyze gene mutation characteristics, so as to accurately and dynamically guide targeted therapy and immunotherapy strategies, is an urgent problem to be solved in the field of lung cancer precision therapy.

发明内容SUMMARY OF THE INVENTION

针对现有技术中存在的问题,本发明提供一种肺癌基因突变的跨模态预测系统,具体包括:In view of the problems existing in the prior art, the present invention provides a cross-modal prediction system for lung cancer gene mutation, which specifically includes:

采集模块,用于分别采集接受靶向治疗和免疫治疗前的若干肺癌患者的计算机断层扫描影像,数字病理图像,真实基因突变类型和以及每种所述真实基因突变类型对应的真实肿瘤突变负荷;an acquisition module, used for respectively acquiring computed tomography images, digital pathology images, real gene mutation types and real tumor mutation loads corresponding to each of the real gene mutation types of several lung cancer patients before receiving targeted therapy and immunotherapy;

标注模块,连接所述采集模块,用于分别对所述计算机断层扫描影像和所述数字病理图像中的病灶区域进行标注得到计算机断层扫描标注图像以及数字病理标注图像;a labeling module, connected to the acquisition module, for labeling the lesion area in the computed tomography image and the digital pathology image respectively to obtain a computed tomography labeling image and a digital pathology labeling image;

训练模块,分别连接所述采集模块和所述标注模块,用于将各所述肺癌患者的所述计算机断层扫描标注图像和所述数字病理标注图像作为输入,将对应的所述真实基因突变类型和所述真实肿瘤突变负荷作为输出,训练得到肺癌基因突变预测模型;A training module, which is respectively connected to the acquisition module and the labeling module, is configured to use the CT annotated image and the digital pathology annotated image of each lung cancer patient as input, and use the corresponding real gene mutation type as input. and the real tumor mutation load as output, and train a lung cancer gene mutation prediction model;

预测模块,连接所述训练模块,用于将待预测肺癌患者的所述计算机断层扫描影像和所述数字病理图像输入到所述肺癌基因突变预测模型得到所述待预测肺癌患者的预测基因突变类型以及对应的预测肿瘤突变负荷,作为医生给出靶向治疗以及免疫治疗意见的参考数据。A prediction module, connected to the training module, for inputting the computed tomography image and the digital pathological image of the lung cancer patient to be predicted into the lung cancer gene mutation prediction model to obtain the predicted gene mutation type of the lung cancer patient to be predicted And the corresponding predicted tumor mutation load, as reference data for doctors to give targeted therapy and immunotherapy opinions.

优选的,所述采集模块包括:Preferably, the collection module includes:

第一采集单元,用于采集各所述肺癌患者的所述计算机断层扫描影像;a first acquisition unit, configured to acquire the computed tomography images of each of the lung cancer patients;

第二采集单元,用于获取各所述肺癌患者的病理切片,并对所述病理切片进行扫描得到所述数字病理图像;a second acquisition unit, configured to acquire a pathological slice of each of the lung cancer patients, and scan the pathological slice to obtain the digital pathological image;

第三采集单元,用于对各所述肺癌患者进行基因测序得到所述真实基因突变类型和以及每种所述真实基因突变类型对应的所述真实肿瘤突变负荷。A third collection unit, configured to perform gene sequencing on each of the lung cancer patients to obtain the real gene mutation type and the real tumor mutation load corresponding to each of the real gene mutation types.

优选的,所述训练模块包括:Preferably, the training module includes:

分组单元,用于将每个所述肺癌患者的所述计算机断层扫描标注图像、所述数字病理标注图像、所述真实基因突变类型和所述真实肿瘤突变负荷作为一数据集合,并根据各所述数据集合形成一训练集和一验证集;The grouping unit is configured to use the computed tomography annotated image, the digital pathology annotated image, the real gene mutation type and the real tumor mutation load of each lung cancer patient as a data set, and according to each The data set forms a training set and a validation set;

训练单元,连接所述分组单元,用于将所述训练集中的所述计算机断层扫描标注图像和所述数字病理标注图像作为输入,将所述真实基因突变类型和所述真实肿瘤突变负荷作为输出,训练得到一肺癌基因突变初始模型;A training unit, connected to the grouping unit, configured to take the CT annotated image and the digital pathology annotated image in the training set as input, and take the real gene mutation type and the real tumor mutation load as output , training to obtain an initial model of lung cancer gene mutation;

验证单元,分别连接所述分组单元和所述训练单元,用于根据所述验证集对所述肺癌基因突变初始模型进行参数优化调整,得到所述肺癌基因突变预测模型。A verification unit, which is respectively connected to the grouping unit and the training unit, is configured to perform parameter optimization and adjustment on the initial model of lung cancer gene mutation according to the verification set to obtain the lung cancer gene mutation prediction model.

优选的,所述训练模块还包括一测试单元,分别连接所述分组单元和所述验证单元,所述测试单元包括:Preferably, the training module further includes a test unit, which is respectively connected to the grouping unit and the verification unit, and the test unit includes:

第一子单元,用于根据各所述数据集合形成一测试集;a first subunit, configured to form a test set according to each of the data sets;

第二子单元,连接所述第一子单元,用于将所述测试集中的各所述计算机断层扫描标注图像和所述数字病理标注图像输入到所述肺癌基因突变预测模型得到相应的基因突变类型预测结果和肿瘤突变负荷预测结果;The second subunit is connected to the first subunit, and is used for inputting each of the CT annotated images and the digital pathology annotated images in the test set into the lung cancer gene mutation prediction model to obtain corresponding gene mutations Type prediction results and tumor mutation burden prediction results;

第三子单元,连接所述第二子单元,用于根据各所述基因突变类型预测结果与对应的所述真实基因突变类型处理得到一第一预测准确率,以及根据各所述肿瘤突变负荷预测结果与对应的所述真实肿瘤突变负荷处理得到一第二预测准确率,作为医生给出靶向治疗以及免疫治疗意见的参考数据。The third subunit is connected to the second subunit, and is configured to process a first prediction accuracy rate according to the prediction result of each gene mutation type and the corresponding real gene mutation type, and obtain a first prediction accuracy rate according to each of the tumor mutation loads The prediction result and the corresponding real tumor mutation load are processed to obtain a second prediction accuracy rate, which is used as reference data for doctors to give advice on targeted therapy and immunotherapy.

优选的,采用受试者工作特性曲线及曲线下面积处理得到所述第一预测准确率和所述第二预测准确率。Preferably, the receiver operating characteristic curve and the area under the curve are processed to obtain the first prediction accuracy rate and the second prediction accuracy rate.

优选的,将所有所述数据集合按照预设比例划分形成所述训练集,所述验证集和所述测试集,所述预设比例为3:1:1。Preferably, all the data sets are divided according to a preset ratio to form the training set, the verification set and the test set, and the preset ratio is 3:1:1.

一种肺癌基因突变的跨模态预测方法,应用于上述肺癌基因突变的跨模态预测系统,所述肺癌基因突变的跨模态预测方法具体包括以下步骤:A cross-modal prediction method for lung cancer gene mutation is applied to the above-mentioned cross-modal prediction system for lung cancer gene mutation, and the cross-modal prediction method for lung cancer gene mutation specifically includes the following steps:

步骤S1,所述的肺癌基因突变的跨模态预测系统分别采集接受靶向治疗和免疫治疗前的若干肺癌患者的计算机断层扫描影像,数字病理图像,真实基因突变类型和以及每种所述真实基因突变类型对应的真实肿瘤突变负荷;Step S1, the cross-modality prediction system for lung cancer gene mutation respectively collects computed tomography images, digital pathology images, real gene mutation types and each type of real gene mutation before receiving targeted therapy and immunotherapy. The true tumor mutation burden corresponding to the gene mutation type;

步骤S2,所述的肺癌基因突变的跨模态预测系统分别对所述计算机断层扫描影像和所述数字病理图像中的病灶区域进行标注得到计算机断层扫描标注图像以及数字病理标注图像;Step S2, the cross-modality prediction system for lung cancer gene mutation respectively annotates the lesion area in the computed tomography image and the digital pathology image to obtain a computed tomography annotated image and a digital pathology annotated image;

步骤S3,所述的肺癌基因突变的跨模态预测系统将各所述肺癌患者的所述计算机断层扫描标注图像和所述数字病理标注图像作为输入,将对应的所述真实基因突变类型和所述真实肿瘤突变负荷作为输出,训练得到肺癌基因突变预测模型;Step S3, the cross-modal prediction system for lung cancer gene mutation takes the CT annotated image and the digital pathology annotated image of each lung cancer patient as input, and uses the corresponding real gene mutation type and all The real tumor mutation load is used as the output, and the lung cancer gene mutation prediction model is obtained by training;

步骤S4,所述的肺癌基因突变的跨模态预测系统将待预测肺癌患者的所述计算机断层扫描影像和所述数字病理图像输入到所述肺癌基因突变预测模型得到所述待预测肺癌患者的预测基因突变类型以及对应的预测肿瘤突变负荷,作为医生给出靶向治疗以及免疫治疗意见的参考数据。Step S4, the cross-modal prediction system for lung cancer gene mutation inputs the computed tomography image and the digital pathology image of the lung cancer patient to be predicted into the lung cancer gene mutation prediction model to obtain the lung cancer patient to be predicted. The predicted gene mutation type and the corresponding predicted tumor mutation load are used as reference data for doctors to give targeted therapy and immunotherapy opinions.

优选的,所述步骤S1包括:Preferably, the step S1 includes:

步骤S11,所述肺癌基因突变的跨模态预测系统采集各所述肺癌患者的所述计算机断层扫描影像;Step S11, the cross-modality prediction system for lung cancer gene mutation collects the computed tomography images of each of the lung cancer patients;

步骤S12,所述肺癌基因突变的跨模态预测系统获取各所述肺癌患者的病理切片,并对所述病理切片进行扫描得到所述数字病理图像;Step S12, the cross-modality prediction system for lung cancer gene mutation obtains the pathological slices of each lung cancer patient, and scans the pathological slices to obtain the digital pathological image;

步骤S13,所述肺癌基因突变的跨模态预测系统对各所述肺癌患者进行基因测序得到所述真实基因突变类型和以及每种所述真实基因突变类型对应的所述真实肿瘤突变负荷。Step S13, the cross-modal prediction system for lung cancer gene mutation performs gene sequencing on each of the lung cancer patients to obtain the true gene mutation type and the true tumor mutation load corresponding to each true gene mutation type.

优选的,所述步骤S3包括:Preferably, the step S3 includes:

步骤S31,所述肺癌基因突变的跨模态预测系统将每个所述肺癌患者的所述计算机断层扫描标注图像、所述数字病理标注图像、所述真实基因突变类型和所述真实肿瘤突变负荷作为一数据集合,并根据各所述数据集合形成一训练集和一验证集;Step S31, the cross-modality prediction system for lung cancer gene mutation annotates the computed tomography scan image, the digital pathology annotated image, the real gene mutation type and the real tumor mutation load of each lung cancer patient. as a data set, and form a training set and a verification set according to each of the data sets;

步骤S32,所述肺癌基因突变的跨模态预测系统将所述训练集中的所述计算机断层扫描标注图像和所述数字病理标注图像作为输入,将所述真实基因突变类型和所述真实肿瘤突变负荷作为输出,训练得到一肺癌基因突变初始模型;Step S32, the cross-modality prediction system for lung cancer gene mutation uses the computed tomography annotated image and the digital pathology annotated image in the training set as input, and uses the real gene mutation type and the real tumor mutation as input. The load is used as the output, and an initial model of lung cancer gene mutation is obtained by training;

步骤S33,所述肺癌基因突变的跨模态预测系统根据所述验证集对所述肺癌基因突变初始模型进行参数优化调整,得到所述肺癌基因突变预测模型。Step S33, the cross-modal prediction system for lung cancer gene mutation performs parameter optimization and adjustment on the initial model of lung cancer gene mutation according to the validation set, to obtain the lung cancer gene mutation prediction model.

优选的,所述步骤S3还包括一获取模型预测准确率的过程,具体包括:Preferably, the step S3 further includes a process of obtaining the model prediction accuracy, which specifically includes:

步骤A1,所述肺癌基因突变的跨模态预测系统根据各所述数据集合形成一测试集;Step A1, the cross-modal prediction system for lung cancer gene mutation forms a test set according to each of the data sets;

步骤A2,所述肺癌基因突变的跨模态预测系统将所述测试集中的各所述计算机断层扫描标注图像和所述数字病理标注图像输入到所述肺癌基因突变预测模型得到相应的基因突变类型预测结果和肿瘤突变负荷预测结果;Step A2, the cross-modality prediction system for lung cancer gene mutation inputs each of the CT annotated images and the digital pathology annotated images in the test set into the lung cancer gene mutation prediction model to obtain the corresponding gene mutation type prediction results and tumor mutation burden prediction results;

步骤A3,所述肺癌基因突变的跨模态预测系统根据各所述基因突变类型预测结果与对应的所述真实基因突变类型处理得到一第一预测准确率,以及根据各所述肿瘤突变负荷预测结果与对应的所述真实肿瘤突变负荷处理得到一第二预测准确率,作为医生给出靶向治疗以及免疫治疗意见的参考数据。Step A3, the cross-modal prediction system for lung cancer gene mutation obtains a first prediction accuracy rate according to the prediction result of each gene mutation type and the corresponding real gene mutation type, and predicts the tumor mutation load according to each The result is processed with the corresponding real tumor mutation load to obtain a second prediction accuracy rate, which is used as reference data for doctors to give targeted therapy and immunotherapy opinions.

上述技术方案具有如下优点或有益效果:The above-mentioned technical scheme has the following advantages or beneficial effects:

本方案融合了影像组学与病理组学实现跨模态预测,通过肺癌基因突变预测模型实现精准预测突变基因类型和肿瘤突变负荷,医生根据预测突变基因类型和预测肿瘤突变负荷评估靶向治疗和免疫治疗疗效,进而为肺癌患者的精准治疗策略提供个性化指导。This solution integrates radiomics and pathomics to achieve cross-modal prediction, and achieves accurate prediction of mutated genotype and tumor mutation load through a lung cancer gene mutation prediction model. The efficacy of immunotherapy can provide personalized guidance for precise treatment strategies for lung cancer patients.

附图说明Description of drawings

图1为本发明的较佳的实施例中,一种肺癌基因突变的跨模态预测系统的结构示意图;1 is a schematic structural diagram of a cross-modality prediction system for lung cancer gene mutation in a preferred embodiment of the present invention;

图2为本发明的较佳的实施例中,一种肺癌基因突变的跨模态预测方法的流程示意图;2 is a schematic flowchart of a method for cross-modality prediction of lung cancer gene mutation in a preferred embodiment of the present invention;

图3为本发明的较佳的实施例中,一种肺癌基因突变的跨模态预测方法的数据采集过程的流程示意图;3 is a schematic flowchart of a data collection process of a method for cross-modality prediction of lung cancer gene mutation in a preferred embodiment of the present invention;

图4为本发明的较佳的实施例中,一种肺癌基因突变的跨模态预测方法的模型训练过程的流程示意图;4 is a schematic flowchart of a model training process of a method for cross-modality prediction of lung cancer gene mutation in a preferred embodiment of the present invention;

图5为本发明的较佳的实施例中,一种肺癌基因突变的跨模态预测方法的获取模型准确率过程的流程示意图;5 is a schematic flowchart of a process of obtaining model accuracy of a method for cross-modal prediction of lung cancer gene mutation in a preferred embodiment of the present invention;

具体实施方式Detailed ways

下面结合附图和具体实施例对本发明进行详细说明。本发明并不限定于该实施方式,只要符合本发明的主旨,则其他实施方式也可以属于本发明的范畴。The present invention will be described in detail below with reference to the accompanying drawings and specific embodiments. The present invention is not limited to this embodiment, and other embodiments may belong to the scope of the present invention as long as it conforms to the gist of the present invention.

本发明的较佳的实施例中,基于现有技术中存在的上述问题,现提供一种肺癌基因突变的跨模态预测系统,如图1所示,具体包括:In a preferred embodiment of the present invention, based on the above-mentioned problems in the prior art, a cross-modal prediction system for lung cancer gene mutation is provided, as shown in Figure 1, which specifically includes:

采集模块1,用于分别采集接受靶向治疗和免疫治疗前的若干肺癌患者的计算机断层扫描影像,数字病理图像,真实基因突变类型和以及每种真实基因突变类型对应的真实肿瘤突变负荷;The acquisition module 1 is used to respectively acquire the computed tomography images, digital pathology images, real gene mutation types and real tumor mutation loads corresponding to each real gene mutation type of several lung cancer patients before receiving targeted therapy and immunotherapy;

标注模块2,连接采集模块1,用于分别对计算机断层扫描影像和数字病理图像中的病灶区域进行标注得到计算机断层扫描标注图像以及数字病理标注图像;The labeling module 2 is connected to the acquisition module 1, and is used for labeling the lesion area in the computed tomography image and the digital pathology image respectively to obtain the computed tomography scanning image and the digital pathology labeling image;

训练模块3,分别连接采集模块1和标注模块2,用于将各肺癌患者的计算机断层扫描标注图像和数字病理标注图像作为输入,将对应的真实基因突变类型和真实肿瘤突变负荷作为输出,训练得到肺癌基因突变预测模型;The training module 3, which is connected to the acquisition module 1 and the labeling module 2 respectively, is used to take the CT annotated images and digital pathology annotated images of each lung cancer patient as input, and the corresponding real gene mutation type and real tumor mutation load as the output. Obtain lung cancer gene mutation prediction model;

预测模块4,连接训练模块3,用于将待预测肺癌患者的计算机断层扫描影像和数字病理图像输入到肺癌基因突变预测模型得到待预测肺癌患者的预测基因突变类型以及对应的预测肿瘤突变负荷,作为医生给出靶向治疗以及免疫治疗意见的参考数据。The prediction module 4 is connected to the training module 3, and is used for inputting the computed tomography images and digital pathology images of the lung cancer patients to be predicted into the lung cancer gene mutation prediction model to obtain the predicted gene mutation types of the lung cancer patients to be predicted and the corresponding predicted tumor mutation load, As a reference data for doctors to give targeted therapy and immunotherapy opinions.

具体地,本实施例中,纳入325名上海市肺科医院接受靶向治疗或免疫治疗并进行肺癌基因类型和肿瘤突变负荷检测的肺癌患者,通过采集模块1采集这些肺癌患者的计算机断层扫描影像,数字病理图像,真实基因突变值和真实肿瘤突变负荷;并通过标注模块2对分别对计算机断层扫描影像和数字病理图像中的病灶区域进行标注得到计算机断层扫描标注图像以及数字病理标注图像,进而将计算机断层扫描标注图像以及数字病理标注图像作为训练模块3中的训练模型的输入,将对应的真实基因突变类型和真实肿瘤突变负荷作为训练模型的输出,进而训练得到肺癌基因突变预测模型,本实施例通过获取计算机断层扫描影像和数字病理图像实现跨模态预测,有效提升肺癌基因突变预测模型的预测准确率;通过预测模块4将待预测肺癌患者的计算机断层扫描影像和数字病理图像输入到肺癌基因突变预测模型得到待预测肺癌患者的预测基因突变类型以及对应的预测肿瘤突变负荷;预测基因突变类型用于表征靶向治疗的有效性,预测肿瘤突变负荷用于表征免疫治疗的有效性,因此医生能够根据预测基因突变类型和预测肿瘤突变负荷给出治疗意见。Specifically, in this example, 325 lung cancer patients who received targeted therapy or immunotherapy in Shanghai Pulmonary Hospital and underwent lung cancer genotype and tumor mutation load detection were included, and the computed tomography images of these lung cancer patients were collected through acquisition module 1 , digital pathology image, real gene mutation value and real tumor mutation load; and through the annotation module 2, the lesion area in the computed tomography image and the digital pathology image are respectively annotated to obtain the computed tomography annotated image and the digital pathology annotated image, and then The annotated images of computed tomography and digital pathology are used as the input of the training model in training module 3, and the corresponding real gene mutation type and real tumor mutation load are used as the output of the training model, and then the lung cancer gene mutation prediction model is obtained by training. In the embodiment, cross-modal prediction is realized by acquiring computed tomography images and digital pathology images, which effectively improves the prediction accuracy of the lung cancer gene mutation prediction model; the computed tomography images and digital pathology images of lung cancer patients to be predicted are input into the prediction module 4. The lung cancer gene mutation prediction model obtains the predicted gene mutation type and the corresponding predicted tumor mutation load of the lung cancer patients to be predicted; the predicted gene mutation type is used to characterize the effectiveness of targeted therapy, and the predicted tumor mutation load is used to characterize the effectiveness of immunotherapy. Therefore, doctors can give treatment advice based on the predicted gene mutation type and predicted tumor mutation burden.

可以通过对接受靶向治疗和免疫治疗的患者进行生存分析对模型的效能进行外部验证:根据待预测肺癌患者的预测基因突变类型以及对应的预测肿瘤突变负荷通过深度学习算法计算得到待预测肺癌患者接受靶向治疗获益评分和免疫治疗获益评分;并根据待预测肺癌患者接受靶向治疗获益评分的中位值对待预测患者进行治疗获益分层,将靶向治疗获益评分高于中位值的患者划分入靶向治疗高分组,将靶向治疗获益评分不高于中位值的患者划分入靶向治疗低分组;以及根据待预测肺癌患者接受免疫治疗获益评分的中位值对待预测患者进行治疗获益分层,将免疫治疗获益评分高于中位值的患者划分入免疫治疗高分组,将免疫治疗获益评分不高于中位值的患者划分入免疫治疗低分组;并通过K-M生存分析和Cox回归分析分别对靶向治疗高分组、靶向治疗低分组、免疫治疗高分组和免疫治疗低分组患者的总体生存和无进展生存进行统计分析,结果表明靶向治疗高分组和免疫治疗高分组的患者具有更好的总体生存和无进展生存,进而表明肺癌基因突变预测模型的预测有效性。The efficacy of the model can be externally verified by performing survival analysis on patients receiving targeted therapy and immunotherapy: according to the predicted gene mutation type and the corresponding predicted tumor mutation load of the lung cancer patients to be predicted, the lung cancer patients to be predicted can be calculated by deep learning algorithms Targeted therapy benefit score and immunotherapy benefit score; and according to the median value of the target therapy benefit score for lung cancer patients to be predicted, the treatment benefit stratification of the patients to be predicted, the target therapy benefit score is higher than the target therapy benefit score. The patients with the median value were divided into the high targeted therapy group, the patients with targeted therapy benefit scores not higher than the median value were divided into the targeted therapy low group; The median value is used to stratify the treatment benefit of the patients to be predicted, and the patients with the immunotherapy benefit score higher than the median value are divided into the high immunotherapy group, and the patients with the immunotherapy benefit score not higher than the median value are divided into the immunotherapy treatment group. Low grouping; K-M survival analysis and Cox regression analysis were used to analyze the overall survival and progression-free survival of patients with targeted therapy high grouping, targeted therapy low grouping, immunotherapy high grouping and immunotherapy low grouping patients respectively. Patients with high allocation to treatment and high allocation to immunotherapy had better overall survival and progression-free survival, thus demonstrating the predictive validity of the lung cancer gene mutation prediction model.

本发明的较佳的实施例中,采集模块1包括:In a preferred embodiment of the present invention, the collection module 1 includes:

第一采集单元11,用于采集各肺癌患者的计算机断层扫描影像;The first acquisition unit 11 is used for acquiring computed tomography images of each lung cancer patient;

第二采集单元12,用于获取各肺癌患者的病理切片,并对病理切片进行扫描得到数字病理图像;The second acquisition unit 12 is configured to acquire pathological slices of each lung cancer patient, and scan the pathological slices to obtain digital pathological images;

第三采集单元13,用于对各肺癌患者进行基因测序得到真实基因突变类型和以及每种真实基因突变类型对应的真实肿瘤突变负荷。The third collection unit 13 is configured to perform gene sequencing on each lung cancer patient to obtain the real gene mutation type and the real tumor mutation load corresponding to each real gene mutation type.

具体地,本实施例中,通过第一采集单元11对各肺癌患者的计算机断层扫描影像的采集和第二采集单元12对数字病理图像的采集得到肺癌基因突变预测模型的输入;通过第三采集单元13对真实基因突变类型和真实肿瘤突变负荷的采集得到肺癌基因突变预测模型的输出,为肺癌基因突变预测模型的构建打下基础。Specifically, in this embodiment, the first collection unit 11 collects the computed tomography images of each lung cancer patient and the second collection unit 12 collects the digital pathological images to obtain the input of the lung cancer gene mutation prediction model; through the third collection Unit 13 collects the real gene mutation type and the real tumor mutation load to obtain the output of the lung cancer gene mutation prediction model, which lays the foundation for the construction of the lung cancer gene mutation prediction model.

本发明的较佳的实施例中,训练模块3包括:In a preferred embodiment of the present invention, the training module 3 includes:

分组单元31,用于将每个肺癌患者的计算机断层扫描标注图像、数字病理标注图像、真实基因突变类型和真实肿瘤突变负荷作为一数据集合,并根据各数据集合形成一训练集和一验证集;The grouping unit 31 is used to use the computed tomography annotated image, digital pathology annotated image, real gene mutation type and real tumor mutation load of each lung cancer patient as a data set, and form a training set and a validation set according to each data set ;

训练单元32,连接分组单元31,用于将训练集中的计算机断层扫描标注图像和数字病理标注图像作为输入,将真实基因突变类型和真实肿瘤突变负荷作为输出,训练得到一肺癌基因突变初始模型;The training unit 32 is connected to the grouping unit 31, and is used for taking the CT annotated image and digital pathology annotated image in the training set as input, and using the real gene mutation type and the real tumor mutation load as the output, and training to obtain an initial model of lung cancer gene mutation;

验证单元33,分别连接分组单元31和训练单元32,用于根据验证集对肺癌基因突变初始模型进行参数优化调整,得到肺癌基因突变预测模型。The verification unit 33 is respectively connected to the grouping unit 31 and the training unit 32, and is configured to optimize and adjust the parameters of the initial model of the lung cancer gene mutation according to the verification set, so as to obtain the prediction model of the lung cancer gene mutation.

具体地,本实施例中,通过设置分组单元31对将每个肺癌患者的计算机断层扫描标注图像、数字病理标注图像、真实基因突变类型和真实肿瘤突变负荷合并到一个数据集合,并将部分数据集合中的数据划分到训练集和验证集内;通过训练单元32根据训练集中的数据训练得到肺癌基因突变初始模型,并通过验证单元33根据验证集中的数据对肺癌基因突变初始模型进行参数优化调整,得到肺癌基因突变预测模型。Specifically, in this embodiment, the grouping unit 31 is set to combine the computed tomography annotated image, digital pathology annotated image, real gene mutation type and real tumor mutation load of each lung cancer patient into one data set, and some data are combined into one data set. The data in the set is divided into a training set and a verification set; the initial model of lung cancer gene mutation is obtained by training according to the data in the training set by the training unit 32, and the parameters of the initial model of lung cancer gene mutation are optimized and adjusted by the verification unit 33 according to the data in the verification set , to obtain a lung cancer gene mutation prediction model.

本发明的较佳的实施例中,训练模块3还包括一测试单元34,分别连接分组单元31和验证单元33,测试单元34包括:In a preferred embodiment of the present invention, the training module 3 further includes a test unit 34, which is respectively connected to the grouping unit 31 and the verification unit 33, and the test unit 34 includes:

第一子单元341,用于根据各数据集合形成一测试集;The first subunit 341 is used to form a test set according to each data set;

第二子单元342,连接第一子单元341,用于将测试集中的各计算机断层扫描标注图像和数字病理标注图像输入到肺癌基因突变预测模型得到相应的基因突变类型预测结果和肿瘤突变负荷预测结果;The second subunit 342 is connected to the first subunit 341, and is used for inputting each CT scan annotated image and digital pathology annotated image in the test set into the lung cancer gene mutation prediction model to obtain the corresponding gene mutation type prediction result and tumor mutation load prediction result;

第三子单元343,连接第二子单元342,用于根据各基因突变类型预测结果与对应的真实基因突变类型处理得到一第一预测准确率,以及根据各肿瘤突变负荷预测结果与对应的真实肿瘤突变负荷处理得到一第二预测准确率,作为医生给出靶向治疗以及免疫治疗意见的参考数据。The third subunit 343 is connected to the second subunit 342, and is configured to obtain a first prediction accuracy rate according to the prediction results of each gene mutation type and the corresponding real gene mutation type, and to obtain a first prediction accuracy rate according to the prediction results of each gene mutation type and the corresponding real gene mutation load prediction results. The tumor mutation burden is processed to obtain a second prediction accuracy rate, which is used as reference data for doctors to give targeted therapy and immunotherapy opinions.

本发明的较佳的实施例中,采用受试者工作特性曲线及曲线下面积处理得到第一预测准确率和第二预测准确率。In a preferred embodiment of the present invention, the receiver operating characteristic curve and the area under the curve are used to obtain the first prediction accuracy rate and the second prediction accuracy rate.

具体地,本实施例中,通过采用受试者工作特性曲线及曲线下面积对第一准确率和第二准确率进行评价,采用受试者工作特性曲线及曲线下面积能简单、直观地通过图示观察分析第一准确率和第二准确率,并可用肉眼作出判断;受试者工作特性曲线将灵敏度与特异性以图示方法结合在一起,可准确反映特异性和敏感性的关系,使得对第一准确率和第二准确率进行评价更加客观准确。Specifically, in this embodiment, the receiver operating characteristic curve and the area under the curve are used to evaluate the first accuracy rate and the second accuracy rate, and the receiver operating characteristic curve and the area under the curve can be used to easily and intuitively pass Graphical observation and analysis of the first accuracy rate and second accuracy rate can be used to make judgments with the naked eye; the receiver operating characteristic curve combines sensitivity and specificity in a graphical way, which can accurately reflect the relationship between specificity and sensitivity. This makes it more objective and accurate to evaluate the first accuracy rate and the second accuracy rate.

本发明的较佳的实施例中,将所有数据集合按照预设比例划分形成训练集,验证集和测试集,预设比例为3:1:1。In a preferred embodiment of the present invention, all data sets are divided according to a preset ratio to form a training set, a verification set and a test set, and the preset ratio is 3:1:1.

具体地,本实施例中,其中训练集包括195名接受检查的患者的数据,验证集和测试集均包括65名接受检查的患者的数据。Specifically, in this embodiment, the training set includes data of 195 patients undergoing examination, and both the validation set and the test set include data of 65 patients undergoing examination.

一种肺癌基因突变的跨模态预测方法,应用于上述肺癌基因突变的跨模态预测系统,如图2所示,肺癌基因突变的跨模态预测方法具体包括以下步骤:A cross-modal prediction method for lung cancer gene mutation is applied to the above-mentioned cross-modal prediction system for lung cancer gene mutation. As shown in Figure 2, the cross-modal prediction method for lung cancer gene mutation specifically includes the following steps:

步骤S1,肺癌基因突变的跨模态预测系统分别采集接受靶向治疗和免疫治疗前的若干肺癌患者的计算机断层扫描影像,数字病理图像,真实基因突变类型和以及每种真实基因突变类型对应的真实肿瘤突变负荷;Step S1, the cross-modality prediction system for lung cancer gene mutation collects computed tomography images, digital pathology images, real gene mutation types, and the corresponding data of each real gene mutation type of several lung cancer patients before receiving targeted therapy and immunotherapy, respectively. True tumor mutational burden;

步骤S2,肺癌基因突变的跨模态预测系统分别对计算机断层扫描影像和数字病理图像中的病灶区域进行标注得到计算机断层扫描标注图像以及数字病理标注图像;Step S2, the cross-modality prediction system for lung cancer gene mutation respectively annotates the lesion area in the computed tomography image and the digital pathology image to obtain the computed tomography annotated image and the digital pathology annotated image;

步骤S3,肺癌基因突变的跨模态预测系统将各肺癌患者的计算机断层扫描标注图像和数字病理标注图像作为输入,将对应的真实基因突变类型和真实肿瘤突变负荷作为输出,训练得到肺癌基因突变预测模型;Step S3, the cross-modality prediction system for lung cancer gene mutation takes the computed tomography annotated image and digital pathology annotated image of each lung cancer patient as input, and uses the corresponding real gene mutation type and real tumor mutation load as output, and trains to obtain lung cancer gene mutation prediction model;

步骤S4,肺癌基因突变的跨模态预测系统将待预测肺癌患者的计算机断层扫描影像和数字病理图像输入到肺癌基因突变预测模型得到待预测肺癌患者的预测基因突变类型以及对应的预测肿瘤突变负荷,作为医生给出靶向治疗以及免疫治疗意见的参考数据。Step S4, the cross-modality prediction system for lung cancer gene mutation inputs the computed tomography image and digital pathology image of the lung cancer patient to be predicted into the lung cancer gene mutation prediction model to obtain the predicted gene mutation type of the lung cancer patient to be predicted and the corresponding predicted tumor mutation load , as reference data for doctors to give targeted therapy and immunotherapy opinions.

本发明的较佳的实施例中,如图3所示,步骤S1包括:In a preferred embodiment of the present invention, as shown in FIG. 3 , step S1 includes:

步骤S11,肺癌基因突变的跨模态预测系统采集各肺癌患者的计算机断层扫描影像;Step S11, the cross-modality prediction system for lung cancer gene mutation collects computed tomography images of each lung cancer patient;

步骤S12,肺癌基因突变的跨模态预测系统获取各肺癌患者的病理切片,并对病理切片进行扫描得到数字病理图像;Step S12, the cross-modality prediction system for lung cancer gene mutation obtains the pathological slices of each lung cancer patient, and scans the pathological slices to obtain digital pathological images;

步骤S13,肺癌基因突变的跨模态预测系统对各肺癌患者进行基因测序得到真实基因突变类型和以及每种真实基因突变类型对应的真实肿瘤突变负荷。Step S13, the cross-modal prediction system for lung cancer gene mutation performs gene sequencing on each lung cancer patient to obtain the real gene mutation type and the real tumor mutation load corresponding to each real gene mutation type.

本发明的较佳的实施例中,如图4所示,步骤S3包括:In a preferred embodiment of the present invention, as shown in FIG. 4 , step S3 includes:

步骤S31,肺癌基因突变的跨模态预测系统将每个肺癌患者的计算机断层扫描标注图像、数字病理标注图像、真实基因突变类型和真实肿瘤突变负荷作为一数据集合,并根据各数据集合形成一训练集和一验证集;Step S31, the cross-modality prediction system for lung cancer gene mutation takes the CT annotated image, digital pathology annotated image, real gene mutation type and real tumor mutation load of each lung cancer patient as a data set, and forms a data set according to each data set. training set and a validation set;

步骤S32,肺癌基因突变的跨模态预测系统将训练集中的计算机断层扫描标注图像和数字病理标注图像作为输入,将真实基因突变类型和真实肿瘤突变负荷作为输出,训练得到一肺癌基因突变初始模型;Step S32, the cross-modality prediction system for lung cancer gene mutation takes the CT annotated image and digital pathology annotated image in the training set as input, and uses the real gene mutation type and real tumor mutation load as output, and trains to obtain an initial model of lung cancer gene mutation ;

步骤S33,肺癌基因突变的跨模态预测系统根据验证集对肺癌基因突变初始模型进行参数优化调整,得到肺癌基因突变预测模型。Step S33, the cross-modality prediction system for lung cancer gene mutation optimizes and adjusts the parameters of the initial lung cancer gene mutation model according to the validation set to obtain a lung cancer gene mutation prediction model.

本发明的较佳的实施例中,步骤S3还包括一获取模型预测准确率的过程,如图5所示,具体包括:In a preferred embodiment of the present invention, step S3 further includes a process of obtaining the model prediction accuracy, as shown in FIG. 5 , specifically including:

步骤A1,肺癌基因突变的跨模态预测系统根据各数据集合形成一测试集;Step A1, the cross-modal prediction system for lung cancer gene mutation forms a test set according to each data set;

步骤A2,肺癌基因突变的跨模态预测系统将测试集中的各计算机断层扫描标注图像和数字病理标注图像输入到肺癌基因突变预测模型得到相应的基因突变类型预测结果和肿瘤突变负荷预测结果;Step A2, the cross-modality prediction system for lung cancer gene mutation inputs each CT scan annotated image and digital pathology annotated image in the test set into the lung cancer gene mutation prediction model to obtain the corresponding gene mutation type prediction result and tumor mutation load prediction result;

步骤A3,肺癌基因突变的跨模态预测系统根据各基因突变类型预测结果与对应的真实基因突变类型处理得到一第一预测准确率,以及根据各肿瘤突变负荷预测结果与对应的真实肿瘤突变负荷处理得到一第二预测准确率,作为医生给出靶向治疗以及免疫治疗意见的参考数据。Step A3, the cross-modal prediction system for lung cancer gene mutation processes according to the prediction result of each gene mutation type and the corresponding real gene mutation type to obtain a first prediction accuracy rate, and according to the prediction result of each tumor mutation load and the corresponding real tumor mutation load After processing, a second prediction accuracy rate is obtained, which is used as reference data for doctors to give opinions on targeted therapy and immunotherapy.

以上仅为本发明较佳的实施例,并非因此限制本发明的实施方式及保护范围,对于本领域技术人员而言,应当能够意识到凡运用本说明书及图示内容所作出的等同替换和显而易见的变化所得到的方案,均应当包含在本发明的保护范围内。The above are only preferred embodiments of the present invention, and are not intended to limit the embodiments and protection scope of the present invention. For those skilled in the art, they should be able to realize that all equivalents and obvious substitutions made by using the contents of the description and the drawings are obvious. The solutions obtained by the changes of the above should be included in the protection scope of the present invention.

Claims (10)

1. A cross-modal prediction system for lung cancer gene mutation is characterized by specifically comprising:
the system comprises an acquisition module, a detection module and a control module, wherein the acquisition module is used for respectively acquiring computed tomography images, digital pathological images, real gene mutation types and real tumor mutation loads corresponding to each real gene mutation type of a plurality of lung cancer patients before receiving targeted therapy and immunotherapy;
the marking module is connected with the acquisition module and is used for marking the focus areas in the computed tomography image and the digital pathology image respectively to obtain a computed tomography marking image and a digital pathology marking image;
the training module is respectively connected with the acquisition module and the labeling module and is used for taking the computed tomography labeling image and the digital pathology labeling image of each lung cancer patient as input, taking the corresponding real gene mutation type and the real tumor mutation load as output, and training to obtain a lung cancer gene mutation prediction model;
and the prediction module is connected with the training module and used for inputting the computed tomography image and the digital pathological image of the lung cancer patient to be predicted into the lung cancer gene mutation prediction model to obtain the predicted gene mutation type and the corresponding predicted tumor mutation load of the lung cancer patient to be predicted, and the predicted gene mutation type and the corresponding predicted tumor mutation load are used as reference data for doctors to give targeted therapy and immunotherapy opinions.
2. The system of claim 1, wherein the collection module comprises:
the first acquisition unit is used for acquiring the computed tomography image of each lung cancer patient;
the second acquisition unit is used for acquiring pathological sections of the lung cancer patients and scanning the pathological sections to obtain the digital pathological images;
and the third acquisition unit is used for carrying out gene sequencing on each lung cancer patient to obtain the real gene mutation type and the real tumor mutation load corresponding to each real gene mutation type.
3. The system of claim 1, wherein the training module comprises:
a grouping unit, configured to use the computed tomography labeling image, the digital pathology labeling image, the true gene mutation type, and the true tumor mutation load of each lung cancer patient as a data set, and form a training set and a verification set according to each data set;
the training unit is connected with the grouping unit and used for taking the computed tomography labeling image and the digital pathology labeling image in the training set as input, taking the real gene mutation type and the real tumor mutation load as output and training to obtain a lung cancer gene mutation initial model;
and the verification unit is respectively connected with the grouping unit and the training unit and is used for carrying out parameter optimization adjustment on the lung cancer gene mutation initial model according to the verification set to obtain the lung cancer gene mutation prediction model.
4. The system of claim 3, wherein the training module further comprises a testing unit connected to the grouping unit and the verification unit, respectively, the testing unit comprising:
a first subunit, configured to form a test set according to each data set;
the second subunit is connected with the first subunit and is used for inputting each computed tomography labeling image and the digital pathology labeling image in the test set into the lung cancer gene mutation prediction model to obtain a corresponding gene mutation type prediction result and a tumor mutation load prediction result;
and the third subunit is connected with the second subunit and used for processing according to each gene mutation type prediction result and the corresponding real gene mutation type to obtain a first prediction accuracy rate and processing according to each tumor mutation load prediction result and the corresponding real tumor mutation load to obtain a second prediction accuracy rate which is used as reference data for doctors to give targeted therapy and immunotherapy opinions.
5. The system of claim 4, wherein the first prediction accuracy and the second prediction accuracy are obtained by processing a characteristic curve of the subject and an area under the curve.
6. The system of claim 4, wherein the training set, the validation set, and the testing set are formed by dividing all the data sets according to a predetermined ratio, the predetermined ratio being 3: 1: 1.
7. a cross-modal lung cancer gene mutation prediction method is applied to the lung cancer gene mutation prediction system of any one of claims 1 to 6, and specifically comprises the following steps:
step S1, the lung cancer gene mutation cross-modal prediction system respectively collects the computed tomography images, digital pathological images, real gene mutation types and real tumor mutation loads corresponding to each real gene mutation type of a plurality of lung cancer patients before receiving targeted therapy and immunotherapy;
step S2, the cross-modal prediction system of the lung cancer gene mutation labels the focus areas in the computed tomography image and the digital pathology image respectively to obtain a computed tomography labeling image and a digital pathology labeling image;
step S3, the lung cancer gene mutation cross-modal prediction system takes the computed tomography labeling image and the digital pathology labeling image of each lung cancer patient as input, takes the corresponding real gene mutation type and the real tumor mutation load as output, and trains to obtain a lung cancer gene mutation prediction model;
step S4, the cross-modal lung cancer gene mutation prediction system inputs the computed tomography image and the digital pathological image of the lung cancer patient to be predicted into the lung cancer gene mutation prediction model to obtain the predicted gene mutation type and the corresponding predicted tumor mutation load of the lung cancer patient to be predicted, and the predicted gene mutation type and the corresponding predicted tumor mutation load are used as reference data for doctors to give targeted therapy and immunotherapy opinions.
8. The method of claim 7, wherein the cross-modal prediction of lung cancer gene mutation is performed,
the step S1 includes:
step S11, the cross-modal lung cancer gene mutation prediction system collects the computed tomography images of the lung cancer patients;
step S12, the cross-modal lung cancer gene mutation prediction system acquires pathological sections of the lung cancer patients and scans the pathological sections to obtain the digital pathological images;
step S13, the cross-modal lung cancer gene mutation prediction system performs gene sequencing on each lung cancer patient to obtain the true gene mutation types and the true tumor mutation loads corresponding to each of the true gene mutation types.
9. The method for cross-modal prediction of lung cancer gene mutation according to claim 7, wherein the step S3 comprises:
step S31, the cross-modal lung cancer gene mutation prediction system takes the computed tomography labeling image, the digital pathology labeling image, the real gene mutation type and the real tumor mutation load of each lung cancer patient as a data set, and forms a training set and a verification set according to the data sets;
step S32, the cross-modal prediction system of the lung cancer gene mutation takes the computed tomography labeling image and the digital pathology labeling image in the training set as input, takes the real gene mutation type and the real tumor mutation load as output, and trains to obtain a lung cancer gene mutation initial model;
and step S33, the cross-modal lung cancer gene mutation prediction system performs parameter optimization adjustment on the lung cancer gene mutation initial model according to the verification set to obtain the lung cancer gene mutation prediction model.
10. The method for cross-modal prediction of lung cancer gene mutation of claim 9, wherein the step S3 further comprises a process for obtaining model prediction accuracy, specifically comprising:
step A1, the cross-modal system for lung cancer gene mutation prediction forms a test set according to each data set;
step A2, the cross-modal system for lung cancer gene mutation prediction inputs each computed tomography labeling image and the digital pathology labeling image in the test set into the lung cancer gene mutation prediction model to obtain a corresponding gene mutation type prediction result and a tumor mutation load prediction result;
step A3, the cross-modal lung cancer gene mutation prediction system obtains a first prediction accuracy rate by processing according to each gene mutation type prediction result and the corresponding real gene mutation type, and obtains a second prediction accuracy rate by processing according to each tumor mutation load prediction result and the corresponding real tumor mutation load, and the second prediction accuracy rate is used as reference data for doctors to give targeted therapy and immunotherapy opinions.
CN202010819881.5A 2020-08-14 2020-08-14 Cross-modal prediction system and method for lung cancer gene mutation Pending CN111968742A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010819881.5A CN111968742A (en) 2020-08-14 2020-08-14 Cross-modal prediction system and method for lung cancer gene mutation

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010819881.5A CN111968742A (en) 2020-08-14 2020-08-14 Cross-modal prediction system and method for lung cancer gene mutation

Publications (1)

Publication Number Publication Date
CN111968742A true CN111968742A (en) 2020-11-20

Family

ID=73387722

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010819881.5A Pending CN111968742A (en) 2020-08-14 2020-08-14 Cross-modal prediction system and method for lung cancer gene mutation

Country Status (1)

Country Link
CN (1) CN111968742A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113077875A (en) * 2021-03-23 2021-07-06 零氪智慧医疗科技(天津)有限公司 CT image processing method and device
CN113409888A (en) * 2021-06-21 2021-09-17 中国科学院自动化研究所 Tumor microenvironment and tumor gene mutation detection system, method and equipment
CN113744801A (en) * 2021-09-09 2021-12-03 首都医科大学附属北京天坛医院 Method, device and system for determining tumor type, electronic equipment and storage medium
CN115661152A (en) * 2022-12-27 2023-01-31 四川大学华西医院 Analysis Method of Target Development Based on Model Prediction

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160188800A1 (en) * 2014-12-30 2016-06-30 Covidien Lp System and method for cytopathological and genetic data based treatment protocol identification and tracking
CN109033749A (en) * 2018-06-29 2018-12-18 深圳裕策生物科技有限公司 A kind of Tumor mutations load testing method, device and storage medium
US20190087532A1 (en) * 2017-09-15 2019-03-21 Case Western Reserve University Predicting recurrence in early stage non-small cell lung cancer (nsclc) with integrated radiomic and pathomic features
US20190159744A1 (en) * 2017-11-27 2019-05-30 Toshiba America Medical Systems, Inc. Computed tomography perfusion (ctp) method and apparatus using blood flow for discriminating types of cancer
CN110604550A (en) * 2019-09-24 2019-12-24 广州医科大学附属肿瘤医院 A method for predicting complications of normal tissues and organs after tumor radiotherapy
CN111370059A (en) * 2020-03-05 2020-07-03 上海市肺科医院(上海市职业病防治院) Prediction system and method for tumor mutation load
CN111445946A (en) * 2020-03-26 2020-07-24 北京易康医疗科技有限公司 An Algorithm for Calculating Lung Cancer Genotyping Using PET/CT Images

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160188800A1 (en) * 2014-12-30 2016-06-30 Covidien Lp System and method for cytopathological and genetic data based treatment protocol identification and tracking
US20190087532A1 (en) * 2017-09-15 2019-03-21 Case Western Reserve University Predicting recurrence in early stage non-small cell lung cancer (nsclc) with integrated radiomic and pathomic features
US20190159744A1 (en) * 2017-11-27 2019-05-30 Toshiba America Medical Systems, Inc. Computed tomography perfusion (ctp) method and apparatus using blood flow for discriminating types of cancer
CN109033749A (en) * 2018-06-29 2018-12-18 深圳裕策生物科技有限公司 A kind of Tumor mutations load testing method, device and storage medium
CN110604550A (en) * 2019-09-24 2019-12-24 广州医科大学附属肿瘤医院 A method for predicting complications of normal tissues and organs after tumor radiotherapy
CN111370059A (en) * 2020-03-05 2020-07-03 上海市肺科医院(上海市职业病防治院) Prediction system and method for tumor mutation load
CN111445946A (en) * 2020-03-26 2020-07-24 北京易康医疗科技有限公司 An Algorithm for Calculating Lung Cancer Genotyping Using PET/CT Images

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
王荃等: "基于深度学习和组织形态分析的肺癌基因突变预测", 《生物医学工程学杂志》 *
胡玉川等: "影像组学在肺癌中的应用研究进展", 《放射学实践》 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113077875A (en) * 2021-03-23 2021-07-06 零氪智慧医疗科技(天津)有限公司 CT image processing method and device
CN113409888A (en) * 2021-06-21 2021-09-17 中国科学院自动化研究所 Tumor microenvironment and tumor gene mutation detection system, method and equipment
CN113744801A (en) * 2021-09-09 2021-12-03 首都医科大学附属北京天坛医院 Method, device and system for determining tumor type, electronic equipment and storage medium
CN115661152A (en) * 2022-12-27 2023-01-31 四川大学华西医院 Analysis Method of Target Development Based on Model Prediction
CN115661152B (en) * 2022-12-27 2023-04-07 四川大学华西医院 Analysis Method of Target Development Based on Model Prediction

Similar Documents

Publication Publication Date Title
CN111968742A (en) Cross-modal prediction system and method for lung cancer gene mutation
Shamai et al. Artificial intelligence algorithms to assess hormonal status from tissue microarrays in patients with breast cancer
Warsinske et al. Assessment of validity of a blood-based 3-gene signature score for progression and diagnosis of tuberculosis, disease severity, and treatment response
Wang et al. Predicting EGFR and PD-L1 status in NSCLC patients using multitask AI system based on CT images
CN111640503B (en) System and method for predicting tumor mutation load of advanced lung cancer patient
Sharifi et al. Deep learning on ultrasound images of thyroid nodules
Brugnara et al. Automated volumetric assessment with artificial neural networks might enable a more accurate assessment of disease burden in patients with multiple sclerosis
CN106650256A (en) Precise medical platform for molecular diagnosis and treatment
Raajan et al. Non-invasive technique-based novel corona (COVID-19) virus detection using CNN
JP7124265B2 (en) Biomarker detection method, disease determination method, biomarker detection device, and biomarker detection program
CN113077875B (en) CT image processing method and device
Chang et al. Wrist joint synovial hypertrophy and effusion detection in musculoskeletal ultrasound images using self-attention U-Net
Orhan et al. Development and validation of an ultrasonography-based machine learning model for predicting outcomes of bruxism treatments
Zhang et al. A model-agnostic framework for universal anomaly detection of multi-organ and multi-modal images
JP2010029659A (en) Method and device for obtaining parameter related to prostate of patient
Huang et al. Preliminary discrimination and evaluation of clinical application value of ChatGPT4o in bone tumors
JP2010029659A5 (en)
JP2021007659A (en) Medical information processing device and program
Wang et al. A generalized deep learning approach for evaluating secondary pulmonary tuberculosis on chest computed tomography
Farid et al. Diminished muscle integrity in patients with fibrodysplasia ossificans progressiva assessed with at-home electrical impedance myography
Wang et al. Automated detection of otosclerosis with interpretable deep learning using temporal bone computed tomography images
Wang et al. Mut-Detecter: An EGFR activating mutation type classification method with a deep convolutional neural network
CN119130916B (en) Universal organ lymph node metastasis analysis system based on joint deep learning model
Naren et al. A Platform to Assist Pathologists in the Cancer Detection Process
Webster et al. Revolutionizing Dermatology and Healthcare through Advanced Artificial Intelligence: Emerging Trends, Applications, and Future Prospects

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20201120