CN113186287B - Biomarker for non-small cell lung cancer typing and application thereof - Google Patents
Biomarker for non-small cell lung cancer typing and application thereof Download PDFInfo
- Publication number
- CN113186287B CN113186287B CN202110505178.1A CN202110505178A CN113186287B CN 113186287 B CN113186287 B CN 113186287B CN 202110505178 A CN202110505178 A CN 202110505178A CN 113186287 B CN113186287 B CN 113186287B
- Authority
- CN
- China
- Prior art keywords
- lung cancer
- small cell
- adenocarcinoma
- cell lung
- biomarker
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 239000000090 biomarker Substances 0.000 title claims abstract description 34
- 208000002154 non-small cell lung carcinoma Diseases 0.000 title claims abstract description 28
- 208000029729 tumor suppressor gene on chromosome 11 Diseases 0.000 title claims abstract description 28
- 208000009956 adenocarcinoma Diseases 0.000 claims abstract description 40
- 206010041823 squamous cell carcinoma Diseases 0.000 claims abstract description 33
- 101000628562 Homo sapiens Serine/threonine-protein kinase STK11 Proteins 0.000 claims abstract description 8
- 108010011536 PTEN Phosphohydrolase Proteins 0.000 claims abstract description 8
- 102100026715 Serine/threonine-protein kinase STK11 Human genes 0.000 claims abstract description 8
- 108010078814 Tumor Suppressor Protein p53 Proteins 0.000 claims abstract description 8
- 102000014160 PTEN Phosphohydrolase Human genes 0.000 claims abstract 2
- 102000015098 Tumor Suppressor Protein p53 Human genes 0.000 claims abstract 2
- -1 NFE2L Proteins 0.000 claims description 8
- 101000584612 Homo sapiens GTPase KRas Proteins 0.000 claims description 7
- 239000003153 chemical reaction reagent Substances 0.000 claims description 7
- 108010009392 Cyclin-Dependent Kinase Inhibitor p16 Proteins 0.000 claims description 6
- 102100030708 GTPase KRas Human genes 0.000 claims description 6
- 101000605639 Homo sapiens Phosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit alpha isoform Proteins 0.000 claims description 6
- 101000984753 Homo sapiens Serine/threonine-protein kinase B-raf Proteins 0.000 claims description 6
- 101000633632 Homo sapiens Teashirt homolog 3 Proteins 0.000 claims description 6
- 102100027103 Serine/threonine-protein kinase B-raf Human genes 0.000 claims description 6
- 102100029222 Teashirt homolog 3 Human genes 0.000 claims description 6
- 102100027708 Astrotactin-1 Human genes 0.000 claims description 5
- 102100032742 Histone-lysine N-methyltransferase SETD2 Human genes 0.000 claims description 5
- 101000936741 Homo sapiens Astrotactin-1 Proteins 0.000 claims description 5
- 101000654725 Homo sapiens Histone-lysine N-methyltransferase SETD2 Proteins 0.000 claims description 5
- 101000623901 Homo sapiens Mucin-16 Proteins 0.000 claims description 5
- 101000609959 Homo sapiens Protein piccolo Proteins 0.000 claims description 5
- 101000728110 Homo sapiens Putative Polycomb group protein ASXL3 Proteins 0.000 claims description 5
- 101000854044 Homo sapiens Retinitis pigmentosa 1-like 1 protein Proteins 0.000 claims description 5
- 101000881267 Homo sapiens Spectrin alpha chain, erythrocytic 1 Proteins 0.000 claims description 5
- 101000804908 Homo sapiens Xin actin-binding repeat-containing protein 2 Proteins 0.000 claims description 5
- 102100023123 Mucin-16 Human genes 0.000 claims description 5
- 102100038332 Phosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit alpha isoform Human genes 0.000 claims description 5
- 102100039154 Protein piccolo Human genes 0.000 claims description 5
- 102100029749 Putative Polycomb group protein ASXL3 Human genes 0.000 claims description 5
- 102000004912 RYR2 Human genes 0.000 claims description 5
- 108060007241 RYR2 Proteins 0.000 claims description 5
- 102100035670 Retinitis pigmentosa 1-like 1 protein Human genes 0.000 claims description 5
- 102100037608 Spectrin alpha chain, erythrocytic 1 Human genes 0.000 claims description 5
- 239000012472 biological sample Substances 0.000 claims description 5
- 208000014488 papillary tumor of the pineal region Diseases 0.000 claims description 2
- 238000002360 preparation method Methods 0.000 claims description 2
- 102100033254 Tumor suppressor ARF Human genes 0.000 claims 1
- 102000052116 epidermal growth factor receptor activity proteins Human genes 0.000 claims 1
- 108700015053 epidermal growth factor receptor activity proteins Proteins 0.000 claims 1
- YOHYSYJDKVYCJI-UHFFFAOYSA-N n-[3-[[6-[3-(trifluoromethyl)anilino]pyrimidin-4-yl]amino]phenyl]cyclopropanecarboxamide Chemical compound FC(F)(F)C1=CC=CC(NC=2N=CN=C(NC=3C=C(NC(=O)C4CC4)C=CC=3)C=2)=C1 YOHYSYJDKVYCJI-UHFFFAOYSA-N 0.000 claims 1
- 108090000623 proteins and genes Proteins 0.000 abstract description 14
- 238000003745 diagnosis Methods 0.000 abstract description 13
- 238000001514 detection method Methods 0.000 abstract description 12
- 208000000587 small cell lung carcinoma Diseases 0.000 abstract description 4
- 206010041067 Small cell lung cancer Diseases 0.000 abstract description 3
- 238000005516 engineering process Methods 0.000 description 22
- 238000012163 sequencing technique Methods 0.000 description 20
- 238000000034 method Methods 0.000 description 17
- 206010058467 Lung neoplasm malignant Diseases 0.000 description 12
- 238000004458 analytical method Methods 0.000 description 12
- 201000005202 lung cancer Diseases 0.000 description 12
- 208000020816 lung neoplasm Diseases 0.000 description 12
- 230000001575 pathological effect Effects 0.000 description 12
- 206010028980 Neoplasm Diseases 0.000 description 10
- 102100032543 Phosphatidylinositol 3,4,5-trisphosphate 3-phosphatase and dual-specificity protein phosphatase PTEN Human genes 0.000 description 9
- 238000007637 random forest analysis Methods 0.000 description 9
- 210000001519 tissue Anatomy 0.000 description 9
- 102000001301 EGF receptor Human genes 0.000 description 8
- 108060006698 EGF receptor Proteins 0.000 description 8
- 206010064571 Gene mutation Diseases 0.000 description 7
- 238000012360 testing method Methods 0.000 description 7
- 238000011282 treatment Methods 0.000 description 7
- 102100025064 Cellular tumor antigen p53 Human genes 0.000 description 6
- 210000004027 cell Anatomy 0.000 description 6
- 239000000523 sample Substances 0.000 description 6
- 238000010200 validation analysis Methods 0.000 description 6
- 102100024458 Cyclin-dependent kinase inhibitor 2A Human genes 0.000 description 5
- 201000011510 cancer Diseases 0.000 description 5
- 230000000694 effects Effects 0.000 description 5
- 230000035772 mutation Effects 0.000 description 5
- 102100036955 Xin actin-binding repeat-containing protein 2 Human genes 0.000 description 4
- 239000003814 drug Substances 0.000 description 4
- 238000001959 radiotherapy Methods 0.000 description 4
- 238000012216 screening Methods 0.000 description 4
- 102100032296 A disintegrin and metalloproteinase with thrombospondin motifs 12 Human genes 0.000 description 3
- 102100032292 A disintegrin and metalloproteinase with thrombospondin motifs 17 Human genes 0.000 description 3
- 108091005671 ADAMTS12 Proteins 0.000 description 3
- 108091005674 ADAMTS17 Proteins 0.000 description 3
- 102100024378 AF4/FMR2 family member 2 Human genes 0.000 description 3
- 102000000872 ATM Human genes 0.000 description 3
- 102100021501 ATP-binding cassette sub-family B member 5 Human genes 0.000 description 3
- 101150020330 ATRX gene Proteins 0.000 description 3
- 102100022117 Abnormal spindle-like microcephaly-associated protein Human genes 0.000 description 3
- 102100040036 Adhesion G-protein coupled receptor G4 Human genes 0.000 description 3
- 102100031366 Ankyrin-1 Human genes 0.000 description 3
- 108010004586 Ataxia Telangiectasia Mutated Proteins Proteins 0.000 description 3
- 102100022983 B-cell lymphoma/leukemia 11B Human genes 0.000 description 3
- 102100035080 BDNF/NT-3 growth factors receptor Human genes 0.000 description 3
- 102100029963 Beta-galactoside alpha-2,6-sialyltransferase 2 Human genes 0.000 description 3
- 102000014811 CACNA1E Human genes 0.000 description 3
- 102100040750 CUB and sushi domain-containing protein 1 Human genes 0.000 description 3
- 102100028914 Catenin beta-1 Human genes 0.000 description 3
- 102100035595 Cohesin subunit SA-2 Human genes 0.000 description 3
- 102100030505 Coiled-coil domain-containing protein 178 Human genes 0.000 description 3
- 102100027442 Collagen alpha-1(XII) chain Human genes 0.000 description 3
- 102100024203 Collagen alpha-1(XIV) chain Human genes 0.000 description 3
- 102100029813 D(1B) dopamine receptor Human genes 0.000 description 3
- 101700026669 DACH1 Proteins 0.000 description 3
- 102100028735 Dachshund homolog 1 Human genes 0.000 description 3
- 102100024230 Dendritic cell-specific transmembrane protein Human genes 0.000 description 3
- 102100024108 Dystrophin Human genes 0.000 description 3
- 102100034674 E3 ubiquitin-protein ligase HECW1 Human genes 0.000 description 3
- 102100031918 E3 ubiquitin-protein ligase NEDD4 Human genes 0.000 description 3
- 102100034568 E3 ubiquitin-protein ligase PDZRN3 Human genes 0.000 description 3
- 102100021604 Ephrin type-A receptor 6 Human genes 0.000 description 3
- 102100031984 Ephrin type-B receptor 6 Human genes 0.000 description 3
- 102100027623 FERM and PDZ domain-containing protein 4 Human genes 0.000 description 3
- 102100031510 Fibrillin-2 Human genes 0.000 description 3
- 102100023593 Fibroblast growth factor receptor 1 Human genes 0.000 description 3
- 108090000369 Glutamate Carboxypeptidase II Proteins 0.000 description 3
- 102100041003 Glutamate carboxypeptidase 2 Human genes 0.000 description 3
- 102100027768 Histone-lysine N-methyltransferase 2D Human genes 0.000 description 3
- 101000833172 Homo sapiens AF4/FMR2 family member 2 Proteins 0.000 description 3
- 101000677872 Homo sapiens ATP-binding cassette sub-family B member 5 Proteins 0.000 description 3
- 101000900939 Homo sapiens Abnormal spindle-like microcephaly-associated protein Proteins 0.000 description 3
- 101000959604 Homo sapiens Adhesion G-protein coupled receptor G4 Proteins 0.000 description 3
- 101000796140 Homo sapiens Ankyrin-1 Proteins 0.000 description 3
- 101000903697 Homo sapiens B-cell lymphoma/leukemia 11B Proteins 0.000 description 3
- 101000596896 Homo sapiens BDNF/NT-3 growth factors receptor Proteins 0.000 description 3
- 101000863891 Homo sapiens Beta-galactoside alpha-2,6-sialyltransferase 2 Proteins 0.000 description 3
- 101000892017 Homo sapiens CUB and sushi domain-containing protein 1 Proteins 0.000 description 3
- 101000916173 Homo sapiens Catenin beta-1 Proteins 0.000 description 3
- 101000642968 Homo sapiens Cohesin subunit SA-2 Proteins 0.000 description 3
- 101000772635 Homo sapiens Coiled-coil domain-containing protein 178 Proteins 0.000 description 3
- 101000861874 Homo sapiens Collagen alpha-1(XII) chain Proteins 0.000 description 3
- 101000909626 Homo sapiens Collagen alpha-1(XIV) chain Proteins 0.000 description 3
- 101000865210 Homo sapiens D(1B) dopamine receptor Proteins 0.000 description 3
- 101000832060 Homo sapiens Dendritic cell-specific transmembrane protein Proteins 0.000 description 3
- 101001053946 Homo sapiens Dystrophin Proteins 0.000 description 3
- 101000872869 Homo sapiens E3 ubiquitin-protein ligase HECW1 Proteins 0.000 description 3
- 101000636713 Homo sapiens E3 ubiquitin-protein ligase NEDD4 Proteins 0.000 description 3
- 101001131834 Homo sapiens E3 ubiquitin-protein ligase PDZRN3 Proteins 0.000 description 3
- 101000898696 Homo sapiens Ephrin type-A receptor 6 Proteins 0.000 description 3
- 101001064451 Homo sapiens Ephrin type-B receptor 6 Proteins 0.000 description 3
- 101000862373 Homo sapiens FERM and PDZ domain-containing protein 4 Proteins 0.000 description 3
- 101000846890 Homo sapiens Fibrillin-2 Proteins 0.000 description 3
- 101000827746 Homo sapiens Fibroblast growth factor receptor 1 Proteins 0.000 description 3
- 101000917159 Homo sapiens Filaggrin Proteins 0.000 description 3
- 101001008894 Homo sapiens Histone-lysine N-methyltransferase 2D Proteins 0.000 description 3
- 101000994375 Homo sapiens Integrin alpha-4 Proteins 0.000 description 3
- 101000605528 Homo sapiens Kallikrein-2 Proteins 0.000 description 3
- 101001052076 Homo sapiens Maltase-glucoamylase Proteins 0.000 description 3
- 101000636209 Homo sapiens Matrix-remodeling-associated protein 5 Proteins 0.000 description 3
- 101001071437 Homo sapiens Metabotropic glutamate receptor 1 Proteins 0.000 description 3
- 101000623904 Homo sapiens Mucin-17 Proteins 0.000 description 3
- 101000589015 Homo sapiens Myomesin-2 Proteins 0.000 description 3
- 101001000104 Homo sapiens Myosin-11 Proteins 0.000 description 3
- 101000624956 Homo sapiens Nesprin-2 Proteins 0.000 description 3
- 101000775053 Homo sapiens Neuroblast differentiation-associated protein AHNAK Proteins 0.000 description 3
- 101000601047 Homo sapiens Nidogen-1 Proteins 0.000 description 3
- 101000634529 Homo sapiens Nuclear pore-associated protein 1 Proteins 0.000 description 3
- 101001122435 Homo sapiens Olfactory receptor 4C15 Proteins 0.000 description 3
- 101001137111 Homo sapiens Olfactory receptor 8H2 Proteins 0.000 description 3
- 101001087045 Homo sapiens Phosphatidylinositol 3,4,5-trisphosphate 3-phosphatase and dual-specificity protein phosphatase PTEN Proteins 0.000 description 3
- 101000905839 Homo sapiens Phospholipid-transporting ATPase VA Proteins 0.000 description 3
- 101001126417 Homo sapiens Platelet-derived growth factor receptor alpha Proteins 0.000 description 3
- 101001062793 Homo sapiens Protein FAM171A1 Proteins 0.000 description 3
- 101000801295 Homo sapiens Protein O-mannosyl-transferase TMTC3 Proteins 0.000 description 3
- 101000736906 Homo sapiens Protein prune homolog 2 Proteins 0.000 description 3
- 101000824415 Homo sapiens Protocadherin Fat 3 Proteins 0.000 description 3
- 101000613366 Homo sapiens Protocadherin-11 X-linked Proteins 0.000 description 3
- 101000580092 Homo sapiens RNA-binding protein 10 Proteins 0.000 description 3
- 101001062093 Homo sapiens RNA-binding protein 15 Proteins 0.000 description 3
- 101001130509 Homo sapiens Ras GTPase-activating protein 1 Proteins 0.000 description 3
- 101000738771 Homo sapiens Receptor-type tyrosine-protein phosphatase C Proteins 0.000 description 3
- 101000606537 Homo sapiens Receptor-type tyrosine-protein phosphatase delta Proteins 0.000 description 3
- 101000650697 Homo sapiens Roundabout homolog 2 Proteins 0.000 description 3
- 101000654718 Homo sapiens SET-binding protein Proteins 0.000 description 3
- 101000740205 Homo sapiens Sal-like protein 1 Proteins 0.000 description 3
- 101000962322 Homo sapiens Sodium leak channel NALCN Proteins 0.000 description 3
- 101000864761 Homo sapiens Splicing factor 1 Proteins 0.000 description 3
- 101000585255 Homo sapiens Steroidogenic factor 1 Proteins 0.000 description 3
- 101000626112 Homo sapiens Telomerase protein component 1 Proteins 0.000 description 3
- 101000837130 Homo sapiens Tenascin-R Proteins 0.000 description 3
- 101000669447 Homo sapiens Toll-like receptor 4 Proteins 0.000 description 3
- 101000702545 Homo sapiens Transcription activator BRG1 Proteins 0.000 description 3
- 101000864342 Homo sapiens Tyrosine-protein kinase BTK Proteins 0.000 description 3
- 101001050476 Homo sapiens Tyrosine-protein kinase ITK/TSK Proteins 0.000 description 3
- 101000662020 Homo sapiens Ubiquitin-like modifier-activating enzyme 6 Proteins 0.000 description 3
- 101000860430 Homo sapiens Versican core protein Proteins 0.000 description 3
- 101000867844 Homo sapiens Voltage-dependent R-type calcium channel subunit alpha-1E Proteins 0.000 description 3
- 101000744897 Homo sapiens Zinc finger homeobox protein 4 Proteins 0.000 description 3
- 101000782145 Homo sapiens Zinc finger protein 226 Proteins 0.000 description 3
- 101000723615 Homo sapiens Zinc finger protein 536 Proteins 0.000 description 3
- 101000976250 Homo sapiens Zinc finger protein 804A Proteins 0.000 description 3
- 101000772560 Homo sapiens Zinc finger transcription factor Trps1 Proteins 0.000 description 3
- 102100032818 Integrin alpha-4 Human genes 0.000 description 3
- 102100038356 Kallikrein-2 Human genes 0.000 description 3
- 108090000484 Kelch-Like ECH-Associated Protein 1 Proteins 0.000 description 3
- 102000004034 Kelch-Like ECH-Associated Protein 1 Human genes 0.000 description 3
- 102100024295 Maltase-glucoamylase Human genes 0.000 description 3
- 102100030776 Matrix-remodeling-associated protein 5 Human genes 0.000 description 3
- 102100036834 Metabotropic glutamate receptor 1 Human genes 0.000 description 3
- 102100023125 Mucin-17 Human genes 0.000 description 3
- 102100032965 Myomesin-2 Human genes 0.000 description 3
- 102100036639 Myosin-11 Human genes 0.000 description 3
- 108010071382 NF-E2-Related Factor 2 Proteins 0.000 description 3
- 102100029166 NT-3 growth factor receptor Human genes 0.000 description 3
- 102100023305 Nesprin-2 Human genes 0.000 description 3
- 102100031837 Neuroblast differentiation-associated protein AHNAK Human genes 0.000 description 3
- 102100037369 Nidogen-1 Human genes 0.000 description 3
- 102000001753 Notch4 Receptor Human genes 0.000 description 3
- 108010029741 Notch4 Receptor Proteins 0.000 description 3
- 102100031701 Nuclear factor erythroid 2-related factor 2 Human genes 0.000 description 3
- 102100029048 Nuclear pore-associated protein 1 Human genes 0.000 description 3
- 102100027146 Olfactory receptor 4C15 Human genes 0.000 description 3
- 102100035642 Olfactory receptor 8H2 Human genes 0.000 description 3
- 102100023496 Phospholipid-transporting ATPase VA Human genes 0.000 description 3
- 102100030485 Platelet-derived growth factor receptor alpha Human genes 0.000 description 3
- 102100030534 Protein FAM171A1 Human genes 0.000 description 3
- 102100033736 Protein O-mannosyl-transferase TMTC3 Human genes 0.000 description 3
- 102100036040 Protein prune homolog 2 Human genes 0.000 description 3
- 102100022134 Protocadherin Fat 3 Human genes 0.000 description 3
- 102100040913 Protocadherin-11 X-linked Human genes 0.000 description 3
- 102100027514 RNA-binding protein 10 Human genes 0.000 description 3
- 102100029244 RNA-binding protein 15 Human genes 0.000 description 3
- 102000004913 RYR1 Human genes 0.000 description 3
- 108060007240 RYR1 Proteins 0.000 description 3
- 102100031426 Ras GTPase-activating protein 1 Human genes 0.000 description 3
- 102100037422 Receptor-type tyrosine-protein phosphatase C Human genes 0.000 description 3
- 102100039666 Receptor-type tyrosine-protein phosphatase delta Human genes 0.000 description 3
- 102100027739 Roundabout homolog 2 Human genes 0.000 description 3
- 102100032741 SET-binding protein Human genes 0.000 description 3
- 102100032735 SH3 and multiple ankyrin repeat domains protein 1 Human genes 0.000 description 3
- 101710101742 SH3 and multiple ankyrin repeat domains protein 1 Proteins 0.000 description 3
- 108091006731 SLCO1B1 Proteins 0.000 description 3
- 101001053942 Saccharolobus solfataricus (strain ATCC 35092 / DSM 1617 / JCM 11322 / P2) Diphosphomevalonate decarboxylase Proteins 0.000 description 3
- 102100037204 Sal-like protein 1 Human genes 0.000 description 3
- 102100039242 Sodium leak channel NALCN Human genes 0.000 description 3
- 102100027233 Solute carrier organic anion transporter family member 1B1 Human genes 0.000 description 3
- 102100029856 Steroidogenic factor 1 Human genes 0.000 description 3
- 102100028644 Tenascin-R Human genes 0.000 description 3
- 102100039360 Toll-like receptor 4 Human genes 0.000 description 3
- 102100031027 Transcription activator BRG1 Human genes 0.000 description 3
- 102100029823 Tyrosine-protein kinase BTK Human genes 0.000 description 3
- 102100023345 Tyrosine-protein kinase ITK/TSK Human genes 0.000 description 3
- 102100037939 Ubiquitin-like modifier-activating enzyme 6 Human genes 0.000 description 3
- 102100028437 Versican core protein Human genes 0.000 description 3
- 102000056014 X-linked Nuclear Human genes 0.000 description 3
- 108700042462 X-linked Nuclear Proteins 0.000 description 3
- 102100039968 Zinc finger homeobox protein 4 Human genes 0.000 description 3
- 102100036559 Zinc finger protein 226 Human genes 0.000 description 3
- 102100027858 Zinc finger protein 536 Human genes 0.000 description 3
- 102100023875 Zinc finger protein 804A Human genes 0.000 description 3
- 102100030619 Zinc finger transcription factor Trps1 Human genes 0.000 description 3
- ZPCCSZFPOXBNDL-ZSTSFXQOSA-N [(4r,5s,6s,7r,9r,10r,11e,13e,16r)-6-[(2s,3r,4r,5s,6r)-5-[(2s,4r,5s,6s)-4,5-dihydroxy-4,6-dimethyloxan-2-yl]oxy-4-(dimethylamino)-3-hydroxy-6-methyloxan-2-yl]oxy-10-[(2r,5s,6r)-5-(dimethylamino)-6-methyloxan-2-yl]oxy-5-methoxy-9,16-dimethyl-2-oxo-7-(2-oxoe Chemical compound O([C@H]1/C=C/C=C/C[C@@H](C)OC(=O)C[C@H]([C@@H]([C@H]([C@@H](CC=O)C[C@H]1C)O[C@H]1[C@@H]([C@H]([C@H](O[C@@H]2O[C@@H](C)[C@H](O)[C@](C)(O)C2)[C@@H](C)O1)N(C)C)O)OC)OC(C)=O)[C@H]1CC[C@H](N(C)C)[C@@H](C)O1 ZPCCSZFPOXBNDL-ZSTSFXQOSA-N 0.000 description 3
- 238000002512 chemotherapy Methods 0.000 description 3
- 238000013210 evaluation model Methods 0.000 description 3
- 238000010827 pathological analysis Methods 0.000 description 3
- 230000007170 pathology Effects 0.000 description 3
- 238000001356 surgical procedure Methods 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 238000007671 third-generation sequencing Methods 0.000 description 3
- 108010064892 trkC Receptor Proteins 0.000 description 3
- 238000012070 whole genome sequencing analysis Methods 0.000 description 3
- 102100033793 ALK tyrosine kinase receptor Human genes 0.000 description 2
- 108091005932 CCKBR Proteins 0.000 description 2
- 102100031011 Chemerin-like receptor 1 Human genes 0.000 description 2
- 102100031457 Collagen alpha-1(V) chain Human genes 0.000 description 2
- 101000837192 Drosophila melanogaster Teneurin-m Proteins 0.000 description 2
- 102100036016 Gastrin/cholecystokinin type B receptor Human genes 0.000 description 2
- 102100028893 Hemicentin-1 Human genes 0.000 description 2
- 101000779641 Homo sapiens ALK tyrosine kinase receptor Proteins 0.000 description 2
- 101000919756 Homo sapiens Chemerin-like receptor 1 Proteins 0.000 description 2
- 101000941708 Homo sapiens Collagen alpha-1(V) chain Proteins 0.000 description 2
- 101000839060 Homo sapiens Hemicentin-1 Proteins 0.000 description 2
- 101001072259 Homo sapiens Protocadherin-15 Proteins 0.000 description 2
- 101800000618 Protein kinase C delta type catalytic subunit Proteins 0.000 description 2
- 102100021004 Protein sidekick-1 Human genes 0.000 description 2
- 102100036382 Protocadherin-15 Human genes 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 239000008280 blood Substances 0.000 description 2
- 210000004369 blood Anatomy 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 238000003384 imaging method Methods 0.000 description 2
- 238000009169 immunotherapy Methods 0.000 description 2
- 238000002493 microarray Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 102000004169 proteins and genes Human genes 0.000 description 2
- 230000001225 therapeutic effect Effects 0.000 description 2
- 102100024153 Cadherin-15 Human genes 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 230000007064 DNA hydrolysis Effects 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 101000762242 Homo sapiens Cadherin-15 Proteins 0.000 description 1
- 101001121408 Homo sapiens L-amino-acid oxidase Proteins 0.000 description 1
- 101000827703 Homo sapiens Polyphosphoinositide phosphatase Proteins 0.000 description 1
- 101000800633 Homo sapiens Teneurin-2 Proteins 0.000 description 1
- 238000012351 Integrated analysis Methods 0.000 description 1
- 102100026388 L-amino-acid oxidase Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 208000007433 Lymphatic Metastasis Diseases 0.000 description 1
- 206010027476 Metastases Diseases 0.000 description 1
- 102100030569 Nuclear receptor corepressor 2 Human genes 0.000 description 1
- 101710153660 Nuclear receptor corepressor 2 Proteins 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 102100023591 Polyphosphoinositide phosphatase Human genes 0.000 description 1
- 101100012902 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) FIG2 gene Proteins 0.000 description 1
- 101100233916 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) KAR5 gene Proteins 0.000 description 1
- 102100033227 Teneurin-2 Human genes 0.000 description 1
- 238000011226 adjuvant chemotherapy Methods 0.000 description 1
- 238000011353 adjuvant radiotherapy Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 229940044683 chemotherapy drug Drugs 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 230000008094 contradictory effect Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000010219 correlation analysis Methods 0.000 description 1
- 238000002405 diagnostic procedure Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- XPPKVPWEQAFLFU-UHFFFAOYSA-J diphosphate(4-) Chemical compound [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0.000 description 1
- 235000011180 diphosphates Nutrition 0.000 description 1
- 238000012850 discrimination method Methods 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003596 drug target Substances 0.000 description 1
- 238000007672 fourth generation sequencing Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000012268 genome sequencing Methods 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 230000002055 immunohistochemical effect Effects 0.000 description 1
- 230000009545 invasion Effects 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 201000005249 lung adenocarcinoma Diseases 0.000 description 1
- 230000007257 malfunction Effects 0.000 description 1
- 230000003211 malignant effect Effects 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000002207 metabolite Substances 0.000 description 1
- 230000009401 metastasis Effects 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 229940028444 muse Drugs 0.000 description 1
- 230000036438 mutation frequency Effects 0.000 description 1
- 238000012148 non-surgical treatment Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000002980 postoperative effect Effects 0.000 description 1
- GMVPRGQOIOIIMI-DWKJAMRDSA-N prostaglandin E1 Chemical compound CCCCC[C@H](O)\C=C\[C@H]1[C@H](O)CC(=O)[C@@H]1CCCCCCC(O)=O GMVPRGQOIOIIMI-DWKJAMRDSA-N 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- 230000005855 radiation Effects 0.000 description 1
- 230000011514 reflex Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000007841 sequencing by ligation Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000011521 systemic chemotherapy Methods 0.000 description 1
- 238000002626 targeted therapy Methods 0.000 description 1
- 238000011269 treatment regimen Methods 0.000 description 1
- 210000004881 tumor cell Anatomy 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
- C12Q1/6886—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
- G16B20/20—Allele or variant detection, e.g. single nucleotide polymorphism [SNP] detection
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B25/00—ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
- G16B25/20—Polymerase chain reaction [PCR]; Primer or probe design; Probe optimisation
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
- G16B40/10—Signal processing, e.g. from mass spectrometry [MS] or from PCR
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/20—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/112—Disease subtyping, staging or classification
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/156—Polymorphic or mutational markers
Landscapes
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Physics & Mathematics (AREA)
- Chemical & Material Sciences (AREA)
- Medical Informatics (AREA)
- General Health & Medical Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- Genetics & Genomics (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Analytical Chemistry (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Organic Chemistry (AREA)
- Pathology (AREA)
- Public Health (AREA)
- Theoretical Computer Science (AREA)
- Immunology (AREA)
- Biomedical Technology (AREA)
- Data Mining & Analysis (AREA)
- Epidemiology (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Databases & Information Systems (AREA)
- Primary Health Care (AREA)
- General Engineering & Computer Science (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Oncology (AREA)
- Hospice & Palliative Care (AREA)
- Signal Processing (AREA)
- Artificial Intelligence (AREA)
- Bioethics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Computation (AREA)
Abstract
Description
技术领域Technical Field
本发明涉及医学检测技术领域,特别是涉及一种用于非小细胞肺癌分型的生物标志物及其应用。The present invention relates to the field of medical detection technology, and in particular to a biomarker for non-small cell lung cancer typing and application thereof.
背景技术Background Art
肺癌是一种异质性疾病,现有肺癌的治疗方式选择的主要依据是病理分型及分期诊断。病理分型一般是通过组织学来确定其亚型:重要的分型例如:小细胞vs非小细胞,腺癌vs鳞状细胞癌等。肺癌的各种形态学亚型之间的区分在指导患者管理中是必需的,不同的病理亚型,其相应的治疗策略也有所差异,例如:小细胞未分化肺癌恶性度高,早期易转移,对放化疗敏感,全身化疗及局部放疗的非手术治疗是其主要治疗手段。而非小细胞肺癌主要包括鳞癌和腺癌,Ⅰ、Ⅱ期非小细胞肺癌主要选择手术,部分结合术后辅助放化疗能够治愈。而对于非小细胞肺癌来说,腺癌的肿瘤细胞生长速度比较快,大多在早期就出现转移,以血行转移为主,所以对化疗药物更敏感,而反射治疗效果欠佳,所以常常选择手术、化疗、免疫、靶向治疗等方式。而鳞癌相对而言稍慢一些,早期多数是局部的侵犯,以淋巴结转移途径为主,远处转移发生比较晚,所以针对鳞癌放射治疗的敏感性更高,一般会采用手术、放射、免疫治疗等方式。Lung cancer is a heterogeneous disease. The main basis for the selection of existing lung cancer treatment methods is pathological classification and staging diagnosis. Pathological classification is generally determined by histology: important classifications include small cell vs non-small cell, adenocarcinoma vs squamous cell carcinoma, etc. The distinction between various morphological subtypes of lung cancer is necessary in guiding patient management. Different pathological subtypes have different corresponding treatment strategies. For example, small cell undifferentiated lung cancer is highly malignant, easy to metastasize in the early stage, sensitive to radiotherapy and chemotherapy, and non-surgical treatment with systemic chemotherapy and local radiotherapy is its main treatment method. Non-small cell lung cancer mainly includes squamous cell carcinoma and adenocarcinoma. Surgery is the main choice for stage I and II non-small cell lung cancer, and some can be cured by postoperative adjuvant radiotherapy and chemotherapy. For non-small cell lung cancer, the tumor cells of adenocarcinoma grow faster, and most of them metastasize in the early stage, mainly in the blood, so they are more sensitive to chemotherapy drugs, and the reflex treatment effect is poor, so surgery, chemotherapy, immunotherapy, targeted therapy and other methods are often chosen. Squamous cell carcinoma develops relatively slowly. In the early stages, most invasions are local, with lymph node metastasis being the main route. Distant metastasis occurs relatively late, so squamous cell carcinoma is more sensitive to radiotherapy, and surgery, radiation, immunotherapy, and other methods are generally used.
而常规肺癌病理判断大部分情况仅依赖于组织学的病理诊断结果,但由于医师个人经验和多种原因等,导致了病理亚型分型错误的结果,错误率高,且没有相关的其他质控手段。Conventional lung cancer pathology judgment mostly relies on histological pathological diagnosis results. However, due to the personal experience of doctors and various reasons, the pathological subtype classification is incorrect, the error rate is high, and there are no other relevant quality control measures.
除了常规的放化疗手段,肺癌治疗目前已经逐步迈入精准医疗的时代:传统的影像学诊断和病理诊断已经满足不了精准医疗的需求。目前,肺癌的精准诊断,在传统诊断方法之外,在分子层面可检测特定治疗靶标志物,即检测肿瘤特异性基因突变和特征性蛋白质、RNA、代谢产物等分子标记物,其重要性日益突出。In addition to conventional radiotherapy and chemotherapy, lung cancer treatment has gradually entered the era of precision medicine: traditional imaging diagnosis and pathological diagnosis can no longer meet the needs of precision medicine. At present, in addition to traditional diagnostic methods, the precision diagnosis of lung cancer can detect specific therapeutic target markers at the molecular level, that is, detecting tumor-specific gene mutations and characteristic proteins, RNA, metabolites and other molecular markers, and its importance is becoming increasingly prominent.
基因变异检测已经成为常规治疗的必备流程之一,其中最为著名的如癌细胞表皮生长因子受体(Epidermal Growth Factor Receptor,EGFR)检测,该蛋白功能失常(通常由基因突变导致)的肺癌患者,对一种特异性靶向EGFR蛋白质的药物呈显著反应,该靶点的相关基因变异检测已进入相关肺癌治疗指南。Gene mutation detection has become one of the essential processes in routine treatment. The most famous example is the detection of the epidermal growth factor receptor (EGFR) in cancer cells. Lung cancer patients with malfunction of this protein (usually caused by gene mutation) show a significant response to a drug that specifically targets the EGFR protein. The detection of related gene mutations of this target has been included in the relevant lung cancer treatment guidelines.
并且,药物靶点相关变异基因的检测是肺癌患者,尤其是晚期患者,的一个几乎是必检项目。而有限的组织样品以及对日益增多的治疗靶向标志物的评估的需要大大提高了当前的诊断需求,组织学诊断再现性的研究已经显示了病理学家内和病理学家间的判定差异性:病理判定错误的结果、分化不良的肿瘤及矛盾的免疫组织化学结果等等,对当前肺癌的精准医疗准确性提出了挑战。因此,需要一个可靠的用于确定肺癌病理亚型的手段。In addition, the detection of drug target-related variant genes is an almost mandatory test item for lung cancer patients, especially those in the advanced stage. However, limited tissue samples and the need to evaluate an increasing number of therapeutic target markers have greatly increased the current diagnostic needs. Studies on the reproducibility of histological diagnosis have shown differences in judgments within and between pathologists: incorrect results of pathological judgments, poorly differentiated tumors, and contradictory immunohistochemical results, etc., have challenged the accuracy of current precision medicine for lung cancer. Therefore, a reliable means of determining the pathological subtypes of lung cancer is needed.
发明内容Summary of the invention
基于此,有必要针对上述问题,提供一种用于非小细胞肺癌分型的生物标志物,通过变异基因在非小细胞肺癌的腺癌vs鳞癌不同表达图谱,得到可对非小细胞肺癌中腺癌和鳞癌进行分型的生物标志物,提供基于分子水平、用于两种病理亚型的一个判别方法。Based on this, it is necessary to provide a biomarker for the classification of non-small cell lung cancer to address the above problems. By using the different expression profiles of variant genes in adenocarcinoma vs squamous cell carcinoma of non-small cell lung cancer, a biomarker that can be used to classify adenocarcinoma and squamous cell carcinoma in non-small cell lung cancer can be obtained, providing a molecular-based discrimination method for the two pathological subtypes.
一种用于非小细胞肺癌分型的生物标志物,包括:NFE2L2,TP53,CDKN2A,PTEN,MUC16,PIK3CA,RYR2,ATP10A,SLCO1B1,RASA1,ZFHX4,KMT2D,KRAS,EGFR,ANK1,BRAF,PCLO,PTPRD,ASTN1,ADGRG4,NOTCH4,FAT3,PCDH15,ROBO2,KEAP1,TENM2,TSHZ3,SETBP1,CACNA1E,XIRP2,ASXL3,ZNF804A,NALCN,FBN2,SPTA1,MUC17,RBM10,SETD2,MXRA5,ST6GAL2,RP1L1,ASPM,FLG,HECW1,COL12A1,COL14A1,AFF2,SMARCA4,SDK1,EPHB6,UBA6,SF1,MGAM,PCDH11X,COL5A1,ALK,ATM,VCAN,ZNF536,EPHA6,PDZRN3,PTPRC,ITGA4,DRD5,MYH11,DACH1,CTNNB1,TNR,NPAP1,TLR4,F8,ABCB5,RYR1,OR4C15,MYOM2,DMD,FOLH1,FRMPD4,ADAMTS17,SHANK1,FAM171A1,CCKBR,TRPS1,HMCN1,NTRK3,ATRX,AHNAK,SALL1,PRUNE2,CSMD1,PDGFRA,ADAMTS12,GRM1,SYNE2,OR8H2,TEP1,CCDC178,STK11,NID1,DCSTAMP,STAG2,MET,BCL11B,ZNF226,NTRK2,NEDD4,BTK,TMTC3,RBM15,KLK2,ITK,CMKLR1基因中的至少五种。A biomarker for non-small cell lung cancer classification, including: NFE2L2, TP53, CDKN2A, PTEN, MUC16, PIK3CA, RYR2, ATP10A, SLCO1B1, RASA1, ZFHX4, KMT2D, KRAS, EGFR, ANK1, BRAF, PCLO, PTPRD, ASTN1, ADGRG4, NOTCH4, FAT3, PCDH15, ROBO2, KEAP1, TENM 2, TSHZ3, SETBP1, CACNA1E, XIRP2, ASXL3, ZNF804A, NALCN, FBN2, SPTA1, MUC17, RBM10, SETD2, MXRA5, ST6GAL2, RP1L1, ASPM, FLG, HECW1, COL12A1, COL14A1, AFF2, SMARCA4, SDK1, EPHB6, UBA6, SF1, MGAM,PCDH11X,CO L5A1, ALK, ATM, VCAN, ZNF536, EPHA6, PDZRN3, PTPRC, ITGA4, DRD5, MYH11, DACH1, CTNNB1, TNR, NPAP1, TLR4, F8, ABCB5, RYR1, OR4C15, MYOM2, DMD, FOLH1, FRMPD4, ADAMTS17, SHANK1, FAM171A1, CCKBR , TRPS1, HMCN1, At least five of the following genes: NTRK3, ATRX, AHNAK, SALL1, PRUNE2, CSMD1, PDGFRA, ADAMTS12, GRM1, SYNE2, OR8H2, TEP1, CCDC178, STK11, NID1, DCSTAMP, STAG2, MET, BCL11B, ZNF226, NTRK2, NEDD4, BTK, TMTC3, RBM15, KLK2, ITK, CMKLR1.
本发明人通过对非小细胞肺癌的基因图谱进行了充分的研究,分别获得了在鳞癌中发生突变频率高于腺癌的相关基因(NFE2L2,TP53,CDKN2A,PTEN,MUC16,PIK3CA,RYR2,ATP10A,SLCO1B1,RASA1,ZFHX4,KMT2D),在腺癌中发生突变频率高于鳞癌的相关基因(KRAS,EGFR,ANK1,BRAF,PCLO,PTPRD,ASTN1,ADGRG4,NOTCH4,FAT3,PCDH15,ROBO2,KEAP1,TENM2,TSHZ3,SETBP1,CACNA1E,XIRP2,ASXL3,ZNF804A,NALCN,FBN2,SPTA1,MUC17,RBM10,SETD2,MXRA5,ST6GAL2,RP1L1,ASPM,FLG,HECW1,COL12A1,COL14A1,AFF2,SMARCA4,SDK1,EPHB6,UBA6,SF1,MGAM,PCDH11X,COL5A1,ALK,ATM,VCAN,ZNF536,EPHA6,PDZRN3,PTPRC,ITGA4,DRD5,MYH11,DACH1,CTNNB1,TNR,NPAP1,TLR4,F8,ABCB5,RYR1,OR4C15,MYOM2,DMD,FOLH1,FRMPD4,ADAMTS17,SHANK1,FAM171A1,CCKBR,TRPS1,HMCN1,NTRK3,ATRX,AHNAK,SALL1,PRUNE2,CSMD1,PDGFRA,ADAMTS12,GRM1,SYNE2,OR8H2,TEP1,CCDC178),以及仅在腺癌中发现的突变基因(STK11,NID1,DCSTAMP,STAG2,MET,BCL11B,ZNF226,NTRK2,NEDD4,BTK,TMTC3,RBM15,KLK2,ITK,CMKLR1),并对上述基因进行区分判断模型的建立,通过采用以上生物标志物,可有效判别非小细胞肺癌中的鳞癌和腺癌两个病理亚型。The present inventors have conducted a thorough study of the gene map of non-small cell lung cancer and obtained the related genes (NFE2L2, TP53, CDKN2A, PTEN, MUC16, PIK3CA, RYR2, ATP10A, SLCO1B1, RASA1, ZFHX4, KMT2D) that have a higher mutation frequency in squamous cell carcinoma than in adenocarcinoma, and the related genes (KRAS, EGFR, ANK1, BRAF, PCLO, PTPRD, ASTN1, ADGRG4, NOTCH4, FAT3, P CDH15, ROBO2, KEAP1, TENM2, TSHZ3, SETBP1, CACNA1E, XIRP2, ASXL3, ZNF804A, NALCN, FBN2, SPTA1, MUC17, RBM10, SETD2, MXRA5, ST6GAL2, RP1L1, ASPM, FLG, HECW1, COL12A1, COL14A1, AFF2, SMARCA4, SD K1, EPHB6, UBA6, SF1, MGAM, PCDH11X, COL5A1, A LK, ATM, VCAN, ZNF536, EPHA6, PDZRN3, PTPRC, ITGA4, DRD5, MYH11, DACH1, CTNNB1, TNR, NPAP1, TLR4, F8, ABCB5, RYR1, OR4C15, MYOM2, DMD, FOLH1, FRMPD4, ADAMTS17, SHANK1, FAM171A1, CCKBR, TRPS1, HMCN1, NTRK3, ATRX, AHNAK, SALL1, PRUNE2, CSMD1 , PDGFRA, ADAMTS12, GRM1, SYNE2, OR8H2, TEP1, CCDC178), and mutant genes only found in adenocarcinoma (STK11, NID1, DCSTAMP, STAG2, MET, BCL11B, ZNF226, NTRK2, NEDD4, BTK, TMTC3, RBM15, KLK2, ITK, CMKLR1), and a differentiation and judgment model for the above genes was established. By using the above biomarkers, the two pathological subtypes of squamous cell carcinoma and adenocarcinoma in non-small cell lung cancer can be effectively distinguished.
在其中一个实施例中,所述生物标志物包括:TP53,STK11,PTEN,NFE2L和KRAS基因。以In one embodiment, the biomarkers include: TP53, STK11, PTEN, NFE2L and KRAS genes.
对于分子诊断而言,在保证诊断敏感性和特异性的前提下,尽量减少诊断标志物的数量,可有效降低操作的复杂程度,保证检测结果的可重复性及可靠性,而且也可以大大降低患者的经济负担。上述生物标志物提供5种生物标志物的联合应用,降低了应用成本,且用于分型判断模型RCO曲线的AUC值可达0.700。For molecular diagnosis, minimizing the number of diagnostic markers while ensuring diagnostic sensitivity and specificity can effectively reduce the complexity of the operation, ensure the repeatability and reliability of the test results, and greatly reduce the economic burden on patients. The above biomarkers provide a combined application of 5 biomarkers, which reduces the application cost, and the AUC value of the RCO curve used for the typing judgment model can reach 0.700.
在其中一个实施例中,所述生物标志物包括:TP53,STK11,PTEN,NFE2L,KRAS,EGFR,CDKN2A,BRAF,TSHZ3和PIK3CA基因。采用上述10种生物标志物进行分型,所得判断模型RCO曲线的AUC值可达0.734。In one embodiment, the biomarkers include: TP53, STK11, PTEN, NFE2L, KRAS, EGFR, CDKN2A, BRAF, TSHZ3 and PIK3CA genes. The AUC value of the RCO curve of the judgment model obtained by typing with the above 10 biomarkers can reach 0.734.
在其中一个实施例中,所述生物标志物包括:TP53,STK11,PTEN,NFE2L,KRAS,EGFR,CDKN2A,BRAF,TSHZ3,PIK3CA,SETD2,MUC16,RYR2,PTPR,PCLO,RP1L1,ASTN1,SPTA1,ASXL3和XIRP2。采用上述20种生物标志物进行分型,所得判断模型RCO曲线的AUC值可达0.747。In one embodiment, the biomarkers include: TP53, STK11, PTEN, NFE2L, KRAS, EGFR, CDKN2A, BRAF, TSHZ3, PIK3CA, SETD2, MUC16, RYR2, PTPR, PCLO, RP1L1, ASTN1, SPTA1, ASXL3 and XIRP2. The AUC value of the RCO curve of the judgment model obtained by typing with the above 20 biomarkers can reach 0.747.
在其中一个实施例中,所述生物标志物包括:NFE2L2,TP53,CDKN2A,PTEN,MUC16,PIK3CA,RYR2,ATP10A,SLCO1B1,RASA1,ZFHX4,KMT2D,KRAS,EGFR,ANK1,BRAF,PCLO,PTPRD,ASTN1,ADGRG4,NOTCH4,FAT3,PCDH15,ROBO2,KEAP1,TENM2,TSHZ3,SETBP1,CACNA1E,XIRP2,ASXL3,ZNF804A,NALCN,FBN2,SPTA1,MUC17,RBM10,SETD2,MXRA5,ST6GAL2,RP1L1,ASPM,FLG,HECW1,COL12A1,COL14A1,AFF2,SMARCA4,SDK1,EPHB6,UBA6,SF1,MGAM,PCDH11X,COL5A1,ALK,ATM,VCAN,ZNF536,EPHA6,PDZRN3,PTPRC,ITGA4,DRD5,MYH11,DACH1,CTNNB1,TNR,NPAP1,TLR4,F8,ABCB5,RYR1,OR4C15,MYOM2,DMD,FOLH1,FRMPD4,ADAMTS17,SHANK1,FAM171A1,CCKBR,TRPS1,HMCN1,NTRK3,ATRX,AHNAK,SALL1,PRUNE2,CSMD1,PDGFRA,ADAMTS12,GRM1,SYNE2,OR8H2,TEP1,CCDC178,STK11,NID1,DCSTAMP,STAG2,MET,BCL11B,ZNF226,NTRK2,NEDD4,BTK,TMTC3,RBM15,KLK2,ITK和CMKLR1。In one embodiment, the biomarkers include: NFE2L2, TP53, CDKN2A, PTEN, MUC16, PIK3CA, RYR2, ATP10A, SLCO1B1, RASA1, ZFHX4, KMT2D, KRAS, EGFR, ANK1, BRAF, PCLO, PTPRD, ASTN1, ADGRG4, NOTCH4, FAT3, PCDH15, ROBO2, KEAP1, TENM 2, TSHZ3, SETBP1, CACNA1E, XIRP2, ASXL3, ZNF804A, NALCN, FBN2, SPTA1, MUC17, RBM10, SETD2, MXRA5, ST6GAL2, RP1L1, ASPM, FLG, HECW1, COL12A1, COL14A1, AFF2, SMARCA4, SDK1, EPHB6, UBA6, SF1, MGAM,PCDH11X , COL5A1, ALK, ATM, VCAN, ZNF536, EPHA6, PDZRN3, PTPRC, ITGA4, DRD5, MYH11, DACH1, CTNNB1, TNR, NPAP1, TLR4, F8, ABCB5, RYR1, OR4C15, MYOM2, DMD, FOLH1, FRMPD4, ADAMTS17, SHANK1, FAM171A1, CC KBR,TRPS1,H MCN1, NTRK3, ATRX, AHNAK, SALL1, PRUNE2, CSMD1, PDGFRA, ADAMTS12, GRM1, SYNE2, OR8H2, TEP1, CCDC178, STK11, NID1, DCSTAMP, STAG2, MET, BCL11B, ZNF226, NTRK2, NEDD4, BTK, TMTC3, RBM15, KLK2, ITK and CM KLR1.
采用所有生物标志物进行分型,所得判断模型RCO曲线的AUC值可达0.786。Using all biomarkers for typing, the AUC value of the RCO curve of the judgment model can reach 0.786.
本发明还公开了上述的生物标志物在非小细胞肺癌患者中腺癌与鳞癌分型诊断中的应用。The present invention also discloses the application of the above biomarker in the diagnosis of adenocarcinoma and squamous cell carcinoma in patients with non-small cell lung cancer.
可以理解的,在检测上述生物标志物(即基因变异)时,可采用本领域中能够用于检测基因变异的各种方法,例如:It is understandable that when detecting the above biomarkers (i.e., gene mutations), various methods that can be used to detect gene mutations in the art can be used, for example:
1)测序技术,包括二代测序技术与三代测序技术。二代测序技术原理为大规模平行测序(massive parallel sequencing,MPS),在其中一些实施例中,二代测序技术可以是:1.基于DNA聚合酶合成测序技术(Sequencing by synthesis technology,SBS),代表公司为Illumina(可逆终止测序,reversible terminator sequencing)、Thermo Fisher/Life Technologies(Ion Torrent),GenapSys,罗氏诊断(454焦磷酸测序)等;2.基于DNA连接酶连接测序技术(Sequencing by ligation technology,SBL),代表公司为华大基因/Complete Genomics(复合探针-锚定分子连接,cPAL)、Thermo Fisher/AppliedBiosystems(Sequencing by Oligonucleotide Ligation and Detection,SOLiD)等。三代测序技术为单分子测序技术,在其中一些实施例中,三代测序技术可以是:单分子实时荧光测序技术(SMRT,Pacific Biosciences)、纳米孔测序技术[Oxford NanoporeTechnologies(ONT),Genia Technologies and Stratos Genomics(罗氏诊断)]、纳米门测序技术(Nanogate,Quantum Biosystems)、基于DNA水解测序技术(Sequencing by de-synthesis,pyrophosphorolysis,Base4)以及本领域已知的任何其他测序方法。1) Sequencing technology, including second-generation sequencing technology and third-generation sequencing technology. The principle of second-generation sequencing technology is massive parallel sequencing (MPS). In some embodiments, the second-generation sequencing technology can be: 1. Based on DNA polymerase synthesis sequencing technology (Sequencing by synthesis technology, SBS), represented by Illumina (reversible terminator sequencing), Thermo Fisher/Life Technologies (Ion Torrent), GenapSys, Roche Diagnostics (454 pyrophosphate sequencing), etc.; 2. Based on DNA ligase ligation sequencing technology (Sequencing by ligation technology, SBL), represented by BGI/Complete Genomics (composite probe-anchor molecule ligation, cPAL), Thermo Fisher/Applied Biosystems (Sequencing by Oligonucleotide Ligation and Detection, SOLiD), etc. The third-generation sequencing technology is a single-molecule sequencing technology. In some embodiments, the third-generation sequencing technology can be: single-molecule real-time fluorescence sequencing technology (SMRT, Pacific Biosciences), nanopore sequencing technology [Oxford Nanopore Technologies (ONT), Genia Technologies and Stratos Genomics (Roche Diagnostics)], nanogate sequencing technology (Nanogate, Quantum Biosystems), DNA hydrolysis-based sequencing technology (Sequencing by de-synthesis, pyrophosphorolysis, Base4) and any other sequencing methods known in the art.
2)微阵列杂交技术:例如SNP微阵列等;2) Microarray hybridization technology: such as SNP microarray;
3)基于PCR对变异位点的相关检测技术:例如KASP分型法,连接酶检测反应(LDR)分型方法、Taqman探针方法等。3) PCR-based detection technologies for variant sites: such as KASP typing method, ligase detection reaction (LDR) typing method, Taqman probe method, etc.
在其中一个实施例中,所述生物标志物作为血液和/或组织检测的生物标志物。In one embodiment, the biomarker is detected as a biomarker in blood and/or tissue.
可以理解的,上述检测也同样适用于其他生物样本类型。但采用上述样本,具有样本易得,适用性广等优势。It is understandable that the above detection is also applicable to other types of biological samples. However, the use of the above samples has the advantages of easy sample acquisition and wide applicability.
本发明还公开了检测生物样本中上述的生物标志物的试剂在制备非小细胞肺癌分型诊断试剂或诊断设备中的应用。The present invention also discloses the use of a reagent for detecting the above-mentioned biomarkers in a biological sample in the preparation of a non-small cell lung cancer typing diagnostic reagent or diagnostic equipment.
可以理解的,上述试剂例如是试剂盒,设备例如是一体化检测设备,可根据具体应用需求调整。It is understandable that the above reagents, such as a test kit, and the equipment, such as an integrated detection equipment, can be adjusted according to specific application requirements.
本发明还公开了一种用于非小细胞肺癌分型诊断的检测试剂盒,包括用于检测权利要求上述的生物标志物的试剂。The present invention also discloses a detection kit for non-small cell lung cancer typing diagnosis, comprising a reagent for detecting the above-mentioned biomarker in the claim.
本发明还公开了一种非小细胞肺癌分型诊断的系统,包括:The present invention also discloses a system for diagnosing non-small cell lung cancer typing, comprising:
分析装置:用于获取待评估对象生物样本中上述的生物标志物的基因变异情况,输入评估模型进行分型评估;Analytical device: used to obtain the gene variation of the above-mentioned biomarkers in the biological sample of the subject to be evaluated, and input it into the evaluation model for typing evaluation;
输出装置:用于将上述评估结果输出。Output device: used to output the above evaluation results.
在其中一个实施例中,所述评估模型通过以下方法建立:所述评估模型通过以下方法建立:获取若干腺癌和鳞癌生物样本,测序得到所述生物标志物的基因突变情况,以随机森林模型建立分型模型,模型中mportance=TRUE且ntree=100,mtry=2,即得非小细胞肺癌分型诊断模型。In one embodiment, the evaluation model is established by the following method: the evaluation model is established by the following method: obtaining a number of adenocarcinoma and squamous cell carcinoma biological samples, sequencing to obtain the gene mutation of the biomarkers, and establishing a typing model using a random forest model, in which importance = TRUE and ntree = 100, mtry = 2, to obtain a non-small cell lung cancer typing diagnosis model.
与现有技术相比,本发明具有以下有益效果:Compared with the prior art, the present invention has the following beneficial effects:
本发明的用于非小细胞肺癌分型的生物标志物,可利用上述生物标志物的组合,对肺小细胞癌中的鳞癌和腺癌进行分型,最少使用5个标志物时,其分型诊断ROC曲线的AUC为0.700,进一步增加标志物至10个时,分型诊断ROC曲线的AUC为0.734,当使用所有生物标志物时,其AUC可达0.786,具有优异的诊断能力。The biomarkers for non-small cell lung cancer classification of the present invention can utilize the combination of the above-mentioned biomarkers to classify squamous cell carcinoma and adenocarcinoma in small cell lung cancer. When at least 5 markers are used, the AUC of the classification diagnosis ROC curve is 0.700. When the markers are further increased to 10, the AUC of the classification diagnosis ROC curve is 0.734. When all biomarkers are used, the AUC can reach 0.786, which has excellent diagnostic ability.
对肺小细胞癌患者,可通过上述生物标志物的基因检测报告结果,对病理结果有一个相互验证的过程,确保病理诊断结果无误,对下一步精准治疗起到了重要作用。For patients with small cell lung cancer, the genetic test report results of the above-mentioned biomarkers can be used to conduct a mutual verification process with the pathological results to ensure that the pathological diagnosis results are correct, which plays an important role in the next step of precise treatment.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1为使用所有标志物建立模型,在验证集中区分效果的ROC-AUC图。FIG1 is a ROC-AUC diagram of the model established using all markers to distinguish the effects in the validation set.
图2为使用20个标志物组合建立模型,在验证集中区分效果的ROC-AUC图。FIG2 is a ROC-AUC graph showing the effect of differentiating between the models established using a combination of 20 markers in the validation set.
图3为使用10个标志物组合建立模型,在验证集中区分效果的ROC-AUC图。FIG3 is a ROC-AUC graph showing the effect of distinguishing the models established using a combination of 10 markers in the validation set.
图4为使用5个标志物组合建立模型,在验证集中区分效果的ROC-AUC图。FIG4 is a ROC-AUC graph showing the effect of distinguishing the models established using a combination of five markers in the validation set.
具体实施方式DETAILED DESCRIPTION
为了便于理解本发明,下面将参照相关附图对本发明进行更全面的描述。附图中给出了本发明的较佳实施例。但是,本发明可以以许多不同的形式来实现,并不限于本文所描述的实施例。相反地,提供这些实施例的目的是使对本发明的公开内容的理解更加透彻全面。In order to facilitate the understanding of the present invention, the present invention will be described more fully below with reference to the relevant drawings. The preferred embodiments of the present invention are given in the drawings. However, the present invention can be implemented in many different forms and is not limited to the embodiments described herein. On the contrary, the purpose of providing these embodiments is to make the understanding of the disclosure of the present invention more thorough and comprehensive.
除非另有定义,本文所使用的所有的技术和科学术语与属于本发明的技术领域的技术人员通常理解的含义相同。本文中在本发明的说明书中所使用的术语只是为了描述具体的实施例的目的,不是旨在于限制本发明。本文所使用的术语“和/或”包括一个或多个相关的所列项目的任意的和所有的组合。Unless otherwise defined, all technical and scientific terms used herein have the same meaning as those generally understood by those skilled in the art of the present invention. The terms used herein in the specification of the present invention are only for the purpose of describing specific embodiments and are not intended to limit the present invention. The term "and/or" used herein includes any and all combinations of one or more related listed items.
以下实施例所用试剂,如非特别说明,均为市售可得;以下实施例所用方法,如非特别说明,均为常规方法实现。The reagents used in the following examples, unless otherwise specified, are all commercially available; the methods used in the following examples, unless otherwise specified, are all implemented by conventional methods.
说明:illustrate:
TCGA:全称为The Cancer Genome Atlas,包括了30+种肿瘤的数据。是美国国家癌症研究所(National Cancer Institute,NCI)和国家人类基因组研究所(NationalHumanGenome Research Institute,NHGRI)发起的癌症基因组图谱(The Cancer GenomeAtlas,TCGA)计划。是一个全面的、多维的,针对多种癌症基因组的图谱。涉及的领域不仅包括基因组测序,还包括转录组、甲基化等表观组学测序以及最终的整合分析,并将它们与临床和影像数据相关联。TCGA: The full name is The Cancer Genome Atlas, which includes data on more than 30 types of tumors. It is a Cancer Genome Atlas (TCGA) project initiated by the National Cancer Institute (NCI) and the National Human Genome Research Institute (NHGRI). It is a comprehensive, multidimensional atlas for a variety of cancer genomes. The areas involved include not only genome sequencing, but also epigenomic sequencing such as transcriptome and methylation, as well as final integrated analysis, and linking them with clinical and imaging data.
本发明中,腺癌患者,指病理检测结果得到2位及以上病理专家共同认定的非小细胞腺癌患者;鳞癌患者,指病理检测结果得到2位及以上病理专家共同认定的非小细胞鳞癌患者。In the present invention, an adenocarcinoma patient refers to a non-small cell adenocarcinoma patient whose pathological test results are jointly confirmed by two or more pathology experts; a squamous cell carcinoma patient refers to a non-small cell squamous cell carcinoma patient whose pathological test results are jointly confirmed by two or more pathology experts.
实施例1Example 1
基于公共数据库,对非小细胞肺癌病理亚型分型的变异基因标记物初筛,具体包括以下步骤:Based on the public database, the initial screening of variant gene markers for pathological subtype classification of non-small cell lung cancer includes the following steps:
1、候选突变位点筛选。1. Screening of candidate mutation sites.
从TCGA数据库(https://portal.gdc.cancer.gov/)获取非小细胞肿瘤患者的肿瘤组织全基因组测序数据:本研究共下载了561例非小细胞肺癌患者(其中腺癌286例,鳞癌275例)全基因组测序数据,用四种不同软件(mutect,varscan,muse和somaticsniper)分别计算突变位点,取四种突变软件至少两个软件在样本中同时call到突变位点作为候选突变位点。The whole genome sequencing data of tumor tissues of patients with non-small cell tumors were obtained from the TCGA database (https://portal.gdc.cancer.gov/): In this study, the whole genome sequencing data of 561 patients with non-small cell lung cancer (including 286 cases of adenocarcinoma and 275 cases of squamous cell carcinoma) were downloaded, and the mutation sites were calculated using four different software (mutect, varscan, muse and somaticsniper). The mutation sites that were simultaneously called by at least two of the four mutation software in the sample were taken as candidate mutation sites.
2、潜在标志物筛选。2. Screening of potential markers.
根据腺癌组别数据集和鳞癌组别数据集进行差异分析:采用fisher.test分析,选取p≤0.05的变异基因作为潜在标志物,见下表。Differential analysis was performed based on the adenocarcinoma group data set and the squamous cell carcinoma group data set: fisher.test analysis was used, and variant genes with p≤0.05 were selected as potential markers, see the table below.
表1.潜在标志物Table 1. Potential markers
实施例2Example 2
本实施例对上述得到的潜在标志物,对临床样本进行分析验证,包括以下步骤:This example analyzes and verifies the potential markers obtained above on clinical samples, including the following steps:
1、组织样本获取:1. Tissue sample acquisition:
从暨南大学收集374例病理经相关专家鉴定为非小细胞肺癌(191例腺癌,183例鳞癌)的相关FFPE切片样本。A total of 374 FFPE section samples were collected from Jinan University, which were identified as non-small cell lung cancer (191 adenocarcinomas and 183 squamous cell carcinomas) by relevant experts.
2、样本测序分析:2. Sample sequencing analysis:
FFPE组织样品由第三方(明码生物技术公司)进行全基因组测序分析。FFPE tissue samples were analyzed by whole genome sequencing by a third party (NextCode Biotech).
3、模型建立。3. Model establishment.
3.1模型初步建立。3.1 Preliminary establishment of the model.
使用上述实施例1得到的所有潜在标志物,对独立的验证集,即上述191例腺癌和183例鳞癌的非小细胞肺癌病人组织样本进行检测判断,利用随机森林模型进行建模分析(R包randomForest),按照6:4的切分,进行20次重复,通过对模型建立条件的反复摸索,尝试,设置其中importance=TRUE且ntree=100,mtry=2,得到模型的ROC曲线AUC高达0.786,见图1。All potential markers obtained in Example 1 above were used to detect and judge an independent validation set, i.e., the tissue samples of the 191 adenocarcinoma and 183 squamous cell carcinoma non-small cell lung cancer patients mentioned above. The random forest model was used for modeling analysis (R package randomForest), and 20 repetitions were performed according to the 6:4 split. Through repeated exploration and trial of the model establishment conditions, importance = TRUE, ntree = 100, and mtry = 2 were set, and the ROC curve AUC of the model was obtained to be as high as 0.786, as shown in Figure 1.
3.2对标志物进行筛选。3.2 Screening of markers.
通过利用随机森林模型进行建模分析,在191例腺癌和183例鳞癌的非小细胞肺癌病人组织样本进行检测判断,利用随机森林模型进行建模分析,按照6:4的切分,进行20次重复,再以pearson correlation对所有marker进行两两相关性分析,并通过穷尽随机组合方法,获得20个MARKER最优组合,得到模型的ROC曲线AUC可高达0.747,见图2。By using the random forest model for modeling and analysis, 191 adenocarcinoma and 183 squamous cell carcinoma tissue samples of non-small cell lung cancer patients were tested and judged. The random forest model was used for modeling and analysis, and 20 repetitions were performed according to the 6:4 split. Pearson correlation was used to perform pairwise correlation analysis on all markers, and the optimal combination of 20 MARKERs was obtained through the exhaustive random combination method. The ROC curve AUC of the model can be as high as 0.747, as shown in Figure 2.
3.2对标志物进行优选。3.2 Optimize markers.
通过利用随机森林模型进行建模分析,在191例腺癌和183例鳞癌的非小细胞肺癌病人组织样本进行检测判断,利用随机森林模型进行建模分析,按照6:4的切分,进行20次重复,同时通过pearson correlation对所有marker进行两两相关性分析,并通过穷尽随机组合方法,获得10个MARKER最优组合,得到模型的ROC曲线AUC可高达0.734,见图3。By using the random forest model for modeling and analysis, 191 adenocarcinoma and 183 squamous cell carcinoma tissue samples of non-small cell lung cancer patients were tested and judged. The random forest model was used for modeling and analysis, and 20 repetitions were performed according to the 6:4 split. At the same time, all markers were analyzed pairwise by Pearson correlation, and the optimal combination of 10 MARKERs was obtained by exhaustive random combination method. The ROC curve AUC of the model can be as high as 0.734, as shown in Figure 3.
3.3对标志物进行进一步优选。3.3 Further optimization of markers.
通过利用随机森林模型进行建模分析,在191例腺癌和183例鳞癌的非小细胞肺癌病人组织样本进行检测判断,利用随机森林模型进行建模分析,按照6:4的切分,进行20次重复,通过pearson correlation对所有marker进行两两相关性分析,并通过穷尽随机组合方法,获得5个MARKER最优组合,得到模型的ROC曲线AUC可高达0.700,见图4。By using the random forest model for modeling and analysis, 191 adenocarcinoma and 183 squamous cell carcinoma tissue samples of non-small cell lung cancer patients were tested and judged. The random forest model was used for modeling and analysis, and 20 repetitions were performed according to the 6:4 split. Pearson correlation was used to analyze the correlation between all markers, and the optimal combination of 5 MARKERs was obtained through the exhaustive random combination method. The ROC curve AUC of the model can be as high as 0.700, as shown in Figure 4.
实施例3Example 3
选取来源于暨南大学,不同于实施例2样本集的13例临床判定为非小细胞肺癌的样本,采用上述实施例2中建立得到20个MARKER组合模型进行分析,并将其分析结果与临床专家判断结果进行比较,结果如下表所示。Thirteen samples from Jinan University that were clinically diagnosed as non-small cell lung cancer, which were different from the sample set in Example 2, were selected for analysis using the 20 MARKER combination models established in Example 2. The analysis results were compared with the clinical expert judgment results. The results are shown in the following table.
表2.临床验证结果Table 2. Clinical validation results
从上述结果可以看出,采用本发明的生物标志物,以上述模型对肺小细胞癌中鳞癌或腺癌的分型判断较为准确,与专家判断一致性达76.9%以上。From the above results, it can be seen that the biomarkers of the present invention and the above model are more accurate in the classification of squamous cell carcinoma or adenocarcinoma in small cell lung carcinoma, with a consistency of more than 76.9% with the expert judgment.
以上所述实施例的各技术特征可以进行任意的组合,为使描述简洁,未对上述实施例中的各个技术特征所有可能的组合都进行描述,然而,只要这些技术特征的组合不存在矛盾,都应当认为是本说明书记载的范围。The technical features of the above-described embodiments may be arbitrarily combined. To make the description concise, not all possible combinations of the technical features in the above-described embodiments are described. However, as long as there is no contradiction in the combination of these technical features, they should be considered to be within the scope of this specification.
以上所述实施例仅表达了本发明的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对发明专利范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本发明构思的前提下,还可以做出若干变形和改进,这些都属于本发明的保护范围。因此,本发明专利的保护范围应以所附权利要求为准。The above-mentioned embodiments only express several implementation methods of the present invention, and the descriptions thereof are relatively specific and detailed, but they cannot be understood as limiting the scope of the invention patent. It should be pointed out that, for ordinary technicians in this field, several variations and improvements can be made without departing from the concept of the present invention, and these all belong to the protection scope of the present invention. Therefore, the protection scope of the patent of the present invention shall be subject to the attached claims.
Claims (2)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110505178.1A CN113186287B (en) | 2021-05-10 | 2021-05-10 | Biomarker for non-small cell lung cancer typing and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110505178.1A CN113186287B (en) | 2021-05-10 | 2021-05-10 | Biomarker for non-small cell lung cancer typing and application thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113186287A CN113186287A (en) | 2021-07-30 |
CN113186287B true CN113186287B (en) | 2023-03-24 |
Family
ID=76988688
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110505178.1A Active CN113186287B (en) | 2021-05-10 | 2021-05-10 | Biomarker for non-small cell lung cancer typing and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113186287B (en) |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU2013337277B2 (en) | 2012-11-05 | 2018-03-08 | Foundation Medicine, Inc. | Novel NTRK1 fusion molecules and uses thereof |
CA2898326C (en) | 2013-01-18 | 2022-05-17 | Foundation Medicine, Inc. | Methods of treating cholangiocarcinoma |
US20240093304A1 (en) * | 2020-12-30 | 2024-03-21 | Foundation Medicine, Inc. | Alk fusion genes and uses thereof |
CN114295706B (en) * | 2021-09-28 | 2024-11-01 | 岛津企业管理(中国)有限公司 | Statistical non-targeted non-small cell lung cancer pathological typing method |
CN114134228B (en) * | 2021-10-09 | 2024-05-03 | 复旦大学附属中山医院 | Kit, system and storage medium for evaluating PI3K/Akt/mTOR pathway related gene mutation and application thereof |
CN113881777B (en) * | 2021-11-12 | 2023-12-15 | 首都医科大学附属北京胸科医院 | Kit applied to environmental pollution and cancerogenic risk assessment |
CN114214409B (en) * | 2021-12-23 | 2024-03-12 | 深圳康华君泰生物科技有限公司 | Biomarker for esophageal carcinoma typing and application thereof |
CN115595370A (en) * | 2022-11-11 | 2023-01-13 | 常州国药医学检验实验室有限公司(Cn) | A combination of gene transcript markers and a typing diagnostic device for typing and diagnosing non-small cell lung cancer |
GB2630774A (en) * | 2023-06-07 | 2024-12-11 | Curenetics Ltd | Lung cancer biomarkers |
CN117535402B (en) * | 2023-12-28 | 2024-05-31 | 湖南家辉生物技术有限公司 | Application of FRMPD gene mutant as detection target, detection reagent with FRMPD gene mutant and detection kit |
CN118064596B (en) * | 2024-04-22 | 2024-06-28 | 北京市肿瘤防治研究所 | Prognosis prediction marker for small cell lung cancer immunotherapy and application thereof |
CN119193823A (en) * | 2024-04-29 | 2024-12-27 | 西安交通大学医学院第二附属医院 | Use of a reagent for detecting methylation levels of molecular markers in preparing a product for diagnosing lung adenocarcinoma |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2015137406A1 (en) * | 2014-03-12 | 2015-09-17 | 学校法人順天堂 | Method for differentiating between lung squamous cell carcinoma and lung adenocarcinoma |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2806274A1 (en) * | 2013-05-24 | 2014-11-26 | AIT Austrian Institute of Technology GmbH | Lung cancer diagnostic method and means |
CA2950623A1 (en) * | 2014-05-30 | 2015-12-03 | Myla LAI-GOLDMAN | Methods for typing of lung cancer |
KR102086935B1 (en) * | 2015-11-05 | 2020-03-09 | 비지아이 션전 | Biomarkers and Their Uses for Detecting Lung Adenocarcinoma |
KR101853118B1 (en) * | 2016-09-02 | 2018-04-30 | 주식회사 바이오인프라생명과학 | Complex biomarker group for detecting lung cancer in a subject, lung cancer diagnostic kit using the same, method for detecting lung cancer using information on complex biomarker and computing system executing the method |
CN108548929A (en) * | 2018-04-11 | 2018-09-18 | 谢丽 | Detect application of the articles for use of biomarker expression level in indicating cancerous state |
CN112375826B (en) * | 2020-12-03 | 2021-08-27 | 远见生物科技(上海)有限公司 | Circular RNA composition marker for identifying non-small cell lung cancer subtype and application thereof |
-
2021
- 2021-05-10 CN CN202110505178.1A patent/CN113186287B/en active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2015137406A1 (en) * | 2014-03-12 | 2015-09-17 | 学校法人順天堂 | Method for differentiating between lung squamous cell carcinoma and lung adenocarcinoma |
Also Published As
Publication number | Publication date |
---|---|
CN113186287A (en) | 2021-07-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113186287B (en) | Biomarker for non-small cell lung cancer typing and application thereof | |
JP7169002B2 (en) | Use of size and number abnormalities in plasma DNA for cancer detection | |
WO2018137678A1 (en) | Second generation sequencing-based method for simultaneously detecting microsatellite locus stability and genomic changes | |
US20150038376A1 (en) | Thyroid cancer biomarker | |
US12234515B2 (en) | Enhancement of cancer screening using cell-free viral nucleic acids | |
TW202039860A (en) | Cell-free dna end characteristics | |
US20200219587A1 (en) | Systems and methods for using fragment lengths as a predictor of cancer | |
WO2020224159A1 (en) | Next generation sequencing-based panel for detecting glioma, detection kit, detection method, and application thereof | |
CN114214409B (en) | Biomarker for esophageal carcinoma typing and application thereof | |
CN116162702A (en) | A CRISPR-based digital quantitative detection method for gene mutation frequency and its application | |
US20240327924A1 (en) | Method of mutation detection in a liquid biopsy | |
TW201250245A (en) | DNA methylation biomarkers for prognosis prediction of lung adenocarcinoma and a use thereof in clinical molecular prognosis of lung adenocarcinoma | |
CN115472294A (en) | A model for predicting the transformation rate of patients with small cell transformed lung adenocarcinoma and its construction method | |
Beaver et al. | Circulating cell-free DNA for molecular diagnostics and therapeutic monitoring | |
CN118675611A (en) | A comprehensive analytical approach to identify miRNA markers of peripheral nerve invasion in patients with colorectal cancer | |
BEAVER et al. | CIRCULATING CELL-FREE DNA |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20241017 Address after: 215334 Station 33, Room 605, Science and Technology Plaza, Qianjin East Road, Kunshan Development Zone, Suzhou City, Jiangsu Province Patentee after: Kanghua Juntai (Kunshan) Biotechnology Co.,Ltd. Country or region after: China Address before: 518064 B215, building 7, Shenzhen Bay science and technology ecological park, 1819 Shahe West Road, high tech Zone community, Yuehai street, Nanshan District, Shenzhen, Guangdong Patentee before: Shenzhen Kanghua Juntai Biotechnology Co.,Ltd. Country or region before: China |
|
TR01 | Transfer of patent right |