[go: up one dir, main page]

CN115575635A - A diagnostic marker for cholangiocarcinoma and its screening method and application - Google Patents

A diagnostic marker for cholangiocarcinoma and its screening method and application Download PDF

Info

Publication number
CN115575635A
CN115575635A CN202211197099.XA CN202211197099A CN115575635A CN 115575635 A CN115575635 A CN 115575635A CN 202211197099 A CN202211197099 A CN 202211197099A CN 115575635 A CN115575635 A CN 115575635A
Authority
CN
China
Prior art keywords
bile
cholangiocarcinoma
clu
markers
diagnosis
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211197099.XA
Other languages
Chinese (zh)
Inventor
孟文勃
高龙
岳平
李书艳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
First Hospital of Lanzhou University
Original Assignee
First Hospital of Lanzhou University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by First Hospital of Lanzhou University filed Critical First Hospital of Lanzhou University
Priority to CN202211197099.XA priority Critical patent/CN115575635A/en
Publication of CN115575635A publication Critical patent/CN115575635A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/53Immunoassay; Biospecific binding assay; Materials therefor
    • G01N33/574Immunoassay; Biospecific binding assay; Materials therefor for cancer
    • G01N33/57484Immunoassay; Biospecific binding assay; Materials therefor for cancer involving compounds serving as markers for tumor, cancer, neoplasia, e.g. cellular determinants, receptors, heat shock/stress proteins, A-protein, oligosaccharides, metabolites
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N30/00Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
    • G01N30/02Column chromatography
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N30/00Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
    • G01N30/02Column chromatography
    • G01N30/04Preparation or injection of sample to be analysed
    • G01N30/06Preparation
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N30/00Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
    • G01N30/02Column chromatography
    • G01N30/62Detectors specially adapted therefor
    • G01N30/72Mass spectrometers
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N30/00Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
    • G01N30/02Column chromatography
    • G01N30/86Signal analysis
    • G01N30/8675Evaluation, i.e. decoding of the signal into analytical information
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/53Immunoassay; Biospecific binding assay; Materials therefor
    • G01N33/573Immunoassay; Biospecific binding assay; Materials therefor for enzymes or isoenzymes
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/53Immunoassay; Biospecific binding assay; Materials therefor
    • G01N33/574Immunoassay; Biospecific binding assay; Materials therefor for cancer
    • G01N33/57407Specifically defined cancers
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/68Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/72Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving blood pigments, e.g. haemoglobin, bilirubin or other porphyrins; involving occult blood
    • G01N33/728Bilirubin; including biliverdin
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/92Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving lipids, e.g. cholesterol, lipoproteins, or their receptors
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/20ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N2333/00Assays involving biological materials from specific organisms or of a specific nature
    • G01N2333/435Assays involving biological materials from specific organisms or of a specific nature from animals; from humans
    • G01N2333/46Assays involving biological materials from specific organisms or of a specific nature from animals; from humans from vertebrates
    • G01N2333/47Assays involving proteins of known structure or function as defined in the subgroups
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N2333/00Assays involving biological materials from specific organisms or of a specific nature
    • G01N2333/90Enzymes; Proenzymes
    • G01N2333/91Transferases (2.)
    • G01N2333/91045Acyltransferases (2.3)
    • G01N2333/91074Aminoacyltransferases (general) (2.3.2)
    • G01N2333/9108Aminoacyltransferases (general) (2.3.2) with definite EC number (2.3.2.-)

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Immunology (AREA)
  • Chemical & Material Sciences (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Hematology (AREA)
  • Urology & Nephrology (AREA)
  • General Health & Medical Sciences (AREA)
  • Pathology (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Biochemistry (AREA)
  • Analytical Chemistry (AREA)
  • Cell Biology (AREA)
  • Biotechnology (AREA)
  • Medicinal Chemistry (AREA)
  • Food Science & Technology (AREA)
  • Microbiology (AREA)
  • Public Health (AREA)
  • Hospice & Palliative Care (AREA)
  • Medical Informatics (AREA)
  • Oncology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Biophysics (AREA)
  • Endocrinology (AREA)
  • Library & Information Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Epidemiology (AREA)
  • Primary Health Care (AREA)
  • Investigating Or Analysing Biological Materials (AREA)

Abstract

The invention relates to the field of clinical examination and diagnosis, in particular to a bile duct cancer diagnosis marker and a screening method and application thereof, wherein the diagnosis marker comprises the following 6 marker combinations: clusterin, indirect bilirubin, low density lipoprotein cholesterol, gamma glutamyltransferase, carbohydrate antigen 19-9, triglyceride; the invention adopts proteomics technology and artificial intelligence data analysis technology to obtain a plurality of diagnostic markers suitable for bile duct cancer diagnosis, the diagnostic markers comprise markers in bile and serum, the AUC of the marker combination is obviously higher than that of the independent CLU, the screening method of the diagnostic markers has strong operability, the model construction method is simple, the sensitivity of the obtained diagnostic markers is high, the specificity is good, the diagnostic markers are suitable for bile duct cancer diagnosis, the defects of the existing imaging diagnosis mode can be well made up, the diagnosis of the invention is simple and quick, the early diagnosis and early treatment of bile duct cancer are facilitated, and the invention has good clinical use and popularization values.

Description

一种胆管癌诊断标志物及其筛选方法和应用A diagnostic marker for cholangiocarcinoma and its screening method and application

技术领域technical field

本发明涉及临床检验诊断领域,具体涉及一种胆管癌诊断标志物及其筛选方法和应用。The invention relates to the field of clinical examination and diagnosis, in particular to a diagnostic marker for cholangiocarcinoma and a screening method and application thereof.

背景技术Background technique

胆管癌(CCA)被认为是一种高度侵袭性的恶性肿瘤。根据病变的解剖部位,CCA可分为肝内胆管癌(iCCA)、肝门部胆管癌(pCCA)和远端胆管癌(dCCA)[1]。作为肝胆系统第二常见的恶性肿瘤,胆管癌预后较差,5年生存率低(7%-20%),病死率高(约占全球每年癌症相关死亡人数的2%),主要归因于早期诊断困难[1]。由于其临床特征不明显,解剖位置较深,使得诊断胆管癌很困难。同时,多项研究的术后病理结果指出,10%-25%接收手术治疗的疑似胆管癌患者最终没有癌细胞[4,5],因此迫切地要求研究人员开发出更准确的胆管癌诊断工具。Cholangiocarcinoma (CCA) is considered a highly aggressive malignancy. According to the anatomical location of the lesion, CCA can be divided into intrahepatic cholangiocarcinoma (iCCA), hilar cholangiocarcinoma (pCCA) and distal cholangiocarcinoma (dCCA) [1] . As the second most common malignant tumor of the hepatobiliary system, cholangiocarcinoma has a poor prognosis, a low 5-year survival rate (7%-20%), and a high mortality rate (about 2% of the annual global cancer-related deaths), mainly due to Early diagnosis is difficult [1] . Diagnosis of cholangiocarcinoma is difficult due to its indistinct clinical features and deep anatomical location. At the same time, the postoperative pathological results of several studies pointed out that 10%-25% of patients with suspected cholangiocarcinoma who received surgical treatment ended up with no cancer cells [4,5] , so researchers are urgently required to develop more accurate diagnostic tools for cholangiocarcinoma .

目前,胆管癌主要通过影像学方法检测,但其诊断效果并不理想,准确度不高,敏感性仅为6%-71.9%[2]。血清CA19-9常用于胆管癌临床诊断,但其敏感性和特异性较差[3]。胆汁主要由肝细胞和胆管上皮细胞分泌,当发生胆道疾病时常常会出现胆汁成分的异常变化。胆管癌组织中的癌症相关蛋白可以被分泌到胆汁中,并可能具有成为诊断标志物的潜力[6]。例如,发明专利CN108982854B公开了蛋白质MUC13在制备诊断肝内胆管癌的试剂中的应用;发明专利CN111413447A公开了鹅去氧胆酸或/和牛磺酸鹅去氧胆酸在胆管癌诊断方面的应用。At present, cholangiocarcinoma is mainly detected by imaging methods, but its diagnostic effect is not satisfactory, the accuracy is not high, and the sensitivity is only 6%-71.9% [2] . Serum CA19-9 is commonly used in the clinical diagnosis of cholangiocarcinoma, but its sensitivity and specificity are poor [3] . Bile is mainly secreted by hepatocytes and bile duct epithelial cells, and abnormal changes in bile components often occur when biliary tract diseases occur. Cancer-associated proteins in cholangiocarcinoma tissue can be secreted into bile and may have the potential to be diagnostic markers [6] . For example, invention patent CN108982854B discloses the application of protein MUC13 in the preparation of reagents for diagnosing intrahepatic cholangiocarcinoma; invention patent CN111413447A discloses the application of chenodeoxycholic acid or/and taurine chenodeoxycholic acid in the diagnosis of cholangiocarcinoma.

在发明人的前期研究中,首次发现并证实了胆汁中的癌症相关蛋白簇集蛋白CLU能够被用于诊断胆管癌,同时,发明人还研究了其与CA19-9结合在胆管癌中的诊断效果,并申请了发明专利CLU及其组合物在诊断胆管癌中的应用及胆管癌诊断试剂盒(2021103642823),前期的研究结果显示,胆汁中的癌症相关蛋白簇集蛋白CLU和CA19-9表达量在胆管癌患者胆汁中的表达量显著高于胆管结石患者胆汁中的表达量,说明CLU可以作为肿瘤标志物用于诊断胆管癌。In the inventor's previous research, it was first discovered and confirmed that the cancer-related protein clusterin CLU in bile can be used to diagnose cholangiocarcinoma. At the same time, the inventor also studied its combination with CA19-9 in the diagnosis of cholangiocarcinoma effect, and applied for the invention patent of CLU and its composition in the diagnosis of cholangiocarcinoma and the diagnostic kit for cholangiocarcinoma (2021103642823). The previous research results showed that the expression of cancer-related proteins clusterin CLU and CA19-9 in bile The expression level of CLU in the bile of patients with cholangiocarcinoma was significantly higher than that in the bile of patients with bile duct stones, indicating that CLU can be used as a tumor marker for the diagnosis of cholangiocarcinoma.

发明人在后续研究过程中发现,许多血清标志物,如碱性磷酸酶和胆红素,在胆管癌和良性胆道疾病中都可能发生变化,因此它们不能单独用于胆管癌诊断[7]。胆汁中的差异蛋白主要反映局部变化,而血清标志物主要反映胆管癌进展的全身系统性变化[2]。在前期研究的基础上,发明人为了进一步提高诊断的准确性,将部分血清指标加入,结合机器学习模型,筛选出了胆汁和血清的标志物组合,所述的标志物组合可以提高鉴别胆管癌和良性胆道狭窄的准确性,并具体公开了标志物的筛选方法。During follow-up research, the inventors found that many serum markers, such as alkaline phosphatase and bilirubin, may change in cholangiocarcinoma and benign biliary tract diseases, so they cannot be used alone for the diagnosis of cholangiocarcinoma [7] . Differential proteins in bile mainly reflect local changes, while serum markers mainly reflect systemic changes in the progression of cholangiocarcinoma [2] . On the basis of the previous research, in order to further improve the accuracy of diagnosis, the inventor added some serum indicators, combined with the machine learning model, and screened out the marker combination of bile and serum, which can improve the identification of cholangiocarcinoma and the accuracy of benign biliary strictures, and specifically disclosed a screening method for markers.

参考文献:references:

[1]Banales J M,Marin J J G,Lamarca A,et al.Cholangiocarcinoma 2020:the next horizon in mechanisms and management[J].Nat Rev GastroenterolHepatol,2020,17(9):557-88.[1] Banales J M, Marin J J G, Lamarca A, et al. Cholangiocarcinoma 2020: the next horizon in mechanisms and management [J]. Nat Rev Gastroenterol Hepatol, 2020, 17(9): 557-88.

[2]

Figure BDA0003870385930000021
T,Metzger J,Husi H,et al.Bile and urine peptide markerprofiles:access keys to molecular pathways and biological processes incholangiocarcinoma[J].J Biomed Sci,2020,27(1):13.[2]
Figure BDA0003870385930000021
T, Metzger J, Husi H, et al. Bile and urine peptide marker profiles: access keys to molecular pathways and biological processes incholangiocarcinoma[J].J Biomed Sci,2020,27(1):13.

[3]Blechacz B,Komuta M,Roskams T,et al.Clinical diagnosis and stagingof cholangiocarcinoma[J].Nat Rev Gastroenterol Hepatol,2011,8(9):512-22.[3] Blechacz B, Komuta M, Roskams T, et al. Clinical diagnosis and staging of cholangiocarcinoma [J]. Nat Rev Gastroenterol Hepatol, 2011, 8(9): 512-22.

[4]Nuzzo G,Giuliante F,Ardito F,et al.Improvement in perioperativeand long-term outcome after surgical treatment of hilar cholangiocarcinoma:results of an Italian multicenter analysis of 440patients[J].Arch Surg,2012,147(1):26-34.[4] Nuzzo G, Giuliante F, Ardito F, et al. Improvement in perioperative and long-term outcome after surgical treatment of hilar cholangiocarcinoma: results of an Italian multicenter analysis of 440 patients [J]. Arch Surg, 2012, 147(1) :26-34.

[5]Banales J M,Cardinale V,Carpino G,et al.Expert consensus document:Cholangiocarcinoma:current knowledge and future perspectives consensusstatement from the European Network for the Study of Cholangiocarcinoma(ENS-CCA)[J].Nat Rev Gastroenterol Hepatol,2016,13(5):261-80.[5] Banales J M, Cardinale V, Carpino G, et al. Expert consensus document: Cholangiocarcinoma: current knowledge and future perspectives consensus statement from the European Network for the Study of Cholangiocarcinoma (ENS-CCA) [J]. Nat Rev Gastroenterol Hepatol, 2016,13(5):261-80.

[6]Lankisch T O,Metzger J,Negm A A,et al.Bile proteomic profilesdifferentiate cholangiocarcinoma from primary sclerosing cholangitis andcholedocholithiasis[J].Hepatology,2011,53(3):875-84.[6] Lankisch T O, Metzger J, Negm A A, et al. Bile proteomic profiles differentiate cholangiocarcinoma from primary sclerosing cholangitis and choledocholithiasis [J]. Hepatology, 2011, 53(3): 875-84.

[7]Rizvi S,Gores G J.Pathogenesis,diagnosis,and management ofcholangiocarcinoma[J].Gastroenterology,2013,145(6):1215-29.[7] Rizvi S, Gores G J. Pathogenesis, diagnosis, and management of cholangiocarcinoma [J]. Gastroenterology, 2013, 145(6): 1215-29.

发明内容Contents of the invention

针对上述技术问题,本发明提供了一种胆管癌诊断标志物,所述的诊断标志物包括以下3种物质组成的组合:簇集蛋白(CLU),间接胆红素(IBIL),低密度脂蛋白胆固醇(LDLC)。In view of the above technical problems, the present invention provides a diagnostic marker for cholangiocarcinoma, which comprises a combination of the following three substances: clusterin (CLU), indirect bilirubin (IBIL), low-density lipoprotein Protein cholesterol (LDLC).

优选的,所述的诊断标志物包括以下4种物质的组合:簇集蛋白(CLU),间接胆红素(IBIL),低密度脂蛋白胆固醇(LDLC),γ-谷氨酰基转移酶(GGT)。Preferably, the diagnostic markers include a combination of the following four substances: clusterin (CLU), indirect bilirubin (IBIL), low-density lipoprotein cholesterol (LDLC), gamma-glutamyl transferase (GGT ).

优选的,所述的诊断标志物包括以下5种物质的组合:簇集蛋白(CLU),间接胆红素(IBIL),低密度脂蛋白胆固醇(LDLC),γ-谷氨酰基转移酶(GGT),糖类抗原19-9(CA19-9)。Preferably, the diagnostic markers include a combination of the following five substances: clusterin (CLU), indirect bilirubin (IBIL), low-density lipoprotein cholesterol (LDLC), gamma-glutamyl transferase (GGT ), carbohydrate antigen 19-9 (CA19-9).

优选的,所述的诊断标志物包括以下6种物质的组合:簇集蛋白(CLU),间接胆红素(IBIL),低密度脂蛋白胆固醇(LDLC),γ-谷氨酰基转移酶(GGT),糖类抗原19-9(CA19-9),甘油三酯(TG)。Preferably, the diagnostic markers include a combination of the following 6 substances: clusterin (CLU), indirect bilirubin (IBIL), low-density lipoprotein cholesterol (LDLC), gamma-glutamyl transferase (GGT ), carbohydrate antigen 19-9 (CA19-9), triglyceride (TG).

本发明还提供了一种所述的胆管癌诊断标志物的筛选方法,包含以下步骤:The present invention also provides a screening method for the diagnostic markers of cholangiocarcinoma, comprising the following steps:

(1)胆汁标志物的筛选:采用液相色谱质谱联用技术对胆汁和细胞上清液进行蛋白质组学分析,分析鉴定胆管癌和对照组之间的差异表达蛋白;对胆管癌的胆汁和细胞上清液中异常高表达的蛋白取交集,得到胆汁标志物;(1) Screening of bile markers: Proteomic analysis of bile and cell supernatant was carried out by liquid chromatography-mass spectrometry, and the differentially expressed proteins between cholangiocarcinoma and control group were analyzed and identified; The abnormally highly expressed proteins in the cell supernatant were intersected to obtain bile markers;

(2)对步骤(1)筛选得到的胆汁标志物与血清指标混合,使用随机森林方法建立分类预测模型,并将每个标志物按照交叉验证集中预测结果的重要性进行排序,利用R语言的glment软件包,基于10倍交叉验证分类法,将287例患者的所有指标数据分为10组不重叠的部分,其中2组用于测试队列,8组用于训练队列;(2) Mix the bile markers and serum indicators screened in step (1), use the random forest method to establish a classification prediction model, and rank each marker according to the importance of the prediction results in the cross-validation set, and use the R language The glment software package, based on the 10-fold cross-validation classification method, divides all index data of 287 patients into 10 non-overlapping parts, of which 2 groups are used for the test cohort and 8 groups are used for the training cohort;

(3)根据基尼指数≥0.25筛选出12个标志物,将所述的12种标志物纳入Lasso分类器训练集的初始输入变量,只有对分类有贡献的变量被赋予非零权重,当增加标志物的数量,Lasso分类器中准确度、灵敏度和特异度不再上升时,Lasso分类器的性能在测试集上达到最佳的准确度、灵敏度和特异度;(3) According to the Gini index ≥ 0.25, 12 markers are screened out, and the 12 markers are included in the initial input variables of the Lasso classifier training set. Only the variables that contribute to the classification are given non-zero weights. When adding markers The number of objects, when the accuracy, sensitivity and specificity of the Lasso classifier no longer increase, the performance of the Lasso classifier reaches the best accuracy, sensitivity and specificity on the test set;

(4)引用受试者工作特征ROC曲线来评价Lasso模型的最佳诊断性能,以ROC曲线上准确度、灵敏度和特异度最佳为截断点,得到相对最优特征数及组合方式;(4) The receiver operating characteristic ROC curve was used to evaluate the best diagnostic performance of the Lasso model, and the best accuracy, sensitivity and specificity on the ROC curve were used as the cut-off point to obtain the relatively optimal number of features and their combination;

(5)在外部验证集中利用ROC曲线验证步骤(4)得到的相对最优特征数及组合方式,得到适合于胆管癌诊断的标志物组合。(5) Using the relatively optimal number of features and combination methods obtained in the ROC curve verification step (4) in the external verification set to obtain a marker combination suitable for the diagnosis of cholangiocarcinoma.

优选的,所述的随机森林和LASSO在glment版本4.1-3中进行。Preferably, the random forest and LASSO are performed in glment version 4.1-3.

优选的,步骤(2)中共种植了2000棵决策树。Preferably, a total of 2000 decision trees are planted in step (2).

优选的,步骤(2)中所述的血清指标包括37个血液生化指标、24个常规血液指标和两个肿瘤生物标志物。Preferably, the serum indicators described in step (2) include 37 blood biochemical indicators, 24 routine blood indicators and two tumor biomarkers.

本发明还提供了所述的诊断标志物在制备用于胆管癌诊断产品中的应用。The present invention also provides the application of the diagnostic markers in the preparation of diagnostic products for cholangiocarcinoma.

优选的,所述的诊断产品包括试剂盒、试剂或芯片。Preferably, the diagnostic products include kits, reagents or chips.

本发明还提供了一种胆管癌诊断试剂盒,包含所述的诊断标志物。The present invention also provides a diagnostic kit for cholangiocarcinoma, comprising the diagnostic marker.

本发明的有益效果是:(1)本发明的优点是采用蛋白质组学技术以及人工智能数据分析技术得到适合于胆管癌诊断的诊断标志物,所述的诊断标志物包括胆汁和血清中的标志物,实验结果显示,Six-panel表现出良好的预测能力,AUC为0.926,灵敏度为86.2%,特异性为85.3%,明显高于单独的CLU(AUC为0.840)。(2)本发明诊断标志物筛选方法可操作性强,模型构建方法简单,所得诊断标志物灵敏度高,特异性好,适合于胆管癌的诊断。(3)本发明将胆汁和血清标志物结合,进一步增加了诊断的可信度,能够很好地弥补现有影像学诊断模式的不足,并且本发明诊断简单快速,有利于胆管癌的早诊早治,具有很好的临床使用和推广价值。The beneficial effect of the present invention is: (1) the advantage of the present invention is to adopt proteomic technology and artificial intelligence data analysis technology to obtain the diagnostic marker that is suitable for the diagnosis of cholangiocarcinoma, and described diagnostic marker includes the marker in bile and serum The experimental results showed that Six-panel showed good predictive ability, with AUC of 0.926, sensitivity of 86.2%, and specificity of 85.3%, which were significantly higher than that of CLU alone (AUC of 0.840). (2) The diagnostic marker screening method of the present invention has strong operability, simple model construction method, and the obtained diagnostic marker has high sensitivity and good specificity, and is suitable for the diagnosis of cholangiocarcinoma. (3) The present invention combines bile and serum markers to further increase the reliability of diagnosis, and can well make up for the shortcomings of existing imaging diagnosis modes, and the present invention is simple and fast in diagnosis, which is conducive to the early diagnosis of cholangiocarcinoma Early treatment has good clinical application and promotion value.

附图说明Description of drawings

图1胆汁和细胞上清蛋白质组学筛选胆管癌候选标志物的流程图Figure 1 Flowchart of bile and cell supernatant proteomic screening for candidate markers of cholangiocarcinoma

图2胆汁中差异表达蛋白的热图和蛋白聚类的弦树图Figure 2 Heat map of differentially expressed proteins in bile and string dendrogram of protein clustering

注:B利用LC-MS/MS分析胆汁中差异表达蛋白的热图,N1-N4代表4例良性胆管狭窄患者的胆汁,T1-T5代表5例胆管癌患者的胆汁。C通过KEGG分析获得的胆汁中差异表达。Note: B The heat map of differentially expressed proteins in bile analyzed by LC-MS/MS, N1-N4 represent the bile of 4 patients with benign biliary stricture, and T1-T5 represent the bile of 5 patients with cholangiocarcinoma. C Differential expression in bile obtained by KEGG analysis.

图3细胞上清中差异表达蛋白的热图和蛋白聚类的弦树图注:D细胞上清中差异表达蛋白的热图;E通过KEGG分析获得的上清液中差异表达蛋白质的弦树图。Fig. 3 Heat map of differentially expressed proteins in cell supernatant and chord tree of protein clustering Note: D Heat map of differentially expressed proteins in cell supernatant; E String tree of differentially expressed proteins in supernatant obtained by KEGG analysis picture.

图4使用LC-MS/MS分析得到的细胞上清液上调蛋白的维恩图Figure 4 Venn diagram of up-regulated proteins in cell supernatants analyzed by LC-MS/MS

注:F使用LC-MS/MS分析得到的四种胆管癌细胞上清液中54个上调蛋白的维恩图。G胆汁和细胞上清液中的五种上调蛋白,它们的基因名称列在右侧。H CLU蛋白也在另一个外部胆汁组的蛋白质组学数据中上调。NOTE: F Venn diagram of 54 upregulated proteins in supernatants of four cholangiocarcinoma cells analyzed using LC-MS/MS. G five upregulated proteins in bile and cell supernatant, and their gene names are listed on the right. H CLU proteins were also upregulated in proteomic data from another external bile group.

图5胆汁中CLU的免疫印迹和免疫组化图像Figure 5 Western blot and immunohistochemical images of CLU in bile

注:A 8例胆管癌患者和8例良性胆管狭窄患者胆汁中CLU的免疫印迹分析。B代表性免疫组化图像,胆管癌组织和小叶间胆管组织中CLU表达水平(Normal),红色箭头指向小叶间胆管。Note: A Western blot analysis of CLU in the bile of 8 patients with cholangiocarcinoma and 8 patients with benign biliary strictures. B Representative immunohistochemical images, the expression level of CLU in cholangiocarcinoma tissue and interlobular bile duct tissue (Normal), the red arrow points to the interlobular bile duct.

图6CLU在胆管癌中的总生存(OS)和无复发生存曲线Figure 6 Overall survival (OS) and recurrence-free survival curves of CLU in cholangiocarcinoma

图7四种胆管癌细胞系和HIBEpiC细胞系的细胞和细胞上清液中CLU的免疫印迹分析和mRNA水平Figure 7 Western blot analysis and mRNA levels of CLU in cells and cell supernatants of four cholangiocarcinoma cell lines and HIBEpiC cell line

注:E和F来自四种胆管癌细胞系和HIBEpiC细胞系的细胞和细胞上清液中CLU的免疫印迹分析。G四种胆管癌细胞系和HIBEpiC细胞系中CLU的mRNA水平。NOTE: E and F Immunoblot analysis of CLU in cells and cell supernatants from four cholangiocarcinoma cell lines and the HIBEpiC cell line. G mRNA levels of CLU in four cholangiocarcinoma cell lines and HIBEpiC cell line.

图8五个原代胆管癌细胞和HIBEpiC细胞的细胞和细胞上清液中CLU的免疫印迹分析和mRNA水平Figure 8 Western blot analysis and mRNA levels of CLU in cells and cell supernatants of five primary cholangiocarcinoma cells and HIBEpiC cells

注:H和I来自五个原代胆管癌细胞和HIBEpiC细胞的细胞和细胞上清液中CLU的免疫印迹分析。J 5个原代胆管癌细胞和HIBEpiC细胞中CLU的mRNA水平。*P<0.05,**P<0.01,***P<0.001。NOTE: H and I Immunoblot analysis of CLU in cells and cell supernatants from five primary cholangiocarcinoma cells and HIBEpiC cells. J CLU mRNA levels in 5 primary cholangiocarcinoma cells and HIBEpiC cells. *P<0.05, **P<0.01, ***P<0.001.

图9胆汁和血清中CLU或CA19-9的ELISA测定。Figure 9 ELISA determination of CLU or CA19-9 in bile and serum.

注:A用于CLU初步研究的患者队列。B ELISA分析来自40名胆管癌患者、40名良性胆管狭窄患者和40名健康志愿者的血清中的CLU。C ELISA分析40例胆管癌患者和40例良性胆管狭窄患者胆汁中的CLU。D用于CA19-9初步研究的患者队列。E和F40例胆管癌患者和40例良性胆管狭窄患者血清和胆汁中CA19-9的ELISA分析。G交叉验证集中的患者队列。H和I来自123名胆管癌患者和164名良性胆管狭窄患者的胆汁CLU和血清CA19-9的ELISA分析。Note: A Patient cohort used for the CLU pilot study. B ELISA analysis of CLU in serum from 40 patients with cholangiocarcinoma, 40 patients with benign biliary stricture and 40 healthy volunteers. C ELISA analysis of CLU in bile of 40 patients with cholangiocarcinoma and 40 patients with benign biliary stricture. D Patient cohort used for the CA19-9 pilot study. E and F ELISA analysis of CA19-9 in serum and bile of 40 patients with cholangiocarcinoma and 40 patients with benign biliary stricture. G Patient cohort in the cross-validation set. H and I ELISA analysis of bile CLU and serum CA19-9 from 123 patients with cholangiocarcinoma and 164 patients with benign biliary strictures.

图10胆汁和血清中CLU或CA19-9的ROC曲线注:J CLU、CA19-9和CLU&CA19-9的ROC曲线,蓝色曲线代表CLU,绿色曲线代表CA19-9,黄色曲线代表CLU&CA19-9。K CLU&CA19-9的tSNE分析,橙色代表良性胆管狭窄,浅蓝色代表胆管癌。L CLU、CA19-9和CLU&CA19-9的DCA分析,绿色代表CLU&CA19-9,蓝色代表CLU,红色代表CA19-9。*P<0.05,**P<0.01,***P<0.001。Figure 10 ROC curves of CLU or CA19-9 in bile and serum Note: ROC curves of J CLU, CA19-9 and CLU&CA19-9, the blue curve represents CLU, the green curve represents CA19-9, and the yellow curve represents CLU&CA19-9. tSNE analysis of K CLU&CA19-9, orange represents benign bile duct stricture, and light blue represents cholangiocarcinoma. DCA analysis of L CLU, CA19-9 and CLU&CA19-9, green represents CLU&CA19-9, blue represents CLU, red represents CA19-9. *P<0.05, **P<0.01, ***P<0.001.

图11随机森林模型筛选出排名前30的标志物Figure 11 The random forest model screened out the top 30 markers

注:按照准确率(左)和基尼指数(右)排序,越靠近右上方的特征越重要。Note: According to the accuracy rate (left) and Gini index (right), the features closer to the upper right are more important.

图12LASSO模型筛选的多标志物联合诊断模型的准确性Figure 12 The accuracy of the multi-marker joint diagnostic model screened by the LASSO model

注:B LASSO模型筛选的的不同生物标志物组合的多标志物联合诊断模型的AUC和准确度(ACC)值;C多标志物联合诊断模型的敏感性和特异性。Note: B The AUC and accuracy (ACC) values of the multi-marker combined diagnostic model of different biomarker combinations screened by the LASSO model; C The sensitivity and specificity of the multi-marker combined diagnostic model.

图13Six-panel联合诊断模型的准确性分析Figure 13 Accuracy analysis of Six-panel joint diagnosis model

注:D Six-panel及其成员的ROC曲线;E Six-panel的六个标志物之间的相关矩阵,包括CLU、IBIL、LDL-C、CA19-9、GGT和TG,数字代表两个特征之间的相关系数(r);F Six-panel的tSNE分析;G Six-panel、CLU和CA19-9的DCA分析,AUC是曲线下的面积。r≥0.8表示高相关,0.5≤r<0.8表示强相关,0.3≤r<0.5表示弱相关,r<0.3表示无相关。Note: ROC curve of D Six-panel and its members; E Correlation matrix between six markers of Six-panel, including CLU, IBIL, LDL-C, CA19-9, GGT and TG, numbers represent two features The correlation coefficient (r) between; F Six-panel tSNE analysis; G Six-panel, CLU and CA19-9 DCA analysis, AUC is the area under the curve. r≥0.8 means high correlation, 0.5≤r<0.8 means strong correlation, 0.3≤r<0.5 means weak correlation, and r<0.3 means no correlation.

具体实施方式detailed description

下面结合具体实施实例,进一步阐释本发明,本发明的实施例仅用于解释本发明,并不意味着限制本发明的保护范围。The present invention will be further explained below in conjunction with specific implementation examples. The embodiments of the present invention are only used to explain the present invention, and are not meant to limit the protection scope of the present invention.

以下实施例所述的糖类抗原19-9(CA19-9)指一种与胰腺癌、胆囊癌、结肠癌和胃癌等相关的肿瘤标志物,又称胃肠道相关抗原。糖类抗原19-9对胰腺癌有较高的灵敏度和较好的特异性,其阳性率在85%-95%之间。Carbohydrate antigen 19-9 (CA19-9) described in the following examples refers to a tumor marker related to pancreatic cancer, gallbladder cancer, colon cancer and gastric cancer, also known as gastrointestinal tract-related antigen. Carbohydrate antigen 19-9 has higher sensitivity and better specificity for pancreatic cancer, and its positive rate is between 85% and 95%.

以下实施例所述的AUC(Area Under Curve)被定义为ROC曲线下与坐标轴围成的面积,显然这个面积的数值不会大于1。又由于ROC曲线一般都处于y=x这条直线的上方,所以AUC的取值范围在0.5和1之间。AUC越接近1.0,检测方法真实性越高;等于0.5时,则真实性最低,无应用价值。The AUC (Area Under Curve) described in the following embodiments is defined as the area enclosed by the ROC curve and the coordinate axis, and obviously the value of this area will not be greater than 1. And because the ROC curve is generally above the straight line y=x, the value range of AUC is between 0.5 and 1. The closer the AUC is to 1.0, the higher the authenticity of the detection method; when it is equal to 0.5, the authenticity is the lowest and has no application value.

本文所述的“良性胆管狭窄”指由于胆管损伤和复发性胆管炎所致或是先天性而导致的胆管腔瘢痕性缩窄。良性胆管狭窄可由医源性损伤、腹部外伤和胆囊结石、胆管结石、胆管炎症等刺激,导致胆管壁纤维组织增生、管壁变厚、胆管内腔逐渐缩窄,胆管结石和胆管炎是临床上最常见的良性胆管狭窄。临床表现为腹痛、寒战、高热、间歇性黄疸等。早期可行抗生素治疗,但手术治疗是本病的根本治疗方法。"Benign biliary stricture" as used herein refers to cicatricial narrowing of the bile duct lumen due to bile duct injury and recurrent cholangitis or congenital. Benign biliary strictures can be stimulated by iatrogenic injury, abdominal trauma, gallbladder stones, bile duct stones, and bile duct inflammation, leading to the proliferation of fibrous tissue in the bile duct wall, thickening of the tube wall, and gradual narrowing of the lumen of the bile duct. Bile duct stones and cholangitis are clinically The most common benign biliary stricture. Clinical manifestations include abdominal pain, chills, high fever, and intermittent jaundice. Antibiotic treatment is feasible in the early stage, but surgical treatment is the fundamental treatment method for this disease.

本文所述的“胆管癌”指发生在肝外胆管,即左右肝管至胆总管下段的恶性肿瘤。本病病因仍不明确,多发生于50~70岁,男性略多于女性,本病可能与肝胆管结石、原发性硬化性胆管炎有关,胆管癌患者大多数会出现黄疸,通常为无痛性、进行性加重的黄疸,二便异常,大便灰白呈白陶土样,尿液颜色加深如浓茶样,胆管中下段癌可出现胆囊肿大。The "cholangiocarcinoma" mentioned herein refers to malignant tumors that occur in the extrahepatic bile duct, that is, the left and right hepatic ducts to the lower part of the common bile duct. The etiology of this disease is still not clear, mostly occurs in 50-70 years old, men are slightly more than women, this disease may be related to hepatolithiasis and primary sclerosing cholangitis, most patients with cholangiocarcinoma will have jaundice, usually without Painful, progressive jaundice, abnormal stools, grayish-white stools like kaolin, darkened urine like strong tea, and gallbladder enlargement due to cancer in the middle and lower bile ducts.

决策树是比较经典的机器学习算法,通常是以递归形式选择最优特征,并根据该特征对训练数据进行分割,使得各个子数据集有一个最好的分类的过程。而随机森林(RF)其实就是多棵决策树,纳入的特征对目标变量预测的相对重要性可以通过特征使用的相对顺序(即深度)来进行评估。Decision tree is a relatively classic machine learning algorithm. It usually selects the optimal feature recursively, and divides the training data according to the feature, so that each sub-data set has the best classification process. The random forest (RF) is actually multiple decision trees, and the relative importance of the included features to the prediction of the target variable can be evaluated by the relative order (ie depth) of feature usage.

Gini指数是常用的一种特征重要性评估方法,是节点杂志的一种度量指标,Gini系数的下降幅度越大,该变量对节点拆分中的高纯度的贡献就越大。The Gini index is a commonly used feature importance evaluation method and a measure of node journals. The greater the decline in the Gini coefficient, the greater the contribution of this variable to the high purity in node splitting.

LASSO回归是在线性回归模型的代价函数后面加上L1范数的约束项的模型,可以通过控制参数lambda进行变量筛选和复杂度调整,在本文中通过对不同数量标志物的多种组合进行验证,LASSO regression is a model in which the constraint term of the L1 norm is added after the cost function of the linear regression model. Variable screening and complexity adjustment can be performed by controlling the parameter lambda. In this paper, various combinations of different numbers of markers are verified. ,

以下实施例所述的ROC(Receiver Operating Characteristic)曲线,又称接受者操作特征曲线。The ROC (Receiver Operating Characteristic) curve described in the following examples is also called the receiver operating characteristic curve.

实施例一、胆汁标志物的筛选及其在诊断胆管癌中的应用Example 1. Screening of bile markers and their application in the diagnosis of cholangiocarcinoma

1.病例及样本收集1. Case and sample collection

本研究经兰州大学第一医院人类研究伦理委员会(LDYYLL2022-381)批准,豁免知情同意,并按照赫尔辛基原则宣言进行。临床标本来自两个中心。This study was approved by the Human Research Ethics Committee of the First Hospital of Lanzhou University (LDYYLL2022-381), exempted from informed consent, and conducted in accordance with the Declaration of Helsinki Principles. Clinical specimens were obtained from two centers.

共有514名患者被分为发现集、交叉验证集和外部验证集进行研究。在发现集中,收集了来自兰州大学第一医院的9例胆汁进行蛋白质组学分析。A total of 514 patients were divided into discovery set, cross-validation set and external validation set for study. In the discovery set, nine cases of bile from the First Hospital of Lanzhou University were collected for proteomic analysis.

在发现集中,有患者9例。In the discovery set, there were 9 patients.

在交叉验证集中,2019年1月至2022年3月兰州大学第一医院普外科招募了287例患者,其中胆管癌患者123例,胆道良性狭窄患者164例。其中良性胆道狭窄主要分为三类,一类是胆总管(CBD)结石合并胆管炎,一类是肝内胆管(IBD)结石合并胆管炎,另一类是CBD和IBD结石合并胆管炎。胆管癌患者主要通过病理结果确诊,患者的具体信息如表1所示In the cross-validation set, 287 patients were recruited from January 2019 to March 2022 in the Department of General Surgery of Lanzhou University First Hospital, including 123 patients with cholangiocarcinoma and 164 patients with benign biliary strictures. Among them, benign biliary strictures are mainly divided into three categories, one is common bile duct (CBD) stones with cholangitis, one is intrahepatic bile duct (IBD) stones with cholangitis, and the other is CBD and IBD stones with cholangitis. The diagnosis of patients with cholangiocarcinoma is mainly based on pathological results, and the specific information of the patients is shown in Table 1

在外部验证集中,从2021年1月至2022年5月从中国医学科学院肿瘤医院招募了87名胆管癌患者,从2022年1月至2022年5月从兰州大学第一医院招募了131名良性胆道狭窄患者。In the external validation set, 87 patients with cholangiocarcinoma were recruited from the Cancer Hospital of the Chinese Academy of Medical Sciences from January 2021 to May 2022, and 131 patients with benign cholangiocarcinoma were recruited from the First Hospital of Lanzhou University from January 2022 to May 2022. Patients with biliary strictures.

胆汁主要在ERCP或PTCD或手术期间收集。胆汁和血清样品在获得后立即冰上运输,然后在4℃以3000×g离心15分钟,收集上清液并储存在-80℃直至实验。Bile is mainly collected during ERCP or PTCD or surgery. Bile and serum samples were transported on ice immediately after acquisition, then centrifuged at 3000 × g for 15 min at 4 °C, and the supernatants were collected and stored at −80 °C until experimentation.

表1患者基本信息Table 1 Basic information of patients

Figure BDA0003870385930000071
Figure BDA0003870385930000071

Figure BDA0003870385930000081
Figure BDA0003870385930000081

注:BBS:良性胆道梗阻;CBD:胆总管结石IBD:肝内胆管结石;TBIL:总胆红素;GGT:γ-谷氨酰转移酶Note: BBS: benign biliary obstruction; CBD: common bile duct stones; IBD: intrahepatic bile duct stones; TBIL: total bilirubin; GGT: γ-glutamyl transferase

2.LC-MS/MS2.LC-MS/MS

(1)蛋白样品制备(1) Protein sample preparation

①将保存的胆汁取出,置于冰上操作。对丙酮进行预冷,向胆汁中加入适量的丙酮震荡,使其充分混合,置于-20℃冰箱过夜沉淀。① Take out the preserved bile and place it on ice for operation. The acetone was pre-cooled, and an appropriate amount of acetone was added to the bile for shock, to make it fully mixed, and placed in a -20°C refrigerator overnight for precipitation.

②第二天在4℃、12000g的条件下对上述混合物离心10分钟,用移液枪弃去上清液,注意过程中避免冲散底部沉淀物,保留管底沉淀物。②The next day, centrifuge the above mixture at 4°C and 12000g for 10 minutes, discard the supernatant with a pipette, and pay attention to avoid washing away the sediment at the bottom during the process, and keep the sediment at the bottom of the tube.

③取适量的预冷丙酮加入上述沉淀物,将混合物置于涡旋仪震荡充分混匀,再次于上述同样的离心条件下离心15分钟,弃去上清收集沉淀物。再次重复该步骤一次。③ Take an appropriate amount of pre-cooled acetone and add the above precipitate, place the mixture in a vortex apparatus to shake and mix thoroughly, then centrifuge again under the same centrifugation conditions as above for 15 minutes, discard the supernatant to collect the precipitate. Repeat this step one more time.

④将上述所得到的沉淀物置于通风橱中室温干燥,向其中加入适量的组织裂解液,利用涡旋仪震荡混合,充分裂解蛋白。④Put the precipitate obtained above in a fume hood to dry at room temperature, add an appropriate amount of tissue lysate to it, and use a vortex instrument to oscillate and mix to fully lyse the protein.

⑤将上述混合物于室温下12000g离心15分钟,取上清液并将其置于新的EP管中,并再次离心该上清,保证充分去除不溶性杂质。⑤ Centrifuge the above mixture at 12000g for 15 minutes at room temperature, take the supernatant and place it in a new EP tube, and centrifuge the supernatant again to ensure that insoluble impurities are fully removed.

⑥最后得到的上清液即为胆汁的总蛋白溶液,然后测量每个标本的蛋白浓度并分装保存。⑥ The supernatant obtained at last is the total protein solution of bile, and then the protein concentration of each sample is measured and stored in aliquots.

(2)检测蛋白浓度(2) Detection of protein concentration

①取出胆汁蛋白样品和超滤过的细胞上清并置于冰上操作。① Take out the bile protein sample and the ultrafiltered cell supernatant and place them on ice for operation.

②根据BCA试剂盒说明书配制BCA工作液,按照每个孔200ul进行配制(配制的时候可多配200-300ul,从而保证工作液充足)。② Prepare the BCA working solution according to the instructions of the BCA kit, and prepare according to 200ul per well (200-300ul can be added when preparing to ensure that the working solution is sufficient).

③按照BSA说明书配制适量的0.5mg/ml的蛋白质标准液,然后按照相应的稀释倍数将蛋白标准品和稀释液加入96孔板的标准品孔中,为提高准确度可设置两组复孔。③Prepare an appropriate amount of 0.5mg/ml protein standard solution according to the BSA instructions, and then add the protein standard product and diluent to the standard wells of the 96-well plate according to the corresponding dilution factor. Two sets of duplicate wells can be set up to improve accuracy.

④在每个样品孔中加入2ul待测样品,为减少误差可以每个样品设置2个或3个复孔,并于每个孔中加18ul标准品稀释液,共计20ul。④ Add 2ul of the sample to be tested in each sample well. In order to reduce the error, 2 or 3 replicate wells can be set up for each sample, and 18ul of standard dilution solution is added to each well, for a total of 20ul.

⑤将上述各试剂添加结束后才能加BCA工作液,每孔200ul(加样过程中避免产生气泡),然后将加样完毕的96孔板置于37℃环境中孵育30分钟。⑤ After adding the above reagents, add the BCA working solution, 200ul per well (to avoid air bubbles during the addition process), and then place the 96-well plate after the addition of the samples at 37°C for 30 minutes.

⑥使用酶标仪或紫外分光光度计测各孔A562nm处的吸光度值,利用标准孔的浓度和对应的OD值制作标准曲线,然后利用每个样品孔中的OD值计算蛋白浓度。⑥ Use a microplate reader or an ultraviolet spectrophotometer to measure the absorbance value at A562nm of each well, use the concentration of the standard well and the corresponding OD value to make a standard curve, and then use the OD value in each sample well to calculate the protein concentration.

(3)SDS-PAGE电泳(3) SDS-PAGE electrophoresis

①将样品全程置于冰上操作,根据样品的蛋白浓度和体积加入一定量的3×上样缓冲液,充分混合后置于100℃金属浴中5分钟,使其完全变性。① Put the sample on ice for the whole process, add a certain amount of 3× loading buffer according to the protein concentration and volume of the sample, mix well and place it in a metal bath at 100°C for 5 minutes to completely denature it.

②配制浓度为12%的SDS-PAGE胶,每个样品取10ug进行电泳检测实验,电泳条件设置为浓缩胶70V,分离胶120V,然后在设定好的恒压条件下电泳。② Prepare SDS-PAGE gel with a concentration of 12%, take 10ug of each sample for electrophoresis detection experiment, set the electrophoresis conditions as stacking gel 70V, separation gel 120V, and then electrophoresis under the set constant voltage conditions.

③电泳结束后,打开玻璃板,轻轻取下凝胶并切除上下冗余的凝胶,将剩余的凝胶置于考马斯亮蓝染色液中进行染色,结束后持续水洗直至背景清晰。③ After electrophoresis, open the glass plate, gently remove the gel and cut off the upper and lower redundant gels, place the remaining gel in Coomassie brilliant blue staining solution for staining, and continue washing with water until the background is clear.

④水洗结束后用ImageScanner扫描仪对凝胶进行扫描。④ After washing with water, scan the gel with an ImageScanner scanner.

(4)蛋白还原烷基化及酶解(4) Protein reductive alkylation and enzymatic hydrolysis

①蛋白定量后每个样品各取100ug,每个样品中加入5倍体积的预冷丙酮,置于-20℃冰箱1小时,充分沉淀蛋白。① After protein quantification, take 100ug of each sample, add 5 times the volume of pre-cooled acetone to each sample, and place it in a -20°C refrigerator for 1 hour to fully precipitate the protein.

②将上述混合物于4℃、12000rpm的条件下离心10分钟,用移液枪吸取上清液并弃掉,并对底部沉淀物进行真空冷冻处理,使其干燥。②Centrifuge the above mixture at 4°C and 12,000 rpm for 10 minutes, suck up the supernatant with a pipette gun and discard it, and vacuum freeze the bottom sediment to make it dry.

③为了溶解干燥后的沉淀,向其中加入Dissolution Buffer,待蛋白充分溶解后向其中加入4ul Reducing Reagent,随后将混合物置于60℃反应1小时。③In order to dissolve the dried precipitate, add Dissolution Buffer to it, after the protein is fully dissolved, add 4ul Reducing Reagent to it, and then place the mixture at 60°C for 1 hour.

④向上述反应混合物中加入Cysteine-Blocking Reagent,让其在室温下进行还原烷基化反应,反应结束后将蛋白溶液转存于超滤管中,并在12000rpm条件下离心过滤25分钟。④ Add Cysteine-Blocking Reagent to the above reaction mixture, let it undergo reductive alkylation reaction at room temperature, transfer the protein solution to an ultrafiltration tube after the reaction, and centrifuge at 12000rpm for 25 minutes.

⑤向上述收集到的超滤液中加入100μl Dissolution Buffer,待其充分混合后于12000rpm条件下离心20分钟,重复该步骤3次。⑤ Add 100 μl Dissolution Buffer to the ultrafiltrate collected above, and centrifuge at 12,000 rpm for 20 minutes after it is fully mixed, and repeat this step 3 times.

⑥将测序级胰蛋白酶溶剂加入上述收集到的溶液中充分反应,然后向收集到的肽段中加入Dissolution Buffer,离心并收集管底溶液。⑥ Add sequencing-grade trypsin solvent to the collected solution to fully react, then add Dissolution Buffer to the collected peptide, centrifuge and collect the solution at the bottom of the tube.

(5)蛋白标记(5) Protein labeling

①取出iTRAQ试剂并按照说明书向胆汁样品中加入一定量的异丙醇,待充分溶解后进行下列操作。①Take out the iTRAQ reagent and add a certain amount of isopropanol to the bile sample according to the instructions, and perform the following operations after it is fully dissolved.

②还原烷基化及酶解的蛋白样品各取50μl,加入iTRAQ试剂并充分混匀,于室温下将混合物置于摇床充分反应2小时,然后向反应混合物中加入100ul蒸馏水终止反应。② Take 50 μl of protein samples for reductive alkylation and enzymatic hydrolysis, add iTRAQ reagent and mix thoroughly, place the mixture on a shaker at room temperature to fully react for 2 hours, then add 100ul distilled water to the reaction mixture to terminate the reaction.

③按照上述实验操作混合所有标记的胆汁样品,并涡旋振荡使其充分混合,然后离心至管底。③Mix all labeled bile samples according to the above experimental operation, and vortex to mix thoroughly, and then centrifuge to the bottom of the tube.

④将上述处理好的样品真空冷冻干燥并保存备用。④ Vacuum freeze-dry the above-treated samples and save them for future use.

(6)LC-MS/MS分析(6) LC-MS/MS analysis

①将干燥后的各组蛋白样品溶解于流动相A溶液,利用涡旋震荡仪充分混合。① Dissolve the dried histone samples in the mobile phase A solution and mix thoroughly with a vortex shaker.

②利用Agilent 1200HPLC仪器对上述混合物进行肽段分离实验。②Agilent 1200HPLC instrument was used to conduct peptide separation experiments on the above mixture.

③肽段分离结束后,联合TripleTOF5600系统(AB SCIEX)和纳升喷雾III离子源(AB SCIEX)进行质谱鉴定和分析。③ After the separation of the peptides, combined with TripleTOF5600 system (AB SCIEX) and nanospray III ion source (AB SCIEX) for mass spectrometric identification and analysis.

(7)蛋白定性、定量和识别(7) Protein Qualitative, Quantitative and Identification

得到LC-MS/MS原始数据后,利用MaxQuant(1.6版)对原始文件进行分析,并通过与Swiss-Prot人类蛋白质序列数据库(2019年2月更新,20,413个蛋白质序列)中的比较来识别相应的蛋白质。在蛋白质和肽水平上,蛋白质的错误发现率小于1%(FDR<1%),并至少识别出两个肽进行进一步的数据处理。After obtaining the LC-MS/MS raw data, use MaxQuant (version 1.6) to analyze the raw file, and compare it with the Swiss-Prot human protein sequence database (updated in February 2019, 20,413 protein sequences) Identify the corresponding protein. At the protein and peptide levels, the false discovery rate of proteins was less than 1% (FDR<1%), and at least two peptides were identified for further data processing.

3.蛋白质印迹:3. Western blot:

(1)蛋白样品制备并测量浓度(1) Protein sample preparation and concentration measurement

向细胞培养皿中加入细胞裂解液200ul,室温下充分裂解细胞15分钟后,将其转移至EP管中后放置,全程于冰上操作,然后将裂解完毕的蛋白溶液以12000rpm的速度在4℃条件下离心20分钟,离心后EP管中混合物分为3层,下层为沉淀,中间层为细胞蛋白,上层为脂质,吸取中间层作为细胞蛋白并将其分装保存;利用丙酮法提取胆汁中的蛋白(相同于LC-MS/MS部分);对细胞上清液进行超滤得到细胞上清液蛋白。利用BCA方法对上述三种蛋白溶液进行浓度测量。Add 200ul of cell lysate to the cell culture dish, fully lyse the cells at room temperature for 15 minutes, transfer them to an EP tube and place them on ice for the whole process, then put the lysed protein solution at 4°C at a speed of 12000rpm After centrifugation, the mixture in the EP tube is divided into 3 layers, the lower layer is sediment, the middle layer is cell protein, and the upper layer is lipid. The middle layer is absorbed as cell protein and stored in separate packages; the bile is extracted by acetone method Protein in (same as LC-MS/MS part); cell supernatant protein was obtained by ultrafiltration of cell supernatant. The concentrations of the above three protein solutions were measured using the BCA method.

(2)SDS-PAGE电泳(2) SDS-PAGE electrophoresis

①制胶:擦洗玻璃板,检查玻璃板正反面是否干净(严格保证清洁度),调整好玻璃板正反面,用夹子紧紧夹住玻璃板,然后将夹子紧紧的垂直卡在水平架上。配制适量的12%分离胶,然后用移液枪吸取分离胶进行灌胶,当胶面达到浓缩胶位置时停止灌胶,灌胶全程动作要轻柔,防止产生气泡。然后用200ul移液枪轻轻的在胶上加一层蒸馏水,当液面高出玻璃板顶端时即可。30分钟后分离胶已经凝集,然后弃去上层蒸馏水。按照说明书配制适量的5%浓缩胶,用移液枪灌注浓缩胶直至溢出,然后插入洗干净的梳子,大约20分钟后浓缩胶已经凝集。① Glue making: Scrub the glass plate, check whether the front and back of the glass plate are clean (strictly ensure the cleanliness), adjust the front and back of the glass plate, clamp the glass plate tightly with clips, and then firmly clamp the clips vertically on the horizontal frame . Prepare an appropriate amount of 12% separation gel, and then use a pipette gun to absorb the separation gel for glue filling. When the glue surface reaches the position of the concentrated gel, stop the glue filling. The whole process of glue filling should be done gently to prevent air bubbles. Then use a 200ul pipette gun to gently add a layer of distilled water on the glue, when the liquid level is higher than the top of the glass plate. After 30 minutes, the separating gel had coagulated, and then the upper layer of distilled water was discarded. Prepare an appropriate amount of 5% stacking gel according to the instructions, fill the stacking gel with a pipette until it overflows, and then insert a clean comb, and the stacking gel has agglutinated after about 20 minutes.

②上样:取下玻璃板,洗干净后放入电泳槽,向电泳槽中注入电泳液,首先在内槽中加满,然后再向外槽中加电泳液,电泳液注入完毕后拔出梳子。根据蛋白浓度提前计算出每组样品需要上样的总体积,对样品进行震荡混合后,依次向孔中加入marker和各组样品。②Sample loading: remove the glass plate, wash it, put it into the electrophoresis tank, inject the electrophoretic liquid into the electrophoretic tank, first fill up the inner tank, and then add the electrophoretic liquid to the outer tank, pull out the electrophoretic liquid after injection comb. According to the protein concentration, the total volume of each group of samples to be loaded is calculated in advance, and after the samples are shaken and mixed, the marker and each group of samples are added to the wells in turn.

③电泳:盖好电泳槽盖子,接通电源,于恒压条件下电泳,刚开始将电压设置为80V,当蛋白样品电泳至分离胶时将条件设置为恒压120V。时刻观察溴酚蓝条带位置,如若该条带已经到达玻璃板下缘时即可终止电泳,关闭电源,打开电泳槽盖子准备取下玻璃板。③ Electrophoresis: Cover the electrophoresis tank, turn on the power, and run electrophoresis under constant voltage conditions. At the beginning, set the voltage to 80V. When the protein sample is electrophoresed to the separation gel, set the condition to constant voltage 120V. Always observe the position of the bromophenol blue band. If the band has reached the lower edge of the glass plate, the electrophoresis can be terminated. Turn off the power, open the cover of the electrophoresis tank and prepare to remove the glass plate.

(3)转膜(3) transfer film

①电泳结束后用切割板轻轻撬起小玻璃板,然后切去浓缩胶和分离胶上无用的部分。在转膜夹黑色的一侧制备海绵和滤纸,然后将胶挪放到滤纸上,裁剪适当大小的PVDF膜放于胶上,用玻璃棒轻轻擀去膜上的气泡,保证滤纸和PVDF膜上均无气泡时关上转膜夹。① After the electrophoresis, use a cutting board to gently pry up the small glass plate, and then cut off the useless parts of the stacking gel and separating gel. Prepare a sponge and filter paper on the black side of the transfer clip, then move the glue onto the filter paper, cut a PVDF membrane of appropriate size and place it on the glue, gently roll out the air bubbles on the membrane with a glass rod to ensure that the filter paper and PVDF membrane Close the transfer clamp when there are no air bubbles.

②将转膜夹置于转膜槽中,向转膜槽中倒入配制好的1×转膜液,预估转膜液完全淹没滤纸最高点时停止加液。根据分子量大小将转膜条件设置为恒流200mA 90分钟(不同的分子量对应不同的电流值和转膜时间)。②Put the transfer holder in the transfer tank, pour the prepared 1× transfer solution into the transfer tank, and stop adding liquid when the transfer solution completely submerges the highest point of the filter paper. According to the molecular weight, the membrane transfer conditions were set as a constant current of 200 mA for 90 minutes (different molecular weights correspond to different current values and membrane transfer times).

(4)封闭(4) closed

取下转膜夹,并在PVDF膜上标记正反面,然后将膜正面朝上投放于装有脱脂奶粉的孵育盒中,室温下置于摇床上封闭1小时。Remove the transfer clip, mark the front and back sides on the PVDF membrane, then place the membrane face up in an incubation box filled with skimmed milk powder, and place it on a shaker at room temperature to seal for 1 hour.

(5)免疫反应(5) immune response

用5%的脱脂奶粉将一抗和二抗稀释至适当的浓度,将GAPDH设置为内参。然后在4℃下与兔抗CLU抗体(1:1000,Cell Signaling)或小鼠抗GAPDH(1:2000,Proteintech)一起孵育过夜。第二天将膜取出并用TBST缓冲液在摇床上清洗三遍,每遍5分钟,然后将膜与山羊抗小鼠或抗兔IgG(1:2000,Cell Signaling)二抗室温下共同孵育1小时。The primary and secondary antibodies were diluted to appropriate concentrations with 5% skimmed milk powder, and GAPDH was set as an internal reference. Then incubated overnight at 4°C with rabbit anti-CLU antibody (1:1000, Cell Signaling) or mouse anti-GAPDH (1:2000, Proteintech). The next day, the membrane was removed and washed three times with TBST buffer on a shaker, each time for 5 minutes, and then the membrane was incubated with goat anti-mouse or anti-rabbit IgG (1:2000, Cell Signaling) secondary antibody for 1 hour at room temperature .

(6)化学发光显影(6) Chemiluminescence development

二抗孵育结束后将膜洗干净,除去膜上残留的抗体和脱脂奶粉,然后将膜浸泡于TBST缓冲液中等待显影。配制适量的化学发光液,利用化学发光分析仪采集并分析图像。After the secondary antibody incubation, the membrane was washed to remove residual antibody and skimmed milk powder on the membrane, and then soaked in TBST buffer for development. Prepare an appropriate amount of chemiluminescence liquid, and use a chemiluminescence analyzer to collect and analyze images.

4.免疫组织化学(IHC)染色:4. Immunohistochemical (IHC) staining:

(1)烤片:为防止组织脱片和更好的脱蜡,在操作前需要将胆管癌组织切片放入75℃烤箱中烘烤2小时(烤片时间不能过短)。(1) Baking slices: In order to prevent tissue detachment and better dewaxing, the cholangiocarcinoma tissue slices need to be baked in a 75°C oven for 2 hours before operation (the baking time should not be too short).

(2)石蜡切片脱蜡及水化:烤片结束后立即将载玻片放入二甲苯试剂中脱蜡,每次10分钟,共3次;然后将载玻片放入浓度由高到低排列的乙醇溶液中进行水化,最后用蒸馏水持续冲洗载玻片2分钟。(2) Dewaxing and hydration of paraffin sections: Immediately after the baking, place the slides in xylene reagent for dewaxing, 10 minutes each time, 3 times in total; then put the slides into the concentration from high to low Aligned ethanol solution was hydrated, and finally the slides were continuously rinsed with distilled water for 2 minutes.

(3)高温高压抗原修复:配制柠檬酸盐修复液约1000ml,置于高压锅中并用电磁炉加热至沸腾,然后将装有切片的塑料架垂直放入锅中(防止塑料架跌倒),待高压锅压力阀门开始喷气2分钟后停止加热,10分钟后打开锅盖并用持续流水冲洗高压锅降温(降温过程不宜过快)。(3) High-temperature and high-pressure antigen repair: Prepare about 1000ml of citrate repair solution, place it in a pressure cooker and heat it to boiling with an induction cooker, then put the plastic frame with slices into the pot vertically (to prevent the plastic frame from falling), and wait for the pressure of the pressure cooker After the valve starts to spray air for 2 minutes, stop heating. After 10 minutes, open the lid and rinse the pressure cooker with continuous running water to cool down (the cooling process should not be too fast).

(4)内源性过氧化物酶阻断:将切片从高压锅中取出,用蒸馏水冲洗去除残存的柠檬酸盐修复液,然后将切片置于PBS缓冲液中并于摇床上冲洗5分钟,该步骤重复3次。甩干切片后在组织上方滴加内源性过氧化物酶阻断剂(阻断剂要完全盖住组织),盖上湿盒盖子让其在室温条件下反应10分钟。(4) Endogenous peroxidase blocking: take the slice out of the pressure cooker, rinse with distilled water to remove the remaining citrate repair solution, then place the slice in PBS buffer and wash it on a shaker for 5 minutes. The steps are repeated 3 times. After drying the slices, add an endogenous peroxidase blocking agent dropwise on top of the tissue (the blocking agent should completely cover the tissue), cover the wet box and allow it to react at room temperature for 10 minutes.

(5)血清封闭:阻断结束后将载玻片置于装有PBS缓冲液的清洗罐中,放于摇床之上清洗3次。甩干切片并在组织上方滴加适量的山羊血清,将切片平放于湿盒中,盖上盖子于室温条件下封闭10分钟。(5) Serum blocking: After the blocking, the slides were placed in a cleaning tank filled with PBS buffer, and placed on a shaker to wash 3 times. Dry the slices and drop an appropriate amount of goat serum on top of the tissue, place the slices flat in a wet box, cover the lid and seal at room temperature for 10 minutes.

(6)一抗孵育:轻轻甩去切片上的封闭血清(不需要PBS冲洗),将切片置于湿盒中并滴加一抗(GRP78稀释浓度为1:1000),每张切片约150ul抗体,然后将其置于4℃冰箱中过夜孵育(该过程轻拿轻放,防止抗体从切片边缘流出)。(6) Primary antibody incubation: Gently shake off the blocking serum on the slices (no need to wash with PBS), place the slices in a wet box and drop the primary antibody (GRP78 dilution concentration is 1:1000), about 150ul per slice Antibody, and then place it in a 4°C refrigerator for overnight incubation (this process should be handled gently to prevent the antibody from flowing out from the edge of the section).

(7)二抗孵育:第二天将湿盒取出置于室温下,当湿盒内切片的温度恢复至室温后,甩去一抗,用PBS缓冲液充分冲洗切片。冲洗干净后甩干切片,在每张载玻片的组织上方滴加150ul辣根酶标羊抗小鼠/兔IgG聚合物。(7) Secondary antibody incubation: the next day, take out the wet box and place it at room temperature. When the temperature of the slices in the wet box returns to room temperature, shake off the primary antibody and wash the slices with PBS buffer. After rinsing and drying the sections, add 150ul horseradish enzyme-labeled goat anti-mouse/rabbit IgG polymer dropwise on top of the tissue on each slide.

(8)DAB显色:待二抗在室温下孵育40分钟后,倾倒掉载玻片上残存的二抗,用PBS缓冲液在摇床上充分冲洗切片,甩干切片后在组织上快速滴加足量的DAB显色剂(滴加过程要快,并且做好计时工作),判断组织染色结束后立即将切片放于自来水下冲洗,终止显色。(8) DAB color development: After the secondary antibody was incubated at room temperature for 40 minutes, pour off the residual secondary antibody on the slide, wash the slices with PBS buffer on a shaker, dry the slices, and quickly drop enough DAB chromogenic agent (dropping process should be fast, and do a good job of timing), and after judging the end of tissue staining, rinse the slice under tap water immediately to stop the color development.

(9)苏木素复染:将组织切片放入苏木素试剂中进行苏木素染色(不同的苏木素染色时间不同),然后置于1%盐酸酒精分化并用自来水持续漂洗返蓝。(9) Hematoxylin counterstaining: Put tissue sections into hematoxylin reagent for hematoxylin staining (different hematoxylin staining time is different), then place in 1% hydrochloric acid alcohol to differentiate and rinse with tap water continuously to turn blue.

(10)脱水:将切片放入浓度由低到高排列的乙醇溶液中进行脱水,每个浓度浸泡2分钟。然后将切片置于通风橱中的二甲苯溶液中(该步骤在通风橱中操作)。(10) Dehydration: Dehydrate the slices in ethanol solutions with concentrations ranging from low to high, and soak for 2 minutes at each concentration. Sections were then placed in a xylene solution in a fume hood (this step was performed in a fume hood).

(11)封片:吸取适量的中性树胶对吹风机吹干的切片进行封片(根据切片中组织大小选取不同规格的载玻片,封片时防止产生气泡),结束后置于通风良好的地方。(11) Sealing: absorb an appropriate amount of neutral gum to seal the slices dried by the hair dryer (select slides of different specifications according to the size of the tissue in the slices, and prevent air bubbles when sealing the slices), and place them in a well-ventilated place after finishing. place.

(12)阅片:待切片晾干后进行结果判读。常用的方法主要有显微镜直接观察图像或者数字病理扫描仪扫描切片后观察图像。(12) Image reading: Interpret the results after the slices are dried. Commonly used methods mainly include direct observation of images with a microscope or observation of images after scanning slices with a digital pathology scanner.

(13)结果分析:图像通过Image pro plus 6.0软件进行分析。CLU的表达强度由两位资深病理学家在不了解任何临床和病理数据的情况下独立判断。其染色强度分为4级,0代表阴性表达(阴性),1代表弱表达(weak),2代表中表达(moderate),3代表强表达(strong)。最后,为了方便统计分析数据,我们将阴性、中度和弱表达(0-2分)定义为低表达,强表达(3分)定义为高表达。(13) Result analysis: The images were analyzed by Image pro plus 6.0 software. The expression intensity of CLU was independently judged by two senior pathologists without knowledge of any clinical and pathological data. The staining intensity is divided into 4 grades, 0 represents negative expression (negative), 1 represents weak expression (weak), 2 represents medium expression (moderate), and 3 represents strong expression (strong). Finally, in order to facilitate statistical analysis of data, we defined negative, moderate and weak expression (0-2 points) as low expression, and strong expression (3 points) as high expression.

5.酶联免疫吸附试验-ELISA5. Enzyme-linked immunosorbent assay-ELISA

ELISA试剂盒用于检测胆汁或血清中CLU(E-TSEL-H0014,Elabscience)和CA19-9(E-EL-H0637c,Elabscience)的水平。ELISA实验按照制造商提供的说明进行。检测前需稀释胆汁和血清,测定CLU水平,将胆汁和血清分别稀释100倍和5000倍;为了测量CA19-9的水平,将胆汁和血清分别稀释10000倍和5倍。ELISA kits were used to detect the levels of CLU (E-TSEL-H0014, Elabscience) and CA19-9 (E-EL-H0637c, Elabscience) in bile or serum. ELISA experiments were performed according to the instructions provided by the manufacturer. Bile and serum need to be diluted before testing to measure CLU level, and the bile and serum are diluted 100 times and 5000 times respectively; in order to measure the level of CA19-9, the bile and serum are diluted 10000 times and 5 times respectively.

6.统计分析6. Statistical Analysis

连续变量表示为中位数(四分位间距)或平均值±SD(标准差),并使用Mann-Whitney U检验或学生t检验进行比较。分类变量表示为比率,并通过卡方检验相互比较。ROC曲线用于评估各标志物或者标志物组合的诊断性能,并使用约登指数计算阈值。曲线下面积(AUC)采用梯形法计算,数值越大,诊断性能越好。敏感性、特异性和准确性(ACC)通过标准的2×2列联表计算,它们是评估诊断性能的主要指标。DCA(决策曲线分析)用于比较不同临床诊断模型或标志物的诊断价值。采用t分布随机邻域嵌入(tSNE)算法直观地评估诊断模型的效果。P值小于0.05被认为具有统计学意义。所有分析均使用SPSS Statistics20、GraphPad Prism 7.0版和R 4.1.0版(R Foundation for Statistical Computing;http://www.R-project.org)进行Continuous variables were presented as median (interquartile range) or mean ± SD (standard deviation) and compared using the Mann-Whitney U test or Student's t-test. Categorical variables were expressed as ratios and compared with each other by chi-square test. The ROC curve was used to evaluate the diagnostic performance of each marker or marker combination, and the Youden index was used to calculate the threshold. The area under the curve (AUC) was calculated by the trapezoidal method, and the larger the value, the better the diagnostic performance. Sensitivity, specificity, and accuracy (ACC) were calculated by standard 2 × 2 contingency tables, which are the main indicators for assessing diagnostic performance. DCA (Decision Curve Analysis) was used to compare the diagnostic value of different clinical diagnostic models or markers. The t-distributed stochastic neighborhood embedding (tSNE) algorithm was used to visually evaluate the effectiveness of the diagnostic model. A P value of less than 0.05 was considered statistically significant. All analyzes were performed using SPSS Statistics20, GraphPad Prism version 7.0 and R version 4.1.0 (R Foundation for Statistical Computing; http://www.R-project.org)

7.结果分析7. Result Analysis

(1)胆管癌胆汁和细胞上清液的蛋白质组学(1) Proteomics of bile and cell supernatant of cholangiocarcinoma

如图1-4所示,胆汁和细胞上清液蛋白质组学用于筛选可作为诊断胆管癌的候选生物标志物(图1A)。在胆汁蛋白质组学中,胆管癌与良性胆管狭窄比较,以Fold Change≥5.0或Fold Change≤0.2为标准,共鉴定出1585个蛋白,筛选出差异表达蛋白167个,其中上调蛋白130个,下调蛋白37个(图1A和2B)。通过GO分析和KEGG分析对差异表达蛋白的生物学功能和关键途径进行注释(图2C)。结果表明,这些差异表达的蛋白质主要与肿瘤发生和细胞间相互作用有关,包括趋化因子信号通路、炎症和免疫、内吞作用和溶酶体等。As shown in Figures 1-4, bile and cell supernatant proteomics were used to screen candidate biomarkers for the diagnosis of cholangiocarcinoma (Figure 1A). In bile proteomics, comparing cholangiocarcinoma with benign bile duct strictures, a total of 1585 proteins were identified based on the standard of Fold Change ≥ 5.0 or Fold Change ≤ 0.2, and 167 differentially expressed proteins were screened out, including 130 up-regulated proteins and down-regulated proteins. 37 proteins (Fig. 1A and 2B). The biological functions and key pathways of differentially expressed proteins were annotated by GO analysis and KEGG analysis (Fig. 2C). The results showed that these differentially expressed proteins were mainly related to tumorigenesis and intercellular interactions, including chemokine signaling pathways, inflammation and immunity, endocytosis and lysosomes, etc.

从四种胆管癌细胞系(TFK-1、HuCCT-1、RBE和HCCC-9810)和一种正常人肝内胆管上皮细胞系(HIBEpiC)收集的细胞上清液用于无标记定量分析,共鉴定出932种蛋白质,包括273种上调蛋白和659种下调蛋白(图1A和3D)。GO和KEGG分析表明,这些差异表达的蛋白质与肿瘤进展过程中的信号转导和免疫调节有关,包括ECM-受体相互作用、内吞囊泡和内吞作用(图3E)。与HIBEpiC细胞系相比,胆管癌细胞系中有54种蛋白质升高(图4F)。将胆汁和上清液中的上调蛋白相交时共筛选出五种蛋白(图4G),包括CLU、COL6A1、GOLM1、QSOX1和IGFBP1。考虑到我们的胆汁标本数量有限,引用了Marut Laohaviroj等人研究中的另一个胆汁蛋白质组数据集(External bile 1)[8]。基于倍数≥1.5的标准比较胆管癌和对照组,在External bilis 1中鉴定出63个上调蛋白,但在5个候选蛋白中只有CLU在Externalbile 1中升高(图4H)。最后,选择CLU进行进一步研究。Cell supernatants collected from four cholangiocarcinoma cell lines (TFK-1, HuCCT-1, RBE, and HCCC-9810) and one normal human intrahepatic biliary epithelial cell line (HIBEpiC) were used for label-free quantitative analysis. 932 proteins were identified, including 273 up-regulated proteins and 659 down-regulated proteins (Fig. 1A and 3D). GO and KEGG analyzes revealed that these differentially expressed proteins were associated with signal transduction and immune regulation during tumor progression, including ECM-receptor interactions, endocytic vesicles, and endocytosis (Fig. 3E). Fifty-four proteins were elevated in the cholangiocarcinoma cell line compared with the HIBEpiC cell line (Fig. 4F). A total of five proteins were screened when intersecting upregulated proteins in bile and supernatant (Fig. 4G), including CLU, COL6A1, GOLM1, QSOX1, and IGFBP1. Considering the limited number of our bile samples, another bile proteome dataset (External bile 1) [8] from the study by Marut Laohaviroj et al. was cited. Comparing cholangiocarcinoma and controls based on the criterion of fold ≥ 1.5, 63 upregulated proteins were identified in External bilis 1, but only CLU was elevated in External bilis 1 among the 5 candidate proteins (Fig. 4H). Finally, CLU is selected for further research.

(2)CLU在胆管癌中的高表达(2) High expression of CLU in cholangiocarcinoma

如图5-图8所示,在临床标本和细胞中验证了胆管癌中CLU蛋白和mRNA的水平。收集了16份胆汁样本(8份来自胆管癌,8份来自良性胆管狭窄)用于验证CLU的蛋白质水平。如图5A所示,胆管癌中CLU蛋白水平较高,但在良性胆管狭窄的胆汁中表达很少或没有表达。包含90例胆管癌组织和31例小叶间胆管组织的组织微阵列(TMA)用于免疫组织化学染色。CLU主要位于细胞质中(图5B)。在90例胆管癌组织中,89例为CLU阳性(98.9%)。在阳性染色病例中,弱、中、强表达的病例数分别为4例(4.5%)、36例(40.4%)和49例(55.1%)。31例小叶间胆管组织中,染色阴性12例(38.7%)。免疫组化图像分析显示胆管癌中CLU蛋白明显升高(P<0.001)(图5B)。Kaplan-Meier生存分析表明,具有高CLU水平的胆管癌患者的总生存(OS)时间(p<0.0001)和无复发生存(RFS)时间更短(p<0.001)(图6C和5D)。总之,CLU的高表达可以促进CCA的进展。如图7E、7F和7G所示,CLU蛋白和mRNA在4个胆管癌细胞系中均呈高表达(P<0.05)。我们已经成功地从术后组织中提取了五个原代胆管癌细胞,并且CLU在其中也高表达(图8H、8I和8J)。As shown in Figures 5-8, the levels of CLU protein and mRNA in cholangiocarcinoma were verified in clinical specimens and cells. Sixteen bile samples (8 from cholangiocarcinoma and 8 from benign biliary strictures) were collected for validation of CLU protein levels. As shown in Figure 5A, CLU protein levels were higher in cholangiocarcinoma, but little or no expression was expressed in bile with benign biliary strictures. Tissue microarrays (TMA) containing 90 cholangiocarcinoma tissues and 31 interlobular bile duct tissues were used for immunohistochemical staining. CLU was mainly located in the cytoplasm (Fig. 5B). Among 90 cases of cholangiocarcinoma tissues, 89 cases were positive for CLU (98.9%). Among the positively stained cases, the numbers of cases with weak, medium and strong expression were 4 cases (4.5%), 36 cases (40.4%) and 49 cases (55.1%), respectively. Of the 31 cases of interlobular bile duct tissue, 12 cases (38.7%) were negatively stained. Immunohistochemical image analysis showed that CLU protein was significantly increased in cholangiocarcinoma (P<0.001) (Fig. 5B). Kaplan-Meier survival analysis showed that cholangiocarcinoma patients with high CLU levels had shorter overall survival (OS) time (p<0.0001) and recurrence-free survival (RFS) time (p<0.001) (Fig. 6C and 5D). In conclusion, high expression of CLU can promote the progression of CCA. As shown in Figures 7E, 7F and 7G, CLU protein and mRNA were highly expressed in the four cholangiocarcinoma cell lines (P<0.05). We have successfully isolated five primary cholangiocarcinoma cells from postoperative tissues, and CLU was also highly expressed in them (Fig. 8H, 8I and 8J).

(3)胆汁CLU和血清CA19-9对CCA的诊断价值(3) Diagnostic value of bile CLU and serum CA19-9 for CCA

为了验证胆汁CLU和血清CLU哪个更适合用于胆管癌诊断,收集了胆管癌或良性胆管狭窄患者的胆汁和血液各40例,以及40例健康志愿者的血液进行初步研究,如图9和10所示。如图9B和9C所示,与对照组相比,胆管癌患者血清和胆汁中的CLU均呈高表达。CLU在胆管癌血清中的表达水平特别高,即使在健康人中的平均表达也为102,028.5±36,784.1ng/ml。然而,胆管癌或良性胆管狭窄胆汁中CLU的平均水平分别仅为2,458.1±3,366.0ng/ml和302.5±283.2ng/ml。高丰度的蛋白质由于其敏感性低而不适合作为诊断标志物。因此,胆汁CLU被确定为CCA的候选生物标志物。CA19-9在血清和胆汁中均有表达(图9D)。如图9E和9F所示,胆汁或血清中CA19-9的水平在胆管癌中较高。胆管癌和良性胆管狭窄患者胆汁平均水平分别为688307.0±803859.0IU/ml和293463.6±321862.1IU/ml,血清中分别为270.6±361.8IU/ml和42.5±50.0IU/ml。同样,血清中的CA19-9更适合作为胆管癌的诊断生物标志物。然后将287名患者纳入交叉验证集进行进一步研究(图9G)。如图9H和10J所示,胆汁CLU在胆管癌中高表达,ROC分析结果表现出了良好的的诊断能力,AUC为0.857(敏感性为73.98%,特异性为93.29%)。血清CA19-9在胆管癌中高表达,其AUC值为0.809(敏感性为84.55%,特异性为67.68%)(图9I和10J)。由于胆汁CLU特异性高、敏感性低,而CA19-9正好相反,我们考虑建立一个包含胆汁CLU和血清CA19-9的模型以获得更好的准确性。如图10J所示,CLU&CA19-9模型的诊断价值显著增加,AUC为0.917,远高于它的两个成员。其敏感性和特异性分别提高到88.6%和82.9%,表明诊断性能更好。tSNE算法可用于简化复杂的混淆矩阵,可以帮助我们可视化疾病的分布。如图10K所示,胆管癌组和对照组利用tSNE形成了不同的簇,说明CLU&CA19-9可以很好的鉴别胆管癌。采用决策曲线分析(DCA)观察CLU、CA19-9和CLU&CA19-9的临床表现,结果表明CLU&CA19-9模型比单独使用CLU或CA19-9模型增加了更多的临床综合效益预测CCA(图10L)。In order to verify which of bile CLU and serum CLU is more suitable for the diagnosis of cholangiocarcinoma, 40 cases of bile and blood of patients with cholangiocarcinoma or benign biliary stricture, and blood of 40 healthy volunteers were collected for preliminary research, as shown in Figures 9 and 10 shown. As shown in Figures 9B and 9C, CLU was highly expressed in serum and bile of patients with cholangiocarcinoma compared with controls. The expression level of CLU in serum of cholangiocarcinoma is particularly high, even in healthy people the average expression is 102,028.5±36,784.1ng/ml. However, the mean levels of CLU in bile from cholangiocarcinoma or benign biliary stricture were only 2,458.1±3,366.0 ng/ml and 302.5±283.2 ng/ml, respectively. Highly abundant proteins are not suitable as diagnostic markers due to their low sensitivity. Therefore, bile CLU was identified as a candidate biomarker for CCA. CA19-9 was expressed in both serum and bile (Fig. 9D). As shown in Figures 9E and 9F, the level of CA19-9 in bile or serum was higher in cholangiocarcinoma. The average levels of bile in patients with cholangiocarcinoma and benign biliary stricture were 688307.0±803859.0IU/ml and 293463.6±321862.1IU/ml respectively, and the serum levels were 270.6±361.8IU/ml and 42.5±50.0IU/ml respectively. Similarly, CA19-9 in serum is more suitable as a diagnostic biomarker for cholangiocarcinoma. 287 patients were then included in the cross-validation set for further study (Fig. 9G). As shown in Figures 9H and 10J, bile CLU is highly expressed in cholangiocarcinoma, and the results of ROC analysis showed a good diagnostic ability, with an AUC of 0.857 (sensitivity 73.98%, specificity 93.29%). Serum CA19-9 was highly expressed in cholangiocarcinoma, with an AUC value of 0.809 (84.55% sensitivity, 67.68% specificity) (Fig. 9I and 10J). Since bile CLU has high specificity and low sensitivity, while CA19-9 is just the opposite, we considered building a model including bile CLU and serum CA19-9 for better accuracy. As shown in Figure 10J, the diagnostic value of the CLU&CA19-9 model increased significantly, with an AUC of 0.917, much higher than its two members. Its sensitivity and specificity increased to 88.6% and 82.9%, respectively, indicating better diagnostic performance. The tSNE algorithm can be used to simplify complex confusion matrices, which can help us visualize the distribution of diseases. As shown in Figure 10K, the cholangiocarcinoma group and the control group formed different clusters using tSNE, indicating that CLU & CA19-9 can well differentiate cholangiocarcinoma. Decision curve analysis (DCA) was used to observe the clinical performance of CLU, CA19-9, and CLU&CA19-9, and the results showed that the CLU&CA19-9 model added more clinical combined benefits than CLU or CA19-9 models alone to predict CCA (Fig. 10L) .

实施例二、通过机器学习筛选生物标志物组合Example 2. Screening biomarker combinations by machine learning

1.筛选方法1. Screening method

包含以下步骤:Contains the following steps:

(1)胆汁标志物的筛选:采用液相色谱质谱联用技术对胆汁和细胞上清液进行蛋白质组学分析,分析鉴定胆管癌和对照组之间的差异表达蛋白;对胆管癌的胆汁和细胞上清液中异常高表达的蛋白取交集,得到胆汁标志物;具体的筛选步骤与实施例一相同。(1) Screening of bile markers: Proteomic analysis of bile and cell supernatant was carried out by liquid chromatography-mass spectrometry, and the differentially expressed proteins between cholangiocarcinoma and control group were analyzed and identified; The abnormally highly expressed proteins in the cell supernatant were intersected to obtain bile markers; the specific screening steps were the same as in Example 1.

(2)对步骤(1)筛选得到的胆汁标志物与血清指标混合,使用随机森林方法建立分类预测模型,并将每个标志物按照交叉验证集中预测结果的重要性进行排序,利用R语言的glment软件包,基于10倍交叉验证分类法,将287例患者的所有指标数据分为10组不重叠的部分,其中2组用于测试队列,8组用于训练队列;共种植了2000棵决策树;所述的血清指标包括37个血液生化指标、24个常规血液指标和两个肿瘤生物标志物。(2) Mix the bile markers and serum indicators screened in step (1), use the random forest method to establish a classification prediction model, and rank each marker according to the importance of the prediction results in the cross-validation set, and use the R language The glment software package, based on the 10-fold cross-validation classification method, divides all indicator data of 287 patients into 10 non-overlapping parts, of which 2 groups are used for the test cohort and 8 groups are used for the training cohort; a total of 2000 decision trees were planted tree; the serum indicators include 37 blood biochemical indicators, 24 routine blood indicators and two tumor biomarkers.

(3)根据基尼指数≥0.25筛选出12个标志物,将所述的12种标志物纳入Lasso分类器训练集的初始输入变量,只有对分类有贡献的变量被赋予非零权重,当增加标志物的数量,Lasso分类器中准确度、灵敏度和特异度不再上升时,Lasso分类器的性能在测试集上达到最佳的准确度、灵敏度和特异度;所述的随机森林和LASSO在glment版本4.1-3中进行。(3) According to the Gini index ≥ 0.25, 12 markers are screened out, and the 12 markers are included in the initial input variables of the Lasso classifier training set. Only the variables that contribute to the classification are given non-zero weights. When adding markers The number of objects, when the accuracy, sensitivity and specificity of the Lasso classifier no longer increase, the performance of the Lasso classifier reaches the best accuracy, sensitivity and specificity on the test set; the random forest and LASSO described in glment Made in version 4.1-3.

(4)引用受试者工作特征ROC曲线来评价Lasso模型的最佳诊断性能,以ROC曲线上准确度、灵敏度和特异度最佳为截断点,得到相对最优特征数及组合方式;(4) The receiver operating characteristic ROC curve was used to evaluate the best diagnostic performance of the Lasso model, and the best accuracy, sensitivity and specificity on the ROC curve were used as the cut-off point to obtain the relatively optimal number of features and their combination;

(5)在外部验证集中利用ROC曲线验证步骤(4)得到的相对最优特征数及组合方式,得到适合于胆管癌诊断的标志物组合。(5) Using the relatively optimal number of features and combination methods obtained in the ROC curve verification step (4) in the external verification set to obtain a marker combination suitable for the diagnosis of cholangiocarcinoma.

2.结果分析2. Result analysis

表2标志物和多指标组合的诊断价值Table 2 Diagnostic value of markers and multi-index combinations

Figure BDA0003870385930000151
Figure BDA0003870385930000151

Figure BDA0003870385930000161
Figure BDA0003870385930000161

通过将生物标志物与不同类型的循环生物标志物相结合,可以提高其诊断性能。用于机器学习的每位患者的数据包含胆汁CLU和常见的63个血液指标,包括37个血液生化指标、24个常规血液指标和两个肿瘤生物标志物。在交叉验证集中,使用随机森林(RF)模型对上述特征进行分类。如图11A所示,根据其准确性(左)和基尼指数(右)进行筛选,前30个标志物被陈列出来。为了选择最合适的特征,选择了12个基尼指数≥0.25的标志物进行进一步研究,包括CLU、DBIL、TBIL、CA19-9、IBIL、LDLC、GGT、ALP、TG、AST、CL和ALT。并且它们的AUC值大约等于或高于0.7(表2)。By combining biomarkers with different types of circulating biomarkers, their diagnostic performance can be improved. The data of each patient used for machine learning contains bile CLU and common 63 blood indicators, including 37 blood biochemical indicators, 24 routine blood indicators and two tumor biomarkers. In the cross-validation set, the above features are classified using a random forest (RF) model. As shown in Figure 11A, the top 30 markers were displayed according to their accuracy (left) and Gini index (right). To select the most suitable features, 12 markers with Gini index ≥0.25 were selected for further study, including CLU, DBIL, TBIL, CA19-9, IBIL, LDLC, GGT, ALP, TG, AST, CL, and ALT. And their AUC values were about equal to or higher than 0.7 (Table 2).

选择最合适的标志物数量对于建立最终分类模型至关重要。基于上述12个选定的标志物,应用LASSO方法进行筛选。当组合包含不同数量的标志物时,LASSO筛选出了最佳组合。ROC分析显示,five-panel的AUC值最高(0.958)。Six-ten panel的AUC值相同,均为0.954,但ACC值最高(图12B和表2)。如图12C所示,Six-ten panel的灵敏度和特异性均比较理想(灵敏度:90.2%,特异性:89.0%)。综上所述,Six-panel模型在复杂性和准确性之间表现出良好的平衡,因此被确定为诊断胆管癌的最佳模型。Selecting the most appropriate number of markers is crucial for building the final classification model. Based on the above 12 selected markers, the LASSO method was used for screening. LASSO screened out the best combinations when the combinations contained different numbers of markers. ROC analysis showed that five-panel had the highest AUC value (0.958). The AUC value of the Six-ten panel is the same, both are 0.954, but the ACC value is the highest (Figure 12B and Table 2). As shown in FIG. 12C , the sensitivity and specificity of the Six-ten panel are relatively ideal (sensitivity: 90.2%, specificity: 89.0%). In summary, the Six-panel model exhibited a good balance between complexity and accuracy, and thus was identified as the best model for diagnosing cholangiocarcinoma.

Six-panel模型的AUC值显著高于它的6个成员(图13D)。并且六种生物标志物之间几乎没有相关性(r<0.5),表明它们可以形成一个很好的诊断模型(图13E)。Six-panel的tSNE结果显示,胆管癌组和对照组形成了不同的聚类(图13F),表明即使在可视化条件下,Six-panel也能很好地区分胆管癌。DCA结果表明,Six-panel在鉴别胆管癌和良性胆道狭窄方面提高了更多的临床整体益处(图13G)。为了进一步评估Six-panel的稳定性和可靠性,我们将其应用于独立的外部验证集。在外部验证集中,Six-panel表现出良好的的预测能力,AUC为0.926,灵敏度为86.2%,特异性为85.3%,明显高于单独的CLU(AUC为0.840)。tSNE和DCA 分析也显示出很好的诊断能力。The AUC value of the Six-panel model was significantly higher than that of its six members (Fig. 13D). And there was almost no correlation (r<0.5) among the six biomarkers, indicating that they could form a good diagnostic model (Fig. 13E). The tSNE results of Six-panel showed that the cholangiocarcinoma group and the control group formed different clusters (Fig. 13F), indicating that Six-panel could well distinguish cholangiocarcinoma even under the visualization condition. The results of DCA showed that Six-panel improved more overall clinical benefits in distinguishing cholangiocarcinoma from benign biliary strictures (Fig. 13G). To further evaluate the stability and reliability of Six-panel, we apply it to an independent external validation set. In the external validation set, Six-panel showed good predictive ability, with AUC of 0.926, sensitivity of 86.2%, and specificity of 85.3%, significantly higher than that of CLU alone (AUC of 0.840). tSNE and DCA analyzes also showed good diagnostic capabilities.

综上所述,本发明采用蛋白质组学技术以及人工智能数据分析技术得到适合于胆管癌诊断的诊断标志物组合,所述的诊断标志物包括胆汁和血清中的标志物,实验结果显示,Six-panel表现出良好的的预测能力,AUC为0.926,灵敏度为86.2%,特异性为85.3%,明显高于单独的CLU(AUC为0.840)。本发明诊断标志物筛选方法可操作性强,模型构建方法简单,所得诊断标志物灵敏度高,特异性好,适合于胆管癌的诊断。本发明将胆汁和血清标志物结合,进一步增加了诊断的可信度,能够很好地替代现有影像学诊断模式,并且本发明诊断简单快速,有利于胆管癌的早诊早治,具有很好的临床使用和推广价值。In summary, the present invention uses proteomics technology and artificial intelligence data analysis technology to obtain a combination of diagnostic markers suitable for the diagnosis of cholangiocarcinoma. The diagnostic markers include markers in bile and serum. The experimental results show that Six -panel showed good predictive ability, with AUC of 0.926, sensitivity of 86.2%, and specificity of 85.3%, significantly higher than that of CLU alone (AUC of 0.840). The diagnostic marker screening method of the invention has strong operability, simple model construction method, high sensitivity and good specificity of the obtained diagnostic marker, and is suitable for the diagnosis of cholangiocarcinoma. The present invention combines bile and serum markers, further increases the reliability of diagnosis, can well replace the existing imaging diagnosis mode, and the present invention is simple and fast in diagnosis, which is beneficial to the early diagnosis and early treatment of cholangiocarcinoma, and has great advantages. Good clinical use and promotion value.

Claims (10)

1. A diagnostic marker for cholangiocarcinoma, which comprises the following combination of 3 substances: clusterin (CLU), indirect Bilirubin (IBIL), low Density Lipoprotein Cholesterol (LDLC).
2. The diagnostic marker of claim 1, wherein said diagnostic marker comprises a combination of 4 of: clusterin (CLU), indirect Bilirubin (IBIL), low Density Lipoprotein Cholesterol (LDLC), gamma-glutamyl transferase (GGT).
3. The diagnostic marker of claim 2, wherein said diagnostic marker comprises a combination of5 of: clusterin (CLU), indirect Bilirubin (IBIL), low Density Lipoprotein Cholesterol (LDLC), gamma-glutamyltransferase (GGT), and carbohydrate antigen 19-9 (CA 19-9).
4. The diagnostic marker of claim 3, wherein said diagnostic marker comprises a combination of 6 of: clusterin (CLU), indirect Bilirubin (IBIL), low Density Lipoprotein Cholesterol (LDLC), gamma-glutamyl transferase (GGT), carbohydrate antigen 19-9 (CA 19-9), triglycerides (TG).
5. A screening method for the biliary duct cancer diagnostic marker according to any one of claims 1 to 4, comprising the steps of:
(1) Screening of bile markers: performing proteomics analysis on the bile and cell supernatant by adopting a liquid chromatography-mass spectrometry technology, and analyzing and identifying differentially expressed proteins between the bile duct cancer and a control group; taking intersection of bile of the bile duct cancer and protein with abnormally high expression in cell supernatant to obtain a bile marker;
(2) Mixing the bile markers obtained by screening in the step (1) with serum indexes, establishing a classification prediction model by using a random forest method, sequencing each marker according to the importance of a cross validation centralized prediction result, and dividing all index data of 287 patients into 10 groups of non-overlapping parts by using a comment software package of an R language and based on a 10-fold cross validation classification method, wherein 2 groups are used for testing queues, and 8 groups are used for training queues;
(3) Screening 12 markers according to the Gini index of more than or equal to 0.25, bringing the 12 markers into an initial input variable of a Lasso classifier training set, and only giving non-zero weight to the variables contributing to classification; when the number of the markers is increased and the accuracy, the sensitivity and the specificity in the Lasso classifier are not increased any more, the performance of the Lasso classifier achieves the optimal accuracy, sensitivity and specificity on a test set;
(4) The optimal diagnostic performance of the Lasso model is evaluated by referring to an ROC curve of the working characteristics of a testee, and the optimal accuracy, sensitivity and specificity on the ROC curve are taken as cut-off points to obtain the relatively optimal characteristic number and the combination mode;
(5) And (3) obtaining a marker combination suitable for diagnosing the bile duct cancer by using the relatively optimal characteristic number and combination mode obtained in the ROC curve verification step (4) in an external verification set.
6. The screening method of claim 5, wherein said random forest and LASSO are performed in comment version 4.1-3.
7. The screening method according to claim 5, wherein 2000 decision trees are co-planted in step (2), and the serum markers comprise 37 blood biochemical markers, 24 conventional blood markers and two tumor biomarkers.
8. Use of a diagnostic marker as defined in any one of claims 1 to 4 in the manufacture of a product for the diagnosis of biliary duct cancer.
9. The use of claim 8, wherein the diagnostic product comprises a kit, reagent or chip.
10. A kit for early diagnosis of cholangiocarcinoma, comprising the diagnostic marker according to any one of claims 1 to 4.
CN202211197099.XA 2022-09-28 2022-09-28 A diagnostic marker for cholangiocarcinoma and its screening method and application Pending CN115575635A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211197099.XA CN115575635A (en) 2022-09-28 2022-09-28 A diagnostic marker for cholangiocarcinoma and its screening method and application

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211197099.XA CN115575635A (en) 2022-09-28 2022-09-28 A diagnostic marker for cholangiocarcinoma and its screening method and application

Publications (1)

Publication Number Publication Date
CN115575635A true CN115575635A (en) 2023-01-06

Family

ID=84582599

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211197099.XA Pending CN115575635A (en) 2022-09-28 2022-09-28 A diagnostic marker for cholangiocarcinoma and its screening method and application

Country Status (1)

Country Link
CN (1) CN115575635A (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090222282A1 (en) * 2005-09-15 2009-09-03 Trustees Of Tufts College Method for Personalized Diet Design
CN102216268A (en) * 2008-09-19 2011-10-12 卡罗生物股份公司 Novel estrogen receptor ligands
US20120116685A1 (en) * 2009-04-03 2012-05-10 The Johns Hopkins University Methods, System, And Medium For Associating Rheumatoid Arthritis Subjects With Cardiovascular Disease
CN104093830A (en) * 2011-04-15 2014-10-08 吉恩勒克斯公司 Clonal strains of attenuated vaccinia viruses and methods of use thereof
CN110709936A (en) * 2017-04-04 2020-01-17 肺癌蛋白质组学有限责任公司 Plasma-based protein profiling for early-stage lung cancer prognosis
CN112881547A (en) * 2021-01-12 2021-06-01 中国科学院大学宁波华美医院 Screening method of early liver cancer diagnosis marker for liver cirrhosis and hepatitis population
CN114264828A (en) * 2022-01-28 2022-04-01 中国科学院基础医学与肿瘤研究所(筹) Biomarker for identifying benign thyroid nodule and thyroid cancer and application thereof

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090222282A1 (en) * 2005-09-15 2009-09-03 Trustees Of Tufts College Method for Personalized Diet Design
CN102216268A (en) * 2008-09-19 2011-10-12 卡罗生物股份公司 Novel estrogen receptor ligands
US20120116685A1 (en) * 2009-04-03 2012-05-10 The Johns Hopkins University Methods, System, And Medium For Associating Rheumatoid Arthritis Subjects With Cardiovascular Disease
CN104093830A (en) * 2011-04-15 2014-10-08 吉恩勒克斯公司 Clonal strains of attenuated vaccinia viruses and methods of use thereof
CN110709936A (en) * 2017-04-04 2020-01-17 肺癌蛋白质组学有限责任公司 Plasma-based protein profiling for early-stage lung cancer prognosis
CN112881547A (en) * 2021-01-12 2021-06-01 中国科学院大学宁波华美医院 Screening method of early liver cancer diagnosis marker for liver cirrhosis and hepatitis population
CN114264828A (en) * 2022-01-28 2022-04-01 中国科学院基础医学与肿瘤研究所(筹) Biomarker for identifying benign thyroid nodule and thyroid cancer and application thereof

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
GABRIELLA ANDREOTTI 等: "Serum Lipid Levels and the Risk of Biliary Tract Cancers and Biliary Stones: A Population-based Study in China" *
JARINYA KHOONTAWAD 等: "Discovering proteins for chemoprevention and chemotherapy by curcumin in liver fluke infection-induced bile duct cancer" *
JOY CUENCO 等: "Identification of a serum biomarker panel for the differential diagnosis of cholangiocarcinoma and primary sclerosing cholangitis" *
MIKEL RUIZ DE GAUNA 等: "Cholangiocarcinoma progression depends on the uptake and metabolization of extracellular lipids" *
SUMERA RIZVI 等: "Pathogenesis, Diagnosis, and Management of Cholangiocarcinoma" *
刘秋艳: "血清CA19-9和CEA及IBIL对胆管癌患者的诊断效果研究" *

Similar Documents

Publication Publication Date Title
CN110579611B (en) A combined detection serum marker, kit and detection method for early screening and diagnosis of lung cancer
CN110286235B (en) Combined detection serum marker, kit and detection method for early screening and diagnosis of liver cancer
CN104422777B (en) Application of ANO1 protein in prognosis of esophageal cancer and prediction of precancerous lesion risk
CN110283909A (en) The application of ZBTB20 albumen or its specific antibody in cardia cancer detection kit
CN109975549A (en) Purposes of the tumour source IgG in diagnosis of pancreatic cancer or prognosis
CN113777311A (en) ELISA kit for auxiliary diagnosis of esophageal squamous cell carcinoma
WO2020200323A1 (en) Very early cell marker of gastric cancer and early cell marker of gastric precancerous lesion and application thereof in diagnostic kit
US20240418724A1 (en) Molecular marker for determining very-early-stage onset risk of gastric cancer and evaluating progression risk of pre-cancerous lesion of gastric cancer, and use thereof in diagnostic kit
CN115575635A (en) A diagnostic marker for cholangiocarcinoma and its screening method and application
CN113238052B (en) Application of MG7-Ag, hTERT and TFF2 expression analysis in intestinal epithelial metaplasia risk stratification and gastric cancer early warning
CN115078726A (en) A biomarker and detection kit for ovarian cancer diagnosis
CN107015002A (en) A kind of Human colorectal carcinoma protein markers CLCA1 kits and its application
CN112229997A (en) Prognostic diagnosis marker Claudin23 for ovarian cancer and application thereof
CN112229998A (en) Prognostic diagnosis marker Claudin22 for ovarian cancer and application thereof
CN111323604A (en) A group of prognostic markers for cardiac adenocarcinoma and their applications
CN106399464A (en) Human colorectal carcinoma molecular marker COL3A1 and application thereof
CN105785004A (en) Application of cell cycle division associated protein 2 to diagnosis or prognosis of pancreatic cancer
CN110095605B (en) Lung cancer screening kit
US20170168059A1 (en) Biomarkers for assessing cancer patients for treatment
CN112462066B (en) Application of FAM3C protein and its specific antibody in preparation of detection kit for nasopharyngeal carcinoma, kit and application thereof
CN110095606B (en) Lung cancer screening kit
EP2850209B1 (en) Methods to predict progression of berret&#39;s esophagus to high grade dysplasia or esophageal adenocarcinoma
CN111208304B (en) Use of tumor-derived IgG in diagnosis and prognosis of parathyroid cancer
CN119001101A (en) Use of anti-GSDMD autoantibodies in the preparation of products for diagnosing lung adenocarcinoma
CN113721021A (en) Application of PRKCZ autoantibody in auxiliary diagnosis of esophageal squamous cell carcinoma

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination