[go: up one dir, main page]

WO2023082820A1 - Marker for lung adenocarcinoma diagnosis and application thereof - Google Patents

Marker for lung adenocarcinoma diagnosis and application thereof Download PDF

Info

Publication number
WO2023082820A1
WO2023082820A1 PCT/CN2022/118708 CN2022118708W WO2023082820A1 WO 2023082820 A1 WO2023082820 A1 WO 2023082820A1 CN 2022118708 W CN2022118708 W CN 2022118708W WO 2023082820 A1 WO2023082820 A1 WO 2023082820A1
Authority
WO
WIPO (PCT)
Prior art keywords
lung adenocarcinoma
phase
metabolites
diagnosing
metabolic
Prior art date
Application number
PCT/CN2022/118708
Other languages
French (fr)
Chinese (zh)
Inventor
娄加陶
王琳
郭巧梅
乔理华
Original Assignee
上海市第一人民医院
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 上海市第一人民医院 filed Critical 上海市第一人民医院
Publication of WO2023082820A1 publication Critical patent/WO2023082820A1/en

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N30/00Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
    • G01N30/02Column chromatography
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N27/00Investigating or analysing materials by the use of electric, electrochemical, or magnetic means
    • G01N27/62Investigating or analysing materials by the use of electric, electrochemical, or magnetic means by investigating the ionisation of gases, e.g. aerosols; by investigating electric discharges, e.g. emission of cathode
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N30/00Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
    • G01N30/02Column chromatography
    • G01N30/04Preparation or injection of sample to be analysed
    • G01N30/06Preparation
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N30/00Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
    • G01N30/02Column chromatography
    • G01N30/26Conditioning of the fluid carrier; Flow patterns
    • G01N30/28Control of physical parameters of the fluid carrier
    • G01N30/34Control of physical parameters of the fluid carrier of fluid composition, e.g. gradient
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N30/00Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
    • G01N30/02Column chromatography
    • G01N30/62Detectors specially adapted therefor
    • G01N30/72Mass spectrometers
    • G01N30/7233Mass spectrometers interfaced to liquid or supercritical fluid chromatograph
    • G01N30/724Nebulising, aerosol formation or ionisation
    • G01N30/7266Nebulising, aerosol formation or ionisation by electric field, e.g. electrospray
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/68Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
    • G01N33/6803General methods of protein analysis not limited to specific proteins or families of proteins
    • G01N33/6848Methods of protein analysis involving mass spectrometry

Definitions

  • the invention relates to the technical field of detection and diagnosis, in particular to a marker for lung adenocarcinoma diagnosis and its screening method and application.
  • lung cancer is mainly divided into two types: non-small cell lung cancer (NSCLC) and small cell lung cancer (SCLC).
  • NSCLC non-small cell lung cancer
  • SCLC small cell lung cancer
  • the proportion of non-small cell lung cancer is as high as 85% to 90%.
  • NSCLC further includes lung adenocarcinoma, lung squamous cell carcinoma, and large cell carcinoma. Compared with small cell carcinoma, the growth and division of cancer cells are slower, and the spread and metastasis are relatively late.
  • the most common subtype of lung cancer is lung adenocarcinoma.
  • Clinically, non-small cell lung cancer is often diagnosed at an advanced stage. More than half of NSCLC patients die within 1 year after diagnosis, and the 5-year survival rate is less than 20%. However, the 5-year survival rate of patients with early lung cancer can be as high as 90%. Therefore, early diagnosis of lung cancer is an important method for lung cancer patients to obtain a good prognosis and reduce mortality.
  • the means of clinical diagnosis of lung cancer mainly rely on ultrasound imaging and lung puncture.
  • the sensitivity of ultrasound diagnosis is low, and lung puncture is harmful to the lungs of patients, which is risky and difficult to promote.
  • many patients are not diagnosed until the decompensated stage of lung cancer.
  • gene molecules can be used as markers for the diagnosis of lung cancer, but the sensitivity and specificity of single gene diagnosis need to be improved urgently.
  • LDCT low-dose computed tomography
  • Metabolomics is an emerging discipline after genomics and proteomics, and is an important part of systems biology. Metabolomics has developed and rapidly penetrated into many fields, and its purpose is to study the overall metabolic differences in biological systems by monitoring the levels of small molecule metabolites in biological fluids or tissues, and to find the relative relationship between metabolites and pathophysiological changes. The occurrence of tumors is bound to be accompanied by metabolic changes, but in the early stages, the changes of small molecule metabolites are very weak and not easy to be found (Pei-Hsuan, C., Ling, C., Kenneth, H. et al. Metabolic diversity in human non-small cell lung cancer cells. Molecular Cell. 2019, 76, 1-14.
  • Lung cancer diagnostic biomarkers such as Mathe, E.A., Patterson, A.D., Haznadar, M. et al.
  • Noninvasive urinary metabolomic profiling identifies diagnostic and prognostic markers in lung cancer. Cancer Res. 2014, 74:3259-3270. William, R.W.
  • the metabolic marker obtained through screening has great clinical application value, and is especially suitable for the prediction and diagnosis of early lung cancer.
  • the present invention adopts following technical scheme:
  • the present invention provides a marker for diagnosing or monitoring lung adenocarcinoma, wherein the metabolic marker is at least selected from D-galactose, homocitrulline, N6-acetyl-L-lysine, 4-hydroxy-L - at least one of proline, hexadecanedioic acid, guanine, glutamic acid, creatine, alanine, kynuric acid.
  • the metabolic marker is at least selected from D-galactose, homocitrulline, N6-acetyl-L-lysine, 4-hydroxy-L - at least one of proline, hexadecanedioic acid, guanine, glutamic acid, creatine, alanine, kynuric acid.
  • the present invention provides a marker for diagnosing or monitoring lung adenocarcinoma
  • the combination of metabolic markers is at least selected from D-galactose, homocitrulline, N6-acetyl-L-lysine, 4-hydroxy- At least one of L-proline, hexadecanedioic acid, and guanine.
  • the marker combination is also selected from at least one of glutamic acid, creatine, alanine, and kynuric acid.
  • the AUC value of the area under the ROC curve of a single metabolic marker in the present invention is 0.7-0.9.
  • the performance of multiple metabolite groups is significantly better than that of a single metabolite, and the area under the ROC curve AUC value is 0.86-0.99, which can effectively diagnose patients with early lung adenocarcinoma.
  • the present invention also provides a reagent product or kit, including the above-mentioned standard product of metabolic markers for early diagnosis or monitoring of lung adenocarcinoma.
  • the reagent product or kit also includes solvents and/or internal standards for extracting and enriching the metabolic markers.
  • the present invention also provides a method for screening metabolic markers for diagnosing or monitoring lung adenocarcinoma, comprising the following steps:
  • the samples of the lung adenocarcinoma group and the serum samples of the healthy group were collected separately, and the TNM staging of the patients in the lung adenocarcinoma group included stage I, stage II, stage III, and stage IV;
  • the screening criteria with a P value less than 0.05 were used to obtain candidate differential metabolites; the candidate differential
  • the 10 metabolic markers screened by the present invention can effectively diagnose lung adenocarcinoma patients.
  • the invention can realize the diagnosis of lung adenocarcinoma only by taking blood for detection, without additional collection of tissue samples, can well replace the existing tissue biopsy and imaging diagnosis modes, and reduce trauma and radiation risks.
  • Fig. 1 is an S-plot diagram of the metabolite OPLS-DA provided in Example 1 of the present invention.
  • the "local standard substance database” in the present invention refers to the mass spectrometric detection of a large number of related metabolite molecular standards in the present invention, and the collection of mass spectrometry information of these metabolites, thereby forming a localized standard substance database.
  • This embodiment provides a method for screening metabolic markers for lung adenocarcinoma diagnosis, comprising the following steps:
  • peripheral vein samples from 242 patients with lung adenocarcinoma (lung adenocarcinoma group, including 172 patients with early lung adenocarcinoma) and 150 healthy people (healthy group) were collected from Shanghai Chest Hospital. blood serum.
  • lung adenocarcinoma group including 172 patients with early lung adenocarcinoma
  • healthy people healthy group
  • the diagnostic standard for patients with lung adenocarcinoma is confirmed by postoperative pathology.
  • TNM staging patients with stage I and II lung adenocarcinoma were defined as patients with early-stage lung adenocarcinoma.
  • step S1 Take out the sample collected in step S1 from the -80°C refrigerator, and thaw it on ice until there are no ice cubes in the sample (subsequent operations are required to be carried out on ice);
  • Into a numbered centrifuge tube add 300 ⁇ L pure methanol internal standard extraction solution (containing 100 ppm L-phenylalanine internal standard); vortex for 5 min, let stand for 24 h, and then centrifuge at 12000 r/min, 4 °C for 10 min ; Take 270 ⁇ L of the supernatant and concentrate for 24 hours; then add 100 ⁇ L of acetonitrile and water in a complex solution with a volume ratio of 1:1 for LC-MS/MS analysis. 20 ⁇ L of each sample was mixed to form a quality control sample (QC), which was collected every 15 samples.
  • QC quality control sample
  • Chromatographic column Waters ACQUITY UPLC HSS T3 C18 1.8 ⁇ m, 2.1mm*100mm; column temperature is 40°C; injection volume is 2 ⁇ L.
  • Phase A is an aqueous solution containing 0.1% acetic acid
  • phase B is an acetonitrile solution containing 0.1% acetic acid.
  • the elution gradient program is: 0min, the volume ratio of phase A to B is 95:5; 11.0min, the volume ratio of phase A to B is 10:90; 12.0min, the volume ratio of phase A to B is 10 :90; 12.1min, the volume ratio of A phase and B phase is 95:5; 14.0min, the volume ratio of A phase and B phase is 95:5V/V.
  • Flow rate 0.4mL/min.
  • electrospray ionization temperature 500°C
  • mass spectrometer voltage 5500V (positive) or -4500V (negative)
  • ion source gas I GS I
  • gas II GS II
  • gas curtain Gas curtain gas, CUR
  • collision-induced ionization collision-activated dissociation, CAD
  • each ion pair is scanned in MRM mode according to the optimized declustering potential (DP) and collision energy (collision energy, CE).
  • DP declustering potential
  • CE collision energy
  • the samples were analyzed and detected according to the determined liquid chromatography conditions and mass spectrometry conditions: 20% of the samples in the lung adenocarcinoma group and the healthy group were randomly selected, and enhanced ion scanning mass spectrometry (MIM-EPI) and time-of-flight mass spectrometry (TOF) were used.
  • MIM-EPI enhanced ion scanning mass spectrometry
  • TOF time-of-flight mass spectrometry
  • the local standard database was integrated to construct the lung adenocarcinoma serum metabolite database.
  • the collected serum samples were analyzed to obtain the original mass spectrometry data of each serum sample.
  • the metabolites of the samples were qualitatively and quantitatively analyzed by mass spectrometry. Metabolites of different molecular weights can be separated by liquid chromatography. The characteristic ions of each substance are screened out by the multiple reaction monitoring mode (MRM) of the triple quadrupole, and the signal intensity (CPS) of the characteristic ions is obtained in the detector.
  • MRM multiple reaction monitoring mode
  • CPS signal intensity
  • Use MultiQuant software to open the mass spectrum file of the sample off-machine, preprocess and correct the original mass spectrum data according to the mass-to-charge ratio and retention time, and perform the integration and calibration of the chromatographic peaks.
  • the peak area (Area) of each chromatographic peak represents the corresponding substance.
  • the repeatability of metabolite extraction and detection can be judged by overlaying and displaying the total ion chromatograms (TIC charts) of different quality control QC samples for mass spectrometry detection and analysis, that is, technical repetition.
  • the high stability of the instrument provides an important guarantee for the repeatability and reliability of the data.
  • the CV value is the coefficient of variation (Coefficient of Variation), which is the ratio of the standard deviation of the original data to the mean of the original data, which can reflect the degree of dispersion of the data.
  • the Empirical Cumulative Distribution Function (ECDF) can be used to analyze the frequency of the CV of substances less than the reference value.
  • the proportion of substances with 0.5 is higher than 85%, indicating that the experimental data is relatively stable; the proportion of substances with QC sample CV value less than 0.3 is higher than 75%, indicating that the experimental data is very stable.
  • the change of the CV value of the internal standard L-phenylalanine during the detection process was monitored, and the change of the CV value of the internal standard was less than 20%, indicating that the instrument was stable during the detection process.
  • the peak area integration data were imported into SIMCA software (Version 14.1, Sweden) for multivariate statistical analysis.
  • OPLS-DA orthogonal-partial least squares discriminant
  • Ten differential metabolites screened by binary logistic regression forward stepwise method can diagnose and distinguish lung adenocarcinoma patients from healthy people: D-galactose, homocitrulline, N6-acetyl-L-lysine, 4-hydroxy -L-proline, hexadecanedioic acid, guanine, glutamic acid, creatine, alanine, kynuric acid. See Table 3 to Table 5 for specific information on metabolites:
  • ROC receiver operating characteristic curve
  • Table 7 The AUC value of any differential metabolite combined for the diagnosis of lung adenocarcinoma
  • Example metabolic marker combinations are as follows:
  • a further preferred combination of metabolic markers is: D-galactose, homocitrulline, and N6-acetyl-L-lysine to construct a model for diagnosing lung adenocarcinoma.
  • the AUC value of these three metabolites combined for the diagnosis of lung adenocarcinoma reached 0.925. Under the optimal cutoff value, the sensitivity and specificity were 90.8% and 90.3%, respectively.
  • a further preferred combination of metabolic markers is: D-galactose, homocitrulline, N6-acetyl-L-lysine, 4-hydroxy-L-proline, hexadecanedioic acid, and guanine to construct a diagnostic lung gland cancer model.
  • the AUC value of these 6 metabolites combined to diagnose benign and malignant pulmonary nodules reached 0.956. Under the optimal cutoff value, the sensitivity and specificity were 94.1% and 93.3%, respectively.
  • Table 8 The AUC value of a single metabolite for the diagnosis of early lung adenocarcinoma
  • Example metabolic marker combinations are as follows:
  • a further preferred combination of metabolic markers is: D-galactose, homocitrulline, and N6-acetyl-L-lysine to construct a model for diagnosing early lung adenocarcinoma.
  • the AUC value of these three metabolites combined to diagnose early lung adenocarcinoma reached 0.925. Under the optimal cutoff value, the sensitivity and specificity were 90.8% and 90.3%, respectively.
  • a further preferred combination of metabolic markers is: D-galactose, homocitrulline, N6-acetyl-L-lysine, 4-hydroxy-L-proline, hexadecanedioic acid, guanine, constructing an early diagnosis Lung adenocarcinoma model.
  • the combined AUC value of these 6 metabolites in the diagnosis of early lung adenocarcinoma reached 0.956. Under the optimal cutoff value, the sensitivity and specificity were 94.1% and 93.3%, respectively.
  • Example 2 Validation of diagnostic markers for lung adenocarcinoma
  • the subjects of this study included 266 serum samples of patients with lung adenocarcinoma from 2 independent medical centers, including 118 patients with early lung adenocarcinoma; and 149 serum samples of healthy people, which were from the same source as the feature screening samples (150 cases) .
  • the diagnostic standard of lung adenocarcinoma is lung adenocarcinoma diagnosed by postoperative pathology; healthy people are healthy people without lung diseases after physical examination. All lung adenocarcinoma patients and healthy samples had no history of any other malignant tumors, no other major systemic diseases, and no chronic medical history of long-term medication.
  • patients with stage I and II lung adenocarcinoma were defined as patients with early-stage lung adenocarcinoma.
  • the detection and data analysis methods of this example are the same as those of Example 1, and the differential metabolites detected and analyzed are the following 10 kinds: D-galactose, homocitrulline, N6-acetyl-L-lysine, 4-hydroxy - L-proline, hexadecanedioic acid, guanine, glutamic acid, creatine, alanine, kynuric acid for the diagnosis of lung adenocarcinoma.
  • These 10 differential metabolites are individually used to diagnose and distinguish between patients with lung adenocarcinoma and healthy people.
  • the ability of early lung adenocarcinoma patients and healthy people is relatively strong, and the area under the ROC curve (AUC) is greater than 0.7, which has clinical diagnostic significance;
  • AUC area under the ROC curve
  • the AUC is further improved, and the AUC value of 10 joint diagnosis of lung adenocarcinoma reaches 0.955.
  • the sensitivity and specificity are 91.5% and 93.6%, respectively; the diagnosis of early lung adenocarcinoma
  • the AUC of cancer was 0.983, and the sensitivity and specificity were 93.2% and 96.0% at the best cutoff value. See Table 11 to Table 14 for the AUC of a single and any combination of 2 to 9 metabolites used for diagnosis:
  • Table 12 The AUC value of any differential metabolite combined for the diagnosis of lung adenocarcinoma
  • D-galactose D-galactose, homocitrulline, N6-acetyl-L-lysine, 4-hydroxy-L-proline, hexadecanedioic acid, and guanine were used to construct a model for diagnosing lung adenocarcinoma.
  • the AUC value of these 6 metabolites combined to diagnose benign and malignant pulmonary nodules reached 0.931. Under the optimal cutoff value, the sensitivity and specificity were 90.2% and 91.4%, respectively.
  • Table 13 The AUC value of a single metabolite for the diagnosis of early lung adenocarcinoma
  • Table 14 The AUC value of any differential metabolite combined for the diagnosis of early lung adenocarcinoma
  • Metabolic markers D-galactose, homocitrulline, and N6-acetyl-L-lysine were further optimized to construct a model for diagnosing early lung adenocarcinoma.
  • the AUC value of these three metabolites combined to diagnose early lung adenocarcinoma reached 0.916. Under the optimal cutoff value, the sensitivity and specificity were 89.3% and 90.1%, respectively.
  • This embodiment provides a detection kit, comprising:
  • 50% acetonitrile in water can be used as a solvent to dissolve the standards.

Landscapes

  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Immunology (AREA)
  • Analytical Chemistry (AREA)
  • Biochemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Pathology (AREA)
  • Engineering & Computer Science (AREA)
  • Molecular Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biomedical Technology (AREA)
  • Urology & Nephrology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Hematology (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Biophysics (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Electrochemistry (AREA)
  • Biotechnology (AREA)
  • Cell Biology (AREA)
  • Dispersion Chemistry (AREA)
  • Microbiology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Food Science & Technology (AREA)
  • Medicinal Chemistry (AREA)
  • Other Investigation Or Analysis Of Materials By Electrical Means (AREA)

Abstract

The present invention provides a metabolic marker for diagnosing or monitoring lung adenocarcinoma, a screening method therefor and an application thereof. The metabolic marker system is selected from one or more combinations of D-galactose, homocitrulline, N6-acetyl-L-lysine, 4-hydroxy-L-proline, hexadecanedioic acid, guanine, glutamic acid, creatine, alanine and kynuric acid. The metabolic marker provided by the present invention can accurately diagnose patients with lung adenocarcinoma, can distinguish patients with early lung adenocarcinoma from healthy persons, is high in sensitivity and strong in specificity, can well replace the existing tissue biopsy and imaging diagnosis modes, and reduces trauma and radiation risks.

Description

用于肺腺癌诊断的标志物及其应用Markers for the diagnosis of lung adenocarcinoma and its application 技术领域technical field
本发明涉及检测诊断技术领域,具体涉及一种用于肺腺癌诊断的标志物及其筛选方法和应用。The invention relates to the technical field of detection and diagnosis, in particular to a marker for lung adenocarcinoma diagnosis and its screening method and application.
背景技术Background technique
根据组织病理学,肺癌主要分为非小细胞肺癌(NSCLC)和小细胞肺癌(SCLC)两类,非小细胞肺癌占比高达85%~90%。NSCLC进一步包括肺腺癌、肺鳞癌和大细胞癌,与小细胞癌相比其癌细胞生长分裂较慢,扩散转移相对较晚,而其中最常见的肺癌亚型是肺腺癌。临床上,非小细胞肺癌在诊断时常常已进入为晚期。超过半数的非小细胞肺癌患者在确诊后1年内死亡,5年生存率不到20%。但是,早期肺癌患者5年生存率可高达90%以上。因此,对肺癌的早期诊断是肺癌患者获得良好预后以及减少死亡率的重要方法。According to histopathology, lung cancer is mainly divided into two types: non-small cell lung cancer (NSCLC) and small cell lung cancer (SCLC). The proportion of non-small cell lung cancer is as high as 85% to 90%. NSCLC further includes lung adenocarcinoma, lung squamous cell carcinoma, and large cell carcinoma. Compared with small cell carcinoma, the growth and division of cancer cells are slower, and the spread and metastasis are relatively late. Among them, the most common subtype of lung cancer is lung adenocarcinoma. Clinically, non-small cell lung cancer is often diagnosed at an advanced stage. More than half of NSCLC patients die within 1 year after diagnosis, and the 5-year survival rate is less than 20%. However, the 5-year survival rate of patients with early lung cancer can be as high as 90%. Therefore, early diagnosis of lung cancer is an important method for lung cancer patients to obtain a good prognosis and reduce mortality.
临床上确诊肺癌的手段主要依靠超声影像和肺穿刺。其中,超声诊断的灵敏度较低,而肺穿刺对患者的肺部有损伤,存在风险,不易推广,导致很多患者直到肺癌失代偿期才被确诊。有研究发现:基因分子可以作为肺癌诊断的标志物,但是单个基因诊断的敏感度与特异性亟待提高。The means of clinical diagnosis of lung cancer mainly rely on ultrasound imaging and lung puncture. Among them, the sensitivity of ultrasound diagnosis is low, and lung puncture is harmful to the lungs of patients, which is risky and difficult to promote. As a result, many patients are not diagnosed until the decompensated stage of lung cancer. Studies have found that gene molecules can be used as markers for the diagnosis of lung cancer, but the sensitivity and specificity of single gene diagnosis need to be improved urgently.
另外,美国国家肺部筛查试验(NLST)的数据表明,利用低剂量计算机断层扫描(LDCT)对高危人群进行早期肺癌筛查,可将肺癌死亡率降低20%,总死亡率降低7%(Bethesda,et al.Reduced lung-cancer mortality with low-dose computed tomographic screening.N Engl J Med.2011;365:395-409.)。但是,LDCT存在辐射暴露和假阳性率高等问题,影响基于LDCT筛查在全球范围内的实用性。In addition, data from the National Lung Screening Trial (NLST) in the United States showed that early lung cancer screening in high-risk groups using low-dose computed tomography (LDCT) can reduce lung cancer mortality by 20% and overall mortality by 7% ( Bethesda, et al. Reduced lung-cancer mortality with low-dose computed tomographic screening. N Engl J Med. 2011; 365:395-409.). However, LDCT has problems such as radiation exposure and high false positive rate, which affect the practicality of LDCT-based screening on a global scale.
代谢组学是继基因组学和蛋白质组学之后的一门新兴学科,是系统生物学的重要组成部分。代谢组学已经发展并迅速渗透到许多领域,其目的是通 过监测生物液或组织中小分子代谢物的水平来研究生物系统中的整体代谢差异,并寻找代谢物与病理生理变化的相对关系。肿瘤的发生必然伴随有代谢的改变,但是在早期阶段,小分子代谢物的变化非常微弱,不容易被发现(Pei-Hsuan,C.,Ling,C.,Kenneth,H.et al.Metabolic diversity in human non-small cell lung cancer cells.Molecular Cell.2019,76,1-14.Brandon,F.,Ashley,S.,Ralph,J.D.Metabolic reprogramming and cancer progression.Science.2020,April 10;368.)。大量研究表明,肿瘤的发生和发展与能量代谢密切相关,比如瓦博格效应(Warburg effect)、三羧酸循环(TCA)、糖酵解途径等,为满足癌细胞增殖提供能量需求(Vander Heiden,MG.,Cantley,LC.,Thompson,CB.Understanding the Warburg effect:the metabolic requirements of cell proliferation.Science 2009;324(5930):1029-33.)。因此,基于血液或尿液的生物标记物或多重标记物组合可以补充LDCT筛查的缺陷,可能能够在实施肺癌筛查方面作出重大贡献。Metabolomics is an emerging discipline after genomics and proteomics, and is an important part of systems biology. Metabolomics has developed and rapidly penetrated into many fields, and its purpose is to study the overall metabolic differences in biological systems by monitoring the levels of small molecule metabolites in biological fluids or tissues, and to find the relative relationship between metabolites and pathophysiological changes. The occurrence of tumors is bound to be accompanied by metabolic changes, but in the early stages, the changes of small molecule metabolites are very weak and not easy to be found (Pei-Hsuan, C., Ling, C., Kenneth, H. et al. Metabolic diversity in human non-small cell lung cancer cells. Molecular Cell. 2019, 76, 1-14. Brandon, F., Ashley, S., Ralph, J.D. Metabolic reprogramming and cancer progression. Science. 2020, April 10; 368.) . A large number of studies have shown that the occurrence and development of tumors are closely related to energy metabolism, such as the Warburg effect (Warburg effect), tricarboxylic acid cycle (TCA), glycolysis pathway, etc., which provide energy requirements for cancer cell proliferation (Vander Heiden , MG., Cantley, LC., Thompson, CB. Understanding the Warburg effect: the metabolic requirements of cell proliferation. Science 2009; 324(5930): 1029-33.). Therefore, blood- or urine-based biomarkers or multiple marker combinations can complement the deficiencies of LDCT screening and may be able to make a significant contribution in the implementation of lung cancer screening.
在过去十年中,也有一些科学家尝试在肺癌筛查、诊断、预后等领域应用代谢组学,研究发现了一些在肺癌发生和发展过程中发生改变的代谢物和代谢通路,获得了一些可靠的肺癌诊断生物标志物,例如Mathe,E.A.,Patterson,A.D.,Haznadar,M.et al.Noninvasive urinary metabolomic profiling identifies diagnostic and prognostic markers in lung cancer.Cancer Res.2014,74:3259-3270.William,R.W.,Samir,H.,Brian,D.et al.Diacetylspermine is a novel prediagnostic serum biomarker for non-small-cell lung cancer and has additive performance with pro-surfactant protein B.J Clin Oncol.2015,Nov 20;33(33):3880-6.Agnieszka,K.,Szymon,P.,Mariusz,K.et al.Serum lipidome screening in patients with stage I non-small cell lung cancer.Clin Exp Med.2019;19(4):505-513.)。但这些研究基本上都是利用公共数据库的代谢物信息,大多数研究样本量不大,并且样本来源单一,筛选得到的代谢物针对早期肺癌筛查特异性不强,在实际临床应用中价值不大。In the past ten years, some scientists have also tried to apply metabolomics in the fields of lung cancer screening, diagnosis, and prognosis. Studies have discovered some metabolites and metabolic pathways that change during the occurrence and development of lung cancer, and obtained some reliable data. Lung cancer diagnostic biomarkers, such as Mathe, E.A., Patterson, A.D., Haznadar, M. et al. Noninvasive urinary metabolomic profiling identifies diagnostic and prognostic markers in lung cancer. Cancer Res. 2014, 74:3259-3270. William, R.W. , Samir, H., Brian, D.et Al.DiaceTylSpermine is a Novel Prediagnostic Serum Biomarker for Non-Small-Cell LUNG CANGER and Has AdditIVANCE PRO-SURFActan T propein b.j clin oncol.2015, nov 20; 33 (33): 3880-6.Agnieszka,K.,Szymon,P.,Mariusz,K.et al.Serum lipidome screening in patients with stage I non-small cell lung cancer.Clin Exp Med.2019;19(4):505-513 .). However, these studies basically use metabolite information in public databases. Most of the studies have small sample sizes and a single source of samples. The metabolites obtained from the screening are not specific for early lung cancer screening and have little value in actual clinical applications. big.
发明的公开disclosure of invention
基于此,有必要提供一种用于肺腺癌诊断的标志物及其筛选方法和应用,筛选得到的代谢标志物临床应用价值大,尤其能够适用于早期肺癌预测诊断。Based on this, it is necessary to provide a marker for the diagnosis of lung adenocarcinoma and its screening method and application. The metabolic marker obtained through screening has great clinical application value, and is especially suitable for the prediction and diagnosis of early lung cancer.
本发明采用如下技术方案:The present invention adopts following technical scheme:
本发明提供一种用于诊断或监测肺腺癌的标志物,所述代谢标志物至少选自D-半乳糖、高瓜氨酸、N6-乙酰-L-赖氨酸、4-羟基-L-脯氨酸、十六烷二酸、鸟嘌呤、谷氨酸、肌酸、丙氨酸、犬尿酸中的至少一种。The present invention provides a marker for diagnosing or monitoring lung adenocarcinoma, wherein the metabolic marker is at least selected from D-galactose, homocitrulline, N6-acetyl-L-lysine, 4-hydroxy-L - at least one of proline, hexadecanedioic acid, guanine, glutamic acid, creatine, alanine, kynuric acid.
本发明提供一种用于诊断或监测肺腺癌的标志物,所述代谢标志物组合至少选自D-半乳糖、高瓜氨酸、N6-乙酰-L-赖氨酸、4-羟基-L-脯氨酸、十六烷二酸、鸟嘌呤中的至少一种。进一步地,所述标志物组合还选自谷氨酸、肌酸、丙氨酸、犬尿酸中的至少一种。The present invention provides a marker for diagnosing or monitoring lung adenocarcinoma, the combination of metabolic markers is at least selected from D-galactose, homocitrulline, N6-acetyl-L-lysine, 4-hydroxy- At least one of L-proline, hexadecanedioic acid, and guanine. Further, the marker combination is also selected from at least one of glutamic acid, creatine, alanine, and kynuric acid.
在ROC曲线评价方法中,本发明中单个代谢标志物在ROC曲线下的面积AUC值为0.7~0.9。多个代谢物组的性能明显优于单个代谢物,ROC曲线下的面积AUC值为0.86~0.99,能够对早期肺腺癌患者进行有效诊断。In the ROC curve evaluation method, the AUC value of the area under the ROC curve of a single metabolic marker in the present invention is 0.7-0.9. The performance of multiple metabolite groups is significantly better than that of a single metabolite, and the area under the ROC curve AUC value is 0.86-0.99, which can effectively diagnose patients with early lung adenocarcinoma.
上述所述用于诊断或监测肺腺癌的代谢标志物在制备诊断或监测肺腺癌的代谢物数据库、试剂产品或者试剂盒中的应用。Application of the above-mentioned metabolic markers for diagnosing or monitoring lung adenocarcinoma in the preparation of a metabolite database, reagent product or kit for diagnosing or monitoring lung adenocarcinoma.
本发明还提供一种试剂产品或者试剂盒,包括上述所述的用于早期诊断或监测肺腺癌的代谢标志物的标准品。The present invention also provides a reagent product or kit, including the above-mentioned standard product of metabolic markers for early diagnosis or monitoring of lung adenocarcinoma.
进一步地,所述试剂产品或者试剂盒还包括提取富集所述代谢标志物的溶剂和/或内标物。Further, the reagent product or kit also includes solvents and/or internal standards for extracting and enriching the metabolic markers.
本发明还提供一种用于诊断或监测肺腺癌的代谢标志物的筛选方法,包括如下步骤:The present invention also provides a method for screening metabolic markers for diagnosing or monitoring lung adenocarcinoma, comprising the following steps:
分别采集肺腺癌组样本和健康组血清样本,肺腺癌组样本的患者的TNM分期包括为I期、II期、III期、IV期;The samples of the lung adenocarcinoma group and the serum samples of the healthy group were collected separately, and the TNM staging of the patients in the lung adenocarcinoma group included stage I, stage II, stage III, and stage IV;
随机挑选肺腺癌组和健康组各20%的样本,采用增强离子扫描质谱和飞行时间质谱结合多反应监测采集模式的代谢组学方法,以及整合本地标准品数据库进行肺腺癌血清代谢物数据库构建;采用构建肺腺癌血清代谢物数据库和LC-MS检测对采集的血清样本进行分析,得到各血清样本的原始质谱数据;使用MultiQuant软件,根据质荷比、保留时间对原始质谱数据进行预处理和校正;根据质谱峰强度计算峰面积得到代谢物相对含量信息;将代谢物相对含量信息进行多元统计正交-偏最小二乘法判别分析,并根据变量权重值大于1及单变量统计分析的P值小于0.05的筛选标准,得到候选差异代谢物;将候选差异代谢物进行二元逻辑回归建模,筛选优异代谢物及其组合对 肺腺癌患者进行诊断,对筛选的优异代谢物及其组合进行受试者工作特征曲线分析,确定用于诊断或监测肺腺癌的代谢标志物。Randomly select 20% of the samples from the lung adenocarcinoma group and the healthy group, adopt the metabolomics method of enhanced ion scanning mass spectrometry and time-of-flight mass spectrometry combined with multiple reaction monitoring acquisition mode, and integrate the local standard database for lung adenocarcinoma serum metabolite database Construction; using the construction of lung adenocarcinoma serum metabolite database and LC-MS detection to analyze the collected serum samples to obtain the original mass spectrometry data of each serum sample; use MultiQuant software to predict the original mass spectrometry data according to the mass-to-charge ratio and retention time Processing and correction; calculate the peak area according to the mass spectrum peak intensity to obtain the relative content information of metabolites; conduct multivariate statistical orthogonal-partial least squares discriminant analysis on the relative content information of metabolites, and according to the variable weight value greater than 1 and univariate statistical analysis The screening criteria with a P value less than 0.05 were used to obtain candidate differential metabolites; the candidate differential metabolites were subjected to binary logistic regression modeling, and excellent metabolites and their combinations were screened for diagnosis of lung adenocarcinoma patients. Receiver operating characteristic curve analysis was performed in combination to identify metabolic markers for diagnosis or monitoring of lung adenocarcinoma.
与现有技术相比,本发明筛选的10种代谢标志物能够对肺腺癌患者进行有效诊断。本发明仅通过取血检测就能实现肺腺癌诊断,无需额外采集组织样本,能够很好地替代现有的组织活检及影像学诊断模式,减少创伤和辐射风险。Compared with the prior art, the 10 metabolic markers screened by the present invention can effectively diagnose lung adenocarcinoma patients. The invention can realize the diagnosis of lung adenocarcinoma only by taking blood for detection, without additional collection of tissue samples, can well replace the existing tissue biopsy and imaging diagnosis modes, and reduce trauma and radiation risks.
附图的简要说明Brief description of the drawings
图1为本发明实施例1所提供的代谢物OPLS-DA的S-plot图。Fig. 1 is an S-plot diagram of the metabolite OPLS-DA provided in Example 1 of the present invention.
实现本发明的最佳方式BEST MODE FOR CARRYING OUT THE INVENTION
下面结合具体实施例对本发明作进一步的详细说明,以使本领域的技术人员更加清楚地理解本发明。The present invention will be further described in detail below in conjunction with specific embodiments, so that those skilled in the art can understand the present invention more clearly.
以下各实施例,仅用于说明本发明,但不止用来限制本发明的范围。基于本发明中的具体实施例,本领域普通技术人员在没有做出创造性劳动的情况下,所获得的其他所有实施例,都属于本发明的保护范围。The following examples are only used to illustrate the present invention, but not to limit the scope of the present invention. Based on the specific embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.
在本发明实施例中,若无特殊说明,所有原料组分均为本领域技术人员熟知的市售产品;在本发明实施例中,若未具体指明,所用的技术手段均为本领域技术人员所熟知的常规手段。关键仪器信息分别见下表1:In the embodiments of the present invention, unless otherwise specified, all raw material components are commercially available products well known to those skilled in the art; in the embodiments of the present invention, if not specifically specified, the technical means used are all well-known conventional means. The key instrument information is shown in Table 1 below:
表1实验仪器信息Table 1 Experimental instrument information
名称name 型号model 品牌brand
LC-MS/MSLC-MS/MS QTRAP 6500+QTRAP 6500+ SCIEXSCIEX
离心机centrifuge 5424R5424R EppendorfEppendorf
离心浓缩仪Centrifugal concentrator CentriVapCentriVap LABCONCOLABCONCO
涡旋混合器Vortex mixer VORTEX-5VORTEX-5 Kyllin-Be11Kyllin-Be11
术语说明Glossary
本发明所述“本地标准品数据库”是指本发明将大量相关代谢物分子标准品进行质谱检测后,收集这些代谢物的质谱信息,由此形成一个本地化的标 准品数据库。The "local standard substance database" in the present invention refers to the mass spectrometric detection of a large number of related metabolite molecular standards in the present invention, and the collection of mass spectrometry information of these metabolites, thereby forming a localized standard substance database.
实施例1Example 1
本实施例提供一种肺腺癌诊断代谢标志物的筛选方法,包括如下步骤:This embodiment provides a method for screening metabolic markers for lung adenocarcinoma diagnosis, comprising the following steps:
S1,采集样品S1, collecting samples
本研究在取得患者同意后,收集上海市胸科医院的242例肺腺癌患者(肺腺癌组,其中包括172例早期肺腺癌患者)和150例健康人(健康组)样本的外周静脉血血清。其中,包括肺腺癌患者的诊断标准是经术后病理确诊。根据TNM分期,将I期和II期肺腺癌患者定义为早期肺腺癌患者。所有肺腺癌患者和非肺腺癌组样本均无其它恶性肿瘤病史,无其他全身性重大疾病,无长期用药的慢性病史。采血时间均为清晨空腹状态。所有血清样本离心后置于-80℃冰箱内保存,研究时分别取出血清样品解冻后进行后续分析。In this study, after obtaining the consent of the patients, peripheral vein samples from 242 patients with lung adenocarcinoma (lung adenocarcinoma group, including 172 patients with early lung adenocarcinoma) and 150 healthy people (healthy group) were collected from Shanghai Chest Hospital. blood serum. Among them, the diagnostic standard for patients with lung adenocarcinoma is confirmed by postoperative pathology. According to TNM staging, patients with stage I and II lung adenocarcinoma were defined as patients with early-stage lung adenocarcinoma. All patients with lung adenocarcinoma and samples from the non-lung adenocarcinoma group had no history of other malignant tumors, no other major systemic diseases, and no chronic medical history of long-term medication. The time of blood collection was in the morning on an empty stomach. All serum samples were centrifuged and stored in a -80°C refrigerator. During the study, serum samples were taken out and thawed for subsequent analysis.
S2,血清广泛靶向代谢组学分析S2, Extensive targeted metabolomics analysis of serum
(1)样品预处理(1) Sample pretreatment
从-80℃冰箱中取出步骤S1采集的样品,于冰上解冻至样本中没有冰块(后续操作都要求在冰上进行);样本解冻后,涡旋10s混匀,取样本50μL加入到对应编号的离心管中;加入300μL纯甲醇内标提取液(含100ppm浓度的L-苯基丙氨酸内标);涡旋5min,静置24h,再于12000r/min、4℃条件下离心10min;吸取上清液270μL浓缩24h;再加入100μL由乙腈和水按照体积比1:1组成的复溶液中,用于LC-MS/MS分析。每个样本各取20μL混合成质控样本(QC),每间隔15个样本采集一次。Take out the sample collected in step S1 from the -80°C refrigerator, and thaw it on ice until there are no ice cubes in the sample (subsequent operations are required to be carried out on ice); Into a numbered centrifuge tube; add 300 μL pure methanol internal standard extraction solution (containing 100 ppm L-phenylalanine internal standard); vortex for 5 min, let stand for 24 h, and then centrifuge at 12000 r/min, 4 °C for 10 min ; Take 270 μL of the supernatant and concentrate for 24 hours; then add 100 μL of acetonitrile and water in a complex solution with a volume ratio of 1:1 for LC-MS/MS analysis. 20 μL of each sample was mixed to form a quality control sample (QC), which was collected every 15 samples.
(2)样品代谢物检测分析(2) Sample metabolite detection and analysis
表2实验试剂Table 2 Experimental Reagents
化合物compound CAS编号CAS number 品牌brand
甲醇Methanol 67-56-167-56-1 MerckMerck
乙腈Acetonitrile 75-05-875-05-8 MerckMerck
乙酸Acetic acid 64-19-764-19-7 AladdinAladdin
L-苯基丙氨酸L-Phenylalanine 63-91-263-91-2 isoreagisoreag
确定液相色谱条件如下:Determine the liquid chromatography conditions as follows:
色谱柱:Waters ACQUITY UPLC HSS T3 C18 1.8μm,2.1mm*100mm;柱温为40℃;进样量为2μL。Chromatographic column: Waters ACQUITY UPLC HSS T3 C18 1.8μm, 2.1mm*100mm; column temperature is 40°C; injection volume is 2μL.
流动相:A相为含0.1%乙酸水溶液,B相为含0.1%乙酸的乙腈溶液。洗脱梯度程序为:0min,A相与B相的体积比为95:5;11.0min,A相与B相的体积比为10:90;12.0min,A相与B相的体积比为10:90;12.1min,A相与B相的体积比为95:5;14.0min,A相与B相的体积比为95:5V/V。流速0.4mL/min。Mobile phase: Phase A is an aqueous solution containing 0.1% acetic acid, and phase B is an acetonitrile solution containing 0.1% acetic acid. The elution gradient program is: 0min, the volume ratio of phase A to B is 95:5; 11.0min, the volume ratio of phase A to B is 10:90; 12.0min, the volume ratio of phase A to B is 10 :90; 12.1min, the volume ratio of A phase and B phase is 95:5; 14.0min, the volume ratio of A phase and B phase is 95:5V/V. Flow rate 0.4mL/min.
确定质谱条件:电喷雾离子源(electrospray ionization,ESI)温度500℃,质谱电压5500V(positive)或者-4500V(negative),离子源气体I(GS I)55psi,气体II(GS II)60psi,气帘气(curtain gas,CUR)25psi,碰撞诱导电离(collision-activated dissociation,CAD)参数设置为高。Determine the mass spectrometry conditions: electrospray ionization (ESI) temperature 500°C, mass spectrometer voltage 5500V (positive) or -4500V (negative), ion source gas I (GS I) 55psi, gas II (GS II) 60psi, gas curtain Gas (curtain gas, CUR) 25psi, collision-induced ionization (collision-activated dissociation, CAD) parameters were set to high.
在三重四极杆(Qtrap)中,每个离子对是根据优化的去簇电压(declustering potential,DP)和碰撞能(collision energy,CE)进行MRM模式扫描检测。In the triple quadrupole (Qtrap), each ion pair is scanned in MRM mode according to the optimized declustering potential (DP) and collision energy (collision energy, CE).
按照确定的液相色谱条件和质谱条件分别对样本进行分析检测:随机挑选肺腺癌组和健康组中各20%的样本,采用增强离子扫描质谱(MIM-EPI)和飞行时间质谱(TOF)结合多反应监测采集模式的代谢组学方法,整合本地标准品数据库进行肺腺癌血清代谢物数据库构建,用液相色谱-质谱联用代谢组学方法和构建的肺腺癌血清代谢物数据库对采集的血清样本进行分析,得到各血清样本的原始质谱数据。The samples were analyzed and detected according to the determined liquid chromatography conditions and mass spectrometry conditions: 20% of the samples in the lung adenocarcinoma group and the healthy group were randomly selected, and enhanced ion scanning mass spectrometry (MIM-EPI) and time-of-flight mass spectrometry (TOF) were used. Combined with the metabolomics method of multiple reaction monitoring acquisition mode, the local standard database was integrated to construct the lung adenocarcinoma serum metabolite database. The collected serum samples were analyzed to obtain the original mass spectrometry data of each serum sample.
(3)图谱峰面积预处理和积分(3) Spectrum peak area preprocessing and integration
基于肺腺癌血清代谢物数据库,对样本的代谢物进行质谱定性定量分析。通过液相色谱能够分离不同分子量的代谢物。通过三重四极杆的多反应监测模式(MRM)筛选出每个物质的特征离子,在检测器中获得特征离子的信号强度(CPS)。用MultiQuant软件打开样本下机质谱文件,根据质荷比、保留时间对原始质谱数据进行预处理和校正,进行色谱峰的积分和校正工作,每个色谱峰的峰面积(Area)代表对应物质的相对含量,设置S/N>5,保留时间偏移不超过0.2min的峰保留;根据质谱峰强度计算峰面积得到代谢物相对含量信息,最后导出所有色谱峰面积积分数据保存,用于下一步统计分析。Based on the lung adenocarcinoma serum metabolite database, the metabolites of the samples were qualitatively and quantitatively analyzed by mass spectrometry. Metabolites of different molecular weights can be separated by liquid chromatography. The characteristic ions of each substance are screened out by the multiple reaction monitoring mode (MRM) of the triple quadrupole, and the signal intensity (CPS) of the characteristic ions is obtained in the detector. Use MultiQuant software to open the mass spectrum file of the sample off-machine, preprocess and correct the original mass spectrum data according to the mass-to-charge ratio and retention time, and perform the integration and calibration of the chromatographic peaks. The peak area (Area) of each chromatographic peak represents the corresponding substance. For relative content, set S/N>5, and retain peaks whose retention time shift does not exceed 0.2min; calculate the peak area according to the mass spectrum peak intensity to obtain the relative content information of metabolites, and finally export all chromatographic peak area integral data and save them for the next step Statistical Analysis.
(4)实验质量控制(4) Experimental quality control
通过对不同质控QC样本质谱检测分析的总离子流图(TIC图)进行重 叠展示分析,可以判断代谢物提取和检测的重复性,即技术重复。仪器的高稳定性为数据的重复性和可靠性提供了重要的保障。CV值即变异系数(Coefficient of Variation),是原始数据标准差与原始数据平均数的比,可反映数据离散程度。使用经验累积分布函数(Empirical Cumulative Distribution Function,ECDF)可以分析小于参考值的物质CV出现的频率,QC样本的CV值较低的物质占比越高,代表实验数据越稳定:QC样本CV值小于0.5的物质占比高于85%,表明实验数据较稳定;QC样本CV值小于0.3的物质占比高于75%,表明实验数据非常稳定。同时监测检测过程中L-苯基丙氨酸内标CV值变化情况,内标CV值的变化小于20%,表明检测过程中仪器稳定性好。The repeatability of metabolite extraction and detection can be judged by overlaying and displaying the total ion chromatograms (TIC charts) of different quality control QC samples for mass spectrometry detection and analysis, that is, technical repetition. The high stability of the instrument provides an important guarantee for the repeatability and reliability of the data. The CV value is the coefficient of variation (Coefficient of Variation), which is the ratio of the standard deviation of the original data to the mean of the original data, which can reflect the degree of dispersion of the data. The Empirical Cumulative Distribution Function (ECDF) can be used to analyze the frequency of the CV of substances less than the reference value. The higher the proportion of substances with lower CV values in the QC sample, the more stable the experimental data: the CV value of the QC sample is less than The proportion of substances with 0.5 is higher than 85%, indicating that the experimental data is relatively stable; the proportion of substances with QC sample CV value less than 0.3 is higher than 75%, indicating that the experimental data is very stable. At the same time, the change of the CV value of the internal standard L-phenylalanine during the detection process was monitored, and the change of the CV value of the internal standard was less than 20%, indicating that the instrument was stable during the detection process.
(5)数据处理、分析及标志物筛选(5) Data processing, analysis and marker screening
将峰面积积分数据导入SIMCA软件(Version 14.1,Sweden)进行多元统计分析。通过建立正交-偏最小二乘法判别(OPLS-DA)模型,寻找健康人与肺腺癌患者之间贡献较大的代谢物(VIP>1.0)。如图1中,黑色标记的点为VIP>1.0的代谢物,浅灰色标记的点为VIP<1.0的代谢物。然后通过T-test检验,设置p<0.05为差异显著性标准。最终筛选出VIP>1.0且p<0.05的差异代谢物。The peak area integration data were imported into SIMCA software (Version 14.1, Sweden) for multivariate statistical analysis. By establishing an orthogonal-partial least squares discriminant (OPLS-DA) model, we searched for metabolites with greater contribution (VIP>1.0) between healthy people and lung adenocarcinoma patients. As shown in Figure 1, the points marked in black are metabolites with VIP>1.0, and the points marked in light gray are metabolites with VIP<1.0. Then through T-test test, set p<0.05 as the standard of significant difference. Finally, differential metabolites with VIP>1.0 and p<0.05 were screened out.
上述分析筛选的潜在肺腺癌代谢标志物,根据其保留时间,一级和二级质谱推测标志物的分子质量和分子式,并且与代谢物谱图数据库中的谱图信息进行比对,从而对代谢物进行定性鉴定。最终通过购买标准品,用标准品的分子量、色谱保留时间和相应的多级MS裂解谱比对,验证代谢标志物的结构。The potential lung adenocarcinoma metabolic markers screened by the above analysis, according to their retention time, primary and secondary mass spectrometry, speculate the molecular mass and molecular formula of the marker, and compare it with the spectral information in the metabolite spectrum database, so as to Metabolites were qualitatively identified. Finally, the structures of metabolic markers were verified by purchasing standard products and comparing their molecular weights, chromatographic retention times, and corresponding multilevel MS fragmentation profiles.
利用二元逻辑回归向前逐步法筛选到10种差异代谢物能够诊断区分肺腺癌患者和健康人:D-半乳糖、高瓜氨酸、N6-乙酰-L-赖氨酸、4-羟基-L-脯氨酸、十六烷二酸、鸟嘌呤、谷氨酸、肌酸、丙氨酸、犬尿酸。代谢物具体信息见表3至表5:Ten differential metabolites screened by binary logistic regression forward stepwise method can diagnose and distinguish lung adenocarcinoma patients from healthy people: D-galactose, homocitrulline, N6-acetyl-L-lysine, 4-hydroxy -L-proline, hexadecanedioic acid, guanine, glutamic acid, creatine, alanine, kynuric acid. See Table 3 to Table 5 for specific information on metabolites:
表3 10种血清代谢标志物Table 3 10 serum metabolic markers
Figure PCTCN2022118708-appb-000001
Figure PCTCN2022118708-appb-000001
Figure PCTCN2022118708-appb-000002
Figure PCTCN2022118708-appb-000002
表4肺腺癌患者VS健康人代谢物Table 4 Metabolites of lung adenocarcinoma patients VS healthy people
中文名称Chinese name 差异倍数multiple of difference VIPVIP P valueP value
D-半乳糖D-galactose 1.311.31 2.062.06 2.71E-052.71E-05
高瓜氨酸high citrulline 1.251.25 1.851.85 5.06E-045.06E-04
N6-乙酰-L-赖氨酸N6-Acetyl-L-lysine 0.870.87 1.881.88 3.03E-023.03E-02
4-羟基-L-脯氨酸4-Hydroxy-L-proline 1.151.15 1.351.35 6.77E-036.77E-03
十六烷二酸hexadecanedioic acid 0.840.84 1.411.41 3.88E-023.88E-02
鸟嘌呤Guanine 0.920.92 1.221.22 2.11E-022.11E-02
谷氨酸glutamic acid 0.860.86 1.051.05 4.50E-024.50E-02
肌酸Creatine 1.181.18 1.161.16 3.04E-023.04E-02
丙氨酸Alanine 1.021.02 1.081.08 4.63E-024.63E-02
犬尿酸Kynuric acid 0.940.94 1.041.04 3.48E-023.48E-02
表5早期肺腺癌患者VS健康人代谢物Table 5 Metabolites of early lung adenocarcinoma patients VS healthy people
中文名称Chinese name 差异倍数multiple of difference VIPVIP P valueP value
D-半乳糖D-galactose 1.431.43 2.112.11 2.50E-052.50E-05
高瓜氨酸high citrulline 1.211.21 1.781.78 3.16E-033.16E-03
N6-乙酰-L-赖氨酸N6-Acetyl-L-lysine 0.890.89 1.661.66 6.20E-026.20E-02
4-羟基-L-脯氨酸4-Hydroxy-L-proline 1.171.17 1.351.35 1.64E-031.64E-03
十六烷二酸hexadecanedioic acid 0.840.84 1.411.41 1.56E-021.56E-02
鸟嘌呤Guanine 0.910.91 1.221.22 3.82E-023.82E-02
谷氨酸glutamic acid 0.870.87 1.051.05 4.54E-024.54E-02
肌酸Creatine 1.131.13 1.161.16 3.12E-023.12E-02
丙氨酸Alanine 1.061.06 1.081.08 4.65E-024.65E-02
犬尿酸Kynuric acid 0.980.98 1.041.04 9.77E-029.77E-02
采用受试者工作特征曲线(ROC)分析代谢物对肺腺癌患者的诊断性能。结果表明,D-半乳糖、高瓜氨酸、N6-乙酰-L-赖氨酸、4-羟基-L-脯氨酸、十六烷二酸、鸟嘌呤、谷氨酸、肌酸、丙氨酸、犬尿酸这10个差异代谢物单个用于诊断区分肺腺癌能力较强,ROC曲线下面积(AUC)均大于0.7,具有临床诊断意义。The diagnostic performance of metabolites in patients with lung adenocarcinoma was analyzed by receiver operating characteristic curve (ROC). The results showed that D-galactose, homocitrulline, N6-acetyl-L-lysine, 4-hydroxy-L-proline, hexadecanedioic acid, guanine, glutamic acid, creatine, propane The 10 differential metabolites such as aminoacid and kynuric acid have a strong ability to diagnose and distinguish lung adenocarcinoma, and the area under the ROC curve (AUC) is greater than 0.7, which has clinical diagnostic significance.
这10个差异代谢物联合用于诊断时,AUC进一步提高,10个联合起来诊断肺腺癌的AUC值达到0.988,在最佳cutoff值下,灵敏度和特异性分别为97.7%和94.0%;诊断早期肺腺癌的AUC达0.995,在最佳cutoff值下,灵敏度和特异性分别为98.2%和95.7%。具体数据统计见下表6至表9:When these 10 differential metabolites are combined for diagnosis, the AUC is further improved, and the AUC value of the 10 joint diagnosis of lung adenocarcinoma reaches 0.988. Under the optimal cutoff value, the sensitivity and specificity are 97.7% and 94.0% respectively; diagnosis The AUC of early lung adenocarcinoma reached 0.995, and the sensitivity and specificity were 98.2% and 95.7% respectively under the optimal cutoff value. Please refer to Table 6 to Table 9 below for specific statistics:
表6单个代谢物用于肺腺癌诊断的AUC值Table 6 AUC values of individual metabolites for the diagnosis of lung adenocarcinoma
编号serial number 中文名称Chinese name AUCAUC 灵敏度sensitivity 特异性specificity
11 D-半乳糖D-galactose 0.8580.858 85.5%85.5% 85.1%85.1%
22 高瓜氨酸high citrulline 0.8440.844 85.2%85.2% 84.3%84.3%
33 N6-乙酰-L-赖氨酸N6-Acetyl-L-lysine 0.8320.832 83.1%83.1% 83.0%83.0%
44 4-羟基-L-脯氨酸4-Hydroxy-L-proline 0.7980.798 80.4%80.4% 81.1%81.1%
55 十六烷二酸hexadecanedioic acid 0.7730.773 79.3%79.3% 77.8%77.8%
66 鸟嘌呤Guanine 0.7610.761 78.5%78.5% 77.6%77.6%
77 谷氨酸glutamic acid 0.7340.734 72.5%72.5% 75.5%75.5%
88 肌酸Creatine 0.7200.720 71.8%71.8% 74.3%74.3%
99 丙氨酸Alanine 0.7080.708 71.7%71.7% 72.5%72.5%
1010 犬尿酸Kynuric acid 0.7030.703 70.3%70.3% 71.8%71.8%
表7任意差异代谢物联合用于肺腺癌诊断的AUC值Table 7 The AUC value of any differential metabolite combined for the diagnosis of lung adenocarcinoma
联合个数joint number AUCAUC 灵敏度sensitivity 特异性specificity
任意二个any two ≥0.752≥0.752 ≥74.2%≥74.2% ≥73.3%≥73.3%
任意三个any three ≥0.775≥0.775 ≥76.4%≥76.4% ≥75.0%≥75.0%
任意四个any four ≥0.794≥0.794 ≥78.1%≥78.1% ≥76.5%≥76.5%
任意五个any five ≥0.835≥0.835 ≥81.9%≥81.9% ≥80.2%≥80.2%
任意六个any six ≥0.877≥0.877 ≥86.2%≥86.2% ≥85.1%≥85.1%
任意七个any seven ≥0.903≥0.903 ≥88.9%≥88.9% ≥87.2%≥87.2%
任意八个any eight ≥0.934≥0.934 ≥92.0%≥92.0% ≥90.1%≥90.1%
任意九个any nine ≥0.951≥0.951 ≥93.6%≥93.6% ≥92.3%≥92.3%
示例代谢标志物组合如下:Example metabolic marker combinations are as follows:
进一步优选代谢标志物组合为:D-半乳糖、高瓜氨酸、N6-乙酰-L-赖氨酸,构建诊断肺腺癌模型。这3个代谢物联合起来诊断肺腺癌的AUC值达到0.925,在最佳cutoff值下,灵敏度和特异性分别为90.8%和90.3%。A further preferred combination of metabolic markers is: D-galactose, homocitrulline, and N6-acetyl-L-lysine to construct a model for diagnosing lung adenocarcinoma. The AUC value of these three metabolites combined for the diagnosis of lung adenocarcinoma reached 0.925. Under the optimal cutoff value, the sensitivity and specificity were 90.8% and 90.3%, respectively.
进一步优选代谢标志物组合为:D-半乳糖、高瓜氨酸、N6-乙酰-L-赖氨酸、4-羟基-L-脯氨酸、十六烷二酸、鸟嘌呤构建诊断肺腺癌模型。这6个代谢物联合起来诊断肺部良恶性结节的AUC值达到0.956,在最佳cutoff值下,灵敏度和特异性分别为94.1%和93.3%。A further preferred combination of metabolic markers is: D-galactose, homocitrulline, N6-acetyl-L-lysine, 4-hydroxy-L-proline, hexadecanedioic acid, and guanine to construct a diagnostic lung gland cancer model. The AUC value of these 6 metabolites combined to diagnose benign and malignant pulmonary nodules reached 0.956. Under the optimal cutoff value, the sensitivity and specificity were 94.1% and 93.3%, respectively.
表8单个代谢物用于早期肺腺癌诊断的AUC值Table 8 The AUC value of a single metabolite for the diagnosis of early lung adenocarcinoma
编号serial number 中文名称Chinese name AUCAUC 灵敏度sensitivity 特异性specificity
11 D-半乳糖D-galactose 0.8650.865 87.5%87.5% 84.1%84.1%
22 高瓜氨酸high citrulline 0.8570.857 86.2%86.2% 83.3%83.3%
33 N6-乙酰-L-赖氨酸N6-Acetyl-L-lysine 0.8380.838 84.8%84.8% 83.0%83.0%
44 4-羟基-L-脯氨酸4-Hydroxy-L-proline 0.8160.816 82.6%82.6% 81.1%81.1%
55 十六烷二酸hexadecanedioic acid 0.7950.795 80.4%80.4% 77.8%77.8%
66 鸟嘌呤Guanine 0.7660.766 78.5%78.5% 75.6%75.6%
77 谷氨酸glutamic acid 0.7540.754 76.5%76.5% 74.3%74.3%
88 肌酸Creatine 0.7410.741 75.2%75.2% 73.7%73.7%
99 丙氨酸Alanine 0.7220.722 72.5%72.5% 71.8%71.8%
1010 犬尿酸Kynuric acid 0.7090.709 71.7%71.7% 70.5%70.5%
表9任意差异代谢物联合用于早期肺腺癌诊断的AUC值Table 9 The AUC value of any differential metabolites combined for the diagnosis of early lung adenocarcinoma
联合个数joint number AUCAUC 灵敏度sensitivity 特异性specificity
任意二个any two ≥0.766≥0.766 ≥75.6%≥75.6% ≥74.8%≥74.8%
任意三个any three ≥0.790≥0.790 ≥77.8%≥77.8% ≥76.5%≥76.5%
任意四个any four ≥0.818≥0.818 ≥80.3%≥80.3% ≥79.5%≥79.5%
任意五个any five ≥0.841≥0.841 ≥83.1%≥83.1% ≥81.8%≥81.8%
任意六个any six ≥0.885≥0.885 ≥86.2%≥86.2% ≥85.1%≥85.1%
任意七个any seven ≥0.916≥0.916 ≥90.2%≥90.2% ≥89.3%≥89.3%
任意八个any eight ≥0.947≥0.947 ≥93.0%≥93.0% ≥92.1%≥92.1%
任意九个any nine ≥0.971≥0.971 ≥95.6%≥95.6% ≥94.3%≥94.3%
示例代谢标志物组合如下:Example metabolic marker combinations are as follows:
进一步优选代谢标志物组合为:D-半乳糖、高瓜氨酸、N6-乙酰-L-赖氨酸,构建诊断早期肺腺癌模型。这3个代谢物联合起来诊断早期肺腺癌的AUC值达到0.925,在最佳cutoff值下,灵敏度和特异性分别为90.8%和90.3%。A further preferred combination of metabolic markers is: D-galactose, homocitrulline, and N6-acetyl-L-lysine to construct a model for diagnosing early lung adenocarcinoma. The AUC value of these three metabolites combined to diagnose early lung adenocarcinoma reached 0.925. Under the optimal cutoff value, the sensitivity and specificity were 90.8% and 90.3%, respectively.
进一步优选代谢标志物组合为:D-半乳糖、高瓜氨酸、N6-乙酰-L-赖氨酸、4-羟基-L-脯氨酸、十六烷二酸、鸟嘌呤,构建诊断早期肺腺癌模型。这6个代谢物联合起来诊断早期肺腺癌的AUC值达到0.956,在最佳cutoff值下,灵敏度和特异性分别为94.1%和93.3%。A further preferred combination of metabolic markers is: D-galactose, homocitrulline, N6-acetyl-L-lysine, 4-hydroxy-L-proline, hexadecanedioic acid, guanine, constructing an early diagnosis Lung adenocarcinoma model. The combined AUC value of these 6 metabolites in the diagnosis of early lung adenocarcinoma reached 0.956. Under the optimal cutoff value, the sensitivity and specificity were 94.1% and 93.3%, respectively.
实施例2:肺腺癌诊断标志物验证Example 2: Validation of diagnostic markers for lung adenocarcinoma
本研究对象共包含来自于2个独立医学中心的266例肺腺癌患者血清样本,其中包括118例早期肺腺癌患者;以及149例健康人血清样本,与特征筛选样本(150例)同一来源。其中肺腺癌的诊断标准是经术后病理确诊的肺腺癌;健康人为体检后无肺部疾病的健康人群。所有肺腺癌患者和健康人样本均无其它任何恶性肿瘤病史,无其他全身性重大疾病,无长期用药的慢性病史。根据TNM分期,将I期和II期肺腺癌患者定义为早期肺腺癌患者。The subjects of this study included 266 serum samples of patients with lung adenocarcinoma from 2 independent medical centers, including 118 patients with early lung adenocarcinoma; and 149 serum samples of healthy people, which were from the same source as the feature screening samples (150 cases) . Among them, the diagnostic standard of lung adenocarcinoma is lung adenocarcinoma diagnosed by postoperative pathology; healthy people are healthy people without lung diseases after physical examination. All lung adenocarcinoma patients and healthy samples had no history of any other malignant tumors, no other major systemic diseases, and no chronic medical history of long-term medication. According to TNM staging, patients with stage I and II lung adenocarcinoma were defined as patients with early-stage lung adenocarcinoma.
采血时间均为清晨空腹状态。所有血清样本离心后置于-80℃冰箱内保存,研究时取出血清样本解冻后进行后续分析。The time of blood collection was in the morning on an empty stomach. All serum samples were centrifuged and stored in a -80°C refrigerator. During the study, serum samples were taken out and thawed for subsequent analysis.
本实施例与实施例1的检测和数据分析方法相同,检测和分析的差异代谢物为以下10种:D-半乳糖、高瓜氨酸、N6-乙酰-L-赖氨酸、4-羟基-L-脯氨酸、十六烷二酸、鸟嘌呤、谷氨酸、肌酸、丙氨酸、犬尿酸,用于肺腺癌诊断。这10个差异代谢物单个用于诊断区分肺腺癌患者和健康人,早期肺腺癌患者和健康人能力较强,ROC曲线下面积(AUC)均大于0.7,具有临床诊断意义;这10个差异代谢物联合用于诊断时,AUC进一步提高,10个联合起来诊断肺腺癌的AUC值达到0.955,在最佳cutoff值下,灵敏度和特异性 分别为91.5%和93.6%;诊断早期肺腺癌的AUC达0.983,在最佳cutoff值下,灵敏度和特异性分别为93.2%和96.0%。单个及任意2~9个代谢物联合用于诊断时的AUC见表11至表14:The detection and data analysis methods of this example are the same as those of Example 1, and the differential metabolites detected and analyzed are the following 10 kinds: D-galactose, homocitrulline, N6-acetyl-L-lysine, 4-hydroxy - L-proline, hexadecanedioic acid, guanine, glutamic acid, creatine, alanine, kynuric acid for the diagnosis of lung adenocarcinoma. These 10 differential metabolites are individually used to diagnose and distinguish between patients with lung adenocarcinoma and healthy people. The ability of early lung adenocarcinoma patients and healthy people is relatively strong, and the area under the ROC curve (AUC) is greater than 0.7, which has clinical diagnostic significance; When the differential metabolites are combined for diagnosis, the AUC is further improved, and the AUC value of 10 joint diagnosis of lung adenocarcinoma reaches 0.955. Under the optimal cutoff value, the sensitivity and specificity are 91.5% and 93.6%, respectively; the diagnosis of early lung adenocarcinoma The AUC of cancer was 0.983, and the sensitivity and specificity were 93.2% and 96.0% at the best cutoff value. See Table 11 to Table 14 for the AUC of a single and any combination of 2 to 9 metabolites used for diagnosis:
表11单个代谢物用于肺腺癌诊断的AUC值Table 11 AUC values of individual metabolites for the diagnosis of lung adenocarcinoma
编号serial number 中文名称Chinese name AUCAUC 灵敏度sensitivity 特异性specificity
11 D-半乳糖D-galactose 0.8420.842 83.2%83.2% 85.1%85.1%
22 高瓜氨酸high citrulline 0.8320.832 82.5%82.5% 84.8%84.8%
33 N6-乙酰-L-赖氨酸N6-Acetyl-L-lysine 0.8250.825 82.2%82.2% 83.9%83.9%
44 4-羟基-L-脯氨酸4-Hydroxy-L-proline 0.8010.801 79.5%79.5% 82.2%82.2%
55 十六烷二酸hexadecanedioic acid 0.7620.762 75.3%75.3% 79.5%79.5%
66 鸟嘌呤Guanine 0.7550.755 74.4%74.4% 77.6%77.6%
77 谷氨酸glutamic acid 0.7330.733 72.5%72.5% 75.8%75.8%
88 丙氨酸Alanine 0.7160.716 71.1%71.1% 74.0%74.0%
99 犬尿酸Kynuric acid 0.7110.711 70.7%70.7% 73.2%73.2%
1010 肌酸Creatine 0.7050.705 70.1%70.1% 72.5%72.5%
表12任意差异代谢物联合用于肺腺癌诊断的AUC值Table 12 The AUC value of any differential metabolite combined for the diagnosis of lung adenocarcinoma
联合个数joint number AUCAUC 灵敏度sensitivity 特异性specificity
任意二个any two ≥0.756≥0.756 ≥71.8%≥71.8% ≥73.6%≥73.6%
任意三个any three ≥0.780≥0.780 ≥74.0%≥74.0% ≥76.1%≥76.1%
任意四个any four ≥0.797≥0.797 ≥76.1%≥76.1% ≥77.5%≥77.5%
任意五个any five ≥0.825≥0.825 ≥78.9%≥78.9% ≥80.4%≥80.4%
任意六个any six ≥0.857≥0.857 ≥81.2%≥81.2% ≥83.1%≥83.1%
任意七个any seven ≥0.888≥0.888 ≥84.9%≥84.9% ≥86.2%≥86.2%
任意八个any eight ≥0.913≥0.913 ≥88.0%≥88.0% ≥90.3%≥90.3%
任意九个any nine ≥0.931≥0.931 ≥90.6%≥90.6% ≥91.5%≥91.5%
进一步优选代谢标志物D-半乳糖、高瓜氨酸、N6-乙酰-L-赖氨酸,构建诊断肺腺癌模型。这3个代谢物联合起来诊断肺腺癌的AUC值达到0.913,在最佳cutoff值下,灵敏度和特异性分别为88.7%和89.6%。Further optimized metabolic markers D-galactose, homocitrulline, and N6-acetyl-L-lysine were used to construct a model for diagnosing lung adenocarcinoma. The AUC value of these three metabolites combined for the diagnosis of lung adenocarcinoma reached 0.913. Under the optimal cutoff value, the sensitivity and specificity were 88.7% and 89.6%, respectively.
进一步优选代谢标志物D-半乳糖、高瓜氨酸、N6-乙酰-L-赖氨酸、4-羟基-L-脯氨酸、十六烷二酸、鸟嘌呤构建诊断肺腺癌模型。这6个代谢物联合起来诊断肺部良恶性结节的AUC值达到0.931,在最佳cutoff值下,灵敏度和特异性分别为90.2%和91.4%。Further preferred metabolic markers D-galactose, homocitrulline, N6-acetyl-L-lysine, 4-hydroxy-L-proline, hexadecanedioic acid, and guanine were used to construct a model for diagnosing lung adenocarcinoma. The AUC value of these 6 metabolites combined to diagnose benign and malignant pulmonary nodules reached 0.931. Under the optimal cutoff value, the sensitivity and specificity were 90.2% and 91.4%, respectively.
表13单个代谢物用于早期肺腺癌诊断的AUC值Table 13 The AUC value of a single metabolite for the diagnosis of early lung adenocarcinoma
编号serial number 中文名称Chinese name AUCAUC 灵敏度sensitivity 特异性specificity
11 D-半乳糖D-galactose 0.8580.858 83.2%83.2% 85.1%85.1%
22 高瓜氨酸high citrulline 0.8420.842 82.5%82.5% 84.8%84.8%
33 4-羟基-L-脯氨酸4-Hydroxy-L-proline 0.8290.829 82.2%82.2% 83.9%83.9%
44 N6-乙酰-L-赖氨酸N6-Acetyl-L-lysine 0.8150.815 79.5%79.5% 82.2%82.2%
55 鸟嘌呤Guanine 0.7880.788 75.3%75.3% 79.5%79.5%
66 十六烷二酸hexadecanedioic acid 0.7610.761 74.4%74.4% 77.6%77.6%
77 谷氨酸glutamic acid 0.7450.745 72.5%72.5% 75.8%75.8%
88 丙氨酸Alanine 0.7280.728 71.1%71.1% 74.0%74.0%
99 犬尿酸Kynuric acid 0.7150.715 70.7%70.7% 73.2%73.2%
1010 肌酸Creatine 0.7080.708 70.1%70.1% 72.5%72.5%
表14任意差异代谢物联合用于早期肺腺癌诊断的AUC值Table 14 The AUC value of any differential metabolite combined for the diagnosis of early lung adenocarcinoma
联合个数joint number AUCAUC 灵敏度sensitivity 特异性specificity
任意二个any two ≥0.759≥0.759 ≥72.6%≥72.6% ≥74.5%≥74.5%
任意三个any three ≥0.781≥0.781 ≥74.8%≥74.8% ≥76.1%≥76.1%
任意四个any four ≥0.803≥0.803 ≥76.4%≥76.4% ≥78.7%≥78.7%
任意五个any five ≥0.832≥0.832 ≥79.1%≥79.1% ≥80.8%≥80.8%
任意六个any six ≥0.875≥0.875 ≥82.2%≥82.2% ≥84.4%≥84.4%
任意七个any seven ≥0.906≥0.906 ≥85.2%≥85.2% ≥87.3%≥87.3%
任意八个any eight ≥0.937≥0.937 ≥88.5%≥88.5% ≥91.1%≥91.1%
任意九个any nine ≥0.951≥0.951 ≥91.7%≥91.7% ≥93.5%≥93.5%
进一步优选代谢标志物D-半乳糖、高瓜氨酸、N6-乙酰-L-赖氨酸,构建诊断早期肺腺癌模型。这3个代谢物联合起来诊断早期肺腺癌的AUC值达到0.916,在最佳cutoff值下,灵敏度和特异性分别为89.3%和90.1%。Metabolic markers D-galactose, homocitrulline, and N6-acetyl-L-lysine were further optimized to construct a model for diagnosing early lung adenocarcinoma. The AUC value of these three metabolites combined to diagnose early lung adenocarcinoma reached 0.916. Under the optimal cutoff value, the sensitivity and specificity were 89.3% and 90.1%, respectively.
进一步优选代谢标志物D-半乳糖、高瓜氨酸、N6-乙酰-L-赖氨酸、4-羟基-L-脯氨酸、十六烷二酸、鸟嘌呤构建诊断早期肺腺癌模型。这6个代谢物联合起来诊断早期肺腺癌的AUC值达到0.951,在最佳cutoff值下,灵敏度和特异性分别为91.8%和93.2%。Further optimization of metabolic markers D-galactose, homocitrulline, N6-acetyl-L-lysine, 4-hydroxy-L-proline, hexadecanedioic acid, and guanine to construct a model for diagnosing early lung adenocarcinoma . The AUC value of these 6 metabolites combined to diagnose early lung adenocarcinoma reached 0.951, and the sensitivity and specificity were 91.8% and 93.2% respectively under the optimal cutoff value.
实施例3检测试剂盒Embodiment 3 detection kit
本实施例提供一种检测试剂盒,包括:This embodiment provides a detection kit, comprising:
(1)代谢标志物的标准品:D-半乳糖、高瓜氨酸、N6-乙酰-L-赖氨酸、4-羟基-L-脯氨酸、十六烷二酸、鸟嘌呤、谷氨酸、肌酸、丙氨酸、犬尿酸, 各标准品分别封装或标准品混合溶液封装。(1) Standards for metabolic markers: D-galactose, homocitrulline, N6-acetyl-L-lysine, 4-hydroxy-L-proline, hexadecanedioic acid, guanine, gluten amino acid, creatine, alanine, kynuric acid, each standard product is packaged separately or a mixed solution of standard products is packaged.
(2)提取溶剂:(2) Extraction solvent:
100%纯甲醇和50%乙腈水溶液用于样品制备;100% pure methanol and 50% acetonitrile in water for sample preparation;
50%乙腈水溶液可以用作溶解标准品的溶剂。50% acetonitrile in water can be used as a solvent to dissolve the standards.
(3)内标物::L-苯基丙氨酸。(3) Internal standard:: L-phenylalanine.
在此有必要指出的是,以上实施例仅限于对本发明的技术方案做进一步的阐述和说明,并不是对本发明的技术方案的进一步的限制,本发明的方法仅为较佳的实施方案,并非用于限定本发明的保护范围。凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。It must be pointed out here that the above examples are only limited to further elaboration and description of the technical solution of the present invention, and are not further limitations on the technical solution of the present invention. The method of the present invention is only a preferred implementation, not a Used to limit the protection scope of the present invention. Any modifications, equivalent replacements, improvements, etc. made within the spirit and principles of the present invention shall be included within the protection scope of the present invention.

Claims (10)

  1. 一种用于诊断或监测肺腺癌的代谢标志物,其特征在于,所述代谢标志物至少选D-半乳糖、高瓜氨酸、N6-乙酰-L-赖氨酸、4-羟基-L-脯氨酸、十六烷二酸、鸟嘌呤、谷氨酸、肌酸、丙氨酸、犬尿酸中的至少一种。A metabolic marker for diagnosing or monitoring lung adenocarcinoma, characterized in that the metabolic marker is at least selected from D-galactose, homocitrulline, N6-acetyl-L-lysine, 4-hydroxy- At least one of L-proline, hexadecanedioic acid, guanine, glutamic acid, creatine, alanine, and kynuric acid.
  2. 根据权利要求1所述的用于诊断或监测肺腺癌的代谢标志物,其特征在于,所述代谢标志物选自D-半乳糖、高瓜氨酸、N6-乙酰-L-赖氨酸中至少一种。The metabolic marker for diagnosing or monitoring lung adenocarcinoma according to claim 1, wherein the metabolic marker is selected from D-galactose, homocitrulline, N6-acetyl-L-lysine at least one of.
  3. 根据权利要求2所述的用于诊断或监测肺腺癌的代谢标志物,其特征在于,所述代谢标志物还选自4-羟基-L-脯氨酸、十六烷二酸、鸟嘌呤中的至少一种。The metabolic marker for diagnosing or monitoring lung adenocarcinoma according to claim 2, wherein the metabolic marker is also selected from 4-hydroxy-L-proline, hexadecandioic acid, guanine at least one of the
  4. 根据权利要求2或3所述的用于诊断或监测肺腺癌的代谢标志物,其特征在于,所述代谢标志物还选自谷氨酸、肌酸、丙氨酸、犬尿酸中的至少一种。The metabolic marker for diagnosing or monitoring lung adenocarcinoma according to claim 2 or 3, wherein the metabolic marker is also selected from at least one of glutamic acid, creatine, alanine, and kynuric acid. A sort of.
  5. 权利要求1至4任一项所述的用于诊断或监测肺腺癌的代谢标志物在制备诊断或监测肺癌的代谢物数据库、试剂产品或者试剂盒中的应用。Application of the metabolic markers for diagnosing or monitoring lung adenocarcinoma according to any one of claims 1 to 4 in preparing metabolite databases, reagent products or kits for diagnosing or monitoring lung cancer.
  6. 一种试剂产品或者试剂盒,其特征在于,包括权利要求1至4任一项所述的用于诊断或监测肺腺癌的代谢标志物的标准品。A reagent product or kit, characterized in that it comprises the standard product of metabolic markers for diagnosing or monitoring lung adenocarcinoma according to any one of claims 1 to 4.
  7. 根据权利要求6所述的试剂产品或者试剂盒,其特征在于,还包括内标物和/或提取试剂。The reagent product or kit according to claim 6, further comprising an internal standard and/or an extraction reagent.
  8. 根据权利要求7所述的试剂产品或者试剂盒,其特征在于,所述内标物为L-苯基丙氨酸。The reagent product or kit according to claim 7, wherein the internal standard is L-phenylalanine.
  9. 一种根据权利要求1至4任一项所述的用于诊断或监测肺腺癌的代谢标志物的筛选方法,其特征在于,包括如下步骤:A method for screening metabolic markers for diagnosing or monitoring lung adenocarcinoma according to any one of claims 1 to 4, characterized in that it comprises the steps of:
    分别采集肺腺癌组样本和健康组血清样本,肺腺癌组样本的患者的TNM分期包括为I期、II期、III期、IV期;The samples of the lung adenocarcinoma group and the serum samples of the healthy group were collected separately, and the TNM staging of the patients in the lung adenocarcinoma group included stage I, stage II, stage III, and stage IV;
    随机挑选肺腺癌组和健康组各20%的样本,采用增强离子扫描质谱和飞行时间质谱结合多反应监测采集模式的代谢组学方法,以及整合本地标准品数据库进行肺腺癌血清代谢物数据库构建;Randomly select 20% of the samples from the lung adenocarcinoma group and the healthy group, adopt the metabolomics method of enhanced ion scanning mass spectrometry and time-of-flight mass spectrometry combined with multiple reaction monitoring acquisition mode, and integrate the local standard database for lung adenocarcinoma serum metabolite database Construct;
    采用构建肺腺癌血清代谢物数据库和LC-MS检测对采集的血清样本 进行分析,得到各血清样本的原始质谱数据;The collected serum samples were analyzed by constructing a lung adenocarcinoma serum metabolite database and LC-MS detection, and the original mass spectrometry data of each serum sample were obtained;
    使用MultiQuant软件,根据质荷比、保留时间对原始质谱数据进行预处理和校正;Use MultiQuant software to preprocess and correct the original mass spectrometry data according to the mass-to-charge ratio and retention time;
    根据质谱峰强度计算峰面积得到代谢物相对含量信息;将代谢物相对含量信息进行多元统计正交-偏最小二乘法判别分析,并根据变量权重值大于1及单变量统计分析的P值小于0.05的筛选标准,得到候选差异代谢物;Calculate the peak area according to the mass spectrum peak intensity to obtain the relative content information of metabolites; conduct multivariate statistical orthogonal-partial least squares discriminant analysis on the relative content information of metabolites, and according to the variable weight value greater than 1 and the P value of univariate statistical analysis less than 0.05 screening criteria to obtain candidate differential metabolites;
    将候选差异代谢物进行二元逻辑回归建模,筛选优异代谢物及其组合对肺腺癌患者进行诊断,对筛选的优异代谢物及其组合进行受试者工作特征曲线分析,确定用于诊断或监测肺腺癌的代谢标志物。Carry out binary logistic regression modeling of candidate differential metabolites, screen excellent metabolites and their combinations for diagnosis of lung adenocarcinoma patients, perform receiver operating characteristic curve analysis on screened excellent metabolites and their combinations, and determine them for diagnosis Or monitor metabolic markers in lung adenocarcinoma.
  10. 根据权利要求9所述的用于诊断或监测肺癌的代谢标志物的筛选方法,其特征在于,LC-MS检测的条件为:The method for screening metabolic markers for diagnosis or monitoring lung cancer according to claim 9, wherein the conditions for LC-MS detection are:
    色谱柱:Waters ACQUITY UPLC HSS T3 C18 1.8μm,2.1mm*100mm;Chromatographic column: Waters ACQUITY UPLC HSS T3 C18 1.8μm, 2.1mm*100mm;
    流动相:A相为含0.1%乙酸水溶液,B相为含0.1%乙酸的乙腈溶液,流速0.4mL/min;Mobile phase: Phase A is an aqueous solution containing 0.1% acetic acid, phase B is an acetonitrile solution containing 0.1% acetic acid, and the flow rate is 0.4mL/min;
    洗脱梯度程序为:The elution gradient program is:
    0min,A相与B相的体积比为95:5;0min, the volume ratio of phase A to phase B is 95:5;
    11.0min,A相与B相的体积比为10:90;11.0min, the volume ratio of phase A and phase B is 10:90;
    12.0min,A相与B相的体积比为10:90;12.0min, the volume ratio of phase A and phase B is 10:90;
    12.1min,A相与B相的体积比为95:5;12.1min, the volume ratio of phase A to phase B is 95:5;
    14.0min,A相与B相的体积比为95:5。14.0min, the volume ratio of phase A to phase B is 95:5.
PCT/CN2022/118708 2021-11-09 2022-09-14 Marker for lung adenocarcinoma diagnosis and application thereof WO2023082820A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111318756.7 2021-11-09
CN202111318756.7A CN113960215B (en) 2021-11-09 2021-11-09 Marker for lung adenocarcinoma diagnosis and application thereof

Publications (1)

Publication Number Publication Date
WO2023082820A1 true WO2023082820A1 (en) 2023-05-19

Family

ID=79469627

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/118708 WO2023082820A1 (en) 2021-11-09 2022-09-14 Marker for lung adenocarcinoma diagnosis and application thereof

Country Status (2)

Country Link
CN (1) CN113960215B (en)
WO (1) WO2023082820A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117368479A (en) * 2023-11-13 2024-01-09 郑州大学 Biomarker and detection kit for lung adenocarcinoma diagnosis
CN118150738A (en) * 2024-05-09 2024-06-07 中国医学科学院阜外医院 A biomarker for chronic unpredictable stress model and its application

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113960215B (en) * 2021-11-09 2024-03-26 上海市第一人民医院 Marker for lung adenocarcinoma diagnosis and application thereof
CN114778729B (en) * 2022-04-20 2023-10-27 中国医学科学院药用植物研究所云南分所 A method for identifying turmeric varieties using broadly targeted metabolomics technology
CN114636774B (en) * 2022-04-22 2024-05-17 苏州市疾病预防控制中心 Biological metabolism marker composition for predicting adolescent hypertension disease risk
CN115876991B (en) * 2023-03-08 2023-06-23 中国医学科学院北京协和医院 Sugar chain markers for the diagnosis of pulmonary embolism and their application
CN116106534B (en) * 2023-04-11 2023-06-27 南京品生医学检验实验室有限公司 Application of biomarker combination in preparation of lung cancer prediction product
CN117368476B (en) * 2023-09-25 2024-03-08 西安交通大学医学院第一附属医院 Application of detection reagents for seven metabolic markers in the preparation of breast cancer diagnosis and prognosis products
CN117388495B (en) * 2023-12-13 2024-02-09 哈尔滨脉图精准技术有限公司 Application of metabolic marker for diagnosing lung cancer stage and kit
CN118421799A (en) * 2024-05-28 2024-08-02 南通大学 A group of specific methylation diagnostic markers related to lung adenocarcinoma and their application
CN118615289A (en) * 2024-06-19 2024-09-10 中国人民解放军军事科学院军事医学研究院 Application of P4Hs inhibitors in the preparation of lung cancer drugs

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040063120A1 (en) * 2002-07-10 2004-04-01 The Regents Of The University Of Michigan Expression profile of lung cancer
CN101923080A (en) * 2010-03-22 2010-12-22 苏州大学附属第一医院 A Serum Marker Protein of Lung Adenocarcinoma
CN108139381A (en) * 2015-06-26 2018-06-08 百奥马克癌症系统公司 Lung cancer detection method
CN108414660A (en) * 2018-03-08 2018-08-17 中国药科大学 One group early diagnoses relevant blood plasma metabolism small molecule marker and its application with lung cancer
CN109725072A (en) * 2017-10-27 2019-05-07 中国医学科学院药物研究所 A targeted qualitative and quantitative metabolomic analysis method for screening cancer biomarkers based on LC-MS/MS technology
CN113267586A (en) * 2021-04-30 2021-08-17 上海交通大学医学院 Application of purine metabolic marker in preparation of lung cancer molecular targeted drug acquired resistance screening and diagnostic reagent
CN113960215A (en) * 2021-11-09 2022-01-21 上海市第一人民医院 Marker for lung adenocarcinoma diagnosis and application thereof

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040063120A1 (en) * 2002-07-10 2004-04-01 The Regents Of The University Of Michigan Expression profile of lung cancer
CN101923080A (en) * 2010-03-22 2010-12-22 苏州大学附属第一医院 A Serum Marker Protein of Lung Adenocarcinoma
CN108139381A (en) * 2015-06-26 2018-06-08 百奥马克癌症系统公司 Lung cancer detection method
CN109725072A (en) * 2017-10-27 2019-05-07 中国医学科学院药物研究所 A targeted qualitative and quantitative metabolomic analysis method for screening cancer biomarkers based on LC-MS/MS technology
CN108414660A (en) * 2018-03-08 2018-08-17 中国药科大学 One group early diagnoses relevant blood plasma metabolism small molecule marker and its application with lung cancer
CN113267586A (en) * 2021-04-30 2021-08-17 上海交通大学医学院 Application of purine metabolic marker in preparation of lung cancer molecular targeted drug acquired resistance screening and diagnostic reagent
CN113960215A (en) * 2021-11-09 2022-01-21 上海市第一人民医院 Marker for lung adenocarcinoma diagnosis and application thereof

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
"Doctoral Dissertation", 1 May 2019, NANCHANG UNIVERSITY, CN, article YU, JIANGQING: "Exploring the Potential Biomarkers for Diagnosis and Classification of Lung Cancer by Non-Targeting and Targeted Metabonomics", pages: 1 - 88, XP009545602 *
QIAN, XIANG ET AL.: "Multi-Omics Analysis of Yin Deficiency And Internal Heat Syndrome of Non-Small Cell Lung Cancer Based on System Biology", CHINA JOURNAL OF TRADITIONAL CHINESE MEDICINE AND PHARMACY, vol. 36, no. 11, 1 November 2021 (2021-11-01), XP009546360, ISSN: 1673-1727 *
ZHANG JIN-SHENG: "The Role of Lysine Acetylation of Protein in Lung Cancer And Pulmonary Fibrosis", FUDAN UNIV J MED SCI, vol. 38, no. 4, 1 July 2011 (2011-07-01), pages 287 - 293, XP093066854 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117368479A (en) * 2023-11-13 2024-01-09 郑州大学 Biomarker and detection kit for lung adenocarcinoma diagnosis
CN118150738A (en) * 2024-05-09 2024-06-07 中国医学科学院阜外医院 A biomarker for chronic unpredictable stress model and its application

Also Published As

Publication number Publication date
CN113960215B (en) 2024-03-26
CN113960215A (en) 2022-01-21

Similar Documents

Publication Publication Date Title
WO2023082820A1 (en) Marker for lung adenocarcinoma diagnosis and application thereof
WO2023082821A1 (en) Serum metabolism marker for diagnosing benign and malignant pulmonary nodules and use thereof
CN105044361B (en) A kind of diagnostic marker and its screening technique for being suitable for esophageal squamous cell carcinoma early diagnosis
Shen et al. Developing urinary metabolomic signatures as early bladder cancer diagnostic markers
CN102323351B (en) Bladder cancer patient urine specific metabolite spectrum, establishing method and application
CA2619732C (en) Methods for the diagnosis of colorectal cancer and ovarian cancer health states
CN114373510B (en) Metabolic marker for diagnosing or monitoring lung cancer and screening method and application thereof
CN108414660B (en) Application of group of plasma metabolism small molecule markers related to early diagnosis of lung cancer
US20170097355A1 (en) Biomarkers and methods to distinguish ovarian cancer from benign tumors
Guo et al. Evaluation of significant features discovered from different data acquisition modes in mass spectrometry-based untargeted metabolomics
CN114924073B (en) Tumor diagnosis marker combination in colorectal progression stage and application thereof
WO2013177222A1 (en) Metabolite biomarkers for the detection of liver cancer
CN111562338A (en) Application of transparent renal cell carcinoma metabolic marker in renal cell carcinoma early screening and diagnosis product
Liu et al. Free amino acid profiling of gastric juice as a method for discovering potential biomarkers of early gastric cancer
Lv et al. Exploratory study on application of MALDI‑TOF‑MS to detect serum and urine peptides related to small cell lung carcinoma
Issaq et al. Biomarker discovery: study design and execution
Yang et al. Proteomic profiling of invasive ductal carcinoma (IDC) using magnetic beads‐based serum fractionation and MALDI‐TOF MS
CN112255335B (en) Plasma metabolic markers for distinguishing benign from malignant ovarian tumors and their applications
CN118011003A (en) A biomarker composition for diagnosing gastric cancer and its application
CN105044240A (en) Diagnosis marker suitable for early-stage esophageal squamous cell cancer diagnosis
CN114280202A (en) Biomarker for diagnosing cadmium poisoning and application thereof
CN114965733B (en) Colorectal advanced adenoma diagnosis metabolic marker combination and application thereof
CN109387593B (en) Acute myelocytic leukemia chemotherapy sensitivity evaluation kit and application thereof
CN114705782B (en) Plasma metabolism marker combination for diagnosing or monitoring colorectal cancer and application
CN105044342B (en) A kind of diagnostic marker for being suitable for cancer of the esophagus early diagnosis

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22891625

Country of ref document: EP

Kind code of ref document: A1

122 Ep: pct application non-entry in european phase

Ref document number: 22891625

Country of ref document: EP

Kind code of ref document: A1