[go: up one dir, main page]

CN113151460A - Gene marker for identifying lung adenocarcinoma tumor cells and application thereof - Google Patents

Gene marker for identifying lung adenocarcinoma tumor cells and application thereof Download PDF

Info

Publication number
CN113151460A
CN113151460A CN202110127761.3A CN202110127761A CN113151460A CN 113151460 A CN113151460 A CN 113151460A CN 202110127761 A CN202110127761 A CN 202110127761A CN 113151460 A CN113151460 A CN 113151460A
Authority
CN
China
Prior art keywords
tumor cells
lung adenocarcinoma
genes
cyb5a
mal2
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110127761.3A
Other languages
Chinese (zh)
Other versions
CN113151460B (en
Inventor
梁嘉琪
陈振淙
郑元生
黄宜炜
卞赟艺
毕国澍
詹成
王群
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhongshan Hospital Fudan University
Original Assignee
Zhongshan Hospital Fudan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhongshan Hospital Fudan University filed Critical Zhongshan Hospital Fudan University
Priority to CN202110127761.3A priority Critical patent/CN113151460B/en
Publication of CN113151460A publication Critical patent/CN113151460A/en
Application granted granted Critical
Publication of CN113151460B publication Critical patent/CN113151460B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • C12Q1/6886Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K45/00Medicinal preparations containing active ingredients not provided for in groups A61K31/00 - A61K41/00
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P35/00Antineoplastic agents
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/158Expression markers

Landscapes

  • Health & Medical Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • General Health & Medical Sciences (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Zoology (AREA)
  • Animal Behavior & Ethology (AREA)
  • Public Health (AREA)
  • Veterinary Medicine (AREA)
  • Wood Science & Technology (AREA)
  • Engineering & Computer Science (AREA)
  • Genetics & Genomics (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Analytical Chemistry (AREA)
  • Immunology (AREA)
  • Medicinal Chemistry (AREA)
  • Pathology (AREA)
  • Oncology (AREA)
  • Hospice & Palliative Care (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • General Chemical & Material Sciences (AREA)
  • Biotechnology (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Epidemiology (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The invention relates to a gene marker for identifying lung adenocarcinoma tumor cells and application thereof, belonging to the technical field of molecular biology. The gene marker is selected from at least one of genes ERRFI1, MAL2, RNASE1, ATP11A, CYB5A, LGALS3BP, LPCAT1, SPINT2, ATP1A1, SMIM22 and TSPAN 3; wherein, the expression quantity of three genes of MAL2, CYB5A and ATP11A in lung adenocarcinoma tumor cells is 3-6 times of that of normal cells, the invention also establishes a risk assessment model by a Logistic regression method so as to more accurately identify the lung adenocarcinoma tumor cells with single cell type, and the specificity and the sensitivity of the model are both close to 90 percent. The gene marker for identifying lung adenocarcinoma tumor cells provides a new molecular target for clinically developing a method for diagnosing lung adenocarcinoma and a medicament for treating lung adenocarcinoma.

Description

Gene marker for identifying lung adenocarcinoma tumor cells and application thereof
Technical Field
The invention relates to a gene marker for identifying lung adenocarcinoma tumor cells and application thereof, belonging to the technical field of molecular biology.
Background
Lung cancer is the most common and most fatal tumor, with morbidity and mortality ranking first for all tumors. In recent years, the advent of single cell RNA sequencing has enabled researchers to analyze tumors with higher resolution, examine gene expression of each single cell, and map tumor views including the surrounding environment, thus uncovering a rich tumor ecosystem. However, without a reliable calculation method, it is difficult to distinguish between tumor cells and normal cells. Currently, in single cell data analysis, a mainstream method is to annotate cells by using a database such as CellMarker and the like through a cell type and cell marker gene table comparison mode, namely, according to the relationship between the marker gene table and an analyzed difference gene table and the abundance of genes, to infer which cell type a certain cell cluster belongs.
However, different databases or annotation methods can generally only distinguish cell types with significant differences, and for cell types with certain similarities and subtypes in the same category, the difficulty in correctly predicting and distinguishing the cell types is high, and at present, a relatively perfect method is not available for realizing high accuracy in different data sets. Therefore, there may still exist a gene which is convenient, accurate, and has higher sensitivity, specificity and application value, and can be used for distinguishing tumor cells from non-tumor cells.
Disclosure of Invention
The technical problem solved by the invention is as follows: the technical problem of how to accurately distinguish tumor cells from non-tumor cells in lung adenocarcinoma.
In order to solve the above problems, the present invention provides a gene marker for identifying lung adenocarcinoma tumor cells, selected from at least one of genes erfi 1, MAL2, RNASE1, ATP11A, CYB5A, LGALS3BP, LPCAT1, SPINT2, ATP1a1, SMIM22, and TSPAN 3; the gene is differentially expressed in lung adenocarcinoma tumor cells and non-tumor cells.
Preferably, the gene marker for identifying lung adenocarcinoma tumor cells is a combination of genes ERRFI1, MAL2, RNASE1, ATP11A, CYB5A, LGALS3BP, LPCAT1, SPINT2, ATP1A1, SMIM22 and TSPAN 3.
Preferably, the gene marker for identifying lung adenocarcinoma tumor cells is selected from one of genes MAL2, CYB5A and ATP 11A.
The invention also provides application of the gene marker for identifying lung adenocarcinoma tumor cells, and the application is application other than diagnosis and treatment.
Preferably, the use comprises use in the manufacture of a medicament for the treatment of lung adenocarcinoma.
Preferably, the use comprises use in the manufacture of a kit for diagnosing lung adenocarcinoma.
Compared with the prior art, the invention has the following beneficial effects:
1. the gene marker for identifying the lung adenocarcinoma tumor cells can accurately identify the lung adenocarcinoma tumor cells, and provides a new molecular target for clinically developing a method for diagnosing lung adenocarcinoma and a medicament for treating lung adenocarcinoma.
2. The risk assessment model established by the Logistic regression method is used for identifying the lung adenocarcinoma tumor cells of the single cell type, and the specificity and the sensitivity of the risk assessment model are close to 90 percent.
Drawings
FIG. 1 is a diagram of a single cell sample of lung adenocarcinoma and a result of cell type clustering;
FIG. 2 is a ROC plot of 11 genes such as ERRFI1, wherein the abscissa represents specificity and the ordinate represents sensitivity, in distinguishing tumor cells from non-tumor cells;
FIG. 3 shows the expression of 11 genes such as ERRFI1 in different cell types;
FIG. 4 is a ROC plot of a risk assessment model constructed based on 11 genes such as ERRFI1, wherein the abscissa represents specificity and the ordinate represents sensitivity;
FIG. 5 is a schematic representation of the results of flow cytometry validation of cell surface markers and sorting of tumor and non-tumor cells;
FIG. 6 is a graph showing the results of RT-qPCR method to detect the difference of multiple mRNA expression of 11 genes such as ERRFI1 in tumor cell samples and non-tumor cell samples, wherein the ordinate represents the difference of multiple mRNA expression.
Detailed Description
In order to make the invention more comprehensible, preferred embodiments are described in detail below with reference to the accompanying drawings.
Example 1
Single cell sequencing result analysis of lung adenocarcinoma tissue:
1.1 by analyzing the single-cell RNA sequencing data results of 17 tumor tissue samples of lung adenocarcinoma and 12 normal lung tissue samples, 204157 single-cell gene expression data were obtained, wherein 22491 tumor cells and 181666 non-tumor cells (alveolar cells, B cells, endothelial cells, epithelial cells, fibroblasts, mast cells, myeloid cells, and T cells) were included, and the single-cell sample of lung adenocarcinoma and the cell type clustering results are shown in FIG. 1.
1.2 Using the R language Seurat package, the gene expression differences between the tumor and non-tumor cells were analyzed after normalization of the expression values of each gene of each cell in the single-cell sample by the "ScaleData" function, and. + -. 0.5 was chosen as log2And F, FC threshold value, and finally screening 1655 genes with differential expression, wherein 949 genes with expression remarkably higher than that of non-tumor cells in tumor cells and 706 genes with expression remarkably higher than that of tumor cells in non-tumor cells. Then, 949 genes which were selected to be significantly highly expressed in tumor cells were analyzed by using a ROC curve, and the results of 51 gene expression cases with an area under the ROC curve (AUC) value of more than 0.80 in tumor cell differentiation and ROC analysis are shown in Table 1.
TABLE 1 Gene expression profiles and ROC analysis for specific high expression in lung adenocarcinoma tumor cells
Figure BDA0002924064170000031
Figure BDA0002924064170000041
Figure BDA0002924064170000051
Since some of the 51 genes have been used as markers for identifying tumor cells, and some genes have been studied in the field of lung cancer, and the existing genes were deleted, eleven genes, ERRFI1, MAL2, RNASE1, ATP11A, CYB5A, LGALS3BP, LPCAT1, SPINT2, ATP1A1, SMIM22, and TSPAN3, were selected and further studied, and the data in Table 1 revealed that the ROC curves of these eleven genes all had an area of 0.80 or more. The ROC curves of erfi 1, MAL2, RNASE1, ATP11A, CYB5A, LGALS3BP, LPCAT1, SPINT2, ATP1a1, SMIM22, and TSPAN3 genes for distinguishing between tumor cells and non-tumor cells are shown in fig. 2, and the distribution of the expression levels of each of them in the respective cells in the single-cell cluster map is shown in fig. 3.
Example 2
A Logistic regression method is applied to establish a risk assessment model to distinguish single tumor cells from non-tumor cells:
2.1 the ERRFI1, MAL2, RNASE1, ATP11A, CYB5A, LGALS3BP, LPCAT1, SPINT2, ATP1A1, SMIM22 and TSPAN3 genes are included in a Logistic regression model, and the obtained risk assessment model is as follows:
Figure BDA0002924064170000052
wherein a, b, c, d, e, f, g, h, i, j, and k represent the normalized expression values of mRNA of eleven genes, i.e., ERRFI1, MAL2, RNASE1, ATP11A, CYB5A, LGALS3BP, LPCAT1, SPINT2, ATP1A1, SMIM22, and TSPAN3, respectively.
2.2 the results of the above model were verified using the gene expression data of single cell of 1.1 in example 1, and the results showed that the specificity and sensitivity of the model were close to 90% compared to the gene alone, and the ROC curve is shown in fig. 4, therefore the prediction model has further improved prediction performance compared to the single gene, and can more accurately distinguish tumor cells from non-tumor cells. Thus, the risk assessment model can be applied to the analysis of cell types of any lung adenocarcinoma single cell type: according to the above model, the expression values normalized by the mrnas of the erfi 1, MAL2, RNASE1, ATP11A, CYB5A, LGALS3BP, LPCAT1, SPINT2, ATP1a1, SMIM22, and TSPAN3 genes in the single-cell data are substituted into the above model, and the obtained P value is closer to 1, and it is proved that the higher the probability that the P value is a tumor cell, the closer to 0 the P value, the lower the probability that the P value is a tumor cell, and the higher the probability that the P value is a non-tumor cell.
Example 3
Flow cytometry sorting of tumor cells and non-tumor cells to verify gene expression:
3.1 flow assay validation of tumor and normal tissues of 5 patients with lung adenocarcinoma was performed by using flow cytometric sorting. EPCAM, FLOR1 and CD45 are used as surface markers of tumor cells, normal epithelial cells and lymphocytes respectively, 5 pairs of sample tissues are dissociated into cells, the cells are stained by corresponding fluorescein-coupled antibodies respectively, and the tumor cells of EPCAM + CD 45-and the non-tumor cells of EPCAM-FLOR 1+/CD45+ are separated by a flow cytometry method, and the result is shown in FIG. 5.
3.2 detection of fold difference in mRNA expression of these 11 genes in sorted tumor and non-tumor cells by RT-qPCR method, the results are shown in FIG. 6. From the results in fig. 6, it is understood that the expression levels of 11 genes of erfi 1, MAL2, RNASE1, ATP11A, CYB5A, LGALS3BP, LPCAT1, SPINT2, ATP1a1, SMIM22, and TSPAN3 in tumor cells are higher than those of non-tumor cells, particularly three genes of MAL2, CYB5A, and ATP11A, and the expression levels in tumor cells are 3 to 6 times higher than those in non-tumor cells. Therefore, 11 genes such as ERRFI1 can be used for distinguishing lung adenocarcinoma tumor cells from non-tumor cells. Especially three genes of MAL2, CYB5A and ATP11A, if one or more of the three genes are expressed in high amount in the lung cancer sample, the sample is tumor.
While the invention has been described with respect to a preferred embodiment, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention.

Claims (6)

1. A genetic marker that recognizes lung adenocarcinoma tumor cells, selected from at least one of the genes erfi 1, MAL2, RNASE1, ATP11A, CYB5A, LGALS3BP, LPCAT1, SPINT2, ATP1a1, SMIM22, and TSPAN 3.
2. The genetic marker for identifying lung adenocarcinoma tumor cells according to claim 1, which is a combination of genes ERRFI1, MAL2, RNASE1, ATP11A, CYB5A, LGALS3BP, LPCAT1, SPINT2, ATP1A1, SMIM22 and TSPAN 3.
3. The genetic marker for identifying lung adenocarcinoma tumor cells according to claim 1, wherein the gene is selected from one of the genes MAL2, CYB5A and ATP 11A.
4. Use of a genetic marker for the identification of lung adenocarcinoma tumor cells according to any one of claims 1 to 3, characterized in that said use is in addition to diagnosis and therapy.
5. The use according to claim 4, wherein the use comprises use in the manufacture of a medicament for the treatment of lung adenocarcinoma.
6. The use according to claim 4, wherein said use comprises use in the preparation of a kit for the diagnosis of lung adenocarcinoma.
CN202110127761.3A 2021-01-29 2021-01-29 A gene marker for identifying lung adenocarcinoma tumor cells and its application Active CN113151460B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110127761.3A CN113151460B (en) 2021-01-29 2021-01-29 A gene marker for identifying lung adenocarcinoma tumor cells and its application

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110127761.3A CN113151460B (en) 2021-01-29 2021-01-29 A gene marker for identifying lung adenocarcinoma tumor cells and its application

Publications (2)

Publication Number Publication Date
CN113151460A true CN113151460A (en) 2021-07-23
CN113151460B CN113151460B (en) 2022-10-18

Family

ID=76879035

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110127761.3A Active CN113151460B (en) 2021-01-29 2021-01-29 A gene marker for identifying lung adenocarcinoma tumor cells and its application

Country Status (1)

Country Link
CN (1) CN113151460B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113755602A (en) * 2021-10-20 2021-12-07 复旦大学附属中山医院 A gene marker for identifying lung adenocarcinoma tumor stem cells and its application
WO2023004624A1 (en) * 2021-07-28 2023-02-02 北京沙东生物技术有限公司 Myeloma biomarker lgals3bp and use thereof
CN115993453A (en) * 2022-07-22 2023-04-21 四川大学 Methods and kits for the diagnosis and treatment of RDAA-positive diseases

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103874770A (en) * 2011-08-08 2014-06-18 卡里斯生命科学卢森堡控股有限责任公司 Biomarker compositions and methods
WO2014151117A1 (en) * 2013-03-15 2014-09-25 The Board Of Trustees Of The Leland Stanford Junior University Identification and use of circulating nucleic acid tumor markers
US20180106806A1 (en) * 2016-10-13 2018-04-19 Regents Of The University Of Minnesota Tumor Analytical Methods

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103874770A (en) * 2011-08-08 2014-06-18 卡里斯生命科学卢森堡控股有限责任公司 Biomarker compositions and methods
WO2014151117A1 (en) * 2013-03-15 2014-09-25 The Board Of Trustees Of The Leland Stanford Junior University Identification and use of circulating nucleic acid tumor markers
US20180106806A1 (en) * 2016-10-13 2018-04-19 Regents Of The University Of Minnesota Tumor Analytical Methods

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
CHUNHUA WEI ET AL.: ""LPCAT1 promotes brain metastasis of lung adenocarcinoma by up-regulating PI3K/AKT/MYC pathway"", 《JOURNAL OF EXPERIMENTAL & CLINICAL CANCER RESEARCH》 *
JIAQI LIANG ET AL.: ""Signatures of malignant cells and novel therapeutic targets revealed by single-cell sequencing in lung adenocarcinoma"", 《CANCER MEDICINE》 *
YING LIU ET AL.: ""Identifcation of feature genes for smoking-related lung adenocarcinoma based on gene expression profle data"", 《ONCOTARGETS AND TERAPY》 *
Z-H YANG ET AL.: ""Abnormal gene expression and gene fusion in lung adenocarcinoma with high-throughput RNA sequencing"", 《CANCER GENE THERAPY》 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2023004624A1 (en) * 2021-07-28 2023-02-02 北京沙东生物技术有限公司 Myeloma biomarker lgals3bp and use thereof
CN113755602A (en) * 2021-10-20 2021-12-07 复旦大学附属中山医院 A gene marker for identifying lung adenocarcinoma tumor stem cells and its application
CN115993453A (en) * 2022-07-22 2023-04-21 四川大学 Methods and kits for the diagnosis and treatment of RDAA-positive diseases
CN115993453B (en) * 2022-07-22 2025-03-25 四川大学 Methods and kits for diagnosis and treatment of RDAA-positive diseases

Also Published As

Publication number Publication date
CN113151460B (en) 2022-10-18

Similar Documents

Publication Publication Date Title
CN113151460A (en) Gene marker for identifying lung adenocarcinoma tumor cells and application thereof
US20100099093A1 (en) Biomarkers for the Identification Monitoring and Treatment of Head and Neck Cancer
CN114203256B (en) MIBC typing and prognosis prediction model construction method based on microbial abundance
CN110577998A (en) Construction of molecular model for predicting postoperative early recurrence risk of liver cancer and application evaluation thereof
CN113355419B (en) Breast cancer prognosis risk prediction marker composition and application
US20130122499A1 (en) System and method of detecting local copy number variation in dna samples
CN113355422A (en) Gene combination for human tumor classification and application thereof
CN112831562A (en) A biomarker combination and kit for predicting the recurrence risk of liver cancer patients after resection
CN111863250A (en) A combined diagnostic model and system for early breast cancer
CN115410713A (en) Hepatocellular carcinoma prognosis risk prediction model construction based on immune-related gene
CN113215254A (en) Immune-clinical characteristic combined prediction model for evaluating lung adenocarcinoma prognosis
CN115807089A (en) Prognostic biomarkers and applications of hepatocellular carcinoma
SG190466A1 (en) Methods for diagnosis and/or prognosis of ovarian cancer
CN113061655A (en) A set of gene signatures and applications for predicting neoadjuvant chemotherapy sensitivity in breast cancer
WO2019232361A1 (en) Personalized treatment of pancreatic cancer
Aanei et al. Database-guided analysis for immunophenotypic diagnosis and follow-up of acute myeloid leukemia with recurrent genetic abnormalities
CN115862876B (en) A device for predicting the prognosis of patients with lung adenocarcinoma based on the immune microenvironment gene group
CN109880905B (en) Genes for immunohistochemical typing of triple negative breast cancer and application thereof
Loi et al. Discriminating lymphomas and reactive lymphadenopathy in lymph node biopsies by gene expression profiling
CN117269495B (en) A marker protein combination for prostate cancer protein molecular typing and its application
CN112760383B (en) qRT-PCR internal reference gene applied to lung adenocarcinoma cell subgroup and application thereof
CN115637292B (en) A model for predicting the risk of small cell transformation in patients with lung adenocarcinoma and its establishment method
Becker et al. Path-19. Tumor heterogeneity in gliomas: A pilot study of histopathology-associated proteome profiles assessed by liquid chromatography tandem mass spectrometry of ffpe samples
CN118932060A (en) Gene composition for immunohistochemical typing of HR+/HER2+ breast cancer and its application
CN119799894A (en) A set of genes for immunohistochemical typing of BRAF V600E mutant colorectal cancer and its application

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant