CN113151460B - Gene marker for identifying lung adenocarcinoma tumor cells and application thereof - Google Patents

Gene marker for identifying lung adenocarcinoma tumor cells and application thereof Download PDF

Info

Publication number
CN113151460B
CN113151460B CN202110127761.3A CN202110127761A CN113151460B CN 113151460 B CN113151460 B CN 113151460B CN 202110127761 A CN202110127761 A CN 202110127761A CN 113151460 B CN113151460 B CN 113151460B
Authority
CN
China
Prior art keywords
tumor cells
lung adenocarcinoma
genes
gene marker
adenocarcinoma tumor
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110127761.3A
Other languages
Chinese (zh)
Other versions
CN113151460A (en
Inventor
梁嘉琪
陈振淙
郑元生
黄宜炜
卞赟艺
毕国澍
詹成
王群
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhongshan Hospital Fudan University
Original Assignee
Zhongshan Hospital Fudan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhongshan Hospital Fudan University filed Critical Zhongshan Hospital Fudan University
Priority to CN202110127761.3A priority Critical patent/CN113151460B/en
Publication of CN113151460A publication Critical patent/CN113151460A/en
Application granted granted Critical
Publication of CN113151460B publication Critical patent/CN113151460B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • C12Q1/6886Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K45/00Medicinal preparations containing active ingredients not provided for in groups A61K31/00 - A61K41/00
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61PSPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
    • A61P35/00Antineoplastic agents
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/158Expression markers

Landscapes

  • Health & Medical Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • General Health & Medical Sciences (AREA)
  • Medicinal Chemistry (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Engineering & Computer Science (AREA)
  • Animal Behavior & Ethology (AREA)
  • Genetics & Genomics (AREA)
  • Public Health (AREA)
  • Veterinary Medicine (AREA)
  • Wood Science & Technology (AREA)
  • Analytical Chemistry (AREA)
  • Immunology (AREA)
  • Zoology (AREA)
  • Pathology (AREA)
  • Oncology (AREA)
  • Hospice & Palliative Care (AREA)
  • Epidemiology (AREA)
  • Physics & Mathematics (AREA)
  • Biophysics (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Biotechnology (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • General Chemical & Material Sciences (AREA)
  • Chemical Kinetics & Catalysis (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The invention relates to a gene marker for identifying lung adenocarcinoma tumor cells and application thereof, belonging to the technical field of molecular biology. The gene marker is selected from at least one of genes ERRFI1, MAL2, RNASE1, ATP11A, CYB5A, LGALS3BP, LPCAT1, SPINT2, ATP1A1, SMIM22 and TSPAN 3; the expression quantity of three genes of MAL2, CYB5A and ATP11A in lung adenocarcinoma tumor cells is 3-6 times of that of normal cells, and the risk assessment model is established by a Logistic regression method to more accurately identify the lung adenocarcinoma tumor cells of single cell type, wherein the specificity and the sensitivity of the model are close to 90%. The gene marker for identifying lung adenocarcinoma tumor cells provides a new molecular target for clinically developing a method for diagnosing lung adenocarcinoma and a medicament for treating lung adenocarcinoma.

Description

Gene marker for identifying lung adenocarcinoma tumor cells and application thereof
Technical Field
The invention relates to a gene marker for identifying lung adenocarcinoma tumor cells and application thereof, belonging to the technical field of molecular biology.
Background
Lung cancer is the most common and most fatal tumor, with morbidity and mortality ranking all first. In recent years, the advent of single cell RNA sequencing has enabled researchers to analyze tumors with higher resolution, examine gene expression of each single cell, and map tumor views including the surrounding environment, thus uncovering a rich tumor ecosystem. However, without a reliable calculation method, it is difficult to distinguish between tumor cells and normal cells. Currently, in single cell data analysis, a mainstream method is to annotate cells by using a database such as CellMarker and the like through a cell type and cell marker gene table comparison mode, namely, according to the relationship between the marker gene table and an analyzed difference gene table and the abundance of genes, to infer which cell type a certain cell cluster belongs.
However, different databases or annotation methods can generally only distinguish cell types with significant differences, and for cell types with certain similarities and subtypes in the same category, the difficulty in correctly predicting and distinguishing the cell types is high, and at present, a relatively perfect method is not available for realizing high accuracy in different data sets. Therefore, there still exists a gene which is convenient and accurate, and has higher sensitivity, specificity and application value, and can be used for distinguishing tumor cells from non-tumor cells.
Disclosure of Invention
The technical problem solved by the invention is as follows: the technical problem of how to accurately distinguish tumor cells from non-tumor cells in lung adenocarcinoma.
In order to solve the above problems, the present invention provides a gene marker for identifying lung adenocarcinoma tumor cells, selected from at least one of genes erfi 1, MAL2, RNASE1, ATP11A, CYB5A, LGALS3BP, LPCAT1, SPINT2, ATP1A1, SMIM22, and TSPAN 3; the gene is differentially expressed in lung adenocarcinoma tumor cells and non-tumor cells.
Preferably, the gene marker for identifying lung adenocarcinoma tumor cells is a combination of genes ERRFI1, MAL2, RNASE1, ATP11A, CYB5A, LGALS3BP, LPCAT1, SPINT2, ATP1A1, SMIM22 and TSPAN 3.
Preferably, the gene marker for identifying lung adenocarcinoma tumor cells is selected from one of genes MAL2, CYB5A and ATP 11A.
The invention also provides application of the gene marker for identifying lung adenocarcinoma tumor cells, and the application is application other than diagnosis and treatment.
Preferably, the use comprises use in the manufacture of a medicament for the treatment of lung adenocarcinoma.
Preferably, the use comprises use in the manufacture of a kit for diagnosing lung adenocarcinoma.
Compared with the prior art, the invention has the following beneficial effects:
1. the gene marker for identifying lung adenocarcinoma tumor cells can accurately identify the lung adenocarcinoma tumor cells, and provides a new molecular target for clinically developing a method for diagnosing lung adenocarcinoma and a medicament for treating lung adenocarcinoma.
2. The risk assessment model established by the Logistic regression method is used for identifying the lung adenocarcinoma tumor cells of single cell type, and the specificity and the sensitivity of the risk assessment model are close to 90%.
Drawings
FIG. 1 is a diagram of the result of single cell sample and cell type clustering of lung adenocarcinoma;
FIG. 2 is a ROC plot of 11 genes such as ERRFI1, wherein the abscissa represents specificity and the ordinate represents sensitivity, in differentiating tumor cells from non-tumor cells;
FIG. 3 shows the expression of 11 genes such as ERRFI1 in different cell types;
FIG. 4 is a ROC plot of a risk assessment model constructed based on 11 genes such as ERRFI1, wherein the abscissa represents specificity and the ordinate represents sensitivity;
FIG. 5 is a schematic representation of the results of flow cytometry validation of cell surface markers and sorting of tumor and non-tumor cells;
FIG. 6 is a graph showing the results of RT-qPCR method for detecting the fold difference of mRNA expression of 11 genes such as ERRFI1 in tumor cell samples and non-tumor cell samples, wherein the ordinate represents the fold difference of mRNA expression.
Detailed Description
In order to make the invention more comprehensible, preferred embodiments are described in detail below with reference to the accompanying drawings.
Example 1
Single cell sequencing result analysis of lung adenocarcinoma tissue:
1.1 by analyzing the single cell RNA sequencing data results of 17 tumor tissue samples of lung adenocarcinoma and 12 normal lung tissue samples, 204157 single cell gene expression data were obtained, wherein 22491 tumor cells and 181666 non-tumor cells (alveolar cells, B cells, endothelial cells, epithelial cells, fibroblasts, mast cells, myeloid cells, and T cells) were included, and the single cell samples and cell type clustering results of lung adenocarcinoma are shown in FIG. 1.
1.2 Using the R language Seurat package, the differences in gene expression between the tumor and non-tumor cells were analyzed and. + -. 0.5 log was chosen, after normalization of the expression values of each gene in each cell in the single cell sample by the "ScaleData" function 2 And F, FC threshold value, and finally screening 1655 genes with differential expression, wherein 949 genes with expression remarkably higher than that of non-tumor cells in tumor cells and 706 genes with expression remarkably higher than that of tumor cells in non-tumor cells. Then, 949 genes which were selected to be significantly highly expressed in tumor cells were analyzed by using a ROC curve, and the results of 51 gene expression cases with an area under the ROC curve (AUC) value of more than 0.80 in tumor cell differentiation and ROC analysis are shown in Table 1.
TABLE 1 Gene expression profiles and ROC analysis for specific high expression in lung adenocarcinoma tumor cells
Figure BDA0002924064170000031
Figure BDA0002924064170000041
Figure BDA0002924064170000051
Since some of the 51 genes have been used as markers for identifying tumor cells, and some genes have been studied in the field of lung cancer, and the expression of the genes has been deleted, eleven genes ERRFI1, MAL2, RNASE1, ATP11A, CYB5A, LGALS3BP, LPCAT1, SPINT2, ATP1A1, SMIM22, and TSPAN3 were screened and further studied, and it was found from the data in Table 1 that the areas under the ROC curve of the eleven genes were all 0.80 or more. The ROC curves of erfi 1, MAL2, RNASE1, ATP11A, CYB5A, LGALS3BP, LPCAT1, SPINT2, ATP1A1, SMIM22, and TSPAN3 genes in distinguishing between tumor cells and non-tumor cells are shown in fig. 2, and the distribution of their expression levels in each type of cells in the single-cell cluster map is shown in fig. 3.
Example 2
A Logistic regression method is applied to establish a risk assessment model to distinguish single tumor cells from non-tumor cells:
2.1 the ERRFI1, MAL2, RNASE1, ATP11A, CYB5A, LGALS3BP, LPCAT1, SPINT2, ATP1A1, SMIM22 and TSPAN3 genes are included in a Logistic regression model, and the obtained risk assessment model is as follows:
Figure BDA0002924064170000052
wherein a, b, c, d, e, f, g, h, i, j, and k represent the normalized expression values of mRNA of eleven genes, i.e., ERRFI1, MAL2, RNASE1, ATP11A, CYB5A, LGALS3BP, LPCAT1, SPINT2, ATP1A1, SMIM22, and TSPAN3, respectively.
2.2 the results of the above model were verified using the gene expression data of the single cell of 1.1 in example 1, and the results showed that the specificity and sensitivity of the model were close to 90% compared to the single gene, and the ROC curve is shown in fig. 4, so that the prediction model was further improved in prediction performance compared to the single gene, and tumor cells and non-tumor cells could be more accurately distinguished. Therefore, the risk assessment model can be applied to the analysis of cell types of any lung adenocarcinoma single cell type: according to the model, the expression values normalized by the mrnas of the erfi 1, MAL2, RNASE1, ATP11A, CYB5A, LGALS3BP, LPCAT1, SPINT2, ATP1A1, SMIM22, and TSPAN3 genes in the single-cell data are substituted into the model, and the P value obtained is closer to 1, and the higher the probability of being a tumor cell, the closer to 0 the P value, the lower the probability of being a tumor cell, and the higher the probability of being a non-tumor cell.
Example 3
Flow cytometry sorting of tumor cells and non-tumor cells to verify gene expression:
3.1 flow analysis of tumor and normal tissues of 5 patients with lung adenocarcinoma was verified by using flow cytometric sorting. Firstly, EPCAM, FLOR1 and CD45 are respectively used as surface markers of tumor cells, normal epithelial cells and lymphocytes, 5 pairs of sample tissues are dissociated into cells, the cells are respectively stained by corresponding fluorescein coupled antibodies, and then the tumor cells of EPCAM + CD 45-and the non-tumor cells of EPCAM-FLOR1+/CD45+ are separated by adopting a flow cytometry method, and the result is shown in figure 5.
3.2 detection of fold difference in mRNA expression of these 11 genes by RT-qPCR method in the sorted tumor cells and non-tumor cells, the results are shown in FIG. 6. From the results shown in FIG. 6, it is understood that 11 genes of ERRFI1, MAL2, RNASE1, ATP11A, CYB5A, LGALS3BP, LPCAT1, SPINT2, ATP1A1, SMIM22 and TSPAN3 are expressed in tumor cells at a higher level than in non-tumor cells, particularly three genes of MAL2, CYB5A and ATP11A, and that the expression level in tumor cells is 3 to 6 times higher than that in non-tumor cells. Therefore, 11 genes such as ERRFI1 can be used for distinguishing lung adenocarcinoma tumor cells from non-tumor cells. Especially three genes of MAL2, CYB5A and ATP11A, if one or more of the three genes are expressed in high amount in the lung cancer sample, the sample is tumor.
While the invention has been described with respect to a preferred embodiment, it will be understood by those skilled in the art that the foregoing and other changes, omissions and deviations in the form and detail thereof may be made without departing from the scope of this invention.

Claims (2)

1. A genetic marker for identifying lung adenocarcinoma tumor cells, which is a combination of genes erfi 1, MAL2, RNASE1, ATP11A, CYB5A, LGALS3BP, LPCAT1, SPINT2, ATP1A1, SMIM22 and TSPAN 3.
2. Use of a reagent for detecting a genetic marker for identifying lung adenocarcinoma tumor cells according to claim 1 in the preparation of a kit for diagnosing lung adenocarcinoma.
CN202110127761.3A 2021-01-29 2021-01-29 Gene marker for identifying lung adenocarcinoma tumor cells and application thereof Active CN113151460B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110127761.3A CN113151460B (en) 2021-01-29 2021-01-29 Gene marker for identifying lung adenocarcinoma tumor cells and application thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110127761.3A CN113151460B (en) 2021-01-29 2021-01-29 Gene marker for identifying lung adenocarcinoma tumor cells and application thereof

Publications (2)

Publication Number Publication Date
CN113151460A CN113151460A (en) 2021-07-23
CN113151460B true CN113151460B (en) 2022-10-18

Family

ID=76879035

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110127761.3A Active CN113151460B (en) 2021-01-29 2021-01-29 Gene marker for identifying lung adenocarcinoma tumor cells and application thereof

Country Status (1)

Country Link
CN (1) CN113151460B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117858965A (en) * 2021-07-28 2024-04-09 北京沙东生物技术有限公司 Myeloma biomarker LGALS3BP and application thereof
CN113755602A (en) * 2021-10-20 2021-12-07 复旦大学附属中山医院 Gene marker for identifying lung adenocarcinoma tumor stem cells and application thereof

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103874770A (en) * 2011-08-08 2014-06-18 卡里斯生命科学卢森堡控股有限责任公司 Biomarker compositions and methods
WO2014151117A1 (en) * 2013-03-15 2014-09-25 The Board Of Trustees Of The Leland Stanford Junior University Identification and use of circulating nucleic acid tumor markers

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180106806A1 (en) * 2016-10-13 2018-04-19 Regents Of The University Of Minnesota Tumor Analytical Methods

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103874770A (en) * 2011-08-08 2014-06-18 卡里斯生命科学卢森堡控股有限责任公司 Biomarker compositions and methods
WO2014151117A1 (en) * 2013-03-15 2014-09-25 The Board Of Trustees Of The Leland Stanford Junior University Identification and use of circulating nucleic acid tumor markers

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
"Abnormal gene expression and gene fusion in lung adenocarcinoma with high-throughput RNA sequencing";Z-H Yang et al.;《Cancer Gene Therapy》;20140207;第21卷;第74-82页 *
"Identifcation of feature genes for smoking-related lung adenocarcinoma based on gene expression profle data";Ying liu et al.;《OncoTargets and Terapy》;20161231;第9卷;7937-7407页 *
"LPCAT1 promotes brain metastasis of lung adenocarcinoma by up-regulating PI3K/AKT/MYC pathway";Chunhua Wei et al.;《Journal of Experimental & Clinical Cancer Research》;20190221;第38卷;第1-16页 *
"Signatures of malignant cells and novel therapeutic targets revealed by single-cell sequencing in lung adenocarcinoma";Jiaqi Liang et al.;《Cancer Medicine》;20220731;第11卷;2255-2258 *

Also Published As

Publication number Publication date
CN113151460A (en) 2021-07-23

Similar Documents

Publication Publication Date Title
CN113151460B (en) Gene marker for identifying lung adenocarcinoma tumor cells and application thereof
CN111394456B (en) Early lung adenocarcinoma patient prognosis evaluation system and application thereof
CN111128385B (en) Prognosis early warning system for esophageal squamous carcinoma and application thereof
WO2009140542A1 (en) Biomarkers for the identification, monitoring, and treatment of head and neck cancer
US20130122499A1 (en) System and method of detecting local copy number variation in dna samples
CN113355419B (en) Breast cancer prognosis risk prediction marker composition and application
CN111653314B (en) Method for analyzing and identifying lymphatic infiltration
CN113215254B (en) Immune-clinical characteristic combined prediction model for evaluating lung adenocarcinoma prognosis
CN113025716A (en) Gene combination for human tumor classification and application thereof
CN113061655B (en) Gene labels for predicting breast cancer neoadjuvant chemotherapy sensitivity and application thereof
CN114203256B (en) MIBC typing and prognosis prediction model construction method based on microbial abundance
CN115410713A (en) Hepatocellular carcinoma prognosis risk prediction model construction based on immune-related gene
CN112831562A (en) Biomarker combination and kit for predicting recurrence risk of liver cancer patient after resection
CN116259360B (en) Identification and characteristic gene set of hyperproliferative tumor subgroup in lung adenocarcinoma and application
US20210215700A1 (en) Personalized treatment of pancreatic cancer
CN116189904A (en) Gene methylation diagnosis model of differentiated thyroid cancer and construction method thereof
CN114022472B (en) Tumor infiltrating lymphocyte analysis method, device and storage medium
CN110408706A (en) It is a kind of assess recurrent nasopharyngeal carcinoma biomarker and its application
CN116121392A (en) Methods and reagents for diagnosis of pancreatic cystic tumours
CN114350799A (en) Application of HTR2C in prognosis of low-grade glioma
CN109880905B (en) Genes for immunohistochemical typing of triple negative breast cancer and application thereof
Loi et al. Discriminating lymphomas and reactive lymphadenopathy in lymph node biopsies by gene expression profiling
CN112760383B (en) qRT-PCR internal reference gene applied to lung adenocarcinoma cell subgroup and application thereof
CN116403648B (en) Small cell lung cancer immune novel typing method established based on multidimensional analysis
CN117269495B (en) Marker protein combination for molecular typing of prostatic cancer proteins and application thereof

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant