CN117778566A - Marker for predicting thyroid cancer metastasis and application thereof - Google Patents
Marker for predicting thyroid cancer metastasis and application thereof Download PDFInfo
- Publication number
- CN117778566A CN117778566A CN202211145670.3A CN202211145670A CN117778566A CN 117778566 A CN117778566 A CN 117778566A CN 202211145670 A CN202211145670 A CN 202211145670A CN 117778566 A CN117778566 A CN 117778566A
- Authority
- CN
- China
- Prior art keywords
- thyroid cancer
- cancer metastasis
- risk
- marker
- group
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 208000024770 Thyroid neoplasm Diseases 0.000 title claims abstract description 139
- 201000002510 thyroid cancer Diseases 0.000 title claims abstract description 134
- 206010027476 Metastases Diseases 0.000 title claims abstract description 99
- 230000009401 metastasis Effects 0.000 title claims abstract description 99
- 239000003550 marker Substances 0.000 title claims abstract description 61
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 80
- 238000001514 detection method Methods 0.000 claims abstract description 32
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 25
- 108020004999 messenger RNA Proteins 0.000 claims abstract description 20
- 239000002299 complementary DNA Substances 0.000 claims abstract description 18
- 239000003153 chemical reaction reagent Substances 0.000 claims abstract description 15
- 230000014509 gene expression Effects 0.000 claims description 33
- 238000000034 method Methods 0.000 claims description 22
- 206010033701 Papillary thyroid cancer Diseases 0.000 claims description 13
- 208000030045 thyroid gland papillary carcinoma Diseases 0.000 claims description 13
- 102100028550 40S ribosomal protein S4, Y isoform 1 Human genes 0.000 claims description 12
- 102100022546 Bone morphogenetic protein 8A Human genes 0.000 claims description 12
- 102100038503 Cellular retinoic acid-binding protein 1 Human genes 0.000 claims description 12
- 102100030831 Fibrocystin-L Human genes 0.000 claims description 12
- 101000696103 Homo sapiens 40S ribosomal protein S4, Y isoform 1 Proteins 0.000 claims description 12
- 101000899364 Homo sapiens Bone morphogenetic protein 8A Proteins 0.000 claims description 12
- 101001099865 Homo sapiens Cellular retinoic acid-binding protein 1 Proteins 0.000 claims description 12
- 101000869896 Homo sapiens Death-inducer obliterator 1 Proteins 0.000 claims description 12
- 101000583237 Homo sapiens Fibrocystin-L Proteins 0.000 claims description 12
- 101001037139 Homo sapiens Immunoglobulin heavy variable 3-30 Proteins 0.000 claims description 12
- 101001027938 Homo sapiens Metallothionein-1G Proteins 0.000 claims description 12
- 101000969031 Homo sapiens Nuclear protein 1 Proteins 0.000 claims description 12
- 101001053773 Homo sapiens Type I iodothyronine deiodinase Proteins 0.000 claims description 12
- 102100040219 Immunoglobulin heavy variable 3-30 Human genes 0.000 claims description 12
- 102100037512 Metallothionein-1G Human genes 0.000 claims description 12
- 102100021133 Nuclear protein 1 Human genes 0.000 claims description 12
- 102100024063 Type I iodothyronine deiodinase Human genes 0.000 claims description 12
- 238000012216 screening Methods 0.000 claims description 10
- 238000004458 analytical method Methods 0.000 claims description 7
- 238000012360 testing method Methods 0.000 claims description 7
- 102100022712 Alpha-1-antitrypsin Human genes 0.000 claims description 4
- 102100035263 Anion exchange transporter Human genes 0.000 claims description 4
- 101000823116 Homo sapiens Alpha-1-antitrypsin Proteins 0.000 claims description 4
- 101000669513 Homo sapiens Metalloproteinase inhibitor 1 Proteins 0.000 claims description 4
- 102100039364 Metalloproteinase inhibitor 1 Human genes 0.000 claims description 4
- 108091006516 SLC26A7 Proteins 0.000 claims description 4
- 108091006576 SLC34A2 Proteins 0.000 claims description 4
- 102100038437 Sodium-dependent phosphate transport protein 2B Human genes 0.000 claims description 4
- 238000011065 in-situ storage Methods 0.000 claims description 4
- 238000002965 ELISA Methods 0.000 claims description 2
- 238000012545 processing Methods 0.000 claims description 2
- 230000004952 protein activity Effects 0.000 claims description 2
- 230000035945 sensitivity Effects 0.000 abstract description 4
- 230000001225 therapeutic effect Effects 0.000 abstract description 3
- 201000010099 disease Diseases 0.000 abstract description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 abstract description 2
- 206010061818 Disease progression Diseases 0.000 abstract 1
- 230000005750 disease progression Effects 0.000 abstract 1
- 208000007433 Lymphatic Metastasis Diseases 0.000 description 27
- 210000004027 cell Anatomy 0.000 description 25
- 239000000523 sample Substances 0.000 description 19
- 238000012163 sequencing technique Methods 0.000 description 11
- 238000003745 diagnosis Methods 0.000 description 9
- 210000001519 tissue Anatomy 0.000 description 9
- 239000012634 fragment Substances 0.000 description 8
- 210000004881 tumor cell Anatomy 0.000 description 8
- 206010028980 Neoplasm Diseases 0.000 description 7
- 239000002773 nucleotide Substances 0.000 description 7
- 125000003729 nucleotide group Chemical group 0.000 description 7
- 208000033781 Thyroid carcinoma Diseases 0.000 description 6
- 208000013077 thyroid gland carcinoma Diseases 0.000 description 6
- 230000001394 metastastic effect Effects 0.000 description 5
- 206010061289 metastatic neoplasm Diseases 0.000 description 5
- 108091033319 polynucleotide Proteins 0.000 description 5
- 102000040430 polynucleotide Human genes 0.000 description 5
- 239000002157 polynucleotide Substances 0.000 description 5
- 208000013076 thyroid tumor Diseases 0.000 description 5
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- 238000003556 assay Methods 0.000 description 4
- 230000027455 binding Effects 0.000 description 4
- 150000007523 nucleic acids Chemical class 0.000 description 4
- 238000007637 random forest analysis Methods 0.000 description 4
- 238000006467 substitution reaction Methods 0.000 description 4
- 239000000758 substrate Substances 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- 208000009458 Carcinoma in Situ Diseases 0.000 description 3
- 238000000018 DNA microarray Methods 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- 238000012408 PCR amplification Methods 0.000 description 3
- 201000011510 cancer Diseases 0.000 description 3
- 238000000338 in vitro Methods 0.000 description 3
- 108020004707 nucleic acids Proteins 0.000 description 3
- 102000039446 nucleic acids Human genes 0.000 description 3
- 238000010839 reverse transcription Methods 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 238000012795 verification Methods 0.000 description 3
- VYZAMTAEIAYCRO-UHFFFAOYSA-N Chromium Chemical compound [Cr] VYZAMTAEIAYCRO-UHFFFAOYSA-N 0.000 description 2
- 108020004414 DNA Proteins 0.000 description 2
- 102000012804 EPCAM Human genes 0.000 description 2
- 101150084967 EPCAM gene Proteins 0.000 description 2
- 108020005196 Mitochondrial DNA Proteins 0.000 description 2
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 2
- 238000011529 RT qPCR Methods 0.000 description 2
- 101150057140 TACSTD1 gene Proteins 0.000 description 2
- 102000011923 Thyrotropin Human genes 0.000 description 2
- 108010061174 Thyrotropin Proteins 0.000 description 2
- 239000011324 bead Substances 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 229910052804 chromium Inorganic materials 0.000 description 2
- 239000011651 chromium Substances 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 238000000684 flow cytometry Methods 0.000 description 2
- 229940127121 immunoconjugate Drugs 0.000 description 2
- 239000003446 ligand Substances 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 239000002751 oligonucleotide probe Substances 0.000 description 2
- 238000010827 pathological analysis Methods 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 239000003761 preservation solution Substances 0.000 description 2
- 238000003753 real-time PCR Methods 0.000 description 2
- 238000010188 recombinant method Methods 0.000 description 2
- 238000003757 reverse transcription PCR Methods 0.000 description 2
- 238000002560 therapeutic procedure Methods 0.000 description 2
- 238000002604 ultrasonography Methods 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- 101150025032 13 gene Proteins 0.000 description 1
- 101150090724 3 gene Proteins 0.000 description 1
- 208000004434 Calcinosis Diseases 0.000 description 1
- 108091006146 Channels Proteins 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108010044091 Globulins Proteins 0.000 description 1
- 102000006395 Globulins Human genes 0.000 description 1
- 108020005004 Guide RNA Proteins 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 238000010222 PCR analysis Methods 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- 238000001793 Wilcoxon signed-rank test Methods 0.000 description 1
- 125000003275 alpha amino acid group Chemical group 0.000 description 1
- 150000001413 amino acids Chemical class 0.000 description 1
- 230000003208 anti-thyroid effect Effects 0.000 description 1
- 229940043671 antithyroid preparations Drugs 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 239000012620 biological material Substances 0.000 description 1
- 239000000090 biomarker Substances 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 239000006285 cell suspension Substances 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 238000002790 cross-validation Methods 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000010195 expression analysis Methods 0.000 description 1
- 230000004547 gene signature Effects 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 238000003119 immunoblot Methods 0.000 description 1
- 238000009830 intercalation Methods 0.000 description 1
- 230000002687 intercalation Effects 0.000 description 1
- 210000001165 lymph node Anatomy 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 230000036210 malignancy Effects 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000002493 microarray Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 230000000683 nonmetastatic effect Effects 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 238000004393 prognosis Methods 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- 239000002096 quantum dot Substances 0.000 description 1
- 230000000630 rising effect Effects 0.000 description 1
- 238000012502 risk assessment Methods 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 230000009870 specific binding Effects 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 210000001685 thyroid gland Anatomy 0.000 description 1
- 238000012549 training Methods 0.000 description 1
- 208000024719 uterine cervix neoplasm Diseases 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
Landscapes
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The invention provides a marker combination for predicting thyroid cancer metastasis and application thereof. In particular, the present invention provides the use of a gene, mRNA, cDNA, protein or a detection reagent thereof for a risk marker for thyroid cancer metastasis assessment for preparing/establishing a diagnostic reagent or kit/device for judging the risk of thyroid cancer metastasis occurrence. Research shows that the thyroid cancer metastasis risk marker can be used as a marker for early judging thyroid cancer metastasis of thyroid cancer patients, has high sensitivity and specificity, can rapidly diagnose thyroid cancer metastasis at a relatively early disease progression stage, and provides powerful assistance for early therapeutic intervention of diseases.
Description
Technical Field
The invention relates to the field of medical diagnosis, in particular to a marker for predicting thyroid cancer metastasis and application thereof.
Background
The number of the global incidences of thyroid cancer is about 56.7 ten thousand cases each year, and the incidence rate is ninth to various cancer types. The global female incidence rate is about 10.2/10 ten thousand people, which is 3 times that of men. Since the 1980 s, the incidence of thyroid cancer has been continuously rising in many countries. This is mainly due to the improved detection and diagnosis of Papillary Thyroid Carcinoma (PTC) which increases the detection rate.
About 84% of thyroid cancer patients are PTC, which is the most common thyroid malignancy. The average incidence of PTC in the united states was about 6.66% and mortality based on morbidity was 0.20% between 1974 and 2013. Although PTC is considered an indolent tumor, some cancer cells metastasize to perithyroidic lymph nodes, mainly including central Lymph Node Metastasis (LNM) and cervical lateral LNM. Typically, the LNM occurs first in the central region and then in the outer regions. LNM is an important indicator of PTC prognosis, range and surgical mode, as well as an important risk factor for high recurrence rate and low survival rate in patients.
Several studies have been proposed for LNM risk assessment of PTC patients. Tumor size, tumor location, tumor expansion, microcalcifications and hashimoto are considered independent risk factors for LNM. Some people combine the risk factors with Thyroid Stimulating Hormone (TSH), antithyroid globulin antibody (TGAb) and other blood markers to construct an LNM predictive model of a PTC patient.
In recent years, radiology has been attracting attention in terms of accurate diagnosis. Radiohistology-based methods have also been proposed to predict the LNM of PTC patients by converting ultrasound images into data that can be mined. These methods extract the intensity, boundary, texture, wavelet, etc. features from the ultrasound image and establish a relationship between these high-throughput features and the LNM state.
In the above studies, the area under the curve (AUC) test set for individual subject operating characteristics (ROC) was approximately between 0.67 and 0.78, both based on clinical statistics and on imaging histology, since the integrity of the extracted image features was difficult to guarantee, and the predictive performance of LNM was not ideal.
Thus, there is an urgent need in the art to develop methods that enable more effective, early and accurate early screening and diagnosis of thyroid cancer lymphatic metastasis, enabling timely intervention therapy for thyroid cancer lymphatic metastasis.
Disclosure of Invention
It is an object of the present invention to provide a more efficient, early and accurate method for early screening and diagnosis of thyroid cancer metastasis.
In a first aspect of the invention, there is provided the use of a gene, mRNA, cDNA, protein, or a detection reagent thereof, for a marker for risk of thyroid cancer metastasis, for the preparation of a diagnostic reagent or kit for diagnosing whether a thyroid cancer patient has thyroid cancer metastasis;
wherein the thyroid cancer metastasis risk marker is selected from the group consisting of:
(A) Any one of the markers selected from A1 to A8, or a combination thereof: (A1) PKHD1L1; (A2) CRABP1; (A3) MT1G; (A4) NUPR1; (A5) BMP8A; (A6) DIO1; (A7) RPS4Y1; (A8) IGHV3-30.
In another preferred embodiment, the thyroid cancer metastasis risk marker further comprises any marker selected from the group consisting of: (B1) TPO; (B2) TIMP1; (B3) SLC26A7; (B4) SERPINA1; (B5) SLC34A2.
In another preferred embodiment, the markers A1 to A8 are selected from table a:
table A
In another preferred embodiment, the markers of group B are selected from table B:
table B
In another preferred embodiment, the gene, mRNA, cDNA, or protein of any one of the markers shown in table a and/or table B of the thyroid cancer metastasis risk markers is of human origin.
In another preferred embodiment, the thyroid cancer metastasis risk marker further comprises a combination of 8 markers selected from group a and 5 markers selected from group B.
In another preferred embodiment, the diagnostic reagent or kit is used to detect a thyroid cancer metastasis risk marker in thyroid carcinoma in situ tissue.
In another preferred embodiment, the thyroid cancer comprises papillary thyroid cancer.
In another preferred embodiment, the thyroid cancer metastasis comprises thyroid cancer lymphatic metastasis;
in another preferred embodiment, the thyroid cancer lymphatic metastasis comprises: central lymphatic metastasis of thyroid cancer, cervical lateral lymphatic metastasis of thyroid cancer, or a combination thereof.
In another preferred embodiment, the detection reagent comprises:
(a) A specific antibody, a specific binding molecule, directed against the thyroid cancer metastasis risk marker; and/or
(b) Primers or primer pairs, probes or chips (e.g., nucleic acid chips or protein chips) for specifically amplifying the mRNA or cDNA of the thyroid cancer metastasis risk marker.
In a second aspect of the present invention, there is provided a kit comprising a detection reagent for detecting a gene, mRNA, cDNA, protein, or a combination thereof, of a thyroid cancer metastasis risk marker in a test sample;
wherein the thyroid cancer metastasis risk marker is selected from the group consisting of:
(A) Any one of the markers selected from A1 to A8, or a combination thereof: (A1) PKHD1L1; (A2) CRABP1; (A3) MT1G; (A4) NUPR1; (A5) BMP8A; (A6) DIO1; (A7) RPS4Y1; (A8) IGHV3-30.
In another preferred embodiment, the detection reagent is coupled to or carries a detectable label.
In another preferred embodiment, the detectable label is selected from the group consisting of: chromophores, chemiluminescent groups, fluorophores, isotopes or enzymes.
In another preferred embodiment, the antibody is a monoclonal antibody or a polyclonal antibody.
In another preferred embodiment, the diagnostic reagent comprises an antibody, a primer, a probe, a sequencing library, a nucleic acid chip (e.g., a DNA chip), or a protein chip.
In another preferred embodiment, the nucleic acid chip comprises a substrate and specific oligonucleotide probes spotted on the substrate, wherein the specific oligonucleotide probes comprise probes specifically binding to polynucleotides (mRNA or cDNA) of any of the thyroid cancer metastasis risk markers.
In another preferred embodiment, the protein chip comprises a substrate and specific antibodies spotted on the substrate, wherein the specific antibodies comprise specific antibodies against the thyroid cancer metastasis risk markers.
In another preferred embodiment, the antibody is a monoclonal antibody or a polyclonal antibody.
In another preferred embodiment, the kit contains the gene, mRNA, cDNA and/or protein of the thyroid cancer metastasis risk marker as a control or quality control.
In another preferred embodiment, the kit further comprises a label or instructions stating that the kit is used to (a) determine the risk of developing thyroid cancer metastasis and/or (b) evaluate the effectiveness of a treatment for thyroid cancer metastasis.
In another preferred embodiment, the reagents comprise primers, probes, gRNA or a combination thereof, more preferably a primer pair or probe for PCR, qPCR, RT-PCR.
In another preferred embodiment, the detection of the thyroid cancer metastasis risk marker can be performed by the following method: sequencing, PCR, or a combination thereof.
In another preferred embodiment, the detection of the thyroid cancer metastasis risk marker is quantitatively detectable.
In a third aspect of the present invention, there is provided a detection method comprising the steps of:
(a) Providing a detection sample;
(b) Detecting the expression level of a thyroid cancer metastasis risk marker gene in the detection sample, and marking the expression level as C1; and
(c) Comparing the concentration of the marker for risk of thyroid cancer metastasis C1 with a control reference value C0, wherein the marker for risk of thyroid cancer metastasis is selected from the group consisting of:
(A) Any one of the markers selected from A1 to A8, or a combination thereof: (A1) PKHD1L1; (A2) CRABP1; (A3) MT1G; (A4) NUPR1; (A5) BMP8A; (A6) DIO1; (A7) RPS4Y1; (A8) IGHV3-30;
if the detection result of the thyroid cancer metastasis risk of the detected object meets the following conditions, prompting that the thyroid cancer metastasis risk of the object is high:
when the expression level of a certain marker in the test subjects of table a and table B is significantly different from the reference value or standard value, the risk of thyroid cancer metastasis in the thyroid cancer patient is high.
In a fourth aspect of the invention, there is provided an apparatus for early screening for thyroid cancer metastasis, the apparatus comprising:
(a) The input module is used for inputting the expression data of the characteristic genes of a certain thyroid cancer patient;
wherein the characteristic gene comprises a gene selected from the group consisting of:
(A) Any one of the markers selected from A1 to A8, or a combination thereof: (A1) PKHD1L1; (A2) CRABP1; (A3) MT1G; (A4) NUPR1; (A5) BMP8A; (A6) DIO1; (A7) RPS4Y1; (A8) IGHV3-30;
(b) The processing module is used for analyzing and calculating the input characteristic genes so as to obtain risk judgment; through the analysis of differential expression conditions, the thyroid cancer metastasis risk of the thyroid cancer patient is indicated to be high when the obvious differential expression conditions appear; when the differential expression condition is not obvious, the thyroid cancer metastasis risk of the thyroid cancer patient is low; and
(c) And the output module is used for outputting the auxiliary screening result.
In another preferred embodiment, the device further comprises a detection module for detecting mRNA levels, protein levels, or protein activity of the risk marker.
In another preferred embodiment, the detection module is selected from the group consisting of: ELISA analyzers, PCR detectors, sequencers, or combinations thereof.
In a fifth aspect of the invention, there is provided a method of detecting the combined expression level of a thyroid cancer metastasis risk marker combination, comprising the steps of:
(a) Providing a detection sample;
(b) Extracting total RNA of the sample;
(c) Performing reverse transcription on the product RNA obtained in step (b);
(d) Performing fluorescent quantitative PCR on the reverse transcription product obtained in the step (c), thereby obtaining the expression level of the risk marker gene;
wherein the risk marker is selected from the group consisting of:
(A) Any one of the markers selected from A1 to A8, or a combination thereof: (A1) PKHD1L1; (A2) CRABP1; (A3) MT1G; (A4) NUPR1; (A5) BMP8A; (A6) DIO1; (A7) RPS4Y1; (A8) IGHV3-30.
In another preferred embodiment, the method is a non-diagnostic, non-therapeutic method.
In another preferred embodiment, the method is an in vitro method.
It is understood that within the scope of the present invention, the above-described technical features of the present invention and technical features specifically described below (e.g., in the examples) may be combined with each other to constitute new or preferred technical solutions. And are limited to a space, and are not described in detail herein.
Drawings
FIG. 1 shows two-dimensional intercalation of thyroid tumor cells (principal components 1 and 2); cells are based on whether they are derived from patients with central/cervical tumor metastasis (positive) or not (negative).
FIG. 2 shows AUROC values of a random forest model using different numbers of genes as inputs; genes are selected based on the mutual information between gene expression and class tags.
Fig. 3 shows AUROC curves of the final random forest model using 13 genes as inputs.
Fig. 4 shows the expression levels of the final 13 gene signature of tumor cells from the positive/negative group.
Detailed Description
Through extensive and intensive studies, the present inventors have for the first time opened a method and apparatus for efficiently and accurately screening for thyroid cancer metastasis in an early stage based on a characteristic gene. Specifically, the inventor constructs a simple, accurate and efficient early screening system for thyroid cancer metastasis by constructing single cell transcriptome maps of central metastatic thyroid cancer patients, lateral metastatic thyroid cancer patients and in-situ cancer tissues without any metastasis, and carrying out weight sequencing on genes according to mutual information scores of the genes, so as to unexpectedly screen out a plurality of characteristic genes closely related to thyroid cancer metastasis. The method and the screening system can effectively distinguish the patients with the thyroid cancer metastasis from the patients with the thyroid cancer metastasis, and facilitate corresponding therapeutic intervention on the patients with the thyroid cancer metastasis as early as possible. The present invention has been completed on the basis of this finding.
The invention uses the tumor cell expression data of in-situ thyroid cancer as the input for constructing a prediction model, and uses at least 13 genes as the input, and finally the model has 92.9 percent of accuracy for classifying the metastasis and non-metastasis of the patients just admitted to the hospital, which shows that the model has strong performance. Implementation of this model may help clinicians make timely diagnoses of thyroid cancer metastasis. Therapeutic intervention may be performed early and prevent the progression of thyroid cancer metastasis. The significance of the work is to have the opportunity to solve the classification screening of early thyroid cancer metastasis, and has great clinical significance for the management of thyroid cancer metastasis of patients and hospitals.
Terminology
The term "sample" or "specimen" as used herein refers to a material that is specifically associated with a subject from which particular information about the subject can be determined, calculated, or inferred. The sample may be composed in whole or in part of biological material from the subject.
As used herein, the term "expression" includes the production of mRNA from a gene or gene portion, and includes the production of a protein encoded by RNA or gene portion, and also includes the presence of a detection substance associated with expression. For example, cDNA, binding of a binding ligand (e.g., an antibody) to a gene or other oligonucleotide, protein or protein fragment, and chromogenic portions of the binding ligand are included within the term "expressed". Thus, an increase in half-pel density on immunoblots, such as Western blots, is also within the term "expression" based on biological molecules.
As used herein, the term "reference value" or "control reference value" refers to a value that is statistically relevant to a particular result when compared to the result of an analysis. In a preferred embodiment, the reference value is determined based on the expression of mRNA and/or protein in comparison to a thyroid cancer lymphatic metastasis risk marker and statistically analyzed. Some of these studies are shown in the examples section herein. However, the studies from the literature and the user experience of the methods disclosed herein can also be used to produce or adjust the reference value. Reference values may also be determined by considering conditions and results that are particularly relevant to the patient's population, medical history, genetics, age, and other factors.
Thyroid cancer lymphatic metastasis risk marker
As used herein, the term "thyroid cancer metastasis risk marker of the present invention" refers to one or more markers shown in table a and/or table B.
In the present invention, the terms "a thyroid cancer metastasis risk marker protein of the present invention", "a thyroid cancer lymphatic metastasis risk marker protein of the present invention", "a polypeptide of the present invention", or "a marker shown in table a and/or table B" are used interchangeably, and refer to any one or more of the thyroid cancer lymphatic metastasis risk markers of the present invention.
In the present invention, the terms "thyroid cancer metastasis risk marker gene", "thyroid cancer lymphatic metastasis risk marker gene", "polynucleotide of thyroid cancer lymphatic metastasis risk marker" are used interchangeably, and refer to the nucleotide sequence of any one of the thyroid cancer metastasis risk markers shown in table a and/or table B.
It is understood that substitution of nucleotides in the codon is acceptable when encoding the same amino acid. It is further understood that nucleotide substitutions are also acceptable when conservative amino acid substitutions are made by the nucleotide substitutions.
In case that information of a thyroid cancer metastasis risk marker is obtained, a nucleic acid sequence encoding it can be constructed therefrom, and a specific probe can be designed based on the nucleotide sequence. The full-length nucleotide sequence or a fragment thereof can be obtained by PCR amplification, recombinant methods or artificial synthesis. For the PCR amplification method, primers can be designed based on the nucleotide sequence of the thyroid cancer metastasis risk marker disclosed in the present invention, particularly, the open reading frame sequence, and amplified to obtain the relevant sequence using a commercially available cDNA library or a cDNA library prepared according to a conventional method known to those skilled in the art as a template. When the sequence is longer, it is often necessary to perform two or more PCR amplifications, and then splice the amplified fragments together in the correct order.
Once the relevant sequences are obtained, recombinant methods can be used to obtain the relevant sequences in large quantities. This is usually done by cloning it into a vector, transferring it into a cell, and isolating the relevant sequence from the propagated host cell by conventional methods.
Furthermore, the sequences concerned, in particular fragments of short length, can also be synthesized by artificial synthesis. In general, fragments of very long sequences are obtained by first synthesizing a plurality of small fragments and then ligating them.
At present, it is entirely possible to obtain the DNA sequences encoding the proteins of the invention (or fragments, derivatives thereof) by chemical synthesis. The DNA sequence may then be introduced into a variety of existing DNA molecules (e.g., vectors) and cells known in the art.
The polynucleotide sequences of the present invention can be used to express or produce recombinant thyroid cancer metastasis risk markers by conventional recombinant DNA techniques.
Detection method
Based on differential expression in-situ cancer cells of thyroid cancer metastasis risk markers, the invention also provides a corresponding method for judging thyroid cancer metastasis risk.
The present invention relates to diagnostic assays for the quantitative and positional detection of protein levels or mRNA levels of thyroid cancer metastasis risk markers. Such tests are well known in the art. The level of human thyroid cancer metastasis risk marker protein or mRNA level detected in the assay can be used to determine (including aid in determining) whether there is a risk of thyroid cancer metastasis.
A preferred method is to perform a quantitative PCR/qPCR/RT-PCR assay on mRNA or cDNA.
One preferred method is to quantitatively detect mRNA or cDNA, sequencing.
Polynucleotides of a thyroid cancer lymphatic metastasis risk marker are useful for diagnosis of thyroid cancer lymphatic metastasis risk. A part or all of the polynucleotides of the present invention can be immobilized as probes on a microarray or DNA chip for differential expression analysis and gene diagnosis of genes in analysis.
In addition, the invention can also detect at the protein level. For example, antibodies to a thyroid cancer lymphatic metastasis risk marker may be immobilized on a protein chip for detection of thyroid cancer lymphatic metastasis risk proteins in a sample.
Detection kit
Based on the correlation between the risk of thyroid cancer metastasis and the risk of thyroid cancer metastasis, the risk of thyroid cancer metastasis marker can be used as a judgment marker for the risk of thyroid cancer metastasis.
The invention also provides a kit for judging the risk of thyroid cancer metastasis, which comprises a detection reagent for detecting the gene, mRNA, cDNA, protein or the combination thereof of the thyroid cancer metastasis risk marker. Preferably, the kit contains an antibody or immunoconjugate of the invention against a risk marker for thyroid cancer metastasis, or an active fragment thereof; or a primer or primer pair, probe or chip containing mRNA or cDNA specifically amplifying a thyroid cancer metastasis risk marker.
In another preferred embodiment, the kit further comprises a label or instructions.
The main advantages of the invention include:
(1) Compared with the existing thyroid cancer metastasis detection method, the marker combination established by the invention has higher specificity and more accurate detection result.
(2) By adopting the prediction model provided by the invention, the multi-part puncture of a patient with high risk of thyroid cancer metastasis is avoided, the pain of the patient is greatly reduced, and medical resources are saved.
(3) The prediction model provided by the invention can be used for guiding the selection of the clinical operation scheme more accurately, so that the treatment scheme is more accurate.
The invention will be further illustrated with reference to specific examples. It is to be understood that these examples are illustrative of the present invention and are not intended to limit the scope of the present invention. The experimental procedure, which does not address the specific conditions in the examples below, is generally followed by routine conditions, such as, for example, sambrook et al, molecular cloning: conditions described in the laboratory Manual (New York: cold Spring Harbor Laboratory Press, 1989) or as recommended by the manufacturer. Percentages and parts are weight percentages and parts unless otherwise indicated.
Materials and methods
The invention uses EPCAM and TG double gene expression as pointer gene to separate all tumor cells. According to the gene expression profile of tumor cells, in order to construct a predictive model, first, a selection of highly variable genes is adopted to remove less informative genes and increase computational efficiency. We then use the mutual information score as a metric to select the largest duty cycle gene for constructing the random forest classifier. AUROC is used to evaluate model performance to finalize model selection, while other metrics are also calculated, including recall scores, precision scores, etc. All models were evaluated using 5-fold cross-validation.
Example 1 sequencing sample preparation
The samples were selected from six central metastatic patients of thyroid cancer, eight lateral cervical metastatic patients, and four in situ cancer patients who did not find any metastasis pathologically, and tissue samples were obtained by:
1. after the pathological diagnosis of papillary thyroid carcinoma, the central part of the thyroid tumor in operation is cut after the patient passes through the ethics of a hospital, 5mm x 5mm tissues with the size are cut, the tissue is preserved in tissue preservation solution and is preserved in liquid nitrogen, and then flow cytometry and single cell sequencing analysis are carried out.
2. After the pathological diagnosis of papillary thyroid cancer patients passes through the ethics of a hospital, the central part of a thyroid tumor in operation is cut off, so that an in-vitro thyroid tumor tissue which is penetrated by ultrasonic guidance is obtained, and the in-vitro thyroid tumor tissue is stored in tissue preservation solution and in liquid nitrogen, and then flow cytometry and single cell sequencing analysis are carried out.
EXAMPLE 2 Single cell sequencing
Single cell sequencing and V (D) J library A10X Genomics Chromium Controller Instrument, chromium Single Cell 5' library, gel bead kit and V (D) J enrichment kit (10X Genomics, pleasanton, calif.) were used. The cell suspension was concentrated to 1,000 cells/μl and about 10,000 of them were loaded into each channel to generate single cell gel beads in emulsion (GEM). The output is mRNA barcodes of 6,000 cells per sample. After the reverse transcription step, the GEM was destroyed, the barcode cDNA was purified, amplified, and used to construct 5' gene expression and TCR and BCR enrichment libraries. For 5' library construction, the amplified barcode cDNA was fragmented, A-tailed, ligated to adaptors and index PCR amplified. For the V (D) J library, human T cell and B cell V (D) J sequences were enriched from amplified cdnas, fragmented, a tail, adaptor ligated and index PCR amplified.
The final library was quantified using a Qubit high sensitivity DNA assay (No. q33231; thermo Fisher Scientific, waltham, MA, USA). Library size distribution was determined in a Bioanalyzer 2200 (Agilent Technologies, santa Clara, CA, USA) using a high sensitivity DNA chip. All libraries were sequenced on an Illumina sequencer (Illumina, san Diego, CA, USA) using a 150bp paired-end run.
Example 3 Gene selection and clustering
After obtaining sequencing raw data, we aligned and barcode demultiplexed single Cell sequencing data using the Cell Ranger v.3.0.2vdj procedure (10X Genomics). The expression matrix data was then analyzed under Scanpy computational framework. The data were filtered according to QC standards. Cells with <500 detected genes and <5 cells/dataset detected genes were removed. Cells with >10% of total expression levels of mitochondrial gene expression were also excluded. The double peak was avoided by removing cells with the first 5% total transcripts (UMI). The Scrublets software is also used to remove the doublet.
Genes were then selected for down-scaling and clustering based on their variability. All data were pooled, the first 2,000 variable genes were selected, and total UMI counts and mitochondrial gene expression ratios per cell were regressed to eliminate the impact of these factors on clustering. The BBKNN method is used to perform batch correction using a donor as a batch key. Leiden clustering was performed in Scanpy, with a resolution set to 1.
Example 4 modeling
The model construction of the invention adopts a basic result based on a machine learning algorithm, and thyroid carcinoma cells in situ are selected as the input of the model.
(1) Feasibility of finding disease predictors: all tumor cells were sorted from single cell transcriptome data using EPCAM and TG dual gene expression as indicator genes and PCR analysis was performed (fig. 1); the results show that these tumor cells show PCR spatial differences, proving feasibility.
(2) The genes are ordered according to their mutual information score, limiting the number of genes to a relatively small range.
(3) Random forest classifiers are constructed based on different numbers of features (genes) and performance is evaluated, with the observation that at feature basis factors of 13, the performance evaluation of the model is inflection points (fig. 2).
(4) The first 13 genes with the highest mutual information scores were selected to obtain a classifier model under which the area of AUROC reached 0.929 (fig. 3).
(5) The expression of these 13 genes was plotted in tumor cells from the metastatic/non-metastatic group, and was found to be all different based on the Wilcoxon test (fig. 4). The 10 genes are 8 genes selected from table a and 5 genes selected from table B, respectively: (A1) PKHD1L1; (A2) CRABP1; (A3) MT1G; (A4) NUPR1; (A5) BMP8A; (A6) DIO1; (A7) RPS4Y1; (A8) IGHV3-30; (B1) TPO; (B2) TIMP1; (B3) SLC26A7; (B4) SERPINA1; (B5) SLC34A2. Wherein the weight of each gene is shown in the tables A and B.
Example 5 model verification
Model inputs were 8 genes selected from table a and 5 genes selected from table B: (A1) PKHD1L1; (A2) CRABP1; (A3) MT1G; (A4) NUPR1; (A5) BMP8A; (A6) DIO1; (A7) RPS4Y1; (A8) IGHV3-30; (B1) TPO; (B2) TIMP1; (B3) SLC26A7; (B4) SERPINA1; (B5) SLC34A2.
The invention uses the established model to carry out 100 verification tests on different sample sets (80% test set and 20% training set) sorted by 4:1, and the AUROC value of each time is above 92%, which proves that the model has better robustness and supports the reliability of conclusion (the experimental result is shown in Table C).
TABLE C verification of result data
Discussion of the invention
In the present invention, a determination method is provided for analyzing the risk of metastasis of thyroid cancer, and an analysis apparatus for predicting the occurrence risk of metastasis of thyroid cancer is developed.
In the present invention, the present inventors have unexpectedly found a novel thyroid cancer metastasis risk marker for the first time, which comprises (A1) PKHD1L1 in group (a); (A2) CRABP1; (A3) MT1G; (A4) NUPR1; (A5) BMP8A; (A6) DIO1; (A7) RPS4Y1; (A8) IGHV3-30.
In the invention, by selecting the markers selected from the group (A) and the group (B) in thyroid carcinoma cells as biomarkers to evaluate the risk of thyroid carcinoma metastasis, objective diagnosis of thyroid carcinoma metastasis is realized, and the diagnosis of thyroid carcinoma metastasis has higher sensitivity and specificity.
All documents mentioned in this application are incorporated by reference as if each were individually incorporated by reference. Further, it will be appreciated that various changes and modifications may be made by those skilled in the art after reading the above teachings, and such equivalents are intended to fall within the scope of the claims appended hereto.
Claims (10)
1. The use of a gene, mRNA, cDNA, protein, or a detection reagent thereof for a risk marker for thyroid cancer metastasis, for preparing a diagnostic reagent or kit for diagnosing whether a thyroid cancer patient has thyroid cancer metastasis;
wherein the thyroid cancer metastasis risk marker is selected from the group consisting of:
(A) Any one of the markers selected from A1 to A8, or a combination thereof: (A1) PKHD1L1; (A2) CRABP1; (A3) MT1G; (A4) NUPR1; (A5) BMP8A; (A6) DIO1; (A7) RPS4Y1; (A8) IGHV3-30.
2. The use of claim 1, wherein the thyroid cancer metastasis risk marker further comprises any marker selected from the group B: (B1) TPO; (B2) TIMP1; (B3) SLC26A7; (B4) SERPINA1; (B5) SLC34A2.
3. The use of claim 1, wherein the thyroid cancer metastasis risk marker further comprises a combination of 8 markers selected from group a and 5 markers selected from group B.
4. The use according to claim 1, wherein the diagnostic reagent or kit is for detecting a thyroid cancer metastasis risk marker in thyroid cancer tissue in situ.
5. The use of claim 1, wherein the thyroid cancer comprises papillary thyroid cancer.
6. A kit comprising a detection reagent for detecting a gene, mRNA, cDNA, protein, or a combination thereof for a thyroid cancer metastasis risk marker in a sample to be tested;
wherein the thyroid cancer metastasis risk marker is selected from the group consisting of:
(A) Any one of the markers selected from A1 to A8, or a combination thereof: (A1) PKHD1L1; (A2) CRABP1; (A3) MT1G; (A4) NUPR1; (A5) BMP8A; (A6) DIO1; (A7) RPS4Y1; (A8) IGHV3-30.
7. A method of detection comprising the steps of:
(a) Providing a detection sample;
(b) Detecting the expression level of a thyroid cancer metastasis risk marker gene in the detection sample, and marking the expression level as C1; and
(c) Comparing the concentration of the marker for risk of thyroid cancer metastasis C1 with a control reference value C0, wherein the marker for risk of thyroid cancer metastasis is selected from the group consisting of:
(A) Any one of the markers selected from A1 to A8, or a combination thereof: (A1) PKHD1L1; (A2) CRABP1; (A3) MT1G; (A4) NUPR1; (A5) BMP8A; (A6) DIO1; (A7) RPS4Y1; (A8) IGHV3-30;
if the detection result of the thyroid cancer metastasis risk of the detected object meets the following conditions, prompting that the thyroid cancer metastasis risk of the object is high:
when the expression level of a certain marker in the test subjects of table a and table B is significantly different from the reference value or standard value, the risk of thyroid cancer metastasis in the thyroid cancer patient is high.
8. An apparatus for early screening for thyroid cancer metastasis, the apparatus comprising:
(a) The input module is used for inputting the expression data of the characteristic genes of a certain thyroid cancer patient;
wherein the characteristic gene comprises a gene selected from the group consisting of:
(A) Any one of the markers selected from A1 to A8, or a combination thereof: (A1) PKHD1L1; (A2) CRABP1; (A3) MT1G; (A4) NUPR1; (A5) BMP8A; (A6) DIO1; (A7) RPS4Y1; (A8) IGHV3-30;
(b) The processing module is used for analyzing and calculating the input characteristic genes so as to obtain risk judgment; through the analysis of differential expression conditions, the thyroid cancer metastasis risk of the thyroid cancer patient is indicated to be high when the obvious differential expression conditions appear; when the differential expression condition is not obvious, the thyroid cancer metastasis risk of the thyroid cancer patient is low; and
(c) And the output module is used for outputting the auxiliary screening result.
9. The apparatus of claim 8, further comprising a detection module for detecting mRNA levels, protein levels, or protein activity of the risk markers.
10. The apparatus of claim 9, wherein the detection module is selected from the group consisting of: ELISA analyzers, PCR detectors, sequencers, or combinations thereof.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211145670.3A CN117778566A (en) | 2022-09-20 | 2022-09-20 | Marker for predicting thyroid cancer metastasis and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211145670.3A CN117778566A (en) | 2022-09-20 | 2022-09-20 | Marker for predicting thyroid cancer metastasis and application thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
CN117778566A true CN117778566A (en) | 2024-03-29 |
Family
ID=90378582
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202211145670.3A Pending CN117778566A (en) | 2022-09-20 | 2022-09-20 | Marker for predicting thyroid cancer metastasis and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117778566A (en) |
-
2022
- 2022-09-20 CN CN202211145670.3A patent/CN117778566A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6666852B2 (en) | Gene expression panel for prognosis of prostate cancer recurrence | |
US10670610B2 (en) | Biomarker test for prediction or early detection of preeclampsia and/or HELLP syndrome | |
US20200219587A1 (en) | Systems and methods for using fragment lengths as a predictor of cancer | |
US11929148B2 (en) | Systems and methods for enriching for cancer-derived fragments using fragment size | |
WO2023143326A1 (en) | Biomarker for predicting risk of pancreatic cancer, method, and diagnostic device | |
CN107208131A (en) | Method for lung cancer parting | |
CN113201590B (en) | LncRNA for evaluating early recurrence risk of hepatocellular carcinoma, evaluation method and device | |
WO2015079060A2 (en) | Mirnas as advanced diagnostic tool in patients with cardiovascular disease, in particular acute myocardial infarction (ami) | |
CN117265092A (en) | Marker combination for predicting severe acute pancreatitis and application thereof | |
CN114164273B (en) | Squamous carcinoma prognosis marker, establishment method of prognosis risk assessment model and application of prognosis risk assessment model | |
CN112852969B (en) | Epigenetically modified lncRNA as tumor diagnosis or tumor progression prediction marker | |
CN111164701A (en) | Fixed-point noise model for target sequencing | |
CN117778566A (en) | Marker for predicting thyroid cancer metastasis and application thereof | |
JP2024527461A (en) | Artificial intelligence-based early cancer diagnosis method using cell-free DNA distribution of tissue-specific regulatory regions | |
TWI725248B (en) | Primary site of metastatic cancer identification method and system thereof | |
EP3339861A1 (en) | Biomarker test for prediction or early detection of preeclampsia and/or hellp syndrome | |
JP2015522258A (en) | Biomarker testing for the prediction or early detection of pre-eclampsia and / or HELLP syndrome | |
US20220042108A1 (en) | Systems and methods of assessing breast cancer | |
CN113151469B (en) | Tumor classification marker combination and application thereof | |
WO2023102786A1 (en) | Application of gene marker in prediction of premature birth risk of pregnant woman | |
TWI676688B (en) | The cell type identification method and system thereof | |
CN117233389A (en) | Marker for rapidly identifying CEBPA double mutation in acute myeloid leukemia | |
CN113584158A (en) | Use of biomarkers for diagnosing diabetic nephropathy | |
CN116246710A (en) | Colorectal cancer prediction model based on cluster molecules and application | |
CN113322326A (en) | Lung cancer prognosis marker, prognosis model and related application |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication |