CN117007810B - Biomarker for predicting progression of intravenous smooth muscle tumor and application thereof - Google Patents
Biomarker for predicting progression of intravenous smooth muscle tumor and application thereof Download PDFInfo
- Publication number
- CN117007810B CN117007810B CN202310690937.5A CN202310690937A CN117007810B CN 117007810 B CN117007810 B CN 117007810B CN 202310690937 A CN202310690937 A CN 202310690937A CN 117007810 B CN117007810 B CN 117007810B
- Authority
- CN
- China
- Prior art keywords
- fragment
- intravenous
- ighj4
- proteins
- ighv
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000001990 intravenous administration Methods 0.000 title claims abstract description 24
- 239000000090 biomarker Substances 0.000 title claims abstract description 17
- 208000010485 smooth muscle tumor Diseases 0.000 title description 2
- 239000012634 fragment Substances 0.000 claims abstract description 52
- 238000000034 method Methods 0.000 claims abstract description 17
- 206010061692 Benign muscle neoplasm Diseases 0.000 claims abstract description 11
- 201000004458 Myoma Diseases 0.000 claims abstract description 11
- 210000002966 serum Anatomy 0.000 claims abstract description 11
- 238000001514 detection method Methods 0.000 claims abstract description 8
- 108090000623 proteins and genes Proteins 0.000 claims description 103
- 102000004169 proteins and genes Human genes 0.000 claims description 96
- 235000018102 proteins Nutrition 0.000 claims description 81
- 230000014509 gene expression Effects 0.000 claims description 23
- 201000010260 leiomyoma Diseases 0.000 claims description 22
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 16
- 206010046798 Uterine leiomyoma Diseases 0.000 claims description 13
- 239000000523 sample Substances 0.000 claims description 10
- 239000007853 buffer solution Substances 0.000 claims description 9
- 238000012216 screening Methods 0.000 claims description 7
- 230000004186 co-expression Effects 0.000 claims description 6
- 238000011033 desalting Methods 0.000 claims description 6
- 229940074200 diamode Drugs 0.000 claims description 6
- PGYPOBZJRVSMDS-UHFFFAOYSA-N loperamide hydrochloride Chemical compound Cl.C=1C=CC=CC=1C(C=1C=CC=CC=1)(C(=O)N(C)C)CCN(CC1)CCC1(O)C1=CC=C(Cl)C=C1 PGYPOBZJRVSMDS-UHFFFAOYSA-N 0.000 claims description 6
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 6
- 238000000108 ultra-filtration Methods 0.000 claims description 6
- 101710132601 Capsid protein Proteins 0.000 claims description 5
- 239000003153 chemical reaction reagent Substances 0.000 claims description 5
- 238000012417 linear regression Methods 0.000 claims description 5
- 229920001184 polypeptide Polymers 0.000 claims description 5
- 230000008569 process Effects 0.000 claims description 5
- 238000002331 protein detection Methods 0.000 claims description 5
- 230000001419 dependent effect Effects 0.000 claims description 4
- 230000007717 exclusion Effects 0.000 claims description 4
- 238000001294 liquid chromatography-tandem mass spectrometry Methods 0.000 claims description 4
- 238000001819 mass spectrum Methods 0.000 claims description 4
- 238000003556 assay Methods 0.000 claims description 3
- 238000009835 boiling Methods 0.000 claims description 3
- 239000000872 buffer Substances 0.000 claims description 3
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 claims description 3
- 239000003599 detergent Substances 0.000 claims description 3
- 238000002347 injection Methods 0.000 claims description 3
- 239000007924 injection Substances 0.000 claims description 3
- PGLTVOMIXTUURA-UHFFFAOYSA-N iodoacetamide Chemical compound NC(=O)CI PGLTVOMIXTUURA-UHFFFAOYSA-N 0.000 claims description 3
- 230000014759 maintenance of location Effects 0.000 claims description 3
- 238000003012 network analysis Methods 0.000 claims description 3
- 238000002360 preparation method Methods 0.000 claims description 3
- 238000000611 regression analysis Methods 0.000 claims description 3
- 230000002441 reversible effect Effects 0.000 claims description 3
- 238000000926 separation method Methods 0.000 claims description 3
- 239000000725 suspension Substances 0.000 claims description 3
- 238000004885 tandem mass spectrometry Methods 0.000 claims description 3
- 238000005119 centrifugation Methods 0.000 claims description 2
- 230000007423 decrease Effects 0.000 claims description 2
- 239000013062 quality control Sample Substances 0.000 claims description 2
- 235000012538 ammonium bicarbonate Nutrition 0.000 claims 1
- 238000002731 protein assay Methods 0.000 claims 1
- 238000004458 analytical method Methods 0.000 abstract description 17
- 238000003745 diagnosis Methods 0.000 abstract description 6
- 206010028980 Neoplasm Diseases 0.000 abstract description 4
- 210000004369 blood Anatomy 0.000 abstract description 3
- 239000008280 blood Substances 0.000 abstract description 3
- 238000005516 engineering process Methods 0.000 abstract description 3
- 238000013399 early diagnosis Methods 0.000 abstract description 2
- 230000002349 favourable effect Effects 0.000 abstract description 2
- 238000012544 monitoring process Methods 0.000 abstract description 2
- 230000003902 lesion Effects 0.000 description 11
- 238000001356 surgical procedure Methods 0.000 description 9
- 210000003462 vein Anatomy 0.000 description 8
- 230000003187 abdominal effect Effects 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 238000012986 modification Methods 0.000 description 5
- 230000001105 regulatory effect Effects 0.000 description 5
- 210000004204 blood vessel Anatomy 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 210000003111 iliac vein Anatomy 0.000 description 4
- 230000002980 postoperative effect Effects 0.000 description 4
- 238000004393 prognosis Methods 0.000 description 4
- 230000004850 protein–protein interaction Effects 0.000 description 4
- 230000019491 signal transduction Effects 0.000 description 4
- 238000002604 ultrasonography Methods 0.000 description 4
- 210000004291 uterus Anatomy 0.000 description 4
- 210000001631 vena cava inferior Anatomy 0.000 description 4
- 108060003951 Immunoglobulin Proteins 0.000 description 3
- 230000002596 correlated effect Effects 0.000 description 3
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 3
- 238000009826 distribution Methods 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 102000018358 immunoglobulin Human genes 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- 230000037361 pathway Effects 0.000 description 3
- 210000004197 pelvis Anatomy 0.000 description 3
- 108020001580 protein domains Proteins 0.000 description 3
- 230000000306 recurrent effect Effects 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 238000001228 spectrum Methods 0.000 description 3
- 208000010579 uterine corpus leiomyoma Diseases 0.000 description 3
- 201000007954 uterine fibroid Diseases 0.000 description 3
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 208000008035 Back Pain Diseases 0.000 description 2
- 208000023514 Barrett esophagus Diseases 0.000 description 2
- 208000005623 Carcinogenesis Diseases 0.000 description 2
- 206010061818 Disease progression Diseases 0.000 description 2
- 102000010638 Kinesin Human genes 0.000 description 2
- 108010063296 Kinesin Proteins 0.000 description 2
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 2
- 230000027455 binding Effects 0.000 description 2
- 230000031018 biological processes and functions Effects 0.000 description 2
- 230000036952 cancer formation Effects 0.000 description 2
- 231100000504 carcinogenesis Toxicity 0.000 description 2
- 238000010219 correlation analysis Methods 0.000 description 2
- 230000000875 corresponding effect Effects 0.000 description 2
- 238000002790 cross-validation Methods 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 230000005750 disease progression Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000010201 enrichment analysis Methods 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 238000003384 imaging method Methods 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 150000002500 ions Chemical class 0.000 description 2
- 238000010801 machine learning Methods 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 239000001301 oxygen Substances 0.000 description 2
- 229910052760 oxygen Inorganic materials 0.000 description 2
- 230000001681 protective effect Effects 0.000 description 2
- 230000004960 subcellular localization Effects 0.000 description 2
- 208000024891 symptom Diseases 0.000 description 2
- 206010042772 syncope Diseases 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- 208000004998 Abdominal Pain Diseases 0.000 description 1
- 206010000077 Abdominal mass Diseases 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 102000006306 Antigen Receptors Human genes 0.000 description 1
- 108010083359 Antigen Receptors Proteins 0.000 description 1
- 108091008875 B cell receptors Proteins 0.000 description 1
- 102000004506 Blood Proteins Human genes 0.000 description 1
- 108010017384 Blood Proteins Proteins 0.000 description 1
- 208000000059 Dyspnea Diseases 0.000 description 1
- 206010013975 Dyspnoeas Diseases 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 102000006395 Globulins Human genes 0.000 description 1
- 108010044091 Globulins Proteins 0.000 description 1
- 102000018310 Immunoglobulin C1-set Human genes 0.000 description 1
- 108050007427 Immunoglobulin C1-set Proteins 0.000 description 1
- 102000006496 Immunoglobulin Heavy Chains Human genes 0.000 description 1
- 108010019476 Immunoglobulin Heavy Chains Proteins 0.000 description 1
- 102000013463 Immunoglobulin Light Chains Human genes 0.000 description 1
- 108010065825 Immunoglobulin Light Chains Proteins 0.000 description 1
- 206010071677 Intravenous leiomyomatosis Diseases 0.000 description 1
- 208000008930 Low Back Pain Diseases 0.000 description 1
- 206010030124 Oedema peripheral Diseases 0.000 description 1
- 101800001646 Protein n Proteins 0.000 description 1
- 108010026552 Proteome Proteins 0.000 description 1
- 208000010378 Pulmonary Embolism Diseases 0.000 description 1
- 101710203837 Replication-associated protein Proteins 0.000 description 1
- 238000011869 Shapiro-Wilk test Methods 0.000 description 1
- 206010042434 Sudden death Diseases 0.000 description 1
- 102000004142 Trypsin Human genes 0.000 description 1
- 108090000631 Trypsin Proteins 0.000 description 1
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 1
- 125000000218 acetic acid group Chemical group C(C)(=O)* 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 238000002583 angiography Methods 0.000 description 1
- 230000001746 atrial effect Effects 0.000 description 1
- 208000019804 backache Diseases 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000002146 bilateral effect Effects 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 210000000601 blood cell Anatomy 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 206010008118 cerebral infarction Diseases 0.000 description 1
- 208000026106 cerebrovascular disease Diseases 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000002591 computed tomography Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- 238000002592 echocardiography Methods 0.000 description 1
- 230000002124 endocrine Effects 0.000 description 1
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 1
- 229940011871 estrogen Drugs 0.000 description 1
- 239000000262 estrogen Substances 0.000 description 1
- ZHNUHDYFZUAESO-UHFFFAOYSA-N formamide Substances NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 238000000265 homogenisation Methods 0.000 description 1
- 238000009802 hysterectomy Methods 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 230000006651 lactation Effects 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 230000036210 malignancy Effects 0.000 description 1
- 230000003211 malignant effect Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 208000007106 menorrhagia Diseases 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 210000003470 mitochondria Anatomy 0.000 description 1
- 230000002438 mitochondrial effect Effects 0.000 description 1
- 230000009149 molecular binding Effects 0.000 description 1
- 230000004879 molecular function Effects 0.000 description 1
- 230000002632 myometrial effect Effects 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 230000002611 ovarian Effects 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 230000000242 pagocytic effect Effects 0.000 description 1
- 238000010239 partial least squares discriminant analysis Methods 0.000 description 1
- 238000010827 pathological analysis Methods 0.000 description 1
- 230000001575 pathological effect Effects 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- 210000002381 plasma Anatomy 0.000 description 1
- 230000005368 positive regulation of B cell activation Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 230000002797 proteolythic effect Effects 0.000 description 1
- 238000000575 proteomic method Methods 0.000 description 1
- 208000020016 psychiatric disease Diseases 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- 238000011158 quantitative evaluation Methods 0.000 description 1
- 230000007974 regulation of B cell activation Effects 0.000 description 1
- 230000008844 regulatory mechanism Effects 0.000 description 1
- 238000002271 resection Methods 0.000 description 1
- 210000005245 right atrium Anatomy 0.000 description 1
- 210000005241 right ventricle Anatomy 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 208000013220 shortness of breath Diseases 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 238000010257 thawing Methods 0.000 description 1
- 239000012588 trypsin Substances 0.000 description 1
- 230000005748 tumor development Effects 0.000 description 1
- 231100000588 tumorigenic Toxicity 0.000 description 1
- 230000000381 tumorigenic effect Effects 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
- 230000002792 vascular Effects 0.000 description 1
- 230000009790 vascular invasion Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/68—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
- G01N33/6893—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids related to diseases not provided for elsewhere
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N30/00—Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
- G01N30/02—Column chromatography
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N30/00—Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
- G01N30/02—Column chromatography
- G01N30/62—Detectors specially adapted therefor
- G01N30/72—Mass spectrometers
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N30/00—Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
- G01N30/02—Column chromatography
- G01N30/88—Integrated analysis systems specially adapted therefor, not covered by a single one of the groups G01N30/04 - G01N30/86
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/68—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
- G01N33/6854—Immunoglobulins
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N30/00—Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
- G01N30/02—Column chromatography
- G01N30/88—Integrated analysis systems specially adapted therefor, not covered by a single one of the groups G01N30/04 - G01N30/86
- G01N2030/8809—Integrated analysis systems specially adapted therefor, not covered by a single one of the groups G01N30/04 - G01N30/86 analysis specially adapted for the sample
- G01N2030/8813—Integrated analysis systems specially adapted therefor, not covered by a single one of the groups G01N30/04 - G01N30/86 analysis specially adapted for the sample biological materials
- G01N2030/8831—Integrated analysis systems specially adapted therefor, not covered by a single one of the groups G01N30/04 - G01N30/86 analysis specially adapted for the sample biological materials involving peptides or proteins
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2800/00—Detection or diagnosis of diseases
- G01N2800/54—Determining the risk of relapse
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2800/00—Detection or diagnosis of diseases
- G01N2800/70—Mechanisms involved in disease identification
- G01N2800/7023—(Hyper)proliferation
- G01N2800/7028—Cancer
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Immunology (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Pathology (AREA)
- General Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Analytical Chemistry (AREA)
- Physics & Mathematics (AREA)
- Molecular Biology (AREA)
- Urology & Nephrology (AREA)
- Hematology (AREA)
- Biomedical Technology (AREA)
- Food Science & Technology (AREA)
- Medicinal Chemistry (AREA)
- Microbiology (AREA)
- Cell Biology (AREA)
- Biotechnology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Investigating Or Analysing Biological Materials (AREA)
Abstract
The invention relates to the technical field of biological detection, and discloses a biomarker for predicting the progress of intravenous smooth myoma and application thereof. Specifically, a biomarker for predicting the progression of the intravenous smooth myoma is obtained by adopting a serum proteomics technology and a data statistics analysis technology, wherein the biomarker is any one or a combination of more than one of [ IGLc601_light_IGLV2-14_IGLJ3 (Fragment) ] [ IGc843_heavy_IGHV 3-49_IGHD3-3_IGHJ6 (Fragment) ] [ IGc678_heavy_IGHV 3-23_IGHD1-14_IGHJ4 (Fragment) ] [ IGH+IGLc492_heavy_IGHV 3-20_IGHD3-22_IGHJ4 (Fragment) ]. When the method is used for predicting the progress of the intravenous smooth myoma, the operability is strong, the accuracy is high, the diagnosis and the prediction can be realized by taking blood, the method is rapid and convenient, belongs to an innovated method, is favorable for the early diagnosis and the treatment of the intravenous smooth myoma and the monitoring of tumor recurrence, and has good clinical value.
Description
Technical Field
The invention relates to the technical field of biological detection, in particular to a biomarker for predicting the progress of intravenous smooth myoma and application thereof.
Background
Intravenous leiomyomas (intravenous leiomyomatosis, IVL) are a rare endocrine-related tumor with intravascular invasive properties. Studies have shown that IVL has a tendency to be estrogen dependent and is common in women of childbearing age, particularly in patients with a history of uterine fibroid surgery (incidence of about 25%). The mechanism of its occurrence is currently controversial, but most scholars believe that IVL originates from the uterine vein wall or pelvis outside the uterus, can extend into the venous passages of the uterus or pelvis, invade through the iliac or ovarian veins and extend to the inferior vena cava, and even the heart. Although the histological pathology of IVL is classified as benign, its continued growth may lead to serious circulatory disorders such as: syncope or sudden death, pulmonary embolism or cerebral infarction, so IVL should be emphasized in clinic and scientific research. Currently, the preferred treatment for IVL is surgery, but there is still a lack of reliable assessment and prediction as to whether surgery is sufficiently clean and whether there is recurrence after surgery. In addition, clinical symptoms and imaging (e.g., echocardiography, computed tomography-based angiography, etc.) are ambiguous when IVL is small, whether initially, recurrently, or for post-operative residues, with a high rate of missed diagnosis, and it is likely that IVL has been prolonged to a large extent by the time it is found, or even has invaded the heart. Therefore, a more sensitive screening index is urgently needed for diagnosis and prognosis prediction of clinical IVL.
Disclosure of Invention
The present invention provides a biomarker for predicting the progression of intravenous leiomyoma and application thereof. The method aims at screening and identifying key proteins affecting IVL progression (complete health, healthy uterine fibroids, no recurrence after IVL operation and recurrence after IVL operation) from serum of IVL patients, explaining potential regulatory mechanisms of IVL recurrence, providing a promising target for IVL diagnosis and/or prognosis prediction, and providing a theoretical basis for subsequent research.
In order to achieve the above purpose, the present invention adopts the following technical scheme:
in a first aspect, the present invention provides biomarkers for predicting the progression of an intravenous smooth myoma, which may be any one or a combination of [ iglc601_light_igl2-14_iglj3 (Fragment) ] (a0a5c2fue 5), [ igc843_weave_ighv3-49_ighd3-3_ighj6 (Fragment) ] (a0a5c GPQ 1), [ igc678_weave_ighv3-23_ighd1-14_ighj4 (Fragment) ] (a0a5c2gnc 7), [ igh_weave_ighv3-20_ighd3-22_ighj4 (Fragment) ].
Further, the expression level of the biomarker [ IGLc601_light_IGLV2-14_IGLJ3 (Fragment) ] increases with the overall trend of the progression of the intravenous leiomyoma, and the expression level of [ IGc843_heel_IGHV 3-49_IGHD3-3_IGHJ6 (Fragment) ], [ IGc678_heel_IGHV 3-23_IGHD1-14_IGHJ4 (Fragment) ] and [ IGH+IGLc492_heel_IGHV 3-20_IGH3-22_IGHJ4 (Fragment) ] decreases with the overall trend of the progression of the intravenous leiomyoma.
Further, the progression of the intravenous leiomyoma sequentially comprises 4 stages, namely complete health, healthy uterine fibroids, no recurrence after IVL operation and recurrence after IVL operation.
In a second aspect, the invention provides the use of a biomarker, which is any one or a combination of more of [ iglc601_light_igl2-14_iglj3 (Fragment) ], [ igc843_heav_ighv3-49_ighd3-3_ighj6 (Fragment) ], [ igc678_heav_ighv3-23_ighd1-14_ighj4 (Fragment) ], [ igh+iglc492_heav_ighv3-20_ighd3-22_ighj4 (Fragment) ], in the manufacture of a kit for predicting the progression of an intravenous leiomyoma.
Further, the kit includes reagents for detecting the expression level of [ IGLc601_light_IGLV2-14_IGLJ3 (Fragment) ], [ IGc843_weave_IGHV 3-49_IGHD3-3_IGHJ6 (Fragment) ], [ IGc678_weave_IGHV 3-23_IGHV 3-23_IGH1-14_IGHJ4 (Fragment) ], [ IGH+IGLc492_weave_IGHV 3-20_IGHD3-22_IGHJ4 (Fragment) ].
Further, the samples used to detect the expression levels of [ IGLc601_light_IGLV2-14_IGLJ3 (Fragment) ], [ IGc843_weave_IGHV 3-49_IGHD3-3_IGHJ6 (Fragment) ], [ IGc678_weave_IGHV 3-23_IGHD1-14_IGHJ4 (Fragment) ], [ IGH+IGLc492_weave_IGHV 3-20_IGHD3-22_IGHJ4 (Fragment) ] were serum.
In a third aspect, the invention provides a method of screening for biomarkers for predicting the progression of an intravenous leiomyoma comprising the steps of:
step 1, respectively collecting serum of a patient without recurrence after IVL operation, a patient with recurrence after IVL operation, a completely healthy control person and a healthy person with hysteromyoma control;
step 2, protein detection is carried out through proteomics based on independent data acquisition mass spectrum;
and 3, screening core proteins involved in IVL progression by using weighted gene co-expression network analysis (WGCNA), lasso penalty Cox regression analysis (Lasso), trend clustering and generalized linear regression model (GLM).
Further, the specific process of protein detection in step 2 through the proteomics based on data independent acquisition mass spectrum is as follows:
step 2.1, separating the most abundant proteins in a serum pool by using a multiple affinity removal system chromatographic column, respectively separating high-abundance proteins and low-abundance proteins, and then desalting and concentrating the high-abundance components and the low-abundance components by using a 5kDa ultrafiltration tube; adding SDT buffer solution, boiling for 15 minutes; centrifugation was then carried out at 14000g for 20 minutes, and the BCA assay was used;
step 2.2, digesting high-abundance proteins and low-abundance proteins, specifically: repeated ultrafiltration of 200 μg protein using UA buffer to remove detergents, DTT and other low molecular weight components; then 100 μl iodoacetamide was added to block the reducing cysteine residues; then sequentially using 100 mu L of UA buffer solution and NH 4 HCO 3 Rinsing the filter tank by using a buffer solution; finally, the protein suspension is digested for further desalting and concentration;
step 2.3, separating the digested Chi Tai from the low abundance component using a high pH reverse phase peptide separation kit; iRT kit was added to correct iRT standard peptide to sample peptide volume ratio of 1:3, relative retention time difference between runs;
step 2.4, analyzing all scores for data dependent acquisition library generation by using a mass spectrometer;
step 2.5, further detecting the polypeptide by liquid chromatography-tandem mass spectrometry in DIA mode.
Further, the MS detection method in the step 2.4 is positive ions, the scanning range is 300-1800m/z, the MS1 scanning resolution is 60000, the scanning frequency is 200m/z, the automatic gain control target is 3e6, the maximum IT is 25MS, and the dynamic exclusion is 30.0s; the MS2 scanning resolution is 15000, the automatic gain control target is 5e4, the maximum IT is 25MS, and the normalized collision energy is 30eV;
further, each DIA period in step 2.5 includes one complete MS-SIM scan and 30 DIA scans, and the SIM full scan resolution is 120000; the automatic gain control target is 3e6; maximum IT 50ms; a configuration mode; the resolution of DIA scan is set to 15000; the automatic gain control target is 3e6; max IT auto; the normalized collision energy was 30eV.
Further, the DIA mode was injected after every six injections of the quality control sample at the beginning of the MS study and throughout the protein detection process, thereby ensuring the stability of the detection system and the reliability of experimental data.
Compared with the prior art, the invention has the following advantages:
the screening method provided by the invention can well obtain the biomarker for IVL progress prediction, has strong operability and high accuracy when being used for IVL progress prediction, can realize diagnosis and prediction by taking blood, is quick and convenient, belongs to an innovative-free method, is favorable for early diagnosis and treatment of IVL and monitoring of tumor recurrence, and has good clinical value.
Drawings
FIG. 1 is an identification of differentially expressed proteins. A. Enumeration of all identified peptides or proteomes in each sample. The dotted line on the abscissa indicates that the amount of protein or polypeptide is 50% of the total maximum identified amount. B. A volcanic plot of the differential changes of all proteins, wherein up-regulated protein is on the right side and down-regulated protein is on the left side, with Fold Change (FC) > 1.5 or < 0.67, p-value < 0.05; C. the first 10 differentially expressed proteins are shown as right (increased expression) or left (decreased content); IVL panel versus control OPLS-DA score plot showing the degree of separation between the two panels. t1 represents principal component one, to1 represents principal component two, and the distribution of dots represents the degree of difference between groups and within a group.
FIG. 2 is a structural property and functional enrichment analysis of all differentially expressed proteins. Subcellular localization and distribution of differentially expressed proteins between IVL group and control group. B. The major structure of the protein domain of all differentially expressed proteins. C. GO annotation analysis of all differentially expressed proteins, with the highlighted item marked with a dashed box. The KEGG pathway is enriched and the size of the bubbles reflects the number of proteins involved in the KEGG pathway.
Fig. 3 is a determination of WGCNA and key modules. A. Soft threshold (power=7) analysis. B. Cluster dendrograms based on modular signature genes. C. Recording of correlation of module signature genes with IVL tumorigenic clinical signatures, color blocks on the left ordinate are of different module types, large middle color blocks represent each protein module, and p-values and correlation coefficient values are marked on the color blocks. D. Expression patterns of proteins contained in the four core hub modules.
FIG. 4 is an analysis of protein co-expression networks in module and trend cluster evaluation. A. Each hub module is involved in PPI analysis of the gene; the node size represents the degree of connection, the higher the connection, the larger the node. B. Identifying clusters according to the expression trend of each protein in the disease progression process; the abscissa indicates the different groups and the ordinate indicates the variation in expression after homogenization.
FIG. 5 shows the construction of Lasso regression model and related analysis. A. Cross-validation evaluates partial likelihood bias, with solid vertical lines representing corresponding 95% CI and partial likelihood bias. Lasso coefficient spectra for 20 progress-related proteins, each curve representing a protein. C. Correlation analysis of the screened proteins based on Lasso analysis. Relative expression levels of the 12 proteins.
FIG. 6 is a generalized linear regression model based identification of hubcan. Forest charts show the dominance ratios of the four proteins. Proteins and p-values ∈0.05 are considered as independent risk factors or protective factors for IVL progression.
FIG. 7 is a graph showing the verification of the prediction accuracy of four core protein models. The reliability of the model is verified by an ROC machine learning algorithm based on the multi-classification variable micro-average and macro-average. 0-3 represent CO-no, CO-um, IVL-no and IVL-re subgroups, respectively. The abscissa represents the false positive rate and the ordinate represents the true positive rate. The AUC values were used to characterize the performance of the model.
Detailed Description
The following examples are illustrative of the invention and are not intended to limit the scope of the invention. The technical means used in the examples are conventional means well known to those skilled in the art unless otherwise indicated.
The experimental methods used in the examples below are conventional methods unless otherwise specified.
All materials, reagents, etc. in the examples described below are commercially available unless otherwise specified.
Examples
1. Sample collection
Patient: the groups were divided into recurrent IVL subgroup (IVL-re) and non-recurrent IVL subgroup (IVL-no), 15 cases each.
Healthy control group: half is defined as having no uterine fibroid (CO-no) and the other half is defined as having uterine fibroid (CO-um).
The patient inclusion criteria were:
1) Preoperative imaging or intraoperative discovery of celiac veins, pelvic veins (inferior vena cava, iliac veins, parauterine veins) or right atrial occupancy lesions;
2) Patients undergoing surgery treatment in our hospital;
3) Post-operative pathological diagnosis is IVL concomitant vascular invasion;
4) Age 18 years old or older.
The exclusion criteria were:
1) Lack of clinical data;
2) Women in lactation or pregnant women;
3) Mental disorder or inability to self-care;
4) Is not willing to participate in the study;
5) Other malignant patients were pooled.
The standards for the control group were:
1) Age above 18 years old;
2) No history of other malignancy;
3) Women in non-pregnant and lactating period;
4) There was no history of hysteromyectomy or hysterectomy related gynecological surgery.
All subjects were confirmed by ultrasound examination (including abdominal and pelvic blood vessels, examination was preceded by ordering the subject to adequately hold urine to fill the bladder in order to better assess uterus and double appendages) for abdominal or pelvic vascular occupancy lesions and uterine fibroids. The IVL group first examined the abdominal large vessels, including the patency of the inferior vena cava and bilateral iliac veins, with or without occupancy, and then observed whether there were occupancy lesions in the pelvic cavity. The ultrasonic examination of the inferior vena cava, the iliac vein and the gynaecology proves that the abdominal and pelvic blood vessels have no space occupying lesion and the uterus has myoma. If uterine fibroids are present, the location and size of the largest leiomyomas are recorded simultaneously. After examination, two other senior doctors independently evaluate the report, and if there is inconsistency, the decision is made through discussion. The inspection result is: the CO-no subgroup is characterized by a homogeneous myometrial echo without hysteromyoma or pelvic placeholder lesions. In contrast, the CO-um subgroup of myolayers showed hypoechogenic masses (maximum diameter. Gtoreq.2 cm) and ultrasound exhibited a typical swirl-like structure. The IVL-re subgroup found either a space-occupying lesion (not less than 1 cm) in the pelvic vein or abdominal vein, or a residual lesion greater than the previous size after at least two consecutive ultrasound examinations. When no placeholder lesions were found in the pelvis or blood vessels, the patients were divided into the IVL-no subgroup.
For all subjects after the ultrasonic examination, venous blood was collected from the elbow vein, plasma, serum and blood cells were extracted separately and packaged into 0.5 ml cryo-tubes and relevant information was marked on the tubes for subsequent retrieval and searching of samples. All aliquots were stored in-80 ℃ ice bins, which were equipped with appropriate alarm systems and emergency back-up power to prevent accidental thawing.
2. Baseline characteristics of the subject
From the above sample collection, it can be seen that: the study was divided into two groups, 30 IVL patients and 30 healthy controls, as shown in Table 1. Both groups were aged around 49.0 years, and the differences were not statistically significant. The median of the beginner ages of the IVL and CO groups were 14.0 years and 15.0 years, respectively (p-value=0.07). All patients had various and atypical symptoms before surgical resection, mainly manifested as shortness of breath (most common), edema of lower extremities, lumbago or backache, few patients had complained about menorrhagia, abdominal pain, syncope, abdominal mass, and as expected, still some cases were asymptomatic (n=7). In fact, all patients enrolled in the study had uterine fibroids, with more than two-thirds of the patients having a history of uterine surgery. The iliac veins account for the most prominent (93.3%) of the extended path of IVL. Notably, the affected extension sites of IVL are mainly right atrium (n=14), right ventricle (n=4), and sub-renal IVC (n=4), in a cast or lumen shape. As shown in Table 1, 23 IVL patients underwent primary surgery and the other patients underwent secondary surgery, except for 5 cases with intravascular or pelvic residual lesions, all lesions were completely resected. More seriously, half of IVL patients recur post-operatively and lesions appear in the blood vessels.
TABLE 1 baseline characteristics of all subjects
3. Detection of samples
After taking the ultrasound examination, the serum samples taken from each participant were subjected to further proteomic testing. The specific process is as follows:
separating the most abundant proteins in the serum pool by using a multiple affinity removal system chromatographic column (Agilent technologies company), separating high-abundance proteins and low-abundance proteins respectively, and desalting and concentrating the high-abundance and low-abundance components by using a 5kDa ultrafiltration tube (Sartorius); adding SDT buffer solution, boiling for 15 minutes; and then centrifuged at 14000g for 20 minutes, and quantified using BCA (Bio-Rad, USA).
Digestion of high and low abundance proteins both employs a digestion program modified from the Filtration Assisted Sample Preparation (FASP) protocol. Briefly, 200. Mu.g of protein was subjected to repeated ultrafiltration using UA buffer (8M urea, 150mM Tris-HCl pH 8.0) to remove detergents, DTT and other low molecular weight components; then 100 μl iodoacetamide was added to block the reducing cysteine residues; then sequentially using 100 mu L of UA buffer solution and NH 4 HCO 3 Rinsing the filter tank by using a buffer solution; finally, the protein suspension is digested for further desalting and concentration.
Use of a high pH reverse phase peptide isolation kit (Thermo Scientific) TM Pierce TM ) Separating the digest Chi Tai from the low abundance component; iRT kit (Biognosys) was added to correct iRT standard peptide to sample peptide volume ratio of 1:3, the relative retention time difference between runs.
All fractions used for Data Dependent Acquisition (DDA) library generation were analyzed using a Thermo Scientific Q Exactive HF X mass spectrometer. The MS detection method is positive ions, the scanning range is 300-1800m/z, the MS1 scanning resolution is 60000, the scanning frequency is 200m/z, the Automatic Gain Control (AGC) target is 3e6, the maximum IT is 25MS, and the dynamic exclusion is 30.0s. The MS2 scan resolution was 15000, the agc target was 5e4, the maximum IT was 25MS, and the normalized collision energy was 30eV.
The polypeptides were further detected by liquid chromatography-tandem mass spectrometry (LC-MS/MS) in DIA mode. Each DIA cycle includes one complete MS-SIM scan and 30 DIA scans (SIM full scan resolution of 120000; agc 3e6; max IT 50MS; configuration mode; resolution of DIA scan set to 15000; agc target 3e6; max IT auto; normalized collision energy of 30 eV).
4. Quality control and analysis
QC samples (equal amounts of mixed samples from each sample in the experiment) were used to observe MS performance, and DIA mode was injected at the beginning of the MS study and after every six injections throughout the experiment.
DDA library data the FASTA sequence database was retrieved using spectrobautm software. The database is the uniprot_human database. The parameters are set as follows: the enzyme is trypsin, two maximum cleavage is carried out, the immobilized modification is formamide methyl (C), the dynamic modification is oxidation modification (M), and the acetyl (protein n) is obtained. All the generated data were protein identified based on 99% confidence, with False Discovery Rate (FDR) 1%. The original raw file and DDA search results were imported into Spectronaut Pulsar XTM (biogosys) to construct a spectral library. The spectrum library constructed as described above was searched using SpectronautTM and the DIA data was analyzed. All results were filtered at a Q cutoff of 0.01.
5. Data processing
Statistical analysis was performed using SPSS software (version.22, chicago, IL, USA). To describe the baseline characteristics of all participants in detail, the data were expressed in numbers (percent) of the count variable or in median (upper quartile, lower quartile) of the measured data, as determined by the Shapiro-Wilk test in combination with a normal map, based on the normal distribution of the data. The comparison between groups uses a non-parametric test. Only at p values <0.05, the results were statistically significant.
6. Analysis of results
(1) Identification of differentially expressed proteins (FIG. 1)
To distinguish between Differentially Expressed Proteins (DEPs) in IVL and CO groups, the present study introduced the DIA technique. After proteolytic preparation of the samples, the processed raw files from the DIA analysis were imported into Spectronaut for qualitative and quantitative evaluation. As shown in fig. 1A, we enumerate the recognizable peptide and protein groups in each sample, ultimately defining 2582 proteins for subsequent analysis. Based on this, IVL groups up-regulated 54 proteins and down-regulated 39 proteins depending on the filtration conditions (fold change (FC) > 1.5 or < 0.67, p-value < 0.05), as shown in FIG. 1B. Among these DEPs, the top ten proteins were recorded, with A0A5C2GK72, Q0ZCH9, A0A5C2G577, A0A5C2FV72, A0A5C2FVH2, A0A5C2GFF7, A0A5C2GQR5, A0A5C2GQ71, A0A5C2GTT5, A0A5C2FWG8 being the most prominent proteins expressed in IVL patients. In contrast, A0A5C2GIl4, A0A5C2GEY, Q562R1, A0A5C2GNC7, A0A5C2FWY9, A0A5C2GBR3, A0A5C2G2C3, A0A5C2G6T6, A0A5C2GHL8, and A8K061 were expressed primarily in healthy women (fig. 1C). Orthogonal partial least squares discriminant analysis (OPLS-DA) was performed on the two sets of protein spectra by building a discriminant model to evaluate the variability of the protein expression patterns. Our study shows that samples from the same group have similar protein expression trends, and the model established using the supervised discriminant analysis method has reliable stability and predictability, r2y=0.912, q2y=0.448, as shown in fig. 1D.
(2) Structural property and functional enrichment analysis of differentially expressed proteins
To elucidate the potential features of these identified DEPs, a multicellular localization was predicted using the multi-class SVM classification system CELLO (http:// CELLO. Life. Nctu. Edu. Tw /). It is well known that proteins behave differently in different organelles (such as mitochondria and endoplasmic reticulum), and thus analysis of subcellular localization helps to further explore the function of the protein. As shown in fig. 2A, these DEPs mostly consist of extracellular or secreted proteins (n=82), with more than one third located in the nucleus. In addition, some of them belong to mitochondrial or cytoplasmic proteins. Subsequently, the protein sequence was retrieved and mapped using the interprerscan software to identify protein domain features. A protein domain is defined as two or more spatially distinct localized regions of a large protein molecule that are tightly linked by adjacent supersecondary structures on the polypeptide chain. Each region has its own unique spatial structure and assumes different biological functions. Here, the main features of the domain enrichment are the immunoglobulin V-set and immunoglobulin C1-set domains, followed by globulins, actin and intermediate silk proteins (FIG. 2B). To determine the functional enrichment of these DEPs, we mapped the ontology (GO) pattern and annotated the sequences based on Blast2GO software. After the annotation step, the studied proteins were aligned to the on-line Kyoto genes and genome encyclopedia (KEGG) database (http:// geneontologiy. Org /), their KEGG ortholog identity was retrieved, and then mapped to the pathways in KEGG. As shown in fig. 2C, these DEPs are mainly enriched in antigen receptor mediated signaling pathways, regulation of B cell activation, positive regulation of B cell activation, B cell receptor signaling pathways, phagocytic recognition, etc. biological processes. In addition, these DEPs are primarily involved in molecular functions such as small molecule binding, oxygen carrier activity, oxygen binding, and molecular carrier activity. For cellular components, these DEPs are immunoglobulin complexes, circulation and the central part of immunoglobulin complexes. KEGG labeling showed that nitrogen metabolism, IL-17 signaling pathway, and Rap1 signaling pathway were most correlated with these DEPs (fig. 2D).
(3) Determination of WGCNA and critical modules
Weighted gene co-expression network analysis (WGCNA) is an algorithm for constructing a co-expression network using a pattern of interactions between targets. Module relevance is identified by computing adjacency matrices and Topology Overlap Matrices (TOMs) based on hierarchical clustering and dynamic tree-chopping. Kinesins are ultimately selected from target modules associated with IVL tumorigenesis. This study utilized WGCNA binding to DEPs to determine the IVL phenotype associated kinesins. As shown in fig. 3aa, b, power=7 was chosen as the optimal soft threshold, and the co-expressed protein modules were identified from a scaleless topology model fit (scaleless r2=085) and average connectivity. Then, 8 modules (black, brown, blue, yellow, green, red, cyan, and gray) were identified in the cluster dendrogram (fig. 3B). As shown in fig. 3C, 4 of the modules were significantly correlated with tumorigenesis of IVL (brown module correlation coefficient=0.28, p-value=0.03; green module correlation coefficient= -0.48, p-value=1e-04; red module correlation coefficient=0.52, p-value=2e-5; correlation coefficient= -0.47, yellow module p-value=1e-04 when considering the relation of module to sample properties). Then, we analyzed the expression patterns of the proteins contained in these modules, and our results indicate that the expression levels of the proteins in the brown and red modules are relatively increased in IVL cases. In contrast, the proteins in the green and yellow modules were down-regulated (fig. 3D). Based on the protein-protein interaction (PPI) information of the four hinge modules, 347 proteins were finally identified, establishing an interaction network. The results are imported into Gephi software (https:// Gephi. Org /) to visualize and further analyze the PPI network. It was observed that the brown and yellow modules involved 188 and 114 junction proteins, respectively. The remaining 26 hub proteins are arranged in green modules, 19 belonging to the red module.
(4) Analysis of protein co-expression networks in module and trend cluster evaluation
We further combined the results of WGCNA and DEPs, and the overlapping results indicated that 31 proteins were ultimately confirmed, as shown in figure 4A (figure 4A). As the disease progresses, the dynamic expression patterns of these proteins are studied, revealing their potential modes of action is essential. Thus, the CO and IVL groups are divided into four subgroups, CO-no, CO-um, IVL-no and IVL-re, depending on the presence or absence of uterine fibroids and postoperative recurrence of IVL. The fuzzy c-means (FCM) algorithm of the Mfuzz software is applied to divide all proteins into corresponding clusters according to the trend of the change of the protein expression. Interestingly, cluster 2 and cluster 3 were noted because the overall change exhibited a relatively consistent upward or downward trend, 7 proteins in cluster 2 participated, and 13 proteins in cluster 3 participated (fig. 4B).
(5) Based on Lasso regression analysis
In the following analysis, confounding factors between the above screened proteins were removed using Lasso-penalized Cox regression (Lasso), and finally 12 proteins were determined and verified by lambda value (λ) and partial likelihood bias cross-validation (G3 GAU4, A0A5C2G2C3, A0A5C2GPQ1, P07737, A0A5C2FZ98, A0A5C2GNC7, A0A5C2GBR3, P05109, A0A5C2FVH4, A0A5C2FUE5, A0A5C2GVV3, A0A5C2FVK 9) (fig. 5a, b.) to elucidate the relationship between these proteins, the correlation between them was known by correlation analysis. Proteins with expression correlation may be co-involved in biological processes, i.e. functional correlation. As shown in fig. 5C, P07737 is positively correlated with A0A5C2G2C3 (correlation coefficient=0.42). A0A5C2FZ98 is also associated with G3GAU 4. A0A5C2GVV3 had a significant positive correlation with P05109 (correlation coefficient was 0.45). A0A5C2FVK9 also has a close correlation (correlation coefficient 0.48 or 0.49, respectively) with A0A5C2FUE5 or A0A5C2GVV 3. P05109 is inversely related to the abundance of A0A5C2FZ 98. The abundance of A0A5C2FVH4 and A0A5C2GBR3 are inversely related. FIG. 5D shows the relative expression levels of the 12 proteins described above, and our results indicate that these proteins are clearly distinguishable between the IVL group and the control group. Of which 5 proteins were significantly enhanced in the IVL-no and IVL-re subgroups compared to the CO-no and CO-um subgroups, while the remaining 7 proteins were the opposite.
(6) Hub protein identification based on generalized linear regression model
Considering the interactions between these 12 proteins, we finally used a generalized linear regression model (GLM) to determine the characteristic key proteins. Our data demonstrate that four proteins, IGLc601_light_IGLV2-14_IGLJ3 (Fragment), [ IGc843_weave_IGHV 3-49_IGHD3-3_IGHJ6 (Fragment), [ IGc678_weave_IGHV 3-23_IGHD1-14_IGHJ4 (Fragment), [ IGH+IGLc492_weave_IGHV 3-20_IGHD3-22_IGHJ4 (Fragment), [ A0A5C2GBR 3) ] are closely related to the progression of IVL, as shown in FIG. 6. Iglc601_light_iglv2-14_iglj3 (Fragment) ((A0 A5C2FUE 5)) may be an independent risk factor (or=2.64) that promotes disease progression; [ IGc 843_weave_IGHV 3-49_IGHD3-3_IGHJ6 (Fragment) ], (A0A 5C2GPQ 1), [ IGc 678_weave_IGHV 3-23_IGHD1-14_IGHJ4 (Fragment) ], (A0A 5C2GNC 7), [ IGH+IGLc492_weave_IGHV 3-20_IGHD3-22_IGHJ4 (Fragment) ], especially the former may be protective indicators (OR 0.32, 0.60, 0.53, respectively) for slowing down the progression of IVL. These results indicate that these four proteins are important factors in the future for hopefully predicting IVL prognosis and progression.
(7) Verification of core protein prediction accuracy
To verify the predictive value of these four proteins, the study was analyzed using a subject work profile (receiver operator characteristic curve, ROC) based on the Python module Scikt-learn (https:// Scikit-learn. Org /). Two integrated machine learning algorithms were introduced, including micro-average ROC (micro-average ROC: globally computing the index and treating each element of the index matrix as a label) and macro-average ROC (macro-average ROC: computing the matrix for each label and evaluating its unweighted average). Our findings indicate that these representative key proteins play a critical role in distinguishing between cases of different pathological conditions, with area under the curve (AUC) of 0.69 or 0.68, respectively, in either micro-or macro-average ROC analysis (fig. 7). Furthermore, it should be noted that these four key proteins show reliable discriminatory ability in identifying post-operative recurrent IVL patients (AUC value=0.92). These results indicate that the model built from the final identified key proteins is of great significance and predictive value for the progression of IVL.
The above research results are integrated to obtain: IVL patients can be effectively distinguished from healthy persons based on DIA proteomic analysis of their serum proteins. The possibility of tumor development or recurrence of IVL was demonstrated by the development of a panel consisting of 4 core proteins, which was further confirmed by a multi-class ROC analysis. Among these characteristic proteins, the expression of the immunoglobulin light chain protein iglc601_light_igl2-14_iglj3 (Fragment) (A0 A5C2FUE 5), a potential risk factor, increases with the progression of the disease. In contrast, the expression of three immunoglobulin heavy chain proteins ([ IGc843_weave_IGHV 3-49_IGHD3-3_IGHJ6 (fragments) ] (A0A 5C2GPQ 1), [ IGc678_weave_IGHV 3-23_IGHD1-14_IGHJ4 (fragments) ] (A0A 5C2GNC 7) and [ IGH+IGLc492_weave_IGHV 3-20 IGHD3-22_IGHJ4 (fragments) ] (A0A 5C2GBR 3)) as potential protection indicators was in a decreasing trend. Overall, these 4 proteins are likely to be promising biomarkers for predicting IVL progression in this study, and can be used for diagnosis and prognosis prediction of IVL.
While the invention has been described in detail in the foregoing general description and with reference to specific embodiments thereof, it will be apparent to one skilled in the art that modifications and improvements can be made thereto. Accordingly, such modifications or improvements may be made without departing from the spirit of the invention and are intended to be within the scope of the invention as claimed.
Claims (9)
1. Use of a combination of biomarkers, wherein the biomarkers are [ iglc601_light_igl2-14_iglj3 (Fragment) ], [ igc843_weave_ighv3-49_ighd3-3_ighj6 (Fragment) ], [ igc678_weave_ighv3-23_ighd1-14_ighj4 (Fragment) ], [ igh+iglc492_weave_ighv3-20_ighd3-22_ighj4 (Fragment) ], for the preparation of a reagent or kit for predicting the progression of an intravenous leiomyoma.
2. The use according to claim 1, characterized in that: the expression level of the biomarker [ IGLc601_light_IGLV2-14_IGLJ3 (Fragment) ] increases with the general trend of the progression of the intravenous leiomyoma, and the expression level of [ IGc843_heel_IGHV 3-49_IGHD3-3_IGHJ6 (Fragment) ], [ IGc678_heel_IGHV 3-23_IGH1-14_IGHJ4 (Fragment) ] and [ IGH+IGLc492_heel_IGHV 3-20_IGH3-22_IGHJ4 (Fragment) ] decreases with the general trend of the progression of the intravenous leiomyoma.
3. The use according to any one of claims 1 or 2, characterized in that: the intravenous smooth myoma comprises 4 stages in sequence before and after the intravenous smooth myoma is developed, wherein the stages are respectively completely healthy, healthy uterine myoma, no recurrence after the intravenous smooth myoma operation and recurrence after the intravenous smooth myoma operation.
4. The use according to claim 1, wherein the reagent or kit comprises a reagent for detecting the expression level of [ iglc601_light_igl2-14_iglj3 (Fragment) ], [ igc843_weave_ighv3-49_ighd3-3_ighj6 (Fragment) ], [ igc678_weave_ighv3-23_ighd1-14_ighj4 (Fragment) ], [ igh+iglc492_weave_ighv3-20_ighd3-22_ighj4 (Fragment) ].
5. The use according to claim 4, characterized in that: the samples of the expression levels of the detection [ IGLc601_light_IGLV2-14_IGLJ3 (Fragment) ], [ IGc843_heav_IGHV 3-49_IGHD3-3_IGHJ6 (Fragment) ], [ IGc678_heav_IGHV 3-23_IGHD1-14_IGHJ4 (Fragment) ], [ IGH+IGLc492_heav_IGHV 3-20_IGH3-22_IGHJ4 (Fragment) ] are serum.
6. The use according to claim 1, characterized in that the method of screening for biomarkers comprises the steps of:
step 1, respectively collecting serum of a patient without recurrence after intravenous leiomyoma operation, a patient with recurrence after intravenous leiomyoma operation, a completely healthy control person and a healthy person with hysteromyoma control;
step 2, protein detection is carried out through proteomics based on independent data acquisition mass spectrum;
and 3, screening the core protein participating in IVL progress by using weighted gene co-expression network analysis, lasso penalty Cox regression analysis, trend clustering and generalized linear regression model.
7. The use according to claim 6, wherein the specific process of protein detection in step 2 by proteomics based on data independent acquisition of mass spectrum is:
step 2.1, separating the most abundant proteins in a serum pool by using a multiple affinity removal system chromatographic column, respectively separating high-abundance proteins and low-abundance proteins, and then desalting and concentrating the high-abundance components and the low-abundance components by using a 5kDa ultrafiltration tube; adding SDT buffer solution, boiling for 15 minutes; centrifugation was then carried out at 14000g for 20 minutes, and the BCA assay was used;
step 2.2, digesting high-abundance proteins and low-abundance proteins, specifically: repeated ultrafiltration of 200 μg protein using UA buffer to remove detergents, DTT and other low molecular weight components; then 100 μl iodoacetamide was added to block the reducing cysteine residues; then rinsing the filter tank with 100 mu L of UA buffer solution and NH4HCO3 buffer solution; finally, the protein suspension is digested for further desalting and concentration;
step 2.3, separating the digested Chi Tai from the low abundance component using a high pH reverse phase peptide separation kit; iRT kit was added to correct iRT standard peptide to sample peptide volume ratio of 1:3, relative retention time difference between runs;
step 2.4, analyzing all scores for data dependent acquisition library generation by using a mass spectrometer;
step 2.5, further detecting the polypeptide by liquid chromatography-tandem mass spectrometry in DIA mode.
8. The method of claim 7, wherein the MS detection method in step 2.4 is positive ion, the scanning range is 300-1800m/z, the MS1 scanning resolution is 60000, the scanning frequency is 200m/z, and the automatic gain control target is 3e 6 Maximum IT25ms, dynamic exclusion 30.0s; MS2 scanning resolution is 15000, automatic gain control target is 5e 4 Maximum IT is 25ms, normalized collision energy is 30eV; each DIA period in step 2.5 includes one complete MS-SIM scan and 30 DIA scans, and the SIM full scan resolution is 120000; the automatic gain control target is 3e 6 The method comprises the steps of carrying out a first treatment on the surface of the Maximum IT 50ms; a configuration mode; the resolution of DIA scan is set to 15000; the automatic gain control target is 3e 6 The method comprises the steps of carrying out a first treatment on the surface of the Max IT auto; the normalized collision energy was 30eV.
9. The use according to claim 8, wherein the DIA mode is injected after every six injections of a quality control sample at the beginning of the MS study and throughout the protein assay, thereby ensuring the stability of the assay system and the reliability of the experimental data.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310690937.5A CN117007810B (en) | 2023-06-12 | 2023-06-12 | Biomarker for predicting progression of intravenous smooth muscle tumor and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310690937.5A CN117007810B (en) | 2023-06-12 | 2023-06-12 | Biomarker for predicting progression of intravenous smooth muscle tumor and application thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN117007810A CN117007810A (en) | 2023-11-07 |
CN117007810B true CN117007810B (en) | 2024-04-05 |
Family
ID=88566371
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310690937.5A Active CN117007810B (en) | 2023-06-12 | 2023-06-12 | Biomarker for predicting progression of intravenous smooth muscle tumor and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117007810B (en) |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108949989A (en) * | 2018-08-06 | 2018-12-07 | 中国医学科学院北京协和医院 | One group of biomarker is preparing the purposes in IVL detection, diagnosis or Prognosis scoveillance product |
CN113189346A (en) * | 2021-04-27 | 2021-07-30 | 嘉兴学院 | Serum protein markers for detecting quality of pulse-activating injection and application thereof |
CN113325117A (en) * | 2021-06-25 | 2021-08-31 | 中国医学科学院北京协和医院 | Group of biomarkers for predicting progression of intravenous smooth myopathy |
CN114441770A (en) * | 2022-01-27 | 2022-05-06 | 浙江省肿瘤医院 | Plasma protein biomarker of malignant mesothelioma and application thereof |
CN114755421A (en) * | 2022-04-07 | 2022-07-15 | 华中科技大学同济医学院附属同济医院 | Protein biomarker for lymphangioleiomyomatosis and application |
CN115267214A (en) * | 2022-08-10 | 2022-11-01 | 南方医科大学珠江医院 | Lipid metabolism marker, screening method thereof and application of lipid metabolism marker in intracranial aneurysm |
-
2023
- 2023-06-12 CN CN202310690937.5A patent/CN117007810B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108949989A (en) * | 2018-08-06 | 2018-12-07 | 中国医学科学院北京协和医院 | One group of biomarker is preparing the purposes in IVL detection, diagnosis or Prognosis scoveillance product |
CN113189346A (en) * | 2021-04-27 | 2021-07-30 | 嘉兴学院 | Serum protein markers for detecting quality of pulse-activating injection and application thereof |
CN113325117A (en) * | 2021-06-25 | 2021-08-31 | 中国医学科学院北京协和医院 | Group of biomarkers for predicting progression of intravenous smooth myopathy |
CN114441770A (en) * | 2022-01-27 | 2022-05-06 | 浙江省肿瘤医院 | Plasma protein biomarker of malignant mesothelioma and application thereof |
CN114755421A (en) * | 2022-04-07 | 2022-07-15 | 华中科技大学同济医学院附属同济医院 | Protein biomarker for lymphangioleiomyomatosis and application |
CN115267214A (en) * | 2022-08-10 | 2022-11-01 | 南方医科大学珠江医院 | Lipid metabolism marker, screening method thereof and application of lipid metabolism marker in intracranial aneurysm |
Also Published As
Publication number | Publication date |
---|---|
CN117007810A (en) | 2023-11-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Ferreira et al. | Integration of spatial and single-cell transcriptomics localizes epithelial cell–immune cross-talk in kidney injury | |
Rodríguez-Suárez et al. | Urine as a source for clinical proteome analysis: from discovery to clinical application | |
CN110579611B (en) | Combined detection serum marker, kit and detection method for early screening and diagnosis of lung cancer | |
CN110196329A (en) | A kind of cancer of the esophagus early stage combined detection kit | |
CN111863250A (en) | Combined diagnosis model and system for early breast cancer | |
US20220336043A1 (en) | cfDNA CLASSIFICATION METHOD, APPARATUS AND APPLICATION | |
CN113325117B (en) | Application of a group of biomarkers in preparation of kit for predicting progression of intravenous smooth sarcomatosis | |
CN116908474B (en) | Biomarker related to atrial fibrillation and application thereof | |
CN117007810B (en) | Biomarker for predicting progression of intravenous smooth muscle tumor and application thereof | |
CN113391072A (en) | Ovarian cancer urine marker combination and application thereof | |
US20230235410A1 (en) | Identification system of circulating biomarkers for cancer detection, development method of circulating biomarkers for cancer detection, cancer detection method and kit | |
US20050202485A1 (en) | Method and compositions for detection of liver cancer | |
CN115128285A (en) | Kit and system for identifying and evaluating thyroid follicular tumor by protein combination | |
CN114252612A (en) | Liver cancer metastasis prediction system and method based on nerve cell adhesion molecules | |
Zhao et al. | Serum metabolites as early detection markers of non-muscle invasive bladder cancer in Chinese patients | |
CN116735889B (en) | Protein marker for early colorectal cancer screening, kit and application | |
CN113252899A (en) | Application of marker in preparation of colorectal cancer detection kit | |
CN112037852A (en) | Method and system for predicting lymph node metastasis of colorectal cancer at stage T1 | |
Shi et al. | Peptidome profiling of human serum of uveal melanoma patients based on magnetic bead fractionation and mass spectrometry | |
Hayran et al. | Elevated neutrophil to lymphocyte ratio as an indicator of secondary erythema nodosum, a retrospective observational study | |
CN115792247B (en) | Application of protein combination in preparation of thyroid papillary carcinoma risk auxiliary layering system | |
Ge et al. | Identification of novel serum protein biomarkers in the context of 3P medicine for intravenous leiomyomatosis: a data-independent acquisition mass spectrometry-based proteomics study | |
CN116246710A (en) | Colorectal cancer prediction model based on cluster molecules and application | |
CN116386716B (en) | Metabolomics and methods for gastric cancer diagnosis | |
Yang et al. | Diagnosis of c4d+ renal allograft acute humoral rejection by urine protein fingerprint analysis |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |