CN117229371A - Novel S protein mutant of coronavirus variant strain, genetically engineered mRNA thereof and vaccine composition - Google Patents
Novel S protein mutant of coronavirus variant strain, genetically engineered mRNA thereof and vaccine composition Download PDFInfo
- Publication number
- CN117229371A CN117229371A CN202210633828.5A CN202210633828A CN117229371A CN 117229371 A CN117229371 A CN 117229371A CN 202210633828 A CN202210633828 A CN 202210633828A CN 117229371 A CN117229371 A CN 117229371A
- Authority
- CN
- China
- Prior art keywords
- protein
- mrna
- leu
- val
- ser
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 101710139375 Corneodesmosin Proteins 0.000 title claims abstract description 139
- 102100031673 Corneodesmosin Human genes 0.000 title claims abstract description 138
- 108020004999 messenger RNA Proteins 0.000 title claims abstract description 100
- 229960005486 vaccine Drugs 0.000 title claims description 30
- 239000000203 mixture Substances 0.000 title claims description 19
- 241000711573 Coronaviridae Species 0.000 title abstract description 10
- 230000035772 mutation Effects 0.000 claims abstract description 24
- 239000013638 trimer Substances 0.000 claims abstract description 11
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims abstract description 10
- 201000010099 disease Diseases 0.000 claims abstract description 6
- 208000035475 disorder Diseases 0.000 claims abstract description 4
- 239000002773 nucleotide Substances 0.000 claims description 88
- 125000003729 nucleotide group Chemical group 0.000 claims description 85
- 150000002632 lipids Chemical class 0.000 claims description 50
- -1 cationic lipid Chemical class 0.000 claims description 36
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 35
- 108020004414 DNA Proteins 0.000 claims description 30
- 210000004027 cell Anatomy 0.000 claims description 24
- 239000002105 nanoparticle Substances 0.000 claims description 24
- 108020005345 3' Untranslated Regions Proteins 0.000 claims description 21
- 241001678559 COVID-19 virus Species 0.000 claims description 21
- 108700026244 Open Reading Frames Proteins 0.000 claims description 20
- 108020003589 5' Untranslated Regions Proteins 0.000 claims description 19
- 102000053602 DNA Human genes 0.000 claims description 19
- 230000015572 biosynthetic process Effects 0.000 claims description 19
- 150000007523 nucleic acids Chemical class 0.000 claims description 19
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 17
- 208000025721 COVID-19 Diseases 0.000 claims description 16
- 150000001413 amino acids Chemical class 0.000 claims description 15
- 239000012634 fragment Substances 0.000 claims description 14
- 102000039446 nucleic acids Human genes 0.000 claims description 14
- 108020004707 nucleic acids Proteins 0.000 claims description 14
- 102100021519 Hemoglobin subunit beta Human genes 0.000 claims description 11
- 108091005904 Hemoglobin subunit beta Proteins 0.000 claims description 11
- 150000001875 compounds Chemical class 0.000 claims description 10
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Natural products C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 claims description 9
- 235000012000 cholesterol Nutrition 0.000 claims description 8
- 210000005220 cytoplasmic tail Anatomy 0.000 claims description 8
- 239000013604 expression vector Substances 0.000 claims description 7
- 230000007935 neutral effect Effects 0.000 claims description 7
- 108091026898 Leader sequence (mRNA) Proteins 0.000 claims description 6
- 102220590621 Spindlin-1_T19R_mutation Human genes 0.000 claims description 5
- 108010053584 alpha-Globins Proteins 0.000 claims description 5
- 238000003776 cleavage reaction Methods 0.000 claims description 5
- 230000007017 scission Effects 0.000 claims description 5
- 101710189104 Fibritin Proteins 0.000 claims description 4
- 102100034013 Gamma-glutamyl phosphate reductase Human genes 0.000 claims description 4
- 102100027685 Hemoglobin subunit alpha Human genes 0.000 claims description 4
- 108091005902 Hemoglobin subunit alpha Proteins 0.000 claims description 4
- 101001133924 Homo sapiens Gamma-glutamyl phosphate reductase Proteins 0.000 claims description 4
- 239000000546 pharmaceutical excipient Substances 0.000 claims description 4
- 238000005829 trimerization reaction Methods 0.000 claims description 4
- 108010077333 CAP1-6D Proteins 0.000 claims description 3
- 108090001126 Furin Proteins 0.000 claims description 3
- 108010031970 prostasin Proteins 0.000 claims description 3
- 206010011224 Cough Diseases 0.000 claims description 2
- 206010019233 Headaches Diseases 0.000 claims description 2
- 206010028748 Nasal obstruction Diseases 0.000 claims description 2
- 206010035664 Pneumonia Diseases 0.000 claims description 2
- 208000036071 Rhinorrhea Diseases 0.000 claims description 2
- 206010039101 Rhinorrhoea Diseases 0.000 claims description 2
- 206010040047 Sepsis Diseases 0.000 claims description 2
- 208000037883 airway inflammation Diseases 0.000 claims description 2
- 208000009190 disseminated intravascular coagulation Diseases 0.000 claims description 2
- 231100000869 headache Toxicity 0.000 claims description 2
- 239000000568 immunological adjuvant Substances 0.000 claims description 2
- 230000002265 prevention Effects 0.000 claims description 2
- 108091036066 Three prime untranslated region Proteins 0.000 claims 3
- 102100027241 Adenylyl cyclase-associated protein 1 Human genes 0.000 claims 1
- 102100035233 Furin Human genes 0.000 claims 1
- 230000001571 immunoadjuvant effect Effects 0.000 claims 1
- 238000000338 in vitro Methods 0.000 abstract description 25
- 238000013518 transcription Methods 0.000 abstract description 25
- 230000035897 transcription Effects 0.000 abstract description 24
- 239000013598 vector Substances 0.000 abstract description 23
- 125000000539 amino acid group Chemical group 0.000 abstract description 10
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 abstract description 6
- 230000028993 immune response Effects 0.000 abstract description 4
- 229940096437 Protein S Drugs 0.000 abstract description 3
- 101710198474 Spike protein Proteins 0.000 abstract description 3
- 125000001433 C-terminal amino-acid group Chemical group 0.000 abstract description 2
- 208000001528 Coronaviridae Infections Diseases 0.000 abstract 1
- 230000027455 binding Effects 0.000 description 27
- 108700021021 mRNA Vaccine Proteins 0.000 description 27
- 229940126582 mRNA vaccine Drugs 0.000 description 27
- 108090000623 proteins and genes Proteins 0.000 description 23
- 238000000034 method Methods 0.000 description 22
- 239000013612 plasmid Substances 0.000 description 22
- 108090000765 processed proteins & peptides Proteins 0.000 description 21
- 238000002649 immunization Methods 0.000 description 19
- 102000004169 proteins and genes Human genes 0.000 description 19
- 230000004927 fusion Effects 0.000 description 18
- 230000003053 immunization Effects 0.000 description 18
- 230000005764 inhibitory process Effects 0.000 description 18
- 230000003472 neutralizing effect Effects 0.000 description 18
- 235000018102 proteins Nutrition 0.000 description 17
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 16
- 210000004369 blood Anatomy 0.000 description 16
- 239000008280 blood Substances 0.000 description 16
- 102000053723 Angiotensin-converting enzyme 2 Human genes 0.000 description 15
- 108090000975 Angiotensin-converting enzyme 2 Proteins 0.000 description 15
- YMWUJEATGCHHMB-UHFFFAOYSA-N Dichloromethane Chemical compound ClCCl YMWUJEATGCHHMB-UHFFFAOYSA-N 0.000 description 15
- 241000700605 Viruses Species 0.000 description 15
- 238000001514 detection method Methods 0.000 description 15
- 235000001014 amino acid Nutrition 0.000 description 14
- 239000002609 medium Substances 0.000 description 14
- 229940024606 amino acid Drugs 0.000 description 13
- 230000014509 gene expression Effects 0.000 description 13
- 241000282560 Macaca mulatta Species 0.000 description 12
- 238000002474 experimental method Methods 0.000 description 12
- 229920001184 polypeptide Polymers 0.000 description 12
- 102000004196 processed proteins & peptides Human genes 0.000 description 12
- 241000254158 Lampyridae Species 0.000 description 11
- 238000003786 synthesis reaction Methods 0.000 description 11
- 210000001519 tissue Anatomy 0.000 description 11
- 238000011725 BALB/c mouse Methods 0.000 description 10
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 10
- 108091007433 antigens Proteins 0.000 description 10
- 102000036639 antigens Human genes 0.000 description 10
- 239000002585 base Substances 0.000 description 10
- 238000006243 chemical reaction Methods 0.000 description 10
- 102000005962 receptors Human genes 0.000 description 10
- 108020003175 receptors Proteins 0.000 description 10
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 10
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 9
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 9
- YXFVVABEGXRONW-UHFFFAOYSA-N Toluene Chemical compound CC1=CC=CC=C1 YXFVVABEGXRONW-UHFFFAOYSA-N 0.000 description 9
- 210000004072 lung Anatomy 0.000 description 9
- 238000002360 preparation method Methods 0.000 description 9
- 239000002904 solvent Substances 0.000 description 9
- 238000011830 transgenic mouse model Methods 0.000 description 9
- 230000003612 virological effect Effects 0.000 description 9
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 8
- 238000002965 ELISA Methods 0.000 description 8
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 8
- 241000699670 Mus sp. Species 0.000 description 8
- 241001112090 Pseudovirus Species 0.000 description 8
- 238000012217 deletion Methods 0.000 description 8
- 230000037430 deletion Effects 0.000 description 8
- 239000002953 phosphate buffered saline Substances 0.000 description 8
- 230000002829 reductive effect Effects 0.000 description 8
- 238000013519 translation Methods 0.000 description 8
- 241000699660 Mus musculus Species 0.000 description 7
- 239000000427 antigen Substances 0.000 description 7
- 108010050848 glycylleucine Proteins 0.000 description 7
- 238000003780 insertion Methods 0.000 description 7
- 230000037431 insertion Effects 0.000 description 7
- 238000007918 intramuscular administration Methods 0.000 description 7
- 238000004519 manufacturing process Methods 0.000 description 7
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 7
- 108010061238 threonyl-glycine Proteins 0.000 description 7
- UHOVQNZJYSORNB-UHFFFAOYSA-N Benzene Chemical compound C1=CC=CC=C1 UHOVQNZJYSORNB-UHFFFAOYSA-N 0.000 description 6
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 6
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 6
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- 241001465754 Metazoa Species 0.000 description 6
- JRNVZBWKYDBUCA-UHFFFAOYSA-N N-chlorosuccinimide Chemical compound ClN1C(=O)CCC1=O JRNVZBWKYDBUCA-UHFFFAOYSA-N 0.000 description 6
- WYURNTSHIVDZCO-UHFFFAOYSA-N Tetrahydrofuran Chemical compound C1CCOC1 WYURNTSHIVDZCO-UHFFFAOYSA-N 0.000 description 6
- 230000005847 immunogenicity Effects 0.000 description 6
- 208000015181 infectious disease Diseases 0.000 description 6
- 230000003902 lesion Effects 0.000 description 6
- 239000002504 physiological saline solution Substances 0.000 description 6
- 239000000243 solution Substances 0.000 description 6
- 238000006467 substitution reaction Methods 0.000 description 6
- 238000001890 transfection Methods 0.000 description 6
- 108091026890 Coding region Proteins 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- 102220606761 Gap junction beta-1 protein_N30V_mutation Human genes 0.000 description 5
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 5
- 241000880493 Leptailurus serval Species 0.000 description 5
- 238000001704 evaporation Methods 0.000 description 5
- 230000008020 evaporation Effects 0.000 description 5
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 5
- 102000040430 polynucleotide Human genes 0.000 description 5
- 108091033319 polynucleotide Proteins 0.000 description 5
- 239000002157 polynucleotide Substances 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- 229960002429 proline Drugs 0.000 description 5
- 239000001226 triphosphate Substances 0.000 description 5
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 description 4
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 4
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 4
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 4
- 102100038132 Endogenous retrovirus group K member 6 Pro protein Human genes 0.000 description 4
- UYTPUPDQBNUYGX-UHFFFAOYSA-N Guanine Natural products O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 4
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 4
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 4
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 4
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 4
- 241000699666 Mus <mouse, genus> Species 0.000 description 4
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 4
- OFBQJSOFQDEBGM-UHFFFAOYSA-N Pentane Chemical compound CCCCC OFBQJSOFQDEBGM-UHFFFAOYSA-N 0.000 description 4
- 108091005804 Peptidases Proteins 0.000 description 4
- 239000004365 Protease Substances 0.000 description 4
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 4
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 4
- 229960001230 asparagine Drugs 0.000 description 4
- 235000009582 asparagine Nutrition 0.000 description 4
- 238000003556 assay Methods 0.000 description 4
- 239000003054 catalyst Substances 0.000 description 4
- 239000003153 chemical reaction reagent Substances 0.000 description 4
- 239000003638 chemical reducing agent Substances 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 238000011161 development Methods 0.000 description 4
- 230000006870 function Effects 0.000 description 4
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 4
- 108010037850 glycylvaline Proteins 0.000 description 4
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 4
- 108010057821 leucylproline Proteins 0.000 description 4
- 239000002502 liposome Substances 0.000 description 4
- 230000011987 methylation Effects 0.000 description 4
- 238000007069 methylation reaction Methods 0.000 description 4
- 239000013642 negative control Substances 0.000 description 4
- 239000012074 organic phase Substances 0.000 description 4
- 229920000642 polymer Polymers 0.000 description 4
- LEHBURLTIWGHEM-UHFFFAOYSA-N pyridinium chlorochromate Chemical compound [O-][Cr](Cl)(=O)=O.C1=CC=[NH+]C=C1 LEHBURLTIWGHEM-UHFFFAOYSA-N 0.000 description 4
- 239000000725 suspension Substances 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- 231100000419 toxicity Toxicity 0.000 description 4
- 230000001988 toxicity Effects 0.000 description 4
- SDXQFSSGNUPJIG-HZJYTTRNSA-N (9Z,12Z)-2-chlorooctadeca-9,12-dien-1-ol Chemical compound CCCCC\C=C/C\C=C/CCCCCCC(Cl)CO SDXQFSSGNUPJIG-HZJYTTRNSA-N 0.000 description 3
- HXLZULGRVFOIDK-HZJYTTRNSA-N (9z,12z)-octadeca-9,12-dienal Chemical compound CCCCC\C=C/C\C=C/CCCCCCCC=O HXLZULGRVFOIDK-HZJYTTRNSA-N 0.000 description 3
- RYHBNJHYFVUHQT-UHFFFAOYSA-N 1,4-Dioxane Chemical compound C1COCCO1 RYHBNJHYFVUHQT-UHFFFAOYSA-N 0.000 description 3
- 102100030988 Angiotensin-converting enzyme Human genes 0.000 description 3
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 3
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 3
- 241000282693 Cercopithecidae Species 0.000 description 3
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 3
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 3
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 3
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 3
- 108060001084 Luciferase Proteins 0.000 description 3
- 239000005089 Luciferase Substances 0.000 description 3
- 108010079364 N-glycylalanine Proteins 0.000 description 3
- 108090001074 Nucleocapsid Proteins Proteins 0.000 description 3
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 3
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 3
- KWYUFKZDYYNOTN-UHFFFAOYSA-M Potassium hydroxide Chemical compound [OH-].[K+] KWYUFKZDYYNOTN-UHFFFAOYSA-M 0.000 description 3
- 229930185560 Pseudouridine Natural products 0.000 description 3
- PTJWIQPHWPFNBW-UHFFFAOYSA-N Pseudouridine C Natural products OC1C(O)C(CO)OC1C1=CNC(=O)NC1=O PTJWIQPHWPFNBW-UHFFFAOYSA-N 0.000 description 3
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 3
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 3
- 150000001299 aldehydes Chemical class 0.000 description 3
- 239000007864 aqueous solution Substances 0.000 description 3
- WGDUUQDYDIIBKT-UHFFFAOYSA-N beta-Pseudouridine Natural products OC1OC(CN2C=CC(=O)NC2=O)C(O)C1O WGDUUQDYDIIBKT-UHFFFAOYSA-N 0.000 description 3
- 210000005013 brain tissue Anatomy 0.000 description 3
- 239000000872 buffer Substances 0.000 description 3
- 210000000170 cell membrane Anatomy 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- 230000006957 competitive inhibition Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 125000000118 dimethyl group Chemical group [H]C([H])([H])* 0.000 description 3
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 3
- 239000003814 drug Substances 0.000 description 3
- 238000003818 flash chromatography Methods 0.000 description 3
- 108010087823 glycyltyrosine Proteins 0.000 description 3
- 210000002216 heart Anatomy 0.000 description 3
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 3
- 238000011534 incubation Methods 0.000 description 3
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 3
- HXLZULGRVFOIDK-UHFFFAOYSA-N linoleic aldehyde Natural products CCCCCC=CCC=CCCCCCCCC=O HXLZULGRVFOIDK-UHFFFAOYSA-N 0.000 description 3
- 210000004185 liver Anatomy 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 239000000178 monomer Substances 0.000 description 3
- 239000007800 oxidant agent Substances 0.000 description 3
- 239000002245 particle Substances 0.000 description 3
- 239000013600 plasmid vector Substances 0.000 description 3
- 108010029020 prolylglycine Proteins 0.000 description 3
- PTJWIQPHWPFNBW-GBNDHIKLSA-N pseudouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1C1=CNC(=O)NC1=O PTJWIQPHWPFNBW-GBNDHIKLSA-N 0.000 description 3
- 239000011541 reaction mixture Substances 0.000 description 3
- 238000003030 reporter gene method Methods 0.000 description 3
- 210000000952 spleen Anatomy 0.000 description 3
- 238000003756 stirring Methods 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- YLQBMQCUIZJEEH-UHFFFAOYSA-N tetrahydrofuran Natural products C=1C=COC=1 YLQBMQCUIZJEEH-UHFFFAOYSA-N 0.000 description 3
- 238000004809 thin layer chromatography Methods 0.000 description 3
- 235000011178 triphosphate Nutrition 0.000 description 3
- 108010073969 valyllysine Proteins 0.000 description 3
- 108010009962 valyltyrosine Proteins 0.000 description 3
- HDTRYLNUVZCQOY-UHFFFAOYSA-N α-D-glucopyranosyl-α-D-glucopyranoside Natural products OC1C(O)C(O)C(CO)OC1OC1C(O)C(O)C(O)C(CO)O1 HDTRYLNUVZCQOY-UHFFFAOYSA-N 0.000 description 2
- GVJHHUAWPYXKBD-UHFFFAOYSA-N (±)-α-Tocopherol Chemical compound OC1=C(C)C(C)=C2OC(CCCC(C)CCCC(C)CCCC(C)C)(C)CCC2=C1C GVJHHUAWPYXKBD-UHFFFAOYSA-N 0.000 description 2
- SCYULBFZEHDVBN-UHFFFAOYSA-N 1,1-Dichloroethane Chemical compound CC(Cl)Cl SCYULBFZEHDVBN-UHFFFAOYSA-N 0.000 description 2
- NRJAVPSFFCBXDT-HUESYALOSA-N 1,2-distearoyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCCCCCCCCCCCC NRJAVPSFFCBXDT-HUESYALOSA-N 0.000 description 2
- UOXJNGFFPMOZDM-UHFFFAOYSA-N 2-[di(propan-2-yl)amino]ethylsulfanyl-methylphosphinic acid Chemical compound CC(C)N(C(C)C)CCSP(C)(O)=O UOXJNGFFPMOZDM-UHFFFAOYSA-N 0.000 description 2
- BBGNINPPDHJETF-UHFFFAOYSA-N 5-heptadecylresorcinol Chemical compound CCCCCCCCCCCCCCCCCC1=CC(O)=CC(O)=C1 BBGNINPPDHJETF-UHFFFAOYSA-N 0.000 description 2
- SFHYNDMGZXWXBU-LIMNOBDPSA-N 6-amino-2-[[(e)-(3-formylphenyl)methylideneamino]carbamoylamino]-1,3-dioxobenzo[de]isoquinoline-5,8-disulfonic acid Chemical compound O=C1C(C2=3)=CC(S(O)(=O)=O)=CC=3C(N)=C(S(O)(=O)=O)C=C2C(=O)N1NC(=O)N\N=C\C1=CC=CC(C=O)=C1 SFHYNDMGZXWXBU-LIMNOBDPSA-N 0.000 description 2
- 208000035657 Abasia Diseases 0.000 description 2
- UGLPMYSCWHTZQU-AUTRQRHGSA-N Ala-Ala-Tyr Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UGLPMYSCWHTZQU-AUTRQRHGSA-N 0.000 description 2
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 2
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 2
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 2
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 2
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 2
- BECXEHHOZNFFFX-IHRRRGAJSA-N Arg-Ser-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BECXEHHOZNFFFX-IHRRRGAJSA-N 0.000 description 2
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 2
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 2
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 2
- QNJIRRVTOXNGMH-GUBZILKMSA-N Asn-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(N)=O QNJIRRVTOXNGMH-GUBZILKMSA-N 0.000 description 2
- QPTAGIPWARILES-AVGNSLFASA-N Asn-Gln-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QPTAGIPWARILES-AVGNSLFASA-N 0.000 description 2
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 2
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 2
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 2
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 2
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 2
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 2
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 2
- XUVTWGPERWIERB-IHRRRGAJSA-N Asp-Pro-Phe Chemical compound N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O XUVTWGPERWIERB-IHRRRGAJSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 208000031648 Body Weight Changes Diseases 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 2
- IAYPIBMASNFSPL-UHFFFAOYSA-N Ethylene oxide Chemical compound C1CO1 IAYPIBMASNFSPL-UHFFFAOYSA-N 0.000 description 2
- 102000004961 Furin Human genes 0.000 description 2
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 2
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 2
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 2
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 2
- XKPACHRGOWQHFH-IRIUXVKKSA-N Gln-Thr-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XKPACHRGOWQHFH-IRIUXVKKSA-N 0.000 description 2
- SGVGIVDZLSHSEN-RYUDHWBXSA-N Gln-Tyr-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O SGVGIVDZLSHSEN-RYUDHWBXSA-N 0.000 description 2
- FQFWFZWOHOEVMZ-IHRRRGAJSA-N Glu-Phe-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FQFWFZWOHOEVMZ-IHRRRGAJSA-N 0.000 description 2
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 2
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 2
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 2
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 2
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 2
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 2
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 2
- WTUSRDZLLWGYAT-KCTSRDHCSA-N Gly-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN WTUSRDZLLWGYAT-KCTSRDHCSA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- ILUVWFTXAUYOBW-CUJWVEQBSA-N His-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CN=CN1)N)O ILUVWFTXAUYOBW-CUJWVEQBSA-N 0.000 description 2
- 101000929928 Homo sapiens Angiotensin-converting enzyme 2 Proteins 0.000 description 2
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 2
- APDIECQNNDGFPD-PYJNHQTQSA-N Ile-His-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N APDIECQNNDGFPD-PYJNHQTQSA-N 0.000 description 2
- KYLIZSDYWQQTFM-PEDHHIEDSA-N Ile-Ile-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N KYLIZSDYWQQTFM-PEDHHIEDSA-N 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 2
- 102100033421 Keratin, type I cytoskeletal 18 Human genes 0.000 description 2
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 2
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 2
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 2
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 2
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 2
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 2
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 2
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 2
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 2
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 2
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 2
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 2
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 2
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 2
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 2
- 239000012097 Lipofectamine 2000 Substances 0.000 description 2
- WALVCOOOKULCQM-ULQDDVLXSA-N Lys-Arg-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WALVCOOOKULCQM-ULQDDVLXSA-N 0.000 description 2
- UWHCKWNPWKTMBM-WDCWCFNPSA-N Lys-Thr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWHCKWNPWKTMBM-WDCWCFNPSA-N 0.000 description 2
- QFSYGUMEANRNJE-DCAQKATOSA-N Lys-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N QFSYGUMEANRNJE-DCAQKATOSA-N 0.000 description 2
- 229930195725 Mannitol Natural products 0.000 description 2
- 102000012750 Membrane Glycoproteins Human genes 0.000 description 2
- 108010090054 Membrane Glycoproteins Proteins 0.000 description 2
- BZLVMXJERCGZMT-UHFFFAOYSA-N Methyl tert-butyl ether Chemical compound COC(C)(C)C BZLVMXJERCGZMT-UHFFFAOYSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 2
- OMHMIXFFRPMYHB-SRVKXCTJSA-N Phe-Cys-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OMHMIXFFRPMYHB-SRVKXCTJSA-N 0.000 description 2
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 2
- PMKIMKUGCSVFSV-CQDKDKBSSA-N Phe-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N PMKIMKUGCSVFSV-CQDKDKBSSA-N 0.000 description 2
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 2
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 2
- ZLMJMSJWJFRBEC-UHFFFAOYSA-N Potassium Chemical compound [K] ZLMJMSJWJFRBEC-UHFFFAOYSA-N 0.000 description 2
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 2
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 2
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 2
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 2
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 2
- 102100029500 Prostasin Human genes 0.000 description 2
- 102220537314 Protein NDRG2_N30E_mutation Human genes 0.000 description 2
- 102220537299 Protein NDRG2_N30R_mutation Human genes 0.000 description 2
- 108010076504 Protein Sorting Signals Proteins 0.000 description 2
- 108700008625 Reporter Genes Proteins 0.000 description 2
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 2
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 2
- FRPNVPKQVFHSQY-BPUTZDHNSA-N Ser-Trp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FRPNVPKQVFHSQY-BPUTZDHNSA-N 0.000 description 2
- 101000629318 Severe acute respiratory syndrome coronavirus 2 Spike glycoprotein Proteins 0.000 description 2
- PMZURENOXWZQFD-UHFFFAOYSA-L Sodium Sulfate Chemical compound [Na+].[Na+].[O-]S([O-])(=O)=O PMZURENOXWZQFD-UHFFFAOYSA-L 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 2
- 229930006000 Sucrose Natural products 0.000 description 2
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 2
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 2
- UTSWGQNAQRIHAI-UNQGMJICSA-N Thr-Arg-Phe Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 UTSWGQNAQRIHAI-UNQGMJICSA-N 0.000 description 2
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 2
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 2
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 2
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 2
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 2
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 2
- HDTRYLNUVZCQOY-WSWWMNSNSA-N Trehalose Natural products O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-WSWWMNSNSA-N 0.000 description 2
- WVGKPKDWYQXWLU-BZSNNMDCSA-N Tyr-His-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WVGKPKDWYQXWLU-BZSNNMDCSA-N 0.000 description 2
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 2
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 2
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 2
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 2
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 2
- PGBMPFKFKXYROZ-UFYCRDLUSA-N Val-Tyr-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N PGBMPFKFKXYROZ-UFYCRDLUSA-N 0.000 description 2
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 2
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 description 2
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 2
- 238000002835 absorbance Methods 0.000 description 2
- 238000001042 affinity chromatography Methods 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 2
- 108010041407 alanylaspartic acid Proteins 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- 125000003172 aldehyde group Chemical group 0.000 description 2
- HDTRYLNUVZCQOY-LIZSDCNHSA-N alpha,alpha-trehalose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-LIZSDCNHSA-N 0.000 description 2
- 150000001412 amines Chemical class 0.000 description 2
- 238000010171 animal model Methods 0.000 description 2
- 230000000845 anti-microbial effect Effects 0.000 description 2
- 229940121375 antifungal agent Drugs 0.000 description 2
- 239000003429 antifungal agent Substances 0.000 description 2
- 239000004599 antimicrobial Substances 0.000 description 2
- 239000008346 aqueous phase Substances 0.000 description 2
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 229940031567 attenuated vaccine Drugs 0.000 description 2
- 239000011324 bead Substances 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- WPYMKLBDIGXBTP-UHFFFAOYSA-N benzoic acid Chemical compound OC(=O)C1=CC=CC=C1 WPYMKLBDIGXBTP-UHFFFAOYSA-N 0.000 description 2
- 230000004579 body weight change Effects 0.000 description 2
- 210000004556 brain Anatomy 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 230000009137 competitive binding Effects 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 239000002577 cryoprotective agent Substances 0.000 description 2
- 238000005520 cutting process Methods 0.000 description 2
- 108010069495 cysteinyltyrosine Proteins 0.000 description 2
- 210000000805 cytoplasm Anatomy 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 230000007423 decrease Effects 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- UQLDLKMNUJERMK-UHFFFAOYSA-L di(octadecanoyloxy)lead Chemical compound [Pb+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O UQLDLKMNUJERMK-UHFFFAOYSA-L 0.000 description 2
- 108010054812 diprotin A Proteins 0.000 description 2
- 231100000673 dose–response relationship Toxicity 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000005538 encapsulation Methods 0.000 description 2
- 235000019441 ethanol Nutrition 0.000 description 2
- 239000012065 filter cake Substances 0.000 description 2
- 239000000706 filtrate Substances 0.000 description 2
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 2
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 2
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 2
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 2
- 108010084389 glycyltryptophan Proteins 0.000 description 2
- 150000008282 halocarbons Chemical class 0.000 description 2
- 102000048657 human ACE2 Human genes 0.000 description 2
- 238000011577 humanized mouse model Methods 0.000 description 2
- 229930195733 hydrocarbon Natural products 0.000 description 2
- 150000002430 hydrocarbons Chemical class 0.000 description 2
- 230000036039 immunity Effects 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 229940031551 inactivated vaccine Drugs 0.000 description 2
- 230000000968 intestinal effect Effects 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 210000003734 kidney Anatomy 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- 239000008101 lactose Substances 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- NUJOXMJBOLGQSY-UHFFFAOYSA-N manganese dioxide Chemical compound O=[Mn]=O NUJOXMJBOLGQSY-UHFFFAOYSA-N 0.000 description 2
- 239000000594 mannitol Substances 0.000 description 2
- 235000010355 mannitol Nutrition 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- LXCFILQKKLGQFO-UHFFFAOYSA-N methylparaben Chemical compound COC(=O)C1=CC=C(O)C=C1 LXCFILQKKLGQFO-UHFFFAOYSA-N 0.000 description 2
- 239000011259 mixed solution Substances 0.000 description 2
- 239000012046 mixed solvent Substances 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 238000006386 neutralization reaction Methods 0.000 description 2
- FIYYMXYOBLWYQO-UHFFFAOYSA-N ortho-iodylbenzoic acid Chemical compound OC(=O)C1=CC=CC=C1I(=O)=O FIYYMXYOBLWYQO-UHFFFAOYSA-N 0.000 description 2
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 2
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 229910052698 phosphorus Inorganic materials 0.000 description 2
- 239000013641 positive control Substances 0.000 description 2
- 229910052700 potassium Inorganic materials 0.000 description 2
- 239000003755 preservative agent Substances 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 108010020432 prolyl-prolylisoleucine Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 238000003908 quality control method Methods 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 239000011734 sodium Substances 0.000 description 2
- 239000012279 sodium borohydride Substances 0.000 description 2
- 229910000033 sodium borohydride Inorganic materials 0.000 description 2
- CDBYLPFSWZWCQE-UHFFFAOYSA-L sodium carbonate Substances [Na+].[Na+].[O-]C([O-])=O CDBYLPFSWZWCQE-UHFFFAOYSA-L 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 235000019333 sodium laurylsulphate Nutrition 0.000 description 2
- 229910052938 sodium sulfate Inorganic materials 0.000 description 2
- 235000011152 sodium sulphate Nutrition 0.000 description 2
- HPALAKNZSZLMCH-UHFFFAOYSA-M sodium;chloride;hydrate Chemical class O.[Na+].[Cl-] HPALAKNZSZLMCH-UHFFFAOYSA-M 0.000 description 2
- 239000005720 sucrose Substances 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical group CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- UNXRWKVEANCORM-UHFFFAOYSA-N triphosphoric acid Chemical compound OP(O)(=O)OP(O)(=O)OP(O)(O)=O UNXRWKVEANCORM-UHFFFAOYSA-N 0.000 description 2
- 229910052721 tungsten Inorganic materials 0.000 description 2
- 229950010342 uridine triphosphate Drugs 0.000 description 2
- 229910052720 vanadium Inorganic materials 0.000 description 2
- 239000013603 viral vector Substances 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- 229910052727 yttrium Inorganic materials 0.000 description 2
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 1
- NTUPOKHATNSWCY-PMPSAXMXSA-N (2s)-2-[[(2s)-1-[(2r)-2-amino-3-phenylpropanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C([C@@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=CC=C1 NTUPOKHATNSWCY-PMPSAXMXSA-N 0.000 description 1
- QZNNVYOVQUKYSC-JEDNCBNOSA-N (2s)-2-amino-3-(1h-imidazol-5-yl)propanoic acid;hydron;chloride Chemical compound Cl.OC(=O)[C@@H](N)CC1=CN=CN1 QZNNVYOVQUKYSC-JEDNCBNOSA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- MPCAJMNYNOGXPB-UHFFFAOYSA-N 1,5-anhydrohexitol Chemical class OCC1OCC(O)C(O)C1O MPCAJMNYNOGXPB-UHFFFAOYSA-N 0.000 description 1
- HVPUUBKTSZTLGO-JNECXHHRSA-N 1-[(2r,3r,4s,5r)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-5-methylpyrimidine-2,4-dione Chemical compound O=C1NC(=O)C(C)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1.O=C1NC(=O)C(C)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 HVPUUBKTSZTLGO-JNECXHHRSA-N 0.000 description 1
- FKKAGFLIPSSCHT-UHFFFAOYSA-N 1-dodecoxydodecane;sulfuric acid Chemical compound OS(O)(=O)=O.CCCCCCCCCCCCOCCCCCCCCCCCC FKKAGFLIPSSCHT-UHFFFAOYSA-N 0.000 description 1
- WCNYVTLGUXBXLV-UHFFFAOYSA-N 13,14,15-trihydroxyheptacosane-12,16-dione Chemical compound CCCCCCCCCCCC(=O)C(O)C(O)C(O)C(=O)CCCCCCCCCCC WCNYVTLGUXBXLV-UHFFFAOYSA-N 0.000 description 1
- HKJAWHYHRVVDHK-UHFFFAOYSA-N 15,16,17-trihydroxyhentriacontane-14,18-dione Chemical compound CCCCCCCCCCCCCC(=O)C(O)C(O)C(O)C(=O)CCCCCCCCCCCCC HKJAWHYHRVVDHK-UHFFFAOYSA-N 0.000 description 1
- BVKFQEAERCHBTG-UHFFFAOYSA-N 17,18,19-trihydroxypentatriacontane-16,20-dione Chemical compound CCCCCCCCCCCCCCCC(=O)C(O)C(O)C(O)C(=O)CCCCCCCCCCCCCCC BVKFQEAERCHBTG-UHFFFAOYSA-N 0.000 description 1
- NHBKXEKEPDILRR-UHFFFAOYSA-N 2,3-bis(butanoylsulfanyl)propyl butanoate Chemical compound CCCC(=O)OCC(SC(=O)CCC)CSC(=O)CCC NHBKXEKEPDILRR-UHFFFAOYSA-N 0.000 description 1
- LDGWQMRUWMSZIU-LQDDAWAPSA-M 2,3-bis[(z)-octadec-9-enoxy]propyl-trimethylazanium;chloride Chemical compound [Cl-].CCCCCCCC\C=C/CCCCCCCCOCC(C[N+](C)(C)C)OCCCCCCCC\C=C/CCCCCCCC LDGWQMRUWMSZIU-LQDDAWAPSA-M 0.000 description 1
- KSXTUUUQYQYKCR-LQDDAWAPSA-M 2,3-bis[[(z)-octadec-9-enoyl]oxy]propyl-trimethylazanium;chloride Chemical compound [Cl-].CCCCCCCC\C=C/CCCCCCCC(=O)OCC(C[N+](C)(C)C)OC(=O)CCCCCCC\C=C/CCCCCCCC KSXTUUUQYQYKCR-LQDDAWAPSA-M 0.000 description 1
- XLGVHAQDCFITCH-UHFFFAOYSA-N 2,3-dihydroxypropanamide Chemical compound NC(=O)C(O)CO XLGVHAQDCFITCH-UHFFFAOYSA-N 0.000 description 1
- GVNVAWHJIKLAGL-UHFFFAOYSA-N 2-(cyclohexen-1-yl)cyclohexan-1-one Chemical compound O=C1CCCCC1C1=CCCCC1 GVNVAWHJIKLAGL-UHFFFAOYSA-N 0.000 description 1
- NWFLONJLUJYCNS-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-3-phenylpropanoyl)amino]acetyl]amino]acetyl]amino]-3-phenylpropanoic acid Chemical compound C=1C=CC=CC=1CC(C(O)=O)NC(=O)CNC(=O)CNC(=O)C(N)CC1=CC=CC=C1 NWFLONJLUJYCNS-UHFFFAOYSA-N 0.000 description 1
- WDNZYZQNVCYYON-JNECXHHRSA-N 4-amino-1-[(2R,3R,4S,5R)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-5-methylpyrimidin-2-one Chemical compound CC=1C(=NC(N([C@H]2[C@H](O)[C@H](O)[C@@H](CO)O2)C1)=O)N.CC=1C(=NC(N([C@H]2[C@H](O)[C@H](O)[C@@H](CO)O2)C1)=O)N WDNZYZQNVCYYON-JNECXHHRSA-N 0.000 description 1
- FJKROLUGYXJWQN-UHFFFAOYSA-N 4-hydroxybenzoic acid Chemical compound OC(=O)C1=CC=C(O)C=C1 FJKROLUGYXJWQN-UHFFFAOYSA-N 0.000 description 1
- LZINOQJQXIEBNN-UHFFFAOYSA-N 4-hydroxybutyl dihydrogen phosphate Chemical class OCCCCOP(O)(O)=O LZINOQJQXIEBNN-UHFFFAOYSA-N 0.000 description 1
- XYVLZAYJHCECPN-UHFFFAOYSA-N 6-aminohexyl phosphate Chemical class NCCCCCCOP(O)(O)=O XYVLZAYJHCECPN-UHFFFAOYSA-N 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 1
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 1
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 1
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 1
- ZDYNWWQXFRUOEO-XDTLVQLUSA-N Ala-Gln-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDYNWWQXFRUOEO-XDTLVQLUSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 1
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 1
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 1
- FAJIYNONGXEXAI-CQDKDKBSSA-N Ala-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 FAJIYNONGXEXAI-CQDKDKBSSA-N 0.000 description 1
- FOHXUHGZZKETFI-JBDRJPRFSA-N Ala-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N FOHXUHGZZKETFI-JBDRJPRFSA-N 0.000 description 1
- NMXKFWOEASXOGB-QSFUFRPTSA-N Ala-Ile-His Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NMXKFWOEASXOGB-QSFUFRPTSA-N 0.000 description 1
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 1
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 1
- XAXHGSOBFPIRFG-LSJOCFKGSA-N Ala-Pro-His Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XAXHGSOBFPIRFG-LSJOCFKGSA-N 0.000 description 1
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 1
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 1
- XMIAMUXIMWREBJ-HERUPUMHSA-N Ala-Trp-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XMIAMUXIMWREBJ-HERUPUMHSA-N 0.000 description 1
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 1
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 1
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 1
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- 241000180579 Arca Species 0.000 description 1
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- KWTVWJPNHAOREN-IHRRRGAJSA-N Arg-Asn-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KWTVWJPNHAOREN-IHRRRGAJSA-N 0.000 description 1
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 1
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 1
- IYMAXBFPHPZYIK-BQBZGAKWSA-N Arg-Gly-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IYMAXBFPHPZYIK-BQBZGAKWSA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 1
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 1
- VEAIMHJZTIDCIH-KKUMJFAQSA-N Arg-Phe-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEAIMHJZTIDCIH-KKUMJFAQSA-N 0.000 description 1
- LXMKTIZAGIBQRX-HRCADAONSA-N Arg-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O LXMKTIZAGIBQRX-HRCADAONSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 1
- QTAIIXQCOPUNBQ-QXEWZRGKSA-N Arg-Val-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QTAIIXQCOPUNBQ-QXEWZRGKSA-N 0.000 description 1
- LLQIAIUAKGNOSE-NHCYSSNCSA-N Arg-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N LLQIAIUAKGNOSE-NHCYSSNCSA-N 0.000 description 1
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 1
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 1
- YNSCBOUZTAGIGO-ZLUOBGJFSA-N Asn-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N YNSCBOUZTAGIGO-ZLUOBGJFSA-N 0.000 description 1
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 1
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- VWJFQGXPYOPXJH-ZLUOBGJFSA-N Asn-Cys-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)C(=O)N VWJFQGXPYOPXJH-ZLUOBGJFSA-N 0.000 description 1
- CZIXHXIJJZLYRJ-SRVKXCTJSA-N Asn-Cys-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CZIXHXIJJZLYRJ-SRVKXCTJSA-N 0.000 description 1
- AYKKKGFJXIDYLX-ACZMJKKPSA-N Asn-Gln-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AYKKKGFJXIDYLX-ACZMJKKPSA-N 0.000 description 1
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 description 1
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 1
- SUEIIIFUBHDCCS-PBCZWWQYSA-N Asn-His-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUEIIIFUBHDCCS-PBCZWWQYSA-N 0.000 description 1
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 1
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 1
- UHGUKCOQUNPSKK-CIUDSAMLSA-N Asn-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N UHGUKCOQUNPSKK-CIUDSAMLSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 1
- OROMFUQQTSWUTI-IHRRRGAJSA-N Asn-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OROMFUQQTSWUTI-IHRRRGAJSA-N 0.000 description 1
- RTFWCVDISAMGEQ-SRVKXCTJSA-N Asn-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N RTFWCVDISAMGEQ-SRVKXCTJSA-N 0.000 description 1
- PPCORQFLAZWUNO-QWRGUYRKSA-N Asn-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N PPCORQFLAZWUNO-QWRGUYRKSA-N 0.000 description 1
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 1
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 1
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 1
- RBOBTTLFPRSXKZ-BZSNNMDCSA-N Asn-Phe-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RBOBTTLFPRSXKZ-BZSNNMDCSA-N 0.000 description 1
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 1
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 1
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 1
- PIABYSIYPGLLDQ-XVSYOHENSA-N Asn-Thr-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PIABYSIYPGLLDQ-XVSYOHENSA-N 0.000 description 1
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 1
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 1
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 1
- HBUJSDCLZCXXCW-YDHLFZDLSA-N Asn-Val-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HBUJSDCLZCXXCW-YDHLFZDLSA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- FTNVLGCFIJEMQT-CIUDSAMLSA-N Asp-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N FTNVLGCFIJEMQT-CIUDSAMLSA-N 0.000 description 1
- DZQKLNLLWFQONU-LKXGYXEUSA-N Asp-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)O DZQKLNLLWFQONU-LKXGYXEUSA-N 0.000 description 1
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 1
- LTXGDRFJRZSZAV-CIUDSAMLSA-N Asp-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N LTXGDRFJRZSZAV-CIUDSAMLSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 1
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- RXBGWGRSWXOBGK-KKUMJFAQSA-N Asp-Lys-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RXBGWGRSWXOBGK-KKUMJFAQSA-N 0.000 description 1
- WZUZGDANRQPCDD-SRVKXCTJSA-N Asp-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N WZUZGDANRQPCDD-SRVKXCTJSA-N 0.000 description 1
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 1
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 1
- BKOIIURTQAJHAT-GUBZILKMSA-N Asp-Pro-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 BKOIIURTQAJHAT-GUBZILKMSA-N 0.000 description 1
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 1
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 1
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 1
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 1
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 1
- 239000005711 Benzoic acid Substances 0.000 description 1
- 239000004255 Butylated hydroxyanisole Substances 0.000 description 1
- 101150077194 CAP1 gene Proteins 0.000 description 1
- 101710180456 CD-NTase-associated protein 4 Proteins 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 102100026548 Caspase-8 Human genes 0.000 description 1
- 101150065749 Churc1 gene Proteins 0.000 description 1
- 108091033380 Coding strand Proteins 0.000 description 1
- 208000035473 Communicable disease Diseases 0.000 description 1
- XMTDCXXLDZKAGI-ACZMJKKPSA-N Cys-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N XMTDCXXLDZKAGI-ACZMJKKPSA-N 0.000 description 1
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 1
- PRXCTTWKGJAPMT-ZLUOBGJFSA-N Cys-Ala-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O PRXCTTWKGJAPMT-ZLUOBGJFSA-N 0.000 description 1
- WDQXKVCQXRNOSI-GHCJXIJMSA-N Cys-Asp-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WDQXKVCQXRNOSI-GHCJXIJMSA-N 0.000 description 1
- WXKWQSDHEXKKNC-ZKWXMUAHSA-N Cys-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N WXKWQSDHEXKKNC-ZKWXMUAHSA-N 0.000 description 1
- ATPDEYTYWVMINF-ZLUOBGJFSA-N Cys-Cys-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O ATPDEYTYWVMINF-ZLUOBGJFSA-N 0.000 description 1
- GUKYYUFHWYRMEU-WHFBIAKZSA-N Cys-Gly-Asp Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O GUKYYUFHWYRMEU-WHFBIAKZSA-N 0.000 description 1
- SKSJPIBFNFPTJB-NKWVEPMBSA-N Cys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CS)N)C(=O)O SKSJPIBFNFPTJB-NKWVEPMBSA-N 0.000 description 1
- VPQZSNQICFCCSO-BJDJZHNGSA-N Cys-Leu-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VPQZSNQICFCCSO-BJDJZHNGSA-N 0.000 description 1
- UCSXXFRXHGUXCQ-SRVKXCTJSA-N Cys-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N UCSXXFRXHGUXCQ-SRVKXCTJSA-N 0.000 description 1
- QQOWCDCBFFBRQH-IXOXFDKPSA-N Cys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N)O QQOWCDCBFFBRQH-IXOXFDKPSA-N 0.000 description 1
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 1
- JLZCAZJGWNRXCI-XKBZYTNZSA-N Cys-Thr-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O JLZCAZJGWNRXCI-XKBZYTNZSA-N 0.000 description 1
- JTEGHEWKBCTIAL-IXOXFDKPSA-N Cys-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N)O JTEGHEWKBCTIAL-IXOXFDKPSA-N 0.000 description 1
- FCXJJTRGVAZDER-FXQIFTODSA-N Cys-Val-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O FCXJJTRGVAZDER-FXQIFTODSA-N 0.000 description 1
- JRZMCSIUYGSJKP-ZKWXMUAHSA-N Cys-Val-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JRZMCSIUYGSJKP-ZKWXMUAHSA-N 0.000 description 1
- KZZYVYWSXMFYEC-DCAQKATOSA-N Cys-Val-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KZZYVYWSXMFYEC-DCAQKATOSA-N 0.000 description 1
- ZZZCUOFIHGPKAK-UHFFFAOYSA-N D-erythro-ascorbic acid Natural products OCC1OC(=O)C(O)=C1O ZZZCUOFIHGPKAK-UHFFFAOYSA-N 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 238000006646 Dess-Martin oxidation reaction Methods 0.000 description 1
- 238000012286 ELISA Assay Methods 0.000 description 1
- 238000008157 ELISA kit Methods 0.000 description 1
- 101710091045 Envelope protein Proteins 0.000 description 1
- 101710204837 Envelope small membrane protein Proteins 0.000 description 1
- 239000004593 Epoxy Substances 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 206010059410 Faecaluria Diseases 0.000 description 1
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- INFBPLSHYFALDE-ACZMJKKPSA-N Gln-Asn-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O INFBPLSHYFALDE-ACZMJKKPSA-N 0.000 description 1
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 1
- KZEUVLLVULIPNX-GUBZILKMSA-N Gln-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N KZEUVLLVULIPNX-GUBZILKMSA-N 0.000 description 1
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 1
- LVNILKSSFHCSJZ-IHRRRGAJSA-N Gln-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LVNILKSSFHCSJZ-IHRRRGAJSA-N 0.000 description 1
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 1
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 1
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 1
- VGTDBGYFVWOQTI-RYUDHWBXSA-N Gln-Gly-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VGTDBGYFVWOQTI-RYUDHWBXSA-N 0.000 description 1
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 1
- RGAOLBZBLOJUTP-GRLWGSQLSA-N Gln-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N RGAOLBZBLOJUTP-GRLWGSQLSA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- KKCJHBXMYYVWMX-KQXIARHKSA-N Gln-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N KKCJHBXMYYVWMX-KQXIARHKSA-N 0.000 description 1
- JKGHMESJHRTHIC-SIUGBPQLSA-N Gln-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JKGHMESJHRTHIC-SIUGBPQLSA-N 0.000 description 1
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 1
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 1
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 1
- QBEWLBKBGXVVPD-RYUDHWBXSA-N Gln-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N QBEWLBKBGXVVPD-RYUDHWBXSA-N 0.000 description 1
- MQJDLNRXBOELJW-KKUMJFAQSA-N Gln-Pro-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O MQJDLNRXBOELJW-KKUMJFAQSA-N 0.000 description 1
- UWMDGPFFTKDUIY-HJGDQZAQSA-N Gln-Pro-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWMDGPFFTKDUIY-HJGDQZAQSA-N 0.000 description 1
- DCWNCMRZIZSZBL-KKUMJFAQSA-N Gln-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O DCWNCMRZIZSZBL-KKUMJFAQSA-N 0.000 description 1
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 1
- GHAXJVNBAKGWEJ-AVGNSLFASA-N Gln-Ser-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GHAXJVNBAKGWEJ-AVGNSLFASA-N 0.000 description 1
- DYVMTEWCGAVKSE-HJGDQZAQSA-N Gln-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O DYVMTEWCGAVKSE-HJGDQZAQSA-N 0.000 description 1
- YRHZWVKUFWCEPW-GLLZPBPUSA-N Gln-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O YRHZWVKUFWCEPW-GLLZPBPUSA-N 0.000 description 1
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 1
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- KVBPDJIFRQUQFY-ACZMJKKPSA-N Glu-Cys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O KVBPDJIFRQUQFY-ACZMJKKPSA-N 0.000 description 1
- FKGNJUCQKXQNRA-NRPADANISA-N Glu-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O FKGNJUCQKXQNRA-NRPADANISA-N 0.000 description 1
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 1
- LKOAAMXDJGEYMS-ZPFDUUQYSA-N Glu-Met-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKOAAMXDJGEYMS-ZPFDUUQYSA-N 0.000 description 1
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 1
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 description 1
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 1
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 1
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 1
- SUDUYJOBLHQAMI-WHFBIAKZSA-N Gly-Asp-Cys Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O SUDUYJOBLHQAMI-WHFBIAKZSA-N 0.000 description 1
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 1
- NMROINAYXCACKF-WHFBIAKZSA-N Gly-Cys-Cys Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O NMROINAYXCACKF-WHFBIAKZSA-N 0.000 description 1
- PEZZSFLFXXFUQD-XPUUQOCRSA-N Gly-Cys-Val Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O PEZZSFLFXXFUQD-XPUUQOCRSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 1
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 1
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 1
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 1
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 1
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 1
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 1
- LUJVWKKYHSLULQ-ZKWXMUAHSA-N Gly-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN LUJVWKKYHSLULQ-ZKWXMUAHSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 1
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 1
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 1
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 1
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- RHRLHXQWHCNJKR-PMVVWTBXSA-N Gly-Thr-His Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 RHRLHXQWHCNJKR-PMVVWTBXSA-N 0.000 description 1
- UMBDRSMLCUYIRI-DVJZZOLTSA-N Gly-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN)O UMBDRSMLCUYIRI-DVJZZOLTSA-N 0.000 description 1
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 1
- OCRQUYDOYKCOQG-IRXDYDNUSA-N Gly-Tyr-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 OCRQUYDOYKCOQG-IRXDYDNUSA-N 0.000 description 1
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 1
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- JZNWSCPGTDBMEW-UHFFFAOYSA-N Glycerophosphorylethanolamin Natural products NCCOP(O)(=O)OCC(O)CO JZNWSCPGTDBMEW-UHFFFAOYSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 101710114810 Glycoprotein Proteins 0.000 description 1
- KZTLOHBDLMIFSH-XVYDVKMFSA-N His-Ala-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O KZTLOHBDLMIFSH-XVYDVKMFSA-N 0.000 description 1
- HTZKFIYQMHJWSQ-INTQDDNPSA-N His-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HTZKFIYQMHJWSQ-INTQDDNPSA-N 0.000 description 1
- LSQHWKPPOFDHHZ-YUMQZZPRSA-N His-Asp-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LSQHWKPPOFDHHZ-YUMQZZPRSA-N 0.000 description 1
- MJUUWJJEUOBDGW-IHRRRGAJSA-N His-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 MJUUWJJEUOBDGW-IHRRRGAJSA-N 0.000 description 1
- UPJODPVSKKWGDQ-KLHWPWHYSA-N His-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O UPJODPVSKKWGDQ-KLHWPWHYSA-N 0.000 description 1
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 1
- KECFCPNPPYCGBL-PMVMPFDFSA-N His-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CN=CN4)N KECFCPNPPYCGBL-PMVMPFDFSA-N 0.000 description 1
- GYXDQXPCPASCNR-NHCYSSNCSA-N His-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N GYXDQXPCPASCNR-NHCYSSNCSA-N 0.000 description 1
- 101000897856 Homo sapiens Adenylyl cyclase-associated protein 2 Proteins 0.000 description 1
- 101000998020 Homo sapiens Keratin, type I cytoskeletal 18 Proteins 0.000 description 1
- 101000836079 Homo sapiens Serpin B8 Proteins 0.000 description 1
- 101000836075 Homo sapiens Serpin B9 Proteins 0.000 description 1
- 101000661807 Homo sapiens Suppressor of tumorigenicity 14 protein Proteins 0.000 description 1
- 101000798702 Homo sapiens Transmembrane protease serine 4 Proteins 0.000 description 1
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 1
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 1
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 1
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 1
- YPQDTQJBOFOTJQ-SXTJYALSSA-N Ile-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N YPQDTQJBOFOTJQ-SXTJYALSSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- PFTFEWHJSAXGED-ZKWXMUAHSA-N Ile-Cys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N PFTFEWHJSAXGED-ZKWXMUAHSA-N 0.000 description 1
- LLHYWBGDMBGNHA-VGDYDELISA-N Ile-Cys-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LLHYWBGDMBGNHA-VGDYDELISA-N 0.000 description 1
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 description 1
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 1
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 1
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 1
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 1
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 1
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 1
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 1
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 1
- CEPIAEUVRKGPGP-DSYPUSFNSA-N Ile-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 CEPIAEUVRKGPGP-DSYPUSFNSA-N 0.000 description 1
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 1
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 1
- RTSQPLLOYSGMKM-DSYPUSFNSA-N Ile-Trp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N RTSQPLLOYSGMKM-DSYPUSFNSA-N 0.000 description 1
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 1
- IPFKIGNDTUOFAF-CYDGBPFRSA-N Ile-Val-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IPFKIGNDTUOFAF-CYDGBPFRSA-N 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 108010066327 Keratin-18 Proteins 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 1
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 1
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- GZAUZBUKDXYPEH-CIUDSAMLSA-N Leu-Cys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N GZAUZBUKDXYPEH-CIUDSAMLSA-N 0.000 description 1
- YORLGJINWYYIMX-KKUMJFAQSA-N Leu-Cys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YORLGJINWYYIMX-KKUMJFAQSA-N 0.000 description 1
- HUEBCHPSXSQUGN-GARJFASQSA-N Leu-Cys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N HUEBCHPSXSQUGN-GARJFASQSA-N 0.000 description 1
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 1
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 1
- HMDDEJADNKQTBR-BZSNNMDCSA-N Leu-His-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMDDEJADNKQTBR-BZSNNMDCSA-N 0.000 description 1
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- ORWTWZXGDBYVCP-BJDJZHNGSA-N Leu-Ile-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(C)C ORWTWZXGDBYVCP-BJDJZHNGSA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 1
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 1
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- OYHQOLUKZRVURQ-HZJYTTRNSA-N Linoleic acid Chemical compound CCCCC\C=C/C\C=C/CCCCCCCC(O)=O OYHQOLUKZRVURQ-HZJYTTRNSA-N 0.000 description 1
- BTSXLXFPMZXVPR-DLOVCJGASA-N Lys-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BTSXLXFPMZXVPR-DLOVCJGASA-N 0.000 description 1
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 1
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 1
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 1
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 1
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 1
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 1
- XFBBBRDEQIPGNR-KATARQTJSA-N Lys-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)O XFBBBRDEQIPGNR-KATARQTJSA-N 0.000 description 1
- DZQYZKPINJLLEN-KKUMJFAQSA-N Lys-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)O DZQYZKPINJLLEN-KKUMJFAQSA-N 0.000 description 1
- OPTCSTACHGNULU-DCAQKATOSA-N Lys-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCCN OPTCSTACHGNULU-DCAQKATOSA-N 0.000 description 1
- VSRXPEHZMHSFKU-IUCAKERBSA-N Lys-Gln-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VSRXPEHZMHSFKU-IUCAKERBSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- IRRZDAIFYHNIIN-JYJNAYRXSA-N Lys-Gln-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IRRZDAIFYHNIIN-JYJNAYRXSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 1
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 1
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 1
- XREQQOATSMMAJP-MGHWNKPDSA-N Lys-Ile-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XREQQOATSMMAJP-MGHWNKPDSA-N 0.000 description 1
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 1
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 1
- JYVCOTWSRGFABJ-DCAQKATOSA-N Lys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N JYVCOTWSRGFABJ-DCAQKATOSA-N 0.000 description 1
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 1
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- ZVXSESPJMKNIQA-YXMSTPNBSA-N Lys-Thr-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 ZVXSESPJMKNIQA-YXMSTPNBSA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- PELXPRPDQRFBGQ-KKUMJFAQSA-N Lys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O PELXPRPDQRFBGQ-KKUMJFAQSA-N 0.000 description 1
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 1
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 101710145006 Lysis protein Proteins 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 101710085938 Matrix protein Proteins 0.000 description 1
- 108091027974 Mature messenger RNA Proteins 0.000 description 1
- 102000018697 Membrane Proteins Human genes 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- 101710127721 Membrane protein Proteins 0.000 description 1
- BVXXDMUMHMXFER-BPNCWPANSA-N Met-Ala-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVXXDMUMHMXFER-BPNCWPANSA-N 0.000 description 1
- OSOLWRWQADPDIQ-DCAQKATOSA-N Met-Asp-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OSOLWRWQADPDIQ-DCAQKATOSA-N 0.000 description 1
- KLFPZIUIXZNEKY-DCAQKATOSA-N Met-Gln-Met Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O KLFPZIUIXZNEKY-DCAQKATOSA-N 0.000 description 1
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 1
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 1
- FIZZULTXMVEIAA-IHRRRGAJSA-N Met-Ser-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FIZZULTXMVEIAA-IHRRRGAJSA-N 0.000 description 1
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 1
- LIIXIZKVWNYQHB-STECZYCISA-N Met-Tyr-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LIIXIZKVWNYQHB-STECZYCISA-N 0.000 description 1
- 101100245221 Mus musculus Prss8 gene Proteins 0.000 description 1
- VQAYFKKCNSOZKM-IOSLPCCCSA-N N(6)-methyladenosine Chemical compound C1=NC=2C(NC)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O VQAYFKKCNSOZKM-IOSLPCCCSA-N 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- PCLIMKBDDGJMGD-UHFFFAOYSA-N N-bromosuccinimide Chemical compound BrN1C(=O)CCC1=O PCLIMKBDDGJMGD-UHFFFAOYSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- WCZKTUCDHDAAGU-UHFFFAOYSA-L N1=CC=CC=C1.[Cr](=O)(=O)(Cl)Cl Chemical compound N1=CC=CC=C1.[Cr](=O)(=O)(Cl)Cl WCZKTUCDHDAAGU-UHFFFAOYSA-L 0.000 description 1
- VQAYFKKCNSOZKM-UHFFFAOYSA-N NSC 29409 Natural products C1=NC=2C(NC)=NC=NC=2N1C1OC(CO)C(O)C1O VQAYFKKCNSOZKM-UHFFFAOYSA-N 0.000 description 1
- 101710141454 Nucleoprotein Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 1
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 1
- UUWCIPUVJJIEEP-SRVKXCTJSA-N Phe-Asn-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N UUWCIPUVJJIEEP-SRVKXCTJSA-N 0.000 description 1
- KAHUBGWSIQNZQQ-KKUMJFAQSA-N Phe-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KAHUBGWSIQNZQQ-KKUMJFAQSA-N 0.000 description 1
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 description 1
- ALHULIGNEXGFRM-QWRGUYRKSA-N Phe-Cys-Gly Chemical compound OC(=O)CNC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=CC=C1 ALHULIGNEXGFRM-QWRGUYRKSA-N 0.000 description 1
- PSBJZLMFFTULDX-IXOXFDKPSA-N Phe-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N)O PSBJZLMFFTULDX-IXOXFDKPSA-N 0.000 description 1
- IDUCUXTUHHIQIP-SOUVJXGZSA-N Phe-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O IDUCUXTUHHIQIP-SOUVJXGZSA-N 0.000 description 1
- NKLDZIPTGKBDBB-HTUGSXCWSA-N Phe-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O NKLDZIPTGKBDBB-HTUGSXCWSA-N 0.000 description 1
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 1
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 1
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 1
- KRYSMKKRRRWOCZ-QEWYBTABSA-N Phe-Ile-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KRYSMKKRRRWOCZ-QEWYBTABSA-N 0.000 description 1
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- METZZBCMDXHFMK-BZSNNMDCSA-N Phe-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N METZZBCMDXHFMK-BZSNNMDCSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- KPEIBEPEUAZWNS-ULQDDVLXSA-N Phe-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KPEIBEPEUAZWNS-ULQDDVLXSA-N 0.000 description 1
- CJAHQEZWDZNSJO-KKUMJFAQSA-N Phe-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CJAHQEZWDZNSJO-KKUMJFAQSA-N 0.000 description 1
- ZUQACJLOHYRVPJ-DKIMLUQUSA-N Phe-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZUQACJLOHYRVPJ-DKIMLUQUSA-N 0.000 description 1
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 1
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 1
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 1
- FGWUALWGCZJQDJ-URLPEUOOSA-N Phe-Thr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGWUALWGCZJQDJ-URLPEUOOSA-N 0.000 description 1
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 1
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 1
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 1
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 1
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 1
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 1
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 1
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 description 1
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 1
- LUGOKRWYNMDGTD-FXQIFTODSA-N Pro-Cys-Asn Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O LUGOKRWYNMDGTD-FXQIFTODSA-N 0.000 description 1
- HQVPQXMCQKXARZ-FXQIFTODSA-N Pro-Cys-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O HQVPQXMCQKXARZ-FXQIFTODSA-N 0.000 description 1
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 1
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 1
- XZONQWUEBAFQPO-HJGDQZAQSA-N Pro-Gln-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZONQWUEBAFQPO-HJGDQZAQSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 1
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 description 1
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 1
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 1
- KLOQCCRTPHPIFN-DCAQKATOSA-N Pro-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 KLOQCCRTPHPIFN-DCAQKATOSA-N 0.000 description 1
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 1
- MLKVIVZCFYRTIR-KKUMJFAQSA-N Pro-Phe-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLKVIVZCFYRTIR-KKUMJFAQSA-N 0.000 description 1
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 1
- DWPXHLIBFQLKLK-CYDGBPFRSA-N Pro-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 DWPXHLIBFQLKLK-CYDGBPFRSA-N 0.000 description 1
- CNUIHOAISPKQPY-HSHDSVGOSA-N Pro-Thr-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O CNUIHOAISPKQPY-HSHDSVGOSA-N 0.000 description 1
- DIDLUFMLRUJLFB-FKBYEOEOSA-N Pro-Trp-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC4=CC=C(C=C4)O)C(=O)O DIDLUFMLRUJLFB-FKBYEOEOSA-N 0.000 description 1
- 102100038239 Protein Churchill Human genes 0.000 description 1
- 102220537313 Protein NDRG2_N30K_mutation Human genes 0.000 description 1
- 101710188315 Protein X Proteins 0.000 description 1
- 102220558462 Proteinase-activated receptor 2_N30S_mutation Human genes 0.000 description 1
- 230000006819 RNA synthesis Effects 0.000 description 1
- 238000011529 RT qPCR Methods 0.000 description 1
- MUPFEKGTMRGPLJ-RMMQSMQOSA-N Raffinose Natural products O(C[C@H]1[C@@H](O)[C@H](O)[C@@H](O)[C@@H](O[C@@]2(CO)[C@H](O)[C@@H](O)[C@@H](CO)O2)O1)[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 MUPFEKGTMRGPLJ-RMMQSMQOSA-N 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 1
- UCXDHBORXLVBNC-ZLUOBGJFSA-N Ser-Asn-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O UCXDHBORXLVBNC-ZLUOBGJFSA-N 0.000 description 1
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 1
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 1
- SNNSYBWPPVAXQW-ZLUOBGJFSA-N Ser-Cys-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)O SNNSYBWPPVAXQW-ZLUOBGJFSA-N 0.000 description 1
- RNFKSBPHLTZHLU-WHFBIAKZSA-N Ser-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)O RNFKSBPHLTZHLU-WHFBIAKZSA-N 0.000 description 1
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 1
- BQWCDDAISCPDQV-XHNCKOQMSA-N Ser-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N)C(=O)O BQWCDDAISCPDQV-XHNCKOQMSA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 1
- CXBFHZLODKPIJY-AAEUAGOBSA-N Ser-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N CXBFHZLODKPIJY-AAEUAGOBSA-N 0.000 description 1
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 1
- SRKMDKACHDVPMD-SRVKXCTJSA-N Ser-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N SRKMDKACHDVPMD-SRVKXCTJSA-N 0.000 description 1
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 1
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 1
- HJAXVYLCKDPPDF-SRVKXCTJSA-N Ser-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N HJAXVYLCKDPPDF-SRVKXCTJSA-N 0.000 description 1
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 1
- XVWDJUROVRQKAE-KKUMJFAQSA-N Ser-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=CC=C1 XVWDJUROVRQKAE-KKUMJFAQSA-N 0.000 description 1
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 1
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 1
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 1
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 1
- VEVYMLNYMULSMS-AVGNSLFASA-N Ser-Tyr-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEVYMLNYMULSMS-AVGNSLFASA-N 0.000 description 1
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- KEAYESYHFKHZAL-UHFFFAOYSA-N Sodium Chemical compound [Na] KEAYESYHFKHZAL-UHFFFAOYSA-N 0.000 description 1
- 239000004283 Sodium sorbate Substances 0.000 description 1
- 101710167605 Spike glycoprotein Proteins 0.000 description 1
- 102100037942 Suppressor of tumorigenicity 14 protein Human genes 0.000 description 1
- 230000024932 T cell mediated immunity Effects 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- 101710137500 T7 RNA polymerase Proteins 0.000 description 1
- RZCIEJXAILMSQK-JXOAFFINSA-N TTP Chemical compound O=C1NC(=O)C(C)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 RZCIEJXAILMSQK-JXOAFFINSA-N 0.000 description 1
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 1
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 1
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 1
- JBHMLZSKIXMVFS-XVSYOHENSA-N Thr-Asn-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JBHMLZSKIXMVFS-XVSYOHENSA-N 0.000 description 1
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 1
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 1
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 1
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 1
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 1
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 1
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 1
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- HPQHHRLWSAMMKG-KATARQTJSA-N Thr-Lys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N)O HPQHHRLWSAMMKG-KATARQTJSA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 1
- FDQXPJCLVPFKJW-KJEVXHAQSA-N Thr-Met-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O FDQXPJCLVPFKJW-KJEVXHAQSA-N 0.000 description 1
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 1
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 1
- NWECYMJLJGCBOD-UNQGMJICSA-N Thr-Phe-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O NWECYMJLJGCBOD-UNQGMJICSA-N 0.000 description 1
- DNCUODYZAMHLCV-XGEHTFHBSA-N Thr-Pro-Cys Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N)O DNCUODYZAMHLCV-XGEHTFHBSA-N 0.000 description 1
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 1
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 1
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 1
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 1
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 1
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 1
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 1
- 241000124703 Torilis Species 0.000 description 1
- 102000004357 Transferases Human genes 0.000 description 1
- 108090000992 Transferases Proteins 0.000 description 1
- 102100032471 Transmembrane protease serine 4 Human genes 0.000 description 1
- ICNFHVUVCNWUAB-SZMVWBNQSA-N Trp-Arg-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ICNFHVUVCNWUAB-SZMVWBNQSA-N 0.000 description 1
- IBBBOLAPFHRDHW-BPUTZDHNSA-N Trp-Asn-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N IBBBOLAPFHRDHW-BPUTZDHNSA-N 0.000 description 1
- UKINEYBQXPMOJO-UBHSHLNASA-N Trp-Asn-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N UKINEYBQXPMOJO-UBHSHLNASA-N 0.000 description 1
- UQHPXCFAHVTWFU-BVSLBCMMSA-N Trp-Phe-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UQHPXCFAHVTWFU-BVSLBCMMSA-N 0.000 description 1
- YXSSXUIBUJGHJY-SFJXLCSZSA-N Trp-Thr-Phe Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)[C@H](O)C)C(O)=O)C1=CC=CC=C1 YXSSXUIBUJGHJY-SFJXLCSZSA-N 0.000 description 1
- NMOIRIIIUVELLY-WDSOQIARSA-N Trp-Val-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)C(C)C)=CNC2=C1 NMOIRIIIUVELLY-WDSOQIARSA-N 0.000 description 1
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 1
- FBVGQXJIXFZKSQ-GMVOTWDCSA-N Tyr-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N FBVGQXJIXFZKSQ-GMVOTWDCSA-N 0.000 description 1
- DYEGCOJHFNJBKB-UFYCRDLUSA-N Tyr-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 DYEGCOJHFNJBKB-UFYCRDLUSA-N 0.000 description 1
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 1
- PEVVXUGSAKEPEN-AVGNSLFASA-N Tyr-Asn-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PEVVXUGSAKEPEN-AVGNSLFASA-N 0.000 description 1
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 1
- NSTPFWRAIDTNGH-BZSNNMDCSA-N Tyr-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NSTPFWRAIDTNGH-BZSNNMDCSA-N 0.000 description 1
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 1
- QHEGAOPHISYNDF-XDTLVQLUSA-N Tyr-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHEGAOPHISYNDF-XDTLVQLUSA-N 0.000 description 1
- CKHQKYHIZCRTAP-SOUVJXGZSA-N Tyr-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CKHQKYHIZCRTAP-SOUVJXGZSA-N 0.000 description 1
- MPKPIWFFDWVJGC-IRIUXVKKSA-N Tyr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O MPKPIWFFDWVJGC-IRIUXVKKSA-N 0.000 description 1
- IWRMTNJCCMEBEX-AVGNSLFASA-N Tyr-Glu-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O IWRMTNJCCMEBEX-AVGNSLFASA-N 0.000 description 1
- WAPFQMXRSDEGOE-IHRRRGAJSA-N Tyr-Glu-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O WAPFQMXRSDEGOE-IHRRRGAJSA-N 0.000 description 1
- LHTGRUZSZOIAKM-SOUVJXGZSA-N Tyr-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O LHTGRUZSZOIAKM-SOUVJXGZSA-N 0.000 description 1
- NOOMDULIORCDNF-IRXDYDNUSA-N Tyr-Gly-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NOOMDULIORCDNF-IRXDYDNUSA-N 0.000 description 1
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 1
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 1
- FBHBVXUBTYVCRU-BZSNNMDCSA-N Tyr-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CN=CN1 FBHBVXUBTYVCRU-BZSNNMDCSA-N 0.000 description 1
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 1
- PSALWJCUIAQKFW-ACRUOGEOSA-N Tyr-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N PSALWJCUIAQKFW-ACRUOGEOSA-N 0.000 description 1
- WURLIFOWSMBUAR-SLFFLAALSA-N Tyr-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O WURLIFOWSMBUAR-SLFFLAALSA-N 0.000 description 1
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 1
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 1
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- XFEMMSGONWQACR-KJEVXHAQSA-N Tyr-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XFEMMSGONWQACR-KJEVXHAQSA-N 0.000 description 1
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 1
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 1
- MUPFEKGTMRGPLJ-UHFFFAOYSA-N UNPD196149 Natural products OC1C(O)C(CO)OC1(CO)OC1C(O)C(O)C(O)C(COC2C(C(O)C(O)C(CO)O2)O)O1 MUPFEKGTMRGPLJ-UHFFFAOYSA-N 0.000 description 1
- 229910052770 Uranium Inorganic materials 0.000 description 1
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 1
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- LIQJSDDOULTANC-QSFUFRPTSA-N Val-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LIQJSDDOULTANC-QSFUFRPTSA-N 0.000 description 1
- KXUKIBHIVRYOIP-ZKWXMUAHSA-N Val-Asp-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N KXUKIBHIVRYOIP-ZKWXMUAHSA-N 0.000 description 1
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 1
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 1
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 1
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 1
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 1
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 1
- XNLUVJPMPAZHCY-JYJNAYRXSA-N Val-Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 XNLUVJPMPAZHCY-JYJNAYRXSA-N 0.000 description 1
- 108010003533 Viral Envelope Proteins Proteins 0.000 description 1
- 230000010530 Virus Neutralization Effects 0.000 description 1
- 229930003268 Vitamin C Natural products 0.000 description 1
- 229930003427 Vitamin E Natural products 0.000 description 1
- NRLNQCOGCKAESA-KWXKLSQISA-N [(6z,9z,28z,31z)-heptatriaconta-6,9,28,31-tetraen-19-yl] 4-(dimethylamino)butanoate Chemical compound CCCCC\C=C/C\C=C/CCCCCCCCC(OC(=O)CCCN(C)C)CCCCCCCC\C=C/C\C=C/CCCCC NRLNQCOGCKAESA-KWXKLSQISA-N 0.000 description 1
- AGTYLGHUHWJTSE-FDDDBJFASA-N [[(2R,3S,4R,5R)-5-(5-ethynyl-2,4-dioxopyrimidin-1-yl)-3,4-dihydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate Chemical compound P(O)(=O)(OP(=O)(O)OP(=O)(O)O)OC[C@@H]1[C@H]([C@H]([C@@H](O1)N1C(=O)NC(=O)C(=C1)C#C)O)O AGTYLGHUHWJTSE-FDDDBJFASA-N 0.000 description 1
- YIJVOACVHQZMKI-JXOAFFINSA-N [[(2r,3s,4r,5r)-5-(4-amino-5-methyl-2-oxopyrimidin-1-yl)-3,4-dihydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate Chemical compound O=C1N=C(N)C(C)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 YIJVOACVHQZMKI-JXOAFFINSA-N 0.000 description 1
- OLRONOIBERDKRE-XUTVFYLZSA-N [[(2r,3s,4r,5s)-3,4-dihydroxy-5-(1-methyl-2,4-dioxopyrimidin-5-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate Chemical compound O=C1NC(=O)N(C)C=C1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 OLRONOIBERDKRE-XUTVFYLZSA-N 0.000 description 1
- YDHWWBZFRZWVHO-UHFFFAOYSA-H [oxido-[oxido(phosphonatooxy)phosphoryl]oxyphosphoryl] phosphate Chemical class [O-]P([O-])(=O)OP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O YDHWWBZFRZWVHO-UHFFFAOYSA-H 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 125000002015 acyclic group Chemical group 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N adenyl group Chemical group N1=CN=C2N=CNC2=C1N GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 210000001552 airway epithelial cell Anatomy 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 229910052783 alkali metal Inorganic materials 0.000 description 1
- 150000001340 alkali metals Chemical class 0.000 description 1
- 125000003342 alkenyl group Chemical group 0.000 description 1
- 125000000217 alkyl group Chemical group 0.000 description 1
- OENHQHLEOONYIE-UKMVMLAPSA-N all-trans beta-carotene Natural products CC=1CCCC(C)(C)C=1/C=C/C(/C)=C/C=C/C(/C)=C/C=C/C=C(C)C=CC=C(C)C=CC1=C(C)CCCC1(C)C OENHQHLEOONYIE-UKMVMLAPSA-N 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 239000003963 antioxidant agent Substances 0.000 description 1
- 235000006708 antioxidants Nutrition 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 235000010323 ascorbic acid Nutrition 0.000 description 1
- 239000011668 ascorbic acid Substances 0.000 description 1
- 229960005070 ascorbic acid Drugs 0.000 description 1
- 150000001507 asparagine derivatives Chemical class 0.000 description 1
- 125000000613 asparagine group Chemical group N[C@@H](CC(N)=O)C(=O)* 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 238000003149 assay kit Methods 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 108010028263 bacteriophage T3 RNA polymerase Proteins 0.000 description 1
- 229960000686 benzalkonium chloride Drugs 0.000 description 1
- UREZNYTWGJKWBI-UHFFFAOYSA-M benzethonium chloride Chemical compound [Cl-].C1=CC(C(C)(C)CC(C)(C)C)=CC=C1OCCOCC[N+](C)(C)CC1=CC=CC=C1 UREZNYTWGJKWBI-UHFFFAOYSA-M 0.000 description 1
- 229960001950 benzethonium chloride Drugs 0.000 description 1
- 235000010233 benzoic acid Nutrition 0.000 description 1
- 229960004365 benzoic acid Drugs 0.000 description 1
- CADWTSSKOVRVJC-UHFFFAOYSA-N benzyl(dimethyl)azanium;chloride Chemical compound [Cl-].C[NH+](C)CC1=CC=CC=C1 CADWTSSKOVRVJC-UHFFFAOYSA-N 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 235000013734 beta-carotene Nutrition 0.000 description 1
- 239000011648 beta-carotene Substances 0.000 description 1
- TUPZEYHYWIEDIH-WAIFQNFQSA-N beta-carotene Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C=C/C1=C(C)CCCC1(C)C)C=CC=C(/C)C=CC2=CCCCC2(C)C TUPZEYHYWIEDIH-WAIFQNFQSA-N 0.000 description 1
- 229960002747 betacarotene Drugs 0.000 description 1
- 239000011230 binding agent Substances 0.000 description 1
- 238000010170 biological method Methods 0.000 description 1
- SIPUZPBQZHNSDW-UHFFFAOYSA-N bis(2-methylpropyl)aluminum Chemical compound CC(C)C[Al]CC(C)C SIPUZPBQZHNSDW-UHFFFAOYSA-N 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 230000037396 body weight Effects 0.000 description 1
- 239000004067 bulking agent Substances 0.000 description 1
- 235000019282 butylated hydroxyanisole Nutrition 0.000 description 1
- CZBZUDVBLSSABA-UHFFFAOYSA-N butylated hydroxyanisole Chemical compound COC1=CC=C(O)C(C(C)(C)C)=C1.COC1=CC=C(O)C=C1C(C)(C)C CZBZUDVBLSSABA-UHFFFAOYSA-N 0.000 description 1
- 229940043253 butylated hydroxyanisole Drugs 0.000 description 1
- 125000002837 carbocyclic group Chemical group 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 239000002738 chelating agent Substances 0.000 description 1
- 238000009388 chemical precipitation Methods 0.000 description 1
- WLNARFZDISHUGS-MIXBDBMTSA-N cholesteryl hemisuccinate Chemical compound C1C=C2C[C@@H](OC(=O)CCC(O)=O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 WLNARFZDISHUGS-MIXBDBMTSA-N 0.000 description 1
- 229960004106 citric acid Drugs 0.000 description 1
- 235000015165 citric acid Nutrition 0.000 description 1
- 238000005354 coacervation Methods 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 239000013256 coordination polymer Substances 0.000 description 1
- 230000010485 coping Effects 0.000 description 1
- 239000012043 crude product Substances 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- NKLCNNUWBJBICK-UHFFFAOYSA-N dess–martin periodinane Chemical compound C1=CC=C2I(OC(=O)C)(OC(C)=O)(OC(C)=O)OC(=O)C2=C1 NKLCNNUWBJBICK-UHFFFAOYSA-N 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 150000001982 diacylglycerols Chemical class 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- WPUMTJGUQUYPIV-JIZZDEOASA-L disodium (S)-malate Chemical compound [Na+].[Na+].[O-]C(=O)[C@@H](O)CC([O-])=O WPUMTJGUQUYPIV-JIZZDEOASA-L 0.000 description 1
- 239000002270 dispersing agent Substances 0.000 description 1
- 239000002612 dispersion medium Substances 0.000 description 1
- 239000002552 dosage form Substances 0.000 description 1
- 239000003651 drinking water Substances 0.000 description 1
- 235000020188 drinking water Nutrition 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 238000001493 electron microscopy Methods 0.000 description 1
- 238000002330 electrospray ionisation mass spectrometry Methods 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 238000006735 epoxidation reaction Methods 0.000 description 1
- BEFDCLMNVWHSGT-UHFFFAOYSA-N ethenylcyclopentane Chemical compound C=CC1CCCC1 BEFDCLMNVWHSGT-UHFFFAOYSA-N 0.000 description 1
- 150000002170 ethers Chemical class 0.000 description 1
- 229960001617 ethyl hydroxybenzoate Drugs 0.000 description 1
- 235000010228 ethyl p-hydroxybenzoate Nutrition 0.000 description 1
- 239000004403 ethyl p-hydroxybenzoate Substances 0.000 description 1
- NUVBSKCKDOMJSU-UHFFFAOYSA-N ethylparaben Chemical compound CCOC(=O)C1=CC=C(O)C=C1 NUVBSKCKDOMJSU-UHFFFAOYSA-N 0.000 description 1
- 230000017188 evasion or tolerance of host immune response Effects 0.000 description 1
- 239000000945 filler Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 239000000796 flavoring agent Substances 0.000 description 1
- 235000013355 food flavoring agent Nutrition 0.000 description 1
- 230000037406 food intake Effects 0.000 description 1
- 235000003599 food sweetener Nutrition 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- WIGCFUFOHFEKBI-UHFFFAOYSA-N gamma-tocopherol Natural products CC(C)CCCC(C)CCCC(C)CCCC1CCC2C(C)C(O)C(C)C(C)C2O1 WIGCFUFOHFEKBI-UHFFFAOYSA-N 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 125000003976 glyceryl group Chemical group [H]C([*])([H])C(O[H])([H])C(O[H])([H])[H] 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 239000003979 granulating agent Substances 0.000 description 1
- 230000002140 halogenating effect Effects 0.000 description 1
- 230000026030 halogenation Effects 0.000 description 1
- 238000005658 halogenation reaction Methods 0.000 description 1
- PHNWGDTYCJFUGZ-UHFFFAOYSA-N hexyl dihydrogen phosphate Chemical class CCCCCCOP(O)(O)=O PHNWGDTYCJFUGZ-UHFFFAOYSA-N 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 230000028996 humoral immune response Effects 0.000 description 1
- 150000004678 hydrides Chemical class 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 150000004679 hydroxides Chemical class 0.000 description 1
- 230000006058 immune tolerance Effects 0.000 description 1
- 230000002163 immunogen Effects 0.000 description 1
- 230000016784 immunoglobulin production Effects 0.000 description 1
- 230000006054 immunological memory Effects 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 210000004969 inflammatory cell Anatomy 0.000 description 1
- 230000002757 inflammatory effect Effects 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 210000000936 intestine Anatomy 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 230000009545 invasion Effects 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 239000007951 isotonicity adjuster Substances 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- OYHQOLUKZRVURQ-IXWMQOLASA-N linoleic acid Natural products CCCCC\C=C/C\C=C\CCCCCCCC(O)=O OYHQOLUKZRVURQ-IXWMQOLASA-N 0.000 description 1
- 235000020778 linoleic acid Nutrition 0.000 description 1
- 239000012280 lithium aluminium hydride Substances 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 230000033001 locomotion Effects 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 239000000314 lubricant Substances 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 230000017156 mRNA modification Effects 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 210000004779 membrane envelope Anatomy 0.000 description 1
- 230000034217 membrane fusion Effects 0.000 description 1
- 230000003340 mental effect Effects 0.000 description 1
- 235000010270 methyl p-hydroxybenzoate Nutrition 0.000 description 1
- 239000004292 methyl p-hydroxybenzoate Substances 0.000 description 1
- 229960002216 methylparaben Drugs 0.000 description 1
- YACKEPLHDIMKIO-UHFFFAOYSA-N methylphosphonic acid Chemical group CP(O)(O)=O YACKEPLHDIMKIO-UHFFFAOYSA-N 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- HYSQEYLBJYFNMH-UHFFFAOYSA-N n'-(2-aminoethyl)-n'-methylethane-1,2-diamine Chemical compound NCCN(C)CCN HYSQEYLBJYFNMH-UHFFFAOYSA-N 0.000 description 1
- KFIGICHILYTCJF-UHFFFAOYSA-N n'-methylethane-1,2-diamine Chemical compound CNCCN KFIGICHILYTCJF-UHFFFAOYSA-N 0.000 description 1
- 239000007923 nasal drop Substances 0.000 description 1
- 229940100662 nasal drops Drugs 0.000 description 1
- 150000002825 nitriles Chemical class 0.000 description 1
- 231100000252 nontoxic Toxicity 0.000 description 1
- 230000003000 nontoxic effect Effects 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 238000010534 nucleophilic substitution reaction Methods 0.000 description 1
- 238000001543 one-way ANOVA Methods 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 239000003960 organic solvent Substances 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 239000003002 pH adjusting agent Substances 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 231100000915 pathological change Toxicity 0.000 description 1
- 230000036285 pathological change Effects 0.000 description 1
- 239000003208 petroleum Substances 0.000 description 1
- 239000008194 pharmaceutical composition Substances 0.000 description 1
- 239000012071 phase Substances 0.000 description 1
- 238000005191 phase separation Methods 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 150000008105 phosphatidylcholines Chemical class 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 235000010235 potassium benzoate Nutrition 0.000 description 1
- 239000004300 potassium benzoate Substances 0.000 description 1
- 229940103091 potassium benzoate Drugs 0.000 description 1
- 235000010241 potassium sorbate Nutrition 0.000 description 1
- 239000004302 potassium sorbate Substances 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 230000001681 protective effect Effects 0.000 description 1
- 230000004850 protein–protein interaction Effects 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 239000008213 purified water Substances 0.000 description 1
- MUPFEKGTMRGPLJ-ZQSKZDJDSA-N raffinose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO[C@@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)O1 MUPFEKGTMRGPLJ-ZQSKZDJDSA-N 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000029058 respiratory gaseous exchange Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 239000003161 ribonuclease inhibitor Substances 0.000 description 1
- 238000007142 ring opening reaction Methods 0.000 description 1
- 102220280971 rs1253463092 Human genes 0.000 description 1
- 102220289727 rs1253463092 Human genes 0.000 description 1
- 102220176789 rs368574479 Human genes 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 238000013207 serial dilution Methods 0.000 description 1
- 239000012679 serum free medium Substances 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 235000015424 sodium Nutrition 0.000 description 1
- 235000019265 sodium DL-malate Nutrition 0.000 description 1
- WXMKPNITSTVMEF-UHFFFAOYSA-M sodium benzoate Chemical compound [Na+].[O-]C(=O)C1=CC=CC=C1 WXMKPNITSTVMEF-UHFFFAOYSA-M 0.000 description 1
- 235000010234 sodium benzoate Nutrition 0.000 description 1
- 239000004299 sodium benzoate Substances 0.000 description 1
- 229910000029 sodium carbonate Inorganic materials 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- BEOOHQFXGBMRKU-UHFFFAOYSA-N sodium cyanoborohydride Chemical compound [Na+].[B-]C#N BEOOHQFXGBMRKU-UHFFFAOYSA-N 0.000 description 1
- 239000012312 sodium hydride Substances 0.000 description 1
- 229910000104 sodium hydride Inorganic materials 0.000 description 1
- 239000001394 sodium malate Substances 0.000 description 1
- 239000001488 sodium phosphate Substances 0.000 description 1
- 229910000162 sodium phosphate Inorganic materials 0.000 description 1
- JXKPEJDQGNYQSM-UHFFFAOYSA-M sodium propionate Chemical compound [Na+].CCC([O-])=O JXKPEJDQGNYQSM-UHFFFAOYSA-M 0.000 description 1
- 235000010334 sodium propionate Nutrition 0.000 description 1
- 239000004324 sodium propionate Substances 0.000 description 1
- 229960003212 sodium propionate Drugs 0.000 description 1
- LROWVYNUWKVTCU-STWYSWDKSA-M sodium sorbate Chemical compound [Na+].C\C=C\C=C\C([O-])=O LROWVYNUWKVTCU-STWYSWDKSA-M 0.000 description 1
- 235000019250 sodium sorbate Nutrition 0.000 description 1
- 229940074404 sodium succinate Drugs 0.000 description 1
- ZDQYSKICYIVCPN-UHFFFAOYSA-L sodium succinate (anhydrous) Chemical compound [Na+].[Na+].[O-]C(=O)CCC([O-])=O ZDQYSKICYIVCPN-UHFFFAOYSA-L 0.000 description 1
- 239000012321 sodium triacetoxyborohydride Substances 0.000 description 1
- 238000000935 solvent evaporation Methods 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 235000010199 sorbic acid Nutrition 0.000 description 1
- 239000004334 sorbic acid Substances 0.000 description 1
- 229940075582 sorbic acid Drugs 0.000 description 1
- 238000001179 sorption measurement Methods 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 238000001694 spray drying Methods 0.000 description 1
- 239000003381 stabilizer Substances 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 239000011550 stock solution Substances 0.000 description 1
- 239000012089 stop solution Substances 0.000 description 1
- 238000012916 structural analysis Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 239000004094 surface-active agent Substances 0.000 description 1
- 239000003765 sweetening agent Substances 0.000 description 1
- 230000008719 thickening Effects 0.000 description 1
- 239000002562 thickening agent Substances 0.000 description 1
- 239000005451 thionucleotide Substances 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 231100000820 toxicity test Toxicity 0.000 description 1
- 239000012096 transfection reagent Substances 0.000 description 1
- 230000010474 transient expression Effects 0.000 description 1
- RYFMWSXOAZQYPI-UHFFFAOYSA-K trisodium phosphate Chemical compound [Na+].[Na+].[Na+].[O-]P([O-])([O-])=O RYFMWSXOAZQYPI-UHFFFAOYSA-K 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 210000002845 virion Anatomy 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 235000019154 vitamin C Nutrition 0.000 description 1
- 239000011718 vitamin C Substances 0.000 description 1
- 235000019165 vitamin E Nutrition 0.000 description 1
- 229940046009 vitamin E Drugs 0.000 description 1
- 239000011709 vitamin E Substances 0.000 description 1
- 229940045997 vitamin a Drugs 0.000 description 1
- 150000003722 vitamin derivatives Chemical class 0.000 description 1
- 238000005303 weighing Methods 0.000 description 1
- OENHQHLEOONYIE-JLTXGRSLSA-N β-Carotene Chemical compound CC=1CCCC(C)(C)C=1\C=C\C(\C)=C\C=C\C(\C)=C\C=C\C=C(/C)\C=C\C=C(/C)\C=C\C1=C(C)CCCC1(C)C OENHQHLEOONYIE-JLTXGRSLSA-N 0.000 description 1
Abstract
The invention provides an S protein mutant with stable conformation trimer, constructs mRNA encoding the S protein mutant and a vector capable of preparing the mRNA by in vitro transcription, wherein the mutation of a plurality of amino acid residues into proline is carried out in C-terminal amino acid of an extracellular domain of spike protein (S protein) of a novel coronavirus variant strain. The S protein mutant, as well as mRNA encoding it (including further optimized mRNA), can be used to induce an immune response against a novel coronavirus in a subject, thereby preventing and/or treating a disease or disorder associated with the novel coronavirus infection.
Description
Technical Field
The invention belongs to the technical field of biological medicines and vaccines, and particularly relates to a recombinant antigen for preparing a vaccine against a novel coronavirus (2019-nCoV) variant strain, genetically engineered mRNA and a vector thereof, and an mRNA vaccine composition thereof.
Background
Traditional inactivated vaccines, attenuated vaccines and polypeptide vaccines have long development cycle and complex production process, but mRNA vaccines are based on the development of mRNA modification and delivery tools, once viral antigen sequences are obtained, mRNA vaccines with clinical scale can be rapidly designed and manufactured within weeks, standardized production can be achieved, and the mRNA vaccines are attractive in coping with pandemic outbreaks of infectious diseases. And the mRNA vaccine does not have potential reversion hazard of attenuated vaccine; the problem of restoring mutation of the inactivated vaccine does not exist. In immunogenicity, mRNA vaccines can induce B-cell and T-cell immune responses, can elicit an immune memory effect, and can express multiple antigens at a time, delivering more potent antigens. In addition, mRNA only needs to cross the cell membrane to efficiently express antigen proteins in the cytoplasm without risk of gene integration into the genome. And thirdly, mRNA is easily degraded after being translated into protein, and the transient expression characteristic of the mRNA ensures the safety of the mRNA medicament, enables the dosage to be controllable, and avoids antigen immune tolerance caused by long-term exposure of vaccine medicaments. Thus, mRNA vaccines have a subverted advantage in terms of safety, rapid preparation and immunogenicity.
mRNA is transcribed from the template strand of DNA and has the same sequence as the coding strand and is complementary to the template strand. Unlike prokaryotes, mRNA carrying genetic information in eukaryotes consists of a spacer arrangement of exons encoding proteins and introns with no encoding functions. Only the correctly modified, spliced mature mRNA can be transported as a informative template into the cytoplasm for further translation to produce protein.
The novel coronavirus (2019-nCoV, also known as SARS-CoV-2) has a spherical ellipsoidal shape with a diameter of 80-120nm. Under electron microscopy, the virion surface had a globular projection consisting of trimeric Spike glycoprotein (Spike, S). The envelope of the virus is composed of membrane glycoproteins (membrane glycoprotein, M) embedded in the viral envelope by three transmembrane domains. In addition, small amounts of small transmembrane protein-envelope (E) proteins are also present in the envelope. Finally, nucleocapsid (N) proteins bind to the RNA genome in the form of beads, forming a helically symmetric nucleocapsid. The research results show that S, M, E and N proteins are main components of coronaviruses for inducing immune responses of organisms. In addition, the receptor binding domain (receptor binding domain, RBD) in the S protein infects human airway epithelial cells by interacting with the human ACE2 protein.
2019-nCoV mutation has higher occurrence frequency, and certain dominant mutant strains can be formed after flowing in the crowd for a period of time, and mRNA vaccine aiming at the dominant mutant strains is designed and produced by utilizing the development advantages of the mRNA vaccine, so that the immune escape of the crowd with new mutant strains can be dealt with.
Disclosure of Invention
The invention provides an S protein mutant with stable conformation trimer, constructs mRNA encoding the S protein mutant and a vector capable of preparing the mRNA by in vitro transcription, wherein the mutation of a plurality of amino acid residues into proline is carried out in C-terminal amino acid of an extracellular domain of spike protein (S protein) of 2019-nCoV variant strain. These S protein mutants, as well as the mRNA encoding them (including further optimized mRNA), can be used to induce an immune response in a subject against strains including 2019-nCoV wild-type and variant strains, thereby preventing and/or treating diseases or disorders associated with 2019-nCoV infection.
The major structures of currently known 2019-nCoV virus particles include single strand positive strand nucleic acid, spike protein (S), membrane protein (M), envelope protein (E), and nucleocapsid protein (nucelocapsid protein, N). As shown in FIG. 1, the S protein can be divided into a receptor binding subunit S1 and a membrane fusion subunit S2. The process of adsorption invasion of 2019-nCoV virus to cells relies primarily on the S protein, which assembles in the form of homotrimers, whose cytoplasmic tail and transmembrane domains anchor the S protein into the viral membrane. By analyzing the S protein pre-fusion structure, the RBD of the S1 subunit is found to undergo hinge-like conformational movement to hide or expose key sites for receptor binding, wherein the downward state is a receptor non-binding state, and the up state is a receptor binding state and is in a relatively unstable state. This conformation allows the S protein to bind readily to the host receptor angiotensin converting enzyme 2 (ACE 2). Upon binding of RBD to the receptor, the S2 subunit is altered to a post-fusion conformation by insertion of FP into the host cell membrane. Using a cryoelectron microscope experiment, a large number of trimeric S protein domains were determined in a pre-fusion conformation, with a large number of neutralizing antibody sensitive epitopes present on the pre-fusion S protein, while the post-fusion conformation minimizes exposure of neutralizing sensitive epitopes present only in the pre-fusion conformation.
Thus, if to be used as an antigen for a vaccine, the optimized S protein mutant should be able to retain the epitope present in the S protein pre-fusion conformational form and induce antibodies capable of inhibiting viral fusion.
The mutant S protein is produced by amino acid mutation of a parent S protein, and the mutation can be substitution, deletion and/or insertion of amino acid. The parent S protein may be the S protein of a wild-type strain of 2019-nCoV, or the S protein of any mutant strain of 2019-nCoV (the mutation of any mutant strain of 2019-nCoV may occur in the region of the S protein or in the region other than the S protein). The parent S protein may be a full-length S protein, or a fragment of a full-length S protein (e.g., a sequence truncated to the full-length S protein (e.g., deletion of cytoplasmic tails and/or transmembrane domains), etc.).
In the present invention, the amino acid positions of both the S protein mutant and the parent S protein are described based on the amino acid sequence of the wild-type S protein, which can be obtained at NCBI GeneID 43740568, having a total of 1273 amino acids, the sequence of which is shown below and is designated as SEQ ID NO 1 in the present invention.
In one embodiment of the invention, the parent S protein is the S protein of the 2019-nCoV B.1.617.2 mutant strain, the S protein of the 2019-nCoV B.1.617.2 mutant strain having the following mutations compared to the S protein of the 2019-nCoV wild strain: T19R, G142D, EF 156-157 del, R158G, L452R, T478K, D614G, P6811R, D950N (said positions are depicted as positions of the amino acid sequence shown in SEQ ID NO: 1).
The first aspect of the present invention is to provide an S protein mutant.
According to the invention, the S protein mutant comprises at least an extracellular domain comprising an amino acid mutation at a position relative to the extracellular domain of the parent S protein: F817P, A892P, A899P, A942P, and KV986_987PP, which are described by the position of the amino acid sequence shown in SEQ ID NO: 1. The amino acid mutation can improve the stability of the S protein mutant.
In some embodiments of the invention, the S protein mutant further has the following mutations relative to the parent S protein: T19R, G142D, EF 156-157 del, R158G, L452R, T478K, D614G, P6811R, D950N, said positions being depicted as positions of the amino acid sequence shown in SEQ ID NO: 1.
According to the invention, in some embodiments, the S protein mutant has a mutation to the Furin cleavage site relative to the parent S protein, and RRARs at amino acids 682-685 (which are depicted as being located at the positions of the amino acid sequence shown in SEQ ID NO: 1) are mutated to lose the ability to be cleaved by Furin like (furilike) proteases. In one embodiment of the invention the RRAR is mutated to GSAS. By mutating the enzyme cutting site in the S protein, the S protein mutant can be prevented from being cut by protease, and the stability of the S protein mutant is further improved.
According to the invention, in some embodiments, the S protein mutant does not comprise the transmembrane domain and/or cytoplasmic tail of the S protein.
According to the invention, in some embodiments, the S protein mutant may also have an amino acid mutation in the fusion peptide domain relative to the parent S protein. Substitution, deletion and/or insertion of one or more amino acid residues in this region results in the fusion peptide domain losing its natural function, i.e., the function of mediating fusion of the virus with the host cell membrane. In some embodiments, the S protein mutant does not comprise a fusion peptide domain. By causing fusion peptide domain mutations in the S protein mutant to render it nonfunctional, the stability of the S protein mutant pre-fusion conformation can be increased such that a large number of neutralizing antibody sensitive epitopes present on the S protein pre-fusion conformation are retained and exposed.
According to the invention, in some embodiments, the S protein mutant is directly fused at the C-terminus of the extracellular region (amino acids 1-1209, said sites being depicted as being located at the position of the amino acid sequence shown in SEQ ID NO: 1) to aid in the formation of the domain of the trimer. "domain that facilitates trimer formation" refers to a protein or polypeptide domain that is capable of spontaneously or induced trimer formation when expressed. A variety of such domains are known in the art. By including domains in the S protein mutant that assist in trimer formation (e.g., by constructing a fusion protein), the S protein mutant can be promoted to form a trimeric conformation, and/or the trimeric conformation of the S protein mutant can be stabilized. In one embodiment of the invention, the domain that aids in trimer formation is T4 Fibritin Foldon Trimerization Motif. In one embodiment of the invention, the amino acid sequence of T4 Fibritin Foldon Trimerization Motif is shown in SEQ ID NO. 3.
In some preferred embodiments of the invention, the S protein mutants of the invention have 6 proline mutations in the extracellular domain of the S protein, depicted in the positions of the amino acid sequence shown in SEQ ID NO. 1: F817P, A892P, A899P, A942P, and KV986_987PP; the following mutations: T19R, G142D, EF 156-157 del, R158G, L452R, T478K, D614G, P6811R, D950N; mutating 682-685 amino acids RRAR into GSAS; and transmembrane domain and cytoplasmic tail that do not contain S protein. In one embodiment of the invention, the S protein mutant comprises the amino acid sequence as shown in SEQ ID NO. 2.
In some preferred embodiments of the invention, the S protein mutants of the invention have 6 proline mutations in the extracellular domain of the S protein, depicted in the positions of the amino acid sequence shown in SEQ ID NO. 1: F817P, A892P, A899P, A942P, and KV986_987PP; the following mutations: T19R, G142D, EF 156-157 del, R158G, L452R, T478K, D614G, P6811R, D950N; mutating 682-685 amino acids RRAR into GSAS; and a transmembrane domain and cytoplasmic tail that does not contain an S protein; the domain T4 Fibritin Foldon Trimerization Motif that assists in trimer formation is fused directly at the C-terminus of the extracellular region. In one embodiment of the invention, the S protein mutant comprises the amino acid sequence of SEQ ID NO. 2 and the amino acid sequence of SEQ ID NO. 3 directly linked from the N-terminus to the C-terminus. In one embodiment of the invention, the amino acid sequence of the S protein mutant is the amino acid sequence of SEQ ID NO:2 and the amino acid sequence of SEQ ID NO:3 directly linked from the N-terminus to the C-terminus.
In a second aspect, the invention provides a DNA molecule encoding an S protein mutant according to the first aspect of the invention, an expression vector or a cell comprising said DNA molecule.
According to the invention, the DNA molecule may be present in an expression vector, such as a plasmid vector or a viral vector, and transfected into an engineered cell for expression to obtain the S protein mutant of the invention. Or the DNA molecule can be recombined into the genome of an engineering cell, and expressed in the engineering cell to obtain the S protein mutant.
In some embodiments of the invention, the nucleotide sequence of the DNA molecule comprises a nucleotide sequence that is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or about 100% homologous to the nucleotide sequence set forth in SEQ ID NO. 4, which encodes the amino acid sequence set forth in SEQ ID NO. 2.
In some embodiments of the invention, the nucleotide sequence of the DNA molecule comprises a nucleotide sequence that is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or about 100% homologous to the nucleotide sequence set forth in SEQ ID NO. 5, which encodes the amino acid sequence set forth in SEQ ID NO. 3.
In some embodiments of the invention, the nucleotide sequence of the DNA molecule comprises a nucleotide sequence that is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or about 100% homologous to the nucleotide sequence of SEQ ID NO. 4, and a nucleotide sequence that is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or about 100% homologous to the nucleotide sequence of SEQ ID NO. 5, directly linked from the 5 'end to the 3' end. In a specific embodiment of the invention, the nucleotide sequence of the DNA molecule comprises the nucleotide sequence of SEQ ID NO. 4 and the nucleotide sequence of SEQ ID NO. 5 directly linked from the 5 'end to the 3' end.
An expression vector comprising said DNA molecule. According to the invention, the expression vector may be a prokaryotic or eukaryotic expression vector.
A cell comprising the DNA molecule. According to the invention, the DNA molecule may be present outside the genome of the cell or may be recombined into the genome of the cell.
In a third aspect, the invention provides an mRNA encoding the mutant S protein of the first aspect of the invention.
According to the invention, the mRNA comprises an Open Reading Frame (ORF) encoding an S protein mutant.
According to the invention, the mRNA may comprise, from the 5' end to the 3' end, a 5' cap structure, a 5' UTR, an Open Reading Frame (ORF) encoding an S protein mutant, a 3' UTR and a poly-A tail.
5' cap structure: the 5 'cap is typically a modified nucleotide (especially a guanine nucleotide) added at the 5' end of the mRNA molecule, and also includes atypical cap analogs. Preferably, the 5' cap is added using a 5' -5' -triphosphate linkage (also known as m7 GpppN). In some embodiments of the invention, the 5' CAP structure is CAP1 (additional methylation of ribose of adjacent nucleotides of m7 GpppN), CAP2 (additional methylation of ribose of a second nucleotide downstream of m7 GpppN), CAP3 (additional methylation of ribose of a third nucleotide downstream of m7 GpppN), CAP4 (additional methylation of ribose of a fourth nucleotide downstream of m7 GpppN).
The 5' cap structure can be formed in chemical RNA synthesis using cap analogs, or RNA in vitro transcription (co-transcription capping), or can be formed in vitro using capping enzymes (e.g., commercially available capping kits).
In one embodiment of the invention, the 5' Cap structure is a Cap1 structure.
According to the invention, the 5'UTR may comprise a 5' UTR of β -globin or α -globin or a homologue, fragment thereof. In some embodiments of the invention, the 5'utr comprises a 5' utr of β -globin or a homolog, fragment thereof. In some embodiments of the invention, the 5'UTR comprises a nucleotide sequence that is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or about 100% homologous to the 5' UTR nucleotide sequence of the β -globin shown in SEQ ID NO. 6. In a specific embodiment of the invention, the 5'UTR comprises the 5' UTR nucleotide sequence of the β -globin as shown in SEQ ID NO. 6.
In some embodiments of the invention, the 5' utr further comprises a Kozak sequence. In one embodiment of the invention, the Kozak sequence is GCCACC.
According to the invention, the 3'UTR may comprise the 3' UTR of β -globin or α -globin or a homologue, fragment or combination of fragments thereof. In some embodiments of the invention, the 3'UTR comprises a nucleotide sequence which is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or about 100% homologous to a fragment of the α2-globin 3' UTR shown in SEQ ID NO. 7. In other embodiments of the invention, the 3'UTR comprises 2 nucleotide sequences joined end to end that are at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or about 100% homologous to a fragment of the α2-globin 3' UTR shown in SEQ ID NO. 7. In a specific embodiment of the invention, the 3' UTR comprises 2 nucleotide sequences as shown in SEQ ID NO. 7, joined end to end.
According to the invention, the poly-A tail may be 50-200 nucleotides, preferably 100-150 nucleotides, for example 110-120 nucleotides, for example about 110 nucleotides, about 120 nucleotides, about 130 nucleotides, about 140 nucleotides, about 150 nucleotides in length.
In one embodiment of the invention, the nucleotide sequence of the Open Reading Frame (ORF) of the S protein mutant is a nucleotide sequence that is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or about 100% homologous to the nucleotide sequence set forth in SEQ ID NO. 8. The amino acid sequence of the S protein mutant after ORF translation consists of an amino acid sequence shown in SEQ ID NO. 2 and an amino acid sequence shown in SEQ ID NO. 3 which are directly connected from the N end to the C end. In one embodiment of the present invention, the nucleotide sequence of the Open Reading Frame (ORF) of the S protein mutant is shown in SEQ ID NO. 8.
In one embodiment of the invention, the mRNA comprises a nucleotide sequence that is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or about 100% homologous to the nucleotide sequence set forth in SEQ ID NO. 9. In one embodiment of the invention, the mRNA comprises the nucleotide sequence set forth in SEQ ID NO. 9.
According to the invention, one or more nucleotides in the mRNA may be modified. For example, one or more nucleotides (e.g., all nucleotides) in the mRNA can each independently be replaced with a naturally occurring nucleotide analog or an artificially synthesized nucleotide analog.
In a fourth aspect, the invention provides a nucleic acid molecule encoding an mRNA according to the third aspect of the invention. The nucleic acid molecule may be in the form of a vector, for example a plasmid vector or a viral vector. In some embodiments, the nucleic acid molecules can be used to prepare the mRNA of the invention by transcription in vitro.
In one embodiment of the invention, the nucleic acid molecule is an in vitro transcription vector comprising operably linked nucleotide sequences encoding a 5'UTR, a 3' UTR and a poly-A tail. The 5'UTR comprises a nucleotide sequence that is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or about 100% homologous to the 5' UTR nucleotide sequence of the β -globin shown in SEQ ID NO. 6. The 3'UTR comprises 2 nucleotide sequences joined end to end with at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or about 100% homology to the fragment of the alpha 2-globin 3' UTR shown in SEQ ID NO. 7. The poly-A tail may be 50-200 nucleotides, preferably 100-150 nucleotides, for example 110-120 nucleotides, such as about 110 nucleotides, about 120 nucleotides, about 130 nucleotides, about 140 nucleotides, about 150 nucleotides in length.
According to the present invention, the vector for in vitro transcription further comprises a nucleotide sequence encoding an ORF of the S protein mutant. The nucleotide sequence of the Open Reading Frame (ORF) of the S protein mutant is a nucleotide sequence that is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or about 100% homologous to the nucleotide sequence set forth in SEQ ID NO. 8.
In a specific embodiment of the invention, the vector for in vitro transcription comprises operably linked nucleotide sequences encoding an ORF of a 5'UTR, an S protein mutant, a 3' UTR and a poly-A tail; the 5'UTR comprises a 5' UTR nucleotide sequence of the beta-globin shown in SEQ ID NO. 6; the 3' UTR comprises 2 nucleotide sequences shown in SEQ ID NO. 7 connected end to end; the poly-A tail is 50-200 nucleotides in length; the nucleotide sequence of the Open Reading Frame (ORF) of the S protein mutant is shown as SEQ ID NO. 8.
According to the present invention, a conventional plasmid can be used as a vector. In some embodiments of the invention, the plasmid is psp73 or pUC57-kana.
The mRNA of the present invention can be prepared by methods known in the art, including but not limited to chemical synthesis or in vitro transcription, and the like. In some embodiments of the invention, a nucleic acid molecule encoding an mRNA may be synthesized artificially, cloned into a vector, and constructed into a plasmid for in vitro transcription. And (3) transforming the constructed plasmid into host bacteria for culture and amplification, and extracting the plasmid. The extracted plasmid was digested into linear molecules by enzyme digestion. mRNA was prepared using in vitro transcription using the prepared linearized plasmid molecule as a template. In Vitro Transcription (IVT) systems typically comprise a transcription buffer, nucleotide Triphosphates (NTPs), an RNase inhibitor, and a polymerase. NTP may be selected from, but is not limited to, natural and non-natural (modified) NTP. The polymerase may be selected from, but is not limited to, T7 RNA polymerase, T3RNA polymerase, and mutant polymerase. The cap structure analogue can be added in the in vitro transcription process to directly obtain mRNA with a cap structure; capping enzymes and dimethyl transferases may also be used to add a capping structure to the mRNA after in vitro transcription is complete. The resulting mRNA may be purified by methods conventional in the art, such as chemical precipitation, magnetic bead, affinity chromatography, and the like.
The S protein mutant of the first aspect of the invention can be directly used as an antigen for preparing vaccines.
The mRNA according to the third aspect of the present invention may be prepared into a liposome or a lipid nanoparticle or the like encapsulating the mRNA together with a lipid compound, and then into a vaccine.
Accordingly, in a fifth aspect the present invention provides a vaccine composition comprising an S protein mutant according to the first aspect of the invention, or an mRNA according to the third aspect of the invention.
According to the invention, the vaccine or vaccine composition may contain pharmaceutically acceptable excipients, and/or immunological adjuvants in addition to the S protein mutant or mRNA, the lipid compound used to form the liposome or lipid nanoparticle.
Lipid nanoparticles can be prepared using methods known in the art. For example: the lipid nanoparticles are prepared by dissolving lipid molecules in an organic solvent at a molar ratio to obtain a lipid-mixed solution, mixing the lipid-mixed solution with an aqueous solution of the object to be delivered (e.g., nucleic acid) as an organic phase and the aqueous phase. Lipid nanoparticles may be prepared using other methods including, but not limited to, spray drying, single and double emulsion solvent evaporation, solvent extraction, phase separation, nano-precipitation, microfluidic, simple and complex coacervation, and others well known to those of ordinary skill in the art. The preparation method may further comprise the step of separating and purifying to obtain the lipid nanoparticle. The preparation method may further comprise the step of lyophilizing the lipid nanoparticle.
According to the invention, in the vaccine or vaccine composition, when lipid nanoparticles are used as a carrier, mRNA is located in the lipid nanoparticles, and the lipid nanoparticles contain 30-60mol% of ionizable/cationic lipid molecules, 5-30mol% of neutral lipid molecules, 30-50mol% of cholesterol lipid molecules, and 0.4-10mol% of PEGylated lipid molecules, which account for the total lipid molecules; preferably contains 32-55 mole% of ionizable/cationic lipid molecules, 8-20 mole% of neutral lipid molecules, 35-50 mole% of cholesterol lipid molecules, 0.5-5 mole% of PEGylated lipid molecules; more preferably, it contains 39-51 mole% of ionizable/cationic lipid molecules, 9-16 mole% of neutral lipid molecules, 37-49 mole% of cholesterol lipid molecules, 1.3-2.7 mole% of PEGylated lipid molecules.
The ionizable/cationic lipid molecules may be selected from commercial molecules such as DLin-MC3-DMA, DOTAP, DOTMA, and the ionizable lipid molecules represented by formula C:
c (C)Wherein each n 3 Are independent of each other and may be the same or different, each n 3 Selected from integers from 1 to 8, each m 3 Are independent of each other and may be the same or different, each m 3 An integer selected from 0 to 8; preferably, each n 3 Selected from integers from 4 to 8, each m 3 An integer selected from 4 to 8; preferably, each n 3 Are all identical to each other, each m 3 Are identical to each other.
In one embodiment of the invention, n is preferably 3 Is 6, m 3 4, the molecular structure is as follows:
the neutral lipid molecule may be selected from, for example, phosphatidylcholines represented by formula EE, phosphatidylethanolamine compound shown in formula FF, wherein Ra, rb, rc, rd is independently selected from the group consisting of linear or branched C10-30 alkyl, linear or branched C10-30 alkenyl, preferably CH 3 (CH 2 ) 17 CH 2 -、CH 3 (CH 2 ) 15 CH 2 -、CH 3 (CH 2 ) 13 CH 2 -、CH 3 (CH 2 ) 11 CH 2 -、CH 3 (CH 2 ) 9 CH 2 -、CH 3 (CH 2 ) 7 CH 2 -、CH 3 (CH 2 ) 7 -CH=CH-(CH 2 ) 7 -、CH 3 (CH 2 ) 4 CH=CHCH 2 CH=CH(CH 2 ) 7 -、CH 3 (CH 2 ) 7 -CH=CH-(CH 2 ) 9 -。
The cholesterol lipid molecule may be selected from cholesterol, 5-heptadecylresorcinol and cholesterol hemisuccinate, for example.
The pegylated lipid molecule comprises a lipid moiety and a PEG-based polymer moiety, denoted as "lipid moiety-PEG-number average molecular weight", said lipid moiety being a diacylglycerol or diacylglycerol amide selected from dilauroylglycerol, dimyristoylglycerol, dipalmitoylglycerol, distearoyl glycerol, dilaurylglycerol amide, dimyristoylglycerol amide, dipalmitoylglycerol amide, distearoyl glyceramide, 1, 2-distearoyl-sn-glycerol-3-phosphoethanolamine, 1, 2-dimyristoyl-sn-glycerol-3-phosphoethanolamine; the PEG has a number average molecular weight of about 130 to about 50,000, such as about 150 to about 30,000, about 150 to about 20,000, about 150 to about 15,000, about 150 to about 10,000, about 150 to about 6,000, about 150 to about 5,000, about 150 to about 4,000, about 150 to about 3,000, about 300 to about 3,000, about 1,000 to about 3,000, about 1,500 to about 2,500, such as about 2000.
In the vaccine composition, the mass ratio of the total mass of lipid molecules to mRNA is 5-20:1.
The S protein mutant of the first aspect of the invention or the mRNA of the third aspect of the invention is applied to the preparation of vaccines.
According to the invention, the vaccine or vaccine composition may be used for the prevention and/or treatment of 2019-nCoV infection or a disease or disorder associated with 2019-nCoV infection, which 2019-nCoV may be a wild strain or a mutant of any one thereof. In one embodiment of the invention, the 2019-nCoV is a B.1.617.2 mutant.
The diseases or conditions associated with 2019-nCoV infection include, but are not limited to, pneumonia caused by 2019-nCoV infection, headache, nasal obstruction, runny nose, cough or/and airway inflammation caused by 2019-nCoV infection, disseminated intravascular coagulation caused by 2019-nCoV infection, and sepsis caused by 2019-nCoV infection.
In a sixth aspect, the invention provides the use of a DNA molecule according to the second aspect of the invention for the preparation of a mutant S protein, and the use of a nucleic acid molecule according to the fourth aspect of the invention for the preparation of an mRNA according to the third aspect of the invention.
List of sequences according to the invention:
/>
/>
/>
/>
the ionizable lipid compounds of formula C of the present invention may be synthesized using methods known in the art, for example, by reacting one or more equivalents of an amine with one or more equivalents of an epoxy-terminated compound under suitable conditions. The synthesis of the ionizable lipid compounds is performed with or without a solvent, and the synthesis may be performed at a higher temperature in the range of 25-100 ℃. The resulting ionizable lipid compound may optionally be purified.
In some embodiments of the invention, the ionizable lipid compounds of the invention may be prepared using the following general preparation methods.
Step 1: reduction of
The carboxyl group of the compound A1 is reduced to a hydroxyl group in the presence of a reducing agent to obtain a compound A2. Examples of reducing agents include, but are not limited to, lithium aluminum hydride, diisobutylaluminum hydride, and the like. Examples of the solvent used in the reaction include, but are not limited to, ethers (such as diethyl ether, tetrahydrofuran, dioxane, etc.), halogenated hydrocarbons (such as chloroform, methylene chloride, dichloroethane, etc.), hydrocarbons (such as n-pentane, n-hexane, benzene, toluene, etc.), and mixed solvents of two or more of these solvents.
Step 2: oxidation
The hydroxyl group of the compound A2 is oxidized to an aldehyde group in the presence of an oxidizing agent to obtain a compound A3. Examples of oxidizing agents include, but are not limited to, 2-iodoxybenzoic acid (IBX), pyridinium chlorochromate (PCC), pyridinium Dichlorochromate (PDC), dess-martin oxidizing agent, manganese dioxide, and the like. Examples of the solvent used in the reaction include, but are not limited to, halogenated hydrocarbons (such as chloroform, methylene chloride, dichloroethane, etc.), hydrocarbons (such as n-pentane, n-hexane, benzene, toluene, etc.), nitriles (such as acetonitrile, etc.), and mixed solvents of two or more of these solvents.
Step 3: halo-reduction
First, the aldehyde α -hydrogen of the compound A3 is subjected to halogenation with a halogenating agent under acidic conditions to obtain an α -halogenated aldehyde intermediate, and then the aldehyde group of the α -halogenated aldehyde is reduced to a hydroxyl group in the presence of a reducing agent to obtain the compound A4. Examples of conditions that provide acidity include, but are not limited to, DL-proline. Examples of halogenated agents include, but are not limited to, N-chlorosuccinimide (NCS) and N-bromosuccinimide (NBS). Examples of reducing agents include, but are not limited to, sodium borohydride, sodium cyanoborohydride, and sodium triacetoxyborohydride.
Step 4: epoxidation
The compound A4 is subjected to intramolecular nucleophilic substitution reaction in the presence of a base to obtain an epoxy compound A5. Examples of bases include, but are not limited to, hydroxides or hydrides of alkali metals, such as sodium hydroxide, potassium hydroxide, and sodium hydride. Examples of solvents used in the reaction include, but are not limited to, mixtures of dioxane and water.
Step 5: ring opening reaction
Compound A5 is ring-opened with an amine (e.g., N-bis (2-aminoethyl) methylamine) to obtain the final compound. Examples of the solvent for the reaction include, but are not limited to, ethanol, methanol, isopropanol, tetrahydrofuran, chloroform, hexane, toluene, diethyl ether, etc.
The raw material A1 in the preparation method can be obtained commercially or synthesized by a conventional method.
Description of the terminology:
in the present application, the meanings of novel coronaviruses, 2019-nCoV and SARS-CoV-2 are the same.
In the present description and claims, conventional single-letter or three-letter codes for amino acid residues are used. Unless otherwise indicated, amino acid sequences are written in an amino-to-carboxyl orientation from left to right.
For ease of reference, the S protein mutants of the present application are described using the following naming convention: original amino acid, position, substituted amino acid. According to this naming convention, for example, substitution of asparagine with alanine at position 30 is expressed as: asn30Ala or N30A; the absence of asparagine at the same position is expressed as: asn30 or n30; insertion of another amino acid residue, e.g., lysine, is denoted: asn30AsnLys or N30NK; deletion of consecutive stretch of amino acid residues, e.g., deletion of amino acid residues 242-244, denoted as (242-244) ×or Δ (242-244) or 242_244del; if an S protein mutant contains a "deletion" and an insertion at that position, as compared to the other S protein parents, it is expressed as: *36Asp or 36D, indicates the deletion at position 36 with simultaneous insertion of aspartic acid. When one or more alternative amino acid residues may be inserted at a given position, this is expressed as: N30A, E, or N30A or N30E. In addition, when a position suitable for modification is identified herein without any particular modification being suggested, it is to be understood that any amino acid residue may be substituted for the amino acid residue at that position. Thus, for example, where reference is made to modifying an asparagine at position 30, but not specified, it is to be understood that the asparagine may be deleted or substituted with any one of the other amino acids, i.e., R, D, A, C, Q, E, G, H, I, L, K, M, F, P, S, T, W, Y, V. Further, "N30X" refers to any one of the following substitutions: N30R, N30D, N30C, N30Q, N30E, N30G, N30H, N30I, N30L, N30K, N30M, N30F, N30P, N30S, N30T, N30W, N30Y, or N30V; or abbreviated as: N30R, D, C, Q, E, G, H, I, L, K, M, F, P, S, T, W, Y, V.
Domain: as used herein, the term "domain" when referring to a polypeptide refers to a motif of the polypeptide that has one or more identifiable structural or functional features or properties (e.g., binding capacity, serving as a site for protein-protein interaction).
The term "protein mutant" or "polypeptide mutant" refers to a molecule whose amino acid sequence differs from a native or reference sequence. Amino acid sequence mutants may have substitutions, deletions and/or insertions, etc., at certain positions within the amino acid sequence, as compared to the native or reference sequence. Typically, the mutant will have at least about 50% identity, at least about 60% identity, at least about 70% identity, at least about 80% identity, at least about 90% identity, at least about 95% identity, at least about 99% identity to the native or reference sequence.
In the present description and claims, nucleotides are referred to by their commonly accepted single letter codes. Unless otherwise indicated, nucleotide sequences are written in the 5 'to 3' direction from left to right. Nucleobases are represented herein by commonly known single letter symbols recommended by the IUPAC-IUB biochemical nomenclature committee. Thus, A represents adenine, C represents cytosine, G represents guanine, T represents thymine, and U represents uracil. The skilled artisan will appreciate that the T base in the codons disclosed herein is present in DNA, whereas the T base will be substituted with a U base in the corresponding RNA. For example, a codon-nucleotide sequence in the form of DNA disclosed herein, such as a vector or an In Vitro Translation (IVT) template, has its T base transcribed into a U base in its corresponding transcribed mRNA. In this regard, both codon-optimized DNA sequences (comprising T) and their corresponding mRNA sequences (comprising U) are considered codon-optimized nucleotide sequences of the present disclosure. Those skilled in the art will also appreciate that equivalent codon patterns can be generated by substituting one or more bases with non-natural bases.
The terms "nucleic acid sequence", "nucleotide sequence" or "polynucleotide sequence" are used interchangeably and refer to a contiguous nucleic acid sequence. The sequence may be single-or double-stranded DNA or RNA, such as mRNA.
"nucleotide sequence encoding …" refers to a nucleic acid (e.g., mRNA or DNA molecule) encoding a polypeptide. The coding sequence may further comprise initiation and termination signals operably linked to regulatory elements including promoters and polyadenylation signals capable of directing expression in cells of the individual or mammal to which the nucleic acid is administered.
Homology: as used herein, the term "homology" refers to the overall relatedness between polymer molecules, e.g., between nucleic acid molecules (e.g., DNA molecules and/or RNA molecules) and/or between polypeptide molecules. In general, the term "homology" means the evolutionary relationship between two molecules. Thus, two homologous molecules will have a common evolutionary ancestor. In the context of the present disclosure, the term homology includes identity and similarity.
In some embodiments, polymer molecules are considered "homologous" to each other if at least 25%,30%,35%,40%,45%,50%,55%,60%,65%,70%,75%,80%,85%,90%,95%,96%,97%,98%,99% or 100% of the monomers in the molecule are identical (identical monomers) or similar (conservative substitutions). The term "homologous" necessarily refers to a comparison between at least two sequences (polynucleotide or polypeptide sequences).
Identity: as used herein, the term "identity" refers to overall monomer conservation between polymer molecules, e.g., between polynucleotide molecules (e.g., DNA molecules and/or RNA molecules) and/or between polypeptide molecules. For example, the calculation of the percent identity of two polynucleotide sequences can be performed by aligning the two sequences for optimal comparison purposes (e.g., gaps can be introduced in one or both of the first and second nucleic acid sequences for optimal alignment and non-identical sequences can be abandoned for comparison purposes, in certain embodiments, the length of the sequences aligned for comparison purposes is at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or 100% of the length of the reference sequence.
Suitable software programs are available from a variety of sources and are used for alignment of both protein and nucleotide sequences. For example, one suitable program for determining percent sequence identity is the Bl2seq, which is part of the BLAST suite of programs available from the national center for Biotechnology information of the United states government (BLAST. Ncbi. Lm. Nih. Gov). Other suitable programs are parts of the bioinformatics EMBOSS program suite, for example Needle, stretcher, water or Matcher, and are also available from European Bioinformatics Institute (EBI) of www.ebi.ac.uk/Tools/psa. Sequence alignment may be performed using methods known in the art, such as MAFFT, clustal (ClustalW, clustal X or Clustal Omega), MUSCLE, and the like.
The terms "coding region" and "coding region" refer to the Open Reading Frame (ORF) in a polynucleotide that, when expressed, produces a polypeptide or protein.
"operably linked" refers to a functional linkage between two or more molecules, constructs, transcripts, entities, moieties, and the like.
Expression: as used herein, "expression" of a nucleic acid sequence refers to one or more of the following events: (1) Generating an mRNA template from the DNA sequence (e.g., by transcription); (2) Processing of mRNA transcripts (e.g., by splicing, editing, 5 'cap formation, and/or 3' end processing); (3) translating the mRNA into a polypeptide or protein; and (4) post-translational modification of the polypeptide or protein.
5' cap structure: the 5 'cap is typically a modified nucleotide (especially a guanine nucleotide) added at the 5' end of the mRNA molecule, and also includes atypical cap analogs. The 5' cap may be added using a 5' -5' -triphosphate linkage (also known as m7 gppppn). Additional examples of 5 'cap structures include glyceryl, inverted deoxyabasic residues (moieties), 4',5 '-methylene nucleotides, 1- (. Beta. -D-erythro furanosyl) nucleotides, 4' -thio nucleotides, carbocyclic nucleotides, 1, 5-anhydrohexitol nucleotides, L-nucleotides, alpha-nucleotides, modified base nucleotides, threo-pentofuranosyl nucleotides, acyclic 3',4' -amethonucleotides, acyclic 3, 4-dihydroxybutyl nucleotides, acyclic 3, 5-dihydroxypentyl nucleotides, 3'-3' -inverted nucleotide moieties, 3'-3' -inverted abasic moieties, 3'-2' -inverted nucleotide moieties, 3'-2' -inverted abasic moieties, 1, 4-butanediol phosphates, 3 '-phosphoramidates, hexyl phosphates, aminohexyl phosphates, 3' -phosphorothioates, dithiophosphates or bridged or unbridged methylphosphonate moieties. These modified 5' cap structures can be used in the context of the present invention to modify the mRNA sequences of the present invention.
Cap analogue: cap analogs refer to non-polymerizable dinucleotides that function as caps, in that they facilitate translation or localization, and/or prevent degradation of RNA molecules when incorporated at the 5' end of the RNA molecule. Non-polymerizable means that the cap analogue will be incorporated only at the 5' end, as it does not have a 5' triphosphate and therefore cannot be extended in the 3' direction by a template dependent RNA polymerase. Cap analogs include, but are not limited to, chemical structures selected from the group consisting of: m7GpppG, m7GpppA, m7GpppC; unmethylated cap analogs (e.g., gpppG); a dimethyl cap analogue (e.g., m2,7 GpppG), a trimethyl cap analogue (e.g., m2,7 GpppG), a dimethyl symmetrical cap analogue (e.g., m7Gpppm 7G), or an anti-reverse cap analogue (e.g., ARCA; m7,2'OmeGpppG, m7,2' dGpppG, m7,3'OmeGpppG, m7,3' dGpppG, and tetraphosphate derivatives thereof) (stepfski et al, 2001.RNA 7 (10): 1486-95).
Naturally occurring nucleotide analogs or synthetic nucleotide analogs, for example, are selected from the group consisting of pseudouridine (pseudouridine), 2-thiouridine (2-thiouridine), 5-methyluridine (5-methyluridine), 5-methylcytidine (5-methylcytidine), N6-methyladenosine (N6-methylpseudouridine), N1-methylpseudouridine (N1-methylpseudouridine), 5-ethynyluridine (5-ethylpseudouridine), pseudouridine triphosphate (pseudouridine-UTP), 1-methyl-pseudouridine triphosphate (N1-methyl-pseudouridine-UTP), 5-ethynyl uridine triphosphate (5-methyl-UTP), 5-methylcytidine triphosphate (5-methyl-CTP), and the like.
By "pharmaceutically acceptable excipient" is meant any ingredient other than the S protein mutants or mrnas described herein, and which has substantially non-toxic and non-inflammatory properties in the patient, including, but not limited to, any and all solvents, dispersion media or other liquid carriers, dispersing or suspending aids, surfactants, isotonic agents, thickening or emulsifying agents, preservatives, binders, lubricants, antioxidants, diluents, granulating and/or dispersing agents, antimicrobial or antifungal agents, osmolality adjusting agents, pH adjusting agents, colorants, sweeteners or flavoring agents, stabilizers, buffers, chelating agents, cryoprotectants, and/or fillers, as appropriate for the particular dosage form desired. Various excipients for formulating pharmaceutical compositions and techniques for preparing the compositions are known in the art. Exemplary antimicrobial or antifungal agents include, but are not limited to, benzalkonium chloride, benzethonium chloride, methylparaben, ethylparaben, benzoic acid, hydroxybenzoic acid, potassium or sodium benzoate, potassium or sodium sorbate, sodium propionate, sorbic acid, and the like, and combinations thereof. Exemplary preservatives include, but are not limited to, beta-carotene, citric acid, ascorbic acid, butylated hydroxyanisole, sodium Lauryl Sulfate (SLS), vitamin a, vitamin C, vitamin E, sodium dodecyl ether sulfate (SLES), and the like, and combinations thereof. Exemplary buffers to control pH may include, but are not limited to, sodium phosphate, sodium succinate, histidine (or histidine-HCl), sodium malate, sodium citrate, sodium carbonate, and the like, and/or combinations thereof. Exemplary cryoprotectants include, but are not limited to, trehalose, lactose, glycerol, mannitol, sucrose, dextrose, and the like, and combinations thereof. Exemplary bulking agents can include, but are not limited to, mannitol, glycine, lactose, sucrose, trehalose, raffinose, and combinations thereof.
And/or is to be taken as a specific disclosure of each of two specified features or components with or without the other. Thus, the term "and/or" as used in phrases such as "a and/or B" is intended to include "a and B", "a or B", "a" (alone) and "B" (alone). Likewise, the term "and/or" as used in phrases such as "A, B and/or C" is intended to encompass each of the following aspects: A. b and C; A. b or C; a or C; a or B; b or C; a and C; a and B; b and C; a (alone); b (alone); and C (alone).
"comprising" and "including" have the same meaning and are intended to be open and allow for the inclusion of additional elements or steps but not required. When the terms "comprising" or "including" are used herein, the terms "consisting of" and/or "consisting essentially of … …" are therefore also included and disclosed.
"about": the term "about" as used in conjunction with numerical values throughout the specification and claims means a range of accuracy that is familiar and acceptable to those skilled in the art. Typically, this accuracy is in the interval of + -10%.
Drawings
Fig. 1: schematic of the primary structure of 2019-nCoV S protein and conformational structure prior to pre-fusion. In the figure, the A part is a primary structure schematic diagram of S protein, SS (signal sequence) -signal peptide sequence, NTD (N-terminal domain) -N terminal region, RBD (receptor binding domain) -receptor binding domain, S2'-S2' protease cleavage site, FP (fusion peptide) -fusion peptide, HR1 (head repeat 1) -7 peptide repeat 1, CH (central helix) -central helix, CD (connector domain) -connecting domain, HR2 (head repeat 2) -7 peptide repeat 2, TM (transmembrane domain) -transmembrane domain, CT (cytoplasmic tail) -cytoplasmic tail, and arrow is protease cleavage site. S1/S2 is preceded by an S1 subunit and followed by an S2 subunit; part B of the figure is a side view and a top view of the S protein pre-fusion construct.
Fig. 2: statistical graphs of the amount of intracellular protein expression of mRNA prepared from different in vitro transcription vectors using Firefly Luc as reporter protein.
Fig. 3: the b.1.617.2mrna integrity results were analyzed on a 2100 bioanalyzer using an RNA 6000 nano chip.
Fig. 4: ELISA method for detecting S protein mutant expression level in supernatant of CHO-K1 cell transfected with nucleic acid.
Fig. 5: three-dimensional structure model diagram of the S protein mutant.
Fig. 6: II-37 (also known as C2) lipid nanoparticles encapsulate S protein expression levels in supernatants after transfection of mRNA according to the invention.
Fig. 7: BALB/c mouse immunization strategy
Fig. 7-1: mRNA vaccine specific IgG binding antibody detection results after immunization of BALB/c mice. BALB/c mice (n=4) were intramuscular injected with different doses of vaccine or phosphate buffered saline (PBS, control, n=4) on day 0 and day 21. Blood was collected on day 35 and the concentration of SARS-CoV-2 B.1.617.2 strain S protein-specific IgG-binding antibody in the blood was determined by ELISA. Each dot represents a single animal, the same number of dots being covered, the figures shown in the figures being median.
Fig. 7-2: results of competitive inhibition assay of ACE2 after immunization of BALB/c mice with mRNA vaccine. BALB/c mice (n=4) were intramuscular injected with different doses of vaccine or phosphate buffered saline (PBS, control, n=4) on day 0 and day 21. Blood was collected on day 35, and the titer of neutralizing antibodies that competitively bound to SARS-CoV-2 B.1.617.2 strain S protein in the blood sample was measured, and the results were expressed as inhibition (%). The figures show the median value and 20% the inhibition ratio cut-off.
Fig. 7-3: mRNA vaccine detection results of pseudovirus neutralizing antibodies after immunization of BALB/c mice. BALB/c mice (n=4) were intramuscular injected with different doses of vaccine or phosphate buffered saline (PBS, control, n=4) on day 0 and day 21. Day 35 blood was collected and strain pVNT50 based on SARS-CoV-2 was determined by the reporter gene method (Vazyme) at B.1.617.2. The numbers shown in the figures are median values.
Fig. 8: rhesus monkey immunization strategy schematic
Fig. 8-1: detection results of specific IgG binding antibodies after immunization of rhesus monkeys with mRNA vaccine. Female and male rhesus monkeys (9-22 years) were intramuscular injected with 10 μg, 30 μg or 100 μg mRNA vaccine (n=3) on day 0 and day 28, and control group was physiological saline (n=2). Blood was collected on day 35 and the concentration of the S protein-specific IgG-binding antibody of the SARS-CoV-2 B.1.617.2 strain in the blood was determined by ELISA. The numbers shown in the figures are median values.
Fig. 8-2: results of competitive inhibition assay of rhesus ACE2 following mRNA vaccine immunization. Female and male rhesus monkeys (9-22 years) were intramuscular injected with 10 μg, 30 μg or 100 μg mRNA vaccine (n=3) on day 0 and day 28, and control group was physiological saline (n=2). Day 35 was bled and the neutralizing antibody titer of the blood sample for S protein that competitively bound to ACE2 in the b.1.617.2 strain was measured, and the result was expressed as inhibition (%). The figures show the median value and 20% the cut-off value.
Fig. 8-3: results of detection of neutralizing antibodies to rhesus pseudovirus after immunization with mRNA vaccine. Female and male rhesus monkeys (9-22 years) were intramuscular injected with 10 μg, 30 μg or 100 μg mRNA vaccine (n=3) on day 0 and day 28, and control group was physiological saline (n=2). Day 35 blood was collected and strain pVNT50 based on SARS-CoV-2 was determined by the reporter gene method (Vazyme) at B.1.617.2. The numbers shown in the figures are median values.
Fig. 9: h11 K18-hACE2 transgenic mouse immunization strategy diagram
Fig. 9-1: h11 Detection results of specific IgG binding antibodies after immunization of the K18-hACE2 transgenic mice with mRNA vaccine. Mice (n=10) were intramuscular injected with different doses of mRNA vaccine or physiological saline on day 0 and day 25 (control group, n=10); the challenge control group was not injected (n=8). Blood samples were collected on day 32 and the concentration of the S protein-specific IgG-binding antibody of the SARS-CoV-2 B.1.617.2 strain in the blood samples was determined by ELISA. Each dot represents a single animal, the same number of dots being covered, the figures shown in the figures being median. P-values were analyzed using one-way analysis of variance (ns, P >0.05; P <0.01; P <0.001; P < 0.0001).
Fig. 9-2: h11 Results of neutralizing antibody titer after immunization of K18-hACE2 transgenic mice with mRNA vaccine. Blood samples were collected on day 32 and the neutralizing antibody titer of the S protein in the blood samples that competitively bound to ACE2 with the b.1.617.2 strain was measured, and the results were expressed as inhibition ratio. The figures show the median value and 20% the inhibition ratio cut-off.
Fig. 10: h11 Statistical graphs of viral load of each tissue in challenge test after immunization of mRNA vaccine by K18-hACE2 transgenic mice
Detailed Description
The technical scheme of the invention will be further described in detail below with reference to specific embodiments. It is to be understood that the following examples are illustrative only and are not to be construed as limiting the scope of the invention. All techniques implemented based on the above description of the invention are intended to be included within the scope of the invention.
Unless otherwise indicated, the starting materials and reagents used in the following examples were either commercially available or may be prepared by known methods. The experimental method is a conventional molecular biological method in the field, and can be operated by referring to the instruction of a molecular biological experimental manual or a kit product instruction in the field.
Example 1 efficiency comparison experiment of IVT vector of the present invention
In the embodiment, firefly Luc is taken as a reporter protein, different IVT vectors are constructed for in vitro transcription synthesis of mRNA capable of translating the Firefly Luc, and the translation efficiency of the synthesized mRNA with different sequence characteristics is compared.
The coding sequence of Firefly Luc was cloned into the multiple cloning site of the corresponding vector by means of a plasmid vector construction technique conventional in the art to obtain vectors numbered IVT1, IVT2, IVT3 and IVT4, respectively, after which corresponding Firefly Luc mRNA samples were prepared by in vitro transcription from the aforementioned vectors using a T7 in vitro transcription kit (cat#AM1344, available from Simer-Feisher).
The vectors IVT 1-IVT 4 are all modified on the basis of a commercial vector psp73, the following sequences are inserted into the vector psp73 at the XhoI/NdeI enzyme cutting sites, wherein UTR sequences are not added into the IVT1, and the length of polyA tails is 64A; the 3' UTR sequences of the 5' UTR and GCTCGCTTTCTTGCTGTCCAATTTCTATTAAAGGTTCCTTTGTTCCCTAAGTCCAACTACTAAACTGGGGGATATTATGAAGGGCCTTGAGCATCTGGATTCTGCCTAATAAAAAACATTTATTTTCATTGC shown in SEQ ID NO. 6 (the 3' UTR sequence of beta globin) were used for IVT2, and the polyA length was 120A; the IVT3 uses the 5'UTR shown in SEQ ID NO. 6 and the 3' UTR sequence shown in SEQ ID NO. 7, and the polyA length is 120A; the 5'UTR shown in SEQ ID NO. 6 and the 3' UTR sequence shown in SEQ ID NO. 7 of 2 tandem repeats were used in IVT4, with a polyA length of 120A. A multiple cloning site comprising the common cleavage sites HindIII and EcoRI is inserted between the sequences of the above 5'UTR and 3' UTR, and the coding sequence of Firefly Luc is cloned into the multiple cloning sites HindIII and EcoRI. All vectors were constructed by the company Jinsri using the method of gene synthesis.
Each Firefly Luc mRNA sample was transfected into CHO cells using Lipofectamine2000 (cat# 11668030, available from Semer Feishan) as a transfection reagent using Dual-Lumi TM Double luciferase reporter gene detection kit (at#RG08)8S, available from Shanghai Biyun biotechnology Co., ltd.) for detection of luciferase. The DNA of Firefly Luc was transferred into psicheck2 plasmid as a positive control (psicheck 2 plasmid, cat#60908-6151, available from Beijing Tian Enzem Gene technologies Co., ltd.). The method comprises the following steps: on the first day, CHO cells were seeded into 96 well plates at 1.5X10 per well 4 Cells were cultured overnight with f12k+10% fbs; the following day, the medium was changed to serum-free F12K medium prior to transfection, and mRNA or DNA was transfected into CHO cells using Lipofectamine 2000; the amount of nucleic acid used per well was 100ng, the amount of liposome was 0.3. Mu.l, the total volume per well was 100. Mu.l, and the cells were cultured overnight; on the third day, the serum-free medium was changed to complete medium (f12k+10% fbs) and the culture was continued for 24 hours; on the fourth day (48 hours post-transfection), firefly Luc fluorescence values were measured.
The results are shown in FIG. 2. In the figure, "DNA" is a positive control (psicheck 2 plasmid carrying the Firefly Luc gene), "IVT1-Luc", "IVT2-Luc", "IVT3-Luc", "IVT4-Luc" represent the corresponding Firefly Luc mRNA transcribed in vitro from the vectors of IVT1, IVT2, IVT3 and IVT4, respectively, "negative control" is a negative control. As can be seen from FIG. 2, the protein expression level of IVT4-Luc is far higher than that of the other three mRNAs by 2-3 times under the same transfection level of the mRNAs, which indicates that the stability of IVT4-Luc is good and the translation efficiency is high.
Example 2 B.1.617.2 preparation of mRNA and translation thereof
1. A nucleic acid sequence encoding the mRNA shown in SEQ ID No.8 was synthesized and cloned into pUC57-kana vector behind the T7 promoter, which had been previously engineered to contain sequences encoding SEQ ID No. 6, a Kozak sequence, 2 end-to-end SEQ ID No. 7, and a polyA tail. The nucleic acid sequence encoding the mRNA shown in SEQ ID No.8 was cloned into the multiple cloning site between the Kozak sequence and 2 end-to-end SEQ ID No. 7, and a plasmid for in vitro transcription was constructed.
2. And (3) transforming the constructed plasmid into escherichia coli Dh5a, culturing and amplifying the plasmid, and extracting the plasmid.
3. The extracted plasmid was digested into linear molecules using the restriction enzyme SpeI immediately following the polyA tail.
4. The prepared linearized plasmid molecule is used as a template, an in vitro transcription method (in vitro transcription kit A45975 of Thermo company) is used for preparing mRNA, the sequence of the mRNA is shown as SEQ ID NO:9, the mRNA is hereinafter abbreviated as B.1.617.2 mRNA, and the S protein mutant is obtained after translation of the mRNA, wherein the amino acid sequence of the S protein mutant is the amino acid sequence of SEQ ID NO:2 and the amino acid sequence of SEQ ID NO:3 which are directly connected from the N end to the C end. After the end of in vitro transcription, CAP structures of CAP1 are added to mRNA using capping enzymes and dimethyltransferase.
Purification of mRNA: the mRNA stock solution obtained was purified by affinity chromatography.
Quality control of mrna: the prepared mRNA was analyzed for mRNA integrity on a 2100 bioanalyzer using an RNA 6000 nano chip, and the results are shown in FIG. 3, where the transcribed mRNA bands were single and no significant degradation was observed.
In addition, spike fragments were excised from the commercial plasmid pCMV3-Spike by restriction enzymes HindIII and EcoRI, and inserted between the HindIII and EcoRI sites of the IVT1 vector of example 1 to give an IVT1-Spike plasmid. And then carrying out point mutation on the plasmid to obtain IVT1-spike-D614G plasmid, and carrying out in vitro transcription by taking the plasmid as a template to obtain spike-D614G mRNA, thereby expressing the full-length S protein containing the D614G mutation.
B.1.617.2 mRNA cellular level expression assay: the CHO-K1 cell line was used as an expression system, mRNA was transfected with Lipofectamine Messenger MAX Reagent (Invitrogen, cat # 1168-027), after 48 hours of culture, cell culture supernatants were collected, and the S protein expression level was detected using an ELISA kit for detecting S protein to evaluate whether mRNA was translatable into protein. The results are shown in FIG. 4. In FIG. 4, "spike DNA" is a commercial plasmid pCMV3-spike (purchased from Soy Severe Inc.) expressing full-length wild-type S protein; "spike-D614G mRNA" is mRNA expressing the full-length S protein containing the D614G mutation, and "spike B.1.617.2 mRNA" is mRNA expressed as B.1.617.2, and the result shows that the mRNA of the invention can highly express the S protein mutant in cells.
After purifying the obtained S protein mutant, carrying out structural analysis by adopting a freeze electron microscope, wherein the 3D structure of the S protein is shown in figure 5, and the S protein mutant is a stable structure of pre-fusion (prefusion spike structure). B.1.617.2 the sequence of the mutant strain and the sequence of the wild strain differ by 9 mutation sites, 2 of which are in the RBD region. The RBD domain status of the pre-fusion S protein of the wild strain has been reported to be mainly 1 OPEN, 2 CLOSE structures. The structure of the S protein mutant of the invention is mainly in flexible state of 2 OPEN and 1 CLOSE. This structural difference is the structural basis for the enhanced binding capacity of the virus to the receptor ACE2 and the enhanced infectivity, and it also leads to a significant difference in the immunogenic epitopes of the S protein, and thus based on the significant differences in antibodies induced by the different structures, in particular neutralizing antibodies.
EXAMPLE 3 construction of LNP-entrapped mRNA
mRNA-entrapped nanoparticles were prepared using II-37 (also known as C2) as an ionizable lipid. Accurately weighing the compounds II-37 and DSPC, CHOL, DMG-PEG2000, and fully dissolving each lipid in absolute ethyl alcohol in a proper container for standby. The specific molar ratio is as follows: II-37:DSPC:CHOL:DMG-PEG 2000=45:15:38.5:1.5; the lipid solutions were mixed uniformly in proportion, and the b.1.617.2 mRNA of example 2 was prepared as an aqueous solution (purified water as solvent) at aqueous phase ph=4 as an organic phase.
Mixing the organic phase and the water phase in a volume ratio of 3:1, and preparing the lipid nanoparticle suspension on a microfluidic platform (such as PNI Ignite). And centrifugally filtering the obtained lipid nanoparticle suspension through a 100kDa ultrafiltration centrifuge tube, purifying and concentrating, and sub-packaging the concentrated liquid.
The prepared lipid nanoparticles were measured for particle size, PDI, potential using a laser nanoparticle analyzer, encapsulation efficiency (EE%) using an ultraviolet spectrophotometer in combination with a RiboGreen RNA kit, and a portion of the samples were transfected into cells a549 in the manner of example 2 and the cell transfection efficiency was measured by Elisa.
The physical and chemical quality control data of the prepared lipid nanoparticle are shown in the following table:
sample information | Particle size (nm) | PDI | Zeta potential | Encapsulation efficiency |
mRNA-LNP | 147.8±20.6 | 0.0651 | 34.28 | 100% |
As a control, liposomes were also made with lipofectamine max entrapped with b.1.617.2 mRNA of example 2, and the above lipid nanoparticles were transfected into cell a549 separately, the negative control being lipid nanoparticles prepared from II-37 without mRNA. As shown in FIG. 6, after the lipid nanoparticle carries mRNA to transfect cells, the expression level of protein in the cells is very high compared with the control reagent Lipofectamine Max, which indicates that the transfection efficiency of the cells of the prepared lipid nanoparticle is very high.
EXAMPLE 4 determination of immunogenicity of protein mutants
The mRNA vaccine used was in a 45:15:38.5:1.5 molar ratio with the mRNA-entrapped LNP lipid nanoparticle prepared in example 3, lipid component II-37:dspc: chol: dmg-PEG 2000. The experimental method comprises the following steps:
ELISA method for detecting specific IgG binding antibody (IgG Binding Antibody)
The content of 2019-nCoV specific IgG antibodies in the plasma of the immunized animal is detected by an indirect ELISA method. Spike antigen protein of the 2019-nCoV B.1.617.2 mutant strain (0.05. Mu.g) was coated on an ELISA plate (Thermo, catalog number.# 442404) at 2-8deg.C overnight. Blocking with 3% BSA (SIGMA, catalog number.#A7030) for 1h at room temperature, adding diluted mouse plasma (1:50), monkey plasma (1:500) incubation for 2h, PBST washing 5 times. Then adding HRP conjugated goat anti-mouse/monkey secondary antibody, incubating for 30-45min at room temperature, and washing with PBST for 5 times. Color development was performed with TMB (thermo filter, catalyst number.# 34029), incubated at room temperature for 7min, stopped by adding stop solution (Solarbio, catalyst number.# C1058), and the antibody content was determined by measuring absorbance at a wavelength of 450 nm. A standard curve was fitted by a polynomial method of selecting positive antibodies (mice: yiqiao Shenzhou cat#40591-MM43, rhesus: ACRO cat#SPD-M201), and the total amount of antibodies was calibrated.
ELISA assay for detecting competitive binding of neutralizing antibodies to antigen proteins in samples for ACE2 (ACE 2 Binding Inhibition)
The neutralizing antibody (Spike RBD) of the 2019-nCoV b.1.617.2 mutant strain in the plasma was diluted and added to the microplate on a plate pre-coated with Human ACE2 Protein using ELISA Anti-SARS-CoV-2 Neutralizing Antibody Titer Serologic Assay Kit (ACRO, catalyst number.#ras-N031/RAS-N040/RAS-N056), and after incubation with HRP-SARS-CoV-2 spike.37 ℃ for 1 hour at constant temperature, incubation with substrate 37 ℃ for 20min at constant temperature, followed by termination with a termination solution. Sample absorbance values (OD 450 nm/OD 630 nm) were determined using a microplate reader (BioTek, SLXFATS) at 450nm/630 nm. OD450nm minus OD630 nm readings for each well reduced background interference. The inhibition rate calculation method comprises the following steps: OD450nm inhibition= (1-sample OD450 nm/Negative Control OD450 nm). Times.100%.
Pseudo virus neutralization experiment (reporter gene method)
The neutralizing antibody can block the binding of the S protein and ACE2 on the surface of the novel coronavirus, thereby preventing the infection of host cells by the pseudovirus. By detecting the expression level of the reporter luciferase, the degree to which the virus is blocked can be deduced. Plasma/serum samples were taken from mice/monkeys at different time points before and after vaccine injection, all samples were heat-inactivated in a water bath at 56 ℃ for 30min before use. Serum-free DMEM (Gibco Catalog Number.#c) 11995500 CP) medium was diluted 20-fold and filter sterilized with a 0.22 μm filter, and 3-fold serial dilutions were made in DMEM medium containing 10% fbs (Gibco Catalog Number.# 10099-141C) for a total of 6 gradients. SARS-CoV-2-Fluc pseudovirus (Vazyme) was transferred from-80℃to 4℃refrigerator or ice until thawed, and the virus was diluted to 1-2X 10 with DMEM medium containing 10% FBS serum before use 4 TCID50/ml. The virus suspension was mixed with equal amounts of plasma in 96-well plates and incubated for 1h at 37℃50. Mu.L of 2X 10 density was added to each well 4 ACE 2-overexpressing 293 cells of cells/well were cultured for 48 hours, then 96-well plates were removed, 100. Mu.L of medium was aspirated from the well plates, 100. Mu.L of a room temperature equilibrated Bio-Lite reporter gene (Vazyme, catalyst number.#DD 1201) detection reagent was added, the plates were shaken for 2 minutes, and after standing at room temperature for 5 minutes, chemiluminescent values (RLU) were detected with a multifunctional microplate reader (TECAN, spark).
The prior research work of the inventor proves that the ACE2 competitive inhibition method and the pseudo-virus neutralization method can well represent the neutralization degree of live viruses in rhesus experiments, and have important reference significance for judging the immunogenicity of vaccines.
BALB/c mice used in this experiment were purchased from Beijing vitamin Torili laboratory animal technologies Co., ltd (animal production license: SCXK (Beijing) 2021-0006), and BALB/c female mice (SPF grade) were subjected to the experiment at 6-8 weeks of age. H11-K18-hACE2 transgenic mice were purchased from Jiangsu Jiujia kang biotechnology Co., ltd (production animal license: SCXK (Su) 2018-0008), 6 week old, SPG grade; the ACE2 humanized mouse model is prepared by preparing an ACE2 humanized mouse on a C57BL/6JGpt background mouse, and driving hACE2 to be overexpressed at the H11 site of a safety island by regulating and controlling a promoter through a human Cytokeratin 18 (Cytokeratin 18, K18) promoter, so as to simulate the human severe COVID-19 phenotype. The age range of the rhesus monkey is 9-22 years old, the rhesus monkey is healthy, and the rhesus monkey is not abnormal in appearance, mental condition, posture, respiration, fecaluria condition, ingestion and drinking water condition during the environment adaptation and quarantine period, so that the rhesus monkey meets the experimental requirements.
1. mRNA vaccine immunogenicity assay in BALB/c mice
BALB/c mice immunization strategy is shown in FIG. 7,2 inter-immunization intervals of 21 days, and conventional blood collection was used for antibody detection.
The experiment set up 6 dose groups, and a PBS control group alone. The 6 dose group search range was sequentially increased 4-fold from the lowest dose of 0.02 μg, i.e., 0.02,0.08,0.3,1.25,5 to the highest dose of 20 μg.
Results:
specific IgG binding antibody detection is shown in FIG. 7-1. All dose groups significantly induced the production of an S protein specific IgG antibody against the b.1.617.2 strain compared to the PBS control group. The median of 20 mug of the highest dose group antibody concentration is 14802ng/mL, and the median of 0.02-5 mug of the group antibody concentration is 165, 1355, 4015, 1809, 7234ng/mL respectively, which show a dose-effect relationship.
The results of inhibition of ACE2 by S protein competitively binding to strain b.1.617.2 are shown in fig. 7-2, expressed as inhibition (%). The results showed that the median inhibition rates were 58%,80%,79%,90% and 91% from 0.08 μg group, 0.3 μg group, 1.25 μg group, 5 μg group to 20 μg group, respectively. Wherein the inhibition rate of the 20 mug high dose group is up to more than 91 percent.
The results of pseudo-virus neutralizing antibody levels are shown in FIGS. 7-3. From the figure, it can be seen that from the 0.08 μg group, higher levels of neutralizing antibody production were induced.
2. mRNA vaccine immunogenicity Pre-test in rhesus monkey
Rhesus immunization strategies are shown in fig. 8, with 28 days between 2 immunizations, and blood was routinely drawn for antibody detection.
The experiment set up 3 dose groups, 10, 30, 100 μg respectively low, medium and high dose groups, and a physiological saline control group was independently established.
Specific IgG binding antibody detection as seen in fig. 8-1, all dose groups induced the production of an S protein specific IgG antibody against the b.1.617.2 strain compared to the saline control group. The median of the high, medium and low dose group antibody concentrations was 141553, 63249, 82458ng/mL, respectively.
The results of inhibition of ACE2 by S protein competitively binding to strain b.1.617.2 are shown in fig. 8-2, expressed as inhibition (%). The results showed that the median levels of inhibition of rhesus ACE2 competitive binding were 89%,88% and 98% for the 10 μg,30 μg,100 μg dose group, respectively.
The results of pseudo-virus neutralizing antibody levels are shown in FIGS. 8-3. The high, medium and low dose group GMT was 3000, 392, 434.
The levels of antibodies in rhesus monkeys showed dose-dependent effects, i.e., low to high vaccine immunity, and could induce dose-dependent humoral and cellular immune responses.
The mice can produce high-level neutralizing antibodies in the group of 0.08 mug at minimum, and the rhesus monkeys can produce high-efficiency antibodies at the minimum of 10 mug.
3. In vivo efficacy test and toxicity test of mRNA vaccine in H11K 18-hACE2 transgenic mice
H11 The immunization strategy of the K18-hACE2 transgenic mice is shown in FIG. 9, and the immunization interval between 2 times is 25 days, and the mice are routinely bled for antibody detection and transferred to a P3 laboratory for toxicity attack experiments 14 days after the secondary immunization.
The experiment set up 3 dose groups, and a physiological saline control group and a blank mouse control group were independently established. 0.8 μg,4 μg,20 μg for low, medium and high dose groups, respectively. The challenge control group was not injected.
Specific IgG binding antibody detection as seen in fig. 9-1, all dose groups significantly induced the production of an S protein specific IgG antibody against the b.1.617.2 strain compared to the saline control group. The median of the antibody concentration of the 20 mug highest dose group is 2621ng/mL, the median of the antibody concentration of the 4 mug group and the median of the antibody concentration of the 0.8 mug group is 1121, 155ng/mL, and the highest dose group have no statistical difference with the 20 mug group and are in dose-effect relation.
The results of inhibition of ACE2 by S protein competitively binding to strain b.1.617.2 are shown in fig. 9-2, expressed as inhibition rate. The results show that different inhibition effects are shown under different dose groups due to individual differences.
B.1.617.2 strain challenge test: the mice are challenged by nasal drops, the virus suspension volume is 20 mu l/mouse, and the challenge dose is 1000TCID 50 The challenge observation period was 5 days, and the blank groups on day 3 (3 dpi) and day 5 (5 dpi) were euthanized 4/time after challenge, and each challenge group was euthanized 5/time. Taking lung, brain, intestinal tissue, heart, liver, kidney and spleen, detecting viral load of each tissue by qPCR method, and taking each tissue for carrying outPathological HE detection.
Each immune group (3 # low dose group, 4# medium dose group, 5# high dose group) showed a significant decrease in viral load of the 5# lung tissue by more than 2 Log10 values on both day 3 and day 5 post infection, as compared to the challenge control group 2 #; the low dose group 3# lung tissue viral load decreased by more than 2 Log10 values on day 5 post infection. The viral load of brain tissue in each immune group (3 #, 4#, 5 #) decreased significantly by more than 2 Log10 values at both day 3 and day 5 post infection. Each immune group (3 #, 4#, 5 #) showed a different degree of decline in viral load in heart, liver, spleen, kidney and intestine tissue compared to the challenge control group 2#, with individual tissue individual time points declining by more than 2 Log10 values (fig. 10).
Histopathological (HE) changes: the 10 mice with lung lesions of the toxicity attack control group are moderate or severe lesions, the lung interval widens the blood stasis, and inflammatory cells infiltrate. The immune high, medium and low dose groups are mild and moderate lesions, severe lesions do not occur, and the incidence rate of moderate lesions is lower than that of the toxicity attack control group. The contrast of the lesion degrees of the heart and the spleen shows that the high, medium and low dose groups of the immunity are lightened to different degrees compared with the toxicity attack control group. Other organs (liver, intestinal tissue, brain) were slightly diseased.
In summary, in the SARS-CoV-2 B.1.617.2 strain infection H11K 18-hACE2 transgenic mouse model, the body weight change of the challenge control group accords with clinical manifestations, and the body weight change of experimental animals in each immune group (3#, 4#, 5#) at 5dpi shows different degrees of improvement, and the high-dose group 5# shows body weight increase. The virus load of the lung and brain tissues is obviously reduced, the virus load of the 5# lung in the 3dpi and 5dpi high-dose group is reduced by more than 2 Log10 values, the virus load of the 3# lung in the 5dpi low-dose group is obviously reduced by more than 2 Log10 values, the virus load of the brain tissues of each immune group is obviously reduced by more than 2 Log10 values, and the virus load of other tissues is reduced to different degrees. The immune groups (3 #, 4#, 5 #) for pathological changes of lung tissue are improved to different degrees compared with the control group. Thus, mRNA vaccine of SARS-CoV-2 B.1.617.2 strain has protective effect against infection of H11-K18-hACE2 transgenic mice with SARS-CoV-2 B.1.617.2 strain.
EXAMPLE 5 Synthesis of ionizable lipid specific Compounds II-37 of formula C
Synthesis of linolenol (a 2): liAlH was added to 950mL of tetrahydrofuran at 0deg.C 4 (7.20 g), linoleic acid (50 g, a 1), after which the mixture was stirred at 25℃for 2h. After completion of the reaction, which was shown by Thin Layer Chromatography (TLC), the reaction mixture was quenched with water (7.2 mL), naOH aqueous solution (7.2 mL, mass fraction 15%) and water (21.6 mL), and an appropriate amount of Na was added 2 SO 4 After stirring for 15 minutes, the filter cake was filtered through a buchner funnel and washed with ethyl acetate, the filtrate was collected and concentrated by evaporation to give 47.4g of the target product linolenol (a 2).
1 H NMR(400MHz,CDCl 3 ):δ5.27-5.44(m,4H),3.63(t,J=6.63Hz,2H),2.77(t,J=6.44Hz,2H),1.97-2.12(m,4H),1.57-1.63(m,1H),1.20-1.46(m,18H),0.83-0.95(m,3H)
Synthesis of (9Z, 12Z) -octadeca-9, 12-dienal (a 3): linolenol (25.0 g, a 2) and 2-iodoxybenzoic acid (39.4 g) were added to 170mL of acetonitrile at room temperature, and the mixture was stirred at 85 ℃ for 4h. The reaction solution was filtered through a buchner funnel and the filter cake was washed with methylene chloride, and the filtrate was collected and concentrated by evaporation to give 24.0g of the objective (9Z, 12Z) -octadeca-9, 12-dienal (a 3).
1 H NMR(400MHz,CDCl 3 ):δ9.76(t,J=1.76Hz,1H),5.25-5.43(m,4H),2.76(t,J=6.17Hz,2H),2.41(td,J=7.33,1.87Hz,2H),2.04(q,J=6.84Hz,4H),1.56-1.68(m,2H),1.22-1.36(m,14H),0.88(t,J=6.73Hz,3H)
Synthesis of (9Z, 12Z) -2-chloro-octadeca-9, 12-dien-1-ol (a 4): to 246mL of acetonitrile at 0℃were added (9Z, 12Z) -octadeca-9, 12-dienal (43.0 g, a 3), DL-proline (5.62 g) and N-chlorosuccinimide, followed by stirring at 0℃for 2h. After completion of the reaction, the reaction mixture was diluted with absolute ethanol (246 mL), and sodium borohydride (8.8 g) was added thereto, followed by stirring at 0℃for 4 hours. The reaction mixture was quenched with water (120 mL) and extracted with methyl tert-butyl ether, the combined organic phases were washed with saturated brine, dried over sodium sulfate, filtered and concentrated by evaporation to give the desired product (9 z,12 z) -2-chloro-octadeca-9, 12-dien-1-ol (a 4,46 g) which was used directly in the next step.
1 H NMR(400MHz,CDCl 3 ):δ5.25-5.51(m,4H),3.97-4.07(m,1H),3.79(dd,J=12.01,3.63Hz,1H),3.59-3.70(m,1H),2.67-2.90(m,2H),1.96-2.15(m,5H),1.64-1.82(m,1H),1.20-1.49(m,15H),0.89(br t,J=6.75Hz,3H)
Synthesis of 2- [ (7 z,10 z) -hexadecane-7, 10-diene ] oxirane (a 5): to 450mL of 1, 4-dioxane were added (9Z, 12Z) -2-chloro-octadeca-9, 12-dien-1-ol (45 g, a 4) and aqueous sodium hydroxide solution (120 g of sodium hydroxide in 585mL of water) at room temperature, and after the addition was completed, the mixture was stirred at 35℃for 2 hours. TLC showed that after the reaction was completed, the reaction solution was separated by a separating funnel and washed with saturated brine, dried over sodium sulfate, filtered and concentrated by evaporation, and then the residue was purified by flash column chromatography eluting with petroleum ether/ethyl acetate to give the target product 2- [ (7 z,10 z) -hexadecane-7, 10-diene ] oxirane (a 5) 29.11g.
1 H NMR(400MHz,CDCl 3 ):δ5.27-5.46(m,4H),2.87-2.98(m,1H),2.70-2.85(m,3H),2.46(dd,J=5.00,2.75Hz,1H),1.94-2.21(m,4H),1.24-1.58(m,17H),0.78-1.00(m,3H)
II-37 synthesis: 2- [ (7Z, 10Z) -hexadecane-7, 10-diene ] oxirane (5 g) and N, N-bis (2-aminoethyl) methylamine (739 mg) were added to 10mL of ethanol at room temperature, and the mixture was stirred at 90℃for 36h. The reaction solution was concentrated by evaporation, and the residue was purified by flash column chromatography eluting with methylene chloride/methanol to give crude product II-37 (4 g). The target product was purified again by flash column chromatography with dichloromethane/methanol to give II-37 (2.2 g).
1 H NMR(400MHz,CDCl 3 ):δ5.27-5.44(m,12H),3.48-3.79(m,3H),2.63-3.00(m,12H),2.16-2.61(m,12H),2.05(q,J=6.80Hz,12H),1.18-1.57(m,51H),0.89(t,J=6.88Hz,9H)
ESI-MS:m/z 910.8[M+H] + ,911.8[M+2H] + ,912.8[M+3H] +
The embodiments of the present invention have been described above. However, the present invention is not limited to the above embodiment. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention should be included in the protection scope of the present invention.
SEQUENCE LISTING
<110> Beijing Qihen Biotechnology Co., ltd
<120> S protein mutant of novel coronavirus variant strain, genetically engineered mRNA thereof and vaccine composition
<130> CPCN22410423
<160> 9
<170> PatentIn version 3.5
<210> 1
<211> 1273
<212> PRT
<213> Unknown
<220>
<223> 2019-nCoV wild-type S protein
<400> 1
Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
1 5 10 15
Asn Leu Thr Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
20 25 30
Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
35 40 45
His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
50 55 60
Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
65 70 75 80
Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Thr Glu
85 90 95
Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
100 105 110
Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
115 120 125
Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Gly Val Tyr
130 135 140
Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Phe Arg Val Tyr
145 150 155 160
Ser Ser Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu
165 170 175
Met Asp Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe
180 185 190
Val Phe Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr
195 200 205
Pro Ile Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu
210 215 220
Pro Leu Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr
225 230 235 240
Leu Leu Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser
245 250 255
Gly Trp Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro
260 265 270
Arg Thr Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala
275 280 285
Val Asp Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys
290 295 300
Ser Phe Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val
305 310 315 320
Gln Pro Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys
325 330 335
Pro Phe Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala
340 345 350
Trp Asn Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu
355 360 365
Tyr Asn Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro
370 375 380
Thr Lys Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe
385 390 395 400
Val Ile Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly
405 410 415
Lys Ile Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys
420 425 430
Val Ile Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn
435 440 445
Tyr Asn Tyr Leu Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe
450 455 460
Glu Arg Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Thr Pro Cys
465 470 475 480
Asn Gly Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly
485 490 495
Phe Gln Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val
500 505 510
Leu Ser Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys
515 520 525
Lys Ser Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn
530 535 540
Gly Leu Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu
545 550 555 560
Pro Phe Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val
565 570 575
Arg Asp Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe
580 585 590
Gly Gly Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val
595 600 605
Ala Val Leu Tyr Gln Asp Val Asn Cys Thr Glu Val Pro Val Ala Ile
610 615 620
His Ala Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser
625 630 635 640
Asn Val Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val
645 650 655
Asn Asn Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala
660 665 670
Ser Tyr Gln Thr Gln Thr Asn Ser Pro Arg Arg Ala Arg Ser Val Ala
675 680 685
Ser Gln Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser
690 695 700
Val Ala Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile
705 710 715 720
Ser Val Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val
725 730 735
Asp Cys Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu
740 745 750
Leu Leu Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr
755 760 765
Gly Ile Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln
770 775 780
Val Lys Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe
785 790 795 800
Asn Phe Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser
805 810 815
Phe Ile Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly
820 825 830
Phe Ile Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp
835 840 845
Leu Ile Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu
850 855 860
Leu Thr Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly
865 870 875 880
Thr Ile Thr Ser Gly Trp Thr Phe Gly Ala Gly Ala Ala Leu Gln Ile
885 890 895
Pro Phe Ala Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr
900 905 910
Gln Asn Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn
915 920 925
Ser Ala Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Ala Ser Ala
930 935 940
Leu Gly Lys Leu Gln Asp Val Val Asn Gln Asn Ala Gln Ala Leu Asn
945 950 955 960
Thr Leu Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val
965 970 975
Leu Asn Asp Ile Leu Ser Arg Leu Asp Lys Val Glu Ala Glu Val Gln
980 985 990
Ile Asp Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr Tyr Val
995 1000 1005
Thr Gln Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn
1010 1015 1020
Leu Ala Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser Lys
1025 1030 1035
Arg Val Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro
1040 1045 1050
Gln Ser Ala Pro His Gly Val Val Phe Leu His Val Thr Tyr Val
1055 1060 1065
Pro Ala Gln Glu Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys His
1070 1075 1080
Asp Gly Lys Ala His Phe Pro Arg Glu Gly Val Phe Val Ser Asn
1085 1090 1095
Gly Thr His Trp Phe Val Thr Gln Arg Asn Phe Tyr Glu Pro Gln
1100 1105 1110
Ile Ile Thr Thr Asp Asn Thr Phe Val Ser Gly Asn Cys Asp Val
1115 1120 1125
Val Ile Gly Ile Val Asn Asn Thr Val Tyr Asp Pro Leu Gln Pro
1130 1135 1140
Glu Leu Asp Ser Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys Asn
1145 1150 1155
His Thr Ser Pro Asp Val Asp Leu Gly Asp Ile Ser Gly Ile Asn
1160 1165 1170
Ala Ser Val Val Asn Ile Gln Lys Glu Ile Asp Arg Leu Asn Glu
1175 1180 1185
Val Ala Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu Gln Glu Leu
1190 1195 1200
Gly Lys Tyr Glu Gln Tyr Ile Lys Trp Pro Trp Tyr Ile Trp Leu
1205 1210 1215
Gly Phe Ile Ala Gly Leu Ile Ala Ile Val Met Val Thr Ile Met
1220 1225 1230
Leu Cys Cys Met Thr Ser Cys Cys Ser Cys Leu Lys Gly Cys Cys
1235 1240 1245
Ser Cys Gly Ser Cys Cys Lys Phe Asp Glu Asp Asp Ser Glu Pro
1250 1255 1260
Val Leu Lys Gly Val Lys Leu His Tyr Thr
1265 1270
<210> 2
<211> 1206
<212> PRT
<213> Unknown
<220>
<223> 2019-nCoV S protein mutant
<400> 2
Met Phe Val Phe Leu Val Leu Leu Pro Leu Val Ser Ser Gln Cys Val
1 5 10 15
Asn Leu Arg Thr Arg Thr Gln Leu Pro Pro Ala Tyr Thr Asn Ser Phe
20 25 30
Thr Arg Gly Val Tyr Tyr Pro Asp Lys Val Phe Arg Ser Ser Val Leu
35 40 45
His Ser Thr Gln Asp Leu Phe Leu Pro Phe Phe Ser Asn Val Thr Trp
50 55 60
Phe His Ala Ile His Val Ser Gly Thr Asn Gly Thr Lys Arg Phe Asp
65 70 75 80
Asn Pro Val Leu Pro Phe Asn Asp Gly Val Tyr Phe Ala Ser Ile Glu
85 90 95
Lys Ser Asn Ile Ile Arg Gly Trp Ile Phe Gly Thr Thr Leu Asp Ser
100 105 110
Lys Thr Gln Ser Leu Leu Ile Val Asn Asn Ala Thr Asn Val Val Ile
115 120 125
Lys Val Cys Glu Phe Gln Phe Cys Asn Asp Pro Phe Leu Asp Val Tyr
130 135 140
Tyr His Lys Asn Asn Lys Ser Trp Met Glu Ser Glu Val Tyr Ser Ser
145 150 155 160
Ala Asn Asn Cys Thr Phe Glu Tyr Val Ser Gln Pro Phe Leu Met Asp
165 170 175
Leu Glu Gly Lys Gln Gly Asn Phe Lys Asn Leu Arg Glu Phe Val Phe
180 185 190
Lys Asn Ile Asp Gly Tyr Phe Lys Ile Tyr Ser Lys His Thr Pro Ile
195 200 205
Asn Leu Val Arg Asp Leu Pro Gln Gly Phe Ser Ala Leu Glu Pro Leu
210 215 220
Val Asp Leu Pro Ile Gly Ile Asn Ile Thr Arg Phe Gln Thr Leu Leu
225 230 235 240
Ala Leu His Arg Ser Tyr Leu Thr Pro Gly Asp Ser Ser Ser Gly Trp
245 250 255
Thr Ala Gly Ala Ala Ala Tyr Tyr Val Gly Tyr Leu Gln Pro Arg Thr
260 265 270
Phe Leu Leu Lys Tyr Asn Glu Asn Gly Thr Ile Thr Asp Ala Val Asp
275 280 285
Cys Ala Leu Asp Pro Leu Ser Glu Thr Lys Cys Thr Leu Lys Ser Phe
290 295 300
Thr Val Glu Lys Gly Ile Tyr Gln Thr Ser Asn Phe Arg Val Gln Pro
305 310 315 320
Thr Glu Ser Ile Val Arg Phe Pro Asn Ile Thr Asn Leu Cys Pro Phe
325 330 335
Gly Glu Val Phe Asn Ala Thr Arg Phe Ala Ser Val Tyr Ala Trp Asn
340 345 350
Arg Lys Arg Ile Ser Asn Cys Val Ala Asp Tyr Ser Val Leu Tyr Asn
355 360 365
Ser Ala Ser Phe Ser Thr Phe Lys Cys Tyr Gly Val Ser Pro Thr Lys
370 375 380
Leu Asn Asp Leu Cys Phe Thr Asn Val Tyr Ala Asp Ser Phe Val Ile
385 390 395 400
Arg Gly Asp Glu Val Arg Gln Ile Ala Pro Gly Gln Thr Gly Lys Ile
405 410 415
Ala Asp Tyr Asn Tyr Lys Leu Pro Asp Asp Phe Thr Gly Cys Val Ile
420 425 430
Ala Trp Asn Ser Asn Asn Leu Asp Ser Lys Val Gly Gly Asn Tyr Asn
435 440 445
Tyr Arg Tyr Arg Leu Phe Arg Lys Ser Asn Leu Lys Pro Phe Glu Arg
450 455 460
Asp Ile Ser Thr Glu Ile Tyr Gln Ala Gly Ser Lys Pro Cys Asn Gly
465 470 475 480
Val Glu Gly Phe Asn Cys Tyr Phe Pro Leu Gln Ser Tyr Gly Phe Gln
485 490 495
Pro Thr Asn Gly Val Gly Tyr Gln Pro Tyr Arg Val Val Val Leu Ser
500 505 510
Phe Glu Leu Leu His Ala Pro Ala Thr Val Cys Gly Pro Lys Lys Ser
515 520 525
Thr Asn Leu Val Lys Asn Lys Cys Val Asn Phe Asn Phe Asn Gly Leu
530 535 540
Thr Gly Thr Gly Val Leu Thr Glu Ser Asn Lys Lys Phe Leu Pro Phe
545 550 555 560
Gln Gln Phe Gly Arg Asp Ile Ala Asp Thr Thr Asp Ala Val Arg Asp
565 570 575
Pro Gln Thr Leu Glu Ile Leu Asp Ile Thr Pro Cys Ser Phe Gly Gly
580 585 590
Val Ser Val Ile Thr Pro Gly Thr Asn Thr Ser Asn Gln Val Ala Val
595 600 605
Leu Tyr Gln Gly Val Asn Cys Thr Glu Val Pro Val Ala Ile His Ala
610 615 620
Asp Gln Leu Thr Pro Thr Trp Arg Val Tyr Ser Thr Gly Ser Asn Val
625 630 635 640
Phe Gln Thr Arg Ala Gly Cys Leu Ile Gly Ala Glu His Val Asn Asn
645 650 655
Ser Tyr Glu Cys Asp Ile Pro Ile Gly Ala Gly Ile Cys Ala Ser Tyr
660 665 670
Gln Thr Gln Thr Asn Ser Arg Gly Ser Ala Ser Ser Val Ala Ser Gln
675 680 685
Ser Ile Ile Ala Tyr Thr Met Ser Leu Gly Ala Glu Asn Ser Val Ala
690 695 700
Tyr Ser Asn Asn Ser Ile Ala Ile Pro Thr Asn Phe Thr Ile Ser Val
705 710 715 720
Thr Thr Glu Ile Leu Pro Val Ser Met Thr Lys Thr Ser Val Asp Cys
725 730 735
Thr Met Tyr Ile Cys Gly Asp Ser Thr Glu Cys Ser Asn Leu Leu Leu
740 745 750
Gln Tyr Gly Ser Phe Cys Thr Gln Leu Asn Arg Ala Leu Thr Gly Ile
755 760 765
Ala Val Glu Gln Asp Lys Asn Thr Gln Glu Val Phe Ala Gln Val Lys
770 775 780
Gln Ile Tyr Lys Thr Pro Pro Ile Lys Asp Phe Gly Gly Phe Asn Phe
785 790 795 800
Ser Gln Ile Leu Pro Asp Pro Ser Lys Pro Ser Lys Arg Ser Pro Ile
805 810 815
Glu Asp Leu Leu Phe Asn Lys Val Thr Leu Ala Asp Ala Gly Phe Ile
820 825 830
Lys Gln Tyr Gly Asp Cys Leu Gly Asp Ile Ala Ala Arg Asp Leu Ile
835 840 845
Cys Ala Gln Lys Phe Asn Gly Leu Thr Val Leu Pro Pro Leu Leu Thr
850 855 860
Asp Glu Met Ile Ala Gln Tyr Thr Ser Ala Leu Leu Ala Gly Thr Ile
865 870 875 880
Thr Ser Gly Trp Thr Phe Gly Ala Gly Pro Ala Leu Gln Ile Pro Phe
885 890 895
Pro Met Gln Met Ala Tyr Arg Phe Asn Gly Ile Gly Val Thr Gln Asn
900 905 910
Val Leu Tyr Glu Asn Gln Lys Leu Ile Ala Asn Gln Phe Asn Ser Ala
915 920 925
Ile Gly Lys Ile Gln Asp Ser Leu Ser Ser Thr Pro Ser Ala Leu Gly
930 935 940
Lys Leu Gln Asn Val Val Asn Gln Asn Ala Gln Ala Leu Asn Thr Leu
945 950 955 960
Val Lys Gln Leu Ser Ser Asn Phe Gly Ala Ile Ser Ser Val Leu Asn
965 970 975
Asp Ile Leu Ser Arg Leu Asp Pro Pro Glu Ala Glu Val Gln Ile Asp
980 985 990
Arg Leu Ile Thr Gly Arg Leu Gln Ser Leu Gln Thr Tyr Val Thr Gln
995 1000 1005
Gln Leu Ile Arg Ala Ala Glu Ile Arg Ala Ser Ala Asn Leu Ala
1010 1015 1020
Ala Thr Lys Met Ser Glu Cys Val Leu Gly Gln Ser Lys Arg Val
1025 1030 1035
Asp Phe Cys Gly Lys Gly Tyr His Leu Met Ser Phe Pro Gln Ser
1040 1045 1050
Ala Pro His Gly Val Val Phe Leu His Val Thr Tyr Val Pro Ala
1055 1060 1065
Gln Glu Lys Asn Phe Thr Thr Ala Pro Ala Ile Cys His Asp Gly
1070 1075 1080
Lys Ala His Phe Pro Arg Glu Gly Val Phe Val Ser Asn Gly Thr
1085 1090 1095
His Trp Phe Val Thr Gln Arg Asn Phe Tyr Glu Pro Gln Ile Ile
1100 1105 1110
Thr Thr Asp Asn Thr Phe Val Ser Gly Asn Cys Asp Val Val Ile
1115 1120 1125
Gly Ile Val Asn Asn Thr Val Tyr Asp Pro Leu Gln Pro Glu Leu
1130 1135 1140
Asp Ser Phe Lys Glu Glu Leu Asp Lys Tyr Phe Lys Asn His Thr
1145 1150 1155
Ser Pro Asp Val Asp Leu Gly Asp Ile Ser Gly Ile Asn Ala Ser
1160 1165 1170
Val Val Asn Ile Gln Lys Glu Ile Asp Arg Leu Asn Glu Val Ala
1175 1180 1185
Lys Asn Leu Asn Glu Ser Leu Ile Asp Leu Gln Glu Leu Gly Lys
1190 1195 1200
Tyr Glu Gln
1205
<210> 3
<211> 28
<212> PRT
<213> Artificial Sequence
<220>
<223> domain aiding in trimer formation
<400> 3
Gly Tyr Ile Pro Glu Ala Pro Arg Asp Gly Gln Ala Tyr Val Arg Lys
1 5 10 15
Asp Gly Glu Trp Val Leu Leu Ser Thr Phe Leu Gly
20 25
<210> 4
<211> 3618
<212> DNA
<213> Artificial Sequence
<220>
<223> Artificial sequence
<400> 4
atgttcgtgt tcctcgtgct ccttccgctg gtctcgagcc agtgcgtcaa tttgcgcacg 60
aggacgcagt tgccccccgc gtacacgaac tcgtttacgc ggggggtgta ctacccggac 120
aaggtcttcc gcagctctgt cctgcacagc actcaggacc tcttcctccc gttcttctcg 180
aacgtgacgt ggttccacgc cattcacgtg tcggggacga acgggacgaa gaggttcgac 240
aaccctgttc tgccgttcaa cgacggggtg tacttcgctt cgatcgagaa gtccaacatt 300
attcgcgggt ggatattcgg gaccactctc gattcgaaga ctcagtcctt gctgatagtg 360
aacaacgcca cgaacgtggt cattaaggtc tgcgagttcc agttctgtaa tgacccgttc 420
ctggacgttt actatcacaa gaacaacaag tcttggatgg agagtgaggt gtattcgtcc 480
gcgaataatt gtaccttcga gtatgtctcg cagccattct tgatggatct tgagggcaag 540
cagggaaatt tcaagaatct ccgcgagttt gtcttcaaga acatcgacgg gtacttcaag 600
atatactcga agcacacgcc gatcaacctc gtccgtgatc tcccgcaggg cttcagcgct 660
ctggagccgc tggtggatct cccgatcggg atcaacatca cgcggttcca gacgctgctg 720
gccctgcaca ggagttacct gacgccgggt gactccagta gtgggtggac tgcgggtgcc 780
gcggcgtact acgtcgggta cctgcagccg cgcacgttct tgttgaagta caacgagaac 840
gggacgatca cggacgcggt tgattgcgcg ttggaccctc tgtcggagac gaagtgcacc 900
ctgaagtcgt tcacggtgga gaagggtatc tatcagacct cgaacttccg ggtccagccg 960
actgagagta tcgttcggtt cccgaacatt acgaacctgt gtccgttcgg ggaggtcttc 1020
aacgcgacgc ggttcgcgag tgtgtacgct tggaaccgga agaggatctc gaattgtgtg 1080
gcggactaca gtgtgctgta caattcggcg tccttttcca cgttcaagtg ctacggggtg 1140
tcgcccacga agttgaacga cctctgcttc accaacgtgt atgcggattc cttcgtcatc 1200
cgtggtgacg aggtgcgtca gattgcgccg gggcagacgg ggaagatagc ggactataat 1260
tataagttgc ccgacgactt tactggctgc gttattgctt ggaacagcaa taacctggac 1320
agtaaggtcg ggggcaacta taattatcgg taccgtctgt tccggaagag caatctgaag 1380
cccttcgagc gcgatatctc gaccgagatc taccaggccg gctcgaagcc gtgcaacggc 1440
gtcgaggggt ttaattgtta ctttccgtta cagagctacg ggtttcagcc cacgaacggg 1500
gtggggtacc agccctaccg cgtcgtggtg ctgagcttcg agctgctgca cgccccggcc 1560
acggtgtgcg gtccgaagaa aagtacaaac cttgtgaaga acaagtgtgt gaactttaac 1620
ttcaacgggc tcaccgggac gggggtgttg acggagagta acaagaagtt cctgccgttc 1680
cagcagttcg gtcgggatat cgcggacacc acggatgccg tgagggatcc gcagacgctt 1740
gagattctgg acatcacgcc ctgcagcttc gggggcgtca gtgtgatcac gcctggtacg 1800
aacaccagca accaggttgc ggtgttgtac cagggtgtga attgcactga ggtccccgta 1860
gcgatccacg cggatcagct gaccccgacg tggagggtgt actcgacggg gagtaatgtc 1920
ttccagactc gcgcgggttg cctgattggc gctgagcacg tgaacaactc gtacgagtgc 1980
gacattccca ttggggcggg gatctgcgcg tcgtaccaga cccagacgaa cagccggggc 2040
agcgctagca gcgtcgcgtc gcagtcgatc atcgcgtaca cgatgagcct gggggcggag 2100
aacagtgtgg cctattcgaa caacagcata gctatcccca cgaattttac gatcagtgtg 2160
acgaccgaga tcttgcccgt gtcgatgacc aagacctcgg tcgattgcac gatgtacatt 2220
tgtggggata gcactgagtg ttctaacctc ctgctccagt acggcagttt ctgtacgcag 2280
ctcaaccggg cgcttacggg gattgccgtg gagcaggaca agaacactca ggaggtgttt 2340
gcgcaggtca agcagatcta caagacgcct ccgatcaagg atttcggggg gttcaatttc 2400
tcccagatac tccccgaccc ttcgaagccc agcaagcgta gccctattga ggacctgctc 2460
ttcaataagg ttacgcttgc ggacgcgggc ttcatcaagc agtacgggga ctgtctgggg 2520
gacattgccg cccgggacct gatctgtgct cagaagttca atgggctcac tgttctgccg 2580
cccctgctca cggacgagat gatcgcgcag tacacgtcgg cgctcctcgc cggcacgatc 2640
acgtcgggct ggacgtttgg ggctggtcct gcgctgcaga tcccgttccc tatgcagatg 2700
gcgtaccgct tcaatgggat cggggtgacc cagaatgtcc tgtacgagaa tcagaagctc 2760
atcgccaatc agttcaactc ggcgatcggg aagatacagg actccctgtc gagtacgcct 2820
tccgcgttgg ggaagctgca gaacgtggtg aaccagaatg ctcaggcgtt gaacacgttg 2880
gtgaagcagc tgtcgtccaa cttcggggcg atatcctcgg tgctgaacga tattctcagt 2940
cggctggacc cgccggaggc ggaggttcag atcgatagac tcatcactgg tcgcctccag 3000
agtttgcaga cgtacgtgac tcagcagctc atccgggctg ctgagatacg tgcgtctgcg 3060
aacctggcgg cgaccaagat gagtgagtgc gtgctggggc agagcaagcg ggtggacttt 3120
tgcgggaagg gctatcacct gatgtccttc ccgcagtccg cccctcacgg ggtggtcttc 3180
ctgcacgtga cgtatgtgcc ggcgcaggag aagaacttca ccacggcgcc ggccatatgt 3240
cacgacggga aggcccactt cccccgtgag ggggtcttcg tgtcgaatgg gacgcactgg 3300
ttcgtgacgc agcggaattt ctatgagccg cagataatta cgactgacaa cacgtttgtc 3360
agtggtaatt gtgatgtggt catagggatt gttaacaaca ccgtgtatga tcccctccag 3420
ccggagctgg acagcttcaa ggaggagctg gataagtact tcaagaatca cacgtcgccg 3480
gacgtggatc ttggggacat atcggggatc aacgcgagtg ttgttaacat acagaaggag 3540
atcgaccggc tcaatgaggt tgcgaagaac ctcaatgagt cgttgatcga ccttcaggag 3600
ctcggcaagt atgagcag 3618
<210> 5
<211> 84
<212> DNA
<213> Artificial Sequence
<220>
<223> Artificial sequence
<400> 5
ggctatatcc cagaggcccc tagagatggc caggcctacg ttagaaagga cggcgagtgg 60
gtcctgctga gcacattcct gggc 84
<210> 6
<211> 50
<212> RNA
<213> Artificial Sequence
<220>
<223> Artificial sequence
<400> 6
acauuugcuu cugacacaac uguguucacu agcaaccuca aacagacacc 50
<210> 7
<211> 88
<212> RNA
<213> Artificial Sequence
<220>
<223> Artificial sequence
<400> 7
gcuggagccu cgguagccgu uccuccugcc cgcugggccu cccaacgggc ccuccucccc 60
uccuugcacc ggcccuuccu ggucuuug 88
<210> 8
<211> 3705
<212> RNA
<213> Artificial Sequence
<220>
<223> Artificial sequence
<400> 8
auguucgugu uccucgugcu ccuuccgcug gucucgagcc agugcgucaa uuugcgcacg 60
aggacgcagu ugccccccgc guacacgaac ucguuuacgc ggggggugua cuacccggac 120
aaggucuucc gcagcucugu ccugcacagc acucaggacc ucuuccuccc guucuucucg 180
aacgugacgu gguuccacgc cauucacgug ucggggacga acgggacgaa gagguucgac 240
aacccuguuc ugccguucaa cgacggggug uacuucgcuu cgaucgagaa guccaacauu 300
auucgcgggu ggauauucgg gaccacucuc gauucgaaga cucaguccuu gcugauagug 360
aacaacgcca cgaacguggu cauuaagguc ugcgaguucc aguucuguaa ugacccguuc 420
cuggacguuu acuaucacaa gaacaacaag ucuuggaugg agagugaggu guauucgucc 480
gcgaauaauu guaccuucga guaugucucg cagccauucu ugauggaucu ugagggcaag 540
cagggaaauu ucaagaaucu ccgcgaguuu gucuucaaga acaucgacgg guacuucaag 600
auauacucga agcacacgcc gaucaaccuc guccgugauc ucccgcaggg cuucagcgcu 660
cuggagccgc ugguggaucu cccgaucggg aucaacauca cgcgguucca gacgcugcug 720
gcccugcaca ggaguuaccu gacgccgggu gacuccagua guggguggac ugcgggugcc 780
gcggcguacu acgucgggua ccugcagccg cgcacguucu uguugaagua caacgagaac 840
gggacgauca cggacgcggu ugauugcgcg uuggacccuc ugucggagac gaagugcacc 900
cugaagucgu ucacggugga gaaggguauc uaucagaccu cgaacuuccg gguccagccg 960
acugagagua ucguucgguu cccgaacauu acgaaccugu guccguucgg ggaggucuuc 1020
aacgcgacgc gguucgcgag uguguacgcu uggaaccgga agaggaucuc gaauugugug 1080
gcggacuaca gugugcugua caauucggcg uccuuuucca cguucaagug cuacggggug 1140
ucgcccacga aguugaacga ccucugcuuc accaacgugu augcggauuc cuucgucauc 1200
cguggugacg aggugcguca gauugcgccg gggcagacgg ggaagauagc ggacuauaau 1260
uauaaguugc ccgacgacuu uacuggcugc guuauugcuu ggaacagcaa uaaccuggac 1320
aguaaggucg ggggcaacua uaauuaucgg uaccgucugu uccggaagag caaucugaag 1380
cccuucgagc gcgauaucuc gaccgagauc uaccaggccg gcucgaagcc gugcaacggc 1440
gucgaggggu uuaauuguua cuuuccguua cagagcuacg gguuucagcc cacgaacggg 1500
gugggguacc agcccuaccg cgucguggug cugagcuucg agcugcugca cgccccggcc 1560
acggugugcg guccgaagaa aaguacaaac cuugugaaga acaagugugu gaacuuuaac 1620
uucaacgggc ucaccgggac ggggguguug acggagagua acaagaaguu ccugccguuc 1680
cagcaguucg gucgggauau cgcggacacc acggaugccg ugagggaucc gcagacgcuu 1740
gagauucugg acaucacgcc cugcagcuuc gggggcguca gugugaucac gccugguacg 1800
aacaccagca accagguugc gguguuguac caggguguga auugcacuga gguccccgua 1860
gcgauccacg cggaucagcu gaccccgacg uggagggugu acucgacggg gaguaauguc 1920
uuccagacuc gcgcggguug ccugauuggc gcugagcacg ugaacaacuc guacgagugc 1980
gacauuccca uuggggcggg gaucugcgcg ucguaccaga cccagacgaa cagccggggc 2040
agcgcuagca gcgucgcguc gcagucgauc aucgcguaca cgaugagccu gggggcggag 2100
aacagugugg ccuauucgaa caacagcaua gcuaucccca cgaauuuuac gaucagugug 2160
acgaccgaga ucuugcccgu gucgaugacc aagaccucgg ucgauugcac gauguacauu 2220
uguggggaua gcacugagug uucuaaccuc cugcuccagu acggcaguuu cuguacgcag 2280
cucaaccggg cgcuuacggg gauugccgug gagcaggaca agaacacuca ggagguguuu 2340
gcgcagguca agcagaucua caagacgccu ccgaucaagg auuucggggg guucaauuuc 2400
ucccagauac uccccgaccc uucgaagccc agcaagcgua gcccuauuga ggaccugcuc 2460
uucaauaagg uuacgcuugc ggacgcgggc uucaucaagc aguacgggga cugucugggg 2520
gacauugccg cccgggaccu gaucugugcu cagaaguuca augggcucac uguucugccg 2580
ccccugcuca cggacgagau gaucgcgcag uacacgucgg cgcuccucgc cggcacgauc 2640
acgucgggcu ggacguuugg ggcugguccu gcgcugcaga ucccguuccc uaugcagaug 2700
gcguaccgcu ucaaugggau cggggugacc cagaaugucc uguacgagaa ucagaagcuc 2760
aucgccaauc aguucaacuc ggcgaucggg aagauacagg acucccuguc gaguacgccu 2820
uccgcguugg ggaagcugca gaacguggug aaccagaaug cucaggcguu gaacacguug 2880
gugaagcagc ugucguccaa cuucggggcg auauccucgg ugcugaacga uauucucagu 2940
cggcuggacc cgccggaggc ggagguucag aucgauagac ucaucacugg ucgccuccag 3000
aguuugcaga cguacgugac ucagcagcuc auccgggcug cugagauacg ugcgucugcg 3060
aaccuggcgg cgaccaagau gagugagugc gugcuggggc agagcaagcg gguggacuuu 3120
ugcgggaagg gcuaucaccu gauguccuuc ccgcaguccg ccccucacgg gguggucuuc 3180
cugcacguga cguaugugcc ggcgcaggag aagaacuuca ccacggcgcc ggccauaugu 3240
cacgacggga aggcccacuu cccccgugag ggggucuucg ugucgaaugg gacgcacugg 3300
uucgugacgc agcggaauuu cuaugagccg cagauaauua cgacugacaa cacguuuguc 3360
agugguaauu gugauguggu cauagggauu guuaacaaca ccguguauga uccccuccag 3420
ccggagcugg acagcuucaa ggaggagcug gauaaguacu ucaagaauca cacgucgccg 3480
gacguggauc uuggggacau aucggggauc aacgcgagug uuguuaacau acagaaggag 3540
aucgaccggc ucaaugaggu ugcgaagaac cucaaugagu cguugaucga ccuucaggag 3600
cucggcaagu augagcaggg cuauauccca gaggccccua gagauggcca ggccuacguu 3660
agaaaggacg gcgagugggu ccugcugagc acauuccugg gcuga 3705
<210> 9
<211> 4098
<212> RNA
<213> Artificial Sequence
<220>
<223> Artificial sequence
<400> 9
gggagaccgg ccucgagaca uuugcuucug acacaacugu guucacuagc aaccucaaac 60
agacaccaag cuugccacca uguucguguu ccucgugcuc cuuccgcugg ucucgagcca 120
gugcgucaau uugcgcacga ggacgcaguu gccccccgcg uacacgaacu cguuuacgcg 180
ggggguguac uacccggaca aggucuuccg cagcucuguc cugcacagca cucaggaccu 240
cuuccucccg uucuucucga acgugacgug guuccacgcc auucacgugu cggggacgaa 300
cgggacgaag agguucgaca acccuguucu gccguucaac gacggggugu acuucgcuuc 360
gaucgagaag uccaacauua uucgcgggug gauauucggg accacucucg auucgaagac 420
ucaguccuug cugauaguga acaacgccac gaacgugguc auuaaggucu gcgaguucca 480
guucuguaau gacccguucc uggacguuua cuaucacaag aacaacaagu cuuggaugga 540
gagugaggug uauucguccg cgaauaauug uaccuucgag uaugucucgc agccauucuu 600
gauggaucuu gagggcaagc agggaaauuu caagaaucuc cgcgaguuug ucuucaagaa 660
caucgacggg uacuucaaga uauacucgaa gcacacgccg aucaaccucg uccgugaucu 720
cccgcagggc uucagcgcuc uggagccgcu gguggaucuc ccgaucggga ucaacaucac 780
gcgguuccag acgcugcugg cccugcacag gaguuaccug acgccgggug acuccaguag 840
uggguggacu gcgggugccg cggcguacua cgucggguac cugcagccgc gcacguucuu 900
guugaaguac aacgagaacg ggacgaucac ggacgcgguu gauugcgcgu uggacccucu 960
gucggagacg aagugcaccc ugaagucguu cacgguggag aaggguaucu aucagaccuc 1020
gaacuuccgg guccagccga cugagaguau cguucgguuc ccgaacauua cgaaccugug 1080
uccguucggg gaggucuuca acgcgacgcg guucgcgagu guguacgcuu ggaaccggaa 1140
gaggaucucg aauugugugg cggacuacag ugugcuguac aauucggcgu ccuuuuccac 1200
guucaagugc uacggggugu cgcccacgaa guugaacgac cucugcuuca ccaacgugua 1260
ugcggauucc uucgucaucc guggugacga ggugcgucag auugcgccgg ggcagacggg 1320
gaagauagcg gacuauaauu auaaguugcc cgacgacuuu acuggcugcg uuauugcuug 1380
gaacagcaau aaccuggaca guaaggucgg gggcaacuau aauuaucggu accgucuguu 1440
ccggaagagc aaucugaagc ccuucgagcg cgauaucucg accgagaucu accaggccgg 1500
cucgaagccg ugcaacggcg ucgagggguu uaauuguuac uuuccguuac agagcuacgg 1560
guuucagccc acgaacgggg ugggguacca gcccuaccgc gucguggugc ugagcuucga 1620
gcugcugcac gccccggcca cggugugcgg uccgaagaaa aguacaaacc uugugaagaa 1680
caagugugug aacuuuaacu ucaacgggcu caccgggacg gggguguuga cggagaguaa 1740
caagaaguuc cugccguucc agcaguucgg ucgggauauc gcggacacca cggaugccgu 1800
gagggauccg cagacgcuug agauucugga caucacgccc ugcagcuucg ggggcgucag 1860
ugugaucacg ccugguacga acaccagcaa ccagguugcg guguuguacc agggugugaa 1920
uugcacugag guccccguag cgauccacgc ggaucagcug accccgacgu ggagggugua 1980
cucgacgggg aguaaugucu uccagacucg cgcggguugc cugauuggcg cugagcacgu 2040
gaacaacucg uacgagugcg acauucccau uggggcgggg aucugcgcgu cguaccagac 2100
ccagacgaac agccggggca gcgcuagcag cgucgcgucg cagucgauca ucgcguacac 2160
gaugagccug ggggcggaga acaguguggc cuauucgaac aacagcauag cuauccccac 2220
gaauuuuacg aucaguguga cgaccgagau cuugcccgug ucgaugacca agaccucggu 2280
cgauugcacg auguacauuu guggggauag cacugagugu ucuaaccucc ugcuccagua 2340
cggcaguuuc uguacgcagc ucaaccgggc gcuuacgggg auugccgugg agcaggacaa 2400
gaacacucag gagguguuug cgcaggucaa gcagaucuac aagacgccuc cgaucaagga 2460
uuucgggggg uucaauuucu cccagauacu ccccgacccu ucgaagccca gcaagcguag 2520
cccuauugag gaccugcucu ucaauaaggu uacgcuugcg gacgcgggcu ucaucaagca 2580
guacggggac ugucuggggg acauugccgc ccgggaccug aucugugcuc agaaguucaa 2640
ugggcucacu guucugccgc cccugcucac ggacgagaug aucgcgcagu acacgucggc 2700
gcuccucgcc ggcacgauca cgucgggcug gacguuuggg gcugguccug cgcugcagau 2760
cccguucccu augcagaugg cguaccgcuu caaugggauc ggggugaccc agaauguccu 2820
guacgagaau cagaagcuca ucgccaauca guucaacucg gcgaucggga agauacagga 2880
cucccugucg aguacgccuu ccgcguuggg gaagcugcag aacgugguga accagaaugc 2940
ucaggcguug aacacguugg ugaagcagcu gucguccaac uucggggcga uauccucggu 3000
gcugaacgau auucucaguc ggcuggaccc gccggaggcg gagguucaga ucgauagacu 3060
caucacuggu cgccuccaga guuugcagac guacgugacu cagcagcuca uccgggcugc 3120
ugagauacgu gcgucugcga accuggcggc gaccaagaug agugagugcg ugcuggggca 3180
gagcaagcgg guggacuuuu gcgggaaggg cuaucaccug auguccuucc cgcaguccgc 3240
cccucacggg guggucuucc ugcacgugac guaugugccg gcgcaggaga agaacuucac 3300
cacggcgccg gccauauguc acgacgggaa ggcccacuuc ccccgugagg gggucuucgu 3360
gucgaauggg acgcacuggu ucgugacgca gcggaauuuc uaugagccgc agauaauuac 3420
gacugacaac acguuuguca gugguaauug ugaugugguc auagggauug uuaacaacac 3480
cguguaugau ccccuccagc cggagcugga cagcuucaag gaggagcugg auaaguacuu 3540
caagaaucac acgucgccgg acguggaucu uggggacaua ucggggauca acgcgagugu 3600
uguuaacaua cagaaggaga ucgaccggcu caaugagguu gcgaagaacc ucaaugaguc 3660
guugaucgac cuucaggagc ucggcaagua ugagcagggc uauaucccag aggccccuag 3720
agauggccag gccuacguua gaaaggacgg cgaguggguc cugcugagca cauuccuggg 3780
cugagaauuc gcuggagccu cgguagccgu uccuccugcc cgcugggccu cccaacgggc 3840
ccuccucccc uccuugcacc ggcccuuccu ggucuuuggc uggagccucg guagccguuc 3900
cuccugcccg cugggccucc caacgggccc uccuccccuc cuugcaccgg cccuuccugg 3960
ucuuuguuaa uuaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4020
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4080
aaaaaaaaaa aaaacuag 4098
Claims (12)
1. An S protein mutant of 2019-nCoV, comprising at least an extracellular domain comprising an amino acid mutation at a position relative to the extracellular domain of the parent S protein: F817P, A892P, A899P, A942P and KV986_987PP, and T19R, G142D, EF 156-157 del, R158G, L452R, T478K, D614G, P6811R, D950N, the amino acid positions of which are depicted in the amino acid sequence shown in SEQ ID NO: 1.
2. The mutant S protein of 2019-nCoV of claim 1, further comprising a mutation at amino acids RRAR from positions 682 to 685 relative to the amino acid sequence set forth in SEQ ID No. 1 to disable cleavage by furin; preferably, the RRAR is mutated to GSAS;
preferably, the S protein mutant of 2019-nCoV does not comprise the transmembrane domain and/or cytoplasmic tail of the S protein;
Preferably, the S protein mutant of 2019-nCoV is directly fused at the C end of the extracellular domain to assist in forming a domain of a trimer; preferably, the domain that assists in trimer formation is T4 Fibritin Foldon Trimerization Motif.
3. The mutant S protein of 2019-nCoV according to claim 1 or 2, comprising an amino acid sequence as shown in SEQ ID No. 2;
preferably, the amino acid sequence of the S protein mutant of 2019-nCoV comprises the amino acid sequence of SEQ ID NO. 2 and the amino acid sequence of SEQ ID NO. 3 which are directly connected from the N end to the C end.
4. A DNA molecule encoding the S protein mutant of 2019-nCoV of any one of claims 1-3;
preferably, the nucleotide sequence of the DNA molecule comprises a nucleotide sequence that is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or about 100% homologous to the nucleotide sequence of SEQ ID NO. 4, and a nucleotide sequence that is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or about 100% homologous to the nucleotide sequence of SEQ ID NO. 5, directly linked from the 5 'end to the 3' end.
5. An expression vector comprising the DNA molecule of claim 4.
6. A cell comprising the DNA molecule of claim 4 or the expression vector of claim 5.
7. An mRNA molecule comprising an open reading frame encoding the S protein mutant of 2019-nCoV of any one of claims 1-3;
preferably, the nucleotide sequence of the open reading frame is a nucleotide sequence that is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or about 100% homologous to the nucleotide sequence set forth in SEQ ID NO. 8.
8. The mRNA molecule of claim 7, wherein the mRNA comprises, from 5 'to 3' ends, a 5'utr, an open reading frame encoding an S protein mutant of 2019-nCoV, a 3' utr, and a poly-a tail;
preferably, the 5'utr comprises a 5' utr of β -globin or α -globin or a homolog or fragment thereof; preferably, the 5'UTR comprises a nucleotide sequence that is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or about 100% homologous to the 5' UTR nucleotide sequence of the β -globin shown in SEQ ID NO. 6;
Preferably, the 3'utr comprises a 3' utr of β -globin or α -globin or a homologue or fragment or combination of fragments thereof; preferably, the 3'UTR comprises 1 nucleotide sequence that is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or about 100% homologous to a fragment of the α2-globin 3' UTR shown in SEQ ID NO. 7; alternatively, 2 or more nucleotide sequences joined end-to-end that are at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or about 100% homologous to a fragment of the 3' UTR of the alpha 2-globin shown in SEQ ID NO. 7;
preferably, the poly-A tail is 50-200 nucleotides in length, preferably 100-150 nucleotides in length;
preferably, the mRNA further contains a Kozak sequence, preferably the Kozak sequence is GCCACC;
preferably, the mRNA further comprises a 5 'CAP, preferably, the 5' CAP is CAP1.
9. The mRNA molecule of claim 7 or 8, comprising a nucleotide sequence that is at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or about 100% homologous to the nucleotide sequence set forth in SEQ ID No. 9.
10. A nucleic acid molecule encoding an mRNA molecule according to any one of claims 7 to 9.
11. A vaccine composition comprising the S protein mutant of 2019-nCoV of any one of claims 1-3, or the mRNA molecule of any one of claims 7-9; preferably, the vaccine composition further comprises a pharmaceutically acceptable excipient, and/or an immunoadjuvant; preferably, the vaccine or vaccine composition is for use in the prevention and/or treatment of 2019-nCoV infection or a disease or disorder associated with 2019-nCoV infection; preferably, the disease or condition associated with 2019-nCoV infection is selected from pneumonia caused by 2019-nCoV infection, headache, nasal obstruction, runny nose, cough or/and airway inflammation caused by 2019-nCoV infection, disseminated intravascular coagulation caused by 2019-nCoV infection, and sepsis caused by 2019-nCoV infection.
12. The vaccine composition of claim 11, further comprising a lipid nanoparticle in which the mRNA is located, the lipid nanoparticle comprising 30-60mol% ionizable/cationic lipid molecules, 5-30mol% neutral lipid molecules, 30-50mol% cholesterol lipid molecules, 0.4-10mol% pegylated lipid molecules of its total lipid molecules; preferably contains 32-55 mole% of ionizable/cationic lipid molecules, 8-20 mole% of neutral lipid molecules, 35-50 mole% of cholesterol lipid molecules, 0.5-5 mole% of PEGylated lipid molecules; more preferably 39-51 mole% of ionizable/cationic lipid molecules, 9-16 mole% of neutral lipid molecules, 37-49 mole% of cholesterol lipid molecules, 1.3-2.7 mole% of pegylated lipid molecules;
Preferably, the ionizable/cationic lipid molecule is a compound of formula CWherein each n 3 Are independent of each other and may be the same or different, each n 3 Selected from integers from 1 to 8, each m 3 Are independent of each other and may be the same or different, each m 3 An integer selected from 0 to 8; preferably, each n 3 Selected from integers from 4 to 8, each m 3 An integer selected from 4 to 8; preferably, each n 3 Are all identical to each other, each m 3 Are all identical to each other;
preferably, the ionizable/cationic lipid molecular structure is as follows:
preferably, the ratio of the total mass of the lipid molecules to the mass of the mRNA is 5-20:1.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210633828.5A CN117229371A (en) | 2022-06-06 | 2022-06-06 | Novel S protein mutant of coronavirus variant strain, genetically engineered mRNA thereof and vaccine composition |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210633828.5A CN117229371A (en) | 2022-06-06 | 2022-06-06 | Novel S protein mutant of coronavirus variant strain, genetically engineered mRNA thereof and vaccine composition |
Publications (1)
Publication Number | Publication Date |
---|---|
CN117229371A true CN117229371A (en) | 2023-12-15 |
Family
ID=89097211
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210633828.5A Pending CN117229371A (en) | 2022-06-06 | 2022-06-06 | Novel S protein mutant of coronavirus variant strain, genetically engineered mRNA thereof and vaccine composition |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117229371A (en) |
-
2022
- 2022-06-06 CN CN202210633828.5A patent/CN117229371A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112480217B (en) | Vaccines and compositions based on S antigen protein of SARS-CoV-2 | |
KR102655641B1 (en) | Compositions and methods for enhancing gene expression | |
EP3247722B1 (en) | Cytomegalovirus antigens and uses thereof | |
CN112292395A (en) | Novel RSV RNA molecules and compositions for vaccination | |
EP3813874A1 (en) | Novel lassa virus rna molecules and compositions for vaccination | |
KR20230008801A (en) | Optimized nucleotide sequences encoding SARS-COV-2 antigens | |
AU710756B2 (en) | DNA construct comprising a muscle specific regulatory element for immunization or gene therapy | |
AU2021242030A1 (en) | Coronavirus vaccine | |
JP2019513377A (en) | Stabilized soluble pre-fusion RSV F protein | |
EA018174B1 (en) | Replication deficient influenza virus for the expression of heterologous sequences | |
CN113736801B (en) | mRNA and novel coronavirus mRNA vaccine comprising same | |
WO2023051701A1 (en) | Mrna, protein and vaccine against sars-cov-2 infection | |
JP2021534182A (en) | Immunogenic compositions and their use | |
EP4313152A1 (en) | Immunogenic compositions | |
CN112641937B (en) | Application of recombinant adenovirus in preparation of medicaments for preventing viruses | |
JP2023523423A (en) | Vaccine against SARS-CoV-2 and its preparation | |
CN115960180A (en) | 2019-nCoV S protein mutant and genetically engineered mRNA and vaccine composition thereof | |
WO2023098679A1 (en) | Novel coronavirus mrna vaccine against mutant strains | |
WO2023056872A1 (en) | Lipid nanoparticle composition and drug delivery system prepared thereby | |
WO2023064993A1 (en) | Chimeric betacoronavirus spike polypeptides | |
CN117229371A (en) | Novel S protein mutant of coronavirus variant strain, genetically engineered mRNA thereof and vaccine composition | |
KR20230008707A (en) | Vaccine composition for treatment of coronavirus | |
WO2007104979A1 (en) | Virus-like particles of rift valley fever virus | |
KR20150074714A (en) | Infectious clone comprising full-length nucleotide of porcine epidemic diarrhea virus | |
CN111732667B (en) | Peste des petits ruminants virus genetic engineering subunit vaccine |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |