CN107098979A - PPR virus H F fusion proteins and its relevant biological material and application - Google Patents
PPR virus H F fusion proteins and its relevant biological material and application Download PDFInfo
- Publication number
- CN107098979A CN107098979A CN201710483527.8A CN201710483527A CN107098979A CN 107098979 A CN107098979 A CN 107098979A CN 201710483527 A CN201710483527 A CN 201710483527A CN 107098979 A CN107098979 A CN 107098979A
- Authority
- CN
- China
- Prior art keywords
- protein
- leu
- rmhf1
- ile
- val
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 239000012620 biological material Substances 0.000 title claims abstract description 17
- 241000700605 Viruses Species 0.000 title abstract description 30
- 108020001507 fusion proteins Proteins 0.000 title abstract description 10
- 102000037865 fusion proteins Human genes 0.000 title abstract description 10
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 171
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 113
- 238000000034 method Methods 0.000 claims abstract description 56
- 239000000427 antigen Substances 0.000 claims abstract description 28
- 108091007433 antigens Proteins 0.000 claims abstract description 28
- 102000036639 antigens Human genes 0.000 claims abstract description 28
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract description 12
- 125000000539 amino acid group Chemical group 0.000 claims abstract description 6
- GOJUJUVQIVIZAV-UHFFFAOYSA-N 2-amino-4,6-dichloropyrimidine-5-carbaldehyde Chemical group NC1=NC(Cl)=C(C=O)C(Cl)=N1 GOJUJUVQIVIZAV-UHFFFAOYSA-N 0.000 claims abstract description 3
- 238000006467 substitution reaction Methods 0.000 claims abstract description 3
- 230000014509 gene expression Effects 0.000 claims description 68
- 244000005700 microbiome Species 0.000 claims description 41
- 241000588724 Escherichia coli Species 0.000 claims description 31
- 239000013598 vector Substances 0.000 claims description 24
- 201000010099 disease Diseases 0.000 claims description 21
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 21
- 108020004707 nucleic acids Proteins 0.000 claims description 17
- 102000039446 nucleic acids Human genes 0.000 claims description 17
- 150000007523 nucleic acids Chemical class 0.000 claims description 17
- 239000012634 fragment Substances 0.000 claims description 16
- 241000894006 Bacteria Species 0.000 claims description 15
- 210000004027 cell Anatomy 0.000 claims description 10
- 210000004102 animal cell Anatomy 0.000 claims description 8
- 230000009261 transgenic effect Effects 0.000 claims description 8
- 238000003259 recombinant expression Methods 0.000 claims description 5
- 241000588722 Escherichia Species 0.000 claims description 4
- 238000009007 Diagnostic Kit Methods 0.000 claims description 2
- 238000002360 preparation method Methods 0.000 claims description 2
- 230000000840 anti-viral effect Effects 0.000 claims 1
- 238000002965 ELISA Methods 0.000 abstract description 53
- 238000001514 detection method Methods 0.000 abstract description 12
- 238000012544 monitoring process Methods 0.000 abstract description 3
- 229920005630 polypropylene random copolymer Polymers 0.000 description 52
- 210000002966 serum Anatomy 0.000 description 35
- 239000000243 solution Substances 0.000 description 32
- 239000007788 liquid Substances 0.000 description 26
- 238000005457 optimization Methods 0.000 description 25
- 108020004414 DNA Proteins 0.000 description 23
- 238000000746 purification Methods 0.000 description 23
- 108010077805 Bacterial Proteins Proteins 0.000 description 20
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 18
- 241001494479 Pecora Species 0.000 description 17
- 238000010790 dilution Methods 0.000 description 17
- 239000012895 dilution Substances 0.000 description 17
- 239000002773 nucleotide Substances 0.000 description 17
- 125000003729 nucleotide group Chemical group 0.000 description 17
- 108010068327 4-hydroxyphenylpyruvate dioxygenase Proteins 0.000 description 16
- 238000003149 assay kit Methods 0.000 description 16
- 239000002808 molecular sieve Substances 0.000 description 15
- URGAHOPLAPQHLN-UHFFFAOYSA-N sodium aluminosilicate Chemical compound [Na+].[Al+3].[O-][Si]([O-])=O.[O-][Si]([O-])=O URGAHOPLAPQHLN-UHFFFAOYSA-N 0.000 description 15
- 239000006228 supernatant Substances 0.000 description 15
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 14
- 238000012360 testing method Methods 0.000 description 14
- 101710169105 Minor spike protein Proteins 0.000 description 13
- 101710081079 Minor spike protein H Proteins 0.000 description 13
- 238000001556 precipitation Methods 0.000 description 13
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical class O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 12
- 229930027917 kanamycin Natural products 0.000 description 12
- 238000005215 recombination Methods 0.000 description 12
- 230000006798 recombination Effects 0.000 description 12
- 239000012530 fluid Substances 0.000 description 11
- 229910052759 nickel Inorganic materials 0.000 description 9
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 9
- 101000911390 Homo sapiens Coagulation factor VIII Proteins 0.000 description 8
- 238000004140 cleaning Methods 0.000 description 8
- 102000057593 human F8 Human genes 0.000 description 8
- 229940047431 recombinate Drugs 0.000 description 8
- 241000880493 Leptailurus serval Species 0.000 description 7
- 108010089804 glycyl-threonine Proteins 0.000 description 7
- 235000015097 nutrients Nutrition 0.000 description 7
- 239000011780 sodium chloride Substances 0.000 description 7
- 241000196324 Embryophyta Species 0.000 description 6
- 229920002684 Sepharose Polymers 0.000 description 6
- 239000007983 Tris buffer Substances 0.000 description 6
- 150000001413 amino acids Chemical class 0.000 description 6
- 230000001580 bacterial effect Effects 0.000 description 6
- 238000006243 chemical reaction Methods 0.000 description 6
- 239000011248 coating agent Substances 0.000 description 6
- 238000000576 coating method Methods 0.000 description 6
- 150000002460 imidazoles Chemical class 0.000 description 6
- 239000002904 solvent Substances 0.000 description 6
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 6
- 241001465754 Metazoa Species 0.000 description 5
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 5
- 235000013601 eggs Nutrition 0.000 description 5
- 108010050848 glycylleucine Proteins 0.000 description 5
- 210000003000 inclusion body Anatomy 0.000 description 5
- 108010064235 lysylglycine Proteins 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- PXIPVTKHYLBLMZ-UHFFFAOYSA-N Sodium azide Chemical compound [Na+].[N-]=[N+]=[N-] PXIPVTKHYLBLMZ-UHFFFAOYSA-N 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 4
- 108010092854 aspartyllysine Proteins 0.000 description 4
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 4
- 239000001963 growth medium Substances 0.000 description 4
- 229920000136 polysorbate Polymers 0.000 description 4
- 239000013049 sediment Substances 0.000 description 4
- 108010061238 threonyl-glycine Proteins 0.000 description 4
- FRXSZNDVFUDTIR-UHFFFAOYSA-N 6-methoxy-1,2,3,4-tetrahydroquinoline Chemical compound N1CCCC2=CC(OC)=CC=C21 FRXSZNDVFUDTIR-UHFFFAOYSA-N 0.000 description 3
- 208000031295 Animal disease Diseases 0.000 description 3
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 3
- 241000193468 Clostridium perfringens Species 0.000 description 3
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 3
- WSEITRHJRVDTRX-QTKMDUPCSA-N His-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CN=CN1)N)O WSEITRHJRVDTRX-QTKMDUPCSA-N 0.000 description 3
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 3
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 3
- 108010077245 asparaginyl-proline Proteins 0.000 description 3
- 239000007853 buffer solution Substances 0.000 description 3
- 238000005119 centrifugation Methods 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 239000004744 fabric Substances 0.000 description 3
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 3
- 230000008676 import Effects 0.000 description 3
- 210000002429 large intestine Anatomy 0.000 description 3
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 230000002265 prevention Effects 0.000 description 3
- 238000004153 renaturation Methods 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 3
- 201000008827 tuberculosis Diseases 0.000 description 3
- 108010051110 tyrosyl-lysine Proteins 0.000 description 3
- QCVGEOXPDFCNHA-UHFFFAOYSA-N 5,5-dimethyl-2,4-dioxo-1,3-oxazolidine-3-carboxamide Chemical compound CC1(C)OC(=O)N(C(N)=O)C1=O QCVGEOXPDFCNHA-UHFFFAOYSA-N 0.000 description 2
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 2
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 2
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 2
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 2
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 2
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 2
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 2
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 2
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 2
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 2
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 2
- LSJQOMAZIKQMTJ-SRVKXCTJSA-N Asn-Phe-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LSJQOMAZIKQMTJ-SRVKXCTJSA-N 0.000 description 2
- 241000193830 Bacillus <bacterium> Species 0.000 description 2
- 241000283707 Capra Species 0.000 description 2
- 102000002322 Egg Proteins Human genes 0.000 description 2
- 108010000912 Egg Proteins Proteins 0.000 description 2
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 2
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 2
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 2
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 2
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 2
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 2
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 2
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 2
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 2
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 2
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 2
- JWLWNCVBBSBCEM-NKIYYHGXSA-N His-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N)O JWLWNCVBBSBCEM-NKIYYHGXSA-N 0.000 description 2
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 2
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 2
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 2
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- CUXRXAIAVYLVFD-ULQDDVLXSA-N Leu-Arg-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUXRXAIAVYLVFD-ULQDDVLXSA-N 0.000 description 2
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 2
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 2
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 2
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 2
- RXWPLVRJQNWXRQ-IHRRRGAJSA-N Met-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CNC=N1 RXWPLVRJQNWXRQ-IHRRRGAJSA-N 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 2
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 2
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 2
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 2
- KCFKKAQKRZBWJB-ZLUOBGJFSA-N Ser-Cys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O KCFKKAQKRZBWJB-ZLUOBGJFSA-N 0.000 description 2
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 2
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 2
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 2
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 2
- UIIMBOGNXHQVGW-UHFFFAOYSA-M Sodium bicarbonate Chemical compound [Na+].OC([O-])=O UIIMBOGNXHQVGW-UHFFFAOYSA-M 0.000 description 2
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 2
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 2
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 2
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 2
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 2
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 2
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 2
- 241000723848 Tobamovirus Species 0.000 description 2
- KLQPIEVIKOQRAW-IZPVPAKOSA-N Tyr-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KLQPIEVIKOQRAW-IZPVPAKOSA-N 0.000 description 2
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 2
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 2
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 2
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 2
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 2
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 2
- 108010005233 alanylglutamic acid Proteins 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 238000005336 cracking Methods 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 235000014103 egg white Nutrition 0.000 description 2
- 210000000969 egg white Anatomy 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 238000010828 elution Methods 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 2
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 2
- 108010049041 glutamylalanine Proteins 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010038983 glycyl-histidyl-lysine Proteins 0.000 description 2
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 2
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 2
- 108010037850 glycylvaline Proteins 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 239000012535 impurity Substances 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 238000007689 inspection Methods 0.000 description 2
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 238000011068 loading method Methods 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 238000005374 membrane filtration Methods 0.000 description 2
- 239000000178 monomer Substances 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- 230000001902 propagating effect Effects 0.000 description 2
- 238000004445 quantitative analysis Methods 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- 229960005486 vaccine Drugs 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- ORFISQWLBSTJAE-KTHKBMNISA-N (2s)-2-[[(2s,3s)-2-[[(2s)-2-[[(2s)-1-[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carbonyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-methylpentanoyl]amino]-4-methylpentanoic acid Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CCCN=C(N)N)C1=CC=C(O)C=C1 ORFISQWLBSTJAE-KTHKBMNISA-N 0.000 description 1
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- QMOQBVOBWVNSNO-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(O)=O QMOQBVOBWVNSNO-UHFFFAOYSA-N 0.000 description 1
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 1
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 1
- MBWYUTNBYSSUIQ-HERUPUMHSA-N Ala-Asn-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N MBWYUTNBYSSUIQ-HERUPUMHSA-N 0.000 description 1
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 1
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 1
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 1
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- DXTYEWAQOXYRHZ-KKXDTOCCSA-N Ala-Phe-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N DXTYEWAQOXYRHZ-KKXDTOCCSA-N 0.000 description 1
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- HCBKAOZYACJUEF-XQXXSGGOSA-N Ala-Thr-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(N)=O)C(=O)O HCBKAOZYACJUEF-XQXXSGGOSA-N 0.000 description 1
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 1
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 description 1
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 1
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 1
- ANNKVZSFQJGVDY-XUXIUFHCSA-N Ala-Val-Pro-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 ANNKVZSFQJGVDY-XUXIUFHCSA-N 0.000 description 1
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 1
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 1
- MAISCYVJLBBRNU-DCAQKATOSA-N Arg-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N MAISCYVJLBBRNU-DCAQKATOSA-N 0.000 description 1
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 1
- FBLMOFHNVQBKRR-IHRRRGAJSA-N Arg-Asp-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FBLMOFHNVQBKRR-IHRRRGAJSA-N 0.000 description 1
- OANWAFQRNQEDSY-DCAQKATOSA-N Arg-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N OANWAFQRNQEDSY-DCAQKATOSA-N 0.000 description 1
- JAYIQMNQDMOBFY-KKUMJFAQSA-N Arg-Glu-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JAYIQMNQDMOBFY-KKUMJFAQSA-N 0.000 description 1
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 1
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 1
- JEXPNDORFYHJTM-IHRRRGAJSA-N Arg-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCN=C(N)N JEXPNDORFYHJTM-IHRRRGAJSA-N 0.000 description 1
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 1
- JCROZIFVIYMXHM-GUBZILKMSA-N Arg-Met-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N JCROZIFVIYMXHM-GUBZILKMSA-N 0.000 description 1
- NIELFHOLFTUZME-HJWJTTGWSA-N Arg-Phe-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NIELFHOLFTUZME-HJWJTTGWSA-N 0.000 description 1
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 1
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 1
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 1
- ATABBWFGOHKROJ-GUBZILKMSA-N Arg-Pro-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ATABBWFGOHKROJ-GUBZILKMSA-N 0.000 description 1
- AUIJUTGLPVHIRT-FXQIFTODSA-N Arg-Ser-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N AUIJUTGLPVHIRT-FXQIFTODSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 1
- OGZBJJLRKQZRHL-KJEVXHAQSA-N Arg-Thr-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OGZBJJLRKQZRHL-KJEVXHAQSA-N 0.000 description 1
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 1
- QJWLLRZTJFPCHA-STECZYCISA-N Arg-Tyr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QJWLLRZTJFPCHA-STECZYCISA-N 0.000 description 1
- CNBIWSCSSCAINS-UFYCRDLUSA-N Arg-Tyr-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNBIWSCSSCAINS-UFYCRDLUSA-N 0.000 description 1
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 1
- FMYQECOAIFGQGU-CYDGBPFRSA-N Arg-Val-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMYQECOAIFGQGU-CYDGBPFRSA-N 0.000 description 1
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- HUZGPXBILPMCHM-IHRRRGAJSA-N Asn-Arg-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HUZGPXBILPMCHM-IHRRRGAJSA-N 0.000 description 1
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 1
- XXAOXVBAWLMTDR-ZLUOBGJFSA-N Asn-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N XXAOXVBAWLMTDR-ZLUOBGJFSA-N 0.000 description 1
- QRHYAUYXBVVDSB-LKXGYXEUSA-N Asn-Cys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QRHYAUYXBVVDSB-LKXGYXEUSA-N 0.000 description 1
- HJRBIWRXULGMOA-ACZMJKKPSA-N Asn-Gln-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJRBIWRXULGMOA-ACZMJKKPSA-N 0.000 description 1
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 1
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 1
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 1
- JQBCANGGAVVERB-CFMVVWHZSA-N Asn-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N JQBCANGGAVVERB-CFMVVWHZSA-N 0.000 description 1
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 1
- MYCSPQIARXTUTP-SRVKXCTJSA-N Asn-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N MYCSPQIARXTUTP-SRVKXCTJSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 1
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 1
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 1
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 1
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 1
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 1
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 1
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 1
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- MUWDILPCTSMUHI-ZLUOBGJFSA-N Asp-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)O MUWDILPCTSMUHI-ZLUOBGJFSA-N 0.000 description 1
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 1
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 1
- VZNOVQKGJQJOCS-SRVKXCTJSA-N Asp-Asp-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VZNOVQKGJQJOCS-SRVKXCTJSA-N 0.000 description 1
- FTNVLGCFIJEMQT-CIUDSAMLSA-N Asp-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N FTNVLGCFIJEMQT-CIUDSAMLSA-N 0.000 description 1
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 1
- QHHVSXGWLYEAGX-GUBZILKMSA-N Asp-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QHHVSXGWLYEAGX-GUBZILKMSA-N 0.000 description 1
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- HICVMZCGVFKTPM-BQBZGAKWSA-N Asp-Pro-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HICVMZCGVFKTPM-BQBZGAKWSA-N 0.000 description 1
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 1
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 1
- HCOQNGIHSXICCB-IHRRRGAJSA-N Asp-Tyr-Arg Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)O HCOQNGIHSXICCB-IHRRRGAJSA-N 0.000 description 1
- OTKUAVXGMREHRX-CFMVVWHZSA-N Asp-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 OTKUAVXGMREHRX-CFMVVWHZSA-N 0.000 description 1
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- PRXCTTWKGJAPMT-ZLUOBGJFSA-N Cys-Ala-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O PRXCTTWKGJAPMT-ZLUOBGJFSA-N 0.000 description 1
- PRVVCRZLTJNPCS-FXQIFTODSA-N Cys-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N PRVVCRZLTJNPCS-FXQIFTODSA-N 0.000 description 1
- OIMUAKUQOUEPCZ-WHFBIAKZSA-N Cys-Asn-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIMUAKUQOUEPCZ-WHFBIAKZSA-N 0.000 description 1
- SMYXEYRYCLIPIL-ZLUOBGJFSA-N Cys-Cys-Cys Chemical compound SC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O SMYXEYRYCLIPIL-ZLUOBGJFSA-N 0.000 description 1
- LWTTURISBKEVAC-CIUDSAMLSA-N Cys-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N LWTTURISBKEVAC-CIUDSAMLSA-N 0.000 description 1
- RFHGRMMADHHQSA-KBIXCLLPSA-N Cys-Gln-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RFHGRMMADHHQSA-KBIXCLLPSA-N 0.000 description 1
- KABHAOSDMIYXTR-GUBZILKMSA-N Cys-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N KABHAOSDMIYXTR-GUBZILKMSA-N 0.000 description 1
- AOZBJZBKFHOYHL-AVGNSLFASA-N Cys-Glu-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O AOZBJZBKFHOYHL-AVGNSLFASA-N 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- OHLLDUNVMPPUMD-DCAQKATOSA-N Cys-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N OHLLDUNVMPPUMD-DCAQKATOSA-N 0.000 description 1
- VDUPGIDTWNQAJD-CIUDSAMLSA-N Cys-Lys-Cys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O VDUPGIDTWNQAJD-CIUDSAMLSA-N 0.000 description 1
- MXZYQNJCBVJHSR-KATARQTJSA-N Cys-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N)O MXZYQNJCBVJHSR-KATARQTJSA-N 0.000 description 1
- QVLKXRMFNGHDRO-FXQIFTODSA-N Cys-Met-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O QVLKXRMFNGHDRO-FXQIFTODSA-N 0.000 description 1
- GFMJUESGWILPEN-MELADBBJSA-N Cys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CS)N)C(=O)O GFMJUESGWILPEN-MELADBBJSA-N 0.000 description 1
- XBELMDARIGXDKY-GUBZILKMSA-N Cys-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CS)N XBELMDARIGXDKY-GUBZILKMSA-N 0.000 description 1
- ZGERHCJBLPQPGV-ACZMJKKPSA-N Cys-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N ZGERHCJBLPQPGV-ACZMJKKPSA-N 0.000 description 1
- HRMMVZISPQOKMU-KKUMJFAQSA-N Cys-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CS)N)O HRMMVZISPQOKMU-KKUMJFAQSA-N 0.000 description 1
- 241000305071 Enterobacterales Species 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- ODBLJLZVLAWVMS-GUBZILKMSA-N Gln-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N ODBLJLZVLAWVMS-GUBZILKMSA-N 0.000 description 1
- WVUZERSNWGUKJY-BPUTZDHNSA-N Gln-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N WVUZERSNWGUKJY-BPUTZDHNSA-N 0.000 description 1
- YXQCLIVLWCKCRS-RYUDHWBXSA-N Gln-Gly-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N)O YXQCLIVLWCKCRS-RYUDHWBXSA-N 0.000 description 1
- GLAPJAHOPFSLKL-SRVKXCTJSA-N Gln-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)N)N GLAPJAHOPFSLKL-SRVKXCTJSA-N 0.000 description 1
- GIVHPCWYVWUUSG-HVTMNAMFSA-N Gln-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GIVHPCWYVWUUSG-HVTMNAMFSA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- ZNTDJIMJKNNSLR-RWRJDSDZSA-N Gln-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZNTDJIMJKNNSLR-RWRJDSDZSA-N 0.000 description 1
- ILKYYKRAULNYMS-JYJNAYRXSA-N Gln-Lys-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ILKYYKRAULNYMS-JYJNAYRXSA-N 0.000 description 1
- DSRVQBZAMPGEKU-AVGNSLFASA-N Gln-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DSRVQBZAMPGEKU-AVGNSLFASA-N 0.000 description 1
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 1
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 1
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 1
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 1
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 1
- UTKICHUQEQBDGC-ACZMJKKPSA-N Glu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UTKICHUQEQBDGC-ACZMJKKPSA-N 0.000 description 1
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 1
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 1
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 1
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 1
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 1
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 1
- SOEPMWQCTJITPZ-SRVKXCTJSA-N Glu-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N SOEPMWQCTJITPZ-SRVKXCTJSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 1
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 1
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 1
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 1
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 1
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- PYUCNHJQQVSPGN-BQBZGAKWSA-N Gly-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)CN=C(N)N PYUCNHJQQVSPGN-BQBZGAKWSA-N 0.000 description 1
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 1
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 1
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 1
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 1
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 1
- BIRKKBCSAIHDDF-WDSKDSINSA-N Gly-Glu-Cys Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BIRKKBCSAIHDDF-WDSKDSINSA-N 0.000 description 1
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 1
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 1
- MVORZMQFXBLMHM-QWRGUYRKSA-N Gly-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 MVORZMQFXBLMHM-QWRGUYRKSA-N 0.000 description 1
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 1
- OJNZVYSGVYLQIN-BQBZGAKWSA-N Gly-Met-Asp Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O OJNZVYSGVYLQIN-BQBZGAKWSA-N 0.000 description 1
- ZWRDOVYMQAAISL-UWVGGRQHSA-N Gly-Met-Lys Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCCN ZWRDOVYMQAAISL-UWVGGRQHSA-N 0.000 description 1
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 1
- ISSDODCYBOWWIP-GJZGRUSLSA-N Gly-Pro-Trp Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISSDODCYBOWWIP-GJZGRUSLSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 1
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 1
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- 101710154606 Hemagglutinin Proteins 0.000 description 1
- XINDHUAGVGCNSF-QSFUFRPTSA-N His-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XINDHUAGVGCNSF-QSFUFRPTSA-N 0.000 description 1
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 1
- ZIMTWPHIKZEHSE-UWVGGRQHSA-N His-Arg-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O ZIMTWPHIKZEHSE-UWVGGRQHSA-N 0.000 description 1
- UCDWNBFOZCZSNV-AVGNSLFASA-N His-Arg-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O UCDWNBFOZCZSNV-AVGNSLFASA-N 0.000 description 1
- TTZAWSKKNCEINZ-AVGNSLFASA-N His-Arg-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O TTZAWSKKNCEINZ-AVGNSLFASA-N 0.000 description 1
- MDBYBTWRMOAJAY-NHCYSSNCSA-N His-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MDBYBTWRMOAJAY-NHCYSSNCSA-N 0.000 description 1
- QQQHYJFKDLDUNK-CIUDSAMLSA-N His-Asp-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N QQQHYJFKDLDUNK-CIUDSAMLSA-N 0.000 description 1
- HQKADFMLECZIQJ-HVTMNAMFSA-N His-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N HQKADFMLECZIQJ-HVTMNAMFSA-N 0.000 description 1
- GHAFKUCRIVBLDJ-IHRRRGAJSA-N His-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N GHAFKUCRIVBLDJ-IHRRRGAJSA-N 0.000 description 1
- BZKDJRSZWLPJNI-SRVKXCTJSA-N His-His-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O BZKDJRSZWLPJNI-SRVKXCTJSA-N 0.000 description 1
- MFQVZYSPCIZFMR-MGHWNKPDSA-N His-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N MFQVZYSPCIZFMR-MGHWNKPDSA-N 0.000 description 1
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 1
- UXSATKFPUVZVDK-KKUMJFAQSA-N His-Lys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N UXSATKFPUVZVDK-KKUMJFAQSA-N 0.000 description 1
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 1
- FCPSGEVYIVXPPO-QTKMDUPCSA-N His-Thr-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FCPSGEVYIVXPPO-QTKMDUPCSA-N 0.000 description 1
- JVEKQAYXFGIISZ-HOCLYGCPSA-N His-Trp-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(O)=O)C1=CN=CN1 JVEKQAYXFGIISZ-HOCLYGCPSA-N 0.000 description 1
- FBOMZVOKCZMDIG-XQQFMLRXSA-N His-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N FBOMZVOKCZMDIG-XQQFMLRXSA-N 0.000 description 1
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 1
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 1
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 1
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 1
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 1
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 1
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 1
- YBJWJQQBWRARLT-KBIXCLLPSA-N Ile-Gln-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O YBJWJQQBWRARLT-KBIXCLLPSA-N 0.000 description 1
- JRYQSFOFUFXPTB-RWRJDSDZSA-N Ile-Gln-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N JRYQSFOFUFXPTB-RWRJDSDZSA-N 0.000 description 1
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 1
- QRTVJGKXFSYJGW-KBIXCLLPSA-N Ile-Glu-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N QRTVJGKXFSYJGW-KBIXCLLPSA-N 0.000 description 1
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 1
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 1
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 1
- KIAOPHMUNPPGEN-PEXQALLHSA-N Ile-Gly-His Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KIAOPHMUNPPGEN-PEXQALLHSA-N 0.000 description 1
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 description 1
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 1
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 1
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 1
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 1
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 1
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 1
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 1
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 1
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 1
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 1
- USXAYNCLFSUSBA-MGHWNKPDSA-N Ile-Phe-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N USXAYNCLFSUSBA-MGHWNKPDSA-N 0.000 description 1
- BATWGBRIZANGPN-ZPFDUUQYSA-N Ile-Pro-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BATWGBRIZANGPN-ZPFDUUQYSA-N 0.000 description 1
- BJECXJHLUJXPJQ-PYJNHQTQSA-N Ile-Pro-His Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N BJECXJHLUJXPJQ-PYJNHQTQSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- JCGMFFQQHJQASB-PYJNHQTQSA-N Ile-Val-His Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O JCGMFFQQHJQASB-PYJNHQTQSA-N 0.000 description 1
- QSXSHZIRKTUXNG-STECZYCISA-N Ile-Val-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QSXSHZIRKTUXNG-STECZYCISA-N 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- QPRQGENIBFLVEB-BJDJZHNGSA-N Leu-Ala-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QPRQGENIBFLVEB-BJDJZHNGSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- XYUBOFCTGPZFSA-WDSOQIARSA-N Leu-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 XYUBOFCTGPZFSA-WDSOQIARSA-N 0.000 description 1
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 1
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- FOEHRHOBWFQSNW-KATARQTJSA-N Leu-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N)O FOEHRHOBWFQSNW-KATARQTJSA-N 0.000 description 1
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 1
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- PRZVBIAOPFGAQF-SRVKXCTJSA-N Leu-Glu-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O PRZVBIAOPFGAQF-SRVKXCTJSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- QPXBPQUGXHURGP-UWVGGRQHSA-N Leu-Gly-Met Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N QPXBPQUGXHURGP-UWVGGRQHSA-N 0.000 description 1
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 1
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- BTNXKBVLWJBTNR-SRVKXCTJSA-N Leu-His-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O BTNXKBVLWJBTNR-SRVKXCTJSA-N 0.000 description 1
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- ORWTWZXGDBYVCP-BJDJZHNGSA-N Leu-Ile-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(C)C ORWTWZXGDBYVCP-BJDJZHNGSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- NRFGTHFONZYFNY-MGHWNKPDSA-N Leu-Ile-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NRFGTHFONZYFNY-MGHWNKPDSA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 1
- GNRPTBRHRRZCMA-RWMBFGLXSA-N Leu-Met-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N GNRPTBRHRRZCMA-RWMBFGLXSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 1
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- QQXJROOJCMIHIV-AVGNSLFASA-N Leu-Val-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O QQXJROOJCMIHIV-AVGNSLFASA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 1
- WQWZXKWOEVSGQM-DCAQKATOSA-N Lys-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN WQWZXKWOEVSGQM-DCAQKATOSA-N 0.000 description 1
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 1
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 1
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 1
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- KSFQPRLZAUXXPT-GARJFASQSA-N Lys-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)C(=O)O KSFQPRLZAUXXPT-GARJFASQSA-N 0.000 description 1
- DZQYZKPINJLLEN-KKUMJFAQSA-N Lys-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)O DZQYZKPINJLLEN-KKUMJFAQSA-N 0.000 description 1
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 1
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 1
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 1
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 1
- SQJSXOQXJYAVRV-SRVKXCTJSA-N Lys-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N SQJSXOQXJYAVRV-SRVKXCTJSA-N 0.000 description 1
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 1
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 1
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 1
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 1
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 1
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 1
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 1
- IEVXCWPVBYCJRZ-IXOXFDKPSA-N Lys-Thr-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IEVXCWPVBYCJRZ-IXOXFDKPSA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- VWJFOUBDZIUXGA-AVGNSLFASA-N Lys-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N VWJFOUBDZIUXGA-AVGNSLFASA-N 0.000 description 1
- XBAJINCXDBTJRH-WDSOQIARSA-N Lys-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N XBAJINCXDBTJRH-WDSOQIARSA-N 0.000 description 1
- CAODKDAPYGUMLK-FXQIFTODSA-N Met-Asn-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CAODKDAPYGUMLK-FXQIFTODSA-N 0.000 description 1
- DJBCKVNHEIJLQA-GMOBBJLQSA-N Met-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCSC)N DJBCKVNHEIJLQA-GMOBBJLQSA-N 0.000 description 1
- CAEZLMGDJMEBKP-AVGNSLFASA-N Met-Pro-His Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CNC=N1 CAEZLMGDJMEBKP-AVGNSLFASA-N 0.000 description 1
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 1
- RIIFMEBFDDXGCV-VEVYYDQMSA-N Met-Thr-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O RIIFMEBFDDXGCV-VEVYYDQMSA-N 0.000 description 1
- IQJMEDDVOGMTKT-SRVKXCTJSA-N Met-Val-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IQJMEDDVOGMTKT-SRVKXCTJSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 101710141454 Nucleoprotein Proteins 0.000 description 1
- 102000011931 Nucleoproteins Human genes 0.000 description 1
- 108010061100 Nucleoproteins Proteins 0.000 description 1
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 1
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 1
- 208000007634 Peste-des-Petits-Ruminants Diseases 0.000 description 1
- VHWOBXIWBDWZHK-IHRRRGAJSA-N Phe-Arg-Asp Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 VHWOBXIWBDWZHK-IHRRRGAJSA-N 0.000 description 1
- UEEVBGHEGJMDDV-AVGNSLFASA-N Phe-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEEVBGHEGJMDDV-AVGNSLFASA-N 0.000 description 1
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 1
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 1
- NPLGQVKZFGJWAI-QWHCGFSZSA-N Phe-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O NPLGQVKZFGJWAI-QWHCGFSZSA-N 0.000 description 1
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 1
- XMQSOOJRRVEHRO-ULQDDVLXSA-N Phe-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMQSOOJRRVEHRO-ULQDDVLXSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 1
- ZUQACJLOHYRVPJ-DKIMLUQUSA-N Phe-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZUQACJLOHYRVPJ-DKIMLUQUSA-N 0.000 description 1
- AAERWTUHZKLDLC-IHRRRGAJSA-N Phe-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O AAERWTUHZKLDLC-IHRRRGAJSA-N 0.000 description 1
- BSJCSHIAMSGQGN-BVSLBCMMSA-N Phe-Pro-Trp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O BSJCSHIAMSGQGN-BVSLBCMMSA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 1
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 1
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 1
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 1
- SJRQWEDYTKYHHL-SLFFLAALSA-N Phe-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O SJRQWEDYTKYHHL-SLFFLAALSA-N 0.000 description 1
- 108010089430 Phosphoproteins Proteins 0.000 description 1
- 102000007982 Phosphoproteins Human genes 0.000 description 1
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 1
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 1
- OYEUSRAZOGIDBY-JYJNAYRXSA-N Pro-Arg-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OYEUSRAZOGIDBY-JYJNAYRXSA-N 0.000 description 1
- XWYXZPHPYKRYPA-GMOBBJLQSA-N Pro-Asn-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XWYXZPHPYKRYPA-GMOBBJLQSA-N 0.000 description 1
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 1
- JARJPEMLQAWNBR-GUBZILKMSA-N Pro-Asp-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JARJPEMLQAWNBR-GUBZILKMSA-N 0.000 description 1
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 1
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 1
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 1
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- FZXSYIPVAFVYBH-KKUMJFAQSA-N Pro-Tyr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O FZXSYIPVAFVYBH-KKUMJFAQSA-N 0.000 description 1
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 1
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 1
- 101710176177 Protein A56 Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 1
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 1
- COAHUSQNSVFYBW-FXQIFTODSA-N Ser-Asn-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O COAHUSQNSVFYBW-FXQIFTODSA-N 0.000 description 1
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 1
- VAIZFHMTBFYJIA-ACZMJKKPSA-N Ser-Asp-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O VAIZFHMTBFYJIA-ACZMJKKPSA-N 0.000 description 1
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 1
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 1
- WTPKKLMBNBCCNL-ACZMJKKPSA-N Ser-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N WTPKKLMBNBCCNL-ACZMJKKPSA-N 0.000 description 1
- SWIQQMYVHIXPEK-FXQIFTODSA-N Ser-Cys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O SWIQQMYVHIXPEK-FXQIFTODSA-N 0.000 description 1
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 1
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 1
- UICKAKRRRBTILH-GUBZILKMSA-N Ser-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N UICKAKRRRBTILH-GUBZILKMSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- WBINSDOPZHQPPM-AVGNSLFASA-N Ser-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)O WBINSDOPZHQPPM-AVGNSLFASA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- ZFVFHHZBCVNLGD-GUBZILKMSA-N Ser-His-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFVFHHZBCVNLGD-GUBZILKMSA-N 0.000 description 1
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 1
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 1
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 1
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 1
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 1
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 1
- HJAXVYLCKDPPDF-SRVKXCTJSA-N Ser-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N HJAXVYLCKDPPDF-SRVKXCTJSA-N 0.000 description 1
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 1
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 1
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 1
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 1
- NVNPWELENFJOHH-CIUDSAMLSA-N Ser-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)N NVNPWELENFJOHH-CIUDSAMLSA-N 0.000 description 1
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 1
- HKHCTNFKZXAMIF-KKUMJFAQSA-N Ser-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=C(O)C=C1 HKHCTNFKZXAMIF-KKUMJFAQSA-N 0.000 description 1
- ZVBCMFDJIMUELU-BZSNNMDCSA-N Ser-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N ZVBCMFDJIMUELU-BZSNNMDCSA-N 0.000 description 1
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 1
- 241001428894 Small ruminant morbillivirus Species 0.000 description 1
- 101710172711 Structural protein Proteins 0.000 description 1
- 108700005078 Synthetic Genes Proteins 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 1
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 1
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 1
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 1
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 1
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 1
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 1
- ODSAPYVQSLDRSR-LKXGYXEUSA-N Thr-Cys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O ODSAPYVQSLDRSR-LKXGYXEUSA-N 0.000 description 1
- WYKJENSCCRJLRC-ZDLURKLDSA-N Thr-Gly-Cys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O WYKJENSCCRJLRC-ZDLURKLDSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- ZBKDBZUTTXINIX-RWRJDSDZSA-N Thr-Ile-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZBKDBZUTTXINIX-RWRJDSDZSA-N 0.000 description 1
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 1
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 1
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 1
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 1
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 1
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 1
- NBIIPOKZPUGATB-BWBBJGPYSA-N Thr-Ser-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O NBIIPOKZPUGATB-BWBBJGPYSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 1
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 1
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- BZTSQFWJNJYZSX-JRQIVUDYSA-N Thr-Tyr-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O BZTSQFWJNJYZSX-JRQIVUDYSA-N 0.000 description 1
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 1
- AXEJRUGTOJPZKG-XGEHTFHBSA-N Thr-Val-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O AXEJRUGTOJPZKG-XGEHTFHBSA-N 0.000 description 1
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- 241000218636 Thuja Species 0.000 description 1
- 241000710914 Totivirus Species 0.000 description 1
- TWJDQTTXXZDJKV-BPUTZDHNSA-N Trp-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O TWJDQTTXXZDJKV-BPUTZDHNSA-N 0.000 description 1
- KOVOKXBHGVXQMG-BPUTZDHNSA-N Trp-Cys-Met Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(O)=O)=CNC2=C1 KOVOKXBHGVXQMG-BPUTZDHNSA-N 0.000 description 1
- HXMJXDNSFVNSEH-IHPCNDPISA-N Trp-Cys-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N HXMJXDNSFVNSEH-IHPCNDPISA-N 0.000 description 1
- UIRPULWLRODAEQ-QEJZJMRPSA-N Trp-Ser-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 UIRPULWLRODAEQ-QEJZJMRPSA-N 0.000 description 1
- SJWLQICJOBMOGG-PMVMPFDFSA-N Trp-Tyr-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)N[C@@H](CC4=CN=CN4)C(=O)O)N SJWLQICJOBMOGG-PMVMPFDFSA-N 0.000 description 1
- YTHWAWACWGWBLE-MNSWYVGCSA-N Trp-Tyr-Thr Chemical compound C([C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 YTHWAWACWGWBLE-MNSWYVGCSA-N 0.000 description 1
- MXKUGFHWYYKVDV-SZMVWBNQSA-N Trp-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(C)C)C(O)=O MXKUGFHWYYKVDV-SZMVWBNQSA-N 0.000 description 1
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 1
- MTEQZJFSEMXXRK-CFMVVWHZSA-N Tyr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N MTEQZJFSEMXXRK-CFMVVWHZSA-N 0.000 description 1
- ZNFPUOSTMUMUDR-JRQIVUDYSA-N Tyr-Asn-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZNFPUOSTMUMUDR-JRQIVUDYSA-N 0.000 description 1
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 1
- WAPFQMXRSDEGOE-IHRRRGAJSA-N Tyr-Glu-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O WAPFQMXRSDEGOE-IHRRRGAJSA-N 0.000 description 1
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 1
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 1
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 description 1
- YMUQBRQQCPQEQN-CXTHYWKRSA-N Tyr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YMUQBRQQCPQEQN-CXTHYWKRSA-N 0.000 description 1
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 1
- WOAQYWUEUYMVGK-ULQDDVLXSA-N Tyr-Lys-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOAQYWUEUYMVGK-ULQDDVLXSA-N 0.000 description 1
- FASACHWGQBNSRO-ZEWNOJEFSA-N Tyr-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FASACHWGQBNSRO-ZEWNOJEFSA-N 0.000 description 1
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 1
- XOVDRAVPGHTYLP-JYJNAYRXSA-N Tyr-Pro-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O XOVDRAVPGHTYLP-JYJNAYRXSA-N 0.000 description 1
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- LVFZXRQQQDTBQH-IRIUXVKKSA-N Tyr-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LVFZXRQQQDTBQH-IRIUXVKKSA-N 0.000 description 1
- JHDZONWZTCKTJR-KJEVXHAQSA-N Tyr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JHDZONWZTCKTJR-KJEVXHAQSA-N 0.000 description 1
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 1
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 1
- NMANTMWGQZASQN-QXEWZRGKSA-N Val-Arg-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N NMANTMWGQZASQN-QXEWZRGKSA-N 0.000 description 1
- CVUDMNSZAIZFAE-TUAOUCFPSA-N Val-Arg-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N CVUDMNSZAIZFAE-TUAOUCFPSA-N 0.000 description 1
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 1
- NWDOPHYLSORNEX-QXEWZRGKSA-N Val-Asn-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N NWDOPHYLSORNEX-QXEWZRGKSA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 1
- YCMXFKWYJFZFKS-LAEOZQHASA-N Val-Gln-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCMXFKWYJFZFKS-LAEOZQHASA-N 0.000 description 1
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- CHWRZUGUMAMTFC-IHRRRGAJSA-N Val-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CNC=N1 CHWRZUGUMAMTFC-IHRRRGAJSA-N 0.000 description 1
- ZIGZPYJXIWLQFC-QTKMDUPCSA-N Val-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N)O ZIGZPYJXIWLQFC-QTKMDUPCSA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 1
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 1
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 1
- QPPZEDOTPZOSEC-RCWTZXSCSA-N Val-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N)O QPPZEDOTPZOSEC-RCWTZXSCSA-N 0.000 description 1
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 1
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 1
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 1
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 1
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 1
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 1
- HVRRJRMULCPNRO-BZSNNMDCSA-N Val-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 HVRRJRMULCPNRO-BZSNNMDCSA-N 0.000 description 1
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 1
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- 108010081404 acein-2 Proteins 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000001154 acute effect Effects 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 238000009835 boiling Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 239000011545 carbonate/bicarbonate buffer Substances 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000000875 corresponding effect Effects 0.000 description 1
- 108010069495 cysteinyltyrosine Proteins 0.000 description 1
- 239000008367 deionised water Substances 0.000 description 1
- 229910021641 deionized water Inorganic materials 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- BNIILDVGGAEEIG-UHFFFAOYSA-L disodium hydrogen phosphate Chemical compound [Na+].[Na+].OP([O-])([O-])=O BNIILDVGGAEEIG-UHFFFAOYSA-L 0.000 description 1
- 229910000397 disodium phosphate Inorganic materials 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 1
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 101150118163 h gene Proteins 0.000 description 1
- XLYOFNOQVPJJNP-ZSJDYOACSA-N heavy water Substances [2H]O[2H] XLYOFNOQVPJJNP-ZSJDYOACSA-N 0.000 description 1
- 239000000185 hemagglutinin Substances 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 108010077158 leucinyl-arginyl-tryptophan Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 238000009630 liquid culture Methods 0.000 description 1
- 239000012160 loading buffer Substances 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- -1 nitrite ions Chemical class 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 230000012743 protein tagging Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 238000001073 sample cooling Methods 0.000 description 1
- 238000007789 sealing Methods 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 238000003307 slaughter Methods 0.000 description 1
- 229910000030 sodium bicarbonate Inorganic materials 0.000 description 1
- CDBYLPFSWZWCQE-UHFFFAOYSA-L sodium carbonate Substances [Na+].[Na+].[O-]C([O-])=O CDBYLPFSWZWCQE-UHFFFAOYSA-L 0.000 description 1
- 229910000029 sodium carbonate Inorganic materials 0.000 description 1
- AJPJDKMHJJGVTQ-UHFFFAOYSA-M sodium dihydrogen phosphate Chemical compound [Na+].OP(O)([O-])=O AJPJDKMHJJGVTQ-UHFFFAOYSA-M 0.000 description 1
- 229910000162 sodium phosphate Inorganic materials 0.000 description 1
- POECFFCNUXZPJT-UHFFFAOYSA-M sodium;carbonic acid;hydrogen carbonate Chemical compound [Na+].OC(O)=O.OC([O-])=O POECFFCNUXZPJT-UHFFFAOYSA-M 0.000 description 1
- 238000010183 spectrum analysis Methods 0.000 description 1
- 238000005728 strengthening Methods 0.000 description 1
- 238000004448 titration Methods 0.000 description 1
- 108010060175 trypsinogen activation peptide Proteins 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 239000012498 ultrapure water Substances 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/08—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from viruses
- C07K16/10—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from viruses from RNA viruses
- C07K16/1027—Paramyxoviridae, e.g. respiratory syncytial virus
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/569—Immunoassay; Biospecific binding assay; Materials therefor for microorganisms, e.g. protozoa, bacteria, viruses
- G01N33/56983—Viruses
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2760/00—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA ssRNA viruses negative-sense
- C12N2760/00011—Details
- C12N2760/18011—Paramyxoviridae
- C12N2760/18411—Morbillivirus, e.g. Measles virus, canine distemper
- C12N2760/18422—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2333/00—Assays involving biological materials from specific organisms or of a specific nature
- G01N2333/005—Assays involving biological materials from specific organisms or of a specific nature from viruses
- G01N2333/08—RNA viruses
- G01N2333/115—Paramyxoviridae, e.g. parainfluenza virus
- G01N2333/12—Mumps virus; Measles virus
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Virology (AREA)
- Biomedical Technology (AREA)
- Immunology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Medicinal Chemistry (AREA)
- Biotechnology (AREA)
- Zoology (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Urology & Nephrology (AREA)
- Hematology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Wood Science & Technology (AREA)
- Cell Biology (AREA)
- Tropical Medicine & Parasitology (AREA)
- Pulmonology (AREA)
- Plant Pathology (AREA)
- Gastroenterology & Hepatology (AREA)
- Food Science & Technology (AREA)
- Analytical Chemistry (AREA)
- General Physics & Mathematics (AREA)
- Pathology (AREA)
- Peptides Or Proteins (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
The invention discloses PPR virus H F fusion proteins and its relevant biological material and application.PPR virus H F fusion proteins are protein a) or b):A) protein being made up of SEQ ID No.2 amino acid sequence;B) soluble protein that substitution and/or missing and/or addition by the amino acid sequence shown in SEQ ID No.2 by one or several amino acid residues is obtained.The method for the indirect ELISA detection PPR virus antibody that PPR virus H F fusion proteins are set up as envelope antigen has higher specificity, sensitiveness and good accuracy, and quick, the easy operation of energy, be conducive to the monitoring clinically to PPR.
Description
Technical field
The present invention relates to PPR virus H-F fusion proteins in biological technical field and its relevant biological material with answering
With.
Background technology
PPR (Peste des Petits Ruminants, PPR) is by PPR virus (Peste
Des Petits Ruminants Virus, PPRV) caused by a kind of acute, hot, contagious disease, with high incidence
And the characteristics of high mortality.At present, it there is no effective ways to treat PPR, can only take after vaccine inoculation, epidemic situation generation
The method for slaughtering and strengthening regular sero monitoring controls the disease.Most common report is with PPRV N, H eggs after purification both at home and abroad
(total length) or the part comprising N protein Main Antigenic recombinate N, H protein fragment as envelope antigen in vain, also have with concentration
PPRV totivirus sets up indirect ELISA method detection PPR serum antibodies (Qiu Wenying, 2011 as envelope antigen;
Balamurugan et al., 2007), but there is not yet set up indirect ELISA side using H-F fused in tandem albumen as envelope antigen
The report of method.
PPRV genomes encode 6 kinds of structural proteins, i.e. nucleoprotein (N), phosphoprotein (P), polymerase large protein (L), base altogether
Matter albumen (M), fusion protein (F) and hemagglutinin (H).Wherein, H gene total length 1852bp, by 609 amino acid encodings,
Molecular weight is 68kDa.F full length gene 2321bp, containing an ORF, F protein contains 546 amino acid, and molecular weight is about
59kDa, it is highly conserved in numb epidemic disease Tobamovirus.F full length gene 2321bp, containing an ORF, F protein contains 546 amino
Acid, molecular weight is about 59kDa, highly conserved in numb epidemic disease Tobamovirus.The antigenicity of H and F protein is stable, in virus infection
Antibody in animal blood serum for H, F fusion protein is occupied an leading position, and is two good target genes as diagnostic antigen.But
It is, PPR H, the usual inclusion bodies with insoluble monovalent albumen of F protein expression product of majority research expression
In the presence of with soluble H-F fusion-activities protein expression, there is not been reported.The expression of the inclusion body protein of unit price is due to existing
Expression quantity is not enough and space structure has false folding, it is impossible to forms high protein structure and natural space conformation epitope, leads
It, which is caused, as diagnostic antigen can not form immune predicting space epitope well.Meanwhile, the expression product in monovalent inclusion body does not have
There is biological activity, thus need to be denatured and renaturation process.The denaturation of albumen and renaturation are an extremely complex processes,
The denaturing conditions of different albumen are different, and renaturation yield is difficult often to improve.This is the main restricting factor for limiting its application.
The content of the invention
A technical problem to be solved by this invention be how clinical quick detection PPR.
In order to solve the above technical problems, the invention provides the restructuring PPR virus H-F of solubility expression fusions
Albumen and its relevant biological material.
In order to solve the above technical problems, the invention provides restructuring PPR virus H-F fusion proteins a) or b):
A) protein being made up of SEQ ID No.2 amino acid sequence;
B) by the amino acid sequence shown in SEQ ID No.2 is by the substitution of one or several amino acid residues and/or lacks
Lose and/or add obtained soluble protein.
In above-mentioned protein, protein name a) is referred to as rmHF1-Y (also known as rmHF1), is the PPRV H-F of restructuring series connection
Albumen.SEQ ID No.2 are made up of 1184 amino acid residues.
In above-mentioned protein, the protein can be prepared according to the method comprised the following steps:Make the volume of the protein
Code gene in biology express obtaining the protein;It is described biological for microorganism, plant or non-human animal.
In above-mentioned protein, the encoding gene of the protein is carried out expression in biology is included described protein
Encoding gene import recipient microorganism, obtain expressing the recombinant microorganism of the protein, cultivate the recombinant microorganism, table
Reach the protein.
Any of in above-mentioned protein, the recipient microorganism can be C1)-C4):
C1) prokaryotic micro-organisms;
C2) gramnegative bacterium;
C3) Escherichia bacteria;
C4) e. coli bl21 (DE3).
In above-mentioned protein, the encoding gene of the protein for it is following 1) or 2) shown in DNA molecular:
1) coded sequence is the DNA molecular shown in SEQ ID No.1 4-3558;
2) DNA molecular of the DNA molecular with more than 90% homogeneity and coding rmHF1-Y with 1) limiting.
Wherein, SEQ ID No.1 are made up of 3564 nucleotides, and SEQ ID No.1 4-3558 are rmHF1-Y genes
Nucleotide sequence, the rmHF1-Y gene coding amino acid sequences shown in SEQ ID No.1 4-3558 are SEQ ID
No.2 protein rmHF1-Y.
In above-mentioned protein, 2) concretely nucleotide sequence and 1) have at least 91%, 92%, 95%, 96%, 98%
Or the DNA molecular of 99% homogeneity.
In above-mentioned protein, the recombinant microorganism is that pET30a-rmHF1-Y is imported into e. coli bl21 (DE3) to obtain
To express amino acid sequence be SEQ ID No.2 protein recombinant microorganism, the recombinant microorganism is named as BL21
(DE3)/pET30a-rmHF1-Y, the pET30a-rmHF1-Y are to replace pET30a (+) with the DNA shown in SEQ ID No.1
Nde I and XhoI recognition sites between fragment (small fragment including Nde I recognition sites and XhoI recognition sites), protect
The other sequences for holding pET30a (+) are constant, obtained rmHF1-Y gene recombinant vectors.
Described to be expressed as induced expression in above-mentioned protein, the induced expression is to be lured with 0.75mM IPTG at 16 DEG C
Lead 13-16 hours or 13-24 hours or 13 hours or 16 hours.
Protection scope of the present invention is fallen within the biomaterial of the albumen qualitative correlation, the biomaterial can be following
Any of B1) to B16):
B1) the nucleic acid molecules of code for said proteins;
B2 B1) is contained) expression cassettes of the nucleic acid molecules;
B3 B1) is contained) recombinant vectors of the nucleic acid molecules;
B4 B2) is contained) recombinant vector of the expression cassette;
B5 B1) is contained) recombinant microorganisms of the nucleic acid molecules;
B6 B2) is contained) recombinant microorganism of the expression cassette;
B7 B3) is contained) recombinant microorganism of the recombinant vector;
B8 B4) is contained) recombinant microorganism of the recombinant vector;
B9 B1) is contained) the transgenetic animal cell systems of the nucleic acid molecules;
B10 B2) is contained) the transgenetic animal cell system of the expression cassette;
B11 B3) is contained) the transgenetic animal cell system of the recombinant vector;
B12 B4) is contained) the transgenetic animal cell system of the recombinant vector;
B13 B1) is contained) the transgenic plant cells systems of the nucleic acid molecules;
B14 B2) is contained) the transgenic plant cells system of the expression cassette;
B15 B3) is contained) the transgenic plant cells system of the recombinant vector;
B16 B4) is contained) the transgenic plant cells system of the recombinant vector.
Wherein, the nucleic acid molecules can be DNA, such as cDNA, genomic DNA or recombinant DNA;The nucleic acid molecules also may be used
To be RNA, such as mRNA or hnRNA.
Herein, term " homogeneity " refers to the sequence similarity with native sequence nucleic acid.Homogeneity can with the naked eye or meter
Calculation machine software is evaluated.Using computer software, the homogeneity between two or more sequences can use percentage (%) table
Show, it can be for the homogeneity between evaluation correlated series.
In above-mentioned biomaterial, the nucleic acid molecules can for it is following 1) or 2) shown in DNA molecular:
1) coded sequence is the DNA molecular shown in SEQ ID No.1 4-3558;
2) DNA molecular of the DNA molecular with more than 90% homogeneity and coding rmHF1-Y with 1) limiting.
Wherein, SEQ ID No.1 are made up of 3564 nucleotides, and SEQ ID No.1 4-3558 are rmHF1-Y genes
Nucleotide sequence, the rmHF1-Y gene coding amino acid sequences shown in SEQ ID No.1 4-3558 are SEQ ID
No.2 protein rmHF1-Y.
In above-mentioned biomaterial, 2) concretely nucleotide sequence and 1) have at least 91%, 92%, 95%, 96%,
The DNA molecular of 98% or 99% homogeneity.
In above-mentioned biomaterial, the recombinant vector can be the pET30a-rmHF1-Y;
The recombinant microorganism can be E1) or E2):
E1) recombinant microorganism is that the encoding gene of described protein is imported into recipient microorganism, obtains expressing institute
Any of the recombinant microorganism of protein is stated, the recipient microorganism is C1)-C4):
C1) prokaryotic micro-organisms;
C2) gramnegative bacterium;
C3) Escherichia bacteria;
C4) e. coli bl21 (DE3).
E2) recombinant microorganism is that the pET30a-rmHF1-Y is imported into the table that e. coli bl21 (DE3) is obtained
It is the recombinant microorganism of SEQ ID No.2 protein up to amino acid sequence.
In above-mentioned biomaterial, B9) to B16) it may include that propagating materials also may not include propagating materials.
D1 application) or D2) falls within protection scope of the present invention:
D1) application in PPR diagnostic antigen is prepared of the protein or the biomaterial, preparing inspection
The application or the application in PPR diagnostic kit is prepared surveyed in the kit of PPR virus antibody;
D2) application of the protein in monoclonal antibody is prepared.
The present invention carries out PPR virus H and F protein gene to merge after series connection in large intestine bar after codon optimization
Soluble fusion overexpression protection original rmHF1-Y (restructuring PPR virus H-F fusion proteins) is obtained in bacterium:The present invention is used
DNA shown in SEQ ID No.1 replaces fragment (including the Nde I identifications between pET30a (+) Nde I and XhoI recognition sites
Small fragment including site and XhoI recognition sites), keep pET30a (+) other sequences constant, obtained rmHF1-Y genes
Recombinant expression carrier pET30a-rmHF1-Y, will
PET30a-rmHF1-Y imports the destination protein rmHF1 that e. coli bl21 (DE3) obtains solubility.With
0.75mM IPTG is induced 16 hours at 16 DEG C, and rmHF1 content reaches the 45% of bacterial protein, the rmHF1 63% of expression
It is solvable.
It is demonstrated experimentally that detecting PPR virus antibody using rmHF1-Y as the indirect ELISA that envelope antigen is set up
Method has higher specificity, sensitiveness and good accuracy, and quick, the easy operation of energy, and it is clinically right to be conducive to
The monitoring of PPR.Using the rmHF1-Y indirect ELISA detection PPR virus antibody set up as envelope antigen
Method and France ID-VET PPR antibody assay kit coincidence rate are 94.5%, and to recombinate PPRV H proteins
The method for the indirect ELISA detection PPR virus antibody that rmH is set up as envelope antigen is small anti-with France ID-VET's
The coincidence rate of hay epizootic disease antibody assay kit only has 83.3%, is set up as envelope antigen using to recombinate PPRV F proteins rmF
The method and France ID-VET PPR antibody assay kit of indirect ELISA detection PPR virus antibody
Coincidence rate only has 72.6%.Illustrate to resist rmHF1-Y as the indirect ELISA detection PPR virus that envelope antigen is set up
The method of body and the coincidence rate of France ID-VET PPR antibody assay kit are significantly higher than to recombinate PPRV H eggs
The indirect ELISA that white rmH is set up as envelope antigen detects the method for PPR virus antibody and to recombinate PPRV F eggs
The method that the indirect ELISA that white rmF is set up as envelope antigen detects PPR virus antibody.It regard rmHF1-Y as bag
The sensitiveness of the method for the indirect ELISA detection PPR virus antibody set up by antigen is significantly higher than to recombinate PPRV H
The indirect ELISA that albumen rmH is set up as envelope antigen detects the method for PPR virus antibody and to recombinate PPRV F
The method that the indirect ELISA that albumen rmF is set up as envelope antigen detects PPR virus antibody.Using rmHF1-Y as
The method for the indirect ELISA detection PPR virus antibody that envelope antigen is set up cannot be only used for examining for PPR
It is disconnected, and evaluable immune effect of vaccine, in addition, this method can fast and accurately detect PPR, so that for China more
The propagation of good control PPR will produce positive role.
Brief description of the drawings
Fig. 1 composes for the SDS-PAGE of each bacterial strain expressing protein.
In figure, M is Marker, from top to bottom respectively 180kDa, 150kDa, 130kDa, 92kDa, 62kDa, 43kDa,
26kDa、10kDa;1st, the recipient bacterium whole bacterial protein liquid of induced expression, 2, BL21 (DE3)/pET30a- of induced expression
RmHF1-Y contains albumen supernatant, 3, BL21 (DE3)/pET30a-rmHF1-Y of induced expression contain albumen precipitation, 4, induced expression
BL21 (DE3)/pET30a-rmHF1-Y whole bacterial protein liquid, 5, the supernatant soluble protein after ni-sepharose purification, 6, molecule
Sieve supernatant soluble protein after purification, 7, BL21 (DE3)/pET30a-rmHF1-W whole bacterial protein liquid of induced expression, 8,
The BL21 (DE3) of induced expression/pET30a-rmHF2-Y whole bacterial protein liquid.
Molecular sieve purification identifications and Structural Identification of the Fig. 2 for recombinant protein rmHF1-Y.The signified purpose for purifying of arrow
Protein peak.
Embodiment
The present invention is further described in detail with reference to embodiment, the embodiment provided is only for explaining
The bright present invention, the scope being not intended to be limiting of the invention.Experimental method in following embodiments, unless otherwise specified, be
Conventional method.Material, reagent used etc., unless otherwise specified, are commercially obtained in following embodiments.
PET30a (+) is Novagen Products.PET28a (+) is Novagen Products.
Embodiment 1, solubility expression rmHF1
1st, synthetic gene
The application devises 3 kinds of PPR virus H and F protein fusion, respectively SEQ ID No.1 4-
The rmHF2-Y genes and SEQ ID No.5 shown in rmHF1-Y genes, SEQ ID No.3 1-3684 shown in 3558
RmHF1-W genes shown in 4-3558.RmHF1-Y genes and rmHF2-Y genes differing only on nucleotide sequence
5' ends are different, and SEQ ID No.1 24-3564 are identical with SEQ ID No.3 150-3690 nucleotides.
RmHF1-Y genes and rmHF1-W genes encode protein rmHF1, the rmHF2-Y base shown in SEQ ID No.2
Because of the protein rmHF2 shown in coding SEQ ID No.3.RmHF1 and rmHF2 differ only in amino on amino acid sequence
End is different, and SEQ ID No.2 8-1184 are identical with SEQ ID No.3 51-1227 amino acids residues.
2nd, the structure of recombinant expression carrier and recombinant bacterium
With shown in SEQ ID No.1 DNA replace pET30a (+) Nde I and XhoI recognition sites between fragment (including
Small fragment including Nde I recognition sites and XhoI recognition sites), keep pET30a (+) other sequences constant, obtain
RmHF1-Y gene recombinant vectors, are named as pET30a-rmHF1-Y.PET30a-rmHF1-Y contains rmHF1-Y genes,
The nucleotide sequence of rmHF1-Y genes is SEQ ID No.1 4-3558 nucleotides, rmHF1-Y gene code SEQ ID
Protein rmHF1 shown in No.2.
With shown in SEQ ID No.5 DNA replace pET30a (+) Nde I and XhoI recognition sites between fragment (including
Small fragment including Nde I recognition sites and XhoI recognition sites), keep pET30a (+) other sequences constant, obtain
RmHF1-W gene recombinant vectors, are named as pET30a-rmHF1-W.PET30a-rmHF1-W contains rmHF1-W genes,
The nucleotide sequence of rmHF1-W genes is SEQ ID No.5 4-3558 nucleotides, rmHF1-W gene code SEQ ID
Protein rmHF1 shown in No.2.
PET30a (+) BamHI and XhoI recognition sites are replaced with the DNA shown in SEQ ID No.3 145-3690
Between fragment (small fragment including BamHI recognition sites and XhoI recognition sites), keep pET30a (+) other sequences
It is constant, rmHF2-Y gene recombinant vectors are obtained, pET30a-rmHF2-Y is named as.PET30a-rmHF2-Y contains
RmHF2-Y genes, the nucleotide sequence of rmHF2-Y genes is SEQ ID No.3 1-3684 nucleotides, rmHF2-Y bases
Because of the protein rmHF2 shown in coding SEQ ID No.4.
PET30a-rmHF1-Y, pET30a-rmHF1-W and pET30a-rmHF2-Y are individually converted into Escherichia coli
BL21 (DE3) competent cell.It is spread evenly across on the LB flat boards containing kanamycins, 37 DEG C are cultivated 16 hours.Single bacterium colony
Shaken cultivation is stayed overnight, and is extracted plasmid and is sequenced, sequencing result is shown into the recombination bacillus coli containing pET30a-rmHF1-Y
BL21 (DE3)/pET30a-rmHF1-Y is named as, sequencing result is shown into the recombination bacillus coli containing pET30a-rmHF1-W
BL21 (DE3)/pET30a-rmHF1-W is named as, sequencing result is shown into the recombination bacillus coli containing pET30a-rmHF2-Y
It is named as BL21 (DE3)/pET30a-rmHF2-Y.
3rd, the analysis and identification of protein expression form
By BL21 (DE3)/pET30a-rmHF1-Y, BL21 (DE3)/pET30a-rmHF1-W, BL21 (DE3)/pET30a-
RmHF2-Y and e. coli bl21 (DE3) (abbreviation recipient bacterium) this four bacterial strains are individually inoculated in containing 50 μ g/ml cards that is mould
(concentration that kanamycins to kanamycins are added in LB fluid nutrient mediums is what 50 μ g/ml were obtained to the LB fluid nutrient mediums of element
Culture medium) in, 37 DEG C, using the complete warm oscillator 200rpm shaken cultivations of Thermo MaxQ6000 types to 0D600Value is (with containing 50 μ
The LB fluid nutrient mediums of g/ml kanamycins are blank control) when reaching 0.6, add isopropylthio-β-D-galactoside
(IPTG) induced expression is carried out.The induced expression of above-mentioned four plants of bacterium is to induce (be somebody's turn to do at 16 DEG C within 16 hours with 0.75mM IPTG
Induced expression condition is by optimizing obtained highly-soluble induced expression to conditions such as temperature, time, IPTG concentration
Condition).
Induced expression zymotic fluid is taken to be used for protein expression form analysis.Concretely comprise the following steps, take 1m L zymotic fluids to be placed in 1.5m
In L centrifuge tubes, 8500rpm/min centrifugation 45min under the conditions of mark, 4 DEG C are carried out, supernatant is discarded, collects bacterial sediment.Add
Precipitation is resuspended in 1m L PBS, and 8000rpm/min centrifugation 5min discard supernatant.200 μ L are added into the bacterial sediment washed
PBS, high pressure crushes thalline, and cracking is no longer sticky to bacterium solution, obtains whole bacterial protein liquid.By whole bacterial protein liquid in 4 DEG C of centrifugations
18000rpm/min centrifuges 45min in machine, and supernatant (being named as containing albumen supernatant) is collected respectively and precipitation (is named as containing egg
White precipitation), 50 μ L PBS are added in containing albumen precipitation washing precipitation is resuspended.To whole bacterial protein liquid, containing albumen supernatant and
Containing 10 μ L 5 × SDS-PAGE loading Buffer are added in albumen precipitation, after fully mixing, put in boiling water bath and boil
5min, after sample cooling after, with hand held centrifuge wink from.6 μ L are taken to be used for SDS-PAGE electrophoretic analysis, and associated proteins gray scale
Analysis software initial analysis protein content.Gel after electrophoresis is needed on NC films, using the sheep anti-mouse antibody of anti-His labels as knot
Antibody DAB colour developings are closed, Western-blot identifications are carried out.Filtered by above-mentioned whole bacterial protein liquid and containing albumen supernatant with 0.22 μm
Loading is to solution 1, (solute and its concentration are as follows in advance after membrane filtration:20mM Tris, 150mM NaCl, solvent are water,
PH8.0 solution) the nickel post that has balanced.Nickel post is accessed on AKTA machines, respectively with the solution 1 of 10 column volumes and 10 posts
(solute and its concentration are as follows for the solution 2 of volume:20mM Tris, 150mM NaCl, 50mM imidazoles, solvent are water, and pH8.0's is molten
Liquid) impurity protein in cleaning nickel post, and monitor protein peak on AKTA machines.With solution 3, (solute and its concentration are as follows:
20mM Tris, 150mM NaCl, 300mM imidazoles, solvent are water, pH8.0 solution) destination protein hung on nickel post is rinsed,
And the elution samples for destination protein peak occur are collected using AKTA, the sample is referred to as ni-sepharose purification destination protein sample.
The Superdex200 gel columns that the destination protein sample of ni-sepharose purification is produced with GE companies enter one by molecular sieve
Step purifying.Mobile phase uses solution 1.By that can remove a large amount of imidazoles contained in sample after molecular sieve purification, collection is washed
De- peak, obtains the destination protein sample of molecular sieve purification, right using NanoDrop2000 ultramicrospectrophotometers (ND2000)
The content of albumen (i.e. soluble destination protein) carries out quantitative analysis in the destination protein sample of obtained molecular sieve purification.It is used in combination
NanoDrop2000 ultramicrospectrophotometers (ND2000) determine the protein content in whole bacterial protein liquid, obtain thalline total
Protein content.It will contain after albumen precipitation urea dissolves, be determined with NanoDrop2000 ultramicrospectrophotometers (ND2000)
Content containing the protein in albumen precipitation.
As a result BL21 (DE3)/pET30a-rmHF1-Y of induced expression whole bacterial protein liquid is shown, containing albumen supernatant
With containing in albumen precipitation containing size be 131kDa destination protein rmHF1;BL21 (DE3)/pET30a- of induced expression
Destination protein rmHF1 in rmHF1-Y whole bacterial protein liquid accounts for the 45% of bacterial protein (full bacterium total protein), induced expression
BL21 (DE3)/pET30a-rmHF1-Y the BL21 that induced expression is accounted for containing the destination protein rmHF1 in albumen supernatant
(DE3) the 63% of destination protein HF-his in/pET30a-rmHF1-Y whole bacterial protein liquid, 63% destination protein
RmHF1 is soluble protein;BL21 (DE3)/pET30a-rmHF1-Y of induced expression containing the destination protein in albumen precipitation
RmHF1 accounts for 37% of destination protein rmHF1 in BL21 (DE3)/pET30a-rmHF1-Y of induced expression whole bacterial protein liquid,
The 37% destination protein rmHF1 is insoluble inclusion body albumen;Illustrate BL21 (DE3)/pET30a-rmHF1- of induced expression
Y destination protein rmHF1 accounts for the destination protein of 45%, BL21 (DE3)/pET30a-rmHF1-Y expression of bacterial protein
63% is soluble protein in rmHF1, and 37% is insoluble inclusion body albumen.BL21 (DE3)/pET30a- of induced expression
The destination protein rmHF1 that size is 131kD is not contained in rmHF1-W whole bacterial protein liquid, illustrates BL21 (DE3)/pET30a-
RmHF1-W does not have express express target protein rmHF1.BL21 (DE3)/pET30a-rmHF2-Y of induced expression whole bacterial protein liquid
In do not contain size be 131kD destination protein rmHF2, illustrate that BL21 (DE3)/pET30a-rmHF2-Y does not express purpose egg
White rmHF2.The whole bacterial protein liquid of the e. coli bl21 (DE3) of induced expression does not contain the destination protein that size is 131kD
The rmHF1 and destination protein rmHF2 that size is 131kD;Illustrate that e. coli bl21 (DE3) does not have express express target protein rmHF1
With rmHF2 (Fig. 1).It can be seen that, although use same expression vector pET30a (+) and same Host Strains e. coli bl21
(DE3), the expression of different external source target gene is differed greatly, and rmHF1-Y genes are imported into large intestine by pET30a (+)
Bacillus BL21 (DE3) can obtain the solution expression with high efficiency of rmHF1-Y genes, and rmHF1-W genes are imported by pET30a (+)
In e. coli bl21 (DE3), rmHF1-W genes are not expressed but, and rmHF2-Y genes are imported into large intestine by pET30a (+)
In bacillus BL21 (DE3), rmHF2-Y genes are not also expressed.
In addition, according to the method described above, the sequence between pET28a (+) restriction enzyme NheI and NotI sites is replaced
The rmHF1-Y genes shown in SEQ ID No.1 4-3558 are changed to, keeps pET28a (+) other sequences constant, is contained
There is the recombinant expression carrier of rmHF1-Y genes, the recombinant expression carrier is named as pET28a-rmHF1-Y.By pET28a-
RmHF1-Y is transferred to e. coli bl21 (DE3) competent cell, by obtained recombination bacillus coli be named as BL21 (DE3)/
pET28a-rmHF1-Y.BL21 (DE3)/pET28a-rmHF1-Y is inoculated in the LB Liquid Cultures containing 50 μ g/ml kanamycins
In base (concentration that kanamycins to kanamycins are added in LB fluid nutrient mediums is the culture medium that 50 μ g/ml are obtained), 37 DEG C,
Using the complete warm oscillator 200rpm shaken cultivations of Thermo MaxQ6000 types to 0D600Value is (with the LB containing 50 μ g/ml kanamycins
Fluid nutrient medium is blank control) when reaching 0.6, add isopropylthio-β-D-galactoside (IPTG) and carry out induced expression.
The induced expression is induced 16 hours at 16 DEG C with 0.75mM IPTG.The zymotic fluid of induced expression is taken to carry out according to the method described above
Protein expression form analysis.As a result show do not have in BL21 (DE3)/pET28a-rmHF1-Y whole bacterial protein liquid of induced expression
The expression of purposeful albumen.It can be seen that, although it is big using same external source target gene (rmHF1-Y genes) and same Host Strains
Enterobacteria BL21 (DE3), in different BL21 (DE3) expression vector-pET28a (+) and pET30a (+), external source target gene
Expression differ greatly, by rmHF1-Y genes by pET30a (+) import e. coli bl21 (DE3) can obtain
The solution expression with high efficiency of rmHF1-Y genes, e. coli bl21 (DE3) is imported by rmHF1-Y genes by pET28a (+)
In, rmHF1-Y genes are not expressed but.
4th, rmHF1 solubility expression and purifying
BL21 (DE3)/pET30a-rmHF1-Y is inoculated in the LB fluid nutrient mediums containing 50 μ g/ml kanamycins (in LB
The concentration that kanamycins to kanamycins are added in fluid nutrient medium is the culture medium that 50 μ g/ml are obtained) in, 37 DEG C, use
The complete warm oscillator 200rpm shaken cultivations of Thermo MaxQ6000 types are to 0D600Value is (with the LB liquid containing 50 μ g/ml kanamycins
Culture medium is blank control) when reaching 0.6, add IPTG and carry out induced expression.The induced expression is with 0.75mM IPTG 16
DEG C induction 16h.The zymotic fluid after IPTG induced expressions 16h is taken to collect bacterial sediment.Add PBS and precipitation, 8000rpm/min is resuspended
5min is centrifuged, supernatant is discarded.PBS is added into the bacterial sediment washed, high pressure crushes thalline, and cracking to bacterium solution is no longer glued
It is thick, 16000 in 4 DEG C of centrifuges
Rpm/min centrifuges 30min, collects supernatant (being named as containing albumen supernatant), abandons precipitation.Albumen supernatant will be contained
With loading after 0.22 μm of membrane filtration to advance solution 1, (solute and its concentration are as follows:20mM Tris, 150mM NaCl, it is molten
Agent is water, pH8.0 solution) the nickel post that has balanced.Nickel post is accessed on AKTA machines, respectively with the solution 1 of 10 column volumes
(solute and its concentration are as follows with the solution 2 of 10 column volumes:20mM Tris, 150mM NaCl, 50mM imidazoles, solvent is water,
PH8.0 solution) impurity protein in cleaning nickel post, and monitor protein peak on AKTA machines.With (the solute and its dense of solution 3
Degree is as follows:20mM Tris, 150mM NaCl, 300mM imidazoles, solvent are water, pH8.0 solution) rinse nickel post hang on nickel post
Destination protein, and collected using AKTA and the elution samples at destination protein peak occur, the sample is referred to as to the rmHF1 of ni-sepharose purification
Albumen (ni-sepharose purification destination protein sample).
The Superdex200 gel columns that the rmHF1 albumen of ni-sepharose purification is produced with GE companies are further by molecular sieve
Purifying, obtains the rmHF1 albumen of molecular sieve purification.Mobile phase in the molecular sieve purification uses above-mentioned solution 1.Pass through molecular sieve
The a large amount of imidazoles contained in sample can be removed after purification, and the structure of rmHF1 albumen is monomer structure (Fig. 2).Collect monomer
The eluting peak of structure, obtains the rmHF1 albumen (molecular sieve purification destination protein sample) of molecular sieve purification, uses
The purity of albumen of the NanoDrop2000 ultramicrospectrophotometers (ND2000) to obtaining carries out quantitative analysis.
By its amino acid sequence of the rmHF1 albumen progress mass spectral analysis of molecular sieve purification, as a result show rmHF1 amino acid
Sequence is as shown in SEQ ID No.2.
Embodiment 2, the detection PPR virus antibody by envelope antigen indirect ELISA method of rmHF1 albumen
Related solution in the present embodiment is as follows:
Concentration is 0.01M, and pH value is the preparation of 7.4 PBS:8.5g NaCl、0.2g KCl、2.9g
Na2HPO4·12H2O、0.59g NaH2PO4·2H2O, 1L deionized water.
It is coated with buffer solution:0.05mol/L sodium carbonate-bicarbonate buffer solution (pH9.6), solvent is water, solute and its
Concentration is as follows:Na2CO31.59g/L and NaHCO3 2.93g/L。
Cleaning solution:0.5% tween cleaning solution.0.5% tween cleaning solution is prepared as follows:It is 0.01M in concentration,
The content of addition polysorbas20 and sodium azide to sodium azide is the content of 5g/L, polysorbas20 in the PBS that pH value is 7.4
For 5mL/L, 0.5% tween cleaning solution is obtained.
Confining liquid:1%BSA confining liquids.1%BSA confining liquids are prepared as follows:It is 0.01M in concentration, pH value is
The BSA solution for Jia 10% in 7.4 PBS, is diluted to 1% (volumn concentration), obtains 1%BSA confining liquids.
Secondary antibody dilution:Addition BSA to BSA content is in concentration is 0.01M, the PBS that pH value is 7.4
1% (volumn concentration), obtains secondary antibody dilution.
1st, the foundation and optimization of indirect ELISA reaction condition
RmHF1 albumen (the hereinafter referred to as rmHF1 eggs of molecular sieve purification in embodiment 1 are determined using chessboard square formation titration
Most suitable coating concentration and optimal serum dilution in vain), optimal confining liquid concentration is determined with the BSA closing ELISA Plates of various concentrations,
Serum and ELIAS secondary antibody best effort time are determined, optimizes ELIAS secondary antibody working concentration, criterion is:Positive serum and feminine gender
The OD of serum450Reaction condition corresponding to the maximum hole of ratio (P/N) is the optimum reaction condition of ELISA method.
By coated elisa plate after the rmHF1 protein ladders dilution of molecular sieve purification in embodiment 1, titrated using chessboard square formation
Method determines the package amount and serum dilution of rmHF1 albumen;And it is excellent on the basis of antigen is most preferably coated with concentration and serum dilution
Change ELISA testing conditions.As a result as shown in table 1, the mass concentration of rmHF1 albumen is 1.0 μ g/mL, test serum extension rate
For 1:When 20, P/N values are maximum, it is thus determined that the optimal coating concentration of antigen (rmHF1 albumen) is 1.0 μ g/mL, test serum is dilute
It is 1 to release multiple:20.This experiment simultaneously determines that the rabbit-anti goat IgG of HRP marks presses 1:20000 dilution proportion, 37 DEG C of effects
0.5h, adds OD values after TMB nitrite ions 10min optimal.
The reaction condition optimization result of table 1, ELISA detection method
Coated antigen | Sealing condition | Test serum | Secondary antibody IgG-HRP | |
Optimize dilution factor | 1.0μg/mL | 1%BSA | 1:20 | 1:20000 |
Reaction condition | 4℃/16h | 37℃/2h | 37℃/1h | 37℃/0.5h |
What the step was determined detects PPR virus antibody by envelope antigen indirect ELISA method of rmHF1 albumen
Optimal Experimental method (hereinafter referred to as recombinate rmHF1 optimization indirect ELISA method) it is as follows:
1.1 coating:With rmHF1 albumen (the hereinafter referred to as rmHF1 of molecular sieve purification in coating buffer solution dilution embodiment 1
Albumen) to rmHF1 albumen concentration be 1.0 μ g/ml, obtain be coated with original solution, with the coating original solution be coated with experimental port, 100 μ
L/ holes add to ELISA Plate, 4 DEG C of incubation 16h.
1.2 washing:Incline hole endoperidium original solution, is washed 5 times with 0.5% tween cleaning solution, each 3min;Pat dry.
1.3 closing:Add 1%BSA confining liquids, 100ul/ holes, 37 DEG C of incubation 2h.
1.4 sample-adding:
1.4.1 sample well
PPRV positive serums are diluted 20 times with coating buffer solution, test serum is obtained.100 μ L test serums are added to enzyme
On target, 37 DEG C are reacted liquid in 1h, hole of inclining, and are then washed with cleaning solution 5 times, each 3min.Wherein, positive serum is method
State ID-VET PPR antibody assay kit (IDPPR Competition PPRs antibody is examined
Test agent box, production code member is PPRC-4P) it is detected as the sheep blood serum of PPRV antibody positives.
1.4.2 blank control wells
Test serum is replaced with to isometric high purity water with differing only in for 1.4.1, other steps are constant.
1.5 add ELIAS secondary antibody:Addition carries out 1 with secondary antibody dilution:The HRP mark rabbit-anti goat IgGs of 20000 dilutions,
100ul/ holes, 37 DEG C of 30min.
1.6 colour developing:TMB is added, 10min is reacted in 100ul/ holes.
1.7 terminate:Add 0.2mol/L H2SO4Solution terminating reaction, 100ul/ holes.
1.8 determine:Each hole OD is read with ELIASA450nmNumerical value.
2nd, the determination of ELISA yin and yang attributes critical value
1000 parts that China Animal Disease Control And Prevention Center plant-eating animal and infecting both domestic animals and human ward are preserved are through French ID-
VET PPR antibody assay kit (IDPPR Competition PPR antibody test reagents
Box, production code member is PPRC-4P) it is detected as restructuring rmHF1 optimization indirect ELISA of the negative sheep blood serums of PPRV using step 1
Method (the PPRV positive serums in 1.4.1 being replaced with into 1000 parts of PPRV negative serums respectively, other operations are identical) is carried out
Indirect ELISA is detected, calculates the average value (X) and standard deviation (SD) of 1000 parts of PPRV negative serums.
It is judged to the positive;It is judged to feminine gender.
As a result show, 1000 parts of PPRV negative serum average valuesIt is 0.053 for 0.162, SD, therefore yin and yang attribute is critical
ValueFor 0.321.
3rd, specific test
Using step 1 restructuring rmHF1 optimization indirect ELISA method to 10 parts of sheep johne's diseases, sheep cloth disease, tuberculosis,
C.perfringens disease and aftosa positive serum are detected that observation has no cross reaction with Other diseases.Wherein, will
1.4.1 the PPRV positive serums in replace with above-mentioned serum respectively, and other operations are identical.As a result show that restructuring rmHF1 optimizes
Cause of disease (sheep johne's disease, sheep cloth disease, tuberculosis, C.perfringens disease and aftosa of the indirect ELISA method to several Yang Yuan
Positive serum) positive serum detected, its OD450Value is respectively:0.169th, 0.171,0.122,0.194,0.226, it is respectively less than
Critical value 0.321, shows rmHF1 albumen and sheep johne's disease, sheep cloth disease, tuberculosis, C.perfringens disease and aftosa sun
Property the equal no cross reaction of serum, step 1 restructuring rmHF1 optimization indirect ELISA method there is good specificity.
4th, sensitivity tests
RmHF1 in the optimization indirect ELISA method of step 1 is replaced with into restructuring H protein rmH, other operations are constant, build
Vertical restructuring H protein rmH optimization indirect ELISA methods.
RmHF1 in the optimization indirect ELISA method of step 1 is replaced with into restructuring F protein rmF, other operations are constant, build
Vertical restructuring F protein rmF optimization indirect ELISA methods.
Wherein, above-mentioned restructuring H protein rmH according to rmHF1 in the step 4 of embodiment 1 solubility expression and purification process system
It is standby, differ only in will carry out the recombination bacillus coli of soluble induced expression by recombination bacillus coli BL21 (DE3)/
PET30a-rmHF1-Y replaces with recombination bacillus coli BL21 (DE3)/pET30a-rmH-Y.Recombination bacillus coli BL21 (DE3)/
PET30a-rmH-Y is that pET30a-rmH-Y is imported into the recombinant bacterium that e. coli bl21 (DE3) is obtained.PET30a-rmH-Y is
With rmH-Y replace pET30a (+) Nde I and XhoI recognition sites between fragment (including Nde I recognition sites and XhoI identification
Small fragment including site), keep pET30a (+) other sequences constant, obtained rmH-Y gene recombinant vectors, life
Entitled pET30a-rmH-Y.
PET30a-rmH-Y contains rmH-Y genes, and rmH-Y genes are by SEQ ID 1852-3555 nucleotides of No.1
Missing, the DNA that other invariant nucleotides are obtained, rmH-Y DNA encoding the protein rmH.RmH is by the of SEQ ID No.3
The constant obtained protein of the other amino acid residues of 617-1184 amino acids residue deletions.
Above-mentioned restructuring F protein rmF is prepared according to the solubility expression and purification process of rmHF1 in the step 4 of embodiment 1, area
It is not only that the recombination bacillus coli by soluble induced expression is carried out by recombination bacillus coli BL21 (DE3)/pET30a-
RmHF1-Y replaces with recombination bacillus coli BL21 (DE3)/pET30a-rmF-Y.Recombination bacillus coli BL21 (DE3)/pET30a-
RmF-Y is that pET30a-rmF-Y is imported into the recombinant bacterium that e. coli bl21 (DE3) is obtained.PET30a-rmF-Y is to use rmF-Y
(including Nde I recognition sites and XhoI recognition sites exist the fragment for replacing between pET30a (+) Nde I and XhoI recognition sites
Interior small fragment), holding pET30a (+) other sequences are constant, and obtained rmF-Y gene recombinant vectors are named as
pET30a-rmF-Y.PET30a-rmF-Y contains rmF-Y genes, and rmF-Y genes are by SEQ ID 25-1896 cores of No.1
Thuja acid is lacked, the DNA that other invariant nucleotides are obtained, rmF-Y DNA encoding the protein rmF.RmF is by SEQ ID No.3
The constant obtained protein of the other amino acid residues of 8-631 amino acids residue deletions.
By the PPR antibody assay kit (ID through French ID-VETPPR Competition are small
Epizootic disease antibody assay kit is ruminated, production code member is PPRC-4P) the positive sheep blood serum progress doubling dilutions of PPRV are detected as,
Detected using the restructuring rmHF1 optimization indirect ELISA methods of step 1, while indirect using the rmH optimizations of restructuring H protein
The PPR antibody test reagent of ELISA method, restructuring F protein rmF optimization indirect ELISA methods and the French ID-VET
Box is compared, and draws greatest dilution during positive critical value.
As a result show sheep blood serum positive PPRV from 1:25 proceed by doubling dilution, using the restructuring rmHF1 of step 1
Optimization indirect ELISA method is detected that the dilution factor of sheep blood serum positive PPRV is still positive when being 1: 1600;Using restructuring
H protein rmH optimization indirect ELISA methods are detected that the greatest dilution of sheep blood serum positive PPRV is 1:800;Using weight
Group F protein rmF optimization indirect ELISA methods are detected that the greatest dilution of sheep blood serum positive PPRV is 1:800;Show
The restructuring rmHF1 optimizations indirect ELISA method sensitiveness of step 1 is preferable, hence it is evident that higher than restructuring H protein rmH and restructuring F protein
RmF as detection method during envelope antigen sensitiveness.
5th, replica test
Indirect ELISA method is optimized to 6 parts of sheep blood serums in same batch plate and different batches using the restructuring rmHF1 of step 1
Detected respectively on plate, parallel determination 5 times, calculate batch interior, interassay coefficient of variation (CV).As a result show, repeat to make a variation in batch
Coefficient repeats the coefficient of variation between 2%~8%, between batch and is less than 9% (table 2).As a result show, the restructuring rmHF1 optimizations of step 1
Indirect ELISA method has good repeatability.
The restructuring rmHF1 optimization indirect ELISA method replica tests of the step 1 of table 2
6th, accordance is tested
Detected using the restructuring rmHF1 optimization indirect ELISA methods of step 1, while using the restructuring H eggs of step 4
White rmH optimizes the small anti-of indirect ELISA method, the restructuring F protein rmF optimization indirect ELISA methods of step 4 and France ID-VET
Hay epizootic disease antibody assay kit (IDPPR Competition PPR antibody assay kits, product is compiled
Number it is PPRC-4P) 500 parts of sheep blood serums being preserved to China Animal Disease Control And Prevention Center plant-eating animal and infecting both domestic animals and human ward
Detected, calculate former three and the coincidence rate of France ID-VET PPR antibody assay kit.
As a result show to optimize the positive that indirect ELISA method detects 500 parts of sheep blood serums using the restructuring rmHF1 of step 1
Rate is 44.19%;French ID-VET PPR antibody assay kit Positive rate is 42.25%, the weight of step 1
Group rmHF1 optimization indirect ELISA methods and France ID-VET PPR antibody assay kit coincidence rate are 94.5%.
And the PPR antibody assay kit of the restructuring H protein rmH optimization indirect ELISA methods and France ID-VET of step 4
Coincidence rate there was only 83.3%, the restructuring F protein rmF optimization indirect ELISA methods of step 4 ruminate beast with France's the small of ID-VET
The coincidence rate of epidemic disease antibody assay kit only has 72.6%.Illustrate the indirect ELISA inspection for setting up rmHF1-Y as envelope antigen
The method and the coincidence rate of France ID-VET PPR antibody assay kit for surveying PPR virus antibody are notable
Higher than the side that the indirect ELISA set up using recombinating PPRV H proteins rmH as envelope antigen detects PPR virus antibody
Method and the indirect ELISA set up using recombinating PPRV F proteins rmF as envelope antigen detect the side of PPR virus antibody
Method.
<110>China Animal Disease Control And Prevention Center
<160> 5
<170> PatentIn version 3.5
<210> 1
<211> 3564
<212> DNA
<213>Artificial sequence
<220>
<221> CDS
<222> (4)..(3558)
<223>
<400> 1
catatgcacc atcaccacca tcacatgagc gcacaacgcg aacgcatcaa cgcgttctac 60
aaagacaacc tgcacaacaa aacccatcgc gttatcctgg atcgcgaacg tctgaccatt 120
gaacgtccgt atatcctgct gggggttctg ctggttatgt ttctgagcct gatcggtctg 180
ctggcaattg ctggtattcg tctgcatcgc gcaaccgttg gtaccgcaga aattcaaagc 240
cgtctgaaca ccaacatcga actgaccgaa agcatcgacc atcagaccaa agacgttctg 300
accccgctgt ttaaaatcat cggcgacgaa gtcggcattc gtattccgca gaaattcagc 360
gacctggtca aattcatcag cgacaaaatc aaattcctga acccggaccg cgaatacgat 420
tttcgcgatc tgcgttggtg tatgaatccg ccggaacgcg tcaaaatcaa cttcgaccag 480
ttctgcgaat acaaagcggc ggttaaaagc gtcgaacaca tcttcgaaag cagcctgaac 540
cgttctgaac gtctgcgtct gctgaccctg ggtccgggta ccggttgtct gggtcgtacc 600
gttacccgtg cacaatttag cgaactgacc ctgaccctga tggatctgga cctggagatg 660
aaacataacg tcagcagcgt ctttaccgtt gtggaagaag gtctgtttgg ccgtacctat 720
accgtttggc gttctgatac cggtaaaccg agtaccagtc cgggtattgg tcattttctg 780
cgcgtcttcg aaattggtct ggttcgcgat ctggaactgg gcgctccgat ttttcacatg 840
accaactacc tgaccgtcaa catgagcgac gattaccgtt cttgtctgct ggcagttggc 900
gaactgaaac tgaccgcact gtgtaccccg tctgaaaccg ttaccctgtc tgaatctggc 960
gttccgaaac gcgaaccgct ggttgtcgtt attctgaatc tggcaggtcc gaccctgggc 1020
ggcgaactgt attctgttct gccgaccacc gatccgaccg ttgaaaaact gtatctgagc 1080
agccatcgcg gtatcatcaa agacaacgaa gcgaattggg ttgttccgtc taccgacgtt 1140
cgtgatctgc aaaacaaagg cgagtgcctg gttgaagctt gtaaaacccg tccgccgagc 1200
ttttgtaacg gtaccggtat tggtccgtgg tctgaaggtc gtattccggc ttacggcgtt 1260
attcgcgttt ctctggatct ggcatctgat ccgggcgtag ttattaccag cgtttttggt 1320
ccgctgattc cgcatctgtc tggcatggac ctgtataata atccgtttag ccgcgcggct 1380
tggctggcag ttccgccgta cgaacagtct tttctgggca tgatcaacac cattggtttt 1440
ccggatcgcg ctgaagttat gccgcatatt ctgaccaccg aaattcgcgg tccgcgcggt 1500
cgttgtcatg ttccgattga actgagcagc cgtatcgacg acgacatcaa aatcggcagc 1560
aacatggttg tcctgccgac caaagatctg cgttatatca ccgcgaccta cgacgtttct 1620
cgtagcgaac acgcgatcgt ctactacatc tacgataccg gccgtagcag cagctatttt 1680
tacccggttc gtctgaactt tcgcggtaat ccgctgtctc tgcgtattga gtgctttccg 1740
tggtaccaca aagtctggtg ttaccacgac tgcctgatct acaacaccat caccaacgaa 1800
gaagtgcata cccgcggtct gaccggtatt gaagtcacct gcaacccggt tggtggcggt 1860
ggaatcggag gtggtggaag cggaggaggt ggaagcatgc accatcacca ccatcacatg 1920
acccgcgttg cgaccctggt ctttctgttt ctgttcccga acaccgttac ctgtcagatt 1980
cattggggca acctgagcaa aatcggcatt gttggtaccg gtagcgcttc ctacaaagtt 2040
atgacccgtc cgagtcatca gaccctggtt atcaaactga tgccgaacat caccgcgatt 2100
gataactgca ccaaaagcga gatcagcgag tacaaacgtc tgctgatcac cgttctgaaa 2160
ccggttgaag acgcactgag cgtcatcacc aaaaacgtcc gtccgattca aaccctgacc 2220
ccgggtcgtc gtacccgtcg ttttgtaggc gctgttctgg caggtgttgc actgggcgtt 2280
gcaaccgcag cacaaattac cgcaggcgtt gcactgcatc agtctctgat gaacagccag 2340
gcgattgaaa gcctgaaaac cagcctggag aaaagcaacc aggcaattga agaaatccgt 2400
ctggcgaaca aagaaaccat cctggcagtt cagggcgtcc aggattacat caacaacgag 2460
ctggtcccgt ctgttcatcg tatgagctgc gaactggtcg gtcataaact gagcctgaaa 2520
ctgctgcgct actacaccga gatcctgagc atctttggtc cgagtctgcg cgatccgatt 2580
gcagcggaaa ttagcattca ggcgctgagt tacgcgctgg gcggcgatat taacaaaatc 2640
ctggacaaac tgggctattc tggcggcgat tttctggcga ttctggaatc caaaggcatc 2700
aaagcgcgcg ttacctacgt tgatacccgc gattatttca tcatcctgag cattgcgtat 2760
ccgaccctga gcgaaatcaa aggcgttatc gtccacaaaa tcgaggcgat cagctacaac 2820
atcggcgctc aagagtggta taccacgatt ccgcgttacg ttgcgaccca gggttatctg 2880
attagcaact tcgacgagac cagctgcgtt tttaccccgg aaggtaccgt ttgtagccaa 2940
aacgctctgt atccgatgag tccgctgctg caagagtgtt ttcgcggtag caccaaaagt 3000
tgcgcacgta ccctggtttc tggtaccacc agtaatcgct tcatcctgag caaaggcaac 3060
ctgatcgcaa attgcgcctc tgttctgtgc aaatgctaca ccaccgaaac cgtcatcaac 3120
caggatccgg ataaactgct gaccgttatt gcctctgata aatgcccggt agttgaagtt 3180
gacggcgtca ccattcaggt tggtagccgc gaatatccgg atagcgtcta tctgcacgag 3240
attgatctgg gtccggcaat ttctctggag aaactggatg tcggtaccaa tctgggtaac 3300
gcagttacgc gtctggaaaa cgcaaaagaa ctgctggacg cgagcgatca gattctgaaa 3360
accgttaaag gcgttccgtt cagcggtaac atctacattg ccctggcagc ttgtattggc 3420
gttagtctgg gtctggttac cctgatttgc tgttgcaaag gccgttgtcg caacaaagag 3480
attccggcaa gcaaaattaa tccgggcctg aaaccggatc tgaccggtac cagcaaaagc 3540
tacgttcgta gcctgtaact cgag 3564
<210> 2
<211> 1184
<212> PRT
<213>Artificial sequence
<220>
<223>
<400> 2
Met His His His His His His Met Ser Ala Gln Arg Glu Arg Ile Asn
1 5 10 15
Ala Phe Tyr Lys Asp Asn Leu His Asn Lys Thr His Arg Val Ile Leu
20 25 30
Asp Arg Glu Arg Leu Thr Ile Glu Arg Pro Tyr Ile Leu Leu Gly Val
35 40 45
Leu Leu Val Met Phe Leu Ser Leu Ile Gly Leu Leu Ala Ile Ala Gly
50 55 60
Ile Arg Leu His Arg Ala Thr Val Gly Thr Ala Glu Ile Gln Ser Arg
65 70 75 80
Leu Asn Thr Asn Ile Glu Leu Thr Glu Ser Ile Asp His Gln Thr Lys
85 90 95
Asp Val Leu Thr Pro Leu Phe Lys Ile Ile Gly Asp Glu Val Gly Ile
100 105 110
Arg Ile Pro Gln Lys Phe Ser Asp Leu Val Lys Phe Ile Ser Asp Lys
115 120 125
Ile Lys Phe Leu Asn Pro Asp Arg Glu Tyr Asp Phe Arg Asp Leu Arg
130 135 140
Trp Cys Met Asn Pro Pro Glu Arg Val Lys Ile Asn Phe Asp Gln Phe
145 150 155 160
Cys Glu Tyr Lys Ala Ala Val Lys Ser Val Glu His Ile Phe Glu Ser
165 170 175
Ser Leu Asn Arg Ser Glu Arg Leu Arg Leu Leu Thr Leu Gly Pro Gly
180 185 190
Thr Gly Cys Leu Gly Arg Thr Val Thr Arg Ala Gln Phe Ser Glu Leu
195 200 205
Thr Leu Thr Leu Met Asp Leu Asp Leu Glu Met Lys His Asn Val Ser
210 215 220
Ser Val Phe Thr Val Val Glu Glu Gly Leu Phe Gly Arg Thr Tyr Thr
225 230 235 240
Val Trp Arg Ser Asp Thr Gly Lys Pro Ser Thr Ser Pro Gly Ile Gly
245 250 255
His Phe Leu Arg Val Phe Glu Ile Gly Leu Val Arg Asp Leu Glu Leu
260 265 270
Gly Ala Pro Ile Phe His Met Thr Asn Tyr Leu Thr Val Asn Met Ser
275 280 285
Asp Asp Tyr Arg Ser Cys Leu Leu Ala Val Gly Glu Leu Lys Leu Thr
290 295 300
Ala Leu Cys Thr Pro Ser Glu Thr Val Thr Leu Ser Glu Ser Gly Val
305 310 315 320
Pro Lys Arg Glu Pro Leu Val Val Val Ile Leu Asn Leu Ala Gly Pro
325 330 335
Thr Leu Gly Gly Glu Leu Tyr Ser Val Leu Pro Thr Thr Asp Pro Thr
340 345 350
Val Glu Lys Leu Tyr Leu Ser Ser His Arg Gly Ile Ile Lys Asp Asn
355 360 365
Glu Ala Asn Trp Val Val Pro Ser Thr Asp Val Arg Asp Leu Gln Asn
370 375 380
Lys Gly Glu Cys Leu Val Glu Ala Cys Lys Thr Arg Pro Pro Ser Phe
385 390 395 400
Cys Asn Gly Thr Gly Ile Gly Pro Trp Ser Glu Gly Arg Ile Pro Ala
405 410 415
Tyr Gly Val Ile Arg Val Ser Leu Asp Leu Ala Ser Asp Pro Gly Val
420 425 430
Val Ile Thr Ser Val Phe Gly Pro Leu Ile Pro His Leu Ser Gly Met
435 440 445
Asp Leu Tyr Asn Asn Pro Phe Ser Arg Ala Ala Trp Leu Ala Val Pro
450 455 460
Pro Tyr Glu Gln Ser Phe Leu Gly Met Ile Asn Thr Ile Gly Phe Pro
465 470 475 480
Asp Arg Ala Glu Val Met Pro His Ile Leu Thr Thr Glu Ile Arg Gly
485 490 495
Pro Arg Gly Arg Cys His Val Pro Ile Glu Leu Ser Ser Arg Ile Asp
500 505 510
Asp Asp Ile Lys Ile Gly Ser Asn Met Val Val Leu Pro Thr Lys Asp
515 520 525
Leu Arg Tyr Ile Thr Ala Thr Tyr Asp Val Ser Arg Ser Glu His Ala
530 535 540
Ile Val Tyr Tyr Ile Tyr Asp Thr Gly Arg Ser Ser Ser Tyr Phe Tyr
545 550 555 560
Pro Val Arg Leu Asn Phe Arg Gly Asn Pro Leu Ser Leu Arg Ile Glu
565 570 575
Cys Phe Pro Trp Tyr His Lys Val Trp Cys Tyr His Asp Cys Leu Ile
580 585 590
Tyr Asn Thr Ile Thr Asn Glu Glu Val His Thr Arg Gly Leu Thr Gly
595 600 605
Ile Glu Val Thr Cys Asn Pro Val Gly Gly Gly Gly Ile Gly Gly Gly
610 615 620
Gly Ser Gly Gly Gly Gly Ser Met His His His His His His Met Thr
625 630 635 640
Arg Val Ala Thr Leu Val Phe Leu Phe Leu Phe Pro Asn Thr Val Thr
645 650 655
Cys Gln Ile His Trp Gly Asn Leu Ser Lys Ile Gly Ile Val Gly Thr
660 665 670
Gly Ser Ala Ser Tyr Lys Val Met Thr Arg Pro Ser His Gln Thr Leu
675 680 685
Val Ile Lys Leu Met Pro Asn Ile Thr Ala Ile Asp Asn Cys Thr Lys
690 695 700
Ser Glu Ile Ser Glu Tyr Lys Arg Leu Leu Ile Thr Val Leu Lys Pro
705 710 715 720
Val Glu Asp Ala Leu Ser Val Ile Thr Lys Asn Val Arg Pro Ile Gln
725 730 735
Thr Leu Thr Pro Gly Arg Arg Thr Arg Arg Phe Val Gly Ala Val Leu
740 745 750
Ala Gly Val Ala Leu Gly Val Ala Thr Ala Ala Gln Ile Thr Ala Gly
755 760 765
Val Ala Leu His Gln Ser Leu Met Asn Ser Gln Ala Ile Glu Ser Leu
770 775 780
Lys Thr Ser Leu Glu Lys Ser Asn Gln Ala Ile Glu Glu Ile Arg Leu
785 790 795 800
Ala Asn Lys Glu Thr Ile Leu Ala Val Gln Gly Val Gln Asp Tyr Ile
805 810 815
Asn Asn Glu Leu Val Pro Ser Val His Arg Met Ser Cys Glu Leu Val
820 825 830
Gly His Lys Leu Ser Leu Lys Leu Leu Arg Tyr Tyr Thr Glu Ile Leu
835 840 845
Ser Ile Phe Gly Pro Ser Leu Arg Asp Pro Ile Ala Ala Glu Ile Ser
850 855 860
Ile Gln Ala Leu Ser Tyr Ala Leu Gly Gly Asp Ile Asn Lys Ile Leu
865 870 875 880
Asp Lys Leu Gly Tyr Ser Gly Gly Asp Phe Leu Ala Ile Leu Glu Ser
885 890 895
Lys Gly Ile Lys Ala Arg Val Thr Tyr Val Asp Thr Arg Asp Tyr Phe
900 905 910
Ile Ile Leu Ser Ile Ala Tyr Pro Thr Leu Ser Glu Ile Lys Gly Val
915 920 925
Ile Val His Lys Ile Glu Ala Ile Ser Tyr Asn Ile Gly Ala Gln Glu
930 935 940
Trp Tyr Thr Thr Ile Pro Arg Tyr Val Ala Thr Gln Gly Tyr Leu Ile
945 950 955 960
Ser Asn Phe Asp Glu Thr Ser Cys Val Phe Thr Pro Glu Gly Thr Val
965 970 975
Cys Ser Gln Asn Ala Leu Tyr Pro Met Ser Pro Leu Leu Gln Glu Cys
980 985 990
Phe Arg Gly Ser Thr Lys Ser Cys Ala Arg Thr Leu Val Ser Gly Thr
995 1000 1005
Thr Ser Asn Arg Phe Ile Leu Ser Lys Gly Asn Leu Ile Ala Asn
1010 1015 1020
Cys Ala Ser Val Leu Cys Lys Cys Tyr Thr Thr Glu Thr Val Ile
1025 1030 1035
Asn Gln Asp Pro Asp Lys Leu Leu Thr Val Ile Ala Ser Asp Lys
1040 1045 1050
Cys Pro Val Val Glu Val Asp Gly Val Thr Ile Gln Val Gly Ser
1055 1060 1065
Arg Glu Tyr Pro Asp Ser Val Tyr Leu His Glu Ile Asp Leu Gly
1070 1075 1080
Pro Ala Ile Ser Leu Glu Lys Leu Asp Val Gly Thr Asn Leu Gly
1085 1090 1095
Asn Ala Val Thr Arg Leu Glu Asn Ala Lys Glu Leu Leu Asp Ala
1100 1105 1110
Ser Asp Gln Ile Leu Lys Thr Val Lys Gly Val Pro Phe Ser Gly
1115 1120 1125
Asn Ile Tyr Ile Ala Leu Ala Ala Cys Ile Gly Val Ser Leu Gly
1130 1135 1140
Leu Val Thr Leu Ile Cys Cys Cys Lys Gly Arg Cys Arg Asn Lys
1145 1150 1155
Glu Ile Pro Ala Ser Lys Ile Asn Pro Gly Leu Lys Pro Asp Leu
1160 1165 1170
Thr Gly Thr Ser Lys Ser Tyr Val Arg Ser Leu
1175 1180
<210> 3
<211> 3690
<212> DNA
<213>Artificial sequence
<220>
<221> CDS
<222> (1)..(3684)
<223>
<400> 3
atgcaccatc atcatcatca ttcttctggt ctggtgccac gcggttctgg tatgaaagaa 60
accgctgctg ctaaattcga acgccagcac atggacagcc cagatctggg taccgacgac 120
gacgacaagg ccatggctga tatcggatcc atgagcgcac aacgcgaacg catcaacgcg 180
ttctacaaag acaacctgca caacaaaacc catcgcgtta tcctggatcg cgaacgtctg 240
accattgaac gtccgtatat cctgctgggg gttctgctgg ttatgtttct gagcctgatc 300
ggtctgctgg caattgctgg tattcgtctg catcgcgcaa ccgttggtac cgcagaaatt 360
caaagccgtc tgaacaccaa catcgaactg accgaaagca tcgaccatca gaccaaagac 420
gttctgaccc cgctgtttaa aatcatcggc gacgaagtcg gcattcgtat tccgcagaaa 480
ttcagcgacc tggtcaaatt catcagcgac aaaatcaaat tcctgaaccc ggaccgcgaa 540
tacgattttc gcgatctgcg ttggtgtatg aatccgccgg aacgcgtcaa aatcaacttc 600
gaccagttct gcgaatacaa agcggcggtt aaaagcgtcg aacacatctt cgaaagcagc 660
ctgaaccgtt ctgaacgtct gcgtctgctg accctgggtc cgggtaccgg ttgtctgggt 720
cgtaccgtta cccgtgcaca atttagcgaa ctgaccctga ccctgatgga tctggacctg 780
gagatgaaac ataacgtcag cagcgtcttt accgttgtgg aagaaggtct gtttggccgt 840
acctataccg tttggcgttc tgataccggt aaaccgagta ccagtccggg tattggtcat 900
tttctgcgcg tcttcgaaat tggtctggtt cgcgatctgg aactgggcgc tccgattttt 960
cacatgacca actacctgac cgtcaacatg agcgacgatt accgttcttg tctgctggca 1020
gttggcgaac tgaaactgac cgcactgtgt accccgtctg aaaccgttac cctgtctgaa 1080
tctggcgttc cgaaacgcga accgctggtt gtcgttattc tgaatctggc aggtccgacc 1140
ctgggcggcg aactgtattc tgttctgccg accaccgatc cgaccgttga aaaactgtat 1200
ctgagcagcc atcgcggtat catcaaagac aacgaagcga attgggttgt tccgtctacc 1260
gacgttcgtg atctgcaaaa caaaggcgag tgcctggttg aagcttgtaa aacccgtccg 1320
ccgagctttt gtaacggtac cggtattggt ccgtggtctg aaggtcgtat tccggcttac 1380
ggcgttattc gcgtttctct ggatctggca tctgatccgg gcgtagttat taccagcgtt 1440
tttggtccgc tgattccgca tctgtctggc atggacctgt ataataatcc gtttagccgc 1500
gcggcttggc tggcagttcc gccgtacgaa cagtcttttc tgggcatgat caacaccatt 1560
ggttttccgg atcgcgctga agttatgccg catattctga ccaccgaaat tcgcggtccg 1620
cgcggtcgtt gtcatgttcc gattgaactg agcagccgta tcgacgacga catcaaaatc 1680
ggcagcaaca tggttgtcct gccgaccaaa gatctgcgtt atatcaccgc gacctacgac 1740
gtttctcgta gcgaacacgc gatcgtctac tacatctacg ataccggccg tagcagcagc 1800
tatttttacc cggttcgtct gaactttcgc ggtaatccgc tgtctctgcg tattgagtgc 1860
tttccgtggt accacaaagt ctggtgttac cacgactgcc tgatctacaa caccatcacc 1920
aacgaagaag tgcatacccg cggtctgacc ggtattgaag tcacctgcaa cccggttggt 1980
ggcggtggaa tcggaggtgg tggaagcgga ggaggtggaa gcatgcacca tcaccaccat 2040
cacatgaccc gcgttgcgac cctggtcttt ctgtttctgt tcccgaacac cgttacctgt 2100
cagattcatt ggggcaacct gagcaaaatc ggcattgttg gtaccggtag cgcttcctac 2160
aaagttatga cccgtccgag tcatcagacc ctggttatca aactgatgcc gaacatcacc 2220
gcgattgata actgcaccaa aagcgagatc agcgagtaca aacgtctgct gatcaccgtt 2280
ctgaaaccgg ttgaagacgc actgagcgtc atcaccaaaa acgtccgtcc gattcaaacc 2340
ctgaccccgg gtcgtcgtac ccgtcgtttt gtaggcgctg ttctggcagg tgttgcactg 2400
ggcgttgcaa ccgcagcaca aattaccgca ggcgttgcac tgcatcagtc tctgatgaac 2460
agccaggcga ttgaaagcct gaaaaccagc ctggagaaaa gcaaccaggc aattgaagaa 2520
atccgtctgg cgaacaaaga aaccatcctg gcagttcagg gcgtccagga ttacatcaac 2580
aacgagctgg tcccgtctgt tcatcgtatg agctgcgaac tggtcggtca taaactgagc 2640
ctgaaactgc tgcgctacta caccgagatc ctgagcatct ttggtccgag tctgcgcgat 2700
ccgattgcag cggaaattag cattcaggcg ctgagttacg cgctgggcgg cgatattaac 2760
aaaatcctgg acaaactggg ctattctggc ggcgattttc tggcgattct ggaatccaaa 2820
ggcatcaaag cgcgcgttac ctacgttgat acccgcgatt atttcatcat cctgagcatt 2880
gcgtatccga ccctgagcga aatcaaaggc gttatcgtcc acaaaatcga ggcgatcagc 2940
tacaacatcg gcgctcaaga gtggtatacc acgattccgc gttacgttgc gacccagggt 3000
tatctgatta gcaacttcga cgagaccagc tgcgttttta ccccggaagg taccgtttgt 3060
agccaaaacg ctctgtatcc gatgagtccg ctgctgcaag agtgttttcg cggtagcacc 3120
aaaagttgcg cacgtaccct ggtttctggt accaccagta atcgcttcat cctgagcaaa 3180
ggcaacctga tcgcaaattg cgcctctgtt ctgtgcaaat gctacaccac cgaaaccgtc 3240
atcaaccagg atccggataa actgctgacc gttattgcct ctgataaatg cccggtagtt 3300
gaagttgacg gcgtcaccat tcaggttggt agccgcgaat atccggatag cgtctatctg 3360
cacgagattg atctgggtcc ggcaatttct ctggagaaac tggatgtcgg taccaatctg 3420
ggtaacgcag ttacgcgtct ggaaaacgca aaagaactgc tggacgcgag cgatcagatt 3480
ctgaaaaccg ttaaaggcgt tccgttcagc ggtaacatct acattgccct ggcagcttgt 3540
attggcgtta gtctgggtct ggttaccctg atttgctgtt gcaaaggccg ttgtcgcaac 3600
aaagagattc cggcaagcaa aattaatccg ggcctgaaac cggatctgac cggtaccagc 3660
aaaagctacg ttcgtagcct gtaactcgag 3690
<210> 4
<211> 1227
<212> PRT
<213>Artificial sequence
<220>
<223>
<400> 4
Met His His His His His His Ser Ser Gly Leu Val Pro Arg Gly Ser
1 5 10 15
Gly Met Lys Glu Thr Ala Ala Ala Lys Phe Glu Arg Gln His Met Asp
20 25 30
Ser Pro Asp Leu Gly Thr Asp Asp Asp Asp Lys Ala Met Ala Asp Ile
35 40 45
Gly Ser Met Ser Ala Gln Arg Glu Arg Ile Asn Ala Phe Tyr Lys Asp
50 55 60
Asn Leu His Asn Lys Thr His Arg Val Ile Leu Asp Arg Glu Arg Leu
65 70 75 80
Thr Ile Glu Arg Pro Tyr Ile Leu Leu Gly Val Leu Leu Val Met Phe
85 90 95
Leu Ser Leu Ile Gly Leu Leu Ala Ile Ala Gly Ile Arg Leu His Arg
100 105 110
Ala Thr Val Gly Thr Ala Glu Ile Gln Ser Arg Leu Asn Thr Asn Ile
115 120 125
Glu Leu Thr Glu Ser Ile Asp His Gln Thr Lys Asp Val Leu Thr Pro
130 135 140
Leu Phe Lys Ile Ile Gly Asp Glu Val Gly Ile Arg Ile Pro Gln Lys
145 150 155 160
Phe Ser Asp Leu Val Lys Phe Ile Ser Asp Lys Ile Lys Phe Leu Asn
165 170 175
Pro Asp Arg Glu Tyr Asp Phe Arg Asp Leu Arg Trp Cys Met Asn Pro
180 185 190
Pro Glu Arg Val Lys Ile Asn Phe Asp Gln Phe Cys Glu Tyr Lys Ala
195 200 205
Ala Val Lys Ser Val Glu His Ile Phe Glu Ser Ser Leu Asn Arg Ser
210 215 220
Glu Arg Leu Arg Leu Leu Thr Leu Gly Pro Gly Thr Gly Cys Leu Gly
225 230 235 240
Arg Thr Val Thr Arg Ala Gln Phe Ser Glu Leu Thr Leu Thr Leu Met
245 250 255
Asp Leu Asp Leu Glu Met Lys His Asn Val Ser Ser Val Phe Thr Val
260 265 270
Val Glu Glu Gly Leu Phe Gly Arg Thr Tyr Thr Val Trp Arg Ser Asp
275 280 285
Thr Gly Lys Pro Ser Thr Ser Pro Gly Ile Gly His Phe Leu Arg Val
290 295 300
Phe Glu Ile Gly Leu Val Arg Asp Leu Glu Leu Gly Ala Pro Ile Phe
305 310 315 320
His Met Thr Asn Tyr Leu Thr Val Asn Met Ser Asp Asp Tyr Arg Ser
325 330 335
Cys Leu Leu Ala Val Gly Glu Leu Lys Leu Thr Ala Leu Cys Thr Pro
340 345 350
Ser Glu Thr Val Thr Leu Ser Glu Ser Gly Val Pro Lys Arg Glu Pro
355 360 365
Leu Val Val Val Ile Leu Asn Leu Ala Gly Pro Thr Leu Gly Gly Glu
370 375 380
Leu Tyr Ser Val Leu Pro Thr Thr Asp Pro Thr Val Glu Lys Leu Tyr
385 390 395 400
Leu Ser Ser His Arg Gly Ile Ile Lys Asp Asn Glu Ala Asn Trp Val
405 410 415
Val Pro Ser Thr Asp Val Arg Asp Leu Gln Asn Lys Gly Glu Cys Leu
420 425 430
Val Glu Ala Cys Lys Thr Arg Pro Pro Ser Phe Cys Asn Gly Thr Gly
435 440 445
Ile Gly Pro Trp Ser Glu Gly Arg Ile Pro Ala Tyr Gly Val Ile Arg
450 455 460
Val Ser Leu Asp Leu Ala Ser Asp Pro Gly Val Val Ile Thr Ser Val
465 470 475 480
Phe Gly Pro Leu Ile Pro His Leu Ser Gly Met Asp Leu Tyr Asn Asn
485 490 495
Pro Phe Ser Arg Ala Ala Trp Leu Ala Val Pro Pro Tyr Glu Gln Ser
500 505 510
Phe Leu Gly Met Ile Asn Thr Ile Gly Phe Pro Asp Arg Ala Glu Val
515 520 525
Met Pro His Ile Leu Thr Thr Glu Ile Arg Gly Pro Arg Gly Arg Cys
530 535 540
His Val Pro Ile Glu Leu Ser Ser Arg Ile Asp Asp Asp Ile Lys Ile
545 550 555 560
Gly Ser Asn Met Val Val Leu Pro Thr Lys Asp Leu Arg Tyr Ile Thr
565 570 575
Ala Thr Tyr Asp Val Ser Arg Ser Glu His Ala Ile Val Tyr Tyr Ile
580 585 590
Tyr Asp Thr Gly Arg Ser Ser Ser Tyr Phe Tyr Pro Val Arg Leu Asn
595 600 605
Phe Arg Gly Asn Pro Leu Ser Leu Arg Ile Glu Cys Phe Pro Trp Tyr
610 615 620
His Lys Val Trp Cys Tyr His Asp Cys Leu Ile Tyr Asn Thr Ile Thr
625 630 635 640
Asn Glu Glu Val His Thr Arg Gly Leu Thr Gly Ile Glu Val Thr Cys
645 650 655
Asn Pro Val Gly Gly Gly Gly Ile Gly Gly Gly Gly Ser Gly Gly Gly
660 665 670
Gly Ser Met His His His His His His Met Thr Arg Val Ala Thr Leu
675 680 685
Val Phe Leu Phe Leu Phe Pro Asn Thr Val Thr Cys Gln Ile His Trp
690 695 700
Gly Asn Leu Ser Lys Ile Gly Ile Val Gly Thr Gly Ser Ala Ser Tyr
705 710 715 720
Lys Val Met Thr Arg Pro Ser His Gln Thr Leu Val Ile Lys Leu Met
725 730 735
Pro Asn Ile Thr Ala Ile Asp Asn Cys Thr Lys Ser Glu Ile Ser Glu
740 745 750
Tyr Lys Arg Leu Leu Ile Thr Val Leu Lys Pro Val Glu Asp Ala Leu
755 760 765
Ser Val Ile Thr Lys Asn Val Arg Pro Ile Gln Thr Leu Thr Pro Gly
770 775 780
Arg Arg Thr Arg Arg Phe Val Gly Ala Val Leu Ala Gly Val Ala Leu
785 790 795 800
Gly Val Ala Thr Ala Ala Gln Ile Thr Ala Gly Val Ala Leu His Gln
805 810 815
Ser Leu Met Asn Ser Gln Ala Ile Glu Ser Leu Lys Thr Ser Leu Glu
820 825 830
Lys Ser Asn Gln Ala Ile Glu Glu Ile Arg Leu Ala Asn Lys Glu Thr
835 840 845
Ile Leu Ala Val Gln Gly Val Gln Asp Tyr Ile Asn Asn Glu Leu Val
850 855 860
Pro Ser Val His Arg Met Ser Cys Glu Leu Val Gly His Lys Leu Ser
865 870 875 880
Leu Lys Leu Leu Arg Tyr Tyr Thr Glu Ile Leu Ser Ile Phe Gly Pro
885 890 895
Ser Leu Arg Asp Pro Ile Ala Ala Glu Ile Ser Ile Gln Ala Leu Ser
900 905 910
Tyr Ala Leu Gly Gly Asp Ile Asn Lys Ile Leu Asp Lys Leu Gly Tyr
915 920 925
Ser Gly Gly Asp Phe Leu Ala Ile Leu Glu Ser Lys Gly Ile Lys Ala
930 935 940
Arg Val Thr Tyr Val Asp Thr Arg Asp Tyr Phe Ile Ile Leu Ser Ile
945 950 955 960
Ala Tyr Pro Thr Leu Ser Glu Ile Lys Gly Val Ile Val His Lys Ile
965 970 975
Glu Ala Ile Ser Tyr Asn Ile Gly Ala Gln Glu Trp Tyr Thr Thr Ile
980 985 990
Pro Arg Tyr Val Ala Thr Gln Gly Tyr Leu Ile Ser Asn Phe Asp Glu
995 1000 1005
Thr Ser Cys Val Phe Thr Pro Glu Gly Thr Val Cys Ser Gln Asn
1010 1015 1020
Ala Leu Tyr Pro Met Ser Pro Leu Leu Gln Glu Cys Phe Arg Gly
1025 1030 1035
Ser Thr Lys Ser Cys Ala Arg Thr Leu Val Ser Gly Thr Thr Ser
1040 1045 1050
Asn Arg Phe Ile Leu Ser Lys Gly Asn Leu Ile Ala Asn Cys Ala
1055 1060 1065
Ser Val Leu Cys Lys Cys Tyr Thr Thr Glu Thr Val Ile Asn Gln
1070 1075 1080
Asp Pro Asp Lys Leu Leu Thr Val Ile Ala Ser Asp Lys Cys Pro
1085 1090 1095
Val Val Glu Val Asp Gly Val Thr Ile Gln Val Gly Ser Arg Glu
1100 1105 1110
Tyr Pro Asp Ser Val Tyr Leu His Glu Ile Asp Leu Gly Pro Ala
1115 1120 1125
Ile Ser Leu Glu Lys Leu Asp Val Gly Thr Asn Leu Gly Asn Ala
1130 1135 1140
Val Thr Arg Leu Glu Asn Ala Lys Glu Leu Leu Asp Ala Ser Asp
1145 1150 1155
Gln Ile Leu Lys Thr Val Lys Gly Val Pro Phe Ser Gly Asn Ile
1160 1165 1170
Tyr Ile Ala Leu Ala Ala Cys Ile Gly Val Ser Leu Gly Leu Val
1175 1180 1185
Thr Leu Ile Cys Cys Cys Lys Gly Arg Cys Arg Asn Lys Glu Ile
1190 1195 1200
Pro Ala Ser Lys Ile Asn Pro Gly Leu Lys Pro Asp Leu Thr Gly
1205 1210 1215
Thr Ser Lys Ser Tyr Val Arg Ser Leu
1220 1225
<210> 5
<211> 3564
<212> PRT
<213>Artificial sequence
<220>
<223>
<400> 5
catatgcatc atcaccatca ccatatgtcc gcacaaaggg aaaggatcaa tgccttctac 60
aaagacaatc ttcataataa gacccatagg gtaatcctgg atagggaacg cttaactatt 120
gaaagaccct acatcttact tggagtccta ctggtaatgt ttctgagtct aatcgggctg 180
ctggccattg cagggatcag gcttcaccga gccaccgttg ggactgcgga gatccagagt 240
cggctgaata ccaacattga gttgaccgaa tccattgatc atcaaactaa ggatgtctta 300
actcccctgt ttaaaatcat tggtgatgaa gtcggcatca gaattccaca gaagttcagt 360
gatcttgtca agttcatctc cgataagatt aagttcctca accctgacag agaatatgat 420
tttagggatc tccggtggtg tatgaacccc cctgagagag tcaaaattaa ctttgaccag 480
ttttgtgaat acaaagccgc agtcaagtca gttgaacata tatttgagtc atcactcaac 540
aggtcagaaa ggttgcgatt attgactctt gggcccggaa caggctgtct cggcaggaca 600
gtaacaagag ctcagttctc agagcttacg ctgaccctga tggacctgga tctcgagatg 660
aagcacaacg tgtcctcagt gtttaccgta gtcgaagagg gattattcgg aagaacatat 720
actgtctgga gatccgacac cggaaaaccg agcaccagtc caggtattgg ccatttttta 780
agagtcttcg agatcgggct ggtgagagat ctcgagctgg gtgcccctat tttccatatg 840
accaactacc tcacagtaaa catgagtgat gactatcgga gctgtctttt agcagtaggg 900
gagttgaagc tgacagccct atgcacccca tctgagactg tgactctgag tgagagtgga 960
gttccaaaga gagagcctct tgtggttgtg atactcaacc tagctgggcc tactctaggg 1020
ggcgaactat acagtgtatt gcctaccact gaccccacgg tggagaaact ctatttatcc 1080
tcacataggg ggattatcaa agataacgag gccaattggg tagtaccgtc gaccgatgtt 1140
cgtgatcttc aaaacaaagg agaatgtctg gtggaagcat gcaagactcg acctccttca 1200
ttttgcaatg gcacaggaat aggcccatgg tcagagggga gaatccctgc ctacggggtg 1260
atcagggtca gtcttgactt agctagtgac ccgggtgtag ttatcacttc agtgtttggc 1320
ccattgatac ctcacctatc cggtatggat ctttacaaca atccgttttc aagagctgca 1380
tggttggctg taccacctta tgagcagtca tttctaggaa tgataaatac aattggcttc 1440
ccggacagag cagaggttat gccgcacatt ttgaccacag agatcagagg gcctcgaggt 1500
cgttgtcatg ttcctataga gttgtccagc aggattgatg atgatatcaa gatcgggtcc 1560
aacatggttg tattgccgac gaaggacctg aggtacataa cagccactta tgatgtttcc 1620
aggagcgagc atgcaatcgt gtactatatc tatgacacgg gtcgctcatc atcttacttc 1680
tacccagttc gattgaattt caggggcaat cctctctctc tgaggataga gtgttttccc 1740
tggtatcata aggtgtggtg ctaccatgat tgtcttatat acaacaccat aacaaacgaa 1800
gaagtccaca cgagagggct gaccggtata gaggtaacat gtaatccagt cggtggcggt 1860
ggaatcggag gtggtggaag cggaggaggt ggaagcatgc atcatcacca tcaccatatg 1920
acacgggtcg caaccttagt atttctgttt cttttcccaa acactgtcac gtgccagatt 1980
cactggggca atctatccaa gatcgggatt gtaggaacgg ggagtgccag ctacaaggtg 2040
atgactaggc caagccacca aactctggtc ataaagttga tgccaaatat aacagccatc 2100
gacaattgta cgaaatcaga gatttcagag tacaaaagat tgctgatcac agtgttaaag 2160
cctgtagagg atgccctgtc agtgataacc aagaatgtaa gaccaattca aactctaaca 2220
cctgggcgca ggacccgccg ttttgtcggg gctgttctgg ccggagtagc acttggagtc 2280
gcgacagccg ctcaaataac tgccggagtc gcactccatc agtcattgat gaattcccaa 2340
gcaattgaaa gtttaaaaac cagtcttgag aagtcgaatc aggcaataga agaaatcaga 2400
cttgcaaata aggagaccat actggcggta cagggcgtcc aagactatat caacaatgag 2460
cttgtcccct ctgttcatag aatgtcatgt gagcttgtag gtcacaaact cagtctcaag 2520
ctccttaggt attataccga gatcctgtct atattcgggc ctagccttcg agacccgata 2580
gctgctgaaa tatcaatcca ggcactcagc tatgcattag gcggagacat caataaaatt 2640
ctggacaagc ttgggtatag cggcggggat ttccttgcta tcctagaaag caaggggata 2700
aaggcccggg tcacatatgt ggacacaaga gattacttta taattcttag catagcctac 2760
ccaaccttat ctgagatcaa gggggtgata gttcataaga tagaagctat atcctacaat 2820
attggggcac aggaatggta tactactatc cctagatatg tagccactca gggatatctg 2880
atatcgaatt tcgatgagac gtcatgcgtc ttcactccag aggggacagt ctgcagccag 2940
aatgcgttgt atccaatgag cccattgctt caggaatgct tcagggggtc gacaaaatcg 3000
tgcgccagaa ccctagtttc agggaccaca agtaatagat ttatcctatc aaaagggaac 3060
ttgattgcaa attgtgcgtc agttttgtgc aagtgttaca caacggagac agttatcaac 3120
caagatcctg ataaactact aactgttata gcctccgata agtgtcccgt agtcgaggtg 3180
gatggagtga caatacaggt cggcagtcga gagtacccag attctgtata cctacatgaa 3240
atagacttag gcccagccat ctccctggag aaactggatg taggcaccaa tttaggcaat 3300
gcagtcacaa gactggagaa tgcaaaggag ctactagatg catcagacca gatactgaag 3360
actgttaaag gggtaccttt cagtggcaat atatacatag cactggcagc ttgcattggg 3420
gtatccctag ggcttgtcac attaatatgc tgctgtaagg ggagatgtag gaacaaggag 3480
attcctgcct ccaaaatcaa cccagggctc aaacccgacc taaccgggac ttcaaagtcg 3540
tacgtgagat cactgtagct cgag 3564
Claims (10)
- Protein 1.a) or b):A) protein being made up of SEQ ID No.2 amino acid sequence;B) by the amino acid sequence shown in SEQ ID No.2 by one or several amino acid residues substitution and/or missing and/ Or the soluble protein that addition is obtained.
- 2. protein according to claim 1, it is characterised in that:The protein is according to the method system comprised the following steps It is standby:The encoding gene of the protein is set in biology express obtaining the protein;It is described biological for microorganism, plant Or non-human animal.
- 3. protein according to claim 2, it is characterised in that:The encoding gene of the protein is set to be carried out in biology Expression includes the encoding gene of the protein described in claim 1 importing recipient microorganism, obtains described in expression claim 1 The recombinant microorganism of protein, cultivates the recombinant microorganism, and expression obtains protein described in claim 1.
- 4. protein according to claim 3, it is characterised in that:Any of the recipient microorganism is C1)-C4):C1) prokaryotic micro-organisms;C2) gramnegative bacterium;C3) Escherichia bacteria;C4) e. coli bl21 (DE3).
- 5. according to any described protein in claim 2-4, it is characterised in that:The encoding gene of the protein is as follows Or 2) 1) DNA molecular shown in:1) coded sequence is the DNA molecular shown in SEQ ID No.1 4-3558;2) DNA molecular with 1) restriction is with protein DNA point described in more than 90% homogeneity and coding claim 1 Son.
- 6. according to any described protein in claim 3-5, it is characterised in that:The recombinant microorganism is by pET30a- The express amino acid sequence that rmHF1-Y importing e. coli bl21s (DE3) are obtained is that the restructuring of SEQ ID No.2 protein is micro- Biology, the recombinant microorganism is named as BL21 (DE3)/pET30a-rmHF1-Y, and the pET30a-rmHF1-Y is to use SEQ DNA shown in ID No.1 replaces the fragment between pET30a (+) Nde I and XhoI recognition sites, keeps pET30a (+) its Its sequence is constant, obtained recombinant expression carrier.
- 7. the biomaterial with the albumen qualitative correlation described in claim 1, the biomaterial is following B1) to B16) in appoint It is a kind of:B1) the nucleic acid molecules of protein described in coding claim 1;B2 B1) is contained) expression cassettes of the nucleic acid molecules;B3 B1) is contained) recombinant vectors of the nucleic acid molecules;B4 B2) is contained) recombinant vector of the expression cassette;B5 B1) is contained) recombinant microorganisms of the nucleic acid molecules;B6 B2) is contained) recombinant microorganism of the expression cassette;B7 B3) is contained) recombinant microorganism of the recombinant vector;B8 B4) is contained) recombinant microorganism of the recombinant vector;B9 B1) is contained) the transgenetic animal cell systems of the nucleic acid molecules;B10 B2) is contained) the transgenetic animal cell system of the expression cassette;B11 B3) is contained) the transgenetic animal cell system of the recombinant vector;B12 B4) is contained) the transgenetic animal cell system of the recombinant vector;B13 B1) is contained) the transgenic plant cells systems of the nucleic acid molecules;B14 B2) is contained) the transgenic plant cells system of the expression cassette;B15 B3) is contained) the transgenic plant cells system of the recombinant vector;B16 B4) is contained) the transgenic plant cells system of the recombinant vector.
- 8. biomaterial according to claim 7, it is characterised in that:The nucleic acid molecules for it is following 1) or 2) shown in DNA molecular:1) coded sequence is the DNA molecular shown in SEQ ID No.1 4-3558;2) DNA molecular with 1) restriction is with protein DNA point described in more than 90% homogeneity and coding claim 1 Son.
- 9. biomaterial according to claim 8 or claim 9, it is characterised in that:The recombinant vector is described in claim 6 pET30a-rmHF1-Y;The recombinant microorganism is E1) or E2):E1) recombinant microorganism is that the encoding gene of the protein described in claim 1 is imported into recipient microorganism, obtains table Any of up to the recombinant microorganism of protein described in claim 1, the recipient microorganism is C1)-C4):C1) prokaryotic micro-organisms;C2) gramnegative bacterium;C3) Escherichia bacteria;C4) e. coli bl21 (DE3).E2) recombinant microorganism is that pET30a-rmHF1-Y described in claim 6 is imported into e. coli bl21 (DE3) to obtain To express amino acid sequence be SEQ ID No.2 protein recombinant microorganism.
- 10.D1) or D2) application:D1 any described biomaterial detects small ruminate in preparation in protein or claim 7-9) described in claim 1 Application in the kit of epizootic disease antiviral antibody, the application in PPR diagnostic antigen is prepared are preparing small ruminate Application in epizootic disease diagnostic kit;D2) application of the protein described in claim 1 in monoclonal antibody is prepared.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710483527.8A CN107098979B (en) | 2017-06-22 | 2017-06-22 | Peste des petits ruminants virus H-F fusion protein and related biological material and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710483527.8A CN107098979B (en) | 2017-06-22 | 2017-06-22 | Peste des petits ruminants virus H-F fusion protein and related biological material and application thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107098979A true CN107098979A (en) | 2017-08-29 |
CN107098979B CN107098979B (en) | 2020-05-19 |
Family
ID=59663149
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710483527.8A Expired - Fee Related CN107098979B (en) | 2017-06-22 | 2017-06-22 | Peste des petits ruminants virus H-F fusion protein and related biological material and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107098979B (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110346566A (en) * | 2018-04-03 | 2019-10-18 | 苏州工业园区强东医药科技有限公司 | PPR virus antibody assay kit |
CN110498844A (en) * | 2018-05-18 | 2019-11-26 | 中国兽医药品监察所 | Peste des petits ruminants diagnostic kit |
CN110498845A (en) * | 2018-05-18 | 2019-11-26 | 中国兽医药品监察所 | Peste des petits ruminants diagnostic kit |
CN110498842A (en) * | 2018-05-18 | 2019-11-26 | 中国兽医药品监察所 | Peste des petits ruminants diagnostic kit |
CN110498843A (en) * | 2018-05-18 | 2019-11-26 | 中国兽医药品监察所 | Peste des petits ruminants diagnostic kit |
CN110684782A (en) * | 2019-10-29 | 2020-01-14 | 新疆畜牧科学院兽医研究所(新疆畜牧科学院动物临床医学研究中心) | Codon-optimized peste des petits ruminants virus F gene nucleic acid vaccine |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101422607A (en) * | 2008-10-08 | 2009-05-06 | 中国农业科学院哈尔滨兽医研究所 | Peste des petis ruminants recombined goat pox virus movable carrier vaccine |
CN102178947A (en) * | 2011-03-16 | 2011-09-14 | 中国人民解放军军事医学科学院军事兽医研究所 | Live vector vaccine for expressing peste des petits ruminants virus (PPRV) H gene and preparation method thereof |
CN102559707A (en) * | 2011-12-27 | 2012-07-11 | 中国农业科学院兰州兽医研究所 | Peste des petits ruminants virus H and F protein antigenic epitope gene fragment, and expression and application thereof |
WO2012098465A1 (en) * | 2011-01-18 | 2012-07-26 | Dalhousie University | Pvrl4 (nectin 4) is a receptor for measles virus |
CN104498527A (en) * | 2014-11-20 | 2015-04-08 | 青岛农业大学 | Method for constructing peste des petits ruminant transgenic plant vaccine efficient expression vector |
CN105158480A (en) * | 2015-08-05 | 2015-12-16 | 中国农业科学院兰州兽医研究所 | Kit for detecting peste des petits ruminants virus hemagglutinin protein antibody and application method of kit |
-
2017
- 2017-06-22 CN CN201710483527.8A patent/CN107098979B/en not_active Expired - Fee Related
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101422607A (en) * | 2008-10-08 | 2009-05-06 | 中国农业科学院哈尔滨兽医研究所 | Peste des petis ruminants recombined goat pox virus movable carrier vaccine |
WO2012098465A1 (en) * | 2011-01-18 | 2012-07-26 | Dalhousie University | Pvrl4 (nectin 4) is a receptor for measles virus |
CN102178947A (en) * | 2011-03-16 | 2011-09-14 | 中国人民解放军军事医学科学院军事兽医研究所 | Live vector vaccine for expressing peste des petits ruminants virus (PPRV) H gene and preparation method thereof |
CN102559707A (en) * | 2011-12-27 | 2012-07-11 | 中国农业科学院兰州兽医研究所 | Peste des petits ruminants virus H and F protein antigenic epitope gene fragment, and expression and application thereof |
CN104498527A (en) * | 2014-11-20 | 2015-04-08 | 青岛农业大学 | Method for constructing peste des petits ruminant transgenic plant vaccine efficient expression vector |
CN105158480A (en) * | 2015-08-05 | 2015-12-16 | 中国农业科学院兰州兽医研究所 | Kit for detecting peste des petits ruminants virus hemagglutinin protein antibody and application method of kit |
Non-Patent Citations (1)
Title |
---|
石晓妮: "小反刍兽疫病毒H、F蛋白抗原表位的鉴定和免疫学特性研究", 《中国优秀硕士学位论文全文数据库 农业科技辑》 * |
Cited By (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110346566A (en) * | 2018-04-03 | 2019-10-18 | 苏州工业园区强东医药科技有限公司 | PPR virus antibody assay kit |
CN110346566B (en) * | 2018-04-03 | 2022-04-12 | 苏州工业园区强东医药科技有限公司 | Peste des petits ruminants virus antibody detection kit |
CN110498844A (en) * | 2018-05-18 | 2019-11-26 | 中国兽医药品监察所 | Peste des petits ruminants diagnostic kit |
CN110498845A (en) * | 2018-05-18 | 2019-11-26 | 中国兽医药品监察所 | Peste des petits ruminants diagnostic kit |
CN110498842A (en) * | 2018-05-18 | 2019-11-26 | 中国兽医药品监察所 | Peste des petits ruminants diagnostic kit |
CN110498843A (en) * | 2018-05-18 | 2019-11-26 | 中国兽医药品监察所 | Peste des petits ruminants diagnostic kit |
CN110498842B (en) * | 2018-05-18 | 2021-01-26 | 中国兽医药品监察所 | Peste des petits ruminants diagnostic kit |
CN110498843B (en) * | 2018-05-18 | 2021-01-26 | 中国兽医药品监察所 | Peste des petits ruminants diagnostic kit |
CN110498844B (en) * | 2018-05-18 | 2021-03-09 | 中国兽医药品监察所 | Peste des petits ruminants diagnostic kit |
CN110498845B (en) * | 2018-05-18 | 2021-03-09 | 中国兽医药品监察所 | Peste des petits ruminants diagnostic kit |
CN110684782A (en) * | 2019-10-29 | 2020-01-14 | 新疆畜牧科学院兽医研究所(新疆畜牧科学院动物临床医学研究中心) | Codon-optimized peste des petits ruminants virus F gene nucleic acid vaccine |
CN110684782B (en) * | 2019-10-29 | 2023-04-14 | 新疆畜牧科学院兽医研究所(新疆畜牧科学院动物临床医学研究中心) | Codon-optimized peste des petits ruminants virus F gene nucleic acid vaccine |
Also Published As
Publication number | Publication date |
---|---|
CN107098979B (en) | 2020-05-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107238702B (en) | Detect the enzyme linked immunological kit of PPR virus antibody | |
CN107098979A (en) | PPR virus H F fusion proteins and its relevant biological material and application | |
CN107894506B (en) | Detect enzyme linked immunological kit and its application of capripox virus antibody | |
CN102964435B (en) | Truncated-form streptococcus hemolyticus bacteriolysin O and detection kit using same | |
CN104725490A (en) | Porcine circovirus 2 type ELISA antibody detection kit | |
CN110746495A (en) | Recombinant protein E2 and application thereof | |
CN107759674A (en) | A kind of Mycoplasma bovis immune-related albumen, the detection kit containing the albumen and its purposes in Mycoplasma bovis antibody test | |
CN105273065A (en) | 1-type duck hepatitis A virus VP2 recombinant protein, ELISA kit and preparation method thereof | |
CN107236047A (en) | Recombinate the solubility expression method of PPR virus H F fusion proteins | |
CN103467584A (en) | Method for obtaining and fermenting prokaryotic genetic-engineered hybrid cationic antimicrobial peptide CC | |
CN109212230B (en) | Sensitized polystyrene nano-microsphere for detecting canine parvovirus structural protein VP2 antibody and preparation method and application thereof | |
CN102680699B (en) | ELISA (enzyme-linked immunosorbent assay) detection method for identifying fowl adenovirus group I (FAVI) infection | |
CN101982777B (en) | Duck plague virus antigen capturing ELISA method based on anti-recombination UL51 albumen antibody | |
CN102618557B (en) | Recombinant avian flavivirus E protein and application thereof | |
CN109851675A (en) | A kind of aftosa diagnostic kit and its aftosa diagnostic antigen used | |
CN109856396A (en) | Detect enzyme linked immunological kit and its application of mouth disease virus infection antibody | |
CN108776225A (en) | Pig parvoviral VLPs antibody assay kits and preparation method thereof, application | |
CN108101995A (en) | Recombinate capripox virus fusion protein and its application | |
CN1325649C (en) | Gene recombinant human cytomegalovirus fusion protein pp150/MDBP, preparation process and application thereof | |
CN110484516A (en) | It is a kind of with CRISPR system by the quantum dot-labeled methods and applications to viral nucleic acid | |
CN105949320A (en) | Preparation and application of echovirus type 1 VP1 protein specific antigen epitope and fusion protein thereof | |
CN108164603A (en) | Recombinate the solubility expression method of capripox virus fusion protein and its biomaterial used | |
CN102360013B (en) | ELISA kit for detecting duck plague virus antibody, and antibody detection method thereof | |
CN109851662A (en) | Foot and mouth disease virus recombinant protein and its relevant biological material and application | |
CN104391112A (en) | Expression protein detecting bovine parainfluenza virus 3 antibody and ELISA kit |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20200519 |