RU2818229C2 - Recovered nucleic acid which codes fviii-bdd-based fusion protein and heterologous signal peptide, and use thereof - Google Patents
Recovered nucleic acid which codes fviii-bdd-based fusion protein and heterologous signal peptide, and use thereof Download PDFInfo
- Publication number
- RU2818229C2 RU2818229C2 RU2022111695A RU2022111695A RU2818229C2 RU 2818229 C2 RU2818229 C2 RU 2818229C2 RU 2022111695 A RU2022111695 A RU 2022111695A RU 2022111695 A RU2022111695 A RU 2022111695A RU 2818229 C2 RU2818229 C2 RU 2818229C2
- Authority
- RU
- Russia
- Prior art keywords
- leu
- ser
- val
- glu
- gly
- Prior art date
Links
- 150000007523 nucleic acids Chemical class 0.000 title claims abstract description 108
- 102000039446 nucleic acids Human genes 0.000 title claims abstract description 103
- 108020004707 nucleic acids Proteins 0.000 title claims abstract description 103
- 108010076504 Protein Sorting Signals Proteins 0.000 title claims abstract description 79
- 108020001507 fusion proteins Proteins 0.000 title claims description 70
- 102000037865 fusion proteins Human genes 0.000 title claims description 70
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 86
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 63
- 208000009292 Hemophilia A Diseases 0.000 claims abstract description 48
- 101000911390 Homo sapiens Coagulation factor VIII Proteins 0.000 claims abstract description 38
- 102100026735 Coagulation factor VIII Human genes 0.000 claims abstract description 37
- 201000003542 Factor VIII deficiency Diseases 0.000 claims abstract description 37
- 239000008194 pharmaceutical composition Substances 0.000 claims abstract description 14
- 239000013604 expression vector Substances 0.000 claims description 82
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 51
- 229960000301 factor viii Drugs 0.000 claims description 36
- 238000000034 method Methods 0.000 claims description 20
- 239000002773 nucleotide Substances 0.000 claims description 19
- 125000003729 nucleotide group Chemical group 0.000 claims description 19
- 239000000203 mixture Substances 0.000 claims description 17
- 238000011282 treatment Methods 0.000 claims description 10
- 239000003114 blood coagulation factor Substances 0.000 claims description 9
- 230000008488 polyadenylation Effects 0.000 claims description 7
- 239000000546 pharmaceutical excipient Substances 0.000 claims description 6
- 241000702421 Dependoparvovirus Species 0.000 claims description 5
- 241000702423 Adeno-associated virus - 2 Species 0.000 claims description 3
- 241001634120 Adeno-associated virus - 5 Species 0.000 claims description 3
- 241000972680 Adeno-associated virus - 6 Species 0.000 claims description 3
- 241001655883 Adeno-associated virus - 1 Species 0.000 claims description 2
- 241000202702 Adeno-associated virus - 3 Species 0.000 claims description 2
- 241000580270 Adeno-associated virus - 4 Species 0.000 claims description 2
- 241001164823 Adeno-associated virus - 7 Species 0.000 claims description 2
- 241001164825 Adeno-associated virus - 8 Species 0.000 claims description 2
- 241000649045 Adeno-associated virus 10 Species 0.000 claims description 2
- 241000649046 Adeno-associated virus 11 Species 0.000 claims description 2
- 241000649047 Adeno-associated virus 12 Species 0.000 claims description 2
- 241000300529 Adeno-associated virus 13 Species 0.000 claims description 2
- 102100035426 DnaJ homolog subfamily B member 7 Human genes 0.000 claims description 2
- 101100285903 Drosophila melanogaster Hsc70-2 gene Proteins 0.000 claims description 2
- 101100178718 Drosophila melanogaster Hsc70-4 gene Proteins 0.000 claims description 2
- 101100178723 Drosophila melanogaster Hsc70-5 gene Proteins 0.000 claims description 2
- 101000804114 Homo sapiens DnaJ homolog subfamily B member 7 Proteins 0.000 claims description 2
- 101150090950 Hsc70-1 gene Proteins 0.000 claims description 2
- 101100150366 Schizosaccharomyces pombe (strain 972 / ATCC 24843) sks2 gene Proteins 0.000 claims description 2
- 239000013646 rAAV2 vector Substances 0.000 claims 1
- 108010054218 Factor VIII Proteins 0.000 abstract description 51
- 102000001690 Factor VIII Human genes 0.000 abstract description 42
- 239000013598 vector Substances 0.000 abstract description 33
- 230000000694 effects Effects 0.000 abstract description 26
- 239000000126 substance Substances 0.000 abstract description 5
- 229940105778 coagulation factor viii Drugs 0.000 abstract description 4
- 210000004027 cell Anatomy 0.000 description 119
- 108020004414 DNA Proteins 0.000 description 25
- 238000001890 transfection Methods 0.000 description 25
- 239000003814 drug Substances 0.000 description 18
- 229940079593 drug Drugs 0.000 description 17
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 15
- 108090000765 processed proteins & peptides Proteins 0.000 description 15
- 238000010361 transduction Methods 0.000 description 15
- 230000026683 transduction Effects 0.000 description 15
- 108010050848 glycylleucine Proteins 0.000 description 14
- 108010056582 methionylglutamic acid Proteins 0.000 description 14
- 239000000243 solution Substances 0.000 description 14
- 102000004407 Lactalbumin Human genes 0.000 description 12
- 108090000942 Lactalbumin Proteins 0.000 description 12
- 241001465754 Metazoa Species 0.000 description 12
- 108010077245 asparaginyl-proline Proteins 0.000 description 12
- 238000001415 gene therapy Methods 0.000 description 12
- 238000004519 manufacturing process Methods 0.000 description 12
- 108010051242 phenylalanylserine Proteins 0.000 description 12
- 108091033319 polynucleotide Proteins 0.000 description 12
- 102000040430 polynucleotide Human genes 0.000 description 12
- 239000002157 polynucleotide Substances 0.000 description 12
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 11
- 208000031220 Hemophilia Diseases 0.000 description 11
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 11
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 11
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 11
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 11
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 11
- 238000002347 injection Methods 0.000 description 11
- 239000007924 injection Substances 0.000 description 11
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 11
- 210000002381 plasma Anatomy 0.000 description 11
- 239000013612 plasmid Substances 0.000 description 11
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 10
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 10
- 241000880493 Leptailurus serval Species 0.000 description 10
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 10
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 10
- 108091028043 Nucleic acid sequence Proteins 0.000 description 10
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 10
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 10
- 102000004196 processed proteins & peptides Human genes 0.000 description 10
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 9
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 9
- 108010079364 N-glycylalanine Proteins 0.000 description 9
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 9
- 229940027941 immunoglobulin g Drugs 0.000 description 9
- 108010057821 leucylproline Proteins 0.000 description 9
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 9
- 108010044292 tryptophyltyrosine Proteins 0.000 description 9
- 101000600434 Homo sapiens Putative uncharacterized protein encoded by MIR7-3HG Proteins 0.000 description 8
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 8
- UIJVKVHLCQSPOJ-XIRDDKMYSA-N Lys-Ser-Trp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O UIJVKVHLCQSPOJ-XIRDDKMYSA-N 0.000 description 8
- 102100037401 Putative uncharacterized protein encoded by MIR7-3HG Human genes 0.000 description 8
- 108010093581 aspartyl-proline Proteins 0.000 description 8
- 238000001727 in vivo Methods 0.000 description 8
- 108010073969 valyllysine Proteins 0.000 description 8
- 230000003612 virological effect Effects 0.000 description 8
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 7
- 238000002965 ELISA Methods 0.000 description 7
- CUEQQFOGARVNHU-VGDYDELISA-N His-Ser-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUEQQFOGARVNHU-VGDYDELISA-N 0.000 description 7
- QPXBPQUGXHURGP-UWVGGRQHSA-N Leu-Gly-Met Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N QPXBPQUGXHURGP-UWVGGRQHSA-N 0.000 description 7
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 7
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 7
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 7
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 7
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 7
- VNRTXOUAOUZCFW-WDSOQIARSA-N Trp-Val-His Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O VNRTXOUAOUZCFW-WDSOQIARSA-N 0.000 description 7
- 239000012531 culture fluid Substances 0.000 description 7
- 239000012634 fragment Substances 0.000 description 7
- 108010040030 histidinoalanine Proteins 0.000 description 7
- 108010054155 lysyllysine Proteins 0.000 description 7
- 108010017391 lysylvaline Proteins 0.000 description 7
- 229920001184 polypeptide Polymers 0.000 description 7
- 108010004914 prolylarginine Proteins 0.000 description 7
- 108010071207 serylmethionine Proteins 0.000 description 7
- 108010061238 threonyl-glycine Proteins 0.000 description 7
- 238000013518 transcription Methods 0.000 description 7
- 230000035897 transcription Effects 0.000 description 7
- 108010027345 wheylin-1 peptide Proteins 0.000 description 7
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 6
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 6
- OWSMKCJUBAPHED-JYJNAYRXSA-N Arg-Pro-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OWSMKCJUBAPHED-JYJNAYRXSA-N 0.000 description 6
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 6
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 6
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 6
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 6
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 6
- 241000699670 Mus sp. Species 0.000 description 6
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 6
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 6
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 6
- VGYVVSQFSSKZRJ-OEAJRASXSA-N Thr-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=CC=C1 VGYVVSQFSSKZRJ-OEAJRASXSA-N 0.000 description 6
- ZOBLBMGJKVJVEV-BZSNNMDCSA-N Tyr-Lys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O ZOBLBMGJKVJVEV-BZSNNMDCSA-N 0.000 description 6
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 6
- 108010087924 alanylproline Proteins 0.000 description 6
- 108010008355 arginyl-glutamine Proteins 0.000 description 6
- 108010038633 aspartylglutamate Proteins 0.000 description 6
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 6
- 108010078144 glutaminyl-glycine Proteins 0.000 description 6
- 108010037850 glycylvaline Proteins 0.000 description 6
- 108010025306 histidylleucine Proteins 0.000 description 6
- 108010085325 histidylproline Proteins 0.000 description 6
- 238000000338 in vitro Methods 0.000 description 6
- 239000007788 liquid Substances 0.000 description 6
- 108010003700 lysyl aspartic acid Proteins 0.000 description 6
- 210000004962 mammalian cell Anatomy 0.000 description 6
- 230000035772 mutation Effects 0.000 description 6
- 108010078580 tyrosylleucine Proteins 0.000 description 6
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 5
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 5
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 5
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 5
- LZRMPXRYLLTAJX-GUBZILKMSA-N Gln-Arg-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZRMPXRYLLTAJX-GUBZILKMSA-N 0.000 description 5
- MWLYSLMKFXWZPW-ZPFDUUQYSA-N Gln-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCC(N)=O MWLYSLMKFXWZPW-ZPFDUUQYSA-N 0.000 description 5
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 5
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 5
- PGTISAJTWZPFGN-PEXQALLHSA-N His-Gly-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O PGTISAJTWZPFGN-PEXQALLHSA-N 0.000 description 5
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 5
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 5
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 5
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 5
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 5
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 5
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 5
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 5
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 5
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 5
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 5
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 5
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 5
- OFCKFBGRYHOKFP-IHPCNDPISA-N Trp-Asp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N OFCKFBGRYHOKFP-IHPCNDPISA-N 0.000 description 5
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 5
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 5
- 108010013835 arginine glutamate Proteins 0.000 description 5
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 5
- 108010062796 arginyllysine Proteins 0.000 description 5
- 230000001580 bacterial effect Effects 0.000 description 5
- 230000002950 deficient Effects 0.000 description 5
- 201000010099 disease Diseases 0.000 description 5
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 5
- 238000001990 intravenous administration Methods 0.000 description 5
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 5
- 108010076718 lysyl-glutamyl-tryptophan Proteins 0.000 description 5
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 5
- 239000013642 negative control Substances 0.000 description 5
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 5
- 108010090894 prolylleucine Proteins 0.000 description 5
- 108010053725 prolylvaline Proteins 0.000 description 5
- 238000012360 testing method Methods 0.000 description 5
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 5
- 230000014616 translation Effects 0.000 description 5
- 108010029384 tryptophyl-histidine Proteins 0.000 description 5
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 4
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 4
- WYPUMLRSQMKIJU-BPNCWPANSA-N Ala-Arg-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WYPUMLRSQMKIJU-BPNCWPANSA-N 0.000 description 4
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 4
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 4
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 4
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 4
- DEAGTWNKODHUIY-MRFFXTKBSA-N Ala-Tyr-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DEAGTWNKODHUIY-MRFFXTKBSA-N 0.000 description 4
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 4
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 4
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 4
- FLYANDHDFRGGTM-PYJNHQTQSA-N Arg-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FLYANDHDFRGGTM-PYJNHQTQSA-N 0.000 description 4
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 4
- OGSQONVYSTZIJB-WDSOQIARSA-N Arg-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OGSQONVYSTZIJB-WDSOQIARSA-N 0.000 description 4
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 4
- IGFJVXOATGZTHD-UHFFFAOYSA-N Arg-Phe-His Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccccc1)C(=O)NC(Cc2c[nH]cn2)C(=O)O IGFJVXOATGZTHD-UHFFFAOYSA-N 0.000 description 4
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 4
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 4
- JPAWCMXVNZPJLO-IHRRRGAJSA-N Arg-Ser-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JPAWCMXVNZPJLO-IHRRRGAJSA-N 0.000 description 4
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 4
- CNBIWSCSSCAINS-UFYCRDLUSA-N Arg-Tyr-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNBIWSCSSCAINS-UFYCRDLUSA-N 0.000 description 4
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 4
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 4
- FODVBOKTYKYRFJ-CIUDSAMLSA-N Asn-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N FODVBOKTYKYRFJ-CIUDSAMLSA-N 0.000 description 4
- HMUKKNAMNSXDBB-CIUDSAMLSA-N Asn-Met-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMUKKNAMNSXDBB-CIUDSAMLSA-N 0.000 description 4
- KYQJHBWHRASMKG-ZLUOBGJFSA-N Asn-Ser-Cys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O KYQJHBWHRASMKG-ZLUOBGJFSA-N 0.000 description 4
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 4
- QOVWVLLHMMCFFY-ZLUOBGJFSA-N Asp-Asp-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QOVWVLLHMMCFFY-ZLUOBGJFSA-N 0.000 description 4
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 4
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 4
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 4
- YRZIYQGXTSBRLT-AVGNSLFASA-N Asp-Phe-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YRZIYQGXTSBRLT-AVGNSLFASA-N 0.000 description 4
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 4
- AWPWHMVCSISSQK-QWRGUYRKSA-N Asp-Tyr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O AWPWHMVCSISSQK-QWRGUYRKSA-N 0.000 description 4
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 4
- 102000015081 Blood Coagulation Factors Human genes 0.000 description 4
- 108010039209 Blood Coagulation Factors Proteins 0.000 description 4
- DCXGXDGGXVZVMY-GHCJXIJMSA-N Cys-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CS DCXGXDGGXVZVMY-GHCJXIJMSA-N 0.000 description 4
- YMBAVNPKBWHDAW-CIUDSAMLSA-N Cys-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N YMBAVNPKBWHDAW-CIUDSAMLSA-N 0.000 description 4
- 108010014173 Factor X Proteins 0.000 description 4
- PGPJSRSLQNXBDT-YUMQZZPRSA-N Gln-Arg-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O PGPJSRSLQNXBDT-YUMQZZPRSA-N 0.000 description 4
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 4
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 4
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 4
- VLOLPWWCNKWRNB-LOKLDPHHSA-N Gln-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VLOLPWWCNKWRNB-LOKLDPHHSA-N 0.000 description 4
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 4
- RQNYYRHRKSVKAB-GUBZILKMSA-N Glu-Cys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O RQNYYRHRKSVKAB-GUBZILKMSA-N 0.000 description 4
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 4
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 4
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 4
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 4
- RQZGFWKQLPJOEQ-YUMQZZPRSA-N Gly-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN)CN=C(N)N RQZGFWKQLPJOEQ-YUMQZZPRSA-N 0.000 description 4
- YYQGVXNKAXUTJU-YUMQZZPRSA-N Gly-Cys-His Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O YYQGVXNKAXUTJU-YUMQZZPRSA-N 0.000 description 4
- QSVMIMFAAZPCAQ-PMVVWTBXSA-N Gly-His-Thr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QSVMIMFAAZPCAQ-PMVVWTBXSA-N 0.000 description 4
- OCPPBNKYGYSLOE-IUCAKERBSA-N Gly-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN OCPPBNKYGYSLOE-IUCAKERBSA-N 0.000 description 4
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 4
- 208000032843 Hemorrhage Diseases 0.000 description 4
- XINDHUAGVGCNSF-QSFUFRPTSA-N His-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XINDHUAGVGCNSF-QSFUFRPTSA-N 0.000 description 4
- JJHWJUYYTWYXPL-PYJNHQTQSA-N His-Ile-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CN=CN1 JJHWJUYYTWYXPL-PYJNHQTQSA-N 0.000 description 4
- ORERHHPZDDEMSC-VGDYDELISA-N His-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ORERHHPZDDEMSC-VGDYDELISA-N 0.000 description 4
- RNMNYMDTESKEAJ-KKUMJFAQSA-N His-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 RNMNYMDTESKEAJ-KKUMJFAQSA-N 0.000 description 4
- IGBBXBFSLKRHJB-BZSNNMDCSA-N His-Lys-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 IGBBXBFSLKRHJB-BZSNNMDCSA-N 0.000 description 4
- OWYIDJCNRWRSJY-QTKMDUPCSA-N His-Pro-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O OWYIDJCNRWRSJY-QTKMDUPCSA-N 0.000 description 4
- FLXCRBXJRJSDHX-AVGNSLFASA-N His-Pro-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O FLXCRBXJRJSDHX-AVGNSLFASA-N 0.000 description 4
- JMSONHOUHFDOJH-GUBZILKMSA-N His-Ser-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 JMSONHOUHFDOJH-GUBZILKMSA-N 0.000 description 4
- FBVHRDXSCYELMI-PBCZWWQYSA-N His-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O FBVHRDXSCYELMI-PBCZWWQYSA-N 0.000 description 4
- VXZZUXWAOMWWJH-QTKMDUPCSA-N His-Thr-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VXZZUXWAOMWWJH-QTKMDUPCSA-N 0.000 description 4
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 4
- DMZOUKXXHJQPTL-GRLWGSQLSA-N Ile-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N DMZOUKXXHJQPTL-GRLWGSQLSA-N 0.000 description 4
- XLCZWMJPVGRWHJ-KQXIARHKSA-N Ile-Glu-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N XLCZWMJPVGRWHJ-KQXIARHKSA-N 0.000 description 4
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 4
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 4
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 4
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 4
- ORWTWZXGDBYVCP-BJDJZHNGSA-N Leu-Ile-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(C)C ORWTWZXGDBYVCP-BJDJZHNGSA-N 0.000 description 4
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 4
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 4
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 4
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 4
- FOBUGKUBUJOWAD-IHPCNDPISA-N Leu-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FOBUGKUBUJOWAD-IHPCNDPISA-N 0.000 description 4
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 4
- YESNGRDJQWDYLH-KKUMJFAQSA-N Leu-Phe-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YESNGRDJQWDYLH-KKUMJFAQSA-N 0.000 description 4
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 4
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 4
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 4
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 4
- UWHCKWNPWKTMBM-WDCWCFNPSA-N Lys-Thr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWHCKWNPWKTMBM-WDCWCFNPSA-N 0.000 description 4
- OEYKVQKYCHATHO-SZMVWBNQSA-N Lys-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N OEYKVQKYCHATHO-SZMVWBNQSA-N 0.000 description 4
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 4
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 4
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 4
- RZJOHSFAEZBWLK-CIUDSAMLSA-N Met-Gln-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N RZJOHSFAEZBWLK-CIUDSAMLSA-N 0.000 description 4
- MVMNUCOHQGYYKB-PEDHHIEDSA-N Met-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCSC)N MVMNUCOHQGYYKB-PEDHHIEDSA-N 0.000 description 4
- JCMMNFZUKMMECJ-DCAQKATOSA-N Met-Lys-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JCMMNFZUKMMECJ-DCAQKATOSA-N 0.000 description 4
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 4
- MUDYEFAKNSTFAI-JYJNAYRXSA-N Met-Tyr-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O MUDYEFAKNSTFAI-JYJNAYRXSA-N 0.000 description 4
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 4
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 4
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 4
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 4
- BSTPNLNKHKBONJ-HTUGSXCWSA-N Phe-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O BSTPNLNKHKBONJ-HTUGSXCWSA-N 0.000 description 4
- FGWUALWGCZJQDJ-URLPEUOOSA-N Phe-Thr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGWUALWGCZJQDJ-URLPEUOOSA-N 0.000 description 4
- FCCBQBZXIAZNIG-LSJOCFKGSA-N Pro-Ala-His Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O FCCBQBZXIAZNIG-LSJOCFKGSA-N 0.000 description 4
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 4
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 4
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 4
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 4
- PUQRDHNIOONJJN-AVGNSLFASA-N Pro-Lys-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O PUQRDHNIOONJJN-AVGNSLFASA-N 0.000 description 4
- GRSLLFZTTLBOQX-CIUDSAMLSA-N Ser-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N GRSLLFZTTLBOQX-CIUDSAMLSA-N 0.000 description 4
- ZFVFHHZBCVNLGD-GUBZILKMSA-N Ser-His-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFVFHHZBCVNLGD-GUBZILKMSA-N 0.000 description 4
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 4
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 4
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 4
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 4
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 4
- ZSLFCBHEINFXRS-LPEHRKFASA-N Ser-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ZSLFCBHEINFXRS-LPEHRKFASA-N 0.000 description 4
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 4
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 4
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 4
- PKXHGEXFMIZSER-QTKMDUPCSA-N Thr-Arg-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O PKXHGEXFMIZSER-QTKMDUPCSA-N 0.000 description 4
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 4
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 4
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 4
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 4
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 4
- HTGJDTPQYFMKNC-VFAJRCTISA-N Trp-Thr-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 HTGJDTPQYFMKNC-VFAJRCTISA-N 0.000 description 4
- STKZKWFOKOCSLW-UMPQAUOISA-N Trp-Thr-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 STKZKWFOKOCSLW-UMPQAUOISA-N 0.000 description 4
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 4
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 4
- HVPPEXXUDXAPOM-MGHWNKPDSA-N Tyr-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HVPPEXXUDXAPOM-MGHWNKPDSA-N 0.000 description 4
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 4
- VBFVQTPETKJCQW-RPTUDFQQSA-N Tyr-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VBFVQTPETKJCQW-RPTUDFQQSA-N 0.000 description 4
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 4
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 4
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 4
- MHAHQDBEIDPFQS-NHCYSSNCSA-N Val-Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)C(C)C MHAHQDBEIDPFQS-NHCYSSNCSA-N 0.000 description 4
- HQYVQDRYODWONX-DCAQKATOSA-N Val-His-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N HQYVQDRYODWONX-DCAQKATOSA-N 0.000 description 4
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 4
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 4
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 4
- UZFNHAXYMICTBU-DZKIICNBSA-N Val-Phe-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UZFNHAXYMICTBU-DZKIICNBSA-N 0.000 description 4
- GQMNEJMFMCJJTD-NHCYSSNCSA-N Val-Pro-Gln Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O GQMNEJMFMCJJTD-NHCYSSNCSA-N 0.000 description 4
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 4
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 4
- 108010044940 alanylglutamine Proteins 0.000 description 4
- 108010068380 arginylarginine Proteins 0.000 description 4
- 108010068265 aspartyltyrosine Proteins 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 4
- 208000034158 bleeding Diseases 0.000 description 4
- 230000000740 bleeding effect Effects 0.000 description 4
- 210000004369 blood Anatomy 0.000 description 4
- 239000008280 blood Substances 0.000 description 4
- 238000004113 cell culture Methods 0.000 description 4
- 238000006243 chemical reaction Methods 0.000 description 4
- 230000000295 complement effect Effects 0.000 description 4
- 239000003623 enhancer Substances 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 108010079547 glutamylmethionine Proteins 0.000 description 4
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 4
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 4
- 108010077515 glycylproline Proteins 0.000 description 4
- 108010087823 glycyltyrosine Proteins 0.000 description 4
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 4
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 4
- 108010072591 lysyl-leucyl-alanyl-arginine Proteins 0.000 description 4
- 108010075702 lysyl-valyl-aspartyl-leucine Proteins 0.000 description 4
- 108010009298 lysylglutamic acid Proteins 0.000 description 4
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 4
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 4
- 108010073101 phenylalanylleucine Proteins 0.000 description 4
- 108010079317 prolyl-tyrosine Proteins 0.000 description 4
- 238000003259 recombinant expression Methods 0.000 description 4
- 230000010076 replication Effects 0.000 description 4
- 108010026333 seryl-proline Proteins 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 238000013519 translation Methods 0.000 description 4
- 108010051110 tyrosyl-lysine Proteins 0.000 description 4
- 241000701161 unidentified adenovirus Species 0.000 description 4
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 4
- SADYNMDJGAWAEW-JKQORVJESA-N (2s)-2-[[(2s)-3-carboxy-2-[[(2s)-2-[[(2s)-2,6-diaminohexanoyl]amino]-3-methylbutanoyl]amino]propanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN SADYNMDJGAWAEW-JKQORVJESA-N 0.000 description 3
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 3
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 3
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 3
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 3
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 3
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 3
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 3
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 3
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 3
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 3
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 3
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 3
- JNJHNBXBGNJESC-KKXDTOCCSA-N Ala-Tyr-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JNJHNBXBGNJESC-KKXDTOCCSA-N 0.000 description 3
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 3
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 3
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 3
- OLDOLPWZEMHNIA-PJODQICGSA-N Arg-Ala-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OLDOLPWZEMHNIA-PJODQICGSA-N 0.000 description 3
- NUBPTCMEOCKWDO-DCAQKATOSA-N Arg-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N NUBPTCMEOCKWDO-DCAQKATOSA-N 0.000 description 3
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 3
- JUWQNWXEGDYCIE-YUMQZZPRSA-N Arg-Gln-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O JUWQNWXEGDYCIE-YUMQZZPRSA-N 0.000 description 3
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 3
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 3
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 3
- DNBMCNQKNOKOSD-DCAQKATOSA-N Arg-Pro-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O DNBMCNQKNOKOSD-DCAQKATOSA-N 0.000 description 3
- FBXMCPLCVYUWBO-BPUTZDHNSA-N Arg-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N FBXMCPLCVYUWBO-BPUTZDHNSA-N 0.000 description 3
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 3
- NTXNUXPCNRDMAF-WFBYXXMGSA-N Asn-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC(N)=O)C)C(O)=O)=CNC2=C1 NTXNUXPCNRDMAF-WFBYXXMGSA-N 0.000 description 3
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 3
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 3
- UPALZCBCKAMGIY-PEFMBERDSA-N Asn-Gln-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UPALZCBCKAMGIY-PEFMBERDSA-N 0.000 description 3
- KUYKVGODHGHFDI-ACZMJKKPSA-N Asn-Gln-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O KUYKVGODHGHFDI-ACZMJKKPSA-N 0.000 description 3
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 3
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 3
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 3
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 3
- LVHMEJJWEXBMKK-GMOBBJLQSA-N Asn-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N LVHMEJJWEXBMKK-GMOBBJLQSA-N 0.000 description 3
- JQBCANGGAVVERB-CFMVVWHZSA-N Asn-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N JQBCANGGAVVERB-CFMVVWHZSA-N 0.000 description 3
- KIJLEFNHWSXHRU-NUMRIWBASA-N Asp-Gln-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KIJLEFNHWSXHRU-NUMRIWBASA-N 0.000 description 3
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 3
- RQYMKRMRZWJGHC-BQBZGAKWSA-N Asp-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N RQYMKRMRZWJGHC-BQBZGAKWSA-N 0.000 description 3
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 3
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 3
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 3
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 3
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 3
- PWAIZUBWHRHYKS-MELADBBJSA-N Asp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)O)N)C(=O)O PWAIZUBWHRHYKS-MELADBBJSA-N 0.000 description 3
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 3
- FIAKNCXQFFKSSI-ZLUOBGJFSA-N Asp-Ser-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O FIAKNCXQFFKSSI-ZLUOBGJFSA-N 0.000 description 3
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 3
- PLNJUJGNLDSFOP-UWJYBYFXSA-N Asp-Tyr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PLNJUJGNLDSFOP-UWJYBYFXSA-N 0.000 description 3
- VNXXMHTZQGGDSG-CIUDSAMLSA-N Cys-His-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O VNXXMHTZQGGDSG-CIUDSAMLSA-N 0.000 description 3
- OZHXXYOHPLLLMI-CIUDSAMLSA-N Cys-Lys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OZHXXYOHPLLLMI-CIUDSAMLSA-N 0.000 description 3
- 241000701022 Cytomegalovirus Species 0.000 description 3
- 241000196324 Embryophyta Species 0.000 description 3
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 3
- BTSPOOHJBYJRKO-CIUDSAMLSA-N Gln-Asp-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BTSPOOHJBYJRKO-CIUDSAMLSA-N 0.000 description 3
- LFIVHGMKWFGUGK-IHRRRGAJSA-N Gln-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LFIVHGMKWFGUGK-IHRRRGAJSA-N 0.000 description 3
- IKFZXRLDMYWNBU-YUMQZZPRSA-N Gln-Gly-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N IKFZXRLDMYWNBU-YUMQZZPRSA-N 0.000 description 3
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 3
- GFLNKSQHOBOMNM-AVGNSLFASA-N Gln-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GFLNKSQHOBOMNM-AVGNSLFASA-N 0.000 description 3
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 3
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 3
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 3
- QMVCEWKHIUHTSD-GUBZILKMSA-N Gln-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N QMVCEWKHIUHTSD-GUBZILKMSA-N 0.000 description 3
- XZUUUKNKNWVPHQ-JYJNAYRXSA-N Gln-Phe-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O XZUUUKNKNWVPHQ-JYJNAYRXSA-N 0.000 description 3
- UTOQQOMEJDPDMX-ACZMJKKPSA-N Gln-Ser-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O UTOQQOMEJDPDMX-ACZMJKKPSA-N 0.000 description 3
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 3
- LJLPOZGRPLORTF-CIUDSAMLSA-N Glu-Asn-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LJLPOZGRPLORTF-CIUDSAMLSA-N 0.000 description 3
- NTBDVNJIWCKURJ-ACZMJKKPSA-N Glu-Asp-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NTBDVNJIWCKURJ-ACZMJKKPSA-N 0.000 description 3
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 3
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 3
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 3
- PBFGQTGPSKWHJA-QEJZJMRPSA-N Glu-Asp-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O PBFGQTGPSKWHJA-QEJZJMRPSA-N 0.000 description 3
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 3
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 3
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 3
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 3
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 3
- LHIPZASLKPYDPI-AVGNSLFASA-N Glu-Phe-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LHIPZASLKPYDPI-AVGNSLFASA-N 0.000 description 3
- LPHGXOWFAXFCPX-KKUMJFAQSA-N Glu-Pro-Phe Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O LPHGXOWFAXFCPX-KKUMJFAQSA-N 0.000 description 3
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 3
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 3
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 3
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 3
- LJXWZPHEMJSNRC-KBPBESRZSA-N Gly-Gln-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LJXWZPHEMJSNRC-KBPBESRZSA-N 0.000 description 3
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 3
- ZKLYPEGLWFVRGF-IUCAKERBSA-N Gly-His-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZKLYPEGLWFVRGF-IUCAKERBSA-N 0.000 description 3
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 3
- JPAACTMBBBGAAR-HOTGVXAUSA-N Gly-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)CC(C)C)C(O)=O)=CNC2=C1 JPAACTMBBBGAAR-HOTGVXAUSA-N 0.000 description 3
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 3
- MKIAPEZXQDILRR-YUMQZZPRSA-N Gly-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN MKIAPEZXQDILRR-YUMQZZPRSA-N 0.000 description 3
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 3
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 3
- BPOHQCZZSFBSON-KKUMJFAQSA-N His-Leu-His Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BPOHQCZZSFBSON-KKUMJFAQSA-N 0.000 description 3
- DPQIPEAHIYMUEJ-IHRRRGAJSA-N His-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N DPQIPEAHIYMUEJ-IHRRRGAJSA-N 0.000 description 3
- HYWZHNUGAYVEEW-KKUMJFAQSA-N His-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HYWZHNUGAYVEEW-KKUMJFAQSA-N 0.000 description 3
- PGXZHYYGOPKYKM-IHRRRGAJSA-N His-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CCCCN)C(=O)O PGXZHYYGOPKYKM-IHRRRGAJSA-N 0.000 description 3
- ZGGWRNBSBOHIGH-HVTMNAMFSA-N Ile-Gln-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZGGWRNBSBOHIGH-HVTMNAMFSA-N 0.000 description 3
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 3
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 3
- IALVDKNUFSTICJ-GMOBBJLQSA-N Ile-Met-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IALVDKNUFSTICJ-GMOBBJLQSA-N 0.000 description 3
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 3
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 3
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 3
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 3
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 3
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 3
- GPXFZVUVPCFTMG-AVGNSLFASA-N Leu-Arg-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(C)C GPXFZVUVPCFTMG-AVGNSLFASA-N 0.000 description 3
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 3
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 3
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 3
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 3
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 3
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 3
- HMDDEJADNKQTBR-BZSNNMDCSA-N Leu-His-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMDDEJADNKQTBR-BZSNNMDCSA-N 0.000 description 3
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 3
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 3
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 3
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 3
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 3
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 3
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 3
- ALSRJRIWBNENFY-DCAQKATOSA-N Lys-Arg-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O ALSRJRIWBNENFY-DCAQKATOSA-N 0.000 description 3
- BRSGXFITDXFMFF-IHRRRGAJSA-N Lys-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N BRSGXFITDXFMFF-IHRRRGAJSA-N 0.000 description 3
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 3
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 3
- SQJSXOQXJYAVRV-SRVKXCTJSA-N Lys-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N SQJSXOQXJYAVRV-SRVKXCTJSA-N 0.000 description 3
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 3
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 3
- WAAZECNCPVGPIV-RHYQMDGZSA-N Lys-Thr-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O WAAZECNCPVGPIV-RHYQMDGZSA-N 0.000 description 3
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 3
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 3
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 3
- AETNZPKUUYYYEK-CIUDSAMLSA-N Met-Glu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AETNZPKUUYYYEK-CIUDSAMLSA-N 0.000 description 3
- SXWQMBGNFXAGAT-FJXKBIBVSA-N Met-Gly-Thr Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SXWQMBGNFXAGAT-FJXKBIBVSA-N 0.000 description 3
- HAQLBBVZAGMESV-IHRRRGAJSA-N Met-Lys-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O HAQLBBVZAGMESV-IHRRRGAJSA-N 0.000 description 3
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 3
- GMMLGMFBYCFCCX-KZVJFYERSA-N Met-Thr-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMMLGMFBYCFCCX-KZVJFYERSA-N 0.000 description 3
- PNHRPOWKRRJATF-IHRRRGAJSA-N Met-Tyr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 PNHRPOWKRRJATF-IHRRRGAJSA-N 0.000 description 3
- JACMWNXOOUYXCD-JYJNAYRXSA-N Met-Val-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JACMWNXOOUYXCD-JYJNAYRXSA-N 0.000 description 3
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 3
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 3
- 108010066427 N-valyltryptophan Proteins 0.000 description 3
- 108091034117 Oligonucleotide Proteins 0.000 description 3
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 3
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 3
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 3
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 3
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 3
- GXDPQJUBLBZKDY-IAVJCBSLSA-N Phe-Ile-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GXDPQJUBLBZKDY-IAVJCBSLSA-N 0.000 description 3
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 3
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 3
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 3
- XZQYIJALMGEUJD-OEAJRASXSA-N Phe-Lys-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZQYIJALMGEUJD-OEAJRASXSA-N 0.000 description 3
- FQUUYTNBMIBOHS-IHRRRGAJSA-N Phe-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FQUUYTNBMIBOHS-IHRRRGAJSA-N 0.000 description 3
- ROOQMPCUFLDOSB-FHWLQOOXSA-N Phe-Phe-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=CC=C1 ROOQMPCUFLDOSB-FHWLQOOXSA-N 0.000 description 3
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N Phe-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 description 3
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 3
- YRHRGNUAXGUPTO-PMVMPFDFSA-N Phe-Trp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCCCN)C(=O)O)N YRHRGNUAXGUPTO-PMVMPFDFSA-N 0.000 description 3
- OLHDPZMYUSBGDE-GUBZILKMSA-N Pro-Arg-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O OLHDPZMYUSBGDE-GUBZILKMSA-N 0.000 description 3
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 3
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 3
- WFHYFCWBLSKEMS-KKUMJFAQSA-N Pro-Glu-Phe Chemical compound N([C@@H](CCC(=O)O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 WFHYFCWBLSKEMS-KKUMJFAQSA-N 0.000 description 3
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 3
- NFLNBHLMLYALOO-DCAQKATOSA-N Pro-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 NFLNBHLMLYALOO-DCAQKATOSA-N 0.000 description 3
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 3
- VGVCNKSUVSZEIE-IHRRRGAJSA-N Pro-Phe-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O VGVCNKSUVSZEIE-IHRRRGAJSA-N 0.000 description 3
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 3
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 3
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 3
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 3
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 3
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 3
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 3
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 3
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 3
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 3
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 3
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 3
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 3
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 3
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 3
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 3
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 3
- QPPYAWVLAVXISR-DCAQKATOSA-N Ser-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QPPYAWVLAVXISR-DCAQKATOSA-N 0.000 description 3
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 3
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 3
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 3
- ATEQEHCGZKBEMU-GQGQLFGLSA-N Ser-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N ATEQEHCGZKBEMU-GQGQLFGLSA-N 0.000 description 3
- ZVBCMFDJIMUELU-BZSNNMDCSA-N Ser-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N ZVBCMFDJIMUELU-BZSNNMDCSA-N 0.000 description 3
- SGZVZUCRAVSPKQ-FXQIFTODSA-N Ser-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N SGZVZUCRAVSPKQ-FXQIFTODSA-N 0.000 description 3
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 3
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 3
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 3
- QNJZOAHSYPXTAB-VEVYYDQMSA-N Thr-Asn-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O QNJZOAHSYPXTAB-VEVYYDQMSA-N 0.000 description 3
- NOWXWJLVGTVJKM-PBCZWWQYSA-N Thr-Asp-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O NOWXWJLVGTVJKM-PBCZWWQYSA-N 0.000 description 3
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 3
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 3
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 3
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 3
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 3
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 3
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 3
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 3
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 3
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 3
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 3
- 108700019146 Transgenes Proteins 0.000 description 3
- ICNFHVUVCNWUAB-SZMVWBNQSA-N Trp-Arg-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ICNFHVUVCNWUAB-SZMVWBNQSA-N 0.000 description 3
- JLTQXEOXIJMCLZ-ZVZYQTTQSA-N Trp-Gln-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 JLTQXEOXIJMCLZ-ZVZYQTTQSA-N 0.000 description 3
- YVXIAOOYAKBAAI-SZMVWBNQSA-N Trp-Leu-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 YVXIAOOYAKBAAI-SZMVWBNQSA-N 0.000 description 3
- NLLARHRWSFNEMH-NUTKFTJISA-N Trp-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NLLARHRWSFNEMH-NUTKFTJISA-N 0.000 description 3
- RERRMBXDSFMBQE-ZFWWWQNUSA-N Trp-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RERRMBXDSFMBQE-ZFWWWQNUSA-N 0.000 description 3
- XHALUUQSNXSPLP-UFYCRDLUSA-N Tyr-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XHALUUQSNXSPLP-UFYCRDLUSA-N 0.000 description 3
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 3
- BEIGSKUPTIFYRZ-SRVKXCTJSA-N Tyr-Asp-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O BEIGSKUPTIFYRZ-SRVKXCTJSA-N 0.000 description 3
- PJWCWGXAVIVXQC-STECZYCISA-N Tyr-Ile-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PJWCWGXAVIVXQC-STECZYCISA-N 0.000 description 3
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 3
- FASACHWGQBNSRO-ZEWNOJEFSA-N Tyr-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FASACHWGQBNSRO-ZEWNOJEFSA-N 0.000 description 3
- ULUXAIYMVXLDQP-PMVMPFDFSA-N Tyr-Trp-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)NC(=O)[C@H](CC4=CC=C(C=C4)O)N ULUXAIYMVXLDQP-PMVMPFDFSA-N 0.000 description 3
- JQOMHZMWQHXALX-FHWLQOOXSA-N Tyr-Tyr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JQOMHZMWQHXALX-FHWLQOOXSA-N 0.000 description 3
- AGDDLOQMXUQPDY-BZSNNMDCSA-N Tyr-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O AGDDLOQMXUQPDY-BZSNNMDCSA-N 0.000 description 3
- IVXJODPZRWHCCR-JYJNAYRXSA-N Val-Arg-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IVXJODPZRWHCCR-JYJNAYRXSA-N 0.000 description 3
- NWDOPHYLSORNEX-QXEWZRGKSA-N Val-Asn-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N NWDOPHYLSORNEX-QXEWZRGKSA-N 0.000 description 3
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 3
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 3
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 3
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 3
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 3
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 3
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 3
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 3
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 3
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 3
- 239000013543 active substance Substances 0.000 description 3
- 108010005233 alanylglutamic acid Proteins 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 3
- 108010092854 aspartyllysine Proteins 0.000 description 3
- 230000004071 biological effect Effects 0.000 description 3
- 229940019700 blood coagulation factors Drugs 0.000 description 3
- 238000011088 calibration curve Methods 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 108010060199 cysteinylproline Proteins 0.000 description 3
- 230000007812 deficiency Effects 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 210000002889 endothelial cell Anatomy 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000002068 genetic effect Effects 0.000 description 3
- 108010015792 glycyllysine Proteins 0.000 description 3
- 208000002085 hemarthrosis Diseases 0.000 description 3
- 206010073071 hepatocellular carcinoma Diseases 0.000 description 3
- 231100000844 hepatocellular carcinoma Toxicity 0.000 description 3
- 108010053037 kyotorphin Proteins 0.000 description 3
- 108010034529 leucyl-lysine Proteins 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 108010068488 methionylphenylalanine Proteins 0.000 description 3
- 239000002245 particle Substances 0.000 description 3
- 108010015796 prolylisoleucine Proteins 0.000 description 3
- 230000004952 protein activity Effects 0.000 description 3
- 238000009256 replacement therapy Methods 0.000 description 3
- 230000028327 secretion Effects 0.000 description 3
- 108010048818 seryl-histidine Proteins 0.000 description 3
- 230000011664 signaling Effects 0.000 description 3
- 238000003860 storage Methods 0.000 description 3
- 108010084932 tryptophyl-proline Proteins 0.000 description 3
- 108010009962 valyltyrosine Proteins 0.000 description 3
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- IXTPACPAXIOCRG-ACZMJKKPSA-N Ala-Glu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N IXTPACPAXIOCRG-ACZMJKKPSA-N 0.000 description 2
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 2
- 108700028369 Alleles Proteins 0.000 description 2
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 2
- SSZGOKWBHLOCHK-DCAQKATOSA-N Arg-Lys-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N SSZGOKWBHLOCHK-DCAQKATOSA-N 0.000 description 2
- VIINVRPKMUZYOI-DCAQKATOSA-N Arg-Met-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIINVRPKMUZYOI-DCAQKATOSA-N 0.000 description 2
- LXMKTIZAGIBQRX-HRCADAONSA-N Arg-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O LXMKTIZAGIBQRX-HRCADAONSA-N 0.000 description 2
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 2
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 2
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 2
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 2
- QIRJQYQOIKBPBZ-IHRRRGAJSA-N Asn-Tyr-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QIRJQYQOIKBPBZ-IHRRRGAJSA-N 0.000 description 2
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 2
- VTYQAQFKMQTKQD-ACZMJKKPSA-N Asp-Ala-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O VTYQAQFKMQTKQD-ACZMJKKPSA-N 0.000 description 2
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 2
- RSMIHCFQDCVVBR-CIUDSAMLSA-N Asp-Gln-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RSMIHCFQDCVVBR-CIUDSAMLSA-N 0.000 description 2
- ZSVJVIOVABDTTL-YUMQZZPRSA-N Asp-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N ZSVJVIOVABDTTL-YUMQZZPRSA-N 0.000 description 2
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 2
- YQKYLDVPCOGIRB-SEKJGCFDSA-N Asp-Leu-Thr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YQKYLDVPCOGIRB-SEKJGCFDSA-N 0.000 description 2
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 2
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 2
- ALMIMUZAWTUNIO-BZSNNMDCSA-N Asp-Tyr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ALMIMUZAWTUNIO-BZSNNMDCSA-N 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- SZQCDCKIGWQAQN-FXQIFTODSA-N Cys-Arg-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O SZQCDCKIGWQAQN-FXQIFTODSA-N 0.000 description 2
- XABFFGOGKOORCG-CIUDSAMLSA-N Cys-Asp-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XABFFGOGKOORCG-CIUDSAMLSA-N 0.000 description 2
- XZKJEOMFLDVXJG-KATARQTJSA-N Cys-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)N)O XZKJEOMFLDVXJG-KATARQTJSA-N 0.000 description 2
- CHRCKSPMGYDLIA-SRVKXCTJSA-N Cys-Phe-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O CHRCKSPMGYDLIA-SRVKXCTJSA-N 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- 108010048049 Factor IXa Proteins 0.000 description 2
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 2
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 2
- ILKYYKRAULNYMS-JYJNAYRXSA-N Gln-Lys-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ILKYYKRAULNYMS-JYJNAYRXSA-N 0.000 description 2
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 2
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 2
- PAQUJCSYVIBPLC-AVGNSLFASA-N Glu-Asp-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PAQUJCSYVIBPLC-AVGNSLFASA-N 0.000 description 2
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 2
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 2
- HMJULNMJWOZNFI-XHNCKOQMSA-N Glu-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N)C(=O)O HMJULNMJWOZNFI-XHNCKOQMSA-N 0.000 description 2
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 2
- XXGQRGQPGFYECI-WDSKDSINSA-N Gly-Cys-Glu Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(O)=O XXGQRGQPGFYECI-WDSKDSINSA-N 0.000 description 2
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 2
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 2
- OMOZPGCHVWOXHN-BQBZGAKWSA-N Gly-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)CN OMOZPGCHVWOXHN-BQBZGAKWSA-N 0.000 description 2
- UWQDKRIZSROAKS-FJXKBIBVSA-N Gly-Met-Thr Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWQDKRIZSROAKS-FJXKBIBVSA-N 0.000 description 2
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 2
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 2
- 206010018852 Haematoma Diseases 0.000 description 2
- LSQHWKPPOFDHHZ-YUMQZZPRSA-N His-Asp-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LSQHWKPPOFDHHZ-YUMQZZPRSA-N 0.000 description 2
- AIPUZFXMXAHZKY-QWRGUYRKSA-N His-Leu-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AIPUZFXMXAHZKY-QWRGUYRKSA-N 0.000 description 2
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 2
- JATYGDHMDRAISQ-KKUMJFAQSA-N His-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O JATYGDHMDRAISQ-KKUMJFAQSA-N 0.000 description 2
- KDDKJKKQODQQBR-NHCYSSNCSA-N His-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N KDDKJKKQODQQBR-NHCYSSNCSA-N 0.000 description 2
- 108700039609 IRW peptide Proteins 0.000 description 2
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 2
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 2
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 2
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 2
- PPSQSIDMOVPKPI-BJDJZHNGSA-N Ile-Cys-Leu Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O PPSQSIDMOVPKPI-BJDJZHNGSA-N 0.000 description 2
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 2
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 2
- MASWXTFJVNRZPT-NAKRPEOUSA-N Ile-Met-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)O)N MASWXTFJVNRZPT-NAKRPEOUSA-N 0.000 description 2
- SNHYFFQZRFIRHO-CYDGBPFRSA-N Ile-Met-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N SNHYFFQZRFIRHO-CYDGBPFRSA-N 0.000 description 2
- OTSVBELRDMSPKY-PCBIJLKTSA-N Ile-Phe-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OTSVBELRDMSPKY-PCBIJLKTSA-N 0.000 description 2
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 2
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 2
- BLFXHAFTNYZEQE-VKOGCVSHSA-N Ile-Trp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BLFXHAFTNYZEQE-VKOGCVSHSA-N 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 2
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 2
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 2
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 2
- NJMXCOOEFLMZSR-AVGNSLFASA-N Leu-Met-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NJMXCOOEFLMZSR-AVGNSLFASA-N 0.000 description 2
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 2
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 2
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 2
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 2
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 2
- ABHIXYDMILIUKV-CIUDSAMLSA-N Lys-Asn-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ABHIXYDMILIUKV-CIUDSAMLSA-N 0.000 description 2
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 2
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 2
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 2
- BVXXDMUMHMXFER-BPNCWPANSA-N Met-Ala-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVXXDMUMHMXFER-BPNCWPANSA-N 0.000 description 2
- OSOLWRWQADPDIQ-DCAQKATOSA-N Met-Asp-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OSOLWRWQADPDIQ-DCAQKATOSA-N 0.000 description 2
- OXHSZBRPUGNMKW-DCAQKATOSA-N Met-Gln-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OXHSZBRPUGNMKW-DCAQKATOSA-N 0.000 description 2
- AWOMRHGUWFBDNU-ZPFDUUQYSA-N Met-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N AWOMRHGUWFBDNU-ZPFDUUQYSA-N 0.000 description 2
- KQBJYJXPZBNEIK-DCAQKATOSA-N Met-Glu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQBJYJXPZBNEIK-DCAQKATOSA-N 0.000 description 2
- JPCHYAUKOUGOIB-HJGDQZAQSA-N Met-Glu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPCHYAUKOUGOIB-HJGDQZAQSA-N 0.000 description 2
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 2
- WYPVCIACUMJRIB-JYJNAYRXSA-N Phe-Gln-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N WYPVCIACUMJRIB-JYJNAYRXSA-N 0.000 description 2
- TXKWKTWYTIAZSV-KKUMJFAQSA-N Phe-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N TXKWKTWYTIAZSV-KKUMJFAQSA-N 0.000 description 2
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 2
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 2
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 2
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 2
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 2
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 2
- AJJDPGVVNPUZCR-RHYQMDGZSA-N Pro-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1)O AJJDPGVVNPUZCR-RHYQMDGZSA-N 0.000 description 2
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 2
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 2
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 2
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 2
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 2
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 2
- UQGAAZXSCGWMFU-UBHSHLNASA-N Ser-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N UQGAAZXSCGWMFU-UBHSHLNASA-N 0.000 description 2
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 2
- 101710172711 Structural protein Proteins 0.000 description 2
- LOHBIDZYHQQTDM-IXOXFDKPSA-N Thr-Cys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LOHBIDZYHQQTDM-IXOXFDKPSA-N 0.000 description 2
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 2
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 2
- JNKAYADBODLPMQ-HSHDSVGOSA-N Thr-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)=CNC2=C1 JNKAYADBODLPMQ-HSHDSVGOSA-N 0.000 description 2
- NIWAGRRZHCMPOY-GMVOTWDCSA-N Trp-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N NIWAGRRZHCMPOY-GMVOTWDCSA-N 0.000 description 2
- VIWQOOBRKCGSDK-RYQLBKOJSA-N Trp-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O VIWQOOBRKCGSDK-RYQLBKOJSA-N 0.000 description 2
- GEGYPBOPIGNZIF-CWRNSKLLSA-N Trp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O GEGYPBOPIGNZIF-CWRNSKLLSA-N 0.000 description 2
- DVLHKUWLNKDINO-PMVMPFDFSA-N Trp-Tyr-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DVLHKUWLNKDINO-PMVMPFDFSA-N 0.000 description 2
- AKXBNSZMYAOGLS-STQMWFEESA-N Tyr-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AKXBNSZMYAOGLS-STQMWFEESA-N 0.000 description 2
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 2
- JWGXUKHIKXZWNG-RYUDHWBXSA-N Tyr-Gly-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JWGXUKHIKXZWNG-RYUDHWBXSA-N 0.000 description 2
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 2
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 2
- IWZYXFRGWKEKBJ-GVXVVHGQSA-N Val-Gln-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IWZYXFRGWKEKBJ-GVXVVHGQSA-N 0.000 description 2
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 2
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 2
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 2
- LMVWCLDJNSBOEA-FKBYEOEOSA-N Val-Tyr-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N LMVWCLDJNSBOEA-FKBYEOEOSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 208000027418 Wounds and injury Diseases 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 2
- 108010070783 alanyltyrosine Proteins 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- 108010060035 arginylproline Proteins 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 230000006378 damage Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 108010009297 diglycyl-histidine Proteins 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- 239000013024 dilution buffer Substances 0.000 description 2
- 238000001647 drug administration Methods 0.000 description 2
- 238000006911 enzymatic reaction Methods 0.000 description 2
- 210000002919 epithelial cell Anatomy 0.000 description 2
- 238000007429 general method Methods 0.000 description 2
- 210000004602 germ cell Anatomy 0.000 description 2
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 2
- 108010049041 glutamylalanine Proteins 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- 238000011194 good manufacturing practice Methods 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 230000000415 inactivating effect Effects 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 208000014674 injury Diseases 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 238000010253 intravenous injection Methods 0.000 description 2
- 108010091871 leucylmethionine Proteins 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 235000015097 nutrients Nutrition 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 150000003904 phospholipids Chemical class 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 239000000047 product Substances 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 238000007480 sanger sequencing Methods 0.000 description 2
- 210000004872 soft tissue Anatomy 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 230000002269 spontaneous effect Effects 0.000 description 2
- 238000010561 standard procedure Methods 0.000 description 2
- 238000001356 surgical procedure Methods 0.000 description 2
- 208000024891 symptom Diseases 0.000 description 2
- 238000002560 therapeutic procedure Methods 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- 230000005030 transcription termination Effects 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 210000003462 vein Anatomy 0.000 description 2
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 1
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 1
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 1
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 1
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 1
- BHTBAVZSZCQZPT-GUBZILKMSA-N Ala-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N BHTBAVZSZCQZPT-GUBZILKMSA-N 0.000 description 1
- LTTLSZVJTDSACD-OWLDWWDNSA-N Ala-Thr-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LTTLSZVJTDSACD-OWLDWWDNSA-N 0.000 description 1
- RIPMDCIXRYWXSH-KNXALSJPSA-N Ala-Trp-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N RIPMDCIXRYWXSH-KNXALSJPSA-N 0.000 description 1
- TVUFMYKTYXTRPY-HERUPUMHSA-N Ala-Trp-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O TVUFMYKTYXTRPY-HERUPUMHSA-N 0.000 description 1
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 1
- 241000219194 Arabidopsis Species 0.000 description 1
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 1
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 1
- YUIGJDNAGKJLDO-JYJNAYRXSA-N Arg-Arg-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YUIGJDNAGKJLDO-JYJNAYRXSA-N 0.000 description 1
- RWWPBOUMKFBHAL-FXQIFTODSA-N Arg-Asn-Cys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O RWWPBOUMKFBHAL-FXQIFTODSA-N 0.000 description 1
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 1
- TTXYKSADPSNOIF-IHRRRGAJSA-N Arg-Asp-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O TTXYKSADPSNOIF-IHRRRGAJSA-N 0.000 description 1
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 1
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 1
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 1
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 1
- QBQVKUNBCAFXSV-ULQDDVLXSA-N Arg-Lys-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QBQVKUNBCAFXSV-ULQDDVLXSA-N 0.000 description 1
- NYDIVDKTULRINZ-AVGNSLFASA-N Arg-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NYDIVDKTULRINZ-AVGNSLFASA-N 0.000 description 1
- RATVAFHGEFAWDH-JYJNAYRXSA-N Arg-Phe-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCN=C(N)N)N RATVAFHGEFAWDH-JYJNAYRXSA-N 0.000 description 1
- AMIQZQAAYGYKOP-FXQIFTODSA-N Arg-Ser-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O AMIQZQAAYGYKOP-FXQIFTODSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- BECXEHHOZNFFFX-IHRRRGAJSA-N Arg-Ser-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BECXEHHOZNFFFX-IHRRRGAJSA-N 0.000 description 1
- ZPWMEWYQBWSGAO-ZJDVBMNYSA-N Arg-Thr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZPWMEWYQBWSGAO-ZJDVBMNYSA-N 0.000 description 1
- VJIQPOJMISSUPO-BVSLBCMMSA-N Arg-Trp-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VJIQPOJMISSUPO-BVSLBCMMSA-N 0.000 description 1
- QJWLLRZTJFPCHA-STECZYCISA-N Arg-Tyr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QJWLLRZTJFPCHA-STECZYCISA-N 0.000 description 1
- 208000036487 Arthropathies Diseases 0.000 description 1
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 1
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 1
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 1
- HJRBIWRXULGMOA-ACZMJKKPSA-N Asn-Gln-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJRBIWRXULGMOA-ACZMJKKPSA-N 0.000 description 1
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 1
- OGMDXNFGPOPZTK-GUBZILKMSA-N Asn-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N OGMDXNFGPOPZTK-GUBZILKMSA-N 0.000 description 1
- ZKDGORKGHPCZOV-DCAQKATOSA-N Asn-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZKDGORKGHPCZOV-DCAQKATOSA-N 0.000 description 1
- RLHANKIRBONJBK-IHRRRGAJSA-N Asn-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N RLHANKIRBONJBK-IHRRRGAJSA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 1
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 1
- VWADICJNCPFKJS-ZLUOBGJFSA-N Asn-Ser-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O VWADICJNCPFKJS-ZLUOBGJFSA-N 0.000 description 1
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 1
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 1
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 1
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 1
- UWMIZBCTVWVMFI-FXQIFTODSA-N Asp-Ala-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UWMIZBCTVWVMFI-FXQIFTODSA-N 0.000 description 1
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 1
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 1
- FRSGNOZCTWDVFZ-ACZMJKKPSA-N Asp-Asp-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRSGNOZCTWDVFZ-ACZMJKKPSA-N 0.000 description 1
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 1
- ACEDJCOOPZFUBU-CIUDSAMLSA-N Asp-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N ACEDJCOOPZFUBU-CIUDSAMLSA-N 0.000 description 1
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 1
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 1
- WSXDIZFNQYTUJB-SRVKXCTJSA-N Asp-His-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O WSXDIZFNQYTUJB-SRVKXCTJSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 1
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- DKQCWCQRAMAFLN-UBHSHLNASA-N Asp-Trp-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O DKQCWCQRAMAFLN-UBHSHLNASA-N 0.000 description 1
- XQFLFQWOBXPMHW-NHCYSSNCSA-N Asp-Val-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XQFLFQWOBXPMHW-NHCYSSNCSA-N 0.000 description 1
- 108010089996 B-domain-deleted factor VIII Proteins 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 101100180402 Caenorhabditis elegans jun-1 gene Proteins 0.000 description 1
- BHPQYMZQTOCNFJ-UHFFFAOYSA-N Calcium cation Chemical compound [Ca+2] BHPQYMZQTOCNFJ-UHFFFAOYSA-N 0.000 description 1
- 241000701489 Cauliflower mosaic virus Species 0.000 description 1
- 206010008111 Cerebral haemorrhage Diseases 0.000 description 1
- 241000862448 Chlorocebus Species 0.000 description 1
- 208000017667 Chronic Disease Diseases 0.000 description 1
- 102100029117 Coagulation factor X Human genes 0.000 description 1
- 208000029767 Congenital, Hereditary, and Neonatal Diseases and Abnormalities Diseases 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 241000699802 Cricetulus griseus Species 0.000 description 1
- YUZPQIQWXLRFBW-ACZMJKKPSA-N Cys-Glu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O YUZPQIQWXLRFBW-ACZMJKKPSA-N 0.000 description 1
- ANPADMNVVOOYKW-DCAQKATOSA-N Cys-His-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ANPADMNVVOOYKW-DCAQKATOSA-N 0.000 description 1
- DIHCYBRLTVEPBW-SRVKXCTJSA-N Cys-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N DIHCYBRLTVEPBW-SRVKXCTJSA-N 0.000 description 1
- MBRWOKXNHTUJMB-CIUDSAMLSA-N Cys-Pro-Glu Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O MBRWOKXNHTUJMB-CIUDSAMLSA-N 0.000 description 1
- VCPHQVQGVSKDHY-FXQIFTODSA-N Cys-Ser-Met Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O VCPHQVQGVSKDHY-FXQIFTODSA-N 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 108010000437 Deamino Arginine Vasopressin Proteins 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 241000588722 Escherichia Species 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 102000009123 Fibrin Human genes 0.000 description 1
- 108010073385 Fibrin Proteins 0.000 description 1
- BWGVNKXGVNDBDI-UHFFFAOYSA-N Fibrin monomer Chemical compound CNC(=O)CNC(=O)CN BWGVNKXGVNDBDI-UHFFFAOYSA-N 0.000 description 1
- WOACHWLUOFZLGJ-GUBZILKMSA-N Gln-Arg-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O WOACHWLUOFZLGJ-GUBZILKMSA-N 0.000 description 1
- ZFADFBPRMSBPOT-KKUMJFAQSA-N Gln-Arg-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZFADFBPRMSBPOT-KKUMJFAQSA-N 0.000 description 1
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 1
- CKNUKHBRCSMKMO-XHNCKOQMSA-N Gln-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O CKNUKHBRCSMKMO-XHNCKOQMSA-N 0.000 description 1
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 1
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 1
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 1
- GLEGHWQNGPMKHO-DCAQKATOSA-N Gln-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GLEGHWQNGPMKHO-DCAQKATOSA-N 0.000 description 1
- FYAULIGIFPPOAA-ZPFDUUQYSA-N Gln-Ile-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O FYAULIGIFPPOAA-ZPFDUUQYSA-N 0.000 description 1
- ZNTDJIMJKNNSLR-RWRJDSDZSA-N Gln-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZNTDJIMJKNNSLR-RWRJDSDZSA-N 0.000 description 1
- WHVLABLIJYGVEK-QEWYBTABSA-N Gln-Phe-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WHVLABLIJYGVEK-QEWYBTABSA-N 0.000 description 1
- JUUNNOLZGVYCJT-JYJNAYRXSA-N Gln-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JUUNNOLZGVYCJT-JYJNAYRXSA-N 0.000 description 1
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 1
- SYZZMPFLOLSMHL-XHNCKOQMSA-N Gln-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SYZZMPFLOLSMHL-XHNCKOQMSA-N 0.000 description 1
- QENSHQJGWGRPQS-QEJZJMRPSA-N Gln-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CCC(N)=O)N)C(O)=O)=CNC2=C1 QENSHQJGWGRPQS-QEJZJMRPSA-N 0.000 description 1
- VOUSELYGTNGEPB-NUMRIWBASA-N Gln-Thr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O VOUSELYGTNGEPB-NUMRIWBASA-N 0.000 description 1
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 1
- XKPACHRGOWQHFH-IRIUXVKKSA-N Gln-Thr-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XKPACHRGOWQHFH-IRIUXVKKSA-N 0.000 description 1
- CTJRFALAOYAJBX-NWLDYVSISA-N Gln-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)N)N)O CTJRFALAOYAJBX-NWLDYVSISA-N 0.000 description 1
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 1
- ICRKQMRFXYDYMK-LAEOZQHASA-N Gln-Val-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ICRKQMRFXYDYMK-LAEOZQHASA-N 0.000 description 1
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 1
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 1
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 1
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 1
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 1
- MLCPTRRNICEKIS-FXQIFTODSA-N Glu-Asn-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLCPTRRNICEKIS-FXQIFTODSA-N 0.000 description 1
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 1
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 1
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 1
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 1
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 1
- CAVMESABQIKFKT-IUCAKERBSA-N Glu-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N CAVMESABQIKFKT-IUCAKERBSA-N 0.000 description 1
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 1
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 1
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 1
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 1
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 1
- ZWMYUDZLXAQHCK-CIUDSAMLSA-N Glu-Met-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O ZWMYUDZLXAQHCK-CIUDSAMLSA-N 0.000 description 1
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 1
- SOEPMWQCTJITPZ-SRVKXCTJSA-N Glu-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N SOEPMWQCTJITPZ-SRVKXCTJSA-N 0.000 description 1
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- TWYFJOHWGCCRIR-DCAQKATOSA-N Glu-Pro-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYFJOHWGCCRIR-DCAQKATOSA-N 0.000 description 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 1
- HGJREIGJLUQBTJ-SZMVWBNQSA-N Glu-Trp-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O HGJREIGJLUQBTJ-SZMVWBNQSA-N 0.000 description 1
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 1
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 1
- ZALGPUWUVHOGAE-GVXVVHGQSA-N Glu-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZALGPUWUVHOGAE-GVXVVHGQSA-N 0.000 description 1
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 1
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 1
- WJZLEENECIOOSA-WDSKDSINSA-N Gly-Asn-Gln Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)O WJZLEENECIOOSA-WDSKDSINSA-N 0.000 description 1
- JUGQPPOVWXSPKJ-RYUDHWBXSA-N Gly-Gln-Phe Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JUGQPPOVWXSPKJ-RYUDHWBXSA-N 0.000 description 1
- JLJLBWDKDRYOPA-RYUDHWBXSA-N Gly-Gln-Tyr Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JLJLBWDKDRYOPA-RYUDHWBXSA-N 0.000 description 1
- JUBDONGMHASUCN-IUCAKERBSA-N Gly-Glu-His Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O JUBDONGMHASUCN-IUCAKERBSA-N 0.000 description 1
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- HHSOPSCKAZKQHQ-PEXQALLHSA-N Gly-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN HHSOPSCKAZKQHQ-PEXQALLHSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 1
- SJLKKOZFHSJJAW-YUMQZZPRSA-N Gly-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN SJLKKOZFHSJJAW-YUMQZZPRSA-N 0.000 description 1
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 1
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 1
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 1
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 208000028782 Hereditary disease Diseases 0.000 description 1
- 229920000209 Hexadimethrine bromide Polymers 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 1
- HXKZJLWGSWQKEA-LSJOCFKGSA-N His-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CN=CN1 HXKZJLWGSWQKEA-LSJOCFKGSA-N 0.000 description 1
- HDXNWVLQSQFJOX-SRVKXCTJSA-N His-Arg-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HDXNWVLQSQFJOX-SRVKXCTJSA-N 0.000 description 1
- MWAJSVTZZOUOBU-IHRRRGAJSA-N His-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 MWAJSVTZZOUOBU-IHRRRGAJSA-N 0.000 description 1
- UZZXGLOJRZKYEL-DJFWLOJKSA-N His-Asn-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UZZXGLOJRZKYEL-DJFWLOJKSA-N 0.000 description 1
- HRGGKHFHRSFSDE-CIUDSAMLSA-N His-Asn-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N HRGGKHFHRSFSDE-CIUDSAMLSA-N 0.000 description 1
- LIEIYPBMQJLASB-SRVKXCTJSA-N His-Gln-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LIEIYPBMQJLASB-SRVKXCTJSA-N 0.000 description 1
- LCNNHVQNFNJLGK-AVGNSLFASA-N His-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N LCNNHVQNFNJLGK-AVGNSLFASA-N 0.000 description 1
- NNBWMLHQXBTIIT-HVTMNAMFSA-N His-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N NNBWMLHQXBTIIT-HVTMNAMFSA-N 0.000 description 1
- STWGDDDFLUFCCA-GVXVVHGQSA-N His-Glu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O STWGDDDFLUFCCA-GVXVVHGQSA-N 0.000 description 1
- PYNUBZSXKQKAHL-UWVGGRQHSA-N His-Gly-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O PYNUBZSXKQKAHL-UWVGGRQHSA-N 0.000 description 1
- UROVZOUMHNXPLZ-AVGNSLFASA-N His-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 UROVZOUMHNXPLZ-AVGNSLFASA-N 0.000 description 1
- ZSKJIISDJXJQPV-BZSNNMDCSA-N His-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 ZSKJIISDJXJQPV-BZSNNMDCSA-N 0.000 description 1
- YVCGJPIKRMGNPA-LSJOCFKGSA-N His-Met-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O YVCGJPIKRMGNPA-LSJOCFKGSA-N 0.000 description 1
- DEMIXZCKUXVEBO-BWAGICSOSA-N His-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O DEMIXZCKUXVEBO-BWAGICSOSA-N 0.000 description 1
- UWNUQPZUSRFIIN-JUKXBJQTSA-N His-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N UWNUQPZUSRFIIN-JUKXBJQTSA-N 0.000 description 1
- MRVZCDSYLJXKKX-ACRUOGEOSA-N His-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CN=CN3)N MRVZCDSYLJXKKX-ACRUOGEOSA-N 0.000 description 1
- QLBXWYXMLHAREM-PYJNHQTQSA-N His-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N QLBXWYXMLHAREM-PYJNHQTQSA-N 0.000 description 1
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 1
- GGXUJBKENKVYNV-ULQDDVLXSA-N His-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N GGXUJBKENKVYNV-ULQDDVLXSA-N 0.000 description 1
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 1
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 1
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 1
- VCYVLFAWCJRXFT-HJPIBITLSA-N Ile-Cys-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N VCYVLFAWCJRXFT-HJPIBITLSA-N 0.000 description 1
- WNQKUUQIVDDAFA-ZPFDUUQYSA-N Ile-Gln-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N WNQKUUQIVDDAFA-ZPFDUUQYSA-N 0.000 description 1
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 1
- OEQKGSPBDVKYOC-ZKWXMUAHSA-N Ile-Gly-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OEQKGSPBDVKYOC-ZKWXMUAHSA-N 0.000 description 1
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 1
- CCYGNFBYUNHFSC-MGHWNKPDSA-N Ile-His-Phe Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CCYGNFBYUNHFSC-MGHWNKPDSA-N 0.000 description 1
- LNJLOZYNZFGJMM-DEQVHRJGSA-N Ile-His-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N LNJLOZYNZFGJMM-DEQVHRJGSA-N 0.000 description 1
- KEKTTYCXKGBAAL-VGDYDELISA-N Ile-His-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N KEKTTYCXKGBAAL-VGDYDELISA-N 0.000 description 1
- HUWYGQOISIJNMK-SIGLWIIPSA-N Ile-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HUWYGQOISIJNMK-SIGLWIIPSA-N 0.000 description 1
- MTONDYJJCIBZTK-PEDHHIEDSA-N Ile-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(=O)O)N MTONDYJJCIBZTK-PEDHHIEDSA-N 0.000 description 1
- XDUVMJCBYUKNFJ-MXAVVETBSA-N Ile-Lys-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N XDUVMJCBYUKNFJ-MXAVVETBSA-N 0.000 description 1
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 1
- NNVXABCGXOLIEB-PYJNHQTQSA-N Ile-Met-His Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NNVXABCGXOLIEB-PYJNHQTQSA-N 0.000 description 1
- NPAYJTAXWXJKLO-NAKRPEOUSA-N Ile-Met-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N NPAYJTAXWXJKLO-NAKRPEOUSA-N 0.000 description 1
- UOPBQSJRBONRON-STECZYCISA-N Ile-Met-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOPBQSJRBONRON-STECZYCISA-N 0.000 description 1
- LRAUKBMYHHNADU-DKIMLUQUSA-N Ile-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 LRAUKBMYHHNADU-DKIMLUQUSA-N 0.000 description 1
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 1
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 1
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 1
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 1
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 description 1
- NSPNUMNLZNOPAQ-SJWGOKEGSA-N Ile-Tyr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N NSPNUMNLZNOPAQ-SJWGOKEGSA-N 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 206010023201 Joint contracture Diseases 0.000 description 1
- 208000012659 Joint disease Diseases 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- 244000207740 Lemna minor Species 0.000 description 1
- 235000006439 Lemna minor Nutrition 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 1
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 1
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 1
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 1
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 1
- KVOFSTUWVSQMDK-KKUMJFAQSA-N Leu-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KVOFSTUWVSQMDK-KKUMJFAQSA-N 0.000 description 1
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 1
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 1
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 1
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- GOFJOGXGMPHOGL-DCAQKATOSA-N Leu-Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C GOFJOGXGMPHOGL-DCAQKATOSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 1
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- QQXJROOJCMIHIV-AVGNSLFASA-N Leu-Val-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O QQXJROOJCMIHIV-AVGNSLFASA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 1
- YRWCPXOFBKTCFY-NUTKFTJISA-N Lys-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N YRWCPXOFBKTCFY-NUTKFTJISA-N 0.000 description 1
- HGZHSNBZDOLMLH-DCAQKATOSA-N Lys-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N HGZHSNBZDOLMLH-DCAQKATOSA-N 0.000 description 1
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 1
- MLLKLNYPZRDIQG-GUBZILKMSA-N Lys-Cys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N MLLKLNYPZRDIQG-GUBZILKMSA-N 0.000 description 1
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 1
- PAMDBWYMLWOELY-SDDRHHMPSA-N Lys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O PAMDBWYMLWOELY-SDDRHHMPSA-N 0.000 description 1
- GHOIOYHDDKXIDX-SZMVWBNQSA-N Lys-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 GHOIOYHDDKXIDX-SZMVWBNQSA-N 0.000 description 1
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 1
- OWRUUFUVXFREBD-KKUMJFAQSA-N Lys-His-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O OWRUUFUVXFREBD-KKUMJFAQSA-N 0.000 description 1
- FGMHXLULNHTPID-KKUMJFAQSA-N Lys-His-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CN=CN1 FGMHXLULNHTPID-KKUMJFAQSA-N 0.000 description 1
- GNLJXWBNLAIPEP-MELADBBJSA-N Lys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCCN)N)C(=O)O GNLJXWBNLAIPEP-MELADBBJSA-N 0.000 description 1
- JQSIGLHQNSZZRL-KKUMJFAQSA-N Lys-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N JQSIGLHQNSZZRL-KKUMJFAQSA-N 0.000 description 1
- BEGQVWUZFXLNHZ-IHPCNDPISA-N Lys-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 BEGQVWUZFXLNHZ-IHPCNDPISA-N 0.000 description 1
- URGPVYGVWLIRGT-DCAQKATOSA-N Lys-Met-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O URGPVYGVWLIRGT-DCAQKATOSA-N 0.000 description 1
- SPNKGZFASINBMR-IHRRRGAJSA-N Lys-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N SPNKGZFASINBMR-IHRRRGAJSA-N 0.000 description 1
- KFSALEZVQJYHCE-AVGNSLFASA-N Lys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N KFSALEZVQJYHCE-AVGNSLFASA-N 0.000 description 1
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 1
- DNWBUCHHMRQWCZ-GUBZILKMSA-N Lys-Ser-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DNWBUCHHMRQWCZ-GUBZILKMSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- BIWVMACFGZFIEB-VFAJRCTISA-N Lys-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCCN)N)O BIWVMACFGZFIEB-VFAJRCTISA-N 0.000 description 1
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 1
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- 206010026865 Mass Diseases 0.000 description 1
- 208000024556 Mendelian disease Diseases 0.000 description 1
- MCNGIXXCMJAURZ-VEVYYDQMSA-N Met-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCSC)N)O MCNGIXXCMJAURZ-VEVYYDQMSA-N 0.000 description 1
- HLYIDXAXQIJYIG-CIUDSAMLSA-N Met-Gln-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HLYIDXAXQIJYIG-CIUDSAMLSA-N 0.000 description 1
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 1
- MTBVQFFQMXHCPC-CIUDSAMLSA-N Met-Glu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MTBVQFFQMXHCPC-CIUDSAMLSA-N 0.000 description 1
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 1
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 1
- OOSPRDCGTLQLBP-NHCYSSNCSA-N Met-Glu-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OOSPRDCGTLQLBP-NHCYSSNCSA-N 0.000 description 1
- STLBOMUOQNIALW-BQBZGAKWSA-N Met-Gly-Cys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O STLBOMUOQNIALW-BQBZGAKWSA-N 0.000 description 1
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 1
- BKIFWLQFOOKUCA-DCAQKATOSA-N Met-His-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N BKIFWLQFOOKUCA-DCAQKATOSA-N 0.000 description 1
- AWGBEIYZPAXXSX-RWMBFGLXSA-N Met-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N AWGBEIYZPAXXSX-RWMBFGLXSA-N 0.000 description 1
- WTHGNAAQXISJHP-AVGNSLFASA-N Met-Lys-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WTHGNAAQXISJHP-AVGNSLFASA-N 0.000 description 1
- MIAZEQZXAFTCCG-UBHSHLNASA-N Met-Phe-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 MIAZEQZXAFTCCG-UBHSHLNASA-N 0.000 description 1
- FDGAMQVRGORBDV-GUBZILKMSA-N Met-Ser-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCSC FDGAMQVRGORBDV-GUBZILKMSA-N 0.000 description 1
- PVSPJQWHEIQTEH-JYJNAYRXSA-N Met-Val-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PVSPJQWHEIQTEH-JYJNAYRXSA-N 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- RUPXJRIDSUCQAN-PQNNUJSWSA-N N-[1,3-bis[3-[3-[5-[(2R,3R,4R,5R,6R)-3-acetamido-4,5-dihydroxy-6-(hydroxymethyl)oxan-2-yl]oxypentanoylamino]propylamino]-3-oxopropoxy]-2-[[3-[3-[5-[(2R,3R,4R,5R,6R)-3-acetamido-4,5-dihydroxy-6-(hydroxymethyl)oxan-2-yl]oxypentanoylamino]propylamino]-3-oxopropoxy]methyl]propan-2-yl]-12-[(2R,4R)-4-hydroxy-2-methylpyrrolidin-1-yl]-12-oxododecanamide Chemical compound C[C@@H]1C[C@@H](O)CN1C(=O)CCCCCCCCCCC(=O)NC(COCCC(=O)NCCCNC(=O)CCCCO[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1NC(C)=O)(COCCC(=O)NCCCNC(=O)CCCCO[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1NC(C)=O)COCCC(=O)NCCCNC(=O)CCCCO[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1NC(C)=O RUPXJRIDSUCQAN-PQNNUJSWSA-N 0.000 description 1
- 241000208125 Nicotiana Species 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- KIEPQOIQHFKQLK-PCBIJLKTSA-N Phe-Asn-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KIEPQOIQHFKQLK-PCBIJLKTSA-N 0.000 description 1
- HTKNPQZCMLBOTQ-XVSYOHENSA-N Phe-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O HTKNPQZCMLBOTQ-XVSYOHENSA-N 0.000 description 1
- ZENDEDYRYVHBEG-SRVKXCTJSA-N Phe-Asp-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZENDEDYRYVHBEG-SRVKXCTJSA-N 0.000 description 1
- UEXCHCYDPAIVDE-SRVKXCTJSA-N Phe-Asp-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEXCHCYDPAIVDE-SRVKXCTJSA-N 0.000 description 1
- QEPZQAPZKIPVDV-KKUMJFAQSA-N Phe-Cys-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N QEPZQAPZKIPVDV-KKUMJFAQSA-N 0.000 description 1
- VLZGUAUYZGQKPM-DRZSPHRISA-N Phe-Gln-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VLZGUAUYZGQKPM-DRZSPHRISA-N 0.000 description 1
- UMKYAYXCMYYNHI-AVGNSLFASA-N Phe-Gln-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N UMKYAYXCMYYNHI-AVGNSLFASA-N 0.000 description 1
- UNLYPPYNDXHGDG-IHRRRGAJSA-N Phe-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UNLYPPYNDXHGDG-IHRRRGAJSA-N 0.000 description 1
- RJYBHZVWJPUSLB-QEWYBTABSA-N Phe-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N RJYBHZVWJPUSLB-QEWYBTABSA-N 0.000 description 1
- PMKIMKUGCSVFSV-CQDKDKBSSA-N Phe-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N PMKIMKUGCSVFSV-CQDKDKBSSA-N 0.000 description 1
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 1
- MIICYIIBVYQNKE-QEWYBTABSA-N Phe-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N MIICYIIBVYQNKE-QEWYBTABSA-N 0.000 description 1
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 1
- KZRQONDKKJCAOL-DKIMLUQUSA-N Phe-Leu-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZRQONDKKJCAOL-DKIMLUQUSA-N 0.000 description 1
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 1
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 1
- FZBGMXYQPACKNC-HJWJTTGWSA-N Phe-Pro-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FZBGMXYQPACKNC-HJWJTTGWSA-N 0.000 description 1
- CKJACGQPCPMWIT-UFYCRDLUSA-N Phe-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CKJACGQPCPMWIT-UFYCRDLUSA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- HBXAOEBRGLCLIW-AVGNSLFASA-N Phe-Ser-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HBXAOEBRGLCLIW-AVGNSLFASA-N 0.000 description 1
- MRWOVVNKSXXLRP-IHPCNDPISA-N Phe-Ser-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MRWOVVNKSXXLRP-IHPCNDPISA-N 0.000 description 1
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 1
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 1
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 1
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 1
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 1
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 1
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 1
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 1
- 241001505332 Polyomavirus sp. Species 0.000 description 1
- 235000001855 Portulaca oleracea Nutrition 0.000 description 1
- 206010051077 Post procedural haemorrhage Diseases 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 1
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 1
- ORPZXBQTEHINPB-SRVKXCTJSA-N Pro-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1)C(O)=O ORPZXBQTEHINPB-SRVKXCTJSA-N 0.000 description 1
- OBVCYFIHIIYIQF-CIUDSAMLSA-N Pro-Asn-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OBVCYFIHIIYIQF-CIUDSAMLSA-N 0.000 description 1
- LUGOKRWYNMDGTD-FXQIFTODSA-N Pro-Cys-Asn Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O LUGOKRWYNMDGTD-FXQIFTODSA-N 0.000 description 1
- LSIWVWRUTKPXDS-DCAQKATOSA-N Pro-Gln-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LSIWVWRUTKPXDS-DCAQKATOSA-N 0.000 description 1
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 1
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 1
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 description 1
- LPGSNRSLPHRNBW-AVGNSLFASA-N Pro-His-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 LPGSNRSLPHRNBW-AVGNSLFASA-N 0.000 description 1
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 1
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 1
- DWPXHLIBFQLKLK-CYDGBPFRSA-N Pro-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 DWPXHLIBFQLKLK-CYDGBPFRSA-N 0.000 description 1
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 1
- GBUNEGKQPSAMNK-QTKMDUPCSA-N Pro-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2)O GBUNEGKQPSAMNK-QTKMDUPCSA-N 0.000 description 1
- JDJMFMVVJHLWDP-UNQGMJICSA-N Pro-Thr-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JDJMFMVVJHLWDP-UNQGMJICSA-N 0.000 description 1
- OFSZYRZOUMNCCU-BZSNNMDCSA-N Pro-Trp-Met Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCSC)C(O)=O)C(=O)[C@@H]1CCCN1 OFSZYRZOUMNCCU-BZSNNMDCSA-N 0.000 description 1
- QHSSUIHLAIWXEE-IHRRRGAJSA-N Pro-Tyr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O QHSSUIHLAIWXEE-IHRRRGAJSA-N 0.000 description 1
- BXHRXLMCYSZSIY-STECZYCISA-N Pro-Tyr-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O BXHRXLMCYSZSIY-STECZYCISA-N 0.000 description 1
- VEUACYMXJKXALX-IHRRRGAJSA-N Pro-Tyr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VEUACYMXJKXALX-IHRRRGAJSA-N 0.000 description 1
- OOZJHTXCLJUODH-QXEWZRGKSA-N Pro-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 OOZJHTXCLJUODH-QXEWZRGKSA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 241000125945 Protoparvovirus Species 0.000 description 1
- 238000012228 RNA interference-mediated gene silencing Methods 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 206010038980 Retroperitoneal haemorrhage Diseases 0.000 description 1
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 1
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 1
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 1
- VAIZFHMTBFYJIA-ACZMJKKPSA-N Ser-Asp-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O VAIZFHMTBFYJIA-ACZMJKKPSA-N 0.000 description 1
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 1
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- RFBKULCUBJAQFT-BIIVOSGPSA-N Ser-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CO)N)C(=O)O RFBKULCUBJAQFT-BIIVOSGPSA-N 0.000 description 1
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 1
- GWMXFEMMBHOKDX-AVGNSLFASA-N Ser-Gln-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GWMXFEMMBHOKDX-AVGNSLFASA-N 0.000 description 1
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 1
- MLSQXWSRHURDMF-GARJFASQSA-N Ser-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N)C(=O)O MLSQXWSRHURDMF-GARJFASQSA-N 0.000 description 1
- JEHPKECJCALLRW-CUJWVEQBSA-N Ser-His-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEHPKECJCALLRW-CUJWVEQBSA-N 0.000 description 1
- ZUDXUJSYCCNZQJ-DCAQKATOSA-N Ser-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N ZUDXUJSYCCNZQJ-DCAQKATOSA-N 0.000 description 1
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- NIOYDASGXWLHEZ-CIUDSAMLSA-N Ser-Met-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O NIOYDASGXWLHEZ-CIUDSAMLSA-N 0.000 description 1
- AXOHAHIUJHCLQR-IHRRRGAJSA-N Ser-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CO)N AXOHAHIUJHCLQR-IHRRRGAJSA-N 0.000 description 1
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 1
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 1
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 1
- VFWQQZMRKFOGLE-ZLUOBGJFSA-N Ser-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O VFWQQZMRKFOGLE-ZLUOBGJFSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 1
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 1
- RTXKJFWHEBTABY-IHPCNDPISA-N Ser-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CO)N RTXKJFWHEBTABY-IHPCNDPISA-N 0.000 description 1
- BIWBTRRBHIEVAH-IHPCNDPISA-N Ser-Tyr-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O BIWBTRRBHIEVAH-IHPCNDPISA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 1
- 208000026552 Severe hemophilia A Diseases 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 244000061456 Solanum tuberosum Species 0.000 description 1
- 235000002595 Solanum tuberosum Nutrition 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 1
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 1
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 1
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 1
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 1
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 1
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 1
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 1
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 1
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 1
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 1
- ZBKDBZUTTXINIX-RWRJDSDZSA-N Thr-Ile-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZBKDBZUTTXINIX-RWRJDSDZSA-N 0.000 description 1
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 1
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 1
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 1
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 1
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 1
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 1
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- XVHAUVJXBFGUPC-RPTUDFQQSA-N Thr-Tyr-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XVHAUVJXBFGUPC-RPTUDFQQSA-N 0.000 description 1
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 1
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 1
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- 108090000190 Thrombin Proteins 0.000 description 1
- 241000723873 Tobacco mosaic virus Species 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- 244000098338 Triticum aestivum Species 0.000 description 1
- VZBWRZGNEPBRDE-HZUKXOBISA-N Trp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N VZBWRZGNEPBRDE-HZUKXOBISA-N 0.000 description 1
- UPOGHWJJZAZNSW-XIRDDKMYSA-N Trp-His-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O UPOGHWJJZAZNSW-XIRDDKMYSA-N 0.000 description 1
- KULBQAVOXHQLIY-HSCHXYMDSA-N Trp-Ile-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 KULBQAVOXHQLIY-HSCHXYMDSA-N 0.000 description 1
- ILDJYIDXESUBOE-HSCHXYMDSA-N Trp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ILDJYIDXESUBOE-HSCHXYMDSA-N 0.000 description 1
- NLWCSMOXNKBRLC-WDSOQIARSA-N Trp-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLWCSMOXNKBRLC-WDSOQIARSA-N 0.000 description 1
- GSCPHMSPGQSZJT-JYBASQMISA-N Trp-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O GSCPHMSPGQSZJT-JYBASQMISA-N 0.000 description 1
- ZPZNQAZHMCLTOA-PXDAIIFMSA-N Trp-Tyr-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 ZPZNQAZHMCLTOA-PXDAIIFMSA-N 0.000 description 1
- UIRVSEPRMWDVEW-RNXOBYDBSA-N Trp-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N UIRVSEPRMWDVEW-RNXOBYDBSA-N 0.000 description 1
- GLNADSQYFUSGOU-GPTZEZBUSA-J Trypan blue Chemical compound [Na+].[Na+].[Na+].[Na+].C1=C(S([O-])(=O)=O)C=C2C=C(S([O-])(=O)=O)C(/N=N/C3=CC=C(C=C3C)C=3C=C(C(=CC=3)\N=N\C=3C(=CC4=CC(=CC(N)=C4C=3O)S([O-])(=O)=O)S([O-])(=O)=O)C)=C(O)C2=C1N GLNADSQYFUSGOU-GPTZEZBUSA-J 0.000 description 1
- OOEUVMFKKZYSRX-LEWSCRJBSA-N Tyr-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OOEUVMFKKZYSRX-LEWSCRJBSA-N 0.000 description 1
- MTEQZJFSEMXXRK-CFMVVWHZSA-N Tyr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N MTEQZJFSEMXXRK-CFMVVWHZSA-N 0.000 description 1
- GIOBXJSONRQHKQ-RYUDHWBXSA-N Tyr-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GIOBXJSONRQHKQ-RYUDHWBXSA-N 0.000 description 1
- HFJJDMOFTCQGEI-STECZYCISA-N Tyr-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HFJJDMOFTCQGEI-STECZYCISA-N 0.000 description 1
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 1
- QSFJHIRIHOJRKS-ULQDDVLXSA-N Tyr-Leu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QSFJHIRIHOJRKS-ULQDDVLXSA-N 0.000 description 1
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 1
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 1
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- GZOCMHSZGGJBCX-ULQDDVLXSA-N Tyr-Lys-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O GZOCMHSZGGJBCX-ULQDDVLXSA-N 0.000 description 1
- PGEFRHBWGOJPJT-KKUMJFAQSA-N Tyr-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O PGEFRHBWGOJPJT-KKUMJFAQSA-N 0.000 description 1
- QPBJXNYYQTUTDD-KKUMJFAQSA-N Tyr-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QPBJXNYYQTUTDD-KKUMJFAQSA-N 0.000 description 1
- PLXQRTXVLZUNMU-RNXOBYDBSA-N Tyr-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CC4=CC=C(C=C4)O)N PLXQRTXVLZUNMU-RNXOBYDBSA-N 0.000 description 1
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 1
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 1
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 1
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 1
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 1
- LDKDSFQSEUOCOO-RPTUDFQQSA-N Tyr-Thr-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LDKDSFQSEUOCOO-RPTUDFQQSA-N 0.000 description 1
- LVILBTSHPTWDGE-PMVMPFDFSA-N Tyr-Trp-Lys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(O)=O)C1=CC=C(O)C=C1 LVILBTSHPTWDGE-PMVMPFDFSA-N 0.000 description 1
- HZDQUVQEVVYDDA-ACRUOGEOSA-N Tyr-Tyr-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HZDQUVQEVVYDDA-ACRUOGEOSA-N 0.000 description 1
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 1
- GOPQNCQSXBJAII-ULQDDVLXSA-N Tyr-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GOPQNCQSXBJAII-ULQDDVLXSA-N 0.000 description 1
- CCEVJBJLPRNAFH-BVSLBCMMSA-N Tyr-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N CCEVJBJLPRNAFH-BVSLBCMMSA-N 0.000 description 1
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 1
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- YLHLNFUXDBOAGX-DCAQKATOSA-N Val-Cys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YLHLNFUXDBOAGX-DCAQKATOSA-N 0.000 description 1
- LHADRQBREKTRLR-DCAQKATOSA-N Val-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N LHADRQBREKTRLR-DCAQKATOSA-N 0.000 description 1
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 1
- AHHJARQXFFGOKF-NRPADANISA-N Val-Glu-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N AHHJARQXFFGOKF-NRPADANISA-N 0.000 description 1
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 1
- ZEBRMWPTJNHXAJ-JYJNAYRXSA-N Val-Phe-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O)N ZEBRMWPTJNHXAJ-JYJNAYRXSA-N 0.000 description 1
- BCBFMJYTNKDALA-UFYCRDLUSA-N Val-Phe-Phe Chemical compound N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O BCBFMJYTNKDALA-UFYCRDLUSA-N 0.000 description 1
- VCIYTVOBLZHFSC-XHSDSOJGSA-N Val-Phe-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N VCIYTVOBLZHFSC-XHSDSOJGSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- NGXQOQNXSGOYOI-BQFCYCMXSA-N Val-Trp-Gln Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 NGXQOQNXSGOYOI-BQFCYCMXSA-N 0.000 description 1
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 1
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 1
- 240000008042 Zea mays Species 0.000 description 1
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 1
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 230000005856 abnormality Effects 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 230000001464 adherent effect Effects 0.000 description 1
- 239000000853 adhesive Substances 0.000 description 1
- 230000001070 adhesive effect Effects 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 150000001413 amino acids Chemical group 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 230000001857 anti-mycotic effect Effects 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 239000002543 antimycotic Substances 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 239000003686 blood clotting factor concentrate Substances 0.000 description 1
- 230000023555 blood coagulation Effects 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- YYRMJZQKEFZXMX-UHFFFAOYSA-L calcium bis(dihydrogenphosphate) Chemical compound [Ca+2].OP(O)([O-])=O.OP(O)([O-])=O YYRMJZQKEFZXMX-UHFFFAOYSA-L 0.000 description 1
- 229910001424 calcium ion Inorganic materials 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 108090001015 cancer procoagulant Proteins 0.000 description 1
- 210000000234 capsid Anatomy 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 239000013553 cell monolayer Substances 0.000 description 1
- 230000003833 cell viability Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 239000003593 chromogenic compound Substances 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 238000013377 clone selection method Methods 0.000 description 1
- 230000015271 coagulation Effects 0.000 description 1
- 238000005345 coagulation Methods 0.000 description 1
- 229940105756 coagulation factor x Drugs 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 239000000356 contaminant Substances 0.000 description 1
- 235000005822 corn Nutrition 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 229960004281 desmopressin Drugs 0.000 description 1
- NFLWUMRGJYTJIN-NXBWRCJVSA-N desmopressin Chemical compound C([C@H]1C(=O)N[C@H](C(N[C@@H](CC(N)=O)C(=O)N[C@@H](CSSCCC(=O)N[C@@H](CC=2C=CC(O)=CC=2)C(=O)N1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(N)=O)=O)CCC(=O)N)C1=CC=CC=C1 NFLWUMRGJYTJIN-NXBWRCJVSA-N 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 210000002257 embryonic structure Anatomy 0.000 description 1
- 229950006925 emicizumab Drugs 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 229940088598 enzyme Drugs 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 210000001723 extracellular space Anatomy 0.000 description 1
- 229950003499 fibrin Drugs 0.000 description 1
- 210000002950 fibroblast Anatomy 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 229950002735 fitusiran Drugs 0.000 description 1
- 230000037433 frameshift Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 230000009368 gene silencing by RNA Effects 0.000 description 1
- 230000023597 hemostasis Effects 0.000 description 1
- 239000002874 hemostatic agent Substances 0.000 description 1
- 239000000833 heterodimer Substances 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 230000005847 immunogenicity Effects 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 238000001802 infusion Methods 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 210000002490 intestinal epithelial cell Anatomy 0.000 description 1
- 208000020658 intracerebral hemorrhage Diseases 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 210000003292 kidney cell Anatomy 0.000 description 1
- 238000011813 knockout mouse model Methods 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 108010012058 leucyltyrosine Proteins 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 210000005229 liver cell Anatomy 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 235000019691 monocalcium phosphate Nutrition 0.000 description 1
- 210000000663 muscle cell Anatomy 0.000 description 1
- 230000036963 noncompetitive effect Effects 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 238000001543 one-way ANOVA Methods 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 230000000671 osmolytic effect Effects 0.000 description 1
- 239000002357 osmotic agent Substances 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 1
- 208000024335 physical disease Diseases 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 230000002265 prevention Effects 0.000 description 1
- 230000003449 preventive effect Effects 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- 238000003753 real-time PCR Methods 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 238000004153 renaturation Methods 0.000 description 1
- 101150066583 rep gene Proteins 0.000 description 1
- 230000000241 respiratory effect Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 230000001177 retroviral effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 239000000523 sample Substances 0.000 description 1
- 101150057323 sar gene Proteins 0.000 description 1
- 238000009097 single-agent therapy Methods 0.000 description 1
- 210000004989 spleen cell Anatomy 0.000 description 1
- 230000037436 splice-site mutation Effects 0.000 description 1
- 239000003381 stabilizer Substances 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 238000011272 standard treatment Methods 0.000 description 1
- 239000012089 stop solution Substances 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 238000001308 synthesis method Methods 0.000 description 1
- 230000009897 systematic effect Effects 0.000 description 1
- 229940124597 therapeutic agent Drugs 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 229960004072 thrombin Drugs 0.000 description 1
- 230000001256 tonic effect Effects 0.000 description 1
- 230000000699 topical effect Effects 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 238000005199 ultracentrifugation Methods 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 229940060306 valoctocogene roxaparvovec Drugs 0.000 description 1
- 239000011534 wash buffer Substances 0.000 description 1
Abstract
Description
Область техники, к которой относится изобретениеField of technology to which the invention relates
Настоящая заявка относится к области генетики, генной терапии и молекулярной биологии. Более конкретно, настоящее изобретение относится к нуклеиновой кислоте, которая кодирует слитый белок на основе FVIII-BDD (фактор свертывания крови VIII с делетированным В доменом) и гетерологичного сигнального пептида, экспрессионной кассете и вектору на ее основе, клетке-хозяину для получения слитого белка на основе FVIII-BDD и гетерологичного сигнального пептида, а также к различным применениям вышеуказанного вектора.This application relates to the fields of genetics, gene therapy and molecular biology. More specifically, the present invention relates to a nucleic acid that encodes a fusion protein based on FVIII-BDD (blood clotting factor VIII with B domain deleted) and a heterologous signal peptide, an expression cassette and a vector based thereon, a host cell for producing the fusion protein at based on FVIII-BDD and heterologous signal peptide, as well as various applications of the above vector.
Уровень техникиState of the art
Гемофилия - это рецессивное, Х-сцепленное, врожденное нарушение, ведущее к дефициту одного из белков, участвующих во вторичном гемостазе. Гемофилия А, или классическая гемофилия, - наиболее распространенная форма гемофилии; встречается у 1 из 5000 новорожденных мальчиков (Report on the annual global survey 2019, WFH https://www.wfh.org/en/our-work-research-data/annual-global-survey) и обусловлена дефицитом белка фактора свертывания VIII. Согласно данным Всероссийского общества гемофилии, в России насчитывается более 6,5 тыс.пациентов с гемофилией A (Report on the annual global survey 2019, WFH).Hemophilia is a recessive, X-linked, congenital disorder leading to a deficiency of one of the proteins involved in secondary hemostasis. Hemophilia A, or classical hemophilia, is the most common form of hemophilia; occurs in 1 in 5,000 newborn boys (Report on the annual global survey 2019, WFH https://www.wfh.org/en/our-work-research-data/annual-global-survey) and is caused by a deficiency of the coagulation factor VIII protein . According to the All-Russian Hemophilia Society, there are more than 6.5 thousand patients with hemophilia A in Russia (Report on the annual global survey 2019, WFH).
Фактор свертывания VIII (FVIII) - это белок размером 280 кДа, который секретируется в кровь преимущественно из синусоидальных эпителиальных клеток печени (Fahs SA, Hille МТ, Shi Q, Weiler H, Montgomery RR. A conditional knockout mouse model reveals endothelial cells as the principal and possibly exclusive source of plasma factor VIII/ Blood. 2014 Jun 12;123(24):3706-13. doi: 10.1182/blood-2014-02-555151. Epub 2014 Apr 4. PMID: 24705491 и Everett LA, Cleuren AC, Khoriaty RN, Ginsburg D. Murine coagulation factor VIII is synthesized in endothelial cells/ Blood. 2014 Jun 12;123(24):3697-705. doi: 10.1182/blood-2014-02-554501. Epub 2014 Apr 9. PMID: 24719406). Активированный FVIII циркулирует в организме в виде гетеродимера, состоящего из тяжелой (домены A1, А2, В) и легкой (домены A3, C1, С3) цепей, связанных друг с другом за счет нековалентных металл-зависимых взаимодействий. В результате процессинга FVIII в активированной форме белка присутствуют только домены А1-А3, C1, С2. Этот факт способствовал созданию рекомбинантного FVIII с удаленным В доменом (FVIII-BDD), который не уступает по своей активности полноразмерному FVIII (Pittman DD, Alderman EM, Tomkinson KN, Wang JH, Giles AR, Kaufman RJ. Biochemical, immunological, and in vivo functional characterization of B-domain-deleted factor VIII/ Blood. 1993 Jun 1; 81(11):2925-35. PMID: 8499631). В отличии от других белков каскада свертывания крови, которые преимущественно относятся к протеазам, FVIII - гликопротеин. Однако он играет критичную роль в формировании теназного комплекса, который необходим для образования активированного фактора свертывания X (FXa), первого члена окончательного общего пути коагуляции, что в итоге приводит к образованию сшитого фибрина.Coagulation factor VIII (FVIII) is a 280 kDa protein that is secreted into the blood primarily from sinusoidal epithelial cells of the liver (Fahs SA, Hille MT, Shi Q, Weiler H, Montgomery RR. A conditional knockout mouse model reveals endothelial cells as the principal and possibly exclusive source of plasma factor VIII/ Blood 2014 Jun 12;123(24):3706-13. doi: 10.1182/blood-2014-02-555151. Epub 2014 Apr 4. PMID: 24705491 and Everett LA, Cleuren AC , Khoriaty RN, Ginsburg D. Murine coagulation factor VIII is synthesized in endothelial cells/ Blood 2014 Jun 12;123(24):3697-705. doi: 10.1182/blood-2014-02-554501 Epub 2014 Apr 9. : 24719406). Activated FVIII circulates in the body as a heterodimer consisting of heavy (domains A1, A2, B) and light (domains A3, C1, C3) chains linked to each other through non-covalent metal-dependent interactions. As a result of FVIII processing, only domains A1-A3, C1, C2 are present in the activated form of the protein. This fact contributed to the creation of recombinant FVIII with the B domain deleted (FVIII-BDD), which is not inferior in its activity to full-length FVIII (Pittman DD, Alderman EM, Tomkinson KN, Wang JH, Giles AR, Kaufman RJ. Biochemical, immunological, and in vivo functional characterization of B-domain-deleted factor VIII/ Blood. 1993 Jun 1; 81(11):2925-35. Unlike other proteins in the blood coagulation cascade, which are predominantly proteases, FVIII is a glycoprotein. However, it plays a critical role in the formation of the tenase complex, which is required for the formation of activated coagulation factor X (FXa), the first member of the final common coagulation pathway, which ultimately leads to the formation of cross-linked fibrin.
Более чем в половине случаев к тяжелой форме гемофилии А приводят инверсии в 1 или 22 интроне гена FVIII (Habart D, Kalabova D, Novotny M, Vorlova Z. Thirty-four novel mutations detected in factor VIII gene by multiplex CSGE: modeling of 13 novel amino acid substitutions/J ThrombHaemost. 2003 Apr; 1(4):773-81. doi: 10.1046/j.l538-7836.2003.00149.x. PMID: 1287141). В других случаях нарушения в последовательности FVIII связаны с различными мутациями, в том числе антисмысловыми мутациями, сдвигами рамки считывания, мутациями в сплайсинг-сайтах, делециями и инсерциями.In more than half of the cases, severe hemophilia A is caused by inversions in intron 1 or 22 of the FVIII gene (Habart D, Kalabova D, Novotny M, Vorlova Z. Thirty-four novel mutations detected in factor VIII gene by multiplex CSGE: modeling of 13 novel amino acid substitutions/J ThrombHaemost. 2003 Apr; 1(4):773-81. doi: 10.1046/j.l538-7836.2003.00149.x. In other cases, abnormalities in the FVIII sequence are associated with various mutations, including antisense mutations, frameshifts, splice site mutations, deletions, and insertions.
Склонность к кровотечениям при гемофилии А взаимосвязана с определяемой активностью FVIII и классифицируется как легкая (0,05-0,40 ME/мл), умеренная (0,01-0,05 ME/мл) или тяжелая (<0,01 ME/мл). Пациенты с легкой формой гемофилии, как правило, испытывают аномальное кровотечение только в связи с оперативным вмешательством или полученными травмами. Напротив, у пациентов с умеренной формой гемофилии наблюдаются длительные реакции кровотечения на относительно незначительные травмы, а у пациентов с тяжелой формой заболевания часто возникают спонтанные кровотечения. Тяжелая форма гемофилии А проявляется спонтанными гемартрозом, гематомами мягких тканей, забрюшинными кровотечениями, внутримозговыми кровоизлияниями и отсроченными послеоперационными кровотечениями. Со временем осложнения от рецидивирующего гемартроза и гематом мягких тканей включают тяжелую артропатию, контрактуры суставов и псевдоопухоли, приводящие к хроническим заболеваниям. Доля пациентов с легкой, средней и тяжелой формами гемофилии А точно не установлена, но, согласно последним эпидемиологическим исследованиям, предполагается, что примерно 60% пациентов с гемофилией А имеют тяжелую форму (Report on the annual global survey 2019, WFH).The tendency to bleed in hemophilia A is related to detectable FVIII activity and is classified as mild (0.05-0.40 IU/ml), moderate (0.01-0.05 IU/ml) or severe (<0.01 IU/ml). ml). Patients with mild hemophilia typically experience abnormal bleeding only due to surgery or injury. In contrast, patients with moderate hemophilia experience prolonged bleeding reactions to relatively minor injuries, and patients with severe hemophilia often experience spontaneous bleeding. The severe form of hemophilia A is manifested by spontaneous hemarthrosis, soft tissue hematomas, retroperitoneal bleeding, intracerebral hemorrhage and delayed postoperative bleeding. Over time, complications from recurrent hemarthrosis and soft tissue hematomas include severe arthropathy, joint contractures, and pseudotumors leading to chronic disease. The proportion of patients with mild, moderate and severe forms of hemophilia A is not precisely established, but recent epidemiological studies suggest that approximately 60% of patients with hemophilia A have severe form (Report on the annual global survey 2019, WFH).
На сегодняшний день стандартная схема лечения пациентов, страдающих гемофилией А, заключается в пожизненной заместительной терапии в виде инъекций рекомбинантного FVIII (Report on the annual global survey 2019, WFH). Несмотря на успех терапии гемофилии, существуют серьезные проблемы подобного подхода. Профилактическая заместительная терапия гемофилии А предполагает внутривенные инъекции рекомбинантного FVIII каждые три дня в течение всей жизни пациента с тяжелой формой заболевания. Подобный формат лечения является очень дорогостоящим и не гарантирует отсутствия осложнений, связанных в первую очередь с гемартрозом. В некоторых случаях у пациентов вырабатываются ингибирующие антитела. Ингибиторные формы гемофилии А чаще наблюдаются у пациентов с тяжелой формой заболевания и требуют применения альтернативных подходов к лечению и профилактике заболевания (Eckhardt CL, van der Bom JG, van der Naald M, Peters M, Kamphuisen PW, Fijnvandraat K. Surgery and inhibitor development in hemophilia A: a systematic review/ J Thromb Haemost. 2011 Oct; 9(10): 1948-58. doi: 10.1111/j.1538-7836.2011.04467.x. PMID: 21838755).Today, the standard treatment regimen for patients suffering from hemophilia A is lifelong replacement therapy in the form of injections of recombinant FVIII (Report on the annual global survey 2019, WFH). Despite the success of hemophilia therapy, there are serious problems with this approach. Preventive replacement therapy for hemophilia A involves intravenous injections of recombinant FVIII every three days for the life of a patient with severe disease. This type of treatment is very expensive and does not guarantee the absence of complications associated primarily with hemarthrosis. In some cases, patients develop inhibitory antibodies. Inhibitory forms of hemophilia A are more often observed in patients with severe forms of the disease and require the use of alternative approaches to the treatment and prevention of the disease (Eckhardt CL, van der Bom JG, van der Naald M, Peters M, Kamphuisen PW, Fijnvandraat K. Surgery and inhibitor development in hemophilia A: a systematic review/ J Thromb Haemost. 2011 Oct; 9(10): 1948-58. doi: 10.1111/j.1538-7836.2011.04467.x.
Генная терапия гемофилии А посредством вирусных (экспрессионных) векторов на основе аденоассоциированных вирусов (AAV), кодирующих ген FVIII, показала впечатляющие результаты в серии доклинических и клинических исследований (Bunting S, Zhang L, Xie L, Bullens S, Mahimkar R, Fong S, Sandza K, Harmon D, Yates B, Handyside B, Sihn CR, Galicia N, Tsuruda L, O'Neill CA, Bagri A, Colosi P, Long S, Vehar G, Carter B. Gene Therapy with BMN 270 Results in Therapeutic Levels of FVIII in Mice and Primates and Normalization of Bleeding in Hemophilic Mice/ Mol Ther. 2018 Feb 7;26(2):496-509. doi: 10.1016/j.ymthe.2017.12.009. Epub 2017 Dec 14.PMID: 29292164 и Peyvandi F, Garagiola I. Clinical advances in gene therapy updates on clinical trials of gene therapy in haemophilia/ Haemophilia. 2019 Sep; 25(5):738-746. doi: 10.1111/hae.13816. Epub 2019 Jul 8.PMID: 31282050). В отличие от традиционных подходов к лечению гемофилии, генная терапия с использованием AAV позволяет поддерживать уровень экспрессии привнесенного FVIII на достаточном уровне в течение нескольких лет после однократного введения препарата пациентам (Long BR, Veron Р, Kuranda К, Hardet R, Mitchell N, Hayes GM, Wong WY, Lau K, Li M, Hock MB, Zoog SJ, Vettermann C, Mingozzi F, Schweighardt B. Early Phase Clinical Immunogenicity of Valoctocogene Roxaparvovec, an AAV5-Mediated Gene Therapy for Hemophilia A/ Mol Ther. 2021 Feb 3; 29(2):597-610. doi: 10.1016/j.ymthe.2020.12.008. Epub 2020 Dec 10. PMID: 33309883).Gene therapy for hemophilia A through adeno-associated virus (AAV)-based viral (expression) vectors encoding the FVIII gene has shown impressive results in a series of preclinical and clinical studies (Bunting S, Zhang L, Xie L, Bullens S, Mahimkar R, Fong S, Sandza K, Harmon D, Yates B, Handyside B, Sihn CR, Galicia N, Tsuruda L, O'Neill CA, Bagri A, Colosi P, Long S, Vehar G, Carter B. Gene Therapy with BMN 270 Results in Therapeutic Levels of FVIII in Mice and Primates and Normalization of Bleeding in Hemophilic Mice/ Mol Ther. 2018 Feb 7;26(2):496-509. and Peyvandi F, Garagiola I. Clinical advances in gene therapy updates on clinical trials of gene therapy in haemophilia/ Haemophilia 2019 Sep; doi: 10.1111/hae.13816 Epub 2019 Jul 8.PMID. : 31282050). In contrast to traditional approaches to the treatment of hemophilia, gene therapy using AAV allows the expression of introduced FVIII to be maintained at sufficient levels for several years after a single dose of the drug to patients (Long BR, Veron P, Kuranda K, Hardet R, Mitchell N, Hayes GM , Wong WY, Lau K, Li M, Hock MB, Zoog SJ, Vettermann C, Mingozzi F, Schweighardt B. Early Phase Clinical Immunogenicity of Valoctocogene Roxaparvovec, an AAV5-Mediated Gene Therapy for Hemophilia A/ Mol Ther 2021; 29(2):597-610. doi: 10.1016/j.ymthe.2020.12.008. Epub 2020 Dec 10. PMID: 33309883).
На данный момент, в мире нет ни одного зарегистрированного генотерапевтического препарата для лечения гемофилии А.At the moment, there is not a single registered gene therapy drug in the world for the treatment of hemophilia A.
Таким образом, является актуальным разработка генотерапевтического препарата для лечения гемофилии А, а также решений, которые позволят улучшить эффективность генотерапевтического препарата для лечения гемофилии А.Thus, it is relevant to develop a gene therapy drug for the treatment of hemophilia A, as well as solutions that will improve the effectiveness of a gene therapy drug for the treatment of hemophilia A.
Раскрытие сущности изобретенияDisclosure of the invention
Авторы изобретения неожиданно обнаружили, что использование нуклеиновой кислоты, кодирующейThe inventors have unexpectedly discovered that the use of a nucleic acid encoding
1) слитый белок на основе FVIII-BDD и сигнального пептида FIX (СП-FIX), который имеет аминокислотную последовательность SEQ ID NO: 7 или1) a fusion protein based on FVIII-BDD and signal peptide FIX (SP-FIX), which has the amino acid sequence SEQ ID NO: 7 or
2) слитый белок на основе FVIII-BDD и сигнального пептида Карра цепи иммуноглобулина G (СП-IgGK), который имеет аминокислотную последовательность SEQ ID NO: 8 или2) a fusion protein based on FVIII-BDD and immunoglobulin G Carr signal peptide (SP-IgGK), which has the amino acid sequence SEQ ID NO: 8 or
3) слитый белок на основе FVIII-BDD и сигнального пептида Lactalbumin (СП-Lactalbumin), который имеет аминокислотную последовательность SEQ ID NO: 9,3) a fusion protein based on FVIII-BDD and Lactalbumin signal peptide (SP-Lactalbumin), which has the amino acid sequence SEQ ID NO: 9,
приводит к увеличению уровня продукции и активности белка FVIII-BDD по сравнению с использованием нуклеиновой кислоты, кодирующей белок FVIII-BDD с природным сигнальным пептидом FVIII (дикий тип).results in an increase in the level of production and activity of the FVIII-BDD protein compared to the use of a nucleic acid encoding the FVIII-BDD protein with a natural FVIII signal peptide (wild type).
Определения и общие методыDefinitions and general methods
Если иное не определено в настоящем документе, научные и технические термины, используемые в связи с настоящим изобретением, будут иметь значения, которые обычно понятны специалистам в данной области.Unless otherwise defined herein, scientific and technical terms used in connection with the present invention will have meanings that are commonly understood by those skilled in the art.
Кроме того, если по контексту не требуется иное, термины в единственном числе включают в себя термины во множественном числе, и термины во множественном числе включают в себя термины в единственном числе. Как правило, используемая классификация и методы культивирования клеток, молекулярной биологии, иммунологии, микробиологии, генетики, аналитической химии, химии органического синтеза, медицинской и фармацевтической химии, а также гибридизации и химии белка и нуклеиновых кислот, описанные в настоящем документе, хорошо известны специалистам и широко применяются в данной области. Ферментативные реакции и способы очистки осуществляют в соответствии с инструкциями производителя, как это обычно осуществляется в данной области, или как описано в настоящем документе.In addition, unless the context otherwise requires, terms in the singular include plural terms, and plural terms include singular terms. In general, the classification and methods of cell culture, molecular biology, immunology, microbiology, genetics, analytical chemistry, organic synthesis chemistry, medicinal and pharmaceutical chemistry, and hybridization and protein and nucleic acid chemistry described herein are well known to those skilled in the art. widely used in this field. Enzymatic reactions and purification methods are carried out in accordance with the manufacturer's instructions, as is typically carried out in the art, or as described herein.
Определения «встречающийся в природе», «нативный» или «дикого типа» используют для описания объекта, который можно обнаружить в природе как отличающийся от получаемого искусственно. Например, белок или нуклеотидная последовательность, присутствующие в организме, в том числе в составе вируса, которые можно изолировать из источника в природе, и которые не модифицированы умышленно специалистом в лаборатории, являются встречающимися в природе.The terms “naturally occurring,” “native,” or “wild type” are used to describe an object that can be found in nature to be different from that produced artificially. For example, a protein or nucleotide sequence that is present in an organism, including a virus, that can be isolated from a source in nature, and that has not been intentionally modified by a person skilled in the laboratory, is naturally occurring.
В настоящем описании и в последующей формуле изобретения, если контекстом не предусмотрено иное, слова «включать» и «содержать» или их вариации, такие как «включает», «включающий», «содержит» или «содержащий», следует понимать как включение указанного целого или группы целых, но не исключение любого другого целого или группы целых.In the present specification and in the following claims, unless the context otherwise requires, the words “include” and “comprise” or variations thereof such as “includes”, “comprising”, “contains” or “comprising” are to be understood as including the specified whole or group of wholes, but not to the exclusion of any other whole or group of wholes.
Белок (Пептид)Protein (Peptide)
В настоящем описании термины «пептид», «полипептид» и «белок» используют взаимозаменяемо, и они относятся к соединению, состоящему из аминокислотных остатков, ковалентно связанных пептидными связями. Полипептиды включают природные пептиды, рекомбинантные пептиды, синтетические пептиды или их комбинацию.As used herein, the terms "peptide", "polypeptide" and "protein" are used interchangeably and refer to a compound consisting of amino acid residues covalently linked by peptide bonds. Polypeptides include natural peptides, recombinant peptides, synthetic peptides, or a combination thereof.
Молекулы нуклеиновых кислотNucleic acid molecules
Термины «нуклеиновая кислота», «нуклеиновая последовательность» или «нуклеиновокислотная последовательность», «полинуклеотид», «олигонуклеотид», «полинуклеотидная последовательность» и «нуклеотидная последовательность», которые используются равнозначно в данном описании, обозначают четкую последовательность нуклеотидов, модифицированных или не модифицированных, определяющую фрагмент или участок нуклеиновой кислоты, содержащую или не содержащую неприродные нуклеотиды и являющуюся либо двухцепочечной ДНК или РНК, либо одноцепочечной ДНК или РНК, либо продуктами транскрипции указанных ДНК.The terms "nucleic acid", "nucleic sequence" or "nucleic acid sequence", "polynucleotide", "oligonucleotide", "polynucleotide sequence" and "nucleotide sequence", as used interchangeably herein, refer to a distinct sequence of nucleotides, modified or unmodified , defining a fragment or region of nucleic acid, containing or not containing unnatural nucleotides and being either double-stranded DNA or RNA, or single-stranded DNA or RNA, or transcription products of these DNAs.
Как применяют в настоящем описании, полинуклеотиды включают, в качестве неограничивающих примеров, все последовательности нуклеиновой кислоты, получаемые любыми способами, доступными в этой области, включая, в качестве неограничивающих примеров, рекомбинантные способы, т.е. клонирование последовательностей нуклеиновой кислоты из рекомбинантной библиотеки или генома клетки, использование обычной технологии клонирования и ПЦР и т.п., и способами синтеза.As used herein, polynucleotides include, but are not limited to, all nucleic acid sequences produced by any methods available in the art, including, but not limited to, recombinant methods, i.e. cloning nucleic acid sequences from a recombinant library or cell genome, using conventional cloning and PCR technology, etc., and synthesis methods.
Здесь также следует упомянуть, что данное изобретение не относится к нуклеотидным последовательностям в их природной хромосомной среде, т.е. в природном состоянии. Последовательности данного изобретения были выделены и/или очищены, т.е. были взяты прямо или косвенно, например, путем копирования, при этом их среда была по меньшей мере частично модифицирована. Таким образом, также здесь следует подразумевать изолированные нуклеиновые кислоты, полученные путем генетической рекомбинации, например, с помощью принимающих клеток (клеток-хозяев), или полученные путем химического синтеза.It should also be mentioned here that this invention does not relate to nucleotide sequences in their natural chromosomal environment, i.e. in its natural state. The sequences of the present invention have been isolated and/or purified, i.e. were taken directly or indirectly, for example by copying, and their environment was at least partially modified. Thus, it should also be understood here as isolated nucleic acids obtained by genetic recombination, for example by host cells, or obtained by chemical synthesis.
Термин нуклеотидная последовательность охватывает его комплемент, если не указано иное. Таким образом, нуклеиновую кислоту, имеющую определенную последовательность следует понимать как охватывающие ее комплементарную цепь с ее комплементарной последовательностью.The term nucleotide sequence covers its complement unless otherwise specified. Thus, a nucleic acid having a specific sequence should be understood as comprising its complementary strand with its complementary sequence.
ВекторVector
Термин «вектор» при использовании в настоящем документе означает молекулу нуклеиновой кислоты, способную транспортировать другую нуклеиновую кислоту, с которой она соединена. Кроме того, термин «вектор» в данном настоящем документе означает рекомбинантную вирусную частицу, способную транспортировать нуклеиновую кислоту.The term “vector” as used herein means a nucleic acid molecule capable of transporting another nucleic acid to which it is linked. In addition, the term “vector” as used herein means a recombinant viral particle capable of transporting a nucleic acid.
Как применяют в настоящем описании, термин «экспрессия» определяют как транскрипцию и/или трансляцию конкретной нуклеотидной последовательности, запускаемую ее промотором.As used herein, the term “expression” is defined as the transcription and/or translation of a particular nucleotide sequence driven by its promoter.
ПрименениеApplication
«Генная терапия» представляет собой вставку генов в клетки и/или ткани субъекта для лечения заболевания, обычно, наследственных заболеваний, при этом дефектный мутантный аллель заменяется или дополняется функциональным аллелем.“Gene therapy” is the insertion of genes into the cells and/or tissues of a subject to treat a disease, usually an inherited disease, whereby a defective mutant allele is replaced or supplemented with a functional allele.
«Лечить», «лечение» и «терапия» относятся к методу смягчения или устранения биологического расстройства и/или по меньшей мере одного из сопутствующих ему симптомов.“Treat,” “cure,” and “therapy” refer to a method of alleviating or eliminating a biological disorder and/or at least one of its associated symptoms.
Термин «субъект», «пациент», «индивидуум» и т.п. используют в настоящем описании взаимозаменяемо, и они относятся к любому животному, которое поддается воздействию способами, представленными в настоящем описании. В конкретных неограничивающих вариантах осуществления субъект, пациент или индивидуум является человеком. Вышеупомянутый субъект может быть мужского или женского пола любого возраста.The term “subject”, “patient”, “individual”, etc. are used interchangeably herein and refer to any animal that can be treated by the methods presented herein. In specific non-limiting embodiments, the subject, patient, or individual is a human. The above subject may be male or female of any age.
«Терапевтически эффективным количеством» или «эффективным количеством» считается количество вводимого терапевтического агента, которое избавит в определенной степени от одного или нескольких симптомов заболевания, по поводу которого проводится лечение.A "therapeutically effective amount" or "effective amount" is considered to be the amount of a therapeutic agent administered that will relieve, to a certain extent, one or more symptoms of the disease being treated.
Подробное описание изобретенияDetailed Description of the Invention
Нуклеиновая кислотаNucleic acid
В одном из аспектов настоящее изобретение относится к выделенной нуклеиновой кислоте, которая кодирует слитый белок на основе FVIII-BDD (фактор свертывания крови VIII с делетированным В доменом) и гетерологичного сигнального пептида, который включает аминокислотную последовательность, выбранную из SEQ ID NO: 7, SEQ ID NO: 8 или SEQ ID NO: 9.In one aspect, the present invention provides an isolated nucleic acid that encodes a fusion protein of FVIII-BDD (blood clotting factor VIII with B domain deleted) and a heterologous signal peptide that includes an amino acid sequence selected from SEQ ID NO: 7, SEQ ID NO: 8 or SEQ ID NO: 9.
«Выделенная» молекула нуклеиновой кислоты представляет собой молекулу нуклеиновой кислоты, которая идентифицирована и отделена от по меньшей мере одной молекулы нуклеиновой кислоты-примеси. Выделенная молекула нуклеиновой кислоты отличается от той формы или набора, в которых она находится в естественных условиях. Таким образом, выделенная молекула нуклеиновой кислоты отличается от молекулы нуклеиновой кислоты, существующей в клетках в естественных условиях.An "isolated" nucleic acid molecule is a nucleic acid molecule that has been identified and separated from at least one contaminant nucleic acid molecule. The isolated nucleic acid molecule differs from the form or set in which it occurs naturally. Thus, the isolated nucleic acid molecule is different from the nucleic acid molecule that naturally exists in cells.
Сигнальный пептид обеспечивает транспорт белка интереса внутри клетки до целевых органелл или способствует секреции белка интереса в межклеточное пространство.The signal peptide ensures the transport of the protein of interest within the cell to target organelles or promotes the secretion of the protein of interest into the intercellular space.
В некоторых вариантах выделенная нуклеиновая кислота кодирует слитый белок на основе FVIII-BDD с аминокислотной последовательностью SEQ ID NO: 5 и сигнального пептида FIX с аминокислотной последовательностью SEQ ID NO: 2.In some embodiments, the isolated nucleic acid encodes a fusion protein based on FVIII-BDD with the amino acid sequence of SEQ ID NO: 5 and a FIX signal peptide with the amino acid sequence of SEQ ID NO: 2.
В некоторых вариантах выделенная нуклеиновая кислота кодирует слитый белок на основе FVIII-BDD с аминокислотной последовательностью SEQ ID NO: 5 и сигнального пептида Карра цепи иммуноглобулина G с аминокислотной последовательностью SEQ ID NO: 3.In some embodiments, the isolated nucleic acid encodes a fusion protein based on FVIII-BDD with the amino acid sequence of SEQ ID NO: 5 and an immunoglobulin G chain Carr signal peptide with the amino acid sequence of SEQ ID NO: 3.
В некоторых вариантах выделенная нуклеиновая кислота кодирует слитый белок на основе FVIII-BDD с аминокислотной последовательностью SEQ ID NO: 5 и сигнального пептида Lactalbumin с аминокислотной последовательностью SEQ ID NO: 4.In some embodiments, the isolated nucleic acid encodes a fusion protein based on FVIII-BDD with the amino acid sequence of SEQ ID NO: 5 and a Lactalbumin signal peptide with the amino acid sequence of SEQ ID NO: 4.
В некоторых вариантах выделенная нуклеиновая кислота кодирует слитый белок на основе FVIII-BDD и гетерологичного сигнального пептида, который имеет аминокислотную последовательность SEQ ID NO: 7. Данный слитый белок с аминокислотной последовательностью SEQ ID NO: 7 включает FVIII-BDD с аминокислотной последовательностью SEQ ID NO: 5 и сигнальный пептид FIX с аминокислотной последовательностью SEQ ID NO: 2.In some embodiments, the isolated nucleic acid encodes a fusion protein of FVIII-BDD and a heterologous signal peptide that has the amino acid sequence of SEQ ID NO: 7. The fusion protein of the amino acid sequence of SEQ ID NO: 7 includes FVIII-BDD of the amino acid sequence of SEQ ID NO: : 5 and signal peptide FIX with amino acid sequence SEQ ID NO: 2.
В некоторых вариантах выделенная нуклеиновая кислота кодирует слитый белок на основе FVIII-BDD и гетерологичного сигнального пептида, который имеет аминокислотную последовательность SEQ ID NO: 8. Данный слитый белок с аминокислотной последовательностью SEQ ID NO: 8 включает FVIII-BDD с аминокислотной последовательностью SEQ ID NO: 5 и сигнальный пептид Карра цепи иммуноглобулина G с аминокислотной последовательностью SEQ ID NO: 3.In some embodiments, the isolated nucleic acid encodes a fusion protein of FVIII-BDD and a heterologous signal peptide that has the amino acid sequence of SEQ ID NO: 8. The fusion protein of SEQ ID NO: 8 includes FVIII-BDD of the amino acid sequence of SEQ ID NO: 8. : 5 and the Carr signal peptide of the immunoglobulin G chain with the amino acid sequence SEQ ID NO: 3.
В некоторых вариантах выделенная нуклеиновая кислота кодирует слитый белок на основе FVIII-BDD и гетерологичного сигнального пептида, который имеет аминокислотную последовательность SEQ ID NO: 9. Данный слитый белок с аминокислотной последовательностью SEQ ID NO: 9 включает FVIII-BDD с аминокислотной последовательностью SEQ ID NO: 5 и сигнальный пептид Lactalbumin с аминокислотной последовательностью SEQ ID NO: 4.In some embodiments, the isolated nucleic acid encodes a fusion protein of FVIII-BDD and a heterologous signal peptide that has the amino acid sequence of SEQ ID NO: 9. The fusion protein of SEQ ID NO: 9 includes FVIII-BDD of the amino acid sequence of SEQ ID NO: : 5 and Lactalbumin signal peptide with amino acid sequence SEQ ID NO: 4.
В некоторых вариантах выделенная нуклеиновая кислота представляет собой нуклеотидную последовательность SEQ ID NO: 11. Данная нуклеиновая кислота кодирует слитый белок на основе FVIII-BDD и сигнального пептида FIX, который имеет аминокислотную последовательность SEQ ID NO: 7.In some embodiments, the isolated nucleic acid is the nucleotide sequence of SEQ ID NO: 11. The nucleic acid encodes a FVIII-BDD-FIX signal peptide fusion protein that has the amino acid sequence of SEQ ID NO: 7.
В некоторых вариантах выделенная нуклеиновая кислота представляет собой нуклеотидную последовательность SEQ ID NO: 12. Данная нуклеиновая кислота кодирует слитый белок на основе FVIII-BDD и сигнального пептида Карра цепи иммуноглобулина G, который имеет аминокислотную последовательность SEQ ID NO: 8.In some embodiments, the isolated nucleic acid is the nucleotide sequence of SEQ ID NO: 12. The nucleic acid encodes a fusion protein based on FVIII-BDD and the Carr signal peptide of the immunoglobulin G chain, which has the amino acid sequence of SEQ ID NO: 8.
В некоторых вариантах выделенная нуклеиновая кислота представляет собой нуклеотидную последовательность SEQ ID NO: 13. Данная нуклеиновая кислота кодирует слитый белок на основе FVIII-BDD и сигнального пептида Lactalbumin, который имеет аминокислотную последовательность SEQ ID NO: 9.In some embodiments, the isolated nucleic acid is the nucleotide sequence of SEQ ID NO: 13. The nucleic acid encodes a fusion protein of FVIII-BDD and the Lactalbumin signal peptide that has the amino acid sequence of SEQ ID NO: 9.
Экспрессионная кассета. Экспрессионный вектор.Expression cassette. Expression vector.
В одном из аспектов настоящее изобретение относится к экспрессионной кассете, которая включает вышеуказанную нуклеиновую кислоту, которая кодирует слитый белок на основе FVIII-BDD и гетерологичного сигнального пептида.In one aspect, the present invention provides an expression cassette that includes the above nucleic acid that encodes a fusion protein based on FVIII-BDD and a heterologous signal peptide.
Термин «кассета, которая экспрессирует» или «экспрессионная кассета» при использовании в данном документе, в частности, относится к фрагменту ДНК, который способен в соответствующей обстановке запускать экспрессию полинуклеотида, кодирующего представляющий интерес полипептид, последовательность которого включена в указанную экспрессионную кассету. При введении в клетку-хозяина экспрессионная кассета помимо прочего способна задействовать клеточные механизмы для транскрипции полинуклеотида, кодирующего представляющий интерес полипептид, в РНК, которая затем обычно дополнительно процессируется и, наконец, транслируется в представляющий интерес полипептид. Экспрессионная кассета может содержаться в экспрессионном векторе.The term “cassette that expresses” or “expression cassette” as used herein specifically refers to a fragment of DNA that is capable, under appropriate circumstances, of driving the expression of a polynucleotide encoding a polypeptide of interest, the sequence of which is included in the expression cassette. When introduced into a host cell, the expression cassette is, among other things, capable of engaging cellular machinery to transcribe the polynucleotide encoding the polypeptide of interest into RNA, which is then typically further processed and finally translated into the polypeptide of interest. The expression cassette may be contained in an expression vector.
Экспрессионная кассета по настоящему изобретению содержит в качестве элемента промотор. Термин «промотор», используемый в настоящем документе, в частности, относится к элементу ДНК, который способствует транскрипции полинуклеотида, с которым функционально связан промотор. Промотор может также составлять часть элемента «промотор/энхансер». Хотя физические границы между элементами «промотор» и «энхансер» не всегда ясны, термин «промотор» обычно относится к месту на молекуле нуклеиновой кислоты, с которым связывается РНК-полимераза и/или связанные с ней факторы, и с которого инициируется транскрипция. Энхансеры усиливают активность промотора во времени, а также пространственно. В данной области известно множество промоторов, которые транскрипционно активны в широком диапазоне типов клеток. Промоторы могут быть разделены на два класса: на тех, которые функционируют конститутивно, и тех, которые регулируются индукцией или снятием репрессии. Для экспрессии белка пригодны оба класса. Промоторы, которые используются для продукции высокого уровня полипептидов в эукариотических клетках и, в частности, в клетках млекопитающих, должны быть сильными и, предпочтительно, должны быть активными в широком диапазоне типов клеток. Сильные конститутивные промоторы, которые способны запускать экспрессию во многих типах клеток, хорошо известны в данной области и, поэтому, нет необходимости в их подробном описании в данном документе.The expression cassette of the present invention contains a promoter element. The term "promoter" as used herein specifically refers to a DNA element that promotes transcription of a polynucleotide to which the promoter is operably linked. A promoter may also form part of a promoter/enhancer element. Although the physical boundaries between promoter and enhancer elements are not always clear, the term promoter generally refers to the location on a nucleic acid molecule to which RNA polymerase and/or associated factors bind and from which transcription is initiated. Enhancers enhance promoter activity temporally as well as spatially. There are many promoters known in the art that are transcriptionally active in a wide range of cell types. Promoters can be divided into two classes: those that function constitutively and those that are regulated by induction or release of repression. Both classes are suitable for protein expression. Promoters that are used to produce high levels of polypeptides in eukaryotic cells and, in particular, mammalian cells must be strong and preferably must be active in a wide range of cell types. Potent constitutive promoters that are capable of driving expression in many cell types are well known in the art and therefore need not be described in detail herein.
Согласно одному варианту осуществления изобретения промотор HLP используется в экспрессионной кассете по настоящему изобретению.According to one embodiment of the invention, the HLP promoter is used in the expression cassette of the present invention.
В некоторых вариантах экспрессионная кассета включает следующие элементы в направлении от 5'-конца к 3'-концу:In some embodiments, the expression cassette includes the following elements in a 5' to 3' direction:
левый (первый) ITR (инвертированные концевые повторы);left (first) ITR (inverted terminal repeats);
промотор;promoter;
любая из вышеуказанных нуклеиновых кислот, которая кодирует слитый белок на основе FVIII-BDD и гетерологичного сигнального пептида;any of the above nucleic acids that encodes a fusion protein based on FVIII-BDD and a heterologous signal peptide;
сигнал полиаденилирования;polyadenylation signal;
правый (второй) ITR.right (second) ITR.
Вышеуказанные структурные элементы экспрессионной кассеты являются функционально связанными между собой.The above structural elements of the expression cassette are functionally related to each other.
В контексте настоящего описания термин «функционально связанный» относится к связи полинуклеотидных (или полипептидных) элементов в функциональную связь. Нуклеиновая кислота является «функционально связанной», если она находится в условиях функциональной связи с другой последовательностью нуклеиновой кислоты. Например, регуляторная последовательность транскрипции функционально связана с кодирующей последовательностью, если она влияет на транскрипцию указанной кодирующей последовательности. Термин «функционально связанный» означает, что связанные последовательности ДНК являются, как правило, непрерывными, и при необходимости соединения двух участков, кодирующих белок, являются также непрерывными и находятся в рамке считывания.As used herein, the term "operably linked" refers to the linking of polynucleotide (or polypeptide) elements into a functional link. A nucleic acid is "operably linked" if it is in a functional relationship with another nucleic acid sequence. For example, a transcription control sequence is operably linked to a coding sequence if it affects the transcription of said coding sequence. The term "operably linked" means that the linked DNA sequences are generally contiguous and, where appropriate, junctions of two protein-coding regions are also contiguous and in frame.
В некоторых вариантах экспрессионная кассета включает левый (первый) ITR с нуклеотидной последовательностью SEQ ID NO: 14.In some embodiments, the expression cassette includes a left (first) ITR with the nucleotide sequence of SEQ ID NO: 14.
В некоторых вариантах экспрессионная кассета включает промотор HLP с нуклеотидной последовательностью SEQ ID NO: 15.In some embodiments, the expression cassette includes an HLP promoter having the nucleotide sequence of SEQ ID NO: 15.
В некоторых вариантах экспрессионная кассета включает сигнал полиаденилирования с нуклеотидной последовательностью SEQ ID NO: 16.In some embodiments, the expression cassette includes a polyadenylation signal with the nucleotide sequence of SEQ ID NO: 16.
В некоторых вариантах экспрессионная кассета включает правый (второй) ITR с нуклеотидной последовательностью SEQ ID NO: 17.In some embodiments, the expression cassette includes a right (second) ITR with the nucleotide sequence of SEQ ID NO: 17.
В некоторых вариантах экспрессионная кассета включает следующие элементы в направлении от 5'-конца к 3'-концу:In some embodiments, the expression cassette includes the following elements in a 5' to 3' direction:
левый (первый) ITR (инвертированные концевые повторы) с нуклеотидной последовательностью SEQ ID NO: 14;left (first) ITR (inverted terminal repeats) with nucleotide sequence SEQ ID NO: 14;
промотор с нуклеотидной последовательностью SEQ ID NO: 15;promoter with nucleotide sequence SEQ ID NO: 15;
любая из вышеуказанных нуклеиновых кислот, которая кодирует слитый белок на основе FVIII-BDD и гетерологичного сигнального пептида;any of the above nucleic acids that encodes a fusion protein based on FVIII-BDD and a heterologous signal peptide;
сигнал полиаденилирования с нуклеотидной последовательностью SEQ ID NO: 16;polyadenylation signal with nucleotide sequence SEQ ID NO: 16;
правый (второй) ITR с нуклеотидной последовательностью SEQ ID NO: 17.right (second) ITR with nucleotide sequence SEQ ID NO: 17.
В одном из аспектов настоящее изобретение относится к экспрессионному вектору, который включает любую из вышеуказанных нуклеиновых кислот, которая кодирует слитый белок на основе FVIII-BDD и гетерологичного сигнального пептида, или любую из вышеуказанных экспрессионных кассет.In one aspect, the present invention provides an expression vector that includes any of the above nucleic acids that encodes a fusion protein based on FVIII-BDD and a heterologous signal peptide, or any of the above expression cassettes.
В некоторых вариантах осуществления изобретения вектор представляет собой плазмиду, т.е. кольцевую двухцепочечную часть ДНК, в которую могут быть вставлены дополнительные сегменты ДНК.In some embodiments, the vector is a plasmid, i.e. a circular, double-stranded piece of DNA into which additional DNA segments can be inserted.
В некоторых вариантах осуществления изобретения вектор представляет собой вирусный (экспрессионный) вектор, в котором дополнительные сегменты ДНК могут быть вставлены в вирусный геном.In some embodiments, the vector is a viral (expression) vector in which additional DNA segments can be inserted into the viral genome.
В некоторых вариантах осуществления изобретения векторы способны к автономной репликации в клетке-хозяине, в которую они введены (например, бактериальные векторы, имеющие бактериальный сайт инициации репликации и эписомные векторы). В других вариантах осуществления изобретения векторы (например, неэписомальные векторы) могут быть интегрированы в геном клетки-хозяина при введении в клетку-хозяина, и таким образом реплицируются вместе с геном хозяина. Более того, некоторые векторы способны направлять экспрессию генов, с которыми они функционально соединены. Такие векторы упоминаются в данном документе как «рекомбинантные экспрессирующие векторы» (или просто «экспрессирующие векторы» («вектор экспрессии» или «экспрессионный вектор»)).In some embodiments, the vectors are capable of autonomous replication in the host cell into which they are introduced (eg, bacterial vectors having a bacterial origin of replication and episomal vectors). In other embodiments, vectors (eg, non-episomal vectors) can be integrated into the host cell genome upon introduction into the host cell, and thus replicate along with the host gene. Moreover, some vectors are capable of directing the expression of genes to which they are operably linked. Such vectors are referred to herein as “recombinant expression vectors” (or simply “expression vectors” (“expression vector” or “expression vector”)).
Экспрессионные векторы включают плазмиды, ретровирусы, аденовирусы, аденоассоциированные вирусы (AAV), вирусы растений, такие как вирус мозаики цветной капусты, вирусы табачной мозаики, космиды, YAC, EBV и тому подобное. Молекулы ДНК могут быть вставлены в вектор таким образом, что последовательности, контролирующие транскрипцию и трансляцию в векторе, выполняют предусмотренную функцию регуляции транскрипции и трансляции ДНК. Экспрессионный вектор и последовательности контроля экспрессии могут быть выбраны таким образом, чтобы быть совместимыми с используемой экспрессирующей клеткой-хозяином. Молекулы ДНК могут быть введены в экспрессионный вектор стандартными способами (например, лигированием комплементарных сайтов рестрикции или лигированием тупых концов, если сайты рестрикции отсутствуют).Expression vectors include plasmids, retroviruses, adenoviruses, adeno-associated viruses (AAV), plant viruses such as cauliflower mosaic virus, tobacco mosaic virus, cosmid, YAC, EBV and the like. DNA molecules can be inserted into a vector such that the transcription and translation control sequences in the vector perform the intended function of regulating DNA transcription and translation. The expression vector and expression control sequences can be selected to be compatible with the expression host cell used. DNA molecules can be introduced into an expression vector by standard methods (eg, ligation of complementary restriction sites or blunt-end ligation if restriction sites are missing).
В некоторых вариантах экспрессионный вектор представляет собой рекомбинантный аденоассоциированный вирус (AAV).In some embodiments, the expression vector is a recombinant adeno-associated virus (AAV).
В некоторых вариантах AAV выбирают из группы, включающей следующие серотипы AAV: AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10 AAV11, AAV12, AAV13, AAV14, AAV15, AAV16, rAAV.rh8, rAAV.rhlO, rAAV.rh20, rAAV.rh39, rAAV.Rh74, rAAV.RHM4-1, AAV.hu37, rAAV.Anc80, rAAV.Anc80L65, rAAV.7m8, rAAV.PHP.B, rAAV2.5, rAAV2tYF, rAAV3B, rAAV.LK03, AAV.HSC1, AAV.HSC2, AAV.HSC3, AAV.HSC4, AAV.HSC5, AAV.HSC6, AAV.HSC7, AAV.HSC8, AAV.HSC9, AAV.HSC10, AAV.HSC11, AAV.HSC12, AAV.HSC13, AAV.HSC14, AAV.HSC15 или AAV.HSC16.In some embodiments, the AAV is selected from the group consisting of the following AAV serotypes: AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10 AAV11, AAV12, AAV13, AAV14, AAV15, AAV16, rAAV.rh8, rAAV. rhlO, rAAV.rh20, rAAV.rh39, rAAV.Rh74, rAAV.RHM4-1, AAV.hu37, rAAV.Anc80, rAAV.Anc80L65, rAAV.7m8, rAAV.PHP.B, rAAV2.5, rAAV2tYF, rAAV3B, rAAV.LK03, AAV.HSC1, AAV.HSC2, AAV.HSC3, AAV.HSC4, AAV.HSC5, AAV.HSC6, AAV.HSC7, AAV.HSC8, AAV.HSC9, AAV.HSC10, AAV.HSC11, AAV. HSC12, AAV.HSC13, AAV.HSC14, AAV.HSC15 or AAV.HSC16.
В некоторых вариантах осуществления изобретения вектор или кассета может включать последовательность контроля экспрессии. Термин «последовательность контроля экспрессии», используемый в данном описании, означает полинуклеотидные последовательности, которые необходимы для воздействия на экспрессию и процессинг кодирующих последовательностей, к которым они вставлены. Специалистам в этой области будет понятно, что дизайн экспрессионного вектора или кассеты, включая выбор последовательностей контроля экспрессии, может зависеть от таких факторов, как выбор типа клетки-хозяина для трансформации, требуемый уровень экспрессии белка, и т.д. Последовательности контроля экспрессии включают соответствующие последовательности инициации транскрипции, терминации, промотора и энхансера; эффективные сигналы процессинга РНК, такие как сплайсинг и сигналы полиаденилирования; последовательности, которые стабилизируют цитоплазматическую мРНК; последовательности, которые повышают эффективность трансляции (т.е. консенсусная последовательность Козака); последовательности, которые повышают стабильность белка; и, при желании, последовательности, которые усиливают секрецию белка. Характер таких контролирующих последовательностей различается в зависимости от организма-хозяина; в прокариотах такие контролирующие последовательности, как правило, включают промотор, сайт связывания рибосомы, а также последовательности терминации транскрипции; в эукариотах, как правило, такие контролирующие последовательности включают промоторы и последовательности терминации транскрипции. Предпочтительные последовательности контроля экспрессии для экспрессирующей клетки-хозяина млекопитающего включают вирусные элементы обеспечивающие высокий уровень экспрессии белков в клетках млекопитающих, таких как промоторы и/или энхансеры, полученные из ретровирусной LTR, цитомегаловируса (CMV) (например, CMV промотора/энхансера), обезьяньего вируса 40 (SV40) (например, SV40 промотора/энхансера), аденовируса, (например, большого позднего промотора аденовируса (AdMLP)), вирус полиомы, а также сильных промоторов млекопитающих, таких как TTR промотор, промотор нативных иммуноглобулинов, промотор актина, а также промотор HLP (hybrid liver-speciHc promotor). Последовательности контроля экспрессии включают, как минимум, все компоненты, наличие которых имеет важное значение для экспрессии и процессинга.In some embodiments, the vector or cassette may include an expression control sequence. The term "expression control sequence" as used herein refers to polynucleotide sequences that are necessary to influence the expression and processing of the coding sequences to which they are inserted. Those skilled in the art will appreciate that the design of an expression vector or cassette, including the choice of expression control sequences, may depend on factors such as the choice of host cell type for transformation, the desired level of protein expression, etc. Expression control sequences include the corresponding transcription initiation, termination, promoter and enhancer sequences; efficient RNA processing signals such as splicing and polyadenylation signals; sequences that stabilize cytoplasmic mRNA; sequences that increase translation efficiency (ie, the Kozak consensus sequence); sequences that increase protein stability; and, if desired, sequences that enhance protein secretion. The nature of such control sequences varies depending on the host organism; in prokaryotes, such control sequences typically include a promoter, a ribosome binding site, and transcription termination sequences; in eukaryotes, such control sequences typically include promoters and transcription termination sequences. Preferred expression control sequences for a mammalian host expression cell include viral elements that provide high level expression of proteins in mammalian cells, such as promoters and/or enhancers derived from a retroviral LTR, cytomegalovirus (CMV) (eg, CMV promoter/enhancer), simian virus 40 (SV40) (eg, SV40 promoter/enhancer), adenovirus (eg, adenovirus large late promoter (AdMLP)), polyoma virus, as well as strong mammalian promoters such as the TTR promoter, native immunoglobulin promoter, actin promoter, and HLP promoter (hybrid liver-speciHc promoter). Expression control sequences include, at a minimum, all components whose presence is essential for expression and processing.
В дополнение к вышеуказанным генам и последовательностям контроля экспрессии, рекомбинантные векторы экспрессии изобретения могут нести дополнительные последовательности, такие как последовательности, которые регулируют репликацию вектора в клетках-хозяевах (например, точки начала репликации) и гены селектируемого маркера. Ген селектируемого маркера облегчает селекцию клеток-хозяев, в которые был введен вектор или кассету.In addition to the above genes and expression control sequences, the recombinant expression vectors of the invention may carry additional sequences, such as sequences that regulate replication of the vector in host cells (eg, origins of replication) and selectable marker genes. The selectable marker gene facilitates selection of host cells into which the vector or cassette has been introduced.
В одном из вариантов настоящего изобретения экспрессионный вектор относится к вектору, содержащему одну или несколько интересующих полинуклеотидных последовательностей, интересующих генов или трансгенов, которые фланкированы парвовирусными или инвертированными концевыми повторяющимися последовательностями (ITR).In one embodiment of the present invention, an expression vector refers to a vector containing one or more polynucleotide sequences of interest, genes or transgenes of interest that are flanked by parvovirus or inverted terminal repeat (ITR) sequences.
Ни кассета, ни вектор по изобретению не содержит нуклеотидные последовательности генов, кодирующих неструктурные белки (Rep) и структурные белки (Сар) аденоассоциированного вируса.Neither the cassette nor the vector according to the invention contains nucleotide sequences of genes encoding non-structural proteins (Rep) and structural proteins (Cap) of the adeno-associated virus.
Клетка-хозяинHost cell
В одном из аспектов настоящее изобретение относится к клетке-хозяину для получения слитого белка на основе FVIII-BDD и гетерологичного сигнального пептида или для получения любого из вышеуказанных экспрессионных векторов, которая содержит любую из вышеуказанных нуклеиновых кислот, которая кодирует слитый белок на основе FVIII-BDD и гетерологичного сигнального пептида.In one aspect, the present invention relates to a host cell for producing a fusion protein based on FVIII-BDD and a heterologous signal peptide or for producing any of the above expression vectors, which contains any of the above nucleic acids that encodes a fusion protein based on FVIII-BDD and a heterologous signal peptide.
Термин «клетка-хозяин» при использовании в данном документе означает клетку, в которую введен рекомбинантный экспрессионный вектор или кассету по изобретению. Настоящее изобретение относится к клеткам-хозяевам, которые могут включать, например, вектор в соответствии с настоящим изобретением, описанным выше. Следует понимать, что «клетка-хозяин» означают не только конкретную заявленную клетку, но также и потомство такой клетки. Поскольку модификации могут проходить в последующих поколениях вследствие мутации или воздействий окружающей среды, такое потомство не может, на самом деле, быть идентичным родительской клетке, но такие клетки по-прежнему включены в объем термина «клетка-хозяин» при использовании в настоящем документе.The term “host cell” as used herein means a cell into which a recombinant expression vector or cassette of the invention is introduced. The present invention relates to host cells, which may include, for example, a vector in accordance with the present invention described above. It should be understood that "host cell" means not only the specific cell claimed, but also the progeny of such a cell. Since modifications may occur in subsequent generations due to mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell, but such cells are still included within the scope of the term “host cell” as used herein.
Экспрессионные векторы или кассеты по изобретению могут быть использованы для трансфекции клетки млекопитающего, клетки растения, бактериальной или дрожжевой клетки-хозяина. Трансфекция может происходить любым известным способом для введения полинуклеотидов в клетку хозяина. Способы введения гетерологичных полинуклеотидов в клетки млекопитающих хорошо известны в данной области и включают декстран опосредованную трансфекцию, трансфекцию комплексом нуклеиновой кислоты и позитивно заряженного полимера, трансфекцию преципитатом нуклеиновой кислоты и фосфата кальция, полибрен опосредованную трансфекцию, слияние протопластов, трансфекцию инкапсулированными в липосомы полинуклеотидами и прямую микроинъекцию ДНК в ядра. В дополнение молекулы нуклеиновых кислот могут быть введены в клетки млекопитающих вирусными (экспрессионными) векторами.Expression vectors or cassettes of the invention can be used to transfect a mammalian cell, plant cell, bacterial or yeast host cell. Transfection can occur by any known method for introducing polynucleotides into a host cell. Methods for introducing heterologous polynucleotides into mammalian cells are well known in the art and include dextran-mediated transfection, nucleic acid-positively charged polymer complex transfection, nucleic acid-calcium phosphate precipitate transfection, polybrene-mediated transfection, protoplast fusion, transfection with liposome-encapsulated polynucleotides, and direct microinjection. DNA into nuclei. In addition, nucleic acid molecules can be introduced into mammalian cells by viral (expression) vectors.
Клеточные линии млекопитающих, используемые в качестве хозяев для трансформации, хорошо известны в данной области и включают множество иммортализованных доступных клеточных линий. К ним относятся, например, клетки яичников китайского хомячка (СНО), NS0 клетки, клетки SP2, HEK-293Т клетки, 293 Фристайл клетки (Invitrogen), NIH-3T3 клетки, клетки HeLa, клетки почек хомячка (BHK), клетки почек африканских зеленых мартышек (COS), клетки гепатоцеллюлярной карциномы человека (например, Hep G2), А549 клетки, SK-HEP1, HUH7, Hep-RG и ряд других клеточных линий. Клеточные линии выбираются путем определения, какие клеточные линии имеют высокие уровни экспрессии и обеспечивают необходимые характеристики продуцируемого белка. Другими клеточными линиями, которые могут быть использованы, являются клеточные линии насекомых, такие как Sf9 или Sf21 клетки. Когда векторы рекомбинантной экспрессии по изобретению вводятся в клетки-хозяева млекопитающих слитый белок продуцируется путем культивирования клеток-хозяев в течение времени, достаточного для экспрессии слитого белка в клетках-хозяевах или, предпочтительнее, выделения слитого белка в питательную среду, в которой выращиваются клетки-хозяева. Слитый белок может быть выделен из питательной среды с использованием стандартных методов очистки белка. Клетки-хозяева растений, например, включают Nicotiana, Arabidopsis, ряску, кукурузу, пшеницу, картофель и т.д. Клетки бактерий хозяина включают виды Escherichia и Streptomyces. Дрожжевые клетки-хозяева включают Schizosaccharomyces pombe, Saccharomyces cerevisiae и Pichia pastoris.Mammalian cell lines used as hosts for transformation are well known in the art and include many immortalized cell lines available. These include, for example, Chinese hamster ovary (CHO) cells, NS0 cells, SP2 cells, HEK-293T cells, 293 Freestyle cells (Invitrogen), NIH-3T3 cells, HeLa cells, BHK cells, African kidney cells vervet monkeys (COS), human hepatocellular carcinoma cells (eg Hep G2), A549 cells, SK-HEP1, HUH7, Hep-RG and a number of other cell lines. Cell lines are selected by determining which cell lines have high levels of expression and provide the desired characteristics of the protein produced. Other cell lines that can be used are insect cell lines such as Sf9 or Sf21 cells. When the recombinant expression vectors of the invention are introduced into mammalian host cells, the fusion protein is produced by culturing the host cells for a time sufficient to express the fusion protein in the host cells or, preferably, releasing the fusion protein into a growth medium in which the host cells are grown . The fusion protein can be isolated from the culture medium using standard protein purification methods. Plant host cells, for example, include Nicotiana, Arabidopsis, duckweed, corn, wheat, potato, etc. Host bacterial cells include Escherichia and Streptomyces species. Yeast host cells include Schizosaccharomyces pombe, Saccharomyces cerevisiae and Pichia pastoris.
Вышеуказанная клетка-хозяин не относится к клетке-хозяину, полученной с использованием человеческих эмбрионов.The above host cell does not refer to a host cell obtained using human embryos.
Вышеуказанная клетка-хозяин не относится к клетке-хозяину, полученной с модификации генетической целостности клеток зародышевой линии человека.The above host cell does not refer to a host cell obtained by modifying the genetic integrity of human germline cells.
Фармацевтическая композицияPharmaceutical composition
В одном из аспектов настоящее изобретение относится к фармацевтической композиции для доставки гена FVIII-BDD в целевые клетки, который включает любой из вышеуказанных экспрессионных векторов или кассет.In one aspect, the present invention relates to a pharmaceutical composition for delivering the FVIII-BDD gene to target cells, which includes any of the above expression vectors or cassettes.
В некоторых вариантах фармацевтическая композиция для доставки гена FVIII-BDD в целевые клетки включает любой из вышеуказанных экспрессионных векторов или кассет в сочетании с одним или несколькими фармацевтически приемлемыми эксципиентами.In some embodiments, a pharmaceutical composition for delivering the FVIII-BDD gene to target cells includes any of the above expression vectors or cassettes in combination with one or more pharmaceutically acceptable excipients.
Действующее вещество в вышеуказанной композиции находится в эффективном количестве, например, в биологически эффективном количестве.The active substance in the above composition is in an effective amount, for example, in a biologically effective amount.
«Фармацевтическая композиция» обозначает композицию, включающую в себя любой из вышеуказанных экспрессионных векторов по изобретению и, по крайней мере, один из компонентов, выбранных из группы, состоящей из фармацевтически приемлемых и фармакологически совместимых эксипиентов, наполнителей, растворителей, разбавителей, носителей, вспомогательных средств, распределяющих средств или средств доставки."Pharmaceutical composition" means a composition comprising any of the above expression vectors of the invention and at least one of the components selected from the group consisting of pharmaceutically acceptable and pharmacologically compatible excipients, excipients, solvents, diluents, carriers, auxiliaries , distribution or delivery vehicles.
Фармацевтические композиции по настоящему изобретению и способы их изготовления будут бесспорно очевидными для специалистов в этой области. Производство фармацевтических композиций предпочтительно должно соответствовать требованиям GMP (надлежащей производственной практики).The pharmaceutical compositions of the present invention and methods for their manufacture will be readily apparent to those skilled in the art. The production of pharmaceutical compositions should preferably comply with GMP (Good Manufacturing Practices) requirements.
В некоторых вариантах осуществления фармацевтической композиции она может включать буферную композицию, тонические агенты (осмолитик или осмотический агент), стабилизаторы и/или солюбилизаторы.In some embodiments of the pharmaceutical composition, it may include a buffer composition, tonic agents (osmolytic or osmotic agent), stabilizers and/or solubilizers.
«Фармацевтически приемлемым» считается материал, который не имеет биологических или других противопоказаний, например, материал можно вводить субъекту без каких-либо нежелательных биологических эффектов. Таким образом, такие фармацевтические композиции можно использовать, например, для трансфекции клетки ех vivo или для введения in vivo любого из вышеуказанных экспрессионных векторов по изобретению непосредственно субъекту.A "pharmaceutically acceptable" material is one that has no biological or other contraindications, e.g., the material can be administered to a subject without causing any undesirable biological effects. Thus, such pharmaceutical compositions can be used, for example, for ex vivo transfection of a cell or for in vivo administration of any of the above expression vectors of the invention directly to a subject.
Термин «эксципиент» или «вспомогательное вещество» используется в данном документе для описания любого компонента, отличающегося от ранее описанных по данному изобретению. Это вещества неорганического или органического происхождения, используемые в процессе производства, изготовления лекарственных препаратов для придания им необходимых физико-химических свойств.The term "excipient" or "excipient" is used herein to describe any component other than those previously described in this invention. These are substances of inorganic or organic origin, used in the production process of medicines to give them the necessary physical and chemical properties.
Фармацевтическая композиция по изобретению является стабильной.The pharmaceutical composition according to the invention is stable.
Фармацевтическая композиция является «стабильной», если активный агент сохраняет свою физическую стабильность и/или химическую стабильность и/или биологическую активность в течение заявленного срока годности при температуре хранения, например, при 2-8°С. Предпочтительно, чтобы активный агент сохранял и физическую, и химическую стабильность, а также биологическую активность. Период хранения выбирается на основании результатов исследования стабильности при ускоренном и естественном хранении.A pharmaceutical composition is “stable” if the active agent retains its physical stability and/or chemical stability and/or biological activity through its stated shelf life at storage temperature, for example, 2-8°C. It is preferred that the active agent retains both physical and chemical stability as well as biological activity. The storage period is selected based on the results of stability studies during accelerated and natural storage.
В некоторых вариантах фармацевтическая композиция представляет собой раствор для внутривенного введения.In some embodiments, the pharmaceutical composition is a solution for intravenous administration.
В некоторых вариантах фармацевтическая композиция представляет собой концентрат для приготовления раствора для внутривенного введения.In some embodiments, the pharmaceutical composition is a concentrate for the preparation of a solution for intravenous administration.
ПрименениеApplication
В одном из аспектов настоящее изобретение относится к применению любого из вышеуказанных экспрессионных кассет или векторов или вышеуказанной композиции для доставки гена FVIII-BDD в целевые клетки.In one aspect, the present invention relates to the use of any of the above expression cassettes or vectors or the above composition for delivering the FVIII-BDD gene to target cells.
В одном из аспектов настоящее изобретение относится к применению любого из вышеуказанных экспрессионных кассет или векторов или вышеуказанной композиции для обеспечения белком FVIII-BDD субъекта, который имеет гемофилию А и/или не имеет функциональных копий гена FVIII.In one aspect, the present invention relates to the use of any of the above expression cassettes or vectors or the above composition for providing the FVIII-BDD protein to a subject who has hemophilia A and/or does not have functional copies of the FVIII gene.
Под отсутствием функциональных копий гена FVIII подразумеваются инактивирующие мутации или делеции во всех копиях гена FVIII в геноме, которые приводят к потере или дефекту функции гена FVIII.Absence of functional copies of the FVIII gene refers to inactivating mutations or deletions in all copies of the FVIII gene in the genome that result in loss or defective function of the FVIII gene.
В одном из аспектов настоящее изобретение относится к способу обеспечения белком FVIII-BDD субъекта с гемофилией А, который включает введение терапевтически эффективного количества любого из вышеуказанных экспрессионных векторов или вышеуказанной композиции в клетки субъекта, нуждающегося в этом.In one aspect, the present invention provides a method of providing FVIII-BDD protein to a subject with hemophilia A, which comprises administering a therapeutically effective amount of any of the above expression vectors or the above composition to cells of a subject in need thereof.
В одном из аспектов настоящее изобретение относится к способу доставки гена FVIII-BDD в целевые клетки субъекта с гемофилией А, который включает введение любого из вышеуказанных экспрессионных векторов или вышеуказанной композиции в клетки субъекта.In one aspect, the present invention relates to a method of delivering the FVIII-BDD gene to target cells of a subject with hemophilia A, which includes introducing any of the above expression vectors or the above composition into the cells of the subject.
Под субъектом, нуждающимся в доставке гена FVIII-BDD в целевые клетки, или субъектом, нуждающимся в обеспечении белком FVIII-BDD, подразумевается субъект, который имеет гемофилию А, или субъект, который имеет дефицит фактора свертывания крови FVIII, или субъект, который имеет инактивирующие мутации или делеции в гене FVIII, которые приводят к потере или дефекту функции гена FVIII.By a subject requiring delivery of the FVIII-BDD gene to target cells or a subject requiring provision of the FVIII-BDD protein is meant a subject who has hemophilia A, or a subject who has a deficiency of coagulation factor FVIII, or a subject who has inactivating mutations or deletions in the FVIII gene that result in loss or defective function of the FVIII gene.
В одном из аспектов настоящее изобретение относится к применению любого из вышеуказанных экспрессионных векторов или вышеуказанной композиции для лечения гемофилии А у субъекта, который имеет гемофилию А.In one aspect, the present invention relates to the use of any of the above expression vectors or the above composition for the treatment of hemophilia A in a subject who has hemophilia A.
В одном из аспектов настоящее изобретение относится к способу лечения гемофилии А у субъекта, который включает ведение терапевтически эффективного количества любого из вышеуказанных экспрессионных векторов или вышеуказанной композиции субъекту, который имеет гемофилию А.In one aspect, the present invention provides a method of treating hemophilia A in a subject, which includes administering a therapeutically effective amount of any of the above expression vectors or the above composition to a subject who has hemophilia A.
В некоторых вариантах гемофилия А представляет собой тяжелую форму гемофилии А (активность фактора VIII <1%) или среднетяжелую форму гемофилии А (активность фактора VIII 1-5%).In some embodiments, hemophilia A is a severe form of hemophilia A (factor VIII activity <1%) or a moderate form of hemophilia A (factor VIII activity 1-5%).
Примеры способов введения включают в себя местное применение, интраназальное, ингаляционное, чрезслизистое, трансдермальное, энтеральное (например, пероральное, ректальное), парентеральное (например, внутривенное, подкожное, внутрикожное, внутримышечное) введения, а также инъекции непосредственно в ткань или в орган.Examples of routes of administration include topical, intranasal, inhalational, transmucosal, transdermal, enteral (eg, oral, rectal), parenteral (eg, intravenous, subcutaneous, intradermal, intramuscular) administration, and injection directly into a tissue or organ.
В некоторых вариантах применения любой из вышеуказанных экспрессионных векторов или вышеуказанная композиция вводится субъекту как внутривенная инфузия.In some embodiments, any of the above expression vectors or the above composition is administered to a subject as an intravenous infusion.
Любой из вышеуказанных экспрессионных векторов вводят в организм в эффективном количестве. Любой из вышеуказанных экспрессионных векторов предпочтительно вводят в организм в биологически эффективном количестве. «Биологически эффективное» количество экспрессионного вектора представляет собой количество, которое достаточно, чтобы вызвать трансдукцию клеток и экспрессию последовательности нуклеиновой кислоты в клетке. Если экспрессионный вектор вводят в клетку in vivo, «биологически эффективное» количество экспрессионного вектора представляет собой количество, которое достаточно, чтобы вызвать трансдукцию клеток-мишеней и экспрессию последовательности нуклеиновой кислоты в клетке-мишени.Any of the above expression vectors is introduced into the body in an effective amount. Any of the above expression vectors is preferably administered to the body in a biologically effective amount. A "biologically effective" amount of an expression vector is an amount that is sufficient to cause cell transduction and expression of a nucleic acid sequence in the cell. If an expression vector is introduced into a cell in vivo, a "biologically effective" amount of the expression vector is an amount that is sufficient to cause transduction of the target cells and expression of the nucleic acid sequence in the target cell.
Дозировки любого из вышеуказанных экспрессионных векторов по данному изобретению будут зависеть от способа введения конкретного вектора, и их можно определять рутинными способами.The dosages of any of the above expression vectors of this invention will depend on the method of administration of the particular vector, and can be determined by routine methods.
Клетка для введения любого из вышеуказанных экспрессионных кассет или векторов по данному изобретению может быть клеткой любого типа, включая в себя без ограничения, эпителиальные клетки (например, эпителиальные клетки кожи, дыхательных путей и кишечника), печеночные клетки, мышечные клетки, клетки селезенки, фибробласты, эндотелиальные клетки и тому подобное.The cell for administering any of the above expression cassettes or vectors of this invention may be any cell type, including, but not limited to, epithelial cells (eg, skin, respiratory and intestinal epithelial cells), liver cells, muscle cells, spleen cells, fibroblasts , endothelial cells and the like.
Любой из вышеуказанных экспрессионных кассет или векторов по данному изобретению не используется для модификации генетической целостности клеток зародышевой линии человека.Any of the above expression cassettes or vectors of this invention are not used to modify the genetic integrity of human germline cells.
В некоторых вариантах применения любой из вышеуказанных экспрессионных векторов по данному изобретению, а также препарат на его основе применяют в формате монотерапии.In some embodiments, any of the above expression vectors according to this invention, as well as a drug based on it, are used in a monotherapy format.
В некоторых вариантах применения любой из вышеуказанных экспрессионных векторов по данному изобретению, а также препарат на его основе применяют в комбинации с заместительной терапией концентратами факторов свертывания, десмопрессином и/или ингибиторами фибринолиза.In some embodiments, any of the above expression vectors of this invention, as well as a drug based thereon, are used in combination with replacement therapy with clotting factor concentrates, desmopressin and/or fibrinolysis inhibitors.
В некоторых вариантах применения любой из вышеуказанных экспрессионных векторов по данному изобретению, а также препарат на его основе применяют в комбинации с моноклональным антителом (например, эмицизумаб).In some embodiments, any of the above expression vectors of this invention, as well as a drug based thereon, are used in combination with a monoclonal antibody (eg, emicizumab).
В некоторых вариантах применения любой из вышеуказанных экспрессионных векторов по данному изобретению, а также препарат на его основе применяют в комбинации с препаратами на базе технологии РНК-интерференции (например, фитусиран).In some applications, any of the above expression vectors of this invention, as well as a drug based on it, is used in combination with drugs based on RNA interference technology (for example, fitusiran).
В некоторых вариантах применения любой из вышеуказанных экспрессионных векторов по данному изобретению, а также препарат на его основе вводят субъекту однократно.In some embodiments, any of the above expression vectors of this invention, as well as a drug based thereon, is administered to a subject in a single dose.
В некоторых вариантах применения любой из вышеуказанных экспрессионных векторов по данному изобретению, а также препарат на его основе вводят субъекту многократно.In some embodiments, any of the above expression vectors of this invention, as well as a drug based thereon, are administered to a subject multiple times.
Краткое описание чертежейBrief description of drawings
Фигура 1 представляет собой график, который показывает увеличение уровня продукции белка FVIII-BDD в культуральную жидкость после трансфекции нуклеиновыми кислотами, кодирующими слитый белок на основе FVIII-BDD (фактор свертывания крови VIII с делетированным В доменом) и одного из гетерологичных сигнальных пептидов, по сравнению с нуклеиновой кислотой, кодирующей FVIII-BDD с сигнальным пептидом FVIII дикого типа.Figure 1 is a graph that shows the increase in the level of production of FVIII-BDD protein in the culture fluid after transfection with nucleic acids encoding a fusion protein based on FVIII-BDD (blood coagulation factor VIII with deleted B domain) and one of the heterologous signal peptides, compared with a nucleic acid encoding FVIII-BDD with a wild-type FVIII signal peptide.
1 - уровень белка FVIII-BDD после трансфекции клеток нуклеиновой кислотой, которая кодирует слитый белок на основе человеческого FVIII-BDD и природного сигнального пептида FVIII, который имеет аминокислотную последовательность SEQ ID NO: 6 (СП-FVIII-FVIII-BDD).1 - level of FVIII-BDD protein after transfection of cells with a nucleic acid that encodes a fusion protein based on human FVIII-BDD and natural FVIII signal peptide, which has the amino acid sequence SEQ ID NO: 6 (SP-FVIII-FVIII-BDD).
2 - уровень белка FVIII-BDD после трансфекции клеток нуклеиновой кислотой, которая кодирует слитый белок на основе FVIII-BDD и сигнального пептида FIX, который имеет аминокислотную последовательность SEQ ID NO: 7 (СП-FIX-FVIII-BDD).2 - level of FVIII-BDD protein after transfection of cells with a nucleic acid that encodes a fusion protein based on FVIII-BDD and the FIX signal peptide, which has the amino acid sequence SEQ ID NO: 7 (SP-FIX-FVIII-BDD).
3 - уровень белка FVIII-BDD после трансфекции клеток нуклеиновой кислотой, которая кодирует слитый белок на основе человеческого FVIII-BDD и сигнального пептида Lactalbumin, который имеет аминокислотную последовательность SEQ ID NO: 9 (СП-Lactalbumin-FVIII-BDD).3 - level of FVIII-BDD protein after transfection of cells with a nucleic acid that encodes a fusion protein based on human FVIII-BDD and the Lactalbumin signal peptide, which has the amino acid sequence SEQ ID NO: 9 (SP-Lactalbumin-FVIII-BDD).
4 - уровень белка FVIII-BDD после трансфекции клеток нуклеиновой кислотой, которая кодирует слитый белок на основе человеческого FVIII-BDD и сигнального пептида Карра цепь иммуноглобулина G, который имеет аминокислотную последовательность SEQ ID NO: 8 (СП-IgGK-FVIII-BDD).4 - level of FVIII-BDD protein after transfection of cells with a nucleic acid that encodes a fusion protein based on human FVIII-BDD and the Carr signal peptide immunoglobulin G chain, which has the amino acid sequence SEQ ID NO: 8 (SP-IgGK-FVIII-BDD).
Фигура 2 представляет собой график, который показывает увеличение уровня активности белка FVIII-BDD в культуральной жидкости после трансфекции нуклеиновыми кислотами, кодирующими слитый белок на основе FVIII-BDD (фактор свертывания крови VIII с делетированным В доменом) и одного из гетерологичных сигнальных пептидов, по сравнению с нуклеиновой кислотой, кодирующей FVIII-BDD с сигнальным пептидом FVIII дикого типа.Figure 2 is a graph that shows the increase in the level of FVIII-BDD protein activity in culture fluid after transfection with nucleic acids encoding a FVIII-BDD (blood clotting factor VIII with B domain deleted) fusion protein and one of the heterologous signal peptides, compared with a nucleic acid encoding FVIII-BDD with a wild-type FVIII signal peptide.
1 - уровень активности FVIII-BDD после трансфекции клеток нуклеиновой кислотой, которая кодирует слитый белок на основе человеческого FVIII-BDD и природного сигнального пептида FVIII, который имеет аминокислотную последовательность SEQ ID NO: 6 (СП-FVIII-FVIII-BDD).1 - level of FVIII-BDD activity after transfection of cells with a nucleic acid that encodes a fusion protein based on human FVIII-BDD and natural FVIII signal peptide, which has the amino acid sequence SEQ ID NO: 6 (SP-FVIII-FVIII-BDD).
2 - уровень активности FVIII-BDD после трансфекции клеток нуклеиновой кислотой, которая кодирует слитый белок на основе FVIII-BDD и сигнального пептида FIX, который имеет аминокислотную последовательность SEQ ID NO: 7 (СП-FIX-FVIII-BDD).2 - level of FVIII-BDD activity after transfection of cells with a nucleic acid that encodes a fusion protein based on FVIII-BDD and the FIX signal peptide, which has the amino acid sequence SEQ ID NO: 7 (SP-FIX-FVIII-BDD).
3 - уровень активности FVIII-BDD после трансфекции клеток нуклеиновой кислотой, которая кодирует слитый белок на основе человеческого FVIII-BDD и сигнального пептида Lactalbumin, который имеет аминокислотную последовательность SEQ ID NO: 9 (СП-Lactalbumin-FVIII-BDD).3 - level of FVIII-BDD activity after transfection of cells with a nucleic acid that encodes a fusion protein based on human FVIII-BDD and the Lactalbumin signal peptide, which has the amino acid sequence SEQ ID NO: 9 (SP-Lactalbumin-FVIII-BDD).
4 - уровень активности FVIII-BDD после трансфекции клеток нуклеиновой кислотой, которая кодирует слитый белок на основе человеческого FVIII-BDD и сигнального пептида Карра цепь иммуноглобулина G, который имеет аминокислотную последовательность SEQ ID NO: 8 (СП-IgGK-FVIII-BDD).4 - level of FVIII-BDD activity after transfection of cells with a nucleic acid that encodes a fusion protein based on human FVIII-BDD and the Carr chain signal peptide of immunoglobulin G, which has the amino acid sequence SEQ ID NO: 8 (SP-IgGK-FVIII-BDD).
Фигура 3 представляет собой график, который показывает увеличение уровня продукции белка FVIII-BDD при доставке in vitro нуклеиновых кислот в виде экспрессионного вектора на основе rAAV, содержащего нуклеиновую кислоту, кодирующую слитый белок на основе FVIII-BDD и одного из гетерологичных сигнальных пептидов, по сравнению с экспрессионным вектором на основе rAAV, содержащим нуклеиновую кислоту, кодирующую FVIII-BDD с сигнальным пептидом FVIII дикого типа.Figure 3 is a graph that shows the increase in the level of FVIII-BDD protein production when nucleic acids are delivered in vitro as an rAAV expression vector containing a nucleic acid encoding a FVIII-BDD fusion protein and one of the heterologous signal peptides, compared with an rAAV-based expression vector containing a nucleic acid encoding FVIII-BDD with a wild-type FVIII signal peptide.
1 - уровень белка FVIII-BDD после трансдукции клеток экспрессионными векторами, содержащими нуклеиновую кислоту, которая кодирует слитый белок на основе человеческого FVIII-BDD и природного сигнального пептида FVIII, который имеет аминокислотную последовательность SEQ ID NO: 6 (СП-FVIII-FVIII-BDD).1 - level of FVIII-BDD protein after transduction of cells with expression vectors containing a nucleic acid that encodes a fusion protein based on human FVIII-BDD and natural FVIII signal peptide, which has the amino acid sequence SEQ ID NO: 6 (SP-FVIII-FVIII-BDD ).
2 - уровень белка FVIII-BDD после трансдукции клеток экспрессионными векторами, содержащими нуклеиновую кислоту, которая кодирует слитый белок на основе FVIII-BDD и сигнального пептида FIX, который имеет аминокислотную последовательность SEQ ID NO: 7 (СП-FIX-FVIII-BDD).2 - level of FVIII-BDD protein after transduction of cells with expression vectors containing a nucleic acid that encodes a fusion protein based on FVIII-BDD and the FIX signal peptide, which has the amino acid sequence SEQ ID NO: 7 (SP-FIX-FVIII-BDD).
3 - уровень белка FVIII-BDD после трансдукции клеток экспрессионными векторами, содержащими нуклеиновую кислоту, которая кодирует слитый белок на основе человеческого FVIII-BDD и сигнального пептида Lactalbumin, который имеет аминокислотную последовательность SEQ ID NO: 9 (СП-Lactalbumin FVIII-BDD).3 - level of FVIII-BDD protein after transduction of cells with expression vectors containing a nucleic acid that encodes a fusion protein based on human FVIII-BDD and the Lactalbumin signal peptide, which has the amino acid sequence SEQ ID NO: 9 (SP-Lactalbumin FVIII-BDD).
Фигура 4 представляет собой график, который показывает увеличение уровня активности белка FVIII-BDD при доставке in vitro нуклеиновых кислот в виде экспрессионного вектора на основе rAAV, содержащего нуклеиновую кислоту, кодирующую слитый белок на основе FVIII-BDD и одного из гетерологичных сигнальных пептидов, по сравнению с экспрессионным вектором на основе rAAV, содержащим нуклеиновую кислоту, кодирующую FVIII-BDD с сигнальным пептидом FVIII дикого типа.Figure 4 is a graph that shows the increase in the level of FVIII-BDD protein activity when nucleic acids are delivered in vitro as an rAAV expression vector containing a nucleic acid encoding a FVIII-BDD fusion protein and one of the heterologous signal peptides, compared with an rAAV-based expression vector containing a nucleic acid encoding FVIII-BDD with a wild-type FVIII signal peptide.
1 - уровень активности FVIII-BDD после трансдукции клеток экспрессионными векторами, содержащими нуклеиновую кислоту, которая кодирует слитый белок на основе человеческого FVIII-BDD и природного сигнального пептида FVIII, который имеет аминокислотную последовательность SEQ ID NO: 6 (СП-FVIII-FVIII-BDD).1 - level of FVIII-BDD activity after transduction of cells with expression vectors containing a nucleic acid that encodes a fusion protein based on human FVIII-BDD and natural FVIII signal peptide, which has the amino acid sequence SEQ ID NO: 6 (SP-FVIII-FVIII-BDD ).
2 - уровень активности FVIII-BDD после трансдукции клеток экспрессионными векторами, содержащими нуклеиновую кислоту, которая кодирует слитый белок на основе FVIII-BDD и сигнального пептида FIX, который имеет аминокислотную последовательность SEQ ID NO: 7 (СП-FIX-FVIII-BDD).2 - level of FVIII-BDD activity after transduction of cells with expression vectors containing a nucleic acid that encodes a fusion protein based on FVIII-BDD and the FIX signal peptide, which has the amino acid sequence SEQ ID NO: 7 (SP-FIX-FVIII-BDD).
3 - уровень активности FVIII-BDD после трансдукции клеток экспрессионными векторами, содержащими нуклеиновую кислоту, которая кодирует слитый белок на основе человеческого FVIII-BDD и сигнального пептида Lactalbumin, который имеет аминокислотную последовательность SEQ ID NO: 9 (СП-Lactalbumin FVIII-BDD).3 - level of FVIII-BDD activity after transduction of cells with expression vectors containing a nucleic acid that encodes a fusion protein based on human FVIII-BDD and the Lactalbumin signal peptide, which has the amino acid sequence SEQ ID NO: 9 (SP-Lactalbumin FVIII-BDD).
Фигура 5 представляет собой график, который показывает увеличение уровня белка FVIII-BDD при доставке нуклеиновой кислоты, кодирующей слитый белок на основе FVIII-BDD и одного из гетерологичных сигнальных пептидов, in vivo мышам линии B6.129S-F8tm1Smoc (HemA) в виде экспрессионного вектора на основе rAAV.Figure 5 is a graph that shows the increase in FVIII-BDD protein levels when a nucleic acid encoding a fusion protein of FVIII-BDD and one of the heterologous signal peptides is delivered in vivo to B6.129S-F8tm1Smoc (HemA) mice as an expression vector based on rAAV.
1 - уровень белка FVIII-BDD в плазме крови животных после инъекции контрольный раствор без AAV (негативный контроль).1 - level of FVIII-BDD protein in the blood plasma of animals after injection of a control solution without AAV (negative control).
2 - уровень белка FVIII-BDD в плазме крови животных после инъекции экспрессионного вектора, содержащего нуклеиновую кислоту, которая кодирует слитый белок на основе FVIII-BDD и сигнального пептида FIX, который имеет аминокислотную последовательность SEQ ID NO: 7 (СП-FIX-FVIII-BDD).2 - level of FVIII-BDD protein in the blood plasma of animals after injection of an expression vector containing a nucleic acid that encodes a fusion protein based on FVIII-BDD and the FIX signal peptide, which has the amino acid sequence SEQ ID NO: 7 (SP-FIX-FVIII- BDD).
3 - уровень белка FVIII-BDD в плазме крови животных после инъекции экспрессионного вектора, содержащего нуклеиновую кислоту, которая кодирует слитый белок на основе человеческого FVIII-BDD и сигнального пептида Lactalbumin, который имеет аминокислотную последовательность SEQ ID NO: 9 (СП-Lactalbumin FVIII-BDD).3 - level of FVIII-BDD protein in the blood plasma of animals after injection of an expression vector containing a nucleic acid that encodes a fusion protein based on human FVIII-BDD and the Lactalbumin signal peptide, which has the amino acid sequence SEQ ID NO: 9 (SP-Lactalbumin FVIII- BDD).
ПримерыExamples
Для наилучшего понимания изобретения приводятся следующие примеры. Эти примеры приведены только в иллюстративных целях и не должны толковаться как ограничивающие сферу применения изобретения в любой форме.For a better understanding of the invention, the following examples are provided. These examples are provided for illustrative purposes only and should not be construed as limiting the scope of the invention in any form.
Все публикации, патенты и патентные заявки, указанные в этой спецификации включены в данный документ путем отсылки. Хотя вышеупомянутое изобретение было довольно подробно описано путем иллюстрации и примера в целях исключения двусмысленного толкования, специалистам в данной области на основе идей, раскрытых в данном изобретении, будет вполне понятно, что могут быть внесены определенные изменения и модификации без отклонения от сущности и объема прилагаемых вариантов осуществления изобретения.All publications, patents and patent applications referenced in this specification are incorporated herein by reference. Although the foregoing invention has been described in some detail by way of illustration and example in order to avoid ambiguity, those skilled in the art will appreciate from the teachings disclosed herein that certain changes and modifications may be made without departing from the spirit and scope of the appended embodiments. implementation of the invention.
Материалы и общие методыMaterials and general methods
Методы рекомбинантной ДНКRecombinant DNA Methods
Для манипуляций с ДНК использовали стандартные методы, описанные у Sambrook J. и др., Molecular cloning: A laboratory manual; Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1989. Реагенты для молекулярной биологии использовали согласно инструкциям производителей. Вкратце, плазмидную ДНК нарабатывали для дальнейших манипуляций в клетках Е. coli, выращиваемых под селективным давлением с антибиотиками для того, чтобы плазмиды не терялись в клеточной популяции. Плазмидную ДНК выделяли из клеток коммерческими наборами, измеряли концентрацию и использовали для клонирования с помощью обработки эндонуклеазами рестрикции или методами ПЦР-амплификации. Фрагменты ДНК лигировали между собой с помощью лигаз и трансформировали в бактериальные клетки для отбора клонов и дальнейших наработок. Все полученные генетические конструкции подтверждали по паттернам рестрикции и полным секвенированием по Сэнгеру.For DNA manipulation, standard methods were used as described in Sambrook J. et al., Molecular cloning: A laboratory manual; Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1989. Molecular biology reagents were used according to the manufacturers' instructions. Briefly, plasmid DNA was produced for further manipulation in E. coli cells grown under selective pressure with antibiotics to ensure that plasmids were not lost in the cell population. Plasmid DNA was isolated from cells using commercial kits, the concentration was measured, and used for cloning using restriction endonuclease treatment or PCR amplification methods. DNA fragments were ligated with each other using ligases and transformed into bacterial cells for clone selection and further development. All obtained genetic constructs were confirmed by restriction patterns and full Sanger sequencing.
Синтез геновGene synthesis
Требуемые сегменты генов получали из олигонуклеотидов, созданных путем химического синтеза. Генные фрагменты длиной от 300 до 1000 п.н., которые фланкированы уникальными сайтами рестрикции, собирали путем ренатурации олигонуклеотидов друг на друге с последующей ПЦР-амплификацией с крайних праймеров. В результате получали смесь фрагментов, включая нужный. Фрагменты клонировали по сайтам рестрикции в промежуточные векторы, после чего последовательности ДНК субклонированных фрагментов подтверждали путем секвенирования ДНК.The required gene segments were obtained from oligonucleotides created by chemical synthesis. Gene fragments ranging in length from 300 to 1000 bp, which are flanked by unique restriction sites, were assembled by renaturation of oligonucleotides on each other, followed by PCR amplification from extreme primers. As a result, a mixture of fragments was obtained, including the desired one. The fragments were cloned at restriction sites into intermediate vectors, after which the DNA sequences of the subcloned fragments were confirmed by DNA sequencing.
Определение последовательностей ДНКDetermination of DNA sequences
Последовательности ДНК определяли путем секвенирования по Сэнгеру. Анализ последовательностей ДНК и белков и обработку данных о последовательностях осуществляли в программе SnapGene 4.2 и выше для создания, картирования, анализа, аннотирования и иллюстрации последовательностей.DNA sequences were determined by Sanger sequencing. DNA and protein sequence analysis and sequence data processing were performed using SnapGene 4.2 and higher to generate, map, analyze, annotate, and illustrate sequences.
Культивирование клеточных культурCell culture cultivation
В экспериментах были использованы клеточные линии HEK293 (Human Embryonic Kidney clone 293), HUH7 (human hepatocellular carcinoma cell lines) и HepG2 (human hepatocellular carcinoma cell lines). Суспензионные клетки HEK293, используемые для наработки AAV, культивировались в стандартных условиях при 37°С и 5% CO2, на полной питательной среде без FBS и антибиотика. Адгезионные клетки HUH7 и HepG2, используемые для проверки эффективности препаратов AAV, культивировались в стандартных условиях при 37°С и 5% СО2, на полной питательной среде DMEM добавлением 10% FBS, антибиотика/антимикотика. Пересев клеток HUH7 и HepG2 осуществлялся при достижении 80-90% конфлюентности. Для диссоциации клеточного монослоя использовался TrypLE Select enzyme (10х). Жизнеспособность клеток оценивалась с помощью окраски Trypan Blue и одноразовых камер для подсчета клеток с помощью автоматического счетчика Countess II.The cell lines HEK293 (Human Embryonic Kidney clone 293), HUH7 (human hepatocellular carcinoma cell lines) and HepG2 (human hepatocellular carcinoma cell lines) were used in the experiments. Suspension HEK293 cells used for AAV production were cultured under standard conditions at 37°C and 5% CO 2 in complete nutrient medium without FBS and antibiotic. Adherent HUH7 and HepG2 cells, used to test the effectiveness of AAV drugs, were cultured under standard conditions at 37°C and 5% CO 2 , in complete DMEM nutrient medium supplemented with 10% FBS, an antibiotic/antimycotic. Reseeding of HUH7 and HepG2 cells was carried out when 80-90% confluency was achieved. TrypLE Select enzyme (10x) was used to dissociate the cell monolayer. Cell viability was assessed using Trypan Blue staining and disposable cell counting chambers using a Countess II automated counter.
Трансфекции клеточных культурCell culture transfections
Для оценки функциональности новых вариантов слитых белков при трансфекции использовались плазмиды, содержащие экспрессионную кассету для экспрессии различных вариантов трансгенов hFVIII-BDD. Клеточную линию HepG2 заранее засевали в лунки 12-луночных планшетов с плотностью 10000 кл/см2. Через сутки вносились плазмиды в одинаковой копийности в составе комплекса с Lipofectamine 3000. На 7 день после трансфекции определяли содержание белка FVIII-BDD и его активность в культуральной жидкости методом ИФА и хромогенного теста. Работы по оценке уровня и активности белка FVIII-BDD в культуральной жидкости проводились в 6 независимых экспериментах. Для негативного контроля были использованы интактные клетки HepG2.To evaluate the functionality of new fusion protein variants during transfection, plasmids containing an expression cassette for expressing various variants of hFVIII-BDD transgenes were used. The HepG2 cell line was pre-seeded into the wells of 12-well plates with a density of 10,000 cells/ cm2 . A day later, plasmids were introduced in the same copy number as part of a complex with Lipofectamine 3000. On the 7th day after transfection, the content of the FVIII-BDD protein and its activity in the culture liquid were determined by ELISA and chromogenic test. Work to assess the level and activity of the FVIII-BDD protein in the culture liquid was carried out in 6 independent experiments. Intact HepG2 cells were used as a negative control.
Сборка и очистка экспрессионных векторов на основе AAVAssembly and purification of AAV-based expression vectors
Для сборки экспрессионных векторов на основе AAV, содержащих кодон-оптимизированные варианты гена FVIII-BDD, использовали клетки-продуценты HEK293, которые трансфецировали 3-мя плазмидами:To assemble AAV-based expression vectors containing codon-optimized variants of the FVIII-BDD gene, HEK293 producer cells were used, which were transfected with 3 plasmids:
1) плазмиды, содержащие экспрессионную кассету AAV для экспрессии различных вариантов трансгенов hFVIII-BDD;1) plasmids containing an AAV expression cassette for the expression of various variants of hFVIII-BDD transgenes;
2) плазмида для экспрессии гена Сар серотипа AAV6 и гена Rep серотипа AAV2. Каждый ген с помощью альтернативных рамок считывания кодирует несколько белковых продуктов;2) plasmid for expression of the Sar gene of serotype AAV6 and the Rep gene of serotype AAV2. Each gene encodes multiple protein products using alternative reading frames;
3) плазмида для экспрессии генов аденовируса Ad2, необходимых для сборки и упаковки капсидов AAV.3) a plasmid for the expression of Ad2 adenovirus genes necessary for the assembly and packaging of AAV capsids.
Через 72 часа клетки лизировали и проводили очистку и концентрирование частиц с помощью методов фильтрации, хроматографии и ультрацентрифугирования. Титр частиц определяли с помощью количественной ПЦР с праймерами и пробой, специфичными к участку рекомбинантного вирусного генома, и выражали в виде количества копий вирусных геномов на 1 мл.After 72 hours, the cells were lysed and the particles were purified and concentrated using filtration, chromatography and ultracentrifugation methods. The particle titer was determined using quantitative PCR with primers and a probe specific to a region of the recombinant viral genome and expressed as the number of copies of viral genomes per 1 ml.
Трансдукция клеточных культурTransduction of cell cultures
Клеточную линию HUH7 заранее засевали в лунки 12-луночных планшетов с плотностью 10000 кл/см2. После прикрепления клеток к адгезивной подложке, вносились препараты AAV при MOI 500000 вг/кл. На 7 день после трансдукции определяли содержание белка FVIII-BDD и его активность в культуральной жидкости методом ИФА и хромогенным тестом. Работы по оценке уровня и активности белка FVIII-BDD в культуральной жидкости проводились в 6 независимых экспериментах. Для негативного контроля были использованы интактные клетки.The HUH7 cell line was pre-seeded into the wells of 12-well plates with a density of 10,000 cells/ cm2 . After cell attachment to the adhesive substrate, AAV preparations were applied at an MOI of 500,000 vg/cell. On day 7 after transduction, the content of the FVIII-BDD protein and its activity in the culture liquid was determined by ELISA and chromogenic test. Work to assess the level and activity of the FVIII-BDD protein in the culture liquid was carried out in 6 independent experiments. Intact cells were used for negative control.
Определение количества белка фактора свертывания крови VIII BDD методом ИФАDetermination of the amount of blood coagulation factor VIII BDD protein by ELISA
Оценка содержания белка фактора свертывания крови VIII-BDD в культуральной жидкости после трансфекции клеток линии HepG2 и трансдукции клеток линии HUH7, а также после инъекции животным (мыши) в плазме крови целевыми кандидатами проводилась «сэндвич»-методом неконкурентного твердофазного иммуноферментного анализа (ИФА). Вкратце, разведенные в буфере для разведений образцы, вносились в лунки 96-луночного планшета, сенсибилизированные первичными специфическими к фактору свертывания крови VIII-BDD антителами. В этот же планшет вносились стандарты для построения калибровочной кривой, контроли. Планшет инкубировался 1 час при температуре 37°С. Производили отмывку лунок планшета отмывочным буфером перед внесением биотинилированных антител, раствора конъюгата стрептавидин-пероксидазы и ТМВ. Вносился раствор со специфическими биотинилированными детектирующими антителами к фактору VIII-BDD и планшет инкубировался 30 минут при температуре 37°С. Далее к образовавшемуся комплексу добавлялся раствор конъюгата стрептавидин-пероксидазы и планшет инкубировался 30 минут при температуре 37°С. Для визуализации ферментативной реакции вносился раствор ТМВ. По достижении нужной степени интенсивности окрашивания во все лунки добавляется стоп-раствор для остановки реакции. Далее измерялась оптическая плотность растворов в лунках планшета. Концентрация фактора свертывания крови VIII-BDD в исследуемых образцах определялась по калибровочной кривой с учетом предварительного разведения образцов.Assessment of the protein content of blood coagulation factor VIII-BDD in the culture fluid after transfection of HepG2 cells and transduction of HUH7 cells, as well as after injection of target candidates into the blood plasma of animals (mice), was carried out using the “sandwich” method of non-competitive enzyme-linked immunosorbent assay (ELISA). Briefly, samples diluted in dilution buffer were added to wells of a 96-well plate coated with primary coagulation factor VIII-BDD-specific antibodies. Standards for constructing a calibration curve and controls were added to the same tablet. The plate was incubated for 1 hour at 37°C. The wells of the plate were washed with washing buffer before adding biotinylated antibodies, a solution of streptavidin-peroxidase conjugate and TMB. A solution with specific biotinylated detection antibodies to factor VIII-BDD was added and the plate was incubated for 30 minutes at 37°C. Next, a solution of streptavidin-peroxidase conjugate was added to the resulting complex and the plate was incubated for 30 minutes at 37°C. To visualize the enzymatic reaction, a TMB solution was added. Once the desired degree of color intensity is achieved, a stop solution is added to all wells to stop the reaction. Next, the optical density of the solutions in the wells of the plate was measured. The concentration of blood coagulation factor VIII-BDD in the studied samples was determined using a calibration curve, taking into account preliminary dilution of the samples.
Определение уровня активности белка фактора свертывания крови VIII BDD методом ИФАDetermination of the level of protein activity of blood coagulation factor VIII BDD by ELISA
Оценка активности белка фактора свертывания крови VIII в культуральной жидкости после трансфекции клеток линии HepG2 и трансдукции клеток линии HUH7 целевыми кандидатами проводилась при помощи хромогенного теста. Суть теста заключается в том, что в присутствии ионов кальция, фосфолипидов и фактора IXa фактор X переходит в активированную форму Ха, фактор VIII действует как кофактор в этой реакции, и скорость активации фактора X линейно связана с количество фактора VIII. Вкратце, разведенные в буфере для разведений образцы культуральной жидкости, стандарты для построения калибровочной кривой и контроли вносились в лунки 96-луночного планшета. Планшет инкубировался 3 минуты при температуре 37°С. Во все лунки планшета вносили раствор Factor reagent, содержащий фактор IXa, фактор X, тромбин, CaCl2 и фосфолипиды. Планшет инкубировался 4 минуты при температуре 37°С. Во все лунки планшета вносили раствор хромогенного субстрата S-2765+I-2581. Планшет инкубировался 7 минут при температуре 37°С. По достижению нужной степени интенсивности окрашивания во все лунки добавляли 20% раствор уксусной кислоты для остановки реакции. Далее измерялась оптическая плотность растворов в лунках планшета. Активность фактора свертывания крови VIII в исследуемых образцах определялась по калибровочной кривой с учетом предварительного разведения образцов.The activity of blood coagulation factor VIII protein in the culture fluid after transfection of HepG2 cells and transduction of HUH7 cells with target candidates was assessed using a chromogenic test. The essence of the test is that in the presence of calcium ions, phospholipids and factor IXa, factor X transforms into the activated form of Xa, factor VIII acts as a cofactor in this reaction, and the rate of activation of factor X is linearly related to the amount of factor VIII. Briefly, culture fluid samples, standard curve standards, and controls diluted in dilution buffer were added to the wells of a 96-well plate. The plate was incubated for 3 minutes at 37°C. A Factor reagent solution containing factor IXa, factor X, thrombin, CaCl 2 and phospholipids was added to all wells of the plate. The plate was incubated for 4 minutes at 37°C. A solution of the chromogenic substrate S-2765+I-2581 was added to all wells of the plate. The plate was incubated for 7 minutes at 37°C. Once the desired degree of color intensity was achieved, a 20% acetic acid solution was added to all wells to stop the reaction. Next, the optical density of the solutions in the wells of the plate was measured. The activity of blood coagulation factor VIII in the studied samples was determined using a calibration curve, taking into account the preliminary dilution of the samples.
Проведение in vivo исследования на лабораторных животныхConducting in vivo studies on laboratory animals
Для экспериментов были использованы мыши линии B6.129S-F8tm1Smoc (HemA), дефицитные по FVIII (самцы возрастом 6-8 недель). Препараты вводили животным однократно путем внутривенного введения в хвостовую вену. Группе животных отрицательного контроля вводился буферный раствор, не содержащий AAV. Отбор плазмы крови производился в день инъекции до введения препаратов, далее - на 14 и 56 дни после введения экспрессионных векторов.For the experiments, mice of the B6.129S-F8tm1Smoc (HemA) line deficient in FVIII (males 6-8 weeks old) were used. The drugs were administered to animals once by intravenous injection into the tail vein. A group of negative control animals was injected with a buffer solution that did not contain AAV. Blood plasma was collected on the day of injection before administration of the drugs, then on days 14 and 56 after administration of expression vectors.
Статистический анализ данныхStatistical data analysis
Результаты представлены в виде среднего значения±стандартное отклонение (SD), для сравнения результатов эксперимента использовали однофакторный дисперсионный анализ с поправкой на множественные попарные сравнения по методу Даннета (One-way ANOVA), и они были определены как статистически значимые.Results are presented as mean ± standard deviation (SD), and Dunnett's One-way ANOVA was used to compare experimental results and were determined to be statistically significant.
Пример 1Example 1
С целью повышения уровня секреции белка FVIII-BDD (фактор свертывания крови VIII с делетированным В доменом) последовательность SEQ ID NO: 6 (СП-FVIII-FVIII-BDD) была модифицирована путем замены последовательности сигнального пептида FVIII дикого типа на сигнальный пептид, соответствующий аминокислотной последовательности, указанной в SEQ ID NO: 2 (СП-FIX) или SEQ ID NO: 3 (СП-IgGK) или SEQ ID NO: 4 (СП-Lactalbumin), что привело к получению слитых белков, соответствующих аминокислотным последовательностям SEQ ID NO: 7 (СП-FIX-FVIII-BDD), SEQ ID NO: 8 (СП-IgGK-FVIII-BDD) и SEQ ID NO: 9 (СП-LactalbuminFVIII-BDD).In order to increase the level of secretion of the FVIII-BDD protein (coagulation factor VIII with the B domain deleted), the sequence SEQ ID NO: 6 (SP-FVIII-FVIII-BDD) was modified by replacing the signal peptide sequence of wild type FVIII with a signal peptide corresponding to the amino acid sequence specified in SEQ ID NO: 2 (SP-FIX) or SEQ ID NO: 3 (SP-IgGK) or SEQ ID NO: 4 (SP-Lactalbumin), resulting in fusion proteins corresponding to the amino acid sequences of SEQ ID NO : 7 (SP-FIX-FVIII-BDD), SEQ ID NO: 8 (SP-IgGK-FVIII-BDD) and SEQ ID NO: 9 (SP-LactalbuminFVIII-BDD).
Полученные нами нуклеиновые кислоты, кодирующие слитые белки на основе FVIII-BDD и одного из гетерологичных сигнальных пептидов, которые включают аминокислотную последовательность, выбранную из SEQ ID NO: 7, SEQ ID NO: 8 или SEQ ID NO: 9, были проверены при трансфекции клеток линии in vitro в составе экспрессионной кассеты, состоящей из левого (первого) ITR (инвертированные концевые повторы), соответствующего последовательности SEQ ID NO: 14, промотора HLP (SEQ ID NO: 15), гена интереса, сигнала полиаденилирования (SEQ ID NO: 16), правого (второго) ITR (SEQ ID NO: 17), где геном интереса является одна из последовательностей SEQ ID NO: 10-13. В качестве контроля использовалась нуклеиновая кислота, которая кодирует белок человеческого FVIII-BDD, включая природный сигнальный пептид FVIII-BDD, соответствующая последовательности SEQ ID NO 10 (СП-FVIII-FVIII-BDD).Our obtained nucleic acids encoding fusion proteins based on FVIII-BDD and one of the heterologous signal peptides, which include an amino acid sequence selected from SEQ ID NO: 7, SEQ ID NO: 8 or SEQ ID NO: 9, were tested during cell transfection line in vitro as part of an expression cassette consisting of a left (first) ITR (inverted terminal repeats) corresponding to the sequence SEQ ID NO: 14, an HLP promoter (SEQ ID NO: 15), a gene of interest, a polyadenylation signal (SEQ ID NO: 16 ), right (second) ITR (SEQ ID NO: 17), where the genome of interest is one of the sequences SEQ ID NO: 10-13. A nucleic acid that encodes the human FVIII-BDD protein, including the natural FVIII-BDD signal peptide corresponding to the sequence SEQ ID NO 10 (SP-FVIII-FVIII-BDD), was used as a control.
Использование всех нуклеиновых кислот SEQ ID NO: 11-13, кодирующих слитые белки на основе FVIII-BDD и одного из гетерологичных сигнальных пептидов с последовательностями SEQ ID NO: 7-9, привело к увеличению уровня продукции (Фигура 1) и активности (Фигура 2) белка FVIII-BDD по сравнению с использованием нуклеиновой кислоты SEQ ID NO: 10, кодирующей белок FVIII-BDD с природным сигнальным пептидом SEQ ID NO: 6 (СП-FVIII-FVIII-BDD). При этом неожиданное увеличение уровня продукции белка FVIII-BDD в полученной после трансфекции культуральной жидкости по сравнению с использованием нуклеиновой кислоты SEQ ID NO: 10 (СП-FVIII-FVIII-BDD) в 4,3 раза показала последовательность SEQ ID NO: 11 (СП-FIX-FVIII-BDD), в 1,3 раза показала последовательность SEQ ID NO: 12 (СП-IgGK-FVIII-BDD), и в 4,9 раз показала последовательность SEQ ID NO: 13 (СП-LactalbuminFVIII-BDD). Аналогичные данные наблюдались по активности FVIII-BDD в полученной после трансфекции культуральной жидкости: наблюдалось неожиданное увеличение при сравнении нуклеиновой кислоты SEQ ID NO: 10 (СП-FVIII-FVIII-BDD) с нуклеиновыми кислотами SEQ ID NO: 11 (СП-FIX-FVIII-BDD) (увеличение в 8,6 раз), с SEQ ID NO: 12 (СП-IgGK-FVIII-BDD) (увеличение в 1,9 раз) и с SEQ ID NO: 13 (СП-LactalbuminFVIII-BDD) (увеличение в 7,2 раза). Наблюдаемое соответствие между результатами по продукции и активности белка FVIII-BDD говорит о том, что замена природного сигнального пептида FVIII-BDD на сигнальный пептид, соответствующий аминокислотной последовательности указанной в SEQ ID NO: 2 (СП-FIX), или SEQ ID NO: 3 (СП-IgGK), или SEQ ID NO: 4 (СП-Lactalbumin) приводит к значительному увеличению продукции и активности белка FVIII-BDD, но при этом не влияет на функциональность белка FVIII-BDD.The use of all nucleic acids SEQ ID NO: 11-13, encoding fusion proteins based on FVIII-BDD and one of the heterologous signal peptides with the sequences SEQ ID NO: 7-9, led to increased levels of production (Figure 1) and activity (Figure 2 ) FVIII-BDD protein compared to the use of the nucleic acid SEQ ID NO: 10 encoding the FVIII-BDD protein with the natural signal peptide SEQ ID NO: 6 (SP-FVIII-FVIII-BDD). At the same time, an unexpected increase in the level of production of the FVIII-BDD protein in the culture liquid obtained after transfection compared to the use of the nucleic acid SEQ ID NO: 10 (SP-FVIII-FVIII-BDD) by 4.3 times was shown by the sequence SEQ ID NO: 11 (SP -FIX-FVIII-BDD), 1.3 times showed the sequence SEQ ID NO: 12 (SP-IgGK-FVIII-BDD), and 4.9 times showed the sequence SEQ ID NO: 13 (SP-LactalbuminFVIII-BDD) . Similar data were observed for FVIII-BDD activity in the resulting culture fluid after transfection: an unexpected increase was observed when comparing nucleic acid SEQ ID NO: 10 (SP-FVIII-FVIII-BDD) with nucleic acid SEQ ID NO: 11 (SP-FIX-FVIII -BDD) (8.6 times increase), with SEQ ID NO: 12 (SP-IgGK-FVIII-BDD) (1.9 times increase) and with SEQ ID NO: 13 (SP-LactalbuminFVIII-BDD) ( an increase of 7.2 times). The observed correspondence between the results for the production and activity of the FVIII-BDD protein suggests that the replacement of the natural signal peptide FVIII-BDD with a signal peptide corresponding to the amino acid sequence specified in SEQ ID NO: 2 (SP-FIX), or SEQ ID NO: 3 (SP-IgGK), or SEQ ID NO: 4 (SP-Lactalbumin) leads to a significant increase in the production and activity of the FVIII-BDD protein, but does not affect the functionality of the FVIII-BDD protein.
Таким образом, полученные нами нуклеиновые кислоты, кодирующие слитые белки на основе FVIII-BDD и одного из гетерологичных сигнальных пептидов, которые включают аминокислотную последовательность, выбранную из SEQ ID NO: 7, SEQ ID NO: 8 или SEQ ID NO: 9, способны приводить к экспрессии белка FVIII-BDD in vitro в клетке-хозяине и обладают высоким потенциалом для получения рекомбинантного белка FVIII-BDD в клетках-продуцентах для терапии Гемофилии А.Thus, the nucleic acids we have obtained encoding fusion proteins based on FVIII-BDD and one of the heterologous signal peptides, which include an amino acid sequence selected from SEQ ID NO: 7, SEQ ID NO: 8 or SEQ ID NO: 9, are capable of causing to the expression of the FVIII-BDD protein in vitro in the host cell and have a high potential for producing recombinant FVIII-BDD protein in producer cells for the treatment of Hemophilia A.
Пример 2Example 2
Были получены экспрессионные векторы на основе rAAV, содержащие нуклеиновые кислоты SEQ ID NO: 11-13, которые кодируют слитые белки на основе FVIII-BDD и гетерологичные сигнальные пептиды, которые включают аминокислотную последовательность, выбранную из SEQ ID NO: 7-9. Указанные экспрессионные вектора были проверены при трансдукции клеток линии HUH7 in vitro. В качестве контроля использовался экспрессионные вектор на основе rAAV, содержащий нуклеиновую кислоту SEQ ID NO: 10, которая кодирует белок человеческого FVIII-BDD, включая природный сигнальный пептид FVIII, соответствующая последовательности SEQ ID NO: 6 (СП-FVIII-FVIII-BDD).rAAV-based expression vectors were prepared containing nucleic acids SEQ ID NO: 11-13 that encode FVIII-BDD fusion proteins and heterologous signal peptides that include an amino acid sequence selected from SEQ ID NO: 7-9. These expression vectors were tested for transduction of HUH7 cells in vitro. As a control, we used an rAAV-based expression vector containing the nucleic acid SEQ ID NO: 10, which encodes the human FVIII-BDD protein, including the natural FVIII signal peptide corresponding to the sequence SEQ ID NO: 6 (SP-FVIII-FVIII-BDD).
Использование экспрессионных векторов, содержащих нуклеиновые кислоты SEQ ID NO: 11-13, привело к увеличению уровня продукции (Фигура 3) и активности (Фигура 4) белка FVIII-BDD по сравнению с использованием экспрессионного вектора, содержащего нуклеиновую кислоту SEQ ID NO: 10 (СП-FVIII-FVIII-BDD). При этом неожиданное увеличение уровня продукции белка FVIII-BDD в полученной после трансдукции культуральной жидкости по сравнению с использованием экспрессионного вектора с нуклеиновой кислотой SEQ ID NO: 10 (СП-FVIII-FVIII-BDD) в 2,7 раза показал экспрессионный вектор с последовательностью SEQ ID NO: 11 (СП-FIX-FVIII-BDD), в 2,4 раза показал экспрессионный вектор с последовательностью SEQ ID NO: 13 (СП-Lactalbumin-FVIII-BDD). Аналогичные данные наблюдались по активности FVIII-BDD в полученной после трансдукции культуральной жидкости. Наблюдалось неожиданное увеличение при сравнении экспрессионного вектора с нуклеиновой кислотой SEQ ID NO: 10 (СП-FVIII-FVIII-BDD) с экспрессионными векторами с нуклеиновыми кислотами SEQ ID NO: 11 (СП-FIX-FVIII-BDD) (увеличение в 3,9 раза) и с SEQ ID NO: 13 (СП-Lactalbumin-FVIII-BDD) (увеличение в 3,5 раза). Полученные результаты находятся в соответствии с данными, полученными при трансфекции клеток линии HepG2.The use of expression vectors containing the nucleic acids SEQ ID NO: 11-13 resulted in an increase in the level of production (Figure 3) and activity (Figure 4) of the FVIII-BDD protein compared to the use of an expression vector containing the nucleic acid SEQ ID NO: 10 ( SP-FVIII-FVIII-BDD). At the same time, an unexpected increase in the level of production of the FVIII-BDD protein in the culture liquid obtained after transduction compared to the use of an expression vector with the nucleic acid SEQ ID NO: 10 (SP-FVIII-FVIII-BDD) was shown by an expression vector with the sequence SEQ ID NO: 11 (SP-FIX-FVIII-BDD), showed 2.4 times the expression vector with the sequence SEQ ID NO: 13 (SP-Lactalbumin-FVIII-BDD). Similar data were observed for FVIII-BDD activity in the culture fluid obtained after transduction. An unexpected increase was observed when comparing the nucleic acid expression vector SEQ ID NO: 10 (SP-FVIII-FVIII-BDD) with the nucleic acid expression vectors SEQ ID NO: 11 (SP-FIX-FVIII-BDD) (3.9 increase times) and with SEQ ID NO: 13 (SP-Lactalbumin-FVIII-BDD) (3.5 times increase). The results obtained are in agreement with the data obtained by transfecting HepG2 cells.
Таким образом, разработанные экспрессионные вектора на основе rAAV, кодирующие слитые белки на основе FVIII-BDD и одного из гетерологичных сигнальных пептидов, которые включают аминокислотную последовательность, выбранную из SEQ ID NO: 7, SEQ ID NO: 8 или SEQ ID NO: 9, способны приводить к экспрессии белка FVIII-BDD; in vitro.Thus, designed rAAV-based expression vectors encoding fusion proteins based on FVIII-BDD and one of the heterologous signal peptides that include an amino acid sequence selected from SEQ ID NO: 7, SEQ ID NO: 8 or SEQ ID NO: 9, capable of leading to the expression of the FVIII-BDD protein; in vitro.
Пример 3Example 3
Для проведения in vivo исследований экспрессионных векторов на основе rAAV, содержащих нуклеиновую кислоту, выбранную из SEQ ID NO: 11 или SEQ ID NO: 13, которые кодируют слитые белки на основе FVIII-BDD и одного из гетерологичных сигнальных пептидов, которые включают аминокислотную последовательность, выбранную из SEQ ID NO: 7 или SEQ ID NO: 9, были использованы лабораторные мыши линии В6.129S-F8tm1Smoc (HemA), дефицитные по FVIII. В исследовании использовали дозу rAAV препаратов равную 6×1013 вг/кг. В качестве негативного контроля был использованы контрольный раствор без AAV. Препараты вводили животным однократно путем внутривенного гидродинамического введения в хвостовую вену. Отбор плазмы крови производился в день инъекции до введения препаратов, далее - на 14 и 56 дни после введения препаратов. В образцах плазмы крови определяли уровень белка фактора свертывания крови VIII-BDD методом ИФА, как было описано выше.To conduct in vivo studies of rAAV expression vectors containing a nucleic acid selected from SEQ ID NO: 11 or SEQ ID NO: 13 that encode fusion proteins based on FVIII-BDD and one of the heterologous signal peptides that include an amino acid sequence, selected from SEQ ID NO: 7 or SEQ ID NO: 9, laboratory mice of the B6.129S-F8tm1Smoc (HemA) line deficient in FVIII were used. The study used a dose of rAAV drugs equal to 6×10 13 vg/kg. A control solution without AAV was used as a negative control. The drugs were administered to animals once by intravenous hydrodynamic injection into the tail vein. Blood plasma was collected on the day of injection before drug administration, then on days 14 and 56 after drug administration. In blood plasma samples, the level of blood coagulation factor VIII-BDD protein was determined by ELISA, as described above.
В результате проведенных in vivo исследований (Фигура 5) было показано, что в случае использования обоих экспрессионных векторов на основе rAAV, содержащих нуклеиновую кислоту, выбранную из SEQ ID NO: 11 или SEQ ID NO: 13, наблюдается достоверное увеличение количества фактора VIII-BDD в плазме крови животных на 14 и на 56 дни (Фигура 5). В случае использования экспрессионного вектора, содержащего нуклеиновую кислоту SEQ ID NO: 11, в плазме крови животных на 14 и 56 дни после инъекции наблюдалась экспрессия FVIII-BDD на уровне 296 и 504 нг/мл, соответственно. В случае использования экспрессионного вектора, содержащего нуклеиновую кислоту SEQ ID NO: 13, в плазме крови животных на 14 и 56 дни после инъекции наблюдалась экспрессия FVIII-BDD на уровне 270 и 381 нг/мл, соответственно.As a result of in vivo studies (Figure 5), it was shown that when using both rAAV expression vectors containing a nucleic acid selected from SEQ ID NO: 11 or SEQ ID NO: 13, there was a significant increase in the amount of factor VIII-BDD in the blood plasma of animals on days 14 and 56 (Figure 5). When using an expression vector containing the nucleic acid SEQ ID NO: 11, FVIII-BDD expression was observed in the blood plasma of animals on days 14 and 56 after injection at a level of 296 and 504 ng/ml, respectively. When using an expression vector containing the nucleic acid SEQ ID NO: 13, FVIII-BDD expression was observed in the blood plasma of animals on days 14 and 56 after injection at a level of 270 and 381 ng/ml, respectively.
Таким образом, разработанные экспрессионные вектора на основе rAAV, содержащие нуклеиновую кислоту, выбранную из SEQ ID NO: 11 или SEQ ID NO: 13, которые кодируют слитые белки на основе FVIII-BDD и одного из гетерологичных сигнальных пептидов, которые включают аминокислотную последовательность, выбранную из SEQ ID NO: 7 или SEQ ID NO: 9, способны приводить к экспрессии белка FVIII-BDD in vivo и обладают высоким потенциалом для генной терапии Гемофилии А.Thus, rAAV-based expression vectors are designed containing a nucleic acid selected from SEQ ID NO: 11 or SEQ ID NO: 13 that encode fusion proteins based on FVIII-BDD and one of the heterologous signal peptides that include the amino acid sequence selected from SEQ ID NO: 7 or SEQ ID NO: 9, are capable of leading to expression of the FVIII-BDD protein in vivo and have high potential for gene therapy for Hemophilia A.
--->--->
Перечень последовательностей List of sequences
<110> АО "БИОКАД"<110> JSC "BIOCAD"
<120> Выделенная нуклеиновая кислота, которая кодирует слитый белок на основе<120> An isolated nucleic acid that encodes a fusion protein based on
FVIII-BDD и гетерологичного сигнального пептида, и ее применение FVIII-BDD and heterologous signal peptide and its application
<160> 17<160> 17
<170> BiSSAP 1.3.6<170> BiSSAP 1.3.6
<210> 1<210> 1
<211> 19<211> 19
<212> PRT<212>PRT
<213> Природная последовательность<213> Natural sequence
<220> <220>
<223> сигнальный пептид FVIII<223> FVIII signal peptide
<400> 1<400> 1
Met Gln Ile Glu Leu Ser Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe Met Gln Ile Glu Leu Ser Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe
1 5 10 15 1 5 10 15
Cys Phe Ser Cys Phe Ser
<210> 2<210> 2
<211> 28<211> 28
<212> PRT<212>PRT
<213> Природная последовательность<213> Natural sequence
<220> <220>
<223> сигнальный пептид FIX<223> signal peptide FIX
<400> 2<400> 2
Met Gln Arg Val Asn Met Ile Met Ala Glu Ser Pro Gly Leu Ile Thr Met Gln Arg Val Asn Met Ile Met Ala Glu Ser Pro Gly Leu Ile Thr
1 5 10 15 1 5 10 15
Ile Cys Leu Leu Gly Tyr Leu Leu Ser Ala Glu Cys Ile Cys Leu Leu Gly Tyr Leu Leu Ser Ala Glu Cys
20 25 20 25
<210> 3<210> 3
<211> 20<211> 20
<212> PRT<212>PRT
<213> Природная последовательность<213> Natural sequence
<220> <220>
<223> сигнальный пептид Kappa цепи иммуноглобулина G<223> Immunoglobulin G chain Kappa signal peptide
<400> 3<400> 3
Met Glu Thr Asp Thr Leu Leu Leu Trp Val Leu Leu Leu Trp Val Pro Met Glu Thr Asp Thr Leu Leu Leu Trp Val Leu Leu Leu Trp Val Pro
1 5 10 15 1 5 10 15
Gly Ser Thr Gly Gly Ser Thr Gly
20 20
<210> 4<210> 4
<211> 19<211> 19
<212> PRT<212>PRT
<213> Природная последовательность<213> Natural sequence
<220> <220>
<223> сигнальный пептид Lactalbumin<223> signal peptide Lactalbumin
<400> 4<400> 4
Met Met Ser Phe Val Ser Leu Leu Leu Val Gly Ile Leu Phe His Ala Met Met Ser Phe Val Ser Leu Leu Leu Val Gly Ile Leu Phe His Ala
1 5 10 15 1 5 10 15
Thr Gln Ala Thr Gln Ala
<210> 5<210> 5
<211> 1438<211> 1438
<212> PRT<212>PRT
<213> Искусственная последовательность<213> Artificial sequence
<220> <220>
<223> белок FVIII-BDD (фактор свёртывания крови VIII c делетированным B<223> FVIII-BDD protein (blood clotting factor VIII with B deleted
доменом) domain)
<400> 5<400> 5
Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser Trp Asp Tyr Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser Trp Asp Tyr
1 5 10 15 1 5 10 15
Met Gln Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg Phe Pro Pro Met Gln Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg Phe Pro Pro
20 25 30 20 25 30
Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Val Tyr Lys Lys Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Val Tyr Lys Lys
35 40 45 35 40 45
Thr Leu Phe Val Glu Phe Thr Asp His Leu Phe Asn Ile Ala Lys Pro Thr Leu Phe Val Glu Phe Thr Asp His Leu Phe Asn Ile Ala Lys Pro
50 55 60 50 55 60
Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr Ile Gln Ala Glu Val Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr Ile Gln Ala Glu Val
65 70 75 80 65 70 75 80
Tyr Asp Thr Val Val Ile Thr Leu Lys Asn Met Ala Ser His Pro Val Tyr Asp Thr Val Val Ile Thr Leu Lys Asn Met Ala Ser His Pro Val
85 90 95 85 90 95
Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser Glu Gly Ala Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser Glu Gly Ala
100 105 110 100 105 110
Glu Tyr Asp Asp Gln Thr Ser Gln Arg Glu Lys Glu Asp Asp Lys Val Glu Tyr Asp Asp Gln Thr Ser Gln Arg Glu Lys Glu Asp Asp Lys Val
115 120 125 115 120 125
Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gln Val Leu Lys Glu Asn Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gln Val Leu Lys Glu Asn
130 135 140 130 135 140
Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser Tyr Leu Ser Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser Tyr Leu Ser
145 150 155 160 145 150 155 160
His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu Ile Gly Ala Leu His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu Ile Gly Ala Leu
165 170 175 165 170 175
Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr Gln Thr Leu Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr Gln Thr Leu
180 185 190 180 185 190
His Lys Phe Ile Leu Leu Phe Ala Val Phe Asp Glu Gly Lys Ser Trp His Lys Phe Ile Leu Leu Phe Ala Val Phe Asp Glu Gly Lys Ser Trp
195 200 205 195 200 205
His Ser Glu Thr Lys Asn Ser Leu Met Gln Asp Arg Asp Ala Ala Ser His Ser Glu Thr Lys Asn Ser Leu Met Gln Asp Arg Asp Ala Ala Ser
210 215 220 210 215 220
Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr Val Asn Arg Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr Val Asn Arg
225 230 235 240 225 230 235 240
Ser Leu Pro Gly Leu Ile Gly Cys His Arg Lys Ser Val Tyr Trp His Ser Leu Pro Gly Leu Ile Gly Cys His Arg Lys Ser Val Tyr Trp His
245 250 255 245 250 255
Val Ile Gly Met Gly Thr Thr Pro Glu Val His Ser Ile Phe Leu Glu Val Ile Gly Met Gly Thr Thr Pro Glu Val His Ser Ile Phe Leu Glu
260 265 270 260 265 270
Gly His Thr Phe Leu Val Arg Asn His Arg Gln Ala Ser Leu Glu Ile Gly His Thr Phe Leu Val Arg Asn His Arg Gln Ala Ser Leu Glu Ile
275 280 285 275 280 285
Ser Pro Ile Thr Phe Leu Thr Ala Gln Thr Leu Leu Met Asp Leu Gly Ser Pro Ile Thr Phe Leu Thr Ala Gln Thr Leu Leu Met Asp Leu Gly
290 295 300 290 295 300
Gln Phe Leu Leu Phe Cys His Ile Ser Ser His Gln His Asp Gly Met Gln Phe Leu Leu Phe Cys His Ile Ser Ser His Gln His Asp Gly Met
305 310 315 320 305 310 315 320
Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro Gln Leu Arg Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro Gln Leu Arg
325 330 335 325 330 335
Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp Leu Thr Asp Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp Leu Thr Asp
340 345 350 340 345 350
Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser Pro Ser Phe Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser Pro Ser Phe
355 360 365 355 360 365
Ile Gln Ile Arg Ser Val Ala Lys Lys His Pro Lys Thr Trp Val His Ile Gln Ile Arg Ser Val Ala Lys Lys His Pro Lys Thr Trp Val His
370 375 380 370 375 380
Tyr Ile Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro Leu Val Leu Tyr Ile Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro Leu Val Leu
385 390 395 400 385 390 395 400
Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gln Tyr Leu Asn Asn Gly Pro Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gln Tyr Leu Asn Asn Gly Pro
405 410 415 405 410 415
Gln Arg Ile Gly Arg Lys Tyr Lys Lys Val Arg Phe Met Ala Tyr Thr Gln Arg Ile Gly Arg Lys Tyr Lys Lys Val Arg Phe Met Ala Tyr Thr
420 425 430 420 425 430
Asp Glu Thr Phe Lys Thr Arg Glu Ala Ile Gln His Glu Ser Gly Ile Asp Glu Thr Phe Lys Thr Arg Glu Ala Ile Gln His Glu Ser Gly Ile
435 440 445 435 440 445
Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu Leu Ile Ile Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu Leu Ile Ile
450 455 460 450 455 460
Phe Lys Asn Gln Ala Ser Arg Pro Tyr Asn Ile Tyr Pro His Gly Ile Phe Lys Asn Gln Ala Ser Arg Pro Tyr Asn Ile Tyr Pro His Gly Ile
465 470 475 480 465 470 475 480
Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys Gly Val Lys Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys Gly Val Lys
485 490 495 485 490 495
His Leu Lys Asp Phe Pro Ile Leu Pro Gly Glu Ile Phe Lys Tyr Lys His Leu Lys Asp Phe Pro Ile Leu Pro Gly Glu Ile Phe Lys Tyr Lys
500 505 510 500 505 510
Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp Pro Arg Cys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp Pro Arg Cys
515 520 525 515 520 525
Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg Asp Leu Ala Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg Asp Leu Ala
530 535 540 530 535 540
Ser Gly Leu Ile Gly Pro Leu Leu Ile Cys Tyr Lys Glu Ser Val Asp Ser Gly Leu Ile Gly Pro Leu Leu Ile Cys Tyr Lys Glu Ser Val Asp
545 550 555 560 545 550 555 560
Gln Arg Gly Asn Gln Ile Met Ser Asp Lys Arg Asn Val Ile Leu Phe Gln Arg Gly Asn Gln Ile Met Ser Asp Lys Arg Asn Val Ile Leu Phe
565 570 575 565 570 575
Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu Asn Ile Gln Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu Asn Ile Gln
580 585 590 580 585 590
Arg Phe Leu Pro Asn Pro Ala Gly Val Gln Leu Glu Asp Pro Glu Phe Arg Phe Leu Pro Asn Pro Ala Gly Val Gln Leu Glu Asp Pro Glu Phe
595 600 605 595 600 605
Gln Ala Ser Asn Ile Met His Ser Ile Asn Gly Tyr Val Phe Asp Ser Gln Ala Ser Asn Ile Met His Ser Ile Asn Gly Tyr Val Phe Asp Ser
610 615 620 610 615 620
Leu Gln Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp Tyr Ile Leu Leu Gln Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp Tyr Ile Leu
625 630 635 640 625 630 635 640
Ser Ile Gly Ala Gln Thr Asp Phe Leu Ser Val Phe Phe Ser Gly Tyr Ser Ile Gly Ala Gln Thr Asp Phe Leu Ser Val Phe Phe Ser Gly Tyr
645 650 655 645 650 655
Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr Leu Phe Pro Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr Leu Phe Pro
660 665 670 660 665 670
Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro Gly Leu Trp Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro Gly Leu Trp
675 680 685 675 680 685
Ile Leu Gly Cys His Asn Ser Asp Phe Arg Asn Arg Gly Met Thr Ala Ile Leu Gly Cys His Asn Ser Asp Phe Arg Asn Arg Gly Met Thr Ala
690 695 700 690 695 700
Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp Tyr Tyr Glu Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp Tyr Tyr Glu
705 710 715 720 705 710 715 720
Asp Ser Tyr Glu Asp Ile Ser Ala Tyr Leu Leu Ser Lys Asn Asn Ala Asp Ser Tyr Glu Asp Ile Ser Ala Tyr Leu Leu Ser Lys Asn Asn Ala
725 730 735 725 730 735
Ile Glu Pro Arg Ser Phe Ser Gln Asn Pro Pro Val Leu Lys Arg His Ile Glu Pro Arg Ser Phe Ser Gln Asn Pro Pro Val Leu Lys Arg His
740 745 750 740 745 750
Gln Arg Glu Ile Thr Arg Thr Thr Leu Gln Ser Asp Gln Glu Glu Ile Gln Arg Glu Ile Thr Arg Thr Thr Leu Gln Ser Asp Gln Glu Glu Ile
755 760 765 755 760 765
Asp Tyr Asp Asp Thr Ile Ser Val Glu Met Lys Lys Glu Asp Phe Asp Asp Tyr Asp Asp Thr Ile Ser Val Glu Met Lys Lys Glu Asp Phe Asp
770 775 780 770 775 780
Ile Tyr Asp Glu Asp Glu Asn Gln Ser Pro Arg Ser Phe Gln Lys Lys Ile Tyr Asp Glu Asp Glu Asn Gln Ser Pro Arg Ser Phe Gln Lys Lys
785 790 795 800 785 790 795 800
Thr Arg His Tyr Phe Ile Ala Ala Val Glu Arg Leu Trp Asp Tyr Gly Thr Arg His Tyr Phe Ile Ala Ala Val Glu Arg Leu Trp Asp Tyr Gly
805 810 815 805 810 815
Met Ser Ser Ser Pro His Val Leu Arg Asn Arg Ala Gln Ser Gly Ser Met Ser Ser Ser Pro His Val Leu Arg Asn Arg Ala Gln Ser Gly Ser
820 825 830 820 825 830
Val Pro Gln Phe Lys Lys Val Val Phe Gln Glu Phe Thr Asp Gly Ser Val Pro Gln Phe Lys Lys Val Val Phe Gln Glu Phe Thr Asp Gly Ser
835 840 845 835 840 845
Phe Thr Gln Pro Leu Tyr Arg Gly Glu Leu Asn Glu His Leu Gly Leu Phe Thr Gln Pro Leu Tyr Arg Gly Glu Leu Asn Glu His Leu Gly Leu
850 855 860 850 855 860
Leu Gly Pro Tyr Ile Arg Ala Glu Val Glu Asp Asn Ile Met Val Thr Leu Gly Pro Tyr Ile Arg Ala Glu Val Glu Asp Asn Ile Met Val Thr
865 870 875 880 865 870 875 880
Phe Arg Asn Gln Ala Ser Arg Pro Tyr Ser Phe Tyr Ser Ser Leu Ile Phe Arg Asn Gln Ala Ser Arg Pro Tyr Ser Phe Tyr Ser Ser Leu Ile
885 890 895 885 890 895
Ser Tyr Glu Glu Asp Gln Arg Gln Gly Ala Glu Pro Arg Lys Asn Phe Ser Tyr Glu Glu Asp Gln Arg Gln Gly Ala Glu Pro Arg Lys Asn Phe
900 905 910 900 905 910
Val Lys Pro Asn Glu Thr Lys Thr Tyr Phe Trp Lys Val Gln His His Val Lys Pro Asn Glu Thr Lys Thr Tyr Phe Trp Lys Val Gln His His
915 920 925 915 920 925
Met Ala Pro Thr Lys Asp Glu Phe Asp Cys Lys Ala Trp Ala Tyr Phe Met Ala Pro Thr Lys Asp Glu Phe Asp Cys Lys Ala Trp Ala Tyr Phe
930 935 940 930 935 940
Ser Asp Val Asp Leu Glu Lys Asp Val His Ser Gly Leu Ile Gly Pro Ser Asp Val Asp Leu Glu Lys Asp Val His Ser Gly Leu Ile Gly Pro
945 950 955 960 945 950 955 960
Leu Leu Val Cys His Thr Asn Thr Leu Asn Pro Ala His Gly Arg Gln Leu Leu Val Cys His Thr Asn Thr Leu Asn Pro Ala His Gly Arg Gln
965 970 975 965 970 975
Val Thr Val Gln Glu Phe Ala Leu Phe Phe Thr Ile Phe Asp Glu Thr Val Thr Val Gln Glu Phe Ala Leu Phe Phe Thr Ile Phe Asp Glu Thr
980 985 990 980 985 990
Lys Ser Trp Tyr Phe Thr Glu Asn Met Glu Arg Asn Cys Arg Ala Pro Lys Ser Trp Tyr Phe Thr Glu Asn Met Glu Arg Asn Cys Arg Ala Pro
995 1000 1005 995 1000 1005
Cys Asn Ile Gln Met Glu Asp Pro Thr Phe Lys Glu Asn Tyr Arg Phe Cys Asn Ile Gln Met Glu Asp Pro Thr Phe Lys Glu Asn Tyr Arg Phe
1010 1015 1020 1010 1015 1020
His Ala Ile Asn Gly Tyr Ile Met Asp Thr Leu Pro Gly Leu Val Met His Ala Ile Asn Gly Tyr Ile Met Asp Thr Leu Pro Gly Leu Val Met
1025 1030 1035 10401025 1030 1035 1040
Ala Gln Asp Gln Arg Ile Arg Trp Tyr Leu Leu Ser Met Gly Ser Asn Ala Gln Asp Gln Arg Ile Arg Trp Tyr Leu Leu Ser Met Gly Ser Asn
1045 1050 1055 1045 1050 1055
Glu Asn Ile His Ser Ile His Phe Ser Gly His Val Phe Thr Val Arg Glu Asn Ile His Ser Ile His Phe Ser Gly His Val Phe Thr Val Arg
1060 1065 1070 1060 1065 1070
Lys Lys Glu Glu Tyr Lys Met Ala Leu Tyr Asn Leu Tyr Pro Gly Val Lys Lys Glu Glu Tyr Lys Met Ala Leu Tyr Asn Leu Tyr Pro Gly Val
1075 1080 1085 1075 1080 1085
Phe Glu Thr Val Glu Met Leu Pro Ser Lys Ala Gly Ile Trp Arg Val Phe Glu Thr Val Glu Met Leu Pro Ser Lys Ala Gly Ile Trp Arg Val
1090 1095 1100 1090 1095 1100
Glu Cys Leu Ile Gly Glu His Leu His Ala Gly Met Ser Thr Leu Phe Glu Cys Leu Ile Gly Glu His Leu His Ala Gly Met Ser Thr Leu Phe
1105 1110 1115 11201105 1110 1115 1120
Leu Val Tyr Ser Asn Lys Cys Gln Thr Pro Leu Gly Met Ala Ser Gly Leu Val Tyr Ser Asn Lys Cys Gln Thr Pro Leu Gly Met Ala Ser Gly
1125 1130 1135 1125 1130 1135
His Ile Arg Asp Phe Gln Ile Thr Ala Ser Gly Gln Tyr Gly Gln Trp His Ile Arg Asp Phe Gln Ile Thr Ala Ser Gly Gln Tyr Gly Gln Trp
1140 1145 1150 1140 1145 1150
Ala Pro Lys Leu Ala Arg Leu His Tyr Ser Gly Ser Ile Asn Ala Trp Ala Pro Lys Leu Ala Arg Leu His Tyr Ser Gly Ser Ile Asn Ala Trp
1155 1160 1165 1155 1160 1165
Ser Thr Lys Glu Pro Phe Ser Trp Ile Lys Val Asp Leu Leu Ala Pro Ser Thr Lys Glu Pro Phe Ser Trp Ile Lys Val Asp Leu Leu Ala Pro
1170 1175 1180 1170 1175 1180
Met Ile Ile His Gly Ile Lys Thr Gln Gly Ala Arg Gln Lys Phe Ser Met Ile Ile His Gly Ile Lys Thr Gln Gly Ala Arg Gln Lys Phe Ser
1185 1190 1195 12001185 1190 1195 1200
Ser Leu Tyr Ile Ser Gln Phe Ile Ile Met Tyr Ser Leu Asp Gly Lys Ser Leu Tyr Ile Ser Gln Phe Ile Ile Met Tyr Ser Leu Asp Gly Lys
1205 1210 1215 1205 1210 1215
Lys Trp Gln Thr Tyr Arg Gly Asn Ser Thr Gly Thr Leu Met Val Phe Lys Trp Gln Thr Tyr Arg Gly Asn Ser Thr Gly Thr Leu Met Val Phe
1220 1225 1230 1220 1225 1230
Phe Gly Asn Val Asp Ser Ser Gly Ile Lys His Asn Ile Phe Asn Pro Phe Gly Asn Val Asp Ser Ser Gly Ile Lys His Asn Ile Phe Asn Pro
1235 1240 1245 1235 1240 1245
Pro Ile Ile Ala Arg Tyr Ile Arg Leu His Pro Thr His Tyr Ser Ile Pro Ile Ile Ala Arg Tyr Ile Arg Leu His Pro Thr His Tyr Ser Ile
1250 1255 1260 1250 1255 1260
Arg Ser Thr Leu Arg Met Glu Leu Met Gly Cys Asp Leu Asn Ser Cys Arg Ser Thr Leu Arg Met Glu Leu Met Gly Cys Asp Leu Asn Ser Cys
1265 1270 1275 12801265 1270 1275 1280
Ser Met Pro Leu Gly Met Glu Ser Lys Ala Ile Ser Asp Ala Gln Ile Ser Met Pro Leu Gly Met Glu Ser Lys Ala Ile Ser Asp Ala Gln Ile
1285 1290 1295 1285 1290 1295
Thr Ala Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Trp Ser Pro Ser Thr Ala Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Trp Ser Pro Ser
1300 1305 1310 1300 1305 1310
Lys Ala Arg Leu His Leu Gln Gly Arg Ser Asn Ala Trp Arg Pro Gln Lys Ala Arg Leu His Leu Gln Gly Arg Ser Asn Ala Trp Arg Pro Gln
1315 1320 1325 1315 1320 1325
Val Asn Asn Pro Lys Glu Trp Leu Gln Val Asp Phe Gln Lys Thr Met Val Asn Asn Pro Lys Glu Trp Leu Gln Val Asp Phe Gln Lys Thr Met
1330 1335 1340 1330 1335 1340
Lys Val Thr Gly Val Thr Thr Gln Gly Val Lys Ser Leu Leu Thr Ser Lys Val Thr Gly Val Thr Thr Gln Gly Val Lys Ser Leu Leu Thr Ser
1345 1350 1355 13601345 1350 1355 1360
Met Tyr Val Lys Glu Phe Leu Ile Ser Ser Ser Gln Asp Gly His Gln Met Tyr Val Lys Glu Phe Leu Ile Ser Ser Ser Gln Asp Gly His Gln
1365 1370 1375 1365 1370 1375
Trp Thr Leu Phe Phe Gln Asn Gly Lys Val Lys Val Phe Gln Gly Asn Trp Thr Leu Phe Phe Gln Asn Gly Lys Val Lys Val Phe Gln Gly Asn
1380 1385 1390 1380 1385 1390
Gln Asp Ser Phe Thr Pro Val Val Asn Ser Leu Asp Pro Pro Leu Leu Gln Asp Ser Phe Thr Pro Val Val Asn Ser Leu Asp Pro Pro Leu Leu
1395 1400 1405 1395 1400 1405
Thr Arg Tyr Leu Arg Ile His Pro Gln Ser Trp Val His Gln Ile Ala Thr Arg Tyr Leu Arg Ile His Pro Gln Ser Trp Val His Gln Ile Ala
1410 1415 1420 1410 1415 1420
Leu Arg Met Glu Val Leu Gly Cys Glu Ala Gln Asp Leu Tyr Leu Arg Met Glu Val Leu Gly Cys Glu Ala Gln Asp Leu Tyr
1425 1430 1435 1425 1430 1435
<210> 6<210> 6
<211> 1457<211> 1457
<212> PRT<212>PRT
<213> Искусственная последовательность<213> Artificial sequence
<220> <220>
<223> белок FVIII-BDD с сигнальным дикого типа (СП-FVIII-FVIII-BDD)<223> FVIII-BDD protein with wild-type signal (SP-FVIII-FVIII-BDD)
<400> 6<400> 6
Met Gln Ile Glu Leu Ser Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe Met Gln Ile Glu Leu Ser Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe
1 5 10 15 1 5 10 15
Cys Phe Ser Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser Cys Phe Ser Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser
20 25 30 20 25 30
Trp Asp Tyr Met Gln Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg Trp Asp Tyr Met Gln Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg
35 40 45 35 40 45
Phe Pro Pro Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Val Phe Pro Pro Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Val
50 55 60 50 55 60
Tyr Lys Lys Thr Leu Phe Val Glu Phe Thr Asp His Leu Phe Asn Ile Tyr Lys Lys Thr Leu Phe Val Glu Phe Thr Asp His Leu Phe Asn Ile
65 70 75 80 65 70 75 80
Ala Lys Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr Ile Gln Ala Lys Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr Ile Gln
85 90 95 85 90 95
Ala Glu Val Tyr Asp Thr Val Val Ile Thr Leu Lys Asn Met Ala Ser Ala Glu Val Tyr Asp Thr Val Val Ile Thr Leu Lys Asn Met Ala Ser
100 105 110 100 105 110
His Pro Val Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser His Pro Val Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser
115 120 125 115 120 125
Glu Gly Ala Glu Tyr Asp Asp Gln Thr Ser Gln Arg Glu Lys Glu Asp Glu Gly Ala Glu Tyr Asp Asp Gln Thr Ser Gln Arg Glu Lys Glu Asp
130 135 140 130 135 140
Asp Lys Val Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gln Val Leu Asp Lys Val Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gln Val Leu
145 150 155 160 145 150 155 160
Lys Glu Asn Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser Lys Glu Asn Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser
165 170 175 165 170 175
Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu Ile Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu Ile
180 185 190 180 185 190
Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr
195 200 205 195 200 205
Gln Thr Leu His Lys Phe Ile Leu Leu Phe Ala Val Phe Asp Glu Gly Gln Thr Leu His Lys Phe Ile Leu Leu Phe Ala Val Phe Asp Glu Gly
210 215 220 210 215 220
Lys Ser Trp His Ser Glu Thr Lys Asn Ser Leu Met Gln Asp Arg Asp Lys Ser Trp His Ser Glu Thr Lys Asn Ser Leu Met Gln Asp Arg Asp
225 230 235 240 225 230 235 240
Ala Ala Ser Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr Ala Ala Ser Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr
245 250 255 245 250 255
Val Asn Arg Ser Leu Pro Gly Leu Ile Gly Cys His Arg Lys Ser Val Val Asn Arg Ser Leu Pro Gly Leu Ile Gly Cys His Arg Lys Ser Val
260 265 270 260 265 270
Tyr Trp His Val Ile Gly Met Gly Thr Thr Pro Glu Val His Ser Ile Tyr Trp His Val Ile Gly Met Gly Thr Thr Pro Glu Val His Ser Ile
275 280 285 275 280 285
Phe Leu Glu Gly His Thr Phe Leu Val Arg Asn His Arg Gln Ala Ser Phe Leu Glu Gly His Thr Phe Leu Val Arg Asn His Arg Gln Ala Ser
290 295 300 290 295 300
Leu Glu Ile Ser Pro Ile Thr Phe Leu Thr Ala Gln Thr Leu Leu Met Leu Glu Ile Ser Pro Ile Thr Phe Leu Thr Ala Gln Thr Leu Leu Met
305 310 315 320 305 310 315 320
Asp Leu Gly Gln Phe Leu Leu Phe Cys His Ile Ser Ser His Gln His Asp Leu Gly Gln Phe Leu Leu Phe Cys His Ile Ser Ser His Gln His
325 330 335 325 330 335
Asp Gly Met Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro Asp Gly Met Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro
340 345 350 340 345 350
Gln Leu Arg Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp Gln Leu Arg Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp
355 360 365 355 360 365
Leu Thr Asp Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser Leu Thr Asp Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser
370 375 380 370 375 380
Pro Ser Phe Ile Gln Ile Arg Ser Val Ala Lys Lys His Pro Lys Thr Pro Ser Phe Ile Gln Ile Arg Ser Val Ala Lys Lys His Pro Lys Thr
385 390 395 400 385 390 395 400
Trp Val His Tyr Ile Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro Trp Val His Tyr Ile Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro
405 410 415 405 410 415
Leu Val Leu Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gln Tyr Leu Asn Leu Val Leu Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gln Tyr Leu Asn
420 425 430 420 425 430
Asn Gly Pro Gln Arg Ile Gly Arg Lys Tyr Lys Lys Val Arg Phe Met Asn Gly Pro Gln Arg Ile Gly Arg Lys Tyr Lys Lys Val Arg Phe Met
435 440 445 435 440 445
Ala Tyr Thr Asp Glu Thr Phe Lys Thr Arg Glu Ala Ile Gln His Glu Ala Tyr Thr Asp Glu Thr Phe Lys Thr Arg Glu Ala Ile Gln His Glu
450 455 460 450 455 460
Ser Gly Ile Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu Ser Gly Ile Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu
465 470 475 480 465 470 475 480
Leu Ile Ile Phe Lys Asn Gln Ala Ser Arg Pro Tyr Asn Ile Tyr Pro Leu Ile Ile Phe Lys Asn Gln Ala Ser Arg Pro Tyr Asn Ile Tyr Pro
485 490 495 485 490 495
His Gly Ile Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys His Gly Ile Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys
500 505 510 500 505 510
Gly Val Lys His Leu Lys Asp Phe Pro Ile Leu Pro Gly Glu Ile Phe Gly Val Lys His Leu Lys Asp Phe Pro Ile Leu Pro Gly Glu Ile Phe
515 520 525 515 520 525
Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp
530 535 540 530 535 540
Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg
545 550 555 560 545 550 555 560
Asp Leu Ala Ser Gly Leu Ile Gly Pro Leu Leu Ile Cys Tyr Lys Glu Asp Leu Ala Ser Gly Leu Ile Gly Pro Leu Leu Ile Cys Tyr Lys Glu
565 570 575 565 570 575
Ser Val Asp Gln Arg Gly Asn Gln Ile Met Ser Asp Lys Arg Asn Val Ser Val Asp Gln Arg Gly Asn Gln Ile Met Ser Asp Lys Arg Asn Val
580 585 590 580 585 590
Ile Leu Phe Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu Ile Leu Phe Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu
595 600 605 595 600 605
Asn Ile Gln Arg Phe Leu Pro Asn Pro Ala Gly Val Gln Leu Glu Asp Asn Ile Gln Arg Phe Leu Pro Asn Pro Ala Gly Val Gln Leu Glu Asp
610 615 620 610 615 620
Pro Glu Phe Gln Ala Ser Asn Ile Met His Ser Ile Asn Gly Tyr Val Pro Glu Phe Gln Ala Ser Asn Ile Met His Ser Ile Asn Gly Tyr Val
625 630 635 640 625 630 635 640
Phe Asp Ser Leu Gln Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp Phe Asp Ser Leu Gln Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp
645 650 655 645 650 655
Tyr Ile Leu Ser Ile Gly Ala Gln Thr Asp Phe Leu Ser Val Phe Phe Tyr Ile Leu Ser Ile Gly Ala Gln Thr Asp Phe Leu Ser Val Phe Phe
660 665 670 660 665 670
Ser Gly Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr Ser Gly Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr
675 680 685 675 680 685
Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro
690 695 700 690 695 700
Gly Leu Trp Ile Leu Gly Cys His Asn Ser Asp Phe Arg Asn Arg Gly Gly Leu Trp Ile Leu Gly Cys His Asn Ser Asp Phe Arg Asn Arg Gly
705 710 715 720 705 710 715 720
Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp
725 730 735 725 730 735
Tyr Tyr Glu Asp Ser Tyr Glu Asp Ile Ser Ala Tyr Leu Leu Ser Lys Tyr Tyr Glu Asp Ser Tyr Glu Asp Ile Ser Ala Tyr Leu Leu Ser Lys
740 745 750 740 745 750
Asn Asn Ala Ile Glu Pro Arg Ser Phe Ser Gln Asn Pro Pro Val Leu Asn Asn Ala Ile Glu Pro Arg Ser Phe Ser Gln Asn Pro Pro Val Leu
755 760 765 755 760 765
Lys Arg His Gln Arg Glu Ile Thr Arg Thr Thr Leu Gln Ser Asp Gln Lys Arg His Gln Arg Glu Ile Thr Arg Thr Thr Leu Gln Ser Asp Gln
770 775 780 770 775 780
Glu Glu Ile Asp Tyr Asp Asp Thr Ile Ser Val Glu Met Lys Lys Glu Glu Glu Ile Asp Tyr Asp Asp Thr Ile Ser Val Glu Met Lys Lys Glu
785 790 795 800 785 790 795 800
Asp Phe Asp Ile Tyr Asp Glu Asp Glu Asn Gln Ser Pro Arg Ser Phe Asp Phe Asp Ile Tyr Asp Glu Asp Glu Asn Gln Ser Pro Arg Ser Phe
805 810 815 805 810 815
Gln Lys Lys Thr Arg His Tyr Phe Ile Ala Ala Val Glu Arg Leu Trp Gln Lys Lys Thr Arg His Tyr Phe Ile Ala Ala Val Glu Arg Leu Trp
820 825 830 820 825 830
Asp Tyr Gly Met Ser Ser Ser Pro His Val Leu Arg Asn Arg Ala Gln Asp Tyr Gly Met Ser Ser Ser Pro His Val Leu Arg Asn Arg Ala Gln
835 840 845 835 840 845
Ser Gly Ser Val Pro Gln Phe Lys Lys Val Val Phe Gln Glu Phe Thr Ser Gly Ser Val Pro Gln Phe Lys Lys Val Val Phe Gln Glu Phe Thr
850 855 860 850 855 860
Asp Gly Ser Phe Thr Gln Pro Leu Tyr Arg Gly Glu Leu Asn Glu His Asp Gly Ser Phe Thr Gln Pro Leu Tyr Arg Gly Glu Leu Asn Glu His
865 870 875 880 865 870 875 880
Leu Gly Leu Leu Gly Pro Tyr Ile Arg Ala Glu Val Glu Asp Asn Ile Leu Gly Leu Leu Gly Pro Tyr Ile Arg Ala Glu Val Glu Asp Asn Ile
885 890 895 885 890 895
Met Val Thr Phe Arg Asn Gln Ala Ser Arg Pro Tyr Ser Phe Tyr Ser Met Val Thr Phe Arg Asn Gln Ala Ser Arg Pro Tyr Ser Phe Tyr Ser
900 905 910 900 905 910
Ser Leu Ile Ser Tyr Glu Glu Asp Gln Arg Gln Gly Ala Glu Pro Arg Ser Leu Ile Ser Tyr Glu Glu Asp Gln Arg Gln Gly Ala Glu Pro Arg
915 920 925 915 920 925
Lys Asn Phe Val Lys Pro Asn Glu Thr Lys Thr Tyr Phe Trp Lys Val Lys Asn Phe Val Lys Pro Asn Glu Thr Lys Thr Tyr Phe Trp Lys Val
930 935 940 930 935 940
Gln His His Met Ala Pro Thr Lys Asp Glu Phe Asp Cys Lys Ala Trp Gln His His Met Ala Pro Thr Lys Asp Glu Phe Asp Cys Lys Ala Trp
945 950 955 960 945 950 955 960
Ala Tyr Phe Ser Asp Val Asp Leu Glu Lys Asp Val His Ser Gly Leu Ala Tyr Phe Ser Asp Val Asp Leu Glu Lys Asp Val His Ser Gly Leu
965 970 975 965 970 975
Ile Gly Pro Leu Leu Val Cys His Thr Asn Thr Leu Asn Pro Ala His Ile Gly Pro Leu Leu Val Cys His Thr Asn Thr Leu Asn Pro Ala His
980 985 990 980 985 990
Gly Arg Gln Val Thr Val Gln Glu Phe Ala Leu Phe Phe Thr Ile Phe Gly Arg Gln Val Thr Val Gln Glu Phe Ala Leu Phe Phe Thr Ile Phe
995 1000 1005 995 1000 1005
Asp Glu Thr Lys Ser Trp Tyr Phe Thr Glu Asn Met Glu Arg Asn Cys Asp Glu Thr Lys Ser Trp Tyr Phe Thr Glu Asn Met Glu Arg Asn Cys
1010 1015 1020 1010 1015 1020
Arg Ala Pro Cys Asn Ile Gln Met Glu Asp Pro Thr Phe Lys Glu Asn Arg Ala Pro Cys Asn Ile Gln Met Glu Asp Pro Thr Phe Lys Glu Asn
1025 1030 1035 10401025 1030 1035 1040
Tyr Arg Phe His Ala Ile Asn Gly Tyr Ile Met Asp Thr Leu Pro Gly Tyr Arg Phe His Ala Ile Asn Gly Tyr Ile Met Asp Thr Leu Pro Gly
1045 1050 1055 1045 1050 1055
Leu Val Met Ala Gln Asp Gln Arg Ile Arg Trp Tyr Leu Leu Ser Met Leu Val Met Ala Gln Asp Gln Arg Ile Arg Trp Tyr Leu Leu Ser Met
1060 1065 1070 1060 1065 1070
Gly Ser Asn Glu Asn Ile His Ser Ile His Phe Ser Gly His Val Phe Gly Ser Asn Glu Asn Ile His Ser Ile His Phe Ser Gly His Val Phe
1075 1080 1085 1075 1080 1085
Thr Val Arg Lys Lys Glu Glu Tyr Lys Met Ala Leu Tyr Asn Leu Tyr Thr Val Arg Lys Lys Glu Glu Tyr Lys Met Ala Leu Tyr Asn Leu Tyr
1090 1095 1100 1090 1095 1100
Pro Gly Val Phe Glu Thr Val Glu Met Leu Pro Ser Lys Ala Gly Ile Pro Gly Val Phe Glu Thr Val Glu Met Leu Pro Ser Lys Ala Gly Ile
1105 1110 1115 11201105 1110 1115 1120
Trp Arg Val Glu Cys Leu Ile Gly Glu His Leu His Ala Gly Met Ser Trp Arg Val Glu Cys Leu Ile Gly Gly His Leu His Ala Gly Met Ser
1125 1130 1135 1125 1130 1135
Thr Leu Phe Leu Val Tyr Ser Asn Lys Cys Gln Thr Pro Leu Gly Met Thr Leu Phe Leu Val Tyr Ser Asn Lys Cys Gln Thr Pro Leu Gly Met
1140 1145 1150 1140 1145 1150
Ala Ser Gly His Ile Arg Asp Phe Gln Ile Thr Ala Ser Gly Gln Tyr Ala Ser Gly His Ile Arg Asp Phe Gln Ile Thr Ala Ser Gly Gln Tyr
1155 1160 1165 1155 1160 1165
Gly Gln Trp Ala Pro Lys Leu Ala Arg Leu His Tyr Ser Gly Ser Ile Gly Gln Trp Ala Pro Lys Leu Ala Arg Leu His Tyr Ser Gly Ser Ile
1170 1175 1180 1170 1175 1180
Asn Ala Trp Ser Thr Lys Glu Pro Phe Ser Trp Ile Lys Val Asp Leu Asn Ala Trp Ser Thr Lys Glu Pro Phe Ser Trp Ile Lys Val Asp Leu
1185 1190 1195 12001185 1190 1195 1200
Leu Ala Pro Met Ile Ile His Gly Ile Lys Thr Gln Gly Ala Arg Gln Leu Ala Pro Met Ile Ile His Gly Ile Lys Thr Gln Gly Ala Arg Gln
1205 1210 1215 1205 1210 1215
Lys Phe Ser Ser Leu Tyr Ile Ser Gln Phe Ile Ile Met Tyr Ser Leu Lys Phe Ser Ser Leu Tyr Ile Ser Gln Phe Ile Ile Met Tyr Ser Leu
1220 1225 1230 1220 1225 1230
Asp Gly Lys Lys Trp Gln Thr Tyr Arg Gly Asn Ser Thr Gly Thr Leu Asp Gly Lys Lys Trp Gln Thr Tyr Arg Gly Asn Ser Thr Gly Thr Leu
1235 1240 1245 1235 1240 1245
Met Val Phe Phe Gly Asn Val Asp Ser Ser Gly Ile Lys His Asn Ile Met Val Phe Phe Gly Asn Val Asp Ser Ser Gly Ile Lys His Asn Ile
1250 1255 1260 1250 1255 1260
Phe Asn Pro Pro Ile Ile Ala Arg Tyr Ile Arg Leu His Pro Thr His Phe Asn Pro Pro Ile Ile Ala Arg Tyr Ile Arg Leu His Pro Thr His
1265 1270 1275 12801265 1270 1275 1280
Tyr Ser Ile Arg Ser Thr Leu Arg Met Glu Leu Met Gly Cys Asp Leu Tyr Ser Ile Arg Ser Thr Leu Arg Met Glu Leu Met Gly Cys Asp Leu
1285 1290 1295 1285 1290 1295
Asn Ser Cys Ser Met Pro Leu Gly Met Glu Ser Lys Ala Ile Ser Asp Asn Ser Cys Ser Met Pro Leu Gly Met Glu Ser Lys Ala Ile Ser Asp
1300 1305 1310 1300 1305 1310
Ala Gln Ile Thr Ala Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Trp Ala Gln Ile Thr Ala Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Trp
1315 1320 1325 1315 1320 1325
Ser Pro Ser Lys Ala Arg Leu His Leu Gln Gly Arg Ser Asn Ala Trp Ser Pro Ser Lys Ala Arg Leu His Leu Gln Gly Arg Ser Asn Ala Trp
1330 1335 1340 1330 1335 1340
Arg Pro Gln Val Asn Asn Pro Lys Glu Trp Leu Gln Val Asp Phe Gln Arg Pro Gln Val Asn Asn Pro Lys Glu Trp Leu Gln Val Asp Phe Gln
1345 1350 1355 13601345 1350 1355 1360
Lys Thr Met Lys Val Thr Gly Val Thr Thr Gln Gly Val Lys Ser Leu Lys Thr Met Lys Val Thr Gly Val Thr Thr Gln Gly Val Lys Ser Leu
1365 1370 1375 1365 1370 1375
Leu Thr Ser Met Tyr Val Lys Glu Phe Leu Ile Ser Ser Ser Gln Asp Leu Thr Ser Met Tyr Val Lys Glu Phe Leu Ile Ser Ser Ser Gln Asp
1380 1385 1390 1380 1385 1390
Gly His Gln Trp Thr Leu Phe Phe Gln Asn Gly Lys Val Lys Val Phe Gly His Gln Trp Thr Leu Phe Phe Gln Asn Gly Lys Val Lys Val Phe
1395 1400 1405 1395 1400 1405
Gln Gly Asn Gln Asp Ser Phe Thr Pro Val Val Asn Ser Leu Asp Pro Gln Gly Asn Gln Asp Ser Phe Thr Pro Val Val Asn Ser Leu Asp Pro
1410 1415 1420 1410 1415 1420
Pro Leu Leu Thr Arg Tyr Leu Arg Ile His Pro Gln Ser Trp Val His Pro Leu Leu Thr Arg Tyr Leu Arg Ile His Pro Gln Ser Trp Val His
1425 1430 1435 14401425 1430 1435 1440
Gln Ile Ala Leu Arg Met Glu Val Leu Gly Cys Glu Ala Gln Asp Leu Gln Ile Ala Leu Arg Met Glu Val Leu Gly Cys Glu Ala Gln Asp Leu
1445 1450 1455 1445 1450 1455
Tyr Tyr
<210> 7<210> 7
<211> 1466<211> 1466
<212> PRT<212>PRT
<213> Искусственная последовательность<213> Artificial sequence
<220> <220>
<223> слитый белок на основе человеческого FVIII-BDD и сигнального<223> fusion protein based on human FVIII-BDD and signaling
пептида FIX (СП-FIX-FVIII-BDD) peptide FIX (SP-FIX-FVIII-BDD)
<400> 7<400> 7
Met Gln Arg Val Asn Met Ile Met Ala Glu Ser Pro Gly Leu Ile Thr Met Gln Arg Val Asn Met Ile Met Ala Glu Ser Pro Gly Leu Ile Thr
1 5 10 15 1 5 10 15
Ile Cys Leu Leu Gly Tyr Leu Leu Ser Ala Glu Cys Ala Thr Arg Arg Ile Cys Leu Leu Gly Tyr Leu Leu Ser Ala Glu Cys Ala Thr Arg Arg
20 25 30 20 25 30
Tyr Tyr Leu Gly Ala Val Glu Leu Ser Trp Asp Tyr Met Gln Ser Asp Tyr Tyr Leu Gly Ala Val Glu Leu Ser Trp Asp Tyr Met Gln Ser Asp
35 40 45 35 40 45
Leu Gly Glu Leu Pro Val Asp Ala Arg Phe Pro Pro Arg Val Pro Lys Leu Gly Glu Leu Pro Val Asp Ala Arg Phe Pro Pro Arg Val Pro Lys
50 55 60 50 55 60
Ser Phe Pro Phe Asn Thr Ser Val Val Tyr Lys Lys Thr Leu Phe Val Ser Phe Pro Phe Asn Thr Ser Val Val Tyr Lys Lys Thr Leu Phe Val
65 70 75 80 65 70 75 80
Glu Phe Thr Asp His Leu Phe Asn Ile Ala Lys Pro Arg Pro Pro Trp Glu Phe Thr Asp His Leu Phe Asn Ile Ala Lys Pro Arg Pro Pro Trp
85 90 95 85 90 95
Met Gly Leu Leu Gly Pro Thr Ile Gln Ala Glu Val Tyr Asp Thr Val Met Gly Leu Leu Gly Pro Thr Ile Gln Ala Glu Val Tyr Asp Thr Val
100 105 110 100 105 110
Val Ile Thr Leu Lys Asn Met Ala Ser His Pro Val Ser Leu His Ala Val Ile Thr Leu Lys Asn Met Ala Ser His Pro Val Ser Leu His Ala
115 120 125 115 120 125
Val Gly Val Ser Tyr Trp Lys Ala Ser Glu Gly Ala Glu Tyr Asp Asp Val Gly Val Ser Tyr Trp Lys Ala Ser Glu Gly Ala Glu Tyr Asp Asp
130 135 140 130 135 140
Gln Thr Ser Gln Arg Glu Lys Glu Asp Asp Lys Val Phe Pro Gly Gly Gln Thr Ser Gln Arg Glu Lys Glu Asp Asp Lys Val Phe Pro Gly Gly
145 150 155 160 145 150 155 160
Ser His Thr Tyr Val Trp Gln Val Leu Lys Glu Asn Gly Pro Met Ala Ser His Thr Tyr Val Trp Gln Val Leu Lys Glu Asn Gly Pro Met Ala
165 170 175 165 170 175
Ser Asp Pro Leu Cys Leu Thr Tyr Ser Tyr Leu Ser His Val Asp Leu Ser Asp Pro Leu Cys Leu Thr Tyr Ser Tyr Leu Ser His Val Asp Leu
180 185 190 180 185 190
Val Lys Asp Leu Asn Ser Gly Leu Ile Gly Ala Leu Leu Val Cys Arg Val Lys Asp Leu Asn Ser Gly Leu Ile Gly Ala Leu Leu Val Cys Arg
195 200 205 195 200 205
Glu Gly Ser Leu Ala Lys Glu Lys Thr Gln Thr Leu His Lys Phe Ile Glu Gly Ser Leu Ala Lys Glu Lys Thr Gln Thr Leu His Lys Phe Ile
210 215 220 210 215 220
Leu Leu Phe Ala Val Phe Asp Glu Gly Lys Ser Trp His Ser Glu Thr Leu Leu Phe Ala Val Phe Asp Glu Gly Lys Ser Trp His Ser Glu Thr
225 230 235 240 225 230 235 240
Lys Asn Ser Leu Met Gln Asp Arg Asp Ala Ala Ser Ala Arg Ala Trp Lys Asn Ser Leu Met Gln Asp Arg Asp Ala Ala Ser Ala Arg Ala Trp
245 250 255 245 250 255
Pro Lys Met His Thr Val Asn Gly Tyr Val Asn Arg Ser Leu Pro Gly Pro Lys Met His Thr Val Asn Gly Tyr Val Asn Arg Ser Leu Pro Gly
260 265 270 260 265 270
Leu Ile Gly Cys His Arg Lys Ser Val Tyr Trp His Val Ile Gly Met Leu Ile Gly Cys His Arg Lys Ser Val Tyr Trp His Val Ile Gly Met
275 280 285 275 280 285
Gly Thr Thr Pro Glu Val His Ser Ile Phe Leu Glu Gly His Thr Phe Gly Thr Thr Pro Glu Val His Ser Ile Phe Leu Glu Gly His Thr Phe
290 295 300 290 295 300
Leu Val Arg Asn His Arg Gln Ala Ser Leu Glu Ile Ser Pro Ile Thr Leu Val Arg Asn His Arg Gln Ala Ser Leu Glu Ile Ser Pro Ile Thr
305 310 315 320 305 310 315 320
Phe Leu Thr Ala Gln Thr Leu Leu Met Asp Leu Gly Gln Phe Leu Leu Phe Leu Thr Ala Gln Thr Leu Leu Met Asp Leu Gly Gln Phe Leu Leu
325 330 335 325 330 335
Phe Cys His Ile Ser Ser His Gln His Asp Gly Met Glu Ala Tyr Val Phe Cys His Ile Ser Ser His Gln His Asp Gly Met Glu Ala Tyr Val
340 345 350 340 345 350
Lys Val Asp Ser Cys Pro Glu Glu Pro Gln Leu Arg Met Lys Asn Asn Lys Val Asp Ser Cys Pro Glu Glu Pro Gln Leu Arg Met Lys Asn Asn
355 360 365 355 360 365
Glu Glu Ala Glu Asp Tyr Asp Asp Asp Leu Thr Asp Ser Glu Met Asp Glu Glu Ala Glu Asp Tyr Asp Asp Asp Leu Thr Asp Ser Glu Met Asp
370 375 380 370 375 380
Val Val Arg Phe Asp Asp Asp Asn Ser Pro Ser Phe Ile Gln Ile Arg Val Val Arg Phe Asp Asp Asp Asn Ser Pro Ser Phe Ile Gln Ile Arg
385 390 395 400 385 390 395 400
Ser Val Ala Lys Lys His Pro Lys Thr Trp Val His Tyr Ile Ala Ala Ser Val Ala Lys Lys His Pro Lys Thr Trp Val His Tyr Ile Ala Ala
405 410 415 405 410 415
Glu Glu Glu Asp Trp Asp Tyr Ala Pro Leu Val Leu Ala Pro Asp Asp Glu Glu Glu Asp Trp Asp Tyr Ala Pro Leu Val Leu Ala Pro Asp Asp
420 425 430 420 425 430
Arg Ser Tyr Lys Ser Gln Tyr Leu Asn Asn Gly Pro Gln Arg Ile Gly Arg Ser Tyr Lys Ser Gln Tyr Leu Asn Asn Gly Pro Gln Arg Ile Gly
435 440 445 435 440 445
Arg Lys Tyr Lys Lys Val Arg Phe Met Ala Tyr Thr Asp Glu Thr Phe Arg Lys Tyr Lys Lys Val Arg Phe Met Ala Tyr Thr Asp Glu Thr Phe
450 455 460 450 455 460
Lys Thr Arg Glu Ala Ile Gln His Glu Ser Gly Ile Leu Gly Pro Leu Lys Thr Arg Glu Ala Ile Gln His Glu Ser Gly Ile Leu Gly Pro Leu
465 470 475 480 465 470 475 480
Leu Tyr Gly Glu Val Gly Asp Thr Leu Leu Ile Ile Phe Lys Asn Gln Leu Tyr Gly Glu Val Gly Asp Thr Leu Leu Ile Ile Phe Lys Asn Gln
485 490 495 485 490 495
Ala Ser Arg Pro Tyr Asn Ile Tyr Pro His Gly Ile Thr Asp Val Arg Ala Ser Arg Pro Tyr Asn Ile Tyr Pro His Gly Ile Thr Asp Val Arg
500 505 510 500 505 510
Pro Leu Tyr Ser Arg Arg Leu Pro Lys Gly Val Lys His Leu Lys Asp Pro Leu Tyr Ser Arg Arg Leu Pro Lys Gly Val Lys His Leu Lys Asp
515 520 525 515 520 525
Phe Pro Ile Leu Pro Gly Glu Ile Phe Lys Tyr Lys Trp Thr Val Thr Phe Pro Ile Leu Pro Gly Glu Ile Phe Lys Tyr Lys Trp Thr Val Thr
530 535 540 530 535 540
Val Glu Asp Gly Pro Thr Lys Ser Asp Pro Arg Cys Leu Thr Arg Tyr Val Glu Asp Gly Pro Thr Lys Ser Asp Pro Arg Cys Leu Thr Arg Tyr
545 550 555 560 545 550 555 560
Tyr Ser Ser Phe Val Asn Met Glu Arg Asp Leu Ala Ser Gly Leu Ile Tyr Ser Ser Phe Val Asn Met Glu Arg Asp Leu Ala Ser Gly Leu Ile
565 570 575 565 570 575
Gly Pro Leu Leu Ile Cys Tyr Lys Glu Ser Val Asp Gln Arg Gly Asn Gly Pro Leu Leu Ile Cys Tyr Lys Glu Ser Val Asp Gln Arg Gly Asn
580 585 590 580 585 590
Gln Ile Met Ser Asp Lys Arg Asn Val Ile Leu Phe Ser Val Phe Asp Gln Ile Met Ser Asp Lys Arg Asn Val Ile Leu Phe Ser Val Phe Asp
595 600 605 595 600 605
Glu Asn Arg Ser Trp Tyr Leu Thr Glu Asn Ile Gln Arg Phe Leu Pro Glu Asn Arg Ser Trp Tyr Leu Thr Glu Asn Ile Gln Arg Phe Leu Pro
610 615 620 610 615 620
Asn Pro Ala Gly Val Gln Leu Glu Asp Pro Glu Phe Gln Ala Ser Asn Asn Pro Ala Gly Val Gln Leu Glu Asp Pro Glu Phe Gln Ala Ser Asn
625 630 635 640 625 630 635 640
Ile Met His Ser Ile Asn Gly Tyr Val Phe Asp Ser Leu Gln Leu Ser Ile Met His Ser Ile Asn Gly Tyr Val Phe Asp Ser Leu Gln Leu Ser
645 650 655 645 650 655
Val Cys Leu His Glu Val Ala Tyr Trp Tyr Ile Leu Ser Ile Gly Ala Val Cys Leu His Glu Val Ala Tyr Trp Tyr Ile Leu Ser Ile Gly Ala
660 665 670 660 665 670
Gln Thr Asp Phe Leu Ser Val Phe Phe Ser Gly Tyr Thr Phe Lys His Gln Thr Asp Phe Leu Ser Val Phe Phe Ser Gly Tyr Thr Phe Lys His
675 680 685 675 680 685
Lys Met Val Tyr Glu Asp Thr Leu Thr Leu Phe Pro Phe Ser Gly Glu Lys Met Val Tyr Glu Asp Thr Leu Thr Leu Phe Pro Phe Ser Gly Glu
690 695 700 690 695 700
Thr Val Phe Met Ser Met Glu Asn Pro Gly Leu Trp Ile Leu Gly Cys Thr Val Phe Met Ser Met Glu Asn Pro Gly Leu Trp Ile Leu Gly Cys
705 710 715 720 705 710 715 720
His Asn Ser Asp Phe Arg Asn Arg Gly Met Thr Ala Leu Leu Lys Val His Asn Ser Asp Phe Arg Asn Arg Gly Met Thr Ala Leu Leu Lys Val
725 730 735 725 730 735
Ser Ser Cys Asp Lys Asn Thr Gly Asp Tyr Tyr Glu Asp Ser Tyr Glu Ser Ser Cys Asp Lys Asn Thr Gly Asp Tyr Tyr Glu Asp Ser Tyr Glu
740 745 750 740 745 750
Asp Ile Ser Ala Tyr Leu Leu Ser Lys Asn Asn Ala Ile Glu Pro Arg Asp Ile Ser Ala Tyr Leu Leu Ser Lys Asn Asn Ala Ile Glu Pro Arg
755 760 765 755 760 765
Ser Phe Ser Gln Asn Pro Pro Val Leu Lys Arg His Gln Arg Glu Ile Ser Phe Ser Gln Asn Pro Pro Val Leu Lys Arg His Gln Arg Glu Ile
770 775 780 770 775 780
Thr Arg Thr Thr Leu Gln Ser Asp Gln Glu Glu Ile Asp Tyr Asp Asp Thr Arg Thr Thr Leu Gln Ser Asp Gln Glu Glu Ile Asp Tyr Asp Asp
785 790 795 800 785 790 795 800
Thr Ile Ser Val Glu Met Lys Lys Glu Asp Phe Asp Ile Tyr Asp Glu Thr Ile Ser Val Glu Met Lys Lys Glu Asp Phe Asp Ile Tyr Asp Glu
805 810 815 805 810 815
Asp Glu Asn Gln Ser Pro Arg Ser Phe Gln Lys Lys Thr Arg His Tyr Asp Glu Asn Gln Ser Pro Arg Ser Phe Gln Lys Lys Thr Arg His Tyr
820 825 830 820 825 830
Phe Ile Ala Ala Val Glu Arg Leu Trp Asp Tyr Gly Met Ser Ser Ser Phe Ile Ala Ala Val Glu Arg Leu Trp Asp Tyr Gly Met Ser Ser Ser
835 840 845 835 840 845
Pro His Val Leu Arg Asn Arg Ala Gln Ser Gly Ser Val Pro Gln Phe Pro His Val Leu Arg Asn Arg Ala Gln Ser Gly Ser Val Pro Gln Phe
850 855 860 850 855 860
Lys Lys Val Val Phe Gln Glu Phe Thr Asp Gly Ser Phe Thr Gln Pro Lys Lys Val Val Phe Gln Glu Phe Thr Asp Gly Ser Phe Thr Gln Pro
865 870 875 880 865 870 875 880
Leu Tyr Arg Gly Glu Leu Asn Glu His Leu Gly Leu Leu Gly Pro Tyr Leu Tyr Arg Gly Glu Leu Asn Glu His Leu Gly Leu Leu Gly Pro Tyr
885 890 895 885 890 895
Ile Arg Ala Glu Val Glu Asp Asn Ile Met Val Thr Phe Arg Asn Gln Ile Arg Ala Glu Val Glu Asp Asn Ile Met Val Thr Phe Arg Asn Gln
900 905 910 900 905 910
Ala Ser Arg Pro Tyr Ser Phe Tyr Ser Ser Leu Ile Ser Tyr Glu Glu Ala Ser Arg Pro Tyr Ser Phe Tyr Ser Ser Leu Ile Ser Tyr Glu Glu
915 920 925 915 920 925
Asp Gln Arg Gln Gly Ala Glu Pro Arg Lys Asn Phe Val Lys Pro Asn Asp Gln Arg Gln Gly Ala Glu Pro Arg Lys Asn Phe Val Lys Pro Asn
930 935 940 930 935 940
Glu Thr Lys Thr Tyr Phe Trp Lys Val Gln His His Met Ala Pro Thr Glu Thr Lys Thr Tyr Phe Trp Lys Val Gln His His Met Ala Pro Thr
945 950 955 960 945 950 955 960
Lys Asp Glu Phe Asp Cys Lys Ala Trp Ala Tyr Phe Ser Asp Val Asp Lys Asp Glu Phe Asp Cys Lys Ala Trp Ala Tyr Phe Ser Asp Val Asp
965 970 975 965 970 975
Leu Glu Lys Asp Val His Ser Gly Leu Ile Gly Pro Leu Leu Val Cys Leu Glu Lys Asp Val His Ser Gly Leu Ile Gly Pro Leu Leu Val Cys
980 985 990 980 985 990
His Thr Asn Thr Leu Asn Pro Ala His Gly Arg Gln Val Thr Val Gln His Thr Asn Thr Leu Asn Pro Ala His Gly Arg Gln Val Thr Val Gln
995 1000 1005 995 1000 1005
Glu Phe Ala Leu Phe Phe Thr Ile Phe Asp Glu Thr Lys Ser Trp Tyr Glu Phe Ala Leu Phe Phe Thr Ile Phe Asp Glu Thr Lys Ser Trp Tyr
1010 1015 1020 1010 1015 1020
Phe Thr Glu Asn Met Glu Arg Asn Cys Arg Ala Pro Cys Asn Ile Gln Phe Thr Glu Asn Met Glu Arg Asn Cys Arg Ala Pro Cys Asn Ile Gln
1025 1030 1035 10401025 1030 1035 1040
Met Glu Asp Pro Thr Phe Lys Glu Asn Tyr Arg Phe His Ala Ile Asn Met Glu Asp Pro Thr Phe Lys Glu Asn Tyr Arg Phe His Ala Ile Asn
1045 1050 1055 1045 1050 1055
Gly Tyr Ile Met Asp Thr Leu Pro Gly Leu Val Met Ala Gln Asp Gln Gly Tyr Ile Met Asp Thr Leu Pro Gly Leu Val Met Ala Gln Asp Gln
1060 1065 1070 1060 1065 1070
Arg Ile Arg Trp Tyr Leu Leu Ser Met Gly Ser Asn Glu Asn Ile His Arg Ile Arg Trp Tyr Leu Leu Ser Met Gly Ser Asn Glu Asn Ile His
1075 1080 1085 1075 1080 1085
Ser Ile His Phe Ser Gly His Val Phe Thr Val Arg Lys Lys Glu Glu Ser Ile His Phe Ser Gly His Val Phe Thr Val Arg Lys Lys Glu Glu
1090 1095 1100 1090 1095 1100
Tyr Lys Met Ala Leu Tyr Asn Leu Tyr Pro Gly Val Phe Glu Thr Val Tyr Lys Met Ala Leu Tyr Asn Leu Tyr Pro Gly Val Phe Glu Thr Val
1105 1110 1115 11201105 1110 1115 1120
Glu Met Leu Pro Ser Lys Ala Gly Ile Trp Arg Val Glu Cys Leu Ile Glu Met Leu Pro Ser Lys Ala Gly Ile Trp Arg Val Glu Cys Leu Ile
1125 1130 1135 1125 1130 1135
Gly Glu His Leu His Ala Gly Met Ser Thr Leu Phe Leu Val Tyr Ser Gly Glu His Leu His Ala Gly Met Ser Thr Leu Phe Leu Val Tyr Ser
1140 1145 1150 1140 1145 1150
Asn Lys Cys Gln Thr Pro Leu Gly Met Ala Ser Gly His Ile Arg Asp Asn Lys Cys Gln Thr Pro Leu Gly Met Ala Ser Gly His Ile Arg Asp
1155 1160 1165 1155 1160 1165
Phe Gln Ile Thr Ala Ser Gly Gln Tyr Gly Gln Trp Ala Pro Lys Leu Phe Gln Ile Thr Ala Ser Gly Gln Tyr Gly Gln Trp Ala Pro Lys Leu
1170 1175 1180 1170 1175 1180
Ala Arg Leu His Tyr Ser Gly Ser Ile Asn Ala Trp Ser Thr Lys Glu Ala Arg Leu His Tyr Ser Gly Ser Ile Asn Ala Trp Ser Thr Lys Glu
1185 1190 1195 12001185 1190 1195 1200
Pro Phe Ser Trp Ile Lys Val Asp Leu Leu Ala Pro Met Ile Ile His Pro Phe Ser Trp Ile Lys Val Asp Leu Leu Ala Pro Met Ile Ile His
1205 1210 1215 1205 1210 1215
Gly Ile Lys Thr Gln Gly Ala Arg Gln Lys Phe Ser Ser Leu Tyr Ile Gly Ile Lys Thr Gln Gly Ala Arg Gln Lys Phe Ser Ser Leu Tyr Ile
1220 1225 1230 1220 1225 1230
Ser Gln Phe Ile Ile Met Tyr Ser Leu Asp Gly Lys Lys Trp Gln Thr Ser Gln Phe Ile Ile Met Tyr Ser Leu Asp Gly Lys Lys Trp Gln Thr
1235 1240 1245 1235 1240 1245
Tyr Arg Gly Asn Ser Thr Gly Thr Leu Met Val Phe Phe Gly Asn Val Tyr Arg Gly Asn Ser Thr Gly Thr Leu Met Val Phe Phe Gly Asn Val
1250 1255 1260 1250 1255 1260
Asp Ser Ser Gly Ile Lys His Asn Ile Phe Asn Pro Pro Ile Ile Ala Asp Ser Ser Gly Ile Lys His Asn Ile Phe Asn Pro Pro Ile Ile Ala
1265 1270 1275 12801265 1270 1275 1280
Arg Tyr Ile Arg Leu His Pro Thr His Tyr Ser Ile Arg Ser Thr Leu Arg Tyr Ile Arg Leu His Pro Thr His Tyr Ser Ile Arg Ser Thr Leu
1285 1290 1295 1285 1290 1295
Arg Met Glu Leu Met Gly Cys Asp Leu Asn Ser Cys Ser Met Pro Leu Arg Met Glu Leu Met Gly Cys Asp Leu Asn Ser Cys Ser Met Pro Leu
1300 1305 1310 1300 1305 1310
Gly Met Glu Ser Lys Ala Ile Ser Asp Ala Gln Ile Thr Ala Ser Ser Gly Met Glu Ser Lys Ala Ile Ser Asp Ala Gln Ile Thr Ala Ser Ser
1315 1320 1325 1315 1320 1325
Tyr Phe Thr Asn Met Phe Ala Thr Trp Ser Pro Ser Lys Ala Arg Leu Tyr Phe Thr Asn Met Phe Ala Thr Trp Ser Pro Ser Lys Ala Arg Leu
1330 1335 1340 1330 1335 1340
His Leu Gln Gly Arg Ser Asn Ala Trp Arg Pro Gln Val Asn Asn Pro His Leu Gln Gly Arg Ser Asn Ala Trp Arg Pro Gln Val Asn Asn Pro
1345 1350 1355 13601345 1350 1355 1360
Lys Glu Trp Leu Gln Val Asp Phe Gln Lys Thr Met Lys Val Thr Gly Lys Glu Trp Leu Gln Val Asp Phe Gln Lys Thr Met Lys Val Thr Gly
1365 1370 1375 1365 1370 1375
Val Thr Thr Gln Gly Val Lys Ser Leu Leu Thr Ser Met Tyr Val Lys Val Thr Thr Gln Gly Val Lys Ser Leu Leu Thr Ser Met Tyr Val Lys
1380 1385 1390 1380 1385 1390
Glu Phe Leu Ile Ser Ser Ser Gln Asp Gly His Gln Trp Thr Leu Phe Glu Phe Leu Ile Ser Ser Ser Gln Asp Gly His Gln Trp Thr Leu Phe
1395 1400 1405 1395 1400 1405
Phe Gln Asn Gly Lys Val Lys Val Phe Gln Gly Asn Gln Asp Ser Phe Phe Gln Asn Gly Lys Val Lys Val Phe Gln Gly Asn Gln Asp Ser Phe
1410 1415 1420 1410 1415 1420
Thr Pro Val Val Asn Ser Leu Asp Pro Pro Leu Leu Thr Arg Tyr Leu Thr Pro Val Val Asn Ser Leu Asp Pro Pro Leu Leu Thr Arg Tyr Leu
1425 1430 1435 14401425 1430 1435 1440
Arg Ile His Pro Gln Ser Trp Val His Gln Ile Ala Leu Arg Met Glu Arg Ile His Pro Gln Ser Trp Val His Gln Ile Ala Leu Arg Met Glu
1445 1450 1455 1445 1450 1455
Val Leu Gly Cys Glu Ala Gln Asp Leu Tyr Val Leu Gly Cys Glu Ala Gln Asp Leu Tyr
1460 1465 1460 1465
<210> 8<210> 8
<211> 1458<211> 1458
<212> PRT<212>PRT
<213> Искусственная последовательность<213> Artificial sequence
<220> <220>
<223> слитый белок на основе человеческого FVIII-BDD и сигнального<223> fusion protein based on human FVIII-BDD and signaling
пептида Kappa цепь иммуноглобулина G (СП-IgGK-FVIII-BDD) Kappa peptide immunoglobulin G chain (SP-IgGK-FVIII-BDD)
<400> 8<400> 8
Met Glu Thr Asp Thr Leu Leu Leu Trp Val Leu Leu Leu Trp Val Pro Met Glu Thr Asp Thr Leu Leu Leu Trp Val Leu Leu Leu Trp Val Pro
1 5 10 15 1 5 10 15
Gly Ser Thr Gly Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Gly Ser Thr Gly Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu
20 25 30 20 25 30
Ser Trp Asp Tyr Met Gln Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Ser Trp Asp Tyr Met Gln Ser Asp Leu Gly Glu Leu Pro Val Asp Ala
35 40 45 35 40 45
Arg Phe Pro Pro Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Arg Phe Pro Pro Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val
50 55 60 50 55 60
Val Tyr Lys Lys Thr Leu Phe Val Glu Phe Thr Asp His Leu Phe Asn Val Tyr Lys Lys Thr Leu Phe Val Glu Phe Thr Asp His Leu Phe Asn
65 70 75 80 65 70 75 80
Ile Ala Lys Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr Ile Ile Ala Lys Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr Ile
85 90 95 85 90 95
Gln Ala Glu Val Tyr Asp Thr Val Val Ile Thr Leu Lys Asn Met Ala Gln Ala Glu Val Tyr Asp Thr Val Val Ile Thr Leu Lys Asn Met Ala
100 105 110 100 105 110
Ser His Pro Val Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser His Pro Val Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala
115 120 125 115 120 125
Ser Glu Gly Ala Glu Tyr Asp Asp Gln Thr Ser Gln Arg Glu Lys Glu Ser Glu Gly Ala Glu Tyr Asp Asp Gln Thr Ser Gln Arg Glu Lys Glu
130 135 140 130 135 140
Asp Asp Lys Val Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gln Val Asp Asp Lys Val Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gln Val
145 150 155 160 145 150 155 160
Leu Lys Glu Asn Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Leu Lys Glu Asn Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr
165 170 175 165 170 175
Ser Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu Ser Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu
180 185 190 180 185 190
Ile Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Ile Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys
195 200 205 195 200 205
Thr Gln Thr Leu His Lys Phe Ile Leu Leu Phe Ala Val Phe Asp Glu Thr Gln Thr Leu His Lys Phe Ile Leu Leu Phe Ala Val Phe Asp Glu
210 215 220 210 215 220
Gly Lys Ser Trp His Ser Glu Thr Lys Asn Ser Leu Met Gln Asp Arg Gly Lys Ser Trp His Ser Glu Thr Lys Asn Ser Leu Met Gln Asp Arg
225 230 235 240 225 230 235 240
Asp Ala Ala Ser Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Asp Ala Ala Ser Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly
245 250 255 245 250 255
Tyr Val Asn Arg Ser Leu Pro Gly Leu Ile Gly Cys His Arg Lys Ser Tyr Val Asn Arg Ser Leu Pro Gly Leu Ile Gly Cys His Arg Lys Ser
260 265 270 260 265 270
Val Tyr Trp His Val Ile Gly Met Gly Thr Thr Pro Glu Val His Ser Val Tyr Trp His Val Ile Gly Met Gly Thr Thr Pro Glu Val His Ser
275 280 285 275 280 285
Ile Phe Leu Glu Gly His Thr Phe Leu Val Arg Asn His Arg Gln Ala Ile Phe Leu Glu Gly His Thr Phe Leu Val Arg Asn His Arg Gln Ala
290 295 300 290 295 300
Ser Leu Glu Ile Ser Pro Ile Thr Phe Leu Thr Ala Gln Thr Leu Leu Ser Leu Glu Ile Ser Pro Ile Thr Phe Leu Thr Ala Gln Thr Leu Leu
305 310 315 320 305 310 315 320
Met Asp Leu Gly Gln Phe Leu Leu Phe Cys His Ile Ser Ser His Gln Met Asp Leu Gly Gln Phe Leu Leu Phe Cys His Ile Ser Ser His Gln
325 330 335 325 330 335
His Asp Gly Met Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu His Asp Gly Met Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu
340 345 350 340 345 350
Pro Gln Leu Arg Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Pro Gln Leu Arg Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp
355 360 365 355 360 365
Asp Leu Thr Asp Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Asp Leu Thr Asp Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn
370 375 380 370 375 380
Ser Pro Ser Phe Ile Gln Ile Arg Ser Val Ala Lys Lys His Pro Lys Ser Pro Ser Phe Ile Gln Ile Arg Ser Val Ala Lys Lys His Pro Lys
385 390 395 400 385 390 395 400
Thr Trp Val His Tyr Ile Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Thr Trp Val His Tyr Ile Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala
405 410 415 405 410 415
Pro Leu Val Leu Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gln Tyr Leu Pro Leu Val Leu Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gln Tyr Leu
420 425 430 420 425 430
Asn Asn Gly Pro Gln Arg Ile Gly Arg Lys Tyr Lys Lys Val Arg Phe Asn Asn Gly Pro Gln Arg Ile Gly Arg Lys Tyr Lys Lys Val Arg Phe
435 440 445 435 440 445
Met Ala Tyr Thr Asp Glu Thr Phe Lys Thr Arg Glu Ala Ile Gln His Met Ala Tyr Thr Asp Glu Thr Phe Lys Thr Arg Glu Ala Ile Gln His
450 455 460 450 455 460
Glu Ser Gly Ile Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Glu Ser Gly Ile Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr
465 470 475 480 465 470 475 480
Leu Leu Ile Ile Phe Lys Asn Gln Ala Ser Arg Pro Tyr Asn Ile Tyr Leu Leu Ile Ile Phe Lys Asn Gln Ala Ser Arg Pro Tyr Asn Ile Tyr
485 490 495 485 490 495
Pro His Gly Ile Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Pro His Gly Ile Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro
500 505 510 500 505 510
Lys Gly Val Lys His Leu Lys Asp Phe Pro Ile Leu Pro Gly Glu Ile Lys Gly Val Lys His Leu Lys Asp Phe Pro Ile Leu Pro Gly Glu Ile
515 520 525 515 520 525
Phe Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Phe Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser
530 535 540 530 535 540
Asp Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Asp Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu
545 550 555 560 545 550 555 560
Arg Asp Leu Ala Ser Gly Leu Ile Gly Pro Leu Leu Ile Cys Tyr Lys Arg Asp Leu Ala Ser Gly Leu Ile Gly Pro Leu Leu Ile Cys Tyr Lys
565 570 575 565 570 575
Glu Ser Val Asp Gln Arg Gly Asn Gln Ile Met Ser Asp Lys Arg Asn Glu Ser Val Asp Gln Arg Gly Asn Gln Ile Met Ser Asp Lys Arg Asn
580 585 590 580 585 590
Val Ile Leu Phe Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Val Ile Leu Phe Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr
595 600 605 595 600 605
Glu Asn Ile Gln Arg Phe Leu Pro Asn Pro Ala Gly Val Gln Leu Glu Glu Asn Ile Gln Arg Phe Leu Pro Asn Pro Ala Gly Val Gln Leu Glu
610 615 620 610 615 620
Asp Pro Glu Phe Gln Ala Ser Asn Ile Met His Ser Ile Asn Gly Tyr Asp Pro Glu Phe Gln Ala Ser Asn Ile Met His Ser Ile Asn Gly Tyr
625 630 635 640 625 630 635 640
Val Phe Asp Ser Leu Gln Leu Ser Val Cys Leu His Glu Val Ala Tyr Val Phe Asp Ser Leu Gln Leu Ser Val Cys Leu His Glu Val Ala Tyr
645 650 655 645 650 655
Trp Tyr Ile Leu Ser Ile Gly Ala Gln Thr Asp Phe Leu Ser Val Phe Trp Tyr Ile Leu Ser Ile Gly Ala Gln Thr Asp Phe Leu Ser Val Phe
660 665 670 660 665 670
Phe Ser Gly Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Phe Ser Gly Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu
675 680 685 675 680 685
Thr Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Thr Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn
690 695 700 690 695 700
Pro Gly Leu Trp Ile Leu Gly Cys His Asn Ser Asp Phe Arg Asn Arg Pro Gly Leu Trp Ile Leu Gly Cys His Asn Ser Asp Phe Arg Asn Arg
705 710 715 720 705 710 715 720
Gly Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Gly Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly
725 730 735 725 730 735
Asp Tyr Tyr Glu Asp Ser Tyr Glu Asp Ile Ser Ala Tyr Leu Leu Ser Asp Tyr Tyr Glu Asp Ser Tyr Glu Asp Ile Ser Ala Tyr Leu Leu Ser
740 745 750 740 745 750
Lys Asn Asn Ala Ile Glu Pro Arg Ser Phe Ser Gln Asn Pro Pro Val Lys Asn Asn Ala Ile Glu Pro Arg Ser Phe Ser Gln Asn Pro Pro Val
755 760 765 755 760 765
Leu Lys Arg His Gln Arg Glu Ile Thr Arg Thr Thr Leu Gln Ser Asp Leu Lys Arg His Gln Arg Glu Ile Thr Arg Thr Thr Leu Gln Ser Asp
770 775 780 770 775 780
Gln Glu Glu Ile Asp Tyr Asp Asp Thr Ile Ser Val Glu Met Lys Lys Gln Glu Glu Ile Asp Tyr Asp Asp Thr Ile Ser Val Glu Met Lys Lys
785 790 795 800 785 790 795 800
Glu Asp Phe Asp Ile Tyr Asp Glu Asp Glu Asn Gln Ser Pro Arg Ser Glu Asp Phe Asp Ile Tyr Asp Glu Asp Glu Asn Gln Ser Pro Arg Ser
805 810 815 805 810 815
Phe Gln Lys Lys Thr Arg His Tyr Phe Ile Ala Ala Val Glu Arg Leu Phe Gln Lys Lys Thr Arg His Tyr Phe Ile Ala Ala Val Glu Arg Leu
820 825 830 820 825 830
Trp Asp Tyr Gly Met Ser Ser Ser Pro His Val Leu Arg Asn Arg Ala Trp Asp Tyr Gly Met Ser Ser Ser Pro His Val Leu Arg Asn Arg Ala
835 840 845 835 840 845
Gln Ser Gly Ser Val Pro Gln Phe Lys Lys Val Val Phe Gln Glu Phe Gln Ser Gly Ser Val Pro Gln Phe Lys Lys Val Val Phe Gln Glu Phe
850 855 860 850 855 860
Thr Asp Gly Ser Phe Thr Gln Pro Leu Tyr Arg Gly Glu Leu Asn Glu Thr Asp Gly Ser Phe Thr Gln Pro Leu Tyr Arg Gly Glu Leu Asn Glu
865 870 875 880 865 870 875 880
His Leu Gly Leu Leu Gly Pro Tyr Ile Arg Ala Glu Val Glu Asp Asn His Leu Gly Leu Leu Gly Pro Tyr Ile Arg Ala Glu Val Glu Asp Asn
885 890 895 885 890 895
Ile Met Val Thr Phe Arg Asn Gln Ala Ser Arg Pro Tyr Ser Phe Tyr Ile Met Val Thr Phe Arg Asn Gln Ala Ser Arg Pro Tyr Ser Phe Tyr
900 905 910 900 905 910
Ser Ser Leu Ile Ser Tyr Glu Glu Asp Gln Arg Gln Gly Ala Glu Pro Ser Ser Leu Ile Ser Tyr Glu Glu Asp Gln Arg Gln Gly Ala Glu Pro
915 920 925 915 920 925
Arg Lys Asn Phe Val Lys Pro Asn Glu Thr Lys Thr Tyr Phe Trp Lys Arg Lys Asn Phe Val Lys Pro Asn Glu Thr Lys Thr Tyr Phe Trp Lys
930 935 940 930 935 940
Val Gln His His Met Ala Pro Thr Lys Asp Glu Phe Asp Cys Lys Ala Val Gln His His Met Ala Pro Thr Lys Asp Glu Phe Asp Cys Lys Ala
945 950 955 960 945 950 955 960
Trp Ala Tyr Phe Ser Asp Val Asp Leu Glu Lys Asp Val His Ser Gly Trp Ala Tyr Phe Ser Asp Val Asp Leu Glu Lys Asp Val His Ser Gly
965 970 975 965 970 975
Leu Ile Gly Pro Leu Leu Val Cys His Thr Asn Thr Leu Asn Pro Ala Leu Ile Gly Pro Leu Leu Val Cys His Thr Asn Thr Leu Asn Pro Ala
980 985 990 980 985 990
His Gly Arg Gln Val Thr Val Gln Glu Phe Ala Leu Phe Phe Thr Ile His Gly Arg Gln Val Thr Val Gln Glu Phe Ala Leu Phe Phe Thr Ile
995 1000 1005 995 1000 1005
Phe Asp Glu Thr Lys Ser Trp Tyr Phe Thr Glu Asn Met Glu Arg Asn Phe Asp Glu Thr Lys Ser Trp Tyr Phe Thr Glu Asn Met Glu Arg Asn
1010 1015 1020 1010 1015 1020
Cys Arg Ala Pro Cys Asn Ile Gln Met Glu Asp Pro Thr Phe Lys Glu Cys Arg Ala Pro Cys Asn Ile Gln Met Glu Asp Pro Thr Phe Lys Glu
1025 1030 1035 10401025 1030 1035 1040
Asn Tyr Arg Phe His Ala Ile Asn Gly Tyr Ile Met Asp Thr Leu Pro Asn Tyr Arg Phe His Ala Ile Asn Gly Tyr Ile Met Asp Thr Leu Pro
1045 1050 1055 1045 1050 1055
Gly Leu Val Met Ala Gln Asp Gln Arg Ile Arg Trp Tyr Leu Leu Ser Gly Leu Val Met Ala Gln Asp Gln Arg Ile Arg Trp Tyr Leu Leu Ser
1060 1065 1070 1060 1065 1070
Met Gly Ser Asn Glu Asn Ile His Ser Ile His Phe Ser Gly His Val Met Gly Ser Asn Glu Asn Ile His Ser Ile His Phe Ser Gly His Val
1075 1080 1085 1075 1080 1085
Phe Thr Val Arg Lys Lys Glu Glu Tyr Lys Met Ala Leu Tyr Asn Leu Phe Thr Val Arg Lys Lys Glu Glu Tyr Lys Met Ala Leu Tyr Asn Leu
1090 1095 1100 1090 1095 1100
Tyr Pro Gly Val Phe Glu Thr Val Glu Met Leu Pro Ser Lys Ala Gly Tyr Pro Gly Val Phe Glu Thr Val Glu Met Leu Pro Ser Lys Ala Gly
1105 1110 1115 11201105 1110 1115 1120
Ile Trp Arg Val Glu Cys Leu Ile Gly Glu His Leu His Ala Gly Met Ile Trp Arg Val Glu Cys Leu Ile Gly Glu His Leu His Ala Gly Met
1125 1130 1135 1125 1130 1135
Ser Thr Leu Phe Leu Val Tyr Ser Asn Lys Cys Gln Thr Pro Leu Gly Ser Thr Leu Phe Leu Val Tyr Ser Asn Lys Cys Gln Thr Pro Leu Gly
1140 1145 1150 1140 1145 1150
Met Ala Ser Gly His Ile Arg Asp Phe Gln Ile Thr Ala Ser Gly Gln Met Ala Ser Gly His Ile Arg Asp Phe Gln Ile Thr Ala Ser Gly Gln
1155 1160 1165 1155 1160 1165
Tyr Gly Gln Trp Ala Pro Lys Leu Ala Arg Leu His Tyr Ser Gly Ser Tyr Gly Gln Trp Ala Pro Lys Leu Ala Arg Leu His Tyr Ser Gly Ser
1170 1175 1180 1170 1175 1180
Ile Asn Ala Trp Ser Thr Lys Glu Pro Phe Ser Trp Ile Lys Val Asp Ile Asn Ala Trp Ser Thr Lys Glu Pro Phe Ser Trp Ile Lys Val Asp
1185 1190 1195 12001185 1190 1195 1200
Leu Leu Ala Pro Met Ile Ile His Gly Ile Lys Thr Gln Gly Ala Arg Leu Leu Ala Pro Met Ile Ile His Gly Ile Lys Thr Gln Gly Ala Arg
1205 1210 1215 1205 1210 1215
Gln Lys Phe Ser Ser Leu Tyr Ile Ser Gln Phe Ile Ile Met Tyr Ser Gln Lys Phe Ser Ser Leu Tyr Ile Ser Gln Phe Ile Ile Met Tyr Ser
1220 1225 1230 1220 1225 1230
Leu Asp Gly Lys Lys Trp Gln Thr Tyr Arg Gly Asn Ser Thr Gly Thr Leu Asp Gly Lys Lys Trp Gln Thr Tyr Arg Gly Asn Ser Thr Gly Thr
1235 1240 1245 1235 1240 1245
Leu Met Val Phe Phe Gly Asn Val Asp Ser Ser Gly Ile Lys His Asn Leu Met Val Phe Phe Gly Asn Val Asp Ser Ser Gly Ile Lys His Asn
1250 1255 1260 1250 1255 1260
Ile Phe Asn Pro Pro Ile Ile Ala Arg Tyr Ile Arg Leu His Pro Thr Ile Phe Asn Pro Pro Ile Ile Ala Arg Tyr Ile Arg Leu His Pro Thr
1265 1270 1275 12801265 1270 1275 1280
His Tyr Ser Ile Arg Ser Thr Leu Arg Met Glu Leu Met Gly Cys Asp His Tyr Ser Ile Arg Ser Thr Leu Arg Met Glu Leu Met Gly Cys Asp
1285 1290 1295 1285 1290 1295
Leu Asn Ser Cys Ser Met Pro Leu Gly Met Glu Ser Lys Ala Ile Ser Leu Asn Ser Cys Ser Met Pro Leu Gly Met Glu Ser Lys Ala Ile Ser
1300 1305 1310 1300 1305 1310
Asp Ala Gln Ile Thr Ala Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Asp Ala Gln Ile Thr Ala Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr
1315 1320 1325 1315 1320 1325
Trp Ser Pro Ser Lys Ala Arg Leu His Leu Gln Gly Arg Ser Asn Ala Trp Ser Pro Ser Lys Ala Arg Leu His Leu Gln Gly Arg Ser Asn Ala
1330 1335 1340 1330 1335 1340
Trp Arg Pro Gln Val Asn Asn Pro Lys Glu Trp Leu Gln Val Asp Phe Trp Arg Pro Gln Val Asn Asn Pro Lys Glu Trp Leu Gln Val Asp Phe
1345 1350 1355 13601345 1350 1355 1360
Gln Lys Thr Met Lys Val Thr Gly Val Thr Thr Gln Gly Val Lys Ser Gln Lys Thr Met Lys Val Thr Gly Val Thr Thr Gln Gly Val Lys Ser
1365 1370 1375 1365 1370 1375
Leu Leu Thr Ser Met Tyr Val Lys Glu Phe Leu Ile Ser Ser Ser Gln Leu Leu Thr Ser Met Tyr Val Lys Glu Phe Leu Ile Ser Ser Ser Gln
1380 1385 1390 1380 1385 1390
Asp Gly His Gln Trp Thr Leu Phe Phe Gln Asn Gly Lys Val Lys Val Asp Gly His Gln Trp Thr Leu Phe Phe Gln Asn Gly Lys Val Lys Val
1395 1400 1405 1395 1400 1405
Phe Gln Gly Asn Gln Asp Ser Phe Thr Pro Val Val Asn Ser Leu Asp Phe Gln Gly Asn Gln Asp Ser Phe Thr Pro Val Val Asn Ser Leu Asp
1410 1415 1420 1410 1415 1420
Pro Pro Leu Leu Thr Arg Tyr Leu Arg Ile His Pro Gln Ser Trp Val Pro Pro Leu Leu Thr Arg Tyr Leu Arg Ile His Pro Gln Ser Trp Val
1425 1430 1435 14401425 1430 1435 1440
His Gln Ile Ala Leu Arg Met Glu Val Leu Gly Cys Glu Ala Gln Asp His Gln Ile Ala Leu Arg Met Glu Val Leu Gly Cys Glu Ala Gln Asp
1445 1450 1455 1445 1450 1455
Leu Tyr Leu Tyr
<210> 9<210> 9
<211> 1457<211> 1457
<212> PRT<212>PRT
<213> Искусственная последовательность<213> Artificial sequence
<220> <220>
<223> слитый белок на основе человеческого FVIII-BDD и сигнального<223> fusion protein based on human FVIII-BDD and signaling
пептида Lactalbumin (СП-Lactalbumin-FVIII-BDD) Lactalbumin peptide (SP-Lactalbumin-FVIII-BDD)
<400> 9<400> 9
Met Met Ser Phe Val Ser Leu Leu Leu Val Gly Ile Leu Phe His Ala Met Met Ser Phe Val Ser Leu Leu Leu Val Gly Ile Leu Phe His Ala
1 5 10 15 1 5 10 15
Thr Gln Ala Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser Thr Gln Ala Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser
20 25 30 20 25 30
Trp Asp Tyr Met Gln Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg Trp Asp Tyr Met Gln Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg
35 40 45 35 40 45
Phe Pro Pro Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Val Phe Pro Pro Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Val
50 55 60 50 55 60
Tyr Lys Lys Thr Leu Phe Val Glu Phe Thr Asp His Leu Phe Asn Ile Tyr Lys Lys Thr Leu Phe Val Glu Phe Thr Asp His Leu Phe Asn Ile
65 70 75 80 65 70 75 80
Ala Lys Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr Ile Gln Ala Lys Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr Ile Gln
85 90 95 85 90 95
Ala Glu Val Tyr Asp Thr Val Val Ile Thr Leu Lys Asn Met Ala Ser Ala Glu Val Tyr Asp Thr Val Val Ile Thr Leu Lys Asn Met Ala Ser
100 105 110 100 105 110
His Pro Val Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser His Pro Val Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser
115 120 125 115 120 125
Glu Gly Ala Glu Tyr Asp Asp Gln Thr Ser Gln Arg Glu Lys Glu Asp Glu Gly Ala Glu Tyr Asp Asp Gln Thr Ser Gln Arg Glu Lys Glu Asp
130 135 140 130 135 140
Asp Lys Val Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gln Val Leu Asp Lys Val Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gln Val Leu
145 150 155 160 145 150 155 160
Lys Glu Asn Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser Lys Glu Asn Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser
165 170 175 165 170 175
Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu Ile Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu Ile
180 185 190 180 185 190
Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr
195 200 205 195 200 205
Gln Thr Leu His Lys Phe Ile Leu Leu Phe Ala Val Phe Asp Glu Gly Gln Thr Leu His Lys Phe Ile Leu Leu Phe Ala Val Phe Asp Glu Gly
210 215 220 210 215 220
Lys Ser Trp His Ser Glu Thr Lys Asn Ser Leu Met Gln Asp Arg Asp Lys Ser Trp His Ser Glu Thr Lys Asn Ser Leu Met Gln Asp Arg Asp
225 230 235 240 225 230 235 240
Ala Ala Ser Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr Ala Ala Ser Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr
245 250 255 245 250 255
Val Asn Arg Ser Leu Pro Gly Leu Ile Gly Cys His Arg Lys Ser Val Val Asn Arg Ser Leu Pro Gly Leu Ile Gly Cys His Arg Lys Ser Val
260 265 270 260 265 270
Tyr Trp His Val Ile Gly Met Gly Thr Thr Pro Glu Val His Ser Ile Tyr Trp His Val Ile Gly Met Gly Thr Thr Pro Glu Val His Ser Ile
275 280 285 275 280 285
Phe Leu Glu Gly His Thr Phe Leu Val Arg Asn His Arg Gln Ala Ser Phe Leu Glu Gly His Thr Phe Leu Val Arg Asn His Arg Gln Ala Ser
290 295 300 290 295 300
Leu Glu Ile Ser Pro Ile Thr Phe Leu Thr Ala Gln Thr Leu Leu Met Leu Glu Ile Ser Pro Ile Thr Phe Leu Thr Ala Gln Thr Leu Leu Met
305 310 315 320 305 310 315 320
Asp Leu Gly Gln Phe Leu Leu Phe Cys His Ile Ser Ser His Gln His Asp Leu Gly Gln Phe Leu Leu Phe Cys His Ile Ser Ser His Gln His
325 330 335 325 330 335
Asp Gly Met Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro Asp Gly Met Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro
340 345 350 340 345 350
Gln Leu Arg Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp Gln Leu Arg Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp
355 360 365 355 360 365
Leu Thr Asp Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser Leu Thr Asp Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser
370 375 380 370 375 380
Pro Ser Phe Ile Gln Ile Arg Ser Val Ala Lys Lys His Pro Lys Thr Pro Ser Phe Ile Gln Ile Arg Ser Val Ala Lys Lys His Pro Lys Thr
385 390 395 400 385 390 395 400
Trp Val His Tyr Ile Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro Trp Val His Tyr Ile Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro
405 410 415 405 410 415
Leu Val Leu Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gln Tyr Leu Asn Leu Val Leu Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gln Tyr Leu Asn
420 425 430 420 425 430
Asn Gly Pro Gln Arg Ile Gly Arg Lys Tyr Lys Lys Val Arg Phe Met Asn Gly Pro Gln Arg Ile Gly Arg Lys Tyr Lys Lys Val Arg Phe Met
435 440 445 435 440 445
Ala Tyr Thr Asp Glu Thr Phe Lys Thr Arg Glu Ala Ile Gln His Glu Ala Tyr Thr Asp Glu Thr Phe Lys Thr Arg Glu Ala Ile Gln His Glu
450 455 460 450 455 460
Ser Gly Ile Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu Ser Gly Ile Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu
465 470 475 480 465 470 475 480
Leu Ile Ile Phe Lys Asn Gln Ala Ser Arg Pro Tyr Asn Ile Tyr Pro Leu Ile Ile Phe Lys Asn Gln Ala Ser Arg Pro Tyr Asn Ile Tyr Pro
485 490 495 485 490 495
His Gly Ile Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys His Gly Ile Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys
500 505 510 500 505 510
Gly Val Lys His Leu Lys Asp Phe Pro Ile Leu Pro Gly Glu Ile Phe Gly Val Lys His Leu Lys Asp Phe Pro Ile Leu Pro Gly Glu Ile Phe
515 520 525 515 520 525
Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp
530 535 540 530 535 540
Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg
545 550 555 560 545 550 555 560
Asp Leu Ala Ser Gly Leu Ile Gly Pro Leu Leu Ile Cys Tyr Lys Glu Asp Leu Ala Ser Gly Leu Ile Gly Pro Leu Leu Ile Cys Tyr Lys Glu
565 570 575 565 570 575
Ser Val Asp Gln Arg Gly Asn Gln Ile Met Ser Asp Lys Arg Asn Val Ser Val Asp Gln Arg Gly Asn Gln Ile Met Ser Asp Lys Arg Asn Val
580 585 590 580 585 590
Ile Leu Phe Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu Ile Leu Phe Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu
595 600 605 595 600 605
Asn Ile Gln Arg Phe Leu Pro Asn Pro Ala Gly Val Gln Leu Glu Asp Asn Ile Gln Arg Phe Leu Pro Asn Pro Ala Gly Val Gln Leu Glu Asp
610 615 620 610 615 620
Pro Glu Phe Gln Ala Ser Asn Ile Met His Ser Ile Asn Gly Tyr Val Pro Glu Phe Gln Ala Ser Asn Ile Met His Ser Ile Asn Gly Tyr Val
625 630 635 640 625 630 635 640
Phe Asp Ser Leu Gln Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp Phe Asp Ser Leu Gln Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp
645 650 655 645 650 655
Tyr Ile Leu Ser Ile Gly Ala Gln Thr Asp Phe Leu Ser Val Phe Phe Tyr Ile Leu Ser Ile Gly Ala Gln Thr Asp Phe Leu Ser Val Phe Phe
660 665 670 660 665 670
Ser Gly Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr Ser Gly Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr
675 680 685 675 680 685
Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro
690 695 700 690 695 700
Gly Leu Trp Ile Leu Gly Cys His Asn Ser Asp Phe Arg Asn Arg Gly Gly Leu Trp Ile Leu Gly Cys His Asn Ser Asp Phe Arg Asn Arg Gly
705 710 715 720 705 710 715 720
Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp
725 730 735 725 730 735
Tyr Tyr Glu Asp Ser Tyr Glu Asp Ile Ser Ala Tyr Leu Leu Ser Lys Tyr Tyr Glu Asp Ser Tyr Glu Asp Ile Ser Ala Tyr Leu Leu Ser Lys
740 745 750 740 745 750
Asn Asn Ala Ile Glu Pro Arg Ser Phe Ser Gln Asn Pro Pro Val Leu Asn Asn Ala Ile Glu Pro Arg Ser Phe Ser Gln Asn Pro Pro Val Leu
755 760 765 755 760 765
Lys Arg His Gln Arg Glu Ile Thr Arg Thr Thr Leu Gln Ser Asp Gln Lys Arg His Gln Arg Glu Ile Thr Arg Thr Thr Leu Gln Ser Asp Gln
770 775 780 770 775 780
Glu Glu Ile Asp Tyr Asp Asp Thr Ile Ser Val Glu Met Lys Lys Glu Glu Glu Ile Asp Tyr Asp Asp Thr Ile Ser Val Glu Met Lys Lys Glu
785 790 795 800 785 790 795 800
Asp Phe Asp Ile Tyr Asp Glu Asp Glu Asn Gln Ser Pro Arg Ser Phe Asp Phe Asp Ile Tyr Asp Glu Asp Glu Asn Gln Ser Pro Arg Ser Phe
805 810 815 805 810 815
Gln Lys Lys Thr Arg His Tyr Phe Ile Ala Ala Val Glu Arg Leu Trp Gln Lys Lys Thr Arg His Tyr Phe Ile Ala Ala Val Glu Arg Leu Trp
820 825 830 820 825 830
Asp Tyr Gly Met Ser Ser Ser Pro His Val Leu Arg Asn Arg Ala Gln Asp Tyr Gly Met Ser Ser Ser Pro His Val Leu Arg Asn Arg Ala Gln
835 840 845 835 840 845
Ser Gly Ser Val Pro Gln Phe Lys Lys Val Val Phe Gln Glu Phe Thr Ser Gly Ser Val Pro Gln Phe Lys Lys Val Val Phe Gln Glu Phe Thr
850 855 860 850 855 860
Asp Gly Ser Phe Thr Gln Pro Leu Tyr Arg Gly Glu Leu Asn Glu His Asp Gly Ser Phe Thr Gln Pro Leu Tyr Arg Gly Glu Leu Asn Glu His
865 870 875 880 865 870 875 880
Leu Gly Leu Leu Gly Pro Tyr Ile Arg Ala Glu Val Glu Asp Asn Ile Leu Gly Leu Leu Gly Pro Tyr Ile Arg Ala Glu Val Glu Asp Asn Ile
885 890 895 885 890 895
Met Val Thr Phe Arg Asn Gln Ala Ser Arg Pro Tyr Ser Phe Tyr Ser Met Val Thr Phe Arg Asn Gln Ala Ser Arg Pro Tyr Ser Phe Tyr Ser
900 905 910 900 905 910
Ser Leu Ile Ser Tyr Glu Glu Asp Gln Arg Gln Gly Ala Glu Pro Arg Ser Leu Ile Ser Tyr Glu Glu Asp Gln Arg Gln Gly Ala Glu Pro Arg
915 920 925 915 920 925
Lys Asn Phe Val Lys Pro Asn Glu Thr Lys Thr Tyr Phe Trp Lys Val Lys Asn Phe Val Lys Pro Asn Glu Thr Lys Thr Tyr Phe Trp Lys Val
930 935 940 930 935 940
Gln His His Met Ala Pro Thr Lys Asp Glu Phe Asp Cys Lys Ala Trp Gln His His Met Ala Pro Thr Lys Asp Glu Phe Asp Cys Lys Ala Trp
945 950 955 960 945 950 955 960
Ala Tyr Phe Ser Asp Val Asp Leu Glu Lys Asp Val His Ser Gly Leu Ala Tyr Phe Ser Asp Val Asp Leu Glu Lys Asp Val His Ser Gly Leu
965 970 975 965 970 975
Ile Gly Pro Leu Leu Val Cys His Thr Asn Thr Leu Asn Pro Ala His Ile Gly Pro Leu Leu Val Cys His Thr Asn Thr Leu Asn Pro Ala His
980 985 990 980 985 990
Gly Arg Gln Val Thr Val Gln Glu Phe Ala Leu Phe Phe Thr Ile Phe Gly Arg Gln Val Thr Val Gln Glu Phe Ala Leu Phe Phe Thr Ile Phe
995 1000 1005 995 1000 1005
Asp Glu Thr Lys Ser Trp Tyr Phe Thr Glu Asn Met Glu Arg Asn Cys Asp Glu Thr Lys Ser Trp Tyr Phe Thr Glu Asn Met Glu Arg Asn Cys
1010 1015 1020 1010 1015 1020
Arg Ala Pro Cys Asn Ile Gln Met Glu Asp Pro Thr Phe Lys Glu Asn Arg Ala Pro Cys Asn Ile Gln Met Glu Asp Pro Thr Phe Lys Glu Asn
1025 1030 1035 10401025 1030 1035 1040
Tyr Arg Phe His Ala Ile Asn Gly Tyr Ile Met Asp Thr Leu Pro Gly Tyr Arg Phe His Ala Ile Asn Gly Tyr Ile Met Asp Thr Leu Pro Gly
1045 1050 1055 1045 1050 1055
Leu Val Met Ala Gln Asp Gln Arg Ile Arg Trp Tyr Leu Leu Ser Met Leu Val Met Ala Gln Asp Gln Arg Ile Arg Trp Tyr Leu Leu Ser Met
1060 1065 1070 1060 1065 1070
Gly Ser Asn Glu Asn Ile His Ser Ile His Phe Ser Gly His Val Phe Gly Ser Asn Glu Asn Ile His Ser Ile His Phe Ser Gly His Val Phe
1075 1080 1085 1075 1080 1085
Thr Val Arg Lys Lys Glu Glu Tyr Lys Met Ala Leu Tyr Asn Leu Tyr Thr Val Arg Lys Lys Glu Glu Tyr Lys Met Ala Leu Tyr Asn Leu Tyr
1090 1095 1100 1090 1095 1100
Pro Gly Val Phe Glu Thr Val Glu Met Leu Pro Ser Lys Ala Gly Ile Pro Gly Val Phe Glu Thr Val Glu Met Leu Pro Ser Lys Ala Gly Ile
1105 1110 1115 11201105 1110 1115 1120
Trp Arg Val Glu Cys Leu Ile Gly Glu His Leu His Ala Gly Met Ser Trp Arg Val Glu Cys Leu Ile Gly Gly His Leu His Ala Gly Met Ser
1125 1130 1135 1125 1130 1135
Thr Leu Phe Leu Val Tyr Ser Asn Lys Cys Gln Thr Pro Leu Gly Met Thr Leu Phe Leu Val Tyr Ser Asn Lys Cys Gln Thr Pro Leu Gly Met
1140 1145 1150 1140 1145 1150
Ala Ser Gly His Ile Arg Asp Phe Gln Ile Thr Ala Ser Gly Gln Tyr Ala Ser Gly His Ile Arg Asp Phe Gln Ile Thr Ala Ser Gly Gln Tyr
1155 1160 1165 1155 1160 1165
Gly Gln Trp Ala Pro Lys Leu Ala Arg Leu His Tyr Ser Gly Ser Ile Gly Gln Trp Ala Pro Lys Leu Ala Arg Leu His Tyr Ser Gly Ser Ile
1170 1175 1180 1170 1175 1180
Asn Ala Trp Ser Thr Lys Glu Pro Phe Ser Trp Ile Lys Val Asp Leu Asn Ala Trp Ser Thr Lys Glu Pro Phe Ser Trp Ile Lys Val Asp Leu
1185 1190 1195 12001185 1190 1195 1200
Leu Ala Pro Met Ile Ile His Gly Ile Lys Thr Gln Gly Ala Arg Gln Leu Ala Pro Met Ile Ile His Gly Ile Lys Thr Gln Gly Ala Arg Gln
1205 1210 1215 1205 1210 1215
Lys Phe Ser Ser Leu Tyr Ile Ser Gln Phe Ile Ile Met Tyr Ser Leu Lys Phe Ser Ser Leu Tyr Ile Ser Gln Phe Ile Ile Met Tyr Ser Leu
1220 1225 1230 1220 1225 1230
Asp Gly Lys Lys Trp Gln Thr Tyr Arg Gly Asn Ser Thr Gly Thr Leu Asp Gly Lys Lys Trp Gln Thr Tyr Arg Gly Asn Ser Thr Gly Thr Leu
1235 1240 1245 1235 1240 1245
Met Val Phe Phe Gly Asn Val Asp Ser Ser Gly Ile Lys His Asn Ile Met Val Phe Phe Gly Asn Val Asp Ser Ser Gly Ile Lys His Asn Ile
1250 1255 1260 1250 1255 1260
Phe Asn Pro Pro Ile Ile Ala Arg Tyr Ile Arg Leu His Pro Thr His Phe Asn Pro Pro Ile Ile Ala Arg Tyr Ile Arg Leu His Pro Thr His
1265 1270 1275 12801265 1270 1275 1280
Tyr Ser Ile Arg Ser Thr Leu Arg Met Glu Leu Met Gly Cys Asp Leu Tyr Ser Ile Arg Ser Thr Leu Arg Met Glu Leu Met Gly Cys Asp Leu
1285 1290 1295 1285 1290 1295
Asn Ser Cys Ser Met Pro Leu Gly Met Glu Ser Lys Ala Ile Ser Asp Asn Ser Cys Ser Met Pro Leu Gly Met Glu Ser Lys Ala Ile Ser Asp
1300 1305 1310 1300 1305 1310
Ala Gln Ile Thr Ala Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Trp Ala Gln Ile Thr Ala Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Trp
1315 1320 1325 1315 1320 1325
Ser Pro Ser Lys Ala Arg Leu His Leu Gln Gly Arg Ser Asn Ala Trp Ser Pro Ser Lys Ala Arg Leu His Leu Gln Gly Arg Ser Asn Ala Trp
1330 1335 1340 1330 1335 1340
Arg Pro Gln Val Asn Asn Pro Lys Glu Trp Leu Gln Val Asp Phe Gln Arg Pro Gln Val Asn Asn Pro Lys Glu Trp Leu Gln Val Asp Phe Gln
1345 1350 1355 13601345 1350 1355 1360
Lys Thr Met Lys Val Thr Gly Val Thr Thr Gln Gly Val Lys Ser Leu Lys Thr Met Lys Val Thr Gly Val Thr Thr Gln Gly Val Lys Ser Leu
1365 1370 1375 1365 1370 1375
Leu Thr Ser Met Tyr Val Lys Glu Phe Leu Ile Ser Ser Ser Gln Asp Leu Thr Ser Met Tyr Val Lys Glu Phe Leu Ile Ser Ser Ser Gln Asp
1380 1385 1390 1380 1385 1390
Gly His Gln Trp Thr Leu Phe Phe Gln Asn Gly Lys Val Lys Val Phe Gly His Gln Trp Thr Leu Phe Phe Gln Asn Gly Lys Val Lys Val Phe
1395 1400 1405 1395 1400 1405
Gln Gly Asn Gln Asp Ser Phe Thr Pro Val Val Asn Ser Leu Asp Pro Gln Gly Asn Gln Asp Ser Phe Thr Pro Val Val Asn Ser Leu Asp Pro
1410 1415 1420 1410 1415 1420
Pro Leu Leu Thr Arg Tyr Leu Arg Ile His Pro Gln Ser Trp Val His Pro Leu Leu Thr Arg Tyr Leu Arg Ile His Pro Gln Ser Trp Val His
1425 1430 1435 14401425 1430 1435 1440
Gln Ile Ala Leu Arg Met Glu Val Leu Gly Cys Glu Ala Gln Asp Leu Gln Ile Ala Leu Arg Met Glu Val Leu Gly Cys Glu Ala Gln Asp Leu
1445 1450 1455 1445 1450 1455
Tyr Tyr
<210> 10<210> 10
<211> 4374<211> 4374
<212> DNA<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220> <220>
<223> Нуклеиновая кислота, которая кодирует белок FVIII-BDD с<223> Nucleic acid that encodes the FVIII-BDD protein with
сигнальным дикого типа (СП-FVIII-FVIII-BDD) signal wild type (SP-FVIII-FVIII-BDD)
<400> 10<400> 10
atgcaaatag agctctccac ctgcttcttt ctgtgccttt tgcgattctg ctttagtgcc 60atgcaaatag agctctccac ctgcttcttt ctgtgccttt tgcgattctg ctttagtgcc 60
accagaagat actacctggg tgcagtggaa ctgtcatggg actatatgca aagtgatctc 120accagaagat actacctggg tgcagtggaa ctgtcatggg actatatgca aagtgatctc 120
ggtgagctgc ctgtggacgc aagatttcct cctagagtgc caaaatcttt tccattcaac 180ggtgagctgc ctgtggacgc aagatttcct cctagagtgc caaaatcttt tccattcaac 180
acctcagtcg tgtacaaaaa gactctgttt gtagaattca cggatcacct tttcaacatc 240acctcagtcg tgtacaaaaa gactctgttt gtagaattca cggatcacct tttcaacatc 240
gctaagccaa ggccaccctg gatgggtctg ctaggtccta ccatccaggc tgaggtttat 300gctaagccaa ggccaccctg gatgggtctg ctaggtccta ccatccaggc tgaggtttat 300
gatacagtgg tcattacact taagaacatg gcttcccatc ctgtcagtct tcatgctgtt 360gatacagtgg tcattacact taagaacatg gcttcccatc ctgtcagtct tcatgctgtt 360
ggtgtatcct actggaaagc ttctgaggga gctgaatatg atgatcagac cagtcaaagg 420ggtgtatcct actggaaagc ttctgaggga gctgaatatg atgatcagac cagtcaaagg 420
gagaaagaag atgataaagt cttccctggt ggaagccata catatgtctg gcaggtcctg 480gagaaagaag atgataaagt cttccctggt ggaagccata catatgtctg gcaggtcctg 480
aaagagaatg gtccaatggc ctctgaccca ctgtgcctta cctactcata tctttctcat 540aaagagaatg gtccaatggc ctctgaccca ctgtgcctta cctactcata tctttctcat 540
gtggacctgg taaaagactt gaattcaggc ctcattggag ccctactagt atgtagagaa 600gtggacctgg taaaagactt gaattcaggc ctcattggag ccctactagt atgtagagaa 600
gggagtctgg ccaaggaaaa gacacagacc ttgcacaaat ttatactact ttttgctgta 660gggagtctgg ccaaggaaaa gacacagacc ttgcacaaat ttatactact ttttgctgta 660
tttgatgaag ggaaaagttg gcactcagaa acaaagaact ccttgatgca ggatagggat 720tttgatgaag ggaaaagttg gcactcagaa acaaagaact ccttgatgca ggatagggat 720
gctgcatctg ctcgggcctg gcctaaaatg cacacagtca atggttatgt aaacaggtct 780gctgcatctg ctcggggcctg gcctaaaatg cacacagtca atggttatgt aaacaggtct 780
ctgccaggtc tgattggatg ccacaggaaa tcagtctatt ggcatgtgat tggaatgggc 840ctgccaggtc tgattggatg ccacaggaaa tcagtctatt ggcatgtgat tggaatgggc 840
accactcctg aagtgcactc aatattcctc gaaggtcaca catttcttgt gaggaaccat 900accactcctg aagtgcactc aatattcctc gaaggtcaca catttcttgt gaggaaccat 900
cgccaggcgt ccttggaaat ctcgccaata actttcctta ctgctcaaac actcttgatg 960cgccaggcgt ccttggaaat ctcgccaata actttcctta ctgctcaaac actcttgatg 960
gaccttggac agtttctact gttttgtcat atctcttccc accaacatga tggcatggaa 1020gaccttggac agtttctact gttttgtcat atctcttccc accaacatga tggcatggaa 1020
gcttatgtca aagtagacag ctgtccagag gaaccccaac tacgaatgaa aaataatgaa 1080gcttatgtca aagtagacag ctgtccagag gaaccccaac tacgaatgaa aaataatgaa 1080
gaagcggaag actatgatga tgatcttact gattctgaaa tggatgtggt caggtttgat 1140gaagcggaag actatgatga tgatcttact gattctgaaa tggatgtggt caggtttgat 1140
gatgacaact ctccttcctt tatccaaatt cgctcagttg ccaagaagca tcctaaaact 1200gatgacaact ctccttcctt tatccaaatt cgctcagttg ccaagaagca tcctaaaact 1200
tgggtacatt acattgctgc tgaagaggag gactgggact atgctccctt agtcctcgcc 1260tgggtacatt acattgctgc tgaagaggag gactgggact atgctccctt agtcctcgcc 1260
cccgatgaca gaagttataa aagtcaatat ttgaacaatg gccctcagcg gattggtagg 1320cccgatgaca gaagttataa aagtcaatat ttgaacaatg gccctcagcg gattggtagg 1320
aagtacaaaa aagtccgatt tatggcatac acagatgaaa cctttaagac tcgtgaagct 1380aagtacaaaa aagtccgatt tatggcatac acagatgaaa cctttaagac tcgtgaagct 1380
attcagcatg aatcaggaat cttgggacct ttactttatg gggaagttgg agacacactg 1440attcagcatg aatcaggaat cttgggacct ttactttatg gggaagttgg agacacactg 1440
ttgattatat ttaagaatca agcaagcaga ccatataaca tctaccctca cggaatcact 1500ttgattatat ttaagaatca agcaagcaga ccatataaca tctaccctca cggaatcact 1500
gatgtccgtc ctttgtattc aaggagatta ccaaaaggtg taaaacattt gaaggatttt 1560gatgtccgtc ctttgtattc aaggatta ccaaaaggtg taaaacattt gaaggatttt 1560
ccaattctgc caggagaaat attcaaatat aaatggacag tgactgtaga agatgggcca 1620ccaattctgc caggagaaat attcaaatat aaatggacag tgactgtaga agatgggcca 1620
actaaatcag atcctcggtg cctgacccgc tattactcta gtttcgttaa tatggagaga 1680actaaatcag atcctcggtg cctgacccgc tattactcta gtttcgttaa tatggagaga 1680
gatctagctt caggactcat tggccctctc ctcatctgct acaaagaatc tgtagatcaa 1740gatctagctt caggactcat tggccctctc ctcatctgct acaaagaatc tgtagatcaa 1740
agaggaaacc agataatgtc agacaagagg aatgtcatcc tgttttctgt atttgatgag 1800agaggaaacc agataatgtc agacaagagg aatgtcatcc tgttttctgt atttgatgag 1800
aaccgaagct ggtacctcac agagaatata caacgctttc tccccaatcc agctggagtg 1860aaccgaagct ggtacctcac agagaatata caacgctttc tccccaatcc agctggagtg 1860
cagcttgagg atccagagtt ccaagcctcc aacatcatgc acagcatcaa tggctatgtt 1920cagcttgagg atccagagtt ccaagcctcc aacatcatgc acagcatcaa tggctatgtt 1920
tttgatagtt tgcagttgtc agtttgtttg catgaggtgg catactggta cattctaagc 1980tttgatagtt tgcagttgtc agtttgtttg catgaggtgg catactggta cattctaagc 1980
attggagcac agactgactt cctttctgtc ttcttctctg gatatacctt caaacacaaa 2040attggagcac agactgactt cctttctgtc ttcttctctg gatatacctt caaacacaaa 2040
atggtctatg aagacacact caccctattc ccattctcag gagaaactgt cttcatgtcg 2100atggtctatg aagacacact caccctattc ccattctcag gagaaactgt cttcatgtcg 2100
atggaaaacc caggtctatg gattctgggg tgccacaact cagactttcg gaacagaggc 2160atggaaaacc caggtctatg gattctgggg tgccacaact cagactttcg gaacagaggc 2160
atgaccgcct tactgaaggt ttctagttgt gacaagaaca ctggtgatta ttacgaggac 2220atgaccgcct tactgaaggt ttctagttgt gacaagaaca ctggtgatta ttacgaggac 2220
agttatgaag atatttcagc atacttgctg agtaaaaaca atgccattga accaagaagc 2280agttatgaag atatttcagc atacttgctg agtaaaaaca atgccattga accaagaagc 2280
ttctctcaaa acccaccagt cttgaaacgc catcaacggg aaataactcg tactactctt 2340ttctctcaaa acccaccagt cttgaaacgc catcaacggg aaataactcg tactactctt 2340
cagtcagatc aagaggaaat tgactatgat gataccatat cagttgaaat gaagaaggaa 2400cagtcagatc aagaggaaat tgactatgat gataccatat cagttgaaat gaagaaggaa 2400
gattttgaca tttatgatga ggatgaaaat cagagccccc gcagctttca aaagaaaaca 2460gattttgaca tttatgatga ggatgaaaat cagagccccc gcagctttca aaagaaaaca 2460
cgacactatt ttattgctgc agtggagagg ctctgggatt atgggatgag tagctcccca 2520cgacactatt ttattgctgc agtggagagg ctctgggatt atgggatgag tagctcccca 2520
catgttctaa gaaacagggc tcagagtggc agtgtccctc agttcaagaa agttgttttc 2580catgttctaa gaaacagggc tcagagtggc agtgtccctc agttcaagaa agttgttttc 2580
caggaattta ctgatggctc ctttactcag cccttatacc gtggagaact aaatgaacat 2640caggaattta ctgatggctc ctttactcag cccttatacc gtggagaact aaatgaacat 2640
ttgggactcc tggggccata tataagagca gaagttgaag ataatatcat ggtaactttc 2700ttgggactcc tggggccata tataagagca gaagttgaag ataatatcat ggtaactttc 2700
agaaatcagg cctctcgtcc ctattccttc tattctagcc ttatttctta tgaggaagat 2760agaaatcagg cctctcgtcc ctattccttc tattctagcc ttatttctta tgaggaagat 2760
cagaggcaag gagcagaacc tagaaaaaac tttgtcaagc ctaatgaaac caaaacttac 2820cagaggcaag gagcagaacc tagaaaaaac tttgtcaagc ctaatgaaac caaaacttac 2820
ttttggaaag tgcaacatca tatggcaccc actaaagatg agtttgactg caaagcctgg 2880ttttggaaag tgcaacatca tatggcaccc actaaagatg agtttgactg caaagcctgg 2880
gcttatttct ctgatgttga cctggaaaaa gatgtgcact caggcctgat tggacccctt 2940gcttatttct ctgatgttga cctggaaaaa gatgtgcact caggcctgat tggacccctt 2940
ctggtctgcc acactaacac actgaaccct gctcatggga gacaagtgac agtacaggaa 3000ctggtctgcc acactaacac actgaaccct gctcatggga gacaagtgac agtacaggaa 3000
tttgctctgt ttttcaccat ctttgatgag accaaaagct ggtacttcac tgaaaatatg 3060tttgctctgt ttttcaccat ctttgatgag accaaaagct ggtacttcac tgaaaatatg 3060
gaaagaaact gcagggctcc ctgcaatatc cagatggaag atcccacttt taaagagaat 3120gaaagaaact gcagggctcc ctgcaatatc cagatggaag atcccacttt taaagagaat 3120
tatcgcttcc atgcaatcaa tggctacata atggatacac tacctggctt agtaatggct 3180tatcgcttcc atgcaatcaa tggctacata atggatacac tacctggctt agtaatggct 3180
caggatcaaa ggattcgatg gtatctgctc agcatgggca gcaatgaaaa catccattct 3240caggatcaaa ggattcgatg gtatctgctc agcatgggca gcaatgaaaa catccattct 3240
attcatttca gtggacatgt gttcactgta cgaaaaaaag aggagtataa aatggcactg 3300attcatttca gtggacatgt gttcactgta cgaaaaaaag aggagtataa aatggcactg 3300
tacaatctct atccaggtgt ttttgagaca gtggaaatgt taccatccaa agctggaatt 3360tacaatctct atccaggtgt ttttgagaca gtggaaatgt taccatccaa agctggaatt 3360
tggcgggtgg aatgccttat tggcgagcat ctacatgctg ggatgagcac actttttctg 3420tggcgggtgg aatgccttat tggcgagcat ctacatgctg ggatgagcac actttttctg 3420
gtgtacagca ataagtgtca gactcccctg ggaatggctt ctggacacat tagagatttt 3480gtgtacagca ataagtgtca gactcccctg ggaatggctt ctggacacat tagagatttt 3480
cagattacag cttcaggaca atatggacag tgggccccaa agctggccag acttcattat 3540cagattacag cttcaggaca atatggacag tgggccccaa agctggccag acttcattat 3540
tccggatcaa tcaatgcctg gagcaccaag gagccctttt cttggatcaa ggtggatctg 3600tccggatcaa tcaatgcctg gagcaccaag gagccctttt cttggatcaa ggtggatctg 3600
ttggcaccaa tgattattca cggcatcaag acccagggtg cccgtcagaa gttctccagc 3660ttggcaccaa tgattattca cggcatcaag acccagggtg cccgtcagaa gttctccagc 3660
ctctacatct ctcagtttat catcatgtat agtcttgatg ggaagaagtg gcagacttat 3720ctctacatct ctcagtttat catcatgtat agtcttgatg ggaagaagtg gcagacttat 3720
cgaggaaatt ccactggaac cttaatggtc ttctttggca atgtggattc atctgggata 3780cgaggaaatt ccactggaac cttaatggtc ttctttggca atgtggattc atctgggata 3780
aaacacaata tttttaaccc tccaattatt gctcgataca tccgtttgca cccaactcat 3840aaacacaata tttttaaccc tccaattatt gctcgataca tccgtttgca cccaactcat 3840
tatagcattc gcagcactct tcgcatggag ttgatgggct gtgatttaaa tagttgcagc 3900tatagcattc gcagcactct tcgcatggag ttgatgggct gtgatttaaa tagttgcagc 3900
atgccattgg gaatggagag taaagcaata tcagatgcac agattactgc ttcatcctac 3960atgccattgg gaatggagag taaagcaata tcagatgcac agattactgc ttcatcctac 3960
tttaccaata tgtttgccac ctggtctcct tcaaaagctc gacttcacct ccaagggagg 4020tttaccaata tgtttgccac ctggtctcct tcaaaagctc gacttcacct ccaagggagg 4020
agtaatgcct ggagacctca ggtgaataat ccaaaagagt ggctgcaagt ggacttccag 4080agtaatgcct ggagaacctca ggtgaataat ccaaaagagt ggctgcaagt ggacttccag 4080
aagacaatga aagtcacagg agtaactact cagggagtaa aatctctgct taccagcatg 4140aagacaatga aagtcacagg agtaactact cagggagtaa aatctctgct taccagcatg 4140
tatgtgaagg agttcctcat ctccagcagt caagatggcc atcagtggac tctctttttt 4200tatgtgaagg agttcctcat ctccagcagt caagatggcc atcagtggac tctctttttt 4200
cagaatggca aagtaaaggt ttttcaggga aatcaagact ccttcacacc tgtggtgaac 4260cagaatggca aagtaaaggt ttttcaggga aatcaagact ccttcacacc tgtggtgaac 4260
tctctagacc caccgttact gactcgctac cttcgaattc acccccagag ttgggtgcac 4320tctctagacc caccgttact gactcgctac cttcgaattc acccccagag ttgggtgcac 4320
cagattgccc tgaggatgga ggttctgggc tgcgaggcac aggacctcta ctga 4374cagattgccc tgaggatgga ggttctgggc tgcgaggcac aggacctcta ctga 4374
<210> 11<210> 11
<211> 4401<211> 4401
<212> DNA<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220> <220>
<223> Нуклеиновая кислота, которая кодирует слитый белок на основе<223> Nucleic acid that encodes a fusion protein based on
человеческого FVIII-BDD и сигнального пептида FIX human FVIII-BDD and FIX signal peptide
(СП-FIX-FVIII-BDD) (SP-FIX-FVIII-BDD)
<400> 11<400> 11
atgcagcgcg tgaacatgat catggcagaa tcaccaggcc tcatcaccat ctgcctttta 60atgcagcgcg tgaacatgat catggcagaa tcaccaggcc tcatcaccat ctgcctttta 60
ggatatctac tcagtgctga atgtgccacc agaagatact acctgggtgc agtggaactg 120ggatatctac tcagtgctga atgtgccacc agaagatact acctgggtgc agtggaactg 120
tcatgggact atatgcaaag tgatctcggt gagctgcctg tggacgcaag atttcctcct 180tcatgggact atatgcaaag tgatctcggt gagctgcctg tggacgcaag atttcctcct 180
agagtgccaa aatcttttcc attcaacacc tcagtcgtgt acaaaaagac tctgtttgta 240agagtgccaa aatcttttcc attcaacacc tcagtcgtgt acaaaaagac tctgtttgta 240
gaattcacgg atcacctttt caacatcgct aagccaaggc caccctggat gggtctgcta 300gaattcacgg atcacctttt caacatcgct aagccaaggc caccctggat gggtctgcta 300
ggtcctacca tccaggctga ggtttatgat acagtggtca ttacacttaa gaacatggct 360ggtcctacca tccaggctga ggtttatgat acagtggtca ttacacttaa gaacatggct 360
tcccatcctg tcagtcttca tgctgttggt gtatcctact ggaaagcttc tgagggagct 420tcccatcctg tcagtcttca tgctgttggt gtatcctact ggaaagcttc tgagggagct 420
gaatatgatg atcagaccag tcaaagggag aaagaagatg ataaagtctt ccctggtgga 480gaatatgatg atcagaccag tcaaagggag aaagaagatg ataaagtctt ccctggtgga 480
agccatacat atgtctggca ggtcctgaaa gagaatggtc caatggcctc tgacccactg 540agccatacat atgtctggca ggtcctgaaa gagaatggtc caatggcctc tgacccactg 540
tgccttacct actcatatct ttctcatgtg gacctggtaa aagacttgaa ttcaggcctc 600tgccttacct actcatatct ttctcatgtg gacctggtaa aagacttgaa ttcaggcctc 600
attggagccc tactagtatg tagagaaggg agtctggcca aggaaaagac acagaccttg 660attggagccc tactagtatg tagagaaggg agtctggcca aggaaaagac acagaccttg 660
cacaaattta tactactttt tgctgtattt gatgaaggga aaagttggca ctcagaaaca 720cacaaattta tactactttt tgctgtattt gatgaaggga aaagttggca ctcagaaaca 720
aagaactcct tgatgcagga tagggatgct gcatctgctc gggcctggcc taaaatgcac 780aagaactcct tgatgcagga tagggatgct gcatctgctc gggcctggcc taaaatgcac 780
acagtcaatg gttatgtaaa caggtctctg ccaggtctga ttggatgcca caggaaatca 840acagtcaatg gttatgtaaa caggtctctg ccaggtctga ttggatgcca caggaaatca 840
gtctattggc atgtgattgg aatgggcacc actcctgaag tgcactcaat attcctcgaa 900gtctattggc atgtgattgg aatgggcacc actcctgaag tgcactcaat attcctcgaa 900
ggtcacacat ttcttgtgag gaaccatcgc caggcgtcct tggaaatctc gccaataact 960ggtcacacat ttcttgtgag gaaccatcgc caggcgtcct tggaaatctc gccaataact 960
ttccttactg ctcaaacact cttgatggac cttggacagt ttctactgtt ttgtcatatc 1020ttccttactg ctcaaacact cttgatggac cttggacagt ttctactgtt ttgtcatatc 1020
tcttcccacc aacatgatgg catggaagct tatgtcaaag tagacagctg tccagaggaa 1080tcttcccacc aacatgatgg catggaagct tatgtcaaag tagacagctg tccagaggaa 1080
ccccaactac gaatgaaaaa taatgaagaa gcggaagact atgatgatga tcttactgat 1140ccccaactac gaatgaaaaa taatgaagaa gcggaagact atgatgatga tcttactgat 1140
tctgaaatgg atgtggtcag gtttgatgat gacaactctc cttcctttat ccaaattcgc 1200tctgaaatgg atgtggtcag gtttgatgat gacaactctc cttcctttat ccaaattcgc 1200
tcagttgcca agaagcatcc taaaacttgg gtacattaca ttgctgctga agaggaggac 1260tcagttgcca agaagcatcc taaaacttgg gtacattaca ttgctgctga agaggaggac 1260
tgggactatg ctcccttagt cctcgccccc gatgacagaa gttataaaag tcaatatttg 1320tgggactatg ctcccttagt cctcgccccc gatgacagaa gttataaaag tcaatatttg 1320
aacaatggcc ctcagcggat tggtaggaag tacaaaaaag tccgatttat ggcatacaca 1380aacaatggcc ctcagcggat tggtaggaag tacaaaaaag tccgatttat ggcatacaca 1380
gatgaaacct ttaagactcg tgaagctatt cagcatgaat caggaatctt gggaccttta 1440gatgaaacct ttaagactcg tgaagctatt cagcatgaat caggaatctt gggaccttta 1440
ctttatgggg aagttggaga cacactgttg attatattta agaatcaagc aagcagacca 1500ctttatgggg aagttggaga cacactgttg attatattta agaatcaagc aagcagacca 1500
tataacatct accctcacgg aatcactgat gtccgtcctt tgtattcaag gagattacca 1560tataacatct accctcacgg aatcactgat gtccgtcctt tgtattcaag gagattacca 1560
aaaggtgtaa aacatttgaa ggattttcca attctgccag gagaaatatt caaatataaa 1620aaaggtgtaa aacatttgaa ggattttcca attctgccag gagaaatatt caaatataaa 1620
tggacagtga ctgtagaaga tgggccaact aaatcagatc ctcggtgcct gacccgctat 1680tggacagtga ctgtagaaga tgggccaact aaatcagatc ctcggtgcct gacccgctat 1680
tactctagtt tcgttaatat ggagagagat ctagcttcag gactcattgg ccctctcctc 1740tactctagtt tcgttaatat ggagagagat ctagcttcag gactcattgg ccctctcctc 1740
atctgctaca aagaatctgt agatcaaaga ggaaaccaga taatgtcaga caagaggaat 1800atctgctaca aagaatctgt agatcaaaga ggaaaccaga taatgtcaga caagaggaat 1800
gtcatcctgt tttctgtatt tgatgagaac cgaagctggt acctcacaga gaatatacaa 1860gtcatcctgt tttctgtatt tgatgagaac cgaagctggt acctcacaga gaatatacaa 1860
cgctttctcc ccaatccagc tggagtgcag cttgaggatc cagagttcca agcctccaac 1920cgctttctcc ccaatccagc tggagtgcag cttgaggatc cagagttcca agcctccaac 1920
atcatgcaca gcatcaatgg ctatgttttt gatagtttgc agttgtcagt ttgtttgcat 1980atcatgcaca gcatcaatgg ctatgttttt gatagtttgc agttgtcagt ttgtttgcat 1980
gaggtggcat actggtacat tctaagcatt ggagcacaga ctgacttcct ttctgtcttc 2040gaggtggcat actggtacat tctaagcatt ggagcacaga ctgacttcct ttctgtcttc 2040
ttctctggat ataccttcaa acacaaaatg gtctatgaag acacactcac cctattccca 2100ttctctggat ataccttcaa acacaaaatg gtctatgaag acacactcac cctattccca 2100
ttctcaggag aaactgtctt catgtcgatg gaaaacccag gtctatggat tctggggtgc 2160ttctcaggag aaactgtctt catgtcgatg gaaaacccag gtctatggat tctggggtgc 2160
cacaactcag actttcggaa cagaggcatg accgccttac tgaaggtttc tagttgtgac 2220cacaactcag actttcggaa cagaggcatg accgccttac tgaaggtttc tagttgtgac 2220
aagaacactg gtgattatta cgaggacagt tatgaagata tttcagcata cttgctgagt 2280aagaacactg gtgattatta cgaggacagt tatgaagata tttcagcata cttgctgagt 2280
aaaaacaatg ccattgaacc aagaagcttc tctcaaaacc caccagtctt gaaacgccat 2340aaaaacaatg ccattgaacc aagaagcttc tctcaaaacc caccagtctt gaaacgccat 2340
caacgggaaa taactcgtac tactcttcag tcagatcaag aggaaattga ctatgatgat 2400caacgggaaa taactcgtac tactcttcag tcagatcaag aggaaattga ctatgatgat 2400
accatatcag ttgaaatgaa gaaggaagat tttgacattt atgatgagga tgaaaatcag 2460accatatcag ttgaaatgaa gaaggaagat tttgacattt atgatgagga tgaaaatcag 2460
agcccccgca gctttcaaaa gaaaacacga cactatttta ttgctgcagt ggagaggctc 2520agcccccgca gctttcaaaa gaaaacacga cactatttta ttgctgcagt ggagaggctc 2520
tgggattatg ggatgagtag ctccccacat gttctaagaa acagggctca gagtggcagt 2580tgggattatg ggatgagtag ctccccacat gttctaagaa acagggctca gagtggcagt 2580
gtccctcagt tcaagaaagt tgttttccag gaatttactg atggctcctt tactcagccc 2640gtccctcagt tcaagaaagt tgttttccag gaatttactg atggctcctt tactcagccc 2640
ttataccgtg gagaactaaa tgaacatttg ggactcctgg ggccatatat aagagcagaa 2700ttataccgtg gagaactaaa tgaacatttg ggactcctgg ggccatatat aagagcagaa 2700
gttgaagata atatcatggt aactttcaga aatcaggcct ctcgtcccta ttccttctat 2760gttgaagata atatcatggt aactttcaga aatcaggcct ctcgtcccta ttccttctat 2760
tctagcctta tttcttatga ggaagatcag aggcaaggag cagaacctag aaaaaacttt 2820tctagcctta tttcttatga ggaagatcag aggcaaggag cagaacctag aaaaaacttt 2820
gtcaagccta atgaaaccaa aacttacttt tggaaagtgc aacatcatat ggcacccact 2880gtcaagccta atgaaaccaa aacttacttt tggaaagtgc aacatcatat ggcacccact 2880
aaagatgagt ttgactgcaa agcctgggct tatttctctg atgttgacct ggaaaaagat 2940aaagatgagt ttgactgcaa agcctgggct tatttctctg atgttgacct ggaaaaagat 2940
gtgcactcag gcctgattgg accccttctg gtctgccaca ctaacacact gaaccctgct 3000gtgcactcag gcctgattgg accccttctg gtctgccaca ctaacacact gaaccctgct 3000
catgggagac aagtgacagt acaggaattt gctctgtttt tcaccatctt tgatgagacc 3060catggggagac aagtgacagt acaggaattt gctctgtttt tcaccatctt tgatgagacc 3060
aaaagctggt acttcactga aaatatggaa agaaactgca gggctccctg caatatccag 3120aaaagctggt acttcactga aaatatggaa agaaactgca gggctccctg caatatccag 3120
atggaagatc ccacttttaa agagaattat cgcttccatg caatcaatgg ctacataatg 3180atggaagatc ccacttttaa agagaattat cgcttccatg caatcaatgg ctacataatg 3180
gatacactac ctggcttagt aatggctcag gatcaaagga ttcgatggta tctgctcagc 3240gatacactac ctggcttagt aatggctcag gatcaaagga ttcgatggta tctgctcagc 3240
atgggcagca atgaaaacat ccattctatt catttcagtg gacatgtgtt cactgtacga 3300atgggcagca atgaaaacat ccattctatt catttcagtg gacatgtgtt cactgtacga 3300
aaaaaagagg agtataaaat ggcactgtac aatctctatc caggtgtttt tgagacagtg 3360aaaaaagagg agtataaaat ggcactgtac aatctctatc caggtgtttt tgagacagtg 3360
gaaatgttac catccaaagc tggaatttgg cgggtggaat gccttattgg cgagcatcta 3420gaaatgttac catccaaagc tggaatttgg cgggtggaat gccttattgg cgagcatcta 3420
catgctggga tgagcacact ttttctggtg tacagcaata agtgtcagac tcccctggga 3480catgctggga tgagcacact ttttctggtg tacagcaata agtgtcagac tcccctggga 3480
atggcttctg gacacattag agattttcag attacagctt caggacaata tggacagtgg 3540atggcttctg gacacattag agattttcag attacagctt caggacaata tggacagtgg 3540
gccccaaagc tggccagact tcattattcc ggatcaatca atgcctggag caccaaggag 3600gccccaaagc tggccagact tcattattcc ggatcaatca atgcctggag caccaaggag 3600
cccttttctt ggatcaaggt ggatctgttg gcaccaatga ttattcacgg catcaagacc 3660cccttttctt ggatcaaggt ggatctgttg gcaccaatga ttattcacgg catcaagacc 3660
cagggtgccc gtcagaagtt ctccagcctc tacatctctc agtttatcat catgtatagt 3720cagggtgccc gtcagaagtt ctccagcctc tacatctctc agtttatcat catgtatagt 3720
cttgatggga agaagtggca gacttatcga ggaaattcca ctggaacctt aatggtcttc 3780cttgatggga agaagtggca gacttatcga ggaaattcca ctggaacctt aatggtcttc 3780
tttggcaatg tggattcatc tgggataaaa cacaatattt ttaaccctcc aattattgct 3840tttggcaatg tggattcatc tgggataaaa cacaatattt ttaaccctcc aattattgct 3840
cgatacatcc gtttgcaccc aactcattat agcattcgca gcactcttcg catggagttg 3900cgatacatcc gtttgcaccc aactcattat agcattcgca gcactcttcg catggagttg 3900
atgggctgtg atttaaatag ttgcagcatg ccattgggaa tggagagtaa agcaatatca 3960atgggctgtg atttaaatag ttgcagcatg ccattgggaa tggagagtaa agcaatatca 3960
gatgcacaga ttactgcttc atcctacttt accaatatgt ttgccacctg gtctccttca 4020gatgcacaga ttactgcttc atcctacttt accaatatgt ttgccacctg gtctccttca 4020
aaagctcgac ttcacctcca agggaggagt aatgcctgga gacctcaggt gaataatcca 4080aaagctcgac ttcacctcca agggaggagt aatgcctgga gacctcaggt gaataatcca 4080
aaagagtggc tgcaagtgga cttccagaag acaatgaaag tcacaggagt aactactcag 4140aaagagtggc tgcaagtgga cttccagaag acaatgaaag tcacaggagt aactactcag 4140
ggagtaaaat ctctgcttac cagcatgtat gtgaaggagt tcctcatctc cagcagtcaa 4200ggagtaaaat ctctgcttac cagcatgtat gtgaaggagt tcctcatctc cagcagtcaa 4200
gatggccatc agtggactct cttttttcag aatggcaaag taaaggtttt tcagggaaat 4260gatggccatc agtggactct cttttttcag aatggcaaag taaaggtttt tcagggaaat 4260
caagactcct tcacacctgt ggtgaactct ctagacccac cgttactgac tcgctacctt 4320caagactcct tcacacctgt ggtgaactct ctagacccac cgttactgac tcgctacctt 4320
cgaattcacc cccagagttg ggtgcaccag attgccctga ggatggaggt tctgggctgc 4380cgaattcacc cccagagttg ggtgcaccag attgccctga ggatggaggt tctgggctgc 4380
gaggcacagg acctctactg a 4401gaggcacagg acctctactg a 4401
<210> 12<210> 12
<211> 4377<211> 4377
<212> DNA<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220> <220>
<223> Нуклеиновая кислота, которая кодирует слитый белок на основе<223> Nucleic acid that encodes a fusion protein based on
человеческого FVIII-BDD и сигнального пептида Kappa цепь human FVIII-BDD and Kappa chain signal peptide
иммуноглобулина G (СП-IgGK-FVIII-BDD) immunoglobulin G (SP-IgGK-FVIII-BDD)
<400> 12<400> 12
atggagaccg acaccctgct gctgtgggtg ctgctgctgt gggtgcccgg gtcgaccggt 60atggagaccg acaccctgct gctgtgggtg ctgctgctgt gggtgcccgg gtcgaccggt 60
gccaccagaa gatactacct gggtgcagtg gaactgtcat gggactatat gcaaagtgat 120gccaccagaa gatactacct gggtgcagtg gaactgtcat gggactatat gcaaagtgat 120
ctcggtgagc tgcctgtgga cgcaagattt cctcctagag tgccaaaatc ttttccattc 180ctcggtgagc tgcctgtgga cgcaagattt cctcctagag tgccaaaatc ttttccattc 180
aacacctcag tcgtgtacaa aaagactctg tttgtagaat tcacggatca ccttttcaac 240aacacctcag tcgtgtacaa aaagactctg tttgtagaat tcacggatca ccttttcaac 240
atcgctaagc caaggccacc ctggatgggt ctgctaggtc ctaccatcca ggctgaggtt 300atcgctaagc caaggccacc ctggatgggt ctgctaggtc ctaccatcca ggctgaggtt 300
tatgatacag tggtcattac acttaagaac atggcttccc atcctgtcag tcttcatgct 360tatgatacag tggtcattac acttaagaac atggcttccc atcctgtcag tcttcatgct 360
gttggtgtat cctactggaa agcttctgag ggagctgaat atgatgatca gaccagtcaa 420gttggtgtat cctactggaa agcttctgag ggagctgaat atgatgatca gaccagtcaa 420
agggagaaag aagatgataa agtcttccct ggtggaagcc atacatatgt ctggcaggtc 480agggaaaag aagatgataa agtcttccct ggtggaagcc atacatatgt ctggcaggtc 480
ctgaaagaga atggtccaat ggcctctgac ccactgtgcc ttacctactc atatctttct 540ctgaaagaga atggtccaat ggcctctgac ccactgtgcc ttacctactc atatctttct 540
catgtggacc tggtaaaaga cttgaattca ggcctcattg gagccctact agtatgtaga 600catgtggacc tggtaaaaga cttgaattca ggcctcattg gagccctact agtatgtaga 600
gaagggagtc tggccaagga aaagacacag accttgcaca aatttatact actttttgct 660gaagggagtc tggccaagga aaagacacag accttgcaca aatttatact actttttgct 660
gtatttgatg aagggaaaag ttggcactca gaaacaaaga actccttgat gcaggatagg 720gtatttgatg aagggaaaag ttggcactca gaaacaaaga actccttgat gcaggatagg 720
gatgctgcat ctgctcgggc ctggcctaaa atgcacacag tcaatggtta tgtaaacagg 780gatgctgcat ctgctcgggc ctggcctaaa atgcacacag tcaatggtta tgtaaacagg 780
tctctgccag gtctgattgg atgccacagg aaatcagtct attggcatgt gattggaatg 840tctctgccag gtctgattgg atgccacagg aaatcagtct attggcatgt gattggaatg 840
ggcaccactc ctgaagtgca ctcaatattc ctcgaaggtc acacatttct tgtgaggaac 900ggcaccactc ctgaagtgca ctcaatattc ctcgaaggtc acacatttct tgtgaggaac 900
catcgccagg cgtccttgga aatctcgcca ataactttcc ttactgctca aacactcttg 960catcgccagg cgtccttgga aatctcgcca ataactttcc ttactgctca aacactcttg 960
atggaccttg gacagtttct actgttttgt catatctctt cccaccaaca tgatggcatg 1020atggaccttg gacagtttct actgttttgt catatctctt cccaccaaca tgatggcatg 1020
gaagcttatg tcaaagtaga cagctgtcca gaggaacccc aactacgaat gaaaaataat 1080gaagcttatg tcaaagtaga cagctgtcca gaggaacccc aactacgaat gaaaaataat 1080
gaagaagcgg aagactatga tgatgatctt actgattctg aaatggatgt ggtcaggttt 1140gaagaagcgg aagactatga tgatgatctt actgattctg aaatggatgt ggtcaggttt 1140
gatgatgaca actctccttc ctttatccaa attcgctcag ttgccaagaa gcatcctaaa 1200gatgatgaca actctccttc ctttatccaa attcgctcag ttgccaagaa gcatcctaaa 1200
acttgggtac attacattgc tgctgaagag gaggactggg actatgctcc cttagtcctc 1260acttgggtac attacattgc tgctgaagag gaggactggg actatgctcc cttagtcctc 1260
gcccccgatg acagaagtta taaaagtcaa tatttgaaca atggccctca gcggattggt 1320gcccccgatg acagaagtta taaaagtcaa tatttgaaca atggccctca gcggattggt 1320
aggaagtaca aaaaagtccg atttatggca tacacagatg aaacctttaa gactcgtgaa 1380aggaagtaca aaaaagtccg atttatggca tacacagatg aaacctttaa gactcgtgaa 1380
gctattcagc atgaatcagg aatcttggga cctttacttt atggggaagt tggagacaca 1440gctattcagc atgaatcagg aatcttggga cctttacttt atggggaagt tggagacaca 1440
ctgttgatta tatttaagaa tcaagcaagc agaccatata acatctaccc tcacggaatc 1500ctgttgatta tatttaagaa tcaagcaagc agaccatata acatctaccc tcacggaatc 1500
actgatgtcc gtcctttgta ttcaaggaga ttaccaaaag gtgtaaaaca tttgaaggat 1560actgatgtcc gtcctttgta ttcaaggaga ttaccaaaag gtgtaaaaca tttgaaggat 1560
tttccaattc tgccaggaga aatattcaaa tataaatgga cagtgactgt agaagatggg 1620tttccaattc tgccaggaga aatattcaaa tataaatgga cagtgactgt agaagatggg 1620
ccaactaaat cagatcctcg gtgcctgacc cgctattact ctagtttcgt taatatggag 1680ccaactaaat cagatcctcg gtgcctgacc cgctattact ctagtttcgt taatatggag 1680
agagatctag cttcaggact cattggccct ctcctcatct gctacaaaga atctgtagat 1740agagatctag cttcaggact cattggccct ctcctcatct gctacaaaga atctgtagat 1740
caaagaggaa accagataat gtcagacaag aggaatgtca tcctgttttc tgtatttgat 1800caaagaggaa accagataat gtcagacaag aggaatgtca tcctgttttc tgtatttgat 1800
gagaaccgaa gctggtacct cacagagaat atacaacgct ttctccccaa tccagctgga 1860gagaaccgaa gctggtacct cacagagaat atacaacgct ttctccccaa tccagctgga 1860
gtgcagcttg aggatccaga gttccaagcc tccaacatca tgcacagcat caatggctat 1920gtgcagcttg aggatccaga gttccaagcc tccaacatca tgcacagcat caatggctat 1920
gtttttgata gtttgcagtt gtcagtttgt ttgcatgagg tggcatactg gtacattcta 1980gtttttgata gtttgcagtt gtcagtttgt ttgcatgagg tggcatactg gtacattcta 1980
agcattggag cacagactga cttcctttct gtcttcttct ctggatatac cttcaaacac 2040agcattggag cacagactga cttcctttct gtcttcttct ctggatatac cttcaaacac 2040
aaaatggtct atgaagacac actcacccta ttcccattct caggagaaac tgtcttcatg 2100aaaatggtct atgaagacac actcacccta ttcccattct caggagaaac tgtcttcatg 2100
tcgatggaaa acccaggtct atggattctg gggtgccaca actcagactt tcggaacaga 2160tcgatggaaa acccaggtct atggattctg gggtgccaca actcagactt tcggaacaga 2160
ggcatgaccg ccttactgaa ggtttctagt tgtgacaaga acactggtga ttattacgag 2220ggcatgaccg ccttactgaa ggtttctagt tgtgacaaga acactggtga ttattacgag 2220
gacagttatg aagatatttc agcatacttg ctgagtaaaa acaatgccat tgaaccaaga 2280gacagttatg aagatatttc agcatacttg ctgagtaaaa acaatgccat tgaaccaaga 2280
agcttctctc aaaacccacc agtcttgaaa cgccatcaac gggaaataac tcgtactact 2340agcttctctc aaaacccacc agtcttgaaa cgccatcaac gggaaataac tcgtactact 2340
cttcagtcag atcaagagga aattgactat gatgatacca tatcagttga aatgaagaag 2400cttcagtcag atcaagagga aattgactat gatgatacca tatcagttga aatgaagaag 2400
gaagattttg acatttatga tgaggatgaa aatcagagcc cccgcagctt tcaaaagaaa 2460gaagattttg acatttatga tgaggatgaa aatcagagcc cccgcagctt tcaaaagaaa 2460
acacgacact attttattgc tgcagtggag aggctctggg attatgggat gagtagctcc 2520acacgacact attttattgc tgcagtggag aggctctggg attatgggat gagtagctcc 2520
ccacatgttc taagaaacag ggctcagagt ggcagtgtcc ctcagttcaa gaaagttgtt 2580ccacatgttc taagaaacag ggctcagagt ggcagtgtcc ctcagttcaa gaaagttgtt 2580
ttccaggaat ttactgatgg ctcctttact cagcccttat accgtggaga actaaatgaa 2640ttccaggaat ttactgatgg ctcctttact cagcccttat accgtggaga actaaatgaa 2640
catttgggac tcctggggcc atatataaga gcagaagttg aagataatat catggtaact 2700catttgggac tcctggggcc atatataaga gcagaagttg aagataatat catggtaact 2700
ttcagaaatc aggcctctcg tccctattcc ttctattcta gccttatttc ttatgaggaa 2760ttcagaaatc aggcctctcg tccctattcc ttctattcta gccttatttc ttatgaggaa 2760
gatcagaggc aaggagcaga acctagaaaa aactttgtca agcctaatga aaccaaaact 2820gatcagaggc aaggagcaga acctagaaaa aactttgtca agcctaatga aaccaaaact 2820
tacttttgga aagtgcaaca tcatatggca cccactaaag atgagtttga ctgcaaagcc 2880tacttttgga aagtgcaaca tcatatggca cccactaaag atgagtttga ctgcaaagcc 2880
tgggcttatt tctctgatgt tgacctggaa aaagatgtgc actcaggcct gattggaccc 2940tgggcttatt tctctgatgt tgacctggaa aaagatgtgc actcaggcct gattggaccc 2940
cttctggtct gccacactaa cacactgaac cctgctcatg ggagacaagt gacagtacag 3000cttctggtct gccacactaa cacactgaac cctgctcatg ggagacaagt gacagtacag 3000
gaatttgctc tgtttttcac catctttgat gagaccaaaa gctggtactt cactgaaaat 3060gaatttgctc tgtttttcac catctttgat gagaccaaaa gctggtactt cactgaaaat 3060
atggaaagaa actgcagggc tccctgcaat atccagatgg aagatcccac ttttaaagag 3120atggaaagaa actgcagggc tccctgcaat atccagatgg aagatcccac ttttaaagag 3120
aattatcgct tccatgcaat caatggctac ataatggata cactacctgg cttagtaatg 3180aattatcgct tccatgcaat caatggctac ataatggata cactacctgg cttagtaatg 3180
gctcaggatc aaaggattcg atggtatctg ctcagcatgg gcagcaatga aaacatccat 3240gctcaggatc aaaggattcg atggtatctg ctcagcatgg gcagcaatga aaacatccat 3240
tctattcatt tcagtggaca tgtgttcact gtacgaaaaa aagaggagta taaaatggca 3300tctattcatt tcagtggaca tgtgttcact gtacgaaaaa aagaggagta taaaatggca 3300
ctgtacaatc tctatccagg tgtttttgag acagtggaaa tgttaccatc caaagctgga 3360ctgtacaatc tctatccagg tgtttttgag acagtggaaa tgttaccatc caaagctgga 3360
atttggcggg tggaatgcct tattggcgag catctacatg ctgggatgag cacacttttt 3420atttggcggg tggaatgcct tattggcgag catctacatg ctgggatgag cacacttttt 3420
ctggtgtaca gcaataagtg tcagactccc ctgggaatgg cttctggaca cattagagat 3480ctggtgtaca gcaataagtg tcagactccc ctgggaatgg cttctggaca cattagagat 3480
tttcagatta cagcttcagg acaatatgga cagtgggccc caaagctggc cagacttcat 3540tttcagatta cagcttcagg acaatatgga cagtgggccc caaagctggc cagacttcat 3540
tattccggat caatcaatgc ctggagcacc aaggagccct tttcttggat caaggtggat 3600tattccggat caatcaatgc ctggagcacc aaggagccct tttcttggat caaggtggat 3600
ctgttggcac caatgattat tcacggcatc aagacccagg gtgcccgtca gaagttctcc 3660ctgttggcac caatgattat tcacggcatc aagacccagg gtgcccgtca gaagttctcc 3660
agcctctaca tctctcagtt tatcatcatg tatagtcttg atgggaagaa gtggcagact 3720agcctctaca tctctcagtt tatcatcatg tatagtcttg atgggaagaa gtggcagact 3720
tatcgaggaa attccactgg aaccttaatg gtcttctttg gcaatgtgga ttcatctggg 3780tatcgaggaa attccactgg aaccttaatg gtcttctttg gcaatgtgga ttcatctggg 3780
ataaaacaca atatttttaa ccctccaatt attgctcgat acatccgttt gcacccaact 3840ataaaacaca atatttttaa ccctccaatt attgctcgat acatccgttt gcacccaact 3840
cattatagca ttcgcagcac tcttcgcatg gagttgatgg gctgtgattt aaatagttgc 3900cattatagca ttcgcagcac tcttcgcatg gagttgatgg gctgtgattt aaatagttgc 3900
agcatgccat tgggaatgga gagtaaagca atatcagatg cacagattac tgcttcatcc 3960agcatgccat tgggaatgga gagtaaagca atatcagatg cacagattac tgcttcatcc 3960
tactttacca atatgtttgc cacctggtct ccttcaaaag ctcgacttca cctccaaggg 4020tactttacca atatgtttgc cacctggtct ccttcaaaag ctcgacttca cctccaaggg 4020
aggagtaatg cctggagacc tcaggtgaat aatccaaaag agtggctgca agtggacttc 4080aggagtaatg cctggagacc tcaggtgaat aatccaaaag agtggctgca agtggacttc 4080
cagaagacaa tgaaagtcac aggagtaact actcagggag taaaatctct gcttaccagc 4140cagaagacaa tgaaagtcac aggagtaact actcagggag taaaatctct gcttaccagc 4140
atgtatgtga aggagttcct catctccagc agtcaagatg gccatcagtg gactctcttt 4200atgtatgtga aggagttcct catctccagc agtcaagatg gccatcagtg gactctcttt 4200
tttcagaatg gcaaagtaaa ggtttttcag ggaaatcaag actccttcac acctgtggtg 4260tttcagaatg gcaaagtaaa ggtttttcag ggaaatcaag actccttcac acctgtggtg 4260
aactctctag acccaccgtt actgactcgc taccttcgaa ttcaccccca gagttgggtg 4320aactctctag acccaccgtt actgactcgc taccttcgaa ttcaccccca gagttgggtg 4320
caccagattg ccctgaggat ggaggttctg ggctgcgagg cacaggacct ctactga 4377caccagattg ccctgaggat ggaggttctg ggctgcgagg cacaggacct ctactga 4377
<210> 13<210> 13
<211> 4374<211> 4374
<212> DNA<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220> <220>
<223> Нуклеиновая кислота, которая слитый белок на основе человеческого<223> Nucleic acid, which is a fusion protein based on human
FVIII-BDD и сигнального пептида Lactalbumin FVIII-BDD and Lactalbumin signal peptide
(СП-Lactalbumin-¬FVIII-BDD) (SP-Lactalbumin-¬FVIII-BDD)
<400> 13<400> 13
atgatgtcct ttgtctctct gctcctggtt ggcatcctgt tccatgccac ccaggccgcc 60atgatgtcct ttgtctctct gctcctggtt ggcatcctgt tccatgccac ccaggccgcc 60
accagaagat actacctggg tgcagtggaa ctgtcatggg actatatgca aagtgatctc 120accagaagat actacctggg tgcagtggaa ctgtcatggg actatatgca aagtgatctc 120
ggtgagctgc ctgtggacgc aagatttcct cctagagtgc caaaatcttt tccattcaac 180ggtgagctgc ctgtggacgc aagatttcct cctagagtgc caaaatcttt tccattcaac 180
acctcagtcg tgtacaaaaa gactctgttt gtagaattca cggatcacct tttcaacatc 240acctcagtcg tgtacaaaaa gactctgttt gtagaattca cggatcacct tttcaacatc 240
gctaagccaa ggccaccctg gatgggtctg ctaggtccta ccatccaggc tgaggtttat 300gctaagccaa ggccaccctg gatgggtctg ctaggtccta ccatccaggc tgaggtttat 300
gatacagtgg tcattacact taagaacatg gcttcccatc ctgtcagtct tcatgctgtt 360gatacagtgg tcattacact taagaacatg gcttcccatc ctgtcagtct tcatgctgtt 360
ggtgtatcct actggaaagc ttctgaggga gctgaatatg atgatcagac cagtcaaagg 420ggtgtatcct actggaaagc ttctgaggga gctgaatatg atgatcagac cagtcaaagg 420
gagaaagaag atgataaagt cttccctggt ggaagccata catatgtctg gcaggtcctg 480gagaaagaag atgataaagt cttccctggt ggaagccata catatgtctg gcaggtcctg 480
aaagagaatg gtccaatggc ctctgaccca ctgtgcctta cctactcata tctttctcat 540aaagagaatg gtccaatggc ctctgaccca ctgtgcctta cctactcata tctttctcat 540
gtggacctgg taaaagactt gaattcaggc ctcattggag ccctactagt atgtagagaa 600gtggacctgg taaaagactt gaattcaggc ctcattggag ccctactagt atgtagagaa 600
gggagtctgg ccaaggaaaa gacacagacc ttgcacaaat ttatactact ttttgctgta 660gggagtctgg ccaaggaaaa gacacagacc ttgcacaaat ttatactact ttttgctgta 660
tttgatgaag ggaaaagttg gcactcagaa acaaagaact ccttgatgca ggatagggat 720tttgatgaag ggaaaagttg gcactcagaa acaaagaact ccttgatgca ggatagggat 720
gctgcatctg ctcgggcctg gcctaaaatg cacacagtca atggttatgt aaacaggtct 780gctgcatctg ctcggggcctg gcctaaaatg cacacagtca atggttatgt aaacaggtct 780
ctgccaggtc tgattggatg ccacaggaaa tcagtctatt ggcatgtgat tggaatgggc 840ctgccaggtc tgattggatg ccacaggaaa tcagtctatt ggcatgtgat tggaatgggc 840
accactcctg aagtgcactc aatattcctc gaaggtcaca catttcttgt gaggaaccat 900accactcctg aagtgcactc aatattcctc gaaggtcaca catttcttgt gaggaaccat 900
cgccaggcgt ccttggaaat ctcgccaata actttcctta ctgctcaaac actcttgatg 960cgccaggcgt ccttggaaat ctcgccaata actttcctta ctgctcaaac actcttgatg 960
gaccttggac agtttctact gttttgtcat atctcttccc accaacatga tggcatggaa 1020gaccttggac agtttctact gttttgtcat atctcttccc accaacatga tggcatggaa 1020
gcttatgtca aagtagacag ctgtccagag gaaccccaac tacgaatgaa aaataatgaa 1080gcttatgtca aagtagacag ctgtccagag gaaccccaac tacgaatgaa aaataatgaa 1080
gaagcggaag actatgatga tgatcttact gattctgaaa tggatgtggt caggtttgat 1140gaagcggaag actatgatga tgatcttact gattctgaaa tggatgtggt caggtttgat 1140
gatgacaact ctccttcctt tatccaaatt cgctcagttg ccaagaagca tcctaaaact 1200gatgacaact ctccttcctt tatccaaatt cgctcagttg ccaagaagca tcctaaaact 1200
tgggtacatt acattgctgc tgaagaggag gactgggact atgctccctt agtcctcgcc 1260tgggtacatt acattgctgc tgaagaggag gactgggact atgctccctt agtcctcgcc 1260
cccgatgaca gaagttataa aagtcaatat ttgaacaatg gccctcagcg gattggtagg 1320cccgatgaca gaagttataa aagtcaatat ttgaacaatg gccctcagcg gattggtagg 1320
aagtacaaaa aagtccgatt tatggcatac acagatgaaa cctttaagac tcgtgaagct 1380aagtacaaaa aagtccgatt tatggcatac acagatgaaa cctttaagac tcgtgaagct 1380
attcagcatg aatcaggaat cttgggacct ttactttatg gggaagttgg agacacactg 1440attcagcatg aatcaggaat cttgggacct ttactttatg gggaagttgg agacacactg 1440
ttgattatat ttaagaatca agcaagcaga ccatataaca tctaccctca cggaatcact 1500ttgattatat ttaagaatca agcaagcaga ccatataaca tctaccctca cggaatcact 1500
gatgtccgtc ctttgtattc aaggagatta ccaaaaggtg taaaacattt gaaggatttt 1560gatgtccgtc ctttgtattc aaggatta ccaaaaggtg taaaacattt gaaggatttt 1560
ccaattctgc caggagaaat attcaaatat aaatggacag tgactgtaga agatgggcca 1620ccaattctgc caggagaaat attcaaatat aaatggacag tgactgtaga agatgggcca 1620
actaaatcag atcctcggtg cctgacccgc tattactcta gtttcgttaa tatggagaga 1680actaaatcag atcctcggtg cctgacccgc tattactcta gtttcgttaa tatggagaga 1680
gatctagctt caggactcat tggccctctc ctcatctgct acaaagaatc tgtagatcaa 1740gatctagctt caggactcat tggccctctc ctcatctgct acaaagaatc tgtagatcaa 1740
agaggaaacc agataatgtc agacaagagg aatgtcatcc tgttttctgt atttgatgag 1800agaggaaacc agataatgtc agacaagagg aatgtcatcc tgttttctgt atttgatgag 1800
aaccgaagct ggtacctcac agagaatata caacgctttc tccccaatcc agctggagtg 1860aaccgaagct ggtacctcac agagaatata caacgctttc tccccaatcc agctggagtg 1860
cagcttgagg atccagagtt ccaagcctcc aacatcatgc acagcatcaa tggctatgtt 1920cagcttgagg atccagagtt ccaagcctcc aacatcatgc acagcatcaa tggctatgtt 1920
tttgatagtt tgcagttgtc agtttgtttg catgaggtgg catactggta cattctaagc 1980tttgatagtt tgcagttgtc agtttgtttg catgaggtgg catactggta cattctaagc 1980
attggagcac agactgactt cctttctgtc ttcttctctg gatatacctt caaacacaaa 2040attggagcac agactgactt cctttctgtc ttcttctctg gatatacctt caaacacaaa 2040
atggtctatg aagacacact caccctattc ccattctcag gagaaactgt cttcatgtcg 2100atggtctatg aagacacact caccctattc ccattctcag gagaaactgt cttcatgtcg 2100
atggaaaacc caggtctatg gattctgggg tgccacaact cagactttcg gaacagaggc 2160atggaaaacc caggtctatg gattctgggg tgccacaact cagactttcg gaacagaggc 2160
atgaccgcct tactgaaggt ttctagttgt gacaagaaca ctggtgatta ttacgaggac 2220atgaccgcct tactgaaggt ttctagttgt gacaagaaca ctggtgatta ttacgaggac 2220
agttatgaag atatttcagc atacttgctg agtaaaaaca atgccattga accaagaagc 2280agttatgaag atatttcagc atacttgctg agtaaaaaca atgccattga accaagaagc 2280
ttctctcaaa acccaccagt cttgaaacgc catcaacggg aaataactcg tactactctt 2340ttctctcaaa acccaccagt cttgaaacgc catcaacggg aaataactcg tactactctt 2340
cagtcagatc aagaggaaat tgactatgat gataccatat cagttgaaat gaagaaggaa 2400cagtcagatc aagaggaaat tgactatgat gataccatat cagttgaaat gaagaaggaa 2400
gattttgaca tttatgatga ggatgaaaat cagagccccc gcagctttca aaagaaaaca 2460gattttgaca tttatgatga ggatgaaaat cagagccccc gcagctttca aaagaaaaca 2460
cgacactatt ttattgctgc agtggagagg ctctgggatt atgggatgag tagctcccca 2520cgacactatt ttattgctgc agtggagagg ctctgggatt atgggatgag tagctcccca 2520
catgttctaa gaaacagggc tcagagtggc agtgtccctc agttcaagaa agttgttttc 2580catgttctaa gaaacagggc tcagagtggc agtgtccctc agttcaagaa agttgttttc 2580
caggaattta ctgatggctc ctttactcag cccttatacc gtggagaact aaatgaacat 2640caggaattta ctgatggctc ctttactcag cccttatacc gtggagaact aaatgaacat 2640
ttgggactcc tggggccata tataagagca gaagttgaag ataatatcat ggtaactttc 2700ttgggactcc tggggccata tataagagca gaagttgaag ataatatcat ggtaactttc 2700
agaaatcagg cctctcgtcc ctattccttc tattctagcc ttatttctta tgaggaagat 2760agaaatcagg cctctcgtcc ctattccttc tattctagcc ttatttctta tgaggaagat 2760
cagaggcaag gagcagaacc tagaaaaaac tttgtcaagc ctaatgaaac caaaacttac 2820cagaggcaag gagcagaacc tagaaaaaac tttgtcaagc ctaatgaaac caaaacttac 2820
ttttggaaag tgcaacatca tatggcaccc actaaagatg agtttgactg caaagcctgg 2880ttttggaaag tgcaacatca tatggcaccc actaaagatg agtttgactg caaagcctgg 2880
gcttatttct ctgatgttga cctggaaaaa gatgtgcact caggcctgat tggacccctt 2940gcttatttct ctgatgttga cctggaaaaa gatgtgcact caggcctgat tggacccctt 2940
ctggtctgcc acactaacac actgaaccct gctcatggga gacaagtgac agtacaggaa 3000ctggtctgcc acactaacac actgaaccct gctcatggga gacaagtgac agtacaggaa 3000
tttgctctgt ttttcaccat ctttgatgag accaaaagct ggtacttcac tgaaaatatg 3060tttgctctgt ttttcaccat ctttgatgag accaaaagct ggtacttcac tgaaaatatg 3060
gaaagaaact gcagggctcc ctgcaatatc cagatggaag atcccacttt taaagagaat 3120gaaagaaact gcagggctcc ctgcaatatc cagatggaag atcccacttt taaagagaat 3120
tatcgcttcc atgcaatcaa tggctacata atggatacac tacctggctt agtaatggct 3180tatcgcttcc atgcaatcaa tggctacata atggatacac tacctggctt agtaatggct 3180
caggatcaaa ggattcgatg gtatctgctc agcatgggca gcaatgaaaa catccattct 3240caggatcaaa ggattcgatg gtatctgctc agcatgggca gcaatgaaaa catccattct 3240
attcatttca gtggacatgt gttcactgta cgaaaaaaag aggagtataa aatggcactg 3300attcatttca gtggacatgt gttcactgta cgaaaaaaag aggagtataa aatggcactg 3300
tacaatctct atccaggtgt ttttgagaca gtggaaatgt taccatccaa agctggaatt 3360tacaatctct atccaggtgt ttttgagaca gtggaaatgt taccatccaa agctggaatt 3360
tggcgggtgg aatgccttat tggcgagcat ctacatgctg ggatgagcac actttttctg 3420tggcgggtgg aatgccttat tggcgagcat ctacatgctg ggatgagcac actttttctg 3420
gtgtacagca ataagtgtca gactcccctg ggaatggctt ctggacacat tagagatttt 3480gtgtacagca ataagtgtca gactcccctg ggaatggctt ctggacacat tagagatttt 3480
cagattacag cttcaggaca atatggacag tgggccccaa agctggccag acttcattat 3540cagattacag cttcaggaca atatggacag tgggccccaa agctggccag acttcattat 3540
tccggatcaa tcaatgcctg gagcaccaag gagccctttt cttggatcaa ggtggatctg 3600tccggatcaa tcaatgcctg gagcaccaag gagccctttt cttggatcaa ggtggatctg 3600
ttggcaccaa tgattattca cggcatcaag acccagggtg cccgtcagaa gttctccagc 3660ttggcaccaa tgattattca cggcatcaag acccagggtg cccgtcagaa gttctccagc 3660
ctctacatct ctcagtttat catcatgtat agtcttgatg ggaagaagtg gcagacttat 3720ctctacatct ctcagtttat catcatgtat agtcttgatg ggaagaagtg gcagacttat 3720
cgaggaaatt ccactggaac cttaatggtc ttctttggca atgtggattc atctgggata 3780cgaggaaatt ccactggaac cttaatggtc ttctttggca atgtggattc atctgggata 3780
aaacacaata tttttaaccc tccaattatt gctcgataca tccgtttgca cccaactcat 3840aaacacaata tttttaaccc tccaattatt gctcgataca tccgtttgca cccaactcat 3840
tatagcattc gcagcactct tcgcatggag ttgatgggct gtgatttaaa tagttgcagc 3900tatagcattc gcagcactct tcgcatggag ttgatgggct gtgatttaaa tagttgcagc 3900
atgccattgg gaatggagag taaagcaata tcagatgcac agattactgc ttcatcctac 3960atgccattgg gaatggagag taaagcaata tcagatgcac agattactgc ttcatcctac 3960
tttaccaata tgtttgccac ctggtctcct tcaaaagctc gacttcacct ccaagggagg 4020tttaccaata tgtttgccac ctggtctcct tcaaaagctc gacttcacct ccaagggagg 4020
agtaatgcct ggagacctca ggtgaataat ccaaaagagt ggctgcaagt ggacttccag 4080agtaatgcct ggagaacctca ggtgaataat ccaaaagagt ggctgcaagt ggacttccag 4080
aagacaatga aagtcacagg agtaactact cagggagtaa aatctctgct taccagcatg 4140aagacaatga aagtcacagg agtaactact cagggagtaa aatctctgct taccagcatg 4140
tatgtgaagg agttcctcat ctccagcagt caagatggcc atcagtggac tctctttttt 4200tatgtgaagg agttcctcat ctccagcagt caagatggcc atcagtggac tctctttttt 4200
cagaatggca aagtaaaggt ttttcaggga aatcaagact ccttcacacc tgtggtgaac 4260cagaatggca aagtaaaggt ttttcaggga aatcaagact ccttcacacc tgtggtgaac 4260
tctctagacc caccgttact gactcgctac cttcgaattc acccccagag ttgggtgcac 4320tctctagacc caccgttact gactcgctac cttcgaattc acccccagag ttgggtgcac 4320
cagattgccc tgaggatgga ggttctgggc tgcgaggcac aggacctcta ctga 4374cagattgccc tgaggatgga ggttctgggc tgcgaggcac aggacctcta ctga 4374
<210> 14<210> 14
<211> 130<211> 130
<212> DNA<212> DNA
<213> Природная последовательность<213> Natural sequence
<220> <220>
<223> Левый (первый) ITR (инвертированные концевые повторы)<223> Left (first) ITR (inverted terminal repeats)
<400> 14<400> 14
cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcgtcg ggcgaccttt 60cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcgtcg ggcgaccttt 60
ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact 120ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact 120
aggggttcct 130aggggttcct 130
<210> 15<210> 15
<211> 252<211> 252
<212> DNA<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220> <220>
<223> Промотор HLP<223> HLP promoter
<400> 15<400> 15
tgtttgctgc ttgcaatgtt tgcccatttt agggtggaca caggacgctg tggtttctga 60tgtttgctgc ttgcaatgtt tgcccatttt agggtggaca caggacgctg tggtttctga 60
gccagggggc gactcagatc ccagccagtg gacttagccc ctgtttgctc ctccgataac 120gccaggggggc gactcagatc ccagccagtg gacttagccc ctgtttgctc ctccgataac 120
tggggtgacc ttggttaata ttcaccagca gcctcccccg ttgcccctct ggatccactg 180tggggtgacc ttggttaata ttcaccagca gcctcccccg ttgcccctct ggatccactg 180
cttaaatacg gacgaggaca gggccctgtc tcctcagctt caggcaccac cactgacctg 240cttaaatacg gacgaggaca gggccctgtc tcctcagctt caggcaccac cactgacctg 240
ggacagtgaa tc 252ggacagtgaa tc 252
<210> 16<210> 16
<211> 60<211> 60
<212> DNA<212> DNA
<213> Искусственная последовательность<213> Artificial sequence
<220> <220>
<223> Сигнал полиаденилирования<223> Polyadenylation signal
<400> 16<400> 16
cgcgaataaa agatctttat tttcattaga tctgtgtgtt ggttttttgt gtgatgcagc 60cgcgaataaa agatctttat tttcattaga tctgtgtgtt ggttttttgt gtgatgcagc 60
<210> 17<210> 17
<211> 141<211> 141
<212> DNA<212> DNA
<213> Природная последовательность<213> Natural sequence
<220> <220>
<223> Правый ITR<223> Right ITR
<400> 17<400> 17
aggaacccct agtgatggag ttggccactc cctctctgcg cgctcgctcg ctcactgagg 60aggaacccct agtgatggag ttggccactc cctctctgcg cgctcgctcg ctcactgagg 60
ccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc 120ccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc 120
gagcgcgcag ctgcctgcag g 141gagcgcgcag ctgcctgcag g 141
<---<---
Claims (23)
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/RU2023/050093 WO2023211315A1 (en) | 2022-04-28 | 2023-04-18 | Isolated nucleic acid that encodes fusion protein based on fviii-bdd and on heterologous signal peptide |
TW112114536A TW202342751A (en) | 2022-04-28 | 2023-04-19 | Isolated nucleic acid that encodes fusion protein based on fviii-bdd and on heterologous signal peptide, and use thereof |
CN202310468986.4A CN117402901A (en) | 2022-04-28 | 2023-04-27 | Isolated nucleic acids encoding fusion proteins based on FVIII-BDD and heterologous signal peptides and uses thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
RU2022111695A RU2022111695A (en) | 2023-10-30 |
RU2818229C2 true RU2818229C2 (en) | 2024-04-26 |
Family
ID=
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2017136358A1 (en) * | 2016-02-01 | 2017-08-10 | Bioverativ Therapeutics Inc. | Optimized factor viii genes |
RU2647769C2 (en) * | 2015-12-30 | 2018-03-19 | Закрытое Акционерное Общество "Биокад" | Chemeric improved recombinant factor viii |
WO2021045541A1 (en) * | 2019-09-03 | 2021-03-11 | 재단법인 오송첨단의료산업진흥재단 | Method for ultra-rapidly selecting signal peptide to which individual barcode system for increasing protein productivity is introduced |
WO2021084276A2 (en) * | 2019-11-01 | 2021-05-06 | Freeline Therapeutics Limited | Factor viii construct |
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
RU2647769C2 (en) * | 2015-12-30 | 2018-03-19 | Закрытое Акционерное Общество "Биокад" | Chemeric improved recombinant factor viii |
WO2017136358A1 (en) * | 2016-02-01 | 2017-08-10 | Bioverativ Therapeutics Inc. | Optimized factor viii genes |
WO2021045541A1 (en) * | 2019-09-03 | 2021-03-11 | 재단법인 오송첨단의료산업진흥재단 | Method for ultra-rapidly selecting signal peptide to which individual barcode system for increasing protein productivity is introduced |
WO2021084276A2 (en) * | 2019-11-01 | 2021-05-06 | Freeline Therapeutics Limited | Factor viii construct |
Non-Patent Citations (1)
Title |
---|
JOSHUA I. SINER et al., Minimal modification in the factor VIII B-domain sequence ameliorates the murine hemophilia A phenotype, Blood, 2013, 121 (21), pp. 4396-4403. * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9512457B2 (en) | Serum-free stable transfection and production of recombinant human proteins in human cell lines | |
US5024939A (en) | Transient expression system for producing recombinant protein | |
Clark et al. | Structure of mouse major urinary protein genes: different splicing configurations in the 3′‐non‐coding region. | |
JP2023518395A (en) | Methods and compositions for directed genome editing | |
JPH11187875A (en) | Regulation of metabolic stability of protein | |
CN112575034B (en) | Product for treating hemophilia A and application thereof | |
AU2008205445A1 (en) | Method for producing gamma-carboxlated proteins | |
Kovnir et al. | A highly productive CHO cell line secreting human blood clotting factor IX | |
EP3013855B1 (en) | Mutant factor viii compositions and methods | |
CN113025659B (en) | Gene editing system for treating hemophilia A and application thereof | |
AU614497B2 (en) | Factor viii-c analogs | |
RU2818229C2 (en) | Recovered nucleic acid which codes fviii-bdd-based fusion protein and heterologous signal peptide, and use thereof | |
RU2808564C2 (en) | Codon-optimized nucleic acid that encodes b-domain-deleted factor viii protein and its use | |
CN114989307B (en) | Recombinant human blood coagulation factor VIII-Fc fusion protein and preparation method thereof | |
Shah-Mahoney et al. | Blocking the ends of transforming DNA enhances gene targeting in Dictyostelium | |
CN114032239A (en) | Tissue specific promoter and application thereof | |
BR102017004788B1 (en) | METHOD OF PRODUCTION OF FIBRINE GLUE OF RECOMBINANT ORIGIN | |
WO2014041500A2 (en) | Double mutant coagulation factor viii and methods thereof | |
WO2023211315A1 (en) | Isolated nucleic acid that encodes fusion protein based on fviii-bdd and on heterologous signal peptide | |
WO2023211316A1 (en) | Codon-optimized nucleic acid encoding the fviii-bdd | |
WO1996004397A1 (en) | Gene therapy using homologous recombination | |
US20210395714A1 (en) | Modified factor ix polypeptides | |
KR101119079B1 (en) | Method for producing human coagulation factor VIII | |
RU2500818C1 (en) | RECOMBINANT PLASMID DNA pAP227 CODING POLYPEPTIDE OF RECOMBINANT FACTOR VIII OF HUMAN BLOOD COAGULABILITY, LINE OF CELLS Cricetulus griseus CHO, 2H5-PRODUCER OF RECOMBINANT FACTOR VIII OF HUMAN BLOOD COAGULABILITY, AND METHOD FOR OBTAINING POLYPEPTIDE HAVING ACTIVITY OF RECOMBINANT FACTOR VIII | |
Shepelev et al. | Human Antithrombin III Minigene with an Optimized Splicing Pattern |