SI25289A - Combination of split orthogonal proteases with dimerization domains that enable the assembly - Google Patents
Combination of split orthogonal proteases with dimerization domains that enable the assembly Download PDFInfo
- Publication number
- SI25289A SI25289A SI201600252A SI201600252A SI25289A SI 25289 A SI25289 A SI 25289A SI 201600252 A SI201600252 A SI 201600252A SI 201600252 A SI201600252 A SI 201600252A SI 25289 A SI25289 A SI 25289A
- Authority
- SI
- Slovenia
- Prior art keywords
- leu
- gly
- ser
- lys
- glu
- Prior art date
Links
- 108091005804 Peptidases Proteins 0.000 title claims abstract description 207
- 239000004365 Protease Substances 0.000 title claims abstract description 206
- 102000035195 Peptidases Human genes 0.000 title claims abstract description 120
- 238000006471 dimerization reaction Methods 0.000 title claims abstract description 76
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 101
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 93
- 239000000126 substance Substances 0.000 claims abstract description 17
- 150000001413 amino acids Chemical class 0.000 claims abstract description 10
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 claims description 86
- 210000004027 cell Anatomy 0.000 claims description 81
- 230000000694 effects Effects 0.000 claims description 45
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 42
- 238000003776 cleavage reaction Methods 0.000 claims description 41
- 230000007017 scission Effects 0.000 claims description 41
- 230000008859 change Effects 0.000 claims description 15
- 241000723811 Soybean mosaic virus Species 0.000 claims description 14
- 102000004190 Enzymes Human genes 0.000 claims description 13
- 108090000790 Enzymes Proteins 0.000 claims description 13
- 230000004913 activation Effects 0.000 claims description 12
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 12
- 108010027179 Tacrolimus Binding Proteins Proteins 0.000 claims description 8
- 102000018679 Tacrolimus Binding Proteins Human genes 0.000 claims description 8
- 238000013518 transcription Methods 0.000 claims description 7
- 230000035897 transcription Effects 0.000 claims description 7
- 102100026280 Cryptochrome-2 Human genes 0.000 claims description 6
- 101000855613 Homo sapiens Cryptochrome-2 Proteins 0.000 claims description 6
- 238000002604 ultrasonography Methods 0.000 claims description 6
- 241000196324 Embryophyta Species 0.000 claims description 5
- 241000282414 Homo sapiens Species 0.000 claims description 5
- 230000001413 cellular effect Effects 0.000 claims description 5
- 210000005260 human cell Anatomy 0.000 claims description 5
- 239000002207 metabolite Substances 0.000 claims description 4
- 241001465754 Metazoa Species 0.000 claims description 3
- 239000005556 hormone Substances 0.000 claims description 3
- 229940088597 hormone Drugs 0.000 claims description 3
- 230000003834 intracellular effect Effects 0.000 claims description 3
- 210000002237 B-cell of pancreatic islet Anatomy 0.000 claims description 2
- 210000001744 T-lymphocyte Anatomy 0.000 claims description 2
- 239000000539 dimer Substances 0.000 claims description 2
- 230000028993 immune response Effects 0.000 claims description 2
- 210000000653 nervous system Anatomy 0.000 claims description 2
- 210000002569 neuron Anatomy 0.000 claims description 2
- 230000003204 osmotic effect Effects 0.000 claims description 2
- 230000028327 secretion Effects 0.000 claims description 2
- 230000001960 triggered effect Effects 0.000 claims description 2
- 241000894006 Bacteria Species 0.000 claims 1
- 241000233866 Fungi Species 0.000 claims 1
- 230000003213 activating effect Effects 0.000 claims 1
- 210000004102 animal cell Anatomy 0.000 claims 1
- 239000012634 fragment Substances 0.000 abstract description 32
- 230000001939 inductive effect Effects 0.000 abstract description 12
- 230000015572 biosynthetic process Effects 0.000 abstract description 11
- 230000001404 mediated effect Effects 0.000 abstract description 6
- 230000001225 therapeutic effect Effects 0.000 abstract description 5
- 235000019419 proteases Nutrition 0.000 description 81
- 241000282326 Felis catus Species 0.000 description 78
- 108010050848 glycylleucine Proteins 0.000 description 58
- 108020004414 DNA Proteins 0.000 description 55
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 53
- 108010047857 aspartylglycine Proteins 0.000 description 48
- 108060001084 Luciferase Proteins 0.000 description 45
- 239000005089 Luciferase Substances 0.000 description 44
- 108010057821 leucylproline Proteins 0.000 description 38
- 241000880493 Leptailurus serval Species 0.000 description 29
- 241000723792 Tobacco etch virus Species 0.000 description 28
- 108010037850 glycylvaline Proteins 0.000 description 28
- 108010049041 glutamylalanine Proteins 0.000 description 27
- 108010061238 threonyl-glycine Proteins 0.000 description 27
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 26
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 25
- 108010026333 seryl-proline Proteins 0.000 description 25
- 108010034529 leucyl-lysine Proteins 0.000 description 23
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 22
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 22
- 108010038633 aspartylglutamate Proteins 0.000 description 22
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 21
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 21
- 108010073969 valyllysine Proteins 0.000 description 21
- GGXZOTSDJJTDGB-GUBZILKMSA-N Met-Ser-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O GGXZOTSDJJTDGB-GUBZILKMSA-N 0.000 description 20
- 241000136550 Sunflower mild mosaic virus Species 0.000 description 20
- 108010081551 glycylphenylalanine Proteins 0.000 description 20
- 108010009298 lysylglutamic acid Proteins 0.000 description 20
- 102000004196 processed proteins & peptides Human genes 0.000 description 20
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 19
- JUJCUYWRJMFJJF-AVGNSLFASA-N Pro-Lys-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 JUJCUYWRJMFJJF-AVGNSLFASA-N 0.000 description 19
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 19
- 108010040030 histidinoalanine Proteins 0.000 description 19
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 18
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 18
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 18
- 108010056582 methionylglutamic acid Proteins 0.000 description 18
- 108010082795 phenylalanyl-arginyl-arginine Proteins 0.000 description 18
- 108010051242 phenylalanylserine Proteins 0.000 description 18
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 17
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 17
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 17
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 17
- 108010087924 alanylproline Proteins 0.000 description 17
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 16
- 108010092854 aspartyllysine Proteins 0.000 description 16
- 108010068265 aspartyltyrosine Proteins 0.000 description 16
- 108010054155 lysyllysine Proteins 0.000 description 16
- 102000039446 nucleic acids Human genes 0.000 description 16
- 108020004707 nucleic acids Proteins 0.000 description 16
- 150000007523 nucleic acids Chemical class 0.000 description 16
- 108010031719 prolyl-serine Proteins 0.000 description 16
- 108010048818 seryl-histidine Proteins 0.000 description 16
- 108010071207 serylmethionine Proteins 0.000 description 16
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 15
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 15
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 15
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 15
- 108010079364 N-glycylalanine Proteins 0.000 description 15
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 15
- 230000006870 function Effects 0.000 description 15
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 15
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 15
- 108010003700 lysyl aspartic acid Proteins 0.000 description 15
- 108010068488 methionylphenylalanine Proteins 0.000 description 15
- 108010077112 prolyl-proline Proteins 0.000 description 15
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 15
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 14
- QBQVKUNBCAFXSV-ULQDDVLXSA-N Arg-Lys-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QBQVKUNBCAFXSV-ULQDDVLXSA-N 0.000 description 14
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 14
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 14
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 14
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 14
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 14
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 14
- 108010076818 TEV protease Proteins 0.000 description 14
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 14
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 14
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 14
- 125000004122 cyclic group Chemical group 0.000 description 14
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 14
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 14
- 108010025306 histidylleucine Proteins 0.000 description 14
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 14
- 239000013612 plasmid Substances 0.000 description 14
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 14
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 13
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 13
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 13
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 13
- AABIBDJHSKIMJK-FXQIFTODSA-N Ser-Ser-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O AABIBDJHSKIMJK-FXQIFTODSA-N 0.000 description 13
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 13
- 108010017391 lysylvaline Proteins 0.000 description 13
- 108010012581 phenylalanylglutamate Proteins 0.000 description 13
- ZAHRKKWIAAJSAO-UHFFFAOYSA-N rapamycin Natural products COCC(O)C(=C/C(C)C(=O)CC(OC(=O)C1CCCCN1C(=O)C(=O)C2(O)OC(CC(OC)C(=CC=CC=CC(C)CC(C)C(=O)C)C)CCC2C)C(C)CC3CCC(O)C(C3)OC)C ZAHRKKWIAAJSAO-UHFFFAOYSA-N 0.000 description 13
- 229960002930 sirolimus Drugs 0.000 description 13
- QFJCIRLUMZQUOT-HPLJOQBZSA-N sirolimus Chemical compound C1C[C@@H](O)[C@H](OC)C[C@@H]1C[C@@H](C)[C@H]1OC(=O)[C@@H]2CCCCN2C(=O)C(=O)[C@](O)(O2)[C@H](C)CC[C@H]2C[C@H](OC)/C(C)=C/C=C/C=C/[C@@H](C)C[C@@H](C)C(=O)[C@H](OC)[C@H](O)/C(C)=C/[C@@H](C)C(=O)C1 QFJCIRLUMZQUOT-HPLJOQBZSA-N 0.000 description 13
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 12
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 12
- VWVPYNGMOCSSGK-GUBZILKMSA-N Arg-Arg-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O VWVPYNGMOCSSGK-GUBZILKMSA-N 0.000 description 12
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 12
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 12
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 12
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 12
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 12
- GGXUJBKENKVYNV-ULQDDVLXSA-N His-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N GGXUJBKENKVYNV-ULQDDVLXSA-N 0.000 description 12
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 12
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 12
- KVOFSTUWVSQMDK-KKUMJFAQSA-N Leu-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KVOFSTUWVSQMDK-KKUMJFAQSA-N 0.000 description 12
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 12
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 12
- CTBMEDOQJFGNMI-IHPCNDPISA-N Lys-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)[C@H](CCCCN)N CTBMEDOQJFGNMI-IHPCNDPISA-N 0.000 description 12
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 12
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 12
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 12
- DDDLIMCZFKOERC-SVSWQMSJSA-N Thr-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N DDDLIMCZFKOERC-SVSWQMSJSA-N 0.000 description 12
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 12
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 12
- 108010047495 alanylglycine Proteins 0.000 description 12
- 230000027455 binding Effects 0.000 description 12
- 108010020688 glycylhistidine Proteins 0.000 description 12
- 108010015792 glycyllysine Proteins 0.000 description 12
- 108010036413 histidylglycine Proteins 0.000 description 12
- 108010092114 histidylphenylalanine Proteins 0.000 description 12
- 238000000034 method Methods 0.000 description 12
- 108010070643 prolylglutamic acid Proteins 0.000 description 12
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 11
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 11
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 11
- FIQKRDXFTANIEJ-ULQDDVLXSA-N Arg-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FIQKRDXFTANIEJ-ULQDDVLXSA-N 0.000 description 11
- IGFJVXOATGZTHD-UHFFFAOYSA-N Arg-Phe-His Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccccc1)C(=O)NC(Cc2c[nH]cn2)C(=O)O IGFJVXOATGZTHD-UHFFFAOYSA-N 0.000 description 11
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 11
- QCLHLXDWRKOHRR-GUBZILKMSA-N Asp-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N QCLHLXDWRKOHRR-GUBZILKMSA-N 0.000 description 11
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 11
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 11
- MHYHLWUGWUBUHF-GUBZILKMSA-N Cys-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N MHYHLWUGWUBUHF-GUBZILKMSA-N 0.000 description 11
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 11
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 11
- SFOXOSKVTLDEDM-HOTGVXAUSA-N Gly-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)CN)=CNC2=C1 SFOXOSKVTLDEDM-HOTGVXAUSA-N 0.000 description 11
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 11
- GNBHSMFBUNEWCJ-DCAQKATOSA-N His-Pro-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GNBHSMFBUNEWCJ-DCAQKATOSA-N 0.000 description 11
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 11
- NPAYJTAXWXJKLO-NAKRPEOUSA-N Ile-Met-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N NPAYJTAXWXJKLO-NAKRPEOUSA-N 0.000 description 11
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 11
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 11
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 11
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 11
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 11
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 11
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 11
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 11
- RYQWALWYQWBUKN-FHWLQOOXSA-N Phe-Phe-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RYQWALWYQWBUKN-FHWLQOOXSA-N 0.000 description 11
- FENSZYFJQOFSQR-FIRPJDEBSA-N Phe-Phe-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FENSZYFJQOFSQR-FIRPJDEBSA-N 0.000 description 11
- DEZCWWXTRAKZKJ-UFYCRDLUSA-N Phe-Phe-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DEZCWWXTRAKZKJ-UFYCRDLUSA-N 0.000 description 11
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 11
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 11
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 11
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 11
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 11
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 11
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 11
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 11
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 11
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 11
- 229940088598 enzyme Drugs 0.000 description 11
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 11
- 108010027338 isoleucylcysteine Proteins 0.000 description 11
- 108010064235 lysylglycine Proteins 0.000 description 11
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 10
- FSXDWQGEWZQBPJ-HERUPUMHSA-N Ala-Trp-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FSXDWQGEWZQBPJ-HERUPUMHSA-N 0.000 description 10
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 10
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 10
- VRZDJJWOFXMFRO-ZFWWWQNUSA-N Arg-Gly-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O VRZDJJWOFXMFRO-ZFWWWQNUSA-N 0.000 description 10
- QHAJMRDEWNAIBQ-FXQIFTODSA-N Asp-Arg-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O QHAJMRDEWNAIBQ-FXQIFTODSA-N 0.000 description 10
- ZSVJVIOVABDTTL-YUMQZZPRSA-N Asp-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N ZSVJVIOVABDTTL-YUMQZZPRSA-N 0.000 description 10
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 10
- LBOLGUYQEPZSKM-YUMQZZPRSA-N Cys-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N LBOLGUYQEPZSKM-YUMQZZPRSA-N 0.000 description 10
- XTHUKRLJRUVVBF-WHFBIAKZSA-N Cys-Gly-Ser Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O XTHUKRLJRUVVBF-WHFBIAKZSA-N 0.000 description 10
- JZDHUJAFXGNDSB-WHFBIAKZSA-N Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O JZDHUJAFXGNDSB-WHFBIAKZSA-N 0.000 description 10
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 10
- 108010050006 Gly-Asp-Gly-Arg Proteins 0.000 description 10
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 10
- NQKRILCJYCASDV-QWRGUYRKSA-N His-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 NQKRILCJYCASDV-QWRGUYRKSA-N 0.000 description 10
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 10
- ZSKJIISDJXJQPV-BZSNNMDCSA-N His-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 ZSKJIISDJXJQPV-BZSNNMDCSA-N 0.000 description 10
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 10
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 10
- ORWTWZXGDBYVCP-BJDJZHNGSA-N Leu-Ile-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(C)C ORWTWZXGDBYVCP-BJDJZHNGSA-N 0.000 description 10
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 10
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 10
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 10
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 10
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 10
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 10
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 10
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 10
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 10
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 10
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 10
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 10
- ADHNYKZHPOEULM-BQBZGAKWSA-N Met-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O ADHNYKZHPOEULM-BQBZGAKWSA-N 0.000 description 10
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 description 10
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 10
- MPGJIHFJCXTVEX-KKUMJFAQSA-N Phe-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O MPGJIHFJCXTVEX-KKUMJFAQSA-N 0.000 description 10
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 10
- ACJULKNZOCRWEI-ULQDDVLXSA-N Phe-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O ACJULKNZOCRWEI-ULQDDVLXSA-N 0.000 description 10
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 10
- NAIPAPCKKRCMBL-JYJNAYRXSA-N Pro-Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=CC=C1 NAIPAPCKKRCMBL-JYJNAYRXSA-N 0.000 description 10
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 10
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 10
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 10
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 10
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 10
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 10
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 10
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 10
- DSLHSTIUAPKERR-XGEHTFHBSA-N Thr-Cys-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O DSLHSTIUAPKERR-XGEHTFHBSA-N 0.000 description 10
- UBDDORVPVLEECX-FJXKBIBVSA-N Thr-Gly-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UBDDORVPVLEECX-FJXKBIBVSA-N 0.000 description 10
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 10
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 10
- 108091023040 Transcription factor Proteins 0.000 description 10
- 102000040945 Transcription factor Human genes 0.000 description 10
- KSVMDJJCYKIXTK-IGNZVWTISA-N Tyr-Ala-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 KSVMDJJCYKIXTK-IGNZVWTISA-N 0.000 description 10
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 description 10
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 10
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 10
- JPPXDMBGXJBTIB-ULQDDVLXSA-N Val-His-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N JPPXDMBGXJBTIB-ULQDDVLXSA-N 0.000 description 10
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 10
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 10
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 10
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 10
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 10
- 108010085325 histidylproline Proteins 0.000 description 10
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 10
- 229920001184 polypeptide Polymers 0.000 description 10
- 230000019491 signal transduction Effects 0.000 description 10
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 9
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 9
- HCIUUZGFTDTEGM-NAKRPEOUSA-N Arg-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HCIUUZGFTDTEGM-NAKRPEOUSA-N 0.000 description 9
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 9
- PWAIZUBWHRHYKS-MELADBBJSA-N Asp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)O)N)C(=O)O PWAIZUBWHRHYKS-MELADBBJSA-N 0.000 description 9
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 9
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 9
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 9
- JBCLFWXMTIKCCB-VIFPVBQESA-N Gly-Phe Chemical compound NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-VIFPVBQESA-N 0.000 description 9
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 9
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 9
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 9
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 9
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 9
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 9
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 9
- SVXXJYJCRNKDDE-AVGNSLFASA-N Pro-Pro-His Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CN=CN1 SVXXJYJCRNKDDE-AVGNSLFASA-N 0.000 description 9
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 9
- 102100038242 Replication initiator 1 Human genes 0.000 description 9
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 9
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 9
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 9
- 238000002474 experimental method Methods 0.000 description 9
- 108010078274 isoleucylvaline Proteins 0.000 description 9
- 239000003446 ligand Substances 0.000 description 9
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 9
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 8
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 8
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 8
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 8
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 8
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 8
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 8
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 8
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 8
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 8
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 8
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 8
- NLCDVZJDEXIDDL-BIIVOSGPSA-N Asn-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O NLCDVZJDEXIDDL-BIIVOSGPSA-N 0.000 description 8
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 8
- FJIRXKVEDFLLOQ-SRVKXCTJSA-N Asn-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N FJIRXKVEDFLLOQ-SRVKXCTJSA-N 0.000 description 8
- ZKDGORKGHPCZOV-DCAQKATOSA-N Asn-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZKDGORKGHPCZOV-DCAQKATOSA-N 0.000 description 8
- RBOBTTLFPRSXKZ-BZSNNMDCSA-N Asn-Phe-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RBOBTTLFPRSXKZ-BZSNNMDCSA-N 0.000 description 8
- DOURAOODTFJRIC-CIUDSAMLSA-N Asn-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N DOURAOODTFJRIC-CIUDSAMLSA-N 0.000 description 8
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 8
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 description 8
- HBUJSDCLZCXXCW-YDHLFZDLSA-N Asn-Val-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HBUJSDCLZCXXCW-YDHLFZDLSA-N 0.000 description 8
- ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 description 8
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 8
- JOCQXVJCTCEFAZ-CIUDSAMLSA-N Asp-His-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O JOCQXVJCTCEFAZ-CIUDSAMLSA-N 0.000 description 8
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 8
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 8
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 8
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 8
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 8
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 8
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 8
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 8
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 8
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 8
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 8
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 8
- MVORZMQFXBLMHM-QWRGUYRKSA-N Gly-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 MVORZMQFXBLMHM-QWRGUYRKSA-N 0.000 description 8
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 8
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 8
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 8
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 8
- HQKADFMLECZIQJ-HVTMNAMFSA-N His-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N HQKADFMLECZIQJ-HVTMNAMFSA-N 0.000 description 8
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 8
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 8
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 8
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 8
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 8
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 8
- 108010065920 Insulin Lispro Proteins 0.000 description 8
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 8
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 8
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 8
- WCTCIIAGNMFYAO-DCAQKATOSA-N Leu-Cys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O WCTCIIAGNMFYAO-DCAQKATOSA-N 0.000 description 8
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 8
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 8
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 8
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 8
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 8
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 8
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 8
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 8
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 8
- JZMGVXLDOQOKAH-UWVGGRQHSA-N Lys-Gly-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O JZMGVXLDOQOKAH-UWVGGRQHSA-N 0.000 description 8
- OWRUUFUVXFREBD-KKUMJFAQSA-N Lys-His-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O OWRUUFUVXFREBD-KKUMJFAQSA-N 0.000 description 8
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 8
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 8
- KQBJYJXPZBNEIK-DCAQKATOSA-N Met-Glu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQBJYJXPZBNEIK-DCAQKATOSA-N 0.000 description 8
- RRIHXWPHQSXHAQ-XUXIUFHCSA-N Met-Ile-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O RRIHXWPHQSXHAQ-XUXIUFHCSA-N 0.000 description 8
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 8
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 description 8
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 8
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 8
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 8
- ZWJKVFAYPLPCQB-UNQGMJICSA-N Phe-Arg-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O ZWJKVFAYPLPCQB-UNQGMJICSA-N 0.000 description 8
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 8
- GXDPQJUBLBZKDY-IAVJCBSLSA-N Phe-Ile-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GXDPQJUBLBZKDY-IAVJCBSLSA-N 0.000 description 8
- ILGCZYGFYQLSDZ-KKUMJFAQSA-N Phe-Ser-His Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ILGCZYGFYQLSDZ-KKUMJFAQSA-N 0.000 description 8
- SJRQWEDYTKYHHL-SLFFLAALSA-N Phe-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O SJRQWEDYTKYHHL-SLFFLAALSA-N 0.000 description 8
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 8
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 8
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 8
- AUQGUYPHJSMAKI-CYDGBPFRSA-N Pro-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 AUQGUYPHJSMAKI-CYDGBPFRSA-N 0.000 description 8
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 8
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 8
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 8
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 8
- IOVBCLGAJJXOHK-SRVKXCTJSA-N Ser-His-His Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IOVBCLGAJJXOHK-SRVKXCTJSA-N 0.000 description 8
- VXYQOFXBIXKPCX-BQBZGAKWSA-N Ser-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N VXYQOFXBIXKPCX-BQBZGAKWSA-N 0.000 description 8
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 8
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 8
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 8
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 8
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 8
- HQVKQINPFOCIIV-BVSLBCMMSA-N Trp-Arg-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 HQVKQINPFOCIIV-BVSLBCMMSA-N 0.000 description 8
- JVTHMUDOKPQBOT-NSHDSACASA-N Trp-Gly-Gly Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O)=CNC2=C1 JVTHMUDOKPQBOT-NSHDSACASA-N 0.000 description 8
- IELISNUVHBKYBX-XDTLVQLUSA-N Tyr-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IELISNUVHBKYBX-XDTLVQLUSA-N 0.000 description 8
- DYEGCOJHFNJBKB-UFYCRDLUSA-N Tyr-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 DYEGCOJHFNJBKB-UFYCRDLUSA-N 0.000 description 8
- FNWGDMZVYBVAGJ-XEGUGMAKSA-N Tyr-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CC=C(C=C1)O)N FNWGDMZVYBVAGJ-XEGUGMAKSA-N 0.000 description 8
- LRHBBGDMBLFYGL-FHWLQOOXSA-N Tyr-Phe-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LRHBBGDMBLFYGL-FHWLQOOXSA-N 0.000 description 8
- NAHUCETZGZZSEX-IHPCNDPISA-N Tyr-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N NAHUCETZGZZSEX-IHPCNDPISA-N 0.000 description 8
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 8
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 8
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 8
- COSLEEOIYRPTHD-YDHLFZDLSA-N Val-Asp-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 COSLEEOIYRPTHD-YDHLFZDLSA-N 0.000 description 8
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 8
- 108010070944 alanylhistidine Proteins 0.000 description 8
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 8
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 8
- 230000004807 localization Effects 0.000 description 8
- 108010025488 pinealon Proteins 0.000 description 8
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 7
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 7
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 7
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 7
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 7
- BVLIJXXSXBUGEC-SRVKXCTJSA-N Asn-Asn-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVLIJXXSXBUGEC-SRVKXCTJSA-N 0.000 description 7
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 7
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 7
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 7
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 7
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 7
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 7
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 7
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 7
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 7
- IALVDKNUFSTICJ-GMOBBJLQSA-N Ile-Met-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IALVDKNUFSTICJ-GMOBBJLQSA-N 0.000 description 7
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 7
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 7
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 7
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 7
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 7
- AWGBEIYZPAXXSX-RWMBFGLXSA-N Met-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N AWGBEIYZPAXXSX-RWMBFGLXSA-N 0.000 description 7
- LQTGGXSOMDSWTQ-UNQGMJICSA-N Met-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCSC)N)O LQTGGXSOMDSWTQ-UNQGMJICSA-N 0.000 description 7
- WXXNVZMWHOLNRJ-AVGNSLFASA-N Met-Pro-Lys Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O WXXNVZMWHOLNRJ-AVGNSLFASA-N 0.000 description 7
- AOLKTFKKSSMRTA-WDSOQIARSA-N Met-Trp-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N AOLKTFKKSSMRTA-WDSOQIARSA-N 0.000 description 7
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 7
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 7
- NFDYGNFETJVMSE-BQBZGAKWSA-N Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CO NFDYGNFETJVMSE-BQBZGAKWSA-N 0.000 description 7
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 7
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 7
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 7
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 7
- DIHPMRTXPYMDJZ-KAOXEZKKSA-N Thr-Tyr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N)O DIHPMRTXPYMDJZ-KAOXEZKKSA-N 0.000 description 7
- PXQPYPMSLBQHJJ-WFBYXXMGSA-N Trp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N PXQPYPMSLBQHJJ-WFBYXXMGSA-N 0.000 description 7
- UXODSMTVPWXHBT-ULQDDVLXSA-N Val-Phe-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N UXODSMTVPWXHBT-ULQDDVLXSA-N 0.000 description 7
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 7
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 7
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 7
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 6
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 6
- LFFOJBOTZUWINF-ZANVPECISA-N Ala-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O)=CNC2=C1 LFFOJBOTZUWINF-ZANVPECISA-N 0.000 description 6
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 6
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 6
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 6
- FBLMOFHNVQBKRR-IHRRRGAJSA-N Arg-Asp-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FBLMOFHNVQBKRR-IHRRRGAJSA-N 0.000 description 6
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 6
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 6
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 6
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 6
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 6
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 6
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 6
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 6
- SRIRHERUAMYIOQ-CIUDSAMLSA-N Cys-Leu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SRIRHERUAMYIOQ-CIUDSAMLSA-N 0.000 description 6
- KFYPRIGJTICABD-XGEHTFHBSA-N Cys-Thr-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N)O KFYPRIGJTICABD-XGEHTFHBSA-N 0.000 description 6
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 6
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 6
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 6
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 6
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 6
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 6
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 6
- GWKBAXRZPLSWJS-QEJZJMRPSA-N Glu-Trp-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N GWKBAXRZPLSWJS-QEJZJMRPSA-N 0.000 description 6
- QOOFKCCZZWTCEP-AVGNSLFASA-N Glu-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QOOFKCCZZWTCEP-AVGNSLFASA-N 0.000 description 6
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 6
- KCCNSVHJSMMGFS-NRPADANISA-N Glu-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KCCNSVHJSMMGFS-NRPADANISA-N 0.000 description 6
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 6
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 6
- QSVMIMFAAZPCAQ-PMVVWTBXSA-N Gly-His-Thr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QSVMIMFAAZPCAQ-PMVVWTBXSA-N 0.000 description 6
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 6
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 6
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 6
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 6
- FXTUGWXZTFMTIV-GJZGRUSLSA-N Gly-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN FXTUGWXZTFMTIV-GJZGRUSLSA-N 0.000 description 6
- OHOXVDFVRDGFND-YUMQZZPRSA-N His-Cys-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CS)C(=O)NCC(O)=O OHOXVDFVRDGFND-YUMQZZPRSA-N 0.000 description 6
- HAPWZEVRQYGLSG-IUCAKERBSA-N His-Gly-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O HAPWZEVRQYGLSG-IUCAKERBSA-N 0.000 description 6
- YXXKBPJEIYFGOD-MGHWNKPDSA-N His-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N YXXKBPJEIYFGOD-MGHWNKPDSA-N 0.000 description 6
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 6
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 6
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 6
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 6
- CMNMPCTVCWWYHY-MXAVVETBSA-N Ile-His-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(C)C)C(=O)O)N CMNMPCTVCWWYHY-MXAVVETBSA-N 0.000 description 6
- KYLIZSDYWQQTFM-PEDHHIEDSA-N Ile-Ile-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N KYLIZSDYWQQTFM-PEDHHIEDSA-N 0.000 description 6
- UWLHDGMRWXHFFY-HPCHECBXSA-N Ile-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N1CCC[C@@H]1C(=O)O)N UWLHDGMRWXHFFY-HPCHECBXSA-N 0.000 description 6
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 6
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 6
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 6
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 6
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 6
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 6
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 6
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 6
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 6
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 6
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 6
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 6
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 6
- CENKQZWVYMLRAX-ULQDDVLXSA-N Lys-Phe-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O CENKQZWVYMLRAX-ULQDDVLXSA-N 0.000 description 6
- MDXAULHWGWETHF-SRVKXCTJSA-N Met-Arg-Val Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCNC(N)=N MDXAULHWGWETHF-SRVKXCTJSA-N 0.000 description 6
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 6
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 6
- WRXOPYNEKGZWAZ-FXQIFTODSA-N Met-Ser-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O WRXOPYNEKGZWAZ-FXQIFTODSA-N 0.000 description 6
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 6
- 101800004193 Peptide P3 Proteins 0.000 description 6
- NPLGQVKZFGJWAI-QWHCGFSZSA-N Phe-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O NPLGQVKZFGJWAI-QWHCGFSZSA-N 0.000 description 6
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 6
- 241000723784 Plum pox virus Species 0.000 description 6
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 6
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 6
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 6
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 6
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 6
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 6
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 6
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 6
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 6
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 6
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 6
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 6
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 6
- NXQAOORHSYJRGH-AAEUAGOBSA-N Trp-Gly-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 NXQAOORHSYJRGH-AAEUAGOBSA-N 0.000 description 6
- OTWIOROMZLNAQC-XIRDDKMYSA-N Trp-His-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OTWIOROMZLNAQC-XIRDDKMYSA-N 0.000 description 6
- BABINGWMZBWXIX-BPUTZDHNSA-N Trp-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BABINGWMZBWXIX-BPUTZDHNSA-N 0.000 description 6
- XHALUUQSNXSPLP-UFYCRDLUSA-N Tyr-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XHALUUQSNXSPLP-UFYCRDLUSA-N 0.000 description 6
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 6
- NMANTMWGQZASQN-QXEWZRGKSA-N Val-Arg-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N NMANTMWGQZASQN-QXEWZRGKSA-N 0.000 description 6
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 6
- WBUOKGBHGDPYMH-GUBZILKMSA-N Val-Cys-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)C(C)C WBUOKGBHGDPYMH-GUBZILKMSA-N 0.000 description 6
- BZOSBRIDWSSTFN-AVGNSLFASA-N Val-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N BZOSBRIDWSSTFN-AVGNSLFASA-N 0.000 description 6
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 6
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 6
- 108010062796 arginyllysine Proteins 0.000 description 6
- 108010077245 asparaginyl-proline Proteins 0.000 description 6
- 108010054813 diprotin B Proteins 0.000 description 6
- 230000004927 fusion Effects 0.000 description 6
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 6
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 6
- 108010077515 glycylproline Proteins 0.000 description 6
- 230000006698 induction Effects 0.000 description 6
- 108010012058 leucyltyrosine Proteins 0.000 description 6
- 108010056787 lysyl-arginyl-glutamyl-glutamic acid Proteins 0.000 description 6
- 239000002773 nucleotide Substances 0.000 description 6
- 125000003729 nucleotide group Chemical group 0.000 description 6
- 108010084572 phenylalanyl-valine Proteins 0.000 description 6
- 108010024607 phenylalanylalanine Proteins 0.000 description 6
- 108010018625 phenylalanylarginine Proteins 0.000 description 6
- 108010015796 prolylisoleucine Proteins 0.000 description 6
- 108010029384 tryptophyl-histidine Proteins 0.000 description 6
- 108010078580 tyrosylleucine Proteins 0.000 description 6
- DECCMEWNXSNSDO-ZLUOBGJFSA-N Ala-Cys-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DECCMEWNXSNSDO-ZLUOBGJFSA-N 0.000 description 5
- LCBSSOCDWUTQQV-SDDRHHMPSA-N Arg-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LCBSSOCDWUTQQV-SDDRHHMPSA-N 0.000 description 5
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 5
- QRULNKJGYQQZMW-ZLUOBGJFSA-N Asp-Asn-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QRULNKJGYQQZMW-ZLUOBGJFSA-N 0.000 description 5
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 5
- JTEGHEWKBCTIAL-IXOXFDKPSA-N Cys-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N)O JTEGHEWKBCTIAL-IXOXFDKPSA-N 0.000 description 5
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 5
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 5
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 5
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 5
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 5
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 5
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 5
- JIUYRPFQJJRSJB-QWRGUYRKSA-N His-His-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)NCC(O)=O)C1=CN=CN1 JIUYRPFQJJRSJB-QWRGUYRKSA-N 0.000 description 5
- VDHOMPFVSABJKU-ULQDDVLXSA-N His-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N VDHOMPFVSABJKU-ULQDDVLXSA-N 0.000 description 5
- ZVKDCQVQTGYBQT-LSJOCFKGSA-N His-Pro-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O ZVKDCQVQTGYBQT-LSJOCFKGSA-N 0.000 description 5
- UWBDLNOCIDGPQE-GUBZILKMSA-N Ile-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN UWBDLNOCIDGPQE-GUBZILKMSA-N 0.000 description 5
- RDFIVFHPOSOXMW-ACRUOGEOSA-N Leu-Tyr-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RDFIVFHPOSOXMW-ACRUOGEOSA-N 0.000 description 5
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 5
- SQUTUWHAAWJYES-GUBZILKMSA-N Met-Asp-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SQUTUWHAAWJYES-GUBZILKMSA-N 0.000 description 5
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 5
- HOTVCUAVDQHUDB-UFYCRDLUSA-N Pro-Phe-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 HOTVCUAVDQHUDB-UFYCRDLUSA-N 0.000 description 5
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 5
- BKOKTRCZXRIQPX-ZLUOBGJFSA-N Ser-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N BKOKTRCZXRIQPX-ZLUOBGJFSA-N 0.000 description 5
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 5
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 5
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 5
- 241000723790 Tobacco vein mottling virus Species 0.000 description 5
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 5
- LHADRQBREKTRLR-DCAQKATOSA-N Val-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N LHADRQBREKTRLR-DCAQKATOSA-N 0.000 description 5
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 5
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 5
- 108010005233 alanylglutamic acid Proteins 0.000 description 5
- 108010013835 arginine glutamate Proteins 0.000 description 5
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 5
- 230000001419 dependent effect Effects 0.000 description 5
- 239000000758 substrate Substances 0.000 description 5
- 238000001890 transfection Methods 0.000 description 5
- 108010020532 tyrosyl-proline Proteins 0.000 description 5
- OZRFYUJEXYKQDV-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-3-carboxypropanoyl)amino]-3-carboxypropanoyl]amino]-3-carboxypropanoyl]amino]butanedioic acid Chemical compound OC(=O)CC(N)C(=O)NC(CC(O)=O)C(=O)NC(CC(O)=O)C(=O)NC(CC(O)=O)C(O)=O OZRFYUJEXYKQDV-UHFFFAOYSA-N 0.000 description 4
- LVPCJMUBOHOZHE-UHFFFAOYSA-N 4-amino-2-[[2-[[2-[(2-amino-3-methylbutanoyl)amino]-3-methylpentanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-4-oxobutanoic acid Chemical compound CC(C)C(N)C(=O)NC(C(C)CC)C(=O)NC(C(=O)NC(CC(N)=O)C(O)=O)CC1=CN=CN1 LVPCJMUBOHOZHE-UHFFFAOYSA-N 0.000 description 4
- QEVHRUUCFGRFIF-UHFFFAOYSA-N 6,18-dimethoxy-17-[oxo-(3,4,5-trimethoxyphenyl)methoxy]-1,3,11,12,14,15,16,17,18,19,20,21-dodecahydroyohimban-19-carboxylic acid methyl ester Chemical compound C1C2CN3CCC(C4=CC=C(OC)C=C4N4)=C4C3CC2C(C(=O)OC)C(OC)C1OC(=O)C1=CC(OC)=C(OC)C(OC)=C1 QEVHRUUCFGRFIF-UHFFFAOYSA-N 0.000 description 4
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 4
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 4
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 4
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 4
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 4
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 4
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 4
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 4
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 4
- BUQICHWNXBIBOG-LMVFSUKVSA-N Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)N BUQICHWNXBIBOG-LMVFSUKVSA-N 0.000 description 4
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 4
- TVUFMYKTYXTRPY-HERUPUMHSA-N Ala-Trp-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O TVUFMYKTYXTRPY-HERUPUMHSA-N 0.000 description 4
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 4
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 4
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 4
- WESHVRNMNFMVBE-FXQIFTODSA-N Arg-Asn-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N WESHVRNMNFMVBE-FXQIFTODSA-N 0.000 description 4
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 4
- RKRSYHCNPFGMTA-CIUDSAMLSA-N Arg-Glu-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O RKRSYHCNPFGMTA-CIUDSAMLSA-N 0.000 description 4
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 4
- HPSVTWMFWCHKFN-GARJFASQSA-N Arg-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O HPSVTWMFWCHKFN-GARJFASQSA-N 0.000 description 4
- PCQXGEUALSFGIA-WDSOQIARSA-N Arg-His-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O PCQXGEUALSFGIA-WDSOQIARSA-N 0.000 description 4
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 4
- PZBSKYJGKNNYNK-ULQDDVLXSA-N Arg-Leu-Tyr Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O PZBSKYJGKNNYNK-ULQDDVLXSA-N 0.000 description 4
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 4
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 4
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 4
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 4
- COUZKSSMBFADSB-AVGNSLFASA-N Asn-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N COUZKSSMBFADSB-AVGNSLFASA-N 0.000 description 4
- JZDZLBJVYWIIQU-AVGNSLFASA-N Asn-Glu-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JZDZLBJVYWIIQU-AVGNSLFASA-N 0.000 description 4
- KLKHFFMNGWULBN-VKHMYHEASA-N Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)NCC(O)=O KLKHFFMNGWULBN-VKHMYHEASA-N 0.000 description 4
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 4
- OWUCNXMFJRFOFI-BQBZGAKWSA-N Asn-Gly-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OWUCNXMFJRFOFI-BQBZGAKWSA-N 0.000 description 4
- JQBCANGGAVVERB-CFMVVWHZSA-N Asn-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N JQBCANGGAVVERB-CFMVVWHZSA-N 0.000 description 4
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 4
- YUUIAUXBNOHFRJ-IHRRRGAJSA-N Asn-Phe-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O YUUIAUXBNOHFRJ-IHRRRGAJSA-N 0.000 description 4
- FTNRWCPWDWRPAV-BZSNNMDCSA-N Asn-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTNRWCPWDWRPAV-BZSNNMDCSA-N 0.000 description 4
- NJSNXIOKBHPFMB-GMOBBJLQSA-N Asn-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N NJSNXIOKBHPFMB-GMOBBJLQSA-N 0.000 description 4
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 4
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 4
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 4
- KWBQPGIYEZKDEG-FSPLSTOPSA-N Asn-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(N)=O KWBQPGIYEZKDEG-FSPLSTOPSA-N 0.000 description 4
- SLHOOKXYTYAJGQ-XVYDVKMFSA-N Asp-Ala-His Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 SLHOOKXYTYAJGQ-XVYDVKMFSA-N 0.000 description 4
- HOQGTAIGQSDCHR-SRVKXCTJSA-N Asp-Asn-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HOQGTAIGQSDCHR-SRVKXCTJSA-N 0.000 description 4
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 4
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 4
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 4
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 4
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 4
- KFAFUJMGHVVYRC-DCAQKATOSA-N Asp-Leu-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O KFAFUJMGHVVYRC-DCAQKATOSA-N 0.000 description 4
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 4
- VWWAFGHMPWBKEP-GMOBBJLQSA-N Asp-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(=O)O)N VWWAFGHMPWBKEP-GMOBBJLQSA-N 0.000 description 4
- HXVILZUZXFLVEN-DCAQKATOSA-N Asp-Met-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HXVILZUZXFLVEN-DCAQKATOSA-N 0.000 description 4
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 4
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 4
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 4
- AMRLSQGGERHDHJ-FXQIFTODSA-N Cys-Ala-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMRLSQGGERHDHJ-FXQIFTODSA-N 0.000 description 4
- KLLFLHBKSJAUMZ-ACZMJKKPSA-N Cys-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N KLLFLHBKSJAUMZ-ACZMJKKPSA-N 0.000 description 4
- MUZAUPFGPMMZSS-GUBZILKMSA-N Cys-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N MUZAUPFGPMMZSS-GUBZILKMSA-N 0.000 description 4
- CVLIHKBUPSFRQP-WHFBIAKZSA-N Cys-Gly-Ala Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C)C(O)=O CVLIHKBUPSFRQP-WHFBIAKZSA-N 0.000 description 4
- BLGNLNRBABWDST-CIUDSAMLSA-N Cys-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BLGNLNRBABWDST-CIUDSAMLSA-N 0.000 description 4
- CMYVIUWVYHOLRD-ZLUOBGJFSA-N Cys-Ser-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CMYVIUWVYHOLRD-ZLUOBGJFSA-N 0.000 description 4
- LKHMGNHQULEPFY-ACZMJKKPSA-N Cys-Ser-Glu Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O LKHMGNHQULEPFY-ACZMJKKPSA-N 0.000 description 4
- SRZZZTMJARUVPI-JBDRJPRFSA-N Cys-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N SRZZZTMJARUVPI-JBDRJPRFSA-N 0.000 description 4
- 101100533283 Dictyostelium discoideum serp gene Proteins 0.000 description 4
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 4
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 4
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 4
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 4
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 4
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 4
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 4
- JJSVALISDCNFCU-SZMVWBNQSA-N Glu-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O JJSVALISDCNFCU-SZMVWBNQSA-N 0.000 description 4
- ZIYGTCDTJJCDDP-JYJNAYRXSA-N Glu-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZIYGTCDTJJCDDP-JYJNAYRXSA-N 0.000 description 4
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 4
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 4
- ZGXGVBYEJGVJMV-HJGDQZAQSA-N Glu-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O ZGXGVBYEJGVJMV-HJGDQZAQSA-N 0.000 description 4
- UQULNJAARAXSPO-ZCWPNWOLSA-N Glu-Thr-Thr-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UQULNJAARAXSPO-ZCWPNWOLSA-N 0.000 description 4
- JVZLZVJTIXVIHK-SXNHZJKMSA-N Glu-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N JVZLZVJTIXVIHK-SXNHZJKMSA-N 0.000 description 4
- MIWJDJAMMKHUAR-ZVZYQTTQSA-N Glu-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N MIWJDJAMMKHUAR-ZVZYQTTQSA-N 0.000 description 4
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 4
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 4
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 4
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 4
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 4
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 4
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 4
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 4
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 4
- FCKPEGOCSVZPNC-WHOFXGATSA-N Gly-Ile-Phe Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FCKPEGOCSVZPNC-WHOFXGATSA-N 0.000 description 4
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 4
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 4
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 4
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 4
- MTBIKIMYHUWBRX-QWRGUYRKSA-N Gly-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN MTBIKIMYHUWBRX-QWRGUYRKSA-N 0.000 description 4
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 4
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 4
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 4
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 4
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 4
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 4
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 4
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 4
- MBSSHYPAEHPSGY-LSJOCFKGSA-N His-Ala-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O MBSSHYPAEHPSGY-LSJOCFKGSA-N 0.000 description 4
- FIMNVXRZGUAGBI-AVGNSLFASA-N His-Glu-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FIMNVXRZGUAGBI-AVGNSLFASA-N 0.000 description 4
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 4
- LBQAHBIVXQSBIR-HVTMNAMFSA-N His-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N LBQAHBIVXQSBIR-HVTMNAMFSA-N 0.000 description 4
- SKYULSWNBYAQMG-IHRRRGAJSA-N His-Leu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SKYULSWNBYAQMG-IHRRRGAJSA-N 0.000 description 4
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 4
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 4
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 4
- ZUELLZFHJUPFEC-PMVMPFDFSA-N His-Phe-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CN=CN1 ZUELLZFHJUPFEC-PMVMPFDFSA-N 0.000 description 4
- KAXZXLSXFWSNNZ-XVYDVKMFSA-N His-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KAXZXLSXFWSNNZ-XVYDVKMFSA-N 0.000 description 4
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 4
- VXZZUXWAOMWWJH-QTKMDUPCSA-N His-Thr-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VXZZUXWAOMWWJH-QTKMDUPCSA-N 0.000 description 4
- KECFCPNPPYCGBL-PMVMPFDFSA-N His-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CN=CN4)N KECFCPNPPYCGBL-PMVMPFDFSA-N 0.000 description 4
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 4
- UNDGQKWQNSTPPW-CYDGBPFRSA-N Ile-Arg-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N UNDGQKWQNSTPPW-CYDGBPFRSA-N 0.000 description 4
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 4
- LLHYWBGDMBGNHA-VGDYDELISA-N Ile-Cys-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LLHYWBGDMBGNHA-VGDYDELISA-N 0.000 description 4
- AWTDTFXPVCTHAK-BJDJZHNGSA-N Ile-Cys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N AWTDTFXPVCTHAK-BJDJZHNGSA-N 0.000 description 4
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 4
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 4
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 4
- KEKTTYCXKGBAAL-VGDYDELISA-N Ile-His-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N KEKTTYCXKGBAAL-VGDYDELISA-N 0.000 description 4
- GTSAALPQZASLPW-KJYZGMDISA-N Ile-His-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N GTSAALPQZASLPW-KJYZGMDISA-N 0.000 description 4
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 4
- BBQABUDWDUKJMB-LZXPERKUSA-N Ile-Ile-Ile Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O BBQABUDWDUKJMB-LZXPERKUSA-N 0.000 description 4
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 4
- GLLAUPMJCGKPFY-BLMTYFJBSA-N Ile-Ile-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 GLLAUPMJCGKPFY-BLMTYFJBSA-N 0.000 description 4
- FFJQAEYLAQMGDL-MGHWNKPDSA-N Ile-Lys-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FFJQAEYLAQMGDL-MGHWNKPDSA-N 0.000 description 4
- MSASLZGZQAXVFP-PEDHHIEDSA-N Ile-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N MSASLZGZQAXVFP-PEDHHIEDSA-N 0.000 description 4
- KLJKJVXDHVUMMZ-KKPKCPPISA-N Ile-Phe-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N KLJKJVXDHVUMMZ-KKPKCPPISA-N 0.000 description 4
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 4
- DLEBSGAVWRPTIX-PEDHHIEDSA-N Ile-Val-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)[C@@H](C)CC DLEBSGAVWRPTIX-PEDHHIEDSA-N 0.000 description 4
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 4
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 4
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 4
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 4
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 4
- JRJLGNFWYFSJHB-HOCLYGCPSA-N Leu-Gly-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRJLGNFWYFSJHB-HOCLYGCPSA-N 0.000 description 4
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 4
- XBCWOTOCBXXJDG-BZSNNMDCSA-N Leu-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 XBCWOTOCBXXJDG-BZSNNMDCSA-N 0.000 description 4
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 4
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 4
- ZALAVHVPPOHAOL-XUXIUFHCSA-N Leu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N ZALAVHVPPOHAOL-XUXIUFHCSA-N 0.000 description 4
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 4
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 4
- GNRPTBRHRRZCMA-RWMBFGLXSA-N Leu-Met-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N GNRPTBRHRRZCMA-RWMBFGLXSA-N 0.000 description 4
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 4
- MUCIDQMDOYQYBR-IHRRRGAJSA-N Leu-Pro-His Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N MUCIDQMDOYQYBR-IHRRRGAJSA-N 0.000 description 4
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 4
- JLYUZRKPDKHUTC-WDSOQIARSA-N Leu-Pro-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JLYUZRKPDKHUTC-WDSOQIARSA-N 0.000 description 4
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 4
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 4
- URJUVJDTPXCQFL-IHPCNDPISA-N Leu-Trp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N URJUVJDTPXCQFL-IHPCNDPISA-N 0.000 description 4
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 4
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 4
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 4
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 4
- ALSRJRIWBNENFY-DCAQKATOSA-N Lys-Arg-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O ALSRJRIWBNENFY-DCAQKATOSA-N 0.000 description 4
- JBRWKVANRYPCAF-XIRDDKMYSA-N Lys-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N JBRWKVANRYPCAF-XIRDDKMYSA-N 0.000 description 4
- IBQMEXQYZMVIFU-SRVKXCTJSA-N Lys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N IBQMEXQYZMVIFU-SRVKXCTJSA-N 0.000 description 4
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 4
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 4
- DYJOORGDQIGZAS-DCAQKATOSA-N Lys-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N DYJOORGDQIGZAS-DCAQKATOSA-N 0.000 description 4
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 4
- IEVXCWPVBYCJRZ-IXOXFDKPSA-N Lys-Thr-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IEVXCWPVBYCJRZ-IXOXFDKPSA-N 0.000 description 4
- SEZADXQOJJTXPG-VFAJRCTISA-N Lys-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N)O SEZADXQOJJTXPG-VFAJRCTISA-N 0.000 description 4
- PELXPRPDQRFBGQ-KKUMJFAQSA-N Lys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O PELXPRPDQRFBGQ-KKUMJFAQSA-N 0.000 description 4
- NQOQDINRVQCAKD-ULQDDVLXSA-N Lys-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N NQOQDINRVQCAKD-ULQDDVLXSA-N 0.000 description 4
- LMMBAXJRYSXCOQ-ACRUOGEOSA-N Lys-Tyr-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O LMMBAXJRYSXCOQ-ACRUOGEOSA-N 0.000 description 4
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 4
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 4
- MVMNUCOHQGYYKB-PEDHHIEDSA-N Met-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCSC)N MVMNUCOHQGYYKB-PEDHHIEDSA-N 0.000 description 4
- WPTDJKDGICUFCP-XUXIUFHCSA-N Met-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCSC)N WPTDJKDGICUFCP-XUXIUFHCSA-N 0.000 description 4
- IMTUWVJPCQPJEE-IUCAKERBSA-N Met-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN IMTUWVJPCQPJEE-IUCAKERBSA-N 0.000 description 4
- KRLKICLNEICJGV-STQMWFEESA-N Met-Phe-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 KRLKICLNEICJGV-STQMWFEESA-N 0.000 description 4
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 4
- SMVTWPOATVIXTN-NAKRPEOUSA-N Met-Ser-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SMVTWPOATVIXTN-NAKRPEOUSA-N 0.000 description 4
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 4
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 4
- MFDDVIJCQYOOES-GUBZILKMSA-N Met-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCSC)N MFDDVIJCQYOOES-GUBZILKMSA-N 0.000 description 4
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 4
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 4
- 108700026244 Open Reading Frames Proteins 0.000 description 4
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 4
- OXUMFAOVGFODPN-KKUMJFAQSA-N Phe-Asn-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OXUMFAOVGFODPN-KKUMJFAQSA-N 0.000 description 4
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 description 4
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 4
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 4
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 4
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 4
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 4
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 4
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 4
- RTUWVJVJSMOGPL-KKUMJFAQSA-N Phe-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RTUWVJVJSMOGPL-KKUMJFAQSA-N 0.000 description 4
- FQUUYTNBMIBOHS-IHRRRGAJSA-N Phe-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FQUUYTNBMIBOHS-IHRRRGAJSA-N 0.000 description 4
- WKLMCMXFMQEKCX-SLFFLAALSA-N Phe-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O WKLMCMXFMQEKCX-SLFFLAALSA-N 0.000 description 4
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N Phe-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 description 4
- YRHRGNUAXGUPTO-PMVMPFDFSA-N Phe-Trp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCCCN)C(=O)O)N YRHRGNUAXGUPTO-PMVMPFDFSA-N 0.000 description 4
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 4
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 4
- OLTFZQIYCNOBLI-DCAQKATOSA-N Pro-Cys-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O OLTFZQIYCNOBLI-DCAQKATOSA-N 0.000 description 4
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 4
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 4
- FFSLAIOXRMOFIZ-GJZGRUSLSA-N Pro-Gly-Trp Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)O)C(=O)CNC(=O)[C@@H]1CCCN1 FFSLAIOXRMOFIZ-GJZGRUSLSA-N 0.000 description 4
- ZTMLZUNPFDGPKY-VKOGCVSHSA-N Pro-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@@H]3CCCN3 ZTMLZUNPFDGPKY-VKOGCVSHSA-N 0.000 description 4
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 4
- SXMSEHDMNIUTSP-DCAQKATOSA-N Pro-Lys-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SXMSEHDMNIUTSP-DCAQKATOSA-N 0.000 description 4
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 4
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 4
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 4
- XNJVJEHDZPDPQL-BZSNNMDCSA-N Pro-Trp-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H]1CCCN1)C(O)=O XNJVJEHDZPDPQL-BZSNNMDCSA-N 0.000 description 4
- 108010079005 RDV peptide Proteins 0.000 description 4
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 4
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 4
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 4
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 4
- GRSLLFZTTLBOQX-CIUDSAMLSA-N Ser-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N GRSLLFZTTLBOQX-CIUDSAMLSA-N 0.000 description 4
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 4
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 4
- CXBFHZLODKPIJY-AAEUAGOBSA-N Ser-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N CXBFHZLODKPIJY-AAEUAGOBSA-N 0.000 description 4
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 4
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 4
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 4
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 4
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 4
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 4
- PBUXMVYWOSKHMF-WDSKDSINSA-N Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CO PBUXMVYWOSKHMF-WDSKDSINSA-N 0.000 description 4
- AXOHAHIUJHCLQR-IHRRRGAJSA-N Ser-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CO)N AXOHAHIUJHCLQR-IHRRRGAJSA-N 0.000 description 4
- MHVXPTAMDHLTHB-IHPCNDPISA-N Ser-Phe-Trp Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MHVXPTAMDHLTHB-IHPCNDPISA-N 0.000 description 4
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 4
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 4
- XGQKSRGHEZNWIS-IHRRRGAJSA-N Ser-Pro-Tyr Chemical compound N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O XGQKSRGHEZNWIS-IHRRRGAJSA-N 0.000 description 4
- VFWQQZMRKFOGLE-ZLUOBGJFSA-N Ser-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O VFWQQZMRKFOGLE-ZLUOBGJFSA-N 0.000 description 4
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 4
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 4
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 4
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 4
- IAOHCSQDQDWRQU-GUBZILKMSA-N Ser-Val-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IAOHCSQDQDWRQU-GUBZILKMSA-N 0.000 description 4
- JBHMLZSKIXMVFS-XVSYOHENSA-N Thr-Asn-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JBHMLZSKIXMVFS-XVSYOHENSA-N 0.000 description 4
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 4
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 4
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 4
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 4
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 4
- JMBRNXUOLJFURW-BEAPCOKYSA-N Thr-Phe-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N)O JMBRNXUOLJFURW-BEAPCOKYSA-N 0.000 description 4
- NWECYMJLJGCBOD-UNQGMJICSA-N Thr-Phe-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O NWECYMJLJGCBOD-UNQGMJICSA-N 0.000 description 4
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 4
- NBIIPOKZPUGATB-BWBBJGPYSA-N Thr-Ser-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O NBIIPOKZPUGATB-BWBBJGPYSA-N 0.000 description 4
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 4
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 4
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 4
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 4
- PXYJUECTGMGIDT-WDSOQIARSA-N Trp-Arg-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 PXYJUECTGMGIDT-WDSOQIARSA-N 0.000 description 4
- HJTYJQVRIQXMHM-XIRDDKMYSA-N Trp-Asp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N HJTYJQVRIQXMHM-XIRDDKMYSA-N 0.000 description 4
- NKUIXQOJUAEIET-AQZXSJQPSA-N Trp-Asp-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@H](O)C)C(O)=O)=CNC2=C1 NKUIXQOJUAEIET-AQZXSJQPSA-N 0.000 description 4
- MZDJYWGXAIEYEP-BPUTZDHNSA-N Trp-Cys-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MZDJYWGXAIEYEP-BPUTZDHNSA-N 0.000 description 4
- KDWZQYUTMJSYRJ-BHYGNILZSA-N Trp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O KDWZQYUTMJSYRJ-BHYGNILZSA-N 0.000 description 4
- PGPCENKYTLDIFM-SZMVWBNQSA-N Trp-His-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O PGPCENKYTLDIFM-SZMVWBNQSA-N 0.000 description 4
- ILDJYIDXESUBOE-HSCHXYMDSA-N Trp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ILDJYIDXESUBOE-HSCHXYMDSA-N 0.000 description 4
- GWBWCGITOYODER-YTQUADARSA-N Trp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GWBWCGITOYODER-YTQUADARSA-N 0.000 description 4
- OWSRIUBVJOQHNY-IHPCNDPISA-N Trp-Lys-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N OWSRIUBVJOQHNY-IHPCNDPISA-N 0.000 description 4
- UUIYFDAWNBSWPG-IHPCNDPISA-N Trp-Lys-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N UUIYFDAWNBSWPG-IHPCNDPISA-N 0.000 description 4
- KCZGSXPFPNKGLE-WDSOQIARSA-N Trp-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N KCZGSXPFPNKGLE-WDSOQIARSA-N 0.000 description 4
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 4
- PMDWYLVWHRTJIW-STQMWFEESA-N Tyr-Gly-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PMDWYLVWHRTJIW-STQMWFEESA-N 0.000 description 4
- JJNXZIPLIXIGBX-HJPIBITLSA-N Tyr-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JJNXZIPLIXIGBX-HJPIBITLSA-N 0.000 description 4
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 4
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 4
- VPEFOFYNHBWFNQ-UFYCRDLUSA-N Tyr-Pro-Tyr Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 VPEFOFYNHBWFNQ-UFYCRDLUSA-N 0.000 description 4
- LDKDSFQSEUOCOO-RPTUDFQQSA-N Tyr-Thr-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LDKDSFQSEUOCOO-RPTUDFQQSA-N 0.000 description 4
- DBMMKEHYWIZTPN-JYJNAYRXSA-N Val-Cys-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N DBMMKEHYWIZTPN-JYJNAYRXSA-N 0.000 description 4
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 4
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 4
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 4
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 4
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 4
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 4
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 4
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 4
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 4
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 4
- 230000009471 action Effects 0.000 description 4
- 108010041407 alanylaspartic acid Proteins 0.000 description 4
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 4
- 230000001908 autoinhibitory effect Effects 0.000 description 4
- 210000004899 c-terminal region Anatomy 0.000 description 4
- 230000036755 cellular response Effects 0.000 description 4
- 230000000295 complement effect Effects 0.000 description 4
- 108010079547 glutamylmethionine Proteins 0.000 description 4
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 4
- 108010084389 glycyltryptophan Proteins 0.000 description 4
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 4
- 239000003550 marker Substances 0.000 description 4
- 108010085203 methionylmethionine Proteins 0.000 description 4
- 108010029020 prolylglycine Proteins 0.000 description 4
- 108010090894 prolylleucine Proteins 0.000 description 4
- 230000000638 stimulation Effects 0.000 description 4
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 4
- 239000013598 vector Substances 0.000 description 4
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 3
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 3
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 3
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 3
- MIPWEZAIMPYQST-FXQIFTODSA-N Ala-Cys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O MIPWEZAIMPYQST-FXQIFTODSA-N 0.000 description 3
- JEPNLGMEZMCFEX-QSFUFRPTSA-N Ala-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N JEPNLGMEZMCFEX-QSFUFRPTSA-N 0.000 description 3
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 3
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 3
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 3
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 3
- BIOCIVSVEDFKDJ-GUBZILKMSA-N Arg-Arg-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O BIOCIVSVEDFKDJ-GUBZILKMSA-N 0.000 description 3
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 3
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 3
- HOIFSHOLNKQCSA-FXQIFTODSA-N Asn-Arg-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O HOIFSHOLNKQCSA-FXQIFTODSA-N 0.000 description 3
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 3
- IIFDPDVJAHQFSR-WHFBIAKZSA-N Asn-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O IIFDPDVJAHQFSR-WHFBIAKZSA-N 0.000 description 3
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 3
- MYCSPQIARXTUTP-SRVKXCTJSA-N Asn-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N MYCSPQIARXTUTP-SRVKXCTJSA-N 0.000 description 3
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 3
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 3
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 3
- ZNYKKCADEQAZKA-FXQIFTODSA-N Asn-Ser-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O ZNYKKCADEQAZKA-FXQIFTODSA-N 0.000 description 3
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 3
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 3
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 3
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 3
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 3
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 3
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 3
- 108010075254 C-Peptide Proteins 0.000 description 3
- 241001123946 Gaga Species 0.000 description 3
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 3
- HQOGXFLBAKJUMH-CIUDSAMLSA-N Glu-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N HQOGXFLBAKJUMH-CIUDSAMLSA-N 0.000 description 3
- PMSDOVISAARGAV-FHWLQOOXSA-N Glu-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 PMSDOVISAARGAV-FHWLQOOXSA-N 0.000 description 3
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 3
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 3
- FUESBOMYALLFNI-VKHMYHEASA-N Gly-Asn Chemical compound NCC(=O)N[C@H](C(O)=O)CC(N)=O FUESBOMYALLFNI-VKHMYHEASA-N 0.000 description 3
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 3
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 3
- DKEXFJVMVGETOO-LURJTMIESA-N Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CN DKEXFJVMVGETOO-LURJTMIESA-N 0.000 description 3
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 3
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 3
- OCPPBNKYGYSLOE-IUCAKERBSA-N Gly-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN OCPPBNKYGYSLOE-IUCAKERBSA-N 0.000 description 3
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 3
- AWHJQEYGWRKPHE-LSJOCFKGSA-N His-Ala-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AWHJQEYGWRKPHE-LSJOCFKGSA-N 0.000 description 3
- SOFSRBYHDINIRG-QTKMDUPCSA-N His-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N)O SOFSRBYHDINIRG-QTKMDUPCSA-N 0.000 description 3
- AKEDPWJFQULLPE-IUCAKERBSA-N His-Glu-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O AKEDPWJFQULLPE-IUCAKERBSA-N 0.000 description 3
- JBSLJUPMTYLLFH-MELADBBJSA-N His-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O JBSLJUPMTYLLFH-MELADBBJSA-N 0.000 description 3
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 3
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 3
- OWSWUWDMSNXTNE-GMOBBJLQSA-N Ile-Pro-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N OWSWUWDMSNXTNE-GMOBBJLQSA-N 0.000 description 3
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 3
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 3
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 3
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 3
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 3
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 3
- GGAPIOORBXHMNY-ULQDDVLXSA-N Lys-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)O GGAPIOORBXHMNY-ULQDDVLXSA-N 0.000 description 3
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 3
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 3
- ATIPDCIQTUXABX-UWVGGRQHSA-N Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ATIPDCIQTUXABX-UWVGGRQHSA-N 0.000 description 3
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 3
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 3
- CAODKDAPYGUMLK-FXQIFTODSA-N Met-Asn-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CAODKDAPYGUMLK-FXQIFTODSA-N 0.000 description 3
- RXWPLVRJQNWXRQ-IHRRRGAJSA-N Met-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CNC=N1 RXWPLVRJQNWXRQ-IHRRRGAJSA-N 0.000 description 3
- KYJHWKAMFISDJE-RCWTZXSCSA-N Met-Thr-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCSC KYJHWKAMFISDJE-RCWTZXSCSA-N 0.000 description 3
- JHVNNUIQXOGAHI-KJEVXHAQSA-N Met-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N)O JHVNNUIQXOGAHI-KJEVXHAQSA-N 0.000 description 3
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 3
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 3
- KIAWKQJTSGRCSA-AVGNSLFASA-N Phe-Asn-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KIAWKQJTSGRCSA-AVGNSLFASA-N 0.000 description 3
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 3
- VJLLEKDQJSMHRU-STQMWFEESA-N Phe-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O VJLLEKDQJSMHRU-STQMWFEESA-N 0.000 description 3
- GLJZDMZJHFXJQG-BZSNNMDCSA-N Phe-Ser-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLJZDMZJHFXJQG-BZSNNMDCSA-N 0.000 description 3
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 3
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 3
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 3
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 3
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 3
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 3
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 3
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 3
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 3
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 3
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 3
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 3
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 3
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 3
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 3
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 3
- JVTHIXKSVYEWNI-JRQIVUDYSA-N Thr-Asn-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JVTHIXKSVYEWNI-JRQIVUDYSA-N 0.000 description 3
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 3
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 3
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 3
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 3
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 3
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 3
- DSGIVWSDDRDJIO-ZXXMMSQZSA-N Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DSGIVWSDDRDJIO-ZXXMMSQZSA-N 0.000 description 3
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 3
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 3
- FKAPNDWDLDWZNF-QEJZJMRPSA-N Trp-Asp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FKAPNDWDLDWZNF-QEJZJMRPSA-N 0.000 description 3
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 3
- OKDNSNWJEXAMSU-IRXDYDNUSA-N Tyr-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 OKDNSNWJEXAMSU-IRXDYDNUSA-N 0.000 description 3
- VBFVQTPETKJCQW-RPTUDFQQSA-N Tyr-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VBFVQTPETKJCQW-RPTUDFQQSA-N 0.000 description 3
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 description 3
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 3
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 3
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 3
- 230000004075 alteration Effects 0.000 description 3
- 210000000170 cell membrane Anatomy 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 108010060199 cysteinylproline Proteins 0.000 description 3
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 3
- 210000003527 eukaryotic cell Anatomy 0.000 description 3
- 108020001507 fusion proteins Proteins 0.000 description 3
- 102000037865 fusion proteins Human genes 0.000 description 3
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 3
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 3
- 108010028295 histidylhistidine Proteins 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 230000017730 intein-mediated protein splicing Effects 0.000 description 3
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 3
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 230000004481 post-translational protein modification Effects 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 230000011664 signaling Effects 0.000 description 3
- 230000035882 stress Effects 0.000 description 3
- 238000012546 transfer Methods 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- 108010027345 wheylin-1 peptide Proteins 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 2
- XJFPXLWGZWAWRQ-UHFFFAOYSA-N 2-[[2-[[2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(O)=O XJFPXLWGZWAWRQ-UHFFFAOYSA-N 0.000 description 2
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 2
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 2
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 2
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 2
- ZFXQNADNEBRERM-BJDJZHNGSA-N Ala-Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 ZFXQNADNEBRERM-BJDJZHNGSA-N 0.000 description 2
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 2
- XAGIMRPOEJSYER-CIUDSAMLSA-N Ala-Cys-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N XAGIMRPOEJSYER-CIUDSAMLSA-N 0.000 description 2
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 2
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 2
- GRPHQEMIFDPKOE-HGNGGELXSA-N Ala-His-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GRPHQEMIFDPKOE-HGNGGELXSA-N 0.000 description 2
- QJABSQFUHKHTNP-SYWGBEHUSA-N Ala-Ile-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QJABSQFUHKHTNP-SYWGBEHUSA-N 0.000 description 2
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 2
- VEAPAYQQLSEKEM-GUBZILKMSA-N Ala-Met-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O VEAPAYQQLSEKEM-GUBZILKMSA-N 0.000 description 2
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 2
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 2
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 2
- USNSOPDIZILSJP-FXQIFTODSA-N Arg-Asn-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O USNSOPDIZILSJP-FXQIFTODSA-N 0.000 description 2
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 2
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 2
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 2
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 2
- RRGPUNYIPJXJBU-GUBZILKMSA-N Arg-Asp-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O RRGPUNYIPJXJBU-GUBZILKMSA-N 0.000 description 2
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 2
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 2
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 2
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 2
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 2
- FOQFHANLUJDQEE-GUBZILKMSA-N Arg-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CS)C(=O)O FOQFHANLUJDQEE-GUBZILKMSA-N 0.000 description 2
- LFAUVOXPCGJKTB-DCAQKATOSA-N Arg-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N LFAUVOXPCGJKTB-DCAQKATOSA-N 0.000 description 2
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 2
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 2
- ZPWMEWYQBWSGAO-ZJDVBMNYSA-N Arg-Thr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZPWMEWYQBWSGAO-ZJDVBMNYSA-N 0.000 description 2
- WAEWODAAWLGLMK-OYDLWJJNSA-N Arg-Trp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WAEWODAAWLGLMK-OYDLWJJNSA-N 0.000 description 2
- QJWLLRZTJFPCHA-STECZYCISA-N Arg-Tyr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QJWLLRZTJFPCHA-STECZYCISA-N 0.000 description 2
- FMYQECOAIFGQGU-CYDGBPFRSA-N Arg-Val-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMYQECOAIFGQGU-CYDGBPFRSA-N 0.000 description 2
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 2
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 description 2
- GWNMUVANAWDZTI-YUMQZZPRSA-N Asn-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N GWNMUVANAWDZTI-YUMQZZPRSA-N 0.000 description 2
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 2
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 2
- RZNAMKZJPBQWDJ-SRVKXCTJSA-N Asn-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N RZNAMKZJPBQWDJ-SRVKXCTJSA-N 0.000 description 2
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 2
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 2
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 2
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 2
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 2
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 description 2
- CPYHLXSGDBDULY-IHPCNDPISA-N Asn-Trp-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CPYHLXSGDBDULY-IHPCNDPISA-N 0.000 description 2
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 2
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 2
- WJHYGGVCWREQMO-GHCJXIJMSA-N Asp-Cys-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WJHYGGVCWREQMO-GHCJXIJMSA-N 0.000 description 2
- KVPHTGVUMJGMCX-BIIVOSGPSA-N Asp-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)C(=O)O KVPHTGVUMJGMCX-BIIVOSGPSA-N 0.000 description 2
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 2
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 2
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 2
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 2
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 2
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 2
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 2
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 2
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 2
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 2
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 2
- KACWACLNYLSVCA-VHWLVUOQSA-N Asp-Trp-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KACWACLNYLSVCA-VHWLVUOQSA-N 0.000 description 2
- PLNJUJGNLDSFOP-UWJYBYFXSA-N Asp-Tyr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PLNJUJGNLDSFOP-UWJYBYFXSA-N 0.000 description 2
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 2
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 2
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 2
- 240000002791 Brassica napus Species 0.000 description 2
- 235000004977 Brassica sinapistrum Nutrition 0.000 description 2
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 2
- WDQXKVCQXRNOSI-GHCJXIJMSA-N Cys-Asp-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WDQXKVCQXRNOSI-GHCJXIJMSA-N 0.000 description 2
- RRJOQIBQVZDVCW-SRVKXCTJSA-N Cys-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N RRJOQIBQVZDVCW-SRVKXCTJSA-N 0.000 description 2
- CIVXDCMSSFGWAL-YUMQZZPRSA-N Cys-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N CIVXDCMSSFGWAL-YUMQZZPRSA-N 0.000 description 2
- MBRWOKXNHTUJMB-CIUDSAMLSA-N Cys-Pro-Glu Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O MBRWOKXNHTUJMB-CIUDSAMLSA-N 0.000 description 2
- KSMSFCBQBQPFAD-GUBZILKMSA-N Cys-Pro-Pro Chemical compound SC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 KSMSFCBQBQPFAD-GUBZILKMSA-N 0.000 description 2
- BUAUGQJXGNRTQE-AAEUAGOBSA-N Cys-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N BUAUGQJXGNRTQE-AAEUAGOBSA-N 0.000 description 2
- ZFHXNNXMNLWKJH-HJPIBITLSA-N Cys-Tyr-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZFHXNNXMNLWKJH-HJPIBITLSA-N 0.000 description 2
- 238000007399 DNA isolation Methods 0.000 description 2
- 230000004568 DNA-binding Effects 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 102000003688 G-Protein-Coupled Receptors Human genes 0.000 description 2
- 108090000045 G-Protein-Coupled Receptors Proteins 0.000 description 2
- 241001200922 Gagata Species 0.000 description 2
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 2
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 2
- OBIHEDRRSMRKLU-ACZMJKKPSA-N Glu-Cys-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OBIHEDRRSMRKLU-ACZMJKKPSA-N 0.000 description 2
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 2
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 2
- LSPKYLAFTPBWIL-BYPYZUCNSA-N Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(O)=O LSPKYLAFTPBWIL-BYPYZUCNSA-N 0.000 description 2
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 2
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 2
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 2
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 2
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 2
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 2
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 2
- ZWMYUDZLXAQHCK-CIUDSAMLSA-N Glu-Met-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O ZWMYUDZLXAQHCK-CIUDSAMLSA-N 0.000 description 2
- XEKAJTCACGEBOK-KKUMJFAQSA-N Glu-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XEKAJTCACGEBOK-KKUMJFAQSA-N 0.000 description 2
- LPHGXOWFAXFCPX-KKUMJFAQSA-N Glu-Pro-Phe Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O LPHGXOWFAXFCPX-KKUMJFAQSA-N 0.000 description 2
- JPUNZXVHHRZMNL-XIRDDKMYSA-N Glu-Pro-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JPUNZXVHHRZMNL-XIRDDKMYSA-N 0.000 description 2
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 2
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 2
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 2
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 2
- JLXVRFDTDUGQEE-YFKPBYRVSA-N Gly-Arg Chemical compound NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N JLXVRFDTDUGQEE-YFKPBYRVSA-N 0.000 description 2
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 2
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 2
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 2
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 2
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 2
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 2
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 2
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 2
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 2
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 2
- FSPVILZGHUJOHS-QWRGUYRKSA-N Gly-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 FSPVILZGHUJOHS-QWRGUYRKSA-N 0.000 description 2
- DENRBIYENOKSEX-PEXQALLHSA-N Gly-Ile-His Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DENRBIYENOKSEX-PEXQALLHSA-N 0.000 description 2
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 2
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 2
- BBTCXWTXOXUNFX-IUCAKERBSA-N Gly-Met-Arg Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O BBTCXWTXOXUNFX-IUCAKERBSA-N 0.000 description 2
- YHYDTTUSJXGTQK-UWVGGRQHSA-N Gly-Met-Leu Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(C)C)C(O)=O YHYDTTUSJXGTQK-UWVGGRQHSA-N 0.000 description 2
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 2
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 2
- IXHQLZIWBCQBLQ-STQMWFEESA-N Gly-Pro-Phe Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IXHQLZIWBCQBLQ-STQMWFEESA-N 0.000 description 2
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 2
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 2
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 2
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 2
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 2
- HVLSXIKZNLPZJJ-TXZCQADKSA-N HA peptide Chemical compound C([C@@H](C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HVLSXIKZNLPZJJ-TXZCQADKSA-N 0.000 description 2
- 108010068250 Herpes Simplex Virus Protein Vmw65 Proteins 0.000 description 2
- DFHVLUKTTVTCKY-PBCZWWQYSA-N His-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N)O DFHVLUKTTVTCKY-PBCZWWQYSA-N 0.000 description 2
- ZYDYEPDFFVCUBI-SRVKXCTJSA-N His-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZYDYEPDFFVCUBI-SRVKXCTJSA-N 0.000 description 2
- MMFKFJORZBJVNF-UWVGGRQHSA-N His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 MMFKFJORZBJVNF-UWVGGRQHSA-N 0.000 description 2
- LVWIJITYHRZHBO-IXOXFDKPSA-N His-Leu-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LVWIJITYHRZHBO-IXOXFDKPSA-N 0.000 description 2
- PGRPSOUCWRBWKZ-DLOVCJGASA-N His-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 PGRPSOUCWRBWKZ-DLOVCJGASA-N 0.000 description 2
- NBWATNYAUVSAEQ-ZEILLAHLSA-N His-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O NBWATNYAUVSAEQ-ZEILLAHLSA-N 0.000 description 2
- CSRRMQFXMBPSIL-SIXJUCDHSA-N His-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CN=CN3)N CSRRMQFXMBPSIL-SIXJUCDHSA-N 0.000 description 2
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 2
- QIHJTGSVGIPHIW-QSFUFRPTSA-N Ile-Asn-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N QIHJTGSVGIPHIW-QSFUFRPTSA-N 0.000 description 2
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 2
- JSZMKEYEVLDPDO-ACZMJKKPSA-N Ile-Cys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CS)C(O)=O JSZMKEYEVLDPDO-ACZMJKKPSA-N 0.000 description 2
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 2
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 description 2
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 2
- UASTVUQJMLZWGG-PEXQALLHSA-N Ile-His-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N UASTVUQJMLZWGG-PEXQALLHSA-N 0.000 description 2
- RQQCJTLBSJMVCR-DSYPUSFNSA-N Ile-Leu-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N RQQCJTLBSJMVCR-DSYPUSFNSA-N 0.000 description 2
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 2
- JTBFQNHKNRZJDS-SYWGBEHUSA-N Ile-Trp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C)C(=O)O)N JTBFQNHKNRZJDS-SYWGBEHUSA-N 0.000 description 2
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 2
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 2
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 2
- SWNRZNLXMXRCJC-VKOGCVSHSA-N Ile-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 SWNRZNLXMXRCJC-VKOGCVSHSA-N 0.000 description 2
- PWWVAXIEGOYWEE-UHFFFAOYSA-N Isophenergan Chemical compound C1=CC=C2N(CC(C)N(C)C)C3=CC=CC=C3SC2=C1 PWWVAXIEGOYWEE-UHFFFAOYSA-N 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 2
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 2
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 2
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 2
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 2
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 2
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 2
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 2
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 2
- HGUUMQWGYCVPKG-DCAQKATOSA-N Leu-Pro-Cys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HGUUMQWGYCVPKG-DCAQKATOSA-N 0.000 description 2
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 2
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 2
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 2
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 2
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 2
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 2
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 2
- HOMFINRJHIIZNJ-HOCLYGCPSA-N Leu-Trp-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O HOMFINRJHIIZNJ-HOCLYGCPSA-N 0.000 description 2
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 2
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 2
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 2
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 2
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 2
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 2
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 2
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 2
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 2
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 2
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 2
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 2
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 2
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 2
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 2
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 2
- MTBLFIQZECOEBY-IHRRRGAJSA-N Lys-Met-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O MTBLFIQZECOEBY-IHRRRGAJSA-N 0.000 description 2
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 2
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 2
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 2
- CFOLERIRBUAYAD-HOCLYGCPSA-N Lys-Trp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O CFOLERIRBUAYAD-HOCLYGCPSA-N 0.000 description 2
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 2
- WGBMNLCRYKSWAR-DCAQKATOSA-N Met-Asp-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN WGBMNLCRYKSWAR-DCAQKATOSA-N 0.000 description 2
- DNDVVILEHVMWIS-LPEHRKFASA-N Met-Asp-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DNDVVILEHVMWIS-LPEHRKFASA-N 0.000 description 2
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 2
- YCUSPBPZVJDMII-YUMQZZPRSA-N Met-Gly-Glu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O YCUSPBPZVJDMII-YUMQZZPRSA-N 0.000 description 2
- HWROAFGWPQUPTE-OSUNSFLBSA-N Met-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCSC)N HWROAFGWPQUPTE-OSUNSFLBSA-N 0.000 description 2
- UFOWQBYMUILSRK-IHRRRGAJSA-N Met-Lys-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 UFOWQBYMUILSRK-IHRRRGAJSA-N 0.000 description 2
- CGUYGMFQZCYJSG-DCAQKATOSA-N Met-Lys-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CGUYGMFQZCYJSG-DCAQKATOSA-N 0.000 description 2
- QTMIXEQWGNIPBL-JYJNAYRXSA-N Met-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N QTMIXEQWGNIPBL-JYJNAYRXSA-N 0.000 description 2
- FBLBCGLSRXBANI-KKUMJFAQSA-N Met-Phe-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FBLBCGLSRXBANI-KKUMJFAQSA-N 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- 108010065395 Neuropep-1 Proteins 0.000 description 2
- 241000424623 Nostoc punctiforme Species 0.000 description 2
- 108091028043 Nucleic acid sequence Proteins 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- LZDIENNKWVXJMX-JYJNAYRXSA-N Phe-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CC=CC=C1 LZDIENNKWVXJMX-JYJNAYRXSA-N 0.000 description 2
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 2
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 2
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 2
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 2
- NRKNYPRRWXVELC-NQCBNZPSSA-N Phe-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=CC=C3)N NRKNYPRRWXVELC-NQCBNZPSSA-N 0.000 description 2
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 2
- DMEYUTSDVRCWRS-ULQDDVLXSA-N Phe-Lys-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DMEYUTSDVRCWRS-ULQDDVLXSA-N 0.000 description 2
- GPSMLZQVIIYLDK-ULQDDVLXSA-N Phe-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O GPSMLZQVIIYLDK-ULQDDVLXSA-N 0.000 description 2
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 2
- AAERWTUHZKLDLC-IHRRRGAJSA-N Phe-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O AAERWTUHZKLDLC-IHRRRGAJSA-N 0.000 description 2
- ZVRJWDUPIDMHDN-ULQDDVLXSA-N Phe-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 ZVRJWDUPIDMHDN-ULQDDVLXSA-N 0.000 description 2
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 2
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 description 2
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 2
- YTGGLKWSVIRECD-JBACZVJFSA-N Phe-Trp-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 YTGGLKWSVIRECD-JBACZVJFSA-N 0.000 description 2
- BQMFWUKNOCJDNV-HJWJTTGWSA-N Phe-Val-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQMFWUKNOCJDNV-HJWJTTGWSA-N 0.000 description 2
- APZNYJFGVAGFCF-JYJNAYRXSA-N Phe-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccccc1)C(C)C)C(O)=O APZNYJFGVAGFCF-JYJNAYRXSA-N 0.000 description 2
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 2
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 2
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 2
- QVIZLAUEAMQKGS-GUBZILKMSA-N Pro-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 QVIZLAUEAMQKGS-GUBZILKMSA-N 0.000 description 2
- DEDANIDYQAPTFI-IHRRRGAJSA-N Pro-Asp-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DEDANIDYQAPTFI-IHRRRGAJSA-N 0.000 description 2
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 2
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 2
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 2
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 2
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 2
- FYPGHGXAOZTOBO-IHRRRGAJSA-N Pro-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FYPGHGXAOZTOBO-IHRRRGAJSA-N 0.000 description 2
- RVQDZELMXZRSSI-IUCAKERBSA-N Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 RVQDZELMXZRSSI-IUCAKERBSA-N 0.000 description 2
- ZZCJYPLMOPTZFC-SRVKXCTJSA-N Pro-Met-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O ZZCJYPLMOPTZFC-SRVKXCTJSA-N 0.000 description 2
- IWIANZLCJVYEFX-RYUDHWBXSA-N Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 IWIANZLCJVYEFX-RYUDHWBXSA-N 0.000 description 2
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 2
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 description 2
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 2
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 2
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 2
- NBDHWLZEMKSVHH-UVBJJODRSA-N Pro-Trp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 NBDHWLZEMKSVHH-UVBJJODRSA-N 0.000 description 2
- 108010076504 Protein Sorting Signals Proteins 0.000 description 2
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 2
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 2
- HZWAHWQZPSXNCB-BPUTZDHNSA-N Ser-Arg-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HZWAHWQZPSXNCB-BPUTZDHNSA-N 0.000 description 2
- LTFSLKWFMWZEBD-IMJSIDKUSA-N Ser-Asn Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O LTFSLKWFMWZEBD-IMJSIDKUSA-N 0.000 description 2
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 2
- LAFKUZYWNCHOHT-WHFBIAKZSA-N Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O LAFKUZYWNCHOHT-WHFBIAKZSA-N 0.000 description 2
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 2
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 2
- MOQDPPUMFSMYOM-KKUMJFAQSA-N Ser-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N MOQDPPUMFSMYOM-KKUMJFAQSA-N 0.000 description 2
- DLPXTCTVNDTYGJ-JBDRJPRFSA-N Ser-Ile-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O DLPXTCTVNDTYGJ-JBDRJPRFSA-N 0.000 description 2
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 2
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 2
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 2
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 2
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 2
- JUTGONBTALQWMK-NAKRPEOUSA-N Ser-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N JUTGONBTALQWMK-NAKRPEOUSA-N 0.000 description 2
- VIIJCAQMJBHSJH-FXQIFTODSA-N Ser-Met-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O VIIJCAQMJBHSJH-FXQIFTODSA-N 0.000 description 2
- ASGYVPAVFNDZMA-GUBZILKMSA-N Ser-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N ASGYVPAVFNDZMA-GUBZILKMSA-N 0.000 description 2
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 2
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 2
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 2
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 2
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 2
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 2
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 2
- UYLKOSODXYSWMQ-XGEHTFHBSA-N Ser-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N)O UYLKOSODXYSWMQ-XGEHTFHBSA-N 0.000 description 2
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 2
- UQGAAZXSCGWMFU-UBHSHLNASA-N Ser-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N UQGAAZXSCGWMFU-UBHSHLNASA-N 0.000 description 2
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 2
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 2
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 2
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 2
- UQTNIFUCMBFWEJ-IWGUZYHVSA-N Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O UQTNIFUCMBFWEJ-IWGUZYHVSA-N 0.000 description 2
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 2
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 2
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 2
- PAOYNIKMYOGBMR-PBCZWWQYSA-N Thr-Asn-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O PAOYNIKMYOGBMR-PBCZWWQYSA-N 0.000 description 2
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 2
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 2
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 2
- WRUWXBBEFUTJOU-XGEHTFHBSA-N Thr-Met-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N)O WRUWXBBEFUTJOU-XGEHTFHBSA-N 0.000 description 2
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 2
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 2
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 2
- BCYUHPXBHCUYBA-CUJWVEQBSA-N Thr-Ser-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BCYUHPXBHCUYBA-CUJWVEQBSA-N 0.000 description 2
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 2
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 2
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 2
- PJCYRZVSACOYSN-ZJDVBMNYSA-N Thr-Thr-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O PJCYRZVSACOYSN-ZJDVBMNYSA-N 0.000 description 2
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 2
- VEENWOSZGWWKHW-SZZJOZGLSA-N Thr-Trp-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N)O VEENWOSZGWWKHW-SZZJOZGLSA-N 0.000 description 2
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 2
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 2
- LCPVBXOHXMBLFW-JSGCOSHPSA-N Trp-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)=CNC2=C1 LCPVBXOHXMBLFW-JSGCOSHPSA-N 0.000 description 2
- IJRXQJVGFBSKIV-ZFWWWQNUSA-N Trp-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N IJRXQJVGFBSKIV-ZFWWWQNUSA-N 0.000 description 2
- PITVQFJBUFDJDD-XEGUGMAKSA-N Trp-Ile Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)=CNC2=C1 PITVQFJBUFDJDD-XEGUGMAKSA-N 0.000 description 2
- MKDXQPMIQPTTAW-SIXJUCDHSA-N Trp-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N MKDXQPMIQPTTAW-SIXJUCDHSA-N 0.000 description 2
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 2
- KOVXHANYYYMBRF-IRIUXVKKSA-N Tyr-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KOVXHANYYYMBRF-IRIUXVKKSA-N 0.000 description 2
- LTSIAOZUVISRAQ-QWRGUYRKSA-N Tyr-Gly-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O LTSIAOZUVISRAQ-QWRGUYRKSA-N 0.000 description 2
- GIOBXJSONRQHKQ-RYUDHWBXSA-N Tyr-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GIOBXJSONRQHKQ-RYUDHWBXSA-N 0.000 description 2
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 2
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 2
- GGXUDPQWAWRINY-XEGUGMAKSA-N Tyr-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GGXUDPQWAWRINY-XEGUGMAKSA-N 0.000 description 2
- HHFMNAVFGBYSAT-IGISWZIWSA-N Tyr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HHFMNAVFGBYSAT-IGISWZIWSA-N 0.000 description 2
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 2
- VTCKHZJKWQENKX-KBPBESRZSA-N Tyr-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O VTCKHZJKWQENKX-KBPBESRZSA-N 0.000 description 2
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 2
- ZSXJENBJGRHKIG-UWVGGRQHSA-N Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZSXJENBJGRHKIG-UWVGGRQHSA-N 0.000 description 2
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 2
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 2
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 2
- HIZMLPKDJAXDRG-FXQIFTODSA-N Val-Cys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N HIZMLPKDJAXDRG-FXQIFTODSA-N 0.000 description 2
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 2
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 2
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 2
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 2
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 2
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 2
- FMQGYTMERWBMSI-HJWJTTGWSA-N Val-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N FMQGYTMERWBMSI-HJWJTTGWSA-N 0.000 description 2
- ZEBRMWPTJNHXAJ-JYJNAYRXSA-N Val-Phe-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O)N ZEBRMWPTJNHXAJ-JYJNAYRXSA-N 0.000 description 2
- VCIYTVOBLZHFSC-XHSDSOJGSA-N Val-Phe-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N VCIYTVOBLZHFSC-XHSDSOJGSA-N 0.000 description 2
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 2
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 2
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 2
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 2
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 2
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 2
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 2
- 239000011543 agarose gel Substances 0.000 description 2
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- 108010013829 alpha subunit DNA polymerase III Proteins 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 210000004900 c-terminal fragment Anatomy 0.000 description 2
- 229910052791 calcium Inorganic materials 0.000 description 2
- 239000011575 calcium Substances 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 230000001276 controlling effect Effects 0.000 description 2
- 238000005520 cutting process Methods 0.000 description 2
- 108010069495 cysteinyltyrosine Proteins 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 208000035475 disorder Diseases 0.000 description 2
- 108091006047 fluorescent proteins Proteins 0.000 description 2
- 102000034287 fluorescent proteins Human genes 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine Chemical compound [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 102000006240 membrane receptors Human genes 0.000 description 2
- 108020004084 membrane receptors Proteins 0.000 description 2
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 108020001580 protein domains Proteins 0.000 description 2
- 230000017854 proteolysis Effects 0.000 description 2
- 230000006337 proteolytic cleavage Effects 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 2
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 2
- 230000008054 signal transmission Effects 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 230000008685 targeting Effects 0.000 description 2
- 239000012096 transfection reagent Substances 0.000 description 2
- 230000001052 transient effect Effects 0.000 description 2
- 108010014563 tryptophyl-cysteinyl-serine Proteins 0.000 description 2
- 108010051110 tyrosyl-lysine Proteins 0.000 description 2
- PIDRBUDUWHBYSR-UHFFFAOYSA-N 1-[2-[[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O PIDRBUDUWHBYSR-UHFFFAOYSA-N 0.000 description 1
- QMOQBVOBWVNSNO-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(O)=O QMOQBVOBWVNSNO-UHFFFAOYSA-N 0.000 description 1
- QWTLUPDHBKBULE-UHFFFAOYSA-N 2-[[2-[[2-[[2-[[2-[[2-[[2-[[2-[[2-[(2-aminoacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetic acid Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(O)=O QWTLUPDHBKBULE-UHFFFAOYSA-N 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 1
- JQDFGZKKXBEANU-IMJSIDKUSA-N Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(O)=O JQDFGZKKXBEANU-IMJSIDKUSA-N 0.000 description 1
- VIGKUFXFTPWYER-BIIVOSGPSA-N Ala-Cys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N VIGKUFXFTPWYER-BIIVOSGPSA-N 0.000 description 1
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 1
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 description 1
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 1
- JNLDTVRGXMSYJC-UVBJJODRSA-N Ala-Pro-Trp Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O JNLDTVRGXMSYJC-UVBJJODRSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- IDLBLNBDLCTPGC-HERUPUMHSA-N Ala-Trp-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CS)C(=O)O)N IDLBLNBDLCTPGC-HERUPUMHSA-N 0.000 description 1
- SFPRJVVDZNLUTG-OWLDWWDNSA-N Ala-Trp-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFPRJVVDZNLUTG-OWLDWWDNSA-N 0.000 description 1
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 1
- JVMKBJNSRZWDBO-FXQIFTODSA-N Arg-Cys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O JVMKBJNSRZWDBO-FXQIFTODSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- AWMAZIIEFPFHCP-RCWTZXSCSA-N Arg-Pro-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWMAZIIEFPFHCP-RCWTZXSCSA-N 0.000 description 1
- HRCIIMCTUIAKQB-XGEHTFHBSA-N Arg-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O HRCIIMCTUIAKQB-XGEHTFHBSA-N 0.000 description 1
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 1
- FTMRPIVPSDVGCC-GUBZILKMSA-N Arg-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FTMRPIVPSDVGCC-GUBZILKMSA-N 0.000 description 1
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 1
- UPALZCBCKAMGIY-PEFMBERDSA-N Asn-Gln-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UPALZCBCKAMGIY-PEFMBERDSA-N 0.000 description 1
- SNAKIVFVLVUCKB-UHFFFAOYSA-N Asn-Glu-Ala-Lys Natural products NCCCCC(C(O)=O)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(N)CC(N)=O SNAKIVFVLVUCKB-UHFFFAOYSA-N 0.000 description 1
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- MFMJRYHVLLEMQM-DCAQKATOSA-N Asp-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N MFMJRYHVLLEMQM-DCAQKATOSA-N 0.000 description 1
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 1
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- 241000726103 Atta Species 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 101100505161 Caenorhabditis elegans mel-32 gene Proteins 0.000 description 1
- 102000000584 Calmodulin Human genes 0.000 description 1
- 108010041952 Calmodulin Proteins 0.000 description 1
- 108090000397 Caspase 3 Proteins 0.000 description 1
- 102000003952 Caspase 3 Human genes 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 1
- AEJSNWMRPXAKCW-WHFBIAKZSA-N Cys-Ala-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AEJSNWMRPXAKCW-WHFBIAKZSA-N 0.000 description 1
- PRXCTTWKGJAPMT-ZLUOBGJFSA-N Cys-Ala-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O PRXCTTWKGJAPMT-ZLUOBGJFSA-N 0.000 description 1
- ATPDEYTYWVMINF-ZLUOBGJFSA-N Cys-Cys-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O ATPDEYTYWVMINF-ZLUOBGJFSA-N 0.000 description 1
- OHLLDUNVMPPUMD-DCAQKATOSA-N Cys-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N OHLLDUNVMPPUMD-DCAQKATOSA-N 0.000 description 1
- DQGIAOGALAQBGK-BWBBJGPYSA-N Cys-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O DQGIAOGALAQBGK-BWBBJGPYSA-N 0.000 description 1
- NRVQLLDIJJEIIZ-VZFHVOOUSA-N Cys-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CS)N)O NRVQLLDIJJEIIZ-VZFHVOOUSA-N 0.000 description 1
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 1
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 1
- VXEFAWJTFAUDJK-AVGNSLFASA-N Glu-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O VXEFAWJTFAUDJK-AVGNSLFASA-N 0.000 description 1
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 1
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 1
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 1
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 1
- QGDOOCIPHSSADO-STQMWFEESA-N Gly-Met-Phe Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGDOOCIPHSSADO-STQMWFEESA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 1
- JBJNKUOMNZGQIM-PYJNHQTQSA-N His-Arg-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JBJNKUOMNZGQIM-PYJNHQTQSA-N 0.000 description 1
- RAVLQPXCMRCLKT-KBPBESRZSA-N His-Gly-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RAVLQPXCMRCLKT-KBPBESRZSA-N 0.000 description 1
- STOOMQFEJUVAKR-KKUMJFAQSA-N His-His-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CNC=N1 STOOMQFEJUVAKR-KKUMJFAQSA-N 0.000 description 1
- AKAPKBNIVNPIPO-KKUMJFAQSA-N His-His-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CN=CN1 AKAPKBNIVNPIPO-KKUMJFAQSA-N 0.000 description 1
- XMAUFHMAAVTODF-STQMWFEESA-N His-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 XMAUFHMAAVTODF-STQMWFEESA-N 0.000 description 1
- LNDVNHOSZQPJGI-AVGNSLFASA-N His-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CN=CN1 LNDVNHOSZQPJGI-AVGNSLFASA-N 0.000 description 1
- AHEBIAHEZWQVHB-QTKMDUPCSA-N His-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O AHEBIAHEZWQVHB-QTKMDUPCSA-N 0.000 description 1
- PFTFEWHJSAXGED-ZKWXMUAHSA-N Ile-Cys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N PFTFEWHJSAXGED-ZKWXMUAHSA-N 0.000 description 1
- BBIXOODYWPFNDT-CIUDSAMLSA-N Ile-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O BBIXOODYWPFNDT-CIUDSAMLSA-N 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 1
- XGDCYUQSFDQISZ-BQBZGAKWSA-N Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O XGDCYUQSFDQISZ-BQBZGAKWSA-N 0.000 description 1
- 241000234435 Lilium Species 0.000 description 1
- 239000012097 Lipofectamine 2000 Substances 0.000 description 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- IPSDPDAOSAEWCN-RHYQMDGZSA-N Lys-Met-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IPSDPDAOSAEWCN-RHYQMDGZSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- ZOKVLMBYDSIDKG-CSMHCCOUSA-N Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ZOKVLMBYDSIDKG-CSMHCCOUSA-N 0.000 description 1
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- SBSIKVMCCJUCBZ-GUBZILKMSA-N Met-Asn-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N SBSIKVMCCJUCBZ-GUBZILKMSA-N 0.000 description 1
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 1
- 108010047562 NGR peptide Proteins 0.000 description 1
- 208000012902 Nervous system disease Diseases 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- GLUBLISJVJFHQS-VIFPVBQESA-N Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 GLUBLISJVJFHQS-VIFPVBQESA-N 0.000 description 1
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 1
- SFKOEHXABNPLRT-KBPBESRZSA-N Phe-His-Gly Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)NCC(O)=O SFKOEHXABNPLRT-KBPBESRZSA-N 0.000 description 1
- CKJACGQPCPMWIT-UFYCRDLUSA-N Phe-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CKJACGQPCPMWIT-UFYCRDLUSA-N 0.000 description 1
- 102000045595 Phosphoprotein Phosphatases Human genes 0.000 description 1
- 108700019535 Phosphoprotein Phosphatases Proteins 0.000 description 1
- 108010076039 Polyproteins Proteins 0.000 description 1
- 241000710078 Potyvirus Species 0.000 description 1
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 1
- FKKHDBFNOLCYQM-FXQIFTODSA-N Pro-Cys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O FKKHDBFNOLCYQM-FXQIFTODSA-N 0.000 description 1
- XJROSHJRQTXWAE-XGEHTFHBSA-N Pro-Cys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XJROSHJRQTXWAE-XGEHTFHBSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- PEYNRYREGPAOAK-LSJOCFKGSA-N Pro-His-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 PEYNRYREGPAOAK-LSJOCFKGSA-N 0.000 description 1
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 1
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 241000242743 Renilla reniformis Species 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- MCFXDOUEIOSDKM-UHFFFAOYSA-N Ser Ala Trp Cys Chemical compound C1=CC=C2C(CC(NC(=O)C(NC(=O)C(N)CO)C)C(=O)NC(CS)C(O)=O)=CNC2=C1 MCFXDOUEIOSDKM-UHFFFAOYSA-N 0.000 description 1
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 1
- MPPHJZYXDVDGOF-BWBBJGPYSA-N Ser-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CO MPPHJZYXDVDGOF-BWBBJGPYSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- UAJAYRMZGNQILN-BQBZGAKWSA-N Ser-Gly-Met Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UAJAYRMZGNQILN-BQBZGAKWSA-N 0.000 description 1
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 1
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 1
- UZJDBCHMIQXLOQ-HEIBUPTGSA-N Thr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O UZJDBCHMIQXLOQ-HEIBUPTGSA-N 0.000 description 1
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- SXAGUVRFGJSFKC-ZEILLAHLSA-N Thr-His-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SXAGUVRFGJSFKC-ZEILLAHLSA-N 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- 108090000190 Thrombin Proteins 0.000 description 1
- VZBWRZGNEPBRDE-HZUKXOBISA-N Trp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N VZBWRZGNEPBRDE-HZUKXOBISA-N 0.000 description 1
- FBGDDUKYOBNZJL-WDSOQIARSA-N Trp-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N FBGDDUKYOBNZJL-WDSOQIARSA-N 0.000 description 1
- ZZDFLJFVSNQINX-HWHUXHBOSA-N Trp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O ZZDFLJFVSNQINX-HWHUXHBOSA-N 0.000 description 1
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 1
- RGYCVIZZTUBSSG-JYJNAYRXSA-N Tyr-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O RGYCVIZZTUBSSG-JYJNAYRXSA-N 0.000 description 1
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 1
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 1
- ZHWZDZFWBXWPDW-GUBZILKMSA-N Val-Val-Cys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O ZHWZDZFWBXWPDW-GUBZILKMSA-N 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 108020005202 Viral DNA Proteins 0.000 description 1
- 108700010756 Viral Polyproteins Proteins 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 239000012190 activator Substances 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 229920001222 biopolymer Polymers 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 230000020411 cell activation Effects 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000005754 cellular signaling Effects 0.000 description 1
- 210000003850 cellular structure Anatomy 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 239000002158 endotoxin Substances 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 230000004992 fission Effects 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 229940084937 glyset Drugs 0.000 description 1
- 239000003102 growth factor Substances 0.000 description 1
- 238000005734 heterodimerization reaction Methods 0.000 description 1
- 230000003054 hormonal effect Effects 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 210000001144 hymen Anatomy 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 239000003999 initiator Substances 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 108020001756 ligand binding domains Proteins 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 210000004898 n-terminal fragment Anatomy 0.000 description 1
- 239000002858 neurotransmitter agent Substances 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 230000008723 osmotic stress Effects 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- MXHCPCSDRGLRER-UHFFFAOYSA-N pentaglycine Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(O)=O MXHCPCSDRGLRER-UHFFFAOYSA-N 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 230000004962 physiological condition Effects 0.000 description 1
- 230000006461 physiological response Effects 0.000 description 1
- 244000000003 plant pathogen Species 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 235000015277 pork Nutrition 0.000 description 1
- 230000006555 post-translational control Effects 0.000 description 1
- 230000001323 posttranslational effect Effects 0.000 description 1
- 244000144977 poultry Species 0.000 description 1
- 230000037452 priming Effects 0.000 description 1
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 230000001012 protector Effects 0.000 description 1
- 238000002331 protein detection Methods 0.000 description 1
- 108060006633 protein kinase Proteins 0.000 description 1
- 230000004850 protein–protein interaction Effects 0.000 description 1
- 230000002797 proteolythic effect Effects 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 230000037425 regulation of transcription Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000009712 regulation of translation Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 238000003757 reverse transcription PCR Methods 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 1
- 229960004072 thrombin Drugs 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- 108010045269 tryptophyltryptophan Proteins 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
- 230000034512 ubiquitination Effects 0.000 description 1
- 238000010798 ubiquitination Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/48—Hydrolases (3) acting on peptide bonds (3.4)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K19/00—Hybrid peptides, i.e. peptides covalently bound to nucleic acids, or non-covalently bound protein-protein complexes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/48—Hydrolases (3) acting on peptide bonds (3.4)
- C12N9/50—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25)
- C12N9/503—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from viruses
- C12N9/506—Proteinases, e.g. Endopeptidases (3.4.21-3.4.25) derived from viruses derived from RNA viruses
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Molecular Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Virology (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- General Engineering & Computer Science (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biophysics (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
Izum se nanaša na kombinacijo ortogonalnih cepljenih proteaz, ki prepoznavajo tarčno zaporedje, ki obsega vsaj 6 aminokislin, povezanih z dimerizacijskimi domenami, ki omogočajo sestavljanje razcepljenih proteaz. Vsaj dve ortogonalni proteazi sta pripravljeni kot razcepljena fragmenta v povezavi z dimerizacijskimi domenami, kjer je dimerizacija lahko inducibilna s svetlobnim, kemičnim ali drugim vhodnim signalom. Proteaze cepijo enega ali več tarčnih proteinov, ki vsebujejo tarčno zaporedje za eno ali več ortogonalnih proteaz in delujejo kot nadaljnji posredniki signala, poročevalski proteini ali terapevtski proteini. Z ustrezno izbranimi tarčnimi proteini lahko pripravimo s proteazami posredovana logična vezja. Izum se nanaša tudi na celice, ki vsebujejo izražene razcepljene proteaze za prenos signala.The invention relates to a combination of orthogonal graft proteases that recognize a target sequence comprising at least 6 amino acids associated with dimerization domains that allow the formation of split proteases. At least two orthogonal proteases are prepared as split fragments in association with dimerization domains, where dimerization can be inducible by a light, chemical or other input signal. Proteases split one or more target proteins that contain a target sequence for one or more orthogonal proteases and act as further mediators of the signal, the reporting protein, or therapeutic proteins. With appropriately selected target proteins, logical circuits mediated by proteases can be prepared. The invention also relates to cells that contain pronounced cleft proteases to transmit the signal.
Description
Kombinacija razcepljenih ortogonalnih proteaz z dimerizacijskimi domenami, ki omogočajo sestavljanjeA combination of split orthogonal proteases with dimerization domains that allow assembly
Področje izumaField of the Invention
Izum se nanaša na kombinacijo dveh ali več ortogonalnih razcepljenih proteaz z dimerizacijskimi domenami, ki omogočajo sestavljanje razcepljenih proteaz in na celice, ki vsebujejo kombinacije ortogonalnih razcepljenih proteaz in tarčne proteine.The invention relates to a combination of two or more orthogonal split proteases with dimerization domains that allow the formation of split proteases and cells containing combinations of orthogonal split proteases and target proteins.
Stanje tehnikeState of the art
Aktivacija izbranih celic ob določenem času in prostoru ter natančen nadzor celičnega odziva na aktivacijo z enim ali več vhodnimi signali je pomemben tehnološki problem. Aktivacijo lahko izvedemo s kemičnimi aktivatorji, spremembo temperature, pH, s pomočjo elektrod ali svetlobe. Aktivacija celic je mogoča tudi s posrednimi ali neposrednimi mehanskimi stimulusi, kot je dotik, strižne sile, tok tekočine, hipo- in hiper-osmotski stres in ultrazvok. Za nadzor nad celičnim odzivom na aktivacijo se v sintezni biologiji pogosto sestavlja umetne signalne kaskade in logična vezja. Običajno ta logična vezja temeljijo na regulaciji transkripcije in translacije proteinov, kar je relativno počasen proces, saj je potrebno prepisovanje in prevajanje, kar pomeni, da ima odziv zamik več kot nekaj deset minut. Hitrejša signalizacija v celicah poteka preko posttranslacijskih sprememb proteinov (Olson & Tabor 2012), kot so fosforilacija, ubikvitinacija in proteolitska cepitev.The activation of selected cells at a given time and space and the precise control of the cell response to activation with one or more input signals is an important technological problem. Activation can be performed by chemical activators, temperature change, pH, by electrodes or light. Cell activation is also possible with indirect or direct mechanical stimuli such as touch, shear force, fluid flow, hypo- and hyperosmotic stress, and ultrasound. In controlling the cellular response to activation, synthetic biology often consists of artificial signal cascades and logic circuits. Usually, these logical circuits are based on the regulation of transcription and translation of proteins, which is a relatively slow process, since transcription and translation is required, which means that the response has a delay of more than a few dozen minutes. Faster cellular signaling takes place through posttranslational proteins (Olson & Tabor 2012), such as phosphorylation, ubiquitination and proteolytic cleavage.
Doslej so proteolizo že opisali kot mehanizem prenosa signala, pri čemer so poznam predvsem trije načini nadzora nad aktivnostjo proteaze: avtoinhibitomi nadzor, bližinski senzor in razcepljena proteaza.Up to now, proteolysis has already been described as a mechanism for signal transduction, with three known modes of control of protease activity known: autoinhibitory control, proximity sensor and split protease.
Avtoinhibitomi peptid je kratko aminokislinsko zaporedje, ki se veže v aktivno mesto proteaze in s tem prepreči vezavo tarčnega substrata. Če tak avtoinhibitomi peptid pripnemo na proteazo preko cepitvenega mesta za drugo proteazo, postane aktivnost prve proteaze odvisna od aktivnosti druge. Tako so na primer pripravili proteazo TVMV, ki se aktivira ob cepitvi s trombinom, in proteazo HCV, ki se aktivira ob cepitvi s proteazo TVMV (Stein & Alexandrov 2014). Z ustreznim izborom povezovalnih peptidov in inducibilnih dimerizacijskih. domen lahko pripravimo tudi proteazo, pri kateri je vezava avtoinhibitomega peptida v aktivno mesto odvisna od prisotnosti liganda. Tako so pripravili proteazo TVMV, ki se bodisi aktivira bodisi deaktivira v prisotnosti kratkega peptidnega liganda (Stein & Alexandrov 2014).The autoinhibitoma peptide is a short amino acid sequence that binds to the active site of the protease, thereby preventing binding of the target substrate. If such an autoinhibitory peptide is attached to the protease via the cleavage site for the second protease, the activity of the first protease becomes dependent on the activity of the other. For example, a TVMV protease TV activated when thrombin cleavage and HCV protease were activated when cleavage with the TVMV protease (Stein & Alexandrov 2014). With an appropriate selection of linking peptides and inducible dimerization. domains, a protease can also be prepared in which the binding of the autoinhibitom peptide to the active site depends on the presence of the ligand. Thus, the TVMV protease was prepared, either activated or deactivated in the presence of a short peptide ligand (Stein & Alexandrov 2014).
Po drugi strani je bližinski senzor način nadzora nad aktivnostjo proteaze, ki temelji na nadzoru nad kolokalizacijo proteaze in njenega substrata. Cepitev s proteazo TEV so na primer naredili odvisno od zunanjega signala preko povezave proteaze na membranski receptor (GPCR), medtem ko je poročevalec (luciferaza) vezan na adaptor receptorja (arestin), ki se veže na membranski receptor zgolj v prisotnosti zunanjega liganda, na primer nevrotransmiterja ali hormona, pri čemer proteaza vezana na GPCR sproži proteolitsko cepitev poročevalca (Eishingdrelo et al. 2011). Podobno so zgoraj opisani avtoinhibirani proteazi HCV in TVMV sklopili v bližinski senzor tako, da so jima dodali domeni FKBP in FRB (Stein & Alexandrov 2014). Ti domeni se povežeta v prisotnosti rapamicina, kar privede tudi proteazi v neposredno bližino. Če ena od proteaz v povezovalnem peptidu med encimom in avtoinhibitomim peptidom vsebuje tarčno cepitveno mesto za drago proteazo, potem njuna neposredna bližina omogoči odcepitev tega avtoinhibitomega peptida in s tem aktivacijo sistema.On the other hand, the proximity sensor is a way of controlling the activity of protease, based on the control over the colocation of protease and its substrate. For example, cleavage with TEV protease depends on the external signal via the protease linkage to the membrane receptor (GPCR), while the luciferase is bound to the receptor receptor (isstin) that binds to the membrane receptor only in the presence of an external ligand an example of a neurotransmitter or hormone, wherein the protease bound to the GPCR triggers the proteolytic splitting of the reporter (Eishingdrelo et al., 2011). Similarly, the auto-infected proteins HCV and TVMV described above were coupled to a proximity sensor by adding them to the FKBP and FRB domains (Stein & Alexandrov 2014). These domains are linked in the presence of rapamycin, which also leads to close proximity. If one of the proteases in the binding peptide between the enzyme and the autoinhibitom peptide contains a target cleavage site for expensive protease, then their immediate proximity allows the autoinhibit peptide to be discarded and thus the activation of the system.
Tretji način indukcije proteaze z ligandom, razcepljena proteaza, je doslej opisan le za proteazo TEV (Wehr et al. 2006). Zaporedje proteaze so razcepili na N-končni in C-končni fragment, ki sta pripeta na rapamicin-vezavni domeni FKBP in FRB. V prisotnosti rapamicina se ti domeni povežeta in s tem približata razcepljena fragmenta proteaze, ki lahko šele pri tem zavzameta aktivno obliko. Z ustrezno razporeditvijo tarčnih cepitvenih mest za proteaze na poročevalskih ali dragih posredniških proteinih lahko dosežemo tudi, da se aktivnost več različnih proteaz različno odraža na poročevalcu. Tako so doslej poročali na primer o nadzora nad genetskimi vezji s pomočjo ortogonalnih proteaz, ki cepijo inhibitome domene na transkripcijskih dejavnikih ali pa z odcepitvijo inhibitome domene na transkripcijski dejavnik pripetega degrona in s tem transkripcijski dejavnik izpostavijo razgradnji (Femandez-Rodriguez & Voigt 2016). Shekhawat in sodelavci (Shekhawat et al. 2009) pa so razvili poročevalec, sestavljen iz dveh fragmentov razcepljene luciferaze, od katerih je vsak pripet na ovito vijačnico in dimerizacija teh ovitih vijačnic fragmenta luciferaze poveže v aktivno obliko. Vsaka od ovitih vijačnih pa vsebuje preko cepitvenega mesta za proteazo pripet še avtoinhibitomi peptid, ki preprečuje dimerizacijo. Šele ob cepitvi obeh avtoinhibitomih peptidov zaradi delovanja dveh proteaz (uporabili so proteazi TEV in kaspazo 3) je omogočena dimerizacija in zaznamo izhodni signal. Tako opisani poročevalec deluje kot logična funkcija AND.The third method of induction of protease with ligand, split protease, has been described so far only for TEV protease (Wehr et al., 2006). The protease sequence was split into the N-terminal and C-terminal fragments attached to the rapamycin-binding domains of FKBP and FRB. In the presence of rapamycin, these domains connect and thus approach the split protease fragments that can only take up the active form. By appropriately allocating target cleavage sites for proteases on the reporting or expensive intermediate proteins, it can also be achieved that the activity of several different proteases is reflected differently on the rapporteur. So far, for example, on the control of genetic circuits with orthogonal proteases, which block the inhibitory domains on transcription factors, or, by cleaving the inhibitory domain to the transcriptional factor of attached degrone, and thereby the transcription factor, are exposed to degradation (Femandez-Rodriguez & Voigt 2016) . Shekhawat and colleagues (Shekhawat et al., 2009) developed a rapporteur consisting of two fragments of split luciferase, each of which is attached to a wrapped helix, and the dimerization of these edged helixes of the luciferase fragment connects to the active form. Each of the wrapped screws contains an auto-inhibitory peptide which protects the dimerization through the cleavage site for the protease. Only when the two auto-inhibitory peptides were split due to the action of two proteases (TEV proteins and caspase 3 were used) dimerization is enabled and the output signal is detected. The rapporteur described above acts as a logical AND function.
Proteaza TEV sodi v družino potivirasnih proteaz. Potivirasi so rastlinski patogeni, katerih genom se prepiše v en sam poliprotein, del katerega je tudi jedrna inkluzijska domena (NIa), ki virusni poliprotein razcepi v posamezne funkcionalne podenote. Poleg proteaze TEV so iz te družine poznane in okarakterizirane na primer proteaze virusov PPV, TVMV, SbMV, SuMMV in drage, prav tako so poznana tudi tarčna zaporedja, katera omenjene proteaze učinkovito cepijo (J. M. Adams et al. 2005; M. J. Adams et al. 2005; Ghabrial et al. 1990; Tozser et al. 2005). Za nekatere izmed naštetih proteaz so že pokazali, da so med seboj ortogonalne (Femandez-Rodriguez & Voigt 2016). Kljub temu se le proteaza TEV pojavlja kot široko uporabljano sintezno-biološko orodje in je edina potivirasna proteaza, kije bila doslej pripravljena v razcepljeni obliki.Protease TEV belongs to the family of potentiated proteases. Potentials are plant pathogens whose genome is transcribed into a single polyprotein, part of which is also a core inclusion domain (NIa) that splits the viral polyprotein into individual functional subunits. In addition to the TEV protease, PPV, TVMV, SbMV, SuMMV and dear viruses are known and characterized, for example, in the proteins of TEV, and target sequences are also known which of these proteases are effectively cleaved (JM Adams et al., 2005; MJ Adams et al. 2005; Ghabrial et al., 1990; Tozser et al., 2005). For some of these proteases, they have already been shown to be orthogonal (Femandez-Rodriguez & Voigt 2016). Nevertheless, only the TEV protease appears as a widely used synthetic-biological tool and is the only potentiary protease that has been prepared in split form so far.
LiteraturaLiterature
Adams, J.M., Antoniw, F.J. & Fauquet, M.C., 2005. Molecular criteria for genus and species discrimination within the family Potyviridae. Archives ofVirology, 150(3), pp.459-479.Adams, J.M., Antoniw, F.J. & Fauquet, M.C., 2005. Molecular criteria for genus and species discrimination within the family Potyviridae. Archives of Virology, 150 (3), pp.459-479.
Adams, M,J., Antoniw, J.F. & Beaudoin, F., 2005. Overview and analysis of the polyprotein cleavage sites in the family Potyviridae. Molecular Plant Pathology, 6(4), pp.471-487.Adams, M, J., Antoniw, J.F. & Beaudoin, F., 2005. Overview and analysis of polyprotein cleavage sites in the Potyviridae family. Molecular Plant Pathology, 6 (4), pp.471-487.
Brown, E.J. et al., 1994. A mammalian protein targeted by Gl-arresting rapamycin-reeeptor complex. Nature, 369(6483), pp.756-758.Brown, E.J. et al., 1994. A mammalian protein targeted by Gl-arresting rapamycin-reeeptor complex. Nature, 369 (6483), pp.756-758.
Eishingdrelo, H. et al., 2011. A cell-based protein-protein interaction method using a permuted luciferase reporter. Current Chemical genomics, 5, pp.122-128.Eishingdrelo, H. et al., 2011. A cell-based protein-protein interaction method using a permuted luciferase reporter. Current Chemical genomics, 5, pp.122-128.
Femandez-Rodriguez, J. & Voigt, C.A., 2016. Post-translational control of genetic circuits using Potyvirus proteases. Nucleic acids research, 44(13), pp.6493-502.Femandez-Rodriguez, J. & Voigt, C. A., 2016. Post-translational control of genetic circuits using Potyvirus proteases. Nucleic Acids Research, 44 (13), pp. 6493-502.
Ghabrial, S.A. et al., 1990. Molecular genetic analyses of the soybean mosaic virus NIa proteinase. The Journal of general virology, 71 (9), pp. 1921-7.Ghabrial, S.A. et al., 1990. Molecular genetic analysis of the soybean mosaic virus NIa proteinase. The Journal of General Virology, 71 (9), pp. 1921-7.
Kanno, A. et al., 2007. Cyclic luciferase for real-time sensing of caspase-3 activities in living mammals. Angewandte Chemie (International ed. in English), 46(40), pp.7595-7599.Kanno, A. et al., 2007. Cyclic luciferase for real-time sensing of caspase-3 activities in living mammals. Angewandte Chemie (International ed. In English), 46 (40), pp.7595-7599.
Kennedy, MJ. et al., 2010. Rapid blue-light-mediated induction of protein interactions in living celiš. Nature methods, 7(12), pp.973-975.Kennedy, MJ. et al., 2010. Rapid blue-light-mediated induction of protein interactions in living celis. Nature methods, 7 (12), pp. 973-975.
Olson, EJ. & Tabor, J.J., 2012. Post-translational tools expand the scope of synthetic biology. Current opinion in Chemical biology, 16(3-4), pp.300-6.Olson, EJ. & Tabor, J.J., 2012. Post-translational tools expand the scope of synthetic biology. Current opinion in Chemical biology, 16 (3-4), pp. 300-6.
Sabatini, D.M. et al., 1994. RAFTI: a mammalian protein that binds to FKBP12 in a rapamycin-dependent fashion and is homologous to yeast TORs. Celi, 78(1), pp.35-43.Sabatini, D.M. et al., 1994. RAFTI: a mammalian protein that binds to FKBP12 in a rapamycin-dependent fashion and is homologous to yeast TORs. Celi, 78 (1), pp.35-43.
Sabers, C.J. et al., 1995. Isolation of a protein target of the FKBP12-rapamycin complex in mammalian celiš. The Journal ofbiological chemistry, 270(2), pp.815-822.Sabers, C.J. et al., 1995. Isolation of a protein target of the FKBP12-rapamycin complex in a mammalian celish. The Journal of Biological Chemistry, 270 (2), pp. 815-822.
Shekhawat, S.S. et al., 2009. An Autoinhibited Coiled-Coil Design Strategy for Split-Protein Protease Sensors. Jacs, 131(42), pp. 15284-15290.Shekhawat, S.S. et al., 2009. An Autoinhibited Coiled-Coil Design Strategy for Split-Protein Protease Sensors. Jacs, 131 (42), pp. 15284-15290.
Stein, V. & Alexandrov, K., 2014. Protease-based synthetic sensing and signal amplification. Proceedings of the National Academy of Sciences of the United States of America, 111(45), pp. 15934-9.Stein, V. & Alexandrov, K., 2014. Protease-based synthetic sensing and signal amplification. Proceedings of the National Academy of Sciences of the United States of America, 111 (45), pp. 15934-9.
Tozser, J. et al., 2005. Comparison of the substrate specificity of two potyvirus proteases. FEBS Journal, 272(2), pp.514—523.Tozser, J. et al., 2005. Comparison of substrate specificity of two potyvirus proteases. FEBS Journal, 272 (2), pp.514-523.
Wehr, M.C. et al., 2006. Monitoring regulated protein-protein interactions using split TEV. Nature methods, 3(12), pp.985-993.Wehr, M.C. et al., 2006. Monitoring regulated protein-protein interactions using split TEV. Nature methods, 3 (12), pp. 985-993.
Tehnični problemTechnical problem
Za natančno upravljanje in selektivnost bioloških sistemov je zaželeno kombinirati več signalov, s katerimi lahko izbiramo različne kombinacije fizioloških pogojev, celic ali signalov iz okolja. Naravne celice to izvajajo preko signalnih poti. Problem je zasnovati novo ortogonalno signalno pot, kar pomeni, da vpliva samo na izbrane celične komponente na točno določen način.For the precise management and selectivity of biological systems, it is desirable to combine several signals to select different combinations of physiological conditions, cells or signals from the environment. The natural cells do this via signal paths. The problem is to design a new orthogonal signal path, which means that it affects only the selected cell components in a specific way.
Za načrtovanje novih ortogonalnih signalnih poti znotraj človeških celic smo ugotovili, da signalizacija preko proteolize predstavlja priložnost za konstrukcijo sistema za hiter prenos signalov, ki doslej še ni bila izkoriščena. Za delovanje znotraj celic bi bilo najbolje izbrati proteaze, ki delujejo na daljša tarčna zaporedja, ki vsebujejo vsaj 6 aminokislin, zaradi česar je malo veijetno, da bi lahko okvarile proteine, ki so potrebni za delovanje celice, po drugi strani pa lahko v tarčne proteine vstavimo točno določena tarčna mesta za cepitev z izbrano proteazo ali njihove kombinacije..For the design of new orthogonal signal paths within human cells, it was found that signaling through proteolysis represents an opportunity for the construction of a system for the rapid transmission of signals that has not yet been utilized. In order to operate within cells, it is best to choose proteases that act on longer target sequences containing at least 6 amino acids, which makes it slightly more prominent in order to disrupt the proteins needed for the cell's functioning, and on the other hand, in target proteins insert specific target sites for cleavage with the selected protease or combinations thereof.
Pri tem smo opazili občutno pomanjkanje ortogonalnih proteaz, ki delujejo na dobro definirana daljša različna polipeptidna zaporedja in katerih delovanje bi lahko bilo neposredno odvisno od vhodnega signala.A significant lack of orthogonal proteases was observed in this, which act on well-defined longer different polypeptide sequences, and the operation of which could be directly dependent on the input signal.
Za potrebe bolj kompleksnega nadzora nad celičnim odzivom na različne zunanje signale potrebujemo večje število signalnih molekul in kaskad, ki med seboj delujejo bodisi ortogonalno, bodisi se med seboj dopolnjujejo in delujejo kot logična vezja. Pri tem je še posebej pomembno, da je aktivnost prenašalnih molekul v signalni kaskadi (v zgoraj opisanem primeru proteaz) natančno nadzorovana in ortogonalna. Izbrane proteaze morajo torej: a) biti neaktivne ob odsotnosti signala, b) omogočati enostavno rekonstitucijo aktivnosti ob prisotnosti signala, c) ne vplivati na endogene procese v celici in d) biti med seboj ortogonalne, torej prenos signala po enem delu kaskade ne sme vplivati na prenos signala po drugem delu kaskade, razen kadar je to namerno načrtovano.For the purposes of more complex control of the cell response to various external signals, we need a larger number of signal molecules and cascades that interact either orthogonally or complement each other and act as logical circuits. It is particularly important that the activity of the transferring molecules in the signal cascade (in the protease described above) is precisely controlled and orthogonal. The selected proteases must therefore: a) be inactive in the absence of a signal, b) facilitate simple reconstitution of the activity in the presence of the signal, c) not affect the endogenous processes in the cell, and d) be orthogonal, therefore the transmission of a signal in one part of the cascade must not be affected to transfer the signal over the second part of the cascade, except when intentionally planned.
Iz stanja tehnike je razvidno, da trenutne rešitve ne morejo zagotoviti natančnega, hitrega, logično zasnovanega in selektivnega upravljanja bioloških sistemov na več nivojih sklopljenega z različnimi ali enakimi vhodnimi signali.It is clear from the state of the art that current solutions can not provide precise, fast, logically-based and selective management of biological systems at several levels coupled with different or identical input signals.
Opis rešitve problema Z izumom rešujemo zgoraj opisane probleme pomanjkanja hitrih ortogonalnih signalnih poti, ki temeljijo na posttranslacijskih modifikacijah proteinov, kar smo dosegli z vnosom kombinacije vsaj dveh ortogonalnih razcepljenih proteaz v povezavi z dimerizacijskimi domenami, ki omogočajo sestavljanje razcepljenih proteaz, v celice.Description of the solution of the problem. The invention solves the above-described problems of the lack of fast orthogonal signaling pathways based on posttranslational modification of proteins, which was achieved by the introduction of a combination of at least two orthogonal split proteases in association with dimerization domains that allow the assembly of the split proteases into cells.
Izum se nanaša na kombinacijo ortogonalnih cepljenih proteaz, ki prepoznavajo tarčno zaporedje, ki obsega vsaj 6 aminokislin, povezanih z dimerizacijskimi domenami, ki omogočajo sestavljanje razcepljenih proteaz. Vsaj dve ortogonalni proteazi sta pripravljeni kot razcepljena fragmenta v povezavi z dimerizacijskimi domenami, kjer je dimerizacija lahko inducibilna s svetlobnim, kemičnim ali drugim vhodnim signalom. Proteaze cepijo enega ali več tarčnih proteinov, ki vsebujejo tarčno zaporedje za eno ali več ortogonalnih protaz in delujejo kot nadaljnji posredniki signala, poročevalski proteini ali terapevtski proteini. Z ustrezno izbranimi tarčnimi proteini lahko pripravimo s proteazami posredovana logična vezja.The invention relates to a combination of orthogonal graft proteases that recognize a target sequence comprising at least 6 amino acids associated with dimerization domains that allow the formation of split proteases. At least two orthogonal proteases are prepared as split fragments in association with dimerization domains, where dimerization can be inducible by a light, chemical or other input signal. Proteases split one or more target proteins containing the target sequence for one or more orthogonal protus and act as further mediators of the signal, the reporting protein, or therapeutic proteins. With appropriately selected target proteins, logical circuits mediated by proteases can be prepared.
Izum se nanaša tudi na celice, ki vsebujejo kombinacijo ortogonalnih razcepljenih proteaz v povezavi z dimerizacijskimi domenami in tarčne proteine, ki vsebujejo ustreznih cepitvena mesta, ki jih posamezne proteaze, ko so sestavljene, prepoznajo. Komplementarni par dimerizacijskih domen omogoča sestavljanje posameznih razcepljenih proteaz. Komplementarni par se lahko sestavlja spontano ali s pomočjo iniciatorja, ki je lahko znotraj celični ali zunanji, kot so svetloba, pH, temperatura, mehanski stres, kemični signal. Dimerizacijske domene so lahko za vsak tip razcepljene proteaze različne ali so za vse ortogonalne proteaze enake.The invention also relates to cells that contain a combination of orthogonal split proteases in association with dimerization domains and target proteins containing the appropriate cleavage sites identified by individual proteases when they are assembled. A complementary pair of dimerization domains allows the assembly of individual split proteases. The complementary pair can be formed spontaneously or with the aid of an initiator which can be inside cellular or external, such as light, pH, temperature, mechanical stress, chemical signal. Dimerization domains can be different for each type of split protease or are the same for all orthogonal proteases.
Rekonstruirana proteaza deluje na tarčni protein, ki vsebuje cepitveno mesto za izbrano proteazo in je lahko endogeni ali transgensko vstavljen poročevalec ali drug posrednik signala.The reconstituted protease acts on a target protein that contains a cleavage site for the selected protease and can be an endogenous or transgenic inserted rapporteur or other mediator of the signal.
Izbrali smo družino potivirusnih proteaz, katere član je tudi proteaza TEV, in na osnovi sekvenčne in funkcionalne podobnosti med posameznimi proteazami te družine izumili nove razcepljene proteaze. Vse opisane proteaze te družine imajo podobno aminokislinsko zaporedje, ki ga je mogoče izraziti kot dva fragmenta razcepljenega proteina, povezana preko kratkih gibljivih povezovalnih peptidov z inducibilnimi dimerizacijskimi domenami. Ločena fragmenta proteaze po izumu tvorita funkcionalno neaktivne proteine. Ob dodatku liganda ali ob kakšnem drugem signalu (npr. svetlobnem), ki povzroči asociacijo dimerizacijskih domen pride do strukturne in funkcionalne rekonstitucije razcepljenih proteaz, ki cepijo naslednji člen signalne kaskade, ki vsebuje tarčno peptidno zaporedje za cepitev z izbrano proteazo. Tako cepljeni protein je lahko neposredno poročevalec (na primer luciferaza ali fluorescenčni protein) ali naslednji posrednik signala (na primer transkripcij ski dejavnik, kinaza, fosfataza ali še ena proteaza), njegova poglavitna lastnost pa je, daje njegova aktivnost odvisna od cepitve.We have selected a family of potivirus proteases, which is also a TEV protease, and invented new split proteases based on the sequential and functional similarities between the individual proteases of this family. All of the proteases described in this family have a similar amino acid sequence that can be expressed as two fragmented protein fragments, coupled via short moving interconnecting peptides with inducible dimerization domains. Separate protease fragments according to the invention form functional inactive proteins. When adding a ligand or any other signal (e.g., light) that causes the association of dimerization domains, structural and functional reconstitution of the split proteases that cleave the next member of the signaling cascade, comprising the target peptide sequence for cleavage with the selected protease. Such a vaccinated protein may be a direct rapporteur (for example, luciferase or fluorescent protein) or the next mediator of the signal (for example, a transcription factor, kinase, phosphatase, or another protease), and its main feature is that its activity depends on cleavage.
Signalni sistem, ki temelji na razcepljenih proteazah, vsebuje vsaj dve razcepljeni proteazi. Vsaka od razcepljenih proteaz v sistemu lahko ima svoj tarčni protein (posrednik ali poročevalec), lahko pa dve ali več proteaz delujejo na isti tarčni protein, če ta vsebuje različna prepoznavna mesta za cepitev. Koncept izuma je predstavljen grafično na sliki 1.A signaling system based on split proteases contains at least two split proteases. Each of the split proteases in the system may have its target protein (mediator or rapporteur), but two or more proteases may act on the same target protein if it contains various identifiable cleavage sites. The concept of the invention is presented graphically in Figure 1.
Prenos signala preko razcepljenih proteaz sproži ali ustavi prepisovanje izbranih genov ali izločanje izbranih molekul, kar je uporabno za zdravljenje bolezni živčevja, hormonske motnje, motnje pretoka in presnove in druge bolezni. Izum je uporaben za uravnavanje delovanja človeških, živalskih ali rastlinskih celic, za prenos signala na druge celice, za izločanje peptidov, proteinov in drugih molekul, za zaznavanje kombinacije dveh ali več kemijskih, svetlobnih ali mehanskih signalov.Transmission of the signal through split proteases triggers or stops the transcription of selected genes or secretion of selected molecules, which is useful for the treatment of nervous system disorders, hormonal disorders, disorders of the flow and metabolism and other diseases. The invention is useful for regulating the action of human, animal or plant cells, for transmitting the signal to other cells, for secreting peptides, proteins and other molecules, to detect a combination of two or more chemical, light or mechanical signals.
Opis slikImage description
Slika 1: Shema primera signalnega sistema z ortogonalnimi razcepljenimi proteazami povezanimi z dimerizacijskimi domenami, ki se lahko aktivirajo preko induktoija, ki je lahko zunanji ali notranji-celici lasten. Ob prisotnem induktorju se neaktivna razcepljena proteaza sestavi v aktivno proteazo in cepi tarčni proteinski substrat, kar privede do spremembe tarčnega proteina, ki je lahko poročevalec, nova proteaza, neaktivni encim, itd. Vsaka ortogonalna razcepljena proteaza ima lahko sebi lastne dimerizacijske domene in tako lahko z različnimi induktorji aktiviramo različne ortogonalne proteaze in tako z logičnimi operacijami vodimo v različne celične odgovore. Proteaze cepijo tarčni protein, ki vodi v nadaljevanje signalne poti (na primer transkripcijsko regulacijo) ali predstavlja izhodni signal (na primer fluorescenčni protein). Če tarčni protein vsebuje prepoznavna mesta za cepitev z različnimi proteazami, lahko prenos signala s proteazami deluje kot logično vezje.Figure 1: A schematic diagram of a signal system with orthogonal split proteases associated with dimerization domains that can be activated via an inductoion that may be external or internal-cell. In the presence of the inducer, the inactive split protease is assembled into the active protease and the target protein protein substrate, which leads to a change in the target protein, which may be a rapporteur, a new protease, an inactive enzyme, etc. Each orthogonal split protease can have its own dimerization domains, and thus, various inductors can activate various orthogonal proteases, and thus, by logic operations, we can lead to different cell responses. Proteases break down the target protein which leads to the continuation of the signal pathway (for example, transcriptional regulation) or represents an output signal (for example, a fluorescent protein). If the target protein contains recognizable sites for cleavage with various proteases, the signal transfer with proteases can act as a logic circuit.
Slika 2: Ortogonalnost izbranih proteaz. Aktivnost vsake proteaze je prikazana kot aktivnost ciklične luciferaze z označenim cepitvenim mestom za proteazo. Tarčna zaporedja za cepitev vsake proteaze so prikazana.Figure 2: Orthogonality of selected proteases. The activity of each protease is shown as a cyclic luciferase activity with a marked cleavage site for the protease. Targeted sequences for cleavage of each protease are shown.
Slika 3: Inducibilnost izbranih proteaz. Aktivnost vsake proteaze je prikazana kot aktivnost ciklične luciferaze z ustreznim cepitvenim mestom za izbrano proteazo. A) Indukcija razcepljenih proteaz z rapamicinom. B) Indukcija razcepljene proteaze TEV z modro svetlobo. C) Indukcija razcepljene proteaze PPV z modro svetlobo.Figure 3: Inducibility of selected proteases. The activity of each protease is shown as cyclic luciferase activity with the appropriate cleavage site for the selected protease. A) Induction of split proteases with rapamycin. B) Induction of split protease TEV with blue light. C) Induction of the split protease PPV with blue light.
Slika 4: Aktivacija ortogonalnih signalnih poti posredovanih s proteazama TEV in PPV. Proteaza TEV cepi transkripcijski dejavnik, ki poviša izražanje poročevalca mCitrin. Proteaza PPV s cepitvijo aktivira poročevalec ciklično luciferazo.Figure 4: Activation of orthogonal signaling pathways mediated by TEV and PPV proteases. Protease TEV cleaves the transcription factor, which increases the expression of the mCitrin rapporteur. Protease PPV by cleavage activates the rapporteur cyclic luciferase.
Slika 5: Logična operacija NOR posredovana s proteazama TEV in PPV. A) Shema logičnega vezje NOR. Tarčna proteina v odsotnosti signala (vhodni signal 0 0) z dimerizacijo preko ovitih vijačnic P3 in AP4 tvorita aktivni luciferazni poročevalec (izhodni signal 1). V prisotnosti katerekoli ali obeh proteaz TEV in PPV (vhodni signali 1 0, 0 1 in 1 1) se fragmenti poročevalca odcepijo od ovitih vijačnic in se zato ne združujejo več v delujoč poročevalec (izhodni signal 0). B) Izmerjena aktivnost poročevalca logične funkcije NOR ob koekspresiji proteaz TEV in PPV. Dodatek katerekoli izmed proteaz ali obeh proteaz hkrati zmanjša aktivnost poročevalca.Figure 5: Logical operation of NOR mediated by TEV and PPV proteases. A) Schematic logic circuit NOR. The target protein in the absence of a signal (input signal 0 0) by dimerization through the wrapped helix P3 and AP4 form an active luciferase rapporteur (output signal 1). In the presence of either or both of the TEV and PPV proteases (input signals 1 0, 0 1 and 1 1), the fragments of the rapporteur break away from the wound hexagons and, therefore, no longer merge into the working rapporteur (output signal 0). B) Reported activity of the NOR logical function with coexpression of TEV and PPV proteases. Addition of any of the proteases or both proteases simultaneously reduces the activity of the rapporteur.
Slika 6: Logična funkcija A NIMPLY B posredovana z razcepljenima proteazama TEV s svetlobno indukcijo in PPV s kemično indukcijo. A) Shema logičnega vezja A NIMPLY B. Peptid P3mS preprečuje tvorbo ovite vijačnice med AP4 in P3 (vhodni signal 0 0). Ob cepitvi s proteazo TEVp se P3mS odcepi in pride do rekonstitucije luciferaze (vhodni signal 1 0). Proteaza PPV odcepi C-končni fragment luciferaze od peptida P3 in s tem prepreči rekonstitucijo encima (vhodna signala 0 1 in 1 1). B) Izmerjena aktivnost poročevalca logične funkcije A NIMPLY B ob različnih vhodnih signalih. Aktivnost poročevalca je visoka samo ob stimulaciji s svetlobo, ki aktivira proteazo TEV, ne pa tudi ob stimulaciji z rapamicinom, ki aktivira proteazo PPV, ali z rapamicinom in svetlobo hkrati.Figure 6: Logic function A NIMPLY B mediated by split TEV proteins with light induction and PPV by chemical induction. A) Schematic of A NIMPLY B logic circuit. The peptide P3mS prevents the formation of a wound heel between AP4 and P3 (input signal 0 0). When cleaved with TEVp protease, P3mS is cleaved and reconstitution of luciferase occurs (input signal 1 0). Protease PPV cleaves the C-terminal luciferase fragment from P3 peptide, thereby preventing the reconstitution of the enzyme (input signals 0 1 and 1 1). B) The measured activity of the reporting logic function A NIMPLY B with different input signals. The activity of the rapporteur is high only with stimulation with light that activates TEV protease, but not with stimulation with rapamycin that activates PPV protease or with rapamycin and light at the same time.
Podroben opis izuma in izvedbeni primeriDetailed description of the invention and embodiments
Izum se nanaša na kombinacijo dveh ali več ortogonalnih razcepljenih proteaz, ki prepoznavajo tarčno zaporedje, ki obsega vsaj 6 aminokislin, z dimerizacijskimi domenami, ki omogočajo funkcionalno rekonstitucijo proteazne aktivnosti in tarčnih proteinov s tarčnim zaporedjem, katerega prepoznavajo proteaze.The invention relates to a combination of two or more orthogonal split proteases that recognize a target sequence comprising at least 6 amino acids with dimerization domains that allow the functional reconstitution of protease activity and target proteins with a target sequence recognized by proteases.
Kombinacija ortogonalnih razcepljenih proteaz z dimerizacijskimi domenami po izumu vključuje: • vsaj dve razcepljeni proteazi, od katerih je vsaka izražena kot vsaj dva razcepljena fragmenta proteina, in sta izbrani prednostno iz družine NIa potivirusnih proteaz, prednostno proteaze SuMMV, SbMV, PPV in TEV; prednostno, daje vsaj ena od razcepljenih proteaz sestavljena iz dveh proteinov s proteinskim zaporedjem SEQ ID 4 in SEQ ID 6, SEQ ID. 10 in SEQ ID. 12, SEQ ED. 16 in SEQ ID. 18, SEQ ID. 22 in SEQ ID. 24, ali njihovih homologov, ki imajo vsaj 30% podobnosti aminokislinskega zaporedja • razcepljene proteaze povezane z dimerizacijskimi domenami, ki omogočajo njihovo sestavljanje in so: (i) spontano sestavljive domene in/ali (ii) je njihovo sestavljanje inducirano z znotrajceličnim ali z zunanjim fizikalnim, kemijskim ali biološkim signalom • dimerizacijske domene, ki so spontano sestavljive domene izbrane izmed domen, ki spontano tvorijo dimere, prednostno dimemi proteini, obvite vijačnice, beta strukture, ki imajo prednostno ortogonalne lastnosti • dimerizacijske domene sproži z (i) znotrajceličnim signalom, kot so: sprememba koncentracije metabolita, sekundarnega sporočevalca, sprememba v aktivnosti encima ali z (ii) zunanjim signalom, kot so sprememba temperature, vrednosti pH, mehanski stres, sprememba ozmotskega tlaka, ultrazvok, svetloba, kemični ali biološki signal • dimerizacijske domene, prednostno ovite vijačnice, svetlobno inducibilne domene, prednostno CRY2 in CIBN, kemijsko inducibilne domene, prednostno FKBP in FRB, ali od kalcija odvisne domene, prednostno kalmodulin in Ml3, v fuziji z razcepljenimi fragmenti proteaz, ki lahko tvorijo homodimere, heterodimere ali z lastnim izrezovanjem povzročijo nastanek kovalentne vezi (inteini), in katerih dimerizacija je odvisna od vhodnega signala, pri čemer vhodni signal predstavlja mehanski stimulus, prednostno ultrazvok ali dotik, svetlobni signal, kemijski ligand, prednostno rapamicin, ali drug fiziološko relevantni signal, prednostno proteolitsko cepitev • dimerizacijske domene, ki se spontano sestavljive domene izbrane izmed ortogonalnih obvitih vijačnic • tarčni protein s prepoznavnim zaporedjem za proteaze nativni, rekombinantni ali umetno generiran protein, in je prepoznavno zaporedje dolgo vsaj 6 aminokislinskih preostankov • vsaj en protein, ki vsebuje vsaj eno prepoznavno mesto za cepitev z vsaj eno od razcepljenih proteaz in lahko tvori naslednji člen signalne poti, prednostno transkripcijski regulator, ali predstavlja terapevtsko tarčo, terapevtski protein ali peptid ali poročevalski protein, prednostno luciferazo, SEAP ali fluorescenčni protein • dovolj dolge nestrukturirane povezovalne peptide med posameznimi domenami proteinov, ki omogočajo pravilno funkcionalno rekonstitucijo fragmentov proteaz, • opcijsko poljubno število kopij lokalizacijskih signalov, prednostno jedrnega lokalizacijskega signala, signalne sekvence za transport v ER ali transmembranskih domen ter • opcijsko poljubna označevalna zaporedja za detekcijo proteina ali izolacijo z afinitetno kromatografijo (npr. označevalca HIS in AU1).The combination of orthogonal split proteases with the dimerization domains of the invention includes: • at least two split proteases, each expressed as at least two split protein fragments, and selected preferably from the NIa family of potivirus proteases, preferably proteins SuMMV, SbMV, PPV and TEV; preferably, at least one of the split proteases consists of two proteins with the protein sequence of SEQ ID 4 and SEQ ID 6, SEQ ID. 10 and SEQ ID. 12, SEQ ED. 16 and SEQ ID. 18, SEQ ID. 22 and SEQ ID. 24 or their homologs having at least 30% similarity of the amino acid sequence of the split proteases associated with the dimerization domains that allow their assembly and which are: (i) spontaneously assembled domains and / or (ii) the assemblage thereof is induced by intracellular or external physical, chemical or biological signals • dimerization domains that are spontaneously assembled domains selected from domains that spontaneously form dimers, preferably dimmer proteins, oblique helix, beta structures that have preferably orthogonal properties, • the dimerization domain is triggered by (i) an intracellular signal, such as: change in metabolite concentration, secondary communicator, change in enzyme activity, or with (ii) external signal such as temperature change, pH value, mechanical stress, change in osmotic pressure, ultrasound, light, chemical or biological signal • dimerization domains, wound helix, light inducible domains, preferably CR Y2 and CIBN, chemically inducible domains, preferably FKBP and FRB, or calcium-dependent domains, preferably calmodulin and Ml3, in fusion with split protease fragments that can form homodymers, heterodimers or with their own excision cause the formation of a covalent bond (intein); and wherein the dimerization is dependent on the input signal, the input signal being a mechanical stimulus, preferably an ultrasound or a touch, a light signal, a chemical ligand, preferably rapamycin, or another physiologically relevant signal, preferably a proteolytic cleavage of the dimerization domain, which spontaneously assembled domains are selected from ortogonal obstructed helixs • a target protein with a recognizable sequence for the proteases of a native, recombinant or artificially generated protein, and the recognizable sequence is at least 6 amino acid residues for at least one protein comprising at least one recognizable cleavage site with at least one of the split proteases and may form the next section of the signal path, pre or a therapeutic target, a therapeutic protein or peptide or a reporting protein, preferably luciferase, SEAP, or a fluorescence protein; • sufficiently long unstructured binding peptides between the individual protein domains that allow for the correct functional reconstitution of protease fragments; • optionally any number of copies of the localization signals , preferably a core localization signal, a signal sequence for transport to the ER or transmembrane domains, and optionally any marker sequences for protein detection or affinity chromatography (e.g. HIS and AU1 markers).
Izum se nanaša na celice, ki vsebujejo zgoraj opisane ortogonalne signalne poti in v katerih signalna pot privede do odziva, ki se lahko izraža na več načinov, kot so (i) sproščanje endogenih celičnih metabolitov, peptidov ali proteinov (npr. hormonov), (ii) sproščanje eksogenih metabolitov, peptidov ali proteinov, prednostno terapevtskih proteinov, (iii) sprememba aktivnosti endogenih ali eksogenih proteinov (prednostno encimov) v celici ali (iv) uravnavanje izražanja genov.The invention relates to cells that contain the above described orthogonal signaling pathways and in which the signal pathway results in a response that can be expressed in several ways, such as (i) release of endogenous cell metabolites, peptides or proteins (e.g., hormones), ( ii) releasing exogenous metabolites, peptides or proteins, preferably therapeutic proteins, (iii) modifying the activity of endogenous or exogenous proteins (preferably enzymes) in the cell, or (iv) regulating the expression of genes.
Izum se nanaša na celice, ki so izbrane med bakterijskimi ali evkariontskimi celicami, rastlinskimi celicami ali človeškimi celičnimi linijami, prednostno se izum nanaša na celice sesalcev in človeške celične linije, na primer na nevrone ali druge celice živčnega sistema, T-limfocite ali druge celice imunskega odziva ali beta celice pankreasa.The invention relates to cells selected from bacterial or eukaryotic cells, plant cells or human cell lines, preferably the invention relates to mammalian cells and human cell lines, for example to neurons or other nervous system cells, T lymphocytes or other cells immune response or pancreatic beta cell.
DefinicijeDefinitions
Izraz »razcepljena proteaza« se nanaša na dva ali več polipeptidov (»fragmentov«), ki izhajajo iz zaporedja proteaze, tako da je vsak od njih enak delu proteaze. Fragmenti so sami po sebi encimsko neaktivni. Za tvorbo proteolitsko aktivnega kompleksa je potrebna medsebojna interakcija fragmentov proteaze, kar je označeno z izrazom »rekonstitucija«. Fragmenti proteaze so izbrani tako, da med njimi ne more priti do rekonstitucije brez pomoči dodatnih dimerizacijskih domen, izraženih v fuziji s fragmenti proteaze v obliki himemega proteina. Izraz »himemi protein« označuje protein ali polipeptid sestavljen iz zaporedij, ki izvirajo iz nesorodnih proteinov in nastane ob translaciji himeme nukleinske kisline, ki združuje zapise za posamezne domene nesorodnih proteinov sestavljene tako, da tvorijo enoten odprt bralni okvir.The term "split protease" refers to two or more polypeptides ("fragments") derived from the protease sequence, each of them being equal to the protease. The fragments themselves are inseparably inactive. The formation of a proteolytically active complex requires the interaction of protease fragments, which is referred to by the term "reconstitution". Protease fragments are selected so as to avoid reconstitution between them without the aid of additional dimerization domains, expressed in fusion with protease fragments in the form of a hymy protein. The term "chimeric protein" means a protein or polypeptide consisting of sequences derived from non-native proteins and formed upon the translation of a nucleic acid hymen which combines the records for the individual domains of non-protein proteins composed in such a way that they form a single open reading frame.
Izraz »dimerizacijske domene« se nanaša na proteinske domene, ki se samostojno ali v prisotnosti liganda povežejo med seboj preko kovalentnih ali nekovalentnih interakcij. Izraz »homodimerizacijske« se nanaša na domene, ki se povežejo z drugimi domenami istega tipa, izraz »heterodimerizacijske« pa se nanaša na domene, ki se povežejo z drugimi domenami drugačnega tipa. Izraz »konstitutivne dimerizacijske domene« se nanaša na dimerizacijske domene, ki se med seboj povežejo samostojno, na primer ovite vijačnice, izraz »inducibilne dimerizacijske domene« pa se nanaša na dimerizacijske domene, ki se poveže zgolj ob prisotnosti dimerizacijskega signala.The term "dimerization domain" refers to protein domains that connect independently or in the presence of a ligand to one another through covalent or non-covalent interactions. The term "homodimerization" refers to domains that connect to other domains of the same type, and the term "heterodimerization" refers to domains that connect to other domains of a different type. The term "constitutive dimerization domains" refers to dimerization domains that interconnect independently, for example, wrapped helices, and the term "inducible dimerization domain" refers to dimerization domains that merge only in the presence of a dimerization signal.
Izraz »signal« se nanaša na merljivo spremembo znotraj celice ali v njenem okolju. Izraz »vhodni signal« se nanaša na spremembo, ki sproži signalno pot, izraz »izhodni signal« pa se nanaša na spremembo, ki nastane kot rezultat signalne poti. Vhodni signal je lahko del endogene signale poti ali je povzročen eksogeno in povzroči fiziološki odziv celice, na primer spremembo koncentracije sekundarnega sporočevalca, na primer kalcija, ali pa je dimerizacijski signal, ki vpliva na dimerizacijo endogenih ali eksogenih dimerizacijskih domen.The term "signal" refers to a measurable change within a cell or in its environment. The term "input signal" refers to a change that triggers a signal path, and the term "output signal" refers to a change that occurs as a result of the signal path. The input signal may be part of an endogenous path signal, or it is caused exogenously and causes a physiological response of the cell, for example, a change in the concentration of a secondary communicator, such as calcium, or a dimerization signal that affects the dimerization of endogenous or exogenous dimerization domains.
Izraz »dimerizacijski signal« se nanaša na Ugande, svetlobne in mehanske signale, spremembe pH, postranslacijsko modifikacijo dimerizacijskih domen, spremembo koncentracije sekundarnega sporočevalca ali spremembo transmembranskega potenciala, ki povzročijo, de se med seboj povežejo dimerizacijske domene, ki nimajo intrinzične afinitete do povezave oziroma se ne morejo povezati samostojno v odsotnosti dimerizacijskega Uganda ali signala.The term "dimerization signal" refers to Uganda, light and mechanical signals, pH changes, post-translational modification of dimerization domains, alteration of the secondary message concentration, or alteration of the transmembrane potential, which causes the dimerization domains that do not have an intrinsic affinity to the connection, can not connect independently in the absence of a dimerization Uganda or a signal.
Izraz "mehanski signal", uporabljen tu, se nanaša na vse vrste mehanske motnje/sile: kot so ultrazvok, dotik, osmotski stres, trenje zaradi toka tekočine, ki deluje na celice oz. na celično membrano. Mehanski signal lahko nastane zaradi pritiska določenega trdnega predmeta, tekočine ali drugih celic na membrane celice, zaradi gravitacijske, centrifugalne, strižne ali druge neposredne sile ali zaradi posrednih sil, ki povzročijo delovanje na celično membrano, kot je npr. osmoza, raztezanje ali krčenje zaradi temperature ali drugih dejavnikov.The term "mechanical signal" used herein refers to all kinds of mechanical disturbances / forces: such as ultrasound, touch, osmotic stress, friction due to the flow of liquid that acts on cells or to the cell membrane. The mechanical signal can be generated by the pressure of a particular solid object, liquid, or other cells on the cell membranes due to a gravitational, centrifugal, shear, or other direct force, or due to indirect forces that induce action on the cell membrane, such as, for example, osmosis, stretching or shrinkage due to temperature or other factors.
Izraz "ultrazvok" označuje zvočno valovanje s frekvenco med približno 20 kHz do približno 15 MHz, ki ga generira naprava, v kateri kontroliramo jakost oz. amplitudo zvoka, frekvenco v kombinaciji z ustreznim pretvornikom in režim časovne odvisnosti ultrazvočnih pulzov.The term "ultrasound" refers to acoustic waves with a frequency between about 20 kHz to about 15 MHz generated by the device in which the strength is controlled, sound amplitude, frequency in combination with a suitable converter and a time dependency regime for ultrasonic pulses.
Izraz »kemijski signal«, uporabljen tu, se nanaša na spremembo koncentracije Uganda. Izraz »ligand«, uporabljen tu, se nanaša na kratke biopolimere ali organske ali anorganske molekule ali ione, ki se lahko vežejo na proteine, prednostno na dimerizacijske domene.The term "chemical signal" used herein refers to a change in the concentration of Uganda. The term " ligand " used herein refers to short biopolymers or organic or inorganic molecules or ions that can bind to proteins, preferably to dimerization domains.
Izraz »svetlobni signal« se nanaša na elektromagnetno valovanje v vidnem območju (valovne dolžine med 400 in 700 nm), bližnjem infrardečem območju (valovne dolžine med 700 nm in 4 pm) ali ultravijoličnem območju (valovne dolžine med 10 nm in 400 nm), ki lahko povzroči konformacijske ali kemijske spremembe bioloških molekul, prednostno dimerizacijskih domen ali njihovih ligandov.The term "light signal" refers to electromagnetic waves in the visible range (wavelengths between 400 and 700 nm), the nearby infrared range (wavelengths between 700 nm and 4 pm) or ultraviolet (wavelengths between 10 nm and 400 nm), which may cause conformational or chemical changes in biological molecules, preferably dimerization domains or their ligands.
Izraz »signalna pot«, uporabljen tu, se nanaša na enega ali več dogodkov v celici, ki lahko obsegajo vezavo Uganda, dimerizacijo proteinov ali peptidov, vdor sekundarnih sporočevalcev, spremembo transmembranskega potenciala, encimsko katalizirano reakcijo ali druge spremembe v celici in jih povzroči vhodni signal, njihov končni produkt pa je izhodni signal.The term "signal path" used herein refers to one or more events in a cell that may comprise the binding of Uganda, dimerization of proteins or peptides, intrusion of secondary messengers, alteration of transmembrane potential, enzyme-catalyzed reaction, or other changes in the cell, and are caused by an input signal, and their output is an output signal.
Izraz »ortogonalni«, uporabljen tu, se nanaša na lastnost signalnih poti, da potekajo ločeno, torej da posamezni deli ene signalne poti ne interagirajo z deli drugih endogenih ali eksogenih signalnih poti. Poglavitna lastnost ortogonalnih signalnih poti je, da vhodni signal vsake poti vodi do izhodnega signala neodvisno od prisotnosti ali odsotnosti drugi signalnih poti ali njihovih vhodnih in izhodnih signalov.The term " orthogonal " used herein refers to the characteristic of the signal paths to take place separately, i.e., that the individual parts of one signal path do not interact with parts of other endogenous or exogenous signal paths. The main feature of the orthogonal signaling paths is that the input signal of each path leads to the output signal independent of the presence or absence of other signal paths or their input and output signals.
Izraz »ortogonalna proteaze« pomeni dve ali več proteaz, ki se razlikujejo v zaporedju tarčnega substrata, tako da nobena od ortogonalnih proteaz ne cepi tarčnega substrata druge ortogonalne proteaze.The term " orthogonal proteases " means two or more proteases that differ in the order of the target substrate so that none of the orthogonal proteases do not split the target substrate of the second orthogonal protease.
Izraz »inducibilna proteaza« se nanaša na lastnost proteaze, da se njena encimska aktivnost signifikantno spremeni ob prisotnosti vhodnega signala, ki je lahko hkrati tudi izhodni signal druge signalne poti.The term " inducible protease " refers to the property of the protease, so that its enzymatic activity significantly changes in the presence of an input signal, which can at the same time be the output signal of the second signal path.
Izraz "povezovalni peptid" se nanaša na aminokislinska zaporedja, katerih vloga je ločevati posamezne domene sestavljenega proteina in omogočiti njihovo pravilno prostorsko orientacijo. Vloga povezovalnega peptida v fuzijskem proteinu, katerega vključitev je opcijska, pa je lahko tudi vnos cepitvenega mesta, mesta za posttranslacijske modifikacije, označevalnega zaporedja ali lokalizacijskega signala.The term "binding peptide" refers to amino acid sequences whose role is to separate the individual domains of the assembled protein and to enable their proper spatial orientation. The role of the binding peptide in the fusion protein, the inclusion of which is optional, may include the insertion of the cleavage site, the posttranslational modification site, the marking sequence, or the localization signal.
Izraz »cepitveno mesto« se nanaša na aminokislinsko zaporedje, na katerega se lahko veže specifična proteaza in znotraj tega zaporedja cepi protein s hidrolizo peptidne vezi. Izraz "označevalna zaporedja" se nanaša na aminokislinska zaporedja, ki se jih proteinu doda za enostavnejše čiščenje, izolacijo ali detekcijo proteina. Izraz »lokalizacijski signal« se nanaša na aminokislinsko zaporedje, ki usmerja protein na določeno lokalizacijo v celici. Lokalizacij ski signali se razlikujejo glede na gostiteljski organizem, v katerem se protein izraža. Aminokislinska zaporedja lokalizacijskih signalov so dobro poznana strokovnjakom, prav tako je poznano, katera signalna sekvenca deluje v katerem organizmu.The term " cleavage site " refers to the amino acid sequence to which a specific protease can bind and within this sequence a protein is cleaved by peptide binding hydrolysis. The term " labeling sequences " refers to the amino acid sequences added to the protein for easier purification, isolation, or detection of the protein. The term " localization signal " refers to an amino acid sequence that directs the protein to a particular localization in the cell. Localization signals differ according to the host organism in which the protein is expressed. The amino acid sequences of localization signals are well known to those skilled in the art, and it is also known which signal sequence operates in which organism.
Položaj povezovalnih peptidov, označevalnih zaporedij, cepitvenih mest in lokalizacijskih signalov je poljuben, vendar tak, da omogoča funkcionalno izražanje proteina in ohranja funkcije, zaradi katere so bila ta aminokislinska zaporedja izbrana, kar je poznano strokovnjakom področja.The position of connecting peptides, marker sequences, cleavage sites and localization signals is arbitrary, but such that it allows functional expression of the protein and preserves the functions due to which these amino acid sequences have been selected, which is known to the experts of the region.
Izraz “celica” uporabljen tu, se nanaša na evkariontsko celico, na celični ali multicelično organizem (celična linija) kultiviran kot unicelična entiteta, ki je ožje bila uporabljena kot prejemnik nukleinske kisline in vključuje potomce originalne celice, ki je bila gensko spremenjena z nukleinsko kislino. Izraz se nanaša predvsem na celice višje razvitih evkariontskih organizmov, prednostno vretenčarjev, prednostno sesalcev. Izraz »celice« se nanaša tudi na človeške celične linije in rastlinske celice. Seveda potomci ene celice niso nujno kompletno identični staršem v morfološki obliki in v celotnem DNA komplementu, zaradi posledic naravnih, naključnih ali načrtovanih mutacij. “Gensko spremenjena gostiteljska celica” (tudi “rekombinantna gostiteljska celica”) je gostiteljska celica, v katero je bila vnesena nukleinska kislina. Evkariontska genetsko spremenjena gostiteljska celica nastane tako, da je primerni evkariontski gostiteljski celici vnesena nukleinska kislina ali rekombinantna nukleinska kislina. Izum v nadaljevanju vključuje gostiteljske celice in organizme, ki vsebujejo nukleinsko kislino po izumu (prehodno ali stabilno), ki nosi zapis za operone po izumu. Ustrezne gostiteljske celice so poznane v stanju tehnike in vključujejo evkariontske celice. Poznano je, da je protein lahko izražen v celicah sledečih organizmov: človek, glodalci, govedo, svinjina, perutnina, zajci in podobno. Gostiteljske celice so lahko gojene celične linije primarnih ali imortaliziranih celičnih linij.The term " cell " used herein refers to the eukaryotic cell, on a cellular or multicellular organism (cell line) cultured as a single cell, which is more closely used as a recipient of nucleic acid, and includes offspring of the original cell that has been genetically modified with nucleic acid . The term refers primarily to cells of higher developed eukaryotic organisms, preferably vertebrates, preferably mammals. The term " cells " also refers to human cell lines and plant cells. Of course, the descendants of one cell are not necessarily completely identical to the parents in morphological form and in the whole DNA complement, due to the consequences of natural, random or planned mutations. "Genetically modified host cell" (also "recombinant host cell") is a host cell into which the nucleic acid has been introduced. An eukaryotic genetically modified host cell is formed such that a suitable nucleic acid or recombinant nucleic acid is introduced into the appropriate eukaryotic host cell. The invention hereinafter includes host cells and organisms comprising a nucleic acid according to the invention (transient or stable) bearing the record for operons according to the invention. Suitable host cells are known in the art and include eukaryotic cells. It is known that the protein can be expressed in cells of the following organisms: human, rodent, cattle, pork, poultry, rabbits and the like. Host cells may be grown cell lines of primary or immortalized cell lines.
Izraz “nukleinska kisline”, uporabljen tu, se nanaša na polimerno obliko nukleotidov poljubne dolžine, ribonukleotidov ali deoksiribonukleotidov in ni omejena na eno, dva ali več verižno DNA ali RNA, genomsko DNA, cDNA, DNA- RNA hibride ali polimere s fosfo-tioatno hrbtenico, iz purinskih in pirimidinskih baz ali drugih naravnih, kemijskih ali biokemijskih modificiranih, nenaravnih ali derivatiziranih nukleotidnih baz.The term " nucleic acids " used herein refers to a polymeric form of nucleotides of any length, ribonucleotides or deoxyribonucleotides and is not limited to one, two or more chain DNA or RNA, genomic DNA, cDNA, DNA-RNA hybrids, or phosphorothioate polymers spine, from purine and pyrimidine bases or other natural, chemical or biochemical modified, non-derivative or derivatized nucleotide bases.
Izraz “polipeptid”, “protein”, “peptid”, uporabljen tu, se nanaša na polimerno obliko aminokislin poljubnih dolžin. Izraz “funkcionalni polipeptid”, uporabljen tu, se nanaša na polipeptidno obliko aminokislin poljubne dolžine, ki izražajo kakršnokoli funkcijo, kot so: formacija strukture, usmerjanje na specifično lokacijo, ciljanje organelov, olajšanje in sprožitev kemijskih reakcij, vezava na druge funkcionalne polipeptide.The term " polypeptide ", " protein ", " peptide ", used herein refers to the polymeric form of amino acids of any length. The term " functional polypeptide " used herein refers to a polypeptide form of any length of amino acids expressing any function, such as: formation of a structure, targeting to a specific location, targeting organelles, facilitating and triggering chemical reactions, binding to other functional polypeptides.
Izraz “heterologni” uporabljen tu, se nanaša na kontekst genetsko modificiranih gostiteljskih celic in se nanaša na polipeptid za katerega velja vsaj ena izmed naslednjih trditev: (a) polipeptid je tuj (“eksogen”) za gostiteljsko celico (v naravi ga v njej ne najdemo); (b) polipeptid je v naravi prisoten (“endogen”) v danem gostiteljskem mikroorganizmu ali gostiteljski celici ampak je proizveden v nenaravnih (več kot pričakovano oz v večjih količinah kot je najdeno v naravi) količinah v celici ali se razlikuje v nukleotidni sekvenci od endogene nukleotidne sekvence, tako daje isti protein (ima isto oz znatno podobno aminokislinsko sekvenco) kot endogen proizveden v nenaravnih (več kot pričakovano oz več kot je najdeno v naravi) količinah v celici.The term "heterologous" used herein refers to the context of genetically modified host cells and refers to a polypeptide for which at least one of the following claims applies: (a) the polypeptide is foreign ("exogenous") for the host cell (in nature it is not present in найдем); (b) the polypeptide is naturally present ("endogenous") in a given host microorganism or host cell, but is produced in an unnatural (more than expected or in greater quantities than found in nature) amounts in a cell or differentiated in a nucleotide sequence from endogenous nucleotide sequences, so that the same protein (having the same or substantially similar amino acid sequence) as the endogen is produced in the unnatural (more than expected and more than found in nature) amounts in the cell.
Izraz “homologen” uporabljen tu, se nanaša na proteine ali nukleinske kisline z dobro ohranjeno aminokislinsko ali nukleotidno sekvenco, prednostno z vsaj 50-% ohranitvijo in najmanj 20-% ohranitvijo, določeno s proteinsko ali nukleinskokislinskimi primeljalnimi tehnikami, ki so znane strokovnjakom na področju. Homologne nukleinske kisline so kodirajoče za homologne proteine.The term " homologous " used herein refers to proteins or nucleic acids with a well-preserved amino acid or nucleotide sequence, preferably with at least 50% conservation and at least 20% conservation determined by protein or nucleic acid priming techniques known to those skilled in the art . Homologous nucleic acids are encoding for homologous proteins.
Izraz “rekombinanten” uporabljen tu, pomeni, daje določena nukleinska kislina (DNA ali RNA) produkt raznih kombinacij kloniranja, restrikcij in/ ali ligacij, ki vodijo do konstrukta, ki ima strukturno kodirajoče ali nekodirajoče sekvence različne od endogenih nukleinskih kislin v naravnem gostiteljskem sistemu. Na splošno se lahko DNA sekvenca, ki kodira strukturno kodirajočo sekvenco, združi iz cDNA fragmentov ali iz kratkih oligonukleotidnih povezovalcev ali iz sintetičnih oligonukleotidov, iz katerih dobimo sintetično nukleinsko kislino, ki se lahko izraža iz rekombninatne transkripcijske enote v celičnem ali v brezceličnem transkripcijskem in trasnlacijskem sistemu. Takšno zaporedje se lahko uporabi v obliki odprtega bralnega okvirja in ne prihaja do motenj zaradi internih ne-prevedenih sekvenc oz. intronov, ki so navadno prisotni v evkariontskih genih. Genomska DNA, ki vsebuje pomembne sekvence se lahko uporabi tudi za tvorbo rekombinantnega gena ali transkripcij ske enote. Sekvence neprevedene DNA so lahko prisotne na 5'- ali 3'-koncu odprtega bralnega okvirja, kjer takšne sekvence ne vplivajo na manipulacijo ali ekspresijo kodirajočih regij in lahko delujejo kot modulatorji produkcije želenih produktov prek različnih mehanizmov.The term "recombinant" used herein means that a particular nucleic acid (DNA or RNA) is a product of various combinations of cloning, restriction and / or ligation leading to a construct having structurally encoding or non-coding sequences different from endogenous nucleic acids in the natural host system . In general, a DNA sequence encoding a structurally coding sequence may be aggregated from the cDNA fragments or from short oligonucleotide linkers or from synthetic oligonucleotides from which a synthetic nucleic acid is obtained which can be expressed from the recombinant transcription unit in cellular or in the cell-free transcription and trasnlation system. Such a sequence can be used in the form of an open reading frame and there is no interference due to internal unmanned sequences, introns, which are usually present in eukaryotic genes. Genomic DNA containing important sequences can also be used to form a recombinant gene or transcription unit. Sequences of untranslated DNA may be present at the 5'- or 3'-end of an open reading frame, where such sequences do not affect the manipulation or expression of the coding regions, and can act as modulators for the production of the desired products through various mechanisms.
Vnos vektorjev v gostiteljske celice je opravljen s konvencionalnimi metodami poznanimi iz stanja tehnike in se metode nanašajo na transformacijo ali transfekcijo in vključujejo: kemični vnos, elektroporacijo, mikroinjiciranje, DNA lipofekcijo, celično sonikacijo, gensko bombardiranje, virusni DNA vnos in drugo. Vnos DNA je lahko prehodnega značaja ali stabilen. Vnos prehodnega značaja se nanaša na vnos DNA z vektorjem, ki DNA po izumu ne vgrajuje v genom celic. Stabilen vnos dosežemo z vgradnjo DNA po izumu v genom gostitelja. Vnos DNA po izumu predvsem za pripravo gostiteljskega organizma, ki ima stabilno vgrajeno DNA po izumu, lahko kontroliramo s prisotnostjo markerjev. DNA zapis za markerje se nanaša na odpornost na antibiotike ali kemikalije in je lahko vključen na vektorju z DNA po izumu ali na ločenem vektorju.The input of vectors into host cells is carried out by conventional methods known from the prior art, and the methods relate to transformation or transfection and include: chemical input, electroporation, micro-injection, DNA lipofection, cellular sonication, gene bombardment, viral DNA input, and the like. The entry of DNA may be of transient character or stable. The introduction of a transitory character refers to the insertion of a DNA with a vector that does not incorporate the DNA of the invention into the cell genome. A stable entry is achieved by incorporating DNA according to the invention into the host genome. The insertion of a DNA according to the invention, in particular, for the preparation of a host organism having stable DNA incorporation according to the invention, can be controlled by the presence of markers. The DNA marker marker refers to resistance to antibiotics or chemicals and may be included on a DNA vector according to the invention or on a separate vector.
Izvedbeni primeri, kijih bomo podrobneje opisali, so zasnovani tako, da čim bolje opišejo izum. Ti opisi nimajo namena omejevati področja izuma in njegove uporabnosti, ampak so zgolj namenjeni boljšemu razumevanju izuma in njegove uporabe.Implementation examples, which will be described in more detail, are designed to best describe the invention. These descriptions have no intention of limiting the scope of the invention and its applicability, but are merely intended to provide a better understanding of the invention and its use.
Izvedbeni primeriImplementation examples
Primer 1: Priprava celic, ki vsebujejo signalno pot posredovano z razcepljenimi proteazamiExample 1: Preparation of cells comprising a signal pathway mediated by split proteases
Priprava DNA zapisov za ortogonalne razcepljene proteaze in njihove tarčePreparation of DNA records for orthogonal split proteases and their targets
Za pripravo DNA konstruktov so izumitelji uporabili metode molekularne biologije, kot so: kemijska transformacija kompetentnih celic E. coli, izolacija plazmidne DNA, pomnoževanje s polimerazo (PCR), reverzna transkripcija - PCR, lepljenje s PCR, določanje koncentracije nukleinskih kislin, elektroforeza DNA na agaroznem gelu, izolacija fragmentov DNA iz agaroznih gelov, kemijska sinteza DNA, rezanje DNA z restrikcijskimi encimi, rezanje plazmidnih vektorjev, ligacija fragmentov DNA, čiščenje plazmidne DNA v večjih količinah. Natančen potek eksperimentalnih tehnik in metode so dobro poznane strokovnjakom s področja in so opisane v priročnikih molekularne biologije.In order to prepare DNA constructs, inventors used molecular biology methods such as: chemical transformation of competent cells of E. coli, plasmid DNA isolation, polymerase polymerase (PCR), reverse transcription - PCR, PCR bonding, nucleic acid concentration determination, DNA electrophoresis agarose gel, isolation of fragments of DNA from agarose gels, chemical synthesis of DNA, DNA cutting with restriction enzymes, plasmid vectors cutting, ligation of DNA fragments, purification of plasmid DNA in large quantities. The exact course of experimental techniques and methods are well known to the experts in the field and are described in the manuals of molecular biology.
Za vse delo smo uporabljali sterilne tehnike dela, ki so prav tako dobro poznane strokovnjakom s področja. Vsi plazmidi, zaključeni konstrukti in delni konstrukti so bili transformirani v bakterijo Escherichia coli s kemično transformacijo. Plazmidi za transfekcijo v celične linije (živalske, rastlinske ali človeške) so bili izolirani z uporabo DNA izolacijskega seta, ki odstrani endotoksine. V opisanih primerih smo proteaze izbrali izmed proteaz družine potivirusnih proteinov NIa, specifično proteaze SuMMV, SbMV, PPV in TEV. Prepoznavna zaporedja za cepitev z vsako od opisanih proteaz so poznana strokovnjakom področja. Glede na podobnost aminokislinskega zaporedja med temi proteazami in že poznano razcepljeno proteazo TEV smo pripravili nukleotidne zapise za razcepljene proteaze v fuziji z dimerizacijskimi domenami. V opisanih primerih smo kot dimerizacijske domene izbrali svetlobno inducibilni domeni CRY2 in CIBN (Kennedy et al. 2010) ter z rapamicinom inducibilni domeni FKBP in FRB (Sabatini et al. 1994; Brown et al. 1994; Sabers et al. 1995).For all work we used sterile techniques of work, which are also well known to the experts in the field. All plasmids, completed constructs and partial constructs were transformed into Escherichia coli bacteria by chemical transformation. Plasmids for transfection into cell lines (animal, plant or human) have been isolated using a DNA isolation kit that removes endotoxins. In the described cases, the proteases were selected from the proteases of the NIa family of potivirus proteins, specifically the SuMMV, SbMV, PPV and TEV proteases. Identified fission sequences with each of the described proteases are known to those of skill in the art. Depending on the similarity of the amino acid sequence between these proteases and the already known split protease TEV, nucleotide records for split proteases in fusion with dimerization domains have been prepared. In the described cases, the light-inducing domains of CRY2 and CIBN (Kennedy et al., 2010) and the rapamycin inducible domains of FKBP and FRB were selected as dimerization domains (Sabatini et al., 1994; Brown et al., 1994; Sabers et al., 1995).
Zaporedja razcepljenih fragmentov proteaz po izumu so našteta v tabeli 1. Ti fragmenti sami po sebi ne tvorijo delujočih encimov. Končni genski konstrukti razcepljenih proteaz v fuziji z dimerizacijskimi domenami, ki omogočajo rekonstitucijo v delujoč encim, so našteti v tabeli 2. Preostala zaporedja uporabljena za prikaz izuma so našteta v tabeli 3. Vsi operoni so bili pripravljeni s tehnikami po metodah poznanih strokovnjakom. Operone smo ustavili v ustrezne plazmide primerne za evkariontske sisteme. Ustreznost nukleotidnega zaporedja smo izumitelji potrdili s sekvenciranjem in restrikcijskimi analizami.The sequences of split protease fragments according to the invention are listed in Table 1. These fragments do not in themselves form active enzymes. The final gene constructs of split proteases in fusion with dimerization domains that allow reconstitution into the working enzyme are listed in Table 2. The remaining sequences used to illustrate the invention are listed in Table 3. All operons were prepared by techniques according to methods known to those skilled in the art. The operones were stopped in suitable plasmids suitable for eukaryotic systems. The suitability of the nucleotide sequence was confirmed by the inventors by sequencing and restriction analyzes.
Tabela 1: Seznam zaporedij razcepljenih proteaz, ki smo jih uporabili za prikaz izuma, in njihovih prepoznavnih cepitvenih mest.Table 1: A list of the split proteases sequences that were used to illustrate the invention and their recognizable cleavage sites.
Tabela 2: Razcepljene proteaze z dimerizacijskimi domenami, ki smo jih uporabili za prikaz izuma.Table 2: Splitted proteases with dimerization domains that were used to illustrate the invention.
Tabela 3: Fuzij ski proteini in operoni, razen razceplj enih proteaz, ki smo j ih uporabili za prikaz izuma.Table 3: Fusion proteins and operones, with the exception of the split proteases that were used to illustrate the invention.
Metode in tehnike gojenja celičnih kultur so dobro poznane strokovnjakom na tem področju, zato so le na kratko opisane z namenom ponazoritve izvedbenega primera. Celične linije celic HEK293T smo gojili pri 37 °C in 5% CO2. Za gojenje smo uporabili gojišče DMEM z 10% FBS, ki vsebuje vsa potrebna hranila in rastne faktorje. Ko je celična kultura dosegla primemo gostoto, so bile celice precepljene v novo gojilno posodico in/ali razredčene. Za uporabo celic v eksperimentih smo število celic določili s hemocitometrom in jih nacepili 2.5x104 na luknjico v mikrotitrsko ploščo s 96 luknjicami 18-24 ur pred transfekcijo. Nacepljene plošče smo inkubirali pri 37 °C in 5% CO2, dokler celice niso bile 50-70 % konfluentno preraščene za transfekcijo s transfekcijskim reagentom. Transfekcijo smo izvedli po navodilih proizvajalca transfekcijskega reagenta (npr. JetPei, Lipofectamin 2000), prilagojeno za uporabljeno mikrotitrsko ploščo.Cell culturing methods and techniques are well known to those skilled in the art, and are therefore briefly described in order to illustrate the embodiment. Cell lines of HEK293T cells were grown at 37 ° C and 5% CO2. For cultivation, DMEM medium containing 10% FBS was used, containing all necessary nutrients and growth factors. When the cell culture reached the density, the cells were grafted into a new breeding vessel and / or diluted. For the use of cells in experiments, the number of cells was determined by the hemocytometer and washed by 2.5x104 per hole in the microtiter plate with 96 holes 18-24 hours before the transfection. The glued plates were incubated at 37 ° C and 5% CO2 until the cells were 50-70% confluently transfected for transfection by transfection reagent. The transfection was carried out according to the instructions of the transfection reagent manufacturer (e.g., JetPei, Lipofectamine 2000) adapted for the microtiter plate used.
Primer 2: Aktivnost razcepljenih proteaz v sesalskih celicah ob indukciji z različnimi signali ter ortogonalnost cepitve s posameznimi proteazamiExample 2: Activity of split proteases in mammalian cells induced by different signals and the orthogonality of cleavage with individual proteases
Za zaznavanje aktivnosti proteaz smo uporabili poročevalski sistem, ki temelji na ciklični kresničkini luciferazi Fluc (Kanno et al. 2007). Ciklična luciferaza je pripravljena na način, da je zaporedje aminokislin 4-233 prestavljeno z N-konca na C-konec proteina, pri čemer je med prejšnji C-konec proteina in prestavljeno proteinsko zaporedje vstavljen povezovalni peptid s prepoznavnim mestom za cepitev z eno izmed proteaz TEV, PPV, SuMMV ali SbMV. Na N-konec tako pripravljenega zaporedja je preko kratkega povezovalnega peptida pripet C-fragment inteina DnaE iz organizma Nostoc punctiforme, na C-konec prestavljenega luciferaznega zaporedja pa je pripet N-fragment inteina DnaE in organizma N. punctiforme in za njim še aminokislinsko zaporedje za hitro razgradnjo PEST-CL1, poznano strokovnjakom področja. Tako pripravljena luciferaza tvori neaktivne ciklične proteine, ki se ob cepitvi s tarčno proteazo razklenejo in pridobijo sposobnost katalize.For the detection of protease activity, a reporting system based on the cyclic luciferase Fluc cyclic luciferase (Kanno et al., 2007) was used. The cyclic luciferase is prepared in such a way that the sequence of the amino acids 4-233 is transferred from the N-terminus to the C-terminus of the protein, wherein a connecting peptide with a recognizable cleavage site with one of the proteases is inserted between the previous C-terminus of the protein and the transferred protein sequence TEV, PPV, SuMMV or SbMV. The C-fragment of the intein DnaE from the organism Nostoc punctiforme is attached to the N-terminus of the sequence thus prepared, via the short linking peptide. The N-fragment of the intein DnaE and the organism N. punctiforme is attached to the C-terminus of the luciferase sequence transmitted, followed by the amino acid sequence for rapid decomposition of PEST-CL1, known to area experts. Such prepared luciferase forms inactive cyclic proteins that, when cleaved with the target protease, break down and acquire the ability to catalyze.
Celice HEK293T nacepljene v plošče s 96 luknjicami smo en dan pred eksperimentom transficirali s plazmidi, ki nosijo zapis za eno izmed izbranih proteaz ali zapise za po dva proteina, ki tvorita razcepljeno proteazo, ter s plazmidi, ki nosijo zapis za zgoraj opisani poročevalski sistem in zapis za luciferazo organizma Renilla reniformis Rluc (GenBank AF362545.1). Celice smo stimulirali z dodatkom rapamicina do končne koncentracije 1 μΜ 1 dan pred meritvijo ali s svetlobo valovne dolžine 455 nm v posebej za to pripravljeni napravi 30 minut pred meritvijo.The HEK293T cells in the 96-hole plates were transfected with plasmids one day before the experiment, bearing the record for one of the selected proteases or the two protease-forming split proteases and the plasmids bearing the record for the above-described reporting system and record for the luciferase of Renilla Reniformis Rluc (GenBank AF362545.1). Cells were stimulated by the addition of rapamycin to a final concentration of 1 μM 1 day before measurement or with a light wavelength of 455 nm in a specially prepared device 30 minutes before the measurement.
Za analizo aktivnosti poročevalskih proteinov smo celice lizirali s pufrom po navodilih proizvajalca (Promega). Najprej smo izmerili aktivnost Fluc nato še aktivnost Rluc. Rluc se izraža neodvisno od vhodnih signalov, zato nam njena aktivnost pove delež transficiranih celic, medtem ko nam aktivnost Fluc kaže aktivacijo ciklične luciferaze po cepitvi s proteazo. Razmerje Fluc/Rluc (RLA - relativna luciferazna aktivnost) nam torej pove normalizirano vrednost stimuliranih celic glede na transficirane celice.For analysis of the activity of the reporting proteins, the cells were lysed with buffer according to the manufacturer's instructions (Promega). We first measured the activity of Fluc followed by Rluc activity. Rluc is expressed independently of the input signals, and therefore its activity tells us the proportion of transfected cells, while Fluc activity shows activation of cyclic luciferase after cleavage with protease. The Fluc / Rluc ratio (RLA - relative luciferase activity) therefore tells us the normalized value of stimulated cells with respect to transfected cells.
Rezultati:Results:
Iz Slike 2 je razvidno, da vsaka izmed proteaz TEV, PPV, SuMMV in SbMV učinkovito cepi ciklično luciferazo z ustreznim prepoznavnim mestom, medtem ko je aktivnost proteaz na ciklične luciferaze z neustreznim prepoznavnim mestom zanemarljiva. V tem poskusu smo uporabili celotna zaporedja proteaz.From Figure 2, it is shown that each of the TEV, PPV, SuMMV and SbMV proteases efficiently cleans cyclic luciferase with an appropriate recognition site, while the activity of proteases on cyclic luciferases with an inadequate recognizable site is negligible. In this experiment, the entire protease sequence was used.
Proteaze TEV, PPV, SbMV in SuMMV smo razcepili na dva fragmenta in ju povezali z ligand-vezavnima domenama FKBP in FRB. Iz slike 3A je razvidno, da razcepljene proteaze TEV, PPV, SuMMV in SbMV z dimerizacijskima domenama FKBP in FRB ob dodatku rapamicina cepijo luciferazo z ustreznim prepoznavnim mestom.Protease TEV, PPV, SbMV and SuMMV were split into two fragments and linked to the ligand-binding domains of FKBP and FRB. From Fig. 3A, the split proteins TEV, PPV, SuMMV and SbMV with dimerization domains FKBP and FRB are cleaved by luciferase with the appropriate recognition site with the addition of rapamycin.
Fragmente proteaz TEV in PPV smo pripravili tudi v fuziji s proteinoma CRY2 in CIBN, za katera je znano, da dimerizirata ob stimulaciji z modro svetlobo. Iz slik 3B in 3C je razvidno da razcepljeni proteazi TEV in PPV v fuziji z dimerizacijskima domenama CRY2 in CIBN cepita ciklično luciferazo z ustreznim prepoznavnim mestom ob stimulaciji z modro svetlobo.TEV and PPV protease fragments were also prepared in fusion with the proteins CRY2 and CIBN, which are known to be dimerised with blue-light stimulation. From Figures 3B and 3C it is shown that the split proteins TEV and PPV in fusion with the dimerization domains CRY2 and CIBN split cyclic luciferase with a suitable recognizable site upon stimulation with blue light.
Primer 3: Nadzor ortogonalnih signalnih poti z racepljenimi proteazami in različnimi vhodnimi SignaliExample 3: Control of orthogonal signal pathways with raceed proteases and various input signals
Za zaznavanje aktivnosti dveh ortogonalnih signalnih poti smo poleg ciklične luciferaze pripravili še od cepitve s proteazo odvisni transkripcijski dejavnik TAL(N)-VP16-TEVs-KRAB. Transkripcijski dejavnik je bil pripravljen na način, da smo na C-konec DNA-vezavne domene TAL, ki je dobro poznana strokovnjakom področja, dodali zaporedje za aktivacijsko domeno VP16, ki ji sledi povezovalni peptid s prepoznavnim zaporedjem za cepitev s proteazo TEV in represijska domena KRAB. Domeni VP16 in KRAB sta prav tako dobro poznani strokovnjakom področja. Transkripcijski dejavnik ob cepitvi spremeni način delovanja iz represije v aktivacijo prepisovanja genov. Za določanje aktivnosti transkripcijskega dejavnika smo uporabili genski zapis za rumeni fluorescenčni protein mCitrine vstavljen v plazmid za minimalnim promotoijem z vezavnimi mesti za DNA-vezavno domeno TAL.To detect the activity of two orthogonal signaling pathways, in addition to cyclic luciferase, the TAL (N) -VP16-TEVs-KRAB transcription factor has been prepared since cleavage with the protease-dependent transcription factor. The transcription factor was prepared in such a way that at the C-terminus of the TAL DNA binding domain, which is well known to the field experts, the sequence for the activation domain VP16 was followed, followed by a binding peptide with a recognizable sequence for cleaving the TEV protease and the repression domain KRAB. VP16 and KRAB are also well-known to the field experts. During the cleavage, the transcription factor changes the mode of action from repression to the activation of the transcription of genes. To determine the activity of the transcription factor, a gene for the yellow fluorescence protein mCitrine was inserted into the plasmid for minimal promotios with binding sites for the DNA-binding domain TAL.
Celice HEK293T nacepljene v plošče s 96 luknjicami smo en dan pred eksperimentom transficirali s plazmidi, ki nosijo zapise za ciklično luciferazo s prepoznavnim mestom za cepitev s proteazo PPV, transkripcijski dejavnik TAL(N)-VP16-TEVs-KRAB, mCitrin pod ustreznim promotoijem in zapis za luciferazo Rluc ter kot označeno plazmida z zapisom za proteazo TEV ali PPV.The HEK293T cells, implanted in 96-hole plates, were transfected one day before the experiment with plasmids bearing cyclic luciferase records with a recognizable cleavage site with PPV protease, a transcription factor TAL (N) -VP16-TEVs-KRAB, mCitrin under the appropriate promotional and a record for luciferase Rluc and a labeled plasmid with a TEV or PPV protease record.
Za analizo aktivnosti poročevalskega proteina mCitrin smo celicam odstranili gojišče, nato pa pomerili fluorescenco na čitalcu plošč SynergyMX (proizvajalca BioTek) z ekscitacijo pri valovni dolžini 512 nm in emisijo pri valovni dolžini 532 nm. Od dobljene vrednosti smo odšteli fluorescenco netransficiranih celic. Za analizo aktivnosti poročevalskega proteina ciklična luciferaza smo celice lizirali s pufrom po navodilih proizvajalca (Promega) in nato pomerili aktivnost FLuc in Rluc kot že v zgoraj opisanem poskusu.To analyze the activity of the mCitrin reporting protein, the cells were removed from the media and then flushed by fluorescence at the SynergyMX plate reader (manufactured by BioTek) with excitation at a wavelength of 512 nm and emission at a wavelength of 532 nm. The fluorescence of untransfected cells was subtracted from the value obtained. For the analysis of the activity of the reporting protein cyclic luciferase, the cells were lysed with buffer according to the manufacturer's instructions (Promega) and then shifted the activity of FLuc and Rluc as in the above experiment.
Rezultat:Result:
Iz slike 4 je razvidno, da se ob prisotnosti proteaze TEV poviša samo aktivnost luciferaznega poročevalca,ob prisotnosti proteaze PPV pa se poviša samo aktivnost fluorescenčnega poročevalca. V prisotnosti obeh ortogonalnih proteaz se poviša aktivnost obeh poročevalcev. To potrjuje, da sta signalni poti ortogonalni.From Figure 4 it can be seen that in the presence of TEV protease only activity of the luciferase rapporteur increases, and in the presence of PPV protease only the activity of the fluorescence reporter increases. In the presence of both orthogonal proteases, the activity of both reporters is increased. This confirms that signal paths are orthogonal.
Primer 4: Logične funkcije s prenosom signala s proteazamiExample 4: Logical functions by transmitting a signal with proteases
Kot primere logičnega vezja s prenosom signala s proteazami smo razvili logični funkciji NOR in A NIMPLY B, slednja je ekvivalentna sestavljeni logični funkciji A AND NOT B.Logical functions NOR and A NIMPLY B have been developed as examples of the logical circuit with signal transfer with proteases, the latter being the equivalent composite logic function A AND NOT B.
Za funkcijo NOR (slika 5A) smo uporabili poročevalec razcepljeno luciferazo, kije sestavljena iz N-končnega fragmenta luciferaze povezanega preko cepitvenega mesta za proteazo TEV s peptidom AP4 in C-končnega fragmenta luciferaze, ki je povezan preko cepitvenega mesta za proteazo PPV s peptidom P3. Peptida P3 in AP4 tvorita ovito vijačnico, kar omogoči rekonstitucijo fragmentov luciferaze v aktivni encim. Cepitev tega poročevalca s proteazo TEV odcepi N-končhi fragment luciferaze od peptida AP4 in s tem prepreči tvorbo funkcionalnega poročevalskega encima. Podobno cepitev s proteazo PPV odcepi C-končni fragment luciferaze od peptida P3 in s tem prepreči tvorbo funkcionalnega poročevalskega encima. Izhodni signal (aktivnost luciferaze) lahko zaznamo torej samo v odsotnosti vhodnih signalov.For the NOR function (FIG. 5A), a rapeseed split luciferase was used, comprised of an N-terminal luciferase fragment linked via a cleavage site for the TEV protease with an AP4 peptide and a C-terminal luciferase fragment, which is linked via a cleavage site for the PPV protease with the peptide P3 . Peptides P3 and AP4 form a wound helix, which allows the reconstitution of luciferase fragments in the active enzyme. The splitting of this rapporteur with the protease TEV cleaves the N-terminals of the luciferase fragment from the AP4 peptide, thereby preventing the formation of a functional reporting enzyme. Similar cleavage with the PPV protease clears the C-terminal luciferase fragment from P3 peptide, thereby preventing the formation of a functional reporting enzyme. Therefore, the output signal (activity of luciferase) can be detected only in the absence of input signals.
Celice HEK293T nacepljene v plošče s 96 luknjicami smo en dan pred eksperimentom transficirali s plazmidi, ki nosijo zapise za luciferazo Rluc, poročevalska proteina nLuc-TEVs-AP4 in P3-PPVs-cLuc ter kot označeno plazmidoma z zapisi za proteazo TEV in proteazo PPV. Aktivnost poročevalskih proteinov Fluc in Rluc smo izmerili kot opisano zgoraj.Cells of HEK293T, implanted in 96-hole plates, were transfected with plasmids that were recorded with Rluc luciferase, nLuc-TEVs-AP4 and P3-PPVs-cLuc luciferase, and as marked plasmidoma with TEV protease and PPV protease, one day before the experiment. The activity of the reporting proteins Fluc and Rluc was measured as described above.
Rezultat:Result:
Iz slike 5B je razvidno, da smo opazili zmanjšanje aktivnosti poročevalca ob koekspresiji katerekoli izmed izbranih proteaz. To je skladno s pričakovanim delovanjem funkcije NOR.From Figure 5B it can be seen that a decrease in the activity of the reporter was observed when coexpression of any of the selected proteases. This is in line with the expected function of the NOR function.
Za funkcijo A NIMPLY B (slika 6A) smo prav tako uporabili poročevalec razcepljeno proteazo, ki pa je v tem primeru sestavljen iz N-končnega fragmenta luciferaze povezanega s peptidom AP4, ki je nadalje preko cepitvenega mesta za proteazo TEV povezan s peptidom P3mS. Sistemu je dodan še C-končni fragment luciferaze vezan na peptid P3 preko cepitvenega mesta za proteazo PPV. Peptida AP4 in P3mS tvorita ovito vijačnico, kar preprečuje tvorbo ovite vijačnice med AP4 in P3 in s tem rekonstitucijo funkcionalnega encima luciferaze. Ob cepitvi poročevalca s proteazo TEV se tvorba ovite vijačnice med ΑΡ4 in P3mS destabilizira, saj ta cepi povezovalni peptid med njima. Ker peptid P3 tvori stabilnejšo ovito vijačnico z AP4, zdaj lahko pride do rekonstitucije luciferaze, vendar le v odsotnosti proteaze PPV, saj ta odcepi C-končni fragment luciferaze od peptida P3. Izhodni signal (aktivnost luciferaze) lahko zaznamo torej le ob cepitvi s proteazo TEV in odsotnosti cepitve s proteazo PPV.For the function A NIMPLY B (Fig. 6A), the rapeseed split protease was also used, but in this case it consists of an N-terminal luciferase associated with the peptide AP4, which is further linked to the P4mS peptide by the cleavage site for the TEV protease. The system is added to the C-terminal luciferase fragment bound to the P3 peptide via the cleavage site for PPV protease. The AP4 and P3mS peptides form a wrapped helix, which prevents the formation of a wrapped helix between AP4 and P3 and thus the reconstitution of the functional enzyme of luciferase. When the TEV protector is vaccinated, the formation of the wrapped helix between ΑΡ4 and P3mS destabilizes, since it cleaves the connecting peptide between them. Since the P3 peptide forms a more stable, wound helix with AP4, there is now a reconstitution of luciferase, but only in the absence of PPV protease, since it cleaves the C-terminal fragment of luciferase from the peptide P3. The output signal (activity of luciferase) can therefore be detected only when cleavage with TEV protease and absence of cleavage with PPV protease.
Celice HEK293T nacepljene v plošče s 96 luknjicami smo en dan pred eksperimentom transficirali s plazmidi, ki nosijo zapise za luciferazo Rluc, poročevalska proteina nLuc-AP4- TEVs-P3mS in P3-PPVs-cLuc ter s plazmidi, ki nosijo zapise za razcepljeno proteazo TEV z inducibilnima domenama CRY2 in CIBN, in plazmidi, ki nosijo zapise za razcepljeno proteazo PPV z inducibilnima domenama FKBP in FRB. Celice smo stimulirali z dodatkom rapamicina do končne koncentracije 1 μΜ ali s svetlobo valovne dolžine 455 nm v posebej za to pripravljeni napravi 15 minut pred meritvijo. Aktivnost poročevalskih protein Fluc in Rluc smo izmerili kot opisano zgoraj.Cells of HEK293T, implanted in 96-hole plates, were transfected one day before the experiment with plasmids bearing the Rluc luciferase, the nLuc-AP4-TEVs-P3mS and P3-PPVs-cLuc reporting protein, and plasmid TEV with inducible CRY2 and CIBN domains, and plasmids bearing split PPV proteases with inducible FKBP and FRB domains. Cells were stimulated by the addition of rapamycin to a final concentration of 1 μM or with a light wavelength of 455 nm in a specially prepared device 15 minutes before the measurement. The activity of the reporting proteins Fluc and Rluc was measured as described above.
Rezultat:Result:
Iz slike 6B je razvidno, da smo opazili povišano aktivnost poročevalca le ob stimulaciji celic z svetlobo, ki povzroči aktivacijo proteaze TEV, v odsotnosti rapamicina, ki sicer povzroči aktivacijo proteaze PPV. To je skladno s pričakovanim delovanjem funkcije A N1MPLY B, kjer sta vhodna signala A in B svetloba in rapamicin.From Figure 6B it can be seen that an increased activity of the reporter was observed only when the cells were stimulated with light that induces the activation of the TEV protease in the absence of rapamycin, which otherwise induces the activation of the PPV protease. This is consistent with the expected function of A N1MPLY B function, where the input signals A and B are light and rapamycin.
Seznam zaporedijList of sequences
SEQ0ENCE LISTING <110> Kemijski inštitut EN-FIST center odličnosti <120> Kombinacija razcepljenih ortogonalnih proteaz z dimerizacijskimi domenami, ki omogočajo sestavljanje <130> - <160> 84 <170> Patentln version 3.5 <210> 1 <211> 732SEQ0ENCE LISTING <110> EN-FIST Chemistry Institute of Excellence <120> Combination of split orthogonal proteases with dimerization domains that allow assembly <130> - <160> 84 <170> Patent version 3.5 <210> 1 <211> 732
<212> DNA <213> Tobacco etch virus <220> <221> CDS <222> (1)..(732) <400> 1 atg gga gaa agc ttg ttt aag gga cca cgt gat tac aac ccg ata tcg 48<212> DNA <213> Tobacco etch virus <220> <221> CDS <222> (1) .. (732) <400> 1 atg gga gaa agc ttg ttt aag gga cca cgt gat tac aac ccg ata tcg 48
Met Gly Glu Ser Leu Phe Lys Gly Pro Arg Asp Tyr Asn Pro Ile Ser 15 10 15 agc acc att tgt cac ttg acg aat gaa tet gat ggg cac aca aca tcg 96Met Gly Glu Ser Leu Phe Lys Gly Pro Arg Asp Tyr Asn Pro Ile Ser 15 10 15 agc acc att tgt cac ttg acg aat gaa tet gat ggg cac aca aca tcg 96
Ser Thr Ile Cys His Leu Thr Asn Glu Ser Asp Gly His Thr Thr Ser 20 25 30 ttg tat ggt att gga ttt ggt ccc ttc ate att aca aac aag cac ttg 144Ser Thr Ile Cys His Leu Thr Asn Glu Ser Asp Gly His Thr Thr Ser 20 25 30 ttg tat ggt att gga ttt ggt ccc ttc ate att aca aac aag cac ttg 144
Leu Tyr Gly Ile Gly Phe Gly Pro Phe Ile Ile Thr Asn Lys His Leu 35 40 45 ttt aga aga aat aat gga aca ctg ttg gtc caa tca cta cat ggt gta 192Leu Tyr Gly Ile Gly Phe Gly Pro Phe Ile Ile Thr Asn Lys His Leu 35 40 45 ttt aga aga aat aat gga aca ctg ttg gtc caa tca cta cat ggt gta 192
Phe Arg Arg Asn Asn Gly Thr Leu Leu Val Gin Ser Leu His Gly Val 50 55 60 ttc aag gtc aag aac acc acg act ttg caa caa cac ete att gat ggg 240Phe Arg Arg Asn Gly Thr Leu Leu Val Gin Ser Leu His Gly Val 50 55 60 ttc aag gtc aag aac acc acg act ttg caa caa cac ete att gat ggg 240
Phe Lys Val Lys Asn Thr Thr Thr Leu Gin Gin His Leu Ile Asp Gly 65 70 75 80 agg gac atg ata att att ege atg cct aag gat ttc cca cca ttt cct 288Phe Lys Val Lys Asn Thr Thr Thr Leu Gin Gin His Leu Ile Asp Gly 65 70 75 80 agg gac atg ata att att ege atg cct aag gat ttc cca cca ttt cct 288
Arg Asp Met Ile Ile Ile Arg Met Pro Lys Asp Phe Pro Pro Phe Pro 85 90 95 caa aag ctg aaa ttt aga gag cca caa agg gaa gag ege ata tgt ctt 336Arg Asp Met Ile Ile Arg Met Pro Lys Asp Phe Pro Pro Phe Pro 85 90 95 caa aag ctg aaa ttt aga gag cca caa agg gaa gag ege ata tgt ctt 336
Gin Lys Leu Lys Phe Arg Glu Pro Gin Arg Glu Glu Arg Ile Cys Leu 100 105 110 gtg aca acc aac ttc caa act aag agc atg tet agc atg gtg tca gac 384Gin Lys Leu Lys Phe Arg Glu Pro Gin Arg Glu Glu Arg Ile Cys Leu 100 105 110 gtg aca acc aac ttc caa act aag agc atg tet agc atg gtg tca gac 384
Val Thr Thr Asn Phe Gin Thr Lys Ser Met Ser Ser Met Val Ser Asp 115 120 125 acc agt tgc aca ttc cct tca tet gat ggc ata ttc tgg aag cat tgg 432Val Thr Thr Asr Phe Gin Thr Lys Ser Met Ser Ser Met Serp Asp 115 120 125 acc agt tgc aca ttc cct tca tet gat ggc ata ttc tgg aag cat tgg 432
Thr Ser Cys Thr Phe Pro Ser Ser Asp Gly Ile Phe Trp Lys His Trp 130 135 140 att caa acc aag gat ggg cag tgt ggc agt cca tta gta tca act aga 480Thr Ser Cys Thr Phe Pro Ser Ser Asp Gly Ile Phe Trp Lys His Trp 130 135 140 att caa acc aag gat ggg cag tgt ggc agt cca thta gta tca act aga 480
Ile Gin Thr Lys Asp Gly Gin Cys Gly Ser Pro Leu Val Ser Thr Arg 145 150 155 160 gat ggg ttc att gtt ggt ata cac tca gca tcg aat ttc acc aac aca 528Ile Gin Thr Lys Asp Gly Gin Cys Gly Ser Pro Leu Val Ser Thr Arg 145 150 155 160 gat ggg ttc att gtt ggt ata cac tca gca tcg aat ttc acc aac aca 528
Asp Gly Phe Ile Val Gly Ile His Ser Ala Ser Asn Phe Thr Asn Thr 165 170 175 aac aat tat ttc aca agc gtg ccg aaa aac ttc atg gaa ttg ttg aca 576Asp Gly Phe Ile Val Gly Ile His Ser Ala Ser Asn Phe Thr Asn Thr 165 170 175 aac aat tat ttc aca agc gtg ccg aaa aac ttc atg gaa ttg ttg aca 576
Asn Asn Tyr Phe Thr Ser Val Pro Lys Asn Phe Met Glu Leu Leu Thr 180 185 190 aat cag gag gcg cag cag tgg gtt agt ggt tgg ega tta aat get gac 624Asn Asn Tyr Phe Thr Ser Val Pro Lys Asn Phe Met Glu Leu Leu Thr 180 185 190 aat cag gag gcg cag cag tgg gtt agt ggt tgg ega tta aat get gac 624
Asn Gin Glu Ala Gin Gin Trp Val Ser Gly Trp Arg Leu Asn Ala Asp 195 200 205 tca gta ttg tgg ggg ggc cat aaa gtt ttc atg agc aaa cct gaa gag 672Asn Gin Glu Ala Gin Gin Trp Val Ser Gly Trp Arg Leu Asn Ala Asp 195 200 205 tta gta ttg tgg ggg ggc cat aaa gtt ttc atg agc aaa cct gaa gag 672
Ser Val Leu Trp Gly Gly His Lys Val Phe Met Ser Lys Pro Glu Glu 210 215 220 cct ttt cag cca gtt aag gaa gcg act caa ete atg agt gaa ttg gtg 720Ser Val Leu Trp Gly Gly His Lys Val Phe Met Ser Lys Pro Glu Glu 210 215 220 cct ttt cag cca gtt aag gaa gcg act caa ete atg agt gaa ttg gtg 720
Pro Phe Gin Pro Val Lys Glu Ala Thr Gin Leu Met Ser Glu Leu Val 225 230 235 240 tac tcg caa taa 732Pro Phe Gin Pro Val Lys Glu Ala Thr Gin Leu Met Ser Glu Leu Val 225 230 235 240 tac tcg caa taa 732
Tyr Ser Gin <210> 2 <211> 243Tyr Ser Gin <210> 2 <211> 243
<212> PRT <213> Tobacco etch virus <400> 2<212> PRT <213> Tobacco etch virus <400> 2
Met Gly Glu Ser Leu Phe Lys Gly Pro Arg Asp Tyr Asn Pro Ile Ser 15 10 15Met Gly Glu Ser Leu Phe Lys Gly Pro Arg Asp Tyr Asn Pro Ile Ser 15 10 15
Ser Thr Ile Cys His Leu Thr Asn Glu Ser Asp Gly His Thr Thr Ser 20 25 30Ser Thr Ile Cys His Leu Thr Asn Glu Ser Asp Gly His Thr Thr Ser 20 25 30
Leu Tyr Gly Ile Gly Phe Gly Pro Phe Ile Ile Thr Asn Lys His Leu 35 40 45Leu Tyr Gly Ile Gly Phe Gly Pro Phe Ile Ile Thr Asn Lys His Leu 35 40 45
Phe Arg Arg Asn Asn Gly Thr Leu Leu Val Gin Ser Leu His Gly Val 50 55 60Phe Arg Arg Asn Asn Gly Thr Leu Leu Val Gin Ser Leu His Gly Val 50 55 60
Phe Lys Val Lys Asn Thr Thr Thr Leu Gin Gin His Leu Ile Asp Gly 65 70 75 80Phe Lys Val Lys Asn Thr Thr Thr Leu Gin Gin His Leu Ile Asp Gly 65 70 75 80
Arg Asp Met Ile Ile Ile Arg Met Pro Lys Asp Phe Pro Pro Phe Pro 85 90 95Arg Asp Met Ile Ile Arg Met Pro Lys Asp Phe Pro Pro Phe Pro 85 90 95
Gin Lys Leu Lys Phe Arg Glu Pro Gin Arg Glu Glu Arg Ile Cys Leu 100 105 110Gin Lys Leu Lys Phe Arg Glu Pro Gin Arg Glu Glu Arg Ile Cys Leu 100 105 110
Val Thr Thr Asn Phe Gin Thr Lys Ser Met Ser Ser Met Val Ser Asp 115 120 125Val Thr Thr Asr Phe Gin Thr Lys Ser Met Ser Ser Met Serp Asp 115 120 125
Thr Ser Cys Thr Phe Pro Ser Ser Asp Gly Ile Phe Trp Lys His Trp 130 135 140Thr Ser Cys Thr Phe Pro Ser Ser Asp Gly Ile Phe Trp Lys His Trp 130 135 140
Ile Gin Thr Lys Asp Gly Gin Cys Gly Ser Pro Leu Val Ser Thr Arg 145 150 155 160Ile Gin Thr Lys Asp Gly Gin Cys Gly Ser Pro Leu Val Ser Thr Arg 145 150 155 160
Asp Gly Phe Ile Val Gly Ile His Ser Ala Ser Asn Phe Thr Asn Thr 165 170 175Asp Gly Phe Ile Val Gly Ile His Ser Ala Ser Asn Phe Thr Asn Thr 165 170 175
Asn Asn Tyr Phe Thr Ser Val Pro Lys Asn Phe Met Glu Leu Leu Thr 180 185 190Asn Asn Tyr Phe Thr Ser Val Pro Lys Asn Phe Met Glu Leu Leu Thr 180 185 190
Asn Gin Glu Ala Gin Gin Trp Val Ser Gly Trp Arg Leu Asn Ala Asp 195 200 205Asn Gin Glu Ala Gin Gin Trp Val Ser Gly Trp Arg Leu Asn Ala Asp 195 200 205
Ser Val Leu Trp Gly Gly His Lys Val Phe Met Ser Lys Pro Glu Glu 210 215 220Ser Val Leu Trp Gly Gly His Lys Val Phe Met Ser Lys Pro Glu Glu 210 215 220
Pro Phe Gin Pro Val Lys Glu Ala Thr Gin Leu Met Ser Glu Leu Val 225 230 235 240Pro Phe Gin Pro Val Lys Glu Ala Thr Gin Leu Met Ser Glu Leu Val 225 230 235 240
Tyr Ser Gin <210> 3 <211> 354Tyr Ser Gin <210> 3 <211> 354
<212> DNA <213> Tobacco etch virus <220> <221> CDS <222> (1)..(354) <400> 3 gga gaa agc ttg ttt aag gga cca cgt gat tac aac ccg ata tcg agc 48<212> DNA <213> Tobacco etch virus <220> <221> CDS <222> (1) .. (354) <400> 3 gga gaa agc ttg ttt aag gga cca cgt gat tac aac ccg ata tcg agc 48
Gly Glu Ser Leu Phe Lys Gly Pro Arg Asp Tyr Asn Pro Ile Ser Ser 15 10 15 acc att tgt cac ttg acg aat gaa tet gat ggg cae aca aca tcg ttg 96Gly Glu Ser Leu Phe Lys Gly Pro Arg Asp Tyr Asn Pro Ile Ser Ser 15 10 15 acc att tgt cac ttg acg aat gaa tet gat ggg cae aca aca tcg ttg 96
Thr Ile Cys His Leu Thr Asn Glu Ser Asp Gly His Thr Thr Ser Leu 20 25 30 tat ggt att gga ttt ggt ccc ttc ate att aca aac aag cac ttg ttt 144Thr Ile Cys His Leu Thr Asn Glu Ser Asp Gly His Thr Thr Ser Leu 20 25 30 tat ggt att gga ttt ggt ccc ttc ate att aca aac aag cac ttg ttt 144
Tyr Gly Ile Gly Phe Gly Pro Phe Ile Ile Thr Asn Lys His Leu Phe 35 40 45 aga aga aat aat gga aca ctg ttg gtc caa tca cta cat ggt gta ttc 192Tyr Gly Ile Gly Phe Gly Pro Phe Ile Ile Thr Asn Lys His Leu Phe 35 40 45 aga aga aat aat gga aca ctg ttg gtc caa tca cta cat ggt gta ttc 192
Arg Arg Asn Asn Gly Thr Leu Leu Val Gin Ser Leu His Gly Val Phe 50 55 60 aag gtc aag aac acc acg act ttg caa caa cac ete att gat ggg agg 240Arg Arg Asn Asn Gly Thr Leu Leu Val Gin Ser Leu His Gly Val Phe 50 55 60 aag gtc aag aac acc acg act ttg caa caa cac ete att gat ggg agg 240
Lys Val Lys Asn Thr Thr Thr Leu Gin Gin His Leu Ile Asp Gly Arg 65 70 75 80 gac atg ata att att ege atg cct aag gat ttc cca cca ttt cct caa 288Lys Val Lys Asn Thr Thr Thr Leu Gin Gin His Leu Ile Asp Gly Arg 65 70 75 80 gac atg atta ege atg cct aag gat ttc cca cca ttt cct caa 288
Asp Met Ile Ile Ile Arg Met Pro Lys Asp Phe Pro Pro Phe Pro Gin 85 90 95 aag ctg aaa ttt aga gag cca caa agg gaa gag cgc ata tgt ctt gtg 336Asp Met Ile Ile Ile Arg Met Pro Lys Asp Phe Pro Pro Phe Pro Gin 85 90 95 aag ctg aaa ttt aga gag cca caa agg gaa gag cgc ata tgt ctt gtg 336
Lys Leu Lys Phe Arg Glu Pro Gin Arg Glu Glu Arg Ile Cys Leu Val 100 105 110 aca acc aac ttc caa act 354Lys Leu Lys Phe Arg Glu Pro Gin Arg Glu Glu Arg Ile Cys Leu Val 100 105 110 aca acc aac ttc caa act 354
Thr Thr Asn Phe Gin Thr 115 <210> 4Thr Thr Asn Phe Gin Thr 115 <210> 4
<211> 118 <212> PRT <213> Tobacco etch virus <400> 4<211> 118 <212> PRT <213> Tobacco etch virus <400> 4
Gly Glu Ser Leu Phe Lys Gly Pro Arg Asp Tyr Asn Pro Ile Ser Ser 15 10 15Gly Glu Ser Leu Phe Lys Gly Pro Arg Asp Tyr Asn Pro Ile Ser Ser 15 10 15
Thr Ile Cys His Leu Thr Asn Glu Ser Asp Gly His Thr Thr Ser Leu 20 25 30Thr Ile Cys His Leu Thr Asn Glu Ser Asp Gly His Thr Thr Ser Leu 20 25 30
Tyr Gly Ile Gly Phe Gly Pro Phe Ile Ile Thr Asn Lys His Leu Phe 35 40 45Tyr Gly Ile Gly Phe Gly Pro Phe Ile Ile Thr Asn Lys His Leu Phe 35 40 45
Arg Arg Asn Asn Gly Thr Leu Leu Val Gin Ser Leu His Gly Val Phe 50 55 60Arg Arg Asn Asn Gly Thr Leu Leu Val Gin Ser Leu His Gly Val Phe 50 55 60
Lys Val Lys Asn Thr Thr Thr Leu Gin Gin His Leu Ile Asp Gly Arg 65 70 75 80Lys Val Lys Asn Thr Thr Thr Leu Gin Gin His Leu Ile Asp Gly Arg 65 70 75 80
Asp Met Ile Ile Ile Arg Met Pro Lys Asp Phe Pro Pro Phe Pro Gin 85 90 95Asp Met Ile Ile Ile Arg Met Pro Lys Asp Phe Pro Pro Phe Pro Gin 85 90 95
Lys Leu Lys Phe Arg Glu Pro Gin Arg Glu Glu Arg Ile Cys Leu Val 100 105 110Lys Leu Lys Phe Arg Glu Pro Gin Arg Glu Glu Arg Ile Cys Leu Val 100 105 110
Thr Thr Asn Phe Gin Thr 115 <210> 5 <211> 372Thr Thr Asr Phe Gin Thr 115 <210> 5 <211> 372
<212> DNA <213> Tobacco etch virus <220> <221> CDS <222> (1)..(372) <400> 5 aag agc atg tet agc atg gtg tca gac acc agt tgc aca ttc cct tca 48<212> DNA <213> Tobacco etch virus <220> <221> CDS <222> (1) .. (372) <400> 5 aag agc atg tet agc atg gtg tca gac acc agt tgc aca ttc cct tca 48
Lys Ser Met Ser Ser Met Val Ser Asp Thr Ser Cys Thr Phe Pro Ser 15 10 15 tet gat ggc ata ttc tgg aag cat tgg att caa acc aag gat ggg cag 96Lys Ser Met Ser Ser Met Serp Asp Thr Ser Cys Thr Phe Pro Ser 15 10 15 tet gat ggc ata ttc tgg aag cat tgg att caa acc aag gat ggg cag 96
Ser Asp Gly Ile Phe Trp Lys His Trp Ile Gin Thr Lys Asp Gly Gin 20 25 30 tgt ggc agt cca tta gta tca act aga gat ggg ttc att gtt ggt ata 144Ser Asp Gly Ile Phe Trp Lys His Trp Ile Gin Thr Lys Asp Gly Gin 20 25 30 tgt ggc agt cca tta gta tca act aga gat ggg ttc att gtt ggt ata 144
Cys Gly Ser Pro Leu Val Ser Thr Arg Asp Gly Phe Ile Val Gly Ile 35 40 45 cac tca gca teg aat ttc acc aac aca aac aat tat ttc aca agc gtg 192Cys Gly Ser Pro Leu Val Ser Thr Arg Asp Gly Phe Ile Val Gly Ile 35 40 45 cac tca gta teg aat ttc acc aac aca aac aat tat tc aca agc gtg 192
His Ser Ala Ser Asn Phe Thr Asn Thr Asn Asn Tyr Phe Thr Ser Val 50 55 60 ccg aaa aac ttc atg gaa ttg ttg aca aat cag gag gcg cag cag tgg 240His Ser Ala Ser Asn Phe Thr Asn Thr Asn Asn Tyr Phe Thr Ser Val 50 55 60 ccg aaa aac ttc atg gaa ttg ttg aca aat cag gag gcg cag cag tgg 240
Pro Lys Asn Phe Met Glu Leu Leu Thr Asn Gin Glu Ala Gin Gin Trp 65 70 75 80 gtt agt ggt tgg ega tta aat get gac tca gta ttg tgg ggg ggc cat 288Pro Lys Asn Phe Met Glu Leu Leu Thr Asn Gin Glu Ala Gin Gin Trp 65 70 75 80 gtt agt ggt tgg ega tta aat get gac tca gta ttg tgg ggg ggc cat 288
Val Ser Gly Trp Arg Leu Asn Ala Asp Ser Val Leu Trp Gly Gly His 85 90 95 aaa gtt ttc atg agc aaa cct gaa gag cct ttt cag cca gtt aag gaa 336Val Ser Gly Trp Arg Leu Asn Ala Asp Ser Val Leu Trp Gly Gly His 85 90 95 aaa gtt ttc atg agc aaa cct gaa gag cct ttt cag cca gtt aag gaa 336
Lys Val Phe Met Ser Lys Pro Glu Glu Pro Phe Gin Pro Val Lys Glu 100 105 110 gcg act caa ete atg agt gaa ttg gtg tac teg caa 372Lys Val Phe Met Ser Lys Pro Glu Glu Pro Phe Gin Pro Val Lys Glu 100 105 110 gcg act caa ete atg agt gaa ttg gtg tac teg caa 372
Ala Thr Gin Leu Met Ser Glu Leu Val Tyr Ser Gin 115 120 <210> 6 <211> 124Ala Thr Gin Leu Met Ser Glu Leu Val Tyr Ser Gin 115 120 <210> 6 <211> 124
<212> PRT <213> Tobacco etch virus <40Q> 6<212> PRT <213> Tobacco etch virus <40Q> 6
Lys Ser Met Ser Ser Met Val Ser Asp Thr Ser Cys Thr Phe Pro Ser 15 10 15Lys Ser Met Ser Ser Met Ser Ser Ser Asp Thr Ser Cys Thr Phe Pro Ser 15 10 15
Ser Asp Gly Ile Phe Trp Lys His Trp Ile Gin Thr Lys Asp Gly Gin 20 25 30Ser Asp Gly Ile Phe Trp Lys His Trp Ile Gin Thr Lys Asp Gly Gin 20 25 30
Cys Gly Ser Pro Leu Val Ser Thr Arg Asp Gly Phe Ile Val Gly Ile 35 40 45Cys Gly Ser Pro Leu Val Ser Thr Arg Asp Gly Phe Ile Val Gly Ile 35 40 45
His Ser Ala Ser Asn Phe Thr Asn Thr Asn Asn Tyr Phe Thr Ser Val 50 55 60His Ser Ala Ser Asn Phe Thr Asn Thr Asn Asn Tyr Phe Thr Ser Val 50 55 60
Pro Lys Asn Phe Met Glu Leu Leu Thr Asn Gin Glu Ala Gin Gin Trp 65 70 75 80Pro Lys Asn Phe Met Glu Leu Leu Thr Asn Gin Glu Ala Gin Gin Trp 65 70 75 80
Val Ser Gly Trp Arg Leu Asn Ala Asp Ser Val Leu Trp Gly Gly His 85 90 95Val Ser Gly Trp Arg Leu Asn Ala Asp Ser Val Leu Trp Gly Gly His 85 90 95
Lys Val Phe Met Ser Lys Pro Glu Glu Pro Phe Gin Pro Val Lys Glu 100 105 110Lys Val Phe Met Ser Lys Pro Glu Glu Pro Phe Gin Pro Val Lys Glu 100 105 110
Ala Thr Gin Leu Met Ser Glu Leu Val Tyr Ser Gin 115 120 <210> 7 <211> 735Ala Thr Gin Leu Met Ser Glu Leu Val Tyr Ser Gin 115 120 <210> 7 <211> 735
<212> DNA <213> Plum pox virus <220> <221> CDS <222> (1)..(735) <400> 7 atg agc aag agc ctg ttc cgc ggc ctg cgc gac tac aac ccc ate gcc 48<212> DNA <213> Plum pox virus <220> <221> CDS <222> (1) .. (735) <400> 7 atg agc aag agc ctg ttc cgc ggc ctg cgc gac tac aac ccc ate gcc 48
Met Ser Lys Ser Leu Phe Arg Gly Leu Arg Asp Tyr Asn Pro Ile Ala 15 10 15 agc agc ate tgc cag ctg aac aac agc agc ggc gcc cgc cag agc gag 96Met Ser Lys Ser Leu Phe Arg Gly Leu Arg Asp Tyr Asn Pro Ile Ala 15 10 15 agc agc ate tgc cag ctg aac aac agc agc ggc gcc cgc cag agc gag 96
Ser Ser Ile Cys Gin Leu Asn Asn Ser Ser Gly Ala Arg Gin Ser Glu 20 25 30 atg ttc ggc ctg ggc ttc ggc ggc ctg ate gtg acc aac cag cac ctg 144Ser Ser Ile Cys Gin Leu Asn Asn Ser Ser Gly Ala Arg Gin Ser Glu 20 25 30 atg ttc ggc ctg ggc ttc ggc ggc ctg ate gtg acc aac cag cac ctg 144
Met Phe Gly Leu Gly Phe Gly Gly Leu Ile Val Thr Asn Gin His Leu 35 40 45 ttc aag cgc aac gac ggc gag ctg acc ate cgc agc cac cac ggc gag 192Met Phe Gly Le Gly Le Gle Le Gle Gle Le Thé Asle Gin His Leu 35 40 45 ttc aag cgc aac gac ggc gag ctg acc ate cgc agc cac cac ggc gag 192
Phe Lys Arg Asn Asp Gly Glu Leu Thr Ile Arg Ser His His Gly Glu 50 55 60 ttc gtg gtg aag gac acc aag acc ctg aag ctg ctg ccc tgc aag ggc 240Phe Lys Arg Asn Asp Gly Glu Leu Thr Ile Arg Ser His His Gly Glu 50 55 60 ttc gtg gtg aag gac acc aag acc ctg aag ctg ctg ccc tgc aag ggc 240
Phe Val Val Lys Asp Thr Lys Thr Leu Lys Leu Leu Pro Cys Lys Gly 65 70 75 80 cgc gac ate gtg ate ate cgc ctg ccc aag gac ttc ccc ccc ttc ccc 288Phe Val Val Lys Asp Thr Lys Thr Leu Lys Leu Leu Leu Leys Pro Cys Lys Gly 65 70 75 80 cgc gac ate gtg ate ate cgc ctg ccc aag gac ttc ccc ccc ttc ccc 288
Arg Asp Ile Val Ile Ile Arg Leu Pro Lys Asp Phe Pro Pro Phe Pro 85 90 95 aag cgc ctg caa ttc cgc acc ccc acc acc gag gac cgc gtg tgc ctg 336Arg Asp Ile Val Ile Ile Arg Leu Pro Lys Asp Phe Pro Pro Phe Pro 85 90 95 aag cgc ctg caa ttc cgc acc ccc acc acc gag cgc gtg tgc ctg 336
Lys Arg Leu Gin Phe Arg Thr Pro Thr Thr Glu Asp Arg Val Cys Leu 100 105 110 ate ggc agc aac ttc cag acc aag agc ate agc agc acc atg agc gag 384Lys Arg Leu Gin Phe Arg Thr Thr Thr Thr Thr Glu Asp Arg Val Cys Leu 100 105 110 ate ggc agc aac ttc cag acc aag agc ate agc agc acc atg agc gag 384
Ile Gly Ser Asn Phe Gin Thr Lys Ser Ile Ser Ser Thr Met Ser Glu 115 120 125 acc agc gcc acc tac ccc gtg gac aac agc cac ttc tgg aag cac tgg 432Ile Gly Ser Asn Phe Gin Thr Lys Ser Ile Ser Ser Thr Met Ser Glu 115 120 125 acc agc gcc acc tac ccc gtg gac aac agc cac ttc tgg aag cac tgg 432
Thr Ser Ala Thr Tyr Pro Val Asp Asn Ser His Phe Trp Lys His Trp 130 135 140 ate agc acc aag gac ggc cac tgc ggc ctg ccc ate gtg agc acc cgc 480Thr Ser Ala Thr Tyr Pro Val Asp Asn Ser His Phe Trp Lys His Trp 130 135 140 ate agc acc aag gac ggc cac tgc ggc ctg ccc ate gtg agc acc cgc 480
Ile Ser Thr Lys Asp Gly His Cys Gly Leu Pro Ile Val Ser Thr Arg 145 150 155 160 gac ggc agc ate ctg ggc ctg cac agc ctg gcc aac agc acc aac acc 52 8Ile Ser Thr Lys Asp Gly His Cys Gly Leu Pro Ile Val Ser Thr Arg 145 150 155 160 gac ggc agc ate ctg ggc ctg cac agc ctg gcc aac agc acc aac acc 52 8
Asp Gly Ser Ile Leu Gly Leu His Ser Leu Ala Asn Ser Thr Asn Thr 165 170 175 cag aac ttc tac gcc gcc ttc ccc gac aac ttc gag acc acc tac ctg 576Asp Gly Ser Ile Leu Gly Leu His Ser Leu Ala Asn Ser Thr Asn Thr 165 170 175 cag aac ttc tac gcc gcc tcc ccc gac aac ttc gag acc acc tac ctg 576
Gin Asn Phe Tyr Ala Ala Phe Pro Asp Asn Phe Glu Thr Thr Tyr Leu 180 185 190 agc aac cag gac aac gac aac tgg ate aag cag tgg cgc tac aac ccc 624Gin Asn Phe Tyr Ala Ala Phe Pro Asp Asn Phe Glu Thr Thr Tyr Leu 180 185 190 agc aac cag gac aac gac aac tgg ate aag cag tgg cgc tac aac ccc 624
Ser Asn Gin Asp Asn Asp Asn Trp Ile Lys Gin Trp Arg Tyr Asn Pro 195 200 205 gac gag gtg tgc tgg ggc agc ctg caa ctg aag cgc gac ate ccc cag 672Ser Asn Gin Asp Asn Asp Asp Trp Ile Lys Gin Trp Arg Tyr Asn Pro 195 200 205 gac gag gtg tgc tgg ggc agc ctg caa ctg aag cgc gac ate ccc cag 672
Asp Glu Val Cys Trp Gly Ser Leu Gin Leu Lys Arg Asp Ile Pro Gin 210 215 220 agc ccc ttc acc ate tgc aag ctg ctg acc gac ctg gac ggc gag ttc 720Asp Glu Val Cys Trp Gly Ser Leu Gin Leu Lys Arg Asp Ile Pro Gin 210 215 220 agc ccc ttc acc ate tgc aag ctg acc gac ctg gac ggc gag ttc 720
Ser Pro Phe Thr Ile Cys Lys Leu Leu Thr Asp Leu Asp Gly Glu Phe 225 230 235 240 gtg tac acc cag taa 735Ser Pro Phe Thr Ile Cys Lys Leu Leu Thr Asp Leu Asp Gly Plu 225 230 235 240 gtg tac acc cag taa 735
Val Tyr Thr Gin <210> 8 <211> 244Val Tyr Thr Gin <210> 8 <211> 244
<212> PRT <213> Plum pox virus <400> 8<212> PRT <213> Plum pox virus <400> 8
Met Ser Lys Ser Leu Phe Arg Gly Leu Arg Asp Tyr Asn Pro Ile Ala 15 10 15Met Ser Lys Ser Leu Phe Arg Gly Leu Arg Asp Tyr Asn Pro Ile Ala 15 10 15
Ser Ser Ile Cys Gin Leu Asn Asn Ser Ser Gly Ala Arg Gin Ser Glu 20 25 30Ser Ser Ile Cys Gin Leu Asn Asn Ser Ser Gly Ala Arg Gin Ser Glu 20 25 30
Met Phe Gly Leu Gly Phe Gly Gly Leu Ile Val Thr Asn Gin His Leu 35 40 45Met Phe Gly Le Gly Gly Gly Gly Leu Ile Val Thr Asn Gin His Leu 35 40 45
Phe Lys Arg Asn Asp Gly Glu Leu Thr Ile Arg Ser His His Gly Glu 50 55 60Phe Lys Arg Asn Asp Gly Glu Leu Thr Ile Arg Ser His His Gly Glu 50 55 60
Phe Val Val Lys Asp Thr Lys Thr Leu Lys Leu Leu Pro Cys Lys Gly 65 70 75 80Phe Val Val Lys Asp Thr Lys Thr Leu Lys Leu Leu Pro Cys Lys Gly 65 70 75 80
Arg Asp Ile Val Ile Ile Arg Leu Pro Lys Asp Phe Pro Pro Phe Pro 85 90 95Arg Asp Ile Val Ile Ile Arg Leu Pro Lys Asp Phe Pro Pro Phe Pro 85 90 95
Lys Arg Leu Gin Phe Arg Thr Pro Thr Thr Glu Asp Arg Val Cys Leu 100 105 110Lys Arg Leu Gin Phe Arg Thr Pro Thr Thr Glu Asp Arg Val Cys Leu 100 105 110
Ile Gly Ser Asn Phe Gin Thr Lys Ser Ile Ser Ser Thr Met Ser Glu 115 120 125Ile Gly Ser Asn Phe Gin Thr Lys Ser Ile Ser Ser Thr Met Ser Glu 115 120 125
Thr Ser Ala Thr Tyr Pro Val Asp Asn Ser His Phe Trp Lys His Trp 130 135 140Thr Ser Ala Thr Tyr Pro Val Asp Asn Ser His Phe Trp Lys His Trp 130 135 140
Ile Ser Thr Lys Asp Gly His Cys Gly Leu Pro Ile Val Ser Thr Arg 145 150 155 160Ile Ser Thr Lys Asp Gly His Cys Gly Leu Pro Ile Val Ser Thr Arg 145 150 155 160
Asp Gly Ser Ile Leu Gly Leu His Ser Leu Ala Asn Ser Thr Asn Thr 165 170 175Asp Gly Ser Ile Leu Gly Leu His Ser Leu Ala Asn Ser Thr Asn Thr 165 170 175
Gin Asn Phe Tyr Ala Ala Phe Pro Asp Asn Phe Glu Thr Thr Tyr Leu 180 185 190Gin Asn Phe Tyr Ala Ala Phe Pro Asp Asn Phe Glu Thr Thr Tyr Leu 180 185 190
Ser Asn Gin Asp Asn Asp Asn Trp Ile Lys Gin Trp Arg Tyr Asn Pro 195 200 205Ser Asn Gin Asp Asn Asp Asp Trp Ile Lys Gin Trp Arg Tyr Asn Pro 195 200 205
Asp Glu Val Cys Trp Gly Ser Leu Gin Leu Lys Arg Asp Ile Pro Gin 210 215 220Asp Glu Val Cys Trp Gly Ser Leu Gin Leu Lys Arg Asp Ile Pro Gin 210 215 220
Ser Pro Phe Thr Ile Cys Lys Leu Leu Thr Asp Leu Asp Gly Glu Phe 225 230 235 240Ser Pro Phe Thr Ile Cys Lys Leu Leu Thr Asp Leu Asp Gly Glu Phe 225 230 235 240
Val Tyr Thr Gin <210> 9 <211> 354Val Tyr Thr Gin <210> 9 <211> 354
<212> DNA <213> Plum pox virus <220> <221> CDS <222> (1) .. (354) <400> 9 agc aag agc ctg ttc ege ggc ctg ege gac tac aac ccc ate gcc agc 48<212> DNA <213> Plum pox virus <220> <221> CDS <222> (1) .. (354) <400> 9 agc aag agc ctg ttc ege ggc ctg ege gac tac aac ccc ate gcc agc 48
Ser Lys Ser Leu Phe Arg Gly Leu Arg Asp Tyr Asn Pro Ile Ala Ser 15 10 15 agc ate tgc cag ctg aac aac agc agc ggc gcc ege cag agc gag atg 96Ser Lys Ser Leu Phe Arg Gly Leu Arg Asp Tyr Asn Pro Ile Ala Ser 15 10 15 agc ate tgc cag ctg aac aac agc agc ggc gcc ege cag agc gag atg 96
Ser Ile Cys Gin Leu Asn Asn Ser Ser Gly Ala Arg Gin Ser Glu Met 20 25 30 ttc ggc ctg ggc ttc ggc ggc ctg ate gtg acc aac cag cac ctg ttc 144Ser Ile Cys Gin Leu Asn Asn Ser Ser Gly Ala Arg Gin Ser Glu Met 20 25 30 ttc ggc ctg ggc ttc ggc ggc ctg ate gtg acc aac cag cac ctg ttc 144
Phe Gly Leu Gly Phe Gly Gly Leu Ile Val Thr Asn Gin His Leu Phe 35 40 45 aag ege aac gac ggc gag ctg acc ate ege agc cac cac ggc gag ttc 192Phe Gly Lei Gly Ply Gly Gly Leu Ile Val Thr Asn Gin His Leu Phe 35 40 45 aag ege aac gac ggc gag ctg acc ate ege agc cac cac ggc gag ttc 192
Lys Arg Asn Asp Gly Glu Leu Thr Ile Arg Ser His His Gly Glu Phe 50 55 60 gtg gtg aag gac acc aag acc ctg aag ctg ctg ccc tgc aag ggc ege 240Lys Arg Asn Asp Gly Glu Leu Thr Ile Arg Ser His His Gly Glu Phe 50 55 60 gtg gtg aag gac acc aag acc ctg aag ctg ctg ccc tgc aag ggc ege 240
Val Val Lys Asp Thr Lys Thr Leu Lys Leu Leu Pro Cys Lys Gly Arg 65 70 75 80 gac ate gtg ate ate ege ctg ccc aag gac ttc ccc ccc ttc ccc aag 288Val Val Lys Asp Thr Lys Thr Leu Lys Leu Leu Leu Pro Cys Lys Gly Arg 65 70 75 80 gac ate gtg ate ate ege ctg ccc aag gac ttc ccc ccc ttc ccc aag 288
Asp Ile Val Ile Ile Arg Leu Pro Lys Asp Phe Pro Pro Phe Pro Lys 85 90 95 ege ctg caa ttc ege acc ccc acc acc gag gac ege gtg tgc ctg ate 336Asp Ile Val Ile Ile Arg Leu Pro Lys Asp Phe Pro Pro Phe Pro Lys 85 90 95 ege ctg caa ttc ege acc ccc acc acc gag gac ege gtg tgc ctg ate 336
Arg Leu Gin Phe Arg Thr Pro Thr Thr Glu Asp Arg Val Cys Leu Ile 100 105 110 ggc agc aac ttc cag acc 354Arg Leu Gin Phe Arg Thr Pro Thr Thr Glu Asp Arg Val Cys Leu Ile 100 105 110 ggc agc aac ttc cag acc 354
Gly Ser Asn Phe Gin Thr 115Gly Ser Asn Phe Gin Thr 115
<210> 10 <211> 118 <212> PRT <213> Plum pox virus <400> 10<210> 10 <211> 118 <212> PRT <213> Plum pox virus <400> 10
Ser Lys Ser Leu Phe Arg Gly Leu Arg Asp Tyr Asn Pro Ile Ala Ser 15 10 15Ser Lys Ser Leu Phe Arg Gly Leu Arg Asp Tyr Asn Pro Ile Ala Ser 15 10 15
Ser Ile Cys Gin Leu Asn Asn Ser Ser Gly Ala Arg Gin Ser Glu Met 20 25 30Ser Ile Cys Gin Leu Asn Asn Ser Ser Gly Ala Arg Gin Ser Glu Met 20 25 30
Phe Gly Leu Gly Phe Gly Gly Leu Ile Val Thr Asn Gin His Leu Phe 35 40 45Phe Gly Leu Gly Ply Gly Gly Leu Ile Val Thr Asn Gin His Leu Phe 35 40 45
Lys Arg Asn Asp Gly Glu Leu Thr Ile Arg Ser His His Gly Glu Phe 50 55 60Lys Arg Asn Asp Gly Glu Leu Thr Ile Arg Ser His His Gly Glu Phe 50 55 60
Val Val Lys Asp Thr Lys Thr Leu Lys Leu Leu Pro Cys Lys Gly Arg 65 70 75 80Val Val Lys Asp Thr Lys Thr Leu Lys Leu Leu Pro Cys Lys Gly Arg 65 70 75 80
Asp Ile Val Ile Ile Arg Leu Pro Lys Asp Phe Pro Pro Phe Pro Lys 85 90 95Asp Ile Val Ile Ile Arg Leu Pro Lys Asp Phe Pro Pro Phe Pro Lys 85 90 95
Arg Leu Gin Phe Arg Thr Pro Thr Thr Glu Asp Arg Val Cys Leu Ile 100 105 110Arg Leu Gin Phe Arg Thr Pro Thr Thr Glu Asp Arg Val Cys Leu Ile 100 105 110
Gly Ser Asn Phe Gin Thr 115 <210> 11 <211> 375Gly Ser Asn Phe Gin Thr 115 <210> 11 <211> 375
<212> DNA <213> Plum pox virus <220> <221> CDS <222> (1)..(375) <400> 11 aag agc ate agc agc acc atg agc gag acc agc gcc acc tac ccc gtg 48<212> DNA <213> Plum pox virus <220> <221> CDS <222> (1) .. (375) <400> 11 aag agc ate agc agc acc atg agc gag acc agc gcc acc tac ccc gtg 48
Lys Ser Ile Ser Ser Thr Met Ser Glu Thr Ser Ala Thr Tyr Pro Val 15 10 15 gac aac agc cac ttc tgg aag cac tgg ate agc acc aag gac ggc cac 96Lys Ser Ile Ser Ser Thr Met Ser Glu Thr Ser Ala Thr Tyr Pro Val 15 10 15 gac aac agc cac ttc tgg aag cac tgg ate agc acc aag gac ggc cac 96
Asp Asn Ser His Phe Trp Lys His Trp Ile Ser Thr Lys Asp Gly His 20 25 30 tgc ggc ctg ccc ate gtg agc acc ege gac ggc agc ate ctg ggc ctg 144Asp Asn Ser His Phe Trp Lys His Trp Ile Ser Thr Lys Asp Gly His 20 25 30 tgc ggc ctg ccc ate gtg agc acc ege gac ggc agc ate ctg ggc ctg 144
Cys Gly Leu Pro Ile Val Ser Thr Arg Asp Gly Ser Ile Leu Gly Leu 35 40 45 cac agc ctg gcc aac agc acc aac acc cag aac ttc tac gcc gcc ttc 192Cys Gly Leu Pro Ile Val Ser Thr Arg Asp Gly Ser Ile Leu Gly Leu 35 40 45 cac agc ctg gcc aac agc acc aac acc cag aac ttc tac gcc gcc ttc 192
His Ser Leu Ala Asn Ser Thr Asn Thr Gin Asn Phe Tyr Ala Ala Phe 50 55 60 ccc gac aac ttc gag acc acc tac ctg agc aac cag gac aac gac aac 240His Ser Leu Ala Asn Ser Thr Asr Thr Gin Asn Phe Tyr Ala Ala Phe 50 55 60 ccc gac aac ttc gag acc acc tac ctg agc aac cag gac aac gac aac 240
Pro Asp Asn Phe Glu Thr Thr Tyr Leu Ser Asn Gin Asp Asn Asp Asn 65 70 75 80 tgg ate aag cag tgg ege tac aac ccc gac gag gtg tgc tgg ggc agc 288Pro Asp Asn Phe Glu Thr Thr Thir Leu Ser Asn Asn Asn Asn Asn Asn 65 70 75 80 tgg aate cag tgg ege tac aac ccc gac gag gtg tgc tgg ggc agc 288
Trp Ile Lys Gin Trp Arg Tyr Asn Pro Asp Glu Val Cys Trp Gly Ser 85 90 95 ctg caa ctg aag cgc gac ate ccc cag agc ccc ttc acc ate tgc aag 336Trp Ile Lys Gin Trp Arg Tyr Asn Pro Asp Glu Val Cys Trp Gly Ser 85 90 95 ctg caa ctg aag cgc gac ate ccc cag agc ccc ttc acc ate tgc aag 336
Leu Gin Leu Lys Arg Asp Ile Pro Gin Ser Pro Phe Thr Ile Cys Lys 100 105 110 ctg ctg acc gac ctg gac ggc gag ttc gtg tac acc cag 375Leu Gin Leu Lys Arg Asp Ile Pro Gin Ser Pro Phe Thr Ile Cys Lys 100 105 110 ctg acc gac ctg gac ggc gag ttc gtg tac acc cag 375
Leu Leu Thr Asp Leu Asp Gly Glu Phe Val Tyr Thr Gin 115 120 125 <210> 12 <211> 125Leu Leu Thr Asp Leu Asp Gly Glu Phe Val Tyr Thr Gin 115 120 125 <210> 12 <211> 125
<212> PRT <213> Plum pox virus <400> 12<212> PRT <213> Plum pox virus <400> 12
Lys Ser Ile Ser Ser Thr Met Ser Glu Thr Ser Ala Thr Tyr Pro Val 15 10 15Lys Ser Ile Ser Ser Thr Met Ser Glu Thr Ser Ala Thr Tyr Pro Val 15 10 15
Asp Asn Ser His Phe Trp Lys His Trp Ile Ser Thr Lys Asp Gly His 20 25 30Asp Asn Ser His Phe Trp Lys His Trp Ile Ser Thr Lys Asp Gly His 20 25 30
Cys Gly Leu Pro Ile Val Ser Thr Arg Asp Gly Ser Ile Leu Gly Leu 35 40 45Cys Gly Leu Pro Ile Val Ser Thr Arg Asp Gly Ser Ile Leu Gly Leu 35 40 45
His Ser Leu Ala Asn Ser Thr Asn Thr Gin Asn Phe Tyr Ala Ala Phe 50 55 60His Ser Leu Ala Asn Ser Thr Asr Thr Gin Asn Phe Tyr Ala Ala Phe 50 55 60
Pro Asp Asn Phe Glu Thr Thr Tyr Leu Ser Asn Gin Asp Asn Asp Asn 65 70 75 80Pro Asp Asn Phe Glu Thr Thr Tyr Leu Ser Asn Gin Asp Asn Asp Asn 65 70 75 80
Trp Ile Lys Gin Trp Arg Tyr Asn Pro Asp Glu Val Cys Trp Gly Ser 85 90 95Trp Ile Lys Gin Trp Arg Tyr Asn Pro Asp Glu Val Cys Trp Gly Ser 85 90 95
Leu Gin Leu Lys Arg Asp Ile Pro Gin Ser Pro Phe Thr Ile Cys Lys 100 105 110Leu Gin Leu Lys Arg Asp Ile Pro Gin Ser Pro Phe Thr Ile Cys Lys 100 105 110
Leu Leu Thr Asp Leu Asp Gly Glu Phe Val Tyr Thr Gin 115 120 125 <210> 13Leu Leu Thr Asp Leu Asp Gly Glu Phe Val Tyr Thr Gin 115 120 125 <210> 13
<211> 806 <212> DNA <213> Sunflower mild mosaic virus <220> <221> CDS <222> (1)..(804) <400> 13 atg gta acg gcc gcc agt gtg ctg gaa ttc gcg gcc get cta gag cca 48<211> 806 <212> DNA <213> Sunflower mild mosaic virus <220> <221> CDS <222> (1) .. (804) <400> 13 atg gta acg gcc gcc agt gtg ctg gaa ttc gcg gcc get cta gag cca 48
Met Val Thr Ala Ala Ser Val Leu Glu Phe Ala Ala Ala Leu Glu Pro 15 10 15 cca tgg gcg tga gcc tga gcc gcg gcg tgc gcg act aca acg cca tca 96Met Val Thr Ala Ala Ser Val Leu Glu Phe Ala Ala Ala Leu Glu Pro 15 10 15 cca tgg gcg tga gcc tga gcc gcg gcg tgc gcg act aca acg cca tca 96
Pro Trp Ala Ala Ala Ala Ala Cys Ala Thr Thr Thr Pro Ser 20 25 30 gca gca tgg tgt gcc gcg tga cca acg aca gcg gca gca gca gca cca 144Pro Trp Ala Ala Ala Ala Ala Ala Tha Thr Thr Pro Ser 20 25 30 gca gggg gg gg gg gg gg gg gg gg gg gg gc gca gca gcg
Ala Ala Trp Cys Ala Ala Pro Thr Thr Ala Ala Ala Ala Ala Pro 35 40 45 cca tgt acg gca teg get acg get get aca tca tca cca aca agc acc 192Ala Ala Trp Cys Ala Ala Pro Thr Thr Ala Ala Ala Ala Ala Pro 35 40 45 cca tgt acg gca teg get acg get get aca tca tca cca aca agc acc 192
Pro Cys Thr Ala Ser Ala Thr Ala Ala Thr Ser Ser Pro Thr Ser Thr 50 55 60 tgt tcc gcg aga aca acg gcc gcc tgc tga tca cca gcc acc acg gcg 240Pro Cys Thr Ala Ser Ala Thr Ala Ala Thr Ser Ser Pro Thr Ser Thr 50 55 60 tgt tcc gcg aga aca acg gcc gcc tgc tga tca cca gcc acc acg gcg 240
Cys Ser Ala Arg Thr Thr Ala Ala Cys Ser Pro Ala Thr Thr Ala 65 70 75 agt aca tet gca aga aca gcg cca gcc tga agc tga gcc tgg tgc ccg 288Cys Ser Ala Arg Thr Thr Ala Ala Cys Ser Pro Ala Thr Thr Ala 65 70 75 agt aca tet gca aga aca gcg cca gcc tga agc tga gcc tgg tgc ccg 288
Ser Thr Ser Ala Arg Thr Ala Pro Ala Ser Ala Trp Cys Pro 80 85 90 gcc gcg aca tgc tgc tga tcc gcc tgc cca agg act gcc ccc cct tcc 336Ser Thr Ser Ala Arg Thr Ala Pro Ala Ser Ala Trp Cys Pro 80 85 90 gcc gcg aca tgc tgc tga gcc tgc cca agg act gcc ccc cct tcc 336
Ala Ala Thr Cys Cys Ser Ala Cys Pro Arg Thr Ala Pro Pro Ser 95 100 105 cca gca agc tga agt ttc gag agc cca cta gcg aag aaa agg ccg tgc 384Ala Ala Thr Cys Cys Ser Ala Cys Pro Arg Thr Ala Pro Pro Ser 95 100 105 cca gca agc tga agt ttc gag agc cca cta gcg aag aaa agg ccg tgc 384
Pro Ala Ser Ser Phe Glu Ser Pro Leu Ala Lys Lys Arg Pro Cys 110 115 120 ttg tag tca caa act tcc agg aga agc ate ttt cca gca tgg tga gcg 432Pro Ala Ser Ser Phe Glu Ser Pro Leu Ala Lys Lys Arg Pro Cys 110 115 120 ttg tag tca caa act tcc agg aga agc ate ttt cca gca tgg tga gcg 432
Leu Ser Gin Thr Ser Arg Arg Ser Ile Phe Pro Ala Trp Ala 125 130 aga gca get gcg tgg tgc agc gcg agg aca gcc cca tet ggc gcc act 480Leu Ser Gin Thr Ser Arg Arg Ser Ile Phe Pro Ala Trp Ala 125 130 aga gca get gcg tgg tgc agc gcg agg aca gcc cca tet ggc gcc act 480
Arg Ala Ala Ala Trp Cys Ser Ala Arg Thr Ala Pro Ser Gly Ala Thr 135 140 145 150 gga tca gca cca agg acg gcc act gcg gcg ccc cca tcg tga gca tcc 528Arg Ala Ala Ala Trp Cys Ser Ala Arg Thr Ala Pro Ser Gly Ala Thr 135 140 145 150 gga tca gca cca agg acg gcc act gcg gcg ccc cca tcg tga gca tcc 528
Gly Ser Ala Pro Arg Thr Ala Thr Ala Ala Pro Pro Ser Ala Ser 155 160 165 gcg acg gct aca tca tcg gca gcc act gcg gcg aga acc cca tga cca 576Gly Ser Ala Pro Arg Thr Ala Thr Ala Ala Pro Pro Ser Ala Ser 155 160 165 gcg acg gct aca tca tcg gca gcc act gcg gcg aga acc cca tga cca 576
Ala Thr Ala Thr Ser Ser Ala Ala Thr Ala Ala Arg Thr Pro Pro 170 175 180 gca act tet tca cca gca tcc cca agg act tcc aga acc tgc tga acg 624Ala Thr Ala Thr Ser Ser Ala Ala Thr Ala Ala Arg Thr Pro Pro 170 175 180 gca act tet tca cca gca tcc cca agg act tcc aga acc tgc tga acg 624
Ala Thr Ser Ser Pro Ala Ser Pro Arg Thr Ser Arg Thr Cys Thr 185 190 195 gca agg agg cca acg agt ggg tga gcg gct gga agt aca aca tcg acg 672Ala Thr Ser Ser Pro Ala Ser Pro Arg Thr Ser Arg Thr Cys Thr 185 190 195 gca agg agg cca acg agt ggg tga gcg gct gga agt aca aca tcg acg 672
Ala Arg Arg Pro Thr Ser Gly Ala Ala Gly Ser Thr Thr Ser Thr 200 205 210 ccg tgt gct ggg gcg gcc tga gcg tgg tga acg acg ccc cca gcg agc 720Ala Arg Arg Pro Thr Ser Gly Ala Ala Gly Ser Thr Thr Ser Thr 200 205 210 ccg tgt gct ggg gcg gcc tga gcg tgg tga acg acg ccc cca gcg agc 720
Pro Cys Ala Gly Ala Ala Ala Trp Thr Thr Pro Pro Ala Ser 215 220 cct tca tca ccg cca agg tgg tga gcg ccc tgg aca ccg agg gca tca 768Pro Cys Ala Gly Ala Ala Ala Trp Thr Pro Pro Ala Ser 215 220 cct tca tca ccg cca agg tgg tga gcg ccc tgg aca ccg agg gca tca 768
Pro Ser Ser Pro Pro Arg Trp Ala Pro Trp Thr Pro Arg Ala Ser 225 230 235 agg tgc agt acc cat acg atg ttc cag att acg ctt aa 806Pro Ser Ser Pro Pro Arg Trp Ala Pro Trp Thr Pro Arg Ala Ser 225 230 235 agg tgc agt acc cat acg atg ttc cag att acg ctt aa 806
Arg Cys Ser Thr His Thr Met Phe Gin Ile Thr Leu 240 245 250 <210> 14 <211> 19Arg Cys Ser Thr His Thr Met Phe Gin Ile Thr Leu 240 245 250 <210> 14 <211> 19
<212> PRT <213> Sunflower mild mosaic virus <400> 14<212> PRT <213> Sunflower mild mosaic virus <400> 14
Met Val Thr Ala Ala Ser Val Leu Glu Phe Ala Ala Ala Leu Glu Pro 15 10 15Met Val Thr Ala Ala Ser Val Leu Glu Phe Ala Ala Ala Leu Glu Pro 15 10 15
Pro Trp Ala <210> 15Pro Trp Ala <210> 15
<211> 16 <212> PRT <213> Sunflower mild mosaic virus <400> 15<211> 16 <212> PRT <213> Sunflower mild mosaic virus <400> 15
Ala Ala Ala Cys Ala Thr Thr Thr Pro Ser Ala Ala Trp Cys Ala Ala 15 10 15 <210> 16 <211> 34Ala Ala Ala Cys Ala Thr Thr Pro Ala Ala Ala Ala Ala 15 10 15 <210> 16 <211> 34
<212> PRT <213> Sunflower mild mosaic virus <400> 16<212> PRT <213> Sunflower mild mosaic virus <400> 16
Pro Thr Thr Ala Ala Ala Ala Ala Pro Pro Cys Thr Ala Ser Ala Thr 15 10 15Pro Thr Thr Ala Ala Ala Ala Pro Pro Cys Thr Ala Ser Ala Thr 15 10 15
Ala Ala Thr Ser Ser Pro Thr Ser Thr Cys Ser Ala Arg Thr Thr Ala 20 25 30Ala Ala Thr Ser Ser Pro Thr Ser Thr Cys Ser Ala Arg Thr Thr Ala 20 25 30
Ala Cys <210> 17 <211> 15Ala Cys <210> 17 <211> 15
<212> PRT <213> Sunflower mild mosaic virus <400> 17<212> PRT <213> Sunflower mild mosaic virus <400> 17
Ser Pro Ala Thr Thr Ala Ser Thr Ser Ala Arg Thr Ala Pro Ala 15 10 15 <210> 18 <211> 9Ser Pro Ala Thr Thr Ala Ser Thr Ser Ala Arg Thr Ala Pro Ala 15 10 15 <210> 18 <211> 9
<212> PRT <213> Sunflower mild mosaic virus <400> 18<212> PRT <213> Sunflower mild mosaic virus <400> 18
Ala Trp Cys Pro Ala Ala Thr Cys Cys 1 5 <210> 19 <211> 13Ala Trp Cys Pro Ala Ala Thr Cys Cys 1 5 <210> 19 <211> 13
<212> PRT <213> Sunflower mild mosaic virus <400> 19<212> PRT <213> Sunflower mild mosaic virus <400> 19
Ser Ala Cys Pro Arg Thr Ala Pro Pro Ser Pro Ala Ser 15 10 <210> 20 <211> 13Ser Ala Cys Pro Arg Thr Ala Pro Pro Ser Pro Ala Ser 15 10 <210> 20 <211> 13
<212> PRT <213> Sunflower mild mosaic virus <400> 20<212> PRT <213> Sunflower mild mosaic virus <400> 20
Ser Phe Glu Ser Pro Leu Ala Lys Lys Arg Pro Cys Leu 15 10Ser Phe Glu Ser Pro Leu Ala Lys Lys Arg Pro Cys Leu 15 10
<210> 21 <211> 12 <212> PRT <213> Sunflower mild mosaic virus <400> 21<210> 21 <211> 12 <212> PRT <213> Sunflower mild mosaic virus <400> 21
Ser Gin Thr Ser Arg Arg Ser Ile Phe Pro Ala Trp 15 10 <210> 22 <211> 30Ser Gin Thr Ser Arg Arg Ser Ile Phe Pro Ala Trp 15 10 <210> 22 <211> 30
<212> PRT <213> Sunflower mild mosaic virus <400> 22<212> PRT <213> Sunflower mild mosaic virus <400> 22
Ala Arg Ala Ala Ala Trp Cys Ser Ala Arg Thr Ala Pro Ser Gly Ala 15 10 15Ala Arg Ala Ala Ala Trp Cys Ser Ala Arg Thr Ala Pro Ser Gly Ala 15 10 15
Thr Gly Ser Ala Pro Arg Thr Ala Thr Ala Ala Pro Pro Ser 20 25 30 <210> 23Thr Gly Ser Ala Pro Arg Thr Ala Thr Ala Ala Pro Pro Ser 20 25 30 <210> 23
<211> 16 <212> PRT <213> Sunflower mild mosaic virus <400> 23<211> 16 <212> PRT <213> Sunflower mild mosaic virus <400> 23
Ala Ser Ala Thr Ala Thr Ser Ser Ala Ala Thr Ala Ala Arg Thr Pro 15 10 15 <210> 24 <211> 15Ala Ser Ala Thr Ala Thr Ser Ser Ala Ala Thr Ala Ala Arg Thr Pro 15 10 15 <210> 24 <211> 15
<212> PRT <213> Sunflower mild mosaic virus <400> 24<212> PRT <213> Sunflower mild mosaic virus <400> 24
Pro Ala Thr Ser Ser Pro Ala Ser Pro Arg Thr Ser Arg Thr Cys 15 10 15 <210> 25Pro Ala Thr Ser Ser Pro Ala Ser Pro Arg Thr Ser Arg Thr Cys 15 10 15 <210> 25
<211> 8 <212> PRT <213> Sunflower mild mosaic virus <400> 25<211> 8 <212> PRT <213> Sunflower mild mosaic virus <400> 25
Thr Ala Arg Arg Pro Thr Ser Gly 1 5 <210> 26 <211> 14Thr Ala Arg Arg Pro Thr Ser Gly 1 5 <210> 26 <211> 14
<212> PRT <213> Sunflower mild mosaic virus <400> 26<212> PRT <213> Sunflower mild mosaic virus <400> 26
Ala Ala Gly Ser Thr Thr Ser Thr Pro Cys Ala Gly Ala Ala 15 10 <210> 27 <211> 13Ala Ala Gly Ser Thr Thr Thr Thr Pro Cys Ala Gly Ala Ala 15 10 <210> 27 <211> 13
<212> PRT <213> Sunflower mild mosaic virus <400> 27<212> PRT <213> Sunflower mild mosaic virus <400> 27
Thr Thr Pro Pro Ala Ser Pro Ser Ser Pro Pro Arg Trp 15 10 <210> 28 <211> 20Thr Pro Pro Pro Ala Ser Pro Ser Ser Pro Pro Arg Trp 15 10 <210> 28 <211> 20
<212> PRT <213> Sunflower mild mosaic virus <400> 28<212> PRT <213> Sunflower mild mosaic virus <400> 28
Ala Pro Trp Thr Pro Arg Ala Ser Arg Cys Ser Thr His Thr Met Phe 15 10 15Ala Pro Trp Thr Pro Arg Ala Ser Arg Cys Ser Thr His Thr Met Phe 15 10 15
Gin Ile Thr Leu 20 <210> 29 <211> 354Gin Ile Thr Leu 20 <210> 29 <211> 354
<212> DNA <213> Sunflower mild mosaic virus <220> <221> CDS <222> (1)..(354) <400> 29 ggc gtg agc ctg agc cgc ggc gtg cgc gac tac aac gcc ate agc agc 48<212> DNA <213> Sunflower mild mosaic virus <220> <221> CDS <222> (1) .. (354) <400> 29 ggc gtg agc ctg agc cgc ggc gtg cgc gac tac aac gcc ate agc agc 48
Gly Val Ser Leu Ser Arg Gly Val Arg Asp Tyr Asn Ala Ile Ser Ser 15 10 15 atg gtg tgc cgc gtg acc aac gac agc ggc agc agc agc acc acc atg 96Gly Val Ser Leu Ser Arg Gly Val Arg Asp Tyr Asn Ala Ile Ser Ser 15 10 15 atg gtg tgc cgc gtg acc aac gac agc ggc agc agc agc acc acc atg 96
Met Val Cys Arg Val Thr Asn Asp Ser Gly Ser Ser Ser Thr Thr Met 20 25 30 tac ggc ate ggc tac ggc tgc tac ate ate acc aac aag cac ctg ttc 144Met Val Cys Arg Val Thr Asn Asp Ser Gly Ser Ser Ser Thr Thr Met 20 25 30 tac ggc ate ggc tac ggc tgc tac ate ate acc aac aag cac ctg ttc 144
Tyr Gly Ile Gly Tyr Gly Cys Tyr Ile Ile Thr Asn Lys His Leu Phe 35 40 45 cgc gag aac aac ggc cgc ctg ctg ate acc agc cac cac ggc gag tac 192Tyr Gly Ile Gly Tyr Gly Cys Tyr Ile Ile Thr Asn Lys His Leu Phe 35 40 45 cgc gag aac aac ggc cgc ctg ctg ate acc agc cac cac ggc gag tac 192
Arg Glu Asn Asn Gly Arg Leu Leu Ile Thr Ser His His Gly Glu Tyr 50 55 60 ate tgc aag aac agc gcc agc ctg aag ctg agc ctg gtg ccc ggc cgc 240Arg Glu Asn Asg Gly Arg Leu Leu Ile Thr Ser His His Gly Glu Tyr 50 55 60 ate tgc aag aac agc gcc agc ctg aag ctg agc ctg gtg ccc ggc cgc 240
Ile Cys Lys Asn Ser Ala Ser Leu Lys Leu Ser Leu Val Pro Gly Arg 65 70 75 80 gac atg ctg ctg ate cgc ctg ccc aag gac tgc ccc ccc ttc ccc agc 288Ile Cys Lys Asn Ser Ala Ser Leu Lys Leu Ser Leu Val Pro Gly Arg 65 70 75 80 gac atg ctg ctg ate cgc ctg ccc aag gac tgc ccc ccc ttc ccc agc 288
Asp Met Leu Leu Ile Arg Leu Pro Lys Asp Cys Pro Pro Phe Pro Ser 85 90 95 aag ctg aag ttt Cga gag ccc act agc gaa gaa aag gcc gtg ctt gta 336Asp Met Leu Leu Ile Arg Leu Pro Lys Asp Cys Pro Pro Phe Pro Ser 85 90 95 aag ctg aag ttt Cga gag ccc act agc gaa gaag aag gcc gtg ctt gta 336
Lys Leu Lys Phe Arg Glu Pro Thr Ser Glu Glu Lys Ala Val Leu Val 100 105 110 gtc aca aac ttc cag gag 354Lys Leu Lys Phe Arg Glu Pro Thr Ser Glu Glu Lys Ala Val Leu Val 100 105 110 gtc aca aac ttc cag gag 354
Val Thr Asn Phe Gin Glu 115 <210> 30Val Thr Asn Phe Gin Glu 115 <210> 30
<211> 118 <212> PRT <213> Sunflower mild mosaic virus <400> 30<211> 118 <212> PRT <213> Sunflower mild mosaic virus <400> 30
Gly Val Ser Leu Ser Arg Gly Val Arg Asp Tyr Asn Ala Ile Ser Ser 15 10 15Gly Val Ser Leu Ser Arg Gly Val Arg Asp Tyr Asn Ala Ile Ser Ser 15 10 15
Met Val Cys Arg Val Thr Asn Asp Ser Gly Ser Ser Ser Thr Thr Met 20 25 30Met Val Cys Arg Val Thr Asn Asp Ser Gly Ser Ser Ser Thr Thr Met 20 25 30
Tyr Gly Ile Gly Tyr Gly Cys Tyr Ile Ile Thr Asn Lys His Leu Phe 35 40 45Tyr Gly Ile Gly Tyr Gly Cys Tyr Ile Ile Thr Asn Lys His Leu Phe 35 40 45
Arg Glu Asn Asn Gly Arg Leu Leu Ile Thr Ser His His Gly Glu Tyr 50 55 60Arg Glu Asn Asn Gly Arg Leu Leu Ile Thr Ser His His Gly Glu Tyr 50 55 60
Ile Cys Lys Asn Ser Ala Ser Leu Lys Leu Ser Leu Val Pro Gly Arg 65 70 75 80Ile Cys Lys Asn Ser Ala Ser Leu Lys Leu Ser Leu Val Pro Gly Arg 65 70 75 80
Asp Met Leu Leu Ile Arg Leu Pro Lys Asp Cys Pro Pro Phe Pro Ser 85 90 95Asp Met Leu Leu Ile Arg Leu Pro Lys Asp Cys Pro Pro Phe Pro Ser 85 90 95
Lys Leu Lys Phe Arg Glu Pro Thr Ser Glu Glu Lys Ala Val Leu Val 100 105 110Lys Leu Lys Phe Arg Glu Pro Thr Ser Glu Glu Lys Ala Val Leu Val 100 105 110
Val Thr Asn Phe Gin Glu 115 <210> 31 <211> 369Val Thr Asn Phe Gin Glu 115 <210> 31 <211> 369
<212> DNA <213> Sunflower mild mosaic virus <220> <221> CDS <222> (1)..(369) <400> 31 aag cat ctt tcc agc atg gtg agc gag agc agc tgc gtg gtg cag cgc 48<212> DNA <213> Sunflower mild mosaic virus <220> <221> CDS <222> (1) .. (369) <400> 31 aag cat ctt tcc agc atg gtg agc gag agc agc tgc gtg gtg cag cgc 48
Lys His Leu Ser Ser Met Val Ser Glu Ser Ser Cys Val Val Gin Arg 15 10 15 gag gac agc ccc ate tgg cgc cac tgg ate agc acc aag gac ggc cac 96Lys His Leu Ser Ser Met Val Ser Glu Ser Ser Cys Val Val Gin Arg 15 10 15 gag gac agc ccc ate tgg cgc cac tgg ate agc acc aag gac ggc cac 96
Glu Asp Ser Pro Ile Trp Arg His Trp Ile Ser Thr Lys Asp Gly His 20 25 30 tgc ggc gcc ccc ate gtg agc ate cgc gac ggc tac ate ate ggc agc 144Glu Asp Ser Pro Ile Trp Arg His Trp Ile Ser Thr Lys Asp Gly His 20 25 30 tgc ggc gcc ccc ate gtg agc ate cgc gac ggc tac ate ate ggc agc 144
Cys Gly Ala Pro Ile Val Ser Ile Arg Asp Gly Tyr Ile Ile Gly Ser 35 40 45 cac tgc ggc gag aac ccc atg acc agc aac ttc ttc acc agc ate ccc 192Cys Gly Ala Pro Ile Val Ser Ile Arg Asp Gly Tyr Ile Ile Gly Ser 35 40 45 cac tgc ggc gag aac ccc atg acc agc aac ttc ttc acc agc ate ccc 192
His Cys Gly Glu Asn Pro Met Thr Ser Asn Phe Phe Thr Ser Ile Pro 50 55 60 aag gac ttc cag aac ctg ctg aac ggc aag gag gcc aac gag tgg gtg 240His Cys Gly Glu Asn Pro Met Thr Ser Asn Phe Phe Thr Ser Ile Pro 50 55 60 aag gac ttc cag aac ctg ctg aac ggc aag gag gcc aac gag tgg gtg 240
Lys Asp Phe Gin Asn Leu Leu Asn Gly Lys Glu Ala Asn Glu Trp Val 65 70 75 80 agc ggc tgg aag tac aac ate gac gcc gtg tgc tgg ggc ggc ctg agc 288Lys Asp Phe Gin Asn Leu Leu Asn Gly Lys Glu Ala Asn Glu Trp Val 65 70 75 80 agc ggc tgg aag tac aac ate gac gcc gtg tgc tgg ggc ggc ctg agc 288
Ser Gly Trp Lys Tyr Asn Ile Asp Ala Val Cys Trp Gly Gly Leu Ser 85 90 95 gtg gtg aac gac gcc ccc agc gag ccc ttc ate acc gcc aag gtg gtg 336Ser Gly Trp Lys Tyr Asn Ile Asp Ala Val Cys Trp Gly Gly Leu Ser 85 90 95 gtg gtg aac gac gcc ccc agc gag ccc ttc ate acc gcc aag gtg gtg 336
Val Val Asn Asp Ala Pro Ser Glu Pro Phe Ile Thr Ala Lys Val Val 100 105 110 agc gcc ctg gac acc gag ggc ate aag gtg cag 369Val Val Asn Asp Ala Pro Ser Glu Pro Phe Ile Thr Ala Lys Val Val 100 105 110 agc gcc ctg gac acc gag ggc ate aag gtg cag 369
Ser Ala Leu Asp Thr Glu Gly Ile Lys Val Gin 115 120 <210> 32 <211> 123Ser Ala Leu Asp Thr Glu Gly Ile Lys Val Gin 115 120 <210> 32 <211> 123
<212> PRT <213> Sunflower mild mosaic virus <400> 32<212> PRT <213> Sunflower mild mosaic virus <400> 32
Lys His Leu Ser Ser Met Val Ser Glu Ser Ser Cys Val Val Gin Arg 15 10 15Lys His Leu Ser Ser Met Val Ser Glu Ser Ser Cys Val Val Gin Arg 15 10 15
Glu Asp Ser Pro Ile Trp Arg His Trp Ile Ser Thr Lys Asp Gly His 20 25 30Glu Asp Ser Pro Ile Trp Arg His Trp Ile Ser Thr Lys Asp Gly His 20 25 30
Cys Gly Ala Pro Ile Val Ser Ile Arg Asp Gly Tyr Ile Ile Gly Ser 35 40 45Cys Gly Ala Pro Ile Val Ser Ile Arg Asp Gly Tyr Ile Ile Gly Ser 35 40 45
His Cys Gly Glu Asn Pro Met Thr Ser Asn Phe Phe Thr Ser Ile Pro 50 55 60His Cys Gly Glu Asn Pro Met Thr Ser Asn Phe Phe Thr Ser Ile Pro 50 55 60
Lys Asp Phe Gin Asn Leu Leu Asn Gly Lys Glu Ala Asn Glu Trp Val 65 70 75 80Lys Asp Phe Gin Asn Leu Leu Asn Gly Lys Glu Ala Asn Glu Trp Val 65 70 75 80
Ser Gly Trp Lys Tyr Asn Ile Asp Ala Val Cys Trp Gly Gly Leu Ser 85 90 95Ser Gly Trp Lys Tyr Asn Ile Asp Ala Val Cys Trp Gly Gly Leu Ser 85 90 95
Val Val Asn Asp Ala Pro Ser Glu Pro Phe Ile Thr Ala Lys Val Val 100 105 110Val Val Asn Asp Ala Pro Ser Glu Pro Phe Ile Thr Ala Lys Val Val 100 105 110
Ser Ala Leu Asp Thr Glu Gly Ile Lys Val Gin 115 120 <210> 33 <211> 762Ser Ala Leu Asp Thr Glu Gly Ile Lys Val Gin 115 120 <210> 33 <211> 762
<212> DNA <213> Soybean mosaic virus <220> <221> CDS <222> (1) . . (759) <400> 33 atg agc aag agc gtg tac aag ggc ctg cgc gac tac agc ggc ate agc 48<212> DNA <213> Soybean mosaic virus <220> <221> CDS <222> (1). . (759) <400> 33 atg agc aag agc gtg tac aag ggc ctg cgc gac tac agc ggc ate agc 48
Met Ser Lys Ser Val Tyr Lys Gly Leu Arg Asp Tyr Ser Gly Ile Ser 15 10 15 acc ctg ate tgc cag ctg acc aac agc agc gac ggc cac aag gag acc 96Met Ser Lys Ser Val Tyr Lys Gly Leu Arg Asp Tyr Ser Gly Ile Ser 15 10 15 acc ctg ate tgc cag ctg acc aac agc agc gac ggc cac aag gag acc 96
Thr Leu Ile Cys Gin Leu Thr Asn Ser Ser Asp Gly His Lys Glu Thr 20 25 30 atg ttc ggc gtg ggc tac ggc agc ttc ate ate acc aac ggc cac ctg 144Thr Leu Ile Cys Gin Leu Thr Asn Ser Ser Asp Gly His Lys Glu Thr 20 25 30 atg ttc ggc gtg ggc tac ggc agc ttc ate ate acc aac ggc cac ctg 144
Met Phe Gly Val Gly Tyr Gly Ser Phe Ile Ile Thr Asn Gly His Leu 35 40 45 ttc ege ege aac aac ggc atg ctg acc gtg aag acc tgg cac ggc gag 192Met Phe Gly Val Gly Tyr Gly Ser Phe Ile Ile Thr Asn Gly His Leu 35 40 45 ttc ege ege aac aac ggc atg ctg acc gtg aag acc tgg cac ggc gag 192
Phe Arg Arg Asn Asn Gly Met Leu Thr Val Lys Thr Trp His Gly Glu 50 55 60 ttc gtg ate cac aac acc acc cag ctg aag ate cac ttc ate cag ggc 240Phe Arg Arg Asn Asn Gly Met Leu Thr Val Lys Thr Trp His Gly Glu 50 55 60 ttc gtg ate cac aac acc cag ctg aag ate cac ttc ate cag ggc 240
Phe Val Ile His Asn Thr Thr Gin Leu Lys Ile His Phe Ile Gin Gly 65 70 75 80 ege gac gtg ate ctg ate ege atg ccc aag gac ttc ccc ccc ttc ggc 288Phe Val Ile His Asn Thr Thr Gin Leu Lys Ile His Phe Ile Gin Gly 65 70 75 80 ege gac gtg ate ctg ate ege atg ccc aag gac ttc ccc ccc ttc ggc 288
Arg Asp Val Ile Leu Ile Arg Met Pro Lys Asp Phe Pro Pro Phe Gly 85 90 95 aag ege aac ctg ttc ege cag ccc aag ege gag gag ege gtg tgc atg 336Arg Asp Val Ile Leu Ile Arg Met Pro Lys Asp Phe Pro Pro Phe Gly 85 90 95 aag ege aac ctg ttc ege cag ccc aag ege gag gag ege gtg tgc atg 336
Lys Arg Asn Leu Phe Arg Gin Pro Lys Arg Glu Glu Arg Val Cys Met 100 105 110 gtg ggc acc aac ttc cag gag aag agc ctg ege gcc acc gtg agc gag 384Lys Arg Asn Leu Phe Arg Gin Pro Lys Arg Glu Glu Arg Val Cys Met 100 105 110 gtg ggc acc aac ttc cag gag aag agc ctg ege gcc acc gtg agc gag 384
Val Gly Thr Asn Phe Gin Glu Lys Ser Leu Arg Ala Thr Val Ser Glu 115 120 125 agc agc atg ate ctg ccc gag ggc aag ggc agc ttc tgg ata cac tgg 432Val Gly Thr Asn Phe Gin Glu Lys Ser Leu Arg Ala Thr Val Ser Glu 115 120 125 agc agc atg ate ctg ccc gag ggc aag ggc agc ttc tgg ata cac tgg 432
Ser Ser Met Ile Leu Pro Glu Gly Lys Gly Ser Phe Trp Ile His Trp 130 135 140 ate acc acc cag gac ggc ttc tgc ggc ctg ccc ctg gtg agc gtg aac 480Ser Ser Met Ile Leu Pro Glu Gly Lys Gly Ser Phe Trp Ile His Trp 130 135 140 ate acc acc cag gac ggc ttc tgc ggc ctg ccc ctg gtg agc gtg aac 480
Ile Thr Thr Gin Asp Gly Phe Cys Gly Leu Pro Leu Val Ser Val Asn 145 150 155 160 gac ggc cac ate gtg ggc ate cac ggc ctg acc agc aac gac agc gag 528Ile Thr Thr Gin Asp Gly Phe Cys Gly Leu Pro Leu Val Ser Val Asn 145 150 155 160 gac ggc cac ate gtg ggc ate cac ggc ctg acc agac aac gac agc gag 528
Asp Gly His Ile Val Gly Ile His Gly Leu Thr Ser Asn Asp Ser Glu 165 170 175 aag aac ttc ttc gtg ccc ctg acc gac ggc ttc gag aag gag tac ctg 576Asp Gly His Ile Val Gly Ile His Gly Leu Thr Ser Asn Asp Ser Glu 165 170 175 aag aac ttc ttc gtg ccc ctg acc gac ggc ttc gag aag gag tac ctg 576
Lys Asn Phe Phe Val Pro Leu Thr Asp Gly Phe Glu Lys Glu Tyr Leu 180 185 190 gag aac gcc gac aac ctg agc tgg gac aag cac tgg ttc tgg gag ccc 624Lys Asn Phe Phe Val Pro Leu Thr Asp Gly Phe Glu Lys Glu Tyr Leu 180 185 190 gag aac gcc gac aac ctg agc tgg gac aag cac tgg ttc tgg gag ccc 624
Glu Asn Ala Asp Asn Leu Ser Trp Asp Lys His Trp Phe Trp Glu Pro 195 200 205 agc aag ate gcc tgg ggc agc ctg aac ctg gtg gag gag cag ccc aag 672Glu Asn Ala Asp Asn Leu Ser Trp Asp Lys His Trp Phe Trp Glu Pro 195 200 205 agc aag ate gcc tgg ggc agc ctg aac ctg gtg gag gag cag ccc aag 672
Ser Lys Ile Ala Trp Gly Ser Leu Asn Leu Val Glu Glu Gin Pro Lys 210 215 220 gag gag ttc aag ate agc aag ctg gtg agc gac ctg ttc ggc aac acc 720Ser Lys Ile Ala Trp Gly Ser Leu Asn Leu Val Glu Glu Gin Pro Lys 210 215 220 gag gag ttc aag ate agc aag ctg gtg agc gac ctg ttc ggc aac acc 720
Glu Glu Phe Lys Ile Ser Lys Leu Val Ser Asp Leu Phe Gly Asn Thr 225 230 235 240 gtg acc gtg cag tac cca tac gat gtt cca gat tac get taa 762Glu Glu Phe Lys Ile Ser Lys Leu Val Ser Asp Leu Phe Gly Asn Thr 225 230 235 240 gtg acc gtg cag tac cca tac gat gtt cca gat tac get taa 762
Val Thr Val Gin Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 245 250 <210> 34 <211> 253Val Thr Val Gin Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 245 250 <210> 34 <211> 253
<212> PRT <213> Soybean mosaic virus <400> 34<212> PRT <213> Soybean mosaic virus <400> 34
Met Ser Lys Ser Val Tyr Lys Gly Leu Arg Asp Tyr Ser Gly Ile Ser 15 10 15Met Ser Lys Ser Val Tyr Lys Gly Leu Arg Asp Tyr Ser Gly Ile Ser 15 10 15
Thr Leu Ile Cys Gin Leu Thr Asn Ser Ser Asp Gly His Lys Glu Thr 20 25 30Thr Leu Ile Cys Gin Leu Thr Asn Ser Ser Asp Gly His Lys Glu Thr 20 25 30
Met Phe Gly Val Gly Tyr Gly Ser Phe Ile Ile Thr Asn Gly His Leu 35 40 45Met Phe Gly Val Gly Tyr Gly Ser Phe Ile Ile Thr Asn Gly His Leu 35 40 45
Phe Arg Arg Asn Asn Gly Met Leu Thr Val Lys Thr Trp His Gly Glu 50 55 60Phe Arg Arg Asn Asn Gly Met Leu Thr Val Lys Thr Trp His Gly Glu 50 55 60
Phe Val Ile His Asn Thr Thr Gin Leu Lys Ile His Phe Ile Gin Gly 65 70 75 80Phe Val Ile His Asn Thr Thr Gin Leu Lys Ile His Phe Ile Gin Gly 65 70 75 80
Arg Asp Val Ile Leu Ile Arg Met Pro Lys Asp Phe Pro Pro Phe Gly 85 90 95Arg Asp Val Ile Leu Ile Arg Met Pro Lys Asp Phe Pro Pro Phe Gly 85 90 95
Lys Arg Asn Leu Phe Arg Gin Pro Lys Arg Glu Glu Arg Val Cys Met 100 105 110Lys Arg Asn Leu Phe Arg Gin Pro Lys Arg Glu Glu Arg Val Cys Met 100 105 110
Val Gly Thr Asn Phe Gin Glu Lys Ser Leu Arg Ala Thr Val Ser Glu 115 120 125Val Gly Thr Asn Phe Gin Glu Lys Ser Leu Arg Ala Thr Val Ser Glu 115 120 125
Ser Ser Met Ile Leu Pro Glu Gly Lys Gly Ser Phe Trp Ile His Trp 130 135 140Ser Ser Met Ile Leu Pro Glu Gly Lys Gly Ser Phe Trp Ile His Trp 130 135 140
Ile Thr Thr Gin Asp Gly Phe Cys Gly Leu Pro Leu Val Ser Val Asn 145 150 155 160Ile Thr Thr Gin Asp Gly Phe Cys Gly Leu Pro Leu Val Ser Val Asn 145 150 155 160
Asp Gly His Ile Val Gly Ile His Gly Leu Thr Ser Asn Asp Ser Glu 165 170 175Asp Gly His Ile Val Gly Ile His Gly Leu Thr Ser Asn Asp Ser Glu 165 170 175
Lys Asn Phe Phe Val Pro Leu Thr Asp Gly Phe Glu Lys Glu Tyr Leu 180 185 190Lys Asn Phe Phe Val Pro Leu Thr Asp Gly Phe Glu Lys Glu Tyr Leu 180 185 190
Glu Asn Ala Asp Asn Leu Ser Trp Asp Lys His Trp Phe Trp Glu Pro 195 200 205Glu Asn Ala Asp Asn Leu Ser Trp Asp Lys His Trp Phe Trp Glu Pro 195 200 205
Ser Lys Ile Ala Trp Gly Ser Leu Asn Leu Val Glu Glu Gin Pro Lys 210 215 220Ser Lys Ile Ala Trp Gly Ser Leu Asn Leu Val Glu Glu Gin Pro Lys 210 215 220
Glu Glu Phe Lys Ile Ser Lys Leu Val Ser Asp Leu Phe Gly Asn Thr 225 230 235 240Glu Glu Phe Lys Ile Ser Lys Leu Val Ser Asp Leu Phe Gly Asn Thr 225 230 235 240
Val Thr Val Gin Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 245 250 <210> 35 <211> 354Val Thr Val Gin Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 245 250 <210> 35 <211> 354
<212> DNA <213> Soybean mosaic virus <220> <221> CDS <222> (1)..(354) <400> 35 agc aag agc gtg tac aag ggc ctg cgc gac tac agc ggc ate agc acc 48<212> DNA <213> Soybean mosaic virus <220> <221> CDS <222> (1) .. (354) <400> 35 agc aag agc gtg tac aag ggc ctg cgc gac tac agc ggc ate agc acc 48
Ser Lys Ser Val Tyr Lys Gly Leu Arg Asp Tyr Ser Gly Ile Ser Thr 15 10 15 ctg ate tgc cag ctg acc aac agc agc gac ggc cac aag gag acc atg 96Ser Lys Ser Val Tyr Lys Gly Leu Arg Asp Tyr Ser Gly Ile Ser Thr 15 10 15 ctg ate tgc cag ctg acc aac agc agc gac ggc cac aag gag acc atg 96
Leu Ile Cys Gin Leu Thr Asn Ser Ser Asp Gly His Lys Glu Thr Met 20 25 30 ttc ggc gtg ggc tac ggc agc ttc ate ate acc aac ggc cac ctg ttc 144Leu Ile Cys Gin Leu Thr Asn Ser Ser Asp Gly His Lys Glu Thr Met 20 25 30 ttc ggc gtg ggc tac ggc agc ttc ate ate acc aac ggc cac ctg ttc 144
Phe Gly Val Gly Tyr Gly Ser Phe Ile Ile Thr Asn Gly His Leu Phe 35 40 45 cgc cgc aac aac ggc atg ctg acc gtg aag acc tgg cac ggc gag ttc 192Phe Gly Val Gly Tyr Gly Ser Phe Ile Ile Thr Asn Gly His Leu Phe 35 40 45 cgc cgc aac aac ggc atg ctg acc gtg aag acc tgg cac ggc gag ttc 192
Arg Arg Asn Asn Gly Met Leu Thr Val Lys Thr Trp His Gly Glu Phe 50 55 60 gtg ate cac aac acc acc cag ctg aag ate cac ttc ate cag ggc cgc 240Arg Arg Asn Asn Gly Met Leu Thr Val Lys Thr Trp His Gly Glu Phe 50 55 60 gtg ate cac aac acc acc cag ctg aag ate cac ttc ate cag ggc cgc 240
Val Ile His Asn Thr Thr Gin Leu Lys Ile His Phe Ile Gin Gly Arg 65 70 75 80 gac gtg ate ctg ate cgc atg ccc aag gac ttc ccc ccc ttc ggc aag 288Val Ile His Asn Thr Thr Gin Leu Lys Ile His Phe Ile Gin Gly Arg 65 70 75 80 gac gtg ate ctg ate cgc atg ccc aag gac ttc ccc ccc ttc ggc aag 288
Asp Val Ile Leu Ile Arg Met Pro Lys Asp Phe Pro Pro Phe Gly Lys 85 90 95 cgc aac ctg ttc cgc cag ccc aag cgc gag gag cgc gtg tgc atg gtg 336Asp Val Ile Leu Ile Arg Met Pro Lys Asp Phe Pro Pro Phe Gly Lys 85 90 95 cgc aac ctg ttc cgc cag ccc aag cgc gag gg cgc gtg tgc atg gtg 336
Arg Asn Leu Phe Arg Gin Pro Lys Arg Glu Glu Arg Val Cys Met Val 100 105 110 ggc acc aac ttc cag gag 354Arg Asn Leu Phe Arg Gin Pro Lys Arg Glu Glu Arg Val Cys Met Val 100 105 110 ggc acc aac ttc cag gag 354
Gly Thr Asn Phe Gin Glu 115 <210> 36Gly Thr Asn Phe Gin Glu 115 <210> 36
<211> 118 <212> PRT <213> Soybean mosaic virus <400> 36<211> 118 <212> PRT <213> Soybean mosaic virus <400> 36
Ser Lys Ser Val Tyr Lys Gly Leu Arg Asp Tyr Ser Gly Ile Ser Thr 15 10 15Ser Lys Ser Val Tyr Lys Gly Leu Arg Asp Tyr Ser Gly Ile Ser Thr 15 10 15
Leu Ile Cys Gin Leu Thr Asn Ser Ser Asp Gly His Lys Glu Thr Met 20 25 30Leu Ile Cys Gin Leu Thr Asn Ser Ser Asp Gly His Lys Glu Thr Met 20 25 30
Phe Gly Val Gly Tyr Gly Ser Phe Ile Ile Thr Asn Gly His Leu Phe 35 40 45Phe Gly Val Gly Tyr Gly Ser Phe Ile Ile Thr Asn Gly His Leu Phe 35 40 45
Arg Arg Asn Asn Gly Met Leu Thr Val Lys Thr Trp His Gly Glu Phe 50 55 60Arg Arg Asn Asn Gly Met Leu Thr Val Lys Thr Trp His Gly Glu Phe 50 55 60
Val Ile His Asn Thr Thr Gin Leu Lys Ile His Phe Ile Gin Gly Arg 65 70 75 80Val Ile His Asn Thr Thr Gin Leu Lys Ile His Phe Ile Gin Gly Arg 65 70 75 80
Asp Val Ile Leu Ile Arg Met Pro Lys Asp Phe Pro Pro Phe Gly Lys 85 90 95Asp Val Ile Leu Ile Arg Met Pro Lys Asp Phe Pro Pro Phe Gly Lys 85 90 95
Arg Asn Leu Phe Arg Gin Pro Lys Arg Glu Glu Arg Val Cys Met Val 100 105 110Arg Asn Leu Phe Arg Gin Pro Lys Arg Glu Glu Arg Val Cys Met Val 100 105 110
Gly Thr Asn Phe Gin Glu 115 <210> 37 <211> 375Gly Thr Asn Phe Gin Glu 115 <210> 37 <211> 375
<212> DNA <213> Soybean mosaic virus <220> <221> CDS <222> (1)..(375) <400> 37 aag agc ctg cgc gcc acc gtg agc gag agc agc atg ate ctg ccc gag 48<212> DNA <213> Soybean mosaic virus <220> <221> CDS <222> (1) .. (375) <400> 37 aag agc ctg cgc gcc agc gg agc atg ate ctg ccc gag 48
Lys Ser Leu Arg Ala Thr Val Ser Glu Ser Ser Met Ile Leu Pro Glu 15 10 15 ggc aag ggc agc ttc tgg ata cac tgg ate acc acc cag gac ggc ttc 96Lys Ser Leu Arg Ala Thr Val Ser Glu Ser Ser Met Ile Leu Pro Glu 15 10 15 ggc aag ggc agc ttc tgg ata cac tgg ate acc acc cag gac ggc ttc 96
Gly Lys Gly Ser Phe Trp Ile His Trp Ile Thr Thr Gin Asp Gly Phe 20 25 30 tgc ggc ctg ccc ctg gtg agc gtg aac gac ggc cac ate gtg ggc ate 144Gly Lys Gly Ser Phe Trp Ile His Trp Ile Thr Thr Gin Asp Gly Phe 20 25 30 tgc ggc ctg ccc ctg gtg agc gtg aac gac ggc cac ate gtg ggc ate 144
Cys Gly Leu Pro Leu Val Ser Val Asn Asp Gly His Ile Val Gly Ile 35 40 45 cac ggc ctg acc agc aac gac agc gag aag aac ttc ttc gtg ccc ctg 192Cys Gly Leu Pro Leu Val Ser Val Asn Asp Gly His Ile Val Gly Ile 35 40 45 cac ggc ctg acc agc aac gac agc gag aag aac ttc ttc gtg ccc ctg 192
His Gly Leu Thr Ser Asn Asp Ser Glu Lys Asn Phe Phe Val Pro Leu 50 55 60 acc gac ggc ttc gag aag gag tac ctg gag aac gcc gac aac ctg agc 240His Gly Leu Thr Ser Asn Asp Ser Glu Lys Asn Phe Phe Val Pro Leu 50 55 60 acc gac ggc ttc gag aag gag tac ctg gag aac gcc gac aac ctg agc 240
Thr Asp Gly Phe Glu Lys Glu Tyr Leu Glu Asn Ala Asp Asn Leu Ser 65 70 75 80 tgg gac aag cac tgg ttc tgg gag ccc agc aag ate gcc tgg ggc agc 288Thr Asp Gly Phe Glu Lys Glu Tyr Leu Glu Asn Ala Asp Asn Leu Ser 65 70 75 80 tgg gac aag cac tgg tgg ggg ccc agc aag ate gcc tgg ggc agc 288
Trp Asp Lys His Trp Phe Trp Glu Pro Ser Lys Ile Ala Trp Gly Ser 85 90 95 ctg aac ctg gtg gag gag cag ccc aag gag gag ttc aag ate agc aag 336Trp Asp Lys His Trp Phe Trp Glu Pro Ser Lys Ile Ala Trp Gly Ser 85 90 95 ctg aac ctg gtg gag cag ccc aag gag gag ttc aag ate agc aag 336
Leu Asn Leu Val Glu Glu Gin Pro Lys Glu Glu Phe Lys Ile Ser Lys 100 105 110 ctg gtg agc gac ctg ttc ggc aac acc gtg acc gtg cag 375Leu Asn Leu Val Glu Glu Gin Pro Lys Glu Glu Phe Lys Ile Ser Lys 100 105 110 ctg gtg agc gac ctg ttc ggc aac acc gtg acc gtg cag 375
Leu Val Ser Asp Leu Phe Gly Asn Thr Val Thr Val Gin 115 120 125 <210> 38 <211> 125Leu Val Ser Asp Leu Phe Gly Asn Thr Val Thr Val Gin 115 120 125 <210> 38 <211> 125
<212> PRT <213> Soybean mosaic virus <400> 38<212> PRT <213> Soybean mosaic virus <400> 38
Lys Ser Leu Arg Ala Thr Val Ser Glu Ser Ser Met Ile Leu Pro Glu 15 10 15Lys Ser Leu Arg Ala Thr Val Ser Glu Ser Ser Met Ile Leu Pro Glu 15 10 15
Gly Lys Gly Ser Phe Trp Ile His Trp Ile Thr Thr Gin Asp Gly Phe 20 25 30Gly Lys Gly Ser Phe Trp Ile His Trp Ile Thr Thr Gin Asp Gly Phe 20 25 30
Cys Gly Leu Pro Leu Val Ser Val Asn Asp Gly His Ile Val Gly Ile 35 40 45Cys Gly Leu Pro Leu Val Ser Val Asn Asp Gly His Ile Val Gly Ile 35 40 45
His Gly Leu Thr Ser Asn Asp Ser Glu Lys Asn Phe Phe Val Pro Leu 50 55 60His Gly Leu Thr Ser Asn Asp Ser Glu Lys Asn Phe Phe Val Pro Leu 50 55 60
Thr Asp Gly Phe Glu Lys Glu Tyr Leu Glu Asn Ala Asp Asn Leu Ser 65 70 75 80Thr Asp Gly Phe Glu Lys Glu Tyr Leu Glu Asn Ala Asp Asn Leu Ser 65 70 75 80
Trp Asp Lys His Trp Phe Trp Glu Pro Ser Lys Ile Ala Trp Gly Ser 85 90 95Trp Asp Lys His Trp Phe Trp Glu Pro Ser Lys Ile Ala Trp Gly Ser 85 90 95
Leu Asn Leu Val Glu Glu Gin Pro Lys Glu Glu Phe Lys Ile Ser Lys 100 105 110Leu Asn Leu Val Glu Glu Gin Pro Lys Glu Glu Phe Lys Ile Ser Lys 100 105 110
Leu Val Ser Asp Leu Phe Gly Asn Thr Val Thr Val Gin 115 120 125 <210> 39 <211> 711Leu Val Ser Asp Leu Phe Gly Asn Thr Val Thr Val Gin 115 120 125 <210> 39 <211> 711
<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1)..(708) <400> 39 atg gga gtg caa gtg gaa acc ate tcc ccg gga gac ggg ege acc ttc 48<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1) .. (708) <400> 39 atg gga gtg caa gtg gaa acc ate tcc ccg gga gac ggg ege acc ttc 48
Met Gly Val Gin Val Glu Thr Ile Ser Pro Gly Asp Gly Arg Thr Phe 15 10 15 ccc aag ege ggc cag acc tgc gtg gtg cac tac acc ggg atg ctt gaa 96Met Gly Val Gin Val Glu Thr Ile Ser Pro Gly Asp Gly Arg Thr Phe 15 10 15 ccc aag ege ggc cag acc tgc gtg gtg cac tac acc ggg atg ctt gaa 96
Pro Lys Arg Gly Gin Thr Cys Val Val His Tyr Thr Gly Met Leu Glu 20 25 30 gat gga aag aaa ttt gat tcc tcc cgg gac aga aac aag ccc ttt aag 144Pro Lys Arg Gly Gin Thr Cys Val Val His Tyr Thr Gly Met Leu Glu 20 25 30 gat gga aag aaa ttt gat tcc tcc cgg gac aga aac aag ccc ttt aag 144
Asp Gly Lys Lys Phe Asp Ser Ser Arg Asp Arg Asn Lys Pro Phe Lys 35 40 45 ttt atg cta ggc aag cag gag gtg ate ega ggc tgg gaa gaa ggg gtt 192Asp Gly Lys Lys Phe Asp Ser Ser Arg Asp Arg Asn Lys Pro Phe Lys 35 40 45 ttt atg cta ggc aag cag gag gtg ate ega ggc tgg gaa gaa ggg gtt 192
Phe Met Leu Gly Lys Gin Glu Val Ile Arg Gly Trp Glu Glu Gly Val 50 55 60 gcc cag atg agt gtg ggt cag aga gcc aaa ctg act ata tet cca gat 240Phe Met Leu Gly Lys Gin Glu Val Ile Arg Gly Trp Glu Glu Gly Val 50 55 60 gcc cag atg agt gtg ggt cag aga gcc aaa ctg act ata tet cca gat 240
Ala Gin Met Ser Val Gly Gin Arg Ala Lys Leu Thr Ile Ser Pro Asp 65 70 75 80 tat gcc tat ggt gcc act ggg cac cca ggc ate ate cca cca cat gcc 288Ala Gin Met Ser Val Gly Gin Arg Ala Lys Leu Thr Ile Ser Pro Asp 65 70 75 80 tat gcc tat ggt gcc act ggg cac cca ggc ate ate cca cca cat gcc 288
Tyr Ala Tyr Gly Ala Thr Gly His Pro Gly Ile Ile Pro Pro His Ala 85 90 95 act ete gtc ttc gat gtg gag ctt cta aaa ctg gaa gga tcc gga agt 336Tyr Ala Tyr Gly Ally Thr Gly His Pro Gly Ile Pro Pro His Ala 85 90 95 act ete gtc ttc gat gtg gag ctt cta aaa ctg gaa gga tcc gga agt 336
Thr Leu Val Phe Asp Val Glu Leu Leu Lys Leu Glu Gly Ser Gly Ser 100 105 110 aag agc atg tet agc atg gtg tca gac acc agt tgc aca ttc cct tca 384Thr Leu Val Phe Asp Val Glu Leu Leu Lys Leu Glu Gly Ser Gly Ser 100 105 110 aag agc atg tet agc atg gtg tca gac acc agt tgc aca ttc cct tca 384
Lys Ser Met Ser Ser Met Val Ser Asp Thr Ser Cys Thr Phe Pro Ser 115 120 125 tet gat ggc ata ttc tgg aag cat tgg att caa acc aag gat ggg cag 432Lys Ser Met Ser Ser Met Serp Asp Thr Ser Cys Thr Phe Pro Ser 115 120 125 tet gat ggc ata ttc tgg aag cat tgg att caa acc aag gat ggg cag 432
Ser Asp Gly Ile Phe Trp Lys His Trp Ile Gin Thr Lys Asp Gly Gin 130 135 140 tgt ggc agt cca tta gta tca act aga gat ggg ttc att gtt ggt ata 480Ser Asp Gly Ile Phe Trp Lys His Trp Ile Gin Thr Lys Asp Gly Gin 130 135 140 tgt ggc agt cca tta gta tca act aga gat ggg ttc att gtt ggt ata 480
Cys Gly Ser Pro Leu Val Ser Thr Arg Asp Gly Phe Ile Val Gly Ile 145 150 155 160 cac tca gca teg aat ttc acc aac aca aac aat tat ttc aca agc gtg 528Cys Gly Ser Pro Leu Val Ser Thr Arg Asp Gly Phe Ile Val Gly Ile 145 150 155 160 cac tca gca teg aat ttc acc aac aca aac aat tat tc aca agc gtg 528
His Ser Ala Ser Asn Phe Thr Asn Thr Asn Asn Tyr Phe Thr Ser Val 165 170 175 ccg aaa aac ttc atg gaa ttg ttg aca aat cag gag gcg cag cag tgg 576His Ser Ala Ser Asn Phe Thr Asn Thr Asn Asn Tyr Phe Thr Ser Val 165 170 175 ccg aaa aac ttc atg gaa ttg ttg aca aat cag gag gcg cag cag tgg 576
Pro Lys Asn Phe Met Glu Leu Leu Thr Asn Gin Glu Ala Gin Gin Trp 180 185 190 gtt agt ggt tgg ega tta aat get gac tca gta ttg tgg ggg ggc cat 624Pro Lys Asn Phe Met Glu Leu Leu Thr Asn Gin Glu Ala Gin Gin Trp 180 185 190 gtt agt ggt tgg ega tta aat get gac tca gta ttg tgg ggg ggc cat 624
Val Ser Gly Trp Arg Leu Asn Ala Asp Ser Val Leu Trp Gly Gly His 195 200 205 aaa gtt ttc atg agc aaa cct gaa gag cct ttt cag cca gtt aag gaa 672Val Ser Gly Trp Arg Leu Asn Ala Asp Ser Val Leu Trp Gly Gly His 195 200 205 aaa gtt ttc atg agc aaa cct gaa gag cct ttt cag cca gtt aag gaa 672
Lys Val Phe Met Ser Lys Pro Glu Glu Pro Phe Gin Pro Val Lys Glu 210 215 220 gcg act caa ete atg agt gaa ttg gtg tac teg caa taa 711Lys Val Phe Met Ser Lys Pro Glu Glu Pro Phe Gin Pro Val Lys Glu 210 215 220 gcg act caa ete atg agt gaa ttg gtg tac teg caa taa 711
Ala Thr Gin Leu Met Ser Glu Leu Val Tyr Ser Gin 225 230 235 <210> 40 <211> 236Ala Thr Gin Leu Met Ser Glu Leu Val Tyr Ser Gin 225 230 235 <210> 40 <211> 236
<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 40<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 40
Met Gly Val Gin Val Glu Thr Ile Ser Pro Gly Asp Gly Arg Thr Phe 15 10 15Met Gly Val Gin Val Glu Thr Ile Ser Pro Gly Asp Gly Arg Thr Phe 15 10 15
Pro Lys Arg Gly Gin Thr Cys Val Val His Tyr Thr Gly Met Leu Glu 20 25 30Pro Lys Arg Gly Gin Thr Cys Val Val His Tyr Thr Gly Met Leu Glu 20 25 30
Asp Gly Lys Lys Phe Asp Ser Ser Arg Asp Arg Asn Lys Pro Phe Lys 35 40 45Asp Gly Lys Lys Phe Asp Ser Ser Arg Asp Arg Asn Lys Pro Phe Lys 35 40 45
Phe Met Leu Gly Lys Gin Glu Val Ile Arg Gly Trp Glu Glu Gly Val 50 55 60Phe Met Leu Gly Lys Gin Glu Val Ile Arg Gly Trp Glu Glu Gly Val 50 55 60
Ala Gin Met Ser Val Gly Gin Arg Ala Lys Leu Thr Ile Ser Pro Asp 65 70 75 80Ala Gin Met Ser Val Gly Gin Arg Ala Lys Leu Thr Ile Ser Pro Asp 65 70 75 80
Tyr Ala Tyr Gly Ala Thr Gly His Pro Gly Ile Ile Pro Pro His Ala 85 90 95Tyr Ala Tyr Gly Ally Thr Gly His Pro Gly Ile Pro Pro His Ala 85 90 95
Thr Leu Val Phe Asp Val Glu Leu Leu Lys Leu Glu Gly Ser Gly Set 100 105 110Thr Leu Val Phe Asp Val Glu Leu Leu Lys Leu Glu Gly Ser Gly Set 100 105 110
Lys Ser Met Ser Ser Met Val Ser Asp Thr Ser Cys Thr Phe Pro Ser 115 120 125Lys Ser Met Ser Ser Ser Ser Ser Asp Thr Ser Cys Thr Phe Pro Ser 115 120 125
Ser Asp Gly Ile Phe Trp Lys His Trp Ile Gin Thr Lys Asp Gly Gin 130 135 140Ser Asp Gly Ile Phe Trp Lys His Trp Ile Gin Thr Lys Asp Gly Gin 130 135 140
Cys Gly Ser Pro Leu Val Ser Thr Arg Asp Gly Phe Ile Val Gly Ile 145 150 155 160Cys Gly Ser Pro Leu Val Ser Thr Arg Asp Gly Ile Val Gly Ile 145 150 155 160
His Ser Ala Ser Asn Phe Thr Asn Thr Asn Asn Tyr Phe Thr Ser Val 165 170 175His Ser Ala Ser Asn Phe Thr Asn Thr Asn Asr Tyr Phe Thr Ser Val 165 170 175
Pro Lys Asn Phe Met Glu Leu Leu Thr Asn Gin Glu Ala Gin Gin Trp 180 185 190Pro Lys Asn Phe Met Glu Leu Leu Thr Asn Gin Glu Ala Gin Gin Trp 180 185 190
Val Ser Gly Trp Arg Leu Asn Ala Asp Ser Val Leu Trp Gly Gly His 195 200 205Val Ser Gly Trp Arg Leu Asn Ala Asp Ser Val Leu Trp Gly Gly His 195 200 205
Lys Val Phe Met Ser Lys Pro Glu Glu Pro Phe Gin Pro Val Lys Glu 210 215 220Lys Val Phe Met Ser Lys Pro Glu Glu Pro Phe Gin Pro Val Lys Glu 210 215 220
Ala Thr Gin Leu Met Ser Glu Leu Val Tyr Ser Gin 225 230 235 <210> 41 <211> 651Ala Thr Gin Leu Met Ser Glu Leu Val Tyr Ser Gin 225 230 235 <210> 41 <211> 651
<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1)..(648) <400> 41 atg ate ete tgg cat gag atg tgg cat gaa ggc ctg gaa gag gca tet 48<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1) .. (648) <400> 41 atg ate ete tgg cat gag atg tgg cat gaa ggc ctg gaa gag gca tet 48
Met Ile Leu Trp His Glu Met Trp His Glu Gly Leu Glu Glu Ala Ser 15 10 15 cgt ttg tac ttt ggg gaa agg aac gtg aaa ggc atg ttt gag gtg ctg 96Met Ile Leu Trp His Glu Met Trp His Glu Gly Leu Glu Glu Ala Ser 15 10 15 cgt ttg tac ttt ggg gaa agg aac gtg aaa ggc atg ttt gag gtg ctg 96
Arg Leu Tyr Phe Gly Glu Arg Asn Val Lys Gly Met Phe Glu Val Leu 20 25 30 gag ccc ttg cat get atg atg gaa cgg ggc ccc cag act ctg aag gaa 144Arg Leu Tyr Phe Gly Glu Arg Asn Val Lys Gly Met Phe Glu Val Leu 20 25 30 gag ccc ttg cat get atg atg gaa cgg ggc ccc cag act ctg aag gaa 144
Glu Pro Leu His Ala Met Met Glu Arg Gly Pro Gin Thr Leu Lys Glu 35 40 45 aca tcc ttt aat cag gcc tat ggt ega gat tta atg gag gcc caa gag 192Glu Pro Leu His Ala Met Met Glu Arg Gly Pro Gin Thr Leu Lys Glu 35 40 45 aca tcc ttt aat cag gcc tat ggt ega gat tta atg gag gcc caa gag 192
Thr Ser Phe Asn Gin Ala Tyr Gly Arg Asp Leu Met Glu Ala Gin Glu 50 55 60 tgg tgc agg aag tac atg aaa tca ggg aat gtc aag gac ete ete caa 240Thr Ser Phe Asn Gin Ala Tyr Gly Arg Asp Leu Met Glu Ala Gin Glu 50 55 60 tgg tgc agg aag tac atg aaa tca ggg aat gtc aag gac ete ete caa 240
Trp Cys Arg Lys Tyr Met Lys Ser Gly Asn Val Lys Asp Leu Leu Gin 65 70 75 80 gcc tgg gac ete tat tat cat gtg ttc ega ega ate tca aag gga tcc 288Trp Cys Arg Lys Tyr Met Lys Ser Gly Asn Val Lys Asp Leu Leu Gin 65 70 75 80 gcc tgg gac ete tat tat cat gtg ttc ega ega ate tca aag gga tcc 288
Ala Trp Asp Leu Tyr Tyr His Val Phe Arg Arg Ile Ser Lys Gly Ser 85 90 95 gga agt gga gaa agc ttg ttt aag gga cca cgt gat tac aac ccg ata 336Ala Trp Asp Leu Tyr Tyr His Val Phe Arg Arg Ile Ser Lys Gly Ser 85 90 95 gga agt gga gaa agc ttg ttt aag gga cca cgt gat tac aac ccg ata 336
Gly Ser Gly Glu Ser Leu Phe Lys Gly Pro Arg Asp Tyr Asn Pro Ile 100 105 110 teg agc acc att tgt cac ttg acg aat gaa tet gat ggg cac aca aca 384Gly Ser Gly Glu Ser Leu Phe Lys Gly Pro Arg Asp Tyr Asn Pro Ile 100 105 110 teg agc acc att tgt cac ttg acg aat gaa tet gat ggg cac aca aca 384
Ser Ser Thr Ile Cys His Leu Thr Asn Glu Ser Asp Gly His Thr Thr 115 120 125 teg ttg tat ggt att gga ttt ggt ccc ttc ate att aca aac aag cac 432Ser Ser Thr Ile Cys His Leu Thr Asn Glu Ser Asp Gly His Thr Thr 115 120 125 teg ttg tat ggt att gga ttt ggt ccc ttc ate att aca aac aag cac 432
Ser Leu Tyr Gly Ile Gly Phe Gly Pro Phe Ile Ile Thr Asn Lys His 130 135 140 ttg ttt aga aga aat aat gga aca ctg ttg gtc caa tca cta cat ggt 480Ser Leu Tyr Gly Ile Gly Phe Gly Pro Phe Ile Ile Thr Asn Lys His 130 135 140 ttg ttt aga aga aat aat gga aca ctg ttg gtc caa tca cta cat ggt 480
Leu Phe Arg Arg Asn Asn Gly Thr Leu Leu Val Gin Ser Leu His Gly 145 150 155 160 gta ttc aag gtc aag aac acc acg act ttg caa caa cac ete att gat 528Leu Phe Arg Arg Asn Asn Gly Thr Leu Leu Val Gin Ser Leu His Gly 145 150 155 160 gta ttc aag gtc aag aac acc acg act ttg caa caa cac ete att gat 528
Val Phe Lys Val Lys Asn Thr Thr Thr Leu Gin Gin His Leu Ile Asp 165 170 175 ggg agg gac atg ata att att cgc atg cct aag gat ttc cča cca ttt 576Val Phe Lys Val Lys Asn Thr Thr Thr Leu Gin Gin His Leu Ile Asp 165 170 175 ggg agg gac atg ata att att cgc atg cct aag gat ttc cca cca ttt 576
Gly Arg Asp Met Ile Ile Ile Arg Met Pro Lys Asp Phe Pro Pro Phe 180 185 190 cct caa aag Ctg aaa ttt aga gag cca caa agg gaa gag cgc ata tgt 624Gly Arg Asp Met Ile Ile Arg Met Pro Lys Asp Phe Pro Pro Phe 180 185 190 cct caa aag Ctg aaa ttt aga gag cca caa agg gaa gag cgc ata tgt 624
Pro Gin Lys Leu Lys Phe Arg Glu Pro Gin Arg Glu Glu Arg Ile Cys 195 200 205 ctt gtg aca acc aac ttc caa act taa 651Pro Gin Lys Leu Lys Phe Arg Glu Pro Gin Arg Glu Glu Arg Ile Cys 195 200 205 ctt gtg aca acc aac ttc caa act taa 651
Leu Val Thr Thr Asn Phe Gin Thr 210 215 <210> 42Leu Val Thr Thr Asn Phe Gin Thr 210 215 <210> 42
<211> 216 <212> PRT <213> Artificial Seguence <220> <223> Synthetic Construct <400> 42<211> 216 <212> PRT <213> Artificial Seguence <220> <223> Synthetic Construct <400> 42
Met Ile Leu Trp His Glu Met Trp His Glu Gly Leu Glu Glu Ala Ser 15 10 15Met Ile Leu Trp His Glu Met Trp His Glu Gly Leu Glu Glu Ala Ser 15 10 15
Arg Leu Tyr Phe Gly Glu Arg Asn Val Lys Gly Met Phe Glu Val Leu 20 25 30Arg Leu Tyr Phe Gly Glu Arg Asn Val Lys Gly Met Phe Glu Val Leu 20 25 30
Glu Pro Leu His Ala Met Met Glu Arg Gly Pro Gin Thr Leu Lys Glu 35 40 45Glu Pro Leu His Ala Met Met Glu Arg Gly Pro Gin Thr Leu Lys Glu 35 40 45
Thr Ser Phe Asn Gin Ala Tyr Gly Arg Asp Leu Met Glu Ala Gin Glu 50 55 60Thr Ser Phe Asn Gin Ala Tyr Gly Arg Asp Leu Met Glu Ala Gin Glu 50 55 60
Trp Cys Arg Lys Tyr Met Lys Ser Gly Asn Val Lys Asp Leu Leu Gin 65 70 75 80Trp Cys Arg Lys Tyr Met Lys Ser Gly Asn Val Lys Asp Leu Leu Gin 65 70 75 80
Ala Trp Asp Leu Tyr Tyr His Val Phe Arg Arg Ile Ser Lys Gly Ser 85 90 95Ala Trp Asp Leu Tyr Tyr His Val Phe Arg Arg Ile Ser Lys Gly Ser 85 90 95
Gly Ser Gly Glu Ser Leu Phe Lys Gly Pro Arg Asp Tyr Asn Pro Ile 100 105 110Gly Ser Gly Glu Ser Leu Phe Lys Gly Pro Arg Asp Tyr Asn Pro Ile 100 105 110
Ser Ser Thr Ile Cys His Leu Thr Asn Glu Ser Asp Gly His Thr Thr 115 120 125Ser Ser Thr Ile Cys His Leu Thr Asn Glu Ser Asp Gly His Thr Thr 115 120 125
Ser Leu Tyr Gly Ile Gly Phe Gly Pro Phe Ile Ile Thr Asn Lys His 130 135 140Ser Leu Tyr Gly Ile Gly Phe Gly Pro Phe Ile Ile Thr Asn Lys His 130 135 140
Leu Phe Arg Arg Asn Asn Gly Thr Leu Leu Val Gin Ser Leu His Gly 145 150 155 160Leu Phe Arg Arg Asn Gly Thr Leu Leu Val Gin Ser Leu His Gly 145 150 155 160
Val Phe Lys Val Lys Asn Thr Thr Thr Leu Gin Gin His Leu Ile Asp 165 170 175Val Phe Lys Val Lys Asn Thr Thr Thr Leu Gin Gin His Leu Ile Asp 165 170 175
Gly Arg Asp Met Ile Ile Ile Atg Met Pro Lys Asp Phe Pro Pro Phe 180 185 190Gly Arg Asp Met Ile Ile Ile Atg Met Pro Lys Asp Phe Pro Pro Phe 180 185 190
Pro Gin Lys Leu Lys Phe Arg Glu Pro Gin Arg Glu Glu Arg Ile Cys 195 200 205Pro Gin Lys Leu Lys Phe Arg Glu Pro Gin Arg Glu Glu Arg Ile Cys 195 200 205
Leu Val Thr Thr Asn Phe Gin Thr 210 215 <210> 43 <211> 759Leu Val Thr Thr Asn Phe Gin Thr 210 215 <210> 43 <211> 759
<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1)..(756) <400> 43 atg gga gtg caa gtg gaa acc ate tcc ccg gga gac ggg cgc acc ttc 4g<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1) .. (756) <400> 43 atg gga gtg caa gtg gaa acc ate tcc ccg gga gac ggg cgc acc ttc 4g
Met Gly Val Gin Val Glu Thr Ile Ser Pro Gly Asp Gly Arg Thr Phe 15 10 15 ccc aag cgc ggc cag acc tgc gtg gtg cac tac acc ggg atg ctt gaa 96Met Gly Val Gin Val Glu Thr Ile Ser Pro Gly Asp Gly Arg Thr Phe 15 10 15 ccc aag cgc ggc cag acc tgc gtg gtg cac tac acc ggg atg ctt gaa 96
Pro Lys Arg Gly Gin Thr Cys Val Val His Tyr Thr Gly Met Leu Glu 20 25 30 gat gga aag aaa ttt gat tcc tcc cgg gac aga aac aag ccc ttt aag 144Pro Lys Arg Gly Gin Thr Cys Val Val His Tyr Thr Gly Met Leu Glu 20 25 30 gat gga aag aaa ttt gat tcc tcc cgg gac aga aac aag ccc ttt aag 144
Asp Gly Lys Lys Phe Asp Ser Ser Arg Asp Arg Asn Lys Pro Phe Lys 35 40 45 ttt atg cta ggc aag cag gag gtg ate ega ggc tgg gaa gaa ggg gtt 192Asp Gly Lys Lys Phe Asp Ser Ser Arg Asp Arg Asn Lys Pro Phe Lys 35 40 45 ttt atg cta ggc aag cag gag gtg ate ega ggc tgg gaa gaa ggg gtt 192
Phe Met Leu Gly Lys Gin Glu Val Ile Arg Gly Trp Glu Glu Gly Val 50 55 60 gcc cag atg agt gtg ggt cag aga gcc aaa ctg act ata tet cca gat 240Phe Met Leu Gly Lys Gin Glu Val Ile Arg Gly Trp Glu Glu Gly Val 50 55 60 gcc cag atg agt gtg ggt cag aga gcc aaa ctg act ata tet cca gat 240
Ala Gin Met Ser Val Gly Gin Arg Ala Lys Leu Thr Ile Ser Pro Asp 65 70 75 80 tat gcc tat ggt gcc act ggg cac cca ggc ate ate cca cca cat gcc 288Ala Gin Met Ser Val Gly Gin Arg Ala Lys Leu Thr Ile Ser Pro Asp 65 70 75 80 tat gcc tat ggt gcc act ggg cac cca ggc ate ate cca cca cat gcc 288
Tyr Ala Tyr Gly Ala Thr Gly His Pro Gly Ile Ile Pro Pro His Ala 85 90 95 act ete gtc ttc gat gtg gag ctt cta aaa ctg gaa gat act tat aga 336Tyr Ala Tyr Gly Ala Thr Gly His Pro Gly Ile Pro Pro His Ala 85 90 95 act ete gtc ttc gat gtg gag ctt cta aaa ctg gaa gat act tat aga 336
Thr Leu Val Phe Asp Val Glu Leu Leu Lys Leu Glu Asp Thr Tyr Arg 100 105 110 tat att gat act tat aga tat att aag agc atg tet agc atg gtg tca 384Thr Leu Val Phe Asp Val Glu Leu Leu Lys Leu Glu Asp Thr Tyr Arg 100 105 110 tat att gat act tat aga tat att aag agc atg tet agc atg gtg tca 384
Tyr Ile Asp Thr Tyr Arg Tyr Ile Lys Ser Met Ser Ser Met Val Ser 115 120 125 gac acc agt tgc aca ttc cct tca tet gat ggc ata ttc tgg aag cat 432Tyr Ile Asp Thr Tyr Arg Tyr Ile Lys Ser Met Ser Ser Met Val Ser 115 120 125 gac acc agt tgc aca ttc cct tca tet gat ggc ata ttc tgg aag cat 432
Asp Thr Ser Cys Thr Phe Pro Ser Ser Asp Gly Ile Phe Trp Lys His 130 135 140 tgg att caa acc aag gat ggg cag tgt ggc agt cca tta gta tca act 480Asp Thr Ser Cys Thr Phe Pro Ser Ser Asp Gly Ile Phe Trp Lys His 130 135 140 tgg att caa acc aag gat ggg cag tgt ggc agt cca tta gta tca act 480
Trp Ile Gin Thr Lys Asp Gly Gin Cys Gly Ser Pro Leu Val Ser Thr 145 150 155 160 aga gat ggg ttc att gtt ggt ata cac tca gca teg aat ttc acc aac 528Trp Ile Gin Thr Lys Asp Gly Gin Cys Gly Ser Pro Leu Val Ser Thr 145 150 155 160 aga gat ggg ttc att gtt ggt ata cac tca gca teg aat ttc acc aac 528
Arg Asp Gly Phe Ile Val Gly Ile His Ser Ala Ser Asn Phe Thr Asn 165 170 175 aca aac aat tat ttc aca agc gtg ccg aaa aac ttc atg gaa ttg ttg 576Arg Asp Gly Phe Ile Val Gly Ile His Ser Ala Ser Asn Phe Thr Asn 165 170 175 aca aac aat tat ttc aca agc gtg ccg aaa aac ttc atg gaa ttg ttg 576
Thr Asn Asn Tyr Phe Thr Ser Val Pro Lys Asn Phe Met Glu Leu Leu 180 185 190 aca aat cag gag gcg cag cag tgg gtt agt ggt tgg ega tta aat get 624Thr Asn Asn Tyr Phe Thr Ser Val Pro Lys Asn Phe Met Glu Leu Leu 180 185 190 aca aat cag gag gcg cag cag tgg gtt agt ggt tgg ega tta aat get 624
Thr Asn Gin Glu Ala Gin Gin Trp Val Ser Gly Trp Arg Leu Asn Ala 195 200 205 gac tca gta ttg tgg ggg ggc cat aaa gtt ttc atg agc aaa cct gaa 672Thr Asn Gin Glu Ala Gin Gin Trp Val Ser Gly Trp Arg Leu Asn Ala 195 200 205 gac tca gta ttg tgg ggg ggc cat aaa gtt ttc atg agc aaa cct gaa 672
Asp Ser Val Leu Trp Gly Gly His Lys Val Phe Met Ser Lys Pro Glu 210 215 220 gag cct ttt cag cca gtt aag gaa gcg act caa ete atg agt gaa ttg 720Asp Ser Val Leu Trp Gly Gly His Lys Val Phe Met Ser Lys Pro Glu 210 215 220 gag cct ttt cag cca gtt aag gaa gcg act caa ete atg agt gaa ttg 720
Glu Pro Phe Gin Pro Val Lys Glu Ala Thr Gin Leu Met Ser Glu Leu 225 230 235 240 gtg tac teg caa gat cct aaa aag aag aga aag gta taa 759Glu Pro Phe Gin Pro Val Lys Glu Ala Thr Gin Leu Met Ser Glu Leu 225 230 235 240 gtg tac teg caa gat cct aaa aag aag aga aag gta taa 759
Val Tyr Ser Gin Asp Pro Lys Lys Lys Arg Lys Val 245 250 <210> 44 <211> 252Val Tyr Ser Gin Asp Pro Lys Lys Lys Arg Lys Val 245 250 <210> 44 <211> 252
<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 44<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 44
Met Gly Val Gin Val Glu Thr Ile Ser Pro Gly Asp Gly Arg Thr Phe 15 10 15Met Gly Val Gin Val Glu Thr Ile Ser Pro Gly Asp Gly Arg Thr Phe 15 10 15
Pro Lys Arg Gly Gin Thr Cys Val Val His Tyr Thr Gly Met Leu Glu 20 25 30Pro Lys Arg Gly Gin Thr Cys Val Val His Tyr Thr Gly Met Leu Glu 20 25 30
Asp Gly Lys Lys Phe Asp Ser Ser Arg Asp Arg Asn Lys Pro Phe Lys 35 40 45Asp Gly Lys Lys Phe Asp Ser Ser Arg Asp Arg Asn Lys Pro Phe Lys 35 40 45
Phe Met Leu Gly Lys Gin Glu Val Ile Arg Gly Trp Glu Glu Gly Val 50 55 60Phe Met Leu Gly Lys Gin Glu Val Ile Arg Gly Trp Glu Glu Gly Val 50 55 60
Ala Gin Met Ser Val Gly Gin Arg Ala Lys Leu Thr Ile Ser Pro Asp 65 70 75 80Ala Gin Met Ser Val Gly Gin Arg Ala Lys Leu Thr Ile Ser Pro Asp 65 70 75 80
Tyr Ala Tyr Gly Ala Thr Gly His Pro Gly Ile Ile Pro Pro His Ala 85 90 95Tyr Ala Tyr Gly Ally Thr Gly His Pro Gly Ile Pro Pro His Ala 85 90 95
Thr Leu Val Phe Asp Val Glu Leu Leu Lys Leu Glu Asp Thr Tyr Arg 100 105 110Thr Leu Val Phe Asp Val Glu Leu Leu Lys Leu Glu Asp Thr Tyr Arg 100 105 110
Tyr Ile Asp Thr Tyr Arg Tyr Ile Lys Ser Met Ser Ser Met Val Ser 115 120 125Tyr Ile Asp Thr Tyr Arg Tyr Ile Lys Ser Met Ser Ser Met Val Ser 115 120 125
Asp Thr Ser Cys Thr Phe Pro Ser Ser Asp Gly Ile Phe Trp Lys His 130 135 140Asp Thr Ser Cys Thr Phe Pro Ser Ser Asp Gly Ile Phe Trp Lys His 130 135 140
Trp Ile Gin Thr Lys Asp Gly Gin Cys Gly Ser Pro Leu Val Ser Thr 145 150 155 160Trp Ile Gin Thr Lys Asp Gly Gin Cys Gly Ser Pro Leu Val Ser Thr 145 150 155 160
Arg Asp Gly Phe Ile Val Gly Ile His Ser Ala Ser Asn Phe Thr Asn 165 170 175Arg Asp Gly Phe Ile Val Gly Ile His Ser Ala Ser Asn Phe Thr Asn 165 170 175
Thr Asn ASn Tyr Phe Thr Ser Val Pro Lys Asn Phe Met Glu Leu Leu 180 185 190Thr Asn ASn Tyr Phe Thr Ser Val Pro Lys Asn Phe Met Glu Leu Leu 180 185 190
Thr Asn Gin Glu Ala Gin Gin Trp Val Ser Gly Trp Arg Leu Asn Ala 195 200 205Thr Asn Gin Glu Ala Gin Gin Trp Val Ser Gly Trp Arg Leu Asn Ala 195 200 205
Asp Ser Val Leu Trp Gly Gly His Lys Val Phe Met Ser Lys Pro Glu 210 215 220Asp Ser Val Leu Trp Gly Gly His Lys Val Phe Met Ser Lys Pro Glu 210 215 220
Glu Pro Phe Gin Pro Val Lys Glu Ala Thr Gin Leu Met Ser Glu Leu 225 230 235 240Glu Pro Phe Gin Pro Val Lys Glu Ala Thr Gin Leu Met Ser Glu Leu 225 230 235 240
Val Tyr Ser Gin Asp Pro Lys Lys Lys Arg Lys Val 245 250 <210> 45 <211> 699Val Tyr Ser Gin Asp Pro Lys Lys Lys Arg Lys Val 245 250 <210> 45 <211> 699
<212> DNA <213> Artificial SequenCe <220> <223> - <220> <221> CDS <222> {1)..(696) <400> 45 atg gac ccg aaa aag aag aga aag gta ate ete tgg cat gag atg tgg 48<212> DNA <213> Artificial SequenCe <220> <223> - <220> <221> CDS <222> {1) .. (696) <400> 45 atg gac ccg aaa aag aag aga aag gta ate ete tgg cat gag atg tgg 48
Met Asp Pro Lys Lys Lys Arg Lys Val Ile Leu Trp His Glu Met Trp 15 10 15 cat gaa ggc ctg gaa gag gca tet cgt ttg tac ttt ggg gaa agg aac 96Met Asp Pro Lys Lys Lys Arg Lys Val Ile Leu Trp His Glu Met Trp 15 10 15 cat gaa ggc ctg gaa gag gca tet cgt ttg tac ttt ggg gaa agg aac 96
His Glu Gly Leu Glu Glu Ala Ser Arg Leu Tyr Phe Gly Glu Arg Asn 20 25 30 gtg aaa ggc atg ttt gag gtg ctg gag ccc ttg cat get atg atg gaa 144His Glu Gly Leu Glu Glu Ala Ser Arg Leu Tyr Phe Gly Glu Arg Asn 20 25 30 gtg aaa ggc atg ttt gag gtg ctg gag ccc ttg cat get atg atg gaa 144
Val Lys Gly Met Phe Glu Val Leu Glu Pro Leu His Ala Met Met Glu 35 40 45 cgg 99c ccc cag act ctg aag gaa aca tcc ttt aat cag gcc tat ggt 192Val Lys Gly Met Phe Glu Val Leu Glu Pro Leu His Ala Met Met Glu 35 40 45 cgg 99c ccc cag act ctg aag gaa aca tcc ttt aat cag gcc tat ggt 192
Arg Gly Pro Gin Thr Leu Lys Glu Thr Ser Phe Asn Gin Ala Tyr Gly 50 55 60 ega gat tta atg gag gcc caa gag tgg tgc agg aag tac atg aaa tca 240Arg Gly Pro Gin Thr Leu Lys Glu Thr Ser Phe Asn Gin Ala Tyr Gly 50 55 60 ega gat tta atg gag gcc caa gag tgg tgc agg aag tac atg aaa tca 240
Arg Asp Leu Met Glu Ala Gin Glu Trp Cys Arg Lys Tyr Met Lys Ser 65 70 75 80 ggg aat gtc aag gac ete ete caa gcc tgg gac ete tat tat cat gtg 288Arg Asp Leu Met Glu Ala Gin Glu Trp Cys Arg Lys Tyr Met Lys Ser 65 70 75 80 ggg aat gtc aag gac ete ete caa gcc tgg gac ete tat tat cat gtg 288
Gly Asn Val Lys Asp Leu Leu Gin Ala Trp Asp Leu Tyr Tyr His Val 85 90 95 ttc ega ega ate tca aag gat act tat aga tat att gat act tat aga 336Gly Asn Val Lys Asp Leu Leu Gin Ala Trp Asp Leu Tyr Tyr His Val 85 90 95 ttc ega ega ate tca aag gat act tat aga tat att gat act tat aga 336
Phe Arg Arg Ile Ser Lys Asp Thr Tyr Arg Tyr Ile Asp Thr Tyr Arg 100 105 110 tat att gga gaa agc ttg ttt aag gga cca cgt gat tac aac ccg ata 384Phe Arg Arg Ile Ser Lys Asp Thr Tyr Arg Tyr Ile Asp Thr Tyr Arg 100 105 110 tat att gga gaa agc ttg ttt aag gga cca cgt gat tac aac ccg ata 384
Tyr Ile Gly Glu Ser Leu Phe Lys Gly Pro Arg Asp Tyr Asn Pro Ile 115 120 125 teg agc acc att tgt cac ttg acg aat gaa tet gat ggg cac aca aca 432Tyr Ile Gly Glu Ser Leu Phe Lys Gly Pro Arg Asp Tyr Asn Pro Ile 115 120 125 teg agc acc att tgt cac ttg acg aat gaa tet gat ggg cac aca aca 432
Ser Ser Thr Ile Cys His Leu Thr Asn Glu Ser Asp Gly His Thr Thr 130 135 140 teg ttg tat ggt att gga ttt ggt ccc ttc ate att aca aac aag cac 480Ser Ser Thr Ile Cys His Leu Thr Asn Glu Ser Asp Gly His Thr Thr 130 135 140 teg ttg tat ggt att gga ttt ggt ccc ttc ate att aca aac aag cac 480
Ser Leu Tyr Gly Ile Gly Phe Gly Pro Phe Ile Ile Thr Asn Lys His 145 150 155 160 ttg ttt aga aga aat aat gga aca ctg ttg gtc caa tca cta cat ggt 528Ser Leu Tyr Gly Ile Gly Phe Gly Pro Phe Ile Ile Thr Asn Lys His 145 150 155 160 ttg ttt aga aga aat aat gga aca ctg ttg gtc caa tca cta cat ggt 528
Leu Phe Arg Arg Asn Asn Gly Thr Leu Leu Val Gin Ser Leu His Gly 165 170 175 gta ttc aag gtc aag aac acc acg act ttg caa caa cac ete att gat 576Leu Phe Arg Arg Asn Asn Gly Thr Leu Leu Val Gin Ser Leu His Gly 165 170 175 gta ttc aag gtc aag aac acc acg act ttg caa caa cac ete att gat 576
Val Phe Lys Val Lys Asn Thr Thr Thr Leu Gin Gin His Leu Ile Asp 180 185 190 ggg agg gac atg ata att att ege atg cct aag gat ttc cca cca ttt 624Val Phe Lys Val Lys Asn Thr Thr Thr Leu Gin Gin His Leu Ile Asp 180 185 190 ggg agg gac atg ata att att ege atg cct aag gat ttc cca cca ttt 624
Gly Arg Asp Met Ile Ile Ile Arg Met Pro Lys Asp Phe Pro Pro Phe 195 200 205 cct caa aag ctg aaa ttt aga gag cca caa agg gaa gag ege ata tgt 672Gly Arg Asp Met Ile Ile Arg Met Pro Lys Asp Phe Pro Pro Phe 195 200 205 cct caa aag ctg aaa ttt aga gag cca caa agg gaa gag ege ata tgt 672
Pro Gin Lys Leu Lys Phe Arg Glu Pro Gin Arg Glu Glu Arg Ile Cys 210 215 220 ctt gtg aca acc aac ttc caa act taa 699Pro Gin Lys Leu Lys Phe Arg Glu Pro Gin Arg Glu Glu Arg Ile Cys 210 215 220 ctt gtg aca acc aac ttc caa act taa 699
Leu Val Thr Thr Asn Phe Gin Thr 225 230 <210> 46 <211> 232Leu Val Thr Thr Asn Phe Gin Thr 225 230 <210> 46 <211> 232
<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 46<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 46
Met Asp Pro Lys Lys Lys Arg Lys Val Ile Leu Trp His Glu Met Trp 15 10 15Met Asp Pro Lys Lys Lys Arg Lys Val Ile Leu Trp His Glu Met Trp 15 10 15
His Glu Gly Leu Glu Glu Ala Ser Arg Leu Tyr Phe Gly Glu Arg Asn 20 25 30His Glu Gly Leu Glu Glu Ala Ser Arg Leu Tyr Phe Gly Glu Arg Asn 20 25 30
Val Lys Gly Met Phe Glu Val Leu Glu Pro Leu His Ala Met Met Glu 35 40 45Val Lys Gly Met Phe Glu Val Leu Glu Pro Leu His Ala Met Met Glu 35 40 45
Arg Gly Pro Gin Thr Leu Lys Glu Thr Ser Phe Asn Gin Ala Tyr Gly 50 55 60Arg Gly Pro Gin Thr Leu Lys Glu Thr Ser Phe Asn Gin Ala Tyr Gly 50 55 60
Arg Asp Leu Met Glu Ala Gin Glu Trp Cys Arg Lys Tyr Met Lys Ser 65 70 75 80Arg Asp Leu Met Glu Ala Gin Glu Trp Cys Arg Lys Tyr Met Lys Ser 65 70 75 80
Gly Asn Val Lys Asp Leu Leu Gin Ala Trp Asp Leu Tyr Tyr His Val 85 90 95Gly Asn Val Lys Asp Leu Leu Gin Ala Trp Asp Leu Tyr Tyr His Val 85 90 95
Phe Arg Arg Ile Ser Lys Asp Thr Tyr Arg Tyr Ile Asp Thr Tyr Arg 100 105 110Phe Arg Arg Ile Ser Lys Asp Thr Tyr Arg Tyr Ile Asp Thr Tyr Arg 100 105 110
Tyr Ile Gly Glu Ser Leu Phe Lys Gly Pro Arg Asp Tyr Asn Pro Ile 115 120 125Tyr Ile Gly Glu Ser Leu Phe Lys Gly Pro Arg Asp Tyr Asn Pro Ile 115 120 125
Ser Ser Thr Ile Cys His Leu Thr Asn Glu Ser Asp Gly His Thr Thr 130 135 140Ser Ser Thr Ile Cys His Leu Thr Asn Glu Ser Asp Gly His Thr Thr 130 135 140
Ser Leu Tyr Gly Ile Gly Phe Gly Pro Phe Ile Ile Thr ASn Lys His 145 150 155 160Ser Leu Tyr Gly Ile Gly Phe Gly Pro Phe Ile Ile Thr ASn Lys His 145 150 155 160
Leu Phe Arg Arg Asn Asn Gly Thr Leu Leu Val Gin Ser Leu His Gly 165 170 175Leu Phe Arg Arg Asn Gly Thr Leu Leu Val Gin Ser Leu His Gly 165 170 175
Val Phe Lys Val Lys Asn Thr Thr Thr Leu Gin Gin His Leu Ile Asp 180 185 190Val Phe Lys Val Lys Asn Thr Thr Thr Leu Gin Gin His Leu Ile Asp 180 185 190
Gly Arg Asp Met Ile Ile Ile Arg Met Pro Lys Asp Phe Pro Pro Phe 195 200 205Gly Arg Asp Met Ile Ile Arg Met Pro Lys Asp Phe Pro Pro Phe 195 200 205
Pro Gin Lys Leu Lys Phe Arg Glu Pro Gin Arg Glu Glu Arg Ile Cys 210 215 220Pro Gin Lys Leu Lys Phe Arg Glu Pro Gin Arg Glu Glu Arg Ile Cys 210 215 220
Leu Val Thr Thr Asn Phe Gin Thr 225 230 <210> 47 <211> 714Leu Val Thr Thr Asn Phe Gin Thr 225 230 <210> 47 <211> 714
<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1)..(711) <400> 47 atg gga gtg caa gtg gaa acc ate tcc ccg gga gac ggg ege acc ttc 48<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1) .. (711) <400> 47 atg gga gtg caa gtg gaa acc ate tcc ccg gga gac ggg ege acc ttc 48
Met Gly Val Gin Val Glu Thr Ile Ser Pro Gly Asp Gly Arg Thr Phe 15 10 15 ccc aag ege ggc cag acc tgc gtg gtg cac tac acc ggg atg ctt gaa 96Met Gly Val Gin Val Glu Thr Ile Ser Pro Gly Asp Gly Arg Thr Phe 15 10 15 ccc aag ege ggc cag acc tgc gtg gtg cac tac acc ggg atg ctt gaa 96
Pro Lys Arg Gly Gin Thr Cys Val Val His Tyr Thr Gly Met Leu Glu 20 25 30 gat gga aag aaa ttt gat tcc tcc čgg gac aga aac aag ccc ttt aag 144Pro Lys Arg Gly Gin Thr Cys Val Val His Tyr Thr Gly Met Leu Glu 20 25 30 gat gga aag aaa ttt gat tcc tcc ggg aga aac aag ccc ttt aag 144
Asp Gly Lys Lys Phe Asp Ser Ser Arg Asp Arg Asn Lys Pro Phe Lys 35 40 45 ttt atg cta ggc aag cag gag gtg ate ega ggc tgg gaa gaa ggg gtt 192Asp Gly Lys Lys Phe Asp Ser Ser Arg Asp Arg Asn Lys Pro Phe Lys 35 40 45 ttt atg cta ggc aag cag gag gtg ate ega ggc tgg gaa gaa ggg gtt 192
Phe Met Leu Gly Lys Gin Glu Val Ile Arg Gly Trp Glu Glu Gly Val 50 55 60 gcc cag atg agt gtg ggt cag aga gcc aaa ctg act ata tet cca gat 240Phe Met Leu Gly Lys Gin Glu Val Ile Arg Gly Trp Glu Glu Gly Val 50 55 60 gcc cag atg agt gtg ggt cag aga gcc aaa ctg act ata tet cca gat 240
Ala Gin Met Ser Val Gly Gin Arg Ala Lys Leu Thr Ile Ser Pro Asp 65 70 75 80 tat gcc tat ggt gcc act ggg cac cca ggc ate ate cca cca cat gcc 288Ala Gin Met Ser Val Gly Gin Arg Ala Lys Leu Thr Ile Ser Pro Asp 65 70 75 80 tat gcc tat ggt gcc act ggg cac cca ggc ate ate cca cca cat gcc 288
Tyr Ala Tyr Gly Ala Thr Gly His Pro Gly Ile Ile Pro Pro His Ala 85 90 95 act ete gtc ttc gat gtg gag ctt cta aaa ctg gaa gga tcc gga agt 336Tyr Ala Tyr Gly Ally Thr Gly His Pro Gly Ile Pro Pro His Ala 85 90 95 act ete gtc ttc gat gtg gag ctt cta aaa ctg gaa gga tcc gga agt 336
Thr Leu Val Phe Asp Val Glu Leu Leu Lys Leu Glu Gly Ser Gly Ser 100 105 110 aag agc ate agc agc acc atg agc gag acc agc gcc acc tac ccc gtg 384Thr Leu Val Phe Asp Val Glu Leu Leu Lys Leu Glu Gly Ser Gly Ser 100 105 110 aag agc ate agc agc acc atg agc gag acc agc gcc acc tac ccc gtg 384
Lys Ser Ile Ser Ser Thr Met Ser Glu Thr Ser Ala Thr Tyr Pro Val 115 120 125 gac aac agc cac ttc tgg aag cac tgg ate agc acc aag gac ggc cac 432Lys Ser Ile Ser Ser Thr Met Ser Glu Thr Ser Ala Thr Tyr Pro Val 115 120 125 gac aac agc cac ttc tgg aag cac tgg ate agc acc aag gac ggc cac 432
Asp Asn Ser His Phe Trp Lys His Trp Ile Ser Thr Lys Asp Gly His 130 135 140 tgc ggc ctg ccc ate gtg agc acc ege gac ggc agc ate ctg ggc ctg 480Asp Asn Ser His Phe Trp Lys His Trp Ile Ser Thr Lys Asp Gly His 130 135 140 tgc ggc ctg ccc ate gtg agc acc ege gac ggc agc ate ctg ggc ctg 480
Cys Gly Leu Pro Ile Val Ser Thr Arg Asp Gly Ser Ile Leu Gly Leu 145 150 155 160 cac agc ctg gcc aac agc acc aac acc cag aac ttc tac gcc gcc ttc 528Cys Gly Leu Pro Ile Val Ser Thr Arg Asp Gly Ser Ile Leu Gly Leu 145 150 155 160 cac agc ctg gcc aac agc aac acc cag aac ttc tac gcc gcc ttc 528
His Ser Leu Ala Asn Ser Thr Asn Thr Gin Asn Phe Tyr Ala Ala Phe 165 170 175 ccc gac aac ttc gag acc acc tac ctg agc aac cag gac aac gac aac 576His Ser Leu Ala Asn Ser Thr Asr Thr Gin Asn Phe Tyr Ala Ala Phe 165 170 175 ccc gac aac ttc gag acc acc tac ctg agc aac cag gac aac gac aac 576
Pro Asp Asn Phe Glu Thr Thr Tyr Leu Ser Asn Gin Asp Asn Asp Asn 180 185 190 tgg ate aag cag tgg ege tac aac ccc gac gag gtg tgc tgg ggc agc 624Pro Asp Asn Phe Glu Thr Thr Tyr Leu Ser Asn Gin Asp Asn Asn Asn 180 185 190 tgg aate cag tgg ege tac aac ccc gac gag gtg tgc tgg ggc agc 624
Trp Ile Lys Gin Trp Arg Tyr Asn Pro Asp Glu Val Cys Trp Gly Ser 195 200 205 ctg caa ctg aag ege gac ate ccc cag agc ccc ttc acc ate tgc aag 672Trp Ile Lys Gin Trp Arg Tyr Asn Pro Asp Glu Val Cys Trp Gly Ser 195 200 205 ctg caa ctg aag ege gac ate ccc cag agc ccc ttc acc ate tgc aag 672
Leu Gin Leu Lys Arg Asp Ile Pro Gin Ser Pro Phe Thr Ile Cys Lys 210 215 220 ctg ctg acc gac ctg gac ggc gag ttc gtg tac acc cag taa 714Leu Gin Leu Lys Arg Asp Ile Pro Gin Ser Pro Phe Thr Ile Cys Lys 210 215 220 ctg acc gac ctg gac ggc gag ttc gtg tac acc cag taa 714
Leu Leu Thr Asp Leu Asp Gly Glu Phe Val Tyr Thr Gin 225 230 235 <210> 48 <211> 237Leu Leu Thr Asp Leu Asp Gly Glu Phe Val Tyr Thr Gin 225 230 235 <210> 48 <211> 237
<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 48<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 48
Met Gly Val Gin Val Glu Thr Ile Ser Pro Gly Asp Gly Arg Thr Phe 15 10 15Met Gly Val Gin Val Glu Thr Ile Ser Pro Gly Asp Gly Arg Thr Phe 15 10 15
Pro Lys Arg Gly Gin Thr Cys Val Val His Tyr Thr Gly Met Leu Glu 20 25 30Pro Lys Arg Gly Gin Thr Cys Val Val His Tyr Thr Gly Met Leu Glu 20 25 30
Asp Gly Lys Lys Phe Asp Ser Ser Arg Asp Arg Asn Lys Pro Phe Lys 35 40 45Asp Gly Lys Lys Phe Asp Ser Ser Arg Asp Arg Asn Lys Pro Phe Lys 35 40 45
Phe Met Leu Gly Lys Gin Glu Val Ile Arg Gly Trp Glu Glu Gly Val 50 55 60Phe Met Leu Gly Lys Gin Glu Val Ile Arg Gly Trp Glu Glu Gly Val 50 55 60
Ala Gin Met Ser Val Gly Gin Arg Ala Lys Leu Thr Ile Ser Pro Asp 65 70 75 80Ala Gin Met Ser Val Gly Gin Arg Ala Lys Leu Thr Ile Ser Pro Asp 65 70 75 80
Tyr Ala Tyr Gly Ala Thr Gly His Pro Gly Ile Ile Pro Pro His Ala 85 90 95Tyr Ala Tyr Gly Ally Thr Gly His Pro Gly Ile Pro Pro His Ala 85 90 95
Thr Leu Val Phe Asp Val Glu Leu Leu Lys Leu Glu Gly Ser Gly Ser 100 105 110Thr Leu Val Phe Asp Val Glu Leu Leu Lys Leu Glu Gly Ser Gly Ser 100 105 110
Lys Ser Ile Ser Ser Thr Met Ser Glu Thr Ser Ala Thr Tyr Pro Val 115 120 125Lys Ser Ile Ser Ser Thr Met Ser Glu Thr Ser Ala Thr Tyr Pro Val 115 120 125
Asp Asn Ser His Phe Trp Lys His Trp Ile Ser Thr Lys Asp Gly His 130 135 140Asp Asn Ser His Phe Trp Lys His Trp Ile Ser Thr Lys Asp Gly His 130 135 140
Cys Gly Leu Pro Ile Val Ser Thr Arg Asp Gly Ser Ile Leu Gly Leu 145 150 155 160Cys Gly Leu Pro Ile Val Ser Thr Arg Asp Gly Ser Ile Leu Gly Leu 145 150 155 160
His Ser Leu Ala Asn Ser Thr Asn Thr Gin Asn Phe Tyr Ala Ala Phe 165 170 175His Ser Leu Ala Asn Ser Thr Asr Thr Gin Asn Phe Tyr Ala Ala Phe 165 170 175
Pro Asp Asn Phe Glu Thr Thr Tyr Leu Ser Asn Gin Asp Asn Asp Asn 180 185 190Pro Asp Asn Phe Glu Thr Thr Tyr Leu Ser Asn Gin Asp Asn Asn Asn 180 185 190
Trp Ile Lys Gin Trp Arg Tyr Asn Pro Asp Glu Val Cys Trp Gly Ser 195 200 205Trp Ile Lys Gin Trp Arg Tyr Asn Pro Asp Glu Val Cys Trp Gly Ser 195 200 205
Leu Gin Leu Lys Arg Asp Ile Pro Gin Ser Pro Phe Thr Ile Cys Lys 210 215 220Leu Gin Leu Lys Arg Asp Ile Pro Gin Ser Pro Phe Thr Ile Cys Lys 210 215 220
Leu Leu Thr Asp Leu Asp Gly Glu Phe Val Tyr Thr Gin 225 230 235 <210> 49 <211> 651Leu Leu Thr Asp Leu Asp Gly Glu Phe Val Tyr Thr Gin 225 230 235 <210> 49 <211> 651
<212> DNA <213> Artificial Seguence <220> <223> - <220> <221> CDS <222> (1)..(648) <400> 49 atg ate ete tgg cat gag atg tgg cat gaa ggc ctg gaa gag gca tet 48<212> DNA <213> Artificial Seguence <220> <223> - <220> <221> CDS <222> (1) .. (648) <400> 49 atg ate ete tgg cat gag atg tgg cat gaa ggc ctg gaa gag gca tet 48
Met Ile Leu Trp His Glu Met Trp His Glu Gly Leu Glu Glu Ala Ser 15 10 15 cgt ttg tac ttt ggg gaa agg aac gtg aaa ggc atg ttt gag gtg ctg 96Met Ile Leu Trp His Glu Met Trp His Glu Gly Leu Glu Glu Ala Ser 15 10 15 cgt ttg tac ttt ggg gaa agg aac gtg aaa ggc atg ttt gag gtg ctg 96
Arg Leu Tyr Phe Gly Glu Arg Asn Val Lys Gly Met Phe Glu Val Leu 20 25 30 gag ccc ttg cat get atg atg gaa cgg ggc ccc cag act ctg aag gaa 144Arg Leu Tyr Phe Gly Glu Arg Asn Val Lys Gly Met Phe Glu Val Leu 20 25 30 gag ccc ttg cat get atg atg gaa cgg ggc ccc cag act ctg aag gaa 144
Glu Pro Leu His Ala Met Met Glu Arg Gly Pro Gin Thr Leu Lys Glu 35 40 45 aca tcc ttt aat cag gcc tat ggt ega gat tta atg gag gcc caa gag 192Glu Pro Leu His Ala Met Met Glu Arg Gly Pro Gin Thr Leu Lys Glu 35 40 45 aca tcc ttt aat cag gcc tat ggt ega gat tta atg gag gcc caa gag 192
Thr Ser Phe Asn Gin Ala Tyr Gly Arg Asp Leu Met Glu Ala Gin Glu 50 55 60 tgg tgc agg aag tae atg aaa tca ggg aat gtc aag gac ete ete caa 240Thr Ser Phe Asn Gin Ala Tyr Gly Arg Asp Leu Met Glu Ala Gin Glu 50 55 60 tgg tgc agg aag tae atg aaa tca ggg aat gtc aag gac ete ete caa 240
Trp Cys Arg Lys Tyr Met Lys Ser Gly Asn Val Lys Asp Leu Leu Gin 65 70 75 80 gcc tgg gac ete tat tat cat gtg ttc ega ega ate tca aag gga tcc 288Trp Cys Arg Lys Tyr Met Lys Ser Gly Asn Val Lys Asp Leu Leu Gin 65 70 75 80 gcc tgg gac ete tat tat cat gtg ttc ega ega ate tca aag gga tcc 288
Ala Trp Asp Leu Tyr Tyr His Val Phe Arg Arg Ile Ser Lys Gly Ser 85 90 95 gga agt agc aag agc ctg ttc ege ggc ctg ege gac tac aac ccc ate 336Ala Trp Asp Leu Tyr Tyr His Val Phe Arg Arg Ile Ser Lys Gly Ser 85 90 95 gga agt agc aag agc ctg ttc ege ggc ctg ege gac tac aac ccc ate 336
Gly Ser Ser Lys Ser Leu Phe Arg Gly Leu Arg Asp Tyr Asn Pro Ile 100 105 110 gcc agc agc ate tgc cag ctg aac aac agc agc ggc gcc ege cag agc 384Gly Ser Ser Lys Ser Leu Phe Arg Gly Leu Arg Asp Tyr Asn Pro Ile 100 105 110 gcc agc agc ate tgc cag ctg aac aac agc agc ggc gcc ege cag agc 384
Ala Ser Ser Ile Cys Gin Leu Asn Asn Ser Ser Gly Ala Arg Gin Ser 115 120 125 gag atg ttc ggc ctg ggc ttc ggc ggc ctg ate gtg acc aac cag cac 432Ala Ser Ser Ile Cys Gin Leu Asn Ser Ser Gly Ala Arg Gin Ser 115 120 125 gag atg ttc ggc ctg ggc ttc ggc ggc ctg ate gtg acc aac cag cac 432
Glu Met Phe Gly Leu Gly Phe Gly Gly Leu Ile Val Thr Asn Gin His 130 135 140 ctg ttc aag ege aac gac ggc gag ctg acc ate ege agc cac cac ggc 480Glu Met Phe Gly Lei Gly Ply Gly Gly Leu Ile Val Thr Asn Gin His 130 135 140 ctg ttc aag ege aac gac ggc gag ctg acc ate ege agc cac cac ggc 480
Leu Phe Lys Arg Asn Asp Gly Glu Leu Thr Ile Arg Ser His His Gly 145 150 155 160 gag ttc gtg gtg aag gac acc aag acc Ctg aag ctg ctg ccc tgc aag 528Leu Phe Lys Arg Asn Asp Gly Glu Leu Thr Ile Arg Ser His His Gly 145 150 155 160 gag ttc gtg gtg aag gac acc aag acc Ctg aag ctg ctg ccc tgc aag 528
Glu Phe Val Val Lys Asp Thr Lys Thr Leu Lys Leu Leu Pro Cys Lys 165 170 175 ggc ege gac ate gtg ate ate ege ctg ccc aag gac ttc ccc ccc ttc 576Glu Phe Val Val Lys Asp Thr Lys Thr Leu Lys Leu Leu Pro Cys Lys 165 170 175 ggc ege gac ate gtg ate ate ege ctg ccc aag gac ttc ccc ccc ttc 576
Gly Arg Asp Ile Val Ile Ile Arg Leu Pro Lys Asp Phe Pro Pro Phe 180 185 190 ccc aag ege ctg caa ttc ege acc ccc acc acc gag gac ege gtg tgc 624Gly Arg Asp Ile Val Ile Ile Arg Leu Pro Lys Asp Phe Pro Pro Phe 180 185 190 ccc aag ege ctg caa ttc ege acc ccc acc acc gag gac ege gtg tgc 624
Pro Lys Arg Leu Gin Phe Arg Thr Pro Thr Thr Glu Asp Arg Val Cys 195 200 205 ctg ate ggc agc aac ttc cag acc taa 651Pro Lys Arg Leu Gin Phe Arg Thr Pro Thr Thr Glu Asp Arg Val Cys 195 200 205 ctg ate ggc agc aac ttc cag acc taa 651
Leu Ile Gly Ser Asn Phe Gin Thr 210 215 <210> 50Leu Ile Gly Ser Asn Phe Gin Thr 210 215 <210> 50
<211> 216 <212> PRT <213> Artificial Seguence <220> <223> Synthetic Construct <400> 50<211> 216 <212> PRT <213> Artificial Seguence <220> <223> Synthetic Construct <400> 50
Met Ile Leu Trp His Glu Met Trp His Glu Gly Leu Glu Glu Ala Ser 15 10 15Met Ile Leu Trp His Glu Met Trp His Glu Gly Leu Glu Glu Ala Ser 15 10 15
Arg Leu Tyr Phe Gly Glu Arg Asn Val Lys Gly Met Phe Glu Val Leu 20 25 30Arg Leu Tyr Phe Gly Glu Arg Asn Val Lys Gly Met Phe Glu Val Leu 20 25 30
Glu Pro Leu His Ala Met Met Glu Arg Gly Pro Gin Thr Leu Lys Glu 35 40 45Glu Pro Leu His Ala Met Met Glu Arg Gly Pro Gin Thr Leu Lys Glu 35 40 45
Thr Ser Phe Asn Gin Ala Tyr Gly Arg Asp Leu Met Glu Ala Gin Glu 50 55 60Thr Ser Phe Asn Gin Ala Tyr Gly Arg Asp Leu Met Glu Ala Gin Glu 50 55 60
Trp Cys Arg Lys Tyr Met Lys Ser Gly Asn Val Lys Asp Leu Leu Gin 65 70 75 80Trp Cys Arg Lys Tyr Met Lys Ser Gly Asn Val Lys Asp Leu Leu Gin 65 70 75 80
Ala Trp Asp Leu Tyr Tyr His Val Phe Arg Arg Ile Ser Lys Gly Ser 85 90 95Ala Trp Asp Leu Tyr Tyr His Val Phe Arg Arg Ile Ser Lys Gly Ser 85 90 95
Gly Ser Ser Lys Ser Leu Phe Arg Gly Leu Arg Asp Tyr Asn Pro Ile 100 105 110Gly Ser Ser Lys Ser Leu Phe Arg Gly Leu Arg Asp Tyr Asn Pro Ile 100 105 110
Ala Ser Ser Ile Cys Gin Leu Asn Asn Ser Ser Gly Ala Arg Gin Ser 115 120 125Ala Ser Ser Ile Cys Gin Leu Asn Asn Ser Ser Gly Ala Arg Gin Ser 115 120 125
Glu Met Phe Gly Leu Gly Phe Gly Gly Leu Ile Val Thr Asn Gin His 130 135 140Glu Met Phe Gly Le Gly Ply Gly Gly Leu Ile Val Thr Asn Gin His 130 135 140
Leu Phe Lys Arg Asn Asp Gly Glu Leu Thr Ile Arg Ser His His Gly 145 150 155 160Leu Phe Lys Arg Asn Asp Gly Glu Leu Thr Ile Arg Ser His His Gly 145 150 155 160
Glu Phe Val Val Lys Asp Thr Lys Thr Leu Lys Leu Leu Pro Cys Lys 165 170 175Glu Phe Val Lys Asp Thr Lys Thr Leu Lys Leu Leu Pro Cys Lys 165 170 175
Gly Arg Asp Ile Val Ile Ile Arg Leu Pro Lys Asp Phe Pro Pro Phe 180 185 190Gly Arg Asp Ile Val Ile Ile Arg Leu Pro Lys Asp Phe Pro Pro Phe 180 185 190
Pro Lys Arg Leu Gin Phe Arg Thr Pro Thr Thr Glu Asp Arg Val Cys 195 200 205Pro Lys Arg Leu Gin Phe Arg Thr Pro Thr Thr Glu Asp Arg Val Cys 195 200 205
Leu Ile Gly Ser Asn Phe Gin Thr 210 215 <210> 51 <211> 735Leu Ile Gly Ser Asn Phe Gin Thr 210 215 <210> 51 <211> 735
<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1)..(732) <400> 51 atg gga gtg caa gtg gaa acc ate tcc ccg gga gac ggg ege acc ttc 48<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1) .. (732) <400> 51 atg gga gtg caa gtg gaa acc ate tcc ccg gga gac ggg ege acc ttc 48
Met Gly Val Gin Val Glu Thr Ile Ser Pro Gly Asp Gly Arg Thr Phe 15 10 15 ccc aag ege ggc cag acc tgc gtg gtg cac tac acc ggg atg ctt gaa 96Met Gly Val Gin Val Glu Thr Ile Ser Pro Gly Asp Gly Arg Thr Phe 15 10 15 ccc aag ege ggc cag acc tgc gtg gtg cac tac acc ggg atg ctt gaa 96
Pro Lys Arg Gly Gin Thr Cys Val Val His Tyr Thr Gly Met Leu Glu 20 25 30 gat gga aag aaa ttt gat tcc tcc cgg gac aga aac aag ccc ttt aag 144Pro Lys Arg Gly Gin Thr Cys Val Val His Tyr Thr Gly Met Leu Glu 20 25 30 gat gga aag aaa ttt gat tcc tcc cgg gac aga aac aag ccc ttt aag 144
Asp Gly Lys Lys Phe Asp Ser Ser Arg Asp Arg Asn Lys Pro Phe Lys 35 40 45 ttt atg cta ggc aag cag gag gtg ate ega ggc tgg gaa gaa ggg gtt 192Asp Gly Lys Lys Phe Asp Ser Ser Arg Asp Arg Asn Lys Pro Phe Lys 35 40 45 ttt atg cta ggc aag cag gag gtg ate ega ggc tgg gaa gaa ggg gtt 192
Phe Met Leu Gly Lys Gin Glu Val Ile Arg Gly Trp Glu Glu Gly Val 50 55 60 gcc cag atg agt gtg ggt cag aga gcc aaa ctg act ata tet cca gat 240Phe Met Leu Gly Lys Gin Glu Val Ile Arg Gly Trp Glu Glu Gly Val 50 55 60 gcc cag atg agt gtg ggt cag aga gcc aaa ctg act ata tet cca gat 240
Ala Gin Met Ser Val Gly Gin Arg Ala Lys Leu Thr Ile Ser Pro Asp 65 70 75 80 tat gcc tat ggt gcc act ggg cac cca ggc ate ate cca cca cat gcc 288Ala Gin Met Ser Val Gly Gin Arg Ala Lys Leu Thr Ile Ser Pro Asp 65 70 75 80 tat gcc tat ggt gcc act ggg cac cca ggc ate ate cca cca cat gcc 288
Tyr Ala Tyr Gly Ala Thr Gly His Pro Gly Ile Ile Pro Pro His Ala 85 90 95 act ete gtc ttc gat gtg gag ctt cta aaa ctg gaa gga tcc gga agt 336Tyr Ala Tyr Gly Ally Thr Gly His Pro Gly Ile Pro Pro His Ala 85 90 95 act ete gtc ttc gat gtg gag ctt cta aaa ctg gaa gga tcc gga agt 336
Thr Leu Val Phe Asp Val Glu Leu Leu Lyš Leu Glu Gly Ser Gly Ser 100 105 110 aag cat ctt tcc agc atg gtg agc gag agc agc tgc gtg gtg cag cgc 384Thr Leu Val Phe Asp Val Glu Leu Leu Lyche Leu Glu Gly Ser Gly Ser 100 105 110 aag cat ctt tcc agc atg gtg agc gag agc agc tgc gtg gtg cag cgc 384
Lys His Leu Ser Ser Met Val Ser Glu Ser Ser Cys Val Val Gin Arg 115 120 125 gag gac agc Ccc ate tgg cgc cac tgg ate agc acc aag gaC ggc cac 432Lys His Leu Ser Ser Met Val Ser Glu Ser Ser Cys Val Val Gin Arg 115 120 125 gag gac agc Ccc ate tgg cgc cac tgg ate agc acc aag gaC ggc cac 432
Glu Asp Ser Pro Ile Trp Arg His Trp Ile Ser Thr Lys Asp Gly His 130 135 140 tgc ggc gcc ccc ate gtg agc ate cgc gac ggc tac ate ate ggc agc 480Glu Asp Ser Pro Ile Trp Arg His Trp Ile Ser Thr Lys Asp Gly His 130 135 140 tgc ggc gcc ccc ate gtg agc ate cgc gac ggc tac ate ate ggc agc 480
Cys Gly Ala Pro Ile Val Ser Ile Arg Asp Gly Tyr Ile Ile Gly Ser 145 150 155 160 cac tgc ggc gag aac ccc atg acc agc aac ttc ttc acc agc ate ccc 528Cys Gly Ala Pro Ile Val Ser Ile Arg Asp Gly Tyr Ile Ile Gly Ser 145 150 155 160 cac tgc ggc gag aac ccc atg acc agc aac ttc ttc acc agc ate ccc 528
His Cys Gly Glu Asn Pro Met Thr Ser Asn Phe Phe Thr Ser Ile Pro 165 170 175 aag gac ttc cag aac ctg ctg aac ggc aag gag gcc aac gag tgg gtg 576His Cys Gly Glu Asn Pro Met Thr Ser Asn Phe Phe Thr Ser Ile Pro 165 170 175 aag gac ttc cag aac ctg ctg aac ggc aag gag gcc aac gag tgg gtg 576
Lys Asp Phe Gin Asn Leu Leu Asn Gly Lys Glu Ala Asn Glu Trp Val 180 185 190 agc ggc tgg aag tac aac ate gac gcc gtg tgc tgg ggc ggc ctg agc 624Lys Asp Phe Gin Asn Leu Leu Asn Gly Lys Glu Ala Asn Glu Trp Val 180 185 190 agc ggc tgg aag tac aac ate gac gcc gtg tgc tgg ggc ggc ctg agc 624
Ser Gly Trp Lys Tyr Asn Ile Asp Ala Val Cys Trp Gly Gly Leu Ser 195 200 205 gtg gtg aac gac gcc ccc agc gag ccc ttc ate acc gcc aag gtg gtg 672Ser Gly Trp Lys Tyr Asn Ile Asp Ala Val Cys Trp Gly Gly Leu Ser 195 200 205 gtg gtg aac gac gcc ccc agc gag ccc ttc ate acc gcc aag gtg gtg 672
Val Val Asn Asp Ala Pro Ser Glu Pro Phe Ile Thr Ala Lys Val Val 210 215 220 agc gcc ctg gac acc gag ggc ate aag gtg cag tac cca tac gat gtt 720Val Val Asn Asp Ala Pro Ser Glu Pro Phe Ile Thr Ala Lys Val Val 210 215 220 agc gcc ctg gac acc gag ggc ate aag gtg cag tac cca tac gat gtt 720
Ser Ala Leu Asp Thr Glu Gly Ile Lys Val Gin Tyr Pro Tyr Asp Val 225 230 235 240 cca gat tac get taa 735Ser Ala Leu Asp Thr Glu Gly Ile Lys Val Gin Tyr Pro Tyr Asp Val 225 230 235 240 cca gat tac get taa 735
Pro Asp Tyr Ala <210> 52 <211> 244Pro Asp Tyr Ala <210> 52 <211> 244
<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 52<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 52
Met Gly Val Gin Val Glu Thr Ile Ser Pro Gly Asp Gly Arg Thr Phe 15 10 15Met Gly Val Gin Val Glu Thr Ile Ser Pro Gly Asp Gly Arg Thr Phe 15 10 15
Pro Lys Arg Gly Gin Thr Cys Val Val His Tyr Thr Gly Met Leu Glu 20 25 30Pro Lys Arg Gly Gin Thr Cys Val Val His Tyr Thr Gly Met Leu Glu 20 25 30
Asp Gly Lys Lys Phe Asp Ser Ser Arg Asp Arg Asn Lys Pro Phe Lys 35 40 45Asp Gly Lys Lys Phe Asp Ser Ser Arg Asp Arg Asn Lys Pro Phe Lys 35 40 45
Phe Met Leu Gly Lys Gin Glu Val Ile Arg Gly Trp Glu Glu Gly Val 50 55 60Phe Met Leu Gly Lys Gin Glu Val Ile Arg Gly Trp Glu Glu Gly Val 50 55 60
Ala Gin Met Ser Val Gly Gin Arg Ala Lys Leu Thr Ile Ser Pro Asp 65 70 75 80Ala Gin Met Ser Val Gly Gin Arg Ala Lys Leu Thr Ile Ser Pro Asp 65 70 75 80
Tyr Ala Tyr Gly Ala Thr Gly His Pro Gly Ile Ile Pro Pro His Ala 85 90 95Tyr Ala Tyr Gly Ally Thr Gly His Pro Gly Ile Pro Pro His Ala 85 90 95
Thr Leu Val Phe Asp Val Glu Leu Leu Lys Leu Glu Gly Ser Gly Ser 100 105 110Thr Leu Val Phe Asp Val Glu Leu Leu Lys Leu Glu Gly Ser Gly Ser 100 105 110
Lys His Leu Ser Ser Met Val Ser Glu Ser Ser Cys Val Val Gin Arg 115 120 125Lys His Leu Ser Ser Met Val Ser Glu Ser Ser Cys Val Val Gin Arg 115 120 125
Glu Asp Ser Pro Ile Trp Arg His Trp Ile Ser Thr Lys Asp Gly His 130 135 140Glu Asp Ser Pro Ile Trp Arg His Trp Ile Ser Thr Lys Asp Gly His 130 135 140
Cys Gly Ala Pro Ile Val Ser Ile Arg Asp Gly Tyr Ile Ile Gly Ser 145 150 155 160Cys Gly Ala Pro Ile Val Ser Ile Arg Asp Gly Tyr Ile Ile Gly Ser 145 150 155 160
His Cys Gly Glu Asn Pro Met Thr Ser Asn Phe Phe Thr Ser Ile Pro 165 170 175His Cys Gly Glu Asn Pro Met Thr Ser Asn Phe Phe Thr Ser Ile Pro 165 170 175
Lys Asp Phe Gin Asn Leu Leu Asn Gly Lys Glu Ala Asn Glu Trp Val 180 185 190Lys Asp Phe Gin Asn Leu Leu Asn Gly Lys Glu Ala Asn Glu Trp Val 180 185 190
Ser Gly Trp Lys Tyr Asn Ile Asp Ala Val Cys Trp Gly Gly Leu Ser 195 200 205Ser Gly Trp Lys Tyr Asn Ile Asp Ala Val Cys Trp Gly Gly Leu Ser 195 200 205
Val Val Asn Asp Ala Pro Ser Glu Pro Phe Ile Thr Ala Lys Val Val 210 215 220Val Val Asn Asp Ala Pro Ser Glu Pro Phe Ile Thr Ala Lys Val Val 210 215 220
Ser Ala Leu Asp Thr Glu Gly Ile Lys Val Gin Tyr Pro Tyr Asp Val 225 230 235 240Ser Ala Leu Asp Thr Glu Gly Ile Lys Val Gin Tyr Pro Tyr Asp Val 225 230 235 240
Pro Asp Tyr Ala <210> 53Pro Asp Tyr Ala <210> 53
<211> 681 <212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1)..(678) <400> 53 atg gag cag aag ctg ate agc gag gag gat ctg ate ete tgg eat gag 48<211> 681 <212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1) .. (678) <400> 53 atg gag cag aag ctg ate agc gag gag gat ctg ate ete tgg eat gag 48
Met Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Ile Leu Trp His Glu 15 10 15 atg tgg eat gaa ggc ctg gaa gag gca tet cgt ttg tac ttt ggg gaa 96Met Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Ile Leu Trp His Glu 15 10 15 atg tgg eat gaa ggc ctg gaa gag gca tet cgt ttg tac ttt ggg gaa 96
Met Trp His Glu Gly Leu Glu Glu Ala Ser Arg Leu Tyr Phe Gly Glu 20 25 30 agg aac gtg aaa ggc atg ttt gag gtg ctg gag ccc ttg eat get atg 144Met Trp His Glu Gly Leu Glu Glu Ala Ser Arg Leu Tyr Phe Gly Glu 20 25 30 agg aac gtg aaa ggc atg ttt gag gtg ctg gag ccc ttg eat get atg 144
Arg Asn Val Lys Gly Met Phe Glu Val Leu Glu Pro Leu His Ala Met 35 40 45 atg gaa cgg ggc ccc cag act ctg aag gaa aca tcc ttt aat cag gcc 192Arg Asn Val Lys Gly Met Phe Glu Val Leu Glu Pro Leu His Ala Met 35 40 45 atg gaa cgg ggc ccc cag act ctg aag gaa aca tcc ttt aat cag gcc 192
Met Glu Arg Gly Pro Gin Thr Leu Lys Glu Thr Ser Phe Asn Gin Ala 50 55 60 tat ggt ega gat tta atg gag gcc caa gag tgg tgc agg aag tac atg 240Met Glu Arg Gly Pro Gin Thr Leu Lys Glu Thr Ser Phe Asn Gin Ala 50 55 60 tat ggt ega gat tta atg gag gcc caa gag tgg tgc agg aag tac atg 240
Tyr Gly Arg Asp Leu Met Glu Ala Gin Glu Trp Cys Arg Lys Tyr Met 65 70 75 80 aaa tca ggg aat gtc aag gac ete ete caa gcc tgg gac ete tat tat 288Tyr Gly Arg Asp Leu Met Glu Ala Gin Glu Trp Cys Arg Lys Tyr Met 65 70 75 80 aaa tca ggg aat gtc aag gac ete ete caa gcc tgg gac ete tat tat 288
Lys Ser Gly Asn Val Lys Asp Leu Leu Gin Ala Trp Asp Leu Tyr Tyr 85 90 95 eat gtg ttc ega ega ate tca aag gga tcc gga agt ggc gtg agc ctg 336Lys Ser Gly Asn Val Lys Asp Leu Leu Gin Ala Trp Asp Leu Tyr Tyr 85 90 95 eat gtg ttc ega ega ate tca aag gga tcc gga agt ggc gtg agc ctg 336
His Val Phe Arg Arg Ile Ser Lys Gly Ser Gly Ser Gly Val Ser Leu 100 105 110 agc ege ggc gtg ege gac tac aac gcc ate agc agc atg gtg tgc ege 384His Val Phe Arg Arg Ile Ser Lys Gly Ser Gly Ser Gly Val Ser 100 100 110 agc ege ggc gtg ege gac tac aac gcc ate agc agc atg gtg tgc ege 384
Ser Arg Gly Val Arg Asp Tyr Asn Ala Ile Ser Ser Met Val Cys Arg 115 120 125 gtg acc aac gac agc ggc agc agc agc acc acc atg tac ggc ate ggc 432Ser Arg Gly Val Arg Asp Tyr Asn Ala Ile Ser Ser Met Val Cys Arg 115 120 125 gtg acc aac gac agc ggc agc agc agc acc acc atg tac ggc ate ggc 432
Val Thr Asn Asp Ser Gly Ser Ser Ser Thr Thr Met Tyr Gly Ile Gly 130 135 140 tac ggc tgc tac ate ate acc aac aag cac ctg ttc ege gag aac aac 480Val Thr Asn Asp Ser Gly Ser Ser Ser Thr Thr Met Tyr Gly Ile Gly 130 135 140 tac ggc tgc tac ate ate acc aac aag cac ctg ttc ege gag aac aac 480
Tyr Gly Cys Tyr Ile Ile Thr Asn Lys His Leu Phe Arg Glu Asn Asn 145 150 155 160 ggc ege ctg ctg ate acc agc cac cac ggc gag tac ate tgc aag aac 528Tyr Gly Cys Tyr Ile Ile Thr Asn Lys His Leu Phe Arg Glu Asn Asn 145 150 155 160 ggc ege ctg ctg ate acc agc cac cac ggc gag tac ate tgc aag aac 528
Gly Arg Leu Leu Ile Thr Ser His His Gly Glu Tyr Ile Cys Lys Asn 165 170 175 agc gcc agc ctg aag ctg agc ctg gtg ccc ggc ege gac atg ctg ctg 576Gly Arg Leu Leu Ile Thr Ser His His Gly Glu Tyr Ile Cys Lys Asn 165 170 175 agc gcc agc ctg aag ctg agc ctg gtg ccc ggc ege gac atg ctg ctg 576
Ser Ala Ser Leu Lys Leu Ser Leu Val Pro Gly Arg Asp Met Leu Leu 180 185 190 ate ege ctg ccc aag gac tgc ccc ccc ttc ccc agc aag ctg aag ttt 624Ser Ala Ser Leu Lys Leu Ser Leu Val Pro Gly Arg Asp Met Leu Leu 180 185 190 ate ege ctg ccc aag gac tgc ccc ccc ttc ccc agc aag ctg aag ttt 624
Ile Arg Leu Pro Lys Asp Cys Pro Pro Phe Pro Ser Lys Leu Lys Phe 195 200 205 ega gag ccc act agc gaa gaa aag gcc gtg ctt gta gtc aca aac ttc 672Ile Arg Leu Pro Lys Asp Cys Pro Pro Phe Pro Ser Lys Leu Lys Phe 195 200 205 ega gag ccc act agc gaa gaa aag gcc gtg ctt gta gtc aca aac ttc 672
Arg Glu Pro Thr Ser Glu Glu Lys Ala Val Leu Val Val Thr Asn Phe 210 215 220 cag gag taa 681Arg Glu Pro Thr Ser Glu Glu Lys Ala Val Leu Val Val Thr Asn Phe 210 215 220 cag gag taa 681
Gin Glu 225 <210> 54Gin Glu 225 <210> 54
<211> 226 <212> PRT <213> Artificial Seguence <220> <223> Synthetic Construct <400> 54<211> 226 <212> PRT <213> Artificial Seguence <220> <223> Synthetic Construct <400> 54
Met Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Ile Leu Trp His Glu 15 10 15Met Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Ile Leu Trp His Glu 15 10 15
Met Trp His Glu Gly Leu Glu Glu Ala Ser Arg Leu Tyr Phe Gly Glu 20 25 30Met Trp His Glu Gly Leu Glu Glu Ala Ser Arg Leu Tyr Phe Gly Glu 20 25 30
Arg Asn Val Lys Gly Met Phe Glu Val Leu Glu Pro Leu His Ala Met 35 40 45Arg Asn Val Lys Gly Met Phe Glu Val Leu Glu Pro Leu His Ala Met 35 40 45
Met Glu Arg Gly Pro Gin Thr Leu Lys Glu Thr Ser Phe Asn Gin Ala 50 55 60Met Glu Arg Gly Pro Gin Thr Leu Lys Glu Thr Ser Phe Asn Gin Ala 50 55 60
Tyr Gly Arg Asp Leu Met Glu Ala Gin Glu Trp Cys Arg Lys Tyr Met 65 70 75 80Tyr Gly Arg Asp Leu Met Glu Ala Gin Glu Trp Cys Arg Lys Tyr Met 65 70 75 80
Lys Ser Gly Asn Val Lys Asp Leu Leu Gin Ala Trp Asp Leu Tyr Tyr 85 90 95Lys Ser Gly Asn Val Lys Asp Leu Leu Gin Ala Trp Asp Leu Tyr Tyr 85 90 95
His Val Phe Arg Arg Ile Ser Lys Gly Ser Gly Ser Gly Val Ser Leu 100 105 110His Val Phe Arg Arg Ile Ser Lys Gly Ser Gly Ser Gly Val Ser Leu 100 105 110
Ser Arg Gly Val Arg Asp Tyr Asn Ala Ile Ser Ser Met Val Cys Arg 115 120 125Ser Arg Gly Val Arg Asp Tyr Asn Ala Ile Ser Ser Met Val Cys Arg 115 120 125
Val Thr Asn Asp Ser Gly Ser Ser Ser Thr Thr Met Tyr Gly Ile Gly 130 135 140Val Thr Asn Asp Ser Gly Ser Ser Ser Thr Thr Met Tyr Gly Ile Gly 130 135 140
Tyr Gly Cys Tyr Ile Ile Thr Asn Lys His Leu Phe Arg Glu Asn Asn 145 150 155 160Tyr Gly Cys Tyr Ile Ile Thr Asn Lys His Leu Phe Arg Glu Asn Asn 145 150 155 160
Gly Arg Leu Leu Ile Thr Ser His His Gly Glu Tyr Ile Cys Lys Asn 165 170 175Gly Arg Leu Leu Ile Thr Ser His His Gly Glu Tyr Ile Cys Lys Asn 165 170 175
Ser Ala Ser Leu Lys Leu Ser Leu Val Pro Gly Arg Asp Met Leu Leu 180 185 190Ser Ala Ser Leu Lys Leu Ser Leu Val Pro Gly Arg Asp Met Leu Leu 180 185 190
Ile Arg Leu Pro Lys Asp Cys Pro Pro Phe Pro Ser Lys Leu Lys Phe 195 200 205Ile Arg Leu Pro Lys Asp Cys Pro Pro Phe Pro Lys Leu Lys Phe 195 200 205
Arg Glu Pro Thr Ser Glu Glu Lys Ala Val Leu Val Val Thr Asn Phe 210 215 220Arg Glu Pro Thr Ser Glu Glu Lys Ala Val Leu Val Val Thr Asn Phe 210 215 220
Gin Glu 225 <210> 55 <211> 741Gin Glu 225 <210> 55 <211> 741
<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1)..(738) <400> 55 atg gga gtg caa gtg gaa acc ate tcc ccg gga gac ggg ege acc ttc 48<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1) .. (738) <400> 55 atg gga gtg caa gtg gaa acc ate tcc ccg gga gac ggg ege acc ttc 48
Met Gly Val Gin Val Glu Thr Ile Ser Pro Gly Asp Gly Arg Thr Phe 15 10 15 ccc aag ege ggc cag acc tgc gtg gtg cac tac acc ggg atg ctt gaa 96Met Gly Val Gin Val Glu Thr Ile Ser Pro Gly Asp Gly Arg Thr Phe 15 10 15 ccc aag ege ggc cag acc tgc gtg gtg cac tac acc ggg atg ctt gaa 96
Pro Lys Arg Gly Gin Thr Cys Val Val His Tyr Thr Gly Met Leu Glu 20 25 30 gat gga aag aaa ttt gat tcc tcc cgg gac aga aac aag ccc ttt aag 144Pro Lys Arg Gly Gin Thr Cys Val Val His Tyr Thr Gly Met Leu Glu 20 25 30 gat gga aag aaa ttt gat tcc tcc cgg gac aga aac aag ccc ttt aag 144
Asp Gly Lys Lys Phe Asp Ser Ser Arg Asp Arg Asn Lys Pro Phe Lys 35 40 45 ttt atg cta ggc aag cag gag gtg ate ega ggc tgg gaa gaa ggg gtt 192Asp Gly Lys Lys Phe Asp Ser Ser Arg Asp Arg Asn Lys Pro Phe Lys 35 40 45 ttt atg cta ggc aag cag gag gtg ate ega ggc tgg gaa gaa ggg gtt 192
Phe Met Leu Gly Lys Gin Glu Val Ile Arg Gly Trp Glu Glu Gly Val 50 55 60 gcc cag atg agt gtg ggt cag aga gcc aaa ctg act ata tet cca gat 240Phe Met Leu Gly Lys Gin Glu Val Ile Arg Gly Trp Glu Glu Gly Val 50 55 60 gcc cag atg agt gtg ggt cag aga gcc aaa ctg act ata tet cca gat 240
Ala Gin Met Ser Val Gly Gin Arg Ala Lys Leu Thr Ile Ser Pro Asp 65 70 75 80 tat gcc tat ggt gcc act ggg cac cca ggc ate ate cca cca cat gcc 288Ala Gin Met Ser Val Gly Gin Arg Ala Lys Leu Thr Ile Ser Pro Asp 65 70 75 80 tat gcc tat ggt gcc act ggg cac cca ggc ate ate cca cca cat gcc 288
Tyr Ala Tyr Gly Ala Thr Gly His Pro Gly Ile Ile Pro Pro His Ala 85 90 95 act ete gtc ttc gat gtg gag ctt cta aaa ctg gaa gga tcc gga agt 336Tyr Ala Tyr Gly Ally Thr Gly His Pro Gly Ile Pro Pro His Ala 85 90 95 act ete gtc ttc gat gtg gag ctt cta aaa ctg gaa gga tcc gga agt 336
Thr Leu Val Phe Asp Val Glu Leu Leu Lys Leu Glu Gly Ser Gly Ser 100 105 110 aag agc ctg cgc gcc acc gtg agc gag agc agc atg ate ctg ccc gag 384Thr Leu Val Phe Asp Val Glu Leu Leu Lys Leu Glu Gly Ser Gly Ser 100 105 110 aag agc ctg cgc gcc acc gtg agc gag agc agc atg ate ctg ccc gag 384
Lys Ser Leu Arg Ala Thr Val Ser Glu Ser Ser Met Ile Leu Pro Glu 115 120 125 ggc aag ggc agc ttc tgg ata cae tgg ate acc acc cag gac ggc ttc 432Lys Ser Leu Arg Ala Thr Val Ser Glu Ser Ser Met Ile Leu Pro Glu 115 120 125 ggc aag ggc agc ttc tgg ata cae tgg ate acc acc cag gac ggc ttc 432
Gly Lys Gly Ser Phe Trp Ile His Trp Ile Thr Thr Gin Asp Gly Phe 130 135 140 tgc ggc ctg ccc ctg gtg agc gtg aac gac ggc cac ate gtg ggc ate 480Gly Lys Gly Ser Phe Trp Ile His Trp Ile Thr Thr Gin Asp Gly Phe 130 135 140 tgc ggc ctg ccc ctg gtg agc gtg aac gac ggc cac ate gtg ggc ate 480
Cys Gly Leu Pro Leu Val Ser Val Asn Asp Gly His Ile Val Gly Ile 145 150 155 160 cac ggc ctg acc agc aac gac agc gag aag aac ttc ttc gtg ccc ctg 528Cys Gly Leu Pro Leu Val Ser Val Asn Asp Gly His Ile Val Gly Ile 145 150 155 160 cac ggc ctg acc agc aac gac agc gag aag aac ttc ttc gtg ccc ctg 528
His Gly Leu Thr Ser Asn Asp Ser Glu Lys Asn Phe Phe Val Pro Leu 165 170 175 acc gac ggc ttc gag aag gag tac ctg gag aac gcc gac aac ctg agc 576His Gly Leu Thr Ser Asn Asp Ser Glu Lys Asn Phe Phe Val Pro Leu 165 170 175 acc gac ggc ttc gag aag gag tac ctg gag aac gcc gac aac ctg agc 576
Thr Asp Gly Phe Glu Lys Glu Tyr Leu Glu Asn Ala Asp Asn Leu Ser 180 185 190 tgg gac aag cac tgg ttc tgg gag ccc agc aag ate gcc tgg ggc agc 624Thr Asp Gly Phe Glu Lys Glu Tyr Leu Glu Asn Ala Asp Asn Leu Ser 180 185 190 tgg gac aag cac tgg tgg ggg ccc agc aag ate gcc tgg ggc agc 624
Trp Asp Lys His Trp Phe Trp Glu Pro Ser Lys Ile Ala Trp Gly Ser 195 200 205 ctg aac ctg gtg gag gag cag ccc aag gag gag ttc aag ate agc aag 672Trp Asp Lys His Trp Phe Trp Glu Pro Ser Lys Ile Ala Trp Gly Ser 195 200 205 ctg aac ctg gtg gag cag ccc aag gag gag ttc aag ate agc aag 672
Leu Asn Leu Val Glu Glu Gin Pro Lys Glu Glu Phe Lys Ile Ser Lys 210 215 220 ctg gtg agc gac ctg ttc ggc aac acc gtg acc gtg cag tac cca tac 720Leu Asn Leu Val Glu Glu Gin Pro Lys Glu Glu Phe Lys Ile Ser Lys 210 215 220 ctg gtc agc gac ctg ttc ggc aac acc gtg acc gtg cag tac cca tac 720
Leu Val Ser Asp Leu Phe Gly Asn Thr Val Thr Val Gin Tyr Pro Tyr 225 230 235 240 gat gtt cca gat tac get taa 741Leu Val Ser Asp Leu Phe Gly Asn Thr Val Thr Val Gin Tyr Pro Tyr 225 230 235 240 gat gtt cca gat tac get taa 741
Asp Val Pro Asp Tyr Ala 245 <210> 56 <211> 246Asp Val Pro Asp Tyr Ala 245 <210> 56 <211> 246
<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 56<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 56
Met Gly Val Gin Val Glu Thr Ile Ser Pro Gly Asp Gly Arg Thr Phe 15 10 15Met Gly Val Gin Val Glu Thr Ile Ser Pro Gly Asp Gly Arg Thr Phe 15 10 15
Pro Lys Arg Gly Gin Thr Cys Val Val His Tyr Thr Gly Met Leu Glu 20 25 30Pro Lys Arg Gly Gin Thr Cys Val Val His Tyr Thr Gly Met Leu Glu 20 25 30
Asp Gly Lys Lys Phe Asp Ser Ser Arg Asp Arg Asn Lys Pro Phe Lys 35 40 45Asp Gly Lys Lys Phe Asp Ser Ser Arg Asp Arg Asn Lys Pro Phe Lys 35 40 45
Phe Met Leu Gly Lys Gin Glu Val Ile Arg Gly Trp Glu Glu Gly Val 50 55 60Phe Met Leu Gly Lys Gin Glu Val Ile Arg Gly Trp Glu Glu Gly Val 50 55 60
Ala Gin Met Ser Val Gly Gin Arg Ala Lys Leu Thr Ile Ser Pro Asp 65 70 75 80Ala Gin Met Ser Val Gly Gin Arg Ala Lys Leu Thr Ile Ser Pro Asp 65 70 75 80
Tyr Ala Tyr Gly Ala Thr Gly His Pro Gly Ile Ile Pro Pro His Ala 85 90 95Tyr Ala Tyr Gly Ally Thr Gly His Pro Gly Ile Pro Pro His Ala 85 90 95
Thr Leu Val Phe Asp Val Glu Leu Leu Lys Leu Glu Gly Ser Gly Ser 100 105 110Thr Leu Val Phe Asp Val Glu Leu Leu Lys Leu Glu Gly Ser Gly Ser 100 105 110
Lys Ser Leu Arg Ala Thr Val Ser Glu Ser Ser Met Ile Leu Pro Glu 115 120 125Lys Ser Leu Arg Ala Thr Val Ser Glu Ser Ser Met Ile Leu Pro Glu 115 120 125
Gly Lys Gly Ser Phe Trp Ile His Trp Ile Thr Thr Gin Asp Gly Phe 130 135 140Gly Lys Gly Ser Phe Trp Ile His Trp Ile Thr Thr Gin Asp Gly Phe 130 135 140
Cys Gly Leu Pro Leu Val Ser Val Asn Asp Gly His Ile Val Gly Ile 145 150 155 160Cys Gly Leu Pro Leu Val Ser Val Asn Asp Gly His Ile Val Gly Ile 145 150 155 160
His Gly Leu Thr Ser Asn Asp Ser Glu Lys Asn Phe Phe Val Pro Leu 165 170 175His Gly Leu Thr Ser Asn Asp Ser Glu Lys Asn Phe Phe Val Pro Leu 165 170 175
Thr Asp Gly Phe Glu Lys Glu Tyr Leu Glu Asn Ala Asp Asn Leu Ser 180 185 190Thr Asp Gly Phe Glu Lys Glu Tyr Leu Glu Asn Ala Asp Asn Leu Ser 180 185 190
Trp Asp Lys His Trp Phe Trp Glu Pro Ser Lys Ile Ala Trp Gly Ser 195 200 205Trp Asp Lys His Trp Phe Trp Glu Pro Ser Lys Ile Ala Trp Gly Ser 195 200 205
Leu Asn Leu Val Glu Glu Gin Pro Lys Glu Glu Phe Lys Ile Ser Lys 210 215 220Leu Asn Leu Val Glu Glu Gin Pro Lys Glu Glu Phe Lys Ile Ser Lys 210 215 220
Leu Val Ser Asp Leu Phe Gly Asn Thr Val Thr Val Gin Tyr Pro Tyr 225 230 235 240Leu Val Ser Asp Leu Phe Gly Asn Thr Val Thr Val Gin Tyr Pro Tyr 225 230 235 240
Asp Val Pro Asp Tyr Ala 245 <210> 57Asp Val Pro Asp Tyr Ala 245 <210> 57
<211> 681 <212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1)..(678) <400> 57 atg gag cag aag ctg ate agc gag gag gat ctg ate ete tgg cat gag 48<211> 681 <212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1) .. (678) <400> 57 atg gag cag aag ctg ate agc gag gag gat ctg ate ete tgg cat gag 48
Met Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Ile Leu Trp His Glu 15 10 15 atg tgg cat gaa ggc ctg gaa gag gca tet cgt ttg tac ttt ggg gaa 96Met Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Ile Leu Trp His Glu 15 10 15 atg tgg cat gaa ggc ctg gaa gag gca tet cgt ttg tac ttt ggg gaa 96
Met Trp His Glu Gly Leu Glu Glu Ala Ser Arg Leu Tyr Phe Gly Glu 20 25 30 agg aac gtg aaa ggc atg ttt gag gtg ctg gag ccc ttg cat get atg 144Met Trp His Glu Gly Leu Glu Glu Ala Ser Arg Leu Tyr Phe Gly Glu 20 25 30 agg aac gtg aaa ggc atg ttt gag gtg ctg gag ccc ttg cat get atg 144
Arg Asn Val Lys Gly Met Phe Glu Val Leu Glu Pro Leu His Ala Met 35 40 45 atg gaa cgg ggc ccc cag act ctg aag gaa aca tcc ttt aat cag gcc 192Arg Asn Val Lys Gly Met Phe Glu Val Leu Glu Pro Leu His Ala Met 35 40 45 atg gaa cgg ggc ccc cag act ctg aag gaa aca tcc ttt aat cag gcc 192
Met Glu Arg Gly Pro Gin Thr Leu Lys Glu Thr Ser Phe Asn Gin Ala 50 55 60 tat ggt ega gat tta atg gag gcc caa gag tgg tgc agg aag tac atg 240Met Glu Arg Gly Pro Gin Thr Leu Lys Glu Thr Ser Phe Asn Gin Ala 50 55 60 tat ggt ega gat tta atg gag gcc caa gag tgg tgc agg aag tac atg 240
Tyr Gly Arg Asp Leu Met Glu Ala Gin Glu Trp Cys Arg Lys Tyr Met 65 70 75 80 aaa tca ggg aat gtc aag gac ete ete caa gcc tgg gac ete tat tat 288Tyr Gly Arg Asp Leu Met Glu Ala Gin Glu Trp Cys Arg Lys Tyr Met 65 70 75 80 aaa tca ggg aat gtc aag gac ete ete caa gcc tgg gac ete tat tat 288
Lys Ser Gly Asn Val Lys Asp Leu Leu Gin Ala Trp Asp Leu Tyr Tyr 85 90 95 cat gtg ttc ega ega ate tca aag gga tcc gga agt agc aag agc gtg 336Lys Ser Gly Asn Val Lys Asp Leu Leu Gin Ala Trp Asp Leu Tyr Tyr 85 90 95 cat gtg ttc ega ega ate tca aag gga tcc gga agt agc aag agc gtg 336
His Val Phe Arg Arg Ile Ser Lys Gly Ser Gly Ser Ser Lys Ser Val 100 105 110 tac aag ggc ctg ege gac tac agc ggc ate agc acc ctg ate tgc cag 384His Val Phe Arg Arg Ile Ser Lys Ser Gly Ser Ser Lys Ser Val 100 105 110 tac aag ggc ctg ege gac tac agc ggc ate agc acc ctg ate tgc cag 384
Tyr Lys Gly Leu Arg Asp Tyr Ser Gly Ile Ser Thr Leu Ile Cys Gin 115 120 125 ctg acc aac agc agc gac ggc cac aag gag acc atg ttc ggc gtg ggc 432Tyr Lys Gly Leu Arg Asp Tyr Ser Gly Ile Ser Thr Leu Ile Cys Gin 115 120 125 ctg acc aac agc agc gac ggc cac aag gag acc atg ttc ggc gtg ggc 432
Leu Thr Asn Ser Ser Asp Gly His Lys Glu Thr Met Phe Gly Val Gly 130 135 140 tac ggc agc ttc ate ate acc aac ggc cac ctg ttc ege ege aac aac 480Leu Thr Asn Ser Ser Asp Gly His Lys Glu Thr Met Phe Gly Val Gly 130 135 140 tac ggc agc ttc ate ate acc aac ggc cac ctg ttc ege ege aac aac 480
Tyr Gly Ser Phe Ile Ile Thr Asn Gly His Leu Phe Arg Arg Asn Asn 145 150 155 160 ggc atg ctg acc gtg aag acc tgg cac ggc gag ttc gtg ate cac aac 528Tyr Gly Ser Phe Ile Ile Thr Asn Gly His Leu Phe Arg Arg Asn Asn 145 150 155 160 ggc atg ctg acc gtg aag acc tgg cac ggc gag ttc gtg ate cac aac 528
Gly Met Leu Thr Val Lys Thr Trp His Gly Glu Phe Val Ile His Asn 165 170 175 acc acc cag ctg aag ate cac ttc ate cag ggc ege gac gtg ate ctg 576Gly Met Leu Thr Val Lys Thr Trp His Gly Glu Phe Val Ile His Asn 165 170 175 acc acc cag ctg aag ate cac ttc ate cag ggc ege gac gtg ate ctg 576
Thr Thr Gin Leu Lys Ile His Phe Ile Gin Gly Arg Asp Val Ile Leu 180 185 190 ate ege atg ccc aag gac ttc ccc ccc ttc ggc aag ege aac ctg ttc 624Thr Thr Gin Leu Lys Ile His Phe Ile Gin Gly Arg Asp Val Ile Leu 180 185 190 ate ege atg ccc aag gac ttc ccc ccc ttc ggc aag ege aac ctg ttc 624
Ile Arg Met Pro Lys Asp Phe Pro Pro Phe Gly Lys Arg Asn Leu Phe 195 200 205 ege cag ccc aag ege gag gag ege gtg tgc atg gtg ggc acc aac ttc 672Ile Arg Met Pro Lys Asp Pro Pro Phe Gly Lys Arg Asn Leu Phe 195 200 205 ege cag ccc aag ege gag gag ege gtg tgc atg gtg ggc acc aac ttc 672
Arg Gin Pro Lys Arg Glu Glu Atg Val Cys Met Val Gly Thr Asn Phe 210 215 220 cag gag taa 681Arg Gin Pro Lys Arg Glu Glu Atg Val Cys Met Val Gly Thr Asn Phe 210 215 220 cag gag taa 681
Gin Glu 225 <210> 58Gin Glu 225 <210> 58
<211> 226 <212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 58<211> 226 <212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 58
Met Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Ile Leu Trp His Glu 15 10 15Met Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Ile Leu Trp His Glu 15 10 15
Met Ttp His Glu Gly Leu Glu Glu Ala Ser Arg Leu Tyr Phe Gly Glu 20 25 30Met Ttp His Glu Gly Leu Glu Glu Ala Ser Arg Leu Tyr Phe Gly Glu 20 25 30
Arg Asn Val Lys Gly Met Phe Glu Val Leu Glu Pro Leu His Ala Met 35 40 45Arg Asn Val Lys Gly Met Phe Glu Val Leu Glu Pro Leu His Ala Met 35 40 45
Met Glu Arg Gly Pro Gin Thr Leu Lys Glu Thr Ser Phe Asn Gin Ala 50 55 60Met Glu Arg Gly Pro Gin Thr Leu Lys Glu Thr Ser Phe Asn Gin Ala 50 55 60
Tyr Gly Arg Asp Leu Met Glu Ala Gin Glu Trp Cys Arg Lys Tyr Met 65 70 75 80Tyr Gly Arg Asp Leu Met Glu Ala Gin Glu Trp Cys Arg Lys Tyr Met 65 70 75 80
Lys Ser Gly Asn Val Lys Asp Leu Leu Gin Ala Trp Asp Leu Tyr Tyr 85 90 95Lys Ser Gly Asn Val Lys Asp Leu Leu Gin Ala Trp Asp Leu Tyr Tyr 85 90 95
His Val Phe Arg Arg Ile Ser Lys Gly Ser Gly Ser Ser Lys Ser Val 100 105 110His Val Phe Arg Arg Ile Ser Lys Gly Ser Gly Ser Ser Lys Ser Val 100 105 110
Tyr Lys Gly Leu Arg Asp Tyr Ser Gly Ile Ser Thr Leu Ile Cys Gin 115 120 125Tyr Lys Gly Leu Arg Asp Tyr Ser Gly Ile Ser Thr Leu Ile Cys Gin 115 120 125
Leu Thr Asn Ser Ser Asp Gly His Lys Glu Thr Met Phe Gly Val Gly 130 135 140Leu Thr Asn Ser Ser Asp Gly His Lys Glu Thr Met Phe Gly Val Gly 130 135 140
Tyr Gly Ser Phe Ile Ile Thr Asn Gly His Leu Phe Arg Arg Asn Asn 145 150 155 160Tyr Gly Ser Phe Ile Ile Thr Asn Gly His Leu Phe Arg Arg Asn Asn 145 150 155 160
Gly Met Leu Thr Val Lys Thr Trp His Gly Glu Phe Val Ile His Asn 165 170 175Gly Met Leu Thr Val Lys Thr Trp His Gly Glu Phe Val Ile His Asn 165 170 175
Thr Thr Gin Leu Lys Ile His Phe Ile Gin Gly Arg Asp Val Ile Leu 180 185 190Thr Thr Gin Leu Lys Ile His Phe Ile Gin Gly Arg Asp Val Ile Leu 180 185 190
Ile Arg Met Pro Lys Asp Phe Pro Pro Phe Gly Lys Arg Asn Leu Phe 195 200 205Ile Arg Met Pro Lys Asp Phe Pro Phe Gly Lys Arg Asn Leu Phe 195 200 205
Arg Gin Pro Lys Arg Glu Glu Arg Val Cys Met Val Gly Thr Asn Phe 210 215 220Arg Gin Pro Lys Arg Glu Glu Arg Val Cys Met Val Gly Thr Asn Phe 210 215 220
Gin Glu 225 <210> 59 <211> 1929Gin Glu 225 <210> 59 <211> 1929
<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1)..(1926) <400> 59 atg cac cac cac cac cac cac aag atg gac aaa aag act ata gtt tgg 48<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1) .. (1926) <400> 59 atg cac cac cac aac atg gac aaa aag act ata gtt tgg 48
Met His His His His His His Lys Met Asp Lys Lys Thr Ile Val Trp 15 10 15 ttt aga aga gac cta agg att gag gat aat cct gca tta gca gca gct 96Met His His His His His Lys Met Asp Lys Lys Thr Ile Val Trp 15 10 15 ttt aga aga gac cta agg att gag gat aat cct gca tta gca gca gct 96
Phe Arg Arg Asp Leu Arg Ile Glu Asp Asn Pro Ala Leu Ala Ala Ala 20 25 30 gct cac gaa gga tet gtt ttt cct gtc ttc att tgg tgt cct gaa gaa 144Phe Arg Arg Asp Leu Arg Ile Glu Asp Asn Pro Ala Leu Ala Ala Ala 20 25 30 gct cac gaa gga tet gtt ttt cct gtc ttc att tgg tgt cct gaa gaa 144
Ala His Glu Gly Ser Val Phe Pro Val Phe Ile Trp Cys Pro Glu Glu 35 40 45 gaa gga cag ttt tat cct gga aga gct tca aga tgg tgg atg aaa caa 192Ala His Glu Gly Ser Val Phe Pro Val Phe Ile Trp Cys Pro Glu Glu 35 40 45 gaa gga cag ttt tat cct gga aga gct tca aga tgg tgg atg aaa caa 192
Glu Gly Gin Phe Tyr Pro Gly Arg Ala Ser Arg Trp Trp Met Lys Gin 50 55 60 tca ctt gct cac tta tet caa tcc ttg aag gct ctt gga tet gac ete 240Glu Gly Gin Phe Tyr Pro Gly Arg Ala Ser Arg Trp Trp Met Lys Gin 50 55 60 tca ctt gct cac tta tet caa tcc ttg aag gct ctt gga tet gac ete 240
Ser Leu Ala His Leu Ser Gin Ser Leu Lys Ala Leu Gly Ser Asp Leu 65 70 75 80 act tta ate aaa acc cac aac acg att tca gcg ate ttg gat tgt ate 288Ser Leu Ala His Leu Ser Gin Ser Leu Lys Ala Leu Gly Ser Asp Leu 65 70 75 80 act tta ate aaa acc cac aac acg att tca gcg ate ttg gat tgt ate 288
Thr Leu Ile Lys Thr His Asn Thr Ile Ser Ala Ile Leu Asp Cys Ile 85 90 95 ege gtt acc ggt gct aca aaa gtc gtc ttt aac cac ete tat gat cct 336Thr Leu Ile Lys Thr His Asn Thr Ile Ser Ala Ile Leu Asp Cys Ile 85 90 95 ege gtt acc ggt gct aca aaa gtc gtc ttt aac cac ete tat gat cct 336
Arg Val Thr Gly Ala Thr Lys Val Val Phe Asn His Leu Tyr Asp Pro 100 105 110 gtt tcg tta gtt cgg gac cat acc gta aag gag aag ctg gtg gaa cgt 384Arg Val Thr Gly Ala Thr Lys Val Val Phe Asn His Leu Tyr Asp Pro 100 105 110 gtt tcg tta gtt cgg gac cat acc gta aag gag aag ctg gtg gaa cgt 384
Val Ser Leu Val Arg Asp His Thr Val Lys Glu Lys Leu Val Glu Arg 115 120 125 ggg ate tet gtg caa agc tac aat gga gat cta ttg tat gaa ccg tgg 432Val Ser Leu Val Arg Asp His Thr Val Lys Glu Lys Leu Val Glu Arg 115 120 125 ggg ate tet gtg caa agc tac aat gga gat cta ttg tat gaa ccg tgg 432
Gly Ile Ser Val Gin Ser Tyr Asn Gly Asp Leu Leu Tyr Glu Pro Trp 130 135 140 gag ata tac tgc gaa aag ggc aaa cct ttt acg agt ttc aat tet tac 480Gly Ile Ser Val Gin Ser Tyr Asn Gly Asp Leu Leu Tyr Glu Pro Trp 130 135 140 gag ata tac tgc gaa aag ggc aaa cct ttt acg agt ttc aat tet tac 480
Glu Ile Tyr Cys Glu Lys Gly Lys Pro Phe Thr Ser Phe Asn Ser Tyr 145 150 155 160 tgg aag aaa tgc tta gat atg tcg att gaa tcc gtt atg ctt cct cct 528Glu Ile Tyr Cys Glu Lys Gly Lys Pro Phe Thr Ser Phe Asn Ser Tyr 145 150 155 160 tgg aag aaa tgc tta gat atg tcg att gaa tcc gtt atg ctt cct cct 528
Trp Lys Lys Cys Leu Asp Met Ser Ile Glu Ser Val Met Leu Pro Pro 165 170 175 cct tgg cgg ttg atg cca ata act gca gcg get gaa gcg att tgg gcg 576Trp Lys Lys Cys Leu Asp Met Ser Ile Glu Ser Val Met Leu Pro Pro 165 170 175 cct tgg cgg ttg atg cca ata act gca gcg get gaa gcg att tgg gcg 576
Pro Trp Arg Leu Met Pro Ile Thr Ala Ala Ala Glu Ala Ile Trp Ala 180 185 190 tgt tcg att gaa gaa cta ggg ctg gag aat gag gcc gag aaa ccg agc 624Pro Trp Arg Leu Met Pro Ile Thr Ala Ala Ala Glu Ala Ile Trp Ala 180 185 190 tgt tcg att gaa gaa cta ggg ctg gag aat gag gcc gag aaa ccg agc 624
Cys Ser Ile Glu Glu Leu Gly Leu Glu Asn Glu Ala Glu Lys Pro Ser 195 200 205 aat gcg ttg tta act aga get tgg tet cca gga tgg agc aat get gat 672Cys Ser Ile Glu Glu Leu Gly Leu Glu Asn Glu Ala Glu Lys Pro Ser 195 200 205 aat gcg ttg tta act aga get tgg tet cca gga tgg agc aat get gat 672
Asn Ala Leu Leu Thr Arg Ala Trp Ser Pro Gly Trp Ser Asn Ala Asp 210 215 220 aag tta cta aat gag ttc ate gag aag cag ttg ata gat tat gca aag 720Asn Ala Leu Leu Thr Arg Ala Trp Ser Pro Gly Trp Ser Asn Ala Asp 210 215 220 aag tta cta aat gag ttc ate gag aag cag ttg ata gat tat gca aag 720
Lys Leu Leu Asn Glu Phe Ile Glu Lys Gin Leu Ile Asp Tyr Ala Lys 225 230 235 240 aac agc aag aaa gtt gtt ggg aat tet act tca cta ctt tet ccg tat 768Lys Leu Leu Asn Glu Phe Ile Glu Lys Gin Leu Ile Asp Tyr Ala Lys 225 230 235 240 aac agc aag aaa gtt gtt ggg aat tet act tca cta ctt tet ccg tat 768
Asn Ser Lys Lys Val Val Gly Asn Ser Thr Ser Leu Leu Ser Pro Tyr 245 250 255 ete cat ttc ggg gaa ata agc gtc aga cae gtt ttc cag tgt gcc cgg 816Asn Ser Lys Val Val Gly Asn Ser Thr Ser Leu Leu Ser Pro Tyr 245 250 255 ete cat ttc ggg gaa ata agc gtc aga cae gtt ttc cag tgt gcc cgg 816
Leu His Phe Gly Glu Ile Ser Val Arg His Val Phe Gin Cys Ala Arg 260 265 270 atg aaa caa att ata tgg gca aga gat aag aac agt gaa gga gaa gaa 864Leu His Phe Gly Glu Ile Ser Val Arg His Val Phe Gin Cys Ala Arg 260 265 270 atg aaa caa att ata tgg gca aga gat aag aac agt gaa gga gaa gaa 864
Met Lys Gin Ile Ile Trp Ala Arg Asp Lys Asn Ser Glu Gly Glu Glu 275 280 285 agt gca gat ctt ttt ctt agg gga ate ggt tta aga gag tat tet cgg 912Met Lys Gin Ile Ile Trp Ala Arg Asp Lys Asn Ser Glu Gly Glu Glu 275 280 285 agt gca gat ctt ttt ctt agg gga ate ggt tta aga gag tat tet cgg 912
Ser Ala Asp Leu Phe Leu Arg Gly Ile Gly Leu Arg Glu Tyr Ser Arg 290 295 300 tat ata tgt ttc aac ttc ccg ttt act cac gag caa tcg ttg ttg agt 960Ser Ala Asp Leu Phe Leu Arg Gly Ile Gly Leu Arg Glu Tyr Ser Arg 290 295 300 tat ata tgt ttc aac ttc ccg ttt act cac gag caa tcg ttg ttg agt 960
Tyr Ile Cys Phe Asn Phe Pro Phe Thr His Glu Gin Ser Leu Leu Ser 305 310 315 320 cat ctt cgg ttt ttc cct tgg gat get gat gtt gat aag ttc aag gcc 1008Tyr Ile Cys Phe Asn Phe Pro Phe Thr His Glu Gin Ser Leu Leu Ser 305 310 315 320 cat ctt ctt tttc cct tgg gat get gat gtt gat aag ttc aag gcc 1008
His Leu Arg Phe Phe Pro Trp Asp Ala Asp Val Asp Lys Phe Lys Ala 325 330 335 tgg aga caa ggc agg acc ggt tat ccg ttg gtg gat gcc gga atg aga 1056His Leu Arg Phe Phe Pro Trp Asp Ala Asp Val Asp Lys Phe Lys Ala 325 330 335 tgg aga caa ggc agg acc ggt tat ccg ttg gtg gat gcc gga atg aga 1056
Trp Arg Gin Gly Arg Thr Gly Tyr Pro Leu Val Asp Ala Gly Met Arg 340 345 350 gag ctt tgg get acc gga tgg atg cat aac aga ata aga gtg att gtt 1104Trp Arg Gin Gly Arg Thr Gly Tyr Pro Leu Val Asp Ala Gly Met Arg 340 345 350 gag ctt tgg get acc gga tgg atg cat aac aga ata aga gtg att gtt 1104
Glu Leu Trp Ala Thr Gly Trp Met His Asn Arg Ile Arg Val Ile Val 355 360 365 tca agc ttt get gtg aag ttt ctt ete ctt cca tgg aaa tgg gga atg 1152Glu Leu Trp Ala Thr Gly Trp Met His Asn Arg Ile Arg Val Ile Val 355 360 365 tca agt ttt get gtg aag ttt ctt ete ctt cga tgg aaa tgg gga atg 1152
Ser Ser Phe Ala Val Lys Phe Leu Leu Leu Pro Trp Lys Trp Gly Met 370 375 380 aag tat ttc tgg gat aca ctt ttg gat get gat ttg gaa tgt gac ate 1200Ser Ser Phe Ala Val Lys Phe Leu Leu Leu Pro Trp Lys Trp Gly Met 370 375 380 aag tat ttc tgg gat aca ctt ttg gat get gat ttg gaa tgt gac ate 1200
Lys Tyr Phe Trp Asp Thr Leu Leu Asp Ala Asp Leu Glu Cys Asp Ile 385 390 395 400 ctt ggc tgg cag tat ate tet ggg agt ate ccc gat ggc cac gag ctt 1248Lys Tyr Phe Trp Asp Thr Leu Leu Asp Ala Asp Leu Glu Cys Asp Ile 385 390 395 400 ctt ggc tgg cag tat ate tet ggg agt ate ccc gat ggc cac gag ctt 1248
Leu Gly Trp Gin Tyr Ile Ser Gly Ser Ile Pro Asp Gly His Glu Leu 405 410 415 gat ege ttg gac aat ccc gcg tta caa ggc gcc aaa tat gac cca gaa 1296Leu Gly Trp Gin Tyr Ile Ser Gly Ser Ile Pro Asp Gly His Glu Leu 405 410 415 gat ege ttg gac aat ccc gcg tta caa ggc gcc aaa tat gac cca gaa 1296
Asp Arg Leu Asp Asn Pro Ala Leu Gin Gly Ala Lys Tyr Asp Pro Glu 420 425 430 ggt gag tac ata agg caa tgg ctt ccc gag ctt gcg aga ttg cca act 1344Asp Arg Leu Asp Asn Pro Ala Leu Gin Gly Ala Lys Tyr Asp Pro Glu 420 425 430 ggt gag tac ata agg caa tgg ctt ccc gag ctt gcg aga ttg cca act 1344
Gly Glu Tyr Ile Arg Gin Trp Leu Pro Glu Leu Ala Arg Leu Pro Thr 435 440 445 gaa tgg ate cat cat cca tgg gac get cct tta acc gta ete aaa get 1392Gly Glu Tyr Ile Arg Gin Trp Leu Pro Glu Leu Ala Arg Leu Pro Thr 435 440 445 gaa tgg ate cat cat cca tgg gac get cct tta acc gta ete aaa get 1392
Glu Trp Ile His His Pro Trp Asp Ala Pro Leu Thr Val Leu Lys Ala 450 455 460 tet ggt gtg gaa ete gga aca aac tat gcg aaa ccc att gta gac ate 1440Glu Trp Ile His His Pro Trp Asp Ala Pro Leu Thr Val Leu Lys Ala 450 455 460 tet ggt gtg gaa ete gga aca aac tat gcg aaa ccc att gta gac ate 1440
Ser Gly Val Glu Leu Gly Thr Asn Tyr Ala Lys Pro Ile Val Asp Ile 465 470 475 480 gac aca get cgt gag cta cta get aaa get att tca aga acc cgt gaa 1488Ser Gly Val Glu Leu Gly Thr Asn Tyr Ala Lys Pro Ile Val Asp Ile 465 470 475 480 gac aca get cgt gag cta cta get aaa get att tca aga acc cgt gaa 1488
Asp Thr Ala Arg Glu Leu Leu Ala Lys Ala Ile Ser Arg Thr Arg Glu 485 490 495 gca cag ate atg ate gga gca gca ggc ggc tet ggc ggc ggc tcc gga 1536Asp Thr Ala Arg Glu Leu Leu Ala Lys Ala Ile Ser Arg Thr Arg Glu 485 490 495 gca cag ate atg ate gga gca gca ggc ggc tet ggc ggc ggc tcc gga 1536
Ala Gin Ile Met Ile Gly Ala Ala Gly Gly Ser Gly Gly Gly Ser Gly 500 505 510 ggc tet gag cag aag ctg ate agc gag gag gac ctg gga gaa agc ttg 1584Ala Gin Ile Met Ile Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly 500 505 510 ggc gag cag aag ctg ate agc gag gag gac ctg gga gaa agc ttg 1584
Gly Ser Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Gly Glu Ser Leu 515 520 525 ttt aag gga cca cgt gat tac aac ccg ata teg agc acc att tgt cac 1632Gly Ser Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Gly Glu Ser Leu 515 520 525 ttt aag gga cca cgt gat tac aac ccg ata teg agc acc att tgt cac 1632
Phe Lys Gly Pro Arg Asp Tyr Asn Pro Ile Ser Ser Thr Ile Cys His 530 535 540 ttg acg aat gaa tet gat ggg cac aca aca teg ttg tat ggt att gga 1680Phe Lys Gly Pro Arg Asp Tyr Asn Pro Ile Ser Ser Thr Ile Cys His 530 535 540 ttg acg aat gaa tet gat ggg cac aca aca teg ttg tat ggt att gga 1680
Leu Thr Asn Glu Ser Asp Gly His Thr Thr Ser Leu Tyr Gly Ile Gly 545 550 555 560 ttt ggt ccc ttc ate att aca aac aag cac ttg ttt aga aga aat aat 1728Leu Thr Asn Glu Ser Asp Gly His Thr Thr Ser Leu Tyr Gly Ile Gly 545 550 555 560 ttt ggt ccc ttc ate att aca aac aag cac ttg ttt aga aga aat aat 1728
Phe Gly Pro Phe Ile Ile Thr Asn Lys His Leu Phe Arg Arg Asn Asn 565 570 575 gga aca ctg ttg gtc caa tca cta cat ggt gta ttc aag gtc aag aac 1776Phe Gly Pro Phe Ile Ile Thr Asn Lys His Leu Phe Arg Arg Asn Asn 565 570 575 gga aca ctg ttg gtc caa tca cta cat ggt gta ttc aag gtc aag aac 1776
Gly Thr Leu Leu Val Gin Ser Leu His Gly Val Phe Lys Val Lys Asn 580 585 590 acc acg act ttg caa caa cac ete att gat ggg agg gac atg ata att 1824Gly Thr Leu Leu Val Gin Ser Leu His Gly Val Phe Lys Val Lys Asn 580 585 590 acc acg act ttg caa caa cac ete att gat ggg agg gac atg ata att 1824
Thr Thr Thr Leu Gin Gin His Leu Ile Asp Gly Arg Asp Met Ile Ile 595 600 605 att ege atg cct aag gat ttc cca cca ttt cct caa aag ctg aaa ttt 1872Thr Thr Thr Leu Gin Gin His Leu Ile Asp Gly Arg Asp Met Ile Ile 595 600 605 att ege atg cct aag gat ttc ca cca ctt cct caa aag ctg aaa ttt 1872
Ile Arg Met Pro Lys Asp Phe Pro Pro Phe Pro Gin Lys Leu Lys Phe 610 615 620 aga gag cca caa agg gaa gag ege ata tgt ctt gtg aca acc aac ttc 1920Ile Arg Met Pro Lys Asp Phe Pro Pro Phe Pro Gin Lys Leu Lys Phe 610 615 620 aga gag cca caa agg gaa gag ege ata tgt ctt gtg aca acc aac ttc 1920
Arg Glu Pro Gin Arg Glu Glu Arg Ile Cys Leu Val Thr Thr Asn Phe 625 630 635 640 caa act taa 1929Arg Glu Pro Gin Arg Glu Glu Arg Ile Cys Leu Val Thr Thr Thr Asn Phe 625 630 635 640 caa act taa 1929
Gin Thr <210> 60 <211> 642Gin Thr <210> 60 <211> 642
<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 60<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 60
Met His His His His His His Lys Met Asp Lys Lys Thr Ile Val Trp 15 10 15Met His Lies Lys Thr Ile Val Trp 15 10 15
Phe Arg Arg Asp Leu Arg Ile Glu Asp Asn Pro Ala Leu Ala Ala Ala 20 25 30Phe Arg Arg Asp Leu Arg Ile Glu Asp Asn Pro Ala Leu Ala Ala Ala 20 25 30
Ala His Glu Gly Ser Val Phe Pro Val Phe Ile Trp Cys Pro Glu Glu 35 40 45Ala His Glu Gly Ser Val Phe Pro Val Phe Ile Trp Cys Pro Glu Glu 35 40 45
Glu Gly Gin Phe Tyr Pro Gly Arg Ala Ser Arg Trp Trp Met Lys Gin 50 55 60Glu Gly Gin Phe Tyr Pro Gly Arg Ala Ser Arg Trp Met Lys Gin 50 55 60
Ser Leu Ala His Leu Ser Gin Ser Leu Lys Ala Leu Gly Ser Asp Leu 65 70 75 80Ser Leu Ala His Leu Ser Gin Ser Leu Lys Ala Leu Gly Ser Asp Leu 65 70 75 80
Thr Leu Ile Lys Thr His Asn Thr Ile Ser Ala Ile Leu Asp Cys Ile 85 90 95Thr Leu Ile Lys Thr His Asn Thr Ile Ser Ala Ile Leu Asp Cys Ile 85 90 95
Arg Val Thr Gly Ala Thr Lys Val Val Phe Asn His Leu Tyr Asp Pro 100 105 110Arg Val Thr Gly Ala Thr Lys Val Val Phe Asn His Leu Tyr Asp Pro 100 105 110
Val Ser Leu Val Arg Asp His Thr Val Lys Glu Lys Leu Val Glu Arg 115 120 125Val Ser Leu Val Arg Asp His Thr Val Lys Glu Lys Leu Val Glu Arg 115 120 125
Gly Ile Ser Val Gin Ser Tyr Asn Gly Asp Leu Leu Tyr Glu Pro Trp 130 135 140Gly Ile Ser Val Gin Ser Tyr Asn Gly Asp Leu Leu Tyr Glu Pro Trp 130 135 140
Glu Ile Tyr Cys Glu Lys Gly Lys Pro Phe Thr Ser Phe Asn Ser Tyr 145 150 155 160Glu Ile Tyr Cys Glu Lys Gly Lys Pro Phe Thr Ser Phe Asn Ser Tyr 145 150 155 160
Trp Lys Lys Cys Leu Asp Met Ser Ile Glu Ser Val Met Leu Pro Pro 165 170 175Trp Lys Lys Cys Leu Asp Met Ser Ile Glu Ser Val Leu Pro Pro 165 170 175
Pro Trp Arg Leu Met Pro Ile Thr Ala Ala Ala Glu Ala Ile Trp Ala 180 185 190Pro Trp Arg Leu Met Pro Ile Thr Ala Ala Ala Glu Ala Ile Trp Ala 180 185 190
Cys Ser Ile Glu Glu Leu Gly Leu Glu Asn Glu Ala Glu Lys Pro Ser 195 200 205Cys Ser Ile Glu Glu Leu Gly Leu Glu Asn Glu Ala Glu Lys Pro Ser 195 200 205
Asn Ala Leu Leu Thr Arg Ala Trp Ser Pro Gly Trp Ser Asn Ala Asp 210 215 220Asn Ala Leu Leu Thr Arg Ala Trp Ser Pro Gly Trp Ser Asn Ala Asp 210 215 220
Lys Leu Leu Asn Glu Phe Ile Glu Lys Gin Leu Ile Asp Tyr Ala Lys 225 230 235 240Lys Leu Leu Asn Glu Phe Ile Glu Lys Gin Leu Ile Asp Tyr Ala Lys 225 230 235 240
Asn Ser Lys Lys Val Val Gly Asn Ser Thr Ser Leu Leu Ser Pro Tyr 245 250 255Asn Ser Lys Val Val Gly Asn Ser Thr Ser Leu Leu Ser Pro Tyr 245 250 255
Leu His Phe Gly Glu Ile Ser Val Arg His Val Phe Gin Cys Ala Arg 260 265 270Leu His Phe Gly Glu Ile Ser Val Arg His Val Phe Gin Cys Ala Arg 260 265 270
Met Lys Gin Ile Ile Trp Ala Arg Asp Lys Asn Ser G1U Gly Glu Glu 275 280 285Met Lys Gin Ile Ile Trp Ala Arg Asp Lys Asn Ser G1U Gly Glu Glu 275 280 285
Ser Ala Asp Leu Phe Leu Atg Gly Ile Gly Leu Arg Glu Tyr Ser Arg 290 295 300Ser Ala Asp Leu Phe Leu Atg Gly Ile Gly Leu Arg Glu Tyr Ser Arg 290 295 300
Tyr Ile Cys Phe Asn Phe Pro Phe Thr His Glu Gin Ser Leu Leu Ser 305 310 315 320Tyr Ile Cys Phe Asn Phe Pro Phe Thr His Glu Gin Ser Leu Leu Ser 305 310 315 320
His Leu Arg Phe Phe Pro Trp Asp Ala Asp Val Asp Lys Phe Lys Ala 325 330 335His Leu Arg Phe Phe Pro Trp Asp Ala Asp Val Asp Lys Phe Lys Ala 325 330 335
Trp Arg Gin Gly Arg Thr Gly Tyr Pro Leu Val Asp Ala Gly Met Arg 340 345 350Trp Arg Gin Gly Arg Thr Gly Tyr Pro Leu Val Asp Ala Gly Met Arg 340 345 350
Glu Leu Trp Ala Thr Gly Trp Met His Asn Arg Ile Arg Val Ile Val 355 360 365Glu Leu Trp Ala Thr Gly Trp Met His Asn Arg Ile Arg Val Ile Val 355 360 365
Ser Ser Phe Ala Val Lys Phe Leu Leu Leu Pro Trp Lys Trp Gly Met 370 375 380Ser Ser Phe Ala Val Lys Phe Leu Leu Leu Pro Trp Lys Trp Gly Met 370 375 380
Lys Tyr Phe Trp Asp Thr Leu Leu Asp Ala Asp Leu Glu Cys Asp Ile 385 390 395 400Lys Tyr Phe Trp Asp Thr Leu Leu Asp Ala Asp Leu Glu Cys Asp Ile 385 390 395 400
Leu Gly Trp Gin Tyr Ile Ser Gly Ser Ile Pro Asp Gly His Glu Leu 405 410 415Leu Gly Trp Gin Tyr Ile Ser Gly Ser Ile Pro Asp Gly His Glu Leu 405 410 415
Asp Arg Leu Asp Asn Pro Ala Leu Gin Gly Ala Lys Tyr Asp Pro Glu 420 425 430Asp Arg Leu Asp Asn Pro Ala Leu Gin Gly Ala Lys Tyr Asp Pro Glu 420 425 430
Gly Glu Tyr Ile Arg Gin Trp Leu Pro Glu Leu Ala Arg Leu Pro Thr 435 440 445Gly Glu Tyr Ile Arg Gin Trp Leu Pro Glu Leu Ala Arg Leu Pro Thr 435 440 445
Glu Trp Ile His His Pro Trp Asp Ala Pro Leu Thr Val Leu Lys Ala 450 455 460Glu Trp Ile His His Pro Trp Asp Ala Pro Leu Thr Val Leu Lys Ala 450 455 460
Ser Gly Val Glu Leu Gly Thr Asn Tyr Ala Lys Pro Ile Val Asp Ile 465 470 475 480Ser Gly Val Glu Leu Gly Thr Asn Tyr Ala Lys Pro Ile Val Asp Ile 465 470 475 480
Asp Thr Ala Arg Glu Leu Leu Ala Lys Ala Ile Ser Arg Thr Arg Glu 485 490 495Asp Thr Ala Arg Glu Leu Leu Ala Lys Ala Ile Ser Arg Thr Arg Glu 485 490 495
Ala Gin Ile Met Ile Gly Ala Ala Gly Gly Ser Gly Gly Gly Ser Gly 500 505 510Ala Gin Ile Met Ile Gly Gly Gly Gly Gly Ser Gly 500 505 510
Gly Ser Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Gly Glu Ser Leu 515 520 525Gly Ser Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Gly Glu Ser Leu 515 520 525
Phe Lys Gly Pro Arg Asp Tyr Asn Pro Ile Ser Ser Thr Ile Cys His 530 535 540Phe Lys Gly Pro Arg Asp Tyr Asn Pro Ile Ser Ser Thr Ile Cys His 530 535 540
Leu Thr Asn Glu Ser Asp Gly His Thr Thr Ser Leu Tyr Gly Ile Gly 545 550 555 560Leu Thr Asn Glu Ser Asp Gly His Thr Thr Ser Leu Tyr Gly Ile Gly 545 550 555 560
Phe Gly Pro Phe Ile Ile Thr Asn Lys His Leu Phe Arg Arg Asn Asn 565 570 575Phe Gly Pro Phe Ile Ile Thr Asn Lys His Leu Phe Arg Arg Asn Asn 565 570 575
Gly Thr Leu Leu Val Gin Ser Leu His Gly Val Phe Lys Val Lys Asn 580 585 590Gly Thr Leu Leu Val Gin Ser Leu His Gly Val Phe Lys Val Lys Asn 580 585 590
Thr Thr Thr Leu Gin Gin His Leu Ile Asp Gly Arg Asp Met Ile Ile 595 600 605Thr Thr Thr Leu Gin Gin His Leu Ile Asp Gly Arg Asp Met Ile Ile 595 600 605
Ile Arg Met Pro Lys Asp Phe Pro Pro Phe Pro Gin Lys Leu Lys Phe 610 615 620Ile Arg Met Pro Lys Asp Pro Pro Phe Pro Gin Lys Leu Lys Phe 610 615 620
Arg Glu Pro Gin Arg Glti Glu Arg Ile Cys Leu val Thr Thr Asn Phe 625 630 635 640Arg Glu Pro Gin Arg Glt Glu Arg Ile Cys Leu val Thr Thr Asr Phe 625 630 635 640
Gin Thr <210> 61 <211> 960Gin Thr <210> 61 <211> 960
<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1)..(930) <400> 61 atg cac cac cac cac cac cac aat gga gct ata gga ggt gac ctt ttg 48<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1) .. (930) <400> 61 atg cac cac cac cac cac cac aat gga gct ata gga ggt gac ctt ttg 48
Met His His His His His His Asn Gly Ala Ile Gly Gly Asp Leu Leu 15 10 15 ctc aat ttt cct gac atg tcg gtc cta gag cgc caa agg gct cac ctc 96Met His His His His His As Gly Gly As Leu Leu 15 10 15 ctc aat ttt cct gac atg tcg gtc cta gag cgc caa agg gct cac ctc 96
Leu Asn Phe Pro Asp Met Ser Val Leu Glu Arg Gin Arg Ala His Leu 20 25 30 aag tac ctc aat ccc acc ttt gat tet cct ctc gcc ggc ttc ttt gcc 144Leu Asn Phe Pro Asp Met Ser Val Leu Glu Arg Gin Arg Ala His Leu 20 25 30 aag tac ctc aat ccc acc ttt gat tet cct ctc gcc ggc ttc ttt gcc 144
Lys Tyr Leu Asn Pro Thr Phe Asp Ser Pro Leu Ala Gly Phe Phe Ala 35 40 45 gat tet tca atg att acc ggc ggc gag atg gac agc tat ctt tcg act 192Lys Tyr Leu Asn Pro Thr Phe Asp Ser Pro Leu Ala Gly Phe Phe Ala 35 40 45 gat tet tca atg att acc ggc ggc gag atg gac agc tat ctt tcg act 192
Asp Ser Ser Met Ile Thr Gly Gly Glu Met Asp Ser Tyr Leu Ser Thr 50 55 60 gcc ggt ttg aat ctt ccg atg atg tac ggt gag acg acg gtg gaa ggt 240Asp Ser Ser Met Ile Thr Gly Gly Glu Met Asp Ser Tyr Leu Ser Thr 50 55 60 gcc ggt ttg aat ctt ccg atg atg tac ggt gag acg acg gtg gaa ggt 240
Ala Gly Leu Asn Leu Pro Met Met Tyr Gly Glu Thr Thr Val Glu Gly 65 70 75 80 gat tca aga ctc tca att tcg ccg gaa acg acg ctt ggg act gga aat 288Ala Gly Leu Asn Leu Pro Met Met Tyr Gly Glu Thr Thr Val Glu Gly 65 70 75 80 gat tca aga ctc tca att tcg ccg gaa acg acg ctt ggg act gga aat 288
Asp Ser Arg Leu Ser Ile Ser Pro Glu Thr Thr Leu Gly Thr Gly Asn 85 90 95 ttc aag gca gcg aag ttt gat aca gag act aag gat tgt aat gag gcg 336Asp Ser Arg Leu Ser Ile Ser Pro Glu Thr Thr Leu Gly Thr Gly Asn 85 90 95 ttc aag gca gcg aag ttt gat aca gag act aag gat tgt aat gag gcg 336
Phe Lys Ala Ala Lys Phe Asp Thr Glu Thr Lys Asp Cys Asn Glu Ala 100 105 110 gcg aag aag atg acg atg aac aga gat gac cta gta gaa gaa gga gaa 384Phe Lys Ala Ala Lys Phe Asp Thr Glu Thr Lys Asp Cys Asn Glu Ala 100 105 110 gcg aag aag atg acg atg aac aga gat gac cta gta gaa gaa gga gaa 384
Ala Lys Lys Met Thr Met Asn Arg Asp Asp Leu Val Glu Glu Gly Glu 115 120 125 gaa gag aag tcg aaa ata aca gag caa aac aat ggg agc aca aaa agc 432Ala Lys Lys Met Thr Met Asn Arg Asp Asp Leu Val Glu Glu Gly 115 120 125 gaa gag aag tcg aaa ata aca gag caa aac aat ggg agc aca aaa agc 432
Glu Glu Lys Ser Lys Ile Thr Glu Gin Asn Asn Gly Ser Thr Lys Ser 130 135 140 ate aag aag atg aaa cac aaa gcc aag aaa gaa gag aac aat ttc tet 480Glu Glu Lys Ser Lys Ile Thr Glu Gin Asn Asn Gly Ser Thr Lys Ser 130 135 140 ate aag aag atg aaa cac aaa gcc aag aaa gaa gag aac aat ttc tet 480
Ile Lys Lys Met Lys His Lys Ala Lys Lys Glu Glu Asn Asn Phe Ser 145 150 155 160 aat gat tca tet aaa gtg acg aag gaa ttg gag aaa acg gat tat att 528Ile Lys Lys Met Lys His Lys Ala Lys Lys Glu Glu Asn Asn Phe Ser 145 150 155 160 aat gat tca tet aaa gtg acg aag gaa ttg gag aaa acg gat tat att 528
Asn Asp Ser Ser Lys Val Thr Lys Glu Leu Glu Lys Thr Asp Tyr Ile 165 170 175 ggc ggc tet ggc ggc ggc tcc gga ggc tet aag agc atg tet agc atg 576Asn Asp Ser Ser Lys Val Thr Lys Glu Leu Glu Lys Thr Asp Tyr Ile 165 170 175 ggc ggc tet ggc ggc ggc tcc gga ggc tet aag agc atg tet agc atg 576
Gly Gly Ser Gly Gly Gly Ser Gly Gly Ser Lys Ser Met Ser Ser Met 180 185 190 gtg tca gac acc agt tgc aca ttc cct tca tet gat ggc ata ttc tgg 624Gly Gly Ser Gly Gly Gly Ser Gly Gly Ser Lys Ser Met Ser Ser Met 180 185 190 gtg tca gac acc agt tgc aca ttc cct tca tet gat ggc ata ttc tgg 624
Val Ser Asp Thr Ser Cys Thr Phe Pro Ser Ser Asp Gly Ile Phe Trp 195 200 205 aag cat tgg att caa acc aag gat ggg cag tgt ggc agt cca tta gta 672Val Ser Asp Thr Ser Cys Thr Phe Pro Ser Ser Asp Gly Ile Phe Trp 195 200 205 aag cat tgg att caa acc aag gat ggg cag tgt ggc agt cca tta gta 672
Lys His Trp Ile Gin Thr Lys Asp Gly Gin Cys Gly Ser Pro Leu Val 210 215 220 tca act aga gat ggg ttc att gtt ggt ata cac tca gca tcg aat ttc 720Lys His Trp Ile Gin Thr Lys Asp Gly Gin Cys Gly Ser Pro Leu Val 210 215 220 tca act aga gat ggg ttc att gtt ggt ata cac tca gca tcg aat ttc 720
Ser Thr Arg Asp Gly Phe Ile Val Gly Ile His Ser Ala Ser Asn Phe 225 230 235 240 acc aac aca aac aat tat ttc aca agc gtg ccg aaa aac ttc atg gaa 768Ser Thr Arg Asp Gly Phe Ile Val Gly Ile His Ser Ala Ser Asn Phe 225 230 235 240 acc aac aca aac aat tat tc aca agc gtg ccg aaa aac ttc atg gaa 768
Thr Asn Thr Asn Asn Tyr Phe Thr Ser Val Pro Lys Asn Phe Met Glu 245 250 255 ttg ttg aca aat cag gag gcg cag cag tgg gtt agt ggt tgg ega tta 816Thr Asn Thr Asn Asn Tyr Phe Thr Ser Val Pro Lys Asn Phe Met Glu 245 250 255 ttg ttg aca aat cag gag gcg cag cag tgg gtt agt ggt tgg ega tta 816
Leu Leu Thr Asn Gin Glu Ala Gin Gin Trp Val Ser Gly Trp Arg Leu 260 265 270 aat gct gac tca gta ttg tgg ggg ggc cat aaa gtt ttc atg agc aaa 864Leu Leu Thr Asn Gin Glu Ala Gin Gin Trp Val Ser Gly Trp Arg Leu 260 265 270 aat gct gac tca gta ttg tgg ggg ggc cat aaa gtt ttc atg agc aaa 864
Asn Ala Asp Ser Val Leu Trp Gly Gly His Lys Val Phe Met Ser Lys 275 280 285 cct gaa gag cct ttt cag cca gtt aag gaa gcg act caa ctc atg agt 912Asn Ala Asp Ser Val Leu Trp Gly Gly His Lys Val Phe Met Ser Lys 275 280 285 cct gaa gag cct ttt cag cca gtt aag gaa gcg act caa ctc atg agt 912
Pro Glu Glu Pro Phe Gin Pro Val Lys Glu Ala Thr Gin Leu Met Ser 290 295 300 gaa ttg gtg tac tčg caa tacccatacg atgttccaga ttacgcttaa 960Pro Glu Glu Pro Phe Gin Pro Val Lys Glu Ala Thr Gin Leu Met Ser 290 295 300 gaa ttg gtg tac tchg caa tacccatacg atgttccaga ttacgcttaa 960
Glu Leu Val Tyr Ser Gin 305 310 <210> 62 <211> 310Glu Leu Val Tyr Ser Gin 305 310 <210> 62 <211> 310
<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 62<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 62
Met His His His His His His Asn Gly Ala Ile Gly Gly Asp Leu Leu 15 10 15Met His He 's His His His Asn Gly As Leu Leu 15 10 15
Leu Asn Phe Pro Asp Met Ser Val Leu Glu Arg Gin Arg Ala His Leu 20 25 30Leu Asn Phe Pro Asp Met Ser Val Leu Glu Arg Gin Arg Ala His Leu 20 25 30
Lys Tyr Leu Asn Pro Thr Phe Asp Ser Pro Leu Ala Gly Phe Phe Ala 35 40 45Lys Tyr Leu Asn Pro Thr Phe Asp Ser Pro Leu Ala Gly Phe Phe Ala 35 40 45
Asp Ser Ser Met Ile Thr Gly Gly Glu Met Asp Ser Tyr Leu Ser Thr 50 55 60Asp Ser Ser Met Ile Thr Gly Gly Glu Met Asp Ser Tyr Leu Ser Thr 50 55 60
Ala Gly Leu Asn Leu Pro Met Met Tyr Gly Glu Thr Thr Val Glu Gly 65 70 75 80Ala Gly Leu Asn Leu Pro Met Met Tyr Gly Glu Thr Thr Val Glu Gly 65 70 75 80
Asp Ser Arg Leu Ser Ile Ser Pro Glu Thr Thr Leu Gly Thr Gly Asn 85 90 95Asp Ser Arg Leu Ser Ile Ser Pro Glu Thr Thr Leu Gly Thr Gly Asn 85 90 95
Phe Lys Ala Ala Lys Phe Asp Thr Glu Thr Lys Asp Cys Asn Glu Ala 100 105 110Phe Lys Ala Ala Lys Phe Asp Thr Glu Thr Lys Asp Cys Asn Glu Ala 100 105 110
Ala Lys Lys Met Thr Met Asn Arg Asp Asp Leu Val Glu Glu Gly Glu 115 120 125Ala Lys Lys Met Thr Met Asn Arg Asp Asp Leu Val Glu Glu Gly 115 120 125
Glu Glu Lys Ser Lys Ile Thr Glu Gin Asn Asn Gly Ser Thr Lys Ser 130 135 140Glu Glu Lys Ser Lys Ile Thr Glu Gin Asn Gly Ser Thr Lys Ser 130 135 140
Ile Lys Lys Met Lys His Lys Ala Lys Lys Glu Glu Asn Asn Phe Ser 145 150 155 160Ile Lys Lys Met Lys His Lys Ala Lys Lys Glu Glu Asn Asn Phe Ser 145 150 155 160
Asn Asp Ser Ser Lys Val Thr Lys Glu Leu Glu Lys Thr Asp Tyr Ile 165 170 175Asn Asp Ser Ser Lys Val Thr Lys Glu Leu Glu Lys Thr Asp Tyr Ile 165 170 175
Gly Gly Ser Gly Gly Gly Ser Gly Gly Ser Lys Ser Met Ser Ser Met 180 185 190Gly Gly Ser Gly Gly Gly Ser Gly Gly Ser Lys Ser Met Ser Ser Met 180 185 190
Val Ser Asp Thr Ser Cys Thr Phe Pro Ser Ser Asp Gly Ile Phe Trp 195 200 205Val Ser Asp Thr Ser Cys Thr Phe Pro Ser Ser Asp Gly Ile Phe Trp 195 200 205
Lys His Trp Ile Gin Thr Lys Asp Gly Gin Cys Gly Ser Pro Leu Val 210 215 220Lys His Trp Ile Gin Thr Lys Asp Gly Gin Cys Gly Ser Pro Leu Val 210 215 220
Ser Thr Arg Asp Gly Phe Ile Val Gly Ile His Ser Ala Ser Asn Phe 225 230 235 240Ser Thr Arg Asp Gly Phe Ile Val Gly Ile His Ser Ala Ser Asn Phe 225 230 235 240
Thr Asn Thr Asn Asn Tyr Phe Thr Ser Val Pro Lys Asn Phe Met Glu 245 250 255Thr Asn Thr Asn Asn Tyr Phe Thr Ser Val Pro Lys Asn Phe Met Glu 245 250 255
Leu Leu Thr Asn Gin Glu Ala Gin Gin Trp Val Ser Gly Trp Arg Leu 260 265 270Leu Leu Thr Asn Gin Glu Ala Gin Gin Trp Val Ser Gly Trp Arg Leu 260 265 270
Asn Ala Asp Ser Val Leu Trp Gly Gly His Lys Val Phe Met Ser LyS 275 280 285Asn Ala Asp Ser Val Leu Trp Gly Gly His Lys Val Phe Met Ser LyS 275 280 285
Pro Glu Glu Pro Phe Gin Pro Val Lys Glu Ala Thr Gin Leu Met Ser 290 295 300Pro Glu Glu Pro Phe Gin Pro Val Lys Glu Ala Thr Gin Leu Met Ser 290 295 300
Glu Leu Val Tyr Ser Gin 305 310 <210> 63 <211> 1938Glu Leu Val Tyr Ser Gin 305 310 <210> 63 <211> 1938
<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1)..(1935) <400> 63 atg cac cac cac cac cac cac ggc tet ggc aag atg gac aaa aag act 48<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1) .. (1935) <400> 63 atg cac cac cac cac cac cac ggc tet ggc aag atg gac aaa aag act 48
Met His His His His His His Gly Ser Gly Lys Met Asp Lys Lys Thr 1 5 10 15 ata gtt tgg ttt aga aga gac cta agg att gag gat aat cct gca tta 96Met His His His His Gly Ser Gly Lys Met Asp Lys Lys Thr 1 5 10 15 ata gtt tgg ttt aga aga gac cta agg att gag gat aat cct gca tta 96
Ile Val Trp Phe Arg Arg Asp Leu Arg Ile Glu Asp Asn Pro Ala Leu 20 25 30 gca gca get get cac gaa gga tet gtt ttt cct gtc ttc att tgg tgt 144Ile Val Trp Phe Arg Arg Asp Leu Arg Ile Glu Asp Asn Pro Ala Leu 20 25 30 gca gca get get cac gaa gga tet gtt ttt cct gtc ttc att tgg tgt 144
Ala Ala Ala Ala His Glu Gly Ser Val Phe Pro Val Phe Ile Trp Cys 35 40 45 cct gaa gaa gaa gga cag ttt tat cct gga aga get tca aga tgg tgg 192Ala Ala Ala Ala His Glu Gly Ser Val Phe Pro Val Phe Ile Trp Cys 35 40 45 cct gaa gaa gaa gaga cag ttt tat cct gga aga get tca aga tgg tgg 192
Pro Glu Glu Glu Gly Gin Phe Tyr Pro Gly Arg Ala Ser Arg Trp Trp 50 55 60 atg aaa caa tca ctt get cac tta tet caa tcc ttg aag get ctt gga 240Pro Glu Glu Gly Gly Phe Tyr Pro Gly Arg Ala Ser Arg Trp Trp 50 55 60 atg aaa caa tca ctt get cac tta tet caa tcc ttg aag get ctt gga 240
Met Lys Gin Ser Leu Ala His Leu Ser Gin Ser Leu Lys Ala Leu Gly 65 70 75 80 tet gac ete act tta ate aaa acc cac aac acg att tca gcg ate ttg 288Met Lys Gin Ser Leu Ala His Leu Ser Gin Ser Leu Lys Ala Leu Gly 65 70 75 80 tet gac ete act tta ate aaa acc cac aac acg att tca gcg ate ttg 288
Ser Asp Leu Thr Leu Ile Lys Thr His Asn Thr Ile Ser Ala Ile Leu 85 90 95 gat tgt ate ege gtt acc ggt get aca aaa gtc gtc ttt aac cac ete 336Ser Asp Leu Thr Leu Ile Lys Thr His Asn Thr Ile Ser Ala Ile Leu 85 90 95 gat tgt ate ege gtt acc ggt get aca aaa gtc gtc ttt aac cac ete 336
Asp Cys Ile Arg Val Thr Gly Ala Thr Lys Val Val Phe Asn His Leu 100 105 110 tat gat cct gtt teg tta gtt cgg gac cat acc gta aag gag aag ctg 384Asp Cys Ile Arg Val Thr Gly Ala Thr Lys Val Val Phe Asn His Leu 100 105 110 tat gat cct gtt teg tta gtt cgg gac cat acc gta aag gag aag ctg 384
Tyr Asp Pro Val Ser Leu Val Arg Asp His Thr Val Lys Glu Lys Leu 115 120 125 gtg gaa cgt ggg ate tet gtg caa agc tac aat gga gat cta ttg tat 432Tyr Asp Pro Val Ser Leu Val Arg Asp His Thr Val Lys Glu Lys Leu 115 120 125 gtg gaa cgt ggg ate tet gtg caa agc tac aat gga gat cta ttg tat 432
Val Glu Arg Gly Ile Ser Val Gin Ser Tyr Asn Gly Asp Leu Leu Tyr 130 135 140 gaa ccg tgg gag ata tac tgc gaa aag ggc aaa cct ttt acg agt ttc 480Val Glu Arg Gly Ile Ser Val Gin Ser Tyr Asn Gly Asp Leu Leu Tyr 130 135 140 gaa ccg tgg gag ata tac tgc gaa aag ggc aaa cct ttt acg agt ttc 480
Glu Pro Trp Glu Ile Tyr Cys Glu Lys Gly Lys Pro Phe Thr Ser Phe 145 150 155 160 aat tet tac tgg aag aaa tgc tta gat atg teg att gaa tcc gtt atg 528Glu Pro Trp Glu Ile Tyr Cys Glu Lys Gly Lys Pro Phe Thr Ser Phe 145 150 155 160 aat tet tac tgg aag aaa tgc tta gat atg teg att gaa tcc gtt atg 528
Asn Ser Tyr Trp Lys Lys Cys Leu Asp Met Ser Ile Glu Ser Val Met 165 170 175 ctt cct cct cct tgg cgg ttg atg cca ata act gca gcg get gaa gcg 576Asn Ser Tyr Trp Lys Lys Cys Leu Asp Met Ser Ile Glu Ser Val Met 165 170 175 ctt cct cct cct cgg ttg atg cca ata act gca gcg get gaa gcg 576
Leu Pro Pro Pro Trp Arg Leu Met Pro Ile Thr Ala Ala Ala Glu Ala 180 185 190 att tgg gcg tgt teg att gaa gaa cta ggg ctg gag aat gag gcc gag 624Leu Pro Pro Pro Trp Arg Leu Met Pro Ile Thr Ala Ala Ala Glu Ala 180 185 190 att tgg gcg tgt teg att gaa gaa cta ggg ctg gag aat gag gcc gag 624
Ile Trp Ala Cys Ser Ile Glu Glu Leu Gly Leu Glu Asn Glu Ala Glu 195 200 205 aaa ccg agc aat gcg ttg tta act aga get tgg tet cca gga tgg agc 672Ile Trp Ala Cys Ser Ile Glu Glu Leu Gly Leu Glu Asn Glu Ala Glu 195 200 205 aaa ccg agc aat gcg ttg tta act aga get tgg tet cca gga tgg agc 672
Lys Pro Ser Asn Ala Leu Leu Thr Arg Ala Trp Ser Pro Gly Trp Ser 210 215 220 aat get gat aag tta cta aat gag ttc ate gag aag cag ttg ata gat 720Lys Pro Ser Asn Ala Leu Leu Thr Arg Ala Trp Ser Pro Gly Trp Ser 210 215 220 aat get gat aag tta cta aat gag ttc ate gag aag cag ttg ata gat 720
Asn Ala Asp Lys Leu Leu Asn Glu Phe Ile Glu Lys Gin Leu Ile Asp 225 230 235 240 tat gca aag aac agc aag aaa gtt gtt ggg aat tet act tca cta ctt 768Asn Ala Asp Lys Leu Leu Asn Glu Phe Ile Glu Lys Gin Leu Ile Asp 225 230 235 240 tat gca aag aac agc aag aaa gtt gtt ggg aat tet act tca cta ctt 768
Tyr Ala Lys Asn Ser Lys Lys Val Val Gly Asn Ser Thr Ser Leu Leu 245 250 255 tet ccg tat ete cat ttc ggg gaa ata agc gtc aga cac gtt ttc cag 816Tyr Ala Lys Asn Ser Lys Lys Val Val Gly Asn Ser Thr Ser Leu Leu 245 250 255 tet ccg tat ete cat ttc ggg gaa ata agc gtc aga cac gtt ttc cag 816
Ser Pro Tyr Leu His Phe Gly Glu Ile Ser Val Arg His Val Phe Gin 260 265 270 tgt gcc cgg atg aaa caa att ata tgg gca aga gat aag aac agt gaa 864Ser Pro Tyr Leu His Phe Gly Glu Ile Ser Val Arg His Val Phe Gin 260 265 270 tgt gcc cgg atg aaa caa att ata tgg gca aga gat aag aac agt gaa 864
Cys Ala Arg Met Lys Gin Ile Ile Trp Ala Arg Asp Lys Asn Ser Glu 275 280 285 gga gaa gaa agt gca gat ctt ttt ctt agg gga ate ggt tta aga gag 912Cys Ala Arg Met Lys Gin Ile Ile Trp Ala Arg Asp Lys Asn Ser Glu 275 280 285 gga gaa gaa agt gca gat ctt ttt ctt agg gga ate ggt tta aga gag 912
Gly Glu Glu Ser Ala Asp Leu Phe Leu Arg Gly Ile Gly Leu Arg Glu 290 295 300 tat tet cgg tat ata tgt ttc aac ttc ccg ttt act cac gag caa teg 960Gly Glu Glu Ser Ala Asp Leu Phe Leu Arg Gly Ile Gly Leu Arg Glu 290 295 300 tat tet cgg tat ata tgt ttc aac ttc ccg ttt act cac gag caa teg 960
Tyr Ser Arg Tyr Ile Cys Phe Asn Phe Pro Phe Thr His Glu Gin Ser 305 310 315 320 ttg ttg agt cat ctt cgg ttt ttc cct tgg gat get gat gtt gat aag 1008Tyr Ser Arg Tyr Ile Cys Phe Asn Phe Pro Phe Thr His Glu Gin Ser 305 310 315 320 ttg ttg agt cat ctt ttt ttc cct tgg gat get gat gtt gat aag 1008
Leu Leu Ser His Leu Arg Phe Phe Pro Trp Asp Ala Asp Val Asp Lys 325 330 335 ttc aag gcc tgg aga caa ggc agg acc ggt tat ccg ttg gtg gat gcc 1056Leu Leu Ser His Leu Arg Phe Phe Pro Trp Asp Ala Asp Val Asp Lys 325 330 335 ttc aag gcc tgg aga caa ggc agg acc ggt tat ccg ttg gtg gat gcc 1056
Phe Lys Ala Trp Arg Gin Gly Arg Thr Gly Tyr Pro Leu Val Asp Ala 340 345 350 gga atg aga gag ctt tgg get acc gga tgg atg cat aac aga ata aga 1104Phe Lys Ala Trp Arg Gin Gly Arg Thr Gly Tyr Pro Leu Val Asp Ala 340 345 350 gga atg agg gag ctt tgg get acc gga tgg atg cat aac aga ata aga 1104
Gly Met Arg Glu Leu Trp Ala Thr Gly Trp Met His Asn Arg ile Arg 355 360 365 gtg att gtt tca agc ttt gct gtg aag ttt ctt ctc ctt cca tgg aaa 1152Gly Met Arg Glu Leu Trp Ala Thr Gly Trp Met His Asn Arg ile Arg 355 360 365 gtg att gtt tta agc ttt gct gtg aag ttt ctt ctt ctt cgg aaa 1152
Val Ile Val Ser Ser Phe Ala Val Lys Phe Leu Leu Leu Pro Trp Lys 370 375 380 tgg gga atg aag tat ttc tgg gat aca ctt ttg gat gct gat ttg gaa 1200Val Ile Val Ser Ser Phe Ala Val Lys Phe Leu Leu Leu Pro Trp Lys 370 375 380 tgg gga atg aag tat ttc tgg gat aca ctt ttg gat gct gat ttg gaa 1200
Trp Gly Met Lys Tyr Phe Trp Asp Thr Leu Leu Asp Ala Asp Leu Glu 385 390 395 400 tgt gac ate ctt ggc tgg cag tat ate tet ggg agt ate ccc gat ggc 1248Trp Gly Met Lys Tyr Phe Trp Asp Thr Leu Leu Asp Ala Asp Leu Glu 385 390 395 400 tgt gac ate ctt ggc tgg cag tat ate tet ggg agt ate ccc gat ggc 1248
Cys Asp Ile Leu Gly Trp Gin Tyr Ile Ser Gly Ser Ile Pro Asp Gly 405 410 415 cac gag ctt gat ege ttg gac aat ccc gcg tta caa ggc gcc aaa tat 1296Cys Asp Ile Leu Gly Trp Gin Tyr Ile Ser Gly Ser Ile Pro Asp Gly 405 410 415 cac gag ctt gat ege ttg gac aat ccc gcg tta caa ggc gcc aaa tat 1296
His Glu Leu Asp Arg Leu Asp Asn Pro Ala Leu Gin Gly Ala Lys Tyr 420 425 430 gac cca gaa ggt gag tac ata agg caa tgg ctt ccc gag ctt gcg aga 1344His Glu Leu Asp Arg Leu Asp Asn Pro Ala Leu Gin Gly Ala Lys Tyr 420 425 430 gac cca gaa ggt gag tac ata agg caa tgg ctt ccc gag ctt gcg aga 1344
Asp Pro Glu Gly Glu Tyr Ile Arg Gin Trp Leu Pro Glu Leu Ala Arg 435 440 445 ttg cca act gaa tgg ate cat cat cca tgg gac gct cct tta acc gta 1392Asp Pro Glu Gly Glu Tyr Ile Arg Gin Trp Leu Pro Glu Leu Ala Arg 435 440 445 ttg cca act gaa tgg ate cat cat cca tgg gac gct cct tta acc gta 1392
Leu Pro Thr Glu Trp Ile His His Pro Trp Asp Ala Pro Leu Thr Val 450 455 460 ctc aaa gct tet ggt gtg gaa ctc gga aca aac tat gcg aaa ccc att 1440Leu Pro Thr Glu Trp Ile His His Pro Trp Asp Ala Pro Leu Thr Val 450 455 460 ctc aaa gct tet ggt gtg gaa ctc gga aca aac tat gcg aaa ccc att 1440
Leu Lys Ala Ser Gly Val Glu Leu Gly Thr Asn Tyr Ala Lys Pro Ile 465 470 475 480 gta gac ate gac aca gct cgt gag cta cta gct aaa gct att tca aga 1488Leu Lys Ala Ser Gly Val Glu Leu Gly Thr Asr Tyr Ala Lys Pro Ile 465 470 475 480 gta gac gac cgt gag cta cta gct aaa gct att tca aga 1488
Val Asp Ile Asp Thr Ala Arg Glu Leu Leu Ala Lys Ala Ile Ser Arg 485 490 495 acc cgt gaa gca cag ate atg ate gga gca gca ggc ggc tet ggc ggc 1536Val Asp Ile Asp Thr Ala Arg Glu Leu Leu Ala Lys Ala Ile Ser Arg 485 490 495 acc cgt gaa gca cag ate atg ate gga gca gca ggc ggc tet ggc ggc 1536
Thr Arg Glu Ala Gin Ile Met Ile Gly Ala Ala Gly Gly Ser Gly Gly 500 505 510 ggc tcc gga ggc tet gag cag aag ctg ate agc gag gag gac ctg agc 1584Thr Arg Glu Ala Gin Ile Met Ile Gly Ala Gly Gly Gly Gly Gly 500 505 510 ggc tcc gga ggc tet gag cag aag ctg ate agc gag gag gac ctg agc 1584
Gly Ser Gly Gly Ser Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Ser 515 520 525 aag agc ctg ttc ege ggc ctg ege gac tac aac ccc ate gcc agc agc 1632Gly Ser Gly Gly Ser Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Ser 515 520 525 aag agc ctg ttc ege ggc ctg ege gac tac aac ccc ate gcc agc agc 1632
Lys Ser Leu Phe Arg Gly Leu Arg Asp Tyr Asn Pro Ile Ala Ser Ser 530 535 540 ate tgc cag ctg aac aac agc agc ggc gcc ege cag agc gag atg ttc 1680Lys Ser Leu Phe Arg Gly Leu Arg Asp Tyr Asn Pro Ile Ala Ser Ser 530 535 540 ate tgc cag ctg aac aac agc agc ggc gcc ege cag agc gag atg ttc 1680
Ile Cys Gin Leu Asn Asn Ser Ser Gly Ala Arg Gin Ser Glu Met Phe 545 550 555 560 ggc ctg ggc ttc ggc ggc ctg ate gtg acc aac cag cac ctg ttc aag 1728Ile Cys Gin Leu Asn Asn Ser Ser Gly Ala Arg Gin Ser Glu Met Phe 545 550 555 560 ggc ctg ggc ttc ggc ggc ctg ate gtg acc aac cag cac ctg ttc aag 1728
Gly Leu Gly Phe Gly Gly Leu Ile Val Thr Asn Gin His Leu Phe Lys 565 570 575 ege aac gac ggc gag ctg acc ate ege agc cac cac ggc gag ttc gtg 1776Gly Leu Gly Ply Gly Gly Leu Ile Val Thr Asn Gin His Leu Phe Lys 565 570 575 ege aac gac ggc gag ctg acc ate ege agc cac cac ggc gag ttc gtg 1776
Arg Asn Asp Gly Glu Leu Thr Ile Arg Ser His His Gly Glu Phe Val 580 585 590 gtg aag gac acc aag acc ctg aag ctg ctg ccc tgc aag ggc ege gac 1824Arg Asn Asp Gly Glu Leu Thr Ile Arg Ser His His Gly Glu Phe Val 580 585 590 gtg aag gac acc aag acc ctg aag ctg ctg ccc tgc aag ggc ege gac 1824
Val Lys Asp Thr Lys Thr Leu Lys Leu Leu Pro Cys Lys Gly Arg Asp 595 600 605 ate gtg ate ate ege ctg ccc aag gac ttc ccc ccc ttc ccc aag ege 1872Val Lys Asp Thr Lys Thr Leu Lys Leu Leu Lys Leu Pro Cys Lys Gly Arg Asp 595 600 605 ate gtg ate ate ege ctg ccc aag gac ttc ccc ccc ttc ccc aag ege 1872
Ile Val Ile Ile Arg Leu Pro Lys Asp Phe Pro Pro Phe Pro Lys Arg 610 615 620 ctg caa ttc ege acc ccc acc acc gag gac ege gtg tgc ctg ate ggc 1920Ile Val Ile Ile Arg Leu Pro Lys Asp Pro Pro Phe Pro Lys Arg 610 615 620 ctg caa ttc ege acc ccc acc acc gag gac ege gtg tgc ctg ate ggc 1920
Leu Gin Phe Arg Thr Pro Thr Thr Glu Asp Arg Val Cys Leu Ile Gly 625 630 635 640 agc aac ttc cag acc taa 1938Leu Gin Phe Arg Thr Pro Thr Thr Glu Asp Arg Val Cys Leu Ile Gly 625 630 635 640 agc aac ttc cag acc taa 1938
Ser Asn Phe Gin Thr 645 <210> 64 <211> 645Ser Asn Phe Gin Thr 645 <210> 64 <211> 645
<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 64<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 64
Met His His His His His His Gly Ser Gly Lys Met Asp Lys Lys Thr 15 10 15Met His Lily Lys Thr 15 10 15
Ile Val Trp Phe Arg Arg Asp Leu Arg Ile Glu Asp Asn Pro Ala Leu 20 25 30Ile Val Trp Phe Arg Arg Asp Leu Arg Ile Glu Asp Asn Pro Ala Leu 20 25 30
Ala Ala Ala Ala His Glu Gly Ser Val Phe Pro Val Phe Ile Trp Cys 35 40 45Ala Ala Ala His Glu Gly Ser Val Phe Pro Val Phe Ile Trp Cys 35 40 45
Pro Glu Glu Glu Gly Gin Phe Tyr Pro Gly Arg Ala Ser Arg Trp Trp 50 55 60Pro Glu Glu Gly Gly Phe Tyr Pro Gly Arg Ala Ser Arg Trp Trp 50 55 60
Met Lys Gin Ser Leu Ala His Leu Ser Gin Ser Leu Lys Ala Leu Gly 65 70 75 80Met Lys Gin Ser Leu Ala His Leu Ser Gin Ser Leu Lys Ala Leu Gly 65 70 75 80
Ser Asp Leu Thr Leu Ile Lys Thr His Asn Thr Ile Ser Ala Ile Leu 85 90 95Ser Asp Leu Thr Leu Ile Lys Thr His Asn Thr Ile Ser Ala Ile Leu 85 90 95
Asp Cys Ile Arg Val Thr Gly Ala Thr Lys Val Val Phe Asn His Leu 100 105 110Asp Cys Ile Arg Val Thr Gly Ala Thr Lys Val Val Phe Asn His Leu 100 105 110
Tyr Asp Pro Val Ser Leu Val Arg Asp His Thr Val Lys Glu Lys Leu 115 120 125Tyr Asp Pro Val Ser Leu Val Arg Asp His Thr Val Lys Glu Lys Leu 115 120 125
Val Glu Arg Gly Ile Ser Val Gin Ser Tyr Asn Gly Asp Leu Leu Tyr 130 135 140Val Glu Arg Gly Ile Ser Val Gin Ser Tyr Asn Gly Asp Leu Leu Tyr 130 135 140
Glu Pro Trp Glu Ile Tyr Cys Glu Lys Gly Lys Pro Phe Thr Ser Phe 145 150 155 160Glu Pro Trp Glu Ile Tyr Cys Glu Lys Gly Lys Pro Phe Thr Ser Phe 145 150 155 160
Asn Ser Tyr Trp Lys Lys Cys Leu Asp Met Ser ile Glu Ser Val Met 165 170 175Asn Ser Tyr Trp Lys Lys Cys Leu Asp Met Ser ile Glu Ser Val Met 165 170 175
Leu Pro Pro Pro Trp Arg Leu Met Pro Ile Thr Ala Ala Ala Glu Ala 180 185 190Leu Pro Pro Pro Trp Arg Leu Met Pro Ile Thr Ala Ala Ala Glu Ala 180 185 190
Ile Trp Ala Cys Ser Ile Glu Glu Leu Gly Leu Glu Asn Glu Ala Glu 195 200 205Ile Trp Ala Cys Ser Ile Glu Glu Leu Glu Leu Glu Asn Glu Ala Glu 195 200 205
Lys Pro Ser Asn Ala Leu Leu Thr Arg Ala Trp Ser Pro Gly Trp Ser 210 215 220Lys Pro Ser Asn Ala Leu Leu Thr Arg Ala Trp Ser Pro Gly Trp Ser 210 215 220
Asn Ala Asp Lys Leu Leu Asn Glu Phe Ile Glu Lys Gin Leu Ile Asp 225 230 235 240Asn Ala Asp Lys Leu Leu Asn Glu Phe Ile Glu Lys Gin Leu Ile Asp 225 230 235 240
Tyr Ala Lys Asn Ser Lys Lys Val Val Gly Asn Ser Thr Ser Leu Leu 245 250 255Tyr Ala Lys Asn Ser Lys Lys Val Val Gly Asn Ser Thr Ser Leu Leu 245 250 255
Ser Pro Tyr Leu His Phe Gly Glu Ile Ser Val Arg His Val Phe Gin 260 265 270Ser Pro Tyr Leu His Phe Gly Glu Ile Ser Val Arg His Val Phe Gin 260 265 270
Cys Ala Arg Met Lys Gin ile Ile Trp Ala Arg Asp Lys Asn Ser Glu 275 280 285Cys Ala Arg Met Lys Gin ile Ile Trp Ala Arg Asp Lys Asn Ser Glu 275 280 285
Gly Glu Glu Ser Ala Asp Leu Phe Leu Arg Gly Ile Gly Leu Arg Glu 290 295 300Gly Glu Glu Ser Ala Asp Leu Phe Leu Arg Gly Ile Gly Leu Arg Glu 290 295 300
Tyr Ser Arg Tyr Ile Cys Phe Asn Phe Pro Phe Thr His Glu Gin Ser 305 310 315 320Tyr Ser Arg Tyr Ile Cys Phe Asn Phe Pro Phe Thr His Glu Gin Ser 305 310 315 320
Leu Leu Ser His Leu Arg Phe Phe Pro Trp Asp Ala Asp Val Asp Lys 325 330 335Leu Leu Ser His Leu Arg Phe Phe Pro Trp Asp Ala Asp Val Asp Lys 325 330 335
Phe Lys Ala Trp Arg Gin Gly Arg Thr Gly Tyr Pro Leu Val Asp Ala 340 345 350Phe Lys Ala Trp Arg Gin Gly Arg Thr Gly Tyr Pro Leu Val Asp Ala 340 345 350
Gly Met Arg Glu Leu Trp Ala Thr Gly Trp Met His Asn Arg Ile Arg 355 360 365Gly Met Arg Glu Leu Trp Ala Thr Gly Trp Met His Asn Arg Ile Arg 355 360 365
Val Ile Val Ser Ser Phe Ala Val Lys Phe Leu Leu Leu Pro Trp Lys 370 375 380Val Ile Val Ser Ser Phe Ala Val Lys Phe Leu Leu Leu Pro Trp Lys 370 375 380
Trp Gly Met Lys Tyr Phe Trp Asp Thr Leu Leu Asp Ala Asp Leu Glu 385 390 395 400Trp Gly Met Lys Tyr Phe Trp Asp Thr Leu Leu Asp Ala Asp Leu Glu 385 390 395 400
Cys Asp Ile Leu Gly Trp Gin Tyr Ile Ser Gly Ser Ile Pro Asp Gly 405 410 415Cys Asp Ile Leu Gly Trp Gin Tyr Ile Ser Gly Ser Ile Pro Asp Gly 405 410 415
His Glu Leu Asp Arg Leu Asp Asn Pro Ala Leu Gin Gly Ala Lys Tyr 420 425 430His Glu Leu Asp Arg Leu Asp Asn Pro Ala Leu Gin Gly Ala Lys Tyr 420 425 430
Asp Pro Glu Gly Glu Tyr Ile Arg Gin Trp Leu Pro Glu Leu Ala Arg 435 440 445Asp Pro Glu Gly Glu Tyr Ile Arg Gin Trp Leu Pro Glu Leu Ala Arg 435 440 445
Leu Pro Thr Glu Trp Ile His His Pro Trp Asp Ala Pro Leu Thr Val 450 455 460Leu Pro Thr Glu Trp Ile His Pro Pro Asp Ala Pro Leu Thr Val 450 455 460
Leu Lys Ala Ser Gly Val Glu Leu Gly Thr Asn Tyr Ala Lys Pro Ile 465 470 475 480Leu Lys Ala Ser Gly Val Glu Leu Gly Thr Asn Tyr Ala Lys Pro Ile 465 470 475 480
Val Asp Ile Asp Thr Ala Arg Glu Leu Leu Ala Lys Ala Ile Ser Arg 485 490 495 fhr Arg Glu Ala Gin Ile Met Ile Gly Ala Ala Gly Gly Ser Gly Gly 500 505 510Val Asp Ile Asp Thr Ala Arg Glu Leu Leu Ala Lys Ala Ile Ser Arg 485 490 495 fhr Arg Glu Ala Gin Ile Met Ile Gly Ala Gly Gly Ser Gly Gly 500 505 510
Gly Ser Gly Gly Ser Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Ser 515 520 525Gly Ser Gly Gly Ser Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Ser 515 520 525
Lys Ser Leu Phe Arg Gly Leu Arg Asp Tyr Asn Pro Ile Ala Ser Ser 530 535 540Lys Ser Leu Phe Arg Gly Leu Arg Asp Tyr Asn Pro Ile Ala Ser Ser 530 535 540
Ile Cys Gin Leu Asn Asn Ser Ser Gly Ala Arg Gin Ser Glu Met Phe 545 550 555 560Ile Cys Gin Leu Asn Asn Ser Ser Gly Ala Arg Gin Ser Glu Met Phe 545 550 555 560
Gly Leu Gly Phe Gly Gly Leu Ile Val Thr Asn Gin His Leu Phe Lys 565 570 575Gly Leu Gly Ply Gly Gly Leu Ile Val Thr Asn Gin His Leu Phe Lys 565 570 575
Arg Asn Asp Gly Glu Leu Thr Ile Arg Ser His His Gly Glu Phe Val 580 585 590Arg Asn Asp Gly Glu Leu Thr Ile Arg Ser His His Gly Glu Phe Val 580 585 590
Val Lys Asp Thr Lys Thr Leu Lys Leu Leu Pro Cys Lys Gly Arg Asp 595 600 605Val Lys Asp Thr Lys Thr Leu Lys Leu Leu Pro Cys Lys Gly Arg Asp 595 600 605
Ile Val Ile Ile Arg Leu Pro Lys Asp Phe Pro Pro Phe Pro Lys Arg 610 615 620Ile Val Ile Ile Arg Leu Pro Lys Asp Phe Pro Pro Phe Pro Lys Arg 610 615 620
Leu Gin Phe Arg Thr Pro Thr Thr Glu Asp Arg Val Cys Leu Ile Gly 625 630 635 640Leu Gin Phe Arg Thr Pro Thr Thr Glu Asp Arg Val Cys Leu Ile Gly 625 630 635 640
Ser Asn Phe Gin Thr 645 <210> 65 <211> 951Ser Asn Phe Gin Thr 645 <210> 65 <211> 951
<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1)..(948) <400> 65 atg gag cag aag ctg ate agc gag gag gat ctg aat gga get ata gga 48<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1) .. (948) <400> 65 atg gag cag aag ctg ate agc gag gag gat ctg aat gga get ata gga 48
Met Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Asn Gly Ala Ile Gly 15 10 15 ggt gac ctt ttg ete aat ttt cct gac atg teg gtc cta gag ege caa 96Met Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Asn Gly Ala Ile Gly 15 10 15 ggt gac ctt ttg ete aat ttt cct gac atg teg gtc cta gag ege caa 96
Gly Asp Leu Leu Leu Asn Phe Pro Asp Met Ser Val Leu Glu Arg Gin 20 25 30 agg get cac ete aag tac ete aat ccc acc ttt gat tet cct ete gcc 144Gly Asp Leu Leu Leu Asn Phe Pro Asp Met Ser Val Leu Glu Arg Gin 20 25 30 agg get cac ete aag tac ete aat ccc acc ttt gat tet cct ete gcc 144
Arg Ala His Leu Lys Tyr Leu Asn Pro Thr Phe Asp Ser Pro Leu Ala 35 40 45 ggc ttc ttt gcc gat tet tca atg att acc ggc ggc gag atg gac agc 192Arg Ala His Leu Lys Tyr Leu Asn Pro Thr Phe Asp Ser Pro Leu Ala 35 40 45 ggc ttc ttt gcc gat tet tca atg att acc ggc ggc gag atg gac agc 192
Gly Phe Phe Ala Asp Ser Ser Met Ile Thr Gly Gly Glu Met Asp Ser 50 55 60 tat ctt teg act gcc ggt ttg aat ctt ccg atg atg tac ggt gag acg 240Gly Phe Phe Ala Asp Ser Ser Met Ile Thr Gly Gly Glu Met Asp Ser 50 55 60 tat ctt teg act gcc ggt ttg aat ctt ccg atg atg tac ggt gag acg 240
Tyr Leu Ser Thr Ala Gly Leu Asn Leu Pro Met Met Tyr Gly Glu Thr 65 70 75 80 acg gtg gaa ggt gat tca aga ete tca att teg ccg gaa acg acg ctt 288Tyr Leu Ser Thr Ala Gly Leu Asn Leu Pro Met Met Tyr Gly Glu Thr 65 70 75 80 acg gtg gaa ggt gat tca aga ete tca att teg ccg gaa acg acg ctt 288
Thr Val Glu Gly Asp Ser Arg Leu Ser Ile Ser Pro Glu Thr Thr Leu 85 90 95 ggg act gga aat ttc aag gca gcg aag ttt gat aca gag act aag gat 336Thr Val Glu Gly Asp Ser Arg Leu Ser Ile Ser Pro Glu Thr Thr Leu 85 90 95 ggg act gga aat ttc aag gca gcg aag ttt gat aca gag act aag gat 336
Gly Thr Gly Asn Phe Lys Ala Ala Lys Phe Asp Thr Glu Thr Lys Asp 100 105 110 tgt aat gag gcg gcg aag aag atg acg atg aac aga gat gac cta gta 384Gly Thr Gly Asn Phe Lys Ala Ala Lys Phe Asp Thr Glu Thr Lys Asp 100 105 110 tgt aat gag gcg gcg aag aag atg acg atg aac aga gat gac cta gta 384
Cys Asn Glu Ala Ala Lys Lys Met Thr Met Asn Arg Asp Asp Leu Val 115 120 125 gaa gaa gga gaa gaa gag aag teg aaa ata aca gag caa aac aat ggg 432Cys Asn Glu Ala Lys Lys Met Thr Met Asn Arg Asp Asp Leu Val 115 120 125 gaa gaa gga gaa gaa gag aag teg aaa ata aca gag caa aac aat ggg 432
Glu Glu Gly Glu Glu Glu Lys Ser Lys Ile Thr Glu Gin Asn Asn Gly 130 135 140 agc aca aaa agc ate aag aag atg aaa cac aaa gec aag aaa gaa gag 480Glu Glu Gly Glu Glu Lys Ser Lys Ile Thr Glu Gin Asn Asn Gly 130 135 140 agc aca aaa agc ate aag aag atg aaa cac aaa gec aag aaa gaa gag 480
Ser Thr Lys Ser Ile Lys Lys Met Lys His Lys Ala Lys Lys Glu Glu 145 150 155 160 aac aat ttc tet aat gat tca tet aaa gtg acg aag gaa ttg gag aaa 528Ser Thr Lys Ser Ile Lys Lys Met Lys His Lys Ala Lys Lys Glu Glu 145 150 155 160 aac aat ttc tet aat gat tca tet aaa gtg acg aag gaa ttg gag aaa 528
Asn Asn Phe Ser Asn Asp Ser Ser Lys Val Thr Lys Glu Leu Glu Lys 165 170 175 acg gat tat att ggc ggc tet ggc ggc gga tcc ggc tec gga ggc aag 576Asn Asn Phe Ser Asn Ser Ser Lys Val Thr Lys Glu Leu Glu Lys 165 170 175 acg gat tat att ggc ggc ggc ggc ggc ggc gga tcc ggc tec gga ggc aag 576
Thr Asp Tyr Ile Gly Gly Ser Gly Gly Gly Ser Gly Ser Gly Gly Lys 180 185 190 agc ate agc agc acc atg agc gag acc agc gcc acc tac ccc gtg gac 624Thr Asp Tyr Ile Gly Gly Ser Gly Gly Gly Ser Gly Ser Gly Gly Lys 180 185 190 agc ate agc agc acc atg agc gag acc agc gcc acc tac ccc gtg gac 624
Ser Ile Ser Ser Thr Met Ser Glu Thr Ser Ala Thr Tyr Pro Val Asp 195 200 205 aac agc cac ttc tgg aag cac tgg ate agc acc aag gac ggc cac tgc 672Ser Ile Ser Ser Thr Met Ser Glu Thr Ser Ala Thr Tyr Pro Val Asp 195 200 205 aac agc cac ttc tgg aag cac tgg ate agc acc aag gac ggc cac tgc 672
Asn Ser His Phe Trp Lys His Trp Ile Ser Thr Lys Asp Gly His Cys 210 215 220 ggc ctg ccc ate gtg agc acc ege gac ggc agc ate ctg ggc ctg cac 720Asn Ser His Phe Trp Lys His Trp Ile Ser Thr Lys Asp Gly His Cys 210 215 220 ggc ctg ccc ate gtg agc acc ege gac ggc agc ate ctg ggc ctg cac 720
Gly Leu Pro Ile Val Ser Thr Arg Asp Gly Ser Ile Leu Gly Leu His 225 230 235 240 agc ctg gcc aac agc acc aac acc cag aac ttc tac gcc gcc ttc ccc 768Gly Leu Pro Ile Val Ser Thr Arg Asp Gly Ser Ile Leu Gly Leu His 225 230 235 240 agc ctg gcc aac agc aac acc cag aac ttc gcc gcc ttc ccc 768
Ser Leu Ala Asn Ser Thr Asn Thr Gin Asn Phe Tyr Ala Ala Phe Pro 245 250 255 gac aac ttc gag acc acc tac ctg agc aac cag gac aac gac aac tgg 816Ser Leu Ala Asn Ser Thr Asr Thr Gin Asn Phe Tyr Ala Ala Phe Pro 245 250 255 gac aac ttc gag acc acc tac ctg agc aac cag gac aac gac aac tgg 816
Asp Asn Phe Glu Thr Thr Tyr Leu Ser Asn Gin Asp Asn Asp Asn Trp 260 265 270 ate aag cag tgg ege tac aac ccc gac gag gtg tgc tgg ggc agc ctg 864Asp Asn Phe Glu Thr Thr Tyr Leu Ser Asn Gin Asp Asn Asp Asn Trp 260 265 270 ate aag cag tgg ege tac aac ccc gac gag gtg tgc tgg ggc agc ctg 864
Ile Lys Gin Trp Arg Tyr Asn Pro Asp Glu Val Cys Trp Gly Ser Leu 275 280 285 caa ctg aag ege gac ate ccc cag agc ccc ttc acc ate tgc aag ctg 912Ile Lys Gin Trp Arg Tyr Asn Pro Asp Glu Val Cys Trp Gly Ser Leu 275 280 285 caa ctg aag ege gac ate ccc cag agc ccc ttc acc ate tgc aag ctg 912
Gin Leu Lys Arg Asp Ile Pro Gin Ser Pro Phe Thr Ile Cys Lys Leu 290 295 300 ctg acc gac ctg gac ggc gag ttc gtg tac acc cag taa 951Gin Leu Lys Arg Asp Ile Pro Gin Ser Pro Phe Thr Ile Cys Lys Leu 290 295 300 ctg acc gac ctg gac ggc gag ttc gtg tac acc cag taa 951
Leu Thr Asp Leu Asp Gly Glu Phe Val Tyr Thr Gin 305 310 315 <210> 66 <211> 316Leu Thr Asp Leu Asp Gly Glu Phe Val Tyr Thr Gin 305 310 315 <210> 66 <211> 316
<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 66<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 66
Met Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Asn Gly Ala Ile Gly 15 10 15Met Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Asn Gly Ala Ile Gly 15 10 15
Gly Asp Leu Leu Leu Asn Phe Pro Asp Met Ser Val Leu Glu Arg Gin 20 25 30Gly Asp Leu Leu Leu Asn Phe Pro Asp Met Ser Val Leu Glu Arg Gin 20 25 30
Arg Ala His Leu Lys Tyr Leu Asn Pro Thr Phe Asp Ser Pro Leu Ala 35 40 45Arg Ala His Leu Lys Tyr Leu Asn Pro Thr Phe Asp Ser Pro Leu Ala 35 40 45
Gly Phe Phe Ala Asp Ser Ser Met Ile Thr Gly Gly Glu Met Asp Ser 50 55 60Gly Phe Phe Ala Asp Ser Ser Met Ile Thr Gly Gly Glu Met Asp Ser 50 55 60
Tyr Leu Ser Thr Ala Gly Leu Asn Leu Pro Met Met Tyr Gly Glu Thr 65 70 75 80Tyr Leu Ser Thr Ala Gly Leu Asn Leu Pro Met Met Tyr Gly Glu Thr 65 70 75 80
Thr Val Glu Gly Asp Ser Arg Leu Ser Ile Ser Pro Glu Thr Thr Leu 85 90 95Thr Val Glu Gly Asp Ser Arg Leu Ser Ile Ser Pro Glu Thr Thr Leu 85 90 95
Gly Thr Gly Asn Phe Lys Ala Ala Lys Phe Asp Thr Glu Thr Lys Asp 100 105 110Gly Thr Gly Asn Phe Lys Ala Ala Lys Phe Asp Thr Glu Thr Lys Asp 100 105 110
Cys Asn Glu Ala Ala Lys Lys Met Thr Met Asn Arg Asp Asp Leu Val 115 120 125Cys Asn Glu Ala Ala Lys Lys Met Thr Met Asn Arg Asp Asp Leu Val 115 120 125
Glu Glu Gly Glu Glu Glu Lys Ser Lys Ile Thr Glu Gin Asn Asn Gly 130 135 140Glu Glu Gly Glu Glu Lys Ser Lys Ile Thr Glu Gin Asn Asn Gly 130 135 140
Ser Thr Lys Ser Ile Lys Lys Met Lys His Lys Ala Lys Lys Glu Glu 145 150 155 160Ser Thr Lys Ser Ile Lys Lys Met Lys His Lys Ala Lys Lys Glu Glu 145 150 155 160
Asn Asn Phe Ser Asn Asp Ser Ser Lys Val Thr Lys Glu Leu Glu Lys 165 170 175Asn Asn Phe Ser Asn Asp Ser Ser Lys Val Thr Lys Glu Leu Glu Lys 165 170 175
Thr Asp Tyr Ile Gly Gly Ser Gly Gly Gly Ser Gly Ser Gly Gly Lys 180 185 190Thr Asp Tyr Ile Gly Gly Ser Gly Gly Gly Ser Gly Ser Gly Gly Lys 180 185 190
Ser Ile Ser Ser Thr Met Ser Glu Thr Ser Ria Thr Tyr Pro Val Rsp 195 200 205Ser Ile Ser Ser Thr Met Ser Glu Thr Ser Ria Thr Tyr Pro Val Rsp 195 200 205
Asn Ser His Phe Trp Lys His Trp Ile Ser Thr Lys Asp Gly His Cys 210 215 220Asn Ser His Phe Trp Lys His Trp Ile Ser Thr Lys Asp Gly His Cys 210 215 220
Gly Leu Pro Ile Val Ser Thr Arg Asp Gly Ser Ile Leu Gly Leu His 225 230 235 240Gly Leu Pro Ile Val Ser Thr Arg Asp Gly Ser Ile Leu Gly Leu His 225 230 235 240
Ser Leu Ala Asn Ser Thr Asn Thr Gin Asn Phe Tyr Ala Ala Phe Pro 245 250 255Ser Leu Ala Asn Ser Thr Asr Thr Gin Asn Phe Tyr Ala Ala Phe Pro 245 250 255
Asp Asn Phe Glu Thr Thr Tyr Leu Ser Asn Gin Asp Asn Asp Asn Trp 260 265 270Asp Asn Phe Glu Thr Thr Tyr Leu Ser Asn Gin Asp Asn Asp Asn Trp 260 265 270
Ile Lys Gin Trp Arg Tyr Asn Pro Asp Glu Val Cys Trp Gly Ser Leu 275 280 285Ile Lys Gin Trp Arg Tyr Asn Pro Asp Glu Val Cys Trp Gly Ser Leu 275 280 285
Gin Leu Lys Arg Asp Ile Pro Gin Ser Pro Phe Thr Ile Cys Lys Leu 290 295 300Gin Leu Lys Arg Asp Ile Pro Gin Ser Pro Phe Thr Ile Cys Lys Leu 290 295 300
Leu Thr Asp Leu Asp Gly Glu Phe Val Tyr Thr Gin 305 310 315 <210> 67 <211> 2304Leu Thr Asp Leu Asp Gly Glu Phe Val Tyr Thr Gin 305 310 315 <210> 67 <211> 2304
<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1)..(2301) <400> 67 atg ate aaa ata gcc aca cgt aaa tat tta ggc aaa caa aat gtc tat 48<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1) .. (2301) <400> 67 atg gac aca cgt aaa tat tta ggc aaa caa aat gtc tat 48
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gin Asn Val Tyr 15 10 15 gac att gga gtt gag ege gac cat aat ttt gca ete aaa aat ggc ttc 96Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gin Asn Val Tyr 15 10 15 gac att gga gtt gag ege gac cat aat ttt gca ete aaa aat ggc ttc 96
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe 20 25 30 ata get tet aat tgt ttc aat gat act tat aga tat att gac acc get 144Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe 20 25 30 ata get tet aat tgt ttc aat gat act tat aga tat att gac acc get 144
Ile Ala Ser Asn Cys Phe Asn Asp Thr Tyr Arg Tyr Ile Asp Thr Ala 35 40 45 ate ete agc gtg gtg cca ttt cac cac ggc ttc ggc atg ttc acc acg 192Ile Ala Ser Asn Cys Phe Asn Asp Thr Tyr Arg Tyr Ile Asp Thr Ala 35 40 45 ate ete agc gtg gtg cca ttt cac cac ggc ttc ggc atg ttc acc acg 192
Ile Leu Ser Val Val Pro Phe His His Gly Phe Gly Met Phe Thr Thr 50 55 60 ctg ggc tac ttg ate tgc ggc ttt cgg gtc gtg ete atg tac ege ttc 240Ile Leu Ser Val Val Phe His Gly Phe Gly Met Phe Thr Thr 50 55 60 ctg ggc tac ttg ate tgc ggc ttt cgg gtc gtg ete atg tac ege ttc 240
Leu Gly Tyr Leu Ile Cys Gly Phe Arg Val Val Leu Met Tyr Arg Phe 65 70 75 80 gag gag gag cta ttc ttg ege agc ttg caa gac tat aag att caa tet 288Leu Gly Tyr Leu Ile Cys Gly Phe Arg Val Val Leu Met Tyr Arg Phe 65 70 75 80 gag gag gag cta ttc ttg ege agc ttg caa gac tat aag att caa tet 288
Glu Glu Glu Leu Phe Leu Arg Ser Leu Gin Asp Tyr Lys Ile Gin Ser 85 90 95 gcc ctg ctg gtg ccc aca cta ttt agc ttc ttc get aag agc act ete 336Glu Glu Glu Leu Phe Leu Arg Ser Leu Gin Asp Tyr Lys Ile Gin Ser 85 90 95 gcc ctg ctg gtg ccc aca cta ttt agc ttc ttc get aag agc act ete 336
Ala Leu Leu Val Pro Thr Leu Phe Ser Phe Phe Ala Lys Ser Thr Leu 100 105 110 ate gac aag tac gac cta agc aac ttg cac gag ate gcc agc ggc ggg 384Ala Leu Leu Val Pro Thr Leu Phe Ser Phe Phe Ala Lys Ser Thr Leu 100 105 110 ate gac aag tac gac cta agc aac ttg cac gag ate gcc agc ggc ggg 384
Ile Asp Lys Tyr Asp Leu Ser Asn Leu His Glu Ile Ala Ser Gly Gly 115 120 125 gcg ccg ete agc aag gag gta ggt gag gcc gtg gcc aaa ege ttc cac 432Ile Asp Lys Tyr Asp Leu Ser Asn Leu His Glu Ile Ala Ser Gly Gly 115 120 125 gcg ccg ete agc aag gag gta ggt gag gcc gtg gcc aaa ege ttc cac 432
Ala Pro Leu Ser Lys Glu Val Gly Glu Ala Val Ala Lys Arg Phe His 130 135 140 cta cca ggc ate ege cag ggc tac ggc ctg aca gaa aca acc agc gcc 480Ala Pro Leu Ser Lys Glu Val Gly Glu Ala Val Ala Lys Arg Phe His 130 135 140 cta cg ggc cate ggc cg ggc ac acc ac ac acc ac gc 480
Leu Pro Gly Ile Arg Gin Gly Tyr Gly Leu Thr Glu Thr Thr Ser Ala 145 150 155 160 att ctg ate acc ccc gaa ggg gac gac aag cct ggc gca gta ggc aag 528Leu Pro Gly Ile Arg Gin Gly Tyr Gly Leu Thr Glu Thr Thr Ser Ala 145 150 155 160 att ctg ate acc ccc gaa ggg gac gac aag cct ggc gca gta ggc aag 528
Ile Leu Ile Thr Pro Glu Gly Asp Asp Lys Pro Gly Ala Val Gly Lys 165 170 175 gtg gtg ccc ttc ttc gag get aag gtg gtg gac ttg gac acc ggt aag 576Ile Leu Ile Thr Pro Glu Gly Asp Asp Lys Pro Gly Ala Val Gly Lys 165 170 175 gtg ccc ttc ttc gag get aag gtg gtg gac ttg gac acc ggt aag 576
Val Val Pro Phe Phe Glu Ala Lys Val Val Asp Leu Asp Thr Gly Lys 180 185 190 aca ctg ggt gtg aac cag ege ggc gag ctg tgc gtc cgt ggc ccc atg 624Val Val Pro Phe Phe Glu Ala Lys Val Val Asp Leu Asp Thr Gly Lys 180 185 190 aca ctg ggt gtg aac cag ege ggc gag ctg tgc gtc cgt ggc ccc atg 624
Thr Leu Gly Val Asn Gin Arg Gly Glu Leu Cys Val Arg Gly Pro Met 195 200 205 ate atg agc ggc tac gtt aac aac ccc gag get aca aac get ete ate 672Thr Leu Gly Val Asn Gin Arg Gly Glu Leu Cys Val Arg Gly Pro Met 195 200 205 ate atg agc ggc tac gtt aac aac ccc gag get aca aac get ete ate 672
Ile Met Ser Gly Tyr Val Asn Asn Pro Glu Ala Thr Asn Ala Leu Ile 210 215 220 gac aag gac ggc tgg ctg cac agc ggc gac ate gcc tac tgg gac gag 720Ile Met Ser Gly Tyr Val Asn Asn Pro Glu Ala Thr Asn Ala Leu Ile 210 215 220 gac aag gac ggg ctg cac agc ggc gac gac gac gac gac 720
Asp Lys Asp Gly Trp Leu His Ser Gly Asp Ile Ala Tyr Trp Asp Glu 225 230 235 240 gac gag cac ttc ttc ate gtg gac cgg ctg aag agc ctg ate aaa tac 768Asp Lys Asp Gly Trp Leu His Ser Gly Asp Ile Ala Tyr Trp Asp Glu 225 230 235 240 gac gag cac ttc ttc ate gtg gac cgg ctg aag agc ctg ate aaa tac 768
Asp Glu His Phe Phe Ile Val Asp Arg Leu Lys Ser Leu Ile Lys Tyr 245 250 255 aag ggc tac cag gta gcc cca gcc gaa ctg gag agc ate ctg ctg caa 816Asp Glu His Phe Phe Ile Val Asp Arg Leu Lys Ser Leu Ile Lys Tyr 245 250 255 aag ggc tac cag gta gcc cca gcc gaa ctg gag agc ate ctg ctg caa 816
Lys Gly Tyr Gin Val Ala Pro Ala Glu Leu Glu Ser Ile Leu Leu Gin 260 265 ' 270 cac ccc aac ate ttc gac gcc ggg gtc gcc ggc ctg ccc gac gac gat 864Lys Gly Tyr Gin Val Ala Pro Ala Glu Leu Glu Ser Ile Leu Leu Gin 260 265 '270 cac ccc aac ate tc gac gcc ggg gtc gcc gcc ctg ccc gac gac gat 864
His Pro Asn Ile Phe Asp Ala Gly Val Ala Gly Leu Pro Asp Asp Asp 275 280 285 gcc ggc gag ctg ccc gcc gca gtc gtc gtg ctg gaa cac ggt aaa acc 912His Pro Asn Ile Phe Asp Ala Gly Val Ala Gly Leu Pro Asp Asp Asp Asp Asp Asp Asp 275 280 285 gcc ggc gag ctg ccc gcc gtca
Ala Gly Glu Leu Pro Ala Ala Val Val Val Leu Glu His Gly Lys Thr 290 295 300 atg acc gag aag gag ate gtg gac tat gtg gcc agc cag gtt aca acc 960Ala Gly Glu Leu Pro Ala Val Val Val Leu Glu His Gly Lys Thr 290 295 300 atg acc gag aag gag ate gtg gac tat gtg gcc agc cag gtt aca acc 960
Met Thr Glu Lys Glu Ile Val Asp Tyr Val Ala Ser Gin Val Thr Thr 305 310 315 320 gcc aag aag ctg ege ggt ggt gtt gtg ttc gtg gac gag gtg cct aaa 1008Met Thr Glu Lys Glu Ile Val Asp Tyr Val Ala Ser Gin Val Thr Thr 305 310 315 320 gcc aag aag ctg ege ggt ggt gtt gtg ttc gtg gac gag gtg cct aaa 1008
Ala Lys Lys Leu Arg Gly Gly Val Val Phe Val Asp Glu Val Pro Lys 325 330 335 gga ctg acc ggc aag ttg gac gcc ege aag ate ege gag att ete att 1056Ala Lys Lys Leu Arg Gly Gly Val Val Phe Val Asp Glu Val Pro Lys 325 330 335 gga ctg acc ggc aag ttg gac gcc ege aag ate ege gag att ete att 1056
Gly Leu Thr Gly Lys Leu Asp Ala Arg Lys Ile Arg Glu Ile Leu Ile 340 345 350 aag gcc aag aag gga tcc gaa aat ete tat ttc cag agc ggc ggt gcc 1104Gly Leu Thr Gly Lys Leu Asp Ala Arg Lys Ile Arg Glu Ile Leu Ile 340 345 350 aag gcc aag aag gga tcc gaa aat ete tat ttc cag agc ggc ggt gcc 1104
Lys Ala Lys Lys Gly Ser Glu Asn Leu Tyr Phe Gin Ser Gly Gly Ala 355 360 365 aaa aac att aag aag ggc cca gcg cca ttc tac cca ete gaa gac ggg 1152Lys Ala Lys Lys Gly Ser Glu Asn Leu Tyr Phe Gin Ser Gly Gly Ala 355 360 365 aaa aac att aag aag ggc cca gcg cca ttc tac cca ete gaa gac ggg 1152
Lys Asn Ile Lys Lys Gly Pro Ala Pro Phe Tyr Pro Leu Glu Asp Gly 370 375 380 acc gcc ggc gag cag ctg cac aaa gcc atg aag ege tac gcc ctg gtg 1200Lys Asn Ile Lys Lys Gly Pro Ala Pro Phe Tyr Pro Leu Glu Asp Gly 370 375 380 acc gcc cg gag gag cag gag cag gag atg aag ege tac gcc ctg gtg 1200
Thr Ala Gly Glu Gin Leu His Lys Ala Met Lys Arg Tyr Ala Leu Val 385 390 395 400 ccc ggc acc ate gcc ttt acc gac gca cat ate gag gtg gac att acc 1248Thr Ala Gly Glu Gin Leu His Lys Ala Met Lys Arg Tyr Ala Leu Val 385 390 395 400 ccc ggc acc ate gcc ttt acc gac gca cat ate gag gtg gac att acc 1248
Pro Gly Thr Ile Ala Phe Thr Asp Ala His Ile Glu Val Asp Ile Thr 405 410 415 tac gcc gag tac ttc gag atg agc gtt cgg ctg gca gaa get atg aag 1296Pro Gly Thr Ile Ala Phe Thr Asp Ala His Ile Glu Val Asp Ile Thr 405 410 415 tac gcc gag tac ttc gag atg agc gtt cgg ctg gca gaa get atg aag 1296
Tyr Ala Glu Tyr Phe Glu Met Ser Val Arg Leu Ala Glu Ala Met Lys 420 425 430 ege tat ggg ctg aat aca aac cat cgg ate gtg gtg tgc agc gag aat 1344Tyr Ala Glu Tyr Phe Glu Met Ser Val Arg Leu Ala Glu Ala Met Lys 420 425 430 ege tat ggg ctg aat aca aac cat cgg ate gtg gtg tgc agc gag aat 1344
Arg Tyr Gly Leu Asn Thr Asn His Arg Ile Val Val Cys Ser Glu Asn 435 440 445 agc ttg cag ttc ttc atg ccc gtg ttg ggt gcc ctg ttc ate ggt gtg 1392Arg Tyr Gly Leu Asn Thr Asn His Arg Ile Val Val Cys Ser Glu Asn 435 440 445 agc ttg cag ttc ttc atg ccc gtg ttg ggt gcc ctg ttc ate ggt gtg 1392
Ser Leu Gin Phe Phe Met Pro Val Leu Gly Ala Leu Phe Ile Gly Val 450 455 460 get gtg gcc cca get aac gac ate tac aac gag ege gag ctg ctg aac 1440Ser Leu Gin Phe Phe Met Pro Val Leu Gly Ala Leu Phe Ile Gly Val 450 455 460 get gtg gcc cca get aac gac ate tac aac gag ege gag ctg ctg aac 1440
Ala Val Ala Pro Ala Asn Asp Ile Tyr Asn Glu Arg Glu Leu Leu Asn 465 470 475 480 agc atg ggc ate agc cag ccc acc gtc gta ttc gtg agc aag aaa ggg 1488Ala Val Ala Pro Ala Asn Asp Ile Tyr Asn Glu Arg Glu Leu Leu Asn 465 470 475 480 agc atg ggc ate agc cag ccc acc gtc gta ttc gtg agc aag aaa ggg 1488
Ser Met Gly Ile Ser Gin Pro Thr Val Val Phe Val Ser Lys Lys Gly 485 490 495 ctg caa aag ate ete aac gtg caa aag aag cta ccg ate ata caa aag 1536Ser Met Gly Ile Ser Gin Pro Thr Val Val Phe Val Ser Lys Lys Gly 485 490 495 ctg caa aag ate ete aac gtg caa aag aag cta ccg ate ata caa aag 1536
Leu Gin Lys Ile Leu Asn Val Gin Lys Lys Leu Pro Ile Ile Gin Lys 500 505 510 ate ate ate atg gat agc aag acc gac tac cag ggc ttc caa agc atg 1584Leu Gin Lys Ile Leu Asn Val Gin Lys Leu Pro Ile Ile Gin Lys 500 505 510 ate ate ate atg gat agc aag acc gac tac cag ggc ttc caa agc atg 1584
Ile Ile Ile Met Asp Ser Lys Thr Asp Tyr Gin Gly Phe Gin Ser Met 515 520 525 tac acc ttc gtg act tcc cat ttg cca ccc ggc ttc aac gag tac gac 1632Ile Ile Ile Met Asp Ser Lys Thr Asp Tyr Gin Gly Phe Gin Ser Met 515 520 525 tac acc ttc gtg act tcc cat ttg cca ccc ggc ttc aac gag tac gac 1632
Tyr Thr Phe Val Thr Ser His Leu Pro Pro Gly Phe Asn Glu Tyr Asp 530 535 540 ttc gtg ccc gag agc ttc gac cgg gac aaa acc ate gcc ctg ate atg 1680Tyr Thr Phe Val Thr Ser His Leu Pro Pro Gly Phe Asn Glu Tyr Asp 530 535 540 ttc gtg ccc gag agc ttc gac cgg gac aaa acc ate gcc ctg ate atg 1680
Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr Ile Ala Leu Ile Met 545 550 555 560 aac agt agt ggc agt acc gga ttg ccc aag ggc gta gcc cta ccg cac 1728Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr Ile Ala Leu Ile Met 545 550 555 560 aac agt agt ggc agt acc gga ttg ccc aag ggc gta gcc cta ccg cac 1728
Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val Ala Leu Pro His 565 570 575 ege acc get tgt gtc ega ttc agt cat gcc ege gac ccc ate ttc ggc 1776Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val Ala Leu Pro His 565 570 575 ege acc get tgt gtc ega ttc agt cat gcc ege gac ccc ate ttc ggc 1776
Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp Pro Ile Phe Gly 580 585 590 aac cag ate ate ccc gcc gag tac tgt tta agc tat gaa acg gaa ata 1824Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp Pro Ile Phe Gly 580 585 590 aac cag ate ate ccc gcc gag tac tgt tta agc tat gaa acg gaa ata 1824
Asn Gin Ile Ile Pro Ala Glu Tyr Cys Leu Ser Tyr Glu Thr Glu Ile 595 600 605 ttg aca gta gaa tat gga tta tta ccg att ggt aaa att gta gaa aag 1872Asn Gin Ile Ile Pro Ala Glu Tyr Cys Leu Ser Tyr Glu Thr Glu Ile 595 600 605 ttg aca gta gaa tat gga tta tta ccg att ggt aaa att gta gaa aag 1872
Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile Val Glu Lys 610 615 620 cgc ate gaa tgt act gtt tat agc gtt gat aat aat gga aat att tat 1920Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile Val Glu Lys 610 615 620 cgc ate gaa tgt act gtt tat agc gtt gat aat aat gga aat att tat 1920
Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly Asn Ile Tyr 625 630 635 640 aca caa cct gta gca caa tgg cac gat cgc gga gaa caa gag gtg ttt 1968Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly Asn Ile Tyr 625 630 635 640 aca caa cct gta gca caa tgg cac gat cgc gga gaa caa gag gtg ttt 1968
Thr Gin Pro Val Ala Gin Trp His Asp Arg Gly Glu Gin Glu Val Phe 645 650 655 gag tat tgt ttg gaa gat ggt tca ttg att cgg gca aca aaa gac cat 2016Thr Gin Pro Val Ala Gin Trp His Asp Arg Gly Glu Gin Glu Val Phe 645 650 655 gag tat tgt ttg gaa gat ggt tca ttg att cgg gca aca aaa gac cat 2016
Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr Lys Asp His 660 665 670 aag ttt atg act gtt gat ggt caa atg ttg eca att gat gaa ata ttt 2064Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr Lys Asp His 660 665 670 aag ttt atg act gtt gat ggt caa atg ttg eca att gat gaa ata ttt 2064
Lys Phe Met Thr Val Asp Gly Gin Met Leu Pro Ile Asp Glu Ile Phe 675 680 685 gaa cgt gaa ttg gat ttg atg cgg gtt gat aat ttg ccg aac ggc ggc 2112Lys Phe Met Thr Val Asp Gly Gin Met Leu Pro Ile Asp Glu Ile Phe 675 680 685 gaa cgt gaa ttg gat ttg atg cgg gtt gat aat ttg ccg aac ggc ggc 2112
Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro Asn Gly Gly 690 695 700 aag ate gcc gtc aat tet get tgc aag aac tgg ttc agt agc tta agc 2160Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro Asn Gly Gly 690 695 700 aag ate gcc gtc aat tet get tgc aag aac tgg agt agc tta agc 2160
Lys Ile Ala Val Asn Ser Ala Cys Lys Asn Trp Phe Ser Ser Leu Ser 705 710 715 720 cac ttt gtg ate cac ctt aac agc cac ggc ttc cct ccc gag gtg gag 2208Lys Ile Ala Val Asn Ser Ala Cys Lys Asn Trp Phe Ser Ser Leu Ser 705 710 715 720 cac ttt gtg ate cac ctt aac agc cac ggc ttc cct ccc gag gtg gag 2208
His Phe val Ile His Leu Asn Ser His Gly Phe Pro Pro Glu Val Glu 725 730 735 gag cag gcc gcc ggc acc ctg ccc atg agc tgc gcc cag gag agc ggc 2256His Phe wave Ile His Leu Asn Ser His Gly Phe Pro Pro Glu Val Glu 725 730 735 gag ccc gcc gcc ggc acc ctg ccc atg agc tgc gcc cag gag agc ggc 2256
Glu Gin Ala Ala Gly Thr Leu Pro Met Ser Cys Ala Gin Glu Ser Gly 740 745 750 atg gat aga cac cct get get tgc gcc agc gcc agg ate aac gtc taa 2304Glu Gin Ala Ala Gly Thr Leu Pro Met Ser Cys Ala Gin Glu Ser Gly 740 745 750 atg gat aga cac cct get get tgc gcc agc gcc agg ate aac gtc taa 2304
Met Asp Arg His Pro Ala Ala Cys Ala Ser Ala Arg Ile Asn Val 755 760 765 <210> 68 <211> 767Met Asp Arg His Pro Ala Ala Cys Ala Ser Ala Arg Ile Asn Val 755 760 765 <210> 68 <211> 767
<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 68<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 68
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gin Asn Val Tyr 15 10 15Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gin Asn Val Tyr 15 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe 20 25 30Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe 20 25 30
Ile Ala Ser Asn Cys Phe Asn Asp Thr Tyr Arg Tyr Ile Asp Thr Ala 35 40 45Ile Ala Ser Asn Cys Phe Asn Asp Thr Tyr Arg Tyr Ile Asp Thr Ala 35 40 45
Ile Leu Ser Val Val Pro Phe His His Gly Phe Gly Met Phe Thr Thr 50 55 60Ile Leu Ser Val Val Phe His Gly Phe Gly Met Phe Thr Thr 50 55 60
Leu Gly Tyr Leu Ile Cys Gly Phe Arg Val Val Leu Met Tyr Arg Phe 65 70 75 80Leu Gly Tyr Leu Ile Cys Gly Phe Arg Val Val Leu Met Tyr Arg Phe 65 70 75 80
Glu Glu Glu Leu Phe Leu Arg Ser Leu Gin Asp Tyr Lys Ile Gin Ser 85 90 95Glu Glu Glu Leu Phe Leu Arg Ser Leu Gin Asp Tyr Lys Ile Gin Ser 85 90 95
Ala Leu Leu Val Pro Thr Leu Phe Ser Phe Phe Ala Lys Ser Thr Leu 100 105 110Ala Leu Leu Val Pro Thr Leu Phe Ser Phe Phe Ala Lys Ser Thr Leu 100 105 110
Ile Asp Lys Tyr Asp Leu Ser Asn Leu His Glu Ile Ala Ser Gly Gly 115 120 125Ile Asp Lys Tyr Asp Leu Ser Asn Leu His Glu Ile Ala Ser Gly Gly 115 120 125
Ala Pro Leu Ser Lys Glu Val Gly Glu Ala Val Ala Lys Arg Phe His 130 135 140Ala Pro Leu Ser Lys Glu Val Gly Glu Ala Val Ala Lys Arg Phe His 130 135 140
Leu Pro Gly Ile Arg Gin Gly Tyr Gly Leu Thr Glu Thr Thr Ser Ala 145 150 155 160Leu Pro Gly Ile Arg Gin Gly Tyr Gly Leu Thr Glu Thr Thr Ser Ala 145 150 155 160
Ile Leu Ile Thr Pro Glu Gly Asp Asp Lys Pro Gly Ala Val Gly Lys 165 170 175Ile Leu Ile Thr Pro Glu Gly Asp Asp Lys Pro Gly Ala Val Gly Lys 165 170 175
Val Val Pro Phe Phe Glu Ala Lys Val Val Asp Leu Asp Thr Gly Lys 180 185 190Val Val Pro Phe Phe Glu Ala Lys Val Val Asp Leu Asp Thr Gly Lys 180 185 190
Thr Leu Gly Val Asn Gin Arg Gly Glu Leu Cys Val Arg Gly Pro Met 195 200 205Thr Leu Gly Val Asn Gin Arg Gly Glu Leu Cys Val Arg Gly Pro Met 195 200 205
Ile Met Ser Gly Tyr Val Asn Asn Pro Glu Ala Thr Asn Ala Leu Ile 210 215 220Ile Met Ser Gly Tyr Val Asn Asn Pro Glu Ala Thr Asn Ala Leu Ile 210 215 220
Asp Lys Asp Gly Trp Leu His Ser Gly Asp Ile Ala Tyr Trp Asp Glu 225 230 235 240Asp Lys Asp Gly Trp Leu His Ser Gly Asp Ile Ala Tyr Trp Asp Glu 225 230 235 240
Asp Glu His Phe Phe Ile Val Asp Arg Leu Lys Ser Leu Ile Lys Tyr 245 250 255Asp Glu His Phe Phe Ile Val Asp Arg Leu Lys Ser Leu Ile Lys Tyr 245 250 255
Lys Gly Tyr Gin Val Ala Pro Ala Glu Leu Glu Ser Ile Leu Leu Gin 260 265 270Lys Gly Tyr Gin Val Ala Pro Ala Glu Leu Glu Ser Ile Leu Leu Gin 260 265 270
His Pro Asn Ile Phe Asp Ala Gly Val Ala Gly Leu Pro Asp Asp Asp 275 280 285His Pro Asn Ile Phe Asp Ala Gly Val Ala Gly Leu Pro Asp Asp Asp 275 280 285
Ala Gly Glu Leu Pro Ala Ala Val Val Val Leu Glu His Gly Lys Thr 290 295 300Ala Gly Glu Leu Pro Ala Ala Val Val Leu Glu His Gly Lys Thr 290 295 300
Met Thr Glu Lys Glu Ile Val Asp Tyr Val Ala Ser Gin Val Thr Thr 305 310 315 320Met Thr Glu Lys Glu Ile Val Asp Tyr Val Ala Ser Gin Val Thr Thr 305 310 315 320
Ala Lys Lys Leu Arg Gly Gly Val Val Phe Val Asp Glu Val Pro Lys 325 330 335Ala Lys Lys Leu Arg Gly Gly Val Val Phe Val Asp Glu Val Pro Lys 325 330 335
Gly Leu Thr Gly Lys Leu Asp Ala Arg Lys Ile Arg Glu Ile Leu Ile 340 345 350Gly Leu Thr Gly Lys Leu Asp Ala Arg Lys Ile Arg Glu Ile Leu Ile 340 345 350
Lys Ala Lys Lys Gly Ser Glu Asn Leu Tyr Phe Gin Ser Gly Gly Ala 355 360 365Lys Ala Lys Lys Gly Ser Glu Asn Leu Tyr Phe Gin Ser Gly Gly Ala 355 360 365
Lys Asn Ile Lys Lys Gly Pro Ala Pro Phe Tyr Pro Leu Glu Asp Gly 370 375 380Lys Asn Ile Lys Lys Gly Pro Ala Pro Phe Tyr Pro Leu Glu Asp Gly 370 375 380
Thr Ala Gly Glu Gin Leu His Lys Ala Met Lys Arg Tyr Ala Leu Val 385 390 395 400Thr Ala Gly Glu Gin Leu His Lys Ala Met Lys Arg Tyr Ala Leu Val 385 390 395 400
Pro Gly Thr Ile Ala Phe Thr Asp Ala His Ile Glu Val Asp Ile Thr 405 410 415Pro Gly Thr Ile Ala Phe Thr Asp Ala His Ile Glu Val Asp Ile Thr 405 410 415
Tyr Ala Glu Tyr Phe Glu Met Ser Val Arg Leu Ala Glu Ala Met Lys 420 425 430Tyr Ala Glu Tyr Phe Glu Met Ser Val Arg Leu Ala Glu Ala Met Lys 420 425 430
Arg Tyr Gly Leu Asn Thr Asn His Arg Ile Val Val Cys Ser Glu Asn 435 440 445Arg Tyr Gly Leu Asn Thr Asn His Arg Ile Val Val Cys Ser Glu Asn 435 440 445
Ser Leu Gin Phe Phe Met Pro Val Leu Gly Ala Leu Phe Ile Gly Val 450 455 460Ser Leu Gin Phe Phe Met Pro Val Leu Gly Ala Leu Phe Ile Gly Val 450 455 460
Ala Val Ala Pro Ala Asn Asp Ile Tyr Asn Glu Arg Glu Leu Leu Asn 465 470 475 480Ala Val Ala Pro Ala Asn Asp Ile Tyr Asn Glu Arg Glu Leu Leu Asn 465 470 475 480
Ser Met Gly Ile Ser Gin Pro Thr Val Val Phe Val Ser Lys Lys Gly 485 490 495Ser Met Gly Ile Ser Gin Pro Thr Val Val Phe Val Ser Lys Lys Gly 485 490 495
Leu Gin Lys Ile Leu Asn Val Gin Lys Lys Leu Pto Ile Ile Gin Lys 500 505 510Leu Gin Lys Ile Leu Asn Val Gin Lys Leu Pt Ile Ile Gin Lys 500 505 510
Ile Ile Ile Met Asp Ser Lys Thr Asp Tyr Gin Gly Phe Gin Ser Met 515 520 525Ile Ile Ile Met Asp Ser Lys Thr Asp Tyr Gin Gly Phe Gin Ser Met 515 520 525
Tyr Thr Phe Val Thr Ser His Leu Pro Pro Gly Phe Asn Glu Tyr Asp 530 535 540Tyr Thr Phe Val Thr Ser His Leu Pro Pro Gly Phe Asn Glu Tyr Asp 530 535 540
Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr Ile Ala Leu Ile Met 545 550 555 560Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr Ile Ala Leu Ile Met 545 550 555 560
Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val Ala Leu Pro His 565 570 575Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val Ala Leu Pro His 565 570 575
Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp Pro Ile Phe Gly 580 585 590Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp Pro Ile Phe Gly 580 585 590
Asn Gin Ile Ile Pro Ala Glu Tyr Cys Leu Ser Tyr Glu Thr Glu Ile 595 600 605Asn Gin Ile Ile Pro Ala Glu Tyr Cys Leu Ser Tyr Glu Thr Glu Ile 595 600 605
Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile Val Glu Lys 610 615 620Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile Val Glu Lys 610 615 620
Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly Asn Ile Tyr 625 630 635 640Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly Asn Ile Tyr 625 630 635 640
Thr Gin Pro Val Ala Gin Trp His Asp Arg Gly Glu Gin Glu Val Phe 645 650 655Thr Gin Pro Val Ala Gin Trp His Asp Arg Gly Glu Gin Glu Val Phe 645 650 655
Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr Lys Asp His 660 665 670Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr Lys Asp His 660 665 670
Lys Phe Met Thr Val Asp Gly Gin Met Leu Pro Ile Asp Glu Ile Phe 675 680 685Lys Phe Met Thr Val Asp Gly Gin Met Leu Pro Ile Asp Glu Ile Phe 675 680 685
Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro Asn Gly Gly 690 695 700Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro Asn Gly Gly 690 695 700
Lys Ile Ala Val Asn Ser Ala Cys Lys Asn Trp Phe Ser Ser Leu Ser 705 710 715 720Lys Ile Ala Val Asn Ser Ala Cys Lys Asn Trp Phe Ser Ser Leu Ser 705 710 715 720
His Phe Val Ile His Leu Asn Ser His Gly Phe Pro Pro Glu Val Glu 725 730 735His Phe Val Ile His Leu Asn Ser His Gly Phe Pro Pro Glu Val Glu 725 730 735
Glu Gin Ala Ala Gly Thr Leu Pro Met Ser Cys Ala Gin Glu Ser Gly 740 745 750Glu Gin Ala Ala Gly Thr Leu Pro Met Ser Cys Ala Gin Glu Ser Gly 740 745 750
Met Asp Arg His Pro Ala Ala Cys Ala Ser Ala Arg Ile Asn Val 755 760 765 <210> 69 <211> 2304Met Asp Arg His Pro Ala Ala Cys Ala Ser Ala Arg Ile Asn Val 755 760 765 <210> 69 <211> 2304
<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1)..(1791) <400> 69 atg ate aaa ata gcc aca cgt aaa tat tta ggc aaa caa aat gtc tat 48<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1) .. (1791) <400> 69 atg gac aca cgt aaa tat tta ggc aaa caa aat gtc tat 48
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gin Asn Val Tyr 15 10 15 gac att gga gtt gag ege gac cat aat ttt gca ete aaa aat ggc ttc 96Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gin Asn Val Tyr 15 10 15 gac att gga gtt gag ege gac cat aat ttt gca ete aaa aat ggc ttc 96
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe 20 25 30 ata get tet aat tgt ttc aat gat act tat aga tat att gac acc get 144Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe 20 25 30 ata get tet aat tgt ttc aat gat act tat aga tat att gac acc get 144
Ile Ala Ser Asn Cys Phe Asn Asp Thr Tyr Arg Tyr Ile Asp Thr Ala 35 40 45 ate ete agc gtg gtg cca ttt cac cac ggc ttc ggc atg ttc acc acg 192Ile Ala Ser Asn Cys Phe Asn Asp Thr Tyr Arg Tyr Ile Asp Thr Ala 35 40 45 ate ete agc gtg gtg cca ttt cac cac ggc ttc ggc atg ttc acc acg 192
Ile Leu Ser Val Val Pro Phe His His Gly Phe Gly Met Phe Thr Thr 50 55 60 ctg ggc tac ttg ate tgc ggc ttt cgg gtc gtg ete atg tac ege ttc 240Ile Leu Ser Val Val Phe His Gly Phe Gly Met Phe Thr Thr 50 55 60 ctg ggc tac ttg ate tgc ggc ttt cgg gtc gtg ete atg tac ege ttc 240
Leu Gly Tyr Leu Ile Cys Gly Phe Arg Val Val Leu Met Tyr Arg Phe 65 70 75 80 gag gag gag cta ttc ttg ege agc ttg caa gac tat aag att caa tet 288Leu Gly Tyr Leu Ile Cys Gly Phe Arg Val Val Leu Met Tyr Arg Phe 65 70 75 80 gag gag gag cta ttc ttg ege agc ttg caa gac tat aag att caa tet 288
Glu Glu Glu Leu Phe Leu Arg Ser Leu Gin Asp Tyr Lys Ile Gin Ser 85 90 95 gcc ctg ctg gtg cec aca cta ttt agc ttc ttc get aag agc act ete 336Glu Glu Glu Leu Phe Leu Arg Ser Leu Gin Asp Tyr Lys Ile Gin Ser 85 90 95 gcc ctg ctg gtg cec aca cta ttt agc ttc ttc get aag agc act ete 336
Ala Leu Leu Val Pro Thr Leu Phe Ser Phe Phe Ala Lys Ser Thr Leu 100 105 110 ate gac aag tac gac cta agc aac ttg cac gag ate gcc agc ggc ggg 384Ala Leu Leu Val Pro Thr Leu Phe Ser Phe Phe Ala Lys Ser Thr Leu 100 105 110 ate gac aag tac gac cta agc aac ttg cac gag ate gcc agc ggc ggg 384
Ile Asp Lys Tyr Asp Leu Ser Asn Leu His Glu Ile Ala Ser Gly Gly 115 120 125 gcg ccg ete agc aag gag gta ggt gag gcc gtg gcc aaa ege ttc cac 432Ile Asp Lys Tyr Asp Leu Ser Asn Leu His Glu Ile Ala Ser Gly Gly 115 120 125 gcg ccg ete agc aag gag gta ggt gag gcc gtg gcc aaa ege ttc cac 432
Ala Pro Leu Ser Lys Glu Val Gly Glu Ala Val Ala Lys Arg Phe His 130 135 140 cta cca ggc ate ege cag ggc tac ggc ctg aca gaa aca acc agc gcc 480Ala Pro Leu Ser Lys Glu Val Gly Glu Ala Val Ala Lys Arg Phe His 130 135 140 cta cg ggc cate ggc cg ggc ac acc ac ac acc ac gc 480
Leu Pro Gly Ile Arg Gin Gly Tyr Gly Leu Thr Glu Thr Thr Ser Ala 145 150 155 160 att ctg ate acc ccc gaa ggg gac gac aag cct ggc gca gta ggc aag 528Leu Pro Gly Ile Arg Gin Gly Tyr Gly Leu Thr Glu Thr Thr Ser Ala 145 150 155 160 att ctg ate acc ccc gaa ggg gac gac aag cct ggc gca gta ggc aag 528
Ile Leu Ile Thr Pro Glu Gly Asp Asp Lys Pro Gly Ala Val Gly Lys 165 170 175 gtg gtg ccc ttc ttc gag gct aag gtg gtg gac ttg gac acc ggt aag 576Ile Leu Ile Thr Pro Glu Gly Asp Asp Lys Pro Gly Ala Val Gly Lys 165 170 175 gtg gtc ccc ttc ttc gag gct aag gtg gtg gac ttg gac acc ggt aag 576
Val val Pro Phe Phe Glu Ala Lys Val Val Asp Leu Asp Thr Gly Lys 180 185 190 aca ctg ggt gtg aac cag cgc ggc gag ctg tgc gtc cgt ggc ccc atg 624Val val Pro Phe Phe Glu Ala Lys Val Val Asp Leu Asp Thr Gly Lys 180 185 190 aca ctg ggt gtg aac cag cgc ggc gag ctg tgc gtc cgt ggc ccc atg 624
Thr Leu Gly Val Asn Gin Arg Gly Glu Leu Cys Val Arg Gly Pro Met 195 200 205 ate atg agc ggc tac gtt aac aac ccc gag gct aca aac gct ete ate 672Thr Leu Gly Val Asn Gin Arg Gly Glu Leu Cys Val Arg Gly Pro Met 195 200 205 ate atg agc ggc tac gtt aac aac ccc gag gct aca aac gct ete ate 672
Ile Met Ser Gly Tyr Val Asn Asn Pro Glu Ala Thr Asn Ala Leu Ile 210 215 220 gac aag gac ggc tgg ctg cac agc ggc gac ate gcc tac tgg gac gag 720Ile Met Ser Gly Tyr Val Asn Asn Pro Glu Ala Thr Asn Ala Leu Ile 210 215 220 gac aag gac ggg ctg cac agc ggc gac gac gac gac gac 720
Asp Lys Asp Gly Trp Leu His Ser Gly Asp Ile Ala Tyr Trp Asp Glu 225 230 235 240 gac gag cac ttc ttc ate gtg gac cgg ctg aag agc ctg ate aaa tac 768Asp Lys Asp Gly Trp Leu His Ser Gly Asp Ile Ala Tyr Trp Asp Glu 225 230 235 240 gac gag cac ttc ttc ate gtg gac cgg ctg aag agc ctg ate aaa tac 768
Asp Glu His Phe Phe Ile Val Asp Arg Leu Lys Ser Leu Ile Lys Tyr 245 250 255 aag ggc tac cag gta gcc cca gcc gaa ctg gag agc ate ctg ctg caa 816Asp Glu His Phe Phe Ile Val Asp Arg Leu Lys Ser Leu Ile Lys Tyr 245 250 255 aag ggc tac cag gta gcc cca gcc gaa ctg gag agc ate ctg ctg caa 816
Lys Gly Tyr Gin Val Ala Pro Ala Glu Leu Glu Ser Ile Leu Leu Gin 260 265 270 cac ccc aac ate ttc gac gcc ggg gtc gcc ggc ctg ccc gac gac gat 864Lys Gly Tyr Gin Val Ala Pro Ala Glu Leu Glu Ser Ile Leu Leu Gin 260 265 270 cac ccc aac ate tc gac gcc ggg gtc gcc gcc ctg ccc gac gac gat 864
His Pro Asn Ile Phe Asp Ala Gly Val Ala Gly Leu Pro Asp Asp Asp 275 280 285 gcc ggc gag ctg ccc gcc gca gtc gtc gtg ctg gaa cac ggt aaa acc 912His Pro Asn Ile Phe Asp Ala Gly Val Ala Gly Leu Pro Asp Asp Asp Asp Asp Asp Asp 275 280 285 gcc ggc gag ctg ccc gcc gtca
Ala Gly Glu Leu Pro Ala Ala Val Val Val Leu Glu His Gly Lys Thr 290 295 300 atg acc gag aag gag ate gtg gac tat gtg gcc agc cag gtt aca acc 960Ala Gly Glu Leu Pro Ala Val Val Val Leu Glu His Gly Lys Thr 290 295 300 atg acc gag aag gag ate gtg gac tat gtg gcc agc cag gtt aca acc 960
Met Thr Glu Lys Glu Ile Val Asp Tyr Val Ala Ser Gin Val Thr Thr 305 310 315 320 gcc aag aag ctg cgc ggt ggt gtt gtg ttc gtg gac gag gtg cct aaa 1008Met Thr Glu Lys Glu Ile Val Asp Tyr Val Ala Ser Gin Val Thr Thr 305 310 315 320 gcc aag aag ctg cgc ggt gpt gtg ttc gtg gac gag gtg cct aaa 1008
Ala Lys Lys Leu Arg Gly Gly Val Val Phe Val Asp Glu Val Pro Lys 325 330 335 gga ctg acc ggc aag ttg gac gcc cgc aag ate cgc gag att ete att 1056Ala Lys Lys Leu Arg Gly Gly Val Val Phe Val Asp Glu Val Pro Lys 325 330 335 gga ctg acc ggc aag ttg gac gcc cgc aag ate cgc gag att ete att 1056
Gly Leu Thr Gly Lys Leu Asp Ala Arg Lys Ile Arg Glu Ile Leu Ile 340 345 350 aag gcc aag aag gga tcc aac gtg gtg gtg cac cag gcc ggc ggt gcc 1104Gly Leu Thr Gly Lys Leu Asp Ala Arg Lys Ile Arg Glu Ile Leu Ile 340 345 350 aag gcc aag aag gaga tcc aac gtg gtg gtg cac cag gcc ggc gcc 1104
Lys Ala Lys Lys Gly Ser Asn Val Val Val His Gin Ala Gly Gly Ala 355 360 365 aaa aac att aag aag ggc cca gcg cca ttc tac cca ete gaa gac ggg 1152Lys Ala Lys Lys Gly Ser Asn Val Val Val His Gin Ala Gly Gly Ala 355 360 365 aaa aac att aag aag ggc cca gcg cca ttc tac cca ete gaa gac ggg 1152
Lys Asn Ile Lys Lys Gly Pro Ala Pro Phe Tyr Pro Leu Glu Asp Gly 370 375 380 acc gcc ggc gag cag ctg cac aaa gcc atg aag cgc tac gcc ctg gtg 1200Lys Asn Ile Lys Lys Gly Pro Ala Pro Phe Tyr Pro Leu Glu Asp Gly 370 375 380 acc gcc cg gag gag cag gag cag gag atg aag cgc tac gcc ctg gtg 1200
Thr Ala Gly Glu Gin Leu His Lys Ala Met Lys Arg Tyr Ala Leu Val 385 390 395 400 ccc ggc acc ate gcc ttt acc gac gca cat ate gag gtg gac att acc 1248Thr Ala Gly Glu Gin Leu His Lys Ala Met Lys Arg Tyr Ala Leu Val 385 390 395 400 ccc ggc acc ate gcc ttt acc gac gca cat ate gag gtg gac att acc 1248
Pro Gly Thr Ile Ala Phe Thr Asp Ala His Ile Glu Val Asp Ile Thr 405 410 415 tac gcc gag tac ttc gag atg agc gtt cgg ctg gca gaa gct atg aag 1296Pro Gly Thr Ile Ala Phe Thr Asp Ala His Ile Glu Val Asp Ile Thr 405 410 415 tac gcc gag tac ttc gag atg agc gtt cgg ctg gca gaa gct atg aag 1296
Tyr Ala Glu Tyr Phe Glu Met Ser Val Arg Leu Ala Glu Ala Met Lys 420 425 430 cgc tat ggg ctg aat aca aac cat cgg ate gtg gtg tgc agc gag aat 1344Tyr Ala Glu Tyr Phe Glu Met Ser Val Arg Leu Ala Glu Ala Met Lys 420 425 430 cgc tat ggg ctg aat aca aac cat cgg ate gtg gtg tgc agc gag aat 1344
Arg Tyr Gly Leu Asn Thr Asn His Arg Ile Val Val Cys Ser Glu Asn 435 440 445 agc ttg cag ttc ttc atg ccc gtg ttg ggt gcc ctg ttc ate ggt gtg 1392Arg Tyr Gly Leu Asn Thr Asn His Arg Ile Val Val Cys Ser Glu Asn 435 440 445 agc ttg cag ttc ttc atg ccc gtg ttg ggt gcc ctg ttc ate ggt gtg 1392
Ser Leu Gin Phe Phe Met Pro Val Leu Gly Ala Leu Phe Ile Gly Val 450 455 460 gct gtg gcc cca gct aac gac ate tac aac gag cgc gag ctg ctg aac 1440Ser Leu Gin Phe Phe Met Pro Val Leu Gly Ala Leu Phe Ile Gly Val 450 455 460 gct gtc gcc cca gct aac gac ate tac aac gag cgc gag ctg ctg aac 1440
Ala Val Ala Pro Ala Asn Asp Ile Tyr Asn Glu Arg Glu Leu Leu Asn 465 470 475 480 agc atg ggc ate agc cag ccc acc gtc gta ttc gtg agc aag aaa ggg 1488Ala Val Ala Pro Ala Asn Asp Ile Tyr Asn Glu Arg Glu Leu Leu Asn 465 470 475 480 agc atg ggc ate agc cag ccc acc gtc gta ttc gtg agc aag aaa ggg 1488
Ser Met Gly Ile Ser Gin Pro Thr Val Val Phe Val Ser Lys Lys Gly 485 490 495 ctg caa aag ate ete aac gtg caa aag aag cta ccg ate ata caa aag 1536Ser Met Gly Ile Ser Gin Pro Thr Val Val Phe Val Ser Lys Lys Gly 485 490 495 ctg caa aag ate ete aac gtg caa aag aag cta ccg ate ata caa aag 1536
Leu Gin Lys Ile Leu Asn Val Gin Lys Lys Leu Pro Ile Ile Gin Lys 500 505 510 ate ate ate atg gat agc aag acc gac tac cag ggc ttc caa agc atg 1584Leu Gin Lys Ile Leu Asn Val Gin Lys Leu Pro Ile Ile Gin Lys 500 505 510 ate ate ate atg gat agc aag acc gac tac cag ggc ttc caa agc atg 1584
Ile Ile Ile Met Asp Ser Lys Thr Asp Tyr Gin Gly Phe Gin Ser Met 515 520 525 tac acc ttc gtg act tcc cat ttg cca ccc ggc ttc aac gag tac gac 1632Ile Ile Ile Met Asp Ser Lys Thr Asp Tyr Gin Gly Phe Gin Ser Met 515 520 525 tac acc ttc gtg act tcc cat ttg cca ccc ggc ttc aac gag tac gac 1632
Tyr Thr Phe Val Thr Ser His Leu Pro Pro Gly Phe Asn Glu Tyr Asp 530 535 540 ttc gtg ccc gag agc ttc gac cgg gac aaa acc ate gcc ctg ate atg 1680Tyr Thr Phe Val Thr Ser His Leu Pro Pro Gly Phe Asn Glu Tyr Asp 530 535 540 ttc gtg ccc gag agc ttc gac cgg gac aaa acc ate gcc ctg ate atg 1680
Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr Ile Ala Leu Ile Met 545 550 555 560 aac agt agt ggc agt acc gga ttg ccc aag ggc gta gcc cta ccg cac 1728Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr Ile Ala Leu Ile Met 545 550 555 560 aac agt agt ggc agt acc gga ttg ccc aag ggc gta gcc cta ccg cac 1728
Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val Ala Leu Pro His 565 570 575 ege acc get tgt gtc ega ttc agt cat gcc ege gac ccc ate ttc ggc 1776Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val Ala Leu Pro His 565 570 575 ege acc get tgt gtc ega ttc agt cat gcc ege gac ccc ate ttc ggc 1776
Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp Pro Ile Phe Gly 580 585 590 aac cag ate ate ccc gccgagtact gtttaagcta tgaaacggaa atattgacag 1831Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp Pro Ile Phe Gly 580 585 590 aac cag ate ate ccc gccgagtact gtttaagcta tgaaacggaa atattgacag 1831
Asn Gin Ile Ile Pro 595 tagaatatgg attattaccg attggtaaaa ttgtagaaaa gcgcatcgaa tgtactgttt 1891 atagcgttga taataatgga aatatttata cacaacctgt agcacaatgg cacgatcgcg 1951 gagaacaaga ggtgtttgag tattgtttgg aagatggttc attgattegg gcaacaaaag 2011 accataagtt tatgactgtt gatggtcaaa tgttgccaat tgatgaaata tttgaacgtg 2071 aattggattt gatgegggtt gataatttgc cgaaeggcgg caagatcgcc gtcaattctg 2131 cttgcaagaa ctggttcagt agcttaagcc actttgtgat ccaccttaac agccacggct 2191 tccctcccga ggtggaggag caggccgccg gcaccctgcc catgagctgc gcccaggaga 2251 gcggcatgga tagacaccct gctgcttgcg ccagcgccag gatcaacgtc taa 2304 <210> 70 <211> 597Asn Gln Ile Ile Pro 595 tagaatatgg attattaccg attggtaaaa ttgtagaaaa gcgcatcgaa tgtactgttt 1891 atagcgttga taataatgga aatatttata cacaacctgt agcacaatgg cacgatcgcg 1951 gagaacaaga ggtgtttgag tattgtttgg aagatggttc attgattegg gcaacaaaag 2011 accataagtt tatgactgtt gatggtcaaa tgttgccaat tgatgaaata tttgaacgtg 2071 aattggattt gatgegggtt gataatttgc cgaaeggcgg caagatcgcc gtcaattctg 2131 cttgcaagaa ctggttcagt agcttaagcc actttgtgat ccaccttaac agccacggct 2191 tccctcccga ggtggaggag caggccgccg gcaccctgcc catgagctgc gcccaggaga 2251 gcggcatgga tagacaccct gctgcttgcg ccagcgccag gatcaacgtc taa 2304 <210> 70 <211> 597
<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 70<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 70
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gin Asn Val Tyr 15 10 15Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gin Asn Val Tyr 15 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe 20 25 30Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe 20 25 30
Ile Ala Ser Asn Cys Phe Asn Asp Thr Tyr Arg Tyr Ile Asp Thr Ala 35 40 45Ile Ala Ser Asn Cys Phe Asn Asp Thr Tyr Arg Tyr Ile Asp Thr Ala 35 40 45
Ile Leu Ser Val Val Pro Phe His His Gly Phe Gly Met Phe Thr Thr 50 55 60Ile Leu Ser Val Val Phe His Gly Phe Gly Met Phe Thr Thr 50 55 60
Leu Gly Tyr Leu Ile Cys Gly Phe Arg Val Val Leu Met Tyr Arg Phe 65 70 75 80Leu Gly Tyr Leu Ile Cys Gly Phe Arg Val Val Leu Met Tyr Arg Phe 65 70 75 80
Glu Glu Glu Leu Phe Leu Arg Ser Leu Gin Asp Tyr Lys Ile Gin Ser 85 90 95Glu Glu Glu Leu Phe Leu Arg Ser Leu Gin Asp Tyr Lys Ile Gin Ser 85 90 95
Ala Leu Leu Val Pro Thr Leu Phe Ser Phe Phe Ala Lys Ser Thr Leu 100 105 110Ala Leu Leu Val Pro Thr Leu Phe Ser Phe Phe Ala Lys Ser Thr Leu 100 105 110
Ile Asp Lys Tyr Asp Leu Ser Asn Leu His Glu Ile Ala Ser Gly Gly 115 120 125Ile Asp Lys Tyr Asp Leu Ser Asn Leu His Glu Ile Ala Ser Gly Gly 115 120 125
Ala Pro Leu Ser Lys Glu Val Gly Glu Ala Val Ala Lys Arg Phe His 130 135 140Ala Pro Leu Ser Lys Glu Val Gly Glu Ala Val Ala Lys Arg Phe His 130 135 140
Leu Pro Gly Ile Arg Gin Gly Tyr Gly Leu Thr Glu Thr Thr Ser Ala 145 150 155 160Leu Pro Gly Ile Arg Gin Gly Tyr Gly Leu Thr Glu Thr Thr Ser Ala 145 150 155 160
Ile Leu Ile Thr Pro Glu Gly Asp Asp Lys Pro Gly Ala Val Gly Lys 165 170 175Ile Leu Ile Thr Pro Glu Gly Asp Asp Lys Pro Gly Ala Val Gly Lys 165 170 175
Val Val Pro Phe Phe Glu Ala Lys Val Val Asp Leu Asp Thr Gly Lys 180 185 190Val Val Pro Phe Phe Glu Ala Lys Val Val Asp Leu Asp Thr Gly Lys 180 185 190
Thr Leu Gly Val Asn Gin Arg Gly Glu Leu Cys Val Arg Gly Pro Met 195 200 205Thr Leu Gly Val Asn Gin Arg Gly Glu Leu Cys Val Arg Gly Pro Met 195 200 205
Ile Met Ser Gly Tyr Val Asn Asn Pro Glu Ala Thr Asn Ala Leu Ile 210 215 220Ile Met Ser Gly Tyr Val Asn Asn Pro Glu Ala Thr Asn Ala Leu Ile 210 215 220
Asp Lys Asp Gly Trp Leu His Ser Gly Asp Ile Ala Tyr Trp Asp Glu 225 230 235 240Asp Lys Asp Gly Trp Leu His Ser Gly Asp Ile Ala Tyr Trp Asp Glu 225 230 235 240
Asp Glu His Phe Phe Ile Val Asp Arg Leu Lys Ser Leu Ile Lys Tyr 245 250 255Asp Glu His Phe Phe Ile Val Asp Arg Leu Lys Ser Leu Ile Lys Tyr 245 250 255
Lys Gly Tyr Gin Val Ala Pro Ala Glu Leu Glu Ser Ile Leu Leu Gin 260 265 270Lys Gly Tyr Gin Val Ala Pro Ala Glu Leu Glu Ser Ile Leu Leu Gin 260 265 270
His Pro Asn Ile Phe Asp Ala Gly Val Ala Gly Leu Pro Asp Asp Asp 275 280 285His Pro Asn Ile Phe Asp Ala Gly Val Ala Gly Leu Pro Asp Asp Asp 275 280 285
Ala Gly Glu Leu Pro Ala Ala Val Val Val Leu Glu His Gly Lys Thr 290 295 300Ala Gly Glu Leu Pro Ala Ala Val Val Leu Glu His Gly Lys Thr 290 295 300
Met Thr Glu Lys Glu Ile Val Asp Tyr Val Ala Ser Gin Val Thr Thr 305 310 315 320Met Thr Glu Lys Glu Ile Val Asp Tyr Val Ala Ser Gin Val Thr Thr 305 310 315 320
Ala Lys Lys Leu Arg Gly Gly Val Val Phe Val Asp Glu Val Pro Lys 325 330 335Ala Lys Lys Leu Arg Gly Gly Val Val Phe Val Asp Glu Val Pro Lys 325 330 335
Gly Leu Thr Gly Lys Leu Asp Ala Arg Lys Ile Arg Glu Ile Leu Ile 340 345 350Gly Leu Thr Gly Lys Leu Asp Ala Arg Lys Ile Arg Glu Ile Leu Ile 340 345 350
Lys Ala Lys Lys Gly Ser Asn Val Val Val His Gin Ala Gly Gly Ala 355 360 365Lys Ala Lys Lys Gly Ser Asn Val Val Val His Gin Ala Gly Gly Ala 355 360 365
Lys Asn Ile Lys Lys Gly Pro Ala Pro Phe Tyr Pro Leu Glu Asp Gly 370 375 380Lys Asn Ile Lys Lys Gly Pro Ala Pro Phe Tyr Pro Leu Glu Asp Gly 370 375 380
Thr Ala Gly Glu Gin Leu His Lys Ala Met Lys Arg Tyr Ala Leu Val 385 390 395 400Thr Ala Gly Glu Gin Leu His Lys Ala Met Lys Arg Tyr Ala Leu Val 385 390 395 400
Pro Gly Thr Ile Ala Phe Thr Asp Ala His Ile Glu Val Asp Ile Thr 405 410 415Pro Gly Thr Ile Ala Phe Thr Asp Ala His Ile Glu Val Asp Ile Thr 405 410 415
Tyr Ala Glu Tyr Phe Glu Met Ser Val Arg Leu Ala Glu Ala Met Lys 420 425 430Tyr Ala Glu Tyr Phe Glu Met Ser Val Arg Leu Ala Glu Ala Met Lys 420 425 430
Arg Tyr Gly Leu Asn Thr Asn His Arg Ile Val Val Cys Ser Glu Asn 435 440 445Arg Tyr Gly Leu Asn Thr Asn His Arg Ile Val Val Cys Ser Glu Asn 435 440 445
Ser Leu Gin Phe Phe Met Pro Val Leu Gly Ala Leu Phe Ile Gly Val 450 455 460Ser Leu Gin Phe Phe Met Pro Val Leu Gly Ala Leu Phe Ile Gly Val 450 455 460
Ala Val Ala Pro Ala Asn Asp Ile Tyr Asn Glu Arg Glu Leu Leu Asn 465 470 475 480Ala Val Ala Pro Ala Asn Asp Ile Tyr Asn Glu Arg Glu Leu Leu Asn 465 470 475 480
Ser Met Gly Ile Ser Gin Pro Thr Val Val Phe Val Ser Lys Lys Gly 485 490 495Ser Met Gly Ile Ser Gin Pro Thr Val Val Phe Val Ser Lys Lys Gly 485 490 495
Leu Gin Lys Ile Leu Asn Val Gin Lys Lys Leu Pro Ile Ile Gin Lys 500 505 510Leu Gin Lys Ile Leu Asn Val Gin Lys Leu Pro Ile Ile Gin Lys 500 505 510
Ile Ile Ile Met Asp Ser Lys Thr Asp Tyr Gin Gly Phe Gin Ser Met 515 520 525Ile Ile Ile Met Asp Ser Lys Thr Asp Tyr Gin Gly Phe Gin Ser Met 515 520 525
Tyr Thr Phe Val Thr Ser His Leu Pro Pro Gly Phe Asn Glu Tyr ASp 530 535 540Tyr Thr Phe Val Thr Ser His Leu Pro Pro Gly Phe Asn Glu Tyr ASp 530 535 540
Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr Ile Ala Leu Ile Met 545 550 555 560Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr Ile Ala Leu Ile Met 545 550 555 560
Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val Ala Leu Pro His 565 570 575Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val Ala Leu Pro His 565 570 575
Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp Pro Ile Phe Gly 580 585 590Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp Pro Ile Phe Gly 580 585 590
Asn Gin Ile Ile Pro 595 <210> 71 <211> 2313Asn Gin Ile Pro 595 <210> 71 <211> 2313
<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1)..(2310) <400> 71 atg ate aaa ata gcc aca cgt aaa tat tta ggc aaa caa aat gtc tat 48<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1) .. (2310) <400> 71 atg gac aca cgt aaa tat tta ggc aaa caa aat gtc tat 48
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gin Asn Val Tyr 15 10 15 gac att gga gtt gag cgc gac cat aat ttt gca ctc aaa aat ggc ttc 96Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gin Asn Val Tyr 15 10 15 gac att gga gtt gag cgc gac cat aat ttt gca ctc aaa aat ggc ttc 96
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe 20 25 30 ata gct tet aat tgt ttc aat gat act tat aga tat att gac acc get 144Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe 20 25 30 ata gct tet aat tgt ttc aat gat act tat aga tat att gac acc get 144
Ile Ala Ser Asn Cys Phe Asn Asp Thr Tyr Arg Tyr Ile Asp Thr Ala 35 40 45 ate ctc agc gtg gtg cča ttt cac cac ggc ttc ggc atg ttc acc acg 192Ile Ala Ser Asn Cys Phe Asn Asp Thr Tyr Arg Tyr Ile Asp Thr Ala 35 40 45 ate ctc agc gtg gtg cca ttt cac cac ggc ttc ggc atg ttc acc acg 192
Ile Leu Ser Val val Pro Phe His His Gly Phe Gly Met Phe Thr Thr 50 55 60 ctg ggc tac ttg ate tgc ggc ttt cgg gtc gtg ctc atg tac cgc ttc 240Ile Leu Ser Val Val Pro Phe His Gly Phe Gly Met Phe Thr Thr 50 55 60 ctg ggc tac ttg ate tgc ggc ttt cgg gtc gtg ctc atg tac cgc ttc 240
Leu Gly Tyr Leu Ile Cys Gly Phe Arg Val Val Leu Met Tyr Arg Phe 65 70 75 80 gag gag gag cta ttc ttg cgc agc ttg caa gac tat aag att caa tet 288Leu Gly Tyr Leu Ile Cys Gly Phe Arg Val Val Leu Met Tyr Arg Phe 65 70 75 80 gag gag gag cta ttc ttg cgc agc ttg caa gac tat aag att caa tet 288
Glu Glu Glu Leu Phe Leu Arg Ser Leu Gin Asp Tyr Lys Ile Gin Ser 85 90 95 gcc ctg ctg gtg ccc aca cta ttt agc ttc ttc gct aag agc act ctc 336Glu Glu Glu Leu Phe Leu Arg Ser Leu Gin Asp Tyr Lys Ile Gin Ser 85 90 95 gcc ctg ctg gtg ccc aca cta ttt agc ttc ttc gct aag agc act ctc 336
Ala Leu Leu Val Pro Thr Leu Phe Ser Phe Phe Ala Lys Ser Thr Leu 100 105 110 ate gac aag tac gac cta agc aac ttg cac gag ate gcc agc ggc ggg 384Ala Leu Leu Val Pro Thr Leu Phe Ser Phe Phe Ala Lys Ser Thr Leu 100 105 110 ate gac aag tac gac cta agc aac ttg cac gag ate gcc agc ggc ggg 384
Ile Asp Lys Tyr Asp Leu Ser Asn Leu His Glu Ile Ala Ser Gly Gly 115 120 125 gcg ccg ctc agc aag gag gta ggt gag gcc gtg gcc aaa cgc ttc cac 432Ile Asp Lys Tyr Asp Leu Ser Asn Leu His Glu Ile Ala Ser Gly Gly 115 120 125 gcg ccg ctc agc aag gag gta ggt gag gcc gtg gcc aaa cgc ttc cac 432
Ala Pro Leu Ser Lys Glu Val Gly Glu Ala Val Ala Lys Arg Phe His 130 135 140 cta cca ggc ate cgc cag ggc tac ggc ctg aca gaa aca acc agc gcc 480Ala Pro Leu Ser Lys Glu Val Gly Glu Ala Val Ala Lys Arg Phe His 130 135 140 cta cg ggc cate ggc cg ggc tc ggc ctg aca gaa aca acc agc gcc 480
Leu Pro Gly Ile Arg Gin Gly Tyr Gly Leu Thr Glu Thr Thr Ser Ala 145 150 155 160 att ctg ate acc ccc gaa ggg gac gac aag cct ggc gca gta ggc aag 528Leu Pro Gly Ile Arg Gin Gly Tyr Gly Leu Thr Glu Thr Thr Ser Ala 145 150 155 160 att ctg ate acc ccc gaa ggg gac gac aag cct ggc gca gta ggc aag 528
Ile Leu Ile Thr Pro Glu Gly Asp Asp Lys Pro Gly Ala Val Gly Lys 165 170 175 gtg gtg ccc ttc ttc gag gct aag gtg gtg gac ttg gac acc ggt aag 576Ile Leu Ile Thr Pro Glu Gly Asp Asp Lys Pro Gly Ala Val Gly Lys 165 170 175 gtg gtc ccc ttc ttc gag gct aag gtg gtg gac ttg gac acc ggt aag 576
Val Val Pro Phe Phe Glu Ala Lys Val Val Asp Leu Asp Thr Gly Lys 180 185 190 aca ctg ggt gtg aac cag cgc ggc gag ctg tgc gtc cgt ggc ccc atg 624Val Val Pro Phe Phe Glu Ala Lys Val Val Asp Leu Asp Thr Gly Lys 180 185 190 aca ctg ggt gtg aac cag cgc ggc gag ctg tgc gtc cgt ggc ccc atg 624
Thr Leu Gly Val Asn Gin Arg Gly Glu Leu Cys Val Arg Gly Pro Met 195 200 205 ate atg agc ggc tac gtt aac aac ccc gag gct aca aac gct ctc ate 672Thr Leu Gly Val Asn Gin Arg Gly Glu Leu Cys Val Arg Gly Pro Met 195 200 205 ate atg agc ggc tac gtt aac aac ccc gag gct aca aac gct ctc ate 672
Ile Met Ser Gly Tyr Val Asn Asn Pro Glu Ala Thr Asn Ala Leu Ile 210 215 220 gac aag gac ggc tgg ctg cac agc ggc gac ate gcc tac tgg gac gag 720Ile Met Ser Gly Tyr Val Asn Asn Pro Glu Ala Thr Asn Ala Leu Ile 210 215 220 gac aag gac ggg ctg cac agc ggc gac gac gac gac gac 720
Asp Lys Asp Gly Trp Leu His Ser Gly Asp Ile Ala Tyr Trp Asp Glu 225 230 235 240 gac gag cac ttc ttc ate gtg gac cgg ctg aag agc ctg ate aaa tac 768Asp Lys Asp Gly Trp Leu His Ser Gly Asp Ile Ala Tyr Trp Asp Glu 225 230 235 240 gac gag cac ttc ttc ate gtg gac cgg ctg aag agc ctg ate aaa tac 768
Asp Glu His Phe Phe Ile Val Asp Arg Leu Lys Ser Leu Ile Lys Tyr 245 250 255 aag ggc tac cag gta gcc cca gcc gaa ctg gag agc ate ctg ctg caa 816Asp Glu His Phe Phe Ile Val Asp Arg Leu Lys Ser Leu Ile Lys Tyr 245 250 255 aag ggc tac cag gta gcc cca gcc gaa ctg gag agc ate ctg ctg caa 816
Lys Gly Tyr Gin Val Ala Pro Ala Glu Leu Glu Ser Ile Leu Leu Gin 260 265 270 cac ccc aac ate ttc gac gcc ggg gtc gcc ggc ctg ccc gac gac gat 864Lys Gly Tyr Gin Val Ala Pro Ala Glu Leu Glu Ser Ile Leu Leu Gin 260 265 270 cac ccc aac ate tc gac gcc ggg gtc gcc gcc ctg ccc gac gac gat 864
His Pro Asn Ile Phe Asp Ala Gly Val Ala Gly Leu Pro Asp Asp Asp 275 280 285 gcc ggc gag ctg ccc gcc gca gtc gtc gtg ctg gaa cac ggt aaa acc 912His Pro Asn Ile Phe Asp Ala Gly Val Ala Gly Leu Pro Asp Asp Asp Asp Asp Asp Asp 275 280 285 gcc ggc gag ctg ccc gcc gtca
Ala Gly Glu Leu Pro Ala Ala Val Val Val Leu Glu His Gly Lys Thr 290 295 300 atg acc gag aag gag ate gtg gac tat gtg gcc agc cag gtt aca acc 960Ala Gly Glu Leu Pro Ala Val Val Val Leu Glu His Gly Lys Thr 290 295 300 atg acc gag aag gag ate gtg gac tat gtg gcc agc cag gtt aca acc 960
Met Thr Glu Lys Glu Ile Val Asp Tyr Val Ala Ser Gin Val Thr Thr 305 310 315 320 gcc aag aag ctg cgc ggt ggt gtt gtg ttc gtg gac gag gtg cct aaa 1008Met Thr Glu Lys Glu Ile Val Asp Tyr Val Ala Ser Gin Val Thr Thr 305 310 315 320 gcc aag aag ctg cgc ggt gpt gtg ttc gtg gac gag gtg cct aaa 1008
Ala Lys Lys Leu Arg Gly Gly Val Val Phe Val Asp Glu Val Pro Lys 325 330 335 gga ctg acc ggc aag ttg gac gcc cgc aag ate cgc gag att ctc att 1056Ala Lys Lys Leu Arg Gly Gly Val Val Phe Val Asp Glu Val Pro Lys 325 330 335 gga ctg acc ggc aag ttg gac gcc cgc aag ate cgc gag att ctc att 1056
Gly Leu Thr Gly Lys Leu Asp Ala Arg Lys Ile Arg Glu Ile Leu Ile 340 345 350 aag gcc aag aag gga tcc gag tcc gtc agc ctg caa agc ggc tca ggt 1104Gly Leu Thr Gly Lys Leu Asp Ala Arg Lys Ile Arg Gle Ile Leu Ile 340 345 350 aag gcc aag aag gaga tcc gag tcc gtc agc ctg caa agc ggc tca ggt 1104
Lys Ala Lys Lys Gly Ser Glu Ser Val Ser Leu Gin Ser Gly Ser Gly 355 360 365 ggc ggt gcc aaa aac att aag aag ggc cca gcg cca ttc tac cca ctc 1152Lys Ala Lys Lys Gly Ser Glu Ser Val Ser Leu Gin Ser Gly Ser Gly 355 360 365 ggc ggt gcc aaa aac att aag aag ggc cca gcg cca ttc tac cca ctc 1152
Gly Gly Ala Lys Asn Ile Lys Lys Gly Pro Ala Pro Phe Tyr Pro Leu 370 375 380 gaa gac ggg acc gcc ggc gag cag ctg cac aaa gcc atg aag cgc tac 1200Gly Gly Ally Lys Asn Ile Lys Lys Gly Pro Ala Pro Phe Tyr Pro Leu 370 375 380 gag gag gag gag cag cg cac aaa gcc atg aag cgc tac 1200
Glu Asp Gly Thr Ala Gly Glu Gin Leu His Lys Ala Met Lys Arg Tyr 385 390 395 400 gcc ctg gtg ccc ggc acc ate gcc ttt acc gac gca cat ate gag gtg 1248Glu Asp Gly Thr Ala Gly Glu Gin Leu His Lys Ala Met Lys Arg Tyr 385 390 395 400 gcc ctg gtg ccc ggc acc ate gcc ttt acc gac gca cat ate gag gtg 1248
Ala Leu Val Pro Gly Thr Ile Ala Phe Thr Asp Ala His Ile Glu Val 405 410 415 gac att acc tac gcc gag tac ttc gag atg agc gtt cgg ctg gca gaa 1296Ala Leu Val Pro Gly Thr Ile Ala Phe Thr Asp Ala His Ile Glu Val 405 410 415 gac att acc tac gcc gag tac ttc gag atg agc gtt cgg ctg gca gaa 1296
Asp Ile Thr Tyr Ala Glu Tyr Phe Glu Met Ser Val Arg Leu Ala Glu 420 425 430 gct atg aag cgc tat ggg ctg aat aca aac cat cgg ate gtg gtg tgc 1344Asp Ile Thr Tyr Ala Glu Tyr Phe Glu Met Ser Val Arg Leu Ala Glu 420 425 430 gct atg aag cgc tat ggg ctg aat aca aac cat cgg ate gtg gtg tgc 1344
Ala Met Lys Arg Tyr Gly Leu Asn Thr Asn His Arg Ile Val Val Cys 435 440 445 agc gag aat agc ttg cag ttc ttc atg ccc gtg ttg ggt gcc ctg ttc 1392Ala Met Lys Arg Tyr Gly Leu Asn Thr Asn His Arg Ile Val Val Cys 435 440 445 agc gag aat agc ttg cag ttc ttc atg ccc gtg ttg ggt gcc ctg ttc 1392
Ser Glu Asn Ser Leu Gin Phe Phe Met Pro Val Leu Gly Ala Leu Phe 450 455 460 ate ggt gtg gct gtg gcc cca gct aac gac ate tac aac gag cgc gag 1440Ser Glu Asn Ser Leu Gin Phe Phe Met Pro Val Leu Gly Ala Leu Phe 450 455 460 ate ggt gtg gct gtc gcc cca gct aac gac ate tac aac gag cgc gag 1440
Ile Gly Val Ala Val Ala Pro Ala Asn Asp Ile Tyr Asn Glu Arg Glu 465 470 475 480 ctg ctg aac agc atg ggc ate agc cag ccc acc gtc gta ttc gtg agc 1488Ile Gly Val Ala Val Ala Pro Ala Asn Asp Asle Ile Tyr Asn Glu Arg Glu 465 470 475 480 ctg ctg aac agc atg ggc ate agc cag ccc acc gtc gta ttc gtg agc 1488
Leu Leu Asn Ser Met Gly Ile Ser Gin Pro Thr Val Val Phe Val Ser 485 490 495 aag aaa ggg ctg caa aag ate ete aac gtg caa aag aag cta ccg ate 1536Leu Leu Asn Ser Met Gly Ile Ser Gin Pro Thr Val Val Phe Val Ser 485 490 495 aag aaa ggg ctg caa aag ate ete aac gtg caa aag aag cta ccg ate 1536
Lys Lys Gly Leu Gin Lys Ile Leu Asn Val Gin Lys Lys Leu Pro Ile 500 505 510 ata caa aag ate ate ate atg gat agc aag acc gac tac cag ggc ttc 1584Lys Lys Gly Leu Gin Lys Ile Leu Asn Val Gin Lys Lys Leu Pro Ile 500 505 510 ata caa aag ate ate ate atg gat agc aag acc gac tac cag ggc ttc 1584
Ile Gin Lys Ile Ile Ile Met Asp Ser Lys Thr Asp Tyr Gin Gly Phe 515 520 525 caa agc atg tac acc ttc gtg act tcc cat ttg cca ccc ggc ttc aac 1632Ile Gin Lys Ile Ile Ile Met Asp Ser Lys Thr Asp Tyr Gin Ghe Phe 515 520 525 caa agc atg tac acc ttc gtg act tcc cat ttg cca ccc ggc ttc aac 1632
Gin Ser Met Tyr Thr Phe Val Thr Ser His Leu Pro Pro Gly Phe Asn 530 535 540 gag tac gac ttc gtg ccc gag agc ttc gac cgg gac aaa acc ate gcc 1680Gin Ser Met Tyr Thr Phe Val Thr Ser He Leu Pro Pro Gly Phe Asn 530 535 540 gag tac gac ttc gtg ccc gag agc ttc gac cgg gac aaa acc ate gcc 1680
Glu Tyr Asp Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr Ile Ala 545 550 555 560 ctg ate atg aac agt agt ggc agt acc gga ttg ccc aag ggc gta gcc 1728Glu Tyr Asp Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr Ile Ala 545 550 555 560 ctg ate atg aac agt agt ggc agt acc gga ttg ccc aag ggc gta gcc 1728
Leu Ile Met Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val Ala 565 570 575 cta ccg cac cgc acc gct tgt gtc ega ttc agt cat gcc cgc gac ccc 1776Leu Ile Met Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val Ala 565 570 575 cta cc cc cgc acc gct tgt gtc ega ttc agt cat gcc cgc gac ccc 1776
Leu Pro His Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp Pro 580 585 590 ate ttc ggc aac cag ate ate ccc gcc gag tac tgt tta agc tat gaa 1824Leu Pro His Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp Pro 580 585 590 ate ttc ggc aac cag ate ate ccc gcc gag tac tgt tta agc tat gaa 1824
Ile Phe Gly Asn Gin Ile Ile Pro Ala Glu Tyr Cys Leu Ser Tyr Glu 595 600 605 acg gaa ata ttg aca gta gaa tat gga tta tta ccg att ggt aaa att 1872Ile Phe Gly Asn Gin Ile Ile Pro Ala Glu Tyr Cys Leu Ser Tyr Glu 595 600 605 acg gaa ata ttg aca gta gaa tat gga tta tta ccg att ggt aaa att 1872
Thr Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile 610 615 620 gta gaa aag cgc ate gaa tgt act gtt tat agc gtt gat aat aat gga 1920Thr Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile 610 615 620 gta gaa aag cgc ate gaa tgt act gtt tat agc gtt gat aat aat gga 1920
Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly 625 630 635 640 aat att tat aca caa cct gta gca caa tgg cac gat cgc gga gaa caa 1968Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly 625 630 635 640 aat att tat aca caa cct gta gca caa tgg cac gat cgc gga gaa caa 1968
Asn Ile Tyr Thr Gin Pro Val Ala Gin Trp His Asp Arg Gly Glu Gin 645 650 655 gag gtg ttt gag tat tgt ttg gaa gat ggt tca ttg att cgg gca aca 2016Asn Ile Tyr Thr Gin Pro Val Ala Gin Trp His Asp Arg Gly Glu Gin 645 650 655 gag gtg ttt gag tat tgt ttg gaa gat ggt tta ttg att cgg gca aca 2016
Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr 660 665 670 aaa gac cat aag ttt atg act gtt gat ggt caa atg ttg cca att gat 2064Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr 660 665 670 aaa gac cat aag ttt atg act gtt gat ggt caa atg ttg cca att gat 2064
Lys Asp His Lys Phe Met Thr Val Asp Gly Gin Met Leu Pro Ile Asp 675 680 685 gaa ata ttt gaa cgt gaa ttg gat ttg atg cgg gtt gat aat ttg ccg 2112Lys Asp His Lys Phe Met Thr Val Asp Gly Gin Met Leu Pro Ile Asp 675 680 685 gaa ata ttt gaa cgt gaa ttg gat ttg atg cgg gtt gat aat ttg ccg 2112
Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro 690 695 700 aac ggc ggc aag ate gcc gtc aat tet gct tgc aag aac tgg ttc agt 2160Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro 690 695 700 aac ggc ggc aag ate gcc gtc aat tet gct tgc aag aac tgg ttc agt 2160
Asn Gly Gly Lys Ile Ala Val Asn Ser Ala Cys Lys Asn Trp Phe Ser 705 710 715 720 agc tta agc cac ttt gtg ate cac ctt aac agc cac ggc ttc cct ccc 2208Asn Gly Gly Lys Ile Ala Val Asn Ser Ala Cys Lys Asn Trp Phe Ser 705 710 715 720 agc tta agc cac ttt gtg ate cac ctt aac agc cac ggc ttc cct ccc 2208
Ser Leu Ser His Phe Val Ile His Leu Asn Ser His Gly Phe Pro Pro 725 730 735 gag gtg gag gag cag gcc gcc ggc acc ctg ccc atg agc tgc gcc cag 2256Ser Leu Ser His Phe Val Ile His Leu Asn Ser His Gly Phe Pro Pro 725 730 735 gag gag gag gag gcc gcc cg ccc atg agc tgc gcc cag 2256
Glu Val Glu Glu Gin Ala Ala Gly Thr Leu Pro Met Ser Cys Ala Gin 740 745 750 gag agc ggc atg gat aga cac cct gct gct tgc gcc agc gcc agg ate 2304Glu Val Glu Glu Gin Ala Ala Gly Thr Leu Pro Met Ser Cys Ala Gin 740 745 750 gg agc ggc atg gat aga cac cct gct gct tgc gcc agc gcc agg ate 2304
Glu Ser Gly Met Asp Arg His Pro Ala Ala Cys Ala Ser Ala Arg Ile 755 760 765 aac gtc taa 2313Glu Ser Gly Met Asp Arg His Pro Ala Ala Cys Ala Ser Ala Arg Ile 755 760 765 aac gtc taa 2313
Asn Val 770 <210> 72 <211> 770Asn Val 770 <210> 72 <211> 770
<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 72<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 72
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gin Asn Val Tyr 15 10 15Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gin Asn Val Tyr 15 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe 20 25 30Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe 20 25 30
Ile Ala Ser Asn Cys Phe Asn Asp Thr Tyr Arg Tyr Ile Asp Thr Ala 35 40 45Ile Ala Ser Asn Cys Phe Asn Asp Thr Tyr Arg Tyr Ile Asp Thr Ala 35 40 45
Ile Leu Ser Val Val Pro Phe His His Gly Phe Gly Met Phe Thr Thr 50 55 60Ile Leu Ser Val Val Phe His Gly Phe Gly Met Phe Thr Thr 50 55 60
Leu Gly Tyr Leu Ile Cys Gly Phe Arg Val Val Leu Met Tyr Arg Phe 65 70 75 80Leu Gly Tyr Leu Ile Cys Gly Phe Arg Val Val Leu Met Tyr Arg Phe 65 70 75 80
Glu Glu Glu Leu Phe Leu Arg Ser Leu Gin Asp Tyr Lys Ile Gin Ser 85 90 95Glu Glu Glu Leu Phe Leu Arg Ser Leu Gin Asp Tyr Lys Ile Gin Ser 85 90 95
Ala Leu Leu Val Pro Thr Leu Phe Ser Phe Phe Ala Lys Ser Thr Leu 100 105 110Ala Leu Leu Val Pro Thr Leu Phe Ser Phe Phe Ala Lys Ser Thr Leu 100 105 110
Ile Asp Lys Tyr Asp Leu Ser Asn Leu His Glu Ile Ala Ser Gly Gly 115 120 125Ile Asp Lys Tyr Asp Leu Ser Asn Leu His Glu Ile Ala Ser Gly Gly 115 120 125
Ala Pro Leu Ser Lys Glu Val Gly Glu Ala Val Ala Lys Arg Phe His 130 135 140Ala Pro Leu Ser Lys Glu Val Gly Glu Ala Val Ala Lys Arg Phe His 130 135 140
Leu Pro Gly Ile Arg Gin Gly Tyr Gly Leu Thr Glu Thr Thr Ser Ala 145 150 155 160Leu Pro Gly Ile Arg Gin Gly Tyr Gly Leu Thr Glu Thr Thr Ser Ala 145 150 155 160
Ile Leu Ile Thr Pro Glu Gly Asp Asp Lys Pro Gly Ala Val Gly Lys 165 170 175Ile Leu Ile Thr Pro Glu Gly Asp Asp Lys Pro Gly Ala Val Gly Lys 165 170 175
Val Val Pro Phe Phe Glu Ala Lys Val Val Asp Leu Asp Thr Gly Lys 180 185 190Val Val Pro Phe Phe Glu Ala Lys Val Val Asp Leu Asp Thr Gly Lys 180 185 190
Thr Leu Gly Val Asn Gin Arg Gly Glu Leu Cys Val Arg Gly Pro Met 195 200 205Thr Leu Gly Val Asn Gin Arg Gly Glu Leu Cys Val Arg Gly Pro Met 195 200 205
Ile Met Ser Gly Tyr Val Asn Asn Pro Glu Ala Thr Asn Ala Leu Ile 210 215 220Ile Met Ser Gly Tyr Val Asn Asn Pro Glu Ala Thr Asn Ala Leu Ile 210 215 220
Asp Lys Asp Gly Trp Leu His Ser Gly Asp Ile Ala Tyr Trp Asp Glu 225 230 235 240Asp Lys Asp Gly Trp Leu His Ser Gly Asp Ile Ala Tyr Trp Asp Glu 225 230 235 240
Asp Glu His Phe Phe Ile Val Asp Arg Leu Lys Ser Leu Ile Lys Tyr 245 250 255Asp Glu His Phe Phe Ile Val Asp Arg Leu Lys Ser Leu Ile Lys Tyr 245 250 255
Lys Gly Tyr Gin Val Ala Pro Ala Glu Leu Glu Ser Ile Leu Leu Gin 260 265 270Lys Gly Tyr Gin Val Ala Pro Ala Glu Leu Glu Ser Ile Leu Leu Gin 260 265 270
His Pro Asn Ile Phe Asp Ala Gly Val Ala Gly Leu Pro Asp Asp Asp 275 280 285His Pro Asn Ile Phe Asp Ala Gly Val Ala Gly Leu Pro Asp Asp Asp 275 280 285
Ala Gly Glu Leu Pro Ala Ala Val Val Val Leu Glu His Gly Lys Thr 290 295 300Ala Gly Glu Leu Pro Ala Ala Val Val Leu Glu His Gly Lys Thr 290 295 300
Met Thr Glu Lys Glu Ile Val Asp Tyr Val Ala Ser Gin Val Thr Thr 305 310 315 320Met Thr Glu Lys Glu Ile Val Asp Tyr Val Ala Ser Gin Val Thr Thr 305 310 315 320
Ala Lys Lys Leu Arg Gly Gly Val Val Phe Val Asp Glu Val Pro Lys 325 330 335Ala Lys Lys Leu Arg Gly Gly Val Val Phe Val Asp Glu Val Pro Lys 325 330 335
Gly Leu Thr Gly Lys Leu Asp Ala Arg Lys Ile Arg Glu Ile Leu Ile 340 345 350Gly Leu Thr Gly Lys Leu Asp Ala Arg Lys Ile Arg Glu Ile Leu Ile 340 345 350
Lys Ala Lys Lys Gly Ser Glu Ser Val Ser Leu Gin Ser Gly Ser Gly 355 360 365Lys Ala Lys Lys Gly Ser Glu Ser Val Ser Leu Gin Ser Gly Ser Gly 355 360 365
Gly Gly Ala Lys Asn Ile Lys Lys Gly Pro Ala Pro Phe Tyr Pro Leu 370 375 380Gly Gly Ala Lys Asn Ile Lys Gly Pro Ala Pro Phe Tyr Pro Leu 370 375 380
Glu Asp Gly Thr Ala Gly Glu Gin Leu His Lys Ala Met Lys Arg Tyr 385 390 395 400Glu Asp Gly Thr Ala Gly Glu Gin Leu His Lys Ala Met Lys Arg Tyr 385 390 395 400
Ala Leu Val Pro Gly Thr Ile Ala Phe Thr Asp Ala His Ile Glu Val 405 410 415Ala Leu Val Pro Gly Thr Ile Ala Phe Thr Asp Ala His Ile Glu Val 405 410 415
Asp Ile Thr Tyr Ala Glu Tyr Phe Glu Met Ser Val Arg Leu Ala Glu 420 425 430Asp Ile Thr Tyr Ala Glu Tyr Phe Glu Met Ser Val Arg Leu Ala Glu 420 425 430
Ala Met Lys Arg Tyr Gly Leu Asn Thr Asn His Arg Ile Val Val Cys 435 440 445Ala Met Lys Arg Tyr Gly Leu Asn Thr Asn His Arg Ile Val Val Cys 435 440 445
Ser Glu Asn Ser Leu Gin Phe Phe Met Pro Val Leu Gly Ala Leu Phe 450 455 460Ser Glu Asn Ser Leu Gin Phe Phe Met Pro Val Leu Gly Ala Leu Phe 450 455 460
Ile Gly Val Ala Val Ala Pro Ala Asn Asp Ile Tyr Asn Glu Arg Glu 465 470 475 480Ile Gly Val Ala Val Ala Pro Ala Asn Asp Ile Tyr Asn Glu Arg Glu 465 470 475 480
Leu Leu Asn Ser Met Gly Ile Ser Gin Pro Thr Val Val Phe Val Ser 485 490 495Leu Leu Asn Ser Met Gly Ile Ser Gin Pro Thr Val Val Phe Val Ser 485 490 495
Lys Lys Gly Leu Gin Lys Ile Leu Asn Val Gin Lys Lys Leu Pro Ile 500 505 510Lys Lys Gly Leu Gin Lys Ile Leu Asn Val Gin Lys Leu Pro Ile 500 505 510
Ile Gin Lys Ile Ile Ile Met Asp Ser Lys Thr Asp Tyr Gin Gly Phe 515 520 525Ile Gin Lys Ile Ile Ile Met Asp Ser Lys Thr Asp Tyr Gin Gly Phe 515 520 525
Gin Ser Met Tyr Thr Phe Val Thr Ser His Leu Pro Pro Gly Phe Asn 530 535 540Gin Ser Met Tyr Thr Phe Val Thr Ser His Leu Pro Pro Gly Phe Asn 530 535 540
Glu Tyr Asp Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr Ile Ala 545 550 555 560Glu Tyr Asp Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr Ile Ala 545 550 555 560
Leu Ile Met Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val Ala 565 570 575Leu Ile Met Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val Ala 565 570 575
Leu Pro His Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp Pro 580 585 590Leu Pro His Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp Pro 580 585 590
Ile Phe Gly Asn Gin Ile Ile Pro Ala Glu Tyr Cys Leu Ser Tyr Glu 595 600 605Ile Phe Gly Asn Gin Ile Ile Pro Ala Glu Tyr Cys Leu Ser Tyr Glu 595 600 605
Thr Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile 610 615 620Thr Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile 610 615 620
Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly 625 630 635 640Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly 625 630 635 640
Asn Ile Tyr Thr Gin Pro Val Ala Gin Trp His Asp Arg Gly Glu Gin 645 650 655Asn Ile Tyr Thr Gin Pro Val Ala Gin Trp His Asp Arg Gly Glu Gin 645 650 655
Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr 660 665 670Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr 660 665 670
Lys Asp His Lys Phe Met Thr val Asp Gly Gin Met Leu Pro Ile Asp 675 680 685Lys Asp His Lys Phe Met Thr val Asp Gly Gin Met Leu Pro Ile Asp 675 680 685
Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro 690 695 700Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro 690 695 700
Asn Gly Gly Lys Ile Ala Val Asn Ser Ala Cys Lys Asn Trp Phe Ser 705 710 715 720Asn Gly Gly Lys Ile Ala Val Asn Ser Ala Cys Lys Asn Trp Phe Ser 705 710 715 720
Ser Leu Ser His Phe Val Ile His Leu Asn Ser His Gly Phe Pro Pro 725 730 735Ser Leu Ser His Phe Val Ile His Leu Asn Ser His Gly Phe Pro Pro 725 730 735
Glu Val Glu Glu Gin Ala Ala Gly Thr Leu Pro Met Ser Cys Ala Gin 740 745 750Glu Val Glu Glu Gin Ala Ala Gly Thr Leu Pro Met Ser Cys Ala Gin 740 745 750
Glu Ser Gly Met Asp Arg His Pro Ala Ala Cys Ala Ser Ala Arg Ile 755 760 765Glu Ser Gly Met Asp Arg His Pro Ala Ala Cys Ala Ser Ala Arg Ile 755 760 765
Asn Val 770 <210> 73 <211> 2313Asn Val 770 <210> 73 <211> 2313
<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1)..(2310) <400> 73 atg ate aaa ata gcc aca cgt aaa tat tta ggc aaa caa aat gtc tat 48<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1) .. (2310) <400> 73 atg ate aaa ata gcc aca cgt aaa tat tta ggc aaa caa aat gtc tat 48
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gin Asn Val Tyr 15 10 15 gac att gga gtt gag ege gac cat aat ttt gca ete aaa aat ggc ttc 96Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gin Asn Val Tyr 15 10 15 gac att gga gtt gag ege gac cat aat ttt gca ete aaa aat ggc ttc 96
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe 20 25 30 ata get tet aat tgt ttc aat gat act tat aga tat att gac acc get 144Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe 20 25 30 ata get tet aat tgt ttc aat gat act tat aga tat att gac acc get 144
Ile Ala Ser Asn Cys Phe Asn Asp Thr Tyr Arg Tyr Ile Asp Thr Ala 35 40 45 ate ete agc gtg gtg cca ttt cac cac ggc ttc ggc atg ttc acc acg 192Ile Ala Ser Asn Cys Phe Asn Asp Thr Tyr Arg Tyr Ile Asp Thr Ala 35 40 45 ate ete agc gtg gtg cca ttt cac cac ggc ttc ggc atg ttc acc acg 192
Ile Leu Ser Val Val Pro Phe His His Gly Phe Gly Met Phe Thr Thr 50 55 60 ctg ggc tac ttg ate tgc ggc ttt cgg gtc gtg ete atg tac ege ttc 240Ile Leu Ser Val Val Phe His Gly Phe Gly Met Phe Thr Thr 50 55 60 ctg ggc tac ttg ate tgc ggc ttt cgg gtc gtg ete atg tac ege ttc 240
Leu Gly Tyr Leu Ile Cys Gly Phe Arg Val Val Leu Met Tyr Arg Phe 65 70 75 80 gag gag gag cta ttc ttg ege agc ttg caa gac tat aag att caa tet 288Leu Gly Tyr Leu Ile Cys Gly Phe Arg Val Val Leu Met Tyr Arg Phe 65 70 75 80 gag gag gag cta ttc ttg ege agc ttg caa gac tat aag att caa tet 288
Glu Glu Glu Leu Phe Leu Arg Ser Leu Gin Asp Tyr Lys Ile Gin Ser 85 90 95 gec ctg ctg gtg ccc aca cta ttt agc ttc ttc get aag agc act ete 336Glu Glu Glu Leu Phe Leu Arg Ser Leu Gin Asp Tyr Lys Ile Gin Ser 85 90 95 gec ctg ctg gtg ccc aca cta ttt agc ttc ttc get aag agc act ete 336
Ala Leu Leu Val Pro Thr Leu Phe Ser Phe Phe Ala Lys Ser Thr Leu 100 105 110 ate gac aag tac gac cta agc aac ttg cac gag ate gcc agc ggc ggg 384Ala Leu Leu Val Pro Thr Leu Phe Ser Phe Phe Ala Lys Ser Thr Leu 100 105 110 ate gac aag tac gac cta agc aac ttg cac gag ate gcc agc ggc ggg 384
Ile Asp Lys Tyr Asp Leu Ser Asn Leu His Glu Ile Ala Ser Gly Gly 115 120 125 gcg ccg ete agc aag gag gta ggt gag gcc gtg gcc aaa ege ttc cac 432Ile Asp Lys Tyr Asp Leu Ser Asn Leu His Glu Ile Ala Ser Gly Gly 115 120 125 gcg ccg ete agc aag gag gta ggt gag gcc gtg gcc aaa ege ttc cac 432
Ala Pro Leu Ser Lys Glu Val Gly Glu Ala Val Ala Lys Arg Phe His 130 135 140 cta cca ggc ate ege cag ggc tac ggc ctg aca gaa aca acc agc gcc 480Ala Pro Leu Ser Lys Glu Val Gly Glu Ala Val Ala Lys Arg Phe His 130 135 140 cta cg ggc cate ggc cg ggc ac acc ac ac acc ac gc 480
Leu Pro Gly Ile Arg Gin Gly Tyr Gly Leu Thr Glu Thr Thr Ser Ala 145 150 155 160 att ctg ate acc ccc gaa ggg gac gac aag cct ggc gca gta ggc aag 528Leu Pro Gly Ile Arg Gin Gly Tyr Gly Leu Thr Glu Thr Thr Ser Ala 145 150 155 160 att ctg ate acc ccc gaa ggg gac gac aag cct ggc gca gta ggc aag 528
Ile Leu Ile Thr Pro Glu Gly Asp Asp Lys Pro Gly Ala Val Gly Lys 165 170 175 gtg gtg ccc ttc ttc gag get aag gtg gtg gac ttg gac acc ggt aag 576Ile Leu Ile Thr Pro Glu Gly Asp Asp Lys Pro Gly Ala Val Gly Lys 165 170 175 gtg ccc ttc ttc gag get aag gtg gtg gac ttg gac acc ggt aag 576
Val Val Pro Phe Phe Glu Ala Lys Val Val Asp Leu Asp Thr Gly Lys 180 185 190 aca ctg ggt gtg aac cag ege ggc gag ctg tgc gtc cgt ggc ccc atg 624Val Val Pro Phe Phe Glu Ala Lys Val Val Asp Leu Asp Thr Gly Lys 180 185 190 aca ctg ggt gtg aac cag ege ggc gag ctg tgc gtc cgt ggc ccc atg 624
Thr Leu Gly Val Asn Gin Arg Gly Glu Leu Cys Val Arg Gly Pro Met 195 200 205 ate atg agc ggc tac gtt aac aac ccc gag get aca aac get ete ate 672Thr Leu Gly Val Asn Gin Arg Gly Glu Leu Cys Val Arg Gly Pro Met 195 200 205 ate atg agc ggc tac gtt aac aac ccc gag get aca aac get ete ate 672
Ile Met Ser Gly Tyr Val Asn Asn Pro Glu Ala Thr Asn Ala Leu ile 210 215 220 gac aag gac ggc tgg ctg cac agc ggc gac ate gcc tac tgg gac gag 720Ile Met Ser Gly Tyr Val Asn Asn Pro Glu Ala Thr Asn Ala Leu ile 210 215 220 gac aag gac ggc tgg ctg cac agc ggc gac ate gcc tac ggg gag 720
Asp Lys Asp Gly Trp Leu His Ser Gly Asp Ile Ala Tyr Trp Asp Glu 225 230 235 240 gac gag cac ttc ttc ate gtg gac cgg ctg aag agc ctg ate aaa tac 768Asp Lys Asp Gly Trp Leu His Ser Gly Asp Ile Ala Tyr Trp Asp Glu 225 230 235 240 gac gag cac ttc ttc ate gtg gac cgg ctg aag agc ctg ate aaa tac 768
Asp Glu His Phe Phe Ile Val Asp Arg Leu Lys Ser Leu Ile Lys Tyr 245 250 255 aag ggc tac cag gta gcc cca gcc gaa ctg gag agc ate ctg ctg caa 816Asp Glu His Phe Phe Ile Val Asp Arg Leu Lys Ser Leu Ile Lys Tyr 245 250 255 aag ggc tac cag gta gcc cca gcc gaa ctg gag agc ate ctg ctg caa 816
Lys Gly Tyr Gin Val Ala Pro Ala Glu Leu Glu Ser Ile Leu Leu Gin 260 265 270 cac ccc aac ate ttc gac gcc ggg gtc gcc ggc ctg ccc gac gac gat 864Lys Gly Tyr Gin Val Ala Pro Ala Glu Leu Glu Ser Ile Leu Leu Gin 260 265 270 cac ccc aac ate tc gac gcc ggg gtc gcc gcc ctg ccc gac gac gat 864
His Pro Asn Ile Phe Asp Ala Gly Val Ala Gly Leu Pro Asp Asp Asp 275 280 285 gcc ggc gag ctg ccc gcc gca gtc gtc gtg ctg gaa cac ggt aaa acc 912His Pro Asn Ile Phe Asp Ala Gly Val Ala Gly Leu Pro Asp Asp Asp Asp Asp Asp Asp 275 280 285 gcc ggc gag ctg ccc gcc gtca
Ala Gly Glu Leu Pro Ala Ala Val Val Val Leu Glu His Gly Lys Thr 290 295 300 atg acc gag aag gag ate gtg gac tat gtg gcc agc cag gtt aca acc 960Ala Gly Glu Leu Pro Ala Val Val Val Leu Glu His Gly Lys Thr 290 295 300 atg acc gag aag gag ate gtg gac tat gtg gcc agc cag gtt aca acc 960
Met Thr Glu Lys Glu Ile Val Asp Tyr Val Ala Ser Gin Val Thr Thr 305 310 315 320 gcc aag aag ctg ege ggt ggt gtt gtg ttc gtg gac gag gtg cct aaa 1008Met Thr Glu Lys Glu Ile Val Asp Tyr Val Ala Ser Gin Val Thr Thr 305 310 315 320 gcc aag aag ctg ege ggt ggt gtt gtg ttc gtg gac gag gtg cct aaa 1008
Ala Lys Lys Leu Arg Gly Gly Val Val Phe Val Asp Glu Val Pro Lys 325 330 335 gga ctg acc ggc aag ttg gac gcc cgc aag ate ege gag att ete att 1056Ala Lys Lys Leu Arg Gly Gly Val Val Phe Val Asp Glu Val Pro Lys 325 330 335 gga ctg acc ggc aag ttg gac gcc cgc aag ate ege gag att ete att 1056
Gly Leu Thr Gly Lys Leu Asp Ala Arg Lys Ile Arg Glu Ile Leu Ile 340 345 350 aag gcc aag aag gga tcc gag gaa ata cac ctg caa agc ggc tca ggt 1104Gly Leu Thr Gly Lys Leu Asp Ala Arg Lys Ile Arg Glu Ile Leu Ile 340 345 350 aag gcc aag aag gga tcc gag gaa ata cac ctg caa agc ggc tca ggt 1104
Lys Ala Lys Lys Gly Ser Glu Glu Ile His Leu Gin Ser Gly Ser Gly 355 360 365 ggc ggt gcc aaa aac att aag aag ggc cca gcg cca ttc tac cca ete 1152Lys Ala Lys Lys Gly Ser Glu Glu Ile His Leu Gin Ser Gly Ser Gly 355 360 365 ggc ggt gcc aaa aac att aag aag ggc cca gcg cca ttc tac cca ete 1152
Gly Gly Ala Lys Asn Ile Lys Lys Gly Pro Ala Pro Phe Tyr Pro Leu 370 375 380 gaa gac ggg acc gcc ggc gag cag ctg cac aaa gcc atg aag ege tac 1200Gly Gly Ala Lys Asn Ile Lys Lys Gly Pro Ala Pro Phe Tyr Pro Leu 370 375 380 gag gag gag gag gag cag cag gag atg aag ege tac 1200
Glu Asp Gly Thr Ala Gly Glu Gin Leu His Lys Ala Met Lys Arg Tyr 385 390 395 400 gcc ctg gtg ccc ggc acc ate gcc ttt acc gac gca cat ate gag gtg 1248Glu Asp Gly Thr Ala Gly Glu Gin Leu His Lys Ala Met Lys Arg Tyr 385 390 395 400 gcc ctg gtg ccc ggc acc ate gcc ttt acc gac gca cat ate gag gtg 1248
Ala Leu Val Pro Gly Thr Ile Ala Phe Thr Asp Ala His Ile Glu Val 405 410 415 gac att acc tac gcc gag tac ttc gag atg agc gtt cgg ctg gca gaa 1296Ala Leu Val Pro Gly Thr Ile Ala Phe Thr Asp Ala His Ile Glu Val 405 410 415 gac att acc tac gcc gag tac ttc gag atg agc gtt cgg ctg gca gaa 1296
Asp Ile Thr Tyr Ala Glu Tyr Phe Glu Met Ser Val Arg Leu Ala Glu 420 425 430 get atg aag ege tat ggg ctg aat aca aac cat cgg ate gtg gtg tgc 1344Asp Ile Thr Tyr Ala Glu Tyr Phe Glu Met Ser Val Arg Leu Ala Glu 420 425 430 Get atg aag ege tat ggg ctg aat aca aac cat cgg ate gtg gtg tgc 1344
Ala Met Lys Arg Tyr Gly Leu Asn Thr Asn His Arg Ile Val Val Cys 435 440 445 agc gag aat agc ttg cag ttc ttc atg ccc gtg ttg ggt gcc ctg ttc 1392Ala Met Lys Arg Tyr Gly Leu Asn Thr Asn His Arg Ile Val Val Cys 435 440 445 agc gag aat agc ttg cag ttc ttc atg ccc gtg ttg ggt gcc ctg ttc 1392
Ser Glu Asn Ser Leu Gin Phe Phe Met Pro Val Leu Gly Ala Leu Phe 450 455 460 ate ggt gtg get gtg gcc cca get aac gac ate tac aac gag ege gag 1440Ser Glu Asn Ser Leu Gin Phe Phe Met Pro Val Leu Gly Ala Leu Phe 450 455 460 ate ggt gtg get gtg gcc cca get aac gac ate tac aac gag ege gag 1440
Ile Gly Val Ala Val Ala Pro Ala Asn Asp Ile Tyr Asn Glu Arg Glu 465 470 475 480 ctg ctg aac agc atg ggc ate agc cag ccc acc gtc gta ttc gtg agc 1488Ile Gly Val Ala Val Ala Pro Ala Asn Asp Asle Ile Tyr Asn Glu Arg Glu 465 470 475 480 ctg ctg aac agc atg ggc ate agc cag ccc acc gtc gta ttc gtg agc 1488
Leu Leu Asn Ser Met Gly Ile Ser Gin Pro Thr Val Val Phe Val Ser 485 490 495 aag aaa ggg ctg caa aag ate ete aac gtg caa aag aag cta ccg ate 1536Leu Leu Asn Ser Met Gly Ile Ser Gin Pro Thr Val Val Phe Val Ser 485 490 495 aag aaa ggg ctg caa aag ate ete aac gtg caa aag aag cta ccg ate 1536
Lys Lys Gly Leu Gin Lys Ile Leu Asn Val Gin Lys Lys Leu Pro Ile 500 505 510 ata caa aag ate ate ate atg gat agc aag acc gac tac cag ggc ttc 1584Lys Lys Gly Leu Gin Lys Ile Leu Asn Val Gin Lys Lys Leu Pro Ile 500 505 510 ata caa aag ate ate ate atg gat agc aag acc gac tac cag ggc ttc 1584
Ile Gin Lys Ile Ile Ile Met Asp Ser Lys Thr Asp Tyr Gin Gly Phe 515 520 525 caa agc atg tac acc ttc gtg act tcc cat ttg cca ccc ggc ttc aac 1632Ile Gin Lys Ile Ile Ile Met Asp Ser Lys Thr Asp Tyr Gin Ghe Phe 515 520 525 caa agc atg tac acc ttc gtg act tcc cat ttg cca ccc ggc ttc aac 1632
Gin Ser Met Tyr Thr Phe Val Thr Ser His Leu Pro Pro Gly Phe Asn 530 535 540 gag tac gac ttc gtg ccc gag agc ttc gac cgg gac aaa acc ate gcc 1680Gin Ser Met Tyr Thr Phe Val Thr Ser He Leu Pro Pro Gly Phe Asn 530 535 540 gag tac gac ttc gtg ccc gag agc ttc gac cgg gac aaa acc ate gcc 1680
Glu Tyr Asp Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr Ile Ala 545 550 555 560 ctg ate atg aac agt agt ggc agt acc gga ttg ccc aag ggc gta gcc 1728Glu Tyr Asp Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr Ile Ala 545 550 555 560 ctg ate atg aac agt agt ggc agt acc gga ttg ccc aag ggc gta gcc 1728
Leu Ile Met Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val Ala 565 570 575 cta ccg cac cgc acc get tgt gtc ega ttc agt cat gcc cgc gac ccc 1776Leu Ile Met Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val Ala 565 570 575 cta cc cc cgc acc get tgt gtc ega ttc agt cat gcc cgc gac ccc 1776
Leu Pro His Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp Pro 580 585 590 ate ttc ggc aac cag ate ate ccc gcc gag tac tgt tta agc tat gaa 1824Leu Pro His Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp Pro 580 585 590 ate ttc ggc aac cag ate ate ccc gcc gag tac tgt tta agc tat gaa 1824
Ile Phe Gly Asn Gin Ile Ile Pro Ala Glu Tyr Cys Leu Ser Tyr Glu 595 600 605 acg gaa ata ttg aca gta gaa tat gga tta tta ccg att ggt aaa att 1872Ile Phe Gly Asn Gin Ile Ile Pro Ala Glu Tyr Cys Leu Ser Tyr Glu 595 600 605 acg gaa ata ttg aca gta gaa tat gga tta tta ccg att ggt aaa att 1872
Thr Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile 610 615 620 gta gaa aag cgc ate gaa tgt act gtt tat agc gtt gat aat aat gga 1920Thr Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile 610 615 620 gta gaa aag cgc ate gaa tgt act gtt tat agc gtt gat aat aat gga 1920
Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly 625 630 635 640 aat att tat aca caa cct gta gca caa tgg cac gat cgc gga gaa caa 1968Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly 625 630 635 640 aat att tat aca caa cct gta gca caa tgg cac gat cgc gga gaa caa 1968
Asn Ile Tyr Thr Gin Pro Val Ala Gin Trp His Asp Arg Gly Glu Gin 645 650 655 gag gtg ttt gag tat tgt ttg gaa gat ggt tca ttg att cgg gca aca 2016Asn Ile Tyr Thr Gin Pro Val Ala Gin Trp His Asp Arg Gly Glu Gin 645 650 655 gag gtg ttt gag tat tgt ttg gaa gat ggt tta ttg att cgg gca aca 2016
Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr 660 665 670 aaa gac cat aag ttt atg act gtt gat ggt caa atg ttg cca att gat 2064Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr 660 665 670 aaa gac cat aag ttt atg act gtt gat ggt caa atg ttg cca att gat 2064
Lys Asp His Lys Phe Met Thr Val Asp Gly Gin Met Leu Pro Ile Asp 675 680 685 gaa ata ttt gaa cgt gaa ttg gat ttg atg cgg gtt gat aat ttg ccg 2112Lys Asp His Lys Phe Met Thr Val Asp Gly Gin Met Leu Pro Ile Asp 675 680 685 gaa ata ttt gaa cgt gaa ttg gat ttg atg cgg gtt gat aat ttg ccg 2112
Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro 690 695 700 aac ggc ggc aag ate gcc gtc aat tet get tgc aag aac tgg ttc agt 2160Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro 690 695 700 aac ggc ggc aag ate gcc gtc aat tet get tgc aag aac tgg ttc agt 2160
Asn Gly Gly Lys Ile Ala Val Asn Ser Ala Cys Lys Asn Trp Phe Ser 705 710 715 720 agc tta agc cac ttt gtg ate cac ctt aac agc cac ggc ttc cct ccc 2208Asn Gly Gly Lys Ile Ala Val Asn Ser Ala Cys Lys Asn Trp Phe Ser 705 710 715 720 agc tta agc cac ttt gtg ate cac ctt aac agc cac ggc ttc cct ccc 2208
Ser Leu Ser His Phe Val Ile His Leu Asn Ser His Gly Phe Pro Pro 725 730 735 gag gtg gag gag cag gcc gcc ggc acc ctg ccc atg agc tgc gcc cag 2256Ser Leu Ser His Phe Val Ile His Leu Asn Ser His Gly Phe Pro Pro 725 730 735 gag gag gag gag gcc gcc cg ccc atg agc tgc gcc cag 2256
Glu Val Glu Glu Gin Ala Ala Gly Thr Leu Pro Met Ser Cys Ala Gin 740 745 750 gag agc ggc atg gat aga cac cct gct gct tgc gcc agc gcc agg ate 2304Glu Val Glu Glu Gin Ala Ala Gly Thr Leu Pro Met Ser Cys Ala Gin 740 745 750 gg agc ggc atg gat aga cac cct gct gct tgc gcc agc gcc agg ate 2304
Glu Ser Gly Met Asp Arg His Pro Ala Ala Cys Ala Ser Ala Arg Ile 755 760 765 aac gtc taa 2313Glu Ser Gly Met Asp Arg His Pro Ala Ala Cys Ala Ser Ala Arg Ile 755 760 765 aac gtc taa 2313
Asn Val 770 <210> 74 <211> 770Asn Val 770 <210> 74 <211> 770
<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 74<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 74
Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gin Asn Val Tyr 15 10 15Met Ile Lys Ile Ala Thr Arg Lys Tyr Leu Gly Lys Gin Asn Val Tyr 15 10 15
Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe 20 25 30Asp Ile Gly Val Glu Arg Asp His Asn Phe Ala Leu Lys Asn Gly Phe 20 25 30
Ile Ala Ser Asn Cys Phe Asn Asp Thr Tyr Arg Tyr Ile Asp Thr Ala 35 40 45Ile Ala Ser Asn Cys Phe Asn Asp Thr Tyr Arg Tyr Ile Asp Thr Ala 35 40 45
Ile Leu Ser Val Val Pro Phe His His Gly Phe Gly Met Phe Thr Thr 50 55 60Ile Leu Ser Val Val Phe His Gly Phe Gly Met Phe Thr Thr 50 55 60
Leu Gly Tyr Leu Ile Cys Gly Phe Arg Val Val Leu Met Tyr Arg Phe 65 70 75 80Leu Gly Tyr Leu Ile Cys Gly Phe Arg Val Val Leu Met Tyr Arg Phe 65 70 75 80
Glu Glu Glu Leu Phe Leu Arg Ser Leu Gin Asp Tyr Lys Ile Gin Ser 85 90 95Glu Glu Glu Leu Phe Leu Arg Ser Leu Gin Asp Tyr Lys Ile Gin Ser 85 90 95
Ala Leu Leu Val Pro Thr Leu Phe Ser Phe Phe Ala Lys Ser Thr Leu 100 105 110Ala Leu Leu Val Pro Thr Leu Phe Ser Phe Phe Ala Lys Ser Thr Leu 100 105 110
Ile Asp Lys Tyr Asp Leu Ser Asn Leu His Glu Ile Ala Ser Gly Gly 115 120 125Ile Asp Lys Tyr Asp Leu Ser Asn Leu His Glu Ile Ala Ser Gly Gly 115 120 125
Ala Pro Leu Ser Lys Glu Val Gly Glu Ala Val Ala Lys Arg Phe His 130 135 140Ala Pro Leu Ser Lys Glu Val Gly Glu Ala Val Ala Lys Arg Phe His 130 135 140
Leu Pro Gly Ile Arg Gin Gly Tyr Gly Leu Thr Glu Thr Thr Ser Ala 145 150 155 160Leu Pro Gly Ile Arg Gin Gly Tyr Gly Leu Thr Glu Thr Thr Ser Ala 145 150 155 160
Ile Leu Ile Thr Pro Glu Gly Asp Asp Lys Pro Gly Ala Val Gly Lys 165 170 175Ile Leu Ile Thr Pro Glu Gly Asp Asp Lys Pro Gly Ala Val Gly Lys 165 170 175
Val Val Pro Phe Phe Glu Ala Lys Val Val Asp Leu Asp Thr Gly Lys 180 185 190Val Val Pro Phe Phe Glu Ala Lys Val Val Asp Leu Asp Thr Gly Lys 180 185 190
Thr Leu Gly Val ASn Gin Arg Gly Glu Leu Cys Val Arg Gly Pro Met 195 200 205Thr Leu Gly Val ASn Gin Arg Gly Glu Leu Cys Val Arg Gly Pro Met 195 200 205
Ile Met Ser Gly Tyr Val Asn Asn Pro Glu Ala Thr Asn Ala Leu Ile 210 215 220Ile Met Ser Gly Tyr Val Asn Asn Pro Glu Ala Thr Asn Ala Leu Ile 210 215 220
Asp Lys Asp Gly Trp Leu His Ser Gly Asp Ile Ala Tyr Trp Asp Glu 225 230 235 240Asp Lys Asp Gly Trp Leu His Ser Gly Asp Ile Ala Tyr Trp Asp Glu 225 230 235 240
Asp Glu His Phe Phe Ile Val Asp Arg Leu Lys Ser Leu Ile Lys Tyr 245 250 255Asp Glu His Phe Phe Ile Val Asp Arg Leu Lys Ser Leu Ile Lys Tyr 245 250 255
Lys Gly Tyr Gin Val Ala Pro Ala Glu Leu Glu Ser Ile Leu Leu Gin 260 265 270Lys Gly Tyr Gin Val Ala Pro Ala Glu Leu Glu Ser Ile Leu Leu Gin 260 265 270
His Pro Asn Ile Phe Asp Ala Gly Val Ala Gly Leu Pro Asp Asp Asp 275 280 285His Pro Asn Ile Phe Asp Ala Gly Val Ala Gly Leu Pro Asp Asp Asp 275 280 285
Ala Gly Glu Leu Pro Ala Ala Val Val Val Leu Glu His Gly Lys Thr 290 295 300Ala Gly Glu Leu Pro Ala Ala Val Val Leu Glu His Gly Lys Thr 290 295 300
Met Thr Glu Lys Glu Ile val Asp Tyr Val Ala Ser Gin Val Thr Thr 305 310 315 320Met Thr Glu Lys Glu Ile Val Asp Tyr Val Ala Ser Gin Val Thr Thr 305 310 315 320
Ala Lys Lys Leu Arg Gly Gly Val val Phe Val Asp Glu Val Pro Lys 325 330 335Ala Lys Lys Leu Arg Gly Gly Val Val Phe Val Asp Glu Val Pro Lys 325 330 335
Gly Leu Thr Gly Lys Leu Asp Ala Arg Lys Ile Arg Glu Ile Leu Ile 340 345 350Gly Leu Thr Gly Lys Leu Asp Ala Arg Lys Ile Arg Glu Ile Leu Ile 340 345 350
Lys Ala Lys Lys Gly Ser Glu Glu Ile His Leu Gin Ser Gly Ser Gly 355 360 365Lys Ala Lys Lys Gly Ser Glu Glu Ile His Leu Gin Ser Gly Ser Gly 355 360 365
Gly Gly Ala Lys Astt Ile Lys Lys Gly Pro Ala Pro Phe Tyr Pro Leu 370 375 380Gly Gly Ala Lys Astle Ile Lys Lys Gly Pro Ala Pro Phe Tyr Pro Leu 370 375 380
Glu Asp Gly Thr Ala Gly Glu Gin Leu His Lys Ala Met Lys Arg Tyr 385 390 395 400Glu Asp Gly Thr Ala Gly Glu Gin Leu His Lys Ala Met Lys Arg Tyr 385 390 395 400
Ala Leu Val Pro Gly Thr Ile Ala Phe Thr Asp Ala His Ile Glu Val 405 410 415Ala Leu Val Pro Gly Thr Ile Ala Phe Thr Asp Ala His Ile Glu Val 405 410 415
Asp Ile Thr Tyr Ala Glu Tyr Phe Glu Met Ser Val Arg Leu Ala Glu 420 425 430Asp Ile Thr Tyr Ala Glu Tyr Phe Glu Met Ser Val Arg Leu Ala Glu 420 425 430
Ala Met Lys Arg Tyr Gly Leu Asn Thr Asn His Arg Ile Val Val Cys 435 440 445Ala Met Lys Arg Tyr Gly Leu Asn Thr Asn His Arg Ile Val Val Cys 435 440 445
Ser Glu Asn Ser Leu Gin Phe Phe Met Pro Val Leu Gly Ala Leu Phe 450 455 460Ser Glu Asn Ser Leu Gin Phe Phe Met Pro Val Leu Gly Ala Leu Phe 450 455 460
Ile Gly Val Ala Val Ala Pro Ala Asn Asp Ile Tyr Asn Glu Arg Glu 465 470 475 480Ile Gly Val Ala Val Ala Pro Ala Asn Asp Ile Tyr Asn Glu Arg Glu 465 470 475 480
Leu Leu Asn Ser Met Gly Ile Ser Gin Pro Thr Val Val Phe Val Ser 485 490 495Leu Leu Asn Ser Met Gly Ile Ser Gin Pro Thr Val Val Phe Val Ser 485 490 495
Lys Lys Gly Leu Gin Lys Ile Leu Asn Val Gin Lys Lys Leu Pro Ile 500 505 510Lys Lys Gly Leu Gin Lys Ile Leu Asn Val Gin Lys Leu Pro Ile 500 505 510
Ile Gin Lys Ile Ile Ile Met Asp Ser Lys Thr Asp Tyr Gin Gly Phe 515 520 525Ile Gin Lys Ile Ile Ile Met Asp Ser Lys Thr Asp Tyr Gin Gly Phe 515 520 525
Gin Ser Met Tyr Thr Phe Val Thr Ser His Leu Pro Pro Gly Phe Asn 530 535 540Gin Ser Met Tyr Thr Phe Val Thr Ser His Leu Pro Pro Gly Phe Asn 530 535 540
Glu Tyr Asp Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr Ile Ala 545 550 555 560Glu Tyr Asp Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr Ile Ala 545 550 555 560
Leu Ile Met Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val Ala 565 570 575Leu Ile Met Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val Ala 565 570 575
Leu Pro His Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp Pro 580 585 590Leu Pro His Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp Pro 580 585 590
Ile Phe Gly Asn Gin Ile Ile Pro Ala Glu Tyr Cys Leu Ser Tyr Glu 595 600 605Ile Phe Gly Asn Gin Ile Ile Pro Ala Glu Tyr Cys Leu Ser Tyr Glu 595 600 605
Thr Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile 610 615 620Thr Glu Ile Leu Thr Val Glu Tyr Gly Leu Leu Pro Ile Gly Lys Ile 610 615 620
Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly 625 630 635 640Val Glu Lys Arg Ile Glu Cys Thr Val Tyr Ser Val Asp Asn Asn Gly 625 630 635 640
Asn Ile Tyr Thr Gin Pro Val Ala Gin Trp His Asp Arg Gly Glu Gin 645 650 655Asn Ile Tyr Thr Gin Pro Val Ala Gin Trp His Asp Arg Gly Glu Gin 645 650 655
Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr 660 665 670Glu Val Phe Glu Tyr Cys Leu Glu Asp Gly Ser Leu Ile Arg Ala Thr 660 665 670
Lys Asp His Lys Phe Met Thr Val Asp Gly Gin Met Leu Pro Ile Asp 675 680 685Lys Asp His Lys Phe Met Thr Val Asp Gly Gin Met Leu Pro Ile Asp 675 680 685
Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro 690 695 700Glu Ile Phe Glu Arg Glu Leu Asp Leu Met Arg Val Asp Asn Leu Pro 690 695 700
Asn Gly Gly Lys Ile Ala Val Asn Ser Ala Cys Lys Asn Trp Phe Ser 705 710 715 720Asn Gly Gly Lys Ile Ala Val Asn Ser Ala Cys Lys Asn Trp Phe Ser 705 710 715 720
Ser Leu Ser His Phe Val Ile His Leu Asn Ser His Gly Phe Pro Pro 725 730 735Ser Leu Ser His Phe Val Ile His Leu Asn Ser His Gly Phe Pro Pro 725 730 735
Glu Val Glu Glu Gin Ala Ala Gly Thr Leu Pro Met Ser Cys Ala Gin 740 745 750 G1U Ser Gly Met Asp Arg His Pro Ala Ala Cys Ala Ser Ala Arg Ile 755 760 765Glu Val Glu Glu Gin Ala Ala Gly Thr Leu Pro Met Ser Cys Ala Gin 740 745 750 G1U Ser Gly Met Asp Arg His Pro Ala Ala Cys Ala Ser Ala Arg Ile 755 760 765
Asn Val 770 <210> 75 <211> 1671Asn Val 770 <210> 75 <211> 1671
<212> DNA <213> Artificial Sequence <220> <223> - <220><212> DNA <213> Artificial Sequence <220> <223> - <220>
<221> CDS <222> (1)..(1668) <400> 75 atg gag cag aag ctg ate agc gag gag gac ctg ggt tet gga gaa gat 48<221> CDS <222> (1) .. (1668) <400> 75 atg gag cag aag ctg ate agc gag gag gac ctg ggt tet gga gaa gat 48
Met Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Gly Ser Gly Glu Asp 15 10 15 gcc aaa aac att aag aag ggc cca gcg cca ttc tac cca ete gaa gac 96Met Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Gly Ser Gly Glu Asp 15 10 15 gcc aaa aac att aag aag ggc cca gcg cca ttc tac cca ete gaa gac 96
Ala Lys Asn Ile Lys Lys Gly Pro Ala Pro Phe Tyr Pro Leu Glu Asp 20 25 30 ggg acc gcc ggc gag cag ctg cac aaa gcc atg aag ege tac gcc ctg 144Ala Lys Asn Ile Lys Lys Gly Pro Ala Pro Phe Tyr Pro Leu Glu Asp 20 25 30 ggg acc gcc gg cag ctg cac aaa gcc atg aag ege tac gcc ctg 144
Gly Thr Ala Gly Glu Gin Leu His Lys Ala Met Lys Arg Tyr Ala Leu 35 40 45 gtg ccc ggc acc ate gcc ttt acc gac gca cat ate gag gtg gac att 192Gly Thr Ala Gly Glu Gin Leu His Lys Ala Met Lys Arg Tyr Ala Leu 35 40 45 gtg ccc ggc acc ate gcc ttt acc gac gca cat ate gag gtg gac att 192
Val Pro Gly Thr Ile Ala Phe Thr Asp Ala His Ile Glu Val Asp Ile 50 55 60 acc tac gcc gag tac ttc gag atg agc gtt cgg ctg gca gaa get atg 240Val Pro Gly Thr Ile Ala Phe Thr Asp Ala His Ile Glu Val Asp Ile 50 55 60 acc tac gcc gag tac ttc gag atg agc gtt cgg ctg gca gaa get atg 240
Thr Tyr Ala Glu Tyr Phe Glu Met Ser Val Arg Leu Ala Glu Ala Met 65 70 75 80 aag ege tat ggg ctg aat aca aac cat cgg ate gtg gtg tgc agc gag 288Thr Tyr Ala Glu Tyr Phe Glu Met Ser Val Arg Leu Ala Glu Ala Met 65 70 75 80 aag ege tat ggg ctg aat aca aac cat cgg ate gtg gtg tgc agc gag 288
Lys Arg Tyr Gly Leu Asn Thr Asn His Arg Ile Val Val Cys Ser Glu 85 90 95 aat agc ttg cag ttc ttc atg ccc gtg ttg ggt gcc ctg ttc ate ggt 336Lys Arg Tyr Gly Leu Asn Thr Asn His Arg Ile Val Val Cys Ser Glu 85 90 95 aat agc ttg cag ttc ttc atg ccc gtg ttg ggt gcc ctg ttc ate ggt 336
Asn Ser Leu Gin Phe Phe Met Pro Val Leu Gly Ala Leu Phe Ile Gly 100 105 110 gtg get gtg gcc cca get aac gac ate tac aac gag ege gag ctg ctg 384Asn Ser Leu Gin Phe Phe Met Pro Val Leu Gly Ala Leu Phe Ile Gly 100 105 110 gtg get gtg gcc cca get aac gac ate tac aac gag ege gag ctg ctg 384
Val Ala Val Ala Pro Ala Asn Asp Ile Tyr Asn Glu Arg Glu Leu Leu 115 120 125 aac agc atg ggc ate agc cag ccc acc gtc gta ttc gtg agc aag aaa 432Val Ala Val Ala Pro Ala Asn Asp Asle Ile Tyr Asn Glu Arg Glu Leu Leu 115 120 125 aac agc atg ggc ate agc cag ccc acc gtc gta ttc gtg agc aag aaa 432
Asn Ser Met Gly Ile Ser Gin Pro Thr Val Val Phe Val Ser Lys Lys 130 135 140 ggg ctg caa aag ate ete aac gtg caa aag aag cta ccg ate ata caa 480Asn Ser Met Gly Ile Ser Gin Pro Thr Val Val Phe Val Ser Lys Lys 130 135 140 ggg ctg caa aag ate ete aac gtg caa aag aag cta ccg ate ata caa 480
Gly Leu Gin Lys Ile Leu Asn Val Gin Lys Lys Leu Pro Ile Ile Gin 145 150 155 160 aag ate ate ate atg gat agc aag acc gac tac cag ggc ttc caa agc 528Gly Leu Gin Lys Ile Leu Asn Val Gin Lys Leu Pro Ile Ile Gin 145 150 155 160 aag ate ate ate atg gat agc aag acc gac tac cag ggc ttc caa agc 528
Lys Ile Ile Ile Met Asp Ser Lys Thr Asp Tyr Gin Gly Phe Gin Ser 165 170 175 atg tac acc ttc gtg act tcc cat ttg cca ccc ggc ttc aac gag tac 576Lys Ile Ile Ile Met Asp Ser Lys Thr Asp Tyr Gin Gly Phe Gin Ser 165 170 175 atg tac acc ttc ct ccc tc ct ccc gc ttc aac gag tac 576
Met Tyr Thr Phe Val Thr Ser His Leu Pro Pro Gly Phe Asn Glu Tyr 180 185 190 gac ttc gtg ccc gag agc ttc gac cgg gac aaa acc ate gcc ctg ate 624Met Tyr Thr Phe Val Thr Ser His Leu Pro Pro Gly Phe Asn Glu Tyr 180 185 190 gac ttc gtg ccc gag agc ttc gac cgg gac aaa acc ate gcc ctg ate 624
Asp Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr Ile Ala Leu Ile 195 200 205 atg aac agt agt ggc agt acc gga ttg ccc aag ggc gta gcc cta ccg 672Asp Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr Ile Ala Leu Ile 195 200 205 atg aac agt agt ggc agt acc gga ttg ccc aag ggc gta gcc cta ccg 672
Met Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val Ala Leu Pro 210 215 220 cac ege acc get tgt gtc ega ttc agt cat gcc ege gac ccc ate ttc 720Met Asn Ser Ser Gly Ser Ther Gly Leu Pro Lys Gly Val Ala Leu Pro 210 215 220 cac ege acc ttc gtc ega ttc agt cat gcc ege gac ccc ate ttc 720
His Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp Pro Ile Phe 225 230 235 240 ggc aac cag ate ate ccc gac acc get ate ete agc gtg gtg cca ttt 768His Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp Pro Ile Phe 225 230 235 240 ggc aac cag ate ate ccc gac acc get ate ete agc gtg gtg cca ttt 768
Gly Asn Gin Ile Ile Pro Asp Thr Ala Ile Leu Ser Val Val Pro Phe 245 250 255 cac cac ggc ttc ggc atg ttc acc acg ctg ggc tac ttg ate tgc ggc 816Gly Asn Gin Ile Pro Asp Thr Ala Ile Leu Ser Val Val Pro Phe 245 250 255 cac cac ggc ttc ggc atg ttc acc acg ctg ggc tac ttg ate tgc ggc 816
His His Gly Phe Gly Met Phe Thr Thr Leu Gly Tyr Leu Ile Cys Gly 260 265 270 ttt cgg gtc gtg ctc atg tac cgc ttc gag gag gag cta ttc ttg cgc 864His His Gly Phe Gly Met Phe Thr Thr Leu Gly Tyr Leu Ile Cys Gly 260 265 270 ttt cgg gtc gtg ctc tg ctc cgc
Phe Arg Val Val Leu Met Tyr Arg Phe Glu Glu Glu Leu Phe Leu Arg 275 280 285 agc ttg caa gac tat aag att caa tet gcc ctg ctg gtg ccc aca cta 912Phe Arg Val Val Leu Met Tyr Arg Phe Glu Glu Glu Leu Phe Leu Arg 275 280 285 agc ttg caa gac tat aag att caa tet gcc ctg ctg gtg ccc aca cta 912
Ser Leu Gin Asp Tyr Lys Ile Gin Ser Ala Leu Leu Val Pro Thr Leu 290 295 300 ttt agc ttc ttc get aag agc act ctc ate gac aag tac gac cta agc 960Ser Leu Gin Asp Tyr Lys Ile Gin Ser Ala Leu Leu Val Pro Thr Leu 290 295 300 ttt agt ttc ttc get aag agc act ctc ate gac aag tac gac cta agc 960
Phe Ser Phe Phe Ala Lys Ser Thr Leu Ile Asp Lys Tyr Asp Leu Ser 305 310 315 320 aac ttg cac gag ate gcc agc ggc ggg gcg ccg ctc agc aag gag gta 1008Phe Ser Phe Phe Ala Lys Ser Thr Leu Ile Asp Lys Tyr Asp Leu Ser 305 310 315 320 aac ttg cac gag ate gcc agc ggc ggg gcg ccc cc agc aag gag gta 1008
Asn Leu His Glu Ile Ala Ser Gly Gly Ala Pro Leu Ser Lys Glu Val 325 330 335 ggt gag gcc gtg gcc aaa cgc ttc cac cta cca ggc ate cgc cag ggc 1056Asn Leu His Glu Ile Ala Ser Gly Gly Ala Pro Leu Ser Lys Glu Val 325 330 335 ggt gag gcc gtg gcc aaa cgc ttc cac cta cca ggc ate cgc cag ggc 1056
Gly Glu Ala Val Ala Lys Arg Phe His Leu Pro Gly Ile Arg Gin Gly 340 345 350 tac ggc ctg aca gaa aca acc agc gcc att ctg ate acc ccc gaa ggg 1104Gly Glu Ala Val Ala Lys Arg Phe His Leu Pro Gly Ile Arg Gin Gly 340 345 350 tac ggc ctg aca gaa aca acc agc gcc att ctg ate acc ccc gaa ggg 1104
Tyr Gly Leu Thr Glu Thr Thr Ser Ala Ile Leu Ile Thr Pro Glu Gly 355 360 365 gac gac aag cct ggc gca gta ggc aag gtg gtg ccc ttc ttc gag get 1152Tyr Gly Leu Thr Glu Thr Thr Ser Ala Ile Leu Ile Thr Pro Glu Gly 355 360 365 gac gac aag cct ggc gca gta ggc aag gtg gtg ccc ttc ttc gag get 1152
Asp Asp Lys Pro Gly Ala Val Gly Lys Val Val Pro Phe Phe Glu Ala 370 375 380 aag gtg gtg gac ttg gac acc ggt aag aca ctg ggt gtg aac cag cgc 1200Asp Asp Lys Pro Gly Ala Val Gly Lys Val Val Pro Phe Phe Glu Ala 370 375 380 aag gtg gtg gac ttg gac acc ggt aag aca ctg ggt gtg aq cag cgc 1200
Lys Val Val Asp Leu Asp Thr Gly Lys Thr Leu Gly Val Asn Gin Arg 385 390 395 400 ggc gag ctg tgc gtc cgt ggc ccc atg ate atg agc ggc tac gtt aac 1248Lys Val Val Asp Leu Asp Thr Gly Lys Thr Leu Gly Val Asn Gin Arg 385 390 395 400 ggc gg ctg tgc gtc cgt ggc ccc atg ate atg agc ggc tac gtt aac 1248
Gly Glu Leu Cys Val Arg Gly Pro Met Ile Met Ser Gly Tyr Val Asn 405 410 415 aac ccc gag get aca aac get ctc ate gac aag gac ggc tgg ctg cac 1296Gly Glu Leu Cys Val Arg Gly Pro Met Ile Met Ser Gly Tyr Val Asn 405 410 415 aac ccc gag get aca aac get ctc ate gac aag gac ggc tgg ctg cac 1296
Asn Pro Glu Ala Thr Asn Ala Leu Ile Asp Lys Asp Gly Trp Leu His 420 425 430 agc ggc gac ate gcc tac tgg gac gag gac gag cac ttc ttc ate gtg 1344Asn Pro Glu Ala Thr Asn Ala Leu Ile Asp Lys Asp Gly Trp Leu His 420 425 430 agc ggc gac gac gac gac gac gac gac gac gac cac ttc ttc ate gtg 1344
Ser Gly Asp Ile Ala Tyr Trp Asp Glu Asp Glu His Phe Phe Ile Val 435 440 445 gac cgg ctg aag agc ctg ate aaa tac aag ggc tac cag gta gcc cca 1392Ser Gly Asp Ile Ala Tyr Trp Asp Glu Asp Glu His Phe Phe Ile Val 435 440 445 gac cgg ctg aag agc ctg ate aaa tac aag ggc tac cag gta gcc cca 1392
Asp Arg Leu Lys Ser Leu Ile Lys Tyr Lys Gly Tyr Gin Val Ala Pro 450 455 460 gcc gaa ctg gag agc ate ctg ctg caa cac ccc aac ate ttc gac gcc 1440Asp Arg Leu Lys Ser Leu Ile Lys Tyr Lys Gly Tyr Gin Val Ala Pro 450 455 460 gcc aga ate ctg ctg caa cac ccc aac ate ttc gac gcc 1440
Ala Glu Leu Glu Ser Ile Leu Leu Gin His Pro Asn Ile Phe Asp Ala 465 470 475 480 ggg gtc gcc ggc ctg ccc gac gac gat gcc ggc gag ctg ccc gcc gca 1488Ala Glu Leu Glu Ser Ile Leu Gin His Pro Asn Ile Phe Asp Ala 465 470 475 480 ggg gc gcc ggc ccc gcc gac gac gac gcc ggc gag ctg ccc gcc gca 1488
Gly Val Ala Gly Leu Pro Asp Asp Asp Ala Gly Glu Leu Pro Ala Ala 485 490 495 gtc gtc gtg ctg gaa cac ggt aaa ggc ggc tet ggc ggc tet gaa aat 1536Gly Val Ala Gly Leu Pro Asp Asp Ala Gly Glu Leu Pro Ala Ala 485 490 495 gtc gtc gtg ctg gaa cac ggt aaa ggc ggc tet ggc ggc tet gaa aat 1536
Val Val Val Leu Glu His Gly Lys Gly Gly Ser Gly Gly Ser Glu Asn 500 505 510 ttg tat ttc cag agt ggc ggc tet ggc ggc tet tca ccc gag gac gag 1584Val Val Val Leu Glu His Gly Lys Gly Gly Ser Gly Gly Ser Glu Asn 500 505 510 ttg tat ttc cag agt ggc ggc tet ggc ggc tet tca ccc gag gac gag 1584
Leu Tyr Phe Gin Ser Gly Gly Ser Gly Gly Ser Ser Pro Glu Asp Glu 515 520 525 ctt get get aat gaa gag gag ttg cag caa aat gaa caa aag ttg get 1632Leu Tyr Phe Gin Ser Gly Gly Ser Gly Gly Ser Ser Pro Glu Asp Glu 515 520 525 ctt get get aat gaa gag gag ttg cag caa aat gaa caa aag ttg get 1632
Leu Ala Ala Asn Glu Glu Glu Leu Gin Gin Asn Glu Gin Lys Leu Ala 530 535 540 caa att aag caa aaa ctt caa get ate aaa tac ggt taa 1671Leu Ala Ala Asn Glu Glu Glu Leu Gin Gin Asn Glu Gin Lys Leu Ala 530 535 540 caa att aag caa aaa ctt caa get ate aaa tac ggt taa 1671
Gin Ile Lys Gin Lys Leu Gin Ala Ile Lys Tyr Gly 545 550 555 <210> 76 <211> 556Gin Ile Lys Gin Lys Leu Gin Alla Ile Lys Tyr Gly 545 550 555 <210> 76 <211> 556
<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 76<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 76
Met Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Gly Ser Gly Glu Asp 15 10 15Met Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Gly Ser Gly Glu Asp 15 10 15
Ala Lys Asn Ile Lys Lys Gly Pro Ala Pro Phe Tyr Pro Leu Glu Asp 20 25 30Ala Lys Asn Ile Lys Lys Gly Pro Ala Pro Phe Tyr Pro Leu Glu Asp 20 25 30
Gly Thr Ala Gly Glu Gin Leu His Lys Ala Met Lys Arg Tyr Ala Leu 35 40 45Gly Thr Ala Gly Glu Gin Leu His Lys Ala Met Lys Arg Tyr Ala Leu 35 40 45
Val Pro Gly Thr Ile Ala Phe Thr Asp Ala His Ile Glu Val Asp Ile 50 55 60Val Pro Gly Thr Ile Ala Phe Thr Asp Ala His Ile Glu Val Asp Ile 50 55 60
Thr Tyr Ala Glu Tyr Phe Glu Met Ser Val Arg Leu Ala Glu Ala Met 65 70 75 80Thr Tyr Ala Glu Tyr Phe Glu Met Ser Val Arg Leu Ala Glu Ala Met 65 70 75 80
Lys Arg Tyr Gly Leu Asn Thr Ash His Arg Ile Val Val Cys Ser Glu 85 90 95Lys Arg Tyr Gly Leu Asn Thr Ash His Arg Ile Val Val Cys Ser Glu 85 90 95
Asn Ser Leu Gin Phe Phe Met Pro Val Leu Gly Ala Leu Phe Ile Gly 100 105 110Asn Ser Leu Gin Phe Phe Met Pro Val Leu Gly Ala Leu Phe Ile Gly 100 105 110
Val Ala Val Ala Pro Ala Asn Asp Ile Tyr Asn Glu Arg Glu Leu Leu 115 120 125Val Ala Val Ala Pro Ala Asn Asp Ile Tyr Asn Glu Arg Glu Leu Leu 115 120 125
Asn Ser Met Gly Ile Ser Gin Pro Thr Val Val Phe Val Ser Lys Lys 130 135 140Asn Ser Met Gly Ile Ser Gin Pro Thr Val Val Phe Val Ser Lys Lys 130 135 140
Gly Leu Gin Lys Ile Leu Asn Val Gin Lys Lys Leu Pro Ile Ile Gin 145 150 155 160Gly Leu Gin Lys Ile Leu Asn Val Gin Lys Leu Pro Ile Ile Gin 145 150 155 160
Lys Ile Ile Ile Met Asp Ser Lys Thr Asp Tyr Gin Gly Phe Gin Ser 165 170 175Lys Ile Ile Ile Met Asp Ser Lys Thr Asp Tyr Gin Gly Phe Gin Ser 165 170 175
Met Tyr Thr Phe Val Thr Ser His Leu Pro Pro Gly Phe Asn Glu Tyr 180 185 190Met Tyr Thr Phe Val Thr Ser His Leu Pro Pro Gly Phe Asn Glu Tyr 180 185 190
Asp Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr Ile Ala Leu Ile 195 200 205Asp Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr Ile Ala Leu Ile 195 200 205
Met Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val Ala Leu Pro 210 215 220Met Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val Ala Leu Pro 210 215 220
His Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp Pro Ile Phe 225 230 235 240His Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp Pro Ile Phe 225 230 235 240
Gly Asn Gin Ile Ile Pro Asp Thr Ala Ile Leu Ser Val Val Pro Phe 245 250 255Gly Asn Gin Ile Pro Asp Thr Ala Ile Leu Ser Val Val Pro Phe 245 250 255
His His Gly Phe Gly Met Phe Thr Thr Leu Gly Tyr Leu Ile Cys Gly 260 265 270His Gly Phe Gly Met Phe Thr Thr Leu Gly Tyr Leu Ile Cys Gly 260 265 270
Phe Arg Val Val Leu Met Tyr Arg Phe Glu Glu GlU Leu Phe Leu Arg 275 280 285Phe Arg Val Val Leu Met Tyr Arg Phe Glu Glu GlU Leu Phe Leu Arg 275 280 285
Ser Leu Gin Asp Tyr Lys Ile Gin Ser Ala Leu Leu Val Pro Thr Leu 290 295 300Ser Leu Gin Asp Tyr Lys Ile Gin Ser Ala Leu Leu Val Pro Thr Leu 290 295 300
Phe Ser Phe Phe Ala Lys Ser Thr Leu Ile Asp Lys Tyr Asp Leu Ser 305 310 315 320Phe Ser Phe Phe Ala Lys Ser Thr Leu Ile Asp Lys Tyr Asp Leu Ser 305 310 315 320
Asn Leu His Glu Ile Ala Ser Gly Gly Ala Pro Leu Ser Lys Glu Val 325 330 335Asn Leu His Glu Ile Ala Ser Gly Gly Ala Pro Leu Ser Lys Glu Val 325 330 335
Gly Glu Ala Val Ala Lys Arg Phe His Leu Pro Gly Ile Arg Gin Gly 340 345 350Gly Glu Ala Val Ala Lys Arg Phe His Leu Pro Gly Ile Arg Gin Gly 340 345 350
Tyr Gly Leu Thr Glu Thr Thr Ser Ala Ile Leu Ile Thr Pro Glu Gly 355 360 365Tyr Gly Leu Thr Glu Thr Thr Ser Ala Ile Leu Ile Thr Pro Glu Gly 355 360 365
Asp Asp Lys Pro Gly Ala Val Gly Lys Val Val Pro Phe Phe Glu Ala 370 375 380Asp Asp Lys Pro Gly Ala Val Gly Lys Val Val Pro Phe Phe Glu Ala 370 375 380
Lys Val Val Asp Leu Asp Thr Gly Lys Thr Leu Gly Val Asn Gin Arg 385 390 395 400Lys Val Val Asp Leu Asp Thr Gly Lys Thr Leu Gly Val Asn Gin Arg 385 390 395 400
Gly Glu Leu Cys Val Arg Gly Pro Met Ile Met Ser Gly Tyr Val Asn 405 410 415Gly Glu Leu Cys Val Arg Gly Pro Met Ile Met Ser Gly Tyr Val Asn 405 410 415
Asn Pro Glu Ala Thr Asn Ala Leu Ile Asp Lys Asp Gly Trp Leu His 420 425 430Asn Pro Glu Ala Thr Asn Ala Leu Ile Asp Lys Asp Gly Trp Leu His 420 425 430
Ser Gly Asp Ile Ala Tyr Trp Asp Glu Asp Glu His Phe Phe Ile Val 435 440 445Ser Gly Asp Ile Ala Tyr Trp Asp Glu Asp Glu His Phe Phe Ile Val 435 440 445
Asp Arg Leu Lys Ser Leu Ile Lys Tyr Lys Gly Tyr Gin Val Ala Pro 450 455 460Asp Arg Leu Lys Ser Leu Ile Lys Tyr Lys Gly Tyr Gin Val Ala Pro 450 455 460
Ala Glu Leu Glu Ser Ile Leu Leu Gin His Pro Asn Ile Phe Asp Ala 465 470 475 480Ala Glu Leu Glu Ser Ile Leu Leu Gin His Pro Asn Ile Phe Asp Ala 465 470 475 480
Gly Val Ala Gly Leu Pro Asp Asp Asp Ala Gly Glu Leu Pro Ala Ala 485 490 495Gly Val Ala Gly Leu Pro Asp Asp Ala Gly Glu Leu Pro Ala Ala 485 490 495
Val Val Val Leu Glu His Gly Lys Gly Gly Ser Gly Gly Ser Glu Asn 500 505 510Val Val Leu Glu His Gly Lys Gly Gly Ser Gly Gly Ser Glu Asn 500 505 510
Leu Tyr Phe Gin Ser Gly Gly Ser Gly Gly Ser Ser Pro Glu Asp Glu 515 520 525Leu Tyr Phe Gin Ser Gly Gly Ser Gly Gly Ser Ser Pro Glu Asp Glu 515 520 525
Leu Ala Ala Asn Glu Glu Glu Leu Gin Gin Asn Glu Gin Lys Leu Ala 530 535 540Leu Ala Ala Asn Glu Glu Glu Leu Gin Gin Asn Glu Gin Lys Leu Ala 530 535 540
Gin Ile Lys Gin Lys Leu Gin Ala Ile Lys Tyr Gly 545 550 555 <210> 77Gin Ile Lys Gin Lys Leu Gin Ala Ile Lys Tyr Gly 545 550 555 <210> 77
<211> 1800 <212> DNA <213> Artificial Sequehce <220> <223> - <220> <221> GDS <222> (1)..(1797) <400> 77 atg gag cag aag ctg ate agc gag gag gac ctg ggt tet gga gaa gat 48<211> 1800 <212> DNA <213> Artificial Sequehce <220> <223> - <220> <221> GDS <222> (1) .. (1797) <400> 77 atg gag cag aag ctg ate agc gag gag gac ctg ggt tet gga gaa gat 48
Met Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Gly Ser Gly Glu Asp 15 10 15 gec aaa aac att aag aag ggc cca gcg cca ttc tac cca ete gaa gac 96Met Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Gly Ser Gly Glu Asp 15 10 15 gec aaa aac att aag aag ggc cca gcg cca ttc tac cca ete gaa gac 96
Ala Lys Asn Ile Lys Lys Gly Pro Ala Pro Phe Tyr Pro Leu Glu Asp 20 25 30 ggg acc gcc ggc gag cag ctg cac aaa gcc atg aag ege tac gcc ctg 144Ala Lys Asn Ile Lys Lys Gly Pro Ala Pro Phe Tyr Pro Leu Glu Asp 20 25 30 ggg acc gcc gg cag ctg cac aaa gcc atg aag ege tac gcc ctg 144
Gly Thr Ala Gly Glu Gin Leu His Lys Ala Met Lys Arg Tyr Ala Leu 35 40 45 gtg ccc ggc acc ate gcc ttt acc gac gca cat ate gag gtg gac att 192Gly Thr Ala Gly Glu Gin Leu His Lys Ala Met Lys Arg Tyr Ala Leu 35 40 45 gtg ccc ggc acc ate gcc ttt acc gac gca cat ate gag gtg gac att 192
Val Pro Gly Thr Ile Ala Phe Thr Asp Ala His Ile Glu Val Asp Ile 50 55 < 60 acc tac gcc gag tac ttc gag atg agč gtt cgg ctg gca gaa get atg 240Val Pro Gly Thr Ile Ala Phe Thr Asp Ala His Ile Glu Val Asp Ile 50 55 <60 acc tac gcc gag tac ttc gag atg agch gtt cgg ctg gca gaa get atg 240
Thr Tyr Ala Glu Tyr Phe Glu Met Ser Val Arg Leu Ala Glu Ala Met 65 70 75 80 aag ege tat ggg ctg aat aca aac cat cgg ate gtg gtg tgc agc gag 288Thr Tyr Ala Glu Tyr Phe Glu Met Ser Val Arg Leu Ala Glu Ala Met 65 70 75 80 aag ege tat ggg ctg aat aca aac cat cgg ate gtg gtg tgc agc gag 288
Lys Arg Tyr Gly Leu Asn Thr Asn His Arg Ile Val Val Cys Ser Glu 85 90 95 aat agc ttg Cag ttc ttc atg ccc gtg ttg ggt gcc ctg ttc ate ggt 336Lys Arg Tyr Gly Leu Asn Thr Asn His Arg Ile Val Val Cys Ser Glu 85 90 95 aat agc ttg Cag ttc ttc atg ccc gtg ttg ggt gcc ctg ttc ate ggt 336
Asn Ser Leu Gin Phe Phe Met Pro Val Leu Gly Ala Leu Phe Ile Gly 100 105 110 gtg get gtg gcc cca get aac gac ate tac aac gag ege gag ctg ctg 384Asn Ser Leu Gin Phe Phe Met Pro Val Leu Gly Ala Leu Phe Ile Gly 100 105 110 gtg get gtg gcc cca get aac gac ate tac aac gag ege gag ctg ctg 384
Val Ala Val Ala Pro Ala Asn Asp Ile Tyr Asn Glu Arg Glu Leu Leu 115 120 125 aac agc atg ggc ate agc cag ccc acc gtc gta ttc gtg agc aag aaa 432Val Ala Val Ala Pro Ala Asn Asp Asle Ile Tyr Asn Glu Arg Glu Leu Leu 115 120 125 aac agc atg ggc ate agc cag ccc acc gtc gta ttc gtg agc aag aaa 432
Asn Ser Met Gly Ile Ser Gin Pro Thr Val Val Phe Val Ser Lys Lys 130 135 140 ggg ctg caa aag ate ete aac gtg caa aag aag cta ccg ate ata caa 480Asn Ser Met Gly Ile Ser Gin Pro Thr Val Val Phe Val Ser Lys Lys 130 135 140 ggg ctg caa aag ate ete aac gtg caa aag aag cta ccg ate ata caa 480
Gly Leu Gin Lys Ile Leu Asn Val Gin Lys Lys Leu Pro Ile Ile Gin 145 150 155 160 aag ate ate ate atg gat agc aag acc gac tac cag ggc ttc caa agc 528Gly Leu Gin Lys Ile Leu Asn Val Gin Lys Leu Pro Ile Ile Gin 145 150 155 160 aag ate ate ate atg gat agc aag acc gac tac cag ggc ttc caa agc 528
Lys Ile Ile Ile Met Asp Ser Lys Thr Asp Tyr Gin Gly Phe Gin Ser 165 170 175 atg tac acc ttc gtg act tcc cat ttg cca ccc ggc ttc aac gag tac 576Lys Ile Ile Ile Met Asp Ser Lys Thr Asp Tyr Gin Gly Phe Gin Ser 165 170 175 atg tac acc ttc ct ccc tc ct ccc gc ttc aac gag tac 576
Met Tyr Thr Phe Val Thr Ser His Leu Pro Pro Gly Phe Asn Glu Tyr 180 185 190 gac ttc gtg ccc gag agc ttc gac cgg gac aaa acc ate gcc ctg ate 624Met Tyr Thr Phe Val Thr Ser His Leu Pro Pro Gly Phe Asn Glu Tyr 180 185 190 gac ttc gtg ccc gag agc ttc gac cgg gac aaa acc ate gcc ctg ate 624
Asp Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr Ile Ala Leu Ile 195 200 205 atg aac agt agt ggc agt acc gga ttg ccc aag ggc gta gcc cta ccg 672Asp Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr Ile Ala Leu Ile 195 200 205 atg aac agt agt ggc agt acc gga ttg ccc aag ggc gta gcc cta ccg 672
Met Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val Ala Leu Pro 210 215 220 cac ege acc get tgt gtc ega ttc agt cat gcc ege gac ccc ate ttc 720Met Asn Ser Ser Gly Ser Ther Gly Leu Pro Lys Gly Val Ala Leu Pro 210 215 220 cac ege acc ttc gtc ega ttc agt cat gcc ege gac ccc ate ttc 720
His Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp Pro Ile Phe 225 230 235 240 ggc aac cag ate ate ccc gac acc get ate ete agc gtg gtg cca ttt 768His Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp Pro Ile Phe 225 230 235 240 ggc aac cag ate ate ccc gac acc get ate ete agc gtg gtg cca ttt 768
Gly Asn Gin Ile Ile Pro Asp Thr Ala Ile Leu Ser Val Val Pro Phe 245 250 255 cac cac ggc ttc ggc atg ttc acc acg ctg ggc tac ttg ate tgc ggc 816Gly Asn Gin Ile Pro Asp Thr Ala Ile Leu Ser Val Val Pro Phe 245 250 255 cac cac ggc ttc ggc atg ttc acc acg ctg ggc tac ttg ate tgc ggc 816
His His Gly Phe Gly Met Phe Thr Thr Leu Gly Tyr Leu Ile Cys Gly 260 265 270 ttt cgg gtc gtg ete atg tac cgd ttc gag gag gag cta ttc ttg ege 864His His Gly Phe Gly Met Phe Thr Thr Leu Gly Tyr Leu Ile Cys Gly 260 265 270 ttt cgg gtc gtg ete atg tac cgd ttc gag gag cta ttc ttg ege 864
Phe Arg Val Val Leu Met Tyr Arg Phe Glu Glu Glu Leu Phe Leu Arg 275 280 285 agc ttg caa gac tat aag att caa tet gcc ctg ctg gtg ccc aca cta 912Phe Arg Val Val Leu Met Tyr Arg Phe Glu Glu Glu Leu Phe Leu Arg 275 280 285 agc ttg caa gac tat aag att caa tet gcc ctg ctg gtg ccc aca cta 912
Ser Leu Gin Asp Tyr Lys Ile Gin Ser Ala Leu Leu Val Pro Thr Leu 290 295 300 ttt agc ttc ttc get aag agc act ete ate gac aag tac gac cta agc 960Ser Leu Gin Asp Tyr Lys Ile Gin Ser Ala Leu Leu Val Pro Thr Leu 290 295 300 ttt agc ttc ttc get aag agc act ete ate gac aag tac gac cta agc 960
Phe Ser Phe Phe Ala Lys Ser Thr Leu Ile Asp Lys Tyr Asp Leu Ser 305 310 315 320 aac ttg cac gag ate gcc agc ggc ggg gcg ceg ete agc aag gag gta 1008Phe Ser Phe Phe Ala Lys Ser Thr Leu Ile Asp Lys Tyr Asp Leu Ser 305 310 315 320 aac ttg cac gag ate gcc agc ggc ggg gcg ceg ete agc aag gag gta 1008
Asn Leu His Glu Ile Ala Ser Gly Gly Ala Pro Leu Ser Lys Glu Val 325 330 335 ggt gag gcc gtg gcc aaa ege ttc cac cta cca ggc ate ege cag ggc 1056Asn Leu His Glu Ile Ala Ser Gly Gly Ala Pro Leu Ser Lys Glu Val 325 330 335 ggt gag gcc gtg gcc aaa ege ttc cac cta cca ggc ate ege cag ggc 1056
Gly Glu Ala Val Ala Lys Arg Phe His Leu Pro Gly Ile Arg Gin Gly 340 345 350 tac ggc ctg aca gaa aca acc agc gcc att ctg ate acc ccc gaa ggg 1104Gly Glu Ala Val Ala Lys Arg Phe His Leu Pro Gly Ile Arg Gin Gly 340 345 350 tac ggc ctg aca gaa aca acc agc gcc att ctg ate acc ccc gaa ggg 1104
Tyr Gly Leu Thr Glu Thr Thr Ser Ala Ile Leu Ile Thr Pro Glu Gly 355 360 365 gac gac aag cct ggc gca gta ggc aag gtg gtg ccc ttc ttc gag get 1152Tyr Gly Leu Thr Glu Thr Thr Ser Ala Ile Leu Ile Thr Pro Glu Gly 355 360 365 gac gac aag cct ggc gca gta ggc aag gtg gtg ccc ttc ttc gag get 1152
Asp Asp Lys Pro Gly Ala Val Gly Lys Val Val Pro Phe Phe Glu Ala 370 375 380 aag gtg gtg gac ttg gac acc ggt aag aca ctg ggt gtg aac cag ege 1200Asp Asp Lys Pro Gly Ala Val Gly Lys Val Val Pro Phe Phe Glu Ala 370 375 380 aag gtg gtg gac ttg gac acc ggt aag aca ctg ggt gtg aag cag ege 1200
Lys Val Val Asp Leu Asp Thr Gly Lys Thr Leu Gly Val Asn Gin Arg 385 390 395 400 ggc gag ctg tgc gtc cgt ggc ccc atg ate atg agc ggc tac gtt aac 1248Lys Val Val Asp Leu Asp Thr Gly Lys Thr Leu Gly Val Asn Gin Arg 385 390 395 400 ggc gg ctg tgc gtc cgt ggc ccc atg ate atg agc ggc tac gtt aac 1248
Gly Glu Leu Cys Val Arg Gly Pro Met Ile Met Ser Gly Tyr Val Asn 405 410 415 aac ccc gag get aca aac get ete ate gac aag gac ggc tgg ctg cac 1296Gly Glu Leu Cys Val Arg Gly Pro Met Ile Met Ser Gly Tyr Val Asn 405 410 415 aac ccc gag get aca aac get ete ate gac aag gac ggc tgg ctg cac 1296
Asn Pro Glu Ala Thr Asn Ala Leu Ile Asp Lys Asp Gly Trp Leu His 420 425 430 agc ggc gac ate gcc tac tgg gac gag gac gag cac ttc ttc ate gtg 1344Asn Pro Glu Ala Thr Asn Ala Leu Ile Asp Lys Asp Gly Trp Leu His 420 425 430 agc ggc gac gac gac gac gac gac gac gac gac cac ttc ttc ate gtg 1344
Ser Gly Asp Ile Ala Tyr Trp Asp Glu Asp Glu His Phe Phe Ile Val 435 440 445 gac cgg ctg aag agc ctg ate aaa tac aag ggc tac cag gta gcc cca 1392Ser Gly Asp Ile Ala Tyr Trp Asp Glu Asp Glu His Phe Phe Ile Val 435 440 445 gac cgg ctg aag agc ctg ate aaa tac aag ggc tac cag gta gcc cca 1392
Asp Arg Leu Lys Ser Leu Ile Lys Tyr Lys Gly Tyr Gin Val Ala Pro 450 455 460 gcc gaa ctg gag agc ate ctg ctg caa cac ccc aac ate ttc gac gcc 1440Asp Arg Leu Lys Ser Leu Ile Lys Tyr Lys Gly Tyr Gin Val Ala Pro 450 455 460 gcc aga ate ctg ctg caa cac ccc aac ate ttc gac gcc 1440
Ala Glu Leu Glu Ser Ile Leu Leu Gin His Pro Asn Ile Phe Asp Ala 465 470 475 480 ggg gtc gcc ggc ctg ccc gac gac gat gcc ggc gag ctg ccc gcc gca 1488Ala Glu Leu Glu Ser Ile Leu Gin His Pro Asn Ile Phe Asp Ala 465 470 475 480 ggg gc gcc ggc ccc gcc gac gac gac gcc ggc gag ctg ccc gcc gca 1488
Gly Val Ala Gly Leu Pro Asp Asp Asp Ala Gly Glu Leu Pro Ala Ala 485 490 495 gtc gtc gtg ctg gaa cac ggt aaa ggc ggc tet ggc ggc ggc tcc gga 1536Gly Val Ala Gly Leu Pro Asp Asp Asp Ala Gly Glu Leu Pro Ala Ala 485 490 495 gtc gtc gtg ctg gaa cac ggt aaa ggc ggc ggc ggc ggc ggc gcc 1536
Val Val Val Leu Glu His Gly Lys Gly Gly Ser Gly Gly Gly Ser Gly 500 505 510 ggc tet tca ccc gag gac gag ctt get get aat gaa gag gag ttg cag 1584Val Val Leu Glu Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 500 505 510 ggc tca ccc gag gac gag ctt get aat gaa gag gag ttg cag 1584
Gly Ser Ser Pro Glu Asp Glu Leu Ala Ala Asn Glu Glu Glu Leu Gin 515 520 525 caa aat gaa caa aag ttg get caa att aag caa aaa ctt caa get ate 1632Gly Ser Ser Pro Glu Asp Glu Leu Ala Ala Asn Glu Glu Glu Leu Gin 515 520 525 caa aat gaa caa aag ttg get caa att aag caa aaa ctt caa get ate 1632
Gin Asn Glu Gin Lys Leu Ala Gin Ile Lys Gin Lys Leu Gin Ala Ile 530 535 540 aaa tac ggt ggc ggc tet ggc ggc ggc gaa aat ttg tat ttc cag agt 1680Gin Asn Glu Gin Lys Leu Ala Gin Ile Lys Gin Lys Leu Gin Ala Ile 530 535 540 aaa tac ggt ggc ggc tet ggc ggc ggc gaa aat ttg tat ttc cag agt 1680
Lys Tyr Gly Gly Gly Ser Gly Gly Gly Glu Asn Leu Tyr Phe Gin Ser 545 550 555 560 ggc ggc tcc gga ggc tet tcc ccc gag gac gag ete cag cag gcc gag 1728Lys Tyr Gly Gly Gly Gly Gly Gly Gly Gly Asn Leu Tyr Phe Gin Ser 545 550 555 560 ggc gcc tcc gga ggc tet tcc ccc gag gac gag ete cag cag gcc gag 1728
Gly Gly Ser Gly Gly Ser Ser Pro Glu Asp Glu Leu Gin Gin Ala Glu 565 570 575 gag gag ete tcc cag gcc gag cag aag aac tcc cag ctg aag gag aag 1776Gly Gly Ser Gly Gly Ser Ser Pro Glu Asp Glu Leu Gin Gin Ala Glu 565 570 575 gag gag ete tcc cag gcc gag cag aag aac tcc cag ctg aag gag aag 1776
Glu Glu Leu Ser Gin Ala Glu Gin Lys Asn Ser Gin Leu Lys Glu Lys 580 585 590 aac cag cag ctg aag tac ggc taa 1800Glu Glu Leu Ser Gin Ala Glu Gin Lys Asn Ser Gin Leu Lys Glu Lys 580 585 590 aac cag cag ctg aag tac ggc taa 1800
Asn Gin Gin Leu Lys Tyr Gly 595 <210> 78 <211> 599Asn Gin Gin Leu Lys Tyr Gly 595 <210> 78 <211> 599
<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <4Q0> 78<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <4Q> 78
Met Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Gly Ser Gly Glu Asp 15 10 15Met Glu Gin Lys Leu Ile Ser Glu Glu Asp Leu Gly Ser Gly Glu Asp 15 10 15
Ala Lys Asn Ile Lys Lys Gly Pro Ala Pro Phe Tyr Pro Leu Glu Asp 20 25 30Ala Lys Asn Ile Lys Lys Gly Pro Ala Pro Phe Tyr Pro Leu Glu Asp 20 25 30
Gly Thr Ala Gly Glu Gin Leu His Lys Ala Met Lys Arg Tyr Ala Leu 35 40 45Gly Thr Ala Gly Glu Gin Leu His Lys Ala Met Lys Arg Tyr Ala Leu 35 40 45
Val Pro Gly Thr Ile Ala Phe Thr Asp Ala His Ile Glu Val Asp Ile 50 55 60Val Pro Gly Thr Ile Ala Phe Thr Asp Ala His Ile Glu Val Asp Ile 50 55 60
Thr Tyr Ala Glu Tyr Phe Glu Met Ser Val Arg Leu Ala Glu Ala Met 65 70 75 80Thr Tyr Ala Glu Tyr Phe Glu Met Ser Val Arg Leu Ala Glu Ala Met 65 70 75 80
Lys Arg Tyr Gly Leu Asn Thr Asn His Arg Ile Val Val Cys Ser Glu 85 90 95Lys Arg Tyr Gly Leu Asn Thr Asn His Arg Ile Val Val Cys Ser Glu 85 90 95
Asn Ser Leu Gin Phe Phe Met Pro Val Leu Gly Ala Leu Phe Ile Gly 100 105 110Asn Ser Leu Gin Phe Phe Met Pro Val Leu Gly Ala Leu Phe Ile Gly 100 105 110
Val Ala Val Ala Pro Ala Asn Asp Ile Tyr Asn Glu Arg Glu Leu Leu 115 120 125Val Ala Val Ala Pro Ala Asn Asp Ile Tyr Asn Glu Arg Glu Leu Leu 115 120 125
Asn Ser Met Gly Ile Ser Gin Pro Thr Val Val Phe Val Ser Lys Lys 130 135 140Asn Ser Met Gly Ile Ser Gin Pro Thr Val Val Phe Val Ser Lys Lys 130 135 140
Gly Leu Gin Lys Ile Leu Asn Val Gin Lys Lys Leu Pro Ile Ile Gin 145 150 155 160Gly Leu Gin Lys Ile Leu Asn Val Gin Lys Leu Pro Ile Ile Gin 145 150 155 160
Lys Ile Ile Ile Met Asp Ser Lys Thr Asp Tyr Gin Gly Phe Gin Ser 165 170 175Lys Ile Ile Ile Met Asp Ser Lys Thr Asp Tyr Gin Gly Phe Gin Ser 165 170 175
Met Tyr Thr Phe Val Thr Ser His Leu Pro Pro Gly Phe Asn Glu Tyr 180 185 190Met Tyr Thr Phe Val Thr Ser His Leu Pro Pro Gly Phe Asn Glu Tyr 180 185 190
Asp Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr Ile Ala Leu Ile 195 200 205Asp Phe Val Pro Glu Ser Phe Asp Arg Asp Lys Thr Ile Ala Leu Ile 195 200 205
Met Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val Ala Leu Pro 210 215 220Met Asn Ser Ser Gly Ser Thr Gly Leu Pro Lys Gly Val Ala Leu Pro 210 215 220
His Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp Pro Ile Phe 225 230 235 240His Arg Thr Ala Cys Val Arg Phe Ser His Ala Arg Asp Pro Ile Phe 225 230 235 240
Gly Asn Gin Ile Ile Pro Asp Thr Ala Ile Leu Ser Val Val Pro Phe 245 250 255Gly Asn Gin Ile Pro Asp Thr Ala Ile Leu Ser Val Val Pro Phe 245 250 255
His His Gly Phe Gly Met Phe Thr Thr Leu Gly Tyr Leu Ile Cys Gly 260 265 270His Gly Phe Gly Met Phe Thr Thr Leu Gly Tyr Leu Ile Cys Gly 260 265 270
Phe Arg Val Val Leu Met Tyr Arg Phe Glu Glu Glu Leu Phe Leu Arg 275 280 285Phe Arg Val Val Leu Met Tyr Arg Phe Glu Glu Glu Leu Phe Leu Arg 275 280 285
Ser Leu Gin Asp Tyr Lys Ile Gin Ser Ala Leu Leu Val Pro Thr Leu 290 295 300Ser Leu Gin Asp Tyr Lys Ile Gin Ser Ala Leu Leu Val Pro Thr Leu 290 295 300
Phe Ser Phe Phe Ala Lys Ser Thr Leu Ile Asp Lys Tyr Asp Leu Ser 305 310 315 320Phe Ser Phe Phe Ala Lys Ser Thr Leu Ile Asp Lys Tyr Asp Leu Ser 305 310 315 320
Asn Leu His Glu Ile Ala Ser Gly Gly Ala Pro Leu Ser Lys Glu Val 325 330 335Asn Leu His Glu Ile Ala Ser Gly Gly Ala Pro Leu Ser Lys Glu Val 325 330 335
Gly Glu Ala Val Ala Lys Arg Phe His Leu Pro Gly Ile Arg Gin Gly 340 345 350Gly Glu Ala Val Ala Lys Arg Phe His Leu Pro Gly Ile Arg Gin Gly 340 345 350
Tyr Gly Leu Thr Glu Thr Thr Ser Ala Ile Leu Ile Thr Pro Glu Gly 355 360 365Tyr Gly Leu Thr Glu Thr Thr Ser Ala Ile Leu Ile Thr Pro Glu Gly 355 360 365
Asp Asp Lys Pro Gly Ala Val Gly Lys Val Val Pro Phe Phe Glu Ala 370 375 380Asp Asp Lys Pro Gly Ala Val Gly Lys Val Val Pro Phe Phe Glu Ala 370 375 380
Lys Val Val Asp Leu Asp Thr Gly Lys Thr Leu Gly Val Asn Gin Arg 385 390 395 400Lys Val Val Asp Leu Asp Thr Gly Lys Thr Leu Gly Val Asn Gin Arg 385 390 395 400
Gly Glu Leu Cys Val Arg Gly Pro Met Ile Met Ser Gly Tyr Val Asn 405 410 415Gly Glu Leu Cys Val Arg Gly Pro Met Ile Met Ser Gly Tyr Val Asn 405 410 415
Asn Pro Glu Ala Thr Asn Ala Leu Ile Asp Lys Asp Gly Trp Leu His 420 425 430Asn Pro Glu Ala Thr Asn Ala Leu Ile Asp Lys Asp Gly Trp Leu His 420 425 430
Ser Gly Asp Ile Ala Tyr Trp Asp Glu Asp Glu His Phe Phe Ile Val 435 440 445Ser Gly Asp Ile Ala Tyr Trp Asp Glu Asp Glu His Phe Phe Ile Val 435 440 445
Asp Arg Leu Lys Ser Leu Ile Lys Tyr Lys Gly Tyr Gin Val Ala Pro 450 455 460Asp Arg Leu Lys Ser Leu Ile Lys Tyr Lys Gly Tyr Gin Val Ala Pro 450 455 460
Ala Glu Leu Glu Ser Ile Leu Leu Gin His Pro Asn Ile Phe Asp Ala 465 470 475 480Ala Glu Leu Glu Ser Ile Leu Leu Gin His Pro Asn Ile Phe Asp Ala 465 470 475 480
Gly Val Ala Gly Leu Pro Asp Asp Asp Ala Gly Glu Leu Pro Ala Ala 485 490 495Gly Val Ala Gly Leu Pro Asp Asp Ala Gly Glu Leu Pro Ala Ala 485 490 495
Val Val Val Leu Glu His Gly Lys Gly Gly Ser Gly Gly Gly Ser Gly 500 505 510Val Val Leu Glu Gly Gly Gly Gly Gly Gly Gly Ser Gly 500 505 510
Gly Ser Ser Pro Glu Asp Glu Leu Ala Ala Asn Glu Glu Glu Leu Gin 515 520 525Gly Ser Ser Pro Glu Asp Glu Leu Ala Ala Asn Glu Glu Glu Leu Gin 515 520 525
Gin Asn Glu Gin Lys Leu Ala Gin Ile Lys Gin Lys Leu Gin Ala Ile 530 535 540Gin Asn Glu Gin Lys Leu Ala Gin Ile Lys Gin Lys Leu Gin Ala Ile 530 535 540
Lys Tyr Gly Gly Gly Ser Gly Gly Gly Glu Asn Leu Tyr Phe Gin Ser 545 550 555 560Lys Tyr Gly Gly Gly Gly Gly Gly Gly Asn Leu Tyr Phe Gin Ser 545 550 555 560
Gly Gly Ser Gly Gly Ser Ser Pro Glu Asp Glu Leu Gin Gin Ala Glu 565 570 575Gly Gly Ser Gly Gly Ser Ser Pro Glu Asp Glu Leu Gin Gin Ala Glu 565 570 575
Glu Glu Leu Ser Gin Ala Glu Gin Lys Asn Ser Gin Leu Lys Glu Lys 580 585 590Glu Glu Leu Ser Gin Ala Glu Gin Lys Asn Ser Gin Leu Lys Glu Lys 580 585 590
Asn Gin Gin Leu Lys Tyr Gly 595 <210> 79 <211> 381Asn Gin Gin Leu Lys Tyr Gly 595 <210> 79 <211> 381
<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1) . . (378) <400> 79 atg tcc ccg gaa gat gag ate cag caa ctg gaa gaa gaa ate get cag 43<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1). . (378) <400> 79 atg tcc ccg gaa gat gag ate cag caa ctg gaa gaa gaa ate get cag 43
Met Ser Pro Glu Asp Glu Ile Gin Gin Leu Glu Glu Glu Ile Ala Gin 15 10 15 ctg gaa cag aaa aac gca gcg ctg aaa gag aaa aac cag gcg ctg aaa ggMet Ser Pro Glu Asp Glu Ile Gin Gin Leu Glu Glu Glu Ile Ala Gin 15 10 15 ctg gaa cag aaa aac gca gcg ctg aaa gag aaa aac cag gcg ctg aaa gg
Leu Glu Gin Lys Asn Ala Ala Leu Lys Glu Lys Asn Gin Ala Leu Lys 20 25 30 tac ggt ggc ggc tet ggc ggc tet aac gtg gtg gtg cac cag gcc ggc 144Leu Glu Gin Lys Asn Ala Lea Lys Glu Lys Asn Gin Ala Leu Lys 20 25 30 tac ggt ggc ggc ggc tet ggc ggc tet aac gtg gtg gtg cac cag gcc ggc 144
Tyr Gly Gly Gly Ser Gly Gly Ser Asn Val Val Val His Gin Ala Gly 35 40 45 ggc tet ggc ggc tet acc atg acc gag aag gag ate gtg gac tat gtg 192Tyr Gly Gly Gly Gly Gly Gly Ser Asn Val Val Val His Gin Ala Gly 35 40 45 ggc tet ggc ggc tet acc atg acc gag aag gag ate gtg gac tat gtg 192
Gly Ser Gly Gly Ser Thr Met Thr Glu Lys Glu Ile Val Asp Tyr Val 50 55 60 gcc agc cag gtt aca acc gcc aag aag ctg ege ggt ggt gtt gtg ttc 240Gly Ser Gly Gly Ser Thr Met Thr Glu Lys Glu Ile Val Asp Tyr Val 50 55 60 gcc agc cag gtt aca acc gcc aag aag ctg ege ggt ggt gtt gtt ttc 240
Ala Ser Gin Val Thr Thr Ala Lys Lys Leu Arg Gly Gly Val Val Phe 65 70 75 80 gtg gac gag gtg cct aaa gga ctg acc ggc aag ttg gac gcc ege aag 288Ala Ser Gin Val Thr Thr Ala Lys Lys Leu Arg Gly Gly Val Val Phe 65 70 75 80 gtg gac gag gtg cct aaa gga ctg acc ggc aag ttg gac gcc ege aag 288
Val Asp Glu Val Pro Lys Gly Leu Thr Gly Lys Leu Asp Ala Arg Lys 85 90 95 ate ege gag att ete att aag gcc aag aag ggc ggc aag ate gcc gtg 335Val Asp Glu Val Pro Lys Gly Leu Le Thr Gly Lys Leu Asp Ala Arg Lys 85 90 95 ate ege gag att ete att aag gcc aag aag ggc ggc aag ate gcc gtg 335
Ile Arg Glu Ile Leu Ile Lys Ala Lys Lys Gly Gly Lys Ile Ala Val 100 105 110 aat agt ggt tet gga tac cca tac gat gtt cca gat tac get taa 331Ile Arg Glu Ile Leu Ile Lys Ala Lys Lys Gly Gly Lys Ile Ala Val 100 105 110 aat agt ggt tet gga tac cca tac gat gtt cca gat tac get taa 331
Asn Ser Gly Ser Gly Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 115 120 125Asn Ser Gly Ser Gly Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 115 120 125
<210> 80 <211> 126 <212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 80<210> 80 <211> 126 <212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 80
Met Ser Pro Glu Asp Glu Ile Gin Gin Leu Glu Glu Glu Ile Ala Gin 15 10 15Met Ser Pro Glu Asp Glu Ile Gin Gin Leu Glu Glu Glu Ile Ala Gin 15 10 15
Leu Glu Gin Lys Asn Ala Ala Leu Lys Glu Lys Asn Gin Ala Leu Lys 20 25 30Leu Glu Gin Lys Asn Ala Lea Lys Glu Lys Asn Gin Ala Leu Lys 20 25 30
Tyr Gly Gly Gly Ser Gly Gly Ser Asn Val Val Val His Gin Ala Gly 35 40 45Tyr Gly Gly Gly Ser Gly Gly Ser Asn Val Val Val His Gin Ala Gly 35 40 45
Gly Ser Gly Gly Ser Thr Met Thr Glu Lys Glu Ile Val Asp Tyr Val 50 55 60Gly Ser Gly Gly Ser Thr Met Thr Glu Lys Glu Ile Val Asp Tyr Val 50 55 60
Ala Ser Gin Val Thr Thr Ala Lys Lys Leu Arg Gly Gly Val Val Phe 65 70 75 80Ala Ser Gin Val Thr Thr Ala Lys Lys Leu Arg Gly Gly Val Val Phe 65 70 75 80
Val Asp Glu Val Pro Lys Gly Leu Thr Gly Lys Leu Asp Ala Arg Lys 85 90 95Val Asp Glu Val Pro Lys Gly Leu Thr Gly Lys Leu Asp Ala Arg Lys 85 90 95
Ile Arg Glu Ile Leu Ile Lys Ala Lys Lys Gly Gly Lys Ile Ala Val 100 105 110Ile Arg Glu Ile Leu Ile Lys Ally Lys Lys Gly Lly Ile Ala Val 100 105 110
Asn Ser Gly Ser Gly Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 115 120 125 <210> 81 <211> 1049Asn Ser Gly Ser Gly Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 115 120 125 <210> 81 <211> 1049
<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> promoter <222> (17)..(294) <220><212> DNA <213> Artificial Sequence <220> <223> - <220> <221> promoter <222> (17) .. (294) <220>
<221> CDS <222> (330)..(1046) <400> 81 atacggtggc accggtactc tatcaatgat agagttgctg gccggggtac cactctatca 60 atgatagagt ttgacacagc gcagggactc tatcaatgat agagtgcggc cgcatgtcac 120 gactctatca atgatagagt tcgcccgtac atccaaactc tatcaatgat agagttgcgg 180 cgtttggtcc tactctatca atgatagagt agtgggtgcc gccaaaggat cctgtacggg 240 ccagatatac gcgttgagat cttagagggt atataatgga agctcgactt ccagctcgag 300 ggcaatccgg tactgttggt aaagccacc atg gtg agc aag ggc gag gag ctg 353<221> CDS <222> (330) .. (1046) <400> 81 atacggtggc accggtactc tatcaatgat agagttgctg gccggggtac cactctatca 60 atgatagagt ttgacacagc gcagggactc tatcaatgat agagtgcggc cgcatgtcac 120 gactctatca atgatagagt tcgcccgtac atccaaactc tatcaatgat agagttgcgg 180 cgtttggtcc tactctatca atgatagagt agtgggtgcc gccaaaggat cctgtacggg 240 ccagatatac gcgttgagat cttagagggt atataatgga agctcgactt ccagctcgag 300 ggcaatccgg tactgttggt aaagccacc atg gtg agc aag ggc gag gag ctg 353
Met Val Ser Lys Gly Glu Glu Leu 1 5 ttc acc ggg gtg gtg ccc ate ctg gtc gag ctg gac ggc gac gta aac 401Met Val Ser Lys Gly Glu Glu Leu 1 5 ttc acc ggg gg gta gac gta aac 401
Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn 10 15 20 ggc cac aag ttc agc gtg tcc ggc gag ggc gag ggc gat gcc acc tac 449Phe Thr Gly Val Val Pro Ile Leu Val Glu Leu Asp Gly Asp Val Asn 10 15 20 ggc cac aag ttc agc gtg tcc ggc gag ggc gag gg gc gc acc tac 449
Gly His Lys Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr 25 30 35 40 ggc aag ctg acc ctg aag ttc ate tgc acc acc ggc aag ctg ccc gtg 497Gly His Lys Phe Ser Val Gly Gly Gly Gly Gly Asp Ala Thr Tyr 25 30 35 40 ggc aag ctg acc ctg aag ttc ate tgc acc acc ggc aag ctg ccc gtg 497
Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val 45 50 55 ccc tgg ccc acc ete gtg acc acc ttc ggc tac ggc ctg atg tgc ttc 545Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr Thr Gly Lys Leu Pro Val 45 50 55 ccc tgg ccc acc ete gt acc acc ttc ggc tac ggc ctg atg tgc ttc 545
Pro Trp Pro Thr Leu Val Thr Thr Phe Gly Tyr Gly Leu Met Cys Phe 60 65 70 gcc ege tac ccc gac cac atg aag cag cac gac ttc ttc aag tcc gcc 593Pro Trp Pro Thr Le Thr Thr Thhe Phe Gly Tyr Gly Leu Met Cys Phe 60 65 70 gcc ege tac ccc gac cac atg aag cag cac gac ttc ttc aag tcc gcc 593
Ala Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala 75 80 85 atg ccc gaa ggc tac gtc cag gag ege acc ate ttc ttc aag gac gac 641Ala Arg Tyr Pro Asp His Met Lys Gin His Asp Phe Lhe Ser Ala 75 80 85 atg ccc gaa ggc tac gtc cag gag ege acc ate ttc ttc aag gac gac 641
Met Pro Glu Gly Tyr Val Gin Glu Arg Thr Ile Phe Phe Lys Asp Asp 90 95 100 ggc aac tac aag acc ege gcc gag gtg aag ttc gag ggc gac acc ctg 689Met Pro Glu Gly Tyr Val Gin Glu Arg Thr Ile Phe Phe Lys Asp Asp 90 95 100 ggc aac tac aag acc ege gcc gag gtg aag ttc gag ggc gac acc ctg 689
Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu 105 110 115 120 gtg aac cgc ate gag ctg aag ggc ate gac ttc aag gag gac ggc aac 737Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu 105 110 115 120 gtg aac cgc ate gag ctg aag ggc ate gac ttc aag gag gac ggc aac 737
Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn 125 130 135 ate ctg ggg cac aag ctg gag tac aac tac aac agc cac aac gtc tat 785Val Asn Arg Ile Glu Leu Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn 125 130 135 ate ctg ggg cac aag ctg gag tac aac tac aac agc cac aac gtc tat 785
Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr 140 145 150 ate atg gcc gac aag cag aag aac ggc ate aag gtg aac ttc aag ate 833Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr 140 145 150 ate atg gcc gac aag cag aag aac ggc ate aag gtg aac ttc aag ate 833
Ile Met Ala Asp Lys Gin Lys Asn Gly Ile Lys Val Asn Phe Lys Ile 155 160 165 cgc cac aac ate gag gac ggc agc gtg cag ete gcc gac cac tac cag 881Ile Met Ala Asp Lys Gin Lys Asn Gly Ile Lys Val Asn Phe Lys Ile 155 160 165 cg cac aac ate gag gac ggc agc gtg cag ete gcc gac cac tac cag 881
Arg His Asn Ile Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin 170 175 180 cag aac acc ccc ate ggc gac ggc ccc gtg ctg ctg ccc gac aac cac 929Arg His Asn Ile Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin 170 175 180 cag aac acc ccc ate ggc gac ggc ccc gtg ctg ctg ccc gac aac cac 929
Gin Asn Thr Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His 185 190 195 200 tac ctg agc tac cag tcc gcc ctg agc aaa gac ccc aac gag aag ege 977Gin Asn Thr Pro Ile Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His 185 190 195 200 tac ctg agc tac cag tcc gcc ctg agc aaa gac ccc aac gag aag ege 977
Tyr Leu Ser Tyr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg 205 210 215 gat cac atg gtc ctg ctg gag ttc gtg acc gcc gcc ggg ate act ete 1025Tyr Leu Ser Tyr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg 205 210 215 gat cac atg gtc ctg ctg gag ttc gtg acc gcc gcc ggg ate act ete 1025
Asp His Met Val Leu Leu Glu Phe Val Thr Ala Ala Gly Ile Thr Leu 220 225 230 ggc atg gac gag ctg tac aag taa 1049Asp His Met Val Leu Glu Phe Val Thr Ala Gly Ile Thr Leu 220 225 230 ggc atg gac gag ctg tac aag taa 1049
Gly Met Asp Glu Leu Tyr Lys 235 <210> 82 <211> 239Gly Met Asp Glu Leu Tyr Lys 235 <210> 82 <211> 239
<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 82<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 82
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu 15 10 15Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu 15 10 15
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 20 25 30Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 20 25 30
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile 35 40 45Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile 35 40 45
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 50 55 60Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 50 55 60
Phe Gly Tyr Gly Leu Met Cys Phe Ala Arg Tyr Pro Asp His Met Lys 65 70 75 80Phe Gly Tyr Gly Leu Met Cys Phe Ala Arg Tyr Pro Asp His Met Lys 65 70 75 80
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 85 90 95Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 85 90 95
Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 100 105 110Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 100 105 110
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly 115 120 125Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly 115 120 125
Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr 130 135 140Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr 130 135 140
Asn Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gin Lys Asn 145 150 155 160Asn Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gin Lys Asn 145 150 155 160
Gly Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser 165 170 175Gly Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser 165 170 175
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro Ile Gly Asp Gly 180 185 190Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro Ile Gly Asp Gly 180 185 190
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 195 200 205Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 195 200 205
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 210 215 220Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 210 215 220
Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys 225 230 235 <210> 83 <211> 2865Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys 225 230 235 <210> 83 <211> 2865
<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1)..(2856) <400> 83 atg ca c cac cac cac cac cac gac tac aaa gac cat gac ggt gat tat 48<212> DNA <213> Artificial Sequence <220> <223> - <220> <221> CDS <222> (1) .. (2856) <400> 83 atg ca cac cac cac cac cac gac tac aaa gac cat gac ggt gat tat 48
Met His His His His Hiš His Asp Tyr Lys Asp His Asp Gly Asp Tyr 15 10 15 aaa gat cat gac ate gat tac aag gat gac gat gac aag atg gcc ccc 96Met His His His His House His Asp Tyr Lys Asp His Asp Gly Asp Tyr 15 10 15 aaa gat cat gac ate gat tac aag gat gac gat gac aag atg gcc ccc 96
Lys Asp His Asp Ile Asp Tyr Lys Asp Asp Asp Asp Lys Met Ala Pro 20 25 30 aag aag aag agg aag gtg ggc att cac ege ggg gta cct atg gtg gac 144Lys Asp His Asp Ile Asp Tyr Lys Asp Asp Asp Lys Met Ala Pro 20 25 30 aag aag aag agg aag gtg ggc att cac ege ggg gta cct atg gtg gac 144
Lys Lys Lys Arg Lys Val Gly Ile His Arg Gly Val Pro Met Val Asp 35 40 45 ttg agg aca ete ggt tat teg caa cag caa cag gag aaa ate aag cct 192Lys Lys Lys Arg Lys Val Gly Ile His Arg Gly Val Pro Met Val Asp 35 40 45 ttg agg aca ete ggt tat teg caa cag caa cag gag aaa ate aag cct 192
Leu Arg Thr Leu Gly Tyr Ser Gin Gin Gin Gin Glu Lys Ile Lys Pro 50 55 60 aag gtc agg agc acc gtc gcg caa cac cac gag gcg ctt gtg ggg cat 240Leu Arg Thr Leu Gly Tyr Ser Gin Gin Gin Gin Glu Lys Ile Lys Pro 50 55 60 aag gtc agg agc acc gtc gcg caa cac cac gag gcg ctt gtg ggg cat 240
Lys Val Arg Ser Thr Val Ala Gin His His Glu Ala Leu Val Gly His 65 70 75 80 ggc ttc act cat gcg cat att gtc gcg ctt tca cag cac cct gcg gcg 288Lys Val Arg Ser Thr Val Ala Gin His His Glu Ala Leu Val Gly His 65 70 75 80 ggc ttc act cat gcg cat att gtc gcg ctt tca cag cac cct gcg gcg 288
Gly Phe Thr His Ala His Ile Val Ala Leu Ser Gin His Pro Ala Ala 85 90 95 ctt ggg acg gtg get gtc aaa tac caa gat atg att gcg gcc ctg ccc 336Gly Phe Thr His Ala His Ile Val Ala Leu Ser Gin His Pro Ala Ala 85 90 95 ctt ggg acg gtg get gtc aaa tac caa gat atg att gcg gcc ctg ccc 336
Leu Gly Thr Val Ala Val Lys Tyr Gin Asp Met Ile Ala Ala Leu Pro 100 105 110 gaa gcc acg cac gag gca att gta ggg gtc ggt aaa cag tgg teg gga 384Leu Gly Thr Val Ala Val Lys Tyr Gin Asp Met Ile Ala Lea Pro 100 105 110 gaa gcc acg cac gag gca gta ggg gtc ggt aaa cag tgg teg gga 384
Glu Ala Thr His Glu Ala Ile Val Gly Val Gly Lys Gin Trp Ser Gly 115 120 125 gcg ega gca ctt gag gcg ctg ctg act gtg gcg ggt gag ctt agg ggg 432Glu Ala Thr His Glu Ala Ile Val Gly Val Gly Lys Gin Trp Ser Gly 115 120 125 gcg ega gta ctt gag gcg ctg ctg act gtg gcg ggt gag ctt agg ggg 432
Ala Arg Ala Leu Glu Ala Leu Leu Thr Val Ala Gly Glu Leu Arg Gly 130 135 140 cct ccg ete cag ete gac acc ggg cag ctg ctg aag ate gcg aag aga 480Ala Arg Ala Leu Glu Ala Leu Leu Thr Val Ala Gly Glu Leu Arg Gly 130 135 140 cct ccg ete cag ete gac acc ggg cag ctg ctg aag ate gcg aag aga 480
Pro Pro Leu Gin Leu Asp Thr Gly Gin Leu Leu Lys Ile Ala Lys Arg 145 150 155 160 ggg gga gta aca gcg gta gag gca gtg cac gcc tgg ege aat gcg ete 528Pro Pro Leu Gin Leu Asp Thr Gly Gin Leu Leu Lys Ile Ala Lys Arg 145 150 155 160 ggg gga gta aca gcg gta gag gca gtg cac gcc tgg ege aat gcg ete 528
Gly Gly Val Thr Ala Val Glu Ala Val His Ala Trp Arg Asn Ala Leu 165 170 175 acc ggg gcc ccg ctg aat ctg aca ccg gaa cag gtt gtt gca att gca 576Gly Gly Val Thr Ala Val Glu Ala Val His Ala Trp Arg Asn Ala Leu 165 170 175 acc ggg gcc ccg ctg aat ctg aca ccg gaa cag gtt gtt gca att gca 576
Thr Gly Ala Pro Leu Asn Leu Thr Pro Glu Gin Val Val Ala Ile Ala 180 185 190 agc cat gat ggt ggt aaa cag gca ctg gaa acc gtt cag cgt ctg ctg 624Thr Gly Ala Pro Leu Asn Leu Thr Pro Glu Gin Val Val Ala Ile Ala 180 185 190 agc cat gat ggt ggt aaa cag gca ctg gaa acc gtt cag cgt ctg ctg 624
Ser His Asp Gly Gly Lys Gin Ala Leu Glu Thr Val Gin Arg Leu Leu 195 200 205 ccg gtt ctg tgt cag gca cat ggt ctg acc cct gaa cag gtg gtg gcc 672Ser His Asp Gly Gly Lys Gin Ala Leu Glu Thr Val Gin Arg Leu Leu 195 200 205 ccg gtt ctg tgt cag gca cat ggt ctg acc cct gaag cag gtg gtg gcc 672
Pro Val Leu Cys Gin Ala His Gly Leu Thr Pro Glu Gin Val Val Ala 210 215 220 att gcc agt aat ggt ggt ggc aaa cag gcg tta gaa aca gtg cag ege 720Pro Val Leu Cys Gin Ala His Gly Leu Thr Pro Glu Gin Val Val Ala 210 215 220 att gcc agt aat ggt ggt ggc aaa cag gcg tta gaa aca gtg cag ege 720
Ile Ala Ser Asn Gly Gly Gly Lys Gin Ala Leu Glu Thr Val Gin Arg 225 230 235 240 ctg ctg cct gtt tta tgc cag gcc cat ggc ctg aca cca gag cag gta 768Ile Ala Ser Asn Gly Gly Gly Lys Gin Ala Leu Glu Thr Val Gin Arg 225 230 235 240 ctg ctg cct gtt tta tgc cag gcc cat ggc ctg aca cca gag cag gta 768
Leu Leu Pro Val Leu Cys Gin Ala His Gly Leu Thr Pro Glu Gin Val 245 250 255 gtg gcg att gcg agc aat att ggc ggt aaa caa gcc ctt gaa acc gtg 816Leu Leu Pro Val Leu Cys Gin Ala His Gly Leu Thr Pro Glu Gin Val 245 250 255 gtg gcg att gcg agc aat att ggc ggt aaa caa gcc ctt gaa acc gtg 816
Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gin Ala Leu Glu Thr Val 260 265 270 cag gca tta ctg ccg gtg ctg tgc caa gcg cac ggc ctg acc cca gaa 864Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gin Ala Leu Glu Thr Val 260 265 270 cag gca tta ctg ccg gtg ctg tgc caa gcg cac ggc ctg acc cca gaa 864
Gin Ala Leu Leu Pro Val Leu Cys Gin Ala His Gly Leu Thr Pro Glu 275 280 285 caa gtt gtt gcg ate gca tca aat ggt ggc ggt aag cag get ttg gaa 912Gin Ala Leu Leu Pro Val Leu Cys Gin Ala His Gly Leu Thr Pro Glu 275 280 285 caa gtt gtt gcg ate gca tca aat ggt ggc ggt aag cag get ttg gaa 912
Gin Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gin Ala Leu Glu 290 295 300 acg gtg cag cgt tta ctg cca gtg tta tgt cag gcg cat ggt tta aca 960Gin Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gin Ala Leu Glu 290 295 300 acg gtg cg cgt tta ctg cca gtg tta tgt cag gcg cat ggt tta aca 960
Thr Val Gin Arg Leu Leu Pro Val Leu Cys Gin Ala His Gly Leu Thr 305 310 315 320 ccg gaa caa gtg gtg gca ate gcc tca cat gat ggc gga aaa caa gca 1008Thr Val Gin Arg Leu Leu Pro Val Leu Cys Gin Ala His Gly Leu Thr 305 310 315 320 ccg gaa caa gtg gtg gca ate gcc tca cat gat ggc gga aaa caa gca 1008
Pro Glu Gin Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gin Ala 325 330 335 tta gag act gtt caa cgc ctg ctg cca gtg ctt tgc cag gca cac ggc 1056Pro Glu Gin Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gin Ala 325 330 335 tta gag act gtt caa cgc ctg ctg cca gtg ctt tgc cag gca cac ggc 1056
Leu Glu Thr Val Gin Arg Leu Leu Pro Val Leu Cys Gin Ala His Gly 340 345 350 tta aca cct gaa cag gtc gta gcg att gca tca aat att ggt gga aaa 1104Leu Glu Thr Val Gin Arg Leu Leu Pro Val Leu Cys Gin Ala His Gly 340 345 350 tta aca cct gaa cag gtc gta gcg att gca tca aat att ggt gga aaa 1104
Leu Thr Pro Glu Gin Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys 355 360 365 cag gcc ttg gag act gta cag gca ctg ctg cct gta ctg tgt caa gcc 1152Leu Thr Pro Glu Gin Val Val Ala Ile Ala Ser Asn Ily Gly Gly Lys 355 360 365 cag gcc ttg gag act gta cag gca ctg ctg cct gta ctg tgt caa gcc 1152
Gin Ala Leu Glu Thr Val Gin Ala Leu Leu Pro Val Leu Cys Gin Ala 370 375 380 cac gga tta acg cca gaa cag gtt gtg gct ate gcc agc aat tca ggc 1200Gin Ala Leu Glu Thr Val Gin Ala Leu Leu Pro Val Leu Cys Gin Ala 370 375 380 cac gga tta acg cca gaa cag gtt gtg gct ate gcc agc aat tca ggc 1200
His Gly Leu Thr Pro Glu Gin Val Val Ala Ile Ala Ser Asn Ser Gly 385 390 395 400 ggt aaa cag gcg ete gag aca gtt cag gcc ctg ctg ccg gtc tta tgt 1248His Gly Leu Thr Pro Glu Gin Val Val Ala Ile Ala Ser Asn Ser Gly 385 390 395 400 ggt aaa cag gcg ete gag aca gtt cag gcc ctg ctg ccg gtc tta tgt 1248
Gly Lys Gin Ala Leu Glu Thr Val Gin Ala Leu Leu Pro Val Leu Cys 405 410 415 caa gct cac gga ctg act ccc gag cag gtt gtc gcc att gcc tca aat 1296Gly Lys Gin Ala Leu Glu Thr Val Gin Ala Leu Leu Pro Val Leu Cys 405 410 415 caa gct cac gga ctg act ccc gag cag gtt gtc gcc att gcc tca aat 1296
Gin Ala His Gly Leu Thr Pro Glu Gin Val Val Ala Ile Ala Ser Asn 420 425 430 ggc ggt ggt aaa caa gct ete gaa acg gta cag aga ctg ctg ccc gtc 1344Gin Ala His Gly Leu Thr Pro Glu Gin Val Val Ala Ile Ala Ser Asn 420 425 430 ggt ggt ggt aaa caa gct ete gaa acg gta cag aga ctg ctg ccc gtc 1344
Gly Gly Gly Lys Gin Ala Leu Glu Thr Val Gin Arg Leu Leu Pro Val 435 440 445 ctg tgc cag gcg cat gga ctt acg cct gag caa gtt gtg gca att gca 1392Gly Gly Gly Lys Gin Ala Leu Glu Thr Val Gin Arg Leu Leu Pro Val 435 440 445 ctg tgc cag gcg cat gga ctt acg cct gag caa gtt gtg gca att gca 1392
Leu Cys Gin Ala His Gly Leu Thr Pro Glu Gin Val Val Ala Ile Ala 450 455 460 tet aat aat ggg ggt aag caa gcg ctg gaa aca gtt caa cgc tta ctg 1440Leu Cys Gin Ala His Gly Leu Thr Pro Glu Gin Val Val Ala Ile Ala 450 455 460 aat aat ggg ggt aag caa gcg ctg gaa aca gtt caa cgc tta ctg 1440
Ser Asn Asn Gly Gly Lys Gin Ala Leu Glu Thr Val Gin Arg Leu Leu 465 470 475 480 cct gtc ttg tgc caa gca cat ggt tta acc cca gag cag gtc gtt gct 1488Ser Asn Asn Gly Gly Lys Gin Ala Leu Glu Thr Val Gin Arg Leu Leu 465 470 475 480 cct gtc ttg tgc caa gca cat ggt tta acc cca gag cag gtc gtt gct 1488
Pro Val Leu Cys Gin Ala His Gly Leu Thr Pro Glu Gin Val Val Ala 485 490 495 att gcc tet aac att gga ggc aaa cag gct ctt gaa act gtc cag gcc 1536Pro Val Leu Cys Gin Ala His Gly Leu Thr Pro Glu Gin Val Val Ala 485 490 495 att gac tet aac att gga ggc aaa cag gct ctt gaa act gtc cag gcc 1536
Ile Ala Ser Asn Ile Gly Gly Lys Gin Ala Leu Glu Thr Val Gin Ala 500 505 510 ctg tta cct gtt ctg tgc cag gct cac ggt ttg act cca gag caa gtg 1584Ile Ala Ser Asn Ile Gly Gly Lys Gin Ala Leu Glu Thr Val Gin Ala 500 505 510 ctg tta cct gtt ctg tgc cag gct cac ggt ttg act cca gag caa gtg 1584
Leu Leu Pro Val Leu Cys Gin Ala His Gly Leu Thr Pro Glu Gin Val 515 520 525 gtt gca ata gcc agc aac ggt ggg ggt aaa caa gct tta gaa acc gtc 1632Leu Leu Pro Val Leu Cys Gin Ala His Gly Leu Thr Pro Glu Gin Val 515 520 525 gtt gca ata gcc agc aac ggt ggg ggt aaa caa gct tta gaa acc gtc 1632
Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gin Ala Leu Glu Thr Val 530 535 540 caa cgt ctg tta cca gtg ctg tgt caa gct cat ggc ctt aca ccc gaa 1680Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gin Ala Leu Glu Thr Val 530 535 540 caa cgt ctg tta cca gtg ctg tgt caa gct cat ggc ctt aca ccc gaa 1680
Gin Arg Leu Leu Pro Val Leu Cys Gin Ala His Gly Leu Thr Pro Glu 545 550 555 560 caa gta gtt gcc att gcg agt aac att ggt ggg aag caa gca ctt gaa 1728Gin Arg Leu Leu Pro Val Leu Cys Gin Ala His Gly Leu Thr Pro Glu 545 550 555 560 caa gta gtt gcc att gcg agt aac att ggt ggg aag caa gca ctt gaa 1728
Gin Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gin Ala Leu Glu 565 570 575 acg gtt cag gcg ctg ctt cca gta tta tgc cag gcg cac gga ete act 1776Gin Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gin Ala Leu Glu 565 570 575 acg gtt cag gcg ctg ctt cta gta tta tgc cag gcg cac gga ete act 1776
Thr Val Gin Ala Leu Leu Pro Val Leu Cys Gin Ala His Gly Leu Thr 580 585 590 cca gaa cag gta gta gca ate gca agt aat aac ggt ggg aaa caa gcg 1824Thr Val Gin Ala Leu Leu Pro Val Leu Cys Gin Ala His Gly Leu Thr 580 585 590 cca gaa cag gta gta gca ate gca agt aat aac ggt ggg aaa caa gcg 1824
Pro Glu Gin Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys Gin Ala 595 600 605 ttg gag aca gtc caa aga ctg ctt ccg gtt ctt tgc caa gcc cac ggt 1872Pro Glu Gin Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys Gin Ala 595 600 605 ttg gag aca gtc caa aga ctg ctt ccg gtt ctt tgc caa gcc cac ggt 1872
Leu Glu Thr Val Gin Arg Leu Leu Pro Val Leu Cys Gin Ala His Gly 610 615 620 ctt aca ccg cag cag gtt gta gct att gct agt aat att gga ggt cgt 1920Leu Glu Thr Val Gin Arg Leu Leu Pro Val Leu Cys Gin Ala His Gly 610 615 620 ctt aca ccg cag cag gtt gta gct att gct agt aat att gga ggt cgt 1920
Leu Thr Pro Gin Gin Val Val Ala Ile Ala Ser Asn Ile Gly Gly Arg 625 630 635 640 ccg gca ctg gaa agc att gtt gca cag ctg agc cgt cct gat ccg gca 1968Leu Thr Pro Gin Gin Val Val Ala Ile Ala Ser Asn Ile Gly Gly Arg 625 630 635 640 ccg gca ctg gaa agc att gtt gca cag ctg agc cgt cct gat ccg gca 1968
Pro Ala Leu Glu Ser Ile Val Ala Gin Leu Ser Arg Pro Asp Pro Ala 645 650 655 ctg gca gca ctg acc aat gat cat ctg gtt gca ctg gca tgt ctg ggt 2016Pro Ala Leu Glu Ser Ile Val Ala Gin Leu Ser Arg Pro Asp Pro Ala 645 650 655 ctg gca gca ctg acc aat gat cat ctg gtt gca ctg gca tgt ctg ggt 2016
Leu Ala Ala Leu Thr Asn Asp His Leu Val Ala Leu Ala Cys Leu Gly 660 665 670 ggt cgt cct gcc ctg gat gca gtt aaa aaa ggt ctg ccg cat gct ccc 2064Leu Ala Ala Leu Thr Asn Asp His Leu Val Ala Leu Ala Cys Leu Gly 660 665 670 ggt cct gcc ctg gat gca gtt aaa aaa ggt ctg ccg cat gct ccc 2064
Gly Arg Pro Ala Leu Asp Ala Val Lys Lys Gly Leu Pro His Ala Pro 675 680 685 gca ttg ate aaa aga acc aac cgg cgg att ccc gag aga act tcc cat 2112Gly Arg Pro Ala Leu Asp Ala Val Lys Lys Gly Leu Pro Ala Pro 675 680 685 gca ttg ate aaa aga acc aac cgg ccc att ccc gag aga act tcc cat 2112
Ala Leu Ile Lys Arg Thr Asn Arg Arg Ile Pro Glu Arg Thr Ser His 690 695 700 ega gtc gcg ggt tcc gat act tat aga tat att gat act tat aga tat 2160Ala Leu Ile Lys Arg Thr Asn Arg Arg Ile Pro Glu Arg Thr Ser His 690 695 700 ega gtc gcg ggt tcc gat act tat agat tat att gat act tat aga tat 2160
Arg Val Ala Gly Ser Asp Thr Tyr Arg Tyr Ile Asp Thr Tyr Arg Tyr 705 710 715 720 att gaa ttc cca aag aag aaa cgg aag gtg cct aag aag aag aga aag 2208Arg Val Ala Gly Ser Asp Thr Tyr Arg Tyr Ile Asp Thr Tyr Arg Tyr 705 710 715 720 att gaa ttc cca aag aag aaa cgg aag gtg cct aag aag aag aga aag 2208
Ile Glu Phe Pro Lys Lys Lys Arg Lys Val Pro Lys Lys Lys Arg Lys 725 730 735 gtt cct aaa aag aaa aga aaa gtc gac gcc cct ccg aoc gat gtc agc 2256Ile Glu Phe Pro Lys Lys Lys Arg Lys Lys Lys Lys Lys Lys Lys Arg Lys 725 730 735 gtt cct aaa aag aaa aga aaa gtc gac gcc cct ccg aoc gat gtc agc 2256
Val Pro Lys Lys Lys Arg Lys Val Asp Ala Pro Pro Thr Asp Val Ser 740 745 750 ctg ggg gac gag ctc cac tta gac ggc gag gac gtg gcg atg gcg cat 2304Val Pro Lys Lys Lys Arg Lys Val Asp Ala Pro Pro Thr Asp Val Ser 740 745 750 ctg ggg gac gag ctc cac tta gac ggc gag gac gtg gcg atg gcg cat 2304
Leu Gly Asp Glu Leu His Leu Asp Gly Glu Asp Val Ala Met Ala His 755 760 765 gcc gac gcg cta gac gat ttc gat ctg gac atg ttg ggg gac ggg gat 2352Leu Gly Asp Glu Leu His Leu Asp Gly Glu Asp Val Ala Met Ala His 755 760 765 gcc gac gcg cta gac gat ttc gat ctg gac atg ttg ggg gac ggg gat 2352
Ala Asp Ala Leu Asp Asp Phe Asp Leu Asp Met Leu Gly Asp Gly Asp 770 775 780 tcc ccg ggt ccg gga ttt acc ccc cac gac tcc gcc ccc tac ggc gct 2400Ala Asp Ala Leu Asp Asp Asp Asp Asp Asp Met Leu Gly Asp Asp Gp Asp 770 775 780 tcc ccg ggt ccg gga ttt acc ccc cac gac tcc gcc ccc tac ggc gct 2400
Ser Pro Gly Pro Gly Phe Thr Pro His Asp Ser Ala Pro Tyr Gly Ala 785 v 790 795 800 ctg gat atg gcc gac ttc gag ttt gag cag atg ttt acc gat gcc ctt 2448Ser Pro Pro Gly Pro Gly Phe Thr Pro His Asp Ser Ala Pro Tyr Gly Ala 785 v 790 795 800 ctg gat atg gcc ttc gag ttt gag cag atg ttt acc gat gcc ctt 2448
Leu Asp Met Ala Asp Phe Glu Phe Glu Gin Met Phe Thr Asp Ala Leu 805 810 815 gga att gac gag tac ggt ggg ggc gaa aac ctc tac ttc cag agc ggc 2496Leu Asp Met Ala Asp Phe Glu Phe Glu Gin Met Phe Thr Asp Ala Leu 805 810 815 gga att gac gag tac ggt ggg ggc gaa aac ctc tac ttc cag agc ggc 2496
Gly Ile Asp Glu Tyr Gly Gly Gly Glu Asn Leu Tyr Phe Gin Ser Gly 820 825 830 ggc ggt ggt gct ttg tet cct cag cac tet gct gtc act caa gga agt 2544Gly Ile Asp Glu Tyr Gly Gly Gly Gly Asn Leu Tyr Phe Gin Ser Gly 820 825 830 ggt ggt ggt gct ttg tet cct cag cac tet gct gtc act caa gga agt 2544
Gly Gly Gly Ala Leu Ser Pro Gin His Ser Ala Val Thr Gin Gly Ser 835 840 845 ate ate aag aac aag gag ggc atg gat gct aag tca cta act gcc tgg 2592Gly Gly Gly Ala Leu Ser Pro Gin His Ser Ala Val Thr Gin Gly Ser 835 840 845 ate ate aag aac aag gag ggc atg gat gct aag tca cta act gcc tgg 2592
Ile Ile Lys Asn Lys Glu Gly Met Asp Ala Lys Ser Leu Thr Ala Trp 850 855 860 tcc cgg aca ctg gtg acc ttc aag gat gta ttt gtg gac ttc acc agg 2640Ile Ile Lys Asn Lys Glu Gly Met Asp Ala Lys Ser Leu Thr Ala Trp 850 855 860 tcc cgg aca ctg gtg acc ttc aag gat gta ttt gtg gac ttc acc agg 2640
Ser Arg Thr Leu Val Thr Phe Lys Asp Val Phe Val Asp Phe Thr Arg 865 870 875 880 gag gag tgg aag ctg ctg gac act gct cag cag ate gtg tac aga aat 2688Ser Arg Thr Leu Val Thr Phe Lys Asp Val Phe Val Asp Phe Thr Arg 865 870 875 880 gag tgg aag ctg ctg gac act gct cag cag ate gtg tac aga aat 2688
Glu Glu Trp Lys Leu Leu Asp Thr Ala Gin Gin Ile Val Tyr Arg Asn 885 890 895 gtg atg ctg gag aac tat aag aac ctg gtt tcc ttg ggt tat cag ctt 2736Glu Glu Trp Lys Leu Leu Asp Thr Ala Gin Gin Ile Val Tyr Arg Asn 885 890 895 gtg atg ctg gag aac tat aag aac ctg gtt tcc ttg ggt tat cag ctt 2736
Val Met Leu Glu Asn Tyr Lys Asn Leu Val Ser Leu Gly Tyr Gin Leu 900 905 910 act aag cca gat gtg ate ctc cgg ttg gag aag gga gaa gag ccc tgg 2784Val Met Leu Glu Asn Tyr Lys Asn Leu Val Leu Gly Tyr Gin Leu 900 905 910 act aag cca gat gtg ate ctc cgg ttg gag aag gaga gaag ggg ccc tgg 2784
Thr Lys Pro Asp Val Ile Leu Arg Leu Glu Lys Gly Glu Glu Pro Trp 915 920 925 ctg gtg gag aga gaa att cac caa gag acc cat cct gat tca gag act 2832Thr Lys Pro Asp Val Ile Leu Arg Leu Glu Lys Gly Glu Glu Pro Trp 915 920 925 ctg gtg gag aga gaa att cac caa gag acc cat cct gat tca gag act 2832
Leu Val Glu Arg Glu Ile His Gin Glu Thr His Pro Asp Ser Glu Thr 930 935 940 gca ttt gaa ate aaa tca tca gtt tetagataa 2865Leu Val Glu Arg Glu Ile His Gin Glu Thr His Pro Asp Ser Glu Thr 930 935 940 gca ttt gaa ate aaa tca tca gtt tetagataa 2865
Ala Phe Glu Ile Lys Ser Ser Val 945 950 <210> 84 <211> 952Ala Phe Glu Ile Lys Ser Ser Val 945 950 <210> 84 <211> 952
<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 84<212> PRT <213> Artificial Sequence <220> <223> Synthetic Construct <400> 84
Met His His His His His His Asp Tyr Lys Asp His Asp Gly Asp Tyr 15 10 15Met His His His His His Asp Tyr Lys Asp His Asp Gly Asp Tyr 15 10 15
Lys Asp His Asp Ile Asp Tyr Lys Asp Asp Asp Asp Lys Met Ala Pro 20 25 30Lys Asp His Asp Ile Asp Tyr Lys Asp Asp Asp Lys Met Ala Pro 20 25 30
Lys Lys Lys Arg Lys Val Gly Ile His Arg Gly Val Pro Met Val Asp 35 40 45Lys Lys Lys Arg Lys Val Gly Ile His Arg Gly Val Pro Met Val Asp 35 40 45
Leu Arg Thr Leu Gly Tyr Ser Gin Gin Gin Gin Glu Lys Ile Lys Pro 50 55 60Leu Arg Thr Leu Gly Tyr Ser Gin Gin Gin Gin Lys Ile Lys Pro 50 55 60
Lys Val Arg Ser Thr Val Ala Gin His His Glu Ala Leu Val Gly His 65 70 75 80Lys Val Arg Ser Thr Val Ala Gin His His Glu Ala Leu Val Gly His 65 70 75 80
Gly Phe Thr His Ala His Ile Val Ala Leu Ser Gin His Pro Ala Ala 85 90 95Gly Phe Thr His Ala His Ile Val Ala Leu Ser Gin His Pro Ala Ala 85 90 95
Leu Gly Thr Val Ala Val Lys Tyr Gin Asp Met Ile Ala Ala Leu Pro 100 105 110Leu Gly Thr Val Ala Val Lys Tyr Gin Asp Met Ile Ala Lea Pro 100 105 110
Glu Ala Thr His Glu Ala Ile Val Gly Val Gly Lys Gin Trp Ser Gly 115 120 125Glu Ala Thr His Glu Ala Ile Val Gly Val Gly Lys Gin Trp Ser Gly 115 120 125
Ala Arg Ala Leu Glu Ala Leu Leu Thr Val Ala Gly Glu Leu Arg Gly 130 135 140Ala Arg Ala Leu Glu Ala Leu Leu Thr Val Ala Gly Glu Leu Arg Gly 130 135 140
Pro Pro Leu Gin Leu Asp Thr Gly Gin Leu Leu Lys Ile Ala Lys Arg 145 150 155 160Pro Pro Leu Gin Leu Asp Thr Gly Gin Leu Leu Lys Ile Ala Lys Arg 145 150 155 160
Gly Gly Val Thr Ala Val Glu Ala Val His Ala Trp Arg Asn Ala Leu 165 170 175Gly Gly Val Thr Ala Val Glu Ala Val His Ala Trp Arg Asn Ala Leu 165 170 175
Thr Gly Ala Pro Leu Asn Leu Thr Pro Glu Gin Val Val Ala Ile Ala 180 185 190Thr Gly Ala Pro Leu Asn Leu Thr Pro Glu Gin Val Val Ala Ile Ala 180 185 190
Ser His Asp Gly Gly Lys Gin Ala Leu Glu Thr Val Gin Arg Leu Leu 195 200 205Ser His Asp Gly Gly Lys Gin Ala Leu Glu Thr Val Gin Arg Leu Leu 195 200 205
Pro Val Leu Cys Gin Ala His Gly Leu Thr Pro Glu Gin Val Val Ala 210 215 220Pro Val Leu Cys Gin Ala His Gly Leu Thr Pro Glu Gin Val Val Ala 210 215 220
Ile Ala Ser Asn Gly Gly Gly Lys Gin Ala Leu Glu Thr Val Gin Arg 225 230 235 240Ile Ala Ser Asn Gly Gly Gly Lys Gin Ala Leu Glu Thr Val Gin Arg 225 230 235 240
Leu Leu Pro Val Leu Cys Gin Ala His Gly Leu Thr Pro Glu Gin Val 245 250 255Leu Leu Pro Val Leu Cys Gin Ala His Gly Leu Thr Pro Glu Gin Val 245 250 255
Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gin Ala Leu Glu Thr Val 260 265 270Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gin Ala Leu Glu Thr Val 260 265 270
Gin Ala Leu Leu Pro Val Leu Cys Gin Ala His Gly Leu Thr Pro Glu 275 280 285Gin Ala Leu Leu Pro Val Leu Cys Gin Ala His Gly Leu Thr Pro Glu 275 280 285
Gin Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gin Ala Leu Glu 290 295 300Gin Val Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gin Ala Leu Glu 290 295 300
Thr Val Gin Arg Leu Leu Pro Val Leu Cys Gin Ala His Gly Leu Thr 305 310 315 320Thr Val Gin Arg Leu Leu Pro Val Leu Cys Gin Ala His Gly Leu Thr 305 310 315 320
Pro Glu Gin Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gin Ala 325 330 335Pro Glu Gin Val Val Ala Ile Ala Ser His Asp Gly Gly Lys Gin Ala 325 330 335
Leu Glu Thr Val Gin Arg Leu Leu Pro Val Leu Cys Gin Ala His Gly 340 345 350Leu Glu Thr Val Gin Arg Leu Leu Pro Val Leu Cys Gin Ala His Gly 340 345 350
Leu Thr Pro Glu Gin Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys 355 360 365Leu Thr Pro Glu Gin Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys 355 360 365
Gin Ala Leu Glu Thr Val Gin Ala Leu Leu Pro Val Leu Cys Gin Ala 370 375 380Gin Ala Leu Glu Thr Val Gin Ala Leu Leu Pro Val Leu Cys Gin Ala 370 375 380
His Gly Leu Thr Pro Glu Gin Val Val Ala Ile Ala Ser Asn Ser Gly 385 390 395 400His Gly Leu Thr Pro Glu Gin Val Val Ala Ile Ala Ser Asn Ser Gly 385 390 395 400
Gly Lys Gin Ala Leu Glu Thr Val Gin Ala Leu Leu Pro Val Leu Cys 405 410 415Gly Lys Gin Ala Leu Glu Thr Val Gin Ala Leu Leu Pro Val Leu Cys 405 410 415
Gin Ala His Gly Leu Thr Pro Glu Gin Val Val Ala Ile Ala Ser Asn 420 425 430Gin Ala His Gly Leu Thr Pro Glu Gin Val Val Ala Ile Ala Ser Asn 420 425 430
Gly Gly Gly Lys Gin Ala Leu Glu Thr Val Gin Arg Leu Leu Pro Val 435 440 445Gly Gly Gly Lys Gin Ala Leu Glu Thr Val Gin Arg Leu Leu Pro Val 435 440 445
Leu Cys Gin Ala His Gly Leu Thr Pro Glu Gin Val Val Ala Ile Ala 450 455 460Leu Cys Gin Ala His Gly Leu Thr Pro Glu Gin Val Val Ala Ile Ala 450 455 460
Ser Asn Asn Gly Gly Lys Gin Ala Leu Glu Thr Val Gin Arg Leu Leu 465 470 475 480Ser Asn Asn Gly Gly Lys Gin Ala Leu Glu Thr Val Gin Arg Leu Leu 465 470 475 480
Pro Val Leu Cys Gin Ala His Gly Leu Thr Pro Glu Gin Val Val Ala 485 490 495Pro Val Leu Cys Gin Ala His Gly Leu Thr Pro Glu Gin Val Val Ala 485 490 495
Ile Ala Ser Asn Ile Gly Gly Lys Gin Ala Leu Glu Thr Val Gin Ala 500 505 510Ile Ala Ser Asn Ile Gly Gly Lys Gin Ala Leu Glu Thr Val Gin Ala 500 505 510
Leu Leu Pro Val Leu Cys Gin Ala His Gly Leu Thr Pro Glu Gin Val 515 520 525Leu Leu Pro Val Leu Cys Gin Ala His Gly Leu Thr Pro Glu Gin Val 515 520 525
Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gin Ala Leu Glu Thr Val 530 535 540Val Ala Ile Ala Ser Asn Gly Gly Gly Lys Gin Ala Leu Glu Thr Val 530 535 540
Gin Arg Leu Leu Pro Val Leu Cys Gin Ala His Gly Leu Thr Pro Glu 545 550 555 560Gin Arg Leu Leu Pro Val Leu Cys Gin Ala His Gly Leu Thr Pro Glu 545 550 555 560
Gin Val Val Ala Ile Ala Ser Asn Ile Gly Gly Lys Gin Ala Leu Glu 565 570 575Gin Val Val Ala Ile Ala Ser Asn Ily Gly Gly Lys Gin Ala Leu Glu 565 570 575
Thr Val Gin Ala Leu Leu Pro Val Leu Cys Gin Ala His Gly Leu Thr 580 585 590Thr Val Gin Ala Leu Leu Pro Val Leu Cys Gin Ala His Gly Leu Thr 580 585 590
Pro Glu Gin Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys Gin Ala 595 600 605Pro Glu Gin Val Val Ala Ile Ala Ser Asn Asn Gly Gly Lys Gin Ala 595 600 605
Leu Glu Thr Val Gin Arg Leu Leu Pro Val Leu Cys Gin Ala His Gly 610 615 620Leu Glu Thr Val Gin Arg Leu Leu Pro Val Leu Cys Gin Ala His Gly 610 615 620
Leu Thr Pro Gin Gin Val Val Ala Ile Ala Ser Asn Ile Gly Gly Arg 625 630 635 640Leu Thr Pro Gin Gin Val Val Ala Ile Ala Ser Asn Ile Gly Gly Arg 625 630 635 640
Pro Ala Leu Glu Ser Ile Val Ala Gin Leu Ser Arg Pro Asp Pro Ala 645 650 655Pro Ala Leu Glu Ser Ile Val Ala Gin Leu Ser Arg Pro Asp Pro Ala 645 650 655
Leu Ala Ala Leu Thr Asn Asp His Leu Val Ala Leu Ala Cys Leu Gly 660 665 670Leu Ala Ala Leu Thr Asn Asp His Leu Val Ala Leu Ala Cys Leu Gly 660 665 670
Gly Arg Pro Ala Leu Asp Ala Val Lys Lys Gly Leu Pro His Ala Pro 675 680 685Gly Arg Pro Ala Leu Asp Ala Val Lys Gly Leu Pro Ala Pro 675 680 685
Ala Leu Ile Lys Arg Thr Asn Arg Arg Ile Pro Glu Arg Thr Ser His 690 695 700Ala Leu Ile Lys Arg Thr Asn Arg Arg Ile Pro Glu Arg Thr Ser His 690 695 700
Arg Val Ala Gly Ser Asp Thr Tyr Arg Tyr Ile Asp Thr Tyr Arg Tyr 705 710 715 720Arg Val Ala Gly Ser Asp Thr Tyr Arg Tyr Ile Asp Thr Tyr Arg Tyr 705 710 715 720
Ile Glu Phe Pro Lys Lys Lys Arg Lys Val Pro Lys Lys Lys Arg Lys 725 730 735Ile Glu Phe Pro Lys Lys Lys Arg Lys Lys Lys Lys Lys Lys Lys Lys Lys 725 730 735
Val Pro Lys Lys Lys Arg Lys Val Asp Ala Pro Pro Thr Asp Val Ser 740 745 750Val Pro Lys Lys Lys Arg Lys Val Asp Ala Pro Pro Thr Asp Val Ser 740 745 750
Leu Gly Asp Glu Leu His Leu Asp Gly Glu Asp Val Ala Met Ala His 755 760 765Leu Gly Asp Glu Leu His Leu Asp Gly Glu Asp Val Ala Met Ala His 755 760 765
Ala Asp Ala Leu Asp Asp Phe Asp Leu Asp Met Leu Gly Asp Gly Asp 770 775 780Ala Asp Ala Leu Asp Asp Asp Leu Asp Met Leu Gly Asp Gp Asp 770 775 780
Ser Pro Gly Pro Gly Phe Thr Pro His Asp Ser Ala Pro Tyr Gly Ala 785 790 795 800Ser Pro Gly Pro Gly Phe Thr Pro His Asp Ser Ala Pro Tyr Gly Ala 785 790 795 800
Leu Asp Met Ala Asp Phe Glu Phe Glu Gin Met Phe Thr Asp Ala Leu 805 810 815Leu Asp Met Ala Asp Phe Glu Phe Glu Gin Met Phe Thr Asp Ala Leu 805 810 815
Gly Ile Asp Glu Tyr Gly Gly Gly Glu Asn Leu Tyr Phe Gin Ser Gly 820 825 830Gly Ile Asp Glu Tyr Gly Gly Gly Gly Asn Leu Tyr Phe Gin Ser Gly 820 825 830
Gly Gly Gly Ala Leu Ser Pro Gin His Ser Ala Val Thr Gin Gly Ser 835 840 845Gly Gly Gly Ala Leu Ser Pro Gin His Ser Ala Val Thr Gin Gly Ser 835 840 845
Ile Ile Lys Asn Lys Glu Gly Met Asp Ala Lys Ser Leu Thr Ala Trp 850 855 860Ile Ile Lys Asn Lys Glu Gly Met Asp Ala Lys Ser Leu Thr Ala Trp 850 855 860
Ser Arg Thr Leu Val Thr Phe Lys Asp Val Phe Val Asp Phe Thr Arg 865 870 875 880Ser Arg Thr Leu Val Thr Phe Lys Asp Val Phe Val Asp Phe Thr Arg 865 870 875 880
Glu Glu Trp Lys Leu Leu Asp Thr Ala Gin Gin Ile Val Tyr Arg Asn 885 890 895Glu Glu Trp Lys Leu Leu Asp Thr Ala Gin Gin Ile Val Tyr Arg Asn 885 890 895
Val Met Leu Glu Asn Tyr Lys Asn Leu Val Ser Leu Gly Tyr Gin Leu 900 905 910Val Met Leu Glu Asn Tyr Lys Asn Leu Val Leu Gly Tyr Gin Leu 900 905 910
Thr Lys Pro Asp Val Ile Leu Arg Leu Glu Lys Gly Glu Glu Pro Trp 915 920 925Thr Lys Pro Asp Val Ile Leu Arg Leu Glu Lys Gly Glu Glu Pro Trp 915 920 925
Leu Val Glu Arg Glu Ile His Gin Glu Thr His Pro Asp Ser Glu Thr 930 935 940Leu Val Glu Arg Glu Ile His Gin Glu Thr His Pro Asp Ser Glu Thr 930 935 940
Ala Phe Glu Ile Lys Ser Ser Val 945 950Ala Phe Glu Ile Lys Ser Ser Val 945 950
Claims (15)
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
SI201600252A SI25289A (en) | 2016-10-12 | 2016-10-12 | Combination of split orthogonal proteases with dimerization domains that enable the assembly |
PCT/IB2017/055902 WO2018069782A2 (en) | 2016-10-12 | 2017-09-27 | A combination of split orthogonal proteases with dimerization domains that allow for assembly |
EP17794772.8A EP3526325A2 (en) | 2016-10-12 | 2017-09-27 | A combination of split orthogonal proteases with dimerization domains that allow for assembly |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
SI201600252A SI25289A (en) | 2016-10-12 | 2016-10-12 | Combination of split orthogonal proteases with dimerization domains that enable the assembly |
Publications (1)
Publication Number | Publication Date |
---|---|
SI25289A true SI25289A (en) | 2018-04-30 |
Family
ID=60268416
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
SI201600252A SI25289A (en) | 2016-10-12 | 2016-10-12 | Combination of split orthogonal proteases with dimerization domains that enable the assembly |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP3526325A2 (en) |
SI (1) | SI25289A (en) |
WO (1) | WO2018069782A2 (en) |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3701015B1 (en) * | 2017-10-24 | 2022-04-06 | Ulisse Biomed S.P.A. | Amplification nanoswitch system based on split site-specific cleaving enzymes for the in vitro detection of target analytes and method for the detection of said target analytes |
US10899823B2 (en) | 2018-01-18 | 2021-01-26 | California Institute Of Technology | Programmable protein circuits in living cells |
US11965191B2 (en) | 2018-01-18 | 2024-04-23 | California Institute Of Technology | Programmable protein circuits in living cells |
US11453893B2 (en) | 2018-08-30 | 2022-09-27 | California Institute Of Technology | RNA-based delivery systems with levels of control |
EP3844180A4 (en) * | 2018-08-31 | 2022-07-20 | California Institute of Technology | Synthetic protein circuits detecting signal transducer activity |
WO2020146627A1 (en) | 2019-01-10 | 2020-07-16 | California Institute Of Technology | A synthetic system for tunable thresholding of protein signals |
EP3783104A1 (en) * | 2019-08-20 | 2021-02-24 | Kemijski Institut | Coiled-coil mediated tethering of crispr-cas and exonucleases for enhanced genome editing |
IT202000018064A1 (en) * | 2020-07-27 | 2022-01-27 | Univ Cattolica Del Sacro Cuore | DEVELOPMENT OF A NEW ENGINEERED TOBACCO ETCH VIRUS (TEV) PROTEASE THAT CAN BE ACTIVATED IN THE CYTOSOL OR SECRETORY PATHWAY |
CN112921053B (en) * | 2021-02-02 | 2023-04-14 | 汕头大学 | Dual-induction mCreER system capable of tracking cell differentiation and development and establishment and application thereof |
CN114591442B (en) * | 2022-03-01 | 2024-04-19 | 中国科学院深圳先进技术研究院 | Light-regulated protease tool and matched substrate thereof |
WO2024011146A1 (en) * | 2022-07-06 | 2024-01-11 | California Institute Of Technology | A synthetic protein-level neural network in mammalian cells |
US20240124913A1 (en) * | 2022-10-14 | 2024-04-18 | California Institute Of Technology | Protein-Based Signal Amplification |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9791436B2 (en) * | 2012-09-12 | 2017-10-17 | The University Of Queensland | Protease-based biosensor |
EP3044322A4 (en) * | 2013-09-12 | 2017-09-06 | The University Of Queensland | Bimolecular protease-based biosensor |
-
2016
- 2016-10-12 SI SI201600252A patent/SI25289A/en not_active IP Right Cessation
-
2017
- 2017-09-27 EP EP17794772.8A patent/EP3526325A2/en not_active Withdrawn
- 2017-09-27 WO PCT/IB2017/055902 patent/WO2018069782A2/en unknown
Also Published As
Publication number | Publication date |
---|---|
EP3526325A2 (en) | 2019-08-21 |
WO2018069782A2 (en) | 2018-04-19 |
WO2018069782A3 (en) | 2018-05-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
SI25289A (en) | Combination of split orthogonal proteases with dimerization domains that enable the assembly | |
Topilina et al. | Recent advances in in vivo applications of intein-mediated protein splicing | |
Wood et al. | Intein applications: from protein purification and labeling to metabolic control methods | |
Wang et al. | PNPASE regulates RNA import into mitochondria | |
Hayashi et al. | Signal transduction by IRE1‐mediated splicing of bZIP50 and other stress sensors in the endoplasmic reticulum stress response of rice | |
Tobaben et al. | A trimeric protein complex functions as a synaptic chaperone machine | |
Takasu et al. | Efficient TALEN construction for Bombyx mori gene targeting | |
Silver et al. | Functional dissection of eyes absent reveals new modes of regulation within the retinal determination gene network | |
Heikkila et al. | Heat shock protein gene expression during Xenopus development | |
US10221422B2 (en) | Blue light-inducible system for gene expression | |
CN109517068A (en) | Chimeric polyeptides with targeting binding specificity | |
Förster et al. | The role of fibroblast growth factor signalling in Echinococcus multilocularis development and host-parasite interaction | |
JP2008263967A (en) | Reproductive cell marker to use vasa gene of fish | |
JP2002526108A (en) | Fusion of scaffold protein with random peptide library | |
Kobayashi et al. | An efficient binary system for gene expression in the silkworm, Bombyx mori, using GAL4 variants | |
EP0862651A2 (en) | Method of screening for factors that modulate gene expression | |
US20040146889A1 (en) | Inducible regulatory system and use thereof | |
Land et al. | A calcium-and diacylglycerol-stimulated protein kinase C (PKC), Caenorhabditis elegans PKC-2, links thermal signals to learned behavior by acting in sensory neurons and intestinal cells | |
Hernandez-Huertas et al. | Optimized CRISPR-RfxCas13d system for RNA targeting in zebrafish embryos | |
Wang et al. | Association of rabbit sperm cells with exogenous DNA | |
McCann et al. | Delivery of mtZFNs into early mouse embryos | |
CN108239656A (en) | A kind of protein function switching system of small-molecule drug control | |
US20240287532A1 (en) | Riboswitch modules and methods for controlling protein expression in plants | |
US20210284979A1 (en) | Methods and compositions for genetically manipulating genes and cells | |
US20180348231A1 (en) | Ligand inducible polypeptide coupler system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
OO00 | Grant of patent |
Effective date: 20180518 |
|
SP73 | Change of data on owner |
Owner name: KEMIJSKI INSTITUT; SI Effective date: 20191107 |
|
KO00 | Lapse of patent |
Effective date: 20210810 |