WO2024004921A1 - ポリペプチド、オキシゲナーゼ及びこれらの応用 - Google Patents
ポリペプチド、オキシゲナーゼ及びこれらの応用 Download PDFInfo
- Publication number
- WO2024004921A1 WO2024004921A1 PCT/JP2023/023517 JP2023023517W WO2024004921A1 WO 2024004921 A1 WO2024004921 A1 WO 2024004921A1 JP 2023023517 W JP2023023517 W JP 2023023517W WO 2024004921 A1 WO2024004921 A1 WO 2024004921A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- leu
- ala
- glu
- lys
- asp
- Prior art date
Links
- 229920001184 polypeptide Polymers 0.000 title claims abstract description 108
- 102000004196 processed proteins & peptides Human genes 0.000 title claims abstract description 108
- 108090000765 processed proteins & peptides Proteins 0.000 title claims abstract description 108
- 102000004020 Oxygenases Human genes 0.000 title claims abstract description 45
- 108090000417 Oxygenases Proteins 0.000 title claims abstract description 45
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract description 95
- 125000000539 amino acid group Chemical group 0.000 claims abstract description 48
- 102200042353 rs1130335 Human genes 0.000 claims abstract description 18
- 102000004190 Enzymes Human genes 0.000 claims description 174
- 108090000790 Enzymes Proteins 0.000 claims description 172
- 238000004519 manufacturing process Methods 0.000 claims description 63
- 238000006467 substitution reaction Methods 0.000 claims description 63
- MHAJPDPJQMAIIY-UHFFFAOYSA-N Hydrogen peroxide Chemical compound OO MHAJPDPJQMAIIY-UHFFFAOYSA-N 0.000 claims description 62
- 235000014113 dietary fatty acids Nutrition 0.000 claims description 59
- 229930195729 fatty acid Natural products 0.000 claims description 59
- 239000000194 fatty acid Substances 0.000 claims description 59
- 150000004665 fatty acids Chemical class 0.000 claims description 59
- 238000000034 method Methods 0.000 claims description 58
- 235000020660 omega-3 fatty acid Nutrition 0.000 claims description 56
- 229940012843 omega-3 fatty acid Drugs 0.000 claims description 55
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 claims description 34
- 239000006014 omega-3 oil Substances 0.000 claims description 34
- 239000013598 vector Substances 0.000 claims description 33
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 27
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 claims description 26
- 230000003197 catalytic effect Effects 0.000 claims description 26
- 239000001301 oxygen Substances 0.000 claims description 26
- 229910052760 oxygen Inorganic materials 0.000 claims description 26
- 230000001172 regenerating effect Effects 0.000 claims description 24
- CYQFCXCEBYINGO-IAGOWNOFSA-N delta1-THC Chemical compound C1=C(C)CC[C@H]2C(C)(C)OC3=CC(CCCCC)=CC(O)=C3[C@@H]21 CYQFCXCEBYINGO-IAGOWNOFSA-N 0.000 claims description 21
- 239000003795 chemical substances by application Substances 0.000 claims description 19
- 238000006735 epoxidation reaction Methods 0.000 claims description 17
- 239000000203 mixture Substances 0.000 claims description 15
- 229910052700 potassium Inorganic materials 0.000 claims description 7
- 238000012258 culturing Methods 0.000 claims description 6
- 230000000717 retained effect Effects 0.000 claims description 6
- ACFIXJIJDZMPPO-NNYOXOHSSA-N NADPH Chemical compound C1=CCC(C(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]2[C@H]([C@@H](OP(O)(O)=O)[C@@H](O2)N2C3=NC=NC(N)=C3N=C2)O)O1 ACFIXJIJDZMPPO-NNYOXOHSSA-N 0.000 claims 1
- 235000021122 unsaturated fatty acids Nutrition 0.000 abstract description 15
- 150000004670 unsaturated fatty acids Chemical class 0.000 abstract description 15
- 229940088598 enzyme Drugs 0.000 description 165
- 241000282326 Felis catus Species 0.000 description 129
- 108020004414 DNA Proteins 0.000 description 62
- 238000006243 chemical reaction Methods 0.000 description 61
- 235000001014 amino acid Nutrition 0.000 description 56
- 239000000758 substrate Substances 0.000 description 49
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 42
- JAZBEHYOTPTENJ-JLNKQSITSA-N all-cis-5,8,11,14,17-icosapentaenoic acid Chemical compound CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O JAZBEHYOTPTENJ-JLNKQSITSA-N 0.000 description 41
- JAZBEHYOTPTENJ-UHFFFAOYSA-N eicosapentaenoic acid Natural products CCC=CCC=CCC=CCC=CCC=CCCCC(O)=O JAZBEHYOTPTENJ-UHFFFAOYSA-N 0.000 description 41
- 235000020673 eicosapentaenoic acid Nutrition 0.000 description 41
- 229960005135 eicosapentaenoic acid Drugs 0.000 description 41
- 239000000243 solution Substances 0.000 description 38
- XJLXINKUBYWONI-DQQFMEOOSA-N [[(2r,3r,4r,5r)-5-(6-aminopurin-9-yl)-3-hydroxy-4-phosphonooxyoxolan-2-yl]methoxy-hydroxyphosphoryl] [(2s,3r,4s,5s)-5-(3-carbamoylpyridin-1-ium-1-yl)-3,4-dihydroxyoxolan-2-yl]methyl phosphate Chemical compound NC(=O)C1=CC=C[N+]([C@@H]2[C@H]([C@@H](O)[C@H](COP([O-])(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](OP(O)(O)=O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 XJLXINKUBYWONI-DQQFMEOOSA-N 0.000 description 35
- GPQVVJQEBXAKBJ-IXEYAUFLSA-N (5e,8e,11e,14e)-16-(3-ethyloxiran-2-yl)hexadeca-5,8,11,14-tetraenoic acid Chemical compound CCC1OC1C\C=C\C\C=C\C\C=C\C\C=C\CCCC(O)=O GPQVVJQEBXAKBJ-IXEYAUFLSA-N 0.000 description 34
- 230000000694 effects Effects 0.000 description 32
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 31
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 31
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 31
- 108090000623 proteins and genes Proteins 0.000 description 31
- MBMBGCFOFBJSGT-KUBAVDMBSA-N all-cis-docosa-4,7,10,13,16,19-hexaenoic acid Chemical compound CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCC(O)=O MBMBGCFOFBJSGT-KUBAVDMBSA-N 0.000 description 30
- 108010049041 glutamylalanine Proteins 0.000 description 30
- 238000004128 high performance liquid chromatography Methods 0.000 description 30
- 210000004027 cell Anatomy 0.000 description 29
- MFORDNZDKAVNSR-SRVKXCTJSA-N Gln-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O MFORDNZDKAVNSR-SRVKXCTJSA-N 0.000 description 25
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 25
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 25
- 229940024606 amino acid Drugs 0.000 description 25
- 150000001413 amino acids Chemical class 0.000 description 25
- 108010038633 aspartylglutamate Proteins 0.000 description 25
- 108010050848 glycylleucine Proteins 0.000 description 25
- 108010009298 lysylglutamic acid Proteins 0.000 description 25
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 24
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 24
- 108010047495 alanylglycine Proteins 0.000 description 24
- 108010017391 lysylvaline Proteins 0.000 description 24
- 239000000047 product Substances 0.000 description 24
- 108010090894 prolylleucine Proteins 0.000 description 24
- 241000588724 Escherichia coli Species 0.000 description 19
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 19
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 19
- 108010092854 aspartyllysine Proteins 0.000 description 19
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 18
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 18
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 18
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 18
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 18
- 101150053185 P450 gene Proteins 0.000 description 18
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 18
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 18
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 18
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 18
- 108010054155 lysyllysine Proteins 0.000 description 18
- 108010084572 phenylalanyl-valine Proteins 0.000 description 18
- 108010029020 prolylglycine Proteins 0.000 description 18
- 108010080629 tryptophan-leucine Proteins 0.000 description 18
- YUFFSWGQGVEMMI-JLNKQSITSA-N (7Z,10Z,13Z,16Z,19Z)-docosapentaenoic acid Chemical compound CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCCCCC(O)=O YUFFSWGQGVEMMI-JLNKQSITSA-N 0.000 description 17
- 235000021294 Docosapentaenoic acid Nutrition 0.000 description 17
- 239000013604 expression vector Substances 0.000 description 17
- 230000035772 mutation Effects 0.000 description 16
- 235000020669 docosahexaenoic acid Nutrition 0.000 description 15
- 229940090949 docosahexaenoic acid Drugs 0.000 description 15
- 238000011282 treatment Methods 0.000 description 15
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 14
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 13
- PJNSIUPOXFBHDM-GUBZILKMSA-N Ala-Arg-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O PJNSIUPOXFBHDM-GUBZILKMSA-N 0.000 description 13
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 13
- 108010040956 Ala-Asp-Glu-Leu Proteins 0.000 description 13
- RAAWHFXHAACDFT-FXQIFTODSA-N Ala-Met-Asn Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CC(N)=O)C(O)=O RAAWHFXHAACDFT-FXQIFTODSA-N 0.000 description 13
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 13
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 13
- OLISTMZJGQUOGS-GMOBBJLQSA-N Asn-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OLISTMZJGQUOGS-GMOBBJLQSA-N 0.000 description 13
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 13
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 13
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 13
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 13
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 13
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 13
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 13
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 13
- UMHRCVCZUPBBQW-GARJFASQSA-N Glu-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UMHRCVCZUPBBQW-GARJFASQSA-N 0.000 description 13
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 13
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 13
- LBQAHBIVXQSBIR-HVTMNAMFSA-N His-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N LBQAHBIVXQSBIR-HVTMNAMFSA-N 0.000 description 13
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 13
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 13
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 13
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 13
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 13
- NTEVEUCLFMWSND-SRVKXCTJSA-N Lys-Arg-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O NTEVEUCLFMWSND-SRVKXCTJSA-N 0.000 description 13
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 13
- XGZDDOKIHSYHTO-SZMVWBNQSA-N Lys-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 XGZDDOKIHSYHTO-SZMVWBNQSA-N 0.000 description 13
- ALTHVGNGGZZSAC-SRVKXCTJSA-N Met-Val-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N ALTHVGNGGZZSAC-SRVKXCTJSA-N 0.000 description 13
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 13
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 13
- UNLYPPYNDXHGDG-IHRRRGAJSA-N Phe-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UNLYPPYNDXHGDG-IHRRRGAJSA-N 0.000 description 13
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 13
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 13
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 13
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 13
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 13
- MPYZGXUYLNPSNF-NAZCDGGXSA-N Trp-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O MPYZGXUYLNPSNF-NAZCDGGXSA-N 0.000 description 13
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 13
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 13
- 108010093581 aspartyl-proline Proteins 0.000 description 13
- 150000001875 compounds Chemical class 0.000 description 13
- 108010016616 cysteinylglycine Proteins 0.000 description 13
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 13
- 108010034529 leucyl-lysine Proteins 0.000 description 13
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 13
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 13
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 13
- 108010031719 prolyl-serine Proteins 0.000 description 13
- 108010070643 prolylglutamic acid Proteins 0.000 description 13
- 239000000126 substance Substances 0.000 description 13
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 12
- UGLPMYSCWHTZQU-AUTRQRHGSA-N Ala-Ala-Tyr Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UGLPMYSCWHTZQU-AUTRQRHGSA-N 0.000 description 12
- SSSROGPPPVTHLX-FXQIFTODSA-N Ala-Arg-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSROGPPPVTHLX-FXQIFTODSA-N 0.000 description 12
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 12
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 12
- IVKWMMGFLAMMKJ-XVYDVKMFSA-N Ala-His-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N IVKWMMGFLAMMKJ-XVYDVKMFSA-N 0.000 description 12
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 12
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 12
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 12
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 12
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 12
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 12
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 12
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 12
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 12
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 12
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 12
- NSTBNYOKCZKOMI-AVGNSLFASA-N Asn-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O NSTBNYOKCZKOMI-AVGNSLFASA-N 0.000 description 12
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 12
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 12
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 12
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 12
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 12
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 12
- GUKYYUFHWYRMEU-WHFBIAKZSA-N Cys-Gly-Asp Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O GUKYYUFHWYRMEU-WHFBIAKZSA-N 0.000 description 12
- AJDMYLOISOCHHC-YVNDNENWSA-N Gln-Gln-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AJDMYLOISOCHHC-YVNDNENWSA-N 0.000 description 12
- IVCOYUURLWQDJQ-LPEHRKFASA-N Gln-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O IVCOYUURLWQDJQ-LPEHRKFASA-N 0.000 description 12
- VUVKKXPCKILIBD-AVGNSLFASA-N Gln-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VUVKKXPCKILIBD-AVGNSLFASA-N 0.000 description 12
- JNVGVECJCOZHCN-DRZSPHRISA-N Gln-Phe-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O JNVGVECJCOZHCN-DRZSPHRISA-N 0.000 description 12
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 12
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 12
- RSUVOPBMWMTVDI-XEGUGMAKSA-N Glu-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCC(O)=O)C)C(O)=O)=CNC2=C1 RSUVOPBMWMTVDI-XEGUGMAKSA-N 0.000 description 12
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 12
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 12
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 12
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 12
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 12
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 12
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 12
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 12
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 12
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 12
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 12
- BRQKGRLDDDQWQJ-MBLNEYKQSA-N His-Thr-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O BRQKGRLDDDQWQJ-MBLNEYKQSA-N 0.000 description 12
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 12
- WUEIUSDAECDLQO-NAKRPEOUSA-N Ile-Ala-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)O)N WUEIUSDAECDLQO-NAKRPEOUSA-N 0.000 description 12
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 12
- IGJWJGIHUFQANP-LAEOZQHASA-N Ile-Gly-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N IGJWJGIHUFQANP-LAEOZQHASA-N 0.000 description 12
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 12
- RKQAYOWLSFLJEE-SVSWQMSJSA-N Ile-Thr-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N RKQAYOWLSFLJEE-SVSWQMSJSA-N 0.000 description 12
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 12
- 241000880493 Leptailurus serval Species 0.000 description 12
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 12
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 12
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 12
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 12
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 12
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 12
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 12
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 12
- IBSGMIPRBMPMHE-IHRRRGAJSA-N Leu-Met-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O IBSGMIPRBMPMHE-IHRRRGAJSA-N 0.000 description 12
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 12
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 12
- RDFIVFHPOSOXMW-ACRUOGEOSA-N Leu-Tyr-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RDFIVFHPOSOXMW-ACRUOGEOSA-N 0.000 description 12
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 12
- PGBPWPTUOSCNLE-JYJNAYRXSA-N Lys-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N PGBPWPTUOSCNLE-JYJNAYRXSA-N 0.000 description 12
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 12
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 12
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 12
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 12
- MPCKIRSXNKACRF-GUBZILKMSA-N Met-Pro-Asn Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O MPCKIRSXNKACRF-GUBZILKMSA-N 0.000 description 12
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 12
- 108010065395 Neuropep-1 Proteins 0.000 description 12
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 12
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 12
- MMYUOSCXBJFUNV-QWRGUYRKSA-N Phe-Gly-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N MMYUOSCXBJFUNV-QWRGUYRKSA-N 0.000 description 12
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 12
- KUSYCSMTTHSZOA-DZKIICNBSA-N Phe-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N KUSYCSMTTHSZOA-DZKIICNBSA-N 0.000 description 12
- BAKAHWWRCCUDAF-IHRRRGAJSA-N Pro-His-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CN=CN1 BAKAHWWRCCUDAF-IHRRRGAJSA-N 0.000 description 12
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 12
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 12
- COAHUSQNSVFYBW-FXQIFTODSA-N Ser-Asn-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O COAHUSQNSVFYBW-FXQIFTODSA-N 0.000 description 12
- ICHZYBVODUVUKN-SRVKXCTJSA-N Ser-Asn-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ICHZYBVODUVUKN-SRVKXCTJSA-N 0.000 description 12
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 12
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 12
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 12
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 12
- XUGYQLFEJYZOKQ-NGTWOADLSA-N Thr-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XUGYQLFEJYZOKQ-NGTWOADLSA-N 0.000 description 12
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 12
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 12
- IYHNBRUWVBIVJR-IHRRRGAJSA-N Tyr-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IYHNBRUWVBIVJR-IHRRRGAJSA-N 0.000 description 12
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 12
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 12
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 12
- XKVXSCHXGJOQND-ZOBUZTSGSA-N Val-Asp-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N XKVXSCHXGJOQND-ZOBUZTSGSA-N 0.000 description 12
- DLYOEFGPYTZVSP-AEJSXWLSSA-N Val-Cys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N DLYOEFGPYTZVSP-AEJSXWLSSA-N 0.000 description 12
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 12
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 12
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 12
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 12
- 108010041407 alanylaspartic acid Proteins 0.000 description 12
- 108010005233 alanylglutamic acid Proteins 0.000 description 12
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 12
- 108010027371 asparaginyl-leucyl-prolyl-arginine Proteins 0.000 description 12
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 12
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 12
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 12
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 12
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 12
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 12
- 108010040030 histidinoalanine Proteins 0.000 description 12
- 108010025306 histidylleucine Proteins 0.000 description 12
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 12
- 108010003700 lysyl aspartic acid Proteins 0.000 description 12
- 108010005942 methionylglycine Proteins 0.000 description 12
- 239000013612 plasmid Substances 0.000 description 12
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 12
- 108700004896 tripeptide FEG Proteins 0.000 description 12
- 108010038745 tryptophylglycine Proteins 0.000 description 12
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 12
- 108010051110 tyrosyl-lysine Proteins 0.000 description 12
- 108010020532 tyrosyl-proline Proteins 0.000 description 12
- 108010073969 valyllysine Proteins 0.000 description 12
- 102000016938 Catalase Human genes 0.000 description 11
- 108010053835 Catalase Proteins 0.000 description 11
- 230000001580 bacterial effect Effects 0.000 description 11
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 10
- 241000894006 Bacteria Species 0.000 description 10
- 108010050375 Glucose 1-Dehydrogenase Proteins 0.000 description 10
- YZXBAPSDXZZRGB-DOFZRALJSA-N arachidonic acid Chemical compound CCCCC\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O YZXBAPSDXZZRGB-DOFZRALJSA-N 0.000 description 10
- 238000004364 calculation method Methods 0.000 description 10
- 239000002609 medium Substances 0.000 description 10
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 9
- 102000019197 Superoxide Dismutase Human genes 0.000 description 9
- 108010012715 Superoxide dismutase Proteins 0.000 description 9
- 238000009396 hybridization Methods 0.000 description 9
- 238000000329 molecular dynamics simulation Methods 0.000 description 9
- 238000000746 purification Methods 0.000 description 9
- 102220020518 rs397508403 Human genes 0.000 description 9
- 239000000523 sample Substances 0.000 description 9
- 241000194107 Bacillus megaterium Species 0.000 description 8
- 108020004705 Codon Proteins 0.000 description 8
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 8
- 238000004458 analytical method Methods 0.000 description 8
- 229930027917 kanamycin Natural products 0.000 description 8
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 8
- 229960000318 kanamycin Drugs 0.000 description 8
- 229930182823 kanamycin A Natural products 0.000 description 8
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 8
- SCPRYBYMKVYVND-UHFFFAOYSA-N 2-[[2-[[1-(2-amino-4-methylpentanoyl)pyrrolidine-2-carbonyl]amino]-4-methylpentanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)CC(N)C(=O)N1CCCC1C(=O)NC(CC(C)C)C(=O)NC(CC(C)C)C(O)=O SCPRYBYMKVYVND-UHFFFAOYSA-N 0.000 description 7
- DAEFQZCYZKRTLR-ZLUOBGJFSA-N Ala-Cys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O DAEFQZCYZKRTLR-ZLUOBGJFSA-N 0.000 description 7
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 7
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 7
- RGQCNKIDEQJEBT-CQDKDKBSSA-N Ala-Leu-Tyr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 RGQCNKIDEQJEBT-CQDKDKBSSA-N 0.000 description 7
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 7
- DXQIQUIQYAGRCC-CIUDSAMLSA-N Arg-Asp-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)CN=C(N)N DXQIQUIQYAGRCC-CIUDSAMLSA-N 0.000 description 7
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 7
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 7
- SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 7
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 7
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 7
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 7
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 7
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 7
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 7
- RGAOLBZBLOJUTP-GRLWGSQLSA-N Gln-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N RGAOLBZBLOJUTP-GRLWGSQLSA-N 0.000 description 7
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 7
- YIFUFYZELCMPJP-YUMQZZPRSA-N Gly-Leu-Cys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O YIFUFYZELCMPJP-YUMQZZPRSA-N 0.000 description 7
- MTBIKIMYHUWBRX-QWRGUYRKSA-N Gly-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN MTBIKIMYHUWBRX-QWRGUYRKSA-N 0.000 description 7
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 7
- MBSSHYPAEHPSGY-LSJOCFKGSA-N His-Ala-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O MBSSHYPAEHPSGY-LSJOCFKGSA-N 0.000 description 7
- UZZXGLOJRZKYEL-DJFWLOJKSA-N His-Asn-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UZZXGLOJRZKYEL-DJFWLOJKSA-N 0.000 description 7
- PQKCQZHAGILVIM-NKIYYHGXSA-N His-Glu-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O PQKCQZHAGILVIM-NKIYYHGXSA-N 0.000 description 7
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 7
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 7
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 7
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 7
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 7
- LRAUKBMYHHNADU-DKIMLUQUSA-N Ile-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 LRAUKBMYHHNADU-DKIMLUQUSA-N 0.000 description 7
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 7
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 7
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 7
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 7
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 7
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 7
- ONHCDMBHPQIPAI-YTQUADARSA-N Leu-Trp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N ONHCDMBHPQIPAI-YTQUADARSA-N 0.000 description 7
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 7
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 7
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 7
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 7
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 7
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 7
- IVCPHARVJUYDPA-FXQIFTODSA-N Met-Asn-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IVCPHARVJUYDPA-FXQIFTODSA-N 0.000 description 7
- QZPXMHVKPHJNTR-DCAQKATOSA-N Met-Leu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O QZPXMHVKPHJNTR-DCAQKATOSA-N 0.000 description 7
- HOZNVKDCKZPRER-XUXIUFHCSA-N Met-Lys-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HOZNVKDCKZPRER-XUXIUFHCSA-N 0.000 description 7
- KSIPKXNIQOWMIC-RCWTZXSCSA-N Met-Thr-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KSIPKXNIQOWMIC-RCWTZXSCSA-N 0.000 description 7
- OVTOTTGZBWXLFU-QXEWZRGKSA-N Met-Val-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O OVTOTTGZBWXLFU-QXEWZRGKSA-N 0.000 description 7
- 108091028043 Nucleic acid sequence Proteins 0.000 description 7
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 7
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 7
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 7
- GLUYKHMBGKQBHE-JYJNAYRXSA-N Phe-Val-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 GLUYKHMBGKQBHE-JYJNAYRXSA-N 0.000 description 7
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 7
- YTWNSIDWAFSEEI-RWMBFGLXSA-N Pro-His-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N3CCC[C@@H]3C(=O)O YTWNSIDWAFSEEI-RWMBFGLXSA-N 0.000 description 7
- QGMLKFGTGXWAHF-IHRRRGAJSA-N Ser-Arg-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGMLKFGTGXWAHF-IHRRRGAJSA-N 0.000 description 7
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 7
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 7
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 7
- UNURFMVMXLENAZ-KJEVXHAQSA-N Thr-Arg-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UNURFMVMXLENAZ-KJEVXHAQSA-N 0.000 description 7
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 7
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 7
- BYAKMYBZADCNMN-JYJNAYRXSA-N Tyr-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYAKMYBZADCNMN-JYJNAYRXSA-N 0.000 description 7
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 7
- FXVDGDZRYLFQKY-WPRPVWTQSA-N Val-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C FXVDGDZRYLFQKY-WPRPVWTQSA-N 0.000 description 7
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 7
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 7
- 108010070944 alanylhistidine Proteins 0.000 description 7
- 239000000872 buffer Substances 0.000 description 7
- 108010053037 kyotorphin Proteins 0.000 description 7
- 108010038320 lysylphenylalanine Proteins 0.000 description 7
- 108010085203 methionylmethionine Proteins 0.000 description 7
- 238000002360 preparation method Methods 0.000 description 7
- 102000004169 proteins and genes Human genes 0.000 description 7
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 6
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 6
- WCBVQNZTOKJWJS-ACZMJKKPSA-N Ala-Cys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O WCBVQNZTOKJWJS-ACZMJKKPSA-N 0.000 description 6
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 6
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 6
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 6
- FAJIYNONGXEXAI-CQDKDKBSSA-N Ala-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 FAJIYNONGXEXAI-CQDKDKBSSA-N 0.000 description 6
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 6
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 6
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 6
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 6
- VEAPAYQQLSEKEM-GUBZILKMSA-N Ala-Met-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O VEAPAYQQLSEKEM-GUBZILKMSA-N 0.000 description 6
- RUXQNKVQSKOOBS-JURCDPSOSA-N Ala-Phe-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RUXQNKVQSKOOBS-JURCDPSOSA-N 0.000 description 6
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 6
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 6
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 6
- XAXMJQUMRJAFCH-CQDKDKBSSA-N Ala-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 XAXMJQUMRJAFCH-CQDKDKBSSA-N 0.000 description 6
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 6
- HULHGJZIZXCPLD-FXQIFTODSA-N Arg-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HULHGJZIZXCPLD-FXQIFTODSA-N 0.000 description 6
- VYSRNGOMGHOJCK-GUBZILKMSA-N Arg-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N VYSRNGOMGHOJCK-GUBZILKMSA-N 0.000 description 6
- OTUQSEPIIVBYEM-IHRRRGAJSA-N Arg-Asn-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OTUQSEPIIVBYEM-IHRRRGAJSA-N 0.000 description 6
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 6
- QAXCZGMLVICQKS-SRVKXCTJSA-N Arg-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QAXCZGMLVICQKS-SRVKXCTJSA-N 0.000 description 6
- OGSQONVYSTZIJB-WDSOQIARSA-N Arg-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OGSQONVYSTZIJB-WDSOQIARSA-N 0.000 description 6
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 6
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 6
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 6
- DPLFNLDACGGBAK-KKUMJFAQSA-N Arg-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N DPLFNLDACGGBAK-KKUMJFAQSA-N 0.000 description 6
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 6
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 6
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 6
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 6
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 6
- QHUOOCKNNURZSL-IHRRRGAJSA-N Arg-Tyr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O QHUOOCKNNURZSL-IHRRRGAJSA-N 0.000 description 6
- CNBIWSCSSCAINS-UFYCRDLUSA-N Arg-Tyr-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNBIWSCSSCAINS-UFYCRDLUSA-N 0.000 description 6
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 6
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 6
- NXVGBGZQQFDUTM-XVYDVKMFSA-N Asn-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N NXVGBGZQQFDUTM-XVYDVKMFSA-N 0.000 description 6
- JEPNYDRDYNSFIU-QXEWZRGKSA-N Asn-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(N)=O)C(O)=O JEPNYDRDYNSFIU-QXEWZRGKSA-N 0.000 description 6
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 6
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 6
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 6
- GFGUPLIETCNQGF-DCAQKATOSA-N Asn-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O GFGUPLIETCNQGF-DCAQKATOSA-N 0.000 description 6
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 6
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 6
- IPPFAOCLQSGHJV-WFBYXXMGSA-N Asn-Trp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O IPPFAOCLQSGHJV-WFBYXXMGSA-N 0.000 description 6
- QIRJQYQOIKBPBZ-IHRRRGAJSA-N Asn-Tyr-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QIRJQYQOIKBPBZ-IHRRRGAJSA-N 0.000 description 6
- UWMIZBCTVWVMFI-FXQIFTODSA-N Asp-Ala-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UWMIZBCTVWVMFI-FXQIFTODSA-N 0.000 description 6
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 6
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 6
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 6
- CSEJMKNZDCJYGJ-XHNCKOQMSA-N Asp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O CSEJMKNZDCJYGJ-XHNCKOQMSA-N 0.000 description 6
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 6
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 6
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 6
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 6
- WSXDIZFNQYTUJB-SRVKXCTJSA-N Asp-His-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O WSXDIZFNQYTUJB-SRVKXCTJSA-N 0.000 description 6
- RKNIUWSZIAUEPK-PBCZWWQYSA-N Asp-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N)O RKNIUWSZIAUEPK-PBCZWWQYSA-N 0.000 description 6
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 6
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 6
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 6
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 6
- RRUWMFBLFLUZSI-LPEHRKFASA-N Asp-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N RRUWMFBLFLUZSI-LPEHRKFASA-N 0.000 description 6
- LKVKODXGSAFOFY-VEVYYDQMSA-N Asp-Met-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LKVKODXGSAFOFY-VEVYYDQMSA-N 0.000 description 6
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 6
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 6
- XQFLFQWOBXPMHW-NHCYSSNCSA-N Asp-Val-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XQFLFQWOBXPMHW-NHCYSSNCSA-N 0.000 description 6
- GYNUXDMCDILYIQ-QRTARXTBSA-N Asp-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N GYNUXDMCDILYIQ-QRTARXTBSA-N 0.000 description 6
- SDXQKJAWASHMIZ-CIUDSAMLSA-N Cys-Glu-Met Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O SDXQKJAWASHMIZ-CIUDSAMLSA-N 0.000 description 6
- PQHYZJPCYRDYNE-QWRGUYRKSA-N Cys-Gly-Phe Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PQHYZJPCYRDYNE-QWRGUYRKSA-N 0.000 description 6
- KZKBJEUWNMQTLV-XDTLVQLUSA-N Gln-Ala-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZKBJEUWNMQTLV-XDTLVQLUSA-N 0.000 description 6
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 6
- JFSNBQJNDMXMQF-XHNCKOQMSA-N Gln-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JFSNBQJNDMXMQF-XHNCKOQMSA-N 0.000 description 6
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 6
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 6
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 6
- XWIBVSAEUCAAKF-GVXVVHGQSA-N Gln-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)N)N XWIBVSAEUCAAKF-GVXVVHGQSA-N 0.000 description 6
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 6
- KLKYKPXITJBSNI-CIUDSAMLSA-N Gln-Met-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O KLKYKPXITJBSNI-CIUDSAMLSA-N 0.000 description 6
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 6
- UTOQQOMEJDPDMX-ACZMJKKPSA-N Gln-Ser-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O UTOQQOMEJDPDMX-ACZMJKKPSA-N 0.000 description 6
- SJMJMEWQMBJYPR-DZKIICNBSA-N Gln-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N SJMJMEWQMBJYPR-DZKIICNBSA-N 0.000 description 6
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 6
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 6
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 6
- UTKICHUQEQBDGC-ACZMJKKPSA-N Glu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UTKICHUQEQBDGC-ACZMJKKPSA-N 0.000 description 6
- BPDVTFBJZNBHEU-HGNGGELXSA-N Glu-Ala-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 BPDVTFBJZNBHEU-HGNGGELXSA-N 0.000 description 6
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 6
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 6
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 6
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 6
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 6
- CLROYXHHUZELFX-FXQIFTODSA-N Glu-Gln-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CLROYXHHUZELFX-FXQIFTODSA-N 0.000 description 6
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 6
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 6
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 6
- QYPKJXSMLMREKF-BPUTZDHNSA-N Glu-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N QYPKJXSMLMREKF-BPUTZDHNSA-N 0.000 description 6
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 6
- YDJOULGWHQRPEV-SRVKXCTJSA-N Glu-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N YDJOULGWHQRPEV-SRVKXCTJSA-N 0.000 description 6
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 6
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 6
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 6
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 6
- DWBBKNPKDHXIAC-SRVKXCTJSA-N Glu-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCC(O)=O DWBBKNPKDHXIAC-SRVKXCTJSA-N 0.000 description 6
- UJMNFCAHLYKWOZ-DCAQKATOSA-N Glu-Lys-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UJMNFCAHLYKWOZ-DCAQKATOSA-N 0.000 description 6
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 6
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 6
- UMZHHILWZBFPGL-LOKLDPHHSA-N Glu-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O UMZHHILWZBFPGL-LOKLDPHHSA-N 0.000 description 6
- JDAYMLXPUJRSDJ-XIRDDKMYSA-N Glu-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 JDAYMLXPUJRSDJ-XIRDDKMYSA-N 0.000 description 6
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 6
- FKJQNJCQTKUBCD-XPUUQOCRSA-N Gly-Ala-His Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O FKJQNJCQTKUBCD-XPUUQOCRSA-N 0.000 description 6
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 6
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 6
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 6
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 6
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 6
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 6
- LLXVQPKEQQCISF-YUMQZZPRSA-N Gly-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN LLXVQPKEQQCISF-YUMQZZPRSA-N 0.000 description 6
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 6
- YZACQYVWLCQWBT-BQBZGAKWSA-N Gly-Cys-Arg Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YZACQYVWLCQWBT-BQBZGAKWSA-N 0.000 description 6
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 6
- ORXZVPZCPMKHNR-IUCAKERBSA-N Gly-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 ORXZVPZCPMKHNR-IUCAKERBSA-N 0.000 description 6
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 6
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 6
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 6
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 6
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 6
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 6
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 6
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 6
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 6
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 6
- KBBFOULZCHWGJX-KBPBESRZSA-N Gly-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN)O KBBFOULZCHWGJX-KBPBESRZSA-N 0.000 description 6
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 6
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 6
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 6
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 6
- FLUVGKKRRMLNPU-CQDKDKBSSA-N His-Ala-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FLUVGKKRRMLNPU-CQDKDKBSSA-N 0.000 description 6
- IIVZNQCUUMBBKF-GVXVVHGQSA-N His-Gln-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 IIVZNQCUUMBBKF-GVXVVHGQSA-N 0.000 description 6
- SDTPKSOWFXBACN-GUBZILKMSA-N His-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O SDTPKSOWFXBACN-GUBZILKMSA-N 0.000 description 6
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 6
- KHUFDBQXGLEIHC-BZSNNMDCSA-N His-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 KHUFDBQXGLEIHC-BZSNNMDCSA-N 0.000 description 6
- RLAOTFTXBFQJDV-KKUMJFAQSA-N His-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CN=CN1 RLAOTFTXBFQJDV-KKUMJFAQSA-N 0.000 description 6
- PBVQWNDMFFCPIZ-ULQDDVLXSA-N His-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 PBVQWNDMFFCPIZ-ULQDDVLXSA-N 0.000 description 6
- LNDVNHOSZQPJGI-AVGNSLFASA-N His-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CN=CN1 LNDVNHOSZQPJGI-AVGNSLFASA-N 0.000 description 6
- FBVHRDXSCYELMI-PBCZWWQYSA-N His-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O FBVHRDXSCYELMI-PBCZWWQYSA-N 0.000 description 6
- MCGOGXFMKHPMSQ-AVGNSLFASA-N His-Val-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 MCGOGXFMKHPMSQ-AVGNSLFASA-N 0.000 description 6
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 6
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 6
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 6
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 6
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 6
- SVZFKLBRCYCIIY-CYDGBPFRSA-N Ile-Pro-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SVZFKLBRCYCIIY-CYDGBPFRSA-N 0.000 description 6
- BATWGBRIZANGPN-ZPFDUUQYSA-N Ile-Pro-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BATWGBRIZANGPN-ZPFDUUQYSA-N 0.000 description 6
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 6
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 6
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 description 6
- 108010065920 Insulin Lispro Proteins 0.000 description 6
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 6
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 6
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 6
- QUAAUWNLWMLERT-IHRRRGAJSA-N Leu-Arg-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O QUAAUWNLWMLERT-IHRRRGAJSA-N 0.000 description 6
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 6
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 6
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 6
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 6
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 6
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 6
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 6
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 6
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 6
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 6
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 6
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 6
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 6
- ZALAVHVPPOHAOL-XUXIUFHCSA-N Leu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N ZALAVHVPPOHAOL-XUXIUFHCSA-N 0.000 description 6
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 6
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 6
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 6
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 6
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 6
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 6
- NJMXCOOEFLMZSR-AVGNSLFASA-N Leu-Met-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NJMXCOOEFLMZSR-AVGNSLFASA-N 0.000 description 6
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 6
- FPFOYSCDUWTZBF-IHPCNDPISA-N Leu-Trp-Leu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H]([NH3+])CC(C)C)C(=O)N[C@@H](CC(C)C)C([O-])=O)=CNC2=C1 FPFOYSCDUWTZBF-IHPCNDPISA-N 0.000 description 6
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 6
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 6
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 6
- JBRWKVANRYPCAF-XIRDDKMYSA-N Lys-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N JBRWKVANRYPCAF-XIRDDKMYSA-N 0.000 description 6
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 6
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 6
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 6
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 6
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 6
- ZMMDPRTXLAEMOD-BZSNNMDCSA-N Lys-His-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZMMDPRTXLAEMOD-BZSNNMDCSA-N 0.000 description 6
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 6
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 6
- SPNKGZFASINBMR-IHRRRGAJSA-N Lys-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N SPNKGZFASINBMR-IHRRRGAJSA-N 0.000 description 6
- ZJSZPXISKMDJKQ-JYJNAYRXSA-N Lys-Phe-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=CC=C1 ZJSZPXISKMDJKQ-JYJNAYRXSA-N 0.000 description 6
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 6
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 6
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 6
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 6
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 6
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 6
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 6
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 6
- 239000004472 Lysine Substances 0.000 description 6
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 6
- DJDFBVNNDAUPRW-GUBZILKMSA-N Met-Glu-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O DJDFBVNNDAUPRW-GUBZILKMSA-N 0.000 description 6
- WXJXYMFUTRXRGO-UWVGGRQHSA-N Met-His-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 WXJXYMFUTRXRGO-UWVGGRQHSA-N 0.000 description 6
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 6
- KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 description 6
- ZRACLHJYVRBJFC-ULQDDVLXSA-N Met-Lys-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZRACLHJYVRBJFC-ULQDDVLXSA-N 0.000 description 6
- WUYLWZRHRLLEGB-AVGNSLFASA-N Met-Met-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O WUYLWZRHRLLEGB-AVGNSLFASA-N 0.000 description 6
- YLDSJJOGQNEQJK-AVGNSLFASA-N Met-Pro-Leu Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YLDSJJOGQNEQJK-AVGNSLFASA-N 0.000 description 6
- SQPZCTBSLIIMBL-BPUTZDHNSA-N Met-Trp-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CO)C(=O)O)N SQPZCTBSLIIMBL-BPUTZDHNSA-N 0.000 description 6
- CQRGINSEMFBACV-WPRPVWTQSA-N Met-Val-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O CQRGINSEMFBACV-WPRPVWTQSA-N 0.000 description 6
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 6
- 102000008109 Mixed Function Oxygenases Human genes 0.000 description 6
- 108010074633 Mixed Function Oxygenases Proteins 0.000 description 6
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 6
- 108010079364 N-glycylalanine Proteins 0.000 description 6
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 6
- 108091005804 Peptidases Proteins 0.000 description 6
- 102000003992 Peroxidases Human genes 0.000 description 6
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 6
- BRDYYVQTEJVRQT-HRCADAONSA-N Phe-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BRDYYVQTEJVRQT-HRCADAONSA-N 0.000 description 6
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 6
- MPFGIYLYWUCSJG-AVGNSLFASA-N Phe-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MPFGIYLYWUCSJG-AVGNSLFASA-N 0.000 description 6
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 6
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 6
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 6
- GYEPCBNTTRORKW-PCBIJLKTSA-N Phe-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O GYEPCBNTTRORKW-PCBIJLKTSA-N 0.000 description 6
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 6
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 6
- VFDRDMOMHBJGKD-UFYCRDLUSA-N Phe-Tyr-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N VFDRDMOMHBJGKD-UFYCRDLUSA-N 0.000 description 6
- MMPBPRXOFJNCCN-ZEWNOJEFSA-N Phe-Tyr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MMPBPRXOFJNCCN-ZEWNOJEFSA-N 0.000 description 6
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 6
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 6
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 6
- OYEUSRAZOGIDBY-JYJNAYRXSA-N Pro-Arg-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OYEUSRAZOGIDBY-JYJNAYRXSA-N 0.000 description 6
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 6
- CMOIIANLNNYUTP-SRVKXCTJSA-N Pro-Gln-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CMOIIANLNNYUTP-SRVKXCTJSA-N 0.000 description 6
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 6
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 6
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 6
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 6
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 6
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 6
- LGMBKOAPPTYKLC-JYJNAYRXSA-N Pro-Phe-Arg Chemical compound C([C@@H](C(=O)N[C@@H](CCCNC(=N)N)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 LGMBKOAPPTYKLC-JYJNAYRXSA-N 0.000 description 6
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 6
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 6
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 6
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 6
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 6
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 6
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 6
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 6
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 6
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 6
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 6
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 6
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 6
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 6
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 6
- QPPYAWVLAVXISR-DCAQKATOSA-N Ser-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QPPYAWVLAVXISR-DCAQKATOSA-N 0.000 description 6
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 6
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 6
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 6
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 6
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 6
- HKHCTNFKZXAMIF-KKUMJFAQSA-N Ser-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=C(O)C=C1 HKHCTNFKZXAMIF-KKUMJFAQSA-N 0.000 description 6
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 6
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 6
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 6
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 6
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 6
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 6
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 6
- KLCCPYZXGXHAGS-QTKMDUPCSA-N Thr-His-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N)O KLCCPYZXGXHAGS-QTKMDUPCSA-N 0.000 description 6
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 6
- FIFDDJFLNVAVMS-RHYQMDGZSA-N Thr-Leu-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O FIFDDJFLNVAVMS-RHYQMDGZSA-N 0.000 description 6
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 6
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 6
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 6
- LKJCABTUFGTPPY-HJGDQZAQSA-N Thr-Pro-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O LKJCABTUFGTPPY-HJGDQZAQSA-N 0.000 description 6
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 6
- KAJRRNHOVMZYBL-IRIUXVKKSA-N Thr-Tyr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAJRRNHOVMZYBL-IRIUXVKKSA-N 0.000 description 6
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 6
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 6
- UUIYFDAWNBSWPG-IHPCNDPISA-N Trp-Lys-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N UUIYFDAWNBSWPG-IHPCNDPISA-N 0.000 description 6
- IKUMWSDCGQVGHC-UMPQAUOISA-N Trp-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O IKUMWSDCGQVGHC-UMPQAUOISA-N 0.000 description 6
- RNDWCRUOGGQDKN-UBHSHLNASA-N Trp-Ser-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RNDWCRUOGGQDKN-UBHSHLNASA-N 0.000 description 6
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 6
- UXUFNBVCPAWACG-SIUGBPQLSA-N Tyr-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N UXUFNBVCPAWACG-SIUGBPQLSA-N 0.000 description 6
- WZQZUVWEPMGIMM-JYJNAYRXSA-N Tyr-Gln-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WZQZUVWEPMGIMM-JYJNAYRXSA-N 0.000 description 6
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 6
- JJNXZIPLIXIGBX-HJPIBITLSA-N Tyr-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JJNXZIPLIXIGBX-HJPIBITLSA-N 0.000 description 6
- OLYXUGBVBGSZDN-ACRUOGEOSA-N Tyr-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 OLYXUGBVBGSZDN-ACRUOGEOSA-N 0.000 description 6
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 6
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 6
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 6
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 6
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 6
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 6
- NMANTMWGQZASQN-QXEWZRGKSA-N Val-Arg-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N NMANTMWGQZASQN-QXEWZRGKSA-N 0.000 description 6
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 6
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 6
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 6
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 6
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 6
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 6
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 6
- DAVNYIUELQBTAP-XUXIUFHCSA-N Val-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N DAVNYIUELQBTAP-XUXIUFHCSA-N 0.000 description 6
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 6
- SVFRYKBZHUGKLP-QXEWZRGKSA-N Val-Met-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVFRYKBZHUGKLP-QXEWZRGKSA-N 0.000 description 6
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 6
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 6
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 6
- IRAUYEAFPFPVND-UVBJJODRSA-N Val-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 IRAUYEAFPFPVND-UVBJJODRSA-N 0.000 description 6
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 6
- 108010011559 alanylphenylalanine Proteins 0.000 description 6
- 108010087924 alanylproline Proteins 0.000 description 6
- 108010013835 arginine glutamate Proteins 0.000 description 6
- 108010077245 asparaginyl-proline Proteins 0.000 description 6
- 108010047857 aspartylglycine Proteins 0.000 description 6
- 238000013461 design Methods 0.000 description 6
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 6
- 108010079547 glutamylmethionine Proteins 0.000 description 6
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 6
- 108010020688 glycylhistidine Proteins 0.000 description 6
- 108010015792 glycyllysine Proteins 0.000 description 6
- 108010081551 glycylphenylalanine Proteins 0.000 description 6
- 108010092114 histidylphenylalanine Proteins 0.000 description 6
- 108010027338 isoleucylcysteine Proteins 0.000 description 6
- 108010057821 leucylproline Proteins 0.000 description 6
- 108010012058 leucyltyrosine Proteins 0.000 description 6
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 6
- 108010064235 lysylglycine Proteins 0.000 description 6
- 108010056582 methionylglutamic acid Proteins 0.000 description 6
- 108010034507 methionyltryptophan Proteins 0.000 description 6
- 108010012581 phenylalanylglutamate Proteins 0.000 description 6
- 108010051242 phenylalanylserine Proteins 0.000 description 6
- 108010077112 prolyl-proline Proteins 0.000 description 6
- 108091008146 restriction endonucleases Proteins 0.000 description 6
- 108010048818 seryl-histidine Proteins 0.000 description 6
- 108010071207 serylmethionine Proteins 0.000 description 6
- 238000002741 site-directed mutagenesis Methods 0.000 description 6
- 108010061238 threonyl-glycine Proteins 0.000 description 6
- 108010003137 tyrosyltyrosine Proteins 0.000 description 6
- 108010000998 wheylin-2 peptide Proteins 0.000 description 6
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 5
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 5
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 5
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 5
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 5
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 5
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 5
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 5
- 239000002253 acid Substances 0.000 description 5
- 235000004279 alanine Nutrition 0.000 description 5
- 235000021342 arachidonic acid Nutrition 0.000 description 5
- 229940114079 arachidonic acid Drugs 0.000 description 5
- 239000002299 complementary DNA Substances 0.000 description 5
- 101150090781 cyp102A1 gene Proteins 0.000 description 5
- 238000004520 electroporation Methods 0.000 description 5
- 230000002708 enhancing effect Effects 0.000 description 5
- 150000003278 haem Chemical class 0.000 description 5
- 230000001965 increasing effect Effects 0.000 description 5
- 239000012528 membrane Substances 0.000 description 5
- 239000002207 metabolite Substances 0.000 description 5
- 238000003032 molecular docking Methods 0.000 description 5
- 108040007629 peroxidase activity proteins Proteins 0.000 description 5
- 235000018102 proteins Nutrition 0.000 description 5
- 230000035484 reaction time Effects 0.000 description 5
- 235000019333 sodium laurylsulphate Nutrition 0.000 description 5
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 4
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 4
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 4
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- 108700010070 Codon Usage Proteins 0.000 description 4
- 102000002004 Cytochrome P-450 Enzyme System Human genes 0.000 description 4
- 241000196324 Embryophyta Species 0.000 description 4
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 4
- XEEYBQQBJWHFJM-UHFFFAOYSA-N Iron Chemical compound [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 description 4
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 4
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 4
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 4
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 4
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 4
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 4
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 4
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 4
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 4
- 101710198130 NADPH-cytochrome P450 reductase Proteins 0.000 description 4
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 4
- 239000004365 Protease Substances 0.000 description 4
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 4
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 4
- 239000004473 Threonine Substances 0.000 description 4
- 238000009825 accumulation Methods 0.000 description 4
- DTOSIQBPPRVQHS-PDBXOOCHSA-N alpha-linolenic acid Chemical compound CC\C=C/C\C=C/C\C=C/CCCCCCCC(O)=O DTOSIQBPPRVQHS-PDBXOOCHSA-N 0.000 description 4
- 238000005119 centrifugation Methods 0.000 description 4
- OSASVXMJTNOKOY-UHFFFAOYSA-N chlorobutanol Chemical compound CC(C)(O)C(Cl)(Cl)Cl OSASVXMJTNOKOY-UHFFFAOYSA-N 0.000 description 4
- 239000012470 diluted sample Substances 0.000 description 4
- 235000013922 glutamic acid Nutrition 0.000 description 4
- 239000004220 glutamic acid Substances 0.000 description 4
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 4
- 238000009776 industrial production Methods 0.000 description 4
- 238000003780 insertion Methods 0.000 description 4
- 230000037431 insertion Effects 0.000 description 4
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 4
- 229960000310 isoleucine Drugs 0.000 description 4
- 238000002156 mixing Methods 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 230000001766 physiological effect Effects 0.000 description 4
- 239000003755 preservative agent Substances 0.000 description 4
- 235000019419 proteases Nutrition 0.000 description 4
- WDGFFVCWBZVLCE-UHFFFAOYSA-N purpurogallin Chemical compound C1=CC=C(O)C(=O)C2=C1C=C(O)C(O)=C2O WDGFFVCWBZVLCE-UHFFFAOYSA-N 0.000 description 4
- WQGWDDDVZFFDIG-UHFFFAOYSA-N pyrogallol Chemical compound OC1=CC=CC(O)=C1O WQGWDDDVZFFDIG-UHFFFAOYSA-N 0.000 description 4
- 235000008521 threonine Nutrition 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- 239000004474 valine Substances 0.000 description 4
- RGZIXZYRGZWDMI-QXBXTPPVSA-N 14,15-EpETE Chemical compound CC\C=C/CC1OC1C\C=C/C\C=C/C\C=C/CCCC(O)=O RGZIXZYRGZWDMI-QXBXTPPVSA-N 0.000 description 3
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 3
- 239000004475 Arginine Substances 0.000 description 3
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 3
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 239000004471 Glycine Substances 0.000 description 3
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 3
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 3
- 239000006142 Luria-Bertani Agar Substances 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 3
- DNIAPMSPPWPWGF-UHFFFAOYSA-N Propylene glycol Chemical compound CC(O)CO DNIAPMSPPWPWGF-UHFFFAOYSA-N 0.000 description 3
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 3
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 3
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 3
- 238000002835 absorbance Methods 0.000 description 3
- 230000002378 acidificating effect Effects 0.000 description 3
- 239000000654 additive Substances 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- 210000001557 animal structure Anatomy 0.000 description 3
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 3
- 235000009582 asparagine Nutrition 0.000 description 3
- 229960001230 asparagine Drugs 0.000 description 3
- 235000003704 aspartic acid Nutrition 0.000 description 3
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 3
- 239000005515 coenzyme Substances 0.000 description 3
- 238000012790 confirmation Methods 0.000 description 3
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 3
- 235000018417 cysteine Nutrition 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 230000002255 enzymatic effect Effects 0.000 description 3
- 238000006911 enzymatic reaction Methods 0.000 description 3
- 239000008103 glucose Substances 0.000 description 3
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 3
- 235000004554 glutamine Nutrition 0.000 description 3
- 230000005764 inhibitory process Effects 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 229930182817 methionine Natural products 0.000 description 3
- 238000003199 nucleic acid amplification method Methods 0.000 description 3
- 235000020777 polyunsaturated fatty acids Nutrition 0.000 description 3
- 230000006798 recombination Effects 0.000 description 3
- 238000011084 recovery Methods 0.000 description 3
- 235000004400 serine Nutrition 0.000 description 3
- 235000002374 tyrosine Nutrition 0.000 description 3
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- FPRKGXIOSIUDSE-SYACGTDESA-N (2z,4z,6z,8z)-docosa-2,4,6,8-tetraenoic acid Chemical compound CCCCCCCCCCCCC\C=C/C=C\C=C/C=C\C(O)=O FPRKGXIOSIUDSE-SYACGTDESA-N 0.000 description 2
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 2
- AVSRCHDOUZLGHP-UHFFFAOYSA-N 18-(3-ethyloxiran-2-yl)octadeca-2,4,6,8-tetraenoic acid Chemical compound CCC1OC1CCCCCCCCCC=CC=CC=CC=CC(O)=O AVSRCHDOUZLGHP-UHFFFAOYSA-N 0.000 description 2
- SQEKYOLLXDEHLF-LJFNSOSFSA-N 19-HEPE Chemical compound CC(O)\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O SQEKYOLLXDEHLF-LJFNSOSFSA-N 0.000 description 2
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 2
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 2
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 2
- CIWBSHSKHKDKBQ-JLAZNSOCSA-N Ascorbic acid Chemical compound OC[C@H](O)[C@H]1OC(=O)C(O)=C1O CIWBSHSKHKDKBQ-JLAZNSOCSA-N 0.000 description 2
- 241000193830 Bacillus <bacterium> Species 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 2
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 2
- MYMOFIZGZYHOMD-UHFFFAOYSA-N Dioxygen Chemical compound O=O MYMOFIZGZYHOMD-UHFFFAOYSA-N 0.000 description 2
- 235000021292 Docosatetraenoic acid Nutrition 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 2
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 2
- 108090001060 Lipase Proteins 0.000 description 2
- 239000004367 Lipase Substances 0.000 description 2
- 102000004882 Lipase Human genes 0.000 description 2
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- 239000012901 Milli-Q water Substances 0.000 description 2
- 108010014251 Muramidase Proteins 0.000 description 2
- 102000016943 Muramidase Human genes 0.000 description 2
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 2
- 239000004677 Nylon Substances 0.000 description 2
- 229910019142 PO4 Inorganic materials 0.000 description 2
- 102000035195 Peptidases Human genes 0.000 description 2
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 2
- LCTONWCANYUPML-UHFFFAOYSA-N Pyruvic acid Chemical compound CC(=O)C(O)=O LCTONWCANYUPML-UHFFFAOYSA-N 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 229930006000 Sucrose Natural products 0.000 description 2
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 2
- 238000010521 absorption reaction Methods 0.000 description 2
- 230000009471 action Effects 0.000 description 2
- 238000007792 addition Methods 0.000 description 2
- 108090000637 alpha-Amylases Proteins 0.000 description 2
- 235000020661 alpha-linolenic acid Nutrition 0.000 description 2
- 210000004102 animal cell Anatomy 0.000 description 2
- 229960000686 benzalkonium chloride Drugs 0.000 description 2
- CADWTSSKOVRVJC-UHFFFAOYSA-N benzyl(dimethyl)azanium;chloride Chemical compound [Cl-].C[NH+](C)CC1=CC=CC=C1 CADWTSSKOVRVJC-UHFFFAOYSA-N 0.000 description 2
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 2
- GUBGYTABKSRVRQ-QUYVBRFLSA-N beta-maltose Chemical compound OC[C@H]1O[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@H](O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@@H]1O GUBGYTABKSRVRQ-QUYVBRFLSA-N 0.000 description 2
- 239000001506 calcium phosphate Substances 0.000 description 2
- 229910000389 calcium phosphate Inorganic materials 0.000 description 2
- 235000011010 calcium phosphates Nutrition 0.000 description 2
- 229910052799 carbon Inorganic materials 0.000 description 2
- 229960004926 chlorobutanol Drugs 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 235000021323 fish oil Nutrition 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 238000010353 genetic engineering Methods 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 229910052742 iron Inorganic materials 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 239000008101 lactose Substances 0.000 description 2
- 229960004488 linolenic acid Drugs 0.000 description 2
- 235000019421 lipase Nutrition 0.000 description 2
- 238000001638 lipofection Methods 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 239000006166 lysate Substances 0.000 description 2
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 2
- 239000004325 lysozyme Substances 0.000 description 2
- 229960000274 lysozyme Drugs 0.000 description 2
- 235000010335 lysozyme Nutrition 0.000 description 2
- 230000002503 metabolic effect Effects 0.000 description 2
- LXCFILQKKLGQFO-UHFFFAOYSA-N methylparaben Chemical compound COC(=O)C1=CC=C(O)C=C1 LXCFILQKKLGQFO-UHFFFAOYSA-N 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 239000002773 nucleotide Substances 0.000 description 2
- 125000003729 nucleotide group Chemical group 0.000 description 2
- 229920001778 nylon Polymers 0.000 description 2
- 239000003921 oil Substances 0.000 description 2
- 235000019198 oils Nutrition 0.000 description 2
- 238000007254 oxidation reaction Methods 0.000 description 2
- 239000000546 pharmaceutical excipient Substances 0.000 description 2
- 235000021317 phosphate Nutrition 0.000 description 2
- 150000003013 phosphoric acid derivatives Chemical class 0.000 description 2
- 102000040430 polynucleotide Human genes 0.000 description 2
- 108091033319 polynucleotide Proteins 0.000 description 2
- 239000002157 polynucleotide Substances 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 235000019833 protease Nutrition 0.000 description 2
- 229940079877 pyrogallol Drugs 0.000 description 2
- 239000002994 raw material Substances 0.000 description 2
- 238000005215 recombination Methods 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 238000007480 sanger sequencing Methods 0.000 description 2
- 238000004088 simulation Methods 0.000 description 2
- 239000003381 stabilizer Substances 0.000 description 2
- 239000004094 surface-active agent Substances 0.000 description 2
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 2
- HDTRYLNUVZCQOY-UHFFFAOYSA-N α-D-glucopyranosyl-α-D-glucopyranoside Natural products OC1C(O)C(O)C(CO)OC1OC1C(O)C(O)C(O)C(CO)O1 HDTRYLNUVZCQOY-UHFFFAOYSA-N 0.000 description 1
- HPSWUFMMLKGKDS-DNKOKRCQSA-N (2e,4e,6e,8e,10e,12e)-tetracosa-2,4,6,8,10,12-hexaenoic acid Chemical compound CCCCCCCCCCC\C=C\C=C\C=C\C=C\C=C\C=C\C(O)=O HPSWUFMMLKGKDS-DNKOKRCQSA-N 0.000 description 1
- KXVFBCSUGDNXQF-DZDBOGACSA-N (2z,4z,6z,8z,10z)-tetracosa-2,4,6,8,10-pentaenoic acid Chemical compound CCCCCCCCCCCCC\C=C/C=C\C=C/C=C\C=C/C(O)=O KXVFBCSUGDNXQF-DZDBOGACSA-N 0.000 description 1
- OSXOPUBJJDUAOJ-MBYQGORISA-N (4Z,7Z,10Z,13Z,16Z)-19,20-epoxydocosapentaenoic acid Chemical compound CCC1OC1C\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCC(O)=O OSXOPUBJJDUAOJ-MBYQGORISA-N 0.000 description 1
- WURBVZBTWMNKQT-UHFFFAOYSA-N 1-(4-chlorophenoxy)-3,3-dimethyl-1-(1,2,4-triazol-1-yl)butan-2-one Chemical compound C1=NC=NN1C(C(=O)C(C)(C)C)OC1=CC=C(Cl)C=C1 WURBVZBTWMNKQT-UHFFFAOYSA-N 0.000 description 1
- GZCWLCBFPRFLKL-UHFFFAOYSA-N 1-prop-2-ynoxypropan-2-ol Chemical compound CC(O)COCC#C GZCWLCBFPRFLKL-UHFFFAOYSA-N 0.000 description 1
- 102100031126 6-phosphogluconolactonase Human genes 0.000 description 1
- 108010029731 6-phosphogluconolactonase Proteins 0.000 description 1
- 108010051457 Acid Phosphatase Proteins 0.000 description 1
- 102000013563 Acid Phosphatase Human genes 0.000 description 1
- 108091005508 Acid proteases Proteins 0.000 description 1
- 241000589158 Agrobacterium Species 0.000 description 1
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 102000004400 Aminopeptidases Human genes 0.000 description 1
- 108090000915 Aminopeptidases Proteins 0.000 description 1
- 239000004382 Amylase Substances 0.000 description 1
- 108010065511 Amylases Proteins 0.000 description 1
- 102000013142 Amylases Human genes 0.000 description 1
- 240000003291 Armoracia rusticana Species 0.000 description 1
- 235000011330 Armoracia rusticana Nutrition 0.000 description 1
- 241000228212 Aspergillus Species 0.000 description 1
- 241000972773 Aulopiformes Species 0.000 description 1
- 108090000145 Bacillolysin Proteins 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 108010077805 Bacterial Proteins Proteins 0.000 description 1
- 108091005658 Basic proteases Proteins 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- BHPQYMZQTOCNFJ-UHFFFAOYSA-N Calcium cation Chemical compound [Ca+2] BHPQYMZQTOCNFJ-UHFFFAOYSA-N 0.000 description 1
- 108010088986 Camphor 5-Monooxygenase Proteins 0.000 description 1
- 239000004215 Carbon black (E152) Substances 0.000 description 1
- 108010059892 Cellulase Proteins 0.000 description 1
- 241000195493 Cryptophyta Species 0.000 description 1
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 1
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 description 1
- 108010001539 D-lactate dehydrogenase Proteins 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 230000007018 DNA scission Effects 0.000 description 1
- 101710088194 Dehydrogenase Proteins 0.000 description 1
- 108010001682 Dextranase Proteins 0.000 description 1
- 239000004375 Dextrin Substances 0.000 description 1
- 229920001353 Dextrin Polymers 0.000 description 1
- 102100023319 Dihydrolipoyl dehydrogenase, mitochondrial Human genes 0.000 description 1
- 101000925662 Enterobacteria phage PRD1 Endolysin Proteins 0.000 description 1
- 241000588722 Escherichia Species 0.000 description 1
- 108090000371 Esterases Proteins 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 229920001917 Ficoll Polymers 0.000 description 1
- OPGOLNDOMSBSCW-CLNHMMGSSA-N Fursultiamine hydrochloride Chemical compound Cl.C1CCOC1CSSC(\CCO)=C(/C)N(C=O)CC1=CN=C(C)N=C1N OPGOLNDOMSBSCW-CLNHMMGSSA-N 0.000 description 1
- 108010093031 Galactosidases Proteins 0.000 description 1
- 102000002464 Galactosidases Human genes 0.000 description 1
- 108010073178 Glucan 1,4-alpha-Glucosidase Proteins 0.000 description 1
- 102100022624 Glucoamylase Human genes 0.000 description 1
- 108010018962 Glucosephosphate Dehydrogenase Proteins 0.000 description 1
- 108010056771 Glucosidases Proteins 0.000 description 1
- 102000004366 Glucosidases Human genes 0.000 description 1
- 108010073324 Glutaminase Proteins 0.000 description 1
- 102000009127 Glutaminase Human genes 0.000 description 1
- 101001122938 Homo sapiens Lysosomal protective protein Proteins 0.000 description 1
- 102000012011 Isocitrate Dehydrogenase Human genes 0.000 description 1
- 108010075869 Isocitrate Dehydrogenase Proteins 0.000 description 1
- 238000012218 Kunkel's method Methods 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- 102000003820 Lipoxygenases Human genes 0.000 description 1
- 108090000128 Lipoxygenases Proteins 0.000 description 1
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 1
- 102100028524 Lysosomal protective protein Human genes 0.000 description 1
- FYYHWMGAXLPEAU-UHFFFAOYSA-N Magnesium Chemical compound [Mg] FYYHWMGAXLPEAU-UHFFFAOYSA-N 0.000 description 1
- 108010026217 Malate Dehydrogenase Proteins 0.000 description 1
- 102000013460 Malate Dehydrogenase Human genes 0.000 description 1
- 102100024295 Maltase-glucoamylase Human genes 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 102000035092 Neutral proteases Human genes 0.000 description 1
- 108091005507 Neutral proteases Proteins 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 101710163270 Nuclease Proteins 0.000 description 1
- KOAIJKRPXNHSHO-UHFFFAOYSA-N OC(CCCCCCCC=CC=CC=CC=CC=CC(=O)O)C Chemical compound OC(CCCCCCCC=CC=CC=CC=CC=CC(=O)O)C KOAIJKRPXNHSHO-UHFFFAOYSA-N 0.000 description 1
- 102000004316 Oxidoreductases Human genes 0.000 description 1
- 108090000854 Oxidoreductases Proteins 0.000 description 1
- 239000001888 Peptone Substances 0.000 description 1
- 108010080698 Peptones Proteins 0.000 description 1
- 235000004347 Perilla Nutrition 0.000 description 1
- 244000124853 Perilla frutescens Species 0.000 description 1
- 108700020962 Peroxidase Proteins 0.000 description 1
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 1
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 1
- 108010059820 Polygalacturonase Proteins 0.000 description 1
- ZLMJMSJWJFRBEC-UHFFFAOYSA-N Potassium Chemical compound [K] ZLMJMSJWJFRBEC-UHFFFAOYSA-N 0.000 description 1
- 102000004005 Prostaglandin-endoperoxide synthases Human genes 0.000 description 1
- 108090000459 Prostaglandin-endoperoxide synthases Proteins 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 241000589517 Pseudomonas aeruginosa Species 0.000 description 1
- 241000589776 Pseudomonas putida Species 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 235000019764 Soybean Meal Nutrition 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 108700026226 TATA Box Proteins 0.000 description 1
- 108060008539 Transglutaminase Proteins 0.000 description 1
- HDTRYLNUVZCQOY-WSWWMNSNSA-N Trehalose Natural products O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-WSWWMNSNSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 102100033220 Xanthine oxidase Human genes 0.000 description 1
- 108010093894 Xanthine oxidase Proteins 0.000 description 1
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 1
- 150000001242 acetic acid derivatives Chemical class 0.000 description 1
- 239000004480 active ingredient Substances 0.000 description 1
- 238000005377 adsorption chromatography Methods 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 238000013019 agitation Methods 0.000 description 1
- 150000001294 alanine derivatives Chemical group 0.000 description 1
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 1
- HDTRYLNUVZCQOY-LIZSDCNHSA-N alpha,alpha-trehalose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-LIZSDCNHSA-N 0.000 description 1
- 102000004139 alpha-Amylases Human genes 0.000 description 1
- 102000005840 alpha-Galactosidase Human genes 0.000 description 1
- 108010030291 alpha-Galactosidase Proteins 0.000 description 1
- 108010028144 alpha-Glucosidases Proteins 0.000 description 1
- 229940024171 alpha-amylase Drugs 0.000 description 1
- 235000019418 amylase Nutrition 0.000 description 1
- -1 and if necessary Proteins 0.000 description 1
- 230000003266 anti-allergic effect Effects 0.000 description 1
- 230000003110 anti-inflammatory effect Effects 0.000 description 1
- 239000011668 ascorbic acid Substances 0.000 description 1
- 235000010323 ascorbic acid Nutrition 0.000 description 1
- 229960005070 ascorbic acid Drugs 0.000 description 1
- 238000000376 autoradiography Methods 0.000 description 1
- 108010019077 beta-Amylase Proteins 0.000 description 1
- 102000005936 beta-Galactosidase Human genes 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- 108010047754 beta-Glucosidase Proteins 0.000 description 1
- 102000006995 beta-Glucosidase Human genes 0.000 description 1
- 239000011942 biocatalyst Substances 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 229940098773 bovine serum albumin Drugs 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 229910001424 calcium ion Inorganic materials 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 238000005251 capillar electrophoresis Methods 0.000 description 1
- 150000001722 carbon compounds Chemical class 0.000 description 1
- 150000004649 carbonic acid derivatives Chemical class 0.000 description 1
- 239000005018 casein Substances 0.000 description 1
- BECPQYXYKAMYBN-UHFFFAOYSA-N casein, tech. Chemical compound NCCCCC(C(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(CC(C)C)N=C(O)C(CCC(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(C(C)O)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(COP(O)(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(N)CC1=CC=CC=C1 BECPQYXYKAMYBN-UHFFFAOYSA-N 0.000 description 1
- 235000021240 caseins Nutrition 0.000 description 1
- 239000003054 catalyst Substances 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 229940106157 cellulase Drugs 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 150000001860 citric acid derivatives Chemical class 0.000 description 1
- 238000004440 column chromatography Methods 0.000 description 1
- 230000002301 combined effect Effects 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 239000000356 contaminant Substances 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- 230000009849 deactivation Effects 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 235000019425 dextrin Nutrition 0.000 description 1
- 239000005546 dideoxynucleotide Substances 0.000 description 1
- 229910001882 dioxygen Inorganic materials 0.000 description 1
- 230000036267 drug metabolism Effects 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 1
- 229960005542 ethidium bromide Drugs 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 108010093305 exopolygalacturonase Proteins 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000003925 fat Substances 0.000 description 1
- 235000019197 fats Nutrition 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 238000004108 freeze drying Methods 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 238000010359 gene isolation Methods 0.000 description 1
- 238000010362 genome editing Methods 0.000 description 1
- 239000008187 granular material Substances 0.000 description 1
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 1
- 229930195733 hydrocarbon Natural products 0.000 description 1
- 150000002430 hydrocarbons Chemical class 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 239000000944 linseed oil Substances 0.000 description 1
- 235000021388 linseed oil Nutrition 0.000 description 1
- 238000009630 liquid culture Methods 0.000 description 1
- XIXADJRWDQXREU-UHFFFAOYSA-M lithium acetate Chemical compound [Li+].CC([O-])=O XIXADJRWDQXREU-UHFFFAOYSA-M 0.000 description 1
- 230000002934 lysing effect Effects 0.000 description 1
- 230000002101 lytic effect Effects 0.000 description 1
- 239000011777 magnesium Substances 0.000 description 1
- 229910052749 magnesium Inorganic materials 0.000 description 1
- WPBNNNQJVZRUHP-UHFFFAOYSA-L manganese(2+);methyl n-[[2-(methoxycarbonylcarbamothioylamino)phenyl]carbamothioyl]carbamate;n-[2-(sulfidocarbothioylamino)ethyl]carbamodithioate Chemical compound [Mn+2].[S-]C(=S)NCCNC([S-])=S.COC(=O)NC(=S)NC1=CC=CC=C1NC(=S)NC(=O)OC WPBNNNQJVZRUHP-UHFFFAOYSA-L 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000000691 measurement method Methods 0.000 description 1
- 235000013372 meat Nutrition 0.000 description 1
- 238000010297 mechanical methods and process Methods 0.000 description 1
- 229910021645 metal ion Inorganic materials 0.000 description 1
- 235000010270 methyl p-hydroxybenzoate Nutrition 0.000 description 1
- 239000004292 methyl p-hydroxybenzoate Substances 0.000 description 1
- 229960002216 methylparaben Drugs 0.000 description 1
- 235000013379 molasses Nutrition 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000005817 monooxygenase reaction Methods 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 229910017464 nitrogen compound Inorganic materials 0.000 description 1
- 150000002830 nitrogen compounds Chemical class 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 235000016709 nutrition Nutrition 0.000 description 1
- 239000003960 organic solvent Substances 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 239000006072 paste Substances 0.000 description 1
- YIYBQIKDCADOSF-UHFFFAOYSA-N pent-2-enoic acid Chemical compound CCC=CC(O)=O YIYBQIKDCADOSF-UHFFFAOYSA-N 0.000 description 1
- 235000019319 peptone Nutrition 0.000 description 1
- 229960003742 phenol Drugs 0.000 description 1
- WVDDGKGOMKODPV-ZQBYOMGUSA-N phenyl(114C)methanol Chemical compound O[14CH2]C1=CC=CC=C1 WVDDGKGOMKODPV-ZQBYOMGUSA-N 0.000 description 1
- 239000002504 physiological saline solution Substances 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 1
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 1
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 1
- 239000011591 potassium Substances 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 229940107700 pyruvic acid Drugs 0.000 description 1
- 230000009257 reactivity Effects 0.000 description 1
- 235000019515 salmon Nutrition 0.000 description 1
- 238000005185 salting out Methods 0.000 description 1
- 229930000044 secondary metabolite Natural products 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 239000002002 slurry Substances 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 239000000600 sorbitol Substances 0.000 description 1
- 239000004455 soybean meal Substances 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 238000001694 spray drying Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- JIWBIWFOSCKQMA-UHFFFAOYSA-N stearidonic acid Natural products CCC=CCC=CCC=CCC=CCCCCC(O)=O JIWBIWFOSCKQMA-UHFFFAOYSA-N 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 150000003467 sulfuric acid derivatives Chemical class 0.000 description 1
- 239000000375 suspending agent Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 238000010998 test method Methods 0.000 description 1
- 125000003831 tetrazolyl group Chemical group 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 102000003601 transglutaminase Human genes 0.000 description 1
- 229910021642 ultra pure water Inorganic materials 0.000 description 1
- 239000012498 ultrapure water Substances 0.000 description 1
- 238000002604 ultrasonography Methods 0.000 description 1
- 238000001291 vacuum drying Methods 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
- 239000011701 zinc Substances 0.000 description 1
- 229910052725 zinc Inorganic materials 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/74—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
- C12N15/81—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N5/00—Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
- C12N5/10—Cells modified by introduction of foreign genetic material
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/64—Fats; Fatty oils; Ester-type waxes; Higher fatty acids, i.e. having at least seven carbon atoms in an unbroken chain bound to a carboxyl group; Oxidised oils or fats
- C12P7/6409—Fatty acids
Definitions
- the present invention relates to polypeptides, oxygenases, and their applications. Specifically, the present invention relates to oxygenases modified from cytochrome P450 monooxygenases and their applications.
- Cytochrome P450 monooxygenase (hereinafter also referred to as P450) is a monooxygenating enzyme possessed by a wide range of biological species ranging from bacteria to plants and mammals.
- P450 derived from bacteria is known to have higher catalytic activity than P450 derived from eukaryotes.
- P450cam derived from Pseudomonas aeruginosa and P450-BM3 derived from Bacillus megaterium, which is a type of bacterium belonging to the genus Bacillus, are known.
- P450 derived from bacteria has high catalytic activity to epoxidize (hydroxylate) substrates such as fatty acids.
- epoxidize substrates such as fatty acids.
- P450s are suitable as biocatalysts.
- wild-type P450 does not necessarily have high catalytic activity in epoxidizing industrially useful hydrocarbon substrates.
- various mutant P450 types have been proposed (eg, Patent Documents 1 to 3, Non-Patent Documents 1 to 4).
- Polyunsaturated fatty acids such as arachidonic acid, eicosapentaenoic acid, and docosahexaenoic acid are metabolized in vivo into various compounds and exhibit various physiological effects in each tissue.
- the cyclooxygenase system, lipoxygenase system, and P450 system are known as arachidonic acid metabolic systems, and arachidonic acid is metabolized into many compounds in these systems.
- the P450 system involves epoxidation of the double bonds of highly unsaturated fatty acids.
- polyunsaturated fatty acid metabolites produced by epoxidizing the ⁇ 3 position of highly unsaturated fatty acids are important substances for anti-inflammatory and anti-allergic effects in vivo (Non-Patent Document 5).
- Non-Patent Document 6 Even in vitro, bacterial-derived P450 catalyzes epoxidation at the ⁇ 3 position by reacting with highly unsaturated fatty acids (Non-Patent Document 6). On the other hand, a method of epoxidizing highly unsaturated fatty acids using a chemical oxidation reaction is also known. In this case, a method of reacting with hydrogen peroxide in the presence of lipase has been reported (Patent Document 4).
- an object of the present invention is to provide a modified oxygenase that specifically epoxidizes only the ⁇ 3 position of highly unsaturated fatty acids.
- the present inventors succeeded in epoxidizing only the ⁇ 3 position of polyunsaturated fatty acids by introducing a specific mutation at a predetermined position in the amino acid sequence of oxygenase.
- the present invention has the following configuration.
- a polypeptide shown in any of the following (1) to (3) (1) A polypeptide consisting of an amino acid sequence containing one or more amino acid substitutions selected from the group consisting of F87K/I/H, A330V, P25L, and T438M in the amino acid sequence shown in SEQ ID NO: 2; (2) In the amino acid sequence described in (1) above, one or several amino acid residues are substituted, added, inserted, or deleted, and has a catalytic activity to epoxidize the ⁇ 3 position of ⁇ 3 fatty acids.
- polypeptide according to [1] or [2], which is an oxygenase [4] An enzyme agent for epoxidizing the ⁇ 3-position of ⁇ 3 fatty acids, comprising the polypeptide according to any one of [1] to [3]. [5] The enzyme for epoxidizing the ⁇ 3 position according to [4], further comprising one or more enzymes selected from the group consisting of a NADPH regenerating enzyme, an enzyme that removes active oxygen, and an enzyme that removes hydrogen peroxide. agent.
- An expression cassette comprising the DNA according to [6].
- [8] A recombinant vector comprising the DNA according to [6].
- a method for producing oxygenase comprising the step of culturing the transformant according to [9].
- a method for epoxidizing the ⁇ 3 position comprising the step of causing the polypeptide according to any one of [1] to [3] or the enzyme agent for epoxidizing the ⁇ 3 position according to [4] or [5] to act on the ⁇ 3 fatty acid.
- Method for producing epoxidized fatty acids [12]
- An ⁇ 3 fatty acid comprising the step of causing the polypeptide according to any one of [1] to [3] or the enzyme agent for epoxidizing the ⁇ 3 position according to [4] or [5] to act on an ⁇ 3 fatty acid.
- Method for epoxidizing fatty acids [13] A composition containing an ⁇ 3-position epoxidized fatty acid obtained by the production method of [11].
- the present invention it is possible to obtain a modified oxygenase that specifically epoxidizes only the ⁇ 3 position of highly unsaturated fatty acids. Further, according to the present invention, there are provided a method for producing an ⁇ 3-epoxidized fatty acid in which only the ⁇ 3 position is specifically epoxidized, and an ⁇ 3-epoxidized fatty acid produced from the production method.
- FIG. 1 shows the detection of 17,18-EpETE (17,18-epoxyeicosatetraenoic acid) produced from the substrate EPA (eicosapentaenoic acid) when using mutant enzymes (F87K, F87I, F87H).
- FIG. 2 shows the results of HPLC analysis of an epoxidized compound produced from the substrate EPA (eicosapentaenoic acid).
- FIG. 3 shows the results of HPLC analysis of the epoxidized compound produced from the substrate EPA (eicosapentaenoic acid).
- FIG. 4 shows the results of HPLC analysis of the epoxidized compound produced from the substrate EPA (eicosapentaenoic acid).
- FIG. 2 shows the results of HPLC analysis of an epoxidized compound produced from the substrate EPA (eicosapentaenoic acid).
- FIG. 3 shows the results of HPLC analysis of the epoxidized compound produced from the substrate EPA (eicos
- FIG. 5 shows the results of HPLC analysis of the epoxidized compound produced from the substrate EPA (eicosapentaenoic acid).
- FIG. 6 shows the results of HPLC analysis of the epoxidized compound produced from the substrate DHA (docosahexaenoic acid).
- FIG. 7 shows the results of HPLC analysis of the epoxidized compound produced from the substrate DHA (docosahexaenoic acid).
- FIG. 8 shows the results of HPLC analysis of the epoxidized compound produced from the substrate DHA (docosahexaenoic acid).
- FIG. 9 shows the results of HPLC analysis of the epoxidized compound produced from the substrate DPA (docosapentaenoic acid).
- FIG. 10 shows the results of HPLC analysis of the epoxidized compound produced from the substrate DPA (docosapentaenoic acid).
- FIG. 11 shows the results of HPLC analysis of the epoxidized compound produced from the substrate DPA (docosapentaenoic acid).
- glycine (Gly) is G
- alanine (Ala) is A
- valine (Val) is V
- leucine (Leu) is L
- isoleucine (Ile) is I
- phenylalanine (Phe) is F
- tyrosine (Tyr) is Y
- tryptophan (Trp) is W
- serine (Ser) is S
- threonine (Thr) is T
- cysteine (Cys) is C
- methionine (Met) is M
- aspartic acid (Asp) is D
- glutamic acid (Glu) is E
- asparagine (Asn) is N
- glutamine (Gln) is Q
- lysine (Lys) is K
- arginine (Arg) is R
- histidine (His) is H
- proline (Pro) is P.
- nonpolar amino acids include alanine, valine, leucine, isoleucine, proline, methionine, phenylalanine, and tryptophan.
- Uncharged amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine and glutamine.
- Acidic amino acids include aspartic acid and glutamic acid.
- Basic amino acids include lysine, arginine and histidine.
- substitution refers not only to artificially introduced substitutions of amino acid residues, but also to cases where amino acid residue substitutions are naturally introduced, that is, when amino acid residues are originally different. This also includes cases where In this specification, substitution of amino acid residues may be artificial or natural substitutions, but artificial substitutions are preferred.
- polypeptide (oxygenase) The present invention relates to a polypeptide shown in any one of (1) to (3) below.
- P25L indicates that the amino acid residue P at position 25 in SEQ ID NO: 2 is substituted with L
- T438M indicates that the residue A is substituted with V. This shows that T, the amino acid residue at position 438 in No. 2, is substituted with M.
- polypeptides (1) to (3) above include not only polypeptides obtained by artificial substitution, but also polypeptides naturally substituted to have such amino acid sequences.
- amino acid substitutions are defined as a one-letter abbreviation representing the amino acid residue before the substitution, a number representing the position of the amino acid residue where the amino acid substitution occurs (position from the N-terminus in a specific amino acid sequence), and a number after the substitution. It is expressed as a combination of one-letter abbreviations representing the amino acid residues. For example, if phenylalanine at position 87 is substituted with lysine, it is expressed as "F87K.”
- the amino acid residue after substitution may be any one of a plurality of types, a one-letter abbreviation representing the amino acid residue after substitution is also written using the symbol "/".
- F87K/I/H means that phenylalanine at position 87 is substituted with lysine, isoleucine, or histidine.
- the symbol "-" is used when representing a combination (combination) of two or more substitutions.
- a combination of substitution of phenylalanine at position 87 with lysine and substitution of alanine at position 330 with valine is expressed as "F87K-A330V".
- the polypeptide shown in any one of (1) to (3) above has a catalytic activity that specifically epoxidizes the ⁇ 3 position of a highly unsaturated fatty acid ( ⁇ 3 fatty acid) than the polypeptide of SEQ ID NO: 2.
- the present inventors designed the enzyme by molecular reaction simulation of the enzyme-substrate reaction based on molecular dynamics (MD) calculations based on the three-dimensional structure information of P450, and created, cultured, and evaluated mutant recombinant P450 based on the design. As a result, we succeeded in obtaining a modified oxygenase (P450) with improved specificity of epoxidation at the ⁇ 3 position.
- MD molecular dynamics
- the polypeptide of SEQ ID NO: 2 is a cytochrome P450 monooxygenase derived from Bacillus megaterium.
- the polypeptide of the present invention is preferably a modified oxygenase, more preferably a modified monooxygenase, and even more preferably a modified monooxygenase derived from cytochrome P450 monooxygenase derived from Bacillus megaterium.
- monooxygenase is an enzyme that has a catalytic action to introduce one atom of oxygen into a compound using an oxygen molecule as a substrate, and uses heme, non-heme iron, or a metal ion as a cofactor.
- monooxygenase may further have other actions as long as the above action is the main activity.
- a polypeptide of the invention may be part of an oxygenase, and the polypeptide or oxygenase may be part of a larger protein (eg, a fusion protein).
- Sequences added in the fusion protein include, for example, sequences useful for purification such as multiple histidine residues, additional sequences that ensure stability during recombinant production, and the like.
- the polypeptide of the present invention consists of an amino acid sequence containing at least one or more amino acid substitutions selected from the group consisting of F87K/I/H, A330V, P25L, and T438M in the amino acid sequence shown in SEQ ID NO: 2. It is.
- the polypeptide of the present invention preferably contains at least an amino acid substitution of F87K, F87I or F87H, more preferably contains an amino acid substitution of F87K or F87I, and particularly preferably contains an amino acid substitution of F87K.
- the polypeptide of the present invention includes an amino acid substitution of F87K/I/H in the amino acid sequence shown in SEQ ID NO: 2, and an amino acid sequence containing one or more amino acid substitutions selected from the group consisting of A330V, P25L, and T438M. It is preferable that it consists of.
- a more preferred embodiment is a polypeptide consisting of the amino acid sequence shown in SEQ ID NO: 2, which includes an amino acid substitution of F87K and includes one or more amino acid substitutions selected from the group consisting of A330V, P25L, and T438M. .
- Specific embodiments include a polypeptide consisting of an amino acid sequence containing an amino acid substitution of F87K-A330V, a polypeptide consisting of an amino acid sequence containing an amino acid substitution of F87K-P25L, and an amino acid sequence of F87K-T438M in the amino acid sequence shown in SEQ ID NO: 2.
- a polypeptide consisting of an amino acid sequence containing a substitution a polypeptide consisting of an amino acid sequence containing an amino acid substitution of F87K-A330V-P25L, a polypeptide consisting of an amino acid sequence containing an amino acid substitution of F87K-A330V-T438M, a polypeptide consisting of an amino acid sequence containing an amino acid substitution of F87K-P25L-T438M, Examples include a polypeptide consisting of an amino acid sequence containing an amino acid substitution, and a polypeptide consisting of an amino acid sequence containing the amino acid substitution F87K-A330V-P25L-T438M.
- a polypeptide consisting of an amino acid sequence containing double amino acid substitutions at least in F87K and A330V in the amino acid sequence shown in SEQ ID NO: 2 is preferred.
- a polypeptide consisting of an amino acid sequence containing a triple amino acid substitution that also has a mutation in P25L or T438M is also preferable, particularly an amino acid sequence containing an amino acid substitution of F87K-A330V-T438M.
- a polypeptide consisting of this sequence is preferable because it tends to exhibit catalytic activity that specifically epoxidizes the ⁇ 3 position.
- amino acid residues other than the 87th, 330th, 25th, and 438th amino acid residues of SEQ ID NO: 2 are referred to as "arbitrary different sites.” It may also be written.
- an "arbitrary difference site” is a site where differences are allowed as long as they do not significantly affect the properties of the polypeptide.
- the catalytic activity for specifically epoxidizing the ⁇ 3 position of ⁇ 3 fatty acids is improved compared to the polypeptide consisting of the amino acid sequence shown in SEQ ID NO: 2.
- the polypeptide is referred to as a variant of the polypeptide (1) above.
- the polypeptides (2) and (3) above are different from the polypeptide (1) above. Further, it is preferable that the different polypeptide has substantially the same properties as the polypeptide of (1) above, although there is a difference in the amino acid sequence at any different site. Note that the fact that the properties of the polypeptides are substantially the same means that the catalytic activity for specifically epoxidizing the ⁇ 3 position of the ⁇ 3 fatty acid is equivalent.
- the amino acid modification introduced at any different site may be one type of modification (for example, only substitution) selected from substitution, addition, insertion, and deletion, or two types of modification may be selected from substitution, addition, insertion, and deletion.
- the above modifications (for example, substitutions and insertions) may be made.
- the number of amino acid differences at any difference site may be one or several. Examples of the number of differences include 1 to 80, 1 to 70, 1 to 60, 1 to 50, 1 to 40, 1 to 30, 1 to 20, 1 to 10, and 1 to 8. 1 to 7, 1 to 6, 1 to 5, 1 to 4, 1 to 3, or 1 to 2.
- sequence identity to the amino acid sequence described in (1) above is preferably 60% or more, more preferably 70% or more, and still more preferably 80% or more. It is preferably 90% or more, even more preferably 95% or more, particularly preferably 97% or more, and most preferably 99% or more.
- sequence identity with the amino acid sequence described in (1) above is the sequence identity calculated by comparing it with the amino acid sequence described in (1) above.
- sequence identity herein refers to BLASTPACKAGE [sgi32 bit edition, Version 2.0.12; available from National Center for Biotechnology Information (NCBI)] bl2seq program (Tatiana A. Tatsusova, Thomas L. Madden, FEMS Microbiol. Lett., Vol. 174, p247-250, 1999). The parameters are calculated by setting Gap insertion Cost value: 11 and Gap extension Cost value: 1.
- amino acid substitutions in polypeptides in (2) and (3) above include, for example, if the amino acid before substitution is a non-polar amino acid, substitution with another non-polar amino acid, and even if the amino acid before substitution is a non-charged amino acid. For example, substitution with another uncharged amino acid, substitution with another acidic amino acid if the amino acid before substitution is an acidic amino acid, and substitution with another basic amino acid if the amino acid before substitution is a basic amino acid. Can be mentioned.
- the 267th position (glutamic acid), the 268th position (threonine), the 264th position (alanine), and the 393rd position (phenylalanine) in the amino acid sequence shown in SEQ ID NO: 2. Since these amino acids play roles in electron transport and oxygen binding and are considered to be important residues for active catalysts, it is desirable not to introduce substitutions or deletions into these positions.
- catalytic activity for epoxidizing the ⁇ 3 position of the ⁇ 3 fatty acid refers to the ⁇ 3 position epoxidation that can be obtained by epoxidizing only the ⁇ 3 position of the ⁇ 3 fatty acid as a substrate. This can be evaluated by detecting converted fatty acids using column chromatography or the like.
- the production rate of ⁇ 3-position epoxidized fatty acids is increased compared to the case where a polypeptide consisting of the amino acid sequence shown in SEQ ID NO: 2 is used, “the ⁇ 3-position of ⁇ 3 fatty acids is It can be said that there is a catalytic activity that increases Furthermore, the production rate of ⁇ 3-epoxidized fatty acids when using the polypeptides (2) and (3) above is equivalent to the production rate of ⁇ 3-epoxidized fatty acids when using the polypeptide (1) above. The above is preferable.
- the production rate of ⁇ 3-position epoxidized fatty acid is 1 when using the polypeptide in (1) above, and when using the polypeptides in (2) and (3) above,
- the production rate of the ⁇ 3-position epoxidized fatty acid is preferably 0.8 to 1.2, more preferably 0.9 to 1.1, and even more preferably 0.95 to 1.05. preferable.
- the production rate of ⁇ 3-position epoxidized fatty acids refers to the mass of ⁇ 3-position epoxidized fatty acids ( molar concentration).
- the polypeptide (oxygenase) of the present invention has high substrate specificity and specifically epoxidizes the ⁇ 3 position of ⁇ 3 fatty acids. Specifically, when the production rate of epoxidized fatty acids at the ⁇ 3 position is 40% or more when the polypeptides (1) to (3) above are applied to eicosapentaenoic acid (EPA) as a substrate, It can be determined that the ⁇ 3 position of the fatty acid is specifically epoxidized. In addition, the production rate of ⁇ 3-position epoxidized fatty acids when the polypeptides (1) to (3) above are made to act on EPA is preferably 40% or more, and more preferably 50% or more.
- EPA eicosapentaenoic acid
- the production rate of the ⁇ 3-position epoxidized fatty acid when the polypeptides (1) to (3) above are allowed to act on EPA may be 100%.
- the present invention may also relate to an enzyme agent for epoxidizing the ⁇ 3 position of a highly unsaturated fatty acid ( ⁇ 3 fatty acid), which includes the above-mentioned polypeptide.
- the enzyme agent for epoxidizing the ⁇ 3 position contains the above-mentioned polypeptide as an active ingredient.
- the enzyme agent may consist of the polypeptide mentioned above.
- the content of the polypeptide in the enzyme preparation of the present invention is not particularly limited, and can be appropriately set within a range that exhibits the catalytic activity of epoxidizing the ⁇ 3 position of the ⁇ 3 fatty acid.
- the enzyme agent of the present invention may contain other components other than the above-mentioned polypeptide to the extent that the effects of the present invention are not affected.
- examples of other components include enzymes other than the above-mentioned polypeptide, additives, culture residues produced by the production method described below, and the like.
- the enzyme agent preferably contains an enzyme other than the above polypeptide, and when epoxidizing the ⁇ 3 position of the ⁇ 3 fatty acid, a concomitant enzyme may be used in addition to the above polypeptide.
- the enzyme agent contains one or more enzymes selected from the group consisting of NADPH regenerating enzymes, enzymes that remove active oxygen, and enzymes that remove hydrogen peroxide. It may be When epoxidizing the ⁇ 3 position of ⁇ 3 fatty acids, the reaction rate can be increased by using the above combined enzyme in addition to the above polypeptide, and the production rate of ⁇ 3 position epoxidized fatty acids can be increased. can.
- the NADPH regenerating enzyme is not particularly limited as long as it is an enzyme capable of converting the coenzyme NADP to NADPH.
- NADPH regenerating enzyme reactivates the enzyme by regenerating the coenzyme NADPH that monooxygenase consumes in the reaction. Therefore, when compared at the same reaction time, it has the effect of improving the conversion rate of the substrate.
- Types of NADPH regenerating enzymes include glucose dehydrogenase, alcohol dehydrogenase, D-lactate dehydrogenase, malate dehydrogenase, isocitrate dehydrogenase, and glucose 6-phosphate dehydrogenase.
- the NADPH regenerating enzyme one type of these NADPH regenerating enzymes may be used, or two or more types may be used in combination.
- NAD(P)-dependent glucose dehydrogenase glucose dehydrogenase derived from Bacillus megaterium.
- NADPH regenerating enzyme can be prepared by a known method.
- it when preparing glucose dehydrogenase derived from Bacillus megaterium, it can be easily prepared by culturing production bacteria and separating glucose dehydrogenase using known means, or by using genetic recombination technology.
- a commercially available product may be used as the NADPH regenerating enzyme.
- Commercially available NADPH regenerating enzymes include glucose dehydrogenase manufactured by Amano Enzyme Co., Ltd., and ADH, rD-LDH, rMDH, rICDH, and rG6PDH manufactured by Oriental Yeast Co., Ltd.
- Enzymes that remove active oxygen and hydrogen peroxide are particularly limited if they are enzymes that disproportionate active oxygen into hydrogen peroxide and molecular oxygen, or enzymes that can convert hydrogen peroxide into water. Not done. Enzymes that remove active oxygen and hydrogen peroxide have the effect of protecting enzymes by decomposing active oxygen and hydrogen peroxide produced by monooxygenase reactions. Therefore, it has the effect of preventing deactivation of monooxygenase and improving the conversion rate of substrate.
- Types of enzymes that remove active oxygen and hydrogen peroxide include catalase, peroxidase, and superoxide dismutase.
- One type of these enzymes that remove active oxygen and hydrogen peroxide may be used, or two or more types may be used in combination.
- catalase and superoxide dismutase are preferred from the viewpoint of further enhancing the effect.
- catalase, peroxidase, and superoxide dismutase derived from animal organs horseradish, Escherichia coli, and the Aspergillus genus can also be used.
- catalases, peroxidases, and superoxide dismutases animal organ-derived catalases and animal-derived superoxide dismutases are preferred from the viewpoint of further enhancing the effect.
- Enzymes that remove active oxygen and hydrogen peroxide can be prepared by known methods. For example, when preparing animal-derived catalase, it can be easily prepared by separating catalase from animal organs using known means, or by using genetic recombination technology. Furthermore, commercially available enzymes may be used to remove active oxygen and hydrogen peroxide. Commercially available enzymes that remove active oxygen and hydrogen peroxide include catalase manufactured by Fujifilm Wako, peroxidase manufactured by Fujifilm Wako, superoxide dismutase manufactured by Sigma-Aldrich, and superoxide dismutase manufactured by Toyobo.
- enzymes that the enzyme agent may contain include amylase ( ⁇ -amylase, ⁇ -amylase, glucoamylase), glucosidase ( ⁇ -glucosidase, ⁇ -glucosidase), galactosidase ( ⁇ -galactosidase, ⁇ -galactosidase), Protease (acid protease, neutral protease, alkaline protease), peptidase (leucine peptidase, aminopeptidase), lipase, esterase, cellulase, phosphatase (acid phosphatase, alkaline phosphatase), nuclease, deaminase, oxidase, dehydrogenase, glutaminase, pectinase, catalase , dextranase, transglutaminase, protein deamidase, pullulanase, peroxidase, superoxid
- additives include excipients, buffers, suspending agents, stabilizers, preservatives, preservatives, physiological saline, and the like.
- Excipients include starch, dextrin, maltose, trehalose, lactose, D-glucose, sorbitol, D-mannitol, white sugar, glycerol, and the like.
- Buffers include phosphates, citrates, acetates, and the like.
- Stabilizers include propylene glycol, ascorbic acid, and the like.
- Preservatives include phenol, benzalkonium chloride, benzyl alcohol, chlorobutanol, methylparaben, and the like.
- preservatives include ethanol, benzalkonium chloride, paraoxybenzoic acid, chlorobutanol, and the like. These additives may be used alone or in combination.
- Examples of the culture residue include components derived from the medium, contaminant proteins, bacterial body components, and the like.
- the form of the enzyme preparation of the present invention is not particularly limited, and examples include liquid form, solid form (powder, granules, etc.), immobilization on a carrier, and the like.
- the present invention relates to DNA shown in any of (1) to (3) below.
- the base sequence shown in SEQ ID NO:3 is the cDNA sequence of a gene encoding a polypeptide consisting of the amino acid sequence of the F87K variant shown in SEQ ID NO:4, and the base sequence shown in SEQ ID NO:5 is the cDNA sequence of the gene encoding the F87K and A330V variant shown in SEQ ID NO:6.
- cDNA sequence of a gene encoding a polypeptide consisting of the amino acid sequence of a heavy mutant the base sequence shown in SEQ ID NO: 7 encodes a polypeptide consisting of the amino acid sequence of the P25L, F87K, and A330V triple mutant shown in SEQ ID NO: 8
- the cDNA sequence of the gene, the nucleotide sequence shown in SEQ ID NO: 9, is the cDNA sequence of the gene encoding a polypeptide consisting of the amino acid sequence of the F87K, A330V, and T438M triple mutant shown in SEQ ID NO: 10.
- the homology of the DNA in (3) above to the DNA in (1) above is preferably 75% or more, more preferably 80% or more, even more preferably 85% or more, and 90% or more. More preferably, it is 95% or more, even more preferably 99% or more. Even if the DNA has a sequence different from the base sequence shown in SEQ ID NO: 3, 5, 7, or 9, it encodes a polypeptide that has a catalytic activity to specifically epoxidize the ⁇ 3 position of ⁇ 3 fatty acids. If so, it can be said to be equivalent to the base sequence shown in any of SEQ ID NOs: 3, 5, 7, and 9, and is included in the DNA of the present invention.
- DNA "homology” is calculated using publicly available or commercially available software with algorithms that compare a reference sequence to a query sequence. Specifically, BLAST, FASTA, GENETYX (manufactured by Software Development Co., Ltd.), etc. can be used, and these can be used by setting default parameters.
- a base sequence encoding a polypeptide in which one or several amino acid residues are substituted, added, inserted, or deleted at any divergent site of the amino acid sequence may also be used to specifically target the ⁇ 3 position of ⁇ 3 fatty acids.
- DNA is included in the present invention as long as it encodes a polypeptide having catalytic activity to epoxidize to.
- DNA that hybridizes under stringent conditions with DNA consisting of a base sequence complementary to the DNA consisting of the base sequence encoding the above-mentioned polypeptide also specifically epoxidizes the ⁇ 3 position of the ⁇ 3 fatty acid.
- it encodes a polypeptide having catalytic activity it is included in the DNA of the present invention.
- a stringent condition for example, the nylon membrane on which the DNA is immobilized is heated using 6x SSC (1x SSC is 8.76 g of sodium chloride and 4.41 g of sodium citrate dissolved in 1 liter of water).
- a DNA library is prepared by connecting DNA obtained from an appropriate gene source to a plasmid or phage vector according to a standard method.
- This library is introduced into a suitable host, the resulting transformant is cultured on a plate, the grown colonies or plaques are transferred to a nitrocellulose or nylon membrane, and the DNA is fixed to the membrane after denaturation treatment.
- This membrane is kept warm under the above stringent conditions in a solution having the above composition containing a probe previously labeled with 32P or the like, and hybridization is carried out.
- a polynucleotide encoding all or part of the amino acid sequence set forth in SEQ ID NO: 2 can be used.
- the DNA of the present invention can also be isolated from a microorganism that produces the above-mentioned predetermined polypeptide.
- a microorganism that produces the above-mentioned predetermined polypeptide.
- PCR or hybridization using primers or probes designed taking into account gene degeneracy based on known amino acid sequence information, or primers or probes designed based on known base sequence information.
- the DNA of interest can be isolated from the genome of the above-mentioned microorganism by the hybridization method.
- the DNA of the present invention includes various types of DNA derived from codon degeneracy. Artificial production of various types of DNA encoding the same amino acid sequence can be easily performed using known genetic engineering techniques. For example, in the production of a protein by genetic engineering, if the codons used in the original gene encoding the target protein are infrequently used in the host, the amount of protein expressed may be low. There is. In such cases, high expression of the target protein can be achieved by optimizing the codon usage frequency for the host without changing the encoded amino acid sequence.
- the total host optimal codon usage frequency for each codon may be adopted.
- the optimal codon is defined as the most frequently used codon among the codons corresponding to the same amino acid.
- the codon usage frequency is not particularly limited as long as it is optimized for the host, and examples of optimal codons for E. coli include the following.
- F phenylalanine (ttt), L: leucine (ctg), I: isoleucine (att), M: methionine (atg), V: valine (gtg), Y: tyrosine (tat), stop codon (taa), H: histidine (cat), Q: glutamine (cag), N: asparagine (aat), K: lysine (aaa), D: aspartic acid (gat), E: glutamic acid (gaa), S: serine (agc), P: Proline (ccg), T: threonine (acc), A: alanine (gcg), C: cysteine (tgc), W: tryptophan (tgg), R: arginine (cgc), G: glycine (ggc).
- Methods for introducing mutations into genes and artificially modifying amino acid sequences include known methods such as the Kunkel method and the Gapped duplex method, and mutation introduction kits using site-directed mutagenesis methods, such as the QuikChangeTM Site.
- Site-directed mutagenesis methods such as the QuikChangeTM Site.
- -Directed Mutagenesis Kit (Stratagene), GeneTailorTM Site-Directed Mutagenesis System (Invitrogen), TaKaRa Site-Directed Mutagenesis s System (Mutan-K, Mutan-Super Express Km, etc.: Takara Bio Inc.), etc.
- Kunkel method and the Gapped duplex method
- mutation introduction kits using site-directed mutagenesis methods such as the QuikChangeTM Site.
- Site-directed mutagenesis Kit (Stratagene), GeneTailorTM Site-Directed Mutagenesis System (Invitrogen), TaKaRa Site-Directed Mutagenesis s System (Mutan
- Confirmation of the DNA base sequence can be performed by sequencing using a conventional method. For example, it can be carried out by the dideoxynucleotide chain termination method (Sanger et al. (1977) Proc. Natl. Acad. Sci. USA 74:5463). Furthermore, the sequence can be analyzed using an appropriate DNA sequencer.
- DNA encoding the polypeptide of interest can be confirmed by comparing the determined base sequence with the base sequence set forth in SEQ ID NO: 1.
- the amino acid sequence deduced from the determined base sequence can be compared with the amino acid sequence set forth in SEQ ID NO: 2. It can also be confirmed by measuring the catalytic activity of the expressed polypeptide (catalytic activity that specifically epoxidizes the ⁇ 3 position of ⁇ 3 fatty acids).
- the present invention may relate to an expression cassette containing the above-mentioned DNA or a recombinant vector containing the above-mentioned DNA.
- the expression cassette or recombinant vector of the present invention can be obtained by linking a promoter and a terminator to the above-mentioned DNA.
- a recombinant vector can be obtained by inserting the expression cassette of the present invention or the DNA of the present invention into an expression vector.
- the expression cassette of the present invention or the recombinant vector of the present invention contains transcription elements such as an enhancer, CCAAT box, TATA box, SPI site, etc. as necessary, in addition to a promoter and a terminator, as control elements. It's okay. These control factors only need to be operably linked to the DNA of the present invention. Operably linked means that various control factors that regulate the DNA of the present invention and the DNA of the present invention are linked in a state in which they can operate in a host cell.
- the expression vector for constructing the recombinant vector of the present invention one constructed for genetic recombination from a phage, plasmid, or virus that can autonomously propagate within a host is suitable.
- Such expression vectors are known.
- commercially available expression vectors include pQE-based vectors (QIAGEN Co., Ltd.), pDR540, pRIT2T (GE Healthcare Bioscience Co., Ltd.), and pET-based vectors. (Merck Co., Ltd.), pHY300PLK, pBE-S (Takara Bio Co., Ltd.), and the like.
- Expression vectors may be used by selecting an appropriate combination with a host cell. For example, when using E.
- coli as a host cell, a combination of a pBAD vector and a DH5 ⁇ E. coli strain, a pET vector and a BL21 (DE3) vector, etc.
- Examples include a combination of E. coli strains, or a combination of pDR540 vector and JM109 E. coli strain.
- the DNA and the expression cassette or expression vector are joined using, for example, DNA ligase.
- the present invention may relate to a transformant having the above-mentioned DNA, the above-mentioned expression cassette, or the above-mentioned recombinant vector. That is, the transformant of the present invention is a transformant obtained by transforming a host with an expression cassette containing the above-mentioned DNA or a recombinant vector containing the above-mentioned DNA.
- the host used for producing the transformant is not particularly limited as long as it is capable of expressing the characteristics of the gene containing the DNA of the present invention, but any host that can introduce the gene and stably carry out the expression cassette or recombinant vector may be used. It is preferable that it can be maintained and autonomously propagated.
- Preferred examples of the host include bacteria belonging to the genus Escherichia such as Escherichia coli, the genus Bacillus such as Bacillus subtilis, the genus Pseudomonas such as Pseudomonas putida; yeast, etc.
- other cells such as animal cells, insect cells, and plants may also be used.
- the transformant of the present invention can be obtained by introducing an expression cassette containing the above-mentioned DNA or a recombinant vector containing the above-mentioned DNA into a host.
- the location where the DNA is introduced is not particularly limited as long as the gene of interest is expressed, and may be on a plasmid or on the genome.
- Specific methods for introducing the expression cassette or recombinant vector into a host include, for example, the recombinant vector method and the genome editing method.
- Conditions for introducing the expression cassette or recombinant vector into the host may be appropriately set depending on the type of host and the like. If the host is a bacterium, examples include a method using competent cells treated with calcium ions, an electroporation method, and the like.
- examples include electroporation, spheroplast method, and lithium acetate method.
- examples include electroporation method, calcium phosphate method, and lipofection method.
- examples include the calcium phosphate method, lipofection method, electroporation method, and the like.
- examples include electroporation method, Agrobacterium method, particle gun method, and PEG method.
- Confirmation of whether the expression cassette or recombinant vector has been integrated into the host can be performed by PCR method, Southern hybridization method, Northern hybridization method, etc.
- the genomic DNA, expression cassette or recombinant vector may be isolated and purified from the transformant. Isolation and purification of genomic DNA, expression cassettes, or recombinant vectors is performed, for example, when the host is a bacterium, by lysing the bacterium to obtain a lysate.
- a lytic enzyme such as lysozyme
- protease, other enzymes, and a surfactant such as sodium lauryl sulfate (SDS) are used in combination.
- SDS sodium lauryl sulfate
- Isolation and purification of DNA from a lysate can be carried out, for example, by appropriately combining protein removal treatment using phenol treatment or protease treatment, ribonuclease treatment, alcohol precipitation treatment, and commercially available kits.
- DNA cleavage can be performed according to a conventional method, for example, using restriction enzyme treatment.
- restriction enzyme for example, a type II restriction enzyme that acts on a specific nucleotide sequence is used.
- PCR when confirming whether the expression cassette or recombinant vector has been integrated into the host by PCR method, it is also possible to perform PCR using primers labeled with a fluorescent dye or the like in advance and detect the amplification product. Furthermore, a method of binding the amplification product to a solid phase such as a microplate and confirming the amplification product by fluorescence, enzyme reaction, etc. may also be adopted.
- the present invention relates to a method for producing oxygenase, which includes the step of culturing the transformant described above.
- the culture conditions for the step of culturing the transformant may be appropriately set in consideration of the nutritional and physiological properties of the transformant, but preferably liquid culture is used. Furthermore, in the case of industrial production, it is preferable to employ aerated agitation culture.
- the nutrient source of the medium those necessary for the growth of the transformant are used.
- the carbon source may be any carbon compound that can be assimilated, and examples thereof include glucose, sucrose, lactose, maltose, molasses, and pyruvic acid.
- the nitrogen source may be any nitrogen compound that can be assimilated, such as peptone, meat extract, yeast extract, casein hydrolyzate, and soybean meal alkaline extract.
- salts such as phosphates, carbonates, sulfates, magnesium, calcium, potassium, iron, manganese, and zinc, specific amino acids, and specific vitamins may be used as necessary. You may.
- the culture temperature is appropriately set within a range where the transformant can grow and the transformant produces the desired oxygenase.
- the culture temperature is preferably about 15 to 37°C. Cultivation can be completed at an appropriate time, taking into account the time when the maximum yield of oxygenase is reached, and the cultivation time is usually about 12 to 48 hours.
- the culture solution is subjected to a method such as centrifugation, and the culture supernatant and/or bacterial cells are collected.
- the bacterial cells are treated with mechanical methods such as ultrasound or French press, or with lytic enzymes such as lysozyme, and if necessary, by using enzymes such as protease or surfactants such as sodium lauryl sulfate (SDS). Solubilize to obtain a water-soluble fraction containing the desired oxygenase.
- SDS sodium lauryl sulfate
- the water-soluble fraction containing oxygenase obtained as described above may be subjected to purification treatment as it is, or may be subjected to purification treatment after concentrating the oxygenase in the water-soluble fraction.
- Concentration can be performed, for example, by vacuum concentration, membrane concentration, salting out treatment, fractional precipitation using a hydrophilic organic solvent (eg, methanol, ethanol, and acetone), and the like.
- the purification treatment can be performed by appropriately combining methods such as gel filtration, adsorption chromatography, ion exchange chromatography, and affinity chromatography.
- the purified predetermined oxygenase may be powdered by freeze-drying, vacuum drying, spray drying, or the like, if necessary.
- the present invention relates to a method for producing an ⁇ 3-position epoxidized fatty acid, which includes a step of allowing the above-mentioned polypeptide or the above-mentioned ⁇ 3-position epoxidation enzyme agent to act on a highly unsaturated fatty acid ( ⁇ 3-based fatty acid).
- a fatty acid in which only the ⁇ 3 position of the ⁇ 3 fatty acid is epoxidized ⁇ 3 position epoxidized fatty acid
- ⁇ 3 fatty acid by treating the ⁇ 3 fatty acid with the above-mentioned polypeptide (oxygenase), a fatty acid in which only the ⁇ 3 position of the ⁇ 3 fatty acid is epoxidized ( ⁇ 3-epoxidized fatty acid) can be obtained.
- the ⁇ 3-position epoxidized fatty acid obtained by the production method of the present invention is not particularly limited, but examples of the ⁇ 3-position epoxidized fatty acid include 19,20-epoxydocosapentaenoic acid (19,20-EpDPE), Examples include 19,20-epoxy docosatetraenoic acid (19,20-EpDTE) and 17,18-epoxyeicosatetraenoic acid (17,18-EpETE). Among these, the ⁇ 3-position epoxidized fatty acid is preferably 17,18-epoxyeicosatetraenoic acid (17,18-EpETE).
- omega-3 fatty acids include fish oil, flaxseed oil, perilla oil, fish oil and algae, docosahexaenoic acid extracted from plants, docosapentaenoic acid, eicosapentaenoic acid, arachidonic acid, alpha-linolenic acid, stearidonic acid, 11, Examples include 14,17-eicosatrienoic acid, tetracosapentaenoic acid, and tetracosahexaenoic acid.
- the omega-3 fatty acid is preferably at least one selected from the group consisting of docosahexaenoic acid, docosapentaenoic acid, eicosapentaenoic acid, arachidonic acid, and alpha-linolenic acid; It is more preferably at least one selected from the group consisting of pentaenoic acid and eicosapentaenoic acid, and even more preferably eicosapentaenoic acid.
- the ⁇ 3 fatty acids may be in the form of fats and oils containing these fatty acids.
- the properties of the omega-3 fatty acids are not particularly limited either, but liquid, slurry, and paste are preferred.
- the amount of polypeptide (oxygenase) used in the method for producing ⁇ 3-position epoxidized fatty acids of the present invention is not particularly limited, it is preferable that the oxygenase enzyme activity per mL of reaction solution is 0.0001 U or more. . From the viewpoint of further enhancing the effect, the oxygenase enzyme activity per mL of reaction solution is more preferably 0.0005 U or more, even more preferably 0.001 U or more, even more preferably 0.005 U or more. , 0.01U or more is particularly preferable.
- the upper limit of the oxygenase enzyme activity per mL of reaction solution is not particularly limited, but for example, it is preferably 10 U or less, more preferably 5 U or less, even more preferably 1 U or less, and even more preferably 0.5 U or less. It is even more preferable that the amount is 0.1 U or less.
- the enzymatic activity of oxygenase can be evaluated using the amount of decrease in the coenzyme NADPH as an index.
- the amount of NADPH can be quantified by measuring the absorbance of the reaction solution at 360 nm using an absorption photometer.
- EPA eicosapentaenoic acid
- HPLC high performance liquid chromatography
- enzymes other than the above-mentioned oxygenase may be used in combination.
- the enzyme used in combination is preferably one or more enzymes selected from the group consisting of NADPH regenerating enzymes, enzymes that remove active oxygen, and enzymes that remove hydrogen peroxide.
- the reaction rate can be increased by using the above-mentioned combined enzyme in addition to the above-mentioned oxygenase. can increase the production rate.
- the order in which each enzyme is acted upon that is, the order in which oxygenase and the combined enzyme are treated
- is simultaneous treatment is simultaneous treatment.
- NADPH regenerating enzyme Preferred embodiments of the NADPH regenerating enzyme, the enzyme that removes active oxygen, and the enzyme that removes hydrogen peroxide are as described in the item (enzyme agent) above.
- the amount of NADPH regenerating enzyme used in the method for producing ⁇ 3-position epoxidized fatty acids of the present invention is not particularly limited, but the enzyme activity per 1 mL of reaction solution is preferably 0.1 U or more. From the viewpoint of further enhancing the effect, the NADPH regenerating enzyme activity per mL of reaction solution is more preferably 0.5 U or more, even more preferably 1.0 U or more, and even more preferably 3.5 U or more. Preferably, it is particularly preferably 7.0U or more.
- the upper limit of NADPH regenerating enzyme activity per mL of reaction solution is not particularly limited, but for example, it is preferably 500 U or less, more preferably 100 U or less, even more preferably 50 U or less, and 20 U or less. is more preferable, it is even more preferable that it is 15 U or less, and it is particularly preferable that it is 10 U or less.
- the activity of NADPH regenerating enzyme is calculated by quantifying the amount of NADPH produced from the substrate NADP.
- NADPH can be quantified by measuring the absorbance of the reaction solution at 360 nm with an absorption photometer. 1 unit is defined as the generation of 1 ⁇ M NADPH (mM) per minute.
- the amount of enzyme for removing active oxygen and hydrogen peroxide used in the method for producing ⁇ 3-position epoxidized fatty acids of the present invention is not particularly limited, but the enzyme activity per 1 mL of reaction solution is preferably 0.1 U or more. . From the viewpoint of further increasing the effect, the activity of the enzyme that removes active oxygen and hydrogen peroxide per 1 mL of the reaction solution is more preferably 1 U or more, even more preferably 10 U or more, and even more preferably 25 U or more. is more preferable, and particularly preferably 45U or more.
- the upper limit of the activity of the enzyme that removes active oxygen and hydrogen peroxide per mL of reaction solution is not particularly limited, but for example, it is preferably 5000 U or less, more preferably 3000 U or less, and preferably 1000 U or less. More preferably, it is 100 U or less, even more preferably 75 U or less, and particularly preferably 60 U or less.
- the activity of catalase decomposes 1.0 ⁇ mol of H 2 O 2 , and during this time, the rate of decrease in the H 2 O 2 concentration is measured using absorbance at 240 nm. It is calculated as follows. 1 unit is defined as 1 ⁇ M H 2 O 2 (mM) consumed per minute. Furthermore, among enzyme activities that remove active oxygen and hydrogen peroxide, peroxidase activity is calculated by generating purpurogallin from pyrogallol and hydrogen peroxide and measuring the amount of color developed. 1 unit is the amount of enzyme that produces 1 mg of purpurogallin in 20 seconds using pyrogallol and hydrogen peroxide as main substrates.
- the activity of superoxide dismutase is calculated by measuring the amount of superoxide radicals produced by xanthine oxidase using a tetrazolium salt. 1 unit is defined as the amount of enzyme required to disproportion 50% of superoxide radicals.
- Reaction time for causing oxygenase (if necessary, one or more enzymes selected from the group consisting of NADPH regenerating enzymes, enzymes that remove active oxygen, and enzymes that remove hydrogen peroxide) to act on ⁇ 3 fatty acids.
- temperature, pH of the reaction solution, etc. are not particularly limited.
- the reaction temperature is, for example, preferably 20 to 50°C, more preferably 25 to 45°C, even more preferably 30 to 40°C.
- the pH of the reaction solution is, for example, preferably 5 to 10, more preferably 6 to 9.5, and even more preferably 7 to 9.
- the reaction time is, for example, preferably 10 minutes to 48 hours, more preferably 1 to 24 hours, even more preferably 2 to 12 hours, and particularly preferably 2 to 6 hours. Under the above reaction conditions, epoxidation of the ⁇ 3 position of the ⁇ 3 fatty acid can proceed more efficiently.
- These reaction conditions are appropriately selected depending on the target epoxidized fatty acid. Note that the optimal reaction conditions may be determined
- oxygenase (if necessary, one or more enzymes selected from the group consisting of an NADPH regenerating enzyme, an enzyme that removes active oxygen, and an enzyme that removes hydrogen peroxide) A step of deactivating the enzyme) may be provided.
- an ⁇ 3-position epoxidized fatty acid or a composition containing an ⁇ 3-position epoxidized fatty acid can be produced. That is, the present invention may relate to an ⁇ 3-epoxidized fatty acid or a composition containing an ⁇ 3-epoxidized fatty acid obtained by the above method for producing an ⁇ 3-epoxidized fatty acid.
- composition containing an ⁇ 3-position epoxidized fatty acid 80% by mass or more of the fatty acids contained is preferably an ⁇ 3-epoxidized fatty acid, more preferably 90% or more by mass is an ⁇ 3-epoxidized fatty acid, and 95% by mass It is more preferable that % or more of the fatty acids are epoxidized fatty acids at the ⁇ 3 position.
- the present invention relates to a method for epoxidizing ⁇ 3 fatty acids, which includes a step of allowing the above-mentioned polypeptide or the above-mentioned enzyme agent for epoxidizing the ⁇ 3 position to act on the ⁇ 3 fatty acids.
- the above-mentioned combined enzymes can also be used in the method for epoxidizing omega-3 fatty acids of the present invention.
- the reaction conditions in the method for epoxidizing ⁇ 3 fatty acids are the same as those in the step of applying polypeptide (oxygenase) to ⁇ 3 fatty acids in the method for producing epoxidized fatty acids at the ⁇ 3 position described above.
- Site-directed mutagenesis was performed on each of the seven amino acid residues that received a high score from the above MD calculation. Site-directed mutagenesis refers to the production of mutant enzymes in which selected amino acid residues are replaced with original amino acid residues (wild type) by other amino acid residues (19 types). According to the method described below, 133 types of mutant enzymes with amino acid substitutions were prepared, and a reaction between the enzyme and the substrate (EPA) was performed. The product after the reaction was analyzed by HPLC.
- mutant enzyme In addition to F87K, a double mutant enzyme was created in which an additional amino acid was substituted. Regarding the amino acid residues selected by the above MD calculation, 57 types of double mutant enzymes were produced. The reaction was carried out under the conditions of ⁇ Condition 1> in ⁇ EPA epoxidation reaction and high performance liquid chromatography (HPLC) analysis> described below, and the product was analyzed by HPLC. As a result, the amount of 17,18-EpETE produced was improved with multiple double mutant enzymes compared to the mutant enzyme (F87K). In particular, the mutant enzyme (F87K-A330V) produced the highest amount of 17,18-EpETE.
- mutant enzymes (3) As before, site-directed mutagenesis was performed on selected amino acid residues. Fifty-seven types of triple mutant enzymes were produced by PCR, with amino acid substitutions in addition to F87K and A330V. These triple mutant enzymes were reacted with the substrate EPA, and the products were analyzed by HPLC. As a result, the mutant enzyme (P25L-F87K-A330V) and mutant enzyme (F87K-A330V-T438M) had improved epoxidation position specificity and a high production rate of 17,18-EpETE from EPA ( Figure 4 ).
- mutant enzyme F87K-A330V-T438M
- mutant enzyme had improved epoxidation position specificity, and the production rate of 17,18-EpETE from EPA was 96% (lower panel of FIG. 4).
- mutant enzyme P25L-F87K-A330V
- EPA consumption was improved and product inhibition was resolved.
- the production rate of 17,18-EpETE was slightly lowered to 85% compared to the case without the combination (upper panel of FIG. 5).
- the peak production amount was reached at 3 hours, indicating that the enzyme reaction proceeded in a shorter time than when no combination was used (upper panel of FIG. 5).
- the production rate of 17,18-EpETE was 97%, which was further improved compared to the case without the combination (lower panel of FIG. 5).
- the peak production amount was at 5 hours, indicating that the enzyme reaction proceeded in a shorter time compared to when no combination was used (lower panel of Figure 5). From the above results, it was found that the combined effect improved the conversion rate of the substrate when compared at the same reaction time.
- the cells were cultured at 37°C for 18 hours to obtain a preculture solution.
- the preculture solution was inoculated into 1 mL of TB medium (manufactured by Invitrogen Co., Ltd.) supplemented with kanamycin (final concentration 50 ⁇ g/mL).
- the cells were cultured at 37° C. for 48 hours (IPTG (final concentration 0.1 mM) was added 24 hours after the start of culture), and the bacterial cells were collected by centrifugation.
- ⁇ Template plasmid> P450-BM3 expression vector: approximately 50 ng/ ⁇ L ⁇ PCR reaction composition> (total volume 25 ⁇ L) Prime STAR MAX DNA Polymerase (Takara Bio Inc.): 12.5 ⁇ L Template plasmid: 0.5 ⁇ L Each primer (primer combinations were as shown in Table 4): 0.5 ⁇ L x 2 Ultrapure water (Milli Q water): 11 ⁇ L
- ⁇ PCR conditions> Repeat 10 cycles of reaction at 98°C for 30 seconds, 98°C for 10 seconds, and 68°C for 8 minutes. Leave at 4°C for 10 minutes at 72°C.
- ⁇ In-fusion reaction> A reaction solution having the following composition was prepared (total volume: 10 ⁇ L) and heated at 50° C. for 15 minutes in a thermal cycler.
- E. coli BL21 DE3 (Nippon Gene)
- ⁇ Plasmid purification/sequence confirmation> The grown colonies were inoculated into LB medium (manufactured by Invitrogen Co., Ltd.) supplemented with kanamycin (final concentration 50 ⁇ g/mL). The cells were cultured at 37°C for 18 hours to obtain a bacterial cell culture solution. Bacterial cells were collected by centrifugation, and each plasmid was purified using a plasmid miniprep kit, NucleoSpin Plasmid EasyPure (Takara Bio Inc.). The DNA sequence of the gene was confirmed by Sanger sequencing. The DNA sequences of the mutant enzyme (F87K, F87I, F87H) genes were as shown in SEQ ID NOs: 3, 5, and 7, respectively.
- mutant recombinant P450-BM3 (F87K, F87H, F87I) 1 ⁇ L of the constructed mutant P450-BM3 expression vector was injected into E. coli. The mixture was mixed with 2 ⁇ L of E. coli BL21 (DE3) (Nippon Gene) and transformed. After recovery culture at 37°C for 20 minutes, the transformation solution was applied to an LB agar medium plate (manufactured by Invitrogen Co., Ltd.) supplemented with kanamycin (final concentration 50 ⁇ g/mL), and cultured at 37°C for 18 hours.
- E. coli BL21 DE3 (Nippon Gene)
- the grown colonies were inoculated into LB medium (manufactured by Invitrogen Co., Ltd.) supplemented with kanamycin (final concentration 50 ⁇ g/mL).
- the cells were cultured at 37°C for 18 hours to obtain a preculture solution.
- the preculture solution was inoculated into 1 mL of TB medium (manufactured by Invitrogen Co., Ltd.) supplemented with kanamycin (final concentration 50 ⁇ g/mL).
- the cells were cultured at 37° C. for 48 hours (IPTG (final concentration 0.1 mM) was added 24 hours after the start of culture), and the bacterial cells were collected by centrifugation.
- IPTG final concentration 0.1 mM
- Mutant recombinant P450-BM3 (F87K-A330V) was produced in the same manner as described above, except that the mutant P450-BM3 (F87K-A330V) expression vector was used instead of the mutant P450-BM3 expression vector. Bacterial cells expressing this were collected. The amino acid sequence of the obtained mutant recombinant P450-BM3 was as shown in SEQ ID NO: 10.
- mutant recombinant P450-BM3 (F87K-A330V-T438M) PCR primers for mutation introduction (Table 7) were designed, and the mutant P450-BM3 (F87K-A330V) expression vector was used as a template plasmid. Other than that, PCR etc. were performed under the same conditions as the above-mentioned method to obtain a mutant P450-BM3 (F87K-A330V-T438M) expression vector in which a triple mutation was introduced into the CYP102A1 gene.
- the DNA sequence of the gene was as shown in SEQ ID NO: 13.
- Mutant recombinant P450-BM3 (F87K- A330V-T438M) was collected.
- the amino acid sequence of the obtained mutant recombinant P450-BM3 was as shown in SEQ ID NO: 14.
- ⁇ Condition 2 Combined use with NADPH regenerating enzyme/hydrogen peroxide removing enzyme> 3.3mM substrate EPA (manufactured by Combi-Blocks, dissolved in ethanol), 0.16M glucose (manufactured by Fujifilm Wako Pure Chemical Industries, Ltd.), 7.7U/mL glucose dehydrogenase (NAD type) (Amano Enzyme Co., Ltd.) ), 0.1M Tris-hydrochloric acid buffer (pH 8.0) containing 50U/ml catalase (manufactured by Fujifilm Wako Pure Chemical Industries, Ltd.) and 3mM NADPH (manufactured by Oriental Yeast Co., Ltd.) was added to P450.
- EPA manufactured by Combi-Blocks, dissolved in ethanol
- 0.16M glucose manufactured by Fujifilm Wako Pure Chemical Industries, Ltd.
- 7.7U/mL glucose dehydrogenase NAD type
- HPLC analysis was performed on epoxidized metabolites generated from the substrate EPA.
- the conditions for HPLC analysis were as follows.
- the buffer and program were as follows.
- the peak areas were quantified and EPA, 17,18-EpETE (17,18-epoxyeicosatetraenoic acid), 14,15-EpETE (14,15-epoxyeicosatetraenoic acid), 19-HEPE (19 -hydroxyleicosapentaenoic acid).
- the peak positions of 17,18-EpETE and 14,15-EpETE were determined using respective standards (manufactured by Cayman Chemical).
- the peak of 19-HEPE was collected after HPLC analysis, and the structure was identified by NMR.
- the peak areas were quantified to determine the amounts of DHA, 19,20-EpDPE (19,20-epoxide cosapentaenoic acid), and 16,17-EpDPE (16,17-epoxide cosapentaenoic acid).
- 19,20-EpDPE (19,20-epoxide docosapentaenoic acid), 16,17-EpDPE (16,17-epoxide docosapentaenoic acid), acid) was produced.
- 19,20-EpDPE (19,20-epoxide docosapentaenoic acid) produced by epoxidizing the ⁇ 3 position of DHA has physiological effects similar to 17,18-EpETE and is a useful product.
- the mutant enzyme (F87K-A330V) was 90%
- the mutant enzyme (P25L-F87K-A330V) was 88%
- the mutant enzyme (F87K-A330V-T438M) was 96% ( Figures 6 to 8). This result suggested the possibility of industrial production of 19,20-EpDPE, which was impossible with the wild type.
- the peak area was quantified and used as the amount of DPA and 19,20-EpDTE (19,20-epoxide docosatetraenoic acid).
- 19,20-EpDTE (19,20-epoxy docosatetraenoic acid) was produced.
- 19,20-EpDTE (19,20-epoxide docosatetraenoic acid) produced by epoxidizing the ⁇ 3 position of DPA has physiological effects similar to 17,18-EpETE and is a useful product.
- the present invention it is possible to obtain a modified oxygenase that specifically epoxidizes only the ⁇ 3 position of highly unsaturated fatty acids. Furthermore, according to the present invention, it is possible to industrially produce ⁇ 3-epoxidized fatty acids in which only the ⁇ 3 position is specifically epoxidized.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Mycology (AREA)
- Medicinal Chemistry (AREA)
- Oil, Petroleum & Natural Gas (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Cell Biology (AREA)
- Enzymes And Modification Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
本発明は、高度不飽和脂肪酸のω3位のみを特異的にエポキシ化する改変型オキシゲナーゼを提供することを課題とする。本発明は、配列番号2に示すアミノ酸配列において、F87K/I/H、A330V、P25L及びT438Mよりなる群から選択される一又は二以上のアミノ酸置換を含むアミノ酸配列からなるポリペプチド、並びに、該ポリペプチドと実質的に同一のポリペプチドに関する。
Description
本発明は、ポリペプチド、オキシゲナーゼ及びこれらの応用に関する。具体的には、本発明は、シトクロムP450モノオキシゲナーゼを改変したオキシゲナーゼ及びその応用に関する。
シトクロムP450モノオキシゲナーゼ(以下、P450ともいう)は、細菌から植物や哺乳動物に至る広範囲の生物種が有する一酸素添加酵素である。細菌由来のP450は、真核生物由来のP450に比べて高い触媒活性を持つことが知られている。細菌由来のP450としては、緑膿菌(Pseudomonas aeruginosa)由来のP450camや、バチルス属細菌の一種であるBacillus megaterium由来のP450-BM3等が知られている。
細菌由来のP450は、脂肪酸等の基質をエポキシ化(水酸化)する高い触媒活性を備えている。また、水に対する親和性が比較的高いため、取得及び取扱いが容易である。こうした理由から、P450は、バイオ触媒として適している。しかし、野生型のP450は、工業的に有用な炭化水素系基質をエポキシ化するにあたっては、必ずしも触媒活性が高いわけでない。このため、種々の変異型P450が提案されている(例えば、特許文献1~3、非特許文献1~4)。
アラキドン酸、エイコサペンタエン酸、ドコサヘキサエン酸等の高度不飽和脂肪酸は、生体内で種々の化合物に代謝され、各組織で様々な生理作用を示す。例えば、アラキドン酸の代謝系としては、シクロオキシゲナーゼ系、リポキシゲナーゼ系、P450系が知られており、これらの系でアラキドン酸は数多くの化合物に代謝されている。これらの代謝系のうち、P450系では高度不飽和脂肪酸の二重結合のエポキシ化が行われている。特に、高度不飽和脂肪酸のω3位をエポキシ化することにより生成する高度不飽和脂肪酸代謝物は、生体内での抗炎症作用、抗アレルギー作用に重要な物質である(非特許文献5)。
細菌由来のP450は生体外においても、高度不飽和脂肪酸と反応することで、ω3位のエポキシ化を触媒する(非特許文献6)。一方、化学的な酸化反応を利用して、高度不飽和脂肪酸をエポキシ化する方法も知られている。この場合は、リパーゼの存在下、過酸化水素と反応させる手法が報告されている(特許文献4)。
Advanced Synthesis & Catalysis, 2006, 348.6: 763-772.
Drug metabolism and disposition, 2010, 38.5: 732-739.
Applied microbiology and biotechnology, 2011, 90.1: 147-157
International Journal of Molecular Sciences, 2021, 22.21: 11380
J Scientific reports, 2015, 5.1: 1-8.
Journal of Biological Chemistry, 1996, 271.37: 22663-22671.
上記の通り、化学的な酸化反応を利用して、高度不飽和脂肪酸をエポキシ化する方法が知られているが、この手法ではω6位、ω9位の二重結合を特異的にエポキシ化することはできるが、生理的に重要なω3位をエポキシ化した高度不飽和脂肪酸代謝物の製造することができない。
また、細菌由来のP450を用いて高度不飽和脂肪酸のω3位をエポキシ化する方法も知られている。しかしながら、本発明者らの検討おいて、野生型の細菌由来P450は反応位置特異性が低く、反応が進むに従ってω3位にとどまらず、それ以外の位置の二重結合のエポキシ化が進み、最終的には脂肪酸が分解されてしまうことが分かった。このように、細菌由来P450を用いた場合においても、生理的に重要なω3位のみを特異的にエポキシ化した高度不飽和脂肪酸代謝物を製造は困難であった。
そこで本発明は、高度不飽和脂肪酸のω3位のみを特異的にエポキシ化する改変型オキシゲナーゼを提供することを課題とする。
上記の課題を解決するために鋭意検討を行った結果、本発明者らは、オキシゲナーゼのアミノ酸配列において所定の位置に特定の変異を導入することで、高度不飽和脂肪酸のω3位のみがエポキシ化したω3位エポキシ化脂肪酸を生成する改変型オキシゲナーゼを得ることに成功した。
具体的に、本発明は、以下の構成を有する。
具体的に、本発明は、以下の構成を有する。
[1] 以下の(1)から(3)のいずれかに示すポリペプチド;
(1)配列番号2に示すアミノ酸配列において、F87K/I/H、A330V、P25L及びT438Mよりなる群から選択される一又は二以上のアミノ酸置換を含むアミノ酸配列からなるポリペプチド;
(2)上記(1)に記載のアミノ酸配列において、1個又は数個のアミノ酸残基が置換、付加、挿入又は欠失されてなり、ω3系脂肪酸のω3位をエポキシ化する触媒活性を有し、かつ、F87K/I/H、A330V、P25L及びT438Mよりなる群から選択される一又は二以上のアミノ酸置換は保持されているポリペプチド;
(3)上記(1)に記載のアミノ酸配列に対する配列同一性が90%以上であり、ω3系脂肪酸のω3位をエポキシ化する触媒活性を有し、かつ、F87K/I/H、A330V、P25L及びT438Mよりなる群から選択される一又は二以上のアミノ酸置換は保持されているポリペプチド。
(但し、F87K/I/Hとは、配列番号2における87位のアミノ酸残基であるFがK、IもしくはHに置換されていることを示し、A330Vとは、配列番号2における330位のアミノ酸残基であるAがVに置換されていることを示し、P25Lとは、配列番号2における25位のアミノ酸残基であるPがLに置換されていることを示し、T438Mとは、配列番号2における438位のアミノ酸残基であるTがMに置換されていることを示す。)
[2] (3)における配列同一性が95%以上である[1]に記載のポリペプチド。
[3] オキシゲナーゼである、[1]又は[2]に記載のポリペプチド。
[4] [1]~[3]のいずれかに記載のポリペプチドを含む、ω3系脂肪酸のω3位エポキシ化用酵素剤。
[5] NADPH再生酵素、活性酸素を除去する酵素及び過酸化水素を除去する酵素よりなる群から選択される一又は二以上の酵素をさらに含む、[4]に記載のω3位エポキシ化用酵素剤。
[6] 以下の(1)から(3)のいずれかに示すDNA;
(1)[1]に記載のポリペプチドをコードするDNA;
(2)配列番号3、5、7及び9のいずれかに示す塩基配列からなるDNA;
(3)配列番号3、5、7及び9のいずれかに示す塩基配列と等価の配列を含み、ω3系脂肪酸のω3位をエポキシ化する触媒活性を有するポリペプチドをコードするDNA。
[7] [6]に記載のDNAを含む発現カセット。
[8] [6]に記載のDNAを含む組換えベクター。
[9] [6]のDNA、[7]に記載の発現カセット又は[8]に記載の組換えベクターを有する形質転換体。
[10] [9]に記載の形質転換体を培養する工程を含む、オキシゲナーゼの製造方法。
[11] ω3系脂肪酸に、[1]~[3]のいずれかに記載のポリペプチドもしくは[4]又は[5]に記載のω3位エポキシ化用酵素剤を作用させる工程を含む、ω3位エポキシ化脂肪酸の製造方法。
[12] ω3系脂肪酸に、[1]~[3]のいずれかに記載のポリペプチドもしくは[4]又は[5]に記載のω3位エポキシ化用酵素剤を作用させる工程を含む、ω3系脂肪酸のエポキシ化方法。
[13] [11]の製造方法で得られる、ω3位エポキシ化脂肪酸を含む組成物。
(1)配列番号2に示すアミノ酸配列において、F87K/I/H、A330V、P25L及びT438Mよりなる群から選択される一又は二以上のアミノ酸置換を含むアミノ酸配列からなるポリペプチド;
(2)上記(1)に記載のアミノ酸配列において、1個又は数個のアミノ酸残基が置換、付加、挿入又は欠失されてなり、ω3系脂肪酸のω3位をエポキシ化する触媒活性を有し、かつ、F87K/I/H、A330V、P25L及びT438Mよりなる群から選択される一又は二以上のアミノ酸置換は保持されているポリペプチド;
(3)上記(1)に記載のアミノ酸配列に対する配列同一性が90%以上であり、ω3系脂肪酸のω3位をエポキシ化する触媒活性を有し、かつ、F87K/I/H、A330V、P25L及びT438Mよりなる群から選択される一又は二以上のアミノ酸置換は保持されているポリペプチド。
(但し、F87K/I/Hとは、配列番号2における87位のアミノ酸残基であるFがK、IもしくはHに置換されていることを示し、A330Vとは、配列番号2における330位のアミノ酸残基であるAがVに置換されていることを示し、P25Lとは、配列番号2における25位のアミノ酸残基であるPがLに置換されていることを示し、T438Mとは、配列番号2における438位のアミノ酸残基であるTがMに置換されていることを示す。)
[2] (3)における配列同一性が95%以上である[1]に記載のポリペプチド。
[3] オキシゲナーゼである、[1]又は[2]に記載のポリペプチド。
[4] [1]~[3]のいずれかに記載のポリペプチドを含む、ω3系脂肪酸のω3位エポキシ化用酵素剤。
[5] NADPH再生酵素、活性酸素を除去する酵素及び過酸化水素を除去する酵素よりなる群から選択される一又は二以上の酵素をさらに含む、[4]に記載のω3位エポキシ化用酵素剤。
[6] 以下の(1)から(3)のいずれかに示すDNA;
(1)[1]に記載のポリペプチドをコードするDNA;
(2)配列番号3、5、7及び9のいずれかに示す塩基配列からなるDNA;
(3)配列番号3、5、7及び9のいずれかに示す塩基配列と等価の配列を含み、ω3系脂肪酸のω3位をエポキシ化する触媒活性を有するポリペプチドをコードするDNA。
[7] [6]に記載のDNAを含む発現カセット。
[8] [6]に記載のDNAを含む組換えベクター。
[9] [6]のDNA、[7]に記載の発現カセット又は[8]に記載の組換えベクターを有する形質転換体。
[10] [9]に記載の形質転換体を培養する工程を含む、オキシゲナーゼの製造方法。
[11] ω3系脂肪酸に、[1]~[3]のいずれかに記載のポリペプチドもしくは[4]又は[5]に記載のω3位エポキシ化用酵素剤を作用させる工程を含む、ω3位エポキシ化脂肪酸の製造方法。
[12] ω3系脂肪酸に、[1]~[3]のいずれかに記載のポリペプチドもしくは[4]又は[5]に記載のω3位エポキシ化用酵素剤を作用させる工程を含む、ω3系脂肪酸のエポキシ化方法。
[13] [11]の製造方法で得られる、ω3位エポキシ化脂肪酸を含む組成物。
本発明によれば、高度不飽和脂肪酸のω3位のみを特異的にエポキシ化する改変型オキシゲナーゼを得ることができる。また、本発明によれば、ω3位のみを特異的にエポキシ化したω3位エポキシ化脂肪酸の製造方法や該製造方法から製造されるω3位エポキシ化脂肪酸が提供される。
以下において、本発明について詳細に説明する。以下に記載する構成要件の説明は、代表的な実施形態や具体例に基づいてなされることがあるが、本発明はそのような実施形態に限定されるものではない。
本明細書において、アミノ酸配列における20種類のアミノ酸残基は、一文字略記で表現することがある。その場合、グリシン(Gly)はG、アラニン(Ala)はA、バリン(Val)はV、ロイシン(Leu)はL、イソロイシン(Ile)はI、フェニルアラニン(Phe)はF、チロシン(Tyr)はY、トリプトファン(Trp)はW、セリン(Ser)はS、スレオニン(Thr)はT、システイン(Cys)はC、メチオニン(Met)はM、アスパラギン酸(Asp)はD、グルタミン酸(Glu)はE、アスパラギン(Asn)はN、グルタミン(Gln)はQ、リジン(Lys)はK、アルギニン(Arg)はR、ヒスチジン(His)はH、プロリン(Pro)はPである。また、本明細書において、表示するアミノ酸配列は、左端がN末端、右端がC末端である。
本明細書において、「非極性アミノ酸」には、アラニン、バリン、ロイシン、イソロイシン、プロリン、メチオニン、フェニルアラニン及びトリプトファンが含まれる。「非電荷アミノ酸」には、グリシン、セリン、スレオニン、システイン、チロシン、アスパラギン及びグルタミンが含まれる。「酸性アミノ酸」には、アスパラギン酸及びグルタミン酸が含まれる。「塩基性アミノ酸」には、リジン、アルギニン及びヒスチジンが含まれる。
本明細書において、「置換」とは、人為的にアミノ酸残基の置換を導入した場合のみならず、自然にアミノ酸残基の置換が導入された場合、すなわち、もともとアミノ酸残基が相違していた場合も含まれる。本明細書においては、アミノ酸残基の置換は、人為的な置換であってもよく、自然な置換であってもよいが、人為的な置換が好ましい。
(ポリペプチド(オキシゲナーゼ))
本発明は、以下の(1)から(3)のいずれかに示すポリペプチドに関する。
(1)配列番号2に示すアミノ酸配列において、F87K/I/H、A330V、P25L及びT438Mよりなる群から選択される一又は二以上のアミノ酸置換を含むアミノ酸配列からなるポリペプチド。
(2)上記(1)に記載のアミノ酸配列において、1個又は数個のアミノ酸残基が置換、付加、挿入又は欠失されてなり、ω3系脂肪酸のω3位をエポキシ化する触媒活性を有し、かつ、F87K/I/H、A330V、P25L及びT438Mよりなる群から選択される一又は二以上のアミノ酸置換は保持されているポリペプチド。
(3)上記(1)に記載のアミノ酸配列に対する配列同一性が90%以上であり、ω3系脂肪酸のω3位をエポキシ化する触媒活性を有し、かつ、F87K/I/H、A330V、P25L及びT438Mよりなる群から選択される一又は二以上のアミノ酸置換は保持されているポリペプチド。
但し、F87K/I/Hとは、配列番号2における87位のアミノ酸残基であるFがK、IもしくはHに置換されていることを示し、A330Vとは、配列番号2における330位のアミノ酸残基であるAがVに置換されていることを示し、P25Lとは、配列番号2における25位のアミノ酸残基であるPがLに置換されていることを示し、T438Mとは、配列番号2における438位のアミノ酸残基であるTがMに置換されていることを示す。
本発明は、以下の(1)から(3)のいずれかに示すポリペプチドに関する。
(1)配列番号2に示すアミノ酸配列において、F87K/I/H、A330V、P25L及びT438Mよりなる群から選択される一又は二以上のアミノ酸置換を含むアミノ酸配列からなるポリペプチド。
(2)上記(1)に記載のアミノ酸配列において、1個又は数個のアミノ酸残基が置換、付加、挿入又は欠失されてなり、ω3系脂肪酸のω3位をエポキシ化する触媒活性を有し、かつ、F87K/I/H、A330V、P25L及びT438Mよりなる群から選択される一又は二以上のアミノ酸置換は保持されているポリペプチド。
(3)上記(1)に記載のアミノ酸配列に対する配列同一性が90%以上であり、ω3系脂肪酸のω3位をエポキシ化する触媒活性を有し、かつ、F87K/I/H、A330V、P25L及びT438Mよりなる群から選択される一又は二以上のアミノ酸置換は保持されているポリペプチド。
但し、F87K/I/Hとは、配列番号2における87位のアミノ酸残基であるFがK、IもしくはHに置換されていることを示し、A330Vとは、配列番号2における330位のアミノ酸残基であるAがVに置換されていることを示し、P25Lとは、配列番号2における25位のアミノ酸残基であるPがLに置換されていることを示し、T438Mとは、配列番号2における438位のアミノ酸残基であるTがMに置換されていることを示す。
なお、上記(1)~(3)のポリペプチドには、人為的に置換して得られるポリペプチドのみならず、そのようなアミノ酸配列を有するように自然置換されたポリペプチドも含まれる。
本明細書においては、アミノ酸置換を、置換前のアミノ酸残基を表す一文字略記、アミノ酸置換が生じるアミノ酸残基の位置(特定のアミノ酸配列におけるN末端側からの位置)を表す数字、及び置換後のアミノ酸残基を表す一文字略記の組合せで表現する。例えば、87位のフェニルアラニンがリジンに置換されるのであれば「F87K」と表現される。また、置換後のアミノ酸残基が複数種のうちいずれかであってもよい場合には、置換後のアミノ酸残基を表す一文字略記を記号「/」を用いて併記する。例えば、F87K/I/Hとは87位のフェニルアラニンがリジン、イソロイシン又はヒスチジンに置換されていることを表す。
また、二つ以上の置換の組合せ(併用)を表す場合、記号「-」を用いる。例えば、87位のフェニルアラニンのリジンへの置換と、330位のアラニンのバリンへの置換の組合せであれば、「F87K-A330V」と表現される。
また、二つ以上の置換の組合せ(併用)を表す場合、記号「-」を用いる。例えば、87位のフェニルアラニンのリジンへの置換と、330位のアラニンのバリンへの置換の組合せであれば、「F87K-A330V」と表現される。
上記(1)から(3)のいずれかに示すポリペプチドは、配列番号2のポリペプチドよりも、高度不飽和脂肪酸(ω3系脂肪酸)のω3位を特異的にエポキシ化する触媒活性を有する。本発明者らは、P450の立体構造情報から分子動力学(MD)計算に基づく酵素-基質反応の分子反応シミュレーションによる酵素の設計、ならびに設計に基づく変異型組み換えP450の作製、培養、評価を行った結果、ω3位のエポキシ化の特異性を向上させた改変型オキシゲナーゼ(P450)を得ることに成功した。
ここで、配列番号2のポリペプチドは、Bacillus megaterium由来のシトクロムP450モノオキシゲナーゼである。本発明のポリペプチドは改変型オキシゲナーゼであることが好ましく、改変型モノオキシゲナーゼであることがより好ましく、Bacillus megaterium由来のシトクロムP450モノオキシゲナーゼに由来する改変型モノオキシゲナーゼであることがさらに好ましい。ここで、モノオキシゲナーゼは、酸素分子を基質として酸素1原子を化合物に導入する触媒作用を有し、かつ、ヘムや非ヘム鉄,もしくは金属イオンをコファクターとする酵素である。なお、モノオキシゲナーゼは、上記作用が主活性である限り、他の作用をさらに有していても良い。
本発明のポリペプチドはオキシゲナーゼの一部であってもよく、さらにポリペプチドもしくはオキシゲナーゼは、より大きいタンパク質(例えば融合タンパク質)の一部であってもよい。融合タンパク質において付加される配列としては、例えば、多重ヒスチジン残基のような精製に役立つ配列、組み換え生産の際の安定性を確保する付加配列等が挙げられる。
本発明のポリペプチドは、配列番号2に示すアミノ酸配列において、少なくとも、F87K/I/H、A330V、P25L及びT438Mよりなる群から選択される一又は二以上のアミノ酸置換を含むアミノ酸配列からなるものである。本発明のポリペプチドは、少なくとも、F87K、F87I又はF87Hのアミノ酸置換を含むことが好ましく、F87K又はF87Iのアミノ酸置換を含むことがより好ましく、F87Kのアミノ酸置換を含むことが特に好ましい。
本発明のポリペプチドは、配列番号2に示すアミノ酸配列において、F87K/I/Hのアミノ酸置換を含み、A330V、P25L及びT438Mよりなる群から選択される一又は二以上のアミノ酸置換を含むアミノ酸配列からなるものであることが好ましい。より好ましい態様は、配列番号2に示すアミノ酸配列において、F87Kのアミノ酸置換を含み、A330V、P25L及びT438Mよりなる群から選択される一又は二以上のアミノ酸置換を含むアミノ酸配列からなるポリペプチドである。具体的態様としては、配列番号2に示すアミノ酸配列において、F87K-A330Vのアミノ酸置換を含むアミノ酸配列からなるポリペプチド、F87K-P25Lのアミノ酸置換を含むアミノ酸配列からなるポリペプチド、F87K-T438Mのアミノ酸置換を含むアミノ酸配列からなるポリペプチド、F87K-A330V-P25Lのアミノ酸置換を含むアミノ酸配列からなるポリペプチド、F87K-A330V-T438Mのアミノ酸置換を含むアミノ酸配列からなるポリペプチド、F87K-P25L-T438Mのアミノ酸置換を含むアミノ酸配列からなるポリペプチド、F87K-A330V-P25L-T438Mのアミノ酸置換を含むアミノ酸配列からなるポリペプチドが挙げられる。中でも、配列番号2に示すアミノ酸配列において、少なくともF87KとA330Vに二重のアミノ酸置換を含むアミノ酸配列からなるポリペプチドが好ましい。さらに、F87KとA330Vの二重変異に加えて、さらに、P25LもしくはT438Mにも変異を有する三重のアミノ酸置換を含むアミノ酸配列からなるポリペプチドも好ましく、特にF87K-A330V-T438Mのアミノ酸置換を含むアミノ酸配列からなるポリペプチドはω3位を特異的にエポキシ化する触媒活性が発揮されやすいため好ましい。
本明細書においては、上記(2)及び(3)のポリペプチドにおいて、配列番号2の第87位、第330位、第25位及び第438位のアミノ酸残基以外を「任意相違部位」と表記することもある。本明細書において、「任意相違部位」とは、ポリペプチドの特性に大きく影響しない限り、相違が許容される部位である。そして、任意相違部位においてアミノ酸配列に相違がみられるものの、配列番号2に示すアミノ酸配列からなるポリペプチドに比してω3系脂肪酸のω3位を特異的にエポキシ化する触媒活性が向上しているポリペプチドを、上記(1)のポリペプチドの相違体という。すなわち、上記(2)及び(3)のポリペプチドは、上記(1)のポリペプチドの相違体である。また、ポリペプチドの相違体は、上記(1)のポリペプチドと比較して、任意相違部位においてアミノ酸配列に相違がみられるものの、当該ポリペプチドの特性が実質的に同一であることが好ましい。なお、ポリペプチドの特性が実質的に同一とは、ω3系脂肪酸のω3位を特異的にエポキシ化する触媒活性が同等であることをいう。
上記(2)のポリペプチドにおいて、任意相違部位に導入されるアミノ酸の改変は、置換、付加、挿入及び欠失から選択される1種類の改変(例えば置換のみ)であってもよく、2種以上の改変(例えば置換と挿入)であってもよい。上記(2)のポリペプチドにおいて、任意相違部位におけるアミノ酸の相違の数は、1個又は数個であればよい。相違の数としては、例えば1~80個、1~70個、1~60個、1~50個、1~40個、1~30個、1~20個、1~10個、1~8個、1~7個、1~6個、1~5個、1~4個、1~3個、もしくは、1~2個である。
上記(3)のポリペプチドにおいて、上記(1)に記載のアミノ酸配列に対する配列同一性が60%以上であることが好ましく、70%以上であることがより好ましく、80%以上であることがさらに好ましく、90%以上であることが一層好ましく、95%以上であることがより一層好ましく、97%以上であることが特に好ましく、99%以上であることが最も好ましい。ここで、上記(3)のポリペプチドにおいて、上記(1)に記載のアミノ酸配列に対する配列同一性とは、上記(1)に記載のアミノ酸配列と比較して算出される配列同一性である。本明細書における「配列同一性」は、BLASTPACKAGE[sgi32 bit edition,Version 2.0.12;available from National Center for Biotechnology Information(NCBI)]のbl2seq program(Tatiana A.Tatsusova,Thomas L.Madden,FEMS Microbiol.Lett.,Vol.174,p247-250,1999)により得られるアミノ酸配列の同一性の値である。パラメーターは、Gap insertion Cost value:11、Gap extension Cost value:1に設定して算定される。
上記(2)及び(3)のポリペプチドの任意相違部位において、アミノ酸置換が導入されている場合、導入されるアミノ酸置換の好適な一態様として保存的置換が挙げられる。上記(2)及び(3)のポリペプチドにおけるアミノ酸置換としては、例えば、置換前のアミノ酸が非極性アミノ酸であれば他の非極性アミノ酸への置換、置換前のアミノ酸が非荷電性アミノ酸であれば他の非荷電性アミノ酸への置換、置換前のアミノ酸が酸性アミノ酸であれば他の酸性アミノ酸への置換、及び置換前のアミノ酸が塩基性アミノ酸であれば他の塩基性アミノ酸への置換が挙げられる。
なお、上記(2)及び(3)のポリペプチドにおいて、配列番号2に示すアミノ酸配列における第267位(グルタミン酸)、第268位(トレオニン)、第264位(アラニン)、第393位(フェニルアラニン)のアミノ酸は、電子の伝達、酸素結合の役割を果たし、活性触媒に重要な残基と考えられることから、これらの部位には置換又は欠失を導入しないことが望ましい。
上記(2)及び(3)のポリペプチドにおいて、「ω3系脂肪酸のω3位をエポキシ化する触媒活性」は、基質となるω3系脂肪酸のω3位のみをエポキシ化することで得られるω3位エポキシ化脂肪酸をカラムクロマトグラフィーなどで検出することで評価することができる。具体的には、配列番号2に示すアミノ酸配列からなるポリペプチドを用いた場合と比して、ω3位エポキシ化脂肪酸の生成率が増加している場合に、「ω3系脂肪酸のω3位をエポキシ化する触媒活性」があると言える。また、上記(2)及び(3)のポリペプチドを用いた場合のω3位エポキシ化脂肪酸の生成率は、上記(1)のポリペプチドを用いた場合のω3位エポキシ化脂肪酸の生成率と同等以上であることが好ましく、例えば、上記(1)のポリペプチドを用いた場合のω3位エポキシ化脂肪酸の生成率を1とした場合、上記(2)及び(3)のポリペプチドを用いた場合のω3位エポキシ化脂肪酸の生成率は、0.8~1.2であることが好ましく、0.9~1.1であることがより好ましく、0.95~1.05であることがさらに好ましい。なお、ω3位エポキシ化脂肪酸の生成率とは、基質となるω3系脂肪酸にポリペプチド(オキシゲナーゼ)を作用させて得られる生成物の全質量(全モル濃度)に対するω3位エポキシ化脂肪酸の質量(モル濃度)の割合である。
本発明のポリペプチド(オキシゲナーゼ)の基質特異性は高く、ω3系脂肪酸のω3位を特異的にエポキシ化する。具体的には、エイコサペンタエン酸(EPA)を基質として上記(1)~(3)のポリペプチドを作用させたときのω3位エポキシ化脂肪酸の生成率が40%以上である場合に、ω3系脂肪酸のω3位を特異的にエポキシ化していると判定できる。なお、EPAに上記(1)~(3)のポリペプチドを作用させたときのω3位エポキシ化脂肪酸の生成率は、それぞれ40%以上であることが好ましく、50%以上であることがより好ましく、60%以上であることがさらに好ましく、65%以上であることが一層好ましく、70%以上であることがより一層好ましく、80%以上であることが特に好ましく、85%以上であることが最も好ましい。また、EPAに上記(1)~(3)のポリペプチドを作用させたときのω3位エポキシ化脂肪酸の生成率は100%であってもよい。
(酵素剤)
本発明は、上述したポリペプチドを含む、高度不飽和脂肪酸(ω3系脂肪酸)のω3位エポキシ化用酵素剤に関するものであってもよい。ω3位エポキシ化用酵素剤は、上述したポリペプチドを有効成分として含む。なお、酵素剤は、上述したポリペプチドからなるものであってもよい。
本発明は、上述したポリペプチドを含む、高度不飽和脂肪酸(ω3系脂肪酸)のω3位エポキシ化用酵素剤に関するものであってもよい。ω3位エポキシ化用酵素剤は、上述したポリペプチドを有効成分として含む。なお、酵素剤は、上述したポリペプチドからなるものであってもよい。
本発明の酵素剤における上記ポリペプチドの含有量は特に限定されるものではなく、ω3系脂肪酸のω3位をエポキシ化する触媒活性が発揮される範囲で適宜設定することができる。
本発明の酵素剤は、上記ポリペプチド以外に、本発明の効果に影響を与えない程度に、他の成分を含んでいてもよい。他の成分としては、上記ポリペプチド以外の他の酵素、添加剤、後述する製造方法で生じた培養残渣等が挙げられる。中でも、酵素剤は、上記ポリペプチド以外の他の酵素を含むことが好ましく、ω3系脂肪酸のω3位をエポキシ化する際には、上記ポリペプチドに加えて、併用酵素を用いてもよい。
具体的には、酵素剤には、上記ポリペプチドに加えて、NADPH再生酵素、活性酸素を除去する酵素及び過酸化水素を除去する酵素よりなる群から選択される一又は二以上の酵素が含まれていてもよい。ω3系脂肪酸のω3位をエポキシ化する際には、上記ポリペプチドに加えて、上記併用酵素を使用することにより、反応速度を高めることができ、ω3位エポキシ化脂肪酸の生成速度を高めることができる。
NADPH再生酵素としては、補酵素NADPをNADPHに変換可能な酵素であれば特に限定されない。NADPH再生酵素は、モノオキシゲナーゼが反応で消費する補酵素NADPHを再生することで、酵素の再活性化を実現する。そのため、同じ反応時間で比較すると基質の変換速度を向上させる効果がある。
NADPH再生酵素の種類としては、グルコースデヒドロゲナーゼ、アルコール脱水素酵素、D-乳酸脱水素酵素、りんご酸脱水素酵素、イソクエン酸脱水素酵素、グルコース6リン酸脱水素酵素が挙げられる。NADPH再生酵素としては、これらのNADPH再生酵素から1種を用いてもよいし、2種以上を組み合わせて用いてもよい。これらのNADPH再生酵素の中でも、効果をより一層高める観点から、好ましくはNAD(P)依存性グルコースデヒドロゲナーゼ(グルコースデヒドロゲナーゼ(NAD))が挙げられる。グルコースデヒドロゲナーゼの具体例としては、Bacillus megateriumに由来するグルコースデヒドロゲナーゼが挙げられる。
NADPH再生酵素は、公知の方法により調製することができる。一例として、Bacillus megaterium由来グルコースデヒドロゲナーゼを調製する場合、生産菌を培養し、グルコースデヒドロゲナーゼを公知の手段を用いて分離する方法、及び遺伝子組み換え技術を用いる方法等によって容易に調製することができる。また、NADPH再生酵素として、市販品を用いてもよい。市販品のNADPH再生酵素としては、天野エンザイム(株)製グルコースデヒドロゲナーゼ、オリエンタル酵母社製ADH、rD-LDH、rMDH、rICDH、rG6PDHが挙げられる。
活性酸素を除去する酵素や過酸化水素を除去する酵素としては、活性酸素を過酸化水素と分子状酸素に不均化する酵素、又は過酸化水素を水に変換可能な酵素であれば特に限定されない。活性酸素を除去する酵素や過酸化水素を除去する酵素は、モノオキシゲナーゼが反応で生成する活性酸素や過酸化水素を分解して酵素を保護する効果がある。そのため、モノオキシゲナーゼの失活を防止して基質の変換速度を向上させる効果がある。
活性酸素を除去する酵素や過酸化水素を除去する酵素の種類としては、カタラーゼ、ペルオキシダーゼ、スーパーオキシドディスムターゼが挙げられる。これらの活性酸素や過酸化水素を除去する酵素から1種を用いてもよいし、2種以上を組み合わせて用いてもよい。これらの活性酸素や過酸化水素を除去する酵素の中でも、効果をより一層高める観点から、好ましくはカタラーゼ、スーパーオキシドディスムターゼが挙げられる。
活性酸素を除去する酵素や過酸化水素を除去する酵素としては、動物臓器、西洋わさび、大腸菌、Aspergillus属に由来するカタラーゼ、ペルオキシダーゼ、スーパーオキシドディスムターゼを用いることもできる。これらのカタラーゼ、ペルオキシダーゼ、スーパーオキシドディスムターゼの中でも、効果をより一層高める観点から、好ましくは動物臓器由来カタラーゼ又は、動物由来スーパーオキシドディスムターゼが挙げられる。
活性酸素を除去する酵素や過酸化水素を除去する酵素は、公知の方法により調製することができる。例えば、動物由来カタラーゼを調製する場合、動物臓器からカタラーゼを公知の手段を用いて分離する方法や、遺伝子組み換え技術を用いる方法等によって容易に調製することができる。また、活性酸素や過酸化水素を除去する酵素として、市販品を用いてもよい。市販品の活性酸素や過酸化水素を除去する酵素としては、富士フイルム和光社製カタラーゼ、富士フイルム和光社製ペルオキシダーゼ、シグマ―アルドリッチ社製スーパーオキシドディスムターゼ、東洋紡社製スーパーオキシドディスムターゼが挙げられる。
酵素剤が含み得る他の酵素としては、その他に、アミラーゼ(α-アミラーゼ、β-アミラーゼ、グルコアミラーゼ)、グルコシダーゼ(α-グルコシダーゼ、β-グルコシダーゼ)、ガラクトシダーゼ(α-ガラクトシダーゼ、β-ガラクトシダーゼ)、プロテアーゼ(酸性プロテアーゼ、中性プロテアーゼ、アルカリプロテアーゼ)、ペプチダーゼ(ロイシンペプチダーゼ、アミノペプチダーゼ)、リパーゼ、エステラーゼ、セルラーゼ、ホスファターゼ(酸性ホスファターゼ、アルカリホスファターゼ)、ヌクレアーゼ、デアミナーゼ、オキシダーゼ、デヒドロゲナーゼ、グルタミナーゼ、ペクチナーゼ、カタラーゼ、デキストラナーゼ、トランスグルタミナーゼ、蛋白質脱アミド酵素、プルラナーゼ、ペルオキシダーゼ、スーパーオキシドディスムターゼ等が挙げられる。これらの他の酵素は、1種であってもよく、複数種であってもよい。
添加剤としては、賦形剤、緩衝剤、懸濁剤、安定剤、保存剤、防腐剤、生理食塩水等が挙げられる。賦形剤としては、デンプン、デキストリン、マルトース、トレハロース、乳糖、D-グルコース、ソルビトール、D-マンニトール、白糖、グリセロール等が挙げられる。緩衝剤としては、リン酸塩、クエン酸塩、酢酸塩等が挙げられる。安定剤としては、プロピレングリコール、アスコルビン酸等が挙げられる。保存剤としては、フェノール、塩化ベンザルコニウム、ベンジルアルコール、クロロブタノール、メチルパラベン等が挙げられる。防腐剤としては、エタノール、塩化ベンザルコニウム、パラオキシ安息香酸、クロロブタノール等が挙げられる。これらの添加剤は、1種であってもよく、複数種であってもよい。
培養残渣としては、培地由来の成分、夾雑タンパク質、菌体成分等が挙げられる。
本発明の酵素剤の形態としては特に限定されず、例えば、液状、固形状(粉末、顆粒等)、担体への固定等が挙げられる。
(DNA)
本発明は、以下の(1)から(3)のいずれかに示すDNAに関する。
(1)上述したポリペプチドをコードするDNA。
(2)配列番号3、5、7及び9のいずれかに示す塩基配列からなるDNA。
(3)配列番号3、5、7及び9のいずれかに示す塩基配列と等価の配列を含み、ω3系脂肪酸のω3位をエポキシ化する触媒活性を有するポリペプチドをコードするDNA。
本発明は、以下の(1)から(3)のいずれかに示すDNAに関する。
(1)上述したポリペプチドをコードするDNA。
(2)配列番号3、5、7及び9のいずれかに示す塩基配列からなるDNA。
(3)配列番号3、5、7及び9のいずれかに示す塩基配列と等価の配列を含み、ω3系脂肪酸のω3位をエポキシ化する触媒活性を有するポリペプチドをコードするDNA。
配列番号3に示される塩基配列は配列番号4に示すF87K変異体のアミノ酸配列からなるポリペプチドをコードする遺伝子のcDNA配列、配列番号5に示される塩基配列は配列番号6に示すF87K及びA330V二重変異体のアミノ酸配列からなるポリペプチドをコードする遺伝子のcDNA配列、配列番号7に示される塩基配列は配列番号8に示すP25L、F87K及びA330V三重変異体のアミノ酸配列からなるポリペプチドをコードする遺伝子のcDNA配列、配列番号9に示される塩基配列は配列番号10に示すF87K、A330V及びT438M三重変異体のアミノ酸配列からなるポリペプチドをコードする遺伝子のcDNA配列である。
上記(1)のDNAに対する上記(3)のDNAの相同性は、75%以上であることが好ましく、80%以上であることがより好ましく、85%以上であることがさらに好ましく、90%以上であることが一層好ましく、95%以上であることがより一層好ましく、99%以上であることが特に好ましい。配列番号3、5、7及び9のいずれかに示す塩基配列と異なる配列を有するDNAであっても、ω3系脂肪酸のω3位を特異的にエポキシ化する触媒活性を有するポリペプチドをコードするものであれば、配列番号3、5、7及び9のいずれかに示す塩基配列と等価であると言うことができ、本発明のDNAに含まれる。
DNAの「相同性」は、基準配列を照会配列として比較するアルゴリズムをもった公開又は市販されているソフトウェアを用いて計算される。具体的には、BLAST、FASTA、又はGENETYX(ソフトウエア開発株式会社製)等を用いることができ、これらはデフォルトパラメーターに設定して使用すればよい。
また、アミノ酸配列の任意相違部位において1個又は数個のアミノ酸残基が置換、付加、挿入又は欠失されてなるポリペプチドをコードする塩基配列も、それがω3系脂肪酸のω3位を特異的にエポキシ化する触媒活性を有するポリペプチドをコードする限り、本発明のDNAに含まれる。
さらに、上述したポリペプチドをコードする塩基配列からなるDNAと相補的な塩基配列からなるDNAとストリンジェントな条件下でハイブリダイズするDNAも、これがω3系脂肪酸のω3位を特異的にエポキシ化する触媒活性を有するポリペプチドをコードする限り、本発明のDNAに含まれる。ここで、ストリンジェントな条件としては、例えば、DNAを固定したナイロン膜を、6×SSC(1×SSCは、塩化ナトリウム8.76g、クエン酸ナトリウム4.41gを1リットルの水に溶かしたもの)、1%SDS、100μg/mLサケ精子DNA、0.1%ウシ血清アルブミン、0.1%ポリビニルピロリドン、0.1%フィコールを含む溶液(本明細書において、「%」は「w/v」を意味する。)中で65℃にて20時間プローブとともに保温してハイブリダイゼーションを行う条件を挙げることができる。しかし、この条件に限定されず、当業者であれば、このような緩衝液の塩濃度、温度等の条件に加えて、その他のプローブ濃度、プローブの長さ、反応時間等の諸条件を加味し、ハイブリダイゼーションの条件を設定することができる。ハイブリダイゼーション法の詳細な手順については、Molecular Cloning, A Laboratory Manual 2nd ed.(Cold Spring Harbor Laboratory Press (1989))等を参照することができる。
以下に、ハイブリダイゼーションにより本発明のDNAを得る方法の一例を示すが、本発明のDNAを得る方法は以下に限定されない。
まず、適当な遺伝子源から得たDNAを定法に従ってプラスミドやファージベクターに接続してDNAライブラリを作製する。このライブラリを適当な宿主に導入して得られる形質転換体をプレート上で培養し、生育したコロニー又はプラークをニトロセルロースやナイロンの膜に移しとり、変性処理の後にDNAを膜に固定する。この膜をあらかじめ32P等で標識したプローブを含む上記の組成の溶液中、上記のストリンジェントな条件で保温し、ハイブリダイゼーションを行う。プローブとしては、配列番号2に記載したアミノ酸配列の全部又は一部をコードするポリヌクレオチドを使用することができる。
まず、適当な遺伝子源から得たDNAを定法に従ってプラスミドやファージベクターに接続してDNAライブラリを作製する。このライブラリを適当な宿主に導入して得られる形質転換体をプレート上で培養し、生育したコロニー又はプラークをニトロセルロースやナイロンの膜に移しとり、変性処理の後にDNAを膜に固定する。この膜をあらかじめ32P等で標識したプローブを含む上記の組成の溶液中、上記のストリンジェントな条件で保温し、ハイブリダイゼーションを行う。プローブとしては、配列番号2に記載したアミノ酸配列の全部又は一部をコードするポリヌクレオチドを使用することができる。
ハイブリダイゼーションの終了後、非特異的に吸着したプローブを洗い流し、オートラジオグラフィ等によりプローブとハイブリッドを形成したクローンを同定する。この操作をハイブリッド形成クローンが単離できるまで繰り返す。最後に、得られたクローンの中から、目的の酵素活性を有するタンパク質をコードする遺伝子を選択する。遺伝子の単離は、アルカリ法等の公知のポリヌクレオチド抽出法により実施できる。
また、本発明のDNAは、上記所定のポリペプチドを産生する微生物から単離することもできる。例えば、Bacillus megateriumのゲノムDNAを鋳型として、既知のアミノ酸配列情報から遺伝子の縮重を考慮して設計したプライマー若しくはプローブ又は既知の塩基配列情報に基づいて設計したプライマー又はプローブを用いたPCR又はハイブリダイゼーション法により、上記微生物のゲノムから目的のDNAを単離することができる。
本発明のDNAには、コドンの縮重に由来する多種のDNAが包含される。同じアミノ酸配列をコードする多種のDNAを人為的に作製することは、公知の遺伝子工学的手法を用いて容易に行うことができる。例えば、遺伝子工学的なタンパク質の生産において、目的のタンパク質をコードする本来の遺伝子上で使用されているコドンが宿主中では使用頻度の低いものであった場合には、タンパク質の発現量が低いことがある。このような場合にはコードされているアミノ酸配列に変化を与えることなく、コドン利用頻度を宿主に最適化することにより、目的タンパク質の高発現を図ることができる。
コドン利用頻度を表す指標として、各コドンの宿主最適コドン利用頻度の総計を採択すればよい。最適コドンとは、同じアミノ酸に対応するコドンのうち利用頻度が最も高いコドンと定義される。コドン利用頻度は、宿主に最適化したものであれば特に限定されないが、例えば、大腸菌の最適コドンの一例として以下のものが挙げられる。
F:フェニルアラニン(ttt)、L:ロイシン(ctg)、I:イソロイシン(att)、M:メチオニン(atg)、V:バリン(gtg)、Y:チロシン(tat)、終止コドン(taa)、H:ヒスチジン(cat)、Q:グルタミン(cag)、N:アスパラギン(aat)、K:リジン(aaa)、D:アスパラギン酸(gat)、E:グルタミン酸(gaa)、S:セリン(agc)、P:プロリン(ccg)、T:スレオニン(acc)、A:アラニン(gcg)、C:システイン(tgc)、W:トリプトファン(tgg)、R:アルギニン(cgc)、G:グリシン(ggc)。
F:フェニルアラニン(ttt)、L:ロイシン(ctg)、I:イソロイシン(att)、M:メチオニン(atg)、V:バリン(gtg)、Y:チロシン(tat)、終止コドン(taa)、H:ヒスチジン(cat)、Q:グルタミン(cag)、N:アスパラギン(aat)、K:リジン(aaa)、D:アスパラギン酸(gat)、E:グルタミン酸(gaa)、S:セリン(agc)、P:プロリン(ccg)、T:スレオニン(acc)、A:アラニン(gcg)、C:システイン(tgc)、W:トリプトファン(tgg)、R:アルギニン(cgc)、G:グリシン(ggc)。
遺伝子に変異を導入し、人為的にアミノ酸配列を改変する方法としては、Kunkel法やGapped duplex法等の公知の手法、及び部位特異的突然変異誘発法を利用した変異導入キット、例えば、QuikChangeTM Site-Directed Mutagenesis Kit(ストラタジーン社)、GeneTailorTM Site-Directed Mutagenesis System(インビトロジェン社)、TaKaRa Site-Directed Mutagenesis System(Mutan-K,Mutan-Super Express Km等:タカラバイオ社)等を用いることができる。
DNAの塩基配列の確認は、慣用の方法により配列決定することにより行うことができる。例えば、ジデオキシヌクレオチドチェーンターミネーション法(Sanger et al.(1977) Proc.Natl.Acad.Sci.USA 74:5463)等により行うことができる。また、適当なDNAシークエンサーを利用して配列を解析することができる。
得られたDNAが目的のポリペプチドをコードするDNAであるかどうかの確認は、決定された塩基配列を配列番号1に記載の塩基配列と比較して行うことができる。あるいは決定された塩基配列より推定されるアミノ酸配列を配列番号2に記載のアミノ酸配列と比較して行うことができる。また、発現するポリペプチドの触媒活性(ω3系脂肪酸のω3位を特異的にエポキシ化する触媒活性)を測定することによっても確認することができる。
(発現カセット/組み換えベクター)
本発明は、上述したDNAを含む発現カセット、もしくは、上述したDNAを含む組み換えベクターに関するものであってもよい。本発明の発現カセットもしくは組み換えベクターは、上述したDNAにプロモーター及びターミネーターを連結することにより得ることができる。また、組み換えベクターは、発現用ベクターに本発明の発現カセットもしくは本発明のDNAを挿入することにより得ることができる。
本発明は、上述したDNAを含む発現カセット、もしくは、上述したDNAを含む組み換えベクターに関するものであってもよい。本発明の発現カセットもしくは組み換えベクターは、上述したDNAにプロモーター及びターミネーターを連結することにより得ることができる。また、組み換えベクターは、発現用ベクターに本発明の発現カセットもしくは本発明のDNAを挿入することにより得ることができる。
本発明の発現カセット、もしくは、本発明の組換えベクターには、制御因子として、プロモーター及びターミネーターに加えて、必要に応じてエンハンサー、CCAATボックス、TATAボックス、SPI部位等の転写要素が含まれていてもよい。これらの制御因子は、本発明のDNAに作動可能に連結されていればよい。作動可能に連結とは、本発明のDNAを調節する種々の制御因子と本発明のDNAが、宿主細胞中で作動し得る状態で連結されることをいう。
本発明の組換えベクターを構築する発現用ベクターとしては、宿主内で自律的に増殖し得るファージ、プラスミド又はウイルスから遺伝子組換え用として構築されたものが好適である。このような発現用ベクターは公知であり、例えば、商業的に入手可能な発現用ベクターとしては、pQE系ベクター(株式会社キアゲン)、pDR540、pRIT2T(GEヘルスケアバイオサイエンス株式会社)、pET系ベクター(メルク株式会社)、pHY300PLK、pBE-S(タカラバイオ株式会社)等が挙げられる。発現用ベクターは、宿主細胞との適切な組み合わせを選んで使用すればよく、例えば、大腸菌を宿主細胞とする場合には、pBAD系ベクターとDH5α大腸菌株の組み合わせ、pET系ベクターとBL21(DE3)大腸菌株の組み合わせ、又はpDR540ベクターとJM109大腸菌株の組み合わせ等が挙げられる。DNAと発現カセット又は発現用ベクターとの結合は、例えばDNAリガーゼを用いて行う。
(形質転換体)
本発明は、上述したDNA、上述した発現カセット又は上述した組み換えベクターを有する形質転換体に関するものであってもよい。すなわち、本発明の形質転換体は、上述したDNAを含む発現カセット、もしくは、上述したDNAを含む組み換えベクターにより宿主を形質転換して得られる形質転換体である。
本発明は、上述したDNA、上述した発現カセット又は上述した組み換えベクターを有する形質転換体に関するものであってもよい。すなわち、本発明の形質転換体は、上述したDNAを含む発現カセット、もしくは、上述したDNAを含む組み換えベクターにより宿主を形質転換して得られる形質転換体である。
形質転換体の製造に使用される宿主としては、本発明のDNAを含む遺伝子の形質を発現できるものであれば特に制限されないが、遺伝子の導入が可能で、発現カセット又は組換えベクターを安定に保持でき、かつ自律増殖可能なものであることが好ましい。宿主としては、例えば、大腸菌(Escherichia coli)等のエッシェリヒア属、バチルス・ズブチリス(Bacillus subtilis)等のバチルス属、シュードモナス・プチダ(Pseudomonas putida)等のシュードモナス属等に属する細菌;酵母等が好適な例として挙げられるが、その他、動物細胞、昆虫細胞、植物等であってもよい。
本発明の形質転換体は、宿主に上述したDNAを含む発現カセット、もしくは、上述したDNAを含む組み換えベクターを導入することによって得ることができる。当該DNAが導入される場所は、目的の遺伝子が発現される限り特に限定されず、プラスミド上であってもよいし、ゲノム上であってもよい。発現カセット又は組換えベクターを宿主に導入する具体的な方法としては、例えば、組換えベクター法、ゲノム編集法が挙げられる。宿主に発現カセット又は組換えベクターを導入する条件は、宿主の種類等に応じて適宜設定すればよい。宿主が細菌の場合であれば、例えば、カルシウムイオン処理によるコンピテントセルを用いる方法や、エレクトロポレーション法等が挙げられる。宿主が酵母の場合であれば、例えば、電気穿孔法(エレクトロポレーション法)、スフェロプラスト法、酢酸リチウム法等が挙げられる。宿主が動物細胞の場合であれば、例えば、エレクトロポレーション法、リン酸カルシウム法、リポフェクション法等が挙げられる。宿主が昆虫細胞の場合であれば、例えば、リン酸カルシウム法、リポフェクション法、エレクトロポレーション法等が挙げられる。宿主が植物胞の場合であれば、例えば、エレクトロポレーション法、アグロバクテリウム法、パーティクルガン法、PEG法等が挙げられる。
発現カセット又は組換えベクターが宿主に組み込まれたか否かの確認は、PCR法、サザンハイブリダイゼーション法、ノーザンハイブリダイゼーション法等により行うことができる。
PCR法によって発現カセット又は組換えベクターが宿主に組み込まれたか否かを確認する場合、例えば、形質転換体からゲノムDNA、発現カセット又は組換えベクターを分離・精製すればよい。ゲノムDNA、発現カセット又は組換えベクターの分離・精製は、例えば、宿主が細菌の場合、細菌を溶菌して溶菌物を得ることにより行われる。溶菌を行う場合、例えばリゾチームなどの溶菌酵素により処理が施され、必要に応じてプロテアーゼ、及び他の酵素並びにラウリル硫酸ナトリウム(SDS)等の界面活性剤が併用される。また、溶菌を行う場合、凍結融解及びフレンチプレス処理のような物理的破砕方法を組み合わせてもよい。溶菌物からのDNAの分離・精製は、例えば、フェノール処理やプロテアーゼ処理による除蛋白処理、リボヌクレアーゼ処理、アルコール沈殿処理並びに市販のキットを適宜組み合わせることにより行うことができる。
分離・精製したDNAを鋳型として、本発明のDNAに特異的なプライマーを設計してPCRを行う。PCRにより得られた増幅産物についてアガロースゲル電気泳動、ポリアクリルアミドゲル電気泳動、キャピラリー電気泳動等を行い、臭化エチジウム及びSYBR Green液等により染色し、そして増幅産物をバンドとして検出することにより、形質転換されたことを確認することができる。なお、DNAの切断は、常法に従い、例えば制限酵素処理を用いて行うことができる。制限酵素としては、例えば特定のヌクレオチド配列に作用するII型制限酵素を用いる。
また、PCR法によって発現カセット又は組換えベクターが宿主に組み込まれたか否かを確認する場合、予め蛍光色素等により標識したプライマーを用いてPCRを行い、増幅産物を検出することもできる。さらに、マイクロプレート等の固相に増幅産物を結合させ、蛍光及び酵素反応等により増幅産物を確認する方法も採用してもよい。
(オキシゲナーゼの製造方法)
本発明は、上述した形質転換体を培養する工程を含む、オキシゲナーゼの製造方法に関する。
本発明は、上述した形質転換体を培養する工程を含む、オキシゲナーゼの製造方法に関する。
形質転換体を培養する工程の培養条件は、上記形質転換体の栄養生理的性質を考慮して適宜設定すればよいが、好ましくは液体培養が挙げられる。また、工業的製造を行う場合であれば、通気攪拌培養を採用することが好ましい。培地の栄養源としては、形質転換体の生育に必要とされるものが使用される。炭素源としては、資化可能な炭素化合物であればよく、例えば、グルコース、スクロース、ラクトース、マルトース、糖蜜、ピルビン酸等が挙げられる。窒素源としては、資化可能な窒素化合物であればよく、例えば、ペプトン、肉エキス、酵母エキス、カゼイン加水分解物、大豆粕アルカリ抽出物が挙げられる。炭素源及び窒素源の他に、例えば、リン酸塩、炭酸塩、硫酸塩、マグネシウム、カルシウム、カリウム、鉄、マンガン及び亜鉛などの塩類、特定のアミノ酸並びに特定のビタミンなどを必要に応じて使用してもよい。
培養温度は、上記形質転換体が生育可能であり、かつ上記形質転換体が所定のオキシゲナーゼを産生する範囲で適宜設定する。例えば、培養温度は、15~37℃程度であることが好ましい。培養は、オキシゲナーゼが最高収量に達する時期を見計らって適当時期に完了すればよく、通常は培養時間が12~48時間程度となる。
形質転換体を培養する工程の後には、培養液を遠心分離などの方法に供し、培養上清及び/又は菌体を回収する。菌体には、超音波やフレンチプレスといった機械的方法又はリゾチーム等の溶菌酵素による処理を施し、必要に応じてプロテアーゼ等の酵素やラウリル硫酸ナトリウム(SDS)等の界面活性剤を使用することにより可溶化し、所定のオキシゲナーゼを含む水溶性画分を得る。また、適当な発現カセット又は発現用ベクターと宿主を選択することにより、発現したオキシゲナーゼを培養液中に分泌させることもできる。
上記のようにして得られたオキシゲナーゼを含む水溶性画分は、そのまま精製処理に供してもよいが、該水溶性画分中のオキシゲナーゼを濃縮した後に精製処理に供してもよい。濃縮は、例えば、減圧濃縮、膜濃縮、塩析処理、親水性有機溶媒(例えば、メタノール、エタノール及びアセトン)による分別沈殿法等により行うことができる。また、精製処理は、例えば、ゲルろ過、吸着クロマトグラフィー、イオン交換クロマトグラフィー、アフィニティクロマトグラフィー等の方法を適宜組み合わせることによって行うことができる。精製された所定のオキシゲナーゼは、必要に応じて、凍結乾燥、真空乾燥、スプレードライ等により粉末化してもよい。
(ω3位エポキシ化脂肪酸の製造方法)
本発明は、高度不飽和脂肪酸(ω3系脂肪酸)に、上述したポリペプチドもしくは上述したω3位エポキシ化用酵素剤を作用させる工程を含む、ω3位エポキシ化脂肪酸の製造方法に関する。上述したポリペプチド(オキシゲナーゼ)をω3系脂肪酸に作用させることで、ω3系脂肪酸のω3位のみがエポキシ化した脂肪酸(ω3位エポキシ化脂肪酸)を得ることができる。換言すれば、ω3系脂肪酸を上述したポリペプチド(オキシゲナーゼ)で処理することで、ω3系脂肪酸のω3位のみがエポキシ化した脂肪酸(ω3位エポキシ化脂肪酸)を得ることができる。
本発明は、高度不飽和脂肪酸(ω3系脂肪酸)に、上述したポリペプチドもしくは上述したω3位エポキシ化用酵素剤を作用させる工程を含む、ω3位エポキシ化脂肪酸の製造方法に関する。上述したポリペプチド(オキシゲナーゼ)をω3系脂肪酸に作用させることで、ω3系脂肪酸のω3位のみがエポキシ化した脂肪酸(ω3位エポキシ化脂肪酸)を得ることができる。換言すれば、ω3系脂肪酸を上述したポリペプチド(オキシゲナーゼ)で処理することで、ω3系脂肪酸のω3位のみがエポキシ化した脂肪酸(ω3位エポキシ化脂肪酸)を得ることができる。
本発明の製造方法で得られるω3位エポキシ化脂肪酸は特に限定されるものではないが、ω3位エポキシ化脂肪酸の例としては、19,20-エポキシドコサペンタエン酸(19,20-EpDPE)、19,20-エポキシドコサテトラエン酸(19,20-EpDTE)、17,18-エポキシエイコサテトラエン酸(17,18-EpETE)が挙げられる。中でも、ω3位エポキシ化脂肪酸は、17,18-エポキシエイコサテトラエン酸(17,18-EpETE)であることが好ましい。
本発明のω3位エポキシ化脂肪酸の製造方法の原料となるω3系脂肪酸の起源、性状等に特段の制約はない。ω3系脂肪酸の具体例としては、魚油、亜麻仁油、エゴマ油、魚油や藻類、植物から抽出されたドコサヘキサエン酸、ドコサペンタエン酸、エイコサペンタエン酸、アラキドン酸、αリノレン酸、ステアリドン酸、11,14,17-エイコサトリエン酸、テトラコサペンタエン酸、テトラコサヘキサエン酸等が挙げられる。ω3系脂肪酸としては、これらを単独で用いてもよく、2種以上を組み合わせて用いてもよい。これらの原料のなかでも、ω3系脂肪酸は、ドコサヘキサエン酸、ドコサペンタエン酸、エイコサペンタエン酸、アラキドン酸及びαリノレン酸よりなる群から選択される少なくとも1種であることが好ましく、ドコサヘキサエン酸、ドコサペンタエン酸及びエイコサペンタエン酸よりなる群から選択される少なくとも1種であることがより好ましく、エイコサペンタエン酸であることがさらに好ましい。また、ω3系脂肪酸は、これらの脂肪酸を含有する油脂の状態であってもよい。ω3系脂肪酸の性状も特に限定されないが、好ましくは、液状、スラリー状、ペースト状が挙げられる。
本発明のω3位エポキシ化脂肪酸の製造方法で使用するポリペプチド(オキシゲナーゼ)の量は特に限定されるものではないが、反応溶液1mL当たりのオキシゲナーゼ酵素活性は、0.0001U以上であることが好ましい。効果をより一層高める観点から、反応溶液1mL当たりのオキシゲナーゼ酵素活性は、0.0005U以上であることがより好ましく、0.001U以上であることがさらに好ましく、0.005U以上であることが一層好ましく、0.01U以上であることが特に好ましい。反応溶液1mL当たりのオキシゲナーゼ酵素活性の上限値は特に限定されないが、例えば10U以下であることが好ましく、5U以下であることがより好ましく、1U以下であることがさらに好ましく、0.5U以下であることが一層好ましく、0.1U以下であることがより一層好ましい。
オキシゲナーゼの酵素活性については、補酵素であるNADPHの減少量を指標にして評価可能である。NADPH量は、吸光光度計で反応溶液の吸光度360nmを測定することで定量可能である。また、オキシゲナーゼの酵素活性の測定方法としては、基質をエイコサペンタエン酸(EPA)として、オキシゲナーゼを作用させ、EPAのエポキシ化生成物を高速液体クロマトグラフィー(HPLC)を用いて定量する分析法がある。HPLCのチャート図から算出されるエリア面積から、基質の消費量、生成物の生成量を定量することが可能である。3.3mM基質EPAを基質に用いて35℃で1時間処理した場合に、1分間に17,18-EpETEを1mg生成する酵素量を1unit(1U)とする。
本発明のω3位エポキシ化脂肪酸の製造方法においては、上記オキシゲナーゼ以外の他の酵素を併用してもよい。併用する酵素は、NADPH再生酵素、活性酸素を除去する酵素及び過酸化水素を除去する酵素よりなる群から選択される一又は二以上の酵素であることが好ましい。ω3系脂肪酸のω3位をエポキシ化し、ω3位エポキシ化脂肪酸を製造する際には、上記オキシゲナーゼに加えて、上記併用酵素を使用することにより、反応速度を高めることができ、ω3位エポキシ化脂肪酸の生成速度を高めることができる。なお、各酵素を作用させる順序(すなわち、オキシゲナーゼと併用酵素の処理の順番)は同時処理とすることが好ましい。
NADPH再生酵素、活性酸素を除去する酵素及び過酸化水素を除去する酵素の好ましい態様は、上述した(酵素剤)の項目に記載のとおりである。
本発明のω3位エポキシ化脂肪酸の製造方法で使用するNADPH再生酵素の量は特に限定されないが、反応溶液1mL当たりの酵素活性は、0.1U以上であることが好ましい。効果をより一層高める観点から、反応溶液1mL当たりのNADPH再生酵素活性は、0.5U以上であることがより好ましく、1.0U以上であることがさらに好ましく、3.5U以上であることが一層好ましく、7.0U以上であることが特に好ましい。反応溶液1mL当たりのNADPH再生酵素活性の上限値は特に限定されないが、例えば500U以下であることが好ましく、100U以下であることがより好ましく、50U以下であることがさらに好ましく、20U以下であることが一層好ましく、15U以下であることがより一層好ましく、10U以下であることが特に好ましい。
NADPH再生酵素の活性については、基質のNADPから生成するNADPH量を定量することで算定される。NADPHは吸光光度計で反応溶液の吸光度360nmを測定することで、定量可能である。1分当たりに1μMのNADPH(mM)が生成することを1unitと定義する。
本発明のω3位エポキシ化脂肪酸の製造方法で使用する活性酸素や過酸化水素を除去する酵素の量は特に限定されないが、反応溶液1mL当たりの酵素活性は、0.1U以上であることが好ましい。効果をより一層高める観点から、反応溶液1mL当たりの活性酸素や過酸化水素を除去する酵素の活性は、1U以上であることがより好ましく、10U以上であることがさらに好ましく、25U以上であることが一層好ましく、45U以上であることが特に好ましい。反応溶液1mL当たりの活性酸素や過酸化水素を除去する酵素の活性の上限値は特に限定されないが、例えば5000U以下であることが好ましく、3000U以下であることがより好ましく、1000U以下であることがさらに好ましく、100U以下であることが一層好ましく、75U以下であることがより一層好ましく、60U以下であることが特に好ましい。
活性酸素や過酸化水素を除去する酵素活性のうち、カタラーゼの活性は1.0μモルのH2O2を分解し、その間に吸光度240nmで測定したH2O2濃度の減少率を測定することで算定される。1分当たりに1μMのH2O2(mM)が消費されることを1unitと定義する。また、活性酸素や過酸化水素を除去する酵素活性のうち、ペルオキシダーゼ活性は、ピロガロールと過酸化水素から、プルプロガリンを生成させてその発色量を測定することで算定される。1unitはピロガロールと過酸化水素を主基質にして20秒に1mgのプルプロガリンを生成する酵素量とする。活性酸素や過酸化水素を除去する酵素活性のうち、スーパーオキシドディスムターゼの活性は、テトラゾリウム塩を用いて、キサンチンオキシダーゼにより産生されたスーパーオキシドラジカル量を測定することで算定される。1unitは、スーパーオキシドラジカルの50%を不均化するために必要な酵素量と定義する。
ω3系脂肪酸にオキシゲナーゼ(必要に応じて、NADPH再生酵素、活性酸素を除去する酵素及び過酸化水素を除去する酵素よりなる群から選択される一又は二以上の酵素)を作用させるための反応時間、温度、反応溶液のpHなどは特に限定されるものではない。反応温度は、例えば20~50℃であることが好ましく、25~45℃であることがより好ましく、30~40℃であることがさらに好ましい。反応溶液のpHは、例えば5~10であることが好ましく、6~9.5であることがより好ましく、7~9であることがさらに好ましい。反応時間は、例えば10分~48時間であることが好ましく、1~24時間であることがより好ましく、2~12時間であることがさらに好ましく、2~6時間であることが特に好ましい。以上の反応条件によってω3系脂肪酸のω3位のエポキシ化をより効率よく進行させることができる。これらの反応条件は、目的とするエポキシ化脂肪酸に応じて適宜選択される。なお、最適な反応条件は予備実験を通して決定すればよい。
ω3系脂肪酸にオキシゲナーゼ等を作用させる工程の後には、オキシゲナーゼ(必要に応じて、NADPH再生酵素、活性酸素を除去する酵素及び過酸化水素を除去する酵素よりなる群から選択される一又は二以上の酵素)を失活させる工程を設けてもよい。
本発明のω3位エポキシ化脂肪酸の製造方法を用いることで、ω3位エポキシ化脂肪酸又はω3位エポキシ化脂肪酸含有組成物を製造することができる。すなわち、本発明は、上記ω3位エポキシ化脂肪酸の製造方法で得られるω3位エポキシ化脂肪酸もしくはω3位エポキシ化脂肪酸含有組成物に関するものであってもよい。ω3位エポキシ化脂肪酸含有組成物においては、含まれる脂肪酸の80質量%以上がω3位エポキシ化脂肪酸であることが好ましく、90%質量以上がω3位エポキシ化脂肪酸であることがより好ましく、95質量%以上がω3位エポキシ化脂肪酸であることがさらに好ましい。
(ω3系脂肪酸のエポキシ化方法)
本発明は、ω3系脂肪酸に、上述したポリペプチドもしくは上述したω3位エポキシ化用酵素剤を作用させる工程を含む、ω3系脂肪酸のエポキシ化方法に関する。上述したポリペプチド(オキシゲナーゼ)をω3系脂肪酸に作用させることで、ω3系脂肪酸のω3位のみをエポキシ化することができる。
本発明は、ω3系脂肪酸に、上述したポリペプチドもしくは上述したω3位エポキシ化用酵素剤を作用させる工程を含む、ω3系脂肪酸のエポキシ化方法に関する。上述したポリペプチド(オキシゲナーゼ)をω3系脂肪酸に作用させることで、ω3系脂肪酸のω3位のみをエポキシ化することができる。
本発明のω3系脂肪酸のエポキシ化方法においても上述した併用酵素を用いることができる。また、ω3系脂肪酸のエポキシ化方法における反応条件等は、上述したω3位エポキシ化脂肪酸の製造方法においてポリペプチド(オキシゲナーゼ)をω3系脂肪酸に作用工程における条件と同様である。
以下に実施例を挙げて本発明の特徴をさらに具体的に説明する。以下の実施例に示す材料、使用量、割合、処理内容、処理手順等は、本発明の趣旨を逸脱しない限り適宜変更することができる。したがって、本発明の範囲は以下に示す具体例により限定的に解釈されるべきものではない。
(試験方法及び評価)
1.ドッキング構造探索と酵素-基質反応に関わる重要アミノ酸残基の設計(1)
基質EPAに対するエポキシ化位置特異性を向上する目的で、酵素-基質の複合構造のドッキングモデル構築とMDシミュレーションを実行して、基質のω3位の反応に関わる重要なアミノ酸残基を予測した。P450-BM3(Bacillus megaterium由来のP450)の立体構造(PDB番号:1bu7)と基質EPAを用いて、酵素活性部位周辺の残基側鎖あるいは主鎖も含めた構造変化をシミュレーションする目的で、MDシミュレーションを複数パターン実行した。基質が酵素の活性部位に結合した、エネルギー的に安定なドッキングポーズをとることができる構造変化促進領域の設定について様々な条件を検討した。その結果、EPAがポケット内で動きやすくなる条件を見出した。さらに計算の収束性を確認するため、初期条件を変えた計算を2回実行した。その結果、2回の独立した結果が同じアミノ酸残基に収束した。EPAの17,18位がヘムとコンタクトする構造と、14,15位がヘムとコンタクトする構造を比較して、基質のω3位の反応に関わる重要なアミノ酸残基を予測した。その結果、2回の計算で共通してスコアの高いアミノ酸残基を選定した。
1.ドッキング構造探索と酵素-基質反応に関わる重要アミノ酸残基の設計(1)
基質EPAに対するエポキシ化位置特異性を向上する目的で、酵素-基質の複合構造のドッキングモデル構築とMDシミュレーションを実行して、基質のω3位の反応に関わる重要なアミノ酸残基を予測した。P450-BM3(Bacillus megaterium由来のP450)の立体構造(PDB番号:1bu7)と基質EPAを用いて、酵素活性部位周辺の残基側鎖あるいは主鎖も含めた構造変化をシミュレーションする目的で、MDシミュレーションを複数パターン実行した。基質が酵素の活性部位に結合した、エネルギー的に安定なドッキングポーズをとることができる構造変化促進領域の設定について様々な条件を検討した。その結果、EPAがポケット内で動きやすくなる条件を見出した。さらに計算の収束性を確認するため、初期条件を変えた計算を2回実行した。その結果、2回の独立した結果が同じアミノ酸残基に収束した。EPAの17,18位がヘムとコンタクトする構造と、14,15位がヘムとコンタクトする構造を比較して、基質のω3位の反応に関わる重要なアミノ酸残基を予測した。その結果、2回の計算で共通してスコアの高いアミノ酸残基を選定した。
2.変異酵素の活性測定(1)
上記のMD計算から高スコアとなった7アミノ酸残基について、それぞれの残基に部位特異的変異導入を行った。部位特異的変異導入とは、選定したアミノ酸残基について、本来のアミノ酸残基(野生型)をそれ以外のアミノ酸残基(19種類)に置換した変異酵素を作製することである。後述する方法に従って、アミノ酸を置換した変異酵素を133種類作製して、酵素と基質(EPA)の反応を行った。反応後の生成物をHPLCにより解析した。
上記のMD計算から高スコアとなった7アミノ酸残基について、それぞれの残基に部位特異的変異導入を行った。部位特異的変異導入とは、選定したアミノ酸残基について、本来のアミノ酸残基(野生型)をそれ以外のアミノ酸残基(19種類)に置換した変異酵素を作製することである。後述する方法に従って、アミノ酸を置換した変異酵素を133種類作製して、酵素と基質(EPA)の反応を行った。反応後の生成物をHPLCにより解析した。
HPLCチャート図の分析から、87番目のフェニルアラニンを置換した変異酵素で反応後の生成物の組成に大きな変化があると分かった。特に、変異酵素(F87K、F87I、F87H)を用いた場合に、副生成物の減少と目的生成物である17,18-EpETE(17,18-エポキシエイコサテトラエン酸)の蓄積が検出された(図1)。さらに、酵素-基質反応をタイムコースで解析した結果、変異酵素(F87K)は目的生成物である17,18-EpETEの蓄積が起こることが示された(図2下図)。
また、変異酵素(F87K、F87I、F87H)における17,18-EpETEの生成率と生成ピーク時間は以下の表のとおりであった。
3.ドッキング構造探索と酵素-基質反応に関わる重要アミノ酸残基の設計(2)
さらなる反応性の向上を目的とし、2回目のMD計算と設計を行った。変異酵素(F87K)は17,18-EpETEがある条件では、EPAに対するエポキシ化反応が低下し、生成物阻害が生じている可能性が考えられたため、アミノ酸変異F87Kを導入したP450-BM3の構造を用いて、生成物の17,18-EpETEが酵素から離れやすくなるよう設計を行った。具体的には、基質EPAには影響を与えないが、生成物17,18-EpETEがポケット内にいることを不安定化させるアミノ酸残基の選定を行った。2回の独立した計算の結果、3種類のアミノ酸残基を選定した
さらなる反応性の向上を目的とし、2回目のMD計算と設計を行った。変異酵素(F87K)は17,18-EpETEがある条件では、EPAに対するエポキシ化反応が低下し、生成物阻害が生じている可能性が考えられたため、アミノ酸変異F87Kを導入したP450-BM3の構造を用いて、生成物の17,18-EpETEが酵素から離れやすくなるよう設計を行った。具体的には、基質EPAには影響を与えないが、生成物17,18-EpETEがポケット内にいることを不安定化させるアミノ酸残基の選定を行った。2回の独立した計算の結果、3種類のアミノ酸残基を選定した
4.変異酵素の活性測定(2)
F87Kに加えてさらにアミノ酸を置換した二重変異酵素を作製した。上記のMD計算により選定したアミノ酸残基に関して、二重変異酵素を57種類作製した。後述する<EPAエポキシ化反応と高速液体クロマトグラフィー(HPLC)分析>の<条件1>の条件で反応を行い、生成物をHPLCで分析した。その結果、変異酵素(F87K)と比較して、複数の二重変異酵素で17,18-EpETEの生成量が向上した。特に、変異酵素(F87K-A330V)は、最も17,18-EpETEの生成量が多くなった。さらに詳細な解析を行い、野生型、変異酵素(F87K)、変異酵素(F87K-A330V)について、酵素-基質反応を比較した。17,18-EpETEの位置特異性を計算したところ、野生型ではEPAから17,18-EpETEへの生成率の最大値が44%であるのに対して、変異酵素(F87K)は89%、変異酵素(F87K-A330V)は93%となった(図2及び図3)。この結果から、野生型では不可能であった17,18-EpETEの工業的製造の可能性が示唆された。
F87Kに加えてさらにアミノ酸を置換した二重変異酵素を作製した。上記のMD計算により選定したアミノ酸残基に関して、二重変異酵素を57種類作製した。後述する<EPAエポキシ化反応と高速液体クロマトグラフィー(HPLC)分析>の<条件1>の条件で反応を行い、生成物をHPLCで分析した。その結果、変異酵素(F87K)と比較して、複数の二重変異酵素で17,18-EpETEの生成量が向上した。特に、変異酵素(F87K-A330V)は、最も17,18-EpETEの生成量が多くなった。さらに詳細な解析を行い、野生型、変異酵素(F87K)、変異酵素(F87K-A330V)について、酵素-基質反応を比較した。17,18-EpETEの位置特異性を計算したところ、野生型ではEPAから17,18-EpETEへの生成率の最大値が44%であるのに対して、変異酵素(F87K)は89%、変異酵素(F87K-A330V)は93%となった(図2及び図3)。この結果から、野生型では不可能であった17,18-EpETEの工業的製造の可能性が示唆された。
5.ドッキング構造探索と酵素-基質反応に関わる重要アミノ酸残基の設計(3)
変異酵素(F87K-A330V)の更なる位置特異性の向上、生成物阻害の解消を目的に3回目のMD計算を行った。アミノ酸変異F87K-A330Vを導入したP450-BM3の立体構造をモデリングで生成した。その構造のMD計算を行い、生成物である17,18-EpETEを維持しながら生成物阻害をする構造を不安定化する変異酵素を予測して、設計した。生成物(エポキシ化した基質)が安定的に結合しない構造を予測して、置換アミノ酸の設計を行った。その結果、3か所のアミノ酸残基が選定された。過去2回の設計では、活性中心であるヘムに近い領域に変異を入れていたのに対して、活性中心から遠い位置にあるアミノ酸残基に変異を導入した。
変異酵素(F87K-A330V)の更なる位置特異性の向上、生成物阻害の解消を目的に3回目のMD計算を行った。アミノ酸変異F87K-A330Vを導入したP450-BM3の立体構造をモデリングで生成した。その構造のMD計算を行い、生成物である17,18-EpETEを維持しながら生成物阻害をする構造を不安定化する変異酵素を予測して、設計した。生成物(エポキシ化した基質)が安定的に結合しない構造を予測して、置換アミノ酸の設計を行った。その結果、3か所のアミノ酸残基が選定された。過去2回の設計では、活性中心であるヘムに近い領域に変異を入れていたのに対して、活性中心から遠い位置にあるアミノ酸残基に変異を導入した。
6.変異酵素の活性測定(3)
これまでと同様に、選定したアミノ酸残基に対して部位特異的変異導入を行った。PCRによりF87K及びA330Vに加えてさらにアミノ酸を置換した三重変異酵素を57種類作製した。これらの三重変異酵素について基質EPAと反応を行い、生成物をHPLCで分析した。その結果、変異酵素(P25L-F87K-A330V)と変異酵素(F87K-A330V-T438M)はエポキシ化位置特異性が向上して、EPAから17,18-EpETEへの生成率が高かった(図4)。特に、変異酵素(F87K-A330V-T438M)はエポキシ化位置特異性が向上して、EPAから17,18-EpETEへの生成率が96%であった(図4下図)。また、変異酵素(P25L-F87K-A330V)では、EPAの消費量が改善されて生成物阻害の解消が見られた。
これまでと同様に、選定したアミノ酸残基に対して部位特異的変異導入を行った。PCRによりF87K及びA330Vに加えてさらにアミノ酸を置換した三重変異酵素を57種類作製した。これらの三重変異酵素について基質EPAと反応を行い、生成物をHPLCで分析した。その結果、変異酵素(P25L-F87K-A330V)と変異酵素(F87K-A330V-T438M)はエポキシ化位置特異性が向上して、EPAから17,18-EpETEへの生成率が高かった(図4)。特に、変異酵素(F87K-A330V-T438M)はエポキシ化位置特異性が向上して、EPAから17,18-EpETEへの生成率が96%であった(図4下図)。また、変異酵素(P25L-F87K-A330V)では、EPAの消費量が改善されて生成物阻害の解消が見られた。
7.NADPH再生酵素/過酸化水素除去酵素との併用
他の酵素との併用効果を評価するために、後述する<条件2:NADPH再生酵素/過酸化水素除去酵素との併用>の条件で、反応を行った。また、グルコースデヒドロゲナーゼの触媒作用に必須なグルコースを含む反応溶液を調製した。変異酵素(F87K)を発現した冷凍大腸菌菌体と変異酵素(F87K-A330V-T438M)を発現した冷凍大腸菌菌体を用いて、17,18-EpETEの生成率と生成量を評価した。その結果、変異酵素(F87K)においては、併用なしと比較して、17,18-EpETEの生成率は85%とやや低下した(図5上図)。しかし、生成量のピークは3時間となり、併用なしと比較して短い時間で酵素反応が進むと分かった(図5上図)。さらに、三重変異酵素(F87K-A330V-T438M)においては、併用なしと比較して、17,18-EpETEの生成率は97%となりさらに向上した(図5下図)。生成量のピークは5時間となり、併用なしと比較して短い時間で酵素反応が進むと分かった(図5下図)。以上の結果から、併用効果により同じ反応時間で比較すると基質の変換速度を向上させる効果が見られた。
他の酵素との併用効果を評価するために、後述する<条件2:NADPH再生酵素/過酸化水素除去酵素との併用>の条件で、反応を行った。また、グルコースデヒドロゲナーゼの触媒作用に必須なグルコースを含む反応溶液を調製した。変異酵素(F87K)を発現した冷凍大腸菌菌体と変異酵素(F87K-A330V-T438M)を発現した冷凍大腸菌菌体を用いて、17,18-EpETEの生成率と生成量を評価した。その結果、変異酵素(F87K)においては、併用なしと比較して、17,18-EpETEの生成率は85%とやや低下した(図5上図)。しかし、生成量のピークは3時間となり、併用なしと比較して短い時間で酵素反応が進むと分かった(図5上図)。さらに、三重変異酵素(F87K-A330V-T438M)においては、併用なしと比較して、17,18-EpETEの生成率は97%となりさらに向上した(図5下図)。生成量のピークは5時間となり、併用なしと比較して短い時間で酵素反応が進むと分かった(図5下図)。以上の結果から、併用効果により同じ反応時間で比較すると基質の変換速度を向上させる効果が見られた。
(変異酵素の作製方法)
(1)P450-BM3発現用ベクターの構築
発現プラスミドpET28a(メルク社)のマルチクローニングサイト、BamHI-EcoRIに、Bacillus megateriumのゲノムからPCRで増幅したCYP102A1遺伝子を挿入した。具体的には、PCR反応においては、制限酵素サイトを付加したプライマー(表2)と、Prime STAR MAX DNA Polymerase(タカラバイオ社)を用いた。そして、DNA Ligation Kit <Mighty Mix>(タカラバイオ社)により、PCRで増幅した遺伝子配列とベクターをライゲーションした。サンガーシーケンスにより遺伝子のDNA配列を確認した。遺伝子のDNA配列は配列番号1のとおりであった。
(1)P450-BM3発現用ベクターの構築
発現プラスミドpET28a(メルク社)のマルチクローニングサイト、BamHI-EcoRIに、Bacillus megateriumのゲノムからPCRで増幅したCYP102A1遺伝子を挿入した。具体的には、PCR反応においては、制限酵素サイトを付加したプライマー(表2)と、Prime STAR MAX DNA Polymerase(タカラバイオ社)を用いた。そして、DNA Ligation Kit <Mighty Mix>(タカラバイオ社)により、PCRで増幅した遺伝子配列とベクターをライゲーションした。サンガーシーケンスにより遺伝子のDNA配列を確認した。遺伝子のDNA配列は配列番号1のとおりであった。
(2)組み換えP450-BM3の調製
構築したP450-BM3発現用ベクター1μLをE.coli BL21(DE3)(ニッポンジーン社)2μLに混合し、形質転換を行った。37℃20分間の回復培養後に、形質転換溶液をカナマイシン(最終濃度50μg/mL)を添加したLB寒天培地プレート(インビトロジェン(株)製)に塗布して、37℃で18時間培養した。生育したコロニーを、カナマイシン(最終濃度50μg/mL)を添加したLB培地(インビトロジェン(株)製)に植菌した。37℃で18時間培養して前培養液とした。前培養液を、カナマイシン(最終濃度50μg/mL)を添加したTB培地(インビトロジェン(株)製)1mLに植菌した。37℃で48時間培養し(培養開始24時間の時点でIPTG(最終濃度0.1mM)を添加)、遠心分離で菌体を回収した。
構築したP450-BM3発現用ベクター1μLをE.coli BL21(DE3)(ニッポンジーン社)2μLに混合し、形質転換を行った。37℃20分間の回復培養後に、形質転換溶液をカナマイシン(最終濃度50μg/mL)を添加したLB寒天培地プレート(インビトロジェン(株)製)に塗布して、37℃で18時間培養した。生育したコロニーを、カナマイシン(最終濃度50μg/mL)を添加したLB培地(インビトロジェン(株)製)に植菌した。37℃で18時間培養して前培養液とした。前培養液を、カナマイシン(最終濃度50μg/mL)を添加したTB培地(インビトロジェン(株)製)1mLに植菌した。37℃で48時間培養し(培養開始24時間の時点でIPTG(最終濃度0.1mM)を添加)、遠心分離で菌体を回収した。
(3)変異型P450-BM3(F87K、F87H、F87I)発現用ベクターの構築
変異導入用PCRプライマー(表3)を設計し、以下の条件でPCRを行い、CYP102A1遺伝子に変異を導入した。
変異導入用PCRプライマー(表3)を設計し、以下の条件でPCRを行い、CYP102A1遺伝子に変異を導入した。
<鋳型プラスミド>
P450-BM3発現用ベクター:約50ng/μL
<PCR反応組成>(総量25μL)
Prime STAR MAX DNA Polymerase(タカラバイオ社):12.5μL
鋳型プラスミド:0.5μL
各プライマー(プライマーの組み合わせは表4のとおりとした):0.5μL×2
超純水(Milli Q水):11μL
P450-BM3発現用ベクター:約50ng/μL
<PCR反応組成>(総量25μL)
Prime STAR MAX DNA Polymerase(タカラバイオ社):12.5μL
鋳型プラスミド:0.5μL
各プライマー(プライマーの組み合わせは表4のとおりとした):0.5μL×2
超純水(Milli Q水):11μL
<PCR条件>
98℃で30秒
98℃で10秒、68℃で8分の反応を10サイクル繰り返し
72℃で10分
4℃で放置
98℃で30秒
98℃で10秒、68℃で8分の反応を10サイクル繰り返し
72℃で10分
4℃で放置
<制限酵素処理・精製>
PCR反応液 20μLに制限酵素DpnI(タカラバイオ(株)製)0.5μLを添加して37℃、3時間以上反応した。DNA精製キットNucleoSpin Gel and PCR Clean-up(タカラバイオ(株)製)でDNAを精製した。
PCR反応液 20μLに制限酵素DpnI(タカラバイオ(株)製)0.5μLを添加して37℃、3時間以上反応した。DNA精製キットNucleoSpin Gel and PCR Clean-up(タカラバイオ(株)製)でDNAを精製した。
<in-fusion反応>
以下の組成の反応液を調整し(総量10μL)、サーマルサイクラーで50℃15分加温した。
精製済みPCR溶液:2~5μL(50~200ng相当量)
Milli Q水:6~3μL
5×In-Fusion HD Enzyme Premix:2μL
5×In-Fusion HD Enzyme PremixはIn-Fusion(R) HD Cloning Kit(タカラバイオ社)に含まれる試薬である。
以下の組成の反応液を調整し(総量10μL)、サーマルサイクラーで50℃15分加温した。
精製済みPCR溶液:2~5μL(50~200ng相当量)
Milli Q水:6~3μL
5×In-Fusion HD Enzyme Premix:2μL
5×In-Fusion HD Enzyme PremixはIn-Fusion(R) HD Cloning Kit(タカラバイオ社)に含まれる試薬である。
<大腸菌形質転換>
in-fusion反応液2μLをE.coli BL21(DE3)(ニッポンジーン社)18μLに混合し、形質転換を行った。37℃20分間の回復培養後に、形質転換溶液をカナマイシン(最終濃度50μg/mL)を添加したLB寒天培地プレート(インビトロジェン(株)製)に塗布して、37℃で18時間培養した。
in-fusion反応液2μLをE.coli BL21(DE3)(ニッポンジーン社)18μLに混合し、形質転換を行った。37℃20分間の回復培養後に、形質転換溶液をカナマイシン(最終濃度50μg/mL)を添加したLB寒天培地プレート(インビトロジェン(株)製)に塗布して、37℃で18時間培養した。
<プラスミド精製・シーケンス確認>
生育したコロニーを、カナマイシン(最終濃度50μg/mL)を添加したLB培地(インビトロジェン(株)製)に植菌した。37℃で18時間培養して菌体培養液とした。遠心分離で菌体を回収して、プラスミドミニプレップキットNucleoSpin Plasmid EasyPure(タカラバイオ社)でそれぞれのプラスミドを精製した。サンガーシーケンスにより遺伝子のDNA配列を確認した。変異酵素(F87K、F87I、F87H)遺伝子のDNA配列はそれぞれ、配列番号3、5、7のとおりであった。
生育したコロニーを、カナマイシン(最終濃度50μg/mL)を添加したLB培地(インビトロジェン(株)製)に植菌した。37℃で18時間培養して菌体培養液とした。遠心分離で菌体を回収して、プラスミドミニプレップキットNucleoSpin Plasmid EasyPure(タカラバイオ社)でそれぞれのプラスミドを精製した。サンガーシーケンスにより遺伝子のDNA配列を確認した。変異酵素(F87K、F87I、F87H)遺伝子のDNA配列はそれぞれ、配列番号3、5、7のとおりであった。
(4)変異型組み換えP450-BM3(F87K、F87H、F87I)の調製
構築した変異型P450-BM3発現用ベクター1μLをE.coli BL21(DE3)(ニッポンジーン社)2μLに混合し、形質転換を行った。37℃20分間の回復培養後に、形質転換溶液をカナマイシン(最終濃度50μg/mL)を添加したLB寒天培地プレート(インビトロジェン(株)製)に塗布して、37℃で18時間培養した。生育したコロニーを、カナマイシン(最終濃度50μg/mL)を添加したLB培地(インビトロジェン(株)製)に植菌した。37℃で18時間培養して前培養液とした。前培養液を、カナマイシン(最終濃度50μg/mL)を添加したTB培地(インビトロジェン(株)製)1mLに植菌した。37℃で48時間培養し(培養開始24時間の時点でIPTG(最終濃度0.1mM)を添加)、遠心分離で菌体を回収した。得られた変異型組み換えP450-BM3(F87K、F87H、F87I)のアミノ酸配列は、それぞれ配列番号4、6、8のとおりであった。
構築した変異型P450-BM3発現用ベクター1μLをE.coli BL21(DE3)(ニッポンジーン社)2μLに混合し、形質転換を行った。37℃20分間の回復培養後に、形質転換溶液をカナマイシン(最終濃度50μg/mL)を添加したLB寒天培地プレート(インビトロジェン(株)製)に塗布して、37℃で18時間培養した。生育したコロニーを、カナマイシン(最終濃度50μg/mL)を添加したLB培地(インビトロジェン(株)製)に植菌した。37℃で18時間培養して前培養液とした。前培養液を、カナマイシン(最終濃度50μg/mL)を添加したTB培地(インビトロジェン(株)製)1mLに植菌した。37℃で48時間培養し(培養開始24時間の時点でIPTG(最終濃度0.1mM)を添加)、遠心分離で菌体を回収した。得られた変異型組み換えP450-BM3(F87K、F87H、F87I)のアミノ酸配列は、それぞれ配列番号4、6、8のとおりであった。
(5)二重変異型組み換えP450-BM3(F87K-A330V)の調製
変異導入用PCRプライマー(表5)を設計し、鋳型プラスミドとして変異型P450-BM3(F87K)発現用ベクターを用いた以外は上述した方法と同様の条件でPCR等を行い、CYP102A1遺伝子に二重の変異を導入された変異型P450-BM3(F87K-A330V)発現用ベクターを得た。遺伝子のDNA配列は配列番号9のとおりであった。変異型P450-BM3発現用ベクターに代えて、変異型P450-BM3(F87K-A330V)発現用ベクターを用いた以外は、上述した方法と同様の方法で変異型組み換えP450-BM3(F87K-A330V)を発現した菌体を回収した。得られた変異型組み換えP450-BM3のアミノ酸配列は、配列番号10のとおりであった。
変異導入用PCRプライマー(表5)を設計し、鋳型プラスミドとして変異型P450-BM3(F87K)発現用ベクターを用いた以外は上述した方法と同様の条件でPCR等を行い、CYP102A1遺伝子に二重の変異を導入された変異型P450-BM3(F87K-A330V)発現用ベクターを得た。遺伝子のDNA配列は配列番号9のとおりであった。変異型P450-BM3発現用ベクターに代えて、変異型P450-BM3(F87K-A330V)発現用ベクターを用いた以外は、上述した方法と同様の方法で変異型組み換えP450-BM3(F87K-A330V)を発現した菌体を回収した。得られた変異型組み換えP450-BM3のアミノ酸配列は、配列番号10のとおりであった。
(6)三重変異型組み換えP450-BM3(P25L-F87K-A330V)の調製
変異導入用PCRプライマー(表6)を設計し、鋳型プラスミドとして二重変異型P450-BM3(F87K-A330V)発現用ベクターを用いた以外は上述した方法と同様の条件でPCR等を行い、CYP102A1遺伝子に三重の変異を導入された変異型P450-BM3(P25L-F87K-A330V)発現用ベクターを得た。遺伝子のDNA配列は配列番号11のとおりであった。変異型P450-BM3発現用ベクターに代えて、変異型P450-BM3(P25L-F87K-A330V)発現用ベクターを用いた以外は、上述した方法と同様の方法で変異型組み換えP450-BM3(P25L-F87K-A330V)を発現した菌体を回収した。得られた変異型組み換えP450-BM3のアミノ酸配列は、配列番号12のとおりであった。
変異導入用PCRプライマー(表6)を設計し、鋳型プラスミドとして二重変異型P450-BM3(F87K-A330V)発現用ベクターを用いた以外は上述した方法と同様の条件でPCR等を行い、CYP102A1遺伝子に三重の変異を導入された変異型P450-BM3(P25L-F87K-A330V)発現用ベクターを得た。遺伝子のDNA配列は配列番号11のとおりであった。変異型P450-BM3発現用ベクターに代えて、変異型P450-BM3(P25L-F87K-A330V)発現用ベクターを用いた以外は、上述した方法と同様の方法で変異型組み換えP450-BM3(P25L-F87K-A330V)を発現した菌体を回収した。得られた変異型組み換えP450-BM3のアミノ酸配列は、配列番号12のとおりであった。
(7)変異型組み換えP450-BM3(F87K-A330V-T438M)の調製
変異導入用PCRプライマー(表7)を設計し、鋳型プラスミドとして変異型P450-BM3(F87K-A330V)発現用ベクターを用いた以外は上述した方法と同様の条件でPCR等を行い、CYP102A1遺伝子に3重の変異を導入された変異型P450-BM3(F87K-A330V-T438M)発現用ベクターを得た。遺伝子のDNA配列は配列番号13のとおりであった。変異型P450-BM3発現用ベクターに代えて、変異型P450-BM3(F87K-A330V-T438M)発現用ベクターを用いた以外は、上述した方法と同様の方法で変異型組み換えP450-BM3(F87K-A330V-T438M)を発現した菌体を回収した。得られた変異型組み換えP450-BM3のアミノ酸配列は、配列番号14のとおりであった。
変異導入用PCRプライマー(表7)を設計し、鋳型プラスミドとして変異型P450-BM3(F87K-A330V)発現用ベクターを用いた以外は上述した方法と同様の条件でPCR等を行い、CYP102A1遺伝子に3重の変異を導入された変異型P450-BM3(F87K-A330V-T438M)発現用ベクターを得た。遺伝子のDNA配列は配列番号13のとおりであった。変異型P450-BM3発現用ベクターに代えて、変異型P450-BM3(F87K-A330V-T438M)発現用ベクターを用いた以外は、上述した方法と同様の方法で変異型組み換えP450-BM3(F87K-A330V-T438M)を発現した菌体を回収した。得られた変異型組み換えP450-BM3のアミノ酸配列は、配列番号14のとおりであった。
(活性測定方法)
<EPAエポキシ化反応と高速液体クロマトグラフィー(HPLC)分析>
<条件1>
3.3mM基質EPA(Combi-Blocks社製・エタノールに溶解)及び3mMのNADPH(オリエンタル酵母工業(株)製)を含む0.1MTris-塩酸緩衝液(pH8.0)を、P450-BM3を発現した冷凍大腸菌菌体もしくは変異型P450-BM3を発現し冷凍大腸菌菌体に添加してよく混合した。マキシマイザーで混合しながら35℃で反応を行った。0、1、3、5、24時間経過時に反応液のサンプリングを行った。サンプリングした反応液を、100%エタノール(富士フイルム和光純薬工(株)製)で40倍に希釈した。希釈サンプルをバイアルに移して、HPLC分析を行った。
<EPAエポキシ化反応と高速液体クロマトグラフィー(HPLC)分析>
<条件1>
3.3mM基質EPA(Combi-Blocks社製・エタノールに溶解)及び3mMのNADPH(オリエンタル酵母工業(株)製)を含む0.1MTris-塩酸緩衝液(pH8.0)を、P450-BM3を発現した冷凍大腸菌菌体もしくは変異型P450-BM3を発現し冷凍大腸菌菌体に添加してよく混合した。マキシマイザーで混合しながら35℃で反応を行った。0、1、3、5、24時間経過時に反応液のサンプリングを行った。サンプリングした反応液を、100%エタノール(富士フイルム和光純薬工(株)製)で40倍に希釈した。希釈サンプルをバイアルに移して、HPLC分析を行った。
<条件2:NADPH再生酵素/過酸化水素除去酵素との併用>
3.3mM基質EPA(Combi-Blocks社製・エタノールに溶解)、0.16Mグルコース(富士フイルム和光純薬工(株)製)、7.7U/mLグルコースデヒドロゲナーゼ(NAD型)(天野エンザイム(株)製)、50U/mlカタラーゼ(富士フイルム和光純薬工(株)製)及び3mMのNADPH(オリエンタル酵母工業(株)製)を含む0.1MTris-塩酸緩衝液(pH8.0)を、P450-BM3を発現した冷凍大腸菌菌体もしくは変異型P450-BM3を発現した冷凍大腸菌菌体に添加してよく混合した。マキシマイザーで混合しながら35℃で反応を行った。0、1、3、5、24時間経過時に反応液のサンプリングを行った。サンプリングした反応液を、100%エタノール(富士フイルム和光純薬工(株)製)で400倍に希釈した。希釈サンプルはバイアルに移して、HPLC分析を行った。
3.3mM基質EPA(Combi-Blocks社製・エタノールに溶解)、0.16Mグルコース(富士フイルム和光純薬工(株)製)、7.7U/mLグルコースデヒドロゲナーゼ(NAD型)(天野エンザイム(株)製)、50U/mlカタラーゼ(富士フイルム和光純薬工(株)製)及び3mMのNADPH(オリエンタル酵母工業(株)製)を含む0.1MTris-塩酸緩衝液(pH8.0)を、P450-BM3を発現した冷凍大腸菌菌体もしくは変異型P450-BM3を発現した冷凍大腸菌菌体に添加してよく混合した。マキシマイザーで混合しながら35℃で反応を行った。0、1、3、5、24時間経過時に反応液のサンプリングを行った。サンプリングした反応液を、100%エタノール(富士フイルム和光純薬工(株)製)で400倍に希釈した。希釈サンプルはバイアルに移して、HPLC分析を行った。
基質EPAから生成するエポキシ化代謝物についてHPLC分析を行った。HPLC分析の条件は以下のとおりとした。
分析カラム:COSMOSIL 5C18 AR-II 250x4.6mm、5.0μm(ナカライテクス(株)製)
検出器:UV 205nm
流速:1.2ml/分
カラム温度:35℃
バッファーとプログラムは以下のとおりとした。
分析カラム:COSMOSIL 5C18 AR-II 250x4.6mm、5.0μm(ナカライテクス(株)製)
検出器:UV 205nm
流速:1.2ml/分
カラム温度:35℃
バッファーとプログラムは以下のとおりとした。
ピーク面積を定量して、EPA、17,18-EpETE(17,18-エポキシエイコサテトラエン酸)、14,15-EpETE(14,15-エポキシエイコサテトラエン酸)、19-HEPE(19-ヒドロキシルエイコサペンタエン酸)の量とした。17,18-EpETE、14,15-EpETEはそれぞれの標品を利用してピークの位置を決定した(Cayman Chemical社製)。19-HEPEはHPLC分析後のピークを分取して、NMRにより構造を特定した。なお、図2~4において「others」には、ω3とω6位の両方がエポキシ化された二次代謝物や、未同定の代謝物(基質の複数個所がエポキシ化されたもの、ω3とω6位以外の2重結合がエポキシ化されたもの、19-OH以外の二重結合ではない場所に水酸基が付加されたもの等)が含まれている。
なお、17,18-EpETEの生成率は、以下の式で算出した。
17,18-EpETEの生成量(%)=17,18-EpETEのエリア面積/(17,18-EpETEのエリア面積+14,15-EpETEのエリア面積+19-OHのエリア面積+その他のエリア面積)×100
なお、17,18-EpETEの生成率は、以下の式で算出した。
17,18-EpETEの生成量(%)=17,18-EpETEのエリア面積/(17,18-EpETEのエリア面積+14,15-EpETEのエリア面積+19-OHのエリア面積+その他のエリア面積)×100
<DHAエポキシ化反応と高速液体クロマトグラフィー(HPLC)分析>
<条件3>
3.3mM基質DHA(MedChemExpress社製・エタノールに溶解)及び3mMのNADPH(オリエンタル酵母工業(株)製)を含む0.1MTris-塩酸緩衝液(pH8.0)を、P450-BM3を発現した冷凍大腸菌菌体もしくは変異型P450-BM3を発現し冷凍大腸菌菌体に添加してよく混合した。マキシマイザーで混合しながら35℃で反応を行った。0、1、3、5、24時間経過時に反応液のサンプリングを行った。サンプリングした反応液を、100%エタノール(富士フイルム和光純薬工(株)製)で40倍に希釈した。希釈サンプルをバイアルに移して、HPLC分析を行った。なお、HPLC分析の条件は上記と同じ条件で行った。
<条件3>
3.3mM基質DHA(MedChemExpress社製・エタノールに溶解)及び3mMのNADPH(オリエンタル酵母工業(株)製)を含む0.1MTris-塩酸緩衝液(pH8.0)を、P450-BM3を発現した冷凍大腸菌菌体もしくは変異型P450-BM3を発現し冷凍大腸菌菌体に添加してよく混合した。マキシマイザーで混合しながら35℃で反応を行った。0、1、3、5、24時間経過時に反応液のサンプリングを行った。サンプリングした反応液を、100%エタノール(富士フイルム和光純薬工(株)製)で40倍に希釈した。希釈サンプルをバイアルに移して、HPLC分析を行った。なお、HPLC分析の条件は上記と同じ条件で行った。
ピーク面積を定量して、DHA、19,20-EpDPE(19,20-エポキシドコサペンタエン酸)、16,17-EpDPE(16,17-エポキシドコサペンタエン酸)の量とした。19,20-EpDPE、16,17-EpDPEはそれぞれの標品を利用してピークの位置を決定した(Cayman Chemical社製)。「その他」は、未同定ピークのエリア面積を合計した。
なお、19,20-EpDPEの生成率は、以下の式で算出した。
19,20-EpDPEの生成量(%)=19,20-EpDPEのエリア面積/(19,20-EpDPEのエリア面積+16,17-EpDPEのエリア面積+その他エリア面積の合計)×100
なお、19,20-EpDPEの生成率は、以下の式で算出した。
19,20-EpDPEの生成量(%)=19,20-EpDPEのエリア面積/(19,20-EpDPEのエリア面積+16,17-EpDPEのエリア面積+その他エリア面積の合計)×100
上記で作製した5種類の変異酵素と基質DHAと反応を行うことで、19,20-EpDPE(19,20-エポキシドコサペンタエン酸)、16,17-EpDPE(16,17-エポキシドコサペンタエン酸)が生成した。DHAのω3位をエポキシ化して生成する19,20-EpDPE(19,20-エポキシドコサペンタエン酸)は、生理的に17,18-EpETEと同様の効果があり、有用な生成物である。
野生型酵素(P450-BM3)と比較して、変異酵素(F87K)、変異酵素(F87K-A330V)、変異酵素(P25L-F87K-A330V)、変異酵素(F87K-A330V-T438M)による変換では、19,20-EpDPEの生成量の増加と蓄積が確認された(図6~8)。
また、19,20-EpDPEの位置特異性を計算したところ、野生型ではDHAから19,20-EpDPEへの生成率の最大値が40%であるのに対して、変異酵素(F87K)は84%、変異酵素(F87K-A330V)は90%、変異酵素(P25L-F87K-A330V)は88%、変異酵素(F87K-A330V-T438M)は96%となった(図6~8)。この結果から、野生型では不可能であった19,20-EpDPEの工業的製造の可能性が示唆された。
野生型酵素(P450-BM3)と比較して、変異酵素(F87K)、変異酵素(F87K-A330V)、変異酵素(P25L-F87K-A330V)、変異酵素(F87K-A330V-T438M)による変換では、19,20-EpDPEの生成量の増加と蓄積が確認された(図6~8)。
また、19,20-EpDPEの位置特異性を計算したところ、野生型ではDHAから19,20-EpDPEへの生成率の最大値が40%であるのに対して、変異酵素(F87K)は84%、変異酵素(F87K-A330V)は90%、変異酵素(P25L-F87K-A330V)は88%、変異酵素(F87K-A330V-T438M)は96%となった(図6~8)。この結果から、野生型では不可能であった19,20-EpDPEの工業的製造の可能性が示唆された。
<DPAエポキシ化反応と高速液体クロマトグラフィー(HPLC)分析>
<条件4>
3.3mM基質DPA(Cayman Chemical社製・エタノールに溶解)及び3mMのNADPH(オリエンタル酵母工業(株)製)を含む0.1MTris-塩酸緩衝液(pH8.0)を、P450-BM3を発現した冷凍大腸菌菌体もしくは変異型P450-BM3を発現し冷凍大腸菌菌体に添加してよく混合した。マキシマイザーで混合しながら35℃で反応を行った。0、1、3、5、24時間経過時に反応液のサンプリングを行った。サンプリングした反応液を、100%エタノール(富士フイルム和光純薬工(株)製)で40倍に希釈した。希釈サンプルをバイアルに移して、HPLC分析を行った。なお、HPLC分析の条件は上記と同じ条件で行った。
<条件4>
3.3mM基質DPA(Cayman Chemical社製・エタノールに溶解)及び3mMのNADPH(オリエンタル酵母工業(株)製)を含む0.1MTris-塩酸緩衝液(pH8.0)を、P450-BM3を発現した冷凍大腸菌菌体もしくは変異型P450-BM3を発現し冷凍大腸菌菌体に添加してよく混合した。マキシマイザーで混合しながら35℃で反応を行った。0、1、3、5、24時間経過時に反応液のサンプリングを行った。サンプリングした反応液を、100%エタノール(富士フイルム和光純薬工(株)製)で40倍に希釈した。希釈サンプルをバイアルに移して、HPLC分析を行った。なお、HPLC分析の条件は上記と同じ条件で行った。
ピーク面積を定量して、DPA、19,20-EpDTE(19,20-エポキシドコサテトラエン酸)の量とした。19,20-EpDTEは標品を利用してピークの位置を決定した(Cayman Chemical社製)。「その他」は、未同定ピークのエリア面積を合計した。
なお、19,20-EpDTEの生成率は、以下の式で算出した。
19,20-EpDTEの生成量(%)=19,20-EpDTEのエリア面積/(19,20-EpDTEのエリア面積+その他エリア面積の合計)×100
なお、19,20-EpDTEの生成率は、以下の式で算出した。
19,20-EpDTEの生成量(%)=19,20-EpDTEのエリア面積/(19,20-EpDTEのエリア面積+その他エリア面積の合計)×100
上記で作製した5種類の変異酵素と基質DPAと反応を行うことで、19,20-EpDTE(19,20-エポキシドコサテトラエン酸)が生成した。DPAのω3位をエポキシ化して生成する19,20-EpDTE(19,20-エポキシドコサテトラエン酸)は、生理的に17,18-EpETEと同様の効果があり、有用な生成物である。
野生型酵素(P450-BM3)と比較して、変異酵素(F87K)、変異酵素(F87K-A330V)、変異酵素(P25L-F87K-A330V)、変異酵素(F87K-A330V-T438M)において、19,20-EpDTEの生成量の増加と蓄積が確認された(図9~11)。
また、19,20-EpDTEの位置特異性を計算したところ、野生型ではDPAから19,20-EpDTEへの生成率の最大値が52%であるのに対して、変異酵素(F87K)は78%、変異酵素(F87K-A330V)は89%、変異酵素(P25L-F87K-A330V)は89%、変異酵素(F87K-A330V-T438M)は94%となった(図9~11)。この結果から、野生型では不可能であった19,20-EpDTEの工業的製造の可能性が示唆された。
野生型酵素(P450-BM3)と比較して、変異酵素(F87K)、変異酵素(F87K-A330V)、変異酵素(P25L-F87K-A330V)、変異酵素(F87K-A330V-T438M)において、19,20-EpDTEの生成量の増加と蓄積が確認された(図9~11)。
また、19,20-EpDTEの位置特異性を計算したところ、野生型ではDPAから19,20-EpDTEへの生成率の最大値が52%であるのに対して、変異酵素(F87K)は78%、変異酵素(F87K-A330V)は89%、変異酵素(P25L-F87K-A330V)は89%、変異酵素(F87K-A330V-T438M)は94%となった(図9~11)。この結果から、野生型では不可能であった19,20-EpDTEの工業的製造の可能性が示唆された。
本発明によれば、高度不飽和脂肪酸のω3位のみを特異的にエポキシ化する改変型オキシゲナーゼを得ることができる。また、本発明によれば、ω3位のみが特異的にエポキシ化されたω3位エポキシ化脂肪酸の工業的生産が可能となる。
配列番号1
aca att aaa gaa atg cct cag cca aaa acg ttt gga gag ctt aaa aat
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
tta ccg tta tta aac aca gat aaa ccg gtt caa gct ttg atg aaa att
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile
gcg gat gaa tta gga gaa atc ttt aaa ttc gag gcg cct ggt cgt gta
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
acg cgc tac tta tca agt cag cgt cta att aaa gaa gca tgc gat gaa
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
tca cgc ttt gat aaa aac tta agt caa gcg ctt aaa ttt gta cgt gat
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
ttt gca gga gac ggg tta ttt aca agc tgg acg cat gaa aaa aat tgg
Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn Trp
aaa aaa gcg cat aat atc tta ctt cca agc ttc agt cag cag gca atg
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
aaa ggc tat cat gcg atg atg gtc gat atc gcc gtg cag ctt gtt caa
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
aag tgg gag cgt cta aat gca gat gag cat att gaa gta ccg gaa gac
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
atg aca cgt tta acg ctt gat aca att ggt ctt tgc ggc ttt aac tat
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
cgc ttt aac agc ttt tac cga gat cag cct cat cca ttt att aca agt
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
atg gtc cgt gca ctg gat gaa gca atg aac aag ctg cag cga gca aat
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
cca gac gac cca gct tat gat gaa aac aag cgc cag ttt caa gaa gat
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
atc aag gtg atg aac gac cta gta gat aaa att att gca gat cgc aaa
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
gca agc ggt gaa caa agc gat gat tta tta acg cat atg cta aac gga
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
aaa gat cca gaa acg ggt gag ccg ctt gat gac gag aac att cgc tat
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
caa att att aca ttc tta att gcg gga cac gaa aca aca agt ggt ctt
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
tta tca ttt gcg ctg tat ttc tta gtg aaa aat cca cat gta tta caa
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
aaa gca gca gaa gaa gca gca cga gtt cta gta gat cct gtt cca agc
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
tac aaa caa gtc aaa cag ctt aaa tat gtc ggc atg gtc tta aac gaa
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
gcg ctg cgc tta tgg cca act gct cct gcg ttt tcc cta tat gca aaa
Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys
gaa gat acg gtg ctt gga gga gaa tat cct tta gaa aaa ggc gac gaa
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
cta atg gtt ctg att cct cag ctt cac cgt gat aaa aca att tgg gga
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
gac gat gtg gaa gag ttc cgt cca gag cgt ttt gaa aat cca agt gcg
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
att ccg cag cat gcg ttt aaa ccg ttt gga aac ggt cag cgt gcg tgt
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
atc ggt cag cag ttc gct ctt cat gaa gca acg ctg gta ctt ggt atg
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
atg cta aaa cac ttt gac ttt gaa gat cat aca aac tac gag ctg gat
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
att aaa gaa act tta acg tta aaa cct gaa ggc ttt gtg gta aaa gca
Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala
aaa tcg aaa aaa att ccg ctt ggc ggt att cct tca cct agc act gaa
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
cag tct gct aaa aaa gta cgc aaa aag gca gaa aac gct cat aat acg
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
ccg ctg ctt gtg cta tac ggt tca aat atg gga aca gct gaa gga acg
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
gcg cgt gat tta gca gat att gca atg agc aaa gga ttt gca ccg cag
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
gtc gca acg ctt gat tca cac gcc gga aat ctt ccg cgc gaa gga gct
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
gta tta att gta acg gcg tct tat aac ggt cat ccg cct gat aac gca
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
aag caa ttt gtc gac tgg tta gac caa gcg tct gct gat gaa gta aaa
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
ggc gtt cgc tac tcc gta ttt gga tgc ggc gat aaa aac tgg gct act
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
acg tat caa aaa gtg cct gct ttt atc gat gaa acg ctt gcc gct aaa
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
ggg gca gaa aac atc gct gac cgc ggt gaa gca gat gca agc gac gac
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
ttt gaa ggc aca tat gaa gaa tgg cgt gaa cat atg tgg agt gac gta
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
gca gcc tac ttt aac ctc gac att gaa aac agt gaa gat aat aaa tct
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
act ctt tca ctt caa ttt gtc gac agc gcc gcg gat atg ccg ctt gcg
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
aaa atg cac ggt gcg ttt tca acg aac gtc gta gca agc aaa gaa ctt
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
caa cag cca ggc agt gca cga agc acg cga cat ctt gaa att gaa ctt
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
cca aaa gaa gct tct tat caa gaa gga gat cat tta ggt gtt att cct
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
cgc aac tat gaa gga ata gta aac cgt gta aca gca agg ttc ggc cta
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
gat gca tca cag caa atc cgt ctg gaa gca gaa gaa gaa aaa tta gct
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
cat ttg cca ctc gct aaa aca gta tcc gta gaa gag ctt ctg caa tac
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
gtg gag ctt caa gat cct gtt acg cgc acg cag ctt cgc gca atg gct
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
gct aaa acg gtc tgc ccg ccg cat aaa gta gag ctt gaa gcc ttg ctt
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
gaa aag caa gcc tac aaa gaa caa gtg ctg gca aaa cgt tta aca atg
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
ctt gaa ctg ctt gaa aaa tac ccg gcg tgt gaa atg aaa ttc agc gaa
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
ttt atc gcc ctt ctg cca agc ata cgc ccg cgc tat tac tcg att tct
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
tca tca cct cgt gtc gat gaa aaa caa gca agc atc acg gtc agc gtt
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
gtc tca gga gaa gcg tgg agc gga tat gga gaa tat aaa gga att gcg
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
tcg aac tat ctt gcc gag ctg caa gaa gga gat acg att acg tgc ttt
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
att tcc aca ccg cag tca gaa ttt acg ctg cca aaa gac cct gaa acg
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
ccg ctt atc atg gtc gga ccg gga aca ggc gtc gcg ccg ttt aga ggc
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
ttt gtg cag gcg cgc aaa cag cta aaa gaa caa gga cag tca ctt gga
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
gaa gca cat tta tac ttc ggc tgc cgt tca cct cat gaa gac tat ctg
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
tat caa gaa gag ctt gaa aac gcc caa agc gaa ggc atc att acg ctt
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
cat acc gct ttt tct cgc atg cca aat cag ccg aaa aca tac gtt cag
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
cac gta atg gaa caa gac ggc aag aaa ttg att gaa ctt ctt gat caa
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
gga gcg cac ttc tat att tgc gga gac gga agc caa atg gca cct gcc
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
gtt gaa gca acg ctt atg aaa agc tat gct gac gtt cac caa gtg
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
agt gaa gca gac gct cgc tta tgg ctg cag cag cta gaa gaa aaa
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
ggc cga tac gca aaa gac gtg tgg gct ggg
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
aca att aaa gaa atg cct cag cca aaa acg ttt gga gag ctt aaa aat
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
tta ccg tta tta aac aca gat aaa ccg gtt caa gct ttg atg aaa att
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile
gcg gat gaa tta gga gaa atc ttt aaa ttc gag gcg cct ggt cgt gta
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
acg cgc tac tta tca agt cag cgt cta att aaa gaa gca tgc gat gaa
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
tca cgc ttt gat aaa aac tta agt caa gcg ctt aaa ttt gta cgt gat
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
ttt gca gga gac ggg tta ttt aca agc tgg acg cat gaa aaa aat tgg
Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn Trp
aaa aaa gcg cat aat atc tta ctt cca agc ttc agt cag cag gca atg
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
aaa ggc tat cat gcg atg atg gtc gat atc gcc gtg cag ctt gtt caa
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
aag tgg gag cgt cta aat gca gat gag cat att gaa gta ccg gaa gac
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
atg aca cgt tta acg ctt gat aca att ggt ctt tgc ggc ttt aac tat
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
cgc ttt aac agc ttt tac cga gat cag cct cat cca ttt att aca agt
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
atg gtc cgt gca ctg gat gaa gca atg aac aag ctg cag cga gca aat
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
cca gac gac cca gct tat gat gaa aac aag cgc cag ttt caa gaa gat
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
atc aag gtg atg aac gac cta gta gat aaa att att gca gat cgc aaa
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
gca agc ggt gaa caa agc gat gat tta tta acg cat atg cta aac gga
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
aaa gat cca gaa acg ggt gag ccg ctt gat gac gag aac att cgc tat
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
caa att att aca ttc tta att gcg gga cac gaa aca aca agt ggt ctt
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
tta tca ttt gcg ctg tat ttc tta gtg aaa aat cca cat gta tta caa
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
aaa gca gca gaa gaa gca gca cga gtt cta gta gat cct gtt cca agc
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
tac aaa caa gtc aaa cag ctt aaa tat gtc ggc atg gtc tta aac gaa
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
gcg ctg cgc tta tgg cca act gct cct gcg ttt tcc cta tat gca aaa
Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys
gaa gat acg gtg ctt gga gga gaa tat cct tta gaa aaa ggc gac gaa
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
cta atg gtt ctg att cct cag ctt cac cgt gat aaa aca att tgg gga
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
gac gat gtg gaa gag ttc cgt cca gag cgt ttt gaa aat cca agt gcg
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
att ccg cag cat gcg ttt aaa ccg ttt gga aac ggt cag cgt gcg tgt
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
atc ggt cag cag ttc gct ctt cat gaa gca acg ctg gta ctt ggt atg
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
atg cta aaa cac ttt gac ttt gaa gat cat aca aac tac gag ctg gat
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
att aaa gaa act tta acg tta aaa cct gaa ggc ttt gtg gta aaa gca
Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala
aaa tcg aaa aaa att ccg ctt ggc ggt att cct tca cct agc act gaa
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
cag tct gct aaa aaa gta cgc aaa aag gca gaa aac gct cat aat acg
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
ccg ctg ctt gtg cta tac ggt tca aat atg gga aca gct gaa gga acg
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
gcg cgt gat tta gca gat att gca atg agc aaa gga ttt gca ccg cag
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
gtc gca acg ctt gat tca cac gcc gga aat ctt ccg cgc gaa gga gct
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
gta tta att gta acg gcg tct tat aac ggt cat ccg cct gat aac gca
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
aag caa ttt gtc gac tgg tta gac caa gcg tct gct gat gaa gta aaa
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
ggc gtt cgc tac tcc gta ttt gga tgc ggc gat aaa aac tgg gct act
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
acg tat caa aaa gtg cct gct ttt atc gat gaa acg ctt gcc gct aaa
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
ggg gca gaa aac atc gct gac cgc ggt gaa gca gat gca agc gac gac
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
ttt gaa ggc aca tat gaa gaa tgg cgt gaa cat atg tgg agt gac gta
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
gca gcc tac ttt aac ctc gac att gaa aac agt gaa gat aat aaa tct
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
act ctt tca ctt caa ttt gtc gac agc gcc gcg gat atg ccg ctt gcg
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
aaa atg cac ggt gcg ttt tca acg aac gtc gta gca agc aaa gaa ctt
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
caa cag cca ggc agt gca cga agc acg cga cat ctt gaa att gaa ctt
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
cca aaa gaa gct tct tat caa gaa gga gat cat tta ggt gtt att cct
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
cgc aac tat gaa gga ata gta aac cgt gta aca gca agg ttc ggc cta
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
gat gca tca cag caa atc cgt ctg gaa gca gaa gaa gaa aaa tta gct
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
cat ttg cca ctc gct aaa aca gta tcc gta gaa gag ctt ctg caa tac
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
gtg gag ctt caa gat cct gtt acg cgc acg cag ctt cgc gca atg gct
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
gct aaa acg gtc tgc ccg ccg cat aaa gta gag ctt gaa gcc ttg ctt
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
gaa aag caa gcc tac aaa gaa caa gtg ctg gca aaa cgt tta aca atg
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
ctt gaa ctg ctt gaa aaa tac ccg gcg tgt gaa atg aaa ttc agc gaa
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
ttt atc gcc ctt ctg cca agc ata cgc ccg cgc tat tac tcg att tct
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
tca tca cct cgt gtc gat gaa aaa caa gca agc atc acg gtc agc gtt
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
gtc tca gga gaa gcg tgg agc gga tat gga gaa tat aaa gga att gcg
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
tcg aac tat ctt gcc gag ctg caa gaa gga gat acg att acg tgc ttt
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
att tcc aca ccg cag tca gaa ttt acg ctg cca aaa gac cct gaa acg
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
ccg ctt atc atg gtc gga ccg gga aca ggc gtc gcg ccg ttt aga ggc
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
ttt gtg cag gcg cgc aaa cag cta aaa gaa caa gga cag tca ctt gga
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
gaa gca cat tta tac ttc ggc tgc cgt tca cct cat gaa gac tat ctg
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
tat caa gaa gag ctt gaa aac gcc caa agc gaa ggc atc att acg ctt
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
cat acc gct ttt tct cgc atg cca aat cag ccg aaa aca tac gtt cag
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
cac gta atg gaa caa gac ggc aag aaa ttg att gaa ctt ctt gat caa
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
gga gcg cac ttc tat att tgc gga gac gga agc caa atg gca cct gcc
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
gtt gaa gca acg ctt atg aaa agc tat gct gac gtt cac caa gtg
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
agt gaa gca gac gct cgc tta tgg ctg cag cag cta gaa gaa aaa
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
ggc cga tac gca aaa gac gtg tgg gct ggg
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
配列番号2
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn Trp
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn Trp
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
配列番号3
aca att aaa gaa atg cct cag cca aaa acg ttt gga gag ctt aaa aat
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
tta ccg tta tta aac aca gat aaa ccg gtt caa gct ttg atg aaa att
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile
gcg gat gaa tta gga gaa atc ttt aaa ttc gag gcg cct ggt cgt gta
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
acg cgc tac tta tca agt cag cgt cta att aaa gaa gca tgc gat gaa
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
tca cgc ttt gat aaa aac tta agt caa gcg ctt aaa ttt gta cgt gat
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
ttt gca gga gac ggg tta aag aca agc tgg acg cat gaa aaa aat tgg
Phe Ala Gly Asp Gly Leu Lys Thr Ser Trp Thr His Glu Lys Asn Trp
aaa aaa gcg cat aat atc tta ctt cca agc ttc agt cag cag gca atg
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
aaa ggc tat cat gcg atg atg gtc gat atc gcc gtg cag ctt gtt caa
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
aag tgg gag cgt cta aat gca gat gag cat att gaa gta ccg gaa gac
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
atg aca cgt tta acg ctt gat aca att ggt ctt tgc ggc ttt aac tat
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
cgc ttt aac agc ttt tac cga gat cag cct cat cca ttt att aca agt
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
atg gtc cgt gca ctg gat gaa gca atg aac aag ctg cag cga gca aat
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
cca gac gac cca gct tat gat gaa aac aag cgc cag ttt caa gaa gat
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
atc aag gtg atg aac gac cta gta gat aaa att att gca gat cgc aaa
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
gca agc ggt gaa caa agc gat gat tta tta acg cat atg cta aac gga
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
aaa gat cca gaa acg ggt gag ccg ctt gat gac gag aac att cgc tat
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
caa att att aca ttc tta att gcg gga cac gaa aca aca agt ggt ctt
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
tta tca ttt gcg ctg tat ttc tta gtg aaa aat cca cat gta tta caa
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
aaa gca gca gaa gaa gca gca cga gtt cta gta gat cct gtt cca agc
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
tac aaa caa gtc aaa cag ctt aaa tat gtc ggc atg gtc tta aac gaa
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
gcg ctg cgc tta tgg cca act gct cct gcg ttt tcc cta tat gca aaa
Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys
gaa gat acg gtg ctt gga gga gaa tat cct tta gaa aaa ggc gac gaa
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
cta atg gtt ctg att cct cag ctt cac cgt gat aaa aca att tgg gga
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
gac gat gtg gaa gag ttc cgt cca gag cgt ttt gaa aat cca agt gcg
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
att ccg cag cat gcg ttt aaa ccg ttt gga aac ggt cag cgt gcg tgt
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
atc ggt cag cag ttc gct ctt cat gaa gca acg ctg gta ctt ggt atg
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
atg cta aaa cac ttt gac ttt gaa gat cat aca aac tac gag ctg gat
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
att aaa gaa act tta acg tta aaa cct gaa ggc ttt gtg gta aaa gca
Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala
aaa tcg aaa aaa att ccg ctt ggc ggt att cct tca cct agc act gaa
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
cag tct gct aaa aaa gta cgc aaa aag gca gaa aac gct cat aat acg
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
ccg ctg ctt gtg cta tac ggt tca aat atg gga aca gct gaa gga acg
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
gcg cgt gat tta gca gat att gca atg agc aaa gga ttt gca ccg cag
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
gtc gca acg ctt gat tca cac gcc gga aat ctt ccg cgc gaa gga gct
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
gta tta att gta acg gcg tct tat aac ggt cat ccg cct gat aac gca
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
aag caa ttt gtc gac tgg tta gac caa gcg tct gct gat gaa gta aaa
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
ggc gtt cgc tac tcc gta ttt gga tgc ggc gat aaa aac tgg gct act
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
acg tat caa aaa gtg cct gct ttt atc gat gaa acg ctt gcc gct aaa
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
ggg gca gaa aac atc gct gac cgc ggt gaa gca gat gca agc gac gac
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
ttt gaa ggc aca tat gaa gaa tgg cgt gaa cat atg tgg agt gac gta
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
gca gcc tac ttt aac ctc gac att gaa aac agt gaa gat aat aaa tct
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
act ctt tca ctt caa ttt gtc gac agc gcc gcg gat atg ccg ctt gcg
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
aaa atg cac ggt gcg ttt tca acg aac gtc gta gca agc aaa gaa ctt
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
caa cag cca ggc agt gca cga agc acg cga cat ctt gaa att gaa ctt
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
cca aaa gaa gct tct tat caa gaa gga gat cat tta ggt gtt att cct
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
cgc aac tat gaa gga ata gta aac cgt gta aca gca agg ttc ggc cta
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
gat gca tca cag caa atc cgt ctg gaa gca gaa gaa gaa aaa tta gct
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
cat ttg cca ctc gct aaa aca gta tcc gta gaa gag ctt ctg caa tac
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
gtg gag ctt caa gat cct gtt acg cgc acg cag ctt cgc gca atg gct
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
gct aaa acg gtc tgc ccg ccg cat aaa gta gag ctt gaa gcc ttg ctt
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
gaa aag caa gcc tac aaa gaa caa gtg ctg gca aaa cgt tta aca atg
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
ctt gaa ctg ctt gaa aaa tac ccg gcg tgt gaa atg aaa ttc agc gaa
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
ttt atc gcc ctt ctg cca agc ata cgc ccg cgc tat tac tcg att tct
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
tca tca cct cgt gtc gat gaa aaa caa gca agc atc acg gtc agc gtt
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
gtc tca gga gaa gcg tgg agc gga tat gga gaa tat aaa gga att gcg
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
tcg aac tat ctt gcc gag ctg caa gaa gga gat acg att acg tgc ttt
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
att tcc aca ccg cag tca gaa ttt acg ctg cca aaa gac cct gaa acg
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
ccg ctt atc atg gtc gga ccg gga aca ggc gtc gcg ccg ttt aga ggc
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
ttt gtg cag gcg cgc aaa cag cta aaa gaa caa gga cag tca ctt gga
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
gaa gca cat tta tac ttc ggc tgc cgt tca cct cat gaa gac tat ctg
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
tat caa gaa gag ctt gaa aac gcc caa agc gaa ggc atc att acg ctt
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
cat acc gct ttt tct cgc atg cca aat cag ccg aaa aca tac gtt cag
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
cac gta atg gaa caa gac ggc aag aaa ttg att gaa ctt ctt gat caa
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
gga gcg cac ttc tat att tgc gga gac gga agc caa atg gca cct gcc
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
gtt gaa gca acg ctt atg aaa agc tat gct gac gtt cac caa gtg
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
agt gaa gca gac gct cgc tta tgg ctg cag cag cta gaa gaa aaa
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
ggc cga tac gca aaa gac gtg tgg gct ggg
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
aca att aaa gaa atg cct cag cca aaa acg ttt gga gag ctt aaa aat
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
tta ccg tta tta aac aca gat aaa ccg gtt caa gct ttg atg aaa att
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile
gcg gat gaa tta gga gaa atc ttt aaa ttc gag gcg cct ggt cgt gta
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
acg cgc tac tta tca agt cag cgt cta att aaa gaa gca tgc gat gaa
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
tca cgc ttt gat aaa aac tta agt caa gcg ctt aaa ttt gta cgt gat
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
ttt gca gga gac ggg tta aag aca agc tgg acg cat gaa aaa aat tgg
Phe Ala Gly Asp Gly Leu Lys Thr Ser Trp Thr His Glu Lys Asn Trp
aaa aaa gcg cat aat atc tta ctt cca agc ttc agt cag cag gca atg
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
aaa ggc tat cat gcg atg atg gtc gat atc gcc gtg cag ctt gtt caa
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
aag tgg gag cgt cta aat gca gat gag cat att gaa gta ccg gaa gac
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
atg aca cgt tta acg ctt gat aca att ggt ctt tgc ggc ttt aac tat
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
cgc ttt aac agc ttt tac cga gat cag cct cat cca ttt att aca agt
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
atg gtc cgt gca ctg gat gaa gca atg aac aag ctg cag cga gca aat
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
cca gac gac cca gct tat gat gaa aac aag cgc cag ttt caa gaa gat
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
atc aag gtg atg aac gac cta gta gat aaa att att gca gat cgc aaa
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
gca agc ggt gaa caa agc gat gat tta tta acg cat atg cta aac gga
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
aaa gat cca gaa acg ggt gag ccg ctt gat gac gag aac att cgc tat
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
caa att att aca ttc tta att gcg gga cac gaa aca aca agt ggt ctt
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
tta tca ttt gcg ctg tat ttc tta gtg aaa aat cca cat gta tta caa
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
aaa gca gca gaa gaa gca gca cga gtt cta gta gat cct gtt cca agc
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
tac aaa caa gtc aaa cag ctt aaa tat gtc ggc atg gtc tta aac gaa
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
gcg ctg cgc tta tgg cca act gct cct gcg ttt tcc cta tat gca aaa
Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys
gaa gat acg gtg ctt gga gga gaa tat cct tta gaa aaa ggc gac gaa
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
cta atg gtt ctg att cct cag ctt cac cgt gat aaa aca att tgg gga
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
gac gat gtg gaa gag ttc cgt cca gag cgt ttt gaa aat cca agt gcg
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
att ccg cag cat gcg ttt aaa ccg ttt gga aac ggt cag cgt gcg tgt
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
atc ggt cag cag ttc gct ctt cat gaa gca acg ctg gta ctt ggt atg
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
atg cta aaa cac ttt gac ttt gaa gat cat aca aac tac gag ctg gat
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
att aaa gaa act tta acg tta aaa cct gaa ggc ttt gtg gta aaa gca
Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala
aaa tcg aaa aaa att ccg ctt ggc ggt att cct tca cct agc act gaa
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
cag tct gct aaa aaa gta cgc aaa aag gca gaa aac gct cat aat acg
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
ccg ctg ctt gtg cta tac ggt tca aat atg gga aca gct gaa gga acg
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
gcg cgt gat tta gca gat att gca atg agc aaa gga ttt gca ccg cag
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
gtc gca acg ctt gat tca cac gcc gga aat ctt ccg cgc gaa gga gct
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
gta tta att gta acg gcg tct tat aac ggt cat ccg cct gat aac gca
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
aag caa ttt gtc gac tgg tta gac caa gcg tct gct gat gaa gta aaa
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
ggc gtt cgc tac tcc gta ttt gga tgc ggc gat aaa aac tgg gct act
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
acg tat caa aaa gtg cct gct ttt atc gat gaa acg ctt gcc gct aaa
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
ggg gca gaa aac atc gct gac cgc ggt gaa gca gat gca agc gac gac
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
ttt gaa ggc aca tat gaa gaa tgg cgt gaa cat atg tgg agt gac gta
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
gca gcc tac ttt aac ctc gac att gaa aac agt gaa gat aat aaa tct
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
act ctt tca ctt caa ttt gtc gac agc gcc gcg gat atg ccg ctt gcg
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
aaa atg cac ggt gcg ttt tca acg aac gtc gta gca agc aaa gaa ctt
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
caa cag cca ggc agt gca cga agc acg cga cat ctt gaa att gaa ctt
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
cca aaa gaa gct tct tat caa gaa gga gat cat tta ggt gtt att cct
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
cgc aac tat gaa gga ata gta aac cgt gta aca gca agg ttc ggc cta
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
gat gca tca cag caa atc cgt ctg gaa gca gaa gaa gaa aaa tta gct
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
cat ttg cca ctc gct aaa aca gta tcc gta gaa gag ctt ctg caa tac
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
gtg gag ctt caa gat cct gtt acg cgc acg cag ctt cgc gca atg gct
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
gct aaa acg gtc tgc ccg ccg cat aaa gta gag ctt gaa gcc ttg ctt
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
gaa aag caa gcc tac aaa gaa caa gtg ctg gca aaa cgt tta aca atg
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
ctt gaa ctg ctt gaa aaa tac ccg gcg tgt gaa atg aaa ttc agc gaa
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
ttt atc gcc ctt ctg cca agc ata cgc ccg cgc tat tac tcg att tct
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
tca tca cct cgt gtc gat gaa aaa caa gca agc atc acg gtc agc gtt
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
gtc tca gga gaa gcg tgg agc gga tat gga gaa tat aaa gga att gcg
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
tcg aac tat ctt gcc gag ctg caa gaa gga gat acg att acg tgc ttt
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
att tcc aca ccg cag tca gaa ttt acg ctg cca aaa gac cct gaa acg
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
ccg ctt atc atg gtc gga ccg gga aca ggc gtc gcg ccg ttt aga ggc
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
ttt gtg cag gcg cgc aaa cag cta aaa gaa caa gga cag tca ctt gga
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
gaa gca cat tta tac ttc ggc tgc cgt tca cct cat gaa gac tat ctg
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
tat caa gaa gag ctt gaa aac gcc caa agc gaa ggc atc att acg ctt
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
cat acc gct ttt tct cgc atg cca aat cag ccg aaa aca tac gtt cag
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
cac gta atg gaa caa gac ggc aag aaa ttg att gaa ctt ctt gat caa
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
gga gcg cac ttc tat att tgc gga gac gga agc caa atg gca cct gcc
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
gtt gaa gca acg ctt atg aaa agc tat gct gac gtt cac caa gtg
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
agt gaa gca gac gct cgc tta tgg ctg cag cag cta gaa gaa aaa
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
ggc cga tac gca aaa gac gtg tgg gct ggg
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
配列番号4
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
Phe Ala Gly Asp Gly Leu Lys Thr Ser Trp Thr His Glu Lys Asn Trp
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
Phe Ala Gly Asp Gly Leu Lys Thr Ser Trp Thr His Glu Lys Asn Trp
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
配列番号5
aca att aaa gaa atg cct cag cca aaa acg ttt gga gag ctt aaa aat
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
tta ccg tta tta aac aca gat aaa ccg gtt caa gct ttg atg aaa att
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile
gcg gat gaa tta gga gaa atc ttt aaa ttc gag gcg cct ggt cgt gta
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
acg cgc tac tta tca agt cag cgt cta att aaa gaa gca tgc gat gaa
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
tca cgc ttt gat aaa aac tta agt caa gcg ctt aaa ttt gta cgt gat
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
ttt gca gga gac ggg tta cat aca agc tgg acg cat gaa aaa aat tgg
Phe Ala Gly Asp Gly Leu His Thr Ser Trp Thr His Glu Lys Asn Trp
aaa aaa gcg cat aat atc tta ctt cca agc ttc agt cag cag gca atg
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
aaa ggc tat cat gcg atg atg gtc gat atc gcc gtg cag ctt gtt caa
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
aag tgg gag cgt cta aat gca gat gag cat att gaa gta ccg gaa gac
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
atg aca cgt tta acg ctt gat aca att ggt ctt tgc ggc ttt aac tat
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
cgc ttt aac agc ttt tac cga gat cag cct cat cca ttt att aca agt
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
atg gtc cgt gca ctg gat gaa gca atg aac aag ctg cag cga gca aat
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
cca gac gac cca gct tat gat gaa aac aag cgc cag ttt caa gaa gat
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
atc aag gtg atg aac gac cta gta gat aaa att att gca gat cgc aaa
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
gca agc ggt gaa caa agc gat gat tta tta acg cat atg cta aac gga
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
aaa gat cca gaa acg ggt gag ccg ctt gat gac gag aac att cgc tat
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
caa att att aca ttc tta att gcg gga cac gaa aca aca agt ggt ctt
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
tta tca ttt gcg ctg tat ttc tta gtg aaa aat cca cat gta tta caa
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
aaa gca gca gaa gaa gca gca cga gtt cta gta gat cct gtt cca agc
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
tac aaa caa gtc aaa cag ctt aaa tat gtc ggc atg gtc tta aac gaa
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
gcg ctg cgc tta tgg cca act gct cct gcg ttt tcc cta tat gca aaa
Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys
gaa gat acg gtg ctt gga gga gaa tat cct tta gaa aaa ggc gac gaa
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
cta atg gtt ctg att cct cag ctt cac cgt gat aaa aca att tgg gga
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
gac gat gtg gaa gag ttc cgt cca gag cgt ttt gaa aat cca agt gcg
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
att ccg cag cat gcg ttt aaa ccg ttt gga aac ggt cag cgt gcg tgt
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
atc ggt cag cag ttc gct ctt cat gaa gca acg ctg gta ctt ggt atg
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
atg cta aaa cac ttt gac ttt gaa gat cat aca aac tac gag ctg gat
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
att aaa gaa act tta acg tta aaa cct gaa ggc ttt gtg gta aaa gca
Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala
aaa tcg aaa aaa att ccg ctt ggc ggt att cct tca cct agc act gaa
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
cag tct gct aaa aaa gta cgc aaa aag gca gaa aac gct cat aat acg
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
ccg ctg ctt gtg cta tac ggt tca aat atg gga aca gct gaa gga acg
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
gcg cgt gat tta gca gat att gca atg agc aaa gga ttt gca ccg cag
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
gtc gca acg ctt gat tca cac gcc gga aat ctt ccg cgc gaa gga gct
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
gta tta att gta acg gcg tct tat aac ggt cat ccg cct gat aac gca
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
aag caa ttt gtc gac tgg tta gac caa gcg tct gct gat gaa gta aaa
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
ggc gtt cgc tac tcc gta ttt gga tgc ggc gat aaa aac tgg gct act
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
acg tat caa aaa gtg cct gct ttt atc gat gaa acg ctt gcc gct aaa
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
ggg gca gaa aac atc gct gac cgc ggt gaa gca gat gca agc gac gac
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
ttt gaa ggc aca tat gaa gaa tgg cgt gaa cat atg tgg agt gac gta
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
gca gcc tac ttt aac ctc gac att gaa aac agt gaa gat aat aaa tct
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
act ctt tca ctt caa ttt gtc gac agc gcc gcg gat atg ccg ctt gcg
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
aaa atg cac ggt gcg ttt tca acg aac gtc gta gca agc aaa gaa ctt
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
caa cag cca ggc agt gca cga agc acg cga cat ctt gaa att gaa ctt
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
cca aaa gaa gct tct tat caa gaa gga gat cat tta ggt gtt att cct
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
cgc aac tat gaa gga ata gta aac cgt gta aca gca agg ttc ggc cta
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
gat gca tca cag caa atc cgt ctg gaa gca gaa gaa gaa aaa tta gct
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
cat ttg cca ctc gct aaa aca gta tcc gta gaa gag ctt ctg caa tac
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
gtg gag ctt caa gat cct gtt acg cgc acg cag ctt cgc gca atg gct
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
gct aaa acg gtc tgc ccg ccg cat aaa gta gag ctt gaa gcc ttg ctt
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
gaa aag caa gcc tac aaa gaa caa gtg ctg gca aaa cgt tta aca atg
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
ctt gaa ctg ctt gaa aaa tac ccg gcg tgt gaa atg aaa ttc agc gaa
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
ttt atc gcc ctt ctg cca agc ata cgc ccg cgc tat tac tcg att tct
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
tca tca cct cgt gtc gat gaa aaa caa gca agc atc acg gtc agc gtt
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
gtc tca gga gaa gcg tgg agc gga tat gga gaa tat aaa gga att gcg
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
tcg aac tat ctt gcc gag ctg caa gaa gga gat acg att acg tgc ttt
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
att tcc aca ccg cag tca gaa ttt acg ctg cca aaa gac cct gaa acg
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
ccg ctt atc atg gtc gga ccg gga aca ggc gtc gcg ccg ttt aga ggc
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
ttt gtg cag gcg cgc aaa cag cta aaa gaa caa gga cag tca ctt gga
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
gaa gca cat tta tac ttc ggc tgc cgt tca cct cat gaa gac tat ctg
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
tat caa gaa gag ctt gaa aac gcc caa agc gaa ggc atc att acg ctt
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
cat acc gct ttt tct cgc atg cca aat cag ccg aaa aca tac gtt cag
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
cac gta atg gaa caa gac ggc aag aaa ttg att gaa ctt ctt gat caa
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
gga gcg cac ttc tat att tgc gga gac gga agc caa atg gca cct gcc
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
gtt gaa gca acg ctt atg aaa agc tat gct gac gtt cac caa gtg
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
agt gaa gca gac gct cgc tta tgg ctg cag cag cta gaa gaa aaa
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
ggc cga tac gca aaa gac gtg tgg gct ggg
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
aca att aaa gaa atg cct cag cca aaa acg ttt gga gag ctt aaa aat
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
tta ccg tta tta aac aca gat aaa ccg gtt caa gct ttg atg aaa att
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile
gcg gat gaa tta gga gaa atc ttt aaa ttc gag gcg cct ggt cgt gta
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
acg cgc tac tta tca agt cag cgt cta att aaa gaa gca tgc gat gaa
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
tca cgc ttt gat aaa aac tta agt caa gcg ctt aaa ttt gta cgt gat
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
ttt gca gga gac ggg tta cat aca agc tgg acg cat gaa aaa aat tgg
Phe Ala Gly Asp Gly Leu His Thr Ser Trp Thr His Glu Lys Asn Trp
aaa aaa gcg cat aat atc tta ctt cca agc ttc agt cag cag gca atg
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
aaa ggc tat cat gcg atg atg gtc gat atc gcc gtg cag ctt gtt caa
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
aag tgg gag cgt cta aat gca gat gag cat att gaa gta ccg gaa gac
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
atg aca cgt tta acg ctt gat aca att ggt ctt tgc ggc ttt aac tat
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
cgc ttt aac agc ttt tac cga gat cag cct cat cca ttt att aca agt
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
atg gtc cgt gca ctg gat gaa gca atg aac aag ctg cag cga gca aat
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
cca gac gac cca gct tat gat gaa aac aag cgc cag ttt caa gaa gat
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
atc aag gtg atg aac gac cta gta gat aaa att att gca gat cgc aaa
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
gca agc ggt gaa caa agc gat gat tta tta acg cat atg cta aac gga
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
aaa gat cca gaa acg ggt gag ccg ctt gat gac gag aac att cgc tat
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
caa att att aca ttc tta att gcg gga cac gaa aca aca agt ggt ctt
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
tta tca ttt gcg ctg tat ttc tta gtg aaa aat cca cat gta tta caa
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
aaa gca gca gaa gaa gca gca cga gtt cta gta gat cct gtt cca agc
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
tac aaa caa gtc aaa cag ctt aaa tat gtc ggc atg gtc tta aac gaa
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
gcg ctg cgc tta tgg cca act gct cct gcg ttt tcc cta tat gca aaa
Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys
gaa gat acg gtg ctt gga gga gaa tat cct tta gaa aaa ggc gac gaa
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
cta atg gtt ctg att cct cag ctt cac cgt gat aaa aca att tgg gga
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
gac gat gtg gaa gag ttc cgt cca gag cgt ttt gaa aat cca agt gcg
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
att ccg cag cat gcg ttt aaa ccg ttt gga aac ggt cag cgt gcg tgt
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
atc ggt cag cag ttc gct ctt cat gaa gca acg ctg gta ctt ggt atg
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
atg cta aaa cac ttt gac ttt gaa gat cat aca aac tac gag ctg gat
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
att aaa gaa act tta acg tta aaa cct gaa ggc ttt gtg gta aaa gca
Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala
aaa tcg aaa aaa att ccg ctt ggc ggt att cct tca cct agc act gaa
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
cag tct gct aaa aaa gta cgc aaa aag gca gaa aac gct cat aat acg
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
ccg ctg ctt gtg cta tac ggt tca aat atg gga aca gct gaa gga acg
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
gcg cgt gat tta gca gat att gca atg agc aaa gga ttt gca ccg cag
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
gtc gca acg ctt gat tca cac gcc gga aat ctt ccg cgc gaa gga gct
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
gta tta att gta acg gcg tct tat aac ggt cat ccg cct gat aac gca
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
aag caa ttt gtc gac tgg tta gac caa gcg tct gct gat gaa gta aaa
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
ggc gtt cgc tac tcc gta ttt gga tgc ggc gat aaa aac tgg gct act
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
acg tat caa aaa gtg cct gct ttt atc gat gaa acg ctt gcc gct aaa
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
ggg gca gaa aac atc gct gac cgc ggt gaa gca gat gca agc gac gac
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
ttt gaa ggc aca tat gaa gaa tgg cgt gaa cat atg tgg agt gac gta
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
gca gcc tac ttt aac ctc gac att gaa aac agt gaa gat aat aaa tct
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
act ctt tca ctt caa ttt gtc gac agc gcc gcg gat atg ccg ctt gcg
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
aaa atg cac ggt gcg ttt tca acg aac gtc gta gca agc aaa gaa ctt
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
caa cag cca ggc agt gca cga agc acg cga cat ctt gaa att gaa ctt
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
cca aaa gaa gct tct tat caa gaa gga gat cat tta ggt gtt att cct
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
cgc aac tat gaa gga ata gta aac cgt gta aca gca agg ttc ggc cta
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
gat gca tca cag caa atc cgt ctg gaa gca gaa gaa gaa aaa tta gct
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
cat ttg cca ctc gct aaa aca gta tcc gta gaa gag ctt ctg caa tac
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
gtg gag ctt caa gat cct gtt acg cgc acg cag ctt cgc gca atg gct
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
gct aaa acg gtc tgc ccg ccg cat aaa gta gag ctt gaa gcc ttg ctt
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
gaa aag caa gcc tac aaa gaa caa gtg ctg gca aaa cgt tta aca atg
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
ctt gaa ctg ctt gaa aaa tac ccg gcg tgt gaa atg aaa ttc agc gaa
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
ttt atc gcc ctt ctg cca agc ata cgc ccg cgc tat tac tcg att tct
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
tca tca cct cgt gtc gat gaa aaa caa gca agc atc acg gtc agc gtt
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
gtc tca gga gaa gcg tgg agc gga tat gga gaa tat aaa gga att gcg
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
tcg aac tat ctt gcc gag ctg caa gaa gga gat acg att acg tgc ttt
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
att tcc aca ccg cag tca gaa ttt acg ctg cca aaa gac cct gaa acg
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
ccg ctt atc atg gtc gga ccg gga aca ggc gtc gcg ccg ttt aga ggc
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
ttt gtg cag gcg cgc aaa cag cta aaa gaa caa gga cag tca ctt gga
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
gaa gca cat tta tac ttc ggc tgc cgt tca cct cat gaa gac tat ctg
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
tat caa gaa gag ctt gaa aac gcc caa agc gaa ggc atc att acg ctt
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
cat acc gct ttt tct cgc atg cca aat cag ccg aaa aca tac gtt cag
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
cac gta atg gaa caa gac ggc aag aaa ttg att gaa ctt ctt gat caa
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
gga gcg cac ttc tat att tgc gga gac gga agc caa atg gca cct gcc
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
gtt gaa gca acg ctt atg aaa agc tat gct gac gtt cac caa gtg
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
agt gaa gca gac gct cgc tta tgg ctg cag cag cta gaa gaa aaa
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
ggc cga tac gca aaa gac gtg tgg gct ggg
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
配列番号6
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
Phe Ala Gly Asp Gly Leu His Thr Ser Trp Thr His Glu Lys Asn Trp
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
Phe Ala Gly Asp Gly Leu His Thr Ser Trp Thr His Glu Lys Asn Trp
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
配列番号7
aca att aaa gaa atg cct cag cca aaa acg ttt gga gag ctt aaa aat
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
tta ccg tta tta aac aca gat aaa ccg gtt caa gct ttg atg aaa att
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile
gcg gat gaa tta gga gaa atc ttt aaa ttc gag gcg cct ggt cgt gta
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
acg cgc tac tta tca agt cag cgt cta att aaa gaa gca tgc gat gaa
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
tca cgc ttt gat aaa aac tta agt caa gcg ctt aaa ttt gta cgt gat
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
ttt gca gga gac ggg tta atc aca agc tgg acg cat gaa aaa aat tgg
Phe Ala Gly Asp Gly Leu Ile Thr Ser Trp Thr His Glu Lys Asn Trp
aaa aaa gcg cat aat atc tta ctt cca agc ttc agt cag cag gca atg
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
aaa ggc tat cat gcg atg atg gtc gat atc gcc gtg cag ctt gtt caa
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
aag tgg gag cgt cta aat gca gat gag cat att gaa gta ccg gaa gac
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
atg aca cgt tta acg ctt gat aca att ggt ctt tgc ggc ttt aac tat
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
cgc ttt aac agc ttt tac cga gat cag cct cat cca ttt att aca agt
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
atg gtc cgt gca ctg gat gaa gca atg aac aag ctg cag cga gca aat
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
cca gac gac cca gct tat gat gaa aac aag cgc cag ttt caa gaa gat
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
atc aag gtg atg aac gac cta gta gat aaa att att gca gat cgc aaa
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
gca agc ggt gaa caa agc gat gat tta tta acg cat atg cta aac gga
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
aaa gat cca gaa acg ggt gag ccg ctt gat gac gag aac att cgc tat
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
caa att att aca ttc tta att gcg gga cac gaa aca aca agt ggt ctt
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
tta tca ttt gcg ctg tat ttc tta gtg aaa aat cca cat gta tta caa
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
aaa gca gca gaa gaa gca gca cga gtt cta gta gat cct gtt cca agc
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
tac aaa caa gtc aaa cag ctt aaa tat gtc ggc atg gtc tta aac gaa
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
gcg ctg cgc tta tgg cca act gct cct gcg ttt tcc cta tat gca aaa
Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys
gaa gat acg gtg ctt gga gga gaa tat cct tta gaa aaa ggc gac gaa
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
cta atg gtt ctg att cct cag ctt cac cgt gat aaa aca att tgg gga
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
gac gat gtg gaa gag ttc cgt cca gag cgt ttt gaa aat cca agt gcg
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
att ccg cag cat gcg ttt aaa ccg ttt gga aac ggt cag cgt gcg tgt
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
atc ggt cag cag ttc gct ctt cat gaa gca acg ctg gta ctt ggt atg
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
atg cta aaa cac ttt gac ttt gaa gat cat aca aac tac gag ctg gat
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
att aaa gaa act tta acg tta aaa cct gaa ggc ttt gtg gta aaa gca
Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala
aaa tcg aaa aaa att ccg ctt ggc ggt att cct tca cct agc act gaa
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
cag tct gct aaa aaa gta cgc aaa aag gca gaa aac gct cat aat acg
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
ccg ctg ctt gtg cta tac ggt tca aat atg gga aca gct gaa gga acg
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
gcg cgt gat tta gca gat att gca atg agc aaa gga ttt gca ccg cag
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
gtc gca acg ctt gat tca cac gcc gga aat ctt ccg cgc gaa gga gct
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
gta tta att gta acg gcg tct tat aac ggt cat ccg cct gat aac gca
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
aag caa ttt gtc gac tgg tta gac caa gcg tct gct gat gaa gta aaa
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
ggc gtt cgc tac tcc gta ttt gga tgc ggc gat aaa aac tgg gct act
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
acg tat caa aaa gtg cct gct ttt atc gat gaa acg ctt gcc gct aaa
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
ggg gca gaa aac atc gct gac cgc ggt gaa gca gat gca agc gac gac
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
ttt gaa ggc aca tat gaa gaa tgg cgt gaa cat atg tgg agt gac gta
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
gca gcc tac ttt aac ctc gac att gaa aac agt gaa gat aat aaa tct
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
act ctt tca ctt caa ttt gtc gac agc gcc gcg gat atg ccg ctt gcg
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
aaa atg cac ggt gcg ttt tca acg aac gtc gta gca agc aaa gaa ctt
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
caa cag cca ggc agt gca cga agc acg cga cat ctt gaa att gaa ctt
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
cca aaa gaa gct tct tat caa gaa gga gat cat tta ggt gtt att cct
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
cgc aac tat gaa gga ata gta aac cgt gta aca gca agg ttc ggc cta
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
gat gca tca cag caa atc cgt ctg gaa gca gaa gaa gaa aaa tta gct
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
cat ttg cca ctc gct aaa aca gta tcc gta gaa gag ctt ctg caa tac
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
gtg gag ctt caa gat cct gtt acg cgc acg cag ctt cgc gca atg gct
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
gct aaa acg gtc tgc ccg ccg cat aaa gta gag ctt gaa gcc ttg ctt
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
gaa aag caa gcc tac aaa gaa caa gtg ctg gca aaa cgt tta aca atg
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
ctt gaa ctg ctt gaa aaa tac ccg gcg tgt gaa atg aaa ttc agc gaa
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
ttt atc gcc ctt ctg cca agc ata cgc ccg cgc tat tac tcg att tct
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
tca tca cct cgt gtc gat gaa aaa caa gca agc atc acg gtc agc gtt
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
gtc tca gga gaa gcg tgg agc gga tat gga gaa tat aaa gga att gcg
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
tcg aac tat ctt gcc gag ctg caa gaa gga gat acg att acg tgc ttt
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
att tcc aca ccg cag tca gaa ttt acg ctg cca aaa gac cct gaa acg
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
ccg ctt atc atg gtc gga ccg gga aca ggc gtc gcg ccg ttt aga ggc
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
ttt gtg cag gcg cgc aaa cag cta aaa gaa caa gga cag tca ctt gga
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
gaa gca cat tta tac ttc ggc tgc cgt tca cct cat gaa gac tat ctg
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
tat caa gaa gag ctt gaa aac gcc caa agc gaa ggc atc att acg ctt
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
cat acc gct ttt tct cgc atg cca aat cag ccg aaa aca tac gtt cag
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
cac gta atg gaa caa gac ggc aag aaa ttg att gaa ctt ctt gat caa
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
gga gcg cac ttc tat att tgc gga gac gga agc caa atg gca cct gcc
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
gtt gaa gca acg ctt atg aaa agc tat gct gac gtt cac caa gtg
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
agt gaa gca gac gct cgc tta tgg ctg cag cag cta gaa gaa aaa
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
ggc cga tac gca aaa gac gtg tgg gct ggg
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
aca att aaa gaa atg cct cag cca aaa acg ttt gga gag ctt aaa aat
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
tta ccg tta tta aac aca gat aaa ccg gtt caa gct ttg atg aaa att
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile
gcg gat gaa tta gga gaa atc ttt aaa ttc gag gcg cct ggt cgt gta
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
acg cgc tac tta tca agt cag cgt cta att aaa gaa gca tgc gat gaa
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
tca cgc ttt gat aaa aac tta agt caa gcg ctt aaa ttt gta cgt gat
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
ttt gca gga gac ggg tta atc aca agc tgg acg cat gaa aaa aat tgg
Phe Ala Gly Asp Gly Leu Ile Thr Ser Trp Thr His Glu Lys Asn Trp
aaa aaa gcg cat aat atc tta ctt cca agc ttc agt cag cag gca atg
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
aaa ggc tat cat gcg atg atg gtc gat atc gcc gtg cag ctt gtt caa
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
aag tgg gag cgt cta aat gca gat gag cat att gaa gta ccg gaa gac
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
atg aca cgt tta acg ctt gat aca att ggt ctt tgc ggc ttt aac tat
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
cgc ttt aac agc ttt tac cga gat cag cct cat cca ttt att aca agt
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
atg gtc cgt gca ctg gat gaa gca atg aac aag ctg cag cga gca aat
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
cca gac gac cca gct tat gat gaa aac aag cgc cag ttt caa gaa gat
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
atc aag gtg atg aac gac cta gta gat aaa att att gca gat cgc aaa
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
gca agc ggt gaa caa agc gat gat tta tta acg cat atg cta aac gga
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
aaa gat cca gaa acg ggt gag ccg ctt gat gac gag aac att cgc tat
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
caa att att aca ttc tta att gcg gga cac gaa aca aca agt ggt ctt
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
tta tca ttt gcg ctg tat ttc tta gtg aaa aat cca cat gta tta caa
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
aaa gca gca gaa gaa gca gca cga gtt cta gta gat cct gtt cca agc
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
tac aaa caa gtc aaa cag ctt aaa tat gtc ggc atg gtc tta aac gaa
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
gcg ctg cgc tta tgg cca act gct cct gcg ttt tcc cta tat gca aaa
Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys
gaa gat acg gtg ctt gga gga gaa tat cct tta gaa aaa ggc gac gaa
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
cta atg gtt ctg att cct cag ctt cac cgt gat aaa aca att tgg gga
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
gac gat gtg gaa gag ttc cgt cca gag cgt ttt gaa aat cca agt gcg
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
att ccg cag cat gcg ttt aaa ccg ttt gga aac ggt cag cgt gcg tgt
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
atc ggt cag cag ttc gct ctt cat gaa gca acg ctg gta ctt ggt atg
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
atg cta aaa cac ttt gac ttt gaa gat cat aca aac tac gag ctg gat
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
att aaa gaa act tta acg tta aaa cct gaa ggc ttt gtg gta aaa gca
Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala
aaa tcg aaa aaa att ccg ctt ggc ggt att cct tca cct agc act gaa
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
cag tct gct aaa aaa gta cgc aaa aag gca gaa aac gct cat aat acg
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
ccg ctg ctt gtg cta tac ggt tca aat atg gga aca gct gaa gga acg
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
gcg cgt gat tta gca gat att gca atg agc aaa gga ttt gca ccg cag
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
gtc gca acg ctt gat tca cac gcc gga aat ctt ccg cgc gaa gga gct
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
gta tta att gta acg gcg tct tat aac ggt cat ccg cct gat aac gca
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
aag caa ttt gtc gac tgg tta gac caa gcg tct gct gat gaa gta aaa
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
ggc gtt cgc tac tcc gta ttt gga tgc ggc gat aaa aac tgg gct act
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
acg tat caa aaa gtg cct gct ttt atc gat gaa acg ctt gcc gct aaa
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
ggg gca gaa aac atc gct gac cgc ggt gaa gca gat gca agc gac gac
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
ttt gaa ggc aca tat gaa gaa tgg cgt gaa cat atg tgg agt gac gta
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
gca gcc tac ttt aac ctc gac att gaa aac agt gaa gat aat aaa tct
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
act ctt tca ctt caa ttt gtc gac agc gcc gcg gat atg ccg ctt gcg
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
aaa atg cac ggt gcg ttt tca acg aac gtc gta gca agc aaa gaa ctt
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
caa cag cca ggc agt gca cga agc acg cga cat ctt gaa att gaa ctt
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
cca aaa gaa gct tct tat caa gaa gga gat cat tta ggt gtt att cct
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
cgc aac tat gaa gga ata gta aac cgt gta aca gca agg ttc ggc cta
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
gat gca tca cag caa atc cgt ctg gaa gca gaa gaa gaa aaa tta gct
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
cat ttg cca ctc gct aaa aca gta tcc gta gaa gag ctt ctg caa tac
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
gtg gag ctt caa gat cct gtt acg cgc acg cag ctt cgc gca atg gct
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
gct aaa acg gtc tgc ccg ccg cat aaa gta gag ctt gaa gcc ttg ctt
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
gaa aag caa gcc tac aaa gaa caa gtg ctg gca aaa cgt tta aca atg
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
ctt gaa ctg ctt gaa aaa tac ccg gcg tgt gaa atg aaa ttc agc gaa
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
ttt atc gcc ctt ctg cca agc ata cgc ccg cgc tat tac tcg att tct
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
tca tca cct cgt gtc gat gaa aaa caa gca agc atc acg gtc agc gtt
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
gtc tca gga gaa gcg tgg agc gga tat gga gaa tat aaa gga att gcg
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
tcg aac tat ctt gcc gag ctg caa gaa gga gat acg att acg tgc ttt
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
att tcc aca ccg cag tca gaa ttt acg ctg cca aaa gac cct gaa acg
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
ccg ctt atc atg gtc gga ccg gga aca ggc gtc gcg ccg ttt aga ggc
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
ttt gtg cag gcg cgc aaa cag cta aaa gaa caa gga cag tca ctt gga
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
gaa gca cat tta tac ttc ggc tgc cgt tca cct cat gaa gac tat ctg
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
tat caa gaa gag ctt gaa aac gcc caa agc gaa ggc atc att acg ctt
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
cat acc gct ttt tct cgc atg cca aat cag ccg aaa aca tac gtt cag
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
cac gta atg gaa caa gac ggc aag aaa ttg att gaa ctt ctt gat caa
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
gga gcg cac ttc tat att tgc gga gac gga agc caa atg gca cct gcc
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
gtt gaa gca acg ctt atg aaa agc tat gct gac gtt cac caa gtg
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
agt gaa gca gac gct cgc tta tgg ctg cag cag cta gaa gaa aaa
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
ggc cga tac gca aaa gac gtg tgg gct ggg
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
配列番号8
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
Phe Ala Gly Asp Gly Leu Ile Thr Ser Trp Thr His Glu Lys Asn Trp
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
Phe Ala Gly Asp Gly Leu Ile Thr Ser Trp Thr His Glu Lys Asn Trp
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
配列番号9
aca att aaa gaa atg cct cag cca aaa acg ttt gga gag ctt aaa aat
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
tta ccg tta tta aac aca gat aaa ccg gtt caa gct ttg atg aaa att
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile
gcg gat gaa tta gga gaa atc ttt aaa ttc gag gcg cct ggt cgt gta
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
acg cgc tac tta tca agt cag cgt cta att aaa gaa gca tgc gat gaa
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
tca cgc ttt gat aaa aac tta agt caa gcg ctt aaa ttt gta cgt gat
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
ttt gca gga gac ggg tta aag aca agc tgg acg cat gaa aaa aat tgg
Phe Ala Gly Asp Gly Leu Lys Thr Ser Trp Thr His Glu Lys Asn Trp
aaa aaa gcg cat aat atc tta ctt cca agc ttc agt cag cag gca atg
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
aaa ggc tat cat gcg atg atg gtc gat atc gcc gtg cag ctt gtt caa
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
aag tgg gag cgt cta aat gca gat gag cat att gaa gta ccg gaa gac
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
atg aca cgt tta acg ctt gat aca att ggt ctt tgc ggc ttt aac tat
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
cgc ttt aac agc ttt tac cga gat cag cct cat cca ttt att aca agt
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
atg gtc cgt gca ctg gat gaa gca atg aac aag ctg cag cga gca aat
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
cca gac gac cca gct tat gat gaa aac aag cgc cag ttt caa gaa gat
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
atc aag gtg atg aac gac cta gta gat aaa att att gca gat cgc aaa
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
gca agc ggt gaa caa agc gat gat tta tta acg cat atg cta aac gga
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
aaa gat cca gaa acg ggt gag ccg ctt gat gac gag aac att cgc tat
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
caa att att aca ttc tta att gcg gga cac gaa aca aca agt ggt ctt
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
tta tca ttt gcg ctg tat ttc tta gtg aaa aat cca cat gta tta caa
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
aaa gca gca gaa gaa gca gca cga gtt cta gta gat cct gtt cca agc
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
tac aaa caa gtc aaa cag ctt aaa tat gtc ggc atg gtc tta aac gaa
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
gcg ctg cgc tta tgg cca act gct cct gtt ttt tcc cta tat gca aaa
Ala Leu Arg Leu Trp Pro Thr Ala Pro Val Phe Ser Leu Tyr Ala Lys
gaa gat acg gtg ctt gga gga gaa tat cct tta gaa aaa ggc gac gaa
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
cta atg gtt ctg att cct cag ctt cac cgt gat aaa aca att tgg gga
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
gac gat gtg gaa gag ttc cgt cca gag cgt ttt gaa aat cca agt gcg
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
att ccg cag cat gcg ttt aaa ccg ttt gga aac ggt cag cgt gcg tgt
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
atc ggt cag cag ttc gct ctt cat gaa gca acg ctg gta ctt ggt atg
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
atg cta aaa cac ttt gac ttt gaa gat cat aca aac tac gag ctg gat
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
att aaa gaa act tta acg tta aaa cct gaa ggc ttt gtg gta aaa gca
Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala
aaa tcg aaa aaa att ccg ctt ggc ggt att cct tca cct agc act gaa
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
cag tct gct aaa aaa gta cgc aaa aag gca gaa aac gct cat aat acg
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
ccg ctg ctt gtg cta tac ggt tca aat atg gga aca gct gaa gga acg
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
gcg cgt gat tta gca gat att gca atg agc aaa gga ttt gca ccg cag
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
gtc gca acg ctt gat tca cac gcc gga aat ctt ccg cgc gaa gga gct
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
gta tta att gta acg gcg tct tat aac ggt cat ccg cct gat aac gca
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
aag caa ttt gtc gac tgg tta gac caa gcg tct gct gat gaa gta aaa
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
ggc gtt cgc tac tcc gta ttt gga tgc ggc gat aaa aac tgg gct act
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
acg tat caa aaa gtg cct gct ttt atc gat gaa acg ctt gcc gct aaa
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
ggg gca gaa aac atc gct gac cgc ggt gaa gca gat gca agc gac gac
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
ttt gaa ggc aca tat gaa gaa tgg cgt gaa cat atg tgg agt gac gta
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
gca gcc tac ttt aac ctc gac att gaa aac agt gaa gat aat aaa tct
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
act ctt tca ctt caa ttt gtc gac agc gcc gcg gat atg ccg ctt gcg
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
aaa atg cac ggt gcg ttt tca acg aac gtc gta gca agc aaa gaa ctt
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
caa cag cca ggc agt gca cga agc acg cga cat ctt gaa att gaa ctt
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
cca aaa gaa gct tct tat caa gaa gga gat cat tta ggt gtt att cct
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
cgc aac tat gaa gga ata gta aac cgt gta aca gca agg ttc ggc cta
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
gat gca tca cag caa atc cgt ctg gaa gca gaa gaa gaa aaa tta gct
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
cat ttg cca ctc gct aaa aca gta tcc gta gaa gag ctt ctg caa tac
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
gtg gag ctt caa gat cct gtt acg cgc acg cag ctt cgc gca atg gct
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
gct aaa acg gtc tgc ccg ccg cat aaa gta gag ctt gaa gcc ttg ctt
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
gaa aag caa gcc tac aaa gaa caa gtg ctg gca aaa cgt tta aca atg
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
ctt gaa ctg ctt gaa aaa tac ccg gcg tgt gaa atg aaa ttc agc gaa
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
ttt atc gcc ctt ctg cca agc ata cgc ccg cgc tat tac tcg att tct
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
tca tca cct cgt gtc gat gaa aaa caa gca agc atc acg gtc agc gtt
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
gtc tca gga gaa gcg tgg agc gga tat gga gaa tat aaa gga att gcg
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
tcg aac tat ctt gcc gag ctg caa gaa gga gat acg att acg tgc ttt
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
att tcc aca ccg cag tca gaa ttt acg ctg cca aaa gac cct gaa acg
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
ccg ctt atc atg gtc gga ccg gga aca ggc gtc gcg ccg ttt aga ggc
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
ttt gtg cag gcg cgc aaa cag cta aaa gaa caa gga cag tca ctt gga
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
gaa gca cat tta tac ttc ggc tgc cgt tca cct cat gaa gac tat ctg
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
tat caa gaa gag ctt gaa aac gcc caa agc gaa ggc atc att acg ctt
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
cat acc gct ttt tct cgc atg cca aat cag ccg aaa aca tac gtt cag
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
cac gta atg gaa caa gac ggc aag aaa ttg att gaa ctt ctt gat caa
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
gga gcg cac ttc tat att tgc gga gac gga agc caa atg gca cct gcc
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
gtt gaa gca acg ctt atg aaa agc tat gct gac gtt cac caa gtg
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
agt gaa gca gac gct cgc tta tgg ctg cag cag cta gaa gaa aaa
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
ggc cga tac gca aaa gac gtg tgg gct ggg
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
aca att aaa gaa atg cct cag cca aaa acg ttt gga gag ctt aaa aat
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
tta ccg tta tta aac aca gat aaa ccg gtt caa gct ttg atg aaa att
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile
gcg gat gaa tta gga gaa atc ttt aaa ttc gag gcg cct ggt cgt gta
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
acg cgc tac tta tca agt cag cgt cta att aaa gaa gca tgc gat gaa
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
tca cgc ttt gat aaa aac tta agt caa gcg ctt aaa ttt gta cgt gat
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
ttt gca gga gac ggg tta aag aca agc tgg acg cat gaa aaa aat tgg
Phe Ala Gly Asp Gly Leu Lys Thr Ser Trp Thr His Glu Lys Asn Trp
aaa aaa gcg cat aat atc tta ctt cca agc ttc agt cag cag gca atg
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
aaa ggc tat cat gcg atg atg gtc gat atc gcc gtg cag ctt gtt caa
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
aag tgg gag cgt cta aat gca gat gag cat att gaa gta ccg gaa gac
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
atg aca cgt tta acg ctt gat aca att ggt ctt tgc ggc ttt aac tat
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
cgc ttt aac agc ttt tac cga gat cag cct cat cca ttt att aca agt
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
atg gtc cgt gca ctg gat gaa gca atg aac aag ctg cag cga gca aat
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
cca gac gac cca gct tat gat gaa aac aag cgc cag ttt caa gaa gat
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
atc aag gtg atg aac gac cta gta gat aaa att att gca gat cgc aaa
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
gca agc ggt gaa caa agc gat gat tta tta acg cat atg cta aac gga
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
aaa gat cca gaa acg ggt gag ccg ctt gat gac gag aac att cgc tat
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
caa att att aca ttc tta att gcg gga cac gaa aca aca agt ggt ctt
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
tta tca ttt gcg ctg tat ttc tta gtg aaa aat cca cat gta tta caa
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
aaa gca gca gaa gaa gca gca cga gtt cta gta gat cct gtt cca agc
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
tac aaa caa gtc aaa cag ctt aaa tat gtc ggc atg gtc tta aac gaa
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
gcg ctg cgc tta tgg cca act gct cct gtt ttt tcc cta tat gca aaa
Ala Leu Arg Leu Trp Pro Thr Ala Pro Val Phe Ser Leu Tyr Ala Lys
gaa gat acg gtg ctt gga gga gaa tat cct tta gaa aaa ggc gac gaa
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
cta atg gtt ctg att cct cag ctt cac cgt gat aaa aca att tgg gga
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
gac gat gtg gaa gag ttc cgt cca gag cgt ttt gaa aat cca agt gcg
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
att ccg cag cat gcg ttt aaa ccg ttt gga aac ggt cag cgt gcg tgt
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
atc ggt cag cag ttc gct ctt cat gaa gca acg ctg gta ctt ggt atg
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
atg cta aaa cac ttt gac ttt gaa gat cat aca aac tac gag ctg gat
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
att aaa gaa act tta acg tta aaa cct gaa ggc ttt gtg gta aaa gca
Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala
aaa tcg aaa aaa att ccg ctt ggc ggt att cct tca cct agc act gaa
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
cag tct gct aaa aaa gta cgc aaa aag gca gaa aac gct cat aat acg
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
ccg ctg ctt gtg cta tac ggt tca aat atg gga aca gct gaa gga acg
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
gcg cgt gat tta gca gat att gca atg agc aaa gga ttt gca ccg cag
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
gtc gca acg ctt gat tca cac gcc gga aat ctt ccg cgc gaa gga gct
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
gta tta att gta acg gcg tct tat aac ggt cat ccg cct gat aac gca
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
aag caa ttt gtc gac tgg tta gac caa gcg tct gct gat gaa gta aaa
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
ggc gtt cgc tac tcc gta ttt gga tgc ggc gat aaa aac tgg gct act
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
acg tat caa aaa gtg cct gct ttt atc gat gaa acg ctt gcc gct aaa
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
ggg gca gaa aac atc gct gac cgc ggt gaa gca gat gca agc gac gac
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
ttt gaa ggc aca tat gaa gaa tgg cgt gaa cat atg tgg agt gac gta
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
gca gcc tac ttt aac ctc gac att gaa aac agt gaa gat aat aaa tct
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
act ctt tca ctt caa ttt gtc gac agc gcc gcg gat atg ccg ctt gcg
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
aaa atg cac ggt gcg ttt tca acg aac gtc gta gca agc aaa gaa ctt
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
caa cag cca ggc agt gca cga agc acg cga cat ctt gaa att gaa ctt
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
cca aaa gaa gct tct tat caa gaa gga gat cat tta ggt gtt att cct
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
cgc aac tat gaa gga ata gta aac cgt gta aca gca agg ttc ggc cta
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
gat gca tca cag caa atc cgt ctg gaa gca gaa gaa gaa aaa tta gct
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
cat ttg cca ctc gct aaa aca gta tcc gta gaa gag ctt ctg caa tac
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
gtg gag ctt caa gat cct gtt acg cgc acg cag ctt cgc gca atg gct
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
gct aaa acg gtc tgc ccg ccg cat aaa gta gag ctt gaa gcc ttg ctt
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
gaa aag caa gcc tac aaa gaa caa gtg ctg gca aaa cgt tta aca atg
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
ctt gaa ctg ctt gaa aaa tac ccg gcg tgt gaa atg aaa ttc agc gaa
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
ttt atc gcc ctt ctg cca agc ata cgc ccg cgc tat tac tcg att tct
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
tca tca cct cgt gtc gat gaa aaa caa gca agc atc acg gtc agc gtt
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
gtc tca gga gaa gcg tgg agc gga tat gga gaa tat aaa gga att gcg
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
tcg aac tat ctt gcc gag ctg caa gaa gga gat acg att acg tgc ttt
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
att tcc aca ccg cag tca gaa ttt acg ctg cca aaa gac cct gaa acg
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
ccg ctt atc atg gtc gga ccg gga aca ggc gtc gcg ccg ttt aga ggc
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
ttt gtg cag gcg cgc aaa cag cta aaa gaa caa gga cag tca ctt gga
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
gaa gca cat tta tac ttc ggc tgc cgt tca cct cat gaa gac tat ctg
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
tat caa gaa gag ctt gaa aac gcc caa agc gaa ggc atc att acg ctt
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
cat acc gct ttt tct cgc atg cca aat cag ccg aaa aca tac gtt cag
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
cac gta atg gaa caa gac ggc aag aaa ttg att gaa ctt ctt gat caa
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
gga gcg cac ttc tat att tgc gga gac gga agc caa atg gca cct gcc
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
gtt gaa gca acg ctt atg aaa agc tat gct gac gtt cac caa gtg
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
agt gaa gca gac gct cgc tta tgg ctg cag cag cta gaa gaa aaa
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
ggc cga tac gca aaa gac gtg tgg gct ggg
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
配列番号10
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
Phe Ala Gly Asp Gly Leu Lys Thr Ser Trp Thr His Glu Lys Asn Trp
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
Ala Leu Arg Leu Trp Pro Thr Ala Pro Val Phe Ser Leu Tyr Ala Lys
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
Phe Ala Gly Asp Gly Leu Lys Thr Ser Trp Thr His Glu Lys Asn Trp
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
Ala Leu Arg Leu Trp Pro Thr Ala Pro Val Phe Ser Leu Tyr Ala Lys
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
配列番号11
aca att aaa gaa atg cct cag cca aaa acg ttt gga gag ctt aaa aat
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
tta ccg tta tta aac aca gat aaa ctt gtt caa gct ttg atg aaa att
Leu Pro Leu Leu Asn Thr Asp Lys Leu Val Gln Ala Leu Met Lys Ile
gcg gat gaa tta gga gaa atc ttt aaa ttc gag gcg cct ggt cgt gta
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
acg cgc tac tta tca agt cag cgt cta att aaa gaa gca tgc gat gaa
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
tca cgc ttt gat aaa aac tta agt caa gcg ctt aaa ttt gta cgt gat
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
ttt gca gga gac ggg tta aag aca agc tgg acg cat gaa aaa aat tgg
Phe Ala Gly Asp Gly Leu Lys Thr Ser Trp Thr His Glu Lys Asn Trp
aaa aaa gcg cat aat atc tta ctt cca agc ttc agt cag cag gca atg
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
aaa ggc tat cat gcg atg atg gtc gat atc gcc gtg cag ctt gtt caa
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
aag tgg gag cgt cta aat gca gat gag cat att gaa gta ccg gaa gac
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
atg aca cgt tta acg ctt gat aca att ggt ctt tgc ggc ttt aac tat
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
cgc ttt aac agc ttt tac cga gat cag cct cat cca ttt att aca agt
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
atg gtc cgt gca ctg gat gaa gca atg aac aag ctg cag cga gca aat
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
cca gac gac cca gct tat gat gaa aac aag cgc cag ttt caa gaa gat
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
atc aag gtg atg aac gac cta gta gat aaa att att gca gat cgc aaa
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
gca agc ggt gaa caa agc gat gat tta tta acg cat atg cta aac gga
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
aaa gat cca gaa acg ggt gag ccg ctt gat gac gag aac att cgc tat
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
caa att att aca ttc tta att gcg gga cac gaa aca aca agt ggt ctt
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
tta tca ttt gcg ctg tat ttc tta gtg aaa aat cca cat gta tta caa
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
aaa gca gca gaa gaa gca gca cga gtt cta gta gat cct gtt cca agc
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
tac aaa caa gtc aaa cag ctt aaa tat gtc ggc atg gtc tta aac gaa
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
gcg ctg cgc tta tgg cca act gct cct gtt ttt tcc cta tat gca aaa
Ala Leu Arg Leu Trp Pro Thr Ala Pro Val Phe Ser Leu Tyr Ala Lys
gaa gat acg gtg ctt gga gga gaa tat cct tta gaa aaa ggc gac gaa
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
cta atg gtt ctg att cct cag ctt cac cgt gat aaa aca att tgg gga
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
gac gat gtg gaa gag ttc cgt cca gag cgt ttt gaa aat cca agt gcg
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
att ccg cag cat gcg ttt aaa ccg ttt gga aac ggt cag cgt gcg tgt
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
atc ggt cag cag ttc gct ctt cat gaa gca acg ctg gta ctt ggt atg
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
atg cta aaa cac ttt gac ttt gaa gat cat aca aac tac gag ctg gat
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
att aaa gaa act tta acg tta aaa cct gaa ggc ttt gtg gta aaa gca
Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala
aaa tcg aaa aaa att ccg ctt ggc ggt att cct tca cct agc act gaa
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
cag tct gct aaa aaa gta cgc aaa aag gca gaa aac gct cat aat acg
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
ccg ctg ctt gtg cta tac ggt tca aat atg gga aca gct gaa gga acg
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
gcg cgt gat tta gca gat att gca atg agc aaa gga ttt gca ccg cag
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
gtc gca acg ctt gat tca cac gcc gga aat ctt ccg cgc gaa gga gct
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
gta tta att gta acg gcg tct tat aac ggt cat ccg cct gat aac gca
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
aag caa ttt gtc gac tgg tta gac caa gcg tct gct gat gaa gta aaa
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
ggc gtt cgc tac tcc gta ttt gga tgc ggc gat aaa aac tgg gct act
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
acg tat caa aaa gtg cct gct ttt atc gat gaa acg ctt gcc gct aaa
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
ggg gca gaa aac atc gct gac cgc ggt gaa gca gat gca agc gac gac
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
ttt gaa ggc aca tat gaa gaa tgg cgt gaa cat atg tgg agt gac gta
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
gca gcc tac ttt aac ctc gac att gaa aac agt gaa gat aat aaa tct
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
act ctt tca ctt caa ttt gtc gac agc gcc gcg gat atg ccg ctt gcg
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
aaa atg cac ggt gcg ttt tca acg aac gtc gta gca agc aaa gaa ctt
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
caa cag cca ggc agt gca cga agc acg cga cat ctt gaa att gaa ctt
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
cca aaa gaa gct tct tat caa gaa gga gat cat tta ggt gtt att cct
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
cgc aac tat gaa gga ata gta aac cgt gta aca gca agg ttc ggc cta
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
gat gca tca cag caa atc cgt ctg gaa gca gaa gaa gaa aaa tta gct
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
cat ttg cca ctc gct aaa aca gta tcc gta gaa gag ctt ctg caa tac
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
gtg gag ctt caa gat cct gtt acg cgc acg cag ctt cgc gca atg gct
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
gct aaa acg gtc tgc ccg ccg cat aaa gta gag ctt gaa gcc ttg ctt
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
gaa aag caa gcc tac aaa gaa caa gtg ctg gca aaa cgt tta aca atg
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
ctt gaa ctg ctt gaa aaa tac ccg gcg tgt gaa atg aaa ttc agc gaa
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
ttt atc gcc ctt ctg cca agc ata cgc ccg cgc tat tac tcg att tct
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
tca tca cct cgt gtc gat gaa aaa caa gca agc atc acg gtc agc gtt
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
gtc tca gga gaa gcg tgg agc gga tat gga gaa tat aaa gga att gcg
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
tcg aac tat ctt gcc gag ctg caa gaa gga gat acg att acg tgc ttt
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
att tcc aca ccg cag tca gaa ttt acg ctg cca aaa gac cct gaa acg
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
ccg ctt atc atg gtc gga ccg gga aca ggc gtc gcg ccg ttt aga ggc
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
ttt gtg cag gcg cgc aaa cag cta aaa gaa caa gga cag tca ctt gga
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
gaa gca cat tta tac ttc ggc tgc cgt tca cct cat gaa gac tat ctg
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
tat caa gaa gag ctt gaa aac gcc caa agc gaa ggc atc att acg ctt
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
cat acc gct ttt tct cgc atg cca aat cag ccg aaa aca tac gtt cag
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
cac gta atg gaa caa gac ggc aag aaa ttg att gaa ctt ctt gat caa
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
gga gcg cac ttc tat att tgc gga gac gga agc caa atg gca cct gcc
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
gtt gaa gca acg ctt atg aaa agc tat gct gac gtt cac caa gtg
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
agt gaa gca gac gct cgc tta tgg ctg cag cag cta gaa gaa aaa
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
ggc cga tac gca aaa gac gtg tgg gct ggg
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
aca att aaa gaa atg cct cag cca aaa acg ttt gga gag ctt aaa aat
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
tta ccg tta tta aac aca gat aaa ctt gtt caa gct ttg atg aaa att
Leu Pro Leu Leu Asn Thr Asp Lys Leu Val Gln Ala Leu Met Lys Ile
gcg gat gaa tta gga gaa atc ttt aaa ttc gag gcg cct ggt cgt gta
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
acg cgc tac tta tca agt cag cgt cta att aaa gaa gca tgc gat gaa
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
tca cgc ttt gat aaa aac tta agt caa gcg ctt aaa ttt gta cgt gat
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
ttt gca gga gac ggg tta aag aca agc tgg acg cat gaa aaa aat tgg
Phe Ala Gly Asp Gly Leu Lys Thr Ser Trp Thr His Glu Lys Asn Trp
aaa aaa gcg cat aat atc tta ctt cca agc ttc agt cag cag gca atg
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
aaa ggc tat cat gcg atg atg gtc gat atc gcc gtg cag ctt gtt caa
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
aag tgg gag cgt cta aat gca gat gag cat att gaa gta ccg gaa gac
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
atg aca cgt tta acg ctt gat aca att ggt ctt tgc ggc ttt aac tat
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
cgc ttt aac agc ttt tac cga gat cag cct cat cca ttt att aca agt
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
atg gtc cgt gca ctg gat gaa gca atg aac aag ctg cag cga gca aat
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
cca gac gac cca gct tat gat gaa aac aag cgc cag ttt caa gaa gat
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
atc aag gtg atg aac gac cta gta gat aaa att att gca gat cgc aaa
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
gca agc ggt gaa caa agc gat gat tta tta acg cat atg cta aac gga
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
aaa gat cca gaa acg ggt gag ccg ctt gat gac gag aac att cgc tat
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
caa att att aca ttc tta att gcg gga cac gaa aca aca agt ggt ctt
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
tta tca ttt gcg ctg tat ttc tta gtg aaa aat cca cat gta tta caa
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
aaa gca gca gaa gaa gca gca cga gtt cta gta gat cct gtt cca agc
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
tac aaa caa gtc aaa cag ctt aaa tat gtc ggc atg gtc tta aac gaa
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
gcg ctg cgc tta tgg cca act gct cct gtt ttt tcc cta tat gca aaa
Ala Leu Arg Leu Trp Pro Thr Ala Pro Val Phe Ser Leu Tyr Ala Lys
gaa gat acg gtg ctt gga gga gaa tat cct tta gaa aaa ggc gac gaa
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
cta atg gtt ctg att cct cag ctt cac cgt gat aaa aca att tgg gga
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
gac gat gtg gaa gag ttc cgt cca gag cgt ttt gaa aat cca agt gcg
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
att ccg cag cat gcg ttt aaa ccg ttt gga aac ggt cag cgt gcg tgt
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
atc ggt cag cag ttc gct ctt cat gaa gca acg ctg gta ctt ggt atg
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
atg cta aaa cac ttt gac ttt gaa gat cat aca aac tac gag ctg gat
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
att aaa gaa act tta acg tta aaa cct gaa ggc ttt gtg gta aaa gca
Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala
aaa tcg aaa aaa att ccg ctt ggc ggt att cct tca cct agc act gaa
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
cag tct gct aaa aaa gta cgc aaa aag gca gaa aac gct cat aat acg
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
ccg ctg ctt gtg cta tac ggt tca aat atg gga aca gct gaa gga acg
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
gcg cgt gat tta gca gat att gca atg agc aaa gga ttt gca ccg cag
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
gtc gca acg ctt gat tca cac gcc gga aat ctt ccg cgc gaa gga gct
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
gta tta att gta acg gcg tct tat aac ggt cat ccg cct gat aac gca
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
aag caa ttt gtc gac tgg tta gac caa gcg tct gct gat gaa gta aaa
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
ggc gtt cgc tac tcc gta ttt gga tgc ggc gat aaa aac tgg gct act
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
acg tat caa aaa gtg cct gct ttt atc gat gaa acg ctt gcc gct aaa
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
ggg gca gaa aac atc gct gac cgc ggt gaa gca gat gca agc gac gac
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
ttt gaa ggc aca tat gaa gaa tgg cgt gaa cat atg tgg agt gac gta
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
gca gcc tac ttt aac ctc gac att gaa aac agt gaa gat aat aaa tct
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
act ctt tca ctt caa ttt gtc gac agc gcc gcg gat atg ccg ctt gcg
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
aaa atg cac ggt gcg ttt tca acg aac gtc gta gca agc aaa gaa ctt
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
caa cag cca ggc agt gca cga agc acg cga cat ctt gaa att gaa ctt
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
cca aaa gaa gct tct tat caa gaa gga gat cat tta ggt gtt att cct
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
cgc aac tat gaa gga ata gta aac cgt gta aca gca agg ttc ggc cta
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
gat gca tca cag caa atc cgt ctg gaa gca gaa gaa gaa aaa tta gct
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
cat ttg cca ctc gct aaa aca gta tcc gta gaa gag ctt ctg caa tac
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
gtg gag ctt caa gat cct gtt acg cgc acg cag ctt cgc gca atg gct
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
gct aaa acg gtc tgc ccg ccg cat aaa gta gag ctt gaa gcc ttg ctt
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
gaa aag caa gcc tac aaa gaa caa gtg ctg gca aaa cgt tta aca atg
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
ctt gaa ctg ctt gaa aaa tac ccg gcg tgt gaa atg aaa ttc agc gaa
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
ttt atc gcc ctt ctg cca agc ata cgc ccg cgc tat tac tcg att tct
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
tca tca cct cgt gtc gat gaa aaa caa gca agc atc acg gtc agc gtt
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
gtc tca gga gaa gcg tgg agc gga tat gga gaa tat aaa gga att gcg
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
tcg aac tat ctt gcc gag ctg caa gaa gga gat acg att acg tgc ttt
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
att tcc aca ccg cag tca gaa ttt acg ctg cca aaa gac cct gaa acg
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
ccg ctt atc atg gtc gga ccg gga aca ggc gtc gcg ccg ttt aga ggc
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
ttt gtg cag gcg cgc aaa cag cta aaa gaa caa gga cag tca ctt gga
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
gaa gca cat tta tac ttc ggc tgc cgt tca cct cat gaa gac tat ctg
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
tat caa gaa gag ctt gaa aac gcc caa agc gaa ggc atc att acg ctt
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
cat acc gct ttt tct cgc atg cca aat cag ccg aaa aca tac gtt cag
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
cac gta atg gaa caa gac ggc aag aaa ttg att gaa ctt ctt gat caa
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
gga gcg cac ttc tat att tgc gga gac gga agc caa atg gca cct gcc
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
gtt gaa gca acg ctt atg aaa agc tat gct gac gtt cac caa gtg
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
agt gaa gca gac gct cgc tta tgg ctg cag cag cta gaa gaa aaa
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
ggc cga tac gca aaa gac gtg tgg gct ggg
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
配列番号12
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
Leu Pro Leu Leu Asn Thr Asp Lys Leu Val Gln Ala Leu Met Lys Ile
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
Phe Ala Gly Asp Gly Leu Lys Thr Ser Trp Thr His Glu Lys Asn Trp
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
Ala Leu Arg Leu Trp Pro Thr Ala Pro Val Phe Ser Leu Tyr Ala Lys
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
Leu Pro Leu Leu Asn Thr Asp Lys Leu Val Gln Ala Leu Met Lys Ile
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
Phe Ala Gly Asp Gly Leu Lys Thr Ser Trp Thr His Glu Lys Asn Trp
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
Ala Leu Arg Leu Trp Pro Thr Ala Pro Val Phe Ser Leu Tyr Ala Lys
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
配列番号13
aca att aaa gaa atg cct cag cca aaa acg ttt gga gag ctt aaa aat
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
tta ccg tta tta aac aca gat aaa ccg gtt caa gct ttg atg aaa att
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile
gcg gat gaa tta gga gaa atc ttt aaa ttc gag gcg cct ggt cgt gta
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
acg cgc tac tta tca agt cag cgt cta att aaa gaa gca tgc gat gaa
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
tca cgc ttt gat aaa aac tta agt caa gcg ctt aaa ttt gta cgt gat
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
ttt gca gga gac ggg tta aag aca agc tgg acg cat gaa aaa aat tgg
Phe Ala Gly Asp Gly Leu Lys Thr Ser Trp Thr His Glu Lys Asn Trp
aaa aaa gcg cat aat atc tta ctt cca agc ttc agt cag cag gca atg
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
aaa ggc tat cat gcg atg atg gtc gat atc gcc gtg cag ctt gtt caa
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
aag tgg gag cgt cta aat gca gat gag cat att gaa gta ccg gaa gac
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
atg aca cgt tta acg ctt gat aca att ggt ctt tgc ggc ttt aac tat
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
cgc ttt aac agc ttt tac cga gat cag cct cat cca ttt att aca agt
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
atg gtc cgt gca ctg gat gaa gca atg aac aag ctg cag cga gca aat
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
cca gac gac cca gct tat gat gaa aac aag cgc cag ttt caa gaa gat
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
atc aag gtg atg aac gac cta gta gat aaa att att gca gat cgc aaa
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
gca agc ggt gaa caa agc gat gat tta tta acg cat atg cta aac gga
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
aaa gat cca gaa acg ggt gag ccg ctt gat gac gag aac att cgc tat
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
caa att att aca ttc tta att gcg gga cac gaa aca aca agt ggt ctt
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
tta tca ttt gcg ctg tat ttc tta gtg aaa aat cca cat gta tta caa
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
aaa gca gca gaa gaa gca gca cga gtt cta gta gat cct gtt cca agc
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
tac aaa caa gtc aaa cag ctt aaa tat gtc ggc atg gtc tta aac gaa
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
gcg ctg cgc tta tgg cca act gct cct gtt ttt tcc cta tat gca aaa
Ala Leu Arg Leu Trp Pro Thr Ala Pro Val Phe Ser Leu Tyr Ala Lys
gaa gat acg gtg ctt gga gga gaa tat cct tta gaa aaa ggc gac gaa
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
cta atg gtt ctg att cct cag ctt cac cgt gat aaa aca att tgg gga
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
gac gat gtg gaa gag ttc cgt cca gag cgt ttt gaa aat cca agt gcg
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
att ccg cag cat gcg ttt aaa ccg ttt gga aac ggt cag cgt gcg tgt
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
atc ggt cag cag ttc gct ctt cat gaa gca acg ctg gta ctt ggt atg
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
atg cta aaa cac ttt gac ttt gaa gat cat aca aac tac gag ctg gat
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
att aaa gaa act tta atg tta aaa cct gaa ggc ttt gtg gta aaa gca
Ile Lys Glu Thr Leu Met Leu Lys Pro Glu Gly Phe Val Val Lys Ala
aaa tcg aaa aaa att ccg ctt ggc ggt att cct tca cct agc act gaa
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
cag tct gct aaa aaa gta cgc aaa aag gca gaa aac gct cat aat acg
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
ccg ctg ctt gtg cta tac ggt tca aat atg gga aca gct gaa gga acg
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
gcg cgt gat tta gca gat att gca atg agc aaa gga ttt gca ccg cag
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
gtc gca acg ctt gat tca cac gcc gga aat ctt ccg cgc gaa gga gct
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
gta tta att gta acg gcg tct tat aac ggt cat ccg cct gat aac gca
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
aag caa ttt gtc gac tgg tta gac caa gcg tct gct gat gaa gta aaa
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
ggc gtt cgc tac tcc gta ttt gga tgc ggc gat aaa aac tgg gct act
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
acg tat caa aaa gtg cct gct ttt atc gat gaa acg ctt gcc gct aaa
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
ggg gca gaa aac atc gct gac cgc ggt gaa gca gat gca agc gac gac
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
ttt gaa ggc aca tat gaa gaa tgg cgt gaa cat atg tgg agt gac gta
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
gca gcc tac ttt aac ctc gac att gaa aac agt gaa gat aat aaa tct
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
act ctt tca ctt caa ttt gtc gac agc gcc gcg gat atg ccg ctt gcg
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
aaa atg cac ggt gcg ttt tca acg aac gtc gta gca agc aaa gaa ctt
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
caa cag cca ggc agt gca cga agc acg cga cat ctt gaa att gaa ctt
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
cca aaa gaa gct tct tat caa gaa gga gat cat tta ggt gtt att cct
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
cgc aac tat gaa gga ata gta aac cgt gta aca gca agg ttc ggc cta
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
gat gca tca cag caa atc cgt ctg gaa gca gaa gaa gaa aaa tta gct
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
cat ttg cca ctc gct aaa aca gta tcc gta gaa gag ctt ctg caa tac
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
gtg gag ctt caa gat cct gtt acg cgc acg cag ctt cgc gca atg gct
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
gct aaa acg gtc tgc ccg ccg cat aaa gta gag ctt gaa gcc ttg ctt
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
gaa aag caa gcc tac aaa gaa caa gtg ctg gca aaa cgt tta aca atg
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
ctt gaa ctg ctt gaa aaa tac ccg gcg tgt gaa atg aaa ttc agc gaa
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
ttt atc gcc ctt ctg cca agc ata cgc ccg cgc tat tac tcg att tct
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
tca tca cct cgt gtc gat gaa aaa caa gca agc atc acg gtc agc gtt
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
gtc tca gga gaa gcg tgg agc gga tat gga gaa tat aaa gga att gcg
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
tcg aac tat ctt gcc gag ctg caa gaa gga gat acg att acg tgc ttt
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
att tcc aca ccg cag tca gaa ttt acg ctg cca aaa gac cct gaa acg
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
ccg ctt atc atg gtc gga ccg gga aca ggc gtc gcg ccg ttt aga ggc
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
ttt gtg cag gcg cgc aaa cag cta aaa gaa caa gga cag tca ctt gga
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
gaa gca cat tta tac ttc ggc tgc cgt tca cct cat gaa gac tat ctg
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
tat caa gaa gag ctt gaa aac gcc caa agc gaa ggc atc att acg ctt
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
cat acc gct ttt tct cgc atg cca aat cag ccg aaa aca tac gtt cag
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
cac gta atg gaa caa gac ggc aag aaa ttg att gaa ctt ctt gat caa
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
gga gcg cac ttc tat att tgc gga gac gga agc caa atg gca cct gcc
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
gtt gaa gca acg ctt atg aaa agc tat gct gac gtt cac caa gtg
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
agt gaa gca gac gct cgc tta tgg ctg cag cag cta gaa gaa aaa
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
ggc cga tac gca aaa gac gtg tgg gct ggg
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
aca att aaa gaa atg cct cag cca aaa acg ttt gga gag ctt aaa aat
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
tta ccg tta tta aac aca gat aaa ccg gtt caa gct ttg atg aaa att
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile
gcg gat gaa tta gga gaa atc ttt aaa ttc gag gcg cct ggt cgt gta
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
acg cgc tac tta tca agt cag cgt cta att aaa gaa gca tgc gat gaa
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
tca cgc ttt gat aaa aac tta agt caa gcg ctt aaa ttt gta cgt gat
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
ttt gca gga gac ggg tta aag aca agc tgg acg cat gaa aaa aat tgg
Phe Ala Gly Asp Gly Leu Lys Thr Ser Trp Thr His Glu Lys Asn Trp
aaa aaa gcg cat aat atc tta ctt cca agc ttc agt cag cag gca atg
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
aaa ggc tat cat gcg atg atg gtc gat atc gcc gtg cag ctt gtt caa
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
aag tgg gag cgt cta aat gca gat gag cat att gaa gta ccg gaa gac
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
atg aca cgt tta acg ctt gat aca att ggt ctt tgc ggc ttt aac tat
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
cgc ttt aac agc ttt tac cga gat cag cct cat cca ttt att aca agt
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
atg gtc cgt gca ctg gat gaa gca atg aac aag ctg cag cga gca aat
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
cca gac gac cca gct tat gat gaa aac aag cgc cag ttt caa gaa gat
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
atc aag gtg atg aac gac cta gta gat aaa att att gca gat cgc aaa
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
gca agc ggt gaa caa agc gat gat tta tta acg cat atg cta aac gga
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
aaa gat cca gaa acg ggt gag ccg ctt gat gac gag aac att cgc tat
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
caa att att aca ttc tta att gcg gga cac gaa aca aca agt ggt ctt
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
tta tca ttt gcg ctg tat ttc tta gtg aaa aat cca cat gta tta caa
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
aaa gca gca gaa gaa gca gca cga gtt cta gta gat cct gtt cca agc
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
tac aaa caa gtc aaa cag ctt aaa tat gtc ggc atg gtc tta aac gaa
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
gcg ctg cgc tta tgg cca act gct cct gtt ttt tcc cta tat gca aaa
Ala Leu Arg Leu Trp Pro Thr Ala Pro Val Phe Ser Leu Tyr Ala Lys
gaa gat acg gtg ctt gga gga gaa tat cct tta gaa aaa ggc gac gaa
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
cta atg gtt ctg att cct cag ctt cac cgt gat aaa aca att tgg gga
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
gac gat gtg gaa gag ttc cgt cca gag cgt ttt gaa aat cca agt gcg
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
att ccg cag cat gcg ttt aaa ccg ttt gga aac ggt cag cgt gcg tgt
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
atc ggt cag cag ttc gct ctt cat gaa gca acg ctg gta ctt ggt atg
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
atg cta aaa cac ttt gac ttt gaa gat cat aca aac tac gag ctg gat
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
att aaa gaa act tta atg tta aaa cct gaa ggc ttt gtg gta aaa gca
Ile Lys Glu Thr Leu Met Leu Lys Pro Glu Gly Phe Val Val Lys Ala
aaa tcg aaa aaa att ccg ctt ggc ggt att cct tca cct agc act gaa
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
cag tct gct aaa aaa gta cgc aaa aag gca gaa aac gct cat aat acg
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
ccg ctg ctt gtg cta tac ggt tca aat atg gga aca gct gaa gga acg
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
gcg cgt gat tta gca gat att gca atg agc aaa gga ttt gca ccg cag
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
gtc gca acg ctt gat tca cac gcc gga aat ctt ccg cgc gaa gga gct
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
gta tta att gta acg gcg tct tat aac ggt cat ccg cct gat aac gca
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
aag caa ttt gtc gac tgg tta gac caa gcg tct gct gat gaa gta aaa
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
ggc gtt cgc tac tcc gta ttt gga tgc ggc gat aaa aac tgg gct act
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
acg tat caa aaa gtg cct gct ttt atc gat gaa acg ctt gcc gct aaa
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
ggg gca gaa aac atc gct gac cgc ggt gaa gca gat gca agc gac gac
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
ttt gaa ggc aca tat gaa gaa tgg cgt gaa cat atg tgg agt gac gta
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
gca gcc tac ttt aac ctc gac att gaa aac agt gaa gat aat aaa tct
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
act ctt tca ctt caa ttt gtc gac agc gcc gcg gat atg ccg ctt gcg
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
aaa atg cac ggt gcg ttt tca acg aac gtc gta gca agc aaa gaa ctt
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
caa cag cca ggc agt gca cga agc acg cga cat ctt gaa att gaa ctt
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
cca aaa gaa gct tct tat caa gaa gga gat cat tta ggt gtt att cct
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
cgc aac tat gaa gga ata gta aac cgt gta aca gca agg ttc ggc cta
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
gat gca tca cag caa atc cgt ctg gaa gca gaa gaa gaa aaa tta gct
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
cat ttg cca ctc gct aaa aca gta tcc gta gaa gag ctt ctg caa tac
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
gtg gag ctt caa gat cct gtt acg cgc acg cag ctt cgc gca atg gct
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
gct aaa acg gtc tgc ccg ccg cat aaa gta gag ctt gaa gcc ttg ctt
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
gaa aag caa gcc tac aaa gaa caa gtg ctg gca aaa cgt tta aca atg
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
ctt gaa ctg ctt gaa aaa tac ccg gcg tgt gaa atg aaa ttc agc gaa
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
ttt atc gcc ctt ctg cca agc ata cgc ccg cgc tat tac tcg att tct
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
tca tca cct cgt gtc gat gaa aaa caa gca agc atc acg gtc agc gtt
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
gtc tca gga gaa gcg tgg agc gga tat gga gaa tat aaa gga att gcg
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
tcg aac tat ctt gcc gag ctg caa gaa gga gat acg att acg tgc ttt
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
att tcc aca ccg cag tca gaa ttt acg ctg cca aaa gac cct gaa acg
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
ccg ctt atc atg gtc gga ccg gga aca ggc gtc gcg ccg ttt aga ggc
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
ttt gtg cag gcg cgc aaa cag cta aaa gaa caa gga cag tca ctt gga
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
gaa gca cat tta tac ttc ggc tgc cgt tca cct cat gaa gac tat ctg
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
tat caa gaa gag ctt gaa aac gcc caa agc gaa ggc atc att acg ctt
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
cat acc gct ttt tct cgc atg cca aat cag ccg aaa aca tac gtt cag
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
cac gta atg gaa caa gac ggc aag aaa ttg att gaa ctt ctt gat caa
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
gga gcg cac ttc tat att tgc gga gac gga agc caa atg gca cct gcc
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
gtt gaa gca acg ctt atg aaa agc tat gct gac gtt cac caa gtg
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
agt gaa gca gac gct cgc tta tgg ctg cag cag cta gaa gaa aaa
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
ggc cga tac gca aaa gac gtg tgg gct ggg
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
配列番号14
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
Phe Ala Gly Asp Gly Leu Lys Thr Ser Trp Thr His Glu Lys Asn Trp
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
Ala Leu Arg Leu Trp Pro Thr Ala Pro Val Phe Ser Leu Tyr Ala Lys
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
Ile Lys Glu Thr Leu Met Leu Lys Pro Glu Gly Phe Val Val Lys Ala
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile
Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val
Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu
Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp
Phe Ala Gly Asp Gly Leu Lys Thr Ser Trp Thr His Glu Lys Asn Trp
Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met
Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln
Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp
Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr
Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser
Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp
Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys
Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr
Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser
Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu
Ala Leu Arg Leu Trp Pro Thr Ala Pro Val Phe Ser Leu Tyr Ala Lys
Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu
Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala
Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys
Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp
Ile Lys Glu Thr Leu Met Leu Lys Pro Glu Gly Phe Val Val Lys Ala
Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu
Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr
Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln
Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala
Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala
Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr
Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys
Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp
Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val
Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser
Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu
Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu
Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro
Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu
Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr
Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu
Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu
Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser
Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala
Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe
Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr
Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly
Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu
Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu
His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln
His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln
Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro Ala
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln Val
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu Lys
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
配列番号15
gggaccatgg gcagcagcca tcatca
gggaccatgg gcagcagcca tcatca
配列番号16
ggggaattct tacccagccc acacgtc
ggggaattct tacccagccc acacgtc
配列番号17
gcaggagacg ggttaaagac aagctggacg catgaaaaaa at
gcaggagacg ggttaaagac aagctggacg catgaaaaaa at
配列番号18
gcaggagacg ggttacatac aagctggacg catgaaaaaa at
gcaggagacg ggttacatac aagctggacg catgaaaaaa at
配列番号19
gcaggagacg ggttaatcac aagctggacg catgaaaaaa at
gcaggagacg ggttaatcac aagctggacg catgaaaaaa at
配列番号20
taacccgtct cctgcaaaat cacg
taacccgtct cctgcaaaat cacg
配列番号21
cgcttatggc caactgctcc tgttttttcc ctatatgcaa aaga
cgcttatggc caactgctcc tgttttttcc ctatatgcaa aaga
配列番号22
agttggccat aagcgcagcg
agttggccat aagcgcagcg
配列番23
ttaaacacag ataaacttgt tcaagctttg atgaaaattg c
ttaaacacag ataaacttgt tcaagctttg atgaaaattg c
配列番号24
tttatctgtg tttaataacg gtaaatt
tttatctgtg tttaataacg gtaaatt
配列番号25
gatattaaag aaactttaat gttaaaacct gaaggctttg tggta
gatattaaag aaactttaat gttaaaacct gaaggctttg tggta
配列番号26
agtttcttta atatccagct cgtagtt
agtttcttta atatccagct cgtagtt
Claims (16)
- 以下の(1)から(3)のいずれかに示すポリペプチド;
(1)配列番号2に示すアミノ酸配列において、F87K/I/H、A330V、P25L及びT438Mよりなる群から選択される一又は二以上のアミノ酸置換を含むアミノ酸配列からなるポリペプチド;
(2)上記(1)に記載のアミノ酸配列において、1個又は数個のアミノ酸残基が置換、付加、挿入又は欠失されてなり、ω3系脂肪酸のω3位をエポキシ化する触媒活性を有し、かつ、F87K/I/H、A330V、P25L及びT438Mよりなる群から選択される一又は二以上のアミノ酸置換は保持されているポリペプチド;
(3)上記(1)に記載のアミノ酸配列に対する配列同一性が90%以上であり、ω3系脂肪酸のω3位をエポキシ化する触媒活性を有し、かつ、F87K/I/H、A330V、P25L及びT438Mよりなる群から選択される一又は二以上のアミノ酸置換は保持されているポリペプチド。
(但し、F87K/I/Hとは、配列番号2における87位のアミノ酸残基であるFがK、IもしくはHに置換されていることを示し、A330Vとは、配列番号2における330位のアミノ酸残基であるAがVに置換されていることを示し、P25Lとは、配列番号2における25位のアミノ酸残基であるPがLに置換されていることを示し、T438Mとは、配列番号2における438位のアミノ酸残基であるTがMに置換されていることを示す。) - 前記(3)における配列同一性が95%以上である請求項1に記載のポリペプチド。
- オキシゲナーゼである、請求項1に記載のポリペプチド。
- 請求項1~3のいずれか1項に記載のポリペプチドを含む、ω3系脂肪酸のω3位エポキシ化用酵素剤。
- NADPH再生酵素、活性酸素を除去する酵素及び過酸化水素を除去する酵素よりなる群から選択される一又は二以上の酵素をさらに含む、請求項4に記載のω3位エポキシ化用酵素剤。
- 以下の(1)から(3)のいずれかに示すDNA;
(1)請求項1に記載のポリペプチドをコードするDNA;
(2)配列番号3、5、7及び9のいずれかに示す塩基配列からなるDNA;
(3)配列番号3、5、7及び9のいずれかに示す塩基配列と等価の配列を含み、ω3系脂肪酸のω3位をエポキシ化する触媒活性を有するポリペプチドをコードするDNA。 - 請求項6に記載のDNAを含む発現カセット。
- 請求項6に記載のDNAを含む組換えベクター。
- 請求項6のDNA、請求項7に記載の発現カセット又は請求項8に記載の組換えベクターを有する形質転換体。
- 請求項9に記載の形質転換体を培養する工程を含む、オキシゲナーゼの製造方法。
- ω3系脂肪酸に、請求項1~3のいずれか1項に記載のポリペプチドを作用させる工程を含む、ω3位エポキシ化脂肪酸の製造方法。
- ω3系脂肪酸に、請求項4に記載のω3位エポキシ化用酵素剤を作用させる工程を含む、ω3位エポキシ化脂肪酸の製造方法。
- ω3系脂肪酸に、請求項1~3のいずれか1項に記載のポリペプチドを作用させる工程を含む、ω3系脂肪酸のエポキシ化方法。
- ω3系脂肪酸に、請求項4に記載のω3位エポキシ化用酵素剤を作用させる工程を含む、ω3系脂肪酸のエポキシ化方法。
- 請求項11の製造方法で得られる、ω3位エポキシ化脂肪酸を含む組成物。
- 請求項12の製造方法で得られる、ω3位エポキシ化脂肪酸を含む組成物。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2022102307 | 2022-06-27 | ||
JP2022-102307 | 2022-06-27 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2024004921A1 true WO2024004921A1 (ja) | 2024-01-04 |
Family
ID=89383038
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2023/023517 WO2024004921A1 (ja) | 2022-06-27 | 2023-06-26 | ポリペプチド、オキシゲナーゼ及びこれらの応用 |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2024004921A1 (ja) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2003521889A (ja) * | 1999-07-27 | 2003-07-22 | ビーエーエスエフ アクチェンゲゼルシャフト | 新規のシトクロムp450モノオキシゲナーゼ及び有機化合物の酸化のためのその使用 |
WO2010070086A2 (de) * | 2008-12-18 | 2010-06-24 | Evonik Degussa Gmbh | Verfahren zur enzymatischen umsetzung von alkanen |
WO2012023254A1 (ja) * | 2010-08-19 | 2012-02-23 | 国立大学法人 東京大学 | オメガ3系脂肪酸由来の新規抗炎症性代謝物 |
JP2015534831A (ja) * | 2012-11-09 | 2015-12-07 | キャリスタ, インコーポレイテッド | 脂肪酸誘導体の生物学的生産のための組成物および方法 |
US20160355852A1 (en) * | 2014-02-18 | 2016-12-08 | The California Institute Of Technology | Methods and systems for sulfimidation or sulfoximidation of organic molecules |
-
2023
- 2023-06-26 WO PCT/JP2023/023517 patent/WO2024004921A1/ja unknown
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2003521889A (ja) * | 1999-07-27 | 2003-07-22 | ビーエーエスエフ アクチェンゲゼルシャフト | 新規のシトクロムp450モノオキシゲナーゼ及び有機化合物の酸化のためのその使用 |
WO2010070086A2 (de) * | 2008-12-18 | 2010-06-24 | Evonik Degussa Gmbh | Verfahren zur enzymatischen umsetzung von alkanen |
WO2012023254A1 (ja) * | 2010-08-19 | 2012-02-23 | 国立大学法人 東京大学 | オメガ3系脂肪酸由来の新規抗炎症性代謝物 |
JP2015534831A (ja) * | 2012-11-09 | 2015-12-07 | キャリスタ, インコーポレイテッド | 脂肪酸誘導体の生物学的生産のための組成物および方法 |
US20160355852A1 (en) * | 2014-02-18 | 2016-12-08 | The California Institute Of Technology | Methods and systems for sulfimidation or sulfoximidation of organic molecules |
Non-Patent Citations (1)
Title |
---|
OLIVER C F ET AL: "A single mutation in cytochrome P450 BM3 changes substrate orientation in a catalytic intermediate and the regiospecificity of hydroxylation.", BIOCHEMISTRY, vol. 36, 1 January 1997 (1997-01-01), pages 1567 - 1572, XP002131694, ISSN: 0006-2960, DOI: 10.1021/bi962826c * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Kniemeyer et al. | (S)-1-Phenylethanol dehydrogenase of Azoarcus sp. strain EbN1, an enzyme of anaerobic ethylbenzene catabolism | |
WO2012073987A1 (ja) | 大腸菌形質転換体、それを用いたフラビン結合型グルコースデヒドロゲナーゼの製造方法、および、変異型フラビン結合型グルコースデヒドロゲナーゼ | |
JP4836017B2 (ja) | 糸状菌由来グルコースデヒドロゲナーゼを組換え体で高発現するための方法 | |
US7575910B2 (en) | Method for producing L-fuculose and method for producing L-fucose | |
CN106754775A (zh) | 一种羰基还原酶突变体及其基因与应用 | |
Pisanelli et al. | Pyranose 2-oxidase from Phanerochaete chrysosporium—expression in E. coli and biochemical characterization | |
JP4107389B2 (ja) | グルコース脱水素酵素βサブユニット及びそれをコードするDNA | |
Hu et al. | Molecular evolution of Fome lignosus laccase by ethyl methane sulfonate-based random mutagenesis in vitro | |
WO2024004921A1 (ja) | ポリペプチド、オキシゲナーゼ及びこれらの応用 | |
CN115232799B (zh) | 一种苯乙烯单加氧酶突变体及其应用 | |
Sugano et al. | Role of H164 in a unique dye-decolorizing heme peroxidase DyP | |
JP6198997B1 (ja) | 新規グルコースデヒドロゲナーゼ | |
JP5207430B2 (ja) | 新規な酵素及びそれを含有する組成物 | |
JPWO2002044387A1 (ja) | 新規なフルクトシルアミノ酸オキシダーゼ | |
Itoh et al. | Cloning and biochemical characterization of Co2+-activated bromoperoxidase-esterase (perhydrolase) from Pseudomonas putida IF-3 strain | |
KR20040064238A (ko) | 옥시도리덕타제 | |
Yamashita et al. | Isolation, characterization, and molecular cloning of a thermostable xylitol oxidase from Streptomyces sp. IKD472 | |
Heckmann-Pohl et al. | Improvement of the fungal enzyme pyranose 2-oxidase using protein engineering | |
JP6053309B2 (ja) | セロビオースデヒドロゲナーゼ様グルコースデヒドロゲナーゼ | |
CN108624568B (zh) | 变异黄嘌呤脱氢酶及其应用 | |
JP5115708B2 (ja) | 新規過酸化水素生成型nadhオキシダーゼ及びそれをコードするdna | |
JP2006503559A (ja) | D−マンニトールを製造するための方法並びに微生物 | |
JPWO2005123921A1 (ja) | 新規グリセロール脱水素酵素、その遺伝子、及びその利用法 | |
KR102309783B1 (ko) | Cyp102a15 또는 cyp102a170을 포함하는 지방산의 히드록시화용 조성물 | |
JPH11155579A (ja) | フルクトシルアミノ酸オキシダーゼ遺伝子、新規な組み換え体dna及びフルクトシルアミノ酸オキシダーゼの製造法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23831363 Country of ref document: EP Kind code of ref document: A1 |