CN114934026B - 具有增加的连接效率的t4 dna连接酶变体 - Google Patents
具有增加的连接效率的t4 dna连接酶变体 Download PDFInfo
- Publication number
- CN114934026B CN114934026B CN202210557719.XA CN202210557719A CN114934026B CN 114934026 B CN114934026 B CN 114934026B CN 202210557719 A CN202210557719 A CN 202210557719A CN 114934026 B CN114934026 B CN 114934026B
- Authority
- CN
- China
- Prior art keywords
- lys
- leu
- glu
- gly
- ile
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 102000012410 DNA Ligases Human genes 0.000 title claims abstract description 95
- 108010061982 DNA Ligases Proteins 0.000 title claims abstract description 95
- 102000040430 polynucleotide Human genes 0.000 claims description 36
- 108091033319 polynucleotide Proteins 0.000 claims description 36
- 239000002157 polynucleotide Substances 0.000 claims description 36
- 239000013598 vector Substances 0.000 claims description 31
- 238000000034 method Methods 0.000 claims description 29
- 150000001413 amino acids Chemical group 0.000 claims description 16
- 239000000203 mixture Substances 0.000 claims description 10
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 claims description 4
- 239000011541 reaction mixture Substances 0.000 claims description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 claims description 4
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 claims description 3
- -1 serine amino acids Chemical class 0.000 claims description 3
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 claims description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 claims 2
- 239000004471 Glycine Substances 0.000 claims 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 claims 1
- 230000000694 effects Effects 0.000 abstract description 22
- 239000012634 fragment Substances 0.000 abstract description 17
- 238000006467 substitution reaction Methods 0.000 abstract description 10
- 210000004027 cell Anatomy 0.000 description 71
- 108010034529 leucyl-lysine Proteins 0.000 description 60
- 150000007523 nucleic acids Chemical group 0.000 description 44
- 108090000623 proteins and genes Proteins 0.000 description 43
- 102000004196 processed proteins & peptides Human genes 0.000 description 39
- 108090000765 processed proteins & peptides Proteins 0.000 description 39
- 229920001184 polypeptide Polymers 0.000 description 37
- 108020004414 DNA Proteins 0.000 description 36
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 27
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 27
- 108010050848 glycylleucine Proteins 0.000 description 25
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 24
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 24
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 24
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 24
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 24
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 24
- 108010062796 arginyllysine Proteins 0.000 description 24
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 24
- 108010003137 tyrosyltyrosine Proteins 0.000 description 24
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 23
- 108091026890 Coding region Proteins 0.000 description 20
- 102000039446 nucleic acids Human genes 0.000 description 20
- 108020004707 nucleic acids Proteins 0.000 description 20
- 108091028043 Nucleic acid sequence Proteins 0.000 description 19
- 239000013612 plasmid Substances 0.000 description 17
- 108010076504 Protein Sorting Signals Proteins 0.000 description 16
- 239000013604 expression vector Substances 0.000 description 16
- 230000014509 gene expression Effects 0.000 description 16
- 102000004169 proteins and genes Human genes 0.000 description 14
- 230000010076 replication Effects 0.000 description 14
- 125000003275 alpha amino acid group Chemical group 0.000 description 13
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 12
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 12
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 12
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 12
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 12
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 12
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 12
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 12
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 12
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 12
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 12
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 12
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 12
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 12
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 description 12
- WOPFJPHVBWKZJH-SRVKXCTJSA-N Arg-Arg-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O WOPFJPHVBWKZJH-SRVKXCTJSA-N 0.000 description 12
- BBYTXXRNSFUOOX-IHRRRGAJSA-N Arg-Cys-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BBYTXXRNSFUOOX-IHRRRGAJSA-N 0.000 description 12
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 12
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 12
- IYMAXBFPHPZYIK-BQBZGAKWSA-N Arg-Gly-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IYMAXBFPHPZYIK-BQBZGAKWSA-N 0.000 description 12
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 12
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 12
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 12
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 12
- GURLOFOJBHRPJN-AAEUAGOBSA-N Asn-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N GURLOFOJBHRPJN-AAEUAGOBSA-N 0.000 description 12
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 12
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 12
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 12
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 12
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 12
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 12
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 12
- GFYOIYJJMSHLSN-QXEWZRGKSA-N Asp-Val-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GFYOIYJJMSHLSN-QXEWZRGKSA-N 0.000 description 12
- ORYFTECKJZTNQP-DCAQKATOSA-N Cys-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N ORYFTECKJZTNQP-DCAQKATOSA-N 0.000 description 12
- 102000004190 Enzymes Human genes 0.000 description 12
- 108090000790 Enzymes Proteins 0.000 description 12
- MGJMFSBEMSNYJL-AVGNSLFASA-N Gln-Asn-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MGJMFSBEMSNYJL-AVGNSLFASA-N 0.000 description 12
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 12
- UWKPRVKWEKEMSY-DCAQKATOSA-N Gln-Lys-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWKPRVKWEKEMSY-DCAQKATOSA-N 0.000 description 12
- FALJZCPMTGJOHX-SRVKXCTJSA-N Gln-Met-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O FALJZCPMTGJOHX-SRVKXCTJSA-N 0.000 description 12
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 12
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 12
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 12
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 12
- MLCPTRRNICEKIS-FXQIFTODSA-N Glu-Asn-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLCPTRRNICEKIS-FXQIFTODSA-N 0.000 description 12
- ZZIFPJZQHRJERU-WDSKDSINSA-N Glu-Cys-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ZZIFPJZQHRJERU-WDSKDSINSA-N 0.000 description 12
- GYCPQVFKCPPRQB-GUBZILKMSA-N Glu-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N GYCPQVFKCPPRQB-GUBZILKMSA-N 0.000 description 12
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 12
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 12
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 12
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 12
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 12
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 12
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 12
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 12
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 12
- CQIIXEHDSZUSAG-QWRGUYRKSA-N Gly-His-His Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 CQIIXEHDSZUSAG-QWRGUYRKSA-N 0.000 description 12
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 12
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 12
- JPAACTMBBBGAAR-HOTGVXAUSA-N Gly-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)CC(C)C)C(O)=O)=CNC2=C1 JPAACTMBBBGAAR-HOTGVXAUSA-N 0.000 description 12
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 12
- YHYDTTUSJXGTQK-UWVGGRQHSA-N Gly-Met-Leu Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(C)C)C(O)=O YHYDTTUSJXGTQK-UWVGGRQHSA-N 0.000 description 12
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 12
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 12
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 12
- STOOMQFEJUVAKR-KKUMJFAQSA-N His-His-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CNC=N1 STOOMQFEJUVAKR-KKUMJFAQSA-N 0.000 description 12
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 12
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 12
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 12
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 12
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 12
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 12
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 12
- 241000880493 Leptailurus serval Species 0.000 description 12
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 12
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 12
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 12
- WMTOVWLLDGQGCV-GUBZILKMSA-N Leu-Glu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N WMTOVWLLDGQGCV-GUBZILKMSA-N 0.000 description 12
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 12
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 12
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 12
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 12
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 12
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 12
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 12
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 12
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 12
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 12
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 12
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 12
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 12
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 12
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 12
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 12
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 12
- IPSDPDAOSAEWCN-RHYQMDGZSA-N Lys-Met-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IPSDPDAOSAEWCN-RHYQMDGZSA-N 0.000 description 12
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 12
- AEIIJFBQVGYVEV-YESZJQIVSA-N Lys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCCN)N)C(=O)O AEIIJFBQVGYVEV-YESZJQIVSA-N 0.000 description 12
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 12
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 12
- AWMMBHDKERMOID-YTQUADARSA-N Lys-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCCCN)N)C(=O)O AWMMBHDKERMOID-YTQUADARSA-N 0.000 description 12
- IMDJSVBFQKDDEQ-MGHWNKPDSA-N Lys-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N IMDJSVBFQKDDEQ-MGHWNKPDSA-N 0.000 description 12
- AETNZPKUUYYYEK-CIUDSAMLSA-N Met-Glu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AETNZPKUUYYYEK-CIUDSAMLSA-N 0.000 description 12
- WPTDJKDGICUFCP-XUXIUFHCSA-N Met-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCSC)N WPTDJKDGICUFCP-XUXIUFHCSA-N 0.000 description 12
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 12
- OXIWIYOJVNOKOV-SRVKXCTJSA-N Met-Met-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CCCNC(N)=N OXIWIYOJVNOKOV-SRVKXCTJSA-N 0.000 description 12
- 108010066427 N-valyltryptophan Proteins 0.000 description 12
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 12
- MFQXSDWKUXTOPZ-DZKIICNBSA-N Phe-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N MFQXSDWKUXTOPZ-DZKIICNBSA-N 0.000 description 12
- MPFGIYLYWUCSJG-AVGNSLFASA-N Phe-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MPFGIYLYWUCSJG-AVGNSLFASA-N 0.000 description 12
- ISYSEOWLRQKQEQ-JYJNAYRXSA-N Phe-His-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISYSEOWLRQKQEQ-JYJNAYRXSA-N 0.000 description 12
- KRYSMKKRRRWOCZ-QEWYBTABSA-N Phe-Ile-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KRYSMKKRRRWOCZ-QEWYBTABSA-N 0.000 description 12
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 12
- LCUOTSLIVGSGAU-AVGNSLFASA-N Pro-His-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LCUOTSLIVGSGAU-AVGNSLFASA-N 0.000 description 12
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 12
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 12
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 12
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 12
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 12
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 12
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 12
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 12
- OFCKFBGRYHOKFP-IHPCNDPISA-N Trp-Asp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N OFCKFBGRYHOKFP-IHPCNDPISA-N 0.000 description 12
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 12
- JFDGVHXRCKEBAU-KKUMJFAQSA-N Tyr-Asp-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JFDGVHXRCKEBAU-KKUMJFAQSA-N 0.000 description 12
- WSFXJLFSJSXGMQ-MGHWNKPDSA-N Tyr-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WSFXJLFSJSXGMQ-MGHWNKPDSA-N 0.000 description 12
- BXPOOVDVGWEXDU-WZLNRYEVSA-N Tyr-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXPOOVDVGWEXDU-WZLNRYEVSA-N 0.000 description 12
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 12
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 12
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 12
- BZWUSZGQOILYEU-STECZYCISA-N Val-Ile-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BZWUSZGQOILYEU-STECZYCISA-N 0.000 description 12
- DAVNYIUELQBTAP-XUXIUFHCSA-N Val-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N DAVNYIUELQBTAP-XUXIUFHCSA-N 0.000 description 12
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 12
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 12
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 12
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 12
- AYHNXCJKBLYVOA-KSZLIROESA-N Val-Trp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N AYHNXCJKBLYVOA-KSZLIROESA-N 0.000 description 12
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 12
- JPBGMZDTPVGGMQ-ULQDDVLXSA-N Val-Tyr-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JPBGMZDTPVGGMQ-ULQDDVLXSA-N 0.000 description 12
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 12
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 12
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 12
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 12
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 12
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 12
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 12
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 12
- 108010038633 aspartylglutamate Proteins 0.000 description 12
- 108010092854 aspartyllysine Proteins 0.000 description 12
- 108010016616 cysteinylglycine Proteins 0.000 description 12
- 108010054813 diprotin B Proteins 0.000 description 12
- 229940088598 enzyme Drugs 0.000 description 12
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 12
- 108010087823 glycyltyrosine Proteins 0.000 description 12
- 108010037850 glycylvaline Proteins 0.000 description 12
- 108010028295 histidylhistidine Proteins 0.000 description 12
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 12
- 108010000761 leucylarginine Proteins 0.000 description 12
- 108010003700 lysyl aspartic acid Proteins 0.000 description 12
- 108010009298 lysylglutamic acid Proteins 0.000 description 12
- 108010038320 lysylphenylalanine Proteins 0.000 description 12
- 108010017391 lysylvaline Proteins 0.000 description 12
- 108010070643 prolylglutamic acid Proteins 0.000 description 12
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 12
- 108010051110 tyrosyl-lysine Proteins 0.000 description 12
- 108010073969 valyllysine Proteins 0.000 description 12
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 11
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 11
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 11
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 11
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 11
- HMRAQFJFTOLDKW-GUBZILKMSA-N Ser-His-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMRAQFJFTOLDKW-GUBZILKMSA-N 0.000 description 11
- BCSYBBMFGLHCOA-ACZMJKKPSA-N Cys-Glu-Cys Chemical compound SC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BCSYBBMFGLHCOA-ACZMJKKPSA-N 0.000 description 10
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 10
- 230000001105 regulatory effect Effects 0.000 description 10
- 241000588724 Escherichia coli Species 0.000 description 9
- 239000002773 nucleotide Substances 0.000 description 9
- 125000003729 nucleotide group Chemical group 0.000 description 9
- 238000006243 chemical reaction Methods 0.000 description 8
- 230000002538 fungal effect Effects 0.000 description 8
- 240000006439 Aspergillus oryzae Species 0.000 description 7
- 235000002247 Aspergillus oryzae Nutrition 0.000 description 7
- 241000196324 Embryophyta Species 0.000 description 7
- 230000001580 bacterial effect Effects 0.000 description 7
- 239000000499 gel Substances 0.000 description 7
- 230000010354 integration Effects 0.000 description 7
- 230000035772 mutation Effects 0.000 description 7
- 239000004382 Amylase Substances 0.000 description 6
- 108010065511 Amylases Proteins 0.000 description 6
- 102000013142 Amylases Human genes 0.000 description 6
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- 102000003960 Ligases Human genes 0.000 description 6
- 108090000364 Ligases Proteins 0.000 description 6
- 235000019418 amylase Nutrition 0.000 description 6
- 239000000047 product Substances 0.000 description 6
- 102200111697 rs104894453 Human genes 0.000 description 6
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 5
- 241000351920 Aspergillus nidulans Species 0.000 description 5
- 241000228245 Aspergillus niger Species 0.000 description 5
- 101000757144 Aspergillus niger Glucoamylase Proteins 0.000 description 5
- 241000238631 Hexapoda Species 0.000 description 5
- 102000012288 Phosphopyruvate Hydratase Human genes 0.000 description 5
- 108010022181 Phosphopyruvate Hydratase Proteins 0.000 description 5
- 238000003780 insertion Methods 0.000 description 5
- 230000037431 insertion Effects 0.000 description 5
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 5
- 238000013518 transcription Methods 0.000 description 5
- 230000035897 transcription Effects 0.000 description 5
- 108091093088 Amplicon Proteins 0.000 description 4
- 244000063299 Bacillus subtilis Species 0.000 description 4
- 235000014469 Bacillus subtilis Nutrition 0.000 description 4
- 108010048241 acetamidase Proteins 0.000 description 4
- 125000001931 aliphatic group Chemical group 0.000 description 4
- 108090000637 alpha-Amylases Proteins 0.000 description 4
- 102000004139 alpha-Amylases Human genes 0.000 description 4
- 229940024171 alpha-amylase Drugs 0.000 description 4
- 230000003321 amplification Effects 0.000 description 4
- 239000003153 chemical reaction reagent Substances 0.000 description 4
- 238000003776 cleavage reaction Methods 0.000 description 4
- 238000012217 deletion Methods 0.000 description 4
- 230000037430 deletion Effects 0.000 description 4
- 238000010790 dilution Methods 0.000 description 4
- 239000012895 dilution Substances 0.000 description 4
- 238000001502 gel electrophoresis Methods 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 238000003199 nucleic acid amplification method Methods 0.000 description 4
- 230000008488 polyadenylation Effects 0.000 description 4
- 230000007017 scission Effects 0.000 description 4
- 238000013207 serial dilution Methods 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 239000000758 substrate Substances 0.000 description 4
- 108010037870 Anthranilate Synthase Proteins 0.000 description 3
- 102000004580 Aspartic Acid Proteases Human genes 0.000 description 3
- 108010017640 Aspartic Acid Proteases Proteins 0.000 description 3
- 241000701022 Cytomegalovirus Species 0.000 description 3
- 241000223221 Fusarium oxysporum Species 0.000 description 3
- 241000235403 Rhizomucor miehei Species 0.000 description 3
- IXKSXJFAGXLQOQ-XISFHERQSA-N WHWLQLKPGQPMY Chemical compound C([C@@H](C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CNC=N1 IXKSXJFAGXLQOQ-XISFHERQSA-N 0.000 description 3
- 125000000539 amino acid group Chemical group 0.000 description 3
- 238000003556 assay Methods 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 230000029087 digestion Effects 0.000 description 3
- 102000006602 glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 3
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 3
- 230000006801 homologous recombination Effects 0.000 description 3
- 238000002744 homologous recombination Methods 0.000 description 3
- 210000004962 mammalian cell Anatomy 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 230000007935 neutral effect Effects 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 238000003259 recombinant expression Methods 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- 241000701447 unidentified baculovirus Species 0.000 description 3
- OSJPPGNTCRNQQC-UWTATZPHSA-N 3-phospho-D-glyceric acid Chemical compound OC(=O)[C@H](O)COP(O)(O)=O OSJPPGNTCRNQQC-UWTATZPHSA-N 0.000 description 2
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 2
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 2
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 2
- 101100163849 Arabidopsis thaliana ARS1 gene Proteins 0.000 description 2
- 101000690713 Aspergillus niger Alpha-glucosidase Proteins 0.000 description 2
- 101900318521 Aspergillus oryzae Triosephosphate isomerase Proteins 0.000 description 2
- 241000193830 Bacillus <bacterium> Species 0.000 description 2
- 101000695691 Bacillus licheniformis Beta-lactamase Proteins 0.000 description 2
- 108010029675 Bacillus licheniformis alpha-amylase Proteins 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 241000701489 Cauliflower mosaic virus Species 0.000 description 2
- 102000010911 Enzyme Precursors Human genes 0.000 description 2
- 108010062466 Enzyme Precursors Proteins 0.000 description 2
- 241000701533 Escherichia virus T4 Species 0.000 description 2
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 2
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 2
- 241000193385 Geobacillus stearothermophilus Species 0.000 description 2
- 101100369308 Geobacillus stearothermophilus nprS gene Proteins 0.000 description 2
- 101100080316 Geobacillus stearothermophilus nprT gene Proteins 0.000 description 2
- 102100027612 Kallikrein-11 Human genes 0.000 description 2
- 241001468191 Lactobacillus kefiri Species 0.000 description 2
- 241000255777 Lepidoptera Species 0.000 description 2
- 241001467552 Mycobacterium bovis BCG Species 0.000 description 2
- 102000002508 Peptide Elongation Factors Human genes 0.000 description 2
- 108010068204 Peptide Elongation Factors Proteins 0.000 description 2
- 101000928111 Scheffersomyces stipitis (strain ATCC 58785 / CBS 6054 / NBRC 10063 / NRRL Y-11545) Alcohol dehydrogenase 1 Proteins 0.000 description 2
- 101100097319 Schizosaccharomyces pombe (strain 972 / ATCC 24843) ala1 gene Proteins 0.000 description 2
- 241000723873 Tobacco mosaic virus Species 0.000 description 2
- 101710152431 Trypsin-like protease Proteins 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 239000002671 adjuvant Substances 0.000 description 2
- 238000001042 affinity chromatography Methods 0.000 description 2
- 239000011543 agarose gel Substances 0.000 description 2
- 229960000723 ampicillin Drugs 0.000 description 2
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 125000003118 aryl group Chemical group 0.000 description 2
- 229960000190 bacillus calmette–guérin vaccine Drugs 0.000 description 2
- 238000002869 basic local alignment search tool Methods 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 239000007795 chemical reaction product Substances 0.000 description 2
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 2
- 238000004587 chromatography analysis Methods 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 125000004122 cyclic group Chemical group 0.000 description 2
- 125000000524 functional group Chemical group 0.000 description 2
- 108010061330 glucan 1,4-alpha-maltohydrolase Proteins 0.000 description 2
- 229910001385 heavy metal Inorganic materials 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 108010045069 keyhole-limpet hemocyanin Proteins 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 201000001441 melanoma Diseases 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 230000037361 pathway Effects 0.000 description 2
- 150000008300 phosphoramidites Chemical class 0.000 description 2
- SCVFZCLFOSHCOH-UHFFFAOYSA-M potassium acetate Chemical compound [K+].CC([O-])=O SCVFZCLFOSHCOH-UHFFFAOYSA-M 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 101150054232 pyrG gene Proteins 0.000 description 2
- 230000008439 repair process Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 230000003248 secreting effect Effects 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- IGXNPQWXIRIGBF-KEOOTSPTSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 IGXNPQWXIRIGBF-KEOOTSPTSA-N 0.000 description 1
- CYNAPIVXKRLDER-LBPRGKRZSA-N (2s)-2-benzamido-3-(4-hydroxy-3-nitrophenyl)propanoic acid Chemical compound C([C@@H](C(=O)O)NC(=O)C=1C=CC=CC=1)C1=CC=C(O)C([N+]([O-])=O)=C1 CYNAPIVXKRLDER-LBPRGKRZSA-N 0.000 description 1
- ASWBNKHCZGQVJV-UHFFFAOYSA-N (3-hexadecanoyloxy-2-hydroxypropyl) 2-(trimethylazaniumyl)ethyl phosphate Chemical compound CCCCCCCCCCCCCCCC(=O)OCC(O)COP([O-])(=O)OCC[N+](C)(C)C ASWBNKHCZGQVJV-UHFFFAOYSA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- UFBJCMHMOXMLKC-UHFFFAOYSA-N 2,4-dinitrophenol Chemical compound OC1=CC=C([N+]([O-])=O)C=C1[N+]([O-])=O UFBJCMHMOXMLKC-UHFFFAOYSA-N 0.000 description 1
- IAJOBQBIJHVGMQ-UHFFFAOYSA-N 2-amino-4-[hydroxy(methyl)phosphoryl]butanoic acid Chemical compound CP(O)(=O)CCC(N)C(O)=O IAJOBQBIJHVGMQ-UHFFFAOYSA-N 0.000 description 1
- 101710163881 5,6-dihydroxyindole-2-carboxylic acid oxidase Proteins 0.000 description 1
- RZVAJINKPMORJF-UHFFFAOYSA-N Acetaminophen Chemical compound CC(=O)NC1=CC=C(O)C=C1 RZVAJINKPMORJF-UHFFFAOYSA-N 0.000 description 1
- 102000009027 Albumins Human genes 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 102100034044 All-trans-retinol dehydrogenase [NAD(+)] ADH1B Human genes 0.000 description 1
- 101710193111 All-trans-retinol dehydrogenase [NAD(+)] ADH4 Proteins 0.000 description 1
- 241000024188 Andala Species 0.000 description 1
- 241000534414 Anotopterus nikparini Species 0.000 description 1
- JUWQNWXEGDYCIE-YUMQZZPRSA-N Arg-Gln-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O JUWQNWXEGDYCIE-YUMQZZPRSA-N 0.000 description 1
- 241000228212 Aspergillus Species 0.000 description 1
- 101000961203 Aspergillus awamori Glucoamylase Proteins 0.000 description 1
- 101900127796 Aspergillus oryzae Glucoamylase Proteins 0.000 description 1
- 108090000145 Bacillolysin Proteins 0.000 description 1
- 101000775727 Bacillus amyloliquefaciens Alpha-amylase Proteins 0.000 description 1
- 241000194108 Bacillus licheniformis Species 0.000 description 1
- 108010045681 Bacillus stearothermophilus neutral protease Proteins 0.000 description 1
- 101000755953 Bacillus subtilis (strain 168) Ribosome maturation factor RimP Proteins 0.000 description 1
- 101900040182 Bacillus subtilis Levansucrase Proteins 0.000 description 1
- 108091005658 Basic proteases Proteins 0.000 description 1
- 102100030981 Beta-alanine-activating enzyme Human genes 0.000 description 1
- 102100026189 Beta-galactosidase Human genes 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 108010059892 Cellulase Proteins 0.000 description 1
- 102100037633 Centrin-3 Human genes 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- 241000723655 Cowpea mosaic virus Species 0.000 description 1
- 241000699802 Cricetulus griseus Species 0.000 description 1
- OCEHKDFAWQIBHH-FXQIFTODSA-N Cys-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N OCEHKDFAWQIBHH-FXQIFTODSA-N 0.000 description 1
- QNNYDGBKNFDYOD-UBHSHLNASA-N Cys-Trp-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N QNNYDGBKNFDYOD-UBHSHLNASA-N 0.000 description 1
- 102000018832 Cytochromes Human genes 0.000 description 1
- 108010052832 Cytochromes Proteins 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 230000004544 DNA amplification Effects 0.000 description 1
- 101100342470 Dictyostelium discoideum pkbA gene Proteins 0.000 description 1
- 108090000204 Dipeptidase 1 Proteins 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 101100385973 Escherichia coli (strain K12) cycA gene Proteins 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 101150094690 GAL1 gene Proteins 0.000 description 1
- 101150108358 GLAA gene Proteins 0.000 description 1
- 102100028501 Galanin peptides Human genes 0.000 description 1
- 108010001498 Galectin 1 Proteins 0.000 description 1
- 102100021736 Galectin-1 Human genes 0.000 description 1
- 101100001650 Geobacillus stearothermophilus amyM gene Proteins 0.000 description 1
- 239000005561 Glufosinate Substances 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- 101150009006 HIS3 gene Proteins 0.000 description 1
- 101100295959 Halobacterium salinarum (strain ATCC 700922 / JCM 11081 / NRC-1) arcB gene Proteins 0.000 description 1
- 101100246753 Halobacterium salinarum (strain ATCC 700922 / JCM 11081 / NRC-1) pyrF gene Proteins 0.000 description 1
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 1
- 101000773364 Homo sapiens Beta-alanine-activating enzyme Proteins 0.000 description 1
- 101000880522 Homo sapiens Centrin-3 Proteins 0.000 description 1
- 101100121078 Homo sapiens GAL gene Proteins 0.000 description 1
- 241001480714 Humicola insolens Species 0.000 description 1
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 1
- 108010059881 Lactase Proteins 0.000 description 1
- 241000186660 Lactobacillus Species 0.000 description 1
- 240000001929 Lactobacillus brevis Species 0.000 description 1
- 235000013957 Lactobacillus brevis Nutrition 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- 101001110310 Lentilactobacillus kefiri NADP-dependent (R)-specific alcohol dehydrogenase Proteins 0.000 description 1
- 108090001060 Lipase Proteins 0.000 description 1
- 102000004882 Lipase Human genes 0.000 description 1
- 239000004367 Lipase Substances 0.000 description 1
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 1
- 101150068888 MET3 gene Proteins 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 102000006833 Multifunctional Enzymes Human genes 0.000 description 1
- 108010047290 Multifunctional Enzymes Proteins 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- 101100062121 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) cyc-1 gene Proteins 0.000 description 1
- 101100022915 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) cys-11 gene Proteins 0.000 description 1
- 108090000913 Nitrate Reductases Proteins 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 102000007981 Ornithine carbamoyltransferase Human genes 0.000 description 1
- 101710113020 Ornithine transcarbamylase, mitochondrial Proteins 0.000 description 1
- 102100037214 Orotidine 5'-phosphate decarboxylase Human genes 0.000 description 1
- 108010055012 Orotidine-5'-phosphate decarboxylase Proteins 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 206010034133 Pathogen resistance Diseases 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 101000968489 Rhizomucor miehei Lipase Proteins 0.000 description 1
- 101100394989 Rhodopseudomonas palustris (strain ATCC BAA-98 / CGA009) hisI gene Proteins 0.000 description 1
- 101900354623 Saccharomyces cerevisiae Galactokinase Proteins 0.000 description 1
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 1
- 101100022918 Schizosaccharomyces pombe (strain 972 / ATCC 24843) sua1 gene Proteins 0.000 description 1
- CLKKNZQUQMZDGD-SRVKXCTJSA-N Ser-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CN=CN1 CLKKNZQUQMZDGD-SRVKXCTJSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- 244000061456 Solanum tuberosum Species 0.000 description 1
- 235000002595 Solanum tuberosum Nutrition 0.000 description 1
- 241000256248 Spodoptera Species 0.000 description 1
- 241000256251 Spodoptera frugiperda Species 0.000 description 1
- 101100309436 Streptococcus mutans serotype c (strain ATCC 700610 / UA159) ftf gene Proteins 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 241000187432 Streptomyces coelicolor Species 0.000 description 1
- 101100370749 Streptomyces coelicolor (strain ATCC BAA-471 / A3(2) / M145) trpC1 gene Proteins 0.000 description 1
- 241000187391 Streptomyces hygroscopicus Species 0.000 description 1
- 108090000787 Subtilisin Proteins 0.000 description 1
- QAOWNCQODCNURD-UHFFFAOYSA-L Sulfate Chemical compound [O-]S([O-])(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-L 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- 101100157012 Thermoanaerobacterium saccharolyticum (strain DSM 8691 / JW/SL-YS485) xynB gene Proteins 0.000 description 1
- 241000223258 Thermomyces lanuginosus Species 0.000 description 1
- 241001313536 Thermothelomyces thermophila Species 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 102000005924 Triose-Phosphate Isomerase Human genes 0.000 description 1
- 108700015934 Triose-phosphate isomerases Proteins 0.000 description 1
- 101150050575 URA3 gene Proteins 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- 206010046865 Vaccinia virus infection Diseases 0.000 description 1
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 230000001594 aberrant effect Effects 0.000 description 1
- 102000005421 acetyltransferase Human genes 0.000 description 1
- 108020002494 acetyltransferase Proteins 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 239000013543 active substance Substances 0.000 description 1
- 108010045649 agarase Proteins 0.000 description 1
- 108010051873 alkaline protease Proteins 0.000 description 1
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 101150009206 aprE gene Proteins 0.000 description 1
- 101150008194 argB gene Proteins 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 210000004507 artificial chromosome Anatomy 0.000 description 1
- 101150103518 bar gene Proteins 0.000 description 1
- 230000037429 base substitution Effects 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 108010051210 beta-Fructofuranosidase Proteins 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- 239000003139 biocide Substances 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- UDSAIICHUKSCKT-UHFFFAOYSA-N bromophenol blue Chemical compound C1=C(Br)C(O)=C(Br)C=C1C1(C=2C=C(Br)C(O)=C(Br)C=2)C2=CC=CC=C2S(=O)(=O)O1 UDSAIICHUKSCKT-UHFFFAOYSA-N 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 229940106157 cellulase Drugs 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 101150005799 dagA gene Proteins 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- JGBUYEVOKHLFID-UHFFFAOYSA-N gelred Chemical compound [I-].[I-].C=1C(N)=CC=C(C2=CC=C(N)C=C2[N+]=2CCCCCC(=O)NCCCOCCOCCOCCCNC(=O)CCCCC[N+]=3C4=CC(N)=CC=C4C4=CC=C(N)C=C4C=3C=3C=CC=CC=3)C=1C=2C1=CC=CC=C1 JGBUYEVOKHLFID-UHFFFAOYSA-N 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 108010002685 hygromycin-B kinase Proteins 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000000099 in vitro assay Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000005462 in vivo assay Methods 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 229910052500 inorganic mineral Inorganic materials 0.000 description 1
- 239000001573 invertase Substances 0.000 description 1
- 235000011073 invertase Nutrition 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 229940116108 lactase Drugs 0.000 description 1
- 229940039696 lactobacillus Drugs 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 235000019421 lipase Nutrition 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 101150039489 lysZ gene Proteins 0.000 description 1
- 230000002934 lysing effect Effects 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- UEGPKNKPLBYCNK-UHFFFAOYSA-L magnesium acetate Chemical compound [Mg+2].CC([O-])=O.CC([O-])=O UEGPKNKPLBYCNK-UHFFFAOYSA-L 0.000 description 1
- 239000011654 magnesium acetate Substances 0.000 description 1
- 229940069446 magnesium acetate Drugs 0.000 description 1
- 235000011285 magnesium acetate Nutrition 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 229960000485 methotrexate Drugs 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 239000011707 mineral Substances 0.000 description 1
- 235000010755 mineral Nutrition 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 101150095344 niaD gene Proteins 0.000 description 1
- QJGQUHMNIGDVPM-UHFFFAOYSA-N nitrogen group Chemical group [N] QJGQUHMNIGDVPM-UHFFFAOYSA-N 0.000 description 1
- 101150105920 npr gene Proteins 0.000 description 1
- 101150017837 nprM gene Proteins 0.000 description 1
- 238000003499 nucleic acid array Methods 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 101150019841 penP gene Proteins 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 229920001983 poloxamer Polymers 0.000 description 1
- 229920000447 polyanionic polymer Polymers 0.000 description 1
- 229920005862 polyol Polymers 0.000 description 1
- 150000003077 polyols Chemical class 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 235000011056 potassium acetate Nutrition 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 235000019833 protease Nutrition 0.000 description 1
- 108060006633 protein kinase Proteins 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- 101150108007 prs gene Proteins 0.000 description 1
- 101150086435 prs1 gene Proteins 0.000 description 1
- 101150070305 prsA gene Proteins 0.000 description 1
- 239000011535 reaction buffer Substances 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000004366 reverse phase liquid chromatography Methods 0.000 description 1
- 238000005096 rolling process Methods 0.000 description 1
- 102220277134 rs776745497 Human genes 0.000 description 1
- 101150025220 sacB gene Proteins 0.000 description 1
- 238000005185 salting out Methods 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 238000010532 solid phase synthesis reaction Methods 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 239000012089 stop solution Substances 0.000 description 1
- 238000001308 synthesis method Methods 0.000 description 1
- 238000010189 synthetic method Methods 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 238000011282 treatment Methods 0.000 description 1
- PIEPQKCYPFFYMG-UHFFFAOYSA-N tris acetate Chemical compound CC(O)=O.OCC(N)(CO)CO PIEPQKCYPFFYMG-UHFFFAOYSA-N 0.000 description 1
- 101150016309 trpC gene Proteins 0.000 description 1
- 230000001810 trypsinlike Effects 0.000 description 1
- 238000005199 ultracentrifugation Methods 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 208000007089 vaccinia Diseases 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
- 101150110790 xylB gene Proteins 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6844—Nucleic acid amplification reactions
- C12Q1/6853—Nucleic acid amplification reactions using modified primers or templates
- C12Q1/6855—Ligating adaptors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/93—Ligases (6)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/26—Preparation of nitrogen-containing carbohydrates
- C12P19/28—N-glycosides
- C12P19/30—Nucleotides
- C12P19/34—Polynucleotides, e.g. nucleic acids, oligoribonucleotides
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6806—Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6844—Nucleic acid amplification reactions
- C12Q1/686—Polymerase chain reaction [PCR]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6844—Nucleic acid amplification reactions
- C12Q1/6862—Ligase chain reaction [LCR]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y605/00—Ligases forming phosphoric ester bonds (6.5)
- C12Y605/01—Ligases forming phosphoric ester bonds (6.5) forming phosphoric ester bonds (6.5.1)
- C12Y605/01001—DNA ligase (ATP) (6.5.1.1)
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Biotechnology (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Analytical Chemistry (AREA)
- Biophysics (AREA)
- Immunology (AREA)
- Biomedical Technology (AREA)
- Medicinal Chemistry (AREA)
- Plant Pathology (AREA)
- General Chemical & Material Sciences (AREA)
- Enzymes And Modification Thereof (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
本发明包括一种突变型T4DNA连接酶或其生物活性片段,其具有比野生型T4DNA连接酶更强的活性。如发明内容中更完全地描述的,所述突变型T4DNA连接酶或其生物活性片段具有不同于所述野生型的一个或多个取代。
Description
序列表
本申请包含序列表,该序列表已以ASCII格式以电子方式提交,并通过引用在此整体并入。上述ASCII副本创建于2022年4月4日,名称为ABCL-T4HiAct_SL.txt,大小为78749字节。
背景技术
连接酶通常用于分子生物学,以用于在并置的5'磷酸和3'羟基末端的交叉处形成双螺旋核酸片段之间的磷酸二酯键。通过设计每个双链片段之间的互补突出端,连接可以是位置特异性连接和定向连接二者。这允许DNA或RNA片段特异性整合到更大的载体中,以满足分子生物学研究的需要。在可商购获得的连接酶中,T4 DNA连接酶是一种多功能的酶,其催化在粘性末端和平滑末端两者处连接双螺旋DNA或RNA的键,具有快速的连接速度,并且还经由连接修复带切口DNA中存在的错配。它可以连接DNA、RNA和RNA-DNA杂交体的粘性末端或平滑末端(但不连接单链核酸)。它还可以按比大肠杆菌DNA连接酶更高的效率连接平滑末端DNA。
因为T4 DNA连接酶是许多分子生物学方案的骨架并且被持续需要,因此由于它在分子生物学试剂市场中的重要性而使它为许多生命科学产品公司创造了很大份额的收入。增加T4 DNA连接酶的活性有益于T4 DNA连接酶本身的制造,从而允许减少产生大量最终产品所需的生产量。取决于使用者的目标,增加T4 DNA连接酶的活性还通过减少酶或底物完全连接所需的总时间来增加酶的有用性。
发明内容
本发明涉及与野生型连接酶相比表现出增强的连接活性的工程化T4 DNA连接酶突变体。以下T4 DNA连接酶突变体(在指示位置处具有取代,并且其中每个突变体的氨基酸序列是指定取代后的序列识别号,并且其中每个这种突变体的DNA序列是序列表中在每个突变体的氨基酸序列识别号之前的奇数序列识别号)被鉴定为具有这种增强的连接活性(其中每个都具有在C末端处添加的标签序列:GSGSSGHHHHHH(SEQ ID NO:25)):E89K(SEQID NO:4)、E271K(SEQ ID NO:6)、D340R(SEQ ID NO:8)、D371Q(SEQ ID NO:10)、D371R(SEQID NO:12)、E419K(SEQ ID NO:14)、E438K(SEQ ID NO:16)、E440R(SEQ ID NO:18)、E440W(SEQ ID NO:20)、D452R(SEQ ID NO:22)和K470E(SEQ ID NO:24)。本发明还包括具有以上突变中的至少一种的T4 DNA连接酶突变体氨基酸序列,但是其中T4 DNA连接酶突变体氨基酸序列的剩余部分仅具有保守取代,使得所述分子与序列表中的相应T4 DNA连接酶突变体氨基酸序列具有至少70%、80%、90%、95%、96%、97%、98%或99%的同一性(在下文中被称为“变体序列”)。
本发明还包括在以上突变体的每个氨基酸序列之前的DNA序列(即,分别为SEQ IDNO:),并且还包括前述DNA序列和其他简并核酸序列(统称为“简并核酸序列”),其编码(i)以上T4 DNA连接酶突变体中的每一个,和(ii)变体序列中的任一个的氨基酸序列。
本发明还包括包含任何简并核酸序列的载体;以及用任何此类载体或简并核酸序列转化并能够表达以上T4 DNA连接酶突变体氨基酸序列或变体序列中的任一个的细胞。
本发明还包括一种组合物或试剂盒,其包含以上T4 DNA连接酶突变体氨基酸序列或变体序列、简并核酸序列或包含此类简并核酸序列的载体中的任一个。本发明还包括一种扩增靶核酸的方法,其中在被设计为扩增靶核酸的反应混合物中采用以上T4 DNA连接酶突变体或变体序列中的任一个,并且将所述试剂混合物经受用于扩增所述靶核酸的条件。
与野生型相比,以上突变型T4 DNA连接酶突变体在扩增靶DNA序列方面在更低浓度下具有更强的活性,并且变体序列也预期具有这种更强的活性。
附图说明
图1A和1B示出了来自与T4 DNA连接酶突变体中的每一种相比野生型T4DNA连接酶(图1a的左下子图中的“WT”)的活性测定的一系列凝胶电泳结果。每个图1a和1b中三个最左侧子图的左边的1kb梯是凝胶带的参考组。每个凝胶中存在12个柱,使得从左到右,每个柱代表T4 DNA连接酶在1:2连续稀释后的浓度,并且其中在柱1中,700ng/μl的T4 DNA连接酶(或突变体)初始存在于溶液中。每个凝胶在稀释水平处都有一个箭头,其中显著量(上条带,右侧标记为“1”)的超螺旋质粒产物和限制性消化的线性化底物质粒(中条带,右侧标记为“2”)是明显的。最下面的条带(右侧标记为“3”)是由T4 DNA连接酶野生型或指示的突变体连接的开环质粒产物。
具体实施方式
术语“生物活性片段”是指T4 DNA连接酶突变体或变体序列的任何片段、衍生物、同源物或类似物,其具有生物分子特有的体内或体外活性;包括例如连接酶活性,或经由连接修复带切口DNA中存在的错配。在一些实施方式中,突变型T4 DNA连接酶的生物活性片段、衍生物、同源物或类似物在任何体内或体外测定中均具有突变型T4 DNA连接酶的任何程度的生物活性。
在一些实施方式中,生物活性片段可以任选地包括突变型T4 DNA连接酶的任何数量的连续氨基酸残基或变体序列。本发明还包括编码任何这种生物活性片段和/或简并核酸序列的多核苷酸。
生物活性片段可以来自转录后加工或可替代剪接RNA的翻译,或者可替代地可以通过工程化、批量合成或其他合适的操作产生。生物活性片段包括在天然或内源细胞中表达的片段,以及在表达系统(例如像细菌、酵母、植物、昆虫或哺乳动物细胞)中产生的片段。
如本文所用,短语“保守氨基酸取代”或“保守突变”是指一个氨基酸被另一个具有共同特性的氨基酸替换。定义单个氨基酸之间共同特性的功能性方法是分析同源生物体的相应蛋白质之间氨基酸变化的归一化频率(Schulz(1979)Principles of ProteinStructure[蛋白质结构原理],Springer-Verlag)。根据此类分析,氨基酸组可以定义为组内的氨基酸优先相互交换,因此在它们对整个蛋白质结构的影响方面彼此最相似(前述Schulz(1979))。以这种方式定义的氨基酸组的实例可以包括:“带电荷/极性组”,包括Glu、Asp、Asn、Gln、Lys、Arg和His;“芳族或环状组”,包括Pro、Phe、Tyr和Trp;以及“脂族组”,包括Gly、Ala、Val、Leu、Ile、Met、Ser、Thr和Cys。在每个组内,还可以鉴定亚组。例如,带电荷/极性氨基酸的组可以细分为多个亚组,包括:“带正电荷的亚组”,包括Lys、Arg和His;“带负电荷的亚组”,包括Glu和Asp;以及“极性亚组”,包括Asn和Gln。在另一个实例中,芳族或环状组可以细分为多个亚组,包括:“氮环亚组”,包括Pro、His和Trp;“苯基亚组”,包括Phe和Tyr。在另一个进一步的实例中,脂族组可以细分为多个亚组,包括:“大脂族非极性亚组”,包括Val、Leu和Ile;“脂族弱极性亚组”,包括Met、Ser、Thr、和Cys;以及“小残基亚组”,包括Gly和Ala。保守突变的实例包括上述亚组内氨基酸的氨基酸取代,诸如但不限于:Lys取代Arg,或反之,使得可以保持正电荷;Glu取代Asp,或反之,使得可以保持负电荷;Ser取代Thr,或反之,使得可以保持游离-OH;以及Gln取代Asn,或反之,使得可以保持游离-NH2。“保守变体”是包括一个或多个氨基酸的多肽,所述一个或多个氨基酸已被取代以用具有共同特性的氨基酸(例如,属于如上所述的相同氨基酸组或亚组)替换参考多肽(例如,其序列在出版物或序列数据库中公开或其序列已通过核酸测序确定的多肽)的一个或多个氨基酸。
当提及基因时,“突变体”意指相对于天然或野生型基因,所述基因具有至少一个碱基(核苷酸)改变、缺失或插入。突变(一个或多个核苷酸的改变、缺失和/或插入)可以在基因的编码区中,或者可以在内含子、3'UTR、5'UTR或启动子区域中。作为非限制性实例,突变基因可以是在启动子区域内具有插入的基因,所述插入可以增加或减少所述基因的表达;可以是具有缺失的基因,导致无功能蛋白、截短蛋白、显性阴性蛋白或无蛋白的产生;或者,可以是具有一个或多个点突变的基因,导致所编码蛋白质的氨基酸改变或导致基因转录物的异常剪接。
术语“本发明的突变型T4 DNA连接酶”和“突变型T4 DNA连接酶”在此具体实施方式部分中使用时,根据上下文,共同或单独地是指测试的并表现出增强的连接活性的突变型T4 DNA连接酶多肽,它们是:
E89K(SEQ ID NO:4)、E271K(SEQ ID NO:6)、D340R(SEQ ID NO:8)、D371Q(SEQ IDNO:10)、D371R(SEQ ID NO:12)、E419K(SEQ ID NO:14)、E438K(SEQ ID NO:16)、E440R(SEQID NO:18)、E440W(SEQ ID NO:20)、D452R(SEQ ID NO:22)和K470E(SEQ ID NO:24),其中每个具有标签序列:在C末端处添加的GSGSSGHHHHHH(SEQ ID NO:25)),和/或变体序列和/或简并核酸序列,如发明内容部分中所定义的那些术语。
“天然存在”或“野生型”是指在自然界中发现的形式。例如,天然存在或野生型多肽或多核苷酸序列是存在于生物体中的序列,其没有被人为操作有意修饰。
关于核酸或多肽序列的术语“同一性百分比”或“同源性”定义为在排列序列以获得最大同一性百分比并引入缺口(如果需要)以实现最大同源性百分比之后,候选序列中与已知多肽相同的核苷酸或氨基酸残基的百分比。N-末端或C-末端插入或缺失不应被解释为影响同源性。核苷酸或氨基酸序列水平上的同源性或同一性可以通过BLAST(基本局部比对搜索工具(Basic Local Alignment Search Tool))分析来确定,所述分析使用由程序blastp、blastn、blastx、tblastn和tblastx采用的算法(Altschul(1997),Nucleic AcidsRes[核酸研究].25,3389-3402和Karlin(1990),Proc.Natl.Acad.Sci.USA[美国国家科学院院刊]87,2264-2268),所述程序是为序列相似性搜索定制的。BLAST程序使用的方法是首先考虑查询序列和数据库序列之间的相似片段(有或没有缺口),然后评估被鉴定的所有匹配的统计显著性,最后仅总结满足预先选择的显著性阈值的那些匹配。关于序列数据库相似性搜索中基本问题的讨论,参见Altschul(1994),Nature Genetics[自然遗传学]6,119-129。直方图、描述、比对、期望(即,用于报告与数据库序列匹配的统计显著性阈值)、截止、矩阵和过滤器(低复杂度)的搜索参数可以是默认设置。blastp、blastx、tblastn和tblastx使用的默认评分矩阵是BLOSUM62矩阵(Henikoff(1992),Proc.Natl.Acad.Sci.USA[美国国家科学院院刊]89,10915-10919),推荐用于长度超过85个单位(核苷酸碱基或氨基酸)的查询序列。
在一些实施方式中,本发明涉及用于进行连接反应的方法(以及相关试剂盒、系统、设备和组合物),所述方法包括或由以下组成:在一种或多种核苷酸存在下,使突变型T4DNA连接酶或其生物活性片段与核酸模板接触,并使用突变型T4 DNA连接酶或其生物活性片段连接所述一种或多种核苷酸中的至少一种。
在一些实施方式中,所述方法可以包括将双链RNA或DNA多核苷酸链连接到环状分子中。在一些实施方式中,所述方法可以进一步包括通过使用传感器检测指示连接的信号。在一些实施方式中,传感器是ISFET。在一些实施方式中,传感器可以包括连接反应中的可检测标记或可检测试剂。
在一些实施方式中,本发明涉及用于进行核酸的滚环扩增(参见美国专利号5,714,320,通过引用并入)的方法(以及相关的试剂盒、系统、设备和组合物),所述方法使用突变型T4 DNA连接酶作为扩增过程的连接步骤中的酶。扩增包括在溶液中扩增核酸,以及在固体支持物(诸如存在于固体支持物表面上的核酸珠、流动池、核酸阵列或孔)上克隆扩增核酸。
制备突变型T4 DNA连接酶
本发明的突变型T4 DNA连接酶可以在任何合适的宿主系统中表达,所述宿主系统包括细菌、酵母、真菌、杆状病毒、植物或哺乳动物宿主细胞。对于细菌宿主细胞,用于指导本公开的核酸构建体的转录的合适启动子包括从以下获得的启动子:大肠杆菌(E.coli)乳糖操纵子、天蓝色链霉菌琼脂糖酶基因(dagA)、枯草芽孢杆菌果聚糖蔗糖酶基因(sacB)、地衣芽孢杆菌α-淀粉酶基因(amyL)、嗜热脂肪芽孢杆菌麦芽糖淀粉酶基因(amyM)、解淀粉芽孢杆菌α-淀粉酶基因(amyQ)、地衣芽孢杆菌青霉素酶基因(penP)、枯草芽孢杆菌xylA和xylB基因和原核β-内酰胺酶基因(Villa-Kamaroff等人,1978,Proc.Natl Acad.Sci.USA[美国国家科学院院刊]75:3727-3731),以及tac启动子(DeBoer等人,1983,Proc.NatlAcad.Sci.USA[美国国家科学院院刊]80:21-25)。
对于丝状真菌宿主细胞,用于指导本公开的核酸构建体的转录的合适启动子包括从以下酶的基因获得的启动子:米曲霉TAKA淀粉酶、米黑根毛霉天冬氨酸蛋白酶、黑曲霉中性α-淀粉酶、黑曲霉酸稳定性α-淀粉酶、黑曲霉或泡盛曲霉葡萄糖淀粉酶(glaA)、米黑根毛霉脂肪酶、米曲霉碱性蛋白酶、米曲霉磷酸丙糖异构酶、构巢曲霉乙酰胺酶和尖孢镰孢胰蛋白酶样蛋白酶(WO 96/00787),以及NA2-tpi启动子(来自黑曲霉中性α-淀粉酶和米曲霉磷酸丙糖异构酶的基因的启动子的杂交体)及其突变体、截短和杂交启动子。
在酵母宿主中,有用的启动子可以来自以下酶的基因:酿酒酵母(Saccharomycescerevisiae)烯醇化酶(ENO-1)、酿酒酵母半乳糖激酶(GAL1)、酿酒酵母乙醇脱氢酶/甘油醛-3-磷酸脱氢酶(ADH2/GAP)、以及酿酒酵母3-磷酸甘油酸激酶。酵母宿主细胞的其他有用的启动子由Romanos等人,1992,Yeast[酵母]8:423-488描述。
对于杆状病毒表达,来源于鳞翅目(蛾和蝴蝶)的昆虫细胞系,诸如草地贪夜蛾被用作宿主。基因表达受强启动子(例如,pPolh)的控制。
植物表达载体基于根癌土壤杆菌(Agrobacterium tumefaciens)的Ti质粒,或基于烟草花叶病毒(TMV)、马铃薯X病毒或豇豆花叶病毒。植物表达载体中常用的组成型启动子是花椰菜花叶病毒(CaMV)35S启动子。
对于哺乳动物表达,培养的哺乳动物细胞系诸如中国仓鼠卵巢(CHO)、COS,包括人细胞系诸如HEK和HeLa,可以用于产生突变型T4 DNA连接酶。哺乳动物表达载体的实例包括腺病毒载体、pSV和pCMV系列质粒载体、牛痘病毒和逆转录病毒载体以及杆状病毒。巨细胞病毒(CMV)和SV40的启动子通常用于哺乳动物表达载体中以驱动基因表达。非病毒启动子,诸如延伸因子(EF)-1启动子也是已知的。
用于表达的控制序列也可以是合适的转录终止子序列,即,由宿主细胞识别以终止转录的序列。终止子序列可操作地连接至编码所述多肽的核酸序列的3'末端。可以使用在所选择宿主细胞中具有功能性的任何终止子。
例如,用于丝状真菌宿主细胞的示例性转录终止子可以从以下酶的基因获得:米曲霉TAKA淀粉酶、黑曲霉葡糖淀粉酶、构巢曲霉邻氨基苯甲酸合酶、黑曲霉α-葡糖苷酶和尖孢镰孢胰蛋白酶样蛋白酶。
用于酵母宿主细胞的示例性终止子可以从以下酶的基因获得:酿酒酵母烯醇化酶、酿酒酵母细胞色素C(CYC1)、以及酿酒酵母甘油醛-3-磷酸脱氢酶。
用于昆虫、植物和哺乳动物宿主细胞的终止子也是熟知的。
控制序列也可以是合适的前导序列,即对由宿主细胞进行的翻译很重要的mRNA的非翻译区。前导序列可操作地连接至编码所述多肽的核酸序列的5'末端。可以使用在所选择宿主细胞中具有功能性的任何前导序列。用于丝状真菌宿主细胞的示例性前导序列从米曲霉TAKA淀粉酶和构巢曲霉磷酸丙糖异构酶的基因获得。用于酵母宿主细胞的合适的前导序列从以下酶的基因获得:酿酒酵母烯醇化酶(ENO-1)、酿酒酵母3-磷酸甘油酸激酶、酿酒酵母α-因子、以及酿酒酵母醇脱氢酶/甘油醛-3-磷酸脱氢酶(ADH2/GAP)。
控制序列还可以是多腺苷酸化序列,一种可操作地连接至所述核酸序列的3'末端并且当转录时由宿主细胞识别为将多腺苷酸残基添加至所转录的mRNA的信号的序列。在所选择宿主细胞中具有功能性的任何多腺苷酸化序列均可以用于本发明中。用于丝状真菌宿主细胞的示例性多腺苷酸化序列可以来自以下酶的基因:米曲霉TAKA淀粉酶、黑曲霉葡糖淀粉酶、构巢曲霉邻氨基苯甲酸合酶、尖孢镰孢胰蛋白酶样蛋白酶和黑曲霉α-葡糖苷酶。
控制序列还可以是编码与多肽的氨基末端连接的氨基酸序列并指导所编码多肽进入细胞的分泌途径的信号肽编码区。核酸序列的编码序列的5'端本身可以含有在翻译阅读框中与编码分泌的多肽的编码区的区段天然连接的信号肽编码区。可替代地,编码序列的5'端可以含有对编码序列是外源的信号肽编码区。在编码序列天然地不含有信号肽编码区的情况下,可能需要外源信号肽编码区。
可替代地,外源信号肽编码区可以单纯地替代天然信号肽编码区以便增强多肽的分泌。然而,可以使用指导已表达多肽进入所选择宿主细胞的分泌途径的任何信号肽编码区。
用于细菌宿主细胞的有效信号肽编码区是从以下酶的基因获得的信号肽编码区:芽孢杆菌NCIB 11837麦芽糖淀粉酶、嗜热脂肪芽孢杆菌α-淀粉酶、地衣芽孢杆菌枯草杆菌蛋白酶、地衣芽孢杆菌β-内酰胺酶、嗜热脂肪芽孢杆菌中性蛋白酶(nprT、nprS、nprM)、以及枯草芽孢杆菌prsA。进一步的信号肽由Simonen和Palva,1993,Microbiol Rev[微生物评论]57:109-137描述。
用于丝状真菌宿主细胞的有效信号肽编码区可以是从以下酶的基因获得的信号肽编码区:米曲霉TAKA淀粉酶、黑曲霉中性淀粉酶、黑曲霉葡糖淀粉酶、米黑根毛霉天冬氨酸蛋白酶、特异腐质霉纤维素酶、以及疏棉状腐质霉脂肪酶。
用于酵母宿主细胞的有用的信号肽可以来自酿酒酵母α-因子和酿酒酵母转化酶的基因。用于其他宿主细胞系统的信号肽也是熟知的。
控制序列还可以是编码位于多肽的氨基末端处的氨基酸序列的前肽编码区。所得的多肽被称为前体酶(proenzyme)或多肽原(或在一些情况下被称为酶原(zymogen))。多肽原通常是无活性的并且可以通过催化切割或自身催化切割来自多肽原的前肽而转化为成熟的活性多肽。前肽编码区可以从以下酶的基因获得:枯草芽孢杆菌碱性蛋白酶(aprE)、枯草芽孢杆菌中性蛋白酶(nprT)、酿酒酵母α-因子、米黑根毛霉天冬氨酸蛋白酶、以及嗜热毁丝霉乳糖酶(WO 95/33836)。
在信号肽和前肽区二者都存在于多肽的氨基末端处的情况下,所述前肽区位于紧邻多肽的氨基末端,并且所述信号肽区位于紧邻前肽区的氨基末端。
还可期望的是添加调节序列,其允许相对于宿主细胞的生长调节突变型T4DNA连接酶的表达。调节系统的实例是响应于化学或物理刺激(包括调节化合物的存在)而引起基因表达被开启或关闭的那些。在原核宿主细胞中,合适的调节序列包括lac、tac和trp操纵子系统。在酵母宿主细胞中,合适的调节系统包括例如,ADH2系统或GAL1系统。在丝状真菌中,合适的调节序列包括TAKAα淀粉酶启动子、黑曲霉葡糖淀粉酶启动子和米曲霉葡糖淀粉酶启动子。用于其他宿主细胞的调节系统也是熟知的。
调节序列的其他实例是允许基因扩增的那些序列。在真核系统中,这些包括在甲氨蝶呤存在下扩增的二氢叶酸还原酶基因以及用重金属扩增的金属硫蛋白基因。在这些情况下,编码本发明KRED多肽的核酸序列将与调节序列可操作地连接。
另一个实施方式包括重组表达载体,其包含编码工程化突变型T4 DNA连接酶或其变体的多核苷酸,和一个或多个表达调节区诸如启动子和终止子,以及复制起点,这取决于它们将被引入的宿主的类型。上述各种核酸和控制序列可以连接在一起以产生重组表达载体,所述重组表达载体可以包括一个或多个便利的限制位点以允许编码突变型T4 DNA连接酶的核酸序列在此类位点处的插入或取代。可替代地,可以通过将核酸序列或包含所述序列的核酸构建体插入用于表达的适当载体中来表达突变型T4 DNA连接酶的核酸序列。在产生表达载体时,编码序列位于载体中,使得编码序列与用于表达的适当控制序列可操作地连接。
重组表达载体可以是可以方便地经受重组DNA程序并且可以引起突变型T4 DNA连接酶多核苷酸序列表达的任何载体(例如,质粒或病毒)。载体的选择将通常取决于载体与待引入载体的宿主细胞的相容性。载体可以是直链质粒或闭合环状质粒。
表达载体可以是自主复制载体,即作为染色体外实体存在的载体,其复制不依赖于染色体复制,例如质粒、染色体外元件、微染色体或人工染色体。载体可以含有用于确保自我复制的任何手段。可替代地,载体可以是这样的载体,当它引入宿主细胞中时被整合到基因组中并与其中已整合了它的一个或多个染色体一起被复制。此外,可以使用单独的载体或质粒或共同含有待引入宿主细胞基因组的总DNA的两个或更多个载体或质粒,或可以使用转座子。
本文表达载体优选地含有一个或多个选择性标记,其允许容易地选择转化的细胞。选择性标记是一种基因,其产物提供了杀生物剂抗性或病毒抗性、对重金属抗性、对营养缺陷型的原营养等。细菌选择性标记的实例是来自枯草芽孢杆菌或地衣芽孢杆菌的dal基因,或赋予抗生素抗性(诸如氨苄青霉素、卡那霉素、氯霉素(实施例1)或四环素抗性)的标记。用于酵母宿主细胞的合适标记是ADE2、HIS3、LEU2、LYS2、MET3、TRP1、和URA3。用于丝状真菌宿主细胞中的选择性标记包括但不限于amdS(乙酰胺酶)、argB(鸟氨酸氨甲酰基转移酶)、bar(草丁膦乙酰转移酶)、hph(潮霉素磷酸转移酶)、niaD(硝酸还原酶)、pyrG(乳清酸核苷-5'-磷酸脱羧酶)、sC(硫酸腺苷酰基转移酶)、和trpC(邻氨基苯甲酸合酶)、以及其等效物。用于曲霉属细胞的实施方式包括构巢曲霉或米曲霉的amdS和pyrG基因,以及吸水链霉菌的bar基因。用于昆虫、植物和哺乳动物细胞的选择性标记也是熟知的。
本发明的表达载体优选地含有允许载体整合到宿主细胞的基因组中或载体在细胞中不依赖于基因组的自主复制的一个或多个元件。对于整合到宿主细胞基因组中,载体可以依赖于编码所述多肽的核酸序列或用于通过同源或非同源重组将载体整合到基因组中的所述载体的任何其他元件。
可替代地,表达载体可以含有另外的核酸序列,用于指导通过同源重组整合到宿主细胞的基因组中。另外的核酸序列使得载体能够整合到宿主细胞基因组中一个或多个染色体的一个或多个精确位置处。整合元件可以是与宿主细胞基因组内的靶序列同源的任何序列。此外,整合元件可以是非编码或编码核酸序列。另一方面,载体可以通过非同源重组整合到宿主细胞的基因组中。
为了自主复制,载体可以进一步包含复制起点,该复制起点使得载体能够在讨论中的宿主细胞中自主复制。细菌复制起点的实例是P15A ori,或允许在大肠杆菌中复制的质粒pBR322、pUC19、pACYC177(所述质粒具有P15A ori)或pACYC184的复制起点、以及允许在芽孢杆菌中复制的质粒pUB110、pE194、pTA1060或pAM31的复制起点。用于酵母宿主细胞中的复制起点的实例是2微米复制起点,ARS1,ARS4,ARS1与CEN3的组合,以及ARS4与CEN6的组合。复制起点可以是具有使其在宿主细胞中的功能对温度敏感的突变的复制起点(参见例如,Ehrlich,1978,Proc Natl Acad Sci.USA[美国国家科学院院刊]75:1433)。
可以将突变型T4 DNA连接酶的多于一个拷贝的核酸序列插入宿主细胞中,以增加基因产物的产生。通过将序列的至少一个另外的拷贝整合到宿主细胞基因组中或者通过包括与所述核酸序列一起的可扩增的选择性标记基因可以获得核酸序列的增加的拷贝数目,其中通过在适当的选择性试剂的存在下培养细胞可以选择含有选择性标记基因的经扩增的拷贝以及由此所述核酸序列的额外拷贝的细胞。
用于突变型T4 DNA连接酶多核苷酸的表达载体可商购获得。合适的商业表达载体包括来自圣路易斯的西格玛奥德里奇化学公司(Sigma-Aldrich Chemicals,St.LouisMo.)的p3xFLAGTM表达载体,其包括用于在哺乳动物宿主细胞中表达的CMV启动子和hGH聚腺苷酸化位点,以及用于在大肠杆菌中扩增的pBR322复制起点和氨苄青霉素抗性标记物。合适的其他表达载体是可从加利福尼亚州拉荷亚(LaJolla Calif.)的Stratagene公司商购获得的pBluescriptII SK(-)和pBK-CMV,以及来源于pBR322(Gibco BRL)、pUC(GibcoBRL)、pREP4、pCEP4(Invitrogen)或pPoly(Lathe等人,1987,Gene[基因]57:193-201)的质粒。
用于表达编码突变型T4 DNA连接酶的多核苷酸的合适宿主细胞是本领域熟知的,并且包括但不限于:细菌细胞,诸如大肠杆菌、克菲尔乳杆菌(Lactobacillus kefir)、短乳杆菌、微小乳杆菌、链霉菌和鼠伤寒沙门氏菌细胞;真菌细胞,诸如酵母细胞(例如,酿酒酵母或巴斯德毕赤酵母(ATCC登记号201178));昆虫细胞,诸如果蝇S2和夜蛾Sf9细胞;动物细胞,诸如CHO、COS、BHK、293和鲍斯黑色素瘤细胞(Bowes melanoma cell);以及植物细胞。用于上述宿主细胞的适当培养基和生长条件是本领域熟知的。
可以通过本领域已知的各种方法将用于表达突变型T4 DNA连接酶的多核苷酸引入细胞。这些技术包括电穿孔、生物射弹粒子轰击、脂质体介导的转染、氯化钙转染和原生质体融合。用于将多核苷酸引入细胞的各种方法是技术人员已知的。
可以根据已知的合成方法,通过标准固相方法来制备编码突变型T4 DNA连接酶的多核苷酸。在一些实施方式中,可以单独合成多达约100个碱基的片段,然后连接(例如,通过酶促或化学连接方法,或聚合酶介导的方法)以形成任何所希望的连续序列。例如,可以使用例如由Beaucage等人,1981,Tet Lett[四面体快报]22:1859-69描述的经典亚磷酰胺方法,或由Matthes等人,1984,EMBO J.[欧洲分子生物学学报]3:801-05描述的方法(例如,如其通常应用于自动合成方法中)而通过化学合成来制备多核苷酸。根据亚磷酰胺方法,例如在自动DNA合成仪中合成寡核苷酸,将其纯化、退火、连接并克隆到合适的载体中。另外,基本上任何核酸都可以从各种商业来源获得,所述商业来源诸如德克萨斯州米德兰的Midland Certified Reagent Company公司、加利福尼亚州拉蒙纳的Great American GeneCompany公司、伊利诺伊州芝加哥的ExpressGen公司和加利福尼亚州阿拉米达的OperonTechnologies公司。
使用任何一种或多种熟知的蛋白质纯化技术,包括溶菌酶处理、超声处理、过滤、盐析、超速离心和色谱法,可以从细胞和/或培养基中回收在宿主细胞中表达的工程化突变型T4 DNA连接酶。用于从细菌(诸如大肠杆菌)裂解和高效提取蛋白质的合适溶液可从圣路易斯的西格玛奥德里奇公司(Sigma-Aldrich)以商品名CelLytic B.TM.商购获得。
用于分离突变型T4 DNA连接酶的色谱技术包括反相色谱、高效液相色谱、离子交换色谱、凝胶电泳和亲和色谱。纯化条件将部分取决于诸如净电荷、疏水性、亲水性、分子量、分子形状的因素,并且对于本领域技术人员将是显而易见的。
在一些实施方式中,亲和技术可以用于分离突变型T4 DNA连接酶。对于亲和色谱纯化,可以使用特异性结合突变型T4 DNA连接酶的任何抗体。为了产生抗体,各种宿主动物(包括但不限于兔、小鼠、大鼠等)可以通过注射化合物进行免疫。所述化合物可以通过侧链官能团或附接至侧链官能团的接头附接至合适的载体,诸如BSA。各种佐剂可以用于增强免疫应答,这取决于宿主种类,包括但不限于弗氏(Freund's)(完全和不完全)、矿物凝胶诸如氢氧化铝、表面活性物质诸如溶血卵磷脂、复合多元醇(pluronic polyols)、聚阴离子、肽、油乳剂、钥孔戚血蓝蛋白(keyhole limpet hemocyanin)、二硝基酚,以及潜在有用的人佐剂诸如BCG(卡介苗)和短小棒状杆菌。
制备T4 DNA连接酶突变体的实例
通过常规PCR诱变生成T4 DNA连接酶突变体,其中引物被设计成包含所需的碱基取代,并且在PCR过程中,突变被并入到扩增子中,从而替换原始序列。所有的T4 DNA连接酶突变体和野生型具有添加的C末端6聚体His标签(SEQ ID NO:26),以便于纯化,前面是Ser和Gly残基的6聚体系列,如图所示。
PCR之后进行DpnI消化,其破坏甲基化模板(不包含取代),从而仅留下具有取代的未甲基化PCR扩增子。
然后将PCR扩增子直接转化到化学感受态大肠杆菌宿主细胞中,其中细菌用化学物质预处理,以使它们能够吸收并入扩增子的质粒。参见ThermoFisher Scientific,化学感受态细胞网页(提供用于生成化学感受态细胞的试剂盒)。
基于具有凝胶电泳的标准连接测定表征和选择由转化的大肠杆菌宿主细胞表达的突变型T4 DNA连接酶多肽。连接酶催化在双螺旋DNA的互补粘性末端或平滑末端的5'与3'末端之间形成磷酸二酯键,并且与不同的T4 DNA连接酶突变体的连接程度可以使用适当的DNA染料在琼脂糖凝胶上可视化。在这种情况下,使用GelRed(加利福尼亚州旧金山的Biotium公司)对凝胶进行染色,以便在UV光下进行可视化。基于其在降低的酶浓度下的连接活性检查每个T4 DNA连接酶突变体的性能,并且将所得的活性与类似稀释的野生型(“WT”)连接酶的活性进行比较,允许确定在相同的条件下与野生型相比哪些突变体显示增加的活性。
如下制备用于表征由转化的大肠杆菌宿主细胞表达的突变型T4 DNA连接酶多肽的连接底物。
所使用的DNA载体是pUC19(新英格兰生物实验室,目录号N3041S)。PUC19是2686个碱基对长的双链环。用具有偏移/不对称切割位点的BsaI-v2(新英格兰生物实验室,目录号R3733S)消化PUC19,其中在识别序列添加一个随机(N1)核苷酸之后切割5'链,并且在互补识别序列添加五个额外的随机(N5)核苷酸之后在3'链上发生切割。5'识别序列是GGTCTC。通过BsaI-/>v2对PUC19的切割位点指定为5'-GGTCTC(N1)/(N5)-3'。
将5μl浓度为1mg/ml的puC19与2.5μl 20000个单位/ml的BsaI-v2、5μl10XrCutSmartTM缓冲液(新英格兰生物实验室,目录号B6004S)(50mM乙酸钾、20mM Tris-乙酸盐、10mM乙酸镁、100μg/ml重组白蛋白)和35μl水组合。在将所有组分组合之后,将组合物在37℃下温育,以进行消化。在37℃下1小时后,将反应在80℃下温育20分钟,以使BsaI-/>v2热失活。然后将混合物用水稀释至浓度为10ng/μl。
实施例:连接测定和结果
如下进行连接程序。将每种T4 DNA连接酶(无论是野生型还是变体)用酶稀释剂(50%甘油,10mM tris-HCl)在连续稀释下进行稀释,使得每个样品获得12个不同的浓度。每个连续稀释的起始浓度为700ng/μ,在每个下一次稀释中稀释两倍,使得最终浓度为稀释之前样品浓度的50%,并且依此类推,总共12个1:2稀释。然后将4μl每种酶或系列稀释液添加到PCR板中。
对于酶,将16μl由2μl of 10X T4 DNA连接酶反应缓冲液(新英格兰生物实验室,目录号B0202A;包括50mM Tris-HCl、10mM MgCl2、1mM ATP、10mM DTT(二硫苏糖醇))、1μl10ng/μl用BsaI-v2消化的pUC19和13μl水组成的主混合物添加到每个反应中。这使得总体积/反应达到20μl。将反应在16℃下温育10分钟。在10分钟的温育期之后,反应接受2分钟的80℃热休克以停止任何进一步的活动。将6μl终止溶液液(120mM EDTA、30%甘油、50mMTris-HCl pH 8.0、0.0125%溴酚蓝、0.1%SDS和5x凝胶红核酸染色剂(加利福尼亚州弗里蒙特的Biotium公司))添加到每个反应中。
使用具有1.2%琼脂糖凝胶的凝胶电泳来对连接反应产物进行可视化。每个凝胶具有野生型T4 DNA连接酶样品系列和7个变体T4 DNA连接酶样品系列。将每个凝胶在180V下运行35分钟。
与野生型比较的结果在图1A和1B中示出。表现出增加的连接活性的鉴定的12个突变体如下:
E89K(SEQ ID NO:4)、E271K(SEQ ID NO:6)、D340R(SEQ ID NO:8)、D371Q(SEQ IDNO:10)、D371R(SEQ ID NO:12)、E419K(SEQ ID NO:14)、E438K(SEQ ID NO:16)、E440R(SEQID NO:18)、E440W(SEQ ID NO:20)、D452R(SEQ ID NO:22)和K470E(SEQ ID NO:24),其中每个具有标签序列:在C末端处添加的GSGSSGHHHHHH(SEQ ID NO:25),如图1A、1B所示。
比较活性结果在下表1中列出,显示每个T4 DNA连接酶突变体的泳道差异,其中每个大于WT的泳道差异被分配2倍的活性增加。例如,相对于WT的1个泳道活性改善被赋予2值(如通过在泳道中很少或没有明显的代表超螺旋质粒产物的上条带和很少或没有明显的代表限制性消化的线性底物质粒的中间条带),而相对于WT的2个泳道活性改善被赋予4值并依此类推。
表1:具有比WT更强的活性的T4 DNA连接酶变体
突变体 | 相对于WT的活性(倍数) |
E89K | 32 |
K271K | 8 |
D340R | 2 |
D371R | 8 |
D371Q | 4 |
E419K | 4 |
E438K | 8 |
E440R | 4 |
E440W | 4 |
K470E | 8 |
D452R | 2 |
SEQ ID NO:1 T4 DNA连接酶CH野生型DNA
SEQ ID NO:2 T4 DNA连接酶CH野生型蛋白质
SEQ ID NO:3 T4 DNA连接酶CH E89K DNA
SEQ ID NO:4 T4 DNA连接酶CH E89K蛋白质
SEQ ID NO:5 T4 DNA连接酶CH E271K DNA
SEQ ID NO:6 T4 DNA连接酶CH E271K蛋白质
SEQ ID NO:7 T4 DNA连接酶CH D340R DNA
SEQ ID NO:8 T4 DNA连接酶CH D340R蛋白质
SEQ ID NO:9 T4 DNA连接酶CH D371Q DNA
SEQ ID NO:10 T4 DNA连接酶CH D371Q蛋白质
SEQ ID NO:11 T4 DNA连接酶CH D371R DNA
SEQ ID NO:12 T4 DNA连接酶CH D371R蛋白质
SEQ ID NO:13 T4 DNA连接酶CH E419K DNA
SEQ ID NO:14 T4 DNA连接酶CH E419K蛋白质
SEQ ID NO:15 T4 DNA连接酶CH E438K DNA
SEQ ID NO:16 T4 DNA连接酶CH E438K蛋白质
SEQ ID NO:17 T4 DNA连接酶CH E440R DNA
SEQ ID NO:18 T4 DNA连接酶CH E440R蛋白质
SEQ ID NO:19 T4 DNA连接酶CH E440W DNA
SEQ ID NO:20 T4 DNA连接酶CH E440W蛋白质
SEQ ID NO:21 T4 DNA连接酶CH D452R DNA
SEQ ID NO:22 T4 DNA连接酶CH D452R蛋白质
SEQ ID NO:23 T4 DNA连接酶CH K470E DNA
SEQ ID NO:24 T4 DNA连接酶CH K470E蛋白质
本文所述的具体方法和组合物是优选实施方式的代表,并且是示例性的且不旨在限制本发明的范围。考虑到本说明书,本领域技术人员将想到其他目的、方面和实施方式,并且包含在由权利要求书的范围所限定的本发明的精神内。对于本领域技术人员来说,将显而易见的是,在不脱离本发明的范围和精神的情况下,可以对本文公开的发明进行各种替换和修改。可以在不存在本文没有明确公开为必要的任何一种或多种要素、或者任何一种或多种限制的情况下适当地实践本文说明性描述的本发明。因此,例如,在本文的每种情况下,在本发明的实施方式或实例中,术语“包含”、“包括”、“含有”等中的任一项都应该被广泛而无限制地阅读。本文中示例性描述的方法和过程可以不同的步骤顺序适当地实施,并且它们不必限于本文或权利要求中指示的步骤顺序。还应注意,除非上下文另外清楚地指出,否则如在本文以及在所附权利要求所用的,单数形式“一个/一种”以及“所述”包括复数指示物,并且复数包括单数形式。在任何情况下,都不能将本专利申请解释为限于本文具体公开的具体实例或实施方式或方法。在任何情况下,本专利申请都不应被解释为受专利商标局的任何审查员或任何其他官员或雇员所做的任何声明的限制,除非此声明由申请人在响应性书面材料中明确且无条件或专门保留地采用。
已经在本文中宽泛且概括地对本发明进行了描述。落在整个公开文本之内的更窄的种类和亚属的分类中的每一个也形成了本发明的一部分。已经采用的术语和表达被用作描述的术语且没有限制性,并且并非旨在使用此类术语和表达而将所示出和描述的特征的任何等效物或其部分排除在外,但是将认识到的是提出要求保护的本发明范围内的不同修改是可能的。因此,应当理解,虽然已经通过优选实施方式和任选特征具体地公开了本发明,但本领域技术人员可以采用本文所公开概念的修改和变更,包括但不限于变体序列,并且认为此类修改和变更在由所附权利要求限定的本发明范围内。
序列表
<110> 武汉爱博泰克生物科技有限公司
<120> 具有增加的连接效率的T4 DNA连接酶变体
<130> ABCL-T4HIACT
<140> 17/699,354
<141> 2022-03-21
<160> 26
<170> PatentIn version 3.5
<210> 1
<211> 1500
<212> DNA
<213> 未知的
<220>
<223> 未知的描述:
噬菌体T4序列
<400> 1
atgattctta aaattctgaa cgaaatagca tctattggtt caactaaaca gaagcaagca 60
attcttgaaa agaataaaga taatgaattg cttaaacgag tatatcgtct gacttattct 120
cgtgggttac agtattatat caagaaatgg cctaaacctg gtattgctac ccagagtttt 180
ggaatgttga ctcttaccga tatgcttgac ttcattgaat tcacattagc tactcggaaa 240
ttgactggaa atgcagcaat tgaggaatta actggatata tcaccgatgg taaaaaagat 300
gatgttgaag ttttgcgtcg agtgatgatg cgagaccttg aatgtggtgc ttcagtatct 360
attgcaaaca aagtttggcc aggtttaatt cctgaacaac ctcaaatgct cgcaagttct 420
tatgatgaaa aaggcattaa taagaatatc aaatttccag cctttgctca gttaaaagct 480
gatggagctc ggtgttttgc tgaagttaga ggtgatgaat tagatgatgt tcgtctttta 540
tcacgagctg gtaatgaata tctaggatta gatcttctta aggaagagtt aattaaaatg 600
accgctgaag cccgccagat tcatccagaa ggtgtgttga ttgatggcga attggtatac 660
catgagcaag ttaaaaagga gccagaaggc ctagattttc tttttgatgc ttatcctgaa 720
aacagtaaag ctaaagaatt cgccgaagta gctgaatcac gtactgcttc taatggaatc 780
gccaataaat ctttaaaggg aaccatttct gaaaaagaag cacaatgcat gaagtttcag 840
gtctgggatt atgtcccgtt ggtagaaata tacagtcttc ctgcatttcg tttgaaatat 900
gatgtacgtt tttctaaact agaacaaatg acatctggat atgataaagt aattttaatt 960
gaaaaccagg tagtaaataa cctagatgaa gctaaggtaa tttataaaaa gtatattgac 1020
caaggtcttg aaggtattat tctcaaaaat atcgatggat tatgggaaaa tgctcgttca 1080
aaaaatcttt ataaatttaa agaagtaatt gatgttgatt taaaaattgt aggaatttat 1140
cctcaccgta aagaccctac taaagcgggt ggatttattc ttgagtcaga gtgtggaaaa 1200
attaaggtaa atgctggttc aggcttaaaa gataaagccg gtgtaaaatc gcatgaactt 1260
gaccgtactc gcattatgga aaaccaaaat tattatattg gaaaaattct agagtgcgaa 1320
tgcaacggtt ggttaaaatc tgatggccgc actgattacg ttaaattatt tcttccgatt 1380
gcgattcgtt tacgtgaaga taaaactaaa gctaatacat tcgaagatgt atttggtgat 1440
tttcatgagg taactggtct aggttctggc agttcaggtc atcaccacca tcatcactaa 1500
<210> 2
<211> 499
<212> PRT
<213> 未知的
<220>
<223> 未知的描述:
噬菌体T4序列
<400> 2
Met Ile Leu Lys Ile Leu Asn Glu Ile Ala Ser Ile Gly Ser Thr Lys
1 5 10 15
Gln Lys Gln Ala Ile Leu Glu Lys Asn Lys Asp Asn Glu Leu Leu Lys
20 25 30
Arg Val Tyr Arg Leu Thr Tyr Ser Arg Gly Leu Gln Tyr Tyr Ile Lys
35 40 45
Lys Trp Pro Lys Pro Gly Ile Ala Thr Gln Ser Phe Gly Met Leu Thr
50 55 60
Leu Thr Asp Met Leu Asp Phe Ile Glu Phe Thr Leu Ala Thr Arg Lys
65 70 75 80
Leu Thr Gly Asn Ala Ala Ile Glu Glu Leu Thr Gly Tyr Ile Thr Asp
85 90 95
Gly Lys Lys Asp Asp Val Glu Val Leu Arg Arg Val Met Met Arg Asp
100 105 110
Leu Glu Cys Gly Ala Ser Val Ser Ile Ala Asn Lys Val Trp Pro Gly
115 120 125
Leu Ile Pro Glu Gln Pro Gln Met Leu Ala Ser Ser Tyr Asp Glu Lys
130 135 140
Gly Ile Asn Lys Asn Ile Lys Phe Pro Ala Phe Ala Gln Leu Lys Ala
145 150 155 160
Asp Gly Ala Arg Cys Phe Ala Glu Val Arg Gly Asp Glu Leu Asp Asp
165 170 175
Val Arg Leu Leu Ser Arg Ala Gly Asn Glu Tyr Leu Gly Leu Asp Leu
180 185 190
Leu Lys Glu Glu Leu Ile Lys Met Thr Ala Glu Ala Arg Gln Ile His
195 200 205
Pro Glu Gly Val Leu Ile Asp Gly Glu Leu Val Tyr His Glu Gln Val
210 215 220
Lys Lys Glu Pro Glu Gly Leu Asp Phe Leu Phe Asp Ala Tyr Pro Glu
225 230 235 240
Asn Ser Lys Ala Lys Glu Phe Ala Glu Val Ala Glu Ser Arg Thr Ala
245 250 255
Ser Asn Gly Ile Ala Asn Lys Ser Leu Lys Gly Thr Ile Ser Glu Lys
260 265 270
Glu Ala Gln Cys Met Lys Phe Gln Val Trp Asp Tyr Val Pro Leu Val
275 280 285
Glu Ile Tyr Ser Leu Pro Ala Phe Arg Leu Lys Tyr Asp Val Arg Phe
290 295 300
Ser Lys Leu Glu Gln Met Thr Ser Gly Tyr Asp Lys Val Ile Leu Ile
305 310 315 320
Glu Asn Gln Val Val Asn Asn Leu Asp Glu Ala Lys Val Ile Tyr Lys
325 330 335
Lys Tyr Ile Asp Gln Gly Leu Glu Gly Ile Ile Leu Lys Asn Ile Asp
340 345 350
Gly Leu Trp Glu Asn Ala Arg Ser Lys Asn Leu Tyr Lys Phe Lys Glu
355 360 365
Val Ile Asp Val Asp Leu Lys Ile Val Gly Ile Tyr Pro His Arg Lys
370 375 380
Asp Pro Thr Lys Ala Gly Gly Phe Ile Leu Glu Ser Glu Cys Gly Lys
385 390 395 400
Ile Lys Val Asn Ala Gly Ser Gly Leu Lys Asp Lys Ala Gly Val Lys
405 410 415
Ser His Glu Leu Asp Arg Thr Arg Ile Met Glu Asn Gln Asn Tyr Tyr
420 425 430
Ile Gly Lys Ile Leu Glu Cys Glu Cys Asn Gly Trp Leu Lys Ser Asp
435 440 445
Gly Arg Thr Asp Tyr Val Lys Leu Phe Leu Pro Ile Ala Ile Arg Leu
450 455 460
Arg Glu Asp Lys Thr Lys Ala Asn Thr Phe Glu Asp Val Phe Gly Asp
465 470 475 480
Phe His Glu Val Thr Gly Leu Gly Ser Gly Ser Ser Gly His His His
485 490 495
His His His
<210> 3
<211> 1500
<212> DNA
<213> 人工序列
<220>
<223> 人工序列的描述: 合成的多核苷酸
<400> 3
atgattctta aaattctgaa cgaaatagca tctattggtt caactaaaca gaagcaagca 60
attcttgaaa agaataaaga taatgaattg cttaaacgag tatatcgtct gacttattct 120
cgtgggttac agtattatat caagaaatgg cctaaacctg gtattgctac ccagagtttt 180
ggaatgttga ctcttaccga tatgcttgac ttcattgaat tcacattagc tactcggaaa 240
ttgactggaa atgcagcaat tgaaaaatta actggatata tcaccgatgg taaaaaagat 300
gatgttgaag ttttgcgtcg agtgatgatg cgagaccttg aatgtggtgc ttcagtatct 360
attgcaaaca aagtttggcc aggtttaatt cctgaacaac ctcaaatgct cgcaagttct 420
tatgatgaaa aaggcattaa taagaatatc aaatttccag cctttgctca gttaaaagct 480
gatggagctc ggtgttttgc tgaagttaga ggtgatgaat tagatgatgt tcgtctttta 540
tcacgagctg gtaatgaata tctaggatta gatcttctta aggaagagtt aattaaaatg 600
accgctgaag cccgccagat tcatccagaa ggtgtgttga ttgatggcga attggtatac 660
catgagcaag ttaaaaagga gccagaaggc ctagattttc tttttgatgc ttatcctgaa 720
aacagtaaag ctaaagaatt cgccgaagta gctgaatcac gtactgcttc taatggaatc 780
gccaataaat ctttaaaggg aaccatttct gaaaaagaag cacaatgcat gaagtttcag 840
gtctgggatt atgtcccgtt ggtagaaata tacagtcttc ctgcatttcg tttgaaatat 900
gatgtacgtt tttctaaact agaacaaatg acatctggat atgataaagt aattttaatt 960
gaaaaccagg tagtaaataa cctagatgaa gctaaggtaa tttataaaaa gtatattgac 1020
caaggtcttg aaggtattat tctcaaaaat atcgatggat tatgggaaaa tgctcgttca 1080
aaaaatcttt ataaatttaa agaagtaatt gatgttgatt taaaaattgt aggaatttat 1140
cctcaccgta aagaccctac taaagcgggt ggatttattc ttgagtcaga gtgtggaaaa 1200
attaaggtaa atgctggttc aggcttaaaa gataaagccg gtgtaaaatc gcatgaactt 1260
gaccgtactc gcattatgga aaaccaaaat tattatattg gaaaaattct agagtgcgaa 1320
tgcaacggtt ggttaaaatc tgatggccgc actgattacg ttaaattatt tcttccgatt 1380
gcgattcgtt tacgtgaaga taaaactaaa gctaatacat tcgaagatgt atttggtgat 1440
tttcatgagg taactggtct aggttctggc agttcaggtc atcaccacca tcatcactaa 1500
<210> 4
<211> 499
<212> PRT
<213> 人工序列
<220>
<223> 人工序列的描述: 合成的多肽
<400> 4
Met Ile Leu Lys Ile Leu Asn Glu Ile Ala Ser Ile Gly Ser Thr Lys
1 5 10 15
Gln Lys Gln Ala Ile Leu Glu Lys Asn Lys Asp Asn Glu Leu Leu Lys
20 25 30
Arg Val Tyr Arg Leu Thr Tyr Ser Arg Gly Leu Gln Tyr Tyr Ile Lys
35 40 45
Lys Trp Pro Lys Pro Gly Ile Ala Thr Gln Ser Phe Gly Met Leu Thr
50 55 60
Leu Thr Asp Met Leu Asp Phe Ile Glu Phe Thr Leu Ala Thr Arg Lys
65 70 75 80
Leu Thr Gly Asn Ala Ala Ile Glu Lys Leu Thr Gly Tyr Ile Thr Asp
85 90 95
Gly Lys Lys Asp Asp Val Glu Val Leu Arg Arg Val Met Met Arg Asp
100 105 110
Leu Glu Cys Gly Ala Ser Val Ser Ile Ala Asn Lys Val Trp Pro Gly
115 120 125
Leu Ile Pro Glu Gln Pro Gln Met Leu Ala Ser Ser Tyr Asp Glu Lys
130 135 140
Gly Ile Asn Lys Asn Ile Lys Phe Pro Ala Phe Ala Gln Leu Lys Ala
145 150 155 160
Asp Gly Ala Arg Cys Phe Ala Glu Val Arg Gly Asp Glu Leu Asp Asp
165 170 175
Val Arg Leu Leu Ser Arg Ala Gly Asn Glu Tyr Leu Gly Leu Asp Leu
180 185 190
Leu Lys Glu Glu Leu Ile Lys Met Thr Ala Glu Ala Arg Gln Ile His
195 200 205
Pro Glu Gly Val Leu Ile Asp Gly Glu Leu Val Tyr His Glu Gln Val
210 215 220
Lys Lys Glu Pro Glu Gly Leu Asp Phe Leu Phe Asp Ala Tyr Pro Glu
225 230 235 240
Asn Ser Lys Ala Lys Glu Phe Ala Glu Val Ala Glu Ser Arg Thr Ala
245 250 255
Ser Asn Gly Ile Ala Asn Lys Ser Leu Lys Gly Thr Ile Ser Glu Lys
260 265 270
Glu Ala Gln Cys Met Lys Phe Gln Val Trp Asp Tyr Val Pro Leu Val
275 280 285
Glu Ile Tyr Ser Leu Pro Ala Phe Arg Leu Lys Tyr Asp Val Arg Phe
290 295 300
Ser Lys Leu Glu Gln Met Thr Ser Gly Tyr Asp Lys Val Ile Leu Ile
305 310 315 320
Glu Asn Gln Val Val Asn Asn Leu Asp Glu Ala Lys Val Ile Tyr Lys
325 330 335
Lys Tyr Ile Asp Gln Gly Leu Glu Gly Ile Ile Leu Lys Asn Ile Asp
340 345 350
Gly Leu Trp Glu Asn Ala Arg Ser Lys Asn Leu Tyr Lys Phe Lys Glu
355 360 365
Val Ile Asp Val Asp Leu Lys Ile Val Gly Ile Tyr Pro His Arg Lys
370 375 380
Asp Pro Thr Lys Ala Gly Gly Phe Ile Leu Glu Ser Glu Cys Gly Lys
385 390 395 400
Ile Lys Val Asn Ala Gly Ser Gly Leu Lys Asp Lys Ala Gly Val Lys
405 410 415
Ser His Glu Leu Asp Arg Thr Arg Ile Met Glu Asn Gln Asn Tyr Tyr
420 425 430
Ile Gly Lys Ile Leu Glu Cys Glu Cys Asn Gly Trp Leu Lys Ser Asp
435 440 445
Gly Arg Thr Asp Tyr Val Lys Leu Phe Leu Pro Ile Ala Ile Arg Leu
450 455 460
Arg Glu Asp Lys Thr Lys Ala Asn Thr Phe Glu Asp Val Phe Gly Asp
465 470 475 480
Phe His Glu Val Thr Gly Leu Gly Ser Gly Ser Ser Gly His His His
485 490 495
His His His
<210> 5
<211> 1500
<212> DNA
<213> 人工序列
<220>
<223> 人工序列的描述: 合成的多核苷酸
<400> 5
atgattctta aaattctgaa cgaaatagca tctattggtt caactaaaca gaagcaagca 60
attcttgaaa agaataaaga taatgaattg cttaaacgag tatatcgtct gacttattct 120
cgtgggttac agtattatat caagaaatgg cctaaacctg gtattgctac ccagagtttt 180
ggaatgttga ctcttaccga tatgcttgac ttcattgaat tcacattagc tactcggaaa 240
ttgactggaa atgcagcaat tgaggaatta actggatata tcaccgatgg taaaaaagat 300
gatgttgaag ttttgcgtcg agtgatgatg cgagaccttg aatgtggtgc ttcagtatct 360
attgcaaaca aagtttggcc aggtttaatt cctgaacaac ctcaaatgct cgcaagttct 420
tatgatgaaa aaggcattaa taagaatatc aaatttccag cctttgctca gttaaaagct 480
gatggagctc ggtgttttgc tgaagttaga ggtgatgaat tagatgatgt tcgtctttta 540
tcacgagctg gtaatgaata tctaggatta gatcttctta aggaagagtt aattaaaatg 600
accgctgaag cccgccagat tcatccagaa ggtgtgttga ttgatggcga attggtatac 660
catgagcaag ttaaaaagga gccagaaggc ctagattttc tttttgatgc ttatcctgaa 720
aacagtaaag ctaaagaatt cgccgaagta gctgaatcac gtactgcttc taatggaatc 780
gccaataaat ctttaaaggg aaccatttct aaaaaagaag cacaatgcat gaagtttcag 840
gtctgggatt atgtcccgtt ggtagaaata tacagtcttc ctgcatttcg tttgaaatat 900
gatgtacgtt tttctaaact agaacaaatg acatctggat atgataaagt aattttaatt 960
gaaaaccagg tagtaaataa cctagatgaa gctaaggtaa tttataaaaa gtatattgac 1020
caaggtcttg aaggtattat tctcaaaaat atcgatggat tatgggaaaa tgctcgttca 1080
aaaaatcttt ataaatttaa agaagtaatt gatgttgatt taaaaattgt aggaatttat 1140
cctcaccgta aagaccctac taaagcgggt ggatttattc ttgagtcaga gtgtggaaaa 1200
attaaggtaa atgctggttc aggcttaaaa gataaagccg gtgtaaaatc gcatgaactt 1260
gaccgtactc gcattatgga aaaccaaaat tattatattg gaaaaattct agagtgcgaa 1320
tgcaacggtt ggttaaaatc tgatggccgc actgattacg ttaaattatt tcttccgatt 1380
gcgattcgtt tacgtgaaga taaaactaaa gctaatacat tcgaagatgt atttggtgat 1440
tttcatgagg taactggtct aggttctggc agttcaggtc atcaccacca tcatcactaa 1500
<210> 6
<211> 499
<212> PRT
<213> 人工序列
<220>
<223> 人工序列的描述: 合成的多肽
<400> 6
Met Ile Leu Lys Ile Leu Asn Glu Ile Ala Ser Ile Gly Ser Thr Lys
1 5 10 15
Gln Lys Gln Ala Ile Leu Glu Lys Asn Lys Asp Asn Glu Leu Leu Lys
20 25 30
Arg Val Tyr Arg Leu Thr Tyr Ser Arg Gly Leu Gln Tyr Tyr Ile Lys
35 40 45
Lys Trp Pro Lys Pro Gly Ile Ala Thr Gln Ser Phe Gly Met Leu Thr
50 55 60
Leu Thr Asp Met Leu Asp Phe Ile Glu Phe Thr Leu Ala Thr Arg Lys
65 70 75 80
Leu Thr Gly Asn Ala Ala Ile Glu Glu Leu Thr Gly Tyr Ile Thr Asp
85 90 95
Gly Lys Lys Asp Asp Val Glu Val Leu Arg Arg Val Met Met Arg Asp
100 105 110
Leu Glu Cys Gly Ala Ser Val Ser Ile Ala Asn Lys Val Trp Pro Gly
115 120 125
Leu Ile Pro Glu Gln Pro Gln Met Leu Ala Ser Ser Tyr Asp Glu Lys
130 135 140
Gly Ile Asn Lys Asn Ile Lys Phe Pro Ala Phe Ala Gln Leu Lys Ala
145 150 155 160
Asp Gly Ala Arg Cys Phe Ala Glu Val Arg Gly Asp Glu Leu Asp Asp
165 170 175
Val Arg Leu Leu Ser Arg Ala Gly Asn Glu Tyr Leu Gly Leu Asp Leu
180 185 190
Leu Lys Glu Glu Leu Ile Lys Met Thr Ala Glu Ala Arg Gln Ile His
195 200 205
Pro Glu Gly Val Leu Ile Asp Gly Glu Leu Val Tyr His Glu Gln Val
210 215 220
Lys Lys Glu Pro Glu Gly Leu Asp Phe Leu Phe Asp Ala Tyr Pro Glu
225 230 235 240
Asn Ser Lys Ala Lys Glu Phe Ala Glu Val Ala Glu Ser Arg Thr Ala
245 250 255
Ser Asn Gly Ile Ala Asn Lys Ser Leu Lys Gly Thr Ile Ser Lys Lys
260 265 270
Glu Ala Gln Cys Met Lys Phe Gln Val Trp Asp Tyr Val Pro Leu Val
275 280 285
Glu Ile Tyr Ser Leu Pro Ala Phe Arg Leu Lys Tyr Asp Val Arg Phe
290 295 300
Ser Lys Leu Glu Gln Met Thr Ser Gly Tyr Asp Lys Val Ile Leu Ile
305 310 315 320
Glu Asn Gln Val Val Asn Asn Leu Asp Glu Ala Lys Val Ile Tyr Lys
325 330 335
Lys Tyr Ile Asp Gln Gly Leu Glu Gly Ile Ile Leu Lys Asn Ile Asp
340 345 350
Gly Leu Trp Glu Asn Ala Arg Ser Lys Asn Leu Tyr Lys Phe Lys Glu
355 360 365
Val Ile Asp Val Asp Leu Lys Ile Val Gly Ile Tyr Pro His Arg Lys
370 375 380
Asp Pro Thr Lys Ala Gly Gly Phe Ile Leu Glu Ser Glu Cys Gly Lys
385 390 395 400
Ile Lys Val Asn Ala Gly Ser Gly Leu Lys Asp Lys Ala Gly Val Lys
405 410 415
Ser His Glu Leu Asp Arg Thr Arg Ile Met Glu Asn Gln Asn Tyr Tyr
420 425 430
Ile Gly Lys Ile Leu Glu Cys Glu Cys Asn Gly Trp Leu Lys Ser Asp
435 440 445
Gly Arg Thr Asp Tyr Val Lys Leu Phe Leu Pro Ile Ala Ile Arg Leu
450 455 460
Arg Glu Asp Lys Thr Lys Ala Asn Thr Phe Glu Asp Val Phe Gly Asp
465 470 475 480
Phe His Glu Val Thr Gly Leu Gly Ser Gly Ser Ser Gly His His His
485 490 495
His His His
<210> 7
<211> 1500
<212> DNA
<213> 人工序列
<220>
<223> 人工序列的描述: 合成的多核苷酸
<400> 7
atgattctta aaattctgaa cgaaatagca tctattggtt caactaaaca gaagcaagca 60
attcttgaaa agaataaaga taatgaattg cttaaacgag tatatcgtct gacttattct 120
cgtgggttac agtattatat caagaaatgg cctaaacctg gtattgctac ccagagtttt 180
ggaatgttga ctcttaccga tatgcttgac ttcattgaat tcacattagc tactcggaaa 240
ttgactggaa atgcagcaat tgaggaatta actggatata tcaccgatgg taaaaaagat 300
gatgttgaag ttttgcgtcg agtgatgatg cgagaccttg aatgtggtgc ttcagtatct 360
attgcaaaca aagtttggcc aggtttaatt cctgaacaac ctcaaatgct cgcaagttct 420
tatgatgaaa aaggcattaa taagaatatc aaatttccag cctttgctca gttaaaagct 480
gatggagctc ggtgttttgc tgaagttaga ggtgatgaat tagatgatgt tcgtctttta 540
tcacgagctg gtaatgaata tctaggatta gatcttctta aggaagagtt aattaaaatg 600
accgctgaag cccgccagat tcatccagaa ggtgtgttga ttgatggcga attggtatac 660
catgagcaag ttaaaaagga gccagaaggc ctagattttc tttttgatgc ttatcctgaa 720
aacagtaaag ctaaagaatt cgccgaagta gctgaatcac gtactgcttc taatggaatc 780
gccaataaat ctttaaaggg aaccatttct gaaaaagaag cacaatgcat gaagtttcag 840
gtctgggatt atgtcccgtt ggtagaaata tacagtcttc ctgcatttcg tttgaaatat 900
gatgtacgtt tttctaaact agaacaaatg acatctggat atgataaagt aattttaatt 960
gaaaaccagg tagtaaataa cctagatgaa gctaaggtaa tttataaaaa gtatattcgt 1020
caaggtcttg aaggtattat tctcaaaaat atcgatggat tatgggaaaa tgctcgttca 1080
aaaaatcttt ataaatttaa agaagtaatt gatgttgatt taaaaattgt aggaatttat 1140
cctcaccgta aagaccctac taaagcgggt ggatttattc ttgagtcaga gtgtggaaaa 1200
attaaggtaa atgctggttc aggcttaaaa gataaagccg gtgtaaaatc gcatgaactt 1260
gaccgtactc gcattatgga aaaccaaaat tattatattg gaaaaattct agagtgcgaa 1320
tgcaacggtt ggttaaaatc tgatggccgc actgattacg ttaaattatt tcttccgatt 1380
gcgattcgtt tacgtgaaga taaaactaaa gctaatacat tcgaagatgt atttggtgat 1440
tttcatgagg taactggtct aggttctggc agttcaggtc atcaccacca tcatcactaa 1500
<210> 8
<211> 499
<212> PRT
<213> 人工序列
<220>
<223> 人工序列的描述: 合成的多肽
<400> 8
Met Ile Leu Lys Ile Leu Asn Glu Ile Ala Ser Ile Gly Ser Thr Lys
1 5 10 15
Gln Lys Gln Ala Ile Leu Glu Lys Asn Lys Asp Asn Glu Leu Leu Lys
20 25 30
Arg Val Tyr Arg Leu Thr Tyr Ser Arg Gly Leu Gln Tyr Tyr Ile Lys
35 40 45
Lys Trp Pro Lys Pro Gly Ile Ala Thr Gln Ser Phe Gly Met Leu Thr
50 55 60
Leu Thr Asp Met Leu Asp Phe Ile Glu Phe Thr Leu Ala Thr Arg Lys
65 70 75 80
Leu Thr Gly Asn Ala Ala Ile Glu Glu Leu Thr Gly Tyr Ile Thr Asp
85 90 95
Gly Lys Lys Asp Asp Val Glu Val Leu Arg Arg Val Met Met Arg Asp
100 105 110
Leu Glu Cys Gly Ala Ser Val Ser Ile Ala Asn Lys Val Trp Pro Gly
115 120 125
Leu Ile Pro Glu Gln Pro Gln Met Leu Ala Ser Ser Tyr Asp Glu Lys
130 135 140
Gly Ile Asn Lys Asn Ile Lys Phe Pro Ala Phe Ala Gln Leu Lys Ala
145 150 155 160
Asp Gly Ala Arg Cys Phe Ala Glu Val Arg Gly Asp Glu Leu Asp Asp
165 170 175
Val Arg Leu Leu Ser Arg Ala Gly Asn Glu Tyr Leu Gly Leu Asp Leu
180 185 190
Leu Lys Glu Glu Leu Ile Lys Met Thr Ala Glu Ala Arg Gln Ile His
195 200 205
Pro Glu Gly Val Leu Ile Asp Gly Glu Leu Val Tyr His Glu Gln Val
210 215 220
Lys Lys Glu Pro Glu Gly Leu Asp Phe Leu Phe Asp Ala Tyr Pro Glu
225 230 235 240
Asn Ser Lys Ala Lys Glu Phe Ala Glu Val Ala Glu Ser Arg Thr Ala
245 250 255
Ser Asn Gly Ile Ala Asn Lys Ser Leu Lys Gly Thr Ile Ser Glu Lys
260 265 270
Glu Ala Gln Cys Met Lys Phe Gln Val Trp Asp Tyr Val Pro Leu Val
275 280 285
Glu Ile Tyr Ser Leu Pro Ala Phe Arg Leu Lys Tyr Asp Val Arg Phe
290 295 300
Ser Lys Leu Glu Gln Met Thr Ser Gly Tyr Asp Lys Val Ile Leu Ile
305 310 315 320
Glu Asn Gln Val Val Asn Asn Leu Asp Glu Ala Lys Val Ile Tyr Lys
325 330 335
Lys Tyr Ile Arg Gln Gly Leu Glu Gly Ile Ile Leu Lys Asn Ile Asp
340 345 350
Gly Leu Trp Glu Asn Ala Arg Ser Lys Asn Leu Tyr Lys Phe Lys Glu
355 360 365
Val Ile Asp Val Asp Leu Lys Ile Val Gly Ile Tyr Pro His Arg Lys
370 375 380
Asp Pro Thr Lys Ala Gly Gly Phe Ile Leu Glu Ser Glu Cys Gly Lys
385 390 395 400
Ile Lys Val Asn Ala Gly Ser Gly Leu Lys Asp Lys Ala Gly Val Lys
405 410 415
Ser His Glu Leu Asp Arg Thr Arg Ile Met Glu Asn Gln Asn Tyr Tyr
420 425 430
Ile Gly Lys Ile Leu Glu Cys Glu Cys Asn Gly Trp Leu Lys Ser Asp
435 440 445
Gly Arg Thr Asp Tyr Val Lys Leu Phe Leu Pro Ile Ala Ile Arg Leu
450 455 460
Arg Glu Asp Lys Thr Lys Ala Asn Thr Phe Glu Asp Val Phe Gly Asp
465 470 475 480
Phe His Glu Val Thr Gly Leu Gly Ser Gly Ser Ser Gly His His His
485 490 495
His His His
<210> 9
<211> 1499
<212> DNA
<213> 人工序列
<220>
<223> 人工序列的描述: 合成的多核苷酸
<400> 9
atgattctta aaattctgaa cgaaatagca tctattggtt caactaaaca gaagcaagca 60
attcttgaaa agaataaaga taatgaattg cttaaacgag tatatcgtct gacttattct 120
cgtgggttac agtattatat caagaaatgg cctaaacctg gtattgctac ccagagtttt 180
ggaatgttga ctcttaccga tatgcttgac ttcattgaat tcacattagc tactcggaaa 240
ttgactggaa atgcagcaat tgaggaatta actggatata tcaccgatgg taaaaaagat 300
gatgttgaag ttttgcgtcg agtgatgatg cgagaccttg aatgtggtgc ttcagtatct 360
attgcaaaca aagtttggcc aggtttaatt cctgaacaac ctcaaatgct cgcaagttct 420
tatgatgaaa aaggcattaa taagaatatc aaatttccag cctttgctca gttaaaagct 480
gatggagctc ggtgttttgc tgaagttaga ggtgatgaat tagatgatgt tcgtctttta 540
tcacgagctg gtaatgaata tctaggatta gatcttctta aggaagagtt aattaaaatg 600
accgctgaag cccgccagat tcatccagaa ggtgtgttga ttgatggcga attggtatac 660
catgagcaag ttaaaaagga gccagaaggc ctagattttc tttttgatgc ttatcctgaa 720
aacagtaaag ctaaagaatt cgccgaagta gctgaatcac gtactgcttc taatggaatc 780
gccaataaat ctttaaaggg aaccatttct gaaaaagaag cacaatgcat gaagtttcag 840
gtctgggatt atgtcccgtt ggtagaaata tacagtcttc ctgcatttcg tttgaaatat 900
gatgtacgtt tttctaaact agaacaaatg acatctggat atgataaagt aattttaatt 960
gaaaaccagg tagtaaataa cctagatgaa gctaaggtaa tttataaaaa gtatattgac 1020
caaggtcttg aaggtattat tctcaaaaat atcgatggat tatgggaaaa tgctcgttca 1080
aaaaatcttt ataaatttaa agaagtaatt caattgattt aaaaattgta ggaatttatc 1140
ctcaccgtaa agaccctact aaagcgggtg gatttattct tgagtcagag tgtggaaaaa 1200
ttaaggtaaa tgctggttca ggcttaaaag ataaagccgg tgtaaaatcg catgaacttg 1260
accgtactcg cattatggaa aaccaaaatt attatattgg aaaaattcta gagtgcgaat 1320
gcaacggttg gttaaaatct gatggccgca ctgattacgt taaattattt cttccgattg 1380
cgattcgttt acgtgaagat aaaactaaag ctaatacatt cgaagatgta tttggtgatt 1440
ttcatgaggt aactggtcta ggttctggca gttcaggtca tcaccaccat catcactaa 1499
<210> 10
<211> 499
<212> PRT
<213> 人工序列
<220>
<223> 人工序列的描述: 合成的多肽
<400> 10
Met Ile Leu Lys Ile Leu Asn Glu Ile Ala Ser Ile Gly Ser Thr Lys
1 5 10 15
Gln Lys Gln Ala Ile Leu Glu Lys Asn Lys Asp Asn Glu Leu Leu Lys
20 25 30
Arg Val Tyr Arg Leu Thr Tyr Ser Arg Gly Leu Gln Tyr Tyr Ile Lys
35 40 45
Lys Trp Pro Lys Pro Gly Ile Ala Thr Gln Ser Phe Gly Met Leu Thr
50 55 60
Leu Thr Asp Met Leu Asp Phe Ile Glu Phe Thr Leu Ala Thr Arg Lys
65 70 75 80
Leu Thr Gly Asn Ala Ala Ile Glu Glu Leu Thr Gly Tyr Ile Thr Asp
85 90 95
Gly Lys Lys Asp Asp Val Glu Val Leu Arg Arg Val Met Met Arg Asp
100 105 110
Leu Glu Cys Gly Ala Ser Val Ser Ile Ala Asn Lys Val Trp Pro Gly
115 120 125
Leu Ile Pro Glu Gln Pro Gln Met Leu Ala Ser Ser Tyr Asp Glu Lys
130 135 140
Gly Ile Asn Lys Asn Ile Lys Phe Pro Ala Phe Ala Gln Leu Lys Ala
145 150 155 160
Asp Gly Ala Arg Cys Phe Ala Glu Val Arg Gly Asp Glu Leu Asp Asp
165 170 175
Val Arg Leu Leu Ser Arg Ala Gly Asn Glu Tyr Leu Gly Leu Asp Leu
180 185 190
Leu Lys Glu Glu Leu Ile Lys Met Thr Ala Glu Ala Arg Gln Ile His
195 200 205
Pro Glu Gly Val Leu Ile Asp Gly Glu Leu Val Tyr His Glu Gln Val
210 215 220
Lys Lys Glu Pro Glu Gly Leu Asp Phe Leu Phe Asp Ala Tyr Pro Glu
225 230 235 240
Asn Ser Lys Ala Lys Glu Phe Ala Glu Val Ala Glu Ser Arg Thr Ala
245 250 255
Ser Asn Gly Ile Ala Asn Lys Ser Leu Lys Gly Thr Ile Ser Glu Lys
260 265 270
Glu Ala Gln Cys Met Lys Phe Gln Val Trp Asp Tyr Val Pro Leu Val
275 280 285
Glu Ile Tyr Ser Leu Pro Ala Phe Arg Leu Lys Tyr Asp Val Arg Phe
290 295 300
Ser Lys Leu Glu Gln Met Thr Ser Gly Tyr Asp Lys Val Ile Leu Ile
305 310 315 320
Glu Asn Gln Val Val Asn Asn Leu Asp Glu Ala Lys Val Ile Tyr Lys
325 330 335
Lys Tyr Ile Asp Gln Gly Leu Glu Gly Ile Ile Leu Lys Asn Ile Asp
340 345 350
Gly Leu Trp Glu Asn Ala Arg Ser Lys Asn Leu Tyr Lys Phe Lys Glu
355 360 365
Val Ile Gln Val Asp Leu Lys Ile Val Gly Ile Tyr Pro His Arg Lys
370 375 380
Asp Pro Thr Lys Ala Gly Gly Phe Ile Leu Glu Ser Glu Cys Gly Lys
385 390 395 400
Ile Lys Val Asn Ala Gly Ser Gly Leu Lys Asp Lys Ala Gly Val Lys
405 410 415
Ser His Glu Leu Asp Arg Thr Arg Ile Met Glu Asn Gln Asn Tyr Tyr
420 425 430
Ile Gly Lys Ile Leu Glu Cys Glu Cys Asn Gly Trp Leu Lys Ser Asp
435 440 445
Gly Arg Thr Asp Tyr Val Lys Leu Phe Leu Pro Ile Ala Ile Arg Leu
450 455 460
Arg Glu Asp Lys Thr Lys Ala Asn Thr Phe Glu Asp Val Phe Gly Asp
465 470 475 480
Phe His Glu Val Thr Gly Leu Gly Ser Gly Ser Ser Gly His His His
485 490 495
His His His
<210> 11
<211> 1500
<212> DNA
<213> 人工序列
<220>
<223> 人工序列的描述: 合成的多核苷酸
<400> 11
atgattctta aaattctgaa cgaaatagca tctattggtt caactaaaca gaagcaagca 60
attcttgaaa agaataaaga taatgaattg cttaaacgag tatatcgtct gacttattct 120
cgtgggttac agtattatat caagaaatgg cctaaacctg gtattgctac ccagagtttt 180
ggaatgttga ctcttaccga tatgcttgac ttcattgaat tcacattagc tactcggaaa 240
ttgactggaa atgcagcaat tgaggaatta actggatata tcaccgatgg taaaaaagat 300
gatgttgaag ttttgcgtcg agtgatgatg cgagaccttg aatgtggtgc ttcagtatct 360
attgcaaaca aagtttggcc aggtttaatt cctgaacaac ctcaaatgct cgcaagttct 420
tatgatgaaa aaggcattaa taagaatatc aaatttccag cctttgctca gttaaaagct 480
gatggagctc ggtgttttgc tgaagttaga ggtgatgaat tagatgatgt tcgtctttta 540
tcacgagctg gtaatgaata tctaggatta gatcttctta aggaagagtt aattaaaatg 600
accgctgaag cccgccagat tcatccagaa ggtgtgttga ttgatggcga attggtatac 660
catgagcaag ttaaaaagga gccagaaggc ctagattttc tttttgatgc ttatcctgaa 720
aacagtaaag ctaaagaatt cgccgaagta gctgaatcac gtactgcttc taatggaatc 780
gccaataaat ctttaaaggg aaccatttct gaaaaagaag cacaatgcat gaagtttcag 840
gtctgggatt atgtcccgtt ggtagaaata tacagtcttc ctgcatttcg tttgaaatat 900
gatgtacgtt tttctaaact agaacaaatg acatctggat atgataaagt aattttaatt 960
gaaaaccagg tagtaaataa cctagatgaa gctaaggtaa tttataaaaa gtatattgac 1020
caaggtcttg aaggtattat tctcaaaaat atcgatggat tatgggaaaa tgctcgttca 1080
aaaaatcttt ataaatttaa agaagtaatt cgtgttgatt taaaaattgt aggaatttat 1140
cctcaccgta aagaccctac taaagcgggt ggatttattc ttgagtcaga gtgtggaaaa 1200
attaaggtaa atgctggttc aggcttaaaa gataaagccg gtgtaaaatc gcatgaactt 1260
gaccgtactc gcattatgga aaaccaaaat tattatattg gaaaaattct agagtgcgaa 1320
tgcaacggtt ggttaaaatc tgatggccgc actgattacg ttaaattatt tcttccgatt 1380
gcgattcgtt tacgtgaaga taaaactaaa gctaatacat tcgaagatgt atttggtgat 1440
tttcatgagg taactggtct aggttctggc agttcaggtc atcaccacca tcatcactaa 1500
<210> 12
<211> 499
<212> PRT
<213> 人工序列
<220>
<223> 人工序列的描述: 合成的多肽
<400> 12
Met Ile Leu Lys Ile Leu Asn Glu Ile Ala Ser Ile Gly Ser Thr Lys
1 5 10 15
Gln Lys Gln Ala Ile Leu Glu Lys Asn Lys Asp Asn Glu Leu Leu Lys
20 25 30
Arg Val Tyr Arg Leu Thr Tyr Ser Arg Gly Leu Gln Tyr Tyr Ile Lys
35 40 45
Lys Trp Pro Lys Pro Gly Ile Ala Thr Gln Ser Phe Gly Met Leu Thr
50 55 60
Leu Thr Asp Met Leu Asp Phe Ile Glu Phe Thr Leu Ala Thr Arg Lys
65 70 75 80
Leu Thr Gly Asn Ala Ala Ile Glu Glu Leu Thr Gly Tyr Ile Thr Asp
85 90 95
Gly Lys Lys Asp Asp Val Glu Val Leu Arg Arg Val Met Met Arg Asp
100 105 110
Leu Glu Cys Gly Ala Ser Val Ser Ile Ala Asn Lys Val Trp Pro Gly
115 120 125
Leu Ile Pro Glu Gln Pro Gln Met Leu Ala Ser Ser Tyr Asp Glu Lys
130 135 140
Gly Ile Asn Lys Asn Ile Lys Phe Pro Ala Phe Ala Gln Leu Lys Ala
145 150 155 160
Asp Gly Ala Arg Cys Phe Ala Glu Val Arg Gly Asp Glu Leu Asp Asp
165 170 175
Val Arg Leu Leu Ser Arg Ala Gly Asn Glu Tyr Leu Gly Leu Asp Leu
180 185 190
Leu Lys Glu Glu Leu Ile Lys Met Thr Ala Glu Ala Arg Gln Ile His
195 200 205
Pro Glu Gly Val Leu Ile Asp Gly Glu Leu Val Tyr His Glu Gln Val
210 215 220
Lys Lys Glu Pro Glu Gly Leu Asp Phe Leu Phe Asp Ala Tyr Pro Glu
225 230 235 240
Asn Ser Lys Ala Lys Glu Phe Ala Glu Val Ala Glu Ser Arg Thr Ala
245 250 255
Ser Asn Gly Ile Ala Asn Lys Ser Leu Lys Gly Thr Ile Ser Glu Lys
260 265 270
Glu Ala Gln Cys Met Lys Phe Gln Val Trp Asp Tyr Val Pro Leu Val
275 280 285
Glu Ile Tyr Ser Leu Pro Ala Phe Arg Leu Lys Tyr Asp Val Arg Phe
290 295 300
Ser Lys Leu Glu Gln Met Thr Ser Gly Tyr Asp Lys Val Ile Leu Ile
305 310 315 320
Glu Asn Gln Val Val Asn Asn Leu Asp Glu Ala Lys Val Ile Tyr Lys
325 330 335
Lys Tyr Ile Asp Gln Gly Leu Glu Gly Ile Ile Leu Lys Asn Ile Asp
340 345 350
Gly Leu Trp Glu Asn Ala Arg Ser Lys Asn Leu Tyr Lys Phe Lys Glu
355 360 365
Val Ile Arg Val Asp Leu Lys Ile Val Gly Ile Tyr Pro His Arg Lys
370 375 380
Asp Pro Thr Lys Ala Gly Gly Phe Ile Leu Glu Ser Glu Cys Gly Lys
385 390 395 400
Ile Lys Val Asn Ala Gly Ser Gly Leu Lys Asp Lys Ala Gly Val Lys
405 410 415
Ser His Glu Leu Asp Arg Thr Arg Ile Met Glu Asn Gln Asn Tyr Tyr
420 425 430
Ile Gly Lys Ile Leu Glu Cys Glu Cys Asn Gly Trp Leu Lys Ser Asp
435 440 445
Gly Arg Thr Asp Tyr Val Lys Leu Phe Leu Pro Ile Ala Ile Arg Leu
450 455 460
Arg Glu Asp Lys Thr Lys Ala Asn Thr Phe Glu Asp Val Phe Gly Asp
465 470 475 480
Phe His Glu Val Thr Gly Leu Gly Ser Gly Ser Ser Gly His His His
485 490 495
His His His
<210> 13
<211> 1500
<212> DNA
<213> 人工序列
<220>
<223> 人工序列的描述: 合成的多核苷酸
<400> 13
atgattctta aaattctgaa cgaaatagca tctattggtt caactaaaca gaagcaagca 60
attcttgaaa agaataaaga taatgaattg cttaaacgag tatatcgtct gacttattct 120
cgtgggttac agtattatat caagaaatgg cctaaacctg gtattgctac ccagagtttt 180
ggaatgttga ctcttaccga tatgcttgac ttcattgaat tcacattagc tactcggaaa 240
ttgactggaa atgcagcaat tgaggaatta actggatata tcaccgatgg taaaaaagat 300
gatgttgaag ttttgcgtcg agtgatgatg cgagaccttg aatgtggtgc ttcagtatct 360
attgcaaaca aagtttggcc aggtttaatt cctgaacaac ctcaaatgct cgcaagttct 420
tatgatgaaa aaggcattaa taagaatatc aaatttccag cctttgctca gttaaaagct 480
gatggagctc ggtgttttgc tgaagttaga ggtgatgaat tagatgatgt tcgtctttta 540
tcacgagctg gtaatgaata tctaggatta gatcttctta aggaagagtt aattaaaatg 600
accgctgaag cccgccagat tcatccagaa ggtgtgttga ttgatggcga attggtatac 660
catgagcaag ttaaaaagga gccagaaggc ctagattttc tttttgatgc ttatcctgaa 720
aacagtaaag ctaaagaatt cgccgaagta gctgaatcac gtactgcttc taatggaatc 780
gccaataaat ctttaaaggg aaccatttct gaaaaagaag cacaatgcat gaagtttcag 840
gtctgggatt atgtcccgtt ggtagaaata tacagtcttc ctgcatttcg tttgaaatat 900
gatgtacgtt tttctaaact agaacaaatg acatctggat atgataaagt aattttaatt 960
gaaaaccagg tagtaaataa cctagatgaa gctaaggtaa tttataaaaa gtatattgac 1020
caaggtcttg aaggtattat tctcaaaaat atcgatggat tatgggaaaa tgctcgttca 1080
aaaaatcttt ataaatttaa agaagtaatt gatgttgatt taaaaattgt aggaatttat 1140
cctcaccgta aagaccctac taaagcgggt ggatttattc ttgagtcaga gtgtggaaaa 1200
attaaggtaa atgctggttc aggcttaaaa gataaagccg gtgtaaaatc gcataaactt 1260
gaccgtactc gcattatgga aaaccaaaat tattatattg gaaaaattct agagtgcgaa 1320
tgcaacggtt ggttaaaatc tgatggccgc actgattacg ttaaattatt tcttccgatt 1380
gcgattcgtt tacgtgaaga taaaactaaa gctaatacat tcgaagatgt atttggtgat 1440
tttcatgagg taactggtct aggttctggc agttcaggtc atcaccacca tcatcactaa 1500
<210> 14
<211> 499
<212> PRT
<213> 人工序列
<220>
<223> 人工序列的描述: 合成的多肽
<400> 14
Met Ile Leu Lys Ile Leu Asn Glu Ile Ala Ser Ile Gly Ser Thr Lys
1 5 10 15
Gln Lys Gln Ala Ile Leu Glu Lys Asn Lys Asp Asn Glu Leu Leu Lys
20 25 30
Arg Val Tyr Arg Leu Thr Tyr Ser Arg Gly Leu Gln Tyr Tyr Ile Lys
35 40 45
Lys Trp Pro Lys Pro Gly Ile Ala Thr Gln Ser Phe Gly Met Leu Thr
50 55 60
Leu Thr Asp Met Leu Asp Phe Ile Glu Phe Thr Leu Ala Thr Arg Lys
65 70 75 80
Leu Thr Gly Asn Ala Ala Ile Glu Glu Leu Thr Gly Tyr Ile Thr Asp
85 90 95
Gly Lys Lys Asp Asp Val Glu Val Leu Arg Arg Val Met Met Arg Asp
100 105 110
Leu Glu Cys Gly Ala Ser Val Ser Ile Ala Asn Lys Val Trp Pro Gly
115 120 125
Leu Ile Pro Glu Gln Pro Gln Met Leu Ala Ser Ser Tyr Asp Glu Lys
130 135 140
Gly Ile Asn Lys Asn Ile Lys Phe Pro Ala Phe Ala Gln Leu Lys Ala
145 150 155 160
Asp Gly Ala Arg Cys Phe Ala Glu Val Arg Gly Asp Glu Leu Asp Asp
165 170 175
Val Arg Leu Leu Ser Arg Ala Gly Asn Glu Tyr Leu Gly Leu Asp Leu
180 185 190
Leu Lys Glu Glu Leu Ile Lys Met Thr Ala Glu Ala Arg Gln Ile His
195 200 205
Pro Glu Gly Val Leu Ile Asp Gly Glu Leu Val Tyr His Glu Gln Val
210 215 220
Lys Lys Glu Pro Glu Gly Leu Asp Phe Leu Phe Asp Ala Tyr Pro Glu
225 230 235 240
Asn Ser Lys Ala Lys Glu Phe Ala Glu Val Ala Glu Ser Arg Thr Ala
245 250 255
Ser Asn Gly Ile Ala Asn Lys Ser Leu Lys Gly Thr Ile Ser Glu Lys
260 265 270
Glu Ala Gln Cys Met Lys Phe Gln Val Trp Asp Tyr Val Pro Leu Val
275 280 285
Glu Ile Tyr Ser Leu Pro Ala Phe Arg Leu Lys Tyr Asp Val Arg Phe
290 295 300
Ser Lys Leu Glu Gln Met Thr Ser Gly Tyr Asp Lys Val Ile Leu Ile
305 310 315 320
Glu Asn Gln Val Val Asn Asn Leu Asp Glu Ala Lys Val Ile Tyr Lys
325 330 335
Lys Tyr Ile Asp Gln Gly Leu Glu Gly Ile Ile Leu Lys Asn Ile Asp
340 345 350
Gly Leu Trp Glu Asn Ala Arg Ser Lys Asn Leu Tyr Lys Phe Lys Glu
355 360 365
Val Ile Asp Val Asp Leu Lys Ile Val Gly Ile Tyr Pro His Arg Lys
370 375 380
Asp Pro Thr Lys Ala Gly Gly Phe Ile Leu Glu Ser Glu Cys Gly Lys
385 390 395 400
Ile Lys Val Asn Ala Gly Ser Gly Leu Lys Asp Lys Ala Gly Val Lys
405 410 415
Ser His Lys Leu Asp Arg Thr Arg Ile Met Glu Asn Gln Asn Tyr Tyr
420 425 430
Ile Gly Lys Ile Leu Glu Cys Glu Cys Asn Gly Trp Leu Lys Ser Asp
435 440 445
Gly Arg Thr Asp Tyr Val Lys Leu Phe Leu Pro Ile Ala Ile Arg Leu
450 455 460
Arg Glu Asp Lys Thr Lys Ala Asn Thr Phe Glu Asp Val Phe Gly Asp
465 470 475 480
Phe His Glu Val Thr Gly Leu Gly Ser Gly Ser Ser Gly His His His
485 490 495
His His His
<210> 15
<211> 1500
<212> DNA
<213> 人工序列
<220>
<223> 人工序列的描述: 合成的多核苷酸
<400> 15
atgattctta aaattctgaa cgaaatagca tctattggtt caactaaaca gaagcaagca 60
attcttgaaa agaataaaga taatgaattg cttaaacgag tatatcgtct gacttattct 120
cgtgggttac agtattatat caagaaatgg cctaaacctg gtattgctac ccagagtttt 180
ggaatgttga ctcttaccga tatgcttgac ttcattgaat tcacattagc tactcggaaa 240
ttgactggaa atgcagcaat tgaggaatta actggatata tcaccgatgg taaaaaagat 300
gatgttgaag ttttgcgtcg agtgatgatg cgagaccttg aatgtggtgc ttcagtatct 360
attgcaaaca aagtttggcc aggtttaatt cctgaacaac ctcaaatgct cgcaagttct 420
tatgatgaaa aaggcattaa taagaatatc aaatttccag cctttgctca gttaaaagct 480
gatggagctc ggtgttttgc tgaagttaga ggtgatgaat tagatgatgt tcgtctttta 540
tcacgagctg gtaatgaata tctaggatta gatcttctta aggaagagtt aattaaaatg 600
accgctgaag cccgccagat tcatccagaa ggtgtgttga ttgatggcga attggtatac 660
catgagcaag ttaaaaagga gccagaaggc ctagattttc tttttgatgc ttatcctgaa 720
aacagtaaag ctaaagaatt cgccgaagta gctgaatcac gtactgcttc taatggaatc 780
gccaataaat ctttaaaggg aaccatttct gaaaaagaag cacaatgcat gaagtttcag 840
gtctgggatt atgtcccgtt ggtagaaata tacagtcttc ctgcatttcg tttgaaatat 900
gatgtacgtt tttctaaact agaacaaatg acatctggat atgataaagt aattttaatt 960
gaaaaccagg tagtaaataa cctagatgaa gctaaggtaa tttataaaaa gtatattgac 1020
caaggtcttg aaggtattat tctcaaaaat atcgatggat tatgggaaaa tgctcgttca 1080
aaaaatcttt ataaatttaa agaagtaatt gatgttgatt taaaaattgt aggaatttat 1140
cctcaccgta aagaccctac taaagcgggt ggatttattc ttgagtcaga gtgtggaaaa 1200
attaaggtaa atgctggttc aggcttaaaa gataaagccg gtgtaaaatc gcatgaactt 1260
gaccgtactc gcattatgga aaaccaaaat tattatattg gaaaaattct aaaatgcgaa 1320
tgcaacggtt ggttaaaatc tgatggccgc actgattacg ttaaattatt tcttccgatt 1380
gcgattcgtt tacgtgaaga taaaactaaa gctaatacat tcgaagatgt atttggtgat 1440
tttcatgagg taactggtct aggttctggc agttcaggtc atcaccacca tcatcactaa 1500
<210> 16
<211> 499
<212> PRT
<213> 人工序列
<220>
<223> 人工序列的描述: 合成的多肽
<400> 16
Met Ile Leu Lys Ile Leu Asn Glu Ile Ala Ser Ile Gly Ser Thr Lys
1 5 10 15
Gln Lys Gln Ala Ile Leu Glu Lys Asn Lys Asp Asn Glu Leu Leu Lys
20 25 30
Arg Val Tyr Arg Leu Thr Tyr Ser Arg Gly Leu Gln Tyr Tyr Ile Lys
35 40 45
Lys Trp Pro Lys Pro Gly Ile Ala Thr Gln Ser Phe Gly Met Leu Thr
50 55 60
Leu Thr Asp Met Leu Asp Phe Ile Glu Phe Thr Leu Ala Thr Arg Lys
65 70 75 80
Leu Thr Gly Asn Ala Ala Ile Glu Glu Leu Thr Gly Tyr Ile Thr Asp
85 90 95
Gly Lys Lys Asp Asp Val Glu Val Leu Arg Arg Val Met Met Arg Asp
100 105 110
Leu Glu Cys Gly Ala Ser Val Ser Ile Ala Asn Lys Val Trp Pro Gly
115 120 125
Leu Ile Pro Glu Gln Pro Gln Met Leu Ala Ser Ser Tyr Asp Glu Lys
130 135 140
Gly Ile Asn Lys Asn Ile Lys Phe Pro Ala Phe Ala Gln Leu Lys Ala
145 150 155 160
Asp Gly Ala Arg Cys Phe Ala Glu Val Arg Gly Asp Glu Leu Asp Asp
165 170 175
Val Arg Leu Leu Ser Arg Ala Gly Asn Glu Tyr Leu Gly Leu Asp Leu
180 185 190
Leu Lys Glu Glu Leu Ile Lys Met Thr Ala Glu Ala Arg Gln Ile His
195 200 205
Pro Glu Gly Val Leu Ile Asp Gly Glu Leu Val Tyr His Glu Gln Val
210 215 220
Lys Lys Glu Pro Glu Gly Leu Asp Phe Leu Phe Asp Ala Tyr Pro Glu
225 230 235 240
Asn Ser Lys Ala Lys Glu Phe Ala Glu Val Ala Glu Ser Arg Thr Ala
245 250 255
Ser Asn Gly Ile Ala Asn Lys Ser Leu Lys Gly Thr Ile Ser Glu Lys
260 265 270
Glu Ala Gln Cys Met Lys Phe Gln Val Trp Asp Tyr Val Pro Leu Val
275 280 285
Glu Ile Tyr Ser Leu Pro Ala Phe Arg Leu Lys Tyr Asp Val Arg Phe
290 295 300
Ser Lys Leu Glu Gln Met Thr Ser Gly Tyr Asp Lys Val Ile Leu Ile
305 310 315 320
Glu Asn Gln Val Val Asn Asn Leu Asp Glu Ala Lys Val Ile Tyr Lys
325 330 335
Lys Tyr Ile Asp Gln Gly Leu Glu Gly Ile Ile Leu Lys Asn Ile Asp
340 345 350
Gly Leu Trp Glu Asn Ala Arg Ser Lys Asn Leu Tyr Lys Phe Lys Glu
355 360 365
Val Ile Asp Val Asp Leu Lys Ile Val Gly Ile Tyr Pro His Arg Lys
370 375 380
Asp Pro Thr Lys Ala Gly Gly Phe Ile Leu Glu Ser Glu Cys Gly Lys
385 390 395 400
Ile Lys Val Asn Ala Gly Ser Gly Leu Lys Asp Lys Ala Gly Val Lys
405 410 415
Ser His Glu Leu Asp Arg Thr Arg Ile Met Glu Asn Gln Asn Tyr Tyr
420 425 430
Ile Gly Lys Ile Leu Lys Cys Glu Cys Asn Gly Trp Leu Lys Ser Asp
435 440 445
Gly Arg Thr Asp Tyr Val Lys Leu Phe Leu Pro Ile Ala Ile Arg Leu
450 455 460
Arg Glu Asp Lys Thr Lys Ala Asn Thr Phe Glu Asp Val Phe Gly Asp
465 470 475 480
Phe His Glu Val Thr Gly Leu Gly Ser Gly Ser Ser Gly His His His
485 490 495
His His His
<210> 17
<211> 1500
<212> DNA
<213> 人工序列
<220>
<223> 人工序列的描述: 合成的多核苷酸
<400> 17
atgattctta aaattctgaa cgaaatagca tctattggtt caactaaaca gaagcaagca 60
attcttgaaa agaataaaga taatgaattg cttaaacgag tatatcgtct gacttattct 120
cgtgggttac agtattatat caagaaatgg cctaaacctg gtattgctac ccagagtttt 180
ggaatgttga ctcttaccga tatgcttgac ttcattgaat tcacattagc tactcggaaa 240
ttgactggaa atgcagcaat tgaggaatta actggatata tcaccgatgg taaaaaagat 300
gatgttgaag ttttgcgtcg agtgatgatg cgagaccttg aatgtggtgc ttcagtatct 360
attgcaaaca aagtttggcc aggtttaatt cctgaacaac ctcaaatgct cgcaagttct 420
tatgatgaaa aaggcattaa taagaatatc aaatttccag cctttgctca gttaaaagct 480
gatggagctc ggtgttttgc tgaagttaga ggtgatgaat tagatgatgt tcgtctttta 540
tcacgagctg gtaatgaata tctaggatta gatcttctta aggaagagtt aattaaaatg 600
accgctgaag cccgccagat tcatccagaa ggtgtgttga ttgatggcga attggtatac 660
catgagcaag ttaaaaagga gccagaaggc ctagattttc tttttgatgc ttatcctgaa 720
aacagtaaag ctaaagaatt cgccgaagta gctgaatcac gtactgcttc taatggaatc 780
gccaataaat ctttaaaggg aaccatttct gaaaaagaag cacaatgcat gaagtttcag 840
gtctgggatt atgtcccgtt ggtagaaata tacagtcttc ctgcatttcg tttgaaatat 900
gatgtacgtt tttctaaact agaacaaatg acatctggat atgataaagt aattttaatt 960
gaaaaccagg tagtaaataa cctagatgaa gctaaggtaa tttataaaaa gtatattgac 1020
caaggtcttg aaggtattat tctcaaaaat atcgatggat tatgggaaaa tgctcgttca 1080
aaaaatcttt ataaatttaa agaagtaatt gatgttgatt taaaaattgt aggaatttat 1140
cctcaccgta aagaccctac taaagcgggt ggatttattc ttgagtcaga gtgtggaaaa 1200
attaaggtaa atgctggttc aggcttaaaa gataaagccg gtgtaaaatc gcatgaactt 1260
gaccgtactc gcattatgga aaaccaaaat tattatattg gaaaaattct agagtgccgt 1320
tgcaacggtt ggttaaaatc tgatggccgc actgattacg ttaaattatt tcttccgatt 1380
gcgattcgtt tacgtgaaga taaaactaaa gctaatacat tcgaagatgt atttggtgat 1440
tttcatgagg taactggtct aggttctggc agttcaggtc atcaccacca tcatcactaa 1500
<210> 18
<211> 499
<212> PRT
<213> 人工序列
<220>
<223> 人工序列的描述: 合成的多肽
<400> 18
Met Ile Leu Lys Ile Leu Asn Glu Ile Ala Ser Ile Gly Ser Thr Lys
1 5 10 15
Gln Lys Gln Ala Ile Leu Glu Lys Asn Lys Asp Asn Glu Leu Leu Lys
20 25 30
Arg Val Tyr Arg Leu Thr Tyr Ser Arg Gly Leu Gln Tyr Tyr Ile Lys
35 40 45
Lys Trp Pro Lys Pro Gly Ile Ala Thr Gln Ser Phe Gly Met Leu Thr
50 55 60
Leu Thr Asp Met Leu Asp Phe Ile Glu Phe Thr Leu Ala Thr Arg Lys
65 70 75 80
Leu Thr Gly Asn Ala Ala Ile Glu Glu Leu Thr Gly Tyr Ile Thr Asp
85 90 95
Gly Lys Lys Asp Asp Val Glu Val Leu Arg Arg Val Met Met Arg Asp
100 105 110
Leu Glu Cys Gly Ala Ser Val Ser Ile Ala Asn Lys Val Trp Pro Gly
115 120 125
Leu Ile Pro Glu Gln Pro Gln Met Leu Ala Ser Ser Tyr Asp Glu Lys
130 135 140
Gly Ile Asn Lys Asn Ile Lys Phe Pro Ala Phe Ala Gln Leu Lys Ala
145 150 155 160
Asp Gly Ala Arg Cys Phe Ala Glu Val Arg Gly Asp Glu Leu Asp Asp
165 170 175
Val Arg Leu Leu Ser Arg Ala Gly Asn Glu Tyr Leu Gly Leu Asp Leu
180 185 190
Leu Lys Glu Glu Leu Ile Lys Met Thr Ala Glu Ala Arg Gln Ile His
195 200 205
Pro Glu Gly Val Leu Ile Asp Gly Glu Leu Val Tyr His Glu Gln Val
210 215 220
Lys Lys Glu Pro Glu Gly Leu Asp Phe Leu Phe Asp Ala Tyr Pro Glu
225 230 235 240
Asn Ser Lys Ala Lys Glu Phe Ala Glu Val Ala Glu Ser Arg Thr Ala
245 250 255
Ser Asn Gly Ile Ala Asn Lys Ser Leu Lys Gly Thr Ile Ser Glu Lys
260 265 270
Glu Ala Gln Cys Met Lys Phe Gln Val Trp Asp Tyr Val Pro Leu Val
275 280 285
Glu Ile Tyr Ser Leu Pro Ala Phe Arg Leu Lys Tyr Asp Val Arg Phe
290 295 300
Ser Lys Leu Glu Gln Met Thr Ser Gly Tyr Asp Lys Val Ile Leu Ile
305 310 315 320
Glu Asn Gln Val Val Asn Asn Leu Asp Glu Ala Lys Val Ile Tyr Lys
325 330 335
Lys Tyr Ile Asp Gln Gly Leu Glu Gly Ile Ile Leu Lys Asn Ile Asp
340 345 350
Gly Leu Trp Glu Asn Ala Arg Ser Lys Asn Leu Tyr Lys Phe Lys Glu
355 360 365
Val Ile Asp Val Asp Leu Lys Ile Val Gly Ile Tyr Pro His Arg Lys
370 375 380
Asp Pro Thr Lys Ala Gly Gly Phe Ile Leu Glu Ser Glu Cys Gly Lys
385 390 395 400
Ile Lys Val Asn Ala Gly Ser Gly Leu Lys Asp Lys Ala Gly Val Lys
405 410 415
Ser His Glu Leu Asp Arg Thr Arg Ile Met Glu Asn Gln Asn Tyr Tyr
420 425 430
Ile Gly Lys Ile Leu Glu Cys Arg Cys Asn Gly Trp Leu Lys Ser Asp
435 440 445
Gly Arg Thr Asp Tyr Val Lys Leu Phe Leu Pro Ile Ala Ile Arg Leu
450 455 460
Arg Glu Asp Lys Thr Lys Ala Asn Thr Phe Glu Asp Val Phe Gly Asp
465 470 475 480
Phe His Glu Val Thr Gly Leu Gly Ser Gly Ser Ser Gly His His His
485 490 495
His His His
<210> 19
<211> 1500
<212> DNA
<213> 人工序列
<220>
<223> 人工序列的描述: 合成的多核苷酸
<400> 19
atgattctta aaattctgaa cgaaatagca tctattggtt caactaaaca gaagcaagca 60
attcttgaaa agaataaaga taatgaattg cttaaacgag tatatcgtct gacttattct 120
cgtgggttac agtattatat caagaaatgg cctaaacctg gtattgctac ccagagtttt 180
ggaatgttga ctcttaccga tatgcttgac ttcattgaat tcacattagc tactcggaaa 240
ttgactggaa atgcagcaat tgaggaatta actggatata tcaccgatgg taaaaaagat 300
gatgttgaag ttttgcgtcg agtgatgatg cgagaccttg aatgtggtgc ttcagtatct 360
attgcaaaca aagtttggcc aggtttaatt cctgaacaac ctcaaatgct cgcaagttct 420
tatgatgaaa aaggcattaa taagaatatc aaatttccag cctttgctca gttaaaagct 480
gatggagctc ggtgttttgc tgaagttaga ggtgatgaat tagatgatgt tcgtctttta 540
tcacgagctg gtaatgaata tctaggatta gatcttctta aggaagagtt aattaaaatg 600
accgctgaag cccgccagat tcatccagaa ggtgtgttga ttgatggcga attggtatac 660
catgagcaag ttaaaaagga gccagaaggc ctagattttc tttttgatgc ttatcctgaa 720
aacagtaaag ctaaagaatt cgccgaagta gctgaatcac gtactgcttc taatggaatc 780
gccaataaat ctttaaaggg aaccatttct gaaaaagaag cacaatgcat gaagtttcag 840
gtctgggatt atgtcccgtt ggtagaaata tacagtcttc ctgcatttcg tttgaaatat 900
gatgtacgtt tttctaaact agaacaaatg acatctggat atgataaagt aattttaatt 960
gaaaaccagg tagtaaataa cctagatgaa gctaaggtaa tttataaaaa gtatattgac 1020
caaggtcttg aaggtattat tctcaaaaat atcgatggat tatgggaaaa tgctcgttca 1080
aaaaatcttt ataaatttaa agaagtaatt gatgttgatt taaaaattgt aggaatttat 1140
cctcaccgta aagaccctac taaagcgggt ggatttattc ttgagtcaga gtgtggaaaa 1200
attaaggtaa atgctggttc aggcttaaaa gataaagccg gtgtaaaatc gcatgaactt 1260
gaccgtactc gcattatgga aaaccaaaat tattatattg gaaaaattct agagtgctgg 1320
tgcaacggtt ggttaaaatc tgatggccgc actgattacg ttaaattatt tcttccgatt 1380
gcgattcgtt tacgtgaaga taaaactaaa gctaatacat tcgaagatgt atttggtgat 1440
tttcatgagg taactggtct aggttctggc agttcaggtc atcaccacca tcatcactaa 1500
<210> 20
<211> 499
<212> PRT
<213> 人工序列
<220>
<223> 人工序列的描述: 合成的多肽
<400> 20
Met Ile Leu Lys Ile Leu Asn Glu Ile Ala Ser Ile Gly Ser Thr Lys
1 5 10 15
Gln Lys Gln Ala Ile Leu Glu Lys Asn Lys Asp Asn Glu Leu Leu Lys
20 25 30
Arg Val Tyr Arg Leu Thr Tyr Ser Arg Gly Leu Gln Tyr Tyr Ile Lys
35 40 45
Lys Trp Pro Lys Pro Gly Ile Ala Thr Gln Ser Phe Gly Met Leu Thr
50 55 60
Leu Thr Asp Met Leu Asp Phe Ile Glu Phe Thr Leu Ala Thr Arg Lys
65 70 75 80
Leu Thr Gly Asn Ala Ala Ile Glu Glu Leu Thr Gly Tyr Ile Thr Asp
85 90 95
Gly Lys Lys Asp Asp Val Glu Val Leu Arg Arg Val Met Met Arg Asp
100 105 110
Leu Glu Cys Gly Ala Ser Val Ser Ile Ala Asn Lys Val Trp Pro Gly
115 120 125
Leu Ile Pro Glu Gln Pro Gln Met Leu Ala Ser Ser Tyr Asp Glu Lys
130 135 140
Gly Ile Asn Lys Asn Ile Lys Phe Pro Ala Phe Ala Gln Leu Lys Ala
145 150 155 160
Asp Gly Ala Arg Cys Phe Ala Glu Val Arg Gly Asp Glu Leu Asp Asp
165 170 175
Val Arg Leu Leu Ser Arg Ala Gly Asn Glu Tyr Leu Gly Leu Asp Leu
180 185 190
Leu Lys Glu Glu Leu Ile Lys Met Thr Ala Glu Ala Arg Gln Ile His
195 200 205
Pro Glu Gly Val Leu Ile Asp Gly Glu Leu Val Tyr His Glu Gln Val
210 215 220
Lys Lys Glu Pro Glu Gly Leu Asp Phe Leu Phe Asp Ala Tyr Pro Glu
225 230 235 240
Asn Ser Lys Ala Lys Glu Phe Ala Glu Val Ala Glu Ser Arg Thr Ala
245 250 255
Ser Asn Gly Ile Ala Asn Lys Ser Leu Lys Gly Thr Ile Ser Glu Lys
260 265 270
Glu Ala Gln Cys Met Lys Phe Gln Val Trp Asp Tyr Val Pro Leu Val
275 280 285
Glu Ile Tyr Ser Leu Pro Ala Phe Arg Leu Lys Tyr Asp Val Arg Phe
290 295 300
Ser Lys Leu Glu Gln Met Thr Ser Gly Tyr Asp Lys Val Ile Leu Ile
305 310 315 320
Glu Asn Gln Val Val Asn Asn Leu Asp Glu Ala Lys Val Ile Tyr Lys
325 330 335
Lys Tyr Ile Asp Gln Gly Leu Glu Gly Ile Ile Leu Lys Asn Ile Asp
340 345 350
Gly Leu Trp Glu Asn Ala Arg Ser Lys Asn Leu Tyr Lys Phe Lys Glu
355 360 365
Val Ile Asp Val Asp Leu Lys Ile Val Gly Ile Tyr Pro His Arg Lys
370 375 380
Asp Pro Thr Lys Ala Gly Gly Phe Ile Leu Glu Ser Glu Cys Gly Lys
385 390 395 400
Ile Lys Val Asn Ala Gly Ser Gly Leu Lys Asp Lys Ala Gly Val Lys
405 410 415
Ser His Glu Leu Asp Arg Thr Arg Ile Met Glu Asn Gln Asn Tyr Tyr
420 425 430
Ile Gly Lys Ile Leu Glu Cys Trp Cys Asn Gly Trp Leu Lys Ser Asp
435 440 445
Gly Arg Thr Asp Tyr Val Lys Leu Phe Leu Pro Ile Ala Ile Arg Leu
450 455 460
Arg Glu Asp Lys Thr Lys Ala Asn Thr Phe Glu Asp Val Phe Gly Asp
465 470 475 480
Phe His Glu Val Thr Gly Leu Gly Ser Gly Ser Ser Gly His His His
485 490 495
His His His
<210> 21
<211> 1500
<212> DNA
<213> 人工序列
<220>
<223> 人工序列的描述: 合成的多核苷酸
<400> 21
atgattctta aaattctgaa cgaaatagca tctattggtt caactaaaca gaagcaagca 60
attcttgaaa agaataaaga taatgaattg cttaaacgag tatatcgtct gacttattct 120
cgtgggttac agtattatat caagaaatgg cctaaacctg gtattgctac ccagagtttt 180
ggaatgttga ctcttaccga tatgcttgac ttcattgaat tcacattagc tactcggaaa 240
ttgactggaa atgcagcaat tgaggaatta actggatata tcaccgatgg taaaaaagat 300
gatgttgaag ttttgcgtcg agtgatgatg cgagaccttg aatgtggtgc ttcagtatct 360
attgcaaaca aagtttggcc aggtttaatt cctgaacaac ctcaaatgct cgcaagttct 420
tatgatgaaa aaggcattaa taagaatatc aaatttccag cctttgctca gttaaaagct 480
gatggagctc ggtgttttgc tgaagttaga ggtgatgaat tagatgatgt tcgtctttta 540
tcacgagctg gtaatgaata tctaggatta gatcttctta aggaagagtt aattaaaatg 600
accgctgaag cccgccagat tcatccagaa ggtgtgttga ttgatggcga attggtatac 660
catgagcaag ttaaaaagga gccagaaggc ctagattttc tttttgatgc ttatcctgaa 720
aacagtaaag ctaaagaatt cgccgaagta gctgaatcac gtactgcttc taatggaatc 780
gccaataaat ctttaaaggg aaccatttct gaaaaagaag cacaatgcat gaagtttcag 840
gtctgggatt atgtcccgtt ggtagaaata tacagtcttc ctgcatttcg tttgaaatat 900
gatgtacgtt tttctaaact agaacaaatg acatctggat atgataaagt aattttaatt 960
gaaaaccagg tagtaaataa cctagatgaa gctaaggtaa tttataaaaa gtatattgac 1020
caaggtcttg aaggtattat tctcaaaaat atcgatggat tatgggaaaa tgctcgttca 1080
aaaaatcttt ataaatttaa agaagtaatt gatgttgatt taaaaattgt aggaatttat 1140
cctcaccgta aagaccctac taaagcgggt ggatttattc ttgagtcaga gtgtggaaaa 1200
attaaggtaa atgctggttc aggcttaaaa gataaagccg gtgtaaaatc gcatgaactt 1260
gaccgtactc gcattatgga aaaccaaaat tattatattg gaaaaattct agagtgcgaa 1320
tgcaacggtt ggttaaaatc tgatggccgc actcgttacg ttaaattatt tcttccgatt 1380
gcgattcgtt tacgtgaaga taaaactaaa gctaatacat tcgaagatgt atttggtgat 1440
tttcatgagg taactggtct aggttctggc agttcaggtc atcaccacca tcatcactaa 1500
<210> 22
<211> 499
<212> PRT
<213> 人工序列
<220>
<223> 人工序列的描述: 合成的多肽
<400> 22
Met Ile Leu Lys Ile Leu Asn Glu Ile Ala Ser Ile Gly Ser Thr Lys
1 5 10 15
Gln Lys Gln Ala Ile Leu Glu Lys Asn Lys Asp Asn Glu Leu Leu Lys
20 25 30
Arg Val Tyr Arg Leu Thr Tyr Ser Arg Gly Leu Gln Tyr Tyr Ile Lys
35 40 45
Lys Trp Pro Lys Pro Gly Ile Ala Thr Gln Ser Phe Gly Met Leu Thr
50 55 60
Leu Thr Asp Met Leu Asp Phe Ile Glu Phe Thr Leu Ala Thr Arg Lys
65 70 75 80
Leu Thr Gly Asn Ala Ala Ile Glu Glu Leu Thr Gly Tyr Ile Thr Asp
85 90 95
Gly Lys Lys Asp Asp Val Glu Val Leu Arg Arg Val Met Met Arg Asp
100 105 110
Leu Glu Cys Gly Ala Ser Val Ser Ile Ala Asn Lys Val Trp Pro Gly
115 120 125
Leu Ile Pro Glu Gln Pro Gln Met Leu Ala Ser Ser Tyr Asp Glu Lys
130 135 140
Gly Ile Asn Lys Asn Ile Lys Phe Pro Ala Phe Ala Gln Leu Lys Ala
145 150 155 160
Asp Gly Ala Arg Cys Phe Ala Glu Val Arg Gly Asp Glu Leu Asp Asp
165 170 175
Val Arg Leu Leu Ser Arg Ala Gly Asn Glu Tyr Leu Gly Leu Asp Leu
180 185 190
Leu Lys Glu Glu Leu Ile Lys Met Thr Ala Glu Ala Arg Gln Ile His
195 200 205
Pro Glu Gly Val Leu Ile Asp Gly Glu Leu Val Tyr His Glu Gln Val
210 215 220
Lys Lys Glu Pro Glu Gly Leu Asp Phe Leu Phe Asp Ala Tyr Pro Glu
225 230 235 240
Asn Ser Lys Ala Lys Glu Phe Ala Glu Val Ala Glu Ser Arg Thr Ala
245 250 255
Ser Asn Gly Ile Ala Asn Lys Ser Leu Lys Gly Thr Ile Ser Glu Lys
260 265 270
Glu Ala Gln Cys Met Lys Phe Gln Val Trp Asp Tyr Val Pro Leu Val
275 280 285
Glu Ile Tyr Ser Leu Pro Ala Phe Arg Leu Lys Tyr Asp Val Arg Phe
290 295 300
Ser Lys Leu Glu Gln Met Thr Ser Gly Tyr Asp Lys Val Ile Leu Ile
305 310 315 320
Glu Asn Gln Val Val Asn Asn Leu Asp Glu Ala Lys Val Ile Tyr Lys
325 330 335
Lys Tyr Ile Asp Gln Gly Leu Glu Gly Ile Ile Leu Lys Asn Ile Asp
340 345 350
Gly Leu Trp Glu Asn Ala Arg Ser Lys Asn Leu Tyr Lys Phe Lys Glu
355 360 365
Val Ile Asp Val Asp Leu Lys Ile Val Gly Ile Tyr Pro His Arg Lys
370 375 380
Asp Pro Thr Lys Ala Gly Gly Phe Ile Leu Glu Ser Glu Cys Gly Lys
385 390 395 400
Ile Lys Val Asn Ala Gly Ser Gly Leu Lys Asp Lys Ala Gly Val Lys
405 410 415
Ser His Glu Leu Asp Arg Thr Arg Ile Met Glu Asn Gln Asn Tyr Tyr
420 425 430
Ile Gly Lys Ile Leu Glu Cys Glu Cys Asn Gly Trp Leu Lys Ser Asp
435 440 445
Gly Arg Thr Arg Tyr Val Lys Leu Phe Leu Pro Ile Ala Ile Arg Leu
450 455 460
Arg Glu Asp Lys Thr Lys Ala Asn Thr Phe Glu Asp Val Phe Gly Asp
465 470 475 480
Phe His Glu Val Thr Gly Leu Gly Ser Gly Ser Ser Gly His His His
485 490 495
His His His
<210> 23
<211> 1500
<212> DNA
<213> 人工序列
<220>
<223> 人工序列的描述: 合成的多核苷酸
<400> 23
atgattctta aaattctgaa cgaaatagca tctattggtt caactaaaca gaagcaagca 60
attcttgaaa agaataaaga taatgaattg cttaaacgag tatatcgtct gacttattct 120
cgtgggttac agtattatat caagaaatgg cctaaacctg gtattgctac ccagagtttt 180
ggaatgttga ctcttaccga tatgcttgac ttcattgaat tcacattagc tactcggaaa 240
ttgactggaa atgcagcaat tgaggaatta actggatata tcaccgatgg taaaaaagat 300
gatgttgaag ttttgcgtcg agtgatgatg cgagaccttg aatgtggtgc ttcagtatct 360
attgcaaaca aagtttggcc aggtttaatt cctgaacaac ctcaaatgct cgcaagttct 420
tatgatgaaa aaggcattaa taagaatatc aaatttccag cctttgctca gttaaaagct 480
gatggagctc ggtgttttgc tgaagttaga ggtgatgaat tagatgatgt tcgtctttta 540
tcacgagctg gtaatgaata tctaggatta gatcttctta aggaagagtt aattaaaatg 600
accgctgaag cccgccagat tcatccagaa ggtgtgttga ttgatggcga attggtatac 660
catgagcaag ttaaaaagga gccagaaggc ctagattttc tttttgatgc ttatcctgaa 720
aacagtaaag ctaaagaatt cgccgaagta gctgaatcac gtactgcttc taatggaatc 780
gccaataaat ctttaaaggg aaccatttct gaaaaagaag cacaatgcat gaagtttcag 840
gtctgggatt atgtcccgtt ggtagaaata tacagtcttc ctgcatttcg tttgaaatat 900
gatgtacgtt tttctaaact agaacaaatg acatctggat atgataaagt aattttaatt 960
gaaaaccagg tagtaaataa cctagatgaa gctaaggtaa tttataaaaa gtatattgac 1020
caaggtcttg aaggtattat tctcaaaaat atcgatggat tatgggaaaa tgctcgttca 1080
aaaaatcttt ataaatttaa agaagtaatt gatgttgatt taaaaattgt aggaatttat 1140
cctcaccgta aagaccctac taaagcgggt ggatttattc ttgagtcaga gtgtggaaaa 1200
attaaggtaa atgctggttc aggcttaaaa gataaagccg gtgtaaaatc gcatgaactt 1260
gaccgtactc gcattatgga aaaccaaaat tattatattg gaaaaattct agagtgcgaa 1320
tgcaacggtt ggttaaaatc tgatggccgc actgattacg ttaaattatt tcttccgatt 1380
gcgattcgtt tacgtgaaga taaaactgaa gctaatacat tcgaagatgt atttggtgat 1440
tttcatgagg taactggtct aggttctggc agttcaggtc atcaccacca tcatcactaa 1500
<210> 24
<211> 499
<212> PRT
<213> 人工序列
<220>
<223> 人工序列的描述: 合成的多肽
<400> 24
Met Ile Leu Lys Ile Leu Asn Glu Ile Ala Ser Ile Gly Ser Thr Lys
1 5 10 15
Gln Lys Gln Ala Ile Leu Glu Lys Asn Lys Asp Asn Glu Leu Leu Lys
20 25 30
Arg Val Tyr Arg Leu Thr Tyr Ser Arg Gly Leu Gln Tyr Tyr Ile Lys
35 40 45
Lys Trp Pro Lys Pro Gly Ile Ala Thr Gln Ser Phe Gly Met Leu Thr
50 55 60
Leu Thr Asp Met Leu Asp Phe Ile Glu Phe Thr Leu Ala Thr Arg Lys
65 70 75 80
Leu Thr Gly Asn Ala Ala Ile Glu Glu Leu Thr Gly Tyr Ile Thr Asp
85 90 95
Gly Lys Lys Asp Asp Val Glu Val Leu Arg Arg Val Met Met Arg Asp
100 105 110
Leu Glu Cys Gly Ala Ser Val Ser Ile Ala Asn Lys Val Trp Pro Gly
115 120 125
Leu Ile Pro Glu Gln Pro Gln Met Leu Ala Ser Ser Tyr Asp Glu Lys
130 135 140
Gly Ile Asn Lys Asn Ile Lys Phe Pro Ala Phe Ala Gln Leu Lys Ala
145 150 155 160
Asp Gly Ala Arg Cys Phe Ala Glu Val Arg Gly Asp Glu Leu Asp Asp
165 170 175
Val Arg Leu Leu Ser Arg Ala Gly Asn Glu Tyr Leu Gly Leu Asp Leu
180 185 190
Leu Lys Glu Glu Leu Ile Lys Met Thr Ala Glu Ala Arg Gln Ile His
195 200 205
Pro Glu Gly Val Leu Ile Asp Gly Glu Leu Val Tyr His Glu Gln Val
210 215 220
Lys Lys Glu Pro Glu Gly Leu Asp Phe Leu Phe Asp Ala Tyr Pro Glu
225 230 235 240
Asn Ser Lys Ala Lys Glu Phe Ala Glu Val Ala Glu Ser Arg Thr Ala
245 250 255
Ser Asn Gly Ile Ala Asn Lys Ser Leu Lys Gly Thr Ile Ser Glu Lys
260 265 270
Glu Ala Gln Cys Met Lys Phe Gln Val Trp Asp Tyr Val Pro Leu Val
275 280 285
Glu Ile Tyr Ser Leu Pro Ala Phe Arg Leu Lys Tyr Asp Val Arg Phe
290 295 300
Ser Lys Leu Glu Gln Met Thr Ser Gly Tyr Asp Lys Val Ile Leu Ile
305 310 315 320
Glu Asn Gln Val Val Asn Asn Leu Asp Glu Ala Lys Val Ile Tyr Lys
325 330 335
Lys Tyr Ile Asp Gln Gly Leu Glu Gly Ile Ile Leu Lys Asn Ile Asp
340 345 350
Gly Leu Trp Glu Asn Ala Arg Ser Lys Asn Leu Tyr Lys Phe Lys Glu
355 360 365
Val Ile Asp Val Asp Leu Lys Ile Val Gly Ile Tyr Pro His Arg Lys
370 375 380
Asp Pro Thr Lys Ala Gly Gly Phe Ile Leu Glu Ser Glu Cys Gly Lys
385 390 395 400
Ile Lys Val Asn Ala Gly Ser Gly Leu Lys Asp Lys Ala Gly Val Lys
405 410 415
Ser His Glu Leu Asp Arg Thr Arg Ile Met Glu Asn Gln Asn Tyr Tyr
420 425 430
Ile Gly Lys Ile Leu Glu Cys Glu Cys Asn Gly Trp Leu Lys Ser Asp
435 440 445
Gly Arg Thr Asp Tyr Val Lys Leu Phe Leu Pro Ile Ala Ile Arg Leu
450 455 460
Arg Glu Asp Lys Thr Glu Ala Asn Thr Phe Glu Asp Val Phe Gly Asp
465 470 475 480
Phe His Glu Val Thr Gly Leu Gly Ser Gly Ser Ser Gly His His His
485 490 495
His His His
<210> 25
<211> 12
<212> PRT
<213> 人工序列
<220>
<223> 人工序列的描述: 合成的肽
<400> 25
Gly Ser Gly Ser Ser Gly His His His His His His
1 5 10
<210> 26
<211> 6
<212> PRT
<213> 人工序列
<220>
<223> 人工序列的描述: 合成的6x组氨酸标签
<400> 26
His His His His His His
1 5
Claims (7)
1.一种突变型T4 DNA连接酶,其氨基酸序列如SEQ ID NO:6所示,且不包括在其C末端处的6-元组氨酸标签和6个紧接在前的甘氨酸和丝氨酸氨基酸。
2.一种多核苷酸,其编码根据权利要求1所述的突变型T4 DNA连接酶。
3.一种载体,其含有根据权利要求2所述的多核苷酸。
4.一种细胞,其含有根据权利要求2所述的多核苷酸。
5.一种在不同多核苷酸之间或通过连接多核苷酸的5'末端和3'末端来进行多核苷酸连接以产生环状多核苷酸的方法,其中所述多核苷酸具有平滑末端或粘性末端,所述方法包括:
提供连接混合物,其包含待连接的多核苷酸和根据权利要求1所述的突变型T4 DNA连接酶;以及
将所述连接混合物置于发生连接的温度下。
6.一种在不同多核苷酸之间或通过连接多核苷酸的5'末端和3'末端来进行多核苷酸连接以产生环状多核苷酸的方法,其中所述多核苷酸具有平滑末端或粘性末端,所述方法包括:
提供连接反应混合物,其包含缓冲剂、待连接的多核苷酸和根据权利要求1所述的突变型T4 DNA连接酶;以及
将所述连接反应混合物置于适于连接的温度条件下。
7.根据权利要求6所述的方法,其中所述连接反应混合物包括Tris-HCl、MgCl2、ATP、二硫苏糖醇和水。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US17/699,354 US20230295707A1 (en) | 2022-03-21 | 2022-03-21 | T4 DNA Ligase Variants with Increased Ligation Efficiency |
US17/699,354 | 2022-03-21 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114934026A CN114934026A (zh) | 2022-08-23 |
CN114934026B true CN114934026B (zh) | 2023-09-26 |
Family
ID=82865606
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210557719.XA Active CN114934026B (zh) | 2022-03-21 | 2022-05-19 | 具有增加的连接效率的t4 dna连接酶变体 |
Country Status (2)
Country | Link |
---|---|
US (1) | US20230295707A1 (zh) |
CN (1) | CN114934026B (zh) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117247911A (zh) * | 2023-08-22 | 2023-12-19 | 武汉爱博泰克生物科技有限公司 | 大肠杆菌dna连接酶突变体及其在毕赤酵母中表达纯化方法 |
Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5955363A (en) * | 1990-01-03 | 1999-09-21 | Promega Corporation | Vector for in vitro mutagenesis and use thereof |
CA2430503A1 (en) * | 2000-12-01 | 2002-06-06 | Cornell Research Foundation, Inc. | Detection of nucleic acid differences using combined endonuclease cleavage and ligation reactions |
CN102597006A (zh) * | 2009-09-16 | 2012-07-18 | 梅西大学 | 融合多肽以及其应用 |
CN103764820A (zh) * | 2011-06-08 | 2014-04-30 | 国际生物分子和细胞研究所 | 序列特异性工程改造的核糖核酸酶h和用于确定dna-rna杂交物结合蛋白序列偏好的方法 |
CN106222185A (zh) * | 2006-08-04 | 2016-12-14 | 维莱尼姆公司 | 葡聚糖酶、编码它们的核酸及制备和使用它们的方法 |
CN108779442A (zh) * | 2016-02-08 | 2018-11-09 | 瑞尔基因公司 | 多种连接酶的组合物、系统以及方法 |
CN110914415A (zh) * | 2017-05-08 | 2020-03-24 | 科德克希思公司 | 工程化连接酶变体 |
CN111328287A (zh) * | 2017-07-04 | 2020-06-23 | 库瑞瓦格股份公司 | 新型核酸分子 |
CN111479923A (zh) * | 2017-12-19 | 2020-07-31 | 葛兰素史克知识产权开发有限公司 | 用于生产寡核苷酸的新型方法 |
CN114045274A (zh) * | 2021-10-09 | 2022-02-15 | 武汉爱博泰克生物科技有限公司 | 热稳定的逆转录酶突变体 |
CN114717209A (zh) * | 2022-02-18 | 2022-07-08 | 武汉爱博泰克生物科技有限公司 | 具有增加的耐盐性的t4 dna连接酶变体 |
CN114854699A (zh) * | 2022-02-22 | 2022-08-05 | 武汉爱博泰克生物科技有限公司 | 具有提高的热稳定性的t4 dna连接酶变体 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2647692A3 (en) * | 2008-11-11 | 2014-01-22 | The Procter and Gamble Company | Compositions and methods comprising serine protease variants |
GB201612011D0 (en) * | 2016-07-11 | 2016-08-24 | Glaxosmithkline Ip Dev Ltd | Novel processes for the production of oligonucleotides |
US10837009B1 (en) * | 2017-12-22 | 2020-11-17 | New England Biolabs, Inc. | DNA ligase variants |
-
2022
- 2022-03-21 US US17/699,354 patent/US20230295707A1/en active Pending
- 2022-05-19 CN CN202210557719.XA patent/CN114934026B/zh active Active
Patent Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5955363A (en) * | 1990-01-03 | 1999-09-21 | Promega Corporation | Vector for in vitro mutagenesis and use thereof |
CA2430503A1 (en) * | 2000-12-01 | 2002-06-06 | Cornell Research Foundation, Inc. | Detection of nucleic acid differences using combined endonuclease cleavage and ligation reactions |
CN106222185A (zh) * | 2006-08-04 | 2016-12-14 | 维莱尼姆公司 | 葡聚糖酶、编码它们的核酸及制备和使用它们的方法 |
CN102597006A (zh) * | 2009-09-16 | 2012-07-18 | 梅西大学 | 融合多肽以及其应用 |
CN103764820A (zh) * | 2011-06-08 | 2014-04-30 | 国际生物分子和细胞研究所 | 序列特异性工程改造的核糖核酸酶h和用于确定dna-rna杂交物结合蛋白序列偏好的方法 |
CN108779442A (zh) * | 2016-02-08 | 2018-11-09 | 瑞尔基因公司 | 多种连接酶的组合物、系统以及方法 |
CN110914415A (zh) * | 2017-05-08 | 2020-03-24 | 科德克希思公司 | 工程化连接酶变体 |
CN111328287A (zh) * | 2017-07-04 | 2020-06-23 | 库瑞瓦格股份公司 | 新型核酸分子 |
CN111479923A (zh) * | 2017-12-19 | 2020-07-31 | 葛兰素史克知识产权开发有限公司 | 用于生产寡核苷酸的新型方法 |
CN114045274A (zh) * | 2021-10-09 | 2022-02-15 | 武汉爱博泰克生物科技有限公司 | 热稳定的逆转录酶突变体 |
CN114717209A (zh) * | 2022-02-18 | 2022-07-08 | 武汉爱博泰克生物科技有限公司 | 具有增加的耐盐性的t4 dna连接酶变体 |
CN114854699A (zh) * | 2022-02-22 | 2022-08-05 | 武汉爱博泰克生物科技有限公司 | 具有提高的热稳定性的t4 dna连接酶变体 |
Non-Patent Citations (7)
Title |
---|
"Bacterial expression vector pPLc28LIG8,complete sequence";De Schamphelaire,W.等;《Genbank Database》;20170206;Accession No.LT726946.1 * |
"DNA ligase[Escherichia phage T4]";Miller,E.S等;《Genbank Database》;20211111;Accession No.NP_049813.1 * |
"LncRNA SChLAP1和E3泛素连接酶TRIM22在胶质母细胞瘤中的作用与机制研究";季剑雄;《中国博士学位论文全文数据库(电子期刊)》;20180830;第1-200页 * |
"Proliferating cell nuclear antigen restores the enzymatic activity of a DNA ligase I deficient in DNA binding";Trasvina-Arenas Carlos H等;《FEBS》;20170505;第7卷(第5期);第659-674页 * |
"Transformation of Escherichia coli Increases 260-fold upon Inactivation of T4 DNA Ligase";Michelsen,B.K等;《Analytical Biochemistry》;19950228;第225卷(第1期);第659-674页 * |
"核酸错配对DNA连接酶反应的单位点核酸变异检测的影响";李书亚;《中国优秀硕士学位论文全文数据库(电子期刊)》;20190827;第1-95页 * |
"额外的错配碱基提高T4DNA连接酶等位特异性连接";李书亚等;《中国生物化学与分子生物学报》;20180820(第8期);第854-860页 * |
Also Published As
Publication number | Publication date |
---|---|
US20230295707A1 (en) | 2023-09-21 |
CN114934026A (zh) | 2022-08-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN114717209B (zh) | 具有增加的耐盐性的t4 dna连接酶变体 | |
US11821010B2 (en) | Mutant Taq polymerase for faster amplification | |
US11807874B2 (en) | P450-BM3 variants with improved activity | |
US8569029B2 (en) | DNase expression in recombinant host cells | |
KR20200003903A (ko) | 조작된 리가제 변이체 | |
US11168312B2 (en) | Mutant taq polymerase for amplification in increased salt concentration or body fluids | |
CN114934026B (zh) | 具有增加的连接效率的t4 dna连接酶变体 | |
CN114854699B (zh) | 具有提高的热稳定性的t4 dna连接酶变体 | |
US20230212550A1 (en) | T4 DNA Ligase Variants with Increased Ligation Efficiency | |
CN115044569B (zh) | 一种Taq DNA聚合酶突变体及其应用 | |
CN117965497A (zh) | 双标记的粘质沙雷氏菌核酸酶以及使用其分解核酸的方法 | |
CN117778348A (zh) | 一种具有逆转录酶活性的Pfu DNA聚合酶突变体及其应用 | |
US20230094503A1 (en) | Mutant Taq Polymerase for Increased Salt Concentration or Body Fluids | |
US11060075B2 (en) | Engineered DNA polymerase variants | |
US20230183790A1 (en) | Recombinant reverse transcriptase variants | |
WO2024059581A2 (en) | Engineered dna polymerase variants |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |