CN1918296A - 来自伯氏致病杆菌的毒素复合物蛋白和基因 - Google Patents
来自伯氏致病杆菌的毒素复合物蛋白和基因 Download PDFInfo
- Publication number
- CN1918296A CN1918296A CNA2004800420040A CN200480042004A CN1918296A CN 1918296 A CN1918296 A CN 1918296A CN A2004800420040 A CNA2004800420040 A CN A2004800420040A CN 200480042004 A CN200480042004 A CN 200480042004A CN 1918296 A CN1918296 A CN 1918296A
- Authority
- CN
- China
- Prior art keywords
- leu
- ala
- thr
- ser
- gly
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 260
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 133
- 239000003053 toxin Substances 0.000 title abstract description 157
- 231100000765 toxin Toxicity 0.000 title abstract description 155
- 241000238631 Hexapoda Species 0.000 claims description 129
- 238000000034 method Methods 0.000 claims description 51
- 230000001580 bacterial effect Effects 0.000 claims description 34
- 108091033319 polynucleotide Proteins 0.000 claims description 25
- 102000040430 polynucleotide Human genes 0.000 claims description 25
- 239000002157 polynucleotide Substances 0.000 claims description 25
- 238000009396 hybridization Methods 0.000 claims description 12
- 230000000295 complement effect Effects 0.000 claims description 10
- 102000039446 nucleic acids Human genes 0.000 claims description 5
- 108020004707 nucleic acids Proteins 0.000 claims description 5
- 150000007523 nucleic acids Chemical class 0.000 claims description 5
- 230000007888 toxin activity Effects 0.000 claims description 4
- 108020004711 Nucleic Acid Probes Proteins 0.000 claims description 3
- 239000002853 nucleic acid probe Substances 0.000 claims description 3
- 238000004113 cell culture Methods 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- 241000607757 Xenorhabdus Species 0.000 abstract description 59
- 241000123579 Xenorhabdus bovienii Species 0.000 abstract description 2
- 108700012359 toxins Proteins 0.000 description 168
- 241000196324 Embryophyta Species 0.000 description 124
- 235000018102 proteins Nutrition 0.000 description 116
- 108020004414 DNA Proteins 0.000 description 76
- 241000894006 Bacteria Species 0.000 description 69
- 210000004027 cell Anatomy 0.000 description 67
- 239000000523 sample Substances 0.000 description 47
- 239000012634 fragment Substances 0.000 description 38
- 150000001413 amino acids Chemical group 0.000 description 36
- 108020004705 Codon Proteins 0.000 description 34
- 238000012360 testing method Methods 0.000 description 34
- 230000000694 effects Effects 0.000 description 31
- 235000001014 amino acid Nutrition 0.000 description 28
- 239000013612 plasmid Substances 0.000 description 26
- 229940024606 amino acid Drugs 0.000 description 24
- 239000002773 nucleotide Substances 0.000 description 23
- 238000006243 chemical reaction Methods 0.000 description 22
- 239000006166 lysate Substances 0.000 description 22
- 241000588724 Escherichia coli Species 0.000 description 21
- 240000008042 Zea mays Species 0.000 description 21
- 125000003729 nucleotide group Chemical group 0.000 description 21
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 20
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 20
- 235000005822 corn Nutrition 0.000 description 20
- 238000005516 engineering process Methods 0.000 description 20
- 230000014509 gene expression Effects 0.000 description 20
- 230000000749 insecticidal effect Effects 0.000 description 20
- 241000193830 Bacillus <bacterium> Species 0.000 description 19
- 108091028043 Nucleic acid sequence Proteins 0.000 description 19
- 239000002585 base Substances 0.000 description 18
- 239000000047 product Substances 0.000 description 18
- 239000007853 buffer solution Substances 0.000 description 17
- 239000004927 clay Substances 0.000 description 17
- 108010057821 leucylproline Proteins 0.000 description 17
- 239000013598 vector Substances 0.000 description 17
- 241000589158 Agrobacterium Species 0.000 description 15
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 15
- 239000000203 mixture Substances 0.000 description 15
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 14
- 244000005700 microbiome Species 0.000 description 14
- 239000000575 pesticide Substances 0.000 description 14
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 13
- 238000002360 preparation method Methods 0.000 description 13
- 238000005406 washing Methods 0.000 description 13
- 239000000499 gel Substances 0.000 description 12
- 108020004999 messenger RNA Proteins 0.000 description 12
- 239000000243 solution Substances 0.000 description 12
- 241000894007 species Species 0.000 description 12
- 210000001519 tissue Anatomy 0.000 description 12
- 241000255967 Helicoverpa zea Species 0.000 description 11
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 11
- 208000037065 Subacute sclerosing leukoencephalitis Diseases 0.000 description 11
- 206010042297 Subacute sclerosing panencephalitis Diseases 0.000 description 11
- 235000013305 food Nutrition 0.000 description 11
- 231100000419 toxicity Toxicity 0.000 description 11
- 230000001988 toxicity Effects 0.000 description 11
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 10
- 238000004458 analytical method Methods 0.000 description 10
- 108010008355 arginyl-glutamine Proteins 0.000 description 10
- 230000008859 change Effects 0.000 description 10
- 108010050848 glycylleucine Proteins 0.000 description 10
- 239000001963 growth medium Substances 0.000 description 10
- 238000012216 screening Methods 0.000 description 10
- 239000006228 supernatant Substances 0.000 description 10
- 102000004190 Enzymes Human genes 0.000 description 9
- 108090000790 Enzymes Proteins 0.000 description 9
- 241000255777 Lepidoptera Species 0.000 description 9
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 9
- 241000254099 Melolontha melolontha Species 0.000 description 9
- 241000700605 Viruses Species 0.000 description 9
- 229940088598 enzyme Drugs 0.000 description 9
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 9
- 108010034529 leucyl-lysine Proteins 0.000 description 9
- 108090000765 processed proteins & peptides Proteins 0.000 description 9
- 108010031719 prolyl-serine Proteins 0.000 description 9
- 230000001131 transforming effect Effects 0.000 description 9
- 241000193388 Bacillus thuringiensis Species 0.000 description 8
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 8
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 8
- 241000607735 Xenorhabdus nematophila Species 0.000 description 8
- 108010047495 alanylglycine Proteins 0.000 description 8
- 229940097012 bacillus thuringiensis Drugs 0.000 description 8
- 239000003795 chemical substances by application Substances 0.000 description 8
- 238000013461 design Methods 0.000 description 8
- 230000029087 digestion Effects 0.000 description 8
- 230000005714 functional activity Effects 0.000 description 8
- 231100000614 poison Toxicity 0.000 description 8
- 238000001556 precipitation Methods 0.000 description 8
- 230000001105 regulatory effect Effects 0.000 description 8
- 238000013519 translation Methods 0.000 description 8
- 241000880493 Leptailurus serval Species 0.000 description 7
- 241000244206 Nematoda Species 0.000 description 7
- 108091005804 Peptidases Proteins 0.000 description 7
- 239000004365 Protease Substances 0.000 description 7
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 7
- 239000011324 bead Substances 0.000 description 7
- 108010049041 glutamylalanine Proteins 0.000 description 7
- 230000000361 pesticidal effect Effects 0.000 description 7
- 239000002574 poison Substances 0.000 description 7
- 230000008569 process Effects 0.000 description 7
- 235000019419 proteases Nutrition 0.000 description 7
- 239000011780 sodium chloride Substances 0.000 description 7
- 239000002689 soil Substances 0.000 description 7
- 238000012546 transfer Methods 0.000 description 7
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 7
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 6
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 6
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 6
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 6
- 108010041407 alanylaspartic acid Proteins 0.000 description 6
- 108010044940 alanylglutamine Proteins 0.000 description 6
- 230000003321 amplification Effects 0.000 description 6
- 108010047857 aspartylglycine Proteins 0.000 description 6
- 108010092854 aspartyllysine Proteins 0.000 description 6
- 150000001875 compounds Chemical class 0.000 description 6
- 230000008034 disappearance Effects 0.000 description 6
- 230000002068 genetic effect Effects 0.000 description 6
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 6
- 230000012010 growth Effects 0.000 description 6
- 108010064235 lysylglycine Proteins 0.000 description 6
- 238000007726 management method Methods 0.000 description 6
- 239000003550 marker Substances 0.000 description 6
- 238000003199 nucleic acid amplification method Methods 0.000 description 6
- 238000005457 optimization Methods 0.000 description 6
- 108091008146 restriction endonucleases Proteins 0.000 description 6
- 238000000926 separation method Methods 0.000 description 6
- 230000009466 transformation Effects 0.000 description 6
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 5
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 5
- 241000721621 Myzus persicae Species 0.000 description 5
- 206010058667 Oral toxicity Diseases 0.000 description 5
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 5
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 5
- 108010005233 alanylglutamic acid Proteins 0.000 description 5
- 108010087924 alanylproline Proteins 0.000 description 5
- 108010077245 asparaginyl-proline Proteins 0.000 description 5
- 108010093581 aspartyl-proline Proteins 0.000 description 5
- 239000013613 expression plasmid Substances 0.000 description 5
- 239000000284 extract Substances 0.000 description 5
- 108010079547 glutamylmethionine Proteins 0.000 description 5
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 5
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 5
- 108010087823 glycyltyrosine Proteins 0.000 description 5
- 239000002917 insecticide Substances 0.000 description 5
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 5
- 239000007788 liquid Substances 0.000 description 5
- 239000000463 material Substances 0.000 description 5
- 238000002156 mixing Methods 0.000 description 5
- 231100000418 oral toxicity Toxicity 0.000 description 5
- JMANVNJQNLATNU-UHFFFAOYSA-N oxalonitrile Chemical compound N#CC#N JMANVNJQNLATNU-UHFFFAOYSA-N 0.000 description 5
- 239000008363 phosphate buffer Substances 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 239000000126 substance Substances 0.000 description 5
- 239000005418 vegetable material Substances 0.000 description 5
- 241000589156 Agrobacterium rhizogenes Species 0.000 description 4
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 4
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 4
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 4
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 4
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 4
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 4
- 108700010070 Codon Usage Proteins 0.000 description 4
- 241000254173 Coleoptera Species 0.000 description 4
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 4
- 102000053187 Glucuronidase Human genes 0.000 description 4
- 108010060309 Glucuronidase Proteins 0.000 description 4
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 4
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 4
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 4
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 4
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 4
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 4
- 108010065920 Insulin Lispro Proteins 0.000 description 4
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 4
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 4
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 4
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 4
- 108010079364 N-glycylalanine Proteins 0.000 description 4
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 4
- 241000589180 Rhizobium Species 0.000 description 4
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 4
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 4
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 4
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 4
- LKEKWDJCJSPXNI-IRIUXVKKSA-N Thr-Glu-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LKEKWDJCJSPXNI-IRIUXVKKSA-N 0.000 description 4
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 4
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 4
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 4
- 241000607479 Yersinia pestis Species 0.000 description 4
- 239000002253 acid Substances 0.000 description 4
- 239000000654 additive Substances 0.000 description 4
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 4
- 101150099105 alien gene Proteins 0.000 description 4
- 108010062796 arginyllysine Proteins 0.000 description 4
- 230000003115 biocidal effect Effects 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 4
- 229960005091 chloramphenicol Drugs 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 238000003967 crop rotation Methods 0.000 description 4
- 230000034994 death Effects 0.000 description 4
- 229960001484 edetic acid Drugs 0.000 description 4
- 238000001962 electrophoresis Methods 0.000 description 4
- 238000004520 electroporation Methods 0.000 description 4
- 238000013467 fragmentation Methods 0.000 description 4
- 238000006062 fragmentation reaction Methods 0.000 description 4
- 239000011521 glass Substances 0.000 description 4
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 4
- 235000011187 glycerol Nutrition 0.000 description 4
- 108010015792 glycyllysine Proteins 0.000 description 4
- 108010037850 glycylvaline Proteins 0.000 description 4
- 108010036413 histidylglycine Proteins 0.000 description 4
- 108010092114 histidylphenylalanine Proteins 0.000 description 4
- 238000011534 incubation Methods 0.000 description 4
- 238000003780 insertion Methods 0.000 description 4
- 230000037431 insertion Effects 0.000 description 4
- 230000010354 integration Effects 0.000 description 4
- 238000005342 ion exchange Methods 0.000 description 4
- 229930027917 kanamycin Natural products 0.000 description 4
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical class O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 4
- 238000012423 maintenance Methods 0.000 description 4
- 239000011159 matrix material Substances 0.000 description 4
- 238000001840 matrix-assisted laser desorption--ionisation time-of-flight mass spectrometry Methods 0.000 description 4
- 239000002609 medium Substances 0.000 description 4
- 230000000813 microbial effect Effects 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 108010012581 phenylalanylglutamate Proteins 0.000 description 4
- 102000004196 processed proteins & peptides Human genes 0.000 description 4
- 230000009467 reduction Effects 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 4
- 238000006467 substitution reaction Methods 0.000 description 4
- 108010061238 threonyl-glycine Proteins 0.000 description 4
- 108010003137 tyrosyltyrosine Proteins 0.000 description 4
- 101150085703 vir gene Proteins 0.000 description 4
- 229920001817 Agar Polymers 0.000 description 3
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 3
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 3
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 3
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 3
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 3
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 3
- 241000588986 Alcaligenes Species 0.000 description 3
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 3
- KUYKVGODHGHFDI-ACZMJKKPSA-N Asn-Gln-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O KUYKVGODHGHFDI-ACZMJKKPSA-N 0.000 description 3
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 3
- PWAIZUBWHRHYKS-MELADBBJSA-N Asp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)O)N)C(=O)O PWAIZUBWHRHYKS-MELADBBJSA-N 0.000 description 3
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 3
- 241000223651 Aureobasidium Species 0.000 description 3
- 241000589151 Azotobacter Species 0.000 description 3
- 108010077805 Bacterial Proteins Proteins 0.000 description 3
- UHOVQNZJYSORNB-UHFFFAOYSA-N Benzene Chemical compound C1=CC=CC=C1 UHOVQNZJYSORNB-UHFFFAOYSA-N 0.000 description 3
- 102000016938 Catalase Human genes 0.000 description 3
- 108010053835 Catalase Proteins 0.000 description 3
- 229920000742 Cotton Polymers 0.000 description 3
- 241001337994 Cryptococcus <scale insect> Species 0.000 description 3
- 241000255925 Diptera Species 0.000 description 3
- 238000002965 ELISA Methods 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- NNQHEEQNPQYPGL-FXQIFTODSA-N Gln-Ala-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NNQHEEQNPQYPGL-FXQIFTODSA-N 0.000 description 3
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 3
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 3
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 3
- UTOQQOMEJDPDMX-ACZMJKKPSA-N Gln-Ser-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O UTOQQOMEJDPDMX-ACZMJKKPSA-N 0.000 description 3
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 3
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 3
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 3
- LERGJIVJIIODPZ-ZANVPECISA-N Gly-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)C)C(O)=O)=CNC2=C1 LERGJIVJIIODPZ-ZANVPECISA-N 0.000 description 3
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 3
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 3
- 206010020649 Hyperkeratosis Diseases 0.000 description 3
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 3
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 3
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 3
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 3
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 3
- 241000235649 Kluyveromyces Species 0.000 description 3
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 3
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 3
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 3
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 3
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 3
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 3
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 3
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 3
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 3
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 3
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 3
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 3
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 3
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 3
- 108700026244 Open Reading Frames Proteins 0.000 description 3
- 229910019142 PO4 Inorganic materials 0.000 description 3
- 241000193418 Paenibacillus larvae Species 0.000 description 3
- 108091093037 Peptide nucleic acid Proteins 0.000 description 3
- 241001148062 Photorhabdus Species 0.000 description 3
- 241000254101 Popillia japonica Species 0.000 description 3
- 241000589540 Pseudomonas fluorescens Species 0.000 description 3
- 238000012300 Sequence Analysis Methods 0.000 description 3
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 3
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 3
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 3
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 3
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 3
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 3
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 3
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 3
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 3
- 241000607720 Serratia Species 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 3
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 3
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 3
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 3
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 3
- NOOMDULIORCDNF-IRXDYDNUSA-N Tyr-Gly-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NOOMDULIORCDNF-IRXDYDNUSA-N 0.000 description 3
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 3
- OJCISMMNNUNNJA-BZSNNMDCSA-N Tyr-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 OJCISMMNNUNNJA-BZSNNMDCSA-N 0.000 description 3
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 3
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 3
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 3
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 3
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 3
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 3
- 230000000996 additive effect Effects 0.000 description 3
- 239000008272 agar Substances 0.000 description 3
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 3
- 239000003513 alkali Substances 0.000 description 3
- 229960000723 ampicillin Drugs 0.000 description 3
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 3
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 3
- 108010038633 aspartylglutamate Proteins 0.000 description 3
- 108010068265 aspartyltyrosine Proteins 0.000 description 3
- 239000000872 buffer Substances 0.000 description 3
- 238000004422 calculation algorithm Methods 0.000 description 3
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 3
- 238000005520 cutting process Methods 0.000 description 3
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 3
- 230000002255 enzymatic effect Effects 0.000 description 3
- 238000011156 evaluation Methods 0.000 description 3
- 238000012239 gene modification Methods 0.000 description 3
- 230000005017 genetic modification Effects 0.000 description 3
- 235000013617 genetically modified food Nutrition 0.000 description 3
- 108010078144 glutaminyl-glycine Proteins 0.000 description 3
- 238000010438 heat treatment Methods 0.000 description 3
- 108010025306 histidylleucine Proteins 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- 208000015181 infectious disease Diseases 0.000 description 3
- 238000002347 injection Methods 0.000 description 3
- 239000007924 injection Substances 0.000 description 3
- 108010078274 isoleucylvaline Proteins 0.000 description 3
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 3
- 231100000518 lethal Toxicity 0.000 description 3
- 230000001665 lethal effect Effects 0.000 description 3
- 239000007791 liquid phase Substances 0.000 description 3
- 108010003700 lysyl aspartic acid Proteins 0.000 description 3
- 108010038320 lysylphenylalanine Proteins 0.000 description 3
- 108010017391 lysylvaline Proteins 0.000 description 3
- 238000002844 melting Methods 0.000 description 3
- 230000008018 melting Effects 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- 238000002703 mutagenesis Methods 0.000 description 3
- 231100000350 mutagenesis Toxicity 0.000 description 3
- 230000007935 neutral effect Effects 0.000 description 3
- 239000002751 oligonucleotide probe Substances 0.000 description 3
- 230000008520 organization Effects 0.000 description 3
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 3
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 3
- 239000010452 phosphate Substances 0.000 description 3
- 229920001184 polypeptide Polymers 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 239000000376 reactant Substances 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- 108010071207 serylmethionine Proteins 0.000 description 3
- 238000001228 spectrum Methods 0.000 description 3
- 101150032575 tcdA gene Proteins 0.000 description 3
- 231100000331 toxic Toxicity 0.000 description 3
- 230000002588 toxic effect Effects 0.000 description 3
- 108010080629 tryptophan-leucine Proteins 0.000 description 3
- 108010051110 tyrosyl-lysine Proteins 0.000 description 3
- 241000701447 unidentified baculovirus Species 0.000 description 3
- 108010000998 wheylin-2 peptide Proteins 0.000 description 3
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 2
- 108020004465 16S ribosomal RNA Proteins 0.000 description 2
- ZPLCXHWYPWVJDL-UHFFFAOYSA-N 4-[(4-hydroxyphenyl)methyl]-1,3-oxazolidin-2-one Chemical compound C1=CC(O)=CC=C1CC1NC(=O)OC1 ZPLCXHWYPWVJDL-UHFFFAOYSA-N 0.000 description 2
- 244000235858 Acetobacter xylinum Species 0.000 description 2
- 235000002837 Acetobacter xylinum Nutrition 0.000 description 2
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 2
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 2
- HGRBNYQIMKTUNT-XVYDVKMFSA-N Ala-Asn-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HGRBNYQIMKTUNT-XVYDVKMFSA-N 0.000 description 2
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 2
- XQJAFSDFQZPYCU-UWJYBYFXSA-N Ala-Asn-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N XQJAFSDFQZPYCU-UWJYBYFXSA-N 0.000 description 2
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 2
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 2
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 2
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 2
- JPGBXANAQYHTLA-DRZSPHRISA-N Ala-Gln-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JPGBXANAQYHTLA-DRZSPHRISA-N 0.000 description 2
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 2
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 2
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 2
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 2
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 2
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 2
- DCUCOIYYUBILPS-GUBZILKMSA-N Ala-Leu-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DCUCOIYYUBILPS-GUBZILKMSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 2
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 2
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 2
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 2
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 2
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 2
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 2
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 2
- VYMJAWXRWHJIMS-LKTVYLICSA-N Ala-Tyr-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VYMJAWXRWHJIMS-LKTVYLICSA-N 0.000 description 2
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 2
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 2
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 2
- PTVGLOCPAVYPFG-CIUDSAMLSA-N Arg-Gln-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PTVGLOCPAVYPFG-CIUDSAMLSA-N 0.000 description 2
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 2
- OCDJOVKIUJVUMO-SRVKXCTJSA-N Arg-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N OCDJOVKIUJVUMO-SRVKXCTJSA-N 0.000 description 2
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 2
- PZBSKYJGKNNYNK-ULQDDVLXSA-N Arg-Leu-Tyr Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O PZBSKYJGKNNYNK-ULQDDVLXSA-N 0.000 description 2
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 2
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 2
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 2
- FMYQECOAIFGQGU-CYDGBPFRSA-N Arg-Val-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMYQECOAIFGQGU-CYDGBPFRSA-N 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- 241000186063 Arthrobacter Species 0.000 description 2
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 2
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 2
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 2
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 2
- QISZHYWZHJRDAO-CIUDSAMLSA-N Asn-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N QISZHYWZHJRDAO-CIUDSAMLSA-N 0.000 description 2
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 2
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 2
- SNAKIVFVLVUCKB-UHFFFAOYSA-N Asn-Glu-Ala-Lys Natural products NCCCCC(C(O)=O)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(N)CC(N)=O SNAKIVFVLVUCKB-UHFFFAOYSA-N 0.000 description 2
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 2
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 2
- MLJZMGIXXMTEPO-UBHSHLNASA-N Asn-Trp-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O MLJZMGIXXMTEPO-UBHSHLNASA-N 0.000 description 2
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 2
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 2
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 2
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 2
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 2
- ILJQISGMGXRZQQ-IHRRRGAJSA-N Asp-Arg-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ILJQISGMGXRZQQ-IHRRRGAJSA-N 0.000 description 2
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 2
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 2
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 2
- SJLDOGLMVPHPLZ-IHRRRGAJSA-N Asp-Met-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SJLDOGLMVPHPLZ-IHRRRGAJSA-N 0.000 description 2
- LKVKODXGSAFOFY-VEVYYDQMSA-N Asp-Met-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LKVKODXGSAFOFY-VEVYYDQMSA-N 0.000 description 2
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 2
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 2
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 2
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 2
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 2
- VHUKCUHLFMRHOD-MELADBBJSA-N Asp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O VHUKCUHLFMRHOD-MELADBBJSA-N 0.000 description 2
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 2
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 2
- XFTWUNOVBCHBJR-UHFFFAOYSA-N Aspergillomarasmine A Chemical compound OC(=O)C(N)CNC(C(O)=O)CNC(C(O)=O)CC(O)=O XFTWUNOVBCHBJR-UHFFFAOYSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 235000016068 Berberis vulgaris Nutrition 0.000 description 2
- 241000335053 Beta vulgaris Species 0.000 description 2
- 102400000739 Corticotropin Human genes 0.000 description 2
- 101800000414 Corticotropin Proteins 0.000 description 2
- 102000012410 DNA Ligases Human genes 0.000 description 2
- 108010061982 DNA Ligases Proteins 0.000 description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 2
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 2
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 2
- 241000588698 Erwinia Species 0.000 description 2
- 101100437498 Escherichia coli (strain K12) uidA gene Proteins 0.000 description 2
- PLUBXMRUUVWRLT-UHFFFAOYSA-N Ethyl methanesulfonate Chemical compound CCOS(C)(=O)=O PLUBXMRUUVWRLT-UHFFFAOYSA-N 0.000 description 2
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
- 241000233866 Fungi Species 0.000 description 2
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 2
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 2
- GMGKDVVBSVVKCT-NUMRIWBASA-N Gln-Asn-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GMGKDVVBSVVKCT-NUMRIWBASA-N 0.000 description 2
- AJDMYLOISOCHHC-YVNDNENWSA-N Gln-Gln-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AJDMYLOISOCHHC-YVNDNENWSA-N 0.000 description 2
- RBWKVOSARCFSQQ-FXQIFTODSA-N Gln-Gln-Ser Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O RBWKVOSARCFSQQ-FXQIFTODSA-N 0.000 description 2
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 description 2
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 2
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 2
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 2
- QFXNFFZTMFHPST-DZKIICNBSA-N Gln-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)N)N QFXNFFZTMFHPST-DZKIICNBSA-N 0.000 description 2
- PIUPHASDUFSHTF-CIUDSAMLSA-N Gln-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O PIUPHASDUFSHTF-CIUDSAMLSA-N 0.000 description 2
- YJSCHRBERYWPQL-DCAQKATOSA-N Gln-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N YJSCHRBERYWPQL-DCAQKATOSA-N 0.000 description 2
- MQJDLNRXBOELJW-KKUMJFAQSA-N Gln-Pro-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O MQJDLNRXBOELJW-KKUMJFAQSA-N 0.000 description 2
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 2
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 2
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 2
- ALCAUWPAMLVUDB-FXQIFTODSA-N Glu-Gln-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ALCAUWPAMLVUDB-FXQIFTODSA-N 0.000 description 2
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 2
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 2
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 2
- LKOAAMXDJGEYMS-ZPFDUUQYSA-N Glu-Met-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKOAAMXDJGEYMS-ZPFDUUQYSA-N 0.000 description 2
- LHIPZASLKPYDPI-AVGNSLFASA-N Glu-Phe-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LHIPZASLKPYDPI-AVGNSLFASA-N 0.000 description 2
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 2
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 2
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 2
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 2
- BDISFWMLMNBTGP-NUMRIWBASA-N Glu-Thr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O BDISFWMLMNBTGP-NUMRIWBASA-N 0.000 description 2
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 2
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 2
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 2
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 2
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 2
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 2
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 2
- VAXIVIPMCTYSHI-YUMQZZPRSA-N Gly-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN VAXIVIPMCTYSHI-YUMQZZPRSA-N 0.000 description 2
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 2
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 2
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 2
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 2
- 239000004471 Glycine Substances 0.000 description 2
- 241000256244 Heliothis virescens Species 0.000 description 2
- WCNXUTNLSRWWQN-DCAQKATOSA-N His-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WCNXUTNLSRWWQN-DCAQKATOSA-N 0.000 description 2
- 101000963974 Hydrophis stokesii Alpha-elapitoxin-Ast2b Proteins 0.000 description 2
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 2
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 2
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 2
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 2
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 2
- BSWLQVGEVFYGIM-ZPFDUUQYSA-N Ile-Gln-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BSWLQVGEVFYGIM-ZPFDUUQYSA-N 0.000 description 2
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 2
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 2
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 2
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 2
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 2
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 2
- HQLSBZFLOUHQJK-STECZYCISA-N Ile-Tyr-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HQLSBZFLOUHQJK-STECZYCISA-N 0.000 description 2
- 229930010555 Inosine Natural products 0.000 description 2
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 2
- 241000588748 Klebsiella Species 0.000 description 2
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 2
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 2
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 2
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 2
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 2
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 2
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 2
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 2
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 2
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 2
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 2
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 2
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 2
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 2
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 2
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 2
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 2
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 2
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 2
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 2
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 2
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 2
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 2
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 2
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 2
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 2
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 2
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 2
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 2
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 2
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 2
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 2
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 2
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 2
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 2
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 2
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 2
- PAMDBWYMLWOELY-SDDRHHMPSA-N Lys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O PAMDBWYMLWOELY-SDDRHHMPSA-N 0.000 description 2
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 2
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 2
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 2
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 2
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 2
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 2
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 2
- RMKJOQSYLQQRFN-KKUMJFAQSA-N Lys-Tyr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O RMKJOQSYLQQRFN-KKUMJFAQSA-N 0.000 description 2
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 2
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 2
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 2
- 241000255908 Manduca sexta Species 0.000 description 2
- MCNGIXXCMJAURZ-VEVYYDQMSA-N Met-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCSC)N)O MCNGIXXCMJAURZ-VEVYYDQMSA-N 0.000 description 2
- NCVJJAJVWILAGI-SRVKXCTJSA-N Met-Gln-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N NCVJJAJVWILAGI-SRVKXCTJSA-N 0.000 description 2
- MXEASDMFHUKOGE-ULQDDVLXSA-N Met-His-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MXEASDMFHUKOGE-ULQDDVLXSA-N 0.000 description 2
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 2
- 241000409991 Mythimna separata Species 0.000 description 2
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- 229910002651 NO3 Inorganic materials 0.000 description 2
- 241001443590 Naganishia albida Species 0.000 description 2
- 241000033319 Naganishia diffluens Species 0.000 description 2
- 101000964025 Naja naja Long neurotoxin 3 Proteins 0.000 description 2
- 101000822778 Naja naja Long neurotoxin 4 Proteins 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 244000061176 Nicotiana tabacum Species 0.000 description 2
- NHNBFGGVMKEFGY-UHFFFAOYSA-N Nitrate Chemical compound [O-][N+]([O-])=O NHNBFGGVMKEFGY-UHFFFAOYSA-N 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- 241000179039 Paenibacillus Species 0.000 description 2
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 2
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 2
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 2
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 2
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 2
- FZBGMXYQPACKNC-HJWJTTGWSA-N Phe-Pro-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FZBGMXYQPACKNC-HJWJTTGWSA-N 0.000 description 2
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 2
- BAONJAHBAUDJKA-BZSNNMDCSA-N Phe-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 BAONJAHBAUDJKA-BZSNNMDCSA-N 0.000 description 2
- 241000255969 Pieris brassicae Species 0.000 description 2
- 108020005089 Plant RNA Proteins 0.000 description 2
- 229920001213 Polysorbate 20 Polymers 0.000 description 2
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 2
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 2
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 2
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 2
- XWYXZPHPYKRYPA-GMOBBJLQSA-N Pro-Asn-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XWYXZPHPYKRYPA-GMOBBJLQSA-N 0.000 description 2
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 2
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 2
- FISHYTLIMUYTQY-GUBZILKMSA-N Pro-Gln-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 FISHYTLIMUYTQY-GUBZILKMSA-N 0.000 description 2
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 2
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 2
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 2
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 2
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 2
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 2
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 2
- BVRBCQBUNGAWFP-KKUMJFAQSA-N Pro-Tyr-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O BVRBCQBUNGAWFP-KKUMJFAQSA-N 0.000 description 2
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 2
- SHTKRJHDMNSKRM-ULQDDVLXSA-N Pro-Tyr-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O SHTKRJHDMNSKRM-ULQDDVLXSA-N 0.000 description 2
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 2
- 241000169446 Promethis Species 0.000 description 2
- 241000589516 Pseudomonas Species 0.000 description 2
- 241000589615 Pseudomonas syringae Species 0.000 description 2
- 241000191043 Rhodobacter sphaeroides Species 0.000 description 2
- 241000190932 Rhodopseudomonas Species 0.000 description 2
- 241000223252 Rhodotorula Species 0.000 description 2
- 241000223254 Rhodotorula mucilaginosa Species 0.000 description 2
- 241000242583 Scyphozoa Species 0.000 description 2
- QGMLKFGTGXWAHF-IHRRRGAJSA-N Ser-Arg-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGMLKFGTGXWAHF-IHRRRGAJSA-N 0.000 description 2
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 2
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 2
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 2
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 2
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 2
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 2
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 2
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 2
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 2
- 241000607715 Serratia marcescens Species 0.000 description 2
- 101710084578 Short neurotoxin 1 Proteins 0.000 description 2
- 241000222068 Sporobolomyces <Sporidiobolaceae> Species 0.000 description 2
- 241000187747 Streptomyces Species 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-L Sulfate Chemical compound [O-]S([O-])(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-L 0.000 description 2
- 241001514658 Symmetrospora marina Species 0.000 description 2
- 108700005078 Synthetic Genes Proteins 0.000 description 2
- 241000589500 Thermus aquaticus Species 0.000 description 2
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 2
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 2
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 2
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 2
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 2
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 2
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 2
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 2
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 2
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 2
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 2
- VGNLMPBYWWNQFS-ZEILLAHLSA-N Thr-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O VGNLMPBYWWNQFS-ZEILLAHLSA-N 0.000 description 2
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 2
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 2
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 2
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 2
- 101710182223 Toxin B Proteins 0.000 description 2
- 101710182532 Toxin a Proteins 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- PNKDNKGMEHJTJQ-BPUTZDHNSA-N Trp-Arg-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N PNKDNKGMEHJTJQ-BPUTZDHNSA-N 0.000 description 2
- LHHDBONOFZDWMW-AAEUAGOBSA-N Trp-Asp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LHHDBONOFZDWMW-AAEUAGOBSA-N 0.000 description 2
- RRVUOLRWIZXBRQ-IHPCNDPISA-N Trp-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RRVUOLRWIZXBRQ-IHPCNDPISA-N 0.000 description 2
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 2
- SDNVRAKIJVKAGS-LKTVYLICSA-N Tyr-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N SDNVRAKIJVKAGS-LKTVYLICSA-N 0.000 description 2
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 2
- XMNDQSYABVWZRK-BZSNNMDCSA-N Tyr-Asn-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XMNDQSYABVWZRK-BZSNNMDCSA-N 0.000 description 2
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 2
- NMKJPMCEKQHRPD-IRXDYDNUSA-N Tyr-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NMKJPMCEKQHRPD-IRXDYDNUSA-N 0.000 description 2
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 2
- YKBUNNNRNZZUID-UFYCRDLUSA-N Tyr-Val-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YKBUNNNRNZZUID-UFYCRDLUSA-N 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 2
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 2
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 2
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 2
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 2
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 2
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 2
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 2
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 2
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 2
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 2
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 2
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 2
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 2
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 2
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- 241000082085 Verticillium <Phyllachorales> Species 0.000 description 2
- 241000589634 Xanthomonas Species 0.000 description 2
- 241000589636 Xanthomonas campestris Species 0.000 description 2
- 235000004279 alanine Nutrition 0.000 description 2
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 2
- 238000000137 annealing Methods 0.000 description 2
- 230000000692 anti-sense effect Effects 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 2
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 2
- 108010084758 arginyl-tyrosyl-aspartic acid Proteins 0.000 description 2
- 235000009582 asparagine Nutrition 0.000 description 2
- 229960001230 asparagine Drugs 0.000 description 2
- 235000003704 aspartic acid Nutrition 0.000 description 2
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 2
- 230000000975 bioactive effect Effects 0.000 description 2
- 230000029918 bioluminescence Effects 0.000 description 2
- 238000005415 bioluminescence Methods 0.000 description 2
- 238000009395 breeding Methods 0.000 description 2
- 230000001488 breeding effect Effects 0.000 description 2
- 229940041514 candida albicans extract Drugs 0.000 description 2
- 230000006037 cell lysis Effects 0.000 description 2
- 235000013339 cereals Nutrition 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 230000001276 controlling effect Effects 0.000 description 2
- IDLFZVILOHSSID-OVLDLUHVSA-N corticotropin Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](C(C)C)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)NC(=O)[C@@H](N)CO)C1=CC=C(O)C=C1 IDLFZVILOHSSID-OVLDLUHVSA-N 0.000 description 2
- 229960000258 corticotropin Drugs 0.000 description 2
- 230000001186 cumulative effect Effects 0.000 description 2
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 2
- 235000018417 cysteine Nutrition 0.000 description 2
- 230000001472 cytotoxic effect Effects 0.000 description 2
- 230000006378 damage Effects 0.000 description 2
- 230000009849 deactivation Effects 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- 108010054812 diprotin A Proteins 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 238000001035 drying Methods 0.000 description 2
- 241001493065 dsRNA viruses Species 0.000 description 2
- 238000010828 elution Methods 0.000 description 2
- 230000000967 entomopathogenic effect Effects 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 230000002349 favourable effect Effects 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- BRZYSWJRSDMWLG-CAXSIQPQSA-N geneticin Natural products O1C[C@@](O)(C)[C@H](NC)[C@@H](O)[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](C(C)O)O2)N)[C@@H](N)C[C@H]1N BRZYSWJRSDMWLG-CAXSIQPQSA-N 0.000 description 2
- 235000013922 glutamic acid Nutrition 0.000 description 2
- 239000004220 glutamic acid Substances 0.000 description 2
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- 239000005090 green fluorescent protein Substances 0.000 description 2
- 230000009036 growth inhibition Effects 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 231100000086 high toxicity Toxicity 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- 238000002744 homologous recombination Methods 0.000 description 2
- 230000006801 homologous recombination Effects 0.000 description 2
- 229940088597 hormone Drugs 0.000 description 2
- 239000005556 hormone Substances 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 230000001965 increasing effect Effects 0.000 description 2
- 238000011081 inoculation Methods 0.000 description 2
- 229960003786 inosine Drugs 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 210000000936 intestine Anatomy 0.000 description 2
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 2
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 2
- 108010091871 leucylmethionine Proteins 0.000 description 2
- 210000001161 mammalian embryo Anatomy 0.000 description 2
- 230000010534 mechanism of action Effects 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 230000002503 metabolic effect Effects 0.000 description 2
- 229930182817 methionine Natural products 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 239000011259 mixed solution Substances 0.000 description 2
- 235000015097 nutrients Nutrition 0.000 description 2
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 2
- 230000008488 polyadenylation Effects 0.000 description 2
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 2
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 2
- 238000004321 preservation Methods 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 230000018883 protein targeting Effects 0.000 description 2
- 230000008929 regeneration Effects 0.000 description 2
- 238000011069 regeneration method Methods 0.000 description 2
- 239000012744 reinforcing agent Substances 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 230000028327 secretion Effects 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 230000035939 shock Effects 0.000 description 2
- 238000005507 spraying Methods 0.000 description 2
- 238000010561 standard procedure Methods 0.000 description 2
- 239000011550 stock solution Substances 0.000 description 2
- 239000004094 surface-active agent Substances 0.000 description 2
- 230000004083 survival effect Effects 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 2
- 239000003440 toxic substance Substances 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- 230000009261 transgenic effect Effects 0.000 description 2
- 108700004896 tripeptide FEG Proteins 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- 108010038745 tryptophylglycine Proteins 0.000 description 2
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 2
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 2
- 108010078580 tyrosylleucine Proteins 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- 239000013603 viral vector Substances 0.000 description 2
- 230000003612 virological effect Effects 0.000 description 2
- 239000012138 yeast extract Substances 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 1
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 1
- PKOHVHWNGUHYRE-ZFWWWQNUSA-N (2s)-1-[2-[[(2s)-2-amino-3-(1h-indol-3-yl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound O=C([C@H](CC=1C2=CC=CC=C2NC=1)N)NCC(=O)N1CCC[C@H]1C(O)=O PKOHVHWNGUHYRE-ZFWWWQNUSA-N 0.000 description 1
- VKYBWTVHQHFSJL-ZATYTLRZSA-N (2s)-2,5-diamino-5-oxopentanoic acid;(2s)-pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1.OC(=O)[C@@H](N)CCC(N)=O VKYBWTVHQHFSJL-ZATYTLRZSA-N 0.000 description 1
- YUXKOWPNKJSTPQ-AXWWPMSFSA-N (2s,3r)-2-amino-3-hydroxybutanoic acid;(2s)-2-amino-3-hydroxypropanoic acid Chemical compound OC[C@H](N)C(O)=O.C[C@@H](O)[C@H](N)C(O)=O YUXKOWPNKJSTPQ-AXWWPMSFSA-N 0.000 description 1
- NMNYMRMXUPRAKF-TYHJCQIPSA-N (2s,3s)-2-amino-3-methylpentanoic acid;(2s)-2,6-diaminohexanoic acid Chemical compound CC[C@H](C)[C@H](N)C(O)=O.NCCCC[C@H](N)C(O)=O NMNYMRMXUPRAKF-TYHJCQIPSA-N 0.000 description 1
- HGHOBRRUMWJWCU-FXQIFTODSA-N (4s)-4-[[(2s)-2-aminopropanoyl]amino]-5-[[(2s)-3-carboxy-1-(carboxymethylamino)-1-oxopropan-2-yl]amino]-5-oxopentanoic acid Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O HGHOBRRUMWJWCU-FXQIFTODSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- IAJOBQBIJHVGMQ-UHFFFAOYSA-N 2-amino-4-[hydroxy(methyl)phosphoryl]butanoic acid Chemical compound CP(O)(=O)CCC(N)C(O)=O IAJOBQBIJHVGMQ-UHFFFAOYSA-N 0.000 description 1
- AXAVXPMQTGXXJZ-UHFFFAOYSA-N 2-aminoacetic acid;2-amino-2-(hydroxymethyl)propane-1,3-diol Chemical compound NCC(O)=O.OCC(N)(CO)CO AXAVXPMQTGXXJZ-UHFFFAOYSA-N 0.000 description 1
- WTLKTXIHIHFSGU-UHFFFAOYSA-N 2-nitrosoguanidine Chemical compound NC(N)=NN=O WTLKTXIHIHFSGU-UHFFFAOYSA-N 0.000 description 1
- UPMXNNIRAGDFEH-UHFFFAOYSA-N 3,5-dibromo-4-hydroxybenzonitrile Chemical compound OC1=C(Br)C=C(C#N)C=C1Br UPMXNNIRAGDFEH-UHFFFAOYSA-N 0.000 description 1
- CAAMSDWKXXPUJR-UHFFFAOYSA-N 3,5-dihydro-4H-imidazol-4-one Chemical compound O=C1CNC=N1 CAAMSDWKXXPUJR-UHFFFAOYSA-N 0.000 description 1
- QFVHZQCOUORWEI-UHFFFAOYSA-N 4-[(4-anilino-5-sulfonaphthalen-1-yl)diazenyl]-5-hydroxynaphthalene-2,7-disulfonic acid Chemical compound C=12C(O)=CC(S(O)(=O)=O)=CC2=CC(S(O)(=O)=O)=CC=1N=NC(C1=CC=CC(=C11)S(O)(=O)=O)=CC=C1NC1=CC=CC=C1 QFVHZQCOUORWEI-UHFFFAOYSA-N 0.000 description 1
- 241000238876 Acari Species 0.000 description 1
- 241000589220 Acetobacter Species 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 241000242764 Aequorea victoria Species 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 1
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 1
- UGLPMYSCWHTZQU-AUTRQRHGSA-N Ala-Ala-Tyr Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UGLPMYSCWHTZQU-AUTRQRHGSA-N 0.000 description 1
- SDMAQFGBPOJFOM-GUBZILKMSA-N Ala-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SDMAQFGBPOJFOM-GUBZILKMSA-N 0.000 description 1
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 1
- PJNSIUPOXFBHDM-GUBZILKMSA-N Ala-Arg-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O PJNSIUPOXFBHDM-GUBZILKMSA-N 0.000 description 1
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 1
- ZEXDYVGDZJBRMO-ACZMJKKPSA-N Ala-Asn-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZEXDYVGDZJBRMO-ACZMJKKPSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 1
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 1
- LZRNYBIJOSKKRJ-XVYDVKMFSA-N Ala-Asp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LZRNYBIJOSKKRJ-XVYDVKMFSA-N 0.000 description 1
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 1
- FOWHQTWRLFTELJ-FXQIFTODSA-N Ala-Asp-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N FOWHQTWRLFTELJ-FXQIFTODSA-N 0.000 description 1
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- NFDVJAKFMXHJEQ-HERUPUMHSA-N Ala-Asp-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N NFDVJAKFMXHJEQ-HERUPUMHSA-N 0.000 description 1
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 1
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 1
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 1
- PWYFCPCBOYMOGB-LKTVYLICSA-N Ala-Gln-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N PWYFCPCBOYMOGB-LKTVYLICSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 1
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 1
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 1
- JEPNLGMEZMCFEX-QSFUFRPTSA-N Ala-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N JEPNLGMEZMCFEX-QSFUFRPTSA-N 0.000 description 1
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 1
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 1
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 1
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 1
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 1
- GFEDXKNBZMPEDM-KZVJFYERSA-N Ala-Met-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFEDXKNBZMPEDM-KZVJFYERSA-N 0.000 description 1
- FVNAUOZKIPAYNA-BPNCWPANSA-N Ala-Met-Tyr Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FVNAUOZKIPAYNA-BPNCWPANSA-N 0.000 description 1
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 1
- RNHKOQHGYMTHFR-UBHSHLNASA-N Ala-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 RNHKOQHGYMTHFR-UBHSHLNASA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 1
- SGFBVLBKDSXGAP-GKCIPKSASA-N Ala-Phe-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N SGFBVLBKDSXGAP-GKCIPKSASA-N 0.000 description 1
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 1
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 1
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 1
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- IEAUDUOCWNPZBR-LKTVYLICSA-N Ala-Trp-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N IEAUDUOCWNPZBR-LKTVYLICSA-N 0.000 description 1
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 1
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 1
- DEAGTWNKODHUIY-MRFFXTKBSA-N Ala-Tyr-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DEAGTWNKODHUIY-MRFFXTKBSA-N 0.000 description 1
- SSQHYGLFYWZWDV-UVBJJODRSA-N Ala-Val-Trp Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O SSQHYGLFYWZWDV-UVBJJODRSA-N 0.000 description 1
- 101900318283 Alfalfa mosaic virus Capsid protein Proteins 0.000 description 1
- 101100437895 Alternaria brassicicola bsc3 gene Proteins 0.000 description 1
- 101100165658 Alternaria brassicicola bsc5 gene Proteins 0.000 description 1
- 244000153158 Ammi visnaga Species 0.000 description 1
- 235000010585 Ammi visnaga Nutrition 0.000 description 1
- 102400000344 Angiotensin-1 Human genes 0.000 description 1
- 101800000734 Angiotensin-1 Proteins 0.000 description 1
- 206010002660 Anoxia Diseases 0.000 description 1
- 241000976983 Anoxia Species 0.000 description 1
- 241000254175 Anthonomus grandis Species 0.000 description 1
- 241000256844 Apis mellifera Species 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- XPSGESXVBSQZPL-SRVKXCTJSA-N Arg-Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XPSGESXVBSQZPL-SRVKXCTJSA-N 0.000 description 1
- VWVPYNGMOCSSGK-GUBZILKMSA-N Arg-Arg-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O VWVPYNGMOCSSGK-GUBZILKMSA-N 0.000 description 1
- MUXONAMCEUBVGA-DCAQKATOSA-N Arg-Arg-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O MUXONAMCEUBVGA-DCAQKATOSA-N 0.000 description 1
- BHSYMWWMVRPCPA-CYDGBPFRSA-N Arg-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N BHSYMWWMVRPCPA-CYDGBPFRSA-N 0.000 description 1
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 1
- JTKLCCFLSLCCST-SZMVWBNQSA-N Arg-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)=CNC2=C1 JTKLCCFLSLCCST-SZMVWBNQSA-N 0.000 description 1
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 1
- USNSOPDIZILSJP-FXQIFTODSA-N Arg-Asn-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O USNSOPDIZILSJP-FXQIFTODSA-N 0.000 description 1
- DPNHSNLIULPOBH-GUBZILKMSA-N Arg-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DPNHSNLIULPOBH-GUBZILKMSA-N 0.000 description 1
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 1
- NTAZNGWBXRVEDJ-FXQIFTODSA-N Arg-Asp-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NTAZNGWBXRVEDJ-FXQIFTODSA-N 0.000 description 1
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 1
- YSUVMPICYVWRBX-VEVYYDQMSA-N Arg-Asp-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YSUVMPICYVWRBX-VEVYYDQMSA-N 0.000 description 1
- QIWYWCYNUMJBTC-CIUDSAMLSA-N Arg-Cys-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QIWYWCYNUMJBTC-CIUDSAMLSA-N 0.000 description 1
- IGULQRCJLQQPSM-DCAQKATOSA-N Arg-Cys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IGULQRCJLQQPSM-DCAQKATOSA-N 0.000 description 1
- GIVWETPOBCRTND-DCAQKATOSA-N Arg-Gln-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GIVWETPOBCRTND-DCAQKATOSA-N 0.000 description 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- RKRSYHCNPFGMTA-CIUDSAMLSA-N Arg-Glu-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O RKRSYHCNPFGMTA-CIUDSAMLSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 1
- JAYIQMNQDMOBFY-KKUMJFAQSA-N Arg-Glu-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JAYIQMNQDMOBFY-KKUMJFAQSA-N 0.000 description 1
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 1
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 1
- QEHMMRSQJMOYNO-DCAQKATOSA-N Arg-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QEHMMRSQJMOYNO-DCAQKATOSA-N 0.000 description 1
- PZVMBNFTBWQWQL-DCAQKATOSA-N Arg-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N PZVMBNFTBWQWQL-DCAQKATOSA-N 0.000 description 1
- OFIYLHVAAJYRBC-HJWJTTGWSA-N Arg-Ile-Phe Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O OFIYLHVAAJYRBC-HJWJTTGWSA-N 0.000 description 1
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 1
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 1
- NIUDXSFNLBIWOB-DCAQKATOSA-N Arg-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NIUDXSFNLBIWOB-DCAQKATOSA-N 0.000 description 1
- IIAXFBUTKIDDIP-ULQDDVLXSA-N Arg-Leu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IIAXFBUTKIDDIP-ULQDDVLXSA-N 0.000 description 1
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 1
- ZEBDYGZVMMKZNB-SRVKXCTJSA-N Arg-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N ZEBDYGZVMMKZNB-SRVKXCTJSA-N 0.000 description 1
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 1
- DPLFNLDACGGBAK-KKUMJFAQSA-N Arg-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N DPLFNLDACGGBAK-KKUMJFAQSA-N 0.000 description 1
- NIELFHOLFTUZME-HJWJTTGWSA-N Arg-Phe-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NIELFHOLFTUZME-HJWJTTGWSA-N 0.000 description 1
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 1
- XFXZKCRBBOVJKS-BVSLBCMMSA-N Arg-Phe-Trp Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 XFXZKCRBBOVJKS-BVSLBCMMSA-N 0.000 description 1
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 1
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 1
- SYFHFLGAROUHNT-VEVYYDQMSA-N Arg-Thr-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SYFHFLGAROUHNT-VEVYYDQMSA-N 0.000 description 1
- ZPWMEWYQBWSGAO-ZJDVBMNYSA-N Arg-Thr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZPWMEWYQBWSGAO-ZJDVBMNYSA-N 0.000 description 1
- DRDWXKWUSIKKOB-PJODQICGSA-N Arg-Trp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O DRDWXKWUSIKKOB-PJODQICGSA-N 0.000 description 1
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 1
- PJOPLXOCKACMLK-KKUMJFAQSA-N Arg-Tyr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O PJOPLXOCKACMLK-KKUMJFAQSA-N 0.000 description 1
- QHUOOCKNNURZSL-IHRRRGAJSA-N Arg-Tyr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O QHUOOCKNNURZSL-IHRRRGAJSA-N 0.000 description 1
- CNBIWSCSSCAINS-UFYCRDLUSA-N Arg-Tyr-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNBIWSCSSCAINS-UFYCRDLUSA-N 0.000 description 1
- KEZVOBAKAXHMOF-GUBZILKMSA-N Arg-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N KEZVOBAKAXHMOF-GUBZILKMSA-N 0.000 description 1
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 1
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 1
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 1
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 1
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 1
- NTXNUXPCNRDMAF-WFBYXXMGSA-N Asn-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC(N)=O)C)C(O)=O)=CNC2=C1 NTXNUXPCNRDMAF-WFBYXXMGSA-N 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 1
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 1
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 1
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 1
- PIWWUBYJNONVTJ-ZLUOBGJFSA-N Asn-Asp-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N PIWWUBYJNONVTJ-ZLUOBGJFSA-N 0.000 description 1
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 description 1
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- ZDOQDYFZNGASEY-BIIVOSGPSA-N Asn-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZDOQDYFZNGASEY-BIIVOSGPSA-N 0.000 description 1
- WQSCVMQDZYTFQU-FXQIFTODSA-N Asn-Cys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WQSCVMQDZYTFQU-FXQIFTODSA-N 0.000 description 1
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 1
- HJRBIWRXULGMOA-ACZMJKKPSA-N Asn-Gln-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJRBIWRXULGMOA-ACZMJKKPSA-N 0.000 description 1
- UPALZCBCKAMGIY-PEFMBERDSA-N Asn-Gln-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UPALZCBCKAMGIY-PEFMBERDSA-N 0.000 description 1
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 1
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 1
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 1
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 1
- OWUCNXMFJRFOFI-BQBZGAKWSA-N Asn-Gly-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OWUCNXMFJRFOFI-BQBZGAKWSA-N 0.000 description 1
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 1
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 1
- GOKCTAJWRPSCHP-VHWLVUOQSA-N Asn-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)N)N GOKCTAJWRPSCHP-VHWLVUOQSA-N 0.000 description 1
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 1
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 1
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 1
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 1
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 1
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 1
- ALHMNHZJBYBYHS-DCAQKATOSA-N Asn-Lys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ALHMNHZJBYBYHS-DCAQKATOSA-N 0.000 description 1
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 1
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 1
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 1
- VITDJIPIJZAVGC-VEVYYDQMSA-N Asn-Met-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VITDJIPIJZAVGC-VEVYYDQMSA-N 0.000 description 1
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 1
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 1
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 1
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 1
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 1
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 1
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 1
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 1
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 1
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 1
- DAYDURRBMDCCFL-AAEUAGOBSA-N Asn-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N DAYDURRBMDCCFL-AAEUAGOBSA-N 0.000 description 1
- SKQTXVZTCGSRJS-SRVKXCTJSA-N Asn-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O SKQTXVZTCGSRJS-SRVKXCTJSA-N 0.000 description 1
- YSYTWUMRHSFODC-QWRGUYRKSA-N Asn-Tyr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O YSYTWUMRHSFODC-QWRGUYRKSA-N 0.000 description 1
- BEHQTVDBCLSCBY-CFMVVWHZSA-N Asn-Tyr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BEHQTVDBCLSCBY-CFMVVWHZSA-N 0.000 description 1
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 1
- VTYQAQFKMQTKQD-ACZMJKKPSA-N Asp-Ala-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O VTYQAQFKMQTKQD-ACZMJKKPSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 1
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 1
- SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 1
- DBWYWXNMZZYIRY-LPEHRKFASA-N Asp-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O DBWYWXNMZZYIRY-LPEHRKFASA-N 0.000 description 1
- ATYWBXGNXZYZGI-ACZMJKKPSA-N Asp-Asn-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ATYWBXGNXZYZGI-ACZMJKKPSA-N 0.000 description 1
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 1
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 1
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 1
- FRSGNOZCTWDVFZ-ACZMJKKPSA-N Asp-Asp-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRSGNOZCTWDVFZ-ACZMJKKPSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 1
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 1
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 1
- NYQHSUGFEWDWPD-ACZMJKKPSA-N Asp-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N NYQHSUGFEWDWPD-ACZMJKKPSA-N 0.000 description 1
- WLKVEEODTPQPLI-ACZMJKKPSA-N Asp-Gln-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WLKVEEODTPQPLI-ACZMJKKPSA-N 0.000 description 1
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 1
- CSEJMKNZDCJYGJ-XHNCKOQMSA-N Asp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O CSEJMKNZDCJYGJ-XHNCKOQMSA-N 0.000 description 1
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 1
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- ZSVJVIOVABDTTL-YUMQZZPRSA-N Asp-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N ZSVJVIOVABDTTL-YUMQZZPRSA-N 0.000 description 1
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 1
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 1
- CRNKLABLTICXDV-GUBZILKMSA-N Asp-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N CRNKLABLTICXDV-GUBZILKMSA-N 0.000 description 1
- ICZWAZVKLACMKR-CIUDSAMLSA-N Asp-His-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 ICZWAZVKLACMKR-CIUDSAMLSA-N 0.000 description 1
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 1
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 1
- AKKUDRZKFZWPBH-SRVKXCTJSA-N Asp-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N AKKUDRZKFZWPBH-SRVKXCTJSA-N 0.000 description 1
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 1
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 1
- HXVILZUZXFLVEN-DCAQKATOSA-N Asp-Met-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HXVILZUZXFLVEN-DCAQKATOSA-N 0.000 description 1
- HJZLUGQGJWXJCJ-CIUDSAMLSA-N Asp-Pro-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJZLUGQGJWXJCJ-CIUDSAMLSA-N 0.000 description 1
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 1
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 1
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 1
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 1
- XAPPCWUWHNWCPQ-PBCZWWQYSA-N Asp-Thr-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XAPPCWUWHNWCPQ-PBCZWWQYSA-N 0.000 description 1
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 1
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 1
- ZVYYMCXVPZEAPU-CWRNSKLLSA-N Asp-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZVYYMCXVPZEAPU-CWRNSKLLSA-N 0.000 description 1
- OYSYWMMZGJSQRB-AVGNSLFASA-N Asp-Tyr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O OYSYWMMZGJSQRB-AVGNSLFASA-N 0.000 description 1
- 108010083946 Asp-Tyr-Leu-Lys Proteins 0.000 description 1
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 1
- 241000589149 Azotobacter vinelandii Species 0.000 description 1
- 108700003918 Bacillus Thuringiensis insecticidal crystal Proteins 0.000 description 1
- 101100032924 Bacillus subtilis (strain 168) radA gene Proteins 0.000 description 1
- 108010023063 Bacto-peptone Proteins 0.000 description 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 1
- 108010006654 Bleomycin Proteins 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 108010003379 Brassica napus oleosin protein Proteins 0.000 description 1
- 239000005489 Bromoxynil Substances 0.000 description 1
- 101150078024 CRY2 gene Proteins 0.000 description 1
- 101100465058 Caenorhabditis elegans prk-2 gene Proteins 0.000 description 1
- 108090000565 Capsid Proteins Proteins 0.000 description 1
- 101710132601 Capsid protein Proteins 0.000 description 1
- BVKZGUZCCUSVTD-UHFFFAOYSA-L Carbonate Chemical compound [O-]C([O-])=O BVKZGUZCCUSVTD-UHFFFAOYSA-L 0.000 description 1
- 241000701489 Cauliflower mosaic virus Species 0.000 description 1
- 102100023321 Ceruloplasmin Human genes 0.000 description 1
- LRBQNJMCXXYXIU-PPKXGCFTSA-N Chinese gallotannin Chemical compound OC1=C(O)C(O)=CC(C(=O)OC=2C(=C(O)C=C(C=2)C(=O)OC[C@@H]2[C@H]([C@H](OC(=O)C=3C=C(OC(=O)C=4C=C(O)C(O)=C(O)C=4)C(O)=C(O)C=3)[C@@H](OC(=O)C=3C=C(OC(=O)C=4C=C(O)C(O)=C(O)C=4)C(O)=C(O)C=3)[C@H](OC(=O)C=3C=C(OC(=O)C=4C=C(O)C(O)=C(O)C=4)C(O)=C(O)C=3)O2)OC(=O)C=2C=C(OC(=O)C=3C=C(O)C(O)=C(O)C=3)C(O)=C(O)C=2)O)=C1 LRBQNJMCXXYXIU-PPKXGCFTSA-N 0.000 description 1
- 101710094648 Coat protein Proteins 0.000 description 1
- 108091033380 Coding strand Proteins 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 239000000055 Corticotropin-Releasing Hormone Substances 0.000 description 1
- 241000195493 Cryptophyta Species 0.000 description 1
- 101710151559 Crystal protein Proteins 0.000 description 1
- 241000252867 Cupriavidus metallidurans Species 0.000 description 1
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 1
- RFHGRMMADHHQSA-KBIXCLLPSA-N Cys-Gln-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RFHGRMMADHHQSA-KBIXCLLPSA-N 0.000 description 1
- SFRQEQGPRTVDPO-NRPADANISA-N Cys-Gln-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O SFRQEQGPRTVDPO-NRPADANISA-N 0.000 description 1
- PQHYZJPCYRDYNE-QWRGUYRKSA-N Cys-Gly-Phe Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PQHYZJPCYRDYNE-QWRGUYRKSA-N 0.000 description 1
- CMYVIUWVYHOLRD-ZLUOBGJFSA-N Cys-Ser-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CMYVIUWVYHOLRD-ZLUOBGJFSA-N 0.000 description 1
- LKHMGNHQULEPFY-ACZMJKKPSA-N Cys-Ser-Glu Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O LKHMGNHQULEPFY-ACZMJKKPSA-N 0.000 description 1
- SAEVTQWAYDPXMU-KATARQTJSA-N Cys-Thr-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O SAEVTQWAYDPXMU-KATARQTJSA-N 0.000 description 1
- DXSBGVKEPHDOTD-UBHSHLNASA-N Cys-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N DXSBGVKEPHDOTD-UBHSHLNASA-N 0.000 description 1
- 108010066133 D-octopine dehydrogenase Proteins 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 238000007399 DNA isolation Methods 0.000 description 1
- 239000003298 DNA probe Substances 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- NDUPDOJHUQKPAG-UHFFFAOYSA-N Dalapon Chemical compound CC(Cl)(Cl)C(O)=O NDUPDOJHUQKPAG-UHFFFAOYSA-N 0.000 description 1
- 241000289763 Dasygaster padockina Species 0.000 description 1
- 241000238557 Decapoda Species 0.000 description 1
- 208000034423 Delivery Diseases 0.000 description 1
- 241000489973 Diabrotica undecimpunctata Species 0.000 description 1
- 241000489976 Diabrotica undecimpunctata howardi Species 0.000 description 1
- 101100492392 Didymella fabae pksAC gene Proteins 0.000 description 1
- 241001303048 Ditta Species 0.000 description 1
- 102000002322 Egg Proteins Human genes 0.000 description 1
- 108010000912 Egg Proteins Proteins 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 108010067770 Endopeptidase K Proteins 0.000 description 1
- 241000305071 Enterobacterales Species 0.000 description 1
- 241000588921 Enterobacteriaceae Species 0.000 description 1
- 108090000371 Esterases Proteins 0.000 description 1
- 108060002716 Exonuclease Proteins 0.000 description 1
- 108050001049 Extracellular proteins Proteins 0.000 description 1
- 210000000712 G cell Anatomy 0.000 description 1
- 108010010803 Gelatin Proteins 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 1
- REJJNXODKSHOKA-ACZMJKKPSA-N Gln-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N REJJNXODKSHOKA-ACZMJKKPSA-N 0.000 description 1
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 1
- XXLBHPPXDUWYAG-XQXXSGGOSA-N Gln-Ala-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XXLBHPPXDUWYAG-XQXXSGGOSA-N 0.000 description 1
- KZKBJEUWNMQTLV-XDTLVQLUSA-N Gln-Ala-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZKBJEUWNMQTLV-XDTLVQLUSA-N 0.000 description 1
- MQANCSUBSBJNLU-KKUMJFAQSA-N Gln-Arg-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQANCSUBSBJNLU-KKUMJFAQSA-N 0.000 description 1
- ODBLJLZVLAWVMS-GUBZILKMSA-N Gln-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N ODBLJLZVLAWVMS-GUBZILKMSA-N 0.000 description 1
- CKNUKHBRCSMKMO-XHNCKOQMSA-N Gln-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O CKNUKHBRCSMKMO-XHNCKOQMSA-N 0.000 description 1
- RMOCFPBLHAOTDU-ACZMJKKPSA-N Gln-Asn-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RMOCFPBLHAOTDU-ACZMJKKPSA-N 0.000 description 1
- ULXXDWZMMSQBDC-ACZMJKKPSA-N Gln-Asp-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ULXXDWZMMSQBDC-ACZMJKKPSA-N 0.000 description 1
- DHNWZLGBTPUTQQ-QEJZJMRPSA-N Gln-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N DHNWZLGBTPUTQQ-QEJZJMRPSA-N 0.000 description 1
- OFPWCBGRYAOLMU-AVGNSLFASA-N Gln-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OFPWCBGRYAOLMU-AVGNSLFASA-N 0.000 description 1
- OIIIRRTWYLCQNW-ACZMJKKPSA-N Gln-Cys-Asn Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O OIIIRRTWYLCQNW-ACZMJKKPSA-N 0.000 description 1
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 1
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 1
- MADFVRSKEIEZHZ-DCAQKATOSA-N Gln-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N MADFVRSKEIEZHZ-DCAQKATOSA-N 0.000 description 1
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 1
- VOLVNCMGXWDDQY-LPEHRKFASA-N Gln-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O VOLVNCMGXWDDQY-LPEHRKFASA-N 0.000 description 1
- WVUZERSNWGUKJY-BPUTZDHNSA-N Gln-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N WVUZERSNWGUKJY-BPUTZDHNSA-N 0.000 description 1
- IKFZXRLDMYWNBU-YUMQZZPRSA-N Gln-Gly-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N IKFZXRLDMYWNBU-YUMQZZPRSA-N 0.000 description 1
- XSBGUANSZDGULP-IUCAKERBSA-N Gln-Gly-Lys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O XSBGUANSZDGULP-IUCAKERBSA-N 0.000 description 1
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 1
- YXQCLIVLWCKCRS-RYUDHWBXSA-N Gln-Gly-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N)O YXQCLIVLWCKCRS-RYUDHWBXSA-N 0.000 description 1
- BVELAHPZLYLZDJ-HGNGGELXSA-N Gln-His-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O BVELAHPZLYLZDJ-HGNGGELXSA-N 0.000 description 1
- KQOPMGBHNQBCEL-HVTMNAMFSA-N Gln-His-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KQOPMGBHNQBCEL-HVTMNAMFSA-N 0.000 description 1
- IWUFOVSLWADEJC-AVGNSLFASA-N Gln-His-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IWUFOVSLWADEJC-AVGNSLFASA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- ZNTDJIMJKNNSLR-RWRJDSDZSA-N Gln-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZNTDJIMJKNNSLR-RWRJDSDZSA-N 0.000 description 1
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 description 1
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 1
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 1
- JNENSVNAUWONEZ-GUBZILKMSA-N Gln-Lys-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JNENSVNAUWONEZ-GUBZILKMSA-N 0.000 description 1
- XZLLTYBONVKGLO-SDDRHHMPSA-N Gln-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XZLLTYBONVKGLO-SDDRHHMPSA-N 0.000 description 1
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 1
- QMVCEWKHIUHTSD-GUBZILKMSA-N Gln-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N QMVCEWKHIUHTSD-GUBZILKMSA-N 0.000 description 1
- HHRAEXBUNGTOGZ-IHRRRGAJSA-N Gln-Phe-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O HHRAEXBUNGTOGZ-IHRRRGAJSA-N 0.000 description 1
- ZVQZXPADLZIQFF-FHWLQOOXSA-N Gln-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 ZVQZXPADLZIQFF-FHWLQOOXSA-N 0.000 description 1
- XUMFMAVDHQDATI-DCAQKATOSA-N Gln-Pro-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XUMFMAVDHQDATI-DCAQKATOSA-N 0.000 description 1
- FGWRYRAVBVOHIB-XIRDDKMYSA-N Gln-Pro-Trp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O FGWRYRAVBVOHIB-XIRDDKMYSA-N 0.000 description 1
- DCWNCMRZIZSZBL-KKUMJFAQSA-N Gln-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O DCWNCMRZIZSZBL-KKUMJFAQSA-N 0.000 description 1
- RWQCWSGOOOEGPB-FXQIFTODSA-N Gln-Ser-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O RWQCWSGOOOEGPB-FXQIFTODSA-N 0.000 description 1
- SXFPZRRVWSUYII-KBIXCLLPSA-N Gln-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N SXFPZRRVWSUYII-KBIXCLLPSA-N 0.000 description 1
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 1
- GHAXJVNBAKGWEJ-AVGNSLFASA-N Gln-Ser-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GHAXJVNBAKGWEJ-AVGNSLFASA-N 0.000 description 1
- DUGYCMAIAKAQPB-GLLZPBPUSA-N Gln-Thr-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DUGYCMAIAKAQPB-GLLZPBPUSA-N 0.000 description 1
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 1
- DITJVHONFRJKJW-BPUTZDHNSA-N Gln-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DITJVHONFRJKJW-BPUTZDHNSA-N 0.000 description 1
- RGNMNWULPAYDAH-JSGCOSHPSA-N Gln-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N RGNMNWULPAYDAH-JSGCOSHPSA-N 0.000 description 1
- BJVBMSTUUWGZKX-JYJNAYRXSA-N Gln-Tyr-His Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BJVBMSTUUWGZKX-JYJNAYRXSA-N 0.000 description 1
- AKDOUBMVLRCHBD-SIUGBPQLSA-N Gln-Tyr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AKDOUBMVLRCHBD-SIUGBPQLSA-N 0.000 description 1
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 1
- ICRKQMRFXYDYMK-LAEOZQHASA-N Gln-Val-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ICRKQMRFXYDYMK-LAEOZQHASA-N 0.000 description 1
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 1
- SDSMVVSHLAAOJL-UKJIMTQDSA-N Gln-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N SDSMVVSHLAAOJL-UKJIMTQDSA-N 0.000 description 1
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 1
- 108010044091 Globulins Proteins 0.000 description 1
- 102000006395 Globulins Human genes 0.000 description 1
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 1
- OJGLIOXAKGFFDW-SRVKXCTJSA-N Glu-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N OJGLIOXAKGFFDW-SRVKXCTJSA-N 0.000 description 1
- SRZLHYPAOXBBSB-HJGDQZAQSA-N Glu-Arg-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SRZLHYPAOXBBSB-HJGDQZAQSA-N 0.000 description 1
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 1
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 1
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 1
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 1
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 1
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- QBLCUWAGTGRXAY-UHFFFAOYSA-N Glu-Glu-Tyr-Tyr Chemical compound C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(NC(=O)C(CCC(O)=O)NC(=O)C(CCC(O)=O)N)CC1=CC=C(O)C=C1 QBLCUWAGTGRXAY-UHFFFAOYSA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 1
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 1
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 1
- BKRQSECBKKCCKW-HVTMNAMFSA-N Glu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N BKRQSECBKKCCKW-HVTMNAMFSA-N 0.000 description 1
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 1
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 1
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 1
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 1
- UMHRCVCZUPBBQW-GARJFASQSA-N Glu-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UMHRCVCZUPBBQW-GARJFASQSA-N 0.000 description 1
- ZIYGTCDTJJCDDP-JYJNAYRXSA-N Glu-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZIYGTCDTJJCDDP-JYJNAYRXSA-N 0.000 description 1
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 1
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 1
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- ZAPFAWQHBOHWLL-GUBZILKMSA-N Glu-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N ZAPFAWQHBOHWLL-GUBZILKMSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- BPCLDCNZBUYGOD-BPUTZDHNSA-N Glu-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 BPCLDCNZBUYGOD-BPUTZDHNSA-N 0.000 description 1
- ZTNHPMZHAILHRB-JSGCOSHPSA-N Glu-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)NCC(O)=O)=CNC2=C1 ZTNHPMZHAILHRB-JSGCOSHPSA-N 0.000 description 1
- HGJREIGJLUQBTJ-SZMVWBNQSA-N Glu-Trp-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O HGJREIGJLUQBTJ-SZMVWBNQSA-N 0.000 description 1
- BKMOHWJHXQLFEX-IRIUXVKKSA-N Glu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N)O BKMOHWJHXQLFEX-IRIUXVKKSA-N 0.000 description 1
- NTNUEBVGKMVANB-NHCYSSNCSA-N Glu-Val-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O NTNUEBVGKMVANB-NHCYSSNCSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- 239000005561 Glufosinate Substances 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 1
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 1
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 1
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 1
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 1
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 1
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 1
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 1
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- ZRZILYKEJBMFHY-BQBZGAKWSA-N Gly-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN ZRZILYKEJBMFHY-BQBZGAKWSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 1
- BULIVUZUDBHKKZ-WDSKDSINSA-N Gly-Gln-Asn Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BULIVUZUDBHKKZ-WDSKDSINSA-N 0.000 description 1
- JMQFHZWESBGPFC-WDSKDSINSA-N Gly-Gln-Asp Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JMQFHZWESBGPFC-WDSKDSINSA-N 0.000 description 1
- XLFHCWHXKSFVIB-BQBZGAKWSA-N Gly-Gln-Gln Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLFHCWHXKSFVIB-BQBZGAKWSA-N 0.000 description 1
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 1
- BPQYBFAXRGMGGY-LAEOZQHASA-N Gly-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN BPQYBFAXRGMGGY-LAEOZQHASA-N 0.000 description 1
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 description 1
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 1
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 1
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 1
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 1
- HFXJIZNEXNIZIJ-BQBZGAKWSA-N Gly-Glu-Gln Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFXJIZNEXNIZIJ-BQBZGAKWSA-N 0.000 description 1
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 1
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 1
- JUBDONGMHASUCN-IUCAKERBSA-N Gly-Glu-His Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O JUBDONGMHASUCN-IUCAKERBSA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 1
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 1
- JNGJGFMFXREJNF-KBPBESRZSA-N Gly-Glu-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JNGJGFMFXREJNF-KBPBESRZSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 1
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 1
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 1
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 1
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- ZOTGXWMKUFSKEU-QXEWZRGKSA-N Gly-Ile-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O ZOTGXWMKUFSKEU-QXEWZRGKSA-N 0.000 description 1
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 1
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 1
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 1
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 1
- ICUTTWWCDIIIEE-BQBZGAKWSA-N Gly-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN ICUTTWWCDIIIEE-BQBZGAKWSA-N 0.000 description 1
- UWQDKRIZSROAKS-FJXKBIBVSA-N Gly-Met-Thr Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWQDKRIZSROAKS-FJXKBIBVSA-N 0.000 description 1
- MDKCBHZLQJZOCJ-STQMWFEESA-N Gly-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)CN MDKCBHZLQJZOCJ-STQMWFEESA-N 0.000 description 1
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 1
- DHNXGWVNLFPOMQ-KBPBESRZSA-N Gly-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN DHNXGWVNLFPOMQ-KBPBESRZSA-N 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- NZOAFWHVAFJERA-OALUTQOASA-N Gly-Phe-Trp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NZOAFWHVAFJERA-OALUTQOASA-N 0.000 description 1
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 1
- RZEDHGORCKRINR-STQMWFEESA-N Gly-Trp-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN RZEDHGORCKRINR-STQMWFEESA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- 244000068988 Glycine max Species 0.000 description 1
- 235000010469 Glycine max Nutrition 0.000 description 1
- 108700037728 Glycine max beta-conglycinin Proteins 0.000 description 1
- 239000005562 Glyphosate Substances 0.000 description 1
- 102100021181 Golgi phosphoprotein 3 Human genes 0.000 description 1
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 1
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 1
- 241000190714 Gymnosporangium clavipes Species 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- 241001465746 Heterorhabditidae Species 0.000 description 1
- 241000500097 Heterorhabditis indica Species 0.000 description 1
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 1
- FLUVGKKRRMLNPU-CQDKDKBSSA-N His-Ala-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FLUVGKKRRMLNPU-CQDKDKBSSA-N 0.000 description 1
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 1
- CJGDTAHEMXLRMB-ULQDDVLXSA-N His-Arg-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CJGDTAHEMXLRMB-ULQDDVLXSA-N 0.000 description 1
- MWWOPNQSBXEUHO-ULQDDVLXSA-N His-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 MWWOPNQSBXEUHO-ULQDDVLXSA-N 0.000 description 1
- JWTKVPMQCCRPQY-SRVKXCTJSA-N His-Asn-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JWTKVPMQCCRPQY-SRVKXCTJSA-N 0.000 description 1
- LYSVCKOXIDKEEL-SRVKXCTJSA-N His-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LYSVCKOXIDKEEL-SRVKXCTJSA-N 0.000 description 1
- VOEGKUNRHYKYSU-XVYDVKMFSA-N His-Asp-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O VOEGKUNRHYKYSU-XVYDVKMFSA-N 0.000 description 1
- NJZGEXYLSFGPHG-GUBZILKMSA-N His-Gln-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N NJZGEXYLSFGPHG-GUBZILKMSA-N 0.000 description 1
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 1
- JJHWJUYYTWYXPL-PYJNHQTQSA-N His-Ile-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CN=CN1 JJHWJUYYTWYXPL-PYJNHQTQSA-N 0.000 description 1
- MFQVZYSPCIZFMR-MGHWNKPDSA-N His-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N MFQVZYSPCIZFMR-MGHWNKPDSA-N 0.000 description 1
- AIPUZFXMXAHZKY-QWRGUYRKSA-N His-Leu-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AIPUZFXMXAHZKY-QWRGUYRKSA-N 0.000 description 1
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 1
- KQJBFMJFUXAYPK-AVGNSLFASA-N His-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N KQJBFMJFUXAYPK-AVGNSLFASA-N 0.000 description 1
- XMAUFHMAAVTODF-STQMWFEESA-N His-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 XMAUFHMAAVTODF-STQMWFEESA-N 0.000 description 1
- YXXKBPJEIYFGOD-MGHWNKPDSA-N His-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N YXXKBPJEIYFGOD-MGHWNKPDSA-N 0.000 description 1
- GNBHSMFBUNEWCJ-DCAQKATOSA-N His-Pro-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GNBHSMFBUNEWCJ-DCAQKATOSA-N 0.000 description 1
- OWYIDJCNRWRSJY-QTKMDUPCSA-N His-Pro-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O OWYIDJCNRWRSJY-QTKMDUPCSA-N 0.000 description 1
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 1
- FWWJVUFXUQOEDM-WDSOQIARSA-N His-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N FWWJVUFXUQOEDM-WDSOQIARSA-N 0.000 description 1
- DLTCGJZBNFOWFL-LKTVYLICSA-N His-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N DLTCGJZBNFOWFL-LKTVYLICSA-N 0.000 description 1
- YKUAGFAXQRYUQW-KKUMJFAQSA-N His-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O YKUAGFAXQRYUQW-KKUMJFAQSA-N 0.000 description 1
- 206010021143 Hypoxia Diseases 0.000 description 1
- YPWHUFAAMNHMGS-QSFUFRPTSA-N Ile-Ala-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YPWHUFAAMNHMGS-QSFUFRPTSA-N 0.000 description 1
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 1
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 1
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 1
- BOTVMTSMOUSDRW-GMOBBJLQSA-N Ile-Arg-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O BOTVMTSMOUSDRW-GMOBBJLQSA-N 0.000 description 1
- ASCFJMSGKUIRDU-ZPFDUUQYSA-N Ile-Arg-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O ASCFJMSGKUIRDU-ZPFDUUQYSA-N 0.000 description 1
- LVQDUPQUJZWKSU-PYJNHQTQSA-N Ile-Arg-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LVQDUPQUJZWKSU-PYJNHQTQSA-N 0.000 description 1
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 1
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 1
- CWJQMCPYXNVMBS-STECZYCISA-N Ile-Arg-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N CWJQMCPYXNVMBS-STECZYCISA-N 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 1
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 1
- YPQDTQJBOFOTJQ-SXTJYALSSA-N Ile-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N YPQDTQJBOFOTJQ-SXTJYALSSA-N 0.000 description 1
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 1
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 1
- QIHJTGSVGIPHIW-QSFUFRPTSA-N Ile-Asn-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N QIHJTGSVGIPHIW-QSFUFRPTSA-N 0.000 description 1
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 1
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 1
- LDRALPZEVHVXEK-KBIXCLLPSA-N Ile-Cys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N LDRALPZEVHVXEK-KBIXCLLPSA-N 0.000 description 1
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 1
- CYHJCEKUMCNDFG-LAEOZQHASA-N Ile-Gln-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N CYHJCEKUMCNDFG-LAEOZQHASA-N 0.000 description 1
- WNQKUUQIVDDAFA-ZPFDUUQYSA-N Ile-Gln-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N WNQKUUQIVDDAFA-ZPFDUUQYSA-N 0.000 description 1
- BALLIXFZYSECCF-QEWYBTABSA-N Ile-Gln-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N BALLIXFZYSECCF-QEWYBTABSA-N 0.000 description 1
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 1
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 1
- JXMSHKFPDIUYGS-SIUGBPQLSA-N Ile-Glu-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N JXMSHKFPDIUYGS-SIUGBPQLSA-N 0.000 description 1
- OEQKGSPBDVKYOC-ZKWXMUAHSA-N Ile-Gly-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OEQKGSPBDVKYOC-ZKWXMUAHSA-N 0.000 description 1
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 1
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 1
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- DBXXASNNDTXOLU-MXAVVETBSA-N Ile-Leu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DBXXASNNDTXOLU-MXAVVETBSA-N 0.000 description 1
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 1
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 1
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 1
- AKOYRLRUFBZOSP-BJDJZHNGSA-N Ile-Lys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N AKOYRLRUFBZOSP-BJDJZHNGSA-N 0.000 description 1
- FFJQAEYLAQMGDL-MGHWNKPDSA-N Ile-Lys-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FFJQAEYLAQMGDL-MGHWNKPDSA-N 0.000 description 1
- WSSGUVAKYCQSCT-XUXIUFHCSA-N Ile-Met-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)O)N WSSGUVAKYCQSCT-XUXIUFHCSA-N 0.000 description 1
- BKPPWVSPSIUXHZ-OSUNSFLBSA-N Ile-Met-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N BKPPWVSPSIUXHZ-OSUNSFLBSA-N 0.000 description 1
- VZSDQFZFTCVEGF-ZEWNOJEFSA-N Ile-Phe-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O VZSDQFZFTCVEGF-ZEWNOJEFSA-N 0.000 description 1
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 1
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 1
- JDCQDJVYUXNCGF-SPOWBLRKSA-N Ile-Ser-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N JDCQDJVYUXNCGF-SPOWBLRKSA-N 0.000 description 1
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 1
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 1
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 1
- JTBFQNHKNRZJDS-SYWGBEHUSA-N Ile-Trp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C)C(=O)O)N JTBFQNHKNRZJDS-SYWGBEHUSA-N 0.000 description 1
- WKSHBPRUIRGWRZ-KCTSRDHCSA-N Ile-Trp-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N WKSHBPRUIRGWRZ-KCTSRDHCSA-N 0.000 description 1
- DTPGSUQHUMELQB-GVARAGBVSA-N Ile-Tyr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 DTPGSUQHUMELQB-GVARAGBVSA-N 0.000 description 1
- RMJWFINHACYKJI-SIUGBPQLSA-N Ile-Tyr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RMJWFINHACYKJI-SIUGBPQLSA-N 0.000 description 1
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 1
- JSLIXOUMAOUGBN-JUKXBJQTSA-N Ile-Tyr-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JSLIXOUMAOUGBN-JUKXBJQTSA-N 0.000 description 1
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 1
- NSPNUMNLZNOPAQ-SJWGOKEGSA-N Ile-Tyr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N NSPNUMNLZNOPAQ-SJWGOKEGSA-N 0.000 description 1
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 1
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 1
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 1
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 1
- SWNRZNLXMXRCJC-VKOGCVSHSA-N Ile-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 SWNRZNLXMXRCJC-VKOGCVSHSA-N 0.000 description 1
- 206010061217 Infestation Diseases 0.000 description 1
- OWYWGLHRNBIFJP-UHFFFAOYSA-N Ipazine Chemical compound CCN(CC)C1=NC(Cl)=NC(NC(C)C)=N1 OWYWGLHRNBIFJP-UHFFFAOYSA-N 0.000 description 1
- 240000007049 Juglans regia Species 0.000 description 1
- 235000009496 Juglans regia Nutrition 0.000 description 1
- 108010025815 Kanamycin Kinase Proteins 0.000 description 1
- 208000007976 Ketosis Diseases 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 1
- FBOZXECLQNJBKD-ZDUSSCGKSA-N L-methotrexate Chemical compound C=1N=C2N=C(N)N=C(N)C2=NC=1CN(C)C1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 FBOZXECLQNJBKD-ZDUSSCGKSA-N 0.000 description 1
- 241000186660 Lactobacillus Species 0.000 description 1
- QPRQGENIBFLVEB-BJDJZHNGSA-N Leu-Ala-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QPRQGENIBFLVEB-BJDJZHNGSA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 1
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 1
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 1
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 1
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 1
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 1
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 1
- GBDMISNMNXVTNV-XIRDDKMYSA-N Leu-Asp-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GBDMISNMNXVTNV-XIRDDKMYSA-N 0.000 description 1
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 1
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 1
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 1
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 1
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 1
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 1
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- TVEOVCYCYGKVPP-HSCHXYMDSA-N Leu-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N TVEOVCYCYGKVPP-HSCHXYMDSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- UCNNZELZXFXXJQ-BZSNNMDCSA-N Leu-Leu-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCNNZELZXFXXJQ-BZSNNMDCSA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- VVQJGYPTIYOFBR-IHRRRGAJSA-N Leu-Lys-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N VVQJGYPTIYOFBR-IHRRRGAJSA-N 0.000 description 1
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 1
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 1
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 1
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 1
- KXCMQWMNYQOAKA-SRVKXCTJSA-N Leu-Met-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KXCMQWMNYQOAKA-SRVKXCTJSA-N 0.000 description 1
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 1
- IBSGMIPRBMPMHE-IHRRRGAJSA-N Leu-Met-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O IBSGMIPRBMPMHE-IHRRRGAJSA-N 0.000 description 1
- MJTOYIHCKVQICL-ULQDDVLXSA-N Leu-Met-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MJTOYIHCKVQICL-ULQDDVLXSA-N 0.000 description 1
- LQUIENKUVKPNIC-ULQDDVLXSA-N Leu-Met-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LQUIENKUVKPNIC-ULQDDVLXSA-N 0.000 description 1
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 1
- SYRTUBLKWNDSDK-DKIMLUQUSA-N Leu-Phe-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYRTUBLKWNDSDK-DKIMLUQUSA-N 0.000 description 1
- VTJUNIYRYIAIHF-IUCAKERBSA-N Leu-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O VTJUNIYRYIAIHF-IUCAKERBSA-N 0.000 description 1
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 1
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 1
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- GOFJOGXGMPHOGL-DCAQKATOSA-N Leu-Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C GOFJOGXGMPHOGL-DCAQKATOSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 1
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 1
- LXGSOEPHQJONMG-PMVMPFDFSA-N Leu-Trp-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N LXGSOEPHQJONMG-PMVMPFDFSA-N 0.000 description 1
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 1
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- RDFIVFHPOSOXMW-ACRUOGEOSA-N Leu-Tyr-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RDFIVFHPOSOXMW-ACRUOGEOSA-N 0.000 description 1
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 1
- MSFITIBEMPWCBD-ULQDDVLXSA-N Leu-Val-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MSFITIBEMPWCBD-ULQDDVLXSA-N 0.000 description 1
- 241000192132 Leuconostoc Species 0.000 description 1
- 241000209510 Liliopsida Species 0.000 description 1
- 108010071324 Livagen Proteins 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000006142 Luria-Bertani Agar Substances 0.000 description 1
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 1
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 1
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 1
- ABHIXYDMILIUKV-CIUDSAMLSA-N Lys-Asn-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ABHIXYDMILIUKV-CIUDSAMLSA-N 0.000 description 1
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 1
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 1
- PXHCFKXNSBJSTQ-KKUMJFAQSA-N Lys-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)O PXHCFKXNSBJSTQ-KKUMJFAQSA-N 0.000 description 1
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 1
- HIIZIQUUHIXUJY-GUBZILKMSA-N Lys-Asp-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HIIZIQUUHIXUJY-GUBZILKMSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 1
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 1
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 1
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 1
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 1
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- VLMNBMFYRMGEMB-QWRGUYRKSA-N Lys-His-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 VLMNBMFYRMGEMB-QWRGUYRKSA-N 0.000 description 1
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 1
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 1
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 1
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 1
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- BEGQVWUZFXLNHZ-IHPCNDPISA-N Lys-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 BEGQVWUZFXLNHZ-IHPCNDPISA-N 0.000 description 1
- DAHQKYYIXPBESV-UWVGGRQHSA-N Lys-Met-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O DAHQKYYIXPBESV-UWVGGRQHSA-N 0.000 description 1
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 1
- UDXSLGLHFUBRRM-OEAJRASXSA-N Lys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCCN)N)O UDXSLGLHFUBRRM-OEAJRASXSA-N 0.000 description 1
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 1
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 1
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 1
- DNWBUCHHMRQWCZ-GUBZILKMSA-N Lys-Ser-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DNWBUCHHMRQWCZ-GUBZILKMSA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 1
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 1
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 1
- WAAZECNCPVGPIV-RHYQMDGZSA-N Lys-Thr-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O WAAZECNCPVGPIV-RHYQMDGZSA-N 0.000 description 1
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 1
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 1
- SUZVLFWOCKHWET-CQDKDKBSSA-N Lys-Tyr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O SUZVLFWOCKHWET-CQDKDKBSSA-N 0.000 description 1
- XBAJINCXDBTJRH-WDSOQIARSA-N Lys-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N XBAJINCXDBTJRH-WDSOQIARSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 101000819644 Lysinibacillus sphaericus UPF0309 protein in nagA 3'region Proteins 0.000 description 1
- 101710125418 Major capsid protein Proteins 0.000 description 1
- KUQWVNFMZLHAPA-CIUDSAMLSA-N Met-Ala-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O KUQWVNFMZLHAPA-CIUDSAMLSA-N 0.000 description 1
- BVXXDMUMHMXFER-BPNCWPANSA-N Met-Ala-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVXXDMUMHMXFER-BPNCWPANSA-N 0.000 description 1
- DTICLBJHRYSJLH-GUBZILKMSA-N Met-Ala-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O DTICLBJHRYSJLH-GUBZILKMSA-N 0.000 description 1
- ZEDVFJPQNNBMST-CYDGBPFRSA-N Met-Arg-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZEDVFJPQNNBMST-CYDGBPFRSA-N 0.000 description 1
- IVCPHARVJUYDPA-FXQIFTODSA-N Met-Asn-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IVCPHARVJUYDPA-FXQIFTODSA-N 0.000 description 1
- QXEVZBXTDTVPCP-GMOBBJLQSA-N Met-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCSC)N QXEVZBXTDTVPCP-GMOBBJLQSA-N 0.000 description 1
- OHMKUHXCDSCOMT-QXEWZRGKSA-N Met-Asn-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHMKUHXCDSCOMT-QXEWZRGKSA-N 0.000 description 1
- GODBLDDYHFTUAH-CIUDSAMLSA-N Met-Asp-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O GODBLDDYHFTUAH-CIUDSAMLSA-N 0.000 description 1
- OSOLWRWQADPDIQ-DCAQKATOSA-N Met-Asp-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OSOLWRWQADPDIQ-DCAQKATOSA-N 0.000 description 1
- MYKLINMAGAIRPJ-CIUDSAMLSA-N Met-Gln-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MYKLINMAGAIRPJ-CIUDSAMLSA-N 0.000 description 1
- MTBVQFFQMXHCPC-CIUDSAMLSA-N Met-Glu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MTBVQFFQMXHCPC-CIUDSAMLSA-N 0.000 description 1
- RNAGAJXCSPDPRK-KKUMJFAQSA-N Met-Glu-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 RNAGAJXCSPDPRK-KKUMJFAQSA-N 0.000 description 1
- DGNZGCQSVGGYJS-BQBZGAKWSA-N Met-Gly-Asp Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O DGNZGCQSVGGYJS-BQBZGAKWSA-N 0.000 description 1
- YCUSPBPZVJDMII-YUMQZZPRSA-N Met-Gly-Glu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O YCUSPBPZVJDMII-YUMQZZPRSA-N 0.000 description 1
- BMHIFARYXOJDLD-WPRPVWTQSA-N Met-Gly-Val Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O BMHIFARYXOJDLD-WPRPVWTQSA-N 0.000 description 1
- JHDNAOVJJQSMMM-GMOBBJLQSA-N Met-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N JHDNAOVJJQSMMM-GMOBBJLQSA-N 0.000 description 1
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 1
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 description 1
- DBXMFHGGHMXYHY-DCAQKATOSA-N Met-Leu-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O DBXMFHGGHMXYHY-DCAQKATOSA-N 0.000 description 1
- USBFEVBHEQBWDD-AVGNSLFASA-N Met-Leu-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O USBFEVBHEQBWDD-AVGNSLFASA-N 0.000 description 1
- MSSJHBAKDDIRMJ-SRVKXCTJSA-N Met-Lys-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MSSJHBAKDDIRMJ-SRVKXCTJSA-N 0.000 description 1
- AXHNAGAYRGCDLG-UWVGGRQHSA-N Met-Lys-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AXHNAGAYRGCDLG-UWVGGRQHSA-N 0.000 description 1
- HAQLBBVZAGMESV-IHRRRGAJSA-N Met-Lys-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O HAQLBBVZAGMESV-IHRRRGAJSA-N 0.000 description 1
- ILKCLLLOGPDNIP-RCWTZXSCSA-N Met-Met-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ILKCLLLOGPDNIP-RCWTZXSCSA-N 0.000 description 1
- ZACMJPCWVSLCNS-JYJNAYRXSA-N Met-Phe-Met Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCSC)C(O)=O)CC1=CC=CC=C1 ZACMJPCWVSLCNS-JYJNAYRXSA-N 0.000 description 1
- QEDGNYFHLXXIDC-DCAQKATOSA-N Met-Pro-Gln Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O QEDGNYFHLXXIDC-DCAQKATOSA-N 0.000 description 1
- QLESZRANMSYLCZ-CYDGBPFRSA-N Met-Pro-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QLESZRANMSYLCZ-CYDGBPFRSA-N 0.000 description 1
- YLDSJJOGQNEQJK-AVGNSLFASA-N Met-Pro-Leu Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YLDSJJOGQNEQJK-AVGNSLFASA-N 0.000 description 1
- XIGAHPDZLAYQOS-SRVKXCTJSA-N Met-Pro-Pro Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 XIGAHPDZLAYQOS-SRVKXCTJSA-N 0.000 description 1
- SOAYQFDWEIWPPR-IHRRRGAJSA-N Met-Ser-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O SOAYQFDWEIWPPR-IHRRRGAJSA-N 0.000 description 1
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 1
- RMLWDZINJUDMEB-IHRRRGAJSA-N Met-Tyr-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RMLWDZINJUDMEB-IHRRRGAJSA-N 0.000 description 1
- LIIXIZKVWNYQHB-STECZYCISA-N Met-Tyr-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LIIXIZKVWNYQHB-STECZYCISA-N 0.000 description 1
- ANCPZNHGZUCSSC-ULQDDVLXSA-N Met-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=C(O)C=C1 ANCPZNHGZUCSSC-ULQDDVLXSA-N 0.000 description 1
- JACMWNXOOUYXCD-JYJNAYRXSA-N Met-Val-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JACMWNXOOUYXCD-JYJNAYRXSA-N 0.000 description 1
- PVSPJQWHEIQTEH-JYJNAYRXSA-N Met-Val-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PVSPJQWHEIQTEH-JYJNAYRXSA-N 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- YFYIWIZSIVZILB-UHFFFAOYSA-N N.[P] Chemical compound N.[P] YFYIWIZSIVZILB-UHFFFAOYSA-N 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- 108010047562 NGR peptide Proteins 0.000 description 1
- 229930193140 Neomycin Natural products 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- 101710141454 Nucleoprotein Proteins 0.000 description 1
- 101150073872 ORF3 gene Proteins 0.000 description 1
- RCEAADKTGXTDOA-UHFFFAOYSA-N OS(O)(=O)=O.CCCCCCCCCCCC[Na] Chemical compound OS(O)(=O)=O.CCCCCCCCCCCC[Na] RCEAADKTGXTDOA-UHFFFAOYSA-N 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 241001310335 Paenibacillus lentimorbus Species 0.000 description 1
- 241001310339 Paenibacillus popilliae Species 0.000 description 1
- 108020002230 Pancreatic Ribonuclease Proteins 0.000 description 1
- 102000005891 Pancreatic ribonuclease Human genes 0.000 description 1
- 239000001888 Peptone Substances 0.000 description 1
- 108010080698 Peptones Proteins 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 1
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 1
- LBSARGIQACMGDF-WBAXXEDZSA-N Phe-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 LBSARGIQACMGDF-WBAXXEDZSA-N 0.000 description 1
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 1
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 1
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 1
- HTTYNOXBBOWZTB-SRVKXCTJSA-N Phe-Asn-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HTTYNOXBBOWZTB-SRVKXCTJSA-N 0.000 description 1
- VUYCNYVLKACHPA-KKUMJFAQSA-N Phe-Asp-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VUYCNYVLKACHPA-KKUMJFAQSA-N 0.000 description 1
- CUMXHKAOHNWRFQ-BZSNNMDCSA-N Phe-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CUMXHKAOHNWRFQ-BZSNNMDCSA-N 0.000 description 1
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 1
- GDBOREPXIRKSEQ-FHWLQOOXSA-N Phe-Gln-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GDBOREPXIRKSEQ-FHWLQOOXSA-N 0.000 description 1
- CDQCFGOQNYOICK-IHRRRGAJSA-N Phe-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDQCFGOQNYOICK-IHRRRGAJSA-N 0.000 description 1
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 1
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- HBGFEEQFVBWYJQ-KBPBESRZSA-N Phe-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HBGFEEQFVBWYJQ-KBPBESRZSA-N 0.000 description 1
- VADLTGVIOIOKGM-BZSNNMDCSA-N Phe-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 VADLTGVIOIOKGM-BZSNNMDCSA-N 0.000 description 1
- BEEVXUYVEHXWRQ-YESZJQIVSA-N Phe-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O BEEVXUYVEHXWRQ-YESZJQIVSA-N 0.000 description 1
- KRYSMKKRRRWOCZ-QEWYBTABSA-N Phe-Ile-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KRYSMKKRRRWOCZ-QEWYBTABSA-N 0.000 description 1
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 1
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 1
- TXKWKTWYTIAZSV-KKUMJFAQSA-N Phe-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N TXKWKTWYTIAZSV-KKUMJFAQSA-N 0.000 description 1
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 1
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 1
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 1
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 1
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 1
- GMWNQSGWWGKTSF-LFSVMHDDSA-N Phe-Thr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMWNQSGWWGKTSF-LFSVMHDDSA-N 0.000 description 1
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 1
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 1
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 1
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 1
- YTGGLKWSVIRECD-JBACZVJFSA-N Phe-Trp-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 YTGGLKWSVIRECD-JBACZVJFSA-N 0.000 description 1
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 description 1
- SJRQWEDYTKYHHL-SLFFLAALSA-N Phe-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O SJRQWEDYTKYHHL-SLFFLAALSA-N 0.000 description 1
- KIQUCMUULDXTAZ-HJOGWXRNSA-N Phe-Tyr-Tyr Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O KIQUCMUULDXTAZ-HJOGWXRNSA-N 0.000 description 1
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 1
- 101100226891 Phomopsis amygdali PaP450-1 gene Proteins 0.000 description 1
- 101100226893 Phomopsis amygdali PaP450-2 gene Proteins 0.000 description 1
- 241000254064 Photinus pyralis Species 0.000 description 1
- 241001123094 Photorhabdus asymbiotica Species 0.000 description 1
- 241001148064 Photorhabdus luminescens Species 0.000 description 1
- 241000694891 Photorhabdus luminescens subsp. akhurstii Species 0.000 description 1
- 241001246813 Photorhabdus luminescens subsp. laumondii Species 0.000 description 1
- 241000694892 Photorhabdus luminescens subsp. luminescens Species 0.000 description 1
- 241001216646 Photorhabdus temperata Species 0.000 description 1
- 241000694906 Photorhabdus temperata subsp. temperata Species 0.000 description 1
- 241000500437 Plutella xylostella Species 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 1
- HPXVFFIIGOAQRV-DCAQKATOSA-N Pro-Arg-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O HPXVFFIIGOAQRV-DCAQKATOSA-N 0.000 description 1
- WWAQEUOYCYMGHB-FXQIFTODSA-N Pro-Asn-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 WWAQEUOYCYMGHB-FXQIFTODSA-N 0.000 description 1
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 1
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 description 1
- MLQVJYMFASXBGZ-IHRRRGAJSA-N Pro-Asn-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O MLQVJYMFASXBGZ-IHRRRGAJSA-N 0.000 description 1
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 1
- DEDANIDYQAPTFI-IHRRRGAJSA-N Pro-Asp-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DEDANIDYQAPTFI-IHRRRGAJSA-N 0.000 description 1
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 1
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 1
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- JRQCDSNPRNGWRG-AVGNSLFASA-N Pro-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2 JRQCDSNPRNGWRG-AVGNSLFASA-N 0.000 description 1
- SOACYAXADBWDDT-CYDGBPFRSA-N Pro-Ile-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SOACYAXADBWDDT-CYDGBPFRSA-N 0.000 description 1
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 1
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 1
- CFVRJNZJQHDQPP-CYDGBPFRSA-N Pro-Ile-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 CFVRJNZJQHDQPP-CYDGBPFRSA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 1
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 1
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 1
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 1
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 1
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 1
- WCNVGGZRTNHOOS-ULQDDVLXSA-N Pro-Lys-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O WCNVGGZRTNHOOS-ULQDDVLXSA-N 0.000 description 1
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 1
- WIPAMEKBSHNFQE-IUCAKERBSA-N Pro-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@@H]1CCCN1 WIPAMEKBSHNFQE-IUCAKERBSA-N 0.000 description 1
- QCMYJBKTMIWZAP-AVGNSLFASA-N Pro-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 QCMYJBKTMIWZAP-AVGNSLFASA-N 0.000 description 1
- JIWJRKNYLSHONY-KKUMJFAQSA-N Pro-Phe-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JIWJRKNYLSHONY-KKUMJFAQSA-N 0.000 description 1
- YYARMJSFDLIDFS-FKBYEOEOSA-N Pro-Phe-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O YYARMJSFDLIDFS-FKBYEOEOSA-N 0.000 description 1
- HOTVCUAVDQHUDB-UFYCRDLUSA-N Pro-Phe-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 HOTVCUAVDQHUDB-UFYCRDLUSA-N 0.000 description 1
- SVXXJYJCRNKDDE-AVGNSLFASA-N Pro-Pro-His Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CN=CN1 SVXXJYJCRNKDDE-AVGNSLFASA-N 0.000 description 1
- QAAYIXYLEMRULP-SRVKXCTJSA-N Pro-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 QAAYIXYLEMRULP-SRVKXCTJSA-N 0.000 description 1
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 1
- SEZGGSHLMROBFX-CIUDSAMLSA-N Pro-Ser-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O SEZGGSHLMROBFX-CIUDSAMLSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 1
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- YIPFBJGBRCJJJD-FHWLQOOXSA-N Pro-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 YIPFBJGBRCJJJD-FHWLQOOXSA-N 0.000 description 1
- LEBTWGWVUVJNTA-FKBYEOEOSA-N Pro-Trp-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC4=CC=CC=C4)C(=O)O LEBTWGWVUVJNTA-FKBYEOEOSA-N 0.000 description 1
- QKWYXRPICJEQAJ-KJEVXHAQSA-N Pro-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@@H]2CCCN2)O QKWYXRPICJEQAJ-KJEVXHAQSA-N 0.000 description 1
- QDDJNKWPTJHROJ-UFYCRDLUSA-N Pro-Tyr-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 QDDJNKWPTJHROJ-UFYCRDLUSA-N 0.000 description 1
- WWXNZNWZNZPDIF-SRVKXCTJSA-N Pro-Val-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 WWXNZNWZNZPDIF-SRVKXCTJSA-N 0.000 description 1
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 1
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- 101710083689 Probable capsid protein Proteins 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 229940124158 Protease/peptidase inhibitor Drugs 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 108091034057 RNA (poly(A)) Proteins 0.000 description 1
- 238000003559 RNA-seq method Methods 0.000 description 1
- 101000702488 Rattus norvegicus High affinity cationic amino acid transporter 1 Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 241000223253 Rhodotorula glutinis Species 0.000 description 1
- 241000235070 Saccharomyces Species 0.000 description 1
- 241000235342 Saccharomycetes Species 0.000 description 1
- 241001479507 Senecio odorus Species 0.000 description 1
- 229920002684 Sepharose Polymers 0.000 description 1
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 1
- JJKSSJVYOVRJMZ-FXQIFTODSA-N Ser-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)CN=C(N)N JJKSSJVYOVRJMZ-FXQIFTODSA-N 0.000 description 1
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 1
- HZWAHWQZPSXNCB-BPUTZDHNSA-N Ser-Arg-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HZWAHWQZPSXNCB-BPUTZDHNSA-N 0.000 description 1
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 description 1
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 1
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 1
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 1
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 1
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 1
- KMWFXJCGRXBQAC-CIUDSAMLSA-N Ser-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N KMWFXJCGRXBQAC-CIUDSAMLSA-N 0.000 description 1
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 1
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- BRIZMMZEYSAKJX-QEJZJMRPSA-N Ser-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N BRIZMMZEYSAKJX-QEJZJMRPSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- UAJAYRMZGNQILN-BQBZGAKWSA-N Ser-Gly-Met Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UAJAYRMZGNQILN-BQBZGAKWSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 1
- MOQDPPUMFSMYOM-KKUMJFAQSA-N Ser-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N MOQDPPUMFSMYOM-KKUMJFAQSA-N 0.000 description 1
- ZUDXUJSYCCNZQJ-DCAQKATOSA-N Ser-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N ZUDXUJSYCCNZQJ-DCAQKATOSA-N 0.000 description 1
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 1
- CJINPXGSKSZQNE-KBIXCLLPSA-N Ser-Ile-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O CJINPXGSKSZQNE-KBIXCLLPSA-N 0.000 description 1
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 1
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 1
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 1
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 1
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 1
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 1
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 1
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 1
- QPPYAWVLAVXISR-DCAQKATOSA-N Ser-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QPPYAWVLAVXISR-DCAQKATOSA-N 0.000 description 1
- OVQZAFXWIWNYKA-GUBZILKMSA-N Ser-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CO)N OVQZAFXWIWNYKA-GUBZILKMSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- NVNPWELENFJOHH-CIUDSAMLSA-N Ser-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)N NVNPWELENFJOHH-CIUDSAMLSA-N 0.000 description 1
- AABIBDJHSKIMJK-FXQIFTODSA-N Ser-Ser-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O AABIBDJHSKIMJK-FXQIFTODSA-N 0.000 description 1
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 1
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 1
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- UYLKOSODXYSWMQ-XGEHTFHBSA-N Ser-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N)O UYLKOSODXYSWMQ-XGEHTFHBSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- FVFUOQIYDPAIJR-XIRDDKMYSA-N Ser-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FVFUOQIYDPAIJR-XIRDDKMYSA-N 0.000 description 1
- FRPNVPKQVFHSQY-BPUTZDHNSA-N Ser-Trp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FRPNVPKQVFHSQY-BPUTZDHNSA-N 0.000 description 1
- NERYDXBVARJIQS-JYBASQMISA-N Ser-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N)O NERYDXBVARJIQS-JYBASQMISA-N 0.000 description 1
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 1
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 1
- HXPNJVLVHKABMJ-KKUMJFAQSA-N Ser-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CO)N)O HXPNJVLVHKABMJ-KKUMJFAQSA-N 0.000 description 1
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 1
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 1
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 241000147799 Serratia entomophila Species 0.000 description 1
- 102000039471 Small Nuclear RNA Human genes 0.000 description 1
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 1
- 241000256247 Spodoptera exigua Species 0.000 description 1
- 241000123675 Sporobolomyces roseus Species 0.000 description 1
- 241001480223 Steinernema carpocapsae Species 0.000 description 1
- 241001465745 Steinernematidae Species 0.000 description 1
- 101000901034 Streptococcus gordonii Accessory Sec system protein Asp2 Proteins 0.000 description 1
- 101000870438 Streptococcus gordonii UDP-N-acetylglucosamine-peptide N-acetylglucosaminyltransferase stabilizing protein GtfB Proteins 0.000 description 1
- 101100038645 Streptomyces griseus rppA gene Proteins 0.000 description 1
- 101710172711 Structural protein Proteins 0.000 description 1
- 229940100389 Sulfonylurea Drugs 0.000 description 1
- 239000012505 Superdex™ Substances 0.000 description 1
- 108700026226 TATA Box Proteins 0.000 description 1
- 108010006785 Taq Polymerase Proteins 0.000 description 1
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 1
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 1
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- GFDUZZACIWNMPE-KZVJFYERSA-N Thr-Ala-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O GFDUZZACIWNMPE-KZVJFYERSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- LHUBVKCLOVALIA-HJGDQZAQSA-N Thr-Arg-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LHUBVKCLOVALIA-HJGDQZAQSA-N 0.000 description 1
- PKXHGEXFMIZSER-QTKMDUPCSA-N Thr-Arg-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O PKXHGEXFMIZSER-QTKMDUPCSA-N 0.000 description 1
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 1
- UTSWGQNAQRIHAI-UNQGMJICSA-N Thr-Arg-Phe Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 UTSWGQNAQRIHAI-UNQGMJICSA-N 0.000 description 1
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 1
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 1
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 1
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 1
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 1
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 1
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 1
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 1
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 1
- ZQUKYJOKQBRBCS-GLLZPBPUSA-N Thr-Gln-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O ZQUKYJOKQBRBCS-GLLZPBPUSA-N 0.000 description 1
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 1
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 1
- GUZGCDIZVGODML-NKIYYHGXSA-N Thr-Gln-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O GUZGCDIZVGODML-NKIYYHGXSA-N 0.000 description 1
- GARULAKWZGFIKC-RWRJDSDZSA-N Thr-Gln-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GARULAKWZGFIKC-RWRJDSDZSA-N 0.000 description 1
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 1
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- OQCXTUQTKQFDCX-HTUGSXCWSA-N Thr-Glu-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O OQCXTUQTKQFDCX-HTUGSXCWSA-N 0.000 description 1
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 1
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 1
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 1
- UYTYTDMCDBPDSC-URLPEUOOSA-N Thr-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N UYTYTDMCDBPDSC-URLPEUOOSA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 1
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 1
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 1
- CJXURNZYNHCYFD-WDCWCFNPSA-N Thr-Lys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CJXURNZYNHCYFD-WDCWCFNPSA-N 0.000 description 1
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 1
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 1
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 1
- XNTVWRJTUIOGQO-RHYQMDGZSA-N Thr-Met-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNTVWRJTUIOGQO-RHYQMDGZSA-N 0.000 description 1
- BDYBHQWMHYDRKJ-UNQGMJICSA-N Thr-Phe-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O)N)O BDYBHQWMHYDRKJ-UNQGMJICSA-N 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 1
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 1
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 1
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- GQPQJNMVELPZNQ-GBALPHGKSA-N Thr-Ser-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O GQPQJNMVELPZNQ-GBALPHGKSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 1
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 1
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 1
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 1
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 1
- IJKNKFJZOJCKRR-GBALPHGKSA-N Thr-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 IJKNKFJZOJCKRR-GBALPHGKSA-N 0.000 description 1
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 244000288561 Torulaspora delbrueckii Species 0.000 description 1
- 235000014681 Torulaspora delbrueckii Nutrition 0.000 description 1
- 241001495125 Torulaspora pretoriensis Species 0.000 description 1
- 102000008579 Transposases Human genes 0.000 description 1
- 108010020764 Transposases Proteins 0.000 description 1
- DTQVDTLACAAQTR-UHFFFAOYSA-N Trifluoroacetic acid Chemical class OC(=O)C(F)(F)F DTQVDTLACAAQTR-UHFFFAOYSA-N 0.000 description 1
- 239000007984 Tris EDTA buffer Substances 0.000 description 1
- 239000013504 Triton X-100 Substances 0.000 description 1
- 229920004890 Triton X-100 Polymers 0.000 description 1
- CXUFDWZBHKUGKK-CABZTGNLSA-N Trp-Ala-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O)=CNC2=C1 CXUFDWZBHKUGKK-CABZTGNLSA-N 0.000 description 1
- SCQBNMKLZVCXNX-ZFWWWQNUSA-N Trp-Arg-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N SCQBNMKLZVCXNX-ZFWWWQNUSA-N 0.000 description 1
- PXYJUECTGMGIDT-WDSOQIARSA-N Trp-Arg-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 PXYJUECTGMGIDT-WDSOQIARSA-N 0.000 description 1
- HQVKQINPFOCIIV-BVSLBCMMSA-N Trp-Arg-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 HQVKQINPFOCIIV-BVSLBCMMSA-N 0.000 description 1
- RNFZZCMCRDFNAE-WFBYXXMGSA-N Trp-Asn-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O RNFZZCMCRDFNAE-WFBYXXMGSA-N 0.000 description 1
- GUWJWCHZNGDKBG-UBHSHLNASA-N Trp-Asn-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N GUWJWCHZNGDKBG-UBHSHLNASA-N 0.000 description 1
- UKINEYBQXPMOJO-UBHSHLNASA-N Trp-Asn-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N UKINEYBQXPMOJO-UBHSHLNASA-N 0.000 description 1
- BXKWZPXTTSCOMX-AQZXSJQPSA-N Trp-Asn-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXKWZPXTTSCOMX-AQZXSJQPSA-N 0.000 description 1
- NKUIXQOJUAEIET-AQZXSJQPSA-N Trp-Asp-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@H](O)C)C(O)=O)=CNC2=C1 NKUIXQOJUAEIET-AQZXSJQPSA-N 0.000 description 1
- XZLHHHYSWIYXHD-XIRDDKMYSA-N Trp-Gln-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XZLHHHYSWIYXHD-XIRDDKMYSA-N 0.000 description 1
- VTHNLRXALGUDBS-BPUTZDHNSA-N Trp-Gln-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VTHNLRXALGUDBS-BPUTZDHNSA-N 0.000 description 1
- DXHHCIYKHRKBOC-BHYGNILZSA-N Trp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O DXHHCIYKHRKBOC-BHYGNILZSA-N 0.000 description 1
- AWYXDHQQFPZJNE-QEJZJMRPSA-N Trp-Gln-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N AWYXDHQQFPZJNE-QEJZJMRPSA-N 0.000 description 1
- GWQUSADRQCTMHN-NWLDYVSISA-N Trp-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O GWQUSADRQCTMHN-NWLDYVSISA-N 0.000 description 1
- ZJKZLNAECPIUTL-JBACZVJFSA-N Trp-Gln-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 ZJKZLNAECPIUTL-JBACZVJFSA-N 0.000 description 1
- WCTYCXZYBNKEIV-SXNHZJKMSA-N Trp-Glu-Ile Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)=CNC2=C1 WCTYCXZYBNKEIV-SXNHZJKMSA-N 0.000 description 1
- YXONONCLMLHWJX-SZMVWBNQSA-N Trp-Glu-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 YXONONCLMLHWJX-SZMVWBNQSA-N 0.000 description 1
- FEZASNVQLJQBHW-CABZTGNLSA-N Trp-Gly-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O)=CNC2=C1 FEZASNVQLJQBHW-CABZTGNLSA-N 0.000 description 1
- CXPJPTFWKXNDKV-NUTKFTJISA-N Trp-Leu-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CXPJPTFWKXNDKV-NUTKFTJISA-N 0.000 description 1
- OGZRZMJASKKMJZ-XIRDDKMYSA-N Trp-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N OGZRZMJASKKMJZ-XIRDDKMYSA-N 0.000 description 1
- YVXIAOOYAKBAAI-SZMVWBNQSA-N Trp-Leu-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 YVXIAOOYAKBAAI-SZMVWBNQSA-N 0.000 description 1
- GWBWCGITOYODER-YTQUADARSA-N Trp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GWBWCGITOYODER-YTQUADARSA-N 0.000 description 1
- RWAYYYOZMHMEGD-XIRDDKMYSA-N Trp-Leu-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 RWAYYYOZMHMEGD-XIRDDKMYSA-N 0.000 description 1
- OSYOKZZRVGUDMO-HSCHXYMDSA-N Trp-Lys-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OSYOKZZRVGUDMO-HSCHXYMDSA-N 0.000 description 1
- WKQNLTQSCYXKQK-VFAJRCTISA-N Trp-Lys-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WKQNLTQSCYXKQK-VFAJRCTISA-N 0.000 description 1
- KOVPHHXMHLFWPL-BPUTZDHNSA-N Trp-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)N[C@@H](CC(=O)N)C(=O)O KOVPHHXMHLFWPL-BPUTZDHNSA-N 0.000 description 1
- UJGDFQRPYGJBEH-AAEUAGOBSA-N Trp-Ser-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N UJGDFQRPYGJBEH-AAEUAGOBSA-N 0.000 description 1
- PKZIWSHDJYIPRH-JBACZVJFSA-N Trp-Tyr-Gln Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKZIWSHDJYIPRH-JBACZVJFSA-N 0.000 description 1
- ZPZNQAZHMCLTOA-PXDAIIFMSA-N Trp-Tyr-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 ZPZNQAZHMCLTOA-PXDAIIFMSA-N 0.000 description 1
- NMOIRIIIUVELLY-WDSOQIARSA-N Trp-Val-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)C(C)C)=CNC2=C1 NMOIRIIIUVELLY-WDSOQIARSA-N 0.000 description 1
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 1
- XLMDWQNAOKLKCP-XDTLVQLUSA-N Tyr-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XLMDWQNAOKLKCP-XDTLVQLUSA-N 0.000 description 1
- DLZKEQQWXODGGZ-KWQFWETISA-N Tyr-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DLZKEQQWXODGGZ-KWQFWETISA-N 0.000 description 1
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 1
- HSVPZJLMPLMPOX-BPNCWPANSA-N Tyr-Arg-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O HSVPZJLMPLMPOX-BPNCWPANSA-N 0.000 description 1
- AKXBNSZMYAOGLS-STQMWFEESA-N Tyr-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AKXBNSZMYAOGLS-STQMWFEESA-N 0.000 description 1
- XHALUUQSNXSPLP-UFYCRDLUSA-N Tyr-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XHALUUQSNXSPLP-UFYCRDLUSA-N 0.000 description 1
- CRWOSTCODDFEKZ-HRCADAONSA-N Tyr-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CRWOSTCODDFEKZ-HRCADAONSA-N 0.000 description 1
- IIJWXEUNETVJPV-IHRRRGAJSA-N Tyr-Arg-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N)O IIJWXEUNETVJPV-IHRRRGAJSA-N 0.000 description 1
- SGFIXFAHVWJKTD-KJEVXHAQSA-N Tyr-Arg-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SGFIXFAHVWJKTD-KJEVXHAQSA-N 0.000 description 1
- DYEGCOJHFNJBKB-UFYCRDLUSA-N Tyr-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 DYEGCOJHFNJBKB-UFYCRDLUSA-N 0.000 description 1
- OEVJGIHPQOXYFE-SRVKXCTJSA-N Tyr-Asn-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O OEVJGIHPQOXYFE-SRVKXCTJSA-N 0.000 description 1
- MBFJIHUHHCJBSN-AVGNSLFASA-N Tyr-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MBFJIHUHHCJBSN-AVGNSLFASA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- BARBHMSSVWPKPZ-IHRRRGAJSA-N Tyr-Asp-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BARBHMSSVWPKPZ-IHRRRGAJSA-N 0.000 description 1
- BEIGSKUPTIFYRZ-SRVKXCTJSA-N Tyr-Asp-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O BEIGSKUPTIFYRZ-SRVKXCTJSA-N 0.000 description 1
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 1
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 1
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 1
- WPVGRKLNHJJCEN-BZSNNMDCSA-N Tyr-Asp-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WPVGRKLNHJJCEN-BZSNNMDCSA-N 0.000 description 1
- NRFTYDWKWGJLAR-MELADBBJSA-N Tyr-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O NRFTYDWKWGJLAR-MELADBBJSA-N 0.000 description 1
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 1
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 1
- FGJWNBBFAUHBEP-IHPCNDPISA-N Tyr-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N FGJWNBBFAUHBEP-IHPCNDPISA-N 0.000 description 1
- GHUNBABNQPIETG-MELADBBJSA-N Tyr-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O GHUNBABNQPIETG-MELADBBJSA-N 0.000 description 1
- ARPONUQDNWLXOZ-KKUMJFAQSA-N Tyr-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ARPONUQDNWLXOZ-KKUMJFAQSA-N 0.000 description 1
- CRHFOYCJGVJPLE-AVGNSLFASA-N Tyr-Gln-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CRHFOYCJGVJPLE-AVGNSLFASA-N 0.000 description 1
- IYHNBRUWVBIVJR-IHRRRGAJSA-N Tyr-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IYHNBRUWVBIVJR-IHRRRGAJSA-N 0.000 description 1
- RYSNTWVRSLCAJZ-RYUDHWBXSA-N Tyr-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RYSNTWVRSLCAJZ-RYUDHWBXSA-N 0.000 description 1
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 1
- PDKILSUYSUGCAO-JBACZVJFSA-N Tyr-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC3=CC=C(C=C3)O)N PDKILSUYSUGCAO-JBACZVJFSA-N 0.000 description 1
- FJKXUIJOMUWCDD-FHWLQOOXSA-N Tyr-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N)O FJKXUIJOMUWCDD-FHWLQOOXSA-N 0.000 description 1
- WAPFQMXRSDEGOE-IHRRRGAJSA-N Tyr-Glu-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O WAPFQMXRSDEGOE-IHRRRGAJSA-N 0.000 description 1
- OSMTVLSRTQDWHJ-JBACZVJFSA-N Tyr-Glu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 OSMTVLSRTQDWHJ-JBACZVJFSA-N 0.000 description 1
- FIRUOPRJKCBLST-KKUMJFAQSA-N Tyr-His-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O FIRUOPRJKCBLST-KKUMJFAQSA-N 0.000 description 1
- SFSZDJHNAICYSD-PMVMPFDFSA-N Tyr-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)[C@H](CC4=CC=C(C=C4)O)N SFSZDJHNAICYSD-PMVMPFDFSA-N 0.000 description 1
- OSXNCKRGMSHWSQ-ACRUOGEOSA-N Tyr-His-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OSXNCKRGMSHWSQ-ACRUOGEOSA-N 0.000 description 1
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 description 1
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 1
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 1
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 1
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 1
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 1
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 1
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 1
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 1
- PGEFRHBWGOJPJT-KKUMJFAQSA-N Tyr-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O PGEFRHBWGOJPJT-KKUMJFAQSA-N 0.000 description 1
- FDKDGFGTHGJKNV-FHWLQOOXSA-N Tyr-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FDKDGFGTHGJKNV-FHWLQOOXSA-N 0.000 description 1
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 1
- QKXAEWMHAAVVGS-KKUMJFAQSA-N Tyr-Pro-Glu Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O QKXAEWMHAAVVGS-KKUMJFAQSA-N 0.000 description 1
- YYLHVUCSTXXKBS-IHRRRGAJSA-N Tyr-Pro-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YYLHVUCSTXXKBS-IHRRRGAJSA-N 0.000 description 1
- RGYCVIZZTUBSSG-JYJNAYRXSA-N Tyr-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O RGYCVIZZTUBSSG-JYJNAYRXSA-N 0.000 description 1
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 1
- QFXVAFIHVWXXBJ-AVGNSLFASA-N Tyr-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O QFXVAFIHVWXXBJ-AVGNSLFASA-N 0.000 description 1
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 1
- XYBNMHRFAUKPAW-IHRRRGAJSA-N Tyr-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XYBNMHRFAUKPAW-IHRRRGAJSA-N 0.000 description 1
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 1
- MDXLPNRXCFOBTL-BZSNNMDCSA-N Tyr-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MDXLPNRXCFOBTL-BZSNNMDCSA-N 0.000 description 1
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 1
- UUBKSZNKJUJQEJ-JRQIVUDYSA-N Tyr-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UUBKSZNKJUJQEJ-JRQIVUDYSA-N 0.000 description 1
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 1
- NZBSVMQZQMEUHI-WZLNRYEVSA-N Tyr-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NZBSVMQZQMEUHI-WZLNRYEVSA-N 0.000 description 1
- KLQPIEVIKOQRAW-IZPVPAKOSA-N Tyr-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KLQPIEVIKOQRAW-IZPVPAKOSA-N 0.000 description 1
- JHDZONWZTCKTJR-KJEVXHAQSA-N Tyr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JHDZONWZTCKTJR-KJEVXHAQSA-N 0.000 description 1
- HMPMGPISLMLHSI-JBACZVJFSA-N Tyr-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N HMPMGPISLMLHSI-JBACZVJFSA-N 0.000 description 1
- BXJQKVDPRMLGKN-PMVMPFDFSA-N Tyr-Trp-Leu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(O)=O)C1=CC=C(O)C=C1 BXJQKVDPRMLGKN-PMVMPFDFSA-N 0.000 description 1
- RZAGEHHVNYESNR-RNXOBYDBSA-N Tyr-Trp-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RZAGEHHVNYESNR-RNXOBYDBSA-N 0.000 description 1
- DJSYPCWZPNHQQE-FHWLQOOXSA-N Tyr-Tyr-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=C(O)C=C1 DJSYPCWZPNHQQE-FHWLQOOXSA-N 0.000 description 1
- WYOBRXPIZVKNMF-IRXDYDNUSA-N Tyr-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 WYOBRXPIZVKNMF-IRXDYDNUSA-N 0.000 description 1
- AFWXOGHZEKARFH-ACRUOGEOSA-N Tyr-Tyr-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=C(O)C=C1 AFWXOGHZEKARFH-ACRUOGEOSA-N 0.000 description 1
- KRXFXDCNKLANCP-CXTHYWKRSA-N Tyr-Tyr-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 KRXFXDCNKLANCP-CXTHYWKRSA-N 0.000 description 1
- FZADUTOCSFDBRV-RNXOBYDBSA-N Tyr-Tyr-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 FZADUTOCSFDBRV-RNXOBYDBSA-N 0.000 description 1
- PQPWEALFTLKSEB-DZKIICNBSA-N Tyr-Val-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PQPWEALFTLKSEB-DZKIICNBSA-N 0.000 description 1
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 1
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 1
- LABUITCFCAABSV-UHFFFAOYSA-N Val-Ala-Tyr Natural products CC(C)C(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-UHFFFAOYSA-N 0.000 description 1
- NMANTMWGQZASQN-QXEWZRGKSA-N Val-Arg-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N NMANTMWGQZASQN-QXEWZRGKSA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 1
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 1
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 1
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 1
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 1
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 1
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 1
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 1
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 1
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 1
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 1
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 1
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 1
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- MLADEWAIYAPAAU-IHRRRGAJSA-N Val-Lys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N MLADEWAIYAPAAU-IHRRRGAJSA-N 0.000 description 1
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 1
- SVFRYKBZHUGKLP-QXEWZRGKSA-N Val-Met-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVFRYKBZHUGKLP-QXEWZRGKSA-N 0.000 description 1
- OJOMXGVLFKYDKP-QXEWZRGKSA-N Val-Met-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OJOMXGVLFKYDKP-QXEWZRGKSA-N 0.000 description 1
- MGVYZTPLGXPVQB-CYDGBPFRSA-N Val-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MGVYZTPLGXPVQB-CYDGBPFRSA-N 0.000 description 1
- UEPLNXPLHJUYPT-AVGNSLFASA-N Val-Met-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O UEPLNXPLHJUYPT-AVGNSLFASA-N 0.000 description 1
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 1
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 1
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 1
- GQMNEJMFMCJJTD-NHCYSSNCSA-N Val-Pro-Gln Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O GQMNEJMFMCJJTD-NHCYSSNCSA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 1
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 1
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 1
- DOBHJKVVACOQTN-DZKIICNBSA-N Val-Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 DOBHJKVVACOQTN-DZKIICNBSA-N 0.000 description 1
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 1
- JPBGMZDTPVGGMQ-ULQDDVLXSA-N Val-Tyr-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JPBGMZDTPVGGMQ-ULQDDVLXSA-N 0.000 description 1
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 1
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 1
- YKZVPMUGEJXEOR-JYJNAYRXSA-N Val-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N YKZVPMUGEJXEOR-JYJNAYRXSA-N 0.000 description 1
- 101000645119 Vibrio campbellii (strain ATCC BAA-1116 / BB120) Nucleotide-binding protein VIBHAR_03667 Proteins 0.000 description 1
- 241000123583 Xenorhabdus beddingii Species 0.000 description 1
- 241000123581 Xenorhabdus poinarii Species 0.000 description 1
- 235000007244 Zea mays Nutrition 0.000 description 1
- 229920002494 Zein Polymers 0.000 description 1
- HYJODZUSLXOFNC-UHFFFAOYSA-N [S].[Cl] Chemical compound [S].[Cl] HYJODZUSLXOFNC-UHFFFAOYSA-N 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 239000004480 active ingredient Substances 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 239000002671 adjuvant Substances 0.000 description 1
- 239000000443 aerosol Substances 0.000 description 1
- 238000012271 agricultural production Methods 0.000 description 1
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010084217 alanyl-glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010056243 alanylalanine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 125000001931 aliphatic group Chemical group 0.000 description 1
- XAGFODPZIPBFFR-UHFFFAOYSA-N aluminium Chemical compound [Al] XAGFODPZIPBFFR-UHFFFAOYSA-N 0.000 description 1
- 229910052782 aluminium Inorganic materials 0.000 description 1
- 229940095054 ammoniac Drugs 0.000 description 1
- ORWYRWWVDCYOMK-HBZPZAIKSA-N angiotensin I Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O)C(C)C)C1=CC=C(O)C=C1 ORWYRWWVDCYOMK-HBZPZAIKSA-N 0.000 description 1
- 230000007953 anoxia Effects 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 1
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 244000052616 bacterial pathogen Species 0.000 description 1
- 241000702540 bacterium W14 Species 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 1
- 238000004166 bioassay Methods 0.000 description 1
- 229960001561 bleomycin Drugs 0.000 description 1
- OYVAGSVQBOHSSS-UAPAGMARSA-O bleomycin A2 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC=C(N=1)C=1SC=C(N=1)C(=O)NCCC[S+](C)C)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C OYVAGSVQBOHSSS-UAPAGMARSA-O 0.000 description 1
- 210000000601 blood cell Anatomy 0.000 description 1
- 229940098773 bovine serum albumin Drugs 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000011088 calibration curve Methods 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 150000001768 cations Chemical class 0.000 description 1
- 230000010307 cell transformation Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 231100000481 chemical toxicant Toxicity 0.000 description 1
- BKHZIBWEHPHYAI-UHFFFAOYSA-N chloroform;3-methylbutan-1-ol Chemical compound ClC(Cl)Cl.CC(C)CCO BKHZIBWEHPHYAI-UHFFFAOYSA-N 0.000 description 1
- YTRQFSDWAXHJCC-UHFFFAOYSA-N chloroform;phenol Chemical compound ClC(Cl)Cl.OC1=CC=CC=C1 YTRQFSDWAXHJCC-UHFFFAOYSA-N 0.000 description 1
- 230000001684 chronic effect Effects 0.000 description 1
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical class OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 1
- 238000005352 clarification Methods 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 210000002777 columnar cell Anatomy 0.000 description 1
- 230000002301 combined effect Effects 0.000 description 1
- 238000010835 comparative analysis Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 239000013068 control sample Substances 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 229940089639 cornsilk Drugs 0.000 description 1
- 210000002251 corpora allata Anatomy 0.000 description 1
- 238000005336 cracking Methods 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 238000002425 crystallisation Methods 0.000 description 1
- 230000008025 crystallization Effects 0.000 description 1
- 230000001351 cycling effect Effects 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 239000003398 denaturant Substances 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 230000030609 dephosphorylation Effects 0.000 description 1
- 238000006209 dephosphorylation reaction Methods 0.000 description 1
- 238000009795 derivation Methods 0.000 description 1
- 230000001066 destructive effect Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 238000007599 discharging Methods 0.000 description 1
- CEYULKASIQJZGP-UHFFFAOYSA-L disodium;2-(carboxymethyl)-2-hydroxybutanedioate Chemical compound [Na+].[Na+].[O-]C(=O)CC(O)(C(=O)O)CC([O-])=O CEYULKASIQJZGP-UHFFFAOYSA-L 0.000 description 1
- 239000002270 dispersing agent Substances 0.000 description 1
- 239000012153 distilled water Substances 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 239000002552 dosage form Substances 0.000 description 1
- 238000010410 dusting Methods 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 238000004043 dyeing Methods 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 239000003480 eluent Substances 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 230000001804 emulsifying effect Effects 0.000 description 1
- 238000005538 encapsulation Methods 0.000 description 1
- 239000002158 endotoxin Substances 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 210000000981 epithelium Anatomy 0.000 description 1
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 1
- 229960005542 ethidium bromide Drugs 0.000 description 1
- 102000013165 exonuclease Human genes 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 244000037666 field crops Species 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 239000004088 foaming agent Substances 0.000 description 1
- 239000011888 foil Substances 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 238000007710 freezing Methods 0.000 description 1
- 230000008014 freezing Effects 0.000 description 1
- 235000012055 fruits and vegetables Nutrition 0.000 description 1
- 239000008273 gelatin Substances 0.000 description 1
- 229920000159 gelatin Polymers 0.000 description 1
- 235000019322 gelatine Nutrition 0.000 description 1
- 235000011852 gelatine desserts Nutrition 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 1
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010081985 glycyl-cystinyl-aspartic acid Proteins 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010023364 glycyl-histidyl-arginine Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine Chemical compound [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- XDDAORKBJWWYJS-UHFFFAOYSA-N glyphosate Chemical compound OC(=O)CNCP(O)(O)=O XDDAORKBJWWYJS-UHFFFAOYSA-N 0.000 description 1
- 229940097068 glyphosate Drugs 0.000 description 1
- 210000002175 goblet cell Anatomy 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 239000008187 granular material Substances 0.000 description 1
- 208000037824 growth disorder Diseases 0.000 description 1
- PJJJBBJSCAKJQF-UHFFFAOYSA-N guanidinium chloride Chemical class [Cl-].NC(N)=[NH2+] PJJJBBJSCAKJQF-UHFFFAOYSA-N 0.000 description 1
- 230000002363 herbicidal effect Effects 0.000 description 1
- 239000004009 herbicide Substances 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 1
- 239000010903 husk Substances 0.000 description 1
- 230000036571 hydration Effects 0.000 description 1
- 238000006703 hydration reaction Methods 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 238000001114 immunoprecipitation Methods 0.000 description 1
- 230000008676 import Effects 0.000 description 1
- 230000001976 improved effect Effects 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 210000003000 inclusion body Anatomy 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 229910052500 inorganic mineral Inorganic materials 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 210000004347 intestinal mucosa Anatomy 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 230000009545 invasion Effects 0.000 description 1
- 238000010829 isocratic elution Methods 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- 230000002147 killing effect Effects 0.000 description 1
- 210000003127 knee Anatomy 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 229940039696 lactobacillus Drugs 0.000 description 1
- 230000001418 larval effect Effects 0.000 description 1
- 230000002045 lasting effect Effects 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 108010083942 mannopine synthase Proteins 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000001819 mass spectrum Methods 0.000 description 1
- 238000000816 matrix-assisted laser desorption--ionisation Methods 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108010090114 methionyl-tyrosyl-lysine Proteins 0.000 description 1
- 229960000485 methotrexate Drugs 0.000 description 1
- 239000011859 microparticle Substances 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 238000007479 molecular analysis Methods 0.000 description 1
- 229940057059 monascus purpureus Drugs 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 239000000618 nitrogen fertilizer Substances 0.000 description 1
- 231100001160 nonlethal Toxicity 0.000 description 1
- 108010058731 nopaline synthase Proteins 0.000 description 1
- 230000001473 noxious effect Effects 0.000 description 1
- 238000001821 nucleic acid purification Methods 0.000 description 1
- 230000000474 nursing effect Effects 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 210000004681 ovum Anatomy 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 1
- 235000019319 peptone Nutrition 0.000 description 1
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 229910052615 phyllosilicate Inorganic materials 0.000 description 1
- 239000000049 pigment Substances 0.000 description 1
- 238000003976 plant breeding Methods 0.000 description 1
- 230000008654 plant damage Effects 0.000 description 1
- 239000003375 plant hormone Substances 0.000 description 1
- 230000007096 poisonous effect Effects 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 239000011546 protein dye Substances 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 239000012460 protein solution Substances 0.000 description 1
- 230000017854 proteolysis Effects 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000008521 reorganization Effects 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 230000032537 response to toxin Effects 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- 239000012266 salt solution Substances 0.000 description 1
- 229920006395 saturated elastomer Polymers 0.000 description 1
- 239000012047 saturated solution Substances 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000004062 sedimentation Methods 0.000 description 1
- 230000011218 segmentation Effects 0.000 description 1
- 239000006152 selective media Substances 0.000 description 1
- 238000012772 sequence design Methods 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 208000007056 sickle cell anemia Diseases 0.000 description 1
- 230000037432 silent mutation Effects 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
- 108091029842 small nuclear ribonucleic acid Proteins 0.000 description 1
- 238000010512 small scale reaction Methods 0.000 description 1
- 239000001632 sodium acetate Substances 0.000 description 1
- 235000017281 sodium acetate Nutrition 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000001488 sodium phosphate Substances 0.000 description 1
- 229910000162 sodium phosphate Inorganic materials 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000011877 solvent mixture Substances 0.000 description 1
- 108010005652 splenotritin Proteins 0.000 description 1
- 239000007921 spray Substances 0.000 description 1
- 239000003381 stabilizer Substances 0.000 description 1
- 229910001220 stainless steel Inorganic materials 0.000 description 1
- 239000010935 stainless steel Substances 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 238000004659 sterilization and disinfection Methods 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 239000000375 suspending agent Substances 0.000 description 1
- 230000008961 swelling Effects 0.000 description 1
- 230000031068 symbiosis, encompassing mutualism through parasitism Effects 0.000 description 1
- 230000002195 synergetic effect Effects 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 101150078747 tcaA gene Proteins 0.000 description 1
- 101150089436 tcdB gene Proteins 0.000 description 1
- 238000009210 therapy by ultrasound Methods 0.000 description 1
- DPJRMOMPQZCRJU-UHFFFAOYSA-M thiamine hydrochloride Chemical compound Cl.[Cl-].CC1=C(CCO)SC=[N+]1CC1=CN=C(C)N=C1N DPJRMOMPQZCRJU-UHFFFAOYSA-M 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 229940104230 thymidine Drugs 0.000 description 1
- 238000004448 titration Methods 0.000 description 1
- 231100000167 toxic agent Toxicity 0.000 description 1
- 238000012549 training Methods 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- YWBFPKPWMSWWEA-UHFFFAOYSA-O triazolopyrimidine Chemical compound BrC1=CC=CC(C=2N=C3N=CN[N+]3=C(NCC=3C=CN=CC=3)C=2)=C1 YWBFPKPWMSWWEA-UHFFFAOYSA-O 0.000 description 1
- RYFMWSXOAZQYPI-UHFFFAOYSA-K trisodium phosphate Chemical compound [Na+].[Na+].[Na+].[O-]P([O-])([O-])=O RYFMWSXOAZQYPI-UHFFFAOYSA-K 0.000 description 1
- 108010045269 tryptophyltryptophan Proteins 0.000 description 1
- 108010029599 tyrosyl-glutamyl-tryptophan Proteins 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 241000701366 unidentified nuclear polyhedrosis viruses Species 0.000 description 1
- 101150112970 up gene Proteins 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 235000013311 vegetables Nutrition 0.000 description 1
- 235000020234 walnut Nutrition 0.000 description 1
- 239000004563 wettable powder Substances 0.000 description 1
- 239000001231 zea mays silk Substances 0.000 description 1
- 229940093612 zein Drugs 0.000 description 1
- 239000005019 zein Substances 0.000 description 1
- AFVLVVWMAFSXCK-UHFFFAOYSA-N α-cyano-4-hydroxycinnamic acid Chemical compound OC(=O)C(C#N)=CC1=CC=C(O)C=C1 AFVLVVWMAFSXCK-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
- C12N15/8271—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
- C12N15/8279—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance
- C12N15/8286—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance for insect resistance
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01N—PRESERVATION OF BODIES OF HUMANS OR ANIMALS OR PLANTS OR PARTS THEREOF; BIOCIDES, e.g. AS DISINFECTANTS, AS PESTICIDES OR AS HERBICIDES; PEST REPELLANTS OR ATTRACTANTS; PLANT GROWTH REGULATORS
- A01N63/00—Biocides, pest repellants or attractants, or plant growth regulators containing microorganisms, viruses, microbial fungi, animals or substances produced by, or obtained from, microorganisms, viruses, microbial fungi or animals, e.g. enzymes or fermentates
- A01N63/50—Isolated enzymes; Isolated proteins
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07H—SUGARS; DERIVATIVES THEREOF; NUCLEOSIDES; NUCLEOTIDES; NUCLEIC ACIDS
- C07H21/00—Compounds containing two or more mononucleotide units having separate phosphate or polyphosphate groups linked by saccharide radicals of nucleoside groups, e.g. nucleic acids
- C07H21/04—Compounds containing two or more mononucleotide units having separate phosphate or polyphosphate groups linked by saccharide radicals of nucleoside groups, e.g. nucleic acids with deoxyribosyl as saccharide radical
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/24—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Enterobacteriaceae (F), e.g. Citrobacter, Serratia, Proteus, Providencia, Morganella, Yersinia
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
- C12N15/8271—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
- C12N15/8279—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A40/00—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
- Y02A40/10—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
- Y02A40/146—Genetically Modified [GMO] plants, e.g. transgenic plants
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S530/00—Chemistry: natural resins or derivatives; peptides or proteins; lignins or reaction products thereof
- Y10S530/82—Proteins from microorganisms
- Y10S530/825—Bacteria
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- General Health & Medical Sciences (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Plant Pathology (AREA)
- Microbiology (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Pest Control & Pesticides (AREA)
- Insects & Arthropods (AREA)
- Medicinal Chemistry (AREA)
- Gastroenterology & Hepatology (AREA)
- Agronomy & Crop Science (AREA)
- Virology (AREA)
- Dentistry (AREA)
- Environmental Sciences (AREA)
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Agricultural Chemicals And Associated Chemicals (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
Abstract
本发明涉及新的致病杆菌毒素复合物(TC)蛋白和编码这些蛋白质的基因。更具体的,本发明涉及得自伯氏致病杆菌ILM104菌株的TC基因和蛋白。
Description
技术领域
昆虫和其他害虫引起的作物损失和控制这些害虫的花费使农民每年损失数十亿美元。由虫害引起的损失在农业生产领域包括作物产量下降、作物质量降低和收获成本提高。虫害对于蔬菜和水果种植者、观赏花卉生产者和家庭园丁和户主也是一种负担。
栽培方法(如轮作和使用高水平的氮肥)部分减轻了由农业害虫引起的问题。然而,农田利用的多种需要限制了轮作的使用。此外,一些昆虫的越冬性特征在一些地区破坏了轮作。
因此,有效水平的控制最依赖合成化学杀虫剂。然而,使用合成化学杀虫剂具有若干缺点。例如,使用这些化学品可能对许多益虫产生不利影响。靶昆虫也已发展出了对一些化学杀虫剂的抗性。另外,雨和杀虫剂使用装置校准不正确可能导致很差的控制。使用杀虫剂经常引起环境问题,如不正确使用时的土壤和供水污染,残留物还可能残留于经处理的水果和蔬菜中。操作一些杀虫剂还可能对使用者造成危害。对使用杀虫剂的新的严格限制和一些有效杀虫剂的消失可能限制了对控制具有破坏性且代价高昂的害虫的有效选择。
用生物杀虫剂取代合成化学杀虫剂或这些物质的组合可以降低环境中有毒化学品的水平。现在成功使用的一些生物杀虫剂来自土壤微生物苏云金芽孢杆菌(Bacillus thuringiensis,(B.t.))。大多数B.t.菌株不表现杀虫活性,但一些B.t.菌株产生对害虫(如昆虫)具有高毒性并且其毒活性具有特异性的蛋白质。已经分离了编码δ-内毒素蛋白的基因。其他芽孢杆菌物种也产生杀虫蛋白质。
Hofte和Whiteley将B.t.晶体蛋白分为四个大类(Hofte,H.,H.R.Whiteley[1989]Mcirobiological Reviews 52(2):242-255)。这些类别为CryI(鳞翅目特异性)、CryII(鳞翅目和双翅目特异性)、CryIII(鞘翅目特异性)和CryIV(双翅目特异性)。也已报道发现了对其他害虫具有特异毒性的菌株。例如,已经提出以CryV和CryVI命名一类线虫特异性的毒素基因。
Hofte和Whiteley于1989年对晶体蛋白的命名和分类方案是基于推定的氨基酸序列和毒素的活性谱。该系统适于覆盖分为五个主要类别的14种不同类型的毒素基因。随着越来越多的编码具有多种杀虫活性谱的基因被发现,1989年的命名方案已变得难以实行。因此,采用了修订的命名方案,该命名方案只基于氨基酸同一性(Crickmore等1998,Microbiology andMolecular Biology Reviews 62:807-813)。
已经产生并批准使用了基于重组DNA的B.t.产物。另外,使用遗传工程技术完善了多种将这些毒素递送到农业环境的方法。它们包括使用用毒素基因遗传改造以获得昆虫抗性的植物和使用稳定的完整微生物细胞作为毒素递送载体。因此,分离的芽孢杆菌毒素开始具有商业价值。
最初将B.t.蛋白毒素配制成可喷雾的昆虫控制剂。B.t.技术相对较新的应用是分离编码这些毒素的基因并转化植物。接着转基因植物产生毒素,从而提供昆虫控制。参阅Mycogen Corporation的U.S.专利No.5,380,831、5,567,600、5,567,862。转基因B.t.植物相当高效,预测在一些作物和地区中可得到高度应用。
农业上成功的使用芽孢杆菌(及其他生物)杀虫蛋白还存在一些障碍。某些昆虫可以耐受芽孢杆菌毒素的效力。现已证明如棉子象鼻虫、切根虫和美洲棉铃虫(Helicoverpa zea)等昆虫以及多数物种的成虫对许多B.t.δ-内毒素不具有显著的敏感性。
另一个潜在的障碍是昆虫发展出对B.t.毒素的抗性。B.t.植物广泛应用的可能已经引起一些担忧,即可能比传统的喷雾应用更快的出现抗性治理问题。尽管在实验室中已经选择了大量昆虫对B.t.毒素的抗性,但只有菱形斑蛾(Plutella xylostella)已证明在大田环境中具有抗性(Ferre,J.和Van Rie,J.,Annu.Rev.Entomol.47:501-533,2002)。
B.t.转基因植物技术中的抗性治理策略已经引起巨大的关注。已经提出了若干策略来保持有效使用苏云金芽孢杆菌毒素的能力。这些策略包括如带有保护区(refuge)的高剂量、(如天然细菌中)交替使用或共同使用不同的毒素(McGaughey等(1998),″B.t.Resistance Management,″NatureBiotechnol.16:144-146)。
因此,仍然非常需要发展可以在植物中表达以有效控制多种昆虫的其他基因。除了继续尝试发现新的B.t.毒素以外(由于已经发现了大量B.t.毒素而变得越来越困难),还非常需要发现产生可用于转基因植物策略的毒素的(不同于B.t.的)其他细菌来源。
最近从光杆状菌(Photorhabdus)/致病杆菌(Xenorhabdus)组细菌中克隆杀虫毒素基因的尝试提出了苏云金芽孢杆菌毒素的潜在替代物。致病杆菌属分类学定义为肠杆菌科成员,尽管其具有该科的某些非典型特征。例如,该属菌株一般为硝酸盐还原阴性和过氧化氢酶阴性。最近刚刚将致病杆菌细分产生第二个属——光杆状菌,其含有三个物种,光杆状菌asymbiotica、光杆状菌temperata和发光光杆状菌(P.luminescens)。发光光杆状菌有三个公认的亚种,发光光杆状菌akhurstii亚种、发光光杆状菌laumondii亚种和发光光杆状菌luminescens亚种(典型种)。(Fischer-LeSaux,M.,Viallard,V.,Brunel,B.,Normand,P.,Boemare,N.E.TitlePolyphasic classification of the genus Photorhabdus and proposal of newtaxa:P.luminescens subsp.luminescens subsp.nov.,P.luminescens subsp.akhurstii subsp.nov.,P.luminescens subsp.laumondii subsp.nov.,P.temperata sp.nov.,P.temperata subsp.temperata subsp.nov.and P.asymbiotica sp.nov.Int.J.Syst.Bacteriol.49;1645-1656,(1999))。这一区分基于本领域技术人员容易鉴定的若干区别特征。这些区别包括:DNA-DNA表征研究、过氧化氢酶活性表型存在(光杆状菌)或不存在(致病杆菌)、生物发光存在(光杆状菌)或不存在(致病杆菌)、宿主线虫的科(致病杆菌发现于斯氏线虫(Steinernematidae)而光杆状菌发现于异小杆线虫(Heterorhabditidae))以及胞内脂肪酸比较分析(Janse等1990,Lett.Appl.Microbiol.10,131-135、Suzuki等1990,J.Gen.Appl.Microbiol.,36,393-401)。另外,最近着眼于16S rRNA基因的序列(Rainey等,1995,Int.J.Syst.Bacteril.,45,379-381)和限制性(Brunel等,1997,App.Environ.Micro.,63,574-580)分析的分子研究也支持这两个属的划分。
致病杆菌的预计特征为:革兰氏染色阴性杆菌、白色到黄色/褐色菌落着色、存在包涵体、无过氧化氢酶、不能还原硝酸盐、无生物发光、能从培养基中摄取染料、阳性明胶水解、在肠杆菌选择性培养基上生长、生长温度低于37℃、在缺氧条件下存活以及活动性。
目前,致病杆菌属细菌包括四个公认的物种,嗜线虫致病杆菌(Xenorhabdus nematophilus)、波氏致病杆菌(Xenorhabdus poinarii)、伯氏致病杆菌(Xenorhabdus bovienii)和贝氏致病杆菌(Xenorhabdusbeddingii)(Brunel等,1997,App.Environ.Micro.,63,574-580)。在文献中已描述了多种相关菌株(如Akhurst和Boemare 1988J.Gen.Microbiol.,134,1835-1845、Boemare等1993 Int.J.Syst.Bacteriol.43,249-255页;Putz等1990,App.Environ.Microbiol.,56,181-186、Brunel等1997,App.Environ.Micro.,63,574-580、Rainey等1995,Int.J.Syst.Bacteriol.,45,379-381)。
光杆状菌和致病杆菌种是与土壤线虫共生的昆虫致病性革兰氏阴性细菌。这些细菌发现于侵袭并杀死昆虫的昆虫致病性线虫的肠中。当线虫侵袭昆虫宿主时,细菌释放到昆虫血腔(开放式循环系统)中,细菌和线虫都进行数轮复制,昆虫宿主一般死亡。可以脱离其线虫宿主培养这些细菌。对于这些细菌更详细的讨论可参阅Forst和Nealson,60 Microbiol.Rev.1(1996),21-43页。不幸的是,如在许多文章中所报道的,该细菌只在注射到昆虫幼虫时具有杀虫活性,而在经口递送时不表现生物活性。
致病杆菌和光杆状菌细菌向培养基中分泌多种物质。与光杆状菌昆虫毒性相关的多种因子的综述参阅R.H.ffrench-Constant等66 AEM No.8,3310-3329页(2000年8月)。
有效地开发线虫或其细菌共生生物的杀虫特性是困难的。因此,需要具有经口活性的来自光杆状菌/致病杆菌细菌的蛋白质剂,从而其产生的产物可以配制成喷雾杀虫剂,或者可以分离编码所述蛋白质剂的基因并用于产生转基因植物。
从发光光杆状菌和嗜线虫致病杆菌克隆编码杀虫毒素的基因已经取得了显著的进展。首先检查了来自发光光杆状菌的毒素复合物编码基因。参阅WO 98/08932。最近从嗜线虫致病杆菌克隆了平行基因。Morgan等,Applied and Environmental Microbiology 2001,67:20062-69。WO95/00647涉及致病杆菌蛋白毒素控制昆虫的用途,但未识别经口活性毒素。WO 98/08388涉及来自致病杆菌的经口施用杀虫剂。U.S.专利No.6,048,838涉及得自致病杆菌物种和菌株的具有经口活性的蛋白毒素/毒素复合物。
已经在光杆状菌属中鉴定了四种不同的毒素复合物(TC)——Tca、Tcb、Tcc和Tcd。这些毒素复合物每种在天然琼脂糖凝胶上均解离为单体或二聚体类型,但在变性凝胶上的解离显示每个复合物由25-280kDa之间的一系列蛋白质组成。图1中展示了来自光杆状菌的编码典型TC的ORF以及蛋白酶切割位点(竖箭头)。还可参阅R.H.ffrench-Constant和Bowen,57 Cell.Mol.Life Sci.828-833(2000)。
用DNA探针和针对毒素产生的单克隆和/或多克隆抗体筛选了发光光杆状菌的基因组文库。克隆了四个tc基因座:tca、tcb、tcc和tcd。tca基因座是转录自同一DNA链的三个开放读码框(ORF)tcaA、tcaB和tcaC的推定操纵子,带有反方向转录的较小的末端ORF(tcaZ)。tcc基因座还含有推定从同方向转录的三个ORF(tccA、tccB和tccC)。tcb基因座是单独的大ORF(tcbA),而tcd基因座由两个ORF(tcdA和tcdB)组成,tcbA和tcdA各约为7.5kb,编码大昆虫毒素。TcdB与TcaC具有一定水平的同源性。已经确定许多这些基因产物被蛋白酶切割。例如TcbA和TcdA都被切割成名为i、ii和iii的三个片段(如TcbAi、TcbAii和TcbAiii)。tca和tcc ORF的产物也被切割。参阅图1。还可参阅R.H.ffrench-Constant和D.J.Bowen,Current Opinions in Microbiology,1999,12:284-288。
Tca毒素复合物的生物测定显示,在经口给予(每平方厘米人工食物875ng LD50)时,它们对1龄番茄天蛾幼虫(Manduca Sexta)具有高毒性。R.H.ffrench-Constant和Bowen 1999。进食被低至40ng/cm2的Tca剂量所抑制。考虑到所预计Tca的高分子量,发光光杆状菌毒素在摩尔基础上具有高活性,且似乎需要相对较少的分子便可发挥毒效应。R.H.ffrench-Constant和D.J.Bowen,Current Opinions in Microbiology,1999,12:284-288。
四个基因座均未与GenBank中已知功能的任何序列显示出总体相似性。序列相似性区域提示这些蛋白质(TcaC和TccA)可能通过攻击昆虫血细胞来克服昆虫免疫系统。R.H.ffrench-Constant和D.J.Bowen,Current Opinions in Microbiology,1999,12:284-288。
互相进行比较时,TcaB、TcbA和TcdA在紧靠其预计的蛋白酶切割位点周围均显示出氨基酸保守性(~50%同一性)。三种不同Tc蛋白之间的这一保守性提示它们可能由相同或相似的蛋白酶进行加工。TcbA和TcdA总体上也具有~50%的同一性以及相似的羧基和氨基端预测切割模式。因此认为这些蛋白可能彼此是同源物。另外,TcbA和TcdA相似的大尺寸和两种毒素似乎均在昆虫肠中发挥作用这一事实可以提示相似的作用模式。R.H.ffrench-Constant和D.J.Bowen,Current Opinions inMicrobiology,1999,12:284-288。
缺失/敲除研究提示tca和tcd基因座的产物决定了对鳞翅目主要的经口毒性。缺失tca或tcd基因大幅降低对烟草天蛾的经口活性。即,tca和tcd基因座的产物本身为经口鳞翅目毒素,其组合效应对分泌经口活性的贡献最大。R.H.ffrench-Constant和D.J.Bowen,57Cell.Mol.Life.Sci.831(2000)。有趣的是,单独缺失tcb或tcc基因座也降低了死亡率,提示在不同基因产物间可能存在复杂的相互作用。因此,tca基因座的产物可能增强了tcd产物的毒性。或者,tcd产物可能调控tca产物(和其他复合物(如果可能))的毒性。注意到上文涉及针对单一昆虫物种的经口毒性,tcb或tcc基因座可能产生对其他昆虫群更具活性的毒素(或通过直接注射进昆虫血腔——由细菌在体内分泌时的正常递送途径——发挥活性)。R.H.ffrench-Constant和Bowen,Current Opinions in Microbiology,1999,12:284-288。
昆虫中肠上皮含有柱状(结构)和杯形(分泌)细胞。烟草天蛾摄食tca产物引起柱状细胞顶端肿胀和大细胞质囊泡起泡,导致最终将囊泡中的细胞核挤压进肠腔。杯状细胞也以同一方式受到明显影响。在经口递送或注射后,tca的产物在昆虫中肠发挥作用。R.H.french-Constant和D.J.Bowen,Current Opinions in Microbiology,1999,12:284-288。纯化的tca产物已显示出对烟草天蛾的经口毒性(875ng/cm2的LD50)。R.H.french-Constant和D.J.Bowen,57 Cell.Mol. Life Sci.828-833(2000)。
WO 99/42589和U.S.专利No.6,281,413公开了来自发光光杆状菌的TC样ORF。WO 00/30453和WO 00/42855公开了来自致病杆菌的TC样蛋白。WO 99/03328和WO 99/54472(和U.S.专利No.6,174,860和6,277,823)涉及来自致病杆菌和光杆状菌的其他毒素。
尽管目前尚不了解TC相互间确切的分子相互作用及其作用机制,但是已经知道,例如光杆状菌的Tca毒素复合物对烟草天蛾具有毒性。另外,已知一些TC蛋白具有“独立的”杀虫活性,而其他TC蛋白已知可促进或增强独立毒素的活性。已知单独是TcdA蛋白对烟草天蛾具有活性,但是可以与TcdB和TccC共同使用(与TcdA缀合)来大幅增强TcdA的活性。TcbA是来自光杆状菌的另一主要独立毒素。TcdB与TccC样蛋白也可大幅增强该毒素(TcbA)的活性。
光杆状菌TC蛋白 | 光杆状菌菌株W14命名 | 同源性 |
TcaA | 毒素C | TccA |
TcaB | TccB | |
TcaC | TcdB | |
Tcb | 毒素B |
TccA | 毒素D | TcdA N端 |
TccB | TcdA C端 | |
TccC | ||
TcdA | 毒素A | TccA+TccB |
TcdB | TcaC |
一些光杆状菌TC蛋白与其他光杆状菌TC蛋白具有一定水平的序列同源性。如上文所指出,TccA与TcdA的N端具有一定水平的同源性,而TccB与TcdA的C端具有一定水平的同源性。另外,TcdA约为280kDa,而TccA与TccB结合后一起与TcdA约为相同的大小。尽管在SCR上TccA和TccB活性远低于TcdA,仍然将来自光杆状菌W14菌株的TccA和TccB称为“毒素D”。上文还标明了“毒素A”(TcdA)、“毒素B”(Tcb或TcbA)和“毒素C”(TcaA和TcaB)。
另外,TcaA与TccA(并类似的与TcdA的N端)具有一定水平的同源性。另外,TcaB与TccB(并类似的与TcdA的N端)具有一定水平的同源性。TcdB与TcaC具有显著水平的相似性。
最近在嗜线虫致病杆菌中克隆的尝试似乎也已鉴定了与发光光杆状菌tc基因座具有同源性的新杀虫毒素基因。参阅如WO 98/08388和Morgan等Applied and Enviromental Mcirobiology 2001,67:20062-69。在R.H.和D.J.Bowen Current Opinions in Micriobiology,1999,12:284-288中,直接筛选粘粒克隆对另一鳞翅目欧洲粉蝶(Pieris brassicae)的经口毒性。测序了一个经口毒性粘粒克隆。该粘粒中的序列分析提示存在与光杆状菌tc基因具有相似性的五个不同的ORF;orf2和orf5与tcbA和tcdA具有一定水平的序列相关性,而orf1与tccB相似,orf3与tccC相似,而orf4与tcaC相似。重要的是,大量这些预测的ORF还共有发光光杆状菌中证明的推定切割位点,提示活性毒素也可能被蛋白水解加工。
存在五种来自致病杆菌的典型TC蛋白:XptA1、XptA2、XptB1、XptC1和XptD1。XptA1为“独立”毒素。XptA2为来自致病杆菌的另一个具有独立毒素活性的TC蛋白。XptB1和XptC1为能增强一种(或两种)XptA毒素活性的致病杆菌增强剂。XptD1与TccB具有一定水平的同源性。
已知XptC1与TcaC具有一定程度的相似性。已知致病杆菌的XptA2蛋白与TcdA蛋白具有一定程度的相似性。XptB1与TccC具有一定水平的相似性。
以这些毒性基因可能的来源来看,在这两种不同细菌中发现具有某些相似性的编码毒素的基因座是很有意义的。嗜线虫致病杆菌粘粒似乎也含有转座酶样序列,其存在可能提示这些基因座能在不同细菌菌株或物种中横向转移。这些转移事件的范围还可以解释两种不同细菌中tc操纵子明显不同的基因组组织。另外,仅有嗜线虫致病杆菌和发光光杆状菌的子集对烟草天蛾表现毒性,提示不同的菌株缺乏tc基因或它们带有不同的tc基因compliment。对两种菌株的详细分析和这些细菌物种之中(和之间)的毒素系统发生可能帮助阐明毒素基因可能的来源以及它们如何在不同的细菌种群中维持。R.H.ffrench-Constant和Bowen,Current Opinions inMicrobiology,1999,12:284-288。
近来已从其他昆虫相关细菌(如昆虫病原体嗜虫沙雷氏菌(Serratiaentomophila))中描述了TC蛋白和基因。Waterfield等TRENDS inMicrobiology,第9卷,No.4,2001年4月。
总而言之,来自发光光杆状菌和嗜线虫致病杆菌的毒素复合物蛋白似乎与先前鉴定的细菌毒素同源性很低,应该能够提供苏云金芽孢杆菌毒素的有用替代物。尽管它们与其他经口活性毒素在昆虫中肠具有相似的毒效应,其精确的作用模式仍不清楚。未来的工作将可能阐明其作用机制。
类芽孢杆菌(Paenibacillus)属细菌通过其独特的rRNA和表型特征区别于其他细菌(C.Ash等(1993),″Molecular identification group 3bacilli(Ash,Farrow,Wallbanks and Collins)using a PCR probe test:Proposal for the creation of a new genus Paenibacillus”Antonie VanLeeuwenhoek 64:253-260)。已知该属中的一些物种对蜜蜂(幼虫类芽孢杆菌(Paenibacillus larvae))和金龟幼虫(日本金龟类芽孢杆菌(P.popilliae)和患病类芽孢杆菌(P.lentimorbus))具有致病性。幼虫类芽孢杆菌、日本金龟类芽孢杆菌和患病类芽孢杆菌认为是金龟乳样病相关的专性昆虫病原体(A.Balows等编辑,《The Procaryotes》,第二版,第2卷,Springer-Verlag,New York,NY中D.P.Stahly等(1992),″The genusBacillus:insect pathogens,″1697-1745页)。
已在日本金龟类芽孢杆菌和患病类芽孢杆菌菌株中鉴定了晶体蛋白Cry18。Cry18具有金龟和幼虫毒性,并与Cry2蛋白具有约40%同一性(Zhang等,1997;Harrison等,2000)。
最近已在类芽孢杆菌中发现了TC蛋白和鳞翅目毒Cry蛋白。参阅提交于2002年6月28日的U.S.Serial No.60/392,633(Bintrim等)。
尽管发现一些致病杆菌TC蛋白“对应于”(具有相似的功能和一定水平的序列同源性)一些光杆状菌TC蛋白,但“对应的”蛋白彼此仅共有约40%(近似值)的序列同一性。新近从类芽孢杆菌中发现的TC蛋白也是如此(这些蛋白及其发现为共同未决的U.S.Serial No.60/392,633的主题)。
出于对昆虫发展出对给定杀虫毒素的抗性的考虑以及其他考虑(其中一些在上文已有讨论),需要继续发现新的杀虫毒素和可用于控制昆虫的其他蛋白质。
发明概述
本发明涉及新的致病杆菌毒素复合物(TC)蛋白质及编码这些蛋白质的基因。更具体地,本发明涉及得自伯氏致病杆菌ILM104菌株的TC蛋白和基因。
附图简述
图1显示来自光杆状菌的TC操纵子。
序列简述
SEQ ID NO:1为天然xptB1xb编码区(4521个碱基)。
SEQ ID NO:2为SEQ ID NO:1编码的天然XptB1xb蛋白(1506个氨基酸)。
SEQ ID NO:3为天然xptC1xb编码区(2889个碱基)。
SEQ ID NO:4为SEQ ID NO:3编码的天然XptC1xb蛋白(962个氨基酸)。
SEQ ID NO:5为天然xptA1xb编码区(局部)(3822个碱基)。
SEQ ID NO:6为SEQ ID NO:5(局部)编码的天然XptA1xb蛋白(1273个氨基酸)。
SEQ ID NO:7为含有天然xptB1xb编码区的表达质粒pDAB6031的Xba I至Xho I片段(4595个碱基),其中40到4557碱基编码SEQ ID NO:2的蛋白质。
SEQ ID NO:9为含有天然xptB1xb和天然xptC1xb编码区的表达质粒pDAB6033的Xba I至Xho I片段(7508个碱基),其中40到4557碱基编码SEQ ID NO:2的蛋白质,4601到7486碱基编码SEQ ID NO:4的蛋白质。
SEQ ID NO:10为名为xptA1xb的新A类基因的全长编码序列。
SEQ ID NO:11为SEQ ID NO:10的读码框编码的蛋白质(XptA1xb)。
发明详述
本发明涉及新致病杆菌毒素复合物(TC)蛋白质及编码这些蛋白质的基因。更具体地,本发明涉及可得自伯氏致病杆菌ILM104菌株的TC基因和蛋白。
存在三种主要的TC蛋白类型。在本文中,A类蛋白(“A蛋白”)为独立毒素。天然A类蛋白约为280kDa。B类蛋白(“B蛋白”)和C类蛋白(“C蛋白”)增强A类蛋白的毒性。在本文中,天然B蛋白约为170kDa,天然C蛋白约为112kDa。A类蛋白的实例为TcbA、TcdA、XptA1和XptA2。B类蛋白的实例为TcaC、TcdB、XptB1xb和XptC1wi。C类蛋白的实例为TccC、XptC1xb和XptB1wi。
先前显示(U.S.专利No.6,048,838)致病杆菌ILM104菌株(NRRLB-30021,保藏于1998年4月30日)产生对鞘翅目、鳞翅目、双翅目和螨目昆虫的成员具有经口杀虫活性的胞外蛋白。本文公开了可得自ILM104菌株的两种特异性TC增强剂和一种TC毒素(及它们的编码基因)。
可以将本发明的多核苷酸插入植物基因组,以使植物产生该多核苷酸所编码的蛋白质。从而使以产生(和含有)该蛋白质的植物组织为食的昆虫与蛋白质接触,并以这种方式受到控制。可以以这种方式(即在植物中表达)或其他方式使用TC蛋白基因控制昆虫和其他类似的害虫。优选地,产生表达本发明基因的植物,以使植物细胞中产生(优选存在)一种或多种本发明的蛋白质。可以构建植物以共表达本发明的基因,以使得到的蛋白质促进或增强如XptA1和/或XptA2TC蛋白毒素。
对昆虫或其他害虫施用本发明蛋白质的其他方法为本领域所熟知。另外,本发明蛋白质不限于彼此共同使用,如本领域所共知,它们可以与其他蛋白质(如B.t.毒素)分开(或组合)使用。
蛋白质和毒素
本发明提供易于施用的功能性蛋白质。本发明还提供递送杀虫毒素的方法,所述杀虫毒素对多个目的昆虫(优选鳞翅目昆虫)具有功能活性和效力。在本文中“功能活性”(或“针对……的活性”)指蛋白质毒素(单独或与其他蛋白质组合)发挥经口活性昆虫控制剂的功能、蛋白质(单独或与其他蛋白质组合)具有毒效应或能够破坏或终止昆虫生长和/或进食,引起或不引起昆虫死亡。当昆虫与有效剂量的本发明“毒素”接触时,结果一般为昆虫死亡、抑制昆虫的生长和/或繁殖和/或阻止昆虫进食对昆虫发生作用的毒素来源(优选转基因植物),其中所述毒素通过转基因植物表达、制备的蛋白质组合物、可喷雾蛋白质组合物、毒饵基质或其他递送系统进行递送。还可以单独或共同使用本发明的功能性蛋白质以增强或提高一种或多种其他毒素的活性。本文所使用的术语“有毒的”、“毒性”或“毒素”旨在表示对象“毒素”具有如本文定义的“功能活性”。
实现功能活性优选但不必须完全致死进食昆虫。如果昆虫逃避毒素或停止进食,该逃避在一些应用中将是有用的,尽管该效应是非致死的或者致死性是延后的或间接的。例如,如果需要昆虫抗性的转基因植物,阻碍昆虫进食该植物与对昆虫的致死毒性是同样有效的,因为最终目的是避免昆虫引起的植物损害。
还有许多将毒素掺入昆虫食物的其他方法。例如,可以如本文所述通过用蛋白质溶液喷雾食物将毒蛋白掺入幼虫食物。另外,可以将纯化的蛋白质遗传改造进本来无害的细菌中,然后在培养基中培养所述细菌,之后用于食物来源或使其存在于需要根除昆虫的地区的土壤中。也可以直接将蛋白质遗传改造进昆虫食物来源中。例如,许多昆虫幼虫的主要食物来源为植物材料。因此可以将编码毒素的基因转移到植物材料,以使所述植物材料表达目的毒素。
将功能活性转移到植物或细菌系统一般需要将编码毒素氨基酸序列的核苷酸序列整合进蛋白质表达载体中,所述载体适合于该载体将滞留的宿主。如本文所述,获得编码功能活性蛋白质的核酸序列的一种方法为使用从毒素氨基酸序列推导的信息,从产生该毒素的细菌物种中分离天然遗传物质。可以优化天然序列以在植物中表达,例如下文更详细的讨论。也可以基于蛋白质序列设计优化的多核苷酸。
本发明提供具有有利的杀虫活性的新毒素类别。表征这些毒素类别和编码它们的多核苷酸的一种方法是通过在一系列指定条件下与示例核苷酸序列(其互补物和/或来自任一链的探针)杂交的能力和/或通过使用来自示例序列的引物的PCR对其扩增的能力来定义多核苷酸。
存在多种获得本发明杀虫毒素的方法。例如,可以使用本文公开和要求的杀虫毒素抗体从蛋白质混合物中鉴定和分离其他毒素。具体地,可以针对毒素中最恒定和与其他毒素区别最大的毒素部分产生抗体。然后可以通过免疫沉淀、酶联免疫吸附测定(ELISA)或Western印迹,使用这些抗体特异性鉴定具有特征活性的毒素。可以使用标准程序便利地制备针对本发明毒素或等价毒素或这些毒素的片段的抗体。这些抗体作为一方面包含于本发明中。本发明的毒素可得自多种来源/来源微生物。
本领域技术人员容易理解,可以从多种来源获得本发明的毒素(和基因)。“来自”或“得自”对象隔离群指该毒素(或相似的毒素)可得自伯氏致病杆菌ILM104菌株或其他一些来源,如其他细菌菌株或植物。例如,本领域技术人员容易理解,在细菌基因和毒素公开后就可以设计植物以产生该毒素。可以使用本文公开的多核苷酸和/或氨基酸序列制备抗体制剂、核酸探针(DNA和RNA)等,并用于从其他(天然)来源筛选和回收其他毒素基因。
多核苷酸和探针
本发明还提供编码本发明毒素的核苷酸序列。本发明还提供鉴定和表征编码杀虫毒素的基因的方法。在一个实施方案中,本发明提供可作为杂交探针和/或PCR技术的引物使用的独有核苷酸序列。引物产生可用于鉴定、表征和/或分离特定毒素基因的特征性基因片段。本发明的核苷酸序列编码与先前已描述毒素不同的毒素。
本发明的多核苷酸序列可用于形成完整“基因”,以在所需宿主细胞中编码蛋白质或多肽。例如,本领域技术人员容易了解,可以如本领域所熟知的,在目的宿主中将本发明的多核苷酸适当地放置于启动子的控制之下。
如本领域技术人员所知,DNA一般以双链形式存在。在这种排列下,一条链与另一条链互补,反之亦然。当DNA在(例如)植物中复制时产生了额外的互补DNA链。本领域中经常使用“编码链”指代与反义链结合的链。mRNA转录自DNA的“反义”链。“有义”或“编码”链具有一连串密码子(密码子是可以作为三残基单位阅读以指定特定氨基酸的三个核苷酸),所述密码子可以作为开放读码框(ORF)阅读以形成目的蛋白质或多肽。为在体内产生蛋白质,一般将DNA链转录成用作蛋白质模板的mRNA互补链。因此,本发明包括附带的序列列表中所示的示例多核苷酸和/或等价物(包括互补链)的用途。与示例DNA功能性等价的RNA和PNA(肽核酸)也包含于本发明中。
在本发明的一个实施方案中,可以在使微生物高度增殖的条件下培养细菌隔离群。在处理微生物以提供单链基因组核酸后,可以用本发明的引物与DNA接触并进行PCR扩增。毒素编码基因的特征性片段将被该操作扩增,从而鉴定毒素编码基因的存在。
本发明的其他方面包括使用本文公开的方法和核苷酸序列鉴定的基因和隔离群。所鉴定的基因编码对害虫具有活性的毒素。
可以通过如使用寡核苷酸探针鉴定和获得本发明的毒素和基因。这些探针为可检测的核苷酸序列,所述核苷酸序列由于适当的标记或如国际申请No.WO 93/16094所述使其具有固有的荧光性而可被检测。探针(和本发明的多核苷酸)可以是DNA、RNA或PNA。除了腺嘌呤(A)、胞嘧啶(C)、鸟嘌呤(G)、胸腺嘧啶(T)和尿嘧啶(U,RNA分子中)之外,本发明的合成探针(和多核苷酸)还可以含有次黄嘌呤核苷(能与全部四种碱基配对的中性碱基,有时用于在合成探针中代替全部四种碱基的混合物)。因此,当本文中提到合成的简并寡核苷酸时一般使用“n”,“n”可以是G、A、T、C或次黄嘌呤核苷。本文使用的多义密码子与提交本申请的标准IUPAC命名约定一致(如R为A或G、Y为C或T等)。
如本领域所熟知,如果探针分子与核酸样品杂交,可以合理的假设探针和样品具有显著同源性/相似性/同一性。优选地,首先进行多核苷酸杂交,之后使用本领域所熟知的技术在低、中或高严格条件下进行洗涤,如Keller,G.H.,M.M.Manak(1987)《DNA Probes》,Press,New York,NY,169-170页中所述。例如,如文中所述可以通过首先在室温下用2×SSC(标准柠檬酸盐水)/0.1%SDS(十二烷基硫酸钠)洗涤15分钟来得到低严格条件。一般进行两次洗涤。可以通过降低盐浓度和/或通过提高温度得到较高的严格度。例如,在上述洗涤之后可以接着在室温下两次用0.1×SSC/0.1%SDS洗涤15分钟,然后在55℃用0.1×SSC/0.1%SDS接着洗涤30分钟。如本领域技术人员所知,这些温度可以与本文公开的其他杂交和洗涤程序一起使用(例如可以用SSPE代替SSC作为盐使用)。可以通过向445ml水中加入50ml 20×SSC和5ml 10%SDS制备2×SSC/0.1%SDS。可以通过组合NaCl(175.3g/0.150M)、柠檬酸钠(88.2g/0.015M)和1升水并接着用10N NaOH调整pH至7.0来制备20×SSC。可以通过将10g SDS溶解到50ml高压灭菌水中、稀释到100ml来制备10%SDS并分成等分试样%。
检测探针提供了以已知方式测定是否保持了杂交的方法。这种探针分析提供了鉴定本发明的毒素编码基因的快速方法。可以使用DNA合成仪和标准程序合成本发明用作探针的核苷酸片段。这些核苷酸片段还可以用作PCR引物以扩增本发明的基因。
可以用分子的杂交特征定义本发明的多核苷酸。因此本发明包括与本文示例的多核苷酸杂交的多核苷酸(和/或其互补物、优选其完整互补物)。
本文使用的“严格”杂交条件指得到与本应用中所使用条件相同或大致相同程度杂交特异性的条件。具体地,通过标准方法进行固定在Southern印迹上的DNA与32P标记的基因特异性探针的杂交(参阅如Maniatis,T.,E.F.Fritsch,J.Sambrook[1982]《Molecular Cloning:ALaboratory Manual》,Cold Spring Harbor Laboratory,Cold SpringHarbor,NY)。一般在允许检测靶序列的条件下进行杂交和其后的洗涤。对于双链DNA基因探针,在6×SSPE、5×Denhardt溶液、0.1%SDS、0.1mg/ml变性DNA中在DNA杂交体解链温度(Tm)以下20-25℃进行过夜杂交。按下式描述解链温度(Beltz,G.A.,K.A.Jacobs,T.H.Eickbush,P.T.Cherbas和F.C.Kafatos[1983]《Methods of Enzymology》,R.Wu,L.和K.Moldave[编辑]Academic Press,New York 100:266-285):
Tm=81.5℃+16.6Log[Na+]+0.41(G+C%)-0.61(甲酰胺%)-600/双链体长度(碱基对)
一般如下述进行洗涤:
(1)在1×SSPE、0.1%SDS中室温15分钟洗涤两次(低严格洗涤)。
(2)在0.2×SSPE、0.1%SDS中在Tm-20℃15分钟洗涤一次(中度严格洗涤)。
对于寡核苷酸探针,在6×SSPE、5×Denhardt溶液、0.1%SDS、0.1mg/ml变性DNA中在杂交体解链温度(Tm)以下10-20℃进行过夜杂交。按下式确定寡核苷酸探针的Tm:
Tm(℃)=2(T/A碱基对数)+4(G/C碱基对数)
(Suggs,S.V.,T.Miyake,E.H.Kawashime,M.J.Johnson,K.Itakura和R.B.Wallace[1981]ICN-UCLA Symp.Dev.Biol.UsingPurified Genes,D.D.Brown[编辑],Academic Press,New York,23:683-693)。
一般如下述进行洗涤:
(1)1×SSPE、0.1%SDS中室温15分钟洗涤两次(低严格洗涤)。
(2)在0.2×SSPE、0.1%SDS中在Tm-20℃15分钟洗涤一次(中度严格洗涤)。
一般可以改变盐和/或温度来改变严格度。对于长度>70碱基左右的标记DNA片段,可以使用以下条件:
低:1或2×SSPE,室温
低:1或2×SSPE,42℃
中:0.2×或1×SSPE,65℃
高:0.1×SSPE,65℃
双链体的形成和稳定依赖于杂交体两条链间的显著互补性,且如上文所提到的,某种程度的错配是允许的。因此,本发明的探针序列包括所述序列的突变(单链和双链)、缺失、插入及其组合,其中所突变、插入和缺失允许与目的靶多核苷酸形成稳定的杂交体。可以以多种方法在给定的多核苷酸序列中产生突变、插入和缺失,这些方法为本领域普通技术人员所共知。其他方法可能在将来逐渐被了解。
PCR技术
聚合酶链式反应(PCR)是核苷酸序列的引发式的重复性酶合成。该方法为本领域技术人员所熟知并普遍使用(参阅Mullis,U.S.专利No.4,683,195、4,683,202和4,800,159;Saiki,Randall K.,Stephen Scharf,FredFaloona,Kary B.Mullis,Glenn T.Horn,Henry A.Erlich,NormanArnheim[1985]″Enzymatic Amplification of Genomic Sequences andRestriction Site Analysis for Diagnosis of Sickle Cell Anemia,″Science 230:1350-1354)。PCR是基于目的DNA片段的酶扩增,所述DNA片段两侧为与靶序列相反链杂交的两个寡核苷酸引物。引物在3’末端彼此指向。模板热变性、引物与其互补序列退火和用DNA聚合酶延伸退火引物的重复循环引起PCR引物5’末端所定义片段的延伸。每一引物的延伸产物可作为其他引物的模板,因此每个循环基本上倍增前一循环中产生的DNA片段数量。这引起特定靶片段的指数积累,可在几小时内多达数百万倍。通过使用热稳定的DNA聚合酶(如分离自嗜热水生菌(Thermus aquaticus)的Taq聚合酶)可以自动完成扩增过程。可以使用的其他酶为本领域技术人员所共知。
本发明的DNA序列可以用作PCR扩增的引物。在PCR中,引物和模板间某种程度的错配是可以接受的。因此,示例引物的突变、缺失和插入(特别是在5’末端加入核苷酸)在本发明的范围内。可以通过本领域普通技术人员已知的方法在给定的引物中产生突变、插入和缺失。
基因和毒素的修饰
本发明的有用基因和毒素不仅包括具体示例的全长序列,还包括这些序列的部分、区段和/或片段(包括与全长分子相比内部和/或末端缺失)、其变体、突变体、嵌合体和融合物。只要保留本文具体示例的氨基酸的特征性杀虫/功能活性,本发明的蛋白质可以具有替代的氨基酸。“变体”基因含有编码具有与示例蛋白质等价杀虫活性的相同毒素或等价毒素的核苷酸序列。术语“变体蛋白”和“等价毒素”指对靶害虫具有相同或基本相同的生物/功能活性和与示例毒素等价的序列的毒素。本文所使用的“等价”序列指含有提高或不对杀虫活性产生不利影响的氨基酸替代、缺失、添加或插入的序列。保留了杀虫活性的片段也包含于本发明中。保留了如示例毒素相应片段的相同或相似功能或“毒素活性”的片段和其他等价物在本发明的范围内。可以为了各种目的(如(不严重/显著降低毒素功能活性地)提高(或降低)蛋白质的蛋白酶稳定性)产生变化,如氨基酸替代或添加。
可以使用本文提供的教导,从野生型或重组细菌和/或来自其他野生型或重组生物获得/产生等价毒素和/或编码这些等价毒素的基因。可以使用其他芽孢杆菌、类芽孢杆菌、光杆状菌和致病杆菌物种作为来源隔离群。
例如,可以使用产生点突变的标准技术便利地构建基因的变异。另外,如U.S.专利No.5,605,793描述了在随机破碎后使用DNA重组装产生额外的分子多样性的方法。可以使用变体基因产生变体蛋白质,可以使用重组宿主产生变体蛋白质。使用这些“基因改组”技术可以构建含有本文示例的任何序列中任何5、10或20个连续残基(氨基酸或核苷酸)的等价基因或蛋白质。如本领域技术人员所了解的,可以调整基因改组技术以获得具有如与任何示例序列(或其互补物(完整互补物))中(相同大小)的片段相对应的3、4、5、6、7、8、9、10、11、12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30、31、32、33、34、35、36、37、38、39、40、41、42、43、44、45、46、47、48、49、50、51、52、53、54、55、56、57、58、59、60、61、62、63、64、65、66、67、68、69、70、71、72、73、74、75、76、77、78、79、80、81、82、83、84、85、86、87、88、89、90、91、92、93、94、95、96、97、98、99、100、101、102、103、104、105、106、107、108、109、110、111、112、113、114、115、116、117、118、119、120、121、122、123、124、125、126、127、128、129、130、131、132、133、134、135、136、137、138、139、140、141、142、143、144、145、146、147、148、149、150、151、152、153、154、155、156、157、158、159、160、161、162、163、164、165、166、167、168、169、170、171、172、173、174、175、176、177、178、179、180、181、183、186、187、188、189、190、191、192、193、194、195、196、197、198、199、200、201、202、203、204、205、206、207、208、209、210、211、212、213、214、215、216、219、220、221、222、223、224、226、227、228、229、230、231、232、235、236、237、238、239、240、242、243、244、245、246、247、248、249、250、251、253、254、255、256、258、259、260、261、262、263、264、265、266、268、269、270、271、272、273、274、275、276、277、278、279、282、284、285、286、287、288、289、290、291、292、293、294、295、296、297、298、299、300、301、302、303、304、305、306、307、308、309、310、311、312、313、314、315、316、317、318、319、320、323、324、325、326、327、328、329、330、331、332、333、334、335、336、337、338、339、340、341、342、343、344、345、346、347、348、349、350、351、352、353、354、355、356、357、358、359、360、361、362、363、366、367、368、369、370、371、372、373、374、375、376、377、380、381、382、383、384、385、386、387、388、389、390、391、392、393、394、395、396、397、398、399、400、401、402、403、404、405、406、407、408、409、410、411、412、413、414、415、416、417、418、419、420、421、422、423、424、425、426、427、428、429、430、431、432、433、434、435、436、437、438、439、440、441、442、443、444、445、446、447、448、449、450、452、453、454、455、456、457、458、459、460、461、462、463、464、465、466、467、468、469、470、471、472、473、474、475、476、477、478、479、480、481、482、483、484、485、486、487、488、489、490、491、492、493、494、495、496、497、498、499或500个连续残基(氨基酸或核苷酸)的等价物。类似大小的片段特别是保守区的片段也可以作为探针和/或引物。
可以根据标准程序使用市售的外切核酸酶或内切核酸酶产生全长基因的片段。例如,可以使用酶如(Bal31)或定向诱变从这些基因的末端系统地切除核苷酸。还可以使用多种限制性酶获得编码活性片段的基因。可以使用蛋白酶直接获得这些毒素的活性片段。
如本文所公开的,可以截短毒素而仍保留功能活性,这也在本发明的范围内。“截短的毒素”指毒素蛋白的部分可被切除,并在切除后仍表现活性。可以通过昆虫中肠内外的蛋白酶进行切割。另外,可以使用分子生物学技术产生有效切割的蛋白质,其中通过用限制性内切核酸酶消化或本领域技术人员可使用的其他技术去除编码所述毒素的DNA碱基。截短后,可以在异源系统(如大肠杆菌、杆状病毒、基于植物的病毒系统、酵母等)中表达所述蛋白质,然后置于本文公开的昆虫实验中测定活性。如本领域所熟知,可以成功地产生小于完整的全长序列而保留功能活性的截短毒素。本领域熟知,可以以截短(核心毒素)形式使用B.t.毒素。参阅如Adang等,Gene 36:289-300(1985),″Characterized full-length and truncatedplasmid clones of the crystal protein of Bacillus thuringiensis subspkurstaki HD-73 and their toxicity to sexta.″。还有其他保留杀虫活性的截短蛋白的实例,包括昆虫保幼激素酯酶(Regents of the University ofCalifornia的U.S.专利No.5,674,485)。本文所使用的术语“毒素”还旨在包括功能活性的截短物。
本发明的某些毒素已在本文中具体示例。这些毒素仅是本发明毒素的范例,显然本发明包括具有与示例毒素相同或相似杀虫活性的变体或等价毒素(和编码等价毒素的核苷酸序列)。等价毒素与示例毒素具有氨基酸相似性(和/或同源性)。氨基酸同一性一般高于60%,优选高于75%,更优选高于80%,甚至更优选高于90%并可以高于95%。还可以通过更具体的同一性和/或相似性范围定义本发明优选的多核苷酸和蛋白质。例如,与本文示例的序列相比,同一性和/或相似性可以为41、42、43、44、45、46、47、48、49、50、51、52、53、54、55、56、57、58、59、60、61、62、63、64、65、66、67、68、69、70、71、72、73、74、75、76、77、78、79、80、82、83、84、86、87、88、90、91、92、93、94、95、96、97、98或99%。
除非另有说明,使用如Karlin和Altschul(1993),Proc.Natl.Acad.Sci.USA 90:5873-5877中所改良的Karlin和Altschul(1990),Proc.Natl.Acad.Sci.USA 87:2264-2268的算法测定本文所使用的两个核酸的序列同一性和/或相似性百分比。这样的算法整合在Altschul等(1990),J.Mol.Biol.215:402-410的NBLAST和XBLAST程序中。使用NBLAST程序进行BLAST核苷酸搜索,评分=100、字宽=12。可以如Altschul等(1997),Nucl.Acids Res.25:3389-3402中所述使用Gapped BLAST。使用BLAST和Gapped BLAST程序时,使用程序(NBLAST和XBLAST)各自的默认参数。参阅NCBI/NIH网址。使用上文背景部分中Crickmore等所述的方法和算法计算分数。使用默认参数用Vector NTI Suite 8(InforMax,Inc.,North Bethesda,MD,U.S.A)中的AlignX函数得到用于比较目的的缺口比对。它们是:缺口打开罚分15、缺口延伸罚分6.66和缺口分离罚分范围8。
氨基酸同源性/相似性/同一性在决定生物活性的毒素关键区域中是最高的或者与最终决定生物活性的三维构型决定相关。考虑到这一点,某些氨基酸替换是可以接受的并预计是可承受的。例如,这些替换可以在对活性非关键的蛋白质区域中。可以使用蛋白质晶体结构分析和基于软件的蛋白质结构模拟来鉴定可(使用定向诱变、改组等)进行修饰以确实改变特性和/或提高蛋白质功能性的蛋白质区域。
还可以改变蛋白质的多种特性和三维特征而不对蛋白质的毒素活性/功能性产生不利影响。可以预计保守性氨基酸替代是可接受的/不对分子的三维构型产生不利影响。氨基酸可按以下分类:非极性、极性不带电荷、碱性和酸性。只要替代不损害化合物的生物活性,则保守性替代在本发明的范围内,在所述保守性氨基酸替代中,一类氨基酸被同一类型的另一氨基酸替换。
表I提供属于每一类的氨基酸实例列表
表I | |
氨基酸类别极性极性不带电荷酸性碱性 | 氨基酸实例丙氨酸、缬氨酸、亮氨酸、异亮氨酸、脯氨酸、甲硫氨酸、苯丙氨酸、色氨酸甘氨酸、丝氨酸、苏氨酸、半胱氨酸、酪氨酸、天冬酰胺、谷氨酰胺天冬氨酸、谷氨酸赖氨酸、精氨酸、组氨酸 |
在一些情况下,还可以进行非保守性替代。关键因素是这些替代不能明显降低毒素的功能/生物活性。
本文提到的“分离的”多核苷酸和/或“纯化的”毒素是指它们不伴随有与其天然共存的其他分子。因此,提到“分离的”和/或“纯化的”表示与本文所述的“人为”相关。例如,放入植物进行表达的本发明细菌毒素“基因”是“分离的多核苷酸”。同样,本文示例的植物所产生的致病杆菌蛋白质是“分离的蛋白质”。
由于遗传密码的简并性/冗余性,多种不同的DNA序列都可以编码本文公开的氨基酸序列。产生编码相同或基本相同毒素的替代DNA序列在受本领域训练的技术人员的能力范围内。这些变体DNA序列在本发明的范围内。
用于植物表达的序列优化
为了在植物中得到异源基因的高表达,可能优选重新设计所述基因以使其在植物细胞(的胞质)中更有效的表达。玉米就是这样的一种植物,其中在转化前重新设计异源基因以提高其在所述植物中的表达水平可能是优选的。因此,在设计编码细菌毒素的基因中额外的步骤是重新改造异源基因以得到最优表达。
在玉米中进行表达时重新改造细菌毒素的一个原因是由于天然基因的非最优G+C含量。例如,许多天然细菌基因极低的G+C含量(和因此高A+T含量的倾向)引起产生模拟或复制已知为高度富含A+T的植物控制序列的序列。在引入植物的基因中存在一些富含A+T的序列(如一般见于基因启动子中的TATA盒区)可能引起基因的异常转录。另一方面,转录的mRNA中其他调节序列(如多腺苷酸化信号序列(AAUAAA)或与前mRNA剪接相关的核内小RNA的互补序列)的存在可能导致RNA的不稳定性。因此,设计用于玉米表达的编码细菌毒素的基因(更优选称为植物优化基因)的一个目的是产生具有较高G+C含量(优选与编码代谢酶的玉米基因接近的G+C含量)的DNA序列。设计编码细菌毒素的植物优化基因的另一个目的是产生其序列修饰不阻碍翻译的DNA序列。
下表(表II)展示了玉米中的G+C含量有多高。对于表II中的数据,基因的编码区出自GenBank(第71次发布)条目,并用Mac Vector.TM.(IBI,New Haven,Conn.)程序计算碱基组成。计算中忽略了内含子序列。
由于遗传密码的冗余性/简并性(即一些氨基酸由多于一个密码子指定)提供的可塑性,基因组在不同生物或不同纲的生物中的进化引起了对冗余密码子不同的使用。在蛋白质编码区的平均碱基组成中反映了这种“密码子偏好”。例如,具有相对低G+C含量的生物使用冗余密码子的第三个位置上为A或T的密码子,而具有较高G+C含量的使用第三个位置上为G或C的密码子。mRNA中少见密码子的存在被认为可降低该mRNA的绝对翻译率,特别是对应于少见密码子的载荷tRNA的相对丰度低时。这一事实可扩展为:对于多个少见密码子,单个少见密码子对翻译的降低将至少是累加的。因此,具有相对高含量少见密码子的mRNA将相应的具有低翻译率。该翻译率将进而反映为低水平的编码蛋白质。
在设计用于玉米(或其他植物,如棉花或大豆)表达的编码细菌毒素的基因时已经测定了所述植物的密码子偏好。玉米的密码子偏好是该植物用于编码其蛋白质的密码子统计分布,表III显示了优选的密码子使用。测定偏好之后,测定目的基因中密码子的频率百分比。应该测定植物优选的主要密码子以及优选密码子的第二和第三选择。其后,反向翻译目的细菌毒素的氨基酸序列,以使得到的核酸序列与待异源表达的天然基因编码完全相同的蛋白质。使用密码子偏好信息设计新的DNA序列,使其对应于所需植物最优选的密码子。然后分析新序列中可能由修饰产生的限制性酶切位点。通过用第二或第三选择的优选密码子取代该密码子来进一步修饰鉴定出的位点。序列中可能影响目的基因转录或翻译的其他位点为外显子-内含子接合处(5’或3’)、多腺苷酸加成信号或RNA聚合酶终止信号。进一步分析和修饰序列,以降低TA或GC双联体的频率。除了双联体之外,具有多于约四个相同碱基的G或C序列嵌段也能影响序列的转录。因此,也通过用下一优选的密码子选择取代首选或次选的密码子来修饰这些嵌段。
表II | ||
玉米基因蛋白质编码区蛋白质分类注a | G+C含量汇编G+C百分比范围 | 平均G+C百分比注b |
代谢酶(76)结构蛋白(18)调节蛋白(5)未表征蛋白质(9)所有蛋白质(108) | 44.4-75.348.6-70.557.2-68.841.5-70.344.4-75.3 | 59.0(.+-.8.0)63.6(.+-.6.7)62.0(.+-.4.9)64.3(.+-.7.2)60.8(.+-.5.2) |
注a:每一类基因的数目在括号内给出。
注b:标准差在括号内给出。
注c:平均值计算中忽略了组合组平均值。
编码细菌毒素的植物优化基因优选含有约63%的首选密码子、约22%到约37%之间的次选密码子和约15%到约0%之间的三选密码子,其中总百分比为100%。最优选植物优化基因含有约63%的首选密码子、至少约22%的次选密码子、约7.5%的三选密码子和约7.5%的四选密码子,其中总百分比为100%。设计用于玉米表达的基因所优选的密码子使用示于表III。上述方法使本领域技术人员能够修饰就具体植物而言是外来的基因,从而使基因在植物中优化表达。PCT申请WO 97/13402中进一步说明了该方法。
为了设计编码细菌毒素的植物优化基因,使用建立自密码子偏好表的非冗余遗传密码将所述蛋白质的氨基酸序列反向翻译成DNA序列,所述偏好表整理自具体植物的基因序列,并示于表II。进一步修饰得到的在密码子使用上完全同质的DNA序列,以产生除了具有高度密码子多样性以外还含有根据策略放置的限制性酶识别位点、所需的碱基组成并缺少可能干扰基因转录或产物mRNA翻译的序列的DNA序列。
表III | |
在玉米中表达的蛋白质优选的氨基酸密码子 | |
氨基酸 | 密码子* |
丙氨酸半胱氨酸天冬氨酸 | GCC/GCGTGC/TGTGAC/GAT |
谷氨酸苯丙氨酸甘氨酸组氨酸异亮氨酸赖氨酸亮氨酸甲硫氨酸天冬酰胺脯氨酸谷氨酰胺精氨酸丝氨酸苏氨酸缬氨酸色氨酸酪氨酸终止 | GAG/GAATTC/TTTGGC/GGGCAC/CATATC/ATTAAG/AAACTG/CTCATGAAC/AATCCG/CCACAG/CAAAGG/CGCAGC/TCCACC/ACGGTG/GTCTGGTAC/TATTGA/TAG |
*玉米的首选和次选密码子
因此,可以使用与本发明的毒素/基因功能性等价的合成基因转化宿主(包括植物)。关于产生合成基因的其他指导可见如U.S.专利No.5,380,831。
在一些情况下,特别是在植物中进行表达时,使用表达截短蛋白的截短基因可能是有利的。例如上文背景技术部分讨论过的Hofte等,1989讨论了B.t.毒素的原毒素和核心毒素。优选的截短基因一般编码全长毒素的40%、41%、42%、43%、44%、45%、46%、47%、48%、49%、50%、51%、52%、53%、54%、56%、57%、58%、59%、60%、61%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%、或99%。
转基因宿主
可以将本发明的毒素编码基因引入多种微生物或植物宿主中。在优选的实施方案中,使用转基因植物细胞和植物。优选的植物(和植物细胞)为谷类、玉米和棉花。
在优选的实施方案中,毒素基因的表达直接或间接引起杀虫蛋白在细胞内的产生和保持。植物可以以这种方式得到昆虫抗性。当转基因/重组/转化的/转染的宿主细胞(或其内含物)被害虫消化时,害虫将消化毒素。这是使害虫与毒素接触优选的方式。其结果为控制(杀死或致病)害虫。也可以以类似的方式控制吸吮害虫。另外,当存在靶害虫时,可以使用合适的微生物宿主,例如假单胞菌如荧光假单胞菌,微生物可以在其中增殖并被靶害虫消化。可以在延长毒素活性并使细胞稳定的条件下处理携带毒素基因的微生物。然后可以将保留有毒活性的经处理细胞应用于靶害虫的环境中。
当通过合适的载体将毒素基因引入微生物宿主并将活状态的所述宿主应用于环境时,应该使用某些宿主微生物。微生物宿主选自已知占据一种或多种目的作物“植物圈”(叶面、叶圈、根际和/或根面)的微生物。选择这些微生物以使其能够在具体环境(作物或其他昆虫生境)中成功地与野生型微生物竞争,提供表达多肽杀虫剂的基因稳定的保持和表达,并理想地为杀虫剂提供针对环境降解和失活的提高的保护。
已知大量微生物栖息于多种重要作物的叶面(植物叶子的表面)和/或根际(植物根周围的土壤)中。这些微生物包括细菌、藻类和真菌。特别感兴趣的是微生物,例如细菌,如假单胞菌属、欧文氏菌属(Erwinia)、沙雷氏菌属(Serratia)、克雷伯氏菌属(Klebsiella)、黄单胞菌属(Xanthomonas)、链霉菌属(Streptomyces)、根瘤菌属(Rhizobium)、红假单胞菌属(Rhodopseudomonas)、Methylophilius属、农杆菌属(Agrobacterium)、醋酸菌属(Acetobacter)、乳酸菌属(Lactobacillus)、节杆菌属(Arthrobacter)、固氮菌属(Azotobacter)、白联球菌属(Leuconostoc)和产碱菌属(Alcaligenes);真菌特别是酵母如酵母菌(Saccharomyces)、隐球菌属(Cryptococcus)、克鲁维酵母菌属(Kluyveromyces)、掷孢酵母属(Sporobolomyces)、红酵母属(Rhodotorula)和短柄霉属(Aureobasidium)。特别感兴趣的是植物圈细菌物种,如丁香假单胞菌(Pseudomonas syringae)、荧光假单胞菌(Pseudomonas fluorescens)、粘质沙雷氏菌(Serratia marcescens)、木醋杆菌(Acetobacter xylinum)、根瘤农杆菌(Agrobacterium tumefaciens)、球形红假单胞菌(Rhodopseudomonas spheroides)、野油菜黄单胞菌(Xanthomonas campestris)、苜蓿根瘤菌(Rhizobium melioti)、真养产碱菌(Alcaligenes entrophus)和棕色固氮菌(Azotobacter vinlandii);植物圈酵母物种如深红酵母(Rhodotorula rubra)、胶红酵母(R.glutinis)、海滨红酵母(R.marina)、橙红酵母(R.aurantiaca)、白色隐球菌(Cryptococcus albidus)、液化隐球菌(C.diffluens)、劳伦梯氏隐球菌(C.lurentii)、Saccharomyces rosei、S.pretoriensis、酿酒酵母(S.cerevisiae)、掷孢酵母(Sporobolomyces roseus)、S.odorus、Kluyveromycesveronae和Aureobasidium pollulans。感兴趣的还有色素微生物。
插入基因以形成转基因宿主
本发明的一个方面是用表达本发明蛋白质的本发明多核苷酸转化/转染植物、植物细胞和其他宿主细胞。以这种方式转化的细胞可以获得对靶害虫攻击的抗性。
可以使用多种方法在允许稳定保持和表达基因的条件下将编码杀虫剂蛋白的基因引入靶宿主。这些方法为本领域技术人员所熟知,并描述于如美国专利No.5,135,867。
例如,大量克隆载体可用于将外来基因插入到高等植物中,所述载体含有大肠杆菌复制系统和允许选择转化的细胞的标记物。载体包括如pBR322、pUC系列、M13mp系列、pACYC184等。因此,可以在合适的限制性位点将编码毒素的序列插入载体。得到的质粒用于转化进大肠杆菌中。在合适的营养培养基上培养大肠杆菌细胞,然后收获并裂解。回收质粒。作为分析方法,一般进行序列分析、限制性分析、电泳和其他生物化学-分子生物学方法。每次操作之后,可以切开所使用的DNA序列并与下一个DNA序列连接。每个质粒序列都可以克隆到同一个或其他质粒中。取决于将所需基因插入到植物中的方法,其他DNA序列可能是必要的。例如,如果使用Ti或Ri质粒转化植物细胞,则必须连接Ti或Ri质粒T-DNA序列的至少右侧边界(常为右侧和左侧边界)作为待插入基因的侧翼区域。T-DNA用于转化植物细胞的用途已在EP 120 516、Hoekema(1985)在《The Binary Plant Vector System》,Offset-durkkerij Kanters B.V.,Alblasserdam,第5章中、Fraley等,Crit.Rev.Plant Sci.4:1-46和An等(1985)EMBO J 4:277-287中有深入的研究和详细的描述。
大量技术可用于将DNA插入植物宿主细胞。这些技术包括使用根瘤农杆菌或发根农杆菌(Agrobacterium rhizogenes)作为转化剂用T-DNA转化、融合、注射、生物射弹(微粒轰击)或电穿孔以及其他可能的方法。如果用农杆菌进行转化,必须将待插入的DNA克隆进特殊的质粒中,即中间载体或二元载体中。由于与T-DNA中序列同源的序列,中间载体可以通过同源重组整合进Ti或Ri质粒。Ti或Ri质粒还含有转移T-DNA所必需的vir区。中间载体不能在农杆菌中自我复制。可以通过辅助质粒(缀合)将中间载体转移到根瘤农杆菌中。二元载体在大肠杆菌和农杆菌中都可以自我复制。它们含有选择标记物基因和接头或多聚接头,两侧为右和左T-DNA边界区。它们可以直接转化进农杆菌中(Holsters等,[1978]Mol.Gen.Gent.163:181-187)。作为宿主的农杆菌含有携带vir区的质粒。vir区是将T-DNA转移进植物细胞所必需的。还可以含有额外的T-DNA。将这样转化的细菌用于转化植物细胞。可以有利地用根瘤农杆菌或发根农杆菌培养植物外植体,以将DNA转移进植物细胞。然后可以在合适的培养基中从感染的植物材料(如叶碎片、茎节段、根以及原生质体或悬浮培养的细胞)再生完整植物,所述培养基可以含有用于选择的抗生素或杀虫剂。然后可以测试这样得到的植物中插入DNA的存在。在注射和电穿孔的情况下质粒没有特殊的要求。可以使用普通质粒,如pUC衍生物。
转化细胞以通常的方式在植物中生长。它们可以形成生殖细胞并将转化的性状传递到子代植物。可以以正常方式培养这些植物并与具有同一转化遗传因子或其他遗传因子的植物杂交。得到的杂合个体具有相应的表型特性。
在本发明优选的一些实施方案中,从插入植物基因组的转录单位表达编码细菌毒素的基因。优选地,所述转录单位是重组载体,所述重组载体能够稳定整合进植物基因组,并使得可以选择表达编码蛋白质的mRNA的转化植物株系。
一但整合进基因组,则插入DNA在其中是相对稳定的(并不再释放出来)。它一般包含赋予转化植物细胞对杀虫剂或抗生素(如卡那霉素、G418、博来霉素、潮霉素或氯霉素等)的抗性的选择标记物。因此单独使用的标记物应允许从不含有插入DNA的细胞中选择转化细胞。目的基因在植物中优选由组成型或诱导型启动子表达。一但表达之后,mRNA翻译成蛋白质,从而将目的氨基酸掺入蛋白质中。在植物中表达的编码毒素的基因可以在组成型启动子、组织特异性启动子或诱导型启动子的控制下。
存在若干将外来重组载体引入植物细胞和获得稳定保持和表达引入基因的方法。这些技术包括将包裹在微粒上的遗传物质直接引入细胞(Cornell的U.S.专利No.4,945,050和DowElanco,现在的DowAgroSciences,LLC的U.S.专利No.5,141,131)。此外,可以使用农杆菌技术转化植物,参阅University of Toledo的U.S.专利No.5,177,010;TexasA&M的5,104,310;欧洲专利申请0131624B1;Schilperoot的欧洲专利申请120516、159418B1和176112;Schilperoot的U.S.专利No.5,149,645、5,469,976、5,464,763和4,940,838和4,693,976;Max Planck的欧洲专利申请116718、290799、320500;Japan Tobacco的欧洲专利申请604662和627752和U.S.专利No.5,591,616;Ciba Geigy,现在的Novartis的欧洲专利申请0267159和0292435和U.S.专利No.5,231,019;Calgene的U.S.专利No.5,463,174和4,762,785和Agracetus的U.S.专利No.5,004,863和5,159,135。其他转化技术包括颈须技术。参阅Zeneca的U.S.专利No.5,302,523和5,464,765。也可以使用电穿孔技术转化植物。参阅Boyce ThompsonInstitute的WO 87/06614;Dekalb的U.S.专利No.5,472,869和5,384,253和Plant Genetic Systems的WO 92/09696和WO 93/21335。另外,也可以用病毒载体产生表达目的蛋白质的转基因植物。例如,可以用MycogenPlant Science和Ciba-Giegy,现在的Novartis的U.S.专利No.5,569,597以及Biosource的U.S.专利No.5,589,367和5,316,931中所述的方法用病毒载体转化单子叶植物。
如前述,将DNA构建体引入植物宿主的方法对于本发明不是关键的。任何提供有效转化的方法都可以使用。例如,本文描述了多种植物细胞转化的方法,包括使用Ti或Ri质粒等进行农杆菌介导的转化。在许多情况下,需要用T-DNA边界与用于转化的载体的一侧或两侧(更具体地为右侧)相连。尽管T-DNA边界也可见用于其他转化模式,但在使用根瘤农杆菌或发根农杆菌作为转化模式时特别有用。在将农杆菌用于植物细胞转化时,可以使用能引入宿主以与宿主中存在的T-DNA或Ti或Ri质粒同源重组的质粒。可以通过电穿孔、三亲杂交和本领域技术人员已知的用于转化革兰氏阴性菌的其他技术进行载体导入。载体转化进农杆菌宿主的方式对于本发明不是关键的。含有用于重组的T-DNA的Ti或Ri质粒可以能够或不能引起根肿大成瘤(gall formation),只要所述宿主中存在vir基因,这对本发明就不是关键的。
将农杆菌用于转化时,在一些情况下将T-DNA边界内的表达载体插入广谱载体如pRK2或其衍生物中,如本文引入作为参考文献的Ditta等,PNAS USA(1980)77:7347-7351和EPO 0 120 515中所述。表达构建体和T-DNA内包括一个或多个如本文所述允许选择转化农杆菌和转化植物细胞的标记物。使用的具体标记物对于本发明并不重要,优选的标记物取决于所使用的宿主和构建体。
为了用农杆菌转化植物细胞,可以将外植体与转化农杆菌组合并孵育足够的时间以允许其转化。转化后,通过用适当抗生素的选择杀死农杆菌,并用适当的选择培养基培养植物细胞。一但形成愈伤组织,可以根据植物组织培养和植物再生领域已知的方法通过使用适当的植物激素促进芽形成。然而,愈伤组织中间期并不总是必要的。芽形成以后,可以将所述植物细胞转移到促进根形成的培养基,从而完成植物再生。可以培养植物产生种子,所述种子可以用于建立将来的世代。不考虑转化技术,优选将编码细菌毒素的基因整合进基因转移载体中,通过在载体中包含植物启动子调节元件以及3’非翻译转录终止区(如Nos等)使所述转移载体适于在植物细胞中表达该基因。
除了用于转化植物的大量技术之外,与外来基因接触的组织类型也可以是多样的。这些组织包括但不仅限于胚胎发生组织、I、II、和III型愈伤组织、下胚轴、分生组织、根组织、用于韧皮部表达的组织等。使用本文所述的适当技术几乎可以在去分化中转化几乎所有的植物组织。
如上文所述,需要时可以使用多种选择标记物。具体标记物的选择由技术人员决定,但是任何以下的选择标记物以及本文未列出的能发挥选择标记物功能的其他基因都可以使用。这些选择标记物包括但不仅限于编码对抗生素卡那霉素、新霉素和G418的抗性的转座子Tn5氨基糖苷磷酸转移酶基因(Aph II)以及编码对草甘磷、潮霉素、氨甲喋呤、草铵磷(双丙氨磷)、咪唑啉酮、磺酰脲和三唑嘧啶除草剂(如氯磺隆、溴草腈、茅草枯等)抗性或耐性的基因。
除了选择标记物以外,可能需要使用报道基因。在一些情况下,报道基因可以与或不与选择标记物一起使用。报道基因一般是在受体生物或组织中不存在的基因,一般编码引起一些表型变化或酶特性的蛋白质。K.Wising等.Ann.Rev.Genetics,22,421(1998)中提供了这些基因的实例。优选的报道基因包括大肠杆菌uidA基因座的β-葡糖醛酸糖苷酶(GUS)、来自大肠杆菌Tn9的氯霉素乙酰转移酶基因、来自生物发光水母维多利亚多管水母(Aequorea victoria)的绿色荧光蛋白和来自萤火虫Photinuspyralis的荧光素酶基因。接着可以在所述基因引入受体细胞后的适当时间实施检测报道基因表达的试验。优选的这些方法使用如Jefferson等(1987Biochem.Soc.Trans.15,17-19)所述的大肠杆菌uidA基因座编码β-葡糖醛酸糖苷酶(GUS)的基因鉴定转化细胞。
除了植物启动子调节元件以外,可以在植物细胞中有效地使用来自多种来源的启动子调节元件表达外源基因。例如,可以使用细菌来源的启动子调节元件,如章鱼碱合酶启动子、胭脂碱合酶启动子、甘露碱合酶启动子;病毒来源的启动子,如花椰菜花叶病毒(35S和19S)、35T(再改造的35S启动子,参阅U.S.专利No.6,166,302,特别是实施例7E)等。植物启动子调节元件包括但不仅限于核酮醣-1,6-二磷酸羧化酶(RUBP)小亚基(ssu)、β-伴大豆球蛋白(conglycinin)启动子、β-菜豆素启动子、ADH启动子、热休克启动子和组织特异性启动子。还可以存在其他元件,如基质附着区、支架附着区、内含子、增强子、多腺苷酸化序列等,从而提高转录效率或DNA整合。尽管这些元件可能通过影响转录、mRNA稳定性等提供较好的DNA表达或功能,但是它们对DNA功能可能是或不是必需的。这些元件可以如所需包含在DNA中,以得到转化DNA在植物中的优化表达。典型元件包括但不仅限于Adh-内含子1、Adh-内含子6、苜蓿花叶病毒外壳蛋白前导序列、玉米线条病毒外壳蛋白前导序列以及其他本领域技术人员可用的元件。还可以使用组成型启动子调节元件从而指导在所有细胞类型和所有时间的持续基因表达(如肌动蛋白、泛素、CaMV 35S等)。组织特异性启动子负责在特定细胞或组织类型(如叶或种子)中的基因表达(如玉米蛋白、油质蛋白、油菜籽蛋白、ACP、球蛋白等),这些也可以使用。
启动子调节元件也可以在植物发育的某些阶段有活性以及在植物组织和器官中有活性。这些的实例包括但不仅限于花粉特异性、胚胎特异性、玉米穗丝特异性、棉花纤维特异性、根特异性、种子胚乳特异性启动子调节元件等。在某些情况下可能需要使用诱导型启动子调节元件,其负责应答于特定信号(如物理刺激(热休克基因)、光(RUBP羧化酶)、激素(Em)、代谢物、化学品和胁迫)的基因表达。还可以使用在植物中发挥功能的其他所需的转录和翻译元件。本领域已知大量植物特异性基因转移载体。
可以使用标准分子生物学技术克隆和测序本文所述的毒素。其他信息可见于本文引入作为参考文献的Sambrook,J.,Fritsch,E.F.和Maniatis,T.(1989),《Molecular Cloning,A Laboratory Manual》,Cold SpringHarbor Press。
抗性治理
随着杀虫蛋白在转基因植物中日益增加的商业使用,一个需要考虑的因素是抗性治理。即,大量公司在其产品中使用苏云金芽孢杆菌毒素,并存在对于昆虫发展出B.t.毒素抗性的担心。昆虫抗性治理的一种策略是将致病杆菌、光杆状菌等产生的TC毒素与如B.t.晶体毒素(来自芽孢杆菌菌株的可溶性杀虫蛋白,参阅如WO 98/18932和WO 99/57282)或其他昆虫毒素组合。组合物可以配制成可喷雾剂应用或者可以是分子组合物。可以用产生两种或更多不同昆虫毒素的细菌基因转化植物(参阅如Gould,38Bioscience 26-33(1988)和U.S.专利No.5,500,365;类似的,欧洲专利申请0 400 246 A1和U.S.专利5,866,784、5,908,970和6,172,281也描述了用两种B.t.晶体蛋白转化植物)。产生含有多于一种昆虫抗性基因的转基因植物的另一种方法是首先产生两种植物,每种植物含有一种昆虫抗性基因。然后可以使用常规植物育种技术杂交这些植物以产生含有多于一种昆虫抗性基因的植物。因此,除非另有说明,本文中使用的“含有多核苷酸”这一说法显然指至少一种(并可以是相邻或不相邻的多种)多核苷酸。
制剂和其他递送系统
可以在土壤中应用配制的含有受试者隔离群的孢子和/或结晶的毒饵粒颗或含有可得自本文公开隔离群的基因的重组微生物。也可以在轮作周期的后段将配制的产物用作种子包衣或根处理或全植物处理。通过与多种惰性材料如无机矿物质(层状硅酸盐、碳酸盐、硫酸盐、磷酸盐等)或植物材料(粉状玉米芯、稻壳、胡桃壳等)混合作为可湿的散剂、颗粒剂或粉剂进行细胞的植物和土壤处理。制剂可以包括涂布器粘着佐剂、稳定剂、其他杀虫添加剂或表面活性剂。液体制剂可以是水基的或非水基的,并可以泡沫剂、凝胶剂、悬液剂、乳化浓缩液等进行应用。成分可以包括流变剂、表面活性剂、乳化剂、分散剂或多聚体。
本领域技术人员会理解,取决于具体制剂的性质(不管其是否为浓缩剂或直接使用),杀虫浓度可以广泛的变化。可以存在以重量计至少1%的杀虫剂,并可以为以重量计100%。干制剂含有以重量计从约1-95%的杀虫剂,而液体制剂一般在液相中含有以重量计约1-60%的固体。制剂一般含有从约102到104个细胞/mg。以每公顷约50mg(液体或干的)到1kg施用这些制剂。
可以通过喷雾、撒粉、洒水等将制剂应用于害虫的环境(如土壤和叶子)。
另一种递送方案是将毒素遗传物质整合入杆状病毒载体。杆状病毒感染特定的昆虫宿主,包括毒素需要靶向的宿主。可以将带有毒素表达载体的侵染性杆状病毒引入昆虫侵扰的地区,从而使感染的昆虫中毒或染毒。
已知昆虫病毒或杆状病毒可感染某些昆虫并产生不利影响。病毒对于昆虫的影响是慢性的,病毒并不立即终止昆虫进食。因此,病毒不被看成是最佳的害虫控制剂。然而,将毒素基因组合进杆状病毒载体可以提供传递毒素的有效方法。此外,由于不同的杆状病毒对不同的昆虫具有特异性,因此可以使用特定的毒素选择性地靶向具体造成损害的害虫。对于毒素基因特别有用的载体是核型多角体病毒。已经描述了用这种病毒转移载体,并且目前是将外来基因转移到昆虫的首选载体。可以以经口传递的形式构建病毒-毒素基因重组体。杆状病毒一般通过中肠肠粘膜感染昆虫受者。插入到病毒外壳蛋白强启动子后面的毒素基因将表达并迅速杀死感染昆虫。
除了本发明蛋白毒素的昆虫病毒或杆状病毒或转基因植物递送系统之外,还可以用苏云金芽孢杆菌包胶技术将蛋白质包入胶囊,例如但不仅限于本文引入为参考的U.S.专利No.4,695,455、4,695,642、4,861,595。本发明蛋白毒素的另一递送系统为将蛋白质配制到毒饵基质中,然后在昆虫投饵地点于地上或地下使用所述毒饵。这些技术的实例包括但不仅限于本文引入为参考的PCT专利申请WO 93/23998。
基于植物RNA病毒的系统也可用于表达细菌毒素。为此,可以将编码毒素的基因插入感染目的宿主植物的适当植物病毒的外壳启动子区。然后可以表达毒素从而为植物提供对昆虫损害的保护。基于植物RNA病毒的系统描述于Mycogen Plant Sciences,Inc.的U.S.专利No.5,500,360和Biosource Genetics Corp的U.S.专利No.5,316,931和5,589,367。
除了产生转基因植物以外,还有其他可能需要再改造细菌基因的递送系统。例如,可以通过将能作为食物来源吸引昆虫的分子与毒素融合来构建蛋白毒素。在实验室纯化之后,可以将这些带有“内置”毒饵的毒剂包装在标准昆虫诱捕盒内。
突变体
可以通过本领域熟知的方法产生本发明伯氏致病杆菌隔离群的突变体。例如,可以通过用甲基磺酸乙酯(EMS)诱变隔离群得到不产孢子的突变体。可以通过本领域熟知的程序使用紫外光和亚硝基胍产生突变体。
本文提到或引用的所有专利、专利申请、临时申请和出版物整体引入为参考,引用程度不与本说明书的明确教导相冲突。
以下为说明本发明操作实践的实施例。这些实施例不应理解为限制性的。除非另有说明,所有的百分比均以重量计,所有的溶剂混合物比例以体积计。
实施例1-概述
使用粘粒互补筛选实现基因的鉴定和分离,所述基因编码加强或协同昆虫活性蛋白光杆状菌TcdA和致病杆菌XptA2wi活性的因子。使用来自伯氏致病杆菌粘粒基因组文库的大肠杆菌单克隆(ILM104菌株)产生粗细胞提取物,将所述提取物与纯化毒素混合并进行生物测定。将裂解液与纯化的光杆状菌毒素TcaA一起针对南方玉米食根虫幼虫(Diabroticaundecimpunctata howardi)进行测试。同样,还将裂解液与纯化的致病杆菌XptA2wi蛋白混合并针对烟草蚜虫(Heliothis virescens)或玉米earworm(美洲棉铃虫)幼虫进行测试。如果裂解液加纯化毒素的组合比任一组分单独的活性更高,则将粘粒裂解液评分为阳性。
两次测试(96孔形式的)首轮筛选样品并与对照比较杀虫活性进行评分。再培养阳性样品并在第二轮筛选中进行测试。对首轮和第二轮筛选中鉴定为阳性的粘粒进行第三轮筛选。在第三轮筛选中使用较大的培养体积(见下文),在128孔形式的生物测定中测试生物活性。
亚克隆在这一筛选中鉴定为具有加强活性的粘粒之一的DNA。测定保留活性的单个亚克隆的DNA序列,其显示含有两个开放读码框,命名为xptB1xb和xptC1xb。将这些编码区亚克隆进pET质粒并在大肠杆菌中进行表达。当TcaA或XptA2wi中任一蛋白质与共表达XptB1xb和XptC1xb的裂解液混合时,可见昆虫活性的显著提高。在与纯化的TcdA或XptA2wi混合时,仅含有XptB1xb或仅含有XptC1xb的裂解液具有最低的效力。
实施例2-昆虫生物测定方法学
使用96孔微孔板(Becton Dickinson and Company,Franklin Lakes,NJ)或特别设计用于昆虫生物测定的128孔盘(C-D International,Pitman,NJ)中的人工饲料进行昆虫生物测定。在96孔微孔板中进行的生物测定使用两种鳞翅目物种的卵:玉米earworm(美洲棉铃虫(Boddie))和烟草蚜虫(Heliothis virescens(F.))。在128孔盘中进行的生物测定使用新生幼虫。在这一形式中测试的鳞翅目物种包括玉米earworm、烟草蚜虫和甜菜粘虫(Spodoptera exigua(Hubner))。在这一生物测定形式中还测试了一种鞘翅目物种:南方玉米食根虫(Diabrotica undecimpunctata howardii(Barber))。
这些生物测定中记录的数据包括处理昆虫总数、死亡昆虫数、生长障碍昆虫数和存活昆虫的重量。对于报道生长抑制的情况,按以下进行计算:
生长抑制%=[1-(处理中昆虫平均重量/载体中昆虫的平均重量-仅对照)]×100
实施例3-粘粒文库构建
在28℃下,将通过16S RNA序列测定(Midi Labs,Newark,DE)确定为代表伯氏致病杆菌的致病杆菌ILM104菌株在含有0.0025%溴麝香草酚蓝的2%蛋白胨#3(下文中称为PP3)琼脂中培养72小时。选择吸收溴麝香草酚蓝的单菌落并用其接种到含有175mL PP3的500mL三角瓶中。以150rpm摇动摇瓶并在28℃下孵育约24小时。以2400×g离心50mL这种培养物以沉淀细胞。去除上清液并将细胞沉淀冻于-20℃直至将其融化用于细胞总DNA分离。
用Genomic DNA纯化试剂盒(Qiagen Inc.,Valencia,CA)分离细胞总DNA。通过涡旋将冷冻的细菌细胞重悬于含有11μL Qiagen RNase A溶液(100mg/mL)的11mL缓冲液B1(50mM Tris/HCI,pH 8.0;50mMEDTA,pH 8.0;0.5%Tween 20;0.5%Triton X-100)中。将300μL溶菌酶储备溶液(100mg/mL;Sigma Chemical Co.,St.Louis,MO)和500μL蛋白酶K储备溶液(50mg/mL;Sigma Chemical Co.)加入悬液中。涡旋混合悬液并在37℃孵育30分钟。在细菌裂解液中加入4mL缓冲液B2(3M盐酸胍;20%Tween 20)并将管轻柔颠倒混合溶液。将细菌裂解液在50℃孵育30分钟,然后按照生产商的说明书(Qiagen Genomic DNAHandbook)用Qiagen Genomic-tip 500/G吸头分离细胞总DNA。将得到的纯化DNA溶于500μL TE缓冲液(10mM Tris/HCl pH 8.0;1mM EDTApH 8.0)中并储存于4℃。
使用基于Ausubel等(《Current Protocols in Molecular Biology》,JohnWiley and Sons,New York,NY)中3.1.3节的操作从细胞总DNA产生Sau3A I局部消化物。进行小规模反应(80μL反应体积中40μg细胞总DNA)以测定产生最高浓度的大小为25-50Kb(千碱基对)的部分消化DNA片段的酶与细胞总DNA的正确比例。将反应物在65℃加热15分钟以使Sau3A I酶失活,并在0.3%琼脂糖凝胶上电泳反应物的等分试样以测定所需大小范围内的部分消化DNA片段的相对丰度。一但观察到酶与细胞总DNA的最佳比例,将反应体积扩大以得到足量的可用作粘粒文库构建中插入片段DNA的部分消化的细胞总DNA。典型的扩大反应包括在800μL总体积的1×React 4缓冲液(由Gibco BRL以10×提供)中将400μg伯氏致病杆菌细胞总DNA与9单位Sau3A I(Gibco BRL,Gaithersburg,MD)在37℃孵育15分钟。将反应物在65℃加热20分钟使酶失活。在粘粒文库构建过程中,为了使插入片段DNA与其他插入片段DNA之间的连接最小化,通过在1.2mL总体积的1×SAP缓冲液(生产商以10×提供)中与20单位的虾碱性磷酸酶(Boehringer Mannheim,Mannheim,德国)在37℃孵育2小时使部分消化的致病杆菌细胞总DNA去磷酸化。将去磷酸化的插入片段DNA与等体积用缓冲液平衡的酚-氯仿溶液(50∶50,体积/体积)混合并轻柔颠倒混合。以14000×g离心15分钟后,取出液相并轻柔颠倒与等体积的氯仿-异戊醇溶液(24∶1,体积/体积)混合。通过在14000×g离心15分钟再次使相分离。将液相移到新管中并加入0.1体积的3M乙酸钠(pH 5.2)。加入两倍体积用冰预冷的100%乙醇,轻柔颠倒混合溶液并置于-70℃过夜。以14000×g离心20分钟沉淀析出的DNA,将DNA沉淀重悬于50μL双蒸水并储存于-20℃。
使用如生产商所推荐制备的SuperCos 1载体(Stratagene,La Jolla,CA)构建粘粒文库。使用3∶1比例的部分消化的插入片段和载体在1×T4DNA连接酶缓冲液(生产商以10×提供)中与20单位的T4DNA连接酶(New England BioLabs Inc.,Beverly,MA)在16℃孵育过夜,将插入DNA连接进SuperCos I DNA的BamH I位点。用Gigapack Gold III GoldPackaging Extract(Stratagene)包装连接混合物并如生产商所推荐用大肠杆菌XLl-Blue MR菌株细胞滴定重组噬菌体。将重组噬菌体和宿主细胞培养物的等分试样(20-40μL)涂到含有氨苄青霉素(100mg/L;SigmaChemical Co.)的LB琼脂(10g/L细菌用蛋白胨、10g/L NaCl、5g/L细菌用酵母提取物、15g/L细菌用琼脂;Difco Laboratories)上,并在37℃孵育过夜。为构建用于长期保存的粘粒文库主平皿,用无菌的木制牙签挑取单菌落并接种到含有添加了100ml/mL氨苄青霉素或50mg/mL卡那霉素(Sigma Chemical Co.)的100-1000μL Terrific Broth(TB培养基:12g/L细菌用蛋白胨、24g/L细菌用酵母提取物、0.4%(体积/体积)甘油、17mM KH2PO4、72mM K2HPO4)的无菌96孔板的单个孔中,并在37℃静置孵育过夜。为从主平皿产生复制平皿,用96孔微孔板复制器(V &P Scientific,Inc.,San Diego,CA)接种含有添加了100mg/L氨苄青霉素的100-1000μL LB培养液的无菌96孔微孔板。将复制平皿在37℃静置孵育过夜。对于主平皿和复制平皿,均在平皿中加入等体积(100-1000μL)过滤灭菌的TB:甘油或LB:甘油,并用多通道移液器将培养物和甘油溶液混匀。用Biomek Seal and Sample铝箔盖(Beckman Instruments,Inc.,Fullerton,CA)封口并置于-70℃保存。
通过用NucleoSpin Nucleic Acid Purification Kit(CLONTECHLaboratories,Inc.,Palo Alto,CA)分离粘粒DNA并用20单位的限制性内切酶EcoR I(New England BioLabs)在37℃消化1小时回收的DNA来评估选定的重组粘粒的平均插入片段大小。将限制性DNA在1.0%琼脂糖凝胶上电泳。用0.5%溴化乙啶(New England BioLabs)染色后用紫外光使DNA片段显影,通过与1Kb梯度DNA比较来估计片段的相对大小。所构建粘粒文库的平均插入片段大小范围在30Kb-45Kb。
实施例4-互补筛选:培养物生长条件
对于首轮和第二轮互补筛选,在深96孔板中含有100μg/mL氨苄青霉素的2mL TB培养基中将粘粒文库的大肠杆菌单菌落在28℃下培养48小时。对于第三轮互补筛选,在28℃下将含有粘粒的大肠杆菌在100mL含有50μg/mL卡那霉素、100mM葡萄糖的TB培养基中以200-250rpm振荡培养24-48小时。
实施例5-互补筛选:裂解液制备
对于首轮和第二轮筛选,含有文库细胞的两份重复的2mL深孔板在Eppendorf 5810R离心机中以4000rpm(2250xg)离心5分钟。重悬两份重复的沉淀并组合进共计250μL LB中。将上清液加入含有3-4mm 0.1mm直径玻璃珠的1.2mL Costar管(Fisher Scientific)中。然后将管在KlecoTM 4-94 Pulverizer玻珠研磨机(Garcia Manufacturing,Visalia,CA)中以最高速摇动3分钟。在Eppendorf 5810R离心机中以2500rpm离心样品3分钟,并在新的96孔板中加入200μL得到的上清液。在昆虫生物测定前,在该大肠杆菌细胞裂解物中加入50μL适当纯化的毒素(TcdA或XptA2wi)或10mm磷酸缓冲液(作为阴性对照)。
通过在50mL锥形管中以3000xg离心从100mL培养物中制备第三轮筛选的裂解液。将沉淀重悬于LB培养基中至约40 OD600单位/mL(Shimadzu UV160U分光光度计(Kyoto,JP))。然后将细胞分装入含有3-4mm 0.1mm直径玻璃珠的96孔1.2mL Costar管中,在KlecoTM 4-94Pulverizer玻珠研磨机中以最高速摇动3分钟,然后在Eppendorf 5810R离心机中以2500rpm离心3分钟。将每一样品的上清液倒入一个管中并加入纯化的毒素。在昆虫生物测定前,加入TcdA(终浓度50ng/cm2)或XptA2wi(终浓度250ng/cm2)或10mm磷酸缓冲液。
实施例6-互补筛选:活性粘粒片段的亚克隆
上述的活性筛选在鉴定产生提高TcdA和XptA2wi活性的提取物的粘粒时是有效的。选择一种粘粒(命名为5H4)用于进一步研究。按照生产商的说明书使用Promega(Madison,WI)的Wizard Plus Midipreps DNAPurification System从含有5H4粘粒的细胞中分离DNA。按照生产商的说明书使用Roche Applied Science(Indianapolis,IN)的限制性内切酶表征DNA。使用标准分子生物学技术在0.7%琼脂糖凝胶上电泳消化产物并用溴化乙锭显影。使用标准分子生物学技术(Sambrook和Russell,《Molecular Cloning:A Laboratory Manual》,第三版,Cold SpringHarbor Laboratory Press,2001),通过连接进载体pBCKS+(Stratagene)的BamH I位点从Xenorhabdus ILM104粘粒5H4中亚克隆约12Kb的BgIII片段。按照供应商的说明书将连接物转化进大肠杆菌DH5α亚克隆高效细胞(Stratagene)中。得到的质粒称为pDAB6026。
质粒pDAB6026显示编码与TcdA和XptA2wi的昆虫毒活性协同的活性。将含有质粒pDAB6026或pBCKS+载体对照的大肠杆菌细胞接种进200mL含有氯霉素(50μg/mL)和75μM IPTG(异丙基-β-D-硫代半乳糖苷)的LB中并在28℃以180rpm振荡培养两天。然后将细胞在3500×g离心10分钟。将沉淀重悬于5mL Butterfield′s磷酸盐溶液(Fisher Scientific)中并转移至含有1.5mL 0.1mm直径玻璃珠(Biospec,Bartlesville,OK,目录号1107901)的50mL锥形管中。在冰上冷却细胞-玻璃珠混合物,然后用Branson Sonifier 250(Danbury CT)用2mm探针以~20输出通过两次45秒脉冲以超声处理法裂解细胞,脉冲间冷却完全。将上清液转移至2mL微量离心管中并在16000×g离心5分钟。然后将上清液转移至15mL管中并测定蛋白质浓度。用H2O以1∶5稀释Bio-Rad Protein Dye AssayReagent并向10μL每一样品的1∶10稀释液中加入1mL,并以5、10、15、20和25μg/ml的浓度加入牛血清白蛋白(BSA)中。然后将样品在分光光度计上读数,在Shimadzu UV160U分光光度计(Kyoto,JP)上测量波长595处的吸光度。然后根据BSA标准曲线计算每一样品中含有的蛋白质量并用磷酸盐缓冲液调整至3-6mg/mL。在昆虫喂养生物测定中的测试之前,向500μL大肠杆菌裂解液中加入600纳克XptA2wi毒素蛋白。pDAB6026和XptA2wi的组合显示具有有效活性(表IV)。
表IV.2个鳞翅目物种对单独的pDAB6026裂解液和与纯化的XptA2wi蛋白一起的应答。 | |||||||||
处理 | 烟草蚜虫 | 玉米earworm | |||||||
死亡 | 发育障碍 | 总计 | 重量 | 死亡 | 发育障碍 | 总计 | 重量 | ||
1 | pBC | 0 | 0 | 8 | 674 | 0 | 0 | 8 | 352 |
2 | pBC+XptA2wi | 0 | 0 | 8 | 538 | 0 | 0 | 8 | 423 |
3 | pDAB6026 | 0 | 0 | 8 | 539 | 0 | 0 | 8 | 519 |
4 | pDAB6026+XptA2wi | 0 | 8 | 8 | 18 | 8 | --- | 8 | --- |
实施例7-xptB1xb和xptC1xb基因的发现、设计和测试
将质粒pDAB6026的DNA送至Seq Wright DNA Sequencing(Houston,TX)测定DNA序列。发现两个很大的完整开放读码框(ORF)。第一个(作为SEQ ID NO:1公开)与属于“B”类的已知毒素复合物基因具有显著的相似性。因此将该ORF称为xptB1xb,其编码SEQ ID NO:2公开的蛋白质。第二个ORF(SEQ ID NO:3)编码与毒素复合物“C”蛋白具有同源性的蛋白质(SEQ ID NO:4),因此称为xptC1xb。还发现了局部开放读码框(SEQ ID NO:5),其与“A”类毒素复合物基因具有显著同源性。该局部ORF编码SEQ ID NO:6的蛋白质序列。
(使用聚合酶链式反应;PCR)通过在编码区5’和3’添加限制性位点并提供核糖体结合序列和最佳翻译终止信号来构建两个完整的基因xptB1xb和xptC1xb以得到高水平重组表达。另外,向编码区的5’端引入沉默突变(氨基酸序列无变化)以减少可能的mRNA二级结构,从而提高翻译。策略是扩增/改造基因的5’和3’端的片段,使用“剪接重叠延伸”反应加入末端片段,然后通过限制性位点加入开放读码框的非扩增中心部分。该方法使DNA序列中可能由PCR引入的变化最小化。将设计的编码区以分开的编码区(SEQ ID NO:7和SEQ ID NO:8)或双顺反子操纵子(SEQID NO:9)克隆进pET表达质粒(Novagen,Madison,WI)。表达质粒的名称示于表V。
表V.含有克隆进pET载体的不同编码区的表达质粒 | |
质粒名称pDAB6031pDAB6032pDAB6033 | 改造来用于表达的编码区如SEQ ID NO:7的xptB1xb如SEQ ID NO:8的xptC1xb如SEQ ID NO:9的xptB1xb+xptC1xb |
用pET(对照)载体或质粒pDAB6031、pDAB6032或pDAB6033的DNA转化新制大肠杆菌T7表达菌株BL21(DE3)StarTM(Stratagene,LaJolla,CA)感受态细胞,并接种进250mL含有50μg/mL氯霉素和75μMIPTG的LB中。在28℃ 180rpm振荡培养24小时后,将细胞在5500×g离心10分钟。将沉淀重悬于5mL磷酸盐溶液中并转移至含有1.5mL0.1mm直径玻璃珠的50mL锥形管中,然后在“恒定”和如上文所述30设置下两次以45秒脉冲超声处理。样品在3000×g离心15分钟,并将上清液转移至2mL微量离心管中,在14000rpm离心5分钟,然后将上清液转移至15mL管中。如上文所述测定蛋白质浓度并用磷酸缓冲液将裂解液调整至5mg/mL。每一裂解液三个样品一组进行昆虫生物测定。对于第一个样品,加入磷酸缓冲液代替纯化毒素;对于第二个样品,加入足够的TcdA蛋白质以在昆虫生物测定孔中提供50ng/cm2的剂量,而对于第三个样品,加入足量的XptA2wi以在昆虫生物测定孔中提供250ng/cm2的剂量。
生物测定的结果示于表VI。未补充低水平添加的TcdA或XptA2wi蛋白质的对照样品(例如来自载体、pDAB6031、pDAB6032和pDAB6033的样品)对昆虫几乎没有影响。同样,含有低水平TcdA或XptA2wi与pDAB6031或pDAB6032裂解液的样品具有最小影响。相反,在包含低水平TcdA或XptA2wi以及pDAB6033裂解液的样品观测到显著活性。
表VI.鞘翅目和鳞翅目物种对大肠杆菌裂解液和纯化蛋白质的应答。应答用百分比死亡率/生长抑制百分比来表示。 | |||||
样品 | 昆虫物种 | ||||
测试的裂解液 | 南方玉米食根虫 | 玉米earworm | 烟草蚜虫 | 甜菜粘虫 | |
载体 | 0/0 | 8/0 | 0/0 | 31/0 | |
pDAB6031 | XptB1xb | 0/0 | 0/0 | 0/0 | 31/33 |
pDAB6032 | XptC1xb | 0/0 | 4/11 | 0/2 | 13/15 |
pDAB6033 | XptB1xb+XptC1xb | 0/0 | 0/0 | 0/6 | 13/38 |
载体+TcdA | 4/0 | 4/3 | 0/6 | 25/22 | |
pDAB6031+TcdA | XptB1xb+TcdA | 0/0 | 0/0 | 0/5 | 13/34 |
pDAB6032+TcdA | XptC1xb+TcdA | 0/0 | 0/2 | 0/14 | 6/25 |
pDAB6033+TcdA | XptB1xb+XptC1xb+TcdA | 25/68 | 4/14 | 4/0 | 31/48 |
载体 | 0/0 | 0/79 | 0/9 | 31/0 | |
pDAB6031+XptA2wi | XptB1xb+XptA2wi | 0/0 | 4/75 | 8/22 | 25/36 |
pDAB6032+XptA2wi | XptC1xb+XptA2wi | 0/0 | 0/71 | 0/22 | 6/14 |
pDAB6033+XptA2wi | XptB1xb+XptC1xb+XptA2wi | 0/0 | 83/100 | 29/98 | 81/100 |
实施例8-鉴定、纯化和表征伯氏致病杆菌ILM104菌株的XptB1xb和XptC1xb蛋白
本实施例涉及含pDAB6033的大肠杆菌裂解液的生物测定驱动分级,其引起通过MALDI-TOF鉴定两种共纯化蛋白(XptB1xb和XptC1xb)。含有这两种蛋白质的峰有效地赋予TcdA和XptA2wi活性。
活性级分基于其协同或赋予活性的能力进行鉴定,所述活性为针对南方玉米食根虫的TcdA或针对玉米earworm的XptA2wi的活性。以实施例2中描述的128孔形式进行所有的生物测定。
将三升来自含有pDAB6033的转基因大肠杆菌培养物的培养液离心(10,000×g 20分钟),并将细胞移至150mL含有0.1M DTT(二硫苏糖醇)和1M EDTA(乙二胺四乙酸)的25mM Tris-HCl,pH 7.8(T缓冲液)中。按照制造商的说明书加入通用蛋白酶抑制剂混合物(SigmaChemical目录号P2714)。通过对每一小份重悬细胞在最大功率下置于冰上超声处理30秒来裂解细胞。重复该过程三次,然后将破碎的细胞以48400×g在4℃下离心1小时。收集裂解液并加入硫酸氨至80%饱和。通过以48400×g离心15分钟收集沉淀的蛋白质,悬于T缓冲液并用2升T缓冲液透析过夜(换液一次)。通过离心(48400×g 10分钟)澄清悬浮的蛋白质,然后上样到Q Sepharose XL离子交换柱(1.6cm直径×10cm长)。用2倍柱床体积的T缓冲液洗涤柱,然后用含300mM NaCl的T缓冲液分批不连续梯度洗脱。混和用300mM NaCl洗脱的蛋白质并加入(NH4)2SO4至终浓度为1M,上样到苯琼脂糖疏水作用柱(1.0cm直径×10cm长)上。用2倍柱床体积含有1.25M(NH4)2SO4的25mM T缓冲液洗涤后,在10倍柱床体积中以T缓冲液中从625mM到0mM(NH4)2SO4的线性梯度和额外的4倍柱床体积T缓冲液等度洗脱来洗脱蛋白质。目的蛋白质在40-60mS/cm之间的导率洗脱。混和这些样品并用T缓冲液在4℃透析过夜。然后将透析过的蛋白质样品上样到用T缓冲液平衡的MonoQ离子交换柱(1.0cm直径×10cm长)。在15倍柱床体积中,以从0到400mM NaCl的线性梯度洗脱蛋白质。从22-24mS/cm导率下洗脱的蛋白质级分中检测到两个活性峰(峰1和峰2)。峰1和峰2加强活性的实例示于表VII。接着对峰1和峰2进行纯化和分析。
将峰浓缩至1.0mL并使用由50mM磷酸钠、100mM NaCl、0.05%吐温20和10%甘油组成的pH7.8的缓冲液以1.0cm/分钟上样到Superdex200(1.6cm直径×60cm长)大小排阻柱上。两种情况下,蛋白质都以对应于约300kDa分子量的单个主峰被洗脱。稀释蛋白质主峰,通过上样到用T缓冲液平衡的Mono Q(0.5cm直径×5cm长)离子交换柱并在15倍柱床体积中使用150-300mM的NaCl梯度洗脱来进一步纯化。
通过SDS-PAGE分析该离子交换纯化所得的级分。将蛋白质级分(20μL)加入5X浓缩的Laemmli缓冲液(Ausubel等,《Current Protocols inMolecular Biology》,John Wiley and Sons,Inc.,New York,NY第10节)(5μL)中,90℃加热3分钟、离心并将上清液上样到SDS电泳缓冲液中的4-20%聚丙烯酰胺Tris-甘氨酸凝胶上。用160V 90分钟将蛋白质分开,通过考马斯蓝染色显影,然后用含有5%水合甲醇加7%乙酸的溶液脱色。用Bio-Rad Fluoro-S Multi Imager成像并分析凝胶。来自峰1和峰2的凝胶都包含两条主要条带,一条迁移到~170kDa而另一条迁移到~80kDa。来自峰1的凝胶包含以约18、33和50kDa迁移的三个另外的蛋白质。回顾分析显示~170kDa和~80kDa条带在纯化的最初阶段含量丰富并在各步逐渐富集。
使用MALDI-TOF分析确定2个主要条带的身份。从pDAB6033裂解液高度富集级分的SDS凝胶上切出~170kDa和~80kDa的条带,放入经硅化处理的Eppendorf微量离心管中并用含50%乙氰的12.5mMNH4HCO3脱色。在Speed-Vac(Savant Instruments,Holbrook,NY)中干燥样品并在37℃用测序级胰蛋白酶(Roche Diagnostics,Indianapolis,IN)消化过夜(约16小时)。短暂离心沉淀凝胶碎片后,将含有肽的上清液转移至新管中并在Speed-Vac中干燥。然后将肽悬浮于6μL 0.1%三氟乙酸(TFA)中,用C18ZipTip树脂(Millipore,Bedford,MA)吸收并用75%乙氰/0.1%TFA洗脱。如下文所述分析洗脱液。
在Voyager DE-STR MALDI-TOF质谱仪(PerSeptive Biosystems,Framingham,MA)上使用MALDI-TOF质谱法分析提取的肽以产生肽质量指纹(PMF)。以0.5μL样品与0.5μL介质按1∶1的比例在平板上混和使用干燥微滴点样技术(晾干)将上文得到的样品点样在MALDI不锈钢平板上。基质为α-氰基-4-羟基肉桂酸在含有0.1%TFA的50%乙氰中的饱和溶液。使用血管紧张素I、促肾上腺皮质激素(ACTH,片段1-17、18-39、7-38)溶液进行外部校准。使用m/z 2163.05的自溶胰蛋白酶峰进行内部校准。所有的质谱以具有延迟提取的阳离子反射器模式收集。设备使用337nm氮激光(用于解吸/离子化事件)和3.0米反射器飞行时间管。将获得的谱去同位素并产生PMF表用于数据库搜索。使用基于Web的搜索引擎Mascot(MatrixSciences,UK)进行数据库搜索。质量限制(mass tolerance)设定为0.15Da并在搜索中选择无修饰。分析提取自~170kDa的样品确认为XptB1xb。分析取自~80kDa的样品确认了XptC1xb。尽管根据基因序列(SEQ ID NO:3)计算的XptC1xb蛋白预计分子量为108kDa,但是提取的蛋白质在SDS/PAGE中的电泳速度预计明显更快。代表完整肽序列的肽片段的存在说明提取的蛋白质是全长的。
表VII:来自pDAB6033的纯化峰1和峰2的生物活性。 | ||||
玉米earworm | 南方玉米食根虫 | |||
样品 | 死亡 | 发育障碍 | 死亡 | 发育障碍 |
峰1 | ||||
0 | 0 | 0 | 0 | 0 |
125 | 2 | 6 | 4 | 2 |
峰2 | ||||
0 | 1 | 0 | 0 | 0 |
125 | 0 | 8 | 5 | 3 |
样品列的值代表应用于饲料的峰1或峰2XptB1xb/XptC1xb的浓度(ng/cm2)。对于针对玉米earworm的生物测定,生物测定中包括了250ng/cm2的XptA1xi。对于针对南方玉米食根虫的生物测定,生物测定中包括了100ng/cm2的TcdA。每种样品使用总计八只幼虫。
实施例9-完整测序来自伯氏致病杆菌ILM104菌株的新“A”类毒素复合物基因
在实施例6中,鉴定5H4粘粒编码提高A类蛋白TcdA和XptA2活性的蛋白质。昆虫生物测定显示该粘粒的亚克隆(质粒pDAB6026)编码协同活性。pDAB6026的DNA序列分析鉴定了三个开放读码框。第一个(以SEQ ID NO:1公开)与属于“B”类的已知毒素复合物基因具有相似性。该ORF因此称为xptB1xb,编码如SEQ ID NO:2公开的XptB1xb蛋白。第二个ORF(SEQ ID NO:3)编码与毒素复合物“C”蛋白具有同源性的蛋白质(XptC1xb,SEQ ID NO:4),因此称为xptC1xb。显示这两个读码框负责TcdA和XptA2协同或增强的活性(实施例7和实施例8)。还发现与“A”类毒素复合物基因具有显著同源性的局部开放读码框(SEQID NO:5)。该局部ORF编码SEQ ID NO:6公开的蛋白质序列。
通过分析5H4粘粒的完整DNA序列确定该新A类基因的全长DNA序列和推定的编码蛋白质序列。如实施例6所述制备粘粒DNA并送至LarkTechnologies(Houston,TX)进行全DNA序列测定。测定了新A类基因的DNA编码序列(称为xptA1xb)并以SEQ ID NO:10公开。该读码框(xptA1xb)编码的蛋白质以SEQ ID NO:11公开。
序列表
<110>美国陶氏益农公司
<120>来自伯氏致病杆菌的毒素复合物蛋白和基因
<130>DAS-114XC1
<150>US 60/534,893
<151>2004-01-07
<160>11
<170>PatentIn版本3.2
<210>1
<211>4521
<212>DNA
<213>伯氏致病杆菌
<400>1
atgaaacaag attcacagga catgacagta acacagctgt ccctgcccaa agggggcggt 60
gcgatcagtg gcatgggtga cactatcagc aatgcagggc cggatgggat ggcttcgctt 120
tccgtgcctt tgcctatctc tgccggtcgg gggggcgcac cgaatttatc cctgaactac 180
agtagcggag caggaaacgg gtcatttggt attggctggc aatccagtac catggctatc 240
agccgtcgta ctcaacatgg cgtaccgcaa tatcacggcg aagatacttt tttatgtccg 300
atgggagaag tgatggcggt tgccgtcaat cagagcgggc aacccgatgt gcgtaaaacc 360
gataaactat taggcgggca actgcctgtt acttataccg ttacgcgtca tcagcccaga 420
aatattcagc acttcagcaa acttgaatac tggcagcccc caacggatgt ggaaaccacg 480
cctttttggt taatgtattc acccgatgga caaattcaca ttttcggaaa aactgagcag 540
gctcagatcg ctaacccggc agaggtttca cagattgccc aatggctttt ggaagaaacc 600
gtaacaccag cgggagaaca catttattac cagtatcggg cagaagacga tatcggttgt 660
gatgacagcg aaaaaaatgc ccaccctaat gccagtgctc aacgttattt gactcaggtg 720
aactacggca atattacacc tgaatccagc ctgcttgtgc tgaagaatac gccaccggcg 780
gataacgaat ggctattcca tttggttttt gattatggtg aacgagcgca ggaaataaac 840
acggttcctc ctttcaaagc accttcaaac aactggaaaa tacggccaga ccgtttctcc 900
cgctttgaat atggttttga ggtgcgaacc cgccgcctgt gtcaacaaat tctgatgttc 960
catcgcctga aatcccttgc aggagaacag attgacggag aagaaatccc tgccttggtt 1020
gcccgtctgc ttctcagtta tgacctgaac gacagcgtga caacccttac cgccattcgg 1080
caaatggcgt atgaaactga cgcaacctta atcgctttac cgccactgga gtttgactat 1140
cagccctttg aggcaaaagt cacgcagaaa tggcaggaaa tgcctcaatt ggccggattg 1200
aatgcccaac aaccttacca actcgtcgat ctctatggtg aaggtatctc cggcatcttg 1260
tatcaggaca gacccggagc atggtggtat caggcaccga tccgtcagaa aaacgttgaa 1320
gatattaacg ctgtcaccta tagcccaata aaccccttac ctaagatccc cagccagcag 1380
gacagagcaa cgttgatgga tatcgacggt gatggacatc tggattgggt gatcgctggc 1440
gcaggtattc aggggcggta cagtatgcag ccgaatggag agtggacaca ctttattccc 1500
atttctgcac tgccaacaga atattttcat ccacaggcac aactggcgga tctggtgggg 1560
gccgggttat ctgatttagc gctgattggc cccagaagtg tgcgtttata tgccaacgac 1620
cgaggaaact ggaaagcggg tattaatgtt atgccacctg atggtgtgaa tttgccgata 1680
tttggtggtg atgccagcag tctggtcgca ttttctgaca tgttgggatc gggacagcag 1740
catttggtgg aaattgccgc tcagagcgtc aaatgctggc cgaatctagg acatggccgt 1800
tttggtgcgg ctattttgct gccggggttt agccagccga atggaacatt caatgctaac 1860
caagtttttc tggcagatat cgatggttcc ggcaccgccg acatcatcta tgcacacagt 1920
acgtatctgg atatttacct gaacgaaagc ggcaaccgtt tcagtgcacc cgttcggctt 1980
aatttgccgg aaggggtgat gtttgacaat acctgtcagt tacaggtgtc ggatattcaa 2040
ggattgggcg ctgccagcat tgtactgacc gtacctcata tgacaccgcg ccattggcgt 2100
tatgatttta ctcacaataa accttggctg ctcaatgtca tcaacaacaa tcgtggcgca 2160
gaaaccacgt tgttttaccg tagttctgcc caattctggc tggatgaaaa aagtcagatc 2220
gaagagctgg gaaaatttgc agcgagttat ctgcctttcc ccatacattt gttgtggcgc 2280
aatgaggcgc tggatgaaat tactggtaat cgactgacta aggtcatgaa ttatgcccac 2340
ggtgcatggg atggcagaga gagagaattt tgcggatttg gccgtgtaac gcaaattgat 2400
accgacgaat ttgccaaggg aaccacagag aaagcgccgg atgaaaatat ctatccttcc 2460
cgtagcataa gctggtttgc cacgggttta ccagaagtgg attctcaact tccggcagaa 2520
tactggcgtg gtgacgatca ggcatttgcc ggctttacac cgcgcttcac tcgttatgaa 2580
aaaggtaatg cggggcaaga ggggcaggat accccgatta aagaaccgac cgaaacagaa 2640
gcgtattggc ttaaccgcgc catgaaaggc caattactgc gcagtgaagt ctatggtgac 2700
gacaaaacag aaaaagctaa aattccgtac accgtcacag aagctcgctg tcaggtcaga 2760
ttaattccca gcaatgacga agccgcgccg tcgtcttgga cgtcgatcat tgaaaaccgc 2820
agttatcact atgagcgtat cgtcgtcgat ccgagttgca aacaacaggt cgtgctcaag 2880
gcggatgaat atggcttccc actggcaaaa gtagatatcg cctatccacg gcgcaataaa 2940
ccggcacaga acccttatcc ggattcgtta ccggatactc tgttcgccga tagctatgac 3000
gaccagcaaa aacagttata tctgacaaaa cagcagcaga gctattacca cctgacccag 3060
caggatgatt gggttctggg tttgacggat agccgataca gcgaagttta tcattatgcg 3120
caaactgacg ctcaaagtga catccccaag gcagggctga tattggaaga cctgctgaaa 3180
gttgacggcc tgataggtaa agacaagact tttatctatt tagggcagca gcgagtggct 3240
tatgtgggag gagatgcaga aaaaccgaca cgtcaggtgc gggtggctta tacagaaacc 3300
gctgcttttg atgacaatgc gctgcacgcc tttgatggcg tgattgcccc tgatgaactg 3360
acgcaacagt tgctggcggg tggatacctg ctcgtgccgc agatttctga tgtggcaggc 3420
agtagtgaaa aggtatgggt agctcggcag ggatacaccg aatacggcag tgctgctcaa 3480
ttctaccggc cactcatcca gcgcaaaagc ttgctgaccg gaaaatatac ccttagttgg 3540
gatacgcact attgtgtggt ggtaaaaacc gaagatggtg cgggaatgac cacgcaagcg 3600
aagtacgatt accgcttcct gcttccggcg caattgacag atatcaatga caaccagcac 3660
atcgtgacat ttaatgcatt ggggcaggtg acttccagcc gtttctgggg cacagaaaat 3720
ggcaaaataa gcggttactc gacgccggag agtaaaccgt tcacagtacc cgataccgtc 3780
gaaaaagccc ttgccttgca accgacgatc ccggtttcac agtgcaacat ttatgtgccg 3840
gatagttgga tgcggcttct gccccaacag tctctgactg gccagctaaa agagggggaa 3900
actttgtgga acgcattaca ccgggcgggt gtagtaacgg aagacggttt gatctgtgaa 3960
ctggcctatc gtcgttggat caaacgtcag gcaacgtctt caatgatggc cgtgacatta 4020
cagcaaatct tggctcagac tccacgacaa cctccgcatg ccatgacgat cacgacagat 4080
cgttatgaca gcgattctca gcagcaactt cggcagtcga tagtattgag tgatggtttt 4140
ggtcgcgtat tgcaaagcgc ccagcgtcat gaagcaggag aggcatggca gcgtgcagaa 4200
gatggttctt tggttgtcga taataccggt aaacccgttg ttgctaatac cacaacgcgc 4260
tgggcagtat ccggtcgcac agaatacgac ggcaaagggc aggcgatcag agcttacctg 4320
ccttattatc tcaatgattg gcgctatgtc agtgatgaca gcgcccggga tgacctgtac 4380
gccgataccc atttttacga tcctctgggg cgtgaatatc aggtaaaaac cgcgaaagga 4440
ttttggcgtg aaaacatgtt tatgccgtgg tttgtcgtca atgaagatga aaatgacaca 4500
gcagcacgtt taacatctta a 4521
<210>2
<211>1506
<212>PRT
<213>伯氏致病杆菌
<400>2
Met Lys Gln Asp Ser Gln Asp Met Thr Val Thr Gln Leu Set Leu Pro
1 5 10 15
Lys Gly Gly Gly Ala Ile Ser Gly Met Gly Asp Thr Ile Ser Asn Ala
20 25 30
Gly Pro Asp Gly Met Ala Ser Leu Ser Val Pro Leu Pro Ile Ser Ala
35 40 45
Gly Arg Gly Gly Ala Pro Asn Leu Ser Leu Asn Tyr Ser Ser Gly Ala
50 55 60
Gly Asn Gly Ser Phc Gly Ile Gly Trp Gln Ser Ser Thr Met Ala Ile
65 70 75 80
Ser Arg Arg Thr Gln His Gly Val Pro Gln Tyr His Gly Glu Asp Thr
85 90 95
Phe Leu Cys Pro Met Gly Glu Val Met Ala Val Ala Val Asn Gln Ser
100 105 110
Gly Gln Pro Asp Val Arg Lys Thr Asp Lys Leu Leu Gly Gly Gln Leu
115 120 125
Pro Val Thr Tyr Thr Val Thr Arg His Gln Pro Arg Asn Ile Gln His
130 135 140
Phe Ser Lys Leu Glu Tyr Trp Gln Pro Pro Thr Asp Val Glu Thr Thr
145 150 155 160
Pro Phe Trp Leu Met Tyr Ser Pro Asp Gly Gln Ile His Ile Phe Gly
165 170 175
Lys Thr Glu Gln Ala Gln Ile Ala Asn Pro Ala Glu Val Ser Gln Ile
180 185 190
Ala Gln Trp Leu Leu Glu Glu Thr Val Thr Pro Ala Gly Glu His Ile
195 200 205
Tyr Tyr Gln Tyr Arg Ala Glu Asp Asp Ile Gly Cys Asp Asp Ser Glu
210 215 220
Lys Asn Ala His Pro Asn Ala Ser Ala Gln Arg Tyr Leu Thr Gln Val
225 230 235 240
Asn Tyr Gly Asn Ile Thr Pro Glu Ser Ser Leu Leu Val Leu Lys Asn
245 250 255
Thr Pro Pro Ala Asp Asn Glu Trp Leu Phe His Leu Val Phe Asp Tyr
260 265 270
Gly Glu Arg Ala Gln Glu Ile Asn Thr Val Pro Pro Phe Lys Ala Pro
275 280 285
Ser Asn Asn Trp Lys Ile Arg Pro Asp Arg Phe Ser Arg Phe Glu Tyr
290 295 300
Gly Phe Glu Val Arg Thr Arg Arg Leu Cys Gln Gln Ile Leu Met Phe
305 310 315 320
His Arg Leu Lys Ser Leu Ala Gly Glu Gln Ile Asp Gly Glu Glu Ile
325 330 335
Pro Ala Leu Val Ala Arg Leu Leu Leu Ser Tyr Asp Leu Asn Asp Ser
340 345 350
Val Thr Thr Leu Thr Ala Ile Arg Gln Met Ala Tyr Glu Thr Asp Ala
355 360 365
Thr Leu Ile Ala Leu Pro Pro Leu Glu Phe Asp Tyr Gln Pro Phe Glu
370 375 380
Ala Lys Val Thr Gln Lys Trp Gln Glu Met Pro Gln Leu Ala Gly Leu
385 390 395 400
Asn Ala Gln Gln Pro Tyr Gln Leu Val Asp Leu Tyr Gly Glu Gly Ile
405 410 415
Ser Gly Ile Leu Tyr Gln Asp Arg Pro Gly Ala Trp Trp Tyr Gln Ala
420 425 430
Pro Ile Arg Gln Lys Asn Val Glu Asp Ile Asn Ala Val Thr Tyr Ser
435 440 445
Pro Ile Asn Pro Leu Pro Lys Ile Pro Ser Gln Gln Asp Arg Ala Thr
450 455 460
Leu Met Asp Ile Asp Gly Asp Gly His Leu Asp Trp Val Ile Ala Gly
465 470 475 480
Ala Gly Ile Gln Gly Arg Tyr Ser Met Gln Pro Asn Gly Glu Trp Thr
485 490 495
His Phe Ile Pro Ile Ser Ala Leu Pro Thr Glu Tyr Phe His Pro Gln
500 505 510
Ala Gln Leu Ala Asp Leu Val Gly Ala Gly Leu Ser Asp Leu Ala Leu
515 520 525
Ile Gly Pro Arg Ser Val Arg Leu Tyr Ala Asn Asp Arg Gly Asn Trp
530 535 540
Lys Ala Gly Ile Asn Val Met Pro Pro Asp Gly Val Asn Leu Pro Ile
545 550 555 560
Phe Gly Gly Asp Ala Ser Ser Leu Val Ala Phe Ser Asp Met Leu Gly
565 570 575
Ser Gly Gln Gln His Leu Val Glu Ile Ala Ala Gln Ser Val Lys Cys
580 585 590
Trp Pro Asn Leu Gly His Gly Arg Phe Gly Ala Ala Ile Leu Leu Pro
595 600 605
Gly Phe Ser Gln Pro Asn Gly Thr Phe Asn Ala Asn Gln Val Phe Leu
610 615 620
Ala Asp Ile Asp Gly Ser Gly Thr Ala Asp Ile Ile Tyr Ala His Ser
625 630 635 640
Thr Tyr Leu Asp Ile Tyr Leu Asn Glu Ser Gly Asn Arg Phe Ser Ala
645 650 655
Pro Val Arg Leu Asn Leu Pro Glu Gly Val Met Phc Asp Asn Thr Cys
660 665 670
Gln Leu Gln Val Ser Asp Ile Gln Gly Leu Gly Ala Ala Ser Ile Val
675 680 685
Leu Thr Val Pro His Met Thr Pro Arg His Trp Arg Tyr Asp Phe Thr
690 695 700
His Asn Lys Pro Trp Leu Leu Asn Val Ile Asn Asn Asn Arg Gly Ala
705 710 715 720
Glu Thr Thr Leu Phe Tyr Arg Ser Ser Ala Gln Phe Trp Leu Asp Glu
725 730 735
Lys Ser Gln Ile Glu Glu Leu Gly Lys Phe Ala Ala Ser Tyr Leu Pro
740 745 750
Phe Pro Ile His Leu Leu Trp Arg Asn Glu Ala Leu Asp Glu Ile Thr
755 760 765
Gly Asn Arg Leu Thr Lys Val Met Asn Tyr Ala His Gly Ala Trp Asp
770 775 780
Gly Arg Glu Arg Glu Phe Cys Gly Phe Gly Arg Val Thr Gln Ile Asp
785 790 795 800
Thr Asp Glu Phe Ala Lys Gly Thr Thr Glu Lys Ala Pro Asp Glu Asn
805 810 815
Ile Tyr Pro Ser Arg Ser Ile Ser Trp Phe Ala Thr Gly Leu Pro Glu
820 825 830
Val Asp Ser Gln Leu Pro Ala Glu Tyr Trp Arg Gly Asp Asp Gln Ala
835 840 845
Phe Ala Gly Phe Thr Pro Arg Phe Thr Arg Tyr Glu Lys Gly Asn Ala
850 855 860
Gly Gln Glu Gly Gln Asp Thr Pro Ile Lys Glu Pro Thr Glu Thr Glu
865 870 875 880
Ala Tyr Trp Leu Asn Arg Ala Met Lys Gly Gln Leu Leu Arg Ser Glu
885 890 895
Val Tyr Gly Asp Asp Lys Thr Glu Lys Ala Lys Ile Pro Tyr Thr Val
900 905 910
Thr Glu Ala Arg Cys Gln Val Arg Leu Ile Pro Ser Asn Asp Glu Ala
915 920 925
Ala Pro Ser Ser Trp Thr Ser Ile Ile Glu Asn Arg Ser Tyr His Tyr
930 935 940
Glu Arg Ile Val Val Asp Pro Ser Cys Lys Gln Gln Val Val Leu Lys
945 950 955 960
Ala Asp Glu Tyr Gly Phe Pro Leu Ala Lys Val Asp Ile Ala Tyr Pro
965 970 975
Arg Arg Asn Lys Pro Ala Gln Asn Pro Tyr Pro Asp Ser Leu Pro Asp
980 985 990
Thr Leu Phe Ala Asp Ser Tyr Asp Asp Gln Gln Lys Gln Leu Tyr Leu
995 1000 1005
Thr Lys Gln Gln Gln Ser Tyr Tyr His Leu Thr Gln Gln Asp Asp
1010 1015 1020
Trp Val Leu Gly Leu Thr Asp Ser Arg Tyr Ser Glu Val Tyr His
1025 1030 1035
Tyr Ala Gln Thr Asp Ala Gln Ser Asp Ile Pro Lys Ala Gly Leu
1040 1045 1050
Ile Leu Glu Asp Leu Leu Lys Val Asp Gly Leu Ile Gly Lys Asp
1055 1060 1065
Lys Thr Phe Ile Tyr Leu Gly Gln Gln Arg Val Ala Tyr Val Gly
1070 1075 1080
Gly Asp Ala Glu Lys Pro Thr Arg Gln Val Arg Val Ala Tyr Thr
1085 1090 1095
Glu Thr Ala ALa Phe Asp Asp Asn Ala Leu His Ala Phe Asp Gly
1100 1105 1110
Val Ile Ala Pro Asp Glu Leu Thr Gln Gln Leu Leu Ala Gly Gly
1115 1120 1125
Tyr Leu Leu Val Pro Gln Ile Ser Asp Val Ala Gly Ser Ser Glu
1130 1135 1140
Lys Val Trp Val Ala Arg Gln Gly Tyr Thr Glu Tyr Gly Ser Ala
1145 1150 1155
Ala Gln Phe Tyr Arg Pro Leu Ile Gln Arg Lys Ser Leu Leu Thr
1160 1165 1170
Gly Lys Tyr Thr Leu Ser Trp Asp Thr His Tyr Cys Val Val Val
1175 1180 1185
Lys Thr Glu Asp Gly Ala Gly Met Thr Thr Gln Ala Lys Tyr Asp
1190 1195 1200
Tyr Arg Phe Leu Leu Pro Ala Gln Leu Thr Asp Ile Asn Asp Asn
1205 1210 1215
Gln His Ile Val Thr Phe Asn Ala Leu Gly Gln Val Thr Ser Ser
1220 1225 1230
Arg Phe Trp Gly Thr Glu Asn Gly Lys Ile Ser Gly Tyr Ser Thr
1235 1240 1245
Pro Glu Ser Lys Pro Phe Thr Val Pro Asp Thr Val Glu Lys Ala
1250 1255 1260
Leu Ala Leu Gln Pro Thr Ile Pro Val Ser Gln Cys Asn Ile Tyr
1265 1270 1275
Val Pro Asp Ser Trp Met Arg Leu Leu Pro Gln Gln Ser Leu Thr
1280 1285 1290
Gly Gln Leu Lys Glu Gly Glu Thr Leu Trp Asn Ala Leu His Arg
1295 1300 1305
Ala Gly Val Val Thr Glu Asp Gly Leu Ile Cys Glu Leu Ala Tyr
1310 1315 1320
Arg Arg Trp Ile Lys Arg Gln Ala Thr Ser Ser Met Met Ala Val
1325 1330 1335
Thr Leu Gln Gln Ile Leu Ala Gln Thr Pro Arg Gln Pro Pro His
1340 1345 1350
Ala Met Thr Ile Thr Thr Asp Arg Tyr Asp Ser Asp Ser Gln Gln
1355 1360 1365
Gln Leu Arg Gln Ser Ile Val Leu Ser Asp Gly Phe Gly Arg Val
1370 1375 1380
Leu Gln Ser Ala Gln Arg His Glu Ala Gly Glu Ala Trp Gln Arg
1385 1390 1395
Ala Glu Asp Gly Ser Leu Val Val Asp Asn Thr Gly Lys Pro Val
1400 1405 1410
Val Ala Asn Thr Thr Thr Arg Trp Ala Val Ser Gly Arg Thr Glu
1415 1420 1425
Tyr Asp Gly Lys Gly Gln Ala Ile Arg Ala Tyr Leu Pro Tyr Tyr
1430 1435 1440
Leu Asn Asp Trp Arg Tyr Val Ser Asp Asp Ser Ala Arg Asp Asp
1445 1450 1455
Leu Tyr Ala Asp Thr His Phe Tyr Asp Pro Leu Gly Arg Glu Tyr
1460 1465 1470
Gln Val Lys Thr Ala Lys Gly Phe Trp Arg Glu Asn Met Phe Met
1475 1480 1485
Pro Trp Phe Val Val Asn Glu Asp Glu Asn Asp Thr Ala Ala Arg
1490 1495 1500
Leu Thr Ser
1505
<210>3
<211>2889
<212>DNA
<213>伯氏致病杆菌
<400>3
atgaatgttt ttaatccaac tttatatgcc ggtacaccga ctgtcaccgt catggacaat 60
cgagggctgt cagtgcggga tattgcttat caccgtacaa cagcaggaga gcaggctgac 120
actcgcatca cccgccatca atacagtccc cataattttt taatcgagag cattgatcca 180
cgcctttttg atttgcaatc tcagagcacc ataaaaccta atttcaccta ctgtcctgcc 240
ttgaagggtg atgtcctacg gacagagagt gtggatgccg gacaaactgt cattttgagt 300
gacatcgaag gtcgtccgtt actgaatatc agtgcgatgg gtgtcgtcaa acactggcaa 360
tatgaagaga gtacattgcc ggggcgcttg ctcgctgtca gtgaacggaa gaatgaggct 420
tcaacacccc aaattattga acggtttatt tggtcgggaa atagcccatc agaaaaagat 480
cacaatttgg cgggaaaata tcttcgtcat tatgataccg ccggattaaa ccagcttaat 540
gctgtgtctc tgaccagcgt ggatctctca caatcccgtc agttattgca ggatgatgtc 600
acagcagatt ggagcggaag tgacgaatcc cagtggaaga cgcgactgag taacgacata 660
ttcacaaccg aaatcaccgc tgatgcggtt ggcaatttct tgactcagaa tgatgccaaa 720
agcaaccagc aacgattgtc ctatgatgtg gcagggcagt taaaggcaag ctggctgacg 780
ataaaaggcc agaatgagca ggtgatagtt aactccctga cttactccgc cgcagggcag 840
aaactgcgtg aagagcaggg taacggcgtt gtcactgaat actcctatga agcacaaacc 900
tggcgtttga taggtgtaac ggcttaccgt cagtcagata aaaaaagatt gcaggatctt 960
gtctataact atgatccggt cggtaatctc ctgaatattc gcaataatgc agaggcaacc 1020
cgtttctggc gtaatcagat agtagaacca gagaaccact atgcttatga ctcgctttat 1080
caactcatca gtgctagtgg tcgagaaatc gccagtatcg gtcagcaggg cagccggctg 1140
cctgtaccga ttattcctct tcctgccaat gacgatgttt atactcgcta cacccgcaca 1200
tatcactatg atcgcggtgg aaatctctgc cagatccggc attgcgctcc tgctacagat 1260
aataagtaca ccacaaagat caccgtatcg aatcgtagta atcgtgcagt atgggatacc 1320
ttgaccacag atcccgccaa agtggatacc ctgtttgatc atggagggca tcaacttcaa 1380
ctccagtcag gccagacttt atgttggaac tatcggggtg aactacagca aataacaaag 1440
atacagcgtg acgaaaaacc cgcagataaa gagcggtatc gctatggtgt tggggctgcg 1500
cgggtcgtga aaatcagcac acagcaggcg gggggaagca gccatgtgca gcgtgttgtt 1560
tatctgccgg ggttggaact acgcacaact cagcatgatg cgacattaat cgaagactta 1620
caggtgatta tcatgggtga agcaggacgt gctcaggtac gcgtacttca ttgggaaata 1680
ccaccaccgg ataatcttaa caatgactca ctgcgttaca gctacgatag tttgatgggt 1740
tccagtcagc ttgaattgga tggagcaggg cagattatta cgcaggaaga atactacccc 1800
tatggaggta cagcaatatg ggcggcaaga aaccagaccg aagccaatta caaaaccatt 1860
cgctactccg gcaaagagcg tgatgcgacg gggctttatt actacgggca ccgttattat 1920
cagccgtggc tagggcgctg gttgagcgca gatcccgccg gaaccgtgga cggactgaat 1980
ctatatcgaa tggtgaggaa taacccgatt acttaccggg atgcagatgg gcttgcgccg 2040
ataggcgata agatcagcga agggatttat gagcctgagt tgcgagttgg tcttgaacga 2100
gatgacccaa atgtcagaga ttatgaccgg gtttatcctg atacggccaa gacagagatg 2160
atcgaagcaa ctgcgaccac aattgctccc agtcaaatgt tatcggcgca tgcttttgca 2220
tctgtaccta tattgacaga tttgtttaat cctcaaacag caaggctttc tcaaaagaca 2280
acggatattg tattaaacac acaaggtgga ggcgatttaa tctttactgg catgaatatt 2340
aaaggtaagg gaaaagaatt taatgcatta aaaatcgttg atacttatgg cggagaaatg 2400
cctgatagca aaaccgctat ttcagcatat tggcttccgc aaggtgggta tactgatatt 2460
ccgatacatc cgactggaat acaaaagtat ttgtttacgc ctgcgtttag tggttgcact 2520
ctggcagtag ataagcttaa cgaaaataca ttacgggcgt atcacgtcga aggaagtaag 2580
gaagatgctc aatataataa tttagcagtt gcagcgcacg gagagggttt ggtcatggct 2640
atggaatttc ctgactatgg atttcataca gacaaaacag ggcaaagact aaggaacaca 2700
cagggatttg cgtttatgtc ctacaatcaa tcccagaaaa aatgggaaat tcattatcaa 2760
aggcaagcat tgacatcaaa caccggtatc atgaatgtta gtgctaaaaa caagattcga 2820
ttgaatgccc ccagtcatgt aaaaaatagc tcaatcaaag gaactgaaat aatgacgaca 2880
catttttaa 2889
<210>4
<211>962
<212>PRT
<213>伯氏致病杆菌
<400>4
Met Asn Val Phe Asn Pro Thr Leu Tyr Ala Gly Thr Pro Thr Val Thr
1 5 10 15
Val Met Asp Asn Arg Gly Leu Ser Val Arg Asp Ile Ala Tyr His Arg
20 25 30
Thr Thr Ala Gly Glu Gln Ala Asp Thr Arg Ile Thr Arg His Gln Tyr
35 40 45
Ser Pro His Asn Phe Leu Ile Glu Ser Ile Asp Pro Arg Leu Phe Asp
50 55 60
Leu Gln Ser Gln Ser Thr Ile Lys Pro Asn Phe Thr Tyr Cys Pro Ala
65 70 75 80
Leu Lys Gly Asp Val Leu Arg Thr Glu Ser Val Asp Ala Gly Gln Thr
85 90 95
Val Ile Leu Ser Asp Ile Glu Gly Arg Pro Leu Leu Asn Ile Ser Ala
100 105 110
Met Gly Val Val Lys His Trp Gln Tyr Glu Glu Ser Thr Leu Pro Gly
115 120 125
Arg Leu Leu Ala Val Ser Glu Arg Lys Asn Glu Ala Ser Thr Pro Gln
130 135 140
Ile Ile Glu Arg Phe Ile Trp Ser Gly Asn Ser Pro Ser Glu Lys Asp
145 150 155 160
His Asn Leu Ala Gly Lys Tyr Leu Arg His Tyr Asp Thr Ala Gly Leu
165 170 175
Asn Gln Leu Asn Ala Val Ser Leu Thr Ser Val Asp Leu Ser Gln Ser
180 185 190
Arg Gln Leu Leu Gln Asp Asp Val Thr Ala Asp Trp Ser Gly Ser Asp
195 200 205
Glu Ser Gln Trp Lys Thr Arg Leu Ser Asn Asp Ile Phe Thr Thr Glu
210 215 220
Ile Thr Ala Asp Ala Val Gly Asn Phe Leu Thr Gln Asn Asp Ala Lys
225 230 235 240
Ser Asn Gln Gln Arg Leu Ser Tyr Asp Val Ala Gly Gln Leu Lys Ala
245 250 255
Ser Trp Leu Thr Ile Lys Gly Gln Asn Glu Gln Val Ile Val Asn Ser
260 265 270
Leu Thr Tyr Ser Ala Ala Gly Gln Lys Leu Arg Glu Glu Gln Gly Asn
275 280 285
Gly Val Val Thr Glu Tyr Ser Tyr Glu Ala Gln Thr Trp Arg Leu Ile
290 295 300
Gly Val Thr Ala Tyr Arg Gln Ser Asp Lys Lys Arg Leu Gln Asp Leu
305 310 315 320
Val Tyr Asn Tyr Asp Pro Val Gly Asn Leu Leu Asn Ile Arg Asn Asn
325 330 335
Ala Glu Ala Thr Arg Phe Trp Arg Asn Gln Ile Val Glu Pro Glu Asn
340 345 350
His Tyr Ala Tyr Asp Ser Leu Tyr Gln Leu Ile Ser Ala Ser Gly Arg
355 360 365
Glu Ile Ala Ser Ile Gly Gln Gln Gly Ser Arg Leu Pro Val Pro Ile
370 375 380
Ile Pro Leu Pro Ala Asn Asp Asp Val Tyr Thr Arg Tyr Thr Arg Thr
385 390 395 400
Tyr His Tyr Asp Arg Gly Gly Asn Leu Cys Gln Ile Arg His Cys Ala
405 410 415
Pro Ala Thr Asp Asn Lys Tyr Thr Thr Lys Ile Thr Val Ser Asn Arg
420 425 430
Ser Asn Arg Ala Val Trp Asp Thr Leu Thr Thr Asp Pro Ala Lys Val
435 440 445
Asp Thr Leu Phe Asp His Gly Gly His Gln Leu Gln Leu Gln Ser Gly
450 455 460
Gln Thr Leu Cys Trp Asn Tyr Arg Gly Glu Leu Gln Gln Ile Thr Lys
465 470 475 480
Ile Gln Arg Asp Glu Lys Pro Ala Asp Lys Glu Arg Tyr Arg Tyr Gly
485 490 495
Val Gly Ala Ala Arg Val Val Lys Ile Ser Thr Gln Gln Ala Gly Gly
500 505 510
Ser Ser His Val Gln Arg Val Val Tyr Leu Pro Gly Leu Glu Leu Arg
515 520 525
Thr Thr Gln His Asp Ala Thr Leu Ile Glu Asp Leu Gln Val Ile Ile
530 535 540
Met Gly Glu Ala Gly Arg Ala Gln Val Arg Val Leu His Trp Glu Ile
545 550 555 560
Pro Pro Pro Asp Asn Leu Asn Asn Asp Ser Leu Arg Tyr Ser Tyr Asp
565 570 575
Ser Leu Met Gly Ser Ser Gln Leu Glu Leu Asp Gly Ala Gly Gln Ile
580 585 590
Ile Thr Gln Glu Glu Tyr Tyr Pro Tyr Gly Gly Thr Ala Ile Trp Ala
595 600 605
Ala Arg Asn Gln Thr Glu Ala Asn Tyr Lys Thr Ile Arg Tyr Ser Gly
610 615 620
Lys Glu Arg Asp Ala Thr Gly Leu Tyr Tyr Tyr Gly His Arg Tyr Tyr
625 630 635 640
Gln Pro Trp Leu Gly Arg Trp Leu Ser Ala Asp Pro Ala Gly Thr Val
645 650 655
Asp Gly Leu Asn Leu Tyr Arg Met Val Arg Asn Asn Pro Ile Thr Tyr
660 665 670
Arg Asp Ala Asp Gly Leu Ala Pro Ile Gly Asp Lys Ile Ser Glu Gly
675 680 685
Ile Tyr Glu Pro Glu Leu Arg Val Gly Leu Glu Arg Asp Asp Pro Asn
690 695 700
Val Arg Asp Tyr Asp Arg Val Tyr Pro Asp Thr Ala Lys Thr Glu Met
705 710 715 720
Ile Glu Ala Thr Ala Thr Thr Ile Ala Pro Ser Gln Met Leu Ser Ala
725 730 735
His Ala Phe Ala Ser Val Pro Ile Leu Thr Asp Leu Phe Asn Pro Gln
740 745 750
Thr Ala Arg Leu Ser Gln Lys Thr Thr Asp Ile Val Leu Asn Thr Gln
755 760 765
Gly Gly Gly Asp Leu Ile Phe Thr Gly Met Asn Ile Lys Gly Lys Gly
770 775 780
Lys Glu Phe Asn Ala Leu Lys Ile Val Asp Thr Tyr Gly Gly Glu Met
785 790 795 800
Pro Asp Ser Lys Thr Ala Ile Ser Ala Tyr Trp Leu Pro Gln Gly Gly
805 810 815
Tyr Thr Asp Ile Pro Ile His Pro Thr Gly Ile Gln Lys Tyr Leu Phe
820 825 830
Thr Pro Ala Phe Ser Gly Cys Thr Leu Ala Val Asp Lys Leu Asn Glu
835 840 845
Asn Thr Leu Arg Ala Tyr His Val Glu Gly Ser Lys Glu Asp Ala Gln
850 855 860
Tyr Asn Asn Leu Ala Val Ala Ala His Gly Glu Gly Leu Val Met Ala
865 870 875 880
Met Glu Phe Pro Asp Tyr Gly Phe His Thr Asp Lys Thr Gly Gln Arg
885 890 895
Leu Arg Asn Thr Gln Gly Phe Ala Phe Met Ser Tyr Asn Gln Ser Gln
900 905 910
Lys Lys Trp Glu Ile His Tyr Gln Arg Gln Ala Leu Thr Ser Asn Thr
915 920 925
Gly Ile Met Asn Val Ser Ala Lys Asn Lys Ile Arg Leu Asn Ala Pro
930 935 940
Ser His Val Lys Asn Ser Ser Ile Lys Gly Thr Glu Ile Met Thr Thr
945 950 955 960
His Phe
<210>5
<211>3822
<212>DNA
<213>伯氏致病杆菌
<400>5
gatctcacaa agaccaaaac aaaggggatg tatatttatt ctgacatgac cacaaaagtc 60
atgattgata gtgagatcaa aaattatcaa aacagcgttt accgagagtt cgatacattg 120
acccagcgac ggttaaataa ccgttatgcg gcgaattatg attatccgtc ttccgttgca 180
gtcagtagtg gttacgagtg gggcgattac tctctgagta tggtttatga cagtaaaatt 240
gcttccattg ctaccgtcgg aactacttca tcagagatca aattgaaaat cgatgccgac 300
ctgcgggtaa tttataacgg ggttgaaggc aggcagcgtc accaatgtgc tctgatgcaa 360
aaatttggtc agttaggtga taaatttatt gtttacgaag acctgaaaat tgacagagag 420
aatcagagtg caggcaataa caatctcttt tatcccgttt atcaatacag tggcaatgtc 480
agtaaattgt caaaagggcg tttattagtt tatagagaaa gttcatcatc ttatgtcaag 540
gcagatattg ggccagggca cgatccgctc attaatgaaa atgctcaaaa accttatggt 600
tatgttgaag acagcaaaaa tgaccccgcc gcgctgaaaa ataacatgac actgacggat 660
aacgcgggta tttctacgaa ggtcgcatca ccaagagata tcgatactgc tgtaacgccg 720
gcaaatatca cgattaaggc cagtgcaggc agcagtaaac ctgtagagtt taacgctggc 780
acatctgtca taaatctgcc caacaacaac ttggaagaaa tgatctataa cttccatgat 840
atggaattca ctatcccact gacagaattt aaggacaacc aagtcgaggt ggagatagtg 900
ctgaccggga aaacggatga tggccgggtt ctgggaagtg aaacctttaa ttttaccgtt 960
acacagaaaa ttctgaatga acagtcaggg ttgctgacgc tcaatactgc tgcgtctaaa 1020
gcccaatatc tgcaatgggg gccttaccgt acccgcatta ataccttatt tgccagaaat 1080
atggtggaac gggcagaaac gggcattgat accctgctga caatggatac ccaacaactg 1140
cctgaaccta aaatgggaga tgggggatat attagtgtca ccttacccaa atatgatcca 1200
gataagcatg gtagtaccag aaacgccgcg gtcacacttt atcaggaaaa agatggtgta 1260
gactcaacaa cgcattacgg cttctgggac gggtcgttaa cagatgcaga acaaaccatc 1320
aaactgttta ttccattaac tagcacgaaa gaacctttct ataacacgat tgattttcca 1380
tcttcgataa gtgacgggct tcaagttttt ctaaaaagcg ctaaggaagg tttgctggcc 1440
ggaaccttaa aaacagcgtt tactccatct gaggataaga aggccaatat tgtcttcacg 1500
gaatataccc ctgtttcggg tacgccaccc atgaaggttg aactgctgtc caaatattat 1560
gatcagccga tggattttaa cggcgccaac tccctctact tctgggaatt gttctattac 1620
agcccgatgc tggtagcgca gcgcttgttg caggaacaaa attttgatga agccaatcat 1680
tggctgaaat atgtttacag ccctgagggc tatatcgtca aaggtgagat tgcgccgtat 1740
cattggaatt gccggccact ggaagaagat acttcgtgga actctaaccc gctggattcc 1800
acagaccccg atgccgtcgc ccaagatgat ccgatgcact ataaagtttc taccttcatg 1860
cggatgctcg atctgctgat tgcccgtggc gacaaggctt accgccagct tgagcgggat 1920
actttgaatg aagccaagct ctggtatata caggcactga atctattggg ggatgagcag 1980
tttgtggcgc tggatggcaa ctggtctgaa cccacgttgg aaaccgcagc ggataagacg 2040
gtggaacagg attatcagca tgcgctgatg ttaattcgcc tggtacagcc cgccgaatat 2100
accgctaact cactgaccaa cctatttttg cctcaacaaa atgacaaact gaatggctac 2160
tggcaaacat tgaagcagcg cttgtataac ctgcgtcata acctcaccat tgatggcctg 2220
ccgctgtcac tgcctattta cgccaaacct gccgatccta aagccttgtt gagtgcggcg 2280
gtgaatgctt cccagggagg cacggatctg ccaaatccgg aaatgccact tcatcgtttc 2340
cccatcatgt tggataacgc gaagagcata gtcagtcaac tcattcagtt tggttctacc 2400
ttacagggga tcattgaacg tcaggatgca gaagcgctca acgaattgct gcaaaatcaa 2460
gcgcgtgaac tgacgctgat cagcattcag atgcagaata aaacgctgga agaattggat 2520
gcggaaaaag aagtactgaa acaatcccga ctaggggcgc aatcacgctt tgacagctat 2580
agcaagctgt acgatgaaaa catcaacgat ggcgaaaaaa ctgctatgga tttgcgtact 2640
gctgccagca cgataagtac tgccctggaa gccgctaaat tggcagaggc cggtgccgat 2700
atgttcccaa atatcttcgg tcttgctggt ggtggcagcc gatggggggc tatccctggc 2760
gcacttgctt ctgtgatggg ctttaccgcc ggcacactca atacgaaagc cgaacgaacc 2820
acacagtctg aaatttaccg ccgccgccgt caggagtggg aaattcagcg caccaatgca 2880
gatcatgaag ttaagcaaat tgacgctcaa ttgaaatcac tggaaatccg gcgtgaagcg 2940
gcagacatgc agaaaaccta tctggaaacc cagcaggctc agacacaggc acaattggaa 3000
ttcctgcaac gtaaattcag taacagagcg ttgtacaact ggatgcgggg tcgtctggcc 3060
gccatttact tccagttcta tgatcttgcc acctctcgtt gcctgatggc acagcaagcc 3120
taccagtggg aaaccaatga tacagcagcc agctttatca aatcgggggc atggcaggga 3180
acctatgctg gcctgctcgc cggcgagtct ctgatactga accttgtcca gatggaagat 3240
gccttcatga aaaaagatga acgggcattg gaaatcacgc gtaccgtttc gttggctgag 3300
gtttaccgtt ctctgcctga tgccgataaa ttcatacttc ctgacgcagt tgctgattta 3360
ttgaactccc cggggaaatc attcgggaaa gatcagaaca cactaaaaat tgagacgaat 3420
caactggaag catccgtaaa tctgtctggt ctcaacattt ggggagatta cccggaacaa 3480
cttggcgcgg ctcgtcgcat caaacaagtg agtgtttccc tgcctgcctt gcttggaccg 3540
tatcaggatg tacaggccat cttgagctat agcggtgaca tgaagggcat tcccaaaggt 3600
tgcagtgcta tcgcggtatc caatggcatg aatgacagcg ggcaattcca gttggatttc 3660
aatgacacca aatacctgcc atttgaaggg atcaatattc cgaaagataa agatcaaagt 3720
gcactggtgc tgagtttccc caacgcggac gctaaacaga aaacgatgtt gctcagtttg 3780
agcgacatca ttctgcacat tcgctacacc attcgcaaat aa 3822
<210>6
<211>1273
<212>PRT
<213>伯氏致病杆菌
<400>6
Asp Leu Thr Lys Thr Lys Thr Lys Gly Met Tyr Ile Tyr Ser Asp Met
1 5 10 15
Thr Thr Lys Val Met Ile Asp Ser Glu Ile Lys Asn Tyr Gln Asn Ser
20 25 30
Val Tyr Arg Glu Phe Asp Thr Leu Thr Gln Arg Arg Leu Asn Asn Arg
35 40 45
Tyr Ala Ala Asn Tyr Asp Tyr Pro Ser Ser Val Ala Val Ser Ser Gly
50 55 60
Tyr Glu Trp Gly Asp Tyr Ser Leu Ser Met Val Tyr Asp Ser Lys Ile
65 70 75 80
Ala Ser Ile Ala Thr Val Gly Thr Thr Ser Ser Glu Ile Lys Leu Lys
85 90 95
Ile Asp Ala Asp Leu Arg Val Ile Tyr Asn Gly Val Glu Gly Arg Gln
100 105 110
Arg His Gln Cys Ala Leu Met Gln Lys Phe Gly Gln Leu Gly Asp Lys
115 120 125
Phe Ile Val Tyr Glu Asp Leu Lys Ile Asp Arg Glu Asn Gln Ser Ala
130 135 140
Gly Asn Asn Asn Leu Phe Tyr Pro Val Tyr Gln Tyr Ser Gly Asn Val
145 150 155 160
Ser Lys Leu Ser Lys Gly Arg Leu Leu Val Tyr Arg Glu Ser Ser Ser
165 170 175
Ser Tyr Val Lys Ala Asp Ile Gly Pro Gly His Asp Pro Leu Ile Asn
180 185 190
Glu Asn Ala Gln Lys Pro Tyr Gly Tyr Val Glu Asp Ser Lys Asn Asp
195 200 205
Pro Ala Ala Leu Lys Asn Asn Met Thr Leu Thr Asp Asn Ala Gly Ile
210 215 220
Ser Thr Lys Val Ala Ser Pro Arg Asp Ile Asp Thr Ala Val Thr Pro
225 230 235 240
Ala Asn Ile Thr Ile Lys Ala Ser Ala Gly Ser Ser Lys Pro Val Glu
245 250 255
Phe Asn Ala Gly Thr Ser Val Ile Asn Leu Pro Asn Asn Asn Leu Glu
260 265 270
Glu Met Ile Tyr Asn Phe His Asp Met Glu Phe Thr Ile Pro Leu Thr
275 280 285
Glu Phe Lys Asp Asn Gln Val Glu Val Glu Ile Val Leu Thr Gly Lys
290 295 300
Thr Asp Asp Gly Arg Val Leu Gly Ser Glu Thr Phe Asn Phe Thr Val
305 310 315 320
Thr Gln Lys Ile Leu Asn Glu Gln Ser Gly Leu Leu Thr Leu Asn Thr
325 330 335
Ala Ala Ser Lys Ala Gln Tyr Leu Gln Trp Gly Pro Tyr Arg Thr Arg
340 345 350
Ile Asn Thr Leu Phe Ala Arg Asn Met Val Glu Arg Ala Glu Thr Gly
355 360 365
Ile Asp Thr Leu Leu Thr Met Asp Thr Gln Gln Leu Pro Glu Pro Lys
370 375 380
Met Gly Asp Gly Gly Tyr Ile Ser Val Thr Leu Pro Lys Tyr Asp Pro
385 390 395 400
Asp Lys His Gly Ser Thr Arg Asn Ala Ala Val Thr Leu Tyr Gln Glu
405 410 415
Lys Asp Gly Val Asp Ser Thr Thr His Tyr Gly Phe Trp Asp Gly Ser
420 425 430
Leu Thr Asp Ala Glu Gln Thr Ile Lys Leu Phe Ile Pro Leu Thr Ser
435 440 445
Thr Lys Glu Pro Phe Tyr Asn Thr Ile Asp Phe Pro Ser Ser Ile Ser
450 455 460
Asp Gly Leu Gln Val Phe Leu Lys Ser Ala Lys Glu Gly Leu Leu Ala
465 470 475 480
Gly Thr Leu Lys Thr Ala Phe Thr Pro Ser Glu Asp Lys Lys Ala Asn
485 490 495
Ile Val Phe Thr Glu Tyr Thr Pro Val Ser Gly Thr Pro Pro Met Lys
500 505 510
Val Glu Leu Leu Ser Lys Tyr Tyr Asp Gln Pro Met Asp Phe Asn Gly
515 520 525
Ala Asn Ser Leu Tyr Phe Trp Glu Leu Phe Tyr Tyr Ser Pro Met Leu
530 535 540
Val Ala Gln Arg Leu Leu Gln Glu Gln Asn Phe Asp Glu Ala Asn His
545 550 555 560
Trp Leu Lys Tyr Val Tyr Ser Pro Glu Gly Tyr Ile Val Lys Gly Glu
565 570 575
Ile Ala Pro Tyr His Trp Asn Cys Arg Pro Leu Glu Glu Asp Thr Ser
580 585 590
Trp Asn Ser Asn Pro Leu Asp Ser Thr Asp Pro Asp Ala Val Ala Gln
595 600 605
Asp Asp Pro Met His Tyr Lys Val Ser Thr Phe Met Arg Met Leu Asp
610 615 620
Leu Leu Ile Ala Arg Gly Asp Lys Ala Tyr Arg Gln Leu Glu Arg Asp
625 630 635 640
Thr Leu Asn Glu Ala Lys Leu Trp Tyr Ile Gln Ala Leu Asn Leu Leu
645 650 655
Gly Asp Glu Gln Phe Val Ala Leu Asp Gly Asn Trp Ser Glu Pro Thr
660 665 670
Leu Glu Thr Ala Ala Asp Lys Thr Val Glu Gln Asp Tyr Gln His Ala
675 680 685
Leu Met Leu Ile Arg Leu Val Gln Pro Ala Glu Tyr Thr Ala Asn Ser
690 695 700
Leu Thr Asn Leu Phe Leu Pro Gln Gln Asn Asp Lys Leu Asn Gly Tyr
705 710 715 720
Trp Gln Thr Leu Lys Gln Arg Leu Tyr Asn Leu Arg His Asn Leu Thr
725 730 735
Ile Asp Gly Leu Pro Leu Ser Leu Pro Ile Tyr Ala Lys Pro Ala Asp
740 745 750
Pro Lys Ala Leu Lcu Ser Ala Ala Val Asn Ala Ser Gln Gly Gly Thr
755 760 765
Asp Leu Pro Asn Pro Glu Met Pro Leu His Arg Phe Pro Ile Met Leu
770 775 780
Asp Asn Ala Lys Ser Ile Val Ser Gln Leu Ile Gln Phe Gly Scr Thr
785 790 795 800
Leu Gln Gly Ile Ile Glu Arg Gln Asp Ala Glu Ala Leu Asn Glu Leu
805 810 815
Leu Gln Asn Gln Ala Arg Glu Leu Thr Leu Ile Ser Ile Gln Met Gln
820 825 830
Asn Lys Thr Leu Glu Glu Leu Asp Ala Glu Lys Glu Val Leu Lys Gln
835 840 845
Ser Arg Leu Gly Ala Gln Ser Arg Phe Asp Ser Tyr Ser Lys Leu Tyr
850 855 860
Asp Glu Asn Ile Asn Asp Gly Glu Lys Thr Ala Met Asp Leu Arg Thr
865 870 875 880
Ala Ala Ser Thr Ile Ser Thr Ala Leu Glu Ala Ala Lys Leu Ala Glu
885 890 895
Ala Gly Ala Asp Met Phe Pro Asn Ile Phe Gly Leu Ala Gly Gly Gly
900 905 910
Ser Arg Trp Gly Ala Ile Pro Gly Ala Leu Ala Ser Val Met Gly Phe
915 920 925
Thr Ala Gly Thr Leu Asn Thr Lys Ala Glu Arg Thr Thr Gln Ser Glu
930 935 940
Ile Tyr Arg Arg Arg Arg Gln Glu Trp Glu Ile Gln Arg Thr Asn Ala
945 950 955 960
Asp His Glu Val Lys Gln Ile Asp Ala Gln Leu Lys Ser Leu Glu Ile
965 970 975
Arg Arg Glu Ala Ala Asp Met Gln Lys Thr Tyr Leu Glu Thr Gln Gln
980 985 990
Ala Gln Thr Gln Ala Gln Leu Glu Phe Leu Gln Arg Lys Phe Ser Asn
995 1000 1005
Arg Ala Leu Tyr Asn Trp Met Arg Gly Arg Leu Ala Ala Ile Tyr
1010 1015 1020
Phe Gln Phe Tyr Asp Leu Ala Thr Ser Arg Cys Leu Met Ala Gln
1025 1030 1035
Gln Ala Tyr Gln Trp Glu Thr Asn Asp Thr Ala Ala Ser Phe Ile
1040 1045 1050
Lys Ser Gly Ala Trp Gln Gly Thr Tyr Ala Gly Leu Leu Ala Gly
1055 1060 1065
Glu Ser Leu Ile Leu Asn Leu Val Gln Met Glu Asp Ala Phe Met
1070 1075 1080
Lys Lys Asp Glu Arg Ala Leu Glu Ile Thr Arg Thr Val Ser Leu
1085 1090 1095
Ala Glu Val Tyr Arg Ser Leu Pro Asp Ala Asp Lys Phe Ile Leu
1100 1105 1110
Pro Asp Ala Val Ala Asp Leu Leu Asn Ser Pro Gly Lys Ser Phe
1115 1120 1125
Gly Lys Asp Gln Asn Thr Leu Lys Ile Glu Thr Asn Gln Leu Glu
1130 1135 1140
Ala Ser Val Asn Leu Ser Gly Leu Asn Ile Trp Gly Asp Tyr Pro
1145 1150 1155
Glu Gln Leu Gly Ala Ala Arg Arg Ile Lys Gln Val Ser Val Ser
1160 1165 1170
Leu Pro Ala Leu Leu Gly Pro Tyr Gln Asp Val Gln Ala Ile Leu
1175 1180 1185
Ser Tyr Ser Gly Asp Met Lys Gly Ile Pro Lys Gly Cys Ser Ala
1190 1195 1200
Ile Ala Val Ser Asn Gly Met Asn Asp Ser Gly Gln Phe Gln Leu
1205 1210 1215
Asp Phe Asn Asp Thr Lys Tyr Leu Pro Phe Glu Gly Ile Asn Ile
1220 1225 1230
Pro Lys Asp Lys Asp Gln Ser Ala Leu Val Leu Ser Phe Pro Asn
1235 1240 1245
Ala Asp Ala Lys Gln Lys Thr Met Leu Leu Ser Leu Ser Asp Ile
1250 1255 1260
Ile Leu His Ile Arg Tyr Thr Ile Arg Lys
1265 1270
<210>7
<211>4595
<212>DNA
<213>伯氏致病杆菌
<400>7
tctagaacta gtgtcgacta aagaagaagg agatatacca tgaaacaaga cagccaggac 60
atgacagtaa cacagctgtc cctgcccaaa gggggcggtg cgatcagtgg catgggtgac 120
actatcagca atgcagggcc ggatgggatg gcttcgcttt ccgtgccttt gcctatctct 180
gccggtcggg ggggcgcacc gaatttatcc ctgaactaca gtagcggagc aggaaacggg 240
tcatttggta ttggctggca atccagtacc atggctatca gccgtcgtac tcaacatggc 300
gtaccgcaat atcacggcga agatactttt ttatgtccga tgggagaagt gatggcggtt 360
gccgtcaatc agagcgggca acccgatgtg cgtaaaaccg ataaactatt aggcgggcaa 420
ctgcctgtta cttataccgt tacgcgtcat cagcccagaa atattcagca cttcagcaaa 480
cttgaatact ggcagccccc aacggatgtg gaaaccacgc ctttttggtt aatgtattca 540
cccgatggac aaattcacat tttcggaaaa actgagcagg ctcagatcgc taacccggca 600
gaggtttcac agattgccca atggcttttg gaagaaaccg taacaccagc gggagaacac 660
atttattacc agtatcgggc agaagacgat atcggttgtg atgacagcga aaaaaatgcc 720
caccctaatg ccagtgctca acgttatttg actcaggtga actacggcaa tattacacct 780
gaatccagcc tgcttgtgct gaagaatacg ccaccggcgg ataacgaatg gctattccat 840
ttggtttttg attatggtga acgagcgcag gaaataaaca cggttcctcc tttcaaagca 900
ccttcaaaca actggaaaat acggccagac cgtttctccc gctttgaata tggttttgag 960
gtgcgaaccc gccgcctgtg tcaacaaatt ctgatgttcc atcgcctgaa atcccttgca 1020
ggagaacaga ttgacggaga agaaatccct gccttggttg cccgtctgct tctcagttat 1080
gacctgaacg acagcgtgac aacccttacc gccattcggc aaatggcgta tgaaactgac 1140
gcaaccttaa tcgctttacc gccactggag tttgactatc agccctttga ggcaaaagtc 1200
acgcagaaat ggcaggaaat gcctcaattg gccggattga atgcccaaca accttaccaa 1260
ctcgtcgatc tctatggtga aggtatctcc ggcatcttgt atcaggacag acccggagca 1320
tggtggtatc aggcaccgat ccgtcagaaa aacgttgaag atattaacgc tgtcacctat 1380
agcccaataa accccttacc taagatcccc agccagcagg acagagcaac gttgatggat 1440
atcgacggtg atggacatct ggattgggtg atcgctggcg caggtattca ggggcggtac 1500
agtatgcagc cgaatggaga gtggacacac tttattccca tttctgcact gccaacagaa 1560
tattttcatc cacaggcaca actggcggat ctggtggggg ccgggttatc tgatttagcg 1620
ctgattggcc ccagaagtgt gcgtttatat gccaacgacc gaggaaactg gaaagcgggt 1680
attaatgtta tgccacctga tggtgtgaat ttgccgatat ttggtggtga tgccagcagt 1740
ctggtcgcat tttctgacat gttgggatcg ggacagcagc atttggtgga aattgccgct 1800
cagagcgtca aatgctggcc gaatctagga catggccgtt ttggtgcggc tattttgctg 1860
ccggggttta gccagccgaa tggaacattc aatgctaacc aagtttttct ggcagatatc 1920
gatggttccg gcaccgccga catcatctat gcacacagta cgtatctgga tatttacctg 1980
aacgaaagcg gcaaccgttt cagtgcaccc gttcggctta atttgccgga aggggtgatg 2040
tttgacaata cctgtcagtt acaggtgtcg gatattcaag gattgggcgc tgccagcatt 2100
gtactgaccg tacctcatat gacaccgcgc cattggcgtt atgattttac tcacaataaa 2160
ccttggctgc tcaatgtcat caacaacaat cgtggcgcag aaaccacgtt gttttaccgt 2220
agttctgccc aattctggct ggatgaaaaa agtcagatcg aagagctggg aaaatttgca 2280
gcgagttatc tgcctttccc catacatttg ttgtggcgca atgaggcgct ggatgaaatt 2340
actggtaatc gactgactaa ggtcatgaat tatgcccacg gtgcatggga tggcagagag 2400
agagaatttt gcggatttgg ccgtgtaacg caaattgata ccgacgaatt tgccaaggga 2460
accacagaga aagcgccgga tgaaaatatc tatccttccc gtagcataag ctggtttgcc 2520
acgggtttac cagaagtgga ttctcaactt ccggcagaat actggcgtgg tgacgatcag 2580
gcatttgccg gctttacacc gcgcttcact cgttatgaaa aaggtaatgc ggggcaagag 2640
gggcaggata ccccgattaa agaaccgacc gaaacagaag cgtattggct taaccgcgcc 2700
atgaaaggcc aattactgcg cagtgaagtc tatggtgacg acaaaacaga aaaagctaaa 2760
attccgtaca ccgtcacaga agctcgctgt caggtcagat taattcccag caatgacgaa 2820
gccgcgccgt cgtcttggac gtcgatcatt gaaaaccgca gttatcacta tgagcgtatc 2880
gtcgtcgatc cgagttgcaa acaacaggtc gtgctcaagg cggatgaata tggcttccca 2940
ctggcaaaag tagatatcgc ctatccacgg cgcaataaac cggcacagaa cccttatccg 3000
gattcgttac cggatactct gttcgccgat agctatgacg accagcaaaa acagttatat 3060
ctgacaaaac agcagcagag ctattaccac ctgacccagc aggatgattg ggttctgggt 3120
ttgacggata gccgatacag cgaagtttat cattatgcgc aaactgacgc tcaaagtgac 3180
atccccaagg cagggctgat attggaagac ctgctgaaag ttgacggcct gataggtaaa 3240
gacaagactt ttatctattt agggcagcag cgagtggctt atgtgggagg agatgcagaa 3300
aaaccgacac gtcaggtgcg ggtggcttat acagaaaccg ctgcttttga tgacaatgcg 3360
ctgcacgcct ttgatggcgt gattgcccct gatgaactga cgcaacagtt gctggcgggt 3420
ggatacctgc tcgtgccgca gatttctgat gtggcaggca gtagtgaaaa ggtatgggta 3480
gctcggcagg gatacaccga atacggcagt gctgctcaat tctaccggcc actcatccag 3540
cgcaaaagct tgctgaccgg aaaatatacc cttagttggg atacgcacta ttgtgtggtg 3600
gtaaaaaccg aagatggtgc gggaatgacc acgcaagcga agtacgatta ccgcttcctg 3660
cttccggcgc aattgacaga tatcaatgac aaccagcaca tcgtgacatt taatgcattg 3720
gggcaggtga cttccagccg tttctggggc acagaaaatg gcaaaataag cggttactcg 3780
acgccggaga gtaaaccgtt cacagtaccc gataccgtcg aaaaagccct tgccttgcaa 3840
ccgacgatcc cggtttcaca gtgcaacatt tatgtgccgg atagttggat gcggcttctg 3900
ccccaacagt ctctgactgg ccagctaaaa gagggggaaa ctttgtggaa cgcattacac 3960
cgggcgggtg tagtaacgga agacggtttg atctgtgaac tggcctatcg tcgttggatc 4020
aaacgtcagg caacgtcttc aatgatggcc gtgacattac agcaaatctt ggctcagact 4080
ccacgacaac ctccgcatgc catgacgatc acgacagatc gttatgacag cgattctcag 4140
cagcaacttc ggcagtcgat agtattgagt gatggttttg gtcgcgtatt gcaaagcgcc 4200
cagcgtcatg aagcaggaga ggcatggcag cgtgcagaag atggttcttt ggttgtcgat 4260
aataccggta aacccgttgt tgctaatacc acaacgcgct gggcagtatc cggtcgcaca 4320
gaatacgacg gcaaagggca ggcgatcaga gcttacctgc cttattatct caatgattgg 4380
cgctatgtca gtgatgacag cgcccgggat gacctgtacg ccgataccca tttttacgat 4440
cctctggggc gtgaatatca ggtaaaaacc gcgaaaggat tttggcgtga aaacatgttt 4500
atgccgtggt ttgtcgtcaa tgaagatgaa aatgacacag cagcacgttt aacatcttaa 4560
ttaatgcggc cgcaggcctc tgtaagactc tcgag 4595
<210>8
<211>2947
<212>DNA
<213>伯氏致病杆菌
<400>8
tctagaacta gtaggcctta aagaagagag agatatacca tgaatgtttt taatccaact 60
ttatatgccg gtacaccgac tgtcaccgtc atggacaatc gagggctgtc agtgcgggat 120
attgcttatc accgtacaac agcaggagag caggctgaca ctcgcatcac ccgccatcaa 180
tacagtcccc ataatttttt aatcgagagc attgatccac gcctttttga tttgcaatct 240
cagagcacca taaaacctaa tttcacctac tgtcctgcct tgaagggtga tgtcctacgg 300
acagagagtg tggatgccgg acaaactgtc attttgagtg acatcgaagg tcgtccgtta 360
ctgaatatca gtgcgatggg tgtcgtcaaa cactggcaat atgaagagag tacattgccg 420
gggcgcttgc tcgctgtcag tgaacggaag aatgaggctt caacacccca aattattgaa 480
cggtttattt ggtcgggaaa tagcccatca gaaaaagatc acaatttggc gggaaaatat 540
cttcgtcatt atgataccgc cggattaaac cagcttaatg ctgtgtctct gaccagcgtg 600
gatctctcac aatcccgtca gttattgcag gatgatgtca cagcagattg gagcggaagt 660
gacgaatccc agtggaagac gcgactgagt aacgacatat tcacaaccga aatcaccgct 720
gatgcggttg gcaatttctt gactcagaat gatgccaaaa gcaaccagca acgattgtcc 780
tatgatgtgg cagggcagtt aaaggcaagc tggctgacga taaaaggcca gaatgagcag 840
gtgatagtta actccctgac ttactccgcc gcagggcaga aactgcgtga agagcagggt 900
aacggcgttg tcactgaata ctcctatgaa gcacaaacct ggcgtttgat aggtgtaacg 960
gcttaccgtc agtcagataa aaaaagattg caggatcttg tctataacta tgatccggtc 1020
ggtaatctcc tgaatattcg caataatgca gaggcaaccc gtttctggcg taatcagata 1080
gtagaaccag agaaccacta tgcttatgac tcgctttatc aactcatcag tgctagtggt 1140
cgagaaatcg ccagtatcgg tcagcagggc agccggctgc ctgtaccgat tattcctctt 1200
cctgccaatg acgatgttta tactcgctac acccgcacat atcactatga tcgcggtgga 1260
aatctctgcc agatccggca ttgcgctcct gctacagata ataagtacac cacaaagatc 1320
accgtatcga atcgtagtaa tcgtgcagta tgggatacct tgaccacaga tcccgccaaa 1380
gtggataccc tgtttgatca tggagggcat caacttcaac tccagtcagg ccagacttta 1440
tgttggaact atcggggtga actacagcaa ataacaaaga tacagcgtga cgaaaaaccc 1500
gcagataaag agcggtatcg ctatggtgtt ggggctgcgc gggtcgtgaa aatcagcaca 1560
cagcaggcgg ggggaagcag ccatgtgcag cgtgttgttt atctgccggg gttggaacta 1620
cgcacaactc agcatgatgc gacattaatc gaagacttac aggtgattat catgggtgaa 1680
gcaggacgtg ctcaggtacg cgtacttcat tgggaaatac caccaccgga taatcttaac 1740
aatgactcac tgcgttacag ctacgatagt ttgatgggtt ccagtcagct tgaattggat 1800
ggagcagggc agattattac gcaggaagaa tactacccct atggaggtac agcaatatgg 1860
gcggcaagaa accagaccga agccaattac aaaaccattc gctactccgg caaagagcgt 1920
gatgcgacgg ggctttatta ctacgggcac cgttattatc agccgtggct agggcgctgg 1980
ttgagcgcag atcccgccgg aaccgtggac ggactgaatc tatatcgaat ggtgaggaat 2040
aacccgatta cttaccggga tgcagatggg cttgcgccga taggcgataa gatcagcgaa 2100
gggatttatg agcctgagtt gcgagttggt cttgaacgag atgacccaaa tgtcagagat 2160
tatgaccggg tttatcctga tacggccaag acagagatga tcgaagcaac tgcgaccaca 2220
attgctccca gtcaaatgtt atcggcgcat gcttttgcat ctgtacctat attgacagat 2280
ttgtttaatc ctcaaacagc aaggctttct caaaagacaa cggatattgt attaaacaca 2340
caaggtggag gcgatttaat ctttactggc atgaatatta aaggtaaggg aaaagaattt 2400
aatgcattaa aaatcgttga tacttatggc ggagaaatgc ctgatagcaa aaccgctatt 2460
tcagcatatt ggcttccgca aggtgggtat actgatattc cgatacatcc gactggaata 2520
caaaagtatt tgtttacgcc tgcgtttagt ggttgcactc tggcagtaga taagcttaac 2580
gaaaatacat tacgggcgta tcacgtcgaa ggaagtaagg aagatgctca atataataat 2640
ttagcagttg cagcgcacgg agagggtttg gtcatggcta tggaatttcc tgactatgga 2700
tttcatacag acaaaacagg gcaaagacta aggaacacac agggatttgc gtttatgtcc 2760
tacaatcaat cccagaaaaa atgggaaatt cattatcaaa ggcaagcatt gacatcaaac 2820
accggtatca tgaatgttag tgctaaaaac aagattcgat tgaatgcccc cagtcatgta 2880
aaaaatagct caatcaaagg aactgaaata atgacgacac atttttaatt aatgcggccg 2940
cctcgag 2947
<210>9
<211>7508
<212>DNA
<213>伯氏致病杆菌
<400>9
tctagaacta gtgtcgacta aagaagaagg agatatacca tgaaacaaga cagccaggac 60
atgacagtaa cacagctgtc cctgcccaaa gggggcggtg cgatcagtgg catgggtgac 120
actatcagca atgcagggcc ggatgggatg gcttcgcttt ccgtgccttt gcctatctct 180
gccggtcggg ggggcgcacc gaatttatcc ctgaactaca gtagcggagc aggaaacggg 240
tcatttggta ttggctggca atccagtacc atggctatca gccgtcgtac tcaacatggc 300
gtaccgcaat atcacggcga agatactttt ttatgtccga tgggagaagt gatggcggtt 360
gccgtcaatc agagcgggca acccgatgtg cgtaaaaccg ataaactatt aggcgggcaa 420
ctgcctgtta cttataccgt tacgcgtcat cagcccagaa atattcagca cttcagcaaa 480
cttgaatact ggcagccccc aacggatgtg gaaaccacgc ctttttggtt aatgtattca 540
cccgatggac aaattcacat tttcggaaaa actgagcagg ctcagatcgc taacccggca 600
gaggtttcac agattgccca atggcttttg gaagaaaccg taacaccagc gggagaacac 660
atttattacc agtatcgggc agaagacgat atcggttgtg atgacagcga aaaaaatgcc 720
caccctaatg ccagtgctca acgttatttg actcaggtga actacggcaa tattacacct 780
gaatccagcc tgcttgtgct gaagaatacg ccaccggcgg ataacgaatg gctattccat 840
ttggtttttg attatggtga acgagcgcag gaaataaaca cggttcctcc tttcaaagca 900
ccttcaaaca actggaaaat acggccagac cgtttctccc gctttgaata tggttttgag 960
gtgcgaaccc gccgcctgtg tcaacaaatt ctgatgttcc atcgcctgaa atcccttgca 1020
ggagaacaga ttgacggaga agaaatccct gccttggttg cccgtctgct tctcagttat 1080
gacctgaacg acagcgtgac aacccttacc gccattcggc aaatggcgta tgaaactgac 1140
gcaaccttaa tcgctttacc gccactggag tttgactatc agccctttga ggcaaaagtc 1200
acgcagaaat ggcaggaaat gcctcaattg gccggattga atgcccaaca accttaccaa 1260
ctcgtcgatc tctatggtga aggtatctcc ggcatcttgt atcaggacag acccggagca 1320
tggtggtatc aggcaccgat ccgtcagaaa aacgttgaag atattaacgc tgtcacctat 1380
agcccaataa accccttacc taagatcccc agccagcagg acagagcaac gttgatggat 1440
atcgacggtg atggacatct ggattgggtg atcgctggcg caggtattca ggggcggtac 1500
agtatgcagc cgaatggaga gtggacacac tttattccca tttctgcact gccaacagaa 1560
tattttcatc cacaggcaca actggcggat ctggtggggg ccgggttatc tgatttagcg 1620
ctgattggcc ccagaagtgt gcgtttatat gccaacgacc gaggaaactg gaaagcgggt 1680
attaatgtta tgccacctga tggtgtgaat ttgccgatat ttggtggtga tgccagcagt 1740
ctggtcgcat tttctgacat gttgggatcg ggacagcagc atttggtgga aattgccgct 1800
cagagcgtca aatgctggcc gaatctagga catggccgtt ttggtgcggc tattttgctg 1860
ccggggttta gccagccgaa tggaacattc aatgctaacc aagtttttct ggcagatatc 1920
gatggttccg gcaccgccga catcatctat gcacacagta cgtatctgga tatttacctg 1980
aacgaaagcg gcaaccgttt cagtgcaccc gttcggctta atttgccgga aggggtgatg 2040
tttgacaata cctgtcagtt acaggtgtcg gatattcaag gattgggcgc tgccagcatt 2100
gtactgaccg tacctcatat gacaccgcgc cattggcgtt atgattttac tcacaataaa 2160
ccttggctgc tcaatgtcat caacaacaat cgtggcgcag aaaccacgtt gttttaccgt 2220
agttctgccc aattctggct ggatgaaaaa agtcagatcg aagagctggg aaaatttgca 2280
gcgagttatc tgcctttccc catacatttg ttgtggcgca atgaggcgct ggatgaaatt 2340
actggtaatc gactgactaa ggtcatgaat tatgcccacg gtgcatggga tggcagagag 2400
agagaatttt gcggatttgg ccgtgtaacg caaattgata ccgacgaatt tgccaaggga 2460
accacagaga aagcgccgga tgaaaatatc tatccttccc gtagcataag ctggtttgcc 2520
acgggtttac cagaagtgga ttctcaactt ccggcagaat actggcgtgg tgacgatcag 2580
gcatttgccg gctttacacc gcgcttcact cgttatgaaa aaggtaatgc ggggcaagag 2640
gggcaggata ccccgattaa agaaccgacc gaaacagaag cgtattggct taaccgcgcc 2700
atgaaaggcc aattactgcg cagtgaagtc tatggtgacg acaaaacaga aaaagctaaa 2760
attccgtaca ccgtcacaga agctcgctgt caggtcagat taattcccag caatgacgaa 2820
gccgcgccgt cgtcttggac gtcgatcatt gaaaaccgca gttatcacta tgagcgtatc 2880
gtcgtcgatc cgagttgcaa acaacaggtc gtgctcaagg cggatgaata tggcttccca 2940
ctggcaaaag tagatatcgc ctatccacgg cgcaataaac cggcacagaa cccttatccg 3000
gattcgttac cggatactct gttcgccgat agctatgacg accagcaaaa acagttatat 3060
ctgacaaaac agcagcagag ctattaccac ctgacccagc aggatgattg ggttctgggt 3120
ttgacggata gccgatacag cgaagtttat cattatgcgc aaactgacgc tcaaagtgac 3180
atccccaagg cagggctgat attggaagac ctgctgaaag ttgacggcct gataggtaaa 3240
gacaagactt ttatctattt agggcagcag cgagtggctt atgtgggagg agatgcagaa 3300
aaaccgacac gtcaggtgcg ggtggcttat acagaaaccg ctgcttttga tgacaatgcg 3360
ctgcacgcct ttgatggcgt gattgcccct gatgaactga cgcaacagtt gctggcgggt 3420
ggatacctgc tcgtgccgca gatttctgat gtggcaggca gtagtgaaaa ggtatgggta 3480
gctcggcagg gatacaccga atacggcagt gctgctcaat tctaccggcc actcatccag 3540
cgcaaaagct tgctgaccgg aaaatatacc cttagttggg atacgcacta ttgtgtggtg 3600
gtaaaaaccg aagatggtgc gggaatgacc acgcaagcga agtacgatta ccgcttcctg 3660
cttccggcgc aattgacaga tateaatgac aaccagcaca tcgtgacatt taatgcattg 3720
gggcaggtga cttccagccg tttctggggc acagaaaatg gcaaaataag cggttactcg 3780
acgccggaga gtaaaccgtt cacagtaccc gataccgtcg aaaaagccct tgccttgcaa 3840
ccgacgatcc cggtttcaca gtgcaacatt tatgtgccgg atagttggat gcggcttctg 3900
ccccaacagt ctctgactgg ccagctaaaa gagggggaaa ctttgtggaa cgcattacac 3960
cgggcgggtg tagtaacgga agacggtttg atctgtgaac tggcctatcg tcgttggatc 4020
aaacgtcagg caacgtcttc aatgatggcc gtgacattac agcaaatctt ggctcagact 4080
ccacgacaac ctccgcatgc catgacgatc acgacagatc gttatgacag cgattctcag 4140
cagcaacttc ggcagtcgat agtattgagt gatggttttg gtcgcgtatt gcaaagcgcc 4200
cagcgtcatg aagcaggaga ggcatggcag cgtgcagaag atggttcttt ggttgtcgat 4260
aataccggta aacccgttgt tgctaatacc acaacgcgct gggcagtatc cggtcgcaca 4320
gaatacgacg gcaaagggca ggcgatcaga gcttacctgc cttattatct caatgattgg 4380
cgctatgtca gtgatgacag cgcccgggat gacctgtacg ccgataccca tttttacgat 4440
cctctggggc gtgaatatca ggtaaaaacc gcgaaaggat tttggcgtga aaacatgttt 4500
atgccgtggt ttgtcgtcaa tgaagatgaa aatgacacag cagcacgttt aacatcttaa 4560
ttaatgcggc cgcaggcctt aaagaagaga gagatatacc atgaatgttt ttaatccaac 4620
tttatatgcc ggtacaccga ctgtcaccgt catggacaat cgagggctgt cagtgcggga 4680
tattgcttat caccgtacaa cagcaggaga gcaggctgac actcgcatca cccgccatca 4740
atacagtccc cataattttt taatcgagag cattgatcca cgcctttttg atttgcaatc 4800
tcagagcacc ataaaaccta atttcaccta ctgtcctgcc ttgaagggtg atgtcctacg 4860
gacagagagt gtggatgccg gacaaactgt cattttgagt gacatcgaag gtcgtccgtt 4920
actgaatatc agtgcgatgg gtgtcgtcaa acactggcaa tatgaagaga gtacattgcc 4980
ggggcgcttg ctcgctgtca gtgaacggaa gaatgaggct tcaacacccc aaattattga 5040
acggtttatt tggtcgggaa atagcccatc agaaaaagat cacaatttgg cgggaaaata 5100
tcttcgtcat tatgataccg ccggattaaa ccagcttaat gctgtgtctc tgaccagcgt 5160
ggatctctca caatcccgtc agttattgca ggatgatgtc acagcagatt ggagcggaag 5220
tgacgaatcc cagtggaaga cgcgactgag taacgacata ttcacaaccg aaatcaccgc 5280
tgatgcggtt ggcaatttct tgactcagaa tgatgccaaa agcaaccagc aacgattgtc 5340
ctatgatgtg gcagggcagt taaaggcaag ctggctgacg ataaaaggcc agaatgagca 5400
ggtgatagtt aactccctga cttactccgc cgcagggcag aaactgcgtg aagagcaggg 5460
taacggcgtt gtcactgaat actcctatga agcacaaacc tggcgtttga taggtgtaac 5520
ggcttaccgt cagtcagata aaaaaagatt gcaggatctt gtctataact atgatccggt 5580
cggtaatctc ctgaatattc gcaataatgc agaggcaacc cgtttctggc gtaatcagat 5640
agtagaacca gagaaccact atgcttatga ctcgctttat caactcatca gtgctagtgg 5700
tcgagaaatc gccagtatcg gtcagcaggg cagccggctg cctgtaccga ttattcctct 5760
tcctgccaat gacgatgttt atactcgcta cacccgcaca tatcactatg atcgcggtgg 5820
aaatctctgc cagatccggc attgcgctcc tgctacagat aataagtaca ccacaaagat 5880
caccgtatcg aatcgtagta atcgtgcagt atgggatacc ttgaccacag atcccgccaa 5940
agtggatacc ctgtttgatc atggagggca tcaacttcaa ctccagtcag gccagacttt 6000
atgttggaac tatcggggtg aactacagca aataacaaag atacagcgtg acgaaaaacc 6060
cgcagataaa gagcggtatc gctatggtgt tggggctgcg cgggtcgtga aaatcagcac 6120
acagcaggcg gggggaagca gccatgtgca gcgtgttgtt tatctgccgg ggttggaact 6180
acgcacaact cagcatgatg cgacattaat cgaagactta caggtgatta tcatgggtga 6240
agcaggacgt gctcaggtac gcgtacttca ttgggaaata ccaccaccgg ataatcttaa 6300
caatgactca ctgcgttaca gctacgatag tttgatgggt tccagtcagc ttgaattgga 6360
tggagcaggg cagattatta cgcaggaaga atactacccc tatggaggta cagcaatatg 6420
ggcggcaaga aaccagaccg aagccaatta caaaaccatt cgctactccg gcaaagagcg 6480
tgatgcgacg gggctttatt actacgggca ccgttattat cagccgtggc tagggcgctg 6540
gttgagcgca gatcccgccg gaaccgtgga cggactgaat ctatatcgaa tggtgaggaa 6600
taacccgatt acttaccggg atgcagatgg gcttgcgccg ataggcgata agatcagcga 6660
agggatttat gagcctgagt tgcgagttgg tcttgaacga gatgacccaa atgtcagaga 6720
ttatgaccgg gtttatcctg atacggccaa gacagagatg atcgaagcaa ctgcgaccac 6780
aattgctccc agtcaaatgt tatcggcgca tgcttttgca tctgtaccta tattgacaga 6840
tttgtttaat cctcaaacag caaggctttc tcaaaagaca acggatattg tattaaacac 6900
acaaggtgga ggcgatttaa tctttactgg catgaatatt aaaggtaagg gaaaagaatt 6960
taatgcatta aaaatcgttg atacttatgg cggagaaatg cctgatagca aaaccgctat 7020
ttcagcatat tggcttccgc aaggtgggta tactgatatt ccgatacatc cgactggaat 7080
acaaaagtat ttgtttacgc ctgcgtttag tggttgcact ctggcagtag ataagcttaa 7140
cgaaaataca ttacgggcgt atcacgtcga aggaagtaag gaagatgctc aatataataa 7200
tttagcagtt gcagcgcacg gagagggttt ggtcatggct atggaatttc ctgactatgg 7260
atttcataca gacaaaacag ggcaaagact aaggaacaca cagggatttg cgtttatgtc 7320
ctacaatcaa tcccagaaaa aatgggaaat tcattatcaa aggcaagcat tgacatcaaa 7380
caccggtatc atgaatgtta gtgctaaaaa caagattcga ttgaatgccc ccagtcatgt 7440
aaaaaatagc tcaatcaaag gaactgaaat aatgacgaca catttttaat taatgcggcc 7500
gcctcgag 7508
<210>10
<211>7605
<212>DNA
<213>伯氏致病杆菌
<400>10
atgtataata cagaaaatat attaattagg cttaataggg aaagttccca ggaaccgatg 60
acattggctc atattatgcc aatctcattt tcagcattca ggaaagaagt caaagatacg 120
ctgaattggg gagaaagcca tcacctgtat ctagccgcca agaaagccga aaaagaaaac 180
aggatttttg aagcacgttt attatcccgc gccaatccgc agttaagggg ggctgttcgt 240
ctcggcattc agcaactctc gcagcggcaa agttacgata cgctatttgg cggtcggtcg 300
ggtaagtatg tattacccgg ctctgtcgcc tctatgttct caccggcagc ctacctgaca 360
gagctgtatc gggaatccag acacctgcat tcagaatcgt ccatctacca tcttgataag 420
cgtcgccctg atttgcaaag cataatgttg acgcaggaaa accaagatca aacactctcc 480
acacttgaac tatcaaatga cattctcttt gatggcataa aaaataagaa aaaactcaac 540
aaaaatgaag atgtactgaa aatgttgtct gattggcgtc tgagtggaaa tacgccttac 600
catcaaccct ttgaaaccct atctaacatt gtctcccagc ttgatcctca gctcagtcag 660
gttagtcagt cgccaaaagt gattggttta ttgtccccgg tcagcctatt ggggatatca 720
agtcaaattt caccagaact gtataaaatc ctgacggaag aaattacggc tgagaatgcg 780
caagacatgt ataagaaaaa tttcggtgac ttgccgattt ctgcactgtc taatcctaac 840
tatttgatga aatattatga tattgatgca gatactctcc gtgctgtaat gggtatctat 900
ggatcaggcc aaaacgatga tgaacccgca ttcatcagcg atcaggccat agtgacttac 960
cttgatgata aaaattcttt cgttacttac ctgattactc gcaccaaagg cgagacttat 1020
gactggcagg ttaattttat cgaagctatt cccacaaaag atggcaaatt aaaatattgg 1080
tataatttta aagctccggc ttccagtgca gtttccacca aaatttcgct gaatgggcag 1140
actatcttcg acagacctga ttggctgccg gagctcaata agacttattc agatatcgtt 1200
gatttcccca gtgatgttga tagaaaaaaa tttactctga aattcgaaag agcagcctct 1260
ggcagtggag gtagttttaa tacggatgcg acattctcaa ttgaaacggt attacctcaa 1320
ctctttttcc tcaaattgaa taaagttatt cgcctttaca aaaaaaccgg tatcacgctg 1380
gaacaaattg aaactgctgt ggattcagat aatgcccaac aacaaataac cgaaacaatt 1440
ctgaaaaaga tattttatac aacctactat attaataggt attatttgag tttcaatgat 1500
gcactggtgt tatgtaatac cgcaatatct cagcacagct ataatgatca gccttctcat 1560
tttgacctta tttttaataa cccgccattg aatggaaact attaccaatt gggcggggat 1620
aaaattcaag ttgatccaga tcaggcagat tatgaacaat ataatcaacg gcgtgaaatg 1680
ctcaagcacg cgttgaaagt taatgacagt gaattattca cactatctaa gattctggat 1740
caagaaaata cgtcaggtat cgacaataac cttgctacgg atttatctgc gctgtaccgc 1800
gtacgaatgc ttgcttacat tcaccaactt tctatcaatg aattggctat cctgctaaaa 1860
ctctcgccat atgctgaaga gtcttttaac aaaatcagta cggaaaagtt aatcgaagtc 1920
attgaatatc tttacagtat cacccagtgg ttacagacac agaaaatcag cgtttatacc 1980
ctgtatatga tgacgaccac aacctacagt acagttttat cacccgatat taacaatctg 2040
atcgagacgc tacgggcggg aatgcagaac aaaaccgtac cagacgatcc acttcaactt 2100
atcaagacct tggcaccctt cattgcagcg gcactgaaac tttcttcggc atttgtggct 2160
gagtcgatcc tgatatggat caacaagatc aaacccaatg gcatggatgt cgccgccttc 2220
tggaaatcca ttgagtctac aaaaaatccg atagaaccga acagcatggt attttgtcag 2280
gtgctggggc agttggcatt gatttattta gccacgcaac taacggaaaa tgctctgaat 2340
ctggcggtga caactaaagt gattatcggt cactccggca gcatcgatca tctgggcaaa 2400
gatactgaga cggtgagaca gcttagccgt tttgcgggat ggtgtaattc actgggcagc 2460
aatacagaca cagtactgac agctctgcaa agtaacaact tggatagcac tattctggcg 2520
agtgccatga ggatggatga gaggctgctt tcaaccgcca gtgaacaggc taaccttaat 2580
aaacaggttg cagaaaaaga taagtatgca gattggccag aaatagacag tgttctgcaa 2640
tggctagcag tggccaatgt gatgaaaacc agcccgaata agattaatgc tcttctgcaa 2700
ttggactatc tgaaagatca gaatactaca gaagtttctt acgaaacatg gagccaatcg 2760
gcggatatac tggcggctgg gctgaataat aatcaatcag atattctgaa acaagcctta 2820
gaggaagaag ccagtgccgc attaagccaa tattacatcc gtgaagttgt ggatagcgcg 2880
gctgaggtga tagatagaaa tgatctgtat ggttacctgc tgatagataa tcaaatctcc 2940
gcacaggtcg aaacgacacg gctggctgag gccattgcca gtatccagca atatatcaac 3000
cgtgcattga atggccgtga gagtacccct gccaccgatg tcatgacagg ccagttttat 3060
caggattggg atcgttataa caaacgctac agcacatggg cgggtgtttc cacgctggtt 3120
tactatcctg aaaactatat cgatccgacc atgcgtatcg gtcagaccca catgatggat 3180
gaattgctgc aatccgtcag ccagagtcaa ctcagtgttg ataccgttga agatgcgttt 3240
aaaacctatc tgacccgctt tgaacaaatt gccaacctga ctgtcgtcag tggctatcat 3300
gataatgtga acatttcaca agggaacagt taccttgtcg gtaaagggga aacggatgcc 3360
aaccaatatt attggcgcaa actggatcac agcaaatccc gtcagggcaa gattgccgcc 3420
aatgcgtgga gtgaatgggc aaaaattgac agcccggtca atccctatca gggcttaatt 3480
aagccggtta tctataaatc ccgcctgtat attgtctggc tggaaaaacg ggtgattact 3540
gtttcagaaa gcaaagacgg cgcaataaca tcgaaagata tcattaaata tgaaatcaaa 3600
atcgcccata tcagacatga tggcacatgg aatacgccta tcacgttaga tgtcagcgat 3660
atcttcagcg catataacga tacagacctg gccaatctgg ctatgtattg ctctgaatat 3720
acgggagaaa gtaccttact cttattactg tatgtcaaac aggctgatac ggcgggaaac 3780
aaagatctca caaagaccaa aacaaagggg atgtatattt attctgacat gaccacaaaa 3840
gtcatgattg atagtgagat caaaaattat caaaacagcg tttaccgaga gttcgataca 3900
ttgacccagc gacggttaaa taaccgttat gcggcgaatt atgattatcc gtcttccgtt 3960
gcagtcagta gtggttacga gtggggcgat tactctctga gtatggttta tgacagtaaa 4020
attgcttcca ttgctaccgt cggaactact tcatcagaga tcaaattgaa aatcgatgcc 4080
gacctgcggg taatttataa cggggttgaa ggcaggcagc gtcaccaatg tgctctgatg 4140
caaaaatttg gtcagttagg tgataaattt attgtttacg aagacctgaa aattgacaga 4200
gagaatcaga gtgcaggcaa taacaatctc ttttatcccg tttatcaata cagtggcaat 4260
gtcagtaaat tgtcaaaagg gcgtttatta gtttatagag aaagttcatc atcttatgtc 4320
aaggcagata ttgggccagg gcacgatccg ctcattaatg aaaatgctca aaaaccttat 4380
ggttatgttg aagacagcaa aaatgacccc gccgcgctga aaaataacat gacactgacg 4440
gataacgcgg gtatttctac gaaggtcgca tcaccaagag atatcgatac tgctgtaacg 4500
ccggcaaata tcacgattaa ggccagtgca ggcagcagta aacctgtaga gtttaacgct 4560
ggcacatctg tcataaatct gcccaacaac aacttggaag aaatgatcta taacttccat 4620
gatatggaat tcactatccc actgacagaa tttaaggaca accaagtcga ggtggagata 4680
gtgctgaccg ggaaaacgga tgatggccgg gttctgggaa gtgaaacctt taattttacc 4740
gttacacaga aaattctgaa tgaacagtca gggttgctga cgctcaatac tgctgcgtct 4800
aaagcccaat atctgcaatg ggggccttac cgtacccgca ttaatacctt atttgccaga 4860
aatatggtgg aacgggcaga aacgggcatt gataccctgc tgacaatgga tacccaacaa 4920
ctgcctgaac ctaaaatggg agatggggga tatattagtg tcaccttacc caaatatgat 4980
ccagataagc atggtagtac cagaaacgcc gcggtcacac tttatcagga aaaagatggt 5040
gtagactcaa caacgcatta cggcttctgg gacgggtcgt taacagatgc agaacaaacc 5100
atcaaactgt ttattccatt aactagcacg aaagaacctt tctataacac gattgatttt 5160
ccatcttcga taagtgacgg gcttcaagtt tttctaaaaa gcgctaagga aggtttgctg 5220
gccggaacct taaaaacagc gtttactcca tctgaggata agaaggccaa tattgtcttc 5280
acggaatata cccctgtttc gggtacgcca cccatgaagg ttgaactgct gtccaaatat 5340
tatgatcagc cgatggattt taacggcgcc aactccctct acttctggga attgttctat 5400
tacagcccga tgctggtagc gcagcgcttg ttgcaggaac aaaattttga tgaagccaat 5460
cattggctga aatatgttta cagccctgag ggctatatcg tcaaaggtga gattgcgccg 5520
tatcattgga attgccggcc actggaagaa gatacttcgt ggaactctaa cccgctggat 5580
tccacagacc ccgatgccgt cgcccaagat gatccgatgc actataaagt ttctaccttc 5640
atgcggatgc tcgatctgct gattgcccgt ggcgacaagg cttaccgcca gcttgagcgg 5700
gatactttga atgaagccaa gctctggtat atacaggcac tgaatctatt gggggatgag 5760
cagtttgtgg cgctggatgg caactggtct gaacccacgt tggaaaccgc agcggataag 5820
acggtggaac aggattatca gcatgcgctg atgttaattc gcctggtaca gcccgccgaa 5880
tataccgcta actcactgac caacctattt ttgcctcaac aaaatgacaa actgaatggc 5940
tactggcaaa cattgaagca gcgcttgtat aacctgcgtc ataacctcac cattgatggc 6000
ctgccgctgt cactgcctat ttacgccaaa cctgccgatc ctaaagcctt gttgagtgcg 6060
gcggtgaatg cttcccaggg aggcacggat ctgccaaatc cggaaatgcc acttcatcgt 6120
ttccccatca tgttggataa cgcgaagagc atagtcagtc aactcattca gtttggttct 6180
accttacagg ggatcattga acgtcaggat gcagaagcgc tcaacgaatt gctgcaaaat 6240
caagcgcgtg aactgacgct gatcagcatt cagatgcaga ataaaacgct ggaagaattg 6300
gatgcggaaa aagaagtact gaaacaatcc cgactagggg cgcaatcacg ctttgacagc 6360
tatagcaagc tgtacgatga aaacatcaac gatggcgaaa aaactgctat ggatttgcgt 6420
actgctgcca gcacgataag tactgccctg gaagccgcta aattggcaga ggccggtgcc 6480
gatatgttcc caaatatctt cggtcttgct ggtggtggca gccgatgggg ggctatccct 6540
ggcgcacttg cttctgtgat gggctttacc gccggcacac tcaatacgaa agccgaacga 6600
accacacagt ctgaaattta ccgccgccgc cgtcaggagt gggaaattca gcgcaccaat 6660
gcagatcatg aagttaagca aattgacgct caattgaaat cactggaaat ccggcgtgaa 6720
gcggcagaca tgcagaaaac ctatctggaa acccagcagg ctcagacaca ggcacaattg 6780
gaattcctgc aacgtaaatt cagtaacaga gcgttgtaca actggatgcg gggtcgtctg 6840
gccgccattt acttccagtt ctatgatctt gccacctctc gttgcctgat ggcacagcaa 6900
gcctaccagt gggaaaccaa tgatacagca gccagcttta tcaaatcggg ggcatggcag 6960
ggaacctatg ctggcctgct cgccggcgag tctctgatac tgaaccttgt ccagatggaa 7020
gatgccttca tgaaaaaaga tgaacgggca ttggaaatca cgcgtaccgt ttcgttggct 7080
gaggtttacc gttctctgcc tgatgccgat aaattcatac ttcctgacgc agttgctgat 7140
ttattgaact ccccggggaa atcattcggg aaagatcaga acacactaaa aattgagacg 7200
aatcaactgg aagcatccgt aaatctgtct ggtctcaaca tttggggaga ttacccggaa 7260
caacttggcg cggctcgtcg catcaaacaa gtgagtgttt ccctgcctgc cttgcttgga 7320
ccgtatcagg atgtacaggc catcttgagc tatagcggtg acatgaaggg cattcccaaa 7380
ggttgcagtg ctatcgcggt atccaatggc atgaatgaca gcgggcaatt ccagttggat 7440
ttcaatgaca ccaaatacct gccatttgaa gggatcaata ttccgaaaga taaagatcaa 7500
agtgcactgg tgctgagttt ccccaacgcg gacgctaaac agaaaacgat gttgctcagt 7560
ttgagcgaca tcattctgca cattcgctac accattcgca aataa 7605
<210>11
<211>2534
<212>PRT
<213>伯氏致病杆菌
<400>11
Met Tyr Asn Thr Glu Asn Ile Leu Ile Arg Leu Asn Arg Glu Ser Ser
1 5 10 15
Gln Glu Pro Met Thr Leu Ala His Ile Met Pro Ile Ser Phe Ser Ala
20 25 30
Phe Arg Lys Glu Val Lys Asp Thr Leu Asn Trp Gly Glu Ser His His
35 40 45
Leu Tyr Leu Ala Ala Lys Lys Ala Glu Lys Glu Asn Arg Ile Phe Glu
50 55 60
Ala Arg Leu Leu Ser Arg Ala Asn Pro Gln Leu Arg Gly Ala Val Arg
65 70 75 80
Leu Gly Ile Gln Gln Leu Ser Gln Arg Gln Ser Tyr Asp Thr Leu Phe
85 90 95
Gly Gly Arg Ser Gly Lys Tyr Val Leu Pro Gly Ser Val Ala Ser Met
100 105 110
Phe Ser Pro Ala Ala Tyr Leu Thr Glu Leu Tyr Arg Glu Ser Arg His
115 120 125
Leu His Ser Glu Ser Ser Ile Tyr His Leu Asp Lys Arg Arg Pro Asp
130 135 140
Leu Gln Ser Ile Met Leu Thr Gln Glu Asn Gln Asp Gln Thr Leu Ser
145 150 155 160
Thr Leu Glu Leu Ser Asn Asp Ile Leu Phe Asp Gly Ile Lys Asn Lys
165 170 175
Lys Lys Leu Asn Lys Asn Glu Asp Val Leu Lys Met Leu Ser Asp Trp
180 185 190
Arg Leu Ser Gly Asn Thr Pro Tyr His Gln Pro Phe Glu Thr Leu Ser
195 200 205
Asn Ile Val Ser Gln Leu Asp Pro Gln Leu Ser Gln Val Ser Gln Ser
210 215 220
Pro Lys Val Ile Gly Leu Leu Ser Pro Val Ser Leu Leu Gly Ile Ser
225 230 235 240
Ser Gln Ile Ser Pro Glu Leu Tyr Lys Ile Leu Thr Glu Glu Ile Thr
245 250 255
Ala Glu Asn Ala Gln Asp Met Tyr Lys Lys Asn Phe Gly Asp Leu Pro
260 265 270
Ile Ser Ala Leu Ser Asn Pro Asn Tyr Leu Met Lys Tyr Tyr Asp Ile
275 280 285
Asp Ala Asp Thr Leu Arg Ala Val Met Gly Ile Tyr Gly Ser Gly Gln
290 295 300
Asn Asp Asp Glu Pro Ala Phe Ile Ser Asp Gln Ala Ile Val Thr Tyr
305 310 315 320
Leu Asp Asp Lys Asn Ser Phe Val Thr Tyr Leu Ile Thr Arg Thr Lys
325 330 335
Gly Glu Thr Tyr Asp Trp Gln Val Asn Phe Ile Glu Ala Ile Pro Thr
340 345 350
Lys Asp Gly Lys Leu Lys Tyr Trp Tyr Asn Phe Lys Ala Pro Ala Ser
355 360 365
Ser Ala Val Ser Thr Lys Ile Ser Leu Asn Gly Gln Thr Ile Phe Asp
370 375 380
Arg Pro Asp Trp Leu Pro Glu Leu Asn Lys Thr Tyr Ser Asp Ile Val
385 390 395 400
Asp Phe Pro Ser Asp Val Asp Arg Lys Lys Phe Thr Leu Lys Phe Glu
405 410 415
Arg Ala Ala Ser Gly Ser Gly Gly Ser Phe Asn Thr Asp Ala Thr Phe
420 425 430
Ser Ile Glu Thr Val Leu Pro Gln Leu Phe Phe Leu Lys Leu Asn Lys
435 440 445
Val Ile Arg Leu Tyr Lys Lys Thr Gly Ile Thr Leu Glu Gln Ile Glu
450 455 460
Thr Ala Val Asp Ser Asp Asn Ala Gln Gln Gln Ile Thr Glu Thr Ile
465 470 475 480
Leu Lys Lys Ile Phe Tyr Thr Thr Tyr Tyr Ile Asn Arg Tyr Tyr Leu
485 490 495
Ser Phe Asn Asp Ala Leu Val Leu Cys Asn Thr Ala Ile Ser Gln His
500 505 510
Ser Tyr Asn Asp Gln Pro Ser His Phe Asp Leu Ile Phe Asn Asn Pro
515 520 525
Pro Leu Asn Gly Asn Tyr Tyr Gln Leu Gly Gly Asp Lys Ile Gln Val
530 535 540
Asp Pro Asp Gln Ala Asp Tyr Glu Gln Tyr Asn Gln Arg Arg Glu Met
545 550 555 560
Leu Lys His Ala Leu Lys Val Asn Asp Ser Glu Leu Phe Thr Leu Ser
565 570 575
Lys Ile Leu Asp Gln Glu Asn Thr Ser Gly Ile Asp Asn Asn Leu Ala
580 585 590
Thr Asp Leu Ser Ala Leu Tyr Arg Val Arg Met Leu Ala Tyr Ile His
595 600 605
Gln Leu Ser Ile Asn Glu Leu Ala Ile Leu Leu Lys Leu Ser Pro Tyr
610 615 620
Ala Glu Glu Ser Phe Asn Lys Ile Ser Thr Glu Lys Leu Ile Glu Val
625 630 635 640
Ile Glu Tyr Leu Tyr Ser Ile Thr Gln Trp Leu Gln Thr Gln Lys Ile
645 650 655
Ser Val Tyr Thr Leu Tyr Met Met Thr Thr Thr Thr Tyr Ser Thr Val
660 665 670
Leu Ser Pro Asp Ile Asn Asn Leu Ile Glu Thr Leu Arg Ala Gly Met
675 680 685
Gln Asn Lys Thr Val Pro Asp Asp Pro Leu Gln Leu Tle Lys Thr Leu
690 695 700
Ala Pro Phe Ile Ala Ala Ala Leu Lys Leu Ser Ser Ala Phe Val Ala
705 710 715 720
Glu Ser Ile Leu Ile Trp Ile Asn Lys Ile Lys Pro Asn Gly Met Asp
725 730 735
Val Ala Ala Phe Trp Lys Ser Ile Glu Ser Thr Lys Asn Pro Ile Glu
740 745 750
Pro Asn Ser Met Val Phe Cys Gln Val Leu Gly Gln Leu Ala Leu Ile
755 760 765
Tyr Leu Ala Thr Gln Leu Thr Glu Asn Ala Leu Asn Leu Ala Val Thr
770 775 780
Thr Lys Val Ile Ile Gly His Ser Gly Ser Ile Asp His Leu Gly Lys
785 790 795 800
Asp Thr Glu Thr Val Arg Gln Leu Ser Arg Phe Ala Gly Trp Cys Asn
805 810 815
Ser Leu Gly Ser Asn Thr Asp Thr Val Leu Thr Ala Leu Gln Ser Asn
820 825 830
Asn Leu Asp Ser Thr Ile Leu Ala Ser Ala Met Arg Met Asp Glu Arg
835 840 845
Leu Leu Ser Thr Ala Ser Glu Gln Ala Asn Leu Asn Lys Gln Val Ala
850 855 860
Glu Lys Asp Lys Tyr Ala Asp Trp Pro Glu Ile Asp Ser Val Leu Gln
865 870 875 880
Trp Leu Ala Val Ala Asn Val Met Lys Thr Ser Pro Asn Lys Ile Asn
885 890 895
Ala Leu Leu Gln Leu Asp Tyr Leu Lys Asp Gln Asn Thr Thr Glu Val
900 905 910
Ser Tyr Glu Thr Trp Ser Gln Ser Ala Asp Ile Leu Ala Ala Gly Leu
915 920 925
Asn Asn Asn Gln Ser Asp Ile Leu Lys Gln Ala Leu Glu Glu Glu Ala
930 935 940
Ser Ala Ala Leu Ser Gln Tyr Tyr Ile Arg Glu Val Val Asp Ser Ala
945 950 955 960
Ala Glu Val Ile Asp Arg Asn Asp Leu Tyr Gly Tyr Leu Leu Ile Asp
965 970 975
Asn Gln Ile Ser Ala Gln Val Glu Thr Thr Arg Leu Ala Glu Ala Ile
980 985 990
Ala Ser Ile Gln Gln Tyr Ile Asn Arg Ala Leu Asn Gly Arg Glu Ser
995 1000 1005
Thr Pro Ala Thr Asp Val Met Thr Gly Gln Phe Tyr Gln Asp Trp
1010 1015 1020
Asp Arg Tyr Asn Lys Arg Tyr Ser Thr Trp Ala Gly Val Ser Thr
1025 1030 1035
Leu Val Tyr Tyr Pro Glu Asn Tyr Ile Asp Pro Thr Met Arg Ile
1040 1045 1050
Gly Gln Thr His Met Met Asp Glu Leu Leu Gln Ser Val Ser Gln
1055 1060 1065
Ser Gln Leu Ser Val Asp Thr Val Glu Asp Ala Phe Lys Thr Tyr
1070 1075 1080
Leu Thr Arg Phe Glu Gln Ile Ala Asn Leu Thr Val Val Ser Gly
1085 1090 1095
Tyr His Asp Asn Val Asn Ile Ser Gln Gly Asn Ser Tyr Leu Val
1100 1105 1110
Gly Lys Gly Glu Thr Asp Ala Asn Gln Tyr Tyr Trp Arg Lys Leu
1115 1120 1125
Asp His Ser Lys Ser Arg Gln Gly Lys Ile Ala Ala Asn Ala Trp
1130 1135 1140
Ser Glu Trp Ala Lys Ile Asp Ser Pro Val Asn Pro Tyr Gln Gly
1145 1150 1155
Leu Ile Lys Pro Val Ile Tyr Lys Ser Arg Leu Tyr Ile Val Trp
1160 1165 1170
Leu Glu Lys Arg Val Ile Thr Val Ser Glu Ser Lys Asp Gly Ala
1175 1180 1185
Ile Thr Ser Lys Asp Ile Ile Lys Tyr Glu Ile Lys Ile Ala His
1190 1195 1200
Ile Arg His Asp Gly Thr Trp Asn Thr Pro Ile Thr Leu Asp Val
1205 1210 1215
Ser Asp Ile Phe Ser Ala Tyr Asn Asp Thr Asp Leu Ala Asn Leu
1220 1225 1230
Ala Met Tyr Cys Ser Glu Tyr Thr Gly Glu Ser Thr Leu Leu Leu
1235 1240 1245
Leu Leu Tyr Val Lys Gln Ala Asp Thr Ala Gly Asn Lys Asp Leu
1250 1255 1260
Thr Lys Thr Lys Thr Lys Gly Met Tyr Ile Tyr Ser Asp Met Thr
1265 1270 1275
Thr Lys Val Met Ile Asp Ser Glu Ile Lys Asn Tyr Gln Asn Ser
1280 1285 1290
Val Tyr Arg Glu Phe Asp Thr Leu Thr Gln Arg Arg Leu Asn Asn
1295 1300 1305
Arg Tyr Ala Ala Asn Tyr Asp Tyr Pro Ser Ser Val Ala Val Ser
1310 1315 1320
Ser Gly Tyr Glu Trp Gly Asp Tyr Ser Leu Ser Met Val Tyr Asp
1325 1330 1335
Ser Lys Ile Ala Ser Ile Ala Thr Val Gly Thr Thr Ser Ser Glu
1340 1345 1350
Ile Lys Leu Lys Ile Asp Ala Asp Leu Arg Val Ile Tyr Asn Gly
1355 1360 1365
Val Glu Gly Arg Gln Arg His Gln Cys Ala Leu Met Gln Lys Phe
1370 1375 1380
Gly Gln Leu Gly Asp Lys Phe Ile Val Tyr Glu Asp Leu Lys Ile
1385 1390 1395
Asp Arg Glu Asn Gln Ser Ala Gly Asn Asn Asn Leu Phe Tyr Pro
1400 1405 1410
Val Tyr Gln Tyr Ser Gly Asn Val Ser Lys Leu Ser Lys Gly Arg
1415 1420 1425
Leu Leu Val Tyr Arg Glu Ser Ser Ser Ser Tyr Val Lys Ala Asp
1430 1435 1440
Ile Gly Pro Gly His Asp Pro Leu Ile Asn Glu Asn Ala Gln Lys
1445 1450 1455
Pro Tyr Gly Tyr Val Glu Asp Ser Lys Asn Asp Pro Ala Ala Leu
1460 1465 1470
Lys Asn Asn Met Thr Leu Thr Asp Asn Ala Gly Ile Ser Thr Lys
1475 1480 1485
Val Ala Ser Pro Arg Asp Ile Asp Thr Ala Val Thr Pro Ala Asn
1490 1495 1500
Ile Thr Ile Lys Ala Ser Ala Gly Ser Ser Lys Pro Val Glu Phe
1505 1510 1515
Asn Ala Gly Thr Ser Val Ile Asn Leu Pro Asn Asn Asn Leu Glu
1520 1525 1530
Glu Met Ile Tyr Asn Phe His Asp Met Glu Phe Thr Ile Pro Leu
1535 1540 1545
Thr Glu Phe Lys Asp Asn Gln Val Glu Val Glu Ile Val Leu Thr
1550 1555 1560
Gly Lys Thr Asp Asp Gly Arg Val Leu Gly Ser Glu Thr Phe Asn
1565 1570 1575
Phe Thr Val Thr Gln Lys Ile Leu Asn Glu Gln Ser Gly Leu Leu
1580 1585 1590
Thr Leu Asn Thr Ala Ala Ser Lys Ala Gln Tyr Leu Gln Trp Gly
1595 1600 1605
Pro Tyr Arg Thr Arg Ile Asn Thr Leu Phe Ala Arg Asn Met Val
1610 1615 1620
Glu Arg Ala Glu Thr Gly Ile Asp Thr Leu Leu Thr Met Asp Thr
1625 1630 1635
Gln Gln Leu Pro Glu Pro Lys Met Gly Asp Gly Gly Tyr Ile Ser
1640 1645 1650
Val Thr Leu Pro Lys Tyr Asp Pro Asp Lys His Gly Ser Thr Arg
1655 1660 1665
Asn Ala Ala Val Thr Leu Tyr Gln Glu Lys Asp Gly Val Asp Ser
1670 1675 1680
Thr Thr His Tyr Gly Phe Trp Asp Gly Ser Leu Thr Asp Ala Glu
1685 1690 1695
Gln Thr Ile Lys Leu Phe Ile Pro Leu Thr Ser Thr Lys Glu Pro
1700 1705 1710
Phe Tyr Asn Thr Tle Asp Phe Pro Ser Ser Ile Ser Asp Gly Leu
1715 1720 1725
Gln Val Phe Leu Lys Ser Ala Lys Glu Gly Leu Leu Ala Gly Thr
1730 1735 1740
Leu Lys Thr Ala Phe Thr Pro Ser Glu Asp Lys Lys Ala Asn Ile
1745 1750 1755
Val Phe Thr Glu Tyr Thr Pro Val Ser Gly Thr Pro Pro Met Lys
1760 1765 1770
Val Glu Leu Leu Ser Lys Tyr Tyr Asp Gln Pro Met Asp Phe Asn
1775 1780 1785
Gly Ala Asn Ser Leu Tyr Phe Trp Glu Leu Phe Tyr Tyr Ser Pro
1790 1795 1800
Met Leu Val Ala Gln Arg Leu Leu Gln Glu Gln Asn Phe Asp Glu
1805 1810 1815
Ala Asn His Trp Leu Lys Tyr Val Tyr Ser Pro Glu Gly Tyr Ile
L820 1825 1830
Val Lys Gly Glu Ile Ala Pro Tyr His Trp Asn Cys Arg Pro Leu
1835 1840 1845
Glu Glu Asp Thr Ser Trp Asn Ser Asn Pro Leu Asp Ser Thr Asp
1850 1855 1860
Pro Asp Ala Val Ala Gln Asp Asp Pro Met His Tyr Lys Val Ser
1865 1870 1875
Thr Phe Met Arg Met Leu Asp Leu Leu Ile Ala Arg Gly Asp Lys
1880 1885 1890
Ala Tyr Arg Gln Leu Glu Arg Asp Thr Leu Asn Glu Ala Lys Leu
1895 1900 1905
Trp Tyr Ile Gln Ala Leu Asn Leu Leu Gly Asp Glu Gln Phe Val
1910 1915 1920
Ala Leu Asp Gly Asn Trp Ser Glu Pro Thr Leu Glu Thr Ala Ala
1925 1930 1935
Asp Lys Thr Val Glu Gln Asp Tyr Gln His Ala Leu Met Leu Ile
1940 1945 1950
Arg Leu Val Gln Pro Ala Glu Tyr Thr Ala Asn Ser Leu Thr Asn
1955 1960 1965
Leu Phe Leu Pro Gln Gln Asn Asp Lys Leu Asn Gly Tyr Trp Gln
1970 1975 1980
Thr Leu Lys Gln Arg Leu Tyr Asn Leu Arg His Asn Leu Thr Ile
1985 1990 1995
Asp Gly Leu Pro Leu Ser Leu Pro Ile Tyr Ala Lys Pro Ala Asp
2000 2005 2010
Pro Lys Ala Leu Leu Ser Ala Ala Val Asn Ala Ser Gln Gly Gly
2015 2020 2025
Thr Asp Leu Pro Asn Pro Glu Met Pro Leu His Arg Phe Pro Ile
2030 2035 2040
Met Leu Asp Asn Ala Lys Ser Ile Val Ser Gln Leu Ile Gln Phe
2045 2050 2055
Gly Ser Thr Leu Gln Gly Ile Ile Glu Arg Gln Asp Ala Glu Ala
2060 2065 2070
Leu Asn Glu Leu Leu Gln Asn Gln Ala Arg Glu Leu Thr Leu Ile
2075 2080 2085
Ser Ile Gln Met Gln Asn Lys Thr Leu Glu Glu Leu Asp Ala Glu
2090 2095 2100
Lys Glu Val Leu Lys Gln Ser Arg Leu Gly Ala Gln Ser Arg Phe
2105 2110 2115
Asp Ser Tyr Ser Lys Leu Tyr Asp Glu Asn Ile Asn Asp Gly Glu
2120 2125 2130
Lys Thr Ala Met Asp Leu Arg Thr Ala Ala Ser Thr Ile Ser Thr
2135 2140 2145
Ala Leu Glu Ala Ala Lys Leu Ala Glu Ala Gly Ala Asp Met Phe
2150 2155 2160
Pro Asn Ile Phe Gly Leu Ala Gly Gly Gly Ser Arg Trp Gly Ala
2165 2170 2175
Ile Pro Gly Ala Leu Ala Ser Val Met Gly Phe Thr Ala Gly Thr
2180 2185 2190
Leu Asn Thr Lys Ala Glu Arg Thr Thr Gln Ser Glu Ile Tyr Arg
2195 2200 2205
Arg Arg Arg Gln Glu Trp Glu Ile Gln Arg Thr Asn Ala Asp His
2210 2215 2220
Glu Val Lys Gln Ile Asp Ala Gln Leu Lys Ser Leu Glu Ile Arg
2225 2230 2235
Arg Glu Ala Ala Asp Met Gln Lys Thr Tyr Leu Glu Thr Gln Gln
2240 2245 2250
Ala Gln Thr Gln Ala Gln Leu Glu Phe Leu Gln Arg Lys Phe Ser
2255 2260 2265
Asn Arg Ala Leu Tyr Asn Trp Met Arg Gly Arg Leu Ala Ala Ile
2270 2275 2280
Tyr Phe Gln Phe Tyr Asp Leu Ala Thr Ser Arg Cys Leu Met Ala
2285 2290 2295
Gln Gln Ala Tyr Gln Trp Glu Thr Asn Asp Thr Ala Ala Ser Phe
2300 2305 2310
Ile Lys Ser Gly Ala Trp Gln Gly Thr Tyr Ala Gly Leu Leu Ala
2315 2320 2325
Gly Glu Ser Leu Ile Leu Asn Leu Val Gln Met Glu Asp Ala Phe
2330 2335 2340
Met Lys Lys Asp Glu Arg Ala Leu Glu Ile Thr Arg Thr Val Ser
2345 2350 2355
Leu Ala Glu Val Tyr Arg Ser Leu Pro Asp Ala Asp Lys Phe Ile
2360 2365 2370
Leu Pro Asp Ala Val Ala Asp Leu Leu Asn Ser Pro Gly Lys Ser
2375 2380 2385
Phe Gly Lys Asp Gln Asn Thr Leu Lys Ile Glu Thr Asn Gln Leu
2390 2395 2400
Glu Ala Ser Val Asn Leu Ser Gly Leu Asn Ile Trp Gly Asp Tyr
2405 2410 2415
Pro Glu Gln Leu Gly Ala Ala Arg Arg Ile Lys Gln Val Ser Val
2420 2425 2430
Ser Leu Pro Ala Leu Leu Gly Pro Tyr Gln Asp Val Gln Ala Ile
2435 2440 2445
Leu Ser Tyr Ser Gly Asp Met Lys Gly Ile Pro Lys Gly Cys Ser
2450 2455 2460
Ala Ile Ala Val Ser Asn Gly Met Asn Asp Ser Gly Gln Phe Gln
2465 2470 2475
Leu Asp Phe Asn Asp Thr Lys Tyr Leu Pro Phe Glu Gly Ile Asn
2480 2485 2490
Ile Pro Lys Asp Lys Asp Gln Ser Ala Leu Val Leu Ser Phe Pro
2495 2500 2505
Asn Ala Asp Ala Lys Gln Lys Thr Met Leu Leu Ser Leu Ser Asp
2510 2515 2520
Ile Ile Leu His Ile Arg Tyr Thr Ile Arg Lys
2525 2530
Claims (14)
1.具有针对害虫的毒素活性的分离蛋白质,其中编码所述蛋白质的核酸分子与选自SEQ ID NO:1、SEQ ID NO:3和SEQ ID NO:5的核酸探针的完全互补物在严格条件下杂交。
2.权利要求1的蛋白质,其中所述核酸分子与包含SEQ ID NO:10的核酸探针杂交。
3.权利要求1的蛋白质,其中所述蛋白质包含选自SEQ ID NO:2、SEQ ID NO:4和SEQ ID NO:6的氨基酸序列。
4.权利要求3的蛋白质,其包含SEQ ID NO:11。
5.分离的多核苷酸,其编码权利要求1的蛋白质。
6.含有权利要求5的多核苷酸的植物细胞。
7.含有权利要求6的细胞的转基因植物。
8.权利要求6的细胞,其中所述细胞在种子中。
9.纯化的细菌细胞培养物,所述细胞含有权利要求5的多核苷酸。
10.抑制昆虫的方法,其中所述方法包括提供权利要求1的蛋白质供所述昆虫摄食。
11.权利要求10的方法,其中所述蛋白质由植物产生并存在于植物中。
12.使用权利要求1的蛋白质产生的抗体。
13.权利要求12的抗体,其中所述抗体为多克隆抗体。
14.权利要求12的抗体,其中所述抗体为单克隆抗体。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US53489304P | 2004-01-07 | 2004-01-07 | |
US60/534,893 | 2004-01-07 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN1918296A true CN1918296A (zh) | 2007-02-21 |
Family
ID=34794332
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA2004800420040A Pending CN1918296A (zh) | 2004-01-07 | 2004-12-23 | 来自伯氏致病杆菌的毒素复合物蛋白和基因 |
Country Status (9)
Country | Link |
---|---|
US (3) | US7285632B2 (zh) |
EP (1) | EP1706491A4 (zh) |
CN (1) | CN1918296A (zh) |
AR (1) | AR047374A1 (zh) |
AU (1) | AU2004313474A1 (zh) |
BR (1) | BRPI0418372A (zh) |
CA (1) | CA2551126A1 (zh) |
MX (1) | MXPA06007784A (zh) |
WO (1) | WO2005067491A2 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116875499A (zh) * | 2023-07-06 | 2023-10-13 | 吉林省农业科学院 | 一种昆虫病原线虫共生菌及其代谢产物的应用 |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7629444B1 (en) | 2004-06-15 | 2009-12-08 | Monsanto Technology Llc | Nucleotide and amino acid sequences from Xenorhabdus bovienii strain Xs85831 and uses thereof |
AR052064A1 (es) * | 2004-12-22 | 2007-02-28 | Dow Agrosciences Llc | Complejo de toxina de xenorhabdus |
DE102009025119B4 (de) | 2009-06-11 | 2011-07-21 | Leibnitz-Institut für Meereswissenschaften, 24148 | Antitumorale, antibiotische und insektizide Cyclodepsipeptide, deren Herstellung und Verwendungen |
EP2618668B1 (en) | 2010-09-20 | 2016-09-14 | Wisconsin Alumni Research Foundation | Mosquitocidal xenorhabdus, lipopeptide and methods |
GB2505448A (en) * | 2012-08-30 | 2014-03-05 | Univ Jw Goethe Frankfurt Main | Anti-infective peptides comprising a C terminal amine group |
CN114302648A (zh) * | 2019-06-05 | 2022-04-08 | 先正达农作物保护股份公司 | 对灰翅夜蛾属的控制 |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6943282B1 (en) * | 1983-09-26 | 2005-09-13 | Mycogen Plant Science, Inc. | Insect resistant plants |
US6281413B1 (en) * | 1998-02-20 | 2001-08-28 | Syngenta Participations Ag | Insecticidal toxins from Photorhabdus luminescens and nucleic acid sequences coding therefor |
WO1999042589A2 (en) * | 1998-02-20 | 1999-08-26 | Novartis Pharma Ag. | Insecticidal toxins from photorhabdus |
US7491698B2 (en) | 2003-01-21 | 2009-02-17 | Dow Agrosciences Llc | Mixing and matching TC proteins for pest control |
CA2558656A1 (en) | 2004-03-02 | 2005-09-15 | Dow Agrosciences Llc | Insecticidal toxin complex fusion proteins |
-
2004
- 2004-12-23 CN CNA2004800420040A patent/CN1918296A/zh active Pending
- 2004-12-23 CA CA002551126A patent/CA2551126A1/en not_active Abandoned
- 2004-12-23 EP EP04815116A patent/EP1706491A4/en not_active Withdrawn
- 2004-12-23 MX MXPA06007784A patent/MXPA06007784A/es not_active Application Discontinuation
- 2004-12-23 BR BRPI0418372-0A patent/BRPI0418372A/pt not_active IP Right Cessation
- 2004-12-23 AU AU2004313474A patent/AU2004313474A1/en not_active Abandoned
- 2004-12-23 WO PCT/US2004/043000 patent/WO2005067491A2/en active Application Filing
- 2004-12-23 US US11/020,848 patent/US7285632B2/en active Active
-
2005
- 2005-01-07 AR ARP050100059A patent/AR047374A1/es unknown
-
2007
- 2007-09-18 US US11/857,073 patent/US7585944B2/en not_active Expired - Fee Related
-
2009
- 2009-09-03 US US12/553,662 patent/US7795395B2/en not_active Expired - Fee Related
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116875499A (zh) * | 2023-07-06 | 2023-10-13 | 吉林省农业科学院 | 一种昆虫病原线虫共生菌及其代谢产物的应用 |
CN116875499B (zh) * | 2023-07-06 | 2024-04-09 | 吉林省农业科学院 | 一种昆虫病原线虫共生菌及其代谢产物的应用 |
Also Published As
Publication number | Publication date |
---|---|
US20100004177A1 (en) | 2010-01-07 |
WO2005067491A3 (en) | 2006-05-18 |
CA2551126A1 (en) | 2005-07-28 |
US7795395B2 (en) | 2010-09-14 |
US20080058248A1 (en) | 2008-03-06 |
EP1706491A2 (en) | 2006-10-04 |
US7285632B2 (en) | 2007-10-23 |
US20050155104A1 (en) | 2005-07-14 |
WO2005067491A2 (en) | 2005-07-28 |
MXPA06007784A (es) | 2008-02-13 |
US7585944B2 (en) | 2009-09-08 |
BRPI0418372A (pt) | 2007-05-22 |
AR047374A1 (es) | 2006-01-18 |
AU2004313474A1 (en) | 2005-07-28 |
EP1706491A4 (en) | 2007-02-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1751125A (zh) | 混合并匹配tc蛋白质用于病虫害防治 | |
CN1234867C (zh) | 苏云金芽孢杆菌杀昆虫蛋白质 | |
CN1255539C (zh) | 新杀虫蛋白质和菌株 | |
CN1849397A (zh) | 苏云金芽孢杆菌分泌的杀虫蛋白及其应用 | |
CN1245513C (zh) | 对鞘翅类昆虫和栉头蚤属种昆虫有毒的苏云金芽孢杆菌CryET29组合物 | |
CN112175094B (zh) | 对鳞翅目害虫具有毒性或抑制性的嵌合杀昆虫蛋白质 | |
CN1527663A (zh) | 新的杀虫毒素 | |
CN1119877A (zh) | 新杀虫蛋白和菌株 | |
CN1314911A (zh) | 用dna改组优化害虫抗性基因 | |
CN1761753A (zh) | δ-内毒素基因及其使用方法 | |
CN1332800A (zh) | 转化植物以表达苏云金芽孢杆菌δ-内毒素的方法 | |
CN1256712A (zh) | 植物病害控制 | |
CN1390259A (zh) | 对鳞翅目昆虫有活性的苏云金芽孢杆菌δ内毒素组合物及其使用方法 | |
CN1334874A (zh) | Cry3B杀虫蛋白在植物中的提高表达 | |
CN1414973A (zh) | 苏云金芽孢杆菌(bacillus thur_ingiensis)的杀虫蛋白 | |
CN1708588A (zh) | Cot102杀虫棉花 | |
CN1073717A (zh) | 编码杀虫剂蛋白的基因,用该基因转化的禾本科植物及其制备方法 | |
CN1296482C (zh) | 杀虫毒素和编码这些毒素的核苷酸序列 | |
CN1642977A (zh) | 新的苏云金芽孢杆菌杀昆虫蛋白质 | |
CN1360632A (zh) | 鞘翅目毒性多肽组合物和抗虫转基因植物 | |
CN1942582A (zh) | 新的苏云金芽孢杆菌晶体多肽、多核苷酸及其组合物 | |
CN1646006A (zh) | 新的vip3毒素和应用的方法 | |
CN1408023A (zh) | 杀虫蛋白 | |
CN1163146C (zh) | 控制昆虫虫害的方法 | |
CN1625562A (zh) | CRY1EA和CRY1CA的嵌合型δ-内毒素蛋白 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Open date: 20070221 |