CN112469813A - 用于产生甲基丙烯酸酯的方法 - Google Patents
用于产生甲基丙烯酸酯的方法 Download PDFInfo
- Publication number
- CN112469813A CN112469813A CN201980049273.6A CN201980049273A CN112469813A CN 112469813 A CN112469813 A CN 112469813A CN 201980049273 A CN201980049273 A CN 201980049273A CN 112469813 A CN112469813 A CN 112469813A
- Authority
- CN
- China
- Prior art keywords
- leu
- ala
- val
- gly
- glu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000004519 manufacturing process Methods 0.000 title claims abstract description 26
- 125000005397 methacrylic acid ester group Chemical group 0.000 title description 4
- 244000005700 microbiome Species 0.000 claims abstract description 216
- 230000035772 mutation Effects 0.000 claims description 172
- 150000007523 nucleic acids Chemical class 0.000 claims description 169
- 241000588724 Escherichia coli Species 0.000 claims description 133
- 101150079343 acrR gene Proteins 0.000 claims description 120
- 108090000623 proteins and genes Proteins 0.000 claims description 114
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 105
- 238000000034 method Methods 0.000 claims description 85
- 101150059149 rob gene Proteins 0.000 claims description 81
- 102000004169 proteins and genes Human genes 0.000 claims description 68
- CERQOIWHTDAKMF-UHFFFAOYSA-N Methacrylic acid Chemical compound CC(=C)C(O)=O CERQOIWHTDAKMF-UHFFFAOYSA-N 0.000 claims description 64
- 150000001413 amino acids Chemical class 0.000 claims description 53
- 101150114554 marR gene Proteins 0.000 claims description 52
- 102000039446 nucleic acids Human genes 0.000 claims description 45
- 108020004707 nucleic acids Proteins 0.000 claims description 45
- 150000002148 esters Chemical class 0.000 claims description 44
- 238000006467 substitution reaction Methods 0.000 claims description 43
- 241000186216 Corynebacterium Species 0.000 claims description 36
- 101150100021 soxR gene Proteins 0.000 claims description 36
- 101100078107 Bacillus sp. (strain B-0618) soxA gene Proteins 0.000 claims description 35
- 101100247001 Mus musculus Qsox1 gene Proteins 0.000 claims description 35
- 230000004568 DNA-binding Effects 0.000 claims description 33
- 238000000855 fermentation Methods 0.000 claims description 32
- 230000004151 fermentation Effects 0.000 claims description 32
- 230000001580 bacterial effect Effects 0.000 claims description 26
- 241000894006 Bacteria Species 0.000 claims description 25
- 230000011506 response to oxidative stress Effects 0.000 claims description 24
- 230000036457 multidrug resistance Effects 0.000 claims description 23
- 102000004190 Enzymes Human genes 0.000 claims description 22
- 108090000790 Enzymes Proteins 0.000 claims description 22
- CERQOIWHTDAKMF-UHFFFAOYSA-M Methacrylate Chemical compound CC(=C)C([O-])=O CERQOIWHTDAKMF-UHFFFAOYSA-M 0.000 claims description 20
- 231100000221 frame shift mutation induction Toxicity 0.000 claims description 20
- 230000037433 frameshift Effects 0.000 claims description 20
- 241000193403 Clostridium Species 0.000 claims description 18
- 241000588807 Bordetella Species 0.000 claims description 16
- 241000588722 Escherichia Species 0.000 claims description 16
- 230000008569 process Effects 0.000 claims description 16
- 238000012217 deletion Methods 0.000 claims description 14
- 230000037430 deletion Effects 0.000 claims description 14
- 108020001756 ligand binding domains Proteins 0.000 claims description 11
- 241000607142 Salmonella Species 0.000 claims description 10
- 210000004899 c-terminal region Anatomy 0.000 claims description 10
- 241000193830 Bacillus <bacterium> Species 0.000 claims description 9
- 241000606161 Chlamydia Species 0.000 claims description 9
- 241000589180 Rhizobium Species 0.000 claims description 9
- 102000008300 Mutant Proteins Human genes 0.000 claims description 8
- 108010021466 Mutant Proteins Proteins 0.000 claims description 8
- 241000589516 Pseudomonas Species 0.000 claims description 8
- 241000588698 Erwinia Species 0.000 claims description 7
- 241000588748 Klebsiella Species 0.000 claims description 7
- 241000607720 Serratia Species 0.000 claims description 7
- 235000003869 genetically modified organism Nutrition 0.000 claims description 7
- 239000013598 vector Substances 0.000 claims description 7
- 241000589220 Acetobacter Species 0.000 claims description 6
- 241000589291 Acinetobacter Species 0.000 claims description 6
- 241000589158 Agrobacterium Species 0.000 claims description 6
- 241001147780 Alicyclobacillus Species 0.000 claims description 6
- 241000589151 Azotobacter Species 0.000 claims description 6
- 241000186146 Brevibacterium Species 0.000 claims description 6
- 241000588914 Enterobacter Species 0.000 claims description 6
- 241000588921 Enterobacteriaceae Species 0.000 claims description 6
- 241000202974 Methanobacterium Species 0.000 claims description 6
- 241000203353 Methanococcus Species 0.000 claims description 6
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 claims description 6
- 241001467578 Microbacterium Species 0.000 claims description 6
- 241000192041 Micrococcus Species 0.000 claims description 6
- 241000588621 Moraxella Species 0.000 claims description 6
- 241000588771 Morganella <proteobacterium> Species 0.000 claims description 6
- 241000186359 Mycobacterium Species 0.000 claims description 6
- 241000204031 Mycoplasma Species 0.000 claims description 6
- 241000588653 Neisseria Species 0.000 claims description 6
- 241000520272 Pantoea Species 0.000 claims description 6
- 241000606860 Pasteurella Species 0.000 claims description 6
- 241000605894 Porphyromonas Species 0.000 claims description 6
- 241000589886 Treponema Species 0.000 claims description 6
- 239000007788 liquid Substances 0.000 claims description 6
- 241000191992 Peptostreptococcus Species 0.000 claims description 5
- 241000194017 Streptococcus Species 0.000 claims description 5
- 241000607734 Yersinia <bacteria> Species 0.000 claims description 5
- 238000012258 culturing Methods 0.000 claims description 5
- 241000606125 Bacteroides Species 0.000 claims description 4
- 241000606660 Bartonella Species 0.000 claims description 4
- 241000589562 Brucella Species 0.000 claims description 4
- 241001453380 Burkholderia Species 0.000 claims description 4
- 241000589876 Campylobacter Species 0.000 claims description 4
- 241000606701 Rickettsia Species 0.000 claims description 4
- 241000607768 Shigella Species 0.000 claims description 4
- 241000191940 Staphylococcus Species 0.000 claims description 4
- 241000589968 Borrelia Species 0.000 claims description 3
- 101000658547 Escherichia coli (strain K12) Type I restriction enzyme EcoKI endonuclease subunit Proteins 0.000 claims description 3
- 101000658543 Escherichia coli Type I restriction enzyme EcoAI endonuclease subunit Proteins 0.000 claims description 3
- 101000658546 Escherichia coli Type I restriction enzyme EcoEI endonuclease subunit Proteins 0.000 claims description 3
- 101000658530 Escherichia coli Type I restriction enzyme EcoR124II endonuclease subunit Proteins 0.000 claims description 3
- 101000658540 Escherichia coli Type I restriction enzyme EcoprrI endonuclease subunit Proteins 0.000 claims description 3
- 101000658545 Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) Type I restriction enyme HindI endonuclease subunit Proteins 0.000 claims description 3
- 208000016604 Lyme disease Diseases 0.000 claims description 3
- 101000658548 Methanocaldococcus jannaschii (strain ATCC 43067 / DSM 2661 / JAL-1 / JCM 10045 / NBRC 100440) Putative type I restriction enzyme MjaIXP endonuclease subunit Proteins 0.000 claims description 3
- 101000658542 Methanocaldococcus jannaschii (strain ATCC 43067 / DSM 2661 / JAL-1 / JCM 10045 / NBRC 100440) Putative type I restriction enzyme MjaVIIIP endonuclease subunit Proteins 0.000 claims description 3
- 101000658529 Methanocaldococcus jannaschii (strain ATCC 43067 / DSM 2661 / JAL-1 / JCM 10045 / NBRC 100440) Putative type I restriction enzyme MjaVIIP endonuclease subunit Proteins 0.000 claims description 3
- 241000605861 Prevotella Species 0.000 claims description 3
- 101001042773 Staphylococcus aureus (strain COL) Type I restriction enzyme SauCOLORF180P endonuclease subunit Proteins 0.000 claims description 3
- 101000838760 Staphylococcus aureus (strain MRSA252) Type I restriction enzyme SauMRSORF196P endonuclease subunit Proteins 0.000 claims description 3
- 101000838761 Staphylococcus aureus (strain MSSA476) Type I restriction enzyme SauMSSORF170P endonuclease subunit Proteins 0.000 claims description 3
- 101000838758 Staphylococcus aureus (strain MW2) Type I restriction enzyme SauMW2ORF169P endonuclease subunit Proteins 0.000 claims description 3
- 101001042566 Staphylococcus aureus (strain Mu50 / ATCC 700699) Type I restriction enzyme SauMu50ORF195P endonuclease subunit Proteins 0.000 claims description 3
- 101000838763 Staphylococcus aureus (strain N315) Type I restriction enzyme SauN315I endonuclease subunit Proteins 0.000 claims description 3
- 101000838759 Staphylococcus epidermidis (strain ATCC 35984 / RP62A) Type I restriction enzyme SepRPIP endonuclease subunit Proteins 0.000 claims description 3
- 101000838756 Staphylococcus saprophyticus subsp. saprophyticus (strain ATCC 15305 / DSM 20229 / NCIMB 8711 / NCTC 7292 / S-41) Type I restriction enzyme SsaAORF53P endonuclease subunit Proteins 0.000 claims description 3
- 230000006696 biosynthetic metabolic pathway Effects 0.000 claims description 2
- 235000003846 Ricinus Nutrition 0.000 claims 5
- 241000322381 Ricinus <louse> Species 0.000 claims 5
- 241000194033 Enterococcus Species 0.000 claims 4
- 241000589601 Francisella Species 0.000 claims 4
- 241000207202 Gardnerella Species 0.000 claims 4
- 241000606790 Haemophilus Species 0.000 claims 4
- 241000589989 Helicobacter Species 0.000 claims 4
- 241000607598 Vibrio Species 0.000 claims 4
- 241000605314 Ehrlichia Species 0.000 claims 3
- 241000209020 Cornus Species 0.000 claims 2
- 241001445332 Coxiella <snail> Species 0.000 claims 2
- 241001528480 Cupriavidus Species 0.000 claims 2
- 241000033356 Hymenobacter Species 0.000 claims 2
- 241000588769 Proteus <enterobacteria> Species 0.000 claims 2
- 241000122971 Stenotrophomonas Species 0.000 claims 2
- 241000604961 Wolbachia Species 0.000 claims 2
- 241000221787 Erysiphe Species 0.000 claims 1
- 241000605909 Fusobacterium Species 0.000 claims 1
- 241000605233 Hydrogenobacter Species 0.000 claims 1
- 241000206591 Peptococcus Species 0.000 claims 1
- 150000002734 metacrylic acid derivatives Chemical class 0.000 abstract description 27
- SOGAXMICEFXMKE-UHFFFAOYSA-N Butylmethacrylate Chemical group CCCCOC(=O)C(C)=C SOGAXMICEFXMKE-UHFFFAOYSA-N 0.000 description 285
- 239000002609 medium Substances 0.000 description 67
- 235000018102 proteins Nutrition 0.000 description 56
- 108020004414 DNA Proteins 0.000 description 38
- 230000012010 growth Effects 0.000 description 37
- 235000001014 amino acid Nutrition 0.000 description 34
- 102200073741 rs121909602 Human genes 0.000 description 24
- -1 packaging Substances 0.000 description 21
- 102000004316 Oxidoreductases Human genes 0.000 description 20
- 108090000854 Oxidoreductases Proteins 0.000 description 20
- 230000003044 adaptive effect Effects 0.000 description 20
- VVQNEPGJFQJSBK-UHFFFAOYSA-N Methyl methacrylate Chemical compound COC(=O)C(C)=C VVQNEPGJFQJSBK-UHFFFAOYSA-N 0.000 description 18
- 108010050848 glycylleucine Proteins 0.000 description 18
- 125000003275 alpha amino acid group Chemical group 0.000 description 17
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 15
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 15
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 15
- 229910052799 carbon Inorganic materials 0.000 description 15
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 14
- 239000000306 component Substances 0.000 description 14
- 238000010790 dilution Methods 0.000 description 14
- 239000012895 dilution Substances 0.000 description 14
- 230000014509 gene expression Effects 0.000 description 14
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 14
- 230000004048 modification Effects 0.000 description 14
- 238000012986 modification Methods 0.000 description 14
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 13
- 101100145480 Prochlorococcus marinus (strain SARG / CCMP1375 / SS120) rpoC2 gene Proteins 0.000 description 13
- 108091028113 Trans-activating crRNA Proteins 0.000 description 13
- 238000002474 experimental method Methods 0.000 description 13
- AEWHYWSPVRZHCT-NDZSKPAWSA-N isobutyryl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C(C)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 AEWHYWSPVRZHCT-NDZSKPAWSA-N 0.000 description 13
- 108010064235 lysylglycine Proteins 0.000 description 13
- 101150109946 rpo1C gene Proteins 0.000 description 13
- 101150042391 rpoC gene Proteins 0.000 description 13
- 101150103066 rpoC1 gene Proteins 0.000 description 13
- 108091033409 CRISPR Proteins 0.000 description 12
- 101710163270 Nuclease Proteins 0.000 description 12
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 12
- 102220317537 rs1553646295 Human genes 0.000 description 12
- 238000013518 transcription Methods 0.000 description 12
- 230000035897 transcription Effects 0.000 description 12
- 210000004027 cell Anatomy 0.000 description 11
- 230000005782 double-strand break Effects 0.000 description 11
- 230000000694 effects Effects 0.000 description 11
- 238000010362 genome editing Methods 0.000 description 11
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 11
- 239000002028 Biomass Substances 0.000 description 10
- 101710088194 Dehydrogenase Proteins 0.000 description 10
- LRHPLDYGYMQRHN-UHFFFAOYSA-N N-Butanol Chemical compound CCCCO LRHPLDYGYMQRHN-UHFFFAOYSA-N 0.000 description 10
- 108010073062 Transcription Activator-Like Effectors Proteins 0.000 description 10
- 229910003460 diamond Inorganic materials 0.000 description 10
- 239000010432 diamond Substances 0.000 description 10
- 239000001963 growth medium Substances 0.000 description 10
- 102220198431 rs1057520004 Human genes 0.000 description 10
- 108010061238 threonyl-glycine Proteins 0.000 description 10
- 238000012546 transfer Methods 0.000 description 10
- QHKABHOOEWYVLI-UHFFFAOYSA-N 3-methyl-2-oxobutanoic acid Chemical compound CC(C)C(=O)C(O)=O QHKABHOOEWYVLI-UHFFFAOYSA-N 0.000 description 9
- 241000196324 Embryophyta Species 0.000 description 9
- 101100394050 Escherichia coli (strain K12) gyrB gene Proteins 0.000 description 9
- 101150004068 acrB gene Proteins 0.000 description 9
- 108010034529 leucyl-lysine Proteins 0.000 description 9
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 8
- 108010046716 3-Methyl-2-Oxobutanoate Dehydrogenase (Lipoamide) Proteins 0.000 description 8
- 241000219195 Arabidopsis thaliana Species 0.000 description 8
- 108010079426 Electron-Transferring Flavoproteins Proteins 0.000 description 8
- 102000012737 Electron-Transferring Flavoproteins Human genes 0.000 description 8
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 8
- 102000003960 Ligases Human genes 0.000 description 8
- 108090000364 Ligases Proteins 0.000 description 8
- 108010079364 N-glycylalanine Proteins 0.000 description 8
- 108010044940 alanylglutamine Proteins 0.000 description 8
- 108010047857 aspartylglycine Proteins 0.000 description 8
- 239000008103 glucose Substances 0.000 description 8
- 108010078144 glutaminyl-glycine Proteins 0.000 description 8
- 108010015792 glycyllysine Proteins 0.000 description 8
- 108010037850 glycylvaline Proteins 0.000 description 8
- 108010057821 leucylproline Proteins 0.000 description 8
- 101150091444 ompR gene Proteins 0.000 description 8
- 239000002023 wood Substances 0.000 description 8
- 101150075472 ycf27 gene Proteins 0.000 description 8
- 241000282414 Homo sapiens Species 0.000 description 7
- 101710192343 NADPH:adrenodoxin oxidoreductase, mitochondrial Proteins 0.000 description 7
- 102100036777 NADPH:adrenodoxin oxidoreductase, mitochondrial Human genes 0.000 description 7
- 101710104207 Probable NADPH:adrenodoxin oxidoreductase, mitochondrial Proteins 0.000 description 7
- 108010005233 alanylglutamic acid Proteins 0.000 description 7
- 108010047495 alanylglycine Proteins 0.000 description 7
- 108010008355 arginyl-glutamine Proteins 0.000 description 7
- 108010077245 asparaginyl-proline Proteins 0.000 description 7
- 108010038633 aspartylglutamate Proteins 0.000 description 7
- 230000027455 binding Effects 0.000 description 7
- 108010083294 ethanol acyltransferase Proteins 0.000 description 7
- 230000006870 function Effects 0.000 description 7
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 7
- 238000011081 inoculation Methods 0.000 description 7
- KQNPFQTWMSNSAP-UHFFFAOYSA-N isobutyric acid Chemical compound CC(C)C(O)=O KQNPFQTWMSNSAP-UHFFFAOYSA-N 0.000 description 7
- 108010056582 methionylglutamic acid Proteins 0.000 description 7
- 229910052757 nitrogen Inorganic materials 0.000 description 7
- 239000003921 oil Substances 0.000 description 7
- 235000019198 oils Nutrition 0.000 description 7
- 235000004252 protein component Nutrition 0.000 description 7
- 239000000126 substance Substances 0.000 description 7
- 108010073969 valyllysine Proteins 0.000 description 7
- 102220583765 Cellular tumor antigen p53_A70T_mutation Human genes 0.000 description 6
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 6
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 description 6
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 6
- 101100038261 Methanococcus vannielii (strain ATCC 35089 / DSM 1224 / JCM 13029 / OCM 148 / SB) rpo2C gene Proteins 0.000 description 6
- 241000589776 Pseudomonas putida Species 0.000 description 6
- 108010092854 aspartyllysine Proteins 0.000 description 6
- 102220370480 c.209C>T Human genes 0.000 description 6
- 102220355573 c.59G>T Human genes 0.000 description 6
- 150000001875 compounds Chemical class 0.000 description 6
- 230000002068 genetic effect Effects 0.000 description 6
- 108010049041 glutamylalanine Proteins 0.000 description 6
- 238000003780 insertion Methods 0.000 description 6
- 108010054155 lysyllysine Proteins 0.000 description 6
- 230000000813 microbial effect Effects 0.000 description 6
- 108010070643 prolylglutamic acid Proteins 0.000 description 6
- 230000001105 regulatory effect Effects 0.000 description 6
- 101150085857 rpo2 gene Proteins 0.000 description 6
- 101150090202 rpoB gene Proteins 0.000 description 6
- 102200031660 rs730880032 Human genes 0.000 description 6
- YGSDEFSMJLZEOE-UHFFFAOYSA-N salicylic acid Chemical compound OC(=O)C1=CC=CC=C1O YGSDEFSMJLZEOE-UHFFFAOYSA-N 0.000 description 6
- 108010014720 2-methylacyl-CoA dehydrogenase Proteins 0.000 description 5
- 102000004539 Acyl-CoA Oxidase Human genes 0.000 description 5
- 108020001558 Acyl-CoA oxidase Proteins 0.000 description 5
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 5
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 5
- 241000220225 Malus Species 0.000 description 5
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 5
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 5
- 241000191043 Rhodobacter sphaeroides Species 0.000 description 5
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 5
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 5
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 5
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 5
- 108010062796 arginyllysine Proteins 0.000 description 5
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 5
- 108010089804 glycyl-threonine Proteins 0.000 description 5
- 239000000710 homodimer Substances 0.000 description 5
- 230000037431 insertion Effects 0.000 description 5
- 108010000761 leucylarginine Proteins 0.000 description 5
- 108010003700 lysyl aspartic acid Proteins 0.000 description 5
- 108010009298 lysylglutamic acid Proteins 0.000 description 5
- 108010017391 lysylvaline Proteins 0.000 description 5
- UMXXGDJOCQSQBV-UHFFFAOYSA-N n-ethyl-n-(triethoxysilylmethyl)ethanamine Chemical compound CCO[Si](OCC)(OCC)CN(CC)CC UMXXGDJOCQSQBV-UHFFFAOYSA-N 0.000 description 5
- 125000003729 nucleotide group Chemical group 0.000 description 5
- 235000015097 nutrients Nutrition 0.000 description 5
- 230000003287 optical effect Effects 0.000 description 5
- 108010012581 phenylalanylglutamate Proteins 0.000 description 5
- 108010079317 prolyl-tyrosine Proteins 0.000 description 5
- 238000012163 sequencing technique Methods 0.000 description 5
- 241000894007 species Species 0.000 description 5
- 231100000331 toxic Toxicity 0.000 description 5
- 230000002588 toxic effect Effects 0.000 description 5
- 230000032258 transport Effects 0.000 description 5
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 5
- NFNSOHPYYIZPLS-UHFFFAOYSA-N 2-methylpropanoyl phosphate Chemical compound CC(C)C(=O)OP(O)(O)=O NFNSOHPYYIZPLS-UHFFFAOYSA-N 0.000 description 4
- RUMACXVDVNRZJZ-UHFFFAOYSA-N 2-methylpropyl 2-methylprop-2-enoate Chemical compound CC(C)COC(=O)C(C)=C RUMACXVDVNRZJZ-UHFFFAOYSA-N 0.000 description 4
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 4
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 4
- 244000063299 Bacillus subtilis Species 0.000 description 4
- 235000014469 Bacillus subtilis Nutrition 0.000 description 4
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 4
- CURLTUGMZLYLDI-UHFFFAOYSA-N Carbon dioxide Chemical compound O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 description 4
- 241000219112 Cucumis Species 0.000 description 4
- 101710170195 Electron transfer flavoprotein-ubiquinone oxidoreductase Proteins 0.000 description 4
- 102100031804 Electron transfer flavoprotein-ubiquinone oxidoreductase, mitochondrial Human genes 0.000 description 4
- 101710099445 Electron transfer flavoprotein-ubiquinone oxidoreductase, mitochondrial Proteins 0.000 description 4
- 108010069915 Electron-transferring-flavoprotein dehydrogenase Proteins 0.000 description 4
- 241001013691 Escherichia coli BW25113 Species 0.000 description 4
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 4
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 4
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 4
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 4
- 108010065920 Insulin Lispro Proteins 0.000 description 4
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 4
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 4
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 4
- 101100278084 Nostoc sp. (strain PCC 7120 / SAG 25.82 / UTEX 2576) dnaK1 gene Proteins 0.000 description 4
- 108091000080 Phosphotransferase Proteins 0.000 description 4
- 101710130023 Probable electron transfer flavoprotein-ubiquinone oxidoreductase, mitochondrial Proteins 0.000 description 4
- 241000589517 Pseudomonas aeruginosa Species 0.000 description 4
- 238000002123 RNA extraction Methods 0.000 description 4
- 101100117145 Synechocystis sp. (strain PCC 6803 / Kazusa) dnaK2 gene Proteins 0.000 description 4
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 4
- 239000012190 activator Substances 0.000 description 4
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 4
- 108010070944 alanylhistidine Proteins 0.000 description 4
- 108010011559 alanylphenylalanine Proteins 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 108010013835 arginine glutamate Proteins 0.000 description 4
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 4
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 4
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 4
- 108010068265 aspartyltyrosine Proteins 0.000 description 4
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 4
- 239000002299 complementary DNA Substances 0.000 description 4
- 101150052825 dnaK gene Proteins 0.000 description 4
- 239000003925 fat Substances 0.000 description 4
- 235000019197 fats Nutrition 0.000 description 4
- 239000007789 gas Substances 0.000 description 4
- 108010010147 glycylglutamine Proteins 0.000 description 4
- 108010087823 glycyltyrosine Proteins 0.000 description 4
- 238000002744 homologous recombination Methods 0.000 description 4
- 230000006801 homologous recombination Effects 0.000 description 4
- 101150077793 ilvH gene Proteins 0.000 description 4
- 101150060643 ilvN gene Proteins 0.000 description 4
- 239000002773 nucleotide Substances 0.000 description 4
- 101150069445 otnK gene Proteins 0.000 description 4
- 230000003647 oxidation Effects 0.000 description 4
- 238000007254 oxidation reaction Methods 0.000 description 4
- 230000037361 pathway Effects 0.000 description 4
- 108010051242 phenylalanylserine Proteins 0.000 description 4
- 102000020233 phosphotransferase Human genes 0.000 description 4
- VCMMXZQDRFWYSE-UHFFFAOYSA-N plumbagin Chemical compound C1=CC=C2C(=O)C(C)=CC(=O)C2=C1O VCMMXZQDRFWYSE-UHFFFAOYSA-N 0.000 description 4
- 229920003229 poly(methyl methacrylate) Polymers 0.000 description 4
- 239000004926 polymethyl methacrylate Substances 0.000 description 4
- 108090000765 processed proteins & peptides Proteins 0.000 description 4
- 239000000047 product Substances 0.000 description 4
- 108010031719 prolyl-serine Proteins 0.000 description 4
- 108010090894 prolylleucine Proteins 0.000 description 4
- 108010053725 prolylvaline Proteins 0.000 description 4
- NHARPDSAXCBDDR-UHFFFAOYSA-N propyl 2-methylprop-2-enoate Chemical compound CCCOC(=O)C(C)=C NHARPDSAXCBDDR-UHFFFAOYSA-N 0.000 description 4
- 102220195216 rs768288444 Human genes 0.000 description 4
- 108010026333 seryl-proline Proteins 0.000 description 4
- 125000006850 spacer group Chemical group 0.000 description 4
- 230000002103 transcriptional effect Effects 0.000 description 4
- 239000002699 waste material Substances 0.000 description 4
- 101150096464 yohJ gene Proteins 0.000 description 4
- WDQMWEYDKDCEHT-UHFFFAOYSA-N 2-ethylhexyl 2-methylprop-2-enoate Chemical compound CCCCC(CC)COC(=O)C(C)=C WDQMWEYDKDCEHT-UHFFFAOYSA-N 0.000 description 3
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 3
- 102100035709 Acetyl-coenzyme A synthetase, cytoplasmic Human genes 0.000 description 3
- 102000057234 Acyl transferases Human genes 0.000 description 3
- 108700016155 Acyl transferases Proteins 0.000 description 3
- 102000002735 Acyl-CoA Dehydrogenase Human genes 0.000 description 3
- 108010001058 Acyl-CoA Dehydrogenase Proteins 0.000 description 3
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 3
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 3
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 3
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 3
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 3
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 3
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 3
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 3
- FUKFQILQFQKHLE-DCAQKATOSA-N Ala-Lys-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O FUKFQILQFQKHLE-DCAQKATOSA-N 0.000 description 3
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 3
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 3
- 101100161759 Arabidopsis thaliana ACX4 gene Proteins 0.000 description 3
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 3
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 3
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 3
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 3
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 3
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 3
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 3
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 3
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 3
- 108010089895 Branched-chain-fatty-acid kinase Proteins 0.000 description 3
- 108700024126 Butyrate kinases Proteins 0.000 description 3
- 241000219173 Carica Species 0.000 description 3
- 108020004705 Codon Proteins 0.000 description 3
- 241000186226 Corynebacterium glutamicum Species 0.000 description 3
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 3
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 3
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 3
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 3
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 3
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 3
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 3
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 3
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 3
- LGMUPVWZEYYUMU-YVNDNENWSA-N Ile-Glu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LGMUPVWZEYYUMU-YVNDNENWSA-N 0.000 description 3
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 3
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 3
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 3
- 102100021646 Isobutyryl-CoA dehydrogenase, mitochondrial Human genes 0.000 description 3
- 108010013792 Isovaleryl-CoA Dehydrogenase Proteins 0.000 description 3
- 102100025392 Isovaleryl-CoA dehydrogenase, mitochondrial Human genes 0.000 description 3
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 3
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 3
- 241000880493 Leptailurus serval Species 0.000 description 3
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 3
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 3
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 3
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 3
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 3
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 3
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 3
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 3
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 3
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 3
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 3
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 3
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 3
- 235000011430 Malus pumila Nutrition 0.000 description 3
- 235000015103 Malus silvestris Nutrition 0.000 description 3
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 3
- UZWMJZSOXGOVIN-LURJTMIESA-N Met-Gly-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(=O)NCC(O)=O UZWMJZSOXGOVIN-LURJTMIESA-N 0.000 description 3
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 3
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 3
- VZUNGTLZRAYYDE-UHFFFAOYSA-N N-methyl-N'-nitro-N-nitrosoguanidine Chemical compound O=NN(C)C(=N)N[N+]([O-])=O VZUNGTLZRAYYDE-UHFFFAOYSA-N 0.000 description 3
- 229910019142 PO4 Inorganic materials 0.000 description 3
- 241000589597 Paracoccus denitrificans Species 0.000 description 3
- 108700023175 Phosphate acetyltransferases Proteins 0.000 description 3
- 241000205160 Pyrococcus Species 0.000 description 3
- 108010079005 RDV peptide Proteins 0.000 description 3
- 108091081062 Repeated sequence (DNA) Proteins 0.000 description 3
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 3
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 3
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 3
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 3
- 229920002472 Starch Polymers 0.000 description 3
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 3
- ZQUKYJOKQBRBCS-GLLZPBPUSA-N Thr-Gln-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O ZQUKYJOKQBRBCS-GLLZPBPUSA-N 0.000 description 3
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 3
- 102000004357 Transferases Human genes 0.000 description 3
- 108090000992 Transferases Proteins 0.000 description 3
- 108700019146 Transgenes Proteins 0.000 description 3
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 3
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 3
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 3
- 150000001252 acrylic acid derivatives Chemical class 0.000 description 3
- 230000009471 action Effects 0.000 description 3
- 230000004913 activation Effects 0.000 description 3
- 239000000654 additive Substances 0.000 description 3
- 108010041407 alanylaspartic acid Proteins 0.000 description 3
- 108010087924 alanylproline Proteins 0.000 description 3
- 125000005907 alkyl ester group Chemical group 0.000 description 3
- 108010068380 arginylarginine Proteins 0.000 description 3
- 108010093581 aspartyl-proline Proteins 0.000 description 3
- 230000010261 cell growth Effects 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 108010054813 diprotin B Proteins 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 239000010794 food waste Substances 0.000 description 3
- 239000012737 fresh medium Substances 0.000 description 3
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- LNCPIMCVTKXXOY-UHFFFAOYSA-N hexyl 2-methylprop-2-enoate Chemical compound CCCCCCOC(=O)C(C)=C LNCPIMCVTKXXOY-UHFFFAOYSA-N 0.000 description 3
- 238000002955 isolation Methods 0.000 description 3
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 3
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 3
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 3
- 230000007762 localization of cell Effects 0.000 description 3
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 108010085203 methionylmethionine Proteins 0.000 description 3
- 108091027963 non-coding RNA Proteins 0.000 description 3
- 102000042567 non-coding RNA Human genes 0.000 description 3
- FJKROLUGYXJWQN-UHFFFAOYSA-N papa-hydroxy-benzoic acid Natural products OC(=O)C1=CC=C(O)C=C1 FJKROLUGYXJWQN-UHFFFAOYSA-N 0.000 description 3
- 229920001184 polypeptide Polymers 0.000 description 3
- 102000004196 processed proteins & peptides Human genes 0.000 description 3
- 108010015796 prolylisoleucine Proteins 0.000 description 3
- BDERNNFJNOPAEC-UHFFFAOYSA-N propan-1-ol Chemical compound CCCO BDERNNFJNOPAEC-UHFFFAOYSA-N 0.000 description 3
- BOQSSGDQNWEFSX-UHFFFAOYSA-N propan-2-yl 2-methylprop-2-enoate Chemical compound CC(C)OC(=O)C(C)=C BOQSSGDQNWEFSX-UHFFFAOYSA-N 0.000 description 3
- 229960004889 salicylic acid Drugs 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 3
- 235000019698 starch Nutrition 0.000 description 3
- 239000008107 starch Substances 0.000 description 3
- 235000000346 sugar Nutrition 0.000 description 3
- 239000011573 trace mineral Substances 0.000 description 3
- 235000013619 trace mineral Nutrition 0.000 description 3
- 229920002818 (Hydroxyethyl)methacrylate Polymers 0.000 description 2
- KBPLFHHGFOOTCA-UHFFFAOYSA-N 1-Octanol Chemical compound CCCCCCCCO KBPLFHHGFOOTCA-UHFFFAOYSA-N 0.000 description 2
- BBMCTIGTTCKYKF-UHFFFAOYSA-N 1-heptanol Chemical compound CCCCCCCO BBMCTIGTTCKYKF-UHFFFAOYSA-N 0.000 description 2
- MWFMGBPGAXYFAR-UHFFFAOYSA-N 2-hydroxy-2-methylpropanenitrile Chemical compound CC(C)(O)C#N MWFMGBPGAXYFAR-UHFFFAOYSA-N 0.000 description 2
- GNSFRPWPOGYVLO-UHFFFAOYSA-N 3-hydroxypropyl 2-methylprop-2-enoate Chemical compound CC(=C)C(=O)OCCCO GNSFRPWPOGYVLO-UHFFFAOYSA-N 0.000 description 2
- ULYIFEQRRINMJQ-UHFFFAOYSA-N 3-methylbutyl 2-methylprop-2-enoate Chemical compound CC(C)CCOC(=O)C(C)=C ULYIFEQRRINMJQ-UHFFFAOYSA-N 0.000 description 2
- ARSRBNBHOADGJU-UHFFFAOYSA-N 7,12-dimethyltetraphene Chemical compound C1=CC2=CC=CC=C2C2=C1C(C)=C(C=CC=C1)C1=C2C ARSRBNBHOADGJU-UHFFFAOYSA-N 0.000 description 2
- RUZXDTHZHJTTRO-UHFFFAOYSA-N 7-amino-4h-1,4-benzoxazin-3-one Chemical compound N1C(=O)COC2=CC(N)=CC=C21 RUZXDTHZHJTTRO-UHFFFAOYSA-N 0.000 description 2
- 108010049926 Acetate-CoA ligase Proteins 0.000 description 2
- 241000219068 Actinidia Species 0.000 description 2
- 241000948980 Actinobacillus succinogenes Species 0.000 description 2
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 2
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 2
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 2
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 2
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 2
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 2
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 2
- CSAHOYQKNHGDHX-ACZMJKKPSA-N Ala-Gln-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CSAHOYQKNHGDHX-ACZMJKKPSA-N 0.000 description 2
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 2
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 2
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 2
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 2
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 2
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 2
- BTBUEVAGZCKULD-XPUUQOCRSA-N Ala-Gly-His Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CN=CN1 BTBUEVAGZCKULD-XPUUQOCRSA-N 0.000 description 2
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 2
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 2
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 2
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 2
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 2
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 2
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 2
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 2
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 2
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 2
- DWYROCSXOOMOEU-CIUDSAMLSA-N Ala-Met-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DWYROCSXOOMOEU-CIUDSAMLSA-N 0.000 description 2
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 2
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 2
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 2
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 2
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 2
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 2
- 235000003840 Amygdalus nana Nutrition 0.000 description 2
- 244000296825 Amygdalus nana Species 0.000 description 2
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 2
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 2
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 2
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 2
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 2
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 2
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 2
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 2
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 2
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 2
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 2
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 2
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 2
- FAEFJTCTNZTPHX-ACZMJKKPSA-N Asn-Gln-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FAEFJTCTNZTPHX-ACZMJKKPSA-N 0.000 description 2
- HCAUEJAQCXVQQM-ACZMJKKPSA-N Asn-Glu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HCAUEJAQCXVQQM-ACZMJKKPSA-N 0.000 description 2
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 2
- NYGILGUOUOXGMJ-YUMQZZPRSA-N Asn-Lys-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O NYGILGUOUOXGMJ-YUMQZZPRSA-N 0.000 description 2
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 2
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 2
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 2
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 2
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 2
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 2
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 2
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 2
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 2
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 2
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 2
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 2
- KFAFUJMGHVVYRC-DCAQKATOSA-N Asp-Leu-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O KFAFUJMGHVVYRC-DCAQKATOSA-N 0.000 description 2
- RRUWMFBLFLUZSI-LPEHRKFASA-N Asp-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N RRUWMFBLFLUZSI-LPEHRKFASA-N 0.000 description 2
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 2
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 2
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 2
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 2
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 2
- 241000228212 Aspergillus Species 0.000 description 2
- 241000228245 Aspergillus niger Species 0.000 description 2
- 241001465318 Aspergillus terreus Species 0.000 description 2
- 241000283690 Bos taurus Species 0.000 description 2
- 108010068197 Butyryl-CoA Dehydrogenase Proteins 0.000 description 2
- 241000222178 Candida tropicalis Species 0.000 description 2
- 235000009467 Carica papaya Nutrition 0.000 description 2
- 241000219172 Caricaceae Species 0.000 description 2
- 241000193454 Clostridium beijerinckii Species 0.000 description 2
- 241000186570 Clostridium kluyveri Species 0.000 description 2
- 241000193469 Clostridium pasteurianum Species 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- 241001337994 Cryptococcus <scale insect> Species 0.000 description 2
- 235000015510 Cucumis melo subsp melo Nutrition 0.000 description 2
- 235000010071 Cucumis prophetarum Nutrition 0.000 description 2
- 241000219104 Cucurbitaceae Species 0.000 description 2
- 241001234009 Cucurbitales Species 0.000 description 2
- WAJDEKCJRKGRPG-CIUDSAMLSA-N Cys-His-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N WAJDEKCJRKGRPG-CIUDSAMLSA-N 0.000 description 2
- 102100037458 Dephospho-CoA kinase Human genes 0.000 description 2
- 241000208421 Ericaceae Species 0.000 description 2
- 241000134884 Ericales Species 0.000 description 2
- 101100126129 Escherichia coli (strain K12) insE7 gene Proteins 0.000 description 2
- 101100052728 Escherichia coli (strain K12) raiA gene Proteins 0.000 description 2
- 101100544652 Escherichia coli (strain K12) ynfM gene Proteins 0.000 description 2
- 101100126130 Escherichia coli O111:H- insE gene Proteins 0.000 description 2
- VGGSQFUCUMXWEO-UHFFFAOYSA-N Ethene Chemical compound C=C VGGSQFUCUMXWEO-UHFFFAOYSA-N 0.000 description 2
- PLUBXMRUUVWRLT-UHFFFAOYSA-N Ethyl methanesulfonate Chemical compound CCOS(C)(=O)=O PLUBXMRUUVWRLT-UHFFFAOYSA-N 0.000 description 2
- 239000005977 Ethylene Substances 0.000 description 2
- 108030007102 Formate kinases Proteins 0.000 description 2
- 235000016623 Fragaria vesca Nutrition 0.000 description 2
- 240000009088 Fragaria x ananassa Species 0.000 description 2
- 235000011363 Fragaria x ananassa Nutrition 0.000 description 2
- 241000233866 Fungi Species 0.000 description 2
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 2
- XFKUFUJECJUQTQ-CIUDSAMLSA-N Gln-Gln-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XFKUFUJECJUQTQ-CIUDSAMLSA-N 0.000 description 2
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 2
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 2
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 2
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 2
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 2
- ITZWDGBYBPUZRG-KBIXCLLPSA-N Gln-Ile-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O ITZWDGBYBPUZRG-KBIXCLLPSA-N 0.000 description 2
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 2
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 2
- WEAVZFWWIPIANL-SRVKXCTJSA-N Gln-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N WEAVZFWWIPIANL-SRVKXCTJSA-N 0.000 description 2
- DYVMTEWCGAVKSE-HJGDQZAQSA-N Gln-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O DYVMTEWCGAVKSE-HJGDQZAQSA-N 0.000 description 2
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 2
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 2
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 2
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 2
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 2
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 2
- RFDHKPSHTXZKLL-IHRRRGAJSA-N Glu-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N RFDHKPSHTXZKLL-IHRRRGAJSA-N 0.000 description 2
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 2
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 2
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 2
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 2
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 2
- ZQYZDDXTNQXUJH-CIUDSAMLSA-N Glu-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)N ZQYZDDXTNQXUJH-CIUDSAMLSA-N 0.000 description 2
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 2
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 2
- 241000589232 Gluconobacter oxydans Species 0.000 description 2
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 2
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 2
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 2
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 2
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 2
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 2
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 2
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 2
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 2
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 2
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 2
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 2
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 2
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 2
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 2
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 2
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 2
- LXTRSHQLGYINON-DTWKUNHWSA-N Gly-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN LXTRSHQLGYINON-DTWKUNHWSA-N 0.000 description 2
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 2
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 2
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 2
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 2
- MUGLKCQHTUFLGF-WPRPVWTQSA-N Gly-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)CN MUGLKCQHTUFLGF-WPRPVWTQSA-N 0.000 description 2
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 2
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 2
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 2
- 102000029812 HNH nuclease Human genes 0.000 description 2
- 108060003760 HNH nuclease Proteins 0.000 description 2
- ZUPVLBAXUUGKKN-VHSXEESVSA-N His-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CN=CN2)N)C(=O)O ZUPVLBAXUUGKKN-VHSXEESVSA-N 0.000 description 2
- LBQAHBIVXQSBIR-HVTMNAMFSA-N His-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N LBQAHBIVXQSBIR-HVTMNAMFSA-N 0.000 description 2
- SOYCWSKCUVDLMC-AVGNSLFASA-N His-Pro-Arg Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CCCNC(=N)N)C(=O)O SOYCWSKCUVDLMC-AVGNSLFASA-N 0.000 description 2
- WOBHKFSMXKNTIM-UHFFFAOYSA-N Hydroxyethyl methacrylate Chemical compound CC(=C)C(=O)OCCO WOBHKFSMXKNTIM-UHFFFAOYSA-N 0.000 description 2
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 2
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 2
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 2
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 2
- QHGBCRCMBCWMBJ-UHFFFAOYSA-N Ile-Glu-Ala-Lys Natural products CCC(C)C(N)C(=O)NC(CCC(O)=O)C(=O)NC(C)C(=O)NC(C(O)=O)CCCCN QHGBCRCMBCWMBJ-UHFFFAOYSA-N 0.000 description 2
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 2
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 2
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 2
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 2
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 2
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 2
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 2
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 2
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 2
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 2
- ZMOIGGHUSNHCAB-UHFFFAOYSA-N Isoplumbagin Natural products C1=CC(O)=C2C(=O)C(C)=CC(=O)C2=C1 ZMOIGGHUSNHCAB-UHFFFAOYSA-N 0.000 description 2
- 241000588749 Klebsiella oxytoca Species 0.000 description 2
- 241000235649 Kluyveromyces Species 0.000 description 2
- 235000014663 Kluyveromyces fragilis Nutrition 0.000 description 2
- 241001138401 Kluyveromyces lactis Species 0.000 description 2
- 241000235058 Komagataella pastoris Species 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- 240000006024 Lactobacillus plantarum Species 0.000 description 2
- 235000013965 Lactobacillus plantarum Nutrition 0.000 description 2
- 241000218195 Lauraceae Species 0.000 description 2
- 241000218194 Laurales Species 0.000 description 2
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 2
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 2
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 2
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 2
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 2
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 2
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 2
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 2
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 2
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 2
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 2
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 2
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 2
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 2
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 2
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 2
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 2
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 2
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 2
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 2
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 2
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 2
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 2
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 2
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 2
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 2
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 2
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 2
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 2
- 239000006142 Luria-Bertani Agar Substances 0.000 description 2
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 2
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 2
- FUKDBQGFSJUXGX-RWMBFGLXSA-N Lys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)C(=O)O FUKDBQGFSJUXGX-RWMBFGLXSA-N 0.000 description 2
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 2
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 2
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 2
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 2
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 2
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 2
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 2
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 2
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 2
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 2
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 2
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 2
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 2
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 2
- MJVAVZPDRWSRRC-UHFFFAOYSA-N Menadione Chemical compound C1=CC=C2C(=O)C(C)=CC(=O)C2=C1 MJVAVZPDRWSRRC-UHFFFAOYSA-N 0.000 description 2
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 2
- TUSOIZOVPJCMFC-FXQIFTODSA-N Met-Asp-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O TUSOIZOVPJCMFC-FXQIFTODSA-N 0.000 description 2
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 2
- RRIHXWPHQSXHAQ-XUXIUFHCSA-N Met-Ile-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O RRIHXWPHQSXHAQ-XUXIUFHCSA-N 0.000 description 2
- BQHLZUMZOXUWNU-DCAQKATOSA-N Met-Pro-Glu Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BQHLZUMZOXUWNU-DCAQKATOSA-N 0.000 description 2
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- RJUFJBKOKNCXHH-UHFFFAOYSA-N Methyl propionate Chemical compound CCC(=O)OC RJUFJBKOKNCXHH-UHFFFAOYSA-N 0.000 description 2
- 240000003433 Miscanthus floridulus Species 0.000 description 2
- 241000234295 Musa Species 0.000 description 2
- 241000234615 Musaceae Species 0.000 description 2
- 101100023384 Mycolicibacterium smegmatis (strain ATCC 700084 / mc(2)155) mimG gene Proteins 0.000 description 2
- FUSGACRLAFQQRL-UHFFFAOYSA-N N-Ethyl-N-nitrosourea Chemical compound CCN(N=O)C(N)=O FUSGACRLAFQQRL-UHFFFAOYSA-N 0.000 description 2
- ZRKWMRDKSOPRRS-UHFFFAOYSA-N N-Methyl-N-nitrosourea Chemical compound O=NN(C)C(N)=O ZRKWMRDKSOPRRS-UHFFFAOYSA-N 0.000 description 2
- AMQJEAYHLZJPGS-UHFFFAOYSA-N N-Pentanol Chemical compound CCCCCO AMQJEAYHLZJPGS-UHFFFAOYSA-N 0.000 description 2
- 108010066427 N-valyltryptophan Proteins 0.000 description 2
- 108010047562 NGR peptide Proteins 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 101100220445 Nocardia farcinica (strain IFM 10152) groEL2 gene Proteins 0.000 description 2
- 108091005461 Nucleic proteins Proteins 0.000 description 2
- 244000025272 Persea americana Species 0.000 description 2
- 235000008673 Persea americana Nutrition 0.000 description 2
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 2
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 2
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 2
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 2
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 2
- ZKSLXIGKRJMALF-MGHWNKPDSA-N Phe-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N ZKSLXIGKRJMALF-MGHWNKPDSA-N 0.000 description 2
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 2
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 2
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 2
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 2
- 241000235648 Pichia Species 0.000 description 2
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 2
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 2
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 2
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 description 2
- OOZJHTXCLJUODH-QXEWZRGKSA-N Pro-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 OOZJHTXCLJUODH-QXEWZRGKSA-N 0.000 description 2
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 2
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 2
- ATUOYWHBWRKTHZ-UHFFFAOYSA-N Propane Chemical compound CCC ATUOYWHBWRKTHZ-UHFFFAOYSA-N 0.000 description 2
- 108090001084 Propionate kinases Proteins 0.000 description 2
- 241000192142 Proteobacteria Species 0.000 description 2
- 235000011432 Prunus Nutrition 0.000 description 2
- 241001646398 Pseudomonas chlororaphis Species 0.000 description 2
- 235000014443 Pyrus communis Nutrition 0.000 description 2
- 240000001987 Pyrus communis Species 0.000 description 2
- 108010003201 RGH 0205 Proteins 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- 102000006382 Ribonucleases Human genes 0.000 description 2
- 108010083644 Ribonucleases Proteins 0.000 description 2
- 235000004789 Rosa xanthina Nutrition 0.000 description 2
- 241000220222 Rosaceae Species 0.000 description 2
- 241000220221 Rosales Species 0.000 description 2
- 241000235070 Saccharomyces Species 0.000 description 2
- 244000253911 Saccharomyces fragilis Species 0.000 description 2
- 235000018368 Saccharomyces fragilis Nutrition 0.000 description 2
- 241000235346 Schizosaccharomyces Species 0.000 description 2
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 2
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 2
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 2
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 2
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 2
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 2
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 2
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 2
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 2
- 241000589970 Spirochaetales Species 0.000 description 2
- 101100398785 Streptococcus agalactiae serotype V (strain ATCC BAA-611 / 2603 V/R) ldhD gene Proteins 0.000 description 2
- 241000193996 Streptococcus pyogenes Species 0.000 description 2
- 241000187432 Streptomyces coelicolor Species 0.000 description 2
- 241000282898 Sus scrofa Species 0.000 description 2
- 238000010459 TALEN Methods 0.000 description 2
- DKGAVHZHDRPRBM-UHFFFAOYSA-N Tert-Butanol Chemical compound CC(C)(C)O DKGAVHZHDRPRBM-UHFFFAOYSA-N 0.000 description 2
- 241000204666 Thermotoga maritima Species 0.000 description 2
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 2
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 2
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 2
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 2
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 2
- TZJSEJOXAIWOST-RHYQMDGZSA-N Thr-Lys-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N TZJSEJOXAIWOST-RHYQMDGZSA-N 0.000 description 2
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 2
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 2
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 2
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 2
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 2
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 2
- 108010043645 Transcription Activator-Like Effector Nucleases Proteins 0.000 description 2
- 108091023040 Transcription factor Proteins 0.000 description 2
- 102000040945 Transcription factor Human genes 0.000 description 2
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 2
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 2
- RYSNTWVRSLCAJZ-RYUDHWBXSA-N Tyr-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RYSNTWVRSLCAJZ-RYUDHWBXSA-N 0.000 description 2
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 2
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 2
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 2
- WOAQYWUEUYMVGK-ULQDDVLXSA-N Tyr-Lys-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOAQYWUEUYMVGK-ULQDDVLXSA-N 0.000 description 2
- NUQZCPSZHGIYTA-HKUYNNGSSA-N Tyr-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N NUQZCPSZHGIYTA-HKUYNNGSSA-N 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 2
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 2
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 2
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 2
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 2
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 2
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 2
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 2
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 2
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 2
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 2
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 2
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 2
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 2
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 2
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 2
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 2
- BCBFMJYTNKDALA-UFYCRDLUSA-N Val-Phe-Phe Chemical compound N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O BCBFMJYTNKDALA-UFYCRDLUSA-N 0.000 description 2
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 2
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 2
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 2
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 2
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 2
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 2
- 240000004922 Vigna radiata Species 0.000 description 2
- 241000589634 Xanthomonas Species 0.000 description 2
- 240000008042 Zea mays Species 0.000 description 2
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 2
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 2
- 108010017070 Zinc Finger Nucleases Proteins 0.000 description 2
- 241000234675 Zingiberales Species 0.000 description 2
- 101100386830 Zymomonas mobilis subsp. mobilis (strain ATCC 31821 / ZM4 / CP4) ddh gene Proteins 0.000 description 2
- FJJCIZWZNKZHII-UHFFFAOYSA-N [4,6-bis(cyanoamino)-1,3,5-triazin-2-yl]cyanamide Chemical compound N#CNC1=NC(NC#N)=NC(NC#N)=N1 FJJCIZWZNKZHII-UHFFFAOYSA-N 0.000 description 2
- BGLGAKMTYHWWKW-UHFFFAOYSA-N acridine yellow Chemical compound [H+].[Cl-].CC1=C(N)C=C2N=C(C=C(C(C)=C3)N)C3=CC2=C1 BGLGAKMTYHWWKW-UHFFFAOYSA-N 0.000 description 2
- 230000000996 additive effect Effects 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- 150000001299 aldehydes Chemical class 0.000 description 2
- 125000001931 aliphatic group Chemical group 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 2
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 2
- 108010091818 arginyl-glycyl-aspartyl-valine Proteins 0.000 description 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 2
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 2
- 108010036533 arginylvaline Proteins 0.000 description 2
- 150000004945 aromatic hydrocarbons Chemical class 0.000 description 2
- 125000003118 aryl group Chemical group 0.000 description 2
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 2
- 238000002869 basic local alignment search tool Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- ALPCEXCHMFUSAN-UHFFFAOYSA-N beta-Dihydroplumbagin Natural products C1=CC=C2C(=O)C(C)CC(=O)C2=C1O ALPCEXCHMFUSAN-UHFFFAOYSA-N 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 239000002639 bone cement Substances 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- BTANRVKWQNVYAZ-UHFFFAOYSA-N butan-2-ol Chemical compound CCC(C)O BTANRVKWQNVYAZ-UHFFFAOYSA-N 0.000 description 2
- 235000014121 butter Nutrition 0.000 description 2
- 108010005026 butyryl-CoA synthetase Proteins 0.000 description 2
- 150000001720 carbohydrates Chemical class 0.000 description 2
- 235000014633 carbohydrates Nutrition 0.000 description 2
- 239000001569 carbon dioxide Substances 0.000 description 2
- 229910002092 carbon dioxide Inorganic materials 0.000 description 2
- 238000012512 characterization method Methods 0.000 description 2
- 239000002962 chemical mutagen Substances 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 230000001276 controlling effect Effects 0.000 description 2
- 235000005822 corn Nutrition 0.000 description 2
- 125000004122 cyclic group Chemical group 0.000 description 2
- OIWOHHBRDFKZNC-UHFFFAOYSA-N cyclohexyl 2-methylprop-2-enoate Chemical compound CC(=C)C(=O)OC1CCCCC1 OIWOHHBRDFKZNC-UHFFFAOYSA-N 0.000 description 2
- GTBGXKPAKVYEKJ-UHFFFAOYSA-N decyl 2-methylprop-2-enoate Chemical compound CCCCCCCCCCOC(=O)C(C)=C GTBGXKPAKVYEKJ-UHFFFAOYSA-N 0.000 description 2
- 108010049285 dephospho-CoA kinase Proteins 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- GMSCBRSQMRDRCD-UHFFFAOYSA-N dodecyl 2-methylprop-2-enoate Chemical compound CCCCCCCCCCCCOC(=O)C(C)=C GMSCBRSQMRDRCD-UHFFFAOYSA-N 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- SUPCQIBBMFXVTL-UHFFFAOYSA-N ethyl 2-methylprop-2-enoate Chemical compound CCOC(=O)C(C)=C SUPCQIBBMFXVTL-UHFFFAOYSA-N 0.000 description 2
- 238000001125 extrusion Methods 0.000 description 2
- 235000013312 flour Nutrition 0.000 description 2
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 2
- 238000012239 gene modification Methods 0.000 description 2
- 238000010353 genetic engineering Methods 0.000 description 2
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- 108010081551 glycylphenylalanine Proteins 0.000 description 2
- 101150077981 groEL gene Proteins 0.000 description 2
- 101150028210 groEL1 gene Proteins 0.000 description 2
- 230000007773 growth pattern Effects 0.000 description 2
- ZSIAUFGUXNUGDI-UHFFFAOYSA-N hexan-1-ol Chemical compound CCCCCCO ZSIAUFGUXNUGDI-UHFFFAOYSA-N 0.000 description 2
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 108010092114 histidylphenylalanine Proteins 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 101150022325 ibpA gene Proteins 0.000 description 2
- 101150077063 ibpB gene Proteins 0.000 description 2
- 230000002779 inactivation Effects 0.000 description 2
- 239000002440 industrial waste Substances 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- ZXEKIIBDNHEJCQ-UHFFFAOYSA-N isobutanol Chemical compound CC(C)CO ZXEKIIBDNHEJCQ-UHFFFAOYSA-N 0.000 description 2
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 2
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 2
- 229940031154 kluyveromyces marxianus Drugs 0.000 description 2
- 229940072205 lactobacillus plantarum Drugs 0.000 description 2
- 101150026107 ldh1 gene Proteins 0.000 description 2
- 101150041530 ldha gene Proteins 0.000 description 2
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 2
- 108010091871 leucylmethionine Proteins 0.000 description 2
- 229920005610 lignin Polymers 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 229960004961 mechlorethamine Drugs 0.000 description 2
- HAWPXGHAZFHHAD-UHFFFAOYSA-N mechlorethamine Chemical class ClCCN(C)CCCl HAWPXGHAZFHHAD-UHFFFAOYSA-N 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- NPALUEYCDZWBOV-NDZSKPAWSA-N methacrylyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C(=C)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 NPALUEYCDZWBOV-NDZSKPAWSA-N 0.000 description 2
- VNWKTOKETHGBQD-UHFFFAOYSA-N methane Chemical compound C VNWKTOKETHGBQD-UHFFFAOYSA-N 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- MBABOKRGFJTBAE-UHFFFAOYSA-N methyl methanesulfonate Chemical compound COS(C)(=O)=O MBABOKRGFJTBAE-UHFFFAOYSA-N 0.000 description 2
- 229940017219 methyl propionate Drugs 0.000 description 2
- 231100000350 mutagenesis Toxicity 0.000 description 2
- 238000002703 mutagenesis Methods 0.000 description 2
- 239000003471 mutagenic agent Substances 0.000 description 2
- 231100000707 mutagenic chemical Toxicity 0.000 description 2
- 230000003505 mutagenic effect Effects 0.000 description 2
- UMFJAHHVKNCGLG-UHFFFAOYSA-N n-Nitrosodimethylamine Chemical compound CN(C)N=O UMFJAHHVKNCGLG-UHFFFAOYSA-N 0.000 description 2
- 230000006780 non-homologous end joining Effects 0.000 description 2
- 125000004043 oxo group Chemical group O=* 0.000 description 2
- 239000001301 oxygen Substances 0.000 description 2
- 229910052760 oxygen Inorganic materials 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 239000008188 pellet Substances 0.000 description 2
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 2
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 2
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010024607 phenylalanylalanine Proteins 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 101150028857 phoP gene Proteins 0.000 description 2
- 235000021317 phosphate Nutrition 0.000 description 2
- 150000003013 phosphoric acid derivatives Chemical class 0.000 description 2
- 108010025488 pinealon Proteins 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- 239000004033 plastic Substances 0.000 description 2
- 229920003023 plastic Polymers 0.000 description 2
- 102000040430 polynucleotide Human genes 0.000 description 2
- 108091033319 polynucleotide Proteins 0.000 description 2
- 239000002157 polynucleotide Substances 0.000 description 2
- 239000010867 poultry litter Substances 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 235000014774 prunus Nutrition 0.000 description 2
- 101150080370 pspA gene Proteins 0.000 description 2
- 101150019395 pspB gene Proteins 0.000 description 2
- 101150076091 pspC gene Proteins 0.000 description 2
- JETDZFFCRPFPDH-UHFFFAOYSA-N quinacrine mustard dihydrochloride Chemical compound [H+].[H+].[Cl-].[Cl-].C1=C(Cl)C=CC2=C(NC(C)CCCN(CCCl)CCCl)C3=CC(OC)=CC=C3N=C21 JETDZFFCRPFPDH-UHFFFAOYSA-N 0.000 description 2
- 230000006798 recombination Effects 0.000 description 2
- 238000005215 recombination Methods 0.000 description 2
- 102200058228 rs180177152 Human genes 0.000 description 2
- 102220007531 rs387906616 Human genes 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 101150033650 soxS gene Proteins 0.000 description 2
- 239000004458 spent grain Substances 0.000 description 2
- 150000008163 sugars Chemical class 0.000 description 2
- 150000003467 sulfuric acid derivatives Chemical class 0.000 description 2
- 230000004083 survival effect Effects 0.000 description 2
- 230000008685 targeting Effects 0.000 description 2
- 239000004753 textile Substances 0.000 description 2
- 238000005809 transesterification reaction Methods 0.000 description 2
- 150000003626 triacylglycerols Chemical class 0.000 description 2
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 101150073958 ybfA gene Proteins 0.000 description 2
- 101150030791 yfiA gene Proteins 0.000 description 2
- 101150060672 yibT gene Proteins 0.000 description 2
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 1
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 1
- MBGQSVGJHSCMFS-BYPYZUCNSA-N (2s)-3-methyl-2-nitrosobutanoic acid Chemical compound CC(C)[C@H](N=O)C(O)=O MBGQSVGJHSCMFS-BYPYZUCNSA-N 0.000 description 1
- 229910019614 (NH4)6 Mo7 O24.4H2 O Inorganic materials 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- UFBJCMHMOXMLKC-UHFFFAOYSA-N 2,4-dinitrophenol Chemical compound OC1=CC=C([N+]([O-])=O)C=C1[N+]([O-])=O UFBJCMHMOXMLKC-UHFFFAOYSA-N 0.000 description 1
- YOIZTLBZAMFVPK-UHFFFAOYSA-N 2-(3-ethoxy-4-hydroxyphenyl)-2-hydroxyacetic acid Chemical compound CCOC1=CC(C(O)C(O)=O)=CC=C1O YOIZTLBZAMFVPK-UHFFFAOYSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- MWBWWFOAEOYUST-UHFFFAOYSA-N 2-aminopurine Chemical compound NC1=NC=C2N=CNC2=N1 MWBWWFOAEOYUST-UHFFFAOYSA-N 0.000 description 1
- LNCCBHFAHILMCT-UHFFFAOYSA-N 2-n,4-n,6-n-triethyl-1,3,5-triazine-2,4,6-triamine Chemical compound CCNC1=NC(NCC)=NC(NCC)=N1 LNCCBHFAHILMCT-UHFFFAOYSA-N 0.000 description 1
- KKAJSJJFBSOMGS-UHFFFAOYSA-N 3,6-diamino-10-methylacridinium chloride Chemical compound [Cl-].C1=C(N)C=C2[N+](C)=C(C=C(N)C=C3)C3=CC2=C1 KKAJSJJFBSOMGS-UHFFFAOYSA-N 0.000 description 1
- HEGWNIMGIDYRAU-UHFFFAOYSA-N 3-hexyl-2,4-dioxabicyclo[1.1.0]butane Chemical compound O1C2OC21CCCCCC HEGWNIMGIDYRAU-UHFFFAOYSA-N 0.000 description 1
- 108010044087 AS-I toxin Proteins 0.000 description 1
- RZVAJINKPMORJF-UHFFFAOYSA-N Acetaminophen Chemical compound CC(=O)NC1=CC=C(O)C=C1 RZVAJINKPMORJF-UHFFFAOYSA-N 0.000 description 1
- 108010092060 Acetate kinase Proteins 0.000 description 1
- HRPVXLWXLXDGHG-UHFFFAOYSA-N Acrylamide Chemical compound NC(=O)C=C HRPVXLWXLXDGHG-UHFFFAOYSA-N 0.000 description 1
- 235000009434 Actinidia chinensis Nutrition 0.000 description 1
- 244000298697 Actinidia deliciosa Species 0.000 description 1
- 235000009436 Actinidia deliciosa Nutrition 0.000 description 1
- 241000219066 Actinidiaceae Species 0.000 description 1
- 241000251468 Actinopterygii Species 0.000 description 1
- 101710103516 Acyl-coenzyme A oxidase 4 Proteins 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 1
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 1
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 1
- WRDANSJTFOHBPI-FXQIFTODSA-N Ala-Arg-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N WRDANSJTFOHBPI-FXQIFTODSA-N 0.000 description 1
- IMMKUCQIKKXKNP-DCAQKATOSA-N Ala-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCN=C(N)N IMMKUCQIKKXKNP-DCAQKATOSA-N 0.000 description 1
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 1
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 1
- JYEBJTDTPNKQJG-FXQIFTODSA-N Ala-Asn-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N JYEBJTDTPNKQJG-FXQIFTODSA-N 0.000 description 1
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 1
- MBWYUTNBYSSUIQ-HERUPUMHSA-N Ala-Asn-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N MBWYUTNBYSSUIQ-HERUPUMHSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 1
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 1
- LZRNYBIJOSKKRJ-XVYDVKMFSA-N Ala-Asp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LZRNYBIJOSKKRJ-XVYDVKMFSA-N 0.000 description 1
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 1
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 1
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 1
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 1
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 1
- CRWFEKLFPVRPBV-CIUDSAMLSA-N Ala-Gln-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O CRWFEKLFPVRPBV-CIUDSAMLSA-N 0.000 description 1
- ZDYNWWQXFRUOEO-XDTLVQLUSA-N Ala-Gln-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDYNWWQXFRUOEO-XDTLVQLUSA-N 0.000 description 1
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 1
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 1
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 1
- FOHXUHGZZKETFI-JBDRJPRFSA-N Ala-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N FOHXUHGZZKETFI-JBDRJPRFSA-N 0.000 description 1
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 1
- XCZXVTHYGSMQGH-NAKRPEOUSA-N Ala-Ile-Met Chemical compound C[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C([O-])=O XCZXVTHYGSMQGH-NAKRPEOUSA-N 0.000 description 1
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 1
- QPBSRMDNJOTFAL-AICCOOGYSA-N Ala-Leu-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QPBSRMDNJOTFAL-AICCOOGYSA-N 0.000 description 1
- OPZJWMJPCNNZNT-DCAQKATOSA-N Ala-Leu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N OPZJWMJPCNNZNT-DCAQKATOSA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- OMFMCIVBKCEMAK-CYDGBPFRSA-N Ala-Leu-Val-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O OMFMCIVBKCEMAK-CYDGBPFRSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- JWUZOJXDJDEQEM-ZLIFDBKOSA-N Ala-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 JWUZOJXDJDEQEM-ZLIFDBKOSA-N 0.000 description 1
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 1
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 1
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 1
- DEWWPUNXRNGMQN-LPEHRKFASA-N Ala-Met-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N DEWWPUNXRNGMQN-LPEHRKFASA-N 0.000 description 1
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 1
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 1
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 1
- FEGOCLZUJUFCHP-CIUDSAMLSA-N Ala-Pro-Gln Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FEGOCLZUJUFCHP-CIUDSAMLSA-N 0.000 description 1
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 1
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 1
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 1
- LFFOJBOTZUWINF-ZANVPECISA-N Ala-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O)=CNC2=C1 LFFOJBOTZUWINF-ZANVPECISA-N 0.000 description 1
- YXXPVUOMPSZURS-ZLIFDBKOSA-N Ala-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 YXXPVUOMPSZURS-ZLIFDBKOSA-N 0.000 description 1
- AOAKQKVICDWCLB-UWJYBYFXSA-N Ala-Tyr-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AOAKQKVICDWCLB-UWJYBYFXSA-N 0.000 description 1
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 1
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- 241000722954 Anaerobiospirillum succiniciproducens Species 0.000 description 1
- 101000801180 Arabidopsis thaliana Acyl-coenzyme A oxidase 4, peroxisomal Proteins 0.000 description 1
- 101100438273 Arabidopsis thaliana CAN1 gene Proteins 0.000 description 1
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- PEFFAAKJGBZBKL-NAKRPEOUSA-N Arg-Ala-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PEFFAAKJGBZBKL-NAKRPEOUSA-N 0.000 description 1
- YYOVLDPHIJAOSY-DCAQKATOSA-N Arg-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N YYOVLDPHIJAOSY-DCAQKATOSA-N 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- SBVJJNJLFWSJOV-UBHSHLNASA-N Arg-Ala-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SBVJJNJLFWSJOV-UBHSHLNASA-N 0.000 description 1
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 1
- KGSJCPBERYUXCN-BPNCWPANSA-N Arg-Ala-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KGSJCPBERYUXCN-BPNCWPANSA-N 0.000 description 1
- QEKBCDODJBBWHV-GUBZILKMSA-N Arg-Arg-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O QEKBCDODJBBWHV-GUBZILKMSA-N 0.000 description 1
- XEPSCVXTCUUHDT-AVGNSLFASA-N Arg-Arg-Leu Natural products CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N XEPSCVXTCUUHDT-AVGNSLFASA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- HJWQFFYRVFEWRM-SRVKXCTJSA-N Arg-Arg-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O HJWQFFYRVFEWRM-SRVKXCTJSA-N 0.000 description 1
- PVSNBTCXCQIXSE-JYJNAYRXSA-N Arg-Arg-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PVSNBTCXCQIXSE-JYJNAYRXSA-N 0.000 description 1
- WESHVRNMNFMVBE-FXQIFTODSA-N Arg-Asn-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N WESHVRNMNFMVBE-FXQIFTODSA-N 0.000 description 1
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 1
- GHNDBBVSWOWYII-LPEHRKFASA-N Arg-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GHNDBBVSWOWYII-LPEHRKFASA-N 0.000 description 1
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 1
- DXQIQUIQYAGRCC-CIUDSAMLSA-N Arg-Asp-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)CN=C(N)N DXQIQUIQYAGRCC-CIUDSAMLSA-N 0.000 description 1
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 1
- JUWQNWXEGDYCIE-YUMQZZPRSA-N Arg-Gln-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O JUWQNWXEGDYCIE-YUMQZZPRSA-N 0.000 description 1
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 1
- OBFTYSPXDRROQO-SRVKXCTJSA-N Arg-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCN=C(N)N OBFTYSPXDRROQO-SRVKXCTJSA-N 0.000 description 1
- BJNUAWGXPSHQMJ-DCAQKATOSA-N Arg-Gln-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O BJNUAWGXPSHQMJ-DCAQKATOSA-N 0.000 description 1
- LLZXKVAAEWBUPB-KKUMJFAQSA-N Arg-Gln-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLZXKVAAEWBUPB-KKUMJFAQSA-N 0.000 description 1
- YHQGEARSFILVHL-HJGDQZAQSA-N Arg-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O YHQGEARSFILVHL-HJGDQZAQSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 1
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 1
- QAXCZGMLVICQKS-SRVKXCTJSA-N Arg-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QAXCZGMLVICQKS-SRVKXCTJSA-N 0.000 description 1
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 1
- HPSVTWMFWCHKFN-GARJFASQSA-N Arg-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O HPSVTWMFWCHKFN-GARJFASQSA-N 0.000 description 1
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 1
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 1
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 1
- MSILNNHVVMMTHZ-UWVGGRQHSA-N Arg-His-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 MSILNNHVVMMTHZ-UWVGGRQHSA-N 0.000 description 1
- MMGCRPZQZWTZTA-IHRRRGAJSA-N Arg-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N MMGCRPZQZWTZTA-IHRRRGAJSA-N 0.000 description 1
- FFEUXEAKYRCACT-PEDHHIEDSA-N Arg-Ile-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(O)=O FFEUXEAKYRCACT-PEDHHIEDSA-N 0.000 description 1
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 1
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 1
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 1
- PZBSKYJGKNNYNK-ULQDDVLXSA-N Arg-Leu-Tyr Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O PZBSKYJGKNNYNK-ULQDDVLXSA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 1
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 1
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 1
- PYZPXCZNQSEHDT-GUBZILKMSA-N Arg-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N PYZPXCZNQSEHDT-GUBZILKMSA-N 0.000 description 1
- VVJTWSRNMJNDPN-IUCAKERBSA-N Arg-Met-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O VVJTWSRNMJNDPN-IUCAKERBSA-N 0.000 description 1
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 1
- INXWADWANGLMPJ-JYJNAYRXSA-N Arg-Phe-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)CC1=CC=CC=C1 INXWADWANGLMPJ-JYJNAYRXSA-N 0.000 description 1
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 1
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 1
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 1
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 1
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 1
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 1
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 1
- LYJXHXGPWDTLKW-HJGDQZAQSA-N Arg-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O LYJXHXGPWDTLKW-HJGDQZAQSA-N 0.000 description 1
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 1
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 1
- QUBKBPZGMZWOKQ-SZMVWBNQSA-N Arg-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QUBKBPZGMZWOKQ-SZMVWBNQSA-N 0.000 description 1
- YHZQOSXDTFRZKU-WDSOQIARSA-N Arg-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 YHZQOSXDTFRZKU-WDSOQIARSA-N 0.000 description 1
- AOJYORNRFWWEIV-IHRRRGAJSA-N Arg-Tyr-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 AOJYORNRFWWEIV-IHRRRGAJSA-N 0.000 description 1
- QJWLLRZTJFPCHA-STECZYCISA-N Arg-Tyr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QJWLLRZTJFPCHA-STECZYCISA-N 0.000 description 1
- LFWOQHSQNCKXRU-UFYCRDLUSA-N Arg-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 LFWOQHSQNCKXRU-UFYCRDLUSA-N 0.000 description 1
- CNBIWSCSSCAINS-UFYCRDLUSA-N Arg-Tyr-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNBIWSCSSCAINS-UFYCRDLUSA-N 0.000 description 1
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 1
- KEZVOBAKAXHMOF-GUBZILKMSA-N Arg-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N KEZVOBAKAXHMOF-GUBZILKMSA-N 0.000 description 1
- QTAIIXQCOPUNBQ-QXEWZRGKSA-N Arg-Val-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QTAIIXQCOPUNBQ-QXEWZRGKSA-N 0.000 description 1
- FTMRPIVPSDVGCC-GUBZILKMSA-N Arg-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FTMRPIVPSDVGCC-GUBZILKMSA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 1
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 1
- 241000186063 Arthrobacter Species 0.000 description 1
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 1
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 1
- ORXCYAFUCSTQGY-FXQIFTODSA-N Asn-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N ORXCYAFUCSTQGY-FXQIFTODSA-N 0.000 description 1
- NUHQMYUWLUSRJX-BIIVOSGPSA-N Asn-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N NUHQMYUWLUSRJX-BIIVOSGPSA-N 0.000 description 1
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 1
- NTXNUXPCNRDMAF-WFBYXXMGSA-N Asn-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC(N)=O)C)C(O)=O)=CNC2=C1 NTXNUXPCNRDMAF-WFBYXXMGSA-N 0.000 description 1
- HOIFSHOLNKQCSA-FXQIFTODSA-N Asn-Arg-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O HOIFSHOLNKQCSA-FXQIFTODSA-N 0.000 description 1
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 1
- GOVUDFOGXOONFT-VEVYYDQMSA-N Asn-Arg-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GOVUDFOGXOONFT-VEVYYDQMSA-N 0.000 description 1
- HAJWYALLJIATCX-FXQIFTODSA-N Asn-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N HAJWYALLJIATCX-FXQIFTODSA-N 0.000 description 1
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 1
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 1
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 1
- QISZHYWZHJRDAO-CIUDSAMLSA-N Asn-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N QISZHYWZHJRDAO-CIUDSAMLSA-N 0.000 description 1
- XWFPGQVLOVGSLU-CIUDSAMLSA-N Asn-Gln-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XWFPGQVLOVGSLU-CIUDSAMLSA-N 0.000 description 1
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 1
- KUYKVGODHGHFDI-ACZMJKKPSA-N Asn-Gln-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O KUYKVGODHGHFDI-ACZMJKKPSA-N 0.000 description 1
- IHUJUZBUOFTIOB-QEJZJMRPSA-N Asn-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N IHUJUZBUOFTIOB-QEJZJMRPSA-N 0.000 description 1
- BKDDABUWNKGZCK-XHNCKOQMSA-N Asn-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O BKDDABUWNKGZCK-XHNCKOQMSA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 1
- MOHUTCNYQLMARY-GUBZILKMSA-N Asn-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MOHUTCNYQLMARY-GUBZILKMSA-N 0.000 description 1
- SGAUXNZEFIEAAI-GARJFASQSA-N Asn-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)C(=O)O SGAUXNZEFIEAAI-GARJFASQSA-N 0.000 description 1
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 1
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 1
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 1
- UHGUKCOQUNPSKK-CIUDSAMLSA-N Asn-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N UHGUKCOQUNPSKK-CIUDSAMLSA-N 0.000 description 1
- UBGGJTMETLEXJD-DCAQKATOSA-N Asn-Leu-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O UBGGJTMETLEXJD-DCAQKATOSA-N 0.000 description 1
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 1
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 1
- KNENKKKUYGEZIO-FXQIFTODSA-N Asn-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N KNENKKKUYGEZIO-FXQIFTODSA-N 0.000 description 1
- HMUKKNAMNSXDBB-CIUDSAMLSA-N Asn-Met-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMUKKNAMNSXDBB-CIUDSAMLSA-N 0.000 description 1
- MYVBTYXSWILFCG-BQBZGAKWSA-N Asn-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N MYVBTYXSWILFCG-BQBZGAKWSA-N 0.000 description 1
- KSGAFDTYQPKUAP-GMOBBJLQSA-N Asn-Met-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KSGAFDTYQPKUAP-GMOBBJLQSA-N 0.000 description 1
- QGABLMITFKUQDF-DCAQKATOSA-N Asn-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QGABLMITFKUQDF-DCAQKATOSA-N 0.000 description 1
- OROMFUQQTSWUTI-IHRRRGAJSA-N Asn-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OROMFUQQTSWUTI-IHRRRGAJSA-N 0.000 description 1
- PPCORQFLAZWUNO-QWRGUYRKSA-N Asn-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N PPCORQFLAZWUNO-QWRGUYRKSA-N 0.000 description 1
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 1
- XMHFCUKJRCQXGI-CIUDSAMLSA-N Asn-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O XMHFCUKJRCQXGI-CIUDSAMLSA-N 0.000 description 1
- UWFOMGUWGPRVBW-GUBZILKMSA-N Asn-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N UWFOMGUWGPRVBW-GUBZILKMSA-N 0.000 description 1
- VWADICJNCPFKJS-ZLUOBGJFSA-N Asn-Ser-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O VWADICJNCPFKJS-ZLUOBGJFSA-N 0.000 description 1
- KYQJHBWHRASMKG-ZLUOBGJFSA-N Asn-Ser-Cys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O KYQJHBWHRASMKG-ZLUOBGJFSA-N 0.000 description 1
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 1
- GOPFMQJUQDLUFW-LKXGYXEUSA-N Asn-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O GOPFMQJUQDLUFW-LKXGYXEUSA-N 0.000 description 1
- RDLYUKRPEJERMM-XIRDDKMYSA-N Asn-Trp-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O RDLYUKRPEJERMM-XIRDDKMYSA-N 0.000 description 1
- QIRJQYQOIKBPBZ-IHRRRGAJSA-N Asn-Tyr-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QIRJQYQOIKBPBZ-IHRRRGAJSA-N 0.000 description 1
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 1
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 1
- MYRLSKYSMXNLLA-LAEOZQHASA-N Asn-Val-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MYRLSKYSMXNLLA-LAEOZQHASA-N 0.000 description 1
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 1
- VTYQAQFKMQTKQD-ACZMJKKPSA-N Asp-Ala-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O VTYQAQFKMQTKQD-ACZMJKKPSA-N 0.000 description 1
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 1
- ZVTDYGWRRPMFCL-WFBYXXMGSA-N Asp-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N ZVTDYGWRRPMFCL-WFBYXXMGSA-N 0.000 description 1
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 1
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- MRQQMVZUHXUPEV-IHRRRGAJSA-N Asp-Arg-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MRQQMVZUHXUPEV-IHRRRGAJSA-N 0.000 description 1
- DBWYWXNMZZYIRY-LPEHRKFASA-N Asp-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O DBWYWXNMZZYIRY-LPEHRKFASA-N 0.000 description 1
- NYLBGYLHBDFRHL-VEVYYDQMSA-N Asp-Arg-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NYLBGYLHBDFRHL-VEVYYDQMSA-N 0.000 description 1
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 1
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- FRSGNOZCTWDVFZ-ACZMJKKPSA-N Asp-Asp-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRSGNOZCTWDVFZ-ACZMJKKPSA-N 0.000 description 1
- ZCKYZTGLXIEOKS-CIUDSAMLSA-N Asp-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N ZCKYZTGLXIEOKS-CIUDSAMLSA-N 0.000 description 1
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 1
- VHWNKSJHQFZJTH-FXQIFTODSA-N Asp-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N VHWNKSJHQFZJTH-FXQIFTODSA-N 0.000 description 1
- VZNOVQKGJQJOCS-SRVKXCTJSA-N Asp-Asp-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VZNOVQKGJQJOCS-SRVKXCTJSA-N 0.000 description 1
- KVPHTGVUMJGMCX-BIIVOSGPSA-N Asp-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)C(=O)O KVPHTGVUMJGMCX-BIIVOSGPSA-N 0.000 description 1
- PJERDVUTUDZPGX-ZKWXMUAHSA-N Asp-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC(O)=O PJERDVUTUDZPGX-ZKWXMUAHSA-N 0.000 description 1
- NYQHSUGFEWDWPD-ACZMJKKPSA-N Asp-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N NYQHSUGFEWDWPD-ACZMJKKPSA-N 0.000 description 1
- WLKVEEODTPQPLI-ACZMJKKPSA-N Asp-Gln-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WLKVEEODTPQPLI-ACZMJKKPSA-N 0.000 description 1
- BKXPJCBEHWFSTF-ACZMJKKPSA-N Asp-Gln-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O BKXPJCBEHWFSTF-ACZMJKKPSA-N 0.000 description 1
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 1
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 1
- HSWYMWGDMPLTTH-FXQIFTODSA-N Asp-Glu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HSWYMWGDMPLTTH-FXQIFTODSA-N 0.000 description 1
- LTXGDRFJRZSZAV-CIUDSAMLSA-N Asp-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N LTXGDRFJRZSZAV-CIUDSAMLSA-N 0.000 description 1
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 1
- JHFNSBBHKSZXKB-VKHMYHEASA-N Asp-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(O)=O JHFNSBBHKSZXKB-VKHMYHEASA-N 0.000 description 1
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 1
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 1
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 1
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 1
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 1
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 1
- SAKCBXNPWDRWPE-BQBZGAKWSA-N Asp-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N SAKCBXNPWDRWPE-BQBZGAKWSA-N 0.000 description 1
- VWWAFGHMPWBKEP-GMOBBJLQSA-N Asp-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(=O)O)N VWWAFGHMPWBKEP-GMOBBJLQSA-N 0.000 description 1
- HXVILZUZXFLVEN-DCAQKATOSA-N Asp-Met-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HXVILZUZXFLVEN-DCAQKATOSA-N 0.000 description 1
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 1
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 1
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 1
- UCHSVZYJKJLPHF-BZSNNMDCSA-N Asp-Phe-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UCHSVZYJKJLPHF-BZSNNMDCSA-N 0.000 description 1
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- QTIZKMMLNUMHHU-DCAQKATOSA-N Asp-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QTIZKMMLNUMHHU-DCAQKATOSA-N 0.000 description 1
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 1
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 1
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 1
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 1
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 1
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 1
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 1
- LLRJPYJQNBMOOO-QEJZJMRPSA-N Asp-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N LLRJPYJQNBMOOO-QEJZJMRPSA-N 0.000 description 1
- LEYKQPDPZJIRTA-AQZXSJQPSA-N Asp-Trp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LEYKQPDPZJIRTA-AQZXSJQPSA-N 0.000 description 1
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 1
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 1
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 1
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 1
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 1
- 241001474374 Blennius Species 0.000 description 1
- 241000219193 Brassicaceae Species 0.000 description 1
- 241000218980 Brassicales Species 0.000 description 1
- COVZYZSDYWQREU-UHFFFAOYSA-N Busulfan Chemical compound CS(=O)(=O)OCCCCOS(C)(=O)=O COVZYZSDYWQREU-UHFFFAOYSA-N 0.000 description 1
- 238000010354 CRISPR gene editing Methods 0.000 description 1
- 238000010453 CRISPR/Cas method Methods 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 241000218236 Cannabis Species 0.000 description 1
- 235000014649 Carica monoica Nutrition 0.000 description 1
- 241000283153 Cetacea Species 0.000 description 1
- 229920002101 Chitin Polymers 0.000 description 1
- 241001112696 Clostridia Species 0.000 description 1
- 241000423302 Clostridium acetobutylicum ATCC 824 Species 0.000 description 1
- 241000193171 Clostridium butyricum Species 0.000 description 1
- 241000252203 Clupea harengus Species 0.000 description 1
- ACTIUHUUMQJHFO-UHFFFAOYSA-N Coenzym Q10 Natural products COC1=C(OC)C(=O)C(CC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)C)=C(C)C1=O ACTIUHUUMQJHFO-UHFFFAOYSA-N 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 241000186031 Corynebacteriaceae Species 0.000 description 1
- 241001485655 Corynebacterium glutamicum ATCC 13032 Species 0.000 description 1
- CMSMOCZEIVJLDB-UHFFFAOYSA-N Cyclophosphamide Chemical compound ClCCN(CCCl)P1(=O)NCCCO1 CMSMOCZEIVJLDB-UHFFFAOYSA-N 0.000 description 1
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 1
- AMRLSQGGERHDHJ-FXQIFTODSA-N Cys-Ala-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMRLSQGGERHDHJ-FXQIFTODSA-N 0.000 description 1
- QFMCHXSGIZPBKG-ZLUOBGJFSA-N Cys-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N QFMCHXSGIZPBKG-ZLUOBGJFSA-N 0.000 description 1
- GEEXORWTBTUOHC-FXQIFTODSA-N Cys-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N GEEXORWTBTUOHC-FXQIFTODSA-N 0.000 description 1
- URDUGPGPLNXXES-WHFBIAKZSA-N Cys-Gly-Cys Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O URDUGPGPLNXXES-WHFBIAKZSA-N 0.000 description 1
- GCDLPNRHPWBKJJ-WDSKDSINSA-N Cys-Gly-Glu Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GCDLPNRHPWBKJJ-WDSKDSINSA-N 0.000 description 1
- LBOLGUYQEPZSKM-YUMQZZPRSA-N Cys-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N LBOLGUYQEPZSKM-YUMQZZPRSA-N 0.000 description 1
- OXOQBEVULIBOSH-ZDLURKLDSA-N Cys-Gly-Thr Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O OXOQBEVULIBOSH-ZDLURKLDSA-N 0.000 description 1
- WZZGXXNRSZIQFC-VGDYDELISA-N Cys-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N WZZGXXNRSZIQFC-VGDYDELISA-N 0.000 description 1
- HAYVLBZZBDCKRA-SRVKXCTJSA-N Cys-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N HAYVLBZZBDCKRA-SRVKXCTJSA-N 0.000 description 1
- DVIHGGUODLILFN-GHCJXIJMSA-N Cys-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N DVIHGGUODLILFN-GHCJXIJMSA-N 0.000 description 1
- OTXLNICGSXPGQF-KBIXCLLPSA-N Cys-Ile-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTXLNICGSXPGQF-KBIXCLLPSA-N 0.000 description 1
- ZMWOJVAXTOUHAP-ZKWXMUAHSA-N Cys-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N ZMWOJVAXTOUHAP-ZKWXMUAHSA-N 0.000 description 1
- SRIRHERUAMYIOQ-CIUDSAMLSA-N Cys-Leu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SRIRHERUAMYIOQ-CIUDSAMLSA-N 0.000 description 1
- GDNWBSFSHJVXKL-GUBZILKMSA-N Cys-Lys-Gln Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O GDNWBSFSHJVXKL-GUBZILKMSA-N 0.000 description 1
- MKVKKORBPTUSNX-LPEHRKFASA-N Cys-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N MKVKKORBPTUSNX-LPEHRKFASA-N 0.000 description 1
- RWVBNRYBHAGYSG-GUBZILKMSA-N Cys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)N RWVBNRYBHAGYSG-GUBZILKMSA-N 0.000 description 1
- GFMJUESGWILPEN-MELADBBJSA-N Cys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CS)N)C(=O)O GFMJUESGWILPEN-MELADBBJSA-N 0.000 description 1
- TXGDWPBLUFQODU-XGEHTFHBSA-N Cys-Pro-Thr Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O TXGDWPBLUFQODU-XGEHTFHBSA-N 0.000 description 1
- XBELMDARIGXDKY-GUBZILKMSA-N Cys-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CS)N XBELMDARIGXDKY-GUBZILKMSA-N 0.000 description 1
- RJPKQCFHEPPTGL-ZLUOBGJFSA-N Cys-Ser-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RJPKQCFHEPPTGL-ZLUOBGJFSA-N 0.000 description 1
- DQGIAOGALAQBGK-BWBBJGPYSA-N Cys-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O DQGIAOGALAQBGK-BWBBJGPYSA-N 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 1
- 108700020911 DNA-Binding Proteins Proteins 0.000 description 1
- 108020005199 Dehydrogenases Proteins 0.000 description 1
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 1
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 1
- 108010053770 Deoxyribonucleases Proteins 0.000 description 1
- 102000016911 Deoxyribonucleases Human genes 0.000 description 1
- ZFIVKAOQEXOYFY-UHFFFAOYSA-N Diepoxybutane Chemical compound C1OC1C1OC1 ZFIVKAOQEXOYFY-UHFFFAOYSA-N 0.000 description 1
- 102000028526 Dihydrolipoamide Dehydrogenase Human genes 0.000 description 1
- 108010028127 Dihydrolipoamide Dehydrogenase Proteins 0.000 description 1
- 239000003109 Disodium ethylene diamine tetraacetate Substances 0.000 description 1
- 101100378121 Drosophila melanogaster nAChRalpha1 gene Proteins 0.000 description 1
- OVBJJZOQPCKUOR-UHFFFAOYSA-L EDTA disodium salt dihydrate Chemical compound O.O.[Na+].[Na+].[O-]C(=O)C[NH+](CC([O-])=O)CC[NH+](CC([O-])=O)CC([O-])=O OVBJJZOQPCKUOR-UHFFFAOYSA-L 0.000 description 1
- 241000305071 Enterobacterales Species 0.000 description 1
- 241000660147 Escherichia coli str. K-12 substr. MG1655 Species 0.000 description 1
- OTMSDBZUPAUEDD-UHFFFAOYSA-N Ethane Chemical compound CC OTMSDBZUPAUEDD-UHFFFAOYSA-N 0.000 description 1
- IAYPIBMASNFSPL-UHFFFAOYSA-N Ethylene oxide Chemical compound C1CO1 IAYPIBMASNFSPL-UHFFFAOYSA-N 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 108020005637 FAD dependent oxidoreductase Proteins 0.000 description 1
- 102000007384 FAD-dependent oxidoreductase Human genes 0.000 description 1
- 108010074122 Ferredoxins Proteins 0.000 description 1
- MBMLMWLHJBBADN-UHFFFAOYSA-N Ferrous sulfide Chemical compound [Fe]=S MBMLMWLHJBBADN-UHFFFAOYSA-N 0.000 description 1
- 241000220223 Fragaria Species 0.000 description 1
- 108091092584 GDNA Proteins 0.000 description 1
- WUAYFMZULZDSLB-ACZMJKKPSA-N Gln-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O WUAYFMZULZDSLB-ACZMJKKPSA-N 0.000 description 1
- REJJNXODKSHOKA-ACZMJKKPSA-N Gln-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N REJJNXODKSHOKA-ACZMJKKPSA-N 0.000 description 1
- NNQHEEQNPQYPGL-FXQIFTODSA-N Gln-Ala-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NNQHEEQNPQYPGL-FXQIFTODSA-N 0.000 description 1
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 1
- WOACHWLUOFZLGJ-GUBZILKMSA-N Gln-Arg-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O WOACHWLUOFZLGJ-GUBZILKMSA-N 0.000 description 1
- KJRXLVZYJJLUCV-DCAQKATOSA-N Gln-Arg-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KJRXLVZYJJLUCV-DCAQKATOSA-N 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- KWLMLNHADZIJIS-CIUDSAMLSA-N Gln-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N KWLMLNHADZIJIS-CIUDSAMLSA-N 0.000 description 1
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 1
- SOIAHPSKKUYREP-CIUDSAMLSA-N Gln-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N SOIAHPSKKUYREP-CIUDSAMLSA-N 0.000 description 1
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 1
- UICOTGULOUGGLC-NUMRIWBASA-N Gln-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UICOTGULOUGGLC-NUMRIWBASA-N 0.000 description 1
- ALUBSZXSNSPDQV-WDSKDSINSA-N Gln-Cys-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ALUBSZXSNSPDQV-WDSKDSINSA-N 0.000 description 1
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 1
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 1
- NPTGGVQJYRSMCM-GLLZPBPUSA-N Gln-Gln-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPTGGVQJYRSMCM-GLLZPBPUSA-N 0.000 description 1
- BLOXULLYFRGYKZ-GUBZILKMSA-N Gln-Glu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BLOXULLYFRGYKZ-GUBZILKMSA-N 0.000 description 1
- LWDGZZGWDMHBOF-FXQIFTODSA-N Gln-Glu-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LWDGZZGWDMHBOF-FXQIFTODSA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- DDNIZQDYXDENIT-FXQIFTODSA-N Gln-Glu-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N DDNIZQDYXDENIT-FXQIFTODSA-N 0.000 description 1
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 1
- LFIVHGMKWFGUGK-IHRRRGAJSA-N Gln-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LFIVHGMKWFGUGK-IHRRRGAJSA-N 0.000 description 1
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 1
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 1
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 1
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 description 1
- BVELAHPZLYLZDJ-HGNGGELXSA-N Gln-His-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O BVELAHPZLYLZDJ-HGNGGELXSA-N 0.000 description 1
- KQOPMGBHNQBCEL-HVTMNAMFSA-N Gln-His-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KQOPMGBHNQBCEL-HVTMNAMFSA-N 0.000 description 1
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 description 1
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 1
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 1
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 1
- VUVKKXPCKILIBD-AVGNSLFASA-N Gln-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VUVKKXPCKILIBD-AVGNSLFASA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 1
- SXGMGNZEHFORAV-IUCAKERBSA-N Gln-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXGMGNZEHFORAV-IUCAKERBSA-N 0.000 description 1
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 1
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- ILKYYKRAULNYMS-JYJNAYRXSA-N Gln-Lys-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ILKYYKRAULNYMS-JYJNAYRXSA-N 0.000 description 1
- KLKYKPXITJBSNI-CIUDSAMLSA-N Gln-Met-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O KLKYKPXITJBSNI-CIUDSAMLSA-N 0.000 description 1
- XBWGJWXGUNSZAT-CIUDSAMLSA-N Gln-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N XBWGJWXGUNSZAT-CIUDSAMLSA-N 0.000 description 1
- QBEWLBKBGXVVPD-RYUDHWBXSA-N Gln-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N QBEWLBKBGXVVPD-RYUDHWBXSA-N 0.000 description 1
- JUUNNOLZGVYCJT-JYJNAYRXSA-N Gln-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JUUNNOLZGVYCJT-JYJNAYRXSA-N 0.000 description 1
- DBNLXHGDGBUCDV-KKUMJFAQSA-N Gln-Phe-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DBNLXHGDGBUCDV-KKUMJFAQSA-N 0.000 description 1
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 1
- OREPWMPAUWIIAM-ZPFDUUQYSA-N Gln-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N OREPWMPAUWIIAM-ZPFDUUQYSA-N 0.000 description 1
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 1
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 1
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 1
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 1
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 1
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 1
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 1
- UXXIVIQGOODKQC-NUMRIWBASA-N Gln-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UXXIVIQGOODKQC-NUMRIWBASA-N 0.000 description 1
- RBSKVTZUFMIWFU-XEGUGMAKSA-N Gln-Trp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O RBSKVTZUFMIWFU-XEGUGMAKSA-N 0.000 description 1
- WPJDPEOQUIXXOY-AVGNSLFASA-N Gln-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WPJDPEOQUIXXOY-AVGNSLFASA-N 0.000 description 1
- SGVGIVDZLSHSEN-RYUDHWBXSA-N Gln-Tyr-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O SGVGIVDZLSHSEN-RYUDHWBXSA-N 0.000 description 1
- QXQDADBVIBLBHN-FHWLQOOXSA-N Gln-Tyr-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QXQDADBVIBLBHN-FHWLQOOXSA-N 0.000 description 1
- UQKVUFGUSVYJMQ-IRIUXVKKSA-N Gln-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N)O UQKVUFGUSVYJMQ-IRIUXVKKSA-N 0.000 description 1
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 1
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 1
- ZFBBMCKQSNJZSN-AUTRQRHGSA-N Gln-Val-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFBBMCKQSNJZSN-AUTRQRHGSA-N 0.000 description 1
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 1
- SDSMVVSHLAAOJL-UKJIMTQDSA-N Gln-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N SDSMVVSHLAAOJL-UKJIMTQDSA-N 0.000 description 1
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 1
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 1
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 1
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 1
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 1
- RCCDHXSRMWCOOY-GUBZILKMSA-N Glu-Arg-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCCDHXSRMWCOOY-GUBZILKMSA-N 0.000 description 1
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 1
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 1
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 1
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 description 1
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 1
- GCYFUZJHAXJKKE-KKUMJFAQSA-N Glu-Arg-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GCYFUZJHAXJKKE-KKUMJFAQSA-N 0.000 description 1
- MLCPTRRNICEKIS-FXQIFTODSA-N Glu-Asn-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLCPTRRNICEKIS-FXQIFTODSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 1
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 1
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 1
- ZJICFHQSPWFBKP-AVGNSLFASA-N Glu-Asn-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZJICFHQSPWFBKP-AVGNSLFASA-N 0.000 description 1
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 1
- LSTFYPOGBGFIPP-FXQIFTODSA-N Glu-Cys-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O LSTFYPOGBGFIPP-FXQIFTODSA-N 0.000 description 1
- XMVLTPMCUJTJQP-FXQIFTODSA-N Glu-Gln-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N XMVLTPMCUJTJQP-FXQIFTODSA-N 0.000 description 1
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 1
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- XOFYVODYSNKPDK-AVGNSLFASA-N Glu-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XOFYVODYSNKPDK-AVGNSLFASA-N 0.000 description 1
- NJPQBTJSYCKCNS-HVTMNAMFSA-N Glu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N NJPQBTJSYCKCNS-HVTMNAMFSA-N 0.000 description 1
- WVTIBGWZUMJBFY-GUBZILKMSA-N Glu-His-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O WVTIBGWZUMJBFY-GUBZILKMSA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 1
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 1
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 1
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 1
- BKRQSECBKKCCKW-HVTMNAMFSA-N Glu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N BKRQSECBKKCCKW-HVTMNAMFSA-N 0.000 description 1
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 1
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 1
- ZCFNZTVIDMLUQC-SXNHZJKMSA-N Glu-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZCFNZTVIDMLUQC-SXNHZJKMSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- DWBBKNPKDHXIAC-SRVKXCTJSA-N Glu-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCC(O)=O DWBBKNPKDHXIAC-SRVKXCTJSA-N 0.000 description 1
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 1
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 1
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 1
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 1
- UJMNFCAHLYKWOZ-DCAQKATOSA-N Glu-Lys-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UJMNFCAHLYKWOZ-DCAQKATOSA-N 0.000 description 1
- AOCARQDSFTWWFT-DCAQKATOSA-N Glu-Met-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AOCARQDSFTWWFT-DCAQKATOSA-N 0.000 description 1
- SOEPMWQCTJITPZ-SRVKXCTJSA-N Glu-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N SOEPMWQCTJITPZ-SRVKXCTJSA-N 0.000 description 1
- XEKAJTCACGEBOK-KKUMJFAQSA-N Glu-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XEKAJTCACGEBOK-KKUMJFAQSA-N 0.000 description 1
- ZTVGZOIBLRPQNR-KKUMJFAQSA-N Glu-Met-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZTVGZOIBLRPQNR-KKUMJFAQSA-N 0.000 description 1
- LHIPZASLKPYDPI-AVGNSLFASA-N Glu-Phe-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LHIPZASLKPYDPI-AVGNSLFASA-N 0.000 description 1
- JDUKCSSHWNIQQZ-IHRRRGAJSA-N Glu-Phe-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JDUKCSSHWNIQQZ-IHRRRGAJSA-N 0.000 description 1
- YTRBQAQSUDSIQE-FHWLQOOXSA-N Glu-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 YTRBQAQSUDSIQE-FHWLQOOXSA-N 0.000 description 1
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 1
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 1
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 1
- ZKONLKQGTNVAPR-DCAQKATOSA-N Glu-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N ZKONLKQGTNVAPR-DCAQKATOSA-N 0.000 description 1
- LPHGXOWFAXFCPX-KKUMJFAQSA-N Glu-Pro-Phe Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O LPHGXOWFAXFCPX-KKUMJFAQSA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 1
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 1
- WXONSNSSBYQGNN-AVGNSLFASA-N Glu-Ser-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WXONSNSSBYQGNN-AVGNSLFASA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- TWYSSILQABLLME-HJGDQZAQSA-N Glu-Thr-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYSSILQABLLME-HJGDQZAQSA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- DDXZHOHEABQXSE-NKIYYHGXSA-N Glu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O DDXZHOHEABQXSE-NKIYYHGXSA-N 0.000 description 1
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 1
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 1
- UMZHHILWZBFPGL-LOKLDPHHSA-N Glu-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O UMZHHILWZBFPGL-LOKLDPHHSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- YOTHMZZSJKKEHZ-SZMVWBNQSA-N Glu-Trp-Lys Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CCC(O)=O)=CNC2=C1 YOTHMZZSJKKEHZ-SZMVWBNQSA-N 0.000 description 1
- VXEFAWJTFAUDJK-AVGNSLFASA-N Glu-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O VXEFAWJTFAUDJK-AVGNSLFASA-N 0.000 description 1
- BKMOHWJHXQLFEX-IRIUXVKKSA-N Glu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N)O BKMOHWJHXQLFEX-IRIUXVKKSA-N 0.000 description 1
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- 241001524188 Glutamicibacter nicotianae Species 0.000 description 1
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 1
- FKJQNJCQTKUBCD-XPUUQOCRSA-N Gly-Ala-His Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O FKJQNJCQTKUBCD-XPUUQOCRSA-N 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 1
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 1
- PYUCNHJQQVSPGN-BQBZGAKWSA-N Gly-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)CN=C(N)N PYUCNHJQQVSPGN-BQBZGAKWSA-N 0.000 description 1
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 1
- MXXXVOYFNVJHMA-IUCAKERBSA-N Gly-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN MXXXVOYFNVJHMA-IUCAKERBSA-N 0.000 description 1
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 1
- WJZLEENECIOOSA-WDSKDSINSA-N Gly-Asn-Gln Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)O WJZLEENECIOOSA-WDSKDSINSA-N 0.000 description 1
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 1
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 1
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 1
- LLXVQPKEQQCISF-YUMQZZPRSA-N Gly-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN LLXVQPKEQQCISF-YUMQZZPRSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- DTRUBYPMMVPQPD-YUMQZZPRSA-N Gly-Gln-Arg Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DTRUBYPMMVPQPD-YUMQZZPRSA-N 0.000 description 1
- BULIVUZUDBHKKZ-WDSKDSINSA-N Gly-Gln-Asn Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BULIVUZUDBHKKZ-WDSKDSINSA-N 0.000 description 1
- JMQFHZWESBGPFC-WDSKDSINSA-N Gly-Gln-Asp Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JMQFHZWESBGPFC-WDSKDSINSA-N 0.000 description 1
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 1
- BPQYBFAXRGMGGY-LAEOZQHASA-N Gly-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN BPQYBFAXRGMGGY-LAEOZQHASA-N 0.000 description 1
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 description 1
- JUGQPPOVWXSPKJ-RYUDHWBXSA-N Gly-Gln-Phe Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JUGQPPOVWXSPKJ-RYUDHWBXSA-N 0.000 description 1
- JLJLBWDKDRYOPA-RYUDHWBXSA-N Gly-Gln-Tyr Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JLJLBWDKDRYOPA-RYUDHWBXSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 1
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 1
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 1
- HHSOPSCKAZKQHQ-PEXQALLHSA-N Gly-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN HHSOPSCKAZKQHQ-PEXQALLHSA-N 0.000 description 1
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 1
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 1
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- YSDLIYZLOTZZNP-UWVGGRQHSA-N Gly-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN YSDLIYZLOTZZNP-UWVGGRQHSA-N 0.000 description 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- MHZXESQPPXOING-KBPBESRZSA-N Gly-Lys-Phe Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MHZXESQPPXOING-KBPBESRZSA-N 0.000 description 1
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 1
- OJNZVYSGVYLQIN-BQBZGAKWSA-N Gly-Met-Asp Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O OJNZVYSGVYLQIN-BQBZGAKWSA-N 0.000 description 1
- YHYDTTUSJXGTQK-UWVGGRQHSA-N Gly-Met-Leu Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(C)C)C(O)=O YHYDTTUSJXGTQK-UWVGGRQHSA-N 0.000 description 1
- OMOZPGCHVWOXHN-BQBZGAKWSA-N Gly-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)CN OMOZPGCHVWOXHN-BQBZGAKWSA-N 0.000 description 1
- FJWSJWACLMTDMI-WPRPVWTQSA-N Gly-Met-Val Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O FJWSJWACLMTDMI-WPRPVWTQSA-N 0.000 description 1
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 1
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 1
- DHNXGWVNLFPOMQ-KBPBESRZSA-N Gly-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN DHNXGWVNLFPOMQ-KBPBESRZSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 1
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 1
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 1
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 description 1
- NWOSHVVPKDQKKT-RYUDHWBXSA-N Gly-Tyr-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O NWOSHVVPKDQKKT-RYUDHWBXSA-N 0.000 description 1
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 1
- IHDKKJVBLGXLEL-STQMWFEESA-N Gly-Tyr-Met Chemical compound CSCC[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)CN)C(O)=O IHDKKJVBLGXLEL-STQMWFEESA-N 0.000 description 1
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- DKJWUIYLMLUBDX-XPUUQOCRSA-N Gly-Val-Cys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O DKJWUIYLMLUBDX-XPUUQOCRSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- FULZDMOZUZKGQU-ONGXEEELSA-N Gly-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN FULZDMOZUZKGQU-ONGXEEELSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- 235000010469 Glycine max Nutrition 0.000 description 1
- 229920002527 Glycogen Polymers 0.000 description 1
- 108020005004 Guide RNA Proteins 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- HTZKFIYQMHJWSQ-INTQDDNPSA-N His-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HTZKFIYQMHJWSQ-INTQDDNPSA-N 0.000 description 1
- HDXNWVLQSQFJOX-SRVKXCTJSA-N His-Arg-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HDXNWVLQSQFJOX-SRVKXCTJSA-N 0.000 description 1
- LYSVCKOXIDKEEL-SRVKXCTJSA-N His-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 LYSVCKOXIDKEEL-SRVKXCTJSA-N 0.000 description 1
- MVADCDSCFTXCBT-CIUDSAMLSA-N His-Asp-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MVADCDSCFTXCBT-CIUDSAMLSA-N 0.000 description 1
- WOAMZMXCLBBQKW-KKUMJFAQSA-N His-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC2=CN=CN2)N)O WOAMZMXCLBBQKW-KKUMJFAQSA-N 0.000 description 1
- FYVHHKMHFPMBBG-GUBZILKMSA-N His-Gln-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FYVHHKMHFPMBBG-GUBZILKMSA-N 0.000 description 1
- BQFGKVYHKCNEMF-DCAQKATOSA-N His-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 BQFGKVYHKCNEMF-DCAQKATOSA-N 0.000 description 1
- AKEDPWJFQULLPE-IUCAKERBSA-N His-Glu-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O AKEDPWJFQULLPE-IUCAKERBSA-N 0.000 description 1
- FIMNVXRZGUAGBI-AVGNSLFASA-N His-Glu-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FIMNVXRZGUAGBI-AVGNSLFASA-N 0.000 description 1
- JCOSMKPAOYDKRO-AVGNSLFASA-N His-Glu-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N JCOSMKPAOYDKRO-AVGNSLFASA-N 0.000 description 1
- ZYDYEPDFFVCUBI-SRVKXCTJSA-N His-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZYDYEPDFFVCUBI-SRVKXCTJSA-N 0.000 description 1
- WEIYKCOEVBUJQC-JYJNAYRXSA-N His-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WEIYKCOEVBUJQC-JYJNAYRXSA-N 0.000 description 1
- HAPWZEVRQYGLSG-IUCAKERBSA-N His-Gly-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O HAPWZEVRQYGLSG-IUCAKERBSA-N 0.000 description 1
- QPSCMXDWVKWVOW-BZSNNMDCSA-N His-His-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QPSCMXDWVKWVOW-BZSNNMDCSA-N 0.000 description 1
- JJHWJUYYTWYXPL-PYJNHQTQSA-N His-Ile-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CN=CN1 JJHWJUYYTWYXPL-PYJNHQTQSA-N 0.000 description 1
- WZBLRQQCDYYRTD-SIXJUCDHSA-N His-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N WZBLRQQCDYYRTD-SIXJUCDHSA-N 0.000 description 1
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 1
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 1
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 1
- ULRFSEJGSHYLQI-YESZJQIVSA-N His-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O ULRFSEJGSHYLQI-YESZJQIVSA-N 0.000 description 1
- VDHOMPFVSABJKU-ULQDDVLXSA-N His-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N VDHOMPFVSABJKU-ULQDDVLXSA-N 0.000 description 1
- OWYIDJCNRWRSJY-QTKMDUPCSA-N His-Pro-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O OWYIDJCNRWRSJY-QTKMDUPCSA-N 0.000 description 1
- FLXCRBXJRJSDHX-AVGNSLFASA-N His-Pro-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O FLXCRBXJRJSDHX-AVGNSLFASA-N 0.000 description 1
- DGLAHESNTJWGDO-SRVKXCTJSA-N His-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N DGLAHESNTJWGDO-SRVKXCTJSA-N 0.000 description 1
- CCUSLCQWVMWTIS-IXOXFDKPSA-N His-Thr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O CCUSLCQWVMWTIS-IXOXFDKPSA-N 0.000 description 1
- FRDFAWHTPDKRHG-ULQDDVLXSA-N His-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CN=CN1 FRDFAWHTPDKRHG-ULQDDVLXSA-N 0.000 description 1
- PZUZIHRPOVVHOT-KBPBESRZSA-N His-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CN=CN1 PZUZIHRPOVVHOT-KBPBESRZSA-N 0.000 description 1
- HIJIJPFILYPTFR-ACRUOGEOSA-N His-Tyr-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HIJIJPFILYPTFR-ACRUOGEOSA-N 0.000 description 1
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 1
- 235000008694 Humulus lupulus Nutrition 0.000 description 1
- PWGOWIIEVDAYTC-UHFFFAOYSA-N ICR-170 Chemical compound Cl.Cl.C1=C(OC)C=C2C(NCCCN(CCCl)CC)=C(C=CC(Cl)=C3)C3=NC2=C1 PWGOWIIEVDAYTC-UHFFFAOYSA-N 0.000 description 1
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 1
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 1
- WUEIUSDAECDLQO-NAKRPEOUSA-N Ile-Ala-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)O)N WUEIUSDAECDLQO-NAKRPEOUSA-N 0.000 description 1
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 1
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 1
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 1
- SACHLUOUHCVIKI-GMOBBJLQSA-N Ile-Arg-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SACHLUOUHCVIKI-GMOBBJLQSA-N 0.000 description 1
- DXUJSRIVSWEOAG-NAKRPEOUSA-N Ile-Arg-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N DXUJSRIVSWEOAG-NAKRPEOUSA-N 0.000 description 1
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 1
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 1
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 1
- PJLLMGWWINYQPB-PEFMBERDSA-N Ile-Asn-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PJLLMGWWINYQPB-PEFMBERDSA-N 0.000 description 1
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 1
- NCSIQAFSIPHVAN-IUKAMOBKSA-N Ile-Asn-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NCSIQAFSIPHVAN-IUKAMOBKSA-N 0.000 description 1
- LEDRIAHEWDJRMF-CFMVVWHZSA-N Ile-Asn-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LEDRIAHEWDJRMF-CFMVVWHZSA-N 0.000 description 1
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 1
- UDLAWRKOVFDKFL-PEFMBERDSA-N Ile-Asp-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UDLAWRKOVFDKFL-PEFMBERDSA-N 0.000 description 1
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 1
- JQLFYZMEXFNRFS-DJFWLOJKSA-N Ile-Asp-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N JQLFYZMEXFNRFS-DJFWLOJKSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 1
- LDRALPZEVHVXEK-KBIXCLLPSA-N Ile-Cys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N LDRALPZEVHVXEK-KBIXCLLPSA-N 0.000 description 1
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 description 1
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 1
- OVPYIUNCVSOVNF-KQXIARHKSA-N Ile-Gln-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N OVPYIUNCVSOVNF-KQXIARHKSA-N 0.000 description 1
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 1
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 1
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 1
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 1
- XLCZWMJPVGRWHJ-KQXIARHKSA-N Ile-Glu-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N XLCZWMJPVGRWHJ-KQXIARHKSA-N 0.000 description 1
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 1
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 1
- LWWILHPVAKKLQS-QXEWZRGKSA-N Ile-Gly-Met Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N LWWILHPVAKKLQS-QXEWZRGKSA-N 0.000 description 1
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 1
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 1
- YKLOMBNBQUTJDT-HVTMNAMFSA-N Ile-His-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YKLOMBNBQUTJDT-HVTMNAMFSA-N 0.000 description 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 1
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 1
- BBQABUDWDUKJMB-LZXPERKUSA-N Ile-Ile-Ile Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O BBQABUDWDUKJMB-LZXPERKUSA-N 0.000 description 1
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 1
- GLLAUPMJCGKPFY-BLMTYFJBSA-N Ile-Ile-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 GLLAUPMJCGKPFY-BLMTYFJBSA-N 0.000 description 1
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 1
- NUKXXNFEUZGPRO-BJDJZHNGSA-N Ile-Leu-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NUKXXNFEUZGPRO-BJDJZHNGSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 1
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 1
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 1
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 1
- RCMNUBZKIIJCOI-ZPFDUUQYSA-N Ile-Met-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RCMNUBZKIIJCOI-ZPFDUUQYSA-N 0.000 description 1
- AYLAAGNJNVZDPY-CYDGBPFRSA-N Ile-Met-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)O)N AYLAAGNJNVZDPY-CYDGBPFRSA-N 0.000 description 1
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 1
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 1
- USXAYNCLFSUSBA-MGHWNKPDSA-N Ile-Phe-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N USXAYNCLFSUSBA-MGHWNKPDSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 1
- CIJLNXXMDUOFPH-HJWJTTGWSA-N Ile-Pro-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CIJLNXXMDUOFPH-HJWJTTGWSA-N 0.000 description 1
- XOZOSAUOGRPCES-STECZYCISA-N Ile-Pro-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XOZOSAUOGRPCES-STECZYCISA-N 0.000 description 1
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 1
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 1
- JJQQGCMKLOEGAV-OSUNSFLBSA-N Ile-Thr-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)O)N JJQQGCMKLOEGAV-OSUNSFLBSA-N 0.000 description 1
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 1
- JTBFQNHKNRZJDS-SYWGBEHUSA-N Ile-Trp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C)C(=O)O)N JTBFQNHKNRZJDS-SYWGBEHUSA-N 0.000 description 1
- RTSQPLLOYSGMKM-DSYPUSFNSA-N Ile-Trp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N RTSQPLLOYSGMKM-DSYPUSFNSA-N 0.000 description 1
- CRYJOCSSSACEAA-VKOGCVSHSA-N Ile-Trp-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCSC)C(=O)O)N CRYJOCSSSACEAA-VKOGCVSHSA-N 0.000 description 1
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 1
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 1
- NSPNUMNLZNOPAQ-SJWGOKEGSA-N Ile-Tyr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N NSPNUMNLZNOPAQ-SJWGOKEGSA-N 0.000 description 1
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 1
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 1
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 1
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- 241000194036 Lactococcus Species 0.000 description 1
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 1
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 1
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 1
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 1
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- IASQBRJGRVXNJI-YUMQZZPRSA-N Leu-Cys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)NCC(O)=O IASQBRJGRVXNJI-YUMQZZPRSA-N 0.000 description 1
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 1
- AXZGZMGRBDQTEY-SRVKXCTJSA-N Leu-Gln-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O AXZGZMGRBDQTEY-SRVKXCTJSA-N 0.000 description 1
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 1
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- PRZVBIAOPFGAQF-SRVKXCTJSA-N Leu-Glu-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O PRZVBIAOPFGAQF-SRVKXCTJSA-N 0.000 description 1
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 1
- QPXBPQUGXHURGP-UWVGGRQHSA-N Leu-Gly-Met Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N QPXBPQUGXHURGP-UWVGGRQHSA-N 0.000 description 1
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 1
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 1
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 1
- WRLPVDVHNWSSCL-MELADBBJSA-N Leu-His-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N WRLPVDVHNWSSCL-MELADBBJSA-N 0.000 description 1
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- TVEOVCYCYGKVPP-HSCHXYMDSA-N Leu-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N TVEOVCYCYGKVPP-HSCHXYMDSA-N 0.000 description 1
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 1
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 1
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 1
- AUNMOHYWTAPQLA-XUXIUFHCSA-N Leu-Met-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AUNMOHYWTAPQLA-XUXIUFHCSA-N 0.000 description 1
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 1
- GNRPTBRHRRZCMA-RWMBFGLXSA-N Leu-Met-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N GNRPTBRHRRZCMA-RWMBFGLXSA-N 0.000 description 1
- JVTYXRRFZCEPPK-RHYQMDGZSA-N Leu-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)N)O JVTYXRRFZCEPPK-RHYQMDGZSA-N 0.000 description 1
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 1
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 1
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 1
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 1
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 1
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 1
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 1
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 1
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 1
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 1
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- 102000015930 Lon proteases Human genes 0.000 description 1
- 239000006137 Luria-Bertani broth Substances 0.000 description 1
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- WQWZXKWOEVSGQM-DCAQKATOSA-N Lys-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN WQWZXKWOEVSGQM-DCAQKATOSA-N 0.000 description 1
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 1
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 1
- NTEVEUCLFMWSND-SRVKXCTJSA-N Lys-Arg-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O NTEVEUCLFMWSND-SRVKXCTJSA-N 0.000 description 1
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 1
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 1
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 1
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 1
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 1
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- ZAENPHCEQXALHO-GUBZILKMSA-N Lys-Cys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZAENPHCEQXALHO-GUBZILKMSA-N 0.000 description 1
- RLZDUFRBMQNYIJ-YUMQZZPRSA-N Lys-Cys-Gly Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N RLZDUFRBMQNYIJ-YUMQZZPRSA-N 0.000 description 1
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 1
- PGBPWPTUOSCNLE-JYJNAYRXSA-N Lys-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N PGBPWPTUOSCNLE-JYJNAYRXSA-N 0.000 description 1
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- PAMDBWYMLWOELY-SDDRHHMPSA-N Lys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O PAMDBWYMLWOELY-SDDRHHMPSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- ZASPELYMPSACER-HOCLYGCPSA-N Lys-Gly-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ZASPELYMPSACER-HOCLYGCPSA-N 0.000 description 1
- KKFVKBWCXXLKIK-AVGNSLFASA-N Lys-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCCN)N KKFVKBWCXXLKIK-AVGNSLFASA-N 0.000 description 1
- GNLJXWBNLAIPEP-MELADBBJSA-N Lys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCCN)N)C(=O)O GNLJXWBNLAIPEP-MELADBBJSA-N 0.000 description 1
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 1
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 1
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- RIJCHEVHFWMDKD-SRVKXCTJSA-N Lys-Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RIJCHEVHFWMDKD-SRVKXCTJSA-N 0.000 description 1
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 1
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 1
- URGPVYGVWLIRGT-DCAQKATOSA-N Lys-Met-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O URGPVYGVWLIRGT-DCAQKATOSA-N 0.000 description 1
- MTBLFIQZECOEBY-IHRRRGAJSA-N Lys-Met-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O MTBLFIQZECOEBY-IHRRRGAJSA-N 0.000 description 1
- MTBBHUKKPWKXBT-ULQDDVLXSA-N Lys-Met-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MTBBHUKKPWKXBT-ULQDDVLXSA-N 0.000 description 1
- XFOAWKDQMRMCDN-ULQDDVLXSA-N Lys-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)CC1=CC=CC=C1 XFOAWKDQMRMCDN-ULQDDVLXSA-N 0.000 description 1
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 1
- ZJSZPXISKMDJKQ-JYJNAYRXSA-N Lys-Phe-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=CC=C1 ZJSZPXISKMDJKQ-JYJNAYRXSA-N 0.000 description 1
- MSSJJDVQTFTLIF-KBPBESRZSA-N Lys-Phe-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O MSSJJDVQTFTLIF-KBPBESRZSA-N 0.000 description 1
- WLXGMVVHTIUPHE-ULQDDVLXSA-N Lys-Phe-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O WLXGMVVHTIUPHE-ULQDDVLXSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 1
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 1
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- LKDXINHHSWFFJC-SRVKXCTJSA-N Lys-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N LKDXINHHSWFFJC-SRVKXCTJSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- DYJOORGDQIGZAS-DCAQKATOSA-N Lys-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N DYJOORGDQIGZAS-DCAQKATOSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- RYOLKFYZBHMYFW-WDSOQIARSA-N Lys-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 RYOLKFYZBHMYFW-WDSOQIARSA-N 0.000 description 1
- ZVZRQKJOQQAFCF-ULQDDVLXSA-N Lys-Tyr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZVZRQKJOQQAFCF-ULQDDVLXSA-N 0.000 description 1
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 1
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 1
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 1
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 1
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 1
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 1
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 1
- 108700041567 MDR Genes Proteins 0.000 description 1
- 239000005913 Maltodextrin Substances 0.000 description 1
- 229920002774 Maltodextrin Polymers 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- 241001293415 Mannheimia Species 0.000 description 1
- MVQGZYIOMXAFQG-GUBZILKMSA-N Met-Ala-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N MVQGZYIOMXAFQG-GUBZILKMSA-N 0.000 description 1
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 1
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 1
- VTKPSXWRUGCOAC-GUBZILKMSA-N Met-Ala-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCSC VTKPSXWRUGCOAC-GUBZILKMSA-N 0.000 description 1
- JQEBITVYKUCBMC-SRVKXCTJSA-N Met-Arg-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JQEBITVYKUCBMC-SRVKXCTJSA-N 0.000 description 1
- DSWOTZCVCBEPOU-IUCAKERBSA-N Met-Arg-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCNC(N)=N DSWOTZCVCBEPOU-IUCAKERBSA-N 0.000 description 1
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 1
- NKDSBBBPGIVWEI-RCWTZXSCSA-N Met-Arg-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NKDSBBBPGIVWEI-RCWTZXSCSA-N 0.000 description 1
- NSGXXVIHCIAISP-CIUDSAMLSA-N Met-Asn-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O NSGXXVIHCIAISP-CIUDSAMLSA-N 0.000 description 1
- QXEVZBXTDTVPCP-GMOBBJLQSA-N Met-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCSC)N QXEVZBXTDTVPCP-GMOBBJLQSA-N 0.000 description 1
- YNOVBMBQSQTLFM-DCAQKATOSA-N Met-Asn-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O YNOVBMBQSQTLFM-DCAQKATOSA-N 0.000 description 1
- ZMYHJISLFYTQGK-FXQIFTODSA-N Met-Asp-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMYHJISLFYTQGK-FXQIFTODSA-N 0.000 description 1
- JQECLVNLAZGHRQ-CIUDSAMLSA-N Met-Asp-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O JQECLVNLAZGHRQ-CIUDSAMLSA-N 0.000 description 1
- XMMWDTUFTZMQFD-GMOBBJLQSA-N Met-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCSC XMMWDTUFTZMQFD-GMOBBJLQSA-N 0.000 description 1
- FVKRQMQQFGBXHV-QXEWZRGKSA-N Met-Asp-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FVKRQMQQFGBXHV-QXEWZRGKSA-N 0.000 description 1
- MYKLINMAGAIRPJ-CIUDSAMLSA-N Met-Gln-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MYKLINMAGAIRPJ-CIUDSAMLSA-N 0.000 description 1
- FWTBMGAKKPSTBT-GUBZILKMSA-N Met-Gln-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FWTBMGAKKPSTBT-GUBZILKMSA-N 0.000 description 1
- UOENBSHXYCHSAU-YUMQZZPRSA-N Met-Gln-Gly Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UOENBSHXYCHSAU-YUMQZZPRSA-N 0.000 description 1
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 1
- GPVLSVCBKUCEBI-KKUMJFAQSA-N Met-Gln-Phe Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GPVLSVCBKUCEBI-KKUMJFAQSA-N 0.000 description 1
- CHQWUYSNAOABIP-ZPFDUUQYSA-N Met-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N CHQWUYSNAOABIP-ZPFDUUQYSA-N 0.000 description 1
- VZBXCMCHIHEPBL-SRVKXCTJSA-N Met-Glu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN VZBXCMCHIHEPBL-SRVKXCTJSA-N 0.000 description 1
- OGAZPKJHHZPYFK-GARJFASQSA-N Met-Glu-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGAZPKJHHZPYFK-GARJFASQSA-N 0.000 description 1
- JPCHYAUKOUGOIB-HJGDQZAQSA-N Met-Glu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPCHYAUKOUGOIB-HJGDQZAQSA-N 0.000 description 1
- OOSPRDCGTLQLBP-NHCYSSNCSA-N Met-Glu-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OOSPRDCGTLQLBP-NHCYSSNCSA-N 0.000 description 1
- JACAKCWAOHKQBV-UWVGGRQHSA-N Met-Gly-Lys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN JACAKCWAOHKQBV-UWVGGRQHSA-N 0.000 description 1
- CUICVBQQHMKBRJ-LSJOCFKGSA-N Met-His-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O CUICVBQQHMKBRJ-LSJOCFKGSA-N 0.000 description 1
- WPTDJKDGICUFCP-XUXIUFHCSA-N Met-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCSC)N WPTDJKDGICUFCP-XUXIUFHCSA-N 0.000 description 1
- HZLSUXCMSIBCRV-RVMXOQNASA-N Met-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N HZLSUXCMSIBCRV-RVMXOQNASA-N 0.000 description 1
- FWAHLGXNBLWIKB-NAKRPEOUSA-N Met-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCSC FWAHLGXNBLWIKB-NAKRPEOUSA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- QZPXMHVKPHJNTR-DCAQKATOSA-N Met-Leu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O QZPXMHVKPHJNTR-DCAQKATOSA-N 0.000 description 1
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 1
- ZIIMORLEZLVRIP-SRVKXCTJSA-N Met-Leu-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZIIMORLEZLVRIP-SRVKXCTJSA-N 0.000 description 1
- LBNFTWKGISQVEE-AVGNSLFASA-N Met-Leu-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCSC LBNFTWKGISQVEE-AVGNSLFASA-N 0.000 description 1
- AWGBEIYZPAXXSX-RWMBFGLXSA-N Met-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N AWGBEIYZPAXXSX-RWMBFGLXSA-N 0.000 description 1
- YYEIFXZOBZVDPH-DCAQKATOSA-N Met-Lys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O YYEIFXZOBZVDPH-DCAQKATOSA-N 0.000 description 1
- HAQLBBVZAGMESV-IHRRRGAJSA-N Met-Lys-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O HAQLBBVZAGMESV-IHRRRGAJSA-N 0.000 description 1
- CGUYGMFQZCYJSG-DCAQKATOSA-N Met-Lys-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CGUYGMFQZCYJSG-DCAQKATOSA-N 0.000 description 1
- WTHGNAAQXISJHP-AVGNSLFASA-N Met-Lys-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WTHGNAAQXISJHP-AVGNSLFASA-N 0.000 description 1
- FMYLZGQFKPHXHI-GUBZILKMSA-N Met-Met-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O FMYLZGQFKPHXHI-GUBZILKMSA-N 0.000 description 1
- OXIWIYOJVNOKOV-SRVKXCTJSA-N Met-Met-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CCCNC(N)=N OXIWIYOJVNOKOV-SRVKXCTJSA-N 0.000 description 1
- XOFDBXYPKZUAAM-GUBZILKMSA-N Met-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N XOFDBXYPKZUAAM-GUBZILKMSA-N 0.000 description 1
- CNAGWYQWQDMUGC-IHRRRGAJSA-N Met-Phe-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CNAGWYQWQDMUGC-IHRRRGAJSA-N 0.000 description 1
- FBLBCGLSRXBANI-KKUMJFAQSA-N Met-Phe-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FBLBCGLSRXBANI-KKUMJFAQSA-N 0.000 description 1
- MPCKIRSXNKACRF-GUBZILKMSA-N Met-Pro-Asn Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O MPCKIRSXNKACRF-GUBZILKMSA-N 0.000 description 1
- QLESZRANMSYLCZ-CYDGBPFRSA-N Met-Pro-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QLESZRANMSYLCZ-CYDGBPFRSA-N 0.000 description 1
- FNYBIOGBMWFQRJ-SRVKXCTJSA-N Met-Pro-Met Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N FNYBIOGBMWFQRJ-SRVKXCTJSA-N 0.000 description 1
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 1
- KSIPKXNIQOWMIC-RCWTZXSCSA-N Met-Thr-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KSIPKXNIQOWMIC-RCWTZXSCSA-N 0.000 description 1
- KYJHWKAMFISDJE-RCWTZXSCSA-N Met-Thr-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCSC KYJHWKAMFISDJE-RCWTZXSCSA-N 0.000 description 1
- JHVNNUIQXOGAHI-KJEVXHAQSA-N Met-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N)O JHVNNUIQXOGAHI-KJEVXHAQSA-N 0.000 description 1
- VWFHWJGVLVZVIS-QXEWZRGKSA-N Met-Val-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O VWFHWJGVLVZVIS-QXEWZRGKSA-N 0.000 description 1
- KPVLLNDCBYXKNV-CYDGBPFRSA-N Met-Val-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KPVLLNDCBYXKNV-CYDGBPFRSA-N 0.000 description 1
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 1
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 1
- PVSPJQWHEIQTEH-JYJNAYRXSA-N Met-Val-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PVSPJQWHEIQTEH-JYJNAYRXSA-N 0.000 description 1
- 101000842985 Metallosphaera sedula (strain ATCC 51363 / DSM 5348 / JCM 9185 / NBRC 15509 / TH2) 4-hydroxybutyrate-CoA ligase 1 Proteins 0.000 description 1
- 101000842982 Metallosphaera sedula (strain ATCC 51363 / DSM 5348 / JCM 9185 / NBRC 15509 / TH2) 4-hydroxybutyrate-CoA ligase 2 Proteins 0.000 description 1
- 241001302042 Methanothermobacter thermautotrophicus Species 0.000 description 1
- 235000016357 Mirtillo rosso Nutrition 0.000 description 1
- 240000005561 Musa balbisiana Species 0.000 description 1
- 235000018290 Musa x paradisiaca Nutrition 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- 108020004485 Nonsense Codon Proteins 0.000 description 1
- 208000012868 Overgrowth Diseases 0.000 description 1
- 241001236817 Paecilomyces <Clavicipitaceae> Species 0.000 description 1
- 235000019482 Palm oil Nutrition 0.000 description 1
- 241001520808 Panicum virgatum Species 0.000 description 1
- 241001057811 Paracoccus <mealybug> Species 0.000 description 1
- 241000218196 Persea Species 0.000 description 1
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 1
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 1
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 1
- HXSUFWQYLPKEHF-IHRRRGAJSA-N Phe-Asn-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HXSUFWQYLPKEHF-IHRRRGAJSA-N 0.000 description 1
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 1
- MFQXSDWKUXTOPZ-DZKIICNBSA-N Phe-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N MFQXSDWKUXTOPZ-DZKIICNBSA-N 0.000 description 1
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 1
- FMMIYCMOVGXZIP-AVGNSLFASA-N Phe-Glu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O FMMIYCMOVGXZIP-AVGNSLFASA-N 0.000 description 1
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 1
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 1
- OYQBFWWQSVIHBN-FHWLQOOXSA-N Phe-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O OYQBFWWQSVIHBN-FHWLQOOXSA-N 0.000 description 1
- JWQWPTLEOFNCGX-AVGNSLFASA-N Phe-Glu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWQWPTLEOFNCGX-AVGNSLFASA-N 0.000 description 1
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 1
- HBGFEEQFVBWYJQ-KBPBESRZSA-N Phe-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HBGFEEQFVBWYJQ-KBPBESRZSA-N 0.000 description 1
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 1
- NPLGQVKZFGJWAI-QWHCGFSZSA-N Phe-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O NPLGQVKZFGJWAI-QWHCGFSZSA-N 0.000 description 1
- OVJMCXAPGFDGMG-HKUYNNGSSA-N Phe-Gly-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OVJMCXAPGFDGMG-HKUYNNGSSA-N 0.000 description 1
- RVRRHFPCEOVRKQ-KKUMJFAQSA-N Phe-His-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RVRRHFPCEOVRKQ-KKUMJFAQSA-N 0.000 description 1
- VADLTGVIOIOKGM-BZSNNMDCSA-N Phe-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 VADLTGVIOIOKGM-BZSNNMDCSA-N 0.000 description 1
- GYEPCBNTTRORKW-PCBIJLKTSA-N Phe-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O GYEPCBNTTRORKW-PCBIJLKTSA-N 0.000 description 1
- MIICYIIBVYQNKE-QEWYBTABSA-N Phe-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N MIICYIIBVYQNKE-QEWYBTABSA-N 0.000 description 1
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 1
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- KPEIBEPEUAZWNS-ULQDDVLXSA-N Phe-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KPEIBEPEUAZWNS-ULQDDVLXSA-N 0.000 description 1
- ZUQACJLOHYRVPJ-DKIMLUQUSA-N Phe-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZUQACJLOHYRVPJ-DKIMLUQUSA-N 0.000 description 1
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 1
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 1
- XZQYIJALMGEUJD-OEAJRASXSA-N Phe-Lys-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZQYIJALMGEUJD-OEAJRASXSA-N 0.000 description 1
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 1
- SRILZRSXIKRGBF-HRCADAONSA-N Phe-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N SRILZRSXIKRGBF-HRCADAONSA-N 0.000 description 1
- FENSZYFJQOFSQR-FIRPJDEBSA-N Phe-Phe-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FENSZYFJQOFSQR-FIRPJDEBSA-N 0.000 description 1
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 1
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 1
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 1
- JXQVYPWVGUOIDV-MXAVVETBSA-N Phe-Ser-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JXQVYPWVGUOIDV-MXAVVETBSA-N 0.000 description 1
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 1
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 1
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 1
- AGTHXWTYCLLYMC-FHWLQOOXSA-N Phe-Tyr-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 AGTHXWTYCLLYMC-FHWLQOOXSA-N 0.000 description 1
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 1
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 1
- 108700024327 Phosphate butyryltransferases Proteins 0.000 description 1
- 235000014676 Phragmites communis Nutrition 0.000 description 1
- 241000209504 Poaceae Species 0.000 description 1
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 1
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 1
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 1
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 1
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 1
- ORPZXBQTEHINPB-SRVKXCTJSA-N Pro-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1)C(O)=O ORPZXBQTEHINPB-SRVKXCTJSA-N 0.000 description 1
- SMCHPSMKAFIERP-FXQIFTODSA-N Pro-Asn-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 SMCHPSMKAFIERP-FXQIFTODSA-N 0.000 description 1
- XWYXZPHPYKRYPA-GMOBBJLQSA-N Pro-Asn-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XWYXZPHPYKRYPA-GMOBBJLQSA-N 0.000 description 1
- AHXPYZRZRMQOAU-QXEWZRGKSA-N Pro-Asn-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1)C(O)=O AHXPYZRZRMQOAU-QXEWZRGKSA-N 0.000 description 1
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 1
- QVIZLAUEAMQKGS-GUBZILKMSA-N Pro-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 QVIZLAUEAMQKGS-GUBZILKMSA-N 0.000 description 1
- OZAPWFHRPINHND-GUBZILKMSA-N Pro-Cys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O OZAPWFHRPINHND-GUBZILKMSA-N 0.000 description 1
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 1
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 1
- ZTVCLZLGHZXLOT-ULQDDVLXSA-N Pro-Glu-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O ZTVCLZLGHZXLOT-ULQDDVLXSA-N 0.000 description 1
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 1
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 1
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 description 1
- BWCZJGJKOFUUCN-ZPFDUUQYSA-N Pro-Ile-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O BWCZJGJKOFUUCN-ZPFDUUQYSA-N 0.000 description 1
- KWMUAKQOVYCQJQ-ZPFDUUQYSA-N Pro-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 KWMUAKQOVYCQJQ-ZPFDUUQYSA-N 0.000 description 1
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 1
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 1
- LXLFEIHKWGHJJB-XUXIUFHCSA-N Pro-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 LXLFEIHKWGHJJB-XUXIUFHCSA-N 0.000 description 1
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 1
- SXMSEHDMNIUTSP-DCAQKATOSA-N Pro-Lys-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SXMSEHDMNIUTSP-DCAQKATOSA-N 0.000 description 1
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 1
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 1
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 1
- HBBBLSVBQGZKOZ-GUBZILKMSA-N Pro-Met-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O HBBBLSVBQGZKOZ-GUBZILKMSA-N 0.000 description 1
- NTXFLJULRHQMDC-GUBZILKMSA-N Pro-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 NTXFLJULRHQMDC-GUBZILKMSA-N 0.000 description 1
- QGLFRQCECIWXFA-RCWTZXSCSA-N Pro-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1)O QGLFRQCECIWXFA-RCWTZXSCSA-N 0.000 description 1
- VGVCNKSUVSZEIE-IHRRRGAJSA-N Pro-Phe-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O VGVCNKSUVSZEIE-IHRRRGAJSA-N 0.000 description 1
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 1
- SWRNSCMUXRLHCR-ULQDDVLXSA-N Pro-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 SWRNSCMUXRLHCR-ULQDDVLXSA-N 0.000 description 1
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 1
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 1
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 1
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- BJCXXMGGPHRSHV-GUBZILKMSA-N Pro-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BJCXXMGGPHRSHV-GUBZILKMSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 1
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 1
- FZXSYIPVAFVYBH-KKUMJFAQSA-N Pro-Tyr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O FZXSYIPVAFVYBH-KKUMJFAQSA-N 0.000 description 1
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 1
- DYJTXTCEXMCPBF-UFYCRDLUSA-N Pro-Tyr-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O DYJTXTCEXMCPBF-UFYCRDLUSA-N 0.000 description 1
- ZAUHSLVPDLNTRZ-QXEWZRGKSA-N Pro-Val-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZAUHSLVPDLNTRZ-QXEWZRGKSA-N 0.000 description 1
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 1
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 1
- 108010023294 Protease La Proteins 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 235000011158 Prunus mume Nutrition 0.000 description 1
- 244000018795 Prunus mume Species 0.000 description 1
- 241000168053 Pseudomonas denitrificans (nomen rejiciendum) Species 0.000 description 1
- 241000589540 Pseudomonas fluorescens Species 0.000 description 1
- 241001467519 Pyrococcus sp. Species 0.000 description 1
- 241000220324 Pyrus Species 0.000 description 1
- 235000019484 Rapeseed oil Nutrition 0.000 description 1
- 241000589126 Rhizobium phaseoli Species 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 1
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 1
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 1
- RZUOXAKGNHXZTB-GUBZILKMSA-N Ser-Arg-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O RZUOXAKGNHXZTB-GUBZILKMSA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- HZWAHWQZPSXNCB-BPUTZDHNSA-N Ser-Arg-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HZWAHWQZPSXNCB-BPUTZDHNSA-N 0.000 description 1
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 1
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 1
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- KNCJWSPMTFFJII-ZLUOBGJFSA-N Ser-Cys-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KNCJWSPMTFFJII-ZLUOBGJFSA-N 0.000 description 1
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 1
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 1
- YMAWDPHQVABADW-CIUDSAMLSA-N Ser-Gln-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O YMAWDPHQVABADW-CIUDSAMLSA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- VDVYTKZBMFADQH-AVGNSLFASA-N Ser-Gln-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 VDVYTKZBMFADQH-AVGNSLFASA-N 0.000 description 1
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 1
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 1
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 1
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 1
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- UAJAYRMZGNQILN-BQBZGAKWSA-N Ser-Gly-Met Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UAJAYRMZGNQILN-BQBZGAKWSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- UGHCUDLCCVVIJR-VGDYDELISA-N Ser-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N UGHCUDLCCVVIJR-VGDYDELISA-N 0.000 description 1
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 1
- DLPXTCTVNDTYGJ-JBDRJPRFSA-N Ser-Ile-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O DLPXTCTVNDTYGJ-JBDRJPRFSA-N 0.000 description 1
- CJINPXGSKSZQNE-KBIXCLLPSA-N Ser-Ile-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O CJINPXGSKSZQNE-KBIXCLLPSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 1
- YMDNFPNTIPQMJP-NAKRPEOUSA-N Ser-Ile-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O YMDNFPNTIPQMJP-NAKRPEOUSA-N 0.000 description 1
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 1
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 1
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- OCWWJBZQXGYQCA-DCAQKATOSA-N Ser-Lys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O OCWWJBZQXGYQCA-DCAQKATOSA-N 0.000 description 1
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 1
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 1
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 1
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 1
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 1
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 1
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- QNBVFKZSSRYNFX-CUJWVEQBSA-N Ser-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N)O QNBVFKZSSRYNFX-CUJWVEQBSA-N 0.000 description 1
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 1
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 1
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 description 1
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 1
- 102100035766 Short/branched chain specific acyl-CoA dehydrogenase, mitochondrial Human genes 0.000 description 1
- 240000003768 Solanum lycopersicum Species 0.000 description 1
- 244000057717 Streptococcus lactis Species 0.000 description 1
- 235000014897 Streptococcus lactis Nutrition 0.000 description 1
- 108091027544 Subgenomic mRNA Proteins 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 235000019486 Sunflower oil Nutrition 0.000 description 1
- 241000205180 Thermococcus litoralis Species 0.000 description 1
- 241000245949 Thermococcus profundus Species 0.000 description 1
- 241001495444 Thermococcus sp. Species 0.000 description 1
- 241000223257 Thermomyces Species 0.000 description 1
- 241000589499 Thermus thermophilus Species 0.000 description 1
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 1
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 description 1
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 1
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 1
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 1
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 1
- IRKWVRSEQFTGGV-VEVYYDQMSA-N Thr-Asn-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IRKWVRSEQFTGGV-VEVYYDQMSA-N 0.000 description 1
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 1
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 1
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- APIQKJYZDWVOCE-VEVYYDQMSA-N Thr-Asp-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O APIQKJYZDWVOCE-VEVYYDQMSA-N 0.000 description 1
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 1
- ZUUDNCOCILSYAM-KKHAAJSZSA-N Thr-Asp-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZUUDNCOCILSYAM-KKHAAJSZSA-N 0.000 description 1
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 1
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 1
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 1
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 1
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- AYCQVUUPIJHJTA-IXOXFDKPSA-N Thr-His-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O AYCQVUUPIJHJTA-IXOXFDKPSA-N 0.000 description 1
- FKIGTIXHSRNKJU-IXOXFDKPSA-N Thr-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CN=CN1 FKIGTIXHSRNKJU-IXOXFDKPSA-N 0.000 description 1
- KRGDDWVBBDLPSJ-CUJWVEQBSA-N Thr-His-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O KRGDDWVBBDLPSJ-CUJWVEQBSA-N 0.000 description 1
- YUPVPKZBKCLFLT-QTKMDUPCSA-N Thr-His-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N)O YUPVPKZBKCLFLT-QTKMDUPCSA-N 0.000 description 1
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 1
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 1
- CJXURNZYNHCYFD-WDCWCFNPSA-N Thr-Lys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CJXURNZYNHCYFD-WDCWCFNPSA-N 0.000 description 1
- ZXIHABSKUITPTN-IXOXFDKPSA-N Thr-Lys-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O ZXIHABSKUITPTN-IXOXFDKPSA-N 0.000 description 1
- WFAUDCSNCWJJAA-KXNHARMFSA-N Thr-Lys-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(O)=O WFAUDCSNCWJJAA-KXNHARMFSA-N 0.000 description 1
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 1
- CGCMNOIQVAXYMA-UNQGMJICSA-N Thr-Met-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CGCMNOIQVAXYMA-UNQGMJICSA-N 0.000 description 1
- SIEZEMFJLYRUMK-YTWAJWBKSA-N Thr-Met-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N)O SIEZEMFJLYRUMK-YTWAJWBKSA-N 0.000 description 1
- KPNSNVTUVKSBFL-ZJDVBMNYSA-N Thr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KPNSNVTUVKSBFL-ZJDVBMNYSA-N 0.000 description 1
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 1
- NWECYMJLJGCBOD-UNQGMJICSA-N Thr-Phe-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O NWECYMJLJGCBOD-UNQGMJICSA-N 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- LKJCABTUFGTPPY-HJGDQZAQSA-N Thr-Pro-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O LKJCABTUFGTPPY-HJGDQZAQSA-N 0.000 description 1
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 1
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- YGZWVPBHYABGLT-KJEVXHAQSA-N Thr-Pro-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YGZWVPBHYABGLT-KJEVXHAQSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 1
- VGNLMPBYWWNQFS-ZEILLAHLSA-N Thr-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O VGNLMPBYWWNQFS-ZEILLAHLSA-N 0.000 description 1
- OMRWDMWXRWTQIU-YJRXYDGGSA-N Thr-Tyr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N)O OMRWDMWXRWTQIU-YJRXYDGGSA-N 0.000 description 1
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 1
- CJEHCEOXPLASCK-MEYUZBJRSA-N Thr-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=C(O)C=C1 CJEHCEOXPLASCK-MEYUZBJRSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 1
- CURFABYITJVKEW-QTKMDUPCSA-N Thr-Val-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O CURFABYITJVKEW-QTKMDUPCSA-N 0.000 description 1
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- BTAJAOWZCWOHBU-HSHDSVGOSA-N Thr-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)C(C)C)C(O)=O)=CNC2=C1 BTAJAOWZCWOHBU-HSHDSVGOSA-N 0.000 description 1
- OETOOJXFNSEYHQ-WFBYXXMGSA-N Trp-Ala-Asp Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 OETOOJXFNSEYHQ-WFBYXXMGSA-N 0.000 description 1
- NMCBVGFGWSIGSB-NUTKFTJISA-N Trp-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NMCBVGFGWSIGSB-NUTKFTJISA-N 0.000 description 1
- BIJDDZBDSJLWJY-PJODQICGSA-N Trp-Ala-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O BIJDDZBDSJLWJY-PJODQICGSA-N 0.000 description 1
- MHNHRNHJMXAVHZ-AAEUAGOBSA-N Trp-Asn-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N MHNHRNHJMXAVHZ-AAEUAGOBSA-N 0.000 description 1
- BXKWZPXTTSCOMX-AQZXSJQPSA-N Trp-Asn-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXKWZPXTTSCOMX-AQZXSJQPSA-N 0.000 description 1
- KVMZNMYZCKORIG-UBHSHLNASA-N Trp-Cys-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KVMZNMYZCKORIG-UBHSHLNASA-N 0.000 description 1
- OBWQLWYNNZPWGX-QEJZJMRPSA-N Trp-Gln-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O OBWQLWYNNZPWGX-QEJZJMRPSA-N 0.000 description 1
- ZJKZLNAECPIUTL-JBACZVJFSA-N Trp-Gln-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 ZJKZLNAECPIUTL-JBACZVJFSA-N 0.000 description 1
- OTWIOROMZLNAQC-XIRDDKMYSA-N Trp-His-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OTWIOROMZLNAQC-XIRDDKMYSA-N 0.000 description 1
- RXEQOXHCHQJMSO-IHPCNDPISA-N Trp-His-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O RXEQOXHCHQJMSO-IHPCNDPISA-N 0.000 description 1
- KULBQAVOXHQLIY-HSCHXYMDSA-N Trp-Ile-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 KULBQAVOXHQLIY-HSCHXYMDSA-N 0.000 description 1
- OGZRZMJASKKMJZ-XIRDDKMYSA-N Trp-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N OGZRZMJASKKMJZ-XIRDDKMYSA-N 0.000 description 1
- YVXIAOOYAKBAAI-SZMVWBNQSA-N Trp-Leu-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 YVXIAOOYAKBAAI-SZMVWBNQSA-N 0.000 description 1
- RIKLKPANMFNREP-FDARSICLSA-N Trp-Met-Ile Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)=CNC2=C1 RIKLKPANMFNREP-FDARSICLSA-N 0.000 description 1
- SUEGAFMNTXXNLR-WFBYXXMGSA-N Trp-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O SUEGAFMNTXXNLR-WFBYXXMGSA-N 0.000 description 1
- GBEAUNVBIMLWIB-IHPCNDPISA-N Trp-Ser-Phe Chemical compound C([C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 GBEAUNVBIMLWIB-IHPCNDPISA-N 0.000 description 1
- ARKBYVBCEOWRNR-UBHSHLNASA-N Trp-Ser-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O ARKBYVBCEOWRNR-UBHSHLNASA-N 0.000 description 1
- XKTWZYNTLXITCY-QRTARXTBSA-N Trp-Val-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 XKTWZYNTLXITCY-QRTARXTBSA-N 0.000 description 1
- SWSUXOKZKQRADK-FDARSICLSA-N Trp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N SWSUXOKZKQRADK-FDARSICLSA-N 0.000 description 1
- 108010069584 Type III Secretion Systems Proteins 0.000 description 1
- NSOMQRHZMJMZIE-GVARAGBVSA-N Tyr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NSOMQRHZMJMZIE-GVARAGBVSA-N 0.000 description 1
- KSVMDJJCYKIXTK-IGNZVWTISA-N Tyr-Ala-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 KSVMDJJCYKIXTK-IGNZVWTISA-N 0.000 description 1
- WTXQBCCKXIKKHB-JYJNAYRXSA-N Tyr-Arg-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WTXQBCCKXIKKHB-JYJNAYRXSA-N 0.000 description 1
- KDGFPPHLXCEQRN-STECZYCISA-N Tyr-Arg-Ile Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDGFPPHLXCEQRN-STECZYCISA-N 0.000 description 1
- DYEGCOJHFNJBKB-UFYCRDLUSA-N Tyr-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 DYEGCOJHFNJBKB-UFYCRDLUSA-N 0.000 description 1
- AYPAIRCDLARHLM-KKUMJFAQSA-N Tyr-Asn-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O AYPAIRCDLARHLM-KKUMJFAQSA-N 0.000 description 1
- XMNDQSYABVWZRK-BZSNNMDCSA-N Tyr-Asn-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XMNDQSYABVWZRK-BZSNNMDCSA-N 0.000 description 1
- BARBHMSSVWPKPZ-IHRRRGAJSA-N Tyr-Asp-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BARBHMSSVWPKPZ-IHRRRGAJSA-N 0.000 description 1
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 1
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 1
- QHEGAOPHISYNDF-XDTLVQLUSA-N Tyr-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHEGAOPHISYNDF-XDTLVQLUSA-N 0.000 description 1
- FXYOYUMPUJONGW-FHWLQOOXSA-N Tyr-Gln-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 FXYOYUMPUJONGW-FHWLQOOXSA-N 0.000 description 1
- XQYHLZNPOTXRMQ-KKUMJFAQSA-N Tyr-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XQYHLZNPOTXRMQ-KKUMJFAQSA-N 0.000 description 1
- IWRMTNJCCMEBEX-AVGNSLFASA-N Tyr-Glu-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O IWRMTNJCCMEBEX-AVGNSLFASA-N 0.000 description 1
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 1
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 1
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 1
- AXWBYOVVDRBOGU-SIUGBPQLSA-N Tyr-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AXWBYOVVDRBOGU-SIUGBPQLSA-N 0.000 description 1
- OHOVFPKXPZODHS-SJWGOKEGSA-N Tyr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OHOVFPKXPZODHS-SJWGOKEGSA-N 0.000 description 1
- BXPOOVDVGWEXDU-WZLNRYEVSA-N Tyr-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXPOOVDVGWEXDU-WZLNRYEVSA-N 0.000 description 1
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 1
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 1
- OFHKXNKJXURPSY-ULQDDVLXSA-N Tyr-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O OFHKXNKJXURPSY-ULQDDVLXSA-N 0.000 description 1
- OGPKMBOPMDTEDM-IHRRRGAJSA-N Tyr-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N OGPKMBOPMDTEDM-IHRRRGAJSA-N 0.000 description 1
- BGFCXQXETBDEHP-BZSNNMDCSA-N Tyr-Phe-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O BGFCXQXETBDEHP-BZSNNMDCSA-N 0.000 description 1
- FASACHWGQBNSRO-ZEWNOJEFSA-N Tyr-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FASACHWGQBNSRO-ZEWNOJEFSA-N 0.000 description 1
- SCZJKZLFSSPJDP-ACRUOGEOSA-N Tyr-Phe-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SCZJKZLFSSPJDP-ACRUOGEOSA-N 0.000 description 1
- ARMNWLJYHCOSHE-KKUMJFAQSA-N Tyr-Pro-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O ARMNWLJYHCOSHE-KKUMJFAQSA-N 0.000 description 1
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 1
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 description 1
- KWKJGBHDYJOVCR-SRVKXCTJSA-N Tyr-Ser-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O KWKJGBHDYJOVCR-SRVKXCTJSA-N 0.000 description 1
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 1
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 1
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 1
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 1
- RIVVDNTUSRVTQT-IRIUXVKKSA-N Tyr-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O RIVVDNTUSRVTQT-IRIUXVKKSA-N 0.000 description 1
- JHDZONWZTCKTJR-KJEVXHAQSA-N Tyr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JHDZONWZTCKTJR-KJEVXHAQSA-N 0.000 description 1
- ULUXAIYMVXLDQP-PMVMPFDFSA-N Tyr-Trp-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)NC(=O)[C@H](CC4=CC=C(C=C4)O)N ULUXAIYMVXLDQP-PMVMPFDFSA-N 0.000 description 1
- HZDQUVQEVVYDDA-ACRUOGEOSA-N Tyr-Tyr-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HZDQUVQEVVYDDA-ACRUOGEOSA-N 0.000 description 1
- RGJZPXFZIUUQDN-BPNCWPANSA-N Tyr-Val-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O RGJZPXFZIUUQDN-BPNCWPANSA-N 0.000 description 1
- SMUWZUSWMWVOSL-JYJNAYRXSA-N Tyr-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N SMUWZUSWMWVOSL-JYJNAYRXSA-N 0.000 description 1
- 108010064997 VPY tripeptide Proteins 0.000 description 1
- 235000012511 Vaccinium Nutrition 0.000 description 1
- 241000736767 Vaccinium Species 0.000 description 1
- 240000000851 Vaccinium corymbosum Species 0.000 description 1
- 235000003095 Vaccinium corymbosum Nutrition 0.000 description 1
- 235000017537 Vaccinium myrtillus Nutrition 0.000 description 1
- 235000017606 Vaccinium vitis idaea Nutrition 0.000 description 1
- 244000077923 Vaccinium vitis idaea Species 0.000 description 1
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 1
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- NMANTMWGQZASQN-QXEWZRGKSA-N Val-Arg-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N NMANTMWGQZASQN-QXEWZRGKSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 1
- GXAZTLJYINLMJL-LAEOZQHASA-N Val-Asn-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GXAZTLJYINLMJL-LAEOZQHASA-N 0.000 description 1
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 1
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- COSLEEOIYRPTHD-YDHLFZDLSA-N Val-Asp-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 COSLEEOIYRPTHD-YDHLFZDLSA-N 0.000 description 1
- YCMXFKWYJFZFKS-LAEOZQHASA-N Val-Gln-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCMXFKWYJFZFKS-LAEOZQHASA-N 0.000 description 1
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 1
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 1
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 1
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- PMXBARDFIAPBGK-DZKIICNBSA-N Val-Glu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PMXBARDFIAPBGK-DZKIICNBSA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- BEGDZYNDCNEGJZ-XVKPBYJWSA-N Val-Gly-Gln Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O BEGDZYNDCNEGJZ-XVKPBYJWSA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 1
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 1
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 1
- XBRMBDFYOFARST-AVGNSLFASA-N Val-His-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N XBRMBDFYOFARST-AVGNSLFASA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 1
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 1
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 1
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- WDIWOIRFNMLNKO-ULQDDVLXSA-N Val-Leu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WDIWOIRFNMLNKO-ULQDDVLXSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- RSGHLMMKXJGCMK-JYJNAYRXSA-N Val-Met-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N RSGHLMMKXJGCMK-JYJNAYRXSA-N 0.000 description 1
- QPPZEDOTPZOSEC-RCWTZXSCSA-N Val-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N)O QPPZEDOTPZOSEC-RCWTZXSCSA-N 0.000 description 1
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 1
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 1
- AIWLHFZYOUUJGB-UFYCRDLUSA-N Val-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 AIWLHFZYOUUJGB-UFYCRDLUSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 1
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 1
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 1
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 1
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 1
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 1
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 1
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- QHSSPPHOHJSTML-HOCLYGCPSA-N Val-Trp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N QHSSPPHOHJSTML-HOCLYGCPSA-N 0.000 description 1
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 1
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 1
- GTACFKZDQFTVAI-STECZYCISA-N Val-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 GTACFKZDQFTVAI-STECZYCISA-N 0.000 description 1
- PDASTHRLDFOZMG-JYJNAYRXSA-N Val-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 PDASTHRLDFOZMG-JYJNAYRXSA-N 0.000 description 1
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 1
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 1
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- WHNSHJJNWNSTSU-BZSNNMDCSA-N Val-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 WHNSHJJNWNSTSU-BZSNNMDCSA-N 0.000 description 1
- 235000006582 Vigna radiata Nutrition 0.000 description 1
- 235000010721 Vigna radiata var radiata Nutrition 0.000 description 1
- 241000588902 Zymomonas mobilis Species 0.000 description 1
- IAXXETNIOYFMLW-COPLHBTASA-N [(1s,3s,4s)-4,7,7-trimethyl-3-bicyclo[2.2.1]heptanyl] 2-methylprop-2-enoate Chemical compound C1C[C@]2(C)[C@@H](OC(=O)C(=C)C)C[C@H]1C2(C)C IAXXETNIOYFMLW-COPLHBTASA-N 0.000 description 1
- UGXQOOQUZRUVSS-ZZXKWVIFSA-N [5-[3,5-dihydroxy-2-(1,3,4-trihydroxy-5-oxopentan-2-yl)oxyoxan-4-yl]oxy-3,4-dihydroxyoxolan-2-yl]methyl (e)-3-(4-hydroxyphenyl)prop-2-enoate Chemical compound OC1C(OC(CO)C(O)C(O)C=O)OCC(O)C1OC1C(O)C(O)C(COC(=O)\C=C\C=2C=CC(O)=CC=2)O1 UGXQOOQUZRUVSS-ZZXKWVIFSA-N 0.000 description 1
- 241000029538 [Mannheimia] succiniciproducens Species 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 101150066706 acrA gene Proteins 0.000 description 1
- 229940023020 acriflavine Drugs 0.000 description 1
- 125000002015 acyclic group Chemical group 0.000 description 1
- 210000005006 adaptive immune system Anatomy 0.000 description 1
- 239000000853 adhesive Substances 0.000 description 1
- 230000001070 adhesive effect Effects 0.000 description 1
- 238000005273 aeration Methods 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 239000003570 air Substances 0.000 description 1
- 108010028939 alanyl-alanyl-lysyl-alanine Proteins 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 1
- 125000005024 alkenyl aryl group Chemical group 0.000 description 1
- 125000003342 alkenyl group Chemical group 0.000 description 1
- 125000002877 alkyl aryl group Chemical group 0.000 description 1
- 125000000217 alkyl group Chemical group 0.000 description 1
- 125000000266 alpha-aminoacyl group Chemical group 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 229910021529 ammonia Inorganic materials 0.000 description 1
- 150000003863 ammonium salts Chemical class 0.000 description 1
- 230000001195 anabolic effect Effects 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 229940121375 antifungal agent Drugs 0.000 description 1
- 239000003963 antioxidant agent Substances 0.000 description 1
- 235000006708 antioxidants Nutrition 0.000 description 1
- 229920000617 arabinoxylan Polymers 0.000 description 1
- 108010080488 arginyl-arginyl-leucine Proteins 0.000 description 1
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 108010094001 arginyl-tryptophyl-arginine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 150000007860 aryl ester derivatives Chemical class 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N aspartic acid group Chemical group N[C@@H](CC(=O)O)C(=O)O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- FIXLYHHVMHXSCP-UHFFFAOYSA-H azane;dihydroxy(dioxo)molybdenum;trioxomolybdenum;tetrahydrate Chemical compound N.N.N.N.N.N.O.O.O.O.O=[Mo](=O)=O.O=[Mo](=O)=O.O=[Mo](=O)=O.O=[Mo](=O)=O.O[Mo](O)(=O)=O.O[Mo](O)(=O)=O.O[Mo](O)(=O)=O FIXLYHHVMHXSCP-UHFFFAOYSA-H 0.000 description 1
- GUBGYTABKSRVRQ-QUYVBRFLSA-N beta-maltose Chemical compound OC[C@H]1O[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@H](O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@@H]1O GUBGYTABKSRVRQ-QUYVBRFLSA-N 0.000 description 1
- 239000011942 biocatalyst Substances 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 235000021014 blueberries Nutrition 0.000 description 1
- 210000000988 bone and bone Anatomy 0.000 description 1
- 101150111656 bssR gene Proteins 0.000 description 1
- 230000005587 bubbling Effects 0.000 description 1
- 229960002092 busulfan Drugs 0.000 description 1
- 239000001273 butane Substances 0.000 description 1
- 235000013844 butane Nutrition 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 229940041514 candida albicans extract Drugs 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 230000006315 carbonylation Effects 0.000 description 1
- 238000005810 carbonylation reaction Methods 0.000 description 1
- 238000005266 casting Methods 0.000 description 1
- 239000004359 castor oil Substances 0.000 description 1
- 235000019438 castor oil Nutrition 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 230000030833 cell death Effects 0.000 description 1
- 230000019522 cellular metabolic process Effects 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000001311 chemical methods and process Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- JCKYGMPEJWAADB-UHFFFAOYSA-N chlorambucil Chemical compound OC(=O)CCCC1=CC=C(N(CCCl)CCCl)C=C1 JCKYGMPEJWAADB-UHFFFAOYSA-N 0.000 description 1
- 229960004630 chlorambucil Drugs 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 235000019868 cocoa butter Nutrition 0.000 description 1
- 229940110456 cocoa butter Drugs 0.000 description 1
- 239000003240 coconut oil Substances 0.000 description 1
- 235000019864 coconut oil Nutrition 0.000 description 1
- 235000017471 coenzyme Q10 Nutrition 0.000 description 1
- ACTIUHUUMQJHFO-UPTCCGCDSA-N coenzyme Q10 Chemical compound COC1=C(OC)C(=O)C(C\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CCC=C(C)C)=C(C)C1=O ACTIUHUUMQJHFO-UPTCCGCDSA-N 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- JZCCFEFSEZPSOG-UHFFFAOYSA-L copper(II) sulfate pentahydrate Chemical compound O.O.O.O.O.[Cu+2].[O-]S([O-])(=O)=O JZCCFEFSEZPSOG-UHFFFAOYSA-L 0.000 description 1
- 229960004397 cyclophosphamide Drugs 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000012350 deep sequencing Methods 0.000 description 1
- 230000007123 defense Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 235000013367 dietary fats Nutrition 0.000 description 1
- 235000014113 dietary fatty acids Nutrition 0.000 description 1
- DENRZWYUOJLTMF-UHFFFAOYSA-N diethyl sulfate Chemical compound CCOS(=O)(=O)OCC DENRZWYUOJLTMF-UHFFFAOYSA-N 0.000 description 1
- 229940008406 diethyl sulfate Drugs 0.000 description 1
- 238000006471 dimerization reaction Methods 0.000 description 1
- 150000004844 dioxiranes Chemical class 0.000 description 1
- 235000019301 disodium ethylene diamine tetraacetate Nutrition 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 239000012636 effector Substances 0.000 description 1
- 239000003480 eluent Substances 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 125000004185 ester group Chemical group 0.000 description 1
- 230000032050 esterification Effects 0.000 description 1
- 238000005886 esterification reaction Methods 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 229930195729 fatty acid Natural products 0.000 description 1
- 239000000194 fatty acid Substances 0.000 description 1
- 150000004665 fatty acids Chemical class 0.000 description 1
- 210000003608 fece Anatomy 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 235000019688 fish Nutrition 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 239000002803 fossil fuel Substances 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 230000008014 freezing Effects 0.000 description 1
- 238000007710 freezing Methods 0.000 description 1
- 235000011389 fruit/vegetable juice Nutrition 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 238000010363 gene targeting Methods 0.000 description 1
- 102000034356 gene-regulatory proteins Human genes 0.000 description 1
- 108091006104 gene-regulatory proteins Proteins 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 239000010520 ghee Substances 0.000 description 1
- 125000000291 glutamic acid group Chemical group N[C@@H](CCC(O)=O)C(=O)* 0.000 description 1
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 1
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 1
- ZEMPKEQAKRGZGQ-XOQCFJPHSA-N glycerol triricinoleate Natural products CCCCCC[C@@H](O)CC=CCCCCCCCC(=O)OC[C@@H](COC(=O)CCCCCCCC=CC[C@@H](O)CCCCCC)OC(=O)CCCCCCCC=CC[C@H](O)CCCCCC ZEMPKEQAKRGZGQ-XOQCFJPHSA-N 0.000 description 1
- 229940096919 glycogen Drugs 0.000 description 1
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010038983 glycyl-histidyl-lysine Proteins 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010017446 glycyl-prolyl-arginyl-proline Proteins 0.000 description 1
- 239000000383 hazardous chemical Substances 0.000 description 1
- 210000004536 heart mitochondria Anatomy 0.000 description 1
- MDNFYIAABKQDML-UHFFFAOYSA-N heptyl 2-methylprop-2-enoate Chemical compound CCCCCCCOC(=O)C(C)=C MDNFYIAABKQDML-UHFFFAOYSA-N 0.000 description 1
- 235000019514 herring Nutrition 0.000 description 1
- GNOIPBMMFNIUFM-UHFFFAOYSA-N hexamethylphosphoric triamide Chemical compound CN(C)P(=O)(N(C)C)N(C)C GNOIPBMMFNIUFM-UHFFFAOYSA-N 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 108010050343 histidyl-alanyl-glutamine Proteins 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 125000002768 hydroxyalkyl group Chemical group 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- SURQXAFEQWPFPV-UHFFFAOYSA-L iron(2+) sulfate heptahydrate Chemical compound O.O.O.O.O.O.O.[Fe+2].[O-]S([O-])(=O)=O SURQXAFEQWPFPV-UHFFFAOYSA-L 0.000 description 1
- BKWBIMSGEOYWCJ-UHFFFAOYSA-L iron;iron(2+);sulfanide Chemical compound [SH-].[SH-].[Fe].[Fe+2] BKWBIMSGEOYWCJ-UHFFFAOYSA-L 0.000 description 1
- 229940119545 isobornyl methacrylate Drugs 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 239000010806 kitchen waste Substances 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 108010012058 leucyltyrosine Proteins 0.000 description 1
- 239000000944 linseed oil Substances 0.000 description 1
- 235000021388 linseed oil Nutrition 0.000 description 1
- FCCDDURTIIUXBY-UHFFFAOYSA-N lipoamide Chemical compound NC(=O)CCCCC1CCSS1 FCCDDURTIIUXBY-UHFFFAOYSA-N 0.000 description 1
- 239000010871 livestock manure Substances 0.000 description 1
- 239000000314 lubricant Substances 0.000 description 1
- 108010076718 lysyl-glutamyl-tryptophan Proteins 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 159000000003 magnesium salts Chemical class 0.000 description 1
- WRUGWIBCXHJTDG-UHFFFAOYSA-L magnesium sulfate heptahydrate Chemical compound O.O.O.O.O.O.O.[Mg+2].[O-]S([O-])(=O)=O WRUGWIBCXHJTDG-UHFFFAOYSA-L 0.000 description 1
- 229940035034 maltodextrin Drugs 0.000 description 1
- 101150008274 marA gene Proteins 0.000 description 1
- 101150069102 marB gene Proteins 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 239000012533 medium component Substances 0.000 description 1
- 229960001924 melphalan Drugs 0.000 description 1
- SGDBTWWWUNNDEQ-LBPRGKRZSA-N melphalan Chemical compound OC(=O)[C@@H](N)CC1=CC=C(N(CCCl)CCCl)C=C1 SGDBTWWWUNNDEQ-LBPRGKRZSA-N 0.000 description 1
- 230000037353 metabolic pathway Effects 0.000 description 1
- 230000031864 metaphase Effects 0.000 description 1
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 1
- 108010090114 methionyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 235000013379 molasses Nutrition 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 238000000465 moulding Methods 0.000 description 1
- IJDNQMDRQITEOD-UHFFFAOYSA-N n-butane Chemical compound CCCC IJDNQMDRQITEOD-UHFFFAOYSA-N 0.000 description 1
- OFBQJSOFQDEBGM-UHFFFAOYSA-N n-pentane Natural products CCCCC OFBQJSOFQDEBGM-UHFFFAOYSA-N 0.000 description 1
- 230000032965 negative regulation of cell volume Effects 0.000 description 1
- 150000002823 nitrates Chemical class 0.000 description 1
- QJGQUHMNIGDVPM-UHFFFAOYSA-N nitrogen group Chemical group [N] QJGQUHMNIGDVPM-UHFFFAOYSA-N 0.000 description 1
- 230000001293 nucleolytic effect Effects 0.000 description 1
- NZIDBRBFGPQCRY-UHFFFAOYSA-N octyl 2-methylprop-2-enoate Chemical compound CCCCCCCCOC(=O)C(C)=C NZIDBRBFGPQCRY-UHFFFAOYSA-N 0.000 description 1
- 235000014593 oils and fats Nutrition 0.000 description 1
- 239000004006 olive oil Substances 0.000 description 1
- 235000008390 olive oil Nutrition 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000036542 oxidative stress Effects 0.000 description 1
- 230000020477 pH reduction Effects 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 239000003973 paint Substances 0.000 description 1
- 239000002540 palm oil Substances 0.000 description 1
- 239000001814 pectin Substances 0.000 description 1
- 229920001277 pectin Polymers 0.000 description 1
- 235000010987 pectin Nutrition 0.000 description 1
- 108010054300 peroxisomal acyl-CoA oxidase Proteins 0.000 description 1
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 1
- 238000006116 polymerization reaction Methods 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- CPTBDICYNRMXFX-UHFFFAOYSA-N procarbazine Chemical compound CNNCC1=CC=C(C(=O)NC(C)C)C=C1 CPTBDICYNRMXFX-UHFFFAOYSA-N 0.000 description 1
- 229960000624 procarbazine Drugs 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- FBCQUCJYYPMKRO-UHFFFAOYSA-N prop-2-enyl 2-methylprop-2-enoate Chemical compound CC(=C)C(=O)OCC=C FBCQUCJYYPMKRO-UHFFFAOYSA-N 0.000 description 1
- 239000001294 propane Substances 0.000 description 1
- 235000013849 propane Nutrition 0.000 description 1
- 108010033058 propionate - CoA ligase Proteins 0.000 description 1
- 230000001681 protective effect Effects 0.000 description 1
- 230000004853 protein function Effects 0.000 description 1
- 108060006633 protein kinase Proteins 0.000 description 1
- 230000004850 protein–protein interaction Effects 0.000 description 1
- 101150013718 pspD gene Proteins 0.000 description 1
- 239000005297 pyrex Substances 0.000 description 1
- 230000005855 radiation Effects 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- NPCOQXAVBJJZBQ-UHFFFAOYSA-N reduced coenzyme Q9 Natural products COC1=C(O)C(C)=C(CC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)C)C(O)=C1OC NPCOQXAVBJJZBQ-UHFFFAOYSA-N 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 230000027756 respiratory electron transport chain Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 239000010801 sewage sludge Substances 0.000 description 1
- 239000004460 silage Substances 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 230000000392 somatic effect Effects 0.000 description 1
- 239000003549 soybean oil Substances 0.000 description 1
- 235000012424 soybean oil Nutrition 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 239000011550 stock solution Substances 0.000 description 1
- 239000010907 stover Substances 0.000 description 1
- 239000010902 straw Substances 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 239000002600 sunflower oil Substances 0.000 description 1
- 238000001356 surgical procedure Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- SJMYWORNLPSJQO-UHFFFAOYSA-N tert-butyl 2-methylprop-2-enoate Chemical compound CC(=C)C(=O)OC(C)(C)C SJMYWORNLPSJQO-UHFFFAOYSA-N 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000010257 thawing Methods 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 230000003614 tolerogenic effect Effects 0.000 description 1
- 108091006106 transcriptional activators Proteins 0.000 description 1
- 108091008023 transcriptional regulators Proteins 0.000 description 1
- 238000011222 transcriptome analysis Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 239000010875 treated wood Substances 0.000 description 1
- 108700004896 tripeptide FEG Proteins 0.000 description 1
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 229940035936 ubiquinone Drugs 0.000 description 1
- 241001148471 unidentified anaerobic bacterium Species 0.000 description 1
- 239000010876 untreated wood Substances 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 235000015112 vegetable and seed oil Nutrition 0.000 description 1
- 239000008158 vegetable oil Substances 0.000 description 1
- 235000013311 vegetables Nutrition 0.000 description 1
- OGWKCGZFUXNPDA-XQKSVPLYSA-N vincristine Chemical compound C([N@]1C[C@@H](C[C@]2(C(=O)OC)C=3C(=CC4=C([C@]56[C@H]([C@@]([C@H](OC(C)=O)[C@]7(CC)C=CCN([C@H]67)CC5)(O)C(=O)OC)N4C=O)C=3)OC)C[C@@](C1)(O)CC)CC1=C2NC2=CC=CC=C12 OGWKCGZFUXNPDA-XQKSVPLYSA-N 0.000 description 1
- 229960004528 vincristine Drugs 0.000 description 1
- OGWKCGZFUXNPDA-UHFFFAOYSA-N vincristine Natural products C1C(CC)(O)CC(CC2(C(=O)OC)C=3C(=CC4=C(C56C(C(C(OC(C)=O)C7(CC)C=CCN(C67)CC5)(O)C(=O)OC)N4C=O)C=3)OC)CN1CCC1=C2NC2=CC=CC=C12 OGWKCGZFUXNPDA-UHFFFAOYSA-N 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 235000012711 vitamin K3 Nutrition 0.000 description 1
- 239000011652 vitamin K3 Substances 0.000 description 1
- 229940041603 vitamin k 3 Drugs 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
- 239000012138 yeast extract Substances 0.000 description 1
- NWONKYPBYAMBJT-UHFFFAOYSA-L zinc sulfate Chemical compound [Zn+2].[O-]S([O-])(=O)=O NWONKYPBYAMBJT-UHFFFAOYSA-L 0.000 description 1
- 229910000368 zinc sulfate Inorganic materials 0.000 description 1
- 239000011686 zinc sulphate Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/40—Preparation of oxygen-containing organic compounds containing a carboxyl group including Peroxycarboxylic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/62—Carboxylic acid esters
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/24—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Enterobacteriaceae (F), e.g. Citrobacter, Serratia, Proteus, Providencia, Morganella, Yersinia
- C07K14/245—Escherichia (G)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N1/00—Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
- C12N1/38—Chemical stimulation of growth or activity by addition of chemical compounds which are not essential growth factors; Stimulation of growth by removal of a chemical compound
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/01—Preparation of mutants without inserting foreign genetic material therein; Screening processes therefor
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- General Chemical & Material Sciences (AREA)
- Medicinal Chemistry (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Tropical Medicine & Parasitology (AREA)
- Virology (AREA)
- Gastroenterology & Hepatology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
本发明涉及用于使用甲基丙烯酸酯耐受性微生物产生甲基丙烯酸酯的方法。
Description
引言
甲基丙烯酸甲酯(methyl methacrylate,MMA)是化学工业中的重要单体。MMA的主要用途是生产多种应用的塑料;然而,MMA也可以用于在骨科手术中使用的骨水泥(bonecement)。最重要的聚合应用是聚甲基丙烯酸甲酯(PMMA)的铸造、模塑或挤出(extrusion),以生产高光学透明度塑料。PMMA的全球消耗估计为每年约210万吨。
虽然最广泛使用的甲基丙烯酸酯是MMA,但是也可以使用甲基丙烯酸或MMA分别通过直接酯化或通过酯交换产生其他甲基丙烯酸酯。实例是2-甲基丙-2-烯酸丁酯(BMA,具有化学式C8H14O2的甲基丙烯酸酯)、甲基丙烯酸异丁酯(iBMA)、甲基丙烯酸乙酯(EMA)和甲基丙烯酸2-乙基己酯(2EHMA)。BMA由MMA与丁醇进行酯交换产生。所有这些酯和PMMA都具有许多的应用,即在制造纺织品、涂料和粘合剂、包装、润滑剂、汽车设备、LCD屏幕、医疗设备和家用物品中的应用。
MMA目前仅通过化学手段产生,并且目前用于产生MMA的方法包括丙酮氰醇(ACH)途径和从多种C2-C4前体开始的其他途径。用于产生MMA的最成功的方法之一是‘Alpha方法’,其中MMA通过丙酸甲酯这种酯与甲醛的无水反应获得。在Alpha方法中,丙酸甲酯通过乙烯的羰基化产生。这种乙烯原料来源于化石燃料。与MMA产生中常用的其他方法相比,Alpha方法提供了许多优势。这些优势包括危险化学物质使用减少、高得多的产物选择性以及降低的对原油来源的原料的依赖。然而,原料的定价与汽油的成本相关。因此,开发克服这些缺陷的用于产生MMA的替代方法将是期望的。
可以使用微生物经由发酵而不是通过化学合成产生高价值的化学品。重组DNA技术和这样的微生物的合成代谢工程已经允许朝着产生特定化学品来重建代谢途径。最近正在开发朝着丙烯酸酯的生物产生的若干可持续途径。这些方法通常集中于经由微生物发酵从可再生原料产生丙烯酸酯。例如,WO2016/185211描述了用于使用重组微生物产生甲基丙烯酸和/或其衍生物的方法。
然而,在一定浓度,丙烯酸酯诸如MMA和BMA在发酵期间的积累通常对生物催化剂是有毒性的,抑制细胞生长和/或导致细胞死亡。特别地,甲基丙烯酸高级烷基酯(诸如甲基丙烯酸丁酯(BMA))比甲基丙烯酸低级烷基酯(诸如MMA)有更多毒性。因此,这可能是使用微生物(特别是工业大规模)生物产生甲基丙烯酸烷基酯的限制因素。
已知,细菌能够适应环境刺激,特别地通过适应控制基因表达的调节网络适应环境刺激。存在若干种这样的网络,包括氧化应激响应和多药耐药系统。
鉴于甲基丙烯酸烷基酯对微生物的毒性作用,以及鉴于与目前用于产生MMA的化学方法相关的缺点,提供具有改进的对甲基丙烯酸酯的耐受性的微生物用于在产生甲基丙烯酸烷基酯的方法中使用将是有利的。
因此,本发明的目的是消除或减轻这些问题中的一个或更多个,提供用于产生甲基丙烯酸酯,特别地BMA和/或MMA的改进方法,并且提供具有改进的对甲基丙烯酸酯的耐受性的微生物。
概述
如实施例中进一步描述的,本发明人采用了多种策略来选择在培养物中暴露于甲基丙烯酸酯后具有获得的对甲基丙烯酸酯的耐受性的微生物。令人惊讶地,本发明人能够分离在通常对野生型细菌是有毒性的浓度的相分离的甲基丙烯酸酯存在下存活的细菌菌株。分离耐受性菌株并测序,以鉴定与野生型相比赋予耐受性的遗传突变。还进一步表征了耐受性菌株。还开发了敲入和敲除菌株,以进一步评估与观察到的耐受性相关的突变。本发明人发现,赋予耐受性的突变驻留于编码氧化应激响应和细菌多药耐药系统等的组分和调节物的某些基因中。本发明人还发现,某些突变的组合可以增强耐受性作用。因此,所鉴定的突变可以用于通过遗传操纵野生型细菌以引入所述突变来赋予细菌耐受性。
因此,本发明人令人惊讶地鉴定、分离和表征了具有比野生型细菌更大的对甲基丙烯酸酯的耐受性并因此可以用于产生这样的化合物的方法中的细菌菌株。因此,本发明人已经证明氧化应激响应和细菌多药耐药系统及其调节物中的突变在介导对甲基丙烯酸酯的耐受性中起着至关重要的作用。特别地,本发明人已经表明吖啶黄抗性调节物(Acriflavine resistance regulator)(acrR)的突变体赋予对甲基丙烯酸酯的耐受性,特别是在与选自marR、soxR或rob的全局调节物之一进行组合时赋予对甲基丙烯酸酯的耐受性。本发明人还鉴定了氧化应激响应和细菌多药耐药系统的组分和调节物,其有助于观察到的耐受性,因此使得对微生物进行遗传修饰以赋予增加的对甲基丙烯酸酯的耐受性的方法成为可能。因此,具有增加的对甲基丙烯酸酯的耐受性的遗传修饰的微生物也在本发明的范围内。本发明人还开发了用于使用具有增加的对甲基丙烯酸酯的耐受性的遗传修饰的微生物产生甲基丙烯酸酯的方法。
附图
在以下非限制性附图中进一步描述本发明。
图1. 20%(v/v)BMA的不存在[■]或存在[●]对大肠杆菌生长的影响。使大肠杆菌MG1655(A)、大肠杆菌MG1655 soxR(R20H)(B)、大肠杆菌MG1655 soxR(R20H)acrR(V29G)(C)、大肠杆菌MG1655 soxR(R20H)acrR(T32fs)(D)在250mL摇瓶中在MSX培养基中在37℃和250rpm摇动生长。示出了两次重复的平均值,并且误差线是标准偏差。
图2.在指数中期期间添加20%(v/v)BMA对大肠杆菌菌株生长的影响。示出了(A)大肠杆菌MG1655、(B)大肠杆菌MG1655 soxR(R20H)、(C)大肠杆菌MG1655 soxR(R20H)acrR(V29G)和(D)大肠杆菌MG1655soxR(R20H)acrR(T32fs)的生长。使所有菌株在250mL摇瓶中在MSX培养基中在37℃和250rpm生长。在达到约0.3的OD600nm后,添加BMA或H2O。在不同的时间间隔获取样品,以确定OD600nm。示出了三次重复的平均值,并且误差线是标准偏差。
图3.接种后立即添加的20%(v/v)BMA的不存在[●]或存在[▲]对单个缺失体生长的影响。使大肠杆菌BW25115ΔsoxR(A)和大肠杆菌BW25115ΔacrR(B)在30mL小瓶中在MSX培养基中在37℃和250rpm摇动生长。在0h、8h和24h时获取样品,以确定OD600nm。示出了两次重复的平均值,并且误差线是标准偏差。
图4.接种后立即添加的20%(v/v)BMA的不存在[●]或存在[▲]对单突变菌株生长的影响。使大肠杆菌MG1655 acrR(V29G)(A)和大肠杆菌MG1655 acrR(T32fs)(B)在30mL小瓶中在MSX培养基中在37℃和250rpm摇动生长。在0h、8h和24h时获取样品,以确定OD600nm。示出了两次重复的平均值,并且误差线是标准偏差。
图5.a)低浓度的BMA的影响。使用具有50mL培养基的250mL锥形瓶使大肠杆菌BW25113在具有0%(□)、0.01%(◇)、0.05%(Δ)和0.1%(x)v/v的BMA时在M9基本培养基中在37℃和200RPM生长。b)高浓度的BMA的影响。使用具有50mL培养基的250mL锥形瓶使大肠杆菌BW25113在具有0%(□)、0.5%(◇)、1.0%(Δ)、5.0%(○)、10.0%(x)和20.0%(+)v/v的BMA时在M9基本培养基中生长。c)大肠杆菌对BMA的固有耐受性的测试。使用具有50mL培养基的250mL锥形瓶使大肠杆菌BW 25113在具有0%(□)、0.1%(x)、0.5%(*)、传代培养后0.1%(◇)、传代培养后0.5%(Δ)v/v的BMA时在M9基本培养基中在37℃和200RPM生长(一式三份)。d)ADE-1。随着BMA浓度的顺序增加,连续分批培养中的适应性进化。大肠杆菌在具有10mL培养基的50mL管(在3个平行管中,管1(实线)、管2(点线)和管3(虚线))中在具有10g L-1葡萄糖和BMA的M9基本培养基中在37℃和200RPM生长。使用最佳生长培养物作为3个管中各自的起始培养物,在每次连续转移中以0.1%BMA(x)、0.5%BMA(□)、1%BMA(◇)、5%(Δ)、10%(○)和20%(+)增加BMA浓度。e)ADE-3。连续分批培养中的适应性进化,每个BMA浓度中有5个连续培养。大肠杆菌在具有10mL培养基的50mL管(在3个平行管中,管1(实线)、管2(点线)和管3(虚线))中在具有10g L-1葡萄糖和BMA的M9基本培养基中在37℃和200RPM生长。BMA浓度从0.1%v/v(x)增加至0.5%v/v(□)、1%v/v(◇)、5%v/v(Δ)、10%v/v(*)和20%v/v(○),使用每个单独的管作为随后转移的起始培养物。f)ADE-3。连续分批培养中的适应性进化,1个连续培养在0.1%v/v BMA,2个连续转移在10%v/v BMA,并且45个连续转移在20%v/v BMA。大肠杆菌在具有10mL培养基的50mL管(在3个平行管中,管1(实线)、管2(点线)和管3(虚线))中在具有10g L-1葡萄糖和BMA的M9基本培养基中在37℃和200RPM生长。BMA浓度从0.1%v/v(x)增加至10%v/v(□)和20%v/v(◇),使用最佳生长培养物作为3个管中各自的起始培养物。g)ADE-4,恒化器中BMA耐受大肠杆菌的适应性进化和选择。微型生物反应器(55mL工作体积)中的大肠杆菌的恒化器培养物在具有1g L-1葡萄糖的M9基本培养基中在37℃和~0.3L h-1的通气速率生长,其中BMA浓度(◆)从0逐渐增加至20%v/v,并且稀释速率(υ)从0变化至0.55h-1。细胞浓度(●)以细胞干重g L-1报告。h)使用具有50mL培养基的250mL锥形瓶使来自ADE-1和ADE-2的分离株在具有20%v/v BMA的M9基本培养基中在37℃和200RPM生长(一式三份)。分离株2(Δ)、3(◇)、5(*)、6(-)、7(○)和没有BMA的野生型(─点线)。i)使用具有50mL培养基的250mL锥形瓶使来自ADE-3的分离株在具有20%v/v BMA的M9基本培养基中在37℃和200RPM生长(一式三份)。分离株14(□)、15(◇)、16(Δ)、17(x)、18(○)和没有BMA的野生型(─点线)。j)使用具有50mL培养基的250mL锥形瓶使来自ADE-4的分离株以0.33h-1的稀释速率在具有20%v/v BMA的M9基本培养基中在37℃和200RPM生长(一式三份)。分离株8(□)、9(◇)、10(Δ)和没有BMA的野生型(─点线)。k)使用具有50mL培养基的250mL锥形瓶使来自ADE-4的分离株以0.46h-1的稀释速率在具有20%v/v BMA的M9基本培养基中在37℃和200RPM生长(一式三份)。分离株19(□)、20(◇)、21(Δ)和没有BMA的野生型(─点线)。
图6.使用具有50mL培养基的250mL锥形瓶使来自适应性进化研究的分离株以0.55h-1的稀释速率在具有20%v/v BMA的M9基本培养基中在37℃和200RPM生长(一式三份)。分离株22(◇)、23(○)和没有BMA的野生型(─点线)。
详述
现在将进一步描述本发明。在以下段落中,更详细地定义了本发明的不同方面。除非另有明确相反指示,否则如此定义的每个方面可以与任何其他一个方面或更多个方面组合。特别地,被指示为优选或有利的任何特征可以与被指示为优选或有利的任何其他一个特征或更多个特征组合。
除非另有指示,否则本发明的实践采用以下的常规技术:微生物学、组织培养、分子生物学、化学、生物化学和重组DNA技术,所述常规技术在本领域技术内。这样的技术在文献中被充分地说明,例如,Green和Sambrook:Molecular Cloning:A Laboratory Manual,第4版,2014.
用于产生甲基丙烯酸酯、赋予微生物对甲基丙烯酸酯的耐受性和使甲基丙烯酸酯耐受性微生物生长的方法
在第一方面,本发明涉及一种用于产生甲基丙烯酸酯的方法,该方法包括
a)在发酵培养基中提供与野生型微生物相比具有增加的对甲基丙烯酸C3-C12酯的耐受性的遗传修饰的微生物,和
b)使微生物在产生甲基丙烯酸C3-C12酯的条件下生长。
在第二方面,本发明涉及一种用于使微生物在甲基丙烯酸酯的存在下生长或维持的方法,所述方法包括在产生甲基丙烯酸C3-C12酯的条件下,在发酵培养基中提供与野生型微生物相比具有增加的对甲基丙烯酸C3-C12酯的耐受性的遗传修饰的微生物。
本发明还涉及一种用于赋予或增加微生物对甲基丙烯酸酯的耐受性的方法,所述方法包括将突变引入编码氧化应激响应和细菌多药耐药系统的蛋白组分和/或调节物的核酸中。
如本文使用的,术语“耐受性(tolerance)”或“抗性(resistance)”是指微生物在甲基丙烯酸酯存在下,特别地在甲基丙烯酸C3-C12酯存在下存活的能力。因此,在一种实施方案中,可以将耐受性微生物维持在对野生型微生物有毒性的浓度的甲基丙烯酸酯下。例如,与野生型微生物相比,在甲基丙烯酸丁酯的存在下,微生物可以存活且生长至少约2.5小时更长,优选地至少约5小时更长,更优选地至少约10小时更长,最优选地至少约20小时更长。微生物的存活和生长可以通过本领域已知的任何合适的方法来确定。优选地,微生物的存活和生长可以通过测量微生物的光密度来确定,更优选地,微生物的光密度在约600nm的波长处测量(OD600)。例如,与野生型微生物相比,在甲基丙烯酸丁酯的存在下,本发明的微生物可以具有至少0.5的OD600持续2.5小时更长,优选地持续至少约5小时更长,更优选地持续至少约10小时更长,最优选地持续至少约20小时更长。在另一个实例中,OD从0.01-0.1增加到0.15至3.0指示生物体正在生长。合适地,OD600在合适的发酵培养基中测量。本领域技术人员会理解,微生物的光密度可以在不同的波长处测量,这取决于使用的发酵培养基。
在另一种实施方案中,耐受性微生物能够在引起野生型微生物死亡或生长停滞的浓度的甲基丙烯酸酯下生长。
术语“耐受性(tolerance)”或“抗性(resistance)”和“耐受性(tolerant)”或“抗性(resistant)”在本文可互换使用。与不包含一个或更多个遗传修饰的野生型微生物相比,如本文描述的遗传修饰的生物体的特征在于增加的对甲基丙烯酸酯,特别是甲基丙烯酸C3-C12酯的“耐受性”或“抗性”。
在本发明的方面的一种实施方案中,微生物在液体培养基中在约37℃生长时,对至少10%v/v至30%v/v,例如10%v/v、11%v/v、12%v/v、13%v/v、14%v/v、15%v/v、16%v/v、17%v/v、18%v/v、19%v/v、20%v/v的甲基丙烯酸C3-C12酯耐受。因此,根据多种实施方案,微生物可以维持在在至少10%v/v至30%v/v,例如10%v/v、11%v/v、12%v/v、13%v/v、14%v/v、15%v/v、16%v/v、17%v/v、18%v/v、19%v/v、20%v/v的浓度的甲基丙烯酸C3-C12酯的存在下。
术语“增加”、“改进”或“增强”或“增加的”、“改进的”、“增强的”或“增加的”在本文可互换使用。
如本文使用的术语微生物是指一个或更多个原核细胞或真核细胞。在一种实施方案中,微生物是原核细胞。在一种实施方案中,微生物是细菌。
在一种实施方案中,修饰的微生物在调节氧化应激响应和细菌多药耐药系统或形成氧化应激响应和细菌多药耐药系统的一部分的一种或更多种蛋白(和编码这样的蛋白的核酸)中具有突变,所述一种或更多种蛋白包括例如外排泵的成员或调节物,例如在大肠杆菌中鉴定的AcrAB TolC外排泵复合物和AraC家族成员。氧化应激响应和细菌多药耐药系统的组分和调节物已在大肠杆菌中得到表征,并且在其他生物体中,包括在许多革兰氏阴性物种中发现了同源复合物。
在一种实施方案中,微生物是革兰氏阴性细菌。在一种实施方案中,革兰氏阴性细菌属于肠杆菌科(Enterobacteriaceae)。
在一种实施方案中,在本发明范围内的合适细菌的实例包括属于变形菌门(proteobacteria)的以下属的肠杆菌:埃希氏菌属(Escherichia)、肠杆菌属(Enterobacter)、泛菌属(Pantoea)、克雷伯氏菌属(Klebsiella)、沙雷氏菌属(Serratia)、欧文氏菌属(Erwinia)、沙门氏菌属(Salmonella)、摩根氏菌属(Morganella)等,属于以下属的所谓棒状杆菌:短杆菌属(Brevibacterium)、棒状杆菌属(Corynebacterium)或微杆菌属(Microbacterium),以及属于以下属的细菌:脂环酸芽孢杆菌属(Alicyclobacillus)、芽孢杆菌属(Bacillus)、氢杆菌属(Hydrogenobacter)、甲烷球菌属(Methanococcus)、醋杆菌属(Acetobacter)、不动杆菌属(Acinetobacter)、土壤杆菌属(Agrobacterium)、固氮根瘤菌属(Axorhizobium)、固氮菌属(Azotobacter)、无形体属(Anaplasma)、拟杆菌属(Bacteroides)、巴尔通体属(Bartonella)、鲍特菌属(Bordetella)、疏螺旋体属(Borrelia)、布鲁氏菌属(Brucella)、伯克霍尔德菌属(Burkholderia)、荚膜杆菌属(Calymmatobacterium)、弯曲菌属(Campylobacter)、衣原体属(Chlamydia)、嗜衣原体属(Chlamydophila)、梭菌属(Clostridium)、柯克斯体属(Coxiella)、贪铜菌属(Cupriavidus)、埃立克体属(Ehrlichia)、肠球菌属(Enterococcus)、弗朗西斯氏菌属(Francisella)、梭杆菌属(Fusobacterium)、加德纳菌属(Gardnerella)、嗜血杆菌属(Haemophilus)、螺杆菌属(Helicobacter)、克雷伯氏菌属、甲烷杆菌属(Methanobacterium)、微球菌属(Micrococcus)、莫拉菌属(Moraxella)、分支杆菌属(Mycobacterium)、支原体属(Mycoplasma)、奈瑟菌属(Neisseria)、巴斯德菌属(Pasteurella)、消化链球菌属(Peptostreptococcus)、卟啉单胞菌属(Porphyromonas)、普氏菌属(Prevotella)、假单胞菌属(Pseudomonas)、根瘤菌属(Rhizobium)、立克次氏体属(Rickettsia)、罗沙利马体属(Rochalimaea)、罗氏菌属(Rothia)、志贺菌属(Shigella)、葡萄球菌属(Staphylococcus)、寡养单胞菌属(Stenotrophomonas)、链球菌属(Streptococcus)、密螺旋体属(Treponema)、弧菌属(Vibrio)、沃尔巴克氏体属(Wolbachia)、耶尔森菌属(Yersinia)等。
示例性细菌包括选自以下的物种:大肠杆菌(Escherichia coli)、产酸克雷伯氏菌(Klebsiella oxytoca)、产琥珀酸厌氧螺菌(Anaerobiospirillumsucciniciproducens)、产琥珀酸放线杆菌(Actinobacillus succinogenes)、产琥珀酸曼氏杆菌(Mannheimia succiniciproducens)、菜豆根瘤菌(Rhizobium etli)、枯草芽孢杆菌(Bacillus subtilis)、谷氨酸棒状杆菌(Corynebacterium glutamicum)、氧化葡糖杆菌(Gluconobacter oxydans)、运动发酵单胞菌(Zymomonas mobilis)、乳酸乳球菌(Lactococcus lactis)、植物乳杆菌(Lactobacillus plantarum)、天蓝色链霉菌(Streptomyces coelicolor)、丙酮丁醇梭菌(Clostridium acetobutylicum)、荧光假单胞菌(Pseudomonas fluorescens)、嗜热氢杆菌(Hydrogenobacter thermophilus)、詹氏甲烷球菌(Methanococcus jannaschii)和恶臭假单胞菌(Pseudomonas putida)。
优选地,该细菌属于以下属:埃希氏菌属、棒状杆菌属或假单胞菌属。优选地,细菌是大肠杆菌、谷氨酸棒状杆菌、荧光假单胞菌或恶臭假单胞菌。
示例性酵母或真菌包括属于以下的属的那些:酵母属(Saccharomyces)、裂殖酵母属(Schizosaccharomyces)、假丝酵母属(Candida)、克鲁维酵母属(Kluyveromyces)、曲霉属(Aspergillus)、毕赤酵母属(Pichia)、隐球菌属(Crytpococcus)等。示例性酵母或真菌物种包括选自以下的那些:酿酒酵母(Saccharomyces cerevisiae)、粟酒裂殖酵母(Schizosaccharomyces pombe)、乳酸克鲁维酵母(Kluyveromyces lactis)、马克斯克鲁维酵母(Kluyveromyces marxianus)、土曲霉(Aspergillus terreus)、黑曲霉(Aspergillusniger)、巴斯德毕赤酵母(Pichia pastoris)等。
在一种实施方案中,合适的微生物选自埃希氏菌属、欧文氏菌属、普鲁威登菌菌属(Providencia)和沙雷氏菌属。在埃希氏菌属中,可以使用大肠杆菌物种。示例性菌株包括大肠杆菌B、大肠杆菌C、大肠杆菌W等。在一种实施方案中,微生物是大肠杆菌,例如商购可得的和/或完全表征的大肠杆菌菌株,诸如K-12MG1655、BW25113或W3110。
虽然本文中与细菌细胞相关的方面和实施方案通常是指根据其在大肠杆菌中的命名的基因或蛋白,但对于另一个科或物种的细菌细胞,鉴定其他科或物种中的对应基因或蛋白,例如同源物、种间同源物和/或种内同源物,是在本领域技术水平内的,通常通过鉴定与大肠杆菌序列具有中等(通常≥30%)或高(通常≥50%)同一性的序列,优选地考虑由基因组中的基因和/或基因的基因座表达的蛋白的功能。下文的表1a和1b列出了由每个特定基因编码的蛋白的功能、其在大肠杆菌BW25113基因组中的基因座和编码序列的SEQ ID号。本文提供了野生型核酸及编码的蛋白的序列。
因此,本领域技术人员会理解,本发明的多个方面扩展至其他微生物中的突变体蛋白和核酸序列,优选地细菌例如大肠杆菌菌株中的突变体蛋白和核酸序列。因此,还提及是大肠杆菌蛋白和核酸的同源物的氧化应激响应和细菌多药耐药系统的调节物和组分及其调节物。如本文使用的,核酸序列或氨基酸序列的同源物与野生型核酸或氨基酸序列具有至少25%、26%、27%、28%、29%、30%、31%、32%、33%、34%、35%、36%、37%、38%、39%、40%、41%、42%、43%、44%、45%、46%、47%、48%、49%、50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、61%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或99%的总体序列同一性。
“同一性”是测量序列的相似性或相关性的序列特性。如本公开内容中使用的术语“序列同一性”或“同一性”意指本公开内容的多肽序列与所讨论的序列进行(同源)比对后成对相同残基相对于这两个序列中较长序列中的残基数量的百分比。通过将相同氨基酸残基的数量除以残基总数量,并且将乘积乘以100来测量序列同一性。
术语“同源性”在本文中以其通常的含义使用,并且包括在本公开内容的多肽的线性氨基酸序列中的等同位置处的相同氨基酸以及被认为是保守取代(例如,谷氨酸残基被天冬氨酸残基替代)的氨基酸。
合适的同源物或种间同源物可以使用数据库诸如本领域技术人员已知的NCBI和Paint ensemble和比对程序通过序列比较和保守结构域的鉴定来进行鉴定。例如,序列同源性或序列同一性的百分比可以在本文使用BLASTP程序来确定。
可以使用合适的计算机程序诸如代表基本局部比对搜索工具(Basic LocalAlignment Search Tool)的BLAST2.0或ClustalW或适合生成序列比对的任何其他合适程序进行比对。因此,野生型核酸或氨基酸序列可以用作“主题序列”或“参考序列”,而突变体核酸的氨基酸序列或者与本文描述的野生型不同的氨基酸序列用作“查询序列”。术语“参考序列”和“野生型序列”在本文可互换使用。
术语“遗传修饰”或“遗传工程”广泛地指对微生物的基因组或核酸的操纵。同样,术语“遗传工程化”或“遗传修饰的”是指包含与野生型微生物的基因组或核酸不同的基因组或核酸的微生物。例如,可以使用遗传修饰的方法操纵基因组或核酸,所述遗传修饰的方法诸如异源基因表达、基因或启动子插入或缺失、核酸突变、改变的基因表达或失活、酶工程、随机诱变方法、基因改组或密码子优化。在一种实施方案中,“遗传工程化”或“遗传修饰的”微生物是指包含与野生型微生物不同的基因组或核酸的分离的菌株,例如分离的细菌菌株。
为了本发明的目的,“突变体”或“遗传修饰的”微生物是与天然存在的野生型(WT)微生物相比已经被改变的微生物。
如本文使用的,“突变的”是指与野生型微生物相比,在本发明的多方面的微生物中被修饰的核酸或蛋白。核酸序列或氨基酸序列中的突变可以是一个或更多个残基的缺失、插入或取代。核酸序列中的突变可以导致错义突变,导致蛋白中单个氨基酸的取代。可选地,核酸序列中的突变可以引入提前终止密码子(premature stop codon),产生截短的蛋白或后续氨基酸序列中的改变。敲除突变是消除蛋白功能的突变。
“突变体”或“遗传修饰的”微生物从其自然环境中分离。在一种实施方案中,“突变体”或“遗传修饰的”微生物不存在于自然界中。
如本文使用的,词语“核酸”、“核酸序列”、“核苷酸”、“核酸分子”或“多核苷酸”意在包括DNA分子(例如,cDNA或基因组DNA)、RNA分子(例如,mRNA)、天然存在的、突变的、合成的DNA或RNA分子,以及使用核苷酸类似物产生的DNA或RNA的类似物。它可以是单链或双链的。这样的核酸或多核苷酸包括但不限于结构基因的编码序列、反义序列和不编码mRNA或蛋白产物的非编码调节序列。这些术语也包括基因。术语“基因”或“基因序列”广泛用于指与生物功能相关的DNA核酸。因此,基因可以包括如基因组序列中的内含子和外显子,或者可以仅包括如cDNA中的编码序列,和/或可以包括与调节序列组合的cDNA。
术语“肽”、“多肽”和“蛋白”在本文可互换使用,并且是指通过肽键连接在一起的任何长度的聚合形式的氨基酸。
为了本发明的目的,“转基因的”、“转基因”或“重组体”意指,关于例如核酸序列、包含核酸序列的表达盒、基因构建体或载体,或用本文描述的核酸序列、表达盒或载体转化的微生物,编码可用于本发明的方法的蛋白的核酸序列不位于其天然遗传环境中或者已通过重组方法修饰。天然遗传环境被理解为意指原始微生物中的天然基因组的或染色体的基因座。
术语“异源”核酸序列或基因构建体指通常不可能天然存在于生物体中的基因构建体的核酸序列。
根据本发明的多个方面,突变体或修饰的基因中的突变可以驻留在编码氧化应激响应和细菌多药耐药系统的蛋白组分或调节物及其调节物的基因中,例如编码AcrAB TolC外排泵的组分或调节物或AraC蛋白家族的成员的基因中。例如,根据本发明的多个方面的突变体或修饰的基因中的突变可以驻留在编码转录调节物的AraC家族蛋白的基因中。特别地,突变可以驻留在产生突变体SoxR蛋白的soxR核酸序列、产生突变体AcrR蛋白的acrR核酸序列、产生突变体Rob蛋白的rob核酸序列和/或产生突变体MarR蛋白的marR核酸序列或其组合中。特别地,本发明人已经表明吖啶黄抗性调节物(acrR)的突变体特别地在与选自marR、soxR或rob的全局调节物之一组合时赋予对甲基丙烯酸酯的耐受性。
氧化还原敏感转录激活物(soxR)
在一种实施方案中,遗传修饰的微生物在soxR核酸序列中包含突变。因此,与野生型soxR核酸序列相比,遗传修饰的微生物包含突变体soxR核酸序列。突变体soxR核酸序列编码突变体SoxR蛋白。
soxR编码MerR家族的调节物。在氧化应激信号不存在下,SoxR同二聚体与soxS的启动子区域结合,并且防止增强的转录。当SoxR簇被氧化时,SoxR成为soxS转录的激活物。
大肠杆菌soxR核酸序列示于SEQ ID NO.1中。这编码如SEQ ID NO.2中示出的蛋白。17-kDa SoxR蛋白由154个氨基酸残基组成,并且形成同二聚体,其包含[2Fe-2S]簇。它具有DNA结合结构域(残基1-80)、二聚化螺旋(残基81-118)和Fe-S簇结合结构域(残基119-154)。SoxR激活通过其Fe-S簇的氧化来介导,导致增强的soxS转录。
在一种实施方案中,野生型soxR核酸序列包括SEQ ID NO.1或其同源物。
soxR核酸序列或SoxR氨基酸序列的同源物与分别由SEQ ID NO.1和SEQ ID NO.2代表的野生型核酸或氨基酸序列具有至少25%、26%、27%、28%、29%、30%、31%、32%、33%、34%、35%、36%、37%、38%、39%、40%、41%、42%、43%、44%、45%、46%、47%、48%、49%、50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、61%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或99%的总体序列同一性。优选地,总体序列同一性是85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或99%。使用本领域已知的全局比对算法,诸如GAP程序(GCG Wisconsin Package,Accelrys)中的Needleman Wunsch算法确定总体序列同一性。
在一种实施方案中,SoxR中的突变促进氧化和/或DNA结合,从而增加SoxR调节子(包括SoxS)的转录。
在一种实施方案中,微生物包含编码突变体SoxR蛋白的突变体soxR核酸序列,该突变体SoxR蛋白在DNA结合结构域(残基1-80)或在FE-S簇结构域(残基119-154)中具有突变。
在一种实施方案中,SoxR蛋白中的突变选自以下之一:参考SEQ ID NO.2,R20被另一种氨基酸的取代、R20被另一种氨基酸的取代、残基146的缺失(核酸残基435、436、437的缺失)或残基139处的截短。在一种实施方案中,R20被H取代。在一种实施方案中,R20被L取代。在除了大肠杆菌之外的微生物中的SoxR同源物中的等同位置处的修饰也在本发明的范围内。
在一种实施方案中,突变不是敲除突变。在一种实施方案中,突变是功能突变的获得。改变的蛋白能够赋予对甲基丙烯酸酯的耐受性。
吖啶黄抗性调节物(acrR)
在一种实施方案中,遗传修饰的微生物在acrR核酸序列中包含突变。因此,与野生型acrR序列相比,遗传修饰的微生物包含突变体acrR核酸序列。突变体acrR核酸序列编码突变体AcrR蛋白。
AcrR调节与AcrAB-TolC多药外排泵相关的acrRAB基因的表达。AcrR同二聚体通过与acrAB操纵子区域结合充当acrAB操纵子的阻遏物,并且当激活物结合到其C末端配体结合结构域时,AcrR同二聚体被释放,允许acrAB的转录。acrAB操纵子编码AcrA和AcrB,其与TolC合作形成主要的多药外排泵复合物AcrAB-TolC。
AcrR是二聚双结构域分子,具有完全螺旋结构,类似于转录调节物TetR家族的成员。在大肠杆菌中,acrR基因编码215个氨基酸的AcrR蛋白,该蛋白包含N末端DNA结合结构域(残基7-51)和C末端配体结合结构域(残基55-204)。
在一种实施方案中,野生型acrR核酸序列包含SEQ ID NO.3或其编码SEQ ID NO.4的同源物。
acrR核酸序列或AcrR氨基酸序列的同源物与分别由SEQ ID NO.3和SEQ ID NO.4代表的野生型核酸或氨基酸序列具有至少25%、26%、27%、28%、29%、30%、31%、32%、33%、34%、35%、36%、37%、38%、39%、40%、41%、42%、43%、44%、45%、46%、47%、48%、49%、50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、61%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或99%的总体序列同一性。优选地,总体序列同一性是85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或99%。总体序列同一性使用本领域已知的全局比对算法,诸如GAP程序(GCG Wisconsin Package,Accelrys)中的Needleman Wunsch算法来确定。
在一种实施方案中,微生物包含编码突变体AcrR蛋白的突变体acrR核酸序列,该突变体AcrR蛋白在DNA结合结构域(残基7-51)或在配体结合结构域(残基55-204)中具有突变。
在一种实施方案中,AcrR中的突变选自以下之一:参考SEQ ID NO.4,V29被另一种氨基酸的取代、Y49处的移码突变、A191处的移码突变或T32处的移码突变。在一种实施方案中,V29被G取代。在除了大肠杆菌之外的微生物中的AcrR同源物中的等同位置处的修饰也在本发明的范围内。
在一种实施方案中,突变不是敲除突变。在一种实施方案中,突变是功能突变的获得,即产生与野生型相比功能改变的功能蛋白。改变的蛋白能够赋予对甲基丙烯酸酯的耐受性。
右起点结合(rob)
在一种实施方案中,遗传修饰的微生物在rob核酸序列中包含突变。因此,与野生型rob序列相比,遗传修饰的微生物包含突变体rob核酸序列。突变体rob核酸序列编码突变体Rob蛋白。
大肠杆菌中的Rob蛋白由289个氨基酸组成,具有N末端DNA结合结构域(残基1-120)和C末端结构域(残基121-189)。C末端结构域被认为是防止其被Lon蛋白酶降解及其活化-失活机制所必需的。
在一种实施方案中,野生型rob核酸序列包含SEQ ID NO.5或其同源物,所述同源物编码具有SEQ ID No.6的蛋白。
rob核酸序列或Rob氨基酸序列的同源物与分别由SEQ ID NO.5和SEQ ID NO.6代表的野生型核酸或氨基酸序列具有至少25%、26%、27%、28%、29%、30%、31%、32%、33%、34%、35%、36%、37%、38%、39%、40%、41%、42%、43%、44%、45%、46%、47%、48%、49%、50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、61%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或99%的总体序列同一性。优选地,总体序列同一性是85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或99%。总体序列同一性使用本领域已知的全局比对算法,诸如GAP程序(GCG Wisconsin Package,Accelrys)中的Needleman Wunsch算法来确定。
在BMA耐受性菌株中发现的突变引起氨基酸残基70和156处的氨基酸取代,这些氨基酸残基远离Rob的DNA结合位点,但可能影响参与自我隔离(self-sequestration)的蛋白-蛋白相互作用或激活物的特异性。这样的突变增加游离/活性Rob的量,或增加Rob对BMA的亲和力以允许激活,或增加由BMA的激活。然后增加的Rob活性导致在其调节控制下增加的多药耐药基因的表达,这可能有助于获得对BMA的耐受性。
在一种实施方案中,微生物包含编码突变体Rob蛋白的突变体rob核酸序列,该突变体Rob蛋白在N-末端DNA结合结构域(残基1-120)和C-末端结构域(残基121-189)中具有突变。
在一种实施方案中,Rob中的突变选自以下之一:参考SEQ ID NO.6,在Rob中A70被另一种氨基酸的取代、或R156被另一种氨基酸的取代。在一种实施方案中,A70被V或T取代。在一种实施方案中,R156被H取代。在除了大肠杆菌之外的微生物中的Rob同源物中的等同位置处的修饰也在本发明的范围内。
多重抗生素抗性蛋白(marR)
在一种实施方案中,遗传修饰的微生物在marR核酸序列中包含突变。因此,与野生型marR序列相比,遗传修饰的微生物包含突变体marR核酸序列。突变体marR核酸序列编码突变体MarR蛋白。
在大肠杆菌中,marR基因的蛋白产物MarR由144个氨基酸残基组成,并且包含N-末端结构域、C-末端结构域、DNA结合位点(残基55-100)和水杨酸结合位点(残基70-86)。MarR作为同二聚体,通过与marAB操纵子的启动子区域结合而充当marAB操纵子的阻遏物。MarR的DNA结合/阻遏物活性被小分子诸如水杨酸、白花丹素(plumbagin)、2,4-二硝基酚和甲萘醌的存在和结合所抑制,这允许marA、marB和marR的转录。
在一种实施方案中,野生型marR核酸序列包含SEQ ID NO.7或其编码具有SEQ IDNo.8的蛋白的同源物。
marR核酸序列或MarR氨基酸序列的同源物与分别由SEQ ID NO.7和SEQ ID NO.8代表的野生型核酸或氨基酸序列具有至少25%、26%、27%、28%、29%、30%、31%、32%、33%、34%、35%、36%、37%、38%、39%、40%、41%、42%、43%、44%、45%、46%、47%、48%、49%、50%、51%、52%、53%、54%、55%、56%、57%、58%、59%、60%、61%、62%、63%、64%、65%、66%、67%、68%、69%、70%、71%、72%、73%、74%、75%、76%、77%、78%、79%、80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或99%的总体序列同一性。优选地,总体序列同一性是85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或99%。总体序列同一性使用本领域已知的全局比对算法,诸如GAP程序(GCG Wisconsin Package,Accelrys)中的Needleman Wunsch算法来确定。
在一种实施方案中,微生物包含编码突变体MarR蛋白的突变体marR核酸序列,该突变体MarR蛋白在DNA结合和水杨酸结合结构域(残基55-100)中具有突变。例如,突变可能导致DNA结合亲和力降低和/或对作为激活物的BMA的亲和力增强。
在一种实施方案中,MarR中的突变是参考SEQ ID NO.8,V84被另一种氨基酸的取代。在一种实施方案中,V84被G取代。在除了大肠杆菌之外的微生物中的MarR同源物中的等同位置处的修饰也在本发明的范围内。
在一种实施方案中,微生物具有在编码突变体AcrR蛋白(例如在DNA结合结构域(残基7-51)或在配体结合结构域(残基55-204)中具有突变的蛋白)的acrR核酸序列中的突变,与soxR、rob或marR中任一种的核酸序列中的突变的组合。例如,微生物可以在acrR和soxR;acrR和rob或acrR和marR核酸序列中包含突变。另外的突变,即在其他基因中的突变,可能存在或可能不存在。
在一种实施方案中,微生物可以包含在编码突变体AcrR蛋白的acrR核酸序列中的突变与soxR中的突变的组合,所述突变体AcrR蛋白在DNA结合结构域(残基7-51)或配体结合结构域(残基55-204)中具有突变,所述acrR核酸序列中的突变包括以下之一:参考SEQID NO:4,V29被另一种氨基酸的取代、Y49处的移码突变、A191处的移码突变或T32处的移码突变,所述soxR中的突变选自以下任一种:参考SEQ ID NO.2,R20被另一种氨基酸的取代、R20被另一种氨基酸的取代、残基146的缺失(核酸残基435、436、437的缺失)或残基139处的截短。在一种实施方案中,R20被H取代。在一种实施方案中,R20被L取代。例如,微生物可以包含含有以下突变组合之一的核酸序列:
·acrR(V29)和SoxR(R20L)
·acrR(Y49fs)和SoxR(R20L)
·acrR(A191fs)和SoxR(R20L)
·acrR(T32fs)和SoxR(R20L)
·acrR(V29)和SoxR(R20H)
·acrR(Y49fs)和SoxR(R20H)
·acrR(A191fs)和SoxR(R20H)
·acrR(T32fs)和SoxR(R20H)
·acrR(V29)和SoxR(146del)
·acrR(Y49fs)和SoxR(146del)
·acrR(A191fs)和SoxR(146del)
·acrR(T32fs)和SoxR(146del)
·acrR(V29)和SoxR(Leu139X)
·acrR(Y49fs)和SoxR(Leu139X)
·acrR(A191fs)和SoxR(Leu139X)
·acrR(T32fs)和SoxR(Leu139X)
在一种实施方案中,微生物可以包含在编码突变体AcrR蛋白的acrR核酸序列中的突变与marR中的突变的组合,所述突变体AcrR蛋白在DNA结合结构域(残基7-51)或配体结合结构域(残基55-204)中具有突变,所述acrR核酸序列中的突变包括以下之一:参考SEQID NO:4,V29被另一种氨基酸的取代、Y49处的移码突变、A191处的移码突变或T32处的移码突变,所述marR中的突变选自:参考SEQ ID NO.8,V84被另一种氨基酸的取代。在一种实施方案中,V84被G取代。例如,微生物可以包含含有以下突变组合之一的核酸序列:
·acrR(V29)和marR(V84G)
·acrR(Y49fs)和marR(V84G)
·acrR(A191fs)和marR(V84G)
·acrR(T32fs)和marR(V84G)
在一种实施方案中,微生物可以包含在编码突变体AcrR蛋白的acrR核酸序列中的突变与rob中的突变的组合,所述突变体AcrR蛋白在DNA结合结构域(残基7-51)或配体结合结构域(残基55-204)中具有突变,所述acrR核酸序列中的突变包括以下之一:参考SEQ IDNO:4,V29被另一种氨基酸的取代、Y49处的移码突变、A191处的移码突变或T32处的移码突变,所述rob中的突变选自:参考SEQ ID NO.6,Rob中A70被另一种氨基酸的取代,或R156被另一种氨基酸的取代。在一种实施方案中,A70被V或T取代。在一种实施方案中,R156被H取代。例如,微生物可以包含含有以下突变组合之一的核酸序列:
·acrR(V29)和rob(156H)
·acrR(Y49fs)和rob(156H)
·acrR(A191fs)和rob(156H)
·acrR(T32fs)和rob(156H)
·acrR(V29)和rob(A70T)
·acrR(Y49fs)和rob(A70T)
·acrR(A191fs)和rob(A70T)
·acrR(T32fs)和rob(A70T)
·acrR(V29)和rob(A70V)
·acrR(Y49fs)和rob(A70V)
·acrR(A191fs)和rob(A70V)
·acrR(T32fs)和rob(A70V)
对于以上实施方案,参考是针对某些突变作出的。如本文解释的,这些是指提及的基因/蛋白的大肠杆菌蛋白序列中的位置(蛋白序列分别参见SEQ ID No 2、4、6和8)。
在一种实施方案中,微生物可以包含以上描述的两种或更多种突变体核酸序列的组合。例如,微生物可以在选自soxR、rob、acrR和marR核酸序列的至少两种核酸序列或其同源物中具有突变。因此,微生物可以在soxR和rob核酸序列、soxR和acrR核酸序列、soxR和marR核酸序列、rob和acrR核酸序列、rob和marR核酸序列或者acrR和marR核酸序列或其同源物中包含突变。
在另一种实施方案中,微生物可以在选自soxR、rob、acrR和marR核酸序列的至少两种核酸序列或其同源物中包含突变。因此,微生物可以在soxR、acrR和rob核酸序列或其同源物中包含突变。因此,微生物可以在soxR、acrR和marR核酸序列或其同源物中包含突变。因此,微生物可以在rob、acrR和marR核酸序列或其同源物中包含突变。
在另一种实施方案中,微生物可以在选自soxR、rob、acrR和marR核酸序列的四种核酸序列或其同源物中包含突变。
在一种实施方案中,微生物可以包含选自表1a中示出的那些的一种或更多种遗传突变。
表1a赋予增加的对甲基丙烯酸酯的耐受性的突变
在一种实施方案中,微生物还包含例如在编码氧化应激响应或多药耐药系统的蛋白组分的基因中的一个或更多个另外的突变。在一种实施方案中,突变处于rpo核酸序列中,例如rpoB(SEQ ID NO 25)或rpoC(SEQ ID NO.27)、ompR(SEQ ID NO 29)、acrB(SEQ IDNO 9)、yohJ(SEQ ID NO 11)、torY(SEQ ID NO 13)、ipxM(SEQ ID NO 15)、dnaK(SEQ ID NO17)、grol(SEQ ID NO 19)、ilvN(SEQ ID NO 21)、phop(SEQ ID NO 31)、ygbK(SEQ ID NO23)或其同源物。在另一种实施方案中,不存在对氧化应激响应或多药耐药系统的蛋白组分或调节物的另外的修饰。
在一种实施方案中,一个或更多个另外的突变选自表1b中列出的遗传突变。
表1b
在一种实施方案中,微生物选自具有如以下列出的一个或更多个遗传突变的微生物:
-rob(R156H)rpoC(L361R)ilvN(C41Y)ygbK(A294E)IpxM(168_185del);
-rob(R156H)i/vN(C41Y)phoP(L11F)acrB(V448L);
-soxR(Leu139X)580116(G>T);
-ssoxR(A146 del));
-rob(A70V);
-rob(R156H)rpoB(T1037P)torY(A87T)acrR(49Yfs);
-rob(A70T)yohJ(L109R)dnaK(V377G)927777(C>T)acrR(A191fs);
-marR(V84G)rpoC(R1075C)ompR(R15S)acrB(T379I);
-marr(V84G)rpoC(R1075C)ompR(R15S);
-marr(V84G)rpoC(R1075C)rpoC(A787V)ompR(R15S)acrB(V901I);
-rob(R156H)rpoB(T1037P)groL(P279L)acrR(49Yfs)1197659(C>A)或
-soxR(R20L)rpoC(r1075C)2133236(T>A)3915915(T>G)。
如本文解释的,可以在野生型生物体中引入上文示出的突变,以赋予对甲基丙烯酸酯的耐受性。本文描述的核酸可以通过一个或更多个核苷酸的插入、取代或缺失来突变。
用于靶基因操纵(包括失活或敲除)的技术是本领域熟知的。这些技术包括使用靶向感兴趣基因并且允许整合允许转基因在特定位点处的整合的载体的基因靶向。靶向构建体被工程化为与靶基因重组,这通过将来自基因本身的序列掺入到构建体中来实现。然后重组发生在基因中该序列的区域中,产生外来序列的插入以破坏基因。随着其序列的中断,改变的基因被翻译成非功能蛋白,如果它被翻译的话。其他技术包括如下文描述的基因组编辑(靶向基因组工程)。也可以通过将微生物暴露于诱变剂来引入突变。诱变剂可以是快中子辐射或化学诱变剂,例如选自以下非限制性清单的化学诱变剂:甲磺酸乙酯(EMS)、甲磺酸甲酯(MMS)、N-乙基-N-亚硝基脲(ENU)、三乙基密胺(triethylmelamine)(1′EΜ)、N-甲基-N-亚硝基脲(MNU)、丙卡巴肼、苯丁酸氮芥、环磷酰胺、硫酸二乙酯、丙烯酰胺单体、美法仑、氮芥(nitrogen mustard)、长春新碱、二甲基亚硝胺、N-甲基-N'-硝基-亚硝基胍(MNNG)、亚硝基胍、2-氨基嘌呤、7,12二甲基-苯并蒽(DMBA)、环氧乙烷、六甲基磷酰胺、白消安(bisulfan)、二环氧烷烃(diepoxyalkanes)(二环氧辛烷(DEO)、二环氧丁烷(BEB)等)、2-甲氧基-6-氯-9[3-(乙基-2-氯乙基)氨基丙基氨基]吖啶二盐酸盐(ICR-170)或甲醛。
最近,基因组编辑技术已经作为常规诱变方法(诸如物理和化学诱变)或使用转基因在植物中的表达来产生具有改进的表型(在农业中是重要的)的突变体植物的方法的替代方法出现。这些技术采用序列特异性核酸酶(SSN),包括锌指核酸酶(ZFN)、转录激活物样效应物核酸酶(TALEN)和RNA指导的核酸酶Cas9(CRISPR/Cas9),所述序列特异性核酸酶(SSN)产生靶向的DNA双链断裂(DSB),然后靶向的DNA双链断裂(DSB)主要通过易错非同源末端连接(NHEJ)或高保真同源重组(HR)进行修复。靶向基因组修饰或靶向基因组编辑是使用靶向的DNA双链断裂(DSB)通过同源重组(HR)介导的重组事件刺激基因组编辑的基因组工程技术。为了经由位点特异性DNA DSB的引入实现有效的基因组编辑,可以使用四大类可定制的DNA结合蛋白:来源于微生物可移动遗传元件的兆核酸酶(meganuclease)、基于真核转录因子的ZF核酸酶、来自黄单胞菌属(Xanthomonas)细菌的转录激活物样效应物(TALE),和来自II型细菌适应性免疫系统CRISPR(成簇规律间隔短回文重复序列)的RNA指导的DNA内切核酸酶Cas9。兆核酸酶、ZF和TALE蛋白都通过蛋白-DNA相互作用识别特异性DNA序列。尽管兆核酸酶使其核酸酶结构域和DNA结合结构域成为一体,但ZF和TALE蛋白由分别靶向3个或1个核苷酸(nt)的DNA的个体模块组成。ZF和TALE可以以期望的组合组装,并且附接至Fokl的核酸酶结构域,以将溶核活性导向特定的基因组基因座。经由细菌III型分泌系统递送到宿主细胞后,TAL效应物进入核,与宿主基因启动子中的效应物特异性序列结合并激活转录。它们的靶向特异性由串联的33-35个氨基酸重复序列的中央结构域决定。这之后是20个氨基酸的单个截短重复序列。大多数经检查的天然存在的TAL效应物具有12个和27个之间的完全重复序列。
这些重复序列彼此仅有两个相邻的氨基酸(其重复序列可变双残基(RVD))差异。RVD决定TAL效应物识别哪种单核苷酸:一种RVD对应于一种核苷酸,其中四种最常见的RVD各自优先与四种碱基中的一种缔合。天然存在的识别位点前面统一有TAL效应物活性所需的T。TAL效应物可以与Fokl核酸酶的催化结构域融合以产生TAL效应物核酸酶(TALEN),TAL效应物核酸酶(TALEN)在体内产生靶向的DNA双链断裂(DSB)用于基因组编辑。该技术在基因组编辑中的使用在本领域中被充分描述,例如在US 8,440,431、US 8,440,432和US 8,450,471中被充分描述。定制的质粒,可以与Golden Gate克隆方法一起使用,以组装多个DNA片段。Golden Gate方法使用IIS型限制性内切核酸酶,该IIS型限制性内切核酸酶在其识别位点外裂解,以产生独特的4bp突出端(overhang)。由于正确的组装消除了酶识别位点,因此通过在同一反应混合物中消化和连接来加速克隆。定制TALEN或TAL效应物构建体的组装包括两个步骤:(i)将重复序列模块组装成1-10个重复序列的中间阵列,以及(ii)将中间阵列连接成骨架,以形成最终的构建体。
根据本发明的多个方面,可以使用的另一种基因组编辑方法是CRISPR。该技术在基因组编辑中的使用在本领域中被良好描述,例如在US 8,697,359和本文引用的参考文献中被良好描述。简言之,CRISPR是参与防御入侵噬菌体和质粒的微生物核酸酶系统。微生物宿主中的CRISPR基因座包含CRISPR相关(Cas)基因以及能够编程CRISPR介导的核酸裂解的特异性的非编码RNA元件(sgRNA)的组合。三种类型(I-III)的CRISPR系统已在许多的细菌宿主中被鉴定。每个CRISPR基因座的一个关键特征是存在由非重复序列的短链段(间隔物)间隔开的重复序列(同向重复序列)的阵列。非编码CRISPR阵列在同向重复序列中被转录和裂解成包含个体间隔物序列的短crRNA,其将Cas核酸酶导向靶位点(前间区(protospacer))。II型CRISPR是最良好表征的系统之一,并且以四个连续步骤进行靶向的DNA双链断裂。第一,两个非编码RNA前crRNA阵列和tracrRNA,由CRISPR基因座转录。第二,tracrRNA与前crRNA的重复序列区域杂交,并且介导将前crRNA加工成包含个体间隔物序列的成熟crRNA。第三,成熟的crRNA:tracrRNA复合物经由crRNA上的间隔物和邻近前间区相邻基序(PAM)的靶DNA上的前间区之间的Watson-Crick碱基配对将Cas9导向靶DNA,这是靶识别的附加要求。最后,Cas9介导靶DNA的裂解,以在前间区内产生双链断裂。
因此,Cas9是II型CRISPR-Cas系统的标志蛋白,并且是一种大单体DNA核酸酶,通过两个非编码RNA:CRIPSR RNA(crRNA)和反式激活crRNA(tracrRNA)的复合物指导至与PAM(前间区相邻基序)序列基序相邻的DNA靶序列。Cas9蛋白包含与RuvC和HNH核酸酶同源的两个核酸酶结构域。HNH核酸酶结构域裂解互补DNA链,而RuvC样结构域裂解非互补链,结果,在靶DNA中引入了平端切割(blunt cut)。Cas9与sgRNA一起异源表达可以将位点特异性双链断裂(DSB)引入来自多种生物体的活细胞的基因组DNA中。对于在真核生物体中的应用,已经使用Cas9的密码子优化形式,Cas9最初来自细菌酿脓链球菌(Streptococcuspyogenes)。
单指导RNA(sgRNA)是CRISPR/Cas系统的第二组分,与Cas9核酸酶形成复合物。sgRNA是合成的RNA嵌合体,通过将crRNA与tracrRNA融合而产生。位于其5’末端的sgRNA指导序列赋予DNA靶特异性。因此,通过修饰指导序列,可能产生具有不同靶特异性的sgRNA。指导序列的典型长度通常是20bp。
可选地,全局转录机器工程(global transcription machinery engineering,gTME)可以为菌株改进提供替代且有用的方法。
在一种实施方案中,通过用BMA施加选择压力从微生物群体中选择耐受性突变体来分离突变体。已知,微生物群体包含以低频率(<106中的1个)自发出现的突变体,并且施加选择压力引起最适突变体的过度生长。例如,通过逐渐增加BMA的浓度或通过从开始添加高浓度(10%-30%)的BMA来施加选择压力。选择压力可以包括(a)分批培养,直至观察到生长;(b)2次或更多次转移的连续分批培养;(c)恒化器培养物随着递增的BMA浓度和递增的稀释速率的生长;(4)pH-auxostat或恒浊器培养物随着递增的BMA浓度的生长。
甲基丙烯酸C3-C12酯
术语甲基丙烯酸C3-C12酯通常意指包含C3-C12烷基、羟烷基、烯基、烷基芳基或烯基芳基基团的甲基丙烯酸酯,包括其结构异构体。C3-C12基团可以是环状的、无环的或部分环状的、直链的或支链的、脂肪族的、芳族的或部分芳族/脂肪族的。优选地,本发明的甲基丙烯酸C3-C12酯可以包括例如甲基丙烯酸正丙酯、甲基丙烯酸异丙酯、甲基丙烯酸异丁酯、甲基丙烯酸正丁酯、甲基丙烯酸叔丁酯、甲基丙烯酸异戊酯、甲基丙烯酸己酯、甲基丙烯酸环己酯、甲基丙烯酸2-乙基己酯、甲基丙烯酸癸酯、甲基丙烯酸十二烷基酯、甲基丙烯酸羟乙酯、甲基丙烯酸羟丙酯、甲基丙烯酸异冰片酯、甲基丙烯酸烯丙酯或甲基丙烯酸肉桂酯。
优选地,甲基丙烯酸C3-C12酯是甲基丙烯酸C3-C12烷基酯,更优选地甲基丙烯酸C3-C8烷基酯,例如甲基丙烯酸正丙酯、甲基丙烯酸异丙酯、甲基丙烯酸异丁酯、甲基丙烯酸正丁酯、甲基丙烯酸异戊酯、甲基丙烯酸己酯、甲基丙烯酸环己酯、甲基丙烯酸庚酯、甲基丙烯酸辛酯、甲基丙烯酸2-乙基己酯、甲基丙烯酸癸酯或甲基丙烯酸十二烷基酯。
优选地,甲基丙烯酸羟烷基酯是甲基丙烯酸羟乙酯或甲基丙烯酸羟丙酯。
在实施方案中,甲基丙烯酸C3-C12酯是甲基丙烯酸C3-C12烯基芳基酯,例如甲基丙烯酸肉桂酯。
更优选地,甲基丙烯酸C3-C12烷基酯是甲基丙烯酸C3-C6烷基酯,诸如甲基丙烯酸丙酯、甲基丙烯酸丁酯或甲基丙烯酸己酯,包括例如其结构异构体。更优选地,甲基丙烯酸C3-C6烷基酯是甲基丙烯酸丙酯或甲基丙烯酸丁酯,特别地甲基丙烯酸异丙酯或甲基丙烯酸正丁酯。
在一种实施方案中,甲基丙烯酸C3-C12酯是甲基丙烯酸丁酯(BMA)。
为了产生MMA,本文描述的方法可以包括另外的步骤:c)从发酵培养基中取出甲基丙烯酸C3-C12酯,并将所取出的甲基丙烯酸C3-C12酯与甲醇进行酯交换,以产生甲基丙烯酸甲酯。
在本发明的方面和实施方案中,微生物可以被遗传修饰以产生比野生型更多的甲基丙烯酸C3-C12酯。因此,微生物具有与本文描述的野生型相比使其对甲基丙烯酸酯更耐受的并且与野生型相比被修饰以增加甲基丙烯酸C3-C12酯的产量的突变,例如,微生物携带表达甲基丙烯酸C3-C12酯生物合成途径的酶的转基因构建体。
因此,如本文描述的用于产生甲基丙烯酸酯的方法可以包括a)提供与野生型相比具有增加的对甲基丙烯酸C3-C12酯的耐受性并且被进一步修饰以与野生型相比在发酵培养基中增加甲基丙烯酸C3-C12酯产量的遗传修饰的微生物,和
b)使微生物在产生甲基丙烯酸C3-C12酯的条件下生长。
与野生型微生物相比,增强甲基丙烯酸C3-C12酯的产生可以包括通过使用本领域已知的多种遗传工程技术对现有的细胞代谢过程、核酸和/或蛋白进行修饰。增强甲基丙烯酸C3-C12酯的产生还可以包括修饰一种或更多种微生物以在一种或更多种微生物中表达一个或更多个异源基因。这些可以包括编码从碳基原料至甲基丙烯酸C3-C12酯的期望途径的酶的基因,或者可以包括直接或间接促进这样的途径中的酶的功能和表达的其他辅助基因。因此,在一种实施方案中,可以修饰微生物以增强甲基丙烯酸C3-C12酯的产生。可以在微生物中表达使得其被修饰以产生甲基丙烯酸C3-C12酯,优选地甲基丙烯酸C3-C12烷基酯的一个或更多个基因,包括编码以下任何酶的基因。异源基因可以通过用包含异源基因的载体转化微生物来表达。
在实施方案中,微生物可以表达能够将异丁酰CoA转化为甲基丙烯酰CoA的一种或更多种酶,例如氧化酶、脱氢酶或氧化还原酶。
氧化酶可以是EC编号1.3.x.x下的作用于CH-CH键的氧化酶,更优选地EC编号EC1.3.3.x下的使用氧作为电子受体作用于CH-CH键的氧化酶。仍更优选地,氧化酶是合适地在EC编号EC 1.3.3.6下的酰基CoA氧化酶。更优选地,酰基CoA氧化酶选自以下酶中任一种:来自拟南芥(Arabidopsis thaliana)的ACX4、来自烟草节杆菌(Arthrobacternicotianae)的短链酰基CoA氧化酶、来自绿豆(Vigna radiata)的过氧化物酶体酰基CoA氧化酶、来自假丝酵母属物种(Candida sp.)的酰基CoA氧化酶和来自热带假丝酵母(Candidatropicalis)的酰基CoA氧化酶4。最优选地,酰基CoA氧化酶是来自拟南芥的ACX4。
氧化还原酶可以是EC组编号1.X.X.X下的氧化还原酶。优选地,氧化还原酶是合适地在EC组1.3.X.X下的作用于电子供体的CH-CH基团的氧化还原酶。更优选地,作用于供体的CH-CH基团的氧化还原酶是FAD依赖性氧化还原酶,仍更优选地,氧化还原酶是EC组1.3.8.X下的CoA脱氢酶。仍更优选地,氧化还原酶是合适地在EC组1.3.8.1下的短链酰基CoA脱氢酶、合适地在EC组1.3.8.4组下的异戊酰CoA脱氢酶、合适地在EC组1.3.8.5下的2-甲基支链酰基CoA脱氢酶或合适地在EC组1.3.8.-下的酰基CoA脱氢酶,诸如异丁酰CoA脱氢酶。最优选地,氧化还原酶选自以下酶中的任一种:来自恶臭假单胞菌的短/支链酰基CoA脱氢酶、来自智人(Homo sapiens)的异丁酰CoA脱氢酶和来自拟南芥的异戊酰CoA脱氢酶。
CoA脱氢酶通常需要相关的电子传递系统来将底物的氧化与泛醌的还原偶联,然后泛醌被再生。这样的电子传递系统由电子转移黄素蛋白(ETF)和电子转移黄素蛋白泛醌氧化还原酶(ETFQO)组成。ETF必须与酰基CoA脱氢酶和ETFQO二者相容。因此,在使用酰基CoA脱氢酶的实施方案中,优选地采用以下再生系统之一:
·表达对异丁酰CoA具有活性的内源CoA脱氢酶及其相关的电子传递系统(诸如在例如恶臭假单胞菌的情况下)的宿主微生物;
·表达异源CoA脱氢酶伴有来自与该异源CoA脱氢酶相同生物体的电子传递系统的蛋白的宿主微生物。例如,来自智人、恶臭假单胞菌、脱氮副球菌(Paracoccusdenitrificans)或来自拟南芥的CoA脱氢酶和电子传递系统组分,都在大肠杆菌(或另一种宿主生物)中表达;或者
·表达异源CoA脱氢酶伴有也来自不同微生物的电子传递系统组分的宿主微生物,其中这些组分彼此相容并与CoA脱氢酶相容。例如,来自智人的CoA脱氢酶与野猪(Susscrofa)的电子转移黄素蛋白相容,继而与来自类球红细菌(Rhodobacter sphaeroides)的电子转移黄素蛋白泛醌氧化还原酶相容。可选地,由于拟南芥(A.thaliana)的ETF-泛醌氧化还原酶与类球红细菌(R.sphaeroides)的ETF-泛醌氧化还原酶具有良好的序列同源性,拟南芥的异戊酰CoA脱氢酶和ETF可以与来自类球红细菌的ETF-泛醌氧化还原酶形成功能系统,用于异丁酰CoA的氧化。最后,由于脱氮副球菌(P.Denitrificans))ETF与人类和猪ETF的相似性,预测来自脱氮副球菌的ETF和ETF-泛醌氧化还原酶与来自另一种来源的异丁酰CoA脱氢酶,诸如来自智人(H.sapiens)的异丁酰CoA脱氢酶或来自不同生物体的同源物相容。
在实施方案中,微生物可以表达能够将甲基丙烯酰CoA转化为甲基丙烯酸C3-C12酯的一种或更多种酶,例如醇酰基转移酶。
优选地,醇酰基转移酶在醇,更优选地C3-C12醇,最优选地C3-C8醇的存在下起作用,仍更优选在丙醇或丁醇,诸如正丙醇、异丙醇、正丁醇、异丁醇、仲丁醇、叔丁醇、戊醇、己醇、庚醇或辛醇的存在下起作用。最优选地,醇酰基转移酶在异丙醇或正丁醇的存在下起作用。
本文的术语醇意指具有羟基基团(-OH基团)并且能够与甲基丙烯酸酯形成酯基团的物质。
优选地,醇酰基转移酶来源于植物起源,更优选地,植物属于选自由以下组成的组的任何目:姜目(Zingiberales)、蔷薇目(Rosales)、杜鹃花目(Ericales)、葫芦目(Cucurbitales)、十字花目(Brassicales)和樟目(Laurales);更优选地,植物属于选自由以下组成的组的任何科:芭蕉科(Musaceae)、蔷薇科(Rosaceae)、杜鹃花科(Ericaceae)、猕猴桃科(Actinidiaceae)、葫芦科(Cucurbitaceae)、番木瓜科(Caricaceae)和樟科(Lauraceae);仍更优选地,植物属于选自由以下组成的组的属:芭蕉属(Musa)、草莓属(Fragaria)、苹果属(Malus)、李属(Prunus)、梨属(Pyrus)、越橘属(Vaccinium)、猕猴桃属(Actinidia)、黄瓜属(Cucumis)、番木瓜属(Carica)和鳄梨属(Persea);仍更优选地,植物是选自由以下组成的组的任何一种:香蕉、草莓、苹果、梅花(Prunus mume)、西洋梨(Pyruscommunis)、蓝莓、猕猴桃、甜瓜、番木瓜和鳄梨组成的组的任何一种。最优选地,醇酰基转移酶来源于水果起源,诸如苹果、甜瓜或番茄起源,合适地苹果起源。
在本发明的实施方案中,微生物可以表达用于将异丁酰CoA转化为甲基丙烯酸C3-C12酯的氧化酶和醇酰基转移酶。上文概述了合适的氧化酶和醇酰基转移酶。如果氧化酶是来自拟南芥的ACX4,则是特别优选的。
在实施方案中,微生物可以表达能够将2-酮异戊酸转化为异丁酰CoA的一种或更多种酶。在实施方案中,一种或更多种酶可以是酶复合物,诸如支链酮酸脱氢酶酶复合物,支链酮酸脱氢酶酶复合物由α亚基组分、硫辛酰胺酰基转移酶组分和硫辛酰胺脱氢酶组分组成。最优选地,脱氢酶选自以下酶中的任何一种:来自恶臭假单胞菌(P.putida)的支链酮酸脱氢酶(BCKD)、来自枯草芽孢杆菌的BCKD、来自铜绿假单胞菌的BCKD、来自拟南芥的BCKD、来自天蓝色链霉菌的BCKD和来自嗜热栖热菌(Thermus thermophilus)的BCKD。
可选地,可以通过以下催化2-酮异戊酸转化为异丁酰CoA:合适地在EC组1.X.X.X下的氧化还原酶,优选地合适地在EC组1.2.X.X下的作用于供体的醛或氧代基团的氧化还原酶,更优选地合适地在EC组1.2.7.X下的使用铁-硫蛋白作为电子受体作用于供体的醛或氧代基团的氧化还原酶,最优选地合适地在EC组编号1.2.7.7下的2-酮异戊酸铁氧还蛋白还原酶(也称为酮缬氨酸铁氧还蛋白氧化还原酶),是由α、β、γ和δ亚基组成的四聚体。这样的酶的实例是来自强烈火球菌(Pyrococcus furiosis)的2-酮异戊酸铁氧还蛋白还原酶;来自火球菌属物种(Pyrococcus sp.)的2-酮异戊酸铁氧还蛋白还原酶;来自热球菌属物种(Thermococcus sp.)的2-酮异戊酸铁氧还蛋白还原酶;来自Thermococcus litoralis的2-酮异戊酸铁氧还蛋白还原酶;来自Thermococcus profundus的2-酮异戊酸铁氧还蛋白还原酶和来自热自养甲烷杆菌(Methanobacterium thermoautotrophicum)的2-酮异戊酸铁氧还蛋白还原酶。
可选地,微生物可以表达能够将异丁酸转化为异丁酰CoA的一种或更多种酶,例如连接酶。连接酶合适地在EC组编号6.X.X.X下,优选地在EC组6.2.X.X下的碳-硫键形成连接酶,更优选地在EC组6.2.1.X下的酸-硫醇形成连接酶,更优选地GDP形成、ADP形成或AMP形成连接酶,诸如合适地在EC组6.2.1.1下的AMP形成乙酸-CoA连接酶,合适地在EC组6.2.1.2下的丁酸-CoA连接酶,合适地在EC组6.2.1.10下的羧酸-CoA连接酶,合适地在EC组6.2.1.13下的ADP形成乙酸-CoA连接酶,合适地在EC组6.2.1.17下的丙酸-CoA连接酶或在EC组6.2.1.-下的酸-硫醇连接酶。最优选地,连接酶选自以下酶中任一种:来自绿针假单胞菌(Pseudomonas chlororaphis)的AcsA、来自拟青霉(Paecilomyces varioti)的丁酰-CoA合成酶、来自牛心脏线粒体的丁酰-CoA合成酶。
可选地,微生物可以表达能够将异丁酸(isobutyrate)转化为异丁酰CoA的一种或更多种酶。例如,异丁酸可以通过激酶转化为异丁酰-磷酸酯,并且异丁酰-磷酸酯可以通过转移酶转化为异丁酰-CoA。
优选地,异丁酸通过以下转化为异丁酰基-磷酸酯:合适地在EC组编号EC 2.X.X.X下的激酶,优选地在EC 2.7.X.X下的激酶,更优选地在EC组编号EC 2.7.2.X下的激酶,最优选地合适地在EC组2.7.2.1下的乙酸激酶,在EC 2.7.2.6下的甲酸激酶,在EC 2.7.2.7下的丁酸激酶,在EC 2.7.2.14下的支链脂肪酸激酶或在EC 2.7.2.15下的丙酸激酶。最优选地,激酶选自以下酶的任何一种:来自螺旋体(Spirochete)MA-2的支链脂肪酸激酶、来自丁酸梭菌(C.butyricum)的丁酸激酶。
优选地,异丁酰-磷酸酯通过以下转化为异丁酰-CoA:通过在EC组编号2.X.X.X下的转移酶的作用,更优选地通过在EC组编号2.3.X.X下的酰基转移酶的作用,仍更优选地通过在EC组编号2.3.1.X下的转移除氨基-酰基基团以外的基团的酰基转移酶的作用。仍更优选地分别在EC组编号2.3.1.8和2.3.1.19下的磷酸乙酰转移酶或磷酸丁酰基转移酶。更优选地,转移酶是来自丙酮丁醇梭菌(Clostridium acetobutylicum)ATCC824的磷酸丁酰基转移酶或来自枯草芽孢杆菌、谷氨酸棒杆菌(Corynebacterium glutamicum)ATCC13032、海栖热袍菌(Thermotoga maritima)和克氏梭菌(Clostridium kluyveri)的磷酸乙酰转移酶。这些酶的其他来源包括其他厌氧细菌,特别地,梭菌属物种,诸如巴氏梭菌(Clostridium pasteurianum)或拜氏梭菌(Clostridium beijerinckii)。
微生物可以表达能够将异丁酸转化为异丁酰CoA的一种或更多种酶,例如合酶,优选地异丁酰CoA合酶,最优选地来自绿针假单胞菌(P.chloraphis)B23的异丁酰CoA合酶(AcsA)。
使用的合适的修饰和构建体还在WO2016/185211中公开,该文献通过引用并入本文。
发酵培养基和发酵
在另一方面,本发明涉及一种发酵培养基,该发酵培养基包含与野生型微生物相比具有增加的对甲基丙烯酸酯的耐受性的分离的遗传修饰的微生物。具体增加的耐受性的遗传修饰的微生物包含如以上描述的氧化应激响应或多药耐药系统的蛋白组分或调节物的修饰。
在一种实施方案中,在一种或更多种微生物将产生甲基丙烯酸C3-C12酯的条件下将所述微生物提供在发酵培养基中。
本发明的多个方面和实施方案包括将野生型或遗传修饰的微生物在所述发酵培养基中培养。合适地,培养(culturing)或培养(cultivation)需要碳基原料,微生物可以在该原料上获得能量并生长。因此,优选地,在碳基原料上培养一种或更多种微生物。
发酵培养基可以是围绕一种或更多种微生物的周围培养基。优选地,碳基原料存在于培养基中,任选地溶解或悬浮在培养基中,鼓入通过(bubbled through)培养基和/或与培养基混合。因此,优选地,培养基包含一种或更多种微生物和碳基原料连同任何缓冲剂和盐。
发酵培养基可以是适于微生物需要的商购可得的任何培养基。发酵培养基合适地包含碳基原料和氮源,以及一种或更多种微生物的生长和/或形成甲基丙烯酸C3-C12酯所需的另外的化合物。
本领域已知的合适的碳基原料的实例包括葡萄糖、麦芽糖、麦芽糖糊精、蔗糖、水解淀粉、淀粉、木质素、芳烃(aromatics)、合成气或其组分、甲烷、乙烷、丙烷、丁烷、糖蜜和油、二氧化碳。优选地,碳基原料来源于生物质。还可以使用混合物以及废弃物,诸如城市废弃物、食品废弃物和来自食品加工、林业或农业的木质纤维素废弃物。
本领域已知的合适氮源的实例包括大豆粉、玉米浆、酵母提取物、氨、铵盐、硝酸盐、尿素、氮气或其他含氮源。
一种或更多种微生物的生长可能需要(并且因此可能存在于发酵培养基中)的另外的化合物的实例包括抗生素、抗真菌剂、抗氧化剂、缓冲剂、磷酸盐(phosphate)、硫酸盐(sulphate)、镁盐、微量元素和/或维生素。
可以以不同类别的微生物之间,即真菌、酵母和细菌之间不同的量添加一种或更多种微生物的生长和/或产生甲基丙烯酸C3-C12酯所需的另外的化合物,如磷酸盐、硫酸盐或微量元素。此外,待添加的另外的化合物的量可以由何种途径用于形成甲基丙烯酸C3-C12酯来决定。
待添加至培养基中的碳基原料和氮源的量可以根据一种或更多种微生物的需要和/或微生物培养的时长而变化。培养基中碳基原料与氮源的比例可以显著不同。
通常,微生物的生长所需的每种发酵培养基组分的量通过测量营养物上的生长量(growth yield)确定,并且根据培养过程中使用的碳基原料的量来进一步评估,因为形成的生物质的量主要由使用的碳基原料的量和在任何给料方案期间施加的营养物限制来决定。
在碳基原料来源于生物质的实施方案中,生物质优选地包含大量碳水化合物。特别优选的是作为C5或C6糖、碳基气体或芳烃的来源,优选地作为C5或C6糖的来源,更优选地作为葡萄糖的来源的碳水化合物,诸如但不限于淀粉、木质素、纤维素、糖原、阿拉伯木聚糖、壳多糖或果胶。
可选地,生物质可以包含大量脂肪,特别优选的是作为甘油和脂肪酸的来源的脂肪或油,具体地甘油三酯。合适的甘油三酯包括可容易从植物或动物来源获得的任何油或脂肪。这样的油和脂肪的实例包括棕榈油、亚麻籽油、菜籽油、猪油、黄油、鲱鱼油、椰子油、植物油、向日葵油、蓖麻油、大豆油、橄榄油、可可脂、酥油(ghee)、鲸脂(blubber)等。
生物质可以由一种或更多种不同的生物质来源组成。合适的生物质来源的实例包括原生木材(virgin wood)、能源作物、农业残余物、食品弃废物、城市废弃物和工业废弃物或副产品。
原生木材生物质来源可以包括但不限于木屑、树皮、残枝(brash)、原木、锯屑、木屑颗粒或木块。
能源作物生物质来源可以包括但不限于短轮伐期矮林或森林、非木本草类诸如芒属植物(miscanthus)、大麻柳枝稷、芦苇或黑麦、农业作物诸如糖、淀粉或油料作物、或水生植物诸如微藻或大型藻类;和杂草。
农业残余物可以包括但不限于皮壳、秸秆、玉米秸秆、面粉、谷物、家禽褥草(poultry litter)、粪肥、泥浆、合成气或青贮饲料。
食品废弃物可以包括但不限于外皮/皮、壳、皮壳、核、果仁/果核(stones)、动物或鱼的不可食用部分、来自果汁和油提取的肉质部分、来自酿造的废糟(spent grain)或酒花、家庭厨房废弃物、猪油或油或脂肪。
工业废弃物可以包括但不限于未处理的木材(包括颗粒)、处理的木材、页岩气、包括MDF/OSD的木质复合材料、木质层压板、纸浆/碎料/废弃物、包括纤维/纱线/流出物(effluent)的纺织品或污水污泥。
微生物可以按分批、重复分批、补料分批、重复补料分批或连续培养(恒化器、恒浊器或auxostat)方法来培养。
培养方法优选地以工业规模进行。工业规模的方法被理解为包括在体积规模≥0.01m3、优选地≥0.1m3、优选地≥0.5m3、优选地≥5m3、优选地≥10m3、更优选地≥25m3、更优选地≥50m3、更优选地≥100m3、最优选地≥200m3的一个或更多个发酵罐中的培养方法。
在实施方案中,培养在生物反应器中进行。生物反应器通常被理解为意指对微生物进行工业培养的容器。生物反应器可以是任何尺寸、数目和形式,并且可以包括用于提供营养物、用于生长的另外的化合物、新鲜培养基、碳基原料、气体添加剂(诸如但不限于空气、氮气、氧气或二氧化碳)的入口。生物反应器还可以包括用于取出大量培养基以从发酵培养基中收集甲基丙烯酸C3-C12酯的出口。生物反应器还可以具有用于对培养物进行取样的出口。生物反应器可以具有用于测量和控制pH的系统。这种pH控制系统用于控制在pH-auxostat培养中的培养基添加。在一些实施方案中,可以使用培养瓶培养。
生物反应器通常可以被配置成例如,通过搅拌、摇动、振荡、倒置、将气体鼓入通过培养物等来混合发酵培养基。可选地,一些连续培养物不需要混合,例如使用塞流系统的微反应器系统。生物反应器是本领域常见且熟知的,并且实例可以见于标准教科书中,诸如“Biotechnology:A Textbook of Industrial Microbiology,第二版(1989)作者:WulfCruegar和Annelise Crueger,由Thomas D.Brock Sinauer Associates,Inc.,Sunderland,MA翻译。本发明还涉及包含如本文描述的遗传修饰的生物体的培养基。此外,本发明涉及包含含有如本文描述的遗传修饰的生物体的发酵培养基或培养基的容器。此外,本发明涉及包含含有如本文描述的遗传修饰的生物体的发酵培养基或培养基的试剂盒。
突变体生物体及其用途
在另一方面,本发明还涉及用于产生甲基丙烯酸酯(例如甲基丙烯酸C3-C12酯)的甲基丙烯酸酯耐受的(特别地甲基丙烯酸C3-C12酯耐受的)分离的遗传修饰的微生物。此外,本发明涉及甲基丙烯酸酯耐受的(特别地甲基丙烯酸C3-C12酯耐受的)分离的遗传修饰的微生物在产生甲基丙烯酸酯(例如甲基丙烯酸C3-C12酯)中的用途。遗传修饰的微生物可以是如以上详细描述的。
在另外的方面,本发明涉及甲基丙烯酸酯耐受的(特别地甲基丙烯酸C3-C12酯耐受的)分离的遗传修饰的微生物,所述微生物在marR(SEQ ID NO.7)中包含突变,其中参考SEQID NO.8,所述突变体marR核酸编码具有V84被另一种氨基酸例如G取代的蛋白。在另外的方面,本发明涉及甲基丙烯酸酯耐受的(特别地甲基丙烯酸C3-C12酯耐受的)分离的遗传修饰的微生物,所述微生物在rob(SEQ ID NO.5)中包含突变,其中参考SEQ ID NO.6,所述突变体Rob核酸编码具有Rob中A70被另一种氨基酸取代、R156被另一种氨基酸取代的蛋白。在另外的方面,本发明涉及甲基丙烯酸酯耐受的(特别地甲基丙烯酸C3-C12酯耐受的)分离的遗传修饰的微生物,所述微生物在acrR(SEQ ID NO.3)中包含突变,其中参考SEQ ID NO:4,所述突变体acrR核酸编码具有T32处的移码突变的蛋白。在另外的方面,本发明涉及甲基丙烯酸酯耐受的(特别地甲基丙烯酸C3-C12酯耐受的)分离的遗传修饰的微生物,所述微生物在soxR(SEQ ID NO.1)中包含突变,其中参考SEQ ID NO.2,所述突变体soxR核酸编码具有R20被另一种氨基酸取代、残基146的缺失或残基139处的截短的蛋白。
在另外的方面,本发明涉及甲基丙烯酸酯耐受的(特别地甲基丙烯酸C3-C12酯耐受的)分离的遗传修饰的微生物,所述微生物在选自以下的两种或更多种核酸中包含突变:soxR、acrR、marR和/或rob核酸或其同源物,其中所述核酸编码突变体蛋白。
在另外的方面,本发明涉及甲基丙烯酸酯耐受的(特别地甲基丙烯酸C3-C12酯耐受的)分离的遗传修饰的微生物,所述微生物包含acrR中的突变和在选自以下的核酸中的突变:soxR、marR和/或rob核酸或其同源物,其中所述核酸编码突变体蛋白。
根据本发明的多个方面,微生物如以上描述,并且在一种实施方案中,微生物是大肠杆菌。
在一种实施方案中,突变体soxR核酸序列编码在DNA结合结构域(残基1-80)或在FE-S簇结构域(残基119-154)中具有突变的突变体SoxR蛋白。在一种实施方案中,SoxR中的突变选自以下之一:参考SEQ ID NO.2,R20被另一种氨基酸的取代、R20被另一种氨基酸的取代、残基146的缺失或残基139处的截短。在一种实施方案中,R20被H取代。在一种实施方案中,R20被L取代。在除大肠杆菌之外的微生物中的SoxR同源物中的等同位置处的修饰也在本发明的范围内。
在一种实施方案中,acrR核酸序列编码在DNA结合结构域(残基7-51)或在配体结合结构域(残基55-204)中具有突变的突变体AcrR蛋白。在一种实施方案中,AcrR中的突变选自以下之一:参考SEQ ID NO:3,V29被另一种氨基酸的取代、T32处的移码突变、Y49处的移码突变、A191处发生的移码突变、残基146的缺失或残基139处的截短。在一种实施方案中,V29被G取代。在除大肠杆菌之外的微生物中的AerR同源物中的等同位置处的修饰也在本发明的范围内。
在一种实施方案中,突变体rob核酸序列编码在N-末端DNA结合结构域(残基1-120)和C-末端结构域(残基121-189)中具有突变的突变体Rob蛋白。在一种实施方案中,Rob中的突变选自以下之一:参考SEQ ID NO.6,在Rob中A70被另一种氨基酸的取代、或R156被另一种氨基酸的取代。在一种实施方案中,A70被V或T取代。在一种实施方案中,R156被H取代。在除大肠杆菌之外的微生物中的Rob同源物中的等同位置处的修饰也在本发明的范围内。
在一种实施方案中,突变体marR核酸序列编码在DNA结合结构域(残基55-100)中具有突变的突变体MarR蛋白。在一种实施方案中,MarR中的突变是参考SEQ ID NO.8,V84被另一种氨基酸取代。在一种实施方案中,V84被G取代。在除大肠杆菌之外的微生物中的MarR同源物中的等同位置处的修饰也在本发明的范围内。
在一种实施方案中,存在如表1a中示出的两个或更多个突变。在一种实施方案中,微生物选自具有如以下列出的遗传突变的微生物:
-rob(R156H)rpoC(L361R)ilvN(C41Y)ygbK(A294E)IpxM(168_185del);
-rob(R156H)i/vN(C41Y)phoP(L11F)acrB(V448L);
-soxR(Leu139X)580116(G>T);
-soxR(A146del);
-rob(A70V);
-rob(R156H)rpoB(T1037P)torY(A87T)acrR(49Yfs);
-rob(A70T)yohJ(L109R)dnaK(V377G)927777(C>T)acrR(A191fs);
-marR(V84G)rpoC(R1075C)ompR(R15S)acrB(T3791);
-marR(V84G)rpoC(R1075C)ompR(R15S);
-marR(V84G)rpoC(R1075C)rpoC(A787V)ompR(R15S)acrB(V901I);
-rob(R156H)rpoB(T1037P)groL(P279L)acrR(49Yfs)1197659(C>A)或
-soxR(R20L)moC(r1075C)2133236(T>A)3915915(T>G)。
在一种实施方案中,soxR中的突变与acrR中的突变组合。如实施例中展示的,本发明人惊讶地发现,将两个基因中的突变组合导致累加的耐受性效应。在一种实施方案中,soxR中的突变与marR中的突变组合。在一种实施方案中,soxR中的突变与rob中的突变组合。
在一种实施方案中,微生物还包含例如在编码氧化应激响应的蛋白组分或调节物的基因中的一个或更多个另外的突变。在一种实施方案中,突变处于rpo核酸序列中,例如,rpoB(SEQ ID NO 25)或rpoC(SEQ ID NO 27)、ompR(SEQ ID NO 29)、acrB(SEQ ID NO 9)、yohJ(SEQ ID NO 11)、torY(SEQ ID NO 13)、ipxM(SEQ ID NO 15)、dnaK(SEQ ID NO 17)、grol(SEQ ID NO 19)、ilvN(SEQ ID NO 21)、phop(SEQ ID NO 31)、ygbK(SEQ ID NO 23)。在另一种实施方案中,不存在对氧化应激响应或多药耐药系统的蛋白组分或调节物的另外的修饰。
在一种实施方案中,acrR中的突变与marR、rob或soxR之一中的一个或更多个突变组合。这是基于令人惊讶的发现,即发现acrR中的突变与marR、rob或soxR组合(参见实施例2)。
在一种实施方案中,acrR中的突变与marR中的一个或更多个突变组合。在一种实施方案中,acrR中的突变与rob中的一个或更多个突变组合。在一种实施方案中,一个或更多个另外的突变选自表1b中列出的突变。
用于制造突变体生物体的方法
本发明还涉及一种用于分离甲基丙烯酸酯耐受性微生物的方法,该方法包括:
a)在发酵培养基中提供微生物
b)使微生物与甲基丙烯酸酯接触;和
c)分离步骤(b)的存活微生物
其中所述存活微生物在约37℃在液体培养基中生长时耐受至少20%v/v的甲基丙烯酸酯。
微生物选自如以上描述的微生物,诸如大肠杆菌。在一种实施方案中,该方法采用适应性进化方法,该适应性进化方法包括将生物体在含有依次递增浓度的甲基丙烯酸酯例如BMA的培养基中培养。例如,在第一个步骤中,使微生物与0.1%浓度的甲基丙烯酸酯接触。在随后的步骤中,将浓度从0.1%逐步增加至例如0.5%、1%、5%、10%和20%,并且将微生物暴露于该浓度一定时间段。在另一种实施方案中,将浓度逐步增加至10%和20%。使微生物与每种浓度的甲基丙烯酸酯接触可以是约(0.1h至144小时)1次至45次。在另一种实施方案中,培养温度可以在该方法期间改变。培养温度可以维持在4℃至50℃。在另一种实施方案中,该方法包括将一种培养物与BMA一起孵育,直至生长发生,然后分离突变体。
例如,适应性进化可以通过以下方式进行:大肠杆菌在恒化器中的适应性进化可以在约0.33h-1的起始稀释速率建立。然后BMA浓度以逐步方式以恒定的稀释速率从0%v/v逐渐增加至20%v/v。在达到约20%v/v BMA的稳定细胞浓度后,培养物的稀释速率可以在约0.33h-1和约0.55h-1之间调节,其中BMA浓度保持恒定在约20%v/v。另外的适应性进化实验通过以下实现:逐步将温度从37℃增加至44℃,同时将BMA浓度维持在约20%v/v,并且初始稀释速率约为0.41h-1。使用pH-auxostat的适应性进化可以使用pH反馈控制系统实现,在该系统中,仅当pH变得低于设定值时,营养物、碱和其他添加剂才会开始流入。生物反应器中培养基的酸化指示培养物中细胞的生长,以其速率控制营养物的添加,并且从而控制稀释速率。稀释速率调节为与施加的条件下培养物的生长速率匹配,并且允许自动选择,其中生长快速的菌株保留在生物反应器中并且生长较慢的菌株被清洗掉。大肠杆菌在pH-auxostat中的进化实验可以在生物反应器中最初没有添加BMA的情况下开始。BMA浓度可以从0逐渐增加至0.1%v/v、0.5%v/v、1.0%v/v、5.0%v/v、10%v/v和20%v/v。
微生物在如以上描述的合适的发酵培养基中在约37℃生长,例如以约200RPM生长。
甲基丙烯酸酯是如本文别处描述的。
本发明还涉及通过以上描述的方法分离的微生物。
本说明书中提及的所有文件,包括基因和蛋白数据库中所有对SEQ ID NO的提及,均通过引用以其整体并入本文。除非另有指定,否则序列形式是形式1。
在本文中使用的“和/或”被认为是两个指定的特征或组分中的每一个与或不与另一个一起的具体公开。例如“A和/或B”被认为是(i)A、(ii)B和(iii)A和B中的每一个的具体公开,如同每一个在本文中单独列出一样。除非上下文另有指示,否则上文列出的特征的描述和定义不限于本发明的任何特定方面或实施方案,并且同样适用于所描述的所有方面和实施方案。
在以下非限制性实施例中进一步描述本发明。
实施例
实施例1:BMA耐受性突变体菌株的分离和表征
在这些实验中使用大肠杆菌K12MG1655。这是常见的实验室菌株,并且该菌株的完整基因组序列是可得的(NCBI参考序列:NC_000913.3)。该菌株可以例如从大肠杆菌遗传保藏中心(the Coli Genetic Stock Center)(CGSC)(菌株7636)或ATCC获得。
我们分离出对BMA耐受的大肠杆菌MG1655变体。使大肠杆菌MG1655在含有20%(v/v)BMA的MSX培养基中在30mL瓶中在37℃、250rpm生长72小时。72h后观察生长。将来自该培养物的样品划线到LB琼脂板上,并且在37℃孵育过夜。然后分离菌落,并且使用IlluminaNGS技术根据制造商的指导进行全基因组深度测序分析,以表征突变体。分析了与MG1655参考基因组(登记号NC_00913.1)相比的变体的Fastq输出文件。测序分析鉴定出以下突变:
大肠杆菌MG1655 soxR(R20H)
大肠杆菌MG1655 soxR(R20H)acrR(V29G)
大肠杆菌MG1655 soxR(R20H)acrR(T32fs)
在随后的筛选实验中,将如以上描述地分离的野生型大肠杆菌MG1655和突变体(大肠杆菌MG1655 soxR(R20H)、大肠杆菌MG1655 soxR(R20H)acrR(V29G)、大肠杆菌MG1655soxR(R20H)acrR(T32fs))在限定培养基(MSX)中在不存在或存在接种后立即添加的20%(v/v)BMA的情况下生长(图1)。培养物在250mL摇瓶中在37℃和250rpm生长。WT菌株不能够在20%(v/v)BMA的存在下生长,但所有突变体能够在20%(v/v)BMA的存在下生长,证明突变赋予对BMA的耐受性。我们还观察到双突变体大肠杆菌MG1655 soxR(R20H)acrR(V29G)和大肠杆菌MG1655 soxR(R20H)acrR(T32fs)能够比具有soxR突变的单突变体生长得更好,表明soxR和acrR中的突变的组合提供加性效应并增强耐受表型。
然后大肠杆菌MG1655、大肠杆菌MG1655 soxR(R20H)、大肠杆菌MG1655 soxR(R20H)acrR(V29G)和大肠杆菌MG1655 soxR(R20H)acrR(T32fs)在具有指数中期期间添加的20%(v/v)BMA的MSX培养基中生长(图2)。使培养物在250mL摇瓶中在37℃和250rpm生长。这些结果再次表明,与单独的soxR(R20H)突变相比,突变的基因acrR(V29G)和acrR(T32fs)与soxR(R20H)组合改善了细胞的适应性并且赋予增强的对BMA的耐受性。
然后我们产生了其中分别使用标准方案敲除soxR和acrR的菌株。使敲除菌株大肠杆菌BW25113ΔsoxR和大肠杆菌BW25113ΔacrR在不存在或存在接种后立即添加的20%(v/v)BMA的情况下生长(图3)。培养物在MSX培养基中在30mL瓶中在37℃和250rpm摇动生长。尽管敲除菌株能够在没有BMA的培养基中生长,但敲除菌株都不能够在BMA的存在下生长。这表明acrR和soxR功能的丧失并不赋予BMA耐受性。因此,soxR(R20H)编码能够调节其他基因的转录的功能性soxR。同样,突变的基因acrR(V29G)和acrR(T32fs)可以编码能够赋予BMA耐受性的功能蛋白。
制备具有单突变的菌株以了解acrR(V29G)和acrR(T32fs)本身是否可以赋予BMA耐受性并且因此编码功能蛋白。然后使大肠杆菌MG1655 acrR(V29G)和大肠杆菌MG1655acrR(T32fs)在具有或不具有接种后立即添加的20%(v/v)BMA的情况下生长(图4)。培养物在MSX培养基中在30mL瓶中在37℃和250rpm摇动生长。大肠杆菌MG1655 acrR(V29G)和大肠杆菌MG1655 acrR(T32fs)都能够在BMA的存在下生长。这表明这些突变编码能够充当转录因子、赋予对BMA的耐受性的功能性AcrR。
总之,结果表明,与野生型大肠杆菌相比,soxR或acrR中的突变赋予对BMA的耐受性,并且包含soxR或acrR二者中的突变的双突变体显示出增强的耐受性。
实施例2:通过适应性进化对BMA耐受突变体菌株的分离和表征
我们使用适应性进化方法鉴定BMA耐受性菌株。总之,适应性进化包括使微生物菌株在期望的选择压力的影响下扩展繁殖(extended propagation)。具有增强的生长速率的突变体由于增加的耐受性偶尔会随时间在群体中出现并扩增。因此,通过对这些突变体分离株的重复分离、鉴定和测序,获得增强的菌株。
实验使用大肠杆菌菌株BW25113。这是常见的实验室菌株,并且该菌株的完整基因组序列是可得的(NCBI参考序列:NZ_CP009273.1,参见Grenier等人:Complete GenomeSequence of Escherichia coli BW25113,Genome Announc.2014Sep-Oct;2(5):e01038-14)。该菌株可以例如从大肠杆菌遗传保藏中心(CGSC)(菌株7636)或ATCC获得。
首先通过使微生物在0.01%至20.0%v/v的浓度范围的BMA的存在下生长来研究BMA对其生长的影响(图5a和图5b)。大肠杆菌在0.01%v/v的BMA浓度的生长(图5.1),与其在BMA不存在时的生长非常相似。然而,随着BMA浓度进一步增加至0.05%v/v和0.1%v/v,观察到更长的滞后期和更低的生长速率(图5a和表2.1)。因此,高于0.05%v/v的浓度的BMA抑制大肠杆菌的生长。
随着BMA浓度进一步增加至20%v/v(图5b),在孵育24-36小时后才观察到细胞生长,并且生长不一致。
表2.1大肠杆菌在不同BMA浓度的生长动力学参数
将在具有0.1%v/v和0.5%v/v BMA时生长的培养物传代培养(图5.3),以了解大肠杆菌的BMA耐受性极限。从在具有0.1%v/v BMA时生长的培养物中取出等分试样,转移至新鲜培养基中,并且在相同条件下生长。传代培养物群体的生长模式与先前的培养物非常相似(图5c),表明在0.1%v/v BMA观察到的耐受性可能是由于大肠杆菌的固有耐受性。在另一方面,从在0.5%v/v BMA生长的培养物中取出的传代培养物在接种仅12小时后显示出显著的生长,这比在相同条件下生长的原始培养物(progenitor culture)要早得多(图5c)。原始培养物和传代培养物的生长模式差异表明大肠杆菌在传代培养物中的生长不是由于固有的耐受性,而是在暴露于0.5%v/v BMA后获得的。
传代培养物实验的结果表明大肠杆菌对BMA的固有耐受性是至少0.1%v/v,但不多于0.5%v/v。
使用多种适应性进化实验来产生BMA耐受性大肠杆菌。在第一实验(ADE-1;图5d)中,使三种平行培养物在包含对于每次连续转移具有从0.1%分别增加至0.5%、1.0%、5.0%、10.0%和20%v/v的递增BMA浓度的10mL M9基本培养基的50mL管中生长。使用来自先前培养物的0.15mL等分试样,并且转移至具有较高BMA浓度的新鲜培养基中。最佳生长培养物用作下一次连续转移的起始培养物。
在第二适应性进化实验(ADE-2;图5e)中,在增加BMA浓度前,允许细胞在相同BMA浓度经历五(5)次连续转移。在该实验中,三种培养物各自用作在单独的管中连续转移的起始培养物。
第三适应性进化实验(ADE-3;图5f)通过使大肠杆菌在具有0.1%v/v BMA时生长一次,然后在具有10%v/v BMA时生长两次,并且最后在具有20%v/v BMA时生长45次来进行。
第四适应性进化实验使用恒化器培养物实现(ADE-4;图5g)。BMA浓度以逐步方式以恒定的稀释速率从0%v/v逐渐增加至20%v/v。在达到在20%v/v BMA的稳定细胞浓度后,培养物的稀释速率在0.33h-1和0.55h-1之间调整,BMA浓度保持恒定在20%v/v。
在适应性进化实验后,将存活培养物的等分试样铺板到LB琼脂中,并且挑取一定数目的(至少6个)个体菌落并在液体培养基中再生长,以保存每种分离的菌株的储备培养物。为了评估和比较BMA如何影响每种分离的菌株的生长,确定它们在20%v/v BMA存在下的生长动力学。用于确定菌株生长动力学的标准生长条件是在配备有suba-密封件的250mL锥形瓶中,在37℃和200rpm使用摇瓶培养箱,在含有10g L-1葡萄糖和20%v/v BMA的50mLM9培养基中的~0.025的起始光密度(O.D.)。通过测量其O.D.持续36小时来监测生长,并且在接种后的第一个13-18小时、24小时和36小时每小时获取样品。分离的菌株的细胞生长动力学参数的总结列于表2.2中。分离的菌株的生长曲线还示于图5中。
表2.2从多种分批和连续培养物在20%v/v BMA的适应性进化中分离的BMA耐受性菌株的生长动力学参数。
菌株来源:a=图5.2,b=ADE-1,c=ADE-2,d=ADE-3,e=ADE-4。f至g取自不同稀释速率(h-1)的ADE-4,其中f=0.46,并且g=0.54。NCBI参考序列:NZ_CP009273.1
参考亲本菌株大肠杆菌BW25113分析和比较从ADE-1、ADE-2、ADE-3和ADE-4产生的多种BMA耐受性菌株的基因组DNA序列(表2.3、表2.4和表2.55)。使用具有作为洗脱液的Tris-HCl(10mM;pH=7.5)的GeneEluteTM细菌基因组DNA试剂盒从选择的分离株在LB肉汤中的过夜培养物制备用于DNA测序的基因组DNA样品。使用MiSeq v2 150PE分析DNA序列,以产生至少11M+11M个读段/运行。使用Cutadapt版本1.12,为了质量,使用30的阈值和Nextera XT试剂盒的衔接子序列(SEQ ID NO.33:CTGTCTCTTATA),在3’末端处将读段修剪至36的最小长度。基因组比对和变体判定(variant calling)通过Snippy流水线(pipeline)进行,以鉴定分离的菌株和亲本菌株/参考基因组之间的基因组DNA序列差异(大肠杆菌菌株BW25113,组装ASM75055v1,来自Ensembl的注释形式34)。使用Snippy版本3,最小10个读段覆盖每个位置,并且0.9作为与参考必须有差异的读段的最小分数。
表2.3编码基因表达调节蛋白的来自多种分批培养物适应性进化的BMA耐受分离株的基因组DNA中的改变的总结。
注:细胞定位;Cyt-细胞质,Per-周质,IM-内膜,Uk-未知
表2.4编码非调节功能蛋白的来自多种分批培养物适应性进化的BMA耐受分离株的基因组DNA中的改变的总结。
注:细胞定位;Cyt-细胞质,Per-周质,IM-内膜,Uk-未知
表2.5来自多种分批培养物适应性进化的BMA耐受分离株的基因组DNA中的在非编码区中的改变的总结。
注:细胞定位;Cyt-细胞质,Per-周质,IM-内膜,Uk-未知,插入/缺失-插入或缺失
如从表2.3和表2.4可以推断的,每种菌株在acrR中都具有突变。此外,每个菌株还在soxR、marR或rob中具有突变。
实施例3–转录组数据
使用标准方法进行转录组分析以表征突变体微生物中的转录活性。
RNA提取方案
材料:
··RNeasy保护细菌微型试剂盒(50)-Qiagen 74524
··无RNA酶的DNA酶组(50)-Qiagen 79254
··RNA酶ZAP-用于去除RNA酶的清理剂-Sigma-Aldrich R2020-250ML
··无RNA酶eppenford和15ml falcon管
··无RNA酶吸头
在无RNA酶“环境”中进行所有的提取,用RNA酶ZAP清理所有事物,一直使用手套并经常更换手套。
1.MSX培养基
制备1L:
Vishniac微量元素
将EDTA二钠盐(50g)与水(800ml)组合,通过添加KOH团粒(pellet)(每次2-3个)溶解。按以下顺序添加化学物质;ZnSO4(2.2g)、CaCl2(5.54g)、MnCl2.4H2O(5.06g)、FeSO4.7H2O(5g)、(NH4)6Mo7O24.4H2O(1.1g)、CuSO4.5H2O(1.57g)和CoCl2.6H2O(1.61g)。
使用1M KOH调节至pH 6,使用水补足至1L,并且在4℃储存直至使用。
MSA
·水(700ml)中的KH2PO4(6g)和Vishniac微量元素(2ml)
使用1M KOH调节至pH 7。用水补足至760ml。
MSB
·水(200ml)中的NH4Cl(3g)和MgSO4.7H2O(0.4g)。
将MSA和MSB灭菌,然后混合并添加40mL的12.5%葡萄糖的储备溶液。制备总计1L的MSX。
2.样品制备
1)将菌株直接从冷冻储备物(cryostock)划线到新鲜MSX琼脂(15g/L)板上,并且在37℃孵育过夜
2)分离单个菌落,并且使用50ml摇瓶在MSX中接种预培养物。在37℃和250rpm孵育过夜。
3)使用具有24/29颈连接的250mL pyrex摇瓶,将培养物接种在MSX中至0.05的初始OD600。在37℃和250rpm孵育。进行4次生物学重复(3次重复用于测序,加一次备份)-每种菌株4个烧瓶。
4)用无菌Suba密封件密封摇瓶。为了监测OD,在首先用70%乙醇擦拭Suba密封件后,使用无菌注射器和针头获取样品。监测OD。
5)当OD达到0.3时,收集用于RNA提取的第一样品(“前”样品)到预冷的falcon中。将样品保持在冰上。
6)紧接着,添加20%(v/v)BMA。再孵育培养物。
7)根据RNeasy试剂盒方案快速处理样品,直至方案1的步骤6。始终将样品保持在冰上,并且在-80℃储存,直至进一步使用,如果可能,在液氮中快速冷冻,然后储存。
8)第一次样品收集/BMA添加后一小时,收集用于RNA提取的第二样品(“后”样品)。按照前面解释的进行。
9)继续监测OD,直至培养物达到24h。
总之,测试的每种菌株具有6个样品送去测序,在添加前收集3个,并且在添加后一小时收集3个,加上前和后的1个“备份”,以防提取期间出现问题(总计8个样品)。除了添加了BMA的WT菌株之外,在达到0.3的OD后一小时还从没有BMA的WT培养物中收集样品,作为对照。
3.RNA提取
根据RNeasy试剂盒说明进行。方案1和7-针对在基本培养基上生长的革兰氏阴性细菌-加上附录B“使用无RNA酶的DNA酶组进行柱上DNA酶消化”。
将最后的样品分成3个等分试样。仅使用一个等分试样检查浓度和是否存在gDNA污染。不将该等分试样用于测序,因为解冻和冷冻样品可能使RNA降解。
使用TapeStation检查RNA浓度和RIN数量。
将双突变体和野生型在BMA存在和不存在下的转录相对于野生型在BMA不存在下的水平进行比较。为免生疑问,表格中的水平是在BMA暴露下的菌株中的水平相对于不存在添加的BMA的野生型中的水平的比率。在BMA不存在下acrA和acrB的转录水平增强,然而这是非常可变的。对于野生型菌株,暴露于BMA实际上降低了全局调节物的表达水平。这表明主要的多药耐药泵在这些菌株中强过表达。结果示于下文表2.6中。
表2.6 acrAB基因表达
BMA强烈增强或降低了一些关键基因的转录水平。相对于不具有BMA暴露的BW25113,BMA引起的“前27项”增强示于表2.7中。为了进行比较,将未暴露于BMA的相同基因示于表2.8中。将双突变体和野生型在BMA存在和不存在下的转录相对于野生型在BMA不存在下的水平进行比较。为免生疑问,表格中的水平是在BMA暴露下的菌株中的水平相对于不存在添加的BMA的野生型中的水平的比率。
表2.7:与野生型和6个其他菌株比较的双突变体在具有添加的BMA时的转录组数据。
表2.8-与野生型和6个其他菌株比较的双突变体在不具有添加的BMA时的转录组数据。
在未供给BMA的情况下,不存在转录水平的一致趋势。在BMA存在下,ibpA、ibpB、ynfM、yibT、pspA、bssR、ybfA、raiA、ldhA、pspB、pspC和pspD在菌株中均增强大于×10,但在野生型菌株中均降低。在BMA不存在下,在菌株中ibpA、ibpB、ynfM、yibT、pspA、bssR、ybfA、raiA、ldhA、pspB和pspC没有一个被增强。这些基因的功能列于表2.9中。
表2.9–当未供给BMA时,未被增强的基因的功能。
序列表
<110> 璐彩特国际英国有限公司
<120> 用于产生甲基丙烯酸酯的方法
<130> P32701WO1
<150> GB1808424.4
<151> 2018-05-23
<160> 33
<170> PatentIn version 3.5
<210> 1
<211> 465
<212> DNA
<213> 大肠杆菌(Escherichia coli)
<400> 1
atggaaaaga aattaccccg cattaaagcg ctgctaaccc ccggcgaagt ggcgaaacgc 60
agcggtgtgg cggtatcggc gctgcatttc tatgaaagta aagggttgat taccagtatc 120
cgtaacagcg gcaatcagcg gcgatataaa cgtgatgtgt tgcgatatgt tgcaattatc 180
aaaattgctc agcgtattgg cattccgctg gcgaccattg gtgaagcgtt tggcgtgttg 240
cccgaagggc atacgttaag tgcgaaagag tggaaacagc tttcgtccca atggcgagaa 300
gagttggatc ggcgcattca taccttagtg gcgctgcgtg acgaactgga cggatgtatt 360
ggttgtggct gcctttcgcg cagtgattgc ccgttgcgta acccgggcga ccgcttagga 420
gaagaaggta ccggcgcacg cttgctggaa gatgaacaaa actaa 465
<210> 2
<211> 154
<212> PRT
<213> 大肠杆菌(Escherichia coli)
<400> 2
Met Glu Lys Lys Leu Pro Arg Ile Lys Ala Leu Leu Thr Pro Gly Glu
1 5 10 15
Val Ala Lys Arg Ser Gly Val Ala Val Ser Ala Leu His Phe Tyr Glu
20 25 30
Ser Lys Gly Leu Ile Thr Ser Ile Arg Asn Ser Gly Asn Gln Arg Arg
35 40 45
Tyr Lys Arg Asp Val Leu Arg Tyr Val Ala Ile Ile Lys Ile Ala Gln
50 55 60
Arg Ile Gly Ile Pro Leu Ala Thr Ile Gly Glu Ala Phe Gly Val Leu
65 70 75 80
Pro Glu Gly His Thr Leu Ser Ala Lys Glu Trp Lys Gln Leu Ser Ser
85 90 95
Gln Trp Arg Glu Glu Leu Asp Arg Arg Ile His Thr Leu Val Ala Leu
100 105 110
Arg Asp Glu Leu Asp Gly Cys Ile Gly Cys Gly Cys Leu Ser Arg Ser
115 120 125
Asp Cys Pro Leu Arg Asn Pro Gly Asp Arg Leu Gly Glu Glu Gly Thr
130 135 140
Gly Ala Arg Leu Leu Glu Asp Glu Gln Asn
145 150
<210> 3
<211> 648
<212> DNA
<213> 大肠杆菌(Escherichia coli)
<400> 3
atggcacgaa aaaccaaaca agaagcgcaa gaaacgcgcc aacacatcct cgatgtggct 60
ctacgtcttt tctcacagca gggggtatca tccacctcgc tgggcgagat tgcaaaagca 120
gctggcgtta cgcgcggtgc aatctactgg cattttaaag acaagtcgga tttgttcagt 180
gagatctggg aactgtcaga atccaatatt ggtgaactag agcttgagta tcaggcaaaa 240
ttccctggcg atccactctc agtattaaga gagatattaa ttcatgttct tgaatccacg 300
gtgacagaag aacggcgtcg attattgatg gagattatat tccacaaatg cgaatttgtc 360
ggagaaatgg ctgttgtgca acaggcacaa cgtaatctct gtctggaaag ttatgaccgt 420
atagaacaaa cgttaaaaca ttgtattgaa gcgaaaatgt tgcctgcgga tttaatgacg 480
cgtcgcgcag caattattat gcgcggctat atttccggcc tgatggaaaa ctggctcttt 540
gccccgcaat cttttgatct taaaaaagaa gcccgcgatt acgttgccat cttactggag 600
atgtatctcc tgtgccccac gcttcgtaat cctgccacta acgaataa 648
<210> 4
<211> 215
<212> PRT
<213> 大肠杆菌(Escherichia coli)
<400> 4
Met Ala Arg Lys Thr Lys Gln Glu Ala Gln Glu Thr Arg Gln His Ile
1 5 10 15
Leu Asp Val Ala Leu Arg Leu Phe Ser Gln Gln Gly Val Ser Ser Thr
20 25 30
Ser Leu Gly Glu Ile Ala Lys Ala Ala Gly Val Thr Arg Gly Ala Ile
35 40 45
Tyr Trp His Phe Lys Asp Lys Ser Asp Leu Phe Ser Glu Ile Trp Glu
50 55 60
Leu Ser Glu Ser Asn Ile Gly Glu Leu Glu Leu Glu Tyr Gln Ala Lys
65 70 75 80
Phe Pro Gly Asp Pro Leu Ser Val Leu Arg Glu Ile Leu Ile His Val
85 90 95
Leu Glu Ser Thr Val Thr Glu Glu Arg Arg Arg Leu Leu Met Glu Ile
100 105 110
Ile Phe His Lys Cys Glu Phe Val Gly Glu Met Ala Val Val Gln Gln
115 120 125
Ala Gln Arg Asn Leu Cys Leu Glu Ser Tyr Asp Arg Ile Glu Gln Thr
130 135 140
Leu Lys His Cys Ile Glu Ala Lys Met Leu Pro Ala Asp Leu Met Thr
145 150 155 160
Arg Arg Ala Ala Ile Ile Met Arg Gly Tyr Ile Ser Gly Leu Met Glu
165 170 175
Asn Trp Leu Phe Ala Pro Gln Ser Phe Asp Leu Lys Lys Glu Ala Arg
180 185 190
Asp Tyr Val Ala Ile Leu Leu Glu Met Tyr Leu Leu Cys Pro Thr Leu
195 200 205
Arg Asn Pro Ala Thr Asn Glu
210 215
<210> 5
<211> 867
<212> DNA
<213> 大肠杆菌(Escherichia coli)
<400> 5
acgacggatc ggaatcagca gttcacagcg tagattaatt gggcgatctc ccgctttggc 60
atcttctgcc gggtagtatc gctcaatatc ctgaccttta cggcgcgtca ggttgagcat 120
tggcatgcac gttccgtata ccgtcaggat aaactcctgc acgccggttc ccagaccttc 180
ataggtaaac atcacatatt cgccgccctg cagcatcacc ggatgccccg tcagtacata 240
gccatctgcc tgatcctggg ctaacgcggt ggtatagaat acctcttgct cgtcgtcttt 300
atcctgactc ggacgcgttt cattcaggcc gtagagcacc ggcggaatgg tcggcgcgtt 360
gccgagaaaa tcgtgccaga actgataacg catttcatgg cggaaatcag agatttgctc 420
cagcgaacag gagtagctct gggtaacacc aatcagcggc gtatcttcca gggtgacaaa 480
tttgtgctct ggcatagtga attcacccag gcgtagcggc gggcgaatac caaaggcgct 540
ccattcagga gaacggcggt aaagtgcagg agtctgggca aactgcttct tgaatgcgcg 600
ggtaaatgtc tgttgagagt cgaagcggta ttgcagcgcg atgtccagaa tcggacgcgc 660
agtcaggcgt agtgcgaccg ccgatttcga caaacgacga gcacgaatat acgcgccaat 720
agcatggcca gtgacatctt taaacattct ctgtaagtgc cacttggaat aacctgcttt 780
cgccgctaca ttgtcgagcg acaggggctg atccagatga ccttccagcc agattaaaag 840
gtcgcgaata atgccggcct gatccat 867
<210> 6
<211> 289
<212> PRT
<213> 大肠杆菌(Escherichia coli)
<400> 6
Met Asp Gln Ala Gly Ile Ile Arg Asp Leu Leu Ile Trp Leu Glu Gly
1 5 10 15
His Leu Asp Gln Pro Leu Ser Leu Asp Asn Val Ala Ala Lys Ala Gly
20 25 30
Tyr Ser Lys Trp His Leu Gln Arg Met Phe Lys Asp Val Thr Gly His
35 40 45
Ala Ile Gly Ala Tyr Ile Arg Ala Arg Arg Leu Ser Lys Ser Ala Val
50 55 60
Ala Leu Arg Leu Thr Ala Arg Pro Ile Leu Asp Ile Ala Leu Gln Tyr
65 70 75 80
Arg Phe Asp Ser Gln Gln Thr Phe Thr Arg Ala Phe Lys Lys Gln Phe
85 90 95
Ala Gln Thr Pro Ala Leu Tyr Arg Arg Ser Pro Glu Trp Ser Ala Phe
100 105 110
Gly Ile Arg Pro Pro Leu Arg Leu Gly Glu Phe Thr Met Pro Glu His
115 120 125
Lys Phe Val Thr Leu Glu Asp Thr Pro Leu Ile Gly Val Thr Gln Ser
130 135 140
Tyr Ser Cys Ser Leu Glu Gln Ile Ser Asp Phe Arg His Glu Met Arg
145 150 155 160
Tyr Gln Phe Trp His Asp Phe Leu Gly Asn Ala Pro Thr Ile Pro Pro
165 170 175
Val Leu Tyr Gly Leu Asn Glu Thr Arg Pro Ser Gln Asp Lys Asp Asp
180 185 190
Glu Gln Glu Val Phe Tyr Thr Thr Ala Leu Ala Gln Asp Gln Ala Asp
195 200 205
Gly Tyr Val Leu Thr Gly His Pro Val Met Leu Gln Gly Gly Glu Tyr
210 215 220
Val Met Phe Thr Tyr Glu Gly Leu Gly Thr Gly Val Gln Glu Phe Ile
225 230 235 240
Leu Thr Val Tyr Gly Thr Cys Met Pro Met Leu Asn Leu Thr Arg Arg
245 250 255
Lys Gly Gln Asp Ile Glu Arg Tyr Tyr Pro Ala Glu Asp Ala Lys Ala
260 265 270
Gly Asp Arg Pro Ile Asn Leu Arg Cys Glu Leu Leu Ile Pro Ile Arg
275 280 285
Arg
<210> 7
<211> 432
<212> DNA
<213> 大肠杆菌(Escherichia coli)
<400> 7
gtgaaaagta ccagcgatct gttcaatgaa attattccat tgggtcgctt aatccatatg 60
gttaatcaga agaaagatcg cctgcttaac gagtatctgt ctccgctgga tattaccgcg 120
gcacagttta aggtgctctg ctctatccgc tgcgcggcgt gtattactcc ggttgaactg 180
aaaaaggtat tgtcggtcga cctgggagca ctgacccgta tgctggatcg cctggtctgt 240
aaaggctggg tggaaaggtt gccgaacccg aatgacaagc gcggcgtact ggtaaaactt 300
accaccggcg gcgcggcaat atgtgaacaa tgccatcaat tagttggcca ggacctgcac 360
caagaattaa caaaaaacct gacggcggac gaagtggcaa cacttgagta tttgcttaag 420
aaagtcctgc cg 432
<210> 8
<211> 144
<212> PRT
<213> 大肠杆菌(Escherichia coli)
<400> 8
Met Lys Ser Thr Ser Asp Leu Phe Asn Glu Ile Ile Pro Leu Gly Arg
1 5 10 15
Leu Ile His Met Val Asn Gln Lys Lys Asp Arg Leu Leu Asn Glu Tyr
20 25 30
Leu Ser Pro Leu Asp Ile Thr Ala Ala Gln Phe Lys Val Leu Cys Ser
35 40 45
Ile Arg Cys Ala Ala Cys Ile Thr Pro Val Glu Leu Lys Lys Val Leu
50 55 60
Ser Val Asp Leu Gly Ala Leu Thr Arg Met Leu Asp Arg Leu Val Cys
65 70 75 80
Lys Gly Trp Val Glu Arg Leu Pro Asn Pro Asn Asp Lys Arg Gly Val
85 90 95
Leu Val Lys Leu Thr Thr Gly Gly Ala Ala Ile Cys Glu Gln Cys His
100 105 110
Gln Leu Val Gly Gln Asp Leu His Gln Glu Leu Thr Lys Asn Leu Thr
115 120 125
Ala Asp Glu Val Ala Thr Leu Glu Tyr Leu Leu Lys Lys Val Leu Pro
130 135 140
<210> 9
<211> 3150
<212> DNA
<213> 大肠杆菌(Escherichia coli)
<400> 9
atgcctaatt tctttatcga tcgcccgatt tttgcgtggg tgatcgccat tatcatcatg 60
ttggcagggg ggctggcgat cctcaaactg ccggtggcgc aatatcctac gattgcaccg 120
ccggcagtaa cgatctccgc ctcctacccc ggcgctgatg cgaaaacagt gcaggacacg 180
gtgacacagg ttatcgaaca gaatatgaac ggtatcgata acctgatgta catgtcctct 240
aacagtgact ccacgggtac cgtgcagatc accctgacct ttgagtctgg tactgatgcg 300
gatatcgcgc aggttcaggt gcagaacaaa ctgcagctgg cgatgccgtt gctgccgcaa 360
gaagttcagc agcaaggggt gagcgttgag aaatcatcca gcagcttcct gatggttgtc 420
ggcgttatca acaccgatgg caccatgacg caggaggata tctccgacta cgtggcggcg 480
aatatgaaag atgccatcag ccgtacgtcg ggcgtgggtg atgttcagtt gttcggttca 540
cagtacgcga tgcgtatctg gatgaacccg aatgagctga acaaattcca gctaacgccg 600
gttgatgtca ttaccgccat caaagcgcag aacgcccagg ttgcggcggg tcagctcggt 660
ggtacgccgc cggtgaaagg ccaacagctt aacgcctcta ttattgctca gacgcgtctg 720
acctctactg aagagttcgg caaaatcctg ctgaaagtga atcaggatgg ttcccgcgtg 780
ctgctgcgtg acgtcgcgaa gattgagctg ggtggtgaga actacgacat catcgcagag 840
tttaacggcc aaccggcttc cggtctgggg atcaagctgg cgaccggtgc aaacgcgctg 900
gataccgctg cggcaatccg tgctgaactg gcgaagatgg aaccgttctt cccgtcgggt 960
ctgaaaattg tttacccata cgacaccacg ccgttcgtga aaatctctat tcacgaagtg 1020
gttaaaacgc tggtcgaagc gatcatcctc gtgttcctgg ttatgtatct gttcctgcag 1080
aacttccgcg cgacgttgat tccgaccatt gccgtaccgg tggtattgct cgggaccttt 1140
gccgtccttg ccgcctttgg cttctcgata aacacgctaa caatgttcgg gatggtgctc 1200
gccatcggcc tgttggtgga tgacgccatc gttgtggtag aaaacgttga gcgtgttatg 1260
gcggaagaag gtttgccgcc aaaagaagct acccgtaagt cgatggggca gattcagggc 1320
gctctggtcg gtatcgcgat ggtactgtcg gcggtattcg taccgatggc cttctttggc 1380
ggttctactg gtgctatcta tcgtcagttc tctattacca ttgtttcagc aatggcgctg 1440
tcggtactgg tggcgttgat cctgactcca gctctttgtg ccaccatgct gaaaccgatt 1500
gccaaaggcg atcacgggga aggtaaaaaa ggcttcttcg gctggtttaa ccgcatgttc 1560
gagaagagca cgcaccacta caccgacagc gtaggcggta ttctgcgcag tacggggcgt 1620
tacctggtgc tgtatctgat catcgtggtc ggcatggcct atctgttcgt gcgtctgcca 1680
agctccttct tgccagatga ggaccagggc gtgtttatga ccatggttca gctgccagca 1740
ggtgcaacgc aggaacgtac acagaaagtg ctcaatgagg taacgcatta ctatctgacc 1800
aaagaaaaga acaacgttga gtcggtgttc gccgttaacg gcttcggctt tgcgggacgt 1860
ggtcagaata ccggtattgc gttcgtttcc ttgaaggact gggccgatcg tccgggcgaa 1920
gaaaacaaag ttgaagcgat taccatgcgt gcaacacgcg ctttctcgca aatcaaagat 1980
gcgatggttt tcgcctttaa cctgcccgca atcgtggaac tgggtactgc aaccggcttt 2040
gactttgagc tgattgacca ggctggcctt ggtcacgaaa aactgactca ggcgcgtaac 2100
cagttgcttg cagaagcagc gaagcaccct gatatgttga ccagcgtacg tccaaacggt 2160
ctggaagata ccccgcagtt taagattgat atcgaccagg aaaaagcgca ggcgctgggt 2220
gtttctatca acgacattaa caccactctg ggcgctgcat ggggcggcag ctatgtgaac 2280
gactttatcg accgcggtcg tgtgaagaaa gtttatgtca tgtcagaagc gaaataccgt 2340
atgctgccgg atgatatcgg cgactggtat gttcgtgctg ctgatggtca gatggtgcca 2400
ttctcggcgt tctcctcttc tcgttgggag tacggttcgc cgcgtctgga acgttacaac 2460
ggcctgccat ccatggaaat cttaggccag gcggcaccgg gtaaaagtac cggtgaagca 2520
atggagctga tggaacaact ggcgagcaaa ctgcctaccg gtgttggcta tgactggacg 2580
gggatgtcct atcaggaacg tctctccggc aaccaggcac cttcactgta cgcgatttcg 2640
ttgattgtcg tgttcctgtg tctggcggcg ctgtacgaga gctggtcgat tccgttctcc 2700
gttatgctgg tcgttccgct gggggttatc ggtgcgttgc tggctgccac cttccgtggc 2760
ctgaccaatg acgtttactt ccaggtaggc ctgctcacaa ccattgggtt gtcggcgaag 2820
aacgcgatcc ttatcgtcga attcgccaaa gacttgatgg ataaagaagg taaaggtctg 2880
attgaagcga cgcttgatgc ggtgcggatg cgtttacgtc cgatcctgat gacctcgctg 2940
gcgtttatcc tcggcgttat gccgctggtt atcagtactg gtgctggttc cggcgcgcag 3000
aacgcagtag gtaccggtgt aatgggcggg atggtgaccg caacggtact ggcaatcttc 3060
ttcgttccgg tattctttgt ggtggttcgc cgccgcttta gccgcaagaa tgaagatatc 3120
gagcacagcc atactgtcga tcatcattga 3150
<210> 10
<211> 1049
<212> PRT
<213> 大肠杆菌(Escherichia coli)
<400> 10
Met Pro Asn Phe Phe Ile Asp Arg Pro Ile Phe Ala Trp Val Ile Ala
1 5 10 15
Ile Ile Ile Met Leu Ala Gly Gly Leu Ala Ile Leu Lys Leu Pro Val
20 25 30
Ala Gln Tyr Pro Thr Ile Ala Pro Pro Ala Val Thr Ile Ser Ala Ser
35 40 45
Tyr Pro Gly Ala Asp Ala Lys Thr Val Gln Asp Thr Val Thr Gln Val
50 55 60
Ile Glu Gln Asn Met Asn Gly Ile Asp Asn Leu Met Tyr Met Ser Ser
65 70 75 80
Asn Ser Asp Ser Thr Gly Thr Val Gln Ile Thr Leu Thr Phe Glu Ser
85 90 95
Gly Thr Asp Ala Asp Ile Ala Gln Val Gln Val Gln Asn Lys Leu Gln
100 105 110
Leu Ala Met Pro Leu Leu Pro Gln Glu Val Gln Gln Gln Gly Val Ser
115 120 125
Val Glu Lys Ser Ser Ser Ser Phe Leu Met Val Val Gly Val Ile Asn
130 135 140
Thr Asp Gly Thr Met Thr Gln Glu Asp Ile Ser Asp Tyr Val Ala Ala
145 150 155 160
Asn Met Lys Asp Ala Ile Ser Arg Thr Ser Gly Val Gly Asp Val Gln
165 170 175
Leu Phe Gly Ser Gln Tyr Ala Met Arg Ile Trp Met Asn Pro Asn Glu
180 185 190
Leu Asn Lys Phe Gln Leu Thr Pro Val Asp Val Ile Thr Ala Ile Lys
195 200 205
Ala Gln Asn Ala Gln Val Ala Ala Gly Gln Leu Gly Gly Thr Pro Pro
210 215 220
Val Lys Gly Gln Gln Leu Asn Ala Ser Ile Ile Ala Gln Thr Arg Leu
225 230 235 240
Thr Ser Thr Glu Glu Phe Gly Lys Ile Leu Leu Lys Val Asn Gln Asp
245 250 255
Gly Ser Arg Val Leu Leu Arg Asp Val Ala Lys Ile Glu Leu Gly Gly
260 265 270
Glu Asn Tyr Asp Ile Ile Ala Glu Phe Asn Gly Gln Pro Ala Ser Gly
275 280 285
Leu Gly Ile Lys Leu Ala Thr Gly Ala Asn Ala Leu Asp Thr Ala Ala
290 295 300
Ala Ile Arg Ala Glu Leu Ala Lys Met Glu Pro Phe Phe Pro Ser Gly
305 310 315 320
Leu Lys Ile Val Tyr Pro Tyr Asp Thr Thr Pro Phe Val Lys Ile Ser
325 330 335
Ile His Glu Val Val Lys Thr Leu Val Glu Ala Ile Ile Leu Val Phe
340 345 350
Leu Val Met Tyr Leu Phe Leu Gln Asn Phe Arg Ala Thr Leu Ile Pro
355 360 365
Thr Ile Ala Val Pro Val Val Leu Leu Gly Thr Phe Ala Val Leu Ala
370 375 380
Ala Phe Gly Phe Ser Ile Asn Thr Leu Thr Met Phe Gly Met Val Leu
385 390 395 400
Ala Ile Gly Leu Leu Val Asp Asp Ala Ile Val Val Val Glu Asn Val
405 410 415
Glu Arg Val Met Ala Glu Glu Gly Leu Pro Pro Lys Glu Ala Thr Arg
420 425 430
Lys Ser Met Gly Gln Ile Gln Gly Ala Leu Val Gly Ile Ala Met Val
435 440 445
Leu Ser Ala Val Phe Val Pro Met Ala Phe Phe Gly Gly Ser Thr Gly
450 455 460
Ala Ile Tyr Arg Gln Phe Ser Ile Thr Ile Val Ser Ala Met Ala Leu
465 470 475 480
Ser Val Leu Val Ala Leu Ile Leu Thr Pro Ala Leu Cys Ala Thr Met
485 490 495
Leu Lys Pro Ile Ala Lys Gly Asp His Gly Glu Gly Lys Lys Gly Phe
500 505 510
Phe Gly Trp Phe Asn Arg Met Phe Glu Lys Ser Thr His His Tyr Thr
515 520 525
Asp Ser Val Gly Gly Ile Leu Arg Ser Thr Gly Arg Tyr Leu Val Leu
530 535 540
Tyr Leu Ile Ile Val Val Gly Met Ala Tyr Leu Phe Val Arg Leu Pro
545 550 555 560
Ser Ser Phe Leu Pro Asp Glu Asp Gln Gly Val Phe Met Thr Met Val
565 570 575
Gln Leu Pro Ala Gly Ala Thr Gln Glu Arg Thr Gln Lys Val Leu Asn
580 585 590
Glu Val Thr His Tyr Tyr Leu Thr Lys Glu Lys Asn Asn Val Glu Ser
595 600 605
Val Phe Ala Val Asn Gly Phe Gly Phe Ala Gly Arg Gly Gln Asn Thr
610 615 620
Gly Ile Ala Phe Val Ser Leu Lys Asp Trp Ala Asp Arg Pro Gly Glu
625 630 635 640
Glu Asn Lys Val Glu Ala Ile Thr Met Arg Ala Thr Arg Ala Phe Ser
645 650 655
Gln Ile Lys Asp Ala Met Val Phe Ala Phe Asn Leu Pro Ala Ile Val
660 665 670
Glu Leu Gly Thr Ala Thr Gly Phe Asp Phe Glu Leu Ile Asp Gln Ala
675 680 685
Gly Leu Gly His Glu Lys Leu Thr Gln Ala Arg Asn Gln Leu Leu Ala
690 695 700
Glu Ala Ala Lys His Pro Asp Met Leu Thr Ser Val Arg Pro Asn Gly
705 710 715 720
Leu Glu Asp Thr Pro Gln Phe Lys Ile Asp Ile Asp Gln Glu Lys Ala
725 730 735
Gln Ala Leu Gly Val Ser Ile Asn Asp Ile Asn Thr Thr Leu Gly Ala
740 745 750
Ala Trp Gly Gly Ser Tyr Val Asn Asp Phe Ile Asp Arg Gly Arg Val
755 760 765
Lys Lys Val Tyr Val Met Ser Glu Ala Lys Tyr Arg Met Leu Pro Asp
770 775 780
Asp Ile Gly Asp Trp Tyr Val Arg Ala Ala Asp Gly Gln Met Val Pro
785 790 795 800
Phe Ser Ala Phe Ser Ser Ser Arg Trp Glu Tyr Gly Ser Pro Arg Leu
805 810 815
Glu Arg Tyr Asn Gly Leu Pro Ser Met Glu Ile Leu Gly Gln Ala Ala
820 825 830
Pro Gly Lys Ser Thr Gly Glu Ala Met Glu Leu Met Glu Gln Leu Ala
835 840 845
Ser Lys Leu Pro Thr Gly Val Gly Tyr Asp Trp Thr Gly Met Ser Tyr
850 855 860
Gln Glu Arg Leu Ser Gly Asn Gln Ala Pro Ser Leu Tyr Ala Ile Ser
865 870 875 880
Leu Ile Val Val Phe Leu Cys Leu Ala Ala Leu Tyr Glu Ser Trp Ser
885 890 895
Ile Pro Phe Ser Val Met Leu Val Val Pro Leu Gly Val Ile Gly Ala
900 905 910
Leu Leu Ala Ala Thr Phe Arg Gly Leu Thr Asn Asp Val Tyr Phe Gln
915 920 925
Val Gly Leu Leu Thr Thr Ile Gly Leu Ser Ala Lys Asn Ala Ile Leu
930 935 940
Ile Val Glu Phe Ala Lys Asp Leu Met Asp Lys Glu Gly Lys Gly Leu
945 950 955 960
Ile Glu Ala Thr Leu Asp Ala Val Arg Met Arg Leu Arg Pro Ile Leu
965 970 975
Met Thr Ser Leu Ala Phe Ile Leu Gly Val Met Pro Leu Val Ile Ser
980 985 990
Thr Gly Ala Gly Ser Gly Ala Gln Asn Ala Val Gly Thr Gly Val Met
995 1000 1005
Gly Gly Met Val Thr Ala Thr Val Leu Ala Ile Phe Phe Val Pro
1010 1015 1020
Val Phe Phe Val Val Val Arg Arg Arg Phe Ser Arg Lys Asn Glu
1025 1030 1035
Asp Ile Glu His Ser His Thr Val Asp His His
1040 1045
<210> 11
<211> 399
<212> DNA
<213> 大肠杆菌(Escherichia coli)
<400> 11
atgagtaaga cactgaacat tatctggcaa tatttacgcg ctttcgtcct gatttatgcc 60
tgcctgtatg caggcatttt cattgcttcc ctgctaccgg taaccattcc gggcagcatc 120
atcgggatgc tgatcctgtt tgtcctgctg gccttgcaaa ttcttccggc aaaatgggtc 180
aatccggggt gctacgtact gattcgctat atggcgctat tgtttgtgcc gattggcgta 240
ggcgtcatgc aatattttga tttgctccgc gcacagtttg gcccggtagt ggtttcctgt 300
gcagtcagta cgctggtggt ttttctggtg gtgagctgga gttcgcaact ggtacacggt 360
gaacgtaaag tcgtaggtca gaaaggatca gaagaatga 399
<210> 12
<211> 132
<212> PRT
<213> 大肠杆菌(Escherichia coli)
<400> 12
Met Ser Lys Thr Leu Asn Ile Ile Trp Gln Tyr Leu Arg Ala Phe Val
1 5 10 15
Leu Ile Tyr Ala Cys Leu Tyr Ala Gly Ile Phe Ile Ala Ser Leu Leu
20 25 30
Pro Val Thr Ile Pro Gly Ser Ile Ile Gly Met Leu Ile Leu Phe Val
35 40 45
Leu Leu Ala Leu Gln Ile Leu Pro Ala Lys Trp Val Asn Pro Gly Cys
50 55 60
Tyr Val Leu Ile Arg Tyr Met Ala Leu Leu Phe Val Pro Ile Gly Val
65 70 75 80
Gly Val Met Gln Tyr Phe Asp Leu Leu Arg Ala Gln Phe Gly Pro Val
85 90 95
Val Val Ser Cys Ala Val Ser Thr Leu Val Val Phe Leu Val Val Ser
100 105 110
Trp Ser Ser Gln Leu Val His Gly Glu Arg Lys Val Val Gly Gln Lys
115 120 125
Gly Ser Glu Glu
130
<210> 13
<211> 1101
<212> DNA
<213> 大肠杆菌(Escherichia coli)
<400> 13
atgcgaggga aaaaacgcat tgggttattg tttttgctga tagcggttgt ggttggtggc 60
ggcgggttat tgctggcgca aaaagtctta cataaaacgt cggatacagc attttgcctt 120
tcctgccact cgatgagtaa accttttgag gaatatcagg gaactgtcca cttttcgaac 180
cagaaaggga tacgtgcgga atgtgccgat tgccatattc caaagtcagg gatggattat 240
ttatttgcta aattaaaggc atctaaagat atttatcatg aatttgttag cggcaaaata 300
gacagtgacg ataagttcga agctcatcgc caggaaatgg ccgaaacagt atggaaagaa 360
ttaaaagcca ctgactctgc aacgtgccgt agttgccatt cttttgatgc catggatatt 420
gcctcgcaaa gtgaatctgc gcagaaaatg cataacaaag cacaaaagga cagcgaaacc 480
tgtatcgatt gtcataaagg cattgcccat tttccgccag aaataaaaat ggatgacaac 540
gcggcgcatg agctggaaag tcaggccgct acttcagtta ctaatggcgc acatatttat 600
cctttcaaaa cttctcacat aggcgagctg gctaccgtga atcctggaac cgatctcacc 660
gtcgttgatg ccagtggcaa acagccgatc gttctgttgc agggttatca aatgcagggc 720
agtgaaaaca cgctctacct ggcggcaggt caacggctgg cgctagccac attaagtgaa 780
gaaggtatca aggcgctcac tgtaaacggg gaatggcagg ctgacgaata cggcaatcaa 840
tggcgtcagg cgtctttaca gggtgcgctt accgatcccg cattagcgga ccgtaaaccg 900
ctatggcaat acgctgaaaa acttgacgat acctattgcg ctggttgtca tgcccctatt 960
gccgccgacc attacaccgt caatgcgtgg ccgtccattg ccaaaggaat gggggcacga 1020
accagcatga gcgaaaacga actggacatt ttaacgcggt atttccagta caacgccaaa 1080
gatattaccg agaaacagtg a 1101
<210> 14
<211> 366
<212> PRT
<213> 大肠杆菌(Escherichia coli)
<400> 14
Met Arg Gly Lys Lys Arg Ile Gly Leu Leu Phe Leu Leu Ile Ala Val
1 5 10 15
Val Val Gly Gly Gly Gly Leu Leu Leu Ala Gln Lys Val Leu His Lys
20 25 30
Thr Ser Asp Thr Ala Phe Cys Leu Ser Cys His Ser Met Ser Lys Pro
35 40 45
Phe Glu Glu Tyr Gln Gly Thr Val His Phe Ser Asn Gln Lys Gly Ile
50 55 60
Arg Ala Glu Cys Ala Asp Cys His Ile Pro Lys Ser Gly Met Asp Tyr
65 70 75 80
Leu Phe Ala Lys Leu Lys Ala Ser Lys Asp Ile Tyr His Glu Phe Val
85 90 95
Ser Gly Lys Ile Asp Ser Asp Asp Lys Phe Glu Ala His Arg Gln Glu
100 105 110
Met Ala Glu Thr Val Trp Lys Glu Leu Lys Ala Thr Asp Ser Ala Thr
115 120 125
Cys Arg Ser Cys His Ser Phe Asp Ala Met Asp Ile Ala Ser Gln Ser
130 135 140
Glu Ser Ala Gln Lys Met His Asn Lys Ala Gln Lys Asp Ser Glu Thr
145 150 155 160
Cys Ile Asp Cys His Lys Gly Ile Ala His Phe Pro Pro Glu Ile Lys
165 170 175
Met Asp Asp Asn Ala Ala His Glu Leu Glu Ser Gln Ala Ala Thr Ser
180 185 190
Val Thr Asn Gly Ala His Ile Tyr Pro Phe Lys Thr Ser His Ile Gly
195 200 205
Glu Leu Ala Thr Val Asn Pro Gly Thr Asp Leu Thr Val Val Asp Ala
210 215 220
Ser Gly Lys Gln Pro Ile Val Leu Leu Gln Gly Tyr Gln Met Gln Gly
225 230 235 240
Ser Glu Asn Thr Leu Tyr Leu Ala Ala Gly Gln Arg Leu Ala Leu Ala
245 250 255
Thr Leu Ser Glu Glu Gly Ile Lys Ala Leu Thr Val Asn Gly Glu Trp
260 265 270
Gln Ala Asp Glu Tyr Gly Asn Gln Trp Arg Gln Ala Ser Leu Gln Gly
275 280 285
Ala Leu Thr Asp Pro Ala Leu Ala Asp Arg Lys Pro Leu Trp Gln Tyr
290 295 300
Ala Glu Lys Leu Asp Asp Thr Tyr Cys Ala Gly Cys His Ala Pro Ile
305 310 315 320
Ala Ala Asp His Tyr Thr Val Asn Ala Trp Pro Ser Ile Ala Lys Gly
325 330 335
Met Gly Ala Arg Thr Ser Met Ser Glu Asn Glu Leu Asp Ile Leu Thr
340 345 350
Arg Tyr Phe Gln Tyr Asn Ala Lys Asp Ile Thr Glu Lys Gln
355 360 365
<210> 15
<211> 972
<212> DNA
<213> 大肠杆菌(Escherichia coli)
<400> 15
atggaaacga aaaaaaataa tagcgaatac attcctgagt ttgataaatc ctttcgccac 60
ccgcgctact ggggagcatg gctgggcgta gcagcgatgg cgggtatcgc tttaacgccg 120
ccaaagttcc gtgatcccat tctggcacgg ctgggacgtt ttgccggacg actgggaaaa 180
agctcacgcc gtcgtgcgtt aatcaatctg tcgctctgct ttccagaacg tagtgaagct 240
gaacgcgaag cgattgttga tgagatgttt gccaccgcgc cgcaagcgat ggcaatgatg 300
gctgagttgg caatacgcgg gccggagaaa attcagccgc gcgttgactg gcaagggctg 360
gagatcatcg aagagatgcg gcgtaataac gagaaagtta tctttctggt gccgcacggt 420
tgggccgtcg atattcctgc catgctgatg gcctcgcaag ggcagaaaat ggcagcgatg 480
ttccataatc agggcaaccc ggtttttgat tatgtctgga acacggtgcg tcgtcgcttt 540
ggcggtcgtc tgcatgcgag aaatgacggt attaaaccat tcatccagtc ggtacgtcag 600
gggtactggg gatattattt acccgatcag gatcatggcc cagagcacag cgaatttgtg 660
gatttctttg ccacctataa agcgacgttg cccgcgattg gtcgtttgat gaaagtgtgc 720
cgtgcgcgcg ttgtaccgct gtttccgatt tatgatggca agacgcatcg tctgacgatt 780
caggtgcgcc caccgatgga tgatctgtta gaggcggatg atcatacgat tgcgcggcgg 840
atgaatgaag aagtcgagat ttttgttggt ccgcgaccag aacaatacac ctggatacta 900
aaattgctga aaactcgcaa accgggcgaa atccagccgt ataagcgcaa agatctttat 960
cccatcaaat aa 972
<210> 16
<211> 323
<212> PRT
<213> 大肠杆菌(Escherichia coli)
<400> 16
Met Glu Thr Lys Lys Asn Asn Ser Glu Tyr Ile Pro Glu Phe Asp Lys
1 5 10 15
Ser Phe Arg His Pro Arg Tyr Trp Gly Ala Trp Leu Gly Val Ala Ala
20 25 30
Met Ala Gly Ile Ala Leu Thr Pro Pro Lys Phe Arg Asp Pro Ile Leu
35 40 45
Ala Arg Leu Gly Arg Phe Ala Gly Arg Leu Gly Lys Ser Ser Arg Arg
50 55 60
Arg Ala Leu Ile Asn Leu Ser Leu Cys Phe Pro Glu Arg Ser Glu Ala
65 70 75 80
Glu Arg Glu Ala Ile Val Asp Glu Met Phe Ala Thr Ala Pro Gln Ala
85 90 95
Met Ala Met Met Ala Glu Leu Ala Ile Arg Gly Pro Glu Lys Ile Gln
100 105 110
Pro Arg Val Asp Trp Gln Gly Leu Glu Ile Ile Glu Glu Met Arg Arg
115 120 125
Asn Asn Glu Lys Val Ile Phe Leu Val Pro His Gly Trp Ala Val Asp
130 135 140
Ile Pro Ala Met Leu Met Ala Ser Gln Gly Gln Lys Met Ala Ala Met
145 150 155 160
Phe His Asn Gln Gly Asn Pro Val Phe Asp Tyr Val Trp Asn Thr Val
165 170 175
Arg Arg Arg Phe Gly Gly Arg Leu His Ala Arg Asn Asp Gly Ile Lys
180 185 190
Pro Phe Ile Gln Ser Val Arg Gln Gly Tyr Trp Gly Tyr Tyr Leu Pro
195 200 205
Asp Gln Asp His Gly Pro Glu His Ser Glu Phe Val Asp Phe Phe Ala
210 215 220
Thr Tyr Lys Ala Thr Leu Pro Ala Ile Gly Arg Leu Met Lys Val Cys
225 230 235 240
Arg Ala Arg Val Val Pro Leu Phe Pro Ile Tyr Asp Gly Lys Thr His
245 250 255
Arg Leu Thr Ile Gln Val Arg Pro Pro Met Asp Asp Leu Leu Glu Ala
260 265 270
Asp Asp His Thr Ile Ala Arg Arg Met Asn Glu Glu Val Glu Ile Phe
275 280 285
Val Gly Pro Arg Pro Glu Gln Tyr Thr Trp Ile Leu Lys Leu Leu Lys
290 295 300
Thr Arg Lys Pro Gly Glu Ile Gln Pro Tyr Lys Arg Lys Asp Leu Tyr
305 310 315 320
Pro Ile Lys
<210> 17
<211> 1917
<212> DNA
<213> 大肠杆菌(Escherichia coli)
<400> 17
atgggtaaaa taattggtat cgacctgggt actaccaact cttgtgtagc gattatggat 60
ggcaccactc ctcgcgtgct ggagaacgcc gaaggcgatc gcaccacgcc ttctatcatt 120
gcctataccc aggatggtga aactctagtt ggtcagccgg ctaaacgtca ggcagtgacg 180
aacccgcaaa acactctgtt tgcgattaaa cgcctgattg gtcgccgctt ccaggacgaa 240
gaagtacagc gtgatgtttc catcatgccg ttcaaaatta ttgctgctga taacggcgac 300
gcatgggtcg aagttaaagg ccagaaaatg gcaccgccgc agatttctgc tgaagtgctg 360
aaaaaaatga agaaaaccgc tgaagattac ctgggtgaac cggtaactga agctgttatc 420
accgtaccgg catactttaa cgatgctcag cgtcaggcaa ccaaagacgc aggccgtatc 480
gctggtctgg aagtaaaacg tatcatcaac gaaccgaccg cagctgcgct ggcttacggt 540
ctggacaaag gcactggcaa ccgtactatc gcggtttatg acctgggtgg tggtactttc 600
gatatttcta ttatcgaaat cgacgaagtt gacggcgaaa aaaccttcga agttctggca 660
accaacggtg atacccacct ggggggtgaa gacttcgaca gccgtctgat caactatctg 720
gttgaagaat tcaagaaaga tcagggcatt gacctgcgca acgatccgct ggcaatgcag 780
cgcctgaaag aagcggcaga aaaagcgaaa atcgaactgt cttccgctca gcagaccgac 840
gttaacctgc catacatcac tgcagacgcg accggtccga aacacatgaa catcaaagtg 900
actcgtgcga aactggaaag cctggttgaa gatctggtaa accgttccat tgagccgctg 960
aaagttgcac tgcaggacgc tggcctgtcc gtatctgata tcgacgacgt tatcctcgtt 1020
ggtggtcaga ctcgtatgcc aatggttcag aagaaagttg ctgagttctt tggtaaagag 1080
ccgcgtaaag acgttaaccc ggacgaagct gtagcaatcg gtgctgctgt tcagggtggt 1140
gttctgactg gtgacgtaaa agacgtactg ctgctggacg ttaccccgct gtctctgggt 1200
atcgaaacca tgggcggtgt gatgacgacg ctgatcgcga aaaacaccac tatcccgacc 1260
aagcacagcc aggtgttctc taccgctgaa gacaaccagt ctgcggtaac catccatgtg 1320
ctgcagggtg aacgtaaacg tgcggctgat aacaaatctc tgggtcagtt caacctagat 1380
ggtatcaacc cggcaccgcg cggcatgccg cagatcgaag ttaccttcga tatcgatgct 1440
gacggtatcc tgcacgtttc cgcgaaagat aaaaacagcg gtaaagagca gaagatcacc 1500
atcaaggctt cttctggtct gaacgaagat gaaatccaga aaatggtacg cgacgcagaa 1560
gctaacgccg aagctgaccg taagtttgaa gagctggtac agactcgcaa ccagggcgac 1620
catctgctgc acagcacccg taagcaggtt gaagaagcag gcgacaaact gccggctgac 1680
gacaaaactg ctatcgagtc tgcgctgact gcactggaaa ctgctctgaa aggtgaagac 1740
aaagccgcta tcgaagcgaa aatgcaggaa ctggcacagg tttcccagaa actgatggaa 1800
atcgcccagc agcaacatgc ccagcagcag actgccggtg ctgatgcttc tgcaaacaac 1860
gcgaaagatg acgatgttgt cgacgctgaa tttgaagaag tcaaagacaa aaaataa 1917
<210> 18
<211> 638
<212> PRT
<213> 大肠杆菌(Escherichia coli)
<400> 18
Met Gly Lys Ile Ile Gly Ile Asp Leu Gly Thr Thr Asn Ser Cys Val
1 5 10 15
Ala Ile Met Asp Gly Thr Thr Pro Arg Val Leu Glu Asn Ala Glu Gly
20 25 30
Asp Arg Thr Thr Pro Ser Ile Ile Ala Tyr Thr Gln Asp Gly Glu Thr
35 40 45
Leu Val Gly Gln Pro Ala Lys Arg Gln Ala Val Thr Asn Pro Gln Asn
50 55 60
Thr Leu Phe Ala Ile Lys Arg Leu Ile Gly Arg Arg Phe Gln Asp Glu
65 70 75 80
Glu Val Gln Arg Asp Val Ser Ile Met Pro Phe Lys Ile Ile Ala Ala
85 90 95
Asp Asn Gly Asp Ala Trp Val Glu Val Lys Gly Gln Lys Met Ala Pro
100 105 110
Pro Gln Ile Ser Ala Glu Val Leu Lys Lys Met Lys Lys Thr Ala Glu
115 120 125
Asp Tyr Leu Gly Glu Pro Val Thr Glu Ala Val Ile Thr Val Pro Ala
130 135 140
Tyr Phe Asn Asp Ala Gln Arg Gln Ala Thr Lys Asp Ala Gly Arg Ile
145 150 155 160
Ala Gly Leu Glu Val Lys Arg Ile Ile Asn Glu Pro Thr Ala Ala Ala
165 170 175
Leu Ala Tyr Gly Leu Asp Lys Gly Thr Gly Asn Arg Thr Ile Ala Val
180 185 190
Tyr Asp Leu Gly Gly Gly Thr Phe Asp Ile Ser Ile Ile Glu Ile Asp
195 200 205
Glu Val Asp Gly Glu Lys Thr Phe Glu Val Leu Ala Thr Asn Gly Asp
210 215 220
Thr His Leu Gly Gly Glu Asp Phe Asp Ser Arg Leu Ile Asn Tyr Leu
225 230 235 240
Val Glu Glu Phe Lys Lys Asp Gln Gly Ile Asp Leu Arg Asn Asp Pro
245 250 255
Leu Ala Met Gln Arg Leu Lys Glu Ala Ala Glu Lys Ala Lys Ile Glu
260 265 270
Leu Ser Ser Ala Gln Gln Thr Asp Val Asn Leu Pro Tyr Ile Thr Ala
275 280 285
Asp Ala Thr Gly Pro Lys His Met Asn Ile Lys Val Thr Arg Ala Lys
290 295 300
Leu Glu Ser Leu Val Glu Asp Leu Val Asn Arg Ser Ile Glu Pro Leu
305 310 315 320
Lys Val Ala Leu Gln Asp Ala Gly Leu Ser Val Ser Asp Ile Asp Asp
325 330 335
Val Ile Leu Val Gly Gly Gln Thr Arg Met Pro Met Val Gln Lys Lys
340 345 350
Val Ala Glu Phe Phe Gly Lys Glu Pro Arg Lys Asp Val Asn Pro Asp
355 360 365
Glu Ala Val Ala Ile Gly Ala Ala Val Gln Gly Gly Val Leu Thr Gly
370 375 380
Asp Val Lys Asp Val Leu Leu Leu Asp Val Thr Pro Leu Ser Leu Gly
385 390 395 400
Ile Glu Thr Met Gly Gly Val Met Thr Thr Leu Ile Ala Lys Asn Thr
405 410 415
Thr Ile Pro Thr Lys His Ser Gln Val Phe Ser Thr Ala Glu Asp Asn
420 425 430
Gln Ser Ala Val Thr Ile His Val Leu Gln Gly Glu Arg Lys Arg Ala
435 440 445
Ala Asp Asn Lys Ser Leu Gly Gln Phe Asn Leu Asp Gly Ile Asn Pro
450 455 460
Ala Pro Arg Gly Met Pro Gln Ile Glu Val Thr Phe Asp Ile Asp Ala
465 470 475 480
Asp Gly Ile Leu His Val Ser Ala Lys Asp Lys Asn Ser Gly Lys Glu
485 490 495
Gln Lys Ile Thr Ile Lys Ala Ser Ser Gly Leu Asn Glu Asp Glu Ile
500 505 510
Gln Lys Met Val Arg Asp Ala Glu Ala Asn Ala Glu Ala Asp Arg Lys
515 520 525
Phe Glu Glu Leu Val Gln Thr Arg Asn Gln Gly Asp His Leu Leu His
530 535 540
Ser Thr Arg Lys Gln Val Glu Glu Ala Gly Asp Lys Leu Pro Ala Asp
545 550 555 560
Asp Lys Thr Ala Ile Glu Ser Ala Leu Thr Ala Leu Glu Thr Ala Leu
565 570 575
Lys Gly Glu Asp Lys Ala Ala Ile Glu Ala Lys Met Gln Glu Leu Ala
580 585 590
Gln Val Ser Gln Lys Leu Met Glu Ile Ala Gln Gln Gln His Ala Gln
595 600 605
Gln Gln Thr Ala Gly Ala Asp Ala Ser Ala Asn Asn Ala Lys Asp Asp
610 615 620
Asp Val Val Asp Ala Glu Phe Glu Glu Val Lys Asp Lys Lys
625 630 635
<210> 19
<211> 1647
<212> DNA
<213> 大肠杆菌(Escherichia coli)
<400> 19
atggcagcta aagacgtaaa attcggtaac gacgctcgtg tgaaaatgct gcgcggcgta 60
aacgtactgg cagatgcagt gaaagttacc ctcggtccaa aaggccgtaa cgtagttctg 120
gataaatctt tcggtgcacc gaccatcacc aaagatggtg tttccgttgc tcgtgaaatc 180
gaactggaag acaagttcga aaatatgggt gcgcagatgg tgaaagaagt tgcctctaaa 240
gcaaacgacg ctgcaggcga cggtaccacc actgcaaccg tactggctca ggctatcatc 300
actgaaggtc tgaaagctgt tgctgcgggc atgaacccga tggacctgaa acgtggtatc 360
gacaaagcgg ttaccgctgc agttgaagaa ctgaaagcgc tgtccgtacc atgctctgac 420
tctaaagcga ttgctcaggt tggtaccatc tccgctaact ccgacgaaac cgtaggtaaa 480
ctgatcgctg aagcgatgga caaagtcggt aaagaaggcg ttatcaccgt tgaagacggt 540
accggtctgc aggacgaact ggacgtggtt gaaggtatgc agttcgaccg tggctacctg 600
tctccttact tcatcaacaa gccggaaact ggcgcagtag aactggaaag cccgttcatc 660
ctgctggctg acaagaaaat ctccaacatc cgcgaaatgc tgccggttct ggaagctgtt 720
gccaaagcag gcaaaccgct gctgatcatc gctgaagatg tagaaggcga agcgctggca 780
actctggttg ttaacaccat gcgtggcatc gtgaaagtcg ctgcggttaa agcaccgggc 840
ttcggcgatc gtcgtaaagc tatgctgcag gatatcgcaa ccctgactgg cggtaccgtg 900
atctctgaag agatcggtat ggagctggaa aaagcaaccc tggaagacct gggtcaggct 960
aaacgtgttg tgatcaacaa agacaccacc actatcatcg atggcgtggg tgaagaagct 1020
gcaatccagg gccgtgttgc tcagatccgt cagcagattg aagaagcaac ttctgactac 1080
gaccgtgaaa aactgcagga acgcgtagcg aaactggcag gcggcgttgc agttatcaaa 1140
gtgggtgctg ctaccgaagt tgaaatgaaa gagaaaaaag cacgcgttga agatgccctg 1200
cacgcgaccc gtgctgcggt agaagaaggc gtggttgctg gtggtggtgt tgcgctgatc 1260
cgcgtagcgt ctaaactggc tgacctgcgt ggtcagaacg aagaccagaa cgtgggtatc 1320
aaagttgcac tgcgtgcaat ggaagctccg ctgcgtcaga tcgtattgaa ctgcggcgaa 1380
gaaccgtctg ttgttgctaa caccgttaaa ggcggcgacg gcaactacgg ttacaacgca 1440
gcaaccgaag aatacggcaa catgatcgac atgggtatcc tggatccaac caaagtaact 1500
cgttctgctc tgcagtacgc agcttctgtg gctggcctga tgatcaccac cgaatgcatg 1560
gttaccgacc tgccgaaaaa cgatgcagct gacttaggcg ctgctggcgg tatgggcggc 1620
atgggtggca tgggcggcat gatgtaa 1647
<210> 20
<211> 548
<212> PRT
<213> 大肠杆菌(Escherichia coli)
<400> 20
Met Ala Ala Lys Asp Val Lys Phe Gly Asn Asp Ala Arg Val Lys Met
1 5 10 15
Leu Arg Gly Val Asn Val Leu Ala Asp Ala Val Lys Val Thr Leu Gly
20 25 30
Pro Lys Gly Arg Asn Val Val Leu Asp Lys Ser Phe Gly Ala Pro Thr
35 40 45
Ile Thr Lys Asp Gly Val Ser Val Ala Arg Glu Ile Glu Leu Glu Asp
50 55 60
Lys Phe Glu Asn Met Gly Ala Gln Met Val Lys Glu Val Ala Ser Lys
65 70 75 80
Ala Asn Asp Ala Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Ala
85 90 95
Gln Ala Ile Ile Thr Glu Gly Leu Lys Ala Val Ala Ala Gly Met Asn
100 105 110
Pro Met Asp Leu Lys Arg Gly Ile Asp Lys Ala Val Thr Ala Ala Val
115 120 125
Glu Glu Leu Lys Ala Leu Ser Val Pro Cys Ser Asp Ser Lys Ala Ile
130 135 140
Ala Gln Val Gly Thr Ile Ser Ala Asn Ser Asp Glu Thr Val Gly Lys
145 150 155 160
Leu Ile Ala Glu Ala Met Asp Lys Val Gly Lys Glu Gly Val Ile Thr
165 170 175
Val Glu Asp Gly Thr Gly Leu Gln Asp Glu Leu Asp Val Val Glu Gly
180 185 190
Met Gln Phe Asp Arg Gly Tyr Leu Ser Pro Tyr Phe Ile Asn Lys Pro
195 200 205
Glu Thr Gly Ala Val Glu Leu Glu Ser Pro Phe Ile Leu Leu Ala Asp
210 215 220
Lys Lys Ile Ser Asn Ile Arg Glu Met Leu Pro Val Leu Glu Ala Val
225 230 235 240
Ala Lys Ala Gly Lys Pro Leu Leu Ile Ile Ala Glu Asp Val Glu Gly
245 250 255
Glu Ala Leu Ala Thr Leu Val Val Asn Thr Met Arg Gly Ile Val Lys
260 265 270
Val Ala Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Met
275 280 285
Leu Gln Asp Ile Ala Thr Leu Thr Gly Gly Thr Val Ile Ser Glu Glu
290 295 300
Ile Gly Met Glu Leu Glu Lys Ala Thr Leu Glu Asp Leu Gly Gln Ala
305 310 315 320
Lys Arg Val Val Ile Asn Lys Asp Thr Thr Thr Ile Ile Asp Gly Val
325 330 335
Gly Glu Glu Ala Ala Ile Gln Gly Arg Val Ala Gln Ile Arg Gln Gln
340 345 350
Ile Glu Glu Ala Thr Ser Asp Tyr Asp Arg Glu Lys Leu Gln Glu Arg
355 360 365
Val Ala Lys Leu Ala Gly Gly Val Ala Val Ile Lys Val Gly Ala Ala
370 375 380
Thr Glu Val Glu Met Lys Glu Lys Lys Ala Arg Val Glu Asp Ala Leu
385 390 395 400
His Ala Thr Arg Ala Ala Val Glu Glu Gly Val Val Ala Gly Gly Gly
405 410 415
Val Ala Leu Ile Arg Val Ala Ser Lys Leu Ala Asp Leu Arg Gly Gln
420 425 430
Asn Glu Asp Gln Asn Val Gly Ile Lys Val Ala Leu Arg Ala Met Glu
435 440 445
Ala Pro Leu Arg Gln Ile Val Leu Asn Cys Gly Glu Glu Pro Ser Val
450 455 460
Val Ala Asn Thr Val Lys Gly Gly Asp Gly Asn Tyr Gly Tyr Asn Ala
465 470 475 480
Ala Thr Glu Glu Tyr Gly Asn Met Ile Asp Met Gly Ile Leu Asp Pro
485 490 495
Thr Lys Val Thr Arg Ser Ala Leu Gln Tyr Ala Ala Ser Val Ala Gly
500 505 510
Leu Met Ile Thr Thr Glu Cys Met Val Thr Asp Leu Pro Lys Asn Asp
515 520 525
Ala Ala Asp Leu Gly Ala Ala Gly Gly Met Gly Gly Met Gly Gly Met
530 535 540
Gly Gly Met Met
545
<210> 21
<211> 291
<212> DNA
<213> 大肠杆菌(Escherichia coli)
<400> 21
atgcaaaaca caactcatga caacgtaatt ctggagctca ccgttcgcaa ccatccgggc 60
gtaatgaccc acgtttgtgg cctttttgcc cgccgcgctt ttaacgttga aggcattctt 120
tgtctgccga ttcaggacag cgacaaaagc catatctggc tactggtcaa tgacgaccag 180
cgtctggagc agatgataag ccaaatcgat aagctggaag atgtcgtgaa agtgcagcgt 240
aatcagtccg atccgacgat gtttaacaag atcgcggtgt tttttcagta a 291
<210> 22
<211> 96
<212> PRT
<213> 大肠杆菌(Escherichia coli)
<400> 22
Met Gln Asn Thr Thr His Asp Asn Val Ile Leu Glu Leu Thr Val Arg
1 5 10 15
Asn His Pro Gly Val Met Thr His Val Cys Gly Leu Phe Ala Arg Arg
20 25 30
Ala Phe Asn Val Glu Gly Ile Leu Cys Leu Pro Ile Gln Asp Ser Asp
35 40 45
Lys Ser His Ile Trp Leu Leu Val Asn Asp Asp Gln Arg Leu Glu Gln
50 55 60
Met Ile Ser Gln Ile Asp Lys Leu Glu Asp Val Val Lys Val Gln Arg
65 70 75 80
Asn Gln Ser Asp Pro Thr Met Phe Asn Lys Ile Ala Val Phe Phe Gln
85 90 95
<210> 23
<211> 1167
<212> DNA
<213> 大肠杆菌(Escherichia coli)
<400> 23
atgatcaaga ttggcgttat cgccgatgat tttaccggcg cgacggatat cgccagtttt 60
ctggtggaaa acggtctacc aacggtacaa attaacggtg ttccaacagg taaaatgccg 120
gaagcaatcg acgcactggt gatcagcctg aaaacgcgct cctgtccagt ggttgaagcc 180
acacagcaat cgctggcggc tctgagctgg ttgcaacagc aaggttgcaa acagatctat 240
ttcaaatact gctctacttt cgacagtacg gcgaaaggta atattggccc ggttaccgat 300
gccttaatgg atgctctcga cacgccgttt acggtcttct ctccggccct gccggtcaac 360
ggacgtacgg tttatcaggg gtatttgttc gtaatgaatc aactgctggc cgaatccggg 420
atgcgccatc acccggtaaa tcccatgacc gacagctatc ttccccgtct ggttgaagcg 480
caatccacag ggcgctgcgg cgtcgtttcg gcacatgttt tcgaacaagg tgtggatgcc 540
gttcgtcaag agctggctcg cttacagcaa gagggctacc gctacgcggt gcttgatgcg 600
ctgaccgaac accatctgga aattcaggga gaagccttgc gcgatgcccc actggtaacg 660
ggcggttctg gtctggcgat tggcctggcc cggcagtggg cgcaagaaaa cggtaaccag 720
gctcgcaaag cagggcgtcc gctcgctggg cgcggcgtag tgctctccgg ttcatgctct 780
caaatgacca accgccaggt agcacattac cgtcaaattg caccagcccg tgaagttgat 840
gtggcacgct gcctctcaat tgaaactctg gccgcttatg cacacgaact ggcagagtgg 900
gttctgggcc aggaaagtgt acttgctcca ctggtttttg ccaccgccag cactgacgca 960
ttggcagcaa ttcaacagca atacggtgca caaaaagcca gtcaggcagt agaaacactg 1020
ttttctcaac tagcggcgcg gttagcagcg gaaggcgtga cacgctttat tgtcgcaggc 1080
ggtgagacct ccggcgtagt cacacagagc ctgggaataa aagggtttca tattggccca 1140
accatttccc cggcgtgccg tgggtaa 1167
<210> 24
<211> 420
<212> PRT
<213> 大肠杆菌(Escherichia coli)
<400> 24
Met Ile Lys Ile Gly Val Ile Ala Asp Asp Phe Thr Gly Ala Thr Asp
1 5 10 15
Ile Ala Ser Phe Leu Val Glu Asn Gly Leu Pro Thr Val Gln Ile Asn
20 25 30
Gly Val Pro Thr Gly Lys Met Pro Glu Ala Ile Asp Ala Leu Val Ile
35 40 45
Ser Leu Lys Thr Arg Ser Cys Pro Val Val Glu Ala Thr Gln Gln Ser
50 55 60
Leu Ala Ala Leu Ser Trp Leu Gln Gln Gln Gly Cys Lys Gln Ile Tyr
65 70 75 80
Phe Lys Tyr Cys Ser Thr Phe Asp Ser Thr Ala Lys Gly Asn Ile Gly
85 90 95
Pro Val Thr Asp Ala Leu Met Asp Ala Leu Asp Thr Pro Phe Thr Val
100 105 110
Phe Ser Pro Ala Leu Pro Val Asn Gly Arg Thr Val Tyr Gln Gly Tyr
115 120 125
Leu Phe Val Met Asn Gln Leu Leu Ala Glu Ser Gly Met Arg His His
130 135 140
Pro Val Asn Pro Met Thr Asp Ser Tyr Leu Pro Arg Leu Val Glu Ala
145 150 155 160
Gln Ser Thr Gly Arg Cys Gly Val Val Ser Ala His Val Phe Glu Gln
165 170 175
Gly Val Asp Ala Val Arg Gln Glu Leu Ala Arg Leu Gln Gln Glu Gly
180 185 190
Tyr Arg Tyr Ala Val Leu Asp Ala Leu Thr Glu His His Leu Glu Ile
195 200 205
Gln Gly Glu Ala Leu Arg Asp Ala Pro Leu Val Thr Gly Gly Ser Gly
210 215 220
Leu Ala Ile Gly Leu Ala Arg Gln Trp Ala Gln Glu Asn Gly Asn Gln
225 230 235 240
Ala Arg Glu Ala Gly His Pro Leu Ala Gly Arg Gly Val Val Leu Ser
245 250 255
Gly Ser Cys Ser Gln Met Thr Asn Arg Gln Val Ala His Tyr Arg Gln
260 265 270
Ile Ala Pro Ala Arg Glu Val Asp Val Ala Arg Cys Leu Ser Thr Glu
275 280 285
Thr Leu Ala Ala Tyr Ala His Glu Leu Ala Glu Trp Val Leu Gly Gln
290 295 300
Glu Ser Val Leu Ala Pro Leu Val Phe Ala Thr Ala Ser Thr Asp Ala
305 310 315 320
Leu Ala Ala Ile Gln Gln Gln Tyr Gly Ala Gln Lys Ala Ser Gln Ala
325 330 335
Val Glu Thr Leu Phe Ser Gln Leu Ala Ala Arg Leu Ala Ala Glu Gly
340 345 350
Val Thr Arg Phe Ile Val Ala Gly Gly Glu Thr Ser Gly Val Val Thr
355 360 365
Gln Ser Leu Gly Ile Lys Gly Phe His Ile Gly Pro Thr Ile Ser Pro
370 375 380
Gly Val Pro Trp Val Asn Ala Leu Asp Lys Pro Val Ser Leu Ala Leu
385 390 395 400
Lys Ser Gly Asn Phe Gly Asp Glu Ala Phe Phe Ser Arg Ala Gln Arg
405 410 415
Glu Phe Leu Ser
420
<210> 25
<211> 4029
<212> DNA
<213> 大肠杆菌(Escherichia coli)
<400> 25
atggtttact cctataccga gaaaaaacgt attcgtaagg attttggtaa acgtccacaa 60
gttctggatg taccttatct cctttctatc cagcttgact cgtttcagaa atttatcgag 120
caagatcctg aagggcagta tggtctggaa gctgctttcc gttccgtatt cccgattcag 180
agctacagcg gtaattccga gctgcaatac gtcagctacc gccttggcga accggtgttt 240
gacgtccagg aatgtcaaat ccgtggcgtg acctattccg caccgctgcg cgttaaactg 300
cgtctggtga tctatgagcg cgaagcgccg gaaggcaccg taaaagacat taaagaacaa 360
gaagtctaca tgggcgaaat tccgctcatg acagacaacg gtacctttgt tatcaacggt 420
actgagcgtg ttatcgtttc ccagctgcac cgtagtccgg gcgtcttctt tgactccgac 480
aaaggtaaaa cccactcttc gggtaaagtg ctgtataacg cgcgtatcat cccttaccgt 540
ggttcctggc tggacttcga attcgatccg aaggacaacc tgttcgtacg tatcgaccgt 600
cgccgtaaac tgcctgcgac catcattctg cgcgccctga actacaccac agagcagatc 660
ctcgacctgt tctttgaaaa agttatcttt gaaatccgtg ataacaagct gcagatggaa 720
ctggtgccgg aacgcctgcg tggtgaaacc gcatcttttg acatcgaagc taacggtaaa 780
gtgtacgtag aaaaaggccg ccgtatcact gcgcgccaca ttcgccagct ggaaaaagac 840
gacgtcaaac tgatcgaagt cccggttgag tacatcgcag gtaaagtggt tgctaaagac 900
tatattgatg agtctaccgg cgagctgatc tgcgcagcga acatggagct gagcctggat 960
ctgctggcta agctgagcca gtctggtcac aagcgtatcg aaacgctgtt caccaacgat 1020
ctggatcacg gcccatatat ctctgaaacc ttacgtgtcg acccaactaa cgaccgtctg 1080
agcgcactgg tagaaatcta ccgcatgatg cgccctggcg agccgccgac tcgtgaagca 1140
gctgaaagcc tgttcgagaa cctgttcttc tccgaagacc gttatgactt gtctgcggtt 1200
ggtcgtatga agttcaaccg ttctctgctg cgcgaagaaa tcgaaggttc cggtatcctg 1260
agcaaagacg acatcattga tgttatgaaa aagctcatcg atatccgtaa cggtaaaggc 1320
gaagtcgatg atatcgacca cctcggcaac cgtcgtatcc gttccgttgg cgaaatggcg 1380
gaaaaccagt tccgcgttgg cctggtacgt gtagagcgtg cggtgaaaga gcgtctgtct 1440
ctgggcgatc tggataccct gatgccacag gatatgatca acgccaagcc gatttccgca 1500
gcagtgaaag agttcttcgg ttccagccag ctgtctcagt ttatggacca gaacaacccg 1560
ctgtctgaga ttacgcacaa acgtcgtatc tccgcactcg gcccaggcgg tctgacccgt 1620
gaacgtgcag gcttcgaagt tcgagacgta cacccgactc actacggtcg cgtatgtcca 1680
atcgaaaccc ctgaaggtcc gaacatcggt ctgatcaact ctctgtccgt gtacgcacag 1740
actaacgaat acggcttcct tgagactccg tatcgtaaag tgaccgacgg tgttgtaact 1800
gacgaaattc actacctgtc tgctatcgaa gaaggcaact acgttatcgc ccaggcgaac 1860
tccaacttgg atgaagaagg ccacttcgta gaagacctgg taacttgccg tagcaaaggc 1920
gaatccagct tgttcagccg cgaccaggtt gactacatgg acgtatccac ccagcaggtg 1980
gtatccgtcg gtgcgtccct gatcccgttc ctggaacacg atgacgccaa ccgtgcattg 2040
atgggtgcga acatgcaacg tcaggccgtt ccgactctgc gcgctgataa gccgctggtt 2100
ggtactggta tggaacgtgc tgttgccgtt gactccggtg taactgcggt agctaaacgt 2160
ggtggtgtcg ttcagtacgt ggatgcttcc cgtatcgtta tcaaagttaa cgaagacgag 2220
atgtatccgg gtgaagcagg tatcgacatc tacaacctga ccaaatacac ccgttctaac 2280
cagaacacct gtatcaacca gatgccgtgt gtgtctctgg gtgaaccggt tgaacgtggc 2340
gacgtgctgg cagacggtcc gtccaccgac ctcggtgaac tggcgcttgg tcagaacatg 2400
cgcgtagcgt tcatgccgtg gaatggttac aacttcgaag actccatcct cgtatccgag 2460
cgtgttgttc aggaagaccg tttcaccacc atccacattc aggaactggc gtgtgtgtcc 2520
cgtgacacca agctgggtcc ggaagagatc accgctgaca tcccgaacgt gggtgaagct 2580
gcgctctcca aactggatga atccggtatc gtttacattg gtgcggaagt gaccggtggc 2640
gacattctgg ttggtaaggt aacgccgaaa ggtgaaactc agctgacccc agaagaaaaa 2700
ctgctgcgtg cgatcttcgg tgagaaagcc tctgacgtta aagactcttc tctgcgcgta 2760
ccaaacggtg tatccggtac ggttatcgac gttcaggtct ttactcgcga tggcgtagaa 2820
aaagacaaac gtgcgctgga aatcgaagaa atgcagctca aacaggcgaa gaaagacctg 2880
tctgaagaac tgcagatcct cgaagcgggt ctgttcagcc gtatccgtgc tgtgctggta 2940
gccggtggcg ttgaagctga gaagctcgac aaactgccgc gcgatcgctg gctggagctg 3000
ggcctgacag acgaagagaa acaaaatcag ctggaacagc tggctgagca gtatgacgaa 3060
ctgaaacacg agttcgagaa gaaactcgaa gcgaaacgcc gcaaaatcac ccagggcgac 3120
gatctggcac cgggcgtgct gaagattgtt aaggtatatc tggcggttaa acgccgtatc 3180
cagcctggtg acaagatggc aggtcgtcac ggtaacaagg gtgtaatttc taagatcaac 3240
ccgatcgaag atatgcctta cgatgaaaac ggtacgccgg tagacatcgt actgaacccg 3300
ctgggcgtac cgtctcgtat gaacatcggt cagatcctcg aaacccacct gggtatggct 3360
gcgaaaggta tcggcgacaa gatcaacgcc atgctgaaac agcagcaaga agtcgcgaaa 3420
ctgcgcgaat tcatccagcg tgcgtacgat ctgggcgctg acgttcgtca gaaagttgac 3480
ctgagtacct tcagcgatga agaagttatg cgtctggctg aaaacctgcg caaaggtatg 3540
ccaatcgcaa cgccggtgtt cgacggtgcg aaagaagcag aaattaaaga gctgctgaaa 3600
cttggcgacc tgccgacttc cggtcagatc cgcctgtacg atggtcgcac tggtgaacag 3660
ttcgagcgtc cggtaaccgt tggttacatg tacatgctga aactgaacca cctggtcgac 3720
gacaagatgc acgcgcgttc caccggttct tacagcctgg ttactcagca gccgctgggt 3780
ggtaaggcac agttcggtgg tcagcgtttc ggggagatgg aagtgtgggc gctggaagca 3840
tacggcgcag catacaccct gcaggaaatg ctcaccgtta agtctgatga cgtgaacggt 3900
cgtaccaaga tgtataaaaa catcgtggac ggcaaccatc agatggagcc gggcatgcca 3960
gaatccttca acgtattgtt gaaagagatt cgttcgctgg gtatcaacat cgaactggaa 4020
gacgagtaa 4029
<210> 26
<211> 1342
<212> PRT
<213> 大肠杆菌(Escherichia coli)
<400> 26
Met Val Tyr Ser Tyr Thr Glu Lys Lys Arg Ile Arg Lys Asp Phe Gly
1 5 10 15
Lys Arg Pro Gln Val Leu Asp Val Pro Tyr Leu Leu Ser Ile Gln Leu
20 25 30
Asp Ser Phe Gln Lys Phe Ile Glu Gln Asp Pro Glu Gly Gln Tyr Gly
35 40 45
Leu Glu Ala Ala Phe Arg Ser Val Phe Pro Ile Gln Ser Tyr Ser Gly
50 55 60
Asn Ser Glu Leu Gln Tyr Val Ser Tyr Arg Leu Gly Glu Pro Val Phe
65 70 75 80
Asp Val Gln Glu Cys Gln Ile Arg Gly Val Thr Tyr Ser Ala Pro Leu
85 90 95
Arg Val Lys Leu Arg Leu Val Ile Tyr Glu Arg Glu Ala Pro Glu Gly
100 105 110
Thr Val Lys Asp Ile Lys Glu Gln Glu Val Tyr Met Gly Glu Ile Pro
115 120 125
Leu Met Thr Asp Asn Gly Thr Phe Val Ile Asn Gly Thr Glu Arg Val
130 135 140
Ile Val Ser Gln Leu His Arg Ser Pro Gly Val Phe Phe Asp Ser Asp
145 150 155 160
Lys Gly Lys Thr His Ser Ser Gly Lys Val Leu Tyr Asn Ala Arg Ile
165 170 175
Ile Pro Tyr Arg Gly Ser Trp Leu Asp Phe Glu Phe Asp Pro Lys Asp
180 185 190
Asn Leu Phe Val Arg Ile Asp Arg Arg Arg Lys Leu Pro Ala Thr Ile
195 200 205
Ile Leu Arg Ala Leu Asn Tyr Thr Thr Glu Gln Ile Leu Asp Leu Phe
210 215 220
Phe Glu Lys Val Ile Phe Glu Ile Arg Asp Asn Lys Leu Gln Met Glu
225 230 235 240
Leu Val Pro Glu Arg Leu Arg Gly Glu Thr Ala Ser Phe Asp Ile Glu
245 250 255
Ala Asn Gly Lys Val Tyr Val Glu Lys Gly Arg Arg Ile Thr Ala Arg
260 265 270
His Ile Arg Gln Leu Glu Lys Asp Asp Val Lys Leu Ile Glu Val Pro
275 280 285
Val Glu Tyr Ile Ala Gly Lys Val Val Ala Lys Asp Tyr Ile Asp Glu
290 295 300
Ser Thr Gly Glu Leu Ile Cys Ala Ala Asn Met Glu Leu Ser Leu Asp
305 310 315 320
Leu Leu Ala Lys Leu Ser Gln Ser Gly His Lys Arg Ile Glu Thr Leu
325 330 335
Phe Thr Asn Asp Leu Asp His Gly Pro Tyr Ile Ser Glu Thr Leu Arg
340 345 350
Val Asp Pro Thr Asn Asp Arg Leu Ser Ala Leu Val Glu Ile Tyr Arg
355 360 365
Met Met Arg Pro Gly Glu Pro Pro Thr Arg Glu Ala Ala Glu Ser Leu
370 375 380
Phe Glu Asn Leu Phe Phe Ser Glu Asp Arg Tyr Asp Leu Ser Ala Val
385 390 395 400
Gly Arg Met Lys Phe Asn Arg Ser Leu Leu Arg Glu Glu Ile Glu Gly
405 410 415
Ser Gly Ile Leu Ser Lys Asp Asp Ile Ile Asp Val Met Lys Lys Leu
420 425 430
Ile Asp Ile Arg Asn Gly Lys Gly Glu Val Asp Asp Ile Asp His Leu
435 440 445
Gly Asn Arg Arg Ile Arg Ser Val Gly Glu Met Ala Glu Asn Gln Phe
450 455 460
Arg Val Gly Leu Val Arg Val Glu Arg Ala Val Lys Glu Arg Leu Ser
465 470 475 480
Leu Gly Asp Leu Asp Thr Leu Met Pro Gln Asp Met Ile Asn Ala Lys
485 490 495
Pro Ile Ser Ala Ala Val Lys Glu Phe Phe Gly Ser Ser Gln Leu Ser
500 505 510
Gln Phe Met Asp Gln Asn Asn Pro Leu Ser Glu Ile Thr His Lys Arg
515 520 525
Arg Ile Ser Ala Leu Gly Pro Gly Gly Leu Thr Arg Glu Arg Ala Gly
530 535 540
Phe Glu Val Arg Asp Val His Pro Thr His Tyr Gly Arg Val Cys Pro
545 550 555 560
Ile Glu Thr Pro Glu Gly Pro Asn Ile Gly Leu Ile Asn Ser Leu Ser
565 570 575
Val Tyr Ala Gln Thr Asn Glu Tyr Gly Phe Leu Glu Thr Pro Tyr Arg
580 585 590
Lys Val Thr Asp Gly Val Val Thr Asp Glu Ile His Tyr Leu Ser Ala
595 600 605
Ile Glu Glu Gly Asn Tyr Val Ile Ala Gln Ala Asn Ser Asn Leu Asp
610 615 620
Glu Glu Gly His Phe Val Glu Asp Leu Val Thr Cys Arg Ser Lys Gly
625 630 635 640
Glu Ser Ser Leu Phe Ser Arg Asp Gln Val Asp Tyr Met Asp Val Ser
645 650 655
Thr Gln Gln Val Val Ser Val Gly Ala Ser Leu Ile Pro Phe Leu Glu
660 665 670
His Asp Asp Ala Asn Arg Ala Leu Met Gly Ala Asn Met Gln Arg Gln
675 680 685
Ala Val Pro Thr Leu Arg Ala Asp Lys Pro Leu Val Gly Thr Gly Met
690 695 700
Glu Arg Ala Val Ala Val Asp Ser Gly Val Thr Ala Val Ala Lys Arg
705 710 715 720
Gly Gly Val Val Gln Tyr Val Asp Ala Ser Arg Ile Val Ile Lys Val
725 730 735
Asn Glu Asp Glu Met Tyr Pro Gly Glu Ala Gly Ile Asp Ile Tyr Asn
740 745 750
Leu Thr Lys Tyr Thr Arg Ser Asn Gln Asn Thr Cys Ile Asn Gln Met
755 760 765
Pro Cys Val Ser Leu Gly Glu Pro Val Glu Arg Gly Asp Val Leu Ala
770 775 780
Asp Gly Pro Ser Thr Asp Leu Gly Glu Leu Ala Leu Gly Gln Asn Met
785 790 795 800
Arg Val Ala Phe Met Pro Trp Asn Gly Tyr Asn Phe Glu Asp Ser Ile
805 810 815
Leu Val Ser Glu Arg Val Val Gln Glu Asp Arg Phe Thr Thr Ile His
820 825 830
Ile Gln Glu Leu Ala Cys Val Ser Arg Asp Thr Lys Leu Gly Pro Glu
835 840 845
Glu Ile Thr Ala Asp Ile Pro Asn Val Gly Glu Ala Ala Leu Ser Lys
850 855 860
Leu Asp Glu Ser Gly Ile Val Tyr Ile Gly Ala Glu Val Thr Gly Gly
865 870 875 880
Asp Ile Leu Val Gly Lys Val Thr Pro Lys Gly Glu Thr Gln Leu Thr
885 890 895
Pro Glu Glu Lys Leu Leu Arg Ala Ile Phe Gly Glu Lys Ala Ser Asp
900 905 910
Val Lys Asp Ser Ser Leu Arg Val Pro Asn Gly Val Ser Gly Thr Val
915 920 925
Ile Asp Val Gln Val Phe Thr Arg Asp Gly Val Glu Lys Asp Lys Arg
930 935 940
Ala Leu Glu Ile Glu Glu Met Gln Leu Lys Gln Ala Lys Lys Asp Leu
945 950 955 960
Ser Glu Glu Leu Gln Ile Leu Glu Ala Gly Leu Phe Ser Arg Ile Arg
965 970 975
Ala Val Leu Val Ala Gly Gly Val Glu Ala Glu Lys Leu Asp Lys Leu
980 985 990
Pro Arg Asp Arg Trp Leu Glu Leu Gly Leu Thr Asp Glu Glu Lys Gln
995 1000 1005
Asn Gln Leu Glu Gln Leu Ala Glu Gln Tyr Asp Glu Leu Lys His
1010 1015 1020
Glu Phe Glu Lys Lys Leu Glu Ala Lys Arg Arg Lys Ile Thr Gln
1025 1030 1035
Gly Asp Asp Leu Ala Pro Gly Val Leu Lys Ile Val Lys Val Tyr
1040 1045 1050
Leu Ala Val Lys Arg Arg Ile Gln Pro Gly Asp Lys Met Ala Gly
1055 1060 1065
Arg His Gly Asn Lys Gly Val Ile Ser Lys Ile Asn Pro Ile Glu
1070 1075 1080
Asp Met Pro Tyr Asp Glu Asn Gly Thr Pro Val Asp Ile Val Leu
1085 1090 1095
Asn Pro Leu Gly Val Pro Ser Arg Met Asn Ile Gly Gln Ile Leu
1100 1105 1110
Glu Thr His Leu Gly Met Ala Ala Lys Gly Ile Gly Asp Lys Ile
1115 1120 1125
Asn Ala Met Leu Lys Gln Gln Gln Glu Val Ala Lys Leu Arg Glu
1130 1135 1140
Phe Ile Gln Arg Ala Tyr Asp Leu Gly Ala Asp Val Arg Gln Lys
1145 1150 1155
Val Asp Leu Ser Thr Phe Ser Asp Glu Glu Val Met Arg Leu Ala
1160 1165 1170
Glu Asn Leu Arg Lys Gly Met Pro Ile Ala Thr Pro Val Phe Asp
1175 1180 1185
Gly Ala Lys Glu Ala Glu Ile Lys Glu Leu Leu Lys Leu Gly Asp
1190 1195 1200
Leu Pro Thr Ser Gly Gln Ile Arg Leu Tyr Asp Gly Arg Thr Gly
1205 1210 1215
Glu Gln Phe Glu Arg Pro Val Thr Val Gly Tyr Met Tyr Met Leu
1220 1225 1230
Lys Leu Asn His Leu Val Asp Asp Lys Met His Ala Arg Ser Thr
1235 1240 1245
Gly Ser Tyr Ser Leu Val Thr Gln Gln Pro Leu Gly Gly Lys Ala
1250 1255 1260
Gln Phe Gly Gly Gln Arg Phe Gly Glu Met Glu Val Trp Ala Leu
1265 1270 1275
Glu Ala Tyr Gly Ala Ala Tyr Thr Leu Gln Glu Met Leu Thr Val
1280 1285 1290
Lys Ser Asp Asp Val Asn Gly Arg Thr Lys Met Tyr Lys Asn Ile
1295 1300 1305
Val Asp Gly Asn His Gln Met Glu Pro Gly Met Pro Glu Ser Phe
1310 1315 1320
Asn Val Leu Leu Lys Glu Ile Arg Ser Leu Gly Ile Asn Ile Glu
1325 1330 1335
Leu Glu Asp Glu
1340
<210> 27
<211> 4224
<212> DNA
<213> 大肠杆菌(Escherichia coli)
<400> 27
gtgaaagatt tattaaagtt tctgaaagcg cagactaaaa ccgaagagtt tgatgcgatc 60
aaaattgctc tggcttcgcc agacatgatc cgttcatggt ctttcggtga agttaaaaag 120
ccggaaacca tcaactaccg tacgttcaaa ccagaacgtg acggcctttt ctgcgcccgt 180
atctttgggc cggtaaaaga ttacgagtgc ctgtgcggta agtacaagcg cctgaaacac 240
cgtggcgtca tctgtgagaa gtgcggcgtt gaagtgaccc agactaaagt acgccgtgag 300
cgtatgggcc acatcgaact ggcttccccg actgcgcaca tctggttcct gaaatcgctg 360
ccgtcccgta tcggtctgct gctcgatatg ccgctgcgcg atatcgaacg cgtactgtac 420
tttgaatcct atgtggttat cgaaggcggt atgaccaacc tggaacgtca gcagatcctg 480
actgaagagc agtatctgga cgcgctggaa gagttcggtg acgaattcga cgcgaagatg 540
ggggcggaag caatccaggc tctgctgaag agcatggatc tggagcaaga gtgcgaacag 600
ctgcgtgaag agctgaacga aaccaactcc gaaaccaagc gtaaaaagct gaccaagcgt 660
atcaaactgc tggaagcgtt cgttcagtct ggtaacaaac cagagtggat gatcctgacc 720
gttctgccgg tactgccgcc agatctgcgt ccgctggttc cgctggatgg tggtcgtttc 780
gcgacttctg acctgaacga tctgtatcgt cgcgtcatta accgtaacaa ccgtctgaaa 840
cgtctgctgg atctggctgc gccggacatc atcgtacgta acgaaaaacg tatgctgcag 900
gaagcggtag acgccctgct ggataacggt cgtcgcggtc gtgcgatcac cggttctaac 960
aagcgtcctc tgaaatcttt ggccgacatg atcaaaggta aacagggtcg tttccgtcag 1020
aacctgctcg gtaagcgtgt tgactactcc ggtcgttctg taatcaccgt aggtccatac 1080
ctgcgtctgc atcagtgcgg tctgccgaag aaaatggcac tggagctgtt caaaccgttc 1140
atctacggca agctggaact gcgtggtctt gctaccacca ttaaagctgc gaagaaaatg 1200
gttgagcgcg aagaagctgt cgtttgggat atcctggacg aagttatccg cgaacacccg 1260
gtactgctga accgtgcacc gactctgcac cgtctgggta tccaggcatt tgaaccggta 1320
ctgatcgaag gtaaagctat ccagctgcac ccgctggttt gtgcggcata taacgccgac 1380
ttcgatggtg accagatggc tgttcacgta ccgctgacgc tggaagccca gctggaagcg 1440
cgtgcgctga tgatgtctac caacaacatc ctgtccccgg cgaacggcga accaatcatc 1500
gttccgtctc aggacgttgt actgggtctg tactacatga cccgtgactg tgttaacgcc 1560
aaaggcgaag gcatggtgct gactggcccg aaagaagcag aacgtctgta tcgctctggt 1620
ctggcttctc tgcatgcgcg cgttaaagtg cgtatcaccg agtatgaaaa agatgctaac 1680
ggtgaattag tagcgaaaac cagcctgaaa gacacgactg ttggccgtgc cattctgtgg 1740
atgattgtac cgaaaggtct gccttactcc atcgtcaacc aggcgctggg taaaaaagca 1800
atctccaaaa tgctgaacac ctgctaccgc attctcggtc tgaaaccgac cgttattttt 1860
gcggaccaga tcatgtacac cggcttcgcc tatgcagcgc gttctggtgc atctgttggt 1920
atcgatgaca tggtcatccc ggagaagaaa cacgaaatca tctccgaggc agaagcagaa 1980
gttgctgaaa ttcaggagca gttccagtct ggtctggtaa ctgcgggcga acgctacaac 2040
aaagttatcg atatctgggc tgcggcgaac gatcgtgtat ccaaagcgat gatggataac 2100
ctgcaaactg aaaccgtgat taaccgtgac ggtcaggaag agaagcaggt ttccttcaac 2160
agcatctaca tgatggccga ctccggtgcg cgtggttctg cggcacagat tcgtcagctt 2220
gctggtatgc gtggtctgat ggcgaagccg gatggctcca tcatcgaaac gccaatcacc 2280
gcgaacttcc gtgaaggtct gaacgtactc cagtacttca tctccaccca cggtgctcgt 2340
aaaggtctgg cggataccgc actgaaaact gcgaactccg gttacctgac tcgtcgtctg 2400
gttgacgtgg cgcaggacct ggtggttacc gaagacgatt gtggtaccca tgaaggtatc 2460
atgatgactc cggttatcga gggtggtgac gttaaagagc cgctgcgcga tcgcgtactg 2520
ggtcgtgtaa ctgctgaaga cgttctgaag ccgggtactg ctgatatcct cgttccgcgc 2580
aacacgctgc tgcacgaaca gtggtgtgac ctgctggaag agaactctgt cgacgcggtt 2640
aaagtacgtt ctgttgtatc ttgtgacacc gactttggtg tatgtgcgca ctgctacggt 2700
cgtgacctgg cgcgtggcca catcatcaac aagggtgaag caatcggtgt tatcgcggca 2760
cagtccatcg gtgaaccggg tacacagctg accatgcgta cgttccacat cggtggtgcg 2820
gcatctcgtg cggctgctga atccagcatc caagtgaaaa acaaaggtag catcaagctc 2880
agcaacgtga agtcggttgt gaactccagc ggtaaactgg ttatcacttc ccgtaatact 2940
gaactgaaac tgatcgacga attcggtcgt actaaagaaa gctacaaagt accttacggt 3000
gcggtactgg cgaaaggcga tggcgaacag gttgctggcg gcgaaaccgt tgcaaactgg 3060
gacccgcaca ccatgccggt tatcaccgaa gtaagcggtt ttgtacgctt tactgacatg 3120
atcgacggcc agaccattac gcgtcagacc gacgaactga ccggtctgtc ttcgctggtg 3180
gttctggatt ccgcagaacg taccgcaggt ggtaaagatc tgcgtccggc actgaaaatc 3240
gttgatgctc agggtaacga cgttctgatc ccaggtaccg atatgccagc gcagtacttc 3300
ctgccgggta aagcgattgt tcagctggaa gatggcgtac agatcagctc tggtgacacc 3360
ctggcgcgta ttccgcagga atccggcggt accaaggaca tcaccggtgg tctgccgcgc 3420
gttgcggacc tgttcgaagc acgtcgtccg aaagagccgg caatcctggc tgaaatcagc 3480
ggtatcgttt ccttcggtaa agaaaccaaa ggtaaacgtc gtctggttat caccccggta 3540
gacggtagcg atccgtacga agagatgatt ccgaaatggc gtcagctcaa cgtgttcgaa 3600
ggtgaacgtg tagaacgtgg tgacgtaatt tccgacggtc cggaagcgcc gcacgacatt 3660
ctgcgtctgc gtggtgttca tgctgttact cgttacatcg ttaacgaagt acaggacgta 3720
taccgtctgc agggcgttaa gattaacgat aaacacatcg aagttatcgt tcgtcagatg 3780
ctgcgtaaag ctaccatcgt taacgcgggt agctccgact tcctggaagg cgaacaggtt 3840
gaatactctc gcgtcaagat cgcaaaccgc gaactggaag cgaacggcaa agtgggtgca 3900
acttactccc gcgatctgct gggtatcacc aaagcgtctc tggcaaccga gtccttcatc 3960
tccgcggcat cgttccagga gaccactcgc gtgctgaccg aagcagccgt tgcgggcaaa 4020
cgcgacgaac tgcgcggcct gaaagagaac gttatcgtgg gtcgtctgat cccggcaggt 4080
accggttacg cgtaccacca ggatcgtatg cgtcgccgtg ctgcgggtga agctccggct 4140
gcaccgcagg tgactgcaga agacgcatct gccagcctgg cagaactgct gaacgcaggt 4200
ctgggcggtt ctgataacga gtaa 4224
<210> 28
<211> 1407
<212> PRT
<213> 大肠杆菌(Escherichia coli)
<400> 28
Met Lys Asp Leu Leu Lys Phe Leu Lys Ala Gln Thr Lys Thr Glu Glu
1 5 10 15
Phe Asp Ala Ile Lys Ile Ala Leu Ala Ser Pro Asp Met Ile Arg Ser
20 25 30
Trp Ser Phe Gly Glu Val Lys Lys Pro Glu Thr Ile Asn Tyr Arg Thr
35 40 45
Phe Lys Pro Glu Arg Asp Gly Leu Phe Cys Ala Arg Ile Phe Gly Pro
50 55 60
Val Lys Asp Tyr Glu Cys Leu Cys Gly Lys Tyr Lys Arg Leu Lys His
65 70 75 80
Arg Gly Val Ile Cys Glu Lys Cys Gly Val Glu Val Thr Gln Thr Lys
85 90 95
Val Arg Arg Glu Arg Met Gly His Ile Glu Leu Ala Ser Pro Thr Ala
100 105 110
His Ile Trp Phe Leu Lys Ser Leu Pro Ser Arg Ile Gly Leu Leu Leu
115 120 125
Asp Met Pro Leu Arg Asp Ile Glu Arg Val Leu Tyr Phe Glu Ser Tyr
130 135 140
Val Val Ile Glu Gly Gly Met Thr Asn Leu Glu Arg Gln Gln Ile Leu
145 150 155 160
Thr Glu Glu Gln Tyr Leu Asp Ala Leu Glu Glu Phe Gly Asp Glu Phe
165 170 175
Asp Ala Lys Met Gly Ala Glu Ala Ile Gln Ala Leu Leu Lys Ser Met
180 185 190
Asp Leu Glu Gln Glu Cys Glu Gln Leu Arg Glu Glu Leu Asn Glu Thr
195 200 205
Asn Ser Glu Thr Lys Arg Lys Lys Leu Thr Lys Arg Ile Lys Leu Leu
210 215 220
Glu Ala Phe Val Gln Ser Gly Asn Lys Pro Glu Trp Met Ile Leu Thr
225 230 235 240
Val Leu Pro Val Leu Pro Pro Asp Leu Arg Pro Leu Val Pro Leu Asp
245 250 255
Gly Gly Arg Phe Ala Thr Ser Asp Leu Asn Asp Leu Tyr Arg Arg Val
260 265 270
Ile Asn Arg Asn Asn Arg Leu Lys Arg Leu Leu Asp Leu Ala Ala Pro
275 280 285
Asp Ile Ile Val Arg Asn Glu Lys Arg Met Leu Gln Glu Ala Val Asp
290 295 300
Ala Leu Leu Asp Asn Gly Arg Arg Gly Arg Ala Ile Thr Gly Ser Asn
305 310 315 320
Lys Arg Pro Leu Lys Ser Leu Ala Asp Met Ile Lys Gly Lys Gln Gly
325 330 335
Arg Phe Arg Gln Asn Leu Leu Gly Lys Arg Val Asp Tyr Ser Gly Arg
340 345 350
Ser Val Ile Thr Val Gly Pro Tyr Leu Arg Leu His Gln Cys Gly Leu
355 360 365
Pro Lys Lys Met Ala Leu Glu Leu Phe Lys Pro Phe Ile Tyr Gly Lys
370 375 380
Leu Glu Leu Arg Gly Leu Ala Thr Thr Ile Lys Ala Ala Lys Lys Met
385 390 395 400
Val Glu Arg Glu Glu Ala Val Val Trp Asp Ile Leu Asp Glu Val Ile
405 410 415
Arg Glu His Pro Val Leu Leu Asn Arg Ala Pro Thr Leu His Arg Leu
420 425 430
Gly Ile Gln Ala Phe Glu Pro Val Leu Ile Glu Gly Lys Ala Ile Gln
435 440 445
Leu His Pro Leu Val Cys Ala Ala Tyr Asn Ala Asp Phe Asp Gly Asp
450 455 460
Gln Met Ala Val His Val Pro Leu Thr Leu Glu Ala Gln Leu Glu Ala
465 470 475 480
Arg Ala Leu Met Met Ser Thr Asn Asn Ile Leu Ser Pro Ala Asn Gly
485 490 495
Glu Pro Ile Ile Val Pro Ser Gln Asp Val Val Leu Gly Leu Tyr Tyr
500 505 510
Met Thr Arg Asp Cys Val Asn Ala Lys Gly Glu Gly Met Val Leu Thr
515 520 525
Gly Pro Lys Glu Ala Glu Arg Leu Tyr Arg Ser Gly Leu Ala Ser Leu
530 535 540
His Ala Arg Val Lys Val Arg Ile Thr Glu Tyr Glu Lys Asp Ala Asn
545 550 555 560
Gly Glu Leu Val Ala Lys Thr Ser Leu Lys Asp Thr Thr Val Gly Arg
565 570 575
Ala Ile Leu Trp Met Ile Val Pro Lys Gly Leu Pro Tyr Ser Ile Val
580 585 590
Asn Gln Ala Leu Gly Lys Lys Ala Ile Ser Lys Met Leu Asn Thr Cys
595 600 605
Tyr Arg Ile Leu Gly Leu Lys Pro Thr Val Ile Phe Ala Asp Gln Ile
610 615 620
Met Tyr Thr Gly Phe Ala Tyr Ala Ala Arg Ser Gly Ala Ser Val Gly
625 630 635 640
Ile Asp Asp Met Val Ile Pro Glu Lys Lys His Glu Ile Ile Ser Glu
645 650 655
Ala Glu Ala Glu Val Ala Glu Ile Gln Glu Gln Phe Gln Ser Gly Leu
660 665 670
Val Thr Ala Gly Glu Arg Tyr Asn Lys Val Ile Asp Ile Trp Ala Ala
675 680 685
Ala Asn Asp Arg Val Ser Lys Ala Met Met Asp Asn Leu Gln Thr Glu
690 695 700
Thr Val Ile Asn Arg Asp Gly Gln Glu Glu Lys Gln Val Ser Phe Asn
705 710 715 720
Ser Ile Tyr Met Met Ala Asp Ser Gly Ala Arg Gly Ser Ala Ala Gln
725 730 735
Ile Arg Gln Leu Ala Gly Met Arg Gly Leu Met Ala Lys Pro Asp Gly
740 745 750
Ser Ile Ile Glu Thr Pro Ile Thr Ala Asn Phe Arg Glu Gly Leu Asn
755 760 765
Val Leu Gln Tyr Phe Ile Ser Thr His Gly Ala Arg Lys Gly Leu Ala
770 775 780
Asp Thr Ala Leu Lys Thr Ala Asn Ser Gly Tyr Leu Thr Arg Arg Leu
785 790 795 800
Val Asp Val Ala Gln Asp Leu Val Val Thr Glu Asp Asp Cys Gly Thr
805 810 815
His Glu Gly Ile Met Met Thr Pro Val Ile Glu Gly Gly Asp Val Lys
820 825 830
Glu Pro Leu Arg Asp Arg Val Leu Gly Arg Val Thr Ala Glu Asp Val
835 840 845
Leu Lys Pro Gly Thr Ala Asp Ile Leu Val Pro Arg Asn Thr Leu Leu
850 855 860
His Glu Gln Trp Cys Asp Leu Leu Glu Glu Asn Ser Val Asp Ala Val
865 870 875 880
Lys Val Arg Ser Val Val Ser Cys Asp Thr Asp Phe Gly Val Cys Ala
885 890 895
His Cys Tyr Gly Arg Asp Leu Ala Arg Gly His Ile Ile Asn Lys Gly
900 905 910
Glu Ala Ile Gly Val Ile Ala Ala Gln Ser Ile Gly Glu Pro Gly Thr
915 920 925
Gln Leu Thr Met Arg Thr Phe His Ile Gly Gly Ala Ala Ser Arg Ala
930 935 940
Ala Ala Glu Ser Ser Ile Gln Val Lys Asn Lys Gly Ser Ile Lys Leu
945 950 955 960
Ser Asn Val Lys Ser Val Val Asn Ser Ser Gly Lys Leu Val Ile Thr
965 970 975
Ser Arg Asn Thr Glu Leu Lys Leu Ile Asp Glu Phe Gly Arg Thr Lys
980 985 990
Glu Ser Tyr Lys Val Pro Tyr Gly Ala Val Leu Ala Lys Gly Asp Gly
995 1000 1005
Glu Gln Val Ala Gly Gly Glu Thr Val Ala Asn Trp Asp Pro His
1010 1015 1020
Thr Met Pro Val Ile Thr Glu Val Ser Gly Phe Val Arg Phe Thr
1025 1030 1035
Asp Met Ile Asp Gly Gln Thr Ile Thr Arg Gln Thr Asp Glu Leu
1040 1045 1050
Thr Gly Leu Ser Ser Leu Val Val Leu Asp Ser Ala Glu Arg Thr
1055 1060 1065
Ala Gly Gly Lys Asp Leu Arg Pro Ala Leu Lys Ile Val Asp Ala
1070 1075 1080
Gln Gly Asn Asp Val Leu Ile Pro Gly Thr Asp Met Pro Ala Gln
1085 1090 1095
Tyr Phe Leu Pro Gly Lys Ala Ile Val Gln Leu Glu Asp Gly Val
1100 1105 1110
Gln Ile Ser Ser Gly Asp Thr Leu Ala Arg Ile Pro Gln Glu Ser
1115 1120 1125
Gly Gly Thr Lys Asp Ile Thr Gly Gly Leu Pro Arg Val Ala Asp
1130 1135 1140
Leu Phe Glu Ala Arg Arg Pro Lys Glu Pro Ala Ile Leu Ala Glu
1145 1150 1155
Ile Ser Gly Ile Val Ser Phe Gly Lys Glu Thr Lys Gly Lys Arg
1160 1165 1170
Arg Leu Val Ile Thr Pro Val Asp Gly Ser Asp Pro Tyr Glu Glu
1175 1180 1185
Met Ile Pro Lys Trp Arg Gln Leu Asn Val Phe Glu Gly Glu Arg
1190 1195 1200
Val Glu Arg Gly Asp Val Ile Ser Asp Gly Pro Glu Ala Pro His
1205 1210 1215
Asp Ile Leu Arg Leu Arg Gly Val His Ala Val Thr Arg Tyr Ile
1220 1225 1230
Val Asn Glu Val Gln Asp Val Tyr Arg Leu Gln Gly Val Lys Ile
1235 1240 1245
Asn Asp Lys His Ile Glu Val Ile Val Arg Gln Met Leu Arg Lys
1250 1255 1260
Ala Thr Ile Val Asn Ala Gly Ser Ser Asp Phe Leu Glu Gly Glu
1265 1270 1275
Gln Val Glu Tyr Ser Arg Val Lys Ile Ala Asn Arg Glu Leu Glu
1280 1285 1290
Ala Asn Gly Lys Val Gly Ala Thr Tyr Ser Arg Asp Leu Leu Gly
1295 1300 1305
Ile Thr Lys Ala Ser Leu Ala Thr Glu Ser Phe Ile Ser Ala Ala
1310 1315 1320
Ser Phe Gln Glu Thr Thr Arg Val Leu Thr Glu Ala Ala Val Ala
1325 1330 1335
Gly Lys Arg Asp Glu Leu Arg Gly Leu Lys Glu Asn Val Ile Val
1340 1345 1350
Gly Arg Leu Ile Pro Ala Gly Thr Gly Tyr Ala Tyr His Gln Asp
1355 1360 1365
Arg Met Arg Arg Arg Ala Ala Gly Glu Ala Pro Ala Ala Pro Gln
1370 1375 1380
Val Thr Ala Glu Asp Ala Ser Ala Ser Leu Ala Glu Leu Leu Asn
1385 1390 1395
Ala Gly Leu Gly Gly Ser Asp Asn Glu
1400 1405
<210> 29
<211> 720
<212> DNA
<213> 大肠杆菌(Escherichia coli)
<400> 29
atgcaagaga actacaagat tctggtggtc gatgacgaca tgcgcctgcg tgcgctgctg 60
gaacgttatc tcaccgaaca aggcttccag gttcgaagcg tcgctaatgc agaacagatg 120
gatcgcctgc tgactcgtga atctttccat cttatggtac tggatttaat gttacctggt 180
gaagatggct tgtcgatttg ccgacgtctt cgtagtcaga gcaacccgat gccgatcatt 240
atggtgacgg cgaaagggga agaagtggac cgtatcgtag gcctggagat tggcgctgac 300
gactacattc caaaaccgtt taacccgcgt gaactgctgg cccgtatccg tgcggtgctg 360
cgtcgtcagg cgaacgaact gccaggcgca ccgtcacagg aagaggcggt aattgctttc 420
ggtaagttca aacttaacct cggtacgcgc gaaatgttcc gcgaagacga gccgatgccg 480
ctcaccagcg gtgagtttgc ggtactgaag gcactggtca gccatccgcg tgagccgctc 540
tcccgcgata agctgatgaa ccttgcccgt ggtcgtgaat attccgcaat ggaacgctcc 600
atcgacgtgc agatttcgcg tctgcgccgc atggtggaag aagatccagc gcatccgcgt 660
tacattcaga ccgtctgggg tctgggctac gtctttgtac cggacggctc taaagcatga 720
<210> 30
<211> 239
<212> PRT
<213> 大肠杆菌(Escherichia coli)
<400> 30
Met Gln Glu Asn Tyr Lys Ile Leu Val Val Asp Asp Asp Met Arg Leu
1 5 10 15
Arg Ala Leu Leu Glu Arg Tyr Leu Thr Glu Gln Gly Phe Gln Val Arg
20 25 30
Ser Val Ala Asn Ala Glu Gln Met Asp Arg Leu Leu Thr Arg Glu Ser
35 40 45
Phe His Leu Met Val Leu Asp Leu Met Leu Pro Gly Glu Asp Gly Leu
50 55 60
Ser Ile Cys Arg Arg Leu Arg Ser Gln Ser Asn Pro Met Pro Ile Ile
65 70 75 80
Met Val Thr Ala Lys Gly Glu Glu Val Asp Arg Ile Val Gly Leu Glu
85 90 95
Ile Gly Ala Asp Asp Tyr Ile Pro Lys Pro Phe Asn Pro Arg Glu Leu
100 105 110
Leu Ala Arg Ile Arg Ala Val Leu Arg Arg Gln Ala Asn Glu Leu Pro
115 120 125
Gly Ala Pro Ser Gln Glu Glu Ala Val Ile Ala Phe Gly Lys Phe Lys
130 135 140
Leu Asn Leu Gly Thr Arg Glu Met Phe Arg Glu Asp Glu Pro Met Pro
145 150 155 160
Leu Thr Ser Gly Glu Phe Ala Val Leu Lys Ala Leu Val Ser His Pro
165 170 175
Arg Glu Pro Leu Ser Arg Asp Lys Leu Met Asn Leu Ala Arg Gly Arg
180 185 190
Glu Tyr Ser Ala Met Glu Arg Ser Ile Asp Val Gln Ile Ser Arg Leu
195 200 205
Arg Arg Met Val Glu Glu Asp Pro Ala His Pro Arg Tyr Ile Gln Thr
210 215 220
Val Trp Gly Leu Gly Tyr Val Phe Val Pro Asp Gly Ser Lys Ala
225 230 235
<210> 31
<211> 672
<212> DNA
<213> 大肠杆菌(Escherichia coli)
<400> 31
atgcgcgtac tggttgttga agacaatgcg ttgttacgtc accaccttaa agttcagatt 60
caggatgctg gtcatcaggt cgatgacgca gaagatgcca aagaagccga ttattatctc 120
aatgaacata taccggatat tgcgattgtc gatctcggat tgccagacga ggacggtctg 180
tcactgattc gccgctggcg tagcaacgat gtttcactgc cgattctggt attaaccgcc 240
cgtgaaagct ggcaggacaa agtcgaagta ttaagtgccg gtgctgatga ttatgtgact 300
aaaccgtttc atattgaaga ggtgatggcg cgaatgcagg cattaatgcg gcgtaatagc 360
ggtctggctt cacaggtcat ttcgctcccc ccgtttcagg ttgatctctc tcgccgtgaa 420
ttatctatta atgacgaagt gatcaaactg accgcgttcg aatacactat tatggaaacg 480
ttgatacgca ataatggcaa agtggtcagc aaagattcgt taatgctcca actctatccg 540
gatgcggagc tgcgggaaag ccataccatt gatgtactga tgggacgtct gcgcaaaaaa 600
attcaggcac aatatcccca agaagtgatt accaccgttc gcggccaggg ctatctgttc 660
gaattgcgct ga 672
<210> 32
<211> 223
<212> PRT
<213> 大肠杆菌(Escherichia coli)
<400> 32
Met Arg Val Leu Val Val Glu Asp Asn Ala Leu Leu Arg His His Leu
1 5 10 15
Lys Val Gln Ile Gln Asp Ala Gly His Gln Val Asp Asp Ala Glu Asp
20 25 30
Ala Lys Glu Ala Asp Tyr Tyr Leu Asn Glu His Ile Pro Asp Ile Ala
35 40 45
Ile Val Asp Leu Gly Leu Pro Asp Glu Asp Gly Leu Ser Leu Ile Arg
50 55 60
Arg Trp Arg Ser Asn Asp Val Ser Leu Pro Ile Leu Val Leu Thr Ala
65 70 75 80
Arg Glu Ser Trp Gln Asp Lys Val Glu Val Leu Ser Ala Gly Ala Asp
85 90 95
Asp Tyr Val Thr Lys Pro Phe His Ile Glu Glu Val Met Ala Arg Met
100 105 110
Gln Ala Leu Met Arg Arg Asn Ser Gly Leu Ala Ser Gln Val Ile Ser
115 120 125
Leu Pro Pro Phe Gln Val Asp Leu Ser Arg Arg Glu Leu Ser Ile Asn
130 135 140
Asp Glu Val Ile Lys Leu Thr Ala Phe Glu Tyr Thr Ile Met Glu Thr
145 150 155 160
Leu Ile Arg Asn Asn Gly Lys Val Val Ser Lys Asp Ser Leu Met Leu
165 170 175
Gln Leu Tyr Pro Asp Ala Glu Leu Arg Glu Ser His Thr Ile Asp Val
180 185 190
Leu Met Gly Arg Leu Arg Lys Lys Ile Gln Ala Gln Tyr Pro Gln Glu
195 200 205
Val Ile Thr Thr Val Arg Gly Gln Gly Tyr Leu Phe Glu Leu Arg
210 215 220
<210> 33
<211> 12
<212> DNA
<213> 人工序列(Artificial Sequence)
<220>
<223> 引物序列
<400> 33
ctgtctctta ta 12
Claims (65)
1.一种用于产生甲基丙烯酸酯的方法,所述方法包括:
a)在发酵培养基中提供与野生型微生物相比具有增加的对甲基丙烯酸C3-C12酯的耐受性的遗传修饰的微生物,和
b)使所述微生物在产生甲基丙烯酸C3-C12酯的条件下生长。
2.根据权利要求1所述的方法,其中所述微生物当在约37℃在液体培养基中生长时耐受至少20%v/v的甲基丙烯酸C3-C12酯。
3.根据权利要求1或2所述的方法,其中所述微生物是细菌。
4.根据前述权利要求所述的方法,其中所述微生物选自肠杆菌科(Enterobacteriaceae)。
5.根据前述权利要求所述的方法,其中所述细菌选自以下属:埃希氏菌属(Escherichia)、肠杆菌属(Enterobacter)、泛菌属(Pantoea)、克雷伯氏菌属(Klebsiella)、沙雷氏菌属(Serratia)、欧文氏菌属(Erwinia)、沙门氏菌属(Salmonella)、摩根氏菌属(Morganella)、短杆菌属(Brevibacterium)、棒状杆菌属(Corynebacterium)、微杆菌属(Microbacterium)、脂环酸芽孢杆菌属(Alicyclobacillus)、芽孢杆菌属(Bacillus)、氢杆菌属(Hydrogenobacter)、甲烷球菌属(Methanococcus)、醋杆菌属(Acetobacter)、不动杆菌属(Acinetobacter)、土壤杆菌属(Agrobacterium)、固氮根瘤菌属(Axorhizobium)、固氮菌属(Azotobacter)、无形体属(Anaplasma)、拟杆菌属(Bacteroides)、巴尔通体属(Bartonella)、鲍特菌属(Bordetella)、疏螺旋体属(Borrelia)、布鲁氏菌属(Brucella)、伯克霍尔德菌属(Burkholderia)、荚膜杆菌属(Calymmatobacterium)、弯曲菌属(Campylobacter)、衣原体属(Chlamydia)、嗜衣原体属(Chlamydophila)、梭菌属(Clostridium)、柯克斯体属(Coxiella)、贪铜菌属(Cupriavidus)、埃立克体属(Ehrlichia)、肠球菌属(Enterococcus)、弗朗西斯氏菌属(Francisella)、梭杆菌属(Fusobacterium)、加德纳菌属(Gardnerella)、嗜血杆菌属(Haemophilus)、螺杆菌属(Helicobacter)、甲烷杆菌属(Methanobacterium)、微球菌属(Micrococcus)、莫拉菌属(Moraxella)、分支杆菌属(Mycobacterium)、支原体属(Mycoplasma)、奈瑟菌属(Neisseria)、巴斯德菌属(Pasteurella)、消化链球菌属(Peptostreptococcus)、卟啉单胞菌属(Porphyromonas)、普氏菌属(Prevotella)、假单胞菌属(Pseudomonas)、根瘤菌属(Rhizobium)、立克次氏体属(Rickettsia)、罗沙利马体属(Rochalimaea)、罗氏菌属(Rothia)、志贺菌属(Shigella)、葡萄球菌属(Staphylococcus)、寡养单胞菌属(Stenotrophomonas)、链球菌属(Streptococcus)、密螺旋体属(Treponema)、弧菌属(Vibrio)、沃尔巴克氏体属(Wolbachia)、耶尔森菌属(Yersinia)。
6.根据权利要求5所述的方法,其中所述细菌是大肠杆菌(E.coli)。
7.根据前述权利要求所述的方法,其中所述遗传修饰的微生物在编码调节氧化应激响应和细菌多药耐药系统或形成氧化应激响应和细菌多药耐药系统的一部分的蛋白的一种或更多种核酸中包含突变。
8.根据前述权利要求所述的方法,其中所述遗传修饰的微生物在一种或更多种以下核酸序列中包含突变:soxR核酸序列、acrR核酸序列、rob核酸序列和/或marR核酸序列。
9.根据权利要求8所述的方法,其中所述soxR核酸序列包含SEQ ID NO.1或其同源物,所述acrR核酸序列包含SEQ ID NO.2或其同源物,所述rob核酸序列包含SEQ ID NO.5或其同源物,并且所述marR核酸序列包含SEQ ID NO.7或其同源物。
10.根据权利要求8或9所述的方法,其中突变体soxR核酸序列编码在DNA结合结构域或FE-S簇结构域中具有突变的突变体SoxR蛋白。
11.根据权利要求10所述的方法,其中所述突变选自以下之一:根据SEQ ID NO.2中列出的SoxR中R20被另一种氨基酸的取代、R20被另一种氨基酸的取代、残基146的缺失或残基139处的截短。
12.根据权利要求权利要求8至11中任一项所述的方法,其中突变体acrR核酸序列编码在DNA结合结构域或配体结合结构域中具有突变的突变体AcrR蛋白。
13.根据权利要求12所述的方法,其中所述突变选自以下之一:根据SEQ ID NO.3中列出的AcrR中V29被另一种氨基酸的取代,T32、A191或位置49处的移码突变。
14.根据权利要求8至13所述的方法,其中突变体rob核酸序列编码在N末端DNA结合结构域或C末端结构域中具有突变的突变体Rob蛋白。
15.根据权利要求14所述的方法,其中所述突变选自以下之一:根据SEQ ID NO.5中列出的Rob中A70被另一种氨基酸的取代或R156被另一种氨基酸的取代。
16.根据权利要求8至15中任一项所述的方法,其中突变体marR核酸序列编码在DNA结合结构域中具有突变的MarR蛋白。
17.根据权利要求16所述的方法,其中所述突变是根据SEQ ID NO.7中列出的MarR中V84被另一种氨基酸的取代。
18.根据前述权利要求所述的方法,其中所述遗传修饰的微生物在acrR核酸序列和一个或更多个以下列核酸序列中包含突变:soxR核酸序列、rob核酸序列和/或marR核酸序列。
19.根据前述权利要求所述的方法,其中所述遗传修饰的生物体包含一个或更多个另外的突变,例如选自表1a或1b的突变。
20.根据权利要求19所述的方法,其中所述一个或更多个另外的突变处于编码调节氧化应激响应和细菌多药耐药系统或形成氧化应激响应和细菌多药耐药系统的一部分的蛋白的核酸中。
21.根据前述权利要求所述的方法,其中所述微生物还包含编码甲基丙烯酸C3-C12酯生物合成途径的酶的载体或构建体。
22.根据前述权利要求所述的方法,其中所述微生物还包含增强甲基丙烯酸C3-C12酯的产生的载体或构建体。
23.根据前述权利要求所述的方法,所述方法包括从所述发酵培养基中取出所述甲基丙烯酸C3-C12酯和使所取出的甲基丙烯酸C3-C12酯与甲醇进行酯交换的步骤。
24.根据前述权利要求所述的方法,其中所述甲基丙烯酸C3-C12酯是BMA。
25.一种用于增加微生物对甲基丙烯酸C3-C12酯的耐受性的方法,所述方法包括将突变引入编码调节氧化应激响应和细菌多药耐药系统或形成氧化应激响应和细菌多药耐药系统的一部分的蛋白的核酸中。
26.一种用于使微生物在甲基丙烯酸C3-C12酯存在下生长或维持的方法,所述方法包括在产生甲基丙烯酸C3-C12酯的条件下,在发酵培养基中提供与野生型微生物相比具有增加的对甲基丙烯酸C3-C12酯的耐受性的遗传修饰的微生物。
27.一种发酵培养基,所述发酵培养基包含甲基丙烯酸C3-C12酯和与野生型微生物相比具有增加的对甲基丙烯酸C3-C12酯的耐受性的遗传修饰的微生物。
28.与野生型微生物相比具有增加的对甲基丙烯酸C3-C12酯的耐受性的遗传修饰的微生物用于产生甲基丙烯酸C3-C12酯的用途。
29.根据权利要求28所述的用途,其中所述微生物选自肠杆菌科。
30.根据权利要求28或29所述的用途,其中所述细菌选自以下属:埃希氏菌属、肠杆菌属、泛菌属、克雷伯氏菌属、沙雷氏菌属、欧文氏菌属、沙门氏菌属、摩根氏菌属、短杆菌属、棒状杆菌属、微杆菌属、脂环酸芽孢杆菌属、芽孢杆菌属、氢杆菌属、甲烷球菌属、醋杆菌属、不动杆菌属、土壤杆菌属、固氮根瘤菌属、固氮菌属、无形体属、拟杆菌属、巴尔通体属、鲍特菌属、疏螺旋体属、布鲁氏菌属、伯克霍尔德菌属、荚膜杆菌属、弯曲菌属、衣原体属、嗜衣原体属、梭菌属、柯克斯体属、贪铜菌属、埃立克体属、肠球菌属、弗朗西斯氏菌属、梭杆菌属、加德纳菌属、嗜血杆菌属、螺杆菌属、甲烷杆菌属、微球菌属、莫拉菌属、分支杆菌属、支原体属、奈瑟菌属、巴斯德菌属、消化链球菌属、卟啉单胞菌属、普氏菌属、假单胞菌属、根瘤菌属、立克次氏体属、罗沙利马体属、罗氏菌属、志贺菌属、葡萄球菌属、寡养单胞菌属、链球菌属、密螺旋体属、弧菌属、沃尔巴克氏体属、耶尔森菌属。
31.根据权利要求30所述的用途,其中所述细菌是大肠杆菌。
32.根据权利要求28至31中任一项所述的用途,其中遗传修饰的微生物在编码调节氧化应激响应和细菌多药耐药系统或形成氧化应激响应和细菌多药耐药系统的一部分的蛋白的一种或更多种核酸中包含突变。
33.根据权利要求28至31中任一项所述的用途,其中所述遗传修饰的微生物在soxR核酸序列、acrR核酸序列、rob核酸序列和/或marR核酸序列中包含突变。
34.根据权利要求33所述的用途,其中所述soxR核酸序列包含SEQ ID NO.1或其同源物,所述acrR核酸序列包含SEQ ID NO.2或其同源物,所述rob核酸序列包含SEQ ID NO.5或其同源物,并且所述marR核酸序列包含SEQ ID NO.7或其同源物。
35.根据权利要求34所述的用途,其中突变体soxR核酸序列编码在DNA结合结构域或FE-S簇结构域中具有突变的突变体SoxR蛋白。
36.根据权利要求35所述的用途,其中所述突变选自以下之一:根据SEQ ID NO.2中列出的SoxR中R20被另一种氨基酸的取代、R20被另一种氨基酸的取代、残基146的缺失或残基139处的截短。
37.根据权利要求33至36中任一项所述的用途,其中突变体acrR核酸序列编码在DNA结合结构域或配体结合结构域中具有突变的突变体AcrR蛋白。
38.根据权利要求37所述的用途,其中所述突变选自以下之一:根据SEQ ID NO.3中列出的AcrR中V29被另一种氨基酸的取代,T32、A191或位置49处的移码突变。
39.根据权利要求33至38中任一项所述的用途,其中突变体rob核酸序列编码在N末端DNA结合结构域或C末端结构域中具有突变的突变体Rob蛋白。
40.根据权利要求39所述的用途,其中所述突变是根据SEQ ID NO.5中列出的Rob中A70被另一种氨基酸的取代或R156被另一种氨基酸的取代。
41.根据权利要求33至40中任一项所述的用途,其中突变体marR核酸序列编码在DNA结合结构域中具有突变的MarR蛋白。
42.根据权利要求41所述的用途,其中所述突变是根据SEQ ID NO.7中列出的MarR中V84被另一种氨基酸的取代。
43.根据权利要求28至42中任一项所述的用途,其中所述遗传修饰的生物体包含一个或更多个另外的突变。
44.根据权利要求43所述的用途,其中所述一个或更多个另外的突变处于编码调节氧化应激响应和细菌多药耐药系统或形成氧化应激响应和细菌多药耐药系统的一部分的蛋白的核酸中。
45.一种遗传修饰的微生物,其中所述微生物包含以下突变之一:a)编码其中V84被另一种氨基酸取代的突变体蛋白的marR核酸序列中的突变;b)编码其中A70或R156被另一种氨基酸取代的突变体蛋白的Rob核酸序列中的突变;c)编码其中R20被另一种氨基酸取代、残基146被缺失或蛋白在残基139处被截短的突变体蛋白的soxR核酸序列中的突变;或d)编码在AcrR中T32、A191或位置49处的移码突变的突变体蛋白的acrR核酸序列中的突变;或e)选自以下的两种或更多种核酸中的突变:soxR核酸或其同源物、acrR核酸或其同源物、marR核酸或其同源物和/或rob核酸或其同源物,其中所述核酸编码突变体蛋白。
46.根据权利要求45所述的遗传修饰的微生物,其中所述微生物选自肠杆菌科。
47.根据权利要求45或46所述的遗传修饰的微生物,其中所述细菌选自以下属:埃希氏菌属、肠杆菌属、泛菌属、克雷伯氏菌属、沙雷氏菌属、欧文氏菌属、沙门氏菌属、摩根氏菌属、短杆菌属、棒状杆菌属、微杆菌属、脂环酸芽孢杆菌属、芽孢杆菌属、氢杆菌属、甲烷球菌属、醋杆菌属、不动杆菌属、土壤杆菌属、固氮根瘤菌属、固氮菌属、无形体属、拟杆菌属、巴尔通体属、鲍特菌属、疏螺旋体属、布鲁氏菌属、伯克霍尔德菌属、荚膜杆菌属、弯曲菌属、衣原体属、嗜衣原体属、梭菌属、柯克斯体属、贪铜菌属、埃立克体属、肠球菌属、弗朗西斯氏菌属、梭杆菌属、加德纳菌属、嗜血杆菌属、螺杆菌属、甲烷杆菌属、微球菌属、莫拉菌属、分支杆菌属、支原体属、奈瑟菌属、巴斯德菌属、消化链球菌属、卟啉单胞菌属、普氏菌属、假单胞菌属、根瘤菌属、立克次氏体属、罗沙利马体属、罗氏菌属、志贺菌属、葡萄球菌属、寡养单胞菌属、链球菌属、密螺旋体属、弧菌属、沃尔巴克氏体属、耶尔森菌属。
48.根据权利要求47所述的遗传修饰的微生物,其中所述细菌是大肠杆菌。
49.根据权利要求45至48中任一项所述的遗传修饰的微生物,其中所述遗传修饰的微生物在编码调节氧化应激响应和细菌多药耐药系统或形成氧化应激响应和细菌多药耐药系统的一部分的蛋白的一种或更多种核酸中包含突变。
50.根据权利要求45至49中任一项所述的遗传修饰的微生物,其中所述遗传修饰的微生物在soxR核酸序列、acrR核酸序列、rob核酸序列和/或marR核酸序列中包含突变。
51.根据权利要求50所述的遗传修饰的微生物,其中所述soxR核酸序列包含SEQ IDNO.1或其同源物,所述acrR核酸序列包含SEQ ID NO.2或其同源物,所述rob核酸序列包含SEQ ID NO.5或其同源物,并且所述marR核酸序列包含SEQ ID NO.7或其同源物。
52.根据权利要求50或51所述的遗传修饰的微生物,其中突变体soxR核酸序列编码在DNA结合结构域或FE-S簇结构域中具有突变的突变体SoxR蛋白。
53.根据权利要求52所述的遗传修饰的微生物,其中所述突变选自以下之一:根据SEQID NO.2中列出的SoxR中R20被另一种氨基酸的取代、R20被另一种氨基酸的取代、残基146的缺失或残基139处的截短。
54.根据权利要求50至53中任一项所述的遗传修饰的微生物,其中所述突变体acrR核酸序列编码在DNA结合结构域或配体结合结构域中具有突变的突变体AcrR蛋白。
55.根据权利要求54所述的遗传修饰的微生物,其中所述突变选自以下之一:根据SEQID NO.3中列出的AcrR中V29被另一种氨基酸的取代,T32、A191或位置49处的移码突变。
56.根据权利要求50至55中任一项所述的遗传修饰的微生物,其中所述突变体rob核酸序列编码在N末端DNA结合结构域或C末端结构域中具有突变的突变体Rob蛋白。
57.根据权利要求56所述的遗传修饰的微生物,其中所述突变是根据SEQ ID NO.5中列出的Rob中A70被另一种氨基酸的取代或R156被另一种氨基酸的取代。
58.根据权利要求50至57中任一项所述的遗传修饰的微生物,其中所述突变体marR核酸序列编码在DNA结合结构域中具有突变的MarR蛋白。
59.根据权利要求58所述的遗传修饰的微生物,其中所述突变是根据SEQ ID NO.7中列出的MarR中V84被另一种氨基酸的取代。
60.根据权利要求45至59中任一项所述的遗传修饰的微生物,其中所述遗传修饰的生物体包含选自表1a或1b的一个或更多个突变。
61.根据权利要求60所述的遗传修饰的微生物,其中所述一个或更多个另外的突变处于编码调节氧化应激响应和细菌多药耐药系统或形成所述氧化应激响应和所述细菌多药耐药系统的一部分的蛋白的核酸中。
62.根据权利要求45至61中任一项所述的遗传修饰的微生物,所述遗传修饰的微生物还包含增强甲基丙烯酸C3-C12酯的产生的遗传构建体或载体。
63.根据权利要求45至62中任一项所述的分离的遗传修饰的微生物在产生甲基丙烯酸C3-C12酯中的用途。
64.一种用于分离甲基丙烯酸酯耐受性微生物的方法,所述方法包括:
a)在发酵培养基中提供微生物
b)使所述微生物与甲基丙烯酸酯接触;和
c)分离步骤(b)的存活微生物
其中所述存活微生物当在约37℃在液体培养基中生长时耐受至少20%v/v的甲基丙烯酸酯。
65.一种甲基丙烯酸酯耐受性微生物,所述甲基丙烯酸酯耐受性微生物通过权利要求64所述的方法获得或能够获得。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GBGB1808424.4A GB201808424D0 (en) | 2018-05-23 | 2018-05-23 | Methods for producing BMA and MMA using genetically modified microorganisms |
GB1808424.4 | 2018-05-23 | ||
PCT/GB2019/051427 WO2019224548A2 (en) | 2018-05-23 | 2019-05-23 | Methods for the production of methacrylates |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112469813A true CN112469813A (zh) | 2021-03-09 |
Family
ID=62812251
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201980049273.6A Pending CN112469813A (zh) | 2018-05-23 | 2019-05-23 | 用于产生甲基丙烯酸酯的方法 |
Country Status (12)
Country | Link |
---|---|
US (1) | US11781160B2 (zh) |
EP (1) | EP3797154A2 (zh) |
JP (1) | JP2021525085A (zh) |
KR (1) | KR20210015881A (zh) |
CN (1) | CN112469813A (zh) |
BR (1) | BR112020023467A2 (zh) |
CA (1) | CA3098529A1 (zh) |
GB (1) | GB201808424D0 (zh) |
PH (1) | PH12020551791A1 (zh) |
SG (1) | SG11202010677XA (zh) |
TW (1) | TW202003856A (zh) |
WO (1) | WO2019224548A2 (zh) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110229828B (zh) * | 2019-05-31 | 2022-03-15 | 天津大学 | 大肠杆菌全局调控因子的突变基因soxR及应用 |
WO2023141347A2 (en) * | 2022-01-24 | 2023-07-27 | Gusto Global, Llc | Single-loci and multi-loci targeted single point amplicon fragment sequencing |
GB2621337A (en) * | 2022-08-08 | 2024-02-14 | Mitsubishi Chemical Uk Ltd | Process for the biological production of methacrylic acid and derivatives thereof |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1999017607A2 (en) * | 1997-10-02 | 1999-04-15 | Trustees Of Tufts College | Methods and compositions for reducing bacterial tolerance of disinfectants and organic solvents |
CN103930558A (zh) * | 2011-04-01 | 2014-07-16 | 基因组股份公司 | 用于生产甲基丙烯酸和甲基丙烯酸酯类的微生物以及与之相关的方法 |
US20150191756A1 (en) * | 2012-09-10 | 2015-07-09 | Mitsubishi Rayon Co., Ltd | Method for producing methacrylic acid and/or ester thereof |
CN108026548A (zh) * | 2015-05-19 | 2018-05-11 | 卢塞特英国国际有限公司 | 生物制备甲基丙烯酸及其衍生物的方法 |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2688292C (en) | 2007-06-01 | 2015-01-27 | Evonik Roehm Gmbh | A process for preparing methacrylic acid or methacrylic esters |
PL2816112T3 (pl) | 2009-12-10 | 2019-03-29 | Regents Of The University Of Minnesota | Modyfikacja DNA za pośrednictwem efektorów TAL |
KR101543833B1 (ko) | 2012-06-26 | 2015-08-11 | 이화여자대학교 산학협력단 | 유기용매에 대한 저항성이 향상된 균주 및 이의 용도 |
US8697359B1 (en) | 2012-12-12 | 2014-04-15 | The Broad Institute, Inc. | CRISPR-Cas systems and methods for altering expression of gene products |
CN105705650A (zh) | 2013-08-28 | 2016-06-22 | 英威达技术有限责任公司 | 用于生物合成异丁烯酸盐的方法 |
GB201619827D0 (en) * | 2016-11-23 | 2017-01-04 | Lucite Int Uk Ltd | Process for the production of methyl methacrylate |
-
2018
- 2018-05-23 GB GBGB1808424.4A patent/GB201808424D0/en not_active Ceased
-
2019
- 2019-05-23 EP EP19730430.6A patent/EP3797154A2/en active Pending
- 2019-05-23 JP JP2020565467A patent/JP2021525085A/ja active Pending
- 2019-05-23 CN CN201980049273.6A patent/CN112469813A/zh active Pending
- 2019-05-23 SG SG11202010677XA patent/SG11202010677XA/en unknown
- 2019-05-23 CA CA3098529A patent/CA3098529A1/en active Pending
- 2019-05-23 BR BR112020023467-2A patent/BR112020023467A2/pt unknown
- 2019-05-23 US US17/057,051 patent/US11781160B2/en active Active
- 2019-05-23 WO PCT/GB2019/051427 patent/WO2019224548A2/en unknown
- 2019-05-23 KR KR1020207037043A patent/KR20210015881A/ko not_active Application Discontinuation
- 2019-05-23 TW TW108117803A patent/TW202003856A/zh unknown
-
2020
- 2020-10-27 PH PH12020551791A patent/PH12020551791A1/en unknown
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1999017607A2 (en) * | 1997-10-02 | 1999-04-15 | Trustees Of Tufts College | Methods and compositions for reducing bacterial tolerance of disinfectants and organic solvents |
CN103930558A (zh) * | 2011-04-01 | 2014-07-16 | 基因组股份公司 | 用于生产甲基丙烯酸和甲基丙烯酸酯类的微生物以及与之相关的方法 |
US20150191756A1 (en) * | 2012-09-10 | 2015-07-09 | Mitsubishi Rayon Co., Ltd | Method for producing methacrylic acid and/or ester thereof |
CN108026548A (zh) * | 2015-05-19 | 2018-05-11 | 卢塞特英国国际有限公司 | 生物制备甲基丙烯酸及其衍生物的方法 |
Non-Patent Citations (1)
Title |
---|
REI WATANABE ET AL.: "Contributions of mutations in acrR and marR genes to organic solvent tolerance in Escherichia coli", 《AMB EXPRESS》, vol. 2, no. 1, pages 1 * |
Also Published As
Publication number | Publication date |
---|---|
GB201808424D0 (en) | 2018-07-11 |
JP2021525085A (ja) | 2021-09-24 |
TW202003856A (zh) | 2020-01-16 |
BR112020023467A2 (pt) | 2021-03-30 |
PH12020551791A1 (en) | 2021-06-21 |
CA3098529A1 (en) | 2019-11-28 |
KR20210015881A (ko) | 2021-02-10 |
US20210207177A1 (en) | 2021-07-08 |
SG11202010677XA (en) | 2020-11-27 |
US11781160B2 (en) | 2023-10-10 |
EP3797154A2 (en) | 2021-03-31 |
WO2019224548A2 (en) | 2019-11-28 |
WO2019224548A3 (en) | 2019-12-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109072245B (zh) | 用于c1固定菌的crispr/cas系统 | |
US11753660B2 (en) | Process for the biological production of methacrylic acid and derivatives thereof | |
CN107406821B (zh) | 用于生产3-羟基丙酸的突变宿主细胞 | |
CN104718282A (zh) | 用于生产脂肪酸和脂肪酸衍生产物的微生物及方法 | |
CN113748211A (zh) | 用于4-羟甲基糠醛及其衍生物的体内合成的方法 | |
US11781160B2 (en) | Methods for the production of methacrylates | |
US20200318146A1 (en) | Metabolic engineering for simultaneous consumption of xylose and glucose for production of chemicals from second generation sugars | |
WO2019113463A1 (en) | Improving algal lipid productivity via genetic modification of a tpr domain containing protein | |
CN117693588A (zh) | 用于改进乙二醇的生物产生的微生物和方法 | |
KR20230156819A (ko) | 아세토락테이트 탈카복실화효소 유전자위에 녹인이 있는 미생물 | |
KR102683624B1 (ko) | 기능적 dna 서열의 안정화된 카피 수를 갖는 미생물 및 관련 방법 | |
JP6723002B2 (ja) | 油脂の製造方法 | |
US12098168B2 (en) | XylR mutant for improved xylose utilization or improved co-utilization of glucose and xylose preliminary | |
JP5737650B2 (ja) | アセトイン産生細胞および当該細胞を用いたアセトインの製造方法 | |
GB2621337A (en) | Process for the biological production of methacrylic acid and derivatives thereof | |
CN117836419A (zh) | 转导免疫细胞的方法 | |
KR20200009462A (ko) | 레귤론 변이를 통한 지방산 및 지방산 유도체 생산에서 불포화도 감소 | |
BR112017024654B1 (pt) | Processo de produzir ácido metacrílico, microorganismo recombinante, processo de fermentação, e, método para preparar polímeros ou copolímeros de ácido metacrílico ou ésteres do ácido metacrílico | |
BR122024008251A2 (pt) | Processo de produzir ácido metacrílico, microorganismo recombinante, processo de fermentação, método para preparar polímeros ou copolímeros de ácido metacrílico ou ésteres do ácido metacrílico |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information | ||
CB02 | Change of applicant information |
Address after: Bai Jinhan Applicant after: Mitsubishi Chemical UK Ltd. Address before: Durham, England Applicant before: Mitsubishi Chemical UK Ltd. |