CN117858952A - 编辑香蕉基因的方法 - Google Patents
编辑香蕉基因的方法 Download PDFInfo
- Publication number
- CN117858952A CN117858952A CN202280056874.1A CN202280056874A CN117858952A CN 117858952 A CN117858952 A CN 117858952A CN 202280056874 A CN202280056874 A CN 202280056874A CN 117858952 A CN117858952 A CN 117858952A
- Authority
- CN
- China
- Prior art keywords
- banana
- leu
- ala
- gly
- val
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 235000018290 Musa x paradisiaca Nutrition 0.000 title claims abstract description 353
- 238000000034 method Methods 0.000 title claims abstract description 227
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 91
- 240000005561 Musa balbisiana Species 0.000 title 1
- 241000234295 Musa Species 0.000 claims abstract description 388
- 108010000700 Acetolactate synthase Proteins 0.000 claims abstract description 362
- 241000196324 Embryophyta Species 0.000 claims abstract description 240
- 230000035772 mutation Effects 0.000 claims abstract description 238
- 210000002257 embryonic structure Anatomy 0.000 claims abstract description 151
- 240000000905 Nymphoides indica Species 0.000 claims abstract description 78
- 235000017590 Nymphoides indica Nutrition 0.000 claims abstract description 78
- 210000004027 cell Anatomy 0.000 claims description 286
- 239000003112 inhibitor Substances 0.000 claims description 175
- 239000005496 Chlorsulfuron Substances 0.000 claims description 145
- VJYIFXVZLXQVHO-UHFFFAOYSA-N chlorsulfuron Chemical group COC1=NC(C)=NC(NC(=O)NS(=O)(=O)C=2C(=CC=CC=2)Cl)=N1 VJYIFXVZLXQVHO-UHFFFAOYSA-N 0.000 claims description 145
- 210000001161 mammalian embryo Anatomy 0.000 claims description 133
- 101000664887 Homo sapiens Superoxide dismutase [Cu-Zn] Proteins 0.000 claims description 100
- 102100038836 Superoxide dismutase [Cu-Zn] Human genes 0.000 claims description 100
- 239000002609 medium Substances 0.000 claims description 82
- 108020005004 Guide RNA Proteins 0.000 claims description 80
- 101000776160 Homo sapiens Alsin Proteins 0.000 claims description 80
- 239000003795 chemical substances by application Substances 0.000 claims description 80
- 102100032047 Alsin Human genes 0.000 claims description 73
- 239000006285 cell suspension Substances 0.000 claims description 66
- 150000007523 nucleic acids Chemical class 0.000 claims description 57
- 230000000408 embryogenic effect Effects 0.000 claims description 56
- 102000039446 nucleic acids Human genes 0.000 claims description 54
- 108020004707 nucleic acids Proteins 0.000 claims description 54
- 101150001232 ALS gene Proteins 0.000 claims description 51
- 108010042407 Endonucleases Proteins 0.000 claims description 50
- 102000004533 Endonucleases Human genes 0.000 claims description 50
- 230000009466 transformation Effects 0.000 claims description 49
- 210000001938 protoplast Anatomy 0.000 claims description 45
- 238000012258 culturing Methods 0.000 claims description 36
- 230000013020 embryo development Effects 0.000 claims description 36
- 241000589158 Agrobacterium Species 0.000 claims description 35
- 238000006467 substitution reaction Methods 0.000 claims description 29
- 230000008685 targeting Effects 0.000 claims description 26
- 235000013399 edible fruits Nutrition 0.000 claims description 24
- 238000003205 genotyping method Methods 0.000 claims description 18
- 101150100395 ALS1 gene Proteins 0.000 claims description 17
- 101150036028 ALS2 gene Proteins 0.000 claims description 17
- 101001091423 Agaricus bisporus Polyphenol oxidase 2 Proteins 0.000 claims description 17
- 101000611523 Arabidopsis thaliana Protoporphyrinogen oxidase 2, chloroplastic/mitochondrial Proteins 0.000 claims description 17
- 239000003550 marker Substances 0.000 claims description 17
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 17
- 229940100389 Sulfonylurea Drugs 0.000 claims description 15
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 15
- 108090001025 Iron regulatory protein 1 Proteins 0.000 claims description 14
- 230000001404 mediated effect Effects 0.000 claims description 14
- 102000004901 Iron regulatory protein 1 Human genes 0.000 claims description 13
- 239000003153 chemical reaction reagent Substances 0.000 claims description 12
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 12
- YROXIXLRRCOBKF-UHFFFAOYSA-N sulfonylurea Chemical class OC(=N)N=S(=O)=O YROXIXLRRCOBKF-UHFFFAOYSA-N 0.000 claims description 12
- 239000002245 particle Substances 0.000 claims description 10
- 102220189664 rs116596858 Human genes 0.000 claims description 10
- 238000001890 transfection Methods 0.000 claims description 10
- 230000004927 fusion Effects 0.000 claims description 9
- 229920001184 polypeptide Polymers 0.000 claims description 9
- CAAMSDWKXXPUJR-UHFFFAOYSA-N 3,5-dihydro-4H-imidazol-4-one Chemical compound O=C1CNC=N1 CAAMSDWKXXPUJR-UHFFFAOYSA-N 0.000 claims description 7
- 102100026846 Cytidine deaminase Human genes 0.000 claims description 7
- 108010031325 Cytidine deaminase Proteins 0.000 claims description 7
- 238000003306 harvesting Methods 0.000 claims description 7
- -1 pyrimidinyloxybenzoate (pyrimidinyl oxybenzoate) Chemical compound 0.000 claims description 7
- 102200058601 rs1800566 Human genes 0.000 claims description 6
- 238000004520 electroporation Methods 0.000 claims description 5
- PRZRAMLXTKZUHF-UHFFFAOYSA-N 5-oxo-n-sulfonyl-4h-triazole-1-carboxamide Chemical compound O=S(=O)=NC(=O)N1N=NCC1=O PRZRAMLXTKZUHF-UHFFFAOYSA-N 0.000 claims description 4
- 102100040958 Aconitate hydratase, mitochondrial Human genes 0.000 claims description 4
- 108010052875 Adenine deaminase Proteins 0.000 claims description 4
- 101000965314 Homo sapiens Aconitate hydratase, mitochondrial Proteins 0.000 claims description 4
- 101000799318 Homo sapiens Long-chain-fatty-acid-CoA ligase 1 Proteins 0.000 claims description 4
- 101000780205 Homo sapiens Long-chain-fatty-acid-CoA ligase 5 Proteins 0.000 claims description 4
- 101000780202 Homo sapiens Long-chain-fatty-acid-CoA ligase 6 Proteins 0.000 claims description 4
- 102100033995 Long-chain-fatty-acid-CoA ligase 1 Human genes 0.000 claims description 4
- 102100034337 Long-chain-fatty-acid-CoA ligase 6 Human genes 0.000 claims description 4
- 235000013305 food Nutrition 0.000 claims description 4
- 239000001963 growth medium Substances 0.000 claims description 4
- 239000002105 nanoparticle Substances 0.000 claims description 4
- 238000012545 processing Methods 0.000 claims description 4
- 101001091417 Agaricus bisporus Polyphenol oxidase 1 Proteins 0.000 claims description 2
- 101000611540 Agaricus bisporus Polyphenol oxidase 4 Proteins 0.000 claims description 2
- 101100353062 Anopheles gambiae PPO8 gene Proteins 0.000 claims description 2
- 101000611521 Arabidopsis thaliana Protoporphyrinogen oxidase 1, chloroplastic Proteins 0.000 claims description 2
- YWBFPKPWMSWWEA-UHFFFAOYSA-O triazolopyrimidine Chemical compound BrC1=CC=CC(C=2N=C3N=CN[N+]3=C(NCC=3C=CN=CC=3)C=2)=C1 YWBFPKPWMSWWEA-UHFFFAOYSA-O 0.000 claims description 2
- 108020001991 Protoporphyrinogen Oxidase Proteins 0.000 claims 1
- 108020004414 DNA Proteins 0.000 description 116
- 210000001519 tissue Anatomy 0.000 description 91
- 230000009261 transgenic effect Effects 0.000 description 67
- 239000013612 plasmid Substances 0.000 description 52
- 108091034117 Oligonucleotide Proteins 0.000 description 47
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 30
- 230000014509 gene expression Effects 0.000 description 30
- 230000001965 increasing effect Effects 0.000 description 30
- 108020004705 Codon Proteins 0.000 description 25
- 239000002773 nucleotide Substances 0.000 description 25
- 125000003729 nucleotide group Chemical group 0.000 description 24
- 235000021015 bananas Nutrition 0.000 description 22
- 238000011161 development Methods 0.000 description 21
- 239000012882 rooting medium Substances 0.000 description 21
- 108010061238 threonyl-glycine Proteins 0.000 description 21
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 20
- 230000000694 effects Effects 0.000 description 20
- 230000012010 growth Effects 0.000 description 20
- 108010031396 Catechol oxidase Proteins 0.000 description 18
- 108700028369 Alleles Proteins 0.000 description 17
- 102000030523 Catechol oxidase Human genes 0.000 description 17
- 241001528553 Malus asiatica Species 0.000 description 17
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 17
- 230000010354 integration Effects 0.000 description 17
- 239000013615 primer Substances 0.000 description 17
- 230000001488 breeding effect Effects 0.000 description 16
- 230000018109 developmental process Effects 0.000 description 16
- 238000009395 breeding Methods 0.000 description 15
- 239000000203 mixture Substances 0.000 description 15
- 238000005507 spraying Methods 0.000 description 15
- 108010010888 1-aminocyclopropane-1-carboxylic acid oxidase Proteins 0.000 description 14
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 14
- 239000004009 herbicide Substances 0.000 description 14
- 108010057821 leucylproline Proteins 0.000 description 14
- 238000011529 RT qPCR Methods 0.000 description 13
- 244000061456 Solanum tuberosum Species 0.000 description 13
- 235000002595 Solanum tuberosum Nutrition 0.000 description 13
- 108010078144 glutaminyl-glycine Proteins 0.000 description 13
- 102000040430 polynucleotide Human genes 0.000 description 13
- 108091033319 polynucleotide Proteins 0.000 description 13
- 239000002157 polynucleotide Substances 0.000 description 13
- 239000000523 sample Substances 0.000 description 13
- 235000003805 Musa ABB Group Nutrition 0.000 description 12
- 235000015266 Plantago major Nutrition 0.000 description 12
- 108700019146 Transgenes Proteins 0.000 description 12
- 241000607479 Yersinia pestis Species 0.000 description 12
- 235000001014 amino acid Nutrition 0.000 description 12
- 238000002474 experimental method Methods 0.000 description 12
- 108010050848 glycylleucine Proteins 0.000 description 12
- 108010054155 lysyllysine Proteins 0.000 description 12
- 108091093088 Amplicon Proteins 0.000 description 11
- 244000292211 Canna coccinea Species 0.000 description 11
- 102000004190 Enzymes Human genes 0.000 description 11
- 108090000790 Enzymes Proteins 0.000 description 11
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 11
- 101100288095 Klebsiella pneumoniae neo gene Proteins 0.000 description 11
- 108091027544 Subgenomic mRNA Proteins 0.000 description 11
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 11
- 108010005233 alanylglutamic acid Proteins 0.000 description 11
- 229940088598 enzyme Drugs 0.000 description 11
- 102000004169 proteins and genes Human genes 0.000 description 11
- 108010030526 1-aminocyclopropanecarboxylate synthase Proteins 0.000 description 10
- 241000219194 Arabidopsis Species 0.000 description 10
- 108091033409 CRISPR Proteins 0.000 description 10
- 235000005273 Canna coccinea Nutrition 0.000 description 10
- 238000006243 chemical reaction Methods 0.000 description 10
- 108010089804 glycyl-threonine Proteins 0.000 description 10
- 239000000463 material Substances 0.000 description 10
- 241000894007 species Species 0.000 description 10
- 238000012360 testing method Methods 0.000 description 10
- 240000003768 Solanum lycopersicum Species 0.000 description 9
- 108010008355 arginyl-glutamine Proteins 0.000 description 9
- 238000010362 genome editing Methods 0.000 description 9
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 9
- 235000018102 proteins Nutrition 0.000 description 9
- 239000013598 vector Substances 0.000 description 9
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 8
- 108010051330 Arg-Pro-Gly-Pro Proteins 0.000 description 8
- 102000053602 DNA Human genes 0.000 description 8
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 8
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 8
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 8
- 240000007594 Oryza sativa Species 0.000 description 8
- 235000007164 Oryza sativa Nutrition 0.000 description 8
- 208000026487 Triploidy Diseases 0.000 description 8
- OJOBTAOGJIWAGB-UHFFFAOYSA-N acetosyringone Chemical compound COC1=CC(C(C)=O)=CC(OC)=C1O OJOBTAOGJIWAGB-UHFFFAOYSA-N 0.000 description 8
- 108010049041 glutamylalanine Proteins 0.000 description 8
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 8
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 8
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 8
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 8
- 230000004048 modification Effects 0.000 description 8
- 238000012986 modification Methods 0.000 description 8
- 108010031719 prolyl-serine Proteins 0.000 description 8
- 230000008929 regeneration Effects 0.000 description 8
- 238000011069 regeneration method Methods 0.000 description 8
- 235000009566 rice Nutrition 0.000 description 8
- 230000005783 single-strand break Effects 0.000 description 8
- 239000007921 spray Substances 0.000 description 8
- 230000035882 stress Effects 0.000 description 8
- 239000000725 suspension Substances 0.000 description 8
- 108010037335 tyrosyl-prolyl-glycyl-glycine Proteins 0.000 description 8
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 7
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 7
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 7
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 7
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 7
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 7
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 7
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 7
- GETCJHFFECHWHI-QXEWZRGKSA-N Met-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCSC)N GETCJHFFECHWHI-QXEWZRGKSA-N 0.000 description 7
- 108010079364 N-glycylalanine Proteins 0.000 description 7
- LNLNHXIQPGKRJQ-SRVKXCTJSA-N Pro-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 LNLNHXIQPGKRJQ-SRVKXCTJSA-N 0.000 description 7
- 238000004458 analytical method Methods 0.000 description 7
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 7
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 7
- 108010077515 glycylproline Proteins 0.000 description 7
- 230000002363 herbicidal effect Effects 0.000 description 7
- 108010034529 leucyl-lysine Proteins 0.000 description 7
- 230000035800 maturation Effects 0.000 description 7
- 239000000047 product Substances 0.000 description 7
- 230000002829 reductive effect Effects 0.000 description 7
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 7
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 6
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 6
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 6
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 6
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 6
- 241000894006 Bacteria Species 0.000 description 6
- LYSHSHHDBVKJRN-JBDRJPRFSA-N Cys-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CS)N LYSHSHHDBVKJRN-JBDRJPRFSA-N 0.000 description 6
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 6
- PODFFOWWLUPNMN-DCAQKATOSA-N Gln-His-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PODFFOWWLUPNMN-DCAQKATOSA-N 0.000 description 6
- IIMZHVKZBGSEKZ-SZMVWBNQSA-N Gln-Trp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O IIMZHVKZBGSEKZ-SZMVWBNQSA-N 0.000 description 6
- WTJIWXMJESRHMM-XDTLVQLUSA-N Gln-Tyr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O WTJIWXMJESRHMM-XDTLVQLUSA-N 0.000 description 6
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 6
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 6
- 108010050006 Gly-Asp-Gly-Arg Proteins 0.000 description 6
- IDOGEHIWMJMAHT-BYPYZUCNSA-N Gly-Gly-Cys Chemical compound NCC(=O)NCC(=O)N[C@@H](CS)C(O)=O IDOGEHIWMJMAHT-BYPYZUCNSA-N 0.000 description 6
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 6
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 6
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 6
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- 206010020649 Hyperkeratosis Diseases 0.000 description 6
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 6
- HYLIOBDWPQNLKI-HVTMNAMFSA-N Ile-His-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HYLIOBDWPQNLKI-HVTMNAMFSA-N 0.000 description 6
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 6
- JCGMFFQQHJQASB-PYJNHQTQSA-N Ile-Val-His Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O JCGMFFQQHJQASB-PYJNHQTQSA-N 0.000 description 6
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 6
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 6
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 6
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 6
- 244000291473 Musa acuminata Species 0.000 description 6
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 6
- NOFBJKKOPKJDCO-KKXDTOCCSA-N Phe-Ala-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NOFBJKKOPKJDCO-KKXDTOCCSA-N 0.000 description 6
- LPGSNRSLPHRNBW-AVGNSLFASA-N Pro-His-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 LPGSNRSLPHRNBW-AVGNSLFASA-N 0.000 description 6
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 6
- NIOYDASGXWLHEZ-CIUDSAMLSA-N Ser-Met-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O NIOYDASGXWLHEZ-CIUDSAMLSA-N 0.000 description 6
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 6
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 6
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 6
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 6
- JONPRIHUYSPIMA-UWJYBYFXSA-N Tyr-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JONPRIHUYSPIMA-UWJYBYFXSA-N 0.000 description 6
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 6
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 6
- 108010087924 alanylproline Proteins 0.000 description 6
- 150000001413 amino acids Chemical class 0.000 description 6
- 108010093581 aspartyl-proline Proteins 0.000 description 6
- 108010038633 aspartylglutamate Proteins 0.000 description 6
- 108010092854 aspartyllysine Proteins 0.000 description 6
- 108010010147 glycylglutamine Proteins 0.000 description 6
- 108010087823 glycyltyrosine Proteins 0.000 description 6
- 108010025306 histidylleucine Proteins 0.000 description 6
- 230000002147 killing effect Effects 0.000 description 6
- 108010017391 lysylvaline Proteins 0.000 description 6
- 108010025488 pinealon Proteins 0.000 description 6
- 108010053725 prolylvaline Proteins 0.000 description 6
- 108010009962 valyltyrosine Proteins 0.000 description 6
- 101150073246 AGL1 gene Proteins 0.000 description 5
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 5
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 5
- 241000219195 Arabidopsis thaliana Species 0.000 description 5
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 5
- UIUXXFIKWQVMEX-UFYCRDLUSA-N Arg-Phe-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UIUXXFIKWQVMEX-UFYCRDLUSA-N 0.000 description 5
- LZLCLRQMUQWUHJ-GUBZILKMSA-N Asn-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N LZLCLRQMUQWUHJ-GUBZILKMSA-N 0.000 description 5
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 5
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 5
- 238000010453 CRISPR/Cas method Methods 0.000 description 5
- 102220605874 Cytosolic arginine sensor for mTORC1 subunit 2_D10A_mutation Human genes 0.000 description 5
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 5
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 5
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 5
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 5
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 5
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 5
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 5
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 5
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 5
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 5
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 5
- QPXBPQUGXHURGP-UWVGGRQHSA-N Leu-Gly-Met Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N QPXBPQUGXHURGP-UWVGGRQHSA-N 0.000 description 5
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 5
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 5
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 5
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 5
- 229930195725 Mannitol Natural products 0.000 description 5
- TUZSWDCTCGTVDJ-PJODQICGSA-N Met-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 TUZSWDCTCGTVDJ-PJODQICGSA-N 0.000 description 5
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 5
- 101710163270 Nuclease Proteins 0.000 description 5
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 5
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 5
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 5
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 5
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 5
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 5
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 5
- 238000010459 TALEN Methods 0.000 description 5
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 5
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 5
- 241000700605 Viruses Species 0.000 description 5
- 108010017070 Zinc Finger Nucleases Proteins 0.000 description 5
- 108010036533 arginylvaline Proteins 0.000 description 5
- 108010047857 aspartylglycine Proteins 0.000 description 5
- 230000003115 biocidal effect Effects 0.000 description 5
- 230000008859 change Effects 0.000 description 5
- 150000001875 compounds Chemical class 0.000 description 5
- 230000034431 double-strand break repair via homologous recombination Effects 0.000 description 5
- 230000006801 homologous recombination Effects 0.000 description 5
- 238000002744 homologous recombination Methods 0.000 description 5
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 5
- 108010012058 leucyltyrosine Proteins 0.000 description 5
- 108010003700 lysyl aspartic acid Proteins 0.000 description 5
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 5
- 239000000594 mannitol Substances 0.000 description 5
- 235000010355 mannitol Nutrition 0.000 description 5
- 238000004519 manufacturing process Methods 0.000 description 5
- 238000012163 sequencing technique Methods 0.000 description 5
- 108010026333 seryl-proline Proteins 0.000 description 5
- 239000011734 sodium Substances 0.000 description 5
- 230000003442 weekly effect Effects 0.000 description 5
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 4
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 4
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 4
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 4
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 4
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 4
- JTZUZBADHGISJD-SRVKXCTJSA-N Arg-His-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JTZUZBADHGISJD-SRVKXCTJSA-N 0.000 description 4
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 4
- AMIQZQAAYGYKOP-FXQIFTODSA-N Arg-Ser-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O AMIQZQAAYGYKOP-FXQIFTODSA-N 0.000 description 4
- RCENDENBBJFJHZ-ACZMJKKPSA-N Asn-Asn-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCENDENBBJFJHZ-ACZMJKKPSA-N 0.000 description 4
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 4
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 4
- ZEDBMCPXPIYJLW-XHNCKOQMSA-N Asp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZEDBMCPXPIYJLW-XHNCKOQMSA-N 0.000 description 4
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 4
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- 238000010354 CRISPR gene editing Methods 0.000 description 4
- 101100215147 Caenorhabditis elegans aco-1 gene Proteins 0.000 description 4
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 4
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 4
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 4
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 4
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 4
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 4
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 4
- RVGMVLVBDRQVKB-UWVGGRQHSA-N Gly-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN RVGMVLVBDRQVKB-UWVGGRQHSA-N 0.000 description 4
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 4
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 4
- AIPUZFXMXAHZKY-QWRGUYRKSA-N His-Leu-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AIPUZFXMXAHZKY-QWRGUYRKSA-N 0.000 description 4
- KYLIZSDYWQQTFM-PEDHHIEDSA-N Ile-Ile-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N KYLIZSDYWQQTFM-PEDHHIEDSA-N 0.000 description 4
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 4
- IPFKIGNDTUOFAF-CYDGBPFRSA-N Ile-Val-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IPFKIGNDTUOFAF-CYDGBPFRSA-N 0.000 description 4
- 108010065920 Insulin Lispro Proteins 0.000 description 4
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 4
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 4
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 4
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 4
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 4
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 4
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 4
- AFLBTVGQCQLOFJ-AVGNSLFASA-N Lys-Pro-Arg Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AFLBTVGQCQLOFJ-AVGNSLFASA-N 0.000 description 4
- HZLSUXCMSIBCRV-RVMXOQNASA-N Met-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N HZLSUXCMSIBCRV-RVMXOQNASA-N 0.000 description 4
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 4
- 108091028043 Nucleic acid sequence Proteins 0.000 description 4
- UNLYPPYNDXHGDG-IHRRRGAJSA-N Phe-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UNLYPPYNDXHGDG-IHRRRGAJSA-N 0.000 description 4
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 4
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 4
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 4
- AUQGUYPHJSMAKI-CYDGBPFRSA-N Pro-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 AUQGUYPHJSMAKI-CYDGBPFRSA-N 0.000 description 4
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 4
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 4
- 238000003559 RNA-seq method Methods 0.000 description 4
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 4
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 4
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 4
- 108020004682 Single-Stranded DNA Proteins 0.000 description 4
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 4
- 241000209140 Triticum Species 0.000 description 4
- 235000021307 Triticum Nutrition 0.000 description 4
- DVIIYMVCSUQOJG-QEJZJMRPSA-N Trp-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DVIIYMVCSUQOJG-QEJZJMRPSA-N 0.000 description 4
- DXYWRYQRKPIGGU-BPNCWPANSA-N Tyr-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DXYWRYQRKPIGGU-BPNCWPANSA-N 0.000 description 4
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 4
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 4
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 4
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 4
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 4
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 4
- 230000036579 abiotic stress Effects 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 4
- 230000004790 biotic stress Effects 0.000 description 4
- 235000013339 cereals Nutrition 0.000 description 4
- 238000012217 deletion Methods 0.000 description 4
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 4
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 4
- 108010037850 glycylvaline Proteins 0.000 description 4
- 108010085325 histidylproline Proteins 0.000 description 4
- 230000006780 non-homologous end joining Effects 0.000 description 4
- 210000000056 organ Anatomy 0.000 description 4
- 108010004914 prolylarginine Proteins 0.000 description 4
- 230000004083 survival effect Effects 0.000 description 4
- 230000001131 transforming effect Effects 0.000 description 4
- 230000001052 transient effect Effects 0.000 description 4
- 108010051110 tyrosyl-lysine Proteins 0.000 description 4
- 108010020532 tyrosyl-proline Proteins 0.000 description 4
- 108010073969 valyllysine Proteins 0.000 description 4
- NUPJIGQFXCQJBK-UHFFFAOYSA-N 2-(4-isopropyl-4-methyl-5-oxo-4,5-dihydro-1H-imidazol-2-yl)-5-(methoxymethyl)nicotinic acid Chemical compound OC(=O)C1=CC(COC)=CN=C1C1=NC(C)(C(C)C)C(=O)N1 NUPJIGQFXCQJBK-UHFFFAOYSA-N 0.000 description 3
- CLQMBPJKHLGMQK-UHFFFAOYSA-N 2-(4-isopropyl-4-methyl-5-oxo-4,5-dihydro-1H-imidazol-2-yl)nicotinic acid Chemical compound N1C(=O)C(C(C)C)(C)N=C1C1=NC=CC=C1C(O)=O CLQMBPJKHLGMQK-UHFFFAOYSA-N 0.000 description 3
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 3
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 3
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 3
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 3
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 3
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 3
- AETQNIIFKCMVHP-UVBJJODRSA-N Ala-Trp-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AETQNIIFKCMVHP-UVBJJODRSA-N 0.000 description 3
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 description 3
- PVSNBTCXCQIXSE-JYJNAYRXSA-N Arg-Arg-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PVSNBTCXCQIXSE-JYJNAYRXSA-N 0.000 description 3
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 3
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 3
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 3
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 3
- SJPZTWAYTJPPBI-GUBZILKMSA-N Asn-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SJPZTWAYTJPPBI-GUBZILKMSA-N 0.000 description 3
- BZMWJLLUAKSIMH-FXQIFTODSA-N Asn-Glu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BZMWJLLUAKSIMH-FXQIFTODSA-N 0.000 description 3
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 3
- BIGRHVNFFJTHEB-UBHSHLNASA-N Asn-Trp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O BIGRHVNFFJTHEB-UBHSHLNASA-N 0.000 description 3
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 3
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 3
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 3
- 102000020313 Cell-Penetrating Peptides Human genes 0.000 description 3
- 108010051109 Cell-Penetrating Peptides Proteins 0.000 description 3
- WXKWQSDHEXKKNC-ZKWXMUAHSA-N Cys-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N WXKWQSDHEXKKNC-ZKWXMUAHSA-N 0.000 description 3
- 241000233866 Fungi Species 0.000 description 3
- 108091092584 GDNA Proteins 0.000 description 3
- LLRJEFPKIIBGJP-DCAQKATOSA-N Gln-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LLRJEFPKIIBGJP-DCAQKATOSA-N 0.000 description 3
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 3
- HPBKQFJXDUVNQV-FHWLQOOXSA-N Gln-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O HPBKQFJXDUVNQV-FHWLQOOXSA-N 0.000 description 3
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 3
- LVCHEMOPBORRLB-DCAQKATOSA-N Glu-Gln-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O LVCHEMOPBORRLB-DCAQKATOSA-N 0.000 description 3
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 3
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 3
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 3
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 3
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 3
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 3
- 229940113491 Glycosylase inhibitor Drugs 0.000 description 3
- 108091027305 Heteroduplex Proteins 0.000 description 3
- 241000238631 Hexapoda Species 0.000 description 3
- WMKXFMUJRCEGRP-SRVKXCTJSA-N His-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N WMKXFMUJRCEGRP-SRVKXCTJSA-N 0.000 description 3
- BZKDJRSZWLPJNI-SRVKXCTJSA-N His-His-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O BZKDJRSZWLPJNI-SRVKXCTJSA-N 0.000 description 3
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 3
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 3
- RVNOXPZHMUWCLW-GMOBBJLQSA-N Ile-Met-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RVNOXPZHMUWCLW-GMOBBJLQSA-N 0.000 description 3
- SVZFKLBRCYCIIY-CYDGBPFRSA-N Ile-Pro-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SVZFKLBRCYCIIY-CYDGBPFRSA-N 0.000 description 3
- 239000005566 Imazamox Substances 0.000 description 3
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 3
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 3
- AXZGZMGRBDQTEY-SRVKXCTJSA-N Leu-Gln-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O AXZGZMGRBDQTEY-SRVKXCTJSA-N 0.000 description 3
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 3
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 3
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 3
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 3
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 3
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 3
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 3
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 3
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 3
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 3
- LQMHZERGCQJKAH-STQMWFEESA-N Met-Gly-Phe Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LQMHZERGCQJKAH-STQMWFEESA-N 0.000 description 3
- IQJMEDDVOGMTKT-SRVKXCTJSA-N Met-Val-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IQJMEDDVOGMTKT-SRVKXCTJSA-N 0.000 description 3
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 3
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 3
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 3
- KQCCDMFIALWGTL-GUBZILKMSA-N Pro-Asn-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 KQCCDMFIALWGTL-GUBZILKMSA-N 0.000 description 3
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 3
- FYPGHGXAOZTOBO-IHRRRGAJSA-N Pro-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FYPGHGXAOZTOBO-IHRRRGAJSA-N 0.000 description 3
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 3
- BARPGRUZBKFJMA-SRVKXCTJSA-N Pro-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BARPGRUZBKFJMA-SRVKXCTJSA-N 0.000 description 3
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 3
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 3
- 102000004389 Ribonucleoproteins Human genes 0.000 description 3
- 108010081734 Ribonucleoproteins Proteins 0.000 description 3
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 3
- DLPXTCTVNDTYGJ-JBDRJPRFSA-N Ser-Ile-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O DLPXTCTVNDTYGJ-JBDRJPRFSA-N 0.000 description 3
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 3
- XQAPEISNMXNKGE-FXQIFTODSA-N Ser-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CS)C(=O)O XQAPEISNMXNKGE-FXQIFTODSA-N 0.000 description 3
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 3
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 3
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 3
- SXAGUVRFGJSFKC-ZEILLAHLSA-N Thr-His-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SXAGUVRFGJSFKC-ZEILLAHLSA-N 0.000 description 3
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 3
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 3
- 108010043645 Transcription Activator-Like Effector Nucleases Proteins 0.000 description 3
- RSUXQZNWAOTBQF-XIRDDKMYSA-N Trp-Arg-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RSUXQZNWAOTBQF-XIRDDKMYSA-N 0.000 description 3
- NSOMQRHZMJMZIE-GVARAGBVSA-N Tyr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NSOMQRHZMJMZIE-GVARAGBVSA-N 0.000 description 3
- 102000006943 Uracil-DNA Glycosidase Human genes 0.000 description 3
- 108010072685 Uracil-DNA Glycosidase Proteins 0.000 description 3
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 3
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 3
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 3
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 3
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 3
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 3
- UOUIMEGEPSBZIV-ULQDDVLXSA-N Val-Lys-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOUIMEGEPSBZIV-ULQDDVLXSA-N 0.000 description 3
- VSCIANXXVZOYOC-AVGNSLFASA-N Val-Pro-His Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VSCIANXXVZOYOC-AVGNSLFASA-N 0.000 description 3
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 3
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 3
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 3
- 240000008042 Zea mays Species 0.000 description 3
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 3
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 3
- 239000002671 adjuvant Substances 0.000 description 3
- 230000002411 adverse Effects 0.000 description 3
- 230000009418 agronomic effect Effects 0.000 description 3
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 3
- 108010011559 alanylphenylalanine Proteins 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- 108010013835 arginine glutamate Proteins 0.000 description 3
- 108010062796 arginyllysine Proteins 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 150000005693 branched-chain amino acids Chemical class 0.000 description 3
- 230000003197 catalytic effect Effects 0.000 description 3
- 229960004261 cefotaxime Drugs 0.000 description 3
- AZZMGZXNTDTSME-JUZDKLSSSA-M cefotaxime sodium Chemical compound [Na+].N([C@@H]1C(N2C(=C(COC(C)=O)CS[C@@H]21)C([O-])=O)=O)C(=O)\C(=N/OC)C1=CSC(N)=N1 AZZMGZXNTDTSME-JUZDKLSSSA-M 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- 108010016616 cysteinylglycine Proteins 0.000 description 3
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical class O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 3
- 230000003247 decreasing effect Effects 0.000 description 3
- 239000008367 deionised water Substances 0.000 description 3
- 229910021641 deionized water Inorganic materials 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 230000005782 double-strand break Effects 0.000 description 3
- 230000002222 downregulating effect Effects 0.000 description 3
- 239000012634 fragment Substances 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 239000012869 germination medium Substances 0.000 description 3
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 3
- 108010079547 glutamylmethionine Proteins 0.000 description 3
- 108010081551 glycylphenylalanine Proteins 0.000 description 3
- 238000010348 incorporation Methods 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 238000002955 isolation Methods 0.000 description 3
- 108010078274 isoleucylvaline Proteins 0.000 description 3
- 108010009298 lysylglutamic acid Proteins 0.000 description 3
- 235000009973 maize Nutrition 0.000 description 3
- 238000007726 management method Methods 0.000 description 3
- 230000004060 metabolic process Effects 0.000 description 3
- 108010005942 methionylglycine Proteins 0.000 description 3
- 239000013642 negative control Substances 0.000 description 3
- 238000003199 nucleic acid amplification method Methods 0.000 description 3
- 108010024607 phenylalanylalanine Proteins 0.000 description 3
- 108010073101 phenylalanylleucine Proteins 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 108010070643 prolylglutamic acid Proteins 0.000 description 3
- 108010029020 prolylglycine Proteins 0.000 description 3
- 238000003762 quantitative reverse transcription PCR Methods 0.000 description 3
- 230000001172 regenerating effect Effects 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 108091008146 restriction endonucleases Proteins 0.000 description 3
- 230000005070 ripening Effects 0.000 description 3
- 238000007480 sanger sequencing Methods 0.000 description 3
- 229910052708 sodium Inorganic materials 0.000 description 3
- 230000002103 transcriptional effect Effects 0.000 description 3
- 230000010474 transient expression Effects 0.000 description 3
- JYEUMXHLPRZUAT-UHFFFAOYSA-N 1,2,3-triazine Chemical compound C1=CN=NN=C1 JYEUMXHLPRZUAT-UHFFFAOYSA-N 0.000 description 2
- RBSXHDIPCIWOMG-UHFFFAOYSA-N 1-(4,6-dimethoxypyrimidin-2-yl)-3-(2-ethylsulfonylimidazo[1,2-a]pyridin-3-yl)sulfonylurea Chemical compound CCS(=O)(=O)C=1N=C2C=CC=CN2C=1S(=O)(=O)NC(=O)NC1=NC(OC)=CC(OC)=N1 RBSXHDIPCIWOMG-UHFFFAOYSA-N 0.000 description 2
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 2
- QRBLKGHRWFGINE-UGWAGOLRSA-N 2-[2-[2-[[2-[[4-[[2-[[6-amino-2-[3-amino-1-[(2,3-diamino-3-oxopropyl)amino]-3-oxopropyl]-5-methylpyrimidine-4-carbonyl]amino]-3-[(2r,3s,4s,5s,6s)-3-[(2s,3r,4r,5s)-4-carbamoyl-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy-4,5-dihydroxy-6-(hydroxymethyl)- Chemical compound N=1C(C=2SC=C(N=2)C(N)=O)CSC=1CCNC(=O)C(C(C)=O)NC(=O)C(C)C(O)C(C)NC(=O)C(C(O[C@H]1[C@@]([C@@H](O)[C@H](O)[C@H](CO)O1)(C)O[C@H]1[C@@H]([C@](O)([C@@H](O)C(CO)O1)C(N)=O)O)C=1NC=NC=1)NC(=O)C1=NC(C(CC(N)=O)NCC(N)C(N)=O)=NC(N)=C1C QRBLKGHRWFGINE-UGWAGOLRSA-N 0.000 description 2
- CABMTIJINOIHOD-UHFFFAOYSA-N 2-[4-methyl-5-oxo-4-(propan-2-yl)-4,5-dihydro-1H-imidazol-2-yl]quinoline-3-carboxylic acid Chemical compound N1C(=O)C(C(C)C)(C)N=C1C1=NC2=CC=CC=C2C=C1C(O)=O CABMTIJINOIHOD-UHFFFAOYSA-N 0.000 description 2
- 108010020183 3-phosphoshikimate 1-carboxyvinyltransferase Proteins 0.000 description 2
- RRCWSLBKLVBFQD-UHFFFAOYSA-N 4-chloro-1-(2-methylpropyl)imidazo[4,5-c]quinoline Chemical class C1=CC=CC2=C3N(CC(C)C)C=NC3=C(Cl)N=C21 RRCWSLBKLVBFQD-UHFFFAOYSA-N 0.000 description 2
- 241000238876 Acari Species 0.000 description 2
- 102000007469 Actins Human genes 0.000 description 2
- 108010085238 Actins Proteins 0.000 description 2
- 229930024421 Adenine Natural products 0.000 description 2
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 2
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 2
- PJNSIUPOXFBHDM-GUBZILKMSA-N Ala-Arg-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O PJNSIUPOXFBHDM-GUBZILKMSA-N 0.000 description 2
- XQJAFSDFQZPYCU-UWJYBYFXSA-N Ala-Asn-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N XQJAFSDFQZPYCU-UWJYBYFXSA-N 0.000 description 2
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 2
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 2
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 2
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 2
- NJWJSLCQEDMGNC-MBLNEYKQSA-N Ala-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N)O NJWJSLCQEDMGNC-MBLNEYKQSA-N 0.000 description 2
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 2
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 2
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 2
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 2
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- KYDYGANDJHFBCW-DRZSPHRISA-N Ala-Phe-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KYDYGANDJHFBCW-DRZSPHRISA-N 0.000 description 2
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 2
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 2
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 2
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 2
- 101000852723 Arabidopsis thaliana Acetolactate synthase small subunit 1, chloroplastic Proteins 0.000 description 2
- 101000852721 Arabidopsis thaliana Acetolactate synthase small subunit 2, chloroplastic Proteins 0.000 description 2
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 2
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 2
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 2
- NTAZNGWBXRVEDJ-FXQIFTODSA-N Arg-Asp-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NTAZNGWBXRVEDJ-FXQIFTODSA-N 0.000 description 2
- JUWQNWXEGDYCIE-YUMQZZPRSA-N Arg-Gln-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O JUWQNWXEGDYCIE-YUMQZZPRSA-N 0.000 description 2
- QKSAZKCRVQYYGS-UWVGGRQHSA-N Arg-Gly-His Chemical compound N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O QKSAZKCRVQYYGS-UWVGGRQHSA-N 0.000 description 2
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 2
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 2
- XUGATJVGQUGQKY-ULQDDVLXSA-N Arg-Lys-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XUGATJVGQUGQKY-ULQDDVLXSA-N 0.000 description 2
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 2
- JBIRFLWXWDSDTR-CYDGBPFRSA-N Arg-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N JBIRFLWXWDSDTR-CYDGBPFRSA-N 0.000 description 2
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 2
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 2
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 2
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 2
- BECXEHHOZNFFFX-IHRRRGAJSA-N Arg-Ser-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BECXEHHOZNFFFX-IHRRRGAJSA-N 0.000 description 2
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 2
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 2
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 2
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 2
- AEZCCDMZZJOGII-DCAQKATOSA-N Asn-Met-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O AEZCCDMZZJOGII-DCAQKATOSA-N 0.000 description 2
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 2
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 2
- JGDBHIVECJGXJA-FXQIFTODSA-N Asp-Asp-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JGDBHIVECJGXJA-FXQIFTODSA-N 0.000 description 2
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 2
- CSEJMKNZDCJYGJ-XHNCKOQMSA-N Asp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O CSEJMKNZDCJYGJ-XHNCKOQMSA-N 0.000 description 2
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 2
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 2
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 2
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 2
- 101100378010 Bacillus subtilis (strain 168) accC1 gene Proteins 0.000 description 2
- 101100322122 Bacillus subtilis (strain 168) accC2 gene Proteins 0.000 description 2
- 239000002028 Biomass Substances 0.000 description 2
- 108010006654 Bleomycin Proteins 0.000 description 2
- 241001515826 Cassava vein mosaic virus Species 0.000 description 2
- 108091005960 Citrine Proteins 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- QFMCHXSGIZPBKG-ZLUOBGJFSA-N Cys-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N QFMCHXSGIZPBKG-ZLUOBGJFSA-N 0.000 description 2
- GUKYYUFHWYRMEU-WHFBIAKZSA-N Cys-Gly-Asp Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O GUKYYUFHWYRMEU-WHFBIAKZSA-N 0.000 description 2
- CHRCKSPMGYDLIA-SRVKXCTJSA-N Cys-Phe-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O CHRCKSPMGYDLIA-SRVKXCTJSA-N 0.000 description 2
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 2
- 108020001738 DNA Glycosylase Proteins 0.000 description 2
- 238000007400 DNA extraction Methods 0.000 description 2
- 102000028381 DNA glycosylase Human genes 0.000 description 2
- 208000005156 Dehydration Diseases 0.000 description 2
- VGGSQFUCUMXWEO-UHFFFAOYSA-N Ethene Chemical compound C=C VGGSQFUCUMXWEO-UHFFFAOYSA-N 0.000 description 2
- 239000005977 Ethylene Substances 0.000 description 2
- 241000237858 Gastropoda Species 0.000 description 2
- 229930182566 Gentamicin Natural products 0.000 description 2
- CEAZRRDELHUEMR-URQXQFDESA-N Gentamicin Chemical compound O1[C@H](C(C)NC)CC[C@@H](N)[C@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](NC)[C@@](C)(O)CO2)O)[C@H](N)C[C@@H]1N CEAZRRDELHUEMR-URQXQFDESA-N 0.000 description 2
- DRDSQGHKTLSNEA-GLLZPBPUSA-N Gln-Glu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRDSQGHKTLSNEA-GLLZPBPUSA-N 0.000 description 2
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 2
- ILKYYKRAULNYMS-JYJNAYRXSA-N Gln-Lys-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ILKYYKRAULNYMS-JYJNAYRXSA-N 0.000 description 2
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 2
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 2
- UTKICHUQEQBDGC-ACZMJKKPSA-N Glu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UTKICHUQEQBDGC-ACZMJKKPSA-N 0.000 description 2
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 2
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 2
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 2
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 2
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 2
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 2
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 2
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 2
- TWYFJOHWGCCRIR-DCAQKATOSA-N Glu-Pro-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYFJOHWGCCRIR-DCAQKATOSA-N 0.000 description 2
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 2
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 2
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 2
- PHONXOACARQMPM-BQBZGAKWSA-N Gly-Ala-Met Chemical compound [H]NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O PHONXOACARQMPM-BQBZGAKWSA-N 0.000 description 2
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 2
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 2
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 2
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 2
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 2
- ICUTTWWCDIIIEE-BQBZGAKWSA-N Gly-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN ICUTTWWCDIIIEE-BQBZGAKWSA-N 0.000 description 2
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 2
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 2
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 2
- DKJWUIYLMLUBDX-XPUUQOCRSA-N Gly-Val-Cys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O DKJWUIYLMLUBDX-XPUUQOCRSA-N 0.000 description 2
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 2
- OMNVOTCFQQLEQU-CIUDSAMLSA-N His-Asn-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMNVOTCFQQLEQU-CIUDSAMLSA-N 0.000 description 2
- VOKCBYNCZVSILJ-KKUMJFAQSA-N His-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)O VOKCBYNCZVSILJ-KKUMJFAQSA-N 0.000 description 2
- UPGJWSUYENXOPV-HGNGGELXSA-N His-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N UPGJWSUYENXOPV-HGNGGELXSA-N 0.000 description 2
- LPZUKJALYGXBIE-SRVKXCTJSA-N His-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N LPZUKJALYGXBIE-SRVKXCTJSA-N 0.000 description 2
- BQFGKVYHKCNEMF-DCAQKATOSA-N His-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 BQFGKVYHKCNEMF-DCAQKATOSA-N 0.000 description 2
- XJFITURPHAKKAI-SRVKXCTJSA-N His-Pro-Gln Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CN=CN1 XJFITURPHAKKAI-SRVKXCTJSA-N 0.000 description 2
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 2
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 2
- BOTVMTSMOUSDRW-GMOBBJLQSA-N Ile-Arg-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O BOTVMTSMOUSDRW-GMOBBJLQSA-N 0.000 description 2
- ATXGFMOBVKSOMK-PEDHHIEDSA-N Ile-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N ATXGFMOBVKSOMK-PEDHHIEDSA-N 0.000 description 2
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 2
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 2
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 2
- LJKDGRWXYUTRSH-YVNDNENWSA-N Ile-Gln-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LJKDGRWXYUTRSH-YVNDNENWSA-N 0.000 description 2
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 2
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 2
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 2
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 2
- FHPZJWJWTWZKNA-LLLHUVSDSA-N Ile-Phe-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N FHPZJWJWTWZKNA-LLLHUVSDSA-N 0.000 description 2
- XHBYEMIUENPZLY-GMOBBJLQSA-N Ile-Pro-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O XHBYEMIUENPZLY-GMOBBJLQSA-N 0.000 description 2
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 2
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 2
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 2
- 239000005981 Imazaquin Substances 0.000 description 2
- XVOKUMIPKHGGTN-UHFFFAOYSA-N Imazethapyr Chemical compound OC(=O)C1=CC(CC)=CN=C1C1=NC(C)(C(C)C)C(=O)N1 XVOKUMIPKHGGTN-UHFFFAOYSA-N 0.000 description 2
- 108010025815 Kanamycin Kinase Proteins 0.000 description 2
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 2
- DQPQTXMIRBUWKO-DCAQKATOSA-N Leu-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N DQPQTXMIRBUWKO-DCAQKATOSA-N 0.000 description 2
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 2
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 2
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 2
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 2
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 2
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 2
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 2
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 2
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 2
- SYRTUBLKWNDSDK-DKIMLUQUSA-N Leu-Phe-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYRTUBLKWNDSDK-DKIMLUQUSA-N 0.000 description 2
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 2
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 2
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 2
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 2
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 2
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 2
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 2
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 2
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 2
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 2
- MQMIRLVJXQNTRJ-SDDRHHMPSA-N Lys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O MQMIRLVJXQNTRJ-SDDRHHMPSA-N 0.000 description 2
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 2
- DKTNGXVSCZULPO-YUMQZZPRSA-N Lys-Gly-Cys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O DKTNGXVSCZULPO-YUMQZZPRSA-N 0.000 description 2
- CBNMHRCLYBJIIZ-XUXIUFHCSA-N Lys-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N CBNMHRCLYBJIIZ-XUXIUFHCSA-N 0.000 description 2
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 2
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 2
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 2
- UQJOKDAYFULYIX-AVGNSLFASA-N Lys-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 UQJOKDAYFULYIX-AVGNSLFASA-N 0.000 description 2
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 2
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 2
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 2
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 2
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 2
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 2
- OHMKUHXCDSCOMT-QXEWZRGKSA-N Met-Asn-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHMKUHXCDSCOMT-QXEWZRGKSA-N 0.000 description 2
- CHQWUYSNAOABIP-ZPFDUUQYSA-N Met-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N CHQWUYSNAOABIP-ZPFDUUQYSA-N 0.000 description 2
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 2
- WXJXYMFUTRXRGO-UWVGGRQHSA-N Met-His-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 WXJXYMFUTRXRGO-UWVGGRQHSA-N 0.000 description 2
- WPTDJKDGICUFCP-XUXIUFHCSA-N Met-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCSC)N WPTDJKDGICUFCP-XUXIUFHCSA-N 0.000 description 2
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 2
- XPVCDCMPKCERFT-GUBZILKMSA-N Met-Ser-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XPVCDCMPKCERFT-GUBZILKMSA-N 0.000 description 2
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 2
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 2
- 108010047562 NGR peptide Proteins 0.000 description 2
- 241000244206 Nematoda Species 0.000 description 2
- 206010028980 Neoplasm Diseases 0.000 description 2
- 244000061176 Nicotiana tabacum Species 0.000 description 2
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 2
- 108020004485 Nonsense Codon Proteins 0.000 description 2
- 229910019142 PO4 Inorganic materials 0.000 description 2
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 2
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 2
- RVRRHFPCEOVRKQ-KKUMJFAQSA-N Phe-His-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RVRRHFPCEOVRKQ-KKUMJFAQSA-N 0.000 description 2
- XZQYIJALMGEUJD-OEAJRASXSA-N Phe-Lys-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZQYIJALMGEUJD-OEAJRASXSA-N 0.000 description 2
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 2
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 2
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 2
- LTQCLFMNABRKSH-UHFFFAOYSA-N Phleomycin Natural products N=1C(C=2SC=C(N=2)C(N)=O)CSC=1CCNC(=O)C(C(O)C)NC(=O)C(C)C(O)C(C)NC(=O)C(C(OC1C(C(O)C(O)C(CO)O1)OC1C(C(OC(N)=O)C(O)C(CO)O1)O)C=1NC=NC=1)NC(=O)C1=NC(C(CC(N)=O)NCC(N)C(N)=O)=NC(N)=C1C LTQCLFMNABRKSH-UHFFFAOYSA-N 0.000 description 2
- 108010035235 Phleomycins Proteins 0.000 description 2
- 102100023712 Poly [ADP-ribose] polymerase 1 Human genes 0.000 description 2
- HPXVFFIIGOAQRV-DCAQKATOSA-N Pro-Arg-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O HPXVFFIIGOAQRV-DCAQKATOSA-N 0.000 description 2
- OBVCYFIHIIYIQF-CIUDSAMLSA-N Pro-Asn-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OBVCYFIHIIYIQF-CIUDSAMLSA-N 0.000 description 2
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 2
- QEWBZBLXDKIQPS-STQMWFEESA-N Pro-Gly-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QEWBZBLXDKIQPS-STQMWFEESA-N 0.000 description 2
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 2
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 2
- SXMSEHDMNIUTSP-DCAQKATOSA-N Pro-Lys-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SXMSEHDMNIUTSP-DCAQKATOSA-N 0.000 description 2
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 2
- FYKUEXMZYFIZKA-DCAQKATOSA-N Pro-Pro-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FYKUEXMZYFIZKA-DCAQKATOSA-N 0.000 description 2
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 2
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 2
- 108010026552 Proteome Proteins 0.000 description 2
- 239000005616 Rimsulfuron Substances 0.000 description 2
- 230000018199 S phase Effects 0.000 description 2
- 241000239226 Scorpiones Species 0.000 description 2
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 2
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 2
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 2
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 2
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 2
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 2
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 2
- QBUWQRKEHJXTOP-DCAQKATOSA-N Ser-His-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QBUWQRKEHJXTOP-DCAQKATOSA-N 0.000 description 2
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 2
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 2
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 2
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 2
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 2
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 2
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 2
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 2
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 2
- HKHCTNFKZXAMIF-KKUMJFAQSA-N Ser-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=C(O)C=C1 HKHCTNFKZXAMIF-KKUMJFAQSA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 235000002560 Solanum lycopersicum Nutrition 0.000 description 2
- 244000061458 Solanum melongena Species 0.000 description 2
- 235000002597 Solanum melongena Nutrition 0.000 description 2
- 239000005619 Sulfosulfuron Substances 0.000 description 2
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 2
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 2
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 2
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 2
- ZXIHABSKUITPTN-IXOXFDKPSA-N Thr-Lys-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O ZXIHABSKUITPTN-IXOXFDKPSA-N 0.000 description 2
- JMBRNXUOLJFURW-BEAPCOKYSA-N Thr-Phe-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N)O JMBRNXUOLJFURW-BEAPCOKYSA-N 0.000 description 2
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 2
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 2
- VGNLMPBYWWNQFS-ZEILLAHLSA-N Thr-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O VGNLMPBYWWNQFS-ZEILLAHLSA-N 0.000 description 2
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 2
- ZOCJFNXUVSGBQI-HSHDSVGOSA-N Thr-Trp-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O ZOCJFNXUVSGBQI-HSHDSVGOSA-N 0.000 description 2
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 2
- 108091028113 Trans-activating crRNA Proteins 0.000 description 2
- UKWSFUSPGPBJGU-VFAJRCTISA-N Trp-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O UKWSFUSPGPBJGU-VFAJRCTISA-N 0.000 description 2
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 2
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 2
- ZOBLBMGJKVJVEV-BZSNNMDCSA-N Tyr-Lys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O ZOBLBMGJKVJVEV-BZSNNMDCSA-N 0.000 description 2
- OGPKMBOPMDTEDM-IHRRRGAJSA-N Tyr-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N OGPKMBOPMDTEDM-IHRRRGAJSA-N 0.000 description 2
- QVYFTFIBKCDHIE-ACRUOGEOSA-N Tyr-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O QVYFTFIBKCDHIE-ACRUOGEOSA-N 0.000 description 2
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 2
- IVXJODPZRWHCCR-JYJNAYRXSA-N Val-Arg-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IVXJODPZRWHCCR-JYJNAYRXSA-N 0.000 description 2
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 2
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 2
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 2
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 2
- OXVPMZVGCAPFIG-BQFCYCMXSA-N Val-Gln-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N OXVPMZVGCAPFIG-BQFCYCMXSA-N 0.000 description 2
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 2
- BEGDZYNDCNEGJZ-XVKPBYJWSA-N Val-Gly-Gln Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O BEGDZYNDCNEGJZ-XVKPBYJWSA-N 0.000 description 2
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 2
- DLMNFMXSNGTSNJ-PYJNHQTQSA-N Val-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N DLMNFMXSNGTSNJ-PYJNHQTQSA-N 0.000 description 2
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 2
- WDIWOIRFNMLNKO-ULQDDVLXSA-N Val-Leu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WDIWOIRFNMLNKO-ULQDDVLXSA-N 0.000 description 2
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 2
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 2
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 2
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 2
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 2
- 101150070497 accC gene Proteins 0.000 description 2
- 230000001133 acceleration Effects 0.000 description 2
- 102000005421 acetyltransferase Human genes 0.000 description 2
- 108020002494 acetyltransferase Proteins 0.000 description 2
- 229960000643 adenine Drugs 0.000 description 2
- 238000013019 agitation Methods 0.000 description 2
- 235000004279 alanine Nutrition 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 229960001561 bleomycin Drugs 0.000 description 2
- OYVAGSVQBOHSSS-UAPAGMARSA-O bleomycin A2 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC=C(N=1)C=1SC=C(N=1)C(=O)NCCC[S+](C)C)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C OYVAGSVQBOHSSS-UAPAGMARSA-O 0.000 description 2
- 239000011035 citrine Substances 0.000 description 2
- 238000011109 contamination Methods 0.000 description 2
- 230000034994 death Effects 0.000 description 2
- 230000001934 delay Effects 0.000 description 2
- 239000000412 dendrimer Substances 0.000 description 2
- 229920000736 dendritic polymer Polymers 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 108010054812 diprotin A Proteins 0.000 description 2
- 108010054813 diprotin B Proteins 0.000 description 2
- 108091006047 fluorescent proteins Proteins 0.000 description 2
- 102000034287 fluorescent proteins Human genes 0.000 description 2
- 235000013569 fruit product Nutrition 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 230000030279 gene silencing Effects 0.000 description 2
- 229960002518 gentamicin Drugs 0.000 description 2
- 230000035784 germination Effects 0.000 description 2
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- 108010002685 hygromycin-B kinase Proteins 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 230000005764 inhibitory process Effects 0.000 description 2
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 108010034507 methionyltryptophan Proteins 0.000 description 2
- OVQLOYMGKFRYJP-UHFFFAOYSA-N n,n-dichlorothiohydroxylamine Chemical compound SN(Cl)Cl OVQLOYMGKFRYJP-UHFFFAOYSA-N 0.000 description 2
- 238000007481 next generation sequencing Methods 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 230000037434 nonsense mutation Effects 0.000 description 2
- 230000009437 off-target effect Effects 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 230000002018 overexpression Effects 0.000 description 2
- 230000001717 pathogenic effect Effects 0.000 description 2
- 239000008188 pellet Substances 0.000 description 2
- 108010084572 phenylalanyl-valine Proteins 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 239000010452 phosphate Substances 0.000 description 2
- 238000003976 plant breeding Methods 0.000 description 2
- 229920000768 polyamine Polymers 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 230000003389 potentiating effect Effects 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 150000003147 proline derivatives Chemical class 0.000 description 2
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 2
- 101150075980 psbA gene Proteins 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- MEFOUWRMVYJCQC-UHFFFAOYSA-N rimsulfuron Chemical compound CCS(=O)(=O)C1=CC=CN=C1S(=O)(=O)NC(=O)NC1=NC(OC)=CC(OC)=N1 MEFOUWRMVYJCQC-UHFFFAOYSA-N 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 238000010187 selection method Methods 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 230000001568 sexual effect Effects 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 230000000392 somatic effect Effects 0.000 description 2
- 108010005652 splenotritin Proteins 0.000 description 2
- 238000010561 standard procedure Methods 0.000 description 2
- 108091005946 superfolder green fluorescent proteins Proteins 0.000 description 2
- 238000004114 suspension culture Methods 0.000 description 2
- 208000024891 symptom Diseases 0.000 description 2
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 2
- 239000003053 toxin Substances 0.000 description 2
- 231100000765 toxin Toxicity 0.000 description 2
- 108700012359 toxins Proteins 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 238000000844 transformation Methods 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- YMXOXAPKZDWXLY-QWRGUYRKSA-N tribenuron methyl Chemical group COC(=O)[C@H]1CCCC[C@@H]1S(=O)(=O)NC(=O)N(C)C1=NC(C)=NC(OC)=N1 YMXOXAPKZDWXLY-QWRGUYRKSA-N 0.000 description 2
- 229940035893 uracil Drugs 0.000 description 2
- 230000001018 virulence Effects 0.000 description 2
- 108010027345 wheylin-1 peptide Proteins 0.000 description 2
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 1
- RRBGTUQJDFBWNN-MUGJNUQGSA-N (2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-2,6-diaminohexanoyl]amino]hexanoyl]amino]hexanoyl]amino]hexanoic acid Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O RRBGTUQJDFBWNN-MUGJNUQGSA-N 0.000 description 1
- CDUVSERIDNVFDD-UHFFFAOYSA-N 2-pyrimidin-2-ylbenzenecarbothioic s-acid Chemical class OC(=S)C1=CC=CC=C1C1=NC=CC=N1 CDUVSERIDNVFDD-UHFFFAOYSA-N 0.000 description 1
- 102100037563 40S ribosomal protein S2 Human genes 0.000 description 1
- PVSGXWMWNRGTKE-UHFFFAOYSA-N 5-methyl-2-[4-methyl-5-oxo-4-(propan-2-yl)-4,5-dihydro-1H-imidazol-2-yl]pyridine-3-carboxylic acid Chemical compound N1C(=O)C(C(C)C)(C)N=C1C1=NC=C(C)C=C1C(O)=O PVSGXWMWNRGTKE-UHFFFAOYSA-N 0.000 description 1
- CSMHHVPDCJNKBW-UHFFFAOYSA-N 5-oxo-4-sulfonyltriazole-1-carboxamide Chemical class NC(=O)N1N=NC(=S(=O)=O)C1=O CSMHHVPDCJNKBW-UHFFFAOYSA-N 0.000 description 1
- 101710193129 Acetolactate synthase, chloroplastic Proteins 0.000 description 1
- 101000611536 Agaricus bisporus Polyphenol oxidase 3 Proteins 0.000 description 1
- 244000307697 Agrimonia eupatoria Species 0.000 description 1
- 235000016626 Agrimonia eupatoria Nutrition 0.000 description 1
- SBGXWWCLHIOABR-UHFFFAOYSA-N Ala Ala Gly Ala Chemical compound CC(N)C(=O)NC(C)C(=O)NCC(=O)NC(C)C(O)=O SBGXWWCLHIOABR-UHFFFAOYSA-N 0.000 description 1
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 1
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 1
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 1
- JPGBXANAQYHTLA-DRZSPHRISA-N Ala-Gln-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JPGBXANAQYHTLA-DRZSPHRISA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- ZDYNWWQXFRUOEO-XDTLVQLUSA-N Ala-Gln-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDYNWWQXFRUOEO-XDTLVQLUSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 1
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 1
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 1
- VYMJAWXRWHJIMS-LKTVYLICSA-N Ala-Tyr-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VYMJAWXRWHJIMS-LKTVYLICSA-N 0.000 description 1
- DDPKBJZLAXLQGZ-KBIXCLLPSA-N Ala-Val-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DDPKBJZLAXLQGZ-KBIXCLLPSA-N 0.000 description 1
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 1
- 239000004382 Amylase Substances 0.000 description 1
- 101100084634 Arabidopsis thaliana PBE1 gene Proteins 0.000 description 1
- 241000239290 Araneae Species 0.000 description 1
- MUXONAMCEUBVGA-DCAQKATOSA-N Arg-Arg-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O MUXONAMCEUBVGA-DCAQKATOSA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- KWTVWJPNHAOREN-IHRRRGAJSA-N Arg-Asn-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KWTVWJPNHAOREN-IHRRRGAJSA-N 0.000 description 1
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 1
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 1
- RRGPUNYIPJXJBU-GUBZILKMSA-N Arg-Asp-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O RRGPUNYIPJXJBU-GUBZILKMSA-N 0.000 description 1
- TTXYKSADPSNOIF-IHRRRGAJSA-N Arg-Asp-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O TTXYKSADPSNOIF-IHRRRGAJSA-N 0.000 description 1
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 1
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 1
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 1
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 1
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 1
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 1
- DJAIOAKQIOGULM-DCAQKATOSA-N Arg-Glu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O DJAIOAKQIOGULM-DCAQKATOSA-N 0.000 description 1
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 1
- RKQRHMKFNBYOTN-IHRRRGAJSA-N Arg-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N RKQRHMKFNBYOTN-IHRRRGAJSA-N 0.000 description 1
- CRCCTGPNZUCAHE-DCAQKATOSA-N Arg-His-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 CRCCTGPNZUCAHE-DCAQKATOSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- PZBSKYJGKNNYNK-ULQDDVLXSA-N Arg-Leu-Tyr Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O PZBSKYJGKNNYNK-ULQDDVLXSA-N 0.000 description 1
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 1
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 1
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 1
- LCBSSOCDWUTQQV-SDDRHHMPSA-N Arg-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LCBSSOCDWUTQQV-SDDRHHMPSA-N 0.000 description 1
- KSUALAGYYLQSHJ-RCWTZXSCSA-N Arg-Met-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KSUALAGYYLQSHJ-RCWTZXSCSA-N 0.000 description 1
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 1
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 1
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 1
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 1
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 1
- LFWOQHSQNCKXRU-UFYCRDLUSA-N Arg-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 LFWOQHSQNCKXRU-UFYCRDLUSA-N 0.000 description 1
- QCTOLCVIGRLMQS-HRCADAONSA-N Arg-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O QCTOLCVIGRLMQS-HRCADAONSA-N 0.000 description 1
- IZSMEUDYADKZTJ-KJEVXHAQSA-N Arg-Tyr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IZSMEUDYADKZTJ-KJEVXHAQSA-N 0.000 description 1
- BVLIJXXSXBUGEC-SRVKXCTJSA-N Asn-Asn-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVLIJXXSXBUGEC-SRVKXCTJSA-N 0.000 description 1
- QISZHYWZHJRDAO-CIUDSAMLSA-N Asn-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N QISZHYWZHJRDAO-CIUDSAMLSA-N 0.000 description 1
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 1
- OKZOABJQOMAYEC-NUMRIWBASA-N Asn-Gln-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OKZOABJQOMAYEC-NUMRIWBASA-N 0.000 description 1
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 1
- QYXNFROWLZPWPC-FXQIFTODSA-N Asn-Glu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QYXNFROWLZPWPC-FXQIFTODSA-N 0.000 description 1
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 1
- ASCGFDYEKSRNPL-CIUDSAMLSA-N Asn-Glu-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O ASCGFDYEKSRNPL-CIUDSAMLSA-N 0.000 description 1
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 description 1
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 1
- IKLAUGBIDCDFOY-SRVKXCTJSA-N Asn-His-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IKLAUGBIDCDFOY-SRVKXCTJSA-N 0.000 description 1
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 1
- CDGHMJJJHYKMPA-DLOVCJGASA-N Asn-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)N)N CDGHMJJJHYKMPA-DLOVCJGASA-N 0.000 description 1
- LSJQOMAZIKQMTJ-SRVKXCTJSA-N Asn-Phe-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LSJQOMAZIKQMTJ-SRVKXCTJSA-N 0.000 description 1
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 1
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 1
- NJSNXIOKBHPFMB-GMOBBJLQSA-N Asn-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N NJSNXIOKBHPFMB-GMOBBJLQSA-N 0.000 description 1
- OOXUBGLNDRGOKT-FXQIFTODSA-N Asn-Ser-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OOXUBGLNDRGOKT-FXQIFTODSA-N 0.000 description 1
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 1
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 1
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 1
- PUUPMDXIHCOPJU-HJGDQZAQSA-N Asn-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O PUUPMDXIHCOPJU-HJGDQZAQSA-N 0.000 description 1
- FHCRKXCTKSHNOE-QEJZJMRPSA-N Asn-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FHCRKXCTKSHNOE-QEJZJMRPSA-N 0.000 description 1
- DATSKXOXPUAOLK-KKUMJFAQSA-N Asn-Tyr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DATSKXOXPUAOLK-KKUMJFAQSA-N 0.000 description 1
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 1
- LTDGPJKGJDIBQD-LAEOZQHASA-N Asn-Val-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LTDGPJKGJDIBQD-LAEOZQHASA-N 0.000 description 1
- SYZWMVSXBZCOBZ-QXEWZRGKSA-N Asn-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N SYZWMVSXBZCOBZ-QXEWZRGKSA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 1
- GVPSCJQLUGIKAM-GUBZILKMSA-N Asp-Arg-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GVPSCJQLUGIKAM-GUBZILKMSA-N 0.000 description 1
- MFMJRYHVLLEMQM-DCAQKATOSA-N Asp-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N MFMJRYHVLLEMQM-DCAQKATOSA-N 0.000 description 1
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 1
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 1
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 1
- DXQOQMCLWWADMU-ACZMJKKPSA-N Asp-Gln-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DXQOQMCLWWADMU-ACZMJKKPSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 1
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 1
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 1
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 1
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 1
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 1
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 1
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 1
- 108020004513 Bacterial RNA Proteins 0.000 description 1
- 241001647745 Banksia Species 0.000 description 1
- 239000005472 Bensulfuron methyl Substances 0.000 description 1
- 235000014698 Brassica juncea var multisecta Nutrition 0.000 description 1
- 235000006008 Brassica napus var napus Nutrition 0.000 description 1
- 240000000385 Brassica napus var. napus Species 0.000 description 1
- 235000006618 Brassica rapa subsp oleifera Nutrition 0.000 description 1
- 235000004977 Brassica sinapistrum Nutrition 0.000 description 1
- 108091079001 CRISPR RNA Proteins 0.000 description 1
- 108010059892 Cellulase Proteins 0.000 description 1
- 206010068051 Chimerism Diseases 0.000 description 1
- KIHRUISMQZVCNO-ZLUOBGJFSA-N Cys-Asp-Asp Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KIHRUISMQZVCNO-ZLUOBGJFSA-N 0.000 description 1
- GCDLPNRHPWBKJJ-WDSKDSINSA-N Cys-Gly-Glu Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GCDLPNRHPWBKJJ-WDSKDSINSA-N 0.000 description 1
- SSNJZBGOMNLSLA-CIUDSAMLSA-N Cys-Leu-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O SSNJZBGOMNLSLA-CIUDSAMLSA-N 0.000 description 1
- LHJDLVVQRJIURS-SRVKXCTJSA-N Cys-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N LHJDLVVQRJIURS-SRVKXCTJSA-N 0.000 description 1
- CAXGCBSRJLADPD-FXQIFTODSA-N Cys-Pro-Asn Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O CAXGCBSRJLADPD-FXQIFTODSA-N 0.000 description 1
- UBHPUQAWSSNQLQ-DCAQKATOSA-N Cys-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O UBHPUQAWSSNQLQ-DCAQKATOSA-N 0.000 description 1
- BCFXQBXXDSEHRS-FXQIFTODSA-N Cys-Ser-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BCFXQBXXDSEHRS-FXQIFTODSA-N 0.000 description 1
- IXPSSIBVVKSOIE-SRVKXCTJSA-N Cys-Ser-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O IXPSSIBVVKSOIE-SRVKXCTJSA-N 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- 102000003915 DNA Topoisomerases Human genes 0.000 description 1
- 108090000323 DNA Topoisomerases Proteins 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 230000033616 DNA repair Effects 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 108010053770 Deoxyribonucleases Proteins 0.000 description 1
- 102000016911 Deoxyribonucleases Human genes 0.000 description 1
- 102100032249 Dystonin Human genes 0.000 description 1
- 108700039887 Essential Genes Proteins 0.000 description 1
- 239000005529 Florasulam Substances 0.000 description 1
- QZXATCCPQKOEIH-UHFFFAOYSA-N Florasulam Chemical compound N=1N2C(OC)=NC=C(F)C2=NC=1S(=O)(=O)NC1=C(F)C=CC=C1F QZXATCCPQKOEIH-UHFFFAOYSA-N 0.000 description 1
- RXCPQSJAVKGONC-UHFFFAOYSA-N Flumetsulam Chemical compound N1=C2N=C(C)C=CN2N=C1S(=O)(=O)NC1=C(F)C=CC=C1F RXCPQSJAVKGONC-UHFFFAOYSA-N 0.000 description 1
- 230000010337 G2 phase Effects 0.000 description 1
- 101150104463 GOS2 gene Proteins 0.000 description 1
- IGNGBUVODQLMRJ-CIUDSAMLSA-N Gln-Ala-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IGNGBUVODQLMRJ-CIUDSAMLSA-N 0.000 description 1
- DTMLKCYOQKZXKZ-HJGDQZAQSA-N Gln-Arg-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DTMLKCYOQKZXKZ-HJGDQZAQSA-N 0.000 description 1
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 1
- KDXKFBSNIJYNNR-YVNDNENWSA-N Gln-Glu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDXKFBSNIJYNNR-YVNDNENWSA-N 0.000 description 1
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 1
- JEFZIKRIDLHOIF-BYPYZUCNSA-N Gln-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(O)=O JEFZIKRIDLHOIF-BYPYZUCNSA-N 0.000 description 1
- DQPOBSRQNWOBNA-GUBZILKMSA-N Gln-His-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O DQPOBSRQNWOBNA-GUBZILKMSA-N 0.000 description 1
- NNXIQPMZGZUFJJ-AVGNSLFASA-N Gln-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N NNXIQPMZGZUFJJ-AVGNSLFASA-N 0.000 description 1
- OOLCSQQPSLIETN-JYJNAYRXSA-N Gln-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N)O OOLCSQQPSLIETN-JYJNAYRXSA-N 0.000 description 1
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 1
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 1
- WEAVZFWWIPIANL-SRVKXCTJSA-N Gln-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N WEAVZFWWIPIANL-SRVKXCTJSA-N 0.000 description 1
- QKWBEMCLYTYBNI-GVXVVHGQSA-N Gln-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O QKWBEMCLYTYBNI-GVXVVHGQSA-N 0.000 description 1
- MSHXWFKYXJTLEZ-CIUDSAMLSA-N Gln-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MSHXWFKYXJTLEZ-CIUDSAMLSA-N 0.000 description 1
- MFHVAWMMKZBSRQ-ACZMJKKPSA-N Gln-Ser-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N MFHVAWMMKZBSRQ-ACZMJKKPSA-N 0.000 description 1
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 1
- VLOLPWWCNKWRNB-LOKLDPHHSA-N Gln-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VLOLPWWCNKWRNB-LOKLDPHHSA-N 0.000 description 1
- XKPACHRGOWQHFH-IRIUXVKKSA-N Gln-Thr-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XKPACHRGOWQHFH-IRIUXVKKSA-N 0.000 description 1
- DITJVHONFRJKJW-BPUTZDHNSA-N Gln-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DITJVHONFRJKJW-BPUTZDHNSA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 1
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 1
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 1
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- PAQUJCSYVIBPLC-AVGNSLFASA-N Glu-Asp-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PAQUJCSYVIBPLC-AVGNSLFASA-N 0.000 description 1
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 1
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 1
- APHGWLWMOXGZRL-DCAQKATOSA-N Glu-Glu-His Chemical compound N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O APHGWLWMOXGZRL-DCAQKATOSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 1
- WVTIBGWZUMJBFY-GUBZILKMSA-N Glu-His-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O WVTIBGWZUMJBFY-GUBZILKMSA-N 0.000 description 1
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 1
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 1
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- ZAPFAWQHBOHWLL-GUBZILKMSA-N Glu-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N ZAPFAWQHBOHWLL-GUBZILKMSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- TWYSSILQABLLME-HJGDQZAQSA-N Glu-Thr-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYSSILQABLLME-HJGDQZAQSA-N 0.000 description 1
- LWYUQLZOIORFFJ-XKBZYTNZSA-N Glu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O LWYUQLZOIORFFJ-XKBZYTNZSA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 1
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 1
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 1
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 1
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 1
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 1
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- HFXJIZNEXNIZIJ-BQBZGAKWSA-N Gly-Glu-Gln Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFXJIZNEXNIZIJ-BQBZGAKWSA-N 0.000 description 1
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 1
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- PCPOYRCAHPJXII-UWVGGRQHSA-N Gly-Lys-Met Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O PCPOYRCAHPJXII-UWVGGRQHSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- BBTCXWTXOXUNFX-IUCAKERBSA-N Gly-Met-Arg Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O BBTCXWTXOXUNFX-IUCAKERBSA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 1
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 1
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 1
- RCHFYMASWAZQQZ-ZANVPECISA-N Gly-Trp-Ala Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)CN)=CNC2=C1 RCHFYMASWAZQQZ-ZANVPECISA-N 0.000 description 1
- PYFIQROSWQERAS-LBPRGKRZSA-N Gly-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(=O)NCC(O)=O)=CNC2=C1 PYFIQROSWQERAS-LBPRGKRZSA-N 0.000 description 1
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 1
- WRFOZIJRODPLIA-QWRGUYRKSA-N Gly-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)O WRFOZIJRODPLIA-QWRGUYRKSA-N 0.000 description 1
- PNUFMLXHOLFRLD-KBPBESRZSA-N Gly-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 PNUFMLXHOLFRLD-KBPBESRZSA-N 0.000 description 1
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 1
- 239000005564 Halosulfuron methyl Substances 0.000 description 1
- 229920002488 Hemicellulose Polymers 0.000 description 1
- KZTLOHBDLMIFSH-XVYDVKMFSA-N His-Ala-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O KZTLOHBDLMIFSH-XVYDVKMFSA-N 0.000 description 1
- CIWILNZNBPIHEU-DCAQKATOSA-N His-Arg-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O CIWILNZNBPIHEU-DCAQKATOSA-N 0.000 description 1
- SVHKVHBPTOMLTO-DCAQKATOSA-N His-Arg-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SVHKVHBPTOMLTO-DCAQKATOSA-N 0.000 description 1
- CJGDTAHEMXLRMB-ULQDDVLXSA-N His-Arg-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CJGDTAHEMXLRMB-ULQDDVLXSA-N 0.000 description 1
- MVADCDSCFTXCBT-CIUDSAMLSA-N His-Asp-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MVADCDSCFTXCBT-CIUDSAMLSA-N 0.000 description 1
- SWSVTNGMKBDTBM-DCAQKATOSA-N His-Gln-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SWSVTNGMKBDTBM-DCAQKATOSA-N 0.000 description 1
- IMCHNUANCIGUKS-SRVKXCTJSA-N His-Glu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IMCHNUANCIGUKS-SRVKXCTJSA-N 0.000 description 1
- XMENRVZYPBKBIL-AVGNSLFASA-N His-Glu-His Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XMENRVZYPBKBIL-AVGNSLFASA-N 0.000 description 1
- IDQNVIWPPWAFSY-AVGNSLFASA-N His-His-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O IDQNVIWPPWAFSY-AVGNSLFASA-N 0.000 description 1
- STOOMQFEJUVAKR-KKUMJFAQSA-N His-His-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CNC=N1 STOOMQFEJUVAKR-KKUMJFAQSA-N 0.000 description 1
- MPXGJGBXCRQQJE-MXAVVETBSA-N His-Ile-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O MPXGJGBXCRQQJE-MXAVVETBSA-N 0.000 description 1
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 1
- ZSKJIISDJXJQPV-BZSNNMDCSA-N His-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 ZSKJIISDJXJQPV-BZSNNMDCSA-N 0.000 description 1
- LNDVNHOSZQPJGI-AVGNSLFASA-N His-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CN=CN1 LNDVNHOSZQPJGI-AVGNSLFASA-N 0.000 description 1
- FLXCRBXJRJSDHX-AVGNSLFASA-N His-Pro-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O FLXCRBXJRJSDHX-AVGNSLFASA-N 0.000 description 1
- CUEQQFOGARVNHU-VGDYDELISA-N His-Ser-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUEQQFOGARVNHU-VGDYDELISA-N 0.000 description 1
- PBJOQLUVSGXRSW-YTQUADARSA-N His-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CN=CN4)N)C(=O)O PBJOQLUVSGXRSW-YTQUADARSA-N 0.000 description 1
- WSWAUVHXQREQQG-JYJNAYRXSA-N His-Tyr-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O WSWAUVHXQREQQG-JYJNAYRXSA-N 0.000 description 1
- DAKSMIWQZPHRIB-BZSNNMDCSA-N His-Tyr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DAKSMIWQZPHRIB-BZSNNMDCSA-N 0.000 description 1
- WYKXJGWSJUULSL-AVGNSLFASA-N His-Val-Arg Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O WYKXJGWSJUULSL-AVGNSLFASA-N 0.000 description 1
- SYPULFZAGBBIOM-GVXVVHGQSA-N His-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N SYPULFZAGBBIOM-GVXVVHGQSA-N 0.000 description 1
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 1
- GBMSSORHVHAYLU-QTKMDUPCSA-N His-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N)O GBMSSORHVHAYLU-QTKMDUPCSA-N 0.000 description 1
- 101001098029 Homo sapiens 40S ribosomal protein S2 Proteins 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- SACHLUOUHCVIKI-GMOBBJLQSA-N Ile-Arg-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SACHLUOUHCVIKI-GMOBBJLQSA-N 0.000 description 1
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 1
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 1
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 1
- QYOGJYIRKACXEP-SLBDDTMCSA-N Ile-Asn-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N QYOGJYIRKACXEP-SLBDDTMCSA-N 0.000 description 1
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 description 1
- DMZOUKXXHJQPTL-GRLWGSQLSA-N Ile-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N DMZOUKXXHJQPTL-GRLWGSQLSA-N 0.000 description 1
- JRYQSFOFUFXPTB-RWRJDSDZSA-N Ile-Gln-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N JRYQSFOFUFXPTB-RWRJDSDZSA-N 0.000 description 1
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 1
- LGMUPVWZEYYUMU-YVNDNENWSA-N Ile-Glu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LGMUPVWZEYYUMU-YVNDNENWSA-N 0.000 description 1
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 1
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 1
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 1
- CMNMPCTVCWWYHY-MXAVVETBSA-N Ile-His-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(C)C)C(=O)O)N CMNMPCTVCWWYHY-MXAVVETBSA-N 0.000 description 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 1
- HUWYGQOISIJNMK-SIGLWIIPSA-N Ile-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HUWYGQOISIJNMK-SIGLWIIPSA-N 0.000 description 1
- PKGGWLOLRLOPGK-XUXIUFHCSA-N Ile-Leu-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PKGGWLOLRLOPGK-XUXIUFHCSA-N 0.000 description 1
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 1
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 1
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- WVUDHMBJNBWZBU-XUXIUFHCSA-N Ile-Lys-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N WVUDHMBJNBWZBU-XUXIUFHCSA-N 0.000 description 1
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- RCMNUBZKIIJCOI-ZPFDUUQYSA-N Ile-Met-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RCMNUBZKIIJCOI-ZPFDUUQYSA-N 0.000 description 1
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 1
- CIDLJWVDMNDKPT-FIRPJDEBSA-N Ile-Phe-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N CIDLJWVDMNDKPT-FIRPJDEBSA-N 0.000 description 1
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 1
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 1
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 1
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 1
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 1
- SWNRZNLXMXRCJC-VKOGCVSHSA-N Ile-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 SWNRZNLXMXRCJC-VKOGCVSHSA-N 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 1
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 1
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 1
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 1
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 1
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- MPSBSKHOWJQHBS-IHRRRGAJSA-N Leu-His-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N MPSBSKHOWJQHBS-IHRRRGAJSA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 1
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 1
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 1
- VVQJGYPTIYOFBR-IHRRRGAJSA-N Leu-Lys-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N VVQJGYPTIYOFBR-IHRRRGAJSA-N 0.000 description 1
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 1
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 1
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 1
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- GOFJOGXGMPHOGL-DCAQKATOSA-N Leu-Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C GOFJOGXGMPHOGL-DCAQKATOSA-N 0.000 description 1
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 1
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 1
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 1
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 1
- ONHCDMBHPQIPAI-YTQUADARSA-N Leu-Trp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N ONHCDMBHPQIPAI-YTQUADARSA-N 0.000 description 1
- HQBOMRTVKVKFMN-WDSOQIARSA-N Leu-Trp-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O HQBOMRTVKVKFMN-WDSOQIARSA-N 0.000 description 1
- SXOFUVGLPHCPRQ-KKUMJFAQSA-N Leu-Tyr-Cys Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(O)=O SXOFUVGLPHCPRQ-KKUMJFAQSA-N 0.000 description 1
- OZTZJMUZVAVJGY-BZSNNMDCSA-N Leu-Tyr-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OZTZJMUZVAVJGY-BZSNNMDCSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 1
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 1
- FUKDBQGFSJUXGX-RWMBFGLXSA-N Lys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)C(=O)O FUKDBQGFSJUXGX-RWMBFGLXSA-N 0.000 description 1
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 1
- DNEJSAIMVANNPA-DCAQKATOSA-N Lys-Asn-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DNEJSAIMVANNPA-DCAQKATOSA-N 0.000 description 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 1
- PXHCFKXNSBJSTQ-KKUMJFAQSA-N Lys-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)O PXHCFKXNSBJSTQ-KKUMJFAQSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- PHHYNOUOUWYQRO-XIRDDKMYSA-N Lys-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N PHHYNOUOUWYQRO-XIRDDKMYSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 1
- NNCDAORZCMPZPX-GUBZILKMSA-N Lys-Gln-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N NNCDAORZCMPZPX-GUBZILKMSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- PGLGNCVOWIORQE-SRVKXCTJSA-N Lys-His-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O PGLGNCVOWIORQE-SRVKXCTJSA-N 0.000 description 1
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 1
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 1
- XREQQOATSMMAJP-MGHWNKPDSA-N Lys-Ile-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XREQQOATSMMAJP-MGHWNKPDSA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 1
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- KJIXWRWPOCKYLD-IHRRRGAJSA-N Lys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N KJIXWRWPOCKYLD-IHRRRGAJSA-N 0.000 description 1
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 1
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 1
- GZGWILAQHOVXTD-DCAQKATOSA-N Lys-Met-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O GZGWILAQHOVXTD-DCAQKATOSA-N 0.000 description 1
- VSTNAUBHKQPVJX-IHRRRGAJSA-N Lys-Met-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O VSTNAUBHKQPVJX-IHRRRGAJSA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 1
- QBHGXFQJFPWJIH-XUXIUFHCSA-N Lys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN QBHGXFQJFPWJIH-XUXIUFHCSA-N 0.000 description 1
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 1
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 1
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 1
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 1
- 241000218922 Magnoliophyta Species 0.000 description 1
- 241000196329 Marchantia polymorpha Species 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- FZUNSVYYPYJYAP-NAKRPEOUSA-N Met-Ile-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O FZUNSVYYPYJYAP-NAKRPEOUSA-N 0.000 description 1
- RRIHXWPHQSXHAQ-XUXIUFHCSA-N Met-Ile-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O RRIHXWPHQSXHAQ-XUXIUFHCSA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 1
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 1
- BEZJTLKUMFMITF-AVGNSLFASA-N Met-Lys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCNC(N)=N BEZJTLKUMFMITF-AVGNSLFASA-N 0.000 description 1
- MSSJHBAKDDIRMJ-SRVKXCTJSA-N Met-Lys-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MSSJHBAKDDIRMJ-SRVKXCTJSA-N 0.000 description 1
- WXXNVZMWHOLNRJ-AVGNSLFASA-N Met-Pro-Lys Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O WXXNVZMWHOLNRJ-AVGNSLFASA-N 0.000 description 1
- KSIPKXNIQOWMIC-RCWTZXSCSA-N Met-Thr-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KSIPKXNIQOWMIC-RCWTZXSCSA-N 0.000 description 1
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 description 1
- 235000009421 Myristica fragrans Nutrition 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
- 239000005586 Nicosulfuron Substances 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 239000005592 Penoxsulam Substances 0.000 description 1
- SYJGKVOENHZYMQ-UHFFFAOYSA-N Penoxsulam Chemical compound N1=C2C(OC)=CN=C(OC)N2N=C1NS(=O)(=O)C1=C(OCC(F)F)C=CC=C1C(F)(F)F SYJGKVOENHZYMQ-UHFFFAOYSA-N 0.000 description 1
- 101001066099 Phaseolus coccineus Gibberellin 2-beta-dioxygenase Proteins 0.000 description 1
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 1
- NEHSHYOUIWBYSA-DCPHZVHLSA-N Phe-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=CC=C3)N NEHSHYOUIWBYSA-DCPHZVHLSA-N 0.000 description 1
- LGBVMDMZZFYSFW-HJWJTTGWSA-N Phe-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N LGBVMDMZZFYSFW-HJWJTTGWSA-N 0.000 description 1
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 1
- HCTXJGRYAACKOB-SRVKXCTJSA-N Phe-Asn-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HCTXJGRYAACKOB-SRVKXCTJSA-N 0.000 description 1
- WMGVYPPIMZPWPN-SRVKXCTJSA-N Phe-Asp-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WMGVYPPIMZPWPN-SRVKXCTJSA-N 0.000 description 1
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 1
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 1
- YEEFZOKPYOUXMX-KKUMJFAQSA-N Phe-Gln-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O YEEFZOKPYOUXMX-KKUMJFAQSA-N 0.000 description 1
- GDBOREPXIRKSEQ-FHWLQOOXSA-N Phe-Gln-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GDBOREPXIRKSEQ-FHWLQOOXSA-N 0.000 description 1
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 1
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 1
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 1
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 1
- JEBWZLWTRPZQRX-QWRGUYRKSA-N Phe-Gly-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O JEBWZLWTRPZQRX-QWRGUYRKSA-N 0.000 description 1
- KRYSMKKRRRWOCZ-QEWYBTABSA-N Phe-Ile-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KRYSMKKRRRWOCZ-QEWYBTABSA-N 0.000 description 1
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 1
- GHNVJQZQYKNTDX-HJWJTTGWSA-N Phe-Ile-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O GHNVJQZQYKNTDX-HJWJTTGWSA-N 0.000 description 1
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 1
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 1
- KZRQONDKKJCAOL-DKIMLUQUSA-N Phe-Leu-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZRQONDKKJCAOL-DKIMLUQUSA-N 0.000 description 1
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 1
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 1
- PTLMYJOMJLTMCB-KKUMJFAQSA-N Phe-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N PTLMYJOMJLTMCB-KKUMJFAQSA-N 0.000 description 1
- KAJLHCWRWDSROH-BZSNNMDCSA-N Phe-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 KAJLHCWRWDSROH-BZSNNMDCSA-N 0.000 description 1
- TXJJXEXCZBHDNA-ACRUOGEOSA-N Phe-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N TXJJXEXCZBHDNA-ACRUOGEOSA-N 0.000 description 1
- RBRNEFJTEHPDSL-ACRUOGEOSA-N Phe-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 RBRNEFJTEHPDSL-ACRUOGEOSA-N 0.000 description 1
- MGLBSROLWAWCKN-FCLVOEFKSA-N Phe-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MGLBSROLWAWCKN-FCLVOEFKSA-N 0.000 description 1
- ZVRJWDUPIDMHDN-ULQDDVLXSA-N Phe-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 ZVRJWDUPIDMHDN-ULQDDVLXSA-N 0.000 description 1
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 1
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 1
- 108700001094 Plant Genes Proteins 0.000 description 1
- 241000013557 Plantaginaceae Species 0.000 description 1
- 108010064218 Poly (ADP-Ribose) Polymerase-1 Proteins 0.000 description 1
- 229920000776 Poly(Adenosine diphosphate-ribose) polymerase Polymers 0.000 description 1
- 239000004698 Polyethylene Substances 0.000 description 1
- 229920002594 Polyethylene Glycol 8000 Polymers 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 108010059820 Polygalacturonase Proteins 0.000 description 1
- 241000183024 Populus tremula Species 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 1
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 1
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 1
- OCSACVPBMIYNJE-GUBZILKMSA-N Pro-Arg-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O OCSACVPBMIYNJE-GUBZILKMSA-N 0.000 description 1
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 1
- QBFONMUYNSNKIX-AVGNSLFASA-N Pro-Arg-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QBFONMUYNSNKIX-AVGNSLFASA-N 0.000 description 1
- MTHRMUXESFIAMS-DCAQKATOSA-N Pro-Asn-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O MTHRMUXESFIAMS-DCAQKATOSA-N 0.000 description 1
- WPQKSRHDTMRSJM-CIUDSAMLSA-N Pro-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 WPQKSRHDTMRSJM-CIUDSAMLSA-N 0.000 description 1
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 1
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 1
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 1
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 1
- SSWJYJHXQOYTSP-SRVKXCTJSA-N Pro-His-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O SSWJYJHXQOYTSP-SRVKXCTJSA-N 0.000 description 1
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 1
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 description 1
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 1
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 1
- ZJXXCGZFYQQETF-CYDGBPFRSA-N Pro-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 ZJXXCGZFYQQETF-CYDGBPFRSA-N 0.000 description 1
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 1
- SVXXJYJCRNKDDE-AVGNSLFASA-N Pro-Pro-His Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CN=CN1 SVXXJYJCRNKDDE-AVGNSLFASA-N 0.000 description 1
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 1
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- RNEFESSBTOQSAC-DCAQKATOSA-N Pro-Ser-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O RNEFESSBTOQSAC-DCAQKATOSA-N 0.000 description 1
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 1
- AJJDPGVVNPUZCR-RHYQMDGZSA-N Pro-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1)O AJJDPGVVNPUZCR-RHYQMDGZSA-N 0.000 description 1
- RSTWKJFWBKFOFC-JYJNAYRXSA-N Pro-Trp-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O RSTWKJFWBKFOFC-JYJNAYRXSA-N 0.000 description 1
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 1
- QDDJNKWPTJHROJ-UFYCRDLUSA-N Pro-Tyr-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 QDDJNKWPTJHROJ-UFYCRDLUSA-N 0.000 description 1
- 239000005604 Prosulfuron Substances 0.000 description 1
- LTUNNEGNEKBSEH-UHFFFAOYSA-N Prosulfuron Chemical compound COC1=NC(C)=NC(NC(=O)NS(=O)(=O)C=2C(=CC=CC=2)CCC(F)(F)F)=N1 LTUNNEGNEKBSEH-UHFFFAOYSA-N 0.000 description 1
- CNILNQMBAHKMFS-UHFFFAOYSA-M Pyrithiobac-sodium Chemical compound [Na+].COC1=CC(OC)=NC(SC=2C(=C(Cl)C=CC=2)C([O-])=O)=N1 CNILNQMBAHKMFS-UHFFFAOYSA-M 0.000 description 1
- 108010003201 RGH 0205 Proteins 0.000 description 1
- 239000013614 RNA sample Substances 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 240000000111 Saccharum officinarum Species 0.000 description 1
- 235000007201 Saccharum officinarum Nutrition 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 1
- JJKSSJVYOVRJMZ-FXQIFTODSA-N Ser-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)CN=C(N)N JJKSSJVYOVRJMZ-FXQIFTODSA-N 0.000 description 1
- RZUOXAKGNHXZTB-GUBZILKMSA-N Ser-Arg-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O RZUOXAKGNHXZTB-GUBZILKMSA-N 0.000 description 1
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 1
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 1
- DKKGAAJTDKHWOD-BIIVOSGPSA-N Ser-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)C(=O)O DKKGAAJTDKHWOD-BIIVOSGPSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 1
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 1
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 1
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- WEQAYODCJHZSJZ-KKUMJFAQSA-N Ser-His-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 WEQAYODCJHZSJZ-KKUMJFAQSA-N 0.000 description 1
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 1
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 1
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 1
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 1
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- KIEIJCFVGZCUAS-MELADBBJSA-N Ser-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N)C(=O)O KIEIJCFVGZCUAS-MELADBBJSA-N 0.000 description 1
- SGZVZUCRAVSPKQ-FXQIFTODSA-N Ser-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N SGZVZUCRAVSPKQ-FXQIFTODSA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 1
- 235000000338 Solanum laciniatum Nutrition 0.000 description 1
- 244000177152 Solanum laciniatum Species 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 208000035199 Tetraploidy Diseases 0.000 description 1
- 239000005623 Thifensulfuron-methyl Substances 0.000 description 1
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 1
- JMQUAZXYFAEOIH-XGEHTFHBSA-N Thr-Arg-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)O JMQUAZXYFAEOIH-XGEHTFHBSA-N 0.000 description 1
- PKXHGEXFMIZSER-QTKMDUPCSA-N Thr-Arg-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O PKXHGEXFMIZSER-QTKMDUPCSA-N 0.000 description 1
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 1
- IRKWVRSEQFTGGV-VEVYYDQMSA-N Thr-Asn-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IRKWVRSEQFTGGV-VEVYYDQMSA-N 0.000 description 1
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 1
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 1
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 1
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 1
- FIFDDJFLNVAVMS-RHYQMDGZSA-N Thr-Leu-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O FIFDDJFLNVAVMS-RHYQMDGZSA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 1
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 1
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 1
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 1
- BZTSQFWJNJYZSX-JRQIVUDYSA-N Thr-Tyr-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O BZTSQFWJNJYZSX-JRQIVUDYSA-N 0.000 description 1
- ABCLYRRGTZNIFU-BWAGICSOSA-N Thr-Tyr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O ABCLYRRGTZNIFU-BWAGICSOSA-N 0.000 description 1
- BRBCKMMXKONBAA-KWBADKCTSA-N Trp-Ala-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 BRBCKMMXKONBAA-KWBADKCTSA-N 0.000 description 1
- PEYSVKMXSLPQRU-FJHTZYQYSA-N Trp-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O PEYSVKMXSLPQRU-FJHTZYQYSA-N 0.000 description 1
- KZIQDVNORJKTMO-WDSOQIARSA-N Trp-Arg-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N KZIQDVNORJKTMO-WDSOQIARSA-N 0.000 description 1
- WACMTVIJWRNVSO-CWRNSKLLSA-N Trp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O WACMTVIJWRNVSO-CWRNSKLLSA-N 0.000 description 1
- PWPJLBWYRTVYQS-PMVMPFDFSA-N Trp-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PWPJLBWYRTVYQS-PMVMPFDFSA-N 0.000 description 1
- GEGYPBOPIGNZIF-CWRNSKLLSA-N Trp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O GEGYPBOPIGNZIF-CWRNSKLLSA-N 0.000 description 1
- GLNADSQYFUSGOU-GPTZEZBUSA-J Trypan blue Chemical compound [Na+].[Na+].[Na+].[Na+].C1=C(S([O-])(=O)=O)C=C2C=C(S([O-])(=O)=O)C(/N=N/C3=CC=C(C=C3C)C=3C=C(C(=CC=3)\N=N\C=3C(=CC4=CC(=CC(N)=C4C=3O)S([O-])(=O)=O)S([O-])(=O)=O)C)=C(O)C2=C1N GLNADSQYFUSGOU-GPTZEZBUSA-J 0.000 description 1
- FBHBVXUBTYVCRU-BZSNNMDCSA-N Tyr-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CN=CN1 FBHBVXUBTYVCRU-BZSNNMDCSA-N 0.000 description 1
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 1
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 1
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 1
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 1
- CWVHKVVKAQIJKY-ACRUOGEOSA-N Tyr-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N CWVHKVVKAQIJKY-ACRUOGEOSA-N 0.000 description 1
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 1
- PSALWJCUIAQKFW-ACRUOGEOSA-N Tyr-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N PSALWJCUIAQKFW-ACRUOGEOSA-N 0.000 description 1
- JXGUUJMPCRXMSO-HJOGWXRNSA-N Tyr-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 JXGUUJMPCRXMSO-HJOGWXRNSA-N 0.000 description 1
- VBFVQTPETKJCQW-RPTUDFQQSA-N Tyr-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VBFVQTPETKJCQW-RPTUDFQQSA-N 0.000 description 1
- QHONGSVIVOFKAC-ULQDDVLXSA-N Tyr-Pro-His Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O QHONGSVIVOFKAC-ULQDDVLXSA-N 0.000 description 1
- RCMWNNJFKNDKQR-UFYCRDLUSA-N Tyr-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 RCMWNNJFKNDKQR-UFYCRDLUSA-N 0.000 description 1
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 1
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 1
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 1
- HZDQUVQEVVYDDA-ACRUOGEOSA-N Tyr-Tyr-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HZDQUVQEVVYDDA-ACRUOGEOSA-N 0.000 description 1
- SQUMHUZLJDUROQ-YDHLFZDLSA-N Tyr-Val-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O SQUMHUZLJDUROQ-YDHLFZDLSA-N 0.000 description 1
- ABSXSJZNRAQDDI-KJEVXHAQSA-N Tyr-Val-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ABSXSJZNRAQDDI-KJEVXHAQSA-N 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- QGFPYRPIUXBYGR-YDHLFZDLSA-N Val-Asn-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N QGFPYRPIUXBYGR-YDHLFZDLSA-N 0.000 description 1
- IQQYYFPCWKWUHW-YDHLFZDLSA-N Val-Asn-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N IQQYYFPCWKWUHW-YDHLFZDLSA-N 0.000 description 1
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- ZQGPWORGSNRQLN-NHCYSSNCSA-N Val-Asp-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZQGPWORGSNRQLN-NHCYSSNCSA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 1
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 1
- OJOMXGVLFKYDKP-QXEWZRGKSA-N Val-Met-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OJOMXGVLFKYDKP-QXEWZRGKSA-N 0.000 description 1
- OFQGGTGZTOTLGH-NHCYSSNCSA-N Val-Met-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N OFQGGTGZTOTLGH-NHCYSSNCSA-N 0.000 description 1
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 1
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- HVRRJRMULCPNRO-BZSNNMDCSA-N Val-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 HVRRJRMULCPNRO-BZSNNMDCSA-N 0.000 description 1
- SUGRIIAOLCDLBD-ZOBUZTSGSA-N Val-Trp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SUGRIIAOLCDLBD-ZOBUZTSGSA-N 0.000 description 1
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 1
- VTIAEOKFUJJBTC-YDHLFZDLSA-N Val-Tyr-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VTIAEOKFUJJBTC-YDHLFZDLSA-N 0.000 description 1
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 1
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 1
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 208000036142 Viral infection Diseases 0.000 description 1
- HMNZFMSWFCAGGW-XPWSMXQVSA-N [3-[hydroxy(2-hydroxyethoxy)phosphoryl]oxy-2-[(e)-octadec-9-enoyl]oxypropyl] (e)-octadec-9-enoate Chemical compound CCCCCCCC\C=C\CCCCCCCC(=O)OCC(COP(O)(=O)OCCO)OC(=O)CCCCCCC\C=C\CCCCCCCC HMNZFMSWFCAGGW-XPWSMXQVSA-N 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 108010060499 acharan sulfate lyase 1 Proteins 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 101150063416 add gene Proteins 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 1
- 230000010310 bacterial transformation Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- XMQFTWRPUQYINF-UHFFFAOYSA-N bensulfuron-methyl Chemical group COC(=O)C1=CC=CC=C1CS(=O)(=O)NC(=O)NC1=NC(OC)=CC(OC)=N1 XMQFTWRPUQYINF-UHFFFAOYSA-N 0.000 description 1
- 238000007622 bioinformatic analysis Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- FUHMZYWBSHTEDZ-UHFFFAOYSA-M bispyribac-sodium Chemical compound [Na+].COC1=CC(OC)=NC(OC=2C(=C(OC=3N=C(OC)C=C(OC)N=3)C=CC=2)C([O-])=O)=N1 FUHMZYWBSHTEDZ-UHFFFAOYSA-M 0.000 description 1
- 210000004781 brain capillary Anatomy 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- FPPNZSSZRUTDAP-UWFZAAFLSA-N carbenicillin Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)C(C(O)=O)C1=CC=CC=C1 FPPNZSSZRUTDAP-UWFZAAFLSA-N 0.000 description 1
- 229960003669 carbenicillin Drugs 0.000 description 1
- 108020001778 catalytic domains Proteins 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000022131 cell cycle Effects 0.000 description 1
- 230000018486 cell cycle phase Effects 0.000 description 1
- 230000003833 cell viability Effects 0.000 description 1
- 108091092356 cellular DNA Proteins 0.000 description 1
- 108091092328 cellular RNA Proteins 0.000 description 1
- 230000010001 cellular homeostasis Effects 0.000 description 1
- 229940106157 cellulase Drugs 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 239000013043 chemical agent Substances 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- NSWAMPCUPHPTTC-UHFFFAOYSA-N chlorimuron-ethyl Chemical group CCOC(=O)C1=CC=CC=C1S(=O)(=O)NC(=O)NC1=NC(Cl)=CC(OC)=N1 NSWAMPCUPHPTTC-UHFFFAOYSA-N 0.000 description 1
- 210000003763 chloroplast Anatomy 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- BIKACRYIQSLICJ-UHFFFAOYSA-N cloransulam-methyl Chemical group N=1N2C(OCC)=NC(F)=CC2=NC=1S(=O)(=O)NC1=C(Cl)C=CC=C1C(=O)OC BIKACRYIQSLICJ-UHFFFAOYSA-N 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 238000002247 constant time method Methods 0.000 description 1
- 239000000356 contaminant Substances 0.000 description 1
- 238000012136 culture method Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000018044 dehydration Effects 0.000 description 1
- 238000006297 dehydration reaction Methods 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 238000002716 delivery method Methods 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 235000019621 digestibility Nutrition 0.000 description 1
- IZEKFCXSFNUWAM-UHFFFAOYSA-N dipyridamole Chemical compound C=12N=C(N(CCO)CCO)N=C(N3CCCCC3)C2=NC(N(CCO)CCO)=NC=1N1CCCCC1 IZEKFCXSFNUWAM-UHFFFAOYSA-N 0.000 description 1
- 229960002768 dipyridamole Drugs 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 235000021186 dishes Nutrition 0.000 description 1
- 230000003828 downregulation Effects 0.000 description 1
- 230000024346 drought recovery Effects 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 230000005684 electric field Effects 0.000 description 1
- 238000002635 electroconvulsive therapy Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 210000002308 embryonic cell Anatomy 0.000 description 1
- 210000002889 endothelial cell Anatomy 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000006862 enzymatic digestion Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 108010093305 exopolygalacturonase Proteins 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 239000000796 flavoring agent Substances 0.000 description 1
- 235000019634 flavors Nutrition 0.000 description 1
- 238000000799 fluorescence microscopy Methods 0.000 description 1
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 1
- FRIPRWYKBIOZJU-UHFFFAOYSA-N fluorone Chemical compound C1=CC=C2OC3=CC(=O)C=CC3=CC2=C1 FRIPRWYKBIOZJU-UHFFFAOYSA-N 0.000 description 1
- 230000037433 frameshift Effects 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 238000001476 gene delivery Methods 0.000 description 1
- 238000012637 gene transfection Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 230000008642 heat stress Effects 0.000 description 1
- 229910001385 heavy metal Inorganic materials 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 235000003642 hunger Nutrition 0.000 description 1
- 230000015784 hyperosmotic salinity response Effects 0.000 description 1
- 208000006278 hypochromic anemia Diseases 0.000 description 1
- 238000007901 in situ hybridization Methods 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 230000000415 inactivating effect Effects 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 229920005610 lignin Polymers 0.000 description 1
- 230000004777 loss-of-function mutation Effects 0.000 description 1
- 108010010679 lysyl-valyl-leucyl-aspartic acid Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 239000001115 mace Substances 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 238000011880 melting curve analysis Methods 0.000 description 1
- 230000000442 meristematic effect Effects 0.000 description 1
- 231100000783 metal toxicity Toxicity 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 1
- ZTYVMAQSHCZXLF-UHFFFAOYSA-N methyl 2-[[4,6-bis(difluoromethoxy)pyrimidin-2-yl]carbamoylsulfamoyl]benzoate Chemical group COC(=O)C1=CC=CC=C1S(=O)(=O)NC(=O)NC1=NC(OC(F)F)=CC(OC(F)F)=N1 ZTYVMAQSHCZXLF-UHFFFAOYSA-N 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- 231100000668 minimum lethal dose Toxicity 0.000 description 1
- 239000002539 nanocarrier Substances 0.000 description 1
- 238000001426 native polyacrylamide gel electrophoresis Methods 0.000 description 1
- 238000007857 nested PCR Methods 0.000 description 1
- RTCOGUMHFFWOJV-UHFFFAOYSA-N nicosulfuron Chemical compound COC1=CC(OC)=NC(NC(=O)NS(=O)(=O)C=2C(=CC=CN=2)C(=O)N(C)C)=N1 RTCOGUMHFFWOJV-UHFFFAOYSA-N 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 230000005257 nucleotidylation Effects 0.000 description 1
- 235000018343 nutrient deficiency Nutrition 0.000 description 1
- ONDBSQWZWRGVIL-UHFFFAOYSA-N o-pyrimidin-2-yl benzenecarbothioate Chemical class C=1C=CC=CC=1C(=S)OC1=NC=CC=N1 ONDBSQWZWRGVIL-UHFFFAOYSA-N 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 230000003204 osmotic effect Effects 0.000 description 1
- 230000008723 osmotic stress Effects 0.000 description 1
- 239000001301 oxygen Substances 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- 244000045947 parasite Species 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 230000009340 pathogen transmission Effects 0.000 description 1
- 239000000575 pesticide Substances 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 230000005080 plant death Effects 0.000 description 1
- 230000008635 plant growth Effects 0.000 description 1
- 239000005648 plant growth regulator Substances 0.000 description 1
- 230000037039 plant physiology Effects 0.000 description 1
- 238000004161 plant tissue culture Methods 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 231100000614 poison Toxicity 0.000 description 1
- 239000002574 poison Substances 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 102000054765 polymorphisms of proteins Human genes 0.000 description 1
- 235000012015 potatoes Nutrition 0.000 description 1
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- OCNPXKLQSGAGKT-UHFFFAOYSA-N propyl benzenesulfonate Chemical compound CCCOS(=O)(=O)C1=CC=CC=C1 OCNPXKLQSGAGKT-UHFFFAOYSA-N 0.000 description 1
- 238000004451 qualitative analysis Methods 0.000 description 1
- 230000005855 radiation Effects 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 108010054624 red fluorescent protein Proteins 0.000 description 1
- 102000037983 regulatory factors Human genes 0.000 description 1
- 108091008025 regulatory factors Proteins 0.000 description 1
- 230000033458 reproduction Effects 0.000 description 1
- 230000001850 reproductive effect Effects 0.000 description 1
- 238000003757 reverse transcription PCR Methods 0.000 description 1
- JQXXHWHPUNPDRT-WLSIYKJHSA-N rifampicin Chemical compound O([C@](C1=O)(C)O/C=C/[C@@H]([C@H]([C@@H](OC(C)=O)[C@H](C)[C@H](O)[C@H](C)[C@@H](O)[C@@H](C)\C=C\C=C(C)/C(=O)NC=2C(O)=C3C([O-])=C4C)C)OC)C4=C1C3=C(O)C=2\C=N\N1CC[NH+](C)CC1 JQXXHWHPUNPDRT-WLSIYKJHSA-N 0.000 description 1
- 229960001225 rifampicin Drugs 0.000 description 1
- 230000021749 root development Effects 0.000 description 1
- 101150071322 ruvC gene Proteins 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000013515 script Methods 0.000 description 1
- 230000021217 seedling development Effects 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 108010015840 seryl-prolyl-lysyl-lysine Proteins 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 230000011869 shoot development Effects 0.000 description 1
- 230000037432 silent mutation Effects 0.000 description 1
- HBMJWWWQQXIZIP-UHFFFAOYSA-N silicon carbide Chemical compound [Si+]#[C-] HBMJWWWQQXIZIP-UHFFFAOYSA-N 0.000 description 1
- 229910010271 silicon carbide Inorganic materials 0.000 description 1
- 230000033443 single strand break repair Effects 0.000 description 1
- 230000003007 single stranded DNA break Effects 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 210000001082 somatic cell Anatomy 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 230000037351 starvation Effects 0.000 description 1
- HXJUTPCZVOIRIF-UHFFFAOYSA-N sulfolane Chemical compound O=S1(=O)CCCC1 HXJUTPCZVOIRIF-UHFFFAOYSA-N 0.000 description 1
- ZDXMLEQEMNLCQG-UHFFFAOYSA-N sulfometuron methyl Chemical group COC(=O)C1=CC=CC=C1S(=O)(=O)NC(=O)NC1=NC(C)=CC(C)=N1 ZDXMLEQEMNLCQG-UHFFFAOYSA-N 0.000 description 1
- BDHFUVZGWQCTTF-UHFFFAOYSA-M sulfonate Chemical compound [O-]S(=O)=O BDHFUVZGWQCTTF-UHFFFAOYSA-M 0.000 description 1
- XTHPWXDJESJLNJ-UHFFFAOYSA-N sulfurochloridic acid Chemical compound OS(Cl)(=O)=O XTHPWXDJESJLNJ-UHFFFAOYSA-N 0.000 description 1
- 230000002459 sustained effect Effects 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 108700029760 synthetic LTSP Proteins 0.000 description 1
- AHTPATJNIAFOLR-UHFFFAOYSA-N thifensulfuron-methyl Chemical group S1C=CC(S(=O)(=O)NC(=O)NC=2N=C(OC)N=C(C)N=2)=C1C(=O)OC AHTPATJNIAFOLR-UHFFFAOYSA-N 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical class CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 1
- 239000003104 tissue culture media Substances 0.000 description 1
- 230000003614 tolerogenic effect Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 239000012096 transfection reagent Substances 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000011426 transformation method Methods 0.000 description 1
- 230000017105 transposition Effects 0.000 description 1
- XOPFESVZMSQIKC-UHFFFAOYSA-N triasulfuron Chemical compound COC1=NC(C)=NC(NC(=O)NS(=O)(=O)C=2C(=CC=CC=2)OCCCl)=N1 XOPFESVZMSQIKC-UHFFFAOYSA-N 0.000 description 1
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 1
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 239000005723 virus inoculator Substances 0.000 description 1
- 239000002023 wood Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
- C12N15/8271—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
- C12N15/8274—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for herbicide resistance
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01H—NEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
- A01H1/00—Processes for modifying genotypes ; Plants characterised by associated natural traits
- A01H1/04—Processes of selection involving genotypic or phenotypic markers; Methods of using phenotypic markers for selection
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01H—NEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
- A01H1/00—Processes for modifying genotypes ; Plants characterised by associated natural traits
- A01H1/12—Processes for modifying agronomic input traits, e.g. crop yield
- A01H1/122—Processes for modifying agronomic input traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
- A01H1/123—Processes for modifying agronomic input traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for herbicide resistance
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01H—NEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
- A01H5/00—Angiosperms, i.e. flowering plants, characterised by their plant parts; Angiosperms characterised otherwise than by their botanic taxonomy
- A01H5/08—Fruits
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01H—NEW PLANTS OR NON-TRANSGENIC PROCESSES FOR OBTAINING THEM; PLANT REPRODUCTION BY TISSUE CULTURE TECHNIQUES
- A01H6/00—Angiosperms, i.e. flowering plants, characterised by their botanic taxonomy
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8201—Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
- C12N15/8209—Selection, visualisation of transformants, reporter constructs, e.g. antibiotic resistance markers
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8201—Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
- C12N15/8213—Targeted insertion of genes into the plant genome by homologous recombination
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Organic Chemistry (AREA)
- Chemical & Material Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Botany (AREA)
- Developmental Biology & Embryology (AREA)
- Environmental Sciences (AREA)
- Plant Pathology (AREA)
- Cell Biology (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Physiology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
Abstract
本发明提供了用于产生在靶序列中携带所需突变的香蕉胚和植物的改进的方法,包括在至少一种内源乙酰乳酸合酶(ALS)基因中引入至少一种突变。本发明还提供了香蕉植物和通过此类方法产生的产品。
Description
技术领域
本发明涉及用于产生在靶序列中包括所需突变的香蕉胚和植物的改进方法,以及通过此类方法产生的香蕉植物和产品。
背景技术
现代基因编辑技术为产生改良作物提供了重要机会,例如具有更高的害虫抗性和所需性状。基因编辑提供了产生改良的非转基因作物的可能性,所述作物不包含来自其他物种的任何异源序列,这是消费者和监管者所优选的。在现代基因编辑技术发展之前,曾使用涉及用携带源自其他物种的异源启动子和标记序列的大质粒进行转化的技术尝试来改良无性作物如香蕉(例如在Ganapathi et al.,Plant Cell Reports,2001,20:157-162中)。然而,整合来自其他物种的异源序列与非转基因植物的产生不相容,并且引入的启动子和基因可能具有不可预测的效果,所述效果甚至与相关内源基因的效果差异极大。
使用诸如CRISPR的技术在香蕉中进行非转基因基因组编辑存在数种挑战。这种作物是通过无性繁殖产生的,并且香蕉的现代品种(特别是主要出口品种“卡文迪什(Cavendish)”)是单性结实的;他们产生无籽水果。因此,不可能分离出任何引入的基因序列(例如选择性标记物或基因编辑机制),如使用有性繁殖作物物种进行的那样,以提供非转基因的编辑的植物系。相反,为了在香蕉中实现非转基因的基因组编辑,必须在转基因没有稳定整合到基因组中的情况下进行编辑。然而,这使得鉴定携带所需编辑的香蕉植物非常困难。可对数千种植物进行基因分型以鉴定缺乏任何转化质粒的经编辑品系,但这是非常劳动密集型、耗时且昂贵的。
已经提出了将突变引入不同作物物种的各种方法,例如在WO 2018/202199中。然而,转化和发育香蕉植物的方法与用于有性繁殖谷类作物(如小麦和玉米)的方法差异极大。
需要在香蕉中编辑基因的改进方法。
发明内容
本发明人已经开发了用于产生在靶序列中包括所需突变的香蕉胚和植物的改进方法。所述方法利用靶序列和内源乙酰乳酸合酶(ALS)基因的共同编辑与ALS抑制剂选择的组合来提供用于编辑香蕉基因的高度高效且有效的策略。
所述ALS基因编码在支链氨基酸生物合成途径中发挥功能的蛋白质。ALS蛋白被磺酰脲家族的数种除草剂不可逆地竞争性抑制。这些化合物是强效除草剂并且可用作有效的广谱除草剂。已经鉴定出数种ALS氨基酸取代(天然存在的和人工生成的),其增加了对磺酰脲化合物的耐受性。这些通常全部是形成活性位点口袋(pocket)并改变除草剂化合物的结合潜力的氨基酸残基。
在本发明的方法中,试剂(如碱基编辑器)被用来编辑一个或多个香蕉ALS基因,例如用丝氨酸取代肽序列中的脯氨酸,并且此类取代赋予对ALS抑制剂(例如磺酰脲化合物绿磺隆(chlorsulfuron))的抗性。包括这些精确编辑的香蕉细胞、胚和植物在用ALS抑制剂处理后存活,而野生型植物或胚被杀死。同时,一个或多个靶序列也被编辑(“共同编辑”),例如通过用所述试剂靶向所述靶序列。以这种方式转化香蕉细胞获得可用ALS抑制剂处理的植物材料,选择ALS编辑的存在,并显著富集在靶序列中存在所需突变的群体。
因此,在第一方面,本发明提供了产生在一个或多个靶序列中包括至少一种所需突变的至少一种香蕉胚或植物的方法,所述方法包括:
a.使香蕉细胞与以下试剂接触:可操作以在靶序列中引入至少一种所需突变的至少一种试剂和可操作以在至少一种内源乙酰乳酸合酶(ALS)基因中引入至少一种突变以便提供对ALS抑制剂的抗性的至少一种试剂;
b.任选地培养所述香蕉细胞以产生香蕉细胞群体、组织或植物;
c.用ALS抑制剂处理所述香蕉细胞、细胞群体、组织或植物以选择至少一种香蕉细胞、组织或植物,所述至少一种香蕉细胞、组织或植物在至少一种内源乙酰乳酸合酶基因中被突变并且在靶序列中包括所需突变;和
d.从步骤(c)中选择的所述香蕉细胞、组织或植物产生至少一种香蕉胚或植物。
本发明还提供了产生在靶序列中包括所需突变的至少一种香蕉胚或植物的方法,所述方法包括:
a.将香蕉细胞与以下试剂接触:可操作以在靶序列中引入所需突变的至少一种试剂和可操作以在内源乙酰乳酸合酶(ALS)基因中引入突变的至少一种试剂,所述在内源乙酰乳酸合酶(ALS)基因中的突变提供对ALS抑制剂的抗性;
b.任选地培养所述香蕉细胞以产生香蕉细胞群体、组织或植物;
c.用ALS抑制剂处理所述香蕉细胞、细胞群体、组织或植物以选择至少一种香蕉细胞、组织或植物,所述至少一种香蕉细胞、组织或植物包括在内源乙酰乳酸合酶基因中的突变并且包括在靶序列中的所需突变;和
d.从步骤(c)中选择的所述香蕉细胞、组织或植物产生至少一种香蕉胚或植物。
实施例证明本发明的方法提供了在靶序列中富含所需突变的香蕉细胞群体、胚或香蕉植物。值得注意地,实施例证明将突变引入香蕉的内源ALS基因可有效提供对ALS抑制剂的抗性,并因此使用ALS抑制剂的选择可用于富集也在靶序列被编辑的香蕉胚和植物。实施例证明,没有必要纳入异源ALS序列或由强启动子驱动ALS序列的表达以在香蕉中提供对ALS抑制剂的抗性。本发明的方法特别有用,因为香蕉是无性的并且不可能培育香蕉以实现改进的性状或表型,并且不可能育出任何引入的序列。
与诸如小麦的作物不同,香蕉包括两个内源ALS基因-ALS1和ALS2。值得注意地,实施例表明,在ALS1或ALS2中以及在一个、两个或三个等位基因(香蕉是三倍体)引入突变,可以有效地提供对ALS抑制剂的抗性,并且实施例证明,没有必要突变所有的内源性ALS基因和等位基因。因此,本发明的方法可有效生成和选择在靶序列中包括所需突变的香蕉胚或植物,即使转化和编辑过程相对低效,因为在香蕉中在ALS1或ALS2任一处及在单个等位基因处的突变可提供对ALS抑制剂的抗性。
在优选的实施方案中,香蕉细胞是原生质体或胚性细胞,例如在原生质体细胞悬浮液中的原生质体或胚性细胞悬浮液中的胚性细胞。在此类方法中,步骤(a)包括接触原生质体细胞悬浮液或胚性细胞悬浮液。实施例证明,此类细胞和悬浮液可被高效且有效地与试剂(例如碱基编辑器)接触,并且可以同时编辑内源ALS基因和靶序列。
在优选的实施方案中,所述方法利用碱基编辑器并且所述可操作以引入突变的试剂包括碱基编辑器和向导RNA或包括编码碱基编辑器和向导RNA的核酸构建体。碱基编辑器特别适合同时编辑内源ALS基因和靶序列,而不需要纳入任何异源DNA,如实施例中所示。下面进一步讨论特别优选的碱基编辑器。
在某些实施方案中,所述方法包括在内源ALS1基因中引入突变。本发明人已经鉴定在香蕉中ALS1与ALS2相比以更高水平表达,因此突变所述内源ALS1基因可能提供增加的对ALS抑制剂的抗性。在某些实施方案中,所述方法不包括在内源ALS2基因中引入突变,并且例如,所述试剂不包括任何序列,例如靶向内源ALS2基因的向导RNA。
在某些实施方案中,所述方法包括在内源ALS2基因中引入突变。本发明人已鉴定在香蕉中ALS2与ALS1相比以更低水平表达,因此令人吃惊的是ALS2的突变足以提供对ALS抑制剂的抗性并允许选择成功编辑的香蕉胚或植物。突变内源ALS2基因可允许对在ALS2基因的多个等位基因处和所述靶序列的多个等位基因处被编辑的胚或植物的改进的选择,因为内源ALS2的较低的表达水平意味着具有单次编辑的胚和植物可对ALS抑制剂具有有限的抗性,这可允许增加ALS抑制剂处理以用于选择多次编辑。此外,突变内源ALS2基因可提供改进的选择以产生非转基因的、性状编辑的植物。此外,由于与ALS1相比,ALS2以较低水平表达,这可意味着ALS1是生物合成支链氨基酸的主要来源,因此突变ALS2将在较小程度上干扰细胞稳态。在某些实施方案中,所述方法不包括在内源ALS1基因中引入突变,并且例如,所述试剂不包括任何序列,例如靶向内源ALS1基因的向导RNA。
在某些实施方案中,所述方法包括在内源ALS1基因和内源ALS2基因两者中引入突变。此类方法可允许使用采用ALS抑制剂进行的更积极的选择(aggressive selection),以产生更高度富集所述靶序列中的所需突变的胚和植物群体。
在优选的实施方案中,步骤(c)包括在ALS抑制剂存在下孵育胚性细胞悬浮液或培养胚。实施例表明,在香蕉中可在胚性细胞悬浮液或胚阶段用ALS抑制剂选择,这可以提供更快、更高效的过程,因为没有必要等到幼苗(plantlet)发育。ALS是一种叶绿体定域酶,胚性细胞悬浮液和胚是非绿色组织,因此令人吃惊的是,在胚性细胞悬浮液或胚阶段用ALS抑制剂选择是有效的。在某些实施方案中,所述方法不包括用ALS抑制剂处理幼苗或植物。在胚阶段以及幼苗或植物阶段的ALS抑制剂处理可能会不必要地抑制生长和发育,而不会改善对共同编辑的植物的选择,例如因为在一或两个等位基因处的突变仅一个ALS基因可能不会提供对ALS抑制剂的功能性耐受(因为可能提供启动子驱动的异源ALS序列的过表达),因此共同编辑的植物仍然可能容易受持续的ALS处理的影响。在优选的实施方案中,所述方法包括用ALS抑制剂处理香蕉细胞并选择至少一种香蕉细胞,例如胚,其包括在内源乙酰乳酸合酶基因中的突变并且包括在靶序列中的所需突变。在某些实施方案中,所述方法包括在用ALS抑制剂处理后选择至少一种香蕉细胞,例如胚,然后在没有用ALS抑制剂进一步处理的情况下发育香蕉植物。在某些实施方案中,在组织培养之前进行步骤(c)。在某些实施方案中,步骤(a)包括接触香蕉胚性细胞悬浮液并且在组织培养之前进行步骤(c)。
在某些实施方案中,所述方法包括通过在包含ALS抑制剂的胚发育培养基中培养胚,例如用来自胚性细胞悬浮液(ECS)的细胞开始,用ALS抑制剂处理香蕉胚。在某些此类实施方案中,所述方法包括随后将胚转移至不包含ALS抑制剂的胚发育培养基中。本发明人已发现,选择后转移胚至不含ALS抑制剂的不同培养基有助于所选择的胚的发育。如上文所述,这可能因为在一或两个等位基因处突变仅一个ALS基因可能不会提供对ALS抑制剂的完全抗性(因为可能提供启动子驱动的异源ALS序列的过表达),因此共同编辑的植物仍然可能容易受持续的ALS处理的影响。
在某些实施方案中,本发明包括选择对用ALS抑制剂的处理具有抗性的至少一种香蕉细胞,例如胚、香蕉组织或香蕉植物,并将其与不具有抗性的细胞、组织或植物物理分离。在某些实施方案中,所述至少一种选择的细胞、组织或植物被转移至单独的板或生长培养基中,优选地不含ALS抑制剂。
在某些实施方案中,所述方法包括选择对ALS抑制剂的处理具有抗性的至少一种香蕉细胞,例如胚、香蕉组织或香蕉植物,并且随后选择还包括在靶序列中的所需突变的至少一种香蕉细胞,例如胚、香蕉组织或香蕉植物,例如通过对所述细胞、组织或植物进行基因分型。
在某些实施方案中,所述方法包括用ALS抑制剂处理香蕉胚或细胞群体,并且对抗性胚或细胞的选择包括将发育的胚与未发育的胚和细胞团(cell mass)分离,和任选地在新培养基上培养发育的胚。本发明人已经确定,发育的胚与未发育的胚和细胞团的物理分离改善了目标胚的选择/发育。ALS处理延迟香蕉胚的发育,但不必须杀死香蕉胚。因此,如果不将发育的胚与未发育的胚和细胞团分开,并且允许发育继续,那么区分抗性胚就变得更困难。这对于在香蕉中的选择和使用内源ALS基因的突变尤其重要,其中ALS抗性可能不完全。此外,分离胚可能减少胚嵌合。
在某些实施方案中,抗性胚被转移至多种不同的培养基,例如转移至第一培养基,然后转移至第二培养基。在某些实施方案中,所述第一培养基包含ALS抑制剂而所述第二培养基不包含(或反之亦然)。
在某些实施方案中,所述ALS抑制剂是磺酰脲,优选地绿磺隆。实施例证明,绿磺隆可有效用于香蕉。
在某些实施方案中,所述试剂包括核酸构建体并且所述接触包括农杆菌(Agrobacterium)介导的转化。实施例证明,农杆菌介导的转化对于在香蕉中的内源ALS基因和靶序列中引入突变特别高效且有效。
在某些实施方案中,步骤(c)包括用ALS抑制剂处理香蕉细胞,并且所述处理提供了香蕉细胞群体,并且所述群体中至少90%(例如至少95%、96%、97%、98%或99%)的细胞包括内源ALS1基因或所述内源ALS2基因或两者中的突变。实施例证明,本发明的方法可有效用于提供高度富集具有成功编辑事件的细胞的群体。
在某些实施方案中,所述方法不导致任何异源DNA的整合。在某些实施方案中,通过所述方法产生的所述至少一种香蕉胚或植物不包括异源DNA。实施例证明,使用本发明的方法可高效地引入所需突变,而不会整合任何异源DNA,这对种植者和消费者有吸引力,并且代表极大地减少了种植者希望获得某些类型的监管批准的潜在问题。
在优选的实施方案中,所述香蕉细胞是同源三倍体小果野蕉(Musa acuminata)“卡文迪什”亚群(subgroup)的香蕉细胞。实施例证明了对卡文迪什香蕉细胞和植物的有效编辑。在优选的实施方案中,所述香蕉细胞来自Grand Nain栽培种。
在某些实施方案中,所述方法包括在任一或两个内源ALS基因的一个或两个等位基因中引入突变以产生杂合香蕉植物。实施例证明,对ALS抑制剂的抗性可以通过仅突变任一或两个内源ALS基因的一个或两个等位基因案来提供。在某些实施方案中,所需突变被引入到靶序列的一个或两个等位基因中。
在某些实施方案中,所述方法包括将突变引入到内源ALS1基因的每个等位基因、或内源ALS2基因的每个等位基因、或内源ALS1基因和内源ALS2基因两者的每个等位基因中。在某些实施方案中,所需突变被引入到靶序列的每个等位基因中。实施例证明,本发明的方法可有效产生ALS抗性和所需突变纯合型的香蕉植物。
在优选的实施方案中,所述方法进一步包括从步骤(c)中选择的所述至少一种细胞或组织生长香蕉植物。在优选的实施方案中,所述方法进一步包括从所述香蕉植物收获果实并任选地将果实加工成食品产品。本发明所述的香蕉植物、香蕉果实和香蕉果实产品有利地携带在靶序列中的所需突变以及在内源ALS基因中的提供对ALS抑制剂的抗性的突变。在优选的实施方案中,所述香蕉植物、香蕉果实和香蕉果实产品不包括任何异源或引入的DNA。
在另一方面,其可与本文描述的任何实施方案组合,本发明提供了产生包括所需序列的至少一种香蕉胚或植物的方法,所述方法包括:
a.使香蕉细胞与包括所需序列和编码突变的香蕉乙酰乳酸合酶基因的序列的构建体接触,所述突变的香蕉乙酰乳酸合酶基因提供对ALS抑制剂的抗性;
b.任选地培养所述香蕉细胞以产生香蕉细胞群体、组织或植物;和
c.用ALS抑制剂处理所述香蕉细胞、细胞群体、组织或植物以选择至少一种香蕉细胞、组织或植物,所述至少一种香蕉细胞、组织或植物包括突变的乙酰乳酸合酶基因并包括所需序列;和
d.从步骤(c)中选择的香蕉细胞、组织或植物产生至少一种香蕉胚或植物。
实施例证明,突变的香蕉ALS基因可有效提供对ALS抑制剂的抗性,这使它们成为在转基因香蕉中有用的选择性标记物。优选地,所需序列和ALS基因都是香蕉序列,以把引入基因组中的异源DNA减至最少。在某些实施方案中,所需序列和ALS基因中的至少一种是香蕉序列。
在另一方面,其可与本文描述的任何实施方案组合,本发明提供产生对ALS抑制剂有抗性的至少一种香蕉胚或植物的方法,所述方法包括:
a.使香蕉细胞与包括编码突变的香蕉乙酰乳酸合酶基因的序列的构建体接触,所述突变的香蕉乙酰乳酸合酶基因提供对ALS抑制剂的抗性,或与可操作以在内源乙酰乳酸合酶基因中引入突变的试剂接触,所述在内源乙酰乳酸合酶基因中的突变提供对ALS抑制剂的抗性;
b.任选地培养所述香蕉细胞以产生香蕉细胞群体、组织或植物;
e.用ALS抑制剂处理所述香蕉细胞、细胞群体、组织或植物以选择至少一种香蕉细胞、组织或植物,所述至少一种香蕉细胞、组织或植物对ALS抑制剂有抗性并且包括所述突变的乙酰乳酸合酶基因或所述在内源乙酰乳酸合酶基因中的突变;和
c.从步骤(c)中选择的香蕉细胞、组织或植物产生至少一种香蕉胚或植物。
本发明还提供了包括编码突变的香蕉ALS基因的序列的香蕉细胞、香蕉胚或香蕉植物,所述突变的香蕉ALS基因提供对ALS抑制剂的抗性。本发明还提供了栽培香蕉植物的方法,包括将作物与ALS抑制剂接触以减少其他植物的生长,其中所述香蕉植物包括编码突变的香蕉乙酰乳酸合酶基因的序列,所述突变的香蕉乙酰乳酸合酶基因提供对ALS抑制剂的抗性,任选地其中香蕉植物包括在内源乙酰乳酸合酶基因的突变,所述突变提供对ALS抑制剂的抗性。实施例证明,突变的香蕉ALS基因可有效提供对ALS抑制剂的抗性,其允许产生对ALS抑制剂有抗性的香蕉植物。突变内源ALS序列,或者引入突变的香蕉序列,以把引入基因组的所述异源DNA减至最少。
在一个优选的实施方案中,本发明提供了产生至少一种香蕉胚或植物的方法,所述至少一种香蕉胚或植物包括在一个或多个靶序列中的至少一种所需突变,所述方法包括:
a.使香蕉细胞与至少一种核酸构建体接触,所述核酸构建体编码至少一种碱基编辑器、可操作以在靶序列中引入至少一种所需突变的至少一种向导RNA、和可操作以在内源乙酰乳酸合酶(ALS)基因中引入突变的至少一种向导RNA,所述在内源乙酰乳酸合酶(ALS)基因中的突变提供对绿磺隆的抗性,其中所述在内源ALS基因中的突变导致以下取代:ALS1中的Pro187Ser或ALS2中的Pro181Ser或两者;
b.任选地培养所述香蕉细胞以产生香蕉细胞群体、组织或植物;和
c.用绿磺隆处理所述香蕉细胞、细胞群体、组织或植物以选择至少一种香蕉细胞、组织或植物,所述至少一种香蕉细胞、组织或植物包括在内源乙酰乳酸合酶基因中的突变并且包括在靶序列中的所需突变;和
d.从步骤(c)中被选择的香蕉细胞、组织或植物产生至少一种香蕉胚或植物。
在一个优选的实施方案中,本发明提供了产生在靶序列中包括至少一种所需突变的至少一种香蕉胚或植物的方法,所述方法包括:
a.使香蕉细胞与以下核酸构建体接触:编码碱基编辑器和可操作以在靶序列中引入至少一种所需突变的向导RNA的核酸构建体,以及编码碱基编辑器和可操作以在内源乙酰乳酸合酶(ALS)基因中引入突变的向导RNA的核酸构建体,所述在内源乙酰乳酸合酶(ALS)基因中的突变提供对绿磺隆的抗性,其中所述在内源ALS基因中的突变导致以下取代:ALS1中的Pro187Ser或ALS2中的Pro181Ser;
b.任选地培养所述香蕉细胞以产生香蕉细胞群体、组织或植物;和
c.用绿磺隆处理所述香蕉细胞、细胞群体、组织或植物以选择至少一种香蕉细胞、组织或植物,所述至少一种香蕉细胞、组织或植物包括在内源乙酰乳酸合酶基因中的突变并且包括在靶序列中的所需突变;和
d.从步骤(c)中选择的所述香蕉细胞、组织或植物产生至少一种香蕉胚或植物。
在一个优选的实施方案中,本发明提供了产生至少一种香蕉胚或植物的方法,所述至少一种香蕉胚或植物在靶序列中包括至少一种所需突变,所述方法包括:
a.使香蕉胚性细胞悬浮液与一种或多种农杆菌T-DNA质粒接触,所述农杆菌T-DNA质粒编码可操作以在靶序列中引入至少一种所需突变的至少一种试剂和可操作以在内源性乙酰乳酸合酶(ALS)基因中引入突变的至少一种试剂,所述在内源性乙酰乳酸合酶(ALS)基因中的突变提供对ALS抑制剂的抗性
b.培养所述香蕉胚性细胞悬液;
c.在包含ALS抑制剂的胚发育培养基上培养胚以选择包括在内源乙酰乳酸合酶基因中的突变并且包括在靶序列中的所需突变的至少一种胚,和
d.任选地培养所述至少一种胚以产生植物。
在一个特别优选的实施方案中,本发明提供了产生至少一种香蕉胚或植物的方法,所述至少一种香蕉胚或植物在靶序列中包括至少一种所需突变,所述方法包括:
a.使香蕉胚性细胞悬浮液与以下核酸构建体接触:编码碱基编辑器和可操作以在靶序列中引入至少一种所需突变的向导RNA的核酸构建体以及编码碱基编辑器和可操作以在内源乙酰乳酸合酶(ALS)中引入突变的向导RNA的核酸构建体,所述在内源乙酰乳酸合酶(ALS)中的突变提供对绿磺隆的抗性,其中一种或多种核酸构建体存在于一个或多个农杆菌T-DNA质粒中,并且其中所述在内源ALS基因中的突变导致以下取代:ALS1中的Pro187Ser或ALS2中的Pro181Ser;
b.培养所述香蕉胚性细胞悬液;
c.在包含绿磺隆的胚发育培养基上培养胚以选择包括在内源乙酰乳酸合酶基因中的突变并且包括在靶序列中的所需突变的至少一种胚,和
d.任选地培养所述至少一种胚以产生植物。
在一个特别优选的实施方案中,本发明提供了产生至少一种香蕉胚或植物的方法,所述至少一种香蕉胚或植物在靶序列中包括至少一种所需突变,所述方法包括:
a.使香蕉胚性细胞悬浮液与以下核酸构建体接触:编码碱基编辑器和可操作以在靶序列中引入至少一种所需突变的向导RNA的核酸构建体以及编码碱基编辑器和可操作以在内源乙酰乳酸合酶(ALS)中引入突变的向导RNA的核酸构建体接触,所述在内源乙酰乳酸合酶(ALS)中的突变提供对绿磺隆的抗性,其中一种或多种核酸构建体存在于一个或多个农杆菌T-DNA质粒中,并且其中所述在内源ALS基因中的突变导致以下取代:ALS1中的Pro187Ser或ALS2中的Pro181Ser;
b.培养所述香蕉胚性细胞悬浮液;
c.在包含绿磺隆的胚发育培养基上培养胚以选择包括在内源乙酰乳酸合酶基因中的突变并且包括在靶序列中的所需突变的至少一种胚,其中所述胚不包括异源DNA;和
d.任选地培养所述至少一种胚以产生植物。
在一个特别优选的实施方案中,本发明提供了产生至少一种香蕉胚或植物的方法,所述至少一种香蕉胚或植物在靶序列中包括至少一种所需突变,所述方法包括:
a.使香蕉胚性细胞悬浮液与以下构建体接触:编码碱基编辑器和可操作以在靶序列中引入至少一种所需突变的向导RNA的核酸构建体以及编码碱基编辑器和可操作以在内源乙酰乳酸合酶(ALS)中引入突变的向导RNA的核酸构建体,所述在内源乙酰乳酸合酶(ALS)中的突变提供对绿磺隆的抗性,其中一种或多种核酸构建体存在于一个或多个农杆菌T-DNA质粒中,并且其中所述在所述内源ALS基因中的突变导致以下取代:ALS1中的Pro187Ser或ALS2中的Pro181Ser;
b.培养所述香蕉胚性细胞悬浮液;
c.在包含绿磺隆的胚发育培养基上培养胚以选择包括在内源乙酰乳酸合酶基因中的突变并且包括在靶序列中的所需突变的胚群体,其中至少1%、2%、3%、4%、5%或10%的胚不包含异源DNA;和
d.任选地培养至少一种胚以产生植物。
因此,本发明提供以下编号的实施方案:
1.一种产生在一个或多个靶序列中包括至少一种所需突变的至少一种香蕉胚或植物的方法,所述方法包括:
a.使香蕉细胞与以下试剂接触:可操作以在靶序列中引入至少一种所需突变的至少一种试剂和可操作以在至少一种内源乙酰乳酸合酶(ALS)基因中引入至少一种突变以便提供对ALS抑制剂的抗性的至少一种试剂;
b.任选地培养所述香蕉细胞以产生香蕉细胞群体、组织或植物;和
c.用ALS抑制剂处理所述香蕉细胞、细胞群体、组织或植物以选择在至少一种内源乙酰乳酸合酶基因中被突变并且包括在靶序列中的所需突变的至少一种香蕉细胞、组织或植物;和
d.从步骤(c)中选择的香蕉细胞、组织或植物产生至少一种香蕉胚或植物。
2.根据实施方案1所述的方法,其中所述香蕉细胞是原生质体或胚性细胞,例如其中所述香蕉细胞是原生质体细胞悬浮液中的原生质体或胚性细胞悬浮液中的胚性细胞,并且步骤(a)包括接触所述原生质体细胞悬浮液或胚性细胞悬浮液。
3.根据前述实施方案中任一项所述的方法,其中所述可操作以引入突变的试剂包括碱基编辑器和向导RNA或包括编码碱基编辑器和向导RNA的至少一种核酸构建体。
4.根据实施方案3所述的方法,其中所述碱基编辑器是融合多肽,所述融合多肽包括修饰的Cas,例如nCas9或dCas9,和胞苷脱氨酶部分或腺嘌呤脱氨酶部分,例如选自以下的碱基编辑器:APOBEC、BE1、BE2、BE3、HF-BE3、BE4、BE4max、BE4-GAM、YE1-BE3、EE-BE3、YE-BE3、YEE-BE3、VQR-BE3、VRER-BE3、Sa-BE3、Sa-BE4、SaBE4-Gam、SaKKH-BE3、Cas12a-BE、Target-AID、Target-AID-NG、xBE3、eA3A-BE3、A3A-BE3、BE-PLUS、TAM、CRISPR-X、ABE7.9、ABE7.10、ABE7.10*、xABE、ABESa、VQR-ABE、VRER-ABE和SaKKH-ABE。
5.根据实施方案1或实施方案2所述的方法,其中所述可操作以引入突变的试剂包括序列特异性核酸内切酶,任选地向导RNA和任选地供体模板,或包括编码序列特异性核酸内切酶、任选地向导RNA和任选地供体模板的至少一种核酸构建体。
6.根据实施方案5所述的方法,其中所述序列特异性核酸内切酶是大范围核酸酶、锌指核酸酶(ZFN)、转录激活子样效应子核酸酶(TALEN)、归巢核酸内切酶或CRISPR相关核酸内切酶,例如修饰的CRISPR相关核酸内切酶。
7.根据任一项前述实施方案所述的方法,其中所述可操作以在靶序列中引入至少一种所需突变的试剂包括靶向靶序列的向导RNA,和所述可操作以在内源ALS基因中引入突变的试剂包括靶向内源ALS1基因或内源ALS2基因或两者的向导RNA,或其中所述可操作以在靶序列中引入至少一种所需突变的试剂包括编码靶向靶序列的向导RNA的至少一种核酸构建体,和所述可操作以在内源ALS基因中引入突变的试剂包括编码靶向内源ALS1基因或内源ALS2基因或两者的向导RNA的至少一种核酸构建体。
8.根据任一项前述实施方案所述的方法,其中所述可操作以引入突变的试剂是存在于一个或多个(例如两个)质粒上的构建体。
9.根据任一项前述实施方案所述的方法,其中所述试剂包括核酸构建体并且所述接触包括农杆菌介导的转化、粒子轰击、原生质体转化、纳米粒子介导的转染或电穿孔。
10.根据任一项前述实施方案所述的方法,其中所述可操作以引入突变的试剂作为核糖核蛋白复合物被递送至香蕉细胞。
11.根据任一项前述实施方案所述的方法,其中步骤(c)包括在ALS抑制剂的存在下孵育胚性细胞悬浮液,或步骤(c)包括在包含ALS抑制剂的胚发育培养基(EDM)上培养胚。
12.根据实施方案11所述的方法,其中步骤(c)包括在包含ALS抑制剂胚性细胞悬浮液的胚发育培养基上培养胚,持续8-20周,例如8-16、10-16、10-14、12-14或约13周。
13.根据实施方案11或12所述的方法,其中步骤(c)包括在包含ALS抑制剂胚性细胞悬浮液的胚发育培养基上培养胚,所述ALS抑制剂胚性细胞悬浮液的浓度为10-80μg/L,例如10-70、15-65、15-55、20-55、20-50、20-40、20-30或约25μg/L。
14.根据实施方案11、12或13所述的方法,其中步骤(c)包括在ALS抑制剂存在下孵育所述胚性细胞悬浮液,持续至少3天,例如至少5、10或15天,任选地持续小于30天,例如小于25、20、15或10天。
15.根据实施方案11-14中任一项所述的方法,其中步骤(c)包括在ALS抑制剂存在下孵育所述胚性细胞悬浮液,所述ALS抑制剂浓度为至少500μg/L,例如至少1、2、3、4或5mg/L。
16.根据任一项前述实施方案所述的方法,包括在用ALS抑制剂的处理后选择至少一种胚并在没有用ALS抑制剂的进一步处理的情况下发育香蕉植物。
17.根据任一项前述实施方案所述的方法,额外包括随后将胚转移至不包含ALS抑制剂的胚发育培养基中。
18.根据任一项前述实施方案所述的方法,包括选择对用ALS抑制剂的处理具有抗性的至少一种香蕉细胞,例如胚、香蕉组织或香蕉植物,并将它们与不具有抗性的细胞、组织或植物物理分离,并且任选地转移所述至少一种选择的细胞、组织或植物至单独的板或生长培养基中,优选地不含ALS抑制剂。
19.根据任一项前述实施方案所述的方法,包括选择对ALS抑制剂的处理具有抗性的至少一种香蕉细胞,例如胚、香蕉组织或香蕉植物,并且随后选择还包括在靶序列中的所需突变的至少一种香蕉细胞,例如胚、香蕉组织或香蕉植物,例如通过对所述细胞、组织或植物进行基因分型。
20.根据任一项前述实施方案所述的方法,其中所述可操作以在靶序列中引入至少一种所需突变的试剂和/或可操作以在内源ALS基因中引入突变的试剂包括核酸构建体,所述核酸构建体额外编码选择性标记物,并且其中所述方法任选地包括选择不携带所述选择性标记物的胚或植物。
21.根据任一项前述实施方案所述的方法,其中所述内源乙酰乳酸合酶基因是内源乙酰乳酸合酶1(ALS1)基因或内源乙酰乳酸合酶2(ALS2)基因,或者其中突变是在ALS1和ALS2两者中引入。
22.根据任一项前述实施方案所述的方法,其中所述在内源ALS基因中的突变是在编码的氨基酸序列中引入的取代,优选地在香蕉ALS1中的Pro-187处或ALS2中的Pro-181处,最优选地在ALS1中的Pro187Ser或在ALS2中的Pro181Ser。
23.根据任一项前述实施方案所述的方法,其中步骤(a)包括使香蕉细胞与可操作以突变所述内源ALS1和ALS2基因的一种或多种试剂接触,以便提供对ALS抑制剂的抗性。
24.根据任一项前述实施方案所述的方法,其中所述ALS抑制剂是磺酰脲、咪唑啉酮、三唑并嘧啶、嘧啶基氧苯甲酸酯(pyrimidinyl oxybenzoate)或磺酰基氨基羰基三唑啉酮,优选地其中所述ALS抑制剂是绿磺隆。
25.根据任一项前述实施方案所述的方法,其中步骤(a)包括接触香蕉胚性细胞悬浮液并且其中步骤(c)是在组织培养之前进行。
26.根据任一项前述实施方案所述的方法,其中步骤(c)提供香蕉细胞群体并且所述群体中至少90%(例如至少95%、96%、97%、98%或99%)的细胞包括突变的ALS,任选地其中至少5%、10%或15%的细胞不包括异源DNA,进一步任选地其中至少1%、2%、3%、4%或5%的细胞包括在内源乙酰乳酸合酶基因中的突变并且包括在靶序列中的所需突变。
27.根据任一项前述实施方案所述的方法,其中通过所述方法产生的至少一种香蕉细胞、组织、胚或植物不包括异源DNA。
28.根据任一项前述实施方案所述的方法,其中所述靶序列是ACO、ACS或PPO中的一种或多种,例如ACO1、ACO2、ACS1、ACS2或PPO2。
29.根据任一项前述实施方案所述的方法,进一步包括从步骤(c)中选择的至少一种细胞或组织生长香蕉植物。
30.根据任一项前述实施方案所述的方法,进一步包括从所述香蕉植物收获果实。
31.根据实施方案30所述的方法,进一步包括将所述果实加工成食品产品。
32.根据前述实施方案中任一项所述的方法,其中所述香蕉细胞是同源三倍体小果野蕉(Musa acuminata)‘卡文迪什’亚群(subgroup)的品种/栽培种,例如Grand Nain。
33.通过根据任一项前述实施方案所述的方法产生的香蕉细胞、胚、细胞群体、组织或植物。
34.香蕉细胞、香蕉胚或香蕉植物,包括在内源ALS1基因或内源ALS2基因或两者中的突变,其提供对ALS抑制剂的抗性。
35.香蕉胚的群体,其中至少90%(例如至少95%、96%、97%、98%或99%)的胚包括突变的内源ALS1基因或突变的内源ALS2基因或两者,其提供对ALS抑制剂的抗性;任选地,其中至少部分的胚不包括异源DNA,所述胚包括在ALS1和/或ALS2中的提供对ALS抑制剂的抗性的突变。
36.根据实施方案34所述的香蕉细胞或植物或根据实施方案35所述的群体,额外包括在至少一种靶序列中的至少一种突变,所述在至少一种靶序列中的至少一种突变提供害虫抗性或表型上合乎需要的性状,任选地其中所述靶序列是ACO、ACS或PPO,例如ACO1、ACO2、ACS1、ACS2或PPO2,进一步任选地其中所述突变是终止密码子的引入。
37.根据实施方案34所述的香蕉细胞、香蕉胚或植物或根据实施方案35所述的群体,额外包括沉默序列中的突变,该突变改变其沉默特异性,使得当被表达时其降低靶基因的表达,任选地其中所述靶基因是害虫基因或其中所述靶基因的降低的表达提供表型上合乎需要的性状。
38.产生包括一个或多个所需序列的至少一种香蕉胚或植物的方法,所述方法包括:
a.使香蕉细胞与至少一种构建体接触,所述构建体携带所需序列和编码至少一种突变的香蕉乙酰乳酸合酶基因的序列,所述突变的香蕉乙酰乳酸合酶基因提供对ALS抑制剂的抗性;
b.任选地培养所述香蕉细胞以产生香蕉细胞群体、组织或植物;
c.用ALS抑制剂处理所述香蕉细胞、细胞群体、组织或植物以选择包括所述至少一种突变的乙酰乳酸合酶基因并且包括所需序列的至少一种香蕉细胞、组织或植物;和
d.从步骤(c)中选择的香蕉细胞、组织或植物产生至少一种香蕉胚或植物。
39.产生至少一种对ALS抑制剂具有抗性的香蕉胚或植物的方法,所述方法包括:
a.使香蕉细胞与至少一种构建体接触,所述构建体携带编码至少一种突变的香蕉乙酰乳酸合酶基因的序列,所述突变的香蕉乙酰乳酸合酶基因提供对ALS抑制剂的抗性,或与可操作以在内源乙酰乳酸合酶(ALS)基因中引入突变的至少一种试剂接触,所述在内源乙酰乳酸合酶(ALS)基因中的突变提供对ALS抑制剂的抗性;
b.任选地培养所述香蕉细胞以产生香蕉细胞群体、组织或植物;
c.用ALS抑制剂处理所述香蕉细胞、细胞群体、组织或植物以选择对ALS抑制剂具有抗性并且包括所述突变的乙酰乳酸合酶基因或所述在内源乙酰乳酸合酶基因中的突变的至少一种香蕉细胞、组织或植物;和
d.从步骤(c)中选择的所述香蕉细胞、组织或植物产生至少一种香蕉胚或植物。
40.根据实施方案38或39所述的方法,进一步包括从步骤(c)中选择的所述至少一种细胞或组织生长香蕉植物。
41.根据实施方案38-40中任一项所述的方法,进一步包括从所述香蕉植物收获果实。
42.根据实施方案41所述的方法,进一步包括将所述果实加工成食品产品。
43.通过根据实施方案38-40所述的方法产生的香蕉细胞、胚、组织或植物。
44.香蕉细胞、香蕉胚或香蕉植物,包括编码突变的香蕉乙酰乳酸合酶基因的序列,所述突变的香蕉乙酰乳酸合酶基因提供对ALS抑制剂突变的抗性。
45.香蕉胚群体,其中至少90%(例如至少95%、96%、97%、98%或99%)的胚包括编码突变的乙酰乳酸合酶基因的序列,所述突变的乙酰乳酸合酶基因提供对ALS抑制剂的抗性。
46.根据实施方案44所述的香蕉细胞、胚或植物或根据实施方案45所述的群体,额外包括提供害虫抗性或表型上合乎需要的性状的所需序列。
47.根据实施方案44所述的香蕉细胞、胚或植物或根据实施方案45所述的群体,额外包括所需序列,当其表达时降低靶基因的表达,任选地其中所述靶基因是害虫基因或其中所述靶基因的降低的表达提供表型上合乎需要的性状。
48.栽培香蕉植物的方法,包括使作物与ALS抑制剂接触以减少其他植物的生长,其中所述香蕉植物包括编码突变的香蕉乙酰乳酸合酶基因的序列,所述突变的香蕉乙酰乳酸合酶基因提供对ALS抑制剂的抗性,或其中所述香蕉植物包括在内源乙酰乳酸合酶基因中的突变,所述在内源乙酰乳酸合酶基因中的突变提供对ALS抑制剂的抗性,并且其中所述其他植物容易受到ALS抑制剂的影响;任选地,其中所述ALS抑制剂是绿磺隆。
49.ALS抑制剂作为除草剂用于栽培根据任一项前述实施方案生长的香蕉植物的用途。
附图说明
图1-绿磺隆(CSF)处理香蕉胚性细胞悬浮液后胚数量的定性分析。ECS样本被短时间暴露于毒素(3、5、10、15天暴露),然后它们被接种到胚发育培养基(EDM;不含CSF)中。暴露后两个月评估胚数量。
图2-CSF处理(0、0.5、5、50和100mg/L)5、10和15天后的香蕉ECS。
图3-CSF处理5、10和15天后,发育3个月的香蕉胚。
图4-香蕉幼苗在补充有CSF(0、0.05、0.5和5mg/L)的RM(生根培养基)中发育6周。
图5-香蕉幼苗在补充有CSF(0、0.05、0.5、5、50和100mg/L)的RM培养基中发育4周。
图6-CSF喷洒(0、0.05、0.5、5、50和100mg/L)后一个月香蕉幼苗在RM培养基中的发育。在无菌去离子水中制备CSF溶液(每个phytotray喷洒体积1mL(一个phytotray中五个芽(shoot)),在第0天喷洒一次)。
图7-CSF喷洒(0、0.05、0.5、5、50和100mg/L)后一个月香蕉幼苗在RM培养基中的发育。CSF溶液是在0.015% Silwett L-77中制备(喷洒模式如针对图6所讨论的)。
图8-每周喷洒CSF(0、0.05、0.5、5、50和100mg/L)后一个月香蕉幼苗在RM培养基中的发育。CSF溶液是在0.015% Silwett L-77中制备(每个phytotray喷洒体积1mL(一个phytotray中五个芽),在第0天以及第一周、第二周和第三周后喷洒)。
图9-CSF喷洒(0、0.05、0.5、5、50和100mg/L)后一个月香蕉幼苗在RM培养基中的发育。所述RM培养基还含有0.05mg/L的CSF作为佐剂。CSF溶液是在0.015% Silwett L-77中制备(喷洒模式如针对图6和7所讨论的)。
图10-每周喷洒CSF(0、0.05、0.5、5、50和100mg/L)后一个月香蕉幼苗在RM培养基中的发育。所述RM培养基还含有0.05mg/L的CSF作为佐剂。CSF溶液是在0.015% SilwettL-77中制备(喷洒模式如针对图8所讨论)。
图11-香蕉ALS1和ALS2的RPKM数据显示这些跨不同香蕉组织的基因的相对标准化表达谱(每千碱基转录的读段,每百万映射读段-RPKM-是转录表达的标准化单位)。
图12-使用RT-qPCR测量的香蕉不同发育阶段中的ALS1和ALS2的表达谱。显示的数据是标准化的,并且相对于ECS中的ALS1。误差线是平均值的标准差。“ddct”是指deltadelta CT方法,是测量相对标准化表达的标准方法。
图13-pMOL_0632_EI的质粒图
图14A-pMOL_0630_EI的质粒图
图14B-pMOL_0628_EI的质粒图
图15-A.用于在香蕉ALS1或ALS2基因中产生精确编辑的构建体的质粒构建体草图。B.被引入香蕉中以赋予对绿磺隆的抗性的特定的ALS1氨基酸变化P187S。还表明P187之前的缬氨酸密码子中的耐受沉默突变C至T。C.香蕉中ALS1和ALS2的特定序列部分,显示氨基酸残基和相应的核苷酸三连体(SEQ ID NO:57-59)。下划线是ALS1和ALS2的具体绿磺隆抗性靶:分别为P187和P181。以灰色突出显示的是sgRNA序列加上CGG原间隔子相邻基序(PAM)的位置。
图16-在CSF 25和50μg/L存在下发育三个月的农杆菌转化的香蕉胚。
图17-在CSF(0、25和50μg/L)中生长3个月的转化的ALS1(pMol_0630)香蕉胚中mcherry荧光的表达。Mcherry表达表明T-DNA的整合。使用明视场(BF)和mcherry滤镜(filter)拍摄照片。可能的瞬时编辑胚在绿色框中突出显示。20倍放大倍率;曝光BF-10ms,mcherry-200ms。
图18-在CSF(0、25和50μg/L)中生长3个月的转化的ALS2(pMol_0632)香蕉胚中mcherry荧光的表达。Mcherry表达表明T-DNA的整合。使用明视场(BF)和mcherry滤镜拍摄照片。可能的瞬时编辑胚在绿色框中突出显示。20倍放大倍率;曝光BF-10ms,mcherry-200ms。
图19-总结CSF选择后的编辑频率和转基因胚分析的表格。
图20-来自T-DNA的扩增子的标准化Ct值表明未成熟香蕉胚中存在转基因。本发明人基于每个单独引物组的WT(野生型)对照设置阈值(>25个循环)。使用管家基因对数据标准化,以解释微小的DNA浓度差异。通过qPCR分析,这些胚中有6-15%看来是非转基因的。
图21-ALS1测序结果
图22-ALS2测序结果
图23-PBE#1和PBE#2胚基因分型数据(仅ALS,无性状)
图24-PBE#3中鉴定的不同植物基因分型的选择。
图25-农杆菌转化和粒子轰击的比较。
具体实施方式
用ALS抑制剂的处理
本发明的方法利用用ALS抑制剂处理香蕉细胞、细胞群、组织或植物来选择包括内源乙酰乳酸合酶基因中的突变并且包括靶序列中的所需突变的至少一种香蕉细胞、组织或植物。ALS1和ALS2之一或两者可被突变,并且类似地,一个或多个靶序列可各自包括一种或多种突变。例如,可有一种靶突变、一个靶序列中的多种突变或跨不同靶序列的多种突变,并且在不同靶序列中具有相同或不同数量的突变。
在某些实施方案中,用于本发明方法的所述ALS抑制剂是咪唑啉酮、嘧啶基硫代苯甲酸酯、磺酰基氨基羰基三唑啉酮、磺酰脲或三唑并嘧啶。优选地,所述ALS抑制剂是磺酰脲。优选的咪唑啉酮包括甲基咪草酯(imazamethabenz-methyl)、咪草啶酸(imazamox)、甲咪唑烟酸(imazapic)、灭草烟(imazapyr)、灭草喹(imazaquin)和咪草烟(imazethapyr)。优选的嘧啶基硫代苯甲酸酯包括双嘧苯甲酸钠(bispyribac-sodium)和嘧草硫醚(pyrithiobac-sodium)。优选的磺酰氨基羰基三唑啉酮包括氟酮磺隆钠(flucarbazone-sodium)和丙苯磺隆钠(propoxycarbazone-sodium)。优选的磺酰脲包括苄嘧磺隆(bensulfuron-methyl)、氯嘧磺隆(chlorimuron-ethyl)、绿磺隆、甲酰胺磺隆(foramsulfuron)、氯吡嘧磺隆(halosulfuron-methyl)、甲基二磺隆(mesosulfuron-methyl)、甲磺隆(metsulfuron-methyl)、烟嘧磺隆(nicosulfuron)、氟嘧磺隆(primisulfuron-methyl)、氟磺隆(prosulfuron)、玉嘧磺隆(rimsulfuron)、甲嘧磺隆(sulfometuron-methyl)、磺酰磺隆(sulfosulfuron)、噻吩磺隆(thifensulfuron-methyl)、醚苯磺隆(triasulfuron)、苯磺隆(tribenuron-methyl)、三氟啶磺隆钠盐(trifloxysulfuron-sodium)和氟胺磺隆(triflusulfuron-methyl)。优选的三唑并嘧啶类包括氯酯磺草胺(cloransulam-methyl)、双氯磺草胺(diclosulam)、双氟磺草胺(florasulam)、唑嘧磺草胺(flumetsulam)、五氟磺草胺(penoxsulam)和啶磺草胺(pyroxsulam)。最优选地,用于本发明方法的ALS抑制剂是绿磺隆。引入的ALS突变可能提供对任何一种或多种ALS抑制剂的抗性,例如单个突变可提供对多种抑制剂的抗性,并且多种突变可以提供对不同抑制剂的抗性。
在优选的实施方案中,所述方法包括处理胚或胚群体,例如在包含ALS抑制剂的胚发育培养基上培养胚。如本文所用,“培养胚”、“孵育胚”和“处理胚”包括从胚性细胞产生胚和孵育发育中的胚。在此类实施方案中,所述胚优选地在包含ALS抑制剂的胚发育培养基上孵育。这意味着胚性细胞当发育成胚时是在胚发育培养基上培养的。在某些实施方案中,所述胚被培养直至观察到完全发育的胚。在某些实施方案中,所述胚在包含ALS抑制剂的胚发育培养基上孵育8-20周,例如8-16、10-16、10-14、12-14或约13周。在某些实施方案中,所述胚在包含ALS抑制剂的胚发育培养基上孵育,所述ALS抑制剂浓度为10-80μg/L,例如10-70、15-65、15-55、20-55、20-50、20-40、20-30或约25μg/L。在某些实施方案中,所述胚在包含ALS抑制剂的胚发育培养基上孵育,所述ALS抑制剂浓度为10-80μg/L,例如20-70、25-65、30-60、40-60、40-55、45-55或约50μg/L。在某些实施方案中,胚在包含ALS抑制剂的胚发育培养基上孵育,所述ALS抑制剂浓度为10-80μg/L,例如10-70、15-65、15-55、20-55、20-50、20-40、20-30或约25μg/L或浓度为10-80μg/L,例如20-70、25-65、30-60、40-60、40-55、45-55或约55μg/L,持续8-20周,例如8-16、10-16、10-14、12-14或约13周。
在某些实施方案中,在转化之后和ALS抑制剂选择之前,允许所述胚或胚性细胞悬浮液复原,例如1-14、3-10、5-10或约7天。
在某些实施方案中,所述方法包括处理胚性细胞悬浮液。在某些实施方案中,在ALS抑制剂存在下孵育所述胚性细胞悬浮液,持续至少3天,例如至少5、10或15天,任选地少于30天,例如少于25、20、15或10天。在某些实施方案中,在ALS抑制剂存在下孵育所述胚性细胞悬浮液,所述ALS抑制剂浓度为至少500μg/L,例如至少1、2、3、4或5mg/L。在某些实施方案中,在ALS抑制剂存在下孵育所述胚性细胞悬浮液,所述ALS抑制剂浓度为至少500μg/L,例如至少1、2、3、4或5mg/L,持续至少3天,例如至少5、10或15天,任选地少于30天,例如少于25、20、15或10天。
在某些实施方案中,所述方法包括通过在包含ALS抑制剂的胚发育培养基中培养胚(例如从来自ECS的细胞开始),用ALS抑制剂处理香蕉胚。在某些此类实施方案中,所述方法包括随后将胚转移至不包含ALS抑制剂的胚发育培养基中。在某些实施方案中,所述方法包括在包含ALS抑制剂的胚发育培养基中培养来自细胞悬浮液、细胞群体或细胞团的胚,观察对ALS抑制剂有抗性的胚的发育,以及物理分离那些胚与较少发育或未发育的剩余细胞。在某些实施方案中,发育的胚被转移至不包含ALS抑制剂的胚发育培养基。
在某些实施方案中,本发明包括选择对用ALS抑制剂的处理有抗性的至少一种香蕉细胞,例如胚、香蕉组织或香蕉植物,并将它们与不具有抗性的细胞、组织或植物物理分离。在某些实施方案中,所述至少一种选择的细胞、组织或植物被转移至单独的板或生长培养基中,优选地不含ALS抑制剂。
在某些实施方案中,用ALS抑制剂处理细胞(例如在ALS抑制剂存在下培养胚)提供细胞或胚的群体,其中至少90%对所述ALS抑制剂具有抗性,例如至少95%、96%、97%、98%或99%。在一些所述细胞或胚中,可整合用于引入突变的核酸构建体,在这种情况下,所述细胞或胚将被认为是转基因的。然而,在某些实施方案中,至少5%、至少10%或至少15%的细胞或胚将在内源ALS基因(即ALS1或ALS2或两者)处被编辑并且将不具有任何整合的异源基因,因此被认为是非转基因的。此外,一些细胞或胚也将在靶序列处被编辑并且将包括所需突变。在某些实施方案中,至少1%、2%、3%、4%或5%的细胞或胚包括在内源乙酰乳酸合酶基因中的突变并且包括在靶序列中的所需突变。
在某些实施方案中,所述方法包括处理植物或幼苗。在某些实施方案中,所述方法包括用ALS抑制剂喷洒植物或幼苗。在某些实施方案中,喷洒剂(spray)额外包括佐剂,例如Silwett L-77。在某些实施方案中,用ALS抑制剂喷洒所述植物或幼苗,浓度为至少1mg/L,例如至少2、3、4或5mg/L或1-10mg/L或2-7mg/L。在某些实施方案中,大约一周一次喷洒所述植物或幼苗。在某些实施方案中,每周喷洒所述植物或幼苗,持续四周。在某些实施方案中,第一次每周喷洒是在第一周开始时。在某些实施方案中,所述植物或幼苗被喷洒四次,优选地一周一次。在某些实施方案中,所述方法包括在已添加ALS抑制剂的生根培养基中生长植物或幼苗。在某些实施方案中,所述生根培养基包括小于1mg/L的ALS抑制剂,例如小于0.5mg/L或小于0.1mg/L,例如0.01-0.09或0.002-0.06mg/L或约0.005mg/L。在某些实施方案中,所述幼苗在含有ALS抑制剂的生根培养基中生长2-6周,例如3-5周、约1个月或4周。在某些实施方案中,所述处理包括喷洒植物或幼苗并在包含ALS抑制剂的生根培养基中生长它们。在某些实施方案中,植物或幼苗被暴露于ALS抑制剂2-6周,例如3-5周、约1个月或4周。
在某些实施方案中,所述方法包括在ALS抑制剂存在下培养胚,优选地使用上文所述的浓度和/或时间段,并且不包括使用ALS抑制剂的任何其他处理。在此类实施方案中,所述方法不包括用ALS抑制剂处理植物或幼苗,并且不包括用ALS抑制剂孵育ECS。在此类实施方案中,转化后的ECS在不包含ALS抑制剂的培养基中孵育,并且在不包含ALS抑制剂的培养基中生长植物和幼苗。
可操作以引入突变的试剂
本发明的方法利用试剂在靶序列中引入至少一种所需突变,并利用试剂在内源ALS基因中引入突变。通常,不同的试剂将被用来靶向靶序列和ALS基因,特别地,不同的序列特异性试剂,但是所述试剂可任选地共享组分,例如核酸内切酶或碱基编辑器。因此,所述方法可任选地包括使香蕉细胞与一种碱基编辑器或核酸内切酶、或一种编码碱基编辑器或核酸内切酶的构建体、或编码相同碱基编辑器或核酸内切酶的一种或多种构建体接触,其可被操作以引入在靶序列中突变和在ALS基因中的突变。在此类实施方案中,所述试剂还将包括序列特异性试剂。所述碱基编辑器或核酸内切酶可以由以下构建体编码:编码靶向ALS基因的一种或多种序列特异性试剂的构建体、或编码靶向靶序列的一种或多种序列特异性试剂的构建体,或可以使用具有多种序列特异性组分的单个构建体。优选地,用于在靶序列中引入至少一种所需突变的试剂和用于在内源ALS基因中引入突变的试剂两者将包括相同或相似的试剂,例如两者将包括相同或相似的碱基编辑器(例如,或者因为使用单个碱基编辑器或编码所述碱基编辑器的构建体,或者因为使用编码相同碱基编辑器的多个拷贝的一种或多种构建体),因为如果两个突变是以相同方式引入,则这可增加以下可能性:对ALS抑制具有抗性的香蕉细胞或植物携带在靶序列中的所需突变。还可能可以使用不同的碱基编辑器/核酸内切酶,例如ALS和靶基因编辑试剂可包括不同的碱基编辑器、不同的核酸内切酶或其组合。
在优选的实施方案中,所述试剂共同包括碱基编辑器和向导RNA,或者包括至少一种编码碱基编辑器和向导RNA的核酸构建体。碱基编辑器可有效引入特定突变,而无需整合异源DNA,如实施例所示。所述试剂可包括靶向内源ALS基因的一种向导RNA或多种向导RNA和/或靶向所述靶序列的一种向导RNA或多种向导RNA。任选地,所述试剂包括靶向内源ALS基因的一种向导RNA和靶向靶序列的多种向导RNA。
碱基编辑是一种基因组编辑方法,其使用来自CRISPR系统的组分与其他酶一起将点突变直接安装到细胞DNA或RNA中,而不造成双链DNA断裂。特别地,修饰的Cas(例如dCas9或nCas9)可以与其他酶(通常作为融合蛋白)一起用于碱基编辑。DNA碱基编辑器包括无催化活性的核酸酶,其与核碱基脱氨酶,和在某些情况下DNA糖基化酶抑制剂融合。RNA碱基编辑器使用靶向RNA的组分实现了类似的变化。碱基编辑器直接将一个碱基或碱基对转换为另一个碱基或碱基对,从而能够将点突变有效安装在非分裂细胞中,而不会产生过多的不需要的编辑副产物(Rees and Liu(2018),“Base Editing:Precision Chemistry on theGenome and Transcriptome of Living Cells”,Nature Reviews Genetics,19(12):770-788)。根据一些实施方案,所述碱基编辑器包括与腺嘌呤或胞苷脱氨酶融合的nCas9或dCas9,并且任选地与DNA糖基化酶抑制剂融合。优选的碱基编辑器是包括nCas9(D10A)、胞苷脱氨酶APOBEC和尿嘧啶糖基化酶抑制剂的融合。Zong et al.Nat Biotechnol.2017,35(5):438-440提供了示例性的合适碱基编辑器。在某些实施方案中,所述碱基编辑器具有腺嘌呤和胞苷脱氨酶活性并且是双脱氨酶碱基编辑器(Griinewald,etal.,2020,NatureBiotechnology,38:861864)。
在优选的实施方案中,碱基编辑器由SEQ ID NO:17或与SEQ ID NO:17具有至少70、80、85、90、95、98或99%序列同一性的序列编码。在优选的实施方案中,碱基编辑器包括SEQ ID NO:18或与SEQ ID NO:18具有至少70、80、85、90、95、98或99%序列同一性的序列。
考虑到的其他碱基编辑器包括APOBEC、BE1、BE2、BE3、HF-BE3、BE4、BE4max、BE4-GAM、YE1-BE3、EE-BE3、YE-BE3、YEE-BE3、VQR-BE3、VRER-BE3、Sa-BE3、Sa-BE4、SaBE4-Gam、SaKKH-BE3、Cas12a-BE、Target-AID、Target-AID-NG、xBE3、eA3A-BE3、A3A-BE3、BE-PLUS、TAM、CRISPR-X、ABE7.9、ABE7.10、ABE7.10*、xABE、ABESa、VQR-ABE、VRER-ABE和SaKKH-ABE(Rees and Liu(2018),“Base Editing:Precision Chemistry on the Genome andTranscriptome of Living Cells”,Nature Reviews Genetics,19(12):770-788,以及其中的参考文献)。进一步考虑到的碱基编辑器包括修饰的Cas12或Cas13,例如dCas12、dCas13、dCas12a、dCas12b、dCas13a、dCas13b、dCas13c或dCas13d。
碱基编辑器通常与向导RNA结合使用,以在靶序列中引入特定突变。本发明中使用的向导RNA(或gRNA)特异性针对靶序列或ALS基因。在本发明的一些实施方案中,一种或多种向导RNA包括选自以下的序列:SEQ ID NO:19和SEQ ID NO:20,或与SEQ ID NO:19和SEQID NO:20具有至少90%、95%、98%或99%序列同一性的序列。在本发明的某些实施方案中,向导RNA包括选自以下的序列:SEQ ID NO:19和SEQ ID NO:20的序列,或与SEQ ID NO:19和SEQ ID NO:20具有至少90%、95%、98%或99%序列同一性(用于靶向ALS1和/或ALS2)的序列,并包括靶向靶基因(例如ACO、ACS或PPO(特别地,ACO1、ACO2、ACS1、ACS2或PPO2))的序列,优选地其中所述靶向靶基因的序列选自:SEQ ID NO:21-26,或与SEQ ID NO:21-26具有至少90、95、98或99%序列同一性的序列。
在进一步的实施方案中,所述试剂包括序列特异性核酸内切酶、向导RNA和任选地供体模板,或者包括编码序列特异性核酸内切酶、向导RNA和任选地供体模板的至少一种核酸构建体。
优选地,所述核酸内切酶介导将特定突变引入ALS基因和靶序列中,例如通过HDR(同源定向修复),而不是随机插入缺失,例如通过NHEJ。
因此,在某些实施方案中,所述试剂包括至少一种核酸内切酶、在至少一种内源ALS基因中包括至少一种突变的至少一种供体模板,和在靶序列中包括至少一种所需突变的至少一种供体模板。类似地,在某些实施方案中,所述试剂包括至少一种核酸构建体,其编码至少一种核酸内切酶、在至少一种内源ALS基因中包括至少一种突变的至少一种供体模板、以及在靶序列中包括至少一种所需突变的至少一种供体模板。优选地,所述供体模板是供体寡核苷酸。此类实施方案可有效引入突变,而无需整合任何异源DNA序列,因此在某些实施方案中,所述方法不会导致整合任何异源DNA,并且在某些实施方案中,通过所述方法产生的至少一种香蕉胚或植物不包括异源DNA。与本发明的其他实施方案一样,用ALS抑制剂选择已经在内源ALS基因座处经受HDR编辑的植物,将富集在靶序列处也被HDR编辑(共同HDR)的植物。
如本文所用,术语“供体寡核苷酸”或“供体模板”是指外源核苷酸,即从外部引入香蕉细胞以在基因组中产生精确变化。供体寡核苷酸可以是合成的。在某些实施方案中,供体寡核苷酸是RNA寡核苷酸。在某些实施方案中,供体寡核苷酸是DNA寡核苷酸。在某些实施方案中,供体寡核苷酸包括单链供体寡核苷酸(ssODN)、双链供体寡核苷酸(dsODN)、双链DNA(dsDNA)、双链DNA-RNA双链体(DNA-RNA双链体)、双链DNA-RNA杂交体、单链DNA-RNA杂交体、单链DNA(ssDNA)、双链RNA(dsRNA)、单链RNA(ssRNA)。在某些实施方案中,所述供体寡核苷酸是以非表达载体形式或寡核苷酸形式提供。在某些实施方案中,供体寡核苷酸包括DNA供体质粒(例如环状或线性化质粒)。
在优选的实施方案中,供体模板仅引入对内源ALS序列和靶序列的最少改变。在某些实施方案中,相对于编码内源ALS序列和靶序列的序列,供体模板包括1-40,例如5-40、5-30、5-20、3-40、3-30、3-20、5-15、3-10、10-30或10-20个核苷酸添加、缺失和取代。在优选的实施方案中,包括在至少一种内源ALS基因中的至少一种突变的供体模板包括单核苷酸取代,如在下文标题为“内源乙酰乳酸合酶基因中的突变”的部分中更详述的。
可能使用任何适合的HDR供体模板的设计。在某些实施方案中,供体模板包括同源臂,例如两个同源臂,每个长度为10-1000个核苷酸,例如100-1000、200-1000、200-800、300-800、300-700、400-600个核苷酸。在某些实施方案中,模板包括化学修饰,例如硫代磷酸酯修饰。修饰可防止线性供体的连接和/或增强其稳定性。根据一个实施方案,供体寡核苷酸包括约50-5000、约100-5000、约250-5000、约500-5000、约750-5000、约1000-5000、约1500-5000、约2000-5000、约2500-5000、约3000-5000、约4000-5000、约50-4000、约100-4000、约250-4000、约500-4000、约750-4000、约1000-4000、约1500-4000、约2000-4000、约2500-4000、约3000-4000、约50-3000、约100-3000、约250-3000、约500-3000、约750-3000、约1000-3000、约1500-3000、约2000-3000、约50-2000、约100-2000、约250-2000、约500-2000、约750-2000、约1000-2000、约1500-2000、约50-1000、约100-1000、约250-1000、约500-1000、约750-1000、约50-750、约150-750、约250-750、约500-750、约50-500、约150-500、约200-500、约250-500、约350-500、约50-250、约150-250或约200-250个核苷酸。根据一个具体实施方案,包括ssODN(例如ssDNA或ssRNA)的供体寡核苷酸包括约200-500个核苷酸。根据进一步的实施方案,包括ssODN(例如ssDNA或ssRNA)的供体寡核苷酸包括约300-500个核苷酸或约400-500个核苷酸。根据一个具体实施方案,包括dsODN(例如dsDNA或dsRNA)的供体寡核苷酸包括约250-5000个核苷酸。
在某些实施方案中,所述供体模板的长度为40-200个核苷酸,例如50-180、60-150、60-120、70-120、70-110、80-110或80-100个核苷酸。
核酸内切酶是切割多核苷酸链内磷酸二酯键的酶,并且包括在特定位点切割DNA而不损坏碱基的限制性核酸内切酶。限制性核酸内切酶包括I型、II型、III型和IV型核酸内切酶,其还包括亚型。在I型和III型系统中,甲基化酶和限制活性两者都包括在单个复合物中。核酸内切酶还包括大范围核酸酶(meganuclease),也称为归巢核酸内切酶(HEase),其类似限制性核酸内切酶,在特定识别位点结合并切割。核酸内切酶允许对真核基因组(如植物基因组)的精确基因工程。在一些实施方案中,核酸内切酶失活并且催化死亡,例如在dCas9中,如下文进一步讨论的。
在一些实施方案中,核酸内切酶选自以下:大范围核酸酶、锌指核酸酶(ZFN)、转录激活子样效应子核酸酶(transcription-activator like effector nuclease,TALEN)、归巢核酸内切酶、CRISPR相关核酸内切酶和修饰的CRISPR相关核酸内切酶。根据一些实施方案,核酸内切酶是CRISPR相关核酸内切酶,任选地其中CRISPR相关核酸内切酶是Cas9。每种可能性代表本发明的单独实施例。
如本文所用,“CRISPR相关核酸内切酶”(或“Cas”)是指具有RNA引导的多核苷酸编辑活性的核酸内切酶,并且是用于基因组编辑的CRISPR/Cas系统的组分之一,其使用至少一种额外成分——“向导RNA”(gRNA)。在本发明的一些实施方案中,“CRISPR相关核酸内切酶”是“Cas9核酸内切酶”(或“Cas9”)。根据一些实施方案,“CRISPR相关核酸内切酶”可以是本领域中已知的任何Cas9,例如但不限于SpCas9、SaCas9、FnCas9、NmCas9、St1Cas9、BlatCas9(Shota Nakade、Takashi Yamamoto&Tetsushi Sakuma(2017),Bioengineered,8:3,265-273以及其中的参考文献)。在其他实施方案中,“CRISPR相关核酸内切酶”可以是Cpf1,例如但不限于AsCpf1或LbCpf1(Shota Nakade,Takashi Yamamoto&Tetsushi Sakuma(2017),Bioengineered,8:3,265-273,以及其中的参考文献)。在进一步的实施方案中,“CRISPR相关核酸内切酶”可以是任何已知的Cas12或Cas13,例如dCas12、dCas13、dCas12a、dCas12b、dCas13a、dCas13b、dCas13c或dCas13d。
如本文所用的术语“向导RNA”或“gRNA”可以互换使用,并且是指促进CRISPR相关核酸内切酶或修饰的CRISPR相关核酸内切酶特异性靶向靶序列(例如细胞中的基因组或游离(episomal)序列)的多核苷酸。根据一些实施方案,gRNA可以是嵌合/单分子的(包括单个RNA分子,也称为单向导RNA或sgRNA)或模块化的(包括多于一个单独的RNA分子,通常是可能连接的crRNA和tracrRNA),例如通过双链化(duplexing))。根据一些实施方案,gRNA是sgRNA。
sgRNA是一种RNA分子,其包括tracrRNA和crRNA(和连接环(connecting loop))。sgRNA包括编码靶同源序列(crRNA-CRISPR RNA)的核苷酸序列和在单个嵌合转录物中将crRNA连接到Cas核酸酶(tracrRNA-trans-活化的CRISPR RNA)的内源细菌RNA。crRNA的可变区赋予相关核酸内切酶的切割特异性,并且长度通常为20个核苷酸,但长度可以在约17至20个核苷酸之间。gRNA/Cas复合物通过gRNA序列和互补基因组DNA之间的碱基配对被募集到靶序列上。为了成功结合Cas,基因组靶序列还必须包括紧接在靶序列之后的正确原间隔子相邻基序(PAM)序列。gRNA/Cas复合物的结合将Cas定位于基因组靶序列,以便Cas可以切割DNA的两条链,导致双链(double-strand)或双链的(double-stranded)断裂。正如ZFN和TALEN一样,CRISPR/Cas产生的双链断裂可以通过HR(同源重组)或NHEJ(非同源末端连接)修复,并且在DNA修复过程中易受特定序列修饰的影响。Cas核酸酶有两个功能结构域:RuvC和HNH,每个功能结构域切割不同的DNA链。当这两个结构域都活化时,Cas会在基因组DNA中导致双链断裂。CRISPR/Cas的一个显著优势是该系统的高效率以及轻松产生合成gRNA的能力。这产生了一个可被容易地修改以靶向不同基因组位点和/或靶向同一位点处不同修饰的系统。此外,已经创建了能够同时靶向多个基因的方案。然而,gRNA序列和基因组DNA靶序列之间碱基配对相互作用的明显灵活性,允许与待被Cas切割的靶序列的不完全匹配(imperfect match)。
本发明的方法中使用的试剂可包括一种或多种向导RNA,特别是针对每个靶标的sgRNA。在优选的实施方案中,单向导RNA或sgRNA被用于在内源ALS基因中引入突变。在一些实施方案中,两个或更多个向导RNA或sgRNA被用于在靶序列中引入所需突变。
在某些实施方案中,单独的CRISPR向导RNA被用在本发明的试剂中。
含有单个失活催化结构域(RuvC-或HNH-)的Cas酶的经修饰版本称为“切口酶”。Cas切口酶只有一个活性核酸酶结构域,仅切割靶DNA中的一条链,产生单链断裂或“切口”。单链断裂(single-strand break)或单链的断裂(single-stranded break)或切口主要通过单链断裂修复机制被修复,该机制涉及蛋白质,例如但不仅为PARP(传感器)和XRCCl/LIGIII复合物(连接)。如果单链断裂(SSB)是由拓扑异构酶I毒物或由在天然存在的SSB上捕获(trap)PARP1的药物产生的,那么这些可持续存在,并且当细胞进入S期并且复制叉(replication fork)遇到此类SSB时,它们将变成单端DSB,其只能由HR修复。然而,由Cas切口酶引入的两个近端相反的链切口被当作双链断裂,这通常被称为“双切口”CRISPR系统。双切口基本上是非平行的DSB,其可像其他DSB一样由HR或NHEJ修复,取决于对基因靶标的所需效果以及供体序列的存在和细胞周期阶段(HR的丰度低得多,并且可仅发生在细胞周期的S期和G2期)。因此,如果特异性和减少的脱靶效应至关重要,那么使用Cas切口酶通过设计两个gRNA来创建双切口(靶序列紧密靠近且在基因组DNA相反链上)将减少脱靶效应,因为单独的gRNA将产生不太可能改变基因组DNA的切口,尽管这些事件并非不可能。
如本文所用,“修饰的CRISPR相关核酸内切酶”(或“修饰的Cas”)是指其中催化结构域已被改变和/或其融合到额外结构域的Cas。根据一些实施方案,“修饰的Cas”是指含有无活性的催化结构域(死亡Cas(dead cas),或dCas)且不具有核酸酶活性同时仍然能够基于gRNA特异性与DNA结合的Cas。根据一些实施方案,“修饰的Cas”是指具有切口酶活性(“nCas9”),从而诱导单链断裂的Cas。在一些实施方案中,修饰的CRISPR相关核酸内切酶是“修饰的Cas9核酸内切酶”,可能是无催化活性的Cas9(或“dCas9”)或Cas9切口酶(“nCas9”)。dCas可以作为DNA转录调控因子的平台,以通过将失活酶融合到已知的调控结构域来激活或抑制基因表达。例如,dCas单独与基因组DNA中的靶序列结合会干扰基因转录。有许多公开可用的工具可以帮助选择和/或设计靶序列以及用于不同物种中不同基因的生物信息学法确定的独特gRNA列表,如Feng Zhang lab's Target Finder,MichaelBoutros lab's Target Finder“E-CRISP”,the RGEN Tools:“Cas-OFFinder”,theCasFinder:Flexible algorithm for identifying specific Cas9 targets in genomes和CRISPR Optimal Target Finder。
为了使用CRISPR/Cas系统,向导RNA和碱基编辑器或核酸内切酶两者都应被引入靶细胞或作为核糖核蛋白复合物被递送。根据一些实施方案,通过引入表达碱基编辑器/Cas/修饰的Cas和gRNA的一种或多种载体,将碱基编辑器/Cas/修饰的Cas和至少一种gRNA提供给香蕉细胞。所述载体可在单个质粒上包括多个盒(cassette),或所述盒是由多个单独的质粒表达。在某些实施方案中,第一质粒编码特异性针对靶基因的向导和碱基编辑器,并且第二质粒编码特异性针对ALS基因(ALS1或ALS2或两者)向导和碱基编辑器。CRISPR质粒可商购获得(例如来自Addgene的px330质粒)。成簇规律间隔短回文重复序列(CRISPR)相关(Cas)向导RNA技术和Cas核酸内切酶用于修饰植物基因组的用途还至少公开于Svitashev et al.(2015),Plant Physiology,169(2):931-945;Kumar和Jain,2015,Journal of Experimental Botany,66:47-57;以及美国专利申请公开号20150082478中。
在某些实施方案中,可操作以引入突变的试剂是存在于一个或多个(例如两个)质粒上的构建体。因此,可操作以在ALS序列中引入突变的试剂和可操作以在靶序列中引入突变的试剂可由单个质粒的序列编码,或者所述序列可被分开至多个质粒上。
转化
本发明的方法包括使香蕉细胞或细胞群体与可操作以在靶序列中引入至少一种所需突变和在内源ALS基因中引入突变的试剂接触。可以使用任何合适的转化香蕉细胞的方法。因此,“接触”涵盖用于将试剂(例如核酸构建体)引入细胞中的任何合适方法,包括称为转化、转染和转导的方法。
在优选的实施方案中,所述试剂包括一种或多种核酸构建体,例如编码向导RNA和碱基编辑器或核酸内切酶的核酸构建体。在此类实施方案中,香蕉细胞或香蕉细胞群是用一种或多种更多核酸构建体转化。用于本发明的实施方案的核酸构建体可使用本领域技术人员公知的重组技术构建。此类核酸构建体可为商购获得的,适合于转化到植物中并且适合于在转化的细胞中表达所述试剂(例如向导RNA和碱基编辑器或核酸内切酶)。
在某些实施方案中,转化的香蕉细胞是原生质体。在优选的实施方案中,转化的香蕉细胞是胚细胞,例如胚性细胞悬浮液中的细胞。
核酸构建体通常包括启动子。如本文所用,所述“启动子”是植物可表达的,即能够诱导、赋予、激活或增强植物细胞、组织或器官中的表达。可用于本发明方法的启动子的实例包括但不限于肌动蛋白、CANV 35S、CaMV19S、GOS2。优选的启动子是CsVMV。待表达的试剂的核苷酸序列可针对植物表达进行优化。此类序列修饰的实例包括但不限于,改变的G/C含量以更接近在香蕉中通常发现的含量,以及去除通常在植物物种中发现的非典型密码子(称为密码子优化)。香蕉植物细胞可用本发明实施方案的DNA构建体稳定或瞬时转化。优选地,转化是瞬时的(至少在一些产生的香蕉细胞和植物中),因此没有整合异源DNA。在一些实施方案中,DNA构建体中的启动子包括Pol3启动子。Pol3启动子的实例包括但不限于,AtU6-29、AtU626、AtU3B、AtU3d和TaU6。在一些实施方案中,DNA构建体中的启动子包括Pol2启动子。Pol2启动子的实例包括但不限于,CaMV 35S、CaMV 19S、泛素、CVMV。在一些实施方案中,DNA构建体中的启动子包括35S启动子。在一些实施方案中,在DNA构建体中的启动子包括U6启动子。在一些实施方案中,在DNA构建体中的启动子包括可操作地连接至编码至少一种gRNA的核酸试剂的Pol3启动子(例如U6)和/或可操作地连接至编码CRISPR相关内切核酸酶或碱基编辑的核酸序列的Pol2启动子(例如CamV35S)。在一些实施方案中,启动子是CsVMV。在某些实施方案中,使用多个启动子。DNA构建体可用于通过农杆菌介导的转化进行的瞬时表达(Helens et al.(2005),Plant Methods 1:13)。在一些实施方案中,包括在DNA构建体中的核酸序列缺乏与香蕉植物细胞的基因组同源的序列(除任何向导序列之外),以避免整合到香蕉基因组中。
在本发明的上下文中可以使用各种克隆试剂盒。如本文所用,“DNA构建体”可以是二元载体。二元载体的实例是pICSL4723、pBIN19、pBHOl、pBinAR、pGPTV、pCAMBIA、pBIB-HYG、pBecks、pGreen或pPZP(Hajukiewicz,P et al.,Plant Molecular Biology,25,989(1994),和Hellens et al.Trends in Plant Science 5,446(2000))。在本发明的上下文中可用于各种DNA递送方法(例如转染、电穿孔、粒子轰击和病毒接种)的其他载体的实例是:pGE-sgRNA(Zhang et al.Nature Communications 2016 7:12697),pJIT163-Ubi-Cas9(Wang et al.Nature Biotechnology 2004,32,947-951),pICH47742::2x35S-5'UTR-hCas9(STOP)-NOST(Belhan et al.Plant Methods 2013,11;9(1):39)。
在其他实施方案中,所述试剂是以RNA形式提供给香蕉植物细胞。在其他实施方案中,碱基编辑器或核酸内切酶是以蛋白质形式提供给香蕉植物细胞,并且一种或多种向导RNA是以RNA形式提供给香蕉植物细胞。在一些实施方案中,所述试剂是作为核糖核蛋白复合物提供给香蕉植物细胞。
在其他实施方案中,所述试剂是以DNA形式提供给香蕉植物细胞。
有大量将DNA、RNA、肽和/或蛋白质,或核酸和肽的组合引入(接触)植物细胞的方法。这些包括,例如原生质体转化(美国专利号5,508,184);干燥/抑制介导的DNA摄取(Potrykus et al.(1985)Mol.Gen.Genet.199:183-8);电穿孔(美国专利号5,384,253);用碳化硅纤维搅拌(agitation)(美国专利号5,302,523和5,464,765);农杆菌介导的转化(美国专利号5,563,055、5,591,616、5,693,512、5,824,877、5,981,840和6,384,301);DNA包被粒子的加速(美国专利号5,015,580、5,550,318、5,538,880、6,160,208、6,399,861和6,403,865),RNA包被粒子、纳米粒子、纳米载体和细胞穿透肽的加速(WO201126644A2;WO2009046384A1;WO2008148223A1)。其他转染方法包括使用转染试剂(例如Lipofectin、ThermoFisher)、树形分子(dendrimer)(Kukowska-Latallo,J.F.et al.(1996),Proc.Natl.Acad.Sci.USA 93,4897-902),细胞穿透肽(Mae et al.(2005),“Internalisation of cell-penetrating peptides into tobacco protoplasts”,Biochimica et Biophysica Acta 1669(2):101-7)或聚胺(Zhang and Vinogradov(2010),“Short biodegradable polyamines for gene delivery and transfection ofbrain capillary endothelial cells”,J Control Release,143(3):359-366)。
在本发明的一些实施方案中,核酸试剂是使用粒子轰击或基因枪提供至香蕉植物细胞。在优选的实施方案中,核酸试剂是使用农杆菌转化提供至香蕉植物细胞。实施例中描述了合适的方案。在其他实施方案中,核酸试剂是使用原生质体转染提供至香蕉植物细胞。在其他实施方案中,核酸试剂是使用电穿孔提供至香蕉植物细胞。在其他实施方案中,核酸试剂是使用纳米粒子介导的转染提供至香蕉植物细胞。
实施例证明农杆菌介导的转化可有效将突变引入香蕉,并且可以比包括粒子轰击在内的其他转化方法更加高效和有效。农杆菌系统包括使用含有限定的DNA片段的质粒载体。优选地,本发明在农杆菌介导的转化中使用的试剂包括一种或多种T-DNA构建体。用于农杆菌介导的转化的T-DNA构建体和其他构建体可以整合到基因组中,但也可以瞬时表达。在优选的实施方案中,本发明中使用的载体适合于瞬时表达用于引入突变的试剂,并且可以适合于瞬时表达和整合。这允许产生一些在靶序列和内源ALS基因处成功突变并且是非转基因的胚或植物,如实施例中所示。
优选地,所述载体(例如用于农杆菌介导的转化的载体)额外包括选择性标记物,其可帮助选择不包括整合到其基因组中的载体的胚或植物。合适的标记物可包括抗生素选择性标记物。可以使用的抗生素选择性标记物的实例是新霉素磷酸转移酶II(nptII)和潮霉素磷酸转移酶(hpt)。可根据本教导使用的额外的标记物基因包括但不限于庆大霉素乙酰转移酶(accC3)抗性以及博莱霉素和腐草霉素抗性基因。进一步优选的标记物包括:提供三嗪抗性的突变型psbA,特别是G264S和I219V;和突变型烯醇式丙酮酰莽草酸-3-磷酸合酶(enolpyruvylshikimate-3-phosphate synthase,EPSPS),其赋予对EPSP合酶抑制剂的抗性,特别是色氨酸102突变、丙氨酸103突变和脯氨酸106突变。在优选的实施方案中,选择性标记物是荧光蛋白,其在实施例中被证明特别有效,例如mCherry、mTurquoise 2、GFP(例如sfGFP或pH-tdGFP)、Gamillus、mNeonGreen、mEYPF、mCitrine、Citrine或TagRFP。优选地,选择性标记物是mCherry。
用于本发明的优选的根瘤农杆菌(Agrobacterium tumefacien)是AGL1。
可使用用携带载体的农杆菌转化香蕉细胞的任何合适的方法。在优选的实施方案中,将农杆菌细胞悬浮液与香蕉ECS混合。然后可孵育混合物,然后可去除细菌悬浮液以提供可被培养的ECS细胞。在某些实施例中,应用热休克处理(例如在45℃下持续5分钟)。合适的方法记载于例如Shivani and Tiwari,Scientia Horticulturae,246:675-685)。
在内源乙酰乳酸合酶基因中的突变
本发明利用靶序列和内源ALS基因的共同编辑与ALS抑制剂选择的组合,来提供用于编辑香蕉基因的高效且有效的策略。因此,本发明涉及在内源合酶基因中的突变,所述在内源合酶基因中的突变提供对ALS抑制剂的抗性。
ALS基因编码在支链氨基酸生物合成途径中发挥功能的蛋白质。ALS蛋白被磺酰脲家族除草剂不可逆地竞争性抑制(Garcia et al.,2017PNAS,114(7),E1091-E1100)。已经鉴定数种ALS氨基酸取代(天然存在的和人工生成的),其增加对磺酰脲化合物的耐受性(Yu&Powles,2014a,Pest Management Science,70(9))。这些通常全部是形成活性位点口袋并改变除草剂化合物的结合潜力的氨基酸残基(Boutsalis et al.,1999,PesticideScience,55(5),507-516)。
一般而言,根据本发明引入的在内源ALS基因中的突变将是改变密码子编码的氨基酸的取代并由此导致ALS蛋白质序列中的氨基酸取代。本发明用于提供对ALS抑制剂的抗性的示例性氨基酸取代提供于下表1中,参考根据拟南芥ALS的氨基酸编号(改编自Yu&Powles,2014,Pest Management Science,70(9)。拟南芥ALS氨基酸序列可以与香蕉ALS1和ALS2氨基酸序列比对以鉴定相应的残基。可以将香蕉ALS1和ALS2氨基酸序列与所述香蕉ALS1和ALS2基因序列进行比较,以鉴定可被改变以改变编码的氨基酸序列的核苷酸。
表1
SU=磺酰脲,IMI=咪唑啉酮
优选地,根据本发明引入的在内源ALS基因中的突变是导致在编码的氨基酸序列中的取代的取代。优选的氨基酸取代在残基Gly-121、Ala-122、Met-124、Arg142、Val-196、Pro-197、Arg-199、Met-200、Ala-205、Phe-206、Gln-207、Lys-256、Met-351、His-352、Arg-373、Asp-375、Asp-376、Arg-377、Met-570、Val-571、Trp-574、Phe-578、Ser-653、Gly-654处,根据拟南芥ALS编号。某些取代消除了与磺酰脲和/或咪唑啉酮除草剂的结合,如表1所示。为了最佳性能,应使用适当的除草剂进行选择。
优选的突变残基是Pro-197、Trp-574-Leu和Ala-121,根据拟南芥ALS序列编号。
优选地,根据本发明引入的在内源ALS基因中的突变导致对应于拟南芥ALS序列中的Pro-197的残基的取代,其是香蕉ALS1的Pro-187并且ALS2中的Pro-181。最优选地,所述引入的取代包括ALS1中的Pro187Ser和/或ALS2中的Pro181Ser。用于引入这些突变的合适的向导RNA包括SEQ ID NO:19和SEQ ID NO:20,或与SEQ ID NO:19或SEQ ID NO:20具有至少90%、95%、98%或99%序列同一性的序列。在某些实施方案中,引入的取代包括ALS1中的Pro187Phe和/或ALS2中的Pro181Phe,其也被预期提供对ALS抑制剂的抗性。在某些实施方案中,通过本发明的方法获得的香蕉细胞或植物对于在ALS1处的多于一个突变和/或在ALS2处的多于一个突变是杂合的,并且引入的取代包括ALS1中的Pro187Ser和Pro187Phe和/或包括ALS2中的Pro181Ser和Pro181Phe。
在某些实施方案中,所述方法包括在内源ALS1基因中引入突变,例如在编码对应于拟南芥ALS序列中Pro-197的残基的密码子处。此类实施方案通常涉及使用靶向ALS1的向导RNA。在某些实施方案中,所述方法包括在内源ALS2基因中引入突变,例如在编码对应于所述拟南芥ALS序列中Pro-197的残基的密码子处。此类实施方案通常涉及使用靶向ALS2的向导RNA。在某些实施方案中,所述方法包括在内源ALS1和ALS2基因两者中引入突变,例如在编码对应于所述拟南芥ALS序列中Pro-197的残基的密码子处。此类实施方案通常可涉及使用靶向ALS1和ALS2的单独的向导RNA,但可涉及使用能够靶向ALS1和AL2两者的单个向导RNA。在某些实施方案中,所述方法不包括在内源ALS1基因中引入突变,并且例如,所述试剂不包括任何序列,例如靶向内源ALS1基因的向导RNA。
本发明的方法可涉及在内源ALS基因的一个、两个或三个等位基因处引入突变。在一些实施方案中,通过本发明的方法获得的香蕉细胞、组织或植物对于在ALS1、ALS2、或ALS1和ALS2处的突变是纯合的。在一些实施方案中,通过本发明的方法获得的香蕉细胞或植物对于在ALS1、ALS2、或ALS1和ALS2处的突变是杂合的。
因此,本发明方法中引入的在ALS基因中的突变没有必要提供对ALS抑制剂的完全抗性,使得用ALS抑制剂的处理没有效果。相反,根据所述方法产生的香蕉胚和植物在用ALS抑制剂处理后可表现出延迟的发育或减弱的生长。然而,ALS基因中的突变将提供对用ALS抑制剂的处理的抗性,因为根据所述方法产生的香蕉胚和植物在用ALS抑制剂处理下相对于野生型香蕉胚和植物将表现出改善的发育或生长。
在靶序列中的所需突变
本发明提供了用于产生在一个或多个靶序列中包括至少一种所需突变的香蕉胚和植物的改进方法。
本文所用的“突变”可指至少一种核苷酸插入、至少一种核苷酸缺失、插入-缺失(indel)、倒位(inversion)、至少一种核苷酸取代,或前述的任意组合。修饰可导致靶序列中的移码、错义突变、功能丧失突变、无义突变或功能获得突变。可操作以引入所需突变的试剂可使用标准技术根据所需要的突变的类型来设计。
在一些实施方案中,靶序列编码参与香蕉成熟的蛋白质,例如参与乙烯产生的蛋白质。在一些实施方案中,可靶向编码ACO(1-氨基环丙烷-1-羧酸氧化酶)和/或ACS(ACC-合酶)的序列。ACO和ACS参与乙烯产生。在某些实施方案中,所需突变降低ACO和/或ACS的表达或活性,这可延迟香蕉果实的成熟。在优选的实施方案中,所需突变在编码ACO或ACS或两者的序列中引入终止密码子。在优选的实施方案中,靶序列是ACO1(例如SEQ ID NO:46,或与SEQ ID NO:46具有至少90、95、98或99%序列同一性的序列)。在一些此类实施方案中,用于引入突变的一个或更多个向导RNA包括选自以下的序列:SEQ ID NO:24-26,或与SEQ IDNO:24-26具有至少90、95、98或99%序列同一性的序列。在某些实施方案中,所述靶是ACO并且所述靶序列选自SEQ ID NO:46和SEQ ID NO:47,或与SEQ ID NO:46或SEQ ID NO:47具有至少90%、95%、98%或99%序列同一性的序列。在某些实施方案中,所述靶是ACS并且所述靶序列选自SEQ ID NO:43、SEQ ID NO:44和SEQ ID NO:45,或与SEQ ID NO:43、SEQ ID NO:44或SEQ ID NO:45具有至少90%、95%、98%或99%序列同一性的序列。
在一些实施方案中,所述靶序列编码参与香蕉褐变的蛋白质。在一些实施方案中,靶序列是编码PPO(多酚氧化酶)的序列。PPO是一种被认为负责香蕉褐变的酶。在优选的实施方案中,靶序列是PPO2(例如SEQ ID NO:48,或与SEQ ID NO:48具有至少90、95、98或99%序列同一性的序列)。在某些实施方案中,靶序列是PPO1(例如SEQ ID NO:49,或与SEQ IDNO:49具有至少90、95、98或99%序列同一性的序列)。在某些实施方案中,靶序列是PPO3(例如,SEQ ID NO:50,或与SEQ ID NO:50具有至少90、95、98或99%序列同一性的序列)。在某些实施方案中,靶序列是PPO4(例如,SEQ ID NO:51,或与SEQ ID NO:51具有至少90、95、98或99%序列同一性的序列)。在某些实施方案中,靶序列是PPO5(例如,SEQ ID NO:52,或与SEQ ID NO:52具有至少90、95、98或99%序列同一性的序列)。在某些实施方案中,靶序列是PPO6(例如,SEQ ID NO:53,或与SEQ ID NO:53具有至少90、95、98或99%序列同一性的序列)。在某些实施方案中,靶序列是PPO7(例如,SEQ ID NO:54,或与SEQ ID NO:54具有至少90、95、98或99%序列同一性的序列)。在某些实施方案中,靶序列是PPO8(例如,SEQ ID NO:55,或与SEQ ID NO:55具有至少90、95、98或99%序列同一性的序列)。在某些实施方案中,靶序列是PPO9(例如,SEQ IDNO:56,或与SEQ ID NO:56具有至少90、95、98或99%序列同一性的序列)。在某些实施方案中,所需突变降低PPO的表达或活性,这可延迟香蕉果实的成熟。在优选的实施方案中,所需突变在编码PPO的序列中引入终止密码子。在一些此类实施方案中,用于引入突变的一种或更多种向导RNA包括选自以下的序列:SEQ ID NO:21-23,或与SEQ ID NO:21-23具有至少90%、95%、98%或99%序列同一性的序列。
任选地,可靶向多个序列以进行突变(例如,可同时靶向ACO、ACS和PPO基因的任何组合),并且对于每个序列靶标,可引入一个或多个突变。本发明可用于突变大量不同基因以实现各种效果。本发明特别可用于调节内源性基因以提供改进的性状并保护生物免受不同的生物和非生物胁迫,例如癌症、病毒、昆虫、真菌(fungi)、线虫、热(heat)、干旱、饥饿(starvation)等。此部分提供根据本发明被引入突变的优选的靶基因。优选地,所需突变降低基因的表达或所表达的蛋白质的活性。优选地,所述突变是引入终止密码子。
在某些实施方案中,根据本发明突变基因提供具有增加的胁迫耐受性、增加的产量、增加的生长速率或增加的产量品质的植物。在某些实施方案中,靶序列编码涉及胁迫耐受性、产量、生长速率或产量品质的蛋白质。
如本文中所使用的,短语“胁迫耐受性”是指植物耐受生物胁迫或非生物胁迫而不遭受代谢、生长、生产力和/或生活力(viability)的实质性改变的能力。
如本文中所使用的,短语“非生物胁迫”是指植物、植物细胞等暴露于非生命的(“非生物的”)环境、物理或化学试剂(其对于植物的代谢、生长、发育、繁殖或存活(统称为“生长”)具有不利影响)。可对植物施加非生物胁迫,例如由于环境因素,例如水(如,洪水、干旱或脱水)、厌氧条件(如,较低水平的氧气或较高水平的CO2)、异常渗透条件(如,渗透胁迫)、盐度或温度(如,热(hot)/加热(heat)、冷、冰冻或霜冻)、暴露于污染物(如,重金属毒性)、厌氧生活(anaerobiosis)、营养缺乏(如,氮缺乏或有限的氮))、大气污染或紫外线照射。
如本文中所使用的,短语“生物胁迫”是指植物、植物细胞等暴露于生命的(“生物的”)生物体(还包括病毒),其对植物的代谢、生长、发育、繁殖、产量或存活(统称为“生长”)具有不利影响。生物胁迫可通过例如细菌、病毒、真菌、寄生虫、有益的昆虫及有害的昆虫、杂草以及培育的或天然的植物所引起。
如本文中所使用的,短语“产量”或“植物产量”是指增加的植物生长(生长速率)、增加的作物生长、增加的生物量及/或增加的植物产品产量(包括谷物、果实、种子等)。
根据一个实施方案,为了产生具有增加的胁迫耐受性、增加的产量、增加的生长速率或增加的产量品质的植物,所需突变为在植物基因中,该基因赋予植物对胁迫、产量降低、生长速度降低或产量品质降低的敏感性。在此类实施方案中,所述突变可以使基因失活或降低其表达或活性。
根据本发明,可使用本领域已知的用于评估增加的胁迫耐受性的任何方法。评估增加的胁迫耐受性的示例性方法包括,但不限于如在Yoon,SK.,Bae,EK.,Lee,H.etal.Trees(2018)32:823.记载的,下调白杨木中的PagSAP1以增加盐胁迫耐受性,以及通过下调SlbZIP38以提高西红柿的耐旱性(Pan Y et al.Genes 2017,8,402;doi:10.3390/genes8120402)。
根据本发明,可使用本领域已知的用于评估增加的产量的任何方法。评估产量增加的示例性方法包括,但不限于如在Ar-Rafi Md.Faisal,et al.,AJPS>第8卷,第9期,2017年8月,DOI:10.4236/ajps.2017.89149中所记载的,在水稻中降低DST的表达;及如在WangY et al.,Mol Plant.2009Jan;2(1):191-200.doi:10.1093/mp/ssn088所记载的,在油菜中下调BnFTA导致产量增加。
根据本发明,可使用本领域已知的用于评估增加的生长速率的任何方法。评估增加的生长速率的示例性方法包括,但不限于,如在Marcelo de Freitas Lima etal.Biotechnology Research and Innovation(2017)1,14---25所记载的,降低拟南芥中BIG BROTHER或GA2-氧化酶的表达导致生长和生物量增加。
根据本发明,可使用本领域已知的用于评估增加的产量品质的任何方法。评估增加的产量品质的示例性方法包括但不限于,在水稻下调中OsCKX2导致产生更多的分蘖(tiller)、更多的谷物和更重的谷物,如Yeh S_Y et al.,Rice(N Y).2015;8:36所记载的;和降低许多植物中的OMT水平,其导致木质素积累改变,提高用于工业目的的材料的消化率,如在Verma SR and Dwivedi UN,South African Journal of Botany,第91卷,2014年3月,第107-125所记载的。
根据一个实施方案,所述方法进一步能够产生包括增加的甜度、增加的糖含量、增加的风味、改善的成熟控制、增加的水胁迫耐受性、增加的热胁迫耐受性及增加的盐耐受性的植物。本领域技术人员将知道如何利用本文描述的方法来选择靶基因序列用于突变。
如本文中所使用的,术语“害虫”是指直接或间接危害植物的生物体。直接的效果包括,例如,以植物叶子为食。间接的效果包括,例如,致病因素(disease agent)(如,病毒、细菌等)传播至植物。在后种情况下,害虫充当病原体传播的载体。
根据一个实施方案,害虫是无脊椎生物体。
示例性害虫包括,但不限于,昆虫(insect)、线虫(nematode)、蜗牛(snai)、蛞蝓(slug)、蜘蛛(spider)、毛毛虫(caterpillar)、蝎子(scorpion)、螨(mite)、蜱(tick)、真菌等。
待突变的植物或病原体靶基因的鉴定可使用本领域已知的任何方法来实现,例如通过常规的生物信息学分析。
进一步选择
在某些实施方案中,所述方法包括在使用ALS抑制剂的处理后的额外选择步骤。特别地,它可被用于选择不包括整合到其基因组中的异源DNA(例如编码所述试剂的核酸构建体)的香蕉胚或植物以及被确认在靶序列中包括至少一种所需突变的胚或植物。
在某些实施方案中,该方法包括在使用ALS抑制剂的选择后,选择在靶序列中包括至少一种所需突变的至少一种香蕉细胞或植物。所述突变可以是至少一个核苷酸插入;至少一个核苷酸缺失;至少一个核苷酸取代;以及前述的任意组合。选择在靶序列中包括至少一种所需突变的至少一种香蕉细胞或植物可包括检测突变的基因组序列的存在。
在优选的实施方案中,用ALS抑制剂处理香蕉细胞提供香蕉细胞群体,并且所述群体中至少1%、5%、10%或15%的细胞不包括异源DNA。在进一步优选的实施方案中,至少0.1%、1%、2%、3%、4%或5%的细胞包括在内源乙酰乳酸合酶基因中的突变并且包括在靶序列中的所需突变。
在某些实施方案中,所述方法包括在使用ALS抑制剂的选择后,选择不包括任何异源DNA(特别是编码可操作以引入突变的试剂的整合核酸构建体)的至少一种香蕉细胞或植物。此类至少一种香蕉细胞或植物的选择可包括确认在细胞或植物的基因组中不存在核酸构建体序列。
这可以使用本领域中已知的能够检测修饰或编辑事件的任何技术来实现,例如但不限于DNA测序(例如下一代测序)、电泳、基于酶的错配检测测定和杂交测定,如PCR、RT-PCR、RNase保护、原位杂交、引物延伸、DNA印迹、RNA印迹和斑点印迹分析。还可以使用用于检测单核苷酸多态性(SNP)的各种方法,例如基于T7核酸内切酶的PCR、异源双链体和Sanger测序。高分辨率熔解曲线分析(High-resolution melting analysis)是另一种验证编辑事件的存在的方法。另一种方法是异源双链体迁移率测定。也可以通过用天然聚丙烯酰胺凝胶电泳(PAGE)直接分析再杂交的PCR片段来检测突变。该方法利用了异源双链体和同源双链体DNA在聚丙烯酰胺凝胶中的差异迁移。验证编辑事件的存在的其他方法详细记载于Zischewski(2017),Biotechnology Advances 1(1):95-104中。
在优选的实施方案中,用于转化香蕉细胞的试剂是核酸构建体,例如T-DNA质粒,并且所述构建体额外编码选择性标记物,例如荧光标记物,例如mCherry。此类标记物可帮助选择非转基因胚,例如,不具有整合到其基因组中的构建体并且仅瞬时表达编辑试剂的非转基因胚,因为它们会不表达选择性标记物,如实施例中所证明的。
合适的标记物可包括抗生素选择性标记物。可以使用的抗生素选择性标记物的实例是新霉素磷酸转移酶II(nptII)和潮霉素磷酸转移酶(hpt)。可根据本教导使用的额外的标记物基因包括但不限于庆大霉素乙酰转移酶(accC3)抗性以及博莱霉素和腐草霉素抗性基因。进一步优选的标记物包括:提供三嗪抗性的突变型psbA,特别是G264S和I219V;和突变型烯醇式丙酮酰莽草酸-3-磷酸合酶(enolpyruvylshikimate-3-phosphate synthase,EPSPS),其赋予对EPSP合酶抑制剂的抗性,特别是色氨酸102突变、丙氨酸103突变和脯氨酸106突变。在优选的实施方案中,选择性标记物是荧光蛋白,其在实施例中被证明特别有效,例如mCherry、mTurquoise 2、GFP,例如sfGFP或pH-tdGFP、Gamillus、mNeonGreen、mEYPF、mCitrine、Citrine或TagRFP。优选地,选择性标记物是mCherry。
香蕉细胞、组织和植物
如本文所用,术语“香蕉”是指芭蕉(Musa)属植物,包括大蕉(plantain)。这些包括小果野蕉(Musa acuminata)(例如,小果野蕉Banksia、小果野蕉Calcutta和小果野蕉DH-Pahang)、野蕉(Musa balbisiana)、阿宽蕉(Musa itinerans)和同源三倍体小果野蕉‘卡文迪什’和‘大麦克’(Gros Michel)。根据优选的实施方案,香蕉是同源三倍体小果野蕉‘Cavendish’。根据进一步优选的实施方案,所述香蕉是Grand Nain栽培种。栽培香蕉是源自祖先物种小果野蕉(基因组AA)的不育同源三倍体(AAA)。此外,大蕉(AAB或ABB)是源自小果野蕉(AA)和野蕉(基因组BB)的杂交的不育的种间异源三倍体。栽培香蕉和大蕉的三倍体性质防止它们产生可育性种子,而野生物种是二倍体并可以产生可育性种子。在优选的实施方案中,所述香蕉植物是三倍体。考虑到了其他倍性,包括二倍体和四倍体。
如本文所用,术语“植物”是指整株植物、嫁接的植物、植物的祖先及后代、植物器官、植物组织和“植物部分”。如本文所用,“植物部分”包括分化和未分化组织,包括但不限于根(包括块茎)、根茎(rootstock)、茎、接穗(scion)、芽、果实、叶、花粉、种子、肿瘤组织和各种形式的细胞和培养物(例如单细胞、原生质体、胚、胚细胞和愈伤组织)。植物组织可在植物中或在植物器官、组织或细胞培养物中。
在具体实施方案中,植物部分是果实。果实包括诸如果肉和果皮的组织。在其他实施方案中,植物部分是种子。本文使用的术语“种子”是指能够发育成另一株此类植物的开花植物的繁殖单位。术语“植物器官”是指植物组织或构成植物的形态学上和功能上不同部分的一群组织。术语“基因组”是指存在于生物体、病毒或细胞器的每个细胞中的遗传物质(基因和非编码序列)的完整互补体(entire complement);和/或作为(单倍体)单位从父母一方继承的完整染色体组。“后代”包括植物的任何后续世代。
“转基因植物”包括例如在其基因组内包括通过转化步骤引入的异源多核苷酸的植物。异源多核苷酸可以被稳定地整合在基因组内,使得多核苷酸被传递到连续的世代。异源多核苷酸可以单独或作为重组DNA构建体的一部分被整合到基因组中。转基因植物还可以在其基因组内包括多于一种异源多核苷酸。每个异源多核苷酸可赋予转基因植物不同的性状。
如本文所用,“异源”多核苷酸包括起源于外来物种的序列。
“转基因”可包括任何细胞、细胞系、愈伤组织、组织、植物部分或植物,其基因型已通过异源核酸的存在而改变,包括最初如此改变的那些转基因以及通过有性杂交或无性繁殖从最初的转基因产生的那些。在一些植物中,引入到植物基因组中的异源多核苷酸可以通过育种去除。如上所述,该过程在香蕉中是不可能的。
根据优选的实施方案,本文记载的以及通过本发明的方法获得的香蕉细胞、香蕉植物或香蕉植物部分是非转基因的。根据一些实施方案,本文公开的方法产生非转基因的香蕉细胞、香蕉植物或香蕉植物细胞。
通过传统植物育种方法、本文记载的基因组编辑步骤(不会导致外源多核苷酸的插入)或通过天然存在的事件(例如随机异体受精、非重组病毒感染、非重组细菌转化、非重组转座或自发突变)进行的基因组(染色体或染色体外)的改变不旨在被视为转基因的。
在一些实施方案中,“香蕉植物”是香蕉育种株系,例如优良株系(elite line)或纯种株系(purebred line),或香蕉品种或育种种质。如本文所用,术语“育种株系”是指与野生品种或地区品种相反的具有商业价值或农艺学上合乎需要特征的栽培香蕉株系。该术语包括提及“优良育种株系”或“优良株系”,其代表用于产生商业F1杂种的基本上纯合的(通常近交的)植物株系。“优良育种株系”是通过育种和选择包括大量的农艺学上合乎需要性状的优异农艺学性能而获得的。“优良植物”是来自优良株系的任何植物。优异的农艺学性能是指如本文所定义的农艺学上合乎需要性状的所需组合,其中与非优良育种株系相比,需要大多数(优选地全部)农艺学上合乎需要性状在优良育种株系中被改进。优良育种株系基本上是纯合的并且优选地是近交株系。本文所用的术语“优良株系”是指通过育种和选择优异农艺学性能而获得的任何株系。
术语“栽培种”和“品种(variety)”在本文中可互换使用,并且意指已通过育种(例如,杂交和选择)有意开发的植物,目的是商业化(例如由农民和种植者使用)来产生农产品供自己消费或商业化。术语“育种种质”意指具有除“野生”状态之外的生物学状态的植物,其中“野生”状态表示植物或种质的原始非栽培或天然状态。
术语“育种种质”包括但不限于半天然、半野生、杂草、传统栽培种、地区品种(landrace)、育种材料、研究材料、育种株系、合成群体、杂种、创始原种(founder stock)/基础群体(base population)、近交株系(杂交栽培种的亲本)、分离群体、突变体/遗传原种、市场类别和先进/改进的栽培种。如本文所用,术语“纯种的”、“纯近交的”或“近交的”是可互换的,并且是指通过重复自交和/或回交获得的基本上纯合的植物或植物株系。
如本文所用,术语“香蕉植物细胞”是香蕉植物的细胞。香蕉植物细胞包括来自种子、悬浮培养物、胚、分生组织区、愈伤组织、叶、根、芽、配子体、孢子体、花粉、小孢子、胚性细胞、体细胞和原生质体的细胞。原生质体可以源自任何植物组织,例如但不限于根、叶、胚性细胞悬浮液、愈伤组织或幼苗组织。根据一些实施方案,香蕉植物细胞是胚性细胞悬浮液(ECS)的细胞。
本发明进一步提供了可通过本发明前述方法中的任一种获得的香蕉植物细胞。
在本发明的一些实施例中,所述方法进一步包括从所述香蕉植物细胞再生香蕉植物。
如本文所用,“再生”可包括通过首先生长香蕉植物细胞至发育成愈伤组织的群体,然后使用植物组织培养方法从愈伤组织再生芽(离体培养枝芽发生(caulogenesis)),从而使香蕉植物细胞(其包括原生质体)生长成整株香蕉植物。香蕉原生质体生长成愈伤组织以及随后的芽再生需要在必须定制的组织培养基中植物生长调节剂的适当平衡。原生质体也可用于植物育种,使用称为原生质体融合的技术。通过使用电场或聚乙二醇溶液来诱导来自不同物种的原生质体融合。该技术可用于在组织培养中产生体细胞杂种。原生质体再生的方法是本领域中公知的。影响原生质体的分离、培养和再生的若干因素:即基因分型、供体组织及其预处理、原生质体分离的酶处理、原生质体培养方法、培养物、培养基和物理环境。(参见Maheshwari et al.(1986),“Differentiation of Protoplasts and ofTransformed Plant Cells”:3-36.Springer-Verlag,Berlin)。再生的香蕉植株可进行选择。香蕉植物或其细胞可缺乏转基因,即“非转基因”。例如,香蕉植物可缺乏编码本发明的一些实施方案中使用的任何CRISPR/Cas系统的任何DNA构建体。
任何适当的培养基和培养技术可用于本发明的方法中。合适的示例性培养基和技术在例如Banana and Plantain embryogenic cell suspensions-INIBAP TechnicalGuidelines 8(Strosse et al.,Vézina和Picq,eds,2003)中提供。例如,在Strosse etal.中,合适的胚发育培养基(EDM)是MA3以及合适的细胞悬浮培养基(CS)是ZZ1(参见附录2)。
在一些实施方案中,本发明的方法进一步包括从所述香蕉植物收获果实。每个成年香蕉植物产生单串,其由许多香蕉果实或“手指”组成,并聚集在若干“手”中。在这方面,“收获”具有常规意义。例如,可以使用锋利的弯刀(curved knife)或砍刀(machete)手工切割香蕉串(通常包括2或3人)。所述收获通常发生在香蕉果实仍呈绿色且坚硬(firm)时——成熟前7至14天。
本发明进一步提供可通过前述方法获得的香蕉植物或植物部分。
本发明进一步提供从可通过本发明前述方法获得的香蕉植物收获的果实,其中所述果肉和/或果皮的特征在于对ALS抑制剂的抗性的表型和由在靶序列中的所需突变提供的表型。
本发明进一步提供包括编码突变的ALS多肽和/或包括突变的ALS多肽的核酸序列的香蕉植物或果实,其中所述突变的ALS多肽包括与SEQ ID NO:9具有至少80、85、90、95、97、98、99或100%序列同一性的序列,并且具有在残基187处脯氨酸的取代,优选地取代为丝氨酸,或其中所述突变的ALS多肽包括与SEQ ID NO:10具有至少80、85、90、95、97、98、99或100%序列同一性的序列,具有在残基181处脯氨酸的取代,优选地取代为丝氨酸。在进一步的实施方案中,所述取代是苯丙氨酸。
总结
应当理解,所公开的产品和方法的不同应用可针对本领域中的具体需要进行定制。还应理解,本文使用的术语仅用于描述本发明的特定实施方案,而并非旨在为限制性的。
此外,如在本说明书和所附实施方案中使用的,单数形式“一(a,an)”和“所述(the)”包括复数指示物,除非内容另有明确规定。因此,例如,提及“多肽”包括“多个多肽”等。
除非特别禁止,本文公开的方法的步骤可以任何适当的顺序执行,且列出的步骤的顺序不应被认为是限制性的。
本文引用的所有出版物、专利和专利申请均通过引用的方式全文纳入本申请。
实施例
实施例1:绿磺隆(CSF)杀灭曲线,以测试使用CSF作为除草剂进行选择的可行性
本实施例令人吃惊地证明,可使用ALS抑制剂(特别是绿磺隆(CSF))对香蕉施加选择压力,包括在胚性细胞悬浮液中的胚性细胞阶段,其为非绿色组织。因此,本实施例证明ALS抑制剂被用作用于香蕉的除草剂,包括作为选择试剂。
杀灭曲线-香蕉ECS
CSF浓度最初根据以下CSF浓度选择,其在下文列出的出版物中已被报道在其他植物物种中有效:
1.甘蔗愈伤组织-3.6、20、50、200、500μg/L:3.6μg/L是5周后用于选择的最佳浓度。
2.马铃薯体细胞-20μg/L
3.番茄植物-3.6μg/L
4.水稻原生质体-3.6μg/L,持续10dd
5.地钱(Marchantia polymorpha)孢子-0.5μM(180μg/L),持续7dd
1.van der Vyver C,et al.2013.In Vitro Cell Dev Biol-Plant 49:198-206.
2.Barrel P,et al.2017.BMC Biotechnology 17:49.
3.Veillet F,et al.2019.Int J Mol Sci 20:402.
4.Li Z.,et al.1992.Plant Physiol 100:662-668.
5.Ishizaki K,et al.2015.PLoS ONE 10(9):e0138876.
香蕉ECS中的杀灭曲线设置
将香蕉胚性细胞悬浮液(ECS)在离心管(falcon tube)中沉淀,并与CS(细胞悬浮液)培养基混合至最终细胞密度为1-3%。将等体积的混合物分入6个烧瓶中,并加入相应量的CSF至终浓度0、10、20、50、100和500μg/L(第1个杀灭曲线)或0、0.5、5、50和100mg/L(第2个杀灭曲线)。将ECS-CSF混合物置于/接种于6孔板中(每孔5mL)并在25℃下在黑暗中以65rpm孵育3、5、10、15天。
在每个时间点,将ECS收集在细胞过滤器(strainer)中并用CS(细胞悬浮液)培养基洗涤以除去所有CSF。从过滤器中收集细胞并接种到胚发育培养基(EDM)板中并发育2-3个月(直到观察到完全发育的胚)。每月对细胞/胚进行传代培养。每次处理设三个重复进行并独立处理。
香蕉胚数量的评估
CSF处理后2-3个月评估胚数量。在立体显微镜下对胚计数,使用50方格网来帮助计数。对于胚过多的平板,仅对50个中的3个方格计数,然后对所有平板的胚数量标准化。对于其他板,对所有胚计数。平均值±标准差(Stdev)是由3次技术重复(technicalreplicate)获得。
幼苗中的杀灭曲线设置
也在生根幼苗中评估了CSF的效果。测试了三种方法,(i)将CSF掺入生根培养基中(RM),(ii)用CSF溶液喷洒幼苗,以及(iii)在培养基中掺入较低浓度的CSF(0.5mg/L)并喷洒幼苗(同时)。
在生根幼苗中测试六种CSF浓度:0、0.05、0.5、5、50和100mg/L。
将正在生根的香蕉幼苗清除根部并与球茎分开,以得到单芽,用于用含有或不含CSF的RM培养基进行测试。将五到六个具有相似形态的芽接种到每个phytotray中。
对于喷洒试验,在无菌去离子水或0.015% Silwett L-77中制备CSF溶液(上述浓度)。将1mL溶液喷洒到每个phytotray上(每个phytotray 5个芽,因此5个芽合用喷洒1mL)。对于对照条件(0mg/LCSF),使用1mL去离子水。
结果
香蕉ECS中的第一种杀灭曲线
测试浓度-0、10、20、50、200、500μg/L。结果示出于图1中。
香蕉ECS的第二种杀灭曲线
测试浓度-0、0.5、5、50、100mg/L(第一条杀灭曲线的200倍)。结果示出于图2、图3和表2中。
表2-用CSF(0、0.5、5、50、100mg/L)进行短期处理(5、10和15天)后3个月发育的香蕉胚数量。平均值±标准差(Stdev)获自3个技术重复。
在香蕉ECS中进行两种CSF杀灭曲线,并在2-3个月后评估胚数量。这些实验的主要目的是确定CSF的最小致死剂量,即与无CSF的情况下获得的数量(对照)相比,将胚数量减少100倍的浓度。(将存活胚数量减少至对照条件下所获得数量的百分之一(1/100)的浓度。)
在第一种CSF杀灭曲线中,根据先前的参考文献,测试了较低浓度的CSF(0、10、20、50、200、500μg/L),持续3、5、10和15天。尽管不同的CSF浓度和不同的暴露时间,但在测试浓度下未观察到胚数量的重大差异,包括在用500μg/L CSF暴露15天的极端情况中。
所用的CSF浓度似乎不足以杀死香蕉ECS。决定增加第二种杀灭曲线的CSF浓度。
在第二种CSF杀灭曲线中,测试了更高浓度:0、0.5、5、50和100mg/L,持续5、10和15天。用CSF处理后,浓度高于5mg/L时观察到香蕉ECS中颜色变化。较高浓度的CSF对胚数量产生不利影响。在5mg/L的CSF下持续10天达到成功标准。观察到胚数量的高于200倍的下降(下降到少于二百分之一(1/200))。显示5天的处理对于杀死香蕉细胞是低效的E,15天的处理并不比10天更好。
香蕉生根幼苗的杀灭曲线
在培养基中掺入CSF-结果示出于图4和图5中。
用CSF喷洒香蕉幼苗-结果示出于图6、图7和图8中。
用CSF喷洒香蕉植株并在培养基中掺入0.05mg/L CSF-结果示出于图9和图10中。
在香蕉生根芽中测试了一系列CSF浓度(0、0.05、0.5、5、50、100mg/L)。将香蕉芽从根分离并分成单芽,将所述单芽接种到新培养基中以确定一个月暴露时CSF的效果。为了评估CSF在香蕉中的有效性,测试了三种方法:(i)在RM培养基中掺入毒素,(ii)喷洒芽,以及(iii)喷洒芽并在RM中掺入少量CSF(0.05mg/L)。
将0.5mg/L CSF添加至培养基中是有效的。幼苗没有发育并表现出褪绿表型。在存在CSF的情况下,即使在较低浓度(如0.05mg/L)也未观察到根发育。这些实验重复3次,结果一致。
喷洒香蕉芽似乎不如在培养基中掺入CSF有效。5mg/L的CSF是延迟芽发育的最低浓度。在CSF溶液中使用Silwett L-77会造成更严重的表型。每周用0.5mg/L的CSF喷洒对于延缓香蕉芽发育非常有效。
结合这两种方法,在培养基中掺入少量CSF(0.05mg/L)并喷洒CSF,导致症状严重程度增加。培养基中存在少量CSF就足以加强CSF溶液的效果(甚至使用模拟溶液喷洒(无CSF)时,观察到低水平的症状)。在这些条件下,香蕉芽病得很严重,在实验期间没有发育。
实施例2:香蕉ALS基因的鉴定
在马铃薯和番茄中已经描述了通过绿磺隆处理选择的特定ALS编辑(V eillet Fet al.2019.Int J Mol Sci 20:402)。这些ALS肽的序列被鉴定、下载并用于查询香蕉基因组/衍生蛋白质组(Droc et al.,The banana genome hub.Database,2013)。
马铃薯:
stALS1
>tr|M1D7U6|M1D7U6_SOLTU Acetolactate synthase OS=Solanum tuberosumOX=4113GN=102594064PE=3SV=1SEQ ID NO:1
stALS2剪接变体1
>tr|M1AAA1|M1AAA1_SOLTU Acetolactate synthase OS=Solanum tuberosumOX=4113GN=102595762PE=3SV=1SEQ ID NO:2
stALS2剪接变体2
>tr|M1AAA0|M1AAA0_SOLTU Acetolactate synthase OS=Solanum tuberosumOX=4113GN=102595762PE=3SV=1SEQ ID NO:3
番茄:
SlALS1
>tr|A0A3Q7FJD2|A0A3Q7FJD2_SOLLC Acetolactate synthase OS=Solanumlycopersicum OX=4081GN=101247891PE=3SV=1SEQ ID NO:4
SlALS2
>tr|A0A3Q7HE60|A0A3Q7HE60_SOLLC Acetolactate synthase OS=Solanumlycopersicum OX=4081PE=3SV=1
SEQ ID NO:5
ALS还已经在拟南芥中也被描述(Sathasivan et al.,1990):
AtALS1
>sp|P17597|ILVB_ARATH Acetolactate synthase,chloroplastic OS=Arabidopsis thaliana OX=3702GN=ALS PE=1SV=1
SEQ ID NO:6
五个马铃薯和番茄的氨基酸序列中的每一个都被用作TBLASTN BLAST搜索中的查询(query),该搜索针对Geneious Prime软件中的自定义香蕉基因组数据库(www.geneious.com)。对于每个查询,在氨基酸水平上只有两次命中超过30% ID(同一性)。这些示出于下表3中。在本文Ma06_g18100被称为MaALS1并且Ma10_g11980被称为MaALS2。
表3与香蕉蛋白质组中已知ALS基因最接近的同源物,以及被表示为查询和目标(subject)肽序列之间相同的残基的百分比的同一性。
MaALS1和ALS2的序列如下所示。
>ALS1 Ma06_g18100-Ma06_g18100
SEQ ID NO:7
>ALS2 Ma10_g11980-Ma10_g11980
SEQ ID NO:8
>ALS1 Ma06_g18100-Ma06_g18100-CDS翻译
SEQ ID NO:9
>ALS2 Ma10_g11980-Ma10_g11980-CDS翻译
SEQ ID NO:10
总之,香蕉中的两个推定ALS基因与拟南芥、番茄和马铃薯中已知ALS基因具有高水平同源性。目前的香蕉基因组“DH-Pahang v2.0”中没有发现其他候选基因(可从以下网址获得:https://banana-genome-hub.southgreen.fr/organism/1)。
实施例3:香蕉中ALS1/ALS2的表达分析
本实施例测试香蕉中ALS1和ALS2的转录表达谱。RNA提取自多不同香蕉组织中,如下表4所详述。使用Illumina Truseq总RNA试剂盒和植物Ribozero制备RNA文库,并使用HiSeq 300循环(2x 150bp)高输出进行测序。这个总RNA-seq生成每个样本约4400万个读段(Cambridge Genomic Services)。
表4用于生成在RNAseq数据集中使用的RNA样本的香蕉组织的样本描述。
使用trim_galore和fastQC修剪所得的RNA测序读段,并使用HISAT2从香蕉基因组hub(Droc 2013)映射到Musa acuminata DH Pahang v2(2016年1月开放)(Kim et al.,2019,Nature Biotechnology,37(8),907-915)。使用Kallisto进行计数和TPM计算(Bray,2016,Nature Biotechnology,34(5)),使用R和EdgeR脚本计算RPKM和TMM(McCarthy etal.,2012,Nucleic Acids Research,40(10),4288-4297,Robinson et al.,2009,Bioinformatics,26(1),139-140)。这些数据用于准备图11,该图示出在所有取样组织类型中香蕉ALS1和ALS2的RPKM值。
为了研究早期发育阶段中ALS1和ALS2的转录谱,并验证根和叶样本的RNAseq数据,进行了RT-qPCR。
所有组织样本均来自香蕉Grand Nain(GN_236),本发明人使用源自雄花(ECS)的胚细胞悬浮液、从这些细胞再生的衍生胚以及来自从这些胚萌发的植物的叶和根材料。
使用Direct-zol试剂盒提取RNA,按照制造商说明书,将样品在柱中进行DNase处理。在使用Turbo-DNase(Ambion)和Superscript IV VILO(Thermofisher)以等量RNA生成cDNA之前,评估浓度和质量。
为了评估每个样品中的转录物水平,本发明人在Roche LightCycler96系统中,在RT-qPCR反应中使用了PowerUpTMSYBRTMGreen Master Mix(Applied Biosystems)。使用Excel中的ΔΔCt方法分析数据,如表5和图12所示。ALS1和ALS2的表达被标准化为引物效率,并使用RPS2表达,并且所有数据均相对于ECS中的ALS1表达示出。
使用寡核苷酸引物生成扩增子:
ALS1F GGTCTTCGCTGCCTCTGGAT(SEQ ID NO:11),
ALS1R(引物2365)CCACCACCAACATAGAGGACT(SEQ ID NO:12),
ALS2F(引物2366)ATCTGTTCGCCTACCCTGGA(SEQ ID NO:13),
ALS2R GATGGATCTGGTGACCTCGAC(SEQ ID NO:14),
RPS2F TAGGGATTCCGACGATTTGTTT(SEQ ID NO:15)和
RPS2R TAGCGTCATCATTGGCTGGGA(SEQ ID NO:16)。
表5计算的相对标准化表达数据
图11和图12中呈现的数据显示,在所有测试的组织中,ALS1的表达高于ALS2。ALS1在香蕉的果实期表达最高,而ALS2的峰值表达在香蕉叶组织中。相对于叶、根和果实,ALS1和ALS2两者在ECS和胚中的表达较低。
实施例4:选择突变以在ALS1和ALS2中引入
ALS酶是若干除草剂的靶标:磺酰脲(SU)、咪唑啉酮(IMI)、嘧啶基硫代苯甲酸酯、磺酰基-氨基羰基-三唑啉酮和三唑并嘧啶。已知数百种杂草物种具有天然产生的对这些化合物的抗性。50种杂草的ALS基因的8个位置处有26个抗性氨基酸取代(Yu&Powles,2014a,Pest Management Science,70(9))。ALS中最频繁发生的突变位于Pro-197、Trp-574-Leu和Ala-121;这些突变通常赋予对特定除草剂的抗性。上表1示出了已记载的已知ALS突变,以及对于拟南芥中此类氨基酸位点的除草剂结合。
与这项工作相关的是,在所有杂草物种中发现的最常见突变是Pro-197。这种突变赋予对磺酰脲类化合物(例如绿磺隆)的抗性。
拟南芥仅具有一个ALS基因,并且氨基酸变化Pro-197-Ser赋予对磺酰脲类化合物绿磺隆的强抗性(Haughn et al.,1988,MGG Molecular&General Genetics,211(2),266-271)。ALS基因中的突变已被记载,并在多种作物物种中赋予对磺酰脲类化合物的抗性,包括番茄(Jung et al.,2004,Biochemical Journal,383(1),53-61和Veillet F,etal.2019.Int J Mol Sci 20:402),马铃薯(Barrel et al.,2017.BMC Biotechnology 17:49和Veillet F et al.2019)、玉米(Li et al.,1992.Plant Physiol 100:662-668)、烟草(Jung et al.,2004)和水稻(Wakasa et al.,2012,Plant Cell Reports,31(11),2075-2084)。
如前所述,香蕉有两个ALS同源物。为了鉴定对应于拟南芥的Pro-197残基的特定氨基酸残基,本发明人比较了来自若干植物物种的肽序列的比对(alignment)。
Pro197残基在下文中以粗体和下划线突出显示。在该区域中氨基酸序列的保守程度非常高。
请注意,在香蕉ALS1中,所需的脯氨酸突变为Pro187Ser,在ALS2中为Pro181Ser。
ALS2 Ma10_g11980-SEQ ID NO:10
ALS1 Ma06_g18100-SEQ ID NO:9
sp|P17597|ILVB_ARATH-SEQ ID NO:6
tr|A0A3Q7HE60|A0A3Q7HE60_SOLLC-SEQ ID NO:5
tr|M1AAA1|M1AAA1_SOLTU-SEQ ID NO:2
tr|M1AAA0|M1AAA0_SOLTU-SEQ ID NO:3
tr|M1D7U6|M1D7U6_SOLTU-SEQ ID NO:1
tr|A0A3Q7FJD2|A0A3Q7FJD2_SOLLC-SEQ ID NO:4
实施例5:ALS1或ALS2基因中突变的引入
本实施例中使用的质粒图谱示出于图13和14中。
为了在香蕉ALS1和ALS2基因中产生赋予对CSF的抗性的特定点突变(ALS1Pro187Ser;ALS2Pro181Ser),本发明人设计了精确碱基编辑器(PBE)表达构建体。本发明人的构建体是基于用于产生小麦CSF抗性的构建体(Zong et al.NatBiotechnol.2017,35(5):438-440),其使用融合至Cas9(nCAs9)的D10A突变形式的rAPOBEC(Komor et al.,2016,Nature,533)胞苷脱氨酶和尿嘧啶去糖基化酶,参见图13-15。该融合蛋白可使用sgRNA向导的Cas9将胞苷脱氨酶和尿嘧啶DNA糖基化酶(UDG)递送至与sgRNA同一条链上的DNA序列的精确编辑“窗口”,在此处它可修饰一个或更多个胞苷核苷酸碱基(C)为胸腺嘧啶核苷酸(T)碱基,例如,如果两个C碱基都被编辑,则ATCGC将变为ATTGT。“切口酶”nCas9(D10A)修饰修饰催化活性,使得当nCas9与sgRNA位点结合时,只有DNA双链体的一条链被切割。该融合物在下文中以核苷酸和肽fasta序列显示。
>rAPOBEC-nCas9-PBE-UDG rice codon optimised and Golden Gatedomesticated NUCLEOTIDE
SEQ ID NO:17
>rAPOBEC-nCas9-PBE-UDG rice codon optimised PEPTIDE
SEQ ID NO:18
为了引导PBE融合物以编辑香蕉ALS1和ALS2中的所需碱基,使用Geneious Prime2019设计向导RNA序列。
使用的具体向导序列是:
ALS1 sgRNA CAGGTCCCTCGCCGCATGAT(SEQ ID NO:19)和ALS2 sgRNACAGGTTCCCCGCCGCATGAT(SEQ ID NO:20)
为了证明这些sgRNA可以促进香蕉ALS1和ALS2基因中的所需编辑,本发明人进行原生质体编辑实验。在这里,本发明人从香蕉ECS细胞中产生原生质体,并用包含ALS1或ALS2向导的rAPOBEC PBE碱基编辑器构建体或带有非靶向sgRNA的对照转染它们(下文的方案),分别命名为pMOL0630、pMOL0632和pMOL0628。这些构建体还含有mcherry荧光基因标记盒。转染三天后,使用FACS Melody分选原生质体。对于每个质粒和阴性对照,最少分选2,000个原生质体。对20,000个事件进行原生质体计数,以测量每个质粒的转染效率;每个质粒的效率>1%,大多数质粒>3%。分选后,在DNA提取之前在-80℃下储存原生质体。
表6
使用扩增子测序和下一代测序评估ALS1和ALS2基因座中的编辑水平。使用QiagenDNeasy植物基因组DNA提取试剂盒从原生质体中纯化DNA,并在使用Q5 master mix聚合酶(NEB)且使用ALS1和ALS2的PCR引物的扩增反应中用作模板。
表7
接下来,使用ALS1和ALS2的引物对来自第一次PCR反应的稀释的扩增子进行巢式PCR。
表8
扩增子被送到Genewiz公司以使用Miseq技术测序。本发明人使用Geneious评估每个样品中的编辑水平,总结在表9中。
表9-使用原生质体群体的Ampseq进行的在ALS1和ALS2基因座中的编辑检测。
表9中呈现的数据证明碱基编辑构建体是活性的并且可以在香蕉细胞的转化群体中以高频率在香蕉中产生CSF抗性需要的所需编辑。
原生质体转染方案
将香蕉原生质体在过滤灭菌酶促消化混合物(2%纤维素酶RS、0.05%离析酶R-10、0.25%果胶酶Y-23、0.2%半纤维素(所有产品均来自Sigma))中从胚性细胞悬浮培养物中分离。将大约2.5mL沉淀的ECS接种到5mL酶促溶液中,并在黑暗中以40-60rpm孵育过夜。
孵育后,组织和缓冲液混合物通过40μm过滤器(Fisherbrand,FisherScientific)过滤,使用1vol 0.45M甘露醇清洗培养皿,然后通过同一过滤器以稀释酶溶液并最大化原生质体收集。收集原生质体并以100g离心4分钟去除酶溶液。然后将原生质体重悬于10mL 0.45M甘露醇中,并再次以100g离心(spin down)4分钟。将原生质体重悬于1mLMMG溶液(0.4M甘露醇、15mM MgCl2、4mM MES)中;使用血细胞计数器和台盼蓝评估浓度和细胞活力。
将原生质体稀释至最终密度1.5x10^6个细胞/mL,然后在转染前将其置于冰中。将20μg质粒DNA与200μL原生质体混合,然后将1vol的PEG溶液(4g PEG-8000、4mL0.5M甘露醇、1mL 1M CaCL2)添加到先前的混合物中。
室温孵育30分钟后,将原生质体用10mL 0.45M甘露醇洗涤,并通过100g离心4分钟收集。将原生质体重悬于2mL W5(154mM NaCl、125mM CaCl2、5mM KCl、2mM MES pH5.6)中,并转移至6孔Falcon培养板并在黑暗中于25℃下孵育3天。
实施例6:可使用CSF选择从ALS1/ALS2中被编辑的ECS细胞生长的香蕉胚
材料和方法
根癌农杆菌制备
将携带二元质粒pMol_0628、pMol_0630和pMol_0632的根癌农杆菌AGL1在补充有利福平50mg/L、卡那霉素100mg/L和羧苄青霉素50mg/L的LB固体培养基中从甘油中划线并在28℃下孵育2天。第二天,将细菌接种到补充有相同抗生素的MG/L培养基中(Sarkar etal.,Methods Mol Biol.2013;966:187–204中讨论),并在28℃下孵育过夜。下一天,将培养物以4000rpm离心10分钟以沉淀细胞,然后将其重悬于ABIM中(农杆菌诱导培养基;Sarkaret al.,Methods Mol Biol.2013;966:187–204中讨论)以诱导毒力并在室温下在黑暗中孵育过夜。使用NanoDrop2000测量细菌的OD600,并在补充有乙酰丁香酮(100μM)的CS液体培养基中标准化至0.6。
香蕉ECS农杆菌转化(agrotransformation)
将2/3日龄细胞的香蕉ECS沉淀并重悬于CS培养基中。根据转化次数将0.5mL悬浮液(约0.1mL ECS)分开至小Eppendorf管中。将细胞在45℃下孵育5分钟,然后将其在室温再孵育5分钟。然后,将1mL农杆菌悬浮液添加到每个香蕉ECS样品中。将混合物混合,以1000rpm离心5分钟,然后在室温下孵育25分钟。将香蕉细胞合并,并将细菌悬浮液除去。将细胞接种于补充有100μM乙酰丁香酮的CS平板中,并在室温下孵育3天以促进转化。
香蕉胚发育和发芽
3天后,从先前的培养基中收集香蕉ECS并转移至补充有头孢噻肟300mg/L的CS培养基中。一周后,将香蕉细胞转移至补充有cefo(头孢噻肟)300mg/L(对照),或cefo 300mg/L和CSF 25μg/L或cefo 300mg/L和CSF 50μg/L的EDM平板。将培养基每两周更新(refreshed)一次,直至胚发育。随后,将香蕉胚转移到不含CSF的成熟培养基(EMM-胚成熟培养基)中一个月,然后将其转移到不含CSF的发芽培养基(GM4)中直至芽发育。
荧光显微镜分析
通过检查胚中mCherry荧光的存在来检查香蕉胚中T-DNA的存在。
结果
ALS编辑的选择
将香蕉ECS用二元载体转化,所述二元载体包含mcherry和胞苷碱基编辑器、ALS1的向导(pMol_0630)和ALS2的向导(pMol_0632)以及GFP的模拟向导(pMol_0628,对照)。
在没有选择压力的情况下,在每个转化组中观察胚。在存在CSF的情况下,没有从对照转化(pMol_0628)中观察到胚(参见图16)。可用CSF选择获得胚,所述CSF选择使用含有用于ALS1和ALS2点突变的向导的质粒。正如预期,使用较高CSF浓度时,观察到较少的胚。
测量mcherry荧光以评估具有或不具有整合T-DNA(在基因组中)的胚的比例。针对所有条件拍摄若干照片,并观察到大量mcherry胚,大约占胚的90%(参见图17和18)。这些数据表明大约10%的胚瞬时表达T-DNA和编辑系统。为了支持该观察结果,进行了qPCR基因分型来检测T-DNA的存在。
为了测试ALS编辑的香蕉ECS和胚的抗性,本发明人使用农杆菌菌株AGL1转化香蕉ECS细胞,以从质粒pMOL0628、pMOL0630和pMOL0632转移T-DNA。本发明人还产生细胞的未转化的对照样本。将ECS细胞与AGL1菌株共培养3天,然后在添加头孢噻肟(300mg/L)的CS培养基上恢复7天。这段时间之后,将细胞暴露于不同的选择性胚发育培养基:EDM+头孢噻肟300、EDM+头孢噻肟300+绿磺隆(25μg/L)或EDM+头孢噻肟300+绿磺隆(50μg/L)。将含有细胞的平板在25℃下的黑暗中静止孵育,并且每2周更换一次培养基。3个月后,本发明人进行以下观察。
表10
从这些数据可以清楚地看出,ALS1和ALS2 PBE构建体pMOL0630和pMOL0632分别在产生CSF抗性胚方面是有效的。本发明人监测这些胚群体的mCherry荧光,可以看到大约90%的再生胚为强烈发荧光的。本发明人相信这意味着它们是转基因的并且已整合包含PBE和mcherry盒的T-DNA。
对于在CSF处理后存活的胚的池(pool),本发明人需要将CSF上的存活与ALS1/ALS2基因座的编辑相关联。为了实现这一点,本发明人对胚的96孔板取样,并进行扩增子测序以观察胚中的编辑频率,并进行qPCR来测量转基因的并已整合来自构建体的T-DNA的胚的比例。
胚的样品取自:
·用pMOL0630转化的非选择的(无CSF)胚
·用pMOL0632转化的非选择的(无CSF)胚
·用pMOL0630转化的选择的(25μg/LCSF)胚
·用pMOL0632转化的选择的(25μg/LCSF)胚
·WT胚
将纯化的基因组DNA样品用作PCR反应中的模板,使用如前所述的ALS1和ALS2引物:ALS1 2364和2365;ALS2 2366和2367。
本发明人在每个测试的胚中均发现ALS编辑,有一个胚除外。这些数据总结在图21的表中。
为了确定这些胚是否是转基因的,本发明人使用与T-DNA结合的引物组对gDNA样本进行qPCR。本发明人使用引物3339和3340来检测Cas9-PBE融合物,和2537和2538来检测mCherry。本发明人还从每个样本中扩增ALS1,并使用该扩增子的扩增Ct值作为对照(基因组拷贝数标准化)。
用5μl模板DNA建立所有反应。本发明人使用两个WT(野生型)胚DNA对照和一个水对照。
根据制造商的建议,本发明人使用Applied PowerUp SYBR Green master mix在Roche LightCycler96中进行PCR。
循环参数:
UDG激活(热启动)50℃ 2min
双锁DNA聚合酶95℃ 2min
40个循环:
变性95℃15秒
退火58℃15秒
延伸72℃1分钟
使用Roche Lightcycler 96软件分析原始数据,收集每个样品的Ct值,并导出到excel。在此,本发明人使用ALS1引物Ct水平对T-DNA扩增子的Ct值标准化。当标准化的时候,本发明人对数据作图,如图20所示。在此,本发明人观察到除了一个非选择的胚外,所有胚似乎都具有与WT样品相似的Ct值(每板8个WT样品,参考图20)。本发明人为CT值设置一个任意阈值水平(arbitrary threshold level),高于该阈值时,本发明人假定胚样品是非转基因的。从存在大量细胞碎片、死亡农杆菌细胞和其他未发育的ECS细胞的平板中收集胚组织。本发明人不会预期使用这种方法可观察到“干净”的qPCR结果,并且满意的是高Ct值(<~25)表明胚是非转基因的。
本发明人对任何候选瞬时编辑(非转基因)胚进行测序。ALS1的样品号C4,以及ALS2的样品G1、G3、H5、H8和H10(在图20中用星号突出显示)。
将靶基因使用引物扩增:
ALS1 2364和2365
ALS2 2366和2367
Q5 master mix聚合酶,35个循环。标准扩增条件。
将PCR产物纯化并送去Sanger测序,使用与用于扩增片段的引物相同的引物。
测序结果总结
ALS1(图21)
非选择的胚
在非选择的胚中,10个在向导位点为WT(A1-A10),1个在向导位点有2nt(连续)C-T变化(A11)。一个胚是WT对照并且是序列为WT(A12)。
其他(仅非转基因胚)
一个(非转基因)-编辑的2nt变化,向导位点中的C-T(C4)
一个(非转基因)-WT(迄今为止本发明人已鉴定的唯一非编辑、CSF选择的胚)(C11)
ALS2(图22)
非选择的胚
全部在向导位点为WT(B1-B12)
其他(仅非转基因胚)
H5-向导位点中的单个C-T编辑。
H8-向导位点中的单个C-T编辑。
H10-向导位点中的2ntC-T编辑。
G1-向导位点中的2ntC-T编辑。
G3-向导位点中的2ntC-T编辑。
结论
本实施例和上述数据提供了大量关键观察结果。首先,其证明香蕉细胞(特别是ECS细胞)可以使用由T-DNA构建体表达的碱基编辑器来编辑。第二,编辑香蕉中的任一内源ALS基因可提供对ALS抑制剂的抗性,并且用ALS抑制剂处理从转化的ECS细胞生长的胚可有效地选择在内源ALS基因处被编辑的胚。第三,很大一部分选择的胚不具有整合到它们的基因组中的T-DNA构建体,因此它们瞬时表达碱基编辑系统并且是非转基因的。第四,可以通过观察T-DNA构建体上存在的选择性标记物的表达的缺失来识别非转基因胚。
实施例7:可使用CSF来鉴定和选择从ECS细胞生长的并在ALS1/ALS2中共同编辑的香蕉胚,和所选择的靶基因
为了测试基于ALS的CSF选择也选择“共同编辑”的另一个性状位点的能力,本发明人设计了包含验证的ALS sgRNA、碱基编辑器和靶向香蕉基因的新性状sgRNA的构建体。
本发明人选择靶向两个香蕉性状:ACO(1-氨基环丙烷-1-羧酸氧化酶)和PPO(多酚氧化酶)。香蕉中的靶位点是:ACO1——Ma01_g11540和PPO2——Ma07_g03540。
本发明人设计sgRNA以在ACO1和PPO2位点的CDS(编码序列)中引入终止密码子作为无义突变。本发明人使用RGEN工具BE-Designer(http://www.rgenome.net/be-designer/)来设计sgRNA序列(CRISPR RGEN Tools,n.d.;Hwang et al.,BMCBioinformatics 19,542(2018)),这些在Geneious Prime中验证。
PPO2:
>Ma07_g03540-CRISPR向导2961
SEQ ID NO:21
AGAGCCAGCTGTTGTGGACTTGG
>Ma07_g03540-CRISPR向导2962
SEQ ID NO:22
GTTCCAGAAAGGGAGCGAGAAGG
>Ma07_g03540-CRISPR向导2963
SEQ ID NO:23
TGAACAGCCTAACCCTGGCGCGG
ACO1:
>ACO_Ma01_g11540-CRISPR向导2965
SEQ ID NO:24
GAAGCAAAAGTTCAAACAGTTGG
>ACO_Ma01_g11540-CRISPR向导2966
SEQ ID NO:25
CTGCGAGAACCTCGGCCTGGAGG
>ACO_Ma01_g11540-CRISPR向导2967
SEQ ID NO:26
GGTGCCAAAGTTCGGACCTTTGG
这些sgRNA纳入Golden Gate构建体中,与验证的ALS1和ALS2sgRNA sg2021和sg2023一起测试:
表11-选择用于测试的质粒
如之前在实施例6中所述,本发明人使用上表11中详述的新的共同编辑的质粒转化ECS。如前所述再生胚,在没有选择的培养基上,用25mg/L G418、25μg/L CSF或50μg/LCSF。
所有测试的质粒的初始基因分型:
表12-由选择的胚中的质粒列出的性状编辑%
表13-选择方法在胚中的性状编辑效率
实施例8-用靶向ALS1、ALS2、PPO2和ACO1的碱基编辑构建体转化后的绿磺隆(CSF)抗性香蕉芽和胚的基因分型。
结果总结和讨论
本实施例证明,在使用碱基编辑器和仅靶向ALS的gRNA转化和使用ALS抑制剂选择后,超过99%的所有测序的CSF选择的样品在ALS1或ALS2中被编辑。其中6-9%为非转基因的并在ALS1中被编辑,12-15%的胚为非转基因的并在ALS2中被编辑。在36个芽中,21个(58%)是转基因的,以及100%的它们在ALS1或ALS2中被编辑。1/36样品(3%)是非转基因的(无T-DNA整合),并且可能在ALS2的单个等位基因中被编辑。
在用碱基编辑器及靶向ALS和“性状”基因ACO1和PPO2的gRNA转化后,观察到高水平的共同编辑。在测试的376个胚中观察到最高达5%非转基因终止密码子编辑。在芽阶段,2/34样品(6%)是非转基因的(无T-DNA整合)。在两个非转基因芽中,一个仅在ALS2中被编辑,另一个在ALS2和PPO2中被编辑。因此,本发明人发现在香蕉芽中,非转基因的性状编辑频率为3%。
材料和方法
本发明人进行了两组不同的实验。
1.使用胞苷碱基编辑器(“CBE”)及靶向ASL1或ALS2的sgRNA在香蕉胚和芽中将ALS编辑频率和绿磺隆抗性基线化(baseline)。本发明人将这些称为实验PBE#1(Grand Naine,GN236)和PBE#2(G251)。
2.评估使用相同的CBE及靶向ALS(1或2)和性状基因(PPO2或ACO1)两者的sgRNA在香蕉胚和芽中产生“共同编辑”的可行性,称为实验PBE#3。
本发明人使用以下构建体转化香蕉ECS:
PBE#1和PBE#2
质粒
全部带有CsVMV-rAPOBEC 1CBE(rAPOBEC+nCas9(D10A)PBE,来自Zong etal.2017*)
pMol_0628(mcherry,sgGFP--CsVMV::mCherry::tg7-pCsVMV::nCas9PBE1::tg7-Dummy-pTaU6::sg1GFP_58
pMol_0630(mcherry,sgALS1--CsVMV::mCherry::tg7-pCsVMV::nCas9PBE1::tg7-taU6::MaALS1sg2_2021)
pMol_0632(mcherry,sgALS2--CsVMV::mCherry::tg7-pCsVMV::nCas9PBE1::tg7-taU6::MaALS2sg22023)
*Zong Y,WangY,Li C,Zhang R,Chen K,Ran Y,Qiu JL,Wang D,Gao C.Precisebaseediting in rice,wheat and maize with a Cas9-cytidine deaminase fusion.NatBiotechnol.2017May;35(5):438-440.doi:10.1038/nbt.3811.Epub 2017Feb 27.PMID:28244994.
PBE#3
质粒
全部带有CsVMV-rAPOBEC1CBE(rAPOBEC+nCas9(D10A)PBE,来自Zong etal.2017*)和nosP-NptII
pMol_0925(sgGFP-pNOS::Kan::tOCS-pCsVMV::nCas9PBE1::tg7-Dummy-pTaU6::sg1GFP_58-pTaU6::sg2GFP_59)
pMol_0934(sgGFP,sgALS1-pNOS::Kan::tOCS-pCsVMV::nCas9PBE1::tg7-taU6::MaALS1sg2_2021-pTaU6::sg1GFP_58-pTaU6::sg2GFP_59)
pMol_0935(sgGFP,sgALS2-pNOS::Kan::tOCS-pCsVMV::nCas9PBE1::tg7-taU6::MaALS2sg2_2023-pTaU6::sg1GFP_58-pTaU6::sg2GFP_59)
pMol_0926(sgPPO2,sgALS1-pNOS::Kan::tOCS-pCsVMV::nCas9PBE1::tg7-taU6::MaALS1sg2_2021-pTaU6::sg1PPO2_2961-pTaU6::sg2PPO2_2962-pTaU6::sg3PPO2_2963)
pMol_0927(sgPPO2,sgALS1-pNOS::Kan::tOCS-pCsVMV::nCas9PBE1::tg7-taU6::MaALS1sg2_2021-pTaU6::sg3PPO2_2963-pTaU6::sg4PPO2_2964)
pMol_0928(sgACO1,sgALS1-pNOS::Kan::tOCS-pCsVMV::nCas9PBE1::tg7-taU6::MaALS1sg2_2021-pTaU6::sg1ACO1_2965-pTaU6::sg2ACO1_2966-pTaU6::sg3ACO1_2967)
pMol_0929(sgPPO2,sgALS2-pNOS::Kan::tOCS-pCsVMV::nCas9PBE1::tg7-taU6::MaALS2sg2_2023-pTaU6::sg1PPO2_2961-pTaU6::sg2PPO2_2962-pTaU6::sg3PPO2_2963)
pMol_0930(sgPPO2,sgALS2-pNOS::Kan::tOCS-pCsVMV::nCas9PBE1::tg7-taU6::MaALS2sg2_2023-pTaU6::sg3PPO2_2963-pTaU6::sg4PPO2_2964)
pMol_0931(sgACO1,sgALS2-pNOS::Kan::tOCS-pCsVMV::nCas9PBE1::tg7-taU6::MaALS2sg2_2023-pTaU6::sg1ACO1_2965-pTaU6::sg2ACO1_2966-pTaU6::sg3ACO1_2967)
在CS+cefo300 mg/L中转化1周后将ECS“恢复”。
本发明人在4种不同的条件下再生胚:
1.无选择
2.仅来自PBE#3的稳定G418选择(仅在EDM中2-3个月)(PBE#1/2质粒仅具有mCherry,无NptII)
3.25μg/L绿磺隆(仅在EDM中2-3个月)
4.50μg/L绿磺隆(仅在EDM中2-3个月)
本发明人通过观察胚再生来评估CSF对每种质粒的选择效力。
对于PBE#1和PBE#3,本发明人仅在构建体包含碱基编辑器和靶向ALS1或ALS2的sgRNA时观察到CSF上的胚发育。每个单个转化平板上可再生超过200个胚。
本发明人在发育中的两个不同时间点进行基因分型:
-EDM阶段后的单个成熟胚
-RM培养基中幼苗的单个绿色叶端
基因分型方法和结果
提取来自胚和幼苗的gDNA,并且使用qPCR来测试植物基因组中整合的tDNA的存在。使用跨越tDNA(范围从左边界到右边界)的不同引物组(如下所列)对每个样本运行3次独立的qPCR。对内源香蕉基因(actin)运行额外的qPCR,以进行Ct值的标准化。所有qPCR均使用PowerUpTMGreen Master Mix(Thermo Fisher ScientificTM)在96系统(Roche)上运行。
为了进行编辑验证,使用High-Fidelity 2X Master Mix(New EnglandBiolabs,NEB)使用下列反应设置制备每个靶基因(als1、als2、ppo2、aco1)的扩增子。根据制造商的说明书,使用PCR&DNA Cleanup Kit试剂盒(NEB)纯化PCR产物。随后通过Sanger测序验证扩增子。
表14-用于香蕉胚和芽基因分型的引物列表
表15-Sanger扩增子反应设置
表17-qPCR反应设置
qPCR Powerup MM 2X | 12.5μL |
FW/Rev引物混合液 | 1.25μL |
模板DNA | 4μL |
H20 | 7.25μL |
总计 | 25μL |
表18
qPCR引物: | |
Actin(标准化) | 277,278(G0390,G0391) |
NptII | G0339,G0340 |
nCas9前 | (G0418,G0419) |
nCas9后 | (G0448,G0449) |
表19
PBE#1和PBE#2胚基因分型数据(仅限ALS,无性状)
本发明人发现6-9%的胚是非转基因的并在ALS1中被编辑以及12-15%的胚是非转基因的并在ALS2中被编辑。超过99%的所有测序的CSF选择的样品都在ALS1或ALS2中被编辑。在本次基因分型实验中,本发明人仅全面评估25μg/LCSF。这些数据如图23所示。
PBE#1芽基因分型数据(仅ALS,无性状)
用含有编码nCas9系统并表达sgRNA的质粒的农杆菌菌株转化胚性细胞悬浮液(ECS)。农杆菌转化是根据Kanna et al.,Molecular Breeding October 2004,Volume 14,Issue 3,pp 239-252进行。将胚性细胞与农杆菌共培养一到三天,然后将其转移到含有CSF的胚发育培养基(EDM)中,然后转移到不含CSF的成熟培养基中(相关培养基可在以下中找到:例如Strosse H.,R.Domergue,B.Panis,J.V.Escalant和F.2003,Banana andplantain embryogenic cell suspensions(A.Vézina和C.Picq,编辑).INIBAP TechnicalGuidelines 8,The International Network for the Improvement of Banana andPlantain,Montpellier,France)。将成熟的胚在发芽培养基中发芽(Strosse H.,R.Domergue,B.Panis,J.V.Escalant和F.2003.Banana and plantain embryogeniccell suspensions(A.Vézina和C.Picq,编辑).INIBAP Technical Guidelines 8.TheInternational Network for the Improvement of Banana and Plantain,Montpellier,France)和将幼芽转移到芽成熟培养基中,直至高度约1厘米。将芽转移到生根培养基中以进行幼苗发育。
在36个芽中,21个(58%)是转基因的,其中100%在ALS1或ALS2中被编辑。1/36样品(3%)是非转基因的(无T-DNA整合),并且可能在ALS2的单个等位基因中被编辑。
表20
PBE#3胚基因分型数据(ALS和性状基因“共同编辑”)
本发明人发现在CSF选择后ACO1和PPO2中的高水平的共同编辑。本发明人在测试的376个胚中观察到最高达5%的非转基因的终止密码子编辑。
本发明人在来自该实验中使用的8个质粒中的每一个的胚的小样品中进行编辑频率的初步筛选。由此,本发明人确定pMOL_0926和pMOL_0931显示出最高的编辑效率中的一些。本发明人对用这两种质粒转化后再生的胚进行4x96孔板取样——来自25和50μg/L CSF选择(总计共4x96孔板)。
表21-胚中质粒编辑效率的初步筛选(%):
当通过G418或CSF选择时,性状基因(PPO2或ACO1)中的编辑百分比。(每个数据点n=4)。频率似乎是构建体特异性的,例如pMOL_0927仅在2/12个测试的胚中具有PPO2编辑,而pMOL_0926则产生了9/12个PPO2编辑的胚。sgRNA效率也有显著差异(数据未显示)。
表22-仅用pMOL0926或pMOL0931转化后再生的胚的大规模基因分型,使用25或50μg/L CSF选择。
PBE#3(胚)pMOL_926
CSF25:
·在94个样本中,11个(11.7%)是非转基因的(无T-DNA整合)。
·5%(5个非转基因胚)在PPO2中具有生成终止密码子的编辑。
·所有测序的转基因样品(7)在PPO2性状基因中在sg2961和sg2962处的都具有生成终止密码子的编辑。
CSF50:
·在94个样本中,24个(25.5%)是非转基因的(无T-DNA整合)
·3%(3个非转基因)在PPO2中具有生成终止密码子的编辑。
·测序的单个转基因样品在PPO2基因座中在sg2962处具有生成终止密码子的编辑。
PBE#3(胚)pMOL_931:
CSF25:
·在94个样本中,16个(17%)是非转基因的(无T-DNA整合)。
·1%(1个非转基因)在ACO1性状基因中在sg2967位点处具有生成终止密码子的编辑。
·测序的两个转基因样品在ACO1性状基因中在sg2967位点处具有生成终止密码子的编辑。
CSF50:
·在94个样本中,11个(11.7%)是非转基因的(无T-DNA整合)
·7个非转基因胚在ACO1性状基因中在向导处具有生成终止密码子的编辑
·测序的两个转基因样品在ACO1性状基因中在sg2967位点处具有生成终止密码子的编辑。
PBE#3芽基因分型数据(ALS和性状“共同编辑”)
2/34的样品(6%)是非转基因的(无T-DNA整合)。在两个非转基因芽中,一个仅在ALS2中被编辑,另一个在ALS2和PPO2中被编辑。因此,本发明人发现该实验中非转基因的性状编辑频率为3%。
尚未完成对pMOL0927和pMOL0930芽和胚的基因分型。
表23--用pMOL_0926-pMOL_0931转化后再生的香蕉芽的基因分型,使用25或50ug/ L CSF选择。
·从取样的34个芽中,发现100%在ALS1或ALS2中具有编辑。
·发现53%(17)在性状基因中具有生成终止密码子的编辑。
·2/34样品(6%)是非转基因的(无T-DNA整合)。
·在两个非转基因芽中,一个仅在ALS2中被编辑,一个在ALS2和PPO2中被编辑。
·在测试的前34个芽中,整体瞬时共同编辑频率为3%。
芽计数和健康数据
在此,本发明人报告从实验PBE#1(仅ALS)和PBE#3(ALS和性状“共同编辑”)中生成和收集的芽数量。本发明人报告因污染或其他原因死亡而丢失的植物样品(“cone”)的数量。
PBE#1芽再生频率(仅ALS)
对于ALS1和ALS2靶标(分别),本发明人观察到来自50μg/L CSF的芽数量(17和37)少于来自25μg/L CSF的芽数量(36和61)。总体上来自25和50μg/L CSF(分别)的ALS2质粒转化产生的芽数量(61和37)高于ALS1(36和17)。本发明人没有发现植物存活与编辑(基因)靶标或用于选择的CSF浓度之间的任何相关性。
表24-CSF选择后产生的芽数量
PBE#3芽再生频率(ALS和性状“共同编辑”)
与PBE#1相比,对于ALS1和ALS2靶标(分别),本发明人观察到来自50μg/L CSF的芽数量(148和328)高于来自25μg/L CSF的芽数量(55和140)。与PBE#1一致,总体上ALS2质粒转化产生的芽数量(140和328)高于ALS1(55和148),来自25和50μg/L CSF(分别)。本发明人没有发现植物死亡(cone loss)与基因靶或选择之间的任何相关性。
来自质粒和选择方法的芽输出具有高度可变的频率。
PBE#3中鉴定的不同植物基因分型的选择如图24所示。
1.A8-转基因:在PPO2和ALS1中编辑(pMol_926)
2.A9-转基因:无性状编辑。在ALS1中编辑(pMol_927)
3.H12-可疑的非转基因。在ACO1(在两个向导位点)和ALS2中编辑(pMol_931)
4.B11-非转基因。在ALS2中编辑(pMol_930)
表25-总结表:
表26-芽频率的完整表(本表中每个细胞3次ECS转化):
总之,本发明人在这些发育的早期阶段观察到多种芽表型。总体而言,本发明人观察到未经G418或CSF选择的再生芽在外观上是最健康的。总体而言,使用G418或CSF进行的胚选择的再生芽之间似乎没有显著的频率差异。关键的是,本发明人发现来自PBE#3的健康的芽,它们是转基因的并在ALS和性状基因中都被编辑。在来自PBE#3的三个非转基因的性状基因编辑的芽中,两个是健康植物,一个因污染而损失。
结论
这些数据证明,在用25或50μg/L CSF选择后,用构建体进行的转化每次转化产生>200个胚。值得注意地,这些胚在ALS基因座和性状基因中始终以高百分比被编辑。本发明人观察到,在所有实验中在胚和芽中,在CSF选择的样品中的非转基因率>10%。总体上在PBE1和PBE3实验中,非转基因的编辑的芽的频率在3-6%之间。
本发明人没有观察到可被归因于ALS或性状编辑本身的对芽健康的任何负面影响。本发明人具有在ALS1或ALS2以及性状基因中被编辑的健康芽的实例(非转基因的和转基因的两者)。
总体上,本发明人确实发现选择对芽发育的负面影响-本发明人在EDM阶段应用选择(CSF和G418两者),并发现相对于来自同一实验的非选择的对照芽,芽发育延迟。这可以通过在等到芽更有活力后再将它们移至“cone”来减轻。
实施例9-农杆菌转化和轰击的比较
比较使用农杆菌和粒子轰击的转化。数据如图25所示。
用农杆菌转化后,在CSF存在下没有具有阴性对照质粒的胚发育。从两种“抗性”质粒观察到在CSF的存在下数百个胚发育。在较高量的CSF(50μg/L)的存在下,发育的胚较少。
在通过粒子轰击转化后,在CSF存在下再次没有具有阴性对照质粒的胚发育。轰击的细胞对处理更敏感,因此使用较低浓度的CSF进行选择。从两种“抗性”质粒观察到在CSF的存在下仅少数胚发育(绿色框)。看来CSF 25μg/L对于选择编辑的轰击细胞来说太强。
根癌农杆菌制备
将携带二元质粒pMol_0628、pMol_0630和pMol_0632的根癌农杆菌AGL1在补充有rif 50mg/L、kan 100mg/L和carb 50mg/L的LB固体培养基中从甘油中划线并在28℃下孵育2天。第二天,将细菌接种到补充有相同抗生素的MG/L培养基中,并在28℃下孵育过夜。下一天,将培养物以4000rpm离心10分钟以沉淀细胞,然后将其重悬于ABIM中(农杆菌诱导培养基)以诱导毒力并在室温下在黑暗中孵育过夜。使用NanoDrop2000测量细菌的OD600,并在补充有乙酰丁香酮(100μM)的CS液体培养基中标准化至0.6。
香蕉ECS农杆菌转化(agrotransformation)
将2/3日龄细胞的香蕉ECS沉淀并重悬于CS培养基中。根据转化次数将0.5mL悬浮液(约0.1mL ECS)分开至小Eppendorf管中。将细胞在45℃下孵育5分钟,并且然后将它们再冷却5分钟。然后,将1mL农杆菌悬浮液添加到香蕉ECS中。将混合物混合,以1000rpm离心5分钟,然后在室温下孵育25分钟。将香蕉细胞合并,并将细菌悬浮液除去。将细胞在室温下接种于补充有100μM乙酰丁香酮的CS平板中3天以促进转化。
香蕉胚发育和发芽
3天后,从先前的培养基中收集香蕉ECS并转移至补充有头孢噻肟300mg/L的CS培养基中。一周后,将香蕉细胞转移至补充有cefo 300mg/L(对照),或cefo 300mg/L和CSF 25μg/L或cefo 300mg/L和CSF 50μg/L的EDM平板。将培养基每两周更新一次,直至胚发育。随后,将香蕉胚转移到不含CSF的成熟培养基(EMM)中一个月,然后将其转移到不含CSF的发芽培养基(GM4)中直至芽发育。
实施例10-使用ALS抑制剂、G418或潮霉素的选择的比较
本实施例比较了使用ALS抑制剂、G418或潮霉素进行选择时所获得的效率。已发现,使用ALS抑制剂的选择提高开发/鉴定非转基因的性状编辑的香蕉样品的效率。
下表27显示在从单个香蕉胚或叶材料中提取的gDNA中检测到的编辑事件的频率。在每种情况下,再生材料是用含有T-DNA的构建体转化,所述构建体会在阳性转化株中产生抗性。
表27
本发明人观察到,与使用G418或潮霉素(64-71%)的稳定选择相比,CSF选择是本发明人用来产生香蕉转基因株系的最有效方法(75-84%)。在CSF选择的材料中观察到的编辑频率为90-94%,而本发明人在使用G418或潮霉素的稳定选择中仅观察到43-61%的编辑频率。
实施例11-用于经编辑的胚的ALS抑制剂处理的优化
在ECS再生中使用浓度为0、25、50和100μg/L的CSF获得数据(3个月处理)。
表28
CSF浓度 | 选择后恢复的WT胚的频率 | 选择后恢复的ALS编辑胚的频率 |
0 | 30000±2000 | 30000±2000 |
25 | 0 | 1500±500 |
50 | 0 | 750±250 |
100 | 0 | 0 |
其他序列
SEQ ID NO:43
ACS-小果野蕉
>Ma04_g31490.1
SEQ ID NO:44
ACS-小果野蕉
>Ma04_g35640.1
SEQ ID NO:45
ACS-小果野蕉
>Ma09_g19150.1
SEQ ID NO:46
ACO1-小果野蕉
>Ma01_g11540.1
SEQ ID NO:47
ACO-小果野蕉
>Ma07_g19730.1
SEQ ID NO:48
PPO2-小果野蕉
>Ma07_g03540
序列表
<110> TROPIC BIOSCIENCES UK LIMITED
<120> 编辑香蕉基因的方法
<130> N420173WO
<150> GB 2109586.4
<151> 2021-07-02
<160> 59
<170> PatentIn version 3.5
<210> 1
<211> 659
<212> PRT
<213> 马铃薯(Solanum tuberosum)
<400> 1
Met Ala Ala Ala Ala Ser Pro Ser Pro Cys Phe Ser Lys Thr Leu Pro
1 5 10 15
Pro Ser Ser Ser Lys Ser Ser Thr Ile Leu Pro Arg Ser Thr Phe Pro
20 25 30
Phe His Asn His Pro Gln Lys Ala Ser Pro Leu His Leu Thr His Thr
35 40 45
His His His Arg Arg Gly Phe Ala Val Ser Asn Val Val Ile Ser Thr
50 55 60
Thr Thr His Asn Asp Val Ser Glu Pro Glu Thr Phe Val Ser Arg Phe
65 70 75 80
Ala Pro Asp Glu Pro Arg Lys Gly Cys Asp Val Leu Val Glu Ala Leu
85 90 95
Glu Arg Glu Gly Val Thr Asp Val Phe Ala Tyr Pro Gly Gly Ala Ser
100 105 110
Met Glu Ile His Gln Ala Leu Thr Arg Ser Asn Ile Ile Arg Asn Val
115 120 125
Leu Pro Arg His Glu Gln Gly Gly Val Phe Ala Ala Glu Gly Tyr Ala
130 135 140
Arg Ala Thr Gly Phe Pro Gly Val Cys Ile Ala Thr Ser Gly Pro Gly
145 150 155 160
Ala Thr Asn Leu Val Ser Gly Leu Ala Asp Ala Leu Leu Asp Ser Ile
165 170 175
Pro Ile Val Ala Ile Thr Gly Gln Val Pro Arg Arg Met Ile Gly Thr
180 185 190
Asp Ala Phe Gln Glu Thr Pro Ile Val Glu Val Thr Arg Ser Ile Thr
195 200 205
Lys His Asn Tyr Leu Val Met Asp Val Glu Asp Ile Pro Arg Val Val
210 215 220
Arg Glu Ala Phe Phe Leu Ala Lys Ser Gly Arg Pro Gly Pro Val Leu
225 230 235 240
Ile Asp Val Pro Lys Asp Ile Gln Gln Gln Leu Val Ile Pro Asn Trp
245 250 255
Asp Gln Pro Met Arg Leu Pro Gly Tyr Met Ser Arg Leu Pro Lys Leu
260 265 270
Pro Asn Glu Met Leu Leu Glu Gln Ile Ile Arg Leu Ile Ser Glu Ser
275 280 285
Lys Lys Pro Val Leu Tyr Val Gly Gly Gly Cys Leu Gln Ser Ser Glu
290 295 300
Glu Leu Arg Arg Phe Val Glu Leu Thr Gly Ile Pro Val Ala Ser Thr
305 310 315 320
Leu Met Gly Leu Gly Ala Phe Pro Thr Gly Asp Glu Leu Ser Leu Gln
325 330 335
Met Leu Gly Met His Gly Thr Val Tyr Ala Asn Tyr Ala Val Asp Gly
340 345 350
Ser Asp Leu Leu Leu Ala Phe Gly Val Arg Phe Asp Asp Arg Val Thr
355 360 365
Gly Lys Leu Glu Ala Phe Ala Ser Arg Ala Lys Ile Val His Ile Asp
370 375 380
Ile Asp Ser Ala Glu Ile Gly Lys Asn Lys Gln Pro His Val Ser Ile
385 390 395 400
Cys Ala Asp Ile Lys Leu Ala Leu Gln Gly Leu Asn Ser Ile Leu Glu
405 410 415
Gly Lys Glu Gly Lys Leu Lys Leu Asp Phe Ser Ala Trp Arg Gln Glu
420 425 430
Leu Thr Glu Gln Lys Val Lys Tyr Pro Leu Ser Phe Lys Thr Phe Gly
435 440 445
Glu Ala Ile Pro Pro Gln Tyr Ala Ile Gln Val Leu Asp Glu Leu Thr
450 455 460
Asn Gly Asn Ala Ile Ile Ser Thr Gly Val Gly Gln His Gln Met Trp
465 470 475 480
Ala Ala Gln Tyr Tyr Lys Tyr Lys Lys Pro Arg Gln Trp Leu Thr Ser
485 490 495
Gly Gly Leu Gly Ala Met Gly Phe Gly Leu Pro Ala Ala Ile Gly Ala
500 505 510
Ala Val Gly Arg Pro Gly Glu Ile Val Val Asp Ile Asp Gly Asp Gly
515 520 525
Ser Phe Ile Met Asn Val Gln Glu Leu Ala Thr Ile Lys Val Glu Asn
530 535 540
Leu Pro Val Lys Ile Met Leu Leu Asn Asn Gln His Leu Gly Met Val
545 550 555 560
Val Gln Trp Glu Asp Arg Phe Tyr Lys Ala Asn Arg Ala His Thr Tyr
565 570 575
Leu Gly Asp Pro Ala Asn Glu Glu Glu Ile Phe Pro Asn Met Leu Lys
580 585 590
Phe Ala Glu Ala Cys Gly Val Pro Ala Ala Arg Val Ser His Arg Asp
595 600 605
Asp Leu Arg Ala Ala Ile Gln Lys Met Leu Asp Thr Pro Gly Pro Tyr
610 615 620
Leu Leu Asp Val Ile Val Pro His Gln Glu His Val Leu Pro Met Ile
625 630 635 640
Pro Ser Gly Gly Ala Phe Lys Asp Val Ile Thr Glu Gly Asp Gly Arg
645 650 655
Arg Ser Tyr
<210> 2
<211> 657
<212> PRT
<213> 马铃薯
<400> 2
Met Ala Ala Ala Ser Pro Ser Pro Cys Phe Ser Lys Asn Leu Pro Pro
1 5 10 15
Ser Ser Ser Lys Ser Ser Ile Leu Leu Pro Lys Ser Thr Phe Thr Phe
20 25 30
His Asn His Pro Lys Asn Thr Ser Pro Leu His Leu Thr His Thr Gln
35 40 45
His His Ser Arg Phe Thr Val Ser Asn Val Ile Leu Ser Thr Thr Thr
50 55 60
His Asn Asp Val Ser Glu Pro Glu Ile Phe Val Ser Arg Phe Ala Pro
65 70 75 80
Asp Glu Pro Arg Lys Gly Cys Asp Val Leu Val Glu Ala Leu Glu Arg
85 90 95
Glu Gly Val Lys Asp Val Phe Ala Tyr Pro Gly Gly Ala Ser Met Glu
100 105 110
Ile His Gln Ala Leu Thr Arg Ser Asn Ile Ile Arg Asn Val Leu Pro
115 120 125
Arg His Glu Gln Gly Gly Val Phe Ala Ala Glu Gly Tyr Ala Arg Ala
130 135 140
Thr Gly Phe Pro Gly Val Cys Ile Ala Thr Ser Gly Pro Gly Ala Thr
145 150 155 160
Asn Leu Val Ser Gly Leu Ala Asp Ala Leu Leu Asp Ser Ile Pro Ile
165 170 175
Val Ala Ile Thr Gly Gln Val Pro Arg Arg Met Ile Gly Thr Asp Ala
180 185 190
Phe Gln Glu Thr Pro Ile Val Glu Val Thr Arg Ser Ile Thr Lys His
195 200 205
Asn Tyr Leu Val Met Asp Val Glu Asp Ile Pro Arg Val Val Arg Glu
210 215 220
Ala Phe Phe Leu Ala Lys Ser Gly Arg Pro Gly Pro Val Leu Ile Asp
225 230 235 240
Val Pro Lys Asp Ile Gln Gln Gln Leu Val Ile Pro Asn Trp Asp Gln
245 250 255
Pro Met Arg Leu Pro Gly Tyr Ile Ser Arg Leu Pro Lys Leu Pro Asn
260 265 270
Glu Met Leu Leu Glu Gln Ile Val Arg Leu Ile Ser Glu Ser Lys Lys
275 280 285
Pro Val Leu Tyr Val Gly Gly Gly Cys Thr Gln Ser Ser Glu Glu Leu
290 295 300
Arg Arg Phe Val Glu Leu Thr Gly Ile Pro Val Ala Ser Thr Leu Met
305 310 315 320
Gly Leu Gly Thr Phe Pro Cys Gly Asp Glu Leu Ser Leu Gln Met Leu
325 330 335
Gly Met His Gly Thr Val Tyr Ala Asn Tyr Ala Val Asp Ser Ser Asp
340 345 350
Leu Leu Leu Ala Phe Gly Val Arg Phe Asp Asp Arg Val Thr Gly Lys
355 360 365
Leu Glu Ala Phe Ala Ser Arg Ala Lys Ile Val His Ile Asp Ile Asp
370 375 380
Ser Ala Glu Ile Gly Lys Asn Lys Gln Pro His Val Ser Ile Cys Ala
385 390 395 400
Asp Ile Lys Leu Ala Leu Gln Gly Leu Asn Ser Ile Leu Glu Gly Lys
405 410 415
Glu Gly Lys Leu Lys Leu Asp Phe Ser Ala Trp Arg Gln Glu Leu Thr
420 425 430
Glu Gln Lys Val Lys Tyr Pro Leu Asn Tyr Lys Thr Phe Gly Glu Ala
435 440 445
Ile Pro Pro Gln Tyr Ala Ile Gln Val Leu Asp Glu Leu Thr Asn Gly
450 455 460
Asn Ala Ile Ile Ser Thr Gly Val Gly Gln His Gln Met Trp Ala Ala
465 470 475 480
Gln Tyr Tyr Lys Tyr Lys Lys Pro Arg Gln Trp Leu Thr Ser Gly Gly
485 490 495
Leu Gly Ala Met Gly Phe Gly Leu Pro Ala Ala Ile Gly Ala Ala Val
500 505 510
Gly Arg Pro Gly Glu Ile Val Val Asp Ile Asp Gly Asp Gly Ser Phe
515 520 525
Ile Met Asn Val Gln Glu Leu Ala Thr Ile Lys Val Glu Asn Leu Pro
530 535 540
Val Lys Ile Met Leu Leu Asn Asn Gln His Leu Gly Met Val Val Gln
545 550 555 560
Trp Glu Asp Arg Phe Tyr Lys Ala Asn Arg Ala His Thr Tyr Leu Gly
565 570 575
Asn Pro Ala Asn Glu Glu Glu Ile Phe Pro Asn Met Leu Lys Phe Ala
580 585 590
Glu Ala Cys Gly Val Pro Ala Ala Arg Val Ser His Arg Asp Asp Leu
595 600 605
Arg Ala Ala Ile Gln Lys Met Leu Asp Thr Pro Gly Pro Tyr Leu Leu
610 615 620
Asp Val Ile Val Pro His Gln Glu His Val Leu Pro Met Ile Pro Ser
625 630 635 640
Gly Gly Ala Phe Lys Asp Val Ile Thr Glu Gly Asp Gly Arg Arg Ser
645 650 655
Tyr
<210> 3
<211> 588
<212> PRT
<213> 马铃薯
<400> 3
Met Ala Ala Ala Ser Pro Ser Pro Cys Phe Ser Lys Asn Leu Pro Pro
1 5 10 15
Ser Ser Ser Lys Ser Ser Ile Leu Leu Pro Lys Ser Thr Phe Thr Phe
20 25 30
His Asn His Pro Lys Asn Thr Ser Pro Leu His Leu Thr His Thr Gln
35 40 45
His His Ser Arg Phe Thr Val Ser Asn Val Ile Leu Ser Thr Thr Thr
50 55 60
His Asn Asp Val Ser Glu Pro Glu Ile Phe Val Ser Arg Phe Ala Pro
65 70 75 80
Asp Glu Pro Arg Lys Gly Cys Asp Val Leu Val Glu Ala Leu Glu Arg
85 90 95
Glu Gly Val Lys Asp Val Phe Ala Tyr Pro Gly Gly Ala Ser Met Glu
100 105 110
Ile His Gln Ala Leu Thr Arg Ser Asn Ile Ile Arg Asn Val Leu Pro
115 120 125
Arg His Glu Gln Gly Gly Val Phe Ala Ala Glu Gly Tyr Ala Arg Ala
130 135 140
Thr Gly Phe Pro Gly Val Cys Ile Ala Thr Ser Gly Pro Gly Ala Thr
145 150 155 160
Asn Leu Val Ser Gly Leu Ala Asp Ala Leu Leu Asp Ser Ile Pro Ile
165 170 175
Val Ala Ile Thr Gly Gln Val Pro Arg Arg Met Ile Gly Thr Asp Ala
180 185 190
Phe Gln Glu Thr Pro Ile Val Glu Val Thr Arg Ser Ile Thr Lys His
195 200 205
Asn Tyr Leu Val Met Asp Val Glu Asp Ile Pro Arg Val Val Arg Glu
210 215 220
Ala Phe Phe Leu Ala Lys Ser Gly Arg Pro Gly Pro Val Leu Ile Asp
225 230 235 240
Val Pro Lys Asp Ile Gln Gln Gln Leu Val Ile Pro Asn Trp Asp Gln
245 250 255
Pro Met Arg Leu Pro Gly Tyr Ile Ser Arg Leu Pro Lys Leu Pro Asn
260 265 270
Glu Met Leu Leu Glu Gln Ile Val Arg Leu Ile Ser Glu Ser Lys Lys
275 280 285
Pro Val Leu Tyr Val Gly Gly Gly Cys Thr Gln Ser Ser Glu Glu Leu
290 295 300
Arg Arg Phe Val Glu Leu Thr Gly Ile Pro Val Ala Ser Thr Leu Met
305 310 315 320
Gly Leu Gly Thr Phe Pro Cys Gly Asp Glu Leu Ser Leu Gln Met Leu
325 330 335
Gly Met His Gly Thr Val Tyr Ala Asn Tyr Ala Val Asp Ser Ser Asp
340 345 350
Leu Leu Leu Ala Phe Gly Val Arg Phe Asp Asp Arg Val Thr Gly Lys
355 360 365
Leu Glu Ala Phe Ala Ser Arg Ala Lys Ile Val His Ile Asp Ile Asp
370 375 380
Ser Ala Glu Ile Gly Lys Asn Lys Gln Pro His Val Ser Ile Cys Ala
385 390 395 400
Asp Ile Lys Leu Ala Leu Gln Gly Leu Asn Ser Ile Leu Glu Gly Lys
405 410 415
Glu Gly Lys Leu Lys Leu Asp Phe Ser Ala Trp Arg Gln Glu Leu Thr
420 425 430
Glu Gln Lys Val Lys Tyr Pro Leu Asn Tyr Lys Thr Phe Gly Glu Ala
435 440 445
Ile Pro Pro Gln Tyr Ala Ile Gln Val Leu Asp Glu Leu Thr Asn Gly
450 455 460
Asn Ala Ile Ile Ser Thr Gly Val Gly Gln His Gln Met Trp Ala Ala
465 470 475 480
Gln Tyr Tyr Lys Tyr Lys Lys Pro Arg Gln Trp Leu Thr Ser Gly Gly
485 490 495
Leu Gly Ala Met Gly Phe Gly Leu Pro Ala Ala Ile Gly Ala Ala Val
500 505 510
Gly Arg Pro Gly Glu Ile Val Val Asp Ile Asp Glu Ala Cys Gly Val
515 520 525
Pro Ala Ala Arg Val Ser His Arg Asp Asp Leu Arg Ala Ala Ile Gln
530 535 540
Lys Met Leu Asp Thr Pro Gly Pro Tyr Leu Leu Asp Val Ile Val Pro
545 550 555 560
His Gln Glu His Val Leu Pro Met Ile Pro Ser Gly Gly Ala Phe Lys
565 570 575
Asp Val Ile Thr Glu Gly Asp Gly Arg Arg Ser Tyr
580 585
<210> 4
<211> 659
<212> PRT
<213> 澳洲茄(Solanum laciniatum)
<400> 4
Met Ala Ala Ala Ala Ser Pro Ser Pro Cys Phe Ser Lys Thr Leu Pro
1 5 10 15
Pro Ser Ser Ser Lys Ser Ser Thr Ile Leu Pro Arg Ser Thr Phe Ser
20 25 30
Phe His Asn His Pro Gln Lys Ala Ser Pro Leu His Leu Ile His Ala
35 40 45
Gln His Asn Arg Arg Gly Phe Ala Val Ala Asn Val Val Ile Ser Thr
50 55 60
Thr Thr His Asn Asp Val Ser Glu Pro Glu Thr Phe Val Ser Arg Phe
65 70 75 80
Ala Pro Asp Glu Pro Arg Lys Gly Cys Asp Val Leu Val Glu Ala Leu
85 90 95
Glu Arg Glu Gly Val Thr Asp Val Phe Ala Tyr Pro Gly Gly Ala Ser
100 105 110
Met Glu Ile His Gln Ala Leu Thr Arg Ser Asn Ile Ile Arg Asn Val
115 120 125
Leu Pro Arg His Glu Gln Gly Gly Val Phe Ala Ala Glu Gly Tyr Ala
130 135 140
Arg Ala Thr Gly Phe Pro Gly Val Cys Ile Ala Thr Ser Gly Pro Gly
145 150 155 160
Ala Thr Asn Leu Val Ser Gly Leu Ala Asp Ala Leu Leu Asp Ser Ile
165 170 175
Pro Ile Val Ala Ile Thr Gly Gln Val Pro Arg Arg Met Ile Gly Thr
180 185 190
Asp Ala Phe Gln Glu Thr Pro Ile Val Glu Val Thr Arg Ser Ile Thr
195 200 205
Lys His Asn Tyr Leu Val Met Asp Val Glu Asp Ile Pro Arg Val Val
210 215 220
Arg Glu Ala Phe Phe Leu Ala Lys Ser Gly Arg Pro Gly Pro Val Leu
225 230 235 240
Ile Asp Val Pro Lys Asp Ile Gln Gln Gln Leu Val Ile Pro Asn Trp
245 250 255
Asp Gln Pro Met Arg Leu Pro Gly Tyr Met Ser Arg Leu Pro Lys Leu
260 265 270
Pro Asn Glu Met Leu Leu Glu Gln Ile Val Arg Leu Ile Ser Glu Ser
275 280 285
Lys Lys Pro Val Leu Tyr Val Gly Gly Gly Cys Ser Gln Ser Ser Glu
290 295 300
Glu Leu Arg Arg Phe Val Glu Leu Thr Gly Ile Pro Val Ala Ser Thr
305 310 315 320
Leu Met Gly Leu Gly Ala Phe Pro Thr Gly Asp Glu Leu Ser Leu Gln
325 330 335
Met Leu Gly Met His Gly Thr Val Tyr Ala Asn Tyr Ala Val Asp Ser
340 345 350
Ser Asp Leu Leu Leu Ala Phe Gly Val Arg Phe Asp Asp Arg Val Thr
355 360 365
Gly Lys Leu Glu Ala Phe Ala Ser Arg Ala Lys Ile Val His Ile Asp
370 375 380
Ile Asp Ser Ala Glu Ile Gly Lys Asn Lys Gln Pro His Val Ser Ile
385 390 395 400
Cys Ala Asp Ile Lys Leu Ala Leu Gln Gly Leu Asn Ser Ile Leu Glu
405 410 415
Gly Lys Glu Gly Lys Met Lys Leu Asp Phe Ser Ala Trp Arg Gln Glu
420 425 430
Leu Thr Glu Gln Lys Met Lys Tyr Pro Leu Asn Phe Lys Thr Phe Gly
435 440 445
Asp Ala Ile Pro Pro Gln Tyr Ala Ile Gln Val Leu Asp Glu Leu Thr
450 455 460
Asn Gly Asn Ala Ile Ile Ser Thr Gly Val Gly Gln His Gln Met Trp
465 470 475 480
Ala Ala Gln Tyr Tyr Lys Tyr Lys Lys Pro Arg Gln Trp Leu Thr Ser
485 490 495
Gly Gly Leu Gly Ala Met Gly Phe Gly Leu Pro Ala Ala Ile Gly Ala
500 505 510
Ala Val Gly Arg Pro Gly Glu Ile Val Val Asp Ile Asp Gly Asp Gly
515 520 525
Ser Phe Ile Met Asn Val Gln Glu Leu Ala Thr Ile Lys Val Glu Asn
530 535 540
Leu Pro Val Lys Ile Met Leu Leu Asn Asn Gln His Leu Gly Met Val
545 550 555 560
Val Gln Trp Glu Asp Arg Phe Tyr Lys Ala Asn Arg Ala His Thr Tyr
565 570 575
Leu Gly Asp Pro Ser Asn Glu Glu Glu Ile Phe Pro Asn Met Leu Lys
580 585 590
Phe Ala Glu Ala Cys Gly Val Pro Ala Ala Arg Val Ser His Arg Asp
595 600 605
Asp Leu Arg Ala Ala Ile Gln Lys Met Leu Asp Thr Pro Gly Pro Tyr
610 615 620
Leu Leu Asp Val Ile Val Pro His Gln Glu His Val Leu Pro Met Ile
625 630 635 640
Pro Ser Gly Gly Ala Phe Lys Asp Val Ile Thr Glu Gly Asp Gly Arg
645 650 655
Cys Ser Tyr
<210> 5
<211> 657
<212> PRT
<213> 澳洲茄
<400> 5
Met Ala Ala Ala Ser Pro Ser Pro Cys Phe Ser Lys Thr Leu Pro Pro
1 5 10 15
Ser Ser Ser Lys Ser Ser Thr Leu Leu Pro Lys Ser Thr Phe Thr Phe
20 25 30
His Asn His Pro Lys Lys Ala Ser Pro Leu His Leu Thr His Thr Gln
35 40 45
His His Ser Arg Phe Thr Val Ser Asn Val Ile Leu Ser Thr Thr Thr
50 55 60
His Asp Asp Val Ser Glu Pro Glu Ile Phe Val Ser Arg Phe Ala Pro
65 70 75 80
Asp Glu Pro Arg Lys Gly Cys Asp Val Leu Val Glu Ala Leu Glu Arg
85 90 95
Glu Gly Val Lys Asp Val Phe Ala Tyr Pro Gly Gly Ala Ser Met Glu
100 105 110
Ile His Gln Ala Leu Thr Arg Ser Asn Ile Ile Arg Asn Val Leu Pro
115 120 125
Arg His Glu Gln Gly Gly Val Phe Ala Ala Glu Gly Tyr Ala Arg Ala
130 135 140
Thr Gly Phe Pro Gly Val Cys Ile Ala Thr Ser Gly Pro Gly Ala Thr
145 150 155 160
Asn Leu Val Ser Gly Leu Ala Asp Ala Leu Leu Asp Ser Ile Pro Ile
165 170 175
Val Ala Ile Thr Gly Gln Val Pro Arg Arg Met Ile Gly Thr Asp Ala
180 185 190
Phe Gln Glu Thr Pro Ile Val Glu Val Thr Arg Ser Ile Thr Lys His
195 200 205
Asn Tyr Leu Val Met Asp Val Glu Asp Ile Pro Arg Val Val Arg Glu
210 215 220
Ala Phe Phe Leu Ala Lys Ser Gly Arg Pro Gly Pro Val Leu Ile Asp
225 230 235 240
Val Pro Lys Asp Ile Gln Gln Gln Leu Val Ile Pro Asn Trp Asp Gln
245 250 255
Pro Met Arg Leu Pro Gly Tyr Met Ser Arg Leu Pro Lys Leu Pro Asn
260 265 270
Glu Met Leu Leu Glu Gln Ile Val Arg Leu Ile Ser Glu Ser Lys Lys
275 280 285
Pro Val Leu Tyr Val Gly Gly Gly Cys Ser Gln Ser Ser Glu Glu Leu
290 295 300
Arg Arg Phe Val Glu Leu Thr Gly Ile Pro Val Ala Ser Thr Leu Met
305 310 315 320
Gly Leu Gly Ala Phe Pro Ser Gly Asp Glu Leu Ser Leu Gln Met Leu
325 330 335
Gly Met His Gly Thr Val Tyr Ala Asn Tyr Ala Val Asp Ser Ser Asp
340 345 350
Leu Leu Leu Ala Phe Gly Val Arg Phe Asp Asp Arg Val Thr Gly Lys
355 360 365
Leu Glu Ala Phe Ala Ser Arg Ala Lys Ile Val His Ile Asp Ile Asp
370 375 380
Ser Ala Glu Ile Gly Lys Asn Lys Gln Pro His Val Ser Ile Cys Ala
385 390 395 400
Asp Ile Lys Leu Ala Leu Gln Gly Leu Asn Ser Ile Phe Glu Ser Lys
405 410 415
Lys Gly Lys Leu Lys Leu Asp Phe Ser Ala Trp Arg Gln Glu Leu Thr
420 425 430
Glu Gln Lys Val Lys Tyr Pro Leu Asn Phe Lys Thr Phe Gly Glu Ala
435 440 445
Ile Pro Pro Gln Tyr Ala Ile Gln Val Leu Asp Glu Leu Thr Asn Gly
450 455 460
Asn Ala Ile Ile Ser Thr Gly Val Gly Gln His Gln Met Trp Ala Ala
465 470 475 480
Gln His Tyr Lys Tyr Lys Lys Pro Arg Gln Trp Leu Thr Ser Gly Gly
485 490 495
Leu Gly Ala Met Gly Phe Gly Leu Pro Ala Ala Ile Gly Ala Ala Val
500 505 510
Gly Arg Pro Gly Glu Ile Val Val Asp Ile Asp Gly Asp Gly Ser Phe
515 520 525
Ile Met Asn Val Gln Glu Leu Ala Thr Ile Lys Val Glu Asn Leu Pro
530 535 540
Val Lys Ile Met Leu Leu Asn Asn Gln His Leu Gly Met Val Val Gln
545 550 555 560
Trp Glu Asp Arg Phe Tyr Lys Ala Asn Arg Ala His Thr Tyr Leu Gly
565 570 575
Asn Pro Ala Asn Glu Glu Glu Ile Phe Pro Asn Met Leu Lys Phe Ala
580 585 590
Glu Ala Cys Gly Val Pro Ala Ala Arg Val Ser His Arg Asp Asp Leu
595 600 605
Arg Ala Ala Ile Gln Lys Met Leu Asp Thr Pro Gly Pro Tyr Leu Leu
610 615 620
Asp Val Ile Val Pro His Gln Glu His Val Leu Pro Met Ile Pro Ser
625 630 635 640
Gly Gly Ala Phe Lys Asp Val Ile Thr Glu Gly Asp Gly Arg Arg Ser
645 650 655
Tyr
<210> 6
<211> 670
<212> PRT
<213> 拟南芥(Arabidopsis thaliana)
<400> 6
Met Ala Ala Ala Thr Thr Thr Thr Thr Thr Ser Ser Ser Ile Ser Phe
1 5 10 15
Ser Thr Lys Pro Ser Pro Ser Ser Ser Lys Ser Pro Leu Pro Ile Ser
20 25 30
Arg Phe Ser Leu Pro Phe Ser Leu Asn Pro Asn Lys Ser Ser Ser Ser
35 40 45
Ser Arg Arg Arg Gly Ile Lys Ser Ser Ser Pro Ser Ser Ile Ser Ala
50 55 60
Val Leu Asn Thr Thr Thr Asn Val Thr Thr Thr Pro Ser Pro Thr Lys
65 70 75 80
Pro Thr Lys Pro Glu Thr Phe Ile Ser Arg Phe Ala Pro Asp Gln Pro
85 90 95
Arg Lys Gly Ala Asp Ile Leu Val Glu Ala Leu Glu Arg Gln Gly Val
100 105 110
Glu Thr Val Phe Ala Tyr Pro Gly Gly Ala Ser Met Glu Ile His Gln
115 120 125
Ala Leu Thr Arg Ser Ser Ser Ile Arg Asn Val Leu Pro Arg His Glu
130 135 140
Gln Gly Gly Val Phe Ala Ala Glu Gly Tyr Ala Arg Ser Ser Gly Lys
145 150 155 160
Pro Gly Ile Cys Ile Ala Thr Ser Gly Pro Gly Ala Thr Asn Leu Val
165 170 175
Ser Gly Leu Ala Asp Ala Leu Leu Asp Ser Val Pro Leu Val Ala Ile
180 185 190
Thr Gly Gln Val Pro Arg Arg Met Ile Gly Thr Asp Ala Phe Gln Glu
195 200 205
Thr Pro Ile Val Glu Val Thr Arg Ser Ile Thr Lys His Asn Tyr Leu
210 215 220
Val Met Asp Val Glu Asp Ile Pro Arg Ile Ile Glu Glu Ala Phe Phe
225 230 235 240
Leu Ala Thr Ser Gly Arg Pro Gly Pro Val Leu Val Asp Val Pro Lys
245 250 255
Asp Ile Gln Gln Gln Leu Ala Ile Pro Asn Trp Glu Gln Ala Met Arg
260 265 270
Leu Pro Gly Tyr Met Ser Arg Met Pro Lys Pro Pro Glu Asp Ser His
275 280 285
Leu Glu Gln Ile Val Arg Leu Ile Ser Glu Ser Lys Lys Pro Val Leu
290 295 300
Tyr Val Gly Gly Gly Cys Leu Asn Ser Ser Asp Glu Leu Gly Arg Phe
305 310 315 320
Val Glu Leu Thr Gly Ile Pro Val Ala Ser Thr Leu Met Gly Leu Gly
325 330 335
Ser Tyr Pro Cys Asp Asp Glu Leu Ser Leu His Met Leu Gly Met His
340 345 350
Gly Thr Val Tyr Ala Asn Tyr Ala Val Glu His Ser Asp Leu Leu Leu
355 360 365
Ala Phe Gly Val Arg Phe Asp Asp Arg Val Thr Gly Lys Leu Glu Ala
370 375 380
Phe Ala Ser Arg Ala Lys Ile Val His Ile Asp Ile Asp Ser Ala Glu
385 390 395 400
Ile Gly Lys Asn Lys Thr Pro His Val Ser Val Cys Gly Asp Val Lys
405 410 415
Leu Ala Leu Gln Gly Met Asn Lys Val Leu Glu Asn Arg Ala Glu Glu
420 425 430
Leu Lys Leu Asp Phe Gly Val Trp Arg Asn Glu Leu Asn Val Gln Lys
435 440 445
Gln Lys Phe Pro Leu Ser Phe Lys Thr Phe Gly Glu Ala Ile Pro Pro
450 455 460
Gln Tyr Ala Ile Lys Val Leu Asp Glu Leu Thr Asp Gly Lys Ala Ile
465 470 475 480
Ile Ser Thr Gly Val Gly Gln His Gln Met Trp Ala Ala Gln Phe Tyr
485 490 495
Asn Tyr Lys Lys Pro Arg Gln Trp Leu Ser Ser Gly Gly Leu Gly Ala
500 505 510
Met Gly Phe Gly Leu Pro Ala Ala Ile Gly Ala Ser Val Ala Asn Pro
515 520 525
Asp Ala Ile Val Val Asp Ile Asp Gly Asp Gly Ser Phe Ile Met Asn
530 535 540
Val Gln Glu Leu Ala Thr Ile Arg Val Glu Asn Leu Pro Val Lys Val
545 550 555 560
Leu Leu Leu Asn Asn Gln His Leu Gly Met Val Met Gln Trp Glu Asp
565 570 575
Arg Phe Tyr Lys Ala Asn Arg Ala His Thr Phe Leu Gly Asp Pro Ala
580 585 590
Gln Glu Asp Glu Ile Phe Pro Asn Met Leu Leu Phe Ala Ala Ala Cys
595 600 605
Gly Ile Pro Ala Ala Arg Val Thr Lys Lys Ala Asp Leu Arg Glu Ala
610 615 620
Ile Gln Thr Met Leu Asp Thr Pro Gly Pro Tyr Leu Leu Asp Val Ile
625 630 635 640
Cys Pro His Gln Glu His Val Leu Pro Met Ile Pro Ser Gly Gly Thr
645 650 655
Phe Asn Asp Val Ile Thr Glu Gly Asp Gly Arg Ile Lys Tyr
660 665 670
<210> 7
<211> 2351
<212> DNA
<213> 小果野蕉(Musa acuminata)
<400> 7
cagattaaaa aacccctttt tttcgtcccc caaatggctt cctctgcggc ggcggcggtg 60
gctgctgcgg gagccgtgat caccccatcg aagtcatccc acccgccctt ctccgctgcc 120
taccgcctcc ccttccccct ccccaagccc ctccattcac tctccacccg ccacccccac 180
gtccggccca tctccgcctc cgcggaccgc cgccagctcc cctccgctgc cgcccccact 240
gccgatgcca cgaccgcccc tctcctccgt aacttcgccc cggatgagcc ccgcaaaggc 300
gccgacatcc tagtcgaggc ccttgagcgg gagggcgtca ccgacctttt cgcctacccc 360
ggcggcgcct ccatggagat ccatcaggct ctcacccgct cgccatctat caccaaccac 420
ctcctccggc acgagcaggg ggaggtcttc gctgcctctg gatacgcccg ctccactggt 480
cgccctggcg tgtgcatcgc cacctccggc cctggcgcca ccaatctcgt ctccggtctc 540
gccgatgccc tccttgactc cgtccctctc gtcgccatca ccggccaggt ccctcgccgc 600
atgatcggca ccgacgcttt ccaggagacc cccatcgttg aggtcaccag atccatcacc 660
aagcacaact acctggtcct caacgttgat gacattcccc gcatcattaa agaagccttc 720
ttcgtagcca ccacgggccg ccctggcccg gtgctcgttg acatccccaa ggacatccag 780
cagcagcttg ccatccctgt atgggaccca ccgctccgcc ttcccggcta catctcccgc 840
ctccccaggc ttcctgcacg ccctctgctc gatcagatcc tccgcctggt gtcggaatcc 900
cgtcgcccag tcctctatgt tggtggtggc tgcttgaact ctagcgagga gcttcatcgg 960
tttgcagacc ttactggcat ccccattgca agtaccctaa tgggtctcgg ggcctatccc 1020
accgatgctg agctatcatt gaagatgttg ggaatgcatg ggacggtcta tgccaactat 1080
gccatcgata aagctgatct gctgcttgca tttggtgtaa ggtttgatga ccgtgtgact 1140
ggaaagattg aagcttttgc tagcagagca aagattgtgc acattgacat tgatccggct 1200
gagatcggga agaacaagca gccacacgtt tcaatctgtg ctgatgtgaa gttggcattg 1260
cagggaatga atgcattgat ggaggagact gagatttatc gaaagtttga cttttcgacc 1320
tggagagaag aactggacaa gcagaagaag atatacccat taaactataa gactttcggg 1380
gatcagattc ctccccagta tgcaattcag gtgcttgatg agctaacgaa tggggaggca 1440
atcattacta ctggtgttgg gcagcaccag atgtgggccg cacaatatta ctcatacaag 1500
agggcacggc agtggctgac atctgccgga ctaggtgcta tgggtttcgg tttaccagca 1560
gcagctggtg cagctgttgg gaatccaggc gttactgtcg tggacattga tggggatggg 1620
agtttccaga tgaatgctca ggaactggca atgattcgga ttgagaatct tcctgtcaag 1680
attatggtgt tgaacaacca gcatttgggc atggtcgtgc aatgggaaga tcgattctac 1740
catagaaaca gggcacatac ttatctgggg aacccagcaa atgagactga ggtattccct 1800
gatttcttga aaatctcaga ggggtatggc atacctgctg ctcgtgtcac gaagaagagt 1860
gaggttagag aagcaatcag gaagatgctg gaaacatcgg ggccttactt gttggatgtg 1920
atcgtgccac atgaggaaca tgtgttgcct atgattccaa gtggcggtgc gtttaaagat 1980
atgatacttg atggagatgg aagaacacca tattagtgct ttttaagata ggtaatcagt 2040
tcagttctgt tgtggcattg agtatcagtt ctgttctgtc gtggcttgaa ctatgtattt 2100
aggacactat agtactatct ttcctgttgg tatgatgagt ttttctctgg ctttggtcat 2160
atctatgaac tacgaaatgt tatagtcatg tctacgacct acgaggactg gtcaatcttt 2220
ttcttccgtc ctgttgtgtg ttttgatcat tagttattac atagtttgtt cgctatcact 2280
tatttggtat tgctgggtgt ccttgttgca ttttcaggga ttttgtttta taagcttaag 2340
aataatttcc t 2351
<210> 8
<211> 2177
<212> DNA
<213> 小果野蕉
<400> 8
cctctcgctc gctcgctcgc tcgctcctca gggaacccag tatggcctcc tctacgccgg 60
gggccgccat cgctccctcg aatccctcgc tctggccatt ctccgctgca tcccgtctct 120
tggtcccctt ccccaagcct ctcctgtccc tctccccgag ccaccgcaat gttcggccca 180
tctccgcgtc cgcggaccgc caccagaccc cctccgccgc cgcctccacc gccgctacca 240
ccgctgtccc cctcctccgt aactttgccc ccgacgaacc ccgcaagggc tctgacatcc 300
ttgtcgaggc cctcgagcgg gagggcgtca ccgatctgtt cgcctaccct ggaggcgcct 360
ccatggagat ccaccaggcc ctcacccgct ccccatccat caccaatcac ctcctccgcc 420
acgagcaggg cgagatcttc gccgcctccg gctacgcccg ctccaccggc cgcccgggcg 480
tgtgcatagc cacctccggc cccggcgcta ccaacctcgt ctccggcttc gcggacgcgc 540
tcctcgactc cgtccccctc gtcgccatca ccggccaggt tccccgccgc atgatcggca 600
cggacgcctt ccaggagacc cccattgtcg aggtcaccag atccatcacc aaacacaact 660
acctggtcct taacgtcgat gacattcccc gcatcattaa agaagccttc tttctcgcca 720
gcactggccg ccctggcccg gtgctggtcg acatccccaa ggacattcag cagcagcttg 780
ccatccctgt ttgggacccg ccgctacgcc ttcccggata catctcccgc ctccctaagc 840
ctccttcacg ctgtctgctt gaccaaatca tccgccttgt gtccgaatcc catcgcccgg 900
ttctctatgt tggtggcggc tgcttgaact ccagcgagga gcttcgccgg tttgcggacc 960
ttactggcat ccccatcgcg agtaccctga tgggtctcgg ggtctatccc accgatgcgg 1020
agctatcgtt gaagatgttg ggaatgcacg ggactgtcta tgccaactat tcggtcgata 1080
aagctgacct cttgctcgca cttggcgtca ggttcgacga tcgtgtgact ggaaagcttg 1140
aagcttttgc tagcagggca aagattgtgc acatcgatat agatccggct gagatcggga 1200
agaacaagca gccacatgtt tcattatgtg ctgatgtgag gttggctctg caggggatga 1260
atgcattgat ggaggagagt gggattcatc aaaagttcga tttttccacc tggaggaaag 1320
agctggacca actgaagaag gcatacccat tgagctataa gacttttggg gatctgattc 1380
ctccccagta tgcaattcag gtgcttgacg aactgacgaa tggggaagca atcatttcca 1440
ctggtgttgg gcagcaccag atgtgggccg cacagtatta ctcatacaag agggcacgcc 1500
agtggctgac atcgggcgga ctaggtgcta tggggtttgg tttaccagca gctgccggtg 1560
cagccgttgg gaacccagga gttactgtcg tcgacattga tggggatggg agtttccaga 1620
tgaatgctca ggagttggct atgattcgga tagagaatct ccctgtcaag atgatggtgt 1680
tgaacaacca gcatttgggc atggttgtac aatgggagga tcgattctac cgtagcaaca 1740
ggggacatac ttacttgggg aacccagcaa acgagagtga gatatttcct gatttcttga 1800
aaatcacaga agcgtatggt atacctgctg cccgtgtgac aaagaagagc gaggtcaggg 1860
aagcaatcag gaagatgctg aagacaccag gaccttactt gttggatgtg atcgtgccac 1920
atgaggagca tgtgttgcct atgattccta gtggcggtgc atttaaggat atgatcctcg 1980
atggagatgg aagaacgcca ccacactaat acattttaat ttaggtaatc agttctgttc 2040
tgttatggct tcgaactgtg aagtcaggaa atatgtagtg ctggctttcg tgttgttaca 2100
atgagctttg gctatagtca catctgtgaa ctttgagaat tggttgacct atttttcctg 2160
tctttttgtg cattttg 2177
<210> 9
<211> 660
<212> PRT
<213> 小果野蕉
<400> 9
Met Ala Ser Ser Ala Ala Ala Ala Val Ala Ala Ala Gly Ala Val Ile
1 5 10 15
Thr Pro Ser Lys Ser Ser His Pro Pro Phe Ser Ala Ala Tyr Arg Leu
20 25 30
Pro Phe Pro Leu Pro Lys Pro Leu His Ser Leu Ser Thr Arg His Pro
35 40 45
His Val Arg Pro Ile Ser Ala Ser Ala Asp Arg Arg Gln Leu Pro Ser
50 55 60
Ala Ala Ala Pro Thr Ala Asp Ala Thr Thr Ala Pro Leu Leu Arg Asn
65 70 75 80
Phe Ala Pro Asp Glu Pro Arg Lys Gly Ala Asp Ile Leu Val Glu Ala
85 90 95
Leu Glu Arg Glu Gly Val Thr Asp Leu Phe Ala Tyr Pro Gly Gly Ala
100 105 110
Ser Met Glu Ile His Gln Ala Leu Thr Arg Ser Pro Ser Ile Thr Asn
115 120 125
His Leu Leu Arg His Glu Gln Gly Glu Val Phe Ala Ala Ser Gly Tyr
130 135 140
Ala Arg Ser Thr Gly Arg Pro Gly Val Cys Ile Ala Thr Ser Gly Pro
145 150 155 160
Gly Ala Thr Asn Leu Val Ser Gly Leu Ala Asp Ala Leu Leu Asp Ser
165 170 175
Val Pro Leu Val Ala Ile Thr Gly Gln Val Pro Arg Arg Met Ile Gly
180 185 190
Thr Asp Ala Phe Gln Glu Thr Pro Ile Val Glu Val Thr Arg Ser Ile
195 200 205
Thr Lys His Asn Tyr Leu Val Leu Asn Val Asp Asp Ile Pro Arg Ile
210 215 220
Ile Lys Glu Ala Phe Phe Val Ala Thr Thr Gly Arg Pro Gly Pro Val
225 230 235 240
Leu Val Asp Ile Pro Lys Asp Ile Gln Gln Gln Leu Ala Ile Pro Val
245 250 255
Trp Asp Pro Pro Leu Arg Leu Pro Gly Tyr Ile Ser Arg Leu Pro Arg
260 265 270
Leu Pro Ala Arg Pro Leu Leu Asp Gln Ile Leu Arg Leu Val Ser Glu
275 280 285
Ser Arg Arg Pro Val Leu Tyr Val Gly Gly Gly Cys Leu Asn Ser Ser
290 295 300
Glu Glu Leu His Arg Phe Ala Asp Leu Thr Gly Ile Pro Ile Ala Ser
305 310 315 320
Thr Leu Met Gly Leu Gly Ala Tyr Pro Thr Asp Ala Glu Leu Ser Leu
325 330 335
Lys Met Leu Gly Met His Gly Thr Val Tyr Ala Asn Tyr Ala Ile Asp
340 345 350
Lys Ala Asp Leu Leu Leu Ala Phe Gly Val Arg Phe Asp Asp Arg Val
355 360 365
Thr Gly Lys Ile Glu Ala Phe Ala Ser Arg Ala Lys Ile Val His Ile
370 375 380
Asp Ile Asp Pro Ala Glu Ile Gly Lys Asn Lys Gln Pro His Val Ser
385 390 395 400
Ile Cys Ala Asp Val Lys Leu Ala Leu Gln Gly Met Asn Ala Leu Met
405 410 415
Glu Glu Thr Glu Ile Tyr Arg Lys Phe Asp Phe Ser Thr Trp Arg Glu
420 425 430
Glu Leu Asp Lys Gln Lys Lys Ile Tyr Pro Leu Asn Tyr Lys Thr Phe
435 440 445
Gly Asp Gln Ile Pro Pro Gln Tyr Ala Ile Gln Val Leu Asp Glu Leu
450 455 460
Thr Asn Gly Glu Ala Ile Ile Thr Thr Gly Val Gly Gln His Gln Met
465 470 475 480
Trp Ala Ala Gln Tyr Tyr Ser Tyr Lys Arg Ala Arg Gln Trp Leu Thr
485 490 495
Ser Ala Gly Leu Gly Ala Met Gly Phe Gly Leu Pro Ala Ala Ala Gly
500 505 510
Ala Ala Val Gly Asn Pro Gly Val Thr Val Val Asp Ile Asp Gly Asp
515 520 525
Gly Ser Phe Gln Met Asn Ala Gln Glu Leu Ala Met Ile Arg Ile Glu
530 535 540
Asn Leu Pro Val Lys Ile Met Val Leu Asn Asn Gln His Leu Gly Met
545 550 555 560
Val Val Gln Trp Glu Asp Arg Phe Tyr His Arg Asn Arg Ala His Thr
565 570 575
Tyr Leu Gly Asn Pro Ala Asn Glu Thr Glu Val Phe Pro Asp Phe Leu
580 585 590
Lys Ile Ser Glu Gly Tyr Gly Ile Pro Ala Ala Arg Val Thr Lys Lys
595 600 605
Ser Glu Val Arg Glu Ala Ile Arg Lys Met Leu Glu Thr Ser Gly Pro
610 615 620
Tyr Leu Leu Asp Val Ile Val Pro His Glu Glu His Val Leu Pro Met
625 630 635 640
Ile Pro Ser Gly Gly Ala Phe Lys Asp Met Ile Leu Asp Gly Asp Gly
645 650 655
Arg Thr Pro Tyr
660
<210> 10
<211> 655
<212> PRT
<213> 小果野蕉
<400> 10
Met Ala Ser Ser Thr Pro Gly Ala Ala Ile Ala Pro Ser Asn Pro Ser
1 5 10 15
Leu Trp Pro Phe Ser Ala Ala Ser Arg Leu Leu Val Pro Phe Pro Lys
20 25 30
Pro Leu Leu Ser Leu Ser Pro Ser His Arg Asn Val Arg Pro Ile Ser
35 40 45
Ala Ser Ala Asp Arg His Gln Thr Pro Ser Ala Ala Ala Ser Thr Ala
50 55 60
Ala Thr Thr Ala Val Pro Leu Leu Arg Asn Phe Ala Pro Asp Glu Pro
65 70 75 80
Arg Lys Gly Ser Asp Ile Leu Val Glu Ala Leu Glu Arg Glu Gly Val
85 90 95
Thr Asp Leu Phe Ala Tyr Pro Gly Gly Ala Ser Met Glu Ile His Gln
100 105 110
Ala Leu Thr Arg Ser Pro Ser Ile Thr Asn His Leu Leu Arg His Glu
115 120 125
Gln Gly Glu Ile Phe Ala Ala Ser Gly Tyr Ala Arg Ser Thr Gly Arg
130 135 140
Pro Gly Val Cys Ile Ala Thr Ser Gly Pro Gly Ala Thr Asn Leu Val
145 150 155 160
Ser Gly Phe Ala Asp Ala Leu Leu Asp Ser Val Pro Leu Val Ala Ile
165 170 175
Thr Gly Gln Val Pro Arg Arg Met Ile Gly Thr Asp Ala Phe Gln Glu
180 185 190
Thr Pro Ile Val Glu Val Thr Arg Ser Ile Thr Lys His Asn Tyr Leu
195 200 205
Val Leu Asn Val Asp Asp Ile Pro Arg Ile Ile Lys Glu Ala Phe Phe
210 215 220
Leu Ala Ser Thr Gly Arg Pro Gly Pro Val Leu Val Asp Ile Pro Lys
225 230 235 240
Asp Ile Gln Gln Gln Leu Ala Ile Pro Val Trp Asp Pro Pro Leu Arg
245 250 255
Leu Pro Gly Tyr Ile Ser Arg Leu Pro Lys Pro Pro Ser Arg Cys Leu
260 265 270
Leu Asp Gln Ile Ile Arg Leu Val Ser Glu Ser His Arg Pro Val Leu
275 280 285
Tyr Val Gly Gly Gly Cys Leu Asn Ser Ser Glu Glu Leu Arg Arg Phe
290 295 300
Ala Asp Leu Thr Gly Ile Pro Ile Ala Ser Thr Leu Met Gly Leu Gly
305 310 315 320
Val Tyr Pro Thr Asp Ala Glu Leu Ser Leu Lys Met Leu Gly Met His
325 330 335
Gly Thr Val Tyr Ala Asn Tyr Ser Val Asp Lys Ala Asp Leu Leu Leu
340 345 350
Ala Leu Gly Val Arg Phe Asp Asp Arg Val Thr Gly Lys Leu Glu Ala
355 360 365
Phe Ala Ser Arg Ala Lys Ile Val His Ile Asp Ile Asp Pro Ala Glu
370 375 380
Ile Gly Lys Asn Lys Gln Pro His Val Ser Leu Cys Ala Asp Val Arg
385 390 395 400
Leu Ala Leu Gln Gly Met Asn Ala Leu Met Glu Glu Ser Gly Ile His
405 410 415
Gln Lys Phe Asp Phe Ser Thr Trp Arg Lys Glu Leu Asp Gln Leu Lys
420 425 430
Lys Ala Tyr Pro Leu Ser Tyr Lys Thr Phe Gly Asp Leu Ile Pro Pro
435 440 445
Gln Tyr Ala Ile Gln Val Leu Asp Glu Leu Thr Asn Gly Glu Ala Ile
450 455 460
Ile Ser Thr Gly Val Gly Gln His Gln Met Trp Ala Ala Gln Tyr Tyr
465 470 475 480
Ser Tyr Lys Arg Ala Arg Gln Trp Leu Thr Ser Gly Gly Leu Gly Ala
485 490 495
Met Gly Phe Gly Leu Pro Ala Ala Ala Gly Ala Ala Val Gly Asn Pro
500 505 510
Gly Val Thr Val Val Asp Ile Asp Gly Asp Gly Ser Phe Gln Met Asn
515 520 525
Ala Gln Glu Leu Ala Met Ile Arg Ile Glu Asn Leu Pro Val Lys Met
530 535 540
Met Val Leu Asn Asn Gln His Leu Gly Met Val Val Gln Trp Glu Asp
545 550 555 560
Arg Phe Tyr Arg Ser Asn Arg Gly His Thr Tyr Leu Gly Asn Pro Ala
565 570 575
Asn Glu Ser Glu Ile Phe Pro Asp Phe Leu Lys Ile Thr Glu Ala Tyr
580 585 590
Gly Ile Pro Ala Ala Arg Val Thr Lys Lys Ser Glu Val Arg Glu Ala
595 600 605
Ile Arg Lys Met Leu Lys Thr Pro Gly Pro Tyr Leu Leu Asp Val Ile
610 615 620
Val Pro His Glu Glu His Val Leu Pro Met Ile Pro Ser Gly Gly Ala
625 630 635 640
Phe Lys Asp Met Ile Leu Asp Gly Asp Gly Arg Thr Pro Pro His
645 650 655
<210> 11
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 11
ggtcttcgct gcctctggat 20
<210> 12
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 12
ccaccaccaa catagaggac t 21
<210> 13
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 13
atctgttcgc ctaccctgga 20
<210> 14
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 14
gatggatctg gtgacctcga c 21
<210> 15
<211> 22
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 15
tagggattcc gacgatttgt tt 22
<210> 16
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 16
tagcgtcatc attggctggg a 21
<210> 17
<211> 5214
<212> DNA
<213> 人工序列
<220>
<223> 合成多核苷酸
<400> 17
atgccaaaga agaagaggaa ggtttcatcg gagacgggcc ctgttgctgt tgaccccacc 60
ctgcggcgga gaatcgagcc acacgagttc gaggtgttct tcgacccaag ggagctccgc 120
aaggagacgt gcctcctgta cgagatcaac tggggcggca ggcactccat ctggaggcac 180
accagccaaa acaccaacaa gcacgtggag gtcaacttca tcgagaagtt caccaccgag 240
aggtacttct gcccaaacac ccgctgctcc atcacctggt tcctgtcctg gagcccatgc 300
ggcgagtgct ccagggccat caccgagttc ctcagccgct acccacacgt caccctgttc 360
atctacatcg ccaggctcta ccaccacgcc gacccaagga acaggcaggg cctccgcgac 420
ctgatctcca gcggcgtgac catccaaatc atgaccgagc aggagtccgg ctactgctgg 480
aggaacttcg tcaactactc cccaagcaac gaggcccact ggccaaggta cccacacctc 540
tgggtgcgcc tctacgtgct cgagctgtac tgcatcatcc tcggcctgcc accatgcctc 600
aacatcctga ggcgcaagca accacagctg accttcttca ccatcgccct ccaaagctgc 660
cactaccaga ggctcccacc acacatcctg tgggctaccg gcctcaagtc cggcagcgag 720
acgccaggca cctccgagag cgctacgcct gaacttaagg acaagaagta ctcgatcggc 780
ctcgccatcg ggacgaactc agttggctgg gccgtgatca ccgacgagta caaggtgccc 840
tctaagaagt tcaaggtcct ggggaacacc gaccgccatt ccatcaagaa gaacctcatc 900
ggcgctctcc tgttcgacag cggggagacg gctgaggcta cgaggctcaa gagaaccgct 960
aggcgccggt acacgagaag gaagaacagg atctgctacc tccaagagat tttctccaac 1020
gagatggcca aggttgacga ttcattcttc caccgcctgg aggagtcttt cctcgtggag 1080
gaggataaga agcacgagcg gcatcccatc ttcggcaaca tcgtggacga ggttgcctac 1140
cacgagaagt accctacgat ctaccatctg cggaagaagc tcgtggactc caccgataag 1200
gcggacctca gactgatcta cctcgctctg gcccacatga tcaagttccg cggccatttc 1260
ctgatcgagg gggatctcaa cccagacaac agcgatgttg acaagctgtt catccaactc 1320
gtgcagacct acaaccaact cttcgaggag aacccgatca acgcctctgg cgtggacgcg 1380
aaggctatcc tgtccgcgag gctctcgaag tccaggaggc tggagaacct gatcgctcag 1440
ctcccaggcg agaagaagaa cggcctgttc gggaacctca tcgctctcag cctggggctc 1500
accccgaact tcaagtcgaa cttcgatctc gctgaggacg ccaagctgca actctccaag 1560
gacacctacg acgatgacct cgataacctc ctggcccaga tcggcgatca atacgcggac 1620
ctgttcctcg ctgccaagaa cctgtcggac gccatcctcc tgtcagatat cctccgcgtg 1680
aacaccgaga tcacgaaggc tccactctct gcctccatga tcaagcgcta cgacgagcac 1740
catcaggatc tgaccctcct gaaggcgctg gtccgccaac agctcccgga gaagtacaag 1800
gagattttct tcgatcagtc gaagaacggc tacgctgggt acatcgacgg cggggcctca 1860
caagaggagt tctacaagtt catcaagcca atcctggaga agatggacgg cacggaggag 1920
ctcctggtga agctcaacag ggaggacctc ctgcggaagc agagaacctt cgataacggc 1980
agcatccccc accaaatcca tctcggggag ctgcacgcca tcctgagaag gcaagaggac 2040
ttctaccctt tcctcaagga taaccgggag aagatcgaga agatcctgac cttcagaatc 2100
ccatactacg tcggccctct cgcgcggggg aactcaagat tcgcttggat gacccgcaag 2160
tctgaggaga cgatcacgcc gtggaacttc gaggaggtgg tggacaaggg cgctagcgct 2220
cagtcgttca tcgagaggat gaccaacttc gacaagaacc tgcccaacga gaaggtgctc 2280
cctaagcact cgctcctgta cgagtacttc accgtctaca acgagctcac gaaggtgaag 2340
tacgtcaccg agggcatgcg caagccagcg ttcctgtccg gggagcagaa gaaggctatc 2400
gtggacctcc tgttcaagac caaccggaag gtcacggtta agcaactcaa ggaggactac 2460
ttcaagaaga tcgagtgctt cgattcggtc gagatcagcg gcgttgagga ccgcttcaac 2520
gccagcctcg ggacctacca cgatctcctg aagatcatca aggataagga cttcctggac 2580
aacgaggaga acgaggatat cctggaggac atcgtgctga ccctcacgct gttcgaggac 2640
agggagatga tcgaggagcg cctcaagacg tacgcccatc tcttcgatga caaggtcatg 2700
aagcaactca agcgccggag atacaccggc tgggggaggc tgtcccgcaa gctcatcaac 2760
ggcatccggg acaagcagtc cggcaagacc atcctcgact tcctcaagag cgatggcttc 2820
gccaacagga acttcatgca actgatccac gatgacagcc tcaccttcaa ggaggatatc 2880
caaaaggctc aagtgagcgg ccagggggac tcgctgcacg agcatatcgc gaacctcgct 2940
ggctcccccg cgatcaagaa gggcatcctc cagaccgtga aggttgtgga cgagctcgtg 3000
aaggtcatgg gccggcacaa gcctgagaac atcgtcatcg agatggccag agagaaccaa 3060
accacgcaga aggggcaaaa gaactctagg gagcgcatga agcgcatcga ggagggcatc 3120
aaggagctgg ggtcccaaat cctcaaggag cacccagtgg agaacaccca actgcagaac 3180
gagaagctct acctgtacta cctccagaac ggcagggata tgtacgtgga ccaagagctg 3240
gatatcaacc gcctcagcga ttacgacgtc gatcatatcg ttccccagtc tttcctgaag 3300
gatgactcca tcgacaacaa ggtcctcacc aggtcggaca agaaccgcgg caagtcagat 3360
aacgttccat ctgaggaggt cgttaagaag atgaagaact actggaggca gctcctgaac 3420
gccaagctga tcacgcaaag gaagttcgac aacctcacca aggctgagag aggcgggctc 3480
tcagagctgg acaaggccgg cttcatcaag cggcagctgg tcgagacgag acaaatcacg 3540
aagcacgttg cgcaaatcct cgactctcgg atgaacacga agtacgatga gaacgacaag 3600
ctgatcaggg aggttaaggt gatcaccctg aagtctaagc tcgtctccga cttcaggaag 3660
gatttccagt tctacaaggt tcgcgagatc aacaactacc accatgccca tgacgcttac 3720
ctcaacgctg tggtcggcac cgctctgatc aagaagtacc caaagctgga gtccgagttc 3780
gtgtacgggg actacaaggt ttacgatgtg cgcaagatga tcgccaagtc ggagcaagag 3840
atcggcaagg ctaccgccaa gtacttcttc tactcaaaca tcatgaactt cttcaagacc 3900
gagatcacgc tggccaacgg cgagatccgg aagcgcccgc tcatcgagac gaacggcgag 3960
acgggggaga tcgtgtggga caagggcagg gatttcgcga ccgtccgcaa ggttctctcc 4020
atgccccagg tgaacatcgt caaaaagacc gaggtccaaa cgggcgggtt ctcaaaggag 4080
tctatcctgc ctaagcggaa cagcgacaag ctcatcgcca gaaagaagga ctgggaccca 4140
aagaagtacg gcgggttcga cagccctacc gtggcctact cggtcctggt tgtggcgaag 4200
gttgagaagg gcaagtccaa gaagctcaag agcgtgaagg agctcctggg gatcaccatc 4260
atggagaggt ccagcttcga gaagaaccca atcgacttcc tggaggccaa gggctacaag 4320
gaggtgaaga aggacctgat catcaagctc ccgaagtact ctctcttcga gctggagaac 4380
ggcaggaaga gaatgctggc ttccgctggc gagctccaga aggggaacga gctcgcgctg 4440
ccaagcaagt acgtgaactt cctctacctg gcttcccact acgagaagct caagggcagc 4500
ccggaggaca acgagcaaaa gcagctgttc gtcgagcagc acaagcatta cctcgacgag 4560
atcatcgagc aaatctccga gttcagcaag cgcgtgatcc tcgccgacgc gaacctggat 4620
aaggtcctct ccgcctacaa caagcaccgg gacaagccca tcagagagca agcggagaac 4680
atcatccatc tcttcaccct gacgaacctc ggcgctcctg ctgctttcaa gtacttcgac 4740
accacgatcg atcggaagag atacacctcc acgaaggagg tcctggacgc gaccctcatc 4800
caccagtcga tcaccggcct gtacgagacg aggatcgacc tctcacaact cggcggggat 4860
aagcgccccg cagcaaccaa gaaggcaggg caagcaaaga agaaaaagac gcgtgactcc 4920
ggcggcagca ccaacctgtc cgacatcatc gagaaggaga cgggcaagca actcgtgatc 4980
caggagagca tcctcatgct gccagaggag gtggaggagg tcatcggcaa caagccagag 5040
tccgacatcc tggtgcacac cgcctacgac gagtccaccg acgagaacgt catgctcctg 5100
accagcgacg ccccagagta caagccatgg gccctcgtca tccaggacag caacggggag 5160
aacaagatca agatgctgtc gggggggagc ccaaagaaga agcggaaggt gtag 5214
<210> 18
<211> 1737
<212> PRT
<213> 人工序列
<220>
<223> 合成多肽
<400> 18
Met Pro Lys Lys Lys Arg Lys Val Ser Ser Glu Thr Gly Pro Val Ala
1 5 10 15
Val Asp Pro Thr Leu Arg Arg Arg Ile Glu Pro His Glu Phe Glu Val
20 25 30
Phe Phe Asp Pro Arg Glu Leu Arg Lys Glu Thr Cys Leu Leu Tyr Glu
35 40 45
Ile Asn Trp Gly Gly Arg His Ser Ile Trp Arg His Thr Ser Gln Asn
50 55 60
Thr Asn Lys His Val Glu Val Asn Phe Ile Glu Lys Phe Thr Thr Glu
65 70 75 80
Arg Tyr Phe Cys Pro Asn Thr Arg Cys Ser Ile Thr Trp Phe Leu Ser
85 90 95
Trp Ser Pro Cys Gly Glu Cys Ser Arg Ala Ile Thr Glu Phe Leu Ser
100 105 110
Arg Tyr Pro His Val Thr Leu Phe Ile Tyr Ile Ala Arg Leu Tyr His
115 120 125
His Ala Asp Pro Arg Asn Arg Gln Gly Leu Arg Asp Leu Ile Ser Ser
130 135 140
Gly Val Thr Ile Gln Ile Met Thr Glu Gln Glu Ser Gly Tyr Cys Trp
145 150 155 160
Arg Asn Phe Val Asn Tyr Ser Pro Ser Asn Glu Ala His Trp Pro Arg
165 170 175
Tyr Pro His Leu Trp Val Arg Leu Tyr Val Leu Glu Leu Tyr Cys Ile
180 185 190
Ile Leu Gly Leu Pro Pro Cys Leu Asn Ile Leu Arg Arg Lys Gln Pro
195 200 205
Gln Leu Thr Phe Phe Thr Ile Ala Leu Gln Ser Cys His Tyr Gln Arg
210 215 220
Leu Pro Pro His Ile Leu Trp Ala Thr Gly Leu Lys Ser Gly Ser Glu
225 230 235 240
Thr Pro Gly Thr Ser Glu Ser Ala Thr Pro Glu Leu Lys Asp Lys Lys
245 250 255
Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val Gly Trp Ala Val
260 265 270
Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe Lys Val Leu Gly
275 280 285
Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile Gly Ala Leu Leu
290 295 300
Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu Lys Arg Thr Ala
305 310 315 320
Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys Tyr Leu Gln Glu
325 330 335
Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser Phe Phe His Arg
340 345 350
Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys His Glu Arg His
355 360 365
Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr His Glu Lys Tyr
370 375 380
Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp Ser Thr Asp Lys
385 390 395 400
Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His Met Ile Lys Phe
405 410 415
Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro Asp Asn Ser Asp
420 425 430
Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr Asn Gln Leu Phe
435 440 445
Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala Lys Ala Ile Leu
450 455 460
Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn Leu Ile Ala Gln
465 470 475 480
Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn Leu Ile Ala Leu
485 490 495
Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe Asp Leu Ala Glu
500 505 510
Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp Asp Asp Leu Asp
515 520 525
Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp Leu Phe Leu Ala
530 535 540
Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp Ile Leu Arg Val
545 550 555 560
Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser Met Ile Lys Arg
565 570 575
Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys Ala Leu Val Arg
580 585 590
Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe Asp Gln Ser Lys
595 600 605
Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser Gln Glu Glu Phe
610 615 620
Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp Gly Thr Glu Glu
625 630 635 640
Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg Lys Gln Arg Thr
645 650 655
Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu Gly Glu Leu His
660 665 670
Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe Leu Lys Asp Asn
675 680 685
Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile Pro Tyr Tyr Val
690 695 700
Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp Met Thr Arg Lys
705 710 715 720
Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu Val Val Asp Lys
725 730 735
Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr Asn Phe Asp Lys
740 745 750
Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser Leu Leu Tyr Glu
755 760 765
Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys Tyr Val Thr Glu
770 775 780
Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln Lys Lys Ala Ile
785 790 795 800
Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr Val Lys Gln Leu
805 810 815
Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp Ser Val Glu Ile
820 825 830
Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly Thr Tyr His Asp
835 840 845
Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp Asn Glu Glu Asn
850 855 860
Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr Leu Phe Glu Asp
865 870 875 880
Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala His Leu Phe Asp
885 890 895
Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr Thr Gly Trp Gly
900 905 910
Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp Lys Gln Ser Gly
915 920 925
Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe Ala Asn Arg Asn
930 935 940
Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe Lys Glu Asp Ile
945 950 955 960
Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu His Glu His Ile
965 970 975
Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly Ile Leu Gln Thr
980 985 990
Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly Arg His Lys Pro
995 1000 1005
Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln Thr Thr Gln
1010 1015 1020
Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile Glu Glu
1025 1030 1035
Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro Val
1040 1045 1050
Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
1055 1060 1065
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn
1070 1075 1080
Arg Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe
1085 1090 1095
Leu Lys Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp
1100 1105 1110
Lys Asn Arg Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val
1115 1120 1125
Lys Lys Met Lys Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu
1130 1135 1140
Ile Thr Gln Arg Lys Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly
1145 1150 1155
Gly Leu Ser Glu Leu Asp Lys Ala Gly Phe Ile Lys Arg Gln Leu
1160 1165 1170
Val Glu Thr Arg Gln Ile Thr Lys His Val Ala Gln Ile Leu Asp
1175 1180 1185
Ser Arg Met Asn Thr Lys Tyr Asp Glu Asn Asp Lys Leu Ile Arg
1190 1195 1200
Glu Val Lys Val Ile Thr Leu Lys Ser Lys Leu Val Ser Asp Phe
1205 1210 1215
Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg Glu Ile Asn Asn Tyr
1220 1225 1230
His His Ala His Asp Ala Tyr Leu Asn Ala Val Val Gly Thr Ala
1235 1240 1245
Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe Val Tyr Gly
1250 1255 1260
Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala Lys Ser Glu
1265 1270 1275
Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe Tyr Ser Asn
1280 1285 1290
Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala Asn Gly Glu
1295 1300 1305
Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu Thr Gly Glu
1310 1315 1320
Ile Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val Arg Lys Val
1325 1330 1335
Leu Ser Met Pro Gln Val Asn Ile Val Lys Lys Thr Glu Val Gln
1340 1345 1350
Thr Gly Gly Phe Ser Lys Glu Ser Ile Leu Pro Lys Arg Asn Ser
1355 1360 1365
Asp Lys Leu Ile Ala Arg Lys Lys Asp Trp Asp Pro Lys Lys Tyr
1370 1375 1380
Gly Gly Phe Asp Ser Pro Thr Val Ala Tyr Ser Val Leu Val Val
1385 1390 1395
Ala Lys Val Glu Lys Gly Lys Ser Lys Lys Leu Lys Ser Val Lys
1400 1405 1410
Glu Leu Leu Gly Ile Thr Ile Met Glu Arg Ser Ser Phe Glu Lys
1415 1420 1425
Asn Pro Ile Asp Phe Leu Glu Ala Lys Gly Tyr Lys Glu Val Lys
1430 1435 1440
Lys Asp Leu Ile Ile Lys Leu Pro Lys Tyr Ser Leu Phe Glu Leu
1445 1450 1455
Glu Asn Gly Arg Lys Arg Met Leu Ala Ser Ala Gly Glu Leu Gln
1460 1465 1470
Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr Val Asn Phe Leu
1475 1480 1485
Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser Pro Glu Asp
1490 1495 1500
Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys His Tyr Leu
1505 1510 1515
Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys Arg Val Ile
1520 1525 1530
Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala Tyr Asn Lys
1535 1540 1545
His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn Ile Ile His
1550 1555 1560
Leu Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala Ala Phe Lys Tyr
1565 1570 1575
Phe Asp Thr Thr Ile Asp Arg Lys Arg Tyr Thr Ser Thr Lys Glu
1580 1585 1590
Val Leu Asp Ala Thr Leu Ile His Gln Ser Ile Thr Gly Leu Tyr
1595 1600 1605
Glu Thr Arg Ile Asp Leu Ser Gln Leu Gly Gly Asp Lys Arg Pro
1610 1615 1620
Ala Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys Thr Arg
1625 1630 1635
Asp Ser Gly Gly Ser Thr Asn Leu Ser Asp Ile Ile Glu Lys Glu
1640 1645 1650
Thr Gly Lys Gln Leu Val Ile Gln Glu Ser Ile Leu Met Leu Pro
1655 1660 1665
Glu Glu Val Glu Glu Val Ile Gly Asn Lys Pro Glu Ser Asp Ile
1670 1675 1680
Leu Val His Thr Ala Tyr Asp Glu Ser Thr Asp Glu Asn Val Met
1685 1690 1695
Leu Leu Thr Ser Asp Ala Pro Glu Tyr Lys Pro Trp Ala Leu Val
1700 1705 1710
Ile Gln Asp Ser Asn Gly Glu Asn Lys Ile Lys Met Leu Ser Gly
1715 1720 1725
Gly Ser Pro Lys Lys Lys Arg Lys Val
1730 1735
<210> 19
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 19
caggtccctc gccgcatgat 20
<210> 20
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 20
caggttcccc gccgcatgat 20
<210> 21
<211> 23
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 21
agagccagct gttgtggact tgg 23
<210> 22
<211> 23
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 22
gttccagaaa gggagcgaga agg 23
<210> 23
<211> 23
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 23
tgaacagcct aaccctggcg cgg 23
<210> 24
<211> 23
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 24
gaagcaaaag ttcaaacagt tgg 23
<210> 25
<211> 23
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 25
ctgcgagaac ctcggcctgg agg 23
<210> 26
<211> 23
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 26
ggtgccaaag ttcggacctt tgg 23
<210> 27
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 27
acccaaaggt cgacgacaaa 20
<210> 28
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 28
catgaacgcc ctcatgttgt t 21
<210> 29
<211> 21
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 29
ttgatgccca cgagcatttg a 21
<210> 30
<211> 19
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 30
atccactggc catccttga 19
<210> 31
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 31
aggtgcccat gttctcgaag 20
<210> 32
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 32
aagtaggtca cgtccctccc 20
<210> 33
<211> 19
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 33
tccatcaggc tctcacccg 19
<210> 34
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 34
caacatagag aaccgggcga 20
<210> 35
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 35
accgaagccc ctcttaaccc 20
<210> 36
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 36
gtatggctga caccatcacc 20
<210> 37
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 37
cagacaatcg gctgctctga 20
<210> 38
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 38
gatgtttcgc ttggtggtcg 20
<210> 39
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 39
cctcgttgct tggggagtag 20
<210> 40
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 40
gccatcaccg agttcctcag 20
<210> 41
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 41
atcgacctct cacaactcgg 20
<210> 42
<211> 20
<212> DNA
<213> 人工序列
<220>
<223> 合成寡核苷酸
<400> 42
ttgttgccga tgacctcctc 20
<210> 43
<211> 1884
<212> DNA
<213> 小果野蕉
<400> 43
atggggattc ccggtgacga gatcctctcc agggtcgcta cgggcgatgg ccacggtgag 60
aacacctcgt acttcgatgg ctggaaggcc tacgataatg atcctttcca cccgattcat 120
aatcccaatg gtgtcatcca aatgggactc gcagaaaacc aggtaatgct tgtttctggc 180
tctgtccatt actttctcct cctcctgctg ctgctgctgc taatgggttt cggtctgcct 240
ttcctcagct ctgcttggac ttgatgcgag attggatcag gaagaatcca caggcttcta 300
tatgcaccaa ggagggcgtt tcagagttcg aagccatcgc taacttccag gactaccatg 360
gcctgccgga cttccgtaag gtaatcaccg tctgcagcca taatgcagct cctcgatccc 420
ttactcatgc gtgccatgaa cgatgagggc acagttggat cgatatgcgt tgctatagcc 480
gaaaggtaat gacgcgatca tctatggaaa tgcacaggcc attgccaagt tcatggagaa 540
agcgagagga ggacgagcca ggttcgaccc ggagcgcata gtgatgagcg gtggagccac 600
cggagctcaa gaaacgatcg cattttgtct ggccaatccc ggggacgcct tcctcattcc 660
gacgccatac tacccagcgt acgtatgcct gttgagtcaa cattctgatc tctcaagtaa 720
ttgcgtcgtc aacttccccg ttcgaacaaa tgttccagcc gaccaatcag tcgtgcaatg 780
acccaaacga cagtcaaact tttatctgcc tgagcattga ccaaaaccac accattcaac 840
gtaattgtgg tcatgcaatc cgacactaaa gaacgacatt tggttcttct caggttcgat 900
cgagacttca ggtggagaac tggagttcag ctcctcccta ttcgctgcca cagtcacgac 960
aacttcaaga tcaccgaagc cgagcttgct gctgcctacc ggaaggcgcg cgactctaag 1020
atcagggtta aaggaatact aataaccaac ccgtcgaatc ctctgggcac aaccatggac 1080
agggagacgc taagaaccct agtaagattc gcgaacgagg aaaggatcca cctagtctgc 1140
gacgagatct tctccggcac agtcttcgac gggccggaat atgtcagtgt ggcggagata 1200
ttgcaagagg atccgtcgac ctgcgacgga gacctaatcc acatcgtcta cagcctgtcg 1260
aaggacctcg gcgtccccgg attccgtgtc ggcatcatat actcgttcaa cgacgcggtg 1320
gtcagctgcg ctcggaggat gtccagcttc ggactggtct cgacgcagac tcaacgcctg 1380
cttgcttcca tgctgggaga cgacgacttc accaccgacc tcttggcgga gagcaggagg 1440
agattaatgc acaggcacag gacgtttact gccggcctcg aaggcgtcgg cattcgttgc 1500
ttacagagca acgccggact attctgctgg atgagcttga agcctctgct gaaagacgcc 1560
acggcggagg gcgaggtcga gctgtggcgg gtgatagtga acgaggtgaa gctcaacatc 1620
tctccggggt cctcgttcca ctgcaccgag ccggggtggt tcagggcgtg ctttgccaac 1680
atggacgagg agaccatgga gacggccctg cggcggatca ggacgttcgt gcgccgggcg 1740
aacgacgcag ctactgccgc caagaccaag aagaggtggg acacatcgct tcgcctgagc 1800
ttgccacgaa ggttcgagga gatgaccgtc ctgacaccgc gtctgatgtc tcctcgctct 1860
ccgctcgttc aggccgccac ctga 1884
<210> 44
<211> 2504
<212> DNA
<213> 小果野蕉
<400> 44
gcagcagctg cttctccttc ttctctgctc gcttcagcct tttccggtac gtacctgaga 60
taacgggtca catgaggatc tacggcgagg agcacccaaa tcagcagatc ctctctcgga 120
tcgcgaccaa cgacggccat ggcgagaact cctcctactt cgatgggtgg aaggcctacg 180
agaaggatcc tttccacctc accgacaacc ccacgggggt catccaaatg ggactcgcag 240
aaaaccaggt tagagttcct tcatggtgat gattaatcgc acatgccttc cgtcaattgc 300
cactccctgc ggttgctaat ctaatctgta tgtgggtttt gggtctttct ttcctcagct 360
ttccctcgac ttgatccgag actggatgaa gaagaacccg caggcttcga tctgcaccga 420
agaaggggtc tcagagttca aagcaattgc caactttcag gactatcatg gcctcccagc 480
cttccgaaag gtaatgattt caacccaaaa cgcagcgctg cagctgcttg tcctcactgt 540
ccaagtagct acatacgtcc aatatgataa agctgggact gacagccact tacggcccga 600
gccctgcctg ctcaccctgg ataagggata agctaatgat ggtgtgattt gctgacacgc 660
gcaggccatc gcccagttca tggagaaggt gagaggggga cgagccagat ttgacccaga 720
ccgcatcgtg atgagcggtg gagccaccgg tgctcaggaa accatcgcct tttgcctggc 780
tgatcctggc gaggccttct tgattccaac gccatattat ccggggtaag tgttcaggtg 840
tactaatcta ccgagttctt tatccggcag aggatctaat ggcatctgca tggtttccag 900
attcgatcga gacttcaggt ggaggacagg agttcagctc ctccccattc actgccacag 960
ttccaacaag ttcaagatca cccaagccgc actggagact gcttacagga aggctcgaaa 1020
ctcacacatt agagtcaaag gaatactggt gaccaaccca tcgaaccctc tgggcacaac 1080
catggacaga gagacgctga gaaccctagt cagcttcgtc aacgagaaaa ggatgcactt 1140
ggtgtgcgac gagatcttct ccggaaccgt cttcgacaag ccgagttacg tgagcgtctc 1200
cgaggtgatc gaagacgatc cctactgcga cagggatctg attcacatcg cctacagcct 1260
ctccaaggac ctgggcgtcc ctggcttccg cgtcggcgtc atatactcct acaacgacgc 1320
cgtggtcagc tgcgcgagga agatgtcgag ctttggactg gtctcgtcgc agacgcagca 1380
cctgctcgct tccatgttgg gagacgagga gttcaccacg agtttcttag cgacgagccg 1440
gacgaggttg tgcgggcggc gcagggtctt tacggacggc ctcaagcgag tcgggattca 1500
ttgcttggac ggcaacgcgg ggctgttctg ctggatggac ttgaggccgt tgctgaagga 1560
agcgacggtg gaggcggagc tccggctgtg gcgggtgatc atcaacgacg tgaagctcaa 1620
catctcgccg gggtcgtcct tccactgctc ggagccgggg tggttcaggg tatgcttcgc 1680
caacatggac gacacggcca tgaagatagc gctgaggagg atcgagagtt tcgtgtaccg 1740
ggagaacgac gccgctgtgc aggcgaagaa caagaggagg tgggacgaag cgctgcggct 1800
gagcttgcct cgtcggaggt tcgaggatcc gaccatcatg acaccacatc tgatgtctcc 1860
ccactcgcct ctcgttcaag ccgccacctg aaacatcgac agcggcgtgt ctgatgtcaa 1920
cgaaggttaa ttaccgtctg atatgttgca catttctttg ttctttggat tatttatttt 1980
tttttttttg ggaaaaatgg gttgaatgtt cccactaagt tatattagat tgttgttcgg 2040
tctcattcat gttataggaa acgaggatag aattgcttgc ctctctcttt cttttatata 2100
tggaaatatg ttacaattgg cctaagctta tttgatgaca ttaatttcac aagacaaagc 2160
cttctaatta atgtttcgga ccaaatgcag gagctcacta catacatttg ttacacttca 2220
tatgttcaaa attagtccag tttaccggtg actcagtttt aaaggttata aatggttctg 2280
attcaagtac ttatctttgg ttctgttaat tggttcaaac cgaatcgatt ttaatttaaa 2340
caatattaat ttaattaaat tttttaattg gtttaaatcg attaatcaaa tcagttgatc 2400
agggaaaata ttattgatgt cttactcaac tcgatatggt ctatactcac gtgcgtagga 2460
atgtccgaga tgtctctgag ataaaaacat cgtgctttcg tgat 2504
<210> 45
<211> 2294
<212> DNA
<213> 小果野蕉
<400> 45
tagctcgtgt tctcccttct ccccaggctt cccagtactc gcctaagatc gtaacgtcgg 60
caatggggct ccacgttgat gaacactcaa attacaatgt cctctccagc atcgcaacga 120
gcgatggcca cggggagaac tcctcatact tcgatggctg gaaggcctac gataatgatc 180
ctttccaccc catcgacaat cctcaggggg tcatccaaat gggacttgca gaaaaccagg 240
taaatgctgt ttcacaacta gttcggtaat tatggtagtt ttttcatggc ctatggccaa 300
aaatatgcct tccgtattct cctactactt ggaatgctaa cgggtgctgc gttttcctta 360
tctcagctct gcctggactt gatgcagcag tggatcaagc agaacccaca ggcttccatt 420
tgcaccggcg agggcgtttc cgagtttaag gacgtcgcga acttccaaga ctaccacggc 480
ctgccagact tccgaaaggt aataaccatc acagtgcagc tctttagtta gtccttatca 540
tgtcataaac tgtggaccct cgagaataga ttacatcact cagataaaag atgtgcgcat 600
tatgactcac gtacatgagt ccagaacttg tatctacttg taacgacgtc aagaggattc 660
tggaaatggt gcctgctggg ctagggacaa cctcactaga ttgctttgct gtttctgaaa 720
ggctaatgat gtgatttgtg gaaacacgca ggcgattgct aggttcatgg ggaaagcgag 780
aggaggagga gctacgttcg acccggagcg cattgtaatg agcggcggag ccaccggagc 840
tcaggaaacc atcgcatttt gtctagcgaa tcctggggag gccttcctga ttccaacgcc 900
atattatcca gggtacgtag acctatccta catcaagatt ttatgtttta tgtatatttc 960
acagtgacac taatctgttt taaagaaaac tgtttgagga tgagccgatc gaactacgga 1020
ggcaacatta atataatcca gcttactggt ataaccaaaa aattagtagt caatatttgc 1080
catcgcacga ctgtgacgtc gacaagacag tctcagtata ttatatttct taattaataa 1140
cgctacacca aaaccataac cgacctaccg gccgcttgag gtttctgcac tctccggcct 1200
cattatggat ctatcggttg atatatatat atatatatat gacagcgatt tcacatttcc 1260
tgcagcttcg atcgagactt tcggtggaga actggagttc aactcctccc tattcagtgc 1320
cacagcttcg acaacttcaa gatcaccgaa cccgcgctag ttactgccta tcaaagggca 1380
caaacagcta acatcagggt taaaggaatc ctggtaacca acccttcaaa ccctctgggt 1440
acaaccttgg acagagagac actgagaacc ttagtgagct tcgccaacga gaaacggatc 1500
cacttggtgt gcgacgagat attctcgggc accgtcttcg acaagcctac ctacgtcagc 1560
gtctccgaga tcgtggaaga ggaaccatac tacgacaggg acctaattca catcgtctac 1620
agtctgtcca aggatctcgg cgtccctgga ttccgcgtgg gtgtcattta ctcgtacaat 1680
gatgcagtgg tcagctgtgc tcggaagatg tccagctttg gactggtctc gactcaaacg 1740
cagcacctac tggcttccat gctgggagat gatgacttca caaccaaatt tttggcggag 1800
agcaggagga gattgtcgcg caggcacaaa tattttactg ctggcctcca cagagttgat 1860
atcaaatgtt tggagagcaa tgcggggcta ttctgctgga tgaacttgac gcatctgcta 1920
aacgaagcca cggtggaggc ggaactcaag ctgtggcgag tgataattaa ggaggtgaag 1980
ctcaacattt caccggggtc ttcgttccac tgctctgagc cggggtggtt cagggtgtgc 2040
ttcgccaaca tggacgataa caccatggaa accgcattga agaggatcag gaagtttgtg 2100
tcccccggga atcacactgc ggctgcgcaa gccaagaaga agaacaagag gtgggacgcg 2160
gcgctccgcc taagtttgcc tcgtcggttc gaggaactga gcatcatgac acctcgcctc 2220
atgtctcctc actcgcccct tgttcaggcc gccaactgat ggtgatggat gagcgtgggc 2280
gatattaacc gacg 2294
<210> 46
<211> 1384
<212> DNA
<213> 小果野蕉
<400> 46
ggctatatat aagtagcaac gtggagtgac gagtgggaat agacaaagga aatggcgtgc 60
tccttcccgg tcctcgactt ggagaagctc cgtggagagg agagagagca gtccatggac 120
ctccttcgtg acgcttgcga gaaatggggc ttctttgagg tgctctcatt cttttacgtg 180
aacaagttga tgcccacgag catttgaagc taattatcat ggctttccat ttgcttgctg 240
atgtagctgc tcaaccatgg gatctcgcat gagctgatgg acgaggtgga gaggcggacc 300
aaagcgcact acgagcaatg caggaagcaa aagttcaaac agttggcgtg caaggctctc 360
aagagcggac ccgggacgga tgtcaccgac atggactggg agagcacctt cttcctgcgc 420
catctccccg tctccaacat gtccgacttc ccagacatgg acgaggagta ccggtaccgc 480
ttcgatttcc ttcgttacag cgcacccccc accaccatcg actgtagtct gccccgacta 540
accttcgcct tcaggaaggc gatgacggaa ttcgcgacgg ggttagagaa gctggcggag 600
cgtcttctcg atctgctctg cgagaacctc ggcctggagg agggttacct caagaacgcc 660
ttctacggat ccaaaggtcc gaactttggc accaaggtga gcaactaccc gccatgccct 720
cgcccggagc tgatccacgg cctgcgagcc cacaccgacg ccggcggcat catcttgctc 780
ttccaagacg accgcgtcag cggcctccag cttctcaagg atggccagtg gatcgacgtg 840
ccgcccatgc accactccat cgtggtcaac ctcggagatc agatagaggt cctctctctc 900
tctctctctc tctctctttc tctctctgcg ggacaaaaga tcaaacaaca tgagggcgtt 960
catgcaggtg atcacgaacg gcaagtacaa gagcgtgctg caccgggtgg tggctcggag 1020
cgacggcaac aggatgtcga tcgcctcctt ctacaacccg agcggcgacg ccgtcatcta 1080
ccccgcgccc tccctggtcc agaaggaagc ggaggcgtac ccgaggtttg tgttcgagga 1140
ctatatgaag ctctacgtca cgcaaaagtt tcaagcgaag cagcccaggt ttgaagcaat 1200
gaaggccacg gtaacagtca atggccaacc tacgcctaca ccttaggaca ccacgacgtc 1260
tcacgtggag atgccaccat ctattagaat gtggcatcca attgtggaaa taataagcga 1320
agcactatga acgtggcttt ttttagtctc gagggttatg tcgtcgatcc aattttccac 1380
ttct 1384
<210> 47
<211> 1402
<212> DNA
<213> 小果野蕉
<400> 47
acactccaga tagaaagcac aagtgcaatc agggaagaaa gagcgtgtca tggattcctt 60
tccggttatc gacatggaga agcttttggg aagggagaga ggagaagcca tggagatcct 120
ccgagatgct tgcgagaaat ggggcttctt tgaggtgctg aagcatacat aactggtttt 180
gcttctttga actatatata tattgctaaa atgtactatt tgcacatgca atctgtgtgt 240
agattttaaa ccatggcatc tcacatgacc tcatggatga agtggagaag gtgaacaaag 300
accagtacaa caaatgcagg gagcaaaagt tcaacgagtt cgccaacaaa gcactggaaa 360
acgccgactc agaaatcgac cacctcgact gggaaagcac ctttttcctg cgtcatctcc 420
ccgtctccaa catttctgag atccccgatc ttgatgacca gtataggttg cacgatctga 480
tcatgatgtc atcttctggc ctggtctttt caccttgctc atcgtttcgt ttcttgggac 540
gatgactgcg tgcaggaagg cgatgaagga atttgcggca gagatggaga agctggcaga 600
gcggctgctc gacttgctgg gtgagaacct ggggctggag aaggggtacc tgaagaaagc 660
cttctctaat ggatccaagg ggccaacctt tgggaccaag gtcagcagct acccgccatg 720
cccgcgcccg gacctggtga agggcctgag ggcgcacacc gacgccggag gcatcatctt 780
gctcttccag gacgaccagg tcagcggcct gcagttcctc aaggacggcg agtggctgga 840
cgtgcccccc atgcgccacg ccatcgtcgt caacctcggc gaccagctcg aggtttgggt 900
cctctttgct ctcgtttccg ctgcccgtcg tctgtgatgt tgaatgcaac gaggtctgca 960
ggtaatcacc aatggcaagt acaagagcgt ggtgcaccgc gtggtggctc agactgatgg 1020
caacaggatg tcgattgcct ccttctacaa ccccgggagc gacgctgtga tcttcccggc 1080
ccccgctctt gtggagaagg aagcggagga gaagaaggag gtctatccga agttcgtgtt 1140
cgaggattac atgaagctct acgtcgggca taagttccag gccaaggagc caagattcga 1200
agccatgaaa gccatggaag cagttgccac ccacccaatc gctacctctt aagtgacagc 1260
ccccaagtta gtgcatgtcg ctgtacttcg cgttaggaag ctgtcgtcta tgtctatgta 1320
acccgatgga tgtgtggtat gtacgtgtgt gagccttttc taatgaagca aatcatataa 1380
tatatatata tatatatata ta 1402
<210> 48
<211> 1761
<212> DNA
<213> 小果野蕉
<400> 48
atggccggcc ttccttattc agctcctcac cctgccacca tctccgcttc ctccaactcc 60
tttgcatgcc ccttccgcag caaggggctt gtcttcccct accctaccag aagagcactc 120
catgttcgtc ccaacatcgc atgcaaggca ggcgaggagc acgagatcgc tgctaaggtc 180
gaccgacgcg acgtactcgt gggcctcggt gggctctgcg gagccgccgc tggccttggc 240
gggttcgata aagccgccct cgctaacccc attcaggccc ctgatctctc caagtgcggc 300
cctgccgacc tccccaccgg cgtgccagtc gtcaactgct gcccgcccta ccgtcccggt 360
gcgaagattg tggatttcaa gcggccgtcg ccgtcctccc cactccgcgt ccgccccgcc 420
gcccacttgg ttgaccccga gtacctggcc aagtacaaga aggccatcga gctcatgaag 480
gcgctcccgg ccgatgaccc tcgcaacttc atgcagcagg ccgacgtcca ctgcgcctac 540
tgcgacggcg cttacgacca gatcggcttc cccaaccttg agatccaagt ccacaacagc 600
tggctcttct tcccctggca ccgcttgtac ctctacttca acgagaggat cctcggcaag 660
ctcatcggcg acgacacctt ctcgctccct ttctggaact gggacgcacc cggcggaatg 720
atgctgcctt cgatctacgc cgatccttcg tcacccctct acgacaaact tcgcgacgcc 780
aagcaccaac ctcctgtcct tgtcgacctc gactacaatg gaaccgaccc aaccttcccc 840
gacgcccagc aaatcgatca caacctcaag atcatgtacc gccaagtctt ctccaacggc 900
aagacgccgt tgctgttctt aggctcagct taccgtgccg gtgaacagcc taaccctggc 960
gcgggctccg tcgagaacat gccgcacaac aacgtgcact tgtggaccgg cgaccgcacc 1020
cagcccaact tcgagaacat gggcaccttc tacgccgcgg cgcgcgaccc catcttcttc 1080
gcccaccacg ccaacatcga ccgcatgtgg tacctgtgga agaagctcag caggaagcac 1140
caggacttca atgactcgga ctggctcaaa gcttccttcc ttttctacga cgagaacgcc 1200
gacttagttc gggtcacggt caaggactgc ttggagaccg attggctgcg ctacacgtac 1260
caagacgtga agatcccatg ggtgaacgcc cgaccgactc ccaagctcgc caaggcgagg 1320
aaagccgcca gcagttcgct gaaagccacc gcggaggtgc agttccctgt gacgctggaa 1380
tccccggtca aagcgacggt gaagaggccc aaggtgggga ggagcggcaa ggagaaggaa 1440
gatgaggagg agatactcat agtggagggg atcgagttcg accgcgacta cttcatcaag 1500
ttcgacgtct tcgtgaacgc gacggagggc gacggcatca cggccggggc cagcgagttc 1560
gccggcagct tcgtgaacgt cccgcacaag cacaagcacc gcaaggatga gaataagctg 1620
aagacgaggc tgtgtcttgg aatcaccgac ctgctcgagg acatcggcgc ggaggacgac 1680
gacagcgtgc tcgtcaccat cgtgccgaag gcgggcaaag gaaaggtgtc cgtcggcggt 1740
cttcggattg acttttccaa g 1761
<210> 49
<211> 1734
<212> DNA
<213> 小果野蕉
<400> 49
atggcttcta tctcgcagct aatcactaca agcatcccca ccaccttttc tctctcctat 60
tcatgcccct tccctccgag aaccacggta tcgatctccg gcttcaaaca cctccaccac 120
gtttccccca tctcatgctc caccagagac cacaaccaac ccctcgtcga tcgccgccac 180
gtcctcgtcg gaataggcag cctctacggc gcctccgctg cactaacgtc cctccgcgag 240
gccagtgcgg caccgatcgc ggcgcccgac ctgtccgcat gcgggctggc tgacctccct 300
ccggatgcca ctccgacgaa ctgctgcccg ccgtccgcgg gagacgcgac cgagttcgtc 360
attcccgacc cgtcctcgcc cttgagggtg cgcccggcgg cccactcggt cgacaaagat 420
tacatagcta agttcgcgaa gggcgtcgct ctcatgaagg cgcttccggc cgacgacccc 480
cggaacttca ctcagcatgc caacgtgcac tgcgcctact gcgacggggc gtacagccaa 540
gtcggcttcc cggaccttga gctccaggtg cacaactcat ggctcttcct gccatggcac 600
cgctgctacc tctacttctt cgagagaata ctcgggaagt tgatcggcga cgacagcttc 660
gcgattccgt tctggaactg ggacgcccct gacgggatgc gattgccagc gatgtacgtg 720
gatcccacgt cgccgcttta cgatccccta agggatgcac agcatcagcc gccgacgtta 780
gtggatttgg acttcggagg gatcgatcct tcttccagtg ataagcagca gattgatcac 840
aacctcaagg ttatgtacag gcagatcgtc tcgaatgcac cgacaccgag gctcttcttc 900
ggaaacccct accgagccgg cgacaatccg aaccccggtg gcggctcgct tgagaacgtc 960
ccccacggac cggtccacgt ctggaccggc gaccgcagcc agtcggaact ggaggacatg 1020
ggcaacctgt actccgccgc tcgcgacccc gtcttctttg cccaccactc caacatcgac 1080
cgcatctgga acgtgtggaa gggtctcggt agccggcgca aggacctggc cgaccccgac 1140
tggctcgacg cctccttcgt cttctacgac gagaacgcca acctcgtcaa gatccgagtt 1200
cgcgactgca tcgactcaga caagctacgc tacgagtacc aggacgtcgg taacccatgg 1260
ctcaacacac gcccgacggt gacgtccgga gtgaggccga gagtggccgg agtggcgcat 1320
gcaaacgtcg tggagccgaa gtttccgata aagttggact cagtggtgac tgccaaggtg 1380
aagaggccaa aggcggcgag gaccaaggag gagaaggagg agaaggagga ggtgctggtg 1440
gttgaaggga tcgagctgga tcgagacgtg cacgtcaaat tcgacgtgtt cgtgaacgtg 1500
accgaccacg ggaaggtcgg gccggggggc cgggagctcg ccgggagctt cgtgaacgtg 1560
cctcacaggc acaagcatga caagatgagc aagcagctga agaccaggct gcagctgggc 1620
ttgactgagc tgttggagga tctcaaggct gatgggagca tcatggtgac tttggtgccg 1680
aggcagggga aggggaaggt gaaggttggc agtctcaaga tcgagttagt tgat 1734
<210> 50
<211> 1608
<212> DNA
<213> 小果野蕉
<400> 50
atggcagaga tcggcaatcc aaatgaaaac aacacctcaa tatttgccga ccgcaagccg 60
ccgaaacgcc tcgtgttgcc attcggcgtc cgaccaccag tggcagaggc actgacaaag 120
gttgatgcaa gtttctgcga ccccaagaac aacaatgagt gtataaactt aggatcccaa 180
ggaggcgagt gcgccgccaa ctgctgcctt cccatccctc ccggtgcgaa gattgtggat 240
ttcaagcggc catcgcggtc cacccccctc cgcgtccgcc ccgccgccca cttggtcgac 300
cccgagtacc tggccaagta caagaaggcc atcgagctca tgaaggcgct cccggccgac 360
gaccctcgca acttcatgca gcagtccaac gtccactgcg ttcactgcga cagcatcccc 420
gacaatgaca tccaagtcca ccagaactgg ttcttctacc cctggcaccg ctggtacctc 480
tacttcaacg agaggatcct cggcaagctc atcggcgacg acaacttcac gctccctttc 540
tggaactggg actcgctcgg cggaatgatg ctgccgtcga tctacgccga cccttcgtcg 600
cccctctacg acaaccttcg cgacgccaag caccaacctc ctttccttgt cgacttcgac 660
ttcaatggaa ccgacccagg cttcaccgac gcccagcaaa tcgatcacaa cctcaagatc 720
atgtaccgcc aattcttctc caacggcaag aagccgatgc tgttcttagg ctcaccttac 780
cgcgggggcg acaagcctaa ccccggcggg ggctccgtcg agaacacgcc gcacaacaac 840
gtgcacacgt ggaccggcga ccgtactcgt cccaacttcg aggacatggg caccttctac 900
tcggcggggc gcgaccccat cttcttcgcc caccacgcca acatcgatcg catgtggtcc 960
ctgtggaaga agctcagccg taagcaccgg gacttcaatg actcggactg gctgaaaact 1020
tccttcctct tctacgacga gaacgccgac ttagttcggg tcacggtcaa ggactgcttc 1080
aagacccggt ggctgcgcta caagtaccaa gacgtggaga tcccatgggt gaaagcccga 1140
ccgactccca agctcaccaa ggcgaggaaa gccgccagcg gatcgctgaa acccaccgcg 1200
gaggcgcagt tccctgtgac gctggaatcc ccggtcagcg cgacgctgaa gaggcccaag 1260
gtggggagga gccgcaagga gaaggaagag gaggaggaag tgctcatagt ggaggggatc 1320
gagttcgacc gcgaccagtt catcaagttc gacgtcatcg tgaacgcgac ggagggcgat 1380
ggcatcacgc ccgcggacag cgagttcgcc ggcagcttcg tgaacacccc gcacaggcac 1440
aggcacctca aggaggagaa caaggggacg acgaggctgt gtctggggat caccgacctg 1500
ctcgaggaca tcggcgggga ggccgacgac ggcgtgctcg tcaccatcgt acccaaggca 1560
ggcaagggca aggtttccgt cggcggtctt cggattgact tcaccaag 1608
<210> 51
<211> 1770
<212> DNA
<213> 小果野蕉
<400> 51
atgtccctgc tgttgaactc tagcttcacc ggtgcttcct ctgcatgcct cctccaacgg 60
gaaaggtccc gccgccgccg cctccacgtc cctggcgtga catgccgcca gggcagtaat 120
ggtgaccgca gcgatgccgc ccgccagcag cagtcgccgc cgctgctgga tcggcgcgac 180
atgctgttgg gtttaggagg gctttacggc gtgaccgcag gacccaaggt tctggcggcg 240
ccgataatgc cgccggatct gtccaagtgc taccctgcca ccgcacctgc cctcgacaac 300
aaatgctgcc cgccttacga ccccggcgag acgatctcgg agtacagctt ccccgctacg 360
cccctccggg tgcggcggcc ggcccatatc gtgaaggacg atcaggagta tatggacaag 420
tacaaggagg cagtgaggag gatgaagaat ctgccggcag accacccttg gaactactac 480
cagcaggcga acatccactg ccagtattgc aactacgcct accaccagca aaataccgac 540
gacgtgccca tccaggtcca cttcagctgg atcttcctcc catggcaccg ctactacctc 600
cacttctacg aaaggatcct cggcaagctc atcgacgacg acaccttcac catcccattc 660
tggaactggg acaccaagga cggcatgacg ttccccgcca tcttccagga tgcggcatcc 720
ccgctgtacg acccgaaacg cgaccaacgc cacgtcaagg acggcaagat cctcgacctc 780
aagtacgcct acaccgaaaa cactgcatcc gacagcgaga tcatacggga gaacctctgc 840
ttcatacaga agacgttcaa gcacagcctg tcgctggcgg aactgttcat gggggatccc 900
gtgcgcgcgg gggagaagga gatccaggag gctaatgggc agatggaagt catccacaat 960
gcggcgcaca tgtgggtcgg agagccggac ggatacaagg aaaacatggg ggacttctcc 1020
accgccgccc gcgattctgt tttcttctgc caccattgca atgtcgaccg catgtgggac 1080
atctaccgca acctccgcgg caaccgcgtc gagttcgaag acaacgactg gttggacagc 1140
accttcctct tccacgacga gaacgaacag ctcgtcaaag tcaagatgag cgactgcctc 1200
aacccgacca agcttcggta cacgttcgaa caagttcccc tcccatggct gggcaaaatc 1260
aattgccaga agacggcaga gacgaagtcc aaggccacga cggagctgtc gctgacgcgc 1320
gtgaacgaat tcgggacgac ggcccaggca ctcgacgcga gcaacccgct gcgggtgatc 1380
gtggcaaggc cgaagaagaa ccgcaagaag aaggagaagc aagagaaggt ggaggtgatt 1440
cagatcaagg atattcaggt gaccaccaac gagacagctc gcttcgacgt ctacgtcgcg 1500
gttccttacg gtgacctcgc cggacccgac tacggcgagt tcgcgggcag ctacgtgagg 1560
ctggcgcata ggatgaaggg aagcgacggg accgcagagc aggtccccaa gaagaaggga 1620
aagctcaagc tgggtattac gccgctgctc gaggacatcg atgctgagga cgccgacaag 1680
ttggtggtca ccctggttct ccgcaccggg agcgtcaccg tgggcggagt ttccatcaat 1740
ctcctgcaga cagattctac cgccgccatc 1770
<210> 52
<211> 1524
<212> DNA
<213> 小果野蕉
<400> 52
atgtccctgc tgttgaactc tagcttcacc ggtgcttcct ctgcatgcct cctccaacgg 60
gaaaggtccc gccgccgccg cctccacgtc cctggcgtga catgccgcca gggcagtaat 120
ggtgaccgca gcgatgccgc ccgccagcag cagtcgccgc tgctgctgga tcggcgcgac 180
atgctgttgg gtttaggagg gctttacggc gtgaccgcag gacccaaggt tctggcggcg 240
ccgataatgc cgccggatct gtccaagtgc taccctgcca ccgcacctgc cctcgacaac 300
aaatgctgcc cgccttacgt ccccggcgag acgatctcgg agtacagctt ccctgctacg 360
cccctccggg tgcggcggcc ggcccatatc gtgaaggacg atcaggagta tatggacaag 420
tacaaggagg cagtgaggag gatgaagaat ctgccggcag accacccttg gaactactac 480
cagcaggcga acatccactg ccagtattgc aactacgcct accaccagca aaatgccgac 540
gacgtgccca tccaggtcca cttcagctgg atcttcctcc catggcaccg ctactacctc 600
cacttctacg aaaggatcct cggcaagctc atcgacgacg acaccttcac catccccttc 660
tggaactggg acaccaagga cggcatgacg ttccccgcca tcttccagga tgcggcatcc 720
ccgctgtacg acccgaaacg cgaccaacgc cacgtcaagg acggcaagat cctcgacctc 780
aagtacgcca tcaccgaaaa tgaaagcact gcatccgaca gcgagatcat acgggagaac 840
ctctgcttca tacagaagac gttcaagcac agcctgtcgc tggcggagct gttcatgggg 900
gatcccgtgc gcgcggggga gaaggagatc caggaggcta acgggcagct ggaagtcatc 960
cacaatgcgg tgcacagttg ggtcggagag ccgagcggaa actatgaaga catgggcgac 1020
ttctccaccg ccgcccgcga ttctgttttc ttctgccacc attgcaatgt cgaccgcatg 1080
tgggacatct accgcaacct ccgcggcaac cgcgtcgagt tcgaagacaa cgactggttg 1140
gacagcacct tcctcttcca cgacgagaac gagcagctcg tcaaagtcaa gatgagggac 1200
tgcctcaacc cgaccaagct tcggtacacg ttcgagcaag ttcccctccc atggctgggc 1260
aaaatcaatt gccagaagac ggcagagacg aagtccaagg ccacgacgga gctgtcgctg 1320
aatcgcgtga acgaattcgg aacgacggcc caggcactcg acgcgagcaa cccgctgcgg 1380
gtgatcgtgg caaggccgaa gaagaaccgc aagaagaagg agaagcaaga gaaggtggag 1440
gtgattcaga tcaaggatat taaggtgacc accaacgaga cagctcgctt cgacgtctac 1500
gtcgcggtcc cttacggtga cctc 1524
<210> 53
<211> 1776
<212> DNA
<213> 小果野蕉
<400> 53
atgtctctcc tgttgaactc tagcctcacc ggagcttcct ctgcatgcct cctccgtcga 60
gaaaagtgcc gccgccgcgg ccgcggtcac gtccacggcg tgacatgcca ccaggggggt 120
aatgatgacc gcagagaggc cgcccggcag cagcggtccc ggttgctgct ggatcggcgc 180
gacatgctgt tgggggggtt aggagggctt tacggcgtga ccgcagggcc caaagttctg 240
gcggagccga taatgccgcc tgatctgtcc aagtgccacg atgccaacgc acctgccctc 300
gacaaccact gctgcccgcc ttacagtggc agcgagacga tcttggagta cgacttcccc 360
gctacgcccc tccgggtgcg gcgaccggcc cacctcgtga aggatgatca ggagtatatg 420
gacaagtaca aggaggccgt gaggaggatg aagaacctgc cggcagaaca cccttggaac 480
tactaccagc aggcgaacat ccactgccag tattgcaacg acgcctacta ccagcaaaat 540
accgacgacg tgcccgtcca ggtccacttc agctggatct tcctcccctg gcaccgctac 600
tacctccact tctacgagcg gatcctcggc aagctcatcg acgacgacac cttcaccatc 660
cccttctgga actgggacac caaggacggc atgacgttcc ccgccatctt ccaggatgcg 720
gcatccccgc tgtacgaccc gaaacgcgac caacgtcacg tcaaggacgg cgcgatcctc 780
gacctcaagt acgcctacac cgaaaacact gcatccgaca gcgagatcat acgggagaac 840
ctctgcttca tacaaaagac gttcaagcac agcctgtcgc tggcggagct gttcatgggg 900
gatcccgtgc gcgcggggga gaaggagatc caggaggcaa acgggcagct ggaagtcatc 960
cacaatgcgg cgcacatgtg ggtcggagag ccggacggat acaaggaaaa catgggcgac 1020
ttctccaccg cggcccgcga ttctgttttc ttctgccacc attccaatgt cgaccgcatg 1080
tgggacatct accgcaacct ccgcggcaac agcgtcgagt tcaacgacaa agactggttg 1140
cacagcacct tcctcttcca tgacgagaac gagcagctcg tcaaagtcaa gatccaggac 1200
tgccttaacc cgaccaagct tcggtacacg ttcgagcaag ttcccctccc atggctgggc 1260
aatataaatt gccagaagac ggcagagacg aagtccaagt ccacggcaga gctgtcgctg 1320
aagcgggtgg gcgaattcgg gacgacaccc aaggcgctcg acgcgagcaa cccgctgcgg 1380
gtgatcgtgg caaggccgaa gaagaaccgc aagaagaagg agaagcaaga gaaggtggag 1440
gtgctccaga tcaaggatat taaggtgacc accaacgaga cagctcgctt cgacgtctac 1500
gtcgccgttc cttacggtga cctcgccgga cccgactacg gcgagttcgt gggcagcttc 1560
gttaggctgg cgcataggaa gaagggaagc gacgggaccg aagagcaggg ccccaagaag 1620
aagggaaagc tcaagctggg tattacggcg ctgctcgagg acatcgatgc tgaggacgcc 1680
gacaagttgg tggtcaccct ggttctccgc accgggagcg tcactgtggg tggagtttcc 1740
atcaaactcc tgcagacaga tactcccgcc gtcatc 1776
<210> 54
<211> 1512
<212> DNA
<213> 小果野蕉
<400> 54
atggctctcc agttgaactc tagcttcacc ggagcttcct ctgcatgcct cctccatcgg 60
gaaaggtccc gccgcctcaa cgtccctgtc gtgacatgcc gccaggggaa taatgatgat 120
cgcagcgatg ccgctcgcca gcagaaatcc ccgtcactac tggatcggcg cgacatgctg 180
ctggggttag gagggcttta tggcttgacc gcaggaccca aagttctggc gaagccgata 240
atgccgcctg atctgtccaa gtgccacgat gccaaggcac ctgccctcga caaccactgc 300
tgcccgcctt acaaccccag cgagacgatc tcggagtacg gcttccccgc tacgcccctc 360
cgggtgcggc ggccggccca cctcgtgaag gacgatcagg agtatttgga caagtacaag 420
gaggccgtga ggaggatgaa gaatctgccg gcagaccacc cttggaacta ctaccagcag 480
gcgaacgtcc actgccagta ctgcaactac gcctactacc agcaaaatac cgacgacgtg 540
cccgtccagg tccacttcag ctggatcttc ctcccctggc accgctacta cctccacttc 600
tacgagcgga tcctcggcaa gctcatcgac gacgacacct tcaccatccc cttctggaac 660
tgggacacca aggacggcat gacgttcccc gccatcttcc acgaagagtc atccccgctg 720
tctgacacga aacgcgacca acgccacgtc aaggatggca agatcgtcga cctcaagtac 780
gcctacaccg aaaaccctgc ctccaacagc gagatcattc gagagaacct ctgcttcata 840
cagaagacgt tcaagcacag cctgtcgctg gcggagctgt tcatggggga tcccgtgcgc 900
gcgggggaga aggagatcca ggaggctaac gggcagctgg aagtcatcca caatgcggtg 960
cacatgtggg tcggagagcc gtgcggatac aaggaaaaca tgggcgattt ctctaccgcc 1020
gcccgcgatt ctgttttctt cagccaccac tccaatgtcg accgcttgtg ggaaatctac 1080
cggaacctcc gcggtaaccg cattgagttc gaagacaacg actggttgga cagtaccttc 1140
ctcttctacg acgagaacga gaagctcgtc aaagtcaaga tgggggactg cctcaacccg 1200
accaagcttc ggtatacgtt cgagcaagtt cctctcccat ggctgggcaa aattaattgc 1260
cagaagacga cagagacgaa gtccaagtcc acgacagaga tgtcgctgac gcgcgtggga 1320
gaattcggga cgacgcccaa ggcgctcgac gcgagcaacc cgctgcgggt gatcgtggca 1380
aggccgaaaa agaaccgcaa gaagaaggag aagcaagaga aggtggaggt gcttcagatc 1440
aatgatatta aggtgaccac caacgagaca gctcgcttcg acgtctacgt cacggttcct 1500
tacggtgacc tc 1512
<210> 55
<211> 1770
<212> DNA
<213> 小果野蕉
<400> 55
atggtcagcc ttcctaaagc tactcttcct ctctcctccc tctcccctcc ctccaactcc 60
aactccaact ccaactccaa ctcctttgca tgcgccttcc atttttctta ccctgataga 120
agacgccatg cccactccaa gatctcatgc aaagctagcg atgagcatga gatgactgca 180
aatgccaagc tcgaccgccg cgatgtgctc gttggcctcg gcgggctttg tggagccgct 240
gccggcctcg ggatcgacgg taaggccctc ggaaatccca tccaggcgcc tgatcttacc 300
aagtgcggcc ccgccgatct acccaaaggt gcaacgccca ccaactgctg cccgccttat 360
tttcccgaca agaagattat cgatttcaag cgtccgccga attcgtcacc cctccgtgtc 420
cgcccggccg cccacttggt cgactccgac tacctggaca agtataagaa ggcggtggag 480
ctcatgagag cactgccggc cgacgatccg cgcaacttca tgcagcaggc caatgtccac 540
tgcgcttact gtgatggcgc ctacgaccag atcggcttcc ccaacctcga gctccaagtc 600
cacaactcct ggctcttctt cccttggcac cgcttctacc tctacttcca cgagaggatc 660
ctcggcaagc tcataggcga cgacactttc gcccttcctt tctggaactg ggacgcgccc 720
ggcggcatga agctgccgtc gatctacgcc gatccttcgt cctcgctcta tgacaagttt 780
cgcgacgcca agcaccagcc gccggtcctc gtcgacctcg actacaacgg aaccgaccct 840
agtttcaccg acgcagagca gatcgatcag aacctcaaga tcatgtaccg gcaggtgatc 900
tccaacggca agacgccgtt gctgttctta ggctcggctt accgtgccgg cgacaaccca 960
aaccccggcg cgggctcgct cgagaacata ccacacggcc ccgtccacgg gtggactggc 1020
gacagaaacc aacccaatct cgaggacatg ggcaacttct actccgcggg gcgcgaccct 1080
atcttcttcg cccaccattc aaacgtcgac cgcatgtggt acttgtggaa gaagctcggc 1140
gggaagcatc aggactttaa cgataaggac tggctcaaca ccaccttcct cttctacgac 1200
gagaatgctg acttagttcg agtcaccctc aaggactgct tgcagccgga gtggcttcgt 1260
tacgattacc aagacgtcga gatcccgtgg ctgaagaccc ggccgactcc caaagccttg 1320
aaggcgcaga aaaccgcagc gaaaacactg aaagctacag cagagacgcc gttcccggtg 1380
acgctgcaat ccgcggtgag cacgacggtg aggaggccca aggtatcgag gagcggcaag 1440
gagaaggaag aggaagagga ggtcctcatc gtggagggga tcgagttcga ccgcgactac 1500
ttcataaagt tcgacgtctt cgtgaacgcc accgagggtg agggcatcac gccgggcgcc 1560
agcgagttcg cgggcagctt cgtcaacgtc ccgcacaagc acaagcacag caagaaggag 1620
aagaagctga agacgaggct ctgcctgggg atcactgacc tgctcgagga catcggggcg 1680
gaggacgacg acagcgtgct cgtcaccatc gtcccgaaag ccggaaaggg caaggtgtcg 1740
gtcgccggcc tccgcatcga tttcccaaat 1770
<210> 56
<211> 1722
<212> DNA
<213> 小果野蕉
<400> 56
atggagggca aacgatggtt gtctcttctc ctcctcgtgc ttgttctcgt gggcatctcc 60
atggatctcc cgagagaagc tccggcggct tcttctaata tcttgaagag ttcatctgcc 120
agaataccag tgaatcctca aggcggagaa cagagagatg gaagcaagag caagggcatt 180
cccctcaaag caaacctatc ggtttgccat gcttcattct cggacgccga tcgtccggtc 240
tactgttgcc cggcctggaa agacgccgac caaaccttgc tcgacttcga gttcccggat 300
ccgtcgtcgc cggtgcgtat ccgacggcct gcccatctcg tcgacgagga gttcgtggcc 360
aagtatgaga gggcggtggc catcatgaag cagatcccgc ctgaccatcc ccacaacttt 420
tggcgccagg ccaacatgca ctgcctctac tgcaccggcg cctacgacca gatgaactcc 480
tctgccctct tcaagatcca caggtcatgg ctcttcttcc cctggcaccg agccttcatc 540
tatttccacg agcgcatcct cgggaagttc atgggagacg acaccttcgc gctcccctac 600
tggagctggg acacccccga gggcatgtgg ttccccgaca tctaccggaa gggagctctg 660
aatgagacgg agcgcgacgc cattcaccta cgggaggccg ccgtcgatga cttcgactac 720
gtggatcatg acctcgccag cgacgtgcag atcgccgaca acctcgcgtt catgtaccac 780
cagatgatct cgggagcgaa gaagaccgag ctgttcatgg gttgcaagct gcggtccggc 840
gtcgaggggt ggtgtgatgg gcccgggacg atcgaagcgg cacctcacaa cacgttgcac 900
agctgggtag ggaacaggta caaccccgaa agagagaaca tgggggcgtt ctactccgcc 960
gcgcgagacg aagtgttctt cgcgcaccac tccaacatcg accgcatgtg gaccgtgtgg 1020
aagaagctgc acggcgacaa gccggagttc gtcgaccagg agtggctcga gtcggagttc 1080
accttctacg acgagaatgt gcgcctgcgc aggatcaagg tgcgcgacgt gttgaacata 1140
gacaaactca ggtaccggta cgaagacatc gacatgccat ggctcgctgc acgtcccaag 1200
ccttccgttc accctaagat cgcgcgcgac atattgaaga agcgcaatgg cgaaggcgta 1260
ctgagaatgc ccggcgaaac ggatcgttca caactctccg aatatggtag ctggacactg 1320
gacaagacca tcaccgtgag ggttgacagg ccaaggatca acaggacagg gcaagaaaaa 1380
gaggaagaag aggagatctt attggtctac ggaatcgata ctaagagaag cagattcgtc 1440
aaattcgatg tgttcatcaa cgtcgtcgac gaaaccgtgc tgagcccaaa gtcgagggag 1500
ttcgcaggga ccttcgtcaa tctccaccac gtctcgagga cgaaaagcca tgacgatggc 1560
ggcatggatt cgaagatgaa aagccacctt aagctcggta tatcggaact tttggaagac 1620
ctcgaggcag acgaagatga cagcatctgg gtgacactgg tgccaagagg cggcacgggg 1680
gtcaacacca ccgtggacgg cgtccggatc gactacatga ag 1722
<210> 57
<211> 39
<212> DNA
<213> 人工序列
<220>
<223> ALS1 Ma06_g18100
<400> 57
gtcgccatca ccggccaggt ccctcgccgc atgatcggc 39
<210> 58
<211> 13
<212> PRT
<213> 人工序列
<220>
<223> ALS1 Ma06_g18100
<400> 58
Val Ala Ile Thr Gly Gln Val Pro Arg Arg Met Ile Gly
1 5 10
<210> 59
<211> 39
<212> DNA
<213> 人工序列
<220>
<223> ALS2 Ma10_g11980
<400> 59
gtcgccatca ccggccaggt tccccgccgc atgatcggc 39
Claims (23)
1.一种产生在一个或多个靶序列中包括至少一种所需突变的至少一种香蕉胚或植物的方法,所述方法包括:
a.使香蕉细胞与以下试剂接触:可操作以在所述靶序列中引入至少一种所需突变的至少一种试剂和可操作以在至少一种内源乙酰乳酸合酶(ALS)基因中引入至少一种突变以便提供对ALS抑制剂的抗性的至少一种试剂;
b.任选地培养所述香蕉细胞以产生香蕉细胞群体、组织或植物;和
c.用ALS抑制剂处理所述香蕉细胞、细胞群体、组织或植物以选择在至少一种内源乙酰乳酸合酶基因中被突变并且在所述靶序列中包括所需突变的至少一种香蕉细胞、组织或植物;和
d.从步骤(c)中选择的所述香蕉细胞、组织或植物产生至少一种香蕉胚或植物。
2.根据权利要求1所述的方法,其中所述香蕉细胞是胚性细胞,例如其中所述香蕉细胞是在胚性细胞悬浮液中的胚性细胞,并且步骤(a)包括接触胚性细胞悬浮液。
3.根据权利要求1或权利要求2所述的方法,其中所述可操作以引入突变的试剂包括碱基编辑器和向导RNA,或包括编码碱基编辑器和向导RNA的至少一种核酸构建体。
4.根据权利要求3所述的方法,其中所述碱基编辑器是融合多肽,所述融合多肽包括经修饰的Cas,例如nCas9或dCas9,和胞苷脱氨酶部分或腺嘌呤脱氨酶部分,例如,选自以下的碱基编辑器:APOBEC、BE1、BE2、BE3、HF-BE3、BE4、BE4max、BE4-GAM、YE1-BE3、EE-BE3、YE-BE3、YEE-BE3、VQR-BE3、VRER-BE3、Sa-BE3、Sa-BE4、SaBE4-Gam、SaKKH-BE3、Cas12a-BE、Target-AID、Target-AID-NG、xBE3、eA3A-BE3、A3A-BE3、BE-PLUS、TAM、CRISPR-X、ABE7.9、ABE7.10、ABE7.10*、xABE、ABESa、VQR-ABE、VRER-ABE和SaKKH-ABE。
5.根据权利要求1或权利要求2所述的方法,其中所述突变是通过HDR引入的,并且所述可操作以引入突变的试剂包括序列特异性核酸内切酶、供体模板和任选地向导RNA,或包括至少一种编码序列特异性核酸内切酶、供体模板和任选地向导RNA的核酸构建体。
6.根据任一项前述权利要求所述的方法,其中所述可操作以在所述靶序列中引入所述至少一种所需突变的试剂包括靶向所述靶序列的向导RNA,并且所述可操作以在内源ALS基因中引入突变的试剂包括靶向所述内源ALS1基因或内源ALS2基因或两者的向导RNA,或其中所述可操作以在所述靶序列中引入所述至少一种所需突变的试剂包括编码靶向所述靶序列的向导RNA的至少一种核酸构建体,并且所述可操作以在内源ALS基因中引入突变的试剂包括编码靶向所述内源ALS1基因或内源ALS2基因或两者的向导RNA的至少一种核酸构建体。
7.根据任一项前述权利要求所述的方法,其中所述试剂包括核酸构建体并且所述接触包括农杆菌(Agrobacterium)介导的转化、粒子轰击、原生质体转化、纳米粒子介导的转染或电穿孔。
8.根据任一项前述权利要求所述的方法,其中步骤(c)包括在ALS抑制剂的存在下孵育胚性细胞悬浮液,或步骤(c)包括在包含ALS抑制剂的胚发育培养基(EDM)上培养胚。
9.根据权利要求8所述的方法,其中步骤(c)包括在包含ALS抑制剂胚性细胞悬浮液的胚发育培养基上培养胚8-20周,例如8-16、10-16、10-14、12-14或约13周。
10.根据权利要求8或9所述的方法,其中步骤(c)包括在包含ALS抑制剂胚性细胞悬浮液的胚发育培养基上培养胚,所述ALS抑制剂胚性细胞悬浮液的浓度为10-80μg/L,例如10-70、15-65、15-55、20-55、20-50、20-40、20-30或约25μg/L。
11.根据权利要求8-10中任一项所述的方法,包括在用ALS抑制剂的处理后选择至少一种胚并在没有用ALS抑制剂的进一步处理的情况下发育香蕉植物。
12.根据权利要求8-10中任一项所述的方法,额外包括随后将胚转移至不包含ALS抑制剂的胚发育培养基中。
13.根据任一项前述权利要求所述的方法,包括选择对用ALS抑制剂的处理具有抗性的至少一种香蕉细胞,例如胚、香蕉组织或香蕉植物,并将它们与不具有抗性的细胞、组织或植物物理分离,并且任选地将所述至少一种选择的细胞、组织或植物转移至单独的板或生长培养基中,优选地不含ALS抑制剂。
14.根据任一项前述权利要求所述的方法,包括选择对用ALS抑制剂的处理具有抗性的至少一种香蕉细胞,例如胚、香蕉组织或香蕉植物,并且随后选择还包括所述在靶序列中的所需突变的至少一种香蕉细胞,例如胚、香蕉组织或香蕉植物,例如通过对所述细胞、组织或植物进行基因分型。
15.根据任一项前述权利要求所述的方法,其中所述可操作以在靶序列中引入至少一种所需突变的试剂和/或所述可操作以在内源ALS基因中引入突变的试剂包括核酸构建体,所述核酸构建体额外编码选择性标记物,并且其中所述方法任选地包括选择不携带所述选择性标记物的胚或植物。
16.根据任一项前述权利要求所述的方法,其中所述内源乙酰乳酸合酶基因是内源乙酰乳酸合酶1(ALS1)基因或所述内源乙酰乳酸合酶2(ALS2)基因。
17.根据任一项前述权利要求所述的方法,其中所述在内源ALS基因中的突变是在编码的氨基酸序列中引入取代的取代,优选地在香蕉ALS1中的Pro-187处或ALS2中的Pro-181处,最优选地在ALS1中的Pro187Ser或在ALS2中的Pro181Ser。
18.根据任一项前述权利要求所述的方法,其中所述ALS抑制剂是磺酰脲、咪唑啉酮、三唑并嘧啶、嘧啶基氧苯甲酸酯(pyrimidinyl oxybenzoate)或磺酰基氨基羰基三唑啉酮,优选地其中所述ALS抑制剂是绿磺隆。
19.根据任一项前述权利要求所述的方法,其中步骤(c)包括用所述ALS抑制剂处理所述香蕉细胞,并且所述处理包括提供香蕉细胞群体且所述群体中至少90%,例如至少95%、96%、97%、98%或99%,的细胞包括突变的ALS,任选地其中至少1%、5%、10%或15%的细胞不包括异源DNA,进一步任选地其中至少0.1%、1%、2%、3%、4%或5%的细胞包括所述在内源乙酰乳酸合酶基因中的突变并且包括所述在靶序列中的所需突变。
20.根据任一项前述权利要求所述的方法,其中通过所述方法产生的所述至少一种香蕉胚或植物不包括异源DNA。
21.根据任一项前述权利要求所述的方法,所述靶序列为ACO、ACS或PPO,例如ACO1、ACO2、ACS1、ACS2或PPO2,或例如PPO1、PPO2、PPO8、PPO9或PPO4。
22.根据任一项前述权利要求所述的方法,进一步包括从步骤(c)中选择的至少一种细胞或组织生长香蕉植物,任选地进一步包括从所述香蕉植物收获果实,任选地进一步包括将所述果实加工成食品产品。
23.通过根据任一项前述权利要求所述的方法产生的香蕉细胞、细胞群体、组织或植物。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB2109586.4 | 2021-07-02 | ||
GBGB2109586.4A GB202109586D0 (en) | 2021-07-02 | 2021-07-02 | Method for editing banana genes |
PCT/EP2022/068060 WO2023275245A1 (en) | 2021-07-02 | 2022-06-30 | Method for editing banana genes |
Publications (1)
Publication Number | Publication Date |
---|---|
CN117858952A true CN117858952A (zh) | 2024-04-09 |
Family
ID=77274500
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202280056874.1A Pending CN117858952A (zh) | 2021-07-02 | 2022-06-30 | 编辑香蕉基因的方法 |
Country Status (6)
Country | Link |
---|---|
EP (1) | EP4363593A1 (zh) |
CN (1) | CN117858952A (zh) |
AU (1) | AU2022303083A1 (zh) |
GB (1) | GB202109586D0 (zh) |
IL (1) | IL309850A (zh) |
WO (1) | WO2023275245A1 (zh) |
Family Cites Families (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0270496B1 (de) | 1986-12-05 | 1993-03-17 | Ciba-Geigy Ag | Verbessertes Verfahren zur Transformation von pflanzlichen Protoplasten |
US5015580A (en) | 1987-07-29 | 1991-05-14 | Agracetus | Particle-mediated transformation of soybean plants and lines |
US5416011A (en) | 1988-07-22 | 1995-05-16 | Monsanto Company | Method for soybean transformation and regeneration |
US5302523A (en) | 1989-06-21 | 1994-04-12 | Zeneca Limited | Transformation of plant cells |
US7705215B1 (en) | 1990-04-17 | 2010-04-27 | Dekalb Genetics Corporation | Methods and compositions for the production of stably transformed, fertile monocot plants and cells thereof |
US5550318A (en) | 1990-04-17 | 1996-08-27 | Dekalb Genetics Corporation | Methods and compositions for the production of stably transformed, fertile monocot plants and cells thereof |
WO1991010725A1 (en) | 1990-01-22 | 1991-07-25 | Dekalb Plant Genetics | Fertile transgenic corn plants |
US5484956A (en) | 1990-01-22 | 1996-01-16 | Dekalb Genetics Corporation | Fertile transgenic Zea mays plant comprising heterologous DNA encoding Bacillus thuringiensis endotoxin |
US6403865B1 (en) | 1990-08-24 | 2002-06-11 | Syngenta Investment Corp. | Method of producing transgenic maize using direct transformation of commercially important genotypes |
US5384253A (en) | 1990-12-28 | 1995-01-24 | Dekalb Genetics Corporation | Genetic transformation of maize cells by electroporation of cells pretreated with pectin degrading enzymes |
EP0604662B1 (en) | 1992-07-07 | 2008-06-18 | Japan Tobacco Inc. | Method of transforming monocotyledon |
AU670316B2 (en) | 1992-07-27 | 1996-07-11 | Pioneer Hi-Bred International, Inc. | An improved method of (agrobacterium)-mediated transformation of cultured soybean cells |
US5693512A (en) | 1996-03-01 | 1997-12-02 | The Ohio State Research Foundation | Method for transforming plant tissue by sonication |
US5981840A (en) | 1997-01-24 | 1999-11-09 | Pioneer Hi-Bred International, Inc. | Methods for agrobacterium-mediated transformation |
WO2000042207A2 (en) | 1999-01-14 | 2000-07-20 | Monsanto Technology Llc | Soybean transformation method |
BRPI0814244B1 (pt) | 2007-06-07 | 2018-10-09 | Agriculture And Agri Food Canada | método para obtenção de uma planta geneticamente engenheirada |
US8722410B2 (en) | 2007-10-05 | 2014-05-13 | Dow Agrosciences, Llc. | Methods for transferring molecular substances into plant cells |
ES2710179T3 (es) | 2009-09-07 | 2019-04-23 | Nobel Biocare Services Ag | Conjunto de implantación |
MX2016002306A (es) | 2013-08-22 | 2016-07-08 | Du Pont | Promotor u6 de polimerasa iii de soja y metodos de uso. |
WO2018202199A1 (en) | 2017-05-05 | 2018-11-08 | Institute Of Genetics And Developmental Biology, Chinese Academy Of Sciences | Methods for isolating cells without the use of transgenic marker sequences |
-
2021
- 2021-07-02 GB GBGB2109586.4A patent/GB202109586D0/en not_active Ceased
-
2022
- 2022-06-30 IL IL309850A patent/IL309850A/en unknown
- 2022-06-30 EP EP22743465.1A patent/EP4363593A1/en active Pending
- 2022-06-30 WO PCT/EP2022/068060 patent/WO2023275245A1/en active Application Filing
- 2022-06-30 CN CN202280056874.1A patent/CN117858952A/zh active Pending
- 2022-06-30 AU AU2022303083A patent/AU2022303083A1/en active Pending
Also Published As
Publication number | Publication date |
---|---|
EP4363593A1 (en) | 2024-05-08 |
GB202109586D0 (en) | 2021-08-18 |
AU2022303083A1 (en) | 2024-01-18 |
IL309850A (en) | 2024-02-01 |
WO2023275245A1 (en) | 2023-01-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP2023113841A (ja) | オリゴヌクレオチド仲介型遺伝子修復を使用した標的遺伝子修飾の効率を高めるための方法および組成物 | |
US9756871B2 (en) | TAL-mediated transfer DNA insertion | |
US11519000B2 (en) | Methodologies and compositions for creating targeted recombination and breaking linkage between traits | |
CN106795524A (zh) | 使用向导rna/cas内切核酸酶系统改变农学性状及其使用方法 | |
US20230365984A1 (en) | Compositions and methods for increasing shelf-life of banana | |
US10988775B2 (en) | Wheat plants resistant to powdery mildew | |
US11773398B2 (en) | Modified excisable 5307 maize transgenic locus lacking a selectable marker | |
US20220364105A1 (en) | Inir12 transgenic maize | |
US11369073B2 (en) | INIR12 transgenic maize | |
US11326177B2 (en) | INIR12 transgenic maize | |
US11359210B2 (en) | INIR12 transgenic maize | |
US20220030822A1 (en) | Inht26 transgenic soybean | |
WO2022026554A1 (en) | Inir12 transgenic maize | |
CN117858952A (zh) | 编辑香蕉基因的方法 | |
CN110959043A (zh) | 利用bcs1l基因和向导rna/cas核酸内切酶系统改良植物农艺性状的方法 | |
EP4311430A1 (en) | Chlorotoluron tolerance gene and methods of use thereof | |
Kölzsch | Approaches to generate herbicide resistant Taraxacum koksaghyz by directed and undirected mutagenesis of the acetohydroxyacid synthase | |
CN118647713A (zh) | 用于编辑植物的修饰的细菌 | |
Mahmoud | A first successful effort of heterologous transposons movement in oat genome | |
WO2024215720A1 (en) | Uorf editing to improve plant traits | |
WO2023130031A2 (en) | Inot1824 transgenic maize |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |