CN101583719B - 用于改变纤维长度和/或植物高度的核酸构建体和方法 - Google Patents
用于改变纤维长度和/或植物高度的核酸构建体和方法 Download PDFInfo
- Publication number
- CN101583719B CN101583719B CN200780046680.9A CN200780046680A CN101583719B CN 101583719 B CN101583719 B CN 101583719B CN 200780046680 A CN200780046680 A CN 200780046680A CN 101583719 B CN101583719 B CN 101583719B
- Authority
- CN
- China
- Prior art keywords
- leu
- gly
- ser
- glu
- thr
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims abstract description 36
- 150000007523 nucleic acids Chemical class 0.000 title claims abstract description 36
- 102000039446 nucleic acids Human genes 0.000 title claims abstract description 35
- 108020004707 nucleic acids Proteins 0.000 title claims abstract description 35
- 239000000835 fiber Substances 0.000 title abstract description 35
- 241000196324 Embryophyta Species 0.000 claims abstract description 209
- 108091000080 Phosphotransferase Proteins 0.000 claims abstract description 48
- 210000004027 cell Anatomy 0.000 claims description 68
- 108090000623 proteins and genes Proteins 0.000 claims description 68
- 210000002421 cell wall Anatomy 0.000 claims description 55
- 230000009261 transgenic effect Effects 0.000 claims description 50
- 102000020233 phosphotransferase Human genes 0.000 claims description 36
- 108091033319 polynucleotide Proteins 0.000 claims description 21
- 102000040430 polynucleotide Human genes 0.000 claims description 21
- 239000002157 polynucleotide Substances 0.000 claims description 21
- 241000219000 Populus Species 0.000 claims description 18
- 230000002018 overexpression Effects 0.000 claims description 10
- 235000013311 vegetables Nutrition 0.000 claims description 9
- 241000218631 Coniferophyta Species 0.000 claims description 6
- 244000166124 Eucalyptus globulus Species 0.000 claims description 5
- 241001233957 eudicotyledons Species 0.000 claims description 5
- 101150095161 C4H gene Proteins 0.000 claims description 4
- 101150106671 COMT gene Proteins 0.000 claims description 4
- 241000209510 Liliopsida Species 0.000 claims description 4
- 235000008331 Pinus X rigitaeda Nutrition 0.000 claims description 4
- 235000011613 Pinus brutia Nutrition 0.000 claims description 4
- 241000018646 Pinus brutia Species 0.000 claims description 4
- 101150117325 TUB gene Proteins 0.000 claims description 4
- 235000005205 Pinus Nutrition 0.000 claims description 2
- 241000218602 Pinus <genus> Species 0.000 claims description 2
- 229920001131 Pulp (paper) Polymers 0.000 claims description 2
- 230000008635 plant growth Effects 0.000 claims description 2
- 230000001737 promoting effect Effects 0.000 claims description 2
- 241000219195 Arabidopsis thaliana Species 0.000 abstract description 3
- 239000002028 Biomass Substances 0.000 abstract description 2
- 238000004537 pulping Methods 0.000 abstract 1
- 241000282326 Felis catus Species 0.000 description 45
- 244000166102 Eucalyptus leucoxylon Species 0.000 description 42
- 230000014509 gene expression Effects 0.000 description 22
- 108020004414 DNA Proteins 0.000 description 20
- 108010078144 glutaminyl-glycine Proteins 0.000 description 18
- 108010050848 glycylleucine Proteins 0.000 description 18
- 230000008859 change Effects 0.000 description 17
- 238000006243 chemical reaction Methods 0.000 description 17
- 239000002773 nucleotide Substances 0.000 description 17
- 125000003729 nucleotide group Chemical group 0.000 description 17
- 238000003752 polymerase chain reaction Methods 0.000 description 17
- 150000001413 amino acids Chemical class 0.000 description 16
- 239000002299 complementary DNA Substances 0.000 description 16
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 16
- 210000001519 tissue Anatomy 0.000 description 16
- 108010025306 histidylleucine Proteins 0.000 description 14
- 238000009396 hybridization Methods 0.000 description 14
- 235000018102 proteins Nutrition 0.000 description 13
- 102000004169 proteins and genes Human genes 0.000 description 13
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 12
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 12
- 108010060035 arginylproline Proteins 0.000 description 12
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 12
- UENPHLAAKDPZQY-XKBZYTNZSA-N Glu-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)O UENPHLAAKDPZQY-XKBZYTNZSA-N 0.000 description 10
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 10
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 10
- 239000002585 base Substances 0.000 description 10
- 108010054813 diprotin B Proteins 0.000 description 10
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 10
- 108090000765 processed proteins & peptides Proteins 0.000 description 10
- 108091028043 Nucleic acid sequence Proteins 0.000 description 9
- 108010047857 aspartylglycine Proteins 0.000 description 9
- 230000012010 growth Effects 0.000 description 9
- 239000003550 marker Substances 0.000 description 9
- 108020004999 messenger RNA Proteins 0.000 description 9
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 8
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 8
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 8
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 8
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 8
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 8
- KHGGWBRVRPHFMH-PEFMBERDSA-N Gln-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHGGWBRVRPHFMH-PEFMBERDSA-N 0.000 description 8
- NMROINAYXCACKF-WHFBIAKZSA-N Gly-Cys-Cys Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O NMROINAYXCACKF-WHFBIAKZSA-N 0.000 description 8
- 241000880493 Leptailurus serval Species 0.000 description 8
- GBDMISNMNXVTNV-XIRDDKMYSA-N Leu-Asp-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O GBDMISNMNXVTNV-XIRDDKMYSA-N 0.000 description 8
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 8
- WPTHAGXMYDRPFD-SRVKXCTJSA-N Met-Lys-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O WPTHAGXMYDRPFD-SRVKXCTJSA-N 0.000 description 8
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 8
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 8
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 8
- 244000061176 Nicotiana tabacum Species 0.000 description 8
- 241000168036 Populus alba Species 0.000 description 8
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 8
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 8
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 8
- 108010044940 alanylglutamine Proteins 0.000 description 8
- 108010077245 asparaginyl-proline Proteins 0.000 description 8
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 8
- 238000010353 genetic engineering Methods 0.000 description 8
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 8
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 8
- 108010049041 glutamylalanine Proteins 0.000 description 8
- 108010089804 glycyl-threonine Proteins 0.000 description 8
- 108010012581 phenylalanylglutamate Proteins 0.000 description 8
- 229920001184 polypeptide Polymers 0.000 description 8
- 102000004196 processed proteins & peptides Human genes 0.000 description 8
- 108010061238 threonyl-glycine Proteins 0.000 description 8
- 241000589158 Agrobacterium Species 0.000 description 7
- 241000183024 Populus tremula Species 0.000 description 7
- 229940024606 amino acid Drugs 0.000 description 7
- 235000001014 amino acid Nutrition 0.000 description 7
- 238000007689 inspection Methods 0.000 description 7
- 238000005406 washing Methods 0.000 description 7
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 6
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 6
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 6
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 6
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 6
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 6
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 6
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 6
- 241000193830 Bacillus <bacterium> Species 0.000 description 6
- 241000894006 Bacteria Species 0.000 description 6
- 229920000742 Cotton Polymers 0.000 description 6
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 6
- MQVNVZUEPUIAFA-WDSKDSINSA-N Gly-Cys-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN MQVNVZUEPUIAFA-WDSKDSINSA-N 0.000 description 6
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 6
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 6
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 6
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 6
- CIWILNZNBPIHEU-DCAQKATOSA-N His-Arg-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O CIWILNZNBPIHEU-DCAQKATOSA-N 0.000 description 6
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 6
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 6
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 6
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 6
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 6
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 6
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 6
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 6
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 6
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 6
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 6
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 6
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 6
- WTHGNAAQXISJHP-AVGNSLFASA-N Met-Lys-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WTHGNAAQXISJHP-AVGNSLFASA-N 0.000 description 6
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 6
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 description 6
- 240000004923 Populus tremuloides Species 0.000 description 6
- QGOZJLYCGRYYRW-KKUMJFAQSA-N Pro-Glu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QGOZJLYCGRYYRW-KKUMJFAQSA-N 0.000 description 6
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 6
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 6
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 6
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 6
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 6
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 6
- XGEUYEOEZYFHRL-KKXDTOCCSA-N Tyr-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XGEUYEOEZYFHRL-KKXDTOCCSA-N 0.000 description 6
- RCMWNNJFKNDKQR-UFYCRDLUSA-N Tyr-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 RCMWNNJFKNDKQR-UFYCRDLUSA-N 0.000 description 6
- RIVVDNTUSRVTQT-IRIUXVKKSA-N Tyr-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O RIVVDNTUSRVTQT-IRIUXVKKSA-N 0.000 description 6
- KHPLUFDSWGDRHD-SLFFLAALSA-N Tyr-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O KHPLUFDSWGDRHD-SLFFLAALSA-N 0.000 description 6
- 108010008355 arginyl-glutamine Proteins 0.000 description 6
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 6
- 108010092854 aspartyllysine Proteins 0.000 description 6
- 230000010261 cell growth Effects 0.000 description 6
- 108010054812 diprotin A Proteins 0.000 description 6
- 210000000630 fibrocyte Anatomy 0.000 description 6
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 6
- 108010081551 glycylphenylalanine Proteins 0.000 description 6
- 108010087823 glycyltyrosine Proteins 0.000 description 6
- 108010027338 isoleucylcysteine Proteins 0.000 description 6
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 108010029020 prolylglycine Proteins 0.000 description 6
- 108010026333 seryl-proline Proteins 0.000 description 6
- 108010073969 valyllysine Proteins 0.000 description 6
- 230000002792 vascular Effects 0.000 description 6
- 244000283070 Abies balsamea Species 0.000 description 5
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 5
- 108091034117 Oligonucleotide Proteins 0.000 description 5
- 230000015572 biosynthetic process Effects 0.000 description 5
- 230000001276 controlling effect Effects 0.000 description 5
- 238000005520 cutting process Methods 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 5
- 239000013613 expression plasmid Substances 0.000 description 5
- 230000002068 genetic effect Effects 0.000 description 5
- 238000003780 insertion Methods 0.000 description 5
- 230000037431 insertion Effects 0.000 description 5
- 230000007246 mechanism Effects 0.000 description 5
- 239000000523 sample Substances 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- 108010051110 tyrosyl-lysine Proteins 0.000 description 5
- 235000007173 Abies balsamea Nutrition 0.000 description 4
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 4
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 4
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 4
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 4
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 4
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 4
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 4
- ITVINTQUZMQWJR-QXEWZRGKSA-N Arg-Asn-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ITVINTQUZMQWJR-QXEWZRGKSA-N 0.000 description 4
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 4
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 4
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 4
- OKZOABJQOMAYEC-NUMRIWBASA-N Asn-Gln-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OKZOABJQOMAYEC-NUMRIWBASA-N 0.000 description 4
- HCAUEJAQCXVQQM-ACZMJKKPSA-N Asn-Glu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HCAUEJAQCXVQQM-ACZMJKKPSA-N 0.000 description 4
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 4
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 4
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 4
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 4
- SUIJFTJDTJKSRK-IHRRRGAJSA-N Asn-Pro-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUIJFTJDTJKSRK-IHRRRGAJSA-N 0.000 description 4
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 4
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 4
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 4
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 4
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 4
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 4
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 4
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 4
- LDIKUWLAMDFHPU-FXQIFTODSA-N Cys-Cys-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LDIKUWLAMDFHPU-FXQIFTODSA-N 0.000 description 4
- ZIKWRNJXFIQECJ-CIUDSAMLSA-N Cys-Cys-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ZIKWRNJXFIQECJ-CIUDSAMLSA-N 0.000 description 4
- DZIGZIIJIGGANI-FXQIFTODSA-N Cys-Glu-Gln Chemical compound SC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O DZIGZIIJIGGANI-FXQIFTODSA-N 0.000 description 4
- BSFFNUBDVYTDMV-WHFBIAKZSA-N Cys-Gly-Asn Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BSFFNUBDVYTDMV-WHFBIAKZSA-N 0.000 description 4
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 4
- VFGADOJXRLWTBU-JBDRJPRFSA-N Cys-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N VFGADOJXRLWTBU-JBDRJPRFSA-N 0.000 description 4
- HKALUUKHYNEDRS-GUBZILKMSA-N Cys-Leu-Gln Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HKALUUKHYNEDRS-GUBZILKMSA-N 0.000 description 4
- YWEHYKGJWHPGPY-XGEHTFHBSA-N Cys-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N)O YWEHYKGJWHPGPY-XGEHTFHBSA-N 0.000 description 4
- 240000001414 Eucalyptus viminalis Species 0.000 description 4
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 4
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 4
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 4
- WHVLABLIJYGVEK-QEWYBTABSA-N Gln-Phe-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WHVLABLIJYGVEK-QEWYBTABSA-N 0.000 description 4
- FTTHLXOMDMLKKW-FHWLQOOXSA-N Gln-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTTHLXOMDMLKKW-FHWLQOOXSA-N 0.000 description 4
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 description 4
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 4
- ALCAUWPAMLVUDB-FXQIFTODSA-N Glu-Gln-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ALCAUWPAMLVUDB-FXQIFTODSA-N 0.000 description 4
- RFDHKPSHTXZKLL-IHRRRGAJSA-N Glu-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N RFDHKPSHTXZKLL-IHRRRGAJSA-N 0.000 description 4
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 4
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 4
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 4
- KXRORHJIRAOQPG-SOUVJXGZSA-N Glu-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KXRORHJIRAOQPG-SOUVJXGZSA-N 0.000 description 4
- LSYFGBRDBIQYAQ-FHWLQOOXSA-N Glu-Tyr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LSYFGBRDBIQYAQ-FHWLQOOXSA-N 0.000 description 4
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 4
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 4
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 4
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 4
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 4
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 description 4
- QPCVIQJVRGXUSA-LURJTMIESA-N Gly-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QPCVIQJVRGXUSA-LURJTMIESA-N 0.000 description 4
- HHSOPSCKAZKQHQ-PEXQALLHSA-N Gly-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN HHSOPSCKAZKQHQ-PEXQALLHSA-N 0.000 description 4
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 4
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 4
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 4
- ZWRDOVYMQAAISL-UWVGGRQHSA-N Gly-Met-Lys Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCCN ZWRDOVYMQAAISL-UWVGGRQHSA-N 0.000 description 4
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 4
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 4
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 description 4
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 4
- SVHKVHBPTOMLTO-DCAQKATOSA-N His-Arg-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SVHKVHBPTOMLTO-DCAQKATOSA-N 0.000 description 4
- HQKADFMLECZIQJ-HVTMNAMFSA-N His-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N HQKADFMLECZIQJ-HVTMNAMFSA-N 0.000 description 4
- BPOHQCZZSFBSON-KKUMJFAQSA-N His-Leu-His Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BPOHQCZZSFBSON-KKUMJFAQSA-N 0.000 description 4
- MVZASEMJYJPJSI-IHPCNDPISA-N His-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC3=CN=CN3)N MVZASEMJYJPJSI-IHPCNDPISA-N 0.000 description 4
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 4
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 4
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 4
- KMBPQYKVZBMRMH-PEFMBERDSA-N Ile-Gln-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KMBPQYKVZBMRMH-PEFMBERDSA-N 0.000 description 4
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 4
- AREBLHSMLMRICD-PYJNHQTQSA-N Ile-His-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AREBLHSMLMRICD-PYJNHQTQSA-N 0.000 description 4
- HUWYGQOISIJNMK-SIGLWIIPSA-N Ile-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HUWYGQOISIJNMK-SIGLWIIPSA-N 0.000 description 4
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 4
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 4
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 4
- VEPIBPGLTLPBDW-URLPEUOOSA-N Ile-Phe-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VEPIBPGLTLPBDW-URLPEUOOSA-N 0.000 description 4
- RKQAYOWLSFLJEE-SVSWQMSJSA-N Ile-Thr-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N RKQAYOWLSFLJEE-SVSWQMSJSA-N 0.000 description 4
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 4
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 4
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 4
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 4
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 4
- DUBAVOVZNZKEQQ-AVGNSLFASA-N Leu-Arg-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCN=C(N)N DUBAVOVZNZKEQQ-AVGNSLFASA-N 0.000 description 4
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 4
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 4
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 4
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 4
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 4
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 4
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 4
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 4
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 4
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 4
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 4
- RNYLNYTYMXACRI-VFAJRCTISA-N Leu-Thr-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O RNYLNYTYMXACRI-VFAJRCTISA-N 0.000 description 4
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 4
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 4
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 4
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 4
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 4
- OWRUUFUVXFREBD-KKUMJFAQSA-N Lys-His-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O OWRUUFUVXFREBD-KKUMJFAQSA-N 0.000 description 4
- KEPWSUPUFAPBRF-DKIMLUQUSA-N Lys-Ile-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KEPWSUPUFAPBRF-DKIMLUQUSA-N 0.000 description 4
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 4
- TVOOGUNBIWAURO-KATARQTJSA-N Lys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N)O TVOOGUNBIWAURO-KATARQTJSA-N 0.000 description 4
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 4
- WGBMNLCRYKSWAR-DCAQKATOSA-N Met-Asp-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN WGBMNLCRYKSWAR-DCAQKATOSA-N 0.000 description 4
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 description 4
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 4
- 108010079364 N-glycylalanine Proteins 0.000 description 4
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 4
- VUYCNYVLKACHPA-KKUMJFAQSA-N Phe-Asp-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VUYCNYVLKACHPA-KKUMJFAQSA-N 0.000 description 4
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 4
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 4
- WKTSCAXSYITIJJ-PCBIJLKTSA-N Phe-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O WKTSCAXSYITIJJ-PCBIJLKTSA-N 0.000 description 4
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 4
- 241000202951 Populus grandidentata Species 0.000 description 4
- 235000011263 Populus tremuloides Nutrition 0.000 description 4
- XZGWNSIRZIUHHP-SRVKXCTJSA-N Pro-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 XZGWNSIRZIUHHP-SRVKXCTJSA-N 0.000 description 4
- QNZLIVROMORQFH-BQBZGAKWSA-N Pro-Gly-Cys Chemical compound C1C[C@H](NC1)C(=O)NCC(=O)N[C@@H](CS)C(=O)O QNZLIVROMORQFH-BQBZGAKWSA-N 0.000 description 4
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 4
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 4
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 4
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 4
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 4
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 4
- VAIZFHMTBFYJIA-ACZMJKKPSA-N Ser-Asp-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O VAIZFHMTBFYJIA-ACZMJKKPSA-N 0.000 description 4
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 4
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 4
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 4
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 4
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 4
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 4
- WRUWXBBEFUTJOU-XGEHTFHBSA-N Thr-Met-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N)O WRUWXBBEFUTJOU-XGEHTFHBSA-N 0.000 description 4
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 4
- VISUNEBASWEMCU-SZMVWBNQSA-N Trp-Glu-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N VISUNEBASWEMCU-SZMVWBNQSA-N 0.000 description 4
- ZNFPUOSTMUMUDR-JRQIVUDYSA-N Tyr-Asn-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZNFPUOSTMUMUDR-JRQIVUDYSA-N 0.000 description 4
- VTCKHZJKWQENKX-KBPBESRZSA-N Tyr-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O VTCKHZJKWQENKX-KBPBESRZSA-N 0.000 description 4
- QKXAEWMHAAVVGS-KKUMJFAQSA-N Tyr-Pro-Glu Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O QKXAEWMHAAVVGS-KKUMJFAQSA-N 0.000 description 4
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 4
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 4
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 4
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 4
- OXGVAUFVTOPFFA-XPUUQOCRSA-N Val-Gly-Cys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OXGVAUFVTOPFFA-XPUUQOCRSA-N 0.000 description 4
- SVFRYKBZHUGKLP-QXEWZRGKSA-N Val-Met-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVFRYKBZHUGKLP-QXEWZRGKSA-N 0.000 description 4
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 4
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 4
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 4
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 4
- 108010005233 alanylglutamic acid Proteins 0.000 description 4
- 108010047495 alanylglycine Proteins 0.000 description 4
- 238000013459 approach Methods 0.000 description 4
- 239000006071 cream Substances 0.000 description 4
- 108010016616 cysteinylglycine Proteins 0.000 description 4
- 108010060199 cysteinylproline Proteins 0.000 description 4
- 108010069495 cysteinyltyrosine Proteins 0.000 description 4
- 238000013461 design Methods 0.000 description 4
- 239000013604 expression vector Substances 0.000 description 4
- 230000006870 function Effects 0.000 description 4
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 4
- 108010010147 glycylglutamine Proteins 0.000 description 4
- 108010077515 glycylproline Proteins 0.000 description 4
- 108010037850 glycylvaline Proteins 0.000 description 4
- 108010092114 histidylphenylalanine Proteins 0.000 description 4
- 108010018006 histidylserine Proteins 0.000 description 4
- 108010000761 leucylarginine Proteins 0.000 description 4
- 108010003700 lysyl aspartic acid Proteins 0.000 description 4
- 108010054155 lysyllysine Proteins 0.000 description 4
- 239000001814 pectin Substances 0.000 description 4
- 229920001277 pectin Polymers 0.000 description 4
- 235000010987 pectin Nutrition 0.000 description 4
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 4
- 108010051242 phenylalanylserine Proteins 0.000 description 4
- 238000002360 preparation method Methods 0.000 description 4
- 108091008146 restriction endonucleases Proteins 0.000 description 4
- 210000002966 serum Anatomy 0.000 description 4
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 4
- 238000012546 transfer Methods 0.000 description 4
- 108010003137 tyrosyltyrosine Proteins 0.000 description 4
- SPIPSJXLZVTXJL-ZLUOBGJFSA-N Asn-Cys-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O SPIPSJXLZVTXJL-ZLUOBGJFSA-N 0.000 description 3
- 108091026890 Coding region Proteins 0.000 description 3
- 244000165963 Eucalyptus camaldulensis Species 0.000 description 3
- 235000004692 Eucalyptus globulus Nutrition 0.000 description 3
- 235000019134 Eucalyptus tereticornis Nutrition 0.000 description 3
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 3
- BIRKKBCSAIHDDF-WDSKDSINSA-N Gly-Glu-Cys Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BIRKKBCSAIHDDF-WDSKDSINSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- WJUYPBBCSSLVJE-CIUDSAMLSA-N His-Asn-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N WJUYPBBCSSLVJE-CIUDSAMLSA-N 0.000 description 3
- 241000446313 Lamella Species 0.000 description 3
- RRSLQOLASISYTB-CIUDSAMLSA-N Leu-Cys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O RRSLQOLASISYTB-CIUDSAMLSA-N 0.000 description 3
- 240000004658 Medicago sativa Species 0.000 description 3
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 description 3
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 3
- 241000218982 Populus nigra Species 0.000 description 3
- 241000218976 Populus trichocarpa Species 0.000 description 3
- 235000008572 Pseudotsuga menziesii Nutrition 0.000 description 3
- 240000001416 Pseudotsuga menziesii Species 0.000 description 3
- SOACHCFYJMCMHC-BWBBJGPYSA-N Ser-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)O SOACHCFYJMCMHC-BWBBJGPYSA-N 0.000 description 3
- DHPPWTOLRWYIDS-XKBZYTNZSA-N Thr-Cys-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O DHPPWTOLRWYIDS-XKBZYTNZSA-N 0.000 description 3
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 3
- 240000007313 Tilia cordata Species 0.000 description 3
- 108700019146 Transgenes Proteins 0.000 description 3
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 3
- 244000274883 Urtica dioica Species 0.000 description 3
- 235000009108 Urtica dioica Nutrition 0.000 description 3
- 240000008042 Zea mays Species 0.000 description 3
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 3
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 230000000692 anti-sense effect Effects 0.000 description 3
- 238000009395 breeding Methods 0.000 description 3
- 230000004087 circulation Effects 0.000 description 3
- 235000005822 corn Nutrition 0.000 description 3
- 230000008034 disappearance Effects 0.000 description 3
- 238000006073 displacement reaction Methods 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 239000012634 fragment Substances 0.000 description 3
- 230000000968 intestinal effect Effects 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 230000008447 perception Effects 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 210000003491 skin Anatomy 0.000 description 3
- 239000002689 soil Substances 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- -1 tubA1 Proteins 0.000 description 3
- 239000002023 wood Substances 0.000 description 3
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 2
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 2
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 2
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 2
- SHYYAQLDNVHPFT-DLOVCJGASA-N Ala-Asn-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SHYYAQLDNVHPFT-DLOVCJGASA-N 0.000 description 2
- NJIFPLAJSVUQOZ-JBDRJPRFSA-N Ala-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C)N NJIFPLAJSVUQOZ-JBDRJPRFSA-N 0.000 description 2
- UQJUGHFKNKGHFQ-VZFHVOOUSA-N Ala-Cys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UQJUGHFKNKGHFQ-VZFHVOOUSA-N 0.000 description 2
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 2
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 2
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 2
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 2
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 2
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 2
- QCTFKEJEIMPOLW-JURCDPSOSA-N Ala-Ile-Phe Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCTFKEJEIMPOLW-JURCDPSOSA-N 0.000 description 2
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 2
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 2
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 2
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 2
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 2
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 2
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 2
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 2
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 2
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 2
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 2
- SFPRJVVDZNLUTG-OWLDWWDNSA-N Ala-Trp-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFPRJVVDZNLUTG-OWLDWWDNSA-N 0.000 description 2
- 241000722948 Apocynum cannabinum Species 0.000 description 2
- 101100484993 Arabidopsis thaliana WAK2 gene Proteins 0.000 description 2
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 2
- VWVPYNGMOCSSGK-GUBZILKMSA-N Arg-Arg-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O VWVPYNGMOCSSGK-GUBZILKMSA-N 0.000 description 2
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 2
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 2
- NTAZNGWBXRVEDJ-FXQIFTODSA-N Arg-Asp-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NTAZNGWBXRVEDJ-FXQIFTODSA-N 0.000 description 2
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 2
- NAARDJBSSPUGCF-FXQIFTODSA-N Arg-Cys-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N NAARDJBSSPUGCF-FXQIFTODSA-N 0.000 description 2
- KBBKCNHWCDJPGN-GUBZILKMSA-N Arg-Gln-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KBBKCNHWCDJPGN-GUBZILKMSA-N 0.000 description 2
- YBIAYFFIVAZXPK-AVGNSLFASA-N Arg-His-Arg Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YBIAYFFIVAZXPK-AVGNSLFASA-N 0.000 description 2
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 2
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 2
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 2
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 2
- FIQKRDXFTANIEJ-ULQDDVLXSA-N Arg-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FIQKRDXFTANIEJ-ULQDDVLXSA-N 0.000 description 2
- IGFJVXOATGZTHD-UHFFFAOYSA-N Arg-Phe-His Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccccc1)C(=O)NC(Cc2c[nH]cn2)C(=O)O IGFJVXOATGZTHD-UHFFFAOYSA-N 0.000 description 2
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 2
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 2
- DNBMCNQKNOKOSD-DCAQKATOSA-N Arg-Pro-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O DNBMCNQKNOKOSD-DCAQKATOSA-N 0.000 description 2
- STHNZYKCJHWULY-AVGNSLFASA-N Arg-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O STHNZYKCJHWULY-AVGNSLFASA-N 0.000 description 2
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 2
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 2
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 2
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 2
- ZPWMEWYQBWSGAO-ZJDVBMNYSA-N Arg-Thr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZPWMEWYQBWSGAO-ZJDVBMNYSA-N 0.000 description 2
- PJOPLXOCKACMLK-KKUMJFAQSA-N Arg-Tyr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O PJOPLXOCKACMLK-KKUMJFAQSA-N 0.000 description 2
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 2
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 2
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 2
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 2
- NTXNUXPCNRDMAF-WFBYXXMGSA-N Asn-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC(N)=O)C)C(O)=O)=CNC2=C1 NTXNUXPCNRDMAF-WFBYXXMGSA-N 0.000 description 2
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 2
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 2
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 2
- RCENDENBBJFJHZ-ACZMJKKPSA-N Asn-Asn-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCENDENBBJFJHZ-ACZMJKKPSA-N 0.000 description 2
- JRVABKHPWDRUJF-UBHSHLNASA-N Asn-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N JRVABKHPWDRUJF-UBHSHLNASA-N 0.000 description 2
- YQNBILXAUIAUCF-CIUDSAMLSA-N Asn-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N YQNBILXAUIAUCF-CIUDSAMLSA-N 0.000 description 2
- NKTLGLBAGUJEGA-BIIVOSGPSA-N Asn-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N)C(=O)O NKTLGLBAGUJEGA-BIIVOSGPSA-N 0.000 description 2
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 2
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 2
- QYXNFROWLZPWPC-FXQIFTODSA-N Asn-Glu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QYXNFROWLZPWPC-FXQIFTODSA-N 0.000 description 2
- OGMDXNFGPOPZTK-GUBZILKMSA-N Asn-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N OGMDXNFGPOPZTK-GUBZILKMSA-N 0.000 description 2
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 2
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 2
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 2
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 2
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 2
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 2
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 2
- SGAUXNZEFIEAAI-GARJFASQSA-N Asn-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)C(=O)O SGAUXNZEFIEAAI-GARJFASQSA-N 0.000 description 2
- AITGTTNYKAWKDR-CIUDSAMLSA-N Asn-His-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O AITGTTNYKAWKDR-CIUDSAMLSA-N 0.000 description 2
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 2
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 2
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 2
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 2
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 2
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 2
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 2
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 2
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 2
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 2
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 2
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 2
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 2
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 2
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 2
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 2
- PUUPMDXIHCOPJU-HJGDQZAQSA-N Asn-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O PUUPMDXIHCOPJU-HJGDQZAQSA-N 0.000 description 2
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 2
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 description 2
- DPWDPEVGACCWTC-SRVKXCTJSA-N Asn-Tyr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O DPWDPEVGACCWTC-SRVKXCTJSA-N 0.000 description 2
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 2
- SYZWMVSXBZCOBZ-QXEWZRGKSA-N Asn-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N SYZWMVSXBZCOBZ-QXEWZRGKSA-N 0.000 description 2
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 2
- ATYWBXGNXZYZGI-ACZMJKKPSA-N Asp-Asn-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ATYWBXGNXZYZGI-ACZMJKKPSA-N 0.000 description 2
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 2
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 2
- QOVWVLLHMMCFFY-ZLUOBGJFSA-N Asp-Asp-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QOVWVLLHMMCFFY-ZLUOBGJFSA-N 0.000 description 2
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 2
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 2
- UFAQGGZUXVLONR-AVGNSLFASA-N Asp-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)O UFAQGGZUXVLONR-AVGNSLFASA-N 0.000 description 2
- RATOMFTUDRYMKX-ACZMJKKPSA-N Asp-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N RATOMFTUDRYMKX-ACZMJKKPSA-N 0.000 description 2
- HSWYMWGDMPLTTH-FXQIFTODSA-N Asp-Glu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HSWYMWGDMPLTTH-FXQIFTODSA-N 0.000 description 2
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 2
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 2
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 2
- RQYMKRMRZWJGHC-BQBZGAKWSA-N Asp-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N RQYMKRMRZWJGHC-BQBZGAKWSA-N 0.000 description 2
- WSXDIZFNQYTUJB-SRVKXCTJSA-N Asp-His-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O WSXDIZFNQYTUJB-SRVKXCTJSA-N 0.000 description 2
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 2
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 2
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 2
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 2
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 2
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 2
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 2
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 2
- KRQFMDNIUOVRIF-KKUMJFAQSA-N Asp-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N KRQFMDNIUOVRIF-KKUMJFAQSA-N 0.000 description 2
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 2
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 2
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 2
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 2
- BOXNGMVEVOGXOJ-UBHSHLNASA-N Asp-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N BOXNGMVEVOGXOJ-UBHSHLNASA-N 0.000 description 2
- VHUKCUHLFMRHOD-MELADBBJSA-N Asp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O VHUKCUHLFMRHOD-MELADBBJSA-N 0.000 description 2
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 240000008564 Boehmeria nivea Species 0.000 description 2
- 241000219193 Brassicaceae Species 0.000 description 2
- 241000701489 Cauliflower mosaic virus Species 0.000 description 2
- 108091033380 Coding strand Proteins 0.000 description 2
- 244000050510 Cunninghamia lanceolata Species 0.000 description 2
- GEEXORWTBTUOHC-FXQIFTODSA-N Cys-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N GEEXORWTBTUOHC-FXQIFTODSA-N 0.000 description 2
- KLLFLHBKSJAUMZ-ACZMJKKPSA-N Cys-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N KLLFLHBKSJAUMZ-ACZMJKKPSA-N 0.000 description 2
- CPTUXCUWQIBZIF-ZLUOBGJFSA-N Cys-Asn-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CPTUXCUWQIBZIF-ZLUOBGJFSA-N 0.000 description 2
- VNLYIYOYUNGURO-ZLUOBGJFSA-N Cys-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N VNLYIYOYUNGURO-ZLUOBGJFSA-N 0.000 description 2
- HIPHJNWPLMUBQQ-ACZMJKKPSA-N Cys-Cys-Gln Chemical compound SC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(N)=O HIPHJNWPLMUBQQ-ACZMJKKPSA-N 0.000 description 2
- MGAWEOHYNIMOQJ-ACZMJKKPSA-N Cys-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N MGAWEOHYNIMOQJ-ACZMJKKPSA-N 0.000 description 2
- YRKJQKATZOTUEN-ACZMJKKPSA-N Cys-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N YRKJQKATZOTUEN-ACZMJKKPSA-N 0.000 description 2
- HHABWQIFXZPZCK-ACZMJKKPSA-N Cys-Gln-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N HHABWQIFXZPZCK-ACZMJKKPSA-N 0.000 description 2
- UCMIKRLLIOVDRJ-XKBZYTNZSA-N Cys-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N)O UCMIKRLLIOVDRJ-XKBZYTNZSA-N 0.000 description 2
- RWGDABDXVXRLLH-ACZMJKKPSA-N Cys-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N RWGDABDXVXRLLH-ACZMJKKPSA-N 0.000 description 2
- GUKYYUFHWYRMEU-WHFBIAKZSA-N Cys-Gly-Asp Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O GUKYYUFHWYRMEU-WHFBIAKZSA-N 0.000 description 2
- VCIIDXDOPGHMDQ-WDSKDSINSA-N Cys-Gly-Gln Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VCIIDXDOPGHMDQ-WDSKDSINSA-N 0.000 description 2
- UXIYYUMGFNSGBK-XPUUQOCRSA-N Cys-Gly-Val Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O UXIYYUMGFNSGBK-XPUUQOCRSA-N 0.000 description 2
- BLGNLNRBABWDST-CIUDSAMLSA-N Cys-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BLGNLNRBABWDST-CIUDSAMLSA-N 0.000 description 2
- XLLSMEFANRROJE-GUBZILKMSA-N Cys-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N XLLSMEFANRROJE-GUBZILKMSA-N 0.000 description 2
- MKMKILWCRQLDFJ-DCAQKATOSA-N Cys-Lys-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MKMKILWCRQLDFJ-DCAQKATOSA-N 0.000 description 2
- VDUPGIDTWNQAJD-CIUDSAMLSA-N Cys-Lys-Cys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O VDUPGIDTWNQAJD-CIUDSAMLSA-N 0.000 description 2
- GDNWBSFSHJVXKL-GUBZILKMSA-N Cys-Lys-Gln Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O GDNWBSFSHJVXKL-GUBZILKMSA-N 0.000 description 2
- CIVXDCMSSFGWAL-YUMQZZPRSA-N Cys-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N CIVXDCMSSFGWAL-YUMQZZPRSA-N 0.000 description 2
- XMVZMBGFIOQONW-GARJFASQSA-N Cys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N)C(=O)O XMVZMBGFIOQONW-GARJFASQSA-N 0.000 description 2
- LHJDLVVQRJIURS-SRVKXCTJSA-N Cys-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N LHJDLVVQRJIURS-SRVKXCTJSA-N 0.000 description 2
- OZSBRCONEMXYOJ-AVGNSLFASA-N Cys-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N OZSBRCONEMXYOJ-AVGNSLFASA-N 0.000 description 2
- HMWBPUDETPKSSS-DCAQKATOSA-N Cys-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CCCCN)C(=O)O HMWBPUDETPKSSS-DCAQKATOSA-N 0.000 description 2
- SWJYSDXMTPMBHO-FXQIFTODSA-N Cys-Pro-Ser Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SWJYSDXMTPMBHO-FXQIFTODSA-N 0.000 description 2
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 2
- LKHMGNHQULEPFY-ACZMJKKPSA-N Cys-Ser-Glu Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O LKHMGNHQULEPFY-ACZMJKKPSA-N 0.000 description 2
- NXQCSPVUPLUTJH-WHFBIAKZSA-N Cys-Ser-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O NXQCSPVUPLUTJH-WHFBIAKZSA-N 0.000 description 2
- DRXOWZZHCSBUOI-YJRXYDGGSA-N Cys-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N)O DRXOWZZHCSBUOI-YJRXYDGGSA-N 0.000 description 2
- 240000008395 Elaeocarpus angustifolius Species 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 2
- 244000187785 Eucalyptus alba Species 0.000 description 2
- 241001480093 Eucalyptus cloeziana Species 0.000 description 2
- 241000006109 Eucalyptus delegatensis Species 0.000 description 2
- 241000006111 Eucalyptus dives Species 0.000 description 2
- 241001074688 Eucalyptus dunnii Species 0.000 description 2
- 244000011905 Eucalyptus globulus subsp bicostata Species 0.000 description 2
- 241001074706 Eucalyptus pellita Species 0.000 description 2
- 240000006361 Eucalyptus saligna Species 0.000 description 2
- 241000404037 Eucalyptus urophylla Species 0.000 description 2
- 241000701484 Figwort mosaic virus Species 0.000 description 2
- LZRMPXRYLLTAJX-GUBZILKMSA-N Gln-Arg-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZRMPXRYLLTAJX-GUBZILKMSA-N 0.000 description 2
- ODBLJLZVLAWVMS-GUBZILKMSA-N Gln-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N ODBLJLZVLAWVMS-GUBZILKMSA-N 0.000 description 2
- MGJMFSBEMSNYJL-AVGNSLFASA-N Gln-Asn-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MGJMFSBEMSNYJL-AVGNSLFASA-N 0.000 description 2
- VVWWRZZMPSPVQU-KBIXCLLPSA-N Gln-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N VVWWRZZMPSPVQU-KBIXCLLPSA-N 0.000 description 2
- NKCZYEDZTKOFBG-GUBZILKMSA-N Gln-Gln-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NKCZYEDZTKOFBG-GUBZILKMSA-N 0.000 description 2
- MCAVASRGVBVPMX-FXQIFTODSA-N Gln-Glu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MCAVASRGVBVPMX-FXQIFTODSA-N 0.000 description 2
- JEFZIKRIDLHOIF-BYPYZUCNSA-N Gln-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(O)=O JEFZIKRIDLHOIF-BYPYZUCNSA-N 0.000 description 2
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 2
- KKCJHBXMYYVWMX-KQXIARHKSA-N Gln-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N KKCJHBXMYYVWMX-KQXIARHKSA-N 0.000 description 2
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 2
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 2
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 2
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 2
- WEAVZFWWIPIANL-SRVKXCTJSA-N Gln-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N WEAVZFWWIPIANL-SRVKXCTJSA-N 0.000 description 2
- XUMFMAVDHQDATI-DCAQKATOSA-N Gln-Pro-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XUMFMAVDHQDATI-DCAQKATOSA-N 0.000 description 2
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 2
- DYVMTEWCGAVKSE-HJGDQZAQSA-N Gln-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O DYVMTEWCGAVKSE-HJGDQZAQSA-N 0.000 description 2
- XIYWAJQIWLXXAF-XKBZYTNZSA-N Gln-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XIYWAJQIWLXXAF-XKBZYTNZSA-N 0.000 description 2
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 2
- CVRUVYDNRPSKBM-QEJZJMRPSA-N Gln-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N CVRUVYDNRPSKBM-QEJZJMRPSA-N 0.000 description 2
- GTBXHETZPUURJE-KKUMJFAQSA-N Gln-Tyr-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GTBXHETZPUURJE-KKUMJFAQSA-N 0.000 description 2
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 2
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 2
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 2
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 2
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 2
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 2
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 2
- LJLPOZGRPLORTF-CIUDSAMLSA-N Glu-Asn-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LJLPOZGRPLORTF-CIUDSAMLSA-N 0.000 description 2
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 2
- NADWTMLCUDMDQI-ACZMJKKPSA-N Glu-Asp-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N NADWTMLCUDMDQI-ACZMJKKPSA-N 0.000 description 2
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 2
- KIMXNQXJJWWVIN-AVGNSLFASA-N Glu-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)O KIMXNQXJJWWVIN-AVGNSLFASA-N 0.000 description 2
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 2
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 2
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 2
- LVCHEMOPBORRLB-DCAQKATOSA-N Glu-Gln-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O LVCHEMOPBORRLB-DCAQKATOSA-N 0.000 description 2
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 2
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 2
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 2
- LYCDZGLXQBPNQU-WDSKDSINSA-N Glu-Gly-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O LYCDZGLXQBPNQU-WDSKDSINSA-N 0.000 description 2
- VOORMNJKNBGYGK-YUMQZZPRSA-N Glu-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N VOORMNJKNBGYGK-YUMQZZPRSA-N 0.000 description 2
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 2
- BRKUZSLQMPNVFN-SRVKXCTJSA-N Glu-His-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BRKUZSLQMPNVFN-SRVKXCTJSA-N 0.000 description 2
- BIHMNDPWRUROFZ-JYJNAYRXSA-N Glu-His-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BIHMNDPWRUROFZ-JYJNAYRXSA-N 0.000 description 2
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 2
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 2
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 2
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 2
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 2
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 2
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 2
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 2
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 2
- BFEZQZKEPRKKHV-SRVKXCTJSA-N Glu-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O BFEZQZKEPRKKHV-SRVKXCTJSA-N 0.000 description 2
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 2
- ZGXGVBYEJGVJMV-HJGDQZAQSA-N Glu-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O ZGXGVBYEJGVJMV-HJGDQZAQSA-N 0.000 description 2
- UCZXXMREFIETQW-AVGNSLFASA-N Glu-Tyr-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O UCZXXMREFIETQW-AVGNSLFASA-N 0.000 description 2
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- NTNUEBVGKMVANB-NHCYSSNCSA-N Glu-Val-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O NTNUEBVGKMVANB-NHCYSSNCSA-N 0.000 description 2
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 2
- JLXVRFDTDUGQEE-YFKPBYRVSA-N Gly-Arg Chemical compound NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N JLXVRFDTDUGQEE-YFKPBYRVSA-N 0.000 description 2
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 2
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 2
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 2
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 2
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 2
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 2
- YDWZGVCXMVLDQH-WHFBIAKZSA-N Gly-Cys-Asn Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(N)=O YDWZGVCXMVLDQH-WHFBIAKZSA-N 0.000 description 2
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 2
- IANBSEOVTQNGBZ-BQBZGAKWSA-N Gly-Cys-Met Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(O)=O IANBSEOVTQNGBZ-BQBZGAKWSA-N 0.000 description 2
- GYAUWXXORNTCHU-QWRGUYRKSA-N Gly-Cys-Tyr Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 GYAUWXXORNTCHU-QWRGUYRKSA-N 0.000 description 2
- VOCMRCVMAPSSAL-IUCAKERBSA-N Gly-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN VOCMRCVMAPSSAL-IUCAKERBSA-N 0.000 description 2
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 2
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 2
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 2
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 2
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 2
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 2
- HPAIKDPJURGQLN-KBPBESRZSA-N Gly-His-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 HPAIKDPJURGQLN-KBPBESRZSA-N 0.000 description 2
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 2
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 2
- QGDOOCIPHSSADO-STQMWFEESA-N Gly-Met-Phe Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGDOOCIPHSSADO-STQMWFEESA-N 0.000 description 2
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 2
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 2
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 2
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 2
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 2
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 2
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 2
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 2
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 2
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 2
- GWNIGUKSRJBIHX-STQMWFEESA-N Gly-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN)O GWNIGUKSRJBIHX-STQMWFEESA-N 0.000 description 2
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 2
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 2
- DKJWUIYLMLUBDX-XPUUQOCRSA-N Gly-Val-Cys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O DKJWUIYLMLUBDX-XPUUQOCRSA-N 0.000 description 2
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 2
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 2
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 2
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- JBJNKUOMNZGQIM-PYJNHQTQSA-N His-Arg-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JBJNKUOMNZGQIM-PYJNHQTQSA-N 0.000 description 2
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 2
- MWAJSVTZZOUOBU-IHRRRGAJSA-N His-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 MWAJSVTZZOUOBU-IHRRRGAJSA-N 0.000 description 2
- NELVFWFDOKRTOR-SDDRHHMPSA-N His-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O NELVFWFDOKRTOR-SDDRHHMPSA-N 0.000 description 2
- STWGDDDFLUFCCA-GVXVVHGQSA-N His-Glu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O STWGDDDFLUFCCA-GVXVVHGQSA-N 0.000 description 2
- FYTCLUIYTYFGPT-YUMQZZPRSA-N His-Gly-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FYTCLUIYTYFGPT-YUMQZZPRSA-N 0.000 description 2
- IWXMHXYOACDSIA-PYJNHQTQSA-N His-Ile-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O IWXMHXYOACDSIA-PYJNHQTQSA-N 0.000 description 2
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 2
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 2
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 2
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 2
- ILUVWFTXAUYOBW-CUJWVEQBSA-N His-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CN=CN1)N)O ILUVWFTXAUYOBW-CUJWVEQBSA-N 0.000 description 2
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 2
- 244000025221 Humulus lupulus Species 0.000 description 2
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 description 2
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 2
- IPYVXYDYLHVWHU-GMOBBJLQSA-N Ile-Asn-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N IPYVXYDYLHVWHU-GMOBBJLQSA-N 0.000 description 2
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 2
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 2
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 2
- SJIGTGZVQGLMGG-NAKRPEOUSA-N Ile-Cys-Arg Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)O SJIGTGZVQGLMGG-NAKRPEOUSA-N 0.000 description 2
- PFTFEWHJSAXGED-ZKWXMUAHSA-N Ile-Cys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N PFTFEWHJSAXGED-ZKWXMUAHSA-N 0.000 description 2
- VCYVLFAWCJRXFT-HJPIBITLSA-N Ile-Cys-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N VCYVLFAWCJRXFT-HJPIBITLSA-N 0.000 description 2
- IXEFKXAGHRQFAF-HVTMNAMFSA-N Ile-Glu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IXEFKXAGHRQFAF-HVTMNAMFSA-N 0.000 description 2
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 2
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 2
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 2
- RIVKTKFVWXRNSJ-GRLWGSQLSA-N Ile-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RIVKTKFVWXRNSJ-GRLWGSQLSA-N 0.000 description 2
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 2
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 2
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 2
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 2
- FJWALBCCVIHZBS-QXEWZRGKSA-N Ile-Met-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N FJWALBCCVIHZBS-QXEWZRGKSA-N 0.000 description 2
- WSSGUVAKYCQSCT-XUXIUFHCSA-N Ile-Met-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)O)N WSSGUVAKYCQSCT-XUXIUFHCSA-N 0.000 description 2
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 2
- CZWANIQKACCEKW-CYDGBPFRSA-N Ile-Pro-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N CZWANIQKACCEKW-CYDGBPFRSA-N 0.000 description 2
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 2
- FBGXMKUWQFPHFB-JBDRJPRFSA-N Ile-Ser-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N FBGXMKUWQFPHFB-JBDRJPRFSA-N 0.000 description 2
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 2
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 2
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 2
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 2
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 2
- AUIYHFRUOOKTGX-UKJIMTQDSA-N Ile-Val-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N AUIYHFRUOOKTGX-UKJIMTQDSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 2
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 2
- SUPVSFFZWVOEOI-CQDKDKBSSA-N Leu-Ala-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-CQDKDKBSSA-N 0.000 description 2
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 2
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 2
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 2
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 2
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 2
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 2
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 2
- YORLGJINWYYIMX-KKUMJFAQSA-N Leu-Cys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YORLGJINWYYIMX-KKUMJFAQSA-N 0.000 description 2
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 2
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 2
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 2
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 2
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 2
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 2
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 2
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 2
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 2
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 2
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 2
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 2
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 2
- DCGXHWINSHEPIR-SRVKXCTJSA-N Leu-Lys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N DCGXHWINSHEPIR-SRVKXCTJSA-N 0.000 description 2
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 2
- KXCMQWMNYQOAKA-SRVKXCTJSA-N Leu-Met-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KXCMQWMNYQOAKA-SRVKXCTJSA-N 0.000 description 2
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 2
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 2
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 2
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 2
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 2
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 2
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 2
- LINKCQUOMUDLKN-KATARQTJSA-N Leu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N)O LINKCQUOMUDLKN-KATARQTJSA-N 0.000 description 2
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 2
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 2
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 2
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 2
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 2
- 235000004431 Linum usitatissimum Nutrition 0.000 description 2
- 240000006240 Linum usitatissimum Species 0.000 description 2
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 2
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 2
- ABHIXYDMILIUKV-CIUDSAMLSA-N Lys-Asn-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ABHIXYDMILIUKV-CIUDSAMLSA-N 0.000 description 2
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 2
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 2
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 2
- XTONYTDATVADQH-CIUDSAMLSA-N Lys-Cys-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O XTONYTDATVADQH-CIUDSAMLSA-N 0.000 description 2
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 2
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 2
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 2
- CRNNMTHBMRFQNG-GUBZILKMSA-N Lys-Glu-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N CRNNMTHBMRFQNG-GUBZILKMSA-N 0.000 description 2
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 2
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 2
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 2
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 2
- FGMHXLULNHTPID-KKUMJFAQSA-N Lys-His-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CN=CN1 FGMHXLULNHTPID-KKUMJFAQSA-N 0.000 description 2
- YXTKSLRSRXKXNV-IHRRRGAJSA-N Lys-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N YXTKSLRSRXKXNV-IHRRRGAJSA-N 0.000 description 2
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 2
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 2
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 2
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 2
- MTBLFIQZECOEBY-IHRRRGAJSA-N Lys-Met-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O MTBLFIQZECOEBY-IHRRRGAJSA-N 0.000 description 2
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 2
- IPTUBUUIFRZMJK-ACRUOGEOSA-N Lys-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 IPTUBUUIFRZMJK-ACRUOGEOSA-N 0.000 description 2
- UDXSLGLHFUBRRM-OEAJRASXSA-N Lys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCCN)N)O UDXSLGLHFUBRRM-OEAJRASXSA-N 0.000 description 2
- MSSABBQOBUZFKZ-IHRRRGAJSA-N Lys-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O MSSABBQOBUZFKZ-IHRRRGAJSA-N 0.000 description 2
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 2
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 2
- GVKINWYYLOLEFQ-XIRDDKMYSA-N Lys-Trp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O GVKINWYYLOLEFQ-XIRDDKMYSA-N 0.000 description 2
- PPNCMJARTHYNEC-MEYUZBJRSA-N Lys-Tyr-Thr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)CC1=CC=C(O)C=C1 PPNCMJARTHYNEC-MEYUZBJRSA-N 0.000 description 2
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 2
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 2
- 241000208949 Malpighiaceae Species 0.000 description 2
- 241000219071 Malvaceae Species 0.000 description 2
- IYXDSYWCVVXSKB-CIUDSAMLSA-N Met-Asn-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IYXDSYWCVVXSKB-CIUDSAMLSA-N 0.000 description 2
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 2
- XDGFFEZAZHRZFR-RHYQMDGZSA-N Met-Leu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDGFFEZAZHRZFR-RHYQMDGZSA-N 0.000 description 2
- UFOWQBYMUILSRK-IHRRRGAJSA-N Met-Lys-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 UFOWQBYMUILSRK-IHRRRGAJSA-N 0.000 description 2
- ZRACLHJYVRBJFC-ULQDDVLXSA-N Met-Lys-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZRACLHJYVRBJFC-ULQDDVLXSA-N 0.000 description 2
- IILAGWCGKJSBGB-IHRRRGAJSA-N Met-Phe-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IILAGWCGKJSBGB-IHRRRGAJSA-N 0.000 description 2
- BJPQKNHZHUCQNQ-SRVKXCTJSA-N Met-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCSC)N BJPQKNHZHUCQNQ-SRVKXCTJSA-N 0.000 description 2
- CNFMPVYIVQUJOO-NHCYSSNCSA-N Met-Val-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O CNFMPVYIVQUJOO-NHCYSSNCSA-N 0.000 description 2
- 241000219926 Myrtaceae Species 0.000 description 2
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 2
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 2
- 108010047562 NGR peptide Proteins 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 240000007377 Petunia x hybrida Species 0.000 description 2
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 2
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 2
- YQNBKXUTWBRQCS-BVSLBCMMSA-N Phe-Arg-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 YQNBKXUTWBRQCS-BVSLBCMMSA-N 0.000 description 2
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 description 2
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 2
- WMGVYPPIMZPWPN-SRVKXCTJSA-N Phe-Asp-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WMGVYPPIMZPWPN-SRVKXCTJSA-N 0.000 description 2
- UEXCHCYDPAIVDE-SRVKXCTJSA-N Phe-Asp-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEXCHCYDPAIVDE-SRVKXCTJSA-N 0.000 description 2
- LXUJDHOKVUYHRC-KKUMJFAQSA-N Phe-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N LXUJDHOKVUYHRC-KKUMJFAQSA-N 0.000 description 2
- GDBOREPXIRKSEQ-FHWLQOOXSA-N Phe-Gln-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GDBOREPXIRKSEQ-FHWLQOOXSA-N 0.000 description 2
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 2
- CDQCFGOQNYOICK-IHRRRGAJSA-N Phe-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDQCFGOQNYOICK-IHRRRGAJSA-N 0.000 description 2
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 2
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 2
- VJLLEKDQJSMHRU-STQMWFEESA-N Phe-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O VJLLEKDQJSMHRU-STQMWFEESA-N 0.000 description 2
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 2
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 2
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 2
- KPEIBEPEUAZWNS-ULQDDVLXSA-N Phe-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KPEIBEPEUAZWNS-ULQDDVLXSA-N 0.000 description 2
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 2
- DMEYUTSDVRCWRS-ULQDDVLXSA-N Phe-Lys-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DMEYUTSDVRCWRS-ULQDDVLXSA-N 0.000 description 2
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 2
- RYQWALWYQWBUKN-FHWLQOOXSA-N Phe-Phe-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RYQWALWYQWBUKN-FHWLQOOXSA-N 0.000 description 2
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 2
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 2
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 2
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 2
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 2
- ZYNBEWGJFXTBDU-ACRUOGEOSA-N Phe-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N ZYNBEWGJFXTBDU-ACRUOGEOSA-N 0.000 description 2
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 2
- GLUYKHMBGKQBHE-JYJNAYRXSA-N Phe-Val-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 GLUYKHMBGKQBHE-JYJNAYRXSA-N 0.000 description 2
- YUPRIZTWANWWHK-DZKIICNBSA-N Phe-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N YUPRIZTWANWWHK-DZKIICNBSA-N 0.000 description 2
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 2
- 235000000422 Phormium tenax Nutrition 0.000 description 2
- 240000009257 Phormium tenax Species 0.000 description 2
- 108700001094 Plant Genes Proteins 0.000 description 2
- 241000209504 Poaceae Species 0.000 description 2
- 241000218978 Populus deltoides Species 0.000 description 2
- 241000182989 Populus laurifolia Species 0.000 description 2
- 241000249899 Populus tomentosa Species 0.000 description 2
- 241000987883 Populus yunnanensis Species 0.000 description 2
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 2
- OLHDPZMYUSBGDE-GUBZILKMSA-N Pro-Arg-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O OLHDPZMYUSBGDE-GUBZILKMSA-N 0.000 description 2
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 2
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 2
- AHXPYZRZRMQOAU-QXEWZRGKSA-N Pro-Asn-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1)C(O)=O AHXPYZRZRMQOAU-QXEWZRGKSA-N 0.000 description 2
- HQVPQXMCQKXARZ-FXQIFTODSA-N Pro-Cys-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O HQVPQXMCQKXARZ-FXQIFTODSA-N 0.000 description 2
- XJROSHJRQTXWAE-XGEHTFHBSA-N Pro-Cys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XJROSHJRQTXWAE-XGEHTFHBSA-N 0.000 description 2
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 2
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 2
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 2
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 2
- LPGSNRSLPHRNBW-AVGNSLFASA-N Pro-His-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 LPGSNRSLPHRNBW-AVGNSLFASA-N 0.000 description 2
- NTXFLJULRHQMDC-GUBZILKMSA-N Pro-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 NTXFLJULRHQMDC-GUBZILKMSA-N 0.000 description 2
- GFHOSBYCLACKEK-GUBZILKMSA-N Pro-Pro-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GFHOSBYCLACKEK-GUBZILKMSA-N 0.000 description 2
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 2
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 2
- BJCXXMGGPHRSHV-GUBZILKMSA-N Pro-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BJCXXMGGPHRSHV-GUBZILKMSA-N 0.000 description 2
- ZYJMLBCDFPIGNL-JYJNAYRXSA-N Pro-Tyr-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O ZYJMLBCDFPIGNL-JYJNAYRXSA-N 0.000 description 2
- 240000001052 Prunus maximowiczii Species 0.000 description 2
- 108010079005 RDV peptide Proteins 0.000 description 2
- 108010003201 RGH 0205 Proteins 0.000 description 2
- 241000589180 Rhizobium Species 0.000 description 2
- UEJYSALTSUZXFV-SRVKXCTJSA-N Rigin Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O UEJYSALTSUZXFV-SRVKXCTJSA-N 0.000 description 2
- 241000124033 Salix Species 0.000 description 2
- 229920002684 Sepharose Polymers 0.000 description 2
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 2
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 2
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 2
- VQBLHWSPVYYZTB-DCAQKATOSA-N Ser-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N VQBLHWSPVYYZTB-DCAQKATOSA-N 0.000 description 2
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 2
- QGMLKFGTGXWAHF-IHRRRGAJSA-N Ser-Arg-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGMLKFGTGXWAHF-IHRRRGAJSA-N 0.000 description 2
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 2
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 2
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 2
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 2
- XSYJDGIDKRNWFX-SRVKXCTJSA-N Ser-Cys-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XSYJDGIDKRNWFX-SRVKXCTJSA-N 0.000 description 2
- MPPHJZYXDVDGOF-BWBBJGPYSA-N Ser-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CO MPPHJZYXDVDGOF-BWBBJGPYSA-N 0.000 description 2
- WKLJLEXEENIYQE-SRVKXCTJSA-N Ser-Cys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WKLJLEXEENIYQE-SRVKXCTJSA-N 0.000 description 2
- SWIQQMYVHIXPEK-FXQIFTODSA-N Ser-Cys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O SWIQQMYVHIXPEK-FXQIFTODSA-N 0.000 description 2
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 2
- SVWQEIRZHHNBIO-WHFBIAKZSA-N Ser-Gly-Cys Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CS)C(O)=O SVWQEIRZHHNBIO-WHFBIAKZSA-N 0.000 description 2
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 2
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 2
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 2
- DLPXTCTVNDTYGJ-JBDRJPRFSA-N Ser-Ile-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O DLPXTCTVNDTYGJ-JBDRJPRFSA-N 0.000 description 2
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 2
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 2
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 2
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 2
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 2
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- GVIGVIOEYBOTCB-XIRDDKMYSA-N Ser-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC(C)C)C(O)=O)=CNC2=C1 GVIGVIOEYBOTCB-XIRDDKMYSA-N 0.000 description 2
- JLPMFVAIQHCBDC-CIUDSAMLSA-N Ser-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N JLPMFVAIQHCBDC-CIUDSAMLSA-N 0.000 description 2
- SRKMDKACHDVPMD-SRVKXCTJSA-N Ser-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N SRKMDKACHDVPMD-SRVKXCTJSA-N 0.000 description 2
- JUTGONBTALQWMK-NAKRPEOUSA-N Ser-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N JUTGONBTALQWMK-NAKRPEOUSA-N 0.000 description 2
- VIIJCAQMJBHSJH-FXQIFTODSA-N Ser-Met-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O VIIJCAQMJBHSJH-FXQIFTODSA-N 0.000 description 2
- WOJYIMBIKTWKJO-KKUMJFAQSA-N Ser-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CO)N WOJYIMBIKTWKJO-KKUMJFAQSA-N 0.000 description 2
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 2
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 2
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 2
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 2
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 2
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 2
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 2
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 2
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 2
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 2
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 2
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 2
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 2
- FGBLCMLXHRPVOF-IHRRRGAJSA-N Ser-Tyr-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FGBLCMLXHRPVOF-IHRRRGAJSA-N 0.000 description 2
- 241001135312 Sinorhizobium Species 0.000 description 2
- 241000208292 Solanaceae Species 0.000 description 2
- 240000005622 Spartium junceum Species 0.000 description 2
- 235000007235 Spartium junceum Nutrition 0.000 description 2
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 2
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 2
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 description 2
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 2
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 2
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 2
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 2
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 2
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 2
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 2
- YAAPRMFURSENOZ-KATARQTJSA-N Thr-Cys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N)O YAAPRMFURSENOZ-KATARQTJSA-N 0.000 description 2
- VEWZSFGRQDUAJM-YJRXYDGGSA-N Thr-Cys-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O VEWZSFGRQDUAJM-YJRXYDGGSA-N 0.000 description 2
- DSLHSTIUAPKERR-XGEHTFHBSA-N Thr-Cys-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O DSLHSTIUAPKERR-XGEHTFHBSA-N 0.000 description 2
- GARULAKWZGFIKC-RWRJDSDZSA-N Thr-Gln-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GARULAKWZGFIKC-RWRJDSDZSA-N 0.000 description 2
- DIPIPFHFLPTCLK-LOKLDPHHSA-N Thr-Gln-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O DIPIPFHFLPTCLK-LOKLDPHHSA-N 0.000 description 2
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 2
- WYKJENSCCRJLRC-ZDLURKLDSA-N Thr-Gly-Cys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O WYKJENSCCRJLRC-ZDLURKLDSA-N 0.000 description 2
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 2
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 2
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 2
- VUSAEKOXGNEYNE-PBCZWWQYSA-N Thr-His-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VUSAEKOXGNEYNE-PBCZWWQYSA-N 0.000 description 2
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 2
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 2
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 2
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 2
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 2
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 2
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 2
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 2
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 2
- GUHLYMZJVXUIPO-RCWTZXSCSA-N Thr-Met-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GUHLYMZJVXUIPO-RCWTZXSCSA-N 0.000 description 2
- NZRUWPIYECBYRK-HTUGSXCWSA-N Thr-Phe-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O NZRUWPIYECBYRK-HTUGSXCWSA-N 0.000 description 2
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 2
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 2
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 2
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 2
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 2
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 2
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 2
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 2
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 2
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 2
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 2
- PJCYRZVSACOYSN-ZJDVBMNYSA-N Thr-Thr-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O PJCYRZVSACOYSN-ZJDVBMNYSA-N 0.000 description 2
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 2
- XGUAUKUYQHBUNY-SWRJLBSHSA-N Thr-Trp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XGUAUKUYQHBUNY-SWRJLBSHSA-N 0.000 description 2
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 2
- AXEJRUGTOJPZKG-XGEHTFHBSA-N Thr-Val-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O AXEJRUGTOJPZKG-XGEHTFHBSA-N 0.000 description 2
- CURFABYITJVKEW-QTKMDUPCSA-N Thr-Val-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O CURFABYITJVKEW-QTKMDUPCSA-N 0.000 description 2
- 241000218638 Thuja plicata Species 0.000 description 2
- 235000015450 Tilia cordata Nutrition 0.000 description 2
- 240000006909 Tilia x europaea Species 0.000 description 2
- RNDWCRUOGGQDKN-UBHSHLNASA-N Trp-Ser-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RNDWCRUOGGQDKN-UBHSHLNASA-N 0.000 description 2
- UPUNWAXSLPBMRK-XTWBLICNSA-N Trp-Thr-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UPUNWAXSLPBMRK-XTWBLICNSA-N 0.000 description 2
- IXTQGBGHWQEEDE-AVGNSLFASA-N Tyr-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IXTQGBGHWQEEDE-AVGNSLFASA-N 0.000 description 2
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 2
- FFCRCJZJARTYCG-KKUMJFAQSA-N Tyr-Cys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N)O FFCRCJZJARTYCG-KKUMJFAQSA-N 0.000 description 2
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 2
- HDSKHCBAVVWPCQ-FHWLQOOXSA-N Tyr-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HDSKHCBAVVWPCQ-FHWLQOOXSA-N 0.000 description 2
- GFJXBLSZOFWHAW-JYJNAYRXSA-N Tyr-His-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GFJXBLSZOFWHAW-JYJNAYRXSA-N 0.000 description 2
- JJNXZIPLIXIGBX-HJPIBITLSA-N Tyr-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JJNXZIPLIXIGBX-HJPIBITLSA-N 0.000 description 2
- AXWBYOVVDRBOGU-SIUGBPQLSA-N Tyr-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AXWBYOVVDRBOGU-SIUGBPQLSA-N 0.000 description 2
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 2
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 2
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 2
- YKCXQOBTISTQJD-BZSNNMDCSA-N Tyr-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YKCXQOBTISTQJD-BZSNNMDCSA-N 0.000 description 2
- WTTRJMAZPDHPGS-KKXDTOCCSA-N Tyr-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(O)=O WTTRJMAZPDHPGS-KKXDTOCCSA-N 0.000 description 2
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 2
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 2
- QRCBQDPRKMYTMB-IHPCNDPISA-N Tyr-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N QRCBQDPRKMYTMB-IHPCNDPISA-N 0.000 description 2
- GZWPQZDVTBZVEP-BZSNNMDCSA-N Tyr-Tyr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O GZWPQZDVTBZVEP-BZSNNMDCSA-N 0.000 description 2
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 2
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 2
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 2
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 2
- GBESYURLQOYWLU-LAEOZQHASA-N Val-Glu-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GBESYURLQOYWLU-LAEOZQHASA-N 0.000 description 2
- AHHJARQXFFGOKF-NRPADANISA-N Val-Glu-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N AHHJARQXFFGOKF-NRPADANISA-N 0.000 description 2
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 2
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 2
- RKIGNDAHUOOIMJ-BQFCYCMXSA-N Val-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 RKIGNDAHUOOIMJ-BQFCYCMXSA-N 0.000 description 2
- PMXBARDFIAPBGK-DZKIICNBSA-N Val-Glu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PMXBARDFIAPBGK-DZKIICNBSA-N 0.000 description 2
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 2
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 2
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 2
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 2
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 2
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 2
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 2
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 2
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 2
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 2
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 2
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 2
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 2
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 2
- DOBHJKVVACOQTN-DZKIICNBSA-N Val-Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 DOBHJKVVACOQTN-DZKIICNBSA-N 0.000 description 2
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 2
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 2
- WBPFYNYTYASCQP-CYDGBPFRSA-N Val-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N WBPFYNYTYASCQP-CYDGBPFRSA-N 0.000 description 2
- 244000042314 Vigna unguiculata Species 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 229920002522 Wood fibre Polymers 0.000 description 2
- 238000009825 accumulation Methods 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 108010066829 alanyl-glutamyl-aspartylprolyine Proteins 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 2
- 108010041407 alanylaspartic acid Proteins 0.000 description 2
- 108010011559 alanylphenylalanine Proteins 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 2
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 108010036533 arginylvaline Proteins 0.000 description 2
- 210000001367 artery Anatomy 0.000 description 2
- 210000004436 artificial bacterial chromosome Anatomy 0.000 description 2
- 210000001106 artificial yeast chromosome Anatomy 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- QLULGSLAHXLKSR-UHFFFAOYSA-N azane;phosphane Chemical compound N.P QLULGSLAHXLKSR-UHFFFAOYSA-N 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 230000001488 breeding effect Effects 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 235000012343 cottonseed oil Nutrition 0.000 description 2
- 108010004073 cysteinylcysteine Proteins 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 108010009297 diglycyl-histidine Proteins 0.000 description 2
- 235000013399 edible fruits Nutrition 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 238000013467 fragmentation Methods 0.000 description 2
- 238000006062 fragmentation reaction Methods 0.000 description 2
- IAJOBQBIJHVGMQ-BYPYZUCNSA-N glufosinate-P Chemical compound CP(O)(=O)CC[C@H](N)C(O)=O IAJOBQBIJHVGMQ-BYPYZUCNSA-N 0.000 description 2
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 2
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 2
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- 239000005090 green fluorescent protein Substances 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 2
- 108010078274 isoleucylvaline Proteins 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010087810 leucyl-seryl-glutamyl-leucine Proteins 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 2
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 230000035800 maturation Effects 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 230000004060 metabolic process Effects 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 108010058731 nopaline synthase Proteins 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 2
- 108010025488 pinealon Proteins 0.000 description 2
- 239000013612 plasmid Substances 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 108010015796 prolylisoleucine Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 238000003757 reverse transcription PCR Methods 0.000 description 2
- 108010091078 rigin Proteins 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 108010005652 splenotritin Proteins 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 238000010257 thawing Methods 0.000 description 2
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- 230000005026 transcription initiation Effects 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- 108010080629 tryptophan-leucine Proteins 0.000 description 2
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 108010078580 tyrosylleucine Proteins 0.000 description 2
- 210000003462 vein Anatomy 0.000 description 2
- FQVLRGLGWNWPSS-BXBUPLCLSA-N (4r,7s,10s,13s,16r)-16-acetamido-13-(1h-imidazol-5-ylmethyl)-10-methyl-6,9,12,15-tetraoxo-7-propan-2-yl-1,2-dithia-5,8,11,14-tetrazacycloheptadecane-4-carboxamide Chemical compound N1C(=O)[C@@H](NC(C)=O)CSSC[C@@H](C(N)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)NC(=O)[C@@H]1CC1=CN=CN1 FQVLRGLGWNWPSS-BXBUPLCLSA-N 0.000 description 1
- NWXMGUDVXFXRIG-WESIUVDSSA-N (4s,4as,5as,6s,12ar)-4-(dimethylamino)-1,6,10,11,12a-pentahydroxy-6-methyl-3,12-dioxo-4,4a,5,5a-tetrahydrotetracene-2-carboxamide Chemical compound C1=CC=C2[C@](O)(C)[C@H]3C[C@H]4[C@H](N(C)C)C(=O)C(C(N)=O)=C(O)[C@@]4(O)C(=O)C3=C(O)C2=C1O NWXMGUDVXFXRIG-WESIUVDSSA-N 0.000 description 1
- 101710169336 5'-deoxyadenosine deaminase Proteins 0.000 description 1
- 235000014081 Abies amabilis Nutrition 0.000 description 1
- 244000101408 Abies amabilis Species 0.000 description 1
- 244000166033 Abies lasiocarpa Species 0.000 description 1
- 235000004710 Abies lasiocarpa Nutrition 0.000 description 1
- 102000055025 Adenosine deaminases Human genes 0.000 description 1
- 101150021974 Adh1 gene Proteins 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- 241000589156 Agrobacterium rhizogenes Species 0.000 description 1
- 244000144725 Amygdalus communis Species 0.000 description 1
- 235000011437 Amygdalus communis Nutrition 0.000 description 1
- 102100033972 Amyloid protein-binding protein 2 Human genes 0.000 description 1
- 244000099147 Ananas comosus Species 0.000 description 1
- 235000007119 Ananas comosus Nutrition 0.000 description 1
- 241000219194 Arabidopsis Species 0.000 description 1
- FTMRPIVPSDVGCC-GUBZILKMSA-N Arg-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FTMRPIVPSDVGCC-GUBZILKMSA-N 0.000 description 1
- 241001071161 Asclepias Species 0.000 description 1
- 235000002470 Asclepias syriaca Nutrition 0.000 description 1
- 244000000594 Asclepias syriaca Species 0.000 description 1
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 1
- 241000208838 Asteraceae Species 0.000 description 1
- 108010006654 Bleomycin Proteins 0.000 description 1
- 240000006248 Broussonetia kazinoki Species 0.000 description 1
- 241001674345 Callitropsis nootkatensis Species 0.000 description 1
- 241000069344 Calycogonium Species 0.000 description 1
- 244000180046 Camden woollybutt Species 0.000 description 1
- 235000003571 Camden woollybutt Nutrition 0.000 description 1
- 244000025254 Cannabis sativa Species 0.000 description 1
- 241000223782 Ciliophora Species 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 241001480083 Corymbia dolichocarpa Species 0.000 description 1
- 241001480160 Corymbia erythrophloia Species 0.000 description 1
- 241000219104 Cucurbitaceae Species 0.000 description 1
- SRIRHERUAMYIOQ-CIUDSAMLSA-N Cys-Leu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SRIRHERUAMYIOQ-CIUDSAMLSA-N 0.000 description 1
- 108010066133 D-octopine dehydrogenase Proteins 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 101710088194 Dehydrogenase Proteins 0.000 description 1
- 101000785279 Dictyostelium discoideum Calcium-transporting ATPase PAT1 Proteins 0.000 description 1
- 101100125027 Dictyostelium discoideum mhsp70 gene Proteins 0.000 description 1
- 101150071673 E6 gene Proteins 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 241001046947 Ectropis obliqua Species 0.000 description 1
- 102000002322 Egg Proteins Human genes 0.000 description 1
- 108010000912 Egg Proteins Proteins 0.000 description 1
- 244000148064 Enicostema verticillatum Species 0.000 description 1
- 241001074639 Eucalyptus albens Species 0.000 description 1
- 240000000991 Eucalyptus amygdalina Species 0.000 description 1
- 241000006103 Eucalyptus aromaphloia Species 0.000 description 1
- 241001074652 Eucalyptus baileyana Species 0.000 description 1
- 241001074641 Eucalyptus balladoniensis Species 0.000 description 1
- 241001465371 Eucalyptus botryoides Species 0.000 description 1
- 241001074643 Eucalyptus brachyandra Species 0.000 description 1
- 241001074642 Eucalyptus brassiana Species 0.000 description 1
- 241001074645 Eucalyptus brevistylis Species 0.000 description 1
- 241001074644 Eucalyptus brockwayi Species 0.000 description 1
- 241000006104 Eucalyptus ceracea Species 0.000 description 1
- 241000006105 Eucalyptus coccifera Species 0.000 description 1
- 244000187656 Eucalyptus cornuta Species 0.000 description 1
- 241000006106 Eucalyptus corticosa Species 0.000 description 1
- 244000187657 Eucalyptus crebra Species 0.000 description 1
- 241000006107 Eucalyptus croajingolensis Species 0.000 description 1
- 241001480120 Eucalyptus curtisii Species 0.000 description 1
- 241000006108 Eucalyptus dalrympleana Species 0.000 description 1
- 241001528275 Eucalyptus deglupta Species 0.000 description 1
- 241001074679 Eucalyptus delicata Species 0.000 description 1
- 241000396461 Eucalyptus diversicolor Species 0.000 description 1
- 241000006110 Eucalyptus diversifolia Species 0.000 description 1
- 241001074689 Eucalyptus dundasii Species 0.000 description 1
- 241001480110 Eucalyptus erythrocorys Species 0.000 description 1
- 241001074681 Eucalyptus eudesmioides Species 0.000 description 1
- 241001074680 Eucalyptus falcata Species 0.000 description 1
- 241001074683 Eucalyptus gamophylla Species 0.000 description 1
- 241001074682 Eucalyptus glaucina Species 0.000 description 1
- 244000005004 Eucalyptus globulus subsp globulus Species 0.000 description 1
- 241001074672 Eucalyptus globulus subsp. maidenii Species 0.000 description 1
- 241001074685 Eucalyptus gongylocarpa Species 0.000 description 1
- 241001074684 Eucalyptus guilfoylei Species 0.000 description 1
- 244000059939 Eucalyptus gunnii Species 0.000 description 1
- 241001074687 Eucalyptus hallii Species 0.000 description 1
- 241001074686 Eucalyptus houseana Species 0.000 description 1
- 241001074676 Eucalyptus jacksonii Species 0.000 description 1
- 241000006113 Eucalyptus lansdowneana Species 0.000 description 1
- 241001074678 Eucalyptus latisinensis Species 0.000 description 1
- 241001074677 Eucalyptus leucophloia Species 0.000 description 1
- 241001074670 Eucalyptus lockyeri Species 0.000 description 1
- 241001074669 Eucalyptus lucasii Species 0.000 description 1
- 241001074671 Eucalyptus marginata Species 0.000 description 1
- 241001074674 Eucalyptus megacarpa Species 0.000 description 1
- 241001074673 Eucalyptus melliodora Species 0.000 description 1
- 241001074675 Eucalyptus michaeliana Species 0.000 description 1
- 241001480112 Eucalyptus microcorys Species 0.000 description 1
- 240000001602 Eucalyptus microtheca Species 0.000 description 1
- 241001480094 Eucalyptus muelleriana Species 0.000 description 1
- 241000006114 Eucalyptus nitens Species 0.000 description 1
- 241001074711 Eucalyptus obtusiflora Species 0.000 description 1
- 241000333074 Eucalyptus occidentalis Species 0.000 description 1
- 241001074714 Eucalyptus optima Species 0.000 description 1
- 241000010285 Eucalyptus ovata Species 0.000 description 1
- 240000008166 Eucalyptus pachyphylla Species 0.000 description 1
- 241001074705 Eucalyptus perriniana Species 0.000 description 1
- 241001045277 Eucalyptus petiolaris Species 0.000 description 1
- 240000006934 Eucalyptus pilularis Species 0.000 description 1
- 240000003470 Eucalyptus piperita Species 0.000 description 1
- 241001074708 Eucalyptus platyphylla Species 0.000 description 1
- 241000604348 Eucalyptus pleurocarpa Species 0.000 description 1
- 241001074707 Eucalyptus polyanthemos Species 0.000 description 1
- 241001074710 Eucalyptus populnea Species 0.000 description 1
- 241001480095 Eucalyptus preissiana Species 0.000 description 1
- 241001074709 Eucalyptus pseudoglobulus Species 0.000 description 1
- 241000006118 Eucalyptus pulchella Species 0.000 description 1
- 240000003060 Eucalyptus radiata Species 0.000 description 1
- 244000234945 Eucalyptus radiata subsp radiata Species 0.000 description 1
- 241000006121 Eucalyptus regnans Species 0.000 description 1
- 241000006122 Eucalyptus risdonii Species 0.000 description 1
- 241000006123 Eucalyptus robertsonii Species 0.000 description 1
- 241000010288 Eucalyptus rodwayi Species 0.000 description 1
- 244000104951 Eucalyptus rubida Species 0.000 description 1
- 241001074696 Eucalyptus rubiginosa Species 0.000 description 1
- 241001074698 Eucalyptus salmonophloia Species 0.000 description 1
- 241001074697 Eucalyptus scoparia Species 0.000 description 1
- 241000006124 Eucalyptus sieberi Species 0.000 description 1
- 241000006125 Eucalyptus spathulata Species 0.000 description 1
- 241001074700 Eucalyptus staeri Species 0.000 description 1
- 241000006126 Eucalyptus stoatei Species 0.000 description 1
- 241001074699 Eucalyptus tenuipes Species 0.000 description 1
- 241000006128 Eucalyptus tenuiramis Species 0.000 description 1
- 240000007002 Eucalyptus tereticornis Species 0.000 description 1
- 241001074691 Eucalyptus tetrodonta Species 0.000 description 1
- 241001074694 Eucalyptus tindaliae Species 0.000 description 1
- 241001074693 Eucalyptus torquata Species 0.000 description 1
- 241000006130 Eucalyptus umbra Species 0.000 description 1
- 241000010292 Eucalyptus vernicosa Species 0.000 description 1
- 235000013366 Eucalyptus viminalis Nutrition 0.000 description 1
- 241001074695 Eucalyptus wandoo Species 0.000 description 1
- 241000396469 Eucalyptus wetarensis Species 0.000 description 1
- 241000006132 Eucalyptus willisii Species 0.000 description 1
- 241000006138 Eucalyptus willisii subsp. falciformis Species 0.000 description 1
- 241000006139 Eucalyptus willisii subsp. willisii Species 0.000 description 1
- 241000006140 Eucalyptus woodwardii Species 0.000 description 1
- 240000007836 Eurya nitida Species 0.000 description 1
- 241000234642 Festuca Species 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 1
- 229930182566 Gentamicin Natural products 0.000 description 1
- CEAZRRDELHUEMR-URQXQFDESA-N Gentamicin Chemical compound O1[C@H](C(C)NC)CC[C@@H](N)[C@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](NC)[C@@](C)(O)CO2)O)[C@H](N)C[C@@H]1N CEAZRRDELHUEMR-URQXQFDESA-N 0.000 description 1
- 241000212941 Glehnia Species 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 1
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 235000006200 Glycyrrhiza glabra Nutrition 0.000 description 1
- 239000005562 Glyphosate Substances 0.000 description 1
- 101150056393 H6 gene Proteins 0.000 description 1
- 101150031823 HSP70 gene Proteins 0.000 description 1
- DEOQGJUXUQGUJN-KKUMJFAQSA-N His-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N DEOQGJUXUQGUJN-KKUMJFAQSA-N 0.000 description 1
- 101000779309 Homo sapiens Amyloid protein-binding protein 2 Proteins 0.000 description 1
- 101000713296 Homo sapiens Proton-coupled amino acid transporter 1 Proteins 0.000 description 1
- 101001059454 Homo sapiens Serine/threonine-protein kinase MARK2 Proteins 0.000 description 1
- 240000005979 Hordeum vulgare Species 0.000 description 1
- 235000007340 Hordeum vulgare Nutrition 0.000 description 1
- 235000008694 Humulus lupulus Nutrition 0.000 description 1
- YBGTWSFIGHUWQE-MXAVVETBSA-N Ile-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CN=CN1 YBGTWSFIGHUWQE-MXAVVETBSA-N 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 241001479557 Iris douglasiana Species 0.000 description 1
- 241000144823 Iris macrosiphon Species 0.000 description 1
- 241001479599 Iris purdyi Species 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- 241001675026 Larix potaninii Species 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 241000209082 Lolium Species 0.000 description 1
- 241001138437 Lophozonia moorei Species 0.000 description 1
- 241000218922 Magnoliophyta Species 0.000 description 1
- 241000970829 Mesorhizobium Species 0.000 description 1
- 102000029749 Microtubule Human genes 0.000 description 1
- 108091022875 Microtubule Proteins 0.000 description 1
- 240000005561 Musa balbisiana Species 0.000 description 1
- 235000018290 Musa x paradisiaca Nutrition 0.000 description 1
- 241000909578 Nectandra Species 0.000 description 1
- 241000208125 Nicotiana Species 0.000 description 1
- 108091092724 Noncoding DNA Proteins 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 1
- IAJOBQBIJHVGMQ-UHFFFAOYSA-N Phosphinothricin Natural products CP(O)(=O)CCC(N)C(O)=O IAJOBQBIJHVGMQ-UHFFFAOYSA-N 0.000 description 1
- 240000000020 Picea glauca Species 0.000 description 1
- 235000008127 Picea glauca Nutrition 0.000 description 1
- 241000218595 Picea sitchensis Species 0.000 description 1
- 235000008593 Pinus contorta Nutrition 0.000 description 1
- 241000218606 Pinus contorta Species 0.000 description 1
- 235000011334 Pinus elliottii Nutrition 0.000 description 1
- 241000142776 Pinus elliottii Species 0.000 description 1
- 235000013267 Pinus ponderosa Nutrition 0.000 description 1
- 241000555277 Pinus ponderosa Species 0.000 description 1
- 235000008577 Pinus radiata Nutrition 0.000 description 1
- 241000218621 Pinus radiata Species 0.000 description 1
- 235000008566 Pinus taeda Nutrition 0.000 description 1
- 241000218679 Pinus taeda Species 0.000 description 1
- 229920003171 Poly (ethylene oxide) Polymers 0.000 description 1
- 241001657474 Populus alba x Populus glandulosa Species 0.000 description 1
- 241000161288 Populus candicans Species 0.000 description 1
- 241001278112 Populus euphratica Species 0.000 description 1
- 241001479460 Populus sieboldii x Populus grandidentata Species 0.000 description 1
- 241001023442 Populus suaveolens Species 0.000 description 1
- 241000789572 Populus szechuanica Species 0.000 description 1
- 241001600128 Populus tremula x Populus alba Species 0.000 description 1
- 241000217825 Populus tremula x Populus tremuloides Species 0.000 description 1
- 241000789575 Populus wilsonii Species 0.000 description 1
- 241000218981 Populus x canadensis Species 0.000 description 1
- 241000612120 Primula sieboldii Species 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 238000002123 RNA extraction Methods 0.000 description 1
- 241001633102 Rhizobiaceae Species 0.000 description 1
- 241000220010 Rhode Species 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 241000220317 Rosa Species 0.000 description 1
- 240000000111 Saccharum officinarum Species 0.000 description 1
- 235000007201 Saccharum officinarum Nutrition 0.000 description 1
- 108091081021 Sense strand Proteins 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 241001138418 Sequoia sempervirens Species 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 1
- 102100028904 Serine/threonine-protein kinase MARK2 Human genes 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 241000168254 Siro Species 0.000 description 1
- 235000002595 Solanum tuberosum Nutrition 0.000 description 1
- 244000061456 Solanum tuberosum Species 0.000 description 1
- 108010043934 Sucrose synthase Proteins 0.000 description 1
- 240000005572 Syzygium cordatum Species 0.000 description 1
- 240000006852 Tabernaemontana pauciflora Species 0.000 description 1
- 108020005038 Terminator Codon Proteins 0.000 description 1
- 108700007696 Tetrahydrofolate Dehydrogenase Proteins 0.000 description 1
- 235000008109 Thuja occidentalis Nutrition 0.000 description 1
- 240000003243 Thuja occidentalis Species 0.000 description 1
- 102000006601 Thymidine Kinase Human genes 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- 235000002168 Tilia europaea Nutrition 0.000 description 1
- 108090000992 Transferases Proteins 0.000 description 1
- 241000219793 Trifolium Species 0.000 description 1
- 208000026487 Triploidy Diseases 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- 244000098338 Triticum aestivum Species 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 239000006035 Tryptophane Substances 0.000 description 1
- 240000003021 Tsuga heterophylla Species 0.000 description 1
- 235000008554 Tsuga heterophylla Nutrition 0.000 description 1
- 102000004243 Tubulin Human genes 0.000 description 1
- 108090000704 Tubulin Proteins 0.000 description 1
- 101150078824 UBQ3 gene Proteins 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- JYVKKBDANPZIAW-AVGNSLFASA-N Val-Arg-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N JYVKKBDANPZIAW-AVGNSLFASA-N 0.000 description 1
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 1
- 235000010726 Vigna sinensis Nutrition 0.000 description 1
- 235000010722 Vigna unguiculata Nutrition 0.000 description 1
- 235000009754 Vitis X bourquina Nutrition 0.000 description 1
- 235000012333 Vitis X labruscana Nutrition 0.000 description 1
- 240000006365 Vitis vinifera Species 0.000 description 1
- 235000014787 Vitis vinifera Nutrition 0.000 description 1
- 101150111972 WAK2 gene Proteins 0.000 description 1
- 101150117835 WAK4 gene Proteins 0.000 description 1
- 108010027570 Xanthine phosphoribosyltransferase Proteins 0.000 description 1
- 235000006801 Ximenia americana Nutrition 0.000 description 1
- 244000112726 Ximenia americana Species 0.000 description 1
- JUGOREOARAHOCO-UHFFFAOYSA-M acetylcholine chloride Chemical compound [Cl-].CC(=O)OCC[N+](C)(C)C JUGOREOARAHOCO-UHFFFAOYSA-M 0.000 description 1
- 230000009418 agronomic effect Effects 0.000 description 1
- 239000003513 alkali Substances 0.000 description 1
- 235000020224 almond Nutrition 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 229960001561 bleomycin Drugs 0.000 description 1
- OYVAGSVQBOHSSS-UAPAGMARSA-O bleomycin A2 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC=C(N=1)C=1SC=C(N=1)C(=O)NCCC[S+](C)C)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C OYVAGSVQBOHSSS-UAPAGMARSA-O 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- RTYJTGSCYUUYAL-YCAHSCEMSA-L carbenicillin disodium Chemical compound [Na+].[Na+].N([C@H]1[C@H]2SC([C@@H](N2C1=O)C([O-])=O)(C)C)C(=O)C(C([O-])=O)C1=CC=CC=C1 RTYJTGSCYUUYAL-YCAHSCEMSA-L 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- RLGQACBPNDBWTB-UHFFFAOYSA-N cetyltrimethylammonium ion Chemical compound CCCCCCCCCCCCCCCC[N+](C)(C)C RLGQACBPNDBWTB-UHFFFAOYSA-N 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 239000012174 chinese wax Substances 0.000 description 1
- 239000004927 clay Substances 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 235000009508 confectionery Nutrition 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 230000001186 cumulative effect Effects 0.000 description 1
- 210000000172 cytosol Anatomy 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 238000013016 damping Methods 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 239000005549 deoxyribonucleoside Substances 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 101150052825 dnaK gene Proteins 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 210000002615 epidermis Anatomy 0.000 description 1
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 1
- 229960005542 ethidium bromide Drugs 0.000 description 1
- 230000004720 fertilization Effects 0.000 description 1
- 235000004426 flaxseed Nutrition 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- PGBHMTALBVVCIT-VCIWKGPPSA-N framycetin Chemical compound N[C@@H]1[C@@H](O)[C@H](O)[C@H](CN)O[C@@H]1O[C@H]1[C@@H](O)[C@H](O[C@H]2[C@@H]([C@@H](N)C[C@@H](N)[C@@H]2O)O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CN)O2)N)O[C@@H]1CO PGBHMTALBVVCIT-VCIWKGPPSA-N 0.000 description 1
- 230000008014 freezing Effects 0.000 description 1
- 238000007710 freezing Methods 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 108010025899 gelatin film Proteins 0.000 description 1
- 238000003208 gene overexpression Methods 0.000 description 1
- 229960002518 gentamicin Drugs 0.000 description 1
- 230000035784 germination Effects 0.000 description 1
- XDDAORKBJWWYJS-UHFFFAOYSA-N glyphosate Chemical compound OC(=O)CNCP(O)(O)=O XDDAORKBJWWYJS-UHFFFAOYSA-N 0.000 description 1
- 229940097068 glyphosate Drugs 0.000 description 1
- 239000008187 granular material Substances 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 230000008642 heat stress Effects 0.000 description 1
- 230000002363 herbicidal effect Effects 0.000 description 1
- 239000004009 herbicide Substances 0.000 description 1
- 235000012907 honey Nutrition 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- WGCNASOHLSPBMP-UHFFFAOYSA-N hydroxyacetaldehyde Natural products OCC=O WGCNASOHLSPBMP-UHFFFAOYSA-N 0.000 description 1
- YQYJSBFKSSDGFO-FWAVGLHBSA-N hygromycin A Chemical compound O[C@H]1[C@H](O)[C@H](C(=O)C)O[C@@H]1Oc1ccc(\C=C(/C)C(=O)N[C@@H]2[C@@H]([C@H]3OCO[C@H]3[C@@H](O)[C@@H]2O)O)cc1O YQYJSBFKSSDGFO-FWAVGLHBSA-N 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000006525 intracellular process Effects 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- XUWPJKDMEZSVTP-LTYMHZPRSA-N kalafungina Chemical compound O=C1C2=C(O)C=CC=C2C(=O)C2=C1[C@@H](C)O[C@H]1[C@@H]2OC(=O)C1 XUWPJKDMEZSVTP-LTYMHZPRSA-N 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 229940049920 malate Drugs 0.000 description 1
- BJEPYKJPYRNKOW-UHFFFAOYSA-N malic acid Chemical compound OC(=O)C(O)CC(O)=O BJEPYKJPYRNKOW-UHFFFAOYSA-N 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000000442 meristematic effect Effects 0.000 description 1
- 210000004688 microtubule Anatomy 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000009456 molecular mechanism Effects 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- 239000006259 organic additive Substances 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 210000004681 ovum Anatomy 0.000 description 1
- 108020004410 pectinesterase Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 238000003976 plant breeding Methods 0.000 description 1
- 210000000745 plant chromosome Anatomy 0.000 description 1
- 208000008423 pleurisy Diseases 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 230000002040 relaxant effect Effects 0.000 description 1
- 230000008521 reorganization Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- 230000000630 rising effect Effects 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 239000012064 sodium phosphate buffer Substances 0.000 description 1
- 238000009331 sowing Methods 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 238000009987 spinning Methods 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 230000008961 swelling Effects 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 230000017423 tissue regeneration Effects 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000011426 transformation method Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 239000001226 triphosphate Substances 0.000 description 1
- 235000011178 triphosphate Nutrition 0.000 description 1
- UNXRWKVEANCORM-UHFFFAOYSA-N triphosphoric acid Chemical compound OP(O)(=O)OP(O)(=O)OP(O)(O)=O UNXRWKVEANCORM-UHFFFAOYSA-N 0.000 description 1
- 229960004799 tryptophan Drugs 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 239000013598 vector Substances 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 229940089401 xylon Drugs 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8216—Methods for controlling, regulating or enhancing expression of transgenes in plant cells
- C12N15/8222—Developmentally regulated expression systems, tissue, organ specific, temporal or spatial regulation
- C12N15/8223—Vegetative tissue-specific promoters
- C12N15/8226—Stem-specific, e.g. including tubers, beets
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/12—Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
- C12N9/1205—Phosphotransferases with an alcohol group as acceptor (2.7.1), e.g. protein kinases
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A40/00—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
- Y02A40/10—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
- Y02A40/146—Genetically Modified [GMO] plants, e.g. transgenic plants
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Organic Chemistry (AREA)
- Biomedical Technology (AREA)
- Wood Science & Technology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Cell Biology (AREA)
- Medicinal Chemistry (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
本发明公开了用于改变纤维长度、植物高度和/或植物组织中的植物生物质的核酸构建体和方法。植物用编码拟南芥细胞壁关联激酶基因的构建体进行遗传工程化,当该基因在形成层/木质部优先启动子的控制下超表达时改变纤维长度和/或植物高度。带有细胞壁关联激酶基因的植物转化体表现出增加的纤维长度,这被认为是用于改善木质树木的制浆和造纸性能的一种性状。
Description
相关申请的交叉引用
本申请要求2006年12月20日提交的60/871048号美国临时申请的优先权,该临时申请的公开内容完整引入作为参考。
技术领域
本发明涉及分子生物学的领域和转化植物中基因表达的改变。更具体地,本发明涉及通过调控编码细胞壁关联激酶(wall-associatedkinase)(WAK)的基因的表达改变具有工业价值的植物中的纤维长度和/或植物高度。
背景技术
对木制品和木材来源的产品的持续增长的需要构成了一个全球范围的问题。据估计,现在世界森林采伐量早已达到了最大的可持续速率。因此,存在着对更多的木本植物的急迫需要,以及开发提高林业植物的农业性状(如更高的植物高度。更大的生物质产量和更长的木质部纤维长度)的方法的需要。例如,纤维均匀度和强度是大多数工业应用的共同要求。在制浆中,强度特性部分地由纤维长度确定。长的纤维由于其强度和结合性能而对于高强度纸的生产、提高纸浆产量和减少碱消耗量等方面是理想的。
我们可以举出桉(Eucalyptus)树作为说明木本植物重要性的例子,桉树是全球用于造纸工业的纤维的最大来源(Bamber,1985,Appita 38:210-216)。据估计,一千万至一千五百万公顷的土地种植有桉树。Verhaegen和Plomion,1996,Genome 39:1051-1061。桉树的主要优点是其非常高的生长率和在大范围条件下生长的能力,在热带和温带条件下都能生长。但是,与其它来源(如松树)的纤维相比,桉树纤维具有一项缺点,就是其长度显著较短。因此,由桉树纸浆制成的纸通常强度较弱且通常需要用其它来源的较长纤维进行强化,从而增加了生产成本。
纤维长度受到细胞延长(一种由内膨压和细胞壁的机械强度之间的相互作用引起的过程)的内源性调节的控制,但其机理和所涉及的基因还没有完全弄清楚。
木质部纤维细胞产生自位于维管形成层内的早已大量相当延长的纺锤状原始细胞。它们通过其径向壁的扩展而增大其直径,且另外通过侵入性顶端生长产生纤维细胞延长,这导致细胞长度高达数倍的增加。Gray-Mitsumune等,2004,Plant Physiol.135:1552-1564。
在顶端生长细胞中,膨胀发生在细胞表面的小的区域内,这导致管状的、细长的细胞。例如,杨树纤维在木质部的径向膨胀带中侵入性地延长,当完全分化时其长度平均达到其原始细胞长度的150%。Hussey等,2006,Annu.Rev.Plant Boil.57:109-125;Mellerowicz等,2001,Plant Mol.Biol.47:239-274。
纤维细胞的快速膨胀可能通过膨压对细胞壁施加的推压和细胞壁的松弛的协同作用而实现。在棉纤维中,细胞延长阶段接着膨压的显著上升,这源于观察到的苹果酸盐、糖和K+(主要的渗压剂(osmoticum))的积聚,因此导致水的内流和纤维细胞内高膨压的产生。Ruan等,2004,Plant Physiol.136:4104-4113。
液泡转化酶可以在膨压维持和细胞壁膨胀中具有重要作用。最近用拟南芥(Arabidopsis thaliana)进行的工作表明细胞壁关联激酶(WAK)可能调节液泡转化酶,因此在WAK和液泡转化酶之间建立了跨区室的联系。Kohorn等,2006,Plant J.46:307-316。
在拟南芥中,WAK由五个紧密连接且高度相似的基因进行编码,并在叶、分生组织和发生膨胀的细胞中表达。Wagner和Kohorn,2001,Plant Cell 13:303-318。
存在T-DNA插入WAK2基因中的突变的拟南芥苗比野生型的植物显著较矮,根比胚轴受到更大的影响。Kohorn等,2006,Plant J.46:307-316。
这些突变植物显示出液泡转化酶活性降低62%,且作者提出WAK2对作为调整溶质浓度的机制和膨压调节的一个要素的液泡转化酶的转录进行调控,因此提供了WAK调节细胞膨胀的一种可能的机制。
在拟南芥中可诱导的反义WAK2的表达导致WAK蛋白水平的50%的下降,产生细胞延长的后续损失,并因此产生矮生植物。当反义WAK4基因用于降低总WAK蛋白水平时也报道了类似的结果。Wagner和Kohorn,2001,Plant Cell 13:303-318;Lally等,2001,PlantCell 13:1317-1331。
现在也知道,细胞壁关联激酶包含可以连接到细胞壁的果胶分子上的细胞外域、跨越质膜且具有胞质丝氨酸/苏氨酸激酶域。He等,1999,Plant Mol.Biol.39:1189-1196。
当纤维在两末端发生显著延长(侵入性顶端生长)时,胞间层的性质限制了这一类型的细胞生长。发育的木质细胞的胞间层富含果胶,且侵入性顶端生长需要胞间层的分解。参见Berthold等,WO2006/068603。
通过其果胶连结,WAK有可能感知细胞壁环境中的变化,因此提供了一种将细胞壁感知与溶质代谢的调节联系起来的分子机制,已知其随后参与生长细胞的膨压维持和细胞膨胀。这种信息对于细胞膨胀或膨压的调节可能是相当有用的。Huang等,2007,FunctionalPlant Biology,34:499-507。
纤维的特性受到一组复杂的遗传因素的控制并且不容易采用经典育种方法育种。通过传统的林木育种,获得某些纤维特性的改良是可能的。例如,已经培育出比亲本树种具有更长的纤维的杨树种间三倍体杂交种。Aziz等,1996,Wood and pulp properties of aspen andits hybrids.TAPPI Proc.Pilping Conference.P.437-443。还有,考虑到传统林木育种的缺陷,如由于其长的世代周期导致的缓慢进展和产生具有理想性状的植物的难度,基因技术的发展可以显著缩短产生植物的新品种所需要的时间并使得能够在特定的树种中更紧密地以林业和制浆工业需要的性状为目标。
发明内容
在一个方面中,本发明提供包含与木质部优先启动子可操作地连接的WAK多核苷酸序列的核酸构建体,该启动子引起所述WAK多核苷酸序列的超表达。在一个实施方式中,所述木质部优先启动子选自TUB基因启动子、SuSy基因启动子、COMT基因启动子和C4H基因启动子。在另一个实施方式中,转基因植物包含所述核酸构建体,且该植物与同种的非转基因植物相比具有纤维长度和/或高度的增加。在进一步的实施方式中,所述植物是双子叶植物、单子叶植物、裸子植物或阔叶树。本发明进一步包括转基因植物的后代,以及由转基因植物制备的木浆和木纤维。
在另一方面中,本发明提供增加纤维长度和/或植物高度的方法,包括:(a)向植物细胞中引入包含与木质部优先启动子可操作地连接的WAK多核苷酸序列的核酸构建体,该木质部优先启动子引起所述WAK多核苷酸序列的超表达;(b)在促进植物生长的条件下培养所述植物细胞;和(c)选择与同种的非转基因植物相比具有增加的纤维长度和/或植物高度的转基因植物。
附图说明
图1示意性地说明包含驱动本发明的细胞壁关联激酶核苷酸序列表达的形成层/木质部优先启动子的本发明的植物表达质粒载体pALELLYX-WAK。
图2显示了用本发明的植物表达质粒载体pALELLYX-WAK转化的几种转基因家系和相应的对照非转基因植物的纤维长度。星号表示统计学显著较高的平均纤维长度值(P<0.05,t检验)。
图3显示用本发明的植物表达质粒载体pALELLYX-WAK转化的T1转基因植物(51B家系)的两种基因型的纤维长度。星号表示统计学显著较高的平均纤维长度值(P<0.05,t检验)。
图4显示用本发明的植物表达质粒载体pALELLYX-WAK转化的T1转基因植物(47B家系)的两种基因型的纤维长度。星号表示统计学显著较高的平均纤维长度值(P<0.05,t检验)。
图5显示用本发明的植物表达质粒载体pALELLYX-WAK转化的T1转基因植物(51B家系)的三种基因型的植物高度。星号表示统计学显著较高的平均植物高度值(P<0.05,t检验)。
具体实施方式
本发明涉及植物纤维长度和/或增大植物高度的基因操作的方法。
植物细胞壁是赋予各细胞稳定外形的高强度纤维网络。为增大,细胞选择性地松弛这一网络,使得它能够顺从由细胞膨压产生的膨胀力。随着细胞膨胀,存在着对膨压的补偿调节的更大的需要,这取决于细胞溶质代谢。
细胞壁关联激酶(WAK)可以通过其与果胶的连结感知细胞壁膨胀,从而如上面所概述的提供将这些信号转导到调节溶质变化的系统的机制。但是,先前在WAK上的工作并没有预示着植物中组织特异性方式的WAK基因的超表达导致纤维长度的显著变化以及植物高度的显著变化。该结果开启了改变对植物纤维、林业、制浆和造纸工业极重要的性状的途径。
因此,按照本发明的一个方面,提供了通过控制细胞壁关联激酶的活性改变植物组织中,如木质被子植物木质部的纤维细胞、裸子植物木质部的管胞细胞和棉籽的纤维细胞中的纤维长度的方法。按照本发明的这一方面,植物细胞或整个植株用细胞壁关联激酶编码序列遗传工程化,该序列在被子植物的木质纤维细胞、裸子植物的木质管胞或棉籽的纤维细胞中表达时引起细胞长度的增加。
本发明中使用的所有技术术语为生物化学、分子生物学和农学中常用的术语,且可以被本发明所属领域的普通技术人员理解。那些技术术语可以在以下材料中找到:MOLECULAR CLONING:ALABORATORY MANUAL,第三版,vol.1-3,编者Sambrook和Russel,Cold Spring Harbor Laboratory Press,Cold Spring Harbor,N.Y.,2001;CURRENT PROTOCOLS IN MOLECULAR BIOLOGY,编者Ausubel等,Greene Publishing Associates and Wiley-Interscience,NewYork,1988(具有定期更新);SHORT PROTOCOLS IN MOLECULARBIOLOGY:A COMPENDIUM OF METHODS FROM CURRENTPROTOCOLS IN MOLECULAR BIOLOGY,第五版,vol.1-2,编者Ausubel等,John Wiley&Sons,Inc.,2002;GENOME ANALYSIS:ALABORATORY MANUAL,vol.1-2,编者Green等,Cold SpringHarbor Laboratory Press,Cold Spring Harbor,N.Y.,1997。涉及植物生物学技术的方法学在本文中进行描述并在专著中详细说明,如METHODS IN PLANT MOLECULAR BIOLOGY:A LABORATORYCOURSE MANUAL,编者Maliga等,Cold Spring Harbor LaboratoryPress,Cold Spring Harbor,N.Y.,1995。在例如Innis等,PCRPROTOCOLS:A GUIDE TO METHODS AND APPLICATIONS,Academic Press,San Diego,1990及Dieffenbach和Dveksler,PCRPRIMER:A LABORATORY MANUAL,第二版,Cold Spring HarborLaboratory Press,Cold Spring Harbor,N.Y.,2003中描述了各种使用PCR的技术。PCR-引物对可通过已知技术,如使用为该目的设计的计算机程序(例如Primer,Version 0.5,1991,Whitehead Institute forBiomedical Research,Cambridge,MA)从已知序列获得。例如,在Beaucage和Caruthers,1981,Tetra.Letts.22:1859-1862和Matteucci和Caruthers,1981,J.Am.Chem.Soc.103:3185中描述了核酸化学合成的方法。
如在Sambrook等,MOLECULAR CLONING:A LABORATORYMANUAL,第二版(1989),Cold Spring Harbor Laboratory Press中所描述的进行限制性酶消化、磷酸化、连接和转化。除非另外说明,所有用于细菌细胞的生长和维持的试剂和材料都从AldrichChemicals(Milwaukee,WI)、DIFCO Laboratories(Detroit,MI)、Invitrogen(Gaithersburg,MD)或Sigma Chemical Company(St.Louis,MO)获得。
术语“编码”和“译码”指的是基因通过转录和翻译的机制向细胞提供信息的过程,由此一系列氨基酸可组装成特定的氨基酸序列以产生活性酶。由于遗传密码的简并性,DNA序列中的特定碱基变化不改变蛋白质的氨基酸序列。因此可以理解,对蛋白质的功能特性不产生实质影响的编码细胞壁关联激酶的DNA序列的改变是可预想的。
在本说明书中,“表达”表示由基因编码的蛋白质产物的产生。可选择地或另外地,“表达”表示由编码DNA分子发生的细胞内过程(包括转录和翻译)的组合以产生多肽。“超表达”指的是特定基因序列的表达,其中转基因生物体中mRNA或多肽的产生超过非转基因生物体中的产生水平。
术语“异源核酸”指的是通过人的努力引入到细胞(或细胞的原型)中的核酸、DNA或RNA。这种外源的核酸可以是其所引入的细胞中天然发现的序列的副本或其片段。
相反,术语“内源核酸”指的是存在于待遗传工程化的植物或生物体中的核酸、基因、多核苷酸、DNA、RNA、mRNA或cDNA。内源序列是待遗传工程化的植物或生物体“本身的”序列,即待遗传工程化的植物或生物体原产的序列。
术语“同源序列”指的是由于共同祖先和序列保守性而类似的多核苷酸或多肽序列。
术语“功能性同源物”指的是由于共同祖先和序列保守性而类似且在催化、细胞或生物体水平上具有相同或类似的功能的多核苷酸或多肽序列。
细胞壁关联激酶序列
在本说明书中,术语“细胞壁关联激酶多核苷酸序列”表示编码细胞壁关联激酶多肽的任何核酸、基因、多核苷酸、DNA、RNA、mRNA或cDNA分子,细胞壁关联激酶多肽的超表达改变纤维长度和/或植物高度。DNA或RNA可以是双链或单链的。单链DNA可以是编码链,也称为有义链,或者可以是非编码链,也称为反义链。说明这一类别的是包含从拟南芥鉴别的SEQ ID NO:1、3、5、7和9且可以用于增加纤维长度和/或植物高度的多核苷酸分子。
适用于本发明的细胞壁关联激酶多核苷酸序列可以从特征为存在WAK基因的极大数量的生物体中鉴别出来。虽然本发明中公开了前述核苷酸序列,但它们并不对本发明构成限制。因此,WAK序列可以进行鉴别并通过序列比较进行功能性解释。技术人员可以轻易使用向公众提供的序列分析程序和参数在合适的数据库(如GenBank)中鉴别功能相关的WAK序列。或者,采用基于本发明公开的DNA或蛋白质序列的合适杂交探针或引物筛查cDNA文库或基因组文库应导致功能相关的WAK序列(功能性同源物)的鉴别。本领域中还理解的是,具有较低的同一性水平的序列也可以借助于简并的寡核苷酸和基于PCR的方法进行分离。尽管本发明的多核苷酸是从拟南芥分离的,来自其它植物的功能性同源物也可用于产生具有增加的纤维长度和/或植物高度的植物。可以从中分离WAK基因的植物品种的例子包括双子叶植物,例如葫芦科(Cucurbitaceae)、茄科(Solanaceae)、十字花科(Brassicaceae)、蝶形花科(Papilionaceae)如紫花苜蓿(alfalfa)和豇豆(Vigna unguiculata)、锦葵科(Malvaceae)、菊科(Asteraceae)、金虎尾科(Malpighiaceae)如杨属(Populus)、桃金娘科(Myrtaceae)如桉属,和单子叶植物,例如禾本科(Gramineae),包括水稻、小麦、甘蔗、大麦和玉米。
在本说明书中,术语“细胞壁关联激酶多核苷酸序列”、“WAK多核苷酸序列”和“WAK DNA序列”也指具有能够在严格条件下与本发明公开的任何序列杂交且能够编码具有与包含本发明公开的SEQ ID NO:2、4、6、8或10氨基酸序列的蛋白质等同的WAK活性的多肽的核苷酸序列的任何核酸分子。该术语还包括与SEQ IDNO:1、SEQ ID NO:3、SEQ ID NO:5、SEQ ID NO:7或SEQ IDNO:9交叉杂交的序列,优选具有与SEQ ID NO:1、3、5、7和9的一种或多种至少65%的同源性或同一性的序列。本发明的核苷酸序列可以编码与本发明公开的SEQ ID NO:2、4、6、8或10中任何一种的预计基因产物同源的蛋白质。另外,本发明的核苷酸序列包括编码具有与本发明公开的SEQ ID NO:2、4、6、8或10中任何一种的氨基酸序列至少55%,优选至少60%,更优选至少70%,更优选至少80%,更优选至少90%和最优选至少95%的序列同一性的氨基酸序列的WAK多肽的序列。遗传密码的简并性使多核苷酸的核苷酸序列中的大多数变异成为可能,同时保持所编码的氨基酸序列。
这里的短语“严格条件”意味着本领域熟知的参数。当单链多核苷酸基于多种良好表征的物理化学力(如氢键、溶剂排斥和碱基堆积)结合时发生杂交。杂交的严格性反映了所涉及的核酸的序列同一性程度,从而严格性越高,两条多核苷酸链越相似。严格性受到多种因素的影响,包括存在于杂交液和洗涤液中的温度、盐浓度和组成、有机和非有机添加剂、溶剂等,及孵育(和次数)。本领域普通技术人员可以轻易地通过改变杂交反应和洗涤处理过程中的温度、杂交反应和洗涤处理过程中的盐浓度等等选择这些条件。
对于具有超过100个互补残基的互补核酸的杂交,在Southern或Northern印迹分析的滤膜上,“严格”杂交条件的例子是在限定的离子强度和pH下低于特定序列的热解链温度(Tm)5℃-20℃的温度。Tm是在限定的离子强度和pH下50%的目标序列与完美匹配的探针杂交的温度。在严格条件下杂交的核酸分子通常与基于完整cDNA或所选择的部分的探针杂交。更优选地,这里的“严格条件”指的是本领域中熟知的参数,例如65℃下在3.5x SSC、1x Denhardt′s液、25mM磷酸钠缓冲液(pH 7.0)、0.5%SDS和2mM EDTA中杂交18小时,随后在65℃下在2x SSC和0.1%SDS中对滤膜进行四次的20分钟洗涤,并在0.5x SSC和0.1%SDS或者0.3x SSC和0.1%SDS中进行最多20分钟的最终洗涤以达到更高的严格性,且0.1xSSC和0.1%SDS用于甚至更高的严格性。可以使用其它条件替代,只要严格性的程度等于本发明中使用0.5x SSC的最终洗涤提供的程度。对于较低相关同源物的鉴别,洗涤可以在较低温度(例如,50℃)下进行。一般,严格性通过升高洗涤温度和/或降低SSC的浓度而提高。
另外,合适的细胞壁关联激酶序列的类别包括由具有一个或多个缺失、置换、插入或添加的碱基的SEQ ID NO:1或3或5或7或9的变异体构成的核酸分子,该变异体编码在超表达时导致纤维长度和/或植物高度改变的多肽。这里所称的“具有一个或多个缺失、置换、插入或添加的碱基的碱基序列”为本领域普通技术人员广泛已知,甚至在通常具有生理活性的蛋白质的氨基酸序列具有一个或多个置换、缺失、插入或添加的氨基酸时保持生理活性。例如,聚A尾或者5’或3’末端非翻译区域可以被删除,且碱基可以被删除到氨基酸被删除的程度。碱基也可以被置换,只要不导致移码。碱基也可以被“添加”到氨基酸被添加的程度。但是,任何这类修饰不导致生理活性的丧失是非常重要的。在该环境中的修饰DNA可以通过改变本发明的DNA碱基序列而获得,从而例如,在特定位置的氨基酸通过定点诱变被置换、删除、插入或添加。Zoller&Smith,1982,Nucleic Acid Res.10:6487-6500。因此,术语“变异体”是与特定基因或蛋白质的标准的或给定的核苷酸或氨基酸序列偏离的核苷酸或氨基酸序列。变异体可以具有“保守的”改变,其中置换的氨基酸具有类似的结构或化学性质,例如亮氨酸被异亮氨酸取代。变异体可以具有“非保守的”改变,例如甘氨酸被色氨酸取代。类似的小变异也可以包括氨基酸缺失或插入,或者包括两者。确定哪些氨基酸残基可以被置换、插入或删除的指引可以使用本领域中公知的计算机程序,如Vector NTI Suite(InforMax,MD)软件找到。“变异体”还可以指“改组的基因(shuffled gene)”,例如在美国专利6506603、6132970、6165793和6117679中描述的。
获得WAK DNA序列的进一步途径是通过,例如使用合适的cDNA序列作为模板从合适的碱基从头合成。
核酸构建体
本发明包括包含一种或多种本发明的核酸序列的重组构建体。该构建体通常包含沿正或反方向插入核酸序列的载体,如质粒、粘粒、噬菌体、病毒(例如,植物病毒)、细菌人工染色体(BAC)、酵母人工染色体(YAC)等等。众多的合适载体是已知的且可商购得到,因此不需要在此反复说明。
重组核酸构建体可以使用标准技术制得。例如,用于转录的核苷酸序列可以通过用限制性内切酶处理包含所述序列的载体以切出适当的片段而获得。用于转录的核苷酸序列还可以通过退火和结合合成的寡核苷酸或通过在聚合酶链式反应(PCR)中使用合成的寡核苷酸以在各末端得到合适的限制性位点而产生。然后核苷酸序列克隆到包含合适的调控元件(如上游启动子和下游终止子序列)的载体中。通常,植物转化载体包括一个或多个在5’和3’调控序列的转录控制下的克隆植物编码序列(基因组的或cDNA)及一个可选择的标志。这类植物转化载体通常还包含启动子、转录启动开始位点、RNA加工信号(如剪接信号序列)、转录终止位点和/或多腺苷酸化信号。还可以存在增强子和导向序列。
本发明提供很可能引起转化的植物中纤维长度和植物高度改变的核酸分子。本发明的一个重要方面是其中细胞壁关联激酶编码核苷酸序列与一个或多个启动子可操作地连接的核酸构建体的用途,该启动子以组成型方式或在特定的细胞类型、器官或组织中驱动细胞壁关联激酶编码序列的表达,从而与非转基因植物的纤维长度相比改变转化植物的纤维长度。
可用于表达适用于本发明的细胞壁关联激酶序列的合适组成型植物启动子包括,但不限于花椰菜花叶病毒(CaMV)35S启动子、玉米和杨属聚泛素启动子,其在大多数植物组织中赋予组成型的高水平表达(参见,例如WO 2007/00611、美国专利5510474号、Odell等,Nature,1985,313:810-812)、胭脂氨酸合酶(nopaline synthase)启动子(An等,1988,Plant Physiol.88:547-552)、来自玄参花叶病毒的FMV启动子(美国专利5378619号)和章鱼氨酸合酶启动子(Fromm等,1989,Plant Cell 1:977-984)。
也可以选择启动子以使得表达在植物发育的确定时间点、或者由外部影响因素确定的时间点、或者以组织特异性的或组织优先的方式发生。例如,它可以确保纤维细胞中的特异性的或优先的表达(棉纤维-、木质部纤维-或额外木质纤维-特异性的或-优先的启动子)。
示例性的棉纤维-特异性的或-优先的启动子包括,例如棉CFACT1基因启动子(美国专利6995256号)、E6基因启动子(美国专利6096950号;John等,1996,Plant mol.Biol.30:297-306;John等,1996,Proc.Natl.Acad.Sci.93:12768-12773);H6基因启动子(John等,1995,Plant Physiol.108:669-676);GhTUB1基因启动子(Li等,2002,Plant Physiol.130:666-674)和FbL2A(Rinehart等,1996,PlantPhysiol.112:1331-1341和John等,1996,Proc.Natl.Acad.Sci.USA 93:12768-12773)。
维管系统-优先的或-特异性的启动子,如木质部优先启动子,可以用于影响本发明中的核酸分子的表达,具体来说在维管组织中,特别是在木质部组织中。因此,“木质部-优先的”意思是本发明的核酸分子在木质部中比在任何其它植物组织中更活跃。所选择的启动子应引起按照本发明的细胞壁关联激酶的超表达,从而改变细胞木质部的长度、改变宿主植物的高度或者同时改变两者。
合适的启动子的例子是,但不限于木质部优先的微管蛋白(TUB)基因启动子、咖啡酸3-O-甲基转移酶基因启动子(COMT)、蔗糖合成酶基因启动子(SuSy)和木质部优先香豆酸-4-羟基化酶(C4H)基因启动子。其它合适的木质部优先启动子在国际专利申请WO2005/096805中公开,该文献通过引用引入本申请。
也可以使用包括赋予组织特异性或组织优先表达的特定核苷酸区域的合成启动子,例如赋予木质部优先的表达的较大启动子内调控元件的鉴别。Seguin等,1997,Plant Mol.Biol.35:281-291;Torres-Schumann等,1996,Plant J.9:283-296和Leyva等,1992,PlantCell 4:263-271。
虽然基因表达率主要由启动子调节,表达的提高也可以通过以独立于方向的方式提高位置接近的基因的表达水平的增强子序列(如基因的内含部分)的鉴别和使用实现。在植物中,在基因构建体中启动子和基因编码序列之间包含一些内含子导致mRNA和蛋白质累积的增加。已知提高植物中的表达的内含子已在玉米基因中鉴别出来,例如hsp70、tubA1、Adh1、Sh1、UbH(Brown和Santino,美国专利5424412和5859347号;Jeon等,2000,Plant Physiol.123:1005-1014;Callis等,1987,Genes Dev.1:1183-1200;Vasil等,1989,Plant Physiol.91:1575-1579),且在双子叶植物基因中,如来自矮牵牛花(Petunia)的rbcS(Dean等,1989,Plant Cell 1:201-208);来自土豆的ST-LS1(Leon等,1991,Plant Physiol.95:968-972)及来自拟南芥的UBQ3(Norris等,1993,Plant Mol.Biol.21:895-906)和PAT1(Rose和Last,1997,Plant J.11:455-464)。
按照本发明的一个方面,细胞壁关联激酶序列被引入适用于植物转化的核酸构建体中。因此,提供包含在植物中有效的转录起始区域控制下的细胞壁关联激酶序列的核酸构建体,从而该构建体可以在宿主植物细胞中产生RNA。优选地,转录起始区域是维管或木质部-优先启动子(如以上所述启动子的任一种)的部分。如上所述,这种核酸构建体可用于改变植物中的细胞壁关联激酶基因表达。
表达载体还可以包含可选择标志,转化细胞可以通过该可选择标志在培养中鉴别。该标志可以与异源核酸分子,即与启动子可操作地连接的基因关联。如本发明中所使用的,术语“标志”指的是编码允许对包含该标志的植物或细胞进行选择或筛选的性状或基因型的基因。在植物中,例如标志基因编码抗生素或除草剂抗性。这能够从未转化或转染的细胞中选择转化的细胞。
合适的可选择标志的例子包括腺苷脱氨酶、二氢叶酸还原酶、潮霉素-B-磷酸转移酶、胸苷激酶、黄嘌呤-鸟嘌呤磷酸核糖基转移酶、草甘膦和草铵膦抗性及氨基-糖苷3’-O-磷酸转移酶(卡那霉素、新霉素和G418抗性)。这些标志可以包括对G418、潮霉素、博来霉素、卡那霉素和庆大霉素的抗性。构建体还可以包含可选择标志基因Bar,其赋予对除草剂膦丝菌素(phosphinothricin)类似物如草丁膦铵的抗性。Thompson等,EMBO J.6:2519-23(1987)。其它合适的选择标志也是已知的。
可以使用可视标志,如绿色荧光蛋白(GFP)。用于基于细胞分裂的控制鉴别或选择转化的植物的方法也已说明。参见John和VanMellaert,WO 2000/052168和Fabijansk等,WO 2001/059086。
也可以包括细菌或病毒源的复制序列以允许将载体克隆到细菌或噬菌体宿主中。优选地,使用广泛宿主范围的原核复制起点。可以包括用于细菌的可选择标志以允许选择携带需要的构建体的细菌细胞。合适的原核可选择标志还包括对抗生素(如卡那霉素或四环素)的抗性。
其它编码另外的功能的DNA序列也可以存在于载体中,如本领域中已知的。例如,当土壤杆菌属(Agrobacterium)为宿主时,可以包括T-DNA序列以利于后续的向植物染色体中的转移和整合。
用于遗传工程的植物
本发明包括植物,特别是阔叶树中的遗传操作以在维管组织中通过引入细胞壁关联激酶基因超表达,优选在木质部-优先或木质部-特异性启动子的控制下。该结果增加纤维长度和植物高度。
在本说明书中,术语“植物”表示可以进行遗传操作的任何含纤维植物材料,包括但不限于分化的或未分化的植物细胞、原生质体、全植物、植物组织或植物器官,或者植物的任何部分如叶、茎、根、芽、块茎、果实、根茎等等。
可以按照本发明工程化的植物包括,但不限于树木,如桉属的种(白桉(E.alba)、白花桉(E.albens),杏仁桉(E.amygdalina)、E.aromaphloia、圆叶桉(E.baileyana)、E.balladoniensis、双脉桉(E.bicostata)、葡萄桉(E.botryoides)、短蕊桉(E.brachyandra)、褐桉(E.brassiana)、短柱桉(E.brevistylis)、布罗韦桉(E.brockwayi)、钝盖赤桉(E.camaldulensis)、E.ceracea、大花序桉(E.cloeziana)、聚果桉(E.coccifera)、异心叶桉(E.cordata)、角蕾桉(E.cornuta)、E.corticosa、常桉(E.crebra)、E.croajingolensis、E.curtisii、山桉(E.dalrympleana)、剥桉(E.deglupta)、大桉(E.delegatensis)、E.delicata、卡瑞桉(E.diversicolor)、E.diversifolia、E.dives、E.dolichocarpa、E.dundasii、邓恩桉(E.dunnii),滨河白桉(E.elata)、E.erythrocorys、E.erythrophloia、E.eudesmoides、E.falcata、E.gamophylla、E.glaucina、蓝桉(E.globulus)、双脉蓝桉(E.globulus subsp.bicostata)、蓝桉原种(E.globulus subsp.globulus)、E.gongylocarpa、巨桉(E.grandis)、巨尾桉杂交种(E.grandis x urophylla)、E.guilfoylei、西达桉(E.gunnii)、E.hallii、E.houseana、E.jacksonii、E.lansdowneana、E.latisinensis、E.leucophloia、白藓叶桉(E.leucoxylon)、E.lockyeri、E.lucasii、直杆蓝桉(E.maidenii)、边缘桉(E.marginata)、E.megacarpa、蜜味桉(E.melliodora)、E.michaeliana、小帽桉(E.microcorys)、小套桉(E.microtheca)、缪勒纤皮桉(E.muelleriana)、亮果桉(E.nitens)、E.nitida、斜叶桉(E.oblique)、E.obtusiflora、西方桉(E.occidentalis)、E.optima、卵叶桉(E.ovata)、E.pachyphylla、雪桉(E.pauciflora)、粗皮桉(E.pellita)、穿叶桉(E.perriniana)、E.petiolaris、弹丸桉(E.pilularis)、E.piperita、阔叶桉(E.platyphylla)、多花桉(E.polyanthemos)、E.populnea、E.preissiana、E.pseudo globulus、E.pulchella、E.radiata、E.radiata subsp.radiata、王桉(E.regnans)、E.risdonii、E.robertsonii、E.rodwayi、河红桉(E.rubida)、赤褐桉(E.rubiginosa)、柳叶桉(E.saligna)、红皮桉(E.salmonophloia)、E.scoparia、银顶白蜡桉(E.sieberi)、E.spathulata、E.staeri、E.stoatei、E.tenuipes、E.tenuiramis、细叶桉(E.tereticornis)、E.tetragona、达尔文纤皮桉(E.tetrodonta)、E.tindaliae、E.torquata、E.umbra、尾叶桉(E.urophylla)、E.vernicosa、多枝桉(E.viminalis)、E.wandoo、韦塔桉(E.wetarensis)、E.willisii、E.willisii subsp.falciformis、E.willisii subsp.willisii、E.woodwardii),杨属种类(银白杨(P.alba)、银白杨大齿白杨杂交种(P.alba x P.grandidentata)、银白杨欧洲山杨杂交种(P.alba x P.tremula)、银白杨欧洲山杨杂交种(变种)(P.alba x P.tremula var.glandulosa)、银白杨美洲山杨杂交种(P.alba x P.tremuloides)、香脂杨(P.balsamifera)、毛果香脂杨(P.balsamifera subsp.trichocarpa)、毛果香脂杨美洲黑杨杂交种(P.balsamifera subsp.trichocarpa x P.deltoids)、缘毛杨(P.ciliate)、美洲黑杨(P.deltoids)、胡杨(P.euphratica)、欧美黑杨(P.euramericana)、杂交颤杨(P.kitakamiensis)、大叶杨(P.lasiocarpa)、苦杨(P.laurifolia)、马氏杨(P.maximowiczii)、毛果马氏杨香脂杨杂交种(P.maximowiczii x P.balsamifera subsp.trichocarpa)、黑杨(P.nigra)、西氏杨大齿白杨杂交种(P.sieboldii x P.grandidentata)、甜杨(P.suaveolens)、川杨(P.szechuanica)、毛白杨(P.tomentosa)、欧洲山杨(P.tremula)、欧洲山杨北美颤杨杂交种(P.tremula x P.tremuloides)、北美颤杨(P.tremuloides)、椅杨(P.wilsonii)、加拿大杨(P.Canadensis)、滇杨(P.yunnanensis)),松类如火炬松(Pinus taeda)、湿地松(Pinus elliotii)、美国黄松(Pinus ponderosa)、小干松(Pinuscontorta)和辐射松(Pinus radiata),花旗松(Pseudotsuga menziesii),美国西部铁杉(加拿大铁杉(Tsuga canadensis)),北美云杉(Piceaglauca),红杉(Sequoia sempervirens),真杉(true fir)如银杉(Abiesamabilis)和香脂冷杉(Abies balsamea)及雪松类如大侧柏(北美乔柏(Thuja plicata))和黄扁柏(Chamaecyparis nootkatensis)。
产生纤维的植物也包括在本发明中。示例性的作物是棉花(草棉种(Gossipium spp.))、亚麻(Linum usitatissimum)、小荨麻(异株荨麻(Urtica dioica))、蛇麻(Humulus lupulus)、椴树类(欧洲小叶椴(Tiliacordata)、欧洲椴(T.x.europaea)和阔叶椴(T.platyphyllus))、鹰爪豆(Spartium junceum)、苎麻(Boehmeria nivea)、楮(Broussonetyapapyrifera),新西兰麻(Phormium tenax)、罗布麻(磁麻(Apocynumcannabinum))、鸢尾类(道氏鸢尾(I.douglasiana)、I.macrosiphon和I.purdyi)、乳草类(马利筋种(Asclepia species))、凤梨、香蕉和其它。还包括饲料作物,如紫花苜蓿、黑麦草、羊茅和三叶草。
在本说明书中,“转基因植物”指的是已经引入核酸序列的植物,核酸序列包括,但不限于正常地不存在于宿主植物基因组中的基因、正常地不转录为RNA或翻译成蛋白质的核酸序列或者希望引入到野生型植物中的任何其它基因或核酸序列,如可能正常地存在于野生型植物中但希望进行遗传工程或改变表达的基因。“转基因植物”类别包括原始转化体和其世系中的包括转化体的植物,例如通过标准基因渗入或另一育种过程的途径。
“杂种植物”指的是由两个亲本植物的杂交(cross)产生的植物或其部分,其中一个亲本是本发明的遗传工程化植物。这种杂交可以通过,例如有性生殖天然发生或通过,例如体外核融合人工产生。植物育种的方法是公知的且在植物生物学领域的普通技术人员的能力范围内。
相反,未进行遗传操作的植物是对照植物,且称为“非转基因”或“对照”植物。非转基因植物可以是其基因组未通过引入包含本发明的多核苷酸序列或其片段的构建体而发生改变的植物。它也可以是从培养细胞或组织再生而没有通过引入包含本发明的多核苷酸序列的构建体而发生在先改变的植物,或者可以包含由转基因植物的自体受精产生的纯合隐性后代(即,不具有转基因的任何副本)。
可以预想,在一些情况中,本发明的转基因植物的基因组通过转基因的稳定引入而扩增。但是在其它情况中,引入的基因取代内源序列。按照本发明的相关优选基因是细胞壁关联激酶DNA序列,例如从拟南芥获得的序列。
遗传工程的方法
按照本发明的构建体可以使用合适的技术引入任何植物细胞中。单子叶和双子叶的被子或裸子植物细胞都可以以本领域已知的各种方法进行遗传工程。例如,参见Klein等,1993,Biotechnology 4:583-590;Bechtold等,1993,C.R.Acad.Sci.Paris 316:1194-1199;Koncz和Schell,1986,Mol.Gen.Genet.204:383-396;Paszkowski等,1984,EMBO J.3:2717-2722;Sagi等,1994,Plant Cell Rep.13:262-266。
例如,可以按照Nagel等,1990,Microbiol Lett 67:325的方法使用土壤杆菌,如根癌土壤杆菌(A.tumefaciens)和发根土壤杆菌(A.rhizogenes)。简单地说,土壤杆菌可以通过例如电穿孔与植物表达载体一起使用,这之后土壤杆菌通过例如公知的叶盘(leaf-disk)方法引入植物细胞中。
用于完成这一任务的其它方法包括,但不限于通过根瘤菌属(Rhizobium)、中华根瘤菌属(Sinorhizobium)或中慢生根瘤菌属(Mesorhizobium)的转化(Broothaerts等,2005,Nature 433:629-633)、电穿孔、基因枪轰击、磷酸钙沉淀、和聚乙二醇融合、转移到发芽花粉粒中、直接转化(Lorz等,1985,Mol.Genet.199:179-182)及本领域中已知的其它方法。如果采用了选择标志如卡那霉素抗性,这使得确定哪些细胞成功转化更容易。
上面讨论的土壤杆菌转化方法已知可用于转化双子叶植物。另外,de la Pena等,1987,Nature 325:274-276;Rhodes等,1988,Science240:204-207和Shimamoto等,1989,Nature 328:274-276(所有这些文献通过引用引入本申请)已经用土壤杆菌转化谷类单子叶植物。另外参见Bechtold和Pelletier,1998,Methods Mol.Biol.82:259-266,其表明真空渗入用于土壤杆菌介导的转化的用途。
蛋白质、多肽或核酸分子在特定细胞中的存在可以进行测量以确定,例如是否细胞已经被成功转化或转染。完成这种分析的能力是公知的,因而不需要在此反复说明。
定量纤维长度和植物高度
单词“纤维”通常用于对共有以下特征的各种各样植物细胞类型进行统一:具有细长的形状和厚细胞壁(通常,但并不总是,描述为次生壁)中具有丰富的纤维素。这种细胞壁可能木质化或可能未木质化,且这类细胞的原生质体可能在成熟时保持存活或不存活。在某些行业中,术语“纤维”通常包括厚壁传导细胞,如导管和管胞,和许多个体纤维细胞的纤维聚集体。为了本发明的目的,术语“纤维”包括:(a)木质部的传导和非传导细胞;(b)木质部外来源的纤维,包括来自韧皮部、树皮、基本组织和表皮的纤维和(c)来自茎、叶、根、种子和花或花序的纤维。
本发明的转基因植物特征是纤维长度增加,且优选高度也增加。遗传工程化植物中纤维长度的增加优选是通过发生细胞膨胀的植物组织中WAK的超表达实现的。在描述本发明的植物时,“纤维长度增加”指的是与野生型植物中纤维细胞的长度相比,植物中纤维细胞的长度量增大。纤维长度的量增大可以通过几种技术测量,例如数字化、Kajaani方法和纤维质量分析仪。Han等,1999,In:KenafProperties,Processing and Products,Mississipi State University,Ag&Bio Engineering,pp 149-167。
本发明的工程植物中的纤维长度比野生型植物的纤维长度长至少5-15%,优选至少10-30%,且最优选至少20-50%。
因为纤维长度增加可能跟随着植物高度的增加,本发明的转基因植物可以具增大的纤维长度和高度。因此在本说明书中,短语“植物高度增加”表示与野生型植物的高度相比植物高度的量增大。本发明的工程植物的高度可以增加到野生型植物高度的大约5%-大约90%,优选大约10%-大约75%,甚至更优选大约15%-大约65%的水平。
************************
下面给出获得细胞壁关联激酶基因以及通过土壤杆菌引入目标基因以产生植物转化体的特定实施例。它们是示例性的且不构成对本发明的限制。
实施例1
从拟南芥分离细胞壁关联激酶DNA序列
(a)从拟南芥茎的RNA制备和cDNA合成
三个月大的拟南芥植物的茎插条切割成小片,在液氮中冷冻并用于通过溴化十六烷三甲基铵(CTAB)提取方法进行RNA提取。Aldrich和Cullis,1993,Plant Mol.Biol.Report,11:128-141。cDNA池用于其中分离的总RNA用作模板的RT-PCR试验,且Superscript II逆转录酶(Invitrogen)和寡(dT)引物用于合成第一链cDNA。如下所述,双链cDNA使用基因特异性引物通过后续的聚合酶反应获得。
(b)引物设计
表现来自拟南芥的细胞壁关联激酶4mRNA的cDNA序列已确认并以登录号NM101974保存于GenBank中。基于这一序列,DNA寡聚体合成为PCR引物,包括编码细胞壁关联激酶4的主ORF的第一密码子ATG附近的区域或终止密码子附近的区域。
引物设计为扩增胞壁关联激酶4的ORF的整个编码区域,即从ATG到翻译终止密码子。引物的序列在下面给出:
WAK_NDE 长度:23 SEQ ID NO:11
CATATGAAAGTGCAGCGTCTGTT
WAK_XBA 长度:23 SEQ ID NO:12
TCTAGATCAGCGGCCTGCTTCAA
(c)PCR扩增
在(a)中获得的cDNA样品用作模板,且在(b)中设计的引物用于PCR。PCR步骤包括在94℃1分钟、50℃1分钟和72℃2分钟的40个循环,接着在72℃7分钟的额外延长步骤。PCR产物通过1.0%琼脂糖的凝胶电泳进行分离,接着进行电泳凝胶的溴化乙锭染色和在UV透射仪中检测扩增带。核实所检测的扩增带并用剃刀切割琼脂糖凝胶。凝胶片转移到1.5mL的微管中,且DNA片段使用GFX PCR净化和凝胶带纯化试剂盒(Amersham)分离和纯化。回收的DNA片段亚克隆到pGEM-T克隆载体(Promega)中,转化到大肠杆菌中,然后用于按照通常方式制备质粒DNA,其随后使用BigDye化学(AppliedBiosystems)通过双脱氧法测序以产生这里公开为SEQ ID NO:1的DNA序列,用于本发明的用途。
实施例2
转基因烟草(Nicotiana tabacum)植物的制备
在上面实施例1中获得的细胞壁关联激酶基因被引入植物宿主中以产生转基因烟草植物。
(a)构建体的制备和土壤杆菌的转化
表达构建体通过用合适的限制性内切酶切割上面实施例1中获得的细胞壁关联激酶基因以包括所有的可译框架并与合适的启动子一起将该基因插入植物转化载体pALELLYX-WAK(图1)中而制备。例如,实施例1中获得的细胞壁关联激酶基因被克隆到上述表达载体中来自美洲黑杨的木质部-优先管状蛋白基因(TUB)启动子的下游,如国际申请WO 2005/096805中所述。获得的表达构建体在大肠杆菌中扩增,然后通过冻融转化到根癌土壤杆菌LBA4404品系中。
(b)土壤杆菌介导的烟草转化
烟草属的种的转化使用Horsh等,1985,Science 227:1229的叶盘方法利用包含可操作地与木质部-优先基因的TUB启动子连接的(a)中获得的细胞壁关联激酶基因的核酸构建体完成。转化体在包含100mg/L的卡那霉素和500mg/L的羧苄青霉素(Sigma)的Murashige和Skoog培养基(Sigma,St.Louis,MO)上选择。使转化的烟草苗在Murashige和Skoog培养基上生根,且随后转移到土壤中并在温室中生长。
(c)外源基因插入宿主植物基因组中的PCR核实
PCR可用于核实基因构建体在转基因植物基因组中的整合。PCR反应混合物在50μL的总体积中含有100ng转化植物的基因组DNA及0.2μM的如上所述的各引物、100μM的各脱氧核糖核苷三磷酸、5μL的PCR缓冲液和2.5单位的AmpliTaq DNA聚合酶(AppliedBiosystems)。循环参数如下:94℃1分钟、50℃1分钟和72℃3分钟,40个循环,加上72℃5分钟的延伸。PCR产物在1%琼脂糖凝胶上进行电泳。
(d)转基因植物中转基因表达水平的确定
半定量RT-PCR用于检测转基因植物的茎组织中细胞壁关联激酶转录物的累积。总RNA使用Aldrich和Cullis,1993,Plant Mol.Biol.Report,11:128-141的CTAB方法从3个月大转基因烟草T0和T1植物的茎插条分离。
cDNA使用Superscript II核糖核酸酶H-RT(Invitrogen,USA)从500ng的总RNA合成。上面描述的引物与作为内部对照以使各样品中使用的总RNA的质量标准化的用于编码甘油醛-3-磷酸脱氢酶(GAPDH)的组成性基因的引物一起使用。PCR以第一链cDNA的12.5倍稀释在以下条件下进行:94℃3分钟和27个循环的94℃1分钟,52-60℃45秒,及72℃1分30秒。
实施例3
在维管组织中超表达细胞壁关联激酶基因的烟草转基因植物中纤维长度的增加
对应于5个月大转基因和对照植物的50%高度的茎区域在70℃下乙酸-过氧化物溶液中浸解48小时或者直到获得单细胞。细胞用番红染色并在装配有与个人计算机连接的照相机(Sony)的显微镜(LeicaDMIL)下检验。细胞(大约每个家系100个)使用“Image Tool”软件直接在筛分机(screen)测量。
已知按照实施例2中详细描述的过程表达转基因的三个转基因事件显示纤维长度的统计学显著的增加(图2)。转基因事件43B与对照植物相比表现出纤维长度21%的增加(P<0.05,t检验)。转基因事件47B与对照植物相比表现出纤维长度19%的增加(图2;P<0.05,t检验)。另外,转基因事件48B与对照植物相比表现出纤维长度15%的增加(图2;P<0.05,t检验)。
值得注意的是,通过果胶甲酯酶基因的超表达增加纤维长度的另一策略(Berthold等,WO 2006/068603)已经获得与对照植物相比转基因植物纤维长度仅5%的增加。
在生长到成熟后,T0事件自我受精(self)以产生T1品系。当与纯合隐性植物相比时,纯合显性的植物在纤维长度上存在着10%的显著增长(P<0.05,t检验)。这些结果在两个不同的家系中观察到(图3和图4)。
实施例4
在维管组织中超表达细胞壁关联激酶基因的烟草转基因植物中植物高度的增加
由转基因植物的自体受精产生的T1后代在播种3周后个别地移栽到花盆中。定期对生长进行测量直到首次开花(植物大约5个月大),并记录为总高度。
表现的结果是在不同品系的纯合显性植物中观察到植物高度增加的实例。对来自事件51B的三个基因型的植物高度进行了对比。纯合显性植物比纯合隐性植物高12%。半合子植物比纯合隐性植物高9%(P<0.05,t检验)(图5)。
实施例5
转基因杨属植物的制备
在上述实施例1中获得的基因引入植物宿主中以产生转基因杨属植物。
(a)构建体的制备和土壤杆菌的转化
表达构建体可以通过用合适的限制性内切酶切割上面实施例1中获得的细胞壁关联激酶基因以包括所有的可译框架,并与合适的启动子一起将该基因插入植物转化载体pALELLYX-WAK(图1)中而制备。例如,实施例1中获得的细胞壁关联激酶基因被克隆到上述表达载体中来自美洲黑杨的木质部-优先管状蛋白基因(TUB)启动子的下游,如国际申请WO 2005/096805中所述。获得的表达构建体在大肠杆菌中扩增,然后通过冻融转化到根癌土壤杆菌LBA4404家系中。
(b)土壤杆菌介导的杨属转化
野生型山杨使用携带包含可操作地与木质部-优先基因(TUB)的启动子连接的实施例1中获得的拟南芥细胞壁关联激酶基因的构建体的根癌土壤杆菌转化。来自体外微繁殖的植物的叶柄和节间茎段用作外植体。转化的苗在包含100mg/L的卡那霉素的再生培养基上选择并使其在Murashige和Skoog培养基上生根。所选择的植物随后转移到土壤中并在温室中生长。
序列表
<110>GERHARDT,Isabel Rodrigues
ARRUDA,Paulo
<120>用于改变纤维长度和/或植物高度的核酸构建体和方法
<130>
<140>
<141>
<150>US 60/871,048
<151>2006-12-20
<160>12
<210>1
<211>2217
<212>DNA
<213>拟南芥
<220>
<221>mRNA
<222>(1)...(2217)
<223>细胞壁关联激酶4,cDNA,完整CD
<400>1
atg aaa gtg cag cgt ctg ttc tta gta gct att ttc tgc ctc tct tat 48
Met Lys Val Gln Arg Leu Phe Leu Val Ala Ile Phe Cys Leu Ser Tyr
1 5 10 15
atg cag ctg gtc aag ggg caa acc ttg cct cgt tgc ccc gaa aaa tgt 96
Met Gln Leu Val Lys Gly Gln Thr Leu Pro Arg Cys Pro Glu Lys Cys
20 25 30
ggc aac gtt aca ctt gag tac cct ttt ggc ttt tct cca ggt tgt tgg 144
Gly Asn Val Thr Leu Glu Tyr Pro Phe Gly Phe Ser Pro Gly Cys Trp
35 40 45
cgt gcc gaa gat cct agt ttc aat ctc agt tgt gtg aac gag aat cta 192
Arg Ala Glu Asp Pro Ser Phe Asn Leu Ser Cys Val Asn Glu Asn Leu
50 55 60
ttc tat aag ggc ctt gaa gtg gtc gaa ata tct cac agc agc cag tta 240
Phe Tyr Lys Gly Leu Glu Val Val Glu Ile Ser His Ser Ser Gln Leu
65 70 75 80
cgc gtc cta tat cct gca tcc tac att tgc tac aac agc aaa gga aag 288
Arg Val Leu Tyr Pro Ala Ser Tyr Ile Cys Tyr Asn Ser Lys Gly Lys
85 90 95
ttc gct aaa ggg act tac tac tgg agt aat cta ggt aat ttg act ctc 336
Phe Ala Lys Gly Thr Tyr Tyr Trp Ser Asn Leu Gly Asn Leu Thr Leu
100 105 110
tcc ggc aac aac acg att act gca tta ggc tgt aac tcg tac gct ttt 384
Ser Gly Asn Asn Thr Ile Thr Ala Leu Gly Cys Asn Ser Tyr Ala Phe
115 120 125
gtg tcc tct aat gga act cga aga aac tca gtt gga tgc ata tca gca 432
Val Ser Ser Asn Gly Thr Arg Arg Asn Ser Val Gly Cys Ile Ser Ala
130 135 140
tgt gat gct ctt tcc cat gaa gca aat gga gaa tgt aat ggt gaa ggc 480
Cys Asp Ala Leu Ser His Glu Ala Asn Gly Glu Cys Asn Gly Glu Gly
145 150 155 160
tgc tgc cag aac ccc gtc cct gca ggg aac aat tgg tta ata gtc aga 528
Cys Cys Gln Asn Pro Val Pro Ala Gly Asn Asn Trp Leu Ile Val Arg
165 170 175
tca tat cgc ttt gac aac gac acg tca gtg caa cct atc tct gag ggt 576
Ser Tyr Arg Phe Asp Asn Asp Thr Ser Val Gln Pro Ile Ser Glu Gly
180 185 190
caa tgc atc tac gcc ttt ctc gtt gaa aat ggc aag ttt aag tac aat 624
Gln Cys Ile Tyr Ala Phe Leu Val Glu Asn Gly Lys Phe Lys Tyr Asn
195 200 205
gct tcg gac aaa tat tct tat ctg cag aat agg aat gtg ggg ttt cct 672
Ala Ser Asp Lys Tyr Ser Tyr Leu Gln Asn Arg Asn Val Gly Phe Pro
210 215 220
gtg gtc ttg gat tgg tct att agg gga gag aca tgt ggg caa gtt gga 720
Val Val Leu Asp Trp Ser Ile Arg Gly Glu Thr Cys Gly Gln ValGly
225 230 235 240
gaa aag aaa tgc ggt gtg aat ggc ata tgt tcc aat tct gct agt ggg 768
Glu Lys Lys Cys Gly Val Asn Gly Ile Cys Ser Asn Ser Ala Ser Gly
245 250 255
atc ggg tat aca tgc aaa tgc aaa gga ggt ttc cag ggg aat cca tat 816
Ile Gly Tyr Thr Cys Lys Cys Lys Gly Gly Phe Gln Gly Asn Pro Tyr
260 265 270
ctt caa aac ggt tgc caa gac atc aat gag tgt act act gct aat cct 864
Leu Gln Asn Gly Cys Gln Asp Ile Asn Glu Cys Thr Thr Ala Asn Pro
275 280 285
atc cat aaa cat aac tgc tcg ggt gac agc acc tgt gaa aac aag ttg 912
lle His Lys His Asn Cys Ser Gly Asp Ser Thr Cys Glu Asn Lys Leu
290 295 300
gga cac ttc cgt tgt aat tgt cga tct cgt tac gaa tta aat acc acc 960
Gly His Phe Arg Cys Asn Cys Arg Ser Arg Tyr Glu Leu Asn Thr Thr
305 310 315 320
act aat acc tgc aaa cct aaa ggc aat cct gaa tac gtt gaa tgg act 1008
Thr Asn Thr Cys Lys Pro Lys Gly Asn Pro Glu Tyr Val Glu Trp Thr
325 330 335
aca att gtt ctt gga acc act atc ggc ttc ttg gtc att ctg ctt gcc 1056
Thr Ile Val Leu Gly Thr Thr Ile Gly Phe Leu Val Ile Leu Leu Ala
340 345 350
att agc tgt ata gaa cat aaa atg aag aac acc aag gac acc gag ctc 1104
Ile Ser Cys Ile Glu His Lys Met Lys Asn Thr Lys Asp Thr Glu Leu
355 360 365
cga caa caa ttc ttc gag caa aat ggt ggc ggc atg ttg atg cag cga 1152
Arg Gln Gln Phe Phe Glu Gln Asn Gly Gly Gly Met Leu Met Gln Arg
370 375 380
ctc tca gga gca ggg cca tca aat gtt gat gtc aaa atc ttc act gag 1200
Leu Ser Gly Ala Gly Pro Ser Asn Val Asp Val Lys Ile Phe Thr Glu
385 390 395 400
gaa gga atg aag gaa gca act gat ggt tat gat gag aac aga atc ttg 1248
Glu Gly Met Lys Glu Ala Thr Asp Gly Tyr Asp Glu Asn Arg Ile Leu
405 410 415
ggc cag gga ggc caa gga aca gtc tac aaa ggt ata tta ccg gac aac 1296
Gly Gln Gly Gly Gln Gly Thr Val Tyr Lys Gly Ile Leu Pro Asp Asn
420 425 430
tcc ata gtt gct ata aag aaa gct cgg ctt gga gac aat agc caa gta 1344
Ser Ile Val Ala Ile Lys Lys Ala Arg Leu Gly Asp Asn Ser Gln Val
435 440 445
gag cag ttc atc aat gaa gtg ctt gtg ctt tca caa atc aac cat agg 1392
Glu Gln Phe Ile Asn Glu Val Leu Val Leu Ser Gln Ile Asn His Arg
450 455 460
aac gtg gtc aag ctc ttg ggc tgc tgt cta gag act gaa gtt ccc ttg 1440
Asn Val Val Lys Leu Leu Gly Cys Cys Leu Glu Thr Glu Val Pro Leu
465 470 475 480
ttg gtc tat gag ttc att tcc agt ggg acc ctt ttc gat cac tta cac 1488
Leu Val Tyr Glu Phe Ile Ser Ser Gly Thr Leu Phe Asp His Leu His
485 490 495
ggt tct atg ttt gat tct tct cta aca tgg gaa cat cgt ttg aga atg 1536
Gly Ser Met Phe Asp Ser Ser Leu Thr Trp Glu His Arg Leu Arg Met
500 505 510
gct gta gaa ata gct gga act ctt gct tat ctt cac tcc tct gct tct 1584
Ala Val Glu Ile Ala Gly Thr Leu Ala Tyr Leu His Ser Ser Ala Ser
515 520 525
ata cca atc atc cat cgc gat atc aaa act gca aat att ctt ctg gat 1632
Ile Pro Ile Ile His Arg Asp Ile Lys Thr Ala Asn Ile Leu Leu Asp
530 535 540
gaa aac tta act gca aaa gta gct gac ttt ggt gct tca agg ctg ata 1680
Glu Asn Leu Thr Ala Lys Val Ala Asp Phe Gly Ala Ser Arg Leu Ile
545 550 555 560
cca atg gat aaa gaa gac ctc gca act atg gtg caa gga act cta ggt 1728
Pro Met Asp Lys Glu Asp Leu Ala Thr Met Val Gln Gly Thr Leu Gly
565 570 575
tac cta gac cca gaa tat tac aac aca ggg ttg cta aac gaa aag agc 1776
Tyr Leu Asp Pro Glu Tyr Tyr Asn Thr Gly Leu Leu Asn Glu Lys Ser
580 585 590
gat gtt tat agc ttt ggg gta gtc cta atg gaa ctg tta tca ggt caa 1824
Asp Val Tyr Ser Phe Gly Val Val Leu Met Glu Leu Leu Ser Gly Gln
595 600 605
aag gca ttg tgc ttt gaa agg cca cag act tca aaa cat ata gtg agt 1872
Lys Ala Leu Cys Phe Glu Arg Pro Gln Thr Ser Lys His Ile Val Ser
610 615 620
tac ttt gcc tca gcc acg aaa gag aat agg ttg cac gag att att gat 1920
Tyr Phe Ala Ser Ala Thr Lys Glu Asn Arg Leu His Glu Ile Ile Asp
625 630 635 640
ggc caa gtg atg aac gag aat aat cag agg gag atc cag aaa gct gca 1968
Gly Gln Val Met Asn Glu Asn Asn Gln Arg Glu Ile Gln Lys Ala Ala
645 650 655
aga att gct gtt gag tgt aca aga ttg acg gga gaa gaa agg cca ggg 2016
Arg Ile Ala Val Glu Cys Thr Arg Leu Thr Gly Glu Glu Arg Pro Gly
660 665 670
atg aag gaa gta gct gca gag ctt gag gcc ttg aga gtc aca aaa acc 2064
Met Lys Glu Val Ala Ala Glu Leu Glu Ala Leu Arg Val Thr Lys Thr
675 680 685
aaa cat aag tgg tca gat gag tat cct gaa cag gag gat act gag cac 2112
Lys His Lys Trp Ser Asp Glu Tyr Pro Glu Gln Glu Asp Thr Glu His
690 695 700
ttg gtt ggt gtt caa aaa tta tca gca caa ggc gaa acc agc agc agc 2160
Leu Val Gly Val Gln Lys Leu Ser Ala Gln Gly Glu Thr Ser Ser Ser
705 710 715 720
att ggc tat gat agt atc agg aat gta gca ata ctg gac att gaa gca 2208
Ile Gly Tyr Asp Ser Ile Arg Asn Val Ala Ile Leu Asp Ile Glu Ala
725 730 735
ggc cgc tga 2217
Gly Arg
<210>2
<211>738
<212>PRT
<213>拟南芥
<220>
<400>2
Met Lys Val Gln Arg Leu Phe Leu Val Ala Ile Phe Cys Leu Ser Tyr
5 10 15
Met Gln Leu Val Lys Gly Gln Thr Leu Pro Arg Cys Pro Glu Lys Cys
20 25 30
Gly Asn Val Thr Leu Glu Tyr Pro Phe Gly Phe Ser Pro Gly Cys Trp
35 40 45
Arg Ala Glu Asp Pro Ser Phe Asn Leu Ser Cys Val Asn Glu Asn Leu
50 55 60
Phe Tyr Lys Gly Leu Glu Val Val Glu Ile Ser His Ser Ser Gln Leu
65 70 75 80
Arg Val Leu Tyr Pro Ala Ser Tyr Ile Cys Tyr Asn Ser Lys Gly Lys
85 90 95
Phe Ala Lys Gly Thr Tyr Tyr Trp Ser Asn Leu Gly Asn Leu Thr Leu
100 105 110
Ser Gly Asn Asn Thr Ile Thr Ala Leu Gly Cys Asn Ser Tyr Ala Phe
115 120 125
Val Ser Ser Asn Gly Thr Arg Arg Asn Ser Val Gly Cys Ile Ser Ala
130 135 140
Cys Asp Ala Leu Ser His Glu Ala Asn Gly Glu Cys Asn Gly Glu Gly
145 150 155 160
Cys Cys Gln Asn Pro Val Pro Ala Gly Asn Asn Trp Leu Ile Val Arg
165 170 175
Ser Tyr Arg Phe Asp Asn Asp Thr Ser Val Gln Pro Ile Ser Glu Gly
180 185 190
Gln Cys Ile Tyr Ala Phe Leu Val Glu Asn Gly Lys Phe Lys Tyr Asn
195 200 205
Ala Ser Asp Lys Tyr Ser Tyr Leu Gln Asn Arg Asn Val Gly Phe Pro
210 215 220
Val Val Leu Asp Trp Ser Ile Arg Gly Glu Thr Cys Gly Gln Val Gly
225 230 235 240
Glu Lys Lys Cys Gly Val Asn Gly Ile Cys Ser Asn Ser Ala Ser Gly
245 250 255
Ile Gly Tyr Thr Cys Lys Cys Lys Gly Gly Phe Gln Gly Asn Pro Tyr
260 265 270
Leu Gln Asn Gly Cys Gln Asp Ile Asn Glu Cys Thr Thr Ala Asn Pro
275 280 285
Ile His Lys His Asn Cys Ser Gly Asp Ser Thr Cys Glu Asn Lys Leu
290 295 300
Gly His Phe Arg Cys Asn Cys Arg Ser Arg Tyr Glu Leu Asn Thr Thr
305 310 315 320
Thr Asn Thr Cys Lys Pro Lys Gly Asn Pro Glu Tyr Val Glu Trp Thr
325 330 335
Thr Ile Val Leu Gly Thr Thr Ile Gly Phe Leu Val Ile Leu Leu Ala
340 345 350
Ile Ser Cys Ile Glu His Lys Met Lys Asn Thr Lys Asp Thr Glu Leu
355 360 365
Arg Gln Gln Phe Phe Glu Gln Asn Gly Gly Gly Met Leu Met Gln Arg
370 375 380
Leu Ser Gly Ala Gly Pro Ser Asn Val Asp Val Lys Ile Phe Thr Glu
385 390 395 400
Glu Gly Met Lys Glu Ala Thr Asp Gly Tyr Asp Glu Asn Arg Ile Leu
405 410 415
Gly Gln Gly Gly Gln Gly Thr Val Tyr Lys Gly Ile Leu Pro Asp Asn
420 425 430
Ser Ile Val Ala Ile Lys Lys Ala Arg Leu Gly Asp Asn Ser Gln Val
435 440 445
Glu Gln Phe Ile Asn Glu Val Leu Val Leu Ser Gln Ile Asn His Arg
450 455 460
Asn Val Val Lys Leu Leu Gly Cys Cys Leu Glu Thr Glu Val Pro Leu
465 470 475 480
Leu Val Tyr Glu Phe Ile Ser Ser Gly Thr Leu Phe Asp His Leu His
485 490 495
Gly Ser Met Phe Asp Ser Ser Leu Thr Trp Glu His Arg Leu Arg Met
500 505 510
Ala Val Glu Ile Ala Gly Thr Leu Ala Tyr Leu His Ser Ser Ala Ser
515 520 525
Ile Pro Ile Ile His Arg Asp Ile Lys Thr Ala Asn Ile Leu Leu Asp
530 535 540
Glu Asn Leu Thr Ala Lys Val Ala Asp Phe Gly Ala Ser Arg Leu Ile
545 550 555 560
Pro Met Asp Lys Glu Asp Leu Ala Thr Met Val Gln Gly Thr Leu Gly
565 570 575
Tyr Leu Asp Pro Glu Tyr Tyr Asn Thr Gly Leu Leu Asn Glu Lys Ser
580 585 590
Asp Val Tyr Ser Phe Gly Val Val Leu Met Glu Leu Leu Ser Gly Gln
595 600 605
Lys Ala Leu Cys Phe Glu Arg Pro Gln Thr Ser Lys His Ile Val Ser
610 615 620
Tyr Phe Ala Ser Ala Thr Lys Glu Asn Arg Leu His Glu Ile Ile Asp
625 630 635 640
Gly Gln Val Met Asn Glu Asn Asn Gln Arg Glu Ile Gln Lys Ala Ala
645 650 655
Arg Ile Ala Val Glu Cys Thr Arg Leu Thr Gly Glu Glu Arg Pro Gly
660 665 670
Met Lys Glu Val Ala Ala Glu Leu Glu Ala Leu Arg Val Thr Lys Thr
675 680 685
Lys His Lys Trp Ser Asp Glu Tyr Pro Glu Gln Glu Asp Thr Glu His
690 695 700
Leu Val Gly Val Gln Lys Leu Ser Ala Gln Gly Glu Thr Ser Ser Ser
705 710 715 720
Ile Gly Tyr Asp Ser Ile Arg Asn Val Ala Ile Leu Asp Ile Glu Ala
725 730 735
Gly Arg
<210>3
<211>2208
<212>DNA
<213>拟南芥
<220>
<221>mRNA
<222>(1)...(2208)
<223>细胞壁关联激酶1,cDNA,完整CDS
<400>3
atg aag gtg cag gag ggt ttg ttc ttg gtg gct att ttc ttc tcc ctt 48
Met Lys Val Gln Glu Gly Leu Phe Leu Val Ala Ile Phe Phe Ser Leu
5 10 15
gcg tgt acg cag ctg gtg aag ggg caa cat caa cct ggt gag aat tgc 96
Ala Cys Thr Gln Leu Val Lys Gly Gln His Gln Pro Gly Glu Asn Cys
20 25 30
caa aat aaa tgt ggc aac atc aca ata gag tac cct ttt ggc att tct 144
Gln Asn Lys Cys Gly Asn Ile Thr Ile Glu Tyr Pro Phe Gly Ile Ser
35 40 45
tca ggt tgt tac tat ccc gga aat gaa agt ttc agt atc acc tgt aag 192
Ser Gly Cys Tyr Tyr Pro Gly Asn Glu Ser Phe Ser Ile Thr Cys Lys
50 55 60
gaa gat agg cca cat gtc tta agc gac att gaa gtg gca aac ttt aat 240
Glu Asp Arg Pro His Val Leu Ser Asp Ile Glu Val Ala Asn Phe Asn
65 70 75 80
cac agc ggc cag cta caa gtt ctg ctt aat cga tcc tct act tgc tac 288
His Ser Gly Gln Leu Gln Val Leu Leu Asn Arg Ser Ser Thr Cys Tyr
85 90 95
gac gag caa gga aaa aaa act gag gag gac agt tct ttt aca ctg gaa 336
Asp Glu Gln Gly Lys Lys Thr Glu Glu Asp Ser Ser Phe Thr Leu Glu
100 105 110
aat tta tct ctt tcc gcc aac aac aag tta act gca gta ggc tgt aac 284
Asn Leu Ser Leu Ser Ala Asn Asn Lys Leu Thr Ala Val Gly Cys Asn
115 120 125
gct tta tca ctt ctg gac act ttt gga atg caa aac tac tca act gca 432
Ala Leu Ser Leu Leu Asp Thr Phe Gly Met Gln Asn Tyr Ser Thr Ala
130 135 140
tgc ttg tca tta tgc gat tct ccc cca gag gct gat gga gaa tgt aat 480
Cys Leu Ser Leu Cys Asp Ser Pro Pro Glu Ala Asp Gly Glu Cys Asn
145 150 155 160
ggt aga ggt tgc tgc aga gtc gac gtt tct gcc ccg ttg gat agc tat 528
Gly Arg Gly Cys Cys Arg Val Asp Val Ser Ala Pro Leu Asp Ser Tyr
165 170 175
aca ttc gaa act aca tca ggt cgc atc aag cac atg act tct ttt cac 576
Thr Phe Glu Thr Thr Ser Gly Arg Ile Lys His Met Thr Ser Phe His
180 185 190
gac ttt agt cct tgc acc tac gct ttt ctc gtt gaa gat gat aag ttc 624
Asp Phe Ser Pro Cys Thr Tyr Ala Phe Leu Val Glu Asp Asp Lys Phe
195 200 205
aac ttc agt tct aca gaa gat ctt ctg aat ctg cga aat gtc atg agg 672
Asn Phe Ser Ser Thr Glu Asp Leu Leu Asn Leu Arg Asn Val Met Arg
210 215 220
ttc cct gtg tta cta gat tgg tct gtt gga aat cag aca tgc gag caa 720
Phe Pro Val Leu Leu Asp Trp Ser Val Gly Asn Gln Thr Cys Glu Gln
225 230 235 240
gtt gga agc aca agc ata tgc ggt ggg aac agc act tgt ctc gat tct 768
Val Gly Ser Thr Ser Ile Cys Gly Gly Asn Ser Thr Cys Leu Asp Ser
245 250 255
act cct aga aac ggg tat atc tgc aga tgc aat gaa ggc ttt gat ggg 816
Thr Pro Arg Asn Gly Tyr Ile Cys Arg Cys Asn Glu Gly Phe Asp Gly
260 265 270
aat cca tac ctt tca gct ggt tgc caa gac gtc aat gag tgt act act 864
Asn Pro Tyr Leu Ser Ala Gly Cys Gln Asp Val Asn Glu Cys Thr Thr
275 280 285
agt agt act atc cat aga cat aac tgt tcg gat ccc aaa acc tgt aga 912
Ser Ser Thr Ile His Arg His Asn Cys Ser Asp Pro Lys Thr Cys Arg
290 295 300
aac aag gtt gga ggc ttc tat tgt aag tgt caa tct ggt tac cgc tta 960
Asn Lys Val Gly Gly Phe Tyr Cys Lys Cys Gln Ser Gly Tyr Arg Leu
305 310 315 320
gat acc acc act atg agc tgc aag cgt aaa gag ttt gca tgg act aca 1008
Asp Thr Thr Thr Met Ser Cys Lys Arg Lys Glu Phe Ala Trp Thr Thr
325 330 335
att ctt ctt gta acc acc atc ggc ttc ttg gtc att ctg ctt ggc gtt 1056
Ile Leu Leu Val Thr Thr Ile Gly Phe Leu Val Ile Leu Leu Gly Val
340 345 350
gcc tgt ata caa cag aga atg aag cac ctg aag gac acc aag ctc cga 1104
Ala Cys Ile Gln Gln Arg Met Lys His Leu Lys Asp Thr Lys Leu Arg
355 360 365
gaa caa ttc ttc gag caa aat ggt ggc ggc atg ttg aca caa cga ctc 1152
Glu Gln Phe Phe Glu Gln Asn Gly Gly Gly Met Leu Thr Gln Arg Leu
370 375 380
tca gga gca ggg ccg tca aat gtt gat gtc aaa atc ttt act gag gat 1200
Ser Gly Ala Gly Pro Ser Asn Val Asp Val Lys Ile Phe Thr Glu Asp
385 390 395 400
ggc atg aag aaa gca aca aat ggt tat gct gag agc agg atc ctg ggt 1248
Gly Met Lys Lys Ala Thr Asn Gly Tyr Ala Glu Ser Arg Ile Leu Gly
405 410 415
cag ggt ggc caa gga aca gtg tac aaa ggg ata ttg ccg gac aac tcc 1296
Gln Gly Gly Gln Gly Thr ValTyr Lys Gly Ile Leu Pro Asp Asn Ser
420 425 430
ata gtt gct ata aag aaa gcc cga ctt gga gac agt agc caa gta gag 1344
Ile Val Ala Ile Lys Lys Ala Arg Leu Gly Asp Ser Ser Gln Val Glu
435 440 445
cag ttc atc aat gaa gtg ctc gtg ctt tca caa atc aac cat agg aac 1392
Gln Phe Ile Asn Glu Val Leu Val Leu Ser Gln Ile Asn His Arg Asn
450 455 460
gta gtc aag ctc ttg ggc tgc tgt cta gag act gaa gtt ccc ttg ttg 1440
Val Val Lys Leu Leu Gly Cys Cys Leu Glu Thr Glu Val Pro Leu Leu
465 470 475 480
gtc tat gag ttc atc acc aat ggc aec ctt ttc gat cac ttg cat ggt 1488
Val Tyr Glu Phe Ile Thr Asn Gly Thr Leu Phe Asp His Leu His Gly
485 490 495
tcc atg att gat tct tcg ctt aca tgg gaa cac cgt ctg aag ata gca 1536
Ser Met Ile Asp Ser Ser Leu Thr Trp Glu His Arg Leu Lys Ile Ala
500 505 510
ata gaa gtc gct gga act ctt gca tat ctt cac tcc tct gct tct att 1584
Ile Glu Val Ala Gly Thr Leu Ala Tyr Leu His Ser Ser Ala Ser Ile
515 520 525
cca atc atc cat cgg gat atc aaa act gca aat att ctt ctg gat gta 1632
Pro Ile Ile His Arg Asp Ile Lys Thr Ala Asn Ile Leu Leu Asp Val
530 535 540
aac tta act gca aaa gta gct gac ttt ggt gct tca agg ctg ata cca 1608
Asn Leu Thr Ala Lys Val Ala Asp Phe Gly Ala Ser Arg Leu Ile Pro
545 550 555 560
atg gat aaa gaa gag ctc gaa act atg gtg caa ggc act cta ggt tac 1728
Met Asp Lys Glu Glu Leu Glu Thr Met Val Gln Gly Thr Leu Gly Tyr
565 570 575
cta gac cca gaa tat tac aac aca ggg ttg tta aac gaa aag agc gat 1776
Leu Asp Pro Glu Tyr Tyr Asn Thr Gly Leu Leu Asn Glu Lys Ser Asp
580 585 590
gtt tat agt ttt ggg gtc gtc cta atg gaa ctg ctc tca ggt caa aag 1824
Val Tyr Ser Phe Gly Val Val Leu Met Glu Leu Leu Ser Gly Gln Lys
595 600 605
gca ttg tgc ttt aaa cgg cca cag tcc tca aaa cat ctg gtg agt tac 1872
Ala Leu Cys Phe Lys Arg Pro Gln Ser Ser Lys His Leu Val Ser Tyr
610 615 620
ttt gcg act gcc aca aaa gag aat agg ttg gat gag att att ggc ggc 1920
Phe Ala Thr Ala Thr Lys Glu Asn Arg Leu Asp Glu Ile Ile Gly Gly
625 630 635 640
gaa gtg atg aac gag gat aat ctg aag gag atc cag gaa gct gca aga 1968
Glu Val Met Asn Glu Asp Asn Leu Lys Glu Ile Gln Glu Ala Ala Arg
645 650 655
att gct gca gag tgt aca agg cta atg gga gag gaa agg cca agg atg 2016
Ile Ala Ala Glu Cys Thr Arg Leu Met Gly Glu Glu Arg Pro Arg Met
660 665 670
aaa gaa gta gct gca aag cta gaa gcc ttg agg gtc gaa aaa acc aaa 2064
Lys Glu Val Ala Ala Lys Leu Glu Ala Leu Arg Val Glu Lys Thr Lys
675 680 685
cat aag tgg tcg gat cag tac cct gag gag aat gaa cac ttg att ggt 2112
His Lys Trp Ser Asp Gln Tyr Pro Glu Glu Asn Glu His Leu Ile Gly
690 695 700
ggt cac atc ttg tca gca caa ggc gaa acc agt agc agc att ggc tat 2160
Gly His Ile Leu Ser Ala Gln Gly Glu Thr Ser Ser Ser Ile Gly Tyr
705 710 715 720
gac agc atc aag aat gta gca ata ttg gac att gaa act ggc cgc tga 2208
Asp Ser Ile Lys Asn Val Ala Ile Leu Asp Ile Glu Thr Gly Arg
725 730 735
<210>4
<211>735
<212>PRT
<213>拟南芥
<220>
<400>4
Met Lys Val Gln Glu Gly Leu Phe Leu Val Ala Ile Phe Phe Ser Leu
5 10 15
Ala Cys Thr Gln Leu Val Lys Gly Gln His Gln Pro Gly Glu Asn Cys
20 25 30
Gln Asn Lys Cys Gly Asn Ile Thr Ile Glu Tyr Pro Phe Gly Ile Ser
35 40 45
Ser Gly Cys Tyr Tyr Pro Gly Asn Glu Ser Phe Ser Ile Thr Cys Lys
50 55 60
Glu Asp Arg Pro His Val Leu Ser Asp Ile Glu Val Ala Asn Phe Asn
65 70 75 80
His Ser Gly Gln Leu Gln Val Leu Leu Asn Arg Ser Ser Thr Cys Tyr
85 90 95
Asp Glu Gln Gly Lys Lys Thr Glu Glu Asp Ser Ser Phc Thr Leu Glu
100 105 110
Asn Leu Ser Leu Ser Ala Asn Asn Lys Leu Thr Ala Val Gly Cys Asn
115 120 125
Ala Leu Ser Leu Leu Asp Thr Phe Gly Met Gln Asn Tyr Ser Thr Ala
130 135 140
Gly Arg Gly Cys Cys Arg Val Asp Val Ser Ala Pro Leu Asp Ser Tyr
165 170 175
Thr Phe Glu Thr Thr Ser Gly Arg Ile Lys His Met Thr Ser Phe His
180 185 190
Asp Phe Ser Pro Cys Thr Tyr Ala Phe Leu Val Glu Asp Asp Lys Phe
195 200 205
Asn Phe Ser Ser Thr Glu Asp Leu Leu Asn Leu Arg Asn Val Met Arg
210 215 220
Phe Pro Val Leu Leu Asp Trp Ser Val Gly Asn Gln Thr Cys Glu Gln
225 230 235 240
Val Gly Ser Thr Ser Ile Cys Gly Gly Asn Ser Thr Cys Leu Asp Ser
245 250 255
Thr Pro Arg Asn Gly Tyr Ile Cys Arg Cys Asn Glu Gly Phe Asp Gly
260 265 270
Asn Pro Tyr Leu Ser Ala Gly Cys Gln Asp Val Asn Glu Cys Thr Thr
275 280 285
Ser Ser Thr Ile His Arg His Asn Cys Ser Asp Pro Lys Thr Cys Arg
290 295 300
Asn Lys Val Gly Gly Phe Tyr Cys Lys Cys Gln Ser Gly Tyr Arg Leu
305 310 315 320
Asp Thr Thr Thr Met Ser Cys Lys Arg Lys Glu Phe Ala Trp Thr Thr
325 330 335
Ile Leu Leu Val Thr Thr Ile Gly Phe Leu Val Ile Leu Leu Gly Val
340 345 350
Ala Cys Ile Gln Gln Arg Met Lys His Leu Lys Asp Thr Lys Leu Arg
355 360 365
Glu Gln Phe Phe Glu Gln Asn Gly Gly Gly Met Leu Thr Gln Arg Leu
370 375 380
Ser Gly Ala Gly Pro Ser Asn Val Asp Val Lys Ile Phe Thr Glu Asp
385 390 395 400
Gly Met Lys Lys Ala Thr Asn Gly Tyr Ala Glu Ser Arg Ile Leu Gly
405 410 415
Gln Gly Gly Gln Gly Thr Val Tyr Lys Gly Ile Leu Pro Asp Asn Ser
420 425 430
Ile Val Ala Ile Lys Lys Ala Arg Leu Gly Asp Ser Ser Gln Val Glu
435 440 445
Gln Phe Ile Asn Glu Val Leu Val Leu Ser Gln Ile Asn His Arg Asn
450 455 460
Val Val Lys Leu Leu Gly Cys Cys Leu Glu Thr Glu Val Pro Leu Leu
465 470 475 480
Val Tyr Glu Phe Ile Thr Asn Gly Thr Leu Phe Asp His Leu His Gly
485 490 495
Ser Met Ile Asp Ser Ser Leu Thr Trp Glu His Arg Leu Lys Ile Ala
500 505 510
Ile Glu Val Ala Gly Thr Leu Ala Tyr Leu His Ser Ser Ala Ser Ile
515 520 525
Pro Ile Ile His Arg Asp Ile Lys Thr Ala Asn Ile Leu Leu Asp Val
530 535 540
Asn Leu Thr Ala Lys Val Ala Asp Phe Gly Ala Ser Arg Leu Ile Pro
545 550 555 560
Met Asp Lys Glu Glu Leu Glu Thr Met Val Gln Gly Thr Leu Gly Tyr
565 570 575
Leu Asp Pro Glu Tyr Tyr Asn Thr Gly Leu Leu Asn Glu Lys Ser Asp
580 585 590
Val Tyr Ser Phe Gly Val Val Leu Met Glu Leu Leu Ser Gly Gln Lys
595 600 605
Ala Leu Cys Phe Lys Arg Pro Gln Ser Ser Lys His Leu Val Ser Tyr
610 615 620
Phe Ala Thr Ala Thr Lys Glu Asn Arg Leu Asp Glu Ile Ile Gly Gly
625 630 635 640
Glu Val Met Asn Glu Asp Asn Leu Lys Glu Ile Gln Glu Ala Ala Arg
645 650 655
Ile Ala Ala Glu Cys Thr Arg Leu Met Gly Glu Glu Arg Pro Arg Met
660 665 670
Lys Glu Val Ala Ala Lys Leu Glu Ala Leu Arg Val Glu Lys Thr Lys
675 680 685
His Lys Trp Ser Asp Gln Tyr Pro Glu Glu Asn Glu His Leu Ile Gly
690 695 700
Gly His Ile Leu Ser Ala Gln Gly Glu Thr Ser Ser Ser Ile Gly Tyr
705 710 715 720
Asp Ser Ile Lys Asn Val Ala Ile Leu Asp Ile Glu Thr Gly Arg
725 730 735
<210>5
<211>2199
<212>DNA
<213>拟南芥
<220>
<221>mRNA
<222>(1)...(2199)
<223>细胞壁关联激酶2,cDNA,完整CDS
<400>5
atg aag gta cag gag ggt ttg ttc gtg gtg gct gtt ttc tac ctt gct 48
Met Lys Val Gln Glu Gly Leu Phe Val Val Ala Val Phe Tyr Leu Ala
5 10 15
tat acg cag cta gtc aag ggg caa cct cgc aag gag tgc caa act aga 96
Tyr Thr Gln Leu Val Lys Gly Gln Pro Arg Lys Glu Cys Gln Thr Arg
20 25 30
tgt ggc aat gtc gca gtt gag tac cct ttt ggt act tct cca ggt tgt 144
Cys Gly Asn Val Ala Val Glu Tyr Pro Phe Gly Thr Ser Pro Gly Cys
35 40 45
tac tat ccc gga gat gaa agt ttc aat ctt act tgc aac gag caa gag 192
Tyr Tyr Pro Gly Asp Glu Ser Phe Asn Leu Thr Cys Asn Glu Gln Glu
50 55 60
aag ctc ttc ttt ggc aac atg cca gtc atc aac atg tct ctc agc ggc 240
Lys Leu Phe Phe Gly Asn Met Pro Val Ile Asn Met Ser Leu Ser Gly
65 70 75 80
cag ctt cgt gtt cgg cta gtt aga tcc aga gtt tgc tac gat agt caa 288
Gln Leu Arg Val Arg Leu Val Arg Ser Arg Val Cys Tyr Asp Ser Gln
85 90 95
gga aaa cag act gac tac att gcc cag cgg acc acc ctg ggt aat ttc 336
Gly Lys Gln Thr Asp Tyr Ile Ala Gln Arg Thr Thr Leu Gly Asn Phe
100 105 110
act ctc tct gaa ctt aac aga ttt act gta gta ggt tgt aac agt tac 384
Thr Leu Ser Glu Leu Asn Arg Phe Thr Val Val Gly Cys Asn Ser Tyr
115 120 125
gca ttt ctc cgc aca tct gga gtt gaa aaa tac tca act gga tgc ata 432
Ala Phe Leu Arg Thr Ser Gly Val Glu Lys Tyr Ser Thr Gly Cys Ile
130 135 140
tca ata tgt gat tct gcc aca acg aaa aac gga tca tgt tct ggt gaa 480
Ser Ile Cys Asp Ser Ala Thr Thr Lys Asn Gly Ser Cys Ser Gly Glu
145 150 155 160
ggt tgc tgc cag atc cct gtc cct aga gga tac tct ttt gtc aga gta 528
Gly Cys Cys Gln Ile Pro Val Pro Arg Gly Tyr Ser Phe Val Arg Val
165 170 175
aaa cca cat agc ttt cac aac cat cct act gtg cat ctg ttt aat cct 576
Lys Pro His Ser Phe His Asn His Pro Thr Val His Leu Phe Asn Pro
180 185 190
tgc acc tac gcc ttt ctc gtt gaa gat ggt atg ttt gac ttc cat gct 624
Cys Thr Tyr Ala Phe Leu Val Glu Asp Gly Met Phe Asp Phe His Ala
195 200 205
ttg gaa gat ctc aac aat ctg cga aat gtt act acg ttc cct gta gta 672
Leu Glu Asp Leu Asn Asn Leu Arg Asn Val Thr Thr Phe Pro Val Val
210 215 220
cta gat tgg tct atc gga gac aag act tgc aaa caa gta gaa tac agg 720
Leu Asp Trp Ser Ile Gly Asp Lys Thr Cys Lys Gln Val Glu Tyr Arg
225 230 235 240
ggc gtg tgt ggt ggt aac agc aca tgt ttc gat tct act ggt gga acc 768
Gly Val Cys Gly Gly Asn Ser Thr Cys Phe Asp Ser Thr Gly Gly Thr
245 250 255
ggg tat aac tgc aaa tgt tta gaa ggt ttt gag ggg aat cca tac ctt 816
Gly Tyr Asn Cys Lys Cys Leu Glu Gly Phe Glu Gly Asn Pro Tyr Leu
260 265 270
cca aac ggt tgt caa gac atc aat gaa tgt att agt agt aga cat aac 864
Pro Asn Gly Cys Gln Asp Ile Asn Glu Cys Ile Ser Ser Arg His Asn
275 280 285
tgt tcg gag cat agt acc tgt gaa aac acg aag ggg agc ttc aac tgt 912
Cys Ser Glu His Ser Thr Cys Glu Asn Thr Lys Gly Ser Phe Asn Cys
290 295 300
aac tgc cca tct ggt tac cgc aaa gat tcc ctt aat agc tgt act cgt 960
Asn Cys Pro Ser Gly Tyr Arg Lys Asp Ser Leu Asn Ser Cys Thr Arg
305 310 315 320
aaa gtc agg cct gaa tac ttt aga tgg act caa att ttt ctt gga acc 1008
Lys Val Arg Pro Glu Tyr Phe Arg Trp Thr Gln Ile Phe Leu Gly Thr
325 330 335
acc atc ggc ttc tcg gtt atc atg ctt ggg att agc tgt cta caa cag 1056
Thr Ile Gly Phe Ser Val Ile Met Leu Gly Ile Ser Cys Leu Gln Gln
340 345 350
aaa att aag cac cgg aag aac aca gag ctc cga caa aaa ttc ttc gag 1104
Lys Ile Lys His Arg Lys Asn Thr Glu Leu Arg Gln Lys Phe Phe Glu
355 360 365
caa aat ggt gga ggc atg ttg ata cag cga gtc tcg gga gca ggg cca 1152
Gln Asn Gly Gly Gly Met Leu Ile Gln Arg Val Ser Gly Ala Gly Pro
370 375 380
tca aat gtt gat gtc aaa atc ttc act gag aaa gga atg aag gaa gca 1200
Ser Asn Val Asp Val Lys Ile Phe Thr Glu Lys Gly Met Lys Glu Ala
385 390 395 400
act aat ggt tac cat gag agc aga atc ctg ggt cag gga ggc caa gga 1248
Thr Asn Gly Tyr His Glu Ser Arg Ile Leu Gly Gln Gly Gly Gln Gly
405 410 415
aca gtg tac aaa ggg ata ttg ccg gac aac tcc ata gtt gct ata aag 1296
Thr Val Tyr Lys Gly Ile Leu Pro Asp Asn Ser Ile Val Ala Ile Lys
420 425 430
aaa gct cgg ctt gga aac cgt agc caa gta gag cag ttc atc aac gaa 1344
Lys Ala Arg Leu Gly Asn Arg Ser Gln Val Glu Gln Phe Ile Asn Glu
435 440 445
gtg cta gtg ctt tca caa atc aac cat agg aac gtg gtc aag gtc ttg 1392
Val Leu Val Leu Ser Gln Ile Asn His Arg Asn Val Val Lys Val Leu
450 455 460
ggg tgt tgt tta gag aca gaa gtc ccc ttg ttg gtc tat gag ttc att 1440
Gly Cys Cys Leu Glu Thr Glu Val Pro Leu Leu Val Tyr Glu Phe Ile
465 470 475 480
aac agt ggt acc ctt ttc gat cac ttg cac ggt tcc ttg tat gat tct 1488
Asn Ser Gly Thr Leu Phe Asp His Leu His Gly Ser Leu Tyr Asp Ser
485 490 495
tca ctt aca tgg gag cac cgt ctg agg ata gca aca gaa gta gca gga
Ser Leu Thr Trp Glu His Arg Leu Arg Ile Ala Thr Glu Val Ala Gly 1536
500 505 510
agt ctt gca tat ctt cac tct tct gct tct att cca atc atc cac cga 1584
Ser Leu Ala Tyr Leu His Ser Ser Ala Ser Ile Pro Ile Ile His Arg
515 520 525
gat atc aag act gct aat att ctc ctg gat aaa aac tta act gca aaa 1632
Asp Ile Lys Thr Ala Asn Ile Leu Leu Asp Lys Asn Leu Thr Ala Lys
530 535 540
gta gct gac ttt ggt gca tca aga ttg ata ccg atg gat aaa gag cag 1680
Val Ala Asp Phe Gly Ala Ser Arg Leu Ile Pro Met Asp Lys Glu Gln
545 550 555 560
ctc aca aca ata gtg caa ggc act cta ggt tac cta gac cca gaa tat 1728
Leu Thr Thr Ile Val Gln Gly Thr Leu Gly Tyr Leu Asp Pro Glu Tyr
565 570 575
tac aac aca ggg ttg tta aac gaa aag agc gat gtt tat agt ttt ggg 1776
Tyr Asn Thr Gly Leu Leu Asn Glu Lys Ser Asp Val Tyr Ser Phe Gly
580 585 590
gtc gtc cta atg gaa ctg ctc tca ggt caa aag gca ttg tgt ttc gaa 1824
Val Val Leu Met Glu Leu Leu Ser Gly Gln Lys Ala Leu Cys Phe Glu
595 600 605
aga cca cat tgc cca aaa aat ctt gtg agt tgt ttt gct tct gcc aca 1872
Arg Pro His Cys Pro Lys Asn Leu Val Ser Cys Phe Ala Ser Ala Thr
610 615 620
aag aat aat agg ttc cat gaa att att gat ggg caa gtg atg aat gag 1920
Lys Asn Asn Arg Phe His Glu Ile Ile Asp Gly Gln Val Met Asn Glu
625 630 635 640
gat aac cag aga gag atc cag gaa gct gca aga att gct gca gag tgt 1968
Asp Asn Gln Arg Glu Ile Gln Glu Ala Ala Arg Ile Ala Ala Glu Cys
645 650 655
aca agg cta atg gga gag gaa agg cca agg atg aaa gaa gta gct gca 2016
Thr Arg Leu Met Gly Glu Glu Arg Pro Arg Met Lys Glu Val Ala Ala
660 665 670
gag tta gag gcc ttg aga gtt aaa aca act aaa tat aag tgg tcg gat 2064
Glu Leu Glu Ala Leu Arg Val Lys Thr Thr Lys Tyr Lys Trp Ser Asp
675 680 685
cag tat cgt gag aca ggg gag att gaa cac ttg ctc ggc gtt caa atc 2112
Gln Tyr Arg Glu Thr Gly Glu Ile Glu His Leu Leu Gly Val Gln Ile
690 695 700
ttg tca gca caa ggc gaa acc agt agc agc att ggc tat gac agc atc 2160
Leu Ser Ala Gln Gly Glu Thr Ser Ser Ser Ile Gly Tyr Asp Ser Ile
705 710 715 720
aga aat gta aca aca ttg gac att gaa gct ggc cgt tga 2199
Arg Asn Val Thr Thr Leu Asp Ile Glu Ala Gly Arg
725 730
<210>6
<211>732
<212>PRT
<213>拟南芥
<220>
<400>6
Met Lys Val Gln Glu Gly Leu Phe Val Val Ala Val Phe Tyr Leu Ala
5 10 15
Tyr Thr Gln Leu Val Lys Gly Gln Pro Arg Lys Glu Cys Gln Thr Arg
20 25 30
Cys Gly Asn Val Ala Val Glu Tyr Pro Phe Gly Thr Ser Pro Gly Cys
35 40 45
Tyr Tyr Pro Gly Asp Glu Ser Phe Asn Leu Thr Cys Asn Glu Gln Glu
50 55 60
Lys Leu Phe Phe Gly Asn Met Pro Val Ile Asn Met Ser Leu Ser Gly
65 70 75 80
Gln Leu Arg Val Arg Leu Val Arg Ser Arg ValCys Tyr Asp Ser Gln
85 90 95
Gly Lys Gln Thr Asp Tyr Ile Ala Gln Arg Thr Thr Leu Gly Asn Phe
100 105 110
Thr Leu Ser Glu Leu Asn Arg Phe Thr Val Val Gly Cys Asn Ser Tyr
115 120 125
Ala Phe Leu Arg Thr Ser Gly Val Glu Lys Tyr Ser Thr Gly Cys Ile
130 135 140
Ser Ile Cys Asp Ser Ala Thr Thr Lys Asn Gly Ser Cys Ser Gly Glu
145 150 155 160
Gly Cys Cys Gln Ile Pro Val Pro Arg Gly Tyr Ser Phe Val Arg Val
165 170 175
Lys Pro His Ser Phe His Asn His Pro Thr Val His Leu Phe Asn Pro
180 185 190
Cys Thr Tyr Ala Phe Leu Val Glu Asp Gly Met Phe Asp Phe His Ala
195 200 205
Leu Glu Asp Leu Asn Asn Leu Arg Asn Val Thr Thr Phe Pro Val Val
210 215 220
Leu Asp Trp Ser Ile Gly Asp Lys Thr Cys Lys Gln Val Glu Tyr Arg
225 230 235 240
Gly Val Cys Gly Gly Asn Ser Thr Cys Phe Asp Ser Thr Gly Gly Thr
245 250 255
Gly Tyr Asn Cys Lys Cys Leu Glu Gly Phe Glu Gly Asn Pro Tyr Leu
260 265 270
Pro Asn Gly Cys Gln Asp Ile Asn Glu Cys Ile Ser Ser Arg His Asn
275 280 285
Cys Ser Glu His Ser Thr Cys Glu Asn Thr Lys Gly Ser Phe Asn Cys
290 295 300
Asn Cys Pro Ser Gly Tyr Arg Lys Asp Ser Leu Asn Ser Cys Thr Arg
305 310 315 320
Lys Val Arg Pro Glu Tyr Phe Arg Trp Thr Gln Ile Phe Leu Gly Thr
325 330 335
Thr Ile Gly Phe Ser Val Ile Met Leu Gly Ile Ser Cys Leu Gln Gln
340 345 350
Lys Ile Lys His Arg Lys Asn Thr Glu Leu Arg Gln Lys Phe Phe Glu
355 360 365
Gln Asn Gly Gly Gly Met Leu Ile Gln Arg Val Ser Gly Ala Gly Pro
370 375 380
Ser Asn Val Asp Val Lys Ile Phe Thr Glu Lys Gly Met Lys Glu Ala
385 390 395 400
Thr Asn Gly Tyr His Glu Ser Arg Ile Leu Gly Gln Gly Gly Gln Gly
405 410 415
Thr Val Tyr Lys Gly Ile Leu Pro Asp Asn Ser Ile Val Ala Ile Lys
420 425 430
Lys Ala Arg Leu Gly Asn Arg Ser Gln Val Glu Gln Phe Ile Asn Glu
435 440 445
Val Leu Val Leu Ser Gln Ile Asn His Arg Asn Val Val Lys Val Leu
450 455 460
Gly Cys Cys Leu Glu Thr Glu Val Pro Leu Leu Val Tyr Glu Phe Ile
465 470 475 480
Asn Ser Gly Thr Leu Phe Asp His Leu His Gly Ser Leu Tyr Asp Ser
485 490 495
Ser Leu Thr Trp Glu His Arg Leu Arg Ile Ala Thr Glu Val Ala Gly
500 505 510
Ser Leu Ala Tyr Leu His Ser Ser Ala Ser Ile Pro Ile Ile His Arg
515 520 525
Asp Ile Lys Thr Ala Asn Ile Leu Leu Asp Lys Asn Leu Thr Ala Lys
530 535 540
Val Ala Asp Phe Gly Ala Ser Arg Leu Ile Pro Met Asp Lys Glu Gln
545 550 555 560
Leu Thr Thr Ile Val Gln Gly Thr Leu Gly Tyr Leu Asp Pro Glu Tyr
565 570 575
Tyr Asn Thr Gly Leu Leu Asn Glu Lys Ser Asp Val Tyr Ser Phe Gly
580 585 590
Val Val Leu Met Glu Leu Leu Ser Gly Gln Lys Ala Leu Cys Phe Glu
595 600 605
Arg Pro His Cys Pro Lys Asn Leu Val Ser Cys Phe Ala Ser Ala Thr
610 615 620
Lys Asn Asn Arg Phe His Glu Ile Ile Asp Gly Gln Val Met Asn Glu
625 630 635 640
Asp Asn Gln Arg Glu Ile Gln Glu Ala Ala Arg Ile Ala Ala Glu Cys
645 650 655
Thr Arg Leu Met Gly Glu Glu Arg Pro Arg Met Lys Glu Val Ala Ala
660 665 670
Glu Leu Glu Ala Leu Arg Val Lys Thr Thr Lys Tyr Lys Trp Ser Asp
675 680 685
Gln Tyr Arg Glu Thr Gly Glu Ile Glu His Leu Leu Gly Val Gln Ile
690 695 700
Leu Ser Ala Gln Gly Glu Thr Ser Ser Ser Ile Gly Tyr Asp Ser Ile
705 710 715 720
Arg Asn Val Thr Thr Leu Asp Ile Glu Ala Gly Arg
725 730
<210>7
<211>2226
<212>DNA
<213>拟南芥
<220>
<221>mRNA
<222>(1)...(2226)
<223>细胞壁关联激酶3,cDNA,完整CDS
<400>7
atg aag rrc cag gag ggr grg ttc ttg gtg gtt att ttc ttc ctt gca 48
Met Lys Phe Gln Glu Gly Val Phe Leu Val Val Ile Phe Phe Leu Ala
5 10 15
tat act cag ctt gtg aag ggg caa cat caa cct cgc gaa gat tgt aaa 96
Tyr Thr Gln Leu Val Lys Gly Gln His Gln Pro Arg Glu Asp Cys Lys
20 25 30
ctt aaa tgt gga aac gtc aca ata gag tac cct ttt ggt att tct aca 144
Leu Lys Cys Gly Asn Val Thr Ile Glu Tyr Pro Phe Gly Ile Ser Thr
35 40 45
ggt tgt tac tat ccc gga gat gat aat ttc aat ctc acc tgt gtc gtg 192
Gly Cys Tyr Tyr Pro Gly Asp Asp Asn Phe Asn Leu Thr Cys Val Val
50 55 60
gaa gag aag cta cta ctc ttt ggc atc att caa gtg acc aat att tct 240
Glu Glu Lys Leu Leu Leu Phe Gly Ile Ile Gln Val Thr Asn Ile Ser
65 70 75 80
cac agt ggc cat gta agt gta ctg ttt gaa cga ttc tct gaa tgc tac 288
His Ser Gly His Val Ser Val Leu Phe Glu Arg Phe Ser Glu Cys Tyr
85 90 95
gag cag aaa aat gag act aat gga act gcc ctc ggg tat cag ctg ggt 336
Glu Gln Lys Asn Glu Thr Asn Gly Thr Ala Leu Gly Tyr Gln Leu Gly
100 105 110
agt agt ttc tct ctc tcc tcc aac aac aag ttt act tta gta gga tgt 384
Ser Ser Phe Ser Leu Ser Ser Asn Asn Lys Phe Thr Leu Val Gly Cys
115 120 125
aac gct tta tca ctt ttg agc act ttt gga aag caa aac tac tca act 432
Asn Ala Leu Ser Leu Leu Ser Thr Phe Gly Lys Gln Asn Tyr Ser Thr
130 135 140
gga tgc ttg tca tta tgc aat tct caa cca gag gca aat gga aga tgt 480
Gly Cys Leu Ser Leu Cys Asn Ser Gln Pro Glu Ala Asn Gly Arg Cys
145 150 155 160
aat ggt gta ggt tgc tgc aca aca gag gac ttc tct gtc ccg ttc gat 528
Asn Gly Val Gly Cys Cys Thr Thr Glu Asp Phe Ser Val Pro Phe Asp
165 170 175
agc gat aca ttc caa ttt ggc tca gtt cgc ttg aga aac caa gtt aat 576
Ser Asp Thr Phe Gln Phe Gly Ser Val Arg Leu Arg Asn Gln Val Asn
180 185 190
aat tcc tta gat cta ttt aat act tcg gta tat cag ttt aat cct tgc 624
Asn Ser Leu Asp Leu Phe Asn Thr Ser Val Tyr Gln Phe Asn Pro Cys
195 200 205
acc tac gct ttt ctc gtt gaa gat ggt aag ttt aac ttc gat tct tca 672
Thr Tyr Ala Phe Leu Val Glu Asp Gly Lys Phe Asn Phe Asp Ser Ser
210 215 220
aaa gat ctt aag aat ctg agg aat gtc acg agg ttc cct gtg gca cta 720
Lys Asp Leu Lys Asn Leu Arg Asn Val Thr Arg Phe Pro Val Ala Leu
225 230 235 240
gat tgg tct att gga aac cag aca tgt gag caa gct gga agc aca aga 768
Asp Trp Ser Ile Gly Asn Gln Thr Cys Glu Gln Ala Gly Ser Thr Arg
245 250 255
ata tgc ggt aag aac agc tca tgt tac aat tct act act aga aac ggg 816
Ile Cys Gly Lys Asn Ser Ser Cys Tyr Asn Ser Thr Thr Arg Asn Gly
260 265 270
tat atc tgc aaa tgt aat gaa ggt tat gat ggg aat cca tac cgt tca 864
Tyr Ile Cys Lys Cys Asn Glu Gly Tyr Asp Gly Asn Pro Tyr Arg Ser
275 280 285
gag ggt tgc aaa gac atc gat gag tgt att agt gat aca cat aac tgt 912
Glu Gly Cys Lys Asp Ile Asp Glu Cys Ile Ser Asp Thr His Asn Cys
290 295 300
tcg gat cca aaa acc tgt aga aac agg gat gga ggc ttc gat tgt aag 960
Ser Asp Pro Lys Thr Cys Arg Asn Arg Asp Gly Gly Phe Asp Cys Lys
305 310 315 320
tgt cca tct ggt tac gac tta aac tcc agt atg agc tgc acg agg ccc 1008
Cys Pro Ser Gly Tyr Asp Leu Asn Ser Ser Met Ser Cys Thr Arg Pro
325 330 335
gaa tac aaa cgg act cga att ttt ctt gta atc ata atc ggc gtc ttg 1056
Glu Tyr Lys Arg Thr Arg Ile Phe Leu Val Ile Ile Ile Gly Val Leu
340 345 350
gtc ctc ctg ctt gct gcg atc tgt ata caa cat gca acg aag caa agg 1104
Val Leu Leu Leu Ala Ala Ile Cys Ile Gln His Ala Thr Lys Gln Arg
355 360 365
aag tat acc aag ctc cga cga caa ttc ttt gag caa aat ggt ggt ggc 1152
Lys Tyr Thr Lys Leu Arg Arg Gln Phe Phe Glu Gln Asn Gly Gly Gly
370 375 380
atg ttg ata cag cga ctt tca gga gca ggg ttg tca aac att gat ttc 1200
Met Leu Ile Gln Arg Leu Ser Gly Ala Gly Leu Ser Asn Ile Asp Phe
385 390 395 400
aaa atc ttt act gag gaa ggc atg aaa gag gca act aat ggc tat gat 1248
Lys Ile Phe Thr Glu Glu Gly Met Lys Glu Ala Thr Asn Gly Tyr Asp
405 410 415
gag agc aga atc ttg ggc cag gga ggt caa gga aca gtc tac aaa ggg 1296
Glu Ser Arg Ile Leu Gly Gln Gly Gly Gln Gly Thr Val Tyr Lys Gly
420 425 430
ata ttg ccg gac aac act atc gtt gct ata aag aaa gct cgg ctt gca 1344
Ile Leu Pro Asp Asn Thr Ile Val Ala Ile Lys Lys Ala Arg Leu Ala
435 440 445
gac agt aga caa gta gat cag ttc atc cac gaa gtg ctc gtg ctt tca 1392
Asp Ser Arg Gln Val Asp Gln Phe Ile His Glu Val Leu Val Leu Ser
450 455 460
caa att aac cac agg aac gtg gtc aag atc ttg ggt tgc tgt cta gag 1440
Gln Ile Asn His Arg Asn Val Val Lys Ile Leu Gly Cys Cys Leu Glu
465 470 475 480
act gaa gtc ccc ttg ttg gtc tat gaa ttc att acc aat ggc acc ctt 1488
Thr Glu Val Pro Leu Leu Val Tyr Glu Phe Ile Thr Asn Gly Thr Leu
485 490 495
ttc gat cac ttg cac ggt tcc att ttt gat tct tct ctt aca tgg gaa 1536
Phe Asp His Leu His Gly Ser Ile Phe Asp Ser Ser Leu Thr Trp Glu
500 505 510
cac cgc ctc aga ata gcg ata gaa gtc gct gga act ctt gct tat ctt 1584
His Arg Leu Arg Ile Ala Ile Glu Val Ala Gly Thr Leu Ala Tyr Leu
515 520 525
cac tcc tct gct tct att cca atc atc cat cgc gat atc aaa act gca 1632
His Ser Ser Ala Ser Ile Pro Ile Ile His Arg Asp Ile Lys Thr Ala
530 535 540
aat att ctc ttg gat gaa aac tta act gca aaa gta gcc gac ttt ggc 1680
Asn Ile Leu Leu Asp Glu Asn Leu Thr Ala Lys Val Ala Asp Phe Gly
545 550 555 560
gct tct aag ctt ata cca atg gat aaa gag cag ctc aca act atg gtg 1728
Ala Ser Lys Leu Ile Pro Met Asp Lys Glu Gln Leu Thr Thr Met Val
565 570 575
caa ggc act cta ggc tat tta gac cca gaa tac tat acc aca ggg ctt 1776
Gln Gly Thr Leu Gly Tyr Leu Asp Pro Glu Tyr Tyr Thr Thr Gly Leu
580 585 590
ctg aac gag aag agc gat gtg tat agc ttt ggg gta gtc ctc atg gaa 1824
Leu Asn Glu Lys Ser Asp Val Tyr Ser Phe Gly Val Val Leu Met Glu
595 600 605
ctg ctc tca ggt caa aag gca ttg tgc ttt gaa cgg cca cag gct tca 1872
Leu Leu Ser Gly Gln Lys Ala Leu Cys Phe Glu Arg Pro Gln Ala Ser
610 615 620
aaa cat ttg gtg agt tac ttt gtt tct gcc acg gaa gag aat agg ttg 1920
Lys His Leu Val Ser Tyr Phe Val Ser Ala Thr Glu Glu Asn Arg Leu
625 630 635 640
cat gag att att gac gac caa gtg ttg aac gag gat aat ctg aag gag 1968
His Glu Ile Ile Asp Asp Gln Val Leu Asn Glu Asp Asn Leu Lys Glu
645 650 655
atc cag gaa gct gca aga att gct gca gag tgt aca agg cta atg gga 2016
Ile Gln Glu Ala Ala Arg Ile Ala Ala Glu Cys Thr Arg Leu Met Gly
660 665 670
gag gaa agg cca agg atg aaa gaa gta gct gca aag cta gaa gcc ttg 2064
Glu Glu Arg Pro Arg Met Lys Glu Val Ala Ala Lys Leu Glu Ala Leu
675 680 685
agg gtc gag aaa acc aaa cat aag tgg tcg gat cag tat cct gag gag 2112
Arg Val Glu Lys Thr Lys His Lys Trp Ser Asp Gln Tyr Pro Glu Glu
690 695 700
aat gaa cac ttg att ggt ggt cac atc ttg tct gca caa ggc gaa acc 2160
Asn Glu His Leu Ile Gly Gly His Ile Leu Ser Ala Gln Gly Glu Thr
705 710 715 720
agt agc agc att ggc tat gat agc atc aaa aat gta gca ata ttg gac 2208
Ser Ser Ser Ile Gly Tyr Asp Ser Ile Lys Asn Val Ala Ile Leu Asp
725 730 735
att gaa act ggc cgc tga 2226
Ile Glu Thr Gly Arg
740
<210>8
<211>741
<212>PRT
<213>拟南芥
<220>
<400>8
Met Lys Phe Gln Glu Gly Val Phe Leu Val Val Ile Phe Phe Leu Ala
5 10 15
Tyr Thr Gln Leu Val Lys Gly Gln His Gln Pro Arg Glu Asp Cys Lys
20 25 30
Leu Lys Cys Gly Asn Val Thr Ile Glu Tyr Pro Phe Gly Ile Ser Thr
35 40 45
Gly Cys Tyr Tyr Pro Gly Asp Asp Asn Phe Asn Leu Thr Cys Val Val
50 55 60
Glu Glu Lys Leu Leu Leu Phe Gly Ile Ile Gln Val Thr Asn Ile Ser
65 70 75 80
His Ser Gly His Val Ser Val Leu Phe Glu Arg Phe Ser Glu Cys Tyr
85 90 95
Glu Gln Lys Asn Glu Thr Asn Gly Thr Ala Leu Gly Tyr Gln Leu Gly
100 105 110
Ser Ser Phe Ser Leu Ser Ser Asn Asn Lys Phe Thr Leu Val Gly Cys
115 120 125
Asn Ala Leu Ser Leu Leu Ser Thr Phe Gly Lys Gln Asn Tyr Ser Thr
130 135 140
Gly Cys Leu Ser Leu Cys Asn Ser Gln Pro Glu Ala Asn Gly Arg Cys
145 150 155 160
Asn Gly Val Gly Cys Cys Thr Thr Glu Asp Phe Ser Val Pro Phe Asp
165 170 175
Ser Asp Thr Phe Gln Phe Gly Ser Val Arg Leu Arg Asn Gln Val Asn
180 185 190
Asn Ser Leu Asp Leu Phe Asn Thr Ser Val Tyr Gln Phe Asn Pro Cys
195 200 205
Thr Tyr Ala Phe Leu Val Glu Asp Gly Lys Phe Asn Phe Asp Ser Ser
210 215 220
Lys Asp Leu Lys Asn Leu Arg Asn Val Thr Arg Phe Pro Val Ala Leu
225 230 235 240
Asp Trp Ser Ile Gly Asn Gln Thr Cys Glu Gln Ala Gly Ser Thr Arg
245 250 255
Ile Cys Gly Lys Asn Ser Ser Cys Tyr Asn Ser Thr Thr Arg Asn Gly
260 265 270
Tyr Ile Cys Lys Cys Asn Glu Gly Tyr Asp Gly Asn Pro Tyr Arg Ser
275 280 285
Glu Gly Cys Lys Asp Ile Asp Glu Cys Ile Ser Asp Thr His Asn Cys
290 295 300
Ser Asp Pro Lys Thr Cys Arg Asn Arg Asp Gly Gly Phe Asp Cys Lys
305 310 315 320
Cys Pro Ser Gly Tyr Asp Leu Asn Ser Ser Met Ser Cys Thr Arg Pro
325 330 335
Glu Tyr Lys Arg Thr Arg Ile Phe Leu Val Ile Ile Ile Gly Val Leu
340 345 350
Val Leu Leu Leu Ala Ala Ile Cys Ile Gln His Ala Thr Lys Gln Arg
355 360 365
Lys Tyr Thr Lys Leu Arg Arg Gln Phe Phe Glu Gln Asn Gly Gly Gly
370 375 380
Met Leu Ile Gln Arg Leu Ser Gly Ala Gly Leu Ser Asn Ile Asp Phe
385 390 395 400
Lys Ile Phe Thr Glu Glu Gly Met Lys Glu Ala Thr Asn Gly Tyr Asp
405 410 415
Glu Ser Arg Ile Leu Gly Gln Gly Gly Gln Gly Thr Val Tyr Lys Gly
420 425 430
Ile Leu Pro Asp Asn Thr Ile Val Ala Ile Lys Lys Ala Arg Leu Ala
435 440 445
Asp Ser Arg Gln Val Asp Gln Phe Ile His Glu Val Leu Val Leu Ser
450 455 460
Gln Ile Asn His Arg Asn Val Val Lys Ile Leu Gly Cys Cys Leu Glu
465 470 475 480
Thr Glu Val Pro Leu Leu Val Tyr Glu Phe Ile Thr Asn Gly Thr Leu
485 490 495
Phe Asp His Leu His Gly Ser Ile Phe Asp Ser Ser Leu Thr Trp Glu
500 505 510
His Arg Leu Arg Ile Ala Ile Glu Val Ala Gly Thr Leu Ala Tyr Leu
515 520 525
His Ser Ser Ala Ser Ile Pro Ile Ile His Arg Asp Ile Lys Thr Ala
530 535 540
Asn Ile Leu Leu Asp Glu Asn Leu Thr Ala Lys Val Ala Asp Phe Gly
545 550 555 560
Ala Ser Lys Leu Ile Pro Met Asp Lys Glu Gln Leu Thr Thr Met Val
565 570 575
Gln Gly Thr Leu Gly Tyr Leu Asp Pro Glu Tyr Tyr Thr Thr Gly Leu
580 585 590
Leu Asn Glu Lys Ser Asp Val Tyr Ser Phe Gly Val Val Leu Met Glu
595 600 605
Leu Leu Ser Gly Gln Lys Ala Leu Cys Phe Glu Arg Pro Gln Ala Ser
610 615 620
Lys His Leu Val Ser Tyr Phe Val Ser Ala Thr Glu Glu Asn Arg Leu
625 630 635 640
His Glu Ile Ile Asp Asp Gln Val Leu Asn Glu Asp Asn Leu Lys Glu
645 650 655
Ile Gln Glu Ala Ala Arg Ile Ala Ala Glu Cys Thr Arg Leu Met Gly
660 665 670
Glu Glu Arg Pro Arg Met Lys Glu Val Ala Ala Lys Leu Glu Ala Leu
675 680 685
Arg Val Glu Lys Thr Lys His Lys Trp Ser Asp Gln Tyr Pro Glu Glu
690 695 700
Asn Glu His Leu Ile Gly Gly His Ile Leu Ser Ala Gln Gly Glu Thr
705 710 715 720
Ser Ser Ser Ile Gly Tyr Asp Ser Ile Lys Asn Val Ala Ile Leu Asp
725 730 735
Ile Glu Thr Gly Arg
740
<210>9
<211>2202
<212>DNA
<213>拟南芥
<220>
<221>mRNA
<222>(1)...(2202)
<223>细胞壁关联激酶5,cDNA,完整CDS
<400>9
atg aag gtg cat agt ctg ttc ttg atg gct att ttc ttc tac cta gca 48
Met Lys Val His Ser Leu Phe Leu Met Ala Ile Phe Phe Tyr Leu Ala
5 10 15
tat acg cag ctg gtc aag gcg caa cct cgc gat gat tgc caa act aga 96
Tyr Thr Gln Leu Val Lys Ala Gln Pro Arg Asp Asp Cys Gln Thr Arg
20 25 30
tgt ggt gac gtc cca att gat tac cct ttt ggt att tct aca ggt tgt 144
Cys Gly Asp Val Pro Ile Asp Tyr Pro Phe Gly Ile Ser Thr Gly Cys
35 40 45
tac tac ccc gga gat gat agc ttc aat att acc tgt gag gaa gat aaa 192
Tyr Tyr Pro Gly Asp Asp Ser Phe Asn Ile Thr Cys Glu Glu Asp Lys
50 55 60
cca aat gtc tta agc aac att gaa gtg cta aac ttt aat cat agc ggc 240
Pro Asn Val Leu Ser Asn Ile Glu Val Leu Asn Phe Asn His Ser Gly
65 70 75 80
cag cta cgc ggt ctg att cct cga tcc act gtt tgc tac gac cag caa 288
Gln Leu Arg Gly Leu Ile Pro Arg Ser Thr Val Cys Tyr Asp Gln Gln
85 90 95
aca aat aat gat ttc gag tcc ctc tgg ttt cgg ttg gat aat tta tct 336
Thr Asn Asn Asp Phe Glu Ser Leu Trp Phe Arg Lou Asp Asn Leu Ser
100 105 110
ttc tcc ccc aat aac aag ttt act tta gta ggc tgt aac gct tgg gca 384
Phe Ser Pro Asn Asn Lys Phe Thr Leu Val Gly Cys Asn Ala Trp Ala
115 120 125
ctt cta agc act ttt gga ata caa aac tac tca act gga tgt atg tca 432
Leu Leu Ser Thr Phe Gly Ile Gln Asn Tyr Ser Thr Gly Cys Met Ser
130 135 140
tta tgc gat act ccc ccg ccg cca aat agt aaa tgt aat ggt gtt ggt 480
Leu Cys Asp Thr Pro Pro Pro Pro Asn Ser Lys Cys Asn Gly Val Gly
145 150 155 160
tgc tgc aga aca gag gta tct atc ccc ttg gat agc cat aga att gaa 528
Cys Cys Arg Thr Glu Val Ser Ile Pro Leu Asp Ser His Arg Ile Glu
165 170 175
act caa cca tct cgc ttc gaa aac atg act tcc gtg gag cac ttt aat 576
Thr Gln Pro Ser Arg Phe Glu Asn Met Thr Ser Val Glu His Phe Asn
180 185 190
cct tgc agc tac gct ttt ttc gtt gaa gat ggt atg ttt aac ttc agt 624
Pro Cys Ser Tyr Ala Phe Phe Val Glu Asp Gly Met Phe Asn Phe Ser
195 200 205
tct tta gaa gat ctt aag gat ctg cga aat gtc acg agg ttc cct gtg 672
Ser Leu Glu Asp Leu Lys Asp Leu Arg Asn Val Thr Arg Phe Pro Val
210 215 220
tta cta gat tgg tct att gga aac cag aca tgt gag caa gtt gta ggt 720
Leu Leu Asp Trp Ser Ile Gly Asn Gln Thr Cys Glu Gln Val Val Gly
225 230 235 240
aga aac ata tgt ggt ggg aac agc aca tgt ttt gat tct act cgt gga 768
Arg Asn Ile Cys Gly Gly Asn Ser Thr Cys Phe Asp Ser Thr Arg Gly
245 250 255
aag ggt tat aac tgc aag tgt tta caa ggt ttt gat ggg aat cca tac 816
Lys Gly Tyr Asn Cys Lys Cys Leu Gln Gly Phe Asp Gly Asn Pro Tyr
260 265 270
ctt tcg gac ggt tgc caa gac atc aat gag tgt act acc cgt ata cat 864
Leu Ser Asp Gly Cys Gln Asp Ile Asn Glu Cys Thr Thr Arg Ile His
275 280 285
aac tgt tcg gat acc agc aca tgt gaa aac aca ctt gga agc ttc cat 912
Asn Cys Ser Asp Thr Ser Thr Cys Glu Asn Thr Leu Gly Ser Phe His
290 295 300
tgt cag tgc cca tct ggt tct gat tta aat acc act act atg agc tgc 960
Cys Gln Cys Pro Ser Gly Ser Asp Leu Asn Thr Thr Thr Met Ser Cys
305 310 315 320
att gac aca cct aaa gaa gag cct aag tac tta gga tgg act act gtt 1008
Ile Asp Thr Pro Lys Glu Glu Pro Lys Tyr Leu Gly Trp Thr Thr Val
325 330 335
ctt ctt gga acc acc atc gga ttc tta atc atc ttg ctt acc att agc 1056
Leu Leu Gly Thr Thr Ile Gly Phe Leu Ile Ile Leu Leu Thr Ile Ser
340 345 350
tat ata caa caa aaa atg agg cac cga aaa aac acc gag ctg cga caa 1104
Tyr Ile Gln Gln Lys Met Arg His Arg Lys Asn Thr Glu Leu Arg Gln
355 360 365
caa ttc ttc gag caa aat ggt ggt ggc atg ttg ata cag cga ctc tca 1152
Gln Phe Phe Glu Gln Asn Gly Gly Gly Met Leu Ile Gln Arg Leu Ser
370 375 380
gga gca ggg cca tca aat gtg gat gtc aaa atc ttt act gaa gaa ggc 1200
Gly Ala Gly Pro Ser Asn Val Asp Val Lys Ile Phe Thr Glu Glu Gly
385 390 395 400
atg aag gaa gca act gat ggt tat aat gag agc aga atc cta ggc cag 1248
Met Lys Glu Ala Thr Asp Gly Tyr Asn Glu Ser Arg Ile Leu Gly Gln
405 410 415
gga gga caa gga aca gtc tac aaa ggg ata ttg caa gat aac tcc att 1296
Gly Gly Gln Gly Thr Val Tyr Lys Gly Ile Leu Gln Asp Asn Ser Ile
420 425 430
gtt gct ata aag aaa gct cga ctt gga gac cgt agc caa gta gag cag 1344
Val Ala Ile Lys Lys Ala Arg Leu Gly Asp Arg Ser Gln Val Glu Gln
435 440 445
ttc atc aac gaa gtg cta gtg ctt tca caa ata aac cac agg aac gtg 1392
Phe Ile Asn Glu Val Leu Val Leu Ser Gln Ile Asn His Arg Asn Val
450 455 460
gtc aaa ctc ttg ggc tgt tgt cta gag act gaa gtt ccc ttg ttg gtc 1440
Val Lys Leu Leu Gly Cys Cys Leu Glu Thr Glu Val Pro Leu Leu Val
465 470 475 480
tat gag ttc att tcc agt ggc act ctt ttt gat cac ttg cac ggt tct 1488
Tyr Glu Phe Ile Ser Ser Gly Thr Leu Phe Asp His Leu His Gly Ser
485 490 495
atg ttt gat tct tcg ctt aca tgg gaa cac cgt ctg agg ata gcc ata 1536
Met Phe Asp Ser Ser Leu Thr Trp Glu His Arg Leu Arg Ile Ala Ile
500 505 510
gaa gtt gct gga act ctt gca tat ctt cac tcc tat gct tct att cca 1584
Glu Val Ala Gly Thr Leu Ala Tyr Leu His Ser Tyr Ala Ser Ile Pro
515 520 525
atc atc cac cga gat gtc aag act gct aac att ctc ctc gat gaa aac 1632
Ile Ile His Arg Asp Val Lys Thr Ala Asn Ile Leu Leu Asp Glu Asn
530 535 540
tta act gca aaa gta gct gat ttt ggt gca tca agg ctg ata ccg atg 1680
Leu Thr Ala Lys Val Ala Asp Phe Gly Ala Ser Arg Leu Ile Pro Met
545 550 555 560
gac caa gag cag ctc aca act atg gtt caa gga act ctt ggc tat tta 1728
Asp Gln Glu Gln Leu Thr Thr Met Val Gln Gly Thr Leu Gly Tyr Leu
565 570 575
gac cct gaa tac tac aat aca ggg ctt ctg aac gaa aag agc gat gtt 1776
Asp Pro Glu Tyr Tyr Asn Thr Gly Leu Leu Asn Glu Lys Ser Asp Val
580 585 590
tat agc ttt ggg gta gtc ctc atg gaa ctg ctc tca ggt gaa aag gca 1824
Tyr Ser Phe Gly Val Val Leu Met Glu Leu Leu Ser Gly Glu Lys Ala
595 600 605
tta tgc ttt gaa cgg cca caa agc tca aaa cat cta gtg agt tac ttt 1872
Leu Cys Phe Glu Arg Pro Gln Ser Ser Lys His Leu Val Ser Tyr Phe
610 615 620
gtt tct gcc atg aaa gaa aat agg ttg cat gag att att gac ggt caa 1920
Val Ser Ala Met Lys Glu Asn Arg Leu His Glu Ile Ile Asp Gly Gln
625 630 635 640
gtt atg aac gag tat aat cag agg gag atc cag gaa tct gca aga att 1968
Val Met Asn Glu Tyr Asn Gln Arg Glu Ile Gln Glu Ser Ala Arg Ile
645 650 655
gct gtt gag tgt aca aga att atg gga gag gaa agg cca agt atg aaa 2016
Ala Val Glu Cys Thr Arg Ile Met Gly Glu Glu Arg Pro Ser Met Lys
660 665 670
gaa gta gct gca gag tta gag gcc ttg aga gtc aaa aca acc aaa cat 2064
Glu Val Ala Ala Glu Leu Glu Ala Leu Arg Val Lys Thr Thr Lys His
675 680 685
cag tgg tca gat caa tat ccc aag gag gtt gag cat ttg ctt ggt gtt 2112
Gln Trp Ser Asp Gln Tyr Pro Lys Glu Val Glu His Leu Leu Gly Val
690 695 700
caa atc tta tcg acg caa ggt gat acc agt agc att ggc tat gac agc 2160
Gln Ile Leu Ser Thr Gln Gly Asp Thr Ser Ser Ile Gly Tyr Asp Ser
705 710 715 720
atc cag aat gta aca agg ttg gac att gaa act ggc cgc tga 2202
Ile Gln Asn Val Thr Arg Leu Asp Ile Glu Thr Gly Arg
725 730
<210>10
<211>733
<212>PRT
<213>拟南芥
<220>
<400>10
Met Lys Val His Ser Leu Phe Leu Met Ala Ile Phe Phe Tyr Leu Ala
5 10 15
Tyr Thr Gln Leu Val Lys Ala Gln Pro Arg Asp Asp Cys Gln Thr Arg
20 25 30
Cys Gly Asp Val Pro Ile Asp Tyr Pro Phe Gly Ile Ser Thr Gly Cys
35 40 45
Tyr Tyr Pro Gly Asp Asp Ser Phe Asn Ile Thr Cys Glu Glu Asp Lys
50 55 60
Pro Asn Val Leu Ser Asn Ile Glu Val Leu Asn Phe Asn His Ser Gly
65 70 75 80
Gln Leu Arg Gly Leu Ile Pro Arg Ser Thr Val Cys Tyr Asp Gln Gln
85 90 95
Thr Asn Asn Asp Phe Glu Ser Leu Trp Phe Arg Leu Asp Asn Leu Ser
100 105 110
Phe Ser Pro Asn Asn Lys Phe Thr Leu Val Gly Cys Asn Ala Trp Ala
115 120 125
Leu Leu Ser Thr Phe Gly Ile Gln Asn Tyr Ser Thr Gly Cys Met Ser
130 135 140
Leu Cys Asp Thr Pro Pro Pro Pro Asn Ser Lys Cys Asn Gly Val Gly
145 150 155 160
Cys Cys Arg Thr Glu Val Ser Ile Pro Leu Asp Ser His Arg Ile Glu
165 170 175
Thr Gln Pro Ser Arg Phe Glu Asn Met Thr Ser Val Glu His Phe Asn
180 185 190
Pro Cys Ser Tyr Ala Phe Phe Val Glu Asp Gly Met Phe Asn Phe Ser
195 200 205
Ser Leu Glu Asp Leu Lys Asp Leu Arg Asn Val Thr Arg Phe Pro Val
210 215 220
Leu Leu Asp Trp Ser Ile Gly Asn Gln Thr Cys Glu Gln Val Val Gly
225 230 235 240
Arg Asn Ile Cys Gly Gly Asn Ser Thr Cys Phe Asp Ser Thr Arg Gly
245 250 255
Lys Gly Tyr Asn Cys Lys Cys Leu Gln Gly Phe Asp Gly Asn Pro Tyr
260 265 270
Leu Ser Asp Gly Cys Gln Asp Ile Asn Glu Cys Thr Thr Arg Ile His
275 280 285
Asn Cys Ser Asp Thr Ser Thr Cys Glu Asn Thr Leu Gly Ser Phe His
290 295 300
Cys Gln Cys Pro Ser Gly Ser Asp Leu Asn Thr Thr Thr Met Ser Cys
305 310 315 320
Ile Asp Thr Pro Lys Glu Glu Pro Lys Tyr Leu Gly Trp Thr Thr Val
325 330 335
Leu Leu Gly Thr Thr Ile Gly Phe Leu Ile Ile Leu Leu Thr Ile Ser
340 345 350
Tyr Ile Gln Gln Lys Met Arg His Arg Lys Asn Thr Glu Leu Arg Gln
355 360 365
Gln Phe Phe Glu Gln Asn Gly Gly Gly Met Leu Ile Gln Arg Leu Ser
370 375 380
Gly Ala Gly Pro Ser Asn Val Asp Val Lys Ile Phe Thr Glu Glu Gly
385 390 395 400
Met Lys Glu Ala Thr Asp Gly Tyr Asn Glu Ser Arg Ile Leu Gly Gln
405 410 415
Gly Gly Gln Gly Thr Val Tyr Lys Gly Ile Leu Gln Asp Asn Ser Ile
420 425 430
Val Ala Ile Lys Lys Ala Arg Leu Gly Asp Arg Ser Gln Val Glu Gln
435 440 445
Phe Ile Asn Glu Val Leu Val Leu Ser Gln Ile Asn His Arg Asn Val
450 455 460
Val Lys Leu Leu Gly Cys Cys Leu Glu Thr Glu Val Pro Leu Leu Val
465 470 475 480
Tyr Glu Phe Ile Ser Ser Gly Thr Leu Phe Asp His Leu His Gly Ser
485 490 495
Met Phe Asp Ser Ser Leu Thr Trp Glu His Arg Leu Arg Ile Ala Ile
500 505 510
Glu Val Ala Gly Thr Leu Ala Tyr Leu His Ser Tyr Ala Ser Ile Pro
515 520 525
Ile Ile His Arg Asp Val Lys Thr Ala Asn Ile Leu Leu Asp Glu Asn
530 535 540
Leu Thr Ala Lys Val Ala Asp Phe Gly Ala Ser Arg Leu Ile Pro Met
545 550 555 560
Asp Gln Glu Gln Leu Thr Thr Met Val Gln Gly Thr Leu Gly Tyr Leu
565 570 575
Asp Pro Glu Tyr Tyr Asn Thr Gly Leu Leu Asn Glu Lys Ser Asp Val
580 585 590
Tyr Ser Phe Gly Val Val Leu Met Glu Leu Leu Ser Gly Glu Lys Ala
595 600 605
Leu Cys Phe Glu Arg Pro Gln Ser Ser Lys His Leu Val Ser Tyr Phe
610 615 620
Val Ser Ala Met Lys Glu Asn Arg Leu His Glu Ile Ile Asp Gly Gln
625 630 635 640
Val Met Asn Glu Tyr Asn Gln Arg Glu Ile Gln Glu Ser Ala Arg Ile
645 650 655
Ala Val Glu Cys Thr Arg Ile Met Gly Glu Glu Arg Pro Ser Met Lys
660 665 670
Glu Val Ala Ala Glu Leu Glu Ala Leu Arg Val Lys Thr Thr Lys His
675 680 685
Gln Trp Ser Asp Gln Tyr Pro Lys Glu Val Glu His Leu Leu Gly Val
690 695 700
Gln Ile Leu Ser Thr Gln Gly Asp Thr Ser Ser Ile Gly Tyr Asp Ser
705 710 715 720
Ile Gln Asn Val Thr Arg Leu Asp Ile Glu Thr Gly Arg
725 730
<210>11
<211>23
<212>DNA
<213>人工序列
<220>
<223>合成寡核苷酸(WAK_NDE PCR引物)
<400>11
catatgaaag tgcagcgtct gtt 23
<210>12
<211>23
<212>DNA
<213>人工序列
<220>
<223>合成寡核苷酸(WAK_XBA PCR引物)
<400>12
tctagatcag cggcctgctt caa 23
Claims (15)
1.一种用于增加纤维长度和/或植物高度的核酸构建体,包含与木质部优先启动子可操作地连接的编码功能性细胞壁关联激酶4的多核苷酸序列,该启动子引起所述编码功能性细胞壁关联激酶4的多核苷酸序列的超表达。
2.如权利要求1所述的构建体,其中所述木质部优先启动子选自TUB基因启动子、SuSy基因启动子、COMT基因启动子和C4H基因启动子。
3.一种包含权利要求1的核酸构建体的转基因植物细胞。
4.如权利要求3所述的转基因植物细胞,其中所述木质部优先启动子选自TUB基因启动子、SuSy基因启动子、COMT基因启动子和C4H基因启动子。
5.如权利要求3所述的转基因植物细胞,其中所述植物是双子叶植物。
6.如权利要求3所述的转基因植物细胞,其中所述植物是单子叶植物。
7.如权利要求3所述的转基因植物细胞,其中所述植物是裸子植物。
8.如权利要求3所述的转基因植物细胞,其中所述植物是阔叶树。
9.如权利要求8所述的转基因植物细胞,其中所述阔叶树是桉(Eucalyptus)树。
10.如权利要求8所述的转基因植物细胞,其中所述阔叶树是杨(Populus)树。
11.如权利要求7所述的转基因植物细胞,其中所述裸子植物是松(Pinus)树。
12.增加纤维长度和/或植物高度的方法,包括:
(a)向植物细胞中引入权利要求1或2的核酸构建体;
(b)在促进植物生长的条件下培养所述植物细胞;和
(c)选择与同种的非转基因植物相比具有增加的纤维长度和/或植物高度的转基因植物。
13.如权利要求12所述的方法,其中所述木质部优先启动子选自TUB基因启动子、SuSy基因启动子、COMT基因启动子和C4H基因启动子。
14.从包含权利要求3的转基因植物细胞的转基因植物获得的木浆。
15.从包含权利要求3的转基因植物细胞的转基因植物获得的木纤维。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US87104806P | 2006-12-20 | 2006-12-20 | |
US60/871,048 | 2006-12-20 | ||
PCT/BR2007/000357 WO2008074115A1 (en) | 2006-12-20 | 2007-12-20 | Nucleic acid constructs and methods for altering plant fiber length and/or plant height |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101583719A CN101583719A (zh) | 2009-11-18 |
CN101583719B true CN101583719B (zh) | 2015-03-25 |
Family
ID=39535914
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200780046680.9A Expired - Fee Related CN101583719B (zh) | 2006-12-20 | 2007-12-20 | 用于改变纤维长度和/或植物高度的核酸构建体和方法 |
Country Status (7)
Country | Link |
---|---|
US (1) | US9080181B2 (zh) |
EP (1) | EP2094852B1 (zh) |
CN (1) | CN101583719B (zh) |
AU (1) | AU2007335207B2 (zh) |
BR (1) | BRPI0720468A2 (zh) |
CA (1) | CA2672771C (zh) |
WO (1) | WO2008074115A1 (zh) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2010060099A2 (en) * | 2008-11-24 | 2010-05-27 | The Regents Of The University Of California | Wall-associated kinase-like polypeptide mediates nutritional status perception and response |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2005096805A2 (en) * | 2004-04-06 | 2005-10-20 | Alellyx S.A. | Cambium/xylem-preferred promoters and uses thereof |
CN1780915A (zh) * | 2003-04-28 | 2006-05-31 | 瑞典树木科技公司 | 组织特异性启动子 |
Family Cites Families (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA1339684C (en) | 1988-05-17 | 1998-02-24 | Peter H. Quail | Plant ubquitin promoter system |
ES2150900T3 (es) | 1989-10-31 | 2000-12-16 | Monsanto Co | Promotor para plantas transgenicas. |
US5593874A (en) | 1992-03-19 | 1997-01-14 | Monsanto Company | Enhanced expression in plants |
US6096950A (en) | 1992-05-18 | 2000-08-01 | Monsanto Company | Cotton fiber-specific promoters |
US6117679A (en) | 1994-02-17 | 2000-09-12 | Maxygen, Inc. | Methods for generating polynucleotides having desired characteristics by iterative selection and recombination |
US6165793A (en) | 1996-03-25 | 2000-12-26 | Maxygen, Inc. | Methods for generating polynucleotides having desired characteristics by iterative selection and recombination |
US5605793A (en) | 1994-02-17 | 1997-02-25 | Affymax Technologies N.V. | Methods for in vitro recombination |
EP1161541A4 (en) | 1999-02-26 | 2004-11-24 | Cropdesign Nv | METHOD FOR MODIFYING THE MORPHOLOGY, BIOCHEMISTRY OR PHYSIOLOGY OF PLANTS USING SUBSTRATES COMPRISING CDC25 |
WO2001035725A1 (en) | 1999-11-17 | 2001-05-25 | Mendel Biotechnology, Inc. | Yield-related genes |
EP1261725A2 (en) | 2000-02-08 | 2002-12-04 | Sakata Seed Corporation | Methods and constructs for agrobacterium-mediated plant transformation |
SE0000751D0 (sv) | 2000-03-07 | 2000-03-07 | Swetree Genomics Ab | Transgenic trees and methods for their production |
AU782244B2 (en) | 2000-08-01 | 2005-07-14 | Temasek Life Sciences Laboratory Limited | Isolation and characterization of a fiber-specific actin promoter from cotton |
WO2002015675A1 (en) | 2000-08-22 | 2002-02-28 | Mendel Biotechnology, Inc. | Genes for modifying plant traits iv |
EP1402037A1 (en) * | 2001-06-22 | 2004-03-31 | Syngenta Participations AG | Plant genes involved in defense against pathogens |
AP2691A (en) | 2004-08-18 | 2013-07-16 | Alellyx Sa | Polynucleotides, Dna constructs and methods for the alteration of plant lignin content and/or composition |
SE0403132D0 (sv) | 2004-12-21 | 2004-12-21 | Swetree Technologies Ab | New transgenic plants and methods for their production |
GB0513248D0 (en) | 2005-06-29 | 2005-08-03 | Boc Group Plc | Gas dispenser |
-
2007
- 2007-12-20 AU AU2007335207A patent/AU2007335207B2/en not_active Ceased
- 2007-12-20 CN CN200780046680.9A patent/CN101583719B/zh not_active Expired - Fee Related
- 2007-12-20 EP EP07845482.4A patent/EP2094852B1/en not_active Not-in-force
- 2007-12-20 BR BRPI0720468-0A patent/BRPI0720468A2/pt not_active IP Right Cessation
- 2007-12-20 US US12/520,282 patent/US9080181B2/en not_active Expired - Fee Related
- 2007-12-20 WO PCT/BR2007/000357 patent/WO2008074115A1/en active Application Filing
- 2007-12-20 CA CA2672771A patent/CA2672771C/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1780915A (zh) * | 2003-04-28 | 2006-05-31 | 瑞典树木科技公司 | 组织特异性启动子 |
WO2005096805A2 (en) * | 2004-04-06 | 2005-10-20 | Alellyx S.A. | Cambium/xylem-preferred promoters and uses thereof |
Non-Patent Citations (2)
Title |
---|
Bruce D. Kohorn等.An Arabidopsis cell wall-associated kinase required for invertase activity and cell growth.《The Plant Journal》.2006,第46卷(第2期),p307–316. * |
David Lally等.Antisense Expression of a Cell Wall–Associated Protein Kinase, WAK4, Inhibits Cell Elongation and Alters Morphology.《The Plant Cell》.2001,p1317–1331. * |
Also Published As
Publication number | Publication date |
---|---|
EP2094852A1 (en) | 2009-09-02 |
WO2008074115A8 (en) | 2009-08-13 |
AU2007335207A1 (en) | 2008-06-26 |
CA2672771C (en) | 2016-11-29 |
EP2094852A4 (en) | 2010-06-30 |
CN101583719A (zh) | 2009-11-18 |
AU2007335207B2 (en) | 2012-08-02 |
EP2094852B1 (en) | 2014-01-22 |
US9080181B2 (en) | 2015-07-14 |
BRPI0720468A2 (pt) | 2014-01-14 |
US20100095405A1 (en) | 2010-04-15 |
CA2672771A1 (en) | 2008-06-26 |
WO2008074115A1 (en) | 2008-06-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11613760B2 (en) | Compositions and methods for increasing plant growth and improving multiple yield-related traits | |
AU2007308770B2 (en) | Method for modifying plant architecture and enhancing plant biomass and/or sucrose yield | |
US20110004958A1 (en) | Compositions for silencing the expression of gibberellin 2-oxidase and uses thereof | |
EP1963353A2 (en) | Wood and cell wall gene microarray | |
JP4880819B2 (ja) | 植物においてセルロース生合成を増強し、リグニン生合成を改変する方法 | |
CN101578370B (zh) | 编码c3hc4家族植物蛋白质的核酸分子和改变植物纤维素和木素含量的方法 | |
CN101583719B (zh) | 用于改变纤维长度和/或植物高度的核酸构建体和方法 | |
CN101495637B (zh) | 用于改变植物纤维素含量的多核苷酸、dna构建体和方法 | |
Xu | Overexpression of a new cellulose synthase gene ('PuCesA6') from Ussuri poplar ('Populus ussuriensis') exhibited a dwarf phenotype in transgenic tobacco | |
CN113416747B (zh) | 一种创建温度敏感型雄性不育植物的方法 | |
WO2004092380A1 (ja) | イネニコチアナミンシンターゼ遺伝子プロモーター、およびその利用 | |
AU2011265556B2 (en) | Polynucleotides, DNA constructs and methods for the alteration of plant lignin content and/or composition | |
WO2009104181A1 (en) | Plants having genetically modified lignin content and methods of producing same | |
Chiang et al. | Methods of modifying lignin in plants by transformation with a 4-coumarate coenzyme a ligase nucleic acid | |
Chiang et al. | Genetic engineering of plants through manipulation of lignin biosynthesis | |
CA2578311A1 (en) | Altering wood density | |
BRPI0720468B1 (pt) | Nucleic acid constructs and methods for altering the length of the fiber of the plant and / or the height of the plant |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
ASS | Succession or assignment of patent right |
Owner name: MONSANTO BRAZIL CO., LTD. Free format text: FORMER OWNER: ALELLYX SA Effective date: 20131129 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20131129 Address after: Sao Paulo Applicant after: Monsanto Do Brasil Ltda Address before: Brazil Campinas Applicant before: Alexx GmbH |
|
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20150325 |
|
CF01 | Termination of patent right due to non-payment of annual fee |