CN111218457A - 一种水稻mit2基因及其编码蛋白与应用 - Google Patents
一种水稻mit2基因及其编码蛋白与应用 Download PDFInfo
- Publication number
- CN111218457A CN111218457A CN202010302486.XA CN202010302486A CN111218457A CN 111218457 A CN111218457 A CN 111218457A CN 202010302486 A CN202010302486 A CN 202010302486A CN 111218457 A CN111218457 A CN 111218457A
- Authority
- CN
- China
- Prior art keywords
- rice
- mit2
- gene
- application
- ser
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 235000007164 Oryza sativa Nutrition 0.000 title claims abstract description 111
- 235000009566 rice Nutrition 0.000 title claims abstract description 107
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 103
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 23
- 240000007594 Oryza sativa Species 0.000 title description 92
- 241000196324 Embryophyta Species 0.000 claims abstract description 40
- 235000013339 cereals Nutrition 0.000 claims abstract description 11
- 230000006872 improvement Effects 0.000 claims abstract description 6
- 230000002401 inhibitory effect Effects 0.000 claims abstract description 5
- 238000009395 breeding Methods 0.000 claims abstract description 3
- 230000001488 breeding effect Effects 0.000 claims abstract description 3
- 241000209094 Oryza Species 0.000 claims abstract 26
- 239000013598 vector Substances 0.000 claims description 13
- 239000012620 biological material Substances 0.000 claims description 10
- 230000009261 transgenic effect Effects 0.000 claims description 9
- 239000003795 chemical substances by application Substances 0.000 claims description 8
- 230000035772 mutation Effects 0.000 claims description 8
- 239000002773 nucleotide Substances 0.000 claims description 8
- 125000003729 nucleotide group Chemical group 0.000 claims description 8
- 230000035558 fertility Effects 0.000 claims description 5
- 241000894006 Bacteria Species 0.000 claims description 3
- 238000004519 manufacturing process Methods 0.000 claims description 3
- 239000013612 plasmid Substances 0.000 claims description 3
- 244000038559 crop plants Species 0.000 claims description 2
- 230000009467 reduction Effects 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- 230000015572 biosynthetic process Effects 0.000 abstract description 15
- 238000002474 experimental method Methods 0.000 abstract description 5
- 230000002068 genetic effect Effects 0.000 abstract description 5
- 108091026890 Coding region Proteins 0.000 abstract description 3
- 238000013461 design Methods 0.000 abstract description 2
- 231100000502 fertility decrease Toxicity 0.000 abstract description 2
- 108020004414 DNA Proteins 0.000 description 33
- 229930192334 Auxin Natural products 0.000 description 17
- 239000002363 auxin Substances 0.000 description 17
- SEOVTRFCIGRIMH-UHFFFAOYSA-N indole-3-acetic acid Chemical compound C1=CC=C2C(CC(=O)O)=CNC2=C1 SEOVTRFCIGRIMH-UHFFFAOYSA-N 0.000 description 17
- XHSDUVBUZOUAOQ-WJQMYRPNSA-N (3e,3ar,8bs)-3-[[(2r)-4-methyl-5-oxo-2h-furan-2-yl]oxymethylidene]-4,8b-dihydro-3ah-indeno[1,2-b]furan-2-one Chemical compound O1C(=O)C(C)=C[C@@H]1O\C=C/1C(=O)O[C@@H]2C3=CC=CC=C3C[C@@H]2\1 XHSDUVBUZOUAOQ-WJQMYRPNSA-N 0.000 description 14
- 230000006870 function Effects 0.000 description 14
- 238000000034 method Methods 0.000 description 14
- 230000012010 growth Effects 0.000 description 10
- 230000000295 complement effect Effects 0.000 description 9
- 238000011161 development Methods 0.000 description 9
- 230000018109 developmental process Effects 0.000 description 9
- 210000001519 tissue Anatomy 0.000 description 9
- 230000014509 gene expression Effects 0.000 description 7
- 238000003786 synthesis reaction Methods 0.000 description 7
- 150000001413 amino acids Chemical group 0.000 description 6
- 230000033228 biological regulation Effects 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 230000019491 signal transduction Effects 0.000 description 6
- 238000010367 cloning Methods 0.000 description 5
- 239000003375 plant hormone Substances 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 4
- 210000000349 chromosome Anatomy 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 239000012634 fragment Substances 0.000 description 4
- 230000002018 overexpression Effects 0.000 description 4
- 230000001105 regulatory effect Effects 0.000 description 4
- 238000012163 sequencing technique Methods 0.000 description 4
- 206010020649 Hyperkeratosis Diseases 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- 101000625767 Oryza sativa subsp. japonica Transcription factor TB1 Proteins 0.000 description 3
- 210000004027 cell Anatomy 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- UQHKFADEQIVWID-UHFFFAOYSA-N cytokinin Natural products C1=NC=2C(NCC=C(CO)C)=NC=NC=2N1C1CC(O)C(CO)O1 UQHKFADEQIVWID-UHFFFAOYSA-N 0.000 description 3
- 239000004062 cytokinin Substances 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 239000005556 hormone Substances 0.000 description 3
- 229940088597 hormone Drugs 0.000 description 3
- 101150044508 key gene Proteins 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 238000002703 mutagenesis Methods 0.000 description 3
- 231100000350 mutagenesis Toxicity 0.000 description 3
- 108010051242 phenylalanylserine Proteins 0.000 description 3
- 238000005204 segregation Methods 0.000 description 3
- 230000004960 subcellular localization Effects 0.000 description 3
- 108010073969 valyllysine Proteins 0.000 description 3
- 238000012795 verification Methods 0.000 description 3
- 241000589158 Agrobacterium Species 0.000 description 2
- 241000219195 Arabidopsis thaliana Species 0.000 description 2
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 2
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 2
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 2
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 2
- XEEYBQQBJWHFJM-UHFFFAOYSA-N Iron Chemical compound [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 description 2
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 2
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 2
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 2
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 2
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 2
- 240000008467 Oryza sativa Japonica Group Species 0.000 description 2
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 2
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 2
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 2
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 2
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 2
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 2
- 239000004480 active ingredient Substances 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 238000005034 decoration Methods 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 210000005069 ears Anatomy 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 108010037850 glycylvaline Proteins 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 238000013507 mapping Methods 0.000 description 2
- 108010085203 methionylmethionine Proteins 0.000 description 2
- 210000004940 nucleus Anatomy 0.000 description 2
- 230000037361 pathway Effects 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- 230000001850 reproductive effect Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000010186 staining Methods 0.000 description 2
- 108010061238 threonyl-glycine Proteins 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 230000009105 vegetative growth Effects 0.000 description 2
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 1
- NTUPOKHATNSWCY-PMPSAXMXSA-N (2s)-2-[[(2s)-1-[(2r)-2-amino-3-phenylpropanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C([C@@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=CC=C1 NTUPOKHATNSWCY-PMPSAXMXSA-N 0.000 description 1
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- PPINMSZPTPRQQB-NHCYSSNCSA-N 2-[[(2s)-1-[(2s)-2-[[(2s)-2-amino-3-methylbutanoyl]amino]propanoyl]pyrrolidine-2-carbonyl]amino]acetic acid Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PPINMSZPTPRQQB-NHCYSSNCSA-N 0.000 description 1
- OZRFYUJEXYKQDV-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-3-carboxypropanoyl)amino]-3-carboxypropanoyl]amino]-3-carboxypropanoyl]amino]butanedioic acid Chemical compound OC(=O)CC(N)C(=O)NC(CC(O)=O)C(=O)NC(CC(O)=O)C(=O)NC(CC(O)=O)C(O)=O OZRFYUJEXYKQDV-UHFFFAOYSA-N 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 1
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 1
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 1
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 1
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 1
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 1
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 1
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 1
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- PHQXWZGXKAFWAZ-ZLIFDBKOSA-N Ala-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 PHQXWZGXKAFWAZ-ZLIFDBKOSA-N 0.000 description 1
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 1
- 108700011581 Arabidopsis MAX2 Proteins 0.000 description 1
- 101100129496 Arabidopsis thaliana CYP711A1 gene Proteins 0.000 description 1
- 101100129499 Arabidopsis thaliana MAX2 gene Proteins 0.000 description 1
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 1
- JGDGLDNAQJJGJI-AVGNSLFASA-N Arg-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N JGDGLDNAQJJGJI-AVGNSLFASA-N 0.000 description 1
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 1
- SNBHMYQRNCJSOJ-CIUDSAMLSA-N Arg-Gln-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SNBHMYQRNCJSOJ-CIUDSAMLSA-N 0.000 description 1
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- NYDIVDKTULRINZ-AVGNSLFASA-N Arg-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NYDIVDKTULRINZ-AVGNSLFASA-N 0.000 description 1
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 1
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 1
- YFHATWYGAAXQCF-JYJNAYRXSA-N Arg-Pro-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YFHATWYGAAXQCF-JYJNAYRXSA-N 0.000 description 1
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 1
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 1
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- PAXHINASXXXILC-SRVKXCTJSA-N Asn-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)O PAXHINASXXXILC-SRVKXCTJSA-N 0.000 description 1
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 description 1
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- QUAWOKPCAKCHQL-SRVKXCTJSA-N Asn-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QUAWOKPCAKCHQL-SRVKXCTJSA-N 0.000 description 1
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 1
- WXVGISRWSYGEDK-KKUMJFAQSA-N Asn-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N WXVGISRWSYGEDK-KKUMJFAQSA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 1
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 1
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 1
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 1
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 1
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 1
- ICZWAZVKLACMKR-CIUDSAMLSA-N Asp-His-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 ICZWAZVKLACMKR-CIUDSAMLSA-N 0.000 description 1
- AITKTFCQOBRJTG-CIUDSAMLSA-N Asp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N AITKTFCQOBRJTG-CIUDSAMLSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 1
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 1
- XAPPCWUWHNWCPQ-PBCZWWQYSA-N Asp-Thr-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XAPPCWUWHNWCPQ-PBCZWWQYSA-N 0.000 description 1
- SQIARYGNVQWOSB-BZSNNMDCSA-N Asp-Tyr-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQIARYGNVQWOSB-BZSNNMDCSA-N 0.000 description 1
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 1
- 241000195940 Bryophyta Species 0.000 description 1
- 108010078791 Carrier Proteins Proteins 0.000 description 1
- OIMUAKUQOUEPCZ-WHFBIAKZSA-N Cys-Asn-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIMUAKUQOUEPCZ-WHFBIAKZSA-N 0.000 description 1
- HQZGVYJBRSISDT-BQBZGAKWSA-N Cys-Gly-Arg Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQZGVYJBRSISDT-BQBZGAKWSA-N 0.000 description 1
- OXFOKRAFNYSREH-BJDJZHNGSA-N Cys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N OXFOKRAFNYSREH-BJDJZHNGSA-N 0.000 description 1
- KKUVRYLJEXJSGX-MXAVVETBSA-N Cys-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N KKUVRYLJEXJSGX-MXAVVETBSA-N 0.000 description 1
- SRIRHERUAMYIOQ-CIUDSAMLSA-N Cys-Leu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SRIRHERUAMYIOQ-CIUDSAMLSA-N 0.000 description 1
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 1
- 230000004568 DNA-binding Effects 0.000 description 1
- 101100083446 Danio rerio plekhh1 gene Proteins 0.000 description 1
- 102000016680 Dioxygenases Human genes 0.000 description 1
- 108010028143 Dioxygenases Proteins 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000371 Esterases Proteins 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 102000018700 F-Box Proteins Human genes 0.000 description 1
- 108091072033 F-box protein family Proteins 0.000 description 1
- 108091070973 GRAS family Proteins 0.000 description 1
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 1
- XOKGKOQWADCLFQ-GARJFASQSA-N Gln-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XOKGKOQWADCLFQ-GARJFASQSA-N 0.000 description 1
- CKNUKHBRCSMKMO-XHNCKOQMSA-N Gln-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O CKNUKHBRCSMKMO-XHNCKOQMSA-N 0.000 description 1
- GQZDDFRXSDGUNG-YVNDNENWSA-N Gln-Ile-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O GQZDDFRXSDGUNG-YVNDNENWSA-N 0.000 description 1
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 1
- ILKYYKRAULNYMS-JYJNAYRXSA-N Gln-Lys-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ILKYYKRAULNYMS-JYJNAYRXSA-N 0.000 description 1
- XUZQMPGBGFQJMY-SRVKXCTJSA-N Gln-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N XUZQMPGBGFQJMY-SRVKXCTJSA-N 0.000 description 1
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 1
- OKARHJKJTKFQBM-ACZMJKKPSA-N Gln-Ser-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OKARHJKJTKFQBM-ACZMJKKPSA-N 0.000 description 1
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 1
- GJLXZITZLUUXMJ-NHCYSSNCSA-N Gln-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GJLXZITZLUUXMJ-NHCYSSNCSA-N 0.000 description 1
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 1
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 1
- CHDWDBPJOZVZSE-KKUMJFAQSA-N Glu-Phe-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O CHDWDBPJOZVZSE-KKUMJFAQSA-N 0.000 description 1
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 1
- ZAPFAWQHBOHWLL-GUBZILKMSA-N Glu-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N ZAPFAWQHBOHWLL-GUBZILKMSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- DXMOIVCNJIJQSC-QEJZJMRPSA-N Glu-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N DXMOIVCNJIJQSC-QEJZJMRPSA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- 102000053187 Glucuronidase Human genes 0.000 description 1
- 108010060309 Glucuronidase Proteins 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 1
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- VOCMRCVMAPSSAL-IUCAKERBSA-N Gly-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN VOCMRCVMAPSSAL-IUCAKERBSA-N 0.000 description 1
- JUBDONGMHASUCN-IUCAKERBSA-N Gly-Glu-His Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O JUBDONGMHASUCN-IUCAKERBSA-N 0.000 description 1
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- LPCKHUXOGVNZRS-YUMQZZPRSA-N Gly-His-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O LPCKHUXOGVNZRS-YUMQZZPRSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 1
- IXHQLZIWBCQBLQ-STQMWFEESA-N Gly-Pro-Phe Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IXHQLZIWBCQBLQ-STQMWFEESA-N 0.000 description 1
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 1
- YOSQCYUFZGPIPC-PBCZWWQYSA-N His-Asp-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YOSQCYUFZGPIPC-PBCZWWQYSA-N 0.000 description 1
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 1
- PGRPSOUCWRBWKZ-DLOVCJGASA-N His-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 PGRPSOUCWRBWKZ-DLOVCJGASA-N 0.000 description 1
- XKIYNCLILDLGRS-QWRGUYRKSA-N His-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 XKIYNCLILDLGRS-QWRGUYRKSA-N 0.000 description 1
- IAYPZSHNZQHQNO-KKUMJFAQSA-N His-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N IAYPZSHNZQHQNO-KKUMJFAQSA-N 0.000 description 1
- JGFWUKYIQAEYAH-DCAQKATOSA-N His-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JGFWUKYIQAEYAH-DCAQKATOSA-N 0.000 description 1
- SYPULFZAGBBIOM-GVXVVHGQSA-N His-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N SYPULFZAGBBIOM-GVXVVHGQSA-N 0.000 description 1
- 108090000604 Hydrolases Proteins 0.000 description 1
- 102000004157 Hydrolases Human genes 0.000 description 1
- JXUGDUWBMKIJDC-NAKRPEOUSA-N Ile-Ala-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JXUGDUWBMKIJDC-NAKRPEOUSA-N 0.000 description 1
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 1
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- KLJKJVXDHVUMMZ-KKPKCPPISA-N Ile-Phe-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N KLJKJVXDHVUMMZ-KKPKCPPISA-N 0.000 description 1
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 1
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 1
- RWHRUZORDWZESH-ZQINRCPSSA-N Ile-Trp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RWHRUZORDWZESH-ZQINRCPSSA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 1
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 1
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 1
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- NHHKSOGJYNQENP-SRVKXCTJSA-N Leu-Cys-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N NHHKSOGJYNQENP-SRVKXCTJSA-N 0.000 description 1
- HQPHMEPBNUHPKD-XIRDDKMYSA-N Leu-Cys-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N HQPHMEPBNUHPKD-XIRDDKMYSA-N 0.000 description 1
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 1
- AXZGZMGRBDQTEY-SRVKXCTJSA-N Leu-Gln-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O AXZGZMGRBDQTEY-SRVKXCTJSA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 1
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 1
- KVOFSTUWVSQMDK-KKUMJFAQSA-N Leu-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KVOFSTUWVSQMDK-KKUMJFAQSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- TVEOVCYCYGKVPP-HSCHXYMDSA-N Leu-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N TVEOVCYCYGKVPP-HSCHXYMDSA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- LQUIENKUVKPNIC-ULQDDVLXSA-N Leu-Met-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LQUIENKUVKPNIC-ULQDDVLXSA-N 0.000 description 1
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 1
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 1
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 1
- FUKDBQGFSJUXGX-RWMBFGLXSA-N Lys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)C(=O)O FUKDBQGFSJUXGX-RWMBFGLXSA-N 0.000 description 1
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 1
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 1
- GGNOBVSOZPHLCE-GUBZILKMSA-N Lys-Gln-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GGNOBVSOZPHLCE-GUBZILKMSA-N 0.000 description 1
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 1
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 1
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 1
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 1
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- JPYPRVHMKRFTAT-KKUMJFAQSA-N Lys-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N JPYPRVHMKRFTAT-KKUMJFAQSA-N 0.000 description 1
- MSSABBQOBUZFKZ-IHRRRGAJSA-N Lys-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O MSSABBQOBUZFKZ-IHRRRGAJSA-N 0.000 description 1
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 1
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 1
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 1
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 1
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 1
- 101150081330 MOC1 gene Proteins 0.000 description 1
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 1
- OLWAOWXIADGIJG-AVGNSLFASA-N Met-Arg-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(O)=O OLWAOWXIADGIJG-AVGNSLFASA-N 0.000 description 1
- OBVHKUFUDCPZDW-JYJNAYRXSA-N Met-Arg-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OBVHKUFUDCPZDW-JYJNAYRXSA-N 0.000 description 1
- ORRNBLTZBBESPN-HJWJTTGWSA-N Met-Ile-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ORRNBLTZBBESPN-HJWJTTGWSA-N 0.000 description 1
- QZPXMHVKPHJNTR-DCAQKATOSA-N Met-Leu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O QZPXMHVKPHJNTR-DCAQKATOSA-N 0.000 description 1
- USBFEVBHEQBWDD-AVGNSLFASA-N Met-Leu-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O USBFEVBHEQBWDD-AVGNSLFASA-N 0.000 description 1
- WTHGNAAQXISJHP-AVGNSLFASA-N Met-Lys-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WTHGNAAQXISJHP-AVGNSLFASA-N 0.000 description 1
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 101100043229 Oryza sativa subsp. japonica SPL14 gene Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 240000007377 Petunia x hybrida Species 0.000 description 1
- WSXKXSBOJXEZDV-DLOVCJGASA-N Phe-Ala-Asn Chemical compound NC(=O)C[C@@H](C([O-])=O)NC(=O)[C@H](C)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 WSXKXSBOJXEZDV-DLOVCJGASA-N 0.000 description 1
- CPTJPDZTFNKFOU-MXAVVETBSA-N Phe-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N CPTJPDZTFNKFOU-MXAVVETBSA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 1
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 1
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 1
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 1
- 240000004713 Pisum sativum Species 0.000 description 1
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 1
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 1
- OLTFZQIYCNOBLI-DCAQKATOSA-N Pro-Cys-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O OLTFZQIYCNOBLI-DCAQKATOSA-N 0.000 description 1
- SKICPQLTOXGWGO-GARJFASQSA-N Pro-Gln-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O SKICPQLTOXGWGO-GARJFASQSA-N 0.000 description 1
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 1
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 1
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 1
- 102100040125 Prokineticin-2 Human genes 0.000 description 1
- 244000184734 Pyrus japonica Species 0.000 description 1
- 108700005079 Recessive Genes Proteins 0.000 description 1
- 102000052708 Recessive Genes Human genes 0.000 description 1
- 108700005075 Regulator Genes Proteins 0.000 description 1
- 101100041989 Schizosaccharomyces pombe (strain 972 / ATCC 24843) sds23 gene Proteins 0.000 description 1
- DKKGAAJTDKHWOD-BIIVOSGPSA-N Ser-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)C(=O)O DKKGAAJTDKHWOD-BIIVOSGPSA-N 0.000 description 1
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 1
- SFZKGGOGCNQPJY-CIUDSAMLSA-N Ser-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N SFZKGGOGCNQPJY-CIUDSAMLSA-N 0.000 description 1
- SNNSYBWPPVAXQW-ZLUOBGJFSA-N Ser-Cys-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)O SNNSYBWPPVAXQW-ZLUOBGJFSA-N 0.000 description 1
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 1
- GYXVUTAOICLGKJ-ACZMJKKPSA-N Ser-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N GYXVUTAOICLGKJ-ACZMJKKPSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 1
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 1
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 1
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- IFLVBVIYADZIQO-DCAQKATOSA-N Ser-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N IFLVBVIYADZIQO-DCAQKATOSA-N 0.000 description 1
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 1
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- ATEQEHCGZKBEMU-GQGQLFGLSA-N Ser-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N ATEQEHCGZKBEMU-GQGQLFGLSA-N 0.000 description 1
- VAIWUNAAPZZGRI-IHPCNDPISA-N Ser-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N VAIWUNAAPZZGRI-IHPCNDPISA-N 0.000 description 1
- KIEIJCFVGZCUAS-MELADBBJSA-N Ser-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N)C(=O)O KIEIJCFVGZCUAS-MELADBBJSA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- 108020005038 Terminator Codon Proteins 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 1
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 1
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 1
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 1
- ABCLYRRGTZNIFU-BWAGICSOSA-N Thr-Tyr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O ABCLYRRGTZNIFU-BWAGICSOSA-N 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- GEGYPBOPIGNZIF-CWRNSKLLSA-N Trp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O GEGYPBOPIGNZIF-CWRNSKLLSA-N 0.000 description 1
- MXKUGFHWYYKVDV-SZMVWBNQSA-N Trp-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(C)C)C(O)=O MXKUGFHWYYKVDV-SZMVWBNQSA-N 0.000 description 1
- BVWADTBVGZHSLW-IHRRRGAJSA-N Tyr-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N BVWADTBVGZHSLW-IHRRRGAJSA-N 0.000 description 1
- BARBHMSSVWPKPZ-IHRRRGAJSA-N Tyr-Asp-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BARBHMSSVWPKPZ-IHRRRGAJSA-N 0.000 description 1
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- VJOWWOGRNXRQMF-UVBJJODRSA-N Val-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 VJOWWOGRNXRQMF-UVBJJODRSA-N 0.000 description 1
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 1
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- HIZMLPKDJAXDRG-FXQIFTODSA-N Val-Cys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N HIZMLPKDJAXDRG-FXQIFTODSA-N 0.000 description 1
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 1
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 1
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- MLADEWAIYAPAAU-IHRRRGAJSA-N Val-Lys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N MLADEWAIYAPAAU-IHRRRGAJSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 1
- YQMILNREHKTFBS-IHRRRGAJSA-N Val-Phe-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YQMILNREHKTFBS-IHRRRGAJSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 1
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 1
- ODUHAIXFXFACDY-SRVKXCTJSA-N Val-Val-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)C(C)C ODUHAIXFXFACDY-SRVKXCTJSA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- 235000007241 Zea diploperennis Nutrition 0.000 description 1
- 240000008042 Zea mays Species 0.000 description 1
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 1
- 235000017556 Zea mays subsp parviglumis Nutrition 0.000 description 1
- 241000172407 Zea mays subsp. huehuetenangensis Species 0.000 description 1
- 230000009418 agronomic effect Effects 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 108010021908 aspartyl-aspartyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 230000028446 budding cell bud growth Effects 0.000 description 1
- 235000021466 carotenoid Nutrition 0.000 description 1
- 150000001747 carotenoids Chemical class 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 108010054812 diprotin A Proteins 0.000 description 1
- 230000005059 dormancy Effects 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 230000009123 feedback regulation Effects 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- 238000012252 genetic analysis Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 230000035784 germination Effects 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010040856 glutamyl-cysteinyl-alanine Proteins 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 1
- 108010038983 glycyl-histidyl-lysine Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 229910052742 iron Inorganic materials 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 235000009973 maize Nutrition 0.000 description 1
- 108010093708 mamba intestinal toxin 1 Proteins 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 235000016709 nutrition Nutrition 0.000 description 1
- 230000035764 nutrition Effects 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 1
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 1
- 230000001863 plant nutrition Effects 0.000 description 1
- 210000002706 plastid Anatomy 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 230000017854 proteolysis Effects 0.000 description 1
- 238000003753 real-time PCR Methods 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000022983 regulation of cell cycle Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 229930004725 sesquiterpene Natural products 0.000 description 1
- -1 sesquiterpene compounds Chemical class 0.000 description 1
- 238000007493 shaping process Methods 0.000 description 1
- 230000008054 signal transmission Effects 0.000 description 1
- 102000035025 signaling receptors Human genes 0.000 description 1
- 108091005475 signaling receptors Proteins 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000002195 synergetic effect Effects 0.000 description 1
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 238000011426 transformation method Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 108700004896 tripeptide FEG Proteins 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 230000034512 ubiquitination Effects 0.000 description 1
- 230000003827 upregulation Effects 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 108010072644 valyl-alanyl-prolyl-glycine Proteins 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 230000002792 vascular Effects 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/415—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from plants
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Molecular Biology (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biophysics (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Biochemistry (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Physics & Mathematics (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Cell Biology (AREA)
- Botany (AREA)
- Gastroenterology & Hepatology (AREA)
- Medicinal Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
本发明提供一种水稻MIT2基因及其编码的蛋白,其编码区序列如SEQ ID No.2所示,其编码蛋白序列如SEQ ID No.3所示。该基因在多个组织中表达。实验证明MIT2具有抑制水稻分蘖的功能,使水稻植株变高。本发明还提供了MIT2基因的突变基因及其等位突变体mit2‑1,它们使水稻表现为株高变矮、分蘖中度增多,枝梗数变少,育性降低,籽粒变小,颖壳开口。本发明提供的水稻MIT2基因及其突变体在水稻种质资源的遗传改良育种中作用重大,有望对水稻株型的形成、分蘖数进行调控,进而对株型或分蘖数定向设计以提高水稻生产力。
Description
技术领域
本发明属于基因工程领域,具体地说,涉及一种水稻MIT2基因、其编码的蛋白及其在抑制水稻分枝中的应用。
背景技术
水稻(Oryza sativa)属于禾本科稻属,是最重要的粮食作物之一。水稻分蘖是一个非常重要的农艺性状,一个有效分蘖对应一个穗。分蘖是决定水稻产量的关键因素。水稻分蘖不仅是水稻产量相关的重要因素,也是在生物讨论中与植物株型相关的核心主题。与分蘖相关的一些基因和Quantitative Traits Location (QTL)已经被定位出来。因此,进一步挖掘水稻分蘖调控基因,阐明分蘖机制,对于水稻株型塑造,提高水稻产量意义重大。
水稻分蘖的形成过程分为两个步骤,即分蘖芽的形成和分蘖芽的伸长。水稻每片叶的叶腋里能形成一个分蘖芽(也称腋芽、侧芽),通常只有位于茎秆基部不伸长节间上的腋芽才可以进行伸长形成分蘖,而位于茎杆上部伸长节间上的腋芽一般不能伸长,而是处于休眠状态。因此,水稻的分蘖数量不仅取决于形成分蘖芽的数目,更取决于能够伸长的分蘖芽的数目。水稻分蘖的形成是一个复杂的生物学现象,受遗传因素、植物激素和外界环境等多种因素的共同调控。其中参与植物分枝调控的激素主要包括生长素、细胞分裂素、独脚金内酯及其衍生物,这些激素协同作用,共同调控植物的分枝发育。外界环境条件包括光照、温度、水分、栽培密度、插秧深度以及植株的营养状况也会影响水稻分蘖的分化和生长。
第一个被发现的调控水稻分蘖的关键基因是MOC1,MOC1属于GRAS家族,编码一个转录调控因子。在分蘖芽中表达,在腋生分生组织营养生长与生殖生长的不同阶段调控分蘖芽的起始和生长。水稻突变体moc1几乎完全丧失分蘖能力,只有1个主茎,没有分蘖出现,圆锥花序上的小穗也明显减少,说明MOC1在水稻分蘖和穗分枝上起到正调控的作用。水稻TB1基因是根据玉米的分支调控基因TEOSINTE BRANCHED 1 (TB1)的相似序列克隆出来的。
研究显示OsTB1处于独脚金内酯(Strigolactones)信号途径的下游,这两个基因都编码推测的转录因子,它具有一个基本的螺旋-环-螺旋状的DNA结合基序,命名为TCP结构域。OsTB1的基因位点证明与玉米TB1是同源基因。发现OsTB1的过表达使分蘖数减少,而在fc1突变体中分蘖增加,并且OsTB1功能缺失。这些结果显示OsTB1负调控水稻分枝,和玉米中TB1基因功能相似。以水稻多分蘖突变体te为材料,通过图位克隆和转基因验证,分离到一个水稻分蘖的负调控因子TE,具有保守的细胞周期调控功能。实验显示 TE和MOC1存在直接的相互作用;遗传学实验也证明TE和MOC1在同一条信号通路上。TE通过介导MOC1的降解来抑制水稻的分蘖。IPA1蛋白可以与前述的水稻分蘖负调节因子OsTB1基因的启动子区GTAC序列相结合,这种结合可以发生在茎尖和幼穗中。ostb1与ipa1的双突变体表型显示,ostb1的突变可以抑制ipa1的分蘖表型,说明ipa1突变体的分蘖表型可能是由于IPA1基因通过OsTB1直接调控的,预示OsTB1可能参与了IPA1介导的针对水稻分蘖的转录调节。
生长素(Auxin) 是第一种被发现的植物激素,合成于植物茎尖与幼嫩的叶中,在植物生长发育过程中具有重要的作用。生长素通过顶端优势抑制分蘖芽的生长;生长素由上往下主动运输,并直接输送到分蘖芽中,抑制分蘖芽的生长发育。YUC是生长素合成途径中的一个关键基因,主要在水稻分生组织、维管组织中表达。研究发现,YUC的缺失会导致生长素的合成大幅减少,从而使植物失去顶端优势,水稻分蘖芽就会打破休眠并开始正常的生长发发育,最终发育成新的分蘖。OsPIN1是编码生长素转运蛋白的基因,定位在木质部薄壁细胞中,调控生长素向根中转运。OsPIN1的功能缺失会导致生长素运输量的降低,使生长素对侧芽生长发育的抑制效果减弱,促进水稻分蘖芽的伸长。
独角金内酯是一类倍半萜烯化合物,鉴定为一种新的植物激素。在阐明水稻分蘖方面重大的突破就是发现了独脚金内酯,它主要合成于根部,由类胡萝卜素代谢产生,由下向上运输,抑制分蘖芽的生长。独角金内酯就是除生长素和细胞分裂素外一种新的能够抑制侧枝发育的植物激素。目前,独角金内酯已经从许多植物中分离得到,而涉及独角金内酯合成与信号转导的基因也陆续在高等植物如拟南芥、豌豆、矮牵牛、水稻以及低等植物苔藓中被发现。在水稻中,多个基因被证实参与独角金内酯介导的侧枝发育过程。D17/ HTD1和D10编码的胡萝卜素裂解双加氧酶7( CCD7)和 8( CCD8)是独角金内酯合成途径的两个关键酶。研究人员从腋生分枝增多的拟南芥突变体中克隆了MAX1至MAX4基因,MAX1,MAX3和MAX4都参与独脚金内酯的生物合成,MAX2参与独脚金内酯的信号传递。水稻中D3编码一个F-BOX蛋白家族成员,是拟南芥MAX2的同源蛋白,通过形成SCF(Skp1-Cull-in-F-box)蛋白复合体,参与蛋白泛素化并介导蛋白质降解,是独角金内酯信号转导途径的关键基因。水稻基因D27和D14,也被证实是独角金内酯合成与信号转导途径的成员。D27编码一个新的含铁蛋白,并影响生长素的极性运输( PAT),参与独角金内酯的合成。D14/ D88/ HTD2编码一个水解酶/ 酯酶,是独角金内酯的信号受体。这些水稻突变体都表现为多分蘖、矮秆表型,独角金内酯控制腋芽的发育,并和生长素、细胞分裂素一起,共同决定水稻的株型。
大量的生理生化数据表明,激素之间存在着一定联系。生长素通过促进独角金内酯的合成来抑制水稻分蘖的形成。而独角金内酯信号分子也通过一条反馈途径抑制生长素信号途径和另外一种未知的反馈调节信号,这两种信号通路均可调节独角金内酯在质体内的生物合成。独角金内酯和生长素能根据环境信号相互调节对方在植物体内的含量,并形成一个动态的反馈循环,共同调节植物的分枝。尽管关于植物激素调控水稻的分蘖已有大量的研究报道,但是它们之间具体的相互关系以及它们是如何协作调控分蘖芽的生长发育还需要进一步的研究。
发明内容
本发明目的是提供水稻MIT2基因及其编码的蛋白与应用。
本发明以水稻日本晴为材料进行EMS诱变,希望筛选到能够调控水稻分蘖,影响水稻产量的关键基因。发明人致力于筛选水稻的分蘖数目中度程度增加的突变体。因为过多的分蘖对水稻高产不利,大部分营养用于水稻的营养生长,而不利于生殖生长。从突变体库中发明人筛选得到多个具有中度分蘖数目的突变体,命名为Moderately IncreasedTiller(MIT)突变体,获得突变体按照发现的顺序分别命名为MIT1,MIT2……等等。这些突变体的突变基因不同,在调控水稻分蘖过程中的作用也不尽相同。本发明针对MIT2基因及其功能进行了研究。发明人发现EMS诱变的水稻突变体中有呈现半矮,植株分蘖数目中度增多的突变体mit2,图位克隆和功能互补实验发现MIT2基因突变后引起植株变矮,分蘖数目中度程度增加。MIT2基因编码一个功能未知的蛋白。本发明将阐述MIT2在调控水稻分蘖过程中的作用。
本发明首先提供了水稻MIT2蛋白,其具有:
1)如SEQ ID No.3所示的氨基酸序列;或
2)SEQ ID No.3所示的氨基酸序列经取代、缺失和/或增加一个或多个氨基酸且具有同等活性的由1)衍生的蛋白质。
本发明提供了编码水稻MIT2蛋白的基因,其具有:
1)SEQ ID No.2所示的核苷酸序列;或
2)SEQ ID No.2所示核苷酸序列经取代、缺失和/或增加一个或几个核苷酸;或
3)在严格条件下与1)限定的DNA序列杂交的核苷酸序列。
本发明提供了含有编码水稻MIT2蛋白的基因的生物材料,所述生物材料为质粒、载体、宿主菌或转化植物细胞。
本发明提供了上述水稻MIT2蛋白或其编码基因或含有其编码基因的生物材料的以下任一种或多种应用:
(1)在制备转基因植物中的应用;
(2)在水稻种质资源改良中的应用;
(3)在保持水稻育性中的应用;
(4)在使水稻籽粒增大中的应用;
(5)是增加水稻株高中的应用
(6)抑制水稻分蘖中的应用。
第二方面,本发明提供了一种水稻MIT2基因突变基因,其为水稻MIT2基因第8个外显子上,CDS的第2413位的碱基T缺失,所述CDS的序列为SEQ ID No.2所示的核苷酸序列。
第三方面,本发明提供了所述水稻MIT2基因突变基因的等位突变体,所述等位突变体的突变位点发生在第一个内含子的3’端拼接位点处,由原来的AG变成AA,第二个外显子的5’端的前两个碱基AG被剪切掉。
本发明提供一种生物材料,含有上述水稻MIT2基因突变基因或上述的等位突变体,所述生物材料为质粒、载体、宿主菌或转化植物细胞。
第四方面,本发明提供了所述水稻MIT2基因突变基因或所述的等位突变体或含有其的生物材料的以下任一种或多种应用,
(1)在农作物改良育种、制种中的应用;
(2)在降低农作物株高中的应用;
(3)在增加水稻分蘖数中的应用;
(4)在降低水稻育性中的应用;
(5)在培育籽粒变小的水稻中的应用;
(6)在提高水稻产量中的应用。
上述应用中,所述降低农作物株高为与野生型相比降低至50%-60%株高。本发明的实施例中,发现水稻MIT2基因突变基因或所述的等位突变体的表型为成熟期株高稍矮,为野生型的60%。
上述应用中所述增加水稻分蘖数是与野生型相比增加2-3倍的分蘖数。在本发明的实施例中,发现水稻MIT2基因突变基因或所述的等位突变体增加水稻分蘖数是野生型的2倍。
本发明的优点在于:本发明提供了水稻MIT2基因(核苷酸序列如SEQ ID No.1所示,编码区核苷酸序列如SEQ ID No.2)及其编码的蛋白(氨基酸序列如SEQ ID No.3所示)。因EMS诱变获得的水稻MIT2基因突变体及该突变基因的等位突变体与野生型相比有若干基因的缺失或突变,采用实验室常用的琼脂糖电泳就能开展分子检测,即能够实现鉴别,不需要特别的检测技术和方法。将过表达MIT2基因功能互补载体和自身启动子驱动的MIT2基因组互补载体通过农杆菌介导转化mit2突变体的愈伤组织,发现互补转基因植株的表型完全恢复正常,分蘖数和株高都和野生型一致,说明MIT2基因具有抑制水稻分蘖的功能,有望对水稻株型的形成进行调控进而对株型定向设计,以提高水稻生产力,在水稻种质资源改良中筛选效果明显,经济价值巨大。
附图说明
图1为野生型日本晴与突变体的表型。(A)成熟期表现;(B)分蘖表型;(C)突变体具有高节位分蘖;(D)穗型;(E)穗部枝梗表现;(F)籽粒表型;(G)株高统计;(H)分蘖数统计。
图2为MIT2基因定位与结构图。(A)粗定位;(B)精细定位;(C)MIT2基因结构图。
图3为MIT2基因的功能互补验证。(A) 过表达MIT2能够回补突变体表型;(B)自身启动子驱动的MIT2基因组能够回补突变体表型;(C)株高统计;(D)分蘖数统计。
图4为MIT2在水稻各个组织中的表达。
图5为水稻组织GUS染色结果。(A)节间;(B)叶鞘;(C)幼穗;(D)小穗;(E)颖壳;(F)茎基部;(G)分蘖芽;(H)根。
图6为MIT2蛋白的亚细胞定位。
图7为载体pCAMBIA 1305.1AP FH-N结构示意图。
图8为pCAMBIA 1305.1-GFPC结构示意图。
具体实施方式
以下实施例用于说明本发明,但不用来限制本发明的范围。在不背离本发明精神和实质的情况下,对本发明方法、步骤或条件所作的修改或替换,均属于本发明的范围。
若未特别指明,实施例中所用的技术手段为本领域技术人员所熟知的常规手段;若未特别指明,实施例中所用试剂均为市售。
实施例1 突变体的获得与表型分析
通过EMS化学诱变粳稻品种日本晴,得到一个分蘖中度增多突变体Moderately Increased Tiller-2 (mit2)及其等位突变体mit2-1。mit2突变为碱基缺失,其等位突变体mit2-1有着与其相似的矮化多分蘖表型,突变为碱基替换。
表型分析表明,突变体与野生型相比有着多方面的改变(图1)。水稻mit2突变体植株高度较野生型变矮,成熟期株高稍矮,约为野生型的60%,各节间等比例一定程度的缩短;mit2突变体分蘖中度增多,约为野生型的二倍。突变体mit2及其等位突变体枝梗数变少,育性降低,籽粒变小,颖壳开口。
实施例2 水稻MIT2基因的获得和功能互补验证
1、水稻MIT2基因的获得
本发明的MIT2基因是采用图位克隆方法通过突变体mit2克隆得到的。将纯合突变体mit2与表型正常且多态性高的籼稻品种Dular杂交,所有杂交 F1 中个体均表现为株高和分蘖数正常。F2获得分离群体,进行遗传分析与基因定位。对F2代发生性状分离的株系分析表明,野生型与突变体的分离比例符合3:1遗传分离比例,由此表明该突变性状受一对隐性基因控制。
为了定位控制该分蘖中度增多突变体基因的位置,我们利用F1代自交构建的 F2分离群体作为定位的群体,以 BSA 法选取l0株 F2 突变单株构建DNA混池,利用均匀分布于水稻12条染色体上的170个Indel标记,将候选基因定位于第9号染色体上(图2的A)。为了明确候选基因在第9号染色体上的位置,选用50个单株利用实验室开发的均匀分布在整个染色体上的分子标记(表1)进行了定位,结果显示候选基因定位在R9-1和R9-2标记之间。为了精细定位目的基因,将F2定位群体扩大到6380株矮化分蘖中度增多个体,并开发了6对新的Indel标记,最终将目的基因定位在标记M3和M4之间(图2的B)。这两个标记之间的物理距离约为1.2 Mb,包含13个候选基因。并对其进行测序,发现基因LOC_Os09g06560发生突变。
LOC_Os09g06560基因组DNA全长6325 bp (如SEQ ID No.1所示序列),包含12个外显子和11个内含子,编码区全长3378 bp (如SEQ ID No.2所示序列),编码1125个氨基酸 (如SEQ ID No.3所示序列)。共设计13对引物进行测序,对基因测序结果分析发现,mit2突变位点位于第8个外显子上,CDS的第2413个碱基T缺失,造成翻译过程从第805个氨基酸开始移位。其等位突变体mit2-1突变位点发生在第一个内含子的3’端拼接位点处,由原来的AG变成AA,测序发现该突变导致该基因在拼接上发生错误,第二个外显子的5’端的前两个碱基AG被剪切掉(图2的C)。
2、MIT2基因的功能互补验证
本发明将MIT2基因克隆至植物表达载体pCAMBIA 1305.1AP FH-N和pCAMBIA 1305.1- GFPC(本实验室保存,载体结构如图7和图8所示),为了进行功能互补实验,分别构建了过表达MIT2基因功能互补载体(pCAMBIA 1305.1-Actinpro-FLAG-HA-MIT2CDS)和由自身启动子驱动的MIT2基因组互补载体(pCAMBIA 1305.1-MIT2pro-MIT2genomic DNA-GFP),分别带有FLAG和GFP标签,获得的转基因植株命名为B267和B270。
过表达MIT2基因功能互补载体(pCAMBIA 1305.1-Actinpro-FLAG-HA-MIT2CDS)的构建:在MIT2全长CDS的 5’端引入NcoI位点,3’ 端引入SpeI位点,片段长度为3378 bp。所用引物为09g06560S1SPF:CGAACGATAGCCATGGCCATGATATTTCAGCTAAGA AATGCG和09g06560S1SPR:GGTAGGATCCACTAGTCACTTTCACCT TCTTGCTGTTAGC。
自身启动子驱动的MIT2基因组互补载体(pCAMBIA 1305.1-MIT2pro-MIT2genomic DNA-CGFP)的构建:在MIT2基因ATG上游2460bp默认为MIT2基因的启动子,在其5’端位置引入 EcoRI 位点,MIT2基因组序列终止密码子TAA前的3’ 端引入KpnI位点。MIT2启动子序列2460bp;MIT2基因组序列5300bp,片段全长7760bp。启动子所用引物为:promF: CCATGATTACGAATTCTGCTATGCCGTTAGGTA GCAC和promR: CCCTTGCTCACCATGGTACCCTGAAATATCATTGCTAACCATCA;基因组所用引物序列为GDNAF: CAATGATATTTCAGGGTACCATGATATTTCAGCTAAGAAATGCG和GDNAR: CCCTTGCTCACC ATGGTACCCACTTTCACCTTCTTGCTGTTAGC。将两种片段一同重组到pCAMBIA1305.1-CGFP中去。
将构建好的两个表达载体用电击法转入农杆菌EHA105中,水稻mit2突变体结的种子诱导愈伤作为受体材料,用农杆菌介导的转化方法进行水稻的转化。获得的转基因植株的表型完全恢复正常,分蘖数和株高都和野生型一致(图3),突变体枝梗数,育性以及硬壳开口情况也恢复成野生型的表型,说明过表达MIT2能够回补突变体表型、自身启动子驱动的MIT2基因组能够回补突变体表型。以上结果证明了LOC_Os09g06560就是目的基因MIT2。
实施例3 水稻MIT2基因表达模式
为明确MIT2基因的组织表达模式,提取粳稻品种日本晴不同组织(种子、根、茎、叶、叶鞘、幼穗、花、分蘖芽、幼苗、幼根)的RNA,反转录为cDNA,以水稻Ubiquitin基因为内参,采用Real-time PCR的方法检测水稻各个组织中该基因表达水平,结果显示MIT2基因在水稻上述组织中都有表达,其中在幼根中表达量最高,其次在幼穗种子、幼苗中表达量较高,在叶鞘、分蘖芽中也有表达,在种子和成熟根中相对较低(图4)。
实施例4 水稻组织GUS染色结果
MIT2promoter::GUS载体构建:MIT2启动子区通过PCR扩增获得,在5’ 端引入BamHI位点,3’端引入NcoI位点, 片段长为2460 bp,重组到pCAMBIA1305.1的BamHI和NcoI位点中去。所用正向引物:CGGTACCCGGGGATCCTGCTATGCCGTTAGGTAGCAC;反向引物:CTCAGATCTACCATGGGAAATATCATTGCTAACCATCA。利用农杆菌侵染水稻愈伤的方法获得转基因植株,用于分析基因组织表达模式。
利用GUS(β-glucuronidase)基因作为报告基因检测MIT2基因表达的空间分布情况。在节间、叶鞘、茎基部和分蘖芽中均能检测到GUS活性。在成熟根、花药中GUS几乎没有活性(图5)。
实施例5 MIT2蛋白的亚细胞定位
在实施例2中构建的自身启动子驱动的MIT2基因组互补载体融合有GFP标签,获得的转基因植株B270。在转基因种子萌发时期,用激光共聚焦显微镜进行观察水稻根尖GFP的亚细胞定位,结果显示MIT2-GFP蛋白在根尖部位主要在细胞核中表达(图6)。
以上所述仅是本发明的优选实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明技术原理的前提下,还可以做出若干改进和润饰,这些改进和润饰也应视为本发明的保护范围。
序列表
<110> 中国农业科学院作物科学研究所
<120> 一种水稻MIT2基因及其编码蛋白与应用
<130> KHP201111154.6YS
<160> 31
<170> SIPOSequenceListing 1.0
<210> 1
<211> 6325
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 1
catcgccaac gcccgctcca gcctttcggc aacggagcct cctccacgcc acgtcagcct 60
cttcggtgac gacctccttg cctccaccgc tgccaaacgc gccggcctcg tcgtccgacc 120
gtcacttgca ttgtcctcgt cctccgccat cgaccatgac cgtgcttcgt catgtcgccc 180
tgtgccgccc cgctcctcca ccgggccgcc gtcaccgcga gcgccattgc tgttgaggga 240
ttcggcaccg ccggagttgc ccgctcccaa tcttgctgca ggagggctag atgcgcatcc 300
ggcgtggcca aattggccgc gccgagtact ctacccaccg gtgcatgcca ctccacctcc 360
cgtgccgaga agaagcgtca ttaacttgtt ttgcctgcca ccgccttctt ggttggccgc 420
tcggtttccg acggtggcga tgcagcaaga aagagtgagg aaggtggcgg tgttttgccg 480
ccagccgccc gcgcaagagc aatgctggga caaggaaaaa aaaatccaaa agtatataac 540
gtgctggact ttgactaaaa agcacaattt tttttattgt gatggttagc aatgatattt 600
cagctaagaa atgcgatagt taaacatata ttccttttga gtttattttt gaaaatatcc 660
ctaccttacc tccgccactg cggccgtgaa attcggtgta tttttcccgc ggtactccac 720
caaaagcttc ccgccaaacg tcctttcccg cgcgggggga cgcgccaaaa cccaccaaaa 780
tgtctgcaac caaaacccca ccccgcgtat cgtcaatctc tggaaacccc taacaaattc 840
ctgaacaccc gccgccacca tagcttcagc ctcgtgggtg aagaatctct cgtcgtcgtc 900
gtcggtgtca ccatgaaagg gcgcgcggtg aagctccgag aggcgcacaa ggccggctcg 960
ccagtcttct gctccgttgc gtggggccaa ggcgggcagc atgtcgtcac cgcatccgcc 1020
gccgacgtgg ccatcctcat ccatgacgcc gctgcggtcg ccgccgccgg tggccggagc 1080
tcgggctccg cggctgcggc ggcgctttcc acgatccggc ttcacaagga tggcgtcacg 1140
gcgctcgccg tcgcgccggg ctccggcgcg tcgctggcgt ccggctccat cgatcactcc 1200
gtcaagttct gttctttccc aggttcttag atctgactgc cccggatgca atttctccta 1260
atctccgtca tttccacggc tatatttgtg aaattttgct tcggatttcc ttccagaggg 1320
ggtgttccag agcaatatcg cccggttcac cctgccgatc cggtcactgg ccttcaacaa 1380
gaaggggact ctgctggcgg cggccggaga cgacgacggc atcaagttga ttgccaccat 1440
cgacaacacc atctccaaag tgctcaaggg ccacaaggga tcggtaaccg ggttgtcttt 1500
cgatcccaga aacgattatt tggcatcaat tgacaccttc ggcacagtca tcttctggga 1560
tctctgcacg gggactgaag cccgtagtct gaagcggatt gcgccgacat ttggttcaga 1620
ccactcaatc aacaatgccc tgtgctggag ccctgatggg cagttccttg ctgttccggg 1680
attgaggaat aatgtggtca tgtatgatag ggacaccggt gaggaggtgt tcactctgaa 1740
aggggagcat gagcaaccag tgtgtagtct ctgctggtct ccaaatggga ggtacctagt 1800
cactgctgga ttggataagc aggttctgat ctgggatgtg aagtcaaagc aggatgttga 1860
gaggcagaag ttcgatgaaa ggatatgtag cttggcttgg aaacctgaaa gtaatgctgt 1920
agcagtgatc gacgtaactg gcagatttgg catttgggaa tcggtcatcc cgtcgacttt 1980
gaaatcgccc acagagggtg cacctgacct gaactctact aaggttcctt tgtttgatga 2040
cgaggatgat gaggagaggc cgagtacctc tggtggactg gatgatgatg atgatgatga 2100
aagtcttggt gaattaggtc cattcaacca caagagattg aggaggaagt caacctatca 2160
tgatcactca aatggagata gtgaagatga ggatctgata cttcagatgg agtcacgcaa 2220
gagaatgaaa gatacacata gagataacaa ggaggttgct gataaggcaa taggtgattc 2280
agcaacttca gtaagactgg ttacagcaag aatgcaaact gcatttcagc ctgggtccac 2340
accacctcaa cctggcaagc gaaatttcct tgcctacaat atgcttggaa gtatcactac 2400
tatcgaaaat gaggggcatt cacatgtaga ggtaaaatct tctcaccctc tatcttataa 2460
gccattgtat cctctacttg tttgcagctt ggatgtgaat aaaccatccg aagttacttt 2520
gtttttcagg tagacttcca tgacaccgga agaggtccta gagttccttc gatgactgat 2580
tattttggtt tcacaatggc tgcactgaat gaatcaggaa gtgtctttgc aaatccatgc 2640
aagggtgaca agaatatgag cactcttatg taccgccctt tcagtagttg ggcaggcaac 2700
agtgaggtaa gttaactaaa tgaaattgtt gtttgccagc ttctgagata gggttgaagt 2760
ttaccacttt ggtgcttact ggactaattt gaaaattact tagtggtcaa tgaggtttga 2820
gggagaagaa gtgaaggctg tggctgttgg tgttggatgg gtcgctgcag ttacaacttt 2880
aaattttctg cgcattttca cagagggagg gttgcaggtt ttgtaacttc ctcaagcttt 2940
gtatttaagt ccttttttcc tgatgaatat acttgttagt cagtttgtga agttcatttt 3000
tttcctgaaa gaaagctgac tccttattag ttccgaaata cccaaaaaac caggttgtgc 3060
ctggtaatca gtttgtgaag ttcatttttt tcctgaaaga aaactgcctc cttattagtt 3120
ccgtaatacc caaaaaacca ggttgcttta gtttttttca gtgctatcta ctagagcttt 3180
cataaggcca aaatagctgt gaagtattaa caaaagaatt tatagtattt taaaaacaat 3240
taatgttata gaagaaaatt agattcatgc gactctcgac atgtttccaa ttaaatttat 3300
tggtcaagga aacattctat actaccaaca tttaaatgct gaaaccattt tttatttcta 3360
catccaatgt acagccgaaa ttgtcccgcc ctaaccatca gaagcacaga aattgttgta 3420
ctatgtaacc actcatgaat tataacttgt ggggcacgaa ttataacttt gactttttgc 3480
actcaagaaa ataattcagg gaatgcactt tccatgtaga tactagtaac ttttatcacc 3540
atttctcacc aagaagtggt ctacggaaca catttacatt tttagatcat taaaaatgct 3600
gttcatgaca ttggtcgtga cttacatagc aaagccaaac tacatattac taatcttgga 3660
tggtttttct tgatacatgc gagctggggg ttacataagc attcatgaat ctgatttaca 3720
tcattccctg tttgttactt gaagtgggaa aaattcaatt taatttactc attttatatt 3780
gtacttttac ctagcatgtc ttgattttca gatgcatatc ctcggtccgt ggcccaatta 3840
aaaactcttt aagttttgtc aaacacatct cttgattttc agatgcatat cctctcagtc 3900
ggtggcccag tggttactgc ggcaggccat ggagatcagc tagcgattgt gtctcatgct 3960
tcagattgtc tttcatcagg agaccaggta ttcagtatat gttcgtttga cttttggagc 4020
tttgtgtagt gatgatatat gttgtagcta gctatcactt atcagattgt cagcacctcc 4080
tagtgtacca tgattccaat cgtgtgaacc aattcacagt acatatgaac tgatggttga 4140
aatggtacta caagattctg aattgaaaca tggatttgtt catgattaac cctttcaggt 4200
gctggatgtt aaagtactga agatatctga atgtgctcaa tcattgtcca gccgacttgt 4260
tttaactcct gcctctaaat tatcttggtt tggtttcagt gagaatggtg aacttagttc 4320
ctttgattcc aaggtatgac caatgaacac tgtcaacaat tcaacatatc tttagttata 4380
tcatgggagt tttaaactga atattgttat ttctagggaa tactgagggt cttttctggt 4440
caattcggtg gaagctggat tccaatattt aggtatagta tctgcctgac aggttatgca 4500
ttttactact accaaagcac acttaataat tatatccaac tgcagttcaa tcaaggcaag 4560
aaaatccgaa gatgaaagcc actgggtggt gggcttagat gctaataata tattctgcat 4620
tctatgcaag tccccggagt cctatccaca ggtatgccta aatgcaactt ctcatgtgta 4680
caacttttga gtgttgatgt tccaacaaaa atagtacttc tatatgcaat tctcaaatat 4740
tgaaatttgt ttgtcgtttg acgtctttga ctgatcttat caggtgatgc ccaaacctgt 4800
tttgacaata cttgagctgt catttcctct tgcatcatct gaccttggtg ccaatagttt 4860
ggaaactgaa ttcatgatga ggaaactgca tctctcacag gtgcactgtg tttctttcag 4920
ttttgaaggg cagtatctgt gatattgttt ttaatcgttg ctgtttatct cagattcaaa 4980
agaaaataga agaaatggct gctttgggtc tggacacgat tgcattagat gatgaagcat 5040
tcaacatgga ggctgcactt gaccggtgca tcttgaggct catctccagc tgctgcaatg 5100
gtacaagact tgaactatgc tttttaccca taaatagtgc tttgcattaa attatttctt 5160
gttttatttc ttgatttcaa aaggtgataa gcttgtacga gctaccgagc ttgcaaaatt 5220
actaacactg gagaagtcaa tgaagggagc attgatgctt gttacacgct taaaacttcc 5280
catattgcaa gaaaggttca gtgccatact cgaggtgaaa ttcacaaccc ttatactgaa 5340
cctgcgtact tttagcactc taaaaatggc acattcaata tttaagaatt tatcatatgt 5400
tcaactgaag gttatgcaat ggcttcttgt aggagatgat gctaaacaat gcaaaaattg 5460
ccaatacatc tggtgttttc tccaatagta atacaaacta ctcaccatca ccagcgttga 5520
gcactcaagc agtcccacca gctaaggttg tgcaaaatgg aaacagcttg aagttaccta 5580
cattgcctaa actgaatcct gccgcccaac gaagcaatcc aactgaatca aacaaggcag 5640
aggtagaaca agcagacaat ttgaaagaaa tcagtacaaa ggtttcacct gcacaaactc 5700
cgttagttaa aattccaaaa aacagtgaaa tgggtgtaaa aacgaagaaa gataatgatg 5760
gagcatcaca tgcaactaca gttgatcaga acccaaaggg aggcagtggt caggttggcc 5820
ttaaaaacaa gagcgtcgat agctgcaatg gtgtacagcc tcagcggcca gttaacccct 5880
tcgcgaaatc ctcatcaagc aaagaacagc catcatccct ttttgattcc atcaagaaga 5940
tgaaggtcga aaatgagaag gttgacaaag ctaacagcaa gaaggtgaaa gtgtaactgc 6000
ttgtcattat ctcatgatct tcgggagcaa accagcagtt aggtatcctc cacctttact 6060
gacatatctg tgcttatatt tttcatttca gttgtgataa tgtaggtgaa aatcactatt 6120
tacttcttga gctacgtttt tgtgtatggc ttcagagctt aatttctagt tatgaggtgg 6180
tacacaatgt atgatagtaa acctgtttag atagtttgct tgttcctttt gttgattgct 6240
ttgatataca attacatttt cgtctgatta tatttaggat ttatgtatct cctacagata 6300
tataatggag tgcgcaggtt ggctg 6325
<210> 2
<211> 3378
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 2
atgatatttc agctaagaaa tgcgatagtt aaacatatat tccttttgag tttatttttg 60
aaaatatccc taccttacct ccgccactgc ggccgtgaaa ttcggtgtat ttttcccgcg 120
gtactccacc aaaagcttcc cgccaaacgt cctttcccgc gcggggggac gcgccaaaac 180
ccaccaaaat gtctgcaacc aaaaccccac cccgcgtatc gtcaatctct ggaaacccct 240
aacaaattcc tgaacacccg ccgccaccat agcttcagcc tcgtgggtga agaatctctc 300
gtcgtcgtcg tcggtgtcac catgaaaggg cgcgcggtga agctccgaga ggcgcacaag 360
gccggctcgc cagtcttctg ctccgttgcg tggggccaag gcgggcagca tgtcgtcacc 420
gcatccgccg ccgacgtggc catcctcatc catgacgccg ctgcggtcgc cgccgccggt 480
ggccggagct cgggctccgc ggctgcggcg gcgctttcca cgatccggct tcacaaggat 540
ggcgtcacgg cgctcgccgt cgcgccgggc tccggcgcgt cgctggcgtc cggctccatc 600
gatcactccg tcaagttctg ttctttccca gagggggtgt tccagagcaa tatcgcccgg 660
ttcaccctgc cgatccggtc actggccttc aacaagaagg ggactctgct ggcggcggcc 720
ggagacgacg acggcatcaa gttgattgcc accatcgaca acaccatctc caaagtgctc 780
aagggccaca agggatcggt aaccgggttg tctttcgatc ccagaaacga ttatttggca 840
tcaattgaca ccttcggcac agtcatcttc tgggatctct gcacggggac tgaagcccgt 900
agtctgaagc ggattgcgcc gacatttggt tcagaccact caatcaacaa tgccctgtgc 960
tggagccctg atgggcagtt ccttgctgtt ccgggattga ggaataatgt ggtcatgtat 1020
gatagggaca ccggtgagga ggtgttcact ctgaaagggg agcatgagca accagtgtgt 1080
agtctctgct ggtctccaaa tgggaggtac ctagtcactg ctggattgga taagcaggtt 1140
ctgatctggg atgtgaagtc aaagcaggat gttgagaggc agaagttcga tgaaaggata 1200
tgtagcttgg cttggaaacc tgaaagtaat gctgtagcag tgatcgacgt aactggcaga 1260
tttggcattt gggaatcggt catcccgtcg actttgaaat cgcccacaga gggtgcacct 1320
gacctgaact ctactaaggt tcctttgttt gatgacgagg atgatgagga gaggccgagt 1380
acctctggtg gactggatga tgatgatgat gatgaaagtc ttggtgaatt aggtccattc 1440
aaccacaaga gattgaggag gaagtcaacc tatcatgatc actcaaatgg agatagtgaa 1500
gatgaggatc tgatacttca gatggagtca cgcaagagaa tgaaagatac acatagagat 1560
aacaaggagg ttgctgataa ggcaataggt gattcagcaa cttcagtaag actggttaca 1620
gcaagaatgc aaactgcatt tcagcctggg tccacaccac ctcaacctgg caagcgaaat 1680
ttccttgcct acaatatgct tggaagtatc actactatcg aaaatgaggg gcattcacat 1740
gtagaggtag acttccatga caccggaaga ggtcctagag ttccttcgat gactgattat 1800
tttggtttca caatggctgc actgaatgaa tcaggaagtg tctttgcaaa tccatgcaag 1860
ggtgacaaga atatgagcac tcttatgtac cgccctttca gtagttgggc aggcaacagt 1920
gagtggtcaa tgaggtttga gggagaagaa gtgaaggctg tggctgttgg tgttggatgg 1980
gtcgctgcag ttacaacttt aaattttctg cgcattttca cagagggagg gttgcagatg 2040
catatcctct cagtcggtgg cccagtggtt actgcggcag gccatggaga tcagctagcg 2100
attgtgtctc atgcttcaga ttgtctttca tcaggagacc aggtgctgga tgttaaagta 2160
ctgaagatat ctgaatgtgc tcaatcattg tccagccgac ttgttttaac tcctgcctct 2220
aaattatctt ggtttggttt cagtgagaat ggtgaactta gttcctttga ttccaaggga 2280
atactgaggg tcttttctgg tcaattcggt ggaagctgga ttccaatatt tagttcaatc 2340
aaggcaagaa aatccgaaga tgaaagccac tgggtggtgg gcttagatgc taataatata 2400
ttctgcattc tatgcaagtc cccggagtcc tatccacagg tgatgcccaa acctgttttg 2460
acaatacttg agctgtcatt tcctcttgca tcatctgacc ttggtgccaa tagtttggaa 2520
actgaattca tgatgaggaa actgcatctc tcacagattc aaaagaaaat agaagaaatg 2580
gctgctttgg gtctggacac gattgcatta gatgatgaag cattcaacat ggaggctgca 2640
cttgaccggt gcatcttgag gctcatctcc agctgctgca atggtgataa gcttgtacga 2700
gctaccgagc ttgcaaaatt actaacactg gagaagtcaa tgaagggagc attgatgctt 2760
gttacacgct taaaacttcc catattgcaa gaaaggttca gtgccatact cgaggagatg 2820
atgctaaaca atgcaaaaat tgccaataca tctggtgttt tctccaatag taatacaaac 2880
tactcaccat caccagcgtt gagcactcaa gcagtcccac cagctaaggt tgtgcaaaat 2940
ggaaacagct tgaagttacc tacattgcct aaactgaatc ctgccgccca acgaagcaat 3000
ccaactgaat caaacaaggc agaggtagaa caagcagaca atttgaaaga aatcagtaca 3060
aaggtttcac ctgcacaaac tccgttagtt aaaattccaa aaaacagtga aatgggtgta 3120
aaaacgaaga aagataatga tggagcatca catgcaacta cagttgatca gaacccaaag 3180
ggaggcagtg gtcaggttgg ccttaaaaac aagagcgtcg atagctgcaa tggtgtacag 3240
cctcagcggc cagttaaccc cttcgcgaaa tcctcatcaa gcaaagaaca gccatcatcc 3300
ctttttgatt ccatcaagaa gatgaaggtc gaaaatgaga aggttgacaa agctaacagc 3360
aagaaggtga aagtgtaa 3378
<210> 3
<211> 1125
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 3
Met Ile Phe Gln Leu Arg Asn Ala Ile Val Lys His Ile Phe Leu Leu
1 5 10 15
Ser Leu Phe Leu Lys Ile Ser Leu Pro Tyr Leu Arg His Cys Gly Arg
20 25 30
Glu Ile Arg Cys Ile Phe Pro Ala Val Leu His Gln Lys Leu Pro Ala
35 40 45
Lys Arg Pro Phe Pro Arg Gly Gly Thr Arg Gln Asn Pro Pro Lys Cys
50 55 60
Leu Gln Pro Lys Pro His Pro Ala Tyr Arg Gln Ser Leu Glu Thr Pro
65 70 75 80
Asn Lys Phe Leu Asn Thr Arg Arg His His Ser Phe Ser Leu Val Gly
85 90 95
Glu Glu Ser Leu Val Val Val Val Gly Val Thr Met Lys Gly Arg Ala
100 105 110
Val Lys Leu Arg Glu Ala His Lys Ala Gly Ser Pro Val Phe Cys Ser
115 120 125
Val Ala Trp Gly Gln Gly Gly Gln His Val Val Thr Ala Ser Ala Ala
130 135 140
Asp Val Ala Ile Leu Ile His Asp Ala Ala Ala Val Ala Ala Ala Gly
145 150 155 160
Gly Arg Ser Ser Gly Ser Ala Ala Ala Ala Ala Leu Ser Thr Ile Arg
165 170 175
Leu His Lys Asp Gly Val Thr Ala Leu Ala Val Ala Pro Gly Ser Gly
180 185 190
Ala Ser Leu Ala Ser Gly Ser Ile Asp His Ser Val Lys Phe Cys Ser
195 200 205
Phe Pro Glu Gly Val Phe Gln Ser Asn Ile Ala Arg Phe Thr Leu Pro
210 215 220
Ile Arg Ser Leu Ala Phe Asn Lys Lys Gly Thr Leu Leu Ala Ala Ala
225 230 235 240
Gly Asp Asp Asp Gly Ile Lys Leu Ile Ala Thr Ile Asp Asn Thr Ile
245 250 255
Ser Lys Val Leu Lys Gly His Lys Gly Ser Val Thr Gly Leu Ser Phe
260 265 270
Asp Pro Arg Asn Asp Tyr Leu Ala Ser Ile Asp Thr Phe Gly Thr Val
275 280 285
Ile Phe Trp Asp Leu Cys Thr Gly Thr Glu Ala Arg Ser Leu Lys Arg
290 295 300
Ile Ala Pro Thr Phe Gly Ser Asp His Ser Ile Asn Asn Ala Leu Cys
305 310 315 320
Trp Ser Pro Asp Gly Gln Phe Leu Ala Val Pro Gly Leu Arg Asn Asn
325 330 335
Val Val Met Tyr Asp Arg Asp Thr Gly Glu Glu Val Phe Thr Leu Lys
340 345 350
Gly Glu His Glu Gln Pro Val Cys Ser Leu Cys Trp Ser Pro Asn Gly
355 360 365
Arg Tyr Leu Val Thr Ala Gly Leu Asp Lys Gln Val Leu Ile Trp Asp
370 375 380
Val Lys Ser Lys Gln Asp Val Glu Arg Gln Lys Phe Asp Glu Arg Ile
385 390 395 400
Cys Ser Leu Ala Trp Lys Pro Glu Ser Asn Ala Val Ala Val Ile Asp
405 410 415
Val Thr Gly Arg Phe Gly Ile Trp Glu Ser Val Ile Pro Ser Thr Leu
420 425 430
Lys Ser Pro Thr Glu Gly Ala Pro Asp Leu Asn Ser Thr Lys Val Pro
435 440 445
Leu Phe Asp Asp Glu Asp Asp Glu Glu Arg Pro Ser Thr Ser Gly Gly
450 455 460
Leu Asp Asp Asp Asp Asp Asp Glu Ser Leu Gly Glu Leu Gly Pro Phe
465 470 475 480
Asn His Lys Arg Leu Arg Arg Lys Ser Thr Tyr His Asp His Ser Asn
485 490 495
Gly Asp Ser Glu Asp Glu Asp Leu Ile Leu Gln Met Glu Ser Arg Lys
500 505 510
Arg Met Lys Asp Thr His Arg Asp Asn Lys Glu Val Ala Asp Lys Ala
515 520 525
Ile Gly Asp Ser Ala Thr Ser Val Arg Leu Val Thr Ala Arg Met Gln
530 535 540
Thr Ala Phe Gln Pro Gly Ser Thr Pro Pro Gln Pro Gly Lys Arg Asn
545 550 555 560
Phe Leu Ala Tyr Asn Met Leu Gly Ser Ile Thr Thr Ile Glu Asn Glu
565 570 575
Gly His Ser His Val Glu Val Asp Phe His Asp Thr Gly Arg Gly Pro
580 585 590
Arg Val Pro Ser Met Thr Asp Tyr Phe Gly Phe Thr Met Ala Ala Leu
595 600 605
Asn Glu Ser Gly Ser Val Phe Ala Asn Pro Cys Lys Gly Asp Lys Asn
610 615 620
Met Ser Thr Leu Met Tyr Arg Pro Phe Ser Ser Trp Ala Gly Asn Ser
625 630 635 640
Glu Trp Ser Met Arg Phe Glu Gly Glu Glu Val Lys Ala Val Ala Val
645 650 655
Gly Val Gly Trp Val Ala Ala Val Thr Thr Leu Asn Phe Leu Arg Ile
660 665 670
Phe Thr Glu Gly Gly Leu Gln Met His Ile Leu Ser Val Gly Gly Pro
675 680 685
Val Val Thr Ala Ala Gly His Gly Asp Gln Leu Ala Ile Val Ser His
690 695 700
Ala Ser Asp Cys Leu Ser Ser Gly Asp Gln Val Leu Asp Val Lys Val
705 710 715 720
Leu Lys Ile Ser Glu Cys Ala Gln Ser Leu Ser Ser Arg Leu Val Leu
725 730 735
Thr Pro Ala Ser Lys Leu Ser Trp Phe Gly Phe Ser Glu Asn Gly Glu
740 745 750
Leu Ser Ser Phe Asp Ser Lys Gly Ile Leu Arg Val Phe Ser Gly Gln
755 760 765
Phe Gly Gly Ser Trp Ile Pro Ile Phe Ser Ser Ile Lys Ala Arg Lys
770 775 780
Ser Glu Asp Glu Ser His Trp Val Val Gly Leu Asp Ala Asn Asn Ile
785 790 795 800
Phe Cys Ile Leu Cys Lys Ser Pro Glu Ser Tyr Pro Gln Val Met Pro
805 810 815
Lys Pro Val Leu Thr Ile Leu Glu Leu Ser Phe Pro Leu Ala Ser Ser
820 825 830
Asp Leu Gly Ala Asn Ser Leu Glu Thr Glu Phe Met Met Arg Lys Leu
835 840 845
His Leu Ser Gln Ile Gln Lys Lys Ile Glu Glu Met Ala Ala Leu Gly
850 855 860
Leu Asp Thr Ile Ala Leu Asp Asp Glu Ala Phe Asn Met Glu Ala Ala
865 870 875 880
Leu Asp Arg Cys Ile Leu Arg Leu Ile Ser Ser Cys Cys Asn Gly Asp
885 890 895
Lys Leu Val Arg Ala Thr Glu Leu Ala Lys Leu Leu Thr Leu Glu Lys
900 905 910
Ser Met Lys Gly Ala Leu Met Leu Val Thr Arg Leu Lys Leu Pro Ile
915 920 925
Leu Gln Glu Arg Phe Ser Ala Ile Leu Glu Glu Met Met Leu Asn Asn
930 935 940
Ala Lys Ile Ala Asn Thr Ser Gly Val Phe Ser Asn Ser Asn Thr Asn
945 950 955 960
Tyr Ser Pro Ser Pro Ala Leu Ser Thr Gln Ala Val Pro Pro Ala Lys
965 970 975
Val Val Gln Asn Gly Asn Ser Leu Lys Leu Pro Thr Leu Pro Lys Leu
980 985 990
Asn Pro Ala Ala Gln Arg Ser Asn Pro Thr Glu Ser Asn Lys Ala Glu
995 1000 1005
Val Glu Gln Ala Asp Asn Leu Lys Glu Ile Ser Thr Lys Val Ser Pro
1010 1015 1020
Ala Gln Thr Pro Leu Val Lys Ile Pro Lys Asn Ser Glu Met Gly Val
1025 1030 1035 1040
Lys Thr Lys Lys Asp Asn Asp Gly Ala Ser His Ala Thr Thr Val Asp
1045 1050 1055
Gln Asn Pro Lys Gly Gly Ser Gly Gln Val Gly Leu Lys Asn Lys Ser
1060 1065 1070
Val Asp Ser Cys Asn Gly Val Gln Pro Gln Arg Pro Val Asn Pro Phe
1075 1080 1085
Ala Lys Ser Ser Ser Ser Lys Glu Gln Pro Ser Ser Leu Phe Asp Ser
1090 1095 1100
Ile Lys Lys Met Lys Val Glu Asn Glu Lys Val Asp Lys Ala Asn Ser
1105 1110 1115 1120
Lys Lys Val Lys Val
1125
<210> 4
<211> 21
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 4
gctacctagt caaattaatc g 21
<210> 5
<211> 21
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 5
gattaggcca agtaagtcca c 21
<210> 6
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 6
tgcatggtca cgttcctcat 20
<210> 7
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 7
attgcggagt gatgagagat 20
<210> 8
<211> 21
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 8
aaccaagcaa aagtcattgg a 21
<210> 9
<211> 21
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 9
cgagtaatat tttgggcgtc a 21
<210> 10
<211> 22
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 10
gcatcattag tcctggttag cg 22
<210> 11
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 11
gtggaactct ccactgctcc 20
<210> 12
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 12
gtgtagctat gggtaccatc 20
<210> 13
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 13
ggtctctcac tgtttctggt 20
<210> 14
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 14
gcgatcgggc aagttcaaaa 20
<210> 15
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 15
gggatctctg gaaaaaggac 20
<210> 16
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 16
tgtctgggaa gcttccaaca 20
<210> 17
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 17
ccaccaaagc cgactctata 20
<210> 18
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 18
gggtgattgg agatatgaca 20
<210> 19
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 19
agcatagcaa taatggccac 20
<210> 20
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 20
tttttgggga ataccctccc 20
<210> 21
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 21
ggtttggcac catgtggaaa 20
<210> 22
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 22
cataccttgc agtcctagaa 20
<210> 23
<211> 20
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 23
cgtggctagt ccatcaattc 20
<210> 24
<211> 42
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 24
cgaacgatag ccatggccat gatatttcag ctaagaaatg cg 42
<210> 25
<211> 40
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 25
ggtaggatcc actagtcact ttcaccttct tgctgttagc 40
<210> 26
<211> 37
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 26
ccatgattac gaattctgct atgccgttag gtagcac 37
<210> 27
<211> 44
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 27
cccttgctca ccatggtacc ctgaaatatc attgctaacc atca 44
<210> 28
<211> 44
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 28
caatgatatt tcagggtacc atgatatttc agctaagaaa tgcg 44
<210> 29
<211> 44
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 29
cccttgctca ccatggtacc cactttcacc ttcttgctgt tagc 44
<210> 30
<211> 37
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 30
cggtacccgg ggatcctgct atgccgttag gtagcac 37
<210> 31
<211> 38
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 31
ctcagatcta ccatgggaaa tatcattgct aaccatca 38
Claims (7)
1.水稻MIT2蛋白或其编码基因或含有其编码基因的生物材料的以下任一种或多种应用:
(1)在制备转基因植物中的应用;
(2)在水稻种质资源改良中的应用;
(3)在保持水稻育性中的应用;
(4)在使水稻籽粒增大中的应用;
(5)是增加水稻株高中的应用;
(6)抑制水稻分蘖中的应用;
所述水稻MIT2蛋白的氨基酸序列如SEQ ID No. 3所示。
2.一种水稻MIT2基因突变基因,其为水稻MIT2基因第8个外显子上,CDS的第2413位的碱基T缺失,所述CDS的序列为SEQ ID No.2所示的核苷酸序列。
3.权利要求2所述水稻MIT2基因突变基因的等位突变体,其特征在于,所述等位突变体的突变位点发生在第一个内含子的3’端拼接位点处,由原来的AG变成AA,第二个外显子的5’端的前两个碱基AG被剪切掉。
4.一种生物材料,其特征在于,含有权利要求2所述水稻MIT2基因突变基因或权利要求3所述的等位突变体,所述生物材料为质粒、载体、宿主菌、转化植物细胞。
5.权利要求2所述水稻MIT2基因突变基因或权利要求3所述的等位突变体或权利要求4所述的生物材料的以下任一种或多种应用,
(1)在农作物改良育种、制种中的应用;
(2)在降低农作物株高中的应用;
(3)在增加水稻分蘖数中的应用;
(4)在降低水稻育性中的应用;
(5)在培育籽粒变小的水稻中的应用;
(6)在提高水稻产量中的应用。
6.根据权利要求5所述的应用,其特征在于,所述降低农作物株高为与野生型相比降低至50%-60%株高。
7.根据权利要求5所述的应用,其特征在于,所述增加水稻分蘖数是与野生型相比增加2-3倍的分蘖数。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010302486.XA CN111218457B (zh) | 2020-04-17 | 2020-04-17 | 一种水稻mit2基因及其编码蛋白与应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010302486.XA CN111218457B (zh) | 2020-04-17 | 2020-04-17 | 一种水稻mit2基因及其编码蛋白与应用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111218457A true CN111218457A (zh) | 2020-06-02 |
CN111218457B CN111218457B (zh) | 2020-07-24 |
Family
ID=70808092
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010302486.XA Active CN111218457B (zh) | 2020-04-17 | 2020-04-17 | 一种水稻mit2基因及其编码蛋白与应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111218457B (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115551879A (zh) * | 2020-06-01 | 2022-12-30 | 中国农业科学院生物技术研究所 | Drw1蛋白调控水稻株高和种子大小的应用 |
CN116004558A (zh) * | 2020-11-02 | 2023-04-25 | 武汉大学 | 乙酰转移酶OsG2基因及其编码的蛋白质在调节水稻植株高度方面的应用 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104341525A (zh) * | 2013-08-09 | 2015-02-11 | 中国农业科学院作物科学研究所 | 水稻转录因子Os02g47744基因CDS序列的应用 |
US20170114359A1 (en) * | 2007-10-31 | 2017-04-27 | Monsanto Technology, Llc | Genes and uses for plant enhancement |
CN107312785A (zh) * | 2017-08-09 | 2017-11-03 | 四川农业大学 | OsKTN80b基因在降低水稻株高方面的应用 |
CN110331161A (zh) * | 2019-07-31 | 2019-10-15 | 湖南杂交水稻研究中心 | 利用显性黑色颖壳性状提高水稻遗传工程核不育系种子色选精度的方法 |
-
2020
- 2020-04-17 CN CN202010302486.XA patent/CN111218457B/zh active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20170114359A1 (en) * | 2007-10-31 | 2017-04-27 | Monsanto Technology, Llc | Genes and uses for plant enhancement |
CN104341525A (zh) * | 2013-08-09 | 2015-02-11 | 中国农业科学院作物科学研究所 | 水稻转录因子Os02g47744基因CDS序列的应用 |
CN107312785A (zh) * | 2017-08-09 | 2017-11-03 | 四川农业大学 | OsKTN80b基因在降低水稻株高方面的应用 |
CN110331161A (zh) * | 2019-07-31 | 2019-10-15 | 湖南杂交水稻研究中心 | 利用显性黑色颖壳性状提高水稻遗传工程核不育系种子色选精度的方法 |
Non-Patent Citations (1)
Title |
---|
无: "PREDICTED: Oryza sativa Japonica Group WD repeat and HMG-box DNA-binding protein 1 (LOC4346476), mRNA", 《NCBI 登录号:XM_015757073》 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115551879A (zh) * | 2020-06-01 | 2022-12-30 | 中国农业科学院生物技术研究所 | Drw1蛋白调控水稻株高和种子大小的应用 |
CN116004558A (zh) * | 2020-11-02 | 2023-04-25 | 武汉大学 | 乙酰转移酶OsG2基因及其编码的蛋白质在调节水稻植株高度方面的应用 |
CN116004558B (zh) * | 2020-11-02 | 2024-05-07 | 武汉大学 | 乙酰转移酶OsG2基因及其编码的蛋白质在调节水稻植株高度方面的应用 |
Also Published As
Publication number | Publication date |
---|---|
CN111218457B (zh) | 2020-07-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107164347B (zh) | 控制水稻茎秆粗度、分蘖数、穗粒数、千粒重和产量的理想株型基因npt1及其应用 | |
Zhao et al. | Identification and characterization of a new dwarf locus DS-4 encoding an Aux/IAA7 protein in Brassica napus | |
CA3175033A1 (en) | Autoflowering markers | |
KR100892904B1 (ko) | 개화 시기를 조절하는 유전자, 이를 이용한 형질전환 식물체, 및 개화 시기 조절 방법 | |
US10704054B2 (en) | Modulation of seed vigor | |
CN110195061A (zh) | 控制番茄果实形状的基因及克隆方法和应用 | |
Yang et al. | A novel HD‐Zip I/C2H2‐ZFP/WD‐repeat complex regulates the size of spine base in cucumber | |
CN112011567B (zh) | 水稻pal1基因及其编码蛋白与应用 | |
CN111218457B (zh) | 一种水稻mit2基因及其编码蛋白与应用 | |
Chen et al. | Genome-wide identification and characterization of the ALOG gene family in Petunia | |
CN113151323A (zh) | 一种控制玉米籽粒发育基因ZmRH4的克隆、功能研究及标记挖掘 | |
CN101747418A (zh) | 一种植物叶片卷曲调控基因及其应用 | |
JP2016171747A (ja) | 作物の収量に関わる遺伝子及びその利用 | |
JP3051874B2 (ja) | 植物を矮性化させる方法 | |
CN106399287B (zh) | 一种水稻mit1基因、其编码蛋白及应用 | |
CN110655561B (zh) | 玉米苞叶长度调控蛋白arr8及其编码基因与应用 | |
CN111826391B (zh) | 一种nhx2-gcd1双基因或其蛋白的应用 | |
CN108003227A (zh) | 一种水稻早花时相关蛋白及其编码基因 | |
CN110092819B (zh) | 玉米苞叶宽度调控蛋白arf2及其编码基因与应用 | |
CN109912703B (zh) | 蛋白质OsARE1在调控植物衰老中的应用 | |
CN117801082A (zh) | 水稻wtg2基因的应用 | |
CN109121420B (zh) | 耐寒植物 | |
CN107418958B (zh) | 水稻rcn20基因及其编码蛋白与应用 | |
CN108586595B (zh) | 水稻mis2基因及其编码蛋白与应用 | |
Li et al. | BTA2 regulates tiller angle and the shoot gravity response through controlling auxin content and distribution in rice |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |