CA2437960A1 - Als2 gene and amyotrophic lateral sclerosis type 2 - Google Patents
Als2 gene and amyotrophic lateral sclerosis type 2 Download PDFInfo
- Publication number
- CA2437960A1 CA2437960A1 CA002437960A CA2437960A CA2437960A1 CA 2437960 A1 CA2437960 A1 CA 2437960A1 CA 002437960 A CA002437960 A CA 002437960A CA 2437960 A CA2437960 A CA 2437960A CA 2437960 A1 CA2437960 A1 CA 2437960A1
- Authority
- CA
- Canada
- Prior art keywords
- leu
- ser
- gly
- glu
- seq
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 201000008266 amyotrophic lateral sclerosis type 2 Diseases 0.000 title claims abstract description 21
- 101150036028 ALS2 gene Proteins 0.000 title description 39
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 139
- 108091034117 Oligonucleotide Proteins 0.000 claims description 113
- 150000007523 nucleic acids Chemical class 0.000 claims description 96
- 102000039446 nucleic acids Human genes 0.000 claims description 88
- 108020004707 nucleic acids Proteins 0.000 claims description 88
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 74
- 238000000034 method Methods 0.000 claims description 73
- 102000004169 proteins and genes Human genes 0.000 claims description 59
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 40
- 230000035772 mutation Effects 0.000 claims description 35
- 239000002299 complementary DNA Substances 0.000 claims description 33
- 241000282414 Homo sapiens Species 0.000 claims description 31
- 239000012634 fragment Substances 0.000 claims description 31
- 150000001413 amino acids Chemical class 0.000 claims description 28
- 239000002773 nucleotide Substances 0.000 claims description 23
- 125000003729 nucleotide group Chemical group 0.000 claims description 23
- 239000013598 vector Substances 0.000 claims description 23
- 238000003745 diagnosis Methods 0.000 claims description 21
- 230000014509 gene expression Effects 0.000 claims description 20
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 claims description 17
- 230000000295 complement effect Effects 0.000 claims description 16
- 230000003321 amplification Effects 0.000 claims description 14
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 14
- 239000000523 sample Substances 0.000 claims description 13
- 241000124008 Mammalia Species 0.000 claims description 10
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 10
- 239000012472 biological sample Substances 0.000 claims description 10
- 238000002360 preparation method Methods 0.000 claims description 8
- 239000003814 drug Substances 0.000 claims description 7
- 239000000427 antigen Substances 0.000 claims description 6
- 108091007433 antigens Proteins 0.000 claims description 6
- 102000036639 antigens Human genes 0.000 claims description 6
- 239000000203 mixture Substances 0.000 claims description 4
- 241000283984 Rodentia Species 0.000 claims description 3
- 230000004071 biological effect Effects 0.000 claims description 3
- 239000008194 pharmaceutical composition Substances 0.000 claims description 3
- 108700026220 vif Genes Proteins 0.000 claims 1
- 102000013446 GTP Phosphohydrolases Human genes 0.000 abstract description 14
- 108091006109 GTPases Proteins 0.000 abstract description 14
- 230000001105 regulatory effect Effects 0.000 abstract description 11
- 108020004414 DNA Proteins 0.000 description 130
- 206010002026 amyotrophic lateral sclerosis Diseases 0.000 description 119
- 241000282326 Felis catus Species 0.000 description 51
- 210000004027 cell Anatomy 0.000 description 43
- 239000000047 product Substances 0.000 description 43
- 108010050848 glycylleucine Proteins 0.000 description 32
- 238000013518 transcription Methods 0.000 description 30
- 230000035897 transcription Effects 0.000 description 30
- 101000776160 Homo sapiens Alsin Proteins 0.000 description 28
- 238000003752 polymerase chain reaction Methods 0.000 description 23
- 241000699666 Mus <mouse, genus> Species 0.000 description 20
- 125000000539 amino acid group Chemical group 0.000 description 19
- 230000002950 deficient Effects 0.000 description 19
- 102100032047 Alsin Human genes 0.000 description 18
- 102000004196 processed proteins & peptides Human genes 0.000 description 17
- 210000000349 chromosome Anatomy 0.000 description 16
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 15
- 108020004999 messenger RNA Proteins 0.000 description 15
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 14
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 13
- 241000880493 Leptailurus serval Species 0.000 description 13
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 12
- 238000012217 deletion Methods 0.000 description 12
- 230000037430 deletion Effects 0.000 description 12
- 230000006870 function Effects 0.000 description 12
- 108010034529 leucyl-lysine Proteins 0.000 description 12
- 108091005573 modified proteins Proteins 0.000 description 12
- 102000035118 modified proteins Human genes 0.000 description 12
- 102000016285 Guanine Nucleotide Exchange Factors Human genes 0.000 description 11
- 108010067218 Guanine Nucleotide Exchange Factors Proteins 0.000 description 11
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 11
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 11
- 108010037850 glycylvaline Proteins 0.000 description 10
- 238000009396 hybridization Methods 0.000 description 10
- 108010070643 prolylglutamic acid Proteins 0.000 description 10
- 108010065920 Insulin Lispro Proteins 0.000 description 9
- 241001465754 Metazoa Species 0.000 description 9
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 9
- 108010047495 alanylglycine Proteins 0.000 description 9
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 9
- 108010013835 arginine glutamate Proteins 0.000 description 9
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 9
- 210000002569 neuron Anatomy 0.000 description 9
- 108010090894 prolylleucine Proteins 0.000 description 9
- 108010026333 seryl-proline Proteins 0.000 description 9
- 238000006467 substitution reaction Methods 0.000 description 9
- 108700024394 Exon Proteins 0.000 description 8
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 8
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 8
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 8
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 8
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 8
- 238000012408 PCR amplification Methods 0.000 description 8
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 8
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 8
- 108010010147 glycylglutamine Proteins 0.000 description 8
- 108010036413 histidylglycine Proteins 0.000 description 8
- 108010018006 histidylserine Proteins 0.000 description 8
- 108010057821 leucylproline Proteins 0.000 description 8
- 239000012528 membrane Substances 0.000 description 8
- 238000003757 reverse transcription PCR Methods 0.000 description 8
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 8
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 7
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 7
- 108700028369 Alleles Proteins 0.000 description 7
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 7
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 7
- RTOOAKXIJADOLL-GUBZILKMSA-N Glu-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N RTOOAKXIJADOLL-GUBZILKMSA-N 0.000 description 7
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 7
- 101001104102 Homo sapiens X-linked retinitis pigmentosa GTPase regulator Proteins 0.000 description 7
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 7
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 7
- 102100040092 X-linked retinitis pigmentosa GTPase regulator Human genes 0.000 description 7
- 108010092854 aspartyllysine Proteins 0.000 description 7
- 210000004556 brain Anatomy 0.000 description 7
- 108010078144 glutaminyl-glycine Proteins 0.000 description 7
- 108010092114 histidylphenylalanine Proteins 0.000 description 7
- 108010005942 methionylglycine Proteins 0.000 description 7
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 6
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 6
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 6
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 6
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 6
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 6
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 6
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 6
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 6
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 6
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 6
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 6
- 108700026244 Open Reading Frames Proteins 0.000 description 6
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 6
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 6
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 6
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 6
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 6
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 6
- 108010005233 alanylglutamic acid Proteins 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 6
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 6
- 108010016616 cysteinylglycine Proteins 0.000 description 6
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 6
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 6
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 6
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 6
- 108010049041 glutamylalanine Proteins 0.000 description 6
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 6
- 108010012581 phenylalanylglutamate Proteins 0.000 description 6
- 108010051242 phenylalanylserine Proteins 0.000 description 6
- 108091033319 polynucleotide Proteins 0.000 description 6
- 102000040430 polynucleotide Human genes 0.000 description 6
- 239000002157 polynucleotide Substances 0.000 description 6
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 6
- 108010061238 threonyl-glycine Proteins 0.000 description 6
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 5
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 5
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 5
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 5
- LFFOJBOTZUWINF-ZANVPECISA-N Ala-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O)=CNC2=C1 LFFOJBOTZUWINF-ZANVPECISA-N 0.000 description 5
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 5
- HBHMVBGGHDMPBF-GARJFASQSA-N Cys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N HBHMVBGGHDMPBF-GARJFASQSA-N 0.000 description 5
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 5
- ZCFNZTVIDMLUQC-SXNHZJKMSA-N Glu-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZCFNZTVIDMLUQC-SXNHZJKMSA-N 0.000 description 5
- GGEJHJIXRBTJPD-BYPYZUCNSA-N Gly-Asn-Gly Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GGEJHJIXRBTJPD-BYPYZUCNSA-N 0.000 description 5
- 101100055414 Homo sapiens ALS2 gene Proteins 0.000 description 5
- 101001077220 Homo sapiens Ras-associated and pleckstrin homology domains-containing protein 1 Proteins 0.000 description 5
- JJQQGCMKLOEGAV-OSUNSFLBSA-N Ile-Thr-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)O)N JJQQGCMKLOEGAV-OSUNSFLBSA-N 0.000 description 5
- 208000011823 Juvenile amyotrophic lateral sclerosis Diseases 0.000 description 5
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 5
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 5
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 5
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 5
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 5
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 5
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 5
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 5
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 5
- 238000000636 Northern blotting Methods 0.000 description 5
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 5
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 5
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 5
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 5
- 102100039977 Regulator of chromosome condensation Human genes 0.000 description 5
- 101710150974 Regulator of chromosome condensation Proteins 0.000 description 5
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 5
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 5
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 5
- WYKJENSCCRJLRC-ZDLURKLDSA-N Thr-Gly-Cys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O WYKJENSCCRJLRC-ZDLURKLDSA-N 0.000 description 5
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 5
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 5
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 5
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 5
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 5
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 5
- 108010038633 aspartylglutamate Proteins 0.000 description 5
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 5
- 230000002068 genetic effect Effects 0.000 description 5
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 5
- 108010089804 glycyl-threonine Proteins 0.000 description 5
- 108010015792 glycyllysine Proteins 0.000 description 5
- 108010025306 histidylleucine Proteins 0.000 description 5
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 5
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 5
- 108010009298 lysylglutamic acid Proteins 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 238000012986 modification Methods 0.000 description 5
- 210000003205 muscle Anatomy 0.000 description 5
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 5
- 108010031719 prolyl-serine Proteins 0.000 description 5
- 108010048818 seryl-histidine Proteins 0.000 description 5
- 210000000278 spinal cord Anatomy 0.000 description 5
- 238000012360 testing method Methods 0.000 description 5
- 210000001519 tissue Anatomy 0.000 description 5
- 108010073969 valyllysine Proteins 0.000 description 5
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 4
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 4
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 4
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 4
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 4
- MSILNNHVVMMTHZ-UWVGGRQHSA-N Arg-His-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 MSILNNHVVMMTHZ-UWVGGRQHSA-N 0.000 description 4
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 4
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 4
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 4
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 4
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 4
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 4
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 4
- NYQHSUGFEWDWPD-ACZMJKKPSA-N Asp-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N NYQHSUGFEWDWPD-ACZMJKKPSA-N 0.000 description 4
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 4
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 4
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 4
- KZZYVYWSXMFYEC-DCAQKATOSA-N Cys-Val-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KZZYVYWSXMFYEC-DCAQKATOSA-N 0.000 description 4
- KHNJVFYHIKLUPD-SRVKXCTJSA-N Gln-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHNJVFYHIKLUPD-SRVKXCTJSA-N 0.000 description 4
- RONJIBWTGKVKFY-HTUGSXCWSA-N Gln-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O RONJIBWTGKVKFY-HTUGSXCWSA-N 0.000 description 4
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 4
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 4
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 4
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 4
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 4
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 4
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 4
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 4
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 4
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 4
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 4
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 4
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 4
- PYFIQROSWQERAS-LBPRGKRZSA-N Gly-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(=O)NCC(O)=O)=CNC2=C1 PYFIQROSWQERAS-LBPRGKRZSA-N 0.000 description 4
- OEROYDLRVAYIMQ-YUMQZZPRSA-N His-Gly-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O OEROYDLRVAYIMQ-YUMQZZPRSA-N 0.000 description 4
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 4
- PBWMCUAFLPMYPF-ZQINRCPSSA-N Ile-Trp-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PBWMCUAFLPMYPF-ZQINRCPSSA-N 0.000 description 4
- 108091092195 Intron Proteins 0.000 description 4
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 4
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 4
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 4
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 4
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 4
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 4
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 4
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 4
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 4
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 4
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 4
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 4
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 4
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 4
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 4
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 4
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 4
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 4
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 description 4
- MQMIRLVJXQNTRJ-SDDRHHMPSA-N Lys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O MQMIRLVJXQNTRJ-SDDRHHMPSA-N 0.000 description 4
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 4
- DNWBUCHHMRQWCZ-GUBZILKMSA-N Lys-Ser-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DNWBUCHHMRQWCZ-GUBZILKMSA-N 0.000 description 4
- AWMMBHDKERMOID-YTQUADARSA-N Lys-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCCCN)N)C(=O)O AWMMBHDKERMOID-YTQUADARSA-N 0.000 description 4
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 4
- 101100055415 Mus musculus Als2 gene Proteins 0.000 description 4
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 4
- MYQCCQSMKNCNKY-KKUMJFAQSA-N Phe-His-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O)N MYQCCQSMKNCNKY-KKUMJFAQSA-N 0.000 description 4
- FZBGMXYQPACKNC-HJWJTTGWSA-N Phe-Pro-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FZBGMXYQPACKNC-HJWJTTGWSA-N 0.000 description 4
- APXXVISUHOLGEE-ILWGZMRPSA-N Phe-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CC=CC=C4)N)C(=O)O APXXVISUHOLGEE-ILWGZMRPSA-N 0.000 description 4
- NFLNBHLMLYALOO-DCAQKATOSA-N Pro-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 NFLNBHLMLYALOO-DCAQKATOSA-N 0.000 description 4
- 102100025208 Ras-associated and pleckstrin homology domains-containing protein 1 Human genes 0.000 description 4
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 4
- WKLJLEXEENIYQE-SRVKXCTJSA-N Ser-Cys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WKLJLEXEENIYQE-SRVKXCTJSA-N 0.000 description 4
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 4
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 4
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 4
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 4
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 4
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 4
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 4
- HJAXVYLCKDPPDF-SRVKXCTJSA-N Ser-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N HJAXVYLCKDPPDF-SRVKXCTJSA-N 0.000 description 4
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 4
- 238000002105 Southern blotting Methods 0.000 description 4
- XSTGOZBBXFKGHA-YJRXYDGGSA-N Thr-His-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O XSTGOZBBXFKGHA-YJRXYDGGSA-N 0.000 description 4
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 4
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 4
- XTOCLOATLKOZAU-JBACZVJFSA-N Tyr-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N XTOCLOATLKOZAU-JBACZVJFSA-N 0.000 description 4
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 4
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 4
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 4
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 4
- GMOLURHJBLOBFW-ONGXEEELSA-N Val-Gly-His Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMOLURHJBLOBFW-ONGXEEELSA-N 0.000 description 4
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 4
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 4
- SJLVYVZBFDTRCG-DCAQKATOSA-N Val-Lys-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N SJLVYVZBFDTRCG-DCAQKATOSA-N 0.000 description 4
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 4
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 4
- VCIYTVOBLZHFSC-XHSDSOJGSA-N Val-Phe-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N VCIYTVOBLZHFSC-XHSDSOJGSA-N 0.000 description 4
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 4
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 4
- 239000002253 acid Substances 0.000 description 4
- 108010044940 alanylglutamine Proteins 0.000 description 4
- 108010087924 alanylproline Proteins 0.000 description 4
- 230000000692 anti-sense effect Effects 0.000 description 4
- 108010008355 arginyl-glutamine Proteins 0.000 description 4
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 4
- 108010093581 aspartyl-proline Proteins 0.000 description 4
- 108010068265 aspartyltyrosine Proteins 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 230000018109 developmental process Effects 0.000 description 4
- 210000003527 eukaryotic cell Anatomy 0.000 description 4
- 239000013604 expression vector Substances 0.000 description 4
- 210000003414 extremity Anatomy 0.000 description 4
- 208000019995 familial amyotrophic lateral sclerosis Diseases 0.000 description 4
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 4
- 238000010363 gene targeting Methods 0.000 description 4
- 238000001415 gene therapy Methods 0.000 description 4
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 4
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 4
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 4
- 108010050343 histidyl-alanyl-glutamine Proteins 0.000 description 4
- 210000005260 human cell Anatomy 0.000 description 4
- 238000007901 in situ hybridization Methods 0.000 description 4
- 108010000761 leucylarginine Proteins 0.000 description 4
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 4
- 238000004519 manufacturing process Methods 0.000 description 4
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 4
- 108010056582 methionylglutamic acid Proteins 0.000 description 4
- 108010034507 methionyltryptophan Proteins 0.000 description 4
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 4
- 108010029020 prolylglycine Proteins 0.000 description 4
- 102000005912 ran GTP Binding Protein Human genes 0.000 description 4
- 108010005652 splenotritin Proteins 0.000 description 4
- 238000002560 therapeutic procedure Methods 0.000 description 4
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 4
- 108010078580 tyrosylleucine Proteins 0.000 description 4
- JUEUYDRZJNQZGR-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]amino]acetyl]amino]-3-phenylpropanoic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JUEUYDRZJNQZGR-UHFFFAOYSA-N 0.000 description 3
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 3
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 3
- DAEFQZCYZKRTLR-ZLUOBGJFSA-N Ala-Cys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O DAEFQZCYZKRTLR-ZLUOBGJFSA-N 0.000 description 3
- IYCZBJXFSZSHPN-DLOVCJGASA-N Ala-Cys-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IYCZBJXFSZSHPN-DLOVCJGASA-N 0.000 description 3
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 3
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 3
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 3
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 3
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 3
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 3
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 3
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 3
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 3
- 101100049539 Arabidopsis thaliana VPS9A gene Proteins 0.000 description 3
- NAARDJBSSPUGCF-FXQIFTODSA-N Arg-Cys-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N NAARDJBSSPUGCF-FXQIFTODSA-N 0.000 description 3
- RYRQZJVFDVWURI-SRVKXCTJSA-N Arg-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N RYRQZJVFDVWURI-SRVKXCTJSA-N 0.000 description 3
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 3
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 3
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 3
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 3
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 3
- LVHMEJJWEXBMKK-GMOBBJLQSA-N Asn-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N LVHMEJJWEXBMKK-GMOBBJLQSA-N 0.000 description 3
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 3
- RBOBTTLFPRSXKZ-BZSNNMDCSA-N Asn-Phe-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RBOBTTLFPRSXKZ-BZSNNMDCSA-N 0.000 description 3
- WKGJGVGTEZGFSW-FXQIFTODSA-N Asp-Asn-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O WKGJGVGTEZGFSW-FXQIFTODSA-N 0.000 description 3
- DXQOQMCLWWADMU-ACZMJKKPSA-N Asp-Gln-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DXQOQMCLWWADMU-ACZMJKKPSA-N 0.000 description 3
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 3
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 3
- YRZIYQGXTSBRLT-AVGNSLFASA-N Asp-Phe-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YRZIYQGXTSBRLT-AVGNSLFASA-N 0.000 description 3
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 3
- LEYKQPDPZJIRTA-AQZXSJQPSA-N Asp-Trp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LEYKQPDPZJIRTA-AQZXSJQPSA-N 0.000 description 3
- 108020004705 Codon Proteins 0.000 description 3
- 206010010904 Convulsion Diseases 0.000 description 3
- VCIIDXDOPGHMDQ-WDSKDSINSA-N Cys-Gly-Gln Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VCIIDXDOPGHMDQ-WDSKDSINSA-N 0.000 description 3
- BBQIWFFTTQTNOC-AVGNSLFASA-N Cys-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N BBQIWFFTTQTNOC-AVGNSLFASA-N 0.000 description 3
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 3
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 3
- ULXXDWZMMSQBDC-ACZMJKKPSA-N Gln-Asp-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ULXXDWZMMSQBDC-ACZMJKKPSA-N 0.000 description 3
- MFLMFRZBAJSGHK-ACZMJKKPSA-N Gln-Cys-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N MFLMFRZBAJSGHK-ACZMJKKPSA-N 0.000 description 3
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 3
- VOLVNCMGXWDDQY-LPEHRKFASA-N Gln-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O VOLVNCMGXWDDQY-LPEHRKFASA-N 0.000 description 3
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 3
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 3
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 3
- QBEWLBKBGXVVPD-RYUDHWBXSA-N Gln-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N QBEWLBKBGXVVPD-RYUDHWBXSA-N 0.000 description 3
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 3
- SJMJMEWQMBJYPR-DZKIICNBSA-N Gln-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N SJMJMEWQMBJYPR-DZKIICNBSA-N 0.000 description 3
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 3
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 3
- WOMUDRVDJMHTCV-DCAQKATOSA-N Glu-Arg-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOMUDRVDJMHTCV-DCAQKATOSA-N 0.000 description 3
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 3
- VSMQDIVEBXPKRT-QEJZJMRPSA-N Glu-Cys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N VSMQDIVEBXPKRT-QEJZJMRPSA-N 0.000 description 3
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 3
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 3
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 3
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 3
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 3
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 3
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 3
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 3
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 3
- QGAJQIGFFIQJJK-IHRRRGAJSA-N Glu-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QGAJQIGFFIQJJK-IHRRRGAJSA-N 0.000 description 3
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 3
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 3
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 3
- XXGQRGQPGFYECI-WDSKDSINSA-N Gly-Cys-Glu Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(O)=O XXGQRGQPGFYECI-WDSKDSINSA-N 0.000 description 3
- VUUOMYFPWDYETE-WDSKDSINSA-N Gly-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN VUUOMYFPWDYETE-WDSKDSINSA-N 0.000 description 3
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 3
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 3
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 3
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 3
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 3
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 3
- PCPOYRCAHPJXII-UWVGGRQHSA-N Gly-Lys-Met Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O PCPOYRCAHPJXII-UWVGGRQHSA-N 0.000 description 3
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 3
- QGDOOCIPHSSADO-STQMWFEESA-N Gly-Met-Phe Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGDOOCIPHSSADO-STQMWFEESA-N 0.000 description 3
- TTYVAUJGNMVTRN-GJZGRUSLSA-N Gly-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)CN TTYVAUJGNMVTRN-GJZGRUSLSA-N 0.000 description 3
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 3
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 3
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 3
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 3
- UVUIXIVPKVMONA-CIUDSAMLSA-N His-Cys-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CN=CN1 UVUIXIVPKVMONA-CIUDSAMLSA-N 0.000 description 3
- PYNUBZSXKQKAHL-UWVGGRQHSA-N His-Gly-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O PYNUBZSXKQKAHL-UWVGGRQHSA-N 0.000 description 3
- VYUXYMRNGALHEA-DLOVCJGASA-N His-Leu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O VYUXYMRNGALHEA-DLOVCJGASA-N 0.000 description 3
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 3
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 3
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 3
- WEWCEPOYKANMGZ-MMWGEVLESA-N Ile-Cys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N WEWCEPOYKANMGZ-MMWGEVLESA-N 0.000 description 3
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 3
- CMNMPCTVCWWYHY-MXAVVETBSA-N Ile-His-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(C)C)C(=O)O)N CMNMPCTVCWWYHY-MXAVVETBSA-N 0.000 description 3
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 3
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 3
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 3
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 3
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 3
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 3
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 3
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 3
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 3
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 3
- JLYUZRKPDKHUTC-WDSOQIARSA-N Leu-Pro-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JLYUZRKPDKHUTC-WDSOQIARSA-N 0.000 description 3
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 3
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 3
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 3
- CNWDWAMPKVYJJB-NUTKFTJISA-N Leu-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CNWDWAMPKVYJJB-NUTKFTJISA-N 0.000 description 3
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 3
- VQHUBNVKFFLWRP-ULQDDVLXSA-N Leu-Tyr-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 VQHUBNVKFFLWRP-ULQDDVLXSA-N 0.000 description 3
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 3
- ULUQBUKAPDUKOC-GVXVVHGQSA-N Lys-Glu-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O ULUQBUKAPDUKOC-GVXVVHGQSA-N 0.000 description 3
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 3
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 3
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 3
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 3
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 3
- INMBONMDMGPADT-AVGNSLFASA-N Lys-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N INMBONMDMGPADT-AVGNSLFASA-N 0.000 description 3
- SVSQSPICRKBMSZ-SRVKXCTJSA-N Lys-Pro-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O SVSQSPICRKBMSZ-SRVKXCTJSA-N 0.000 description 3
- KXYLFJIQDIMURW-IHPCNDPISA-N Lys-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCCN)=CNC2=C1 KXYLFJIQDIMURW-IHPCNDPISA-N 0.000 description 3
- NQOQDINRVQCAKD-ULQDDVLXSA-N Lys-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N NQOQDINRVQCAKD-ULQDDVLXSA-N 0.000 description 3
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 3
- VWJFOUBDZIUXGA-AVGNSLFASA-N Lys-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N VWJFOUBDZIUXGA-AVGNSLFASA-N 0.000 description 3
- YLBUMXYVQCHBPR-ULQDDVLXSA-N Met-Leu-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YLBUMXYVQCHBPR-ULQDDVLXSA-N 0.000 description 3
- NSMXRFMGZYTFEX-KJEVXHAQSA-N Met-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCSC)N)O NSMXRFMGZYTFEX-KJEVXHAQSA-N 0.000 description 3
- PNHRPOWKRRJATF-IHRRRGAJSA-N Met-Tyr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 PNHRPOWKRRJATF-IHRRRGAJSA-N 0.000 description 3
- 108010021466 Mutant Proteins Proteins 0.000 description 3
- 102000008300 Mutant Proteins Human genes 0.000 description 3
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 3
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 3
- 108010066427 N-valyltryptophan Proteins 0.000 description 3
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 3
- 241000761456 Nops Species 0.000 description 3
- 238000002944 PCR assay Methods 0.000 description 3
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 3
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 3
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 3
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 3
- LCWXSALTPTZKNM-CIUDSAMLSA-N Pro-Cys-Glu Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O LCWXSALTPTZKNM-CIUDSAMLSA-N 0.000 description 3
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 3
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 3
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 3
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 3
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 3
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 3
- 101100049541 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) VPS9 gene Proteins 0.000 description 3
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 3
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 3
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 3
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 3
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 3
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 3
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 3
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 3
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 3
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 3
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 3
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 3
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 3
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 description 3
- SIEZEMFJLYRUMK-YTWAJWBKSA-N Thr-Met-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N)O SIEZEMFJLYRUMK-YTWAJWBKSA-N 0.000 description 3
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 3
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 3
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 3
- DZIKVMCFXIIETR-JSGCOSHPSA-N Trp-Gly-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O DZIKVMCFXIIETR-JSGCOSHPSA-N 0.000 description 3
- NXQAOORHSYJRGH-AAEUAGOBSA-N Trp-Gly-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 NXQAOORHSYJRGH-AAEUAGOBSA-N 0.000 description 3
- HJXOFWKCWLHYIJ-SZMVWBNQSA-N Trp-Lys-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HJXOFWKCWLHYIJ-SZMVWBNQSA-N 0.000 description 3
- GNCPKOZDOCQRAF-BPUTZDHNSA-N Trp-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N GNCPKOZDOCQRAF-BPUTZDHNSA-N 0.000 description 3
- ILTXFANLDMJWPR-SIUGBPQLSA-N Tyr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N ILTXFANLDMJWPR-SIUGBPQLSA-N 0.000 description 3
- AGDDLOQMXUQPDY-BZSNNMDCSA-N Tyr-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O AGDDLOQMXUQPDY-BZSNNMDCSA-N 0.000 description 3
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 3
- VKYDVKAKGDNZED-STECZYCISA-N Tyr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N VKYDVKAKGDNZED-STECZYCISA-N 0.000 description 3
- WOCYUGQDXPTQPY-FXQIFTODSA-N Val-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N WOCYUGQDXPTQPY-FXQIFTODSA-N 0.000 description 3
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 3
- BEGDZYNDCNEGJZ-XVKPBYJWSA-N Val-Gly-Gln Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O BEGDZYNDCNEGJZ-XVKPBYJWSA-N 0.000 description 3
- FEFZWCSXEMVSPO-LSJOCFKGSA-N Val-His-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O FEFZWCSXEMVSPO-LSJOCFKGSA-N 0.000 description 3
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 3
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 3
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 3
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 3
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 3
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 3
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 3
- 108010068380 arginylarginine Proteins 0.000 description 3
- 108010062796 arginyllysine Proteins 0.000 description 3
- 108010036533 arginylvaline Proteins 0.000 description 3
- 210000004436 artificial bacterial chromosome Anatomy 0.000 description 3
- 239000000969 carrier Substances 0.000 description 3
- 238000012512 characterization method Methods 0.000 description 3
- 230000036461 convulsion Effects 0.000 description 3
- 108010069495 cysteinyltyrosine Proteins 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 238000002405 diagnostic procedure Methods 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 3
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 3
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 3
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 3
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 3
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 3
- 210000003917 human chromosome Anatomy 0.000 description 3
- 238000002955 isolation Methods 0.000 description 3
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 3
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 3
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 3
- 108010012058 leucyltyrosine Proteins 0.000 description 3
- 210000004698 lymphocyte Anatomy 0.000 description 3
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 3
- 108010054155 lysyllysine Proteins 0.000 description 3
- 108010017391 lysylvaline Proteins 0.000 description 3
- 238000013507 mapping Methods 0.000 description 3
- 210000002161 motor neuron Anatomy 0.000 description 3
- 230000003387 muscular Effects 0.000 description 3
- 108010077112 prolyl-proline Proteins 0.000 description 3
- 108010020432 prolyl-prolylisoleucine Proteins 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 230000006798 recombination Effects 0.000 description 3
- 238000005215 recombination Methods 0.000 description 3
- 108091008146 restriction endonucleases Proteins 0.000 description 3
- 230000002441 reversible effect Effects 0.000 description 3
- 102000007268 rho GTP-Binding Proteins Human genes 0.000 description 3
- 108010033674 rho GTP-Binding Proteins Proteins 0.000 description 3
- 238000012216 screening Methods 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- 108010071207 serylmethionine Proteins 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- 230000008685 targeting Effects 0.000 description 3
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 3
- 108010038745 tryptophylglycine Proteins 0.000 description 3
- 238000005406 washing Methods 0.000 description 3
- PIDRBUDUWHBYSR-UHFFFAOYSA-N 1-[2-[[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O PIDRBUDUWHBYSR-UHFFFAOYSA-N 0.000 description 2
- 102000007469 Actins Human genes 0.000 description 2
- 108010085238 Actins Proteins 0.000 description 2
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 2
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 2
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 2
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 2
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 2
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 2
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 2
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 2
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 2
- QKHWNPQNOHEFST-VZFHVOOUSA-N Ala-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N)O QKHWNPQNOHEFST-VZFHVOOUSA-N 0.000 description 2
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 2
- XKXAZPSREVUCRT-BPNCWPANSA-N Ala-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=C(O)C=C1 XKXAZPSREVUCRT-BPNCWPANSA-N 0.000 description 2
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 2
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 2
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 2
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 2
- VDBKFYYIBLXEIF-GUBZILKMSA-N Arg-Gln-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VDBKFYYIBLXEIF-GUBZILKMSA-N 0.000 description 2
- RKRSYHCNPFGMTA-CIUDSAMLSA-N Arg-Glu-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O RKRSYHCNPFGMTA-CIUDSAMLSA-N 0.000 description 2
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 2
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 2
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 2
- CVXXSWQORBZAAA-SRVKXCTJSA-N Arg-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N CVXXSWQORBZAAA-SRVKXCTJSA-N 0.000 description 2
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 2
- GRRXPUAICOGISM-RWMBFGLXSA-N Arg-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GRRXPUAICOGISM-RWMBFGLXSA-N 0.000 description 2
- QBQVKUNBCAFXSV-ULQDDVLXSA-N Arg-Lys-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QBQVKUNBCAFXSV-ULQDDVLXSA-N 0.000 description 2
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 2
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 2
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 2
- YHZQOSXDTFRZKU-WDSOQIARSA-N Arg-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 YHZQOSXDTFRZKU-WDSOQIARSA-N 0.000 description 2
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 2
- QGNXYDHVERJIAY-ACZMJKKPSA-N Asn-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N QGNXYDHVERJIAY-ACZMJKKPSA-N 0.000 description 2
- RAKKBBHMTJSXOY-XVYDVKMFSA-N Asn-His-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O RAKKBBHMTJSXOY-XVYDVKMFSA-N 0.000 description 2
- MYCSPQIARXTUTP-SRVKXCTJSA-N Asn-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N MYCSPQIARXTUTP-SRVKXCTJSA-N 0.000 description 2
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 2
- AEZCCDMZZJOGII-DCAQKATOSA-N Asn-Met-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O AEZCCDMZZJOGII-DCAQKATOSA-N 0.000 description 2
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 2
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 2
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 2
- PIABYSIYPGLLDQ-XVSYOHENSA-N Asn-Thr-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PIABYSIYPGLLDQ-XVSYOHENSA-N 0.000 description 2
- CPYHLXSGDBDULY-IHPCNDPISA-N Asn-Trp-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CPYHLXSGDBDULY-IHPCNDPISA-N 0.000 description 2
- JPPLRQVZMZFOSX-UWJYBYFXSA-N Asn-Tyr-Ala Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 JPPLRQVZMZFOSX-UWJYBYFXSA-N 0.000 description 2
- CXBOKJPLEYUPGB-FXQIFTODSA-N Asp-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N CXBOKJPLEYUPGB-FXQIFTODSA-N 0.000 description 2
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 2
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 2
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 2
- SWTQDYFZVOJVLL-KKUMJFAQSA-N Asp-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)O SWTQDYFZVOJVLL-KKUMJFAQSA-N 0.000 description 2
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 2
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 2
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 2
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 2
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 2
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 2
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 2
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 2
- 206010003694 Atrophy Diseases 0.000 description 2
- 244000063299 Bacillus subtilis Species 0.000 description 2
- 235000014469 Bacillus subtilis Nutrition 0.000 description 2
- 102100021676 Baculoviral IAP repeat-containing protein 1 Human genes 0.000 description 2
- 206010006542 Bulbar palsy Diseases 0.000 description 2
- 102100035933 Calcium-responsive transcription factor Human genes 0.000 description 2
- PRXCTTWKGJAPMT-ZLUOBGJFSA-N Cys-Ala-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O PRXCTTWKGJAPMT-ZLUOBGJFSA-N 0.000 description 2
- BMHBJCVEXUBGFI-BIIVOSGPSA-N Cys-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CS)N)C(=O)O BMHBJCVEXUBGFI-BIIVOSGPSA-N 0.000 description 2
- CVLIHKBUPSFRQP-WHFBIAKZSA-N Cys-Gly-Ala Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C)C(O)=O CVLIHKBUPSFRQP-WHFBIAKZSA-N 0.000 description 2
- HKALUUKHYNEDRS-GUBZILKMSA-N Cys-Leu-Gln Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HKALUUKHYNEDRS-GUBZILKMSA-N 0.000 description 2
- OZSBRCONEMXYOJ-AVGNSLFASA-N Cys-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N OZSBRCONEMXYOJ-AVGNSLFASA-N 0.000 description 2
- WTXCNOPZMQRTNN-BWBBJGPYSA-N Cys-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)O WTXCNOPZMQRTNN-BWBBJGPYSA-N 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
- JFOKLAPFYCTNHW-SRVKXCTJSA-N Gln-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N JFOKLAPFYCTNHW-SRVKXCTJSA-N 0.000 description 2
- WMOMPXKOKASNBK-PEFMBERDSA-N Gln-Asn-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WMOMPXKOKASNBK-PEFMBERDSA-N 0.000 description 2
- UZMWDBOHAOSCCH-ACZMJKKPSA-N Gln-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(N)=O UZMWDBOHAOSCCH-ACZMJKKPSA-N 0.000 description 2
- LOJYQMFIIJVETK-WDSKDSINSA-N Gln-Gln Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LOJYQMFIIJVETK-WDSKDSINSA-N 0.000 description 2
- XSBGUANSZDGULP-IUCAKERBSA-N Gln-Gly-Lys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O XSBGUANSZDGULP-IUCAKERBSA-N 0.000 description 2
- BVELAHPZLYLZDJ-HGNGGELXSA-N Gln-His-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O BVELAHPZLYLZDJ-HGNGGELXSA-N 0.000 description 2
- NROSLUJMIQGFKS-IUCAKERBSA-N Gln-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N NROSLUJMIQGFKS-IUCAKERBSA-N 0.000 description 2
- ITZWDGBYBPUZRG-KBIXCLLPSA-N Gln-Ile-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O ITZWDGBYBPUZRG-KBIXCLLPSA-N 0.000 description 2
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 2
- DRNMNLKUUKKPIA-HTUGSXCWSA-N Gln-Phe-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)CCC(N)=O)C(O)=O DRNMNLKUUKKPIA-HTUGSXCWSA-N 0.000 description 2
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 2
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 2
- CSMHMEATMDCQNY-DZKIICNBSA-N Gln-Val-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CSMHMEATMDCQNY-DZKIICNBSA-N 0.000 description 2
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 2
- IYAUFWMUCGBFMQ-CIUDSAMLSA-N Glu-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N IYAUFWMUCGBFMQ-CIUDSAMLSA-N 0.000 description 2
- OJGLIOXAKGFFDW-SRVKXCTJSA-N Glu-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N OJGLIOXAKGFFDW-SRVKXCTJSA-N 0.000 description 2
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 2
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 2
- RQNYYRHRKSVKAB-GUBZILKMSA-N Glu-Cys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O RQNYYRHRKSVKAB-GUBZILKMSA-N 0.000 description 2
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 2
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 2
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 2
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 2
- WDTAKCUOIKHCTB-NKIYYHGXSA-N Glu-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N)O WDTAKCUOIKHCTB-NKIYYHGXSA-N 0.000 description 2
- YVYVMJNUENBOOL-KBIXCLLPSA-N Glu-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N YVYVMJNUENBOOL-KBIXCLLPSA-N 0.000 description 2
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 2
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 2
- NTHIHAUEXVTXQG-KKUMJFAQSA-N Glu-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O NTHIHAUEXVTXQG-KKUMJFAQSA-N 0.000 description 2
- QOOFKCCZZWTCEP-AVGNSLFASA-N Glu-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QOOFKCCZZWTCEP-AVGNSLFASA-N 0.000 description 2
- XIJOPMSILDNVNJ-ZVZYQTTQSA-N Glu-Val-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XIJOPMSILDNVNJ-ZVZYQTTQSA-N 0.000 description 2
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 2
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 2
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 2
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 2
- RQZGFWKQLPJOEQ-YUMQZZPRSA-N Gly-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN)CN=C(N)N RQZGFWKQLPJOEQ-YUMQZZPRSA-N 0.000 description 2
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 2
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 2
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 2
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 2
- VOCMRCVMAPSSAL-IUCAKERBSA-N Gly-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN VOCMRCVMAPSSAL-IUCAKERBSA-N 0.000 description 2
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 2
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 2
- LPCKHUXOGVNZRS-YUMQZZPRSA-N Gly-His-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O LPCKHUXOGVNZRS-YUMQZZPRSA-N 0.000 description 2
- ZOTGXWMKUFSKEU-QXEWZRGKSA-N Gly-Ile-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O ZOTGXWMKUFSKEU-QXEWZRGKSA-N 0.000 description 2
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 2
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 2
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 2
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 2
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 2
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 2
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 2
- OCRQUYDOYKCOQG-IRXDYDNUSA-N Gly-Tyr-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 OCRQUYDOYKCOQG-IRXDYDNUSA-N 0.000 description 2
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 2
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 2
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 2
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- VHHYJBSXXMPQGZ-AVGNSLFASA-N His-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N VHHYJBSXXMPQGZ-AVGNSLFASA-N 0.000 description 2
- FIMNVXRZGUAGBI-AVGNSLFASA-N His-Glu-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FIMNVXRZGUAGBI-AVGNSLFASA-N 0.000 description 2
- OSZUPUINVNPCOE-SDDRHHMPSA-N His-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O OSZUPUINVNPCOE-SDDRHHMPSA-N 0.000 description 2
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 2
- JENKOCSDMSVWPY-SRVKXCTJSA-N His-Leu-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JENKOCSDMSVWPY-SRVKXCTJSA-N 0.000 description 2
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 2
- VCBWXASUBZIFLQ-IHRRRGAJSA-N His-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O VCBWXASUBZIFLQ-IHRRRGAJSA-N 0.000 description 2
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 2
- GIRSNERMXCMDBO-GARJFASQSA-N His-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O GIRSNERMXCMDBO-GARJFASQSA-N 0.000 description 2
- JGFWUKYIQAEYAH-DCAQKATOSA-N His-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JGFWUKYIQAEYAH-DCAQKATOSA-N 0.000 description 2
- CCUSLCQWVMWTIS-IXOXFDKPSA-N His-Thr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O CCUSLCQWVMWTIS-IXOXFDKPSA-N 0.000 description 2
- QLBXWYXMLHAREM-PYJNHQTQSA-N His-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N QLBXWYXMLHAREM-PYJNHQTQSA-N 0.000 description 2
- 101000715597 Homo sapiens Calcium-responsive transcription factor Proteins 0.000 description 2
- 101000601056 Homo sapiens NIF3-like protein 1 Proteins 0.000 description 2
- 101000642656 Homo sapiens STE20-related kinase adapter protein beta Proteins 0.000 description 2
- 101000610729 Homo sapiens Trafficking kinesin-binding protein 2 Proteins 0.000 description 2
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 2
- HLYBGMZJVDHJEO-CYDGBPFRSA-N Ile-Arg-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HLYBGMZJVDHJEO-CYDGBPFRSA-N 0.000 description 2
- BSWLQVGEVFYGIM-ZPFDUUQYSA-N Ile-Gln-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BSWLQVGEVFYGIM-ZPFDUUQYSA-N 0.000 description 2
- IGJWJGIHUFQANP-LAEOZQHASA-N Ile-Gly-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N IGJWJGIHUFQANP-LAEOZQHASA-N 0.000 description 2
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 2
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 2
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 2
- DNKDIDZHXZAGRY-HJWJTTGWSA-N Ile-Met-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N DNKDIDZHXZAGRY-HJWJTTGWSA-N 0.000 description 2
- VOCZPDONPURUHV-QEWYBTABSA-N Ile-Phe-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VOCZPDONPURUHV-QEWYBTABSA-N 0.000 description 2
- XHBYEMIUENPZLY-GMOBBJLQSA-N Ile-Pro-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O XHBYEMIUENPZLY-GMOBBJLQSA-N 0.000 description 2
- BATWGBRIZANGPN-ZPFDUUQYSA-N Ile-Pro-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BATWGBRIZANGPN-ZPFDUUQYSA-N 0.000 description 2
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 2
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 2
- 208000026350 Inborn Genetic disease Diseases 0.000 description 2
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 2
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 2
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 2
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 2
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 2
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 2
- PIHFVNPEAHFNLN-KKUMJFAQSA-N Leu-Cys-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N PIHFVNPEAHFNLN-KKUMJFAQSA-N 0.000 description 2
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 2
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 2
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 2
- BOFAFKVZQUMTID-AVGNSLFASA-N Leu-Gln-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BOFAFKVZQUMTID-AVGNSLFASA-N 0.000 description 2
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 2
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 2
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 2
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 2
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 2
- BTNXKBVLWJBTNR-SRVKXCTJSA-N Leu-His-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O BTNXKBVLWJBTNR-SRVKXCTJSA-N 0.000 description 2
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 2
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 2
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 2
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 2
- FKQPWMZLIIATBA-AJNGGQMLSA-N Leu-Lys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FKQPWMZLIIATBA-AJNGGQMLSA-N 0.000 description 2
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 2
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 2
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 2
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 2
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 2
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 2
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 2
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 2
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 2
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 2
- 102100022119 Lipoprotein lipase Human genes 0.000 description 2
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 2
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 2
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 2
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 description 2
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 2
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 2
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 2
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 2
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 2
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 2
- PINHPJWGVBKQII-SRVKXCTJSA-N Lys-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N PINHPJWGVBKQII-SRVKXCTJSA-N 0.000 description 2
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 2
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 2
- BXPHMHQHYHILBB-BZSNNMDCSA-N Lys-Lys-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BXPHMHQHYHILBB-BZSNNMDCSA-N 0.000 description 2
- IPSDPDAOSAEWCN-RHYQMDGZSA-N Lys-Met-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IPSDPDAOSAEWCN-RHYQMDGZSA-N 0.000 description 2
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 2
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 2
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 2
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 2
- RYOLKFYZBHMYFW-WDSOQIARSA-N Lys-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 RYOLKFYZBHMYFW-WDSOQIARSA-N 0.000 description 2
- YUTZYVTZDVZBJJ-IHPCNDPISA-N Lys-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 YUTZYVTZDVZBJJ-IHPCNDPISA-N 0.000 description 2
- PPNCMJARTHYNEC-MEYUZBJRSA-N Lys-Tyr-Thr Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)CC1=CC=C(O)C=C1 PPNCMJARTHYNEC-MEYUZBJRSA-N 0.000 description 2
- 102100023261 MAGUK p55 subfamily member 4 Human genes 0.000 description 2
- UAPZLLPGGOOCRO-IHRRRGAJSA-N Met-Asn-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N UAPZLLPGGOOCRO-IHRRRGAJSA-N 0.000 description 2
- DNDVVILEHVMWIS-LPEHRKFASA-N Met-Asp-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DNDVVILEHVMWIS-LPEHRKFASA-N 0.000 description 2
- UOENBSHXYCHSAU-YUMQZZPRSA-N Met-Gln-Gly Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UOENBSHXYCHSAU-YUMQZZPRSA-N 0.000 description 2
- NHXXGBXJTLRGJI-GUBZILKMSA-N Met-Pro-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O NHXXGBXJTLRGJI-GUBZILKMSA-N 0.000 description 2
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 2
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 description 2
- 241000699660 Mus musculus Species 0.000 description 2
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- 102100037380 NIF3-like protein 1 Human genes 0.000 description 2
- 108010006696 Neuronal Apoptosis-Inhibitory Protein Proteins 0.000 description 2
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 2
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 2
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 2
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 2
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 2
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 2
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 2
- VADLTGVIOIOKGM-BZSNNMDCSA-N Phe-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 VADLTGVIOIOKGM-BZSNNMDCSA-N 0.000 description 2
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 2
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 2
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 2
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 2
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 2
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 2
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 2
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 description 2
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 2
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 2
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 2
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 2
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 2
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 2
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 2
- CMOIIANLNNYUTP-SRVKXCTJSA-N Pro-Gln-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CMOIIANLNNYUTP-SRVKXCTJSA-N 0.000 description 2
- LANQLYHLMYDWJP-SRVKXCTJSA-N Pro-Gln-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O LANQLYHLMYDWJP-SRVKXCTJSA-N 0.000 description 2
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 2
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 2
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 2
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 2
- QEWBZBLXDKIQPS-STQMWFEESA-N Pro-Gly-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QEWBZBLXDKIQPS-STQMWFEESA-N 0.000 description 2
- SOACYAXADBWDDT-CYDGBPFRSA-N Pro-Ile-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SOACYAXADBWDDT-CYDGBPFRSA-N 0.000 description 2
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 2
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 2
- FYKUEXMZYFIZKA-DCAQKATOSA-N Pro-Pro-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FYKUEXMZYFIZKA-DCAQKATOSA-N 0.000 description 2
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 2
- BJCXXMGGPHRSHV-GUBZILKMSA-N Pro-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BJCXXMGGPHRSHV-GUBZILKMSA-N 0.000 description 2
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 2
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 2
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 2
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 2
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 2
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 2
- 108091034057 RNA (poly(A)) Proteins 0.000 description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 2
- 208000007014 Retinitis pigmentosa Diseases 0.000 description 2
- 108010053823 Rho Guanine Nucleotide Exchange Factors Proteins 0.000 description 2
- 102100035929 STE20-related kinase adapter protein beta Human genes 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- 108091081021 Sense strand Proteins 0.000 description 2
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 2
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 2
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 2
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 2
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 2
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 2
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 2
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 2
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 2
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 2
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 2
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 2
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 2
- UICKAKRRRBTILH-GUBZILKMSA-N Ser-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N UICKAKRRRBTILH-GUBZILKMSA-N 0.000 description 2
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 2
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 2
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 2
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 2
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 2
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 2
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 2
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 2
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 2
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 2
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 2
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 2
- BCAVNDNYOGTQMQ-AAEUAGOBSA-N Ser-Trp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O BCAVNDNYOGTQMQ-AAEUAGOBSA-N 0.000 description 2
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 2
- 108091081024 Start codon Proteins 0.000 description 2
- 102000008221 Superoxide Dismutase-1 Human genes 0.000 description 2
- 108010021188 Superoxide Dismutase-1 Proteins 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 2
- IRKWVRSEQFTGGV-VEVYYDQMSA-N Thr-Asn-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IRKWVRSEQFTGGV-VEVYYDQMSA-N 0.000 description 2
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 2
- LKEKWDJCJSPXNI-IRIUXVKKSA-N Thr-Glu-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LKEKWDJCJSPXNI-IRIUXVKKSA-N 0.000 description 2
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 2
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 2
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 2
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 2
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 2
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 2
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 2
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 2
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 2
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 2
- LKJCABTUFGTPPY-HJGDQZAQSA-N Thr-Pro-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O LKJCABTUFGTPPY-HJGDQZAQSA-N 0.000 description 2
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 2
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 2
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 2
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 2
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 2
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 2
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 2
- 102100040377 Trafficking kinesin-binding protein 2 Human genes 0.000 description 2
- IQGJAHMZWBTRIF-UBHSHLNASA-N Trp-Asp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N IQGJAHMZWBTRIF-UBHSHLNASA-N 0.000 description 2
- JVTHMUDOKPQBOT-NSHDSACASA-N Trp-Gly-Gly Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O)=CNC2=C1 JVTHMUDOKPQBOT-NSHDSACASA-N 0.000 description 2
- RWAYYYOZMHMEGD-XIRDDKMYSA-N Trp-Leu-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 RWAYYYOZMHMEGD-XIRDDKMYSA-N 0.000 description 2
- WHJVRIBYQWHRQA-NQCBNZPSSA-N Trp-Phe-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=CC=C1 WHJVRIBYQWHRQA-NQCBNZPSSA-N 0.000 description 2
- SUGLEXVWEJOCGN-ONUFPDRFSA-N Trp-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N)O SUGLEXVWEJOCGN-ONUFPDRFSA-N 0.000 description 2
- KPEVFMGKBCMTJF-SZMVWBNQSA-N Trp-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N KPEVFMGKBCMTJF-SZMVWBNQSA-N 0.000 description 2
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 2
- GIOBXJSONRQHKQ-RYUDHWBXSA-N Tyr-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GIOBXJSONRQHKQ-RYUDHWBXSA-N 0.000 description 2
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 2
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 2
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 2
- QQCCSDWLVIEPSF-BVSLBCMMSA-N Tyr-Met-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 QQCCSDWLVIEPSF-BVSLBCMMSA-N 0.000 description 2
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 2
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 2
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 2
- LDKDSFQSEUOCOO-RPTUDFQQSA-N Tyr-Thr-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LDKDSFQSEUOCOO-RPTUDFQQSA-N 0.000 description 2
- DJSYPCWZPNHQQE-FHWLQOOXSA-N Tyr-Tyr-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=C(O)C=C1 DJSYPCWZPNHQQE-FHWLQOOXSA-N 0.000 description 2
- OBKOPLHSRDATFO-XHSDSOJGSA-N Tyr-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OBKOPLHSRDATFO-XHSDSOJGSA-N 0.000 description 2
- LABUITCFCAABSV-UHFFFAOYSA-N Val-Ala-Tyr Natural products CC(C)C(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-UHFFFAOYSA-N 0.000 description 2
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 2
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 2
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 2
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 2
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 2
- VXCAZHCVDBQMTP-NRPADANISA-N Val-Cys-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VXCAZHCVDBQMTP-NRPADANISA-N 0.000 description 2
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 2
- AHHJARQXFFGOKF-NRPADANISA-N Val-Glu-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N AHHJARQXFFGOKF-NRPADANISA-N 0.000 description 2
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 2
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 2
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 2
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 2
- MLADEWAIYAPAAU-IHRRRGAJSA-N Val-Lys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N MLADEWAIYAPAAU-IHRRRGAJSA-N 0.000 description 2
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 2
- JQTYTBPCSOAZHI-FXQIFTODSA-N Val-Ser-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N JQTYTBPCSOAZHI-FXQIFTODSA-N 0.000 description 2
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 2
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 2
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 2
- NGXQOQNXSGOYOI-BQFCYCMXSA-N Val-Trp-Gln Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 NGXQOQNXSGOYOI-BQFCYCMXSA-N 0.000 description 2
- QTXGUIMEHKCPBH-FHWLQOOXSA-N Val-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 QTXGUIMEHKCPBH-FHWLQOOXSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 230000002378 acidificating effect Effects 0.000 description 2
- 210000002226 anterior horn cell Anatomy 0.000 description 2
- 108010060035 arginylproline Proteins 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 230000037444 atrophy Effects 0.000 description 2
- 230000008335 axon cargo transport Effects 0.000 description 2
- 230000037429 base substitution Effects 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 229910000389 calcium phosphate Inorganic materials 0.000 description 2
- 239000001506 calcium phosphate Substances 0.000 description 2
- 235000011010 calcium phosphates Nutrition 0.000 description 2
- 210000001638 cerebellum Anatomy 0.000 description 2
- 210000003710 cerebral cortex Anatomy 0.000 description 2
- 230000001684 chronic effect Effects 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 238000009833 condensation Methods 0.000 description 2
- 230000005494 condensation Effects 0.000 description 2
- 230000007547 defect Effects 0.000 description 2
- 210000001947 dentate gyrus Anatomy 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 238000002509 fluorescent in situ hybridization Methods 0.000 description 2
- 230000037433 frameshift Effects 0.000 description 2
- 102000054766 genetic haplotypes Human genes 0.000 description 2
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 2
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 108010084389 glycyltryptophan Proteins 0.000 description 2
- 239000005090 green fluorescent protein Substances 0.000 description 2
- 102000009543 guanyl-nucleotide exchange factor activity proteins Human genes 0.000 description 2
- 102000049526 human ALS2 Human genes 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 230000000366 juvenile effect Effects 0.000 description 2
- 239000002502 liposome Substances 0.000 description 2
- 108010043322 lysyl-tryptophyl-alpha-lysine Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 238000002844 melting Methods 0.000 description 2
- 230000008018 melting Effects 0.000 description 2
- 108700023046 methionyl-leucyl-phenylalanine Proteins 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 230000001537 neural effect Effects 0.000 description 2
- 230000007935 neutral effect Effects 0.000 description 2
- 239000002853 nucleic acid probe Substances 0.000 description 2
- 210000004940 nucleus Anatomy 0.000 description 2
- 239000002751 oligonucleotide probe Substances 0.000 description 2
- 229920001184 polypeptide Polymers 0.000 description 2
- 201000002241 progressive bulbar palsy Diseases 0.000 description 2
- 230000000750 progressive effect Effects 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 210000000449 purkinje cell Anatomy 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 108010043680 somatostatin(7-10) Proteins 0.000 description 2
- 230000004083 survival effect Effects 0.000 description 2
- 229940124597 therapeutic agent Drugs 0.000 description 2
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 2
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 210000003934 vacuole Anatomy 0.000 description 2
- RCGFMNKLEKXILD-XYCLDAKMSA-N (2s,3r)-2-[[(2s)-6-amino-2-[[(2s)-2-[[(2s)-2-amino-3-phenylpropanoyl]amino]-3-(1h-indol-3-yl)propanoyl]amino]hexanoyl]amino]-3-hydroxybutanoic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@H](O)C)C(O)=O)C1=CC=CC=C1 RCGFMNKLEKXILD-XYCLDAKMSA-N 0.000 description 1
- 101150028074 2 gene Proteins 0.000 description 1
- QPIROHVZMLYRNN-YRNJLPRFSA-N 2-[(3s,6s,9s,12s,15r)-15-[[(2s,3r)-2-[[(2s)-2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]-3-hydroxypropanoyl]amino]-3-hydroxybutanoyl]amino]-9-benzyl-6-[(2s)-butan-2-yl]-3-(2-methylsulfanylethyl)-2,5,8,11,14-pentaoxo-1-thia-4,7,10,13-tetrazacyclohe Chemical compound C([C@H]1C(=O)N[C@H](C(N[C@@H](CCSC)C(=O)SC[C@@H](C(=O)N[C@@H](CC(O)=O)C(=O)N1)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)[C@@H](C)O)=O)[C@@H](C)CC)C1=CC=CC=C1 QPIROHVZMLYRNN-YRNJLPRFSA-N 0.000 description 1
- BFSVOASYOCHEOV-UHFFFAOYSA-N 2-diethylaminoethanol Chemical compound CCN(CC)CCO BFSVOASYOCHEOV-UHFFFAOYSA-N 0.000 description 1
- 101150001232 ALS gene Proteins 0.000 description 1
- SBGXWWCLHIOABR-UHFFFAOYSA-N Ala Ala Gly Ala Chemical compound CC(N)C(=O)NC(C)C(=O)NCC(=O)NC(C)C(O)=O SBGXWWCLHIOABR-UHFFFAOYSA-N 0.000 description 1
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 1
- JPGBXANAQYHTLA-DRZSPHRISA-N Ala-Gln-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JPGBXANAQYHTLA-DRZSPHRISA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 1
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 1
- DYJJJCHDHLEFDW-FXQIFTODSA-N Ala-Pro-Cys Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N DYJJJCHDHLEFDW-FXQIFTODSA-N 0.000 description 1
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- FSXDWQGEWZQBPJ-HERUPUMHSA-N Ala-Trp-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FSXDWQGEWZQBPJ-HERUPUMHSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 1
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 1
- KJGNDQCYBNBXDA-GUBZILKMSA-N Arg-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N KJGNDQCYBNBXDA-GUBZILKMSA-N 0.000 description 1
- PVSNBTCXCQIXSE-JYJNAYRXSA-N Arg-Arg-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PVSNBTCXCQIXSE-JYJNAYRXSA-N 0.000 description 1
- RVDVDRUZWZIBJQ-CIUDSAMLSA-N Arg-Asn-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RVDVDRUZWZIBJQ-CIUDSAMLSA-N 0.000 description 1
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 1
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 1
- PTVGLOCPAVYPFG-CIUDSAMLSA-N Arg-Gln-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PTVGLOCPAVYPFG-CIUDSAMLSA-N 0.000 description 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 1
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 1
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 1
- UIUXXFIKWQVMEX-UFYCRDLUSA-N Arg-Phe-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UIUXXFIKWQVMEX-UFYCRDLUSA-N 0.000 description 1
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 1
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 1
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 1
- GMRGSBAMMMVDGG-GUBZILKMSA-N Asn-Arg-Arg Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N GMRGSBAMMMVDGG-GUBZILKMSA-N 0.000 description 1
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 1
- NNMUHYLAYUSTTN-FXQIFTODSA-N Asn-Gln-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O NNMUHYLAYUSTTN-FXQIFTODSA-N 0.000 description 1
- JZDZLBJVYWIIQU-AVGNSLFASA-N Asn-Glu-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JZDZLBJVYWIIQU-AVGNSLFASA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- AITGTTNYKAWKDR-CIUDSAMLSA-N Asn-His-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O AITGTTNYKAWKDR-CIUDSAMLSA-N 0.000 description 1
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 1
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 1
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 1
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 1
- VOGCFWDZYYTEOY-DCAQKATOSA-N Asn-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N VOGCFWDZYYTEOY-DCAQKATOSA-N 0.000 description 1
- MYVBTYXSWILFCG-BQBZGAKWSA-N Asn-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N MYVBTYXSWILFCG-BQBZGAKWSA-N 0.000 description 1
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 1
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 1
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 1
- SYZWMVSXBZCOBZ-QXEWZRGKSA-N Asn-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N SYZWMVSXBZCOBZ-QXEWZRGKSA-N 0.000 description 1
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 1
- SOYOSFXLXYZNRG-CIUDSAMLSA-N Asp-Arg-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O SOYOSFXLXYZNRG-CIUDSAMLSA-N 0.000 description 1
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 1
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 1
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 1
- RATOMFTUDRYMKX-ACZMJKKPSA-N Asp-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N RATOMFTUDRYMKX-ACZMJKKPSA-N 0.000 description 1
- HSWYMWGDMPLTTH-FXQIFTODSA-N Asp-Glu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HSWYMWGDMPLTTH-FXQIFTODSA-N 0.000 description 1
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 1
- OEDJQRXNDRUGEU-SRVKXCTJSA-N Asp-Leu-His Chemical compound N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O OEDJQRXNDRUGEU-SRVKXCTJSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- NZWDWXSWUQCNMG-GARJFASQSA-N Asp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)C(=O)O NZWDWXSWUQCNMG-GARJFASQSA-N 0.000 description 1
- WWOYXVBGHAHQBG-FXQIFTODSA-N Asp-Met-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O WWOYXVBGHAHQBG-FXQIFTODSA-N 0.000 description 1
- PWAIZUBWHRHYKS-MELADBBJSA-N Asp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)O)N)C(=O)O PWAIZUBWHRHYKS-MELADBBJSA-N 0.000 description 1
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 1
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 1
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- OTKUAVXGMREHRX-CFMVVWHZSA-N Asp-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 OTKUAVXGMREHRX-CFMVVWHZSA-N 0.000 description 1
- OQMGSMNZVHYDTQ-ZKWXMUAHSA-N Asp-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N OQMGSMNZVHYDTQ-ZKWXMUAHSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- 241000020089 Atacta Species 0.000 description 1
- 102100025422 Bone morphogenetic protein receptor type-2 Human genes 0.000 description 1
- 102100025752 CASP8 and FADD-like apoptosis regulator Human genes 0.000 description 1
- 101100356682 Caenorhabditis elegans rho-1 gene Proteins 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 101100302136 Chlorobium chlorochromatii (strain CaD3) rimP gene Proteins 0.000 description 1
- 108010077544 Chromatin Proteins 0.000 description 1
- 208000031404 Chromosome Aberrations Diseases 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 102100033250 Cyclin-dependent kinase 15 Human genes 0.000 description 1
- ASHTVGGFIMESRD-LKXGYXEUSA-N Cys-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)O ASHTVGGFIMESRD-LKXGYXEUSA-N 0.000 description 1
- KEBJBKIASQVRJS-WDSKDSINSA-N Cys-Gln-Gly Chemical compound C(CC(=O)N)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N KEBJBKIASQVRJS-WDSKDSINSA-N 0.000 description 1
- UIKLEGZPIOXFHJ-DLOVCJGASA-N Cys-Phe-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O UIKLEGZPIOXFHJ-DLOVCJGASA-N 0.000 description 1
- WZJLBUPPZRZNTO-CIUDSAMLSA-N Cys-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N WZJLBUPPZRZNTO-CIUDSAMLSA-N 0.000 description 1
- HPZAJRPYUIHDIN-BZSNNMDCSA-N Cys-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CS)N HPZAJRPYUIHDIN-BZSNNMDCSA-N 0.000 description 1
- 206010067477 Cytogenetic abnormality Diseases 0.000 description 1
- 102100039498 Cytotoxic T-lymphocyte protein 4 Human genes 0.000 description 1
- 101710112752 Cytotoxin Proteins 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 101100382570 Danio rerio caspb gene Proteins 0.000 description 1
- 208000019505 Deglutition disease Diseases 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 102000016607 Diphtheria Toxin Human genes 0.000 description 1
- 108010053187 Diphtheria Toxin Proteins 0.000 description 1
- 102100028571 Disabled homolog 2-interacting protein Human genes 0.000 description 1
- 101710191399 Disabled homolog 2-interacting protein Proteins 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- 208000000059 Dyspnea Diseases 0.000 description 1
- 206010013975 Dyspnoeas Diseases 0.000 description 1
- 102100040305 Flagellum-associated coiled-coil domain-containing protein 1 Human genes 0.000 description 1
- 102100039676 Frizzled-7 Human genes 0.000 description 1
- 201000011240 Frontotemporal dementia Diseases 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- WOACHWLUOFZLGJ-GUBZILKMSA-N Gln-Arg-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O WOACHWLUOFZLGJ-GUBZILKMSA-N 0.000 description 1
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 1
- BTSPOOHJBYJRKO-CIUDSAMLSA-N Gln-Asp-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BTSPOOHJBYJRKO-CIUDSAMLSA-N 0.000 description 1
- QYTKAVBFRUGYAU-ACZMJKKPSA-N Gln-Asp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QYTKAVBFRUGYAU-ACZMJKKPSA-N 0.000 description 1
- OFPWCBGRYAOLMU-AVGNSLFASA-N Gln-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OFPWCBGRYAOLMU-AVGNSLFASA-N 0.000 description 1
- CITDWMLWXNUQKD-FXQIFTODSA-N Gln-Gln-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CITDWMLWXNUQKD-FXQIFTODSA-N 0.000 description 1
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 1
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 1
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 description 1
- QQAPDATZKKTBIY-YUMQZZPRSA-N Gln-Gly-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O QQAPDATZKKTBIY-YUMQZZPRSA-N 0.000 description 1
- DWDBJWAXPXXYLP-SRVKXCTJSA-N Gln-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DWDBJWAXPXXYLP-SRVKXCTJSA-N 0.000 description 1
- GQZDDFRXSDGUNG-YVNDNENWSA-N Gln-Ile-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O GQZDDFRXSDGUNG-YVNDNENWSA-N 0.000 description 1
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 1
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 1
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 1
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 1
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 1
- YRHZWVKUFWCEPW-GLLZPBPUSA-N Gln-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O YRHZWVKUFWCEPW-GLLZPBPUSA-N 0.000 description 1
- NSEKYCAADBNQFE-XIRDDKMYSA-N Gln-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 NSEKYCAADBNQFE-XIRDDKMYSA-N 0.000 description 1
- NVHJGTGTUGEWCG-ZVZYQTTQSA-N Gln-Trp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O NVHJGTGTUGEWCG-ZVZYQTTQSA-N 0.000 description 1
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 1
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- SVZIKUHLRKVZIF-GUBZILKMSA-N Glu-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N SVZIKUHLRKVZIF-GUBZILKMSA-N 0.000 description 1
- PAQUJCSYVIBPLC-AVGNSLFASA-N Glu-Asp-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PAQUJCSYVIBPLC-AVGNSLFASA-N 0.000 description 1
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 1
- RZJIZCXOYDRDBX-UHFFFAOYSA-N Glu-Glu-His-Thr Chemical compound OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(=O)NC(C(=O)NC(C(O)C)C(O)=O)CC1=CN=CN1 RZJIZCXOYDRDBX-UHFFFAOYSA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 1
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 1
- XIKYNVKEUINBGL-IUCAKERBSA-N Glu-His-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O XIKYNVKEUINBGL-IUCAKERBSA-N 0.000 description 1
- WVTIBGWZUMJBFY-GUBZILKMSA-N Glu-His-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O WVTIBGWZUMJBFY-GUBZILKMSA-N 0.000 description 1
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 1
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 1
- JZJGEKDPWVJOLD-QEWYBTABSA-N Glu-Phe-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JZJGEKDPWVJOLD-QEWYBTABSA-N 0.000 description 1
- CBWKURKPYSLMJV-SOUVJXGZSA-N Glu-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CBWKURKPYSLMJV-SOUVJXGZSA-N 0.000 description 1
- CBOVGULVQSVMPT-CIUDSAMLSA-N Glu-Pro-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O CBOVGULVQSVMPT-CIUDSAMLSA-N 0.000 description 1
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 1
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 1
- GUOWMVFLAJNPDY-CIUDSAMLSA-N Glu-Ser-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GUOWMVFLAJNPDY-CIUDSAMLSA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- ZTNHPMZHAILHRB-JSGCOSHPSA-N Glu-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)NCC(O)=O)=CNC2=C1 ZTNHPMZHAILHRB-JSGCOSHPSA-N 0.000 description 1
- HJTSRYLPAYGEEC-SIUGBPQLSA-N Glu-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N HJTSRYLPAYGEEC-SIUGBPQLSA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- 108010070675 Glutathione transferase Proteins 0.000 description 1
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 1
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 1
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- SABZDFAAOJATBR-QWRGUYRKSA-N Gly-Cys-Phe Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SABZDFAAOJATBR-QWRGUYRKSA-N 0.000 description 1
- VNBNZUAPOYGRDB-ZDLURKLDSA-N Gly-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)O VNBNZUAPOYGRDB-ZDLURKLDSA-N 0.000 description 1
- LJXWZPHEMJSNRC-KBPBESRZSA-N Gly-Gln-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LJXWZPHEMJSNRC-KBPBESRZSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 1
- HFXJIZNEXNIZIJ-BQBZGAKWSA-N Gly-Glu-Gln Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFXJIZNEXNIZIJ-BQBZGAKWSA-N 0.000 description 1
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 1
- JNGJGFMFXREJNF-KBPBESRZSA-N Gly-Glu-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JNGJGFMFXREJNF-KBPBESRZSA-N 0.000 description 1
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 1
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 1
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 1
- NZOAFWHVAFJERA-OALUTQOASA-N Gly-Phe-Trp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NZOAFWHVAFJERA-OALUTQOASA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- MKIAPEZXQDILRR-YUMQZZPRSA-N Gly-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN MKIAPEZXQDILRR-YUMQZZPRSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 1
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 1
- KBBFOULZCHWGJX-KBPBESRZSA-N Gly-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN)O KBBFOULZCHWGJX-KBPBESRZSA-N 0.000 description 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- COZMNNJEGNPDED-HOCLYGCPSA-N Gly-Val-Trp Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O COZMNNJEGNPDED-HOCLYGCPSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- 102100029100 Hematopoietic prostaglandin D synthase Human genes 0.000 description 1
- 208000028782 Hereditary disease Diseases 0.000 description 1
- HXKZJLWGSWQKEA-LSJOCFKGSA-N His-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CN=CN1 HXKZJLWGSWQKEA-LSJOCFKGSA-N 0.000 description 1
- SDTPKSOWFXBACN-GUBZILKMSA-N His-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O SDTPKSOWFXBACN-GUBZILKMSA-N 0.000 description 1
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 1
- HAPWZEVRQYGLSG-IUCAKERBSA-N His-Gly-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O HAPWZEVRQYGLSG-IUCAKERBSA-N 0.000 description 1
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 1
- NDKSHNQINMRKHT-PEXQALLHSA-N His-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N NDKSHNQINMRKHT-PEXQALLHSA-N 0.000 description 1
- OQDLKDUVMTUPPG-AVGNSLFASA-N His-Leu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OQDLKDUVMTUPPG-AVGNSLFASA-N 0.000 description 1
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 1
- STGQSBKUYSPPIG-CIUDSAMLSA-N His-Ser-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 STGQSBKUYSPPIG-CIUDSAMLSA-N 0.000 description 1
- BFOGZWSSGMLYKV-DCAQKATOSA-N His-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CN=CN1)N BFOGZWSSGMLYKV-DCAQKATOSA-N 0.000 description 1
- PDLQNLSEJXOQNQ-IHPCNDPISA-N His-Trp-Lys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(O)=O)C1=CN=CN1 PDLQNLSEJXOQNQ-IHPCNDPISA-N 0.000 description 1
- CGAMSLMBYJHMDY-ONGXEEELSA-N His-Val-Gly Chemical compound CC(C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N CGAMSLMBYJHMDY-ONGXEEELSA-N 0.000 description 1
- 101000934635 Homo sapiens Bone morphogenetic protein receptor type-2 Proteins 0.000 description 1
- 101000914211 Homo sapiens CASP8 and FADD-like apoptosis regulator Proteins 0.000 description 1
- 101000983518 Homo sapiens Caspase-10 Proteins 0.000 description 1
- 101000944355 Homo sapiens Cyclin-dependent kinase 15 Proteins 0.000 description 1
- 101000889276 Homo sapiens Cytotoxic T-lymphocyte protein 4 Proteins 0.000 description 1
- 101000891676 Homo sapiens Flagellum-associated coiled-coil domain-containing protein 1 Proteins 0.000 description 1
- 101000885797 Homo sapiens Frizzled-7 Proteins 0.000 description 1
- 101001042104 Homo sapiens Inducible T-cell costimulator Proteins 0.000 description 1
- 101001033293 Homo sapiens Interleukin enhancer-binding factor 3 Proteins 0.000 description 1
- 101001115423 Homo sapiens MAGUK p55 subfamily member 4 Proteins 0.000 description 1
- 101100077756 Homo sapiens MPP4 gene Proteins 0.000 description 1
- 101001128623 Homo sapiens NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 3 Proteins 0.000 description 1
- 101000582320 Homo sapiens Neurogenic differentiation factor 6 Proteins 0.000 description 1
- 101000613984 Homo sapiens Origin recognition complex subunit 2 Proteins 0.000 description 1
- 101001091194 Homo sapiens Peptidyl-prolyl cis-trans isomerase G Proteins 0.000 description 1
- 101000752249 Homo sapiens Rho guanine nucleotide exchange factor 3 Proteins 0.000 description 1
- 101000803747 Homo sapiens Ribosome biogenesis protein WDR12 Proteins 0.000 description 1
- 101000914514 Homo sapiens T-cell-specific surface glycoprotein CD28 Proteins 0.000 description 1
- 101000798539 Homo sapiens Transmembrane protein 237 Proteins 0.000 description 1
- 101000802734 Homo sapiens eIF5-mimic protein 2 Proteins 0.000 description 1
- 101000829171 Hypocrea virens (strain Gv29-8 / FGSC 10586) Effector TSP1 Proteins 0.000 description 1
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 1
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 1
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 1
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 1
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 1
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- FJWALBCCVIHZBS-QXEWZRGKSA-N Ile-Met-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N FJWALBCCVIHZBS-QXEWZRGKSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 1
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 1
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 1
- RMJWFINHACYKJI-SIUGBPQLSA-N Ile-Tyr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RMJWFINHACYKJI-SIUGBPQLSA-N 0.000 description 1
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 1
- 102000017727 Immunoglobulin Variable Region Human genes 0.000 description 1
- 108010067060 Immunoglobulin Variable Region Proteins 0.000 description 1
- 102100021317 Inducible T-cell costimulator Human genes 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- -1 LOC57404 Proteins 0.000 description 1
- 241000914716 Lenarchus rho Species 0.000 description 1
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 1
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 1
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 1
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 1
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 1
- KXODZBLFVFSLAI-AVGNSLFASA-N Leu-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KXODZBLFVFSLAI-AVGNSLFASA-N 0.000 description 1
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 1
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 1
- WXDRGWBQZIMJDE-ULQDDVLXSA-N Leu-Phe-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O WXDRGWBQZIMJDE-ULQDDVLXSA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- SIGZKCWZEBFNAK-QAETUUGQSA-N Leu-Ser-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SIGZKCWZEBFNAK-QAETUUGQSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- URHJPNHRQMQGOZ-RHYQMDGZSA-N Leu-Thr-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O URHJPNHRQMQGOZ-RHYQMDGZSA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 1
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 1
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 1
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- VHFFQUSNFFIZBT-CIUDSAMLSA-N Lys-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N VHFFQUSNFFIZBT-CIUDSAMLSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- WQWZXKWOEVSGQM-DCAQKATOSA-N Lys-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN WQWZXKWOEVSGQM-DCAQKATOSA-N 0.000 description 1
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 1
- YRWCPXOFBKTCFY-NUTKFTJISA-N Lys-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N YRWCPXOFBKTCFY-NUTKFTJISA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 1
- KZJQUYFDSCFSCO-DLOVCJGASA-N Lys-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N KZJQUYFDSCFSCO-DLOVCJGASA-N 0.000 description 1
- VLMNBMFYRMGEMB-QWRGUYRKSA-N Lys-His-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 VLMNBMFYRMGEMB-QWRGUYRKSA-N 0.000 description 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 1
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 1
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 1
- 101710137272 Membrane-associated guanylate kinase, WW and PDZ domain-containing protein 2 Proteins 0.000 description 1
- 208000024556 Mendelian disease Diseases 0.000 description 1
- VIZLHGTVGKBBKO-AVGNSLFASA-N Met-Arg-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VIZLHGTVGKBBKO-AVGNSLFASA-N 0.000 description 1
- DRXODWRPPUFIAY-DCAQKATOSA-N Met-Asn-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN DRXODWRPPUFIAY-DCAQKATOSA-N 0.000 description 1
- XMMWDTUFTZMQFD-GMOBBJLQSA-N Met-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCSC XMMWDTUFTZMQFD-GMOBBJLQSA-N 0.000 description 1
- WWWGMQHQSAUXBU-BQBZGAKWSA-N Met-Gly-Asn Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O WWWGMQHQSAUXBU-BQBZGAKWSA-N 0.000 description 1
- UZWMJZSOXGOVIN-LURJTMIESA-N Met-Gly-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(=O)NCC(O)=O UZWMJZSOXGOVIN-LURJTMIESA-N 0.000 description 1
- UZVKFARGHHMQGX-IUCAKERBSA-N Met-Gly-Met Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCSC UZVKFARGHHMQGX-IUCAKERBSA-N 0.000 description 1
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 1
- SXWQMBGNFXAGAT-FJXKBIBVSA-N Met-Gly-Thr Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SXWQMBGNFXAGAT-FJXKBIBVSA-N 0.000 description 1
- WPTHAGXMYDRPFD-SRVKXCTJSA-N Met-Lys-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O WPTHAGXMYDRPFD-SRVKXCTJSA-N 0.000 description 1
- UFOWQBYMUILSRK-IHRRRGAJSA-N Met-Lys-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 UFOWQBYMUILSRK-IHRRRGAJSA-N 0.000 description 1
- LNXGEYIEEUZGGH-JYJNAYRXSA-N Met-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=CC=C1 LNXGEYIEEUZGGH-JYJNAYRXSA-N 0.000 description 1
- RSOMVHWMIAZNLE-HJWJTTGWSA-N Met-Phe-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSOMVHWMIAZNLE-HJWJTTGWSA-N 0.000 description 1
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 1
- 108091092878 Microsatellite Proteins 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- 102100032195 NADH dehydrogenase [ubiquinone] 1 beta subcomplex subunit 3 Human genes 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 102100030589 Neurogenic differentiation factor 6 Human genes 0.000 description 1
- 108091005461 Nucleic proteins Chemical group 0.000 description 1
- 102100040608 Origin recognition complex subunit 2 Human genes 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 102100034850 Peptidyl-prolyl cis-trans isomerase G Human genes 0.000 description 1
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 1
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 1
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 1
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 1
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 1
- METZZBCMDXHFMK-BZSNNMDCSA-N Phe-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N METZZBCMDXHFMK-BZSNNMDCSA-N 0.000 description 1
- HQPWNHXERZCIHP-PMVMPFDFSA-N Phe-Leu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 HQPWNHXERZCIHP-PMVMPFDFSA-N 0.000 description 1
- XOHJOMKCRLHGCY-UNQGMJICSA-N Phe-Pro-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOHJOMKCRLHGCY-UNQGMJICSA-N 0.000 description 1
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 1
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 1
- 206010035226 Plasma cell myeloma Diseases 0.000 description 1
- 102000010995 Pleckstrin homology domains Human genes 0.000 description 1
- 108050001185 Pleckstrin homology domains Proteins 0.000 description 1
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 1
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 1
- MTHRMUXESFIAMS-DCAQKATOSA-N Pro-Asn-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O MTHRMUXESFIAMS-DCAQKATOSA-N 0.000 description 1
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 1
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 1
- PZSCUPVOJGKHEP-CIUDSAMLSA-N Pro-Gln-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PZSCUPVOJGKHEP-CIUDSAMLSA-N 0.000 description 1
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 1
- QGOZJLYCGRYYRW-KKUMJFAQSA-N Pro-Glu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QGOZJLYCGRYYRW-KKUMJFAQSA-N 0.000 description 1
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 description 1
- BFXZQMWKTYWGCF-PYJNHQTQSA-N Pro-His-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BFXZQMWKTYWGCF-PYJNHQTQSA-N 0.000 description 1
- YTWNSIDWAFSEEI-RWMBFGLXSA-N Pro-His-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N3CCC[C@@H]3C(=O)O YTWNSIDWAFSEEI-RWMBFGLXSA-N 0.000 description 1
- FYPGHGXAOZTOBO-IHRRRGAJSA-N Pro-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FYPGHGXAOZTOBO-IHRRRGAJSA-N 0.000 description 1
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 1
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 1
- ANESFYPBAJPYNJ-SDDRHHMPSA-N Pro-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ANESFYPBAJPYNJ-SDDRHHMPSA-N 0.000 description 1
- MLKVIVZCFYRTIR-KKUMJFAQSA-N Pro-Phe-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLKVIVZCFYRTIR-KKUMJFAQSA-N 0.000 description 1
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 1
- RTQKBZIRDWZLDF-BZSNNMDCSA-N Pro-Pro-Trp Chemical compound C([C@H]1C(=O)N[C@@H](CC=2C3=CC=CC=C3NC=2)C(=O)O)CCN1C(=O)[C@@H]1CCCN1 RTQKBZIRDWZLDF-BZSNNMDCSA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 1
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 101150111584 RHOA gene Proteins 0.000 description 1
- 239000013614 RNA sample Substances 0.000 description 1
- 238000010240 RT-PCR analysis Methods 0.000 description 1
- 102000014113 Rabaptin-5 Human genes 0.000 description 1
- 108050003998 Rabaptin-5 Proteins 0.000 description 1
- 241000700159 Rattus Species 0.000 description 1
- 206010038910 Retinitis Diseases 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- 102100021689 Rho guanine nucleotide exchange factor 3 Human genes 0.000 description 1
- 102100035119 Ribosome biogenesis protein WDR12 Human genes 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- JJKSSJVYOVRJMZ-FXQIFTODSA-N Ser-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)CN=C(N)N JJKSSJVYOVRJMZ-FXQIFTODSA-N 0.000 description 1
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 1
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 1
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 1
- SFZKGGOGCNQPJY-CIUDSAMLSA-N Ser-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N SFZKGGOGCNQPJY-CIUDSAMLSA-N 0.000 description 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 1
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 1
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 1
- BQWCDDAISCPDQV-XHNCKOQMSA-N Ser-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N)C(=O)O BQWCDDAISCPDQV-XHNCKOQMSA-N 0.000 description 1
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 1
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 1
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- CAOYHZOWXFFAIR-CIUDSAMLSA-N Ser-His-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CAOYHZOWXFFAIR-CIUDSAMLSA-N 0.000 description 1
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 1
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 1
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- KJKQUQXDEKMPDK-FXQIFTODSA-N Ser-Met-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O KJKQUQXDEKMPDK-FXQIFTODSA-N 0.000 description 1
- HEYZPTCCEIWHRO-IHRRRGAJSA-N Ser-Met-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HEYZPTCCEIWHRO-IHRRRGAJSA-N 0.000 description 1
- WOJYIMBIKTWKJO-KKUMJFAQSA-N Ser-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CO)N WOJYIMBIKTWKJO-KKUMJFAQSA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- MHVXPTAMDHLTHB-IHPCNDPISA-N Ser-Phe-Trp Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MHVXPTAMDHLTHB-IHPCNDPISA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 1
- KIEIJCFVGZCUAS-MELADBBJSA-N Ser-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N)C(=O)O KIEIJCFVGZCUAS-MELADBBJSA-N 0.000 description 1
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- IAOHCSQDQDWRQU-GUBZILKMSA-N Ser-Val-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IAOHCSQDQDWRQU-GUBZILKMSA-N 0.000 description 1
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- 102100026940 Small ubiquitin-related modifier 1 Human genes 0.000 description 1
- 102000005588 Son of Sevenless Proteins Human genes 0.000 description 1
- 108010059447 Son of Sevenless Proteins Proteins 0.000 description 1
- 206010041415 Spastic paralysis Diseases 0.000 description 1
- 108010012715 Superoxide dismutase Proteins 0.000 description 1
- 102100027213 T-cell-specific surface glycoprotein CD28 Human genes 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 1
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 1
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 1
- JBHMLZSKIXMVFS-XVSYOHENSA-N Thr-Asn-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JBHMLZSKIXMVFS-XVSYOHENSA-N 0.000 description 1
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 1
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 1
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 1
- SIMKLINEDYOTKL-MBLNEYKQSA-N Thr-His-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C)C(=O)O)N)O SIMKLINEDYOTKL-MBLNEYKQSA-N 0.000 description 1
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 1
- QFCQNHITJPRQTB-IEGACIPQSA-N Thr-Lys-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O QFCQNHITJPRQTB-IEGACIPQSA-N 0.000 description 1
- DNCUODYZAMHLCV-XGEHTFHBSA-N Thr-Pro-Cys Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N)O DNCUODYZAMHLCV-XGEHTFHBSA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 1
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 1
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 1
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- XEVHXNLPUBVQEX-DVJZZOLTSA-N Thr-Trp-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N)O XEVHXNLPUBVQEX-DVJZZOLTSA-N 0.000 description 1
- KAJRRNHOVMZYBL-IRIUXVKKSA-N Thr-Tyr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAJRRNHOVMZYBL-IRIUXVKKSA-N 0.000 description 1
- XVHAUVJXBFGUPC-RPTUDFQQSA-N Thr-Tyr-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XVHAUVJXBFGUPC-RPTUDFQQSA-N 0.000 description 1
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 102100032480 Transmembrane protein 237 Human genes 0.000 description 1
- MQVGIFJSFFVGFW-XEGUGMAKSA-N Trp-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MQVGIFJSFFVGFW-XEGUGMAKSA-N 0.000 description 1
- RSUXQZNWAOTBQF-XIRDDKMYSA-N Trp-Arg-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RSUXQZNWAOTBQF-XIRDDKMYSA-N 0.000 description 1
- TWJDQTTXXZDJKV-BPUTZDHNSA-N Trp-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O TWJDQTTXXZDJKV-BPUTZDHNSA-N 0.000 description 1
- OBWQLWYNNZPWGX-QEJZJMRPSA-N Trp-Gln-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O OBWQLWYNNZPWGX-QEJZJMRPSA-N 0.000 description 1
- WLBZWXXGSOLJBA-HOCLYGCPSA-N Trp-Gly-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 WLBZWXXGSOLJBA-HOCLYGCPSA-N 0.000 description 1
- NLLARHRWSFNEMH-NUTKFTJISA-N Trp-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NLLARHRWSFNEMH-NUTKFTJISA-N 0.000 description 1
- ULHASJWZGUEUNN-XIRDDKMYSA-N Trp-Lys-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O ULHASJWZGUEUNN-XIRDDKMYSA-N 0.000 description 1
- WKQNLTQSCYXKQK-VFAJRCTISA-N Trp-Lys-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WKQNLTQSCYXKQK-VFAJRCTISA-N 0.000 description 1
- HTGJDTPQYFMKNC-VFAJRCTISA-N Trp-Thr-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 HTGJDTPQYFMKNC-VFAJRCTISA-N 0.000 description 1
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 1
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 1
- KDGFPPHLXCEQRN-STECZYCISA-N Tyr-Arg-Ile Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDGFPPHLXCEQRN-STECZYCISA-N 0.000 description 1
- JHORGUYURUBVOM-KKUMJFAQSA-N Tyr-His-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O JHORGUYURUBVOM-KKUMJFAQSA-N 0.000 description 1
- USYGMBIIUDLYHJ-GVARAGBVSA-N Tyr-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 USYGMBIIUDLYHJ-GVARAGBVSA-N 0.000 description 1
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 1
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 1
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 1
- PSALWJCUIAQKFW-ACRUOGEOSA-N Tyr-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N PSALWJCUIAQKFW-ACRUOGEOSA-N 0.000 description 1
- NVZVJIUDICCMHZ-BZSNNMDCSA-N Tyr-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O NVZVJIUDICCMHZ-BZSNNMDCSA-N 0.000 description 1
- VXFXIBCCVLJCJT-JYJNAYRXSA-N Tyr-Pro-Pro Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N1CCC[C@H]1C(O)=O VXFXIBCCVLJCJT-JYJNAYRXSA-N 0.000 description 1
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 1
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 1
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 1
- JQOMHZMWQHXALX-FHWLQOOXSA-N Tyr-Tyr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JQOMHZMWQHXALX-FHWLQOOXSA-N 0.000 description 1
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 1
- 108091007492 Ubiquitin-like domain 1 Proteins 0.000 description 1
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 1
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 1
- IVXJODPZRWHCCR-JYJNAYRXSA-N Val-Arg-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IVXJODPZRWHCCR-JYJNAYRXSA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- KVRLNEILGGVBJX-IHRRRGAJSA-N Val-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CN=CN1 KVRLNEILGGVBJX-IHRRRGAJSA-N 0.000 description 1
- PYXQBKJPHNCTNW-CYDGBPFRSA-N Val-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N PYXQBKJPHNCTNW-CYDGBPFRSA-N 0.000 description 1
- OJOMXGVLFKYDKP-QXEWZRGKSA-N Val-Met-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OJOMXGVLFKYDKP-QXEWZRGKSA-N 0.000 description 1
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 1
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 1
- SVLAAUGFIHSJPK-JYJNAYRXSA-N Val-Trp-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CO)C(=O)O)N SVLAAUGFIHSJPK-JYJNAYRXSA-N 0.000 description 1
- OEVFFOBAXHBXKM-HSHDSVGOSA-N Val-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](C(C)C)N)O OEVFFOBAXHBXKM-HSHDSVGOSA-N 0.000 description 1
- PDASTHRLDFOZMG-JYJNAYRXSA-N Val-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 PDASTHRLDFOZMG-JYJNAYRXSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 239000012190 activator Substances 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- VREFGVBLTWBCJP-UHFFFAOYSA-N alprazolam Chemical compound C12=CC(Cl)=CC=C2N2C(C)=NN=C2CN=C1C1=CC=CC=C1 VREFGVBLTWBCJP-UHFFFAOYSA-N 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 238000003277 amino acid sequence analysis Methods 0.000 description 1
- 230000019552 anatomical structure morphogenesis Effects 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 210000001106 artificial yeast chromosome Anatomy 0.000 description 1
- 230000001174 ascending effect Effects 0.000 description 1
- 208000021024 autosomal recessive inheritance Diseases 0.000 description 1
- 210000003050 axon Anatomy 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 238000001574 biopsy Methods 0.000 description 1
- 210000002459 blastocyst Anatomy 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 108700021031 cdc Genes Proteins 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 230000002490 cerebral effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 210000003483 chromatin Anatomy 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 239000003184 complementary RNA Substances 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- TVZPLCNGKSPOJA-UHFFFAOYSA-N copper zinc Chemical compound [Cu].[Zn] TVZPLCNGKSPOJA-UHFFFAOYSA-N 0.000 description 1
- 239000013601 cosmid vector Substances 0.000 description 1
- 210000003792 cranial nerve Anatomy 0.000 description 1
- 238000009402 cross-breeding Methods 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 210000004292 cytoskeleton Anatomy 0.000 description 1
- 231100000599 cytotoxic agent Toxicity 0.000 description 1
- 239000002619 cytotoxin Substances 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000000502 dialysis Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 238000010494 dissociation reaction Methods 0.000 description 1
- 230000005593 dissociations Effects 0.000 description 1
- 102100035859 eIF5-mimic protein 2 Human genes 0.000 description 1
- 239000012636 effector Substances 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000002567 electromyography Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000006862 enzymatic digestion Effects 0.000 description 1
- 238000010230 functional analysis Methods 0.000 description 1
- 230000007045 gastrulation Effects 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 210000002503 granulosa cell Anatomy 0.000 description 1
- 230000000971 hippocampal effect Effects 0.000 description 1
- 210000001320 hippocampus Anatomy 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 210000004408 hybridoma Anatomy 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 230000006122 isoprenylation Effects 0.000 description 1
- 108010012212 junctophilin Proteins 0.000 description 1
- 102000019028 junctophilin Human genes 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 108010085203 methionylmethionine Proteins 0.000 description 1
- 230000000394 mitotic effect Effects 0.000 description 1
- 239000003607 modifier Substances 0.000 description 1
- 208000005264 motor neuron disease Diseases 0.000 description 1
- 201000000050 myeloid neoplasm Diseases 0.000 description 1
- 230000007498 myristoylation Effects 0.000 description 1
- 210000005036 nerve Anatomy 0.000 description 1
- 230000004770 neurodegeneration Effects 0.000 description 1
- 208000015122 neurodegenerative disease Diseases 0.000 description 1
- 210000000956 olfactory bulb Anatomy 0.000 description 1
- 210000000287 oocyte Anatomy 0.000 description 1
- 239000003960 organic solvent Substances 0.000 description 1
- 230000000242 pagocytic effect Effects 0.000 description 1
- 210000001769 parahippocampal gyrus Anatomy 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 230000007115 recruitment Effects 0.000 description 1
- 102000037983 regulatory factors Human genes 0.000 description 1
- 108091008025 regulatory factors Proteins 0.000 description 1
- 230000009711 regulatory function Effects 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 210000000880 retinal rod photoreceptor cell Anatomy 0.000 description 1
- 238000004366 reverse phase liquid chromatography Methods 0.000 description 1
- 238000005096 rolling process Methods 0.000 description 1
- 238000005185 salting out Methods 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 210000003900 secondary neuron Anatomy 0.000 description 1
- 238000005204 segregation Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 230000008054 signal transmission Effects 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 210000001082 somatic cell Anatomy 0.000 description 1
- 230000020347 spindle assembly Effects 0.000 description 1
- 210000000952 spleen Anatomy 0.000 description 1
- 210000000130 stem cell Anatomy 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 239000004094 surface-active agent Substances 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 108091035539 telomere Proteins 0.000 description 1
- 210000003411 telomere Anatomy 0.000 description 1
- 102000055501 telomere Human genes 0.000 description 1
- 229940126585 therapeutic drug Drugs 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 238000011830 transgenic mouse model Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000005945 translocation Effects 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- 108700004896 tripeptide FEG Proteins 0.000 description 1
- 108010029384 tryptophyl-histidine Proteins 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 210000004291 uterus Anatomy 0.000 description 1
- 210000000707 wrist Anatomy 0.000 description 1
- 239000011701 zinc Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P21/00—Drugs for disorders of the muscular or neuromuscular system
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P25/00—Drugs for disorders of the nervous system
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
- C07K14/4701—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals not used
- C07K14/4702—Regulators; Modulating activity
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/68—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
- G01N33/6893—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids related to diseases not provided for elsewhere
- G01N33/6896—Neurological disorders, e.g. Alzheimer's disease
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2217/00—Genetically modified animals
- A01K2217/05—Animals comprising random inserted nucleic acids (transgenic)
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2217/00—Genetically modified animals
- A01K2217/07—Animals genetically altered by homologous recombination
- A01K2217/075—Animals genetically altered by homologous recombination inducing loss of function, i.e. knock out
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/156—Polymorphic or mutational markers
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/158—Expression markers
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2800/00—Detection or diagnosis of diseases
- G01N2800/28—Neurological disorders
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Medicinal Chemistry (AREA)
- Neurology (AREA)
- Analytical Chemistry (AREA)
- Genetics & Genomics (AREA)
- Immunology (AREA)
- Zoology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Neurosurgery (AREA)
- Veterinary Medicine (AREA)
- Urology & Nephrology (AREA)
- Public Health (AREA)
- Animal Behavior & Ethology (AREA)
- Pharmacology & Pharmacy (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- General Chemical & Material Sciences (AREA)
- Pathology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Physics & Mathematics (AREA)
- Biotechnology (AREA)
- Hematology (AREA)
- Microbiology (AREA)
- Wood Science & Technology (AREA)
- General Engineering & Computer Science (AREA)
- Cell Biology (AREA)
- Toxicology (AREA)
- Gastroenterology & Hepatology (AREA)
- Food Science & Technology (AREA)
- General Physics & Mathematics (AREA)
- Orthopedic Medicine & Surgery (AREA)
Abstract
Candidate gene causing Amyotrophic Lateral Sclerosis type 2 (ALS2), encoding a putative GTP exchange factor (GEF) or GTPase regulatory factor.
Description
Related Applications This application claims priority from United States application no. 60/267,723 filed S February 12, 2001; Japanese application no. 2001-116973 filed April 16, 2001; and, United States application no. 60/318,352 filed September 12, 2001, which applications are hereby incorporated by reference.
Field of the Invention This invention relates to genetic causes of amyotrophic lateral sclerosis of type 2 ("ALS2").
Backeround of the Invention Amyotrophic lateral sclerosis ("ALS") is a progressive neurodegenerative disease in which distal and proximal motor neurons are selectively degenerated'. Its cause is ambiguous and its onset is mostly at middle age and thereafter. Its rate of onset is about 2-6 per 100,000 persons and begins with lowering of muscular strength and myoatrophy of wrist muscle as a secondary neuron hindrance resulting in bulbar paralysis symptoms such as atrophy of muscle of limbs, atrophy of tongue, alalia, dysphagia and dyspnea.
No therapeutic method has been established yet and most of the afflicted die within five years from onset.
Juvenile amyotrophic lateral sclerosis of type 2 ("ALS2"; OMIM2151002) is a somatically recessive type hereditary disease. Although the frequency of its onset is rare, muscular convulsion of limbs, face and throat is gradually expressed in persons of teens or twenties and becomes chronic by bulbar paralysis as described above.
Amyotrophic lateral sclerosis of type 2 has been mapped to the 1.7 cM interval flanked by D2SI16 and D2S2237 on human chromosome 2q333'4. Alterations in 391 exons and their flanking regions derived from 43 non-overlapping transcripts have been noted within this intervals'6.
ALS is a very severe disease and there is a need for development of means for its early detection or diagnosis and for treatment.
SUBSTITUTE SHEET (RULE 26) Summary of the Invention We have now identified a gene associated with amyotrophic lateral sclerosis type 2, termed the ALS2 or ALS2CR6 gene. This gene is expressed in various human tissues including neurons in the brain and spinal cord, and encodes a protein with homology to RanGED and RhoGEF.
This invention now provides mammalian ALS2 genes and mutant versions thereof as well as peptides (including proteins) encoded by such genes. Also included are fragments and nucleic acids derived from these genes, corresponding peptides, and oligonucleotides suitable for use as amplification primers and/or probes. Antibodies to the peptides of this invention are also provided.
This invention also provides methods of diagnosis of ALS2 which may include identifying in a patient at risk, an altered ALS2 gene or protein. The patient may be tested to characterize one or more mutations in the gene or protein produced. Such a mutation may comprise the A261de1 mutation or the AGI548de1 mutations described herein.
This invention also provides nucleic acids which correspond to a region of the gene, which nucleic acids typically hybridize to at least about 6, at least about 10, at least about 15, at least about 20, or at least about 25 consecutive nucleotides of an ALS2 sequence as described herein, or to complements of such sequences, or to naturally occurring mutants or allelic variants thereof. The probes or primers may be chosen to be capable of distinguishing (such as by amplification or hybridization) allelic variants, including the A261 del and AGI548de1 mutations described herein. Such probes or primers may fiuther include a label which is capable of being detected. This invention also provides kits for identifying ALS2 genes, including those comprising alleles associated with an ALS2 disease state, wherein the kits may comprise a probe or primer as described herein.
The kit may further comprise instructions for using the probes or primers to distinguish alleles as described herein.
This invention also provides vectors containing nucleic acids of this invention, including vectors adapted for expression of such nucleic acids in a target cell or organism.
Such vectors may comprise appropriate transcription regulatory elements for directing transcription of the nucleic acids in a target cell or organism. Nucleic acids and peptides of this invention may be expressed in bacterial as well as eukaryotic cells, including SUBSTITUTE SHEET (RULE 26) mammalian cells. Such vectors may be adapted to express nucleic acids of this invention in a reverse direction so as to generate anti-sense transcription products.
This invention also provides non-human mammals comprising a genome in which an ALS2 gene has been mutated, including by deletion. Such a mammal may be a mouse and S methods for altering the murine genome such as to produce an ALS2 "knock-out" mouse, are described herein and are known in the art.
This invention also provides the use of nucleic acids and peptides as disclosed herein for the preparation of medicaments for treatment of ALS2 or in the treatment of ALS2.
This invention also provides methods of treating patients for ALS2, which methods may comprise testing the patient to diagnose or characterize an ALS2 disease state. A
patient may be treated for ALS2, for example by administering to the patient or by otherwise providing a native form or functional fragment or derivative of the ALS2 peptide described herein or such other therapeutic agent as which will restore function of the protein in a patient. Also included in this invention are vectors suitable for use in gene therapy and gene therapy methodologies whereby a patient is treated to restore the function of ALS2 by delivering or producing a functional gene for expression in the patient. Gene therapy vectors may, for example, be adeno-associated vector, such as those known in the art.
General methods for gene therapy are also known in the art.
This invention includes a human ALS2 gene which is present in human second chromosome q33 region and may code for a GTPase regulatory factor. The gene may encode an amino acid sequence of SEQ ID N0:2. cDNA synthesized from mRNA that may be transcribed by this gene has a base sequence of SEQ ID NO:1.
This invention includes a human ALS2 mutated gene which is related to amyotrophic lateral sclerosis of type 2 and codes for a modified protein having an amino acid sequence of SEQ ID NO: 3 or SEQ ID N0:84, by a deficiency of one or two bases of the above human ALS2 gene.
This invention includes nucleic acids purified from genomic DNA, mRNA or cDNA
as well as synthesized nucleic acids.
This invention includes oligonucleotides which hybridize to ALS2 genes and variants thereof, preferably under.stringent conditions.
SUBSTITUTE SHEET (RULE 26) This invention includes kits comprising oligonucleotides or oligonucleotide primer sets which may be used to carry out amplification of ALS2 encoded nucleic acids, for example by the polymerase chain reaction (PCR).
This invention includes oligonucleotide probes which hybridize to the regions containing base deficient sites (A261de1 and AGI548de1J in ALS2 under stringent conditions.
This invention includes oligonucleotide primer sets which carry out a PCR
amplification of the region containing a base deficient site in ALS2 as described herein. A
specific example of this primer set is a pair of synthetic oligonucleotides comprising the base sequences of SEQ ID NO: 6 and NO: 7 or a pair of synthetic oligonucleotides comprising the base sequences of SEQ ID NO: 8 and NO: 9.
This invention includes recombinant vectors comprising the above nucleic acids and cells transcribed by said recombinant vectors.
This invention includes a GTPase regulatory factor or a GEF which is characterized in being an expression product of an ALS2 gene as described herein.
Embodiments of such GTPase regulatory factors of GEF's are recombinant proteins produced by the transformed cells transformed according to this invention..
This invention includes a peptide comprising an amino acid sequence having continuous 5 or more acid amino residues in the first to the 46th amino acid sequence in SEQ ID NO: 2 and also a peptide comprising an amino acid sequence having continuous 5 or more acid amino residues in the 47th to the 1657th amino acid sequence in SEQ ID NO:
2. These peptides may be used for production of antibodies.
This invention also provides a modified protein which may be an expression product of a mutant human ALS2 gene and which comprises the amino acid sequence of SEQ
ID
NO: 3. An embodiment of this modified protein is a recombinant protein produced by a transformed cell.
This invention includes an antibody which recognizes peptides (including proteins) as disclosed herein. Embodiments of this antibody are an antibody which is prepared using a peptide according to this invention as an antigen, including a peptide comprising an amino acid sequence having continuous 5 or more acid amino residues in the first to the 46th amino acid sequence in SEQ ID NO: 2 and also an antibody which is prepared using a peptide SUBSTITUTE SHEET (RULE 26) comprising an amino acid sequence having continuous S or more acid amino residues in the 47th to the 1657th amino acid sequence in SEQ ID NO: 2 as an antigen.
This invention furthermore provides methods for the diagnosis of amyotrophic lateral sclerosis of type 2 which is characterized in detecting ALS2 mutated genes. An 5 embodiment of this method for the diagnosis it that genomic DNA of the cells of a person to be diagnosed is subjected to a PCR amplification using a primer set comprising a pair of synthetic oligonucleotides comprising the base sequences of SEQ ID NO: 6 and NO: 7 or a pair of synthetic oligonucleotides comprising the base sequences of SEQ ID NO:
8 and NO:
9, the resulting DNA fragments are treated with a restriction enzyme NarI and the said person where each of the DNA fragments is divided into two fragments is judged to be suffering from amyotrophic lateral sclerosis of type 2.
This invention also provides a method for the diagnosis of amyotrophic lateral sclerosis of type 2 which is characterized in that the transcribed product of an ALS2 gene or mutated gene is detected. In an embodiment of this diagnostic method, the transcribed product is cDNA or mRNA of the gene of an ALS2 mutated gene or the modified protein expressed by the said mutated gene. An embodiment of the case of detection of the modified protein is a method for the detection of the protein where the antibody recognizing the first to the 46th amino acid sequences in SEQ ID NO: 2 reacts but the antibody recognizing the 47th to the 1657th amino acid sequence region in SEQ ID NO: 2 does not react.
Further, this invention provides a mouse ALS2 gene which may have an amino acid sequence of SEQ ID NO:S as well as nucleic acids derived therefrom including nucleic acids synthesized or purified from genomic DNA, mRNA or cDNA of the mouse gene or a complementary sequence thereof.
This invention also provides a gene-defective non-human mammal such as a rodent, preferably a mouse, where function of an ALS2 gene is substantially deficient.
Also provided are tissues of such a mouse.
The human ALS2 gene according to this invention is a genomic gene which has 33 introns and 34 exons, exists in a genomic DNA of 80.3 kb adjacent to a polymorphic DNA
marker D2S2309 in human second chromosome q 33 region (refer to Fig. 1) and codes for a SUBSTITUTE SHEET (RULE 26) human GTPase regulatory factor having an amino acid sequence of SEQ ID N0:2.
In this ALS2 gene, its cDNA has a base sequence of SEQ ID NO: 1.
This invention provides an isolated nucleic acid that codes for a peptide having at least about 75, 80, 85, 90, 95, 97 or 100% identity to all of an amino acid sequence selected from the group consisting of SEQ ID N0:2; SEQ ID N0:3; SEQ ID N0:5; SEQ ID
N0:84; and, amino acids 372-1657 of SEQ ID N0:2. Also provided are the peptides encoded by these nucleic acids.
This invention also provides an isolated nucleic acid consisting essentially of a nucleotide sequence having at least about 75, 80, 85, 90, 95, 97 or 100%
identity to all of a nucleotide sequence or a complementary sequence thereof, selected from the group consisting of SEQ ID NO:1; SEQ ID N0:4; nucleotides 124-5094 of SEQ ID NO:1;
nucleotides 1225-5094 of SEQ ID NO:1; and, nucleotides 124-5076 of SEQ ID
N0:4. Also provided are the peptides encoded by these nucleic acids.
The nucleic acids of this invention may be joined to a second nucleic acid not naturally associated with the nucleic acid of this invention. By not naturally associated, it is meant that the second nucleic acid is not part of an ALS2 gene and is not directly joined to an ALS2 gene in the genome of a mammal.
This invention also provides an oligonucleotide of 6 to 75 nucleotides, wherein the oligonucleotide hybridizes to a nucleic acid of this invention or a complementary sequence thereof, under stringent conditions. An oligonucleotide of this invention may be joined to a label, which is any moiety suitable for detectable labelling of the nucleic acid or for binding of the nucleic acid to a non-nucleic acid moiety.
This invention also provides a peptide consisting essentially of a sequence of at least 5 contiguous amino acids from a sequence selected from the group consisting of amino acids 1-46 of SEQ ID N0:2; amino acids 47-1657 of SEQ ID N0:2; SEQ ID N0:3;
amino acids 43-49 of SEQ ID N0:3; SEQ ID N0:84; and amino acids 476 to 545 of SEQ ID
N0:84. These peptides are useful, for example in raising antibodies of this invention and for investigating the function of the ALS2 protein.
This invention also provides a non-human mammal comprising a mutated gene, wherein the gene but for the mutation would encode a protein having at least about 75, 80, 85, 90, 95, 97 or 100% sequence identity to all of SEQ ID N0:2 or SEQ ID N0:5.
SUBSTITUTE SHEET (RULE 26) This invention also provides a method for the diagnosis of amyotrophic lateral sclerosis type 2 in a patient, comprising detecting the presence of a mutation in a gene that encodes a protein having at least about 75, 80, 85, 90, 95, 97 or 100%
sequence identity to SEQ ID N0:2 in a patient or a biological sample from a patient.
This invention also provides a method for the diagnosis of amyotrophic lateral sclerosis type 2, comprising detecting the presence or absence of a protein having at least about 75, 80, 85, 90, 95, 97 or 100% sequence identity to all of SEQ ID N0:2 in a patient or a biological sample from a patient.
This invention also provides a method for the diagnosis of amyotrophic lateral sclerosis type 2, comprising detecting the presence or absence of a protein having at least about 75, 80, 85, 90, 95, 97 or 100% sequence identity to all of SEQ ID N0:3 or SEQ ID
N0:84 in a patient or a biological sample from a patient.
In the diagnostic methods of this invention, sequences may be compared to determine the presence of mutations; oligonucleotides may be used to detect hybridization to nucleic acids of the patient; amplification of nucleic acids of the patient may be performed;
proteins of the patient may be contacted with antibodies of this invention; or proteins produced in the patient may be evaluated for the function of ALS2 protein.
This invention also provides a method of treatment of amyotrophic lateral sclerosis type 2, comprising administering a peptide, a nucleic acid, or a pharmaceutical composition comprising the peptide or nucleic acid to a patient in need thereof, wherein the peptide comprises an amino acid sequence having at least about 75, 80, 85, 90, 95, 97 or 100%
identity to SEQ ID N0:2 or a fragment thereof, and wherein the nucleic acid codes for said peptide.
This invention also provides a method of treatment of amyotrophic lateral sclerosis type 2, comprising administering a composition to a patient in need thereof, wherein the composition mimics the biological activity of the peptide of SEQ ID NO. 2 or a fragment thereof.
This invention also provides the use of a peptide or a nucleic acid for preparation of a medicament for treatment of amyotrophic lateral sclerosis type 2, wherein the peptide comprises an amino acid sequence having at least about 75, 80, 85, 90, 95, 97 or 100%
identity to SEQ ID N0:2 or a fragment thereof, and the nucleic acid codes for said peptide.
SUBSTITUTE SHEET (RULE 26) In this specification the term "isolated" with reference to a nucleic acid or peptide means that a nucleic acid is separate from the genome of a cell, a peptide is separate from a cell but does not mean that the subject matter has been obtained from a genome or a cell. In some instances, nucleic acids and peptides of this invention may be synthesized using conventional techniques.
Two nucleic acid or protein sequences are considered substantially identical if, when optimally aligned, they share at least about 70% sequence identity. In alternative embodiments, sequence identity may for example be at least 75%, at least 90%
or at least 95%. Optimal alignment of sequences for comparisons of identity may be conducted using a variety of algorithms, such as the local homology algorithm of Smith and Waterman,1981, Adv. Appl. Math 2: 482, the homology alignment algorithm of Needleman and Wunsch, 1970, J. Mol. Biol. 48:443, the search for similarity method of Pearson and Lipman, 1988, Proc. Natl. Acad. Sci. USA 85: 2444, and the computerised implementations of these algorithms (such as GAP, BESTFIT, FASTA and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, Madison, WI, U.S.A.). Sequence alignment may also be carned out using the BLAST algorithm, described in Altschul et al., 1990, J.
Mol. Biol. 215:403-10 (using the published default settings).
Nucleic acid sequences of the invention may in some embodiments be substantially identical, such as substantially identical gene targeting substrates and target sequences. The substantial identity of such sequences may be reflected in percentage of identity when optimally aligned that may for example be greater than 50%, 80% to 100%, at least 80%, at least 90% or at least 95%, which in the case of gene targeting substrates may refer to the identity of a portion of the gene targeting substrate with a portion of the target sequence, wherein the degree of identity may facilitate homologous pairing and recombination and/or repair. An alternative indication that two nucleic acid sequences are substantially identical is that the two sequences hybridize to each other under moderately stringent, or preferably stringent, conditions. Hybridization to filter-bound sequences under moderately stringent conditions may, for example, be performed in 0.5 M NaHP04, 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 65°C, and washing in 0.2 x SSC/0.1% SDS at 42°C (see Ausubel, et al. (eds), 1989, Current Protocols in Molecular Biology, Vol. 1, Green Publishing Associates, Inc., and John Wiley & Sons, Inc., New York, at p. 2.10.3).
Alternatively, SUBSTITUTE SHEET (RULE 26) hybridization to filter-bound sequences under stringent conditions may, for example, be performed in 0.5 M NaHP04, 7% SDS, 1 mM EDTA at 65°C, and washing in 0.1 x SSC/0.1% SDS at 68°C (see Ausubel, et al. (eds), 1989, supra).
Hybridization conditions may be modified in accordance with known methods depending on the sequence of interest (see Tijssen, 1993, Laboratory Technigues in Biochemistry and Molecular Biology --Hybridization with Nucleic Acid Probes, Part I, Chapter 2 "Overview of principles of hybridization and the strategy of nucleic acid probe assays", Elsevier, New York).
Generally, stringent conditions are selected to be about 5°C lower than the thermal melting point for the specific sequence at a defined ionic strength and pH.
It is well known in the art that some modifications and changes can be made in the structure of a polypeptide without substantially altering the biological function of that peptide, to obtain a biologically equivalent polypeptide. In one aspect of the invention, LPL
S447X therapeutics may include peptides that differ from a portion of the wild-type LPL
sequence by conservative amino acid substitutions. As used herein, the term "conserved amino acid substitutions" refers to the substitution of one amino acid for another at a given location in the peptide, where the substitution can be made without loss of function. In making such changes, substitutions of like amino acid residues can be made, for example, on the basis of relative similarity of side-chain substituents, for example, their size, charge, hydrophobicity, hydrophilicity, and the like, and such substitutions may be assayed for their effect on the function of the peptide by routine testing.
In some embodiments, conserved amino acid substitutions may be made where an amino acid residue is substituted for another having a similar hydrophilicity value (e.g., within a value of plus or minus 2.0), where the following hydrophilicity values are assigned to amino acid residues (as detailed in United States Patent No. 4,554,101, incorporated herein by reference): Arg (+3.0); Lys (+3.0); Asp (+3.0); Glu (+3.0); Ser (+0.3); Asn (+0.2);
Gln (+p.2); Gly (0); Pro (-0.5); Thr (-0.4); Ala (-0.5); His (-0.5); Cys (-1.0); Met (-1.3); Val (-1.5); Leu (-1.8); Ile (-1.8); Tyr (-2.3); Phe (-2.5); and Trp (-3.4).
In alternative embodiments, conserved amino acid substitutions may be made where an amino acid residue is substituted for another having a similar hydropathic index (e.g., within a value of plus or minus 2.0). In such embodiments, each amino acid residue may be SUBSTITUTE SHEET (RULE 26) assigned a hydropathic index on the basis of its hydrophobicity and charge characteristics, as follows: Ile (+4.5); Val (+4.2); Leu (+3.8); Phe (+2.8); Cys (+2.5); Met (+1.9); Ala (+1.8);
Gly (-0.4); Thr (-0.7); Ser (-0.8); Trp (-0.9); Tyr (-1.3); Pro (-1.6); His (-3.2); Glu (-3.5); Gln (-3.5); Asp (-3.5); Asn (-3.5); Lys (-3.9); and Arg (-4.5).
In alternative embodiments, conserved amino acid substitutions may be made where an amino acid residue is substituted for another in the same class, where the amino acids are divided into non-polar, acidic, basic and neutral classes, as follows: non-polar: Ala, Val, Leu, Ile, Phe, Trp, Pro, Met; acidic: Asp, Glu; basic: Lys, Arg, His; neutral:
Gly, Ser, Thr, 10 Cys, Asn, Gln, Tyr.
Brief DescriRtion of the Drawings Figure 1 is a transcription map of 3Mb region of human chromosome 2q33 including an ALS2 candidate region. The white open rectangle is between D2S116 to D2S2237z'3.
Positions of 7 STS markers, 12 polymorphic DNA markers and 42 independent transcription units are shown. Polarity of 38 transcription units are shown by arrows. The location of the ALS2 gene is designated "ALS2CR6" which term may be used interchangeably for below.
Figure 2 shows a process for the detection of ALS2 associated mutations. "a"
is an example of the Tunisian and Kuwaiti ALS2 families. Genotypes of the members constituting a family is shown based on previously reported results3'4. "b"
shows the result of sequence determination of mutation (A261 del) in genomic DNA of the Tunisian ALS2 family. Patient 10797 is A261 del of a homozygotic type and the carrier 10784 is a heterozygotic type. The sequence determination was carried out for PCR
products. "c"
shows the results of determination of mutation (AGI548de1) in the genomic DNA
in the Kuwaiti ALS2 family. Sequence of the reverse strand of exon 5 in the region of interest are shown. Individual 18279 is a normal sibling, who is unaffected by ALS2 and carries two normal haplotypes. The box in this sequence indicates the position of the bases deleted in affected members. Individual 18281 is an unaffected parent .who carnes one disease haplotype. The overlapping normal and mutated sequences are shown. Individual 18275 is SUBSTITUTE SHEET (RULE 26) affected and the figure shows a homozygous CT deletion in the reverse strand of exon 5:
The position of the deleted bases is indicated by the arrow. The corresponding forward sequence and coded normal amino acids and novel amino acids produced by frameshifting are indicated. "d" shows segregation of the A261de1 mutation in the Tunisian ALS2 family.
The presence of the deletion was assayed by the digestion with NarI, which only cuts mutated gene product. For exon-PCR products, the 339 by fragment representing the normal allele was cleaved into two fragments (225 by and 113 bp) in the mutant allele. For RT-PCR product, the 302 by product which represents the normal allele was cleaved into two fragments (195 by and 106 bp) in the mutant allele.
Figure 3 shows northern blot analysis of the ALS2 (ALS2CR6) mRNA. In "a", a northern blot containing 2 ~g of poly A+ mRNA of many adult human tissues is hybridized with exon 4 of ALS2 cDNA. In the lower drawings, the same blot is hybridized with human ~3-actin cDNA for confirmation of the property and the comparative load of RNA. In the left, size of the ALS2 transcript is shown. In "b", northern blot containing 10 pg of total RNA obtained normal whole brain and 20 ~g of total RNA obtained from lymphocytes of patients and healthy persons (10788 persons) was hybridized to exon 4 of the ALS2 cDNA.
The right panel shows an agarose gel electrophoresis of an RNA sample.
Figure 4 is a comparison of amino acid sequences in human ALS2CR6 and mouse homolog mALS2CR6. The same residues are shown by frames. There are shown the position of the additional three amino acid residues of the Tunisian mutant protein (starting from the 47th amino acid residue), the position of the 25 amino acid residues (starting from the 372nd residue) of a short variant part of the ALS2 gene and the position of the additional 70 amino acid residues of the Kuwaiti mutant protein (starting from the 476th residue).
Figure S shows an expression of ALS2 mRNA in brain and spinal cord of adult mouse. "a" is an arrow-like whole image of an RNA/RNA in situ hybridization using an antisense ALS2 riboprobe while "b" is a control image using a sense strand probe.
Significant expression was noted in neurons of hippocampus and dentate gyrus (c and g), Purkinje cells of cerebellum (d and h), neurons of cerebral cortex (e and i) and cinerea of spinal cord including anterior horn cells (f and j). A scale bar shows a length of 10 pm.
Figure 6 is a result of an amino acid sequence analysis. "a" is a schematic chart of domains and motifs in normal and mutated ALS2 protein. RCCl is a regulatory factor for SUBSTITUTE SHEET (RULE 26) chromosome condensation, DH is a homologous domain to Dbl, PH is a pleckstrin-homologous domain, MORN is membrane structure and recognition nexus and VPS9 is a vacuole protein for discrimination of 9 domains. "b" is comparison of amino acid sequences of RCCl repeat-containing regions for human ALS2 (hALS2CR6), mouse ALS2 S (mALS2CR6), human (h) RCC I , human (h) RPGR and mouse (m) RPGR. The amino acid residues shown by open frames are the same. Conserved amino acid residues are abundantly contained as well. Positions of the seven blades corresponding to RCC 1 are shown according to the literature3o.
Figure 7 is a chart that compares the wild type human, mouse, and short human variant of the ALS2 proteins and the coding products of the A261de1 (Tunisian) and AGI548de1 (Kuwaiti) mutations.
Detailed Description of the Invention The locus of a 1.7 cM region specified by microsatellite markers D2S116 and D2S2237 of a human second chromosome q 33 region has been mapped3'4. The inventors previously prepared a physical map on the basis of YACBAC/PAC of 3Mb genomic region covering the candidate region in Figure 15'6. Sequences of cDNA clones and EST's have now been analyzed and 42 non-duplicated transcription units including 10 new genes mapped. 411 pairs in of primers were designed depending upon genomic DNA of 14 persons of a family of ALS2 (Fig. 2a) and 6 normal control persons having no kinship with the former was amplified by PCR. Seventy-seven base sequence polymorphs of introns or exons were identified by determining the sequence for all of the PCR products.
Among them, a gene having base deletions related to onset of ALS2 was identified.
The ALS2 gene also includes restriction regions and regulatory regions (promoter/enhancer, suppressor, etc.) which function in expression of protein which is coded thereby. Such restriction and regulation regions are useful for clarifying the functions of the ALS2 gene product as a GEF or a GTPase regulatory factor.
This ALS2 gene may, for example, by isolated by screening a human genome library using pure polynucleotide or oligonucleotide comprising a base sequence of SEQ ID
NO: 1 or a partial sequence thereof as a probe. The resulting genomic gene may be amplified by commonly used genetic amplifying methods such as, for example, a PCR
SUBSTITUTE SHEET (RULE 26) (polymerase chain reaction) method, an NASBN (nucleic acid sequence based amplification) method, a TMA (transcription-mediated amplification) method or an SDA
(strand displacement amplification) method.
A pure polynucleotide (DNA fragments and RNA fragments) may also be prepared from this ALS2 genomic gene, mRNA transcribed by this gene or cDNA synthesized from mRNA. For example, cDNA may be synthesized using poly(A) + RNA extracted from human cells as a template. The human cells may be either those excised from human body by operation, etc. or incubated cells. cDNA may be synthesized by known methods (Mol.
Cell Biol., 2, 161-170, 1982; J. Gene, 25, 263-269, 1983; Gene, 150, 243-250, 1994). One may also synthesize cDNA by an RT-PCR method using an oligonucleotide as a primer and mRNA isolated from human cells as a template. Specifically, the cDNA prepared as such has a base sequence of SEQ ID NO: 1. Those polynucleotides may be used for recombinant expression of a human GTPase regulatory factor.
The oligonucleotides of this invention are DNA fragments or RNA fragments which 1 S hybridize to the above-mentioned ALS2 or the above-mentioned nucleic acids under stringent conditions. For example it is a continuous DNA fragment of 10-100 by in the base sequence of SEQ ID NO: 1. Here, stringent conditions means a condition whereby a specific hybrid formation of target with a probe is made possible by salt concentration, concentration of organic solvent (such as formamide), or temperature condition during hybridization and washing steps. Methods are described in U. S. Patent No.
6,100,037.
One methodology for creating stringent hybridization conditions is [insert B &
K) A primer set of this invention is typically a pair of oligonucleotides for amplification of ALS2 gene or related nucleic acids. Such a primer set may be designed on the basis of the base sequence of SEQ ID NO: 1, synthesized and subjected to purification using known methods. Size (base numbers) of the primer preferably is 15-40 bases or more preferably, 15-30 bases which specificity anneal with a template DNA. However, when LA
(long accurate) PCR is carried out, it is effective to use primers in excess of 30 bases. A pair (two) primers comprising sense strand (5'-terminal side) and antisense strand (3'-terminal side) should not be complementary. In addition, a self complementary sequence is to be avoided in a primer to prevent the formation of a hairpin structure. Further, in order to ensure a stable bond to a template DNA, the GC content should be about SO% and occurrence of GC-SUBSTITUTE SHEET (RULE 26) rich or AT-rich regions in a primer should be avoided. Since an annealing temperature is dependent upon Tm (melting temperature), primers having Tm of 55-65°C
are chosen so as to prepare a PCR product having a high specificity. The final concentration of the primer used in PCR should be about 0.1 to about 1 ~M. It is possible to use commercially available software for designing a primer including the OligoTM software [manufactured by National Bioscience Inc. (U. S. A.)] and GenetyxT"" software [manufactured by Software Development KK (Japan)}.
Mutated ALS2 genes may be obtained by a method where a DNA library prepared from cells of a patient thought to be suffering from ALS2 is screened with a probe which hybridizes to a region containing mutant (e.g. a base deficient site) under a stringent condition. Pure polynucleotide (DNA fragment or RNA fragment) may be obtained from genomic DNA, mRNA or cDNA of an ALS2 mutated gene or a complementary sequence thereof. For example, an ALS2 mutated gene comprises a nucleic acid where the 261 st base a of SEQ ID NO: 1 is deficient. Such a polynucleotide may be used for recombinant 1 S production of ALS2 modified protein or for diagnosis of ALS2.
A primer set for a PCR amplification of ALS2, including various regions having base deficient sites in mutated ALS2 is (for example) a pair of synthetic oligonucleotides comprising base sequences of SEQ ID NO: 6 and NO: 7. This primer set is capable of a PCR amplification of the region (339 bp) including exon 3 and introns before and after that in the ALS2 gene. Another primer set may be composed of synthetic oligonucleotides comprising base sequences of SEQ ID NO: 8 and NO: 9 and is capable of PCR
amplification of exons 2-4 (302 bp) of the ALS2 gene using RNA as a template.
Any PCR
product not cleaved by the restriction enzyme NarI is derived from the normal ALS2 gene but PCR products derived from a mutated ALS2 gene may be cleaved by NarI to give two fragments (Fig. 2c).
A recombinant vector of this invention may be a cloning vector or an expression vector. Vectors will be constructed depending upon the type of the polynucleotide as an insert or upon the object for use. For example, when an ALS2 protein or a modified protein thereof is produced using cDNA or its ORF region as an insert, there may be used an expression vector for an in vitro transcription or an expression suitable for each of prokaryotic cells such as Escherichia coli and Bacillus subtilis and eukaryotic cells such as SUBSTITUTE SHEET (RULE 26) yeast, insect cells and mammalian cells. When a genomic DNA of the ALS2 gene or a mutated gene thereof is used as an insert, it is also possible to use a BAC
(bacterial artificial chromosome) vector or a cosmid vector. Such recombinant vectors are also useful, for example, as probes for diagnosis of chromosome abnormality by hybridization including 5 fluorescent in situ hybridization (FISH). Further, a nucleic acid derived from a normal ALS2 gene may be recombined in a virus vector such as adenovirus or the like and the product may be used for genetic therapy.
In the manufacture of ALS2 peptide (including protein), a transformed cell of this invention may be a prokaryotic cell such as Escherichia coli and Bacillus subtilis or an 10 eukaryotic cell such as from yeast, insects, and mammals. In addition, cells (such as blood stem cells) derived from a patient suffering from ALS2 which are transformed by a virus vector of this invention in which a nucleic acid derived from a normal ALS2 gene is recombined, may be used for a genetic therapy of ALS2. Such transformed cells may be prepared by introducing a recombinant vector into cells by means of known methods such as 15 electroporation, calcium phosphate method, liposome method and DEAE dextran method.
A peptide of this invention may be an expression product of a normal ALS2 gene or an expression product of a mutated ALS2 gene. The normal gene product is a GTPase transcription factor or GEF having an amino acid sequence of SEQ ID NO: 2.
Peptides of this invention are useful as immunogens for the preparation of an antibody, as target molecules for the development of therapeutic agents for ALS2, etc. These peptides may be prepared by methods involving isolating peptides from the cells of healthy persons or patients suffering from ALS2. Methods of chemical synthesis on the basis of a desired amino acid sequence from SEQ ID N0:2 or SEQ ID N0:3, etc. and (preferably) by production and isolation or purification from the above-mentioned transformed cells. Such transformed cells are incubated and isolation and purification are carried out for the culture by, for example, means of treatment with a modifier such as urea or with a surface-active agent, ultrasonic wave treatment, enzymatic digestion, precipitation by salting out or by solvent, dialysis, centrifugal separation, ultrafiltration, gel filtration, SDS-PAGE, isoelectric electrophoresis, ion exchange chromatography, hydrophobic chromatography, affinity chromatography and reversed phase chromatography. Such proteins may include fused proteins with any other protein. For example, fused proteins with glutathione-S-transferase SUBSTITUTE SHEET (RULE 26) (GST) or green fluorescent protein (GFP) may be exemplified. In addition, the protein expressed in cells may be subjected to various kinds of modifications in the cells after being translated. Accordingly, modified proteins are also included in the coverage of the protein of this invention. Examples of the modification after translation as such are elimination of N-terminal methionine, N-terminal acetylation, addition of sugar chain, limited decomposition by intracellular protease, myristoylation, isoprenylation and phosphorylation.
An antibody of this invention is a polyclonal antibody or monoclonal antibody which recognizes a peptide of this invention. Examples include an antibody prepared using a peptide comprising an amino acid sequence of continuous 5 amino acid residues or more of the first to the 46th amino acid sequence in SEQ ID NO: 2 as an antigen and an antibody prepared using a peptide comprising an amino acid sequence of continuous 5 amino acid residues or more of the 47th to the 1657th amino acid sequence in SEQ ID NO: 2 as an antigen. When those two kinds of antibodies are used, it is possible to detect and differentiate normal and A261 del mutant proteins. The antibody of this invention includes all molecules which are able to bind to an epitope of an ALS2 protein or other peptide of this invention, and all of Fab, F(ab')2, Fv fragments, etc. thereof. Such an antibody can be obtained from serum after an animal is immunized using ALS2 derived protein or peptide as an antigen. Alternatively, the above expression vectors for eukaryotic cells may be introduced into muscle or skin of animals by injection or particle gun and then serum is collected therefrom. Examples of animals that may be used are mouse, rat, rabbit, goat, chicken, etc. When B cells collected from the spleen of an immunized animal are fused with myeloma cells to produce a hybridoma, it is possible to produce monoclonal antibodies.
The diagnostic method of this invention is one in which an ALS2 mutated gene or a transcription product of an ALS2 mutated gene is detected whereby the risk of onset of ALS2 may be estimated. Particularly amenable are persons of known ALS2 families although diagnosis is not limited thereto.
Genomic DNA of a person to be diagnosed may be subjected to a PCR
amplification using any of the above-mentioned primer sets or other oligonucleotides of this invention.
The resulting DNA fragment may be treated with one or more restriction enzymes such as NarI, and the person to be diagnosed where the DNA fragment is cleaved into fragments different from cleaving product produced from a person not suffering from ALS2 is SUBSTITUTE SHEET (RULE 26) indicative of a patient suffering from ALS2 or a person with some risk of ALS2 in view of the presence of a mutation in the ALS2 gene.
It is also possible to detect the ALS2 mutated genes by (for example) an allele specific oligonucleotide probe method, an oligonucleotide ligation assay method, a PCR
SSCP method, a PCR-CFLP method, a PCR-PHFA method, an invader method, an RCA
(rolling circle amplification) method and a primer oligo base extension method.
In detecting transcription products of ALS2 mutated genes, diagnosis may be carried out by determining the sequence of mRNA of the person to be diagnosed or cDNA
thereof.
It is also possible to carry out the diagnosis in such a manner that an ALS2 gene of a person to be diagnosed or cDNA thereof is recombined with an expression vector, transfected to cells and the expression product thereof measured.
Expression products of normal and mutant ALS2 genes may be assessed by measurement of molecular weight. For example, the frame shift caused by deletion of one base in normal ALS2 gene whereupon the modified protein is changed to a low-molecular protein (SEQ ID NO: 3) comprising the first to the 46th amino acid residues of SEQ ID NO:
2 and three amino acid residues (Pro-Ser-Glu) newly coded by the frame shift results in a product having a molecular weight easily comparable to naturally occurnng gene products of the normal ALS2 gene. Further, diagnosis may be also carned out by the above antibody provided by this invention in which the ALS2 modified protein reacts with an antibody recognizing (for example) the first to the 46th amino acid sequence in SEQ ID
NO: 2 or a region comprising amino acids 43-49 of SEQ ID N0:3, but does not react with an antibody recognizing the 47th to the 1657th amino acid sequence region in SEQ ID NO: 2.
Antibodies specific for amino acids 476 to 545 of SEQ ID N0:84 as compared to any of the amino acids of SEQ ID N0:2 could be similarly used for diagnosis of the AGI548de1.
Diagnosis using antibodies may, for example, be carried out with an ELIZA
method.
The-mouse ALS2 gene of this invention is a mouse genomic gene isolated as a homolog of the human ALS2 gene and which codes for a mouse ALS2 protein comprising an amino acid sequence of SEQ ID NO: 5. Its cDNA has a base sequence of SEQ ID
NO: 4.
This gene may be used for the preparation of a "knock-out" mouse.
Such a "knock-out" mouse can be prepared by known gene targeting methods (Science, 244: 1288-1292, 1989) or generally according to the following exarriple.
SUBSTITUTE SHEET (RULE 26) First, a DNA fragment of the mouse ALS2 gene including the initiation codon of the gene is modified whereupon a defective DNA fragment which deletes expression of the ALS2 gene is obtained. This defective DNA fragment is used for the preparation of a targeting vector for introduction of the modification into a mouse totipotent cell (ES cell) according to known methods (such as the method described in Science, 244: 1288-1292, 1989). For example, genomic DNA comprising the ALS2 gene is substituted or inserted with a resistant gene to a cytotoxin to prepare a recombinant plasmid DNA
possessing the defective gene having a sequence homologous to the genomic DNA of the ALS2 gene at both terminals (the targeting vector). It is also possible for the resistant gene to be connected to a sequence such as PGKl promoter and PGKl polyadenylation signal for controlling the expression. It is preferred that the genomic DNA site of the ALS2CR6 gene which is substituted with or inserted by resistant gene be a genomic DNA region containing an exon region containing an initiation codon.
There are no particular limitations on such target vectors except that it will have a sequence which is homologous to genomic DNA of the ALS2 gene and a resistance sequence or other sequence useful for cell sorting (such as diphtheria toxin A
gene and thymidine kinase gene of herpes virus). A promoter and enhancer may be appropriately combined and used. The targeting vector is then introduced into an ES
(embryonic stem) cell according to known methods (e.g. Nature, 292: 154-156, 1981). Such methods include electric pulse, a liposome and calcium phosphate. When recombination efficiency of the gene to be introduced is of concern, the electric pulse methods is preferred.
DNA in each of the ES cells into which gene is introduced is extracted and, by means of a southern blot analysis or a PCR assay, cells are selected in which a homologous gene recombination has taken place between the wild type ALS2 gene existing on the chromosome and the introduced defective ALS2 gene fragment resulting in placement of the defective gene fragment in the chromosome.
An ES cell having a defective gene prepared above may be injected into a blastocyst of a wild type animal and chimera-embryos obtained which are transplanted to the uterus of a preliminary parent. Resulting progeny are selected for the ALS2 defective gene and bred.
Selection may be carned out by checking the difference in the color of hair or by extraction of DNA from a part of the body (such as the tail end) followed by conducting a southern blot SUBSTITUTE SHEET (RULE 26) analysis, a PCR assay after extraction of DNA, etc. As to the offspring obtained by a crossbreeding of animal of a wild type with a chimera animal where the ALS2 defective gene is in the generative cells, a southern blot analysis, a PCR assay or the like may be carned out using the DNA extracted from a part of the body (such as the tail end) as a material to identify a heterozygote into which the ALS2 defective gene is introduced. A
heterozygote possessing the ALS2 defective gene which is stable in all generative cells and somatic cells may be bred to produce progeny in which the ALS2 gene is completely knocked-out".
An animal prepared as such may be used for analysis of function of ALS2 gene in onset of ALS2 and for screening of therapeutic drugs or development of therapeutic methods as an ALS2 model animal.
Methods and results of procedures carned out for cloning of the ALS2 gene and for functional analysis thereof are shown.
1. Methods 1-1. ALS2 family Sixteen cases including 8 individuals suffering from the disease obtained from a Tunisian consanginous ALS2 family (literature 2) were analyzed. The characteristic of ALS2 is a progressive convulsion of muscles of the limbs and the face accompanied by distal myoatrophy of the hand and the foot. Age of onset is between 3 and 10 years age (literature 2). According to biopsy of nerves and muscles and also to electromyography test, there was confirmed deletion of distal motor neuron (literature 2). When a gene type of the ' polymorphic DNA markers was analyzed together with clinical test data, ALS2 was clearly an autosomal recessive inheritance.
1-2. Transcription map Genome Data Base (GDB) (http://www.gdbwww.gdb.org) and UniGene (http://www.ncbi.nJm.nih.gov) of the Biotechnology Information Center (NCBI) which were open to the public for discriminating the sequence of transcribed DNA mapped within an objective region were retrieved. Sequence of genomic DNA. overlapped with the objective region of ALS2 was retrieved from the "nr" or "htgs" data base of GenBank and utilized as the object for the test when a BLAST retrieval to the dbEST data base is conducted. In order to isolate the transcript of a full length, there were carned out RT-PCR, 5'-RACE and cDNA
SUBSTITUTE SHEET (RULE 26) library screening. In addition, EST clone was purchased from Research Genetics and sequencing for DNA was carned out for measuring the insertion of the whole clone.
Sequence of double stranded DNA was determined by conducting a dideoxy sequencing using a BigDye Terminator Cycle Sequencing Kit (ABI) and an AB 1377DNA
sequences.
5 All sequences of EST data, PCR products and DNA obtained from cDNA clone were determined and an estimated independent transcription unit was established.
Then each unit was mapped on a physical map by a PCR method.
1-3. Identification of exon In order to determine the constitution of intron and exon of the transcription DNA, 10 genomic DNA sequence data open to the public in GenBank data base was compared with the sequence of cDNA using a Sequences Version 3.0 (Gene Codes Corporation) program according to the descriptions of BLAST (literature 28) and literatures (5 and 6).
1-4. PCR
Exon and intron/exon boundaries were subjected to a PCR amplification. ExTaq 15 polyrnerase (Takara) was used and a cycle of 95°C for 15 seconds, 60°C for 30 seconds and 72°C for 30 seconds was repeated for 35 times whereby about SO mg of genomic DNA were amplified by a PCR. In order to detect the deficient form of the transcription DNA, an RT-PCR was carried out. Total RNA from lymphocytes of four patients of a family of ALS2 and two carriers was isolated. Total RNA extracted from a healthy human brain was 20 purchased from Clontech. An RT-PCR was carried out using a Superscript pre-amplification system (Gibco-BRL) according to the protocol of the manufacturer. The oligonucleotide primer for such a PCR was designed using Primer 3.0 (http://www-genome.wi.mit.edu). Table 1 lists the primers used for amplification of ALS2 (ALS2CR6).
SUBSTITUTE SHEET (RULE 26) %
%
%
N
?C W
W W
a y ai 0 4 a a a a .-r M V1 l~ O~ .~ M V1 I~ 01 .-, M v'1 t~ O~ .-r M U1 l~ O~ .r M h I~ O~ .~ M
V1 l~ Ov .~ M V~ l~ O~ .-~ M
az ~~n~.~.-~NNNNNMMMMM~~~~W llNllWll~l1\plp~D~pvp[w[v[vI~[vpppp N ~
Ivl ,.1 . . . . . ~j . 1.1 . . . . . . . . 1 p1 1.1 ~~, a ~, ~, ~, ~, ~, 1 ~, ~, r, ~ ~, a ~, ~, ~, .., ~~, .~, ~ ~, ~, ~ 1 r~, r, r, m~, r, r, ~, a 1 1 1 R 1 1 1 1 1 U 1 1 1 1 1 H 1 ' 1 1 1 1 1 1 1 1 1 O O O~ 1 1 1 1 1 1 1 1 O U
U t U U O U O U O O O O O O U ~ O O U O O O O U t t H U U U U .U U U O U
O U t H O ~ O H O U H O H F F O p U O t p p U O U O p O t O p F t tHUU ~tHOUUO U U4FOtUt OUO,~~tOtWIHU~U
U F U U H ~ U ~ U t U H U ~ ~ O t OO t O O t U U 44 U
K U ~ ~ U ~ H t ~ U ~ O U t ~ ~ O U t ~ E ~ t ~ t t t O U ~ H t H
,- t U H U O U p t H U
U ~ U O U ~ U H H R O ~ O ~ O ~ U ~ t U U ~ ~ t U U O
D U R t ~ t O U ~ ~ ~ K H U ~ U O U O U ~ O t Et., .~ ~ ('~-~ OU
t t O U U U U 4 t t U t t U U H t H
Gi. t t ~ O O ~ ~ H ~ O ~ ~ ~ U ~ U ~ 4 ~ (~. ~ ~ t O U U O H
ai ~ U U t H .t 0 t H t H H t O U U U E O H U H O
U ~ U U ~ U t O ~ O O U U U O ~ O H t U O ~ Wt U ~ U ~ ~ ~ ~ ~ U
> E R N ~ U t U t ~ t O O O O O U O ~ t t U ~ ~ U O ~ H ~ O O
U R O U O t 4 ~ R O v O H O F O t t OU.U~~OEUEF~U~V~~UE~~E,D.,tD4~~OEt~~,'~U~~~~~UE
'W t ~ H O U ~G ~C O t O O U H U H ~ R t U ~c L~ ~ H U O H U ~ U ii E ~ O U O
O H
yf1 1f1 Vf 1I1 Iff ~!1 1I1 1f1 1!1 1f1 ~fl t!1 U1 1l1 If1 1f1 If1 111 1f1 1f1 off 1f1 Nnff 1f1 1f1 1f1 1f1 V1 1f1 1f1 1f1 1f1 U1 1f1 1f1 ~!1 ~R
z ON~~~o00NNNN~MMMMM~~~~ttO~v~lh~~~ ~~~~~h~o~00N0 W
.
rf 1 r1 wm1 O n 1'n ~ ~n n w1 1~1 O e~1 ,., ,., ,.~ 1., ,.1 w1 i1 In ~ e~ ~n r1 r1 n e~ - . wwn. 1.1 w1 w1 1 1 1 O 1 1 1 1 1 1 t t 1 1 1 1 1 1 1 U 1 1 O 1 1 t U 1 1 1 1 AI 1 1 1 1 1 O U O R U O U U~ U U O U U U U U U U N H U U 1 U U U O
U O U ~ D R eU~ ~ 4 E ~ ~ O U O E ~ ~ U ~ O ~ R U ~ ~ U R O V U O
U O U H't U 4 H O H.t ~ U U H t U O U O ~ O t ,." U ~ ~ N O ~ ~ ~ ~ F H ~ ~ R U ~ R ~ ~ ~ U ~ ~ U ~ ~ ~ 4 a) U t t U U U t O U U U t U t t t t t O ' it O ~ ~ ~ ~ U U O E E F U U E E O O' U ~ F ~ U ~ U 4 ~ ~ U U ~ O U F O
U U t H t U U t t O t t O t ~ t U U t U t ~ 4 O O
U ~ U t t t ~ p O U ~ ~ ~ O H O H U O U U U U U t O H O ~ ~ U ~ ,~ H U 4 F O U t H 4 O O K U U p 4 ~ ~ t U
U K R ~ ~ ~ U . O t ~ O O U t ~ t H .~ ~ ~ ~ p O ~ ~ ~ H t O U 4 H
t U E U F U U t U t t O H H O O K U O
4 ~ D ~ U 4 O 4 ~ U O O ~ N E U ~ O O ~ 4 R V ~ F F ~ ~ O p O O
O O t O F O U U H U U t H O O O t H ~ ~ t0 ~ ~ O
O ~ ~ ~ U '~"' p . U ~ ~ ~ U O U ~ ~ ~ ~ ~ < ~ O ~ U U ~ H ~ ~ ~ ~ N U ~ O O
O H U U U
1f1 1f1 If1 1f1 ~I1 1f1 1f1 Ifl vf1 1!1 ~f1 ~f1 ~f1 t11 V1 N ~f1 V1 1!1 1f1 1f1 1f1 1f1 N If1 II1 1!1 1I1 1f1 1f1 1f1 1f1 1f1 1f1 1f1 1f1 1f1 1f1 ~ 01 r1 01 1T U1 m O N PI P1 01 1~ m b O N O of O CD CD Of ~ O U1 O O 1f1 N O
V If1 O Il1 N N1 b 1~1 .D (3. ,.1 m Pf N 1~1 01 01 N d 1~ m b d N 00 b 01 If1 V 0f m 1~ r1 1f1 O 10 N
1f1 b I~ 0f m O t~ 'i 01 m O
O ~, Y1 PI f'1 V d b V P1 V 1~1 Pf V A1 V V Q N Q P1 N PI Pf ~ Q N N P1 d n N
1~1 N V 1~1 1~1 f~l V V
pr 'i N
1 1 w N d 1f1 ' 10 i~ m r1 N P1 d 1f1 r-1 N 1~1 V V O O O O O O r1 N P1 d 1f1 b r m pf O r1 N PI O O O O O 1f1 10 r 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 ., ', ., .-1 o d a o m o 0 0 0 0 exr o m m o axr o o m d m m a o m ~ o 0 0'o n o d d d o 0 0 0 0 0 0 0 o x x O O 'O O O r1 ~ O r1 ,-1 r1 r1 ,-1 ,-1 ,-1 'i ,-1 r1 ,H r1 'i r1 r1 r1 ,~ ~ N
N N N N~ N N N N N 0 V
f~ f~ ~ t~ h N r1 N r1 .i v-1 n.1 ,-1 e1 r1 ,~i ri r1 ,-1 r1 ,-1 ,1 r1 ,~i ,1 ,1 N N N N N N N N N N V V
I1 N Pf !1 N r1 e~1 '1 rl-1 r1 r1 '1 'r1 r1 r1 'i r1 n-1 r1 n-1 n-1 e1 e~l rl-1 v-1 n-1 'i r1 e-1 r1 n-1 r1 r1 e-1 N N
VJ Vl N fA N tp VJ 41 N V1 N N N tp VJ N UJ 4J V1 UJ tp V! N V! f!7 f7J 41 H
V1 N N N if! N N VJ H
.a .7 .7 J .7 .7 .7 J .7 .a J J .7 .7 .a .7 .7 ~7 '7 .7 .7 .7 J a .7 .7 .7 .7 .7 .7 .7 .7 .7 J .7 ~i ~7 t t t 4 4 4 t t 4 4 4 t Q t t t t t t nt 4 t t 4 4 K t 4 t t t t t t t 4 t 4 U U U 'U U U U U U U U U U U U U U U U U U U U U U U U U U U U U U U U U U U
o .r N In v In to r m o1 0 .-1 N r1 vm b r ao 01 0 .-1 N n a v1 b e~ m of o .1 N <n v k aHrlrlrlrlrlr-1e-irIrINNNNNNNNNN 1~1P1NI~P1 1~1 W
SUBSTITUTE SHEET (RULE 26) 1-5. Analysis of mutation In order to detect the mutation of the DNA sequence at the exon or the intron/exon boundary, a DNA sequence of a PCR product of exon was determined. The DNA
sequence of a PCR product of exon was analyzed using the same oligonucleotide as a primer. The S sequence in the data base open to the public was compared with the DNA
sequence obtained from patients, carriers and healthy persons and changes in the nucleotide were discriminated.
[0044]
It was also confirmed that a new NarI site was formed (A261 del) after the treatment with NarI by means of an RT-PCR amplification of exons 2-4 or a PCR
amplification of exon 3. As to the primers for exon 3-PCR, there were used 5' CCTAGTCATCCATGTGCTGG-3' (SEQ ID NO: 6) and 5' TCCCATACCTGACCTTCCAC-3' (SEQ ID NO: 7). As to the primers for the RT-PCR of exons 2-4, there were used 5'-CTTGATAGACTTTCTGTAAAGAAG-3' (SEQ ID NO: 8) and 5'-GGCTACTTGGACAAATCTCCACTG-3' (SEQ ID NO: 9). Decomposed product with NarI was separated by 1.5% agarose gel.
1-6. Northern blot analysis Northern (MTN) blot (Clontech) of many human adult tissues was hybridized with exon 4 labelled with 32P-dCTP of ALS2CR6 or human (3-actin cDNA in a Perfect Hyb hybridizing solution (Toyobo). The membrane was washed with 0.1 x SSC
containing 1%
of SDS and subjected to an X-ray film (Bio-MAX, Kodak).
1-7. mRNA in situ hybridization Antisense and sense cRNA probes were prepared from two mouse cDNA clones m2-as and m2-s. Those mouse cDNA clones covered a part of mouse mALS2CR6 cDNA
(from the 1732nd to the 2685th bases of SEQ ID NO: 4; 954 bp) and inserted into pCR2.1 (Invitrogen) in an opposite direction. The probes were prepared according to the protocol of the manufacturer (Roche Molecular Biochemicals) by an in vitro transcription reaction where digoxigenin-labelled UTP and T7 polymerase were mixed. Preparation of the sample and method for the in situ hybridization were in accordance with the literature (29).
1-8. Retrieval of the data base Each of DNA and amino acid sequences was compared with the data base of sequences of nucleotide and protein which were not overlapped each other using BLASTN
SUBSTITUTE SHEET (RULE 26) and BLASTP. Domain and motive of protein were identified by MOTIF servers of Genome Net Japan (http://www.~enome.ad.jp), search launcher of BCM
(http://www.hgse.hem.tmc.edu/Search.launcher) and CD search of NCBI
(http://www.hcbi.nlm.nih.~ov).
S 2. Results The inventors have prepared a physical map on the basis of YAC/BAC/PAC of genomic region of 3 Mb covering a complete candidate region to ALS2 (literatures 5 and 6).
Sequences of EST and cDNA clone were analyzed within a broad area and, at the same time, this physical map was used for the mapping of 43 independent transcription units including previously analyzed 18 genes (KIAA0005, CLKl, PP1L3, ORC2L, NDUFB3, CFLAR, CASP10, CASPB, FZD7, NOPS, UBL1, BMPR2, FLJ10881, LOC57404, AIP-1, CD28, CTLA4 and AILIM) and new 10 full-length transcription products (ALS2CR1, ALS2CR2, ALS2CR3, ALS2CR4, ALS2CR5/MPP4, ALS2CR6, ALS2CR7, ALS2CR8, ALS2CR9 and ALS2CR12). Those genetic sequences were present in the locus of ALS2 (Fig.
1 ).
[0045]
Juvenile ALS2 is rare and has a sign that, in teens and twenties, muscular convulsion of limbs, face and throat gradually expresses. Since ALS2 is recessively hereditary, it is predicted that this ALS2 disease may take place by a loss of a functional mutation. . Big deletion or translocation in the ALS2 locus was investigated by a mapping of STS/EST
content on the basis of a PCR and a southern blot analysis but that was not detected. After that, small deletion or base substitution in exon or intron-exon boundary was investigated.
In order to detect those mutations, each gene was analyzed and an intron/exon boundary thereof was determined. Until now, 395 exons have been identified from 42 genes. In order to amplify exon and flanking sequence thereof including consensus sequence to splicing donor and acceptor, 411 primers in total were designed and those primers were used to amplify the genomic DNA of 10 normal control persons who were not related to 14 persons of the ALS2 family (Fig. 2a) by PCR. Sequence of each of those PCR products was determined whereby 77 sequence polymorphs in total of intron or exon were identified.
Among those 77 polymorphs, 8 mutations contained in 4 different genes were related to ALS2 (Table 2).
SUBSTITUTE SHEET (RULE 26) Table 2 Gene Region Normal ALS2 NOPS intron 2 tatctc(T~9aattct NOPS intron 6 gttttg(TTG 2ttttta ~ (TTG)3 ALS2CR6 intron 2 ggtaaAtcattt ALS2CR6 exon 3 gcaggcAgccctc -~ A261 deletion*
ALS2CR8 intron 6 gtcagtAttataa ALS2CR9 exon 4 ctccagCatggac ~ T (3rd codon) ALS2CR9 intron 7 ttgggaTtttttt ALS2CR9 intron 8 aaaataCggatat .-~ T
Among those sequence mutations, one nucleotide deletion (A261 del) noted in exon 3 of ALS2CR6 broke the reading frame and it is suggested that such a mutation mutates the protein. All of the suspicious hetero-conjugative carriers show a duplicated sequence pattern starting from the first nucleotide after the deficient part (Fig. 2b). This deletion clearly moves together with an ALS2 expression type (Fig. 2c) and is not noted in 533 normal control individuals of various races (data not shown). In other mutations, one base substitution from C to T in exon 4 of ALS2CR9 gene is included (C873T).
However, this mutation corresponds to the third codon and, therefore, it does not change the amino acid residue. In order to detect a splicing error which is made latent or manifest by other sequence mutation, an RT-PCR was carned out using total RNA extracted from lymphocytes of patients and healthy control persons but no sequence mutation of mRNA
was detected (data not shown). Accordingly, the mutation related to ALS2 of intron or exon region does not cause a splicing error. From those results, it has been confirmed that deletion of one base in exon 3 of ALS2CR6 (A261 del; Table 1 ) is mutation concerning ALS2.
ALS2CR6 gene contains 33 introns and 34 exons and is present in a genomic DNA
of 80.3 kb adjacent to a polymorphic DNA marker D2S2309 (Fig. 1).
Transcription polarity of the ALS2CR6 gene is in the direction of central body from telomere. An transcription product (mRNA) has a full length of 6394 by (SEQ ID NO: 1) having a single open reading frame (ORF) with a length of 4974 nucleotides (124-5,097 nt) and codes for a SUBSTITUTE SHEET (RULE 26) protein of 184 KDa comprising 1,657 amino acid residues. Polyadenylated estimated signal (AATAAA: 6,375-6,380 nt) and poly(A) region are clear. A short ALS2CR6 transcription product in a full length of 2,651 by having 1,191 by ORF coding for a 396 amino acid sequence was identified as well. This short variant splices a 5'-donor site after exon 4 and, S as a result, a stop codon is formed after 25 amino acid residues of intron 4. Being correspondent to those results, 2 transcription products of about 6.5 kb and 2.6 kb were identified in many adult human tissues by a northern blot analysis (Fig. 3a).
Except the liver where short transcription products are mostly expressed, both transcription products showed the similar expression pattern. It has been confirmed that a big transcription product of 6.5 10 kb is expressed in a slightly higher level than a transcription product of 2.6 kb and is most abundantly expressed in the cerebellum. This gene has been also confirmed to be expressed in cells of ALS2 patients (Fig. 3b).
Further, a mouse homolog of ALS2CR6 was isolated and named mALS2CR6. A
transcription product of mALS2CR6 is in a full length of 6,349 by (SEQ ID NO:
4) having 15 one ORF of 4,956 by (124-5,076 nt) and codes for a protein of 183 kDa comprising 1651 amino acids (SEQ ID NO: 5). The ORF as a whole is well reserved in a DNA level (87%
same) and a protein level (91 % same; 94% similar; Fig. 4) between human being and mouse and it is suggested that ALS2CR6 gene is a gene which is well reserved in mammals.
In order to check the localization property of expression of mALS2CR6 transcription 20 product in the brain and the spinal cord of mouse, an in situ hybridization using riboprobe corresponding to a part of mALS2CR6 cDNA was carned out. The result was that, as shown in Fig. 5, the mALS2CR6 transcription products were expressed in various levels in nerve cells from the brain to the spinal cord, especially in neurons of hippocampal and dentate gyrus, cerebellar Purkinje cells, neurons of cerebral cortex and spinal cinerea 25 including anterior horn cells. In addition, a significant expression was noted in neurons of olfactory bulb, basal nucleus and cranial nerve nucleus as well.
Human ALS2CR6 protein showed many interesting properties (Fig. 6a). The first property is present in a region of A-terminal side of ALS2CR6 and it showed a high homology to RCC 1 (regulatory factor for concentrating the chromosome;
literature 7) and RPGR (GTPase for pigmentary retinitis; literature 8)(Fig. 8). RCC1 and RPGR
protein acts as a guanine nucleotide exchange factor (GEF) for GTPase like Ran. The second property is SUBSTITUTE SHEET (RULE 26) that ALS2CR6 has a Dbl-homologous (DH) domain and a pleckstrin-homologous (PH) domain and both domains are typical domains noted in RhoGEF protein (literatures 9 and 10). In addition, VPS9 domain is noted in a C-terminal region as well. VPS9 domain is noted in many GEF including Vps9 (literature 11 ) and Rabex-S (literature 12) and each is said to mediate the selection of vacuole protein and the phagocytic transportation. Two MORN motives comprising 14 amino acids (literature 13) were noted as well.
According to the recent study for junctophilin containing an MORN motive, this motive is shown to contribute in bonding of plasma membrane (literature 13). It has been known that GEF is related to a GDP bonding form of GTPase and promotes the dissociation of GDP
and bonding of GTP whereby GTPase is activated. Since it has been known (literatures 18 and 19) that GEF plays an important role in many signal transmission cascades (literature 14), neuron formation (literature 15), membrane transportation (literature 16) and formation of actin cell skeleton (literature 17), it is likely that ALS2CR6 acts as a regulatory factor/activator of Ran-related GTPase, regulates the formation of membrane, and acts in a (membrane) transportation of cells including neurons.
According to an RT-PCR analysis, a transcription product of mutated ALS2CR6 gene is transcribed from chromosomes of the patient (Fig. 2c) and produces a modified protein comprising 49 amino acids having three new residues (Pro-Ser-Glu) at C-terminal (Fig. 6a). Since this modified protein has no functional domain corresponding to ALS2CR6 protein, it seems to make the inherent function lost. Accordingly, the A261de1 mutation noted in this ALS2CR6 is related to the fact that ALS2 is recessively hereditary.
A recent finding that ALS is related to defect for the transportation of axon and the formation of cell skeleton (literatures 20, 21 and 22) induces a hypothesis that ALS2CR6 gene corresponds to ALS2 and that ALS2 is generated by the defect of membrane structure due to lacking in a regulatory function of membrane structure Ran-related GTPase.
ALS2CR6 gene is the second ALS gene succeeding to the determination of role of copper-zinc superoxide desmutase (SDS-1) in~ ALS. Mutation of SOD-1 is related to the form of tardive autosomal dominance (literature 23).
Although the foregoing invention has been described in some detail by way of illustration and example for purposes of clarity of understanding, it will be readily apparent to those of skill in the art in light of the teachings of this invention that changes and SUBSTITUTE SHEET (RULE 26) modification may be made thereto without departing from the spirit or scope of the appended claims. All patents, patent applications and publications referred to herein are hereby incorporated by reference.
S
References 1. Siddique, T., Nijhawan, D.& Hentati, A. Molecular genetic basis of familiar ALS.
Neurology 47, S27-S35 (1996).
2. Ben Hamida, M., Hentati, F. & Ben Hamida, C. Hereditary motor system diseases (chronic juvenile amyotrophic lateral sclerosis). Brain 113, 347-363 (1990).
3. Hentati, A. et al. Linkage of recessive familial amyotrophic lateral sclerosis to chromosome 2q33-q35. Nature Genet. 7, 425-428 (1994).
4. Hosler, B.A. et al. Refined mapping and characterization of the recessive familial amyotrophic lateral sclerosis locus (ALS2) on chromosome 2q33. Neuro genetics 2, 34-42 (1998).
5. Hadano, S. et al. A yeast artificial chromosome-based physical map of the juvenile amyotrophic lateral sclerosis (ALS2) critical region on human chromosome 2q33-q34.
Genomics 55, 106-112 (1999).
6. Hadano, S. et al. Cloning and characterization of three novel genes, ALS2CR1, ALS2CR2, and ALS2CR3, in the juvenile amyotrophic lateral sclerosis (ALS2) critical region at chromosome 2q33-q34: candidate genes for ALS2. Genomics 71, 200-213 (2001 ).
SUBSTITUTE SHEET (RULE 26) 7. Lerman-Sagie, T., Filiano, J., Smith, D.W. & Korson, M. Infantile onset of hereditary ascending spastic paralysis with bulbar involvement. J. Child Neurol. 11, 54-(1996).
8. Ohtsubo, M. et al. Isolation and characterization of the active cDNA of the human cell cycle gene (RCC1) involved in the regulation of onset of chromosome condensation.
Genes Dev. 1,585-593 (1987).
9. Meindl, A. et al. A gene (RPGR) with homology to the RCCI guanine nucleotide exchange factor is mutated in X-linked retinitis pigmentosa (RP3). Nature Genet. 13, 35-42 (1996).
10. Renault, L. et al. The 1.7 ~ crystal structure of the regulator of chromosome condensation (RCC1) reveals a seven-bladed propeller. Nature 392, 97-101 (1998).
11. Carazo-Salas, R.E. et al. Generation of GTP-bound Ran by RCC1 is required for chromatin mitotic spindle formation. Nature 400, 178-181 (1999).
12. Soisson, S.M., Nimnual, A.S., Uy, M., Bar-Sagi, D. & Kuriyan, J. Crystal structure of the Dbl and pleckstrin homology domains from the human Son of sevenless protein.
Cell 95, 259-268 (1998).
13. Hama, H., Tall, G. G. & Horazdovsky, B.F. Vps9p is a guanine nucleotide exchange factor involved in vesicle-mediated vacuolar protein transport. J. Biol. Chem.
274, 15284-15291 (1999).
14. Horiuchi, H. et al. A novel RabS GDP/GTP exchange factor complexed to Rabaptin-5 links nucleotide exchange to effector recruitment and function. Cell 90, 1149-(1997).
SUBSTITUTE SHEET (RULE 26) 15. Takeshima, H., Kamazaki, S., Nishi, M., Iino, M. & Kangawa, K.
Junctophilins: a novel family of functional membrane complex proteins. Mol. Cell 6, 11-22 (2000).
16. Barrett, K., Leptin, M. & Settleman, J. The Rho GTPase and a putative RhoGEF
mediate a signaling pathway for the cell shape changes in Drosophila gastrulation.
Cell 91, 905-915 (1997).
17. Martijn, F.B.G. et al. Identification of a novel, putative Rho-specific GDP/GTP
exchange factor and a RhoA-binding protein: control of neuronal morphology.
.I.
Cell. Biol. 137, 1603-1613 (1997).
18. Roepman, R. et al. The retinitis pigmentosa GTPase regulator (RPGR) interacts with novel transport-like proteins in the outer segments of rod photoreceptors.
Hum. Mol.
Genet. 9, 2095-2105 (2000).
19. Luo, L. Rho GTPases in neuronal morphogenesis. Nature Rev. Neurosci. 1, (2000).
20. Hall, A. Rho GTPases and the actin cytoskeleton. Science 279, 509-514 (1998).
21. Culbertson, M.R. RNA surveillance. Unforeseen consequences for gene expression, inherited genetic disorders and cancer. Trends Genet. 15, 74-80 (1999).
22. Collard, J.-F., Cote, F. & Julien, J.-P. Defective axonal transport in a transgenic mouse model of amyotrophic lateral sclerosis. Nature 375, 61-64 (1995).
23. Williamson, T.L. & Cleveland, D.W. Slowing of axonal transport is a very early event in the toxicity of ALS-linked SOD1 mutants to motor neurons. Nature Neurosci.
2, SO-56 (1999).
SUBSTITUTE SHEET (RULE 26) 24. Rosen, D.R. et al. Mutations in Cu/Zn superoxide dismutase gene are associated with familial amyotrophic lateral sclerosis. Nature 362, 59-62 (1993).
25. Chance, P.F. et al. Linkage of the gene for an autosomal dominant form of juvenile 5 amyotrophic lateral sclerosis to chromosome 9q34. Am. J. Hum. Genet. 62, 633-(1998).
26. Blair, LP. et al. A gene for autosomal dominant juvenile amyotrophic lateral sclerosis (ALS4) localizes to a 500-kb interval on chromosome 9q34. Neurogenetics 3, 1-6 10 (2000).
27. Hentati, A. et al. Linkage of a commoner form of recessive amyotrophic lateral sclerosis to chromosome 15q15-q22 markers. Neurogenetics 2, 55-60 (1998).
15 28. Hosler, B.A. et al. Linkage of familial amyotrophic lateral sclerosis with frontotemporal dementia to chromosome 9q21-q22. JAMA 284, 1664-1669 (2000).
29. Altschul, S.F. et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389-3402 (1997).
Field of the Invention This invention relates to genetic causes of amyotrophic lateral sclerosis of type 2 ("ALS2").
Backeround of the Invention Amyotrophic lateral sclerosis ("ALS") is a progressive neurodegenerative disease in which distal and proximal motor neurons are selectively degenerated'. Its cause is ambiguous and its onset is mostly at middle age and thereafter. Its rate of onset is about 2-6 per 100,000 persons and begins with lowering of muscular strength and myoatrophy of wrist muscle as a secondary neuron hindrance resulting in bulbar paralysis symptoms such as atrophy of muscle of limbs, atrophy of tongue, alalia, dysphagia and dyspnea.
No therapeutic method has been established yet and most of the afflicted die within five years from onset.
Juvenile amyotrophic lateral sclerosis of type 2 ("ALS2"; OMIM2151002) is a somatically recessive type hereditary disease. Although the frequency of its onset is rare, muscular convulsion of limbs, face and throat is gradually expressed in persons of teens or twenties and becomes chronic by bulbar paralysis as described above.
Amyotrophic lateral sclerosis of type 2 has been mapped to the 1.7 cM interval flanked by D2SI16 and D2S2237 on human chromosome 2q333'4. Alterations in 391 exons and their flanking regions derived from 43 non-overlapping transcripts have been noted within this intervals'6.
ALS is a very severe disease and there is a need for development of means for its early detection or diagnosis and for treatment.
SUBSTITUTE SHEET (RULE 26) Summary of the Invention We have now identified a gene associated with amyotrophic lateral sclerosis type 2, termed the ALS2 or ALS2CR6 gene. This gene is expressed in various human tissues including neurons in the brain and spinal cord, and encodes a protein with homology to RanGED and RhoGEF.
This invention now provides mammalian ALS2 genes and mutant versions thereof as well as peptides (including proteins) encoded by such genes. Also included are fragments and nucleic acids derived from these genes, corresponding peptides, and oligonucleotides suitable for use as amplification primers and/or probes. Antibodies to the peptides of this invention are also provided.
This invention also provides methods of diagnosis of ALS2 which may include identifying in a patient at risk, an altered ALS2 gene or protein. The patient may be tested to characterize one or more mutations in the gene or protein produced. Such a mutation may comprise the A261de1 mutation or the AGI548de1 mutations described herein.
This invention also provides nucleic acids which correspond to a region of the gene, which nucleic acids typically hybridize to at least about 6, at least about 10, at least about 15, at least about 20, or at least about 25 consecutive nucleotides of an ALS2 sequence as described herein, or to complements of such sequences, or to naturally occurring mutants or allelic variants thereof. The probes or primers may be chosen to be capable of distinguishing (such as by amplification or hybridization) allelic variants, including the A261 del and AGI548de1 mutations described herein. Such probes or primers may fiuther include a label which is capable of being detected. This invention also provides kits for identifying ALS2 genes, including those comprising alleles associated with an ALS2 disease state, wherein the kits may comprise a probe or primer as described herein.
The kit may further comprise instructions for using the probes or primers to distinguish alleles as described herein.
This invention also provides vectors containing nucleic acids of this invention, including vectors adapted for expression of such nucleic acids in a target cell or organism.
Such vectors may comprise appropriate transcription regulatory elements for directing transcription of the nucleic acids in a target cell or organism. Nucleic acids and peptides of this invention may be expressed in bacterial as well as eukaryotic cells, including SUBSTITUTE SHEET (RULE 26) mammalian cells. Such vectors may be adapted to express nucleic acids of this invention in a reverse direction so as to generate anti-sense transcription products.
This invention also provides non-human mammals comprising a genome in which an ALS2 gene has been mutated, including by deletion. Such a mammal may be a mouse and S methods for altering the murine genome such as to produce an ALS2 "knock-out" mouse, are described herein and are known in the art.
This invention also provides the use of nucleic acids and peptides as disclosed herein for the preparation of medicaments for treatment of ALS2 or in the treatment of ALS2.
This invention also provides methods of treating patients for ALS2, which methods may comprise testing the patient to diagnose or characterize an ALS2 disease state. A
patient may be treated for ALS2, for example by administering to the patient or by otherwise providing a native form or functional fragment or derivative of the ALS2 peptide described herein or such other therapeutic agent as which will restore function of the protein in a patient. Also included in this invention are vectors suitable for use in gene therapy and gene therapy methodologies whereby a patient is treated to restore the function of ALS2 by delivering or producing a functional gene for expression in the patient. Gene therapy vectors may, for example, be adeno-associated vector, such as those known in the art.
General methods for gene therapy are also known in the art.
This invention includes a human ALS2 gene which is present in human second chromosome q33 region and may code for a GTPase regulatory factor. The gene may encode an amino acid sequence of SEQ ID N0:2. cDNA synthesized from mRNA that may be transcribed by this gene has a base sequence of SEQ ID NO:1.
This invention includes a human ALS2 mutated gene which is related to amyotrophic lateral sclerosis of type 2 and codes for a modified protein having an amino acid sequence of SEQ ID NO: 3 or SEQ ID N0:84, by a deficiency of one or two bases of the above human ALS2 gene.
This invention includes nucleic acids purified from genomic DNA, mRNA or cDNA
as well as synthesized nucleic acids.
This invention includes oligonucleotides which hybridize to ALS2 genes and variants thereof, preferably under.stringent conditions.
SUBSTITUTE SHEET (RULE 26) This invention includes kits comprising oligonucleotides or oligonucleotide primer sets which may be used to carry out amplification of ALS2 encoded nucleic acids, for example by the polymerase chain reaction (PCR).
This invention includes oligonucleotide probes which hybridize to the regions containing base deficient sites (A261de1 and AGI548de1J in ALS2 under stringent conditions.
This invention includes oligonucleotide primer sets which carry out a PCR
amplification of the region containing a base deficient site in ALS2 as described herein. A
specific example of this primer set is a pair of synthetic oligonucleotides comprising the base sequences of SEQ ID NO: 6 and NO: 7 or a pair of synthetic oligonucleotides comprising the base sequences of SEQ ID NO: 8 and NO: 9.
This invention includes recombinant vectors comprising the above nucleic acids and cells transcribed by said recombinant vectors.
This invention includes a GTPase regulatory factor or a GEF which is characterized in being an expression product of an ALS2 gene as described herein.
Embodiments of such GTPase regulatory factors of GEF's are recombinant proteins produced by the transformed cells transformed according to this invention..
This invention includes a peptide comprising an amino acid sequence having continuous 5 or more acid amino residues in the first to the 46th amino acid sequence in SEQ ID NO: 2 and also a peptide comprising an amino acid sequence having continuous 5 or more acid amino residues in the 47th to the 1657th amino acid sequence in SEQ ID NO:
2. These peptides may be used for production of antibodies.
This invention also provides a modified protein which may be an expression product of a mutant human ALS2 gene and which comprises the amino acid sequence of SEQ
ID
NO: 3. An embodiment of this modified protein is a recombinant protein produced by a transformed cell.
This invention includes an antibody which recognizes peptides (including proteins) as disclosed herein. Embodiments of this antibody are an antibody which is prepared using a peptide according to this invention as an antigen, including a peptide comprising an amino acid sequence having continuous 5 or more acid amino residues in the first to the 46th amino acid sequence in SEQ ID NO: 2 and also an antibody which is prepared using a peptide SUBSTITUTE SHEET (RULE 26) comprising an amino acid sequence having continuous S or more acid amino residues in the 47th to the 1657th amino acid sequence in SEQ ID NO: 2 as an antigen.
This invention furthermore provides methods for the diagnosis of amyotrophic lateral sclerosis of type 2 which is characterized in detecting ALS2 mutated genes. An 5 embodiment of this method for the diagnosis it that genomic DNA of the cells of a person to be diagnosed is subjected to a PCR amplification using a primer set comprising a pair of synthetic oligonucleotides comprising the base sequences of SEQ ID NO: 6 and NO: 7 or a pair of synthetic oligonucleotides comprising the base sequences of SEQ ID NO:
8 and NO:
9, the resulting DNA fragments are treated with a restriction enzyme NarI and the said person where each of the DNA fragments is divided into two fragments is judged to be suffering from amyotrophic lateral sclerosis of type 2.
This invention also provides a method for the diagnosis of amyotrophic lateral sclerosis of type 2 which is characterized in that the transcribed product of an ALS2 gene or mutated gene is detected. In an embodiment of this diagnostic method, the transcribed product is cDNA or mRNA of the gene of an ALS2 mutated gene or the modified protein expressed by the said mutated gene. An embodiment of the case of detection of the modified protein is a method for the detection of the protein where the antibody recognizing the first to the 46th amino acid sequences in SEQ ID NO: 2 reacts but the antibody recognizing the 47th to the 1657th amino acid sequence region in SEQ ID NO: 2 does not react.
Further, this invention provides a mouse ALS2 gene which may have an amino acid sequence of SEQ ID NO:S as well as nucleic acids derived therefrom including nucleic acids synthesized or purified from genomic DNA, mRNA or cDNA of the mouse gene or a complementary sequence thereof.
This invention also provides a gene-defective non-human mammal such as a rodent, preferably a mouse, where function of an ALS2 gene is substantially deficient.
Also provided are tissues of such a mouse.
The human ALS2 gene according to this invention is a genomic gene which has 33 introns and 34 exons, exists in a genomic DNA of 80.3 kb adjacent to a polymorphic DNA
marker D2S2309 in human second chromosome q 33 region (refer to Fig. 1) and codes for a SUBSTITUTE SHEET (RULE 26) human GTPase regulatory factor having an amino acid sequence of SEQ ID N0:2.
In this ALS2 gene, its cDNA has a base sequence of SEQ ID NO: 1.
This invention provides an isolated nucleic acid that codes for a peptide having at least about 75, 80, 85, 90, 95, 97 or 100% identity to all of an amino acid sequence selected from the group consisting of SEQ ID N0:2; SEQ ID N0:3; SEQ ID N0:5; SEQ ID
N0:84; and, amino acids 372-1657 of SEQ ID N0:2. Also provided are the peptides encoded by these nucleic acids.
This invention also provides an isolated nucleic acid consisting essentially of a nucleotide sequence having at least about 75, 80, 85, 90, 95, 97 or 100%
identity to all of a nucleotide sequence or a complementary sequence thereof, selected from the group consisting of SEQ ID NO:1; SEQ ID N0:4; nucleotides 124-5094 of SEQ ID NO:1;
nucleotides 1225-5094 of SEQ ID NO:1; and, nucleotides 124-5076 of SEQ ID
N0:4. Also provided are the peptides encoded by these nucleic acids.
The nucleic acids of this invention may be joined to a second nucleic acid not naturally associated with the nucleic acid of this invention. By not naturally associated, it is meant that the second nucleic acid is not part of an ALS2 gene and is not directly joined to an ALS2 gene in the genome of a mammal.
This invention also provides an oligonucleotide of 6 to 75 nucleotides, wherein the oligonucleotide hybridizes to a nucleic acid of this invention or a complementary sequence thereof, under stringent conditions. An oligonucleotide of this invention may be joined to a label, which is any moiety suitable for detectable labelling of the nucleic acid or for binding of the nucleic acid to a non-nucleic acid moiety.
This invention also provides a peptide consisting essentially of a sequence of at least 5 contiguous amino acids from a sequence selected from the group consisting of amino acids 1-46 of SEQ ID N0:2; amino acids 47-1657 of SEQ ID N0:2; SEQ ID N0:3;
amino acids 43-49 of SEQ ID N0:3; SEQ ID N0:84; and amino acids 476 to 545 of SEQ ID
N0:84. These peptides are useful, for example in raising antibodies of this invention and for investigating the function of the ALS2 protein.
This invention also provides a non-human mammal comprising a mutated gene, wherein the gene but for the mutation would encode a protein having at least about 75, 80, 85, 90, 95, 97 or 100% sequence identity to all of SEQ ID N0:2 or SEQ ID N0:5.
SUBSTITUTE SHEET (RULE 26) This invention also provides a method for the diagnosis of amyotrophic lateral sclerosis type 2 in a patient, comprising detecting the presence of a mutation in a gene that encodes a protein having at least about 75, 80, 85, 90, 95, 97 or 100%
sequence identity to SEQ ID N0:2 in a patient or a biological sample from a patient.
This invention also provides a method for the diagnosis of amyotrophic lateral sclerosis type 2, comprising detecting the presence or absence of a protein having at least about 75, 80, 85, 90, 95, 97 or 100% sequence identity to all of SEQ ID N0:2 in a patient or a biological sample from a patient.
This invention also provides a method for the diagnosis of amyotrophic lateral sclerosis type 2, comprising detecting the presence or absence of a protein having at least about 75, 80, 85, 90, 95, 97 or 100% sequence identity to all of SEQ ID N0:3 or SEQ ID
N0:84 in a patient or a biological sample from a patient.
In the diagnostic methods of this invention, sequences may be compared to determine the presence of mutations; oligonucleotides may be used to detect hybridization to nucleic acids of the patient; amplification of nucleic acids of the patient may be performed;
proteins of the patient may be contacted with antibodies of this invention; or proteins produced in the patient may be evaluated for the function of ALS2 protein.
This invention also provides a method of treatment of amyotrophic lateral sclerosis type 2, comprising administering a peptide, a nucleic acid, or a pharmaceutical composition comprising the peptide or nucleic acid to a patient in need thereof, wherein the peptide comprises an amino acid sequence having at least about 75, 80, 85, 90, 95, 97 or 100%
identity to SEQ ID N0:2 or a fragment thereof, and wherein the nucleic acid codes for said peptide.
This invention also provides a method of treatment of amyotrophic lateral sclerosis type 2, comprising administering a composition to a patient in need thereof, wherein the composition mimics the biological activity of the peptide of SEQ ID NO. 2 or a fragment thereof.
This invention also provides the use of a peptide or a nucleic acid for preparation of a medicament for treatment of amyotrophic lateral sclerosis type 2, wherein the peptide comprises an amino acid sequence having at least about 75, 80, 85, 90, 95, 97 or 100%
identity to SEQ ID N0:2 or a fragment thereof, and the nucleic acid codes for said peptide.
SUBSTITUTE SHEET (RULE 26) In this specification the term "isolated" with reference to a nucleic acid or peptide means that a nucleic acid is separate from the genome of a cell, a peptide is separate from a cell but does not mean that the subject matter has been obtained from a genome or a cell. In some instances, nucleic acids and peptides of this invention may be synthesized using conventional techniques.
Two nucleic acid or protein sequences are considered substantially identical if, when optimally aligned, they share at least about 70% sequence identity. In alternative embodiments, sequence identity may for example be at least 75%, at least 90%
or at least 95%. Optimal alignment of sequences for comparisons of identity may be conducted using a variety of algorithms, such as the local homology algorithm of Smith and Waterman,1981, Adv. Appl. Math 2: 482, the homology alignment algorithm of Needleman and Wunsch, 1970, J. Mol. Biol. 48:443, the search for similarity method of Pearson and Lipman, 1988, Proc. Natl. Acad. Sci. USA 85: 2444, and the computerised implementations of these algorithms (such as GAP, BESTFIT, FASTA and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, Madison, WI, U.S.A.). Sequence alignment may also be carned out using the BLAST algorithm, described in Altschul et al., 1990, J.
Mol. Biol. 215:403-10 (using the published default settings).
Nucleic acid sequences of the invention may in some embodiments be substantially identical, such as substantially identical gene targeting substrates and target sequences. The substantial identity of such sequences may be reflected in percentage of identity when optimally aligned that may for example be greater than 50%, 80% to 100%, at least 80%, at least 90% or at least 95%, which in the case of gene targeting substrates may refer to the identity of a portion of the gene targeting substrate with a portion of the target sequence, wherein the degree of identity may facilitate homologous pairing and recombination and/or repair. An alternative indication that two nucleic acid sequences are substantially identical is that the two sequences hybridize to each other under moderately stringent, or preferably stringent, conditions. Hybridization to filter-bound sequences under moderately stringent conditions may, for example, be performed in 0.5 M NaHP04, 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 65°C, and washing in 0.2 x SSC/0.1% SDS at 42°C (see Ausubel, et al. (eds), 1989, Current Protocols in Molecular Biology, Vol. 1, Green Publishing Associates, Inc., and John Wiley & Sons, Inc., New York, at p. 2.10.3).
Alternatively, SUBSTITUTE SHEET (RULE 26) hybridization to filter-bound sequences under stringent conditions may, for example, be performed in 0.5 M NaHP04, 7% SDS, 1 mM EDTA at 65°C, and washing in 0.1 x SSC/0.1% SDS at 68°C (see Ausubel, et al. (eds), 1989, supra).
Hybridization conditions may be modified in accordance with known methods depending on the sequence of interest (see Tijssen, 1993, Laboratory Technigues in Biochemistry and Molecular Biology --Hybridization with Nucleic Acid Probes, Part I, Chapter 2 "Overview of principles of hybridization and the strategy of nucleic acid probe assays", Elsevier, New York).
Generally, stringent conditions are selected to be about 5°C lower than the thermal melting point for the specific sequence at a defined ionic strength and pH.
It is well known in the art that some modifications and changes can be made in the structure of a polypeptide without substantially altering the biological function of that peptide, to obtain a biologically equivalent polypeptide. In one aspect of the invention, LPL
S447X therapeutics may include peptides that differ from a portion of the wild-type LPL
sequence by conservative amino acid substitutions. As used herein, the term "conserved amino acid substitutions" refers to the substitution of one amino acid for another at a given location in the peptide, where the substitution can be made without loss of function. In making such changes, substitutions of like amino acid residues can be made, for example, on the basis of relative similarity of side-chain substituents, for example, their size, charge, hydrophobicity, hydrophilicity, and the like, and such substitutions may be assayed for their effect on the function of the peptide by routine testing.
In some embodiments, conserved amino acid substitutions may be made where an amino acid residue is substituted for another having a similar hydrophilicity value (e.g., within a value of plus or minus 2.0), where the following hydrophilicity values are assigned to amino acid residues (as detailed in United States Patent No. 4,554,101, incorporated herein by reference): Arg (+3.0); Lys (+3.0); Asp (+3.0); Glu (+3.0); Ser (+0.3); Asn (+0.2);
Gln (+p.2); Gly (0); Pro (-0.5); Thr (-0.4); Ala (-0.5); His (-0.5); Cys (-1.0); Met (-1.3); Val (-1.5); Leu (-1.8); Ile (-1.8); Tyr (-2.3); Phe (-2.5); and Trp (-3.4).
In alternative embodiments, conserved amino acid substitutions may be made where an amino acid residue is substituted for another having a similar hydropathic index (e.g., within a value of plus or minus 2.0). In such embodiments, each amino acid residue may be SUBSTITUTE SHEET (RULE 26) assigned a hydropathic index on the basis of its hydrophobicity and charge characteristics, as follows: Ile (+4.5); Val (+4.2); Leu (+3.8); Phe (+2.8); Cys (+2.5); Met (+1.9); Ala (+1.8);
Gly (-0.4); Thr (-0.7); Ser (-0.8); Trp (-0.9); Tyr (-1.3); Pro (-1.6); His (-3.2); Glu (-3.5); Gln (-3.5); Asp (-3.5); Asn (-3.5); Lys (-3.9); and Arg (-4.5).
In alternative embodiments, conserved amino acid substitutions may be made where an amino acid residue is substituted for another in the same class, where the amino acids are divided into non-polar, acidic, basic and neutral classes, as follows: non-polar: Ala, Val, Leu, Ile, Phe, Trp, Pro, Met; acidic: Asp, Glu; basic: Lys, Arg, His; neutral:
Gly, Ser, Thr, 10 Cys, Asn, Gln, Tyr.
Brief DescriRtion of the Drawings Figure 1 is a transcription map of 3Mb region of human chromosome 2q33 including an ALS2 candidate region. The white open rectangle is between D2S116 to D2S2237z'3.
Positions of 7 STS markers, 12 polymorphic DNA markers and 42 independent transcription units are shown. Polarity of 38 transcription units are shown by arrows. The location of the ALS2 gene is designated "ALS2CR6" which term may be used interchangeably for below.
Figure 2 shows a process for the detection of ALS2 associated mutations. "a"
is an example of the Tunisian and Kuwaiti ALS2 families. Genotypes of the members constituting a family is shown based on previously reported results3'4. "b"
shows the result of sequence determination of mutation (A261 del) in genomic DNA of the Tunisian ALS2 family. Patient 10797 is A261 del of a homozygotic type and the carrier 10784 is a heterozygotic type. The sequence determination was carried out for PCR
products. "c"
shows the results of determination of mutation (AGI548de1) in the genomic DNA
in the Kuwaiti ALS2 family. Sequence of the reverse strand of exon 5 in the region of interest are shown. Individual 18279 is a normal sibling, who is unaffected by ALS2 and carries two normal haplotypes. The box in this sequence indicates the position of the bases deleted in affected members. Individual 18281 is an unaffected parent .who carnes one disease haplotype. The overlapping normal and mutated sequences are shown. Individual 18275 is SUBSTITUTE SHEET (RULE 26) affected and the figure shows a homozygous CT deletion in the reverse strand of exon 5:
The position of the deleted bases is indicated by the arrow. The corresponding forward sequence and coded normal amino acids and novel amino acids produced by frameshifting are indicated. "d" shows segregation of the A261de1 mutation in the Tunisian ALS2 family.
The presence of the deletion was assayed by the digestion with NarI, which only cuts mutated gene product. For exon-PCR products, the 339 by fragment representing the normal allele was cleaved into two fragments (225 by and 113 bp) in the mutant allele. For RT-PCR product, the 302 by product which represents the normal allele was cleaved into two fragments (195 by and 106 bp) in the mutant allele.
Figure 3 shows northern blot analysis of the ALS2 (ALS2CR6) mRNA. In "a", a northern blot containing 2 ~g of poly A+ mRNA of many adult human tissues is hybridized with exon 4 of ALS2 cDNA. In the lower drawings, the same blot is hybridized with human ~3-actin cDNA for confirmation of the property and the comparative load of RNA. In the left, size of the ALS2 transcript is shown. In "b", northern blot containing 10 pg of total RNA obtained normal whole brain and 20 ~g of total RNA obtained from lymphocytes of patients and healthy persons (10788 persons) was hybridized to exon 4 of the ALS2 cDNA.
The right panel shows an agarose gel electrophoresis of an RNA sample.
Figure 4 is a comparison of amino acid sequences in human ALS2CR6 and mouse homolog mALS2CR6. The same residues are shown by frames. There are shown the position of the additional three amino acid residues of the Tunisian mutant protein (starting from the 47th amino acid residue), the position of the 25 amino acid residues (starting from the 372nd residue) of a short variant part of the ALS2 gene and the position of the additional 70 amino acid residues of the Kuwaiti mutant protein (starting from the 476th residue).
Figure S shows an expression of ALS2 mRNA in brain and spinal cord of adult mouse. "a" is an arrow-like whole image of an RNA/RNA in situ hybridization using an antisense ALS2 riboprobe while "b" is a control image using a sense strand probe.
Significant expression was noted in neurons of hippocampus and dentate gyrus (c and g), Purkinje cells of cerebellum (d and h), neurons of cerebral cortex (e and i) and cinerea of spinal cord including anterior horn cells (f and j). A scale bar shows a length of 10 pm.
Figure 6 is a result of an amino acid sequence analysis. "a" is a schematic chart of domains and motifs in normal and mutated ALS2 protein. RCCl is a regulatory factor for SUBSTITUTE SHEET (RULE 26) chromosome condensation, DH is a homologous domain to Dbl, PH is a pleckstrin-homologous domain, MORN is membrane structure and recognition nexus and VPS9 is a vacuole protein for discrimination of 9 domains. "b" is comparison of amino acid sequences of RCCl repeat-containing regions for human ALS2 (hALS2CR6), mouse ALS2 S (mALS2CR6), human (h) RCC I , human (h) RPGR and mouse (m) RPGR. The amino acid residues shown by open frames are the same. Conserved amino acid residues are abundantly contained as well. Positions of the seven blades corresponding to RCC 1 are shown according to the literature3o.
Figure 7 is a chart that compares the wild type human, mouse, and short human variant of the ALS2 proteins and the coding products of the A261de1 (Tunisian) and AGI548de1 (Kuwaiti) mutations.
Detailed Description of the Invention The locus of a 1.7 cM region specified by microsatellite markers D2S116 and D2S2237 of a human second chromosome q 33 region has been mapped3'4. The inventors previously prepared a physical map on the basis of YACBAC/PAC of 3Mb genomic region covering the candidate region in Figure 15'6. Sequences of cDNA clones and EST's have now been analyzed and 42 non-duplicated transcription units including 10 new genes mapped. 411 pairs in of primers were designed depending upon genomic DNA of 14 persons of a family of ALS2 (Fig. 2a) and 6 normal control persons having no kinship with the former was amplified by PCR. Seventy-seven base sequence polymorphs of introns or exons were identified by determining the sequence for all of the PCR products.
Among them, a gene having base deletions related to onset of ALS2 was identified.
The ALS2 gene also includes restriction regions and regulatory regions (promoter/enhancer, suppressor, etc.) which function in expression of protein which is coded thereby. Such restriction and regulation regions are useful for clarifying the functions of the ALS2 gene product as a GEF or a GTPase regulatory factor.
This ALS2 gene may, for example, by isolated by screening a human genome library using pure polynucleotide or oligonucleotide comprising a base sequence of SEQ ID
NO: 1 or a partial sequence thereof as a probe. The resulting genomic gene may be amplified by commonly used genetic amplifying methods such as, for example, a PCR
SUBSTITUTE SHEET (RULE 26) (polymerase chain reaction) method, an NASBN (nucleic acid sequence based amplification) method, a TMA (transcription-mediated amplification) method or an SDA
(strand displacement amplification) method.
A pure polynucleotide (DNA fragments and RNA fragments) may also be prepared from this ALS2 genomic gene, mRNA transcribed by this gene or cDNA synthesized from mRNA. For example, cDNA may be synthesized using poly(A) + RNA extracted from human cells as a template. The human cells may be either those excised from human body by operation, etc. or incubated cells. cDNA may be synthesized by known methods (Mol.
Cell Biol., 2, 161-170, 1982; J. Gene, 25, 263-269, 1983; Gene, 150, 243-250, 1994). One may also synthesize cDNA by an RT-PCR method using an oligonucleotide as a primer and mRNA isolated from human cells as a template. Specifically, the cDNA prepared as such has a base sequence of SEQ ID NO: 1. Those polynucleotides may be used for recombinant expression of a human GTPase regulatory factor.
The oligonucleotides of this invention are DNA fragments or RNA fragments which 1 S hybridize to the above-mentioned ALS2 or the above-mentioned nucleic acids under stringent conditions. For example it is a continuous DNA fragment of 10-100 by in the base sequence of SEQ ID NO: 1. Here, stringent conditions means a condition whereby a specific hybrid formation of target with a probe is made possible by salt concentration, concentration of organic solvent (such as formamide), or temperature condition during hybridization and washing steps. Methods are described in U. S. Patent No.
6,100,037.
One methodology for creating stringent hybridization conditions is [insert B &
K) A primer set of this invention is typically a pair of oligonucleotides for amplification of ALS2 gene or related nucleic acids. Such a primer set may be designed on the basis of the base sequence of SEQ ID NO: 1, synthesized and subjected to purification using known methods. Size (base numbers) of the primer preferably is 15-40 bases or more preferably, 15-30 bases which specificity anneal with a template DNA. However, when LA
(long accurate) PCR is carried out, it is effective to use primers in excess of 30 bases. A pair (two) primers comprising sense strand (5'-terminal side) and antisense strand (3'-terminal side) should not be complementary. In addition, a self complementary sequence is to be avoided in a primer to prevent the formation of a hairpin structure. Further, in order to ensure a stable bond to a template DNA, the GC content should be about SO% and occurrence of GC-SUBSTITUTE SHEET (RULE 26) rich or AT-rich regions in a primer should be avoided. Since an annealing temperature is dependent upon Tm (melting temperature), primers having Tm of 55-65°C
are chosen so as to prepare a PCR product having a high specificity. The final concentration of the primer used in PCR should be about 0.1 to about 1 ~M. It is possible to use commercially available software for designing a primer including the OligoTM software [manufactured by National Bioscience Inc. (U. S. A.)] and GenetyxT"" software [manufactured by Software Development KK (Japan)}.
Mutated ALS2 genes may be obtained by a method where a DNA library prepared from cells of a patient thought to be suffering from ALS2 is screened with a probe which hybridizes to a region containing mutant (e.g. a base deficient site) under a stringent condition. Pure polynucleotide (DNA fragment or RNA fragment) may be obtained from genomic DNA, mRNA or cDNA of an ALS2 mutated gene or a complementary sequence thereof. For example, an ALS2 mutated gene comprises a nucleic acid where the 261 st base a of SEQ ID NO: 1 is deficient. Such a polynucleotide may be used for recombinant 1 S production of ALS2 modified protein or for diagnosis of ALS2.
A primer set for a PCR amplification of ALS2, including various regions having base deficient sites in mutated ALS2 is (for example) a pair of synthetic oligonucleotides comprising base sequences of SEQ ID NO: 6 and NO: 7. This primer set is capable of a PCR amplification of the region (339 bp) including exon 3 and introns before and after that in the ALS2 gene. Another primer set may be composed of synthetic oligonucleotides comprising base sequences of SEQ ID NO: 8 and NO: 9 and is capable of PCR
amplification of exons 2-4 (302 bp) of the ALS2 gene using RNA as a template.
Any PCR
product not cleaved by the restriction enzyme NarI is derived from the normal ALS2 gene but PCR products derived from a mutated ALS2 gene may be cleaved by NarI to give two fragments (Fig. 2c).
A recombinant vector of this invention may be a cloning vector or an expression vector. Vectors will be constructed depending upon the type of the polynucleotide as an insert or upon the object for use. For example, when an ALS2 protein or a modified protein thereof is produced using cDNA or its ORF region as an insert, there may be used an expression vector for an in vitro transcription or an expression suitable for each of prokaryotic cells such as Escherichia coli and Bacillus subtilis and eukaryotic cells such as SUBSTITUTE SHEET (RULE 26) yeast, insect cells and mammalian cells. When a genomic DNA of the ALS2 gene or a mutated gene thereof is used as an insert, it is also possible to use a BAC
(bacterial artificial chromosome) vector or a cosmid vector. Such recombinant vectors are also useful, for example, as probes for diagnosis of chromosome abnormality by hybridization including 5 fluorescent in situ hybridization (FISH). Further, a nucleic acid derived from a normal ALS2 gene may be recombined in a virus vector such as adenovirus or the like and the product may be used for genetic therapy.
In the manufacture of ALS2 peptide (including protein), a transformed cell of this invention may be a prokaryotic cell such as Escherichia coli and Bacillus subtilis or an 10 eukaryotic cell such as from yeast, insects, and mammals. In addition, cells (such as blood stem cells) derived from a patient suffering from ALS2 which are transformed by a virus vector of this invention in which a nucleic acid derived from a normal ALS2 gene is recombined, may be used for a genetic therapy of ALS2. Such transformed cells may be prepared by introducing a recombinant vector into cells by means of known methods such as 15 electroporation, calcium phosphate method, liposome method and DEAE dextran method.
A peptide of this invention may be an expression product of a normal ALS2 gene or an expression product of a mutated ALS2 gene. The normal gene product is a GTPase transcription factor or GEF having an amino acid sequence of SEQ ID NO: 2.
Peptides of this invention are useful as immunogens for the preparation of an antibody, as target molecules for the development of therapeutic agents for ALS2, etc. These peptides may be prepared by methods involving isolating peptides from the cells of healthy persons or patients suffering from ALS2. Methods of chemical synthesis on the basis of a desired amino acid sequence from SEQ ID N0:2 or SEQ ID N0:3, etc. and (preferably) by production and isolation or purification from the above-mentioned transformed cells. Such transformed cells are incubated and isolation and purification are carried out for the culture by, for example, means of treatment with a modifier such as urea or with a surface-active agent, ultrasonic wave treatment, enzymatic digestion, precipitation by salting out or by solvent, dialysis, centrifugal separation, ultrafiltration, gel filtration, SDS-PAGE, isoelectric electrophoresis, ion exchange chromatography, hydrophobic chromatography, affinity chromatography and reversed phase chromatography. Such proteins may include fused proteins with any other protein. For example, fused proteins with glutathione-S-transferase SUBSTITUTE SHEET (RULE 26) (GST) or green fluorescent protein (GFP) may be exemplified. In addition, the protein expressed in cells may be subjected to various kinds of modifications in the cells after being translated. Accordingly, modified proteins are also included in the coverage of the protein of this invention. Examples of the modification after translation as such are elimination of N-terminal methionine, N-terminal acetylation, addition of sugar chain, limited decomposition by intracellular protease, myristoylation, isoprenylation and phosphorylation.
An antibody of this invention is a polyclonal antibody or monoclonal antibody which recognizes a peptide of this invention. Examples include an antibody prepared using a peptide comprising an amino acid sequence of continuous 5 amino acid residues or more of the first to the 46th amino acid sequence in SEQ ID NO: 2 as an antigen and an antibody prepared using a peptide comprising an amino acid sequence of continuous 5 amino acid residues or more of the 47th to the 1657th amino acid sequence in SEQ ID NO: 2 as an antigen. When those two kinds of antibodies are used, it is possible to detect and differentiate normal and A261 del mutant proteins. The antibody of this invention includes all molecules which are able to bind to an epitope of an ALS2 protein or other peptide of this invention, and all of Fab, F(ab')2, Fv fragments, etc. thereof. Such an antibody can be obtained from serum after an animal is immunized using ALS2 derived protein or peptide as an antigen. Alternatively, the above expression vectors for eukaryotic cells may be introduced into muscle or skin of animals by injection or particle gun and then serum is collected therefrom. Examples of animals that may be used are mouse, rat, rabbit, goat, chicken, etc. When B cells collected from the spleen of an immunized animal are fused with myeloma cells to produce a hybridoma, it is possible to produce monoclonal antibodies.
The diagnostic method of this invention is one in which an ALS2 mutated gene or a transcription product of an ALS2 mutated gene is detected whereby the risk of onset of ALS2 may be estimated. Particularly amenable are persons of known ALS2 families although diagnosis is not limited thereto.
Genomic DNA of a person to be diagnosed may be subjected to a PCR
amplification using any of the above-mentioned primer sets or other oligonucleotides of this invention.
The resulting DNA fragment may be treated with one or more restriction enzymes such as NarI, and the person to be diagnosed where the DNA fragment is cleaved into fragments different from cleaving product produced from a person not suffering from ALS2 is SUBSTITUTE SHEET (RULE 26) indicative of a patient suffering from ALS2 or a person with some risk of ALS2 in view of the presence of a mutation in the ALS2 gene.
It is also possible to detect the ALS2 mutated genes by (for example) an allele specific oligonucleotide probe method, an oligonucleotide ligation assay method, a PCR
SSCP method, a PCR-CFLP method, a PCR-PHFA method, an invader method, an RCA
(rolling circle amplification) method and a primer oligo base extension method.
In detecting transcription products of ALS2 mutated genes, diagnosis may be carried out by determining the sequence of mRNA of the person to be diagnosed or cDNA
thereof.
It is also possible to carry out the diagnosis in such a manner that an ALS2 gene of a person to be diagnosed or cDNA thereof is recombined with an expression vector, transfected to cells and the expression product thereof measured.
Expression products of normal and mutant ALS2 genes may be assessed by measurement of molecular weight. For example, the frame shift caused by deletion of one base in normal ALS2 gene whereupon the modified protein is changed to a low-molecular protein (SEQ ID NO: 3) comprising the first to the 46th amino acid residues of SEQ ID NO:
2 and three amino acid residues (Pro-Ser-Glu) newly coded by the frame shift results in a product having a molecular weight easily comparable to naturally occurnng gene products of the normal ALS2 gene. Further, diagnosis may be also carned out by the above antibody provided by this invention in which the ALS2 modified protein reacts with an antibody recognizing (for example) the first to the 46th amino acid sequence in SEQ ID
NO: 2 or a region comprising amino acids 43-49 of SEQ ID N0:3, but does not react with an antibody recognizing the 47th to the 1657th amino acid sequence region in SEQ ID NO: 2.
Antibodies specific for amino acids 476 to 545 of SEQ ID N0:84 as compared to any of the amino acids of SEQ ID N0:2 could be similarly used for diagnosis of the AGI548de1.
Diagnosis using antibodies may, for example, be carried out with an ELIZA
method.
The-mouse ALS2 gene of this invention is a mouse genomic gene isolated as a homolog of the human ALS2 gene and which codes for a mouse ALS2 protein comprising an amino acid sequence of SEQ ID NO: 5. Its cDNA has a base sequence of SEQ ID
NO: 4.
This gene may be used for the preparation of a "knock-out" mouse.
Such a "knock-out" mouse can be prepared by known gene targeting methods (Science, 244: 1288-1292, 1989) or generally according to the following exarriple.
SUBSTITUTE SHEET (RULE 26) First, a DNA fragment of the mouse ALS2 gene including the initiation codon of the gene is modified whereupon a defective DNA fragment which deletes expression of the ALS2 gene is obtained. This defective DNA fragment is used for the preparation of a targeting vector for introduction of the modification into a mouse totipotent cell (ES cell) according to known methods (such as the method described in Science, 244: 1288-1292, 1989). For example, genomic DNA comprising the ALS2 gene is substituted or inserted with a resistant gene to a cytotoxin to prepare a recombinant plasmid DNA
possessing the defective gene having a sequence homologous to the genomic DNA of the ALS2 gene at both terminals (the targeting vector). It is also possible for the resistant gene to be connected to a sequence such as PGKl promoter and PGKl polyadenylation signal for controlling the expression. It is preferred that the genomic DNA site of the ALS2CR6 gene which is substituted with or inserted by resistant gene be a genomic DNA region containing an exon region containing an initiation codon.
There are no particular limitations on such target vectors except that it will have a sequence which is homologous to genomic DNA of the ALS2 gene and a resistance sequence or other sequence useful for cell sorting (such as diphtheria toxin A
gene and thymidine kinase gene of herpes virus). A promoter and enhancer may be appropriately combined and used. The targeting vector is then introduced into an ES
(embryonic stem) cell according to known methods (e.g. Nature, 292: 154-156, 1981). Such methods include electric pulse, a liposome and calcium phosphate. When recombination efficiency of the gene to be introduced is of concern, the electric pulse methods is preferred.
DNA in each of the ES cells into which gene is introduced is extracted and, by means of a southern blot analysis or a PCR assay, cells are selected in which a homologous gene recombination has taken place between the wild type ALS2 gene existing on the chromosome and the introduced defective ALS2 gene fragment resulting in placement of the defective gene fragment in the chromosome.
An ES cell having a defective gene prepared above may be injected into a blastocyst of a wild type animal and chimera-embryos obtained which are transplanted to the uterus of a preliminary parent. Resulting progeny are selected for the ALS2 defective gene and bred.
Selection may be carned out by checking the difference in the color of hair or by extraction of DNA from a part of the body (such as the tail end) followed by conducting a southern blot SUBSTITUTE SHEET (RULE 26) analysis, a PCR assay after extraction of DNA, etc. As to the offspring obtained by a crossbreeding of animal of a wild type with a chimera animal where the ALS2 defective gene is in the generative cells, a southern blot analysis, a PCR assay or the like may be carned out using the DNA extracted from a part of the body (such as the tail end) as a material to identify a heterozygote into which the ALS2 defective gene is introduced. A
heterozygote possessing the ALS2 defective gene which is stable in all generative cells and somatic cells may be bred to produce progeny in which the ALS2 gene is completely knocked-out".
An animal prepared as such may be used for analysis of function of ALS2 gene in onset of ALS2 and for screening of therapeutic drugs or development of therapeutic methods as an ALS2 model animal.
Methods and results of procedures carned out for cloning of the ALS2 gene and for functional analysis thereof are shown.
1. Methods 1-1. ALS2 family Sixteen cases including 8 individuals suffering from the disease obtained from a Tunisian consanginous ALS2 family (literature 2) were analyzed. The characteristic of ALS2 is a progressive convulsion of muscles of the limbs and the face accompanied by distal myoatrophy of the hand and the foot. Age of onset is between 3 and 10 years age (literature 2). According to biopsy of nerves and muscles and also to electromyography test, there was confirmed deletion of distal motor neuron (literature 2). When a gene type of the ' polymorphic DNA markers was analyzed together with clinical test data, ALS2 was clearly an autosomal recessive inheritance.
1-2. Transcription map Genome Data Base (GDB) (http://www.gdbwww.gdb.org) and UniGene (http://www.ncbi.nJm.nih.gov) of the Biotechnology Information Center (NCBI) which were open to the public for discriminating the sequence of transcribed DNA mapped within an objective region were retrieved. Sequence of genomic DNA. overlapped with the objective region of ALS2 was retrieved from the "nr" or "htgs" data base of GenBank and utilized as the object for the test when a BLAST retrieval to the dbEST data base is conducted. In order to isolate the transcript of a full length, there were carned out RT-PCR, 5'-RACE and cDNA
SUBSTITUTE SHEET (RULE 26) library screening. In addition, EST clone was purchased from Research Genetics and sequencing for DNA was carned out for measuring the insertion of the whole clone.
Sequence of double stranded DNA was determined by conducting a dideoxy sequencing using a BigDye Terminator Cycle Sequencing Kit (ABI) and an AB 1377DNA
sequences.
5 All sequences of EST data, PCR products and DNA obtained from cDNA clone were determined and an estimated independent transcription unit was established.
Then each unit was mapped on a physical map by a PCR method.
1-3. Identification of exon In order to determine the constitution of intron and exon of the transcription DNA, 10 genomic DNA sequence data open to the public in GenBank data base was compared with the sequence of cDNA using a Sequences Version 3.0 (Gene Codes Corporation) program according to the descriptions of BLAST (literature 28) and literatures (5 and 6).
1-4. PCR
Exon and intron/exon boundaries were subjected to a PCR amplification. ExTaq 15 polyrnerase (Takara) was used and a cycle of 95°C for 15 seconds, 60°C for 30 seconds and 72°C for 30 seconds was repeated for 35 times whereby about SO mg of genomic DNA were amplified by a PCR. In order to detect the deficient form of the transcription DNA, an RT-PCR was carried out. Total RNA from lymphocytes of four patients of a family of ALS2 and two carriers was isolated. Total RNA extracted from a healthy human brain was 20 purchased from Clontech. An RT-PCR was carried out using a Superscript pre-amplification system (Gibco-BRL) according to the protocol of the manufacturer. The oligonucleotide primer for such a PCR was designed using Primer 3.0 (http://www-genome.wi.mit.edu). Table 1 lists the primers used for amplification of ALS2 (ALS2CR6).
SUBSTITUTE SHEET (RULE 26) %
%
%
N
?C W
W W
a y ai 0 4 a a a a .-r M V1 l~ O~ .~ M V1 I~ 01 .-, M v'1 t~ O~ .-r M U1 l~ O~ .r M h I~ O~ .~ M
V1 l~ Ov .~ M V~ l~ O~ .-~ M
az ~~n~.~.-~NNNNNMMMMM~~~~W llNllWll~l1\plp~D~pvp[w[v[vI~[vpppp N ~
Ivl ,.1 . . . . . ~j . 1.1 . . . . . . . . 1 p1 1.1 ~~, a ~, ~, ~, ~, ~, 1 ~, ~, r, ~ ~, a ~, ~, ~, .., ~~, .~, ~ ~, ~, ~ 1 r~, r, r, m~, r, r, ~, a 1 1 1 R 1 1 1 1 1 U 1 1 1 1 1 H 1 ' 1 1 1 1 1 1 1 1 1 O O O~ 1 1 1 1 1 1 1 1 O U
U t U U O U O U O O O O O O U ~ O O U O O O O U t t H U U U U .U U U O U
O U t H O ~ O H O U H O H F F O p U O t p p U O U O p O t O p F t tHUU ~tHOUUO U U4FOtUt OUO,~~tOtWIHU~U
U F U U H ~ U ~ U t U H U ~ ~ O t OO t O O t U U 44 U
K U ~ ~ U ~ H t ~ U ~ O U t ~ ~ O U t ~ E ~ t ~ t t t O U ~ H t H
,- t U H U O U p t H U
U ~ U O U ~ U H H R O ~ O ~ O ~ U ~ t U U ~ ~ t U U O
D U R t ~ t O U ~ ~ ~ K H U ~ U O U O U ~ O t Et., .~ ~ ('~-~ OU
t t O U U U U 4 t t U t t U U H t H
Gi. t t ~ O O ~ ~ H ~ O ~ ~ ~ U ~ U ~ 4 ~ (~. ~ ~ t O U U O H
ai ~ U U t H .t 0 t H t H H t O U U U E O H U H O
U ~ U U ~ U t O ~ O O U U U O ~ O H t U O ~ Wt U ~ U ~ ~ ~ ~ ~ U
> E R N ~ U t U t ~ t O O O O O U O ~ t t U ~ ~ U O ~ H ~ O O
U R O U O t 4 ~ R O v O H O F O t t OU.U~~OEUEF~U~V~~UE~~E,D.,tD4~~OEt~~,'~U~~~~~UE
'W t ~ H O U ~G ~C O t O O U H U H ~ R t U ~c L~ ~ H U O H U ~ U ii E ~ O U O
O H
yf1 1f1 Vf 1I1 Iff ~!1 1I1 1f1 1!1 1f1 ~fl t!1 U1 1l1 If1 1f1 If1 111 1f1 1f1 off 1f1 Nnff 1f1 1f1 1f1 1f1 V1 1f1 1f1 1f1 1f1 U1 1f1 1f1 ~!1 ~R
z ON~~~o00NNNN~MMMMM~~~~ttO~v~lh~~~ ~~~~~h~o~00N0 W
.
rf 1 r1 wm1 O n 1'n ~ ~n n w1 1~1 O e~1 ,., ,., ,.~ 1., ,.1 w1 i1 In ~ e~ ~n r1 r1 n e~ - . wwn. 1.1 w1 w1 1 1 1 O 1 1 1 1 1 1 t t 1 1 1 1 1 1 1 U 1 1 O 1 1 t U 1 1 1 1 AI 1 1 1 1 1 O U O R U O U U~ U U O U U U U U U U N H U U 1 U U U O
U O U ~ D R eU~ ~ 4 E ~ ~ O U O E ~ ~ U ~ O ~ R U ~ ~ U R O V U O
U O U H't U 4 H O H.t ~ U U H t U O U O ~ O t ,." U ~ ~ N O ~ ~ ~ ~ F H ~ ~ R U ~ R ~ ~ ~ U ~ ~ U ~ ~ ~ 4 a) U t t U U U t O U U U t U t t t t t O ' it O ~ ~ ~ ~ U U O E E F U U E E O O' U ~ F ~ U ~ U 4 ~ ~ U U ~ O U F O
U U t H t U U t t O t t O t ~ t U U t U t ~ 4 O O
U ~ U t t t ~ p O U ~ ~ ~ O H O H U O U U U U U t O H O ~ ~ U ~ ,~ H U 4 F O U t H 4 O O K U U p 4 ~ ~ t U
U K R ~ ~ ~ U . O t ~ O O U t ~ t H .~ ~ ~ ~ p O ~ ~ ~ H t O U 4 H
t U E U F U U t U t t O H H O O K U O
4 ~ D ~ U 4 O 4 ~ U O O ~ N E U ~ O O ~ 4 R V ~ F F ~ ~ O p O O
O O t O F O U U H U U t H O O O t H ~ ~ t0 ~ ~ O
O ~ ~ ~ U '~"' p . U ~ ~ ~ U O U ~ ~ ~ ~ ~ < ~ O ~ U U ~ H ~ ~ ~ ~ N U ~ O O
O H U U U
1f1 1f1 If1 1f1 ~I1 1f1 1f1 Ifl vf1 1!1 ~f1 ~f1 ~f1 t11 V1 N ~f1 V1 1!1 1f1 1f1 1f1 1f1 N If1 II1 1!1 1I1 1f1 1f1 1f1 1f1 1f1 1f1 1f1 1f1 1f1 1f1 ~ 01 r1 01 1T U1 m O N PI P1 01 1~ m b O N O of O CD CD Of ~ O U1 O O 1f1 N O
V If1 O Il1 N N1 b 1~1 .D (3. ,.1 m Pf N 1~1 01 01 N d 1~ m b d N 00 b 01 If1 V 0f m 1~ r1 1f1 O 10 N
1f1 b I~ 0f m O t~ 'i 01 m O
O ~, Y1 PI f'1 V d b V P1 V 1~1 Pf V A1 V V Q N Q P1 N PI Pf ~ Q N N P1 d n N
1~1 N V 1~1 1~1 f~l V V
pr 'i N
1 1 w N d 1f1 ' 10 i~ m r1 N P1 d 1f1 r-1 N 1~1 V V O O O O O O r1 N P1 d 1f1 b r m pf O r1 N PI O O O O O 1f1 10 r 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 ., ', ., .-1 o d a o m o 0 0 0 0 exr o m m o axr o o m d m m a o m ~ o 0 0'o n o d d d o 0 0 0 0 0 0 0 o x x O O 'O O O r1 ~ O r1 ,-1 r1 r1 ,-1 ,-1 ,-1 'i ,-1 r1 ,H r1 'i r1 r1 r1 ,~ ~ N
N N N N~ N N N N N 0 V
f~ f~ ~ t~ h N r1 N r1 .i v-1 n.1 ,-1 e1 r1 ,~i ri r1 ,-1 r1 ,-1 ,1 r1 ,~i ,1 ,1 N N N N N N N N N N V V
I1 N Pf !1 N r1 e~1 '1 rl-1 r1 r1 '1 'r1 r1 r1 'i r1 n-1 r1 n-1 n-1 e1 e~l rl-1 v-1 n-1 'i r1 e-1 r1 n-1 r1 r1 e-1 N N
VJ Vl N fA N tp VJ 41 N V1 N N N tp VJ N UJ 4J V1 UJ tp V! N V! f!7 f7J 41 H
V1 N N N if! N N VJ H
.a .7 .7 J .7 .7 .7 J .7 .a J J .7 .7 .a .7 .7 ~7 '7 .7 .7 .7 J a .7 .7 .7 .7 .7 .7 .7 .7 .7 J .7 ~i ~7 t t t 4 4 4 t t 4 4 4 t Q t t t t t t nt 4 t t 4 4 K t 4 t t t t t t t 4 t 4 U U U 'U U U U U U U U U U U U U U U U U U U U U U U U U U U U U U U U U U U
o .r N In v In to r m o1 0 .-1 N r1 vm b r ao 01 0 .-1 N n a v1 b e~ m of o .1 N <n v k aHrlrlrlrlrlr-1e-irIrINNNNNNNNNN 1~1P1NI~P1 1~1 W
SUBSTITUTE SHEET (RULE 26) 1-5. Analysis of mutation In order to detect the mutation of the DNA sequence at the exon or the intron/exon boundary, a DNA sequence of a PCR product of exon was determined. The DNA
sequence of a PCR product of exon was analyzed using the same oligonucleotide as a primer. The S sequence in the data base open to the public was compared with the DNA
sequence obtained from patients, carriers and healthy persons and changes in the nucleotide were discriminated.
[0044]
It was also confirmed that a new NarI site was formed (A261 del) after the treatment with NarI by means of an RT-PCR amplification of exons 2-4 or a PCR
amplification of exon 3. As to the primers for exon 3-PCR, there were used 5' CCTAGTCATCCATGTGCTGG-3' (SEQ ID NO: 6) and 5' TCCCATACCTGACCTTCCAC-3' (SEQ ID NO: 7). As to the primers for the RT-PCR of exons 2-4, there were used 5'-CTTGATAGACTTTCTGTAAAGAAG-3' (SEQ ID NO: 8) and 5'-GGCTACTTGGACAAATCTCCACTG-3' (SEQ ID NO: 9). Decomposed product with NarI was separated by 1.5% agarose gel.
1-6. Northern blot analysis Northern (MTN) blot (Clontech) of many human adult tissues was hybridized with exon 4 labelled with 32P-dCTP of ALS2CR6 or human (3-actin cDNA in a Perfect Hyb hybridizing solution (Toyobo). The membrane was washed with 0.1 x SSC
containing 1%
of SDS and subjected to an X-ray film (Bio-MAX, Kodak).
1-7. mRNA in situ hybridization Antisense and sense cRNA probes were prepared from two mouse cDNA clones m2-as and m2-s. Those mouse cDNA clones covered a part of mouse mALS2CR6 cDNA
(from the 1732nd to the 2685th bases of SEQ ID NO: 4; 954 bp) and inserted into pCR2.1 (Invitrogen) in an opposite direction. The probes were prepared according to the protocol of the manufacturer (Roche Molecular Biochemicals) by an in vitro transcription reaction where digoxigenin-labelled UTP and T7 polymerase were mixed. Preparation of the sample and method for the in situ hybridization were in accordance with the literature (29).
1-8. Retrieval of the data base Each of DNA and amino acid sequences was compared with the data base of sequences of nucleotide and protein which were not overlapped each other using BLASTN
SUBSTITUTE SHEET (RULE 26) and BLASTP. Domain and motive of protein were identified by MOTIF servers of Genome Net Japan (http://www.~enome.ad.jp), search launcher of BCM
(http://www.hgse.hem.tmc.edu/Search.launcher) and CD search of NCBI
(http://www.hcbi.nlm.nih.~ov).
S 2. Results The inventors have prepared a physical map on the basis of YAC/BAC/PAC of genomic region of 3 Mb covering a complete candidate region to ALS2 (literatures 5 and 6).
Sequences of EST and cDNA clone were analyzed within a broad area and, at the same time, this physical map was used for the mapping of 43 independent transcription units including previously analyzed 18 genes (KIAA0005, CLKl, PP1L3, ORC2L, NDUFB3, CFLAR, CASP10, CASPB, FZD7, NOPS, UBL1, BMPR2, FLJ10881, LOC57404, AIP-1, CD28, CTLA4 and AILIM) and new 10 full-length transcription products (ALS2CR1, ALS2CR2, ALS2CR3, ALS2CR4, ALS2CR5/MPP4, ALS2CR6, ALS2CR7, ALS2CR8, ALS2CR9 and ALS2CR12). Those genetic sequences were present in the locus of ALS2 (Fig.
1 ).
[0045]
Juvenile ALS2 is rare and has a sign that, in teens and twenties, muscular convulsion of limbs, face and throat gradually expresses. Since ALS2 is recessively hereditary, it is predicted that this ALS2 disease may take place by a loss of a functional mutation. . Big deletion or translocation in the ALS2 locus was investigated by a mapping of STS/EST
content on the basis of a PCR and a southern blot analysis but that was not detected. After that, small deletion or base substitution in exon or intron-exon boundary was investigated.
In order to detect those mutations, each gene was analyzed and an intron/exon boundary thereof was determined. Until now, 395 exons have been identified from 42 genes. In order to amplify exon and flanking sequence thereof including consensus sequence to splicing donor and acceptor, 411 primers in total were designed and those primers were used to amplify the genomic DNA of 10 normal control persons who were not related to 14 persons of the ALS2 family (Fig. 2a) by PCR. Sequence of each of those PCR products was determined whereby 77 sequence polymorphs in total of intron or exon were identified.
Among those 77 polymorphs, 8 mutations contained in 4 different genes were related to ALS2 (Table 2).
SUBSTITUTE SHEET (RULE 26) Table 2 Gene Region Normal ALS2 NOPS intron 2 tatctc(T~9aattct NOPS intron 6 gttttg(TTG 2ttttta ~ (TTG)3 ALS2CR6 intron 2 ggtaaAtcattt ALS2CR6 exon 3 gcaggcAgccctc -~ A261 deletion*
ALS2CR8 intron 6 gtcagtAttataa ALS2CR9 exon 4 ctccagCatggac ~ T (3rd codon) ALS2CR9 intron 7 ttgggaTtttttt ALS2CR9 intron 8 aaaataCggatat .-~ T
Among those sequence mutations, one nucleotide deletion (A261 del) noted in exon 3 of ALS2CR6 broke the reading frame and it is suggested that such a mutation mutates the protein. All of the suspicious hetero-conjugative carriers show a duplicated sequence pattern starting from the first nucleotide after the deficient part (Fig. 2b). This deletion clearly moves together with an ALS2 expression type (Fig. 2c) and is not noted in 533 normal control individuals of various races (data not shown). In other mutations, one base substitution from C to T in exon 4 of ALS2CR9 gene is included (C873T).
However, this mutation corresponds to the third codon and, therefore, it does not change the amino acid residue. In order to detect a splicing error which is made latent or manifest by other sequence mutation, an RT-PCR was carned out using total RNA extracted from lymphocytes of patients and healthy control persons but no sequence mutation of mRNA
was detected (data not shown). Accordingly, the mutation related to ALS2 of intron or exon region does not cause a splicing error. From those results, it has been confirmed that deletion of one base in exon 3 of ALS2CR6 (A261 del; Table 1 ) is mutation concerning ALS2.
ALS2CR6 gene contains 33 introns and 34 exons and is present in a genomic DNA
of 80.3 kb adjacent to a polymorphic DNA marker D2S2309 (Fig. 1).
Transcription polarity of the ALS2CR6 gene is in the direction of central body from telomere. An transcription product (mRNA) has a full length of 6394 by (SEQ ID NO: 1) having a single open reading frame (ORF) with a length of 4974 nucleotides (124-5,097 nt) and codes for a SUBSTITUTE SHEET (RULE 26) protein of 184 KDa comprising 1,657 amino acid residues. Polyadenylated estimated signal (AATAAA: 6,375-6,380 nt) and poly(A) region are clear. A short ALS2CR6 transcription product in a full length of 2,651 by having 1,191 by ORF coding for a 396 amino acid sequence was identified as well. This short variant splices a 5'-donor site after exon 4 and, S as a result, a stop codon is formed after 25 amino acid residues of intron 4. Being correspondent to those results, 2 transcription products of about 6.5 kb and 2.6 kb were identified in many adult human tissues by a northern blot analysis (Fig. 3a).
Except the liver where short transcription products are mostly expressed, both transcription products showed the similar expression pattern. It has been confirmed that a big transcription product of 6.5 10 kb is expressed in a slightly higher level than a transcription product of 2.6 kb and is most abundantly expressed in the cerebellum. This gene has been also confirmed to be expressed in cells of ALS2 patients (Fig. 3b).
Further, a mouse homolog of ALS2CR6 was isolated and named mALS2CR6. A
transcription product of mALS2CR6 is in a full length of 6,349 by (SEQ ID NO:
4) having 15 one ORF of 4,956 by (124-5,076 nt) and codes for a protein of 183 kDa comprising 1651 amino acids (SEQ ID NO: 5). The ORF as a whole is well reserved in a DNA level (87%
same) and a protein level (91 % same; 94% similar; Fig. 4) between human being and mouse and it is suggested that ALS2CR6 gene is a gene which is well reserved in mammals.
In order to check the localization property of expression of mALS2CR6 transcription 20 product in the brain and the spinal cord of mouse, an in situ hybridization using riboprobe corresponding to a part of mALS2CR6 cDNA was carned out. The result was that, as shown in Fig. 5, the mALS2CR6 transcription products were expressed in various levels in nerve cells from the brain to the spinal cord, especially in neurons of hippocampal and dentate gyrus, cerebellar Purkinje cells, neurons of cerebral cortex and spinal cinerea 25 including anterior horn cells. In addition, a significant expression was noted in neurons of olfactory bulb, basal nucleus and cranial nerve nucleus as well.
Human ALS2CR6 protein showed many interesting properties (Fig. 6a). The first property is present in a region of A-terminal side of ALS2CR6 and it showed a high homology to RCC 1 (regulatory factor for concentrating the chromosome;
literature 7) and RPGR (GTPase for pigmentary retinitis; literature 8)(Fig. 8). RCC1 and RPGR
protein acts as a guanine nucleotide exchange factor (GEF) for GTPase like Ran. The second property is SUBSTITUTE SHEET (RULE 26) that ALS2CR6 has a Dbl-homologous (DH) domain and a pleckstrin-homologous (PH) domain and both domains are typical domains noted in RhoGEF protein (literatures 9 and 10). In addition, VPS9 domain is noted in a C-terminal region as well. VPS9 domain is noted in many GEF including Vps9 (literature 11 ) and Rabex-S (literature 12) and each is said to mediate the selection of vacuole protein and the phagocytic transportation. Two MORN motives comprising 14 amino acids (literature 13) were noted as well.
According to the recent study for junctophilin containing an MORN motive, this motive is shown to contribute in bonding of plasma membrane (literature 13). It has been known that GEF is related to a GDP bonding form of GTPase and promotes the dissociation of GDP
and bonding of GTP whereby GTPase is activated. Since it has been known (literatures 18 and 19) that GEF plays an important role in many signal transmission cascades (literature 14), neuron formation (literature 15), membrane transportation (literature 16) and formation of actin cell skeleton (literature 17), it is likely that ALS2CR6 acts as a regulatory factor/activator of Ran-related GTPase, regulates the formation of membrane, and acts in a (membrane) transportation of cells including neurons.
According to an RT-PCR analysis, a transcription product of mutated ALS2CR6 gene is transcribed from chromosomes of the patient (Fig. 2c) and produces a modified protein comprising 49 amino acids having three new residues (Pro-Ser-Glu) at C-terminal (Fig. 6a). Since this modified protein has no functional domain corresponding to ALS2CR6 protein, it seems to make the inherent function lost. Accordingly, the A261de1 mutation noted in this ALS2CR6 is related to the fact that ALS2 is recessively hereditary.
A recent finding that ALS is related to defect for the transportation of axon and the formation of cell skeleton (literatures 20, 21 and 22) induces a hypothesis that ALS2CR6 gene corresponds to ALS2 and that ALS2 is generated by the defect of membrane structure due to lacking in a regulatory function of membrane structure Ran-related GTPase.
ALS2CR6 gene is the second ALS gene succeeding to the determination of role of copper-zinc superoxide desmutase (SDS-1) in~ ALS. Mutation of SOD-1 is related to the form of tardive autosomal dominance (literature 23).
Although the foregoing invention has been described in some detail by way of illustration and example for purposes of clarity of understanding, it will be readily apparent to those of skill in the art in light of the teachings of this invention that changes and SUBSTITUTE SHEET (RULE 26) modification may be made thereto without departing from the spirit or scope of the appended claims. All patents, patent applications and publications referred to herein are hereby incorporated by reference.
S
References 1. Siddique, T., Nijhawan, D.& Hentati, A. Molecular genetic basis of familiar ALS.
Neurology 47, S27-S35 (1996).
2. Ben Hamida, M., Hentati, F. & Ben Hamida, C. Hereditary motor system diseases (chronic juvenile amyotrophic lateral sclerosis). Brain 113, 347-363 (1990).
3. Hentati, A. et al. Linkage of recessive familial amyotrophic lateral sclerosis to chromosome 2q33-q35. Nature Genet. 7, 425-428 (1994).
4. Hosler, B.A. et al. Refined mapping and characterization of the recessive familial amyotrophic lateral sclerosis locus (ALS2) on chromosome 2q33. Neuro genetics 2, 34-42 (1998).
5. Hadano, S. et al. A yeast artificial chromosome-based physical map of the juvenile amyotrophic lateral sclerosis (ALS2) critical region on human chromosome 2q33-q34.
Genomics 55, 106-112 (1999).
6. Hadano, S. et al. Cloning and characterization of three novel genes, ALS2CR1, ALS2CR2, and ALS2CR3, in the juvenile amyotrophic lateral sclerosis (ALS2) critical region at chromosome 2q33-q34: candidate genes for ALS2. Genomics 71, 200-213 (2001 ).
SUBSTITUTE SHEET (RULE 26) 7. Lerman-Sagie, T., Filiano, J., Smith, D.W. & Korson, M. Infantile onset of hereditary ascending spastic paralysis with bulbar involvement. J. Child Neurol. 11, 54-(1996).
8. Ohtsubo, M. et al. Isolation and characterization of the active cDNA of the human cell cycle gene (RCC1) involved in the regulation of onset of chromosome condensation.
Genes Dev. 1,585-593 (1987).
9. Meindl, A. et al. A gene (RPGR) with homology to the RCCI guanine nucleotide exchange factor is mutated in X-linked retinitis pigmentosa (RP3). Nature Genet. 13, 35-42 (1996).
10. Renault, L. et al. The 1.7 ~ crystal structure of the regulator of chromosome condensation (RCC1) reveals a seven-bladed propeller. Nature 392, 97-101 (1998).
11. Carazo-Salas, R.E. et al. Generation of GTP-bound Ran by RCC1 is required for chromatin mitotic spindle formation. Nature 400, 178-181 (1999).
12. Soisson, S.M., Nimnual, A.S., Uy, M., Bar-Sagi, D. & Kuriyan, J. Crystal structure of the Dbl and pleckstrin homology domains from the human Son of sevenless protein.
Cell 95, 259-268 (1998).
13. Hama, H., Tall, G. G. & Horazdovsky, B.F. Vps9p is a guanine nucleotide exchange factor involved in vesicle-mediated vacuolar protein transport. J. Biol. Chem.
274, 15284-15291 (1999).
14. Horiuchi, H. et al. A novel RabS GDP/GTP exchange factor complexed to Rabaptin-5 links nucleotide exchange to effector recruitment and function. Cell 90, 1149-(1997).
SUBSTITUTE SHEET (RULE 26) 15. Takeshima, H., Kamazaki, S., Nishi, M., Iino, M. & Kangawa, K.
Junctophilins: a novel family of functional membrane complex proteins. Mol. Cell 6, 11-22 (2000).
16. Barrett, K., Leptin, M. & Settleman, J. The Rho GTPase and a putative RhoGEF
mediate a signaling pathway for the cell shape changes in Drosophila gastrulation.
Cell 91, 905-915 (1997).
17. Martijn, F.B.G. et al. Identification of a novel, putative Rho-specific GDP/GTP
exchange factor and a RhoA-binding protein: control of neuronal morphology.
.I.
Cell. Biol. 137, 1603-1613 (1997).
18. Roepman, R. et al. The retinitis pigmentosa GTPase regulator (RPGR) interacts with novel transport-like proteins in the outer segments of rod photoreceptors.
Hum. Mol.
Genet. 9, 2095-2105 (2000).
19. Luo, L. Rho GTPases in neuronal morphogenesis. Nature Rev. Neurosci. 1, (2000).
20. Hall, A. Rho GTPases and the actin cytoskeleton. Science 279, 509-514 (1998).
21. Culbertson, M.R. RNA surveillance. Unforeseen consequences for gene expression, inherited genetic disorders and cancer. Trends Genet. 15, 74-80 (1999).
22. Collard, J.-F., Cote, F. & Julien, J.-P. Defective axonal transport in a transgenic mouse model of amyotrophic lateral sclerosis. Nature 375, 61-64 (1995).
23. Williamson, T.L. & Cleveland, D.W. Slowing of axonal transport is a very early event in the toxicity of ALS-linked SOD1 mutants to motor neurons. Nature Neurosci.
2, SO-56 (1999).
SUBSTITUTE SHEET (RULE 26) 24. Rosen, D.R. et al. Mutations in Cu/Zn superoxide dismutase gene are associated with familial amyotrophic lateral sclerosis. Nature 362, 59-62 (1993).
25. Chance, P.F. et al. Linkage of the gene for an autosomal dominant form of juvenile 5 amyotrophic lateral sclerosis to chromosome 9q34. Am. J. Hum. Genet. 62, 633-(1998).
26. Blair, LP. et al. A gene for autosomal dominant juvenile amyotrophic lateral sclerosis (ALS4) localizes to a 500-kb interval on chromosome 9q34. Neurogenetics 3, 1-6 10 (2000).
27. Hentati, A. et al. Linkage of a commoner form of recessive amyotrophic lateral sclerosis to chromosome 15q15-q22 markers. Neurogenetics 2, 55-60 (1998).
15 28. Hosler, B.A. et al. Linkage of familial amyotrophic lateral sclerosis with frontotemporal dementia to chromosome 9q21-q22. JAMA 284, 1664-1669 (2000).
29. Altschul, S.F. et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389-3402 (1997).
30. Matsumoto, K. et al. Neuronal apoptosis inhibitory protein (NAIP) may enhance the survival of granulosa cells thus indirectly affecting oocyte survival. Mol.
Repod. Dev.
54, 103-111 (1999).
SUBSTITUTE SHEET (RULE 26) SEQUENCE LISTING
<110> The University of British Columbia; and Japan Science and Technology Corporation <120> ALS2 Gene and Amyotrophic Lateral Sclerosis Type 2 <130> 80021-346 <140>
<141>
<150> US 60/267,723 <151> 2001-02-12 <150> JP 2001-116973 <151> 2001-04-16 <150> US 60/318,352 <151> 2001-09-12 <160> 84 <170> PatentIn Ver. 2.0 <210> 1 <211> 6394 <212> DNA
<213> Homo Sapiens <220>
<221> CDS
<222> (124)..(5097) <400> 1 ggacccactg ggttgccaag ctcgcgccgg atgcggagcg cggtgctgcc ggtggagctt 60 caggtcttga tagactttct gtaaagaagg aatgatttgg tgatggagtg ttcccactga 120 ccg atg gac tca aag aag aga agc tca aca gag gca gaa gga tcc aag 168 Met Asp Ser Lys Lys Arg Ser Ser Thr Glu Ala Glu Gly Ser Lys gaa aga ggc ctg gtc cat atc tgg cag gca gga tcc ttt ccc ata aca 216 Glu Arg Gly Leu Val His Ile Trp Gln Ala Gly Ser Phe Pro Ile Thr cca gag aga ttg cca ggc tgg gga gga aag act gtt ttg cag gca gcc 264 Pro Glu Arg Leu Pro Gly Trp Gly Gly Lys Thr Val Leu Gln Ala Ala ctc gga gtg aaa cat gga gtt ctt ctg act gaa gat ggt gag gtc tac 312 Leu Gly Val Lys His Gly Val Leu Leu Thr Glu Asp Gly Glu Val Tyr agc ttt ggg act ctt ccc tgg aga agt gga cca gtg gag att tgt cca 360 Ser Phe Gly Thr Leu Pro Trp Arg Ser Gly Pro Val Glu Ile Cys Pro agt agc ccc att cta gaa aat gcc ctg gtt ggg caa tat gtt att act 408 Ser Ser Pro Ile Leu Glu Asn Ala Leu Val Gly Gln Tyr Val Ile Thr SUBSTITUTE SHEET (RULE 26) gtg gca aca gga agc ttc cat agt gga gca gtg aca gac aat ggt gtc 456 Val Ala Thr Gly Ser Phe His Ser Gly Ala Val Thr Asp Asn Gly Val gcg tac atg tgg gga gag aat tct get ggc cag tgt gca gta gcc aac 504 Ala Tyr Met Trp Gly Glu Asn Ser Ala Gly Gln Cys Ala Val Ala Asn cag cag tat gtg ccg gaa cca aat cct gtc agc att get gat tct gag 552 Gln Gln Tyr Val Pro Glu Pro Asn Pro Val Ser Ile Ala Asp Ser Glu gcc agc cct ttg tta gca gtc agg att tta cag ttg gcg tgt ggc gag 600 Ala Ser Pro Leu Leu Ala Val Arg Ile Leu Gln Leu Ala Cys Gly Glu gag cac act ctg gca ttg tca ata agc aga gag att tgg gca tgg ggt 648 Glu His Thr Leu Ala Leu Ser Ile Ser Arg Glu Ile Trp Ala Trp Gly acc ggt tgt cag ttg ggt ctc att acc act gcc ttc cca gtg aca aag 696 Thr Gly Cys Gln Leu Gly Leu Ile Thr Thr Ala Phe Pro Val Thr Lys ccg caa aag gta gaa cat ctt get ggg cga gtg gtg ctt caa gtt gcc 744 Pro Gln Lys Val Glu His Leu Ala Gly Arg Val Val Leu Gln Val Ala tgt ggt get ttc cac agc tta gcc ctt gta caa tgc ctc cct tcc cag 792 Cys Gly Ala Phe His Ser Leu Ala Leu Val Gln Cys Leu Pro Ser Gln gat ctg aag cca gtc cca gaa cga tgc aac cag tgc agc cag ctc ttg 840 Asp Leu Lys Pro Val Pro Glu Arg Cys Asn Gln Cys Ser Gln Leu Leu att act atg act gac aaa gaa gac cat gtg att ata tca gac.agt cat 888 Ile Thr Met Thr Asp Lys Glu Asp His Val Ile Ile Ser Asp Ser His tgt tgc cca tta ggt gtg aca ctg aca gaa tct cag gca gaa aac cat 936 Cys Cys Pro Leu Gly Val Thr Leu Thr Glu Ser Gln Ala Glu Asn His gcc agc act get ctc agc ccc tcc act gaa acc ctt gac agg cag gaa 984 Ala Ser Thr Ala Leu Ser Pro Ser Thr Glu Thr Leu Asp Arg Gln Glu gaa gta ttt gag aac act ctt gta gca aat gat cag tct gtt get act 1032 Glu Val Phe Glu Asn Thr Leu Val Ala Asn Asp Gln Ser Val Ala Thr gaa ctg aat gca gta agt get cag atc aca agc agc gat gcc atg tcc 1080 Glu Leu Asn Ala Val Ser Ala Gln Ile Thr Ser Ser Asp Ala Met Ser tct caa caa aat gtc atg gga aca act gaa att tcc tct gcc aga aac 1128 Ser Gln Gln Asn Val Met Gly Thr Thr Glu Ile Ser Ser Ala Arg Asn ata cca tca tac cct gac acc caa gca gtc aat gaa tac cta cgg aaa 1176 SUBSTITUTE SHEET (RULE 26) Ile Pro Ser Tyr Pro Asp Thr Gln Ala Val Asn Glu Tyr Leu Arg Lys ctg tca gat cat tca gta aga gag gac tca gag cat ggt gaa aag cca 1224 Leu Ser Asp His Ser Val Arg Glu Asp Ser Glu His Gly Glu Lys Pro atg cca tct cag cct ctt tta gaa gaa gca att cct aat ctc cac agc 1272 Met Pro Ser Gln Pro Leu Leu Glu Glu Ala Ile Pro Asn Leu His Ser ccg cct acc aca agc acc tca gcc cta aac agc ctg gtg gtc tct tgt 1320 Pro Pro Thr Thr Ser Thr Ser Ala Leu Asn Ser Leu Val Val Ser Cys gca tct get gtt ggt gtg aga gtg get get act tat gaa get ggt gcc 1368 Ala Ser Ala Val Gly Val Arg Val Ala Ala Thr Tyr Glu Ala Gly Ala ttg tca ctg aag aaa gtt atg aac ttt tat agt aca acc cct tgt gaa 1416 Leu Ser Leu Lys Lys Val Met Asn Phe Tyr Ser Thr Thr Pro Cys Glu act gga get cag gca ggc agt agt gcc att ggc ccc gaa ggt ttg aaa 1464 Thr Gly Ala Gln Ala Gly Ser Ser Ala Ile Gly Pro Glu Gly Leu Lys gat agc agg gaa gaa cag gtt aaa cag gaa tca atg caa gga aag aaa 1512 Asp Ser Arg Glu Glu Gln Val Lys Gln Glu Ser Met Gln Gly Lys Lys agt tca agt ctt gtg gat atc aga gaa gaa gaa aca gag gga ggc agt 1560 Ser Ser Ser Leu Val Asp Ile Arg Glu Glu Glu Thr Glu Gly Gly Ser cga aga ctc tcc ctc cct gga ttg ttg tca caa gtt tcc ccc agg ctc 1608 Arg Arg Leu Ser Leu Pro Gly Leu Leu Ser Gln Val Ser Pro Arg Leu tta aga aag get gca cgg gtg aaa acg agg aca gtg gtt ctg acc ccc 1656 Leu Arg Lys Ala Ala Arg Val Lys Thr Arg Thr Val Val Leu Thr Pro aca tac agt gga gaa gca gat gcg ctc ctg cct tct ctg aga aca gaa 1704 Thr Tyr Ser Gly Glu Ala Asp Ala Leu Leu Pro Ser Leu Arg Thr Glu gtg tgg acc tgg ggg aaa ggg aag gaa ggg cag ctg ggg cac ggc gat 1752 Val Trp Thr Trp Gly Lys Gly Lys Glu Gly Gln Leu Gly His Gly Asp gtt ctg cct agg ctt caa ccg ttg tgt gta aaa tgt ctg gat ggc aaa 1800 Val Leu Pro Arg Leu Gln Pro Leu Cys Val Lys Cys Leu Asp Gly Lys gaa gta atc cat ctg gag gca ggt ggt tac cat tct ctt gca ctt act 1848 Glu Val Ile His Leu Glu Ala Gly Gly Tyr His Ser Leu Ala Leu Thr gcg aaa tcc cag gtt tac tca tgg ggt agc aat acc ttt ggt caa ctt 1896 Ala Lys Ser Gln Val Tyr Ser Trp Gly Ser Asn Thr Phe Gly Gln Leu SUBSTITUTE SHEET (RULE 26) ggg cat tcc gat ttt cca aca aca gtt cct cgt ctt gca aag ata agc 1944 Gly His Ser Asp Phe Pro Thr Thr Val Pro Arg Leu Ala Lys Ile Ser agt gaa aat gga gtc tgg agc ata get gca ggc agg gat tat tcc ctg 1992 Ser Glu Asn Gly Val Trp Ser Ile Ala Ala Gly Arg Asp Tyr Ser Leu ttt tta gtg gat aca gaa gac ttc cag cct ggg tta tat tac agt ggc 2040 Phe Leu Val Asp Thr Glu Asp Phe Gln Pro Gly Leu Tyr Tyr Ser Gly cga cag gac cct aca gaa ggt gac aac ctt cca gag aat cac agt ggt 2088 Arg Gln Asp Pro Thr Glu Gly Asp Asn Leu Pro Glu Asn His Ser Gly tct aag act cca gta ctt ctc tcc tgt agt aag ctt gga tat ata agc 2136 Ser Lys Thr Pro Val Leu Leu Ser Cys Ser Lys Leu Gly Tyr Ile Ser aga gtg aca gca gga aaa gat agc tat tta gcc ttg gtg gat aaa aac 2184 Arg Val Thr Ala Gly Lys Asp Ser Tyr Leu Ala Leu Val Asp Lys Asn att atg ggg tat att gcc agt ctc cac gag tta get act aca gaa aga 2232 Ile Met Gly Tyr Ile Ala Ser Leu His Glu Leu Ala Thr Thr Glu Arg cga ttc tat tca aaa cta agt gat atc aaa tct cag att ctc agg cct 2280 Arg Phe Tyr Ser Lys Leu Ser Asp Ile Lys Ser Gln Ile Leu Arg Pro ctt ctc agt tta gaa aat ttg ggc act aca act aca gtc cag ctg ttg 2328 Leu Leu Ser Leu Glu Asn Leu Gly Thr Thr Thr Thr Val Gln Leu Leu cag gag gtg get agc cga ttc agc aag ctg tgt tac ctc att ggt cag 2376 Gln Glu Val Ala Ser Arg Phe Ser Lys Leu Cys Tyr Leu Ile Gly Gln cat gga gcc tca ttg agc agc ttc ctt cat ggg gta aag gaa gcc agg 2424 His Gly Ala Ser Leu Ser Ser Phe Leu His Gly Val Lys Glu Ala Arg agt ttg gtc atc ctg aag cat tca agt ctc ttc ttg gat agt tat aca 2472 Ser Leu Val Ile Leu Lys His Ser Ser Leu Phe Leu Asp Ser Tyr Thr gag tat tgc aca tct att aca aat ttc ctg gtt atg gga gga ttc cag 2520 Glu Tyr Cys Thr Ser Ile Thr Asn Phe Leu Val Met Gly Gly Phe Gln ctt ctt get aag cct gcc att gat ttc cta aat aaa aac caa gag ctg 2568 Leu Leu Ala Lys Pro Ala Ile Asp Phe Leu Asn Lys Asn Gln Glu Leu ttg caa gat ttg tca gaa gtg aat gac gaa aac act cag ttg atg gaa 2616 Leu Gln Asp Leu Ser Glu Val Asn Asp Glu Asn Thr Gln Leu Met Glu ata ctg aat act ttg ttt ttc ttg cca atc aga cga ctt cat aat tac 2664 SUBSTITUTE SHEET (RULE 26) Ile Leu Asn Thr Leu Phe Phe Leu Pro Ile Arg Arg Leu His Asn Tyr gca aaa gtt ttg cta aag ctt get act tgt ttt gaa gtg gca tct cca 2712 Ala Lys Val Leu Leu Lys Leu Ala Thr Cys Phe Glu Val Ala Ser Pro gaa tat cag aaa ctg cag gat tcc agt tct tgt tat gag tgt ctt get 2760 Glu Tyr Gln Lys Leu Gln Asp Ser Ser Ser Cys Tyr Glu Cys Leu Ala ctc cat ctc ggc agg aaa agg aag gaa gca gaa tac aca ctg ggc ttc 2808 Leu His Leu Gly Arg Lys Arg Lys Glu Ala Glu Tyr Thr Leu Gly Phe tgg aag acc ttc ccc gga aaa atg acg gat tcc ttg agg aag cca gag 2856 Trp Lys Thr Phe Pro Gly Lys Met Thr Asp Ser Leu Arg Lys Pro Glu cgt cga ctg ctg tgt gag agt agt aac cga gcc ctg tct ctg cag cat 2904 Arg Arg Leu Leu Cys Glu Ser Ser Asn Arg Ala Leu Ser Leu Gln His get ggg agg ttt tcc gtg aat tgg ttc att ctc ttt aat gat gcc ctg 2952 Ala Gly Arg Phe Ser Val Asn Trp Phe Ile Leu Phe Asn Asp Ala Leu gtc cat gcc cag ttc tcc acg cac cat gtt ttc cct ctg gcc acg ctg 3000 Val His Ala Gln Phe Ser Thr His His Val Phe Pro Leu Ala Thr Leu tgg gca gag cca ctg tct gaa gaa get ggt ggt gtg aat ggc tta aag 3048 Trp Ala Glu Pro Leu Ser Glu Glu Ala Gly Gly Val Asn Gly Leu Lys ata act aca cct gag gag cag ttc act ctc att tca tct aca ccc cag 3096 Ile Thr Thr Pro Glu Glu Gln Phe Thr Leu Ile Ser Ser Thr Pro Gln gaa aag aca aag tgg cta cga get ata agc caa gcc gta gat cag get 3144 Glu Lys Thr Lys Trp Leu Arg Ala Ile Ser Gln Ala Val Asp Gln Ala ttg aga ggg atg tct gat ctc ccc cct tat gga agt ggt agc agt gtt 3192 Leu Arg Gly Met Ser Asp Leu Pro Pro Tyr Gly Ser Gly Ser Ser Val cag aga cag gaa cca ccc att tca cgc agt gcc aaa tat act ttc tac 3240 Gln Arg Gln Glu Pro Pro Ile Ser Arg Ser Ala Lys Tyr Thr Phe Tyr aag gat cct cgc cta aag gat gcc acc tat gat gga cgc tgg ctt tca 3288 Lys Asp Pro Arg Leu Lys Asp Ala Thr Tyr Asp Gly Arg Trp Leu Ser ggg aag cct cat ggc aga ggg gtt ttg aag tgg cct gat gga aag atg 3336 Gly Lys Pro His Gly Arg Gly Val Leu Lys Trp Pro Asp Gly Lys Met tat tct ggc atg ttc agg aat ggc ttg gaa gat ggg tat gga gaa tac 3384 Tyr Ser Gly Met Phe Arg Asn Gly Leu Glu Asp Gly Tyr Gly Glu Tyr SUBSTITUTE SHEET (RULE 26) aga atc cca aac aag gca atg aac aaa gaa gac cat tat gtg ggc cat 3432 Arg Ile Pro Asn Lys Ala Met Asn Lys Glu Asp His Tyr Val Gly His tgg aaa gaa gga aaa atg tgc ggt caa gga gtc tac agc tat get tct 3480 Trp Lys Glu Gly Lys Met Cys Gly Gln Gly Val Tyr Ser Tyr Ala Ser ggt gaa gta ttt gag ggc tgt ttt caa gat aat atg cgt cat ggt cat 3528 Gly Glu Val Phe Glu Gly Cys Phe Gln Asp Asn Met Arg His Gly His ggt ctt cta cga agt ggg aaa ttg acg tcc tct tct cct agt atg ttc 3576 Gly Leu Leu Arg Ser Gly Lys Leu Thr Ser Ser Ser Pro Ser Met Phe att ggc cag tgg gta atg gat aag aaa gca gga tat ggt gtc ttt gat 3624 Ile Gly Gln Trp Val Met Asp Lys Lys Ala Gly Tyr Gly Val Phe Asp gat atc act agg ggg gaa aag tat atg gga atg tgg caa gat gat gtg 3672 Asp Ile Thr Arg Gly Glu Lys Tyr Met Gly Met Trp Gln Asp Asp Val tgt caa ggg aat ggt gtg gtg gtt acc cag ttt gga tta tac tac gag 3720 Cys Gln Gly Asn Gly Val Val Val Thr Gln Phe Gly Leu Tyr Tyr Glu ggc aac ttt cac ctt aat aaa atg atg gga aat ggg gtt ttg ctt tcc 3768 Gly Asn Phe His Leu Asn Lys Met Met Gly Asn Gly Val Leu Leu Ser gaa gat gat act atc tat gaa gga gaa ttt tca gat gac tgg act ctt 3816 Glu Asp Asp Thr Ile Tyr Glu Gly Glu Phe Ser Asp Asp Trp Thr Leu agt gga aag gga aca ctg act atg cca aat gga gac tac att gaa ggt 3864 Ser Gly Lys Gly Thr Leu Thr Met Pro Asn Gly Asp Tyr Ile Glu Gly tat ttt agt gga gaa tgg gga tct ggg ata aaa atc act gga acc tac 3912 Tyr Phe Ser Gly Glu Trp Gly Ser Gly Ile Lys Ile Thr Gly Thr Tyr ttc aaa cct agt cta tat gag agt gat aaa gac aga cct aaa gtt ttc 3960 Phe Lys Pro Ser Leu Tyr Glu Ser Asp Lys Asp Arg Pro Lys Val Phe agg aag cta gga aac ctg gca gtg cca get gat gag aag tgg aaa gcg 4008 Arg Lys Leu Gly Asn Leu Ala Val Pro Ala Asp Glu Lys Trp Lys Ala gtg ttt gac gaa tgt tgg cgc caa ctg ggc tgt gag ggc cca ggc caa 4056 Val Phe Asp Glu Cys Trp Arg Gln Leu Gly Cys Glu Gly Pro Gly Gln ggg gaa gtt tgg aaa gca tgg gac aat att get gtg gcc ttg acc acc 4104 Gly Glu Val Trp Lys Ala Trp Asp Asn Ile Ala Val Ala Leu Thr Thr agt cgg cgc cag cac aga gac agt cca gaa ata ttg agt cgt tca cag 4152 SUBSTITUTE SHEET (RULE 26) Ser Arg Arg Gln His Arg Asp Ser Pro Glu Lle Leu Ser Arg Ser Gln act cag aca cta gag agt ttg gaa ttc att cca cag cat gtt ggt gcc 4200 Thr Gln Thr Leu Glu Ser Leu Glu Phe Ile Pro Gln His Val Gly Ala ttc tct gtg gag aaa tat gat gac atc agg aaa tat tta ata aag gcc 4248 Phe Ser Val Glu Lys Tyr Asp Asp Ile Arg Lys Tyr Leu Ile Lys Ala tgt gac act cct ctg cac ccc ctg ggc agg ctt gtg gag aca ctg gtt 4296 Cys Asp Thr Pro Leu His Pro Leu Gly Arg Leu Val Glu Thr Leu Val gca gtg tat aga atg aca tac gtg ggc gta gga gcc aac cgc agg tta 4344 Ala Val Tyr Arg Met Thr Tyr Val Gly Val Gly Ala Asn Arg Arg Leu ttg cag gag get gta aag gag att aag tcc tat ctt aag cga att ttc 4392 Leu Gln Glu Ala Val Lys Glu Ile Lys Ser Tyr Leu Lys Arg Ile Phe cag ctg gtg agg ttc tta ttt cct gag ctg cct gaa gaa ggc agc aca 4440 Gln Leu Val Arg Phe Leu Phe Pro Glu Leu Pro Glu Glu Gly Ser Thr att cct ctc tct get cct ctg cca acc gaa agg aag tct ttt tgc act 4488 Ile Pro Leu Ser Ala Pro Leu Pro Thr Glu Arg Lys Ser Phe Cys Thr ggg aag tca gat tcc cga tct gaa tca cca gag cca ggt tat gta gta 4536 Gly Lys Ser Asp Ser Arg Ser Glu Ser Pro Glu Pro Gly Tyr Val Val acg agt tct gga tta ttg ctt cct gtg ctg cta cct cgg ctc tac cca 4584 Thr Ser Ser Gly Leu Leu Leu Pro Val Leu Leu Pro Arg Leu Tyr Pro ccg ctg ttt atg ctt tat get ttg gat aat gat cgc gag gaa gac att 4632 Pro Leu Phe Met Leu Tyr Ala Leu Asp Asn Asp Arg Glu Glu Asp Ile tac tgg gaa tgt gtc ctt cga cta aat aag cag cca gat att get ctc 4680 Tyr Trp Glu Cys Val Leu Arg Leu Asn Lys Gln Pro Asp Ile Ala Leu ctg ggc ttt ctt ggg gtg cag agg aaa ttt tgg cca gca acc ttg tca 4728 Leu Gly Phe Leu Gly Val Gln Arg Lys Phe Trp Pro Ala Thr Leu Ser atc ctt gga gag agt aaa aag gtt ttg cca acc acg aaa gat get tgt 4776 Ile Leu Gly Glu Ser Lys Lys Val Leu Pro Thr Thr Lys Asp Ala Cys ttt gcc tca gca gta gaa tgt ctg cag cag atc agc aca aca ttt acc 4824 Phe Ala Ser Ala Val Glu Cys Leu Gln Gln Ile Ser Thr Thr Phe Thr cca tca gac aaa ctt aag gtc atc cag cag act ttt gag gag atc tct 4872 Pro Ser Asp Lys Leu Lys Val Ile Gln Gln Thr Phe Glu Glu Ile Ser SUBSTITUTE SHEET (RULE 26) cag agt gtc ctg gcg tca ctc cac gaa gac ttc ttg tgg tcc atg gat 4920 Gln Ser Val Leu Ala Ser Leu His Glu Asp Phe Leu Trp Ser Met Asp gac ttg ttt cct gtt ttc tta tat gtg gtg cta cgg gcc agg att agg 4968 Asp Leu Phe Pro Val Phe Leu Tyr Val Val Leu Arg Ala Arg Ile Arg aat tta ggc tct gag gta cac ctc att gag gat cta atg gac ccc tat 5016 Asn Leu Gly Ser Glu Val His Leu Ile Glu Asp Leu Met Asp Pro Tyr ctt cag cat ggg gaa cag ggt ata atg ttc acc acc ttg aag gca tgt 5064 Leu Gln His Gly Glu Gln Gly Ile Met Phe Thr Thr Leu Lys Ala Cys tac tac cag att cag cgt gag aag ctt aac tag gctgcataac agcttgaaaa 5117 Tyr Tyr Gln Ile Gln Arg Glu Lys Leu Asn ctggattatc tactacagag tgttataaca ccatctggag tcttcctgta gtggcaaaaa 5177 agaacagtgt tgaaattgga aaggactttg tgttatttag gttgttagaa tgagccttac 5237 caataataag agccctgagc ccagaaaaaa ggactgtata gtttaaaggg aggattgaaa 5297 gggaggtaaa aaatcagatt agaccagttc ttggcctatg ataagttcca aaaataccat 5357 ttatctacta tttgaaaaaa gaagaggata tcccttccta cagtaaaggg tatgtcagct 5417 acatgaagtt gtaagaaaag cttccagtag agcttcttat attaaagaag ttgatggata 5477 tttttgaatt tctggtttgc ctgaatccac ctgcagttac cccgatccgt ttgcaagaac 5537 cagatcgtac ttgaaactat agtggccaca ctctgccttc ctgagtccct tccagtcatg 5597 tgtgcatcat gtctctttgc caagggaggg gagaaaggaa cttttaaact gcagttttaa 5657 ctttttctaa gctgtttctt gatgggagag gttctgtgca aaactaccac attctgtccc 5717 caaaatgtgg aatgcatcca aataggagtc ttctgcctct taacttaaaa gaacatagga 5777 attttgtttt tggtttcttt atcatgctac agagagtgaa tacactggaa ttcagacacc 5837 gactctgagc tgctaggaac ctcatttgtc catgtgcaaa cgctgtattc caaggcctgt 5897 gaatggcagc ctgaggaagt tttgcatgca ggctgtgttt tcgagcagga ctaacaactg 5957 ggaaataagc aaaaaactgc atcgatcccc agcctggtgt tgttcttccc tatacttcac 6017 actgaactca ggatgggaag aaaaaggaaa caagctttgg ctttttccat ctcaaaagta 6077 ttgtggcacc tcaacatttc agtgttttgc tttttaaaaa atgccctatt gtaagttgtt 6137 ggtttatact gtataagtaa cactagtagc tgttttgaat aacataggtg ctcttcctca 6197 tctcatctcc tacaccgtgg tgagcataca gagtgtcctg atttgtgtta agtgactgag 6257 aagatgttaa ttacttttga aaaaggatca tggtttttgc tctactttat aatcaagaca 6317 agtgtttatt aaaatactgt tttggaatgt tggctgtaat gtaacagcaa ttttcataat 6377 aaaaggcatt catcttt 6394 <210> 2 <211> 1657 <212> PRT
<213> Homo Sapiens <400> 2 Met Asp Ser Lys Lys Arg Ser Ser Thr Glu Ala Glu Gly Ser Lys Glu Arg Gly Leu Val His Ile Trp Gln Ala Gly Ser Phe Pro Ile Thr Pro Glu Arg Leu Pro Gly Trp Gly Gly Lys Thr Val Leu Gln Ala Ala Leu Gly Val Lys His Gly Val Leu Leu Thr Glu Asp Gly Glu Val Tyr Ser Phe Gly Thr Leu Pro Trp Arg Ser Gly Pro Val Glu Ile Cys Pro Ser Ser Pro Ile Leu Glu Asn Ala Leu Val Gly Gln Tyr Val Ile Thr Val SUBSTITUTE SHEET (RULE 26) Ala Thr Gly Ser Phe His Ser Gly Ala Val Thr Asp Asn Gly Val Ala Tyr Met Trp Gly Glu Asn Ser Ala Gly Gln Cys Ala Val Ala Asn Gln Gln Tyr Val Pro Glu Pro Asn Pro Val Ser Ile Ala Asp Ser Glu Ala Ser Pro Leu Leu Ala Val Arg Ile Leu Gln Leu Ala Cys Gly Glu Glu His Thr Leu Ala Leu Ser Ile Ser Arg Glu Ile Trp Ala Trp Gly Thr Gly Cys Gln Leu Gly Leu Ile Thr Thr Ala Phe Pro Val Thr Lys Pro Gln Lys Val Glu His Leu Ala Gly Arg Val Val Leu Gln Val Ala Cys Gly Ala Phe His Ser Leu Ala Leu Val Gln Cys Leu Pro Ser Gln Asp Leu Lys Pro Val Pro Glu Arg Cys Asn Gln Cys Ser Gln Leu Leu Ile Thr Met Thr Asp Lys Glu Asp His Val Ile Ile Ser Asp Ser His Cys Cys Pro Leu Gly Val Thr Leu Thr Glu Ser Gln Ala Glu Asn His Ala Ser Thr Ala Leu Ser Pro Ser Thr Glu Thr Leu Asp Arg Gln Glu Glu Val Phe Glu Asn Thr Leu Val Ala Asn Asp Gln Ser Val Ala Thr Glu Leu Asn Ala Val Ser Ala Gln Ile Thr Ser Ser Asp Ala Met Ser Ser Gln Gln Asn Val Met Gly Thr Thr Glu Ile Ser Ser Ala Arg Asn Ile Pro Ser Tyr Pro Asp Thr Gln Ala Val Asn Glu Tyr Leu Arg Lys Leu Ser Asp His Ser Val Arg Glu Asp Ser Glu His Gly Glu Lys Pro Met Pro Ser Gln Pro Leu Leu Glu Glu Ala Ile Pro Asn Leu His Ser Pro Pro Thr Thr Ser Thr Ser Ala Leu Asn Ser Leu Val Val Ser Cys Ala Ser Ala Val Gly Val Arg Val Ala Ala Thr Tyr Glu Ala Gly Ala Leu Ser Leu Lys Lys Val Met Asn Phe Tyr Ser Thr Thr Pro Cys Glu Thr Gly Ala Gln Ala Gly Ser Ser Ala Ile Gly Pro Glu Gly Leu Lys Asp Ser Arg Glu Glu Gln Val Lys Gln Glu Ser Met Gln Gly Lys Lys Ser Ser Ser Leu Val Asp Ile Arg Glu Glu Glu Thr Glu Gly Gly Ser Arg Arg Leu Ser Leu Pro Gly Leu Leu Ser Gln Val Ser Pro Arg Leu Leu Arg Lys Ala Ala Arg Val Lys Thr Arg Thr Val Val Leu Thr Pro Thr Tyr Ser Gly Glu Ala Asp Ala Leu Leu Pro Ser Leu Arg Thr Glu Val Trp Thr Trp Gly Lys Gly Lys Glu Gly Gln Leu Gly His Gly Asp Val Leu Pro Arg Leu Gln Pro Leu Cys Val Lys Cys Leu Asp Gly Lys Glu Val Ile His Leu Glu Ala Gly Gly Tyr His Ser Leu Ala Leu Thr Ala Lys Ser Gln Val Tyr Ser Trp Gly Ser Asn Thr Phe Gly Gln Leu Gly SUBSTITUTE SHEET (RULE 26) His Ser Asp Phe Pro Thr Thr Val Pro Arg Leu Ala Lys Ile Ser Ser Glu Asn Gly Val Trp Ser Ile Ala Ala Gly Arg Asp Tyr Ser Leu Phe Leu Val Asp Thr Glu Asp Phe Gln Pro Gly Leu Tyr Tyr Ser Gly Arg Gln Asp Pro Thr Glu Gly Asp Asn Leu Pro Glu Asn His Ser Gly Ser Lys Thr Pro Val Leu Leu Ser Cys Ser Lys Leu Gly Tyr Ile Ser Arg Val Thr Ala Gly Lys Asp Ser Tyr Leu Ala Leu Val Asp Lys Asn Ile Met Gly Tyr Ile Ala Ser Leu His Glu Leu Ala Thr Thr Glu Arg Arg Phe Tyr Ser Lys Leu Ser Asp Ile Lys Ser Gln Ile Leu Arg Pro Leu Leu Ser Leu Glu Asn Leu Gly Thr Thr Thr Thr Val Gln Leu Leu Gln Glu Val Ala Ser Arg Phe Ser Lys Leu Cys Tyr Leu Ile Gly Gln His Gly Ala Ser Leu Ser Ser Phe Leu His Gly Val Lys Glu Ala Arg Ser Leu Val Ile Leu Lys His Ser Ser Leu Phe Leu Asp Ser Tyr Thr Glu Tyr Cys Thr Ser Ile Thr Asn Phe Leu Val Met Gly Gly Phe Gln Leu Leu Ala Lys Pro Ala Ile Asp Phe Leu Asn Lys Asn Gln Glu Leu Leu Gln Asp Leu Ser Glu Val Asn Asp Glu Asn Thr Gln Leu Met Glu Ile Leu Asn Thr Leu Phe Phe Leu Pro Ile Arg Arg Leu His Asn Tyr Ala Lys Val Leu Leu Lys Leu Ala Thr Cys Phe Glu Val Ala Ser Pro Glu Tyr Gln Lys Leu Gln Asp Ser Ser Ser Cys Tyr Glu Cys Leu Ala Leu His Leu Gly Arg Lys Arg Lys Glu Ala Glu Tyr Thr Leu Gly Phe Trp Lys Thr Phe Pro Gly Lys Met Thr Asp Ser Leu Arg Lys Pro Glu Arg Arg Leu Leu Cys Glu Ser Ser Asn Arg Ala Leu Ser Leu Gln His Ala Gly Arg Phe Ser Val Asn Trp Phe Ile Leu Phe Asn Asp Ala Leu Val His Ala Gln Phe Ser Thr His His Val Phe Pro Leu Ala Thr Leu Trp Ala Glu Pro Leu Ser Glu Glu Ala Gly Gly Val Asn Gly Leu Lys Ile Thr Thr Pro Glu Glu Gln Phe Thr Leu Ile Ser Ser Thr Pro Gln Glu Lys Thr Lys Trp Leu Arg Ala Ile Ser Gln Ala Val Asp Gln Ala Leu Arg Gly Met Ser Asp Leu Pro Pro Tyr Gly Ser Gly Ser Ser Val Gln Arg Gln Glu Pro Pro Ile Ser Arg Ser Ala Lys Tyr Thr Phe Tyr Lys Asp Pro Arg Leu Lys Asp Ala Thr Tyr Asp Gly Arg Trp Leu Ser Gly Lys Pro His Gly Arg Gly Val Leu Lys Trp Pro Asp Gly Lys Met Tyr Ser Gly Met Phe Arg Asn Gly Leu Glu Asp Gly Tyr Gly Glu Tyr Arg SUBSTITUTE SHEET (RULE 26) Ile Pro Asn Lys Ala Met Asn Lys Glu Asp His Tyr Val Gly His Trp Lys Glu Gly Lys Met Cys Gly Gln Gly Val Tyr Ser Tyr Ala Ser Gly Glu Val Phe Glu Gly Cys Phe Gln Asp Asn Met Arg His Gly His Gly Leu Leu Arg Ser Gly Lys Leu Thr Ser Ser Ser Pro Ser Met Phe Ile Gly Gln Trp Val Met Asp Lys Lys Ala Gly Tyr Gly Val Phe Asp Asp Ile Thr Arg Gly Glu Lys Tyr Met Gly Met Trp Gln Asp Asp Val Cys Gln Gly Asn Gly Val Val Val Thr Gln Phe Gly Leu Tyr Tyr Glu Gly Asn Phe His Leu Asn Lys Met Met Gly Asn Gly Val Leu Leu Ser Glu Asp Asp Thr Ile Tyr Glu Gly Glu Phe Ser Asp Asp Trp Thr Leu Ser Gly Lys Gly Thr Leu Thr Met Pro Asn Gly Asp Tyr Ile Glu Gly Tyr Phe Ser Gly Glu Trp Gly Ser Gly Ile Lys Ile Thr Gly Thr Tyr Phe Lys Pro Ser Leu Tyr Glu Ser Asp Lys Asp Arg Pro Lys Val Phe Arg Lys Leu Gly Asn Leu Ala Val Pro Ala Asp Glu Lys Trp Lys Ala Val Phe Asp Glu Cys Trp Arg Gln Leu Gly Cys Glu Gly Pro Gly Gln Gly Glu Val Trp Lys Ala Trp Asp Asn Ile Ala Val Ala Leu Thr Thr Ser Arg Arg Gln His Arg Asp Ser Pro Glu Ile Leu Ser Arg Ser Gln Thr Gln Thr Leu Glu Ser Leu Glu Phe Ile Pro Gln His Val Gly Ala Phe Ser Val Glu Lys Tyr Asp Asp Ile Arg Lys Tyr Leu Ile Lys Ala Cys Asp Thr Pro Leu His Pro Leu Gly Arg Leu Val Glu Thr Leu Val Ala Val Tyr Arg Met Thr Tyr Val Gly Val Gly Ala Asn Arg Arg Leu Leu Gln Glu Ala Val Lys Glu Ile Lys Ser Tyr Leu Lys Arg Ile Phe Gln Leu Val Arg Phe Leu Phe Pro Glu Leu Pro Glu Glu Gly Ser Thr Ile Pro Leu Ser Ala Pro Leu Pro Thr Glu Arg Lys Ser Phe Cys Thr Gly Lys Ser Asp Ser Arg Ser Glu Ser Pro Glu Pro Gly Tyr Val Val Thr Ser Ser Gly Leu Leu Leu Pro Val Leu Leu Pro Arg Leu Tyr Pro Pro Leu Phe Met Leu Tyr Ala Leu Asp Asn Asp Arg Glu Glu Asp Ile Tyr Trp Glu Cys Val Leu Arg Leu Asn Lys Gln Pro Asp Ile Ala Leu Leu Gly Phe Leu Gly Val Gln Arg Lys Phe Trp Pro Ala Thr Leu Ser Ile Leu Gly Glu Ser Lys Lys Val Leu Pro Thr Thr Lys Asp Ala Cys Phe Ala Ser Ala Val Glu Cys Leu Gln Gln Ile Ser Thr Thr Phe Thr Pro Ser Asp Lys Leu Lys Val Ile Gln Gln Thr Phe Glu Glu Ile Ser Gln SUBSTITUTE SHEET (RULE 26) Ser Val Leu Ala Ser Leu His Glu Asp Phe Leu Trp Ser Met Asp Asp Leu Phe Pro Val Phe Leu Tyr Val Val Leu Arg Ala Arg Ile Arg Asn Leu Gly Ser Glu Val His Leu Ile Glu Asp Leu Met Asp Pro Tyr Leu Gln His Gly Glu Gln Gly Ile Met Phe Thr Thr Leu Lys Ala Cys Tyr Tyr Gln Ile Gln Arg Glu Lys Leu Asn <210> 3 <211> 49 <212> PRT
<213> Homo Sapiens <400> 3 Met Asp Ser Lys Lys Arg Ser Ser Thr Glu Ala Glu Gly Ser Lys Glu Arg Gly Leu Val His Ile Trp Gln Ala Gly Ser Phe Pro Ile Thr Pro Glu Arg Leu Pro Gly Trp Gly Gly Lys Thr Val Leu Gln Ala Pro Ser Glu <210> 4 <211> 6349 <212> DNA
<213> Mus musculus <220>
<221> CDS
<222> (124)..(5079) <400> 4 ccacgcgtcc ggcggtgcag tcgggctcgc gccgggagaa gagcgcggag ctgcgggagc 60 gtcaggtctt gagagagctt ttgctaatgg gatggtttgg tgatggagta ctcctcctga 120 ccg atg gac tca aag aag aaa agc tca aca gag gca gaa gga tcc aaa 168 Met Asp Ser Lys Lys Lys Ser Ser Thr Glu Ala Glu Gly Ser Lys gaa aga ggc cta gtc cat gtc tgg cag gca gga tcc ttt tct cta aca 216 Glu Arg Gly Leu Val His Val Trp Gln Ala Gly Ser Phe Ser Leu Thr cca gag agg ttg cca ggc tgg ggt gga aag aca gtt ctt cag gca gcc 264 Pro Glu Arg Leu Pro Gly Trp Gly Gly Lys Thr Val Leu Gln Ala Ala ctt ggt gtg agg cat gga gtt ctt ctg act gaa gat ggt gag gtc tac 312 Leu Gly Val Arg His Gly Val Leu Leu Thr Glu Asp Gly Glu Val Tyr agc ttt ggg act ctt ccc tgg aaa agt gaa tca gca gaa att tgt cca 360 Ser Phe Gly Thr Leu Pro Trp Lys Ser Glu Ser Ala Glu Ile Cys Pro agc agc ccc ctt cta gaa agt gcc ctg gtt ggg cat cat gtt att act 408 Ser Ser Pro Leu Leu Glu Ser Ala Leu Val Gly His His Val Ile Thr SUBSTITUTE SHEET (RULE 26) gtg gca aca ggg agc ttc cac agt gga gca gtg aca gag agc ggg gtg 456 Val Ala Thr Gly Ser Phe His Ser Gly Ala Val Thr Glu Ser Gly Val gtg tac atg tgg gga gag aat get gcc ggg cag tgt gcg gta get aac 504 Val Tyr Met Trp Gly Glu Asn Ala Ala Gly Gln Cys Ala Val Ala Asn cag cag tat gtg ccg gag ccg agt cct gtc agc att tct gac tcg gag 552 Gln Gln Tyr Val Pro Glu Pro Ser Pro Val Ser Ile Ser Asp Ser Glu acc agc ccg tca tta gca gtt agg att ctg caa ttg gca tgt ggc gag 600 Thr Ser Pro Ser Leu Ala Val Arg Ile Leu Gln Leu Ala Cys Gly Glu gaa cac aca ctg gca ttg tca ctc agc aga gag atc tgg gca tgg ggc 648 Glu His Thr Leu Ala Leu Ser Leu Ser Arg Glu Ile Trp Ala Trp Gly acc ggc tgt cag ctg ggc ctc atc acc acc act ttc cca gtg aca aag 696 Thr Gly Cys Gln Leu Gly Leu Ile Thr Thr Thr Phe Pro Val Thr Lys cca cag aag gtg gaa cac ctt get gga cga gtg gtg ctc cag gtg gcc 744 Pro Gln Lys Val Glu His Leu Ala Gly Arg Val Val Leu Gln Val Ala tgc ggt gca ttc cac agc ctt gca ctt gtg cag tgc ctc cct cct cag 792 Cys Gly Ala Phe His Ser Leu Ala Leu Val Gln Cys Leu Pro Pro Gln gat ctg aag cca gtc cca gag aga tgc aat cag tgc agc cag ctg ctc 840 Asp Leu Lys Pro Val Pro Glu Arg Cys Asn Gln Cys Ser Gln Leu Leu atc acc atg aca gac aaa gag gac cat gtg ata ata tcg gac agc cat 888 Ile Thr Met Thr Asp Lys Glu Asp His Val Ile Ile Ser Asp Ser His tgc tgc cct tta ggt gtg aca ttg tcc gag tct caa gca gaa aag cat 936 Cys Cys Pro Leu Gly Val Thr Leu Ser Glu Ser Gln Ala Glu Lys His gcc agc cct get ccc agc cct cac cca gag gca ctg gat gag cag gga 984 Ala Ser Pro Ala Pro Ser Pro His Pro Glu Ala Leu Asp Glu Gln Gly gag gtg ttt gag aac acg gtg gta gaa get gaa ctg aac atg gga agc 1032 Glu Val Phe Glu Asn Thr Val Val Glu Ala Glu Leu Asn Met Gly Ser agt cag acc aca agt ggc agt gcc att tcc acc cag cag aac atc gtg 1080 Ser Gln Thr Thr Ser Gly Ser Ala Ile Ser Thr Gln Gln Asn Ile Val gga aca get gaa gtg tct tct gcc aga aca get ccg tca tac cca gac 1128 Gly Thr Ala Glu Val Ser Ser Ala Arg Thr Ala Pro Ser Tyr Pro Asp acc cat gcg gta act gca tac ctg cag aag ctg tca gag cat tcg atg 1176 Thr His Ala Val Thr Ala Tyr Leu Gln Lys Leu Ser Glu His Ser Met SUBSTITUTE SHEET (RULE 26) agg gag aac cat gag cct gga gaa aag cca ccc cag gtc cag cct ctt 1224 Arg Glu Asn His Glu Pro Gly Glu Lys Pro Pro Gln Val Gln Pro Leu gta gaa gaa gca gtt cct gat ctt cac agt cca cca acc aca agc acc 1272 Val Glu Glu Ala Val Pro Asp Leu His Ser Pro Pro Thr Thr Ser Thr tca gcc ctc aac agc ttg gtg gtc tcc tgt gca tct get gtt ggt gtc 1320 Ser Ala Leu Asn Ser Leu Val Val Ser Cys Ala Ser Ala Val Gly Val aga gtg get gcc acc tat gaa get ggg gcc ttg tct ctc aag aaa gtt 1368 Arg Val Ala Ala Thr Tyr Glu Ala Gly Ala Leu Ser Leu Lys Lys Val atg aac ttt tac agc act gcc ccc tgc gag acg gca get cag tcg ggc 1416 Met Asn Phe Tyr Ser Thr Ala Pro Cys Glu Thr Ala Ala Gln Ser Gly agt gcc tcc aca ggc cca gaa agt ctg aaa gat ctc cga gaa gag cag 1464 Ser Ala Ser Thr Gly Pro Glu Ser Leu Lys Asp Leu Arg Glu Glu Gln gtg aaa cag gag tca ctg caa ggg aag aaa agc tca agt ctc atg gac 1512 Val Lys Gln Glu Ser Leu Gln Gly Lys Lys Ser Ser Ser Leu Met Asp atc aga gag gaa gag tcg gag gga ggg agt cga aga ctc tcc ctc cca 1560 Ile Arg Glu Glu Glu Ser Glu Gly Gly Ser Arg Arg Leu Ser Leu Pro ggg ttg ttg tcg caa gtt tcc ccc agg ctc tta agg aag get gcg agg 1608 Gly Leu Leu Ser Gln Val Ser Pro Arg Leu Leu Arg Lys Ala Ala Arg gtg aaa act cgg aca gtg gtt ctg act ccc aca tac agt gga gaa gca 1656 Val Lys Thr Arg Thr Val Val Leu Thr Pro Thr Tyr Ser Gly Glu Ala gat gcc ctt ctg cct tcc ctg agg aca gag gtg tgg acc tgg ggg aaa 1704 Asp Ala Leu Leu Pro Ser Leu Arg Thr Glu Val Trp Thr Trp Gly Lys ggc aag gaa ggg cag cta ggg cac ggc gac gtc ctg ccc agg ctt cag 1752 Gly Lys Glu Gly Gln Leu Gly His Gly Asp Val Leu Pro Arg Leu Gln ccg ttg tgt gtc aag tgt ctg gat ggt aaa gag gta atc cac ctg gag 1800 Pro Leu Cys Val Lys Cys Leu Asp Gly Lys Glu Val Ile His Leu Glu gcg ggc ggc tcc cac tcc ctc gca ctc act gcg aaa tct cag gtt tac 1848 Ala Gly Gly Ser His Ser Leu Ala Leu Thr Ala Lys Ser Gln Val Tyr tca tgg ggc agt aat acc ttt ggt cag ctt ggg cat tct gag ttt cca 1896 Ser Trp Gly Ser Asn Thr Phe Gly Gln Leu Gly His Ser Glu Phe Pro SUBSTITUTE SHEET (RULE 26) aca acg gtt cct cga ctc tca aag gtt agc agt gaa aat gga gtc tgg 1944 Thr Thr Val Pro Arg Leu Ser Lys Val Ser Ser Glu Asn Gly Val Trp agt gta get gca ggc caa gat tat tcc ttg ttt tta gtg gac acg gaa 1992 Ser Val Ala Ala Gly Gln Asp Tyr Ser Leu Phe Leu Val Asp Thr Glu gac ttc cag cct ggg ttg tat tac agt ggc cga cag gac cgt gca gaa 2040 Asp Phe Gln Pro Gly Leu Tyr Tyr Ser Gly Arg Gln Asp Arg Ala Glu ggt gat acc ctg cca gag aat ccc agt ggt aca aag act cca gta ctt 2088 Gly Asp Thr Leu Pro Glu Asn Pro Ser Gly Thr Lys Thr Pro Val Leu ctc tcc tgt agt aag ctt gga tac ata agc aga gta aca gca gga aaa 2136 Leu Ser Cys Ser Lys Leu Gly Tyr Ile Ser Arg Val Thr Ala Gly Lys gat agc tat cta gcc ttg gtg gat aag aac atc atg gga tac atc gcc 2184 Asp Ser Tyr Leu Ala Leu Val Asp Lys Asn Ile Met Gly Tyr Ile Ala agt ctc cat gag ttg get tct aca gaa aga cgg ttt tac tca aaa ctg 2232 Ser Leu His Glu Leu Ala Ser Thr Glu Arg Arg Phe Tyr Ser Lys Leu agc gaa atc aaa tca cag ata ctt agg cct ctt ctc agt tta gaa aat 2280 Ser Glu Ile Lys Ser Gln Ile Leu Arg Pro Leu Leu Ser Leu Glu Asn ttg ggc aca gtg acc act gtc cag ctg ttg cag gaa gtt gcc agc cgg 2328 Leu Gly Thr Val Thr Thr Val Gln Leu Leu Gln Glu Val Ala Ser Arg ttc agc aag ttg tgt tac ctc att ggg cag cat gga gcc tca cta agc 2376 Phe Ser Lys Leu Cys Tyr Leu Ile Gly Gln His Gly Ala Ser Leu Ser agc tac cta cag ggt atg aag gaa gcc agc agc ctg gtc atc atg aag 2424 Ser Tyr Leu Gln Gly Met Lys Glu Ala Ser Ser Leu Val Ile Met Lys cac tca agt ctt ttc ctg gac agc tac aca gag tac tgc aca tca gtt 2472 His Ser Ser Leu Phe Leu Asp Ser Tyr Thr Glu Tyr Cys Thr Ser Val tca aat ttc ctg gtt atg gga gga ttc cag ctt ctt get aag cct gcc 2520 Ser Asn Phe Leu Val Met Gly Gly Phe Gln Leu Leu Ala Lys Pro Ala att gat ttc cta aat aaa aac caa gaa ctc ttg caa gat ttg tca gaa 2568 Ile Asp Phe Leu Asn Lys Asn Gln Glu Leu Leu Gln Asp Leu Ser Glu gtg aat gat gag aac act cag ttg atg gaa atc ctg aac atg ctg ttt 2616 Val Asn Asp Glu Asn Thr Gln Leu Met Glu Ile Leu Asn Met Leu Phe ttc ttg cca atc aga cga ctt cat aat tat gca aaa gtt ttg cta aag 2664 Phe Leu Pro Ile Arg Arg Leu His Asn Tyr Ala Lys Val Leu Leu Lys SUBSTITUTE SHEET (RULE 26) ctt gcc act tgc ttt gaa gtg aca tct cca gag tat caa aag ctg cag 2712 Leu Ala Thr Cys Phe Glu Val Thr Ser Pro Glu Tyr Gln Lys Leu Gln gat tcc agt tct tgc tat gag tct ctt get ctc cat ctt ggc aag aag 2760 Asp Ser Ser Ser Cys Tyr Glu Ser Leu Ala Leu His Leu Gly Lys Lys agg aag gaa gca gag tac aca ctg agc ttc tgg aag acc ttt cct ggg 2808 Arg Lys Glu Ala Glu Tyr Thr Leu Ser Phe Trp Lys Thr Phe Pro Gly aaa atg acg gat tcc ttg agg aag cca gag cgc cgg ctg ctg tgt gag 2856 Lys Met Thr Asp Ser Leu Arg Lys Pro Glu Arg Arg Leu Leu Cys Glu agc agt aac cga gcc ctc tcc ctg cag cat gcc ggc agg ttt tct gtg 2904 Ser Ser Asn Arg Ala Leu Ser Leu Gln His Ala Gly Arg Phe Ser Val aat tgg ttc att ctc ttc aat gat gcc ctg gtc cat get cag ttc tct 2952 Asn Trp Phe Ile Leu Phe Asn Asp Ala Leu Val His Ala Gln Phe Ser aca cac cac gtg ttc cct ttg gcc aca ctc tgg gca gag cca cta tct 3000 Thr His His Val Phe Pro Leu Ala Thr Leu Trp Ala Glu Pro Leu Ser gaa gaa get ggt agc gtg aat ggc tta aag ata act aca cct gaa gaa 3048 Glu Glu Ala Gly Ser Val Asn Gly Leu Lys Ile Thr Thr Pro Glu Glu caa ttc aca ctc att tct tca aca ccc cag gaa aag acc aag tgg ctt 3096 Gln Phe Thr Leu Ile Ser Ser Thr Pro Gln Glu Lys Thr Lys Trp Leu cgg get att agc caa get gtg gat cag get ttg agg ggg acg tcc gat 3144 Arg Ala Ile Ser Gln Ala Val Asp Gln Ala Leu Arg Gly Thr Ser Asp ttc cca ctt tac gga ggc ggc agc agt gtt cag aga cag gaa cca ccc 3192 Phe Pro Leu Tyr Gly Gly Gly Ser Ser Val Gln Arg Gln Glu Pro Pro atc tca aga agt gcc aaa tac act ttc tac aag gat act cgc cta aag 3240 Ile Ser Arg Ser Ala Lys Tyr Thr Phe Tyr Lys Asp Thr Arg Leu Lys gat gcc act tac gat ggg cgc tgg ctt tca ggg aag cct cat ggc agg 3288 Asp Ala Thr Tyr Asp Gly Arg Trp Leu Ser Gly Lys Pro His Gly Arg ggt gtt ctg aag tgg cct gat gga aag atg tac tct ggc atg ttc agg 3336 Gly Val Leu Lys Trp Pro Asp Gly Lys Met Tyr Ser Gly Met Phe Arg aat ggc ttg gaa gat gga tat ggt gaa tac aga atc cct aac aag gcc 3384 Asn Gly Leu Glu Asp Gly Tyr Gly Glu Tyr Arg Ile Pro Asn Lys Ala SUBSTITUTE SHEET (RULE 26) ctg aac aaa gaa gac cat tat gta ggc cat tgg aaa gag ggg aaa atg 3432 Leu Asn Lys Glu Asp His Tyr Val Gly His Trp Lys Glu Gly Lys Met tgt ggg caa gga gtc tac agc tat gcc tct ggt gaa gtg ttt gaa ggc 3480 Cys Gly Gln Gly Val Tyr Ser Tyr Ala Ser Gly Glu Val Phe Glu Gly tgc ttt caa gat aac atg cgc cat ggg cat ggt ctg ctc cgg agt gga 3528 Cys Phe Gln Asp Asn Met Arg His Gly His Gly Leu Leu Arg Ser Gly aaa ctg act tct tct tct cct agc atg ttc att ggc cag tgg gta atg 3576 Lys Leu Thr Ser Ser Ser Pro Ser Met Phe Ile Gly Gln Trp Val Met gat aag aaa gca gga tat ggc gtc ttt gat gat atc acc agg gga gaa 3624 Asp Lys Lys Ala Gly Tyr Gly Val Phe Asp Asp Ile Thr Arg Gly Glu aag tac atg gga atg tgg cag gat gat gtg tgc caa ggg aat ggg gta 3672 Lys Tyr Met Gly Met Trp Gln Asp Asp Val Cys Gln Gly Asn Gly Val gta gtc acc cag ttt ggg tta tac tac gaa ggc aac ttc cac ctg aat 3720 Val Val Thr Gln Phe Gly Leu Tyr Tyr Glu Gly Asn Phe His Leu Asn aag atg atg gga aat ggg gtt ttg ctt tct gaa gat gat acc atc tat 3768 Lys Met Met Gly Asn Gly Val Leu Leu Ser Glu Asp Asp Thr Ile Tyr gaa gga gaa ttt tcc gat gac tgg aca ctt agt gga aag gga acg ctg 3816 Glu Gly Glu Phe Ser Asp Asp Trp Thr Leu Ser Gly Lys Gly Thr Leu act atg cca cat gga gat tat att gaa ggt tat ttt agt gga gaa tgg 3864 Thr Met Pro His Gly Asp Tyr Ile Glu Gly Tyr Phe Ser Gly Glu Trp gga tct ggg ata aaa atc act ggg acc tac ttc aaa cct agc ctg tat 3912 Gly Ser Gly Ile Lys Ile Thr Gly Thr Tyr Phe Lys Pro Ser Leu Tyr gag agc gat aag gac aag ccc aaa gcc ttc agg aag ctg ggg aac ctg 3960 Glu Ser Asp Lys Asp Lys Pro Lys Ala Phe Arg Lys Leu Gly Asn Leu gcc gtg gca gca gac gag aaa tgg aga gca gtg ttt gaa gaa tgc tgg 4008 Ala Val Ala Ala Asp Glu Lys Trp Arg Ala Val Phe Glu Glu Cys Trp cac cag ctg ggc tgt gag agc cca ggc caa ggg gag gtt tgg aaa gca 4056 His Gln Leu Gly Cys Glu Ser Pro Gly Gln Gly Glu Val Trp Lys Ala tgg gat aat att get gtg gcc ttg acc acg aac cgt cgc cag cat aaa 4104 Trp Asp Asn Ile Ala Val Ala Leu Thr Thr Asn Arg Arg Gln His Lys gac agt cca gaa ata cta agc cgc tct cag act cag acc ctg gag agt 4152 Asp Ser Pro Glu Ile Leu Ser Arg Ser Gln Thr Gln Thr Leu Glu Ser SUBSTITUTE SHEET (RULE 26) ct ttg gag tac att ccc cag cac att ggc gcc ttc tct gtg gag aaa tat 4200 Leu Glu Tyr Ile Pro Gln His Ile Gly Ala Phe Ser Val Glu Lys Tyr gat gac atc aag aag tat tta ata aag gcc tgt gat act cct ctg cac 4248 Asp Asp Ile Lys Lys Tyr Leu Ile Lys Ala Cys Asp Thr Pro Leu His cca ctg ggc agg ctt gtg gag acc ctg gtt gcg gtg tat aga atg aca 4296 Pro Leu Gly Arg Leu Val Glu Thr Leu Val Ala Val Tyr Arg Met Thr tat gtg ggt gta ggg gcc aac cgc cgg tta ctg cag gaa get gtg aag 4344 Tyr Val Gly Val Gly Ala Asn Arg Arg Leu Leu Gln Glu Ala Val Lys gag att aaa tct tat ctc aag agg att ttc cag ctt gtg agg ttc ttg 4392 Glu Ile Lys Ser Tyr Leu Lys Arg Ile Phe Gln Leu Val Arg Phe Leu ttt cct gag ctt cct gag gag ggc agc aca att cct ctt tct get cct 4440 Phe Pro Glu Leu Pro Glu Glu Gly Ser Thr Ile Pro Leu Ser Ala Pro ctg ccc act gga agg aga tcc ttc tgt act ggg aaa ttg gat tcc aga 4488 Leu Pro Thr Gly Arg Arg Ser Phe Cys Thr Gly Lys Leu Asp Ser Arg tcc gag tca cca gaa cca ggt tat gta gta aca agt tct ggc tta ctg 4536 Ser Glu Ser Pro Glu Pro Gly Tyr Val Val Thr Ser Ser Gly Leu Leu ctt ccg gtg ctg ctg ccg cgg ctc tac cca cct ctc ttc atg ctc tat 4584 Leu Pro Val Leu Leu Pro Arg Leu Tyr Pro Pro Leu Phe Met Leu Tyr gcc ctg gat aat gac cga gag gaa gac att tac tgg gaa tgt gtg ctt 4632 Ala Leu Asp Asn Asp Arg Glu Glu Asp Ile Tyr Trp Glu Cys Val Leu cga cta aac aag cag cca gat att get ctc ttg ggc ttc ctt gga gta 4680 Arg Leu Asn Lys Gln Pro Asp Ile Ala Leu Leu Gly Phe Leu Gly Val cag aaa aaa ttc tgg cca gcc acc ttg tca atc ctt gga gag agt aaa 4728 Gln Lys Lys Phe Trp Pro Ala Thr Leu Ser Ile Leu Gly Glu Ser Lys aag gtg ttg tca acc aca aag gat get tgc ttt gca tct gca gta gaa 4776 Lys Val Leu Ser Thr Thr Lys Asp Ala Cys Phe Ala Ser Ala Val Glu tgc ctg cag cag atc agc aca aca ttt act cca tca gac aag ctt aaa 4824 Cys Leu Gln Gln Ile Ser Thr Thr Phe Thr Pro Ser Asp Lys Leu Lys gtg atc cag cag acc ttt gaa gag atc tcc cag agt gtc ctt gca tcg 4872 Val Ile Gln Gln Thr Phe Glu Glu Ile Ser Gln Ser Val Leu Ala Ser SUBSTITUTE SHEET (RULE 26) ctg cag gag gac ttc ctc tgg tcc atg gat gac ttg ttc ccc gtc ttc 4920 Leu Gln Glu Asp Phe Leu Trp Ser Met Asp Asp Leu Phe Pro Val Phe tta tac gtg gtg ctg cgg gcc agg att cgg aac ttg ggc tct gaa gtt 4968 Leu Tyr Val Val Leu Arg Ala Arg Ile Arg Asn Leu Gly Ser Glu Val cac ctc att gag gat ctg atg gac ccc ttt ctc cag cat ggg gaa caa 5016 His Leu Ile Glu Asp Leu Met Asp Pro Phe Leu Gln His Gly Glu Gln ggc atc atg ttc acc acc ttg aag gcc tgt tac ttc cag att cag cgg 5064 Gly Ile Met Phe Thr Thr Leu Lys Ala Cys Tyr Phe Gln Ile Gln Arg gag aag ctt aac tag ggcgcctgac agcttgagga ccggattatc tgctgcggag 5119 Glu Lys Leu Asn gctacagcta tggcacaggc accgactgga ggctgatggg gcaaagaaca gtgttgaata 5179 cagaatggac ttttgtgcta ttttggttgt aatttctgag ccttactaat aattagagcc 5239 cagcatggaa aacatactgt atcattcaaa tggagactgg aaaaggagat agggatagag 5299 tagagtcttt ggcctgtgct gagatccaca cacctactta gaaaaggaaa ctggttaccc 5359 tttcctgtag tgaaagctct cagctccatg cagttccagg aaacctttcc aggaaagctg 5419 cttagatgaa aagaagttga tgactgtgtt taagctcctg gtttgtctaa ttccatttgc 5479 agttacccaa taccctttgg caaggagcag gttttacttg aaactgaagc agccatccct 5539 tgccttccta gacctctcgc tcccaggcac aagtgcagca tgctactttg ctaggggtgg 5599 gggtggggga gaagaagttt taaactgtag ttttaacctt ttgtaagccc ctttaccaag 5659 gcatttgtgg tcagagagct cccacggggt gactatgaca tcctggtccc ctcgtggaat 5719 gcatccacat aggatcttct gcctgctgac tgaaaagaac ataggaatac actggagtgc 5779 aaacactgcc gtgccaagct gctccaaacc tcactgatcc gaggcccact gcctacccag 5839 gaggcccact gcctacccag gaggcccgta agcttcttag cacaagcttt gtgtggagac 5899 tgaagatctg cacatgtgag gaagcaggga gctacagtgg ccctcagccc agtctgcggg 5959 tcttccctct acctcacact gaactcagaa gggaaggaag gagagacgca catgggattc 6019 tcccacctca gaagtattgt gacagcaccg cataaccacg gtttgctctt ttacaagcag 6079 cctcacaagt gtgggttgtg ggtgtgcgct ggagcagtgc cactcgtagc tgtttggata 6139 ccacaggtgc tcttccgtct catctgctgt actcggaggc gagcgcagtg gcctgactca 6199 tgggaaatga ctcagcaggc ggcaactact tttgaaaagg atcatgattt ccgagctact 6259 ttataatcaa gacaagcatt tgttaacata ctgttttgga atgttggctg taatgtaaca 6319 gcagttttca taataaatga cattcatctc 6349 <210> 5 <211> 1651 <212> PRT
<213> Mus musculus <400> 5 Met Asp Ser Lys Lys Lys Ser Ser Thr Glu Ala Glu Gly Ser Lys Glu Arg Gly Leu Val His Val Trp Gln Ala Gly Ser Phe Ser Leu Thr Pro Glu Arg Leu Pro Gly Trp Gly Gly Lys Thr Val Leu Gln Ala Ala Leu Gly Val Arg His Gly Val Leu Leu Thr Glu Asp Gly Glu Val Tyr Ser Phe Gly Thr Leu Pro Trp Lys Ser Glu Ser Ala Glu Ile Cys Pro Ser Ser Pro Leu Leu Glu Ser Ala Leu Val Gly His His Val Ile Thr Val Ala Thr Gly Ser Phe His Ser Gly Ala Val Thr Glu Ser Gly Val Val SUBSTITUTE SHEET (RULE 26) Tyr Met Trp Gly Glu Asn Ala Ala Gly Gln Cys Ala Val Ala Asn Gln Gln Tyr Val Pro Glu Pro Ser Pro Val Ser Ile Ser Asp Ser Glu Thr Ser Pro Ser Leu Ala Val Arg Ile Leu Gln Leu Ala Cys Gly Glu Glu His Thr Leu Ala Leu Ser Leu Ser Arg Glu Ile Trp Ala Trp Gly Thr Gly Cys Gln Leu Gly Leu Ile Thr Thr Thr Phe Pro Val Thr Lys Pro Gln Lys Val Glu His Leu Ala Gly Arg Val Val Leu Gln Val Ala Cys Gly Ala Phe His Ser Leu Ala Leu Val Gln Cys Leu Pro Pro Gln Asp Leu Lys Pro Val Pro Glu Arg Cys Asn Gln Cys Ser Gln Leu Leu Ile Thr Met Thr Asp Lys Glu Asp His Val Ile Ile Ser Asp Ser His Cys Cys Pro Leu Gly Val Thr Leu Ser Glu Ser Gln Ala Glu Lys His Ala Ser Pro Ala Pro Ser Pro His Pro Glu Ala Leu Asp Glu Gln Gly Glu Val Phe Glu Asn Thr Val Val Glu Ala Glu Leu Asn Met Gly Ser Ser Gln Thr Thr Ser Gly Ser Ala Ile Ser Thr Gln Gln Asn Ile Val Gly Thr Ala Glu Val Ser Ser Ala Arg Thr Ala Pro Ser Tyr Pro Asp Thr His Ala Val Thr Ala Tyr Leu Gln Lys Leu Ser Glu His Ser Met Arg Glu Asn His Glu Pro Gly Glu Lys Pro Pro Gln Val Gln Pro Leu Val Glu Glu Ala Val Pro Asp Leu His Ser Pro Pro Thr Thr Ser Thr Ser Ala Leu Asn Ser Leu Val Val Ser Cys Ala Ser Ala Val Gly Val Arg Val Ala Ala Thr Tyr Glu Ala Gly Ala Leu Ser Leu Lys Lys Val Met Asn Phe Tyr Ser Thr Ala Pro Cys Glu Thr Ala Ala Gln Ser Gly Ser Ala Ser Thr Gly Pro Glu Ser Leu Lys Asp Leu Arg Glu Glu Gln Val Lys Gln Glu Ser Leu Gln Gly Lys Lys Ser Ser Ser Leu Met Asp Ile Arg Glu Glu Glu Ser Glu Gly Gly Ser Arg Arg Leu Ser Leu Pro Gly Leu Leu Ser Gln Val Ser Pro Arg Leu Leu Arg Lys Ala Ala Arg Val Lys Thr Arg Thr Val Val Leu Thr Pro Thr Tyr Ser Gly Glu Ala Asp Ala Leu Leu Pro Ser Leu Arg Thr Glu Val Trp Thr Trp Gly Lys Gly Lys Glu Gly Gln Leu Gly His Gly Asp Val Leu Pro Arg Leu Gln Pro Leu Cys Val Lys Cys Leu Asp Gly Lys Glu Val Ile His Leu Glu Ala Gly Gly Ser His Ser Leu Ala Leu Thr Ala Lys Ser Gln Val Tyr Ser Trp Gly Ser Asn Thr Phe Gly Gln Leu Gly His Ser Glu Phe Pro Thr Thr Val Pro Arg Leu Ser Lys Val Ser Ser Glu Asn Gly Val Trp Ser SUBSTITUTE SHEET (RULE 26) Val Ala Ala Gly Gln Asp Tyr Ser Leu Phe Leu Val Asp Thr Glu Asp Phe Gln Pro Gly Leu Tyr Tyr Ser Gly Arg Gln Asp Arg Ala Glu Gly Asp Thr Leu Pro Glu Asn Pro Ser Gly Thr Lys Thr Pro Val Leu Leu Ser Cys Ser Lys Leu Gly Tyr Ile Ser Arg Val Thr Ala Gly Lys Asp Ser Tyr Leu Ala Leu Val Asp Lys Asn Ile Met Gly Tyr Ile Ala Ser Leu His Glu Leu Ala Ser Thr Glu Arg Arg Phe Tyr Ser Lys Leu Ser Glu Ile Lys Ser Gln Ile Leu Arg Pro Leu Leu Ser Leu Glu Asn Leu Gly Thr Val Thr Thr Val Gln Leu Leu Gln Glu Val Ala Ser Arg Phe Ser Lys Leu Cys Tyr Leu Ile Gly Gln His Gly Ala Ser Leu Ser Ser Tyr Leu Gln Gly Met Lys Glu Ala Ser Ser Leu Val Ile Met Lys His Ser Ser Leu Phe Leu Asp Ser Tyr Thr Glu Tyr Cys Thr Ser Val Ser Asn Phe Leu Val Met Gly Gly Phe Gln Leu Leu Ala Lys Pro Ala Ile Asp Phe Leu Asn Lys Asn Gln Glu Leu Leu Gln Asp Leu Ser Glu Val Asn Asp Glu Asn Thr Gln Leu Met Glu Ile Leu Asn Met Leu Phe Phe Leu Pro Ile Arg Arg Leu His Asn Tyr Ala Lys Val Leu Leu Lys Leu Ala Thr Cys Phe Glu Val Thr Ser Pro Glu Tyr Gln Lys Leu Gln Asp Ser Ser Ser Cys Tyr Glu Ser Leu Ala Leu His Leu Gly Lys Lys Arg Lys Glu Ala Glu Tyr Thr Leu Ser Phe Trp Lys Thr Phe Pro Gly Lys Met Thr Asp Ser Leu Arg Lys Pro Glu Arg Arg Leu Leu Cys Glu Ser Ser Asn Arg Ala Leu Ser Leu Gln His Ala Gly Arg Phe Ser Val Asn Trp Phe Ile Leu Phe Asn Asp Ala Leu Val His Ala Gln Phe Ser Thr His His Val Phe Pro Leu Ala Thr Leu Trp Ala Glu Pro Leu Ser Glu Glu Ala Gly Ser Val Asn Gly Leu Lys Ile Thr Thr Pro Glu Glu Gln Phe Thr Leu Ile Ser Ser Thr Pro Gln Glu Lys Thr Lys Trp Leu Arg Ala Ile Ser Gln Ala Val Asp Gln Ala Leu Arg Gly Thr Ser Asp Phe Pro Leu Tyr Gly Gly Gly Ser Ser Val Gln Arg Gln Glu Pro Pro Ile Ser Arg Ser Ala Lys Tyr Thr Phe Tyr Lys Asp Thr Arg Leu Lys Asp Ala Thr Tyr Asp Gly Arg Trp Leu Ser Gly Lys Pro His Gly Arg Gly Val Leu Lys Trp Pro Asp Gly Lys Met Tyr Ser Gly Met Phe Arg Asn Gly Leu Glu Asp Gly Tyr Gly Glu Tyr Arg Ile Pro Asn Lys Ala Leu Asn Lys Glu Asp His Tyr Val Gly His Trp Lys Glu Gly Lys Met Cys SUBSTITUTE SHEET (RULE 26) Gly Gln Gly Val Tyr Ser Tyr Ala Ser Gly Glu Val Phe Glu Gly Cys Phe Gln Asp Asn Met Arg His Gly His Gly Leu Leu Arg Ser Gly Lys Leu Thr Ser Ser Ser Pro Ser Met Phe Ile Gly Gln Trp Val Met Asp Lys Lys Ala Gly Tyr Gly Val Phe Asp Asp Ile Thr Arg Gly Glu Lys Tyr Met Gly Met Trp Gln Asp Asp Val Cys Gln Gly Asn Gly Val Val Val Thr Gln Phe Gly Leu Tyr Tyr Glu Gly Asn Phe His Leu Asn Lys Met Met Gly Asn Gly Val Leu Leu Ser Glu Asp Asp Thr Ile Tyr Glu Gly Glu Phe Ser Asp Asp Trp Thr Leu Ser Gly Lys Gly Thr Leu Thr Met Pro His Gly Asp Tyr Ile Glu Gly Tyr Phe Ser Gly Glu Trp Gly Ser Gly Ile Lys Ile Thr Gly Thr Tyr Phe Lys Pro Ser Leu Tyr Glu Ser Asp Lys Asp Lys Pro Lys Ala Phe Arg Lys Leu Gly Asn Leu Ala Val Ala Ala Asp Glu Lys Trp Arg Ala Val Phe Glu Glu Cys Trp His Gln Leu Gly Cys Glu Ser Pro Gly Gln Gly Glu Val Trp Lys Ala Trp Asp Asn Ile Ala Val Ala Leu Thr Thr Asn Arg Arg Gln His Lys Asp Ser Pro Glu Ile Leu Ser Arg Ser Gln Thr Gln Thr Leu Glu Ser Leu Glu Tyr Ile Pro Gln His Ile Gly Ala Phe Ser Val Glu Lys Tyr Asp Asp Ile Lys Lys Tyr Leu Ile Lys Ala Cys Asp Thr Pro Leu His Pro Leu Gly Arg Leu Val Glu Thr Leu Val Ala Val Tyr Arg Met Thr Tyr Val Gly Val Gly Ala Asn Arg Arg Leu Leu Gln Glu Ala Val Lys Glu Ile Lys Ser Tyr Leu Lys Arg Ile Phe Gln Leu Val Arg Phe Leu Phe Pro Glu Leu Pro Glu Glu Gly Ser Thr Ile Pro Leu Ser Ala Pro Leu Pro Thr Gly Arg Arg Ser Phe Cys Thr Gly Lys Leu Asp Ser Arg Ser Glu Ser Pro Glu Pro Gly Tyr Val Val Thr Ser Ser Gly Leu Leu Leu Pro Val Leu Leu Pro Arg Leu Tyr Pro Pro Leu Phe Met Leu Tyr Ala Leu Asp Asn Asp Arg Glu Glu Asp Ile Tyr Trp Glu Cys Val Leu Arg Leu Asn Lys Gln Pro Asp Ile Ala Leu Leu Gly Phe Leu Gly Val Gln Lys Lys Phe Trp Pro Ala Thr Leu Ser Ile Leu Gly Glu Ser Lys Lys Val Leu Ser Thr Thr Lys Asp Ala Cys Phe Ala Ser Ala Val Glu Cys Leu Gln Gln Ile Ser Thr Thr Phe Thr Pro Ser Asp Lys Leu Lys Val Ile Gln Gln Thr Phe Glu Glu Ile Ser Gln Ser Val Leu Ala Ser Leu Gln Glu Asp Phe Leu Trp Ser Met Asp Asp Leu Phe Pro Val Phe Leu SUBSTETUTE SHEET (RULE 26) TyrValVal Leu Ala Arg Ile Arg Leu Gly Ser ValHis Arg Asn Glu LeuIleGlu Asp Met Asp Pro Phe Gln His Gly GlnGly Leu Leu Glu IleMetPhe Thr Leu Lys Ala Cys Phe Gln Ile ArgGlu Thr Tyr Gln LysLeuAsn <210> 6 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Artificial Sequence: Synthesized Oligonucleotide <400> 6 cctagtcatc catgtgctgg 20 <210> 7 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Artificial Sequence: Synthesized Oligonucleotide <400> 7 tcccatacct gaccttccac 20 <210> 8 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Artificial Sequence: Synthesized Oligonucleotide <400> 8 cttgatagac tttctgtaaa gaag 24 <210> 9 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Artificial Sequence: Synthesized Oligonucleotide <400> 9 ggctacttgg acaaatctcc actg 24 <210> 10 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide Sk)BSTfTUTE SHEET (RULE 26) <400> 10 ggagagactg tgctcccaag 20 <210> 11 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 11 agccctcctt agccaatagc 20 <210> 12 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 12 taagcttagt gggcaggctc 20 <210> 13 <211> 22 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 13 ttcccactta acaaccatca ac 22 <210> 14 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 14 ccaatttggt taaatctata gggg 24 <210> 15 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 15 gacaatgcca gagtgtgctc 20 SUBSTITUTE SHEET (RULE 26) <210> 16 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 16 ccagcccttt gttagcagtc 20 <210> 17 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 17 cttcttcctg cctgtcaagg 20 <210> 18 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 18 ttgtacaatg cctcccttcc 20 <210> 19 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 19 agcccaacat gacacctttc 20 <210> 20 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 20 gattgcttgt tgcataaggg 20 <210> 21 SUBSTITUTE SHEET (RULE 26) <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 21 atacagcatg cgatgtcagg 20 <210> 22 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 22 ctggactccc actccttcac 20 <210> 23 <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 23 gctagaagag cccagatttc c 21 <210> 24 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 24 tgactttgtg tgcctgtgtg 20 <210> 25 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 25 ataccctgga aaatctgggg 20 <210> 26 <211> 20 <212> DNA
SUBSTITUTE SHEET (RULE 26) <213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 26 tttgcgcatt atctctggtc 20 <210> 27 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 27 gtacgtatga aattcccccg 20 <210> 28 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 28 ttccgtctta ctcctgcacc 20 <210> 29 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 29 gccttaggat ccaattcctg 20 <210> 30 <211> 23 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 30 caatgatgta ctgatgaacc agc 23 <210> 31 <211> 20 <212> DNA
<213> Artificial Sequence SUBSTITUTE SHEET (RULE 26) <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 31 cctgatggtt taatggtggg 20 <210> 32 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 32 gcacatggca acaggttaag 20 <210> 33 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 33 tccttggcag aataaccctg 20 <210> 34 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 34 cccctaccac tccctttacc 20 <210> 35 <211> 23 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 35 ccagtggcta atagtacctg tcc 23 <210> 36 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized SUBSTITUTE SHEET (RULE 26) oligonucleotide <400> 36 tggatgcatg attcatttcc 20 <210> 37 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 37 tccttggctt tccaaatgtc 20 <210> 38 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 38 ctatcctggg gtctctgctg 20 <210> 39 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 39 tgctatcgaa atggttgctg 20 <210> 40 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 40 agctacgacc agcaaattcc 20 <210> 41 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide SUBSTITUTE SHEET (RULE 26) <400> 41 ~
ataggggtcc acctttcagg 20 <210> 42 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 42 aaggggatat gggcagagtc 20 <210> 43 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 43 aaatgcttgc ttggttttgg 20 <210> 44 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 44 aaagggcatc ttcattgcac 20 <210> 45 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 45 cacaagaggc agaaagagcc 20 <210> 46 <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 46 aatgcttgat gaattgttgc c 21 SUBSTITUTE SHEET (RULE 26) <210> 47 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 47 atgatcatcc tcaccccagg 20 <210> 48 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 48 ttgaagattt atgcctgggg 20 <210> 49 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 49 tgaggtcaca cggctatcag 20 <210> 50 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 50 gtgtagtggg gctgatgtcc 20 <210> 51 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 51 tggctatgca aacattcagg 20 <210> 52 SUBSTITUTE SHEET (RULE 26) <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 52 aatgcaaaat accacacatg g 21 <210> 53 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 53 tcattggctt aaactgtggg 20 <210> 54 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 54 caacctaggg ttgatgcctg 20 <210> 55 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 55 catcttcgga aagcaaaacc . 20 <210> 56 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 56 ctttggggat atgactgcgt 20 <210> 57 <211> 26 <212> DNA
Repod. Dev.
54, 103-111 (1999).
SUBSTITUTE SHEET (RULE 26) SEQUENCE LISTING
<110> The University of British Columbia; and Japan Science and Technology Corporation <120> ALS2 Gene and Amyotrophic Lateral Sclerosis Type 2 <130> 80021-346 <140>
<141>
<150> US 60/267,723 <151> 2001-02-12 <150> JP 2001-116973 <151> 2001-04-16 <150> US 60/318,352 <151> 2001-09-12 <160> 84 <170> PatentIn Ver. 2.0 <210> 1 <211> 6394 <212> DNA
<213> Homo Sapiens <220>
<221> CDS
<222> (124)..(5097) <400> 1 ggacccactg ggttgccaag ctcgcgccgg atgcggagcg cggtgctgcc ggtggagctt 60 caggtcttga tagactttct gtaaagaagg aatgatttgg tgatggagtg ttcccactga 120 ccg atg gac tca aag aag aga agc tca aca gag gca gaa gga tcc aag 168 Met Asp Ser Lys Lys Arg Ser Ser Thr Glu Ala Glu Gly Ser Lys gaa aga ggc ctg gtc cat atc tgg cag gca gga tcc ttt ccc ata aca 216 Glu Arg Gly Leu Val His Ile Trp Gln Ala Gly Ser Phe Pro Ile Thr cca gag aga ttg cca ggc tgg gga gga aag act gtt ttg cag gca gcc 264 Pro Glu Arg Leu Pro Gly Trp Gly Gly Lys Thr Val Leu Gln Ala Ala ctc gga gtg aaa cat gga gtt ctt ctg act gaa gat ggt gag gtc tac 312 Leu Gly Val Lys His Gly Val Leu Leu Thr Glu Asp Gly Glu Val Tyr agc ttt ggg act ctt ccc tgg aga agt gga cca gtg gag att tgt cca 360 Ser Phe Gly Thr Leu Pro Trp Arg Ser Gly Pro Val Glu Ile Cys Pro agt agc ccc att cta gaa aat gcc ctg gtt ggg caa tat gtt att act 408 Ser Ser Pro Ile Leu Glu Asn Ala Leu Val Gly Gln Tyr Val Ile Thr SUBSTITUTE SHEET (RULE 26) gtg gca aca gga agc ttc cat agt gga gca gtg aca gac aat ggt gtc 456 Val Ala Thr Gly Ser Phe His Ser Gly Ala Val Thr Asp Asn Gly Val gcg tac atg tgg gga gag aat tct get ggc cag tgt gca gta gcc aac 504 Ala Tyr Met Trp Gly Glu Asn Ser Ala Gly Gln Cys Ala Val Ala Asn cag cag tat gtg ccg gaa cca aat cct gtc agc att get gat tct gag 552 Gln Gln Tyr Val Pro Glu Pro Asn Pro Val Ser Ile Ala Asp Ser Glu gcc agc cct ttg tta gca gtc agg att tta cag ttg gcg tgt ggc gag 600 Ala Ser Pro Leu Leu Ala Val Arg Ile Leu Gln Leu Ala Cys Gly Glu gag cac act ctg gca ttg tca ata agc aga gag att tgg gca tgg ggt 648 Glu His Thr Leu Ala Leu Ser Ile Ser Arg Glu Ile Trp Ala Trp Gly acc ggt tgt cag ttg ggt ctc att acc act gcc ttc cca gtg aca aag 696 Thr Gly Cys Gln Leu Gly Leu Ile Thr Thr Ala Phe Pro Val Thr Lys ccg caa aag gta gaa cat ctt get ggg cga gtg gtg ctt caa gtt gcc 744 Pro Gln Lys Val Glu His Leu Ala Gly Arg Val Val Leu Gln Val Ala tgt ggt get ttc cac agc tta gcc ctt gta caa tgc ctc cct tcc cag 792 Cys Gly Ala Phe His Ser Leu Ala Leu Val Gln Cys Leu Pro Ser Gln gat ctg aag cca gtc cca gaa cga tgc aac cag tgc agc cag ctc ttg 840 Asp Leu Lys Pro Val Pro Glu Arg Cys Asn Gln Cys Ser Gln Leu Leu att act atg act gac aaa gaa gac cat gtg att ata tca gac.agt cat 888 Ile Thr Met Thr Asp Lys Glu Asp His Val Ile Ile Ser Asp Ser His tgt tgc cca tta ggt gtg aca ctg aca gaa tct cag gca gaa aac cat 936 Cys Cys Pro Leu Gly Val Thr Leu Thr Glu Ser Gln Ala Glu Asn His gcc agc act get ctc agc ccc tcc act gaa acc ctt gac agg cag gaa 984 Ala Ser Thr Ala Leu Ser Pro Ser Thr Glu Thr Leu Asp Arg Gln Glu gaa gta ttt gag aac act ctt gta gca aat gat cag tct gtt get act 1032 Glu Val Phe Glu Asn Thr Leu Val Ala Asn Asp Gln Ser Val Ala Thr gaa ctg aat gca gta agt get cag atc aca agc agc gat gcc atg tcc 1080 Glu Leu Asn Ala Val Ser Ala Gln Ile Thr Ser Ser Asp Ala Met Ser tct caa caa aat gtc atg gga aca act gaa att tcc tct gcc aga aac 1128 Ser Gln Gln Asn Val Met Gly Thr Thr Glu Ile Ser Ser Ala Arg Asn ata cca tca tac cct gac acc caa gca gtc aat gaa tac cta cgg aaa 1176 SUBSTITUTE SHEET (RULE 26) Ile Pro Ser Tyr Pro Asp Thr Gln Ala Val Asn Glu Tyr Leu Arg Lys ctg tca gat cat tca gta aga gag gac tca gag cat ggt gaa aag cca 1224 Leu Ser Asp His Ser Val Arg Glu Asp Ser Glu His Gly Glu Lys Pro atg cca tct cag cct ctt tta gaa gaa gca att cct aat ctc cac agc 1272 Met Pro Ser Gln Pro Leu Leu Glu Glu Ala Ile Pro Asn Leu His Ser ccg cct acc aca agc acc tca gcc cta aac agc ctg gtg gtc tct tgt 1320 Pro Pro Thr Thr Ser Thr Ser Ala Leu Asn Ser Leu Val Val Ser Cys gca tct get gtt ggt gtg aga gtg get get act tat gaa get ggt gcc 1368 Ala Ser Ala Val Gly Val Arg Val Ala Ala Thr Tyr Glu Ala Gly Ala ttg tca ctg aag aaa gtt atg aac ttt tat agt aca acc cct tgt gaa 1416 Leu Ser Leu Lys Lys Val Met Asn Phe Tyr Ser Thr Thr Pro Cys Glu act gga get cag gca ggc agt agt gcc att ggc ccc gaa ggt ttg aaa 1464 Thr Gly Ala Gln Ala Gly Ser Ser Ala Ile Gly Pro Glu Gly Leu Lys gat agc agg gaa gaa cag gtt aaa cag gaa tca atg caa gga aag aaa 1512 Asp Ser Arg Glu Glu Gln Val Lys Gln Glu Ser Met Gln Gly Lys Lys agt tca agt ctt gtg gat atc aga gaa gaa gaa aca gag gga ggc agt 1560 Ser Ser Ser Leu Val Asp Ile Arg Glu Glu Glu Thr Glu Gly Gly Ser cga aga ctc tcc ctc cct gga ttg ttg tca caa gtt tcc ccc agg ctc 1608 Arg Arg Leu Ser Leu Pro Gly Leu Leu Ser Gln Val Ser Pro Arg Leu tta aga aag get gca cgg gtg aaa acg agg aca gtg gtt ctg acc ccc 1656 Leu Arg Lys Ala Ala Arg Val Lys Thr Arg Thr Val Val Leu Thr Pro aca tac agt gga gaa gca gat gcg ctc ctg cct tct ctg aga aca gaa 1704 Thr Tyr Ser Gly Glu Ala Asp Ala Leu Leu Pro Ser Leu Arg Thr Glu gtg tgg acc tgg ggg aaa ggg aag gaa ggg cag ctg ggg cac ggc gat 1752 Val Trp Thr Trp Gly Lys Gly Lys Glu Gly Gln Leu Gly His Gly Asp gtt ctg cct agg ctt caa ccg ttg tgt gta aaa tgt ctg gat ggc aaa 1800 Val Leu Pro Arg Leu Gln Pro Leu Cys Val Lys Cys Leu Asp Gly Lys gaa gta atc cat ctg gag gca ggt ggt tac cat tct ctt gca ctt act 1848 Glu Val Ile His Leu Glu Ala Gly Gly Tyr His Ser Leu Ala Leu Thr gcg aaa tcc cag gtt tac tca tgg ggt agc aat acc ttt ggt caa ctt 1896 Ala Lys Ser Gln Val Tyr Ser Trp Gly Ser Asn Thr Phe Gly Gln Leu SUBSTITUTE SHEET (RULE 26) ggg cat tcc gat ttt cca aca aca gtt cct cgt ctt gca aag ata agc 1944 Gly His Ser Asp Phe Pro Thr Thr Val Pro Arg Leu Ala Lys Ile Ser agt gaa aat gga gtc tgg agc ata get gca ggc agg gat tat tcc ctg 1992 Ser Glu Asn Gly Val Trp Ser Ile Ala Ala Gly Arg Asp Tyr Ser Leu ttt tta gtg gat aca gaa gac ttc cag cct ggg tta tat tac agt ggc 2040 Phe Leu Val Asp Thr Glu Asp Phe Gln Pro Gly Leu Tyr Tyr Ser Gly cga cag gac cct aca gaa ggt gac aac ctt cca gag aat cac agt ggt 2088 Arg Gln Asp Pro Thr Glu Gly Asp Asn Leu Pro Glu Asn His Ser Gly tct aag act cca gta ctt ctc tcc tgt agt aag ctt gga tat ata agc 2136 Ser Lys Thr Pro Val Leu Leu Ser Cys Ser Lys Leu Gly Tyr Ile Ser aga gtg aca gca gga aaa gat agc tat tta gcc ttg gtg gat aaa aac 2184 Arg Val Thr Ala Gly Lys Asp Ser Tyr Leu Ala Leu Val Asp Lys Asn att atg ggg tat att gcc agt ctc cac gag tta get act aca gaa aga 2232 Ile Met Gly Tyr Ile Ala Ser Leu His Glu Leu Ala Thr Thr Glu Arg cga ttc tat tca aaa cta agt gat atc aaa tct cag att ctc agg cct 2280 Arg Phe Tyr Ser Lys Leu Ser Asp Ile Lys Ser Gln Ile Leu Arg Pro ctt ctc agt tta gaa aat ttg ggc act aca act aca gtc cag ctg ttg 2328 Leu Leu Ser Leu Glu Asn Leu Gly Thr Thr Thr Thr Val Gln Leu Leu cag gag gtg get agc cga ttc agc aag ctg tgt tac ctc att ggt cag 2376 Gln Glu Val Ala Ser Arg Phe Ser Lys Leu Cys Tyr Leu Ile Gly Gln cat gga gcc tca ttg agc agc ttc ctt cat ggg gta aag gaa gcc agg 2424 His Gly Ala Ser Leu Ser Ser Phe Leu His Gly Val Lys Glu Ala Arg agt ttg gtc atc ctg aag cat tca agt ctc ttc ttg gat agt tat aca 2472 Ser Leu Val Ile Leu Lys His Ser Ser Leu Phe Leu Asp Ser Tyr Thr gag tat tgc aca tct att aca aat ttc ctg gtt atg gga gga ttc cag 2520 Glu Tyr Cys Thr Ser Ile Thr Asn Phe Leu Val Met Gly Gly Phe Gln ctt ctt get aag cct gcc att gat ttc cta aat aaa aac caa gag ctg 2568 Leu Leu Ala Lys Pro Ala Ile Asp Phe Leu Asn Lys Asn Gln Glu Leu ttg caa gat ttg tca gaa gtg aat gac gaa aac act cag ttg atg gaa 2616 Leu Gln Asp Leu Ser Glu Val Asn Asp Glu Asn Thr Gln Leu Met Glu ata ctg aat act ttg ttt ttc ttg cca atc aga cga ctt cat aat tac 2664 SUBSTITUTE SHEET (RULE 26) Ile Leu Asn Thr Leu Phe Phe Leu Pro Ile Arg Arg Leu His Asn Tyr gca aaa gtt ttg cta aag ctt get act tgt ttt gaa gtg gca tct cca 2712 Ala Lys Val Leu Leu Lys Leu Ala Thr Cys Phe Glu Val Ala Ser Pro gaa tat cag aaa ctg cag gat tcc agt tct tgt tat gag tgt ctt get 2760 Glu Tyr Gln Lys Leu Gln Asp Ser Ser Ser Cys Tyr Glu Cys Leu Ala ctc cat ctc ggc agg aaa agg aag gaa gca gaa tac aca ctg ggc ttc 2808 Leu His Leu Gly Arg Lys Arg Lys Glu Ala Glu Tyr Thr Leu Gly Phe tgg aag acc ttc ccc gga aaa atg acg gat tcc ttg agg aag cca gag 2856 Trp Lys Thr Phe Pro Gly Lys Met Thr Asp Ser Leu Arg Lys Pro Glu cgt cga ctg ctg tgt gag agt agt aac cga gcc ctg tct ctg cag cat 2904 Arg Arg Leu Leu Cys Glu Ser Ser Asn Arg Ala Leu Ser Leu Gln His get ggg agg ttt tcc gtg aat tgg ttc att ctc ttt aat gat gcc ctg 2952 Ala Gly Arg Phe Ser Val Asn Trp Phe Ile Leu Phe Asn Asp Ala Leu gtc cat gcc cag ttc tcc acg cac cat gtt ttc cct ctg gcc acg ctg 3000 Val His Ala Gln Phe Ser Thr His His Val Phe Pro Leu Ala Thr Leu tgg gca gag cca ctg tct gaa gaa get ggt ggt gtg aat ggc tta aag 3048 Trp Ala Glu Pro Leu Ser Glu Glu Ala Gly Gly Val Asn Gly Leu Lys ata act aca cct gag gag cag ttc act ctc att tca tct aca ccc cag 3096 Ile Thr Thr Pro Glu Glu Gln Phe Thr Leu Ile Ser Ser Thr Pro Gln gaa aag aca aag tgg cta cga get ata agc caa gcc gta gat cag get 3144 Glu Lys Thr Lys Trp Leu Arg Ala Ile Ser Gln Ala Val Asp Gln Ala ttg aga ggg atg tct gat ctc ccc cct tat gga agt ggt agc agt gtt 3192 Leu Arg Gly Met Ser Asp Leu Pro Pro Tyr Gly Ser Gly Ser Ser Val cag aga cag gaa cca ccc att tca cgc agt gcc aaa tat act ttc tac 3240 Gln Arg Gln Glu Pro Pro Ile Ser Arg Ser Ala Lys Tyr Thr Phe Tyr aag gat cct cgc cta aag gat gcc acc tat gat gga cgc tgg ctt tca 3288 Lys Asp Pro Arg Leu Lys Asp Ala Thr Tyr Asp Gly Arg Trp Leu Ser ggg aag cct cat ggc aga ggg gtt ttg aag tgg cct gat gga aag atg 3336 Gly Lys Pro His Gly Arg Gly Val Leu Lys Trp Pro Asp Gly Lys Met tat tct ggc atg ttc agg aat ggc ttg gaa gat ggg tat gga gaa tac 3384 Tyr Ser Gly Met Phe Arg Asn Gly Leu Glu Asp Gly Tyr Gly Glu Tyr SUBSTITUTE SHEET (RULE 26) aga atc cca aac aag gca atg aac aaa gaa gac cat tat gtg ggc cat 3432 Arg Ile Pro Asn Lys Ala Met Asn Lys Glu Asp His Tyr Val Gly His tgg aaa gaa gga aaa atg tgc ggt caa gga gtc tac agc tat get tct 3480 Trp Lys Glu Gly Lys Met Cys Gly Gln Gly Val Tyr Ser Tyr Ala Ser ggt gaa gta ttt gag ggc tgt ttt caa gat aat atg cgt cat ggt cat 3528 Gly Glu Val Phe Glu Gly Cys Phe Gln Asp Asn Met Arg His Gly His ggt ctt cta cga agt ggg aaa ttg acg tcc tct tct cct agt atg ttc 3576 Gly Leu Leu Arg Ser Gly Lys Leu Thr Ser Ser Ser Pro Ser Met Phe att ggc cag tgg gta atg gat aag aaa gca gga tat ggt gtc ttt gat 3624 Ile Gly Gln Trp Val Met Asp Lys Lys Ala Gly Tyr Gly Val Phe Asp gat atc act agg ggg gaa aag tat atg gga atg tgg caa gat gat gtg 3672 Asp Ile Thr Arg Gly Glu Lys Tyr Met Gly Met Trp Gln Asp Asp Val tgt caa ggg aat ggt gtg gtg gtt acc cag ttt gga tta tac tac gag 3720 Cys Gln Gly Asn Gly Val Val Val Thr Gln Phe Gly Leu Tyr Tyr Glu ggc aac ttt cac ctt aat aaa atg atg gga aat ggg gtt ttg ctt tcc 3768 Gly Asn Phe His Leu Asn Lys Met Met Gly Asn Gly Val Leu Leu Ser gaa gat gat act atc tat gaa gga gaa ttt tca gat gac tgg act ctt 3816 Glu Asp Asp Thr Ile Tyr Glu Gly Glu Phe Ser Asp Asp Trp Thr Leu agt gga aag gga aca ctg act atg cca aat gga gac tac att gaa ggt 3864 Ser Gly Lys Gly Thr Leu Thr Met Pro Asn Gly Asp Tyr Ile Glu Gly tat ttt agt gga gaa tgg gga tct ggg ata aaa atc act gga acc tac 3912 Tyr Phe Ser Gly Glu Trp Gly Ser Gly Ile Lys Ile Thr Gly Thr Tyr ttc aaa cct agt cta tat gag agt gat aaa gac aga cct aaa gtt ttc 3960 Phe Lys Pro Ser Leu Tyr Glu Ser Asp Lys Asp Arg Pro Lys Val Phe agg aag cta gga aac ctg gca gtg cca get gat gag aag tgg aaa gcg 4008 Arg Lys Leu Gly Asn Leu Ala Val Pro Ala Asp Glu Lys Trp Lys Ala gtg ttt gac gaa tgt tgg cgc caa ctg ggc tgt gag ggc cca ggc caa 4056 Val Phe Asp Glu Cys Trp Arg Gln Leu Gly Cys Glu Gly Pro Gly Gln ggg gaa gtt tgg aaa gca tgg gac aat att get gtg gcc ttg acc acc 4104 Gly Glu Val Trp Lys Ala Trp Asp Asn Ile Ala Val Ala Leu Thr Thr agt cgg cgc cag cac aga gac agt cca gaa ata ttg agt cgt tca cag 4152 SUBSTITUTE SHEET (RULE 26) Ser Arg Arg Gln His Arg Asp Ser Pro Glu Lle Leu Ser Arg Ser Gln act cag aca cta gag agt ttg gaa ttc att cca cag cat gtt ggt gcc 4200 Thr Gln Thr Leu Glu Ser Leu Glu Phe Ile Pro Gln His Val Gly Ala ttc tct gtg gag aaa tat gat gac atc agg aaa tat tta ata aag gcc 4248 Phe Ser Val Glu Lys Tyr Asp Asp Ile Arg Lys Tyr Leu Ile Lys Ala tgt gac act cct ctg cac ccc ctg ggc agg ctt gtg gag aca ctg gtt 4296 Cys Asp Thr Pro Leu His Pro Leu Gly Arg Leu Val Glu Thr Leu Val gca gtg tat aga atg aca tac gtg ggc gta gga gcc aac cgc agg tta 4344 Ala Val Tyr Arg Met Thr Tyr Val Gly Val Gly Ala Asn Arg Arg Leu ttg cag gag get gta aag gag att aag tcc tat ctt aag cga att ttc 4392 Leu Gln Glu Ala Val Lys Glu Ile Lys Ser Tyr Leu Lys Arg Ile Phe cag ctg gtg agg ttc tta ttt cct gag ctg cct gaa gaa ggc agc aca 4440 Gln Leu Val Arg Phe Leu Phe Pro Glu Leu Pro Glu Glu Gly Ser Thr att cct ctc tct get cct ctg cca acc gaa agg aag tct ttt tgc act 4488 Ile Pro Leu Ser Ala Pro Leu Pro Thr Glu Arg Lys Ser Phe Cys Thr ggg aag tca gat tcc cga tct gaa tca cca gag cca ggt tat gta gta 4536 Gly Lys Ser Asp Ser Arg Ser Glu Ser Pro Glu Pro Gly Tyr Val Val acg agt tct gga tta ttg ctt cct gtg ctg cta cct cgg ctc tac cca 4584 Thr Ser Ser Gly Leu Leu Leu Pro Val Leu Leu Pro Arg Leu Tyr Pro ccg ctg ttt atg ctt tat get ttg gat aat gat cgc gag gaa gac att 4632 Pro Leu Phe Met Leu Tyr Ala Leu Asp Asn Asp Arg Glu Glu Asp Ile tac tgg gaa tgt gtc ctt cga cta aat aag cag cca gat att get ctc 4680 Tyr Trp Glu Cys Val Leu Arg Leu Asn Lys Gln Pro Asp Ile Ala Leu ctg ggc ttt ctt ggg gtg cag agg aaa ttt tgg cca gca acc ttg tca 4728 Leu Gly Phe Leu Gly Val Gln Arg Lys Phe Trp Pro Ala Thr Leu Ser atc ctt gga gag agt aaa aag gtt ttg cca acc acg aaa gat get tgt 4776 Ile Leu Gly Glu Ser Lys Lys Val Leu Pro Thr Thr Lys Asp Ala Cys ttt gcc tca gca gta gaa tgt ctg cag cag atc agc aca aca ttt acc 4824 Phe Ala Ser Ala Val Glu Cys Leu Gln Gln Ile Ser Thr Thr Phe Thr cca tca gac aaa ctt aag gtc atc cag cag act ttt gag gag atc tct 4872 Pro Ser Asp Lys Leu Lys Val Ile Gln Gln Thr Phe Glu Glu Ile Ser SUBSTITUTE SHEET (RULE 26) cag agt gtc ctg gcg tca ctc cac gaa gac ttc ttg tgg tcc atg gat 4920 Gln Ser Val Leu Ala Ser Leu His Glu Asp Phe Leu Trp Ser Met Asp gac ttg ttt cct gtt ttc tta tat gtg gtg cta cgg gcc agg att agg 4968 Asp Leu Phe Pro Val Phe Leu Tyr Val Val Leu Arg Ala Arg Ile Arg aat tta ggc tct gag gta cac ctc att gag gat cta atg gac ccc tat 5016 Asn Leu Gly Ser Glu Val His Leu Ile Glu Asp Leu Met Asp Pro Tyr ctt cag cat ggg gaa cag ggt ata atg ttc acc acc ttg aag gca tgt 5064 Leu Gln His Gly Glu Gln Gly Ile Met Phe Thr Thr Leu Lys Ala Cys tac tac cag att cag cgt gag aag ctt aac tag gctgcataac agcttgaaaa 5117 Tyr Tyr Gln Ile Gln Arg Glu Lys Leu Asn ctggattatc tactacagag tgttataaca ccatctggag tcttcctgta gtggcaaaaa 5177 agaacagtgt tgaaattgga aaggactttg tgttatttag gttgttagaa tgagccttac 5237 caataataag agccctgagc ccagaaaaaa ggactgtata gtttaaaggg aggattgaaa 5297 gggaggtaaa aaatcagatt agaccagttc ttggcctatg ataagttcca aaaataccat 5357 ttatctacta tttgaaaaaa gaagaggata tcccttccta cagtaaaggg tatgtcagct 5417 acatgaagtt gtaagaaaag cttccagtag agcttcttat attaaagaag ttgatggata 5477 tttttgaatt tctggtttgc ctgaatccac ctgcagttac cccgatccgt ttgcaagaac 5537 cagatcgtac ttgaaactat agtggccaca ctctgccttc ctgagtccct tccagtcatg 5597 tgtgcatcat gtctctttgc caagggaggg gagaaaggaa cttttaaact gcagttttaa 5657 ctttttctaa gctgtttctt gatgggagag gttctgtgca aaactaccac attctgtccc 5717 caaaatgtgg aatgcatcca aataggagtc ttctgcctct taacttaaaa gaacatagga 5777 attttgtttt tggtttcttt atcatgctac agagagtgaa tacactggaa ttcagacacc 5837 gactctgagc tgctaggaac ctcatttgtc catgtgcaaa cgctgtattc caaggcctgt 5897 gaatggcagc ctgaggaagt tttgcatgca ggctgtgttt tcgagcagga ctaacaactg 5957 ggaaataagc aaaaaactgc atcgatcccc agcctggtgt tgttcttccc tatacttcac 6017 actgaactca ggatgggaag aaaaaggaaa caagctttgg ctttttccat ctcaaaagta 6077 ttgtggcacc tcaacatttc agtgttttgc tttttaaaaa atgccctatt gtaagttgtt 6137 ggtttatact gtataagtaa cactagtagc tgttttgaat aacataggtg ctcttcctca 6197 tctcatctcc tacaccgtgg tgagcataca gagtgtcctg atttgtgtta agtgactgag 6257 aagatgttaa ttacttttga aaaaggatca tggtttttgc tctactttat aatcaagaca 6317 agtgtttatt aaaatactgt tttggaatgt tggctgtaat gtaacagcaa ttttcataat 6377 aaaaggcatt catcttt 6394 <210> 2 <211> 1657 <212> PRT
<213> Homo Sapiens <400> 2 Met Asp Ser Lys Lys Arg Ser Ser Thr Glu Ala Glu Gly Ser Lys Glu Arg Gly Leu Val His Ile Trp Gln Ala Gly Ser Phe Pro Ile Thr Pro Glu Arg Leu Pro Gly Trp Gly Gly Lys Thr Val Leu Gln Ala Ala Leu Gly Val Lys His Gly Val Leu Leu Thr Glu Asp Gly Glu Val Tyr Ser Phe Gly Thr Leu Pro Trp Arg Ser Gly Pro Val Glu Ile Cys Pro Ser Ser Pro Ile Leu Glu Asn Ala Leu Val Gly Gln Tyr Val Ile Thr Val SUBSTITUTE SHEET (RULE 26) Ala Thr Gly Ser Phe His Ser Gly Ala Val Thr Asp Asn Gly Val Ala Tyr Met Trp Gly Glu Asn Ser Ala Gly Gln Cys Ala Val Ala Asn Gln Gln Tyr Val Pro Glu Pro Asn Pro Val Ser Ile Ala Asp Ser Glu Ala Ser Pro Leu Leu Ala Val Arg Ile Leu Gln Leu Ala Cys Gly Glu Glu His Thr Leu Ala Leu Ser Ile Ser Arg Glu Ile Trp Ala Trp Gly Thr Gly Cys Gln Leu Gly Leu Ile Thr Thr Ala Phe Pro Val Thr Lys Pro Gln Lys Val Glu His Leu Ala Gly Arg Val Val Leu Gln Val Ala Cys Gly Ala Phe His Ser Leu Ala Leu Val Gln Cys Leu Pro Ser Gln Asp Leu Lys Pro Val Pro Glu Arg Cys Asn Gln Cys Ser Gln Leu Leu Ile Thr Met Thr Asp Lys Glu Asp His Val Ile Ile Ser Asp Ser His Cys Cys Pro Leu Gly Val Thr Leu Thr Glu Ser Gln Ala Glu Asn His Ala Ser Thr Ala Leu Ser Pro Ser Thr Glu Thr Leu Asp Arg Gln Glu Glu Val Phe Glu Asn Thr Leu Val Ala Asn Asp Gln Ser Val Ala Thr Glu Leu Asn Ala Val Ser Ala Gln Ile Thr Ser Ser Asp Ala Met Ser Ser Gln Gln Asn Val Met Gly Thr Thr Glu Ile Ser Ser Ala Arg Asn Ile Pro Ser Tyr Pro Asp Thr Gln Ala Val Asn Glu Tyr Leu Arg Lys Leu Ser Asp His Ser Val Arg Glu Asp Ser Glu His Gly Glu Lys Pro Met Pro Ser Gln Pro Leu Leu Glu Glu Ala Ile Pro Asn Leu His Ser Pro Pro Thr Thr Ser Thr Ser Ala Leu Asn Ser Leu Val Val Ser Cys Ala Ser Ala Val Gly Val Arg Val Ala Ala Thr Tyr Glu Ala Gly Ala Leu Ser Leu Lys Lys Val Met Asn Phe Tyr Ser Thr Thr Pro Cys Glu Thr Gly Ala Gln Ala Gly Ser Ser Ala Ile Gly Pro Glu Gly Leu Lys Asp Ser Arg Glu Glu Gln Val Lys Gln Glu Ser Met Gln Gly Lys Lys Ser Ser Ser Leu Val Asp Ile Arg Glu Glu Glu Thr Glu Gly Gly Ser Arg Arg Leu Ser Leu Pro Gly Leu Leu Ser Gln Val Ser Pro Arg Leu Leu Arg Lys Ala Ala Arg Val Lys Thr Arg Thr Val Val Leu Thr Pro Thr Tyr Ser Gly Glu Ala Asp Ala Leu Leu Pro Ser Leu Arg Thr Glu Val Trp Thr Trp Gly Lys Gly Lys Glu Gly Gln Leu Gly His Gly Asp Val Leu Pro Arg Leu Gln Pro Leu Cys Val Lys Cys Leu Asp Gly Lys Glu Val Ile His Leu Glu Ala Gly Gly Tyr His Ser Leu Ala Leu Thr Ala Lys Ser Gln Val Tyr Ser Trp Gly Ser Asn Thr Phe Gly Gln Leu Gly SUBSTITUTE SHEET (RULE 26) His Ser Asp Phe Pro Thr Thr Val Pro Arg Leu Ala Lys Ile Ser Ser Glu Asn Gly Val Trp Ser Ile Ala Ala Gly Arg Asp Tyr Ser Leu Phe Leu Val Asp Thr Glu Asp Phe Gln Pro Gly Leu Tyr Tyr Ser Gly Arg Gln Asp Pro Thr Glu Gly Asp Asn Leu Pro Glu Asn His Ser Gly Ser Lys Thr Pro Val Leu Leu Ser Cys Ser Lys Leu Gly Tyr Ile Ser Arg Val Thr Ala Gly Lys Asp Ser Tyr Leu Ala Leu Val Asp Lys Asn Ile Met Gly Tyr Ile Ala Ser Leu His Glu Leu Ala Thr Thr Glu Arg Arg Phe Tyr Ser Lys Leu Ser Asp Ile Lys Ser Gln Ile Leu Arg Pro Leu Leu Ser Leu Glu Asn Leu Gly Thr Thr Thr Thr Val Gln Leu Leu Gln Glu Val Ala Ser Arg Phe Ser Lys Leu Cys Tyr Leu Ile Gly Gln His Gly Ala Ser Leu Ser Ser Phe Leu His Gly Val Lys Glu Ala Arg Ser Leu Val Ile Leu Lys His Ser Ser Leu Phe Leu Asp Ser Tyr Thr Glu Tyr Cys Thr Ser Ile Thr Asn Phe Leu Val Met Gly Gly Phe Gln Leu Leu Ala Lys Pro Ala Ile Asp Phe Leu Asn Lys Asn Gln Glu Leu Leu Gln Asp Leu Ser Glu Val Asn Asp Glu Asn Thr Gln Leu Met Glu Ile Leu Asn Thr Leu Phe Phe Leu Pro Ile Arg Arg Leu His Asn Tyr Ala Lys Val Leu Leu Lys Leu Ala Thr Cys Phe Glu Val Ala Ser Pro Glu Tyr Gln Lys Leu Gln Asp Ser Ser Ser Cys Tyr Glu Cys Leu Ala Leu His Leu Gly Arg Lys Arg Lys Glu Ala Glu Tyr Thr Leu Gly Phe Trp Lys Thr Phe Pro Gly Lys Met Thr Asp Ser Leu Arg Lys Pro Glu Arg Arg Leu Leu Cys Glu Ser Ser Asn Arg Ala Leu Ser Leu Gln His Ala Gly Arg Phe Ser Val Asn Trp Phe Ile Leu Phe Asn Asp Ala Leu Val His Ala Gln Phe Ser Thr His His Val Phe Pro Leu Ala Thr Leu Trp Ala Glu Pro Leu Ser Glu Glu Ala Gly Gly Val Asn Gly Leu Lys Ile Thr Thr Pro Glu Glu Gln Phe Thr Leu Ile Ser Ser Thr Pro Gln Glu Lys Thr Lys Trp Leu Arg Ala Ile Ser Gln Ala Val Asp Gln Ala Leu Arg Gly Met Ser Asp Leu Pro Pro Tyr Gly Ser Gly Ser Ser Val Gln Arg Gln Glu Pro Pro Ile Ser Arg Ser Ala Lys Tyr Thr Phe Tyr Lys Asp Pro Arg Leu Lys Asp Ala Thr Tyr Asp Gly Arg Trp Leu Ser Gly Lys Pro His Gly Arg Gly Val Leu Lys Trp Pro Asp Gly Lys Met Tyr Ser Gly Met Phe Arg Asn Gly Leu Glu Asp Gly Tyr Gly Glu Tyr Arg SUBSTITUTE SHEET (RULE 26) Ile Pro Asn Lys Ala Met Asn Lys Glu Asp His Tyr Val Gly His Trp Lys Glu Gly Lys Met Cys Gly Gln Gly Val Tyr Ser Tyr Ala Ser Gly Glu Val Phe Glu Gly Cys Phe Gln Asp Asn Met Arg His Gly His Gly Leu Leu Arg Ser Gly Lys Leu Thr Ser Ser Ser Pro Ser Met Phe Ile Gly Gln Trp Val Met Asp Lys Lys Ala Gly Tyr Gly Val Phe Asp Asp Ile Thr Arg Gly Glu Lys Tyr Met Gly Met Trp Gln Asp Asp Val Cys Gln Gly Asn Gly Val Val Val Thr Gln Phe Gly Leu Tyr Tyr Glu Gly Asn Phe His Leu Asn Lys Met Met Gly Asn Gly Val Leu Leu Ser Glu Asp Asp Thr Ile Tyr Glu Gly Glu Phe Ser Asp Asp Trp Thr Leu Ser Gly Lys Gly Thr Leu Thr Met Pro Asn Gly Asp Tyr Ile Glu Gly Tyr Phe Ser Gly Glu Trp Gly Ser Gly Ile Lys Ile Thr Gly Thr Tyr Phe Lys Pro Ser Leu Tyr Glu Ser Asp Lys Asp Arg Pro Lys Val Phe Arg Lys Leu Gly Asn Leu Ala Val Pro Ala Asp Glu Lys Trp Lys Ala Val Phe Asp Glu Cys Trp Arg Gln Leu Gly Cys Glu Gly Pro Gly Gln Gly Glu Val Trp Lys Ala Trp Asp Asn Ile Ala Val Ala Leu Thr Thr Ser Arg Arg Gln His Arg Asp Ser Pro Glu Ile Leu Ser Arg Ser Gln Thr Gln Thr Leu Glu Ser Leu Glu Phe Ile Pro Gln His Val Gly Ala Phe Ser Val Glu Lys Tyr Asp Asp Ile Arg Lys Tyr Leu Ile Lys Ala Cys Asp Thr Pro Leu His Pro Leu Gly Arg Leu Val Glu Thr Leu Val Ala Val Tyr Arg Met Thr Tyr Val Gly Val Gly Ala Asn Arg Arg Leu Leu Gln Glu Ala Val Lys Glu Ile Lys Ser Tyr Leu Lys Arg Ile Phe Gln Leu Val Arg Phe Leu Phe Pro Glu Leu Pro Glu Glu Gly Ser Thr Ile Pro Leu Ser Ala Pro Leu Pro Thr Glu Arg Lys Ser Phe Cys Thr Gly Lys Ser Asp Ser Arg Ser Glu Ser Pro Glu Pro Gly Tyr Val Val Thr Ser Ser Gly Leu Leu Leu Pro Val Leu Leu Pro Arg Leu Tyr Pro Pro Leu Phe Met Leu Tyr Ala Leu Asp Asn Asp Arg Glu Glu Asp Ile Tyr Trp Glu Cys Val Leu Arg Leu Asn Lys Gln Pro Asp Ile Ala Leu Leu Gly Phe Leu Gly Val Gln Arg Lys Phe Trp Pro Ala Thr Leu Ser Ile Leu Gly Glu Ser Lys Lys Val Leu Pro Thr Thr Lys Asp Ala Cys Phe Ala Ser Ala Val Glu Cys Leu Gln Gln Ile Ser Thr Thr Phe Thr Pro Ser Asp Lys Leu Lys Val Ile Gln Gln Thr Phe Glu Glu Ile Ser Gln SUBSTITUTE SHEET (RULE 26) Ser Val Leu Ala Ser Leu His Glu Asp Phe Leu Trp Ser Met Asp Asp Leu Phe Pro Val Phe Leu Tyr Val Val Leu Arg Ala Arg Ile Arg Asn Leu Gly Ser Glu Val His Leu Ile Glu Asp Leu Met Asp Pro Tyr Leu Gln His Gly Glu Gln Gly Ile Met Phe Thr Thr Leu Lys Ala Cys Tyr Tyr Gln Ile Gln Arg Glu Lys Leu Asn <210> 3 <211> 49 <212> PRT
<213> Homo Sapiens <400> 3 Met Asp Ser Lys Lys Arg Ser Ser Thr Glu Ala Glu Gly Ser Lys Glu Arg Gly Leu Val His Ile Trp Gln Ala Gly Ser Phe Pro Ile Thr Pro Glu Arg Leu Pro Gly Trp Gly Gly Lys Thr Val Leu Gln Ala Pro Ser Glu <210> 4 <211> 6349 <212> DNA
<213> Mus musculus <220>
<221> CDS
<222> (124)..(5079) <400> 4 ccacgcgtcc ggcggtgcag tcgggctcgc gccgggagaa gagcgcggag ctgcgggagc 60 gtcaggtctt gagagagctt ttgctaatgg gatggtttgg tgatggagta ctcctcctga 120 ccg atg gac tca aag aag aaa agc tca aca gag gca gaa gga tcc aaa 168 Met Asp Ser Lys Lys Lys Ser Ser Thr Glu Ala Glu Gly Ser Lys gaa aga ggc cta gtc cat gtc tgg cag gca gga tcc ttt tct cta aca 216 Glu Arg Gly Leu Val His Val Trp Gln Ala Gly Ser Phe Ser Leu Thr cca gag agg ttg cca ggc tgg ggt gga aag aca gtt ctt cag gca gcc 264 Pro Glu Arg Leu Pro Gly Trp Gly Gly Lys Thr Val Leu Gln Ala Ala ctt ggt gtg agg cat gga gtt ctt ctg act gaa gat ggt gag gtc tac 312 Leu Gly Val Arg His Gly Val Leu Leu Thr Glu Asp Gly Glu Val Tyr agc ttt ggg act ctt ccc tgg aaa agt gaa tca gca gaa att tgt cca 360 Ser Phe Gly Thr Leu Pro Trp Lys Ser Glu Ser Ala Glu Ile Cys Pro agc agc ccc ctt cta gaa agt gcc ctg gtt ggg cat cat gtt att act 408 Ser Ser Pro Leu Leu Glu Ser Ala Leu Val Gly His His Val Ile Thr SUBSTITUTE SHEET (RULE 26) gtg gca aca ggg agc ttc cac agt gga gca gtg aca gag agc ggg gtg 456 Val Ala Thr Gly Ser Phe His Ser Gly Ala Val Thr Glu Ser Gly Val gtg tac atg tgg gga gag aat get gcc ggg cag tgt gcg gta get aac 504 Val Tyr Met Trp Gly Glu Asn Ala Ala Gly Gln Cys Ala Val Ala Asn cag cag tat gtg ccg gag ccg agt cct gtc agc att tct gac tcg gag 552 Gln Gln Tyr Val Pro Glu Pro Ser Pro Val Ser Ile Ser Asp Ser Glu acc agc ccg tca tta gca gtt agg att ctg caa ttg gca tgt ggc gag 600 Thr Ser Pro Ser Leu Ala Val Arg Ile Leu Gln Leu Ala Cys Gly Glu gaa cac aca ctg gca ttg tca ctc agc aga gag atc tgg gca tgg ggc 648 Glu His Thr Leu Ala Leu Ser Leu Ser Arg Glu Ile Trp Ala Trp Gly acc ggc tgt cag ctg ggc ctc atc acc acc act ttc cca gtg aca aag 696 Thr Gly Cys Gln Leu Gly Leu Ile Thr Thr Thr Phe Pro Val Thr Lys cca cag aag gtg gaa cac ctt get gga cga gtg gtg ctc cag gtg gcc 744 Pro Gln Lys Val Glu His Leu Ala Gly Arg Val Val Leu Gln Val Ala tgc ggt gca ttc cac agc ctt gca ctt gtg cag tgc ctc cct cct cag 792 Cys Gly Ala Phe His Ser Leu Ala Leu Val Gln Cys Leu Pro Pro Gln gat ctg aag cca gtc cca gag aga tgc aat cag tgc agc cag ctg ctc 840 Asp Leu Lys Pro Val Pro Glu Arg Cys Asn Gln Cys Ser Gln Leu Leu atc acc atg aca gac aaa gag gac cat gtg ata ata tcg gac agc cat 888 Ile Thr Met Thr Asp Lys Glu Asp His Val Ile Ile Ser Asp Ser His tgc tgc cct tta ggt gtg aca ttg tcc gag tct caa gca gaa aag cat 936 Cys Cys Pro Leu Gly Val Thr Leu Ser Glu Ser Gln Ala Glu Lys His gcc agc cct get ccc agc cct cac cca gag gca ctg gat gag cag gga 984 Ala Ser Pro Ala Pro Ser Pro His Pro Glu Ala Leu Asp Glu Gln Gly gag gtg ttt gag aac acg gtg gta gaa get gaa ctg aac atg gga agc 1032 Glu Val Phe Glu Asn Thr Val Val Glu Ala Glu Leu Asn Met Gly Ser agt cag acc aca agt ggc agt gcc att tcc acc cag cag aac atc gtg 1080 Ser Gln Thr Thr Ser Gly Ser Ala Ile Ser Thr Gln Gln Asn Ile Val gga aca get gaa gtg tct tct gcc aga aca get ccg tca tac cca gac 1128 Gly Thr Ala Glu Val Ser Ser Ala Arg Thr Ala Pro Ser Tyr Pro Asp acc cat gcg gta act gca tac ctg cag aag ctg tca gag cat tcg atg 1176 Thr His Ala Val Thr Ala Tyr Leu Gln Lys Leu Ser Glu His Ser Met SUBSTITUTE SHEET (RULE 26) agg gag aac cat gag cct gga gaa aag cca ccc cag gtc cag cct ctt 1224 Arg Glu Asn His Glu Pro Gly Glu Lys Pro Pro Gln Val Gln Pro Leu gta gaa gaa gca gtt cct gat ctt cac agt cca cca acc aca agc acc 1272 Val Glu Glu Ala Val Pro Asp Leu His Ser Pro Pro Thr Thr Ser Thr tca gcc ctc aac agc ttg gtg gtc tcc tgt gca tct get gtt ggt gtc 1320 Ser Ala Leu Asn Ser Leu Val Val Ser Cys Ala Ser Ala Val Gly Val aga gtg get gcc acc tat gaa get ggg gcc ttg tct ctc aag aaa gtt 1368 Arg Val Ala Ala Thr Tyr Glu Ala Gly Ala Leu Ser Leu Lys Lys Val atg aac ttt tac agc act gcc ccc tgc gag acg gca get cag tcg ggc 1416 Met Asn Phe Tyr Ser Thr Ala Pro Cys Glu Thr Ala Ala Gln Ser Gly agt gcc tcc aca ggc cca gaa agt ctg aaa gat ctc cga gaa gag cag 1464 Ser Ala Ser Thr Gly Pro Glu Ser Leu Lys Asp Leu Arg Glu Glu Gln gtg aaa cag gag tca ctg caa ggg aag aaa agc tca agt ctc atg gac 1512 Val Lys Gln Glu Ser Leu Gln Gly Lys Lys Ser Ser Ser Leu Met Asp atc aga gag gaa gag tcg gag gga ggg agt cga aga ctc tcc ctc cca 1560 Ile Arg Glu Glu Glu Ser Glu Gly Gly Ser Arg Arg Leu Ser Leu Pro ggg ttg ttg tcg caa gtt tcc ccc agg ctc tta agg aag get gcg agg 1608 Gly Leu Leu Ser Gln Val Ser Pro Arg Leu Leu Arg Lys Ala Ala Arg gtg aaa act cgg aca gtg gtt ctg act ccc aca tac agt gga gaa gca 1656 Val Lys Thr Arg Thr Val Val Leu Thr Pro Thr Tyr Ser Gly Glu Ala gat gcc ctt ctg cct tcc ctg agg aca gag gtg tgg acc tgg ggg aaa 1704 Asp Ala Leu Leu Pro Ser Leu Arg Thr Glu Val Trp Thr Trp Gly Lys ggc aag gaa ggg cag cta ggg cac ggc gac gtc ctg ccc agg ctt cag 1752 Gly Lys Glu Gly Gln Leu Gly His Gly Asp Val Leu Pro Arg Leu Gln ccg ttg tgt gtc aag tgt ctg gat ggt aaa gag gta atc cac ctg gag 1800 Pro Leu Cys Val Lys Cys Leu Asp Gly Lys Glu Val Ile His Leu Glu gcg ggc ggc tcc cac tcc ctc gca ctc act gcg aaa tct cag gtt tac 1848 Ala Gly Gly Ser His Ser Leu Ala Leu Thr Ala Lys Ser Gln Val Tyr tca tgg ggc agt aat acc ttt ggt cag ctt ggg cat tct gag ttt cca 1896 Ser Trp Gly Ser Asn Thr Phe Gly Gln Leu Gly His Ser Glu Phe Pro SUBSTITUTE SHEET (RULE 26) aca acg gtt cct cga ctc tca aag gtt agc agt gaa aat gga gtc tgg 1944 Thr Thr Val Pro Arg Leu Ser Lys Val Ser Ser Glu Asn Gly Val Trp agt gta get gca ggc caa gat tat tcc ttg ttt tta gtg gac acg gaa 1992 Ser Val Ala Ala Gly Gln Asp Tyr Ser Leu Phe Leu Val Asp Thr Glu gac ttc cag cct ggg ttg tat tac agt ggc cga cag gac cgt gca gaa 2040 Asp Phe Gln Pro Gly Leu Tyr Tyr Ser Gly Arg Gln Asp Arg Ala Glu ggt gat acc ctg cca gag aat ccc agt ggt aca aag act cca gta ctt 2088 Gly Asp Thr Leu Pro Glu Asn Pro Ser Gly Thr Lys Thr Pro Val Leu ctc tcc tgt agt aag ctt gga tac ata agc aga gta aca gca gga aaa 2136 Leu Ser Cys Ser Lys Leu Gly Tyr Ile Ser Arg Val Thr Ala Gly Lys gat agc tat cta gcc ttg gtg gat aag aac atc atg gga tac atc gcc 2184 Asp Ser Tyr Leu Ala Leu Val Asp Lys Asn Ile Met Gly Tyr Ile Ala agt ctc cat gag ttg get tct aca gaa aga cgg ttt tac tca aaa ctg 2232 Ser Leu His Glu Leu Ala Ser Thr Glu Arg Arg Phe Tyr Ser Lys Leu agc gaa atc aaa tca cag ata ctt agg cct ctt ctc agt tta gaa aat 2280 Ser Glu Ile Lys Ser Gln Ile Leu Arg Pro Leu Leu Ser Leu Glu Asn ttg ggc aca gtg acc act gtc cag ctg ttg cag gaa gtt gcc agc cgg 2328 Leu Gly Thr Val Thr Thr Val Gln Leu Leu Gln Glu Val Ala Ser Arg ttc agc aag ttg tgt tac ctc att ggg cag cat gga gcc tca cta agc 2376 Phe Ser Lys Leu Cys Tyr Leu Ile Gly Gln His Gly Ala Ser Leu Ser agc tac cta cag ggt atg aag gaa gcc agc agc ctg gtc atc atg aag 2424 Ser Tyr Leu Gln Gly Met Lys Glu Ala Ser Ser Leu Val Ile Met Lys cac tca agt ctt ttc ctg gac agc tac aca gag tac tgc aca tca gtt 2472 His Ser Ser Leu Phe Leu Asp Ser Tyr Thr Glu Tyr Cys Thr Ser Val tca aat ttc ctg gtt atg gga gga ttc cag ctt ctt get aag cct gcc 2520 Ser Asn Phe Leu Val Met Gly Gly Phe Gln Leu Leu Ala Lys Pro Ala att gat ttc cta aat aaa aac caa gaa ctc ttg caa gat ttg tca gaa 2568 Ile Asp Phe Leu Asn Lys Asn Gln Glu Leu Leu Gln Asp Leu Ser Glu gtg aat gat gag aac act cag ttg atg gaa atc ctg aac atg ctg ttt 2616 Val Asn Asp Glu Asn Thr Gln Leu Met Glu Ile Leu Asn Met Leu Phe ttc ttg cca atc aga cga ctt cat aat tat gca aaa gtt ttg cta aag 2664 Phe Leu Pro Ile Arg Arg Leu His Asn Tyr Ala Lys Val Leu Leu Lys SUBSTITUTE SHEET (RULE 26) ctt gcc act tgc ttt gaa gtg aca tct cca gag tat caa aag ctg cag 2712 Leu Ala Thr Cys Phe Glu Val Thr Ser Pro Glu Tyr Gln Lys Leu Gln gat tcc agt tct tgc tat gag tct ctt get ctc cat ctt ggc aag aag 2760 Asp Ser Ser Ser Cys Tyr Glu Ser Leu Ala Leu His Leu Gly Lys Lys agg aag gaa gca gag tac aca ctg agc ttc tgg aag acc ttt cct ggg 2808 Arg Lys Glu Ala Glu Tyr Thr Leu Ser Phe Trp Lys Thr Phe Pro Gly aaa atg acg gat tcc ttg agg aag cca gag cgc cgg ctg ctg tgt gag 2856 Lys Met Thr Asp Ser Leu Arg Lys Pro Glu Arg Arg Leu Leu Cys Glu agc agt aac cga gcc ctc tcc ctg cag cat gcc ggc agg ttt tct gtg 2904 Ser Ser Asn Arg Ala Leu Ser Leu Gln His Ala Gly Arg Phe Ser Val aat tgg ttc att ctc ttc aat gat gcc ctg gtc cat get cag ttc tct 2952 Asn Trp Phe Ile Leu Phe Asn Asp Ala Leu Val His Ala Gln Phe Ser aca cac cac gtg ttc cct ttg gcc aca ctc tgg gca gag cca cta tct 3000 Thr His His Val Phe Pro Leu Ala Thr Leu Trp Ala Glu Pro Leu Ser gaa gaa get ggt agc gtg aat ggc tta aag ata act aca cct gaa gaa 3048 Glu Glu Ala Gly Ser Val Asn Gly Leu Lys Ile Thr Thr Pro Glu Glu caa ttc aca ctc att tct tca aca ccc cag gaa aag acc aag tgg ctt 3096 Gln Phe Thr Leu Ile Ser Ser Thr Pro Gln Glu Lys Thr Lys Trp Leu cgg get att agc caa get gtg gat cag get ttg agg ggg acg tcc gat 3144 Arg Ala Ile Ser Gln Ala Val Asp Gln Ala Leu Arg Gly Thr Ser Asp ttc cca ctt tac gga ggc ggc agc agt gtt cag aga cag gaa cca ccc 3192 Phe Pro Leu Tyr Gly Gly Gly Ser Ser Val Gln Arg Gln Glu Pro Pro atc tca aga agt gcc aaa tac act ttc tac aag gat act cgc cta aag 3240 Ile Ser Arg Ser Ala Lys Tyr Thr Phe Tyr Lys Asp Thr Arg Leu Lys gat gcc act tac gat ggg cgc tgg ctt tca ggg aag cct cat ggc agg 3288 Asp Ala Thr Tyr Asp Gly Arg Trp Leu Ser Gly Lys Pro His Gly Arg ggt gtt ctg aag tgg cct gat gga aag atg tac tct ggc atg ttc agg 3336 Gly Val Leu Lys Trp Pro Asp Gly Lys Met Tyr Ser Gly Met Phe Arg aat ggc ttg gaa gat gga tat ggt gaa tac aga atc cct aac aag gcc 3384 Asn Gly Leu Glu Asp Gly Tyr Gly Glu Tyr Arg Ile Pro Asn Lys Ala SUBSTITUTE SHEET (RULE 26) ctg aac aaa gaa gac cat tat gta ggc cat tgg aaa gag ggg aaa atg 3432 Leu Asn Lys Glu Asp His Tyr Val Gly His Trp Lys Glu Gly Lys Met tgt ggg caa gga gtc tac agc tat gcc tct ggt gaa gtg ttt gaa ggc 3480 Cys Gly Gln Gly Val Tyr Ser Tyr Ala Ser Gly Glu Val Phe Glu Gly tgc ttt caa gat aac atg cgc cat ggg cat ggt ctg ctc cgg agt gga 3528 Cys Phe Gln Asp Asn Met Arg His Gly His Gly Leu Leu Arg Ser Gly aaa ctg act tct tct tct cct agc atg ttc att ggc cag tgg gta atg 3576 Lys Leu Thr Ser Ser Ser Pro Ser Met Phe Ile Gly Gln Trp Val Met gat aag aaa gca gga tat ggc gtc ttt gat gat atc acc agg gga gaa 3624 Asp Lys Lys Ala Gly Tyr Gly Val Phe Asp Asp Ile Thr Arg Gly Glu aag tac atg gga atg tgg cag gat gat gtg tgc caa ggg aat ggg gta 3672 Lys Tyr Met Gly Met Trp Gln Asp Asp Val Cys Gln Gly Asn Gly Val gta gtc acc cag ttt ggg tta tac tac gaa ggc aac ttc cac ctg aat 3720 Val Val Thr Gln Phe Gly Leu Tyr Tyr Glu Gly Asn Phe His Leu Asn aag atg atg gga aat ggg gtt ttg ctt tct gaa gat gat acc atc tat 3768 Lys Met Met Gly Asn Gly Val Leu Leu Ser Glu Asp Asp Thr Ile Tyr gaa gga gaa ttt tcc gat gac tgg aca ctt agt gga aag gga acg ctg 3816 Glu Gly Glu Phe Ser Asp Asp Trp Thr Leu Ser Gly Lys Gly Thr Leu act atg cca cat gga gat tat att gaa ggt tat ttt agt gga gaa tgg 3864 Thr Met Pro His Gly Asp Tyr Ile Glu Gly Tyr Phe Ser Gly Glu Trp gga tct ggg ata aaa atc act ggg acc tac ttc aaa cct agc ctg tat 3912 Gly Ser Gly Ile Lys Ile Thr Gly Thr Tyr Phe Lys Pro Ser Leu Tyr gag agc gat aag gac aag ccc aaa gcc ttc agg aag ctg ggg aac ctg 3960 Glu Ser Asp Lys Asp Lys Pro Lys Ala Phe Arg Lys Leu Gly Asn Leu gcc gtg gca gca gac gag aaa tgg aga gca gtg ttt gaa gaa tgc tgg 4008 Ala Val Ala Ala Asp Glu Lys Trp Arg Ala Val Phe Glu Glu Cys Trp cac cag ctg ggc tgt gag agc cca ggc caa ggg gag gtt tgg aaa gca 4056 His Gln Leu Gly Cys Glu Ser Pro Gly Gln Gly Glu Val Trp Lys Ala tgg gat aat att get gtg gcc ttg acc acg aac cgt cgc cag cat aaa 4104 Trp Asp Asn Ile Ala Val Ala Leu Thr Thr Asn Arg Arg Gln His Lys gac agt cca gaa ata cta agc cgc tct cag act cag acc ctg gag agt 4152 Asp Ser Pro Glu Ile Leu Ser Arg Ser Gln Thr Gln Thr Leu Glu Ser SUBSTITUTE SHEET (RULE 26) ct ttg gag tac att ccc cag cac att ggc gcc ttc tct gtg gag aaa tat 4200 Leu Glu Tyr Ile Pro Gln His Ile Gly Ala Phe Ser Val Glu Lys Tyr gat gac atc aag aag tat tta ata aag gcc tgt gat act cct ctg cac 4248 Asp Asp Ile Lys Lys Tyr Leu Ile Lys Ala Cys Asp Thr Pro Leu His cca ctg ggc agg ctt gtg gag acc ctg gtt gcg gtg tat aga atg aca 4296 Pro Leu Gly Arg Leu Val Glu Thr Leu Val Ala Val Tyr Arg Met Thr tat gtg ggt gta ggg gcc aac cgc cgg tta ctg cag gaa get gtg aag 4344 Tyr Val Gly Val Gly Ala Asn Arg Arg Leu Leu Gln Glu Ala Val Lys gag att aaa tct tat ctc aag agg att ttc cag ctt gtg agg ttc ttg 4392 Glu Ile Lys Ser Tyr Leu Lys Arg Ile Phe Gln Leu Val Arg Phe Leu ttt cct gag ctt cct gag gag ggc agc aca att cct ctt tct get cct 4440 Phe Pro Glu Leu Pro Glu Glu Gly Ser Thr Ile Pro Leu Ser Ala Pro ctg ccc act gga agg aga tcc ttc tgt act ggg aaa ttg gat tcc aga 4488 Leu Pro Thr Gly Arg Arg Ser Phe Cys Thr Gly Lys Leu Asp Ser Arg tcc gag tca cca gaa cca ggt tat gta gta aca agt tct ggc tta ctg 4536 Ser Glu Ser Pro Glu Pro Gly Tyr Val Val Thr Ser Ser Gly Leu Leu ctt ccg gtg ctg ctg ccg cgg ctc tac cca cct ctc ttc atg ctc tat 4584 Leu Pro Val Leu Leu Pro Arg Leu Tyr Pro Pro Leu Phe Met Leu Tyr gcc ctg gat aat gac cga gag gaa gac att tac tgg gaa tgt gtg ctt 4632 Ala Leu Asp Asn Asp Arg Glu Glu Asp Ile Tyr Trp Glu Cys Val Leu cga cta aac aag cag cca gat att get ctc ttg ggc ttc ctt gga gta 4680 Arg Leu Asn Lys Gln Pro Asp Ile Ala Leu Leu Gly Phe Leu Gly Val cag aaa aaa ttc tgg cca gcc acc ttg tca atc ctt gga gag agt aaa 4728 Gln Lys Lys Phe Trp Pro Ala Thr Leu Ser Ile Leu Gly Glu Ser Lys aag gtg ttg tca acc aca aag gat get tgc ttt gca tct gca gta gaa 4776 Lys Val Leu Ser Thr Thr Lys Asp Ala Cys Phe Ala Ser Ala Val Glu tgc ctg cag cag atc agc aca aca ttt act cca tca gac aag ctt aaa 4824 Cys Leu Gln Gln Ile Ser Thr Thr Phe Thr Pro Ser Asp Lys Leu Lys gtg atc cag cag acc ttt gaa gag atc tcc cag agt gtc ctt gca tcg 4872 Val Ile Gln Gln Thr Phe Glu Glu Ile Ser Gln Ser Val Leu Ala Ser SUBSTITUTE SHEET (RULE 26) ctg cag gag gac ttc ctc tgg tcc atg gat gac ttg ttc ccc gtc ttc 4920 Leu Gln Glu Asp Phe Leu Trp Ser Met Asp Asp Leu Phe Pro Val Phe tta tac gtg gtg ctg cgg gcc agg att cgg aac ttg ggc tct gaa gtt 4968 Leu Tyr Val Val Leu Arg Ala Arg Ile Arg Asn Leu Gly Ser Glu Val cac ctc att gag gat ctg atg gac ccc ttt ctc cag cat ggg gaa caa 5016 His Leu Ile Glu Asp Leu Met Asp Pro Phe Leu Gln His Gly Glu Gln ggc atc atg ttc acc acc ttg aag gcc tgt tac ttc cag att cag cgg 5064 Gly Ile Met Phe Thr Thr Leu Lys Ala Cys Tyr Phe Gln Ile Gln Arg gag aag ctt aac tag ggcgcctgac agcttgagga ccggattatc tgctgcggag 5119 Glu Lys Leu Asn gctacagcta tggcacaggc accgactgga ggctgatggg gcaaagaaca gtgttgaata 5179 cagaatggac ttttgtgcta ttttggttgt aatttctgag ccttactaat aattagagcc 5239 cagcatggaa aacatactgt atcattcaaa tggagactgg aaaaggagat agggatagag 5299 tagagtcttt ggcctgtgct gagatccaca cacctactta gaaaaggaaa ctggttaccc 5359 tttcctgtag tgaaagctct cagctccatg cagttccagg aaacctttcc aggaaagctg 5419 cttagatgaa aagaagttga tgactgtgtt taagctcctg gtttgtctaa ttccatttgc 5479 agttacccaa taccctttgg caaggagcag gttttacttg aaactgaagc agccatccct 5539 tgccttccta gacctctcgc tcccaggcac aagtgcagca tgctactttg ctaggggtgg 5599 gggtggggga gaagaagttt taaactgtag ttttaacctt ttgtaagccc ctttaccaag 5659 gcatttgtgg tcagagagct cccacggggt gactatgaca tcctggtccc ctcgtggaat 5719 gcatccacat aggatcttct gcctgctgac tgaaaagaac ataggaatac actggagtgc 5779 aaacactgcc gtgccaagct gctccaaacc tcactgatcc gaggcccact gcctacccag 5839 gaggcccact gcctacccag gaggcccgta agcttcttag cacaagcttt gtgtggagac 5899 tgaagatctg cacatgtgag gaagcaggga gctacagtgg ccctcagccc agtctgcggg 5959 tcttccctct acctcacact gaactcagaa gggaaggaag gagagacgca catgggattc 6019 tcccacctca gaagtattgt gacagcaccg cataaccacg gtttgctctt ttacaagcag 6079 cctcacaagt gtgggttgtg ggtgtgcgct ggagcagtgc cactcgtagc tgtttggata 6139 ccacaggtgc tcttccgtct catctgctgt actcggaggc gagcgcagtg gcctgactca 6199 tgggaaatga ctcagcaggc ggcaactact tttgaaaagg atcatgattt ccgagctact 6259 ttataatcaa gacaagcatt tgttaacata ctgttttgga atgttggctg taatgtaaca 6319 gcagttttca taataaatga cattcatctc 6349 <210> 5 <211> 1651 <212> PRT
<213> Mus musculus <400> 5 Met Asp Ser Lys Lys Lys Ser Ser Thr Glu Ala Glu Gly Ser Lys Glu Arg Gly Leu Val His Val Trp Gln Ala Gly Ser Phe Ser Leu Thr Pro Glu Arg Leu Pro Gly Trp Gly Gly Lys Thr Val Leu Gln Ala Ala Leu Gly Val Arg His Gly Val Leu Leu Thr Glu Asp Gly Glu Val Tyr Ser Phe Gly Thr Leu Pro Trp Lys Ser Glu Ser Ala Glu Ile Cys Pro Ser Ser Pro Leu Leu Glu Ser Ala Leu Val Gly His His Val Ile Thr Val Ala Thr Gly Ser Phe His Ser Gly Ala Val Thr Glu Ser Gly Val Val SUBSTITUTE SHEET (RULE 26) Tyr Met Trp Gly Glu Asn Ala Ala Gly Gln Cys Ala Val Ala Asn Gln Gln Tyr Val Pro Glu Pro Ser Pro Val Ser Ile Ser Asp Ser Glu Thr Ser Pro Ser Leu Ala Val Arg Ile Leu Gln Leu Ala Cys Gly Glu Glu His Thr Leu Ala Leu Ser Leu Ser Arg Glu Ile Trp Ala Trp Gly Thr Gly Cys Gln Leu Gly Leu Ile Thr Thr Thr Phe Pro Val Thr Lys Pro Gln Lys Val Glu His Leu Ala Gly Arg Val Val Leu Gln Val Ala Cys Gly Ala Phe His Ser Leu Ala Leu Val Gln Cys Leu Pro Pro Gln Asp Leu Lys Pro Val Pro Glu Arg Cys Asn Gln Cys Ser Gln Leu Leu Ile Thr Met Thr Asp Lys Glu Asp His Val Ile Ile Ser Asp Ser His Cys Cys Pro Leu Gly Val Thr Leu Ser Glu Ser Gln Ala Glu Lys His Ala Ser Pro Ala Pro Ser Pro His Pro Glu Ala Leu Asp Glu Gln Gly Glu Val Phe Glu Asn Thr Val Val Glu Ala Glu Leu Asn Met Gly Ser Ser Gln Thr Thr Ser Gly Ser Ala Ile Ser Thr Gln Gln Asn Ile Val Gly Thr Ala Glu Val Ser Ser Ala Arg Thr Ala Pro Ser Tyr Pro Asp Thr His Ala Val Thr Ala Tyr Leu Gln Lys Leu Ser Glu His Ser Met Arg Glu Asn His Glu Pro Gly Glu Lys Pro Pro Gln Val Gln Pro Leu Val Glu Glu Ala Val Pro Asp Leu His Ser Pro Pro Thr Thr Ser Thr Ser Ala Leu Asn Ser Leu Val Val Ser Cys Ala Ser Ala Val Gly Val Arg Val Ala Ala Thr Tyr Glu Ala Gly Ala Leu Ser Leu Lys Lys Val Met Asn Phe Tyr Ser Thr Ala Pro Cys Glu Thr Ala Ala Gln Ser Gly Ser Ala Ser Thr Gly Pro Glu Ser Leu Lys Asp Leu Arg Glu Glu Gln Val Lys Gln Glu Ser Leu Gln Gly Lys Lys Ser Ser Ser Leu Met Asp Ile Arg Glu Glu Glu Ser Glu Gly Gly Ser Arg Arg Leu Ser Leu Pro Gly Leu Leu Ser Gln Val Ser Pro Arg Leu Leu Arg Lys Ala Ala Arg Val Lys Thr Arg Thr Val Val Leu Thr Pro Thr Tyr Ser Gly Glu Ala Asp Ala Leu Leu Pro Ser Leu Arg Thr Glu Val Trp Thr Trp Gly Lys Gly Lys Glu Gly Gln Leu Gly His Gly Asp Val Leu Pro Arg Leu Gln Pro Leu Cys Val Lys Cys Leu Asp Gly Lys Glu Val Ile His Leu Glu Ala Gly Gly Ser His Ser Leu Ala Leu Thr Ala Lys Ser Gln Val Tyr Ser Trp Gly Ser Asn Thr Phe Gly Gln Leu Gly His Ser Glu Phe Pro Thr Thr Val Pro Arg Leu Ser Lys Val Ser Ser Glu Asn Gly Val Trp Ser SUBSTITUTE SHEET (RULE 26) Val Ala Ala Gly Gln Asp Tyr Ser Leu Phe Leu Val Asp Thr Glu Asp Phe Gln Pro Gly Leu Tyr Tyr Ser Gly Arg Gln Asp Arg Ala Glu Gly Asp Thr Leu Pro Glu Asn Pro Ser Gly Thr Lys Thr Pro Val Leu Leu Ser Cys Ser Lys Leu Gly Tyr Ile Ser Arg Val Thr Ala Gly Lys Asp Ser Tyr Leu Ala Leu Val Asp Lys Asn Ile Met Gly Tyr Ile Ala Ser Leu His Glu Leu Ala Ser Thr Glu Arg Arg Phe Tyr Ser Lys Leu Ser Glu Ile Lys Ser Gln Ile Leu Arg Pro Leu Leu Ser Leu Glu Asn Leu Gly Thr Val Thr Thr Val Gln Leu Leu Gln Glu Val Ala Ser Arg Phe Ser Lys Leu Cys Tyr Leu Ile Gly Gln His Gly Ala Ser Leu Ser Ser Tyr Leu Gln Gly Met Lys Glu Ala Ser Ser Leu Val Ile Met Lys His Ser Ser Leu Phe Leu Asp Ser Tyr Thr Glu Tyr Cys Thr Ser Val Ser Asn Phe Leu Val Met Gly Gly Phe Gln Leu Leu Ala Lys Pro Ala Ile Asp Phe Leu Asn Lys Asn Gln Glu Leu Leu Gln Asp Leu Ser Glu Val Asn Asp Glu Asn Thr Gln Leu Met Glu Ile Leu Asn Met Leu Phe Phe Leu Pro Ile Arg Arg Leu His Asn Tyr Ala Lys Val Leu Leu Lys Leu Ala Thr Cys Phe Glu Val Thr Ser Pro Glu Tyr Gln Lys Leu Gln Asp Ser Ser Ser Cys Tyr Glu Ser Leu Ala Leu His Leu Gly Lys Lys Arg Lys Glu Ala Glu Tyr Thr Leu Ser Phe Trp Lys Thr Phe Pro Gly Lys Met Thr Asp Ser Leu Arg Lys Pro Glu Arg Arg Leu Leu Cys Glu Ser Ser Asn Arg Ala Leu Ser Leu Gln His Ala Gly Arg Phe Ser Val Asn Trp Phe Ile Leu Phe Asn Asp Ala Leu Val His Ala Gln Phe Ser Thr His His Val Phe Pro Leu Ala Thr Leu Trp Ala Glu Pro Leu Ser Glu Glu Ala Gly Ser Val Asn Gly Leu Lys Ile Thr Thr Pro Glu Glu Gln Phe Thr Leu Ile Ser Ser Thr Pro Gln Glu Lys Thr Lys Trp Leu Arg Ala Ile Ser Gln Ala Val Asp Gln Ala Leu Arg Gly Thr Ser Asp Phe Pro Leu Tyr Gly Gly Gly Ser Ser Val Gln Arg Gln Glu Pro Pro Ile Ser Arg Ser Ala Lys Tyr Thr Phe Tyr Lys Asp Thr Arg Leu Lys Asp Ala Thr Tyr Asp Gly Arg Trp Leu Ser Gly Lys Pro His Gly Arg Gly Val Leu Lys Trp Pro Asp Gly Lys Met Tyr Ser Gly Met Phe Arg Asn Gly Leu Glu Asp Gly Tyr Gly Glu Tyr Arg Ile Pro Asn Lys Ala Leu Asn Lys Glu Asp His Tyr Val Gly His Trp Lys Glu Gly Lys Met Cys SUBSTITUTE SHEET (RULE 26) Gly Gln Gly Val Tyr Ser Tyr Ala Ser Gly Glu Val Phe Glu Gly Cys Phe Gln Asp Asn Met Arg His Gly His Gly Leu Leu Arg Ser Gly Lys Leu Thr Ser Ser Ser Pro Ser Met Phe Ile Gly Gln Trp Val Met Asp Lys Lys Ala Gly Tyr Gly Val Phe Asp Asp Ile Thr Arg Gly Glu Lys Tyr Met Gly Met Trp Gln Asp Asp Val Cys Gln Gly Asn Gly Val Val Val Thr Gln Phe Gly Leu Tyr Tyr Glu Gly Asn Phe His Leu Asn Lys Met Met Gly Asn Gly Val Leu Leu Ser Glu Asp Asp Thr Ile Tyr Glu Gly Glu Phe Ser Asp Asp Trp Thr Leu Ser Gly Lys Gly Thr Leu Thr Met Pro His Gly Asp Tyr Ile Glu Gly Tyr Phe Ser Gly Glu Trp Gly Ser Gly Ile Lys Ile Thr Gly Thr Tyr Phe Lys Pro Ser Leu Tyr Glu Ser Asp Lys Asp Lys Pro Lys Ala Phe Arg Lys Leu Gly Asn Leu Ala Val Ala Ala Asp Glu Lys Trp Arg Ala Val Phe Glu Glu Cys Trp His Gln Leu Gly Cys Glu Ser Pro Gly Gln Gly Glu Val Trp Lys Ala Trp Asp Asn Ile Ala Val Ala Leu Thr Thr Asn Arg Arg Gln His Lys Asp Ser Pro Glu Ile Leu Ser Arg Ser Gln Thr Gln Thr Leu Glu Ser Leu Glu Tyr Ile Pro Gln His Ile Gly Ala Phe Ser Val Glu Lys Tyr Asp Asp Ile Lys Lys Tyr Leu Ile Lys Ala Cys Asp Thr Pro Leu His Pro Leu Gly Arg Leu Val Glu Thr Leu Val Ala Val Tyr Arg Met Thr Tyr Val Gly Val Gly Ala Asn Arg Arg Leu Leu Gln Glu Ala Val Lys Glu Ile Lys Ser Tyr Leu Lys Arg Ile Phe Gln Leu Val Arg Phe Leu Phe Pro Glu Leu Pro Glu Glu Gly Ser Thr Ile Pro Leu Ser Ala Pro Leu Pro Thr Gly Arg Arg Ser Phe Cys Thr Gly Lys Leu Asp Ser Arg Ser Glu Ser Pro Glu Pro Gly Tyr Val Val Thr Ser Ser Gly Leu Leu Leu Pro Val Leu Leu Pro Arg Leu Tyr Pro Pro Leu Phe Met Leu Tyr Ala Leu Asp Asn Asp Arg Glu Glu Asp Ile Tyr Trp Glu Cys Val Leu Arg Leu Asn Lys Gln Pro Asp Ile Ala Leu Leu Gly Phe Leu Gly Val Gln Lys Lys Phe Trp Pro Ala Thr Leu Ser Ile Leu Gly Glu Ser Lys Lys Val Leu Ser Thr Thr Lys Asp Ala Cys Phe Ala Ser Ala Val Glu Cys Leu Gln Gln Ile Ser Thr Thr Phe Thr Pro Ser Asp Lys Leu Lys Val Ile Gln Gln Thr Phe Glu Glu Ile Ser Gln Ser Val Leu Ala Ser Leu Gln Glu Asp Phe Leu Trp Ser Met Asp Asp Leu Phe Pro Val Phe Leu SUBSTETUTE SHEET (RULE 26) TyrValVal Leu Ala Arg Ile Arg Leu Gly Ser ValHis Arg Asn Glu LeuIleGlu Asp Met Asp Pro Phe Gln His Gly GlnGly Leu Leu Glu IleMetPhe Thr Leu Lys Ala Cys Phe Gln Ile ArgGlu Thr Tyr Gln LysLeuAsn <210> 6 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Artificial Sequence: Synthesized Oligonucleotide <400> 6 cctagtcatc catgtgctgg 20 <210> 7 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Artificial Sequence: Synthesized Oligonucleotide <400> 7 tcccatacct gaccttccac 20 <210> 8 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Artificial Sequence: Synthesized Oligonucleotide <400> 8 cttgatagac tttctgtaaa gaag 24 <210> 9 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Artificial Sequence: Synthesized Oligonucleotide <400> 9 ggctacttgg acaaatctcc actg 24 <210> 10 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide Sk)BSTfTUTE SHEET (RULE 26) <400> 10 ggagagactg tgctcccaag 20 <210> 11 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 11 agccctcctt agccaatagc 20 <210> 12 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 12 taagcttagt gggcaggctc 20 <210> 13 <211> 22 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 13 ttcccactta acaaccatca ac 22 <210> 14 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 14 ccaatttggt taaatctata gggg 24 <210> 15 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 15 gacaatgcca gagtgtgctc 20 SUBSTITUTE SHEET (RULE 26) <210> 16 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 16 ccagcccttt gttagcagtc 20 <210> 17 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 17 cttcttcctg cctgtcaagg 20 <210> 18 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 18 ttgtacaatg cctcccttcc 20 <210> 19 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 19 agcccaacat gacacctttc 20 <210> 20 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 20 gattgcttgt tgcataaggg 20 <210> 21 SUBSTITUTE SHEET (RULE 26) <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 21 atacagcatg cgatgtcagg 20 <210> 22 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 22 ctggactccc actccttcac 20 <210> 23 <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 23 gctagaagag cccagatttc c 21 <210> 24 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 24 tgactttgtg tgcctgtgtg 20 <210> 25 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 25 ataccctgga aaatctgggg 20 <210> 26 <211> 20 <212> DNA
SUBSTITUTE SHEET (RULE 26) <213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 26 tttgcgcatt atctctggtc 20 <210> 27 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 27 gtacgtatga aattcccccg 20 <210> 28 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 28 ttccgtctta ctcctgcacc 20 <210> 29 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 29 gccttaggat ccaattcctg 20 <210> 30 <211> 23 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 30 caatgatgta ctgatgaacc agc 23 <210> 31 <211> 20 <212> DNA
<213> Artificial Sequence SUBSTITUTE SHEET (RULE 26) <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 31 cctgatggtt taatggtggg 20 <210> 32 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 32 gcacatggca acaggttaag 20 <210> 33 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 33 tccttggcag aataaccctg 20 <210> 34 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 34 cccctaccac tccctttacc 20 <210> 35 <211> 23 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 35 ccagtggcta atagtacctg tcc 23 <210> 36 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized SUBSTITUTE SHEET (RULE 26) oligonucleotide <400> 36 tggatgcatg attcatttcc 20 <210> 37 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 37 tccttggctt tccaaatgtc 20 <210> 38 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 38 ctatcctggg gtctctgctg 20 <210> 39 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 39 tgctatcgaa atggttgctg 20 <210> 40 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 40 agctacgacc agcaaattcc 20 <210> 41 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide SUBSTITUTE SHEET (RULE 26) <400> 41 ~
ataggggtcc acctttcagg 20 <210> 42 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 42 aaggggatat gggcagagtc 20 <210> 43 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 43 aaatgcttgc ttggttttgg 20 <210> 44 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 44 aaagggcatc ttcattgcac 20 <210> 45 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 45 cacaagaggc agaaagagcc 20 <210> 46 <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 46 aatgcttgat gaattgttgc c 21 SUBSTITUTE SHEET (RULE 26) <210> 47 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 47 atgatcatcc tcaccccagg 20 <210> 48 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 48 ttgaagattt atgcctgggg 20 <210> 49 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 49 tgaggtcaca cggctatcag 20 <210> 50 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 50 gtgtagtggg gctgatgtcc 20 <210> 51 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 51 tggctatgca aacattcagg 20 <210> 52 SUBSTITUTE SHEET (RULE 26) <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 52 aatgcaaaat accacacatg g 21 <210> 53 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 53 tcattggctt aaactgtggg 20 <210> 54 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 54 caacctaggg ttgatgcctg 20 <210> 55 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 55 catcttcgga aagcaaaacc . 20 <210> 56 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 56 ctttggggat atgactgcgt 20 <210> 57 <211> 26 <212> DNA
SUBSTITUTE SHEET (RULE 26) <213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 57 gtaaaagaat ttattaggga gaaaaa 26 <210> 58 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 58 ttcctctaac cccacatttt attc 24 <210> 59 <211> 26 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 59 tgcttttaaa atattaacca gctttg 26 <210> 60 <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 60 tcagtcttgg cagttttggt c 21 <210> 61 <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 61 ctgctgtatg ttgagcaggt g 21 <210> 62 <211> 20 <212> DNA
<213> Artificial Sequence SUBSTITUTE SHEET (RULE 26) <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 62 tggatgctcc actttgactg 20 <210> 63 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 63 ttaagaaccc ccttgagtgc 20 <210> 64 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 64 ttcctggtcc caaaattgac 20 <210> 65 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 65 cagggtgaaa ctacccaagc 20 <210> 66 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 66 ttttatgctt ttcaaccccc 20 <210> 67 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized SUBSTITUTE SHEET (RULE 26) oligonucleotide <400> 67 acacactttc tcgctgggac 20 <210> 68 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 68 tgatctgagc acaaaggctg 20 <210> 69 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 69 taaacagcgg tgggtagagc 20 <210> 70 <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 70 aatgctcctt ttctcccact c 21 <210> 71 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 71 tgccaaattt ccaataatgc 20 <210> 72 <211> 19 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide SUBSTITUTE SHEET (RULE 26) <400> 72 taatggggac aaggaagcc 19 <210> 73 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 73 gctgaggcaa aacaagcatc 20 <210> 74 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 74 ccaaagacct gcactctgac 20 <210> 75 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 75 ctggcttggc tctctcctac 20 <210> 76 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 76 aaaaagcacg atcaaatggc 20 <210> 77 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 77 ggaagagcgt actcctgctg 20 SUBSTITUTE SHEET (RULE 26) <210> 78 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 78 gcaggagtac gctcttccac 20 <210> 79 <211> 22 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 79 gaacaaaatg tgctctaaag gc 22 <210> 80 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 80 tctttttctc tctggggcag 20 <210> 81 <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 81 tgccttctgt gttttaccct g 21 <210> 82 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 82 gaagggaaca gggaaaagtg 20 <210> 83 SUBSTITUTE SHEET (RULE 26) <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 83 ttacctccct ttcaatcctc c 21 <210> 84 <211> 545 <212> PRT
<213> Homo sapiens <400> 84 Met Asp Ser Lys Lys Arg Ser Ser Thr Glu Ala Glu Gly Ser Lys Glu Arg Gly Leu Val His Ile Trp Gln Ala Gly Ser Phe Pro Ile Thr Pro Glu Arg Leu Pro Gly Trp Gly Gly Lys Thr Val Leu Gln Ala Ala Leu Gly Val Lys His Gly Val Leu Leu Thr Glu Asp Gly Glu Val Tyr Ser Phe Gly Thr Leu Pro Trp Arg Ser Gly Pro Val Glu Ile Cys Pro Ser Ser Pro Ile Leu Glu Asn Ala Leu Val Gly Gln Tyr Val Ile Thr Val Ala Thr Gly Ser Phe His Ser Gly Ala Val Thr Asp Asn Gly Val Ala Tyr Met Trp Gly Glu Asn Ser Ala Gly Gln Cys Ala Val Ala Asn Gln Gln Tyr Val Pro Glu Pro Asn Pro Val Ser Ile Ala Asp Ser Glu Ala Ser Pro Leu Leu Ala Val Arg Ile Leu Gln Leu Ala Cys Gly Glu Glu 145 . 150 155 160 His Thr Leu Ala Leu Ser Ile Ser Arg Glu Ile Trp Ala Trp Gly Thr Gly Cys Gln Leu Gly Leu Ile Thr Thr Ala Phe Pro Val Thr Lys Pro Gln Lys Val Glu His Leu Ala Gly Arg Val Val Leu Gln Val Ala Cys Gly Ala Phe His Ser Leu Ala Leu Val Gln Cys Leu Pro Ser Gln Asp Leu Lys Pro Val Pro Glu Arg Cys Asn Gln Cys Ser Gln Leu Leu Ile Thr Met Thr Asp Lys Glu Asp His Val Ile Ile Ser Asp Ser His Cys Cys Pro Leu Gly Val Thr Leu Thr Glu Ser Gln Ala Glu Asn His Ala Ser Thr Ala Leu Ser Pro Ser Thr Glu Thr Leu Asp Arg Gln Glu Glu Val Phe Glu Asn Thr Leu Val Ala Asn Asp Gln Ser Val Ala Thr Glu Leu Asn Ala Val Ser Ala Gln Ile Thr Ser Ser Asp Ala Met Ser Ser Gln Gln Asn Val Met Gly Thr Thr Glu Ile Ser Ser Ala Arg Asn Ile Pro Ser Tyr Pro Asp Thr Gln Ala Val Asn Glu Tyr Leu Arg Lys Leu SUBSTITUTE SHEET (RULE 26) Ser Asp His Ser Val Arg Glu Asp Ser Glu His Gly Glu Lys Pro Met Pro Ser Gln Pro Leu Leu Glu Glu Ala Ile Pro Asn Leu His Ser Pro Pro Thr Thr Ser Thr Ser Ala Leu Asn Ser Leu Val Val Ser Cys Ala Ser Ala Val Gly Val Arg Val Ala Ala Thr Tyr Glu Ala Gly Ala Leu Ser Leu Lys Lys Val Met Asn Phe Tyr Ser Thr Thr Pro Cys Glu Thr Gly Ala Gln Ala Gly Ser Ser Ala Ile Gly Pro Glu Gly Leu Lys Asp Ser Arg Glu Glu Gln Val Lys Gln Glu Ser Met Gln Gly Lys Lys Ser Ser Ser Leu Val Asp Ile Arg Glu Glu Glu Thr Gly Arg Gln Ser Lys Thr Leu Pro Pro Trp Ile Val Val Thr Ser Phe Pro Gln Ala Leu Lys Lys Gly Cys Thr Gly Glu Asn Glu Asp Ser Gly Ser Asp Pro His Ile Gln Trp Arg Ser Arg Cys Ala Pro Ala Phe Ser Glu Asn Arg Ser Val Asp Leu Gly Glu Arg Glu Gly Arg Ala Ala Gly Ala Arg Arg Cys Ser Ala SUBSTITUTE SHEET (RULE 26)
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 57 gtaaaagaat ttattaggga gaaaaa 26 <210> 58 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 58 ttcctctaac cccacatttt attc 24 <210> 59 <211> 26 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 59 tgcttttaaa atattaacca gctttg 26 <210> 60 <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 60 tcagtcttgg cagttttggt c 21 <210> 61 <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 61 ctgctgtatg ttgagcaggt g 21 <210> 62 <211> 20 <212> DNA
<213> Artificial Sequence SUBSTITUTE SHEET (RULE 26) <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 62 tggatgctcc actttgactg 20 <210> 63 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 63 ttaagaaccc ccttgagtgc 20 <210> 64 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 64 ttcctggtcc caaaattgac 20 <210> 65 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 65 cagggtgaaa ctacccaagc 20 <210> 66 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 66 ttttatgctt ttcaaccccc 20 <210> 67 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized SUBSTITUTE SHEET (RULE 26) oligonucleotide <400> 67 acacactttc tcgctgggac 20 <210> 68 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 68 tgatctgagc acaaaggctg 20 <210> 69 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 69 taaacagcgg tgggtagagc 20 <210> 70 <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 70 aatgctcctt ttctcccact c 21 <210> 71 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 71 tgccaaattt ccaataatgc 20 <210> 72 <211> 19 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide SUBSTITUTE SHEET (RULE 26) <400> 72 taatggggac aaggaagcc 19 <210> 73 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 73 gctgaggcaa aacaagcatc 20 <210> 74 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 74 ccaaagacct gcactctgac 20 <210> 75 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 75 ctggcttggc tctctcctac 20 <210> 76 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 76 aaaaagcacg atcaaatggc 20 <210> 77 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence: synthesized oligonucleotide <400> 77 ggaagagcgt actcctgctg 20 SUBSTITUTE SHEET (RULE 26) <210> 78 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 78 gcaggagtac gctcttccac 20 <210> 79 <211> 22 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 79 gaacaaaatg tgctctaaag gc 22 <210> 80 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 80 tctttttctc tctggggcag 20 <210> 81 <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 81 tgccttctgt gttttaccct g 21 <210> 82 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 82 gaagggaaca gggaaaagtg 20 <210> 83 SUBSTITUTE SHEET (RULE 26) <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: synthesized oligonucleotide <400> 83 ttacctccct ttcaatcctc c 21 <210> 84 <211> 545 <212> PRT
<213> Homo sapiens <400> 84 Met Asp Ser Lys Lys Arg Ser Ser Thr Glu Ala Glu Gly Ser Lys Glu Arg Gly Leu Val His Ile Trp Gln Ala Gly Ser Phe Pro Ile Thr Pro Glu Arg Leu Pro Gly Trp Gly Gly Lys Thr Val Leu Gln Ala Ala Leu Gly Val Lys His Gly Val Leu Leu Thr Glu Asp Gly Glu Val Tyr Ser Phe Gly Thr Leu Pro Trp Arg Ser Gly Pro Val Glu Ile Cys Pro Ser Ser Pro Ile Leu Glu Asn Ala Leu Val Gly Gln Tyr Val Ile Thr Val Ala Thr Gly Ser Phe His Ser Gly Ala Val Thr Asp Asn Gly Val Ala Tyr Met Trp Gly Glu Asn Ser Ala Gly Gln Cys Ala Val Ala Asn Gln Gln Tyr Val Pro Glu Pro Asn Pro Val Ser Ile Ala Asp Ser Glu Ala Ser Pro Leu Leu Ala Val Arg Ile Leu Gln Leu Ala Cys Gly Glu Glu 145 . 150 155 160 His Thr Leu Ala Leu Ser Ile Ser Arg Glu Ile Trp Ala Trp Gly Thr Gly Cys Gln Leu Gly Leu Ile Thr Thr Ala Phe Pro Val Thr Lys Pro Gln Lys Val Glu His Leu Ala Gly Arg Val Val Leu Gln Val Ala Cys Gly Ala Phe His Ser Leu Ala Leu Val Gln Cys Leu Pro Ser Gln Asp Leu Lys Pro Val Pro Glu Arg Cys Asn Gln Cys Ser Gln Leu Leu Ile Thr Met Thr Asp Lys Glu Asp His Val Ile Ile Ser Asp Ser His Cys Cys Pro Leu Gly Val Thr Leu Thr Glu Ser Gln Ala Glu Asn His Ala Ser Thr Ala Leu Ser Pro Ser Thr Glu Thr Leu Asp Arg Gln Glu Glu Val Phe Glu Asn Thr Leu Val Ala Asn Asp Gln Ser Val Ala Thr Glu Leu Asn Ala Val Ser Ala Gln Ile Thr Ser Ser Asp Ala Met Ser Ser Gln Gln Asn Val Met Gly Thr Thr Glu Ile Ser Ser Ala Arg Asn Ile Pro Ser Tyr Pro Asp Thr Gln Ala Val Asn Glu Tyr Leu Arg Lys Leu SUBSTITUTE SHEET (RULE 26) Ser Asp His Ser Val Arg Glu Asp Ser Glu His Gly Glu Lys Pro Met Pro Ser Gln Pro Leu Leu Glu Glu Ala Ile Pro Asn Leu His Ser Pro Pro Thr Thr Ser Thr Ser Ala Leu Asn Ser Leu Val Val Ser Cys Ala Ser Ala Val Gly Val Arg Val Ala Ala Thr Tyr Glu Ala Gly Ala Leu Ser Leu Lys Lys Val Met Asn Phe Tyr Ser Thr Thr Pro Cys Glu Thr Gly Ala Gln Ala Gly Ser Ser Ala Ile Gly Pro Glu Gly Leu Lys Asp Ser Arg Glu Glu Gln Val Lys Gln Glu Ser Met Gln Gly Lys Lys Ser Ser Ser Leu Val Asp Ile Arg Glu Glu Glu Thr Gly Arg Gln Ser Lys Thr Leu Pro Pro Trp Ile Val Val Thr Ser Phe Pro Gln Ala Leu Lys Lys Gly Cys Thr Gly Glu Asn Glu Asp Ser Gly Ser Asp Pro His Ile Gln Trp Arg Ser Arg Cys Ala Pro Ala Phe Ser Glu Asn Arg Ser Val Asp Leu Gly Glu Arg Glu Gly Arg Ala Ala Gly Ala Arg Arg Cys Ser Ala SUBSTITUTE SHEET (RULE 26)
Claims (62)
1. An isolated nucleic acid that codes for a peptide having at least 75%
identity to all of an amino acid sequence selected from the group consisting of SEQ ID NO:2; SEQ
ID
NO:3; SEQ ID NO:5; SEQ ID NO:84; and, amino acids 372-1657 of SEQ ID NO:2.
identity to all of an amino acid sequence selected from the group consisting of SEQ ID NO:2; SEQ
ID
NO:3; SEQ ID NO:5; SEQ ID NO:84; and, amino acids 372-1657 of SEQ ID NO:2.
2. The nucleic acid of claim 1 that codes for a peptide having about 80% or more sequence identity to the selected sequence.
3. The nucleic acid of claim 1 that codes for a peptide having about 85% or more sequence identity to the selected sequence.
4. The nucleic acid of claim 1 that codes for a peptide having about 90% or more sequence identity to the selected sequence.
5. The nucleic acid of claim 1 that codes a peptide having about 95% or more sequence identity to the selected sequence.
6. The nucleic acid of any one of claims 1-5, wherein the selected sequence is SEQ ID
NO:2.
NO:2.
7. The nucleic acid of claim 1, wherein the selected sequence is SEQ ID NO:3.
8. The nucleic acid of claim 1, wherein the selected sequence is SEQ ID NO:5.
9. The nucleic acid of claim 1, wherein the selected sequence is SEQ ID NO:84.
10. An isolated nucleic acid consisting essentially of a nucleotide sequence having at least 75% identity to all of a nucleotide sequence or a complementary sequence thereof, selected from the group consisting of SEQ ID NO:1; SEQ ID NO:4; nucleotides of SEQ ID NO:1; nucleotides 1225-5094 of SEQ ID NO:1; and, nucleotides 124-5076 of SEQ ID NO:4.
11. The nucleic acid of claim 10 having about 80% or more sequence identity to the selected sequence or complementary sequence thereof.
12. The nucleic acid of claim 10 having about 85% or more sequence identity to the selected sequence or complementary sequence thereof.
13. The nucleic acid of claim 10 having about 90% or more sequence identity to the selected sequence or complementary sequence thereof.
14. The nucleic acid of claim 10 having about 95% or more sequence identity to the selected sequence or complementary sequence thereof.
15. The nucleic acid of any one of claims 10-14, wherein the selected sequence is SEQ
ID NO:1.
ID NO:1.
16. The nucleic acid of any one of claims 10-14, wherein the selected sequence is SEQ
ID NO:4.
ID NO:4.
17. The nucleic acid of any one of claims 10-14, wherein the selected sequence is nucleotides 124-5094 of SEQ ID NO:1.
18. The nucleic acid of any one of claims 10-14, wherein the selected sequence is amino acids 124-5076 of SEQ ID NO:4.
19. The isolated nucleic acid of any one of claims 1-18 joined to a second nucleic acid, wherein the second nucleic acid is not naturally associated with the isolated nucleic acid.
20. A recombinant vector comprising a nucleic acid according to any one of claims 1-19.
21. A cell comprising a nucleic acid of claim 19 or a vector of claim 20.
22. An oligonucleotide of 6 to 75 nucleotides, wherein the oligonucleotide hybridizes to a nucleic acid according to any one of claims 1-18 or a complementary sequence thereof, under stringent conditions.
23. The oligonucleotide of claim 22 of about 10 to about 40 nucleotides.
24. The oligonucleotide of claim 22 of about 15 to about 30 nucleotides.
25. The oligonucleotide of claim 22 of about 15 to about 25 nucleotides.
26. The oligonucleotide of any one of claims 22-25 capable of hybridizing under stringent conditions to a nucleic acid encoding a peptide consisting of the sequence of SEQ
ID NO:3 or the complementary nucleic acid sequence thereof, but not to a nucleic acid encoding a peptide consisting of the sequence of SEQ ID NO:2 or the complementary nucleic acid sequence thereof.
ID NO:3 or the complementary nucleic acid sequence thereof, but not to a nucleic acid encoding a peptide consisting of the sequence of SEQ ID NO:2 or the complementary nucleic acid sequence thereof.
27. The oligonucleotide of any one of claims 22-25 capable of hybridizing under stringent conditions to a nucleic acid encoding a peptide consisting of the sequence of SEQ
ID NO:84 or the complementary nucleic acid sequence thereof, but not to a nucleic acid encoding a peptide consisting of the sequence of SEQ ID NO:2 or the complementary nucleic acid sequence thereof.
ID NO:84 or the complementary nucleic acid sequence thereof, but not to a nucleic acid encoding a peptide consisting of the sequence of SEQ ID NO:2 or the complementary nucleic acid sequence thereof.
28. The oligonucleotide of any one of claims 22-27 joined to a label.
29. A kit comprising two or more different oligonucleotides according to any one of claims 22-27 for use in nucleic acid amplification.
30. An isolated peptide comprising a sequence of amino acids coded by a nucleic acid according to any one of claims 1-19 or a recombinant vector according to claim 20.
31. A peptide consisting essentially of a sequence of at least 5 contiguous amino acids from a sequence selected from the group consisting of amino acids 1-46 of SEQ
ID NO:2;
amino acids 47-1657 of SEQ ID NO:2; SEQ ID NO:3; amino acids 43-49 of SEQ ID
NO:3;
SEQ ID NO:84; and amino acids 476 to 545 of SEQ ID NO:84.
ID NO:2;
amino acids 47-1657 of SEQ ID NO:2; SEQ ID NO:3; amino acids 43-49 of SEQ ID
NO:3;
SEQ ID NO:84; and amino acids 476 to 545 of SEQ ID NO:84.
32. A peptide comprising at least 5 contiguous amino acids from amino acids 43-49 of SEQ ID NO:3 or amino acids 476 to 545 of SEQ ID NO:84.
33. An antibody which binds a peptide according to any one of claims 30-32.
34. The antibody of claim 33 prepared by using a peptide according to any one of claims 30-32 as an antigen.
35. A non-human mammal comprising a mutated gene, wherein the gene but for the mutation would encode a protein having at least 75% sequence identity to all of SEQ ID
NO:2 or SEQ ID NO:5.
NO:2 or SEQ ID NO:5.
36. The mammal of claim 35, wherein the protein has at least 85% sequence identity to all of SEQ ID NO:1 or SEQ ID NO:2.
37. The mammal of claim 35 or 36, wherein the mutated gene does not express a protein having biological activity.
38. The mammal of claim 35, 36, or 37, wherein the mutated gene is incapable of expression of a protein.
39. The mammal of any one of claims 35-38, wherein the mammal is a rodent.
40. The mammal of claim 39, wherein the rodent is a mouse.
41. A method for the diagnosis of amyotrophic lateral sclerosis type 2 in a patient, comprising detecting the presence of a mutation in a gene that encodes a protein having at least 75% sequence identity to SEQ ID NO:2.
42. The method of claim 41, wherein the protein has at least about 90%
sequence identity to SEQ ID NO:2.
sequence identity to SEQ ID NO:2.
43. The method of claim 41, wherein the protein has at least about 95%
sequence identity to SEQ ID NO:2.
sequence identity to SEQ ID NO:2.
44. The method of claim 41, wherein the protein has at least about 97%
sequence identity to SEQ ID NO:2.
sequence identity to SEQ ID NO:2.
45. The method of claim 41, wherein the protein has essentially the sequence of SEQ ID
NO:2 but for the presence of the mutation.
NO:2 but for the presence of the mutation.
46. The method of any one of claims 41-45, comprising detecting the presence of the mutation in a biological sample from the patient.
47. The method of any one of claims 41-46, wherein the detecting comprises comparing a sequence of the gene, a RNA transcript of the gene, and a cDNA made from the RNA
transcript, or a protein expressed by the gene from a human patient, to SEQ ID
NO:1, wherein a difference in sequence is indicative of mutation.
transcript, or a protein expressed by the gene from a human patient, to SEQ ID
NO:1, wherein a difference in sequence is indicative of mutation.
48. The method of claim 46, comprising contacting nucleic acids obtained from the biological sample or cDNA made from said nucleic acids, with one or more oligonucleotides according to any one of claims 22 to 28.
49. The method of claim 46, comprising detecting whether the one or more oligonucleotides hybridize to said nucleic acids or cDNA, under stringent conditions.
50. The method of claim 46, comprising amplification of nucleic acids or cDNA
to which two or more of said oligonucleotides hybridize, and determining the presence of an amplified product.
to which two or more of said oligonucleotides hybridize, and determining the presence of an amplified product.
51. A method for the diagnosis of amyotrophic lateral sclerosis type 2, comprising detecting the presence or absence of a protein having at least 85% sequence identity to all of SEQ ID NO:2 in a patient.
52. A method for the diagnosis of amyotrophic lateral sclerosis type 2, comprising detecting the presence or absence of a protein having at least 95% sequence identity to all of SEQ ID NO:2 in a patient.
53. The method of claim 51, wherein the detecting comprises determining whether a protein having at least 85% sequence identity to all of SEQ ID NO:2 is present in a biological sample from the patient.
54. The method of claim 52, wherein the detecting comprises determining whether a protein having at least 95% sequence identity to all of SEQ ID NO:2 is present in a biological sample from the patient.
55. A method for the diagnosis of amyotrophic lateral sclerosis type 2, comprising detecting the presence or absence of a protein having at least 85% sequence identity to all of SEQ ID NO:3 or SEQ ID NO:84 in a biological sample from the patient.
56. A method for the diagnosis of amyotrophic lateral sclerosis type 2, comprising detecting the presence or absence of a protein having at least 95% sequence identity to all of SEQ ID NO:3 or SEQ ID NO:84 in a biological sample from the patient.
57. The method of any one of claims 51-56, comprising contacting an antibody according to any one of claims 33 or 34 with a biological sample from the patient and determining whether the antibody binds to protein in the sample.
58. A method of treatment of amyotrophic lateral sclerosis type 2, comprising administering a peptide, a nucleic acid, or a pharmaceutical composition comprising the peptide or nucleic acid to a patient in need thereof, wherein the peptide comprises an amino acid sequence having at least 90% identity to SEQ ID NO:2 or a fragment thereof, and the nucleic acid codes for said peptide.
59. A method of treatment of amyotrophic lateral sclerosis type 2, comprising administering a peptide, a nucleic acid, or a pharmaceutical composition comprising the peptide or nucleic acid to a patient in need thereof, wherein the peptide comprises an amino acid sequence having at least 95% identity to SEQ ID NO:2 or a fragment thereof, and the nucleic acid codes for said peptide.
60. A method of treatment of amyotrophic lateral sclerosis type 2, comprising administering a composition to a patient in need thereof, wherein the composition mimics the biological activity of the peptide of SEQ ID NO. 2.
61. The use of a peptide or a nucleic acid for preparation of a medicament for treatment of amyotrophic lateral sclerosis type 2, wherein the peptide comprises an amino acid sequence having at least 90% identity to SEQ ID NO:2 or a fragment thereof, and the nucleic acid codes for said peptide.
62. The use of a peptide or a nucleic acid for preparation of a medicament for treatment of amyotrophic lateral sclerosis type 2, wherein the peptide comprises an amino acid sequence having at least 95% identity to SEQ ID NO:2 or a fragment thereof, and the nucleic acid codes for said peptide.
Applications Claiming Priority (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US26772301P | 2001-02-12 | 2001-02-12 | |
US60/267,723 | 2001-02-12 | ||
JP2001116973A JP2002306179A (en) | 2001-04-16 | 2001-04-16 | Als2cr6 gene and method for diagnosing type 2 amyotrophic lateral sclerosis |
JP2001-116973 | 2001-04-16 | ||
US31835201P | 2001-09-12 | 2001-09-12 | |
US60/318,352 | 2001-09-12 | ||
PCT/CA2002/000147 WO2002072822A2 (en) | 2001-02-12 | 2002-02-12 | Als2 gene and amyotrophic lateral sclerosis type 2 |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2437960A1 true CA2437960A1 (en) | 2002-09-19 |
Family
ID=27346540
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002437960A Abandoned CA2437960A1 (en) | 2001-02-12 | 2002-02-12 | Als2 gene and amyotrophic lateral sclerosis type 2 |
Country Status (3)
Country | Link |
---|---|
JP (1) | JP2005525079A (en) |
CA (1) | CA2437960A1 (en) |
WO (1) | WO2002072822A2 (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2016210123A1 (en) * | 2015-06-26 | 2016-12-29 | Genervon Biopharmaceuticals, LLC | Methods of using gm604 in modulating als disease biomarkers leading to prognosis and therapeutic treatment for als disease |
WO2018080573A1 (en) * | 2016-10-28 | 2018-05-03 | Massachusetts Institute Of Technology | Crispr/cas global regulator screening platform |
CN111593031B (en) * | 2020-05-07 | 2022-04-08 | 海南波莲水稻基因科技有限公司 | Rice ALS mutant gene, plant transgenic screening vector pCALSm3 containing gene and application thereof |
-
2002
- 2002-02-12 CA CA002437960A patent/CA2437960A1/en not_active Abandoned
- 2002-02-12 JP JP2002571877A patent/JP2005525079A/en not_active Withdrawn
- 2002-02-12 WO PCT/CA2002/000147 patent/WO2002072822A2/en active Application Filing
Also Published As
Publication number | Publication date |
---|---|
JP2005525079A (en) | 2005-08-25 |
WO2002072822A3 (en) | 2003-04-24 |
WO2002072822A2 (en) | 2002-09-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA2554380C (en) | Mecp2e1 gene | |
Paoloni-Giacobino et al. | Cloning of a novel human neural cell adhesion molecule gene (NCAM2) that maps to chromosome region 21q21 and is potentially involved in Down syndrome | |
US20030190639A1 (en) | Genes involved in intestinal inflamatory diseases and use thereof | |
JP3449419B2 (en) | Long QT syndrome gene encoding KVLQT1 and its association with minK | |
JP2004508835A (en) | Human osteoporosis gene | |
KR100781481B1 (en) | Schizophrenia related gene and protein | |
CA2415127A1 (en) | Down syndrome critical region 1-like 1 proteins | |
US9618521B2 (en) | Sperm-specific cation channel, Catsper2, and uses therefor | |
US8273548B2 (en) | Nucleic acids encoding a human PAP1 polypeptide | |
US7339029B2 (en) | Sperm-specific cation channel, CatSper2, and uses therefor | |
CA2437960A1 (en) | Als2 gene and amyotrophic lateral sclerosis type 2 | |
US20040137450A1 (en) | Als2 gene and amyotrophic lateral sclerosis type 2 | |
US6310182B1 (en) | Compositions for the diagnosis and treatment of Chediak-Higashi syndrome | |
US5837535A (en) | Neuronal-neonatal gene: neuronatin | |
US20040086913A1 (en) | Human genes and gene expression products XVI | |
AU767263B2 (en) | Nucleic acids encoding human CIDE-B protein and polymorphic markers thereof | |
US20020102551A1 (en) | Nope polypeptides, encoding nucleic acids and methods of use | |
WO2009127211A1 (en) | Methods and kits for determining spinal dysmyelination | |
US6670465B1 (en) | Retinal calcium channel (alpha)1F-subunit gene | |
US20040242468A1 (en) | Gene involved in mineral deposition and uses thereof | |
WO2003000111A2 (en) | Human ugrp (uteroglobin-related protein) 1 promoter and its use | |
WO1999029860A1 (en) | Cadherin-like polypeptides, methods and compositions related thereto | |
US20030215833A1 (en) | Variant cleavage stimulation factor and its encoding nucleic acid | |
JPH09238685A (en) | Cerebral ischemia-related gene | |
JP2001128686A (en) | New protein and gene encoding the same |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FZDE | Dead |