KR20140033619A - Zfyve9를 포함하는 융합 단백질 및 이를 포함하는 암 진단용 조성물 - Google Patents
Zfyve9를 포함하는 융합 단백질 및 이를 포함하는 암 진단용 조성물 Download PDFInfo
- Publication number
- KR20140033619A KR20140033619A KR1020120099613A KR20120099613A KR20140033619A KR 20140033619 A KR20140033619 A KR 20140033619A KR 1020120099613 A KR1020120099613 A KR 1020120099613A KR 20120099613 A KR20120099613 A KR 20120099613A KR 20140033619 A KR20140033619 A KR 20140033619A
- Authority
- KR
- South Korea
- Prior art keywords
- protein
- fragment
- fusion
- seq
- exon
- Prior art date
Links
- 108020001507 fusion proteins Proteins 0.000 title claims abstract description 242
- 102000037865 fusion proteins Human genes 0.000 title claims abstract description 241
- 206010028980 Neoplasm Diseases 0.000 title claims abstract description 104
- 201000011510 cancer Diseases 0.000 title claims abstract description 84
- 239000000203 mixture Substances 0.000 title claims abstract description 23
- 101000964562 Homo sapiens Zinc finger FYVE domain-containing protein 9 Proteins 0.000 claims abstract description 43
- 102100040801 Zinc finger FYVE domain-containing protein 9 Human genes 0.000 claims abstract description 43
- 238000000034 method Methods 0.000 claims abstract description 43
- 238000011282 treatment Methods 0.000 claims abstract description 15
- 239000003112 inhibitor Substances 0.000 claims abstract description 14
- 238000012216 screening Methods 0.000 claims abstract description 11
- 238000003745 diagnosis Methods 0.000 claims abstract description 10
- 230000002265 prevention Effects 0.000 claims abstract 2
- 239000012634 fragment Substances 0.000 claims description 463
- 108090000623 proteins and genes Proteins 0.000 claims description 412
- 230000004927 fusion Effects 0.000 claims description 257
- 239000002773 nucleotide Substances 0.000 claims description 180
- 125000003729 nucleotide group Chemical group 0.000 claims description 180
- 102000004169 proteins and genes Human genes 0.000 claims description 161
- 108091033319 polynucleotide Proteins 0.000 claims description 44
- 102000040430 polynucleotide Human genes 0.000 claims description 44
- 239000002157 polynucleotide Substances 0.000 claims description 44
- 102100031048 Coiled-coil domain-containing protein 6 Human genes 0.000 claims description 37
- 101000777370 Homo sapiens Coiled-coil domain-containing protein 6 Proteins 0.000 claims description 37
- 230000014509 gene expression Effects 0.000 claims description 36
- 102100040038 Amyloid beta precursor like protein 2 Human genes 0.000 claims description 35
- 102100030708 GTPase KRas Human genes 0.000 claims description 35
- 101000584612 Homo sapiens GTPase KRas Proteins 0.000 claims description 35
- 101000890401 Homo sapiens Amyloid beta precursor like protein 2 Proteins 0.000 claims description 34
- 101000830603 Homo sapiens Tumor necrosis factor ligand superfamily member 11 Proteins 0.000 claims description 34
- 102100040362 Tumor protein D53 Human genes 0.000 claims description 34
- 102100037236 Tyrosine-protein kinase receptor UFO Human genes 0.000 claims description 34
- 102100026434 BCAS3 microtubule associated cell migration factor Human genes 0.000 claims description 33
- 101000766273 Homo sapiens BCAS3 microtubule associated cell migration factor Proteins 0.000 claims description 33
- 102100029987 Erbin Human genes 0.000 claims description 31
- 102100023728 MAP3K12-binding inhibitory protein 1 Human genes 0.000 claims description 31
- 102100033059 Mitogen-activated protein kinase kinase kinase 3 Human genes 0.000 claims description 31
- 101001018298 Homo sapiens Microtubule-associated serine/threonine-protein kinase 4 Proteins 0.000 claims description 30
- 102100033252 Microtubule-associated serine/threonine-protein kinase 4 Human genes 0.000 claims description 30
- 102100024568 Tumor necrosis factor ligand superfamily member 11 Human genes 0.000 claims description 30
- 102100024154 Cadherin-13 Human genes 0.000 claims description 29
- 101000766249 Homo sapiens tRNA (guanine(10)-N2)-methyltransferase homolog Proteins 0.000 claims description 29
- 102100026307 tRNA (guanine(10)-N2)-methyltransferase homolog Human genes 0.000 claims description 29
- 101000762243 Homo sapiens Cadherin-13 Proteins 0.000 claims description 28
- 102100026441 Adhesion G-protein coupled receptor D1 Human genes 0.000 claims description 27
- 208000020816 lung neoplasm Diseases 0.000 claims description 24
- 206010058467 Lung neoplasm malignant Diseases 0.000 claims description 21
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 21
- 201000005202 lung cancer Diseases 0.000 claims description 21
- 239000000126 substance Substances 0.000 claims description 19
- 208000002154 non-small cell lung carcinoma Diseases 0.000 claims description 15
- 208000029729 tumor suppressor gene on chromosome 11 Diseases 0.000 claims description 15
- 239000007787 solid Substances 0.000 claims description 14
- 239000012472 biological sample Substances 0.000 claims description 13
- 101000716750 Homo sapiens Protein SCAF11 Proteins 0.000 claims description 12
- 102100020876 Protein SCAF11 Human genes 0.000 claims description 12
- 101150038994 PDGFRA gene Proteins 0.000 claims description 10
- 239000002246 antineoplastic agent Substances 0.000 claims description 10
- 108020004414 DNA Proteins 0.000 claims description 9
- 102000053602 DNA Human genes 0.000 claims description 9
- 108020004999 messenger RNA Proteins 0.000 claims description 9
- 108091023037 Aptamer Proteins 0.000 claims description 8
- 230000001093 anti-cancer Effects 0.000 claims description 4
- 230000007423 decrease Effects 0.000 claims description 4
- 108020004459 Small interfering RNA Proteins 0.000 claims description 3
- 230000000295 complement effect Effects 0.000 claims description 3
- 230000019491 signal transduction Effects 0.000 claims description 3
- 108091027967 Small hairpin RNA Proteins 0.000 claims description 2
- 102000002287 alpha Subunit Glycoprotein Hormones Human genes 0.000 claims description 2
- 108010000732 alpha Subunit Glycoprotein Hormones Proteins 0.000 claims description 2
- 229940043355 kinase inhibitor Drugs 0.000 claims description 2
- 239000003757 phosphotransferase inhibitor Substances 0.000 claims description 2
- 239000004055 small Interfering RNA Substances 0.000 claims description 2
- 108091008794 FGF receptors Proteins 0.000 claims 3
- 101000718219 Homo sapiens Adhesion G-protein coupled receptor D1 Proteins 0.000 claims 3
- 101001010810 Homo sapiens Erbin Proteins 0.000 claims 3
- 101000978544 Homo sapiens MAP3K12-binding inhibitory protein 1 Proteins 0.000 claims 3
- 101001018145 Homo sapiens Mitogen-activated protein kinase kinase kinase 3 Proteins 0.000 claims 3
- 101001059989 Homo sapiens Mitogen-activated protein kinase kinase kinase kinase 3 Proteins 0.000 claims 3
- 101001026852 Homo sapiens Protein kinase C epsilon type Proteins 0.000 claims 3
- 101000686031 Homo sapiens Proto-oncogene tyrosine-protein kinase ROS Proteins 0.000 claims 3
- 101000844686 Homo sapiens Thioredoxin reductase 1, cytoplasmic Proteins 0.000 claims 3
- 101000610794 Homo sapiens Tumor protein D53 Proteins 0.000 claims 3
- 102100028193 Mitogen-activated protein kinase kinase kinase kinase 3 Human genes 0.000 claims 3
- 102100037339 Protein kinase C epsilon type Human genes 0.000 claims 3
- 102100023347 Proto-oncogene tyrosine-protein kinase ROS Human genes 0.000 claims 3
- 102100031208 Thioredoxin reductase 1, cytoplasmic Human genes 0.000 claims 3
- 125000003275 alpha amino acid group Chemical group 0.000 claims 3
- 102000052178 fibroblast growth factor receptor activity proteins Human genes 0.000 claims 3
- 239000003550 marker Substances 0.000 abstract description 5
- 239000003795 chemical substances by application Substances 0.000 abstract description 2
- 150000001413 amino acids Chemical group 0.000 description 117
- 210000004027 cell Anatomy 0.000 description 57
- 238000003776 cleavage reaction Methods 0.000 description 57
- 230000007017 scission Effects 0.000 description 57
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 39
- 102100023600 Fibroblast growth factor receptor 2 Human genes 0.000 description 38
- 101710182389 Fibroblast growth factor receptor 2 Proteins 0.000 description 38
- 210000000349 chromosome Anatomy 0.000 description 33
- 102100023275 Dual specificity mitogen-activated protein kinase kinase 3 Human genes 0.000 description 32
- 101700035123 Erbin Proteins 0.000 description 32
- 108010068355 MAP Kinase Kinase 3 Proteins 0.000 description 32
- 101710190245 Tumor protein D53 Proteins 0.000 description 31
- 108010023337 axl receptor tyrosine kinase Proteins 0.000 description 31
- 230000003426 interchromosomal effect Effects 0.000 description 29
- 101710096319 Adhesion G-protein coupled receptor D1 Proteins 0.000 description 28
- 108010075645 MAP Kinase Kinase Kinase 3 Proteins 0.000 description 28
- 101710132748 MAP3K12-binding inhibitory protein 1 Proteins 0.000 description 28
- 102000015840 Protein kinase C, epsilon Human genes 0.000 description 28
- 108050004067 Protein kinase C, epsilon Proteins 0.000 description 28
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 28
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 27
- 239000002299 complementary DNA Substances 0.000 description 27
- 108010050848 glycylleucine Proteins 0.000 description 26
- 210000001519 tissue Anatomy 0.000 description 26
- 108010034529 leucyl-lysine Proteins 0.000 description 24
- 208000035970 Chromosome Breakpoints Diseases 0.000 description 23
- 210000004899 c-terminal region Anatomy 0.000 description 23
- 108090000765 processed proteins & peptides Proteins 0.000 description 23
- 210000003917 human chromosome Anatomy 0.000 description 21
- 208000010507 Adenocarcinoma of Lung Diseases 0.000 description 19
- 102000004196 processed proteins & peptides Human genes 0.000 description 19
- 201000005249 lung adenocarcinoma Diseases 0.000 description 18
- 229920001184 polypeptide Polymers 0.000 description 18
- 108010031719 prolyl-serine Proteins 0.000 description 16
- 108010092854 aspartyllysine Proteins 0.000 description 15
- 108010057821 leucylproline Proteins 0.000 description 15
- 241000880493 Leptailurus serval Species 0.000 description 14
- 108010013835 arginine glutamate Proteins 0.000 description 14
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 14
- 230000035772 mutation Effects 0.000 description 14
- 108010026333 seryl-proline Proteins 0.000 description 14
- 108010073969 valyllysine Proteins 0.000 description 14
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 12
- 108010005233 alanylglutamic acid Proteins 0.000 description 12
- 108010062796 arginyllysine Proteins 0.000 description 12
- 108010049041 glutamylalanine Proteins 0.000 description 12
- 108010009298 lysylglutamic acid Proteins 0.000 description 12
- 108010070643 prolylglutamic acid Proteins 0.000 description 11
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 10
- 108020004705 Codon Proteins 0.000 description 10
- 102100022404 E3 ubiquitin-protein ligase Midline-1 Human genes 0.000 description 10
- 101000680670 Homo sapiens E3 ubiquitin-protein ligase Midline-1 Proteins 0.000 description 10
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 10
- 108091000080 Phosphotransferase Proteins 0.000 description 10
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 10
- 238000003018 immunoassay Methods 0.000 description 10
- 108010017391 lysylvaline Proteins 0.000 description 10
- 102000020233 phosphotransferase Human genes 0.000 description 10
- 102100033793 ALK tyrosine kinase receptor Human genes 0.000 description 9
- 101001050559 Homo sapiens Kinesin-1 heavy chain Proteins 0.000 description 9
- 102100023422 Kinesin-1 heavy chain Human genes 0.000 description 9
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 9
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 9
- 238000001514 detection method Methods 0.000 description 9
- 102000052116 epidermal growth factor receptor activity proteins Human genes 0.000 description 9
- 108700015053 epidermal growth factor receptor activity proteins Proteins 0.000 description 9
- 108010025306 histidylleucine Proteins 0.000 description 9
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 9
- YOHYSYJDKVYCJI-UHFFFAOYSA-N n-[3-[[6-[3-(trifluoromethyl)anilino]pyrimidin-4-yl]amino]phenyl]cyclopropanecarboxamide Chemical compound FC(F)(F)C1=CC=CC(NC=2N=CN=C(NC=3C=C(NC(=O)C4CC4)C=CC=3)C=2)=C1 YOHYSYJDKVYCJI-UHFFFAOYSA-N 0.000 description 9
- 238000012163 sequencing technique Methods 0.000 description 9
- 108010048818 seryl-histidine Proteins 0.000 description 9
- 108010087924 alanylproline Proteins 0.000 description 8
- 238000003556 assay Methods 0.000 description 8
- 108010089804 glycyl-threonine Proteins 0.000 description 8
- 108010003700 lysyl aspartic acid Proteins 0.000 description 8
- 108010064235 lysylglycine Proteins 0.000 description 8
- 108010054155 lysyllysine Proteins 0.000 description 8
- 108010056582 methionylglutamic acid Proteins 0.000 description 8
- 210000004897 n-terminal region Anatomy 0.000 description 8
- 108010051242 phenylalanylserine Proteins 0.000 description 8
- 108010077112 prolyl-proline Proteins 0.000 description 8
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 7
- 108010044191 Dynamin II Proteins 0.000 description 7
- 102100021238 Dynamin-2 Human genes 0.000 description 7
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 7
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 7
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 7
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 7
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 7
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 7
- 101001126417 Homo sapiens Platelet-derived growth factor receptor alpha Proteins 0.000 description 7
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 7
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 7
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 7
- 102100030485 Platelet-derived growth factor receptor alpha Human genes 0.000 description 7
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 7
- 108010044940 alanylglutamine Proteins 0.000 description 7
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 7
- 108010060035 arginylproline Proteins 0.000 description 7
- 108010047857 aspartylglycine Proteins 0.000 description 7
- 108010068265 aspartyltyrosine Proteins 0.000 description 7
- 101150088071 fgfr2 gene Proteins 0.000 description 7
- 238000000684 flow cytometry Methods 0.000 description 7
- 108010079547 glutamylmethionine Proteins 0.000 description 7
- 108010015792 glycyllysine Proteins 0.000 description 7
- 108010018625 phenylalanylarginine Proteins 0.000 description 7
- 238000012360 testing method Methods 0.000 description 7
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 7
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 6
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 6
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 6
- 102100027100 Echinoderm microtubule-associated protein-like 4 Human genes 0.000 description 6
- 102000004190 Enzymes Human genes 0.000 description 6
- 108090000790 Enzymes Proteins 0.000 description 6
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 6
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 6
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 6
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 6
- 101100219901 Homo sapiens CCDC6 gene Proteins 0.000 description 6
- 101001057929 Homo sapiens Echinoderm microtubule-associated protein-like 4 Proteins 0.000 description 6
- 101000633314 Homo sapiens Nicotinamide riboside kinase 2 Proteins 0.000 description 6
- 101001086862 Homo sapiens Pulmonary surfactant-associated protein B Proteins 0.000 description 6
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 6
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 6
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 6
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 6
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 6
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 6
- 102100029560 Nicotinamide riboside kinase 2 Human genes 0.000 description 6
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 6
- 102100032617 Pulmonary surfactant-associated protein B Human genes 0.000 description 6
- 101150035397 Ros1 gene Proteins 0.000 description 6
- VGYVVSQFSSKZRJ-OEAJRASXSA-N Thr-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=CC=C1 VGYVVSQFSSKZRJ-OEAJRASXSA-N 0.000 description 6
- 108010041407 alanylaspartic acid Proteins 0.000 description 6
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 6
- 108010068380 arginylarginine Proteins 0.000 description 6
- 108010038633 aspartylglutamate Proteins 0.000 description 6
- 101150027332 cit gene Proteins 0.000 description 6
- 108010060199 cysteinylproline Proteins 0.000 description 6
- 239000003814 drug Substances 0.000 description 6
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 6
- 238000010166 immunofluorescence Methods 0.000 description 6
- 238000003364 immunohistochemistry Methods 0.000 description 6
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 6
- 108010012581 phenylalanylglutamate Proteins 0.000 description 6
- 238000003127 radioimmunoassay Methods 0.000 description 6
- 238000007480 sanger sequencing Methods 0.000 description 6
- 230000004083 survival effect Effects 0.000 description 6
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 5
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 5
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 5
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 5
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 5
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 5
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 5
- 101150018445 Axl gene Proteins 0.000 description 5
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 5
- JCOSMKPAOYDKRO-AVGNSLFASA-N His-Glu-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N JCOSMKPAOYDKRO-AVGNSLFASA-N 0.000 description 5
- 101000887201 Homo sapiens Polyamine-transporting ATPase 13A2 Proteins 0.000 description 5
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 5
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 5
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 5
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 5
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 5
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 5
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 5
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 5
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 5
- ZCWWVXAXWUAEPZ-SRVKXCTJSA-N Lys-Met-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZCWWVXAXWUAEPZ-SRVKXCTJSA-N 0.000 description 5
- 101150072825 Mbip gene Proteins 0.000 description 5
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 5
- 108010079364 N-glycylalanine Proteins 0.000 description 5
- 102100039917 Polyamine-transporting ATPase 13A2 Human genes 0.000 description 5
- 238000003559 RNA-seq method Methods 0.000 description 5
- 101150072799 SCAF11 gene Proteins 0.000 description 5
- 108091006576 SLC34A2 Proteins 0.000 description 5
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 5
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 5
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 5
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 5
- 102100038437 Sodium-dependent phosphate transport protein 2B Human genes 0.000 description 5
- YHRCLOURJWJABF-WDSOQIARSA-N Trp-His-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N YHRCLOURJWJABF-WDSOQIARSA-N 0.000 description 5
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 5
- PYPZMFDMCCWNST-NAKRPEOUSA-N Val-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N PYPZMFDMCCWNST-NAKRPEOUSA-N 0.000 description 5
- IEBGHUMBJXIXHM-AVGNSLFASA-N Val-Lys-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N IEBGHUMBJXIXHM-AVGNSLFASA-N 0.000 description 5
- 108010081404 acein-2 Proteins 0.000 description 5
- 108010070944 alanylhistidine Proteins 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 5
- 108010008355 arginyl-glutamine Proteins 0.000 description 5
- 229940079593 drug Drugs 0.000 description 5
- 230000000694 effects Effects 0.000 description 5
- 230000012010 growth Effects 0.000 description 5
- 108010092114 histidylphenylalanine Proteins 0.000 description 5
- 108010085325 histidylproline Proteins 0.000 description 5
- 239000000463 material Substances 0.000 description 5
- 101150085141 APLP2 gene Proteins 0.000 description 4
- UQJUGHFKNKGHFQ-VZFHVOOUSA-N Ala-Cys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UQJUGHFKNKGHFQ-VZFHVOOUSA-N 0.000 description 4
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 4
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 4
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 4
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 4
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 4
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 4
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 4
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 4
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 4
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 4
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 4
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 4
- QRHYAUYXBVVDSB-LKXGYXEUSA-N Asn-Cys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QRHYAUYXBVVDSB-LKXGYXEUSA-N 0.000 description 4
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 4
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 4
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 4
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 4
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 4
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 4
- GYNUXDMCDILYIQ-QRTARXTBSA-N Asp-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N GYNUXDMCDILYIQ-QRTARXTBSA-N 0.000 description 4
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 4
- 101150051491 CDH13 gene Proteins 0.000 description 4
- XOKGKOQWADCLFQ-GARJFASQSA-N Gln-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XOKGKOQWADCLFQ-GARJFASQSA-N 0.000 description 4
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 4
- IWUFOVSLWADEJC-AVGNSLFASA-N Gln-His-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IWUFOVSLWADEJC-AVGNSLFASA-N 0.000 description 4
- MTCXQQINVAFZKW-MNXVOIDGSA-N Gln-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MTCXQQINVAFZKW-MNXVOIDGSA-N 0.000 description 4
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 4
- ICRKQMRFXYDYMK-LAEOZQHASA-N Gln-Val-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ICRKQMRFXYDYMK-LAEOZQHASA-N 0.000 description 4
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 4
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 4
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 4
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 4
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 4
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 4
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 4
- 102100030595 HLA class II histocompatibility antigen gamma chain Human genes 0.000 description 4
- JBJNKUOMNZGQIM-PYJNHQTQSA-N His-Arg-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JBJNKUOMNZGQIM-PYJNHQTQSA-N 0.000 description 4
- 101100218527 Homo sapiens BCAS3 gene Proteins 0.000 description 4
- 101001082627 Homo sapiens HLA class II histocompatibility antigen gamma chain Proteins 0.000 description 4
- 101001003102 Homo sapiens Hypoxia up-regulated protein 1 Proteins 0.000 description 4
- 101000599056 Homo sapiens Interleukin-6 receptor subunit beta Proteins 0.000 description 4
- 101100236409 Homo sapiens MAP3K3 gene Proteins 0.000 description 4
- 101100400483 Homo sapiens MAST4 gene Proteins 0.000 description 4
- 101001074602 Homo sapiens Protein PIMREG Proteins 0.000 description 4
- 101000651017 Homo sapiens Pulmonary surfactant-associated protein A2 Proteins 0.000 description 4
- 101000814514 Homo sapiens XIAP-associated factor 1 Proteins 0.000 description 4
- 102100020755 Hypoxia up-regulated protein 1 Human genes 0.000 description 4
- 101150018316 Igsf3 gene Proteins 0.000 description 4
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 4
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 4
- 102100022519 Immunoglobulin superfamily member 3 Human genes 0.000 description 4
- 102100037795 Interleukin-6 receptor subunit beta Human genes 0.000 description 4
- 101150105104 Kras gene Proteins 0.000 description 4
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 4
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 4
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 4
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 4
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 4
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 4
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 4
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 4
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 4
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 4
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 4
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 4
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 4
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 4
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 description 4
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 4
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 4
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 4
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 4
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 4
- 241000124008 Mammalia Species 0.000 description 4
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 4
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 4
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 4
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 4
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 4
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 4
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 4
- 102100036258 Protein PIMREG Human genes 0.000 description 4
- 102100027773 Pulmonary surfactant-associated protein A2 Human genes 0.000 description 4
- 101100495925 Schizosaccharomyces pombe (strain 972 / ATCC 24843) chr3 gene Proteins 0.000 description 4
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 4
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 4
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 4
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 4
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 4
- VFWQQZMRKFOGLE-ZLUOBGJFSA-N Ser-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O VFWQQZMRKFOGLE-ZLUOBGJFSA-N 0.000 description 4
- AABIBDJHSKIMJK-FXQIFTODSA-N Ser-Ser-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O AABIBDJHSKIMJK-FXQIFTODSA-N 0.000 description 4
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 4
- 101150074234 TXNRD1 gene Proteins 0.000 description 4
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 4
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 4
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 4
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 4
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 4
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 4
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 4
- 101150033619 Tpd52l1 gene Proteins 0.000 description 4
- OFTGYORHQMSPAI-PJODQICGSA-N Trp-Met-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O OFTGYORHQMSPAI-PJODQICGSA-N 0.000 description 4
- FBHBVXUBTYVCRU-BZSNNMDCSA-N Tyr-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CN=CN1 FBHBVXUBTYVCRU-BZSNNMDCSA-N 0.000 description 4
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 4
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 4
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 4
- 102100039488 XIAP-associated factor 1 Human genes 0.000 description 4
- 101150063269 ZFYVE9 gene Proteins 0.000 description 4
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 4
- 108010047495 alanylglycine Proteins 0.000 description 4
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 4
- 239000003153 chemical reaction reagent Substances 0.000 description 4
- 150000001875 compounds Chemical class 0.000 description 4
- 108010016616 cysteinylglycine Proteins 0.000 description 4
- 239000012091 fetal bovine serum Substances 0.000 description 4
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 4
- 238000002509 fluorescent in situ hybridization Methods 0.000 description 4
- 230000002068 genetic effect Effects 0.000 description 4
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 4
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 4
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 4
- 108010018006 histidylserine Proteins 0.000 description 4
- 108010012058 leucyltyrosine Proteins 0.000 description 4
- 108010038320 lysylphenylalanine Proteins 0.000 description 4
- 108010085203 methionylmethionine Proteins 0.000 description 4
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 4
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 4
- 108010079317 prolyl-tyrosine Proteins 0.000 description 4
- 108010004914 prolylarginine Proteins 0.000 description 4
- 108010005652 splenotritin Proteins 0.000 description 4
- 101150110709 trmt11 gene Proteins 0.000 description 4
- PQFMROVJTOPVDF-JBDRJPRFSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-carboxypropanoyl]amino]-3-carboxypropanoyl]amino]-4-carboxybutanoyl]amino]butanedioic acid Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PQFMROVJTOPVDF-JBDRJPRFSA-N 0.000 description 3
- 102100027520 ATP synthase mitochondrial F1 complex assembly factor 2 Human genes 0.000 description 3
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 3
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 3
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 3
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 3
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 3
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 3
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 3
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 3
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 3
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 3
- MCYJBCKCAPERSE-FXQIFTODSA-N Arg-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N MCYJBCKCAPERSE-FXQIFTODSA-N 0.000 description 3
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 3
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 3
- RCAUJZASOAFTAJ-FXQIFTODSA-N Arg-Asp-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N RCAUJZASOAFTAJ-FXQIFTODSA-N 0.000 description 3
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 3
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 3
- IGULQRCJLQQPSM-DCAQKATOSA-N Arg-Cys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IGULQRCJLQQPSM-DCAQKATOSA-N 0.000 description 3
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 3
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 3
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 3
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 3
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 3
- JAYIQMNQDMOBFY-KKUMJFAQSA-N Arg-Glu-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JAYIQMNQDMOBFY-KKUMJFAQSA-N 0.000 description 3
- SYAUZLVLXCDRSH-IUCAKERBSA-N Arg-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N SYAUZLVLXCDRSH-IUCAKERBSA-N 0.000 description 3
- ZJEDSBGPBXVBMP-PYJNHQTQSA-N Arg-His-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZJEDSBGPBXVBMP-PYJNHQTQSA-N 0.000 description 3
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 3
- YKZJPIPFKGYHKY-DCAQKATOSA-N Arg-Leu-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKZJPIPFKGYHKY-DCAQKATOSA-N 0.000 description 3
- OGSQONVYSTZIJB-WDSOQIARSA-N Arg-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OGSQONVYSTZIJB-WDSOQIARSA-N 0.000 description 3
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 3
- AFNHFVVOJZBIJD-GUBZILKMSA-N Arg-Met-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O AFNHFVVOJZBIJD-GUBZILKMSA-N 0.000 description 3
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 3
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 3
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 3
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 3
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 3
- SUEIIIFUBHDCCS-PBCZWWQYSA-N Asn-His-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUEIIIFUBHDCCS-PBCZWWQYSA-N 0.000 description 3
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 3
- ALHMNHZJBYBYHS-DCAQKATOSA-N Asn-Lys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ALHMNHZJBYBYHS-DCAQKATOSA-N 0.000 description 3
- NNDSLVWAQAUPPP-GUBZILKMSA-N Asn-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N NNDSLVWAQAUPPP-GUBZILKMSA-N 0.000 description 3
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 3
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 3
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 3
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 3
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 3
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 3
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 3
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 3
- CZECQDPEMSVPDH-MNXVOIDGSA-N Asp-Leu-Val-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CZECQDPEMSVPDH-MNXVOIDGSA-N 0.000 description 3
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 3
- NZWDWXSWUQCNMG-GARJFASQSA-N Asp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)C(=O)O NZWDWXSWUQCNMG-GARJFASQSA-N 0.000 description 3
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 3
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 3
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 3
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 3
- 102100029892 Bromodomain and WD repeat-containing protein 1 Human genes 0.000 description 3
- 102100029968 Calreticulin Human genes 0.000 description 3
- 102100038564 Carboxymethylenebutenolidase homolog Human genes 0.000 description 3
- 102100033129 Centrosomal protein of 112 kDa Human genes 0.000 description 3
- 102100030791 Colorectal cancer-associated protein 2 Human genes 0.000 description 3
- OZHXXYOHPLLLMI-CIUDSAMLSA-N Cys-Lys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OZHXXYOHPLLLMI-CIUDSAMLSA-N 0.000 description 3
- CIVXDCMSSFGWAL-YUMQZZPRSA-N Cys-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N CIVXDCMSSFGWAL-YUMQZZPRSA-N 0.000 description 3
- DRXOWZZHCSBUOI-YJRXYDGGSA-N Cys-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N)O DRXOWZZHCSBUOI-YJRXYDGGSA-N 0.000 description 3
- ALTQTAKGRFLRLR-GUBZILKMSA-N Cys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N ALTQTAKGRFLRLR-GUBZILKMSA-N 0.000 description 3
- 108010090461 DFG peptide Proteins 0.000 description 3
- 102100023966 Dynein light chain Tctex-type 4 Human genes 0.000 description 3
- 102100028043 Fibroblast growth factor 3 Human genes 0.000 description 3
- 102100021066 Fibroblast growth factor receptor substrate 2 Human genes 0.000 description 3
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 3
- LWDGZZGWDMHBOF-FXQIFTODSA-N Gln-Glu-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LWDGZZGWDMHBOF-FXQIFTODSA-N 0.000 description 3
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 3
- LFIVHGMKWFGUGK-IHRRRGAJSA-N Gln-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LFIVHGMKWFGUGK-IHRRRGAJSA-N 0.000 description 3
- XQEAVUJIRZRLQQ-SZMVWBNQSA-N Gln-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)[C@H](CCC(=O)N)N XQEAVUJIRZRLQQ-SZMVWBNQSA-N 0.000 description 3
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 3
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 3
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 3
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 3
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 3
- KEBACWCLVOXFNC-DCAQKATOSA-N Glu-Arg-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KEBACWCLVOXFNC-DCAQKATOSA-N 0.000 description 3
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 3
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 3
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 3
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 3
- GYCPQVFKCPPRQB-GUBZILKMSA-N Glu-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N GYCPQVFKCPPRQB-GUBZILKMSA-N 0.000 description 3
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 3
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 3
- VGUYMZGLJUJRBV-YVNDNENWSA-N Glu-Ile-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VGUYMZGLJUJRBV-YVNDNENWSA-N 0.000 description 3
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 3
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 3
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 3
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 3
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 3
- JZJGEKDPWVJOLD-QEWYBTABSA-N Glu-Phe-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JZJGEKDPWVJOLD-QEWYBTABSA-N 0.000 description 3
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 3
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 3
- QVXWAFZDWRLXTI-NWLDYVSISA-N Glu-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QVXWAFZDWRLXTI-NWLDYVSISA-N 0.000 description 3
- RXJFSLQVMGYQEL-IHRRRGAJSA-N Glu-Tyr-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 RXJFSLQVMGYQEL-IHRRRGAJSA-N 0.000 description 3
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 3
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 3
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 3
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 3
- SABZDFAAOJATBR-QWRGUYRKSA-N Gly-Cys-Phe Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SABZDFAAOJATBR-QWRGUYRKSA-N 0.000 description 3
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 3
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 3
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 3
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 3
- SJLKKOZFHSJJAW-YUMQZZPRSA-N Gly-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN SJLKKOZFHSJJAW-YUMQZZPRSA-N 0.000 description 3
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 3
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 3
- HXKZJLWGSWQKEA-LSJOCFKGSA-N His-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CN=CN1 HXKZJLWGSWQKEA-LSJOCFKGSA-N 0.000 description 3
- SVHKVHBPTOMLTO-DCAQKATOSA-N His-Arg-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SVHKVHBPTOMLTO-DCAQKATOSA-N 0.000 description 3
- DVHGLDYMGWTYKW-GUBZILKMSA-N His-Gln-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DVHGLDYMGWTYKW-GUBZILKMSA-N 0.000 description 3
- AIPUZFXMXAHZKY-QWRGUYRKSA-N His-Leu-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AIPUZFXMXAHZKY-QWRGUYRKSA-N 0.000 description 3
- XIGFLVCAVQQGNS-IHRRRGAJSA-N His-Pro-His Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 XIGFLVCAVQQGNS-IHRRRGAJSA-N 0.000 description 3
- 101000936108 Homo sapiens ATP synthase mitochondrial F1 complex assembly factor 2 Proteins 0.000 description 3
- 101000794040 Homo sapiens Bromodomain and WD repeat-containing protein 1 Proteins 0.000 description 3
- 101000793651 Homo sapiens Calreticulin Proteins 0.000 description 3
- 101000882691 Homo sapiens Carboxymethylenebutenolidase homolog Proteins 0.000 description 3
- 101000944325 Homo sapiens Centrosomal protein of 112 kDa Proteins 0.000 description 3
- 101000920097 Homo sapiens Colorectal cancer-associated protein 2 Proteins 0.000 description 3
- 101000904008 Homo sapiens Dynein light chain Tctex-type 4 Proteins 0.000 description 3
- 101000807547 Homo sapiens E3 ubiquitin-protein ligase UBR4 Proteins 0.000 description 3
- 101001060280 Homo sapiens Fibroblast growth factor 3 Proteins 0.000 description 3
- 101000818410 Homo sapiens Fibroblast growth factor receptor substrate 2 Proteins 0.000 description 3
- 101000615932 Homo sapiens Mannosyl-oligosaccharide 1,2-alpha-mannosidase IB Proteins 0.000 description 3
- 101001013017 Homo sapiens Mesoderm induction early response protein 2 Proteins 0.000 description 3
- 101001030625 Homo sapiens Mucin-like protein 1 Proteins 0.000 description 3
- 101000588491 Homo sapiens NADH dehydrogenase (ubiquinone) complex I, assembly factor 6 Proteins 0.000 description 3
- 101000579580 Homo sapiens Protein LSM14 homolog A Proteins 0.000 description 3
- 101001062098 Homo sapiens RNA-binding protein 14 Proteins 0.000 description 3
- 101000731726 Homo sapiens Rho guanine nucleotide exchange factor 16 Proteins 0.000 description 3
- 101000587436 Homo sapiens Serine/arginine-rich splicing factor 4 Proteins 0.000 description 3
- 101000846996 Homo sapiens Tetratricopeptide repeat protein 19, mitochondrial Proteins 0.000 description 3
- 101000708392 Homo sapiens U5 small nuclear ribonucleoprotein 40 kDa protein Proteins 0.000 description 3
- 101000644655 Homo sapiens Ubiquitin-conjugating enzyme E2 E1 Proteins 0.000 description 3
- CISBRYJZMFWOHJ-JBDRJPRFSA-N Ile-Ala-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N CISBRYJZMFWOHJ-JBDRJPRFSA-N 0.000 description 3
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 3
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 3
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 3
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 3
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 3
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 3
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 3
- DLEBSGAVWRPTIX-PEDHHIEDSA-N Ile-Val-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)[C@@H](C)CC DLEBSGAVWRPTIX-PEDHHIEDSA-N 0.000 description 3
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 3
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 3
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 3
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 3
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 3
- NFHJQETXTSDZSI-DCAQKATOSA-N Leu-Cys-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NFHJQETXTSDZSI-DCAQKATOSA-N 0.000 description 3
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 3
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 3
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 3
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 3
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 3
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 3
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 3
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 3
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 3
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 3
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 3
- GSSMYQHXZNERFX-WDSOQIARSA-N Leu-Met-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N GSSMYQHXZNERFX-WDSOQIARSA-N 0.000 description 3
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 3
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 3
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 3
- SEOXPEFQEOYURL-PMVMPFDFSA-N Leu-Tyr-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SEOXPEFQEOYURL-PMVMPFDFSA-N 0.000 description 3
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 3
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 3
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 3
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 3
- WXJKFRMKJORORD-DCAQKATOSA-N Lys-Arg-Ala Chemical compound NC(=N)NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CCCCN WXJKFRMKJORORD-DCAQKATOSA-N 0.000 description 3
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 3
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 3
- CKSXSQUVEYCDIW-AVGNSLFASA-N Lys-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N CKSXSQUVEYCDIW-AVGNSLFASA-N 0.000 description 3
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 3
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 3
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 3
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 3
- SFQPJNQDUUYCLA-BJDJZHNGSA-N Lys-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N SFQPJNQDUUYCLA-BJDJZHNGSA-N 0.000 description 3
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 3
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 3
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 3
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 3
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 3
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 3
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 3
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 3
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 3
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 3
- 101150011852 MAP4K3 gene Proteins 0.000 description 3
- 102100021767 Mannosyl-oligosaccharide 1,2-alpha-mannosidase IB Human genes 0.000 description 3
- 102100029625 Mesoderm induction early response protein 2 Human genes 0.000 description 3
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 3
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 3
- CHLJXFMOQGYDNH-SZMVWBNQSA-N Met-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 CHLJXFMOQGYDNH-SZMVWBNQSA-N 0.000 description 3
- CHQWUYSNAOABIP-ZPFDUUQYSA-N Met-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N CHQWUYSNAOABIP-ZPFDUUQYSA-N 0.000 description 3
- RAAVFTFEAUAVIY-DCAQKATOSA-N Met-Glu-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N RAAVFTFEAUAVIY-DCAQKATOSA-N 0.000 description 3
- VBGGTAPDGFQMKF-AVGNSLFASA-N Met-Lys-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O VBGGTAPDGFQMKF-AVGNSLFASA-N 0.000 description 3
- LUYURUYVNYGKGM-RCWTZXSCSA-N Met-Pro-Thr Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUYURUYVNYGKGM-RCWTZXSCSA-N 0.000 description 3
- LBSWWNKMVPAXOI-GUBZILKMSA-N Met-Val-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O LBSWWNKMVPAXOI-GUBZILKMSA-N 0.000 description 3
- IQJMEDDVOGMTKT-SRVKXCTJSA-N Met-Val-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IQJMEDDVOGMTKT-SRVKXCTJSA-N 0.000 description 3
- 102100025272 Monocarboxylate transporter 2 Human genes 0.000 description 3
- 102100038565 Mucin-like protein 1 Human genes 0.000 description 3
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 3
- 102100031377 NADH dehydrogenase (ubiquinone) complex I, assembly factor 6 Human genes 0.000 description 3
- 108010047562 NGR peptide Proteins 0.000 description 3
- 101150010978 PRKCE gene Proteins 0.000 description 3
- YMORXCKTSSGYIG-IHRRRGAJSA-N Phe-Arg-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N YMORXCKTSSGYIG-IHRRRGAJSA-N 0.000 description 3
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 3
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 3
- RGZYXNFHYRFNNS-MXAVVETBSA-N Phe-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGZYXNFHYRFNNS-MXAVVETBSA-N 0.000 description 3
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 3
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 3
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 3
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 3
- WWAQEUOYCYMGHB-FXQIFTODSA-N Pro-Asn-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 WWAQEUOYCYMGHB-FXQIFTODSA-N 0.000 description 3
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 3
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 3
- RPLMFKUKFZOTER-AVGNSLFASA-N Pro-Met-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 RPLMFKUKFZOTER-AVGNSLFASA-N 0.000 description 3
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 3
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 3
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 3
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 3
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 3
- AJJDPGVVNPUZCR-RHYQMDGZSA-N Pro-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1)O AJJDPGVVNPUZCR-RHYQMDGZSA-N 0.000 description 3
- QMABBZHZMDXHKU-FKBYEOEOSA-N Pro-Tyr-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QMABBZHZMDXHKU-FKBYEOEOSA-N 0.000 description 3
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 3
- 102100028259 Protein LSM14 homolog A Human genes 0.000 description 3
- 108090000412 Protein-Tyrosine Kinases Proteins 0.000 description 3
- 102100029250 RNA-binding protein 14 Human genes 0.000 description 3
- 102100032436 Rho guanine nucleotide exchange factor 16 Human genes 0.000 description 3
- 108091006604 SLC16A7 Proteins 0.000 description 3
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 3
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 3
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 3
- BQWCDDAISCPDQV-XHNCKOQMSA-N Ser-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N)C(=O)O BQWCDDAISCPDQV-XHNCKOQMSA-N 0.000 description 3
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 3
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 3
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 3
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 3
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 3
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 3
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 3
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 3
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 3
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 3
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 3
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 3
- 102100029705 Serine/arginine-rich splicing factor 4 Human genes 0.000 description 3
- 102100031473 Tetratricopeptide repeat protein 19, mitochondrial Human genes 0.000 description 3
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 3
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 3
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 3
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 3
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 3
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 3
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 3
- ZBKDBZUTTXINIX-RWRJDSDZSA-N Thr-Ile-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZBKDBZUTTXINIX-RWRJDSDZSA-N 0.000 description 3
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 3
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 3
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 3
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 3
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 3
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 3
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 3
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 3
- OKAMOYTUQMIFJO-JBACZVJFSA-N Trp-Glu-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 OKAMOYTUQMIFJO-JBACZVJFSA-N 0.000 description 3
- HQJOVVWAPQPYDS-ZFWWWQNUSA-N Trp-Gly-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQJOVVWAPQPYDS-ZFWWWQNUSA-N 0.000 description 3
- ORQGVWIUHICVKE-KCTSRDHCSA-N Trp-His-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O ORQGVWIUHICVKE-KCTSRDHCSA-N 0.000 description 3
- DYIXEGROAOVQPK-VFAJRCTISA-N Trp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DYIXEGROAOVQPK-VFAJRCTISA-N 0.000 description 3
- PEVVXUGSAKEPEN-AVGNSLFASA-N Tyr-Asn-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PEVVXUGSAKEPEN-AVGNSLFASA-N 0.000 description 3
- WEFIPBYPXZYPHD-HJPIBITLSA-N Tyr-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WEFIPBYPXZYPHD-HJPIBITLSA-N 0.000 description 3
- UXUFNBVCPAWACG-SIUGBPQLSA-N Tyr-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N UXUFNBVCPAWACG-SIUGBPQLSA-N 0.000 description 3
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 3
- HHFMNAVFGBYSAT-IGISWZIWSA-N Tyr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HHFMNAVFGBYSAT-IGISWZIWSA-N 0.000 description 3
- ZOBLBMGJKVJVEV-BZSNNMDCSA-N Tyr-Lys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O ZOBLBMGJKVJVEV-BZSNNMDCSA-N 0.000 description 3
- HNERGSKJJZQGEA-JYJNAYRXSA-N Tyr-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HNERGSKJJZQGEA-JYJNAYRXSA-N 0.000 description 3
- WPRVVBVWIUWLOH-UFYCRDLUSA-N Tyr-Phe-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N WPRVVBVWIUWLOH-UFYCRDLUSA-N 0.000 description 3
- MDXLPNRXCFOBTL-BZSNNMDCSA-N Tyr-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MDXLPNRXCFOBTL-BZSNNMDCSA-N 0.000 description 3
- VKYDVKAKGDNZED-STECZYCISA-N Tyr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N VKYDVKAKGDNZED-STECZYCISA-N 0.000 description 3
- 102100031471 U5 small nuclear ribonucleoprotein 40 kDa protein Human genes 0.000 description 3
- 102000003442 UBR4 Human genes 0.000 description 3
- 102100020711 Ubiquitin-conjugating enzyme E2 E1 Human genes 0.000 description 3
- 108010064997 VPY tripeptide Proteins 0.000 description 3
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 3
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 3
- IQQYYFPCWKWUHW-YDHLFZDLSA-N Val-Asn-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N IQQYYFPCWKWUHW-YDHLFZDLSA-N 0.000 description 3
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 3
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 3
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 3
- PMXBARDFIAPBGK-DZKIICNBSA-N Val-Glu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PMXBARDFIAPBGK-DZKIICNBSA-N 0.000 description 3
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 3
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 3
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 3
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 3
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 3
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 3
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 3
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 3
- ODUHAIXFXFACDY-SRVKXCTJSA-N Val-Val-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)C(C)C ODUHAIXFXFACDY-SRVKXCTJSA-N 0.000 description 3
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 3
- 230000004913 activation Effects 0.000 description 3
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 3
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 3
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 3
- 108010021908 aspartyl-aspartyl-glutamyl-aspartic acid Proteins 0.000 description 3
- 101150104145 cga gene Proteins 0.000 description 3
- 108010054813 diprotin B Proteins 0.000 description 3
- 239000000284 extract Substances 0.000 description 3
- 108010078144 glutaminyl-glycine Proteins 0.000 description 3
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 3
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 3
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 3
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 3
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 3
- 108010008671 glycyl-tryptophyl-methionine Proteins 0.000 description 3
- 108010084760 glycyl-tyrosyl-glycyl-aspartate Proteins 0.000 description 3
- 108010020688 glycylhistidine Proteins 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- 108010009932 leucyl-alanyl-glycyl-valine Proteins 0.000 description 3
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 3
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 3
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 3
- 108010090894 prolylleucine Proteins 0.000 description 3
- 108010029895 rubimetide Proteins 0.000 description 3
- 108010071207 serylmethionine Proteins 0.000 description 3
- 108010036387 trimethionine Proteins 0.000 description 3
- 108700004896 tripeptide FEG Proteins 0.000 description 3
- 108010080629 tryptophan-leucine Proteins 0.000 description 3
- 108010044292 tryptophyltyrosine Proteins 0.000 description 3
- 108010072644 valyl-alanyl-prolyl-glycine Proteins 0.000 description 3
- 108010027345 wheylin-1 peptide Proteins 0.000 description 3
- 238000007482 whole exome sequencing Methods 0.000 description 3
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 2
- QWTLUPDHBKBULE-UHFFFAOYSA-N 2-[[2-[[2-[[2-[[2-[[2-[[2-[[2-[[2-[(2-aminoacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetic acid Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(O)=O QWTLUPDHBKBULE-UHFFFAOYSA-N 0.000 description 2
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- 102100021029 Activating signal cointegrator 1 complex subunit 3 Human genes 0.000 description 2
- 229920001817 Agar Polymers 0.000 description 2
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 2
- SSSROGPPPVTHLX-FXQIFTODSA-N Ala-Arg-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSROGPPPVTHLX-FXQIFTODSA-N 0.000 description 2
- WRDANSJTFOHBPI-FXQIFTODSA-N Ala-Arg-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N WRDANSJTFOHBPI-FXQIFTODSA-N 0.000 description 2
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 2
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 2
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 2
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 2
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 2
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 2
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 2
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 2
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 2
- ZPXCNXMJEZKRLU-LSJOCFKGSA-N Ala-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 ZPXCNXMJEZKRLU-LSJOCFKGSA-N 0.000 description 2
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 2
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 2
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 2
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 2
- FUKFQILQFQKHLE-DCAQKATOSA-N Ala-Lys-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O FUKFQILQFQKHLE-DCAQKATOSA-N 0.000 description 2
- DEWWPUNXRNGMQN-LPEHRKFASA-N Ala-Met-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N DEWWPUNXRNGMQN-LPEHRKFASA-N 0.000 description 2
- AWNAEZICPNGAJK-FXQIFTODSA-N Ala-Met-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O AWNAEZICPNGAJK-FXQIFTODSA-N 0.000 description 2
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 2
- KLKARCOHVHLAJP-UWJYBYFXSA-N Ala-Tyr-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CS)C(O)=O KLKARCOHVHLAJP-UWJYBYFXSA-N 0.000 description 2
- 102100037982 Alpha-1,6-mannosylglycoprotein 6-beta-N-acetylglucosaminyltransferase A Human genes 0.000 description 2
- 102100028661 Amine oxidase [flavin-containing] A Human genes 0.000 description 2
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 2
- KGSJCPBERYUXCN-BPNCWPANSA-N Arg-Ala-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KGSJCPBERYUXCN-BPNCWPANSA-N 0.000 description 2
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 2
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 2
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 2
- DQNLFLGFZAUIOW-FXQIFTODSA-N Arg-Cys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DQNLFLGFZAUIOW-FXQIFTODSA-N 0.000 description 2
- KBBKCNHWCDJPGN-GUBZILKMSA-N Arg-Gln-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KBBKCNHWCDJPGN-GUBZILKMSA-N 0.000 description 2
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 2
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 2
- DJAIOAKQIOGULM-DCAQKATOSA-N Arg-Glu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O DJAIOAKQIOGULM-DCAQKATOSA-N 0.000 description 2
- HPSVTWMFWCHKFN-GARJFASQSA-N Arg-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O HPSVTWMFWCHKFN-GARJFASQSA-N 0.000 description 2
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 2
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 2
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 2
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 2
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 2
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 2
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 2
- CVXXSWQORBZAAA-SRVKXCTJSA-N Arg-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N CVXXSWQORBZAAA-SRVKXCTJSA-N 0.000 description 2
- JOADBFCFJGNIKF-GUBZILKMSA-N Arg-Met-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O JOADBFCFJGNIKF-GUBZILKMSA-N 0.000 description 2
- GITAWLWBTMJPKH-AVGNSLFASA-N Arg-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GITAWLWBTMJPKH-AVGNSLFASA-N 0.000 description 2
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 2
- AWMAZIIEFPFHCP-RCWTZXSCSA-N Arg-Pro-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWMAZIIEFPFHCP-RCWTZXSCSA-N 0.000 description 2
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 2
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 2
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 2
- BWMMKQPATDUYKB-IHRRRGAJSA-N Arg-Tyr-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=C(O)C=C1 BWMMKQPATDUYKB-IHRRRGAJSA-N 0.000 description 2
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 2
- BHQQRVARKXWXPP-ACZMJKKPSA-N Asn-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BHQQRVARKXWXPP-ACZMJKKPSA-N 0.000 description 2
- ZPMNECSEJXXNBE-CIUDSAMLSA-N Asn-Cys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ZPMNECSEJXXNBE-CIUDSAMLSA-N 0.000 description 2
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 2
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 2
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 2
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 2
- GIQCDTKOIPUDSG-GARJFASQSA-N Asn-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N)C(=O)O GIQCDTKOIPUDSG-GARJFASQSA-N 0.000 description 2
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 2
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 2
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 2
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 2
- RSMIHCFQDCVVBR-CIUDSAMLSA-N Asp-Gln-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RSMIHCFQDCVVBR-CIUDSAMLSA-N 0.000 description 2
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 2
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 2
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 2
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 2
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 2
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 2
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 2
- WNGZKSVJFDZICU-XIRDDKMYSA-N Asp-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N WNGZKSVJFDZICU-XIRDDKMYSA-N 0.000 description 2
- WZUZGDANRQPCDD-SRVKXCTJSA-N Asp-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N WZUZGDANRQPCDD-SRVKXCTJSA-N 0.000 description 2
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 2
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 2
- WTNLLMQAFPOCTJ-GARJFASQSA-N Cys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CS)N)C(=O)O WTNLLMQAFPOCTJ-GARJFASQSA-N 0.000 description 2
- UVZFZTWNHOQWNK-NAKRPEOUSA-N Cys-Ile-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UVZFZTWNHOQWNK-NAKRPEOUSA-N 0.000 description 2
- KJJASVYBTKRYSN-FXQIFTODSA-N Cys-Pro-Asp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC(=O)O)C(=O)O KJJASVYBTKRYSN-FXQIFTODSA-N 0.000 description 2
- MWVDDZUTWXFYHL-XKBZYTNZSA-N Cys-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)O MWVDDZUTWXFYHL-XKBZYTNZSA-N 0.000 description 2
- 102100024426 Dihydropyrimidinase-related protein 2 Human genes 0.000 description 2
- 238000002965 ELISA Methods 0.000 description 2
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 2
- JFOKLAPFYCTNHW-SRVKXCTJSA-N Gln-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N JFOKLAPFYCTNHW-SRVKXCTJSA-N 0.000 description 2
- GHYJGDCPHMSFEJ-GUBZILKMSA-N Gln-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N GHYJGDCPHMSFEJ-GUBZILKMSA-N 0.000 description 2
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 2
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 2
- MQJDLNRXBOELJW-KKUMJFAQSA-N Gln-Pro-Phe Chemical compound N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O MQJDLNRXBOELJW-KKUMJFAQSA-N 0.000 description 2
- NYCVMJGIJYQWDO-CIUDSAMLSA-N Gln-Ser-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NYCVMJGIJYQWDO-CIUDSAMLSA-N 0.000 description 2
- UTOQQOMEJDPDMX-ACZMJKKPSA-N Gln-Ser-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O UTOQQOMEJDPDMX-ACZMJKKPSA-N 0.000 description 2
- OUBUHIODTNUUTC-WDCWCFNPSA-N Gln-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OUBUHIODTNUUTC-WDCWCFNPSA-N 0.000 description 2
- DITJVHONFRJKJW-BPUTZDHNSA-N Gln-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DITJVHONFRJKJW-BPUTZDHNSA-N 0.000 description 2
- GJLXZITZLUUXMJ-NHCYSSNCSA-N Gln-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GJLXZITZLUUXMJ-NHCYSSNCSA-N 0.000 description 2
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 2
- BPDVTFBJZNBHEU-HGNGGELXSA-N Glu-Ala-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 BPDVTFBJZNBHEU-HGNGGELXSA-N 0.000 description 2
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 2
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 2
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 2
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 2
- ZZIFPJZQHRJERU-WDSKDSINSA-N Glu-Cys-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ZZIFPJZQHRJERU-WDSKDSINSA-N 0.000 description 2
- KVBPDJIFRQUQFY-ACZMJKKPSA-N Glu-Cys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O KVBPDJIFRQUQFY-ACZMJKKPSA-N 0.000 description 2
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 2
- LVCHEMOPBORRLB-DCAQKATOSA-N Glu-Gln-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O LVCHEMOPBORRLB-DCAQKATOSA-N 0.000 description 2
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 2
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 2
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 2
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 2
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 2
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 2
- QLPYYTDOUQNJGQ-AVGNSLFASA-N Glu-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N QLPYYTDOUQNJGQ-AVGNSLFASA-N 0.000 description 2
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 2
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 2
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 2
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 2
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 2
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 2
- LKOAAMXDJGEYMS-ZPFDUUQYSA-N Glu-Met-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKOAAMXDJGEYMS-ZPFDUUQYSA-N 0.000 description 2
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 2
- HLYCMRDRWGSTPZ-CIUDSAMLSA-N Glu-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CS)C(=O)O HLYCMRDRWGSTPZ-CIUDSAMLSA-N 0.000 description 2
- ARIORLIIMJACKZ-KKUMJFAQSA-N Glu-Pro-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ARIORLIIMJACKZ-KKUMJFAQSA-N 0.000 description 2
- GTFYQOVVVJASOA-ACZMJKKPSA-N Glu-Ser-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N GTFYQOVVVJASOA-ACZMJKKPSA-N 0.000 description 2
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 2
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 2
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 2
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 2
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 2
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 2
- XLFHCWHXKSFVIB-BQBZGAKWSA-N Gly-Gln-Gln Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLFHCWHXKSFVIB-BQBZGAKWSA-N 0.000 description 2
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 2
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 2
- FCKPEGOCSVZPNC-WHOFXGATSA-N Gly-Ile-Phe Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FCKPEGOCSVZPNC-WHOFXGATSA-N 0.000 description 2
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 2
- YIFUFYZELCMPJP-YUMQZZPRSA-N Gly-Leu-Cys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O YIFUFYZELCMPJP-YUMQZZPRSA-N 0.000 description 2
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 2
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 2
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 2
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 2
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 2
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 2
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 2
- 102000003886 Glycoproteins Human genes 0.000 description 2
- 108090000288 Glycoproteins Proteins 0.000 description 2
- 108010066705 H-cadherin Proteins 0.000 description 2
- TXLQHACKRLWYCM-DCAQKATOSA-N His-Glu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O TXLQHACKRLWYCM-DCAQKATOSA-N 0.000 description 2
- YXXKBPJEIYFGOD-MGHWNKPDSA-N His-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N YXXKBPJEIYFGOD-MGHWNKPDSA-N 0.000 description 2
- JSQIXEHORHLQEE-MEYUZBJRSA-N His-Phe-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JSQIXEHORHLQEE-MEYUZBJRSA-N 0.000 description 2
- 102100029076 Histamine N-methyltransferase Human genes 0.000 description 2
- 101000784211 Homo sapiens Activating signal cointegrator 1 complex subunit 3 Proteins 0.000 description 2
- 101000951392 Homo sapiens Alpha-1,6-mannosylglycoprotein 6-beta-N-acetylglucosaminyltransferase A Proteins 0.000 description 2
- 101000694718 Homo sapiens Amine oxidase [flavin-containing] A Proteins 0.000 description 2
- 101001053503 Homo sapiens Dihydropyrimidinase-related protein 2 Proteins 0.000 description 2
- 101000988655 Homo sapiens Histamine N-methyltransferase Proteins 0.000 description 2
- 101000613960 Homo sapiens Lysine-specific histone demethylase 1B Proteins 0.000 description 2
- 101001011906 Homo sapiens Matrix metalloproteinase-14 Proteins 0.000 description 2
- 101001007909 Homo sapiens Nuclear pore complex protein Nup93 Proteins 0.000 description 2
- 101001116302 Homo sapiens Platelet endothelial cell adhesion molecule Proteins 0.000 description 2
- 101001130308 Homo sapiens Ras-related protein Rab-21 Proteins 0.000 description 2
- 101000823935 Homo sapiens Serine palmitoyltransferase 3 Proteins 0.000 description 2
- 101000836849 Homo sapiens Signal-induced proliferation-associated 1-like protein 3 Proteins 0.000 description 2
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 2
- BSWLQVGEVFYGIM-ZPFDUUQYSA-N Ile-Gln-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BSWLQVGEVFYGIM-ZPFDUUQYSA-N 0.000 description 2
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 2
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 2
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 2
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 2
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 2
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 2
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 2
- REXAUQBGSGDEJY-IGISWZIWSA-N Ile-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N REXAUQBGSGDEJY-IGISWZIWSA-N 0.000 description 2
- AUIYHFRUOOKTGX-UKJIMTQDSA-N Ile-Val-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N AUIYHFRUOOKTGX-UKJIMTQDSA-N 0.000 description 2
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 2
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 2
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 2
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 2
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 2
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 2
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 2
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 2
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 2
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 2
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 2
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 2
- WCTCIIAGNMFYAO-DCAQKATOSA-N Leu-Cys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O WCTCIIAGNMFYAO-DCAQKATOSA-N 0.000 description 2
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 2
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 2
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 2
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 2
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 2
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 2
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 2
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 2
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 2
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 2
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 2
- NRFGTHFONZYFNY-MGHWNKPDSA-N Leu-Ile-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NRFGTHFONZYFNY-MGHWNKPDSA-N 0.000 description 2
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 2
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 2
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 2
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 2
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 2
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 2
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 2
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 2
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 2
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 2
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 2
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 2
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 2
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 2
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 2
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 2
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 2
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 2
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 2
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 2
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 2
- YKIRNDPUWONXQN-GUBZILKMSA-N Lys-Asn-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKIRNDPUWONXQN-GUBZILKMSA-N 0.000 description 2
- MKBIVWXCFINCLE-SRVKXCTJSA-N Lys-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N MKBIVWXCFINCLE-SRVKXCTJSA-N 0.000 description 2
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 2
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 2
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 2
- WTZUSCUIVPVCRH-SRVKXCTJSA-N Lys-Gln-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WTZUSCUIVPVCRH-SRVKXCTJSA-N 0.000 description 2
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 2
- NNCDAORZCMPZPX-GUBZILKMSA-N Lys-Gln-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N NNCDAORZCMPZPX-GUBZILKMSA-N 0.000 description 2
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 2
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 2
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 2
- DKTNGXVSCZULPO-YUMQZZPRSA-N Lys-Gly-Cys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O DKTNGXVSCZULPO-YUMQZZPRSA-N 0.000 description 2
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 2
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 2
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 2
- 102100040596 Lysine-specific histone demethylase 1B Human genes 0.000 description 2
- 102100030216 Matrix metalloproteinase-14 Human genes 0.000 description 2
- BXNZDLVLGYYFIB-FXQIFTODSA-N Met-Asn-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N BXNZDLVLGYYFIB-FXQIFTODSA-N 0.000 description 2
- YORIKIDJCPKBON-YUMQZZPRSA-N Met-Glu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YORIKIDJCPKBON-YUMQZZPRSA-N 0.000 description 2
- BCRQJDMZQUHQSV-STQMWFEESA-N Met-Gly-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BCRQJDMZQUHQSV-STQMWFEESA-N 0.000 description 2
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 2
- WTHGNAAQXISJHP-AVGNSLFASA-N Met-Lys-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WTHGNAAQXISJHP-AVGNSLFASA-N 0.000 description 2
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 2
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 101100068676 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) gln-1 gene Proteins 0.000 description 2
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 2
- 102100027585 Nuclear pore complex protein Nup93 Human genes 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 2
- KIAWKQJTSGRCSA-AVGNSLFASA-N Phe-Asn-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KIAWKQJTSGRCSA-AVGNSLFASA-N 0.000 description 2
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 2
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 2
- AAERWTUHZKLDLC-IHRRRGAJSA-N Phe-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O AAERWTUHZKLDLC-IHRRRGAJSA-N 0.000 description 2
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 2
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 2
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 2
- ZYNBEWGJFXTBDU-ACRUOGEOSA-N Phe-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N ZYNBEWGJFXTBDU-ACRUOGEOSA-N 0.000 description 2
- 102100024616 Platelet endothelial cell adhesion molecule Human genes 0.000 description 2
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 2
- OCSACVPBMIYNJE-GUBZILKMSA-N Pro-Arg-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O OCSACVPBMIYNJE-GUBZILKMSA-N 0.000 description 2
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 2
- XWYXZPHPYKRYPA-GMOBBJLQSA-N Pro-Asn-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XWYXZPHPYKRYPA-GMOBBJLQSA-N 0.000 description 2
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 2
- WPQKSRHDTMRSJM-CIUDSAMLSA-N Pro-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 WPQKSRHDTMRSJM-CIUDSAMLSA-N 0.000 description 2
- KTFZQPLSPLWLKN-KKUMJFAQSA-N Pro-Gln-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KTFZQPLSPLWLKN-KKUMJFAQSA-N 0.000 description 2
- VPFGPKIWSDVTOY-SRVKXCTJSA-N Pro-Glu-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O VPFGPKIWSDVTOY-SRVKXCTJSA-N 0.000 description 2
- LCUOTSLIVGSGAU-AVGNSLFASA-N Pro-His-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LCUOTSLIVGSGAU-AVGNSLFASA-N 0.000 description 2
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 2
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 2
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 2
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 2
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 2
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 2
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 2
- 102000004022 Protein-Tyrosine Kinases Human genes 0.000 description 2
- 102000020146 Rab21 Human genes 0.000 description 2
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 2
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 2
- IYCBDVBJWDXQRR-FXQIFTODSA-N Ser-Ala-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IYCBDVBJWDXQRR-FXQIFTODSA-N 0.000 description 2
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 2
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 2
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 2
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 2
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 2
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 2
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 2
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 2
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 2
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 2
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 2
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 2
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 2
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 2
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 2
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 2
- QJKPECIAWNNKIT-KKUMJFAQSA-N Ser-Lys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QJKPECIAWNNKIT-KKUMJFAQSA-N 0.000 description 2
- FZEUTKVQGMVGHW-AVGNSLFASA-N Ser-Phe-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZEUTKVQGMVGHW-AVGNSLFASA-N 0.000 description 2
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 2
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 2
- 102100022070 Serine palmitoyltransferase 3 Human genes 0.000 description 2
- 102100027099 Signal-induced proliferation-associated 1-like protein 3 Human genes 0.000 description 2
- 206010041067 Small cell lung cancer Diseases 0.000 description 2
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 2
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 2
- UDNVOQMPQBEITB-MEYUZBJRSA-N Thr-His-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UDNVOQMPQBEITB-MEYUZBJRSA-N 0.000 description 2
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 2
- MEBDIIKMUUNBSB-RPTUDFQQSA-N Thr-Phe-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MEBDIIKMUUNBSB-RPTUDFQQSA-N 0.000 description 2
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 2
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 2
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 2
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 2
- TZQWJCGVCIJDMU-HEIBUPTGSA-N Thr-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N)O TZQWJCGVCIJDMU-HEIBUPTGSA-N 0.000 description 2
- HTHCZRWCFXMENJ-KKUMJFAQSA-N Tyr-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HTHCZRWCFXMENJ-KKUMJFAQSA-N 0.000 description 2
- PMDWYLVWHRTJIW-STQMWFEESA-N Tyr-Gly-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PMDWYLVWHRTJIW-STQMWFEESA-N 0.000 description 2
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 2
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 2
- KLOZTPOXVVRVAQ-DZKIICNBSA-N Tyr-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KLOZTPOXVVRVAQ-DZKIICNBSA-N 0.000 description 2
- 108010075653 Utrophin Proteins 0.000 description 2
- 102000011856 Utrophin Human genes 0.000 description 2
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 2
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 2
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 2
- CWOSXNKDOACNJN-BZSNNMDCSA-N Val-Arg-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N CWOSXNKDOACNJN-BZSNNMDCSA-N 0.000 description 2
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 2
- LHADRQBREKTRLR-DCAQKATOSA-N Val-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N LHADRQBREKTRLR-DCAQKATOSA-N 0.000 description 2
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 2
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 2
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 2
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 2
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 2
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 2
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 2
- UZFNHAXYMICTBU-DZKIICNBSA-N Val-Phe-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UZFNHAXYMICTBU-DZKIICNBSA-N 0.000 description 2
- AIWLHFZYOUUJGB-UFYCRDLUSA-N Val-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 AIWLHFZYOUUJGB-UFYCRDLUSA-N 0.000 description 2
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 2
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 2
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 2
- 102000002258 X-ray Repair Cross Complementing Protein 1 Human genes 0.000 description 2
- 108010000443 X-ray Repair Cross Complementing Protein 1 Proteins 0.000 description 2
- 238000002835 absorbance Methods 0.000 description 2
- 239000004480 active ingredient Substances 0.000 description 2
- 239000008272 agar Substances 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 238000000423 cell based assay Methods 0.000 description 2
- 238000012790 confirmation Methods 0.000 description 2
- 108010025198 decaglycine Proteins 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- 230000037437 driver mutation Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 229940125874 fusion protein inhibitor Drugs 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 2
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 2
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 2
- 108010037850 glycylvaline Proteins 0.000 description 2
- 230000009036 growth inhibition Effects 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 239000005556 hormone Substances 0.000 description 2
- 229940088597 hormone Drugs 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- 230000028993 immune response Effects 0.000 description 2
- 238000003317 immunochromatography Methods 0.000 description 2
- 238000011532 immunohistochemical staining Methods 0.000 description 2
- 108010088381 isoleucyl-lysyl-valyl-alanyl-valine Proteins 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- 108010077158 leucinyl-arginyl-tryptophan Proteins 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 2
- 238000004020 luminiscence type Methods 0.000 description 2
- 210000004072 lung Anatomy 0.000 description 2
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- BASFCYQUMIYNBI-UHFFFAOYSA-N platinum Chemical compound [Pt] BASFCYQUMIYNBI-UHFFFAOYSA-N 0.000 description 2
- 108010053725 prolylvaline Proteins 0.000 description 2
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N puromycin Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 description 2
- 238000003908 quality control method Methods 0.000 description 2
- 230000001177 retroviral effect Effects 0.000 description 2
- 208000000649 small cell carcinoma Diseases 0.000 description 2
- 208000000587 small cell lung carcinoma Diseases 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 238000002626 targeted therapy Methods 0.000 description 2
- 230000008685 targeting Effects 0.000 description 2
- 230000001225 therapeutic effect Effects 0.000 description 2
- 108010061238 threonyl-glycine Proteins 0.000 description 2
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- 210000004881 tumor cell Anatomy 0.000 description 2
- 108010079202 tyrosyl-alanyl-cysteine Proteins 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 108010078580 tyrosylleucine Proteins 0.000 description 2
- 108010003137 tyrosyltyrosine Proteins 0.000 description 2
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- CNKBMTKICGGSCQ-ACRUOGEOSA-N (2S)-2-[[(2S)-2-[[(2S)-2,6-diamino-1-oxohexyl]amino]-1-oxo-3-phenylpropyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CNKBMTKICGGSCQ-ACRUOGEOSA-N 0.000 description 1
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 1
- COEXAQSTZUWMRI-STQMWFEESA-N (2s)-1-[2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound C([C@H](N)C(=O)NCC(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 COEXAQSTZUWMRI-STQMWFEESA-N 0.000 description 1
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 1
- NTUPOKHATNSWCY-PMPSAXMXSA-N (2s)-2-[[(2s)-1-[(2r)-2-amino-3-phenylpropanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C([C@@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=CC=C1 NTUPOKHATNSWCY-PMPSAXMXSA-N 0.000 description 1
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 1
- XQQUSYWGKLRJRA-RABCQHRBSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-6-amino-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]hexanoyl]amino]-3-methylbutanoyl]amino]propanoyl]amino]-3-methylbutanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XQQUSYWGKLRJRA-RABCQHRBSA-N 0.000 description 1
- RRBGTUQJDFBWNN-MUGJNUQGSA-N (2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-2,6-diaminohexanoyl]amino]hexanoyl]amino]hexanoyl]amino]hexanoic acid Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O RRBGTUQJDFBWNN-MUGJNUQGSA-N 0.000 description 1
- QMOQBVOBWVNSNO-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(O)=O QMOQBVOBWVNSNO-UHFFFAOYSA-N 0.000 description 1
- XJFPXLWGZWAWRQ-UHFFFAOYSA-N 2-[[2-[[2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(O)=O XJFPXLWGZWAWRQ-UHFFFAOYSA-N 0.000 description 1
- 101710168331 ALK tyrosine kinase receptor Proteins 0.000 description 1
- 206010069754 Acquired gene mutation Diseases 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- ODWSTKXGQGYHSH-FXQIFTODSA-N Ala-Arg-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O ODWSTKXGQGYHSH-FXQIFTODSA-N 0.000 description 1
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 1
- FSBCNCKIQZZASN-GUBZILKMSA-N Ala-Arg-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O FSBCNCKIQZZASN-GUBZILKMSA-N 0.000 description 1
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 1
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 1
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 1
- SHYYAQLDNVHPFT-DLOVCJGASA-N Ala-Asn-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SHYYAQLDNVHPFT-DLOVCJGASA-N 0.000 description 1
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 1
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- WCBVQNZTOKJWJS-ACZMJKKPSA-N Ala-Cys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O WCBVQNZTOKJWJS-ACZMJKKPSA-N 0.000 description 1
- CXZFXHGJJPVUJE-CIUDSAMLSA-N Ala-Cys-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O)N CXZFXHGJJPVUJE-CIUDSAMLSA-N 0.000 description 1
- YEELWQSXYBJVSV-UWJYBYFXSA-N Ala-Cys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YEELWQSXYBJVSV-UWJYBYFXSA-N 0.000 description 1
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 1
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 1
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 1
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 1
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 1
- VWEWCZSUWOEEFM-WDSKDSINSA-N Ala-Gly-Ala-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(=O)NCC(O)=O VWEWCZSUWOEEFM-WDSKDSINSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 1
- GRPHQEMIFDPKOE-HGNGGELXSA-N Ala-His-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GRPHQEMIFDPKOE-HGNGGELXSA-N 0.000 description 1
- HUUOZYZWNCXTFK-INTQDDNPSA-N Ala-His-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N HUUOZYZWNCXTFK-INTQDDNPSA-N 0.000 description 1
- CBCCCLMNOBLBSC-XVYDVKMFSA-N Ala-His-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CBCCCLMNOBLBSC-XVYDVKMFSA-N 0.000 description 1
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 1
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 1
- NMXKFWOEASXOGB-QSFUFRPTSA-N Ala-Ile-His Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NMXKFWOEASXOGB-QSFUFRPTSA-N 0.000 description 1
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 1
- OPZJWMJPCNNZNT-DCAQKATOSA-N Ala-Leu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N OPZJWMJPCNNZNT-DCAQKATOSA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 1
- VHEVVUZDDUCAKU-FXQIFTODSA-N Ala-Met-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O VHEVVUZDDUCAKU-FXQIFTODSA-N 0.000 description 1
- IHRGVZXPTIQNIP-NAKRPEOUSA-N Ala-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C)N IHRGVZXPTIQNIP-NAKRPEOUSA-N 0.000 description 1
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 1
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 1
- BDQNLQSWRAPHGU-DLOVCJGASA-N Ala-Phe-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N BDQNLQSWRAPHGU-DLOVCJGASA-N 0.000 description 1
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 1
- CUOMGDPDITUMIJ-HZZBMVKVSA-N Ala-Phe-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 CUOMGDPDITUMIJ-HZZBMVKVSA-N 0.000 description 1
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 1
- FEGOCLZUJUFCHP-CIUDSAMLSA-N Ala-Pro-Gln Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FEGOCLZUJUFCHP-CIUDSAMLSA-N 0.000 description 1
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 1
- QKHWNPQNOHEFST-VZFHVOOUSA-N Ala-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N)O QKHWNPQNOHEFST-VZFHVOOUSA-N 0.000 description 1
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 1
- AAWLEICNDUHIJM-MBLNEYKQSA-N Ala-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C)N)O AAWLEICNDUHIJM-MBLNEYKQSA-N 0.000 description 1
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 1
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 1
- AETQNIIFKCMVHP-UVBJJODRSA-N Ala-Trp-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AETQNIIFKCMVHP-UVBJJODRSA-N 0.000 description 1
- TVUFMYKTYXTRPY-HERUPUMHSA-N Ala-Trp-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O TVUFMYKTYXTRPY-HERUPUMHSA-N 0.000 description 1
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 1
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 1
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 1
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 1
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 1
- BOKLLPVAQDSLHC-FXQIFTODSA-N Ala-Val-Cys Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N BOKLLPVAQDSLHC-FXQIFTODSA-N 0.000 description 1
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 1
- DHONNEYAZPNGSG-UBHSHLNASA-N Ala-Val-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DHONNEYAZPNGSG-UBHSHLNASA-N 0.000 description 1
- 101710168921 Amyloid beta precursor like protein 2 Proteins 0.000 description 1
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 1
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 1
- RWWPBOUMKFBHAL-FXQIFTODSA-N Arg-Asn-Cys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O RWWPBOUMKFBHAL-FXQIFTODSA-N 0.000 description 1
- KWTVWJPNHAOREN-IHRRRGAJSA-N Arg-Asn-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KWTVWJPNHAOREN-IHRRRGAJSA-N 0.000 description 1
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 1
- BBYTXXRNSFUOOX-IHRRRGAJSA-N Arg-Cys-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BBYTXXRNSFUOOX-IHRRRGAJSA-N 0.000 description 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 1
- BQBPFMNVOWDLHO-XIRDDKMYSA-N Arg-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N BQBPFMNVOWDLHO-XIRDDKMYSA-N 0.000 description 1
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 1
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 1
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 1
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 1
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 1
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 1
- GFMWTFHOZGLTLC-AVGNSLFASA-N Arg-His-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(O)=O GFMWTFHOZGLTLC-AVGNSLFASA-N 0.000 description 1
- YQGZIRIYGHNSQO-ZPFDUUQYSA-N Arg-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YQGZIRIYGHNSQO-ZPFDUUQYSA-N 0.000 description 1
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 1
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 1
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 1
- NOZYDJOPOGKUSR-AVGNSLFASA-N Arg-Leu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O NOZYDJOPOGKUSR-AVGNSLFASA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- XUGATJVGQUGQKY-ULQDDVLXSA-N Arg-Lys-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XUGATJVGQUGQKY-ULQDDVLXSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- QBQVKUNBCAFXSV-ULQDDVLXSA-N Arg-Lys-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QBQVKUNBCAFXSV-ULQDDVLXSA-N 0.000 description 1
- VIINVRPKMUZYOI-DCAQKATOSA-N Arg-Met-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIINVRPKMUZYOI-DCAQKATOSA-N 0.000 description 1
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 1
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 1
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 1
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 1
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 1
- WCZXPVPHUMYLMS-VEVYYDQMSA-N Arg-Thr-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O WCZXPVPHUMYLMS-VEVYYDQMSA-N 0.000 description 1
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 1
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 1
- JBQORRNSZGTLCV-WDSOQIARSA-N Arg-Trp-Lys Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 JBQORRNSZGTLCV-WDSOQIARSA-N 0.000 description 1
- ZCSHHTFOZULVLN-SZMVWBNQSA-N Arg-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 ZCSHHTFOZULVLN-SZMVWBNQSA-N 0.000 description 1
- CTAPSNCVKPOOSM-KKUMJFAQSA-N Arg-Tyr-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O CTAPSNCVKPOOSM-KKUMJFAQSA-N 0.000 description 1
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 1
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 1
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 1
- BRCVLJZIIFBSPF-ZLUOBGJFSA-N Asn-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N BRCVLJZIIFBSPF-ZLUOBGJFSA-N 0.000 description 1
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 1
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 1
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 1
- RCENDENBBJFJHZ-ACZMJKKPSA-N Asn-Asn-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCENDENBBJFJHZ-ACZMJKKPSA-N 0.000 description 1
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 1
- WVCJSDCHTUTONA-FXQIFTODSA-N Asn-Asp-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WVCJSDCHTUTONA-FXQIFTODSA-N 0.000 description 1
- HUAOKVVEVHACHR-CIUDSAMLSA-N Asn-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N HUAOKVVEVHACHR-CIUDSAMLSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- PAXHINASXXXILC-SRVKXCTJSA-N Asn-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)O PAXHINASXXXILC-SRVKXCTJSA-N 0.000 description 1
- XXAOXVBAWLMTDR-ZLUOBGJFSA-N Asn-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N XXAOXVBAWLMTDR-ZLUOBGJFSA-N 0.000 description 1
- AYKKKGFJXIDYLX-ACZMJKKPSA-N Asn-Gln-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AYKKKGFJXIDYLX-ACZMJKKPSA-N 0.000 description 1
- QGNXYDHVERJIAY-ACZMJKKPSA-N Asn-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N QGNXYDHVERJIAY-ACZMJKKPSA-N 0.000 description 1
- PQAIOUVVZCOLJK-FXQIFTODSA-N Asn-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PQAIOUVVZCOLJK-FXQIFTODSA-N 0.000 description 1
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 1
- QNJIRRVTOXNGMH-GUBZILKMSA-N Asn-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(N)=O QNJIRRVTOXNGMH-GUBZILKMSA-N 0.000 description 1
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 1
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 1
- OGMDXNFGPOPZTK-GUBZILKMSA-N Asn-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N OGMDXNFGPOPZTK-GUBZILKMSA-N 0.000 description 1
- COUZKSSMBFADSB-AVGNSLFASA-N Asn-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N COUZKSSMBFADSB-AVGNSLFASA-N 0.000 description 1
- BKDDABUWNKGZCK-XHNCKOQMSA-N Asn-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O BKDDABUWNKGZCK-XHNCKOQMSA-N 0.000 description 1
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- PLVAAIPKSGUXDV-WHFBIAKZSA-N Asn-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)N PLVAAIPKSGUXDV-WHFBIAKZSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 1
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 1
- YGHCVNQOZZMHRZ-DJFWLOJKSA-N Asn-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N YGHCVNQOZZMHRZ-DJFWLOJKSA-N 0.000 description 1
- SGAUXNZEFIEAAI-GARJFASQSA-N Asn-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)C(=O)O SGAUXNZEFIEAAI-GARJFASQSA-N 0.000 description 1
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 1
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 1
- LVHMEJJWEXBMKK-GMOBBJLQSA-N Asn-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N LVHMEJJWEXBMKK-GMOBBJLQSA-N 0.000 description 1
- XLZCLJRGGMBKLR-PCBIJLKTSA-N Asn-Ile-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XLZCLJRGGMBKLR-PCBIJLKTSA-N 0.000 description 1
- LTZIRYMWOJHRCH-GUDRVLHUSA-N Asn-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N LTZIRYMWOJHRCH-GUDRVLHUSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 1
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 1
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 1
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- UBGGJTMETLEXJD-DCAQKATOSA-N Asn-Leu-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O UBGGJTMETLEXJD-DCAQKATOSA-N 0.000 description 1
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 1
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 1
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 1
- WXVGISRWSYGEDK-KKUMJFAQSA-N Asn-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N WXVGISRWSYGEDK-KKUMJFAQSA-N 0.000 description 1
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 1
- ICDDSTLEMLGSTB-GUBZILKMSA-N Asn-Met-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ICDDSTLEMLGSTB-GUBZILKMSA-N 0.000 description 1
- QGABLMITFKUQDF-DCAQKATOSA-N Asn-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QGABLMITFKUQDF-DCAQKATOSA-N 0.000 description 1
- CDGHMJJJHYKMPA-DLOVCJGASA-N Asn-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC(=O)N)N CDGHMJJJHYKMPA-DLOVCJGASA-N 0.000 description 1
- ZVUMKOMKQCANOM-AVGNSLFASA-N Asn-Phe-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVUMKOMKQCANOM-AVGNSLFASA-N 0.000 description 1
- RBOBTTLFPRSXKZ-BZSNNMDCSA-N Asn-Phe-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RBOBTTLFPRSXKZ-BZSNNMDCSA-N 0.000 description 1
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 1
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 1
- KYQJHBWHRASMKG-ZLUOBGJFSA-N Asn-Ser-Cys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O KYQJHBWHRASMKG-ZLUOBGJFSA-N 0.000 description 1
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 1
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 1
- XHTUGJCAEYOZOR-UBHSHLNASA-N Asn-Ser-Trp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XHTUGJCAEYOZOR-UBHSHLNASA-N 0.000 description 1
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 1
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 1
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 1
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 1
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 description 1
- JPPLRQVZMZFOSX-UWJYBYFXSA-N Asn-Tyr-Ala Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 JPPLRQVZMZFOSX-UWJYBYFXSA-N 0.000 description 1
- NJPLPRFQLBZAMH-IHRRRGAJSA-N Asn-Tyr-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(O)=O NJPLPRFQLBZAMH-IHRRRGAJSA-N 0.000 description 1
- DPWDPEVGACCWTC-SRVKXCTJSA-N Asn-Tyr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O DPWDPEVGACCWTC-SRVKXCTJSA-N 0.000 description 1
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 1
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 1
- SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 1
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 1
- JGDBHIVECJGXJA-FXQIFTODSA-N Asp-Asp-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JGDBHIVECJGXJA-FXQIFTODSA-N 0.000 description 1
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 1
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 1
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 1
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 1
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 1
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 1
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 1
- SNAWMGHSCHKSDK-GUBZILKMSA-N Asp-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N SNAWMGHSCHKSDK-GUBZILKMSA-N 0.000 description 1
- CSEJMKNZDCJYGJ-XHNCKOQMSA-N Asp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O CSEJMKNZDCJYGJ-XHNCKOQMSA-N 0.000 description 1
- DXQOQMCLWWADMU-ACZMJKKPSA-N Asp-Gln-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DXQOQMCLWWADMU-ACZMJKKPSA-N 0.000 description 1
- ZSJFGGSPCCHMNE-LAEOZQHASA-N Asp-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N ZSJFGGSPCCHMNE-LAEOZQHASA-N 0.000 description 1
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- WSXDIZFNQYTUJB-SRVKXCTJSA-N Asp-His-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O WSXDIZFNQYTUJB-SRVKXCTJSA-N 0.000 description 1
- UBPMOJLRVMGTOQ-GARJFASQSA-N Asp-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)C(=O)O UBPMOJLRVMGTOQ-GARJFASQSA-N 0.000 description 1
- ICZWAZVKLACMKR-CIUDSAMLSA-N Asp-His-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 ICZWAZVKLACMKR-CIUDSAMLSA-N 0.000 description 1
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 1
- LDLZOAJRXXBVGF-GMOBBJLQSA-N Asp-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)O)N LDLZOAJRXXBVGF-GMOBBJLQSA-N 0.000 description 1
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 1
- AITKTFCQOBRJTG-CIUDSAMLSA-N Asp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N AITKTFCQOBRJTG-CIUDSAMLSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 1
- QNIACYURSSCLRP-GUBZILKMSA-N Asp-Lys-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O QNIACYURSSCLRP-GUBZILKMSA-N 0.000 description 1
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 1
- FQHBAQLBIXLWAG-DCAQKATOSA-N Asp-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N FQHBAQLBIXLWAG-DCAQKATOSA-N 0.000 description 1
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 1
- BPTFNDRZKBFMTH-DCAQKATOSA-N Asp-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N BPTFNDRZKBFMTH-DCAQKATOSA-N 0.000 description 1
- DJCAHYVLMSRBFR-QXEWZRGKSA-N Asp-Met-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(O)=O DJCAHYVLMSRBFR-QXEWZRGKSA-N 0.000 description 1
- KOWYNSKRPUWSFG-IHPCNDPISA-N Asp-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CC(=O)O)N KOWYNSKRPUWSFG-IHPCNDPISA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- LGGHQRZIJSYRHA-GUBZILKMSA-N Asp-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N LGGHQRZIJSYRHA-GUBZILKMSA-N 0.000 description 1
- BKOIIURTQAJHAT-GUBZILKMSA-N Asp-Pro-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 BKOIIURTQAJHAT-GUBZILKMSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 1
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 1
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 1
- RCGVPVZHKAXDPA-NYVOZVTQSA-N Asp-Trp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)NC(=O)[C@H](CC(=O)O)N RCGVPVZHKAXDPA-NYVOZVTQSA-N 0.000 description 1
- GHAHOJDCBRXAKC-IHPCNDPISA-N Asp-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N GHAHOJDCBRXAKC-IHPCNDPISA-N 0.000 description 1
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 1
- ALMIMUZAWTUNIO-BZSNNMDCSA-N Asp-Tyr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ALMIMUZAWTUNIO-BZSNNMDCSA-N 0.000 description 1
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- 208000003174 Brain Neoplasms Diseases 0.000 description 1
- 206010006187 Breast cancer Diseases 0.000 description 1
- 208000026310 Breast neoplasm Diseases 0.000 description 1
- 102000014818 Cadherin-13 Human genes 0.000 description 1
- 201000009030 Carcinoma Diseases 0.000 description 1
- 102100028914 Catenin beta-1 Human genes 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 102100025567 Citron Rho-interacting kinase Human genes 0.000 description 1
- 206010009944 Colon cancer Diseases 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 1
- RRIJEABIXPKSGP-FXQIFTODSA-N Cys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CS RRIJEABIXPKSGP-FXQIFTODSA-N 0.000 description 1
- GMXSSZUVDNPRMA-FXQIFTODSA-N Cys-Arg-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GMXSSZUVDNPRMA-FXQIFTODSA-N 0.000 description 1
- XGIAHEUULGOZHH-GUBZILKMSA-N Cys-Arg-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N XGIAHEUULGOZHH-GUBZILKMSA-N 0.000 description 1
- NDUSUIGBMZCOIL-ZKWXMUAHSA-N Cys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N NDUSUIGBMZCOIL-ZKWXMUAHSA-N 0.000 description 1
- XABFFGOGKOORCG-CIUDSAMLSA-N Cys-Asp-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XABFFGOGKOORCG-CIUDSAMLSA-N 0.000 description 1
- DVKQPQKQDHHFTE-ZLUOBGJFSA-N Cys-Cys-Asn Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N)C(=O)N DVKQPQKQDHHFTE-ZLUOBGJFSA-N 0.000 description 1
- SFRQEQGPRTVDPO-NRPADANISA-N Cys-Gln-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O SFRQEQGPRTVDPO-NRPADANISA-N 0.000 description 1
- VBPGTULCFGKGTF-ACZMJKKPSA-N Cys-Glu-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VBPGTULCFGKGTF-ACZMJKKPSA-N 0.000 description 1
- VCIIDXDOPGHMDQ-WDSKDSINSA-N Cys-Gly-Gln Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VCIIDXDOPGHMDQ-WDSKDSINSA-N 0.000 description 1
- LBOLGUYQEPZSKM-YUMQZZPRSA-N Cys-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N LBOLGUYQEPZSKM-YUMQZZPRSA-N 0.000 description 1
- RRJOQIBQVZDVCW-SRVKXCTJSA-N Cys-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N RRJOQIBQVZDVCW-SRVKXCTJSA-N 0.000 description 1
- HAYVLBZZBDCKRA-SRVKXCTJSA-N Cys-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N HAYVLBZZBDCKRA-SRVKXCTJSA-N 0.000 description 1
- XIZWKXATMJODQW-KKUMJFAQSA-N Cys-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CS)N XIZWKXATMJODQW-KKUMJFAQSA-N 0.000 description 1
- XGHYKIDVGYYHDC-JBDRJPRFSA-N Cys-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N XGHYKIDVGYYHDC-JBDRJPRFSA-N 0.000 description 1
- PRHGYQOSEHLDRW-VGDYDELISA-N Cys-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N PRHGYQOSEHLDRW-VGDYDELISA-N 0.000 description 1
- BLGNLNRBABWDST-CIUDSAMLSA-N Cys-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BLGNLNRBABWDST-CIUDSAMLSA-N 0.000 description 1
- XLLSMEFANRROJE-GUBZILKMSA-N Cys-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N XLLSMEFANRROJE-GUBZILKMSA-N 0.000 description 1
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 1
- UCSXXFRXHGUXCQ-SRVKXCTJSA-N Cys-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N UCSXXFRXHGUXCQ-SRVKXCTJSA-N 0.000 description 1
- HBHMVBGGHDMPBF-GARJFASQSA-N Cys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N HBHMVBGGHDMPBF-GARJFASQSA-N 0.000 description 1
- XZKJEOMFLDVXJG-KATARQTJSA-N Cys-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)N)O XZKJEOMFLDVXJG-KATARQTJSA-N 0.000 description 1
- OHLLDUNVMPPUMD-DCAQKATOSA-N Cys-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N OHLLDUNVMPPUMD-DCAQKATOSA-N 0.000 description 1
- OETOANMAHTWESF-KKUMJFAQSA-N Cys-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CS)N OETOANMAHTWESF-KKUMJFAQSA-N 0.000 description 1
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 1
- BCWIFCLVCRAIQK-ZLUOBGJFSA-N Cys-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O BCWIFCLVCRAIQK-ZLUOBGJFSA-N 0.000 description 1
- ZGERHCJBLPQPGV-ACZMJKKPSA-N Cys-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N ZGERHCJBLPQPGV-ACZMJKKPSA-N 0.000 description 1
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 1
- DQGIAOGALAQBGK-BWBBJGPYSA-N Cys-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O DQGIAOGALAQBGK-BWBBJGPYSA-N 0.000 description 1
- JLZCAZJGWNRXCI-XKBZYTNZSA-N Cys-Thr-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O JLZCAZJGWNRXCI-XKBZYTNZSA-N 0.000 description 1
- GFAPBMCRSMSGDZ-XGEHTFHBSA-N Cys-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CS)N)O GFAPBMCRSMSGDZ-XGEHTFHBSA-N 0.000 description 1
- FANFRJOFTYCNRG-JYBASQMISA-N Cys-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CS)N)O FANFRJOFTYCNRG-JYBASQMISA-N 0.000 description 1
- KFYPRIGJTICABD-XGEHTFHBSA-N Cys-Thr-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N)O KFYPRIGJTICABD-XGEHTFHBSA-N 0.000 description 1
- YFKWIIRWHGKSQQ-WFBYXXMGSA-N Cys-Trp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CS)N YFKWIIRWHGKSQQ-WFBYXXMGSA-N 0.000 description 1
- IZJLAQMWJHCHTN-BPUTZDHNSA-N Cys-Trp-Arg Chemical compound N[C@@H](CS)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O IZJLAQMWJHCHTN-BPUTZDHNSA-N 0.000 description 1
- XAHWYEYOMSGKDA-CWRNSKLLSA-N Cys-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CS)N)C(=O)O XAHWYEYOMSGKDA-CWRNSKLLSA-N 0.000 description 1
- KXHAPEPORGOXDT-UWJYBYFXSA-N Cys-Tyr-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O KXHAPEPORGOXDT-UWJYBYFXSA-N 0.000 description 1
- WVWRADGCZPIJJR-IHRRRGAJSA-N Cys-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N WVWRADGCZPIJJR-IHRRRGAJSA-N 0.000 description 1
- 230000004544 DNA amplification Effects 0.000 description 1
- 239000003298 DNA probe Substances 0.000 description 1
- 102400001368 Epidermal growth factor Human genes 0.000 description 1
- 101800003838 Epidermal growth factor Proteins 0.000 description 1
- 208000000461 Esophageal Neoplasms Diseases 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 229940127513 Fusion Protein Inhibitors Drugs 0.000 description 1
- 102100039788 GTPase NRas Human genes 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 1
- LZRMPXRYLLTAJX-GUBZILKMSA-N Gln-Arg-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZRMPXRYLLTAJX-GUBZILKMSA-N 0.000 description 1
- MWLYSLMKFXWZPW-ZPFDUUQYSA-N Gln-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCC(N)=O MWLYSLMKFXWZPW-ZPFDUUQYSA-N 0.000 description 1
- KJRXLVZYJJLUCV-DCAQKATOSA-N Gln-Arg-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KJRXLVZYJJLUCV-DCAQKATOSA-N 0.000 description 1
- DTMLKCYOQKZXKZ-HJGDQZAQSA-N Gln-Arg-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DTMLKCYOQKZXKZ-HJGDQZAQSA-N 0.000 description 1
- SSWAFVQFQWOJIJ-XIRDDKMYSA-N Gln-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N SSWAFVQFQWOJIJ-XIRDDKMYSA-N 0.000 description 1
- OETQLUYCMBARHJ-CIUDSAMLSA-N Gln-Asn-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OETQLUYCMBARHJ-CIUDSAMLSA-N 0.000 description 1
- WMOMPXKOKASNBK-PEFMBERDSA-N Gln-Asn-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WMOMPXKOKASNBK-PEFMBERDSA-N 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- PONUFVLSGMQFAI-AVGNSLFASA-N Gln-Asn-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PONUFVLSGMQFAI-AVGNSLFASA-N 0.000 description 1
- RKAQZCDMSUQTSS-FXQIFTODSA-N Gln-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RKAQZCDMSUQTSS-FXQIFTODSA-N 0.000 description 1
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 1
- KZEUVLLVULIPNX-GUBZILKMSA-N Gln-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N KZEUVLLVULIPNX-GUBZILKMSA-N 0.000 description 1
- WLODHVXYKYHLJD-ACZMJKKPSA-N Gln-Asp-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N WLODHVXYKYHLJD-ACZMJKKPSA-N 0.000 description 1
- COYGBRTZEVWZBW-XKBZYTNZSA-N Gln-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(N)=O COYGBRTZEVWZBW-XKBZYTNZSA-N 0.000 description 1
- XEZWLWNGUMXAJI-QEJZJMRPSA-N Gln-Cys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CS)NC(=O)[C@H](CCC(N)=O)N)C(O)=O)=CNC2=C1 XEZWLWNGUMXAJI-QEJZJMRPSA-N 0.000 description 1
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 1
- QYKBTDOAMKORGL-FXQIFTODSA-N Gln-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QYKBTDOAMKORGL-FXQIFTODSA-N 0.000 description 1
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 1
- AJDMYLOISOCHHC-YVNDNENWSA-N Gln-Gln-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AJDMYLOISOCHHC-YVNDNENWSA-N 0.000 description 1
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 1
- MADFVRSKEIEZHZ-DCAQKATOSA-N Gln-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N MADFVRSKEIEZHZ-DCAQKATOSA-N 0.000 description 1
- NPTGGVQJYRSMCM-GLLZPBPUSA-N Gln-Gln-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPTGGVQJYRSMCM-GLLZPBPUSA-N 0.000 description 1
- MCAVASRGVBVPMX-FXQIFTODSA-N Gln-Glu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MCAVASRGVBVPMX-FXQIFTODSA-N 0.000 description 1
- BLOXULLYFRGYKZ-GUBZILKMSA-N Gln-Glu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BLOXULLYFRGYKZ-GUBZILKMSA-N 0.000 description 1
- VOLVNCMGXWDDQY-LPEHRKFASA-N Gln-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O VOLVNCMGXWDDQY-LPEHRKFASA-N 0.000 description 1
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 1
- NXPXQIZKDOXIHH-JSGCOSHPSA-N Gln-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N NXPXQIZKDOXIHH-JSGCOSHPSA-N 0.000 description 1
- GLAPJAHOPFSLKL-SRVKXCTJSA-N Gln-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)N)N GLAPJAHOPFSLKL-SRVKXCTJSA-N 0.000 description 1
- ICDIMQAMJGDHSE-GUBZILKMSA-N Gln-His-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O ICDIMQAMJGDHSE-GUBZILKMSA-N 0.000 description 1
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 1
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 1
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 1
- QBLMTCRYYTVUQY-GUBZILKMSA-N Gln-Leu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QBLMTCRYYTVUQY-GUBZILKMSA-N 0.000 description 1
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 1
- SHAUZYVSXAMYAZ-JYJNAYRXSA-N Gln-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SHAUZYVSXAMYAZ-JYJNAYRXSA-N 0.000 description 1
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 1
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 1
- UWKPRVKWEKEMSY-DCAQKATOSA-N Gln-Lys-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWKPRVKWEKEMSY-DCAQKATOSA-N 0.000 description 1
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 1
- ZEEPYMXTJWIMSN-GUBZILKMSA-N Gln-Lys-Ser Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CO)C(O)=O)NC(=O)[C@@H](N)CCC(N)=O ZEEPYMXTJWIMSN-GUBZILKMSA-N 0.000 description 1
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 1
- XBWGJWXGUNSZAT-CIUDSAMLSA-N Gln-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N XBWGJWXGUNSZAT-CIUDSAMLSA-N 0.000 description 1
- LHMWTCWZARHLPV-CIUDSAMLSA-N Gln-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LHMWTCWZARHLPV-CIUDSAMLSA-N 0.000 description 1
- BZULIEARJFRINC-IHRRRGAJSA-N Gln-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N BZULIEARJFRINC-IHRRRGAJSA-N 0.000 description 1
- DRNMNLKUUKKPIA-HTUGSXCWSA-N Gln-Phe-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)CCC(N)=O)C(O)=O DRNMNLKUUKKPIA-HTUGSXCWSA-N 0.000 description 1
- ZVQZXPADLZIQFF-FHWLQOOXSA-N Gln-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 ZVQZXPADLZIQFF-FHWLQOOXSA-N 0.000 description 1
- NJMYZEJORPYOTO-BQBZGAKWSA-N Gln-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O NJMYZEJORPYOTO-BQBZGAKWSA-N 0.000 description 1
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 1
- DOQUICBEISTQHE-CIUDSAMLSA-N Gln-Pro-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O DOQUICBEISTQHE-CIUDSAMLSA-N 0.000 description 1
- WLRYGVYQFXRJDA-DCAQKATOSA-N Gln-Pro-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 WLRYGVYQFXRJDA-DCAQKATOSA-N 0.000 description 1
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 1
- DCWNCMRZIZSZBL-KKUMJFAQSA-N Gln-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O DCWNCMRZIZSZBL-KKUMJFAQSA-N 0.000 description 1
- KVQOVQVGVKDZNW-GUBZILKMSA-N Gln-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KVQOVQVGVKDZNW-GUBZILKMSA-N 0.000 description 1
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 1
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 1
- ININBLZFFVOQIO-JHEQGTHGSA-N Gln-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O ININBLZFFVOQIO-JHEQGTHGSA-N 0.000 description 1
- XFHMVFKCQSHLKW-HJGDQZAQSA-N Gln-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XFHMVFKCQSHLKW-HJGDQZAQSA-N 0.000 description 1
- BETSEXMYBWCDAE-SZMVWBNQSA-N Gln-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N BETSEXMYBWCDAE-SZMVWBNQSA-N 0.000 description 1
- WTJIWXMJESRHMM-XDTLVQLUSA-N Gln-Tyr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O WTJIWXMJESRHMM-XDTLVQLUSA-N 0.000 description 1
- WPJDPEOQUIXXOY-AVGNSLFASA-N Gln-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WPJDPEOQUIXXOY-AVGNSLFASA-N 0.000 description 1
- AKDOUBMVLRCHBD-SIUGBPQLSA-N Gln-Tyr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AKDOUBMVLRCHBD-SIUGBPQLSA-N 0.000 description 1
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 1
- UBRQJXFDVZNYJP-AVGNSLFASA-N Gln-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UBRQJXFDVZNYJP-AVGNSLFASA-N 0.000 description 1
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 1
- OACPJRQRAHMQEQ-NHCYSSNCSA-N Gln-Val-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OACPJRQRAHMQEQ-NHCYSSNCSA-N 0.000 description 1
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 1
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 1
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 1
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 1
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- RSUVOPBMWMTVDI-XEGUGMAKSA-N Glu-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCC(O)=O)C)C(O)=O)=CNC2=C1 RSUVOPBMWMTVDI-XEGUGMAKSA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 1
- RCCDHXSRMWCOOY-GUBZILKMSA-N Glu-Arg-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCCDHXSRMWCOOY-GUBZILKMSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- LJLPOZGRPLORTF-CIUDSAMLSA-N Glu-Asn-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LJLPOZGRPLORTF-CIUDSAMLSA-N 0.000 description 1
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 1
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 1
- FLQAKQOBSPFGKG-CIUDSAMLSA-N Glu-Cys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLQAKQOBSPFGKG-CIUDSAMLSA-N 0.000 description 1
- LSTFYPOGBGFIPP-FXQIFTODSA-N Glu-Cys-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O LSTFYPOGBGFIPP-FXQIFTODSA-N 0.000 description 1
- GFLQTABMFBXRIY-GUBZILKMSA-N Glu-Gln-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GFLQTABMFBXRIY-GUBZILKMSA-N 0.000 description 1
- CJWANNXUTOATSJ-DCAQKATOSA-N Glu-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N CJWANNXUTOATSJ-DCAQKATOSA-N 0.000 description 1
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 1
- IFZWDJWERARYFC-WNHJNPCNSA-N Glu-Glu-Gln-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 IFZWDJWERARYFC-WNHJNPCNSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 1
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 1
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 1
- GGJOGFJIPPGNRK-JSGCOSHPSA-N Glu-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 GGJOGFJIPPGNRK-JSGCOSHPSA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- XOIATPHFYVWFEU-DCAQKATOSA-N Glu-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XOIATPHFYVWFEU-DCAQKATOSA-N 0.000 description 1
- XOFYVODYSNKPDK-AVGNSLFASA-N Glu-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XOFYVODYSNKPDK-AVGNSLFASA-N 0.000 description 1
- NJPQBTJSYCKCNS-HVTMNAMFSA-N Glu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N NJPQBTJSYCKCNS-HVTMNAMFSA-N 0.000 description 1
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 1
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 1
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 1
- LZMQSTPFYJLVJB-GUBZILKMSA-N Glu-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N LZMQSTPFYJLVJB-GUBZILKMSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- DWBBKNPKDHXIAC-SRVKXCTJSA-N Glu-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCC(O)=O DWBBKNPKDHXIAC-SRVKXCTJSA-N 0.000 description 1
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 1
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 1
- UJMNFCAHLYKWOZ-DCAQKATOSA-N Glu-Lys-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UJMNFCAHLYKWOZ-DCAQKATOSA-N 0.000 description 1
- YGLCLCMAYUYZSG-AVGNSLFASA-N Glu-Lys-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 YGLCLCMAYUYZSG-AVGNSLFASA-N 0.000 description 1
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 1
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- NPMSEUWUMOSEFM-CIUDSAMLSA-N Glu-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N NPMSEUWUMOSEFM-CIUDSAMLSA-N 0.000 description 1
- ZWMYUDZLXAQHCK-CIUDSAMLSA-N Glu-Met-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O ZWMYUDZLXAQHCK-CIUDSAMLSA-N 0.000 description 1
- XNOWYPDMSLSRKP-GUBZILKMSA-N Glu-Met-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O XNOWYPDMSLSRKP-GUBZILKMSA-N 0.000 description 1
- JHSRJMUJOGLIHK-GUBZILKMSA-N Glu-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N JHSRJMUJOGLIHK-GUBZILKMSA-N 0.000 description 1
- YHOJJFFTSMWVGR-HJGDQZAQSA-N Glu-Met-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YHOJJFFTSMWVGR-HJGDQZAQSA-N 0.000 description 1
- HOIPREWORBVRLD-XIRDDKMYSA-N Glu-Met-Trp Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O HOIPREWORBVRLD-XIRDDKMYSA-N 0.000 description 1
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 1
- JDUKCSSHWNIQQZ-IHRRRGAJSA-N Glu-Phe-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JDUKCSSHWNIQQZ-IHRRRGAJSA-N 0.000 description 1
- ZIYGTCDTJJCDDP-JYJNAYRXSA-N Glu-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZIYGTCDTJJCDDP-JYJNAYRXSA-N 0.000 description 1
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 1
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 1
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 description 1
- PAZQYODKOZHXGA-SRVKXCTJSA-N Glu-Pro-His Chemical compound N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O PAZQYODKOZHXGA-SRVKXCTJSA-N 0.000 description 1
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 1
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- ZAPFAWQHBOHWLL-GUBZILKMSA-N Glu-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N ZAPFAWQHBOHWLL-GUBZILKMSA-N 0.000 description 1
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 1
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 1
- TWYSSILQABLLME-HJGDQZAQSA-N Glu-Thr-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYSSILQABLLME-HJGDQZAQSA-N 0.000 description 1
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- UMZHHILWZBFPGL-LOKLDPHHSA-N Glu-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O UMZHHILWZBFPGL-LOKLDPHHSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 1
- QGAJQIGFFIQJJK-IHRRRGAJSA-N Glu-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QGAJQIGFFIQJJK-IHRRRGAJSA-N 0.000 description 1
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 1
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 1
- ZALGPUWUVHOGAE-GVXVVHGQSA-N Glu-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZALGPUWUVHOGAE-GVXVVHGQSA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- PHONXOACARQMPM-BQBZGAKWSA-N Gly-Ala-Met Chemical compound [H]NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O PHONXOACARQMPM-BQBZGAKWSA-N 0.000 description 1
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 1
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 1
- RQZGFWKQLPJOEQ-YUMQZZPRSA-N Gly-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN)CN=C(N)N RQZGFWKQLPJOEQ-YUMQZZPRSA-N 0.000 description 1
- MXXXVOYFNVJHMA-IUCAKERBSA-N Gly-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN MXXXVOYFNVJHMA-IUCAKERBSA-N 0.000 description 1
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 1
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 1
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 1
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 1
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- MQVNVZUEPUIAFA-WDSKDSINSA-N Gly-Cys-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN MQVNVZUEPUIAFA-WDSKDSINSA-N 0.000 description 1
- YYQGVXNKAXUTJU-YUMQZZPRSA-N Gly-Cys-His Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O YYQGVXNKAXUTJU-YUMQZZPRSA-N 0.000 description 1
- PEZZSFLFXXFUQD-XPUUQOCRSA-N Gly-Cys-Val Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O PEZZSFLFXXFUQD-XPUUQOCRSA-N 0.000 description 1
- DTRUBYPMMVPQPD-YUMQZZPRSA-N Gly-Gln-Arg Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DTRUBYPMMVPQPD-YUMQZZPRSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 1
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 1
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 1
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 1
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 1
- HPAIKDPJURGQLN-KBPBESRZSA-N Gly-His-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 HPAIKDPJURGQLN-KBPBESRZSA-N 0.000 description 1
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 1
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- LIXWIUAORXJNBH-QWRGUYRKSA-N Gly-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN LIXWIUAORXJNBH-QWRGUYRKSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- JPAACTMBBBGAAR-HOTGVXAUSA-N Gly-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)CC(C)C)C(O)=O)=CNC2=C1 JPAACTMBBBGAAR-HOTGVXAUSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- OMOZPGCHVWOXHN-BQBZGAKWSA-N Gly-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)CN OMOZPGCHVWOXHN-BQBZGAKWSA-N 0.000 description 1
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 1
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 1
- JNGHLWWFPGIJER-STQMWFEESA-N Gly-Pro-Tyr Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JNGHLWWFPGIJER-STQMWFEESA-N 0.000 description 1
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 1
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 1
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 1
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 1
- BXDLTKLPPKBVEL-FJXKBIBVSA-N Gly-Thr-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O BXDLTKLPPKBVEL-FJXKBIBVSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- GULGDABMYTYMJZ-STQMWFEESA-N Gly-Trp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O GULGDABMYTYMJZ-STQMWFEESA-N 0.000 description 1
- RJVZMGQMJOQIAX-GJZGRUSLSA-N Gly-Trp-Met Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(O)=O RJVZMGQMJOQIAX-GJZGRUSLSA-N 0.000 description 1
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 1
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- KZTLOHBDLMIFSH-XVYDVKMFSA-N His-Ala-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O KZTLOHBDLMIFSH-XVYDVKMFSA-N 0.000 description 1
- MBSSHYPAEHPSGY-LSJOCFKGSA-N His-Ala-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O MBSSHYPAEHPSGY-LSJOCFKGSA-N 0.000 description 1
- ZIMTWPHIKZEHSE-UWVGGRQHSA-N His-Arg-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O ZIMTWPHIKZEHSE-UWVGGRQHSA-N 0.000 description 1
- MWAJSVTZZOUOBU-IHRRRGAJSA-N His-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 MWAJSVTZZOUOBU-IHRRRGAJSA-N 0.000 description 1
- CJGDTAHEMXLRMB-ULQDDVLXSA-N His-Arg-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CJGDTAHEMXLRMB-ULQDDVLXSA-N 0.000 description 1
- MWWOPNQSBXEUHO-ULQDDVLXSA-N His-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 MWWOPNQSBXEUHO-ULQDDVLXSA-N 0.000 description 1
- QZAFGJNKLMNDEM-DCAQKATOSA-N His-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 QZAFGJNKLMNDEM-DCAQKATOSA-N 0.000 description 1
- UZZXGLOJRZKYEL-DJFWLOJKSA-N His-Asn-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UZZXGLOJRZKYEL-DJFWLOJKSA-N 0.000 description 1
- VIVSWEBJUHXCDS-DCAQKATOSA-N His-Asn-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O VIVSWEBJUHXCDS-DCAQKATOSA-N 0.000 description 1
- VLPMGIJPAWENQB-SRVKXCTJSA-N His-Cys-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O VLPMGIJPAWENQB-SRVKXCTJSA-N 0.000 description 1
- FYVHHKMHFPMBBG-GUBZILKMSA-N His-Gln-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FYVHHKMHFPMBBG-GUBZILKMSA-N 0.000 description 1
- ZNNNYCXPCKACHX-DCAQKATOSA-N His-Gln-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZNNNYCXPCKACHX-DCAQKATOSA-N 0.000 description 1
- NELVFWFDOKRTOR-SDDRHHMPSA-N His-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O NELVFWFDOKRTOR-SDDRHHMPSA-N 0.000 description 1
- WEIYKCOEVBUJQC-JYJNAYRXSA-N His-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WEIYKCOEVBUJQC-JYJNAYRXSA-N 0.000 description 1
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 1
- PGTISAJTWZPFGN-PEXQALLHSA-N His-Gly-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O PGTISAJTWZPFGN-PEXQALLHSA-N 0.000 description 1
- CSTNMMIHMYJGFR-IHRRRGAJSA-N His-His-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CN=CN1 CSTNMMIHMYJGFR-IHRRRGAJSA-N 0.000 description 1
- CTJHHEQNUNIYNN-SRVKXCTJSA-N His-His-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O CTJHHEQNUNIYNN-SRVKXCTJSA-N 0.000 description 1
- MPXGJGBXCRQQJE-MXAVVETBSA-N His-Ile-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O MPXGJGBXCRQQJE-MXAVVETBSA-N 0.000 description 1
- MFQVZYSPCIZFMR-MGHWNKPDSA-N His-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N MFQVZYSPCIZFMR-MGHWNKPDSA-N 0.000 description 1
- ZRSJXIKQXUGKRB-TUBUOCAGSA-N His-Ile-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZRSJXIKQXUGKRB-TUBUOCAGSA-N 0.000 description 1
- SKYULSWNBYAQMG-IHRRRGAJSA-N His-Leu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SKYULSWNBYAQMG-IHRRRGAJSA-N 0.000 description 1
- OQDLKDUVMTUPPG-AVGNSLFASA-N His-Leu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OQDLKDUVMTUPPG-AVGNSLFASA-N 0.000 description 1
- MJUUWJJEUOBDGW-IHRRRGAJSA-N His-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 MJUUWJJEUOBDGW-IHRRRGAJSA-N 0.000 description 1
- ZSKJIISDJXJQPV-BZSNNMDCSA-N His-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 ZSKJIISDJXJQPV-BZSNNMDCSA-N 0.000 description 1
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 1
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 1
- PGRPSOUCWRBWKZ-DLOVCJGASA-N His-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 PGRPSOUCWRBWKZ-DLOVCJGASA-N 0.000 description 1
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 1
- KYFGGRHWLFZXPU-KKUMJFAQSA-N His-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N KYFGGRHWLFZXPU-KKUMJFAQSA-N 0.000 description 1
- BSVLMPMIXPQNKC-KBPBESRZSA-N His-Phe-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O BSVLMPMIXPQNKC-KBPBESRZSA-N 0.000 description 1
- SOYCWSKCUVDLMC-AVGNSLFASA-N His-Pro-Arg Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CCCNC(=N)N)C(=O)O SOYCWSKCUVDLMC-AVGNSLFASA-N 0.000 description 1
- GNBHSMFBUNEWCJ-DCAQKATOSA-N His-Pro-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GNBHSMFBUNEWCJ-DCAQKATOSA-N 0.000 description 1
- PGXZHYYGOPKYKM-IHRRRGAJSA-N His-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CCCCN)C(=O)O PGXZHYYGOPKYKM-IHRRRGAJSA-N 0.000 description 1
- YEKYGQZUBCRNGH-DCAQKATOSA-N His-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CO)C(=O)O YEKYGQZUBCRNGH-DCAQKATOSA-N 0.000 description 1
- FLXCRBXJRJSDHX-AVGNSLFASA-N His-Pro-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O FLXCRBXJRJSDHX-AVGNSLFASA-N 0.000 description 1
- CWSZWFILCNSNEX-CIUDSAMLSA-N His-Ser-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CWSZWFILCNSNEX-CIUDSAMLSA-N 0.000 description 1
- STGQSBKUYSPPIG-CIUDSAMLSA-N His-Ser-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 STGQSBKUYSPPIG-CIUDSAMLSA-N 0.000 description 1
- JMSONHOUHFDOJH-GUBZILKMSA-N His-Ser-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 JMSONHOUHFDOJH-GUBZILKMSA-N 0.000 description 1
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 1
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 1
- UOYGZBIPZYKGSH-SRVKXCTJSA-N His-Ser-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N UOYGZBIPZYKGSH-SRVKXCTJSA-N 0.000 description 1
- XHQYFGPIRUHQIB-PBCZWWQYSA-N His-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CN=CN1 XHQYFGPIRUHQIB-PBCZWWQYSA-N 0.000 description 1
- CCUSLCQWVMWTIS-IXOXFDKPSA-N His-Thr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O CCUSLCQWVMWTIS-IXOXFDKPSA-N 0.000 description 1
- NBWATNYAUVSAEQ-ZEILLAHLSA-N His-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O NBWATNYAUVSAEQ-ZEILLAHLSA-N 0.000 description 1
- VXZZUXWAOMWWJH-QTKMDUPCSA-N His-Thr-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VXZZUXWAOMWWJH-QTKMDUPCSA-N 0.000 description 1
- FBOMZVOKCZMDIG-XQQFMLRXSA-N His-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N FBOMZVOKCZMDIG-XQQFMLRXSA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101000916173 Homo sapiens Catenin beta-1 Proteins 0.000 description 1
- 101000856200 Homo sapiens Citron Rho-interacting kinase Proteins 0.000 description 1
- 101000744505 Homo sapiens GTPase NRas Proteins 0.000 description 1
- 101000605639 Homo sapiens Phosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit alpha isoform Proteins 0.000 description 1
- 101000984753 Homo sapiens Serine/threonine-protein kinase B-raf Proteins 0.000 description 1
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 1
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 1
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- WUEIUSDAECDLQO-NAKRPEOUSA-N Ile-Ala-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)O)N WUEIUSDAECDLQO-NAKRPEOUSA-N 0.000 description 1
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 1
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 1
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 1
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 1
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 1
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 1
- CCHSQWLCOOZREA-GMOBBJLQSA-N Ile-Asp-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N CCHSQWLCOOZREA-GMOBBJLQSA-N 0.000 description 1
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 1
- LOXMWQOKYBGCHF-JBDRJPRFSA-N Ile-Cys-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O LOXMWQOKYBGCHF-JBDRJPRFSA-N 0.000 description 1
- PPSQSIDMOVPKPI-BJDJZHNGSA-N Ile-Cys-Leu Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O PPSQSIDMOVPKPI-BJDJZHNGSA-N 0.000 description 1
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 1
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 description 1
- CYHJCEKUMCNDFG-LAEOZQHASA-N Ile-Gln-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N CYHJCEKUMCNDFG-LAEOZQHASA-N 0.000 description 1
- JRYQSFOFUFXPTB-RWRJDSDZSA-N Ile-Gln-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N JRYQSFOFUFXPTB-RWRJDSDZSA-N 0.000 description 1
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 1
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 1
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 1
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 1
- FUOYNOXRWPJPAN-QEWYBTABSA-N Ile-Glu-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FUOYNOXRWPJPAN-QEWYBTABSA-N 0.000 description 1
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 1
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- IGJWJGIHUFQANP-LAEOZQHASA-N Ile-Gly-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N IGJWJGIHUFQANP-LAEOZQHASA-N 0.000 description 1
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 1
- ZXIGYKICRDFISM-DJFWLOJKSA-N Ile-His-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZXIGYKICRDFISM-DJFWLOJKSA-N 0.000 description 1
- UASTVUQJMLZWGG-PEXQALLHSA-N Ile-His-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N UASTVUQJMLZWGG-PEXQALLHSA-N 0.000 description 1
- CMNMPCTVCWWYHY-MXAVVETBSA-N Ile-His-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(C)C)C(=O)O)N CMNMPCTVCWWYHY-MXAVVETBSA-N 0.000 description 1
- CCYGNFBYUNHFSC-MGHWNKPDSA-N Ile-His-Phe Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CCYGNFBYUNHFSC-MGHWNKPDSA-N 0.000 description 1
- URWXDJAEEGBADB-TUBUOCAGSA-N Ile-His-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N URWXDJAEEGBADB-TUBUOCAGSA-N 0.000 description 1
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 1
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 1
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 1
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 1
- PKGGWLOLRLOPGK-XUXIUFHCSA-N Ile-Leu-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PKGGWLOLRLOPGK-XUXIUFHCSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 1
- RQQCJTLBSJMVCR-DSYPUSFNSA-N Ile-Leu-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N RQQCJTLBSJMVCR-DSYPUSFNSA-N 0.000 description 1
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 1
- FTUZWJVSNZMLPI-RVMXOQNASA-N Ile-Met-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N FTUZWJVSNZMLPI-RVMXOQNASA-N 0.000 description 1
- UYNXBNHVWFNVIN-HJWJTTGWSA-N Ile-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 UYNXBNHVWFNVIN-HJWJTTGWSA-N 0.000 description 1
- VOCZPDONPURUHV-QEWYBTABSA-N Ile-Phe-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VOCZPDONPURUHV-QEWYBTABSA-N 0.000 description 1
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 1
- VEPIBPGLTLPBDW-URLPEUOOSA-N Ile-Phe-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VEPIBPGLTLPBDW-URLPEUOOSA-N 0.000 description 1
- BATWGBRIZANGPN-ZPFDUUQYSA-N Ile-Pro-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BATWGBRIZANGPN-ZPFDUUQYSA-N 0.000 description 1
- BJECXJHLUJXPJQ-PYJNHQTQSA-N Ile-Pro-His Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N BJECXJHLUJXPJQ-PYJNHQTQSA-N 0.000 description 1
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 1
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 1
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 1
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 1
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 1
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 1
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 1
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 1
- RWHRUZORDWZESH-ZQINRCPSSA-N Ile-Trp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RWHRUZORDWZESH-ZQINRCPSSA-N 0.000 description 1
- WKSHBPRUIRGWRZ-KCTSRDHCSA-N Ile-Trp-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N WKSHBPRUIRGWRZ-KCTSRDHCSA-N 0.000 description 1
- DTPGSUQHUMELQB-GVARAGBVSA-N Ile-Tyr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 DTPGSUQHUMELQB-GVARAGBVSA-N 0.000 description 1
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 1
- PMAOIIWHZHAPBT-HJPIBITLSA-N Ile-Tyr-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N PMAOIIWHZHAPBT-HJPIBITLSA-N 0.000 description 1
- JSLIXOUMAOUGBN-JUKXBJQTSA-N Ile-Tyr-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JSLIXOUMAOUGBN-JUKXBJQTSA-N 0.000 description 1
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 1
- DZMWFIRHFFVBHS-ZEWNOJEFSA-N Ile-Tyr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N DZMWFIRHFFVBHS-ZEWNOJEFSA-N 0.000 description 1
- WRDTXMBPHMBGIB-STECZYCISA-N Ile-Tyr-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 WRDTXMBPHMBGIB-STECZYCISA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 1
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 1
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- 108010002386 Interleukin-3 Proteins 0.000 description 1
- 208000008839 Kidney Neoplasms Diseases 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- 239000005411 L01XE02 - Gefitinib Substances 0.000 description 1
- 239000002146 L01XE16 - Crizotinib Substances 0.000 description 1
- 206010023774 Large cell lung cancer Diseases 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- UILIPCLTHRPCRB-XUXIUFHCSA-N Leu-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(C)C)N UILIPCLTHRPCRB-XUXIUFHCSA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 1
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 1
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 1
- RFUBXQQFJFGJFV-GUBZILKMSA-N Leu-Asn-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RFUBXQQFJFGJFV-GUBZILKMSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- RIMMMMYKGIBOSN-DCAQKATOSA-N Leu-Asn-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O RIMMMMYKGIBOSN-DCAQKATOSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 1
- XVSJMWYYLHPDKY-DCAQKATOSA-N Leu-Asp-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O XVSJMWYYLHPDKY-DCAQKATOSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- PPTAQBNUFKTJKA-BJDJZHNGSA-N Leu-Cys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PPTAQBNUFKTJKA-BJDJZHNGSA-N 0.000 description 1
- YORLGJINWYYIMX-KKUMJFAQSA-N Leu-Cys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YORLGJINWYYIMX-KKUMJFAQSA-N 0.000 description 1
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 1
- BOFAFKVZQUMTID-AVGNSLFASA-N Leu-Gln-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BOFAFKVZQUMTID-AVGNSLFASA-N 0.000 description 1
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 1
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- WMTOVWLLDGQGCV-GUBZILKMSA-N Leu-Glu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N WMTOVWLLDGQGCV-GUBZILKMSA-N 0.000 description 1
- IWTBYNQNAPECCS-AVGNSLFASA-N Leu-Glu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IWTBYNQNAPECCS-AVGNSLFASA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- PRZVBIAOPFGAQF-SRVKXCTJSA-N Leu-Glu-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O PRZVBIAOPFGAQF-SRVKXCTJSA-N 0.000 description 1
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 1
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 1
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 1
- WRLPVDVHNWSSCL-MELADBBJSA-N Leu-His-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N WRLPVDVHNWSSCL-MELADBBJSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 1
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- TVEOVCYCYGKVPP-HSCHXYMDSA-N Leu-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N TVEOVCYCYGKVPP-HSCHXYMDSA-N 0.000 description 1
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 1
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 1
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 1
- PPQRKXHCLYCBSP-IHRRRGAJSA-N Leu-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N PPQRKXHCLYCBSP-IHRRRGAJSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- DCGXHWINSHEPIR-SRVKXCTJSA-N Leu-Lys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N DCGXHWINSHEPIR-SRVKXCTJSA-N 0.000 description 1
- VVQJGYPTIYOFBR-IHRRRGAJSA-N Leu-Lys-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N VVQJGYPTIYOFBR-IHRRRGAJSA-N 0.000 description 1
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 1
- FIICHHJDINDXKG-IHPCNDPISA-N Leu-Lys-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O FIICHHJDINDXKG-IHPCNDPISA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 1
- AUNMOHYWTAPQLA-XUXIUFHCSA-N Leu-Met-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AUNMOHYWTAPQLA-XUXIUFHCSA-N 0.000 description 1
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 1
- JVTYXRRFZCEPPK-RHYQMDGZSA-N Leu-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)N)O JVTYXRRFZCEPPK-RHYQMDGZSA-N 0.000 description 1
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 1
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 1
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- YLMIDMSLKLRNHX-HSCHXYMDSA-N Leu-Trp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YLMIDMSLKLRNHX-HSCHXYMDSA-N 0.000 description 1
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 1
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 1
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 1
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 1
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 1
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 1
- PXHCFKXNSBJSTQ-KKUMJFAQSA-N Lys-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)O PXHCFKXNSBJSTQ-KKUMJFAQSA-N 0.000 description 1
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 description 1
- ZAWOJFFMBANLGE-CIUDSAMLSA-N Lys-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N ZAWOJFFMBANLGE-CIUDSAMLSA-N 0.000 description 1
- SSYOBDBNBQBSQE-SRVKXCTJSA-N Lys-Cys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O SSYOBDBNBQBSQE-SRVKXCTJSA-N 0.000 description 1
- BYEBKXRNDLTGFW-CIUDSAMLSA-N Lys-Cys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O BYEBKXRNDLTGFW-CIUDSAMLSA-N 0.000 description 1
- HEWWNLVEWBJBKA-WDCWCFNPSA-N Lys-Gln-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN HEWWNLVEWBJBKA-WDCWCFNPSA-N 0.000 description 1
- IRRZDAIFYHNIIN-JYJNAYRXSA-N Lys-Gln-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IRRZDAIFYHNIIN-JYJNAYRXSA-N 0.000 description 1
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 1
- PAMDBWYMLWOELY-SDDRHHMPSA-N Lys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O PAMDBWYMLWOELY-SDDRHHMPSA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- KZJQUYFDSCFSCO-DLOVCJGASA-N Lys-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N KZJQUYFDSCFSCO-DLOVCJGASA-N 0.000 description 1
- KKFVKBWCXXLKIK-AVGNSLFASA-N Lys-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCCN)N KKFVKBWCXXLKIK-AVGNSLFASA-N 0.000 description 1
- SPCHLZUWJTYZFC-IHRRRGAJSA-N Lys-His-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O SPCHLZUWJTYZFC-IHRRRGAJSA-N 0.000 description 1
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 1
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- JQSIGLHQNSZZRL-KKUMJFAQSA-N Lys-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N JQSIGLHQNSZZRL-KKUMJFAQSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- GOVDTWNJCBRRBJ-DCAQKATOSA-N Lys-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N GOVDTWNJCBRRBJ-DCAQKATOSA-N 0.000 description 1
- GZGWILAQHOVXTD-DCAQKATOSA-N Lys-Met-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O GZGWILAQHOVXTD-DCAQKATOSA-N 0.000 description 1
- KFSALEZVQJYHCE-AVGNSLFASA-N Lys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N KFSALEZVQJYHCE-AVGNSLFASA-N 0.000 description 1
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 1
- TWPCWKVOZDUYAA-KKUMJFAQSA-N Lys-Phe-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O TWPCWKVOZDUYAA-KKUMJFAQSA-N 0.000 description 1
- SVSQSPICRKBMSZ-SRVKXCTJSA-N Lys-Pro-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O SVSQSPICRKBMSZ-SRVKXCTJSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 1
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 1
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 1
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 1
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 1
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 1
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 1
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 1
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 1
- YUTZYVTZDVZBJJ-IHPCNDPISA-N Lys-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 YUTZYVTZDVZBJJ-IHPCNDPISA-N 0.000 description 1
- GVKINWYYLOLEFQ-XIRDDKMYSA-N Lys-Trp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O GVKINWYYLOLEFQ-XIRDDKMYSA-N 0.000 description 1
- HONVOXINDBETTI-KKUMJFAQSA-N Lys-Tyr-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CS)C(O)=O)CC1=CC=C(O)C=C1 HONVOXINDBETTI-KKUMJFAQSA-N 0.000 description 1
- IMDJSVBFQKDDEQ-MGHWNKPDSA-N Lys-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N IMDJSVBFQKDDEQ-MGHWNKPDSA-N 0.000 description 1
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 1
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 1
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 1
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 1
- XBAJINCXDBTJRH-WDSOQIARSA-N Lys-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N XBAJINCXDBTJRH-WDSOQIARSA-N 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- 102100026553 Mannose-binding protein C Human genes 0.000 description 1
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 1
- DTICLBJHRYSJLH-GUBZILKMSA-N Met-Ala-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O DTICLBJHRYSJLH-GUBZILKMSA-N 0.000 description 1
- DLAFCQWUMFMZSN-GUBZILKMSA-N Met-Arg-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N DLAFCQWUMFMZSN-GUBZILKMSA-N 0.000 description 1
- DCHHUGLTVLJYKA-FXQIFTODSA-N Met-Asn-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DCHHUGLTVLJYKA-FXQIFTODSA-N 0.000 description 1
- NSGXXVIHCIAISP-CIUDSAMLSA-N Met-Asn-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O NSGXXVIHCIAISP-CIUDSAMLSA-N 0.000 description 1
- CAODKDAPYGUMLK-FXQIFTODSA-N Met-Asn-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CAODKDAPYGUMLK-FXQIFTODSA-N 0.000 description 1
- JQECLVNLAZGHRQ-CIUDSAMLSA-N Met-Asp-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O JQECLVNLAZGHRQ-CIUDSAMLSA-N 0.000 description 1
- GODBLDDYHFTUAH-CIUDSAMLSA-N Met-Asp-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O GODBLDDYHFTUAH-CIUDSAMLSA-N 0.000 description 1
- FJVJLMZUIGMFFU-BQBZGAKWSA-N Met-Asp-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FJVJLMZUIGMFFU-BQBZGAKWSA-N 0.000 description 1
- OSOLWRWQADPDIQ-DCAQKATOSA-N Met-Asp-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OSOLWRWQADPDIQ-DCAQKATOSA-N 0.000 description 1
- OFNCSQNBSWGGNV-DCAQKATOSA-N Met-Cys-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 OFNCSQNBSWGGNV-DCAQKATOSA-N 0.000 description 1
- OXHSZBRPUGNMKW-DCAQKATOSA-N Met-Gln-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OXHSZBRPUGNMKW-DCAQKATOSA-N 0.000 description 1
- CRGKLOXHKICQOL-GARJFASQSA-N Met-Gln-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N CRGKLOXHKICQOL-GARJFASQSA-N 0.000 description 1
- PHWSCIFNNLLUFJ-NHCYSSNCSA-N Met-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N PHWSCIFNNLLUFJ-NHCYSSNCSA-N 0.000 description 1
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 1
- DJDFBVNNDAUPRW-GUBZILKMSA-N Met-Glu-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O DJDFBVNNDAUPRW-GUBZILKMSA-N 0.000 description 1
- RVYDCISQIGHAFC-ZPFDUUQYSA-N Met-Ile-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O RVYDCISQIGHAFC-ZPFDUUQYSA-N 0.000 description 1
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 1
- KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 description 1
- DBXMFHGGHMXYHY-DCAQKATOSA-N Met-Leu-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O DBXMFHGGHMXYHY-DCAQKATOSA-N 0.000 description 1
- JCMMNFZUKMMECJ-DCAQKATOSA-N Met-Lys-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JCMMNFZUKMMECJ-DCAQKATOSA-N 0.000 description 1
- MSSJHBAKDDIRMJ-SRVKXCTJSA-N Met-Lys-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MSSJHBAKDDIRMJ-SRVKXCTJSA-N 0.000 description 1
- HOZNVKDCKZPRER-XUXIUFHCSA-N Met-Lys-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HOZNVKDCKZPRER-XUXIUFHCSA-N 0.000 description 1
- CNAGWYQWQDMUGC-IHRRRGAJSA-N Met-Phe-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CNAGWYQWQDMUGC-IHRRRGAJSA-N 0.000 description 1
- NHXXGBXJTLRGJI-GUBZILKMSA-N Met-Pro-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O NHXXGBXJTLRGJI-GUBZILKMSA-N 0.000 description 1
- BJPQKNHZHUCQNQ-SRVKXCTJSA-N Met-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCSC)N BJPQKNHZHUCQNQ-SRVKXCTJSA-N 0.000 description 1
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 1
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 1
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 1
- FIZZULTXMVEIAA-IHRRRGAJSA-N Met-Ser-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FIZZULTXMVEIAA-IHRRRGAJSA-N 0.000 description 1
- SPSSJSICDYYTQN-HJGDQZAQSA-N Met-Thr-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O SPSSJSICDYYTQN-HJGDQZAQSA-N 0.000 description 1
- YIGCDRZMZNDENK-UNQGMJICSA-N Met-Thr-Phe Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YIGCDRZMZNDENK-UNQGMJICSA-N 0.000 description 1
- ATBJCCFCJXCNGZ-UFYCRDLUSA-N Met-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 ATBJCCFCJXCNGZ-UFYCRDLUSA-N 0.000 description 1
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 206010061309 Neoplasm progression Diseases 0.000 description 1
- 206010030155 Oesophageal carcinoma Diseases 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 108700020796 Oncogene Proteins 0.000 description 1
- 102000043276 Oncogene Human genes 0.000 description 1
- 206010033128 Ovarian cancer Diseases 0.000 description 1
- 206010061535 Ovarian neoplasm Diseases 0.000 description 1
- 108091008606 PDGF receptors Proteins 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 206010061902 Pancreatic neoplasm Diseases 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 1
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 1
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 1
- NOFBJKKOPKJDCO-KKXDTOCCSA-N Phe-Ala-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NOFBJKKOPKJDCO-KKXDTOCCSA-N 0.000 description 1
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 1
- YQNBKXUTWBRQCS-BVSLBCMMSA-N Phe-Arg-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 YQNBKXUTWBRQCS-BVSLBCMMSA-N 0.000 description 1
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 1
- OXUMFAOVGFODPN-KKUMJFAQSA-N Phe-Asn-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OXUMFAOVGFODPN-KKUMJFAQSA-N 0.000 description 1
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 1
- JOXIIFVCSATTDH-IHPCNDPISA-N Phe-Asn-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JOXIIFVCSATTDH-IHPCNDPISA-N 0.000 description 1
- JIYJYFIXQTYDNF-YDHLFZDLSA-N Phe-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N JIYJYFIXQTYDNF-YDHLFZDLSA-N 0.000 description 1
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 1
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 1
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 1
- PDUVELWDJZOUEI-IHRRRGAJSA-N Phe-Cys-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PDUVELWDJZOUEI-IHRRRGAJSA-N 0.000 description 1
- FSPGBMWPNMRWDB-AVGNSLFASA-N Phe-Cys-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N FSPGBMWPNMRWDB-AVGNSLFASA-N 0.000 description 1
- SXJGROGVINAYSH-AVGNSLFASA-N Phe-Gln-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SXJGROGVINAYSH-AVGNSLFASA-N 0.000 description 1
- RJYBHZVWJPUSLB-QEWYBTABSA-N Phe-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N RJYBHZVWJPUSLB-QEWYBTABSA-N 0.000 description 1
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 1
- UEADQPLTYBWWTG-AVGNSLFASA-N Phe-Glu-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEADQPLTYBWWTG-AVGNSLFASA-N 0.000 description 1
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 1
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 1
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 1
- ZZVUXQCQPXSUFH-JBACZVJFSA-N Phe-Glu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 ZZVUXQCQPXSUFH-JBACZVJFSA-N 0.000 description 1
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 1
- JJHVFCUWLSKADD-ONGXEEELSA-N Phe-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O JJHVFCUWLSKADD-ONGXEEELSA-N 0.000 description 1
- RFEXGCASCQGGHZ-STQMWFEESA-N Phe-Gly-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O RFEXGCASCQGGHZ-STQMWFEESA-N 0.000 description 1
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 1
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 1
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 1
- SWCOXQLDICUYOL-ULQDDVLXSA-N Phe-His-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SWCOXQLDICUYOL-ULQDDVLXSA-N 0.000 description 1
- ISYSEOWLRQKQEQ-JYJNAYRXSA-N Phe-His-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISYSEOWLRQKQEQ-JYJNAYRXSA-N 0.000 description 1
- HTXVATDVCRFORF-MGHWNKPDSA-N Phe-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N HTXVATDVCRFORF-MGHWNKPDSA-N 0.000 description 1
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 1
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 1
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 1
- KNYPNEYICHHLQL-ACRUOGEOSA-N Phe-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 KNYPNEYICHHLQL-ACRUOGEOSA-N 0.000 description 1
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 1
- DMEYUTSDVRCWRS-ULQDDVLXSA-N Phe-Lys-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DMEYUTSDVRCWRS-ULQDDVLXSA-N 0.000 description 1
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 1
- FQUUYTNBMIBOHS-IHRRRGAJSA-N Phe-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FQUUYTNBMIBOHS-IHRRRGAJSA-N 0.000 description 1
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 1
- GZGPMBKUJDRICD-ULQDDVLXSA-N Phe-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O GZGPMBKUJDRICD-ULQDDVLXSA-N 0.000 description 1
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 1
- BSTPNLNKHKBONJ-HTUGSXCWSA-N Phe-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O BSTPNLNKHKBONJ-HTUGSXCWSA-N 0.000 description 1
- BTAIJUBAGLVFKQ-BVSLBCMMSA-N Phe-Trp-Val Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](C(C)C)C(O)=O)C1=CC=CC=C1 BTAIJUBAGLVFKQ-BVSLBCMMSA-N 0.000 description 1
- VFDRDMOMHBJGKD-UFYCRDLUSA-N Phe-Tyr-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N VFDRDMOMHBJGKD-UFYCRDLUSA-N 0.000 description 1
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 description 1
- AGTHXWTYCLLYMC-FHWLQOOXSA-N Phe-Tyr-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 AGTHXWTYCLLYMC-FHWLQOOXSA-N 0.000 description 1
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 1
- KIQUCMUULDXTAZ-HJOGWXRNSA-N Phe-Tyr-Tyr Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O KIQUCMUULDXTAZ-HJOGWXRNSA-N 0.000 description 1
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 1
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 1
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 1
- XBCOOBCTVMMQSC-BVSLBCMMSA-N Phe-Val-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 XBCOOBCTVMMQSC-BVSLBCMMSA-N 0.000 description 1
- 102100038332 Phosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit alpha isoform Human genes 0.000 description 1
- 108010001441 Phosphopeptides Proteins 0.000 description 1
- OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 description 1
- 102000011653 Platelet-Derived Growth Factor Receptors Human genes 0.000 description 1
- 239000004793 Polystyrene Substances 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 1
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 1
- FCCBQBZXIAZNIG-LSJOCFKGSA-N Pro-Ala-His Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O FCCBQBZXIAZNIG-LSJOCFKGSA-N 0.000 description 1
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 1
- DRVIASBABBMZTF-GUBZILKMSA-N Pro-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1 DRVIASBABBMZTF-GUBZILKMSA-N 0.000 description 1
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 1
- OYEUSRAZOGIDBY-JYJNAYRXSA-N Pro-Arg-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OYEUSRAZOGIDBY-JYJNAYRXSA-N 0.000 description 1
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 1
- MTHRMUXESFIAMS-DCAQKATOSA-N Pro-Asn-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O MTHRMUXESFIAMS-DCAQKATOSA-N 0.000 description 1
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- GDXZRWYXJSGWIV-GMOBBJLQSA-N Pro-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 GDXZRWYXJSGWIV-GMOBBJLQSA-N 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 1
- DIZLUAZLNDFDPR-CIUDSAMLSA-N Pro-Cys-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 DIZLUAZLNDFDPR-CIUDSAMLSA-N 0.000 description 1
- NOXSEHJOXCWRHK-DCAQKATOSA-N Pro-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 NOXSEHJOXCWRHK-DCAQKATOSA-N 0.000 description 1
- HQVPQXMCQKXARZ-FXQIFTODSA-N Pro-Cys-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O HQVPQXMCQKXARZ-FXQIFTODSA-N 0.000 description 1
- UPJGUQPLYWTISV-GUBZILKMSA-N Pro-Gln-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UPJGUQPLYWTISV-GUBZILKMSA-N 0.000 description 1
- LQZZPNDMYNZPFT-KKUMJFAQSA-N Pro-Gln-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LQZZPNDMYNZPFT-KKUMJFAQSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 1
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 1
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- PEYNRYREGPAOAK-LSJOCFKGSA-N Pro-His-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 PEYNRYREGPAOAK-LSJOCFKGSA-N 0.000 description 1
- BFXZQMWKTYWGCF-PYJNHQTQSA-N Pro-His-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BFXZQMWKTYWGCF-PYJNHQTQSA-N 0.000 description 1
- YTWNSIDWAFSEEI-RWMBFGLXSA-N Pro-His-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N3CCC[C@@H]3C(=O)O YTWNSIDWAFSEEI-RWMBFGLXSA-N 0.000 description 1
- BODDREDDDRZUCF-QTKMDUPCSA-N Pro-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2)O BODDREDDDRZUCF-QTKMDUPCSA-N 0.000 description 1
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 1
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 1
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 1
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- FYPGHGXAOZTOBO-IHRRRGAJSA-N Pro-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FYPGHGXAOZTOBO-IHRRRGAJSA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 1
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 1
- YAZNFQUKPUASKB-DCAQKATOSA-N Pro-Lys-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O YAZNFQUKPUASKB-DCAQKATOSA-N 0.000 description 1
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 1
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 description 1
- DSGSTPRKNYHGCL-JYJNAYRXSA-N Pro-Phe-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DSGSTPRKNYHGCL-JYJNAYRXSA-N 0.000 description 1
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 1
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 1
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 1
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 1
- RNEFESSBTOQSAC-DCAQKATOSA-N Pro-Ser-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O RNEFESSBTOQSAC-DCAQKATOSA-N 0.000 description 1
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 1
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 1
- FZXSYIPVAFVYBH-KKUMJFAQSA-N Pro-Tyr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O FZXSYIPVAFVYBH-KKUMJFAQSA-N 0.000 description 1
- BXHRXLMCYSZSIY-STECZYCISA-N Pro-Tyr-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O BXHRXLMCYSZSIY-STECZYCISA-N 0.000 description 1
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 1
- IALSFJSONJZBKB-HRCADAONSA-N Pro-Tyr-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N3CCC[C@@H]3C(=O)O IALSFJSONJZBKB-HRCADAONSA-N 0.000 description 1
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 1
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 1
- 206010060862 Prostate cancer Diseases 0.000 description 1
- 208000000236 Prostatic Neoplasms Diseases 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 238000011530 RNeasy Mini Kit Methods 0.000 description 1
- 239000012980 RPMI-1640 medium Substances 0.000 description 1
- 206010038389 Renal cancer Diseases 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 1
- QWZIOCFPXMAXET-CIUDSAMLSA-N Ser-Arg-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QWZIOCFPXMAXET-CIUDSAMLSA-N 0.000 description 1
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 1
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 1
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 1
- VBKBDLMWICBSCY-IMJSIDKUSA-N Ser-Asp Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O VBKBDLMWICBSCY-IMJSIDKUSA-N 0.000 description 1
- VAIZFHMTBFYJIA-ACZMJKKPSA-N Ser-Asp-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O VAIZFHMTBFYJIA-ACZMJKKPSA-N 0.000 description 1
- DBIDZNUXSLXVRG-FXQIFTODSA-N Ser-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N DBIDZNUXSLXVRG-FXQIFTODSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 1
- RNFKSBPHLTZHLU-WHFBIAKZSA-N Ser-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)O RNFKSBPHLTZHLU-WHFBIAKZSA-N 0.000 description 1
- INCNPLPRPOYTJI-JBDRJPRFSA-N Ser-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N INCNPLPRPOYTJI-JBDRJPRFSA-N 0.000 description 1
- MPPHJZYXDVDGOF-BWBBJGPYSA-N Ser-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CO MPPHJZYXDVDGOF-BWBBJGPYSA-N 0.000 description 1
- SWIQQMYVHIXPEK-FXQIFTODSA-N Ser-Cys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O SWIQQMYVHIXPEK-FXQIFTODSA-N 0.000 description 1
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 1
- MAWSJXHRLWVJEZ-ACZMJKKPSA-N Ser-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N MAWSJXHRLWVJEZ-ACZMJKKPSA-N 0.000 description 1
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- VMVNCJDKFOQOHM-GUBZILKMSA-N Ser-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N VMVNCJDKFOQOHM-GUBZILKMSA-N 0.000 description 1
- GWMXFEMMBHOKDX-AVGNSLFASA-N Ser-Gln-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GWMXFEMMBHOKDX-AVGNSLFASA-N 0.000 description 1
- LAFKUZYWNCHOHT-WHFBIAKZSA-N Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O LAFKUZYWNCHOHT-WHFBIAKZSA-N 0.000 description 1
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 1
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 1
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- ZFVFHHZBCVNLGD-GUBZILKMSA-N Ser-His-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFVFHHZBCVNLGD-GUBZILKMSA-N 0.000 description 1
- UGHCUDLCCVVIJR-VGDYDELISA-N Ser-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N UGHCUDLCCVVIJR-VGDYDELISA-N 0.000 description 1
- CLKKNZQUQMZDGD-SRVKXCTJSA-N Ser-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CN=CN1 CLKKNZQUQMZDGD-SRVKXCTJSA-N 0.000 description 1
- MLSQXWSRHURDMF-GARJFASQSA-N Ser-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N)C(=O)O MLSQXWSRHURDMF-GARJFASQSA-N 0.000 description 1
- JEHPKECJCALLRW-CUJWVEQBSA-N Ser-His-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEHPKECJCALLRW-CUJWVEQBSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 1
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 1
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 1
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 1
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 1
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 1
- BYCVMHKULKRVPV-GUBZILKMSA-N Ser-Lys-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYCVMHKULKRVPV-GUBZILKMSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 1
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 1
- WGDYNRCOQRERLZ-KKUMJFAQSA-N Ser-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N WGDYNRCOQRERLZ-KKUMJFAQSA-N 0.000 description 1
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 1
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 1
- XVWDJUROVRQKAE-KKUMJFAQSA-N Ser-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=CC=C1 XVWDJUROVRQKAE-KKUMJFAQSA-N 0.000 description 1
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 1
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 1
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 1
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 1
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 1
- BVLGVLWFIZFEAH-BPUTZDHNSA-N Ser-Pro-Trp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O BVLGVLWFIZFEAH-BPUTZDHNSA-N 0.000 description 1
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 1
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- SOACHCFYJMCMHC-BWBBJGPYSA-N Ser-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)O SOACHCFYJMCMHC-BWBBJGPYSA-N 0.000 description 1
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 1
- NERYDXBVARJIQS-JYBASQMISA-N Ser-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N)O NERYDXBVARJIQS-JYBASQMISA-N 0.000 description 1
- VEVYMLNYMULSMS-AVGNSLFASA-N Ser-Tyr-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEVYMLNYMULSMS-AVGNSLFASA-N 0.000 description 1
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 1
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 1
- OSFZCEQJLWCIBG-BZSNNMDCSA-N Ser-Tyr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OSFZCEQJLWCIBG-BZSNNMDCSA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 1
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 description 1
- 102100027103 Serine/threonine-protein kinase B-raf Human genes 0.000 description 1
- 208000005718 Stomach Neoplasms Diseases 0.000 description 1
- 108010006785 Taq Polymerase Proteins 0.000 description 1
- 102000001639 Thioredoxin Reductase 1 Human genes 0.000 description 1
- 108010093836 Thioredoxin Reductase 1 Proteins 0.000 description 1
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 1
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 1
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 1
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 1
- JVTHIXKSVYEWNI-JRQIVUDYSA-N Thr-Asn-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JVTHIXKSVYEWNI-JRQIVUDYSA-N 0.000 description 1
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 1
- DCCGCVLVVSAJFK-NUMRIWBASA-N Thr-Asp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O DCCGCVLVVSAJFK-NUMRIWBASA-N 0.000 description 1
- NOWXWJLVGTVJKM-PBCZWWQYSA-N Thr-Asp-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O NOWXWJLVGTVJKM-PBCZWWQYSA-N 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- LYGKYFKSZTUXGZ-ZDLURKLDSA-N Thr-Cys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)NCC(O)=O LYGKYFKSZTUXGZ-ZDLURKLDSA-N 0.000 description 1
- ASJDFGOPDCVXTG-KATARQTJSA-N Thr-Cys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ASJDFGOPDCVXTG-KATARQTJSA-N 0.000 description 1
- UZJDBCHMIQXLOQ-HEIBUPTGSA-N Thr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O UZJDBCHMIQXLOQ-HEIBUPTGSA-N 0.000 description 1
- RJBFAHKSFNNHAI-XKBZYTNZSA-N Thr-Gln-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O RJBFAHKSFNNHAI-XKBZYTNZSA-N 0.000 description 1
- ZQUKYJOKQBRBCS-GLLZPBPUSA-N Thr-Gln-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O ZQUKYJOKQBRBCS-GLLZPBPUSA-N 0.000 description 1
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 1
- GARULAKWZGFIKC-RWRJDSDZSA-N Thr-Gln-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GARULAKWZGFIKC-RWRJDSDZSA-N 0.000 description 1
- DIPIPFHFLPTCLK-LOKLDPHHSA-N Thr-Gln-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O DIPIPFHFLPTCLK-LOKLDPHHSA-N 0.000 description 1
- DKDHTRVDOUZZTP-IFFSRLJSSA-N Thr-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DKDHTRVDOUZZTP-IFFSRLJSSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 1
- VOHWDZNIESHTFW-XKBZYTNZSA-N Thr-Glu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O VOHWDZNIESHTFW-XKBZYTNZSA-N 0.000 description 1
- WDFPMSHYMRBLKM-NKIYYHGXSA-N Thr-Glu-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O WDFPMSHYMRBLKM-NKIYYHGXSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- OQCXTUQTKQFDCX-HTUGSXCWSA-N Thr-Glu-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O OQCXTUQTKQFDCX-HTUGSXCWSA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 1
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 1
- KRGDDWVBBDLPSJ-CUJWVEQBSA-N Thr-His-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O KRGDDWVBBDLPSJ-CUJWVEQBSA-N 0.000 description 1
- YUPVPKZBKCLFLT-QTKMDUPCSA-N Thr-His-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N)O YUPVPKZBKCLFLT-QTKMDUPCSA-N 0.000 description 1
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 description 1
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 1
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 1
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 1
- MCDVZTRGHNXTGK-HJGDQZAQSA-N Thr-Met-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O MCDVZTRGHNXTGK-HJGDQZAQSA-N 0.000 description 1
- XNTVWRJTUIOGQO-RHYQMDGZSA-N Thr-Met-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNTVWRJTUIOGQO-RHYQMDGZSA-N 0.000 description 1
- KZURUCDWKDEAFZ-XVSYOHENSA-N Thr-Phe-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O KZURUCDWKDEAFZ-XVSYOHENSA-N 0.000 description 1
- GYUUYCIXELGTJS-MEYUZBJRSA-N Thr-Phe-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O GYUUYCIXELGTJS-MEYUZBJRSA-N 0.000 description 1
- NWECYMJLJGCBOD-UNQGMJICSA-N Thr-Phe-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O NWECYMJLJGCBOD-UNQGMJICSA-N 0.000 description 1
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 1
- JAJOFWABAUKAEJ-QTKMDUPCSA-N Thr-Pro-His Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O JAJOFWABAUKAEJ-QTKMDUPCSA-N 0.000 description 1
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 1
- YGZWVPBHYABGLT-KJEVXHAQSA-N Thr-Pro-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YGZWVPBHYABGLT-KJEVXHAQSA-N 0.000 description 1
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 1
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 1
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 1
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 1
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- NDLHSJWPCXKOGG-VLCNGCBASA-N Thr-Trp-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N)O NDLHSJWPCXKOGG-VLCNGCBASA-N 0.000 description 1
- NJGMALCNYAMYCB-JRQIVUDYSA-N Thr-Tyr-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJGMALCNYAMYCB-JRQIVUDYSA-N 0.000 description 1
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 1
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 1
- RPECVQBNONKZAT-WZLNRYEVSA-N Thr-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H]([C@@H](C)O)N RPECVQBNONKZAT-WZLNRYEVSA-N 0.000 description 1
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 1
- SJPDTIQHLBQPFO-VLCNGCBASA-N Thr-Tyr-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SJPDTIQHLBQPFO-VLCNGCBASA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 1
- CURFABYITJVKEW-QTKMDUPCSA-N Thr-Val-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O CURFABYITJVKEW-QTKMDUPCSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- 208000024770 Thyroid neoplasm Diseases 0.000 description 1
- WFZYXGSAPWKTHR-XEGUGMAKSA-N Trp-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WFZYXGSAPWKTHR-XEGUGMAKSA-N 0.000 description 1
- CXUFDWZBHKUGKK-CABZTGNLSA-N Trp-Ala-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O)=CNC2=C1 CXUFDWZBHKUGKK-CABZTGNLSA-N 0.000 description 1
- HYVLNORXQGKONN-NUTKFTJISA-N Trp-Ala-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 HYVLNORXQGKONN-NUTKFTJISA-N 0.000 description 1
- FOAJSVIXYCLTSC-PJODQICGSA-N Trp-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N FOAJSVIXYCLTSC-PJODQICGSA-N 0.000 description 1
- HYNAKPYFEYJMAS-XIRDDKMYSA-N Trp-Arg-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HYNAKPYFEYJMAS-XIRDDKMYSA-N 0.000 description 1
- TWJDQTTXXZDJKV-BPUTZDHNSA-N Trp-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O TWJDQTTXXZDJKV-BPUTZDHNSA-N 0.000 description 1
- YEGMNOHLZNGOCG-UBHSHLNASA-N Trp-Asn-Asn Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YEGMNOHLZNGOCG-UBHSHLNASA-N 0.000 description 1
- ADBFWLXCCKIXBQ-XIRDDKMYSA-N Trp-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ADBFWLXCCKIXBQ-XIRDDKMYSA-N 0.000 description 1
- GKUROEIXVURAAO-BPUTZDHNSA-N Trp-Asp-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GKUROEIXVURAAO-BPUTZDHNSA-N 0.000 description 1
- WACMTVIJWRNVSO-CWRNSKLLSA-N Trp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O WACMTVIJWRNVSO-CWRNSKLLSA-N 0.000 description 1
- SSNGFWKILJLTQM-QEJZJMRPSA-N Trp-Gln-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SSNGFWKILJLTQM-QEJZJMRPSA-N 0.000 description 1
- NXJZCPKZIKTYLX-XEGUGMAKSA-N Trp-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NXJZCPKZIKTYLX-XEGUGMAKSA-N 0.000 description 1
- WCTYCXZYBNKEIV-SXNHZJKMSA-N Trp-Glu-Ile Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)=CNC2=C1 WCTYCXZYBNKEIV-SXNHZJKMSA-N 0.000 description 1
- WSGPBCAGEGHKQJ-BBRMVZONSA-N Trp-Gly-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WSGPBCAGEGHKQJ-BBRMVZONSA-N 0.000 description 1
- YYXIWHBHTARPOG-HJXMPXNTSA-N Trp-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N YYXIWHBHTARPOG-HJXMPXNTSA-N 0.000 description 1
- AZBIIKDSDLVJAK-VHWLVUOQSA-N Trp-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N AZBIIKDSDLVJAK-VHWLVUOQSA-N 0.000 description 1
- SAKLWFSRZTZQAJ-GQGQLFGLSA-N Trp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N SAKLWFSRZTZQAJ-GQGQLFGLSA-N 0.000 description 1
- VDUJEEQMRQCLHB-YTQUADARSA-N Trp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O VDUJEEQMRQCLHB-YTQUADARSA-N 0.000 description 1
- ULHASJWZGUEUNN-XIRDDKMYSA-N Trp-Lys-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O ULHASJWZGUEUNN-XIRDDKMYSA-N 0.000 description 1
- FBGDDUKYOBNZJL-WDSOQIARSA-N Trp-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N FBGDDUKYOBNZJL-WDSOQIARSA-N 0.000 description 1
- GIAMKIPJSRZVJB-IHPCNDPISA-N Trp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GIAMKIPJSRZVJB-IHPCNDPISA-N 0.000 description 1
- WMIUTJPFHMMUGY-ZFWWWQNUSA-N Trp-Pro-Gly Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)NCC(=O)O WMIUTJPFHMMUGY-ZFWWWQNUSA-N 0.000 description 1
- ADMHZNPMMVKGJW-BPUTZDHNSA-N Trp-Ser-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N ADMHZNPMMVKGJW-BPUTZDHNSA-N 0.000 description 1
- UIRPULWLRODAEQ-QEJZJMRPSA-N Trp-Ser-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 UIRPULWLRODAEQ-QEJZJMRPSA-N 0.000 description 1
- GBEAUNVBIMLWIB-IHPCNDPISA-N Trp-Ser-Phe Chemical compound C([C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 GBEAUNVBIMLWIB-IHPCNDPISA-N 0.000 description 1
- RWTFCAMQLFNPTK-UMPQAUOISA-N Trp-Val-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)=CNC2=C1 RWTFCAMQLFNPTK-UMPQAUOISA-N 0.000 description 1
- QJBWZNTWJSZUOY-UWJYBYFXSA-N Tyr-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QJBWZNTWJSZUOY-UWJYBYFXSA-N 0.000 description 1
- XLMDWQNAOKLKCP-XDTLVQLUSA-N Tyr-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XLMDWQNAOKLKCP-XDTLVQLUSA-N 0.000 description 1
- IELISNUVHBKYBX-XDTLVQLUSA-N Tyr-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IELISNUVHBKYBX-XDTLVQLUSA-N 0.000 description 1
- NSOMQRHZMJMZIE-GVARAGBVSA-N Tyr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NSOMQRHZMJMZIE-GVARAGBVSA-N 0.000 description 1
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 1
- NOXKHHXSHQFSGJ-FQPOAREZSA-N Tyr-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NOXKHHXSHQFSGJ-FQPOAREZSA-N 0.000 description 1
- KDGFPPHLXCEQRN-STECZYCISA-N Tyr-Arg-Ile Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDGFPPHLXCEQRN-STECZYCISA-N 0.000 description 1
- GFZQWWDXJVGEMW-ULQDDVLXSA-N Tyr-Arg-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GFZQWWDXJVGEMW-ULQDDVLXSA-N 0.000 description 1
- SGFIXFAHVWJKTD-KJEVXHAQSA-N Tyr-Arg-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SGFIXFAHVWJKTD-KJEVXHAQSA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- JRXKIVGWMMIIOF-YDHLFZDLSA-N Tyr-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JRXKIVGWMMIIOF-YDHLFZDLSA-N 0.000 description 1
- DANHCMVVXDXOHN-SRVKXCTJSA-N Tyr-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DANHCMVVXDXOHN-SRVKXCTJSA-N 0.000 description 1
- GHUNBABNQPIETG-MELADBBJSA-N Tyr-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O GHUNBABNQPIETG-MELADBBJSA-N 0.000 description 1
- QHEGAOPHISYNDF-XDTLVQLUSA-N Tyr-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHEGAOPHISYNDF-XDTLVQLUSA-N 0.000 description 1
- ARPONUQDNWLXOZ-KKUMJFAQSA-N Tyr-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ARPONUQDNWLXOZ-KKUMJFAQSA-N 0.000 description 1
- NGALWFGCOMHUSN-AVGNSLFASA-N Tyr-Gln-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NGALWFGCOMHUSN-AVGNSLFASA-N 0.000 description 1
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 1
- DXUVJJRTVACXSO-KKUMJFAQSA-N Tyr-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DXUVJJRTVACXSO-KKUMJFAQSA-N 0.000 description 1
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 1
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 description 1
- HVHJYXDXRIWELT-RYUDHWBXSA-N Tyr-Glu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O HVHJYXDXRIWELT-RYUDHWBXSA-N 0.000 description 1
- IMXAAEFAIBRCQF-SIUGBPQLSA-N Tyr-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N IMXAAEFAIBRCQF-SIUGBPQLSA-N 0.000 description 1
- RIFVTNDKUMSSMN-ULQDDVLXSA-N Tyr-His-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(O)=O RIFVTNDKUMSSMN-ULQDDVLXSA-N 0.000 description 1
- HVPPEXXUDXAPOM-MGHWNKPDSA-N Tyr-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HVPPEXXUDXAPOM-MGHWNKPDSA-N 0.000 description 1
- AZZLDIDWPZLCCW-ZEWNOJEFSA-N Tyr-Ile-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O AZZLDIDWPZLCCW-ZEWNOJEFSA-N 0.000 description 1
- FJBCEFPCVPHPPM-STECZYCISA-N Tyr-Ile-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O FJBCEFPCVPHPPM-STECZYCISA-N 0.000 description 1
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 1
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 1
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- PGEFRHBWGOJPJT-KKUMJFAQSA-N Tyr-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O PGEFRHBWGOJPJT-KKUMJFAQSA-N 0.000 description 1
- SBLZVFCEOCWRLS-BPNCWPANSA-N Tyr-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=C(C=C1)O)N SBLZVFCEOCWRLS-BPNCWPANSA-N 0.000 description 1
- KHUVIWRRFMPVHD-JYJNAYRXSA-N Tyr-Met-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O KHUVIWRRFMPVHD-JYJNAYRXSA-N 0.000 description 1
- WTTRJMAZPDHPGS-KKXDTOCCSA-N Tyr-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(O)=O WTTRJMAZPDHPGS-KKXDTOCCSA-N 0.000 description 1
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 1
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 1
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 1
- BCOBSVIZMQXKFY-KKUMJFAQSA-N Tyr-Ser-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O BCOBSVIZMQXKFY-KKUMJFAQSA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 1
- AOIZTZRWMSPPAY-KAOXEZKKSA-N Tyr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O AOIZTZRWMSPPAY-KAOXEZKKSA-N 0.000 description 1
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 1
- KUXCBJFJURINGF-PXDAIIFMSA-N Tyr-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CC=C(C=C3)O)N KUXCBJFJURINGF-PXDAIIFMSA-N 0.000 description 1
- GPLTZEMVOCZVAV-UFYCRDLUSA-N Tyr-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 GPLTZEMVOCZVAV-UFYCRDLUSA-N 0.000 description 1
- KRXFXDCNKLANCP-CXTHYWKRSA-N Tyr-Tyr-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 KRXFXDCNKLANCP-CXTHYWKRSA-N 0.000 description 1
- AGDDLOQMXUQPDY-BZSNNMDCSA-N Tyr-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O AGDDLOQMXUQPDY-BZSNNMDCSA-N 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 1
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 1
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 1
- ZQGPWORGSNRQLN-NHCYSSNCSA-N Val-Asp-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZQGPWORGSNRQLN-NHCYSSNCSA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 1
- PFMAFMPJJSHNDW-ZKWXMUAHSA-N Val-Cys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N PFMAFMPJJSHNDW-ZKWXMUAHSA-N 0.000 description 1
- BWVHQINTNLVWGZ-ZKWXMUAHSA-N Val-Cys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BWVHQINTNLVWGZ-ZKWXMUAHSA-N 0.000 description 1
- YCMXFKWYJFZFKS-LAEOZQHASA-N Val-Gln-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCMXFKWYJFZFKS-LAEOZQHASA-N 0.000 description 1
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 1
- OXGVAUFVTOPFFA-XPUUQOCRSA-N Val-Gly-Cys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OXGVAUFVTOPFFA-XPUUQOCRSA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- OPGWZDIYEYJVRX-AVGNSLFASA-N Val-His-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OPGWZDIYEYJVRX-AVGNSLFASA-N 0.000 description 1
- KVRLNEILGGVBJX-IHRRRGAJSA-N Val-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CN=CN1 KVRLNEILGGVBJX-IHRRRGAJSA-N 0.000 description 1
- ZTKGDWOUYRRAOQ-ULQDDVLXSA-N Val-His-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N ZTKGDWOUYRRAOQ-ULQDDVLXSA-N 0.000 description 1
- HQYVQDRYODWONX-DCAQKATOSA-N Val-His-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N HQYVQDRYODWONX-DCAQKATOSA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 1
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- DAVNYIUELQBTAP-XUXIUFHCSA-N Val-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N DAVNYIUELQBTAP-XUXIUFHCSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 1
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 1
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 1
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 1
- OJOMXGVLFKYDKP-QXEWZRGKSA-N Val-Met-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OJOMXGVLFKYDKP-QXEWZRGKSA-N 0.000 description 1
- IOETTZIEIBVWBZ-GUBZILKMSA-N Val-Met-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N IOETTZIEIBVWBZ-GUBZILKMSA-N 0.000 description 1
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 1
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 1
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 1
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 1
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 1
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- QWCZXKIFPWPQHR-JYJNAYRXSA-N Val-Pro-Tyr Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QWCZXKIFPWPQHR-JYJNAYRXSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- TVGWMCTYUFBXAP-QTKMDUPCSA-N Val-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N)O TVGWMCTYUFBXAP-QTKMDUPCSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- SUGRIIAOLCDLBD-ZOBUZTSGSA-N Val-Trp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SUGRIIAOLCDLBD-ZOBUZTSGSA-N 0.000 description 1
- SVLAAUGFIHSJPK-JYJNAYRXSA-N Val-Trp-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CO)C(=O)O)N SVLAAUGFIHSJPK-JYJNAYRXSA-N 0.000 description 1
- OEVFFOBAXHBXKM-HSHDSVGOSA-N Val-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](C(C)C)N)O OEVFFOBAXHBXKM-HSHDSVGOSA-N 0.000 description 1
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 1
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 1
- PGBMPFKFKXYROZ-UFYCRDLUSA-N Val-Tyr-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N PGBMPFKFKXYROZ-UFYCRDLUSA-N 0.000 description 1
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 description 1
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- WHNSHJJNWNSTSU-BZSNNMDCSA-N Val-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 WHNSHJJNWNSTSU-BZSNNMDCSA-N 0.000 description 1
- 230000001594 aberrant effect Effects 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 239000012491 analyte Substances 0.000 description 1
- 230000003042 antagnostic effect Effects 0.000 description 1
- 229940041181 antineoplastic drug Drugs 0.000 description 1
- 230000006907 apoptotic process Effects 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 238000002820 assay format Methods 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 239000000090 biomarker Substances 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 239000010839 body fluid Substances 0.000 description 1
- 210000001185 bone marrow Anatomy 0.000 description 1
- 239000012830 cancer therapeutic Substances 0.000 description 1
- 230000005773 cancer-related death Effects 0.000 description 1
- 231100000504 carcinogenesis Toxicity 0.000 description 1
- 238000000006 cell growth inhibition assay Methods 0.000 description 1
- 238000001516 cell proliferation assay Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000002512 chemotherapy Methods 0.000 description 1
- 239000005515 coenzyme Substances 0.000 description 1
- 208000029742 colonic neoplasm Diseases 0.000 description 1
- 238000002591 computed tomography Methods 0.000 description 1
- 239000013068 control sample Substances 0.000 description 1
- 229960005061 crizotinib Drugs 0.000 description 1
- KTEIFNKAUNYNJU-GFCCVEGCSA-N crizotinib Chemical compound O([C@H](C)C=1C(=C(F)C=CC=1Cl)Cl)C(C(=NC=1)N)=CC=1C(=C1)C=NN1C1CCNCC1 KTEIFNKAUNYNJU-GFCCVEGCSA-N 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- 108010069495 cysteinyltyrosine Proteins 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 238000007877 drug screening Methods 0.000 description 1
- 229940125436 dual inhibitor Drugs 0.000 description 1
- 229940116977 epidermal growth factor Drugs 0.000 description 1
- 201000004101 esophageal cancer Diseases 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 230000037433 frameshift Effects 0.000 description 1
- 206010017758 gastric cancer Diseases 0.000 description 1
- 229960002584 gefitinib Drugs 0.000 description 1
- XGALLCVXEZPNRQ-UHFFFAOYSA-N gefitinib Chemical compound C=12C=C(OCCCN3CCOCC3)C(OC)=CC2=NC=NC=1NC1=CC=C(F)C(Cl)=C1 XGALLCVXEZPNRQ-UHFFFAOYSA-N 0.000 description 1
- 238000012239 gene modification Methods 0.000 description 1
- 230000004077 genetic alteration Effects 0.000 description 1
- 231100000118 genetic alteration Toxicity 0.000 description 1
- 102000054766 genetic haplotypes Human genes 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 230000007614 genetic variation Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 238000011331 genomic analysis Methods 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010081985 glycyl-cystinyl-aspartic acid Proteins 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 239000012456 homogeneous solution Substances 0.000 description 1
- BHEPBYXIRTUNPN-UHFFFAOYSA-N hydridophosphorus(.) (triplet) Chemical compound [PH] BHEPBYXIRTUNPN-UHFFFAOYSA-N 0.000 description 1
- 230000000984 immunochemical effect Effects 0.000 description 1
- 239000003547 immunosorbent Substances 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 201000010982 kidney cancer Diseases 0.000 description 1
- 229920000126 latex Polymers 0.000 description 1
- 239000004816 latex Substances 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 201000007270 liver cancer Diseases 0.000 description 1
- 208000014018 liver neoplasm Diseases 0.000 description 1
- 210000005265 lung cell Anatomy 0.000 description 1
- 201000009546 lung large cell carcinoma Diseases 0.000 description 1
- 201000005243 lung squamous cell carcinoma Diseases 0.000 description 1
- 108010043322 lysyl-tryptophyl-alpha-lysine Proteins 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010010679 lysyl-valyl-leucyl-aspartic acid Proteins 0.000 description 1
- 208000015486 malignant pancreatic neoplasm Diseases 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 239000002547 new drug Substances 0.000 description 1
- 238000007481 next generation sequencing Methods 0.000 description 1
- 239000002751 oligonucleotide probe Substances 0.000 description 1
- 201000002528 pancreatic cancer Diseases 0.000 description 1
- 208000008443 pancreatic carcinoma Diseases 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 229910052698 phosphorus Inorganic materials 0.000 description 1
- 239000011574 phosphorus Substances 0.000 description 1
- 229910052697 platinum Inorganic materials 0.000 description 1
- 229920002223 polystyrene Polymers 0.000 description 1
- 238000010837 poor prognosis Methods 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 108060006633 protein kinase Proteins 0.000 description 1
- 229950010131 puromycin Drugs 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 102000027426 receptor tyrosine kinases Human genes 0.000 description 1
- 108091008598 receptor tyrosine kinases Proteins 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000003757 reverse transcription PCR Methods 0.000 description 1
- 239000000523 sample Substances 0.000 description 1
- 108091006024 signal transducing proteins Proteins 0.000 description 1
- 102000034285 signal transducing proteins Human genes 0.000 description 1
- 239000002924 silencing RNA Substances 0.000 description 1
- 230000000391 smoking effect Effects 0.000 description 1
- 230000000392 somatic effect Effects 0.000 description 1
- 230000037439 somatic mutation Effects 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 201000011549 stomach cancer Diseases 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 229940124597 therapeutic agent Drugs 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 201000002510 thyroid cancer Diseases 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000005945 translocation Effects 0.000 description 1
- 238000011269 treatment regimen Methods 0.000 description 1
- 230000005748 tumor development Effects 0.000 description 1
- 230000005751 tumor progression Effects 0.000 description 1
- 229940121358 tyrosine kinase inhibitor Drugs 0.000 description 1
- 239000005483 tyrosine kinase inhibitor Substances 0.000 description 1
- 150000004917 tyrosine kinase inhibitor derivatives Chemical class 0.000 description 1
- 108010071635 tyrosyl-prolyl-arginine Proteins 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- VBEQCZHXXJYVRD-GACYYNSASA-N uroanthelone Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C(C)C)[C@@H](C)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)NC(=O)[C@@H](NC(=O)CNC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CS)NC(=O)CNC(=O)[C@H]1N(CCC1)C(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O)C(C)C)[C@@H](C)CC)C1=CC=C(O)C=C1 VBEQCZHXXJYVRD-GACYYNSASA-N 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 230000035899 viability Effects 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K19/00—Hybrid peptides, i.e. peptides covalently bound to nucleic acids, or non-covalently bound protein-protein complexes
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
- A61K38/16—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- A61K38/17—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
- C12Q1/6886—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/15—Medicinal preparations ; Physical properties thereof, e.g. dissolubility
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/574—Immunoassay; Biospecific binding assay; Materials therefor for cancer
- G01N33/57407—Specifically defined cancers
- G01N33/57423—Specifically defined cancers of lung
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/178—Oligonucleotides characterized by their use miRNA, siRNA or ncRNA
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Immunology (AREA)
- Molecular Biology (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Biotechnology (AREA)
- Medicinal Chemistry (AREA)
- Pathology (AREA)
- Analytical Chemistry (AREA)
- Microbiology (AREA)
- General Engineering & Computer Science (AREA)
- Pharmacology & Pharmacy (AREA)
- Urology & Nephrology (AREA)
- General Physics & Mathematics (AREA)
- Hematology (AREA)
- Food Science & Technology (AREA)
- Oncology (AREA)
- Hospice & Palliative Care (AREA)
- Animal Behavior & Ethology (AREA)
- Gastroenterology & Hepatology (AREA)
- Plant Pathology (AREA)
- Epidemiology (AREA)
- Cell Biology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Peptides Or Proteins (AREA)
Abstract
ZFYVE9 (zinc finger, FYVE domain containing 9)를 포함하는 융합 단백질의 암 진단 마커 및/또는 치료 타겟으로서의 용도와 관련된 것으로, ZFYVE9를 포함하는 융합 단백질, 상기 융합 단백질과 특이적으로 결합하는 분자를 포함하는 암 진단용 조성물, 이를 이용하여 암 진단에 정보를 제공하는 방법, 상기 융합 단백질의 억제제를 포함하는 암 예방 및/또는 치료용 조성물, 및 융합 단백질을 이용한 암 치료제 스크리닝 방법이 제공된다.
Description
본 발명은 ZFYVE9 단백질 또는 그의 단편을 포함하는 융합 단백질의 암 진단 마커 및/또는 치료 타겟으로서의 용도와 관련된 것으로, ZFYVE9 (zinc finger, FYVE domain containing 9)와 CGA (glycoprotein hormones, alpha polypeptide)의 융합 단백질, 상기 융합 단백질과 특이적으로 결합하는 분자를 포함하는 암 진단용 조성물, 이를 이용하여 암 진단에 정보를 제공하는 방법, 상기 융합 단백질의 억제제를 포함하는 암 예방 및/또는 치료용 조성물, 및 융합 단백질을 이용한 항암제 스크리닝 방법에 관한 것이다.
폐암은 인간에게 있어서 가장 흔하게 발병하는 암 중 하나이며, 세계적으로 암 관련 사망의 주된 원인이 되고 있다. 저선량 컴퓨터 단층촬영 스크리닝 기법이 도입되면서 초기 진단 비율이 높아지고 있지만, 폐암은 여전히 예후가 매우 좋지 않은 치명적인 질병이다. 폐암은 조직병리학적 관점에서 다음과 같이 분류될 수 있다: 폐선암은 비흡연자 또는 소량 흡연자 및 여성 환자에서도 빈번하게 발생하는 가장 흔한 유형이다. 과거 10년 동안, 폐선암은 폐암 연구의 중심이 되어 왔으며, 본 발명에서의 폐선암에 대한 이해는 병리학, 분자생물학, 유전학, 방사선학 및 임상 치료를 포함하는 모든 측면에서 진전된 것이다.
특히, 수반되는 주요한 유전적 변형(alterations)과 신호 경로의 이해의 진전은 기본적인 유전적 변화에 기초한 폐선암의 재분류를 제안한다. 핵심 돌연변이(driver mutations)라 불리는 이러한 유전적 변형을 갖는 암세포들은 이러한 변형을 갖지 않는 세포들과 비교하여 생존 및 성장에 있어서 이점을 갖는다. 핵심 돌연변이는 카이네이즈와 같은 신호화 단백질을 암호화하는 유전자에 발생하며, 이는 종양 발생을 개시하고 유지하는 구성적으로 활성이 있는 생존 신호를 발생시킬 수 있다. 현재, 폐선암에 있어서 약 10개의 핵심 돌연변이가 알려져 있으며, 이들 변형을 타겟팅하는 몇몇 약물이 최근 시험에서 주목할만한 성과를 보이는 것으로 보고되어 있다. 예컨대, 상피성장인자(EGFR) 타이로신 카이네이즈 억제제인 게피티닙(gefitinib)은 반응 응답률이 약 70%정도이며, EGFR-활성화 변이를 갖는 환자가 10개월 동안 종양 진행 없이 생존할 수 있도록 하였으며, 이러한 결과는 최근의 임상시험 중인 백금 기반 이중 화학요법 (platinum-based doublet chemotherapy)보다 우수한 효과이다. 역형성림프종 카이네이즈(anaplastic lymphoma kinase; ALK) 및 MET 타이로신 카이네이즈에 대한 이중 억제제인 크리조티닙(Crizotinib)은 ALK 융합 유전자를 포함하는 폐암 환자에 효과가 있는 것으로 입증되었다. 이와 같은 중추적인 연구들로부터, 폐선암의 치료 전략은 조직 기반 접근에서 핵심 돌연변이 관련 치료법으로 옮겨가고 있다. 최근 많은 연구에도 불구하고, 폐선암의 약 40%의 분자 수준 조종자(driver)가 아직 밝혀지지 않고 있다. 흥미롭게, 몇몇 핵심 돌연변이의 발생 빈도는 민족간 상당한 차이가 있으며, 따라서 폐암의 새로운 신약 타겟(druggable targets) 및 표적 치료를 개발하기 위하여 광범위한 유전적 연구가 필요한 것으로 나타났다.
이에 본 발명자들은 인간 고형 종양, 특히 폐암에서 특이적으로 ZFYVE9 (zinc finger, FYVE domain containing 9)의 전부 또는 일부와 소정의 융합 파트너, 예컨대, CGA (glycoprotein hormones, alpha polypeptide)의 전부 또는 일부가 융합된 형태의 융합 단백질의 발현 및/또는 이를 암호화하는 유전자의 존재가 관찰됨을 확인하여 본 발명을 완성하였다.
따라서, 본 발명의 일례는 암 진단 마커로서 ZFYVE9 (zinc finger, FYVE domain containing 9) 단백질 또는 그 단편과, CGA (glycoprotein hormones, alpha polypeptide) 단백질 또는 그 단편의 융합 단백질을 제공한다.
또 다른 예는 상기 융합 단백질을 암호화하는 폴리뉴클레오타이드 분자를 제공한다.
또 다른 예는 상기 융합 단백질에 특이적으로 결합하는 분자 및/또는 상기 폴리뉴클레오타이드 분자와 혼성화 가능한 폴리뉴클레오타이드를 포함하는 암 진단용 조성물을 제공한다.
또 다른 예는 환자로부터 얻은 생물 시료에서 상기 융합 단백질의 발현을 측정하는 단계를 포함하는 암 진단에 정보를 제공하는 방법을 제공한다.
또 다른 예는 상기 융합 단백질 억제제 및/또는 상기 융합 단백질을 암호화하는 폴리뉴클레오타이드 분자 억제제를 유효성분으로 포함하는 암 예방 및/또는 치료용 조성물을 제공한다.
또 다른 예는 상기 융합 단백질을 발현하는 세포에 후보 물질을 처리하여 융합 단백질 발현 수준을 측정하는 단계를 포함하고, 상기 후보 물질을 처리하기 전 또는 후보 물질을 처리하지 않은 세포와 비교하여 상기 융합 단백질 발현 수준이 감소한 경우, 상기 후보물질을 항암제로 선택하는 것을 특징으로 하는, 항암제 스크리닝 방법을 제공한다.
암 환자에게 적절하게 표적화된 요법을 적용하기 위하여 암의 분자적 특성을 이해하는 것이 중요하다. 유전자 발현, 점돌연변이, 융합 유전자, 대체적 스플라이싱(alternative splicing) 등의 암의 전반적인 유전적 기초를 이해하기 위한 효과적인 수단 중 하나로 대규모의 parallel RNA sequencing을 들 수 있다.
이에, 본 발명자들은 대표적인 고형암인 폐암을 대상으로 연구하였다. 구체적으로, 한국인 환자로부터 얻은 200개의 폐선암 수술 표본에서 유전적 변형을 광범위하게 조사하였으며, 이 중에서 87개는 대응하는 인접 정상 조직 샘플에 대한 전사체 시퀀싱(transciptome sequencing)(n=77) 및 전체 엑솜 시퀀싱(whole-exome sequencing)(n=76)과 조합된 전사체 시퀀싱에 의하여 분석하여, 폐암에 특이적인 융합단백질을 확인하여 본 발명을 완성하였다. 전사체 시퀀싱은, 체세포 점돌연변이 뿐 아니라 융합 유전자 및 대체적 스플라이싱(alternative splicing)과 같은 이상 RNA 변이체를 시험할 수 있기 때문에, 암에서의 핵심 돌연변이를 검출하는 적합한 방법이다. 유전자 기술의 진전에 의하여 암의 광범위한 게놈 분석이 가능해졌지만, 본 발명은 RNA 시퀀싱을 사용한 최초의 대규모 폐선암 연구이다.
보다 구체적으로, 한국인 환자로부터 얻어진 87개의 외과 수술 표본의 전사체(transcriptome)를, 77개의 인접하는 정상 조직의 엑솜(exome) 및 RNA 시퀀싱 결과와 조합하여 분석하였다. 유전자 발현 양상은 흡연자로부터 얻은 암조직에서 보다 강한 변태(perturbation)를 나타낸다. 또한, EGFR, KRAS, NRAS, BRAF, PIK3CA , MET 및 CTNNB1와 같은 transforming gene들에서의 체세포 돌연변이를 확인하였다. EGFR 돌연변이 빈도는 한국인 환자에게서 극단적으로 높게(~60%) 나타났으며, 이는 아시아인들에서 우세한 EGFR의 반수체형(haplotype)에 의하여 설명될 수 있다. 본 발명자들은 ALK, RET, ROS1 및 다른 타이로신 카이네이즈 유전자들(FGFR2, PDGFRA 및 AXL 등)을 포함하는 30개 키메릭 전사체를 동정하였으며, 이들은 폐암에서의 핵심 돌연변이(driver mutation)일 가능성이 매우 높을 것으로 보인다.
상기 연구 결과, 암(예컨대, 비소세포성 폐암, NSCLC) 환자의 수술 표본에서 상기 수술 표본과 동일한 조직 내의 주변 정상 조직 샘플에서는 발견되지 않고 암 조직에서 특이적으로 발현되는 변이로서 동일한 염색체 또는 상이한 염색체에 위치하는 두 유전자가 융합된 융합 유전자 및/또는 상기 융합 유전자가 발현되어 생성된 융합 단백질이 존재함을 확인하였다.
본 발명에서 확인된 암 조직 (예컨대 폐암 조직)에서 특이적으로 존재하는 융합 유전자를 다음의 표 1에 정리하였다:
D
onor
Gene |
Acceptor
Gene |
Chromosome
( Donor ; Acceptor ) |
Distance ( Mb ) | |
1 | CCDC6 | ROS1 | chr10(q21.2);chr6(q22.1) | Interchromosomal |
2 | SCAF11 | PDGFRA | chr12(q12);chr4(q12) | Interchromosomal |
3 | FGFR2 | CIT | chr10(q26.13);chr12(q24.23) | Interchromosomal |
4 | AXL | MBIP | chr19(q13.2);chr14(q13.3) | Interchromosomal |
5 | APLP2 | TNFSF11 | chr11(q24.3);chr13(q14.11) | Interchromosomal |
6 | MAP4K3 | PRKCE | chr2(p22.1);chr2(p21) | 6.215 |
7 | BCAS3 | MAP3K3 | chr17(q23.2); chr17(q23.3) | 2.23 |
8 | KRAS | CDH13 | chr12(p12.1);chr16(q23.3) | Interchromosomal |
9 | ZFYVE9 | CGA | chr1(p32.3);chr6(q14.3) | Interchromosomal |
10 | ERBB2IP | MAST4 | chr5(q12.3);chr5(q12.3) | 0.515 |
11 | TPD52L1 | TRMT11 | chr6(q22.31); chr6(q22.32) | 0.723 |
12 | TXNRD1 | GPR133 | chr12(q23.3);chr12(q24.33) | 26.694 |
상기 표 1에 기재된 융합 유전자는 암 조직에 특이적으로 관찰되는 것으로 확인되었으므로, 상기 융합 유전자 및/또는 상기 융합 단백질은 암 진단을 위한 바이오마커 및 암 치료를 위한 타겟으로서 유용하다.
이에, 본 발명의 일례는,
CCDC6 단백질 또는 그의 단편과 ROS1 단백질 또는 그의 단편이 융합된 CCDC6-ROS1융합 단백질;
FGFR2 단백질 또는 그의 단편과 CIT 단백질 또는 그의 단편이 융합된 FGFR2-CIT 융합 단백질;
AXL 단백질 또는 그의 단편과 MBIP 단백질 또는 그의 단편이 융합된 AXL-MBIP 융합 단백질;
APLP2 단백질 또는 그의 단편과 TNFSF11 단백질 또는 그의 단편이 융합된 APLP2-TNFSF11 융합 단백질;
MAP4K3 단백질 또는 그의 단편과 PRKCE 단백질 또는 그의 단편이 융합된 MAP4K3-PRKCE 융합 단백질;
BCAS3 단백질 또는 그의 단편과 MAP3K3 단백질 또는 그의 단편이 융합된 BCAS3-MAP3K3 융합 단백질;
KRAS 단백질 또는 그의 단편과 CDH13 단백질 또는 그의 단편이 융합된 KRAS-CDH13 융합 단백질;
ZFYVE9 단백질 또는 그의 단편과 CGA 단백질 또는 그의 단편이 융합된 ZFYVE9-CGA 융합 단백질;
ERBB2IP 단백질 또는 그의 단편과 MAST4 단백질 또는 그의 단편이 융합된 ERBB2IP-MAST4 융합 단백질;
TPD52L1 단백질 또는 그의 단편과 TRMT11 단백질 또는 그의 단편이 융합된 TPD52L1-TRMT11 융합 단백질; 및
TXNRD1 단백질 또는 그의 단편과 GPR133 단백질 또는 그의 단편이 융합된 TXNRD1-GPR133 융합 단백질
로 이루어진 군에서 선택된 융합 단백질을 제공된다.
또 다른 예는 상기 융합 단백질을 암호화하는 폴리뉴클레오타이드 분자를 제공한다.
상기 융합 단백질 및/또는 이를 암호화하는 폴리뉴클레오타이드 분자는 암의 진단 마커로서 유용하게 사용될 수 있다.
이하 상기 융합 파트너와 융합 단백질을 보다 상세히 설명한다. 본 명세서에 있어서 융합 파트너인 단백질 단편들의 절단점 (break point; 또는 융합부위)은 이들을 암호화하는 유전자의 엑손을 기준으로 설명되며, 상기 단편의 N-말단 (C-말단 융합 파트너의 경우) 또는 C-말단(N-말단 융합 파트너의 경우) 마지막에 포함되는 엑손과 절단되어 제거되는 엑손 사이의 인트론 부위 중 어느 지점에서 절단되어도 암호화되는 단백질 단편의 아미노산 서열에는 영향을 미치지 않게 되므로, 실제 절단되는 지점은 상기 인트론 부위 중 어느 지점이어도 무방하다.
CCDC6(coiled-coil domain containing 6) 단백질을 암호화 하는 CCDC6 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 10(q21.2)에 위치하며, 이로부터 암호화되는 CCDC6 단백질은 총 아미노산 길이가 474aa인 단백질이다. CCDC6 단백질 또는 CCDC6 단백질의 단편은 CCDC6-ROS1융합 단백질의 N-말단쪽 융합 파트너이다. 구체예에서, CCDC6 유전자는 GenBank accession no. NM_005436에 제공되는 뉴클레오타이드 서열을 갖는 것일 수 있으며, CCDC6 단백질은 NM_005436에 의하여 암호화되는 아미노산 서열을 갖는 단백질일 수 있다.
상기 CCDC6 단백질의 단편은 NM_005436의 첫 번째 엑손(exon)부터 엑손 5(exon 5) 5(염색체 10 상의 위치((-) strand)를 기준으로 61572393-61572553 염기 부위)까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있으며, 엑손 5의 3' 말단에 코돈을 이루지 못하는 1개의 뉴클레오타이드(c)가 존재하는 형태일 수 있다. 구체예에서, 상기 CCDC6 단백질의 단편을 암호화하는 유전자 및 이로부터 암호화되는 CCDC6 단백질에 대하여 아래의 표 2 및 3에 정리하였다:
CCDC6 유전자 (Accession No.) |
CCDC6 단백질의 암호화 부위-CDS | CCDC6 단백질 단편의 암호화 부위: 엑손 기준 | CCDC6 단백질 단편의 암호화 부위: cDNA 기준 | 염색체 상의 Break-point 위치 | Break-point 부위 염기서열 |
NM_005436 | 233~1657 (1425bp) (서열번호 1) |
첫번째 엑손~엑손 5까지의 부위 | 233~1079 (846bp +1nt(c); 총 847bp) (서열번호 2) |
chr10:[61572393 (엑손 5의 3' 말단) | gctgctcagttacagc (서열번호 3) |
CCDC6 단백질의 Full size(a.a) | CCDC6 단백질 단편 부위 | Breakpoint 부위 아미노산 서열 |
474aa (서열번호 4) |
1~282aa+1nt(c) (아미노산 서열: 서열번호 5) |
AAQLQ+1nt(c) (아미노산 서열: 서열번호 6) |
ROS1(receptor tyrosine kinase 1) 단백질을 암호화 하는 ROS1 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 6(q22.1)에 위치하며, 이로부터 암호화되는 ROS1 단백질은 총 아미노산 길이가 2347aa인 단백질이다. ROS1 단백질 또는 ROS1 단백질의 단편은 CCDC6-ROS1융합 단백질의 C-말단쪽 융합 파트너이다. 구체예에서, 상기 ROS1 유전자는 GenBank accession no. NM_002944에 제공되는 뉴클레오타이드 서열을 갖는 것일 수 있으며, 상기 ROS1 단백질은 NM_002944에 의하여 암호화되는 아미노산 서열을 갖는 단백질일 수 있다.
상기 ROS1 단백질의 단편은 NM_002944의 엑손 35(염색체 6 상의 위치((-) strand)를 기준으로 117642422-117642557 염기 부위)부터 마지막 엑손까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있다. 이 때, 엑손 35의 5' 말단의 처음 시작하는 2개의 뉴클레오타이드(tc)는 코돈을 이루지 못하는 형태일 수 있고, CCDC6 단백질의 단편과 융합시에 앞서 설명한 바와 같이 CCDC6 단백질의 단편에 추가로 포함된 1개의 뉴클레오타이드(c)와 연결되어 코돈(ctc)을 이루어 하나의 아미노산(L)을 코딩할 수 있다. 구체예에서, 상기 ROS1 단백질의 단편을 암호화하는 유전자 및 이로부터 암호화되는 ROS1 단백질에 대하여 아래의 표 4 및 5에 정리하였다:
ROS1유전자 (Accession No.) |
ROS1 단백질의 암호화 부위-CDS | ROS1 단백질 단편의 암호화 부위: 엑손 기준 | ROS1 단백질 단편의 암호화 부위: cDNA 기준 | 염색체 상의 Break-point 위치 | Break-point 부위 염기서열 |
NM_002944 | 200~7243 (7044bp) (서열번호 7) |
엑손 35~마지막 엑손까지의 부위 | 5841~7243 (2nt(tc) +1401bp; 총 1403bp) (서열번호 8) |
chr6:117642557] (엑손 35의 5' 말단) | tctggcatagaagatta (서열번호 9) |
ROS1 단백질의 Full size(a.a) | ROS1단백질 단편 부위 | Breakpoint 부위 아미노산 서열 |
2347aa (서열번호 10) |
1882~2347aa 2nt(tc)+ 1882~2347aa (466aa) (아미노산 서열: 서열번호 11) |
2nt(tc)+WHRRL (아미노산 서열: 서열번호 12) |
상기 'CCDC6 단백질 또는 그의 단편과 ROS1 단백질 또는 그의 단편이 융합된 CCDC6-ROS1융합 단백질'을 암호화하는 융합 유전자 (CCDC6-ROS1융합 유전자)는 5'-말단에 상기한 바와 같은 CCDC6 단백질 또는 그의 단편을 암호화하는 폴리뉴클레오타이드 분자 및 3'-말단에 상기한 바와 같은 ROS1 단백질 또는 그의 단편을 암호화하는 폴리뉴클레오타이드 분자를 포함하는 것일 수 있다. 구체예에서, 상기 CCDC6-ROS1융합 유전자는 5' 말단쪽에 NM_005436의 첫번째 엑손에서 엑손 5까지의 뉴클레오타이드 서열과 3' 말단쪽에 NM_002944의 엑손 35부터 마지막 엑손까지 뉴클레오타이드 서열이 연결된 융합 유전자일 수 있다. 보다 구체적으로, 상기 CCDC6-ROS1융합 유전자는 5' 말단쪽에 NM_005436의 233번째부터 1079번째까지의 뉴클레오타이드 서열 (서열번호 2)과 3' 말단쪽에 NM_002944의 5841번째부터 7243번째까지의 뉴클레오타이드 서열(서열번호 8)이 연결된 융합 유전자(서열번호 13; 융합부위: 서열번호 14)일 수 있다 (도 17 참조).
상기 CCDC6-ROS1융합 단백질은 N-말단쪽에 상기한 바와 같은 CCDC6 단백질 또는 단편과, C-말단쪽에 상기한 바와 같은 ROS1 단백질 또는 단편이 연결된 융합 단백질로서, 예컨대 5' 말단쪽에 NM_005436의 첫번째 엑손에서 엑손 5까지의 뉴클레오타이드 서열과 3' 말단쪽에 NM_002944의 엑손 35부터 마지막 엑손까지 뉴클레오타이드 서열이 연결된 융합 유전자에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있으며, 구체예에서, 서열번호 13의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열(서열번호 15; 융합부위: 서열번호 16, 도 18 참조) 또는 상기 서열과 적어도 90% 이상, 구체적으로 95% 이상, 보다 구체적으로 99% 이상의 서열 상동성을 갖는 폴리펩타이드 분자일 수 있다. 상기 CCDC6-ROS1융합 유전자의 일 예를 도 1에 모식적으로 나타내었다.
FGFR2 (fibroblast growth factor receptor 2) 단백질을 암호화 하는 FGFR2 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 10(q26.13)에 위치하며, FGFR2 단백질은 이로부터 암호화되는 단백질이다. FGFR2 단백질 또는 FGFR2 단백질의 단편은 FGFR2-CIT융합 단백질의 N-말단쪽 융합 파트너이다. 구체예에서, FGFR2 유전자는 GenBank accession no. NM_001144914, NM_001144916, NM_001144915, NM_001144917, NM_001144918, NM_022970, NM_000141, NM_001144913, NM_001144919 등에 제공되는 뉴클레오타이드 서열을 갖는 것일 수 있으며, FGFR2 단백질은 이들 뉴클레오타이드 서열 중 어느 하나에 의하여 암호화되는 아미노산 서열을 갖는 단백질일 수 있다.
상기 FGFR2 단백질의 단편은 상기 뉴클레오타이드 서열의 첫 번째 엑손(exon)부터 엑손 19(염색체 10 상의 위치((-) strand)를 기준으로 123243212-123243317 염기 부위)까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있다. 구체예에서, 상기 FGFR2 단백질의 단편을 암호화하는 유전자 및 이로부터 암호화되는 FGFR2 단백질에 대하여 아래의 표 6 및 7에 정리하였다:
FGFR2 유전자 (Accession No.) |
FGFR2 단백질의 암호화 부위-CDS | FGFR2 단백질 단편의 암호화 부위: 엑손 기준 | FGFR2 단백질 단편의 암호화 부위: cDNA 기준 | 염색체 상의 Break-point 위치 | Break-point 부위 염기서열 |
NM_001144914 | 151~2280 (2130bp) (서열번호 17) |
첫번째 엑손~엑손 19까지의 부위 | 151~2115 (1965bp) (서열번호 18) |
chr10:[123243212 (엑손 19의 3' 말단) | ctcactctcacaaccaatgag (서열번호 19) |
NM_001144916 | 442~2562 (2121bp) |
442~2397 (1956bp) |
|||
NM_001144915 | 320~2443 (2124bp) |
320~2353 (2034bp) |
|||
NM_001144917 | 648~2765 (2118bp) |
648~2600 (1953bp) |
|||
NM_001144918 | 648~2762 (2115bp) |
648~2597 (1950bp) |
|||
NM_022970 | 648~3116 (2469bp) |
648~2951 (2304bp) |
|||
NM_000141 | 648~3113 (2466bp) |
648~2948 (2301bp) |
|||
NM_001144913 | 151~2460 (1813bp) |
151~2454 (2304bp) |
|||
NM_001144919 | 648~2690 (2043bp) |
648~2648 (2001bp) |
FGFR2 유전자 (Accession No.) |
FGFR2 단백질의 Full size(a.a) | FGFR2 단백질 단편 부위 | Breakpoint 부위 아미노산 서열 |
NM_001144914 | 709aa (서열번호 20) |
1~655aa (서열번호 21) |
LTLTTNE (서열번호 22) |
NM_001144916 | 706aa | 1~652aa | |
NM_001144915 | 707aa | 1~678aa | |
NM_001144917 | 705aa | 1~651aa | |
NM_001144918 | 704aa | 1~650aa | |
NM_022970 | 822aa | 1~768aa | |
NM_000141 | 821aa | 1~767aa | |
NM_001144913 | 769aa | 1~768aa | |
NM_001144919 | 680aa | 1~679aa |
CIT [citron (rho-interacting, serine/threonine kinase 21)] 단백질을 암호화 하는 CIT 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 12(q24.23)에 위치하며, CIT 단백질은 이로부터 암호화되는 총길이 2027aa의 단백질이다. CIT 단백질 또는 CIT 단백질의 단편은 FGFR2-CIT융합 단백질의 C-말단쪽 융합 파트너이다. 구체예에서, CIT 유전자는 GenBank accession no. NM_007174에 제공되는 뉴클레오타이드 서열을 갖는 것일 수 있으며, CIT 단백질은 상기 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 단백질일 수 있다.
상기 CIT 단백질의 단편은 상기 뉴클레오타이드 서열의 엑손 24 (염색체 12 상의 위치((-) strand)를 기준으로 120180216-12018026 염기 부위)부터 마지막 엑손까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있다. 구체예에서, 상기 CIT 단백질의 단편을 암호화하는 유전자 및 이로부터 암호화되는 CIT 단백질에 대하여 아래의 표 8 및 9에 정리하였다:
CIT 유전자 (Accession No.) |
CIT 단백질의 암호화 부위-CDS | CIT 단백질 단편의 암호화 부위: 엑손 기준 | CIT 단백질 단편의 암호화 부위: cDNA 기준 | 염색체 상의 Break-point 위치 | Break-point 부위 염기서열 |
NM_007174 | 57~6140 (6084bp) (서열번호 23) |
엑손 24~마지막 엑손까지의 부위 | 2835~6140 (3306bp) (서열번호 24) |
chr12:120180269] (엑손 24의 5' 말단) | gcacatagagatgaaatccag (서열번호 25) |
CIT 단백질의 Full size(a.a) | CIT 단백질 단편 부위 | Breakpoint 부위 아미노산 서열 |
2027aa (서열번호 26) |
927~2027aa (1101aa) (서열번호 27) |
AHRDEIQ (서열번호 28) |
상기 'FGFR2 단백질 또는 그의 단편과 CIT 단백질 또는 그의 단편이 융합된 FGFR2-CIT 융합 단백질'을 암호화하는 융합 유전자 (FGFR2-CIT 융합 유전자)는 5'-말단에 상기한 바와 같은 FGFR2 단백질 또는 그의 단편을 암호화하는 폴리뉴클레오타이드 분자 및 3'-말단에 상기한 바와 같은 CIT 단백질 또는 그의 단편을 암호화하는 폴리뉴클레오타이드 분자를 포함하는 것일 수 있다. 구체예에서, 상기 FGFR2-CIT 융합 유전자는 5' 말단쪽에 NM_001144914, NM_001144916, NM_001144915, NM_001144917, NM_001144918, NM_022970, NM_000141, NM_001144913, 또는 NM_001144919의 첫번째 엑손에서 엑손 19까지의 뉴클레오타이드 서열과 3' 말단쪽에 NM_007174의 엑손 24부터 마지막 엑손까지 뉴클레오타이드 서열이 연결된 융합 유전자일 수 있다. 예컨대, 상기 FGFR2-CIT 융합 유전자는 5' 말단쪽에 NM_001144914의 151번째부터 2115번째까지의 뉴클레오타이드 서열 (서열번호 18)과 3' 말단쪽에 NM_007174의 2835번째부터 6140번째까지의 뉴클레오타이드 서열(서열번호 24)이 연결된 융합 유전자(서열번호 29; 융합부위: 서열번호 30)일 수 있다 (23a 및 23b 참조).
상기 FGFR2-CIT 융합 단백질은 N-말단쪽에 상기한 바와 같은 FGFR2 단백질 또는 단편과, C-말단쪽에 상기한 바와 같은 CIT 단백질 또는 단편이 연결된 융합 단백질로서, 예컨대 5' 말단쪽에 NM_001144914, NM_001144916, NM_001144915, NM_001144917, NM_001144918, NM_022970, NM_000141, NM_001144913, 또는 NM_001144919의 첫번째 엑손에서 엑손 19까지의 뉴클레오타이드 서열과 3' 말단쪽에 NM_007174의 엑손 24부터 마지막 엑손까지 뉴클레오타이드 서열이 연결된 융합 유전자에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있으며, 구체예에서, 서열번호 29의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열(서열번호 31; 융합부위: 서열번호 32, 도 24 참조) 또는 상기 서열과 적어도 90% 이상, 구체적으로 95% 이상, 보다 구체적으로 99% 이상의 서열 상동성을 갖는 폴리펩타이드 분자일 수 있다. 상기 FGFR2-CIT 융합 유전자의 일 예를 도 3에 모식적으로 나타내었다.
AXL (AXL receptor tyrosine kinase) 단백질을 암호화 하는 AXL 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 19(q13.2)에 위치하며, AXL 단백질은 상기 AXL 유전자로부터 암호화된다. AXL 단백질 또는 AXL 단백질의 단편은 AXL-MBIP 융합 단백질의 N-말단쪽 융합 파트너이다. 구체예에서, AXL 유전자는 GenBank accession no. NM_021913, NM_001699 등에 제공되는 뉴클레오타이드 서열을 갖는 것일 수 있으며, AXL 단백질은 NM_021913, NM_001699 등에 의하여 암호화되는 아미노산 서열을 갖는 단백질일 수 있다.
상기 AXL 단백질의 단편은 상기 뉴클레오타이드 서열의 첫 번째 엑손(exon)부터 엑손 20(염색체 19 상의 위치((+) strand)를 기준으로 41765458-41767670 염기 부위) 중의 244번째 뉴클레오타이드까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있다. 구체예에서, 상기 AXL 단백질의 단편을 암호화하는 유전자 및 이로부터 암호화되는 AXL 단백질에 대하여 아래의 표 10 및 11에 정리하였다:
AXL 유전자 (Accession No.) |
AXL 단백질의 암호화 부위-CDS | AXL 단백질 단편의 암호화 부위: 엑손 기준 | AXL 단백질 단편의 암호화 부위: cDNA 기준 | 염색체 상의 Break-point 위치 | Break-point 부위 염기서열 |
NM_021913 | 191~2875 (2685bp) (서열번호 33) |
첫번째 엑손~엑손 20 중의 244번째 뉴클레오타이드까지의 부위 | 191~2767 (2577bp) (서열번호 34) |
chr19:41765701]엑손 20 중의 244번째 뉴클레오타이드의 3 위치 | ctcactgcggctgag (서열번호 35) |
NM_001699 | 191~2848 (2658bp) |
191~2740 (2550bp) |
AXL 유전자 (Accession No.) |
AXL 단백질의 Full size(a.a) | AXL 단백질 단편 부위 | Breakpoint 부위 아미노산 서열 |
NM_021913 | 894aa (서열번호 36) |
1~859aa (서열번호 37) |
LTAAE (서열번호 38) |
NM_001699 | 885aa | 1~850aa |
MBIP (MAP3K12 binding inhibitory protein 1) 단백질을 암호화 하는 MBIP 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 14(q13.3)에 위치하며, MBIP 단백질은 상기 MBIP 유전자로부터 암호화된다. MBIP 단백질 또는 MBIP 단백질의 단편은 AXL-MBIP 융합 단백질의 C-말단쪽 융합 파트너이다. 구체예에서, MBIP 유전자는 GenBank accession no. NM_016586, NM_001144891 등에 제공되는 뉴클레오타이드 서열을 갖는 것일 수 있으며, MBIP 단백질은 NM_016586, NM_001144891 등에 의하여 암호화되는 아미노산 서열을 갖는 단백질일 수 있다.
상기 MBIP 단백질의 단편은 상기 뉴클레오타이드 서열의 엑손 4(염색체 14 상의 위치((-) strand)를 기준으로 36783718-36783814 염기 부위)부터 마지막 엑손까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있다. 구체예에서, 상기 MBIP 단백질의 단편을 암호화하는 유전자 및 이로부터 암호화되는 MBIP 단백질에 대하여 아래의 표 12 및 13에 정리하였다:
MBIP 유전자 (Accession No.) |
MBIP 단백질의 암호화 부위-CDS | MBIP 단백질 단편의 암호화 부위: 엑손 기준 | MBIP 단백질 단편의 암호화 부위: cDNA 기준 | 염색체 상의 Break-point 위치 | Break-point 부위 염기서열 |
NM_016586 | 89~1123 (1035bp) (서열번호 39) |
엑손 4부터 마지막 엑손까지의 부위 | 563~1123 (561bp) (서열번호 40) |
chr14:36783814](엑손 4의 5' 말단) | attgacagacgaata (서열번호 41) |
NM_001144891 | 89~1120 (1032bp) |
563~1120 (558bp) |
MBIP 유전자 (Accession No.) |
MBIP 단백질의 Full size(a.a) | MBIP 단백질 단편 부위 | Breakpoint 부위 아미노산 서열 |
NM_016586 | 344aa (서열번호 42) |
159~344aa (186aa) (서열번호 43) |
IDRRI (서열번호 44) |
NM_001144891 | 343aa | 159~343aa (185aa) |
상기 'AXL 단백질 또는 그의 단편과 MBIP 단백질 또는 그의 단편이 융합된 AXL-MBIP 융합 단백질'을 암호화하는 융합 유전자 (AXL-MBIP 융합 유전자)는 5'-말단에 상기한 바와 같은 AXL 단백질 또는 그의 단편을 암호화하는 폴리뉴클레오타이드 분자 및 3'-말단에 상기한 바와 같은 MBIP 단백질 또는 그의 단편을 암호화하는 폴리뉴클레오타이드 분자를 포함하는 것일 수 있다. 구체예에서, 상기 AXL-MBIP 융합 유전자는 5' 말단쪽에 NM_021913, NM_001699 등의 첫번째 엑손에서 엑손 20 중의 244번째 뉴클레오타이드까지의 뉴클레오타이드 서열과 3' 말단쪽에 NM_016586, NM_001144891 등의 엑손 4부터 마지막 엑손까지 뉴클레오타이드 서열이 연결된 융합 유전자일 수 있다. 보다 구체적으로, 상기 AXL-MBIP 융합 유전자는 5' 말단쪽에 NM_021913의 191번째부터 2767번째까지의 뉴클레오타이드 서열 (서열번호 34)과 3' 말단쪽에 NM_016586의 563번째부터 1123번째까지의 뉴클레오타이드 서열(서열번호 40)이 연결된 융합 유전자(서열번호 45; 융합부위: 서열번호 46)일 수 있다 (도 29 참조).
상기 AXL-MBIP 융합 단백질은 N-말단쪽에 상기한 바와 같은 AXL 단백질 또는 단편과, C-말단쪽에 상기한 바와 같은 MBIP 단백질 또는 단편이 연결된 융합 단백질로서, 예컨대, 5' 말단쪽에 NM_021913, NM_001699 등의 첫번째 엑손에서 엑손 20의 244번째 뉴클레오타이드까지의 뉴클레오타이드 서열과 3' 말단쪽에 NM_016586, NM_001144891 등의 엑손 4부터 마지막 엑손까지 뉴클레오타이드 서열이 연결된 융합 유전자에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있으며, 구체예에서, 서열번호 45의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열(서열번호 47; 융합부위: 서열번호 48, 도 30 참조) 또는 상기 서열과 적어도 90% 이상, 구체적으로 95% 이상, 보다 구체적으로 99% 이상의 서열 상동성을 갖는 폴리펩타이드 분자일 수 있다. 상기 AXL-MBIP 융합 유전자의 일 예를 도 4에 모식적으로 나타내었다.
APLP2 (amyloid beta (A4) precursor-like protein 2) 단백질을 암호화 하는 APLP2 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 11(q24.3)에 위치하며, APLP2 단백질은 이로부터 암호화되는 단백질이다. APLP2 단백질 또는 APLP2 단백질의 단편은 APLP2-TNFSF11융합 단백질의 N-말단쪽 융합 파트너이다. 구체예에서, APLP2 유전자는 GenBank accession no. NM_001642, NM_001142276, NM_001142278, NM_001142277, NR_024516, NR_024515 등에 제공되는 뉴클레오타이드 서열을 갖는 것일 수 있으며, APLP2 단백질은 이들 뉴클레오타이드 서열 중 어느 하나에 의하여 암호화되는 아미노산 서열을 갖는 단백질일 수 있다.
상기 APLP2 단백질의 단편은 상기 뉴클레오타이드 서열의 첫 번째 엑손(exon)부터 엑손 12(염색체 11 상의 위치((+) strand)를 기준으로 129999933-130000061 염기 부위)까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있다. 구체예에서, 상기 APLP2 단백질의 단편을 암호화하는 유전자 및 이로부터 암호화되는 APLP2 단백질에 대하여 아래의 표 14 및 15에 정리하였다:
APLP2 유전자 (Accession No.) |
APLP2 단백질의 암호화 부위-CDS | APLP2 단백질 단편의 암호화 부위: 엑손 기준 | APLP2 단백질 단편의 암호화 부위: cDNA 기준 | 염색체 상의 Break-point 위치 | Break-point 부위 염기서열 |
NM_001642 | 158~2449 (2292bp) (서열번호 49) |
첫번째 엑손~엑손 12까지의 부위 | 158~1741 (1584bp) (서열번호 50) |
chr11:130000061] (엑손 12의 3' 말단) | gcggcccagatgaaatcccag (서열번호 51) |
NM_001142276 | 158~2413 (2256bp) |
158~1741 (1584bp) |
|||
NM_001142278 | 158~1726 (1569bp) |
158~1054 (896bp) |
|||
NM_001142277 | 158~2245 (2088bp) |
158~1573 (1416bp) |
APLP2유전자 (Accession No.) |
APLP2 단백질의 Full size(a.a) | APLP2 단백질 단편 부위 | Breakpoint 부위 아미노산 서열 |
NM_001642 | 763aa (서열번호 52) |
1~528aa (서열번호 53) |
AAQMKSQ (서열번호 54) |
NM_001142276 | 751aa | 1~528aa | |
NM_001142278 | 522aa | 1~299aa | |
NM_001142277 | 695aa | 1~472aa |
TNFSF11 (tumor necrosis factor (ligand) superfamily, member 11) 단백질을 암호화 하는 TNFSF11 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 13(q14.11)에 위치하며, TNFSF11 단백질은 이로부터 암호화되는 단백질이다. TNFSF11 단백질 또는 TNFSF11 단백질의 단편은 APLP2-TNFSF11융합 단백질의 C-말단쪽 융합 파트너이다. 구체예에서, TNFSF11 유전자는 GenBank accession no. NM_033012, NM_003701 등에 제공되는 뉴클레오타이드 서열을 갖는 것일 수 있으며, TNFSF11단백질은 이들 뉴클레오타이드 서열 중 어느 하나에 의하여 암호화되는 아미노산 서열을 갖는 단백질일 수 있다.
상기 TNFSF11단백질의 단편은 상기 뉴클레오타이드 서열의 엑손 6 (염색체 13 상의 위치((+) strand)를 기준으로 43174888-43174933 염기 부위)부터 마지막 엑손까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있다. 구체예에서, 상기 TNFSF11단백질의 단편을 암호화하는 유전자 및 이로부터 암호화되는 TNFSF11 단백질에 대하여 아래의 표 16 및 17에 정리하였다:
TNFSF11유전자 (Accession No.) |
TNFSF11 단백질의 암호화 부위-CDS | TNFSF11 단백질 단편의 암호화 부위: 엑손 기준 | TNFSF11 단백질 단편의 암호화 부위: cDNA 기준 | 염색체 상의 Break-point 위치 | Break-point 부위 염기서열 |
NM_033012 | 530~1264 (735bp) (서열번호 55) |
엑손 6부터 마지막 엑손까지의 부위 | 698~1264 (567bp) (서열번호 56) |
chr13:[43174888 (엑손 6의 5' 말단) | gaattacaacatatcgttgga (서열번호 57) |
NM_003701 | 150~1122 (973bp) |
537~2198 (1662bp) |
TNFSF11유전자 (Accession No.) |
TNFSF11 단백질의 Full size(a.a) | TNFSF11 단백질 단편 부위 | Breakpoint 부위 아미노산 서열 |
NM_033012 | 244aa (서열번호 58) |
57~244aa (188aa) (서열번호 59) |
ELQHIVG (서열번호 60) |
NM_003701 | 315aa | 130~315 (186aa) |
상기 'APLP2 단백질 또는 그의 단편과 TNFSF11 단백질 또는 그의 단편이 융합된 APLP2-TNFSF11 융합 단백질'을 암호화하는 융합 유전자 (APLP2-TNFSF11 융합 유전자)는 5'-말단에 상기한 바와 같은 APLP2 단백질 또는 그의 단편을 암호화하는 폴리뉴클레오타이드 분자 및 3'-말단에 상기한 바와 같은 TNFSF11 단백질 또는 그의 단편을 암호화하는 폴리뉴클레오타이드 분자를 포함하는 것일 수 있다. 구체예에서, 상기 APLP2-TNFSF11 융합 유전자는 5' 말단쪽에 NM_001642, NM_001142276, NM_001142278, NM_001142277, NR_024516, NR_024515 등의 첫번째 엑손에서 엑손 12까지의 뉴클레오타이드 서열과 3' 말단쪽에 NM_033012, NM_003701 등의 엑손 6부터 마지막 엑손까지 뉴클레오타이드 서열이 연결된 융합 유전자일 수 있다. 보다 구체적으로, 상기 APLP2-TNFSF11 융합 유전자는 5' 말단쪽에 NM_001642의 158번째부터 1741번째까지의 뉴클레오타이드 서열 (서열번호 50)과 3' 말단쪽에 NM_033012의 698번째부터 1264번째까지의 뉴클레오타이드 서열(서열번호 56)이 연결된 융합 유전자(서열번호 61; 융합부위: 서열번호 62)일 수 있다 (도 35 참조).
상기 APLP2-TNFSF11 융합 단백질은 N-말단쪽에 상기한 바와 같은 APLP2 단백질 또는 단편과, C-말단쪽에 상기한 바와 같은 TNFSF11 단백질 또는 단편이 연결된 융합 단백질로서, 예컨대 5' 말단쪽에 NM_001642, NM_001142276, NM_001142278, NM_001142277, NR_024516, NR_024515 등의 첫번째 엑손에서 엑손 12까지의 뉴클레오타이드 서열과 3' 말단쪽에 NM_033012, NM_003701 등의 엑손 6부터 마지막 엑손까지 뉴클레오타이드 서열이 연결된 융합 유전자에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있으며, 구체예에서, 서열번호 61의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열(서열번호 63; 융합부위: 서열번호 64, 도 36 참조) 또는 상기 서열과 적어도 90% 이상, 구체적으로 95% 이상, 보다 구체적으로 99% 이상의 서열 상동성을 갖는 폴리펩타이드 분자일 수 있다. 상기 APLP2-TNFSF11 융합 유전자의 일 예를 도 5에 모식적으로 나타내었다.
MAP4K3 (mitogen-activated protein kinase kinase kinase kinase3) 단백질을 암호화 하는 MAP4K3 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 2(p22.1)에 위치하며, 이로부터 암호화되는 MAP4K3 단백질은 총 아미노산 길이가 894aa인 단백질이다. MAP4K3 단백질 또는 MAP4K3 단백질의 단편은 MAP4K3-PRKCE 융합 단백질의 N-말단쪽 융합 파트너이다. 구체예에서, MAP4K3 유전자는 GenBank accession no. NM_003618에 제공되는 뉴클레오타이드 서열을 갖는 것일 수 있으며, MAP4K3 단백질은 NM_003618에 의하여 암호화되는 아미노산 서열을 갖는 단백질일 수 있다.
상기 MAP4K3 단백질의 단편은 NM_003618의 엑손 1 (염색체 2 상의 위치((-) strand)를 기준으로 39664033-39664219 염기 부위)의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 포함하는 것일 수 있다. 구체예에서, 상기 MAP4K3 단백질의 단편을 암호화하는 유전자 및 이로부터 암호화되는 MAP4K3 단백질에 대하여 아래의 표 18 및 19에 정리하였다:
MAP4K3 유전자 (Accession No.) |
MAP4K3 단백질의 암호화 부위-CDS | MAP4K3 단백질 단편의 암호화 부위: 엑손 기준 | MAP4K3 단백질 단편의 암호화 부위: cDNA 기준 | 염색체 상의 Break-point 위치 | Break-point 부위 염기서열 |
NM_003618 | 326~3010 (2685bp) (서열번호 65) |
엑손 1 부위 | 326~421 (96bp) (서열번호 66) |
chr2:[39664033 (엑손 1의 3' 말단) | acctacggcgacgtctacaag (서열번호 67) |
MAP4K3 단백질의 Full size(a.a) | MAP4K3 단백질 단편 부위 | Breakpoint 부위 아미노산 서열 |
894aa (서열번호 68) |
1~32aa (서열번호 69) |
TYGDVYK (서열번호 70) |
PRKCE (protein kinase C, epsilon) 단백질을 암호화 하는 PRKCE 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 2(p21)에 위치하며, 이로부터 암호화되는 PRKCE 단백질은 총 아미노산 길이가 737aa인 단백질이다. PRKCE 단백질 또는 PRKCE 단백질의 단편은 MAP4K3-PRKCE 융합 단백질의 C-말단쪽 융합 파트너이다. 구체예에서, 상기 PRKCE 유전자는 GenBank accession no. NM_005400에 제공되는 뉴클레오타이드 서열을 갖는 것일 수 있으며, 상기 PRKCE 단백질은 NM_005400에 의하여 암호화되는 아미노산 서열을 갖는 단백질일 수 있다.
상기 PRKCE 단백질의 단편은 NM_005400의 엑손 2(염색체 2 상의 위치((+) strand)를 기준으로 46070139-46070202 염기 부위)부터 마지막 엑손까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열일 수 있다. 구체예에서, 상기 PRKCE 단백질의 단편을 암호화하는 유전자 및 이로부터 암호화되는 PRKCE 단백질에 대하여 아래의 표 20 및 21에 정리하였다:
PRKCE 유전자 (Accession No.) |
PRKCE 단백질의 암호화 부위-CDS | PRKCE 단백질 단편의 암호화 부위: 엑손 기준 | PRKCE 단백질 단편의 암호화 부위: cDNA 기준 | 염색체 상의 Break-point 위치 | Break-point 부위 염기서열 |
NM_005400 | 198~2411 (2214bp) (서열번호 71) |
엑손 2~마지막 엑손까지의 부위 | 546~2411 (1866bp) (서열번호 72 |
chr2:[46070139 (엑손 2의 5' 말단) | attgatctggagccagaaggaaga (서열번호 73) |
PRKCE 단백질의 Full size(a.a) | PRKCE 단백질 단편 부위 | Breakpoint 부위 아미노산 서열 |
737aa (서열번호 74) |
117~737aa (621aa) (서열번호 75) |
IDLEPEGR (서열번호 76) |
상기 'MAP4K3 단백질 또는 그의 단편과 PRKCE 단백질 또는 그의 단편이 융합된 MAP4K3-PRKCE 융합 단백질'을 암호화하는 융합 유전자 (MAP4K3-PRKCE 융합 유전자)는 5'-말단에 상기한 바와 같은 MAP4K3 단백질 또는 그의 단편을 암호화하는 폴리뉴클레오타이드 분자 및 3'-말단에 상기한 바와 같은 PRKCE 단백질 또는 그의 단편을 암호화하는 폴리뉴클레오타이드 분자를 포함하는 것일 수 있다. 구체예에서, 상기 MAP4K3-PRKCE 융합 유전자는 5' 말단쪽에 NM_003618의 엑손 1의 뉴클레오타이드 서열과 3' 말단쪽에 NM_005400의 엑손 2부터 마지막 엑손까지 뉴클레오타이드 서열이 연결된 융합 유전자일 수 있다. 보다 구체적으로, 상기 MAP4K3-PRKCE 융합 유전자는 5' 말단쪽에 NM_003618의 326번째부터 421번째까지의 뉴클레오타이드 서열 (서열번호 66)과 3' 말단쪽에 NM_005400의 546번째부터 2411번째까지의 뉴클레오타이드 서열(서열번호 72)이 연결된 융합 유전자(서열번호 77; 융합부위: 서열번호 78)일 수 있다 (도 41 참조).
상기 MAP4K3-PRKCE 융합 단백질은 N-말단쪽에 상기한 바와 같은 MAP4K3 단백질 또는 단편과, C-말단쪽에 상기한 바와 같은 PRKCE 단백질 또는 단편이 연결된 융합 단백질로서, 예컨대 5' 말단쪽에 NM_003618의 엑손 1의 뉴클레오타이드 서열과 3' 말단쪽에 NM_005400의 엑손 2부터 마지막 엑손까지 뉴클레오타이드 서열이 연결된 융합 유전자에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있으며, 구체예에서, 서열번호 77의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열(서열번호 79; 융합부위: 서열번호 80, 도 42 참조) 또는 상기 서열과 적어도 90% 이상, 구체적으로 95% 이상, 보다 구체적으로 99% 이상의 서열 상동성을 갖는 폴리펩타이드 분자일 수 있다. 상기 MAP4K3-PRKCE 융합 유전자의 일 예를 도 6에 모식적으로 나타내었다.
BCAS3 (breast carcinoma amplified sequence3) 단백질을 암호화 하는 BCAS3 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 17(q23.2)에 위치하며, BCAS3 단백질은 이로부터 암호화되는 단백질이다. BCAS3 단백질 또는 BCAS3 단백질의 단편은 BCAS3-MAP3K3 융합 단백질의 N-말단쪽 융합 파트너이다. 구체예에서, BCAS3 유전자는 GenBank accession no. NM_017679, NM_001099432 등에 제공되는 뉴클레오타이드 서열을 갖는 것일 수 있으며, BCAS3 단백질은 이들 뉴클레오타이드 서열 중 어느 하나에 의하여 암호화되는 아미노산 서열을 갖는 단백질일 수 있다.
상기 BCAS3 단백질의 단편은 상기 뉴클레오타이드 서열의 첫 번째 엑손부터 엑손 23(염색체 17 상의 위치((+) strand)를 기준으로 59161828-59161925 염기 부위)까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있으며, 엑손 23의 3' 말단에 코돈을 이루지 못하는 1개의 뉴클레오타이드(g)가 존재하는 형태일 수 있다. 구체예에서, 상기 BCAS3 단백질의 단편을 암호화하는 유전자 및 이로부터 암호화되는 BCAS3 단백질에 대하여 아래의 표 22 및 23에 정리하였다:
BCAS3 유전자 (Accession No.) |
BCAS3 단백질의 암호화 부위-CDS | BCAS3 단백질 단편의 암호화 부위: 엑손 기준 | BCAS3 단백질 단편의 암호화 부위: cDNA 기준 | 염색체 상의 Break-point 위치 | Break-point 부위 염기서열 |
NM_001099432 | 110~2896 (2787bp) (서열번호 81) |
첫번째 엑손~엑손 23까지의 부위 | 110~2579 (2469bp +1nt(c); 총 2470bp) (서열번호 82) |
chr17:59161925] (엑손 23의 3' 말단) | acagtgattgatgctgcctcag (서열번호 83) |
NM_017679 | 110~2851 (2742bp) |
110~2534 (2454bp) |
BCAS3유전자 (Accession No.) |
BCAS3 단백질의 Full size(a.a) | BCAS3 단백질 단편 부위 | Breakpoint 부위 아미노산 서열 |
NM_001099432 | 928aa (서열번호 84) |
1~823aa+1nt(g) (아미노산 서열: 서열번호 85) |
TVIDAAS+1nt(g) (아미노산 서열: 서열번호 86) |
NM_017679 | 913aa | 1~808aa |
MAP3K3 (mitogen activated protein kinase kinase kinase3) 단백질을 암호화 하는 MAP3K3 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 17(q23.3)에 위치하며, MAP3K3 단백질은 이로부터 암호화되는 단백질이다. MAP3K3 단백질 또는 MAP3K3 단백질의 단편은 BCAS3-MAP3K3 융합 단백질의 C-말단쪽 융합 파트너이다. 구체예에서, MAP3K3 유전자는 GenBank accession no. NM_002401, NM_203351 등에 제공되는 뉴클레오타이드 서열을 갖는 것일 수 있으며, MAP3K3 단백질은 이들 뉴클레오타이드 서열 중 어느 하나에 의하여 암호화되는 아미노산 서열을 갖는 단백질일 수 있다.
상기 MAP3K3 단백질의 단편은 상기 뉴클레오타이드 서열의 엑손 2 (염색체 17 상의 위치((+) strand)를 기준으로 61710041-61710162 염기 부위)부터 마지막 엑손까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있다. 이 때, 엑손 2의 5' 말단의 처음 시작하는 2개의 뉴클레오타이드(ac)는 코돈을 이루지 못하는 형태일 수 있고, BCAS3 단백질의 단편과 융합시에 앞서 설명한 바와 같이 BCAS3 단백질의 단편에 추가로 포함된 1개의 뉴클레오타이드(g)와 연결되어 코돈(gac)을 이루어 하나의 아미노산(D)을 코딩할 수 있다. 구체예에서, 상기 MAP3K3단백질의 단편을 암호화하는 유전자 및 이로부터 암호화되는 MAP3K3 단백질에 대하여 아래의 표 24 및 25에 정리하였다:
MAP3K3 유전자 (Accession No.) |
MAP3K3 단백질의 암호화 부위-CDS | MAP3K3 단백질 단편의 암호화 부위: 엑손 기준 | MAP3K3 단백질 단편의 암호화 부위: cDNA 기준 | 염색체 상의 Break-point 위치 | Break-point 부위 염기서열 |
NM_002401 | 320~2200 (1881bp) (서열번호 87) |
엑손 2부터 마지막 엑손까지의 부위 | 324~2200 (2nt(ac) +1875bp; 총 1877bp) (서열번호 88) |
chr17:61710041] (엑손 2의 5' 말단) | acgaacaggaggcattgaactca (서열번호 89) |
NM_203351 | 320~2293 (1974bp) |
324~2293 (1970bp) |
MAP3K3 유전자 (Accession No.) |
MAP3K3 단백질의 Full size(a.a) | MAP3K3 단백질 단편 부위 | Breakpoint 부위 아미노산 서열 |
NM_002401 | 626aa (서열번호 90) |
3aa~626aa (2nt(ac)+3aa~626aa; 총 624aa) (아미노산 서열: 서열번호 91) |
2nt(ac)+EQEALNS (아미노산 서열: 서열번호 92) |
NM_203351 | 657aa | 3~655aa |
상기 'BCAS3 단백질 또는 그의 단편과 MAP3K3 단백질 또는 그의 단편이 융합된 BCAS3-MAP3K3 융합 단백질'을 암호화하는 융합 유전자 (BCAS3-MAP3K3 융합 유전자)는 5'-말단에 상기한 바와 같은 BCAS3 단백질 또는 그의 단편을 암호화하는 폴리뉴클레오타이드 분자 및 3'-말단에 상기한 바와 같은 MAP3K3 단백질 또는 그의 단편을 암호화하는 폴리뉴클레오타이드 분자를 포함하는 것일 수 있다. 구체예에서, 상기 BCAS3-MAP3K3 융합 유전자는 5' 말단쪽에 NM_017679, NM_001099432 등의 첫번째 엑손에서 엑손 23까지의 뉴클레오타이드 서열과 3' 말단쪽에 NM_002401, NM_203351 등의 엑손 2부터 마지막 엑손까지 뉴클레오타이드 서열이 연결된 융합 유전자일 수 있다. 보다 구체적으로, 상기 BCAS3-MAP3K3 융합 유전자는 5' 말단쪽에 NM_001099432의 110번째부터 2579번째까지의 뉴클레오타이드 서열 (서열번호 82)과 3' 말단쪽에 NM_002401의 324번째부터 2200번째까지의 뉴클레오타이드 서열(서열번호 88)이 연결된 융합 유전자(서열번호 93; 융합부위: 서열번호 94)일 수 있다 (도 47a 및 47b 참조).
상기 BCAS3-MAP3K3 융합 단백질은 N-말단쪽에 상기한 바와 같은 BCAS3 단백질 또는 단편과, C-말단쪽에 상기한 바와 같은 MAP3K3 단백질 또는 단편이 연결된 융합 단백질로서, 예컨대 5' 말단쪽에 NM_017679, NM_001099432 등의 첫번째 엑손에서 엑손 23까지의 뉴클레오타이드 서열과 3' 말단쪽에 NM_002401, NM_203351 등의 엑손 2부터 마지막 엑손까지 뉴클레오타이드 서열이 연결된 융합 유전자에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있으며, 구체예에서, 서열번호 93의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열(서열번호 95; 융합부위: 서열번호 96, 도 48 참조) 또는 상기 서열과 적어도 90% 이상, 구체적으로 95% 이상, 보다 구체적으로 99% 이상의 서열 상동성을 갖는 폴리펩타이드 분자일 수 있다. 상기 BCAS3- MAP3K3 융합 유전자의 일 예를 도 7에 모식적으로 나타내었다.
KRAS (Ki-ras2 Kirsten rat sarcoma viral oncogene homolog) 단백질을 암호화하는 KRAS 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 12(p12.1)에 위치하며, KRAS 단백질은 이로부터 암호화되는 단백질이다. KRAS 단백질 또는 KRAS 단백질의 단편은 KRAS-CDH13 융합 단백질의 N-말단쪽 융합 파트너이다. 구체예에서, KRAS 유전자는 GenBank accession no. NM_004985, NM_033360 등에 제공되는 뉴클레오타이드 서열을 갖는 것일 수 있으며, KRAS 단백질은 이들 뉴클레오타이드 서열 중 어느 하나에 의하여 암호화되는 아미노산 서열을 갖는 단백질일 수 있다.
상기 KRAS 단백질의 단편은 상기 뉴클레오타이드 서열의 첫 번째 엑손부터 엑손 4(염색체 11 상의 위치((-) strand)를 기준으로 25378548-25378707 염기 부위)까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있다. 구체예에서, 상기 KRAS 단백질의 단편을 암호화하는 유전자 및 이로부터 암호화되는 KRAS 단백질에 대하여 아래의 표 26 및 27에 정리하였다:
KRAS 유전자 (Accession No.) |
KRAS 단백질의 암호화 부위-CDS | KRAS 단백질 단편의 암호화 부위: 엑손 기준 | KRAS 단백질 단편의 암호화 부위: cDNA 기준 | 염색체 상의 Break-point 위치 | Break-point 부위 염기서열 |
NM_004985 | 182~748 (567bp) (서열번호 97) |
첫번째 엑손~엑손 4까지의 부위 | 182~631 (450bp) (서열번호 98) |
chr12:[25378548 (엑손 4의 3' 말단) | acatcagcaaagacaagacag (서열번호 99) |
NM_033360 | 182~751 (570bp) |
182~631 (450bp) |
KRAS 유전자 (Accession No.) |
KRAS 단백질의 Full size(a.a) | KRAS 단백질 단편 부위 | Breakpoint 부위 아미노산 서열 |
NM_004985 | 188aa (서열번호 100) |
1~150aa (서열번호 101) |
TSAKTRQ (서열번호 102) |
NM_033360 | 189aa | 1~150aa |
CDH13 (cadherin 13, H-cadherin) 단백질을 암호화 하는 CDH13 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 16(q23.3)에 위치하며, CDH13 단백질은 이로부터 암호화되는 단백질이다. CDH13 단백질 또는 CDH13 단백질의 단편은 KRAS-CDH13 융합 단백질의 C-말단쪽 융합 파트너이다. 구체예에서, CDH13 유전자는 GenBank accession no. NM_001257에 제공되는 뉴클레오타이드 서열을 갖는 것일 수 있으며, CDH13 단백질은 이 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 단백질일 수 있다.
상기 CDH13 단백질의 단편은 상기 뉴클레오타이드 서열의 엑손 5 (염색체 16 상의 위치((+) strand)를 기준으로 83158990-83159106 염기 부위)부터 마지막 엑손까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있다. 구체예에서, 상기 CDH13 단백질의 단편을 암호화하는 유전자 및 이로부터 암호화되는 CDH13 단백질에 대하여 아래의 표 28 및 29에 정리하였다:
CDH13 유전자 (Accession No.) |
CDH13 단백질의 암호화 부위-CDS | CDH13 단백질 단편의 암호화 부위: 엑손 기준 | CDH13 단백질 단편의 암호화 부위: cDNA 기준 | 염색체 상의 Break-point 위치 | Break-point 부위 염기서열 |
NM_001257 | 300~2441 (2142bp) (서열번호 103) |
엑손 5부터 마지막 엑손까지의 부위 | 666~2441 (1776bp) (서열번호 104) |
chr16:[83158990 (엑손 5의 5' 말단) | gatatatttaaatttgcaaga (서열번호 105) |
CDH13 유전자 (Accession No.) |
CDH13 단백질의 Full size(a.a) | CDH13 단백질 단편 부위 | Breakpoint 부위 아미노산 서열 |
NM_001257 | 713aa (서열번호 106) |
123~713aa (591aa) (서열번호 107) |
DIFKFAR (서열번호 108) |
상기 'KRAS 단백질 또는 그의 단편과 CDH13 단백질 또는 그의 단편이 융합된 KRAS-CDH13 융합 단백질'을 암호화하는 융합 유전자 (KRAS-CDH13 융합 유전자)는 5'-말단에 상기한 바와 같은 KRAS 단백질 또는 그의 단편을 암호화하는 폴리뉴클레오타이드 분자 및 3'-말단에 상기한 바와 같은 CDH13 단백질 또는 그의 단편을 암호화하는 폴리뉴클레오타이드 분자를 포함하는 것일 수 있다. 구체예에서, 상기 KRAS-CDH13 융합 유전자는 5' 말단쪽에 NM_004985, NM_033360 등의 첫번째 엑손에서 엑손 4까지의 뉴클레오타이드 서열과 3' 말단쪽에 NM_001257의 엑손 5부터 마지막 엑손까지 뉴클레오타이드 서열이 연결된 융합 유전자일 수 있다. 보다 구체적으로, 상기 KRAS-CDH13 융합 유전자는 5' 말단쪽에 NM_004985의 182번째부터 631번째까지의 뉴클레오타이드 서열 (서열번호 98)과 3' 말단쪽에 NM_001257의 666번째부터 2441번째까지의 뉴클레오타이드 서열(서열번호 104)이 연결된 융합 유전자(서열번호 109; 융합부위: 서열번호 110)일 수 있다 (도 53 참조).
상기 KRAS-CDH13 융합 단백질은 N-말단쪽에 상기한 바와 같은 KRAS 단백질 또는 단편과, C-말단쪽에 상기한 바와 같은 CDH13 단백질 또는 단편이 연결된 융합 단백질로서, 예컨대 5' 말단쪽에 NM_004985, NM_033360 등의 첫번째 엑손에서 엑손 4까지의 뉴클레오타이드 서열과 3' 말단쪽에 NM_001257의 엑손 5부터 마지막 엑손까지 뉴클레오타이드 서열이 연결된 융합 유전자에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있으며, 구체예에서, 서열번호 109의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열(서열번호 111; 융합부위: 서열번호 112, 도 54 참조) 또는 상기 서열과 적어도 90% 이상, 구체적으로 95% 이상, 보다 구체적으로 99% 이상의 서열 상동성을 갖는 폴리펩타이드 분자일 수 있다. 상기 KRAS-CDH13 융합 유전자의 일 예를 도 8에 모식적으로 나타내었다.
ZFYVE9 (zinc finger, FYVE domain containing 9) 단백질을 암호화 하는 ZFYVE9 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 1(p32.3)에 위치하며, ZFYVE9 단백질은 이로부터 암호화되는 단백질이다. ZFYVE9 단백질 또는 ZFYVE9 단백질의 단편은 ZFYVE9-CGA 융합 단백질의 N-말단쪽 융합 파트너이다. 구체예에서, ZFYVE9 유전자는 GenBank accession no. NM_007324, NM_004799 등에 제공되는 뉴클레오타이드 서열을 갖는 것일 수 있으며, ZFYVE9 단백질은 이들 뉴클레오타이드 서열 중 어느 하나에 의하여 암호화되는 아미노산 서열을 갖는 단백질일 수 있다.
상기 ZFYVE9 단백질의 단편은 상기 뉴클레오타이드 서열의 첫 번째 엑손부터 엑손 16(염색체 1 상의 위치((+) strand)를 기준으로 52803444-52803606 염기 부위)까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있으며, 엑손 16의 3' 말단에 코돈을 이루지 못하는 2개의 뉴클레오타이드(gg)가 존재하는 형태일 수 있다. 구체예에서, 상기 ZFYVE9 단백질의 단편을 암호화하는 유전자 및 이로부터 암호화되는 ZFYVE9 단백질에 대하여 아래의 표 30 및 31에 정리하였다:
ZFYVE9 유전자 (Accession No.) |
ZFYVE9 단백질의 암호화 부위-CDS | ZFYVE9 단백질 단편의 암호화 부위: 엑손 기준 | ZFYVE9 단백질 단편의 암호화 부위: cDNA 기준 | 염색체 상의 Break-point 위치 | Break-point 부위 염기서열 |
NM_007324 | 173~4273 (4101bp) (서열번호 113) |
첫번째 엑손~엑손 16까지의 부위 | 173~3828 (3654bp +2nt(gg); 총 3656bp) (서열번호 114) |
chr1:52803606] (엑손 16의 3' 말단) | gacaagaacgttagcaaggg (서열번호 115) |
NM_004799 | 173~4450 (4278bp) |
173~4005 (3833bp) |
ZFYVE9유전자 (Accession No.) |
ZFYVE9 단백질의 Full size(a.a) | ZFYVE9 단백질 단편 부위 | Breakpoint 부위 아미노산 서열 |
NM_007324 | 1366aa (서열번호 116) |
1~1218aa+2nt(gg) (아미노산 서열: 서열번호 117) |
DKNVSK+2nt(gg) (아미노산 서열: 서열번호 118) |
NM_004799 | 1425aa | 1~1277aa+2nt |
CGA (glycoprotein hormones, alpha polypeptide) 단백질을 암호화 하는 CGA 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 6(q14.3)에 위치하며, CGA 단백질은 이로부터 암호화되는 단백질이다. CGA 단백질 또는 CGA 단백질의 단편은 ZFYVE9-CGA 융합 단백질의 C-말단쪽 융합 파트너이다. 구체예에서, CGA 유전자는 GenBank accession no. NM_000735에 제공되는 뉴클레오타이드 서열을 갖는 것일 수 있으며, CGA 단백질은 상기 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 단백질일 수 있다.
상기 CGA 단백질의 단편은 상기 뉴클레오타이드 서열의 엑손 2 (염색체 6 상의 위치((-) strand)를 기준으로 87797831-87797925 염기 부위)부터 마지막 엑손까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있다. 이 때, 엑손 2의 5' 말단의 처음 시작하는 7개의 뉴클레오타이드(gagcgcc)는 UTR이며, ZFYVE9 단백질의 단편과 융합시에 앞서 설명한 바와 같이 ZFYVE9 단백질의 단편에 추가로 포함된 2개의 뉴클레오타이드(gg)와 연결되어 총 3개의 아미노산(GSA)을 코딩할 수 있다. 구체예에서, 상기 CGA 단백질의 단편을 암호화하는 유전자 및 이로부터 암호화되는 CGA 단백질에 대하여 아래의 표 32 및 33에 정리하였다:
CGA 유전자 (Accession No.) |
CGA 단백질의 암호화 부위-CDS | CGA 단백질 단편의 암호화 부위: 엑손 기준 | CGA 단백질 단편의 암호화 부위: cDNA 기준 | 염색체 상의 Break-point 위치 | Break-point 부위 염기서열 |
NM_000735 | 143~493 (351bp) (서열번호 119) |
엑손2~ 마지막 엑손까지의 부위 | 5UTR+ 143~449 (358bp) (서열번호 120) |
Chr6:87797925](엑손2의 5'말단 | gagcgcc (서열번호 121) |
CGA 단백질의 Full size(a.a) | CGA 단백질 단편 부위 | Breakpoint 부위 아미노산 서열 |
116aa (서열번호 122) |
116aa (서열번호 123) |
UTR에서breakpoint 발생 |
상기 'ZFYVE9 단백질 또는 그의 단편과 CGA 단백질 또는 그의 단편이 융합된 ZFYVE9-CGA 융합 단백질'을 암호화하는 융합 유전자 (ZFYVE9-CGA 융합 유전자)는 5'-말단에 상기한 바와 같은 ZFYVE9 단백질 또는 그의 단편을 암호화하는 폴리뉴클레오타이드 분자 및 3'-말단에 상기한 바와 같은 CGA 단백질 또는 그의 단편을 암호화하는 폴리뉴클레오타이드 분자를 포함하는 것일 수 있다. 구체예에서, 상기 ZFYVE9-CGA 융합 유전자는 5' 말단쪽에 NM_007324, NM_004799 등의 첫번째 엑손에서 엑손 16까지의 뉴클레오타이드 서열과 3' 말단쪽에 NM_000735의 5UTR (7bp)을 포함하여 두번째 엑손에서 마지막 엑손까지 뉴클레오타이드 서열이 연결된 융합 유전자일 수 있다. 보다 구체적으로, 상기 ZFYVE9-CGA 융합 유전자는 5' 말단쪽에 NM_007324의 173번째부터 3828번째까지의 뉴클레오타이드 서열 (서열번호 114)과 3' 말단쪽에 NM_000735의 136번째부터 493번째까지의 뉴클레오타이드 서열(서열번호 120)이 연결된 융합 유전자(서열번호 124; 융합부위: 서열번호 125)일 수 있다 (도 59a 및 59b참조).
상기 ZFYVE9-CGA 융합 단백질은 N-말단쪽에 상기한 바와 같은 ZFYVE9 단백질 또는 단편과, C-말단쪽에 상기한 바와 같은 CGA 단백질 또는 단편이 연결된 융합 단백질로서, 예컨대 5' 말단쪽에 NM_007324, NM_004799 등의 첫번째 엑손에서 엑손 16까지의 뉴클레오타이드 서열과 3' 말단쪽에 NM_000735의 5UTR (7bp)을 포함하여 두번째 엑손에서 마지막 엑손까지 뉴클레오타이드 서열이 연결된 융합 유전자에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있으며, 구체예에서, 서열번호 124의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열(서열번호 126; 융합부위: 서열번호 127, 도 60 참조) 또는 상기 서열과 적어도 90% 이상, 구체적으로 95% 이상, 보다 구체적으로 99% 이상의 서열 상동성을 갖는 폴리펩타이드 분자일 수 있다. 상기 ZFYVE9-CGA 융합 유전자의 일 예를 도 9에 모식적으로 나타내었다.
ERBB2IP (erbb2 interacting protein) 단백질을 암호화 하는 ERBB2IP 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 5(q12.3)에 위치하며, ERBB2IP 단백질은 이로부터 암호화되는 단백질이다. ERBB2IP 단백질 또는 ERBB2IP 단백질의 단편은 ERBB2IP-MAST4 융합 단백질의 N-말단쪽 융합 파트너이다. 구체예에서, ERBB2IP 유전자는 GenBank accession no. NM_018695, NM_001006600 등에 제공되는 뉴클레오타이드 서열을 갖는 것일 수 있으며, ERBB2IP 단백질은 이들 뉴클레오타이드 서열 중 어느 하나에 의하여 암호화되는 아미노산 서열을 갖는 단백질일 수 있다.
상기 ERBB2IP 단백질의 단편은 상기 뉴클레오타이드 서열의 첫 번째 엑손부터 엑손 26(염색체 5 상의 위치((+) strand)를 기준으로 65372703-65372777 염기 부위)까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있다. 구체예에서, 상기 ERBB2IP 단백질의 단편을 암호화하는 유전자 및 이로부터 암호화되는 ERBB2IP 단백질에 대하여 아래의 표 34 및 35에 정리하였다:
ERBB2IP 유전자 (Accession No.) |
ERBB2IP 단백질의 암호화 부위-CDS | ERBB2IP 단백질 단편의 암호화 부위: 엑손 기준 | ERBB2IP 단백질 단편의 암호화 부위: cDNA 기준 | 염색체 상의 Break-point 위치 | Break-point 부위 염기서열 |
NM_001006600 | 311~4219 (3909bp) (서열번호 128) |
첫번째 엑손~엑손 26까지의 부위 | 311~4111 (3801bp) (서열번호 129) |
chr5:65372777] (엑손 26의 3' 말단) | cagccaggtgataaaattattcag (서열번호 130) |
NM_018695 | 311~4426 (4116bp) |
311~4318 (4008bp) |
ERBB2IP 유전자 (Accession No.) |
ERBB2IP 단백질의 Full size(a.a) | ERBB2IP 단백질 단편 부위 | Breakpoint 부위 아미노산 서열 |
NM_001006600 | 1302aa (서열번호 131) |
1~1267aa (서열번호 132) |
QPGDKIIQ (서열번호 133) |
NM_018695 | 1371aa | 1~1336aa |
MAST4 (microtubule associated serine/threonine kinase family member4) 단백질을 암호화 하는 MAST4 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 5(q12.3)에 위치하며, MAST4 단백질은 이로부터 암호화되는 단백질이다. MAST4 단백질 또는 MAST4 단백질의 단편은 ERBB2IP-MAST4 융합 단백질의 C-말단쪽 융합 파트너이다. 구체예에서, MAST4 유전자는 GenBank accession no. NM_001164664, NM_015183등에 제공되는 뉴클레오타이드 서열을 갖는 것일 수 있으며, MAST4 단백질은 이들 뉴클레오타이드 서열 중 어느 하나에 의하여 암호화되는 아미노산 서열을 갖는 단백질일 수 있다.
상기 MAST4 단백질의 단편은 상기 뉴클레오타이드 서열의 엑손 13 (염색체 5 상의 위치((+) strand)를 기준으로 66400194-66400403 염기 부위)부터 마지막 엑손까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있다. 구체예에서, 상기 MAST4단백질의 단편을 암호화하는 유전자 및 이로부터 암호화되는 MAST4 단백질에 대하여 아래의 표 36 및 37에 정리하였다:
MAST4 유전자 (Accession No.) |
MAST4 단백질의 암호화 부위-CDS | MAST4 단백질 단편의 암호화 부위: 엑손 기준 | MAST4 단백질 단편의 암호화 부위: cDNA 기준 | 염색체 상의 Break-point 위치 | Break-point 부위 염기서열 |
NM_001164664 | 309~8180 (7872bp) (서열번호 134) |
엑손 13부터 마지막 엑손까지의 부위 | 1455~8180 (6726bp) (서열135) |
chr5:[66400194 (엑손 13의 5' 말단) | gctacagctcagatggaagaacgt (서열번호 136) |
NM_015183 | 69~7373bp (7065bp) |
648~7373 (6726bp) |
MAST4 유전자 (Accession No.) |
MAST4 단백질의 Full size(a.a) | MAST4 단백질 단편 부위 | Breakpoint 부위 아미노산 서열 |
NM_001164664 | 2623aa (서열번호 137) |
383aa~2623aa (총 2241aa) (서열번호 138) |
ATAQMEER (서열번호 139) |
NM_015183 | 2623aa | 383~2623aa (2241aa) |
상기 'ERBB2IP 단백질 또는 그의 단편과 MAST4 단백질 또는 그의 단편이 융합된 ERBB2IP-MAST4 융합 단백질'을 암호화하는 융합 유전자 (ERBB2IP-MAST4 융합 유전자)는 5'-말단에 상기한 바와 같은 ERBB2IP 단백질 또는 그의 단편을 암호화하는 폴리뉴클레오타이드 분자 및 3'-말단에 상기한 바와 같은 MAST4 단백질 또는 그의 단편을 암호화하는 폴리뉴클레오타이드 분자를 포함하는 것일 수 있다. 구체예에서, 상기 ERBB2IP-MAST4 융합 유전자는 5' 말단쪽에 NM_018695, NM_001006600 등의 첫번째 엑손에서 엑손 26까지의 뉴클레오타이드 서열과 3' 말단쪽에 NM_001164664, NM_015183등의 엑손 13부터 마지막 엑손까지 뉴클레오타이드 서열이 연결된 융합 유전자일 수 있다. 보다 구체적으로, 상기 ERBB2IP-MAST4 융합 유전자는 5' 말단쪽에 NM_001006600 의 311번째부터 4111번째까지의 뉴클레오타이드 서열 (서열번호 129)과 3' 말단쪽에 NM_001164664의 1455번째부터 8180번째까지의 뉴클레오타이드 서열(서열번호 135)이 연결된 융합 유전자(서열번호 140; 융합부위: 서열번호 141)일 수 있다 (도 65a, 65b 및 65c 참조).
상기 ERBB2IP-MAST4 융합 단백질은 N-말단쪽에 상기한 바와 같은 ERBB2IP 단백질 또는 단편과, C-말단쪽에 상기한 바와 같은 MAST4 단백질 또는 단편이 연결된 융합 단백질로서, 예컨대 5' 말단쪽에 NM_018695, NM_001006600 등의 첫번째 엑손에서 엑손 26까지의 뉴클레오타이드 서열과 3' 말단쪽에 NM_001164664, NM_015183등의 엑손 13부터 마지막 엑손까지 뉴클레오타이드 서열이 연결된 융합 유전자에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있으며, 구체예에서, 서열번호 140의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열(서열번호 142; 융합부위: 서열번호 143, 도 66a 및 66b참조) 또는 상기 서열과 적어도 90% 이상, 구체적으로 95% 이상, 보다 구체적으로 99% 이상의 서열 상동성을 갖는 폴리펩타이드 분자일 수 있다. 상기 ERBB2IP-MAST4 융합 유전자의 일 예를 도 10에 모식적으로 나타내었다.
TPD52L1 (tumor protein D52-like1) 단백질을 암호화 하는 TPD52L1 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 6(q22.31)에 위치하며, TPD52L1 단백질은 이로부터 암호화되는 단백질이다. TPD52L1 단백질 또는 TPD52L1 단백질의 단편은 TPD52L1-TRMT11 융합 단백질의 N-말단쪽 융합 파트너이다. 구체예에서, TPD52L1 유전자는 GenBank accession no. NM_003287, NM_001003396, NM_001003397, NM_001003395 등에 제공되는 뉴클레오타이드 서열을 갖는 것일 수 있으며, TPD52L1 단백질은 이들 뉴클레오타이드 서열 중 어느 하나에 의하여 암호화되는 아미노산 서열을 갖는 단백질일 수 있다.
상기 TPD52L1 단백질의 단편은 상기 뉴클레오타이드 서열의 첫 번째 엑손부터 엑손 5(염색체 6 상의 위치((+) strand)를 기준으로 125569428-125569529 염기 부위)까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있으며, 엑손 5의 3' 말단에 코돈을 이루지 못하는 2개의 뉴클레오타이드(ag)가 존재하는 형태일 수 있다. 구체예에서, 상기 TPD52L1 단백질의 단편을 암호화하는 유전자 및 이로부터 암호화되는 TPD52L1 단백질에 대하여 아래의 표 38 및 39에 정리하였다:
TPD52L1 유전자 (Accession No.) |
TPD52L1 단백질의 암호화 부위-CDS | TPD52L1 단백질 단편의 암호화 부위: 엑손 기준 | TPD52L1 단백질 단편의 암호화 부위: cDNA 기준 | 염색체 상의 Break-point 위치 | Break-point 부위 염기서열 |
NM_001003395 | 328~855 (528bp) (서열번호 144) |
첫번째 엑손~엑손 5까지의 부위 | 328~626 (297bp +2nt(ag); 총 299bp) (서열번호 145) |
chr6:125569529] (엑손 5의 3' 말단) | tcagcaagaagttcggagacatgag (서열번호 146) |
NM_001003396 | 220~654 (435bp) |
220~605 (386bp) |
|||
NM_001003397 | 220~615 (396bp) |
220~605 (386bp) |
|||
NM_003287 | 220~834 (615bp) |
220~605 (386bp) |
TPD52L1유전자 (Accession No.) |
TPD52L1 단백질의 Full size(a.a) | TPD52L1 단백질 단편 부위 | Breakpoint 부위 아미노산 서열 |
NM_001003395 | 175aa (서열번호 147) |
1~99aa+2nt(ag) (아미노산 서열: 서열번호 148) |
SKKFGDM+2nt(ag) (아미노산 서열: 서열번호 149) |
NM_001003396 | 144aa | 1~128aa | |
NM_001003397 | 131aa | 1~128aa | |
NM_003287 | 204aa | 1~128aa |
TRMT11 (tRNA methyl transferase11 homolog) 단백질을 암호화 하는 TRMT11 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 6(q22.32)에 위치하며, TRMT11 단백질은 이로부터 암호화되는 단백질이다. TRMT11 단백질 또는 TRMT11 단백질의 단편은 TPD52L1-TRMT11 융합 단백질의 C-말단쪽 융합 파트너이다. 구체예에서, TRMT11 유전자는 GenBank accession no. NM_001031712에 제공되는 뉴클레오타이드 서열을 갖는 것일 수 있으며, TRMT11 단백질은 이들 뉴클레오타이드 서열 중 어느 하나에 의하여 암호화되는 아미노산 서열을 갖는 단백질일 수 있다.
상기 TRMT11 단백질의 단편은 상기 뉴클레오타이드 서열의 엑손 12 (염색체 6 상의 위치((+) strand)를 기준으로 126342306-126342426 염기 부위)부터 마지막 엑손까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있다. 이 때, 엑손 12의 5' 말단의 처음 시작하는 1개의 뉴클레오타이드(a)는 코돈을 이루지 못하는 형태일 수 있고, TPD52L1 단백질의 단편과 융합시에 앞서 설명한 바와 같이 TPD52L1 단백질의 단편에 추가로 포함된 2개의 뉴클레오타이드(ag)와 연결되어 코돈(aga)을 이루어 하나의 아미노산(R)을 코딩할 수 있다. 구체예에서, 상기 TRMT11단백질의 단편을 암호화하는 유전자 및 이로부터 암호화되는 TRMT11 단백질에 대하여 아래의 표 40 및 41에 정리하였다:
TRMT11 유전자 (Accession No.) |
TRMT11 단백질의 암호화 부위-CDS | TRMT11 단백질 단편의 암호화 부위: 엑손 기준 | TRMT11 단백질 단편의 암호화 부위: cDNA 기준 | 염색체 상의 Break-point 위치 | Break-point 부위 염기서열 |
NM_001031712 | 122~1513 (1392bp) (서열번호 150) |
엑손 12부터 마지막 엑손까지의 부위 | 1261~1513 (1nt(a) +252bp; 총 253bp) (서열번호 151) |
chr6:[126342306 (엑손 12의 5' 말단) | atacactgaagagatggtgcct (서열번호 152) |
TRMT11 유전자 (Accession No.) |
TRMT11 단백질의 Full size(a.a) | TRMT11 단백질 단편 부위 | Breakpoint 부위 아미노산 서열 |
NM_001031712 | 463aa (서열번호 153) |
1nt(a)+381aa~463aa (총 83aa) (아미노산 서열: 서열번호 154) |
1nt(a)+ YTEEMVP (아미노산 서열: 서열번호 155) |
상기 'TPD52L1 단백질 또는 그의 단편과 TRMT11 단백질 또는 그의 단편이 융합된 TPD52L1-TRMT11 융합 단백질'을 암호화하는 융합 유전자 (TPD52L1-TRMT11 융합 유전자)는 5'-말단에 상기한 바와 같은 TPD52L1 단백질 또는 그의 단편을 암호화하는 폴리뉴클레오타이드 분자 및 3'-말단에 상기한 바와 같은 TRMT11 단백질 또는 그의 단편을 암호화하는 폴리뉴클레오타이드 분자를 포함하는 것일 수 있다. 구체예에서, 상기 TPD52L1-TRMT11 융합 유전자는 5' 말단쪽에 NM_003287, NM_001003396, NM_001003397, NM_001003395 등의 첫번째 엑손에서 엑손 5까지의 뉴클레오타이드 서열과 3' 말단쪽에 NM_001031712의 엑손 12부터 마지막 엑손까지 뉴클레오타이드 서열이 연결된 융합 유전자일 수 있다. 보다 구체적으로, 상기 TPD52L1-TRMT11 융합 유전자는 5' 말단쪽에 NM_001003395의 328번째부터 626번째까지의 뉴클레오타이드 서열 (서열번호 145)과 3' 말단쪽에 NM_001031712의 1261번째부터 1513번째까지의 뉴클레오타이드 서열(서열번호 151)이 연결된 융합 유전자(서열번호 156; 융합부위: 서열번호 157)일 수 있다 (도 71 참조).
상기 TPD52L1-TRMT11 융합 단백질은 N-말단쪽에 상기한 바와 같은 TPD52L1 단백질 또는 단편과, C-말단쪽에 상기한 바와 같은 TRMT11 단백질 또는 단편이 연결된 융합 단백질로서, 예컨대 5' 말단쪽에 NM_003287, NM_001003396, NM_001003397, NM_001003395 등의 첫번째 엑손에서 엑손 5까지의 뉴클레오타이드 서열과 3' 말단쪽에 NM_001031712의 엑손 12부터 마지막 엑손까지 뉴클레오타이드 서열이 연결된 융합 유전자에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있으며, 구체예에서, 서열번호 156의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열(서열번호 158; 융합부위: 서열번호 159, 도 72 참조) 또는 상기 서열과 적어도 90% 이상, 구체적으로 95% 이상, 보다 구체적으로 99% 이상의 서열 상동성을 갖는 폴리펩타이드 분자일 수 있다. 상기 TPD52L1-TRMT11 융합 유전자의 일 예를 도 11에 모식적으로 나타내었다.
TXNRD1 (thioredoxin reductase1) 단백질을 암호화 하는 TXNRD1 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 12(q23.3)에 위치하며, TXNRD1 단백질은 이로부터 암호화되는 단백질이다. TXNRD1 단백질 또는 TXNRD1 단백질의 단편은 TXNRD1-GPR133 융합 단백질의 N-말단쪽 융합 파트너이다. 구체예에서, TXNRD1 유전자는 GenBank accession no. NM_003330, NM_001093771, NM_182729, NM_182743, NM_182742 등에 제공되는 뉴클레오타이드 서열을 갖는 것일 수 있으며, TXNRD1 단백질은 이들 뉴클레오타이드 서열 중 어느 하나에 의하여 암호화되는 아미노산 서열을 갖는 단백질일 수 있다.
상기 TXNRD1 단백질의 단편은 상기 뉴클레오타이드 서열의 첫 번째 엑손부터 엑손 17(염색체 12 상의 위치((+) strand)를 기준으로 104732917-104733051 염기 부위)까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있다. 구체예에서, 상기 TXNRD1 단백질의 단편을 암호화하는 유전자 및 이로부터 암호화되는 TXNRD1 단백질에 대하여 아래의 표 42 및 43에 정리하였다:
TXNRD1 유전자 (Accession No.) |
TXNRD1 단백질의 암호화 부위-CDS | TXNRD1 단백질 단편의 암호화 부위: 엑손 기준 | TXNRD1 단백질 단편의 암호화 부위: cDNA 기준 | 염색체 상의 Break-point 위치 | Break-point 부위 염기서열 |
NM_003330 | 656~2311 (1656bp) (서열번호 160) |
첫번째 엑손~엑손 17까지의 부위 | 656~2242 (1587bp) (서열번호 161) |
chr12:104733051] (엑손 17의 3' 말단) | aatccaccctgtctgtgcagag (서열번호 162) |
NM_001093771 | 25~1974 (1950bp) |
258~1905 (1881bp) |
|||
NM_182729 | 527~2074 (1548bp) |
527~2005 (1479bp) |
|||
NM_182743 | 465~1964 (1500bp) |
465~1895 (1431bp) |
|||
NM_182742 | 702~2201 (1500bp) |
702~2132 (1431bp) |
TXNRD1 유전자 (Accession No.) |
ERBB2IP 단백질의 Full size(a.a) | ERBB2IP 단백질 단편 부위 | Breakpoint 부위 아미노산 서열 |
NM_003330 | 551aa (서열번호 163) |
1~529aa (서열번호 164) |
IHPVCAE (서열번호 165) |
NM_001093771 | 649aa | 1~627aa | |
NM_182729 | 499aa | 1~477aa | |
NM_182743 | 499aa | 1~477aa | |
NM_182742 | 499aa | 1~477aa |
GPR133 (G protein-coupled receptor133) 단백질을 암호화 하는 GPR133 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 12(q24.33)에 위치하며, GPR133 단백질은 이로부터 암호화되는 단백질이다. GPR133 단백질 또는 GPR133 단백질의 단편은 TXNRD1-GPR133 융합 단백질의 C-말단쪽 융합 파트너이다. 구체예에서, GPR133 유전자는 GenBank accession no. NM_198827에 제공되는 뉴클레오타이드 서열을 갖는 것일 수 있으며, GPR133 단백질은 이들 뉴클레오타이드 서열 중 어느 하나에 의하여 암호화되는 아미노산 서열을 갖는 단백질일 수 있다.
상기 GPR133 단백질의 단편은 상기 뉴클레오타이드 서열의 엑손 14 (염색체 12 상의 위치((+) strand)를 기준으로 131561346-131561419 염기 부위)부터 마지막 엑손까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있다. 구체예에서, 상기 GPR133단백질의 단편을 암호화하는 유전자 및 이로부터 암호화되는 GPR133 단백질에 대하여 아래의 표 44 및 45에 정리하였다:
GPR133 유전자 (Accession No.) |
GPR133 단백질의 암호화 부위-CDS | GPR133 단백질 단편의 암호화 부위: 엑손 기준 | GPR133 단백질 단편의 암호화 부위: cDNA 기준 | 염색체 상의 Break-point 위치 | Break-point 부위 염기서열 |
NM_198827 | 560~3184 (2625bp) (서열번호 166) |
엑손 14부터 마지막 엑손까지의 부위 | 2033~3184 (1152bp) (서열167) |
chr12:[131561346 (엑손 14의 5' 말단) | acacgtaagcagcac (서열번호 168) |
GPR133 유전자 (Accession No.) |
GPR133 단백질의 Full size(a.a) | GPR133 단백질 단편 부위 | Breakpoint 부위 아미노산 서열 |
NM_198827 | 874aa (서열번호 169) |
492aa~874aa (총 383aa) (서열번호 170) |
TRKQHS (서열번호 171) |
상기 'TXNRD1 단백질 또는 그의 단편과 GPR133 단백질 또는 그의 단편이 융합된 TXNRD1-GPR133 융합 단백질'을 암호화하는 융합 유전자 (TXNRD1-GPR133 융합 유전자)는 5'-말단에 상기한 바와 같은 TXNRD1 단백질 또는 그의 단편을 암호화하는 폴리뉴클레오타이드 분자 및 3'-말단에 상기한 바와 같은 GPR133 단백질 또는 그의 단편을 암호화하는 폴리뉴클레오타이드 분자를 포함하는 것일 수 있다. 구체예에서, 상기 TXNRD1-GPR133 융합 유전자는 5' 말단쪽에 NM_003330, NM_001093771, NM_182729, NM_182743, NM_182742 등의 첫번째 엑손에서 엑손 17까지의 뉴클레오타이드 서열과 3' 말단쪽에 NM_198827의 엑손 14부터 마지막 엑손까지 뉴클레오타이드 서열이 연결된 융합 유전자일 수 있다. 보다 구체적으로, 상기 TXNRD1-GPR133 융합 유전자는 5' 말단쪽에 NM_003330의 656번째부터 2242번째까지의 뉴클레오타이드 서열 (서열번호 161)과 3' 말단쪽에 NM_198827의 2033번째부터 3184번째까지의 뉴클레오타이드 서열(서열번호 167)이 연결된 융합 유전자(서열번호 172; 융합부위: 서열번호 173)일 수 있다.
상기 TXNRD1-GPR133 융합 단백질은 N-말단쪽에 상기한 바와 같은 TXNRD1 단백질 또는 단편과, C-말단쪽에 상기한 바와 같은 GPR133 단백질 또는 단편이 연결된 융합 단백질로서, 예컨대 5' 말단쪽에 NM_003330, NM_001093771, NM_182729, NM_182743, NM_182742 등의 첫번째 엑손에서 엑손 17까지의 뉴클레오타이드 서열과 3' 말단쪽에 NM_198827의 엑손 14부터 마지막 엑손까지 뉴클레오타이드 서열이 연결된 융합 유전자에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있으며, 구체예에서, 서열번호 172의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열(서열번호 174; 융합부위: 서열번호 175) 또는 상기 서열과 적어도 90% 이상, 구체적으로 95% 이상, 보다 구체적으로 99% 이상의 서열 상동성을 갖는 폴리펩타이드 분자일 수 있다. 상기 TXNRD1-GPR133 융합 유전자의 일 예를 도 12에 모식적으로 나타내었다.
본 명세서에 사용된 바로서, 첫 번째 엑손과 마지막 엑손은 엑손 번호와 무관하게 주어진 accession number의 염기서열에서 첫 번째 위치하는 엑손과 마지막에 위치하는 엑손을 각각 의미하며, 엑손 번호는 NCBI의 서열 정보에서 부여된 번호에 따른다.
한편, SCAF11 유전자의 5UTR부위가 PDGFRA 유전자 또는 이의 단편과 융합된 융합 유전자의 경우, PDGFRA 유전자 또는 이의 단편의 발현율이 현저하게 증가되며, 이러한 현상은 암 환자에서 특이적으로 관찰됨을 확인하였다. 따라서, 본 발명의 다른 예는 SCAF11 유전자의 5UTR부위 및 PDGFRA 유전자 또는 이의 단편이 융합된 폴리뉴클레오타이드 분자(SCAF11-PDGFRA 융합 유전자), 및 상기 폴리뉴클레오타이드 분자의 암 진단 마커로서의 용도를 제공한다.
SCAF11 (SR-related CTD-associated factor 11) 단백질을 암호화 하는 SCAF11 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 12(q12)에 위치한다. SCAF11의 5UTR 부위는 SCAF11-PDGFRA 융합 유전자의 5' 말단쪽 융합 파트너이다. 구체예에서, SCAF11 유전자는 GenBank accession no. NM_004719에 제공되는 뉴클레오타이드 서열을 갖는 것일 수 있으며, SCAF11의 5UTR 부위는 NM_004719에 제공되는 뉴클레오타이드 서열 중 1번째부터 266번째까지의 뉴클레오타이드 서열 (엑손 1에 해당; 염색체상 위치((-) strand)-chr12:46384136-46384401; 염색체상 breakpoint-chr12:[46384136; 서열번호 176; 융합 부위-서열번호 177)을 갖는 폴리뉴클레오타이드 분자일 수 있다 (도 79 참조).
PDGFRA (platelet-derived growth factor receptor, alpha polypeptide) 단백질을 암호화 하는 PDGFRA 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 4(q12)에 위치한다. PDGFRA 유전자 또는 유전자 단편은 SCAF11-PDGFRA 융합 유전자의 3' 말단쪽 융합 파트너이다. 구체예에서, PDGFRA 유전자는 GenBank accession no. NM_006206 에 제공되는 뉴클레오타이드 서열을 갖는 것일 수 있으며, PDGFRA 융합 유전자의 단편은 NM_006206의 CDS 부위 (332번째부터 3601번째까지의 염기서열; 총길이 3270 bp)를 포함하는 것으로, 상기 CDS 부위의 5' 말단에 12bp의 5UTR 부위 (NM_006206의 120번째부터 331번째까지의 염기서열)를 추가로 포함하는 것일 수 있으며, 이는 NM_006206의 엑손 2(염색체상 위치((+) strand)-chr4:55124924-55124984; 염색체상 breakpoint-chr12:120180269])부터 마지막 엑손까지를 포함하는 유전자 단편으로 표현될 수 있다 (서열번호 178; 융합 부위-서열번호 179)을 갖는 폴리뉴클레오타이드 분자일 수 있다 (도 80 참조).
상기 SCAF11-PDGFRA 융합 유전자는 서열번호 180 (융합부위: 서열번호 181)의 뉴클레오타이드 서열을 갖는 것일 수 있다 (도 81 참조).
상기 SCAF11-PDGFRA 융합 유전자의 일 예를 도 2에 모식적으로 나타내었다.
별다른 언급이 없는 한, 본 명세서에 기재된 DNA 분자를 시퀀싱하여 결정된 모든 뉴크레오타이드 서열들은 자동화된 DNA 시퀀서(예컨대, Applied Biosystems, Inc.에서 제조된 Model 373)를 사용하여 결정될 수 있고, 결정된 뉴클레오타이드 서열에 의하여 암호화되는 모든 아미노산 서열들은 자동화된 펩타이드 시퀀서를 사용하여 결정된 것이다. 이 자동화된 접근에 의하여 결정된 뉴클레오타이드 서열은 실제 서열과 비교하여 일부 에러를 포함할 수 있다. 예컨대, 자동에 의하여 결정된 뉴크레오타이드 서열들은 시퀀싱된 DNA 분자의 실제 뉴클레오타이드 서열과 전형적으로 약 90% 이상, 구체적으로 약 95% 이상, 보다 구체적으로 약 99% 이상, 더욱 구체적으로 약 99.9% 이상의 서열 상동성을 가질 수 있다. 실제 서열과 비교하여 결정된 뉴클레오타이드에서 하나의 삽입 또는 결손은 포함할 수 있으며, 이와 같은 삽입 또는 결손에 의하여 뉴클레오타이드 서열 번역시의 프레임 쉬프트가 야기되어, 암호화되는 아미노산 서열이 실제 아미노산 서열과 완전하게 다르게 될 수 있다.
본 발명에 따른 융합 단백질 및/또는 융합 유전자는 고형암, 구체적으로 폐암, 특히 폐선암과 같은 비소세포암(NSCLC) 환자에게서 특이적으로 발견되거나 발현되는 것으로 확인되었으므로, 상기 융합 단백질 및/또는 이를 암호화하는 융합 유전자 및/또는 SCAF11-PDGFRA 융합 유전자는 고형암, 구체적으로 폐암, 특히 폐선암과 같은 비소세포암(NSCLC)의 진단 마커로서 유용하다.
이에, 본 발명의 또 다른 예는 상기 융합 단백질에 특이적으로 결합하는 분자 및/또는 상기 융합 유전자와 혼성화 가능한 폴리뉴클레오타이드를 포함하는 암 진단용 조성물을 제공한다.
상기 융합 단백질에 특이적으로 결합하는 분자는 항체, 압타머 등으로 이루어진 군에서 선택된 것일 수 있다.
또한, 상기 융합 단백질을 암호화하는 융합 유전자 또는 SCAF11-PDGFRA 융합 유전자와 혼성화 가능한 폴리뉴클레오타이드는 상기 융합 유전자 내의 연속하는 융합 부위를 포함하는 연속하는 50 내지 250개, 구체적으로 100 내지 200개의 염기로 이루어진 DNA 분자를 증폭할 수 있도록 상기 DNA 분자의 양 말단에 인접하는 20 내지 100개, 구체적으로 25 내지 50개의 염기서열 또는 이와 상보적인 서열과 완전히 상보적이거나, 80% 이상 (예컨대 80-100%), 구체적으로 90% 이상 (예컨대 90-100%)의 상보적인 염기서열을 가져서 상기 DNA 분자와 특이적으로 결합 가능한 폴리뉴클레오타이드를 의미한다. 예컨대, 각 융합 유전자에 혼성 가능한 프라이머쌍을 아래의 표 46에 예시하였다.
융합 유전자 | 융합부위 | Forward Primer Sequence | Reverse Primer Sequence |
CCDC6-ROS1 | 서열번호 14 | CCTGCAGGAAAAATTAGACCAG (SEQ ID NO: 182) |
AGCTCAGCCAACTCTTTGTCTT (SEQ ID NO: 183) |
SCAF11-PDGFRA | 서열번호 181 | CAGCGGAGTCAGTGTCCTAGAG (SEQ ID NO: 184) |
TGAGAAGACAGCCTAAGACCAG (SEQ ID NO: 185) |
FGFR2-CIT | 서열번호 30 | ACATGATGATGAGGGACTGTTG (SEQ ID NO: 186) |
ACAGCTGTTACGAAGAGCATCA (SEQ ID NO: 187) |
AXL-MBIP | 서열번호 46 | GCCTGACGAAATCCTCTATGTC (SEQ ID NO: 188) |
CAAAATTCCCTGACGTTGTTTT (SEQ ID NO: 189) |
APLP2-TNFSF11 | 서열번호 62 | TGCTGAGAACAAAGATCGCTTA (SEQ ID NO: 190) |
TGTCGGTGGCATTAATAGTGAG (SEQ ID NO: 191) |
MAP4K3-PRKCE | 서열번호 78 | AGGAGGACTTCGAGCTGATTC (SEQ ID NO: 192) |
ACGACCCTGAGAGATCGATGA (SEQ ID NO: 193) |
BCAS3-MAP3K3 | 서열번호 94 | CATCCCGTCCAGTCTCTGAT (SEQ ID NO: 194) |
CTGCCTATTTGAGTGACCTGTG (SEQ ID NO: 195) |
KRAS-CDH13 | 서열번호 110 | GGAAATAAATGTGATTTGCCTTC (SEQ ID NO: 196) |
AAGGCTGTCTCTGATTCTCTGG (SEQ ID NO: 197) |
ZFYVE9-CGA | 서열번호 126 | ACTGCAGAGAACATGGATTCCT (SEQ ID NO: 198) |
GAATGGAGAACATGCAGAAACA (SEQ ID NO: 199) |
ERBB2IP-MAST4 | 서열번호 141 | AACAAGGGTACAACCTGAAGGA (SEQ ID NO: 200) |
TCAAGGAAGTATCGTGAGGTGA (SEQ ID NO: 201) |
TPD52L1-TRMT11 | 서열번호 157 | GAAAACACATGAAACCCTGAGTC (SEQ ID NO: 202) |
ATGTGTGACTGGAAAGCTTCTG (SEQ ID NO: 203) |
TXNRD1-GPR133 | 서열번호 173 | TCCAAATGCTGGAGAAGTTACA (SEQ ID NO: 204) |
AGTACACGAAGACTCGGTTGCT (SEQ ID NO: 205) |
또 다른 예는 환자로부터 얻은 생물 시료에서 상기 융합 단백질의 발현을 측정하는 단계를 포함하는 암 진단에 정보를 제공하는 방법을 제공한다.
상기 암 진단에 정보를 제공하는 방법에 있어서, 생물 시료에서 상기 융합 단백질의 발현이 검출되면, 상기 환자는 암(고형암), 구체적으로 폐암, 보다 구체적으로 비소세포폐암(NSCLC), 특히 폐선암 환자로 결정할 수 있다.
상기 융합 단백질 발현 검출은 생물 시료에서 융합 단백질 존재를 검출하거나, 상기 융합 단백질을 암호화하는 융합 유전자 또는 이에 상응하는 mRNA의 존재를 검출하여 수행될 수 있다. 예컨대, 상기 융합 단백질의 존재는 이 융합 단백질에 특이적으로 결합하는 분자(예컨대, 항체 또는 압타머)를 이용하여 상기 융합 단백질과 상기 분자(예컨대, 항체 또는 압타머)와의 상호작용을 검출하는 통상적인 에세이법, 예컨대, 면역크로마토그래피(Immunochromatography), 면역조직화학염색, 효소결합 면역흡착 분석(enzyme liked immunosorbent assay: ELISA), 방사선 면역측정법(radioimmunoassay: RIA), 효소 면역분석(enzyme immunoassay: EIA), 형광면역분석(Floresence immunoassay: FIA), 발광면역분석(luminescence immunoassay: LIA), 웨스턴블라팅(Western blotting) FACS 등에 의하여 검출할 수 있다.
통상적인 단백질 발현 검출 에세이법으로 면역분석 방법을 수행할 수 있다. 유용한 면역분석은 동종 면역분석 또는 이종 면역분석일 수 있다. 동종 분석에서 그 면역학적 반응은 종종 융합 단백질 특이적 시약 (예를 들어, 융합 폴리펩타이드 특이적 항체), 그 표지된 분석체, 및 분석 대상 생물학적 샘플이 관련된다. 그 표지로부터 일어나는 신호는 그 표지된 분석체에 항체의 결합에 대하여 직접 또는 간접적으로 변형된다. 면역학적 반응 및 그것의 양의 검출 모두는 동종 용액에서 수행된다. 사용 가능한 면역화학적 표지들은 자유 래디컬, 방사성-동위원소, 형광염료, 효소, 박테리오파지, 조효소 등으로 이루어진 군에서 선택된 1종 이상일 수있다.
본 명세서에 기재된 그 방법의 실행에 유용한 항체들은 침전과 같은 공지된 기술들에 따라서 진단 분석에 적당한 고체 서포트(예를 들어 라텍스 또는 폴리스티렌과 같은 물질로 제조된 웰, 비드, 플레이트 또는 슬라이드)에 부착될 수 있다. 항체들 또는 다른 융합단백질 결합 시약들은 공지된 기술에 따라 방사성표지(예를 들어, 35S, 125I, 131I 등), 효소 표지(예를 들어 호스래디쉬 퍼옥시데이즈, 알칼라인 포스페테이즈, 등), 및 형광 표지들 (예를 들어 흐루르세인 등)과 같은 검출될 수 있는 기들에 마찬가지로 부착될 수 있다.
유동세포분석법(flow cytometry; FC)와 같은 세포-기반 분석, 면역-조직화학(IHC), 또는 면역형광법(IF)은 그러한 분석 포맷이 임상적으로 적합하고, 생체내에서 융합 단백질 내의 카이네이즈 폴리펩타이드 발현의 검출을 가능하게 하고 추출물을 덜기 위하여 예를 들어 종양 샘플로부터 얻어진 세포들 조작을 야기하는 활성에서 인위적인 변화의 위험성을 회피할 수 있기에 본 발명의 방법을 실행하는데 특히 바람직하다. 따라서 일부 바람직한 구체 예에서, 본 발명의 방법들은 유동세포분석법(flow cytometry; FC)와 같은 세포-기반 분석, 면역-조직화학(IHC), 또는 면역형광법(IF) 분석 포맷을 고안한다.
유동세포 분석법(FC)은 융합 단백질 내의 카이네이즈 활성을 저해하는 타겟된 약물의 처리 전, 동안 및 후에 포유류 종양에서 융합 단백질 내의 카이네이즈 폴리펩타이드 발현을 결정하기 위하여 채택될 수 있다. 예를 들어 골수 샘플로부터 유래한 종양 세포들은 그렇게 하는 것이 바람직하다면 암 세포 타입을 동정하기 위한 마커로 뿐만 아니라 융합 단백질 발현 및/또는 활성화에 대한 유동세포 분석법에 의하여 분석될 수 있다.
면역조직화학(IHC) 염색은 융합 단백질 내의 카이네이즈 활성을 저해하는 타겟된 약물로 처리 전, 동안 및 후에 포유류 암(예를 들어 NSCLC와 같은 고형암)에서 융합 단백질 내의 카이네이즈 단백질의 발현 및/또는 활성화 상태를 결정하기 위하여 채택될 수 있다.
면역형광(IF) 분석은 융합 단백질 내의 카이네이즈 활성을 저해하는 타겟된 약물로 처리 전, 동안 및 후에 포유류 암에서 융합 단백질 내의 카이네이즈 폴리펩타이드의 발현 및/또는 활성화 상태를 결정하기 위하여 채택될 수 있다.
이에 더하여, 효소-링크된 면역흡수 분석법(ELISA), 방사성-면역분석(RIA), 및 형광-활성화된 세포 소팅(FACS)을 포함하는 다른 여러 프로토콜은 당업계에 공지되고 융합 단백질 발현의 변경된 또는 비정상적인 레벨을 진단하는 방법도 사용 가능하다.
종양으로부터 유래된 세포들을 포함하는 생물학적 샘플에서 발현된 융합 단백질의 검출/정량화를 위한 AQUA 펩타이드들은 표준 AQUA 분석에서 사용되고 제조될 수 있다. 따라서 본 발명의 방법들의 일부 바람직한 구체예에서, 융합 단백질 특이적 시약은 상술된 것과 같이 융합 단백질 또는 융합 접합점을 포함하는 펩타이드 서열에 해당하는 서열 중 동위원소 표지된 포스포펩타이드(AQUA 펩타이드)를 포함할 수 있다.
또한, 상기 융합 단백질을 암호화하는 DNA 분자 및/또는 이에 상응하는 mRNA의 존재는 융합 단백질을 암호화하는 DNA 분자와 혼성화 가능한 폴리뉴클레오타이드를 사용하는 통상적인 PCR, FISH(fluorescent in situ hybridization) 등의 방법으로 검출할 수 있다. 한 구체예에서, 상기 융합 단백질 코딩 DNA분자 및/또는 이에 상응하는 mRNA의 존재는 대규모 병렬 염기서열 분석(massively parallel sequencing) 기술을 통한 전체-전사체(whole-transcriptome; RNA) 및 전체-게놈(whole-genome; DNA) 시퀀싱의 통합 기술에 의하여 검출할 수 있다. 상기 폴리뉴클레오타이드 검출에 유용한 융합 단백질 코딩 폴리뉴클레오타이드 특이적인 시약들은 생물학적 샘플에서 융합 또는 트런케이트된 폴리펩타이드 발현 전사체와 직접 교잡하고 검출할 수 있는 siRNA, 올리고뉴크레오타이드 또는 DNA 프로브일 수 있다.
상기 환자는 인간을 포함하는 인간, 원숭이 등의 영장류, 마우스, 래트 등의 설치류를 포함하는 포유류, 구체적으로 인간일 수 있다.
상기 생물 시료는 상기 환자로부터 분리된 세포(예컨대, 폐세포 등), 조직(예컨대, 폐조직 등), 체액(예컨대, 혈액 등) 등일 수 있다. 보다 구체적으로, 본 발명에서 유용한 생물 시료들은 융합 단백질의 발현에 의하여 특징화되는 암(고형암 또는 비고형암)들이 존재하거나 발생하는 포유류로부터 얻어질 수 있다. 일 구체예에서, 그 포유류는 인간이고 그 인간은 예를 들어 NSCLC와 같은 암의 치료 대상 환자일 수 있다. 그 인간 환자는 검출 대상 융합 단백질 내의 카이네이즈를 억제하는 치료제로 현재 치료 중이거나 또는 치료를 고려하는 환자일 수 있다. 본 발명에서 사용 적합한 생물 시료는 포유류 암으로부터 유래한 세포(또는 세포의 추출물)을 포함하는 하는 것일 수 있다.
본 발명의 암 환자에서 특이적으로 발현하는 융합 단백질은 새로운 암 치료 타겟으로서의 사용될 수 있다.
따라서, 본 발명의 또 다른 예는 상기 융합 단백질 억제제 및 상기 융합 단백질을 암호화하는 폴리뉴클레오타이드 분자 억제제로 이루어진 군에서 선택된 1종 이상을 유효성분으로 포함하는 암 예방 및/또는 치료용 조성물을 제공한다.
상기 융합 단백질의 억제제는 상기 융합 단백질에 결합하여 그 기능을 상실시키거나 저하시키는 물질로서, 상기 융합 단백질에 대한 항체, 압타머, 또는 통상적인 키나제 저해제(티로신 키나제를 포함하는 융합유전자의 경우: CCDC6-ROS1, SCAF11-PDGFRA, FGFR2-CIT, AXL-MBIP, MAP4K3-PRKCE, BCAS3-MAP3K3, ERBB2IP-MAST4), 신호 전달 저해제 등으로 이루어진 군에서 선택된 1종 이상일 수 있다. 상기 융합 단백질을 암호화하는 융합 DNA 분자 억제제는 상기 DNA 분자에 결합하여, 융합 단백질로 발현하지 못하도록 하는 물질로서, 상기 DNA 분자에 특이적으로 결합하는 siRNA, shRNA, 압타머 등으로 이루어진 군에서 선택된 1종 이상의 것일 수 있다.
상기 암 진단용 조성물, 암 진단에 정보를 제공하는 방법, 및 암의 예방 및/또는 치료용 조성물의 진단 또는 치료 대상이 되는 암은 모든 종류의 고형암일 수 있다. 예컨대 상기 고형암은 폐암, 간암, 대장암, 췌장암, 위암, 유방암, 난소암, 신장암, 갑성선암, 식도암, 전립선암, 뇌암 등일 수 있다. 구체예에서 상기 고형암은 폐암일 수 있으며, 보다 구체적으로 소세포폐암 (small cell lung cancer, SCLC), 또는 폐선암(lung adenocarcinoma), 편평세포폐암(Squamous cell lung carcinoma), 또는 대세포폐암 Large cell lung carcinoma)과 같은 비소세포폐암(non-small cell lung cancer, NSCLC)일 수 있고, 예컨대 폐선암일 수 있다.
또 다른 예는 상기 융합 단백질을 이용한 항암제 스크리닝 방법을 제공한다.
일 구체예에서, 상기 스크리닝 방법은,
상기 융합 단백질을 발현하는 세포에 후보 물질을 처리하는 단계; 및
상기 세포에서의 융합 단백질 발현 정도를 측정하는 단계를 포함하고,
상기 후보 물질이 처리된 세포에서의 융합 단백질의 발현 정도가 상기 후보 물질 처리 전 또는 상기 후보 물질이 처리되지 않은 세포와 비교하여 감소한 경우, 상기 후보 물질을 항암제로 결정하는 것을 특징으로 하는 것일 수 있다.
상기 항암제 스크리닝 방법은 상기 후보 물질을 처리하는 단계 이전에, 상기 후보 물질 전 세포 내의 융합 단백질의 발현 정도를 측정하는 단계를 추가로 포함하여, 동일한 세포에서의 후보 물질 처리 후의 융합 단백질 발현 정도가 후보 물질 처리 전의 융합 단백질 발현 정도보다 감소한 경우, 상기 후보 물질을 항암제로 결정할 수 있다. 또 다른 예에서, 상기 항암제 스크리닝 방법은 융합 단백질을 발현하는 세포를 준비하고, 이들 세포 중 일부에 후보 화합물을 처리하고, 후보 화합물이 처리된 일부 세포와 후보 물질이 처리되지 않은 나머지 세포에서의 상기 융합 단백질의 발현 정도를 측정하는 단계를 포함하여, 후보 물질이 처리된 세포에서의 융합 단백질 발현 정도가 후보 물질이 처리되지 않은 세포에서의 융합 단백질 발현 정도보다 감소한 경우, 상기 후보 물질을 항암제로 결정할 수 있다.
상기 스크리닝 방법에서 사용되는 세포는 상기한 융합 단백질 중 1종 이상이 발현 및/또는 활성화되는 암 (예컨대, 스크리닝하고자 하는 항암제가 항암 활성을 보이기를 소망하는 암)으로부터 유래하는 세포, 상기 세포의 추출물, 또는 상기 세포를 통상의 방법으로 배양한 것을 포함할 수 있다.
상기 융합 단백질을 발현하는 세포는 앞서 설명한 바와 같은 암세포, 특히 고형암 세포, 구체적으로 폐암 세포, 보다 구체적으로 폐선암 세포일 수 있다. 융합 단백질의 발현량 측정은 통상적인 단백질 측정법인 면역크로마토그래피(Immunochromatography), 면역조직화학염색, 효소결합 면역흡착 분석(enzyme liked immunosorbent assay: ELISA), 방사선 면역측정법(radioimmunoassay: RIA), 효소 면역분석(enzyme immunoassay: EIA), 형광면역분석(Floresence immunoassay: FIA), 발광면역분석(luminescence immunoassay: LIA), 웨스턴블라팅(Western blotting), FACS 등에 의하여 수행할 수 있다. 상기 후보 물질은 모든 천연 또는 합성 화합물을 포함하는 의미로, 일반 화합물, DNA, RNA, 단백질 등으로 이루어진 군에서 선택된 1종 이상일 수 있다.
후보 물질이 융합 단백질에 의하여 특성화되는 종양의 진행을 저해하는지를 결정하기 위한 방법의 실행에 있어서, 포유류 이종이식 유래 세포들을 포함하는 생물 시료를 채택할 수 있다. 바람직한 이종이식(또는 이식 수용체)는 융합 단백질을 발현하는 인간 종양 세포 시료 또는 상기 세포들을 가지는 마우스와 같은 작은 포유류일 수 있다. 포유류 암 종양으로부터 유래한 세포들을 포함하는 생물학적 샘플에서 융합 단백질의 존재 또는 발현을 평가하는데 있어서, 그러한 전위 및/또는 융합 단백질이 발생하지 않는 세포를 대조군으로 채택할 수 있다. 바람직하게는 대조군 시료는 상기한 바와 같은 융합 단백질 발현이 존재하지 않는 동일 조직의 정상 세포 (예컨대 종양 주변의 정상 세포), 또는 상기 융합 단백질이 발현되지 아니하는 특정 암(예를 들어 NSCLC)에서 유래하는 암세포들을 포함할 수 있다.
상기 스크리닝 방법에 의하여 개발된 항암제의 치료 대상 암 종류는 상기한 바와 같다.
본 발명에서 제공되는 폐암, 특히 폐선암에서 특이적으로 발현하는 융합 단백질 및/또는 이를 암호화하는 DNA 분자/mRNA는 폐암 진단 마커 및 더 나아가 치료 타겟으로서 유용하다.
도 1은 CCDC6-ROS1융합 단백질의 구조를 예시적으로 보여주는 것이다.
도 2는 SCAF11-PDGFRA 융합 단백질의 구조를 예시적으로 보여주는 것이다.
도 3은 FGFR2-CIT 융합 단백질의 구조를 예시적으로 보여주는 것이다.
도 4는 AXL-MBIP 융합 단백질의 구조를 예시적으로 보여주는 것이다.
도 5는 APLP2-TNFSF11 융합 단백질의 구조를 예시적으로 보여주는 것이다.
도 6은 MAP4K3-PRKCE 융합 단백질의 구조를 예시적으로 보여주는 것이다.
도 7은 BCAS3-MAP3K3 융합 단백질의 구조를 예시적으로 보여주는 것이다.
도 8은 KRAS-CDH13 융합 단백질의 구조를 예시적으로 보여주는 것이다.
도 9는 ZFYVE9-CGA 융합 단백질의 구조를 예시적으로 보여주는 것이다.
도 10은 ERBB2IP-MAST4 융합 단백질의 구조를 예시적으로 보여주는 것이다.
도 11은 TPD52L1-TRMT11 융합 단백질의 구조를 예시적으로 보여주는 것이다.
도 12는 TXNRD1-GPR133 융합 단백질의 구조를
예시적으로 보여주는 것이다.
도 13은 일 예에 따른 CCDC6 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 14는 일 예에 따른 CCDC6 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 15a 및 15b는 일 예에 따른 ROS1 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 16은 일 예에 따른 ROS1 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 17은 일 예에 따른 CCDC6-ROS1융합 유전자를 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 5'-말단 부위(CCDC6 단편)이고, 파란색으로 표시된 부분이 3'-말단 부위 (ROS1 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 18은 일 예에 따른 CCDC6-ROS1융합 단백질을 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 N-말단 부위(CCDC6 단편)이고, 파란색으로 표시된 부분이 C-말단 부위 (ROS1 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다 (가운데 초록색 음영의 'L'은 CCDC6 단편의 C-말단과 ROS1 단편의 N-말단에 추가로 포함된 총 3개의 뉴클레오타이드에 의하여 코딩된 아미노산이다).
도 19은 일 예에 따른 FGFR2 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 20은 일 예에 따른 FGFR2 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 21a 및 21b는 일 예에 따른 CIT 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 22은 일 예에 따른 CIT 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 23a 및 23b은 일 예에 따른 FGFR2-CIT 융합 유전자를 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 5'-말단 부위(FGFR2 단편)이고, 파란색으로 표시된 부분이 3'-말단 부위 (CIT 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 24는 FGFR2-CIT 융합 단백질을 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 N-말단 부위(FGFR2 단편)이고, 파란색으로 표시된 부분이 C-말단 부위 (CIT 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 25은 일 예에 따른 AXL 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 26은 일 예에 따른 AXL 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 27은 일 예에 따른 MBIP 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 28은 일 예에 따른 MBIP 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 29는 일 예에 따른 AXL-MBIP 융합 유전자를 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 5'-말단 부위(AXL 단편)이고, 파란색으로 표시된 부분이 3'-말단 부위 (MBIP 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 30은 AXL-MBIP 융합 단백질을 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 N-말단 부위(AXL 단편)이고, 파란색으로 표시된 부분이 C-말단 부위 (MBIP 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 31은 일 예에 따른 APLP2 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 32은 일 예에 따른 APLP2단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 33은 일 예에 따른 TNFSF11 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 34은 일 예에 따른 TNFSF11 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 35는 일 예에 따른 APLP2-TNFSF11 융합 유전자를 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 5'-말단 부위(APLP2 단편)이고, 파란색으로 표시된 부분이 3'-말단 부위 (TNFSF11 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 36은 APLP2-TNFSF11 융합 단백질을 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 N-말단 부위(APLP2 단편)이고, 파란색으로 표시된 부분이 C-말단 부위 (TNFSF11 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 37은 일 예에 따른 MAP4K3 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 38은 일 예에 따른 MAP4K3 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 39는 일 예에 따른 PRKCE 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 40은 일 예에 따른 PRKCE 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 41은 일 예에 따른 MAP4K3-PRKCE 융합 유전자를 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 5'-말단 부위(MAP4K3 단편)이고, 파란색으로 표시된 부분이 3'-말단 부위 (PRKCE 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 42는 MAP4K3-PRKCE 융합 단백질을 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 N-말단 부위(MAP4K3 단편)이고, 파란색으로 표시된 부분이 C-말단 부위 (PRKCE 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 43은 일 예에 따른 BCAS3 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다 (절단 부위 중 이탤릭체로 표시된 뉴클레오타이드(g)는 코돈을 이루지 못하는 뉴클레오타이드이다).
도 44는 일 예에 따른 BCAS3 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 45는 일 예에 따른 MAP3K3 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다(절단 부위 중 이탤릭체로 표시된 뉴클레오타이드(ac)는 코돈을 이루지 못하는 뉴클레오타이드이다).
도 46은 일 예에 따른 MAP3K3 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 47a 및 47b은 일 예에 따른 BCAS3-MAP3K3융합 유전자를 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 5'-말단 부위(BCAS3 단편)이고, 파란색으로 표시된 부분이 3'-말단 부위 (MAP3K3 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 48은 일 예에 따른 BCAS3-MAP3K3 융합 단백질을 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 N-말단 부위(BCAS3 단편)이고, 파란색으로 표시된 부분이 C-말단 부위 (MAP3K3 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다 (가운데 붉은색 음영의 'D'는 BCAS3 단편의 C-말단과 MAP3K3 단편의 N-말단에 추가로 포함된 총 3개의 뉴클레오타이드에 의하여 코딩된 아미노산이다).
도 49는 일 예에 따른 KRAS 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 50은 일 예에 따른 KRAS 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 51은 일 예에 따른 CDH13 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 52는 일 예에 따른 CDH13 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 53은 일 예에 따른 KRAS-CDH13 융합 유전자를 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 5'-말단 부위(KRAS 단편)이고, 파란색으로 표시된 부분이 3'-말단 부위 (CDH13 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 54는 KRAS-CDH13 융합 단백질을 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 N-말단 부위(KRAS 단편)이고, 파란색으로 표시된 부분이 C-말단 부위 (CDH13 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 55a 및 55b는 일 예에 따른 ZFYVE9 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 56은 일 예에 따른 ZFYVE9 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 57은 일 예에 따른 CGA 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 58은 일 예에 따른 CGA 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 59a 및 59b는 일 예에 따른 ZFYVE9-CGA 융합 유전자를 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 5'-말단 부위(ZFYVE9 단편)이고, 파란색으로 표시된 부분이 3'-말단 부위 (CGA 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 60은 ZFYVE9-CGA 융합 단백질을 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 N-말단 부위(ZFYVE9 단편)이고, 파란색으로 표시된 부분이 C-말단 부위 (CGA 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 61a 및 61b 은 일 예에 따른 ERBB2IP 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 62는 일 예에 따른 ERBB2IP 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 63a, 63b 및 63c는 일 예에 따른 MAST4 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 64는 일 예에 따른 MAST4 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 65a, 65b 및 65c 는 일 예에 따른 ERBB2IP-MAST4 융합 유전자를 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 5'-말단 부위(ERBB2IP 단편)이고, 파란색으로 표시된 부분이 3'-말단 부위 (MAST4 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 66a 및 66b는 ERBB2IP-MAST4 융합 단백질을 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 N-말단 부위(ERBB2IP 단편)이고, 파란색으로 표시된 부분이 C-말단 부위 (MAST4 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 67은 일 예에 따른 TPD52L1 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 68은 일 예에 따른 TPD52L1 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 69는 일 예에 따른 TRMT11 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 70은 일 예에 따른 TRMT11 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 71은 일 예에 따른 TPD52L1-TRMT11 융합 유전자를 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 5'-말단 부위(TPD52L1 단편)이고, 파란색으로 표시된 부분이 3'-말단 부위 (TRMT11 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 72는 TPD52L1-TRMT11 융합 단백질을 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 N-말단 부위(TPD52L1 단편)이고, 파란색으로 표시된 부분이 C-말단 부위 (TRMT11 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 73은 일 예에 따른 TXNRD1 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 74는 일 예에 따른 TXNRD1 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 75는 일 예에 따른 GPR133 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 76은 일 예에 따른 GPR133 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 77은 일 예에 따른 TXNRD1-GPR133 융합 유전자를 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 5'-말단 부위(TXNRD1 단편)이고, 파란색으로 표시된 부분이 3'-말단 부위 (GPR133 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 78은 TXNRD1-GPR133 융합 단백질을 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 N-말단 부위(TXNRD1 단편)이고, 파란색으로 표시된 부분이 C-말단 부위 (GPR133 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 79은 일 예에 따른 SCAF11유전자 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 80는 일 예에 따른 PDGFRA 유전자 단편 (회색 음영) 및 절단 부위(5UTR(12bp); 굵은 체 + 밑줄)를 보여준다.
도 81은 일 예에 따른 SCAF11-PDGFRA 융합 유전자를 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 5'-말단 부위(SCAF11 단편)이고, 파란색으로 표시된 부분이 3'-말단 부위 (PDGFRA 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 2는 SCAF11-PDGFRA 융합 단백질의 구조를 예시적으로 보여주는 것이다.
도 3은 FGFR2-CIT 융합 단백질의 구조를 예시적으로 보여주는 것이다.
도 4는 AXL-MBIP 융합 단백질의 구조를 예시적으로 보여주는 것이다.
도 5는 APLP2-TNFSF11 융합 단백질의 구조를 예시적으로 보여주는 것이다.
도 6은 MAP4K3-PRKCE 융합 단백질의 구조를 예시적으로 보여주는 것이다.
도 7은 BCAS3-MAP3K3 융합 단백질의 구조를 예시적으로 보여주는 것이다.
도 8은 KRAS-CDH13 융합 단백질의 구조를 예시적으로 보여주는 것이다.
도 9는 ZFYVE9-CGA 융합 단백질의 구조를 예시적으로 보여주는 것이다.
도 10은 ERBB2IP-MAST4 융합 단백질의 구조를 예시적으로 보여주는 것이다.
도 11은 TPD52L1-TRMT11 융합 단백질의 구조를 예시적으로 보여주는 것이다.
도 12는 TXNRD1-GPR133 융합 단백질의 구조를
예시적으로 보여주는 것이다.
도 13은 일 예에 따른 CCDC6 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 14는 일 예에 따른 CCDC6 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 15a 및 15b는 일 예에 따른 ROS1 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 16은 일 예에 따른 ROS1 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 17은 일 예에 따른 CCDC6-ROS1융합 유전자를 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 5'-말단 부위(CCDC6 단편)이고, 파란색으로 표시된 부분이 3'-말단 부위 (ROS1 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 18은 일 예에 따른 CCDC6-ROS1융합 단백질을 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 N-말단 부위(CCDC6 단편)이고, 파란색으로 표시된 부분이 C-말단 부위 (ROS1 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다 (가운데 초록색 음영의 'L'은 CCDC6 단편의 C-말단과 ROS1 단편의 N-말단에 추가로 포함된 총 3개의 뉴클레오타이드에 의하여 코딩된 아미노산이다).
도 19은 일 예에 따른 FGFR2 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 20은 일 예에 따른 FGFR2 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 21a 및 21b는 일 예에 따른 CIT 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 22은 일 예에 따른 CIT 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 23a 및 23b은 일 예에 따른 FGFR2-CIT 융합 유전자를 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 5'-말단 부위(FGFR2 단편)이고, 파란색으로 표시된 부분이 3'-말단 부위 (CIT 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 24는 FGFR2-CIT 융합 단백질을 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 N-말단 부위(FGFR2 단편)이고, 파란색으로 표시된 부분이 C-말단 부위 (CIT 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 25은 일 예에 따른 AXL 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 26은 일 예에 따른 AXL 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 27은 일 예에 따른 MBIP 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 28은 일 예에 따른 MBIP 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 29는 일 예에 따른 AXL-MBIP 융합 유전자를 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 5'-말단 부위(AXL 단편)이고, 파란색으로 표시된 부분이 3'-말단 부위 (MBIP 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 30은 AXL-MBIP 융합 단백질을 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 N-말단 부위(AXL 단편)이고, 파란색으로 표시된 부분이 C-말단 부위 (MBIP 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 31은 일 예에 따른 APLP2 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 32은 일 예에 따른 APLP2단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 33은 일 예에 따른 TNFSF11 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 34은 일 예에 따른 TNFSF11 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 35는 일 예에 따른 APLP2-TNFSF11 융합 유전자를 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 5'-말단 부위(APLP2 단편)이고, 파란색으로 표시된 부분이 3'-말단 부위 (TNFSF11 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 36은 APLP2-TNFSF11 융합 단백질을 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 N-말단 부위(APLP2 단편)이고, 파란색으로 표시된 부분이 C-말단 부위 (TNFSF11 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 37은 일 예에 따른 MAP4K3 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 38은 일 예에 따른 MAP4K3 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 39는 일 예에 따른 PRKCE 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 40은 일 예에 따른 PRKCE 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 41은 일 예에 따른 MAP4K3-PRKCE 융합 유전자를 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 5'-말단 부위(MAP4K3 단편)이고, 파란색으로 표시된 부분이 3'-말단 부위 (PRKCE 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 42는 MAP4K3-PRKCE 융합 단백질을 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 N-말단 부위(MAP4K3 단편)이고, 파란색으로 표시된 부분이 C-말단 부위 (PRKCE 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 43은 일 예에 따른 BCAS3 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다 (절단 부위 중 이탤릭체로 표시된 뉴클레오타이드(g)는 코돈을 이루지 못하는 뉴클레오타이드이다).
도 44는 일 예에 따른 BCAS3 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 45는 일 예에 따른 MAP3K3 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다(절단 부위 중 이탤릭체로 표시된 뉴클레오타이드(ac)는 코돈을 이루지 못하는 뉴클레오타이드이다).
도 46은 일 예에 따른 MAP3K3 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 47a 및 47b은 일 예에 따른 BCAS3-MAP3K3융합 유전자를 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 5'-말단 부위(BCAS3 단편)이고, 파란색으로 표시된 부분이 3'-말단 부위 (MAP3K3 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 48은 일 예에 따른 BCAS3-MAP3K3 융합 단백질을 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 N-말단 부위(BCAS3 단편)이고, 파란색으로 표시된 부분이 C-말단 부위 (MAP3K3 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다 (가운데 붉은색 음영의 'D'는 BCAS3 단편의 C-말단과 MAP3K3 단편의 N-말단에 추가로 포함된 총 3개의 뉴클레오타이드에 의하여 코딩된 아미노산이다).
도 49는 일 예에 따른 KRAS 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 50은 일 예에 따른 KRAS 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 51은 일 예에 따른 CDH13 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 52는 일 예에 따른 CDH13 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 53은 일 예에 따른 KRAS-CDH13 융합 유전자를 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 5'-말단 부위(KRAS 단편)이고, 파란색으로 표시된 부분이 3'-말단 부위 (CDH13 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 54는 KRAS-CDH13 융합 단백질을 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 N-말단 부위(KRAS 단편)이고, 파란색으로 표시된 부분이 C-말단 부위 (CDH13 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 55a 및 55b는 일 예에 따른 ZFYVE9 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 56은 일 예에 따른 ZFYVE9 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 57은 일 예에 따른 CGA 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 58은 일 예에 따른 CGA 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 59a 및 59b는 일 예에 따른 ZFYVE9-CGA 융합 유전자를 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 5'-말단 부위(ZFYVE9 단편)이고, 파란색으로 표시된 부분이 3'-말단 부위 (CGA 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 60은 ZFYVE9-CGA 융합 단백질을 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 N-말단 부위(ZFYVE9 단편)이고, 파란색으로 표시된 부분이 C-말단 부위 (CGA 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 61a 및 61b 은 일 예에 따른 ERBB2IP 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 62는 일 예에 따른 ERBB2IP 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 63a, 63b 및 63c는 일 예에 따른 MAST4 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 64는 일 예에 따른 MAST4 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 65a, 65b 및 65c 는 일 예에 따른 ERBB2IP-MAST4 융합 유전자를 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 5'-말단 부위(ERBB2IP 단편)이고, 파란색으로 표시된 부분이 3'-말단 부위 (MAST4 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 66a 및 66b는 ERBB2IP-MAST4 융합 단백질을 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 N-말단 부위(ERBB2IP 단편)이고, 파란색으로 표시된 부분이 C-말단 부위 (MAST4 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 67은 일 예에 따른 TPD52L1 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 68은 일 예에 따른 TPD52L1 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 69는 일 예에 따른 TRMT11 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 70은 일 예에 따른 TRMT11 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 71은 일 예에 따른 TPD52L1-TRMT11 융합 유전자를 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 5'-말단 부위(TPD52L1 단편)이고, 파란색으로 표시된 부분이 3'-말단 부위 (TRMT11 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 72는 TPD52L1-TRMT11 융합 단백질을 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 N-말단 부위(TPD52L1 단편)이고, 파란색으로 표시된 부분이 C-말단 부위 (TRMT11 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 73은 일 예에 따른 TXNRD1 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 74는 일 예에 따른 TXNRD1 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 75는 일 예에 따른 GPR133 단백질의 단편을 암호화 하는 유전자 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 76은 일 예에 따른 GPR133 단백질의 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 77은 일 예에 따른 TXNRD1-GPR133 융합 유전자를 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 5'-말단 부위(TXNRD1 단편)이고, 파란색으로 표시된 부분이 3'-말단 부위 (GPR133 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 78은 TXNRD1-GPR133 융합 단백질을 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 N-말단 부위(TXNRD1 단편)이고, 파란색으로 표시된 부분이 C-말단 부위 (GPR133 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
도 79은 일 예에 따른 SCAF11유전자 단편 (회색 음영) 및 절단 부위(굵은 체 + 밑줄)를 보여준다.
도 80는 일 예에 따른 PDGFRA 유전자 단편 (회색 음영) 및 절단 부위(5UTR(12bp); 굵은 체 + 밑줄)를 보여준다.
도 81은 일 예에 따른 SCAF11-PDGFRA 융합 유전자를 보여주는 것으로, 붉은 색 이탤릭체로 표시된 부분이 5'-말단 부위(SCAF11 단편)이고, 파란색으로 표시된 부분이 3'-말단 부위 (PDGFRA 단편)이며, 음영, 굵은 체 및 밑줄로 표시된 부분이 융합 부위이다.
이하, 본 발명을 실시예에 의해 상세히 설명한다.
단, 하기 실시예는 본 발명을 예시하는 것일 뿐, 본 발명의 내용이 하기 실시예에 한정되는 것은 아니다.
실시예
1:
암시료의
준비
서울대학병원 및 서울카톨릭성모병원에서 엽절제술(lobectomy)을 받은 환자로부터 얻은 일차 폐선암 수술 표본 200개를 수집하였다(서울대학병원: n=164; 2010년부터 20119월까지, 서울카톨릭성모병원: n=36; 2009년~2011년 동안 tissue bank에 기탁된 샘플들). 본 발명자들의 이전 연구(Ju YS et al.Genome Res. 2012 22:436-445)로부터 20명의 환자가 이 시험군에 포함되었다. 각각의 환자에 대하여, 진단, 성별, 암단계 및 흡연 상태를 기록하였다. 암환자 200명 중에서, 여성 및 비흡연자의 비율은 각각 54.5% (n=109) 및 58.0% (n=116) 이었다.
이전 보고된 방법(Ju YS et al.Genome Res. 2012 22:436-445)에 의하여, 200개의 폐선암 조직의 서브셋에서의 잘 알려진 3개의 핵심 돌연변이에 대한 스크리닝 유전자 시험을 수행하였다(EGFR 의 엑손 18-21은 PCR 및 Sanger sequencing (n=164)에 의하여 시험; KRAS의 exon 2는 PCR 및 Sanger sequencing (n=37)에 의하여 시험; EML4-ALK 융합 유전자는 형광 in-situ 혼성화(fluorescent in-situ hybridization)(FISH; n=163)에 의하여 시험; Supplementary Figure 1; Supplementary Table 1). 200개의 암조직 중에서 100개는 EGFR (n=99), KRAS (n=6) 및 EML4-ALK (n=7) 중 하나의 핵심 돌연변이에 대하여 양성이었다. 이들 돌연변이는 2개 샘플(1 EGFR + KRAS + 및 1 EGFR + EML4-ALK +)을 제외하고 실질적으로 상호 배타적이었다. 나머지 90개에서의 핵심 돌연변이는 알려지지 않았다.
이들 90개 샘플을 RNA 시퀀싱의 타겟으로 하였다. RNA quality control(RNA Integrity Number(RIN) Standardization of RNA Quality Control: Agilent Tech.)을 통과하지 못한 3개의 샘플을 제외하고, 나머지 87개 폐선암 샘플로부터 mRNA 서열을 얻었다. 그리고 나서, 인접한 정상 폐조직의 전사체(transcriptome; n=77) 및 전체 엑솜(whole-exome; n=76) 시퀀싱을 수행하여 암조직 및 정상조직을 비교하였다 (Supplementary Table 1). 모든 시퀀싱 시험은 "Ju YS et al.Genome Res. 2012 22:436-445"에 기재된 방법에 따라서 수행하였다.
164개 샘플 (87개의 암조직 샘플 및 77개의 대응 정상 조직 샘플; Supplementary Table 2)로부터 14,038,673,860 paired-end 101bp-long reads를 생성하였다. 평균적으로, RNA 시퀀싱 처리량(throughputs)은 암조직 및 정상 조직에 대하여 각각 9.77 및 7.38 Gbp이었다. 76개 정상 조직의 전체-엑솜 시퀀싱에서, 타겟팅 부위에 대하여 paired-정상 조직당 32.96?read depth를 얻었다.
실시예
2: 융합 유전자 분석
그리고 난 후, 최근 몇몇 전이(transforming) 융합 유전자들이 핵심 돌연변이로서 폐선암에서 검출되고 있기 때문에, 융합 유전자 검출에 초점을 두었다. "Ju YS et al.Genome Res. 2012 22:436-445"에 기재된 유전자 융합 프로그램(GFP)을 이용하여, 87개 암조직 샘플로부터 45개의 인-프레임 융합 전사체를 동정하였다.
전사체 (transcriptome) 시퀀싱을 이용한 융합유전자 검출을 위하여, 약 300bp 길이의 cDNA 조각의 양쪽 말단 101bp씩을 차세대 서열 분석법(Ju YS et al.Genome Res. 2012 22:436-445)으로 서열을 규명하고, 양쪽 말단이 상이한 유전자 서열로 구성되어 있는 불일치 서열 (discordant read)을 찾아내었다. 또한 한쪽 말단 서열이 융합 유전자의 분점(breakpoint)으로부터 생성됨으로써 상이한 두 유전자 서열의 조합으로 되어있는 엑손-스패닝 서열 (exon-spanning read) 을 찾아내었다. 불일치 서열과 엑손-스패닝 서열은 모두 융합 유전자의 존재를 시사하는 서열이다. 불일치 서열과 엑손-스패닝 서열을 모두 가지고 있는 한쌍의 유전자를 최종 융합 유전자 후보로 선정하였고, 이 가운데 아미노산 서열 상 원래 유전자의 코돈 프레임이 유지되는 인-프레임 융합 유전자를 최종 폐암 융합 유전자로 선정하였다.
이들 45개 전사체의 융합된 유전자와 각 유전자가 위치하는 염색체와 각 유전자 간 거리를 아래의 표 47에 정리하였다.
Index | Donor Gene |
Acceptor Gene |
Chromosome (Donor;Acceptor) |
Distance (Mb) |
1 | EML4 | ALK | chr2;chr2 | 12.252 |
2 | KIF5B | RET | chr10;chr10 | 11.227 |
2 | KIF5B | RET | chr10;chr10 | 11.227 |
2 | KIF5B | RET | chr10;chr10 | 11.227 |
2 | KIF5B | RET | chr10;chr10 | 11.227 |
3 | CD74 | ROS1 | chr5;chr6 | Interchromosomal |
4 | SLC34A2 | ROS1 | chr4;chr6 | Interchromosomal |
5 | CCDC6 | ROS1 | chr10;chr6 | Interchromosomal |
6 | SCAF11 | PDGFRA | chr12;chr4 | Interchromosomal |
7 | FGFR2 | CIT | chr10;chr12 | Interchromosomal |
8 | AXL | MBIP | chr19;chr14 | Interchromosomal |
9 | APLP2 | TNFSF11 | chr11;chr13 | Interchromosomal |
10 | MAP4K3 | PRKCE | chr2;chr2 | 6.215 |
11 | BCAS3 | MAP3K3 | chr17;chr17 | 2.23 |
12 | KRAS | CDH13 | chr12;chr16 | Interchromosomal |
13 | ZFYVE9 | CGA | chr1;chr6 | Interchromosomal |
14 | ERBB2IP | MAST4 | chr5;chr5 | 0.515 |
15 | TPD52L1 | TRMT11 | chr6;chr6 | 0.723 |
16 | TXNRD1 | GPR133 | chr12;chr12 | 26.694 |
17 | SRSF4 | SNRNP40 | chr1;chr1 | 2.224 |
18 | EDA | MID1 | chrX;chrX | 57.984 |
19 | HYOU1 | C11orf93 | chr11;chr11 | 7.736 |
20 | SLC16A7 | MUCL1 | chr12;chr12 | 4.831 |
21 | MIER2 | ITGB1BP3 | chr19;chr19 | 3.588 |
22 | RBM14 | FGF3 | chr11;chr11 | 3.211 |
23 | UBR4 | ATP13A2 | chr1;chr1 | 2.063 |
24 | TTC19 | ATPAF2 | chr17;chr17 | 1.989 |
25 | IGSF3 | MAN1A2 | chr1;chr1 | 0.7 |
26 | XAF1 | FAM64A | chr17;chr17 | 0.305 |
27 | IL6ST | KDM1B | chr5;chr6 | Interchromosomal |
28 | UBE2E1 | ASCC3 | chr3;chr6 | Interchromosomal |
29 | XRCC1 | MAL | chr19;chr2 | Interchromosomal |
30 | BRWD1 | CCDC46 | chr21;chr17 | Interchromosomal |
31 | SPTLC3 | MAOA | chr20;chrX | Interchromosomal |
32 | UTRN | OS9 | chr6;chr12 | Interchromosomal |
33 | LOC100306951 | NUP93 | chr17;chr16 | Interchromosomal |
34 | MGAT5 | HNMT | chr2;chr2 | 3.515 |
35 | MAP3K3 | PECAM1 | chr17;chr17 | 0.623 |
36 | CMBL | C8orf38 | chr5;chr8 | Interchromosomal |
37 | ITGB1BP3 | DNM2 | chr19;chr19 | 6.886 |
38 | LSM14A | SIPA1L3 | chr19;chr19 | 3.677 |
39 | RAB21 | FRS2 | chr12;chr12 | 2.175 |
40 | ARHGEF16 | TCTEX1D4 | chr1;chr1 | 41.874 |
41 | MMP14 | H19 | chr14;chr11 | Interchromosomal |
42 | H19 | CALR | chr11;chr19 | Interchromosomal |
43 | SFTPB | DPYSL2 | chr2;chr8 | Interchromosomal |
44 | SFTPA2 | SFTPB | chr10;chr2 | Interchromosomal |
45 | FTL | SFTPA2 | chr19;chr10 | Interchromosomal |
하기의 표 48에 상기 45개 전사체의 donor 부위의 절단 지점(전사체 기준으로 3'-말단 절단 지점), 절단 지점에 해당하는 아미노산 서열, 및 절단 지점이 위치하는 엑손을 각 유전자 (변형체 포함) 별로 나타내었다.
Index | Donor Breakpoint (RNA) |
Donor protein sequence near breakpoint | Donor exon number | |
1 | chr2:42522656] | TPGKGPK+1nt | EML4(NM_001145076,+strand),exon12(chr2:42522521-42522656;EML4(NM_019063,+strand),exon13(chr2:42522521-42522656; | |
2 | chr10:[32317356 | NNDVK | KIF5B(NM_004521,-strand),exon15(chr10:32317356-32317499; | |
2 | chr10:[32306980 | KVHKQ | KIF5B(NM_004521,-strand),exon23(chr10:32306980-32307084; | |
2 | chr10:[32317356 | NNDVK | KIF5B(NM_004521,-strand),exon15(chr10:32317356-32317499; | |
2 | chr10:[32317356 | NNDVK | KIF5B(NM_004521,-strand),exon15(chr10:32317356-32317499; | |
3 | chr5:[149784243 | DAPPK+1nt | CD74(NM_004355,-strand),exon6(chr5:149784243-149784330;CD74(NM_001025159,-strand),exon6(chr5:149784243-149784330; | |
4 | chr4:25678324] | SREAQ+1nt | SLC34A2(NM_006424,+strand),exon13(chr4:25677757-25680366;SLC34A2(NM_001177998,+strand),exon13(chr4:25677757-25680366;SLC34A2(NM_001177999,+strand),exon13(chr4:25677757-25680366; | |
5 | chr10:[61572393 | AAQLQ+1nt | CCDC6(NM_005436,-strand),exon5(chr10:61572393-61572553; | |
6 | chr12:[46384136 | 5UTR | SRSF2IP(NM_004719,-strand),exon1(chr12:46384136-46384401; | |
7 | chr10:[123243212 | LTLTTNE | FGFR2(NM_001144914,-strand),exon14(chr10:123243212-123243317;FGFR2(NM_001144916,-strand),exon14(chr10:123243212-123243317;FGFR2(NM_001144915,-strand),exon16(chr10:123243212-123243317;FGFR2(NM_001144917,-strand),exon15(chr10:123243212-123243317;FGFR2(NM_001144918,-strand),exon15(chr10:123243212-123243317;FGFR2(NM_022970,-strand),exon17(chr10:123243212-123243317;FGFR2(NM_000141,-strand),exon17(chr10:123243212-123243317;FGFR2(NM_001144913,-strand),exon16(chr10:123243212-123243317;FGFR2(NM_001144919,-strand),exon16(chr10:123243212-123243317; | |
8 | chr19:41765701] | LTAAE | AXL(NM_021913,+strand),exon20(chr19:41765458-41767670;AXL(NM_001699,+strand),exon19(chr19:41765458-41767670; | |
9 | chr11:130000061] | AAQMKSQ | APLP2(NM_001642,+strand),exon11(chr11:129999933-130000061;APLP2(NM_001142276,+strand),exon11(chr11:129999933-130000061;APLP2(NM_001142278,+strand),exon7(chr11:129999933-130000061;APLP2(NM_001142277,+strand),exon10(chr11:129999933-130000061;APLP2(NR_024516,+strand),exon8(chr11:129999933-130000061;APLP2(NR_024515,+strand),exon8(chr11:129999933-130000061; | |
10 | chr2:[39664033 | TYGDVYK | MAP4K3(NM_003618,-strand),exon1(chr2:39664033-39664219; | |
11 | chr17:59161925] | TVIDAAS+1nt | BCAS3(NM_017679,+strand),exon22(chr17:59161828-59161925;BCAS3(NM_001099432,+strand),exon23(chr17:59161828-59161925; | |
12 | chr12:[25378548 | TSAKTRQ | KRAS(NM_004985,-strand),exon4(chr12:25378548-25378707;KRAS(NM_033360,-strand),exon4(chr12:25378548-25378707; | |
13 | chr1:52803606] | DKNVSK+2nt | ZFYVE9(NM_007324,+strand),exon15(chr1:52803444-52803606;ZFYVE9(NM_004799,+strand),exon16(chr1:52803444-52803606; | |
14 | chr5:65372777] | QPGDKIIQ | ERBB2IP(NM_018695,+strand),exon24(chr5:65372703-65372777;ERBB2IP(NM_001006600,+strand),exon23(chr5:65372703-65372777; | |
15 | chr6:125569529] | SKKFGDM+2nt | TPD52L1(NM_003287,+strand),exon4(chr6:125569428-125569529;TPD52L1(NM_001003396,+strand),exon4(chr6:125569428-125569529;TPD52L1(NM_001003397,+strand),exon4(chr6:125569428-125569529;TPD52L1(NM_001003395,+strand),exon4(chr6:125569428-125569529; | |
16 | chr12:104733051] | IHPVCAE | TXNRD1(NM_001093771,+strand),exon16(chr12:104732917-104733051;TXNRD1(NM_003330,+strand),exon14(chr12:104732917-104733051;TXNRD1(NM_182729,+strand),exon14(chr12:104732917-104733051;TXNRD1(NM_182743,+strand),exon13(chr12:104732917-104733051;TXNRD1(NM_182742,+strand),exon13(chr12:104732917-104733051; | |
17 | chr1:[29485886 | SRCSWQDLK | SRSF4(NM_005626,-strand),exon3(chr1:29485886-29485998; | |
18 | chrX:68836548] | DSQDGHQ | EDA(NM_001005610,+strand),exon1(chrX:68835911-68836548;EDA(NM_001005613,+strand),exon1(chrX:68835911-68836548;EDA(NM_001399,+strand),exon1(chrX:68835911-68836548;EDA(NM_001005609,+strand),exon1(chrX:68835911-68836548;EDA(NM_001005612,+strand),exon1(chrX:68835911-68836548; | |
19 | chr11:[118921747 | SGVLSLDR | HYOU1(NM_006389,-strand),exon14(chr11:118921747-118921885;HYOU1(NM_001130991,-strand),exon14(chr11:118921747-118921885; | |
20 | chr12:60098799] | LAVMYAG+1nt | SLC16A7(NM_004731,+strand),exon2(chr12:60098553-60098799; | |
21 | chr19:[325635 | LNRHCEK+1nt | MIER2(NM_017550,-strand),exon7(chr19:325635-325704; | |
22 | chr11:66384528] | IECDVVK+1nt | RBM14(NM_006328,+strand),exon1(chr11:66384053-66384528; | |
23 | chr1:[19523635 | LSCLYA+1nt | UBR4(NM_020765,-strand),exon8(chr1:19523635-19523759; | |
24 | chr17:15930016] | AAVLMHR+1nt | TTC19(NM_017775,+strand),exon9(chr17:15929854-15930016; | |
25 | chr1:117156387] | VVNVQPT+1nt | IGSF3(NM_001542,-strand),exon4(chr1:117156387-117156797;IGSF3(NM_001007237,-strand),exon4(chr1:117156387-117156797; | |
26 | chr17:6663920] | EQAQLGK+1nt | XAF1(NM_017523,+strand),exon4(chr17:6663725-6663920;XAF1(NM_199139,+strand),exon3(chr17:6663725-6663920; | |
27 | chr5:[55290612 | 5UTR | IL6ST(NM_175767,-strand),exon1(chr5:55290612-55290821;IL6ST(NM_002184,-strand),exon1(chr5:55290612-55290821;IL6ST(NM_001190981,-strand),exon1(chr5:55290612-55290821; | |
28 | chr3:23847579] | 5UTR | UBE2E1(NM_003341,+strand),exon1(chr3:23847439-23847579;UBE2E1(NM_182666,+strand),exon1(chr3:23847439-23847579; | |
29 | chr19:[44079062 | TISVVLQ | XRCC1(NM_006297,-strand),exon2(chr19:44079062-44079154; | |
30 | chr21:[40604103 | WRKMDLR | BRWD1(NM_018963,-strand),exon25(chr21:40604103-40604210;BRWD1(NM_033656,-strand),exon25(chr21:40604103-40604210; | |
31 | chr20:13074224] | IRIFKHN+1nt | SPTLC3(NM_018327,+strand),exon6(chr20:13074131-13074224; | |
32 | chr6:144820563] | LDTEISWAK | UTRN(NM_007124,+strand),exon33(chr6:144820393-144820563; | |
33 | chr17:1420387] | 5UTR | LOC100306951(NR_028514,+strand),exon1(chr17:1420213-1420387; | |
34 | chr2:135028121] | LEKINVA+1nt | MGAT5(NM_002410,+strand),exon2(chr2:135027957-135028121; | |
35 | chr17:61723434] | EHNGER+2nt | MAP3K3(NM_002401,+strand),exon3(chr17:61723394-61723434;MAP3K3(NM_203351,+strand),exon4(chr17:61723394-61723434; | |
36 | chr5:[10307737 | 5UTR | CMBL(NM_138809,-strand),exon1(chr5:10307737-10308168; | |
37 | chr19:3942267] | ASQQDS+2nt | ITGB1BP3(NM_170678,+strand),exon8(chr19:3942081-3942412; | |
38 | chr19:34663668] | NSTVALAK+1nt | LSM14A(NM_015578,+strand),exon1(chr19:34663352-34663668;LSM14A(NM_001114093,+strand),exon1(chr19:34663352-34663668; | |
39 | chr12:72176438] | LFLDLCK+1nt | RAB21(NM_014999,+strand),exon6(chr12:72176350-72176438; | |
40 | chr1:3392626] | QLDFSKVK | ARHGEF16(NM_014448,+strand),exon10(chr1:3392534-3392626; | |
41 | chr14:23316426] | 3UTR | MMP14(NM_004995,+strand),exon10(chr14:23314917-23316802; | |
42 | chr11:[2018179 | noncoding | H19(NR_002196,-strand),exon1(chr11:2017748-2019065; | |
43 | chr2:[85885042 | 3UTR | SFTPB(NM_000542,-strand),exon12(chr2:85884441-85886805;SFTPB(NM_198843,-strand),exon12(chr2:85884441-85885978; | |
44 | chr10:[81319068 | GDPGPP+1nt | SFTPA2(NM_001098668,-strand),exon3(chr10:81319068-81319262; | |
45 | chr19:49468806] | NYSTDVE | FTL(NM_000146,+strand),exon1(chr19:49468566-49468866; |
하기의 표 49에 상기 45개 전사체의 acceptor 부위의 절단 지점(전사체 기준으로 5'-말단 절단 지점), 절단 지점에 해당하는 아미노산 서열, 및 절단 지점이 위치하는 엑손을 각 유전자 (변형체 포함) 별로 나타내었다.
Index | Acceptor Breakpoint (RNA) |
Acceptor protein sequence near breakpoint | Acceptor exon number | ||
1 | chr2:29446394] | 2nt+YRRKHQE | ALK(NM_004304,-strand),exon20(chr2:29446208-29446394; | ||
2 | chr10:[43612032 | EDPKWEF | RET(NM_020630,+strand),exon12(chr10:43612032-43612179;RET(NM_020975,+strand),exon12(chr10:43612032-43612179; | ||
2 | chr10:[43612032 | EDPKWEF | RET(NM_020630,+strand),exon12(chr10:43612032-43612179;RET(NM_020975,+strand),exon12(chr10:43612032-43612179; | ||
2 | chr10:[43612032 | EDPKWEF | RET(NM_020630,+strand),exon12(chr10:43612032-43612179;RET(NM_020975,+strand),exon12(chr10:43612032-43612179; | ||
2 | chr10:[43612032 | EDPKWEF | RET(NM_020630,+strand),exon12(chr10:43612032-43612179;RET(NM_020975,+strand),exon12(chr10:43612032-43612179; | ||
3 | chr6:117645578] | 2nt+DFWIP | ROS1(NM_002944,-strand),exon34(chr6:117645495-117645578; | ||
4 | chr6:117650609] | 2nt+GVPNK | ROS1(NM_002944,-strand),exon32(chr6:117650492-117650609; | ||
5 | chr6:117642557] | 2nt+WHRRL | ROS1(NM_002944,-strand),exon35(chr6:117642422-117642557; | ||
6 | chr4:[55124924 | 5UTR, in-frame | PDGFRA(NM_006206,+strand),exon2(chr4:55124924-55124984; | ||
7 | chr12:120180269] | AHRDEIQ | CIT(NM_007174,-strand),exon23(chr12:120180216-120180269; | ||
8 | chr14:36783814] | IDRRI | MBIP(NM_016586,-strand),exon4(chr14:36783718-36783814;MBIP(NM_001144891,-strand),exon4(chr14:36783718-36783814; | ||
9 | chr13:[43174888 | ELQHIVG | TNFSF11(NM_033012,+strand),exon5(chr13:43174888-43174933;TNFSF11(NM_003701,+strand),exon3(chr13:43174888-43174933; | ||
10 | chr2:[46070139 | IDLEPEGR | PRKCE(NM_005400,+strand),exon2(chr2:46070139-46070202; | ||
11 | chr17:61710041] | 2nt+EQEALNS | MAP3K3(NM_002401,+strand),exon2(chr17:61710041-61710162;MAP3K3(NM_203351,+strand),exon2(chr17:61710041-61710162; | ||
12 | chr16:[83158990 | DIFKFAR | CDH13(NM_001257,+strand),exon4(chr16:83158990-83159106; | ||
13 | chr6:87797925] | 5UTR, in-frame | CGA(NM_000735,-strand),exon2(chr6:87797831-87797925; | ||
14 | chr5:[66400194 | ATAQMEER | MAST4(NM_001164664,+strand),exon10(chr5:66400194-66400403;MAST4(NM_015183,+strand),exon9(chr5:66400194-66400403; | ||
15 | chr6:[126342306 | 1nt+YTEEMVP | TRMT11(NM_001031712,+strand),exon12(chr6:126342306-126342426; | ||
16 | chr12:[131561346 | TRKQHS | GPR133(NM_198827,+strand),exon14(chr12:131561346-131561419; | ||
17 | chr1:31744346] | VWDLRQN | SNRNP40(NM_004814,-strand),exon6(chr1:31744226-31744346; | ||
18 | chrX:10463731] | VNASRQE | MID1(NM_001193277,-strand),exon4(chrX:10463624-10463731;MID1(NM_000381,-strand),exon4(chrX:10463624-10463731;MID1(NM_033289,-strand),exon4(chrX:10463624-10463731;MID1(NM_001098624,-strand),exon4(chrX:10463624-10463731;MID1(NM_033290,-strand),exon4(chrX:10463624-10463731;MID1(NM_001193278,-strand),exon4(chrX:10463624-10463731;MID1(NM_001193279,-strand),exon3(chrX:10463624-10463731;MID1(NM_001193280,-strand),exon3(chrX:10463624-10463731; | ||
19 | chr11:[111175653 | 5 UTR | C11orf93(NM_001136105,+strand),exon3(chr11:111175653-111175707; | ||
20 | chr12:[55248900 | 2nt+NPTTAAPAD | MUCL1(NM_058173,+strand),exon2(chr12:55248900-55248941; | ||
21 | chr19:[3942081 | 2nt+YLDGMKS | ITGB1BP3(NM_170678,+strand),exon8(chr19:3942081-3942412; | ||
22 | chr11:69631191] | 2nt+ILEITAV | FGF3(NM_005247,-strand),exon2(chr11:69631088-69631191; | ||
23 | chr1:17332273] | 2nt+SSPLVG | ATP13A2(NM_001141973,-strand),exon2(chr1:17332179-17332273;ATP13A2(NM_022089,-strand),exon2(chr1:17332179-17332273;ATP13A2(NM_001141974,-strand),exon2(chr1:17332179-17332273; | ||
24 | chr17:17931973] | 2nt+RKRFYQN | ATPAF2(NM_145691,-strand),exon2(chr17:17931929-17931973; | ||
25 | chr1:[118035769 | 2nt+HTSVGGLGD | MAN1A2(NM_006699,+strand),exon9(chr1:118035769-118035884; | ||
26 | chr17:[6348396 | 5UTR, in-frame | FAM64A(NM_001195228,+strand),exon2(chr17:6348396-6348724;FAM64A(NM_019013,+strand),exon2(chr17:6348396-6348724; | ||
27 | chr6:[18215238 | KKHSVLM | KDM1B(NM_153042,+strand),exon16(chr6:18215238-18215360; | ||
28 | chr6:100966018] | AMLDVAAN | ASCC3(NM_006828,-strand),exon38(chr6:100965867-100966018; | ||
29 | chr2:[95713704 | IFGGLVW | MAL(NM_022438,+strand),exon2(chr2:95713704-95713871;MAL(NM_002371,+strand),exon2(chr2:95713704-95713871; | ||
30 | chr17:63685336] | VLQDELE | CCDC46(NM_001037325,-strand),exon4(chr17:63685247-63685336;CCDC46(NM_145036,-strand),exon24(chr17:63685247-63685336; | ||
31 | chrX:[43542761 | 2nt+LSAAKLL | MAOA(NM_000240,+strand),exon2(chrX:43542761-43542855; | ||
32 | chr12:[58109543 | FLCDEGA | OS9(NM_006812,+strand),exon6(chr12:58109543-58109753;OS9(NM_001017956,+strand),exon6(chr12:58109543-58109753;OS9(NM_001017957,+strand),exon6(chr12:58109543-58109753;OS9(NM_001017958,+strand),exon6(chr12:58109543-58109753; | ||
33 | chr16:[56870513 | PGVIDKF | NUP93(NM_014669,+strand),exon17(chr16:56870513-56870629; | ||
34 | chr2:[138758488 | 2nt+EIDLQIL | HNMT(NM_006895,+strand),exon3(chr2:138758488-138758595; | ||
35 | chr17:62401205] | unidentified | PECAM1(NM_000442,-strand),exon1(chr17:62399864-62401205; | ||
36 | chr8:[96044223 | 5UTR | C8orf38(NM_152416,+strand),exon2(chr8:96044223-96044322; | ||
37 | chr19:[10870414 | 1nt+DFLPRGS | DNM2(NM_001190716,+strand),exon2(chr19:10870414-10870487;DNM2(NM_004945,+strand),exon2(chr19:10870414-10870487;DNM2(NM_001005361,+strand),exon2(chr19:10870414-10870487;DNM2(NM_001005362,+strand),exon2(chr19:10870414-10870487;DNM2(NM_001005360,+strand),exon2(chr19:10870414-10870487; | ||
38 | chr19:[38519729 | 5UTR | SIPA1L3(NM_015073,+strand),exon2(chr19:38519729-38519796; | ||
39 | chr12:[69924645 | 5UTR | FRS2(NM_006654,+strand),exon2(chr12:69924645-69924740;FRS2(NM_001042555,+strand),exon3(chr12:69924645-69924740; | ||
40 | chr1:45272510] | 5UTR | TCTEX1D4(NM_001013632,-strand),exon1(chr1:45272456-45272957; | ||
41 | chr11:2018689] | noncoding | H19(NR_002196,-strand),exon1(chr11:2017748-2019065; | ||
42 | chr19:[13054527 | AAEKQMK | CALR(NM_004343,+strand),exon9(chr19:13054527-13055304; | ||
43 | chr8:[26501052 | 1nt+SPPLSPD | DPYSL2(NM_001386,+strand),exon9(chr8:26500955-26501111; | ||
44 | chr2:85885494] | 3UTR | SFTPB(NM_000542,-strand),exon12(chr2:85884441-85886805;SFTPB(NM_198843,-strand),exon12(chr2:85884441-85885978; | ||
45 | chr10:81316285] | 3UTR | SFTPA2(NM_001098668,-strand),exon6(chr10:81315609-81317341; |
이들 중에서, 22개 (48.9%)는 염색체내 융합(intra-chromosomal fusions)이었다. cDNA의 PCR 증폭 및 Sanger sequencing을 이용하여 30개의 선별된 융합 유전자 중에서 29개(1-29번)를 validating하였다 (표 50 참조).
Donor Gene |
Acceptor Gene |
Forward Primer Name | Forward Primer Sequence | Reverse Primer Name | Reverse Primer Sequence | |
1 | KIF5B | RET | GF1_KIF5B:RET_F | TAAGGAAATGACCAACCACCAG | GF1_KIF5B:RET_R | CCTTGACCACTTTTCCAAATTC |
2 | KRAS | CDH13 | GF2_KRAS:CDH13_F | GGAAATAAATGTGATTTGCCTTC | GF2_KRAS:CDH13_R | AAGGCTGTCTCTGATTCTCTGG |
3 | APLP2 | TNFSF11 | GF3_APLP2:TNFSF11_F | TGCTGAGAACAAAGATCGCTTA | GF3_APLP2:TNFSF11_R | TGTCGGTGGCATTAATAGTGAG |
4 | ZFYVE9 | CGA | GF4_ZFYVE9:CGA_F | ACTGCAGAGAACATGGATTCCT | GF4_ZFYVE9:CGA_R | GAATGGAGAACATGCAGAAACA |
5 | CCDC6 | ROS1 | GF5_CCDC6:ROS1_F | CCTGCAGGAAAAATTAGACCAG | GF5_CCDC6:ROS1_R | AGCTCAGCCAACTCTTTGTCTT |
6 | FGFR2 | CIT | GF6_FGFR2:CIT_F | ACATGATGATGAGGGACTGTTG | GF6_FGFR2:CIT_R | ACAGCTGTTACGAAGAGCATCA |
7 | AXL | MBIP | GF7_AXL:MBIP_F | GCCTGACGAAATCCTCTATGTC | GF7_AXL:MBIP_R | CAAAATTCCCTGACGTTGTTTT |
8 | SCAF11 | PDGFRA | GF8_SCAF11:PDGFRA_F | CAGCGGAGTCAGTGTCCTAGAG | GF8_SCAF11:PDGFRA_R | TGAGAAGACAGCCTAAGACCAG |
9 | CD74 | ROS1 | GF9_CD74:ROS1_F | GTCTTTGAGAGCTGGATGCAC | GF9_CD74:ROS1_R | AGCTCAGCCAACTCTTTGTCTT |
10 | SLC34A2 | ROS1 | GF10_SLC34A2:ROS1_F | ATGCCGTCGTCTCCAAGTTC | GF10_SLC34A2:ROS1_R | ATCTTCAGCTTTCTCCCACTGT |
11 | TXNRD1 | GPR133 | GF11_TXNRD1:GPR133_F | TCCAAATGCTGGAGAAGTTACA | GF11_TXNRD1:GPR133_R | AGTACACGAAGACTCGGTTGCT |
12 | EML4 | ALK | GF12_EML4:ALK_F | GCCAAAATTTGTGCAGTGTTTA | GF12_EML4:ALK_R | GGAGCTTGCTCAGCTTGTACTC |
13 | HYOU1 | C11orf93 | GF13_HYOU1:C11orf93_F | CCAGAATCTGACCACAGTGAAG | GF13_HYOU1:C11orf93_R | AGAAGATGGTGCAACTGGGTCT |
14 | MAP4K3 | PRKCE | GF14_MAP4K3:PRKCE_F | AGGAGGACTTCGAGCTGATTC | GF14_MAP4K3:PRKCE_R | ACGACCCTGAGAGATCGATGA |
15 | RBM14 | FGF3 | GF15_RBM14:FGF3_F | CCAAGGCCTCTTAATACTTGGA | GF15_RBM14:FGF3_R | CATAGAGTCGTCCCCTCTTGTT |
16 | BCAS3 | MAP3K3 | GF16_BCAS3:MAP3K3_F | CATCCCGTCCAGTCTCTGAT | GF16_BCAS3:MAP3K3_R | CTGCCTATTTGAGTGACCTGTG |
17 | SRSF4 | SNRNP40 | GF17_SRSF4:SNRNP40_F | GAAGTGGCCGAGATAAATATGG | GF17_SRSF4:SNRNP40_R | TAAACTCAGGCCAGTCACTGAA |
18 | UBR4 | ATP13A2 | GF18_UBR4:ATP13A2_F | ACCCTTTCTCTACCTGTGTTGG | GF18_UBR4:ATP13A2_R | AGCTGAGGGGATCTATTGATGT |
19 | TTC19 | ATPAF2 | GF19_TTC19:ATPAF2_F | CGCTTTGATGAGGCCTATATTT | GF19_TTC19:ATPAF2_R | CTGTGTGATGCTGACATTCTGA |
20 | TPD52L1 | TRMT11 | GF20_TPD52L1:TRMT11_F | GAAAACACATGAAACCCTGAGTC | GF20_TPD52L1:TRMT11_R | ATGTGTGACTGGAAAGCTTCTG |
21 | IGSF3 | MAN1A2 | GF21_IGSF3:MAN1A2_F | CTGACCAGGGCGAATTCTACT | GF21_IGSF3:MAN1A2_R | TCTTGCCTCATGGTCTGTTTTA |
22 | ERBB2IP | MAST4 | GF22_ERBB2IP:MAST4_F | AACAAGGGTACAACCTGAAGGA | GF22_ERBB2IP:MAST4_R | TCAAGGAAGTATCGTGAGGTGA |
23 | XAF1 | FAM64A | GF23_XAF1:FAM64A_F | GGAGCTCCACGAGTCCTACTGT | GF23_XAF1:FAM64A_R | AGAGGTCTCCTGATGGCTGAC |
24 | MIER2 | ITGB1BP3 | GF24_MIER2:ITGB1BP3_F | AGATCATGGTGGGACCTCAGT | GF24_MIER2:ITGB1BP3_R | AGCAGCGAGTTCTGAATGTCTT |
25 | SLC16A7 | MUCL1 | GF25_SLC16A7:MUCL1_F | GTGGTTGGAGCAGCTTTTATCT | GF25_SLC16A7:MUCL1_R | TCATCATCAGCAGGACCAGTAG |
26 | ITGB1BP3 | DNM2 | GF26_ITGB1BP3:DNM2_F | CCTGGAAGACATTCAGAACTCG | GF26_ITGB1BP3:DNM2_R | TTTGAGAAGATGAGCTGCAGAA |
27 | ARHGEF16 | TCTEX1D4 | GF27_ARHGEF16:TCTEX1D4_F | GCATGGAGCAGATGTACACG | GF27_ARHGEF16:TCTEX1D4_R | TGTGTTTTAGAACAAGTGGATCAGA |
28 | CMBL | C8orf38 | GF29_CMBL:C8orf38_F | CTCTCCCAGGAGGCTACGACT | GF29_CMBL:C8orf38_R | TGAGCCAGTTCCACATTAAAGG |
29 | EDA | MID1 | GF30_EDA:MID1_F | TGACGTTGTGCTGCTACCTAGA | GF30_EDA:MID1_R | ATCTGTCGTCTTTGCTGAATGA |
30 | H19 | CALR | GF28_H19:CALR_F | CACCGCAATTCATTTAGTAGCA | GF28_H19:CALR_R | GCCTCTCTACAGCTCGTCCTT |
실시예
3: 융합 유전자 확인
게놈 DNA 및 cDNA에 대한 Sanger sequencing 및 PCR 증폭에 의하여 상기 발견된 사항을 확인하였다.
RNeasy 미니 키트(Qiagen)를 사용하여 종양 샘플로부터 RNA를 추출하는데 사용하였다. DNA는 DNeasy 티슈 키트(Qiagen)를 사용하여 추출하였다. RT-PCR을 위하여, 첫째 가닥 cDNA는 올리고(dT)20을 가지는 SuperScript TM III 첫째 가닥 합성 시스템(Invitrogen)으로 2.5mg의 전체 RNA으로부터 합성하였다. 그 다음에 각 융합 유전자를 해당 프라이머 쌍(상기 표 50 참조)을 사용하여 증폭하였다. 각각의 프라이머 쌍 및 Taq DNA polymerase 하이 피델리티(Invitrogen)를 사용하여 PCR로 유전자 증폭을 수행하였다.
PCR 반응은 95 ℃에서 10분간, 95 ℃에서 30초간-62 ?에서 30초간-72 ?에서 30초간을 30 사이클, 및 최종적으로 72 ?에서 10 분간 수행하였다. 게놈 결실 검출용 PCR 및 Sanger sequencing 프라이머는 다음과 같다: 5'- AACAAGGGTACAACCTGAAGGA-3' 및 5'-TCAAGGAAGTATCGTGAGGTGA-3'. 융합 전사체용 프라이머는 다음과 같다: 5'-AACAAGGGTACAACCTGAAGGA-3' 및 5'-TCAAGGAAGTATCGTGAGGTGA-3'. 모든 Sanger sequencing 시험은 Macrogen Inc. 매뉴얼에 따라서 수행하였다 (http://www.macrogen.com).
실시예
5: 융합 단백질 저해제의 포유류 고형 종양의 성장 저해 확인 시험
본 발명에서 제안되는 융합 단백질들이 이러한 융합단백질을 발현하는 세포주 또는 종양에서 성장 및 생존을 추진하는 것을 확인하기 위하여, 그 세포들을 각 융합 단백질 내 카이네이즈 또는 다른 도메인의 저해제로 처리하였다. ?
융합단백질을 발현하는 세포의 세포수를 계산하고, 세포 성장 저해 분석은 제조업자가 제안하는 것에 따라 CellTiter 96 AQueous 원 용액 세포 증식 분석(Promega)으로 수행하였다. 간략하게 설명하면, 1000에서 5000 세포들을 프랫-바텀 96-웰 플레이트 상에 시딩하고 10% FBS를 가지는 완전 배지에서 성장시켰다. 24 시간 후, 그 세포 배지를 각각의 융항 단백질에 대한 저해 약제를 다양한 농도로 포함하는 10% FBS를 가지는 100㎕ 완전 성장 배지로 교체하고, 72 시간 동안 더 배양하였다. 각 약제 농도는 세포들의 세 동일한(triplicate) 웰에 적용하였다. 배양 종결 시기에서, 20 ㎕의 CellTiter 96 AQueous 원 용액은 각 웰에 첨가하고 그 플레이트는 1-4 시간 동안 배양하였다. 흡광도는 마이크로플레이트 리더를 사용하여 490nm에서 판독하였다. 성장 저해는 처리된 세포들 대비 처리된 세포들로부터 판독한 흡광도의 평균 ± SD 값으로 표현될 수 있다. 이와 같은 분석은 적어도 3회 반복하였다. 그러한 분석으로부터 융합 단백질들이 발현되는 인간 NSCLC 종양들의 서브세트의 성장 및 생존을 추진하는지를 확인하고, 그러한 세포들은 타겟된 저해제를 사용하여 융합 단백질 내 카이네이즈의 활성 또는 다른 도메인의 활성을 저해하여 저해될 수 있다는 것을 확인할 수 있을 것이다.
실시예
6: 융합 단백질의 형질전환된 포유류 세포주의 성장 및 생존 촉진 확인 시험
NIH 3T3 세포들을 각 융합 단백질의 코딩 cDNA를 포함하는 구조체로 형질전환시켜서 융합 단백질을 발현시킴으로써, 본 발명에서 제안된 융합 단백질의 발현이 정상세포를 암 표현형으로 형질전환시킬 수 있는지를 확인하였다. 간략하게 설명하면, 세포들을 10% 우태혈청(FBS) (Sigma) 및 1.0 ng/ml IL-3 (R&D Systems)을 가지는 RPMI-1640 배지(Invitrogen)에서 유지하였다. 레트로바이러스 상등액의 생성 및 트랜스팩션은 전에 알려진 것과 같이 수행하였다. NIH3T3 세포들은 pMXs-puro/융합 단백질 발현 벡터를 포함하는 레트로바이러스 상등액으로 트랜스덕션하고, puromycin(2ug/ml)에 대하여 선택하였다. 다음, 소프트 아가에서 성장하는 형질전환된 세포들의 능력을 측정하였다. 그러한 분석은 융합 단백질의 발현이 NIH3T3 세포를 형질전환할 수 있고, 이들 세포들을 융합 단백질에 의하여 소프트 아가상에서 생존 및 성장이 촉진됨을 확인할 수 있고, 형질전환된 세포들에서 이들 융합 단백질의 발현의 저해는 감소된 생존성 및 증가된 아팝토시스를 야기한다는 것을 나타낸다.
<110> MACROGEN INC.
<120> Fusion Protein containing ZFYVE9 and composition for diagnosing
cancer
<130> DPP20120913KR
<160> 205
<170> KopatentIn 1.71
<210> 1
<211> 1425
<212> DNA
<213> Artificial Sequence
<220>
<223> CDS of CCDC6 gene (NM_005436)
<400> 1
atggcggaca gcgccagcga gagcgacacg gacggggcgg ggggcaacag cagcagctcg 60
gccgccatgc agtcgtcctg ctcgtcgacc tcgggcggcg gcggtggcgg cgggggaggc 120
ggcggcggtg ggaagtcggg gggcattgtc atctcgccgt tccgcctgga ggagctcacc 180
aaccgcctgg cctcgctgca gcaagagaac aaggtgctga agatagagct ggagacctac 240
aaactgaagt gcaaggcact gcaggaggag aaccgcgacc tgcgcaaagc cagcgtgacc 300
atccaagcca gggctgagca ggaagaagaa ttcattagta acactttatt caagaaaatt 360
caggctttgc agaaggagaa agaaaccctt gctgtaaatt atgagaaaga agaagaattc 420
ctcactaatg agctctccag aaaattgatg cagttgcagc atgagaaagc cgaactagaa 480
cagcatcttg aacaagagca ggaatttcag gtcaacaaac tgatgaagaa aattaaaaaa 540
ctggagaatg acaccatttc taagcaactt acattagaac agttgagacg ggagaagatt 600
gaccttgaaa atacattgga acaagaacaa gaagcactag ttaatcgcct ctggaaaagg 660
atggataagc ttgaagctga aaagcgaatc ctgcaggaaa aattagacca gcccgtctct 720
gctccaccat cgcctagaga tatctccatg gagattgatt ctccagaaaa tatgatgcgt 780
cacatcaggt ttttaaagaa tgaagtggaa cggctgaaga agcaactgag agctgctcag 840
ttacagcatt cagagaaaat ggcacagtat ctggaggagg aacgtcacat gagagaagag 900
aacttgaggc tccagaggaa gctgcagagg gagatggaga gaagagaagc cctctgtcga 960
cagctctccg agagtgagtc cagcttagaa atggacgacg aaaggtattt taatgagatg 1020
tctgcacaag gattaagacc tcgcactgtg tccagcccga tcccttacac accttctccg 1080
agttcaagca ggcctatatc acctggtcta tcatatgcaa gtcacacggt tggtttcacg 1140
ccaccaactt cactgactag agctggaatg tcttattaca attccccggg tcttcacgtg 1200
cagcacatgg gaacatccca tggtatcaca aggccttcac cacggagaag caacagtcct 1260
gacaaattca aacggcccac gccgcctcca tctcccaaca cacagacccc agtccagcca 1320
cctccgcctc cacctccgcc acccatgcag cccacggtcc cctcagcagc cacctcgcag 1380
cctactcctt cgcaacattc ggcgcacccc tcctcccagc cttaa 1425
<210> 2
<211> 847
<212> DNA
<213> Artificial Sequence
<220>
<223> CCDC6 gene fragment
<400> 2
atggcggaca gcgccagcga gagcgacacg gacggggcgg ggggcaacag cagcagctcg 60
gccgccatgc agtcgtcctg ctcgtcgacc tcgggcggcg gcggtggcgg cgggggaggc 120
ggcggcggtg ggaagtcggg gggcattgtc atctcgccgt tccgcctgga ggagctcacc 180
aaccgcctgg cctcgctgca gcaagagaac aaggtgctga agatagagct ggagacctac 240
aaactgaagt gcaaggcact gcaggaggag aaccgcgacc tgcgcaaagc cagcgtgacc 300
atccaagcca gggctgagca ggaagaagaa ttcattagta acactttatt caagaaaatt 360
caggctttgc agaaggagaa agaaaccctt gctgtaaatt atgagaaaga agaagaattc 420
ctcactaatg agctctccag aaaattgatg cagttgcagc atgagaaagc cgaactagaa 480
cagcatcttg aacaagagca ggaatttcag gtcaacaaac tgatgaagaa aattaaaaaa 540
ctggagaatg acaccatttc taagcaactt acattagaac agttgagacg ggagaagatt 600
gaccttgaaa atacattgga acaagaacaa gaagcactag ttaatcgcct ctggaaaagg 660
atggataagc ttgaagctga aaagcgaatc ctgcaggaaa aattagacca gcccgtctct 720
gctccaccat cgcctagaga tatctccatg gagattgatt ctccagaaaa tatgatgcgt 780
cacatcaggt ttttaaagaa tgaagtggaa cggctgaaga agcaactgag agctgctcag 840
ttacagc 847
<210> 3
<211> 16
<212> DNA
<213> Artificial Sequence
<220>
<223> Break-point sequence of CCDC6 gene fragment
<400> 3
gctgctcagt tacagc 16
<210> 4
<211> 474
<212> PRT
<213> Artificial Sequence
<220>
<223> CCDC6 protein
<400> 4
Met Ala Asp Ser Ala Ser Glu Ser Asp Thr Asp Gly Ala Gly Gly Asn
1 5 10 15
Ser Ser Ser Ser Ala Ala Met Gln Ser Ser Cys Ser Ser Thr Ser Gly
20 25 30
Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Lys Ser Gly Gly
35 40 45
Ile Val Ile Ser Pro Phe Arg Leu Glu Glu Leu Thr Asn Arg Leu Ala
50 55 60
Ser Leu Gln Gln Glu Asn Lys Val Leu Lys Ile Glu Leu Glu Thr Tyr
65 70 75 80
Lys Leu Lys Cys Lys Ala Leu Gln Glu Glu Asn Arg Asp Leu Arg Lys
85 90 95
Ala Ser Val Thr Ile Gln Ala Arg Ala Glu Gln Glu Glu Glu Phe Ile
100 105 110
Ser Asn Thr Leu Phe Lys Lys Ile Gln Ala Leu Gln Lys Glu Lys Glu
115 120 125
Thr Leu Ala Val Asn Tyr Glu Lys Glu Glu Glu Phe Leu Thr Asn Glu
130 135 140
Leu Ser Arg Lys Leu Met Gln Leu Gln His Glu Lys Ala Glu Leu Glu
145 150 155 160
Gln His Leu Glu Gln Glu Gln Glu Phe Gln Val Asn Lys Leu Met Lys
165 170 175
Lys Ile Lys Lys Leu Glu Asn Asp Thr Ile Ser Lys Gln Leu Thr Leu
180 185 190
Glu Gln Leu Arg Arg Glu Lys Ile Asp Leu Glu Asn Thr Leu Glu Gln
195 200 205
Glu Gln Glu Ala Leu Val Asn Arg Leu Trp Lys Arg Met Asp Lys Leu
210 215 220
Glu Ala Glu Lys Arg Ile Leu Gln Glu Lys Leu Asp Gln Pro Val Ser
225 230 235 240
Ala Pro Pro Ser Pro Arg Asp Ile Ser Met Glu Ile Asp Ser Pro Glu
245 250 255
Asn Met Met Arg His Ile Arg Phe Leu Lys Asn Glu Val Glu Arg Leu
260 265 270
Lys Lys Gln Leu Arg Ala Ala Gln Leu Gln His Ser Glu Lys Met Ala
275 280 285
Gln Tyr Leu Glu Glu Glu Arg His Met Arg Glu Glu Asn Leu Arg Leu
290 295 300
Gln Arg Lys Leu Gln Arg Glu Met Glu Arg Arg Glu Ala Leu Cys Arg
305 310 315 320
Gln Leu Ser Glu Ser Glu Ser Ser Leu Glu Met Asp Asp Glu Arg Tyr
325 330 335
Phe Asn Glu Met Ser Ala Gln Gly Leu Arg Pro Arg Thr Val Ser Ser
340 345 350
Pro Ile Pro Tyr Thr Pro Ser Pro Ser Ser Ser Arg Pro Ile Ser Pro
355 360 365
Gly Leu Ser Tyr Ala Ser His Thr Val Gly Phe Thr Pro Pro Thr Ser
370 375 380
Leu Thr Arg Ala Gly Met Ser Tyr Tyr Asn Ser Pro Gly Leu His Val
385 390 395 400
Gln His Met Gly Thr Ser His Gly Ile Thr Arg Pro Ser Pro Arg Arg
405 410 415
Ser Asn Ser Pro Asp Lys Phe Lys Arg Pro Thr Pro Pro Pro Ser Pro
420 425 430
Asn Thr Gln Thr Pro Val Gln Pro Pro Pro Pro Pro Pro Pro Pro Pro
435 440 445
Met Gln Pro Thr Val Pro Ser Ala Ala Thr Ser Gln Pro Thr Pro Ser
450 455 460
Gln His Ser Ala His Pro Ser Ser Gln Pro
465 470
<210> 5
<211> 282
<212> PRT
<213> Artificial Sequence
<220>
<223> CCDC6 protein fragment
<400> 5
Met Ala Asp Ser Ala Ser Glu Ser Asp Thr Asp Gly Ala Gly Gly Asn
1 5 10 15
Ser Ser Ser Ser Ala Ala Met Gln Ser Ser Cys Ser Ser Thr Ser Gly
20 25 30
Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Lys Ser Gly Gly
35 40 45
Ile Val Ile Ser Pro Phe Arg Leu Glu Glu Leu Thr Asn Arg Leu Ala
50 55 60
Ser Leu Gln Gln Glu Asn Lys Val Leu Lys Ile Glu Leu Glu Thr Tyr
65 70 75 80
Lys Leu Lys Cys Lys Ala Leu Gln Glu Glu Asn Arg Asp Leu Arg Lys
85 90 95
Ala Ser Val Thr Ile Gln Ala Arg Ala Glu Gln Glu Glu Glu Phe Ile
100 105 110
Ser Asn Thr Leu Phe Lys Lys Ile Gln Ala Leu Gln Lys Glu Lys Glu
115 120 125
Thr Leu Ala Val Asn Tyr Glu Lys Glu Glu Glu Phe Leu Thr Asn Glu
130 135 140
Leu Ser Arg Lys Leu Met Gln Leu Gln His Glu Lys Ala Glu Leu Glu
145 150 155 160
Gln His Leu Glu Gln Glu Gln Glu Phe Gln Val Asn Lys Leu Met Lys
165 170 175
Lys Ile Lys Lys Leu Glu Asn Asp Thr Ile Ser Lys Gln Leu Thr Leu
180 185 190
Glu Gln Leu Arg Arg Glu Lys Ile Asp Leu Glu Asn Thr Leu Glu Gln
195 200 205
Glu Gln Glu Ala Leu Val Asn Arg Leu Trp Lys Arg Met Asp Lys Leu
210 215 220
Glu Ala Glu Lys Arg Ile Leu Gln Glu Lys Leu Asp Gln Pro Val Ser
225 230 235 240
Ala Pro Pro Ser Pro Arg Asp Ile Ser Met Glu Ile Asp Ser Pro Glu
245 250 255
Asn Met Met Arg His Ile Arg Phe Leu Lys Asn Glu Val Glu Arg Leu
260 265 270
Lys Lys Gln Leu Arg Ala Ala Gln Leu Gln
275 280
<210> 6
<211> 5
<212> PRT
<213> Artificial Sequence
<220>
<223> Break-point of CCDC6 protein fragment
<400> 6
Ala Ala Gln Leu Gln
1 5
<210> 7
<211> 7044
<212> DNA
<213> Artificial Sequence
<220>
<223> CDS fo ROS1 gene(NM_002944 )
<400> 7
atgaagaaca tttactgtct tattccgaag cttgtcaatt ttgcaactct tggctgccta 60
tggatttctg tggtgcagtg tacagtttta aatagctgcc taaagtcgtg tgtaactaat 120
ctgggccagc agcttgacct tggcacacca cataatctga gtgaaccgtg tatccaagga 180
tgtcactttt ggaactctgt agatcagaaa aactgtgctt taaagtgtcg ggagtcgtgt 240
gaggttggct gtagcagcgc ggaaggtgca tatgaagagg aagtactgga aaatgcagac 300
ctaccaactg ctccctttgc ttcttccatt ggaagccaca atatgacatt acgatggaaa 360
tctgcaaact tctctggagt aaaatacatc attcagtgga aatatgcaca acttctggga 420
agctggactt atactaagac tgtgtccaga ccgtcctatg tggtcaagcc cctgcacccc 480
ttcactgagt acattttccg agtggtttgg atcttcacag cgcagctgca gctctactcc 540
cctccaagtc ccagttacag gactcatcct catggagttc ctgaaactgc acctttgatt 600
aggaatattg agagctcaag tcccgacact gtggaagtca gctgggatcc acctcaattc 660
ccaggtggac ctattttggg ttataactta aggctgatca gcaaaaatca aaaattagat 720
gcagggacac agagaaccag tttccagttt tactccactt taccaaatac tatctacagg 780
ttttctattg cagcagtaaa tgaagttggt gagggtccag aagcagaatc tagtattacc 840
acttcatctt cagcagttca acaagaggaa cagtggctct ttttatccag aaaaacttct 900
ctaagaaaga gatctttaaa acatttagta gatgaagcac attgccttcg gttggatgct 960
atataccata atattacagg aatatctgtt gatgtccacc agcaaattgt ttatttctct 1020
gaaggaactc tcatatgggc gaagaaggct gccaacatgt ctgatgtatc tgacctgaga 1080
attttttaca gaggttcagg attaatttct tctatctcca tagattggct ttatcaaaga 1140
atgtatttca tcatggatga actggtatgt gtctgtgatt tagagaactg ctcaaacatc 1200
gaggaaatta ctccaccctc tattagtgca cctcaaaaaa ttgtggctga ttcatacaat 1260
gggtatgtct tttacctcct gagagatggc atttatagag cagaccttcc tgtaccatct 1320
ggccggtgtg cagaagctgt gcgtattgtg gagagttgca cgttaaagga ctttgcaatc 1380
aagccacaag ccaagcgaat catttacttc aatgacactg cccaagtctt catgtcaaca 1440
tttctggatg gctctgcttc ccatctcatc ctacctcgca tcccctttgc tgatgtgaaa 1500
agttttgctt gtgaaaacaa tgactttctt gtcacagatg gcaaggtcat tttccaacag 1560
gatgctttgt cttttaatga attcatcgtg ggatgtgacc tgagtcacat agaagaattt 1620
gggtttggta acttggtcat ctttggctca tcctcccagc tgcaccctct gccaggccgc 1680
ccgcaggagc tttcggtgct gtttggctct caccaggctc ttgttcaatg gaagcctcct 1740
gcccttgcca taggagccaa tgtcatcctg atcagtgata ttattgaact ctttgaatta 1800
ggcccttctg cctggcagaa ctggacctat gaggtgaaag tatccaccca agaccctcct 1860
gaagtcactc atattttctt gaacataagt ggaaccatgc tgaatgtacc tgagctgcag 1920
agtgctatga aatacaaggt ttctgtgaga gcaagttctc caaagaggcc aggcccctgg 1980
tcagagccct cagtgggtac taccctggtg ccagctagtg aaccaccatt tatcatggct 2040
gtgaaagaag atgggctttg gagtaaacca ttaaatagct ttggcccagg agagttctta 2100
tcctctgata taggaaatgt gtcagacatg gattggtata acaacagcct ctactacagt 2160
gacacgaaag gcgacgtttt tgtgtggctg ctgaatggga cggatatctc agagaattat 2220
cacctaccca gcattgcagg agcaggggct ttagcttttg agtggctggg tcactttctc 2280
tactgggctg gaaagacata tgtgatacaa aggcagtctg tgttgacggg acacacagac 2340
attgttaccc acgtgaagct attggtgaat gacatggtgg tggattcagt tggtggatat 2400
ctctactgga ccacactcta ttcagtggaa agcaccagac taaatgggga aagttccctt 2460
gtactacaga cacagccttg gttttctggg aaaaaggtaa ttgctctaac tttagacctc 2520
agtgatgggc tcctgtattg gttggttcaa gacagtcaat gtattcacct gtacacagct 2580
gttcttcggg gacagagcac tggggatacc accatcacag aatttgcagc ctggagtact 2640
tctgaaattt cccagaatgc actgatgtac tatagtggtc ggctgttctg gatcaatggc 2700
tttaggatta tcacaactca agaaataggt cagaaaacca gtgtctctgt tttggaacca 2760
gccagattta atcagttcac aattattcag acatccctta agcccctgcc agggaacttt 2820
tcctttaccc ctaaggttat tccagattct gttcaagagt cttcatttag gattgaagga 2880
aatgcttcaa gttttcaaat cctgtggaat ggtccccctg cggtagactg gggtgtagtt 2940
ttctacagtg tagaatttag tgctcattct aagttcttgg ctagtgaaca acactcttta 3000
cctgtattta ctgtggaagg actggaacct tatgccttat ttaatctttc tgtcactcct 3060
tatacctact ggggaaaggg ccccaaaaca tctctgtcac ttcgagcacc tgaaacagtt 3120
ccatcagcac cagagaaccc cagaatattt atattaccaa gtggaaaatg ctgcaacaag 3180
aatgaagttg tggtggaatt taggtggaac aaacctaagc atgaaaatgg ggtgttaaca 3240
aaatttgaaa ttttctacaa tatatccaat caaagtatta caaacaaaac atgtgaagac 3300
tggattgctg tcaatgtcac tccctcagtg atgtcttttc aacttgaagg catgagtccc 3360
agatgcttta ttgccttcca ggttagggcc tttacatcta aggggccagg accatatgct 3420
gacgttgtaa agtctacaac atcagaaatc aacccatttc ctcacctcat aactcttctt 3480
ggtaacaaga tagttttttt agatatggat caaaatcaag ttgtgtggac gttttcagca 3540
gaaagagtta tcagtgccgt ttgctacaca gctgataatg agatgggata ttatgctgaa 3600
ggggactcac tctttcttct gcacttgcac aatcgctcta gctctgagct tttccaagat 3660
tcactggttt ttgatatcac agttattaca attgactgga tttcaaggca cctctacttt 3720
gcactgaaag aatcacaaaa tggaatgcaa gtatttgatg ttgatcttga acacaaggtg 3780
aaatatccca gagaggtgaa gattcacaat aggaattcaa caataatttc tttttctgta 3840
tatcctcttt taagtcgctt gtattggaca gaagtttcca attttggcta ccagatgttc 3900
tactacagta ttatcagtca caccttgcac cgaattctgc aacccacagc tacaaaccaa 3960
caaaacaaaa ggaatcaatg ttcttgtaat gtgactgaat ttgagttaag tggagcaatg 4020
gctattgata cctctaacct agagaaacca ttgatatact ttgccaaagc acaagagatc 4080
tgggcaatgg atctggaagg ctgtcagtgt tggagagtta tcacagtacc tgctatgctc 4140
gcaggaaaaa cccttgttag cttaactgtg gatggagatc ttatatactg gatcatcaca 4200
gcaaaggaca gcacacagat ttatcaggca aagaaaggaa atggggccat cgtttcccag 4260
gtgaaggccc taaggagtag gcatatcttg gcttacagtt cagttatgca gccttttcca 4320
gataaagcgt ttctgtctct agcttcagac actgtggaac caactatact taatgccact 4380
aacactagcc tcacaatcag attacctctg gccaagacaa acctcacatg gtatggcatc 4440
accagcccta ctccaacata cctggtttat tatgcagaag ttaatgacag gaaaaacagc 4500
tctgacttga aatatagaat tctggaattt caggacagta tagctcttat tgaagattta 4560
caaccatttt caacatacat gatacagata gctgtaaaaa attattattc agatcctttg 4620
gaacatttac caccaggaaa agagatttgg ggaaaaacta aaaatggagt accagaggca 4680
gtgcagctca ttaatacaac tgtgcggtca gacaccagcc tcattatatc ttggagagaa 4740
tctcacaagc caaatggacc taaagaatca gtccgttatc agttggcaat ctcacacctg 4800
gccctaattc ctgaaactcc tctaagacaa agtgaatttc caaatggaag gctcactctc 4860
cttgttacta gactgtctgg tggaaatatt tatgtgttaa aggttcttgc ctgccactct 4920
gaggaaatgt ggtgtacaga gagtcatcct gtcactgtgg aaatgtttaa cacaccagag 4980
aaaccttatt ccttggttcc agagaacact agtttgcaat ttaattggaa ggctccattg 5040
aatgttaacc tcatcagatt ttgggttgag ctacagaagt ggaaatacaa tgagttttac 5100
catgttaaaa cttcatgcag ccaaggtcct gcttatgtct gtaatatcac aaatctacaa 5160
ccttatactt catataatgt cagagtagtg gtggtttata agacgggaga aaatagcacc 5220
tcacttccag aaagctttaa gacaaaagct ggagtcccaa ataaaccagg cattcccaaa 5280
ttactagaag ggagtaaaaa ttcaatacag tgggagaaag ctgaagataa tggatgtaga 5340
attacatact atatccttga gataagaaag agcacttcaa ataatttaca gaaccagaat 5400
ttaaggtgga agatgacatt taatggatcc tgcagtagtg tttgcacatg gaagtccaaa 5460
aacctgaaag gaatatttca gttcagagta gtagctgcaa ataatctagg gtttggtgaa 5520
tatagtggaa tcagtgagaa tattatatta gttggagatg atttttggat accagaaaca 5580
agtttcatac ttactattat agttggaata tttctggttg ttacaatccc actgaccttt 5640
gtctggcata gaagattaaa gaatcaaaaa agtgccaagg aaggggtgac agtgcttata 5700
aacgaagaca aagagttggc tgagctgcga ggtctggcag ccggagtagg cctggctaat 5760
gcctgctatg caatacatac tcttccaacc caagaggaga ttgaaaatct tcctgccttc 5820
cctcgggaaa aactgactct gcgtctcttg ctgggaagtg gagcctttgg agaagtgtat 5880
gaaggaacag cagtggacat cttaggagtt ggaagtggag aaatcaaagt agcagtgaag 5940
actttgaaga agggttccac agaccaggag aagattgaat tcctgaagga ggcacatctg 6000
atgagcaaat ttaatcatcc caacattctg aagcagcttg gagtttgtct gctgaatgaa 6060
ccccaataca ttatcctgga actgatggag ggaggagacc ttcttactta tttgcgtaaa 6120
gcccggatgg caacgtttta tggtccttta ctcaccttgg ttgaccttgt agacctgtgt 6180
gtagatattt caaaaggctg tgtctacttg gaacggatgc atttcattca cagggatctg 6240
gcagctagaa attgccttgt ttccgtgaaa gactatacca gtccacggat agtgaagatt 6300
ggagactttg gactcgccag agacatctat aaaaatgatt actatagaaa gagaggggaa 6360
ggcctgctcc cagttcggtg gatggctcca gaaagtttga tggatggaat cttcactact 6420
caatctgatg tatggtcttt tggaattctg atttgggaga ttttaactct tggtcatcag 6480
ccttatccag ctcattccaa ccttgatgtg ttaaactatg tgcaaacagg agggagactg 6540
gagccaccaa gaaattgtcc tgatgatctg tggaatttaa tgacccagtg ctgggctcaa 6600
gaacccgacc aaagacctac ttttcataga attcaggacc aacttcagtt attcagaaat 6660
tttttcttaa atagcattta taagtccaga gatgaagcaa acaacagtgg agtcataaat 6720
gaaagctttg aaggtgaaga tggcgatgtg atttgtttga attcagatga cattatgcca 6780
gttgctttaa tggaaacgaa gaaccgagaa gggttaaact atatggtact tgctacagaa 6840
tgtggccaag gtgaagaaaa gtctgagggt cctctaggct cccaggaatc tgaatcttgt 6900
ggtctgagga aagaagagaa ggaaccacat gcagacaaag atttctgcca agaaaaacaa 6960
gtggcttact gcccttctgg caagcctgaa ggcctgaact atgcctgtct cactcacagt 7020
ggatatggag atgggtctga ttaa 7044
<210> 8
<211> 1403
<212> DNA
<213> Artificial Sequence
<220>
<223> ROS1 gene fragment
<400> 8
tctggcatag aagattaaag aatcaaaaaa gtgccaagga aggggtgaca gtgcttataa 60
acgaagacaa agagttggct gagctgcgag gtctggcagc cggagtaggc ctggctaatg 120
cctgctatgc aatacatact cttccaaccc aagaggagat tgaaaatctt cctgccttcc 180
ctcgggaaaa actgactctg cgtctcttgc tgggaagtgg agcctttgga gaagtgtatg 240
aaggaacagc agtggacatc ttaggagttg gaagtggaga aatcaaagta gcagtgaaga 300
ctttgaagaa gggttccaca gaccaggaga agattgaatt cctgaaggag gcacatctga 360
tgagcaaatt taatcatccc aacattctga agcagcttgg agtttgtctg ctgaatgaac 420
cccaatacat tatcctggaa ctgatggagg gaggagacct tcttacttat ttgcgtaaag 480
cccggatggc aacgttttat ggtcctttac tcaccttggt tgaccttgta gacctgtgtg 540
tagatatttc aaaaggctgt gtctacttgg aacggatgca tttcattcac agggatctgg 600
cagctagaaa ttgccttgtt tccgtgaaag actataccag tccacggata gtgaagattg 660
gagactttgg actcgccaga gacatctata aaaatgatta ctatagaaag agaggggaag 720
gcctgctccc agttcggtgg atggctccag aaagtttgat ggatggaatc ttcactactc 780
aatctgatgt atggtctttt ggaattctga tttgggagat tttaactctt ggtcatcagc 840
cttatccagc tcattccaac cttgatgtgt taaactatgt gcaaacagga gggagactgg 900
agccaccaag aaattgtcct gatgatctgt ggaatttaat gacccagtgc tgggctcaag 960
aacccgacca aagacctact tttcatagaa ttcaggacca acttcagtta ttcagaaatt 1020
ttttcttaaa tagcatttat aagtccagag atgaagcaaa caacagtgga gtcataaatg 1080
aaagctttga aggtgaagat ggcgatgtga tttgtttgaa ttcagatgac attatgccag 1140
ttgctttaat ggaaacgaag aaccgagaag ggttaaacta tatggtactt gctacagaat 1200
gtggccaagg tgaagaaaag tctgagggtc ctctaggctc ccaggaatct gaatcttgtg 1260
gtctgaggaa agaagagaag gaaccacatg cagacaaaga tttctgccaa gaaaaacaag 1320
tggcttactg cccttctggc aagcctgaag gcctgaacta tgcctgtctc actcacagtg 1380
gatatggaga tgggtctgat taa 1403
<210> 9
<211> 17
<212> DNA
<213> Artificial Sequence
<220>
<223> Break-point oof ROS1 gene fragment
<400> 9
tctggcatag aagatta 17
<210> 10
<211> 2347
<212> PRT
<213> Artificial Sequence
<220>
<223> ROS1 protein
<400> 10
Met Lys Asn Ile Tyr Cys Leu Ile Pro Lys Leu Val Asn Phe Ala Thr
1 5 10 15
Leu Gly Cys Leu Trp Ile Ser Val Val Gln Cys Thr Val Leu Asn Ser
20 25 30
Cys Leu Lys Ser Cys Val Thr Asn Leu Gly Gln Gln Leu Asp Leu Gly
35 40 45
Thr Pro His Asn Leu Ser Glu Pro Cys Ile Gln Gly Cys His Phe Trp
50 55 60
Asn Ser Val Asp Gln Lys Asn Cys Ala Leu Lys Cys Arg Glu Ser Cys
65 70 75 80
Glu Val Gly Cys Ser Ser Ala Glu Gly Ala Tyr Glu Glu Glu Val Leu
85 90 95
Glu Asn Ala Asp Leu Pro Thr Ala Pro Phe Ala Ser Ser Ile Gly Ser
100 105 110
His Asn Met Thr Leu Arg Trp Lys Ser Ala Asn Phe Ser Gly Val Lys
115 120 125
Tyr Ile Ile Gln Trp Lys Tyr Ala Gln Leu Leu Gly Ser Trp Thr Tyr
130 135 140
Thr Lys Thr Val Ser Arg Pro Ser Tyr Val Val Lys Pro Leu His Pro
145 150 155 160
Phe Thr Glu Tyr Ile Phe Arg Val Val Trp Ile Phe Thr Ala Gln Leu
165 170 175
Gln Leu Tyr Ser Pro Pro Ser Pro Ser Tyr Arg Thr His Pro His Gly
180 185 190
Val Pro Glu Thr Ala Pro Leu Ile Arg Asn Ile Glu Ser Ser Ser Pro
195 200 205
Asp Thr Val Glu Val Ser Trp Asp Pro Pro Gln Phe Pro Gly Gly Pro
210 215 220
Ile Leu Gly Tyr Asn Leu Arg Leu Ile Ser Lys Asn Gln Lys Leu Asp
225 230 235 240
Ala Gly Thr Gln Arg Thr Ser Phe Gln Phe Tyr Ser Thr Leu Pro Asn
245 250 255
Thr Ile Tyr Arg Phe Ser Ile Ala Ala Val Asn Glu Val Gly Glu Gly
260 265 270
Pro Glu Ala Glu Ser Ser Ile Thr Thr Ser Ser Ser Ala Val Gln Gln
275 280 285
Glu Glu Gln Trp Leu Phe Leu Ser Arg Lys Thr Ser Leu Arg Lys Arg
290 295 300
Ser Leu Lys His Leu Val Asp Glu Ala His Cys Leu Arg Leu Asp Ala
305 310 315 320
Ile Tyr His Asn Ile Thr Gly Ile Ser Val Asp Val His Gln Gln Ile
325 330 335
Val Tyr Phe Ser Glu Gly Thr Leu Ile Trp Ala Lys Lys Ala Ala Asn
340 345 350
Met Ser Asp Val Ser Asp Leu Arg Ile Phe Tyr Arg Gly Ser Gly Leu
355 360 365
Ile Ser Ser Ile Ser Ile Asp Trp Leu Tyr Gln Arg Met Tyr Phe Ile
370 375 380
Met Asp Glu Leu Val Cys Val Cys Asp Leu Glu Asn Cys Ser Asn Ile
385 390 395 400
Glu Glu Ile Thr Pro Pro Ser Ile Ser Ala Pro Gln Lys Ile Val Ala
405 410 415
Asp Ser Tyr Asn Gly Tyr Val Phe Tyr Leu Leu Arg Asp Gly Ile Tyr
420 425 430
Arg Ala Asp Leu Pro Val Pro Ser Gly Arg Cys Ala Glu Ala Val Arg
435 440 445
Ile Val Glu Ser Cys Thr Leu Lys Asp Phe Ala Ile Lys Pro Gln Ala
450 455 460
Lys Arg Ile Ile Tyr Phe Asn Asp Thr Ala Gln Val Phe Met Ser Thr
465 470 475 480
Phe Leu Asp Gly Ser Ala Ser His Leu Ile Leu Pro Arg Ile Pro Phe
485 490 495
Ala Asp Val Lys Ser Phe Ala Cys Glu Asn Asn Asp Phe Leu Val Thr
500 505 510
Asp Gly Lys Val Ile Phe Gln Gln Asp Ala Leu Ser Phe Asn Glu Phe
515 520 525
Ile Val Gly Cys Asp Leu Ser His Ile Glu Glu Phe Gly Phe Gly Asn
530 535 540
Leu Val Ile Phe Gly Ser Ser Ser Gln Leu His Pro Leu Pro Gly Arg
545 550 555 560
Pro Gln Glu Leu Ser Val Leu Phe Gly Ser His Gln Ala Leu Val Gln
565 570 575
Trp Lys Pro Pro Ala Leu Ala Ile Gly Ala Asn Val Ile Leu Ile Ser
580 585 590
Asp Ile Ile Glu Leu Phe Glu Leu Gly Pro Ser Ala Trp Gln Asn Trp
595 600 605
Thr Tyr Glu Val Lys Val Ser Thr Gln Asp Pro Pro Glu Val Thr His
610 615 620
Ile Phe Leu Asn Ile Ser Gly Thr Met Leu Asn Val Pro Glu Leu Gln
625 630 635 640
Ser Ala Met Lys Tyr Lys Val Ser Val Arg Ala Ser Ser Pro Lys Arg
645 650 655
Pro Gly Pro Trp Ser Glu Pro Ser Val Gly Thr Thr Leu Val Pro Ala
660 665 670
Ser Glu Pro Pro Phe Ile Met Ala Val Lys Glu Asp Gly Leu Trp Ser
675 680 685
Lys Pro Leu Asn Ser Phe Gly Pro Gly Glu Phe Leu Ser Ser Asp Ile
690 695 700
Gly Asn Val Ser Asp Met Asp Trp Tyr Asn Asn Ser Leu Tyr Tyr Ser
705 710 715 720
Asp Thr Lys Gly Asp Val Phe Val Trp Leu Leu Asn Gly Thr Asp Ile
725 730 735
Ser Glu Asn Tyr His Leu Pro Ser Ile Ala Gly Ala Gly Ala Leu Ala
740 745 750
Phe Glu Trp Leu Gly His Phe Leu Tyr Trp Ala Gly Lys Thr Tyr Val
755 760 765
Ile Gln Arg Gln Ser Val Leu Thr Gly His Thr Asp Ile Val Thr His
770 775 780
Val Lys Leu Leu Val Asn Asp Met Val Val Asp Ser Val Gly Gly Tyr
785 790 795 800
Leu Tyr Trp Thr Thr Leu Tyr Ser Val Glu Ser Thr Arg Leu Asn Gly
805 810 815
Glu Ser Ser Leu Val Leu Gln Thr Gln Pro Trp Phe Ser Gly Lys Lys
820 825 830
Val Ile Ala Leu Thr Leu Asp Leu Ser Asp Gly Leu Leu Tyr Trp Leu
835 840 845
Val Gln Asp Ser Gln Cys Ile His Leu Tyr Thr Ala Val Leu Arg Gly
850 855 860
Gln Ser Thr Gly Asp Thr Thr Ile Thr Glu Phe Ala Ala Trp Ser Thr
865 870 875 880
Ser Glu Ile Ser Gln Asn Ala Leu Met Tyr Tyr Ser Gly Arg Leu Phe
885 890 895
Trp Ile Asn Gly Phe Arg Ile Ile Thr Thr Gln Glu Ile Gly Gln Lys
900 905 910
Thr Ser Val Ser Val Leu Glu Pro Ala Arg Phe Asn Gln Phe Thr Ile
915 920 925
Ile Gln Thr Ser Leu Lys Pro Leu Pro Gly Asn Phe Ser Phe Thr Pro
930 935 940
Lys Val Ile Pro Asp Ser Val Gln Glu Ser Ser Phe Arg Ile Glu Gly
945 950 955 960
Asn Ala Ser Ser Phe Gln Ile Leu Trp Asn Gly Pro Pro Ala Val Asp
965 970 975
Trp Gly Val Val Phe Tyr Ser Val Glu Phe Ser Ala His Ser Lys Phe
980 985 990
Leu Ala Ser Glu Gln His Ser Leu Pro Val Phe Thr Val Glu Gly Leu
995 1000 1005
Glu Pro Tyr Ala Leu Phe Asn Leu Ser Val Thr Pro Tyr Thr Tyr Trp
1010 1015 1020
Gly Lys Gly Pro Lys Thr Ser Leu Ser Leu Arg Ala Pro Glu Thr Val
1025 1030 1035 1040
Pro Ser Ala Pro Glu Asn Pro Arg Ile Phe Ile Leu Pro Ser Gly Lys
1045 1050 1055
Cys Cys Asn Lys Asn Glu Val Val Val Glu Phe Arg Trp Asn Lys Pro
1060 1065 1070
Lys His Glu Asn Gly Val Leu Thr Lys Phe Glu Ile Phe Tyr Asn Ile
1075 1080 1085
Ser Asn Gln Ser Ile Thr Asn Lys Thr Cys Glu Asp Trp Ile Ala Val
1090 1095 1100
Asn Val Thr Pro Ser Val Met Ser Phe Gln Leu Glu Gly Met Ser Pro
1105 1110 1115 1120
Arg Cys Phe Ile Ala Phe Gln Val Arg Ala Phe Thr Ser Lys Gly Pro
1125 1130 1135
Gly Pro Tyr Ala Asp Val Val Lys Ser Thr Thr Ser Glu Ile Asn Pro
1140 1145 1150
Phe Pro His Leu Ile Thr Leu Leu Gly Asn Lys Ile Val Phe Leu Asp
1155 1160 1165
Met Asp Gln Asn Gln Val Val Trp Thr Phe Ser Ala Glu Arg Val Ile
1170 1175 1180
Ser Ala Val Cys Tyr Thr Ala Asp Asn Glu Met Gly Tyr Tyr Ala Glu
1185 1190 1195 1200
Gly Asp Ser Leu Phe Leu Leu His Leu His Asn Arg Ser Ser Ser Glu
1205 1210 1215
Leu Phe Gln Asp Ser Leu Val Phe Asp Ile Thr Val Ile Thr Ile Asp
1220 1225 1230
Trp Ile Ser Arg His Leu Tyr Phe Ala Leu Lys Glu Ser Gln Asn Gly
1235 1240 1245
Met Gln Val Phe Asp Val Asp Leu Glu His Lys Val Lys Tyr Pro Arg
1250 1255 1260
Glu Val Lys Ile His Asn Arg Asn Ser Thr Ile Ile Ser Phe Ser Val
1265 1270 1275 1280
Tyr Pro Leu Leu Ser Arg Leu Tyr Trp Thr Glu Val Ser Asn Phe Gly
1285 1290 1295
Tyr Gln Met Phe Tyr Tyr Ser Ile Ile Ser His Thr Leu His Arg Ile
1300 1305 1310
Leu Gln Pro Thr Ala Thr Asn Gln Gln Asn Lys Arg Asn Gln Cys Ser
1315 1320 1325
Cys Asn Val Thr Glu Phe Glu Leu Ser Gly Ala Met Ala Ile Asp Thr
1330 1335 1340
Ser Asn Leu Glu Lys Pro Leu Ile Tyr Phe Ala Lys Ala Gln Glu Ile
1345 1350 1355 1360
Trp Ala Met Asp Leu Glu Gly Cys Gln Cys Trp Arg Val Ile Thr Val
1365 1370 1375
Pro Ala Met Leu Ala Gly Lys Thr Leu Val Ser Leu Thr Val Asp Gly
1380 1385 1390
Asp Leu Ile Tyr Trp Ile Ile Thr Ala Lys Asp Ser Thr Gln Ile Tyr
1395 1400 1405
Gln Ala Lys Lys Gly Asn Gly Ala Ile Val Ser Gln Val Lys Ala Leu
1410 1415 1420
Arg Ser Arg His Ile Leu Ala Tyr Ser Ser Val Met Gln Pro Phe Pro
1425 1430 1435 1440
Asp Lys Ala Phe Leu Ser Leu Ala Ser Asp Thr Val Glu Pro Thr Ile
1445 1450 1455
Leu Asn Ala Thr Asn Thr Ser Leu Thr Ile Arg Leu Pro Leu Ala Lys
1460 1465 1470
Thr Asn Leu Thr Trp Tyr Gly Ile Thr Ser Pro Thr Pro Thr Tyr Leu
1475 1480 1485
Val Tyr Tyr Ala Glu Val Asn Asp Arg Lys Asn Ser Ser Asp Leu Lys
1490 1495 1500
Tyr Arg Ile Leu Glu Phe Gln Asp Ser Ile Ala Leu Ile Glu Asp Leu
1505 1510 1515 1520
Gln Pro Phe Ser Thr Tyr Met Ile Gln Ile Ala Val Lys Asn Tyr Tyr
1525 1530 1535
Ser Asp Pro Leu Glu His Leu Pro Pro Gly Lys Glu Ile Trp Gly Lys
1540 1545 1550
Thr Lys Asn Gly Val Pro Glu Ala Val Gln Leu Ile Asn Thr Thr Val
1555 1560 1565
Arg Ser Asp Thr Ser Leu Ile Ile Ser Trp Arg Glu Ser His Lys Pro
1570 1575 1580
Asn Gly Pro Lys Glu Ser Val Arg Tyr Gln Leu Ala Ile Ser His Leu
1585 1590 1595 1600
Ala Leu Ile Pro Glu Thr Pro Leu Arg Gln Ser Glu Phe Pro Asn Gly
1605 1610 1615
Arg Leu Thr Leu Leu Val Thr Arg Leu Ser Gly Gly Asn Ile Tyr Val
1620 1625 1630
Leu Lys Val Leu Ala Cys His Ser Glu Glu Met Trp Cys Thr Glu Ser
1635 1640 1645
His Pro Val Thr Val Glu Met Phe Asn Thr Pro Glu Lys Pro Tyr Ser
1650 1655 1660
Leu Val Pro Glu Asn Thr Ser Leu Gln Phe Asn Trp Lys Ala Pro Leu
1665 1670 1675 1680
Asn Val Asn Leu Ile Arg Phe Trp Val Glu Leu Gln Lys Trp Lys Tyr
1685 1690 1695
Asn Glu Phe Tyr His Val Lys Thr Ser Cys Ser Gln Gly Pro Ala Tyr
1700 1705 1710
Val Cys Asn Ile Thr Asn Leu Gln Pro Tyr Thr Ser Tyr Asn Val Arg
1715 1720 1725
Val Val Val Val Tyr Lys Thr Gly Glu Asn Ser Thr Ser Leu Pro Glu
1730 1735 1740
Ser Phe Lys Thr Lys Ala Gly Val Pro Asn Lys Pro Gly Ile Pro Lys
1745 1750 1755 1760
Leu Leu Glu Gly Ser Lys Asn Ser Ile Gln Trp Glu Lys Ala Glu Asp
1765 1770 1775
Asn Gly Cys Arg Ile Thr Tyr Tyr Ile Leu Glu Ile Arg Lys Ser Thr
1780 1785 1790
Ser Asn Asn Leu Gln Asn Gln Asn Leu Arg Trp Lys Met Thr Phe Asn
1795 1800 1805
Gly Ser Cys Ser Ser Val Cys Thr Trp Lys Ser Lys Asn Leu Lys Gly
1810 1815 1820
Ile Phe Gln Phe Arg Val Val Ala Ala Asn Asn Leu Gly Phe Gly Glu
1825 1830 1835 1840
Tyr Ser Gly Ile Ser Glu Asn Ile Ile Leu Val Gly Asp Asp Phe Trp
1845 1850 1855
Ile Pro Glu Thr Ser Phe Ile Leu Thr Ile Ile Val Gly Ile Phe Leu
1860 1865 1870
Val Val Thr Ile Pro Leu Thr Phe Val Trp His Arg Arg Leu Lys Asn
1875 1880 1885
Gln Lys Ser Ala Lys Glu Gly Val Thr Val Leu Ile Asn Glu Asp Lys
1890 1895 1900
Glu Leu Ala Glu Leu Arg Gly Leu Ala Ala Gly Val Gly Leu Ala Asn
1905 1910 1915 1920
Ala Cys Tyr Ala Ile His Thr Leu Pro Thr Gln Glu Glu Ile Glu Asn
1925 1930 1935
Leu Pro Ala Phe Pro Arg Glu Lys Leu Thr Leu Arg Leu Leu Leu Gly
1940 1945 1950
Ser Gly Ala Phe Gly Glu Val Tyr Glu Gly Thr Ala Val Asp Ile Leu
1955 1960 1965
Gly Val Gly Ser Gly Glu Ile Lys Val Ala Val Lys Thr Leu Lys Lys
1970 1975 1980
Gly Ser Thr Asp Gln Glu Lys Ile Glu Phe Leu Lys Glu Ala His Leu
1985 1990 1995 2000
Met Ser Lys Phe Asn His Pro Asn Ile Leu Lys Gln Leu Gly Val Cys
2005 2010 2015
Leu Leu Asn Glu Pro Gln Tyr Ile Ile Leu Glu Leu Met Glu Gly Gly
2020 2025 2030
Asp Leu Leu Thr Tyr Leu Arg Lys Ala Arg Met Ala Thr Phe Tyr Gly
2035 2040 2045
Pro Leu Leu Thr Leu Val Asp Leu Val Asp Leu Cys Val Asp Ile Ser
2050 2055 2060
Lys Gly Cys Val Tyr Leu Glu Arg Met His Phe Ile His Arg Asp Leu
2065 2070 2075 2080
Ala Ala Arg Asn Cys Leu Val Ser Val Lys Asp Tyr Thr Ser Pro Arg
2085 2090 2095
Ile Val Lys Ile Gly Asp Phe Gly Leu Ala Arg Asp Ile Tyr Lys Asn
2100 2105 2110
Asp Tyr Tyr Arg Lys Arg Gly Glu Gly Leu Leu Pro Val Arg Trp Met
2115 2120 2125
Ala Pro Glu Ser Leu Met Asp Gly Ile Phe Thr Thr Gln Ser Asp Val
2130 2135 2140
Trp Ser Phe Gly Ile Leu Ile Trp Glu Ile Leu Thr Leu Gly His Gln
2145 2150 2155 2160
Pro Tyr Pro Ala His Ser Asn Leu Asp Val Leu Asn Tyr Val Gln Thr
2165 2170 2175
Gly Gly Arg Leu Glu Pro Pro Arg Asn Cys Pro Asp Asp Leu Trp Asn
2180 2185 2190
Leu Met Thr Gln Cys Trp Ala Gln Glu Pro Asp Gln Arg Pro Thr Phe
2195 2200 2205
His Arg Ile Gln Asp Gln Leu Gln Leu Phe Arg Asn Phe Phe Leu Asn
2210 2215 2220
Ser Ile Tyr Lys Ser Arg Asp Glu Ala Asn Asn Ser Gly Val Ile Asn
2225 2230 2235 2240
Glu Ser Phe Glu Gly Glu Asp Gly Asp Val Ile Cys Leu Asn Ser Asp
2245 2250 2255
Asp Ile Met Pro Val Ala Leu Met Glu Thr Lys Asn Arg Glu Gly Leu
2260 2265 2270
Asn Tyr Met Val Leu Ala Thr Glu Cys Gly Gln Gly Glu Glu Lys Ser
2275 2280 2285
Glu Gly Pro Leu Gly Ser Gln Glu Ser Glu Ser Cys Gly Leu Arg Lys
2290 2295 2300
Glu Glu Lys Glu Pro His Ala Asp Lys Asp Phe Cys Gln Glu Lys Gln
2305 2310 2315 2320
Val Ala Tyr Cys Pro Ser Gly Lys Pro Glu Gly Leu Asn Tyr Ala Cys
2325 2330 2335
Leu Thr His Ser Gly Tyr Gly Asp Gly Ser Asp
2340 2345
<210> 11
<211> 466
<212> PRT
<213> Artificial Sequence
<220>
<223> ROS1 protein fragment
<400> 11
Trp His Arg Arg Leu Lys Asn Gln Lys Ser Ala Lys Glu Gly Val Thr
1 5 10 15
Val Leu Ile Asn Glu Asp Lys Glu Leu Ala Glu Leu Arg Gly Leu Ala
20 25 30
Ala Gly Val Gly Leu Ala Asn Ala Cys Tyr Ala Ile His Thr Leu Pro
35 40 45
Thr Gln Glu Glu Ile Glu Asn Leu Pro Ala Phe Pro Arg Glu Lys Leu
50 55 60
Thr Leu Arg Leu Leu Leu Gly Ser Gly Ala Phe Gly Glu Val Tyr Glu
65 70 75 80
Gly Thr Ala Val Asp Ile Leu Gly Val Gly Ser Gly Glu Ile Lys Val
85 90 95
Ala Val Lys Thr Leu Lys Lys Gly Ser Thr Asp Gln Glu Lys Ile Glu
100 105 110
Phe Leu Lys Glu Ala His Leu Met Ser Lys Phe Asn His Pro Asn Ile
115 120 125
Leu Lys Gln Leu Gly Val Cys Leu Leu Asn Glu Pro Gln Tyr Ile Ile
130 135 140
Leu Glu Leu Met Glu Gly Gly Asp Leu Leu Thr Tyr Leu Arg Lys Ala
145 150 155 160
Arg Met Ala Thr Phe Tyr Gly Pro Leu Leu Thr Leu Val Asp Leu Val
165 170 175
Asp Leu Cys Val Asp Ile Ser Lys Gly Cys Val Tyr Leu Glu Arg Met
180 185 190
His Phe Ile His Arg Asp Leu Ala Ala Arg Asn Cys Leu Val Ser Val
195 200 205
Lys Asp Tyr Thr Ser Pro Arg Ile Val Lys Ile Gly Asp Phe Gly Leu
210 215 220
Ala Arg Asp Ile Tyr Lys Asn Asp Tyr Tyr Arg Lys Arg Gly Glu Gly
225 230 235 240
Leu Leu Pro Val Arg Trp Met Ala Pro Glu Ser Leu Met Asp Gly Ile
245 250 255
Phe Thr Thr Gln Ser Asp Val Trp Ser Phe Gly Ile Leu Ile Trp Glu
260 265 270
Ile Leu Thr Leu Gly His Gln Pro Tyr Pro Ala His Ser Asn Leu Asp
275 280 285
Val Leu Asn Tyr Val Gln Thr Gly Gly Arg Leu Glu Pro Pro Arg Asn
290 295 300
Cys Pro Asp Asp Leu Trp Asn Leu Met Thr Gln Cys Trp Ala Gln Glu
305 310 315 320
Pro Asp Gln Arg Pro Thr Phe His Arg Ile Gln Asp Gln Leu Gln Leu
325 330 335
Phe Arg Asn Phe Phe Leu Asn Ser Ile Tyr Lys Ser Arg Asp Glu Ala
340 345 350
Asn Asn Ser Gly Val Ile Asn Glu Ser Phe Glu Gly Glu Asp Gly Asp
355 360 365
Val Ile Cys Leu Asn Ser Asp Asp Ile Met Pro Val Ala Leu Met Glu
370 375 380
Thr Lys Asn Arg Glu Gly Leu Asn Tyr Met Val Leu Ala Thr Glu Cys
385 390 395 400
Gly Gln Gly Glu Glu Lys Ser Glu Gly Pro Leu Gly Ser Gln Glu Ser
405 410 415
Glu Ser Cys Gly Leu Arg Lys Glu Glu Lys Glu Pro His Ala Asp Lys
420 425 430
Asp Phe Cys Gln Glu Lys Gln Val Ala Tyr Cys Pro Ser Gly Lys Pro
435 440 445
Glu Gly Leu Asn Tyr Ala Cys Leu Thr His Ser Gly Tyr Gly Asp Gly
450 455 460
Ser Asp
465
<210> 12
<211> 5
<212> PRT
<213> Artificial Sequence
<220>
<223> Break-point of ROS1 protein fragment
<400> 12
Trp His Arg Arg Leu
1 5
<210> 13
<211> 2250
<212> DNA
<213> Artificial Sequence
<220>
<223> CCDC6-ROS1 fusion gene
<400> 13
atggcggaca gcgccagcga gagcgacacg gacggggcgg ggggcaacag cagcagctcg 60
gccgccatgc agtcgtcctg ctcgtcgacc tcgggcggcg gcggtggcgg cgggggaggc 120
ggcggcggtg ggaagtcggg gggcattgtc atctcgccgt tccgcctgga ggagctcacc 180
aaccgcctgg cctcgctgca gcaagagaac aaggtgctga agatagagct ggagacctac 240
aaactgaagt gcaaggcact gcaggaggag aaccgcgacc tgcgcaaagc cagcgtgacc 300
atccaagcca gggctgagca ggaagaagaa ttcattagta acactttatt caagaaaatt 360
caggctttgc agaaggagaa agaaaccctt gctgtaaatt atgagaaaga agaagaattc 420
ctcactaatg agctctccag aaaattgatg cagttgcagc atgagaaagc cgaactagaa 480
cagcatcttg aacaagagca ggaatttcag gtcaacaaac tgatgaagaa aattaaaaaa 540
ctggagaatg acaccatttc taagcaactt acattagaac agttgagacg ggagaagatt 600
gaccttgaaa atacattgga acaagaacaa gaagcactag ttaatcgcct ctggaaaagg 660
atggataagc ttgaagctga aaagcgaatc ctgcaggaaa aattagacca gcccgtctct 720
gctccaccat cgcctagaga tatctccatg gagattgatt ctccagaaaa tatgatgcgt 780
cacatcaggt ttttaaagaa tgaagtggaa cggctgaaga agcaactgag agctgctcag 840
ttacagctct ggcatagaag attaaagaat caaaaaagtg ccaaggaagg ggtgacagtg 900
cttataaacg aagacaaaga gttggctgag ctgcgaggtc tggcagccgg agtaggcctg 960
gctaatgcct gctatgcaat acatactctt ccaacccaag aggagattga aaatcttcct 1020
gccttccctc gggaaaaact gactctgcgt ctcttgctgg gaagtggagc ctttggagaa 1080
gtgtatgaag gaacagcagt ggacatctta ggagttggaa gtggagaaat caaagtagca 1140
gtgaagactt tgaagaaggg ttccacagac caggagaaga ttgaattcct gaaggaggca 1200
catctgatga gcaaatttaa tcatcccaac attctgaagc agcttggagt ttgtctgctg 1260
aatgaacccc aatacattat cctggaactg atggagggag gagaccttct tacttatttg 1320
cgtaaagccc ggatggcaac gttttatggt cctttactca ccttggttga ccttgtagac 1380
ctgtgtgtag atatttcaaa aggctgtgtc tacttggaac ggatgcattt cattcacagg 1440
gatctggcag ctagaaattg ccttgtttcc gtgaaagact ataccagtcc acggatagtg 1500
aagattggag actttggact cgccagagac atctataaaa atgattacta tagaaagaga 1560
ggggaaggcc tgctcccagt tcggtggatg gctccagaaa gtttgatgga tggaatcttc 1620
actactcaat ctgatgtatg gtcttttgga attctgattt gggagatttt aactcttggt 1680
catcagcctt atccagctca ttccaacctt gatgtgttaa actatgtgca aacaggaggg 1740
agactggagc caccaagaaa ttgtcctgat gatctgtgga atttaatgac ccagtgctgg 1800
gctcaagaac ccgaccaaag acctactttt catagaattc aggaccaact tcagttattc 1860
agaaattttt tcttaaatag catttataag tccagagatg aagcaaacaa cagtggagtc 1920
ataaatgaaa gctttgaagg tgaagatggc gatgtgattt gtttgaattc agatgacatt 1980
atgccagttg ctttaatgga aacgaagaac cgagaagggt taaactatat ggtacttgct 2040
acagaatgtg gccaaggtga agaaaagtct gagggtcctc taggctccca ggaatctgaa 2100
tcttgtggtc tgaggaaaga agagaaggaa ccacatgcag acaaagattt ctgccaagaa 2160
aaacaagtgg cttactgccc ttctggcaag cctgaaggcc tgaactatgc ctgtctcact 2220
cacagtggat atggagatgg gtctgattaa 2250
<210> 14
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> fused resion of CCDC6-ROS1 fusion gene
<400> 14
gctgctcagt tacagctctg gcatagaaga tta 33
<210> 15
<211> 749
<212> PRT
<213> Artificial Sequence
<220>
<223> CCDC6-ROS1 fusion protein
<400> 15
Met Ala Asp Ser Ala Ser Glu Ser Asp Thr Asp Gly Ala Gly Gly Asn
1 5 10 15
Ser Ser Ser Ser Ala Ala Met Gln Ser Ser Cys Ser Ser Thr Ser Gly
20 25 30
Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Lys Ser Gly Gly
35 40 45
Ile Val Ile Ser Pro Phe Arg Leu Glu Glu Leu Thr Asn Arg Leu Ala
50 55 60
Ser Leu Gln Gln Glu Asn Lys Val Leu Lys Ile Glu Leu Glu Thr Tyr
65 70 75 80
Lys Leu Lys Cys Lys Ala Leu Gln Glu Glu Asn Arg Asp Leu Arg Lys
85 90 95
Ala Ser Val Thr Ile Gln Ala Arg Ala Glu Gln Glu Glu Glu Phe Ile
100 105 110
Ser Asn Thr Leu Phe Lys Lys Ile Gln Ala Leu Gln Lys Glu Lys Glu
115 120 125
Thr Leu Ala Val Asn Tyr Glu Lys Glu Glu Glu Phe Leu Thr Asn Glu
130 135 140
Leu Ser Arg Lys Leu Met Gln Leu Gln His Glu Lys Ala Glu Leu Glu
145 150 155 160
Gln His Leu Glu Gln Glu Gln Glu Phe Gln Val Asn Lys Leu Met Lys
165 170 175
Lys Ile Lys Lys Leu Glu Asn Asp Thr Ile Ser Lys Gln Leu Thr Leu
180 185 190
Glu Gln Leu Arg Arg Glu Lys Ile Asp Leu Glu Asn Thr Leu Glu Gln
195 200 205
Glu Gln Glu Ala Leu Val Asn Arg Leu Trp Lys Arg Met Asp Lys Leu
210 215 220
Glu Ala Glu Lys Arg Ile Leu Gln Glu Lys Leu Asp Gln Pro Val Ser
225 230 235 240
Ala Pro Pro Ser Pro Arg Asp Ile Ser Met Glu Ile Asp Ser Pro Glu
245 250 255
Asn Met Met Arg His Ile Arg Phe Leu Lys Asn Glu Val Glu Arg Leu
260 265 270
Lys Lys Gln Leu Arg Ala Ala Gln Leu Gln Leu Trp His Arg Arg Leu
275 280 285
Lys Asn Gln Lys Ser Ala Lys Glu Gly Val Thr Val Leu Ile Asn Glu
290 295 300
Asp Lys Glu Leu Ala Glu Leu Arg Gly Leu Ala Ala Gly Val Gly Leu
305 310 315 320
Ala Asn Ala Cys Tyr Ala Ile His Thr Leu Pro Thr Gln Glu Glu Ile
325 330 335
Glu Asn Leu Pro Ala Phe Pro Arg Glu Lys Leu Thr Leu Arg Leu Leu
340 345 350
Leu Gly Ser Gly Ala Phe Gly Glu Val Tyr Glu Gly Thr Ala Val Asp
355 360 365
Ile Leu Gly Val Gly Ser Gly Glu Ile Lys Val Ala Val Lys Thr Leu
370 375 380
Lys Lys Gly Ser Thr Asp Gln Glu Lys Ile Glu Phe Leu Lys Glu Ala
385 390 395 400
His Leu Met Ser Lys Phe Asn His Pro Asn Ile Leu Lys Gln Leu Gly
405 410 415
Val Cys Leu Leu Asn Glu Pro Gln Tyr Ile Ile Leu Glu Leu Met Glu
420 425 430
Gly Gly Asp Leu Leu Thr Tyr Leu Arg Lys Ala Arg Met Ala Thr Phe
435 440 445
Tyr Gly Pro Leu Leu Thr Leu Val Asp Leu Val Asp Leu Cys Val Asp
450 455 460
Ile Ser Lys Gly Cys Val Tyr Leu Glu Arg Met His Phe Ile His Arg
465 470 475 480
Asp Leu Ala Ala Arg Asn Cys Leu Val Ser Val Lys Asp Tyr Thr Ser
485 490 495
Pro Arg Ile Val Lys Ile Gly Asp Phe Gly Leu Ala Arg Asp Ile Tyr
500 505 510
Lys Asn Asp Tyr Tyr Arg Lys Arg Gly Glu Gly Leu Leu Pro Val Arg
515 520 525
Trp Met Ala Pro Glu Ser Leu Met Asp Gly Ile Phe Thr Thr Gln Ser
530 535 540
Asp Val Trp Ser Phe Gly Ile Leu Ile Trp Glu Ile Leu Thr Leu Gly
545 550 555 560
His Gln Pro Tyr Pro Ala His Ser Asn Leu Asp Val Leu Asn Tyr Val
565 570 575
Gln Thr Gly Gly Arg Leu Glu Pro Pro Arg Asn Cys Pro Asp Asp Leu
580 585 590
Trp Asn Leu Met Thr Gln Cys Trp Ala Gln Glu Pro Asp Gln Arg Pro
595 600 605
Thr Phe His Arg Ile Gln Asp Gln Leu Gln Leu Phe Arg Asn Phe Phe
610 615 620
Leu Asn Ser Ile Tyr Lys Ser Arg Asp Glu Ala Asn Asn Ser Gly Val
625 630 635 640
Ile Asn Glu Ser Phe Glu Gly Glu Asp Gly Asp Val Ile Cys Leu Asn
645 650 655
Ser Asp Asp Ile Met Pro Val Ala Leu Met Glu Thr Lys Asn Arg Glu
660 665 670
Gly Leu Asn Tyr Met Val Leu Ala Thr Glu Cys Gly Gln Gly Glu Glu
675 680 685
Lys Ser Glu Gly Pro Leu Gly Ser Gln Glu Ser Glu Ser Cys Gly Leu
690 695 700
Arg Lys Glu Glu Lys Glu Pro His Ala Asp Lys Asp Phe Cys Gln Glu
705 710 715 720
Lys Gln Val Ala Tyr Cys Pro Ser Gly Lys Pro Glu Gly Leu Asn Tyr
725 730 735
Ala Cys Leu Thr His Ser Gly Tyr Gly Asp Gly Ser Asp
740 745
<210> 16
<211> 11
<212> PRT
<213> Artificial Sequence
<220>
<223> Fused region of CCDC6-ROS1 fusion protein
<400> 16
Ala Ala Gln Leu Gln Leu Trp His Arg Arg Leu
1 5 10
<210> 17
<211> 2130
<212> DNA
<213> Artificial Sequence
<220>
<223> CDS of FGFR2 gene (NM_001144914)
<400> 17
atggtcagct ggggtcgttt catctgcctg gtcgtggtca ccatggcaac cttgtccctg 60
gcccggccct ccttcagttt agttgaggat accacattag agccagaaga gccaccaacc 120
aaataccaaa tctctcaacc agaagtgtac gtggctgcgc caggggagtc gctagaggtg 180
cgctgcctgt tgaaagatgc cgccgtgatc agttggacta aggatggggt gcacttgggg 240
cccaacaata ggacagtgct tattggggag tacttgcaga taaagggcgc cacgcctaga 300
gactccggcc tctatgcttg tactgccagt aggactgtag acagtgaaac ttggtacttc 360
atggtgaatg tcacagatgc catctcatcc ggagatgatg aggatgacac cgatggtgcg 420
gaagattttg tcagtgagaa cagtaacaac aagagagcac catactggac caacacagaa 480
aagatggaaa agcggctcca tgctgtgcct gcggccaaca ctgtcaagtt tcgctgccca 540
gccgggggga acccaatgcc aaccatgcgg tggctgaaaa acgggaagga gtttaagcag 600
gagcatcgca ttggaggcta caaggtacga aaccagcact ggagcctcat tatggaaagt 660
gtggtcccat ctgacaaggg aaattatacc tgtgtagtgg agaatgaata cgggtccatc 720
aatcacacgt accacctgga tgttgtggcg cctggaagag aaaaggagat tacagcttcc 780
ccagactacc tggagatagc catttactgc ataggggtct tcttaatcgc ctgtatggtg 840
gtaacagtca tcctgtgccg aatgaagaac acgaccaaga agccagactt cagcagccag 900
ccggctgtgc acaagctgac caaacgtatc cccctgcgga gacaggtaac agtttcggct 960
gagtccagct cctccatgaa ctccaacacc ccgctggtga ggataacaac acgcctctct 1020
tcaacggcag acacccccat gctggcaggg gtctccgagt atgaacttcc agaggaccca 1080
aaatgggagt ttccaagaga taagctgaca ctgggcaagc ccctgggaga aggttgcttt 1140
gggcaagtgg tcatggcgga agcagtggga attgacaaag acaagcccaa ggaggcggtc 1200
accgtggccg tgaagatgtt gaaagatgat gccacagaga aagacctttc tgatctggtg 1260
tcagagatgg agatgatgaa gatgattggg aaacacaaga atatcataaa tcttcttgga 1320
gcctgcacac aggatgggcc tctctatgtc atagttgagt atgcctctaa aggcaacctc 1380
cgagaatacc tccgagcccg gaggccaccc gggatggagt actcctatga cattaaccgt 1440
gttcctgagg agcagatgac cttcaaggac ttggtgtcat gcacctacca gctggccaga 1500
ggcatggagt acttggcttc ccaaaaatgt attcatcgag atttagcagc cagaaatgtt 1560
ttggtaacag aaaacaatgt gatgaaaata gcagactttg gactcgccag agatatcaac 1620
aatatagact attacaaaaa gaccaccaat gggcggcttc cagtcaagtg gatggctcca 1680
gaagccctgt ttgatagagt atacactcat cagagtgatg tctggtcctt cggggtgtta 1740
atgtgggaga tcttcacttt agggggctcg ccctacccag ggattcccgt ggaggaactt 1800
tttaagctgc tgaaggaagg acacagaatg gataagccag ccaactgcac caacgaactg 1860
tacatgatga tgagggactg ttggcatgca gtgccctccc agagaccaac gttcaagcag 1920
ttggtagaag acttggatcg aattctcact ctcacaacca atgaggaata cttggacctc 1980
agccaacctc tcgaacagta ttcacctagt taccctgaca caagaagttc ttgttcttca 2040
ggagatgatt ctgttttttc tccagacccc atgccttacg aaccatgcct tcctcagtat 2100
ccacacataa acggcagtgt taaaacatga 2130
<210> 18
<211> 1965
<212> DNA
<213> Artificial Sequence
<220>
<223> FGFR2 gene fragment
<400> 18
atggtcagct ggggtcgttt catctgcctg gtcgtggtca ccatggcaac cttgtccctg 60
gcccggccct ccttcagttt agttgaggat accacattag agccagaaga gccaccaacc 120
aaataccaaa tctctcaacc agaagtgtac gtggctgcgc caggggagtc gctagaggtg 180
cgctgcctgt tgaaagatgc cgccgtgatc agttggacta aggatggggt gcacttgggg 240
cccaacaata ggacagtgct tattggggag tacttgcaga taaagggcgc cacgcctaga 300
gactccggcc tctatgcttg tactgccagt aggactgtag acagtgaaac ttggtacttc 360
atggtgaatg tcacagatgc catctcatcc ggagatgatg aggatgacac cgatggtgcg 420
gaagattttg tcagtgagaa cagtaacaac aagagagcac catactggac caacacagaa 480
aagatggaaa agcggctcca tgctgtgcct gcggccaaca ctgtcaagtt tcgctgccca 540
gccgggggga acccaatgcc aaccatgcgg tggctgaaaa acgggaagga gtttaagcag 600
gagcatcgca ttggaggcta caaggtacga aaccagcact ggagcctcat tatggaaagt 660
gtggtcccat ctgacaaggg aaattatacc tgtgtagtgg agaatgaata cgggtccatc 720
aatcacacgt accacctgga tgttgtggcg cctggaagag aaaaggagat tacagcttcc 780
ccagactacc tggagatagc catttactgc ataggggtct tcttaatcgc ctgtatggtg 840
gtaacagtca tcctgtgccg aatgaagaac acgaccaaga agccagactt cagcagccag 900
ccggctgtgc acaagctgac caaacgtatc cccctgcgga gacaggtaac agtttcggct 960
gagtccagct cctccatgaa ctccaacacc ccgctggtga ggataacaac acgcctctct 1020
tcaacggcag acacccccat gctggcaggg gtctccgagt atgaacttcc agaggaccca 1080
aaatgggagt ttccaagaga taagctgaca ctgggcaagc ccctgggaga aggttgcttt 1140
gggcaagtgg tcatggcgga agcagtggga attgacaaag acaagcccaa ggaggcggtc 1200
accgtggccg tgaagatgtt gaaagatgat gccacagaga aagacctttc tgatctggtg 1260
tcagagatgg agatgatgaa gatgattggg aaacacaaga atatcataaa tcttcttgga 1320
gcctgcacac aggatgggcc tctctatgtc atagttgagt atgcctctaa aggcaacctc 1380
cgagaatacc tccgagcccg gaggccaccc gggatggagt actcctatga cattaaccgt 1440
gttcctgagg agcagatgac cttcaaggac ttggtgtcat gcacctacca gctggccaga 1500
ggcatggagt acttggcttc ccaaaaatgt attcatcgag atttagcagc cagaaatgtt 1560
ttggtaacag aaaacaatgt gatgaaaata gcagactttg gactcgccag agatatcaac 1620
aatatagact attacaaaaa gaccaccaat gggcggcttc cagtcaagtg gatggctcca 1680
gaagccctgt ttgatagagt atacactcat cagagtgatg tctggtcctt cggggtgtta 1740
atgtgggaga tcttcacttt agggggctcg ccctacccag ggattcccgt ggaggaactt 1800
tttaagctgc tgaaggaagg acacagaatg gataagccag ccaactgcac caacgaactg 1860
tacatgatga tgagggactg ttggcatgca gtgccctccc agagaccaac gttcaagcag 1920
ttggtagaag acttggatcg aattctcact ctcacaacca atgag 1965
<210> 19
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Break-point of FGFR2 gene fragment
<400> 19
ctcactctca caaccaatga g 21
<210> 20
<211> 709
<212> PRT
<213> Artificial Sequence
<220>
<223> FGFR2 protein
<400> 20
Met Val Ser Trp Gly Arg Phe Ile Cys Leu Val Val Val Thr Met Ala
1 5 10 15
Thr Leu Ser Leu Ala Arg Pro Ser Phe Ser Leu Val Glu Asp Thr Thr
20 25 30
Leu Glu Pro Glu Glu Pro Pro Thr Lys Tyr Gln Ile Ser Gln Pro Glu
35 40 45
Val Tyr Val Ala Ala Pro Gly Glu Ser Leu Glu Val Arg Cys Leu Leu
50 55 60
Lys Asp Ala Ala Val Ile Ser Trp Thr Lys Asp Gly Val His Leu Gly
65 70 75 80
Pro Asn Asn Arg Thr Val Leu Ile Gly Glu Tyr Leu Gln Ile Lys Gly
85 90 95
Ala Thr Pro Arg Asp Ser Gly Leu Tyr Ala Cys Thr Ala Ser Arg Thr
100 105 110
Val Asp Ser Glu Thr Trp Tyr Phe Met Val Asn Val Thr Asp Ala Ile
115 120 125
Ser Ser Gly Asp Asp Glu Asp Asp Thr Asp Gly Ala Glu Asp Phe Val
130 135 140
Ser Glu Asn Ser Asn Asn Lys Arg Ala Pro Tyr Trp Thr Asn Thr Glu
145 150 155 160
Lys Met Glu Lys Arg Leu His Ala Val Pro Ala Ala Asn Thr Val Lys
165 170 175
Phe Arg Cys Pro Ala Gly Gly Asn Pro Met Pro Thr Met Arg Trp Leu
180 185 190
Lys Asn Gly Lys Glu Phe Lys Gln Glu His Arg Ile Gly Gly Tyr Lys
195 200 205
Val Arg Asn Gln His Trp Ser Leu Ile Met Glu Ser Val Val Pro Ser
210 215 220
Asp Lys Gly Asn Tyr Thr Cys Val Val Glu Asn Glu Tyr Gly Ser Ile
225 230 235 240
Asn His Thr Tyr His Leu Asp Val Val Ala Pro Gly Arg Glu Lys Glu
245 250 255
Ile Thr Ala Ser Pro Asp Tyr Leu Glu Ile Ala Ile Tyr Cys Ile Gly
260 265 270
Val Phe Leu Ile Ala Cys Met Val Val Thr Val Ile Leu Cys Arg Met
275 280 285
Lys Asn Thr Thr Lys Lys Pro Asp Phe Ser Ser Gln Pro Ala Val His
290 295 300
Lys Leu Thr Lys Arg Ile Pro Leu Arg Arg Gln Val Thr Val Ser Ala
305 310 315 320
Glu Ser Ser Ser Ser Met Asn Ser Asn Thr Pro Leu Val Arg Ile Thr
325 330 335
Thr Arg Leu Ser Ser Thr Ala Asp Thr Pro Met Leu Ala Gly Val Ser
340 345 350
Glu Tyr Glu Leu Pro Glu Asp Pro Lys Trp Glu Phe Pro Arg Asp Lys
355 360 365
Leu Thr Leu Gly Lys Pro Leu Gly Glu Gly Cys Phe Gly Gln Val Val
370 375 380
Met Ala Glu Ala Val Gly Ile Asp Lys Asp Lys Pro Lys Glu Ala Val
385 390 395 400
Thr Val Ala Val Lys Met Leu Lys Asp Asp Ala Thr Glu Lys Asp Leu
405 410 415
Ser Asp Leu Val Ser Glu Met Glu Met Met Lys Met Ile Gly Lys His
420 425 430
Lys Asn Ile Ile Asn Leu Leu Gly Ala Cys Thr Gln Asp Gly Pro Leu
435 440 445
Tyr Val Ile Val Glu Tyr Ala Ser Lys Gly Asn Leu Arg Glu Tyr Leu
450 455 460
Arg Ala Arg Arg Pro Pro Gly Met Glu Tyr Ser Tyr Asp Ile Asn Arg
465 470 475 480
Val Pro Glu Glu Gln Met Thr Phe Lys Asp Leu Val Ser Cys Thr Tyr
485 490 495
Gln Leu Ala Arg Gly Met Glu Tyr Leu Ala Ser Gln Lys Cys Ile His
500 505 510
Arg Asp Leu Ala Ala Arg Asn Val Leu Val Thr Glu Asn Asn Val Met
515 520 525
Lys Ile Ala Asp Phe Gly Leu Ala Arg Asp Ile Asn Asn Ile Asp Tyr
530 535 540
Tyr Lys Lys Thr Thr Asn Gly Arg Leu Pro Val Lys Trp Met Ala Pro
545 550 555 560
Glu Ala Leu Phe Asp Arg Val Tyr Thr His Gln Ser Asp Val Trp Ser
565 570 575
Phe Gly Val Leu Met Trp Glu Ile Phe Thr Leu Gly Gly Ser Pro Tyr
580 585 590
Pro Gly Ile Pro Val Glu Glu Leu Phe Lys Leu Leu Lys Glu Gly His
595 600 605
Arg Met Asp Lys Pro Ala Asn Cys Thr Asn Glu Leu Tyr Met Met Met
610 615 620
Arg Asp Cys Trp His Ala Val Pro Ser Gln Arg Pro Thr Phe Lys Gln
625 630 635 640
Leu Val Glu Asp Leu Asp Arg Ile Leu Thr Leu Thr Thr Asn Glu Glu
645 650 655
Tyr Leu Asp Leu Ser Gln Pro Leu Glu Gln Tyr Ser Pro Ser Tyr Pro
660 665 670
Asp Thr Arg Ser Ser Cys Ser Ser Gly Asp Asp Ser Val Phe Ser Pro
675 680 685
Asp Pro Met Pro Tyr Glu Pro Cys Leu Pro Gln Tyr Pro His Ile Asn
690 695 700
Gly Ser Val Lys Thr
705
<210> 21
<211> 655
<212> PRT
<213> Artificial Sequence
<220>
<223> FGFR2 protein fragment
<400> 21
Met Val Ser Trp Gly Arg Phe Ile Cys Leu Val Val Val Thr Met Ala
1 5 10 15
Thr Leu Ser Leu Ala Arg Pro Ser Phe Ser Leu Val Glu Asp Thr Thr
20 25 30
Leu Glu Pro Glu Glu Pro Pro Thr Lys Tyr Gln Ile Ser Gln Pro Glu
35 40 45
Val Tyr Val Ala Ala Pro Gly Glu Ser Leu Glu Val Arg Cys Leu Leu
50 55 60
Lys Asp Ala Ala Val Ile Ser Trp Thr Lys Asp Gly Val His Leu Gly
65 70 75 80
Pro Asn Asn Arg Thr Val Leu Ile Gly Glu Tyr Leu Gln Ile Lys Gly
85 90 95
Ala Thr Pro Arg Asp Ser Gly Leu Tyr Ala Cys Thr Ala Ser Arg Thr
100 105 110
Val Asp Ser Glu Thr Trp Tyr Phe Met Val Asn Val Thr Asp Ala Ile
115 120 125
Ser Ser Gly Asp Asp Glu Asp Asp Thr Asp Gly Ala Glu Asp Phe Val
130 135 140
Ser Glu Asn Ser Asn Asn Lys Arg Ala Pro Tyr Trp Thr Asn Thr Glu
145 150 155 160
Lys Met Glu Lys Arg Leu His Ala Val Pro Ala Ala Asn Thr Val Lys
165 170 175
Phe Arg Cys Pro Ala Gly Gly Asn Pro Met Pro Thr Met Arg Trp Leu
180 185 190
Lys Asn Gly Lys Glu Phe Lys Gln Glu His Arg Ile Gly Gly Tyr Lys
195 200 205
Val Arg Asn Gln His Trp Ser Leu Ile Met Glu Ser Val Val Pro Ser
210 215 220
Asp Lys Gly Asn Tyr Thr Cys Val Val Glu Asn Glu Tyr Gly Ser Ile
225 230 235 240
Asn His Thr Tyr His Leu Asp Val Val Ala Pro Gly Arg Glu Lys Glu
245 250 255
Ile Thr Ala Ser Pro Asp Tyr Leu Glu Ile Ala Ile Tyr Cys Ile Gly
260 265 270
Val Phe Leu Ile Ala Cys Met Val Val Thr Val Ile Leu Cys Arg Met
275 280 285
Lys Asn Thr Thr Lys Lys Pro Asp Phe Ser Ser Gln Pro Ala Val His
290 295 300
Lys Leu Thr Lys Arg Ile Pro Leu Arg Arg Gln Val Thr Val Ser Ala
305 310 315 320
Glu Ser Ser Ser Ser Met Asn Ser Asn Thr Pro Leu Val Arg Ile Thr
325 330 335
Thr Arg Leu Ser Ser Thr Ala Asp Thr Pro Met Leu Ala Gly Val Ser
340 345 350
Glu Tyr Glu Leu Pro Glu Asp Pro Lys Trp Glu Phe Pro Arg Asp Lys
355 360 365
Leu Thr Leu Gly Lys Pro Leu Gly Glu Gly Cys Phe Gly Gln Val Val
370 375 380
Met Ala Glu Ala Val Gly Ile Asp Lys Asp Lys Pro Lys Glu Ala Val
385 390 395 400
Thr Val Ala Val Lys Met Leu Lys Asp Asp Ala Thr Glu Lys Asp Leu
405 410 415
Ser Asp Leu Val Ser Glu Met Glu Met Met Lys Met Ile Gly Lys His
420 425 430
Lys Asn Ile Ile Asn Leu Leu Gly Ala Cys Thr Gln Asp Gly Pro Leu
435 440 445
Tyr Val Ile Val Glu Tyr Ala Ser Lys Gly Asn Leu Arg Glu Tyr Leu
450 455 460
Arg Ala Arg Arg Pro Pro Gly Met Glu Tyr Ser Tyr Asp Ile Asn Arg
465 470 475 480
Val Pro Glu Glu Gln Met Thr Phe Lys Asp Leu Val Ser Cys Thr Tyr
485 490 495
Gln Leu Ala Arg Gly Met Glu Tyr Leu Ala Ser Gln Lys Cys Ile His
500 505 510
Arg Asp Leu Ala Ala Arg Asn Val Leu Val Thr Glu Asn Asn Val Met
515 520 525
Lys Ile Ala Asp Phe Gly Leu Ala Arg Asp Ile Asn Asn Ile Asp Tyr
530 535 540
Tyr Lys Lys Thr Thr Asn Gly Arg Leu Pro Val Lys Trp Met Ala Pro
545 550 555 560
Glu Ala Leu Phe Asp Arg Val Tyr Thr His Gln Ser Asp Val Trp Ser
565 570 575
Phe Gly Val Leu Met Trp Glu Ile Phe Thr Leu Gly Gly Ser Pro Tyr
580 585 590
Pro Gly Ile Pro Val Glu Glu Leu Phe Lys Leu Leu Lys Glu Gly His
595 600 605
Arg Met Asp Lys Pro Ala Asn Cys Thr Asn Glu Leu Tyr Met Met Met
610 615 620
Arg Asp Cys Trp His Ala Val Pro Ser Gln Arg Pro Thr Phe Lys Gln
625 630 635 640
Leu Val Glu Asp Leu Asp Arg Ile Leu Thr Leu Thr Thr Asn Glu
645 650 655
<210> 22
<211> 7
<212> PRT
<213> Artificial Sequence
<220>
<223> Break-point of FGFR2 protein fragment
<400> 22
Leu Thr Leu Thr Thr Asn Glu
1 5
<210> 23
<211> 6084
<212> DNA
<213> Artificial Sequence
<220>
<223> CDS of CIT gene (NM_007174)
<400> 23
atgttgaagt tcaaatatgg agcgcggaat cctttggatg ctggtgctgc tgaacccatt 60
gccagccggg cctccaggct gaatctgttc ttccagggga aaccaccctt tatgactcaa 120
cagcagatgt ctcctctttc ccgagaaggg atattagatg ccctctttgt tctctttgaa 180
gaatgcagtc agcctgctct gatgaagatt aagcacgtga gcaactttgt ccggaagtat 240
tccgacacca tagctgagtt acaggagctc cagccttcgg caaaggactt cgaagtcaga 300
agtcttgtag gttgtggtca ctttgctgaa gtgcaggtgg taagagagaa agcaaccggg 360
gacatctatg ctatgaaagt gatgaagaag aaggctttat tggcccagga gcaggtttca 420
ttttttgagg aagagcggaa catattatct cgaagcacaa gcccgtggat cccccaatta 480
cagtatgcct ttcaggacaa aaatcacctt tatctggtca tggaatatca gcctggaggg 540
gacttgctgt cacttttgaa tagatatgag gaccagttag atgaaaacct gatacagttt 600
tacctagctg agctgatttt ggctgttcac agcgttcatc tgatgggata cgtgcatcga 660
gacatcaagc ctgagaacat tctcgttgac cgcacaggac acatcaagct ggtggatttt 720
ggatctgccg cgaaaatgaa ttcaaacaag atggtgaatg ccaaactccc gattgggacc 780
ccagattaca tggctcctga agtgctgact gtgatgaacg gggatggaaa aggcacctac 840
ggcctggact gtgactggtg gtcagtgggc gtgattgcct atgagatgat ttatgggaga 900
tcccccttcg cagagggaac ctctgccaga accttcaata acattatgaa tttccagcgg 960
tttttgaaat ttccagatga ccccaaagtg agcagtgact ttcttgatct gattcaaagc 1020
ttgttgtgcg gccagaaaga gagactgaag tttgaaggtc tttgctgcca tcctttcttc 1080
tctaaaattg actggaacaa cattcgtaac tctcctcccc ccttcgttcc caccctcaag 1140
tctgacgatg acacctccaa ttttgatgaa ccagagaaga attcgtgggt ttcatcctct 1200
ccgtgccagc tgagcccctc aggcttctcg ggtgaagaac tgccgtttgt ggggttttcg 1260
tacagcaagg cactggggat tcttggtaga tctgagtctg ttgtgtcggg tctggactcc 1320
cctgccaaga ctagctccat ggaaaagaaa cttctcatca aaagcaaaga gctacaagac 1380
tctcaggaca agtgtcacaa gatggagcag gaaatgaccc ggttacatcg gagagtgtca 1440
gaggtggagg ctgtgcttag tcagaaggag gtggagctga aggcctctga gactcagaga 1500
tccctcctgg agcaggacct tgctacctac atcacagaat gcagtagctt aaagcgaagt 1560
ttggagcaag cacggatgga ggtgtcccag gaggatgaca aagcactgca gcttctccat 1620
gatatcagag agcagagccg gaagctccaa gaaatcaaag agcaggagta ccaggctcaa 1680
gtggaagaaa tgaggttgat gatgaatcag ttggaagagg atcttgtctc agcaagaaga 1740
cggagtgatc tctacgaatc tgagctgaga gagtctcggc ttgctgctga agaattcaag 1800
cggaaagcga cagaatgtca gcataaactg ttgaaggcta aggatcaagg gaagcctgaa 1860
gtgggagaat atgcgaaact ggagaagatc aatgctgagc agcagctcaa aattcaggag 1920
ctccaagaga aactggagaa ggctgtaaaa gccagcacgg aggccaccga gctgctgcag 1980
aatatccgcc aggcaaagga gcgagccgag agggagctgg agaagctgca gaaccgagag 2040
gattcttctg aaggcatcag aaagaagctg gtggaagctg aggagctcga agagaaacat 2100
cgggaggccc aagtctcagc ccagcaccta gaagtgcacc tgaaacagaa agagcagcac 2160
tatgaggaaa agattaaagt gttggacaat cagataaaga aagacctggc tgacaaggag 2220
acactggaga acatgatgca gagacacgag gaggaggccc atgagaaggg caaaattctc 2280
agcgaacaga aggcgatgat caatgctatg gattccaaga tcagatccct ggaacagagg 2340
attgtggaac tgtctgaagc caataaactt gcagcaaata gcagtctttt tacccaaagg 2400
aacatgaagg cccaagaaga gatgatttct gaactcaggc aacagaaatt ttacctggag 2460
acacaggctg ggaagttgga ggcccagaac cgaaaactgg aggagcagct ggagaagatc 2520
agccaccaag accacagtga caagaatcgg ctgctggaac tggagacaag attgcgggag 2580
gtcagtctag agcacgagga gcagaaactg gagctcaagc gccagctcac agagctacag 2640
ctctccctgc aggagcgcga gtcacagttg acagccctgc aggctgcacg ggcggccctg 2700
gagagccagc ttcgccaggc gaagacagag ctggaagaga ccacagcaga agctgaagag 2760
gagatccagg cactcacggc acatagagat gaaatccagc gcaaatttga tgctcttcgt 2820
aacagctgta ctgtaatcac agacctggag gagcagctaa accagctgac cgaggacaac 2880
gctgaactca acaaccaaaa cttctacttg tccaaacaac tcgatgaggc ttctggcgcc 2940
aacgacgaga ttgtacaact gcgaagtgaa gtggaccatc tccgccggga gatcacggaa 3000
cgagagatgc agcttaccag ccagaagcaa acgatggagg ctctgaagac cacgtgcacc 3060
atgctggagg aacaggtcat ggatttggag gccctaaacg atgagctgct agaaaaagag 3120
cggcagtggg aggcctggag gagcgtcctg ggtgatgaga aatcccagtt tgagtgtcgg 3180
gttcgagagc tgcagagaat gctggacacc gagaaacaga gcagggcgag agccgatcag 3240
cggatcaccg agtctcgcca ggtggtggag ctggcagtga aggagcacaa ggctgagatt 3300
ctcgctctgc agcaggctct caaagagcag aagctgaagg ccgagagcct ctctgacaag 3360
ctcaatgacc tggagaagaa gcatgctatg cttgaaatga atgcccgaag cttacagcag 3420
aagctggaga ctgaacgaga gctcaaacag aggcttctgg aagagcaagc caaattacag 3480
cagcagatgg acctgcagaa aaatcacatt ttccgtctga ctcaaggact gcaagaagct 3540
ctagatcggg ctgatctact gaagacagaa agaagtgact tggagtatca gctggaaaac 3600
attcaggttc tctattctca tgaaaaggtg aaaatggaag gcactatttc tcaacaaacc 3660
aaactcattg attttctgca agccaaaatg gaccaacctg ctaaaaagaa aaagggttta 3720
tttagtcgac ggaaagagga ccctgcttta cccacacagg ttcctctgca gtacaatgag 3780
ctgaagctgg ccctggagaa ggagaaagct cgctgtgcag agctagagga agcccttcag 3840
aagacccgca tcgagctccg gtccgcccgg gaggaagctg cccaccgcaa agcaacggac 3900
cacccacacc catccacgcc agccaccgcg aggcagcaga tcgccatgtc cgccatcgtg 3960
cggtcgccag agcaccagcc cagtgccatg agcctgctgg ccccgccatc cagccgcaga 4020
aaggagtctt caactccaga ggaatttagt cggcgtctta aggaacgcat gcaccacaat 4080
attcctcacc gattcaacgt aggactgaac atgcgagcca caaagtgtgc tgtgtgtctg 4140
gataccgtgc actttggacg ccaggcatcc aaatgtctcg aatgtcaggt gatgtgtcac 4200
cccaagtgct ccacgtgctt gccagccacc tgcggcttgc ctgctgaata tgccacacac 4260
ttcaccgagg ccttctgccg tgacaaaatg aactccccag gtctccagac caaggagccc 4320
agcagcagct tgcacctgga agggtggatg aaggtgccca ggaataacaa acgaggacag 4380
caaggctggg acaggaagta cattgtcctg gagggatcaa aagtcctcat ttatgacaat 4440
gaagccagag aagctggaca gaggccggtg gaagaatttg agctgtgcct tcccgacggg 4500
gatgtatcta ttcatggtgc cgttggtgct tccgaactcg caaatacagc caaagcagat 4560
gtcccataca tactgaagat ggaatctcac ccgcacacca cctgctggcc cgggagaacc 4620
ctctacttgc tagctcccag cttccctgac aaacagcgct gggtcaccgc cttagaatca 4680
gttgtcgcag gtgggagagt ttctagggaa aaagcagaag ctgatgctaa actgcttgga 4740
aactccctgc tgaaactgga aggtgatgac cgtctagaca tgaactgcac gctgcccttc 4800
agtgaccagg tggtgttggt gggcaccgag gaagggctct acgccctgaa tgtcttgaaa 4860
aactccctaa cccatgtccc aggaattgga gcagtcttcc aaatttatat tatcaaggac 4920
ctggagaagc tactcatgat agcaggagaa gagcgggcac tgtgtcttgt ggacgtgaag 4980
aaagtgaaac agtccctggc ccagtcccac ctgcctgccc agcccgacat ctcacccaac 5040
atttttgaag ctgtcaaggg ctgccacttg tttggggcag gcaagattga gaacgggctc 5100
tgcatctgtg cagccatgcc cagcaaagtc gtcattctcc gctacaacga aaacctcagc 5160
aaatactgca tccggaaaga gatagagacc tcagagccct gcagctgtat ccacttcacc 5220
aattacagta tcctcattgg aaccaataaa ttctacgaaa tcgacatgaa gcagtacacg 5280
ctcgaggaat tcctggataa gaatgaccat tccttggcac ctgctgtgtt tgccgcctct 5340
tccaacagct tccctgtctc aatcgtgcag gtgaacagcg cagggcagcg agaggagtac 5400
ttgctgtgtt tccacgaatt tggagtgttc gtggattctt acggaagacg tagccgcaca 5460
gacgatctca agtggagtcg cttacctttg gcctttgcct acagagaacc ctatctgttt 5520
gtgacccact tcaactcact cgaagtaatt gagatccagg cacgctcctc agcagggacc 5580
cctgcccgag cgtacctgga catcccgaac ccgcgctacc tgggccctgc catttcctca 5640
ggagcgattt acttggcgtc ctcataccag gataaattaa gggtcatttg ctgcaaggga 5700
aacctcgtga aggagtccgg cactgaacac caccggggcc cgtccacctc ccgcagcagc 5760
cccaacaagc gaggcccacc cacgtacaac gagcacatca ccaagcgcgt ggcctccagc 5820
ccagcgccgc ccgaaggccc cagccacccg cgagagccaa gcacacccca ccgctaccgc 5880
gaggggcgga ccgagctgcg cagggacaag tctcctggcc gccccctgga gcgagagaag 5940
tcccccggcc ggatgctcag cacgcggaga gagcggtccc ccgggaggct gtttgaagac 6000
agcagcaggg gccggctgcc tgcgggagcc gtgaggaccc cgctgtccca ggtgaacaag 6060
gtctgggacc agtcttcagt ataa 6084
<210> 24
<211> 3306
<212> DNA
<213> Artificial Sequence
<220>
<223> CIT gene fragment
<400> 24
gcacatagag atgaaatcca gcgcaaattt gatgctcttc gtaacagctg tactgtaatc 60
acagacctgg aggagcagct aaaccagctg accgaggaca acgctgaact caacaaccaa 120
aacttctact tgtccaaaca actcgatgag gcttctggcg ccaacgacga gattgtacaa 180
ctgcgaagtg aagtggacca tctccgccgg gagatcacgg aacgagagat gcagcttacc 240
agccagaagc aaacgatgga ggctctgaag accacgtgca ccatgctgga ggaacaggtc 300
atggatttgg aggccctaaa cgatgagctg ctagaaaaag agcggcagtg ggaggcctgg 360
aggagcgtcc tgggtgatga gaaatcccag tttgagtgtc gggttcgaga gctgcagaga 420
atgctggaca ccgagaaaca gagcagggcg agagccgatc agcggatcac cgagtctcgc 480
caggtggtgg agctggcagt gaaggagcac aaggctgaga ttctcgctct gcagcaggct 540
ctcaaagagc agaagctgaa ggccgagagc ctctctgaca agctcaatga cctggagaag 600
aagcatgcta tgcttgaaat gaatgcccga agcttacagc agaagctgga gactgaacga 660
gagctcaaac agaggcttct ggaagagcaa gccaaattac agcagcagat ggacctgcag 720
aaaaatcaca ttttccgtct gactcaagga ctgcaagaag ctctagatcg ggctgatcta 780
ctgaagacag aaagaagtga cttggagtat cagctggaaa acattcaggt tctctattct 840
catgaaaagg tgaaaatgga aggcactatt tctcaacaaa ccaaactcat tgattttctg 900
caagccaaaa tggaccaacc tgctaaaaag aaaaagggtt tatttagtcg acggaaagag 960
gaccctgctt tacccacaca ggttcctctg cagtacaatg agctgaagct ggccctggag 1020
aaggagaaag ctcgctgtgc agagctagag gaagcccttc agaagacccg catcgagctc 1080
cggtccgccc gggaggaagc tgcccaccgc aaagcaacgg accacccaca cccatccacg 1140
ccagccaccg cgaggcagca gatcgccatg tccgccatcg tgcggtcgcc agagcaccag 1200
cccagtgcca tgagcctgct ggccccgcca tccagccgca gaaaggagtc ttcaactcca 1260
gaggaattta gtcggcgtct taaggaacgc atgcaccaca atattcctca ccgattcaac 1320
gtaggactga acatgcgagc cacaaagtgt gctgtgtgtc tggataccgt gcactttgga 1380
cgccaggcat ccaaatgtct cgaatgtcag gtgatgtgtc accccaagtg ctccacgtgc 1440
ttgccagcca cctgcggctt gcctgctgaa tatgccacac acttcaccga ggccttctgc 1500
cgtgacaaaa tgaactcccc aggtctccag accaaggagc ccagcagcag cttgcacctg 1560
gaagggtgga tgaaggtgcc caggaataac aaacgaggac agcaaggctg ggacaggaag 1620
tacattgtcc tggagggatc aaaagtcctc atttatgaca atgaagccag agaagctgga 1680
cagaggccgg tggaagaatt tgagctgtgc cttcccgacg gggatgtatc tattcatggt 1740
gccgttggtg cttccgaact cgcaaataca gccaaagcag atgtcccata catactgaag 1800
atggaatctc acccgcacac cacctgctgg cccgggagaa ccctctactt gctagctccc 1860
agcttccctg acaaacagcg ctgggtcacc gccttagaat cagttgtcgc aggtgggaga 1920
gtttctaggg aaaaagcaga agctgatgct aaactgcttg gaaactccct gctgaaactg 1980
gaaggtgatg accgtctaga catgaactgc acgctgccct tcagtgacca ggtggtgttg 2040
gtgggcaccg aggaagggct ctacgccctg aatgtcttga aaaactccct aacccatgtc 2100
ccaggaattg gagcagtctt ccaaatttat attatcaagg acctggagaa gctactcatg 2160
atagcaggag aagagcgggc actgtgtctt gtggacgtga agaaagtgaa acagtccctg 2220
gcccagtccc acctgcctgc ccagcccgac atctcaccca acatttttga agctgtcaag 2280
ggctgccact tgtttggggc aggcaagatt gagaacgggc tctgcatctg tgcagccatg 2340
cccagcaaag tcgtcattct ccgctacaac gaaaacctca gcaaatactg catccggaaa 2400
gagatagaga cctcagagcc ctgcagctgt atccacttca ccaattacag tatcctcatt 2460
ggaaccaata aattctacga aatcgacatg aagcagtaca cgctcgagga attcctggat 2520
aagaatgacc attccttggc acctgctgtg tttgccgcct cttccaacag cttccctgtc 2580
tcaatcgtgc aggtgaacag cgcagggcag cgagaggagt acttgctgtg tttccacgaa 2640
tttggagtgt tcgtggattc ttacggaaga cgtagccgca cagacgatct caagtggagt 2700
cgcttacctt tggcctttgc ctacagagaa ccctatctgt ttgtgaccca cttcaactca 2760
ctcgaagtaa ttgagatcca ggcacgctcc tcagcaggga cccctgcccg agcgtacctg 2820
gacatcccga acccgcgcta cctgggccct gccatttcct caggagcgat ttacttggcg 2880
tcctcatacc aggataaatt aagggtcatt tgctgcaagg gaaacctcgt gaaggagtcc 2940
ggcactgaac accaccgggg cccgtccacc tcccgcagca gccccaacaa gcgaggccca 3000
cccacgtaca acgagcacat caccaagcgc gtggcctcca gcccagcgcc gcccgaaggc 3060
cccagccacc cgcgagagcc aagcacaccc caccgctacc gcgaggggcg gaccgagctg 3120
cgcagggaca agtctcctgg ccgccccctg gagcgagaga agtcccccgg ccggatgctc 3180
agcacgcgga gagagcggtc ccccgggagg ctgtttgaag acagcagcag gggccggctg 3240
cctgcgggag ccgtgaggac cccgctgtcc caggtgaaca aggtctggga ccagtcttca 3300
gtataa 3306
<210> 25
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Break-point of CIT gene fragment
<400> 25
gcacatagag atgaaatcca g 21
<210> 26
<211> 2027
<212> PRT
<213> Artificial Sequence
<220>
<223> CIT protein
<400> 26
Met Leu Lys Phe Lys Tyr Gly Ala Arg Asn Pro Leu Asp Ala Gly Ala
1 5 10 15
Ala Glu Pro Ile Ala Ser Arg Ala Ser Arg Leu Asn Leu Phe Phe Gln
20 25 30
Gly Lys Pro Pro Phe Met Thr Gln Gln Gln Met Ser Pro Leu Ser Arg
35 40 45
Glu Gly Ile Leu Asp Ala Leu Phe Val Leu Phe Glu Glu Cys Ser Gln
50 55 60
Pro Ala Leu Met Lys Ile Lys His Val Ser Asn Phe Val Arg Lys Tyr
65 70 75 80
Ser Asp Thr Ile Ala Glu Leu Gln Glu Leu Gln Pro Ser Ala Lys Asp
85 90 95
Phe Glu Val Arg Ser Leu Val Gly Cys Gly His Phe Ala Glu Val Gln
100 105 110
Val Val Arg Glu Lys Ala Thr Gly Asp Ile Tyr Ala Met Lys Val Met
115 120 125
Lys Lys Lys Ala Leu Leu Ala Gln Glu Gln Val Ser Phe Phe Glu Glu
130 135 140
Glu Arg Asn Ile Leu Ser Arg Ser Thr Ser Pro Trp Ile Pro Gln Leu
145 150 155 160
Gln Tyr Ala Phe Gln Asp Lys Asn His Leu Tyr Leu Val Met Glu Tyr
165 170 175
Gln Pro Gly Gly Asp Leu Leu Ser Leu Leu Asn Arg Tyr Glu Asp Gln
180 185 190
Leu Asp Glu Asn Leu Ile Gln Phe Tyr Leu Ala Glu Leu Ile Leu Ala
195 200 205
Val His Ser Val His Leu Met Gly Tyr Val His Arg Asp Ile Lys Pro
210 215 220
Glu Asn Ile Leu Val Asp Arg Thr Gly His Ile Lys Leu Val Asp Phe
225 230 235 240
Gly Ser Ala Ala Lys Met Asn Ser Asn Lys Met Val Asn Ala Lys Leu
245 250 255
Pro Ile Gly Thr Pro Asp Tyr Met Ala Pro Glu Val Leu Thr Val Met
260 265 270
Asn Gly Asp Gly Lys Gly Thr Tyr Gly Leu Asp Cys Asp Trp Trp Ser
275 280 285
Val Gly Val Ile Ala Tyr Glu Met Ile Tyr Gly Arg Ser Pro Phe Ala
290 295 300
Glu Gly Thr Ser Ala Arg Thr Phe Asn Asn Ile Met Asn Phe Gln Arg
305 310 315 320
Phe Leu Lys Phe Pro Asp Asp Pro Lys Val Ser Ser Asp Phe Leu Asp
325 330 335
Leu Ile Gln Ser Leu Leu Cys Gly Gln Lys Glu Arg Leu Lys Phe Glu
340 345 350
Gly Leu Cys Cys His Pro Phe Phe Ser Lys Ile Asp Trp Asn Asn Ile
355 360 365
Arg Asn Ser Pro Pro Pro Phe Val Pro Thr Leu Lys Ser Asp Asp Asp
370 375 380
Thr Ser Asn Phe Asp Glu Pro Glu Lys Asn Ser Trp Val Ser Ser Ser
385 390 395 400
Pro Cys Gln Leu Ser Pro Ser Gly Phe Ser Gly Glu Glu Leu Pro Phe
405 410 415
Val Gly Phe Ser Tyr Ser Lys Ala Leu Gly Ile Leu Gly Arg Ser Glu
420 425 430
Ser Val Val Ser Gly Leu Asp Ser Pro Ala Lys Thr Ser Ser Met Glu
435 440 445
Lys Lys Leu Leu Ile Lys Ser Lys Glu Leu Gln Asp Ser Gln Asp Lys
450 455 460
Cys His Lys Met Glu Gln Glu Met Thr Arg Leu His Arg Arg Val Ser
465 470 475 480
Glu Val Glu Ala Val Leu Ser Gln Lys Glu Val Glu Leu Lys Ala Ser
485 490 495
Glu Thr Gln Arg Ser Leu Leu Glu Gln Asp Leu Ala Thr Tyr Ile Thr
500 505 510
Glu Cys Ser Ser Leu Lys Arg Ser Leu Glu Gln Ala Arg Met Glu Val
515 520 525
Ser Gln Glu Asp Asp Lys Ala Leu Gln Leu Leu His Asp Ile Arg Glu
530 535 540
Gln Ser Arg Lys Leu Gln Glu Ile Lys Glu Gln Glu Tyr Gln Ala Gln
545 550 555 560
Val Glu Glu Met Arg Leu Met Met Asn Gln Leu Glu Glu Asp Leu Val
565 570 575
Ser Ala Arg Arg Arg Ser Asp Leu Tyr Glu Ser Glu Leu Arg Glu Ser
580 585 590
Arg Leu Ala Ala Glu Glu Phe Lys Arg Lys Ala Thr Glu Cys Gln His
595 600 605
Lys Leu Leu Lys Ala Lys Asp Gln Gly Lys Pro Glu Val Gly Glu Tyr
610 615 620
Ala Lys Leu Glu Lys Ile Asn Ala Glu Gln Gln Leu Lys Ile Gln Glu
625 630 635 640
Leu Gln Glu Lys Leu Glu Lys Ala Val Lys Ala Ser Thr Glu Ala Thr
645 650 655
Glu Leu Leu Gln Asn Ile Arg Gln Ala Lys Glu Arg Ala Glu Arg Glu
660 665 670
Leu Glu Lys Leu Gln Asn Arg Glu Asp Ser Ser Glu Gly Ile Arg Lys
675 680 685
Lys Leu Val Glu Ala Glu Glu Leu Glu Glu Lys His Arg Glu Ala Gln
690 695 700
Val Ser Ala Gln His Leu Glu Val His Leu Lys Gln Lys Glu Gln His
705 710 715 720
Tyr Glu Glu Lys Ile Lys Val Leu Asp Asn Gln Ile Lys Lys Asp Leu
725 730 735
Ala Asp Lys Glu Thr Leu Glu Asn Met Met Gln Arg His Glu Glu Glu
740 745 750
Ala His Glu Lys Gly Lys Ile Leu Ser Glu Gln Lys Ala Met Ile Asn
755 760 765
Ala Met Asp Ser Lys Ile Arg Ser Leu Glu Gln Arg Ile Val Glu Leu
770 775 780
Ser Glu Ala Asn Lys Leu Ala Ala Asn Ser Ser Leu Phe Thr Gln Arg
785 790 795 800
Asn Met Lys Ala Gln Glu Glu Met Ile Ser Glu Leu Arg Gln Gln Lys
805 810 815
Phe Tyr Leu Glu Thr Gln Ala Gly Lys Leu Glu Ala Gln Asn Arg Lys
820 825 830
Leu Glu Glu Gln Leu Glu Lys Ile Ser His Gln Asp His Ser Asp Lys
835 840 845
Asn Arg Leu Leu Glu Leu Glu Thr Arg Leu Arg Glu Val Ser Leu Glu
850 855 860
His Glu Glu Gln Lys Leu Glu Leu Lys Arg Gln Leu Thr Glu Leu Gln
865 870 875 880
Leu Ser Leu Gln Glu Arg Glu Ser Gln Leu Thr Ala Leu Gln Ala Ala
885 890 895
Arg Ala Ala Leu Glu Ser Gln Leu Arg Gln Ala Lys Thr Glu Leu Glu
900 905 910
Glu Thr Thr Ala Glu Ala Glu Glu Glu Ile Gln Ala Leu Thr Ala His
915 920 925
Arg Asp Glu Ile Gln Arg Lys Phe Asp Ala Leu Arg Asn Ser Cys Thr
930 935 940
Val Ile Thr Asp Leu Glu Glu Gln Leu Asn Gln Leu Thr Glu Asp Asn
945 950 955 960
Ala Glu Leu Asn Asn Gln Asn Phe Tyr Leu Ser Lys Gln Leu Asp Glu
965 970 975
Ala Ser Gly Ala Asn Asp Glu Ile Val Gln Leu Arg Ser Glu Val Asp
980 985 990
His Leu Arg Arg Glu Ile Thr Glu Arg Glu Met Gln Leu Thr Ser Gln
995 1000 1005
Lys Gln Thr Met Glu Ala Leu Lys Thr Thr Cys Thr Met Leu Glu Glu
1010 1015 1020
Gln Val Met Asp Leu Glu Ala Leu Asn Asp Glu Leu Leu Glu Lys Glu
1025 1030 1035 1040
Arg Gln Trp Glu Ala Trp Arg Ser Val Leu Gly Asp Glu Lys Ser Gln
1045 1050 1055
Phe Glu Cys Arg Val Arg Glu Leu Gln Arg Met Leu Asp Thr Glu Lys
1060 1065 1070
Gln Ser Arg Ala Arg Ala Asp Gln Arg Ile Thr Glu Ser Arg Gln Val
1075 1080 1085
Val Glu Leu Ala Val Lys Glu His Lys Ala Glu Ile Leu Ala Leu Gln
1090 1095 1100
Gln Ala Leu Lys Glu Gln Lys Leu Lys Ala Glu Ser Leu Ser Asp Lys
1105 1110 1115 1120
Leu Asn Asp Leu Glu Lys Lys His Ala Met Leu Glu Met Asn Ala Arg
1125 1130 1135
Ser Leu Gln Gln Lys Leu Glu Thr Glu Arg Glu Leu Lys Gln Arg Leu
1140 1145 1150
Leu Glu Glu Gln Ala Lys Leu Gln Gln Gln Met Asp Leu Gln Lys Asn
1155 1160 1165
His Ile Phe Arg Leu Thr Gln Gly Leu Gln Glu Ala Leu Asp Arg Ala
1170 1175 1180
Asp Leu Leu Lys Thr Glu Arg Ser Asp Leu Glu Tyr Gln Leu Glu Asn
1185 1190 1195 1200
Ile Gln Val Leu Tyr Ser His Glu Lys Val Lys Met Glu Gly Thr Ile
1205 1210 1215
Ser Gln Gln Thr Lys Leu Ile Asp Phe Leu Gln Ala Lys Met Asp Gln
1220 1225 1230
Pro Ala Lys Lys Lys Lys Gly Leu Phe Ser Arg Arg Lys Glu Asp Pro
1235 1240 1245
Ala Leu Pro Thr Gln Val Pro Leu Gln Tyr Asn Glu Leu Lys Leu Ala
1250 1255 1260
Leu Glu Lys Glu Lys Ala Arg Cys Ala Glu Leu Glu Glu Ala Leu Gln
1265 1270 1275 1280
Lys Thr Arg Ile Glu Leu Arg Ser Ala Arg Glu Glu Ala Ala His Arg
1285 1290 1295
Lys Ala Thr Asp His Pro His Pro Ser Thr Pro Ala Thr Ala Arg Gln
1300 1305 1310
Gln Ile Ala Met Ser Ala Ile Val Arg Ser Pro Glu His Gln Pro Ser
1315 1320 1325
Ala Met Ser Leu Leu Ala Pro Pro Ser Ser Arg Arg Lys Glu Ser Ser
1330 1335 1340
Thr Pro Glu Glu Phe Ser Arg Arg Leu Lys Glu Arg Met His His Asn
1345 1350 1355 1360
Ile Pro His Arg Phe Asn Val Gly Leu Asn Met Arg Ala Thr Lys Cys
1365 1370 1375
Ala Val Cys Leu Asp Thr Val His Phe Gly Arg Gln Ala Ser Lys Cys
1380 1385 1390
Leu Glu Cys Gln Val Met Cys His Pro Lys Cys Ser Thr Cys Leu Pro
1395 1400 1405
Ala Thr Cys Gly Leu Pro Ala Glu Tyr Ala Thr His Phe Thr Glu Ala
1410 1415 1420
Phe Cys Arg Asp Lys Met Asn Ser Pro Gly Leu Gln Thr Lys Glu Pro
1425 1430 1435 1440
Ser Ser Ser Leu His Leu Glu Gly Trp Met Lys Val Pro Arg Asn Asn
1445 1450 1455
Lys Arg Gly Gln Gln Gly Trp Asp Arg Lys Tyr Ile Val Leu Glu Gly
1460 1465 1470
Ser Lys Val Leu Ile Tyr Asp Asn Glu Ala Arg Glu Ala Gly Gln Arg
1475 1480 1485
Pro Val Glu Glu Phe Glu Leu Cys Leu Pro Asp Gly Asp Val Ser Ile
1490 1495 1500
His Gly Ala Val Gly Ala Ser Glu Leu Ala Asn Thr Ala Lys Ala Asp
1505 1510 1515 1520
Val Pro Tyr Ile Leu Lys Met Glu Ser His Pro His Thr Thr Cys Trp
1525 1530 1535
Pro Gly Arg Thr Leu Tyr Leu Leu Ala Pro Ser Phe Pro Asp Lys Gln
1540 1545 1550
Arg Trp Val Thr Ala Leu Glu Ser Val Val Ala Gly Gly Arg Val Ser
1555 1560 1565
Arg Glu Lys Ala Glu Ala Asp Ala Lys Leu Leu Gly Asn Ser Leu Leu
1570 1575 1580
Lys Leu Glu Gly Asp Asp Arg Leu Asp Met Asn Cys Thr Leu Pro Phe
1585 1590 1595 1600
Ser Asp Gln Val Val Leu Val Gly Thr Glu Glu Gly Leu Tyr Ala Leu
1605 1610 1615
Asn Val Leu Lys Asn Ser Leu Thr His Val Pro Gly Ile Gly Ala Val
1620 1625 1630
Phe Gln Ile Tyr Ile Ile Lys Asp Leu Glu Lys Leu Leu Met Ile Ala
1635 1640 1645
Gly Glu Glu Arg Ala Leu Cys Leu Val Asp Val Lys Lys Val Lys Gln
1650 1655 1660
Ser Leu Ala Gln Ser His Leu Pro Ala Gln Pro Asp Ile Ser Pro Asn
1665 1670 1675 1680
Ile Phe Glu Ala Val Lys Gly Cys His Leu Phe Gly Ala Gly Lys Ile
1685 1690 1695
Glu Asn Gly Leu Cys Ile Cys Ala Ala Met Pro Ser Lys Val Val Ile
1700 1705 1710
Leu Arg Tyr Asn Glu Asn Leu Ser Lys Tyr Cys Ile Arg Lys Glu Ile
1715 1720 1725
Glu Thr Ser Glu Pro Cys Ser Cys Ile His Phe Thr Asn Tyr Ser Ile
1730 1735 1740
Leu Ile Gly Thr Asn Lys Phe Tyr Glu Ile Asp Met Lys Gln Tyr Thr
1745 1750 1755 1760
Leu Glu Glu Phe Leu Asp Lys Asn Asp His Ser Leu Ala Pro Ala Val
1765 1770 1775
Phe Ala Ala Ser Ser Asn Ser Phe Pro Val Ser Ile Val Gln Val Asn
1780 1785 1790
Ser Ala Gly Gln Arg Glu Glu Tyr Leu Leu Cys Phe His Glu Phe Gly
1795 1800 1805
Val Phe Val Asp Ser Tyr Gly Arg Arg Ser Arg Thr Asp Asp Leu Lys
1810 1815 1820
Trp Ser Arg Leu Pro Leu Ala Phe Ala Tyr Arg Glu Pro Tyr Leu Phe
1825 1830 1835 1840
Val Thr His Phe Asn Ser Leu Glu Val Ile Glu Ile Gln Ala Arg Ser
1845 1850 1855
Ser Ala Gly Thr Pro Ala Arg Ala Tyr Leu Asp Ile Pro Asn Pro Arg
1860 1865 1870
Tyr Leu Gly Pro Ala Ile Ser Ser Gly Ala Ile Tyr Leu Ala Ser Ser
1875 1880 1885
Tyr Gln Asp Lys Leu Arg Val Ile Cys Cys Lys Gly Asn Leu Val Lys
1890 1895 1900
Glu Ser Gly Thr Glu His His Arg Gly Pro Ser Thr Ser Arg Ser Ser
1905 1910 1915 1920
Pro Asn Lys Arg Gly Pro Pro Thr Tyr Asn Glu His Ile Thr Lys Arg
1925 1930 1935
Val Ala Ser Ser Pro Ala Pro Pro Glu Gly Pro Ser His Pro Arg Glu
1940 1945 1950
Pro Ser Thr Pro His Arg Tyr Arg Glu Gly Arg Thr Glu Leu Arg Arg
1955 1960 1965
Asp Lys Ser Pro Gly Arg Pro Leu Glu Arg Glu Lys Ser Pro Gly Arg
1970 1975 1980
Met Leu Ser Thr Arg Arg Glu Arg Ser Pro Gly Arg Leu Phe Glu Asp
1985 1990 1995 2000
Ser Ser Arg Gly Arg Leu Pro Ala Gly Ala Val Arg Thr Pro Leu Ser
2005 2010 2015
Gln Val Asn Lys Val Trp Asp Gln Ser Ser Val
2020 2025
<210> 27
<211> 1101
<212> PRT
<213> Artificial Sequence
<220>
<223> CIT protein fragment
<400> 27
Ala His Arg Asp Glu Ile Gln Arg Lys Phe Asp Ala Leu Arg Asn Ser
1 5 10 15
Cys Thr Val Ile Thr Asp Leu Glu Glu Gln Leu Asn Gln Leu Thr Glu
20 25 30
Asp Asn Ala Glu Leu Asn Asn Gln Asn Phe Tyr Leu Ser Lys Gln Leu
35 40 45
Asp Glu Ala Ser Gly Ala Asn Asp Glu Ile Val Gln Leu Arg Ser Glu
50 55 60
Val Asp His Leu Arg Arg Glu Ile Thr Glu Arg Glu Met Gln Leu Thr
65 70 75 80
Ser Gln Lys Gln Thr Met Glu Ala Leu Lys Thr Thr Cys Thr Met Leu
85 90 95
Glu Glu Gln Val Met Asp Leu Glu Ala Leu Asn Asp Glu Leu Leu Glu
100 105 110
Lys Glu Arg Gln Trp Glu Ala Trp Arg Ser Val Leu Gly Asp Glu Lys
115 120 125
Ser Gln Phe Glu Cys Arg Val Arg Glu Leu Gln Arg Met Leu Asp Thr
130 135 140
Glu Lys Gln Ser Arg Ala Arg Ala Asp Gln Arg Ile Thr Glu Ser Arg
145 150 155 160
Gln Val Val Glu Leu Ala Val Lys Glu His Lys Ala Glu Ile Leu Ala
165 170 175
Leu Gln Gln Ala Leu Lys Glu Gln Lys Leu Lys Ala Glu Ser Leu Ser
180 185 190
Asp Lys Leu Asn Asp Leu Glu Lys Lys His Ala Met Leu Glu Met Asn
195 200 205
Ala Arg Ser Leu Gln Gln Lys Leu Glu Thr Glu Arg Glu Leu Lys Gln
210 215 220
Arg Leu Leu Glu Glu Gln Ala Lys Leu Gln Gln Gln Met Asp Leu Gln
225 230 235 240
Lys Asn His Ile Phe Arg Leu Thr Gln Gly Leu Gln Glu Ala Leu Asp
245 250 255
Arg Ala Asp Leu Leu Lys Thr Glu Arg Ser Asp Leu Glu Tyr Gln Leu
260 265 270
Glu Asn Ile Gln Val Leu Tyr Ser His Glu Lys Val Lys Met Glu Gly
275 280 285
Thr Ile Ser Gln Gln Thr Lys Leu Ile Asp Phe Leu Gln Ala Lys Met
290 295 300
Asp Gln Pro Ala Lys Lys Lys Lys Gly Leu Phe Ser Arg Arg Lys Glu
305 310 315 320
Asp Pro Ala Leu Pro Thr Gln Val Pro Leu Gln Tyr Asn Glu Leu Lys
325 330 335
Leu Ala Leu Glu Lys Glu Lys Ala Arg Cys Ala Glu Leu Glu Glu Ala
340 345 350
Leu Gln Lys Thr Arg Ile Glu Leu Arg Ser Ala Arg Glu Glu Ala Ala
355 360 365
His Arg Lys Ala Thr Asp His Pro His Pro Ser Thr Pro Ala Thr Ala
370 375 380
Arg Gln Gln Ile Ala Met Ser Ala Ile Val Arg Ser Pro Glu His Gln
385 390 395 400
Pro Ser Ala Met Ser Leu Leu Ala Pro Pro Ser Ser Arg Arg Lys Glu
405 410 415
Ser Ser Thr Pro Glu Glu Phe Ser Arg Arg Leu Lys Glu Arg Met His
420 425 430
His Asn Ile Pro His Arg Phe Asn Val Gly Leu Asn Met Arg Ala Thr
435 440 445
Lys Cys Ala Val Cys Leu Asp Thr Val His Phe Gly Arg Gln Ala Ser
450 455 460
Lys Cys Leu Glu Cys Gln Val Met Cys His Pro Lys Cys Ser Thr Cys
465 470 475 480
Leu Pro Ala Thr Cys Gly Leu Pro Ala Glu Tyr Ala Thr His Phe Thr
485 490 495
Glu Ala Phe Cys Arg Asp Lys Met Asn Ser Pro Gly Leu Gln Thr Lys
500 505 510
Glu Pro Ser Ser Ser Leu His Leu Glu Gly Trp Met Lys Val Pro Arg
515 520 525
Asn Asn Lys Arg Gly Gln Gln Gly Trp Asp Arg Lys Tyr Ile Val Leu
530 535 540
Glu Gly Ser Lys Val Leu Ile Tyr Asp Asn Glu Ala Arg Glu Ala Gly
545 550 555 560
Gln Arg Pro Val Glu Glu Phe Glu Leu Cys Leu Pro Asp Gly Asp Val
565 570 575
Ser Ile His Gly Ala Val Gly Ala Ser Glu Leu Ala Asn Thr Ala Lys
580 585 590
Ala Asp Val Pro Tyr Ile Leu Lys Met Glu Ser His Pro His Thr Thr
595 600 605
Cys Trp Pro Gly Arg Thr Leu Tyr Leu Leu Ala Pro Ser Phe Pro Asp
610 615 620
Lys Gln Arg Trp Val Thr Ala Leu Glu Ser Val Val Ala Gly Gly Arg
625 630 635 640
Val Ser Arg Glu Lys Ala Glu Ala Asp Ala Lys Leu Leu Gly Asn Ser
645 650 655
Leu Leu Lys Leu Glu Gly Asp Asp Arg Leu Asp Met Asn Cys Thr Leu
660 665 670
Pro Phe Ser Asp Gln Val Val Leu Val Gly Thr Glu Glu Gly Leu Tyr
675 680 685
Ala Leu Asn Val Leu Lys Asn Ser Leu Thr His Val Pro Gly Ile Gly
690 695 700
Ala Val Phe Gln Ile Tyr Ile Ile Lys Asp Leu Glu Lys Leu Leu Met
705 710 715 720
Ile Ala Gly Glu Glu Arg Ala Leu Cys Leu Val Asp Val Lys Lys Val
725 730 735
Lys Gln Ser Leu Ala Gln Ser His Leu Pro Ala Gln Pro Asp Ile Ser
740 745 750
Pro Asn Ile Phe Glu Ala Val Lys Gly Cys His Leu Phe Gly Ala Gly
755 760 765
Lys Ile Glu Asn Gly Leu Cys Ile Cys Ala Ala Met Pro Ser Lys Val
770 775 780
Val Ile Leu Arg Tyr Asn Glu Asn Leu Ser Lys Tyr Cys Ile Arg Lys
785 790 795 800
Glu Ile Glu Thr Ser Glu Pro Cys Ser Cys Ile His Phe Thr Asn Tyr
805 810 815
Ser Ile Leu Ile Gly Thr Asn Lys Phe Tyr Glu Ile Asp Met Lys Gln
820 825 830
Tyr Thr Leu Glu Glu Phe Leu Asp Lys Asn Asp His Ser Leu Ala Pro
835 840 845
Ala Val Phe Ala Ala Ser Ser Asn Ser Phe Pro Val Ser Ile Val Gln
850 855 860
Val Asn Ser Ala Gly Gln Arg Glu Glu Tyr Leu Leu Cys Phe His Glu
865 870 875 880
Phe Gly Val Phe Val Asp Ser Tyr Gly Arg Arg Ser Arg Thr Asp Asp
885 890 895
Leu Lys Trp Ser Arg Leu Pro Leu Ala Phe Ala Tyr Arg Glu Pro Tyr
900 905 910
Leu Phe Val Thr His Phe Asn Ser Leu Glu Val Ile Glu Ile Gln Ala
915 920 925
Arg Ser Ser Ala Gly Thr Pro Ala Arg Ala Tyr Leu Asp Ile Pro Asn
930 935 940
Pro Arg Tyr Leu Gly Pro Ala Ile Ser Ser Gly Ala Ile Tyr Leu Ala
945 950 955 960
Ser Ser Tyr Gln Asp Lys Leu Arg Val Ile Cys Cys Lys Gly Asn Leu
965 970 975
Val Lys Glu Ser Gly Thr Glu His His Arg Gly Pro Ser Thr Ser Arg
980 985 990
Ser Ser Pro Asn Lys Arg Gly Pro Pro Thr Tyr Asn Glu His Ile Thr
995 1000 1005
Lys Arg Val Ala Ser Ser Pro Ala Pro Pro Glu Gly Pro Ser His Pro
1010 1015 1020
Arg Glu Pro Ser Thr Pro His Arg Tyr Arg Glu Gly Arg Thr Glu Leu
1025 1030 1035 1040
Arg Arg Asp Lys Ser Pro Gly Arg Pro Leu Glu Arg Glu Lys Ser Pro
1045 1050 1055
Gly Arg Met Leu Ser Thr Arg Arg Glu Arg Ser Pro Gly Arg Leu Phe
1060 1065 1070
Glu Asp Ser Ser Arg Gly Arg Leu Pro Ala Gly Ala Val Arg Thr Pro
1075 1080 1085
Leu Ser Gln Val Asn Lys Val Trp Asp Gln Ser Ser Val
1090 1095 1100
<210> 28
<211> 7
<212> PRT
<213> Artificial Sequence
<220>
<223> Break-point of CIT protein fragment
<400> 28
Ala His Arg Asp Glu Ile Gln
1 5
<210> 29
<211> 5271
<212> DNA
<213> Artificial Sequence
<220>
<223> FGFR2-CIT fusion gene
<400> 29
atggtcagct ggggtcgttt catctgcctg gtcgtggtca ccatggcaac cttgtccctg 60
gcccggccct ccttcagttt agttgaggat accacattag agccagaaga gccaccaacc 120
aaataccaaa tctctcaacc agaagtgtac gtggctgcgc caggggagtc gctagaggtg 180
cgctgcctgt tgaaagatgc cgccgtgatc agttggacta aggatggggt gcacttgggg 240
cccaacaata ggacagtgct tattggggag tacttgcaga taaagggcgc cacgcctaga 300
gactccggcc tctatgcttg tactgccagt aggactgtag acagtgaaac ttggtacttc 360
atggtgaatg tcacagatgc catctcatcc ggagatgatg aggatgacac cgatggtgcg 420
gaagattttg tcagtgagaa cagtaacaac aagagagcac catactggac caacacagaa 480
aagatggaaa agcggctcca tgctgtgcct gcggccaaca ctgtcaagtt tcgctgccca 540
gccgggggga acccaatgcc aaccatgcgg tggctgaaaa acgggaagga gtttaagcag 600
gagcatcgca ttggaggcta caaggtacga aaccagcact ggagcctcat tatggaaagt 660
gtggtcccat ctgacaaggg aaattatacc tgtgtagtgg agaatgaata cgggtccatc 720
aatcacacgt accacctgga tgttgtggcg cctggaagag aaaaggagat tacagcttcc 780
ccagactacc tggagatagc catttactgc ataggggtct tcttaatcgc ctgtatggtg 840
gtaacagtca tcctgtgccg aatgaagaac acgaccaaga agccagactt cagcagccag 900
ccggctgtgc acaagctgac caaacgtatc cccctgcgga gacaggtaac agtttcggct 960
gagtccagct cctccatgaa ctccaacacc ccgctggtga ggataacaac acgcctctct 1020
tcaacggcag acacccccat gctggcaggg gtctccgagt atgaacttcc agaggaccca 1080
aaatgggagt ttccaagaga taagctgaca ctgggcaagc ccctgggaga aggttgcttt 1140
gggcaagtgg tcatggcgga agcagtggga attgacaaag acaagcccaa ggaggcggtc 1200
accgtggccg tgaagatgtt gaaagatgat gccacagaga aagacctttc tgatctggtg 1260
tcagagatgg agatgatgaa gatgattggg aaacacaaga atatcataaa tcttcttgga 1320
gcctgcacac aggatgggcc tctctatgtc atagttgagt atgcctctaa aggcaacctc 1380
cgagaatacc tccgagcccg gaggccaccc gggatggagt actcctatga cattaaccgt 1440
gttcctgagg agcagatgac cttcaaggac ttggtgtcat gcacctacca gctggccaga 1500
ggcatggagt acttggcttc ccaaaaatgt attcatcgag atttagcagc cagaaatgtt 1560
ttggtaacag aaaacaatgt gatgaaaata gcagactttg gactcgccag agatatcaac 1620
aatatagact attacaaaaa gaccaccaat gggcggcttc cagtcaagtg gatggctcca 1680
gaagccctgt ttgatagagt atacactcat cagagtgatg tctggtcctt cggggtgtta 1740
atgtgggaga tcttcacttt agggggctcg ccctacccag ggattcccgt ggaggaactt 1800
tttaagctgc tgaaggaagg acacagaatg gataagccag ccaactgcac caacgaactg 1860
tacatgatga tgagggactg ttggcatgca gtgccctccc agagaccaac gttcaagcag 1920
ttggtagaag acttggatcg aattctcact ctcacaacca atgaggcaca tagagatgaa 1980
atccagcgca aatttgatgc tcttcgtaac agctgtactg taatcacaga cctggaggag 2040
cagctaaacc agctgaccga ggacaacgct gaactcaaca accaaaactt ctacttgtcc 2100
aaacaactcg atgaggcttc tggcgccaac gacgagattg tacaactgcg aagtgaagtg 2160
gaccatctcc gccgggagat cacggaacga gagatgcagc ttaccagcca gaagcaaacg 2220
atggaggctc tgaagaccac gtgcaccatg ctggaggaac aggtcatgga tttggaggcc 2280
ctaaacgatg agctgctaga aaaagagcgg cagtgggagg cctggaggag cgtcctgggt 2340
gatgagaaat cccagtttga gtgtcgggtt cgagagctgc agagaatgct ggacaccgag 2400
aaacagagca gggcgagagc cgatcagcgg atcaccgagt ctcgccaggt ggtggagctg 2460
gcagtgaagg agcacaaggc tgagattctc gctctgcagc aggctctcaa agagcagaag 2520
ctgaaggccg agagcctctc tgacaagctc aatgacctgg agaagaagca tgctatgctt 2580
gaaatgaatg cccgaagctt acagcagaag ctggagactg aacgagagct caaacagagg 2640
cttctggaag agcaagccaa attacagcag cagatggacc tgcagaaaaa tcacattttc 2700
cgtctgactc aaggactgca agaagctcta gatcgggctg atctactgaa gacagaaaga 2760
agtgacttgg agtatcagct ggaaaacatt caggttctct attctcatga aaaggtgaaa 2820
atggaaggca ctatttctca acaaaccaaa ctcattgatt ttctgcaagc caaaatggac 2880
caacctgcta aaaagaaaaa gggtttattt agtcgacgga aagaggaccc tgctttaccc 2940
acacaggttc ctctgcagta caatgagctg aagctggccc tggagaagga gaaagctcgc 3000
tgtgcagagc tagaggaagc ccttcagaag acccgcatcg agctccggtc cgcccgggag 3060
gaagctgccc accgcaaagc aacggaccac ccacacccat ccacgccagc caccgcgagg 3120
cagcagatcg ccatgtccgc catcgtgcgg tcgccagagc accagcccag tgccatgagc 3180
ctgctggccc cgccatccag ccgcagaaag gagtcttcaa ctccagagga atttagtcgg 3240
cgtcttaagg aacgcatgca ccacaatatt cctcaccgat tcaacgtagg actgaacatg 3300
cgagccacaa agtgtgctgt gtgtctggat accgtgcact ttggacgcca ggcatccaaa 3360
tgtctcgaat gtcaggtgat gtgtcacccc aagtgctcca cgtgcttgcc agccacctgc 3420
ggcttgcctg ctgaatatgc cacacacttc accgaggcct tctgccgtga caaaatgaac 3480
tccccaggtc tccagaccaa ggagcccagc agcagcttgc acctggaagg gtggatgaag 3540
gtgcccagga ataacaaacg aggacagcaa ggctgggaca ggaagtacat tgtcctggag 3600
ggatcaaaag tcctcattta tgacaatgaa gccagagaag ctggacagag gccggtggaa 3660
gaatttgagc tgtgccttcc cgacggggat gtatctattc atggtgccgt tggtgcttcc 3720
gaactcgcaa atacagccaa agcagatgtc ccatacatac tgaagatgga atctcacccg 3780
cacaccacct gctggcccgg gagaaccctc tacttgctag ctcccagctt ccctgacaaa 3840
cagcgctggg tcaccgcctt agaatcagtt gtcgcaggtg ggagagtttc tagggaaaaa 3900
gcagaagctg atgctaaact gcttggaaac tccctgctga aactggaagg tgatgaccgt 3960
ctagacatga actgcacgct gcccttcagt gaccaggtgg tgttggtggg caccgaggaa 4020
gggctctacg ccctgaatgt cttgaaaaac tccctaaccc atgtcccagg aattggagca 4080
gtcttccaaa tttatattat caaggacctg gagaagctac tcatgatagc aggagaagag 4140
cgggcactgt gtcttgtgga cgtgaagaaa gtgaaacagt ccctggccca gtcccacctg 4200
cctgcccagc ccgacatctc acccaacatt tttgaagctg tcaagggctg ccacttgttt 4260
ggggcaggca agattgagaa cgggctctgc atctgtgcag ccatgcccag caaagtcgtc 4320
attctccgct acaacgaaaa cctcagcaaa tactgcatcc ggaaagagat agagacctca 4380
gagccctgca gctgtatcca cttcaccaat tacagtatcc tcattggaac caataaattc 4440
tacgaaatcg acatgaagca gtacacgctc gaggaattcc tggataagaa tgaccattcc 4500
ttggcacctg ctgtgtttgc cgcctcttcc aacagcttcc ctgtctcaat cgtgcaggtg 4560
aacagcgcag ggcagcgaga ggagtacttg ctgtgtttcc acgaatttgg agtgttcgtg 4620
gattcttacg gaagacgtag ccgcacagac gatctcaagt ggagtcgctt acctttggcc 4680
tttgcctaca gagaacccta tctgtttgtg acccacttca actcactcga agtaattgag 4740
atccaggcac gctcctcagc agggacccct gcccgagcgt acctggacat cccgaacccg 4800
cgctacctgg gccctgccat ttcctcagga gcgatttact tggcgtcctc ataccaggat 4860
aaattaaggg tcatttgctg caagggaaac ctcgtgaagg agtccggcac tgaacaccac 4920
cggggcccgt ccacctcccg cagcagcccc aacaagcgag gcccacccac gtacaacgag 4980
cacatcacca agcgcgtggc ctccagccca gcgccgcccg aaggccccag ccacccgcga 5040
gagccaagca caccccaccg ctaccgcgag gggcggaccg agctgcgcag ggacaagtct 5100
cctggccgcc ccctggagcg agagaagtcc cccggccgga tgctcagcac gcggagagag 5160
cggtcccccg ggaggctgtt tgaagacagc agcaggggcc ggctgcctgc gggagccgtg 5220
aggaccccgc tgtcccaggt gaacaaggtc tgggaccagt cttcagtata a 5271
<210> 30
<211> 42
<212> DNA
<213> Artificial Sequence
<220>
<223> Fused region of FGFR2-CIT fusion gene
<400> 30
ctcactctca caaccaatga ggcacataga gatgaaatcc ag 42
<210> 31
<211> 1756
<212> PRT
<213> Artificial Sequence
<220>
<223> FGFR2-CIT fusion protein
<400> 31
Met Val Ser Trp Gly Arg Phe Ile Cys Leu Val Val Val Thr Met Ala
1 5 10 15
Thr Leu Ser Leu Ala Arg Pro Ser Phe Ser Leu Val Glu Asp Thr Thr
20 25 30
Leu Glu Pro Glu Glu Pro Pro Thr Lys Tyr Gln Ile Ser Gln Pro Glu
35 40 45
Val Tyr Val Ala Ala Pro Gly Glu Ser Leu Glu Val Arg Cys Leu Leu
50 55 60
Lys Asp Ala Ala Val Ile Ser Trp Thr Lys Asp Gly Val His Leu Gly
65 70 75 80
Pro Asn Asn Arg Thr Val Leu Ile Gly Glu Tyr Leu Gln Ile Lys Gly
85 90 95
Ala Thr Pro Arg Asp Ser Gly Leu Tyr Ala Cys Thr Ala Ser Arg Thr
100 105 110
Val Asp Ser Glu Thr Trp Tyr Phe Met Val Asn Val Thr Asp Ala Ile
115 120 125
Ser Ser Gly Asp Asp Glu Asp Asp Thr Asp Gly Ala Glu Asp Phe Val
130 135 140
Ser Glu Asn Ser Asn Asn Lys Arg Ala Pro Tyr Trp Thr Asn Thr Glu
145 150 155 160
Lys Met Glu Lys Arg Leu His Ala Val Pro Ala Ala Asn Thr Val Lys
165 170 175
Phe Arg Cys Pro Ala Gly Gly Asn Pro Met Pro Thr Met Arg Trp Leu
180 185 190
Lys Asn Gly Lys Glu Phe Lys Gln Glu His Arg Ile Gly Gly Tyr Lys
195 200 205
Val Arg Asn Gln His Trp Ser Leu Ile Met Glu Ser Val Val Pro Ser
210 215 220
Asp Lys Gly Asn Tyr Thr Cys Val Val Glu Asn Glu Tyr Gly Ser Ile
225 230 235 240
Asn His Thr Tyr His Leu Asp Val Val Ala Pro Gly Arg Glu Lys Glu
245 250 255
Ile Thr Ala Ser Pro Asp Tyr Leu Glu Ile Ala Ile Tyr Cys Ile Gly
260 265 270
Val Phe Leu Ile Ala Cys Met Val Val Thr Val Ile Leu Cys Arg Met
275 280 285
Lys Asn Thr Thr Lys Lys Pro Asp Phe Ser Ser Gln Pro Ala Val His
290 295 300
Lys Leu Thr Lys Arg Ile Pro Leu Arg Arg Gln Val Thr Val Ser Ala
305 310 315 320
Glu Ser Ser Ser Ser Met Asn Ser Asn Thr Pro Leu Val Arg Ile Thr
325 330 335
Thr Arg Leu Ser Ser Thr Ala Asp Thr Pro Met Leu Ala Gly Val Ser
340 345 350
Glu Tyr Glu Leu Pro Glu Asp Pro Lys Trp Glu Phe Pro Arg Asp Lys
355 360 365
Leu Thr Leu Gly Lys Pro Leu Gly Glu Gly Cys Phe Gly Gln Val Val
370 375 380
Met Ala Glu Ala Val Gly Ile Asp Lys Asp Lys Pro Lys Glu Ala Val
385 390 395 400
Thr Val Ala Val Lys Met Leu Lys Asp Asp Ala Thr Glu Lys Asp Leu
405 410 415
Ser Asp Leu Val Ser Glu Met Glu Met Met Lys Met Ile Gly Lys His
420 425 430
Lys Asn Ile Ile Asn Leu Leu Gly Ala Cys Thr Gln Asp Gly Pro Leu
435 440 445
Tyr Val Ile Val Glu Tyr Ala Ser Lys Gly Asn Leu Arg Glu Tyr Leu
450 455 460
Arg Ala Arg Arg Pro Pro Gly Met Glu Tyr Ser Tyr Asp Ile Asn Arg
465 470 475 480
Val Pro Glu Glu Gln Met Thr Phe Lys Asp Leu Val Ser Cys Thr Tyr
485 490 495
Gln Leu Ala Arg Gly Met Glu Tyr Leu Ala Ser Gln Lys Cys Ile His
500 505 510
Arg Asp Leu Ala Ala Arg Asn Val Leu Val Thr Glu Asn Asn Val Met
515 520 525
Lys Ile Ala Asp Phe Gly Leu Ala Arg Asp Ile Asn Asn Ile Asp Tyr
530 535 540
Tyr Lys Lys Thr Thr Asn Gly Arg Leu Pro Val Lys Trp Met Ala Pro
545 550 555 560
Glu Ala Leu Phe Asp Arg Val Tyr Thr His Gln Ser Asp Val Trp Ser
565 570 575
Phe Gly Val Leu Met Trp Glu Ile Phe Thr Leu Gly Gly Ser Pro Tyr
580 585 590
Pro Gly Ile Pro Val Glu Glu Leu Phe Lys Leu Leu Lys Glu Gly His
595 600 605
Arg Met Asp Lys Pro Ala Asn Cys Thr Asn Glu Leu Tyr Met Met Met
610 615 620
Arg Asp Cys Trp His Ala Val Pro Ser Gln Arg Pro Thr Phe Lys Gln
625 630 635 640
Leu Val Glu Asp Leu Asp Arg Ile Leu Thr Leu Thr Thr Asn Glu Ala
645 650 655
His Arg Asp Glu Ile Gln Arg Lys Phe Asp Ala Leu Arg Asn Ser Cys
660 665 670
Thr Val Ile Thr Asp Leu Glu Glu Gln Leu Asn Gln Leu Thr Glu Asp
675 680 685
Asn Ala Glu Leu Asn Asn Gln Asn Phe Tyr Leu Ser Lys Gln Leu Asp
690 695 700
Glu Ala Ser Gly Ala Asn Asp Glu Ile Val Gln Leu Arg Ser Glu Val
705 710 715 720
Asp His Leu Arg Arg Glu Ile Thr Glu Arg Glu Met Gln Leu Thr Ser
725 730 735
Gln Lys Gln Thr Met Glu Ala Leu Lys Thr Thr Cys Thr Met Leu Glu
740 745 750
Glu Gln Val Met Asp Leu Glu Ala Leu Asn Asp Glu Leu Leu Glu Lys
755 760 765
Glu Arg Gln Trp Glu Ala Trp Arg Ser Val Leu Gly Asp Glu Lys Ser
770 775 780
Gln Phe Glu Cys Arg Val Arg Glu Leu Gln Arg Met Leu Asp Thr Glu
785 790 795 800
Lys Gln Ser Arg Ala Arg Ala Asp Gln Arg Ile Thr Glu Ser Arg Gln
805 810 815
Val Val Glu Leu Ala Val Lys Glu His Lys Ala Glu Ile Leu Ala Leu
820 825 830
Gln Gln Ala Leu Lys Glu Gln Lys Leu Lys Ala Glu Ser Leu Ser Asp
835 840 845
Lys Leu Asn Asp Leu Glu Lys Lys His Ala Met Leu Glu Met Asn Ala
850 855 860
Arg Ser Leu Gln Gln Lys Leu Glu Thr Glu Arg Glu Leu Lys Gln Arg
865 870 875 880
Leu Leu Glu Glu Gln Ala Lys Leu Gln Gln Gln Met Asp Leu Gln Lys
885 890 895
Asn His Ile Phe Arg Leu Thr Gln Gly Leu Gln Glu Ala Leu Asp Arg
900 905 910
Ala Asp Leu Leu Lys Thr Glu Arg Ser Asp Leu Glu Tyr Gln Leu Glu
915 920 925
Asn Ile Gln Val Leu Tyr Ser His Glu Lys Val Lys Met Glu Gly Thr
930 935 940
Ile Ser Gln Gln Thr Lys Leu Ile Asp Phe Leu Gln Ala Lys Met Asp
945 950 955 960
Gln Pro Ala Lys Lys Lys Lys Gly Leu Phe Ser Arg Arg Lys Glu Asp
965 970 975
Pro Ala Leu Pro Thr Gln Val Pro Leu Gln Tyr Asn Glu Leu Lys Leu
980 985 990
Ala Leu Glu Lys Glu Lys Ala Arg Cys Ala Glu Leu Glu Glu Ala Leu
995 1000 1005
Gln Lys Thr Arg Ile Glu Leu Arg Ser Ala Arg Glu Glu Ala Ala His
1010 1015 1020
Arg Lys Ala Thr Asp His Pro His Pro Ser Thr Pro Ala Thr Ala Arg
1025 1030 1035 1040
Gln Gln Ile Ala Met Ser Ala Ile Val Arg Ser Pro Glu His Gln Pro
1045 1050 1055
Ser Ala Met Ser Leu Leu Ala Pro Pro Ser Ser Arg Arg Lys Glu Ser
1060 1065 1070
Ser Thr Pro Glu Glu Phe Ser Arg Arg Leu Lys Glu Arg Met His His
1075 1080 1085
Asn Ile Pro His Arg Phe Asn Val Gly Leu Asn Met Arg Ala Thr Lys
1090 1095 1100
Cys Ala Val Cys Leu Asp Thr Val His Phe Gly Arg Gln Ala Ser Lys
1105 1110 1115 1120
Cys Leu Glu Cys Gln Val Met Cys His Pro Lys Cys Ser Thr Cys Leu
1125 1130 1135
Pro Ala Thr Cys Gly Leu Pro Ala Glu Tyr Ala Thr His Phe Thr Glu
1140 1145 1150
Ala Phe Cys Arg Asp Lys Met Asn Ser Pro Gly Leu Gln Thr Lys Glu
1155 1160 1165
Pro Ser Ser Ser Leu His Leu Glu Gly Trp Met Lys Val Pro Arg Asn
1170 1175 1180
Asn Lys Arg Gly Gln Gln Gly Trp Asp Arg Lys Tyr Ile Val Leu Glu
1185 1190 1195 1200
Gly Ser Lys Val Leu Ile Tyr Asp Asn Glu Ala Arg Glu Ala Gly Gln
1205 1210 1215
Arg Pro Val Glu Glu Phe Glu Leu Cys Leu Pro Asp Gly Asp Val Ser
1220 1225 1230
Ile His Gly Ala Val Gly Ala Ser Glu Leu Ala Asn Thr Ala Lys Ala
1235 1240 1245
Asp Val Pro Tyr Ile Leu Lys Met Glu Ser His Pro His Thr Thr Cys
1250 1255 1260
Trp Pro Gly Arg Thr Leu Tyr Leu Leu Ala Pro Ser Phe Pro Asp Lys
1265 1270 1275 1280
Gln Arg Trp Val Thr Ala Leu Glu Ser Val Val Ala Gly Gly Arg Val
1285 1290 1295
Ser Arg Glu Lys Ala Glu Ala Asp Ala Lys Leu Leu Gly Asn Ser Leu
1300 1305 1310
Leu Lys Leu Glu Gly Asp Asp Arg Leu Asp Met Asn Cys Thr Leu Pro
1315 1320 1325
Phe Ser Asp Gln Val Val Leu Val Gly Thr Glu Glu Gly Leu Tyr Ala
1330 1335 1340
Leu Asn Val Leu Lys Asn Ser Leu Thr His Val Pro Gly Ile Gly Ala
1345 1350 1355 1360
Val Phe Gln Ile Tyr Ile Ile Lys Asp Leu Glu Lys Leu Leu Met Ile
1365 1370 1375
Ala Gly Glu Glu Arg Ala Leu Cys Leu Val Asp Val Lys Lys Val Lys
1380 1385 1390
Gln Ser Leu Ala Gln Ser His Leu Pro Ala Gln Pro Asp Ile Ser Pro
1395 1400 1405
Asn Ile Phe Glu Ala Val Lys Gly Cys His Leu Phe Gly Ala Gly Lys
1410 1415 1420
Ile Glu Asn Gly Leu Cys Ile Cys Ala Ala Met Pro Ser Lys Val Val
1425 1430 1435 1440
Ile Leu Arg Tyr Asn Glu Asn Leu Ser Lys Tyr Cys Ile Arg Lys Glu
1445 1450 1455
Ile Glu Thr Ser Glu Pro Cys Ser Cys Ile His Phe Thr Asn Tyr Ser
1460 1465 1470
Ile Leu Ile Gly Thr Asn Lys Phe Tyr Glu Ile Asp Met Lys Gln Tyr
1475 1480 1485
Thr Leu Glu Glu Phe Leu Asp Lys Asn Asp His Ser Leu Ala Pro Ala
1490 1495 1500
Val Phe Ala Ala Ser Ser Asn Ser Phe Pro Val Ser Ile Val Gln Val
1505 1510 1515 1520
Asn Ser Ala Gly Gln Arg Glu Glu Tyr Leu Leu Cys Phe His Glu Phe
1525 1530 1535
Gly Val Phe Val Asp Ser Tyr Gly Arg Arg Ser Arg Thr Asp Asp Leu
1540 1545 1550
Lys Trp Ser Arg Leu Pro Leu Ala Phe Ala Tyr Arg Glu Pro Tyr Leu
1555 1560 1565
Phe Val Thr His Phe Asn Ser Leu Glu Val Ile Glu Ile Gln Ala Arg
1570 1575 1580
Ser Ser Ala Gly Thr Pro Ala Arg Ala Tyr Leu Asp Ile Pro Asn Pro
1585 1590 1595 1600
Arg Tyr Leu Gly Pro Ala Ile Ser Ser Gly Ala Ile Tyr Leu Ala Ser
1605 1610 1615
Ser Tyr Gln Asp Lys Leu Arg Val Ile Cys Cys Lys Gly Asn Leu Val
1620 1625 1630
Lys Glu Ser Gly Thr Glu His His Arg Gly Pro Ser Thr Ser Arg Ser
1635 1640 1645
Ser Pro Asn Lys Arg Gly Pro Pro Thr Tyr Asn Glu His Ile Thr Lys
1650 1655 1660
Arg Val Ala Ser Ser Pro Ala Pro Pro Glu Gly Pro Ser His Pro Arg
1665 1670 1675 1680
Glu Pro Ser Thr Pro His Arg Tyr Arg Glu Gly Arg Thr Glu Leu Arg
1685 1690 1695
Arg Asp Lys Ser Pro Gly Arg Pro Leu Glu Arg Glu Lys Ser Pro Gly
1700 1705 1710
Arg Met Leu Ser Thr Arg Arg Glu Arg Ser Pro Gly Arg Leu Phe Glu
1715 1720 1725
Asp Ser Ser Arg Gly Arg Leu Pro Ala Gly Ala Val Arg Thr Pro Leu
1730 1735 1740
Ser Gln Val Asn Lys Val Trp Asp Gln Ser Ser Val
1745 1750 1755
<210> 32
<211> 14
<212> PRT
<213> Artificial Sequence
<220>
<223> Fused region of FGFR2-CIT fusion protein
<400> 32
Leu Thr Leu Thr Thr Asn Glu Ala His Arg Asp Glu Ile Gln
1 5 10
<210> 33
<211> 2685
<212> DNA
<213> Artificial Sequence
<220>
<223> CDS of AXL gene (NM_021913)
<400> 33
atggcgtggc ggtgccccag gatgggcagg gtcccgctgg cctggtgctt ggcgctgtgc 60
ggctgggcgt gcatggcccc caggggcacg caggctgaag aaagtccctt cgtgggcaac 120
ccagggaata tcacaggtgc ccggggactc acgggcaccc ttcggtgtca gctccaggtt 180
cagggagagc cccccgaggt acattggctt cgggatggac agatcctgga gctcgcggac 240
agcacccaga cccaggtgcc cctgggtgag gatgaacagg atgactggat agtggtcagc 300
cagctcagaa tcacctccct gcagctttcc gacacgggac agtaccagtg tttggtgttt 360
ctgggacatc agaccttcgt gtcccagcct ggctatgttg ggctggaggg cttgccttac 420
ttcctggagg agcccgaaga caggactgtg gccgccaaca cccccttcaa cctgagctgc 480
caagctcagg gacccccaga gcccgtggac ctactctggc tccaggatgc tgtccccctg 540
gccacggctc caggtcacgg cccccagcgc agcctgcatg ttccagggct gaacaagaca 600
tcctctttct cctgcgaagc ccataacgcc aagggggtca ccacatcccg cacagccacc 660
atcacagtgc tcccccagca gccccgtaac ctccacctgg tctcccgcca acccacggag 720
ctggaggtgg cttggactcc aggcctgagc ggcatctacc ccctgaccca ctgcaccctg 780
caggctgtgc tgtcagacga tgggatgggc atccaggcgg gagaaccaga ccccccagag 840
gagcccctca cctcgcaagc atccgtgccc ccccatcagc ttcggctagg cagcctccat 900
cctcacaccc cttatcacat ccgcgtggca tgcaccagca gccagggccc ctcatcctgg 960
acccactggc ttcctgtgga gacgccggag ggagtgcccc tgggcccccc tgagaacatt 1020
agtgctacgc ggaatgggag ccaggccttc gtgcattggc aagagccccg ggcgcccctg 1080
cagggtaccc tgttagggta ccggctggcg tatcaaggcc aggacacccc agaggtgcta 1140
atggacatag ggctaaggca agaggtgacc ctggagctgc agggggacgg gtctgtgtcc 1200
aatctgacag tgtgtgtggc agcctacact gctgctgggg atggaccctg gagcctccca 1260
gtacccctgg aggcctggcg cccagggcaa gcacagccag tccaccagct ggtgaaggaa 1320
ccttcaactc ctgccttctc gtggccctgg tggtatgtac tgctaggagc agtcgtggcc 1380
gctgcctgtg tcctcatctt ggctctcttc cttgtccacc ggcgaaagaa ggagacccgt 1440
tatggagaag tgtttgaacc aacagtggaa agaggtgaac tggtagtcag gtaccgcgtg 1500
cgcaagtcct acagtcgtcg gaccactgaa gctaccttga acagcctggg catcagtgaa 1560
gagctgaagg agaagctgcg ggatgtgatg gtggaccggc acaaggtggc cctggggaag 1620
actctgggag agggagagtt tggagctgtg atggaaggcc agctcaacca ggacgactcc 1680
atcctcaagg tggctgtgaa gacgatgaag attgccatct gcacgaggtc agagctggag 1740
gatttcctga gtgaagcggt ctgcatgaag gaatttgacc atcccaacgt catgaggctc 1800
atcggtgtct gtttccaggg ttctgaacga gagagcttcc cagcacctgt ggtcatctta 1860
cctttcatga aacatggaga cctacacagc ttcctcctct attcccggct cggggaccag 1920
ccagtgtacc tgcccactca gatgctagtg aagttcatgg cagacatcgc cagtggcatg 1980
gagtatctga gtaccaagag attcatacac cgggacctgg cggccaggaa ctgcatgctg 2040
aatgagaaca tgtccgtgtg tgtggcggac ttcgggctct ccaagaagat ctacaatggg 2100
gactactacc gccagggacg tatcgccaag atgccagtca agtggattgc cattgagagt 2160
ctagctgacc gtgtctacac cagcaagagc gatgtgtggt ccttcggggt gacaatgtgg 2220
gagattgcca caagaggcca aaccccatat ccgggcgtgg agaacagcga gatttatgac 2280
tatctgcgcc agggaaatcg cctgaagcag cctgcggact gtctggatgg actgtatgcc 2340
ttgatgtcgc ggtgctggga gctaaatccc caggaccggc caagttttac agagctgcgg 2400
gaagatttgg agaacacact gaaggccttg cctcctgccc aggagcctga cgaaatcctc 2460
tatgtcaaca tggatgaggg tggaggttat cctgaacccc ctggagctgc aggaggagct 2520
gaccccccaa cccagccaga ccctaaggat tcctgtagct gcctcactgc ggctgaggtc 2580
catcctgctg gacgctatgt cctctgccct tccacaaccc ctagccccgc tcagcctgct 2640
gataggggct ccccagcagc cccagggcag gaggatggtg cctga 2685
<210> 34
<211> 2577
<212> DNA
<213> Artificial Sequence
<220>
<223> AXL gene fragment
<400> 34
atggcgtggc ggtgccccag gatgggcagg gtcccgctgg cctggtgctt ggcgctgtgc 60
ggctgggcgt gcatggcccc caggggcacg caggctgaag aaagtccctt cgtgggcaac 120
ccagggaata tcacaggtgc ccggggactc acgggcaccc ttcggtgtca gctccaggtt 180
cagggagagc cccccgaggt acattggctt cgggatggac agatcctgga gctcgcggac 240
agcacccaga cccaggtgcc cctgggtgag gatgaacagg atgactggat agtggtcagc 300
cagctcagaa tcacctccct gcagctttcc gacacgggac agtaccagtg tttggtgttt 360
ctgggacatc agaccttcgt gtcccagcct ggctatgttg ggctggaggg cttgccttac 420
ttcctggagg agcccgaaga caggactgtg gccgccaaca cccccttcaa cctgagctgc 480
caagctcagg gacccccaga gcccgtggac ctactctggc tccaggatgc tgtccccctg 540
gccacggctc caggtcacgg cccccagcgc agcctgcatg ttccagggct gaacaagaca 600
tcctctttct cctgcgaagc ccataacgcc aagggggtca ccacatcccg cacagccacc 660
atcacagtgc tcccccagca gccccgtaac ctccacctgg tctcccgcca acccacggag 720
ctggaggtgg cttggactcc aggcctgagc ggcatctacc ccctgaccca ctgcaccctg 780
caggctgtgc tgtcagacga tgggatgggc atccaggcgg gagaaccaga ccccccagag 840
gagcccctca cctcgcaagc atccgtgccc ccccatcagc ttcggctagg cagcctccat 900
cctcacaccc cttatcacat ccgcgtggca tgcaccagca gccagggccc ctcatcctgg 960
acccactggc ttcctgtgga gacgccggag ggagtgcccc tgggcccccc tgagaacatt 1020
agtgctacgc ggaatgggag ccaggccttc gtgcattggc aagagccccg ggcgcccctg 1080
cagggtaccc tgttagggta ccggctggcg tatcaaggcc aggacacccc agaggtgcta 1140
atggacatag ggctaaggca agaggtgacc ctggagctgc agggggacgg gtctgtgtcc 1200
aatctgacag tgtgtgtggc agcctacact gctgctgggg atggaccctg gagcctccca 1260
gtacccctgg aggcctggcg cccagggcaa gcacagccag tccaccagct ggtgaaggaa 1320
ccttcaactc ctgccttctc gtggccctgg tggtatgtac tgctaggagc agtcgtggcc 1380
gctgcctgtg tcctcatctt ggctctcttc cttgtccacc ggcgaaagaa ggagacccgt 1440
tatggagaag tgtttgaacc aacagtggaa agaggtgaac tggtagtcag gtaccgcgtg 1500
cgcaagtcct acagtcgtcg gaccactgaa gctaccttga acagcctggg catcagtgaa 1560
gagctgaagg agaagctgcg ggatgtgatg gtggaccggc acaaggtggc cctggggaag 1620
actctgggag agggagagtt tggagctgtg atggaaggcc agctcaacca ggacgactcc 1680
atcctcaagg tggctgtgaa gacgatgaag attgccatct gcacgaggtc agagctggag 1740
gatttcctga gtgaagcggt ctgcatgaag gaatttgacc atcccaacgt catgaggctc 1800
atcggtgtct gtttccaggg ttctgaacga gagagcttcc cagcacctgt ggtcatctta 1860
cctttcatga aacatggaga cctacacagc ttcctcctct attcccggct cggggaccag 1920
ccagtgtacc tgcccactca gatgctagtg aagttcatgg cagacatcgc cagtggcatg 1980
gagtatctga gtaccaagag attcatacac cgggacctgg cggccaggaa ctgcatgctg 2040
aatgagaaca tgtccgtgtg tgtggcggac ttcgggctct ccaagaagat ctacaatggg 2100
gactactacc gccagggacg tatcgccaag atgccagtca agtggattgc cattgagagt 2160
ctagctgacc gtgtctacac cagcaagagc gatgtgtggt ccttcggggt gacaatgtgg 2220
gagattgcca caagaggcca aaccccatat ccgggcgtgg agaacagcga gatttatgac 2280
tatctgcgcc agggaaatcg cctgaagcag cctgcggact gtctggatgg actgtatgcc 2340
ttgatgtcgc ggtgctggga gctaaatccc caggaccggc caagttttac agagctgcgg 2400
gaagatttgg agaacacact gaaggccttg cctcctgccc aggagcctga cgaaatcctc 2460
tatgtcaaca tggatgaggg tggaggttat cctgaacccc ctggagctgc aggaggagct 2520
gaccccccaa cccagccaga ccctaaggat tcctgtagct gcctcactgc ggctgag 2577
<210> 35
<211> 15
<212> DNA
<213> Artificial Sequence
<220>
<223> Break-point of AXL gene fragment
<400> 35
ctcactgcgg ctgag 15
<210> 36
<211> 894
<212> PRT
<213> Artificial Sequence
<220>
<223> AXL protein
<400> 36
Met Ala Trp Arg Cys Pro Arg Met Gly Arg Val Pro Leu Ala Trp Cys
1 5 10 15
Leu Ala Leu Cys Gly Trp Ala Cys Met Ala Pro Arg Gly Thr Gln Ala
20 25 30
Glu Glu Ser Pro Phe Val Gly Asn Pro Gly Asn Ile Thr Gly Ala Arg
35 40 45
Gly Leu Thr Gly Thr Leu Arg Cys Gln Leu Gln Val Gln Gly Glu Pro
50 55 60
Pro Glu Val His Trp Leu Arg Asp Gly Gln Ile Leu Glu Leu Ala Asp
65 70 75 80
Ser Thr Gln Thr Gln Val Pro Leu Gly Glu Asp Glu Gln Asp Asp Trp
85 90 95
Ile Val Val Ser Gln Leu Arg Ile Thr Ser Leu Gln Leu Ser Asp Thr
100 105 110
Gly Gln Tyr Gln Cys Leu Val Phe Leu Gly His Gln Thr Phe Val Ser
115 120 125
Gln Pro Gly Tyr Val Gly Leu Glu Gly Leu Pro Tyr Phe Leu Glu Glu
130 135 140
Pro Glu Asp Arg Thr Val Ala Ala Asn Thr Pro Phe Asn Leu Ser Cys
145 150 155 160
Gln Ala Gln Gly Pro Pro Glu Pro Val Asp Leu Leu Trp Leu Gln Asp
165 170 175
Ala Val Pro Leu Ala Thr Ala Pro Gly His Gly Pro Gln Arg Ser Leu
180 185 190
His Val Pro Gly Leu Asn Lys Thr Ser Ser Phe Ser Cys Glu Ala His
195 200 205
Asn Ala Lys Gly Val Thr Thr Ser Arg Thr Ala Thr Ile Thr Val Leu
210 215 220
Pro Gln Gln Pro Arg Asn Leu His Leu Val Ser Arg Gln Pro Thr Glu
225 230 235 240
Leu Glu Val Ala Trp Thr Pro Gly Leu Ser Gly Ile Tyr Pro Leu Thr
245 250 255
His Cys Thr Leu Gln Ala Val Leu Ser Asp Asp Gly Met Gly Ile Gln
260 265 270
Ala Gly Glu Pro Asp Pro Pro Glu Glu Pro Leu Thr Ser Gln Ala Ser
275 280 285
Val Pro Pro His Gln Leu Arg Leu Gly Ser Leu His Pro His Thr Pro
290 295 300
Tyr His Ile Arg Val Ala Cys Thr Ser Ser Gln Gly Pro Ser Ser Trp
305 310 315 320
Thr His Trp Leu Pro Val Glu Thr Pro Glu Gly Val Pro Leu Gly Pro
325 330 335
Pro Glu Asn Ile Ser Ala Thr Arg Asn Gly Ser Gln Ala Phe Val His
340 345 350
Trp Gln Glu Pro Arg Ala Pro Leu Gln Gly Thr Leu Leu Gly Tyr Arg
355 360 365
Leu Ala Tyr Gln Gly Gln Asp Thr Pro Glu Val Leu Met Asp Ile Gly
370 375 380
Leu Arg Gln Glu Val Thr Leu Glu Leu Gln Gly Asp Gly Ser Val Ser
385 390 395 400
Asn Leu Thr Val Cys Val Ala Ala Tyr Thr Ala Ala Gly Asp Gly Pro
405 410 415
Trp Ser Leu Pro Val Pro Leu Glu Ala Trp Arg Pro Gly Gln Ala Gln
420 425 430
Pro Val His Gln Leu Val Lys Glu Pro Ser Thr Pro Ala Phe Ser Trp
435 440 445
Pro Trp Trp Tyr Val Leu Leu Gly Ala Val Val Ala Ala Ala Cys Val
450 455 460
Leu Ile Leu Ala Leu Phe Leu Val His Arg Arg Lys Lys Glu Thr Arg
465 470 475 480
Tyr Gly Glu Val Phe Glu Pro Thr Val Glu Arg Gly Glu Leu Val Val
485 490 495
Arg Tyr Arg Val Arg Lys Ser Tyr Ser Arg Arg Thr Thr Glu Ala Thr
500 505 510
Leu Asn Ser Leu Gly Ile Ser Glu Glu Leu Lys Glu Lys Leu Arg Asp
515 520 525
Val Met Val Asp Arg His Lys Val Ala Leu Gly Lys Thr Leu Gly Glu
530 535 540
Gly Glu Phe Gly Ala Val Met Glu Gly Gln Leu Asn Gln Asp Asp Ser
545 550 555 560
Ile Leu Lys Val Ala Val Lys Thr Met Lys Ile Ala Ile Cys Thr Arg
565 570 575
Ser Glu Leu Glu Asp Phe Leu Ser Glu Ala Val Cys Met Lys Glu Phe
580 585 590
Asp His Pro Asn Val Met Arg Leu Ile Gly Val Cys Phe Gln Gly Ser
595 600 605
Glu Arg Glu Ser Phe Pro Ala Pro Val Val Ile Leu Pro Phe Met Lys
610 615 620
His Gly Asp Leu His Ser Phe Leu Leu Tyr Ser Arg Leu Gly Asp Gln
625 630 635 640
Pro Val Tyr Leu Pro Thr Gln Met Leu Val Lys Phe Met Ala Asp Ile
645 650 655
Ala Ser Gly Met Glu Tyr Leu Ser Thr Lys Arg Phe Ile His Arg Asp
660 665 670
Leu Ala Ala Arg Asn Cys Met Leu Asn Glu Asn Met Ser Val Cys Val
675 680 685
Ala Asp Phe Gly Leu Ser Lys Lys Ile Tyr Asn Gly Asp Tyr Tyr Arg
690 695 700
Gln Gly Arg Ile Ala Lys Met Pro Val Lys Trp Ile Ala Ile Glu Ser
705 710 715 720
Leu Ala Asp Arg Val Tyr Thr Ser Lys Ser Asp Val Trp Ser Phe Gly
725 730 735
Val Thr Met Trp Glu Ile Ala Thr Arg Gly Gln Thr Pro Tyr Pro Gly
740 745 750
Val Glu Asn Ser Glu Ile Tyr Asp Tyr Leu Arg Gln Gly Asn Arg Leu
755 760 765
Lys Gln Pro Ala Asp Cys Leu Asp Gly Leu Tyr Ala Leu Met Ser Arg
770 775 780
Cys Trp Glu Leu Asn Pro Gln Asp Arg Pro Ser Phe Thr Glu Leu Arg
785 790 795 800
Glu Asp Leu Glu Asn Thr Leu Lys Ala Leu Pro Pro Ala Gln Glu Pro
805 810 815
Asp Glu Ile Leu Tyr Val Asn Met Asp Glu Gly Gly Gly Tyr Pro Glu
820 825 830
Pro Pro Gly Ala Ala Gly Gly Ala Asp Pro Pro Thr Gln Pro Asp Pro
835 840 845
Lys Asp Ser Cys Ser Cys Leu Thr Ala Ala Glu Val His Pro Ala Gly
850 855 860
Arg Tyr Val Leu Cys Pro Ser Thr Thr Pro Ser Pro Ala Gln Pro Ala
865 870 875 880
Asp Arg Gly Ser Pro Ala Ala Pro Gly Gln Glu Asp Gly Ala
885 890
<210> 37
<211> 859
<212> PRT
<213> Artificial Sequence
<220>
<223> AXL protein fragment
<400> 37
Met Ala Trp Arg Cys Pro Arg Met Gly Arg Val Pro Leu Ala Trp Cys
1 5 10 15
Leu Ala Leu Cys Gly Trp Ala Cys Met Ala Pro Arg Gly Thr Gln Ala
20 25 30
Glu Glu Ser Pro Phe Val Gly Asn Pro Gly Asn Ile Thr Gly Ala Arg
35 40 45
Gly Leu Thr Gly Thr Leu Arg Cys Gln Leu Gln Val Gln Gly Glu Pro
50 55 60
Pro Glu Val His Trp Leu Arg Asp Gly Gln Ile Leu Glu Leu Ala Asp
65 70 75 80
Ser Thr Gln Thr Gln Val Pro Leu Gly Glu Asp Glu Gln Asp Asp Trp
85 90 95
Ile Val Val Ser Gln Leu Arg Ile Thr Ser Leu Gln Leu Ser Asp Thr
100 105 110
Gly Gln Tyr Gln Cys Leu Val Phe Leu Gly His Gln Thr Phe Val Ser
115 120 125
Gln Pro Gly Tyr Val Gly Leu Glu Gly Leu Pro Tyr Phe Leu Glu Glu
130 135 140
Pro Glu Asp Arg Thr Val Ala Ala Asn Thr Pro Phe Asn Leu Ser Cys
145 150 155 160
Gln Ala Gln Gly Pro Pro Glu Pro Val Asp Leu Leu Trp Leu Gln Asp
165 170 175
Ala Val Pro Leu Ala Thr Ala Pro Gly His Gly Pro Gln Arg Ser Leu
180 185 190
His Val Pro Gly Leu Asn Lys Thr Ser Ser Phe Ser Cys Glu Ala His
195 200 205
Asn Ala Lys Gly Val Thr Thr Ser Arg Thr Ala Thr Ile Thr Val Leu
210 215 220
Pro Gln Gln Pro Arg Asn Leu His Leu Val Ser Arg Gln Pro Thr Glu
225 230 235 240
Leu Glu Val Ala Trp Thr Pro Gly Leu Ser Gly Ile Tyr Pro Leu Thr
245 250 255
His Cys Thr Leu Gln Ala Val Leu Ser Asp Asp Gly Met Gly Ile Gln
260 265 270
Ala Gly Glu Pro Asp Pro Pro Glu Glu Pro Leu Thr Ser Gln Ala Ser
275 280 285
Val Pro Pro His Gln Leu Arg Leu Gly Ser Leu His Pro His Thr Pro
290 295 300
Tyr His Ile Arg Val Ala Cys Thr Ser Ser Gln Gly Pro Ser Ser Trp
305 310 315 320
Thr His Trp Leu Pro Val Glu Thr Pro Glu Gly Val Pro Leu Gly Pro
325 330 335
Pro Glu Asn Ile Ser Ala Thr Arg Asn Gly Ser Gln Ala Phe Val His
340 345 350
Trp Gln Glu Pro Arg Ala Pro Leu Gln Gly Thr Leu Leu Gly Tyr Arg
355 360 365
Leu Ala Tyr Gln Gly Gln Asp Thr Pro Glu Val Leu Met Asp Ile Gly
370 375 380
Leu Arg Gln Glu Val Thr Leu Glu Leu Gln Gly Asp Gly Ser Val Ser
385 390 395 400
Asn Leu Thr Val Cys Val Ala Ala Tyr Thr Ala Ala Gly Asp Gly Pro
405 410 415
Trp Ser Leu Pro Val Pro Leu Glu Ala Trp Arg Pro Gly Gln Ala Gln
420 425 430
Pro Val His Gln Leu Val Lys Glu Pro Ser Thr Pro Ala Phe Ser Trp
435 440 445
Pro Trp Trp Tyr Val Leu Leu Gly Ala Val Val Ala Ala Ala Cys Val
450 455 460
Leu Ile Leu Ala Leu Phe Leu Val His Arg Arg Lys Lys Glu Thr Arg
465 470 475 480
Tyr Gly Glu Val Phe Glu Pro Thr Val Glu Arg Gly Glu Leu Val Val
485 490 495
Arg Tyr Arg Val Arg Lys Ser Tyr Ser Arg Arg Thr Thr Glu Ala Thr
500 505 510
Leu Asn Ser Leu Gly Ile Ser Glu Glu Leu Lys Glu Lys Leu Arg Asp
515 520 525
Val Met Val Asp Arg His Lys Val Ala Leu Gly Lys Thr Leu Gly Glu
530 535 540
Gly Glu Phe Gly Ala Val Met Glu Gly Gln Leu Asn Gln Asp Asp Ser
545 550 555 560
Ile Leu Lys Val Ala Val Lys Thr Met Lys Ile Ala Ile Cys Thr Arg
565 570 575
Ser Glu Leu Glu Asp Phe Leu Ser Glu Ala Val Cys Met Lys Glu Phe
580 585 590
Asp His Pro Asn Val Met Arg Leu Ile Gly Val Cys Phe Gln Gly Ser
595 600 605
Glu Arg Glu Ser Phe Pro Ala Pro Val Val Ile Leu Pro Phe Met Lys
610 615 620
His Gly Asp Leu His Ser Phe Leu Leu Tyr Ser Arg Leu Gly Asp Gln
625 630 635 640
Pro Val Tyr Leu Pro Thr Gln Met Leu Val Lys Phe Met Ala Asp Ile
645 650 655
Ala Ser Gly Met Glu Tyr Leu Ser Thr Lys Arg Phe Ile His Arg Asp
660 665 670
Leu Ala Ala Arg Asn Cys Met Leu Asn Glu Asn Met Ser Val Cys Val
675 680 685
Ala Asp Phe Gly Leu Ser Lys Lys Ile Tyr Asn Gly Asp Tyr Tyr Arg
690 695 700
Gln Gly Arg Ile Ala Lys Met Pro Val Lys Trp Ile Ala Ile Glu Ser
705 710 715 720
Leu Ala Asp Arg Val Tyr Thr Ser Lys Ser Asp Val Trp Ser Phe Gly
725 730 735
Val Thr Met Trp Glu Ile Ala Thr Arg Gly Gln Thr Pro Tyr Pro Gly
740 745 750
Val Glu Asn Ser Glu Ile Tyr Asp Tyr Leu Arg Gln Gly Asn Arg Leu
755 760 765
Lys Gln Pro Ala Asp Cys Leu Asp Gly Leu Tyr Ala Leu Met Ser Arg
770 775 780
Cys Trp Glu Leu Asn Pro Gln Asp Arg Pro Ser Phe Thr Glu Leu Arg
785 790 795 800
Glu Asp Leu Glu Asn Thr Leu Lys Ala Leu Pro Pro Ala Gln Glu Pro
805 810 815
Asp Glu Ile Leu Tyr Val Asn Met Asp Glu Gly Gly Gly Tyr Pro Glu
820 825 830
Pro Pro Gly Ala Ala Gly Gly Ala Asp Pro Pro Thr Gln Pro Asp Pro
835 840 845
Lys Asp Ser Cys Ser Cys Leu Thr Ala Ala Glu
850 855
<210> 38
<211> 5
<212> PRT
<213> Artificial Sequence
<220>
<223> Break-point of AXL protein fragment
<400> 38
Leu Thr Ala Ala Glu
1 5
<210> 39
<211> 1035
<212> DNA
<213> Artificial Sequence
<220>
<223> CDS of MBIP gene (NM_016586)
<400> 39
atggctgctg ccacggagct taatcgcccg agcagcggtg acaggaacct ggagcgaaga 60
tgcagaccca acctctcccg agaggtgctc tacgaaatct ttcgctccct acacaccctg 120
gttggacagc ttgacctcag agatgatgtg gtgaaaatta caatcgattg gaacaagctc 180
cagagcctct cggcattcca gcctgcattg ctctttagtg cacttgaaca acacatttta 240
tatttacagc cttttttagc aaaacttcag tctccgatta aagaggagaa tacaactgct 300
gttgaagaga taggaagaac agaaatgggg aacaaaaatg aagtaaatga caaattttcc 360
attggcgacc tacaagagga agaaaagcac aaagaaagtg atttaagaga tgtgaaaaag 420
acacagatcc attttgatcc agaagtagtt cagataaagg ctggaaaagc agaaattgac 480
agacgaatat ctgcatttat tgaaagaaag caagctgaaa tcaatgaaaa caacgtcagg 540
gaattttgca atgttattga ttgtaatcaa gaaaatagtt gtgcaagaac tgatgcgatt 600
tttacccctt accccggatt taaaagtcac gtaaaagttt ctagagttgt gaatacatac 660
ggaccacaga ctagacctga aggaattcca gggtcaggtc ataaacctaa cagcatgctt 720
cgagactgtg gtaatcaggc tgtagaagaa cgactacaaa atattgaggc ccacttgcgg 780
ttacagacag gtggtccagt gccaagagac atttatcaga gaattaaaaa acttgaggat 840
aaaatccttg aattggaagg catctctcct gaatattttc agtctgtaag cttttctgga 900
aaaagaagaa aagttcaacc acctcaacag aactattcac tggctgaact tgatgagaaa 960
attagtgccc tcaaacaagc cctcctcaga aaatcaagag aagcagaatc catggcaacc 1020
caccaccttc catga 1035
<210> 40
<211> 561
<212> DNA
<213> Artificial Sequence
<220>
<223> MBIP gene fragment
<400> 40
attgacagac gaatatctgc atttattgaa agaaagcaag ctgaaatcaa tgaaaacaac 60
gtcagggaat tttgcaatgt tattgattgt aatcaagaaa atagttgtgc aagaactgat 120
gcgattttta ccccttaccc cggatttaaa agtcacgtaa aagtttctag agttgtgaat 180
acatacggac cacagactag acctgaagga attccagggt caggtcataa acctaacagc 240
atgcttcgag actgtggtaa tcaggctgta gaagaacgac tacaaaatat tgaggcccac 300
ttgcggttac agacaggtgg tccagtgcca agagacattt atcagagaat taaaaaactt 360
gaggataaaa tccttgaatt ggaaggcatc tctcctgaat attttcagtc tgtaagcttt 420
tctggaaaaa gaagaaaagt tcaaccacct caacagaact attcactggc tgaacttgat 480
gagaaaatta gtgccctcaa acaagccctc ctcagaaaat caagagaagc agaatccatg 540
gcaacccacc accttccatg a 561
<210> 41
<211> 15
<212> DNA
<213> Artificial Sequence
<220>
<223> Break-point of MBIP gene fragment
<400> 41
attgacagac gaata 15
<210> 42
<211> 344
<212> PRT
<213> Artificial Sequence
<220>
<223> MBIP protein
<400> 42
Met Ala Ala Ala Thr Glu Leu Asn Arg Pro Ser Ser Gly Asp Arg Asn
1 5 10 15
Leu Glu Arg Arg Cys Arg Pro Asn Leu Ser Arg Glu Val Leu Tyr Glu
20 25 30
Ile Phe Arg Ser Leu His Thr Leu Val Gly Gln Leu Asp Leu Arg Asp
35 40 45
Asp Val Val Lys Ile Thr Ile Asp Trp Asn Lys Leu Gln Ser Leu Ser
50 55 60
Ala Phe Gln Pro Ala Leu Leu Phe Ser Ala Leu Glu Gln His Ile Leu
65 70 75 80
Tyr Leu Gln Pro Phe Leu Ala Lys Leu Gln Ser Pro Ile Lys Glu Glu
85 90 95
Asn Thr Thr Ala Val Glu Glu Ile Gly Arg Thr Glu Met Gly Asn Lys
100 105 110
Asn Glu Val Asn Asp Lys Phe Ser Ile Gly Asp Leu Gln Glu Glu Glu
115 120 125
Lys His Lys Glu Ser Asp Leu Arg Asp Val Lys Lys Thr Gln Ile His
130 135 140
Phe Asp Pro Glu Val Val Gln Ile Lys Ala Gly Lys Ala Glu Ile Asp
145 150 155 160
Arg Arg Ile Ser Ala Phe Ile Glu Arg Lys Gln Ala Glu Ile Asn Glu
165 170 175
Asn Asn Val Arg Glu Phe Cys Asn Val Ile Asp Cys Asn Gln Glu Asn
180 185 190
Ser Cys Ala Arg Thr Asp Ala Ile Phe Thr Pro Tyr Pro Gly Phe Lys
195 200 205
Ser His Val Lys Val Ser Arg Val Val Asn Thr Tyr Gly Pro Gln Thr
210 215 220
Arg Pro Glu Gly Ile Pro Gly Ser Gly His Lys Pro Asn Ser Met Leu
225 230 235 240
Arg Asp Cys Gly Asn Gln Ala Val Glu Glu Arg Leu Gln Asn Ile Glu
245 250 255
Ala His Leu Arg Leu Gln Thr Gly Gly Pro Val Pro Arg Asp Ile Tyr
260 265 270
Gln Arg Ile Lys Lys Leu Glu Asp Lys Ile Leu Glu Leu Glu Gly Ile
275 280 285
Ser Pro Glu Tyr Phe Gln Ser Val Ser Phe Ser Gly Lys Arg Arg Lys
290 295 300
Val Gln Pro Pro Gln Gln Asn Tyr Ser Leu Ala Glu Leu Asp Glu Lys
305 310 315 320
Ile Ser Ala Leu Lys Gln Ala Leu Leu Arg Lys Ser Arg Glu Ala Glu
325 330 335
Ser Met Ala Thr His His Leu Pro
340
<210> 43
<211> 186
<212> PRT
<213> Unknown
<220>
<223> MBIP protein fragment
<400> 43
Ile Asp Arg Arg Ile Ser Ala Phe Ile Glu Arg Lys Gln Ala Glu Ile
1 5 10 15
Asn Glu Asn Asn Val Arg Glu Phe Cys Asn Val Ile Asp Cys Asn Gln
20 25 30
Glu Asn Ser Cys Ala Arg Thr Asp Ala Ile Phe Thr Pro Tyr Pro Gly
35 40 45
Phe Lys Ser His Val Lys Val Ser Arg Val Val Asn Thr Tyr Gly Pro
50 55 60
Gln Thr Arg Pro Glu Gly Ile Pro Gly Ser Gly His Lys Pro Asn Ser
65 70 75 80
Met Leu Arg Asp Cys Gly Asn Gln Ala Val Glu Glu Arg Leu Gln Asn
85 90 95
Ile Glu Ala His Leu Arg Leu Gln Thr Gly Gly Pro Val Pro Arg Asp
100 105 110
Ile Tyr Gln Arg Ile Lys Lys Leu Glu Asp Lys Ile Leu Glu Leu Glu
115 120 125
Gly Ile Ser Pro Glu Tyr Phe Gln Ser Val Ser Phe Ser Gly Lys Arg
130 135 140
Arg Lys Val Gln Pro Pro Gln Gln Asn Tyr Ser Leu Ala Glu Leu Asp
145 150 155 160
Glu Lys Ile Ser Ala Leu Lys Gln Ala Leu Leu Arg Lys Ser Arg Glu
165 170 175
Ala Glu Ser Met Ala Thr His His Leu Pro
180 185
<210> 44
<211> 5
<212> PRT
<213> Artificial Sequence
<220>
<223> Break-point of MBIP protein fragment
<400> 44
Ile Asp Arg Arg Ile
1 5
<210> 45
<211> 3138
<212> DNA
<213> Artificial Sequence
<220>
<223> AXL-MBIPCIT fusion gene
<400> 45
atggcgtggc ggtgccccag gatgggcagg gtcccgctgg cctggtgctt ggcgctgtgc 60
ggctgggcgt gcatggcccc caggggcacg caggctgaag aaagtccctt cgtgggcaac 120
ccagggaata tcacaggtgc ccggggactc acgggcaccc ttcggtgtca gctccaggtt 180
cagggagagc cccccgaggt acattggctt cgggatggac agatcctgga gctcgcggac 240
agcacccaga cccaggtgcc cctgggtgag gatgaacagg atgactggat agtggtcagc 300
cagctcagaa tcacctccct gcagctttcc gacacgggac agtaccagtg tttggtgttt 360
ctgggacatc agaccttcgt gtcccagcct ggctatgttg ggctggaggg cttgccttac 420
ttcctggagg agcccgaaga caggactgtg gccgccaaca cccccttcaa cctgagctgc 480
caagctcagg gacccccaga gcccgtggac ctactctggc tccaggatgc tgtccccctg 540
gccacggctc caggtcacgg cccccagcgc agcctgcatg ttccagggct gaacaagaca 600
tcctctttct cctgcgaagc ccataacgcc aagggggtca ccacatcccg cacagccacc 660
atcacagtgc tcccccagca gccccgtaac ctccacctgg tctcccgcca acccacggag 720
ctggaggtgg cttggactcc aggcctgagc ggcatctacc ccctgaccca ctgcaccctg 780
caggctgtgc tgtcagacga tgggatgggc atccaggcgg gagaaccaga ccccccagag 840
gagcccctca cctcgcaagc atccgtgccc ccccatcagc ttcggctagg cagcctccat 900
cctcacaccc cttatcacat ccgcgtggca tgcaccagca gccagggccc ctcatcctgg 960
acccactggc ttcctgtgga gacgccggag ggagtgcccc tgggcccccc tgagaacatt 1020
agtgctacgc ggaatgggag ccaggccttc gtgcattggc aagagccccg ggcgcccctg 1080
cagggtaccc tgttagggta ccggctggcg tatcaaggcc aggacacccc agaggtgcta 1140
atggacatag ggctaaggca agaggtgacc ctggagctgc agggggacgg gtctgtgtcc 1200
aatctgacag tgtgtgtggc agcctacact gctgctgggg atggaccctg gagcctccca 1260
gtacccctgg aggcctggcg cccagggcaa gcacagccag tccaccagct ggtgaaggaa 1320
ccttcaactc ctgccttctc gtggccctgg tggtatgtac tgctaggagc agtcgtggcc 1380
gctgcctgtg tcctcatctt ggctctcttc cttgtccacc ggcgaaagaa ggagacccgt 1440
tatggagaag tgtttgaacc aacagtggaa agaggtgaac tggtagtcag gtaccgcgtg 1500
cgcaagtcct acagtcgtcg gaccactgaa gctaccttga acagcctggg catcagtgaa 1560
gagctgaagg agaagctgcg ggatgtgatg gtggaccggc acaaggtggc cctggggaag 1620
actctgggag agggagagtt tggagctgtg atggaaggcc agctcaacca ggacgactcc 1680
atcctcaagg tggctgtgaa gacgatgaag attgccatct gcacgaggtc agagctggag 1740
gatttcctga gtgaagcggt ctgcatgaag gaatttgacc atcccaacgt catgaggctc 1800
atcggtgtct gtttccaggg ttctgaacga gagagcttcc cagcacctgt ggtcatctta 1860
cctttcatga aacatggaga cctacacagc ttcctcctct attcccggct cggggaccag 1920
ccagtgtacc tgcccactca gatgctagtg aagttcatgg cagacatcgc cagtggcatg 1980
gagtatctga gtaccaagag attcatacac cgggacctgg cggccaggaa ctgcatgctg 2040
aatgagaaca tgtccgtgtg tgtggcggac ttcgggctct ccaagaagat ctacaatggg 2100
gactactacc gccagggacg tatcgccaag atgccagtca agtggattgc cattgagagt 2160
ctagctgacc gtgtctacac cagcaagagc gatgtgtggt ccttcggggt gacaatgtgg 2220
gagattgcca caagaggcca aaccccatat ccgggcgtgg agaacagcga gatttatgac 2280
tatctgcgcc agggaaatcg cctgaagcag cctgcggact gtctggatgg actgtatgcc 2340
ttgatgtcgc ggtgctggga gctaaatccc caggaccggc caagttttac agagctgcgg 2400
gaagatttgg agaacacact gaaggccttg cctcctgccc aggagcctga cgaaatcctc 2460
tatgtcaaca tggatgaggg tggaggttat cctgaacccc ctggagctgc aggaggagct 2520
gaccccccaa cccagccaga ccctaaggat tcctgtagct gcctcactgc ggctgagatt 2580
gacagacgaa tatctgcatt tattgaaaga aagcaagctg aaatcaatga aaacaacgtc 2640
agggaatttt gcaatgttat tgattgtaat caagaaaata gttgtgcaag aactgatgcg 2700
atttttaccc cttaccccgg atttaaaagt cacgtaaaag tttctagagt tgtgaataca 2760
tacggaccac agactagacc tgaaggaatt ccagggtcag gtcataaacc taacagcatg 2820
cttcgagact gtggtaatca ggctgtagaa gaacgactac aaaatattga ggcccacttg 2880
cggttacaga caggtggtcc agtgccaaga gacatttatc agagaattaa aaaacttgag 2940
gataaaatcc ttgaattgga aggcatctct cctgaatatt ttcagtctgt aagcttttct 3000
ggaaaaagaa gaaaagttca accacctcaa cagaactatt cactggctga acttgatgag 3060
aaaattagtg ccctcaaaca agccctcctc agaaaatcaa gagaagcaga atccatggca 3120
acccaccacc ttccatga 3138
<210> 46
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> Fused region of AXL-MBIPCIT fusion gene
<400> 46
ctcactgcgg ctgagattga cagacgaata 30
<210> 47
<211> 1045
<212> PRT
<213> Artificial Sequence
<220>
<223> AXL-MBIPCIT fusion protein
<400> 47
Met Ala Trp Arg Cys Pro Arg Met Gly Arg Val Pro Leu Ala Trp Cys
1 5 10 15
Leu Ala Leu Cys Gly Trp Ala Cys Met Ala Pro Arg Gly Thr Gln Ala
20 25 30
Glu Glu Ser Pro Phe Val Gly Asn Pro Gly Asn Ile Thr Gly Ala Arg
35 40 45
Gly Leu Thr Gly Thr Leu Arg Cys Gln Leu Gln Val Gln Gly Glu Pro
50 55 60
Pro Glu Val His Trp Leu Arg Asp Gly Gln Ile Leu Glu Leu Ala Asp
65 70 75 80
Ser Thr Gln Thr Gln Val Pro Leu Gly Glu Asp Glu Gln Asp Asp Trp
85 90 95
Ile Val Val Ser Gln Leu Arg Ile Thr Ser Leu Gln Leu Ser Asp Thr
100 105 110
Gly Gln Tyr Gln Cys Leu Val Phe Leu Gly His Gln Thr Phe Val Ser
115 120 125
Gln Pro Gly Tyr Val Gly Leu Glu Gly Leu Pro Tyr Phe Leu Glu Glu
130 135 140
Pro Glu Asp Arg Thr Val Ala Ala Asn Thr Pro Phe Asn Leu Ser Cys
145 150 155 160
Gln Ala Gln Gly Pro Pro Glu Pro Val Asp Leu Leu Trp Leu Gln Asp
165 170 175
Ala Val Pro Leu Ala Thr Ala Pro Gly His Gly Pro Gln Arg Ser Leu
180 185 190
His Val Pro Gly Leu Asn Lys Thr Ser Ser Phe Ser Cys Glu Ala His
195 200 205
Asn Ala Lys Gly Val Thr Thr Ser Arg Thr Ala Thr Ile Thr Val Leu
210 215 220
Pro Gln Gln Pro Arg Asn Leu His Leu Val Ser Arg Gln Pro Thr Glu
225 230 235 240
Leu Glu Val Ala Trp Thr Pro Gly Leu Ser Gly Ile Tyr Pro Leu Thr
245 250 255
His Cys Thr Leu Gln Ala Val Leu Ser Asp Asp Gly Met Gly Ile Gln
260 265 270
Ala Gly Glu Pro Asp Pro Pro Glu Glu Pro Leu Thr Ser Gln Ala Ser
275 280 285
Val Pro Pro His Gln Leu Arg Leu Gly Ser Leu His Pro His Thr Pro
290 295 300
Tyr His Ile Arg Val Ala Cys Thr Ser Ser Gln Gly Pro Ser Ser Trp
305 310 315 320
Thr His Trp Leu Pro Val Glu Thr Pro Glu Gly Val Pro Leu Gly Pro
325 330 335
Pro Glu Asn Ile Ser Ala Thr Arg Asn Gly Ser Gln Ala Phe Val His
340 345 350
Trp Gln Glu Pro Arg Ala Pro Leu Gln Gly Thr Leu Leu Gly Tyr Arg
355 360 365
Leu Ala Tyr Gln Gly Gln Asp Thr Pro Glu Val Leu Met Asp Ile Gly
370 375 380
Leu Arg Gln Glu Val Thr Leu Glu Leu Gln Gly Asp Gly Ser Val Ser
385 390 395 400
Asn Leu Thr Val Cys Val Ala Ala Tyr Thr Ala Ala Gly Asp Gly Pro
405 410 415
Trp Ser Leu Pro Val Pro Leu Glu Ala Trp Arg Pro Gly Gln Ala Gln
420 425 430
Pro Val His Gln Leu Val Lys Glu Pro Ser Thr Pro Ala Phe Ser Trp
435 440 445
Pro Trp Trp Tyr Val Leu Leu Gly Ala Val Val Ala Ala Ala Cys Val
450 455 460
Leu Ile Leu Ala Leu Phe Leu Val His Arg Arg Lys Lys Glu Thr Arg
465 470 475 480
Tyr Gly Glu Val Phe Glu Pro Thr Val Glu Arg Gly Glu Leu Val Val
485 490 495
Arg Tyr Arg Val Arg Lys Ser Tyr Ser Arg Arg Thr Thr Glu Ala Thr
500 505 510
Leu Asn Ser Leu Gly Ile Ser Glu Glu Leu Lys Glu Lys Leu Arg Asp
515 520 525
Val Met Val Asp Arg His Lys Val Ala Leu Gly Lys Thr Leu Gly Glu
530 535 540
Gly Glu Phe Gly Ala Val Met Glu Gly Gln Leu Asn Gln Asp Asp Ser
545 550 555 560
Ile Leu Lys Val Ala Val Lys Thr Met Lys Ile Ala Ile Cys Thr Arg
565 570 575
Ser Glu Leu Glu Asp Phe Leu Ser Glu Ala Val Cys Met Lys Glu Phe
580 585 590
Asp His Pro Asn Val Met Arg Leu Ile Gly Val Cys Phe Gln Gly Ser
595 600 605
Glu Arg Glu Ser Phe Pro Ala Pro Val Val Ile Leu Pro Phe Met Lys
610 615 620
His Gly Asp Leu His Ser Phe Leu Leu Tyr Ser Arg Leu Gly Asp Gln
625 630 635 640
Pro Val Tyr Leu Pro Thr Gln Met Leu Val Lys Phe Met Ala Asp Ile
645 650 655
Ala Ser Gly Met Glu Tyr Leu Ser Thr Lys Arg Phe Ile His Arg Asp
660 665 670
Leu Ala Ala Arg Asn Cys Met Leu Asn Glu Asn Met Ser Val Cys Val
675 680 685
Ala Asp Phe Gly Leu Ser Lys Lys Ile Tyr Asn Gly Asp Tyr Tyr Arg
690 695 700
Gln Gly Arg Ile Ala Lys Met Pro Val Lys Trp Ile Ala Ile Glu Ser
705 710 715 720
Leu Ala Asp Arg Val Tyr Thr Ser Lys Ser Asp Val Trp Ser Phe Gly
725 730 735
Val Thr Met Trp Glu Ile Ala Thr Arg Gly Gln Thr Pro Tyr Pro Gly
740 745 750
Val Glu Asn Ser Glu Ile Tyr Asp Tyr Leu Arg Gln Gly Asn Arg Leu
755 760 765
Lys Gln Pro Ala Asp Cys Leu Asp Gly Leu Tyr Ala Leu Met Ser Arg
770 775 780
Cys Trp Glu Leu Asn Pro Gln Asp Arg Pro Ser Phe Thr Glu Leu Arg
785 790 795 800
Glu Asp Leu Glu Asn Thr Leu Lys Ala Leu Pro Pro Ala Gln Glu Pro
805 810 815
Asp Glu Ile Leu Tyr Val Asn Met Asp Glu Gly Gly Gly Tyr Pro Glu
820 825 830
Pro Pro Gly Ala Ala Gly Gly Ala Asp Pro Pro Thr Gln Pro Asp Pro
835 840 845
Lys Asp Ser Cys Ser Cys Leu Thr Ala Ala Glu Ile Asp Arg Arg Ile
850 855 860
Ser Ala Phe Ile Glu Arg Lys Gln Ala Glu Ile Asn Glu Asn Asn Val
865 870 875 880
Arg Glu Phe Cys Asn Val Ile Asp Cys Asn Gln Glu Asn Ser Cys Ala
885 890 895
Arg Thr Asp Ala Ile Phe Thr Pro Tyr Pro Gly Phe Lys Ser His Val
900 905 910
Lys Val Ser Arg Val Val Asn Thr Tyr Gly Pro Gln Thr Arg Pro Glu
915 920 925
Gly Ile Pro Gly Ser Gly His Lys Pro Asn Ser Met Leu Arg Asp Cys
930 935 940
Gly Asn Gln Ala Val Glu Glu Arg Leu Gln Asn Ile Glu Ala His Leu
945 950 955 960
Arg Leu Gln Thr Gly Gly Pro Val Pro Arg Asp Ile Tyr Gln Arg Ile
965 970 975
Lys Lys Leu Glu Asp Lys Ile Leu Glu Leu Glu Gly Ile Ser Pro Glu
980 985 990
Tyr Phe Gln Ser Val Ser Phe Ser Gly Lys Arg Arg Lys Val Gln Pro
995 1000 1005
Pro Gln Gln Asn Tyr Ser Leu Ala Glu Leu Asp Glu Lys Ile Ser Ala
1010 1015 1020
Leu Lys Gln Ala Leu Leu Arg Lys Ser Arg Glu Ala Glu Ser Met Ala
1025 1030 1035 1040
Thr His His Leu Pro
1045
<210> 48
<211> 10
<212> PRT
<213> Artificial Sequence
<220>
<223> Fused region of AXL-MBIPCIT fusion protein
<400> 48
Leu Thr Ala Ala Glu Ile Asp Arg Arg Ile
1 5 10
<210> 49
<211> 2292
<212> DNA
<213> Artificial Sequence
<220>
<223> CDS of APLP2 gene (NM_001642)
<400> 49
atggcggcca ccgggaccgc ggccgccgca gccacgggca ggctcctgct tctgctgctg 60
gtggggctca cggcgcctgc cttggcgctg gccggctaca tcgaggctct tgcagccaat 120
gccggaacag gatttgctgt tgctgagcct caaatcgcaa tgttttgtgg gaagttaaat 180
atgcatgtga acattcagac tgggaaatgg gaacctgatc caacaggcac caagagctgc 240
tttgaaacaa aagaagaagt tcttcagtac tgtcaggaga tgtatccaga gctacagatc 300
acaaatgtga tggaggcaaa ccagcgggtt agtattgaca actggtgccg gagggacaaa 360
aagcaatgca agagtcgctt tgttacacct ttcaagtgtc tcgtgggtga atttgtaagt 420
gatgtcctgc tagttccaga aaagtgccag tttttccaca aagagcggat ggaggtgtgt 480
gagaatcacc agcactggca cacggtagtc aaagaggcat gtctgactca gggaatgacc 540
ttatatagct acggcatgct gctcccatgt ggggtagacc agttccatgg cactgaatat 600
gtgtgctgcc ctcagacaaa gattattgga tctgtgtcaa aagaagagga agaggaagat 660
gaagaggaag aggaagagga agatgaagag gaagactatg atgtttataa aagtgaattt 720
cctactgaag cagatctgga agacttcaca gaagcagctg tggatgagga tgatgaggat 780
gaggaagaag gggaggaagt ggtggaggac cgagattact actatgacac cttcaaagga 840
gatgactaca atgaggagaa tcctactgaa cccggcagcg acggcaccat gtcagacaag 900
gaaattactc atgatgtcaa agctgtctgc tcccaggagg cgatgacggg gccctgccgg 960
gccgtgatgc ctcgttggta cttcgacctc tccaagggaa agtgcgtgcg ctttatatat 1020
ggtggctgcg gcggcaacag gaacaatttt gagtctgagg attattgtat ggctgtgtgt 1080
aaagcgatga ttcctccaac tcctctgcca accaatgatg ttgatgtgta tttcgagacc 1140
tctgcagatg ataatgagca tgctcgcttc cagaaggcta aggagcagct ggagattcgg 1200
caccgcaacc gaatggacag ggtaaagaag gaatgggaag aggcagagct tcaagctaag 1260
aacctcccca aagcagagag gcagactctg attcagcact tccaagccat ggttaaagct 1320
ttagagaagg aagcagccag tgagaagcag cagctggtgg agacccacct ggcccgagtg 1380
gaagctatgc tgaatgaccg ccgtcggatg gctctggaga actacctggc tgccttgcag 1440
tctgacccgc cacggcctca tcgcattctc caggccttac ggcgttatgt ccgtgctgag 1500
aacaaagatc gcttacatac catccgtcat taccagcatg tgttggctgt tgacccagaa 1560
aaggcggccc agatgaaatc ccaggtgatg acacatctcc acgtgattga agaaaggagg 1620
aaccaaagcc tctctctgct ctacaaagta ccttatgtag cccaagaaat tcaagaggaa 1680
attgatgagc tccttcagga gcagcgtgca gatatggacc agttcactgc ctcaatctca 1740
gagacccctg tggacgtccg ggtgagctct gaggagagtg aggagatccc accgttccac 1800
cccttccacc ccttcccagc cctacctgag aacgaagaca ctcagccgga gttgtaccac 1860
ccaatgaaaa aaggatctgg agtgggagag caggatgggg gactgatcgg tgccgaagag 1920
aaagtgatta acagtaagaa taaagtggat gaaaacatgg tcattgacga gactctggat 1980
gttaaggaaa tgattttcaa tgccgagaga gttggaggcc tcgaggaaga gcgggaatcc 2040
gtgggcccac tgcgggagga cttcagtctg agtagcagtg ctctcattgg cctgctggtc 2100
atcgcagtgg ccattgccac ggtcatcgtc atcagcctgg tgatgctgag gaagaggcag 2160
tatggcacca tcagccacgg gatcgtggag gttgatccaa tgctcacccc agaagagcgt 2220
cacctgaaca agatgcagaa ccatggctat gagaacccca cctacaaata cctggagcag 2280
atgcagattt ag 2292
<210> 50
<211> 1584
<212> DNA
<213> Artificial Sequence
<220>
<223> APLP2 gene fragment
<400> 50
atggcggcca ccgggaccgc ggccgccgca gccacgggca ggctcctgct tctgctgctg 60
gtggggctca cggcgcctgc cttggcgctg gccggctaca tcgaggctct tgcagccaat 120
gccggaacag gatttgctgt tgctgagcct caaatcgcaa tgttttgtgg gaagttaaat 180
atgcatgtga acattcagac tgggaaatgg gaacctgatc caacaggcac caagagctgc 240
tttgaaacaa aagaagaagt tcttcagtac tgtcaggaga tgtatccaga gctacagatc 300
acaaatgtga tggaggcaaa ccagcgggtt agtattgaca actggtgccg gagggacaaa 360
aagcaatgca agagtcgctt tgttacacct ttcaagtgtc tcgtgggtga atttgtaagt 420
gatgtcctgc tagttccaga aaagtgccag tttttccaca aagagcggat ggaggtgtgt 480
gagaatcacc agcactggca cacggtagtc aaagaggcat gtctgactca gggaatgacc 540
ttatatagct acggcatgct gctcccatgt ggggtagacc agttccatgg cactgaatat 600
gtgtgctgcc ctcagacaaa gattattgga tctgtgtcaa aagaagagga agaggaagat 660
gaagaggaag aggaagagga agatgaagag gaagactatg atgtttataa aagtgaattt 720
cctactgaag cagatctgga agacttcaca gaagcagctg tggatgagga tgatgaggat 780
gaggaagaag gggaggaagt ggtggaggac cgagattact actatgacac cttcaaagga 840
gatgactaca atgaggagaa tcctactgaa cccggcagcg acggcaccat gtcagacaag 900
gaaattactc atgatgtcaa agctgtctgc tcccaggagg cgatgacggg gccctgccgg 960
gccgtgatgc ctcgttggta cttcgacctc tccaagggaa agtgcgtgcg ctttatatat 1020
ggtggctgcg gcggcaacag gaacaatttt gagtctgagg attattgtat ggctgtgtgt 1080
aaagcgatga ttcctccaac tcctctgcca accaatgatg ttgatgtgta tttcgagacc 1140
tctgcagatg ataatgagca tgctcgcttc cagaaggcta aggagcagct ggagattcgg 1200
caccgcaacc gaatggacag ggtaaagaag gaatgggaag aggcagagct tcaagctaag 1260
aacctcccca aagcagagag gcagactctg attcagcact tccaagccat ggttaaagct 1320
ttagagaagg aagcagccag tgagaagcag cagctggtgg agacccacct ggcccgagtg 1380
gaagctatgc tgaatgaccg ccgtcggatg gctctggaga actacctggc tgccttgcag 1440
tctgacccgc cacggcctca tcgcattctc caggccttac ggcgttatgt ccgtgctgag 1500
aacaaagatc gcttacatac catccgtcat taccagcatg tgttggctgt tgacccagaa 1560
aaggcggccc agatgaaatc ccag 1584
<210> 51
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Break-point of APLP2 gene fragment
<400> 51
gcggcccaga tgaaatccca g 21
<210> 52
<211> 763
<212> PRT
<213> Artificial Sequence
<220>
<223> APLP2 protein
<400> 52
Met Ala Ala Thr Gly Thr Ala Ala Ala Ala Ala Thr Gly Arg Leu Leu
1 5 10 15
Leu Leu Leu Leu Val Gly Leu Thr Ala Pro Ala Leu Ala Leu Ala Gly
20 25 30
Tyr Ile Glu Ala Leu Ala Ala Asn Ala Gly Thr Gly Phe Ala Val Ala
35 40 45
Glu Pro Gln Ile Ala Met Phe Cys Gly Lys Leu Asn Met His Val Asn
50 55 60
Ile Gln Thr Gly Lys Trp Glu Pro Asp Pro Thr Gly Thr Lys Ser Cys
65 70 75 80
Phe Glu Thr Lys Glu Glu Val Leu Gln Tyr Cys Gln Glu Met Tyr Pro
85 90 95
Glu Leu Gln Ile Thr Asn Val Met Glu Ala Asn Gln Arg Val Ser Ile
100 105 110
Asp Asn Trp Cys Arg Arg Asp Lys Lys Gln Cys Lys Ser Arg Phe Val
115 120 125
Thr Pro Phe Lys Cys Leu Val Gly Glu Phe Val Ser Asp Val Leu Leu
130 135 140
Val Pro Glu Lys Cys Gln Phe Phe His Lys Glu Arg Met Glu Val Cys
145 150 155 160
Glu Asn His Gln His Trp His Thr Val Val Lys Glu Ala Cys Leu Thr
165 170 175
Gln Gly Met Thr Leu Tyr Ser Tyr Gly Met Leu Leu Pro Cys Gly Val
180 185 190
Asp Gln Phe His Gly Thr Glu Tyr Val Cys Cys Pro Gln Thr Lys Ile
195 200 205
Ile Gly Ser Val Ser Lys Glu Glu Glu Glu Glu Asp Glu Glu Glu Glu
210 215 220
Glu Glu Glu Asp Glu Glu Glu Asp Tyr Asp Val Tyr Lys Ser Glu Phe
225 230 235 240
Pro Thr Glu Ala Asp Leu Glu Asp Phe Thr Glu Ala Ala Val Asp Glu
245 250 255
Asp Asp Glu Asp Glu Glu Glu Gly Glu Glu Val Val Glu Asp Arg Asp
260 265 270
Tyr Tyr Tyr Asp Thr Phe Lys Gly Asp Asp Tyr Asn Glu Glu Asn Pro
275 280 285
Thr Glu Pro Gly Ser Asp Gly Thr Met Ser Asp Lys Glu Ile Thr His
290 295 300
Asp Val Lys Ala Val Cys Ser Gln Glu Ala Met Thr Gly Pro Cys Arg
305 310 315 320
Ala Val Met Pro Arg Trp Tyr Phe Asp Leu Ser Lys Gly Lys Cys Val
325 330 335
Arg Phe Ile Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe Glu Ser
340 345 350
Glu Asp Tyr Cys Met Ala Val Cys Lys Ala Met Ile Pro Pro Thr Pro
355 360 365
Leu Pro Thr Asn Asp Val Asp Val Tyr Phe Glu Thr Ser Ala Asp Asp
370 375 380
Asn Glu His Ala Arg Phe Gln Lys Ala Lys Glu Gln Leu Glu Ile Arg
385 390 395 400
His Arg Asn Arg Met Asp Arg Val Lys Lys Glu Trp Glu Glu Ala Glu
405 410 415
Leu Gln Ala Lys Asn Leu Pro Lys Ala Glu Arg Gln Thr Leu Ile Gln
420 425 430
His Phe Gln Ala Met Val Lys Ala Leu Glu Lys Glu Ala Ala Ser Glu
435 440 445
Lys Gln Gln Leu Val Glu Thr His Leu Ala Arg Val Glu Ala Met Leu
450 455 460
Asn Asp Arg Arg Arg Met Ala Leu Glu Asn Tyr Leu Ala Ala Leu Gln
465 470 475 480
Ser Asp Pro Pro Arg Pro His Arg Ile Leu Gln Ala Leu Arg Arg Tyr
485 490 495
Val Arg Ala Glu Asn Lys Asp Arg Leu His Thr Ile Arg His Tyr Gln
500 505 510
His Val Leu Ala Val Asp Pro Glu Lys Ala Ala Gln Met Lys Ser Gln
515 520 525
Val Met Thr His Leu His Val Ile Glu Glu Arg Arg Asn Gln Ser Leu
530 535 540
Ser Leu Leu Tyr Lys Val Pro Tyr Val Ala Gln Glu Ile Gln Glu Glu
545 550 555 560
Ile Asp Glu Leu Leu Gln Glu Gln Arg Ala Asp Met Asp Gln Phe Thr
565 570 575
Ala Ser Ile Ser Glu Thr Pro Val Asp Val Arg Val Ser Ser Glu Glu
580 585 590
Ser Glu Glu Ile Pro Pro Phe His Pro Phe His Pro Phe Pro Ala Leu
595 600 605
Pro Glu Asn Glu Asp Thr Gln Pro Glu Leu Tyr His Pro Met Lys Lys
610 615 620
Gly Ser Gly Val Gly Glu Gln Asp Gly Gly Leu Ile Gly Ala Glu Glu
625 630 635 640
Lys Val Ile Asn Ser Lys Asn Lys Val Asp Glu Asn Met Val Ile Asp
645 650 655
Glu Thr Leu Asp Val Lys Glu Met Ile Phe Asn Ala Glu Arg Val Gly
660 665 670
Gly Leu Glu Glu Glu Arg Glu Ser Val Gly Pro Leu Arg Glu Asp Phe
675 680 685
Ser Leu Ser Ser Ser Ala Leu Ile Gly Leu Leu Val Ile Ala Val Ala
690 695 700
Ile Ala Thr Val Ile Val Ile Ser Leu Val Met Leu Arg Lys Arg Gln
705 710 715 720
Tyr Gly Thr Ile Ser His Gly Ile Val Glu Val Asp Pro Met Leu Thr
725 730 735
Pro Glu Glu Arg His Leu Asn Lys Met Gln Asn His Gly Tyr Glu Asn
740 745 750
Pro Thr Tyr Lys Tyr Leu Glu Gln Met Gln Ile
755 760
<210> 53
<211> 528
<212> PRT
<213> Artificial Sequence
<220>
<223> APLP2 protein fragment
<400> 53
Met Ala Ala Thr Gly Thr Ala Ala Ala Ala Ala Thr Gly Arg Leu Leu
1 5 10 15
Leu Leu Leu Leu Val Gly Leu Thr Ala Pro Ala Leu Ala Leu Ala Gly
20 25 30
Tyr Ile Glu Ala Leu Ala Ala Asn Ala Gly Thr Gly Phe Ala Val Ala
35 40 45
Glu Pro Gln Ile Ala Met Phe Cys Gly Lys Leu Asn Met His Val Asn
50 55 60
Ile Gln Thr Gly Lys Trp Glu Pro Asp Pro Thr Gly Thr Lys Ser Cys
65 70 75 80
Phe Glu Thr Lys Glu Glu Val Leu Gln Tyr Cys Gln Glu Met Tyr Pro
85 90 95
Glu Leu Gln Ile Thr Asn Val Met Glu Ala Asn Gln Arg Val Ser Ile
100 105 110
Asp Asn Trp Cys Arg Arg Asp Lys Lys Gln Cys Lys Ser Arg Phe Val
115 120 125
Thr Pro Phe Lys Cys Leu Val Gly Glu Phe Val Ser Asp Val Leu Leu
130 135 140
Val Pro Glu Lys Cys Gln Phe Phe His Lys Glu Arg Met Glu Val Cys
145 150 155 160
Glu Asn His Gln His Trp His Thr Val Val Lys Glu Ala Cys Leu Thr
165 170 175
Gln Gly Met Thr Leu Tyr Ser Tyr Gly Met Leu Leu Pro Cys Gly Val
180 185 190
Asp Gln Phe His Gly Thr Glu Tyr Val Cys Cys Pro Gln Thr Lys Ile
195 200 205
Ile Gly Ser Val Ser Lys Glu Glu Glu Glu Glu Asp Glu Glu Glu Glu
210 215 220
Glu Glu Glu Asp Glu Glu Glu Asp Tyr Asp Val Tyr Lys Ser Glu Phe
225 230 235 240
Pro Thr Glu Ala Asp Leu Glu Asp Phe Thr Glu Ala Ala Val Asp Glu
245 250 255
Asp Asp Glu Asp Glu Glu Glu Gly Glu Glu Val Val Glu Asp Arg Asp
260 265 270
Tyr Tyr Tyr Asp Thr Phe Lys Gly Asp Asp Tyr Asn Glu Glu Asn Pro
275 280 285
Thr Glu Pro Gly Ser Asp Gly Thr Met Ser Asp Lys Glu Ile Thr His
290 295 300
Asp Val Lys Ala Val Cys Ser Gln Glu Ala Met Thr Gly Pro Cys Arg
305 310 315 320
Ala Val Met Pro Arg Trp Tyr Phe Asp Leu Ser Lys Gly Lys Cys Val
325 330 335
Arg Phe Ile Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe Glu Ser
340 345 350
Glu Asp Tyr Cys Met Ala Val Cys Lys Ala Met Ile Pro Pro Thr Pro
355 360 365
Leu Pro Thr Asn Asp Val Asp Val Tyr Phe Glu Thr Ser Ala Asp Asp
370 375 380
Asn Glu His Ala Arg Phe Gln Lys Ala Lys Glu Gln Leu Glu Ile Arg
385 390 395 400
His Arg Asn Arg Met Asp Arg Val Lys Lys Glu Trp Glu Glu Ala Glu
405 410 415
Leu Gln Ala Lys Asn Leu Pro Lys Ala Glu Arg Gln Thr Leu Ile Gln
420 425 430
His Phe Gln Ala Met Val Lys Ala Leu Glu Lys Glu Ala Ala Ser Glu
435 440 445
Lys Gln Gln Leu Val Glu Thr His Leu Ala Arg Val Glu Ala Met Leu
450 455 460
Asn Asp Arg Arg Arg Met Ala Leu Glu Asn Tyr Leu Ala Ala Leu Gln
465 470 475 480
Ser Asp Pro Pro Arg Pro His Arg Ile Leu Gln Ala Leu Arg Arg Tyr
485 490 495
Val Arg Ala Glu Asn Lys Asp Arg Leu His Thr Ile Arg His Tyr Gln
500 505 510
His Val Leu Ala Val Asp Pro Glu Lys Ala Ala Gln Met Lys Ser Gln
515 520 525
<210> 54
<211> 7
<212> PRT
<213> Artificial Sequence
<220>
<223> Break-point of APLP2 protein fragment
<400> 54
Ala Ala Gln Met Lys Ser Gln
1 5
<210> 55
<211> 735
<212> DNA
<213> Artificial Sequence
<220>
<223> CDS of TNFSF11 gene (NM_033012)
<400> 55
atggatccta atagaatatc agaagatggc actcactgca tttatagaat tttgagactc 60
catgaaaatg cagattttca agacacaact ctggagagtc aagatacaaa attaatacct 120
gattcatgta ggagaattaa acaggccttt caaggagctg tgcaaaagga attacaacat 180
atcgttggat cacagcacat cagagcagag aaagcgatgg tggatggctc atggttagat 240
ctggccaaga ggagcaagct tgaagctcag ccttttgctc atctcactat taatgccacc 300
gacatcccat ctggttccca taaagtgagt ctgtcctctt ggtaccatga tcggggttgg 360
gccaagatct ccaacatgac ttttagcaat ggaaaactaa tagttaatca ggatggcttt 420
tattacctgt atgccaacat ttgctttcga catcatgaaa cttcaggaga cctagctaca 480
gagtatcttc aactaatggt gtacgtcact aaaaccagca tcaaaatccc aagttctcat 540
accctgatga aaggaggaag caccaagtat tggtcaggga attctgaatt ccatttttat 600
tccataaacg ttggtggatt ttttaagtta cggtctggag aggaaatcag catcgaggtc 660
tccaacccct ccttactgga tccggatcag gatgcaacat actttggggc ttttaaagtt 720
cgagatatag attga 735
<210> 56
<211> 567
<212> DNA
<213> Artificial Sequence
<220>
<223> TNFSF11 gene fragment
<400> 56
gaattacaac atatcgttgg atcacagcac atcagagcag agaaagcgat ggtggatggc 60
tcatggttag atctggccaa gaggagcaag cttgaagctc agccttttgc tcatctcact 120
attaatgcca ccgacatccc atctggttcc cataaagtga gtctgtcctc ttggtaccat 180
gatcggggtt gggccaagat ctccaacatg acttttagca atggaaaact aatagttaat 240
caggatggct tttattacct gtatgccaac atttgctttc gacatcatga aacttcagga 300
gacctagcta cagagtatct tcaactaatg gtgtacgtca ctaaaaccag catcaaaatc 360
ccaagttctc ataccctgat gaaaggagga agcaccaagt attggtcagg gaattctgaa 420
ttccattttt attccataaa cgttggtgga ttttttaagt tacggtctgg agaggaaatc 480
agcatcgagg tctccaaccc ctccttactg gatccggatc aggatgcaac atactttggg 540
gcttttaaag ttcgagatat agattga 567
<210> 57
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Break-point of TNFSF11 gene fragment
<400> 57
gaattacaac atatcgttgg a 21
<210> 58
<211> 244
<212> PRT
<213> Artificial Sequence
<220>
<223> TNFSF11 protein
<400> 58
Met Asp Pro Asn Arg Ile Ser Glu Asp Gly Thr His Cys Ile Tyr Arg
1 5 10 15
Ile Leu Arg Leu His Glu Asn Ala Asp Phe Gln Asp Thr Thr Leu Glu
20 25 30
Ser Gln Asp Thr Lys Leu Ile Pro Asp Ser Cys Arg Arg Ile Lys Gln
35 40 45
Ala Phe Gln Gly Ala Val Gln Lys Glu Leu Gln His Ile Val Gly Ser
50 55 60
Gln His Ile Arg Ala Glu Lys Ala Met Val Asp Gly Ser Trp Leu Asp
65 70 75 80
Leu Ala Lys Arg Ser Lys Leu Glu Ala Gln Pro Phe Ala His Leu Thr
85 90 95
Ile Asn Ala Thr Asp Ile Pro Ser Gly Ser His Lys Val Ser Leu Ser
100 105 110
Ser Trp Tyr His Asp Arg Gly Trp Ala Lys Ile Ser Asn Met Thr Phe
115 120 125
Ser Asn Gly Lys Leu Ile Val Asn Gln Asp Gly Phe Tyr Tyr Leu Tyr
130 135 140
Ala Asn Ile Cys Phe Arg His His Glu Thr Ser Gly Asp Leu Ala Thr
145 150 155 160
Glu Tyr Leu Gln Leu Met Val Tyr Val Thr Lys Thr Ser Ile Lys Ile
165 170 175
Pro Ser Ser His Thr Leu Met Lys Gly Gly Ser Thr Lys Tyr Trp Ser
180 185 190
Gly Asn Ser Glu Phe His Phe Tyr Ser Ile Asn Val Gly Gly Phe Phe
195 200 205
Lys Leu Arg Ser Gly Glu Glu Ile Ser Ile Glu Val Ser Asn Pro Ser
210 215 220
Leu Leu Asp Pro Asp Gln Asp Ala Thr Tyr Phe Gly Ala Phe Lys Val
225 230 235 240
Arg Asp Ile Asp
<210> 59
<211> 188
<212> PRT
<213> Artificial Sequence
<220>
<223> TNFSF11 protein fragment
<400> 59
Glu Leu Gln His Ile Val Gly Ser Gln His Ile Arg Ala Glu Lys Ala
1 5 10 15
Met Val Asp Gly Ser Trp Leu Asp Leu Ala Lys Arg Ser Lys Leu Glu
20 25 30
Ala Gln Pro Phe Ala His Leu Thr Ile Asn Ala Thr Asp Ile Pro Ser
35 40 45
Gly Ser His Lys Val Ser Leu Ser Ser Trp Tyr His Asp Arg Gly Trp
50 55 60
Ala Lys Ile Ser Asn Met Thr Phe Ser Asn Gly Lys Leu Ile Val Asn
65 70 75 80
Gln Asp Gly Phe Tyr Tyr Leu Tyr Ala Asn Ile Cys Phe Arg His His
85 90 95
Glu Thr Ser Gly Asp Leu Ala Thr Glu Tyr Leu Gln Leu Met Val Tyr
100 105 110
Val Thr Lys Thr Ser Ile Lys Ile Pro Ser Ser His Thr Leu Met Lys
115 120 125
Gly Gly Ser Thr Lys Tyr Trp Ser Gly Asn Ser Glu Phe His Phe Tyr
130 135 140
Ser Ile Asn Val Gly Gly Phe Phe Lys Leu Arg Ser Gly Glu Glu Ile
145 150 155 160
Ser Ile Glu Val Ser Asn Pro Ser Leu Leu Asp Pro Asp Gln Asp Ala
165 170 175
Thr Tyr Phe Gly Ala Phe Lys Val Arg Asp Ile Asp
180 185
<210> 60
<211> 7
<212> PRT
<213> Artificial Sequence
<220>
<223> Break-point of TNFSF11 protein fragment
<400> 60
Glu Leu Gln His Ile Val Gly
1 5
<210> 61
<211> 2151
<212> DNA
<213> Artificial Sequence
<220>
<223> APLP2-TNFSF11 fusion gene
<400> 61
atggcggcca ccgggaccgc ggccgccgca gccacgggca ggctcctgct tctgctgctg 60
gtggggctca cggcgcctgc cttggcgctg gccggctaca tcgaggctct tgcagccaat 120
gccggaacag gatttgctgt tgctgagcct caaatcgcaa tgttttgtgg gaagttaaat 180
atgcatgtga acattcagac tgggaaatgg gaacctgatc caacaggcac caagagctgc 240
tttgaaacaa aagaagaagt tcttcagtac tgtcaggaga tgtatccaga gctacagatc 300
acaaatgtga tggaggcaaa ccagcgggtt agtattgaca actggtgccg gagggacaaa 360
aagcaatgca agagtcgctt tgttacacct ttcaagtgtc tcgtgggtga atttgtaagt 420
gatgtcctgc tagttccaga aaagtgccag tttttccaca aagagcggat ggaggtgtgt 480
gagaatcacc agcactggca cacggtagtc aaagaggcat gtctgactca gggaatgacc 540
ttatatagct acggcatgct gctcccatgt ggggtagacc agttccatgg cactgaatat 600
gtgtgctgcc ctcagacaaa gattattgga tctgtgtcaa aagaagagga agaggaagat 660
gaagaggaag aggaagagga agatgaagag gaagactatg atgtttataa aagtgaattt 720
cctactgaag cagatctgga agacttcaca gaagcagctg tggatgagga tgatgaggat 780
gaggaagaag gggaggaagt ggtggaggac cgagattact actatgacac cttcaaagga 840
gatgactaca atgaggagaa tcctactgaa cccggcagcg acggcaccat gtcagacaag 900
gaaattactc atgatgtcaa agctgtctgc tcccaggagg cgatgacggg gccctgccgg 960
gccgtgatgc ctcgttggta cttcgacctc tccaagggaa agtgcgtgcg ctttatatat 1020
ggtggctgcg gcggcaacag gaacaatttt gagtctgagg attattgtat ggctgtgtgt 1080
aaagcgatga ttcctccaac tcctctgcca accaatgatg ttgatgtgta tttcgagacc 1140
tctgcagatg ataatgagca tgctcgcttc cagaaggcta aggagcagct ggagattcgg 1200
caccgcaacc gaatggacag ggtaaagaag gaatgggaag aggcagagct tcaagctaag 1260
aacctcccca aagcagagag gcagactctg attcagcact tccaagccat ggttaaagct 1320
ttagagaagg aagcagccag tgagaagcag cagctggtgg agacccacct ggcccgagtg 1380
gaagctatgc tgaatgaccg ccgtcggatg gctctggaga actacctggc tgccttgcag 1440
tctgacccgc cacggcctca tcgcattctc caggccttac ggcgttatgt ccgtgctgag 1500
aacaaagatc gcttacatac catccgtcat taccagcatg tgttggctgt tgacccagaa 1560
aaggcggccc agatgaaatc ccaggaatta caacatatcg ttggatcaca gcacatcaga 1620
gcagagaaag cgatggtgga tggctcatgg ttagatctgg ccaagaggag caagcttgaa 1680
gctcagcctt ttgctcatct cactattaat gccaccgaca tcccatctgg ttcccataaa 1740
gtgagtctgt cctcttggta ccatgatcgg ggttgggcca agatctccaa catgactttt 1800
agcaatggaa aactaatagt taatcaggat ggcttttatt acctgtatgc caacatttgc 1860
tttcgacatc atgaaacttc aggagaccta gctacagagt atcttcaact aatggtgtac 1920
gtcactaaaa ccagcatcaa aatcccaagt tctcataccc tgatgaaagg aggaagcacc 1980
aagtattggt cagggaattc tgaattccat ttttattcca taaacgttgg tggatttttt 2040
aagttacggt ctggagagga aatcagcatc gaggtctcca acccctcctt actggatccg 2100
gatcaggatg caacatactt tggggctttt aaagttcgag atatagattg a 2151
<210> 62
<211> 42
<212> DNA
<213> Artificial Sequence
<220>
<223> Fused region of APLP2-TNFSF11 fusion gene
<400> 62
gcggcccaga tgaaatccca ggaattacaa catatcgttg ga 42
<210> 63
<211> 716
<212> PRT
<213> Artificial Sequence
<220>
<223> APLP2-TNFSF11 fusion protein
<400> 63
Met Ala Ala Thr Gly Thr Ala Ala Ala Ala Ala Thr Gly Arg Leu Leu
1 5 10 15
Leu Leu Leu Leu Val Gly Leu Thr Ala Pro Ala Leu Ala Leu Ala Gly
20 25 30
Tyr Ile Glu Ala Leu Ala Ala Asn Ala Gly Thr Gly Phe Ala Val Ala
35 40 45
Glu Pro Gln Ile Ala Met Phe Cys Gly Lys Leu Asn Met His Val Asn
50 55 60
Ile Gln Thr Gly Lys Trp Glu Pro Asp Pro Thr Gly Thr Lys Ser Cys
65 70 75 80
Phe Glu Thr Lys Glu Glu Val Leu Gln Tyr Cys Gln Glu Met Tyr Pro
85 90 95
Glu Leu Gln Ile Thr Asn Val Met Glu Ala Asn Gln Arg Val Ser Ile
100 105 110
Asp Asn Trp Cys Arg Arg Asp Lys Lys Gln Cys Lys Ser Arg Phe Val
115 120 125
Thr Pro Phe Lys Cys Leu Val Gly Glu Phe Val Ser Asp Val Leu Leu
130 135 140
Val Pro Glu Lys Cys Gln Phe Phe His Lys Glu Arg Met Glu Val Cys
145 150 155 160
Glu Asn His Gln His Trp His Thr Val Val Lys Glu Ala Cys Leu Thr
165 170 175
Gln Gly Met Thr Leu Tyr Ser Tyr Gly Met Leu Leu Pro Cys Gly Val
180 185 190
Asp Gln Phe His Gly Thr Glu Tyr Val Cys Cys Pro Gln Thr Lys Ile
195 200 205
Ile Gly Ser Val Ser Lys Glu Glu Glu Glu Glu Asp Glu Glu Glu Glu
210 215 220
Glu Glu Glu Asp Glu Glu Glu Asp Tyr Asp Val Tyr Lys Ser Glu Phe
225 230 235 240
Pro Thr Glu Ala Asp Leu Glu Asp Phe Thr Glu Ala Ala Val Asp Glu
245 250 255
Asp Asp Glu Asp Glu Glu Glu Gly Glu Glu Val Val Glu Asp Arg Asp
260 265 270
Tyr Tyr Tyr Asp Thr Phe Lys Gly Asp Asp Tyr Asn Glu Glu Asn Pro
275 280 285
Thr Glu Pro Gly Ser Asp Gly Thr Met Ser Asp Lys Glu Ile Thr His
290 295 300
Asp Val Lys Ala Val Cys Ser Gln Glu Ala Met Thr Gly Pro Cys Arg
305 310 315 320
Ala Val Met Pro Arg Trp Tyr Phe Asp Leu Ser Lys Gly Lys Cys Val
325 330 335
Arg Phe Ile Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe Glu Ser
340 345 350
Glu Asp Tyr Cys Met Ala Val Cys Lys Ala Met Ile Pro Pro Thr Pro
355 360 365
Leu Pro Thr Asn Asp Val Asp Val Tyr Phe Glu Thr Ser Ala Asp Asp
370 375 380
Asn Glu His Ala Arg Phe Gln Lys Ala Lys Glu Gln Leu Glu Ile Arg
385 390 395 400
His Arg Asn Arg Met Asp Arg Val Lys Lys Glu Trp Glu Glu Ala Glu
405 410 415
Leu Gln Ala Lys Asn Leu Pro Lys Ala Glu Arg Gln Thr Leu Ile Gln
420 425 430
His Phe Gln Ala Met Val Lys Ala Leu Glu Lys Glu Ala Ala Ser Glu
435 440 445
Lys Gln Gln Leu Val Glu Thr His Leu Ala Arg Val Glu Ala Met Leu
450 455 460
Asn Asp Arg Arg Arg Met Ala Leu Glu Asn Tyr Leu Ala Ala Leu Gln
465 470 475 480
Ser Asp Pro Pro Arg Pro His Arg Ile Leu Gln Ala Leu Arg Arg Tyr
485 490 495
Val Arg Ala Glu Asn Lys Asp Arg Leu His Thr Ile Arg His Tyr Gln
500 505 510
His Val Leu Ala Val Asp Pro Glu Lys Ala Ala Gln Met Lys Ser Gln
515 520 525
Glu Leu Gln His Ile Val Gly Ser Gln His Ile Arg Ala Glu Lys Ala
530 535 540
Met Val Asp Gly Ser Trp Leu Asp Leu Ala Lys Arg Ser Lys Leu Glu
545 550 555 560
Ala Gln Pro Phe Ala His Leu Thr Ile Asn Ala Thr Asp Ile Pro Ser
565 570 575
Gly Ser His Lys Val Ser Leu Ser Ser Trp Tyr His Asp Arg Gly Trp
580 585 590
Ala Lys Ile Ser Asn Met Thr Phe Ser Asn Gly Lys Leu Ile Val Asn
595 600 605
Gln Asp Gly Phe Tyr Tyr Leu Tyr Ala Asn Ile Cys Phe Arg His His
610 615 620
Glu Thr Ser Gly Asp Leu Ala Thr Glu Tyr Leu Gln Leu Met Val Tyr
625 630 635 640
Val Thr Lys Thr Ser Ile Lys Ile Pro Ser Ser His Thr Leu Met Lys
645 650 655
Gly Gly Ser Thr Lys Tyr Trp Ser Gly Asn Ser Glu Phe His Phe Tyr
660 665 670
Ser Ile Asn Val Gly Gly Phe Phe Lys Leu Arg Ser Gly Glu Glu Ile
675 680 685
Ser Ile Glu Val Ser Asn Pro Ser Leu Leu Asp Pro Asp Gln Asp Ala
690 695 700
Thr Tyr Phe Gly Ala Phe Lys Val Arg Asp Ile Asp
705 710 715
<210> 64
<211> 14
<212> PRT
<213> Artificial Sequence
<220>
<223> Fused region of APLP2-TNFSF11 fusion protein
<400> 64
Ala Ala Gln Met Lys Ser Gln Glu Leu Gln His Ile Val Gly
1 5 10
<210> 65
<211> 2685
<212> DNA
<213> Artificial Sequence
<220>
<223> CDS of MAP4K3 gene (NM_003618)
<400> 65
atgaaccccg gcttcgattt gtcccgccgg aacccgcagg aggacttcga gctgattcag 60
cgcatcggca gcggcaccta cggcgacgtc tacaaggcac ggaatgttaa cactggtgaa 120
ttagcagcaa ttaaagtaat aaaattggaa ccaggagaag actttgcagt tgtgcagcaa 180
gaaattatta tgatgaaaga ctgtaaacac ccaaatattg ttgcttattt tggaagctat 240
ctcaggcgag ataagctttg gatttgcatg gagttttgtg gaggtggttc tttacaggat 300
atttatcacg taactggacc tctgtcagaa ctgcaaattg catatgttag cagagaaaca 360
ctgcagggat tatattatct tcacagtaaa ggaaaaatgc acagagatat aaagggagct 420
aacattctat taacggataa tggtcatgtg aaattggctg attttggagt atctgcacag 480
ataacagcta caattgccaa acggaagtct ttcattggca caccatattg gatggctcca 540
gaagttgcag ctgttgagag gaaggggggt tacaatcaac tctgtgatct ctgggcagtg 600
ggaatcactg ccatagaact tgcagagctt cagcctccta tgtttgactt acacccaatg 660
agagcattat ttctaatgac aaaaagcaat tttcagcctc ctaaactaaa ggataaaatg 720
aaatggtcaa atagttttca tcactttgtg aaaatggcac ttaccaaaaa tccgaaaaaa 780
agacctactg ctgaaaaatt attacagcat ccttttgtaa cacaacattt gacacggtct 840
ttggcaatcg agctgttgga taaagtaaat aatccagatc attccactta ccatgatttc 900
gatgatgatg atcctgagcc tcttgttgct gtaccacata gaattcactc aacaagtaga 960
aacgtgagag aagaaaaaac acgctcagag ataacctttg gccaagtgaa atttgatcca 1020
cccttaagaa aggagacaga accacatcat gaacttcccg acagtgatgg ttttttggac 1080
agttcagaag aaatatacta cactgcaaga tctaatctgg atctgcaact ggaatatgga 1140
caaggacacc aaggtggtta ctttttaggt gcaaacaaga gtcttctcaa gtctgttgaa 1200
gaagaattgc atcagcgagg acacgtcgca catttagaag atgatgaagg agatgatgat 1260
gaatctaaac actcaactct gaaagcaaaa attccacctc ctttgccacc aaagcctaag 1320
tctatcttca taccacagga aatgcattct actgaggatg aaaatcaagg aacaatcaag 1380
agatgtccca tgtcagggag cccagcaaag ccatcccaag ttccacctag accaccacct 1440
cccagattac ccccacacaa acctgttgcc ttaggaaatg gaatgagctc cttccagtta 1500
aatggtgaac gagatggctc attatgtcaa caacagaatg aacatagagg cacaaacctt 1560
tcaagaaaag aaaagaaaga tgtaccaaag cctattagta atggtcttcc tccaacacct 1620
aaagtgcata tgggtgcatg tttttcaaaa gtttttaatg ggtgtccctt gaaaattcac 1680
tgtgcatcat catggataaa cccagataca agagatcagt acttgatatt tggtgccgaa 1740
gaagggattt ataccctcaa tcttaatgaa cttcatgaaa catcaatgga acagctattc 1800
cctcgaaggt gtacatggtt gtatgtaatg aacaattgct tgctatcaat atctggtaaa 1860
gcttctcagc tttattccca taatttacca gggctttttg attatgcaag acaaatgcaa 1920
aagttacctg ttgctattcc agcacacaaa ctccctgaca gaatactgcc aaggaaattt 1980
tctgtatcag caaaaatccc tgaaaccaaa tggtgccaga agtgttgtgt tgtaagaaat 2040
ccttacacgg gccataaata cctatgtgga gcacttcaga ctagcattgt tctattagaa 2100
tgggttgaac caatgcagaa atttatgtta attaagcaca tagattttcc tataccatgt 2160
ccacttagaa tgtttgaaat gctggtagtt cctgaacagg agtacccttt agtttgtgtt 2220
ggtgtcagta gaggtagaga cttcaaccaa gtggttcgat ttgagacggt caatccaaat 2280
tctacctctt catggtttac agaatcagat accccacaga caaatgttac tcatgtaacc 2340
caactggaga gagataccat ccttgtatgc ttggactgtt gtataaaaat agtaaatctc 2400
caaggaagat taaaatctag caggaaattg tcatcagaac tcacctttga tttccagatt 2460
gaatcaatag tgtgcctaca agacagtgtg ctagctttct ggaaacatgg aatgcaaggt 2520
agaagtttta gatctaatga ggtaacacaa gaaatttcag atagcacaag aattttcagg 2580
ctgcttggat ctgacagggt cgtggttttg gaaagtaggc caactgataa ccccacagca 2640
aatagcaatt tgtacatcct ggcgggtcat gaaaacagtt actga 2685
<210> 66
<211> 96
<212> DNA
<213> Artificial Sequence
<220>
<223> MAP4K3 gene fragment
<400> 66
atgaaccccg gcttcgattt gtcccgccgg aacccgcagg aggacttcga gctgattcag 60
cgcatcggca gcggcaccta cggcgacgtc tacaag 96
<210> 67
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Break-point of MAP4K3 gene fragment
<400> 67
acctacggcg acgtctacaa g 21
<210> 68
<211> 894
<212> PRT
<213> Artificial Sequence
<220>
<223> MAP4K3 protein
<400> 68
Met Asn Pro Gly Phe Asp Leu Ser Arg Arg Asn Pro Gln Glu Asp Phe
1 5 10 15
Glu Leu Ile Gln Arg Ile Gly Ser Gly Thr Tyr Gly Asp Val Tyr Lys
20 25 30
Ala Arg Asn Val Asn Thr Gly Glu Leu Ala Ala Ile Lys Val Ile Lys
35 40 45
Leu Glu Pro Gly Glu Asp Phe Ala Val Val Gln Gln Glu Ile Ile Met
50 55 60
Met Lys Asp Cys Lys His Pro Asn Ile Val Ala Tyr Phe Gly Ser Tyr
65 70 75 80
Leu Arg Arg Asp Lys Leu Trp Ile Cys Met Glu Phe Cys Gly Gly Gly
85 90 95
Ser Leu Gln Asp Ile Tyr His Val Thr Gly Pro Leu Ser Glu Leu Gln
100 105 110
Ile Ala Tyr Val Ser Arg Glu Thr Leu Gln Gly Leu Tyr Tyr Leu His
115 120 125
Ser Lys Gly Lys Met His Arg Asp Ile Lys Gly Ala Asn Ile Leu Leu
130 135 140
Thr Asp Asn Gly His Val Lys Leu Ala Asp Phe Gly Val Ser Ala Gln
145 150 155 160
Ile Thr Ala Thr Ile Ala Lys Arg Lys Ser Phe Ile Gly Thr Pro Tyr
165 170 175
Trp Met Ala Pro Glu Val Ala Ala Val Glu Arg Lys Gly Gly Tyr Asn
180 185 190
Gln Leu Cys Asp Leu Trp Ala Val Gly Ile Thr Ala Ile Glu Leu Ala
195 200 205
Glu Leu Gln Pro Pro Met Phe Asp Leu His Pro Met Arg Ala Leu Phe
210 215 220
Leu Met Thr Lys Ser Asn Phe Gln Pro Pro Lys Leu Lys Asp Lys Met
225 230 235 240
Lys Trp Ser Asn Ser Phe His His Phe Val Lys Met Ala Leu Thr Lys
245 250 255
Asn Pro Lys Lys Arg Pro Thr Ala Glu Lys Leu Leu Gln His Pro Phe
260 265 270
Val Thr Gln His Leu Thr Arg Ser Leu Ala Ile Glu Leu Leu Asp Lys
275 280 285
Val Asn Asn Pro Asp His Ser Thr Tyr His Asp Phe Asp Asp Asp Asp
290 295 300
Pro Glu Pro Leu Val Ala Val Pro His Arg Ile His Ser Thr Ser Arg
305 310 315 320
Asn Val Arg Glu Glu Lys Thr Arg Ser Glu Ile Thr Phe Gly Gln Val
325 330 335
Lys Phe Asp Pro Pro Leu Arg Lys Glu Thr Glu Pro His His Glu Leu
340 345 350
Pro Asp Ser Asp Gly Phe Leu Asp Ser Ser Glu Glu Ile Tyr Tyr Thr
355 360 365
Ala Arg Ser Asn Leu Asp Leu Gln Leu Glu Tyr Gly Gln Gly His Gln
370 375 380
Gly Gly Tyr Phe Leu Gly Ala Asn Lys Ser Leu Leu Lys Ser Val Glu
385 390 395 400
Glu Glu Leu His Gln Arg Gly His Val Ala His Leu Glu Asp Asp Glu
405 410 415
Gly Asp Asp Asp Glu Ser Lys His Ser Thr Leu Lys Ala Lys Ile Pro
420 425 430
Pro Pro Leu Pro Pro Lys Pro Lys Ser Ile Phe Ile Pro Gln Glu Met
435 440 445
His Ser Thr Glu Asp Glu Asn Gln Gly Thr Ile Lys Arg Cys Pro Met
450 455 460
Ser Gly Ser Pro Ala Lys Pro Ser Gln Val Pro Pro Arg Pro Pro Pro
465 470 475 480
Pro Arg Leu Pro Pro His Lys Pro Val Ala Leu Gly Asn Gly Met Ser
485 490 495
Ser Phe Gln Leu Asn Gly Glu Arg Asp Gly Ser Leu Cys Gln Gln Gln
500 505 510
Asn Glu His Arg Gly Thr Asn Leu Ser Arg Lys Glu Lys Lys Asp Val
515 520 525
Pro Lys Pro Ile Ser Asn Gly Leu Pro Pro Thr Pro Lys Val His Met
530 535 540
Gly Ala Cys Phe Ser Lys Val Phe Asn Gly Cys Pro Leu Lys Ile His
545 550 555 560
Cys Ala Ser Ser Trp Ile Asn Pro Asp Thr Arg Asp Gln Tyr Leu Ile
565 570 575
Phe Gly Ala Glu Glu Gly Ile Tyr Thr Leu Asn Leu Asn Glu Leu His
580 585 590
Glu Thr Ser Met Glu Gln Leu Phe Pro Arg Arg Cys Thr Trp Leu Tyr
595 600 605
Val Met Asn Asn Cys Leu Leu Ser Ile Ser Gly Lys Ala Ser Gln Leu
610 615 620
Tyr Ser His Asn Leu Pro Gly Leu Phe Asp Tyr Ala Arg Gln Met Gln
625 630 635 640
Lys Leu Pro Val Ala Ile Pro Ala His Lys Leu Pro Asp Arg Ile Leu
645 650 655
Pro Arg Lys Phe Ser Val Ser Ala Lys Ile Pro Glu Thr Lys Trp Cys
660 665 670
Gln Lys Cys Cys Val Val Arg Asn Pro Tyr Thr Gly His Lys Tyr Leu
675 680 685
Cys Gly Ala Leu Gln Thr Ser Ile Val Leu Leu Glu Trp Val Glu Pro
690 695 700
Met Gln Lys Phe Met Leu Ile Lys His Ile Asp Phe Pro Ile Pro Cys
705 710 715 720
Pro Leu Arg Met Phe Glu Met Leu Val Val Pro Glu Gln Glu Tyr Pro
725 730 735
Leu Val Cys Val Gly Val Ser Arg Gly Arg Asp Phe Asn Gln Val Val
740 745 750
Arg Phe Glu Thr Val Asn Pro Asn Ser Thr Ser Ser Trp Phe Thr Glu
755 760 765
Ser Asp Thr Pro Gln Thr Asn Val Thr His Val Thr Gln Leu Glu Arg
770 775 780
Asp Thr Ile Leu Val Cys Leu Asp Cys Cys Ile Lys Ile Val Asn Leu
785 790 795 800
Gln Gly Arg Leu Lys Ser Ser Arg Lys Leu Ser Ser Glu Leu Thr Phe
805 810 815
Asp Phe Gln Ile Glu Ser Ile Val Cys Leu Gln Asp Ser Val Leu Ala
820 825 830
Phe Trp Lys His Gly Met Gln Gly Arg Ser Phe Arg Ser Asn Glu Val
835 840 845
Thr Gln Glu Ile Ser Asp Ser Thr Arg Ile Phe Arg Leu Leu Gly Ser
850 855 860
Asp Arg Val Val Val Leu Glu Ser Arg Pro Thr Asp Asn Pro Thr Ala
865 870 875 880
Asn Ser Asn Leu Tyr Ile Leu Ala Gly His Glu Asn Ser Tyr
885 890
<210> 69
<211> 32
<212> PRT
<213> Artificial Sequence
<220>
<223> MAP4K3 protein fragment
<400> 69
Met Asn Pro Gly Phe Asp Leu Ser Arg Arg Asn Pro Gln Glu Asp Phe
1 5 10 15
Glu Leu Ile Gln Arg Ile Gly Ser Gly Thr Tyr Gly Asp Val Tyr Lys
20 25 30
<210> 70
<211> 7
<212> PRT
<213> Artificial Sequence
<220>
<223> Break-point of MAP4K3 protein fragment
<400> 70
Thr Tyr Gly Asp Val Tyr Lys
1 5
<210> 71
<211> 2214
<212> DNA
<213> Artificial Sequence
<220>
<223> CDS of PRKCE gene (NM_005400)
<400> 71
atggtagtgt tcaatggcct tcttaagatc aaaatctgcg aggccgtgag cttgaagccc 60
acagcctggt cgctgcgcca tgcggtggga ccccggccgc agactttcct tctcgacccc 120
tacattgccc tcaatgtgga cgactcgcgc atcggccaaa cggccaccaa gcagaagacc 180
aacagcccgg cctggcacga cgagttcgtc accgatgtgt gcaacggacg caagatcgag 240
ctggctgtct ttcacgatgc ccccataggc tacgacgact tcgtggccaa ctgcaccatc 300
cagtttgagg agctgctgca gaacgggagc cgccacttcg aggactggat tgatctggag 360
ccagaaggaa gagtgtatgt gatcatcgat ctctcagggt cgtcgggtga agcccctaaa 420
gacaatgaag agcgtgtgtt cagggaacgc atgcggccga ggaagcggca gggggccgtc 480
aggcgcaggg tccatcaggt caacggccac aagttcatgg ccacctatct tcggcagccc 540
acctactgct cccattgcag agacttcatc tggggtgtca taggaaagca gggataccag 600
tgtcaagtct gcacctgcgt ggtccacaag cggtgccacg agctcataat cacaaagtgt 660
gctgggttaa agaagcagga gacccccgac caggtgggct cccagcggtt cagcgtcaac 720
atgccccaca agttcggtat ccacaactac aaggtcccta ccttctgcga tcactgtggg 780
tccctgctct ggggactctt gcggcagggt ttgcagtgta aagtctgcaa aatgaatgtt 840
caccgtcgat gtgagaccaa cgtggctccc aactgtggag tggatgccag aggaatcgcc 900
aaagtactgg ccgacctggg cgttacccca gacaaaatca ccaacagcgg ccagagaagg 960
aaaaagctca ttgctggtgc cgagtccccg cagcctgctt ctggaagctc accatctgag 1020
gaagatcgat ccaagtcagc acccacctcc ccttgtgacc aggaaataaa agaacttgag 1080
aacaacattc ggaaagcctt gtcatttgac aaccgaggag aggagcaccg ggcagcatcg 1140
tctcctgatg gccagctgat gagccccggt gagaatggcg aagtccggca aggccaggcc 1200
aagcgcctgg gcctggatga gttcaacttc atcaaggtgt tgggcaaagg cagctttggc 1260
aaggtcatgt tggcagaact caagggcaaa gatgaagtat atgctgtgaa ggtcttaaag 1320
aaggacgtca tccttcagga tgatgacgtg gactgcacaa tgacagagaa gaggattttg 1380
gctctggcac ggaaacaccc gtaccttacc caactctact gctgcttcca gaccaaggac 1440
cgcctctttt tcgtcatgga atatgtaaat ggtggagacc tcatgtttca gattcagcgc 1500
tcccgaaaat tcgacgagcc tcgttcacgg ttctatgctg cagaggtcac atcggccctc 1560
atgttcctcc accagcatgg agtcatctac agggatttga aactggacaa catccttctg 1620
gatgcagaag gtcactgcaa gctggctgac ttcgggatgt gcaaggaagg gattctgaat 1680
ggtgtgacga ccaccacgtt ctgtgggact cctgactaca tagctcctga gatcctgcag 1740
gagttggagt atggcccctc cgtggactgg tgggccctgg gggtgctgat gtacgagatg 1800
atggctggac agcctccctt tgaggccgac aatgaggacg acctatttga gtccatcctc 1860
catgacgacg tgctgtaccc agtctggctc agcaaggagg ctgtcagcat cttgaaagct 1920
ttcatgacga agaatcccca caagcgcctg ggctgtgtgg catcgcagaa tggcgaggac 1980
gccatcaagc agcacccatt cttcaaagag attgactggg tgctcctgga gcagaagaag 2040
atcaagccac ccttcaaacc acgcattaaa accaaaagag acgtcaataa ttttgaccaa 2100
gactttaccc gggaagagcc ggtactcacc cttgtggacg aagcaattgt aaagcagatc 2160
aaccaggagg aattcaaagg tttctcctac tttggtgaag acctgatgcc ctga 2214
<210> 72
<211> 1866
<212> DNA
<213> Artificial Sequence
<220>
<223> PRKCE gene fragment
<400> 72
attgatctgg agccagaagg aagagtgtat gtgatcatcg atctctcagg gtcgtcgggt 60
gaagccccta aagacaatga agagcgtgtg ttcagggaac gcatgcggcc gaggaagcgg 120
cagggggccg tcaggcgcag ggtccatcag gtcaacggcc acaagttcat ggccacctat 180
cttcggcagc ccacctactg ctcccattgc agagacttca tctggggtgt cataggaaag 240
cagggatacc agtgtcaagt ctgcacctgc gtggtccaca agcggtgcca cgagctcata 300
atcacaaagt gtgctgggtt aaagaagcag gagacccccg accaggtggg ctcccagcgg 360
ttcagcgtca acatgcccca caagttcggt atccacaact acaaggtccc taccttctgc 420
gatcactgtg ggtccctgct ctggggactc ttgcggcagg gtttgcagtg taaagtctgc 480
aaaatgaatg ttcaccgtcg atgtgagacc aacgtggctc ccaactgtgg agtggatgcc 540
agaggaatcg ccaaagtact ggccgacctg ggcgttaccc cagacaaaat caccaacagc 600
ggccagagaa ggaaaaagct cattgctggt gccgagtccc cgcagcctgc ttctggaagc 660
tcaccatctg aggaagatcg atccaagtca gcacccacct ccccttgtga ccaggaaata 720
aaagaacttg agaacaacat tcggaaagcc ttgtcatttg acaaccgagg agaggagcac 780
cgggcagcat cgtctcctga tggccagctg atgagccccg gtgagaatgg cgaagtccgg 840
caaggccagg ccaagcgcct gggcctggat gagttcaact tcatcaaggt gttgggcaaa 900
ggcagctttg gcaaggtcat gttggcagaa ctcaagggca aagatgaagt atatgctgtg 960
aaggtcttaa agaaggacgt catccttcag gatgatgacg tggactgcac aatgacagag 1020
aagaggattt tggctctggc acggaaacac ccgtacctta cccaactcta ctgctgcttc 1080
cagaccaagg accgcctctt tttcgtcatg gaatatgtaa atggtggaga cctcatgttt 1140
cagattcagc gctcccgaaa attcgacgag cctcgttcac ggttctatgc tgcagaggtc 1200
acatcggccc tcatgttcct ccaccagcat ggagtcatct acagggattt gaaactggac 1260
aacatccttc tggatgcaga aggtcactgc aagctggctg acttcgggat gtgcaaggaa 1320
gggattctga atggtgtgac gaccaccacg ttctgtggga ctcctgacta catagctcct 1380
gagatcctgc aggagttgga gtatggcccc tccgtggact ggtgggccct gggggtgctg 1440
atgtacgaga tgatggctgg acagcctccc tttgaggccg acaatgagga cgacctattt 1500
gagtccatcc tccatgacga cgtgctgtac ccagtctggc tcagcaagga ggctgtcagc 1560
atcttgaaag ctttcatgac gaagaatccc cacaagcgcc tgggctgtgt ggcatcgcag 1620
aatggcgagg acgccatcaa gcagcaccca ttcttcaaag agattgactg ggtgctcctg 1680
gagcagaaga agatcaagcc acccttcaaa ccacgcatta aaaccaaaag agacgtcaat 1740
aattttgacc aagactttac ccgggaagag ccggtactca cccttgtgga cgaagcaatt 1800
gtaaagcaga tcaaccagga ggaattcaaa ggtttctcct actttggtga agacctgatg 1860
ccctga 1866
<210> 73
<211> 24
<212> PRT
<213> Artificial Sequence
<220>
<223> Break-point of PRKCE gene fragment
<400> 73
Ala Thr Thr Gly Ala Thr Cys Thr Gly Gly Ala Gly Cys Cys Ala Gly
1 5 10 15
Ala Ala Gly Gly Ala Ala Gly Ala
20
<210> 74
<211> 737
<212> PRT
<213> Artificial Sequence
<220>
<223> PRKCE protein
<400> 74
Met Val Val Phe Asn Gly Leu Leu Lys Ile Lys Ile Cys Glu Ala Val
1 5 10 15
Ser Leu Lys Pro Thr Ala Trp Ser Leu Arg His Ala Val Gly Pro Arg
20 25 30
Pro Gln Thr Phe Leu Leu Asp Pro Tyr Ile Ala Leu Asn Val Asp Asp
35 40 45
Ser Arg Ile Gly Gln Thr Ala Thr Lys Gln Lys Thr Asn Ser Pro Ala
50 55 60
Trp His Asp Glu Phe Val Thr Asp Val Cys Asn Gly Arg Lys Ile Glu
65 70 75 80
Leu Ala Val Phe His Asp Ala Pro Ile Gly Tyr Asp Asp Phe Val Ala
85 90 95
Asn Cys Thr Ile Gln Phe Glu Glu Leu Leu Gln Asn Gly Ser Arg His
100 105 110
Phe Glu Asp Trp Ile Asp Leu Glu Pro Glu Gly Arg Val Tyr Val Ile
115 120 125
Ile Asp Leu Ser Gly Ser Ser Gly Glu Ala Pro Lys Asp Asn Glu Glu
130 135 140
Arg Val Phe Arg Glu Arg Met Arg Pro Arg Lys Arg Gln Gly Ala Val
145 150 155 160
Arg Arg Arg Val His Gln Val Asn Gly His Lys Phe Met Ala Thr Tyr
165 170 175
Leu Arg Gln Pro Thr Tyr Cys Ser His Cys Arg Asp Phe Ile Trp Gly
180 185 190
Val Ile Gly Lys Gln Gly Tyr Gln Cys Gln Val Cys Thr Cys Val Val
195 200 205
His Lys Arg Cys His Glu Leu Ile Ile Thr Lys Cys Ala Gly Leu Lys
210 215 220
Lys Gln Glu Thr Pro Asp Gln Val Gly Ser Gln Arg Phe Ser Val Asn
225 230 235 240
Met Pro His Lys Phe Gly Ile His Asn Tyr Lys Val Pro Thr Phe Cys
245 250 255
Asp His Cys Gly Ser Leu Leu Trp Gly Leu Leu Arg Gln Gly Leu Gln
260 265 270
Cys Lys Val Cys Lys Met Asn Val His Arg Arg Cys Glu Thr Asn Val
275 280 285
Ala Pro Asn Cys Gly Val Asp Ala Arg Gly Ile Ala Lys Val Leu Ala
290 295 300
Asp Leu Gly Val Thr Pro Asp Lys Ile Thr Asn Ser Gly Gln Arg Arg
305 310 315 320
Lys Lys Leu Ile Ala Gly Ala Glu Ser Pro Gln Pro Ala Ser Gly Ser
325 330 335
Ser Pro Ser Glu Glu Asp Arg Ser Lys Ser Ala Pro Thr Ser Pro Cys
340 345 350
Asp Gln Glu Ile Lys Glu Leu Glu Asn Asn Ile Arg Lys Ala Leu Ser
355 360 365
Phe Asp Asn Arg Gly Glu Glu His Arg Ala Ala Ser Ser Pro Asp Gly
370 375 380
Gln Leu Met Ser Pro Gly Glu Asn Gly Glu Val Arg Gln Gly Gln Ala
385 390 395 400
Lys Arg Leu Gly Leu Asp Glu Phe Asn Phe Ile Lys Val Leu Gly Lys
405 410 415
Gly Ser Phe Gly Lys Val Met Leu Ala Glu Leu Lys Gly Lys Asp Glu
420 425 430
Val Tyr Ala Val Lys Val Leu Lys Lys Asp Val Ile Leu Gln Asp Asp
435 440 445
Asp Val Asp Cys Thr Met Thr Glu Lys Arg Ile Leu Ala Leu Ala Arg
450 455 460
Lys His Pro Tyr Leu Thr Gln Leu Tyr Cys Cys Phe Gln Thr Lys Asp
465 470 475 480
Arg Leu Phe Phe Val Met Glu Tyr Val Asn Gly Gly Asp Leu Met Phe
485 490 495
Gln Ile Gln Arg Ser Arg Lys Phe Asp Glu Pro Arg Ser Arg Phe Tyr
500 505 510
Ala Ala Glu Val Thr Ser Ala Leu Met Phe Leu His Gln His Gly Val
515 520 525
Ile Tyr Arg Asp Leu Lys Leu Asp Asn Ile Leu Leu Asp Ala Glu Gly
530 535 540
His Cys Lys Leu Ala Asp Phe Gly Met Cys Lys Glu Gly Ile Leu Asn
545 550 555 560
Gly Val Thr Thr Thr Thr Phe Cys Gly Thr Pro Asp Tyr Ile Ala Pro
565 570 575
Glu Ile Leu Gln Glu Leu Glu Tyr Gly Pro Ser Val Asp Trp Trp Ala
580 585 590
Leu Gly Val Leu Met Tyr Glu Met Met Ala Gly Gln Pro Pro Phe Glu
595 600 605
Ala Asp Asn Glu Asp Asp Leu Phe Glu Ser Ile Leu His Asp Asp Val
610 615 620
Leu Tyr Pro Val Trp Leu Ser Lys Glu Ala Val Ser Ile Leu Lys Ala
625 630 635 640
Phe Met Thr Lys Asn Pro His Lys Arg Leu Gly Cys Val Ala Ser Gln
645 650 655
Asn Gly Glu Asp Ala Ile Lys Gln His Pro Phe Phe Lys Glu Ile Asp
660 665 670
Trp Val Leu Leu Glu Gln Lys Lys Ile Lys Pro Pro Phe Lys Pro Arg
675 680 685
Ile Lys Thr Lys Arg Asp Val Asn Asn Phe Asp Gln Asp Phe Thr Arg
690 695 700
Glu Glu Pro Val Leu Thr Leu Val Asp Glu Ala Ile Val Lys Gln Ile
705 710 715 720
Asn Gln Glu Glu Phe Lys Gly Phe Ser Tyr Phe Gly Glu Asp Leu Met
725 730 735
Pro
<210> 75
<211> 621
<212> PRT
<213> Artificial Sequence
<220>
<223> PRKCE protein fragment
<400> 75
Ile Asp Leu Glu Pro Glu Gly Arg Val Tyr Val Ile Ile Asp Leu Ser
1 5 10 15
Gly Ser Ser Gly Glu Ala Pro Lys Asp Asn Glu Glu Arg Val Phe Arg
20 25 30
Glu Arg Met Arg Pro Arg Lys Arg Gln Gly Ala Val Arg Arg Arg Val
35 40 45
His Gln Val Asn Gly His Lys Phe Met Ala Thr Tyr Leu Arg Gln Pro
50 55 60
Thr Tyr Cys Ser His Cys Arg Asp Phe Ile Trp Gly Val Ile Gly Lys
65 70 75 80
Gln Gly Tyr Gln Cys Gln Val Cys Thr Cys Val Val His Lys Arg Cys
85 90 95
His Glu Leu Ile Ile Thr Lys Cys Ala Gly Leu Lys Lys Gln Glu Thr
100 105 110
Pro Asp Gln Val Gly Ser Gln Arg Phe Ser Val Asn Met Pro His Lys
115 120 125
Phe Gly Ile His Asn Tyr Lys Val Pro Thr Phe Cys Asp His Cys Gly
130 135 140
Ser Leu Leu Trp Gly Leu Leu Arg Gln Gly Leu Gln Cys Lys Val Cys
145 150 155 160
Lys Met Asn Val His Arg Arg Cys Glu Thr Asn Val Ala Pro Asn Cys
165 170 175
Gly Val Asp Ala Arg Gly Ile Ala Lys Val Leu Ala Asp Leu Gly Val
180 185 190
Thr Pro Asp Lys Ile Thr Asn Ser Gly Gln Arg Arg Lys Lys Leu Ile
195 200 205
Ala Gly Ala Glu Ser Pro Gln Pro Ala Ser Gly Ser Ser Pro Ser Glu
210 215 220
Glu Asp Arg Ser Lys Ser Ala Pro Thr Ser Pro Cys Asp Gln Glu Ile
225 230 235 240
Lys Glu Leu Glu Asn Asn Ile Arg Lys Ala Leu Ser Phe Asp Asn Arg
245 250 255
Gly Glu Glu His Arg Ala Ala Ser Ser Pro Asp Gly Gln Leu Met Ser
260 265 270
Pro Gly Glu Asn Gly Glu Val Arg Gln Gly Gln Ala Lys Arg Leu Gly
275 280 285
Leu Asp Glu Phe Asn Phe Ile Lys Val Leu Gly Lys Gly Ser Phe Gly
290 295 300
Lys Val Met Leu Ala Glu Leu Lys Gly Lys Asp Glu Val Tyr Ala Val
305 310 315 320
Lys Val Leu Lys Lys Asp Val Ile Leu Gln Asp Asp Asp Val Asp Cys
325 330 335
Thr Met Thr Glu Lys Arg Ile Leu Ala Leu Ala Arg Lys His Pro Tyr
340 345 350
Leu Thr Gln Leu Tyr Cys Cys Phe Gln Thr Lys Asp Arg Leu Phe Phe
355 360 365
Val Met Glu Tyr Val Asn Gly Gly Asp Leu Met Phe Gln Ile Gln Arg
370 375 380
Ser Arg Lys Phe Asp Glu Pro Arg Ser Arg Phe Tyr Ala Ala Glu Val
385 390 395 400
Thr Ser Ala Leu Met Phe Leu His Gln His Gly Val Ile Tyr Arg Asp
405 410 415
Leu Lys Leu Asp Asn Ile Leu Leu Asp Ala Glu Gly His Cys Lys Leu
420 425 430
Ala Asp Phe Gly Met Cys Lys Glu Gly Ile Leu Asn Gly Val Thr Thr
435 440 445
Thr Thr Phe Cys Gly Thr Pro Asp Tyr Ile Ala Pro Glu Ile Leu Gln
450 455 460
Glu Leu Glu Tyr Gly Pro Ser Val Asp Trp Trp Ala Leu Gly Val Leu
465 470 475 480
Met Tyr Glu Met Met Ala Gly Gln Pro Pro Phe Glu Ala Asp Asn Glu
485 490 495
Asp Asp Leu Phe Glu Ser Ile Leu His Asp Asp Val Leu Tyr Pro Val
500 505 510
Trp Leu Ser Lys Glu Ala Val Ser Ile Leu Lys Ala Phe Met Thr Lys
515 520 525
Asn Pro His Lys Arg Leu Gly Cys Val Ala Ser Gln Asn Gly Glu Asp
530 535 540
Ala Ile Lys Gln His Pro Phe Phe Lys Glu Ile Asp Trp Val Leu Leu
545 550 555 560
Glu Gln Lys Lys Ile Lys Pro Pro Phe Lys Pro Arg Ile Lys Thr Lys
565 570 575
Arg Asp Val Asn Asn Phe Asp Gln Asp Phe Thr Arg Glu Glu Pro Val
580 585 590
Leu Thr Leu Val Asp Glu Ala Ile Val Lys Gln Ile Asn Gln Glu Glu
595 600 605
Phe Lys Gly Phe Ser Tyr Phe Gly Glu Asp Leu Met Pro
610 615 620
<210> 76
<211> 8
<212> PRT
<213> Artificial Sequence
<220>
<223> Break-point of PRKCE protein fragment
<400> 76
Ile Asp Leu Glu Pro Glu Gly Arg
1 5
<210> 77
<211> 1962
<212> DNA
<213> Artificial Sequence
<220>
<223> MAP4K3-PRKCE fusion gene
<400> 77
atgaaccccg gcttcgattt gtcccgccgg aacccgcagg aggacttcga gctgattcag 60
cgcatcggca gcggcaccta cggcgacgtc tacaagattg atctggagcc agaaggaaga 120
gtgtatgtga tcatcgatct ctcagggtcg tcgggtgaag cccctaaaga caatgaagag 180
cgtgtgttca gggaacgcat gcggccgagg aagcggcagg gggccgtcag gcgcagggtc 240
catcaggtca acggccacaa gttcatggcc acctatcttc ggcagcccac ctactgctcc 300
cattgcagag acttcatctg gggtgtcata ggaaagcagg gataccagtg tcaagtctgc 360
acctgcgtgg tccacaagcg gtgccacgag ctcataatca caaagtgtgc tgggttaaag 420
aagcaggaga cccccgacca ggtgggctcc cagcggttca gcgtcaacat gccccacaag 480
ttcggtatcc acaactacaa ggtccctacc ttctgcgatc actgtgggtc cctgctctgg 540
ggactcttgc ggcagggttt gcagtgtaaa gtctgcaaaa tgaatgttca ccgtcgatgt 600
gagaccaacg tggctcccaa ctgtggagtg gatgccagag gaatcgccaa agtactggcc 660
gacctgggcg ttaccccaga caaaatcacc aacagcggcc agagaaggaa aaagctcatt 720
gctggtgccg agtccccgca gcctgcttct ggaagctcac catctgagga agatcgatcc 780
aagtcagcac ccacctcccc ttgtgaccag gaaataaaag aacttgagaa caacattcgg 840
aaagccttgt catttgacaa ccgaggagag gagcaccggg cagcatcgtc tcctgatggc 900
cagctgatga gccccggtga gaatggcgaa gtccggcaag gccaggccaa gcgcctgggc 960
ctggatgagt tcaacttcat caaggtgttg ggcaaaggca gctttggcaa ggtcatgttg 1020
gcagaactca agggcaaaga tgaagtatat gctgtgaagg tcttaaagaa ggacgtcatc 1080
cttcaggatg atgacgtgga ctgcacaatg acagagaaga ggattttggc tctggcacgg 1140
aaacacccgt accttaccca actctactgc tgcttccaga ccaaggaccg cctctttttc 1200
gtcatggaat atgtaaatgg tggagacctc atgtttcaga ttcagcgctc ccgaaaattc 1260
gacgagcctc gttcacggtt ctatgctgca gaggtcacat cggccctcat gttcctccac 1320
cagcatggag tcatctacag ggatttgaaa ctggacaaca tccttctgga tgcagaaggt 1380
cactgcaagc tggctgactt cgggatgtgc aaggaaggga ttctgaatgg tgtgacgacc 1440
accacgttct gtgggactcc tgactacata gctcctgaga tcctgcagga gttggagtat 1500
ggcccctccg tggactggtg ggccctgggg gtgctgatgt acgagatgat ggctggacag 1560
cctccctttg aggccgacaa tgaggacgac ctatttgagt ccatcctcca tgacgacgtg 1620
ctgtacccag tctggctcag caaggaggct gtcagcatct tgaaagcttt catgacgaag 1680
aatccccaca agcgcctggg ctgtgtggca tcgcagaatg gcgaggacgc catcaagcag 1740
cacccattct tcaaagagat tgactgggtg ctcctggagc agaagaagat caagccaccc 1800
ttcaaaccac gcattaaaac caaaagagac gtcaataatt ttgaccaaga ctttacccgg 1860
gaagagccgg tactcaccct tgtggacgaa gcaattgtaa agcagatcaa ccaggaggaa 1920
ttcaaaggtt tctcctactt tggtgaagac ctgatgccct ga 1962
<210> 78
<211> 45
<212> DNA
<213> Artificial Sequence
<220>
<223> Fused region of MAP4K3-PRKCE fusion gene
<400> 78
acctacggcg acgtctacaa gattgatctg gagccagaag gaaga 45
<210> 79
<211> 653
<212> PRT
<213> Artificial Sequence
<220>
<223> MAP4K3-PRKCE fusion protein
<400> 79
Met Asn Pro Gly Phe Asp Leu Ser Arg Arg Asn Pro Gln Glu Asp Phe
1 5 10 15
Glu Leu Ile Gln Arg Ile Gly Ser Gly Thr Tyr Gly Asp Val Tyr Lys
20 25 30
Ile Asp Leu Glu Pro Glu Gly Arg Val Tyr Val Ile Ile Asp Leu Ser
35 40 45
Gly Ser Ser Gly Glu Ala Pro Lys Asp Asn Glu Glu Arg Val Phe Arg
50 55 60
Glu Arg Met Arg Pro Arg Lys Arg Gln Gly Ala Val Arg Arg Arg Val
65 70 75 80
His Gln Val Asn Gly His Lys Phe Met Ala Thr Tyr Leu Arg Gln Pro
85 90 95
Thr Tyr Cys Ser His Cys Arg Asp Phe Ile Trp Gly Val Ile Gly Lys
100 105 110
Gln Gly Tyr Gln Cys Gln Val Cys Thr Cys Val Val His Lys Arg Cys
115 120 125
His Glu Leu Ile Ile Thr Lys Cys Ala Gly Leu Lys Lys Gln Glu Thr
130 135 140
Pro Asp Gln Val Gly Ser Gln Arg Phe Ser Val Asn Met Pro His Lys
145 150 155 160
Phe Gly Ile His Asn Tyr Lys Val Pro Thr Phe Cys Asp His Cys Gly
165 170 175
Ser Leu Leu Trp Gly Leu Leu Arg Gln Gly Leu Gln Cys Lys Val Cys
180 185 190
Lys Met Asn Val His Arg Arg Cys Glu Thr Asn Val Ala Pro Asn Cys
195 200 205
Gly Val Asp Ala Arg Gly Ile Ala Lys Val Leu Ala Asp Leu Gly Val
210 215 220
Thr Pro Asp Lys Ile Thr Asn Ser Gly Gln Arg Arg Lys Lys Leu Ile
225 230 235 240
Ala Gly Ala Glu Ser Pro Gln Pro Ala Ser Gly Ser Ser Pro Ser Glu
245 250 255
Glu Asp Arg Ser Lys Ser Ala Pro Thr Ser Pro Cys Asp Gln Glu Ile
260 265 270
Lys Glu Leu Glu Asn Asn Ile Arg Lys Ala Leu Ser Phe Asp Asn Arg
275 280 285
Gly Glu Glu His Arg Ala Ala Ser Ser Pro Asp Gly Gln Leu Met Ser
290 295 300
Pro Gly Glu Asn Gly Glu Val Arg Gln Gly Gln Ala Lys Arg Leu Gly
305 310 315 320
Leu Asp Glu Phe Asn Phe Ile Lys Val Leu Gly Lys Gly Ser Phe Gly
325 330 335
Lys Val Met Leu Ala Glu Leu Lys Gly Lys Asp Glu Val Tyr Ala Val
340 345 350
Lys Val Leu Lys Lys Asp Val Ile Leu Gln Asp Asp Asp Val Asp Cys
355 360 365
Thr Met Thr Glu Lys Arg Ile Leu Ala Leu Ala Arg Lys His Pro Tyr
370 375 380
Leu Thr Gln Leu Tyr Cys Cys Phe Gln Thr Lys Asp Arg Leu Phe Phe
385 390 395 400
Val Met Glu Tyr Val Asn Gly Gly Asp Leu Met Phe Gln Ile Gln Arg
405 410 415
Ser Arg Lys Phe Asp Glu Pro Arg Ser Arg Phe Tyr Ala Ala Glu Val
420 425 430
Thr Ser Ala Leu Met Phe Leu His Gln His Gly Val Ile Tyr Arg Asp
435 440 445
Leu Lys Leu Asp Asn Ile Leu Leu Asp Ala Glu Gly His Cys Lys Leu
450 455 460
Ala Asp Phe Gly Met Cys Lys Glu Gly Ile Leu Asn Gly Val Thr Thr
465 470 475 480
Thr Thr Phe Cys Gly Thr Pro Asp Tyr Ile Ala Pro Glu Ile Leu Gln
485 490 495
Glu Leu Glu Tyr Gly Pro Ser Val Asp Trp Trp Ala Leu Gly Val Leu
500 505 510
Met Tyr Glu Met Met Ala Gly Gln Pro Pro Phe Glu Ala Asp Asn Glu
515 520 525
Asp Asp Leu Phe Glu Ser Ile Leu His Asp Asp Val Leu Tyr Pro Val
530 535 540
Trp Leu Ser Lys Glu Ala Val Ser Ile Leu Lys Ala Phe Met Thr Lys
545 550 555 560
Asn Pro His Lys Arg Leu Gly Cys Val Ala Ser Gln Asn Gly Glu Asp
565 570 575
Ala Ile Lys Gln His Pro Phe Phe Lys Glu Ile Asp Trp Val Leu Leu
580 585 590
Glu Gln Lys Lys Ile Lys Pro Pro Phe Lys Pro Arg Ile Lys Thr Lys
595 600 605
Arg Asp Val Asn Asn Phe Asp Gln Asp Phe Thr Arg Glu Glu Pro Val
610 615 620
Leu Thr Leu Val Asp Glu Ala Ile Val Lys Gln Ile Asn Gln Glu Glu
625 630 635 640
Phe Lys Gly Phe Ser Tyr Phe Gly Glu Asp Leu Met Pro
645 650
<210> 80
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Fused region of MAP4K3-PRKCE fusion protein
<400> 80
Thr Tyr Gly Asp Val Tyr Lys Ile Asp Leu Glu Pro Glu Gly Arg
1 5 10 15
<210> 81
<211> 2788
<212> DNA
<213> Artificial Sequence
<220>
<223> CDS of BCAS3 gene (NM_001099432)
<400> 81
atgaatgaag ctatggctac agattcccca agaagaccca gtcgttgtac tggtggagtt 60
gtggttcgcc cccaggctgt cacagagcag tcctacatgg aaagtgttgt gacttttctg 120
caggatgttg tgccacaggc ttacagtgga acacctctaa cagaagaaaa ggagaaaata 180
gtctgggtca gatttgaaaa tgcagattta aatgatacat caagaaatct ggaatttcat 240
gaaatacata gtactgggaa tgaaccgcct ttgttgatta tgattggcta cagtgatgga 300
atgcaggtct ggagcatccc tatcagtggc gaagcacaag agctcttctc tgttcgacat 360
ggcccaattc gagcggctag aatcttgcct gctccacagt ttggtgctca aaaatgtgat 420
aactttgctg aaaaaagacc cctccttggt gtttgtaaga gcattggatc ttctggcaca 480
agcccaccgt actgttgtgt ggatctgtat tcacttcgta ctggggagat ggtcaagtcc 540
attcaattta agacacctat ttatgatctc cattgcaata aacggatcct tgtcgtagtc 600
ttgcaggaga aaattgctgc ctttgatagc tgtactttca cgaagaaatt ctttgttaca 660
agctgctatc catgtccagg gccaaacatg aatcctattg ctcttgggag ccgctggctt 720
gcttatgcag aaaacaagtt gattcgatgt catcagtccc gtggtggagc ctgtggagac 780
aacattcagt cttatactgc cacagtcatt agtgctgcta aaacattgaa aagtggcctg 840
acaatggtag ggaaagtggt gactcagctg acaggcacac tgccttcagg tgtgacagaa 900
gatgatgttg ccatccacag taattcacgg cggagtcctt tggtcccagg catcatcaca 960
gttattgaca ccgaaaccgt tggagagggc caggtgcttg tgagtgagga ttctgacagt 1020
gatggcattg tggcccactt ccctgcccat gagaagccag tgtgctgcat ggcttttaat 1080
acaagtggaa tgcttctagt cacaacagac acccttggcc atgactttca tgtcttccaa 1140
attctgactc atccttggtc ctcatcacaa tgtgctgtcc accatctgta tactcttcac 1200
aggggagaaa ctgaagccaa agtacaggac atctgcttca gccatgactg tcgctgggtt 1260
gtggtcagta ctctccgggg tacttcccac gttttcccca tcaaccctta tggtggccag 1320
ccttgtgttc gtacacatat gtcaccacga gtagtgaatc gcatgagccg tttccagaaa 1380
agtgctggac tggaagagat tgaacaagaa ctgacgtcta agcaaggagg tcgctgtagc 1440
cctgttccag gtctatcaag cagcccttct gggtcaccct tgcatgggaa actgaacagc 1500
caagactcct ataacaattt taccaacaac aaccctggca accctcggct ctctcctctt 1560
cccagcttga tggtagtgat gcctcttgca caaatcaagc agccaatgac attggggacc 1620
atcaccaaac gaaccgggcc ttatctcttt ggagcggggt gtttttccat aaaagcccca 1680
tgcaaagtta aacctcctcc acaaatttca cccagcaaat cgatgggcgg agaattttgt 1740
gtggctgcta tcttcggaac atccaggtca tggtttgcaa ataatgcagg tctgaaaaga 1800
gaaaaagatc agtccaaaca agttgtagtt gagtccctgt acattatcag ttgctatggc 1860
accttagtgg aacacatgat ggagccgcga cccctcagca ctgcacccaa gattagtgac 1920
gacacaccac tggaaatgat gacatcgcct cgagccagct ggactctggt tagaacccct 1980
caatggaatg aattgcagcc accgtttaat gcaaaccacc ctctgctcct cgctgcagat 2040
gcagtacagt attatcagtt cctgcttgct ggcctggttc cccctggaag tcctgggccc 2100
attactcgac atgggtctta cgacagttta gcttctgacc atagtggaca ggaagatgaa 2160
gaatggcttt cccaggttga aattgtaaca cacactggac cccatagacg tctgtggatg 2220
ggtccacagt tccagttcaa aaccatccat ccctcaggcc aaaccacagt tatctcatcc 2280
agttcatctg tgttgcagtc tcatggtccg agtgacacgc cacagcctct tttggatttt 2340
gatacagatg atcttgatct caacagtctc aggatccagc cagtccgctc tgaccccgtc 2400
agcatgccag ggtcatcccg tccagtctct gatcgaaggg gagtttccac agtgattgat 2460
gctgcctcag ggtacctttg acaggagcgt gaccctgctg gaggtgtgcg ggagctggcc 2520
tgagggcttc gggctgcggc acatgtcctc catggagcac acggaggagg gcctccggga 2580
gcgacttgcc gacgccatgg ccgagtcacc tagccgggac gtcgtgggat ccggaacaga 2640
acttcagcga gagggaagca tcgagactct gagtaacagc tcaggctcca ccagcggcag 2700
cataccaaga aactttgatg gctaccgatc tccgctgccc accaatgaga gccagcccct 2760
cagcctcttc ccgactggct tcccgtag 2788
<210> 82
<211> 2470
<212> DNA
<213> Artificial Sequence
<220>
<223> BCAS3 gene fragment
<400> 82
atgaatgaag ctatggctac agattcccca agaagaccca gtcgttgtac tggtggagtt 60
gtggttcgcc cccaggctgt cacagagcag tcctacatgg aaagtgttgt gacttttctg 120
caggatgttg tgccacaggc ttacagtgga acacctctaa cagaagaaaa ggagaaaata 180
gtctgggtca gatttgaaaa tgcagattta aatgatacat caagaaatct ggaatttcat 240
gaaatacata gtactgggaa tgaaccgcct ttgttgatta tgattggcta cagtgatgga 300
atgcaggtct ggagcatccc tatcagtggc gaagcacaag agctcttctc tgttcgacat 360
ggcccaattc gagcggctag aatcttgcct gctccacagt ttggtgctca aaaatgtgat 420
aactttgctg aaaaaagacc cctccttggt gtttgtaaga gcattggatc ttctggcaca 480
agcccaccgt actgttgtgt ggatctgtat tcacttcgta ctggggagat ggtcaagtcc 540
attcaattta agacacctat ttatgatctc cattgcaata aacggatcct tgtcgtagtc 600
ttgcaggaga aaattgctgc ctttgatagc tgtactttca cgaagaaatt ctttgttaca 660
agctgctatc catgtccagg gccaaacatg aatcctattg ctcttgggag ccgctggctt 720
gcttatgcag aaaacaagtt gattcgatgt catcagtccc gtggtggagc ctgtggagac 780
aacattcagt cttatactgc cacagtcatt agtgctgcta aaacattgaa aagtggcctg 840
acaatggtag ggaaagtggt gactcagctg acaggcacac tgccttcagg tgtgacagaa 900
gatgatgttg ccatccacag taattcacgg cggagtcctt tggtcccagg catcatcaca 960
gttattgaca ccgaaaccgt tggagagggc caggtgcttg tgagtgagga ttctgacagt 1020
gatggcattg tggcccactt ccctgcccat gagaagccag tgtgctgcat ggcttttaat 1080
acaagtggaa tgcttctagt cacaacagac acccttggcc atgactttca tgtcttccaa 1140
attctgactc atccttggtc ctcatcacaa tgtgctgtcc accatctgta tactcttcac 1200
aggggagaaa ctgaagccaa agtacaggac atctgcttca gccatgactg tcgctgggtt 1260
gtggtcagta ctctccgggg tacttcccac gttttcccca tcaaccctta tggtggccag 1320
ccttgtgttc gtacacatat gtcaccacga gtagtgaatc gcatgagccg tttccagaaa 1380
agtgctggac tggaagagat tgaacaagaa ctgacgtcta agcaaggagg tcgctgtagc 1440
cctgttccag gtctatcaag cagcccttct gggtcaccct tgcatgggaa actgaacagc 1500
caagactcct ataacaattt taccaacaac aaccctggca accctcggct ctctcctctt 1560
cccagcttga tggtagtgat gcctcttgca caaatcaagc agccaatgac attggggacc 1620
atcaccaaac gaaccgggcc ttatctcttt ggagcggggt gtttttccat aaaagcccca 1680
tgcaaagtta aacctcctcc acaaatttca cccagcaaat cgatgggcgg agaattttgt 1740
gtggctgcta tcttcggaac atccaggtca tggtttgcaa ataatgcagg tctgaaaaga 1800
gaaaaagatc agtccaaaca agttgtagtt gagtccctgt acattatcag ttgctatggc 1860
accttagtgg aacacatgat ggagccgcga cccctcagca ctgcacccaa gattagtgac 1920
gacacaccac tggaaatgat gacatcgcct cgagccagct ggactctggt tagaacccct 1980
caatggaatg aattgcagcc accgtttaat gcaaaccacc ctctgctcct cgctgcagat 2040
gcagtacagt attatcagtt cctgcttgct ggcctggttc cccctggaag tcctgggccc 2100
attactcgac atgggtctta cgacagttta gcttctgacc atagtggaca ggaagatgaa 2160
gaatggcttt cccaggttga aattgtaaca cacactggac cccatagacg tctgtggatg 2220
ggtccacagt tccagttcaa aaccatccat ccctcaggcc aaaccacagt tatctcatcc 2280
agttcatctg tgttgcagtc tcatggtccg agtgacacgc cacagcctct tttggatttt 2340
gatacagatg atcttgatct caacagtctc aggatccagc cagtccgctc tgaccccgtc 2400
agcatgccag ggtcatcccg tccagtctct gatcgaaggg gagtttccac agtgattgat 2460
gctgcctcag 2470
<210> 83
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Break-point of BCAS3 gene fragment
<400> 83
acagtgattg atgctgcctc ag 22
<210> 84
<211> 928
<212> PRT
<213> Artificial Sequence
<220>
<223> BCAS3 protein
<400> 84
Met Asn Glu Ala Met Ala Thr Asp Ser Pro Arg Arg Pro Ser Arg Cys
1 5 10 15
Thr Gly Gly Val Val Val Arg Pro Gln Ala Val Thr Glu Gln Ser Tyr
20 25 30
Met Glu Ser Val Val Thr Phe Leu Gln Asp Val Val Pro Gln Ala Tyr
35 40 45
Ser Gly Thr Pro Leu Thr Glu Glu Lys Glu Lys Ile Val Trp Val Arg
50 55 60
Phe Glu Asn Ala Asp Leu Asn Asp Thr Ser Arg Asn Leu Glu Phe His
65 70 75 80
Glu Ile His Ser Thr Gly Asn Glu Pro Pro Leu Leu Ile Met Ile Gly
85 90 95
Tyr Ser Asp Gly Met Gln Val Trp Ser Ile Pro Ile Ser Gly Glu Ala
100 105 110
Gln Glu Leu Phe Ser Val Arg His Gly Pro Ile Arg Ala Ala Arg Ile
115 120 125
Leu Pro Ala Pro Gln Phe Gly Ala Gln Lys Cys Asp Asn Phe Ala Glu
130 135 140
Lys Arg Pro Leu Leu Gly Val Cys Lys Ser Ile Gly Ser Ser Gly Thr
145 150 155 160
Ser Pro Pro Tyr Cys Cys Val Asp Leu Tyr Ser Leu Arg Thr Gly Glu
165 170 175
Met Val Lys Ser Ile Gln Phe Lys Thr Pro Ile Tyr Asp Leu His Cys
180 185 190
Asn Lys Arg Ile Leu Val Val Val Leu Gln Glu Lys Ile Ala Ala Phe
195 200 205
Asp Ser Cys Thr Phe Thr Lys Lys Phe Phe Val Thr Ser Cys Tyr Pro
210 215 220
Cys Pro Gly Pro Asn Met Asn Pro Ile Ala Leu Gly Ser Arg Trp Leu
225 230 235 240
Ala Tyr Ala Glu Asn Lys Leu Ile Arg Cys His Gln Ser Arg Gly Gly
245 250 255
Ala Cys Gly Asp Asn Ile Gln Ser Tyr Thr Ala Thr Val Ile Ser Ala
260 265 270
Ala Lys Thr Leu Lys Ser Gly Leu Thr Met Val Gly Lys Val Val Thr
275 280 285
Gln Leu Thr Gly Thr Leu Pro Ser Gly Val Thr Glu Asp Asp Val Ala
290 295 300
Ile His Ser Asn Ser Arg Arg Ser Pro Leu Val Pro Gly Ile Ile Thr
305 310 315 320
Val Ile Asp Thr Glu Thr Val Gly Glu Gly Gln Val Leu Val Ser Glu
325 330 335
Asp Ser Asp Ser Asp Gly Ile Val Ala His Phe Pro Ala His Glu Lys
340 345 350
Pro Val Cys Cys Met Ala Phe Asn Thr Ser Gly Met Leu Leu Val Thr
355 360 365
Thr Asp Thr Leu Gly His Asp Phe His Val Phe Gln Ile Leu Thr His
370 375 380
Pro Trp Ser Ser Ser Gln Cys Ala Val His His Leu Tyr Thr Leu His
385 390 395 400
Arg Gly Glu Thr Glu Ala Lys Val Gln Asp Ile Cys Phe Ser His Asp
405 410 415
Cys Arg Trp Val Val Val Ser Thr Leu Arg Gly Thr Ser His Val Phe
420 425 430
Pro Ile Asn Pro Tyr Gly Gly Gln Pro Cys Val Arg Thr His Met Ser
435 440 445
Pro Arg Val Val Asn Arg Met Ser Arg Phe Gln Lys Ser Ala Gly Leu
450 455 460
Glu Glu Ile Glu Gln Glu Leu Thr Ser Lys Gln Gly Gly Arg Cys Ser
465 470 475 480
Pro Val Pro Gly Leu Ser Ser Ser Pro Ser Gly Ser Pro Leu His Gly
485 490 495
Lys Leu Asn Ser Gln Asp Ser Tyr Asn Asn Phe Thr Asn Asn Asn Pro
500 505 510
Gly Asn Pro Arg Leu Ser Pro Leu Pro Ser Leu Met Val Val Met Pro
515 520 525
Leu Ala Gln Ile Lys Gln Pro Met Thr Leu Gly Thr Ile Thr Lys Arg
530 535 540
Thr Gly Pro Tyr Leu Phe Gly Ala Gly Cys Phe Ser Ile Lys Ala Pro
545 550 555 560
Cys Lys Val Lys Pro Pro Pro Gln Ile Ser Pro Ser Lys Ser Met Gly
565 570 575
Gly Glu Phe Cys Val Ala Ala Ile Phe Gly Thr Ser Arg Ser Trp Phe
580 585 590
Ala Asn Asn Ala Gly Leu Lys Arg Glu Lys Asp Gln Ser Lys Gln Val
595 600 605
Val Val Glu Ser Leu Tyr Ile Ile Ser Cys Tyr Gly Thr Leu Val Glu
610 615 620
His Met Met Glu Pro Arg Pro Leu Ser Thr Ala Pro Lys Ile Ser Asp
625 630 635 640
Asp Thr Pro Leu Glu Met Met Thr Ser Pro Arg Ala Ser Trp Thr Leu
645 650 655
Val Arg Thr Pro Gln Trp Asn Glu Leu Gln Pro Pro Phe Asn Ala Asn
660 665 670
His Pro Leu Leu Leu Ala Ala Asp Ala Val Gln Tyr Tyr Gln Phe Leu
675 680 685
Leu Ala Gly Leu Val Pro Pro Gly Ser Pro Gly Pro Ile Thr Arg His
690 695 700
Gly Ser Tyr Asp Ser Leu Ala Ser Asp His Ser Gly Gln Glu Asp Glu
705 710 715 720
Glu Trp Leu Ser Gln Val Glu Ile Val Thr His Thr Gly Pro His Arg
725 730 735
Arg Leu Trp Met Gly Pro Gln Phe Gln Phe Lys Thr Ile His Pro Ser
740 745 750
Gly Gln Thr Thr Val Ile Ser Ser Ser Ser Ser Val Leu Gln Ser His
755 760 765
Gly Pro Ser Asp Thr Pro Gln Pro Leu Leu Asp Phe Asp Thr Asp Asp
770 775 780
Leu Asp Leu Asn Ser Leu Arg Ile Gln Pro Val Arg Ser Asp Pro Val
785 790 795 800
Ser Met Pro Gly Ser Ser Arg Pro Val Ser Asp Arg Arg Gly Val Ser
805 810 815
Thr Val Ile Asp Ala Ala Ser Gly Thr Phe Asp Arg Ser Val Thr Leu
820 825 830
Leu Glu Val Cys Gly Ser Trp Pro Glu Gly Phe Gly Leu Arg His Met
835 840 845
Ser Ser Met Glu His Thr Glu Glu Gly Leu Arg Glu Arg Leu Ala Asp
850 855 860
Ala Met Ala Glu Ser Pro Ser Arg Asp Val Val Gly Ser Gly Thr Glu
865 870 875 880
Leu Gln Arg Glu Gly Ser Ile Glu Thr Leu Ser Asn Ser Ser Gly Ser
885 890 895
Thr Ser Gly Ser Ile Pro Arg Asn Phe Asp Gly Tyr Arg Ser Pro Leu
900 905 910
Pro Thr Asn Glu Ser Gln Pro Leu Ser Leu Phe Pro Thr Gly Phe Pro
915 920 925
<210> 85
<211> 823
<212> PRT
<213> Artificial Sequence
<220>
<223> BCAS3 protein fragment
<400> 85
Met Asn Glu Ala Met Ala Thr Asp Ser Pro Arg Arg Pro Ser Arg Cys
1 5 10 15
Thr Gly Gly Val Val Val Arg Pro Gln Ala Val Thr Glu Gln Ser Tyr
20 25 30
Met Glu Ser Val Val Thr Phe Leu Gln Asp Val Val Pro Gln Ala Tyr
35 40 45
Ser Gly Thr Pro Leu Thr Glu Glu Lys Glu Lys Ile Val Trp Val Arg
50 55 60
Phe Glu Asn Ala Asp Leu Asn Asp Thr Ser Arg Asn Leu Glu Phe His
65 70 75 80
Glu Ile His Ser Thr Gly Asn Glu Pro Pro Leu Leu Ile Met Ile Gly
85 90 95
Tyr Ser Asp Gly Met Gln Val Trp Ser Ile Pro Ile Ser Gly Glu Ala
100 105 110
Gln Glu Leu Phe Ser Val Arg His Gly Pro Ile Arg Ala Ala Arg Ile
115 120 125
Leu Pro Ala Pro Gln Phe Gly Ala Gln Lys Cys Asp Asn Phe Ala Glu
130 135 140
Lys Arg Pro Leu Leu Gly Val Cys Lys Ser Ile Gly Ser Ser Gly Thr
145 150 155 160
Ser Pro Pro Tyr Cys Cys Val Asp Leu Tyr Ser Leu Arg Thr Gly Glu
165 170 175
Met Val Lys Ser Ile Gln Phe Lys Thr Pro Ile Tyr Asp Leu His Cys
180 185 190
Asn Lys Arg Ile Leu Val Val Val Leu Gln Glu Lys Ile Ala Ala Phe
195 200 205
Asp Ser Cys Thr Phe Thr Lys Lys Phe Phe Val Thr Ser Cys Tyr Pro
210 215 220
Cys Pro Gly Pro Asn Met Asn Pro Ile Ala Leu Gly Ser Arg Trp Leu
225 230 235 240
Ala Tyr Ala Glu Asn Lys Leu Ile Arg Cys His Gln Ser Arg Gly Gly
245 250 255
Ala Cys Gly Asp Asn Ile Gln Ser Tyr Thr Ala Thr Val Ile Ser Ala
260 265 270
Ala Lys Thr Leu Lys Ser Gly Leu Thr Met Val Gly Lys Val Val Thr
275 280 285
Gln Leu Thr Gly Thr Leu Pro Ser Gly Val Thr Glu Asp Asp Val Ala
290 295 300
Ile His Ser Asn Ser Arg Arg Ser Pro Leu Val Pro Gly Ile Ile Thr
305 310 315 320
Val Ile Asp Thr Glu Thr Val Gly Glu Gly Gln Val Leu Val Ser Glu
325 330 335
Asp Ser Asp Ser Asp Gly Ile Val Ala His Phe Pro Ala His Glu Lys
340 345 350
Pro Val Cys Cys Met Ala Phe Asn Thr Ser Gly Met Leu Leu Val Thr
355 360 365
Thr Asp Thr Leu Gly His Asp Phe His Val Phe Gln Ile Leu Thr His
370 375 380
Pro Trp Ser Ser Ser Gln Cys Ala Val His His Leu Tyr Thr Leu His
385 390 395 400
Arg Gly Glu Thr Glu Ala Lys Val Gln Asp Ile Cys Phe Ser His Asp
405 410 415
Cys Arg Trp Val Val Val Ser Thr Leu Arg Gly Thr Ser His Val Phe
420 425 430
Pro Ile Asn Pro Tyr Gly Gly Gln Pro Cys Val Arg Thr His Met Ser
435 440 445
Pro Arg Val Val Asn Arg Met Ser Arg Phe Gln Lys Ser Ala Gly Leu
450 455 460
Glu Glu Ile Glu Gln Glu Leu Thr Ser Lys Gln Gly Gly Arg Cys Ser
465 470 475 480
Pro Val Pro Gly Leu Ser Ser Ser Pro Ser Gly Ser Pro Leu His Gly
485 490 495
Lys Leu Asn Ser Gln Asp Ser Tyr Asn Asn Phe Thr Asn Asn Asn Pro
500 505 510
Gly Asn Pro Arg Leu Ser Pro Leu Pro Ser Leu Met Val Val Met Pro
515 520 525
Leu Ala Gln Ile Lys Gln Pro Met Thr Leu Gly Thr Ile Thr Lys Arg
530 535 540
Thr Gly Pro Tyr Leu Phe Gly Ala Gly Cys Phe Ser Ile Lys Ala Pro
545 550 555 560
Cys Lys Val Lys Pro Pro Pro Gln Ile Ser Pro Ser Lys Ser Met Gly
565 570 575
Gly Glu Phe Cys Val Ala Ala Ile Phe Gly Thr Ser Arg Ser Trp Phe
580 585 590
Ala Asn Asn Ala Gly Leu Lys Arg Glu Lys Asp Gln Ser Lys Gln Val
595 600 605
Val Val Glu Ser Leu Tyr Ile Ile Ser Cys Tyr Gly Thr Leu Val Glu
610 615 620
His Met Met Glu Pro Arg Pro Leu Ser Thr Ala Pro Lys Ile Ser Asp
625 630 635 640
Asp Thr Pro Leu Glu Met Met Thr Ser Pro Arg Ala Ser Trp Thr Leu
645 650 655
Val Arg Thr Pro Gln Trp Asn Glu Leu Gln Pro Pro Phe Asn Ala Asn
660 665 670
His Pro Leu Leu Leu Ala Ala Asp Ala Val Gln Tyr Tyr Gln Phe Leu
675 680 685
Leu Ala Gly Leu Val Pro Pro Gly Ser Pro Gly Pro Ile Thr Arg His
690 695 700
Gly Ser Tyr Asp Ser Leu Ala Ser Asp His Ser Gly Gln Glu Asp Glu
705 710 715 720
Glu Trp Leu Ser Gln Val Glu Ile Val Thr His Thr Gly Pro His Arg
725 730 735
Arg Leu Trp Met Gly Pro Gln Phe Gln Phe Lys Thr Ile His Pro Ser
740 745 750
Gly Gln Thr Thr Val Ile Ser Ser Ser Ser Ser Val Leu Gln Ser His
755 760 765
Gly Pro Ser Asp Thr Pro Gln Pro Leu Leu Asp Phe Asp Thr Asp Asp
770 775 780
Leu Asp Leu Asn Ser Leu Arg Ile Gln Pro Val Arg Ser Asp Pro Val
785 790 795 800
Ser Met Pro Gly Ser Ser Arg Pro Val Ser Asp Arg Arg Gly Val Ser
805 810 815
Thr Val Ile Asp Ala Ala Ser
820
<210> 86
<211> 7
<212> PRT
<213> Artificial Sequence
<220>
<223> Break-point of BCAS3 protein fragment
<400> 86
Thr Val Ile Asp Ala Ala Ser
1 5
<210> 87
<211> 1883
<212> DNA
<213> Artificial Sequence
<220>
<223> CDS of MAP3K3 gene (NM_002401)
<400> 87
atggacacga acaggaggca ttgaactcaa tcatgaacga tctggtggcc ctccagatga 60
accgacgtca ccggatgcct ggatatgaga ccatgaagaa caaagacaca ggtcactcaa 120
ataggcagag tgacgtcaga atcaagttcg agcacaacgg ggagaggcga attatagcgt 180
tcagccggcc tgtgaaatat gaagatgtgg agcacaaggt gacaacagta tttggacaac 240
ctcttgatct acattacatg aacaatgagc tctccatcct gctgaaaaac caagatgatc 300
ttgataaagc aattgacatt ttagatagaa gctcaagcat gaaaagcctt aggatattgc 360
tgttgtccca ggacagaaac cataacagtt cctctcccca ctctggggtg tccagacagg 420
tgcggatcaa ggcttcccag tccgcagggg atataaatac tatctaccag ccccccgagc 480
ccagaagcag gcacctctct gtcagctccc agaaccctgg ccgaagctca cctccccctg 540
gctatgttcc tgagcggcag cagcacattg cccggcaggg gtcctacacc agcatcaaca 600
gtgaggggga gttcatccca gagaccagcg agcagtgcat gctggatccc ctgagcagtg 660
cagaaaattc cttgtctgga agctgccaat ccttggacag gtcagcagac agcccatcct 720
tccggaaatc acgaatgtcc cgtgcccaga gcttccctga caacagacag gaatactcag 780
atcgggaaac tcagctttat gacaaagggg tcaaaggtgg aacctacccc cggcgctacc 840
acgtgtctgt gcaccacaag gactacagtg atggcagaag aacatttccc cgaatacggc 900
gtcatcaagg caacttgttc accctggtgc cctccagccg ctccctgagc acaaatggcg 960
agaacatggg tctggctgtg caatacctgg acccccgtgg gcgcctgcgg agtgcggaca 1020
gcgagaatgc cctctctgtg caggagagga atgtgccaac caagtctccc agtgccccca 1080
tcaactggcg ccggggaaag ctcctgggcc agggtgcctt cggcagggtc tatttgtgct 1140
atgacgtgga cacgggacgt gaacttgctt ccaagcaggt ccaatttgat ccagacagtc 1200
ctgagacaag caaggaggtg agtgctctgg agtgcgagat ccagttgcta aagaacttgc 1260
agcatgagcg catcgtgcag tactatggct gtctgcggga ccgcgctgag aagaccctga 1320
ccatcttcat ggagtacatg ccagggggct cggtgaaaga ccagttgaag gcttacggtg 1380
ctctgacaga gagcgtgacc cgaaagtaca cgcggcagat cctggagggc atgtcctacc 1440
tgcacagcaa catgattgtt caccgggaca ttaagggagc caacatcctc cgagactctg 1500
ctgggaatgt aaagctgggg gactttgggg ccagcaaacg cctgcagacg atctgtatgt 1560
cggggacggg catgcgctcc gtcactggca caccctactg gatgagccct gaggtgatca 1620
gcggcgaggg ctatggaagg aaagcagacg tgtggagcct gggctgcact gtggtggaga 1680
tgctgacaga gaaaccaccg tgggcagagt atgaagctat ggccgccatc ttcaagattg 1740
ccacccagcc caccaatcct cagctgccct cccacatctc tgaacatggc cgggacttcc 1800
tgaggcgcat ttttgtggag gctcgccaga gaccttcagc tgaggagctg ctcacacacc 1860
actttgcaca gctcatgtac tga 1883
<210> 88
<211> 1877
<212> DNA
<213> Artificial Sequence
<220>
<223> MAP3K3 gene fragment
<400> 88
acgaacagga ggcattgaac tcaatcatga acgatctggt ggccctccag atgaaccgac 60
gtcaccggat gcctggatat gagaccatga agaacaaaga cacaggtcac tcaaataggc 120
agagtgacgt cagaatcaag ttcgagcaca acggggagag gcgaattata gcgttcagcc 180
ggcctgtgaa atatgaagat gtggagcaca aggtgacaac agtatttgga caacctcttg 240
atctacatta catgaacaat gagctctcca tcctgctgaa aaaccaagat gatcttgata 300
aagcaattga cattttagat agaagctcaa gcatgaaaag ccttaggata ttgctgttgt 360
cccaggacag aaaccataac agttcctctc cccactctgg ggtgtccaga caggtgcgga 420
tcaaggcttc ccagtccgca ggggatataa atactatcta ccagcccccc gagcccagaa 480
gcaggcacct ctctgtcagc tcccagaacc ctggccgaag ctcacctccc cctggctatg 540
ttcctgagcg gcagcagcac attgcccggc aggggtccta caccagcatc aacagtgagg 600
gggagttcat cccagagacc agcgagcagt gcatgctgga tcccctgagc agtgcagaaa 660
attccttgtc tggaagctgc caatccttgg acaggtcagc agacagccca tccttccgga 720
aatcacgaat gtcccgtgcc cagagcttcc ctgacaacag acaggaatac tcagatcggg 780
aaactcagct ttatgacaaa ggggtcaaag gtggaaccta cccccggcgc taccacgtgt 840
ctgtgcacca caaggactac agtgatggca gaagaacatt tccccgaata cggcgtcatc 900
aaggcaactt gttcaccctg gtgccctcca gccgctccct gagcacaaat ggcgagaaca 960
tgggtctggc tgtgcaatac ctggaccccc gtgggcgcct gcggagtgcg gacagcgaga 1020
atgccctctc tgtgcaggag aggaatgtgc caaccaagtc tcccagtgcc cccatcaact 1080
ggcgccgggg aaagctcctg ggccagggtg ccttcggcag ggtctatttg tgctatgacg 1140
tggacacggg acgtgaactt gcttccaagc aggtccaatt tgatccagac agtcctgaga 1200
caagcaagga ggtgagtgct ctggagtgcg agatccagtt gctaaagaac ttgcagcatg 1260
agcgcatcgt gcagtactat ggctgtctgc gggaccgcgc tgagaagacc ctgaccatct 1320
tcatggagta catgccaggg ggctcggtga aagaccagtt gaaggcttac ggtgctctga 1380
cagagagcgt gacccgaaag tacacgcggc agatcctgga gggcatgtcc tacctgcaca 1440
gcaacatgat tgttcaccgg gacattaagg gagccaacat cctccgagac tctgctggga 1500
atgtaaagct gggggacttt ggggccagca aacgcctgca gacgatctgt atgtcgggga 1560
cgggcatgcg ctccgtcact ggcacaccct actggatgag ccctgaggtg atcagcggcg 1620
agggctatgg aaggaaagca gacgtgtgga gcctgggctg cactgtggtg gagatgctga 1680
cagagaaacc accgtgggca gagtatgaag ctatggccgc catcttcaag attgccaccc 1740
agcccaccaa tcctcagctg ccctcccaca tctctgaaca tggccgggac ttcctgaggc 1800
gcatttttgt ggaggctcgc cagagacctt cagctgagga gctgctcaca caccactttg 1860
cacagctcat gtactga 1877
<210> 89
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> Break-point of MAP3K3 gene fragment
<400> 89
acgaacagga ggcattgaac tca 23
<210> 90
<211> 626
<212> PRT
<213> Artificial Sequence
<220>
<223> MAP3K3 protein
<400> 90
Met Asp Glu Gln Glu Ala Leu Asn Ser Ile Met Asn Asp Leu Val Ala
1 5 10 15
Leu Gln Met Asn Arg Arg His Arg Met Pro Gly Tyr Glu Thr Met Lys
20 25 30
Asn Lys Asp Thr Gly His Ser Asn Arg Gln Ser Asp Val Arg Ile Lys
35 40 45
Phe Glu His Asn Gly Glu Arg Arg Ile Ile Ala Phe Ser Arg Pro Val
50 55 60
Lys Tyr Glu Asp Val Glu His Lys Val Thr Thr Val Phe Gly Gln Pro
65 70 75 80
Leu Asp Leu His Tyr Met Asn Asn Glu Leu Ser Ile Leu Leu Lys Asn
85 90 95
Gln Asp Asp Leu Asp Lys Ala Ile Asp Ile Leu Asp Arg Ser Ser Ser
100 105 110
Met Lys Ser Leu Arg Ile Leu Leu Leu Ser Gln Asp Arg Asn His Asn
115 120 125
Ser Ser Ser Pro His Ser Gly Val Ser Arg Gln Val Arg Ile Lys Ala
130 135 140
Ser Gln Ser Ala Gly Asp Ile Asn Thr Ile Tyr Gln Pro Pro Glu Pro
145 150 155 160
Arg Ser Arg His Leu Ser Val Ser Ser Gln Asn Pro Gly Arg Ser Ser
165 170 175
Pro Pro Pro Gly Tyr Val Pro Glu Arg Gln Gln His Ile Ala Arg Gln
180 185 190
Gly Ser Tyr Thr Ser Ile Asn Ser Glu Gly Glu Phe Ile Pro Glu Thr
195 200 205
Ser Glu Gln Cys Met Leu Asp Pro Leu Ser Ser Ala Glu Asn Ser Leu
210 215 220
Ser Gly Ser Cys Gln Ser Leu Asp Arg Ser Ala Asp Ser Pro Ser Phe
225 230 235 240
Arg Lys Ser Arg Met Ser Arg Ala Gln Ser Phe Pro Asp Asn Arg Gln
245 250 255
Glu Tyr Ser Asp Arg Glu Thr Gln Leu Tyr Asp Lys Gly Val Lys Gly
260 265 270
Gly Thr Tyr Pro Arg Arg Tyr His Val Ser Val His His Lys Asp Tyr
275 280 285
Ser Asp Gly Arg Arg Thr Phe Pro Arg Ile Arg Arg His Gln Gly Asn
290 295 300
Leu Phe Thr Leu Val Pro Ser Ser Arg Ser Leu Ser Thr Asn Gly Glu
305 310 315 320
Asn Met Gly Leu Ala Val Gln Tyr Leu Asp Pro Arg Gly Arg Leu Arg
325 330 335
Ser Ala Asp Ser Glu Asn Ala Leu Ser Val Gln Glu Arg Asn Val Pro
340 345 350
Thr Lys Ser Pro Ser Ala Pro Ile Asn Trp Arg Arg Gly Lys Leu Leu
355 360 365
Gly Gln Gly Ala Phe Gly Arg Val Tyr Leu Cys Tyr Asp Val Asp Thr
370 375 380
Gly Arg Glu Leu Ala Ser Lys Gln Val Gln Phe Asp Pro Asp Ser Pro
385 390 395 400
Glu Thr Ser Lys Glu Val Ser Ala Leu Glu Cys Glu Ile Gln Leu Leu
405 410 415
Lys Asn Leu Gln His Glu Arg Ile Val Gln Tyr Tyr Gly Cys Leu Arg
420 425 430
Asp Arg Ala Glu Lys Thr Leu Thr Ile Phe Met Glu Tyr Met Pro Gly
435 440 445
Gly Ser Val Lys Asp Gln Leu Lys Ala Tyr Gly Ala Leu Thr Glu Ser
450 455 460
Val Thr Arg Lys Tyr Thr Arg Gln Ile Leu Glu Gly Met Ser Tyr Leu
465 470 475 480
His Ser Asn Met Ile Val His Arg Asp Ile Lys Gly Ala Asn Ile Leu
485 490 495
Arg Asp Ser Ala Gly Asn Val Lys Leu Gly Asp Phe Gly Ala Ser Lys
500 505 510
Arg Leu Gln Thr Ile Cys Met Ser Gly Thr Gly Met Arg Ser Val Thr
515 520 525
Gly Thr Pro Tyr Trp Met Ser Pro Glu Val Ile Ser Gly Glu Gly Tyr
530 535 540
Gly Arg Lys Ala Asp Val Trp Ser Leu Gly Cys Thr Val Val Glu Met
545 550 555 560
Leu Thr Glu Lys Pro Pro Trp Ala Glu Tyr Glu Ala Met Ala Ala Ile
565 570 575
Phe Lys Ile Ala Thr Gln Pro Thr Asn Pro Gln Leu Pro Ser His Ile
580 585 590
Ser Glu His Gly Arg Asp Phe Leu Arg Arg Ile Phe Val Glu Ala Arg
595 600 605
Gln Arg Pro Ser Ala Glu Glu Leu Leu Thr His His Phe Ala Gln Leu
610 615 620
Met Tyr
625
<210> 91
<211> 624
<212> PRT
<213> Artificial Sequence
<220>
<223> MAP3K3 protein fragment
<400> 91
Glu Gln Glu Ala Leu Asn Ser Ile Met Asn Asp Leu Val Ala Leu Gln
1 5 10 15
Met Asn Arg Arg His Arg Met Pro Gly Tyr Glu Thr Met Lys Asn Lys
20 25 30
Asp Thr Gly His Ser Asn Arg Gln Ser Asp Val Arg Ile Lys Phe Glu
35 40 45
His Asn Gly Glu Arg Arg Ile Ile Ala Phe Ser Arg Pro Val Lys Tyr
50 55 60
Glu Asp Val Glu His Lys Val Thr Thr Val Phe Gly Gln Pro Leu Asp
65 70 75 80
Leu His Tyr Met Asn Asn Glu Leu Ser Ile Leu Leu Lys Asn Gln Asp
85 90 95
Asp Leu Asp Lys Ala Ile Asp Ile Leu Asp Arg Ser Ser Ser Met Lys
100 105 110
Ser Leu Arg Ile Leu Leu Leu Ser Gln Asp Arg Asn His Asn Ser Ser
115 120 125
Ser Pro His Ser Gly Val Ser Arg Gln Val Arg Ile Lys Ala Ser Gln
130 135 140
Ser Ala Gly Asp Ile Asn Thr Ile Tyr Gln Pro Pro Glu Pro Arg Ser
145 150 155 160
Arg His Leu Ser Val Ser Ser Gln Asn Pro Gly Arg Ser Ser Pro Pro
165 170 175
Pro Gly Tyr Val Pro Glu Arg Gln Gln His Ile Ala Arg Gln Gly Ser
180 185 190
Tyr Thr Ser Ile Asn Ser Glu Gly Glu Phe Ile Pro Glu Thr Ser Glu
195 200 205
Gln Cys Met Leu Asp Pro Leu Ser Ser Ala Glu Asn Ser Leu Ser Gly
210 215 220
Ser Cys Gln Ser Leu Asp Arg Ser Ala Asp Ser Pro Ser Phe Arg Lys
225 230 235 240
Ser Arg Met Ser Arg Ala Gln Ser Phe Pro Asp Asn Arg Gln Glu Tyr
245 250 255
Ser Asp Arg Glu Thr Gln Leu Tyr Asp Lys Gly Val Lys Gly Gly Thr
260 265 270
Tyr Pro Arg Arg Tyr His Val Ser Val His His Lys Asp Tyr Ser Asp
275 280 285
Gly Arg Arg Thr Phe Pro Arg Ile Arg Arg His Gln Gly Asn Leu Phe
290 295 300
Thr Leu Val Pro Ser Ser Arg Ser Leu Ser Thr Asn Gly Glu Asn Met
305 310 315 320
Gly Leu Ala Val Gln Tyr Leu Asp Pro Arg Gly Arg Leu Arg Ser Ala
325 330 335
Asp Ser Glu Asn Ala Leu Ser Val Gln Glu Arg Asn Val Pro Thr Lys
340 345 350
Ser Pro Ser Ala Pro Ile Asn Trp Arg Arg Gly Lys Leu Leu Gly Gln
355 360 365
Gly Ala Phe Gly Arg Val Tyr Leu Cys Tyr Asp Val Asp Thr Gly Arg
370 375 380
Glu Leu Ala Ser Lys Gln Val Gln Phe Asp Pro Asp Ser Pro Glu Thr
385 390 395 400
Ser Lys Glu Val Ser Ala Leu Glu Cys Glu Ile Gln Leu Leu Lys Asn
405 410 415
Leu Gln His Glu Arg Ile Val Gln Tyr Tyr Gly Cys Leu Arg Asp Arg
420 425 430
Ala Glu Lys Thr Leu Thr Ile Phe Met Glu Tyr Met Pro Gly Gly Ser
435 440 445
Val Lys Asp Gln Leu Lys Ala Tyr Gly Ala Leu Thr Glu Ser Val Thr
450 455 460
Arg Lys Tyr Thr Arg Gln Ile Leu Glu Gly Met Ser Tyr Leu His Ser
465 470 475 480
Asn Met Ile Val His Arg Asp Ile Lys Gly Ala Asn Ile Leu Arg Asp
485 490 495
Ser Ala Gly Asn Val Lys Leu Gly Asp Phe Gly Ala Ser Lys Arg Leu
500 505 510
Gln Thr Ile Cys Met Ser Gly Thr Gly Met Arg Ser Val Thr Gly Thr
515 520 525
Pro Tyr Trp Met Ser Pro Glu Val Ile Ser Gly Glu Gly Tyr Gly Arg
530 535 540
Lys Ala Asp Val Trp Ser Leu Gly Cys Thr Val Val Glu Met Leu Thr
545 550 555 560
Glu Lys Pro Pro Trp Ala Glu Tyr Glu Ala Met Ala Ala Ile Phe Lys
565 570 575
Ile Ala Thr Gln Pro Thr Asn Pro Gln Leu Pro Ser His Ile Ser Glu
580 585 590
His Gly Arg Asp Phe Leu Arg Arg Ile Phe Val Glu Ala Arg Gln Arg
595 600 605
Pro Ser Ala Glu Glu Leu Leu Thr His His Phe Ala Gln Leu Met Tyr
610 615 620
<210> 92
<211> 7
<212> PRT
<213> Artificial Sequence
<220>
<223> Break-point of MAP3K3 protein fragment
<400> 92
Glu Gln Glu Ala Leu Asn Ser
1 5
<210> 93
<211> 4347
<212> DNA
<213> Artificial Sequence
<220>
<223> BCAS3-MAP3K3 fusion gene
<400> 93
atgaatgaag ctatggctac agattcccca agaagaccca gtcgttgtac tggtggagtt 60
gtggttcgcc cccaggctgt cacagagcag tcctacatgg aaagtgttgt gacttttctg 120
caggatgttg tgccacaggc ttacagtgga acacctctaa cagaagaaaa ggagaaaata 180
gtctgggtca gatttgaaaa tgcagattta aatgatacat caagaaatct ggaatttcat 240
gaaatacata gtactgggaa tgaaccgcct ttgttgatta tgattggcta cagtgatgga 300
atgcaggtct ggagcatccc tatcagtggc gaagcacaag agctcttctc tgttcgacat 360
ggcccaattc gagcggctag aatcttgcct gctccacagt ttggtgctca aaaatgtgat 420
aactttgctg aaaaaagacc cctccttggt gtttgtaaga gcattggatc ttctggcaca 480
agcccaccgt actgttgtgt ggatctgtat tcacttcgta ctggggagat ggtcaagtcc 540
attcaattta agacacctat ttatgatctc cattgcaata aacggatcct tgtcgtagtc 600
ttgcaggaga aaattgctgc ctttgatagc tgtactttca cgaagaaatt ctttgttaca 660
agctgctatc catgtccagg gccaaacatg aatcctattg ctcttgggag ccgctggctt 720
gcttatgcag aaaacaagtt gattcgatgt catcagtccc gtggtggagc ctgtggagac 780
aacattcagt cttatactgc cacagtcatt agtgctgcta aaacattgaa aagtggcctg 840
acaatggtag ggaaagtggt gactcagctg acaggcacac tgccttcagg tgtgacagaa 900
gatgatgttg ccatccacag taattcacgg cggagtcctt tggtcccagg catcatcaca 960
gttattgaca ccgaaaccgt tggagagggc caggtgcttg tgagtgagga ttctgacagt 1020
gatggcattg tggcccactt ccctgcccat gagaagccag tgtgctgcat ggcttttaat 1080
acaagtggaa tgcttctagt cacaacagac acccttggcc atgactttca tgtcttccaa 1140
attctgactc atccttggtc ctcatcacaa tgtgctgtcc accatctgta tactcttcac 1200
aggggagaaa ctgaagccaa agtacaggac atctgcttca gccatgactg tcgctgggtt 1260
gtggtcagta ctctccgggg tacttcccac gttttcccca tcaaccctta tggtggccag 1320
ccttgtgttc gtacacatat gtcaccacga gtagtgaatc gcatgagccg tttccagaaa 1380
agtgctggac tggaagagat tgaacaagaa ctgacgtcta agcaaggagg tcgctgtagc 1440
cctgttccag gtctatcaag cagcccttct gggtcaccct tgcatgggaa actgaacagc 1500
caagactcct ataacaattt taccaacaac aaccctggca accctcggct ctctcctctt 1560
cccagcttga tggtagtgat gcctcttgca caaatcaagc agccaatgac attggggacc 1620
atcaccaaac gaaccgggcc ttatctcttt ggagcggggt gtttttccat aaaagcccca 1680
tgcaaagtta aacctcctcc acaaatttca cccagcaaat cgatgggcgg agaattttgt 1740
gtggctgcta tcttcggaac atccaggtca tggtttgcaa ataatgcagg tctgaaaaga 1800
gaaaaagatc agtccaaaca agttgtagtt gagtccctgt acattatcag ttgctatggc 1860
accttagtgg aacacatgat ggagccgcga cccctcagca ctgcacccaa gattagtgac 1920
gacacaccac tggaaatgat gacatcgcct cgagccagct ggactctggt tagaacccct 1980
caatggaatg aattgcagcc accgtttaat gcaaaccacc ctctgctcct cgctgcagat 2040
gcagtacagt attatcagtt cctgcttgct ggcctggttc cccctggaag tcctgggccc 2100
attactcgac atgggtctta cgacagttta gcttctgacc atagtggaca ggaagatgaa 2160
gaatggcttt cccaggttga aattgtaaca cacactggac cccatagacg tctgtggatg 2220
ggtccacagt tccagttcaa aaccatccat ccctcaggcc aaaccacagt tatctcatcc 2280
agttcatctg tgttgcagtc tcatggtccg agtgacacgc cacagcctct tttggatttt 2340
gatacagatg atcttgatct caacagtctc aggatccagc cagtccgctc tgaccccgtc 2400
agcatgccag ggtcatcccg tccagtctct gatcgaaggg gagtttccac agtgattgat 2460
gctgcctcag acgaacagga ggcattgaac tcaatcatga acgatctggt ggccctccag 2520
atgaaccgac gtcaccggat gcctggatat gagaccatga agaacaaaga cacaggtcac 2580
tcaaataggc agagtgacgt cagaatcaag ttcgagcaca acggggagag gcgaattata 2640
gcgttcagcc ggcctgtgaa atatgaagat gtggagcaca aggtgacaac agtatttgga 2700
caacctcttg atctacatta catgaacaat gagctctcca tcctgctgaa aaaccaagat 2760
gatcttgata aagcaattga cattttagat agaagctcaa gcatgaaaag ccttaggata 2820
ttgctgttgt cccaggacag aaaccataac agttcctctc cccactctgg ggtgtccaga 2880
caggtgcgga tcaaggcttc ccagtccgca ggggatataa atactatcta ccagcccccc 2940
gagcccagaa gcaggcacct ctctgtcagc tcccagaacc ctggccgaag ctcacctccc 3000
cctggctatg ttcctgagcg gcagcagcac attgcccggc aggggtccta caccagcatc 3060
aacagtgagg gggagttcat cccagagacc agcgagcagt gcatgctgga tcccctgagc 3120
agtgcagaaa attccttgtc tggaagctgc caatccttgg acaggtcagc agacagccca 3180
tccttccgga aatcacgaat gtcccgtgcc cagagcttcc ctgacaacag acaggaatac 3240
tcagatcggg aaactcagct ttatgacaaa ggggtcaaag gtggaaccta cccccggcgc 3300
taccacgtgt ctgtgcacca caaggactac agtgatggca gaagaacatt tccccgaata 3360
cggcgtcatc aaggcaactt gttcaccctg gtgccctcca gccgctccct gagcacaaat 3420
ggcgagaaca tgggtctggc tgtgcaatac ctggaccccc gtgggcgcct gcggagtgcg 3480
gacagcgaga atgccctctc tgtgcaggag aggaatgtgc caaccaagtc tcccagtgcc 3540
cccatcaact ggcgccgggg aaagctcctg ggccagggtg ccttcggcag ggtctatttg 3600
tgctatgacg tggacacggg acgtgaactt gcttccaagc aggtccaatt tgatccagac 3660
agtcctgaga caagcaagga ggtgagtgct ctggagtgcg agatccagtt gctaaagaac 3720
ttgcagcatg agcgcatcgt gcagtactat ggctgtctgc gggaccgcgc tgagaagacc 3780
ctgaccatct tcatggagta catgccaggg ggctcggtga aagaccagtt gaaggcttac 3840
ggtgctctga cagagagcgt gacccgaaag tacacgcggc agatcctgga gggcatgtcc 3900
tacctgcaca gcaacatgat tgttcaccgg gacattaagg gagccaacat cctccgagac 3960
tctgctggga atgtaaagct gggggacttt ggggccagca aacgcctgca gacgatctgt 4020
atgtcgggga cgggcatgcg ctccgtcact ggcacaccct actggatgag ccctgaggtg 4080
atcagcggcg agggctatgg aaggaaagca gacgtgtgga gcctgggctg cactgtggtg 4140
gagatgctga cagagaaacc accgtgggca gagtatgaag ctatggccgc catcttcaag 4200
attgccaccc agcccaccaa tcctcagctg ccctcccaca tctctgaaca tggccgggac 4260
ttcctgaggc gcatttttgt ggaggctcgc cagagacctt cagctgagga gctgctcaca 4320
caccactttg cacagctcat gtactga 4347
<210> 94
<211> 45
<212> DNA
<213> Artificial Sequence
<220>
<223> Fused region of BCAS3-MAP3K3 fusion gene
<400> 94
acagtgattg atgctgcctc agacgaacag gaggcattga actca 45
<210> 95
<211> 1448
<212> PRT
<213> Artificial Sequence
<220>
<223> BCAS3-MAP3K3 fusion protein
<400> 95
Met Asn Glu Ala Met Ala Thr Asp Ser Pro Arg Arg Pro Ser Arg Cys
1 5 10 15
Thr Gly Gly Val Val Val Arg Pro Gln Ala Val Thr Glu Gln Ser Tyr
20 25 30
Met Glu Ser Val Val Thr Phe Leu Gln Asp Val Val Pro Gln Ala Tyr
35 40 45
Ser Gly Thr Pro Leu Thr Glu Glu Lys Glu Lys Ile Val Trp Val Arg
50 55 60
Phe Glu Asn Ala Asp Leu Asn Asp Thr Ser Arg Asn Leu Glu Phe His
65 70 75 80
Glu Ile His Ser Thr Gly Asn Glu Pro Pro Leu Leu Ile Met Ile Gly
85 90 95
Tyr Ser Asp Gly Met Gln Val Trp Ser Ile Pro Ile Ser Gly Glu Ala
100 105 110
Gln Glu Leu Phe Ser Val Arg His Gly Pro Ile Arg Ala Ala Arg Ile
115 120 125
Leu Pro Ala Pro Gln Phe Gly Ala Gln Lys Cys Asp Asn Phe Ala Glu
130 135 140
Lys Arg Pro Leu Leu Gly Val Cys Lys Ser Ile Gly Ser Ser Gly Thr
145 150 155 160
Ser Pro Pro Tyr Cys Cys Val Asp Leu Tyr Ser Leu Arg Thr Gly Glu
165 170 175
Met Val Lys Ser Ile Gln Phe Lys Thr Pro Ile Tyr Asp Leu His Cys
180 185 190
Asn Lys Arg Ile Leu Val Val Val Leu Gln Glu Lys Ile Ala Ala Phe
195 200 205
Asp Ser Cys Thr Phe Thr Lys Lys Phe Phe Val Thr Ser Cys Tyr Pro
210 215 220
Cys Pro Gly Pro Asn Met Asn Pro Ile Ala Leu Gly Ser Arg Trp Leu
225 230 235 240
Ala Tyr Ala Glu Asn Lys Leu Ile Arg Cys His Gln Ser Arg Gly Gly
245 250 255
Ala Cys Gly Asp Asn Ile Gln Ser Tyr Thr Ala Thr Val Ile Ser Ala
260 265 270
Ala Lys Thr Leu Lys Ser Gly Leu Thr Met Val Gly Lys Val Val Thr
275 280 285
Gln Leu Thr Gly Thr Leu Pro Ser Gly Val Thr Glu Asp Asp Val Ala
290 295 300
Ile His Ser Asn Ser Arg Arg Ser Pro Leu Val Pro Gly Ile Ile Thr
305 310 315 320
Val Ile Asp Thr Glu Thr Val Gly Glu Gly Gln Val Leu Val Ser Glu
325 330 335
Asp Ser Asp Ser Asp Gly Ile Val Ala His Phe Pro Ala His Glu Lys
340 345 350
Pro Val Cys Cys Met Ala Phe Asn Thr Ser Gly Met Leu Leu Val Thr
355 360 365
Thr Asp Thr Leu Gly His Asp Phe His Val Phe Gln Ile Leu Thr His
370 375 380
Pro Trp Ser Ser Ser Gln Cys Ala Val His His Leu Tyr Thr Leu His
385 390 395 400
Arg Gly Glu Thr Glu Ala Lys Val Gln Asp Ile Cys Phe Ser His Asp
405 410 415
Cys Arg Trp Val Val Val Ser Thr Leu Arg Gly Thr Ser His Val Phe
420 425 430
Pro Ile Asn Pro Tyr Gly Gly Gln Pro Cys Val Arg Thr His Met Ser
435 440 445
Pro Arg Val Val Asn Arg Met Ser Arg Phe Gln Lys Ser Ala Gly Leu
450 455 460
Glu Glu Ile Glu Gln Glu Leu Thr Ser Lys Gln Gly Gly Arg Cys Ser
465 470 475 480
Pro Val Pro Gly Leu Ser Ser Ser Pro Ser Gly Ser Pro Leu His Gly
485 490 495
Lys Leu Asn Ser Gln Asp Ser Tyr Asn Asn Phe Thr Asn Asn Asn Pro
500 505 510
Gly Asn Pro Arg Leu Ser Pro Leu Pro Ser Leu Met Val Val Met Pro
515 520 525
Leu Ala Gln Ile Lys Gln Pro Met Thr Leu Gly Thr Ile Thr Lys Arg
530 535 540
Thr Gly Pro Tyr Leu Phe Gly Ala Gly Cys Phe Ser Ile Lys Ala Pro
545 550 555 560
Cys Lys Val Lys Pro Pro Pro Gln Ile Ser Pro Ser Lys Ser Met Gly
565 570 575
Gly Glu Phe Cys Val Ala Ala Ile Phe Gly Thr Ser Arg Ser Trp Phe
580 585 590
Ala Asn Asn Ala Gly Leu Lys Arg Glu Lys Asp Gln Ser Lys Gln Val
595 600 605
Val Val Glu Ser Leu Tyr Ile Ile Ser Cys Tyr Gly Thr Leu Val Glu
610 615 620
His Met Met Glu Pro Arg Pro Leu Ser Thr Ala Pro Lys Ile Ser Asp
625 630 635 640
Asp Thr Pro Leu Glu Met Met Thr Ser Pro Arg Ala Ser Trp Thr Leu
645 650 655
Val Arg Thr Pro Gln Trp Asn Glu Leu Gln Pro Pro Phe Asn Ala Asn
660 665 670
His Pro Leu Leu Leu Ala Ala Asp Ala Val Gln Tyr Tyr Gln Phe Leu
675 680 685
Leu Ala Gly Leu Val Pro Pro Gly Ser Pro Gly Pro Ile Thr Arg His
690 695 700
Gly Ser Tyr Asp Ser Leu Ala Ser Asp His Ser Gly Gln Glu Asp Glu
705 710 715 720
Glu Trp Leu Ser Gln Val Glu Ile Val Thr His Thr Gly Pro His Arg
725 730 735
Arg Leu Trp Met Gly Pro Gln Phe Gln Phe Lys Thr Ile His Pro Ser
740 745 750
Gly Gln Thr Thr Val Ile Ser Ser Ser Ser Ser Val Leu Gln Ser His
755 760 765
Gly Pro Ser Asp Thr Pro Gln Pro Leu Leu Asp Phe Asp Thr Asp Asp
770 775 780
Leu Asp Leu Asn Ser Leu Arg Ile Gln Pro Val Arg Ser Asp Pro Val
785 790 795 800
Ser Met Pro Gly Ser Ser Arg Pro Val Ser Asp Arg Arg Gly Val Ser
805 810 815
Thr Val Ile Asp Ala Ala Ser Asp Glu Gln Glu Ala Leu Asn Ser Ile
820 825 830
Met Asn Asp Leu Val Ala Leu Gln Met Asn Arg Arg His Arg Met Pro
835 840 845
Gly Tyr Glu Thr Met Lys Asn Lys Asp Thr Gly His Ser Asn Arg Gln
850 855 860
Ser Asp Val Arg Ile Lys Phe Glu His Asn Gly Glu Arg Arg Ile Ile
865 870 875 880
Ala Phe Ser Arg Pro Val Lys Tyr Glu Asp Val Glu His Lys Val Thr
885 890 895
Thr Val Phe Gly Gln Pro Leu Asp Leu His Tyr Met Asn Asn Glu Leu
900 905 910
Ser Ile Leu Leu Lys Asn Gln Asp Asp Leu Asp Lys Ala Ile Asp Ile
915 920 925
Leu Asp Arg Ser Ser Ser Met Lys Ser Leu Arg Ile Leu Leu Leu Ser
930 935 940
Gln Asp Arg Asn His Asn Ser Ser Ser Pro His Ser Gly Val Ser Arg
945 950 955 960
Gln Val Arg Ile Lys Ala Ser Gln Ser Ala Gly Asp Ile Asn Thr Ile
965 970 975
Tyr Gln Pro Pro Glu Pro Arg Ser Arg His Leu Ser Val Ser Ser Gln
980 985 990
Asn Pro Gly Arg Ser Ser Pro Pro Pro Gly Tyr Val Pro Glu Arg Gln
995 1000 1005
Gln His Ile Ala Arg Gln Gly Ser Tyr Thr Ser Ile Asn Ser Glu Gly
1010 1015 1020
Glu Phe Ile Pro Glu Thr Ser Glu Gln Cys Met Leu Asp Pro Leu Ser
1025 1030 1035 1040
Ser Ala Glu Asn Ser Leu Ser Gly Ser Cys Gln Ser Leu Asp Arg Ser
1045 1050 1055
Ala Asp Ser Pro Ser Phe Arg Lys Ser Arg Met Ser Arg Ala Gln Ser
1060 1065 1070
Phe Pro Asp Asn Arg Gln Glu Tyr Ser Asp Arg Glu Thr Gln Leu Tyr
1075 1080 1085
Asp Lys Gly Val Lys Gly Gly Thr Tyr Pro Arg Arg Tyr His Val Ser
1090 1095 1100
Val His His Lys Asp Tyr Ser Asp Gly Arg Arg Thr Phe Pro Arg Ile
1105 1110 1115 1120
Arg Arg His Gln Gly Asn Leu Phe Thr Leu Val Pro Ser Ser Arg Ser
1125 1130 1135
Leu Ser Thr Asn Gly Glu Asn Met Gly Leu Ala Val Gln Tyr Leu Asp
1140 1145 1150
Pro Arg Gly Arg Leu Arg Ser Ala Asp Ser Glu Asn Ala Leu Ser Val
1155 1160 1165
Gln Glu Arg Asn Val Pro Thr Lys Ser Pro Ser Ala Pro Ile Asn Trp
1170 1175 1180
Arg Arg Gly Lys Leu Leu Gly Gln Gly Ala Phe Gly Arg Val Tyr Leu
1185 1190 1195 1200
Cys Tyr Asp Val Asp Thr Gly Arg Glu Leu Ala Ser Lys Gln Val Gln
1205 1210 1215
Phe Asp Pro Asp Ser Pro Glu Thr Ser Lys Glu Val Ser Ala Leu Glu
1220 1225 1230
Cys Glu Ile Gln Leu Leu Lys Asn Leu Gln His Glu Arg Ile Val Gln
1235 1240 1245
Tyr Tyr Gly Cys Leu Arg Asp Arg Ala Glu Lys Thr Leu Thr Ile Phe
1250 1255 1260
Met Glu Tyr Met Pro Gly Gly Ser Val Lys Asp Gln Leu Lys Ala Tyr
1265 1270 1275 1280
Gly Ala Leu Thr Glu Ser Val Thr Arg Lys Tyr Thr Arg Gln Ile Leu
1285 1290 1295
Glu Gly Met Ser Tyr Leu His Ser Asn Met Ile Val His Arg Asp Ile
1300 1305 1310
Lys Gly Ala Asn Ile Leu Arg Asp Ser Ala Gly Asn Val Lys Leu Gly
1315 1320 1325
Asp Phe Gly Ala Ser Lys Arg Leu Gln Thr Ile Cys Met Ser Gly Thr
1330 1335 1340
Gly Met Arg Ser Val Thr Gly Thr Pro Tyr Trp Met Ser Pro Glu Val
1345 1350 1355 1360
Ile Ser Gly Glu Gly Tyr Gly Arg Lys Ala Asp Val Trp Ser Leu Gly
1365 1370 1375
Cys Thr Val Val Glu Met Leu Thr Glu Lys Pro Pro Trp Ala Glu Tyr
1380 1385 1390
Glu Ala Met Ala Ala Ile Phe Lys Ile Ala Thr Gln Pro Thr Asn Pro
1395 1400 1405
Gln Leu Pro Ser His Ile Ser Glu His Gly Arg Asp Phe Leu Arg Arg
1410 1415 1420
Ile Phe Val Glu Ala Arg Gln Arg Pro Ser Ala Glu Glu Leu Leu Thr
1425 1430 1435 1440
His His Phe Ala Gln Leu Met Tyr
1445
<210> 96
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Fused region of BCAS3-MAP3K3 fusion protein
<400> 96
Thr Val Ile Asp Ala Ala Ser Asp Glu Gln Glu Ala Leu Asn Ser
1 5 10 15
<210> 97
<211> 567
<212> DNA
<213> Artificial Sequence
<220>
<223> CDS of KRAS gene (NM_004985)
<400> 97
atgactgaat ataaacttgt ggtagttgga gctggtggcg taggcaagag tgccttgacg 60
atacagctaa ttcagaatca ttttgtggac gaatatgatc caacaataga ggattcctac 120
aggaagcaag tagtaattga tggagaaacc tgtctcttgg atattctcga cacagcaggt 180
caagaggagt acagtgcaat gagggaccag tacatgagga ctggggaggg ctttctttgt 240
gtatttgcca taaataatac taaatcattt gaagatattc accattatag agaacaaatt 300
aaaagagtta aggactctga agatgtacct atggtcctag taggaaataa atgtgatttg 360
ccttctagaa cagtagacac aaaacaggct caggacttag caagaagtta tggaattcct 420
tttattgaaa catcagcaaa gacaagacag ggtgttgatg atgccttcta tacattagtt 480
cgagaaattc gaaaacataa agaaaagatg agcaaagatg gtaaaaagaa gaaaaagaag 540
tcaaagacaa agtgtgtaat tatgtaa 567
<210> 98
<211> 450
<212> DNA
<213> Artificial Sequence
<220>
<223> KRAS gene fragment
<400> 98
atgactgaat ataaacttgt ggtagttgga gctggtggcg taggcaagag tgccttgacg 60
atacagctaa ttcagaatca ttttgtggac gaatatgatc caacaataga ggattcctac 120
aggaagcaag tagtaattga tggagaaacc tgtctcttgg atattctcga cacagcaggt 180
caagaggagt acagtgcaat gagggaccag tacatgagga ctggggaggg ctttctttgt 240
gtatttgcca taaataatac taaatcattt gaagatattc accattatag agaacaaatt 300
aaaagagtta aggactctga agatgtacct atggtcctag taggaaataa atgtgatttg 360
ccttctagaa cagtagacac aaaacaggct caggacttag caagaagtta tggaattcct 420
tttattgaaa catcagcaaa gacaagacag 450
<210> 99
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Break-point of KRAS gene fragment
<400> 99
acatcagcaa agacaagaca g 21
<210> 100
<211> 188
<212> PRT
<213> Artificial Sequence
<220>
<223> KRAS protein
<400> 100
Met Thr Glu Tyr Lys Leu Val Val Val Gly Ala Gly Gly Val Gly Lys
1 5 10 15
Ser Ala Leu Thr Ile Gln Leu Ile Gln Asn His Phe Val Asp Glu Tyr
20 25 30
Asp Pro Thr Ile Glu Asp Ser Tyr Arg Lys Gln Val Val Ile Asp Gly
35 40 45
Glu Thr Cys Leu Leu Asp Ile Leu Asp Thr Ala Gly Gln Glu Glu Tyr
50 55 60
Ser Ala Met Arg Asp Gln Tyr Met Arg Thr Gly Glu Gly Phe Leu Cys
65 70 75 80
Val Phe Ala Ile Asn Asn Thr Lys Ser Phe Glu Asp Ile His His Tyr
85 90 95
Arg Glu Gln Ile Lys Arg Val Lys Asp Ser Glu Asp Val Pro Met Val
100 105 110
Leu Val Gly Asn Lys Cys Asp Leu Pro Ser Arg Thr Val Asp Thr Lys
115 120 125
Gln Ala Gln Asp Leu Ala Arg Ser Tyr Gly Ile Pro Phe Ile Glu Thr
130 135 140
Ser Ala Lys Thr Arg Gln Gly Val Asp Asp Ala Phe Tyr Thr Leu Val
145 150 155 160
Arg Glu Ile Arg Lys His Lys Glu Lys Met Ser Lys Asp Gly Lys Lys
165 170 175
Lys Lys Lys Lys Ser Lys Thr Lys Cys Val Ile Met
180 185
<210> 101
<211> 150
<212> PRT
<213> Artificial Sequence
<220>
<223> KRAS protein fragment
<400> 101
Met Thr Glu Tyr Lys Leu Val Val Val Gly Ala Gly Gly Val Gly Lys
1 5 10 15
Ser Ala Leu Thr Ile Gln Leu Ile Gln Asn His Phe Val Asp Glu Tyr
20 25 30
Asp Pro Thr Ile Glu Asp Ser Tyr Arg Lys Gln Val Val Ile Asp Gly
35 40 45
Glu Thr Cys Leu Leu Asp Ile Leu Asp Thr Ala Gly Gln Glu Glu Tyr
50 55 60
Ser Ala Met Arg Asp Gln Tyr Met Arg Thr Gly Glu Gly Phe Leu Cys
65 70 75 80
Val Phe Ala Ile Asn Asn Thr Lys Ser Phe Glu Asp Ile His His Tyr
85 90 95
Arg Glu Gln Ile Lys Arg Val Lys Asp Ser Glu Asp Val Pro Met Val
100 105 110
Leu Val Gly Asn Lys Cys Asp Leu Pro Ser Arg Thr Val Asp Thr Lys
115 120 125
Gln Ala Gln Asp Leu Ala Arg Ser Tyr Gly Ile Pro Phe Ile Glu Thr
130 135 140
Ser Ala Lys Thr Arg Gln
145 150
<210> 102
<211> 7
<212> PRT
<213> Artificial Sequence
<220>
<223> Break-point of KRAS protein fragment
<400> 102
Thr Ser Ala Lys Thr Arg Gln
1 5
<210> 103
<211> 2142
<212> DNA
<213> Artificial Sequence
<220>
<223> CDS of CDH13 gene (NM_001257)
<400> 103
atgcagccga gaactccgct cgttctgtgc gttctcctgt cccaggtgct gctgctaaca 60
tctgcagaag atttggactg cactcctgga tttcagcaga aagtgttcca tatcaatcag 120
ccagctgaat tcattgagga ccagtcaatt ctaaacttga ccttcagtga ctgtaaggga 180
aacgacaagc tacgctatga ggtctcgagc ccatacttca aggtgaacag cgatggcggc 240
ttagttgctc tgagaaacat aactgcagtg ggcaaaactc tgttcgtcca tgcacggacc 300
ccccatgcgg aagatatggc agaactcgtg attgtcgggg ggaaagacat ccagggctcc 360
ttgcaggata tatttaaatt tgcaagaact tctcctgtcc caagacaaaa gaggtccatt 420
gtggtatctc ccattttaat tccagagaat cagagacagc ctttcccaag agatgttggc 480
aaggtagtcg atagtgacag gccagaaagg tccaagttcc ggctcactgg aaagggagtg 540
gatcaagagc ctaaaggaat tttcagaatc aatgagaaca cagggagcgt ctccgtgaca 600
cggaccttgg acagagaagt aatcgctgtt tatcaactat ttgtggagac cactgatgtc 660
aatggcaaaa ctctcgaggg gccggtgcct ctggaagtca ttgtgattga tcagaatgac 720
aaccgaccga tctttcggga aggcccctac atcggccacg tcatggaagg gtcacccaca 780
ggcaccacag tgatgcggat gacagccttt gatgcagatg acccagccac cgataatgcc 840
ctcctgcggt ataatatccg tcagcagacg cctgacaagc catctcccaa catgttctac 900
atcgatcctg agaaaggaga cattgtcact gttgtgtcac ctgcgctgct ggaccgagag 960
actctggaaa atcccaagta tgaactgatc atcgaggctc aagatatggc tggactggat 1020
gttggattaa caggcacggc cacagccacg atcatgatcg atgacaaaaa tgatcactca 1080
ccaaaattca ccaagaaaga gtttcaagcc acagtcgagg aaggagctgt gggagttatt 1140
gtcaatttga cagttgaaga taaggatgac cccaccacag gtgcatggag ggctgcctac 1200
accatcatca acggaaaccc cgggcagagc tttgaaatcc acaccaaccc tcaaaccaac 1260
gaagggatgc tttctgttgt caaaccattg gactatgaaa tttctgcctt ccacaccctg 1320
ctgatcaaag tggaaaatga agacccactc gtacccgacg tctcctacgg ccccagctcc 1380
acagccaccg tccacatcac tgtcctggat gtcaacgagg gcccagtctt ctacccagac 1440
cccatgatgg tgaccaggca ggaggacctc tctgtgggca gcgtgctgct gacagtgaat 1500
gccacggacc ccgactccct gcagcatcaa accatcaggt attctgttta caaggaccca 1560
gcaggttggc tgaatattaa ccccatcaat gggactgttg acaccacagc tgtgctggac 1620
cgtgagtccc catttgtcga caacagcgtg tacactgctc tcttcctggc aattgacagt 1680
ggcaaccctc ccgctacggg cactgggact ttgctgataa ccctggagga cgtgaatgac 1740
aatgccccgt tcatttaccc cacagtagct gaagtctgtg atgatgccaa aaacctcagt 1800
gtagtcattt tgggagcatc agataaggat cttcacccga atacagatcc tttcaaattt 1860
gaaatccaca aacaagctgt tcctgataaa gtctggaaga tctccaagat caacaataca 1920
cacgccctgg taagccttct tcaaaatctg aacaaagcaa actacaacct gcccatcatg 1980
gtgacagatt cagggaaacc acccatgacg aatatcacag atctcagggt acaagtgtgc 2040
tcctgcagga attccaaagt ggactgcaac gcggcagggg ccctgcgctt cagcctgccc 2100
tcagtcctgc tcctcagcct cttcagctta gcttgtctgt ga 2142
<210> 104
<211> 1776
<212> DNA
<213> Artificial Sequence
<220>
<223> CDH13 gene fragment
<400> 104
gatatattta aatttgcaag aacttctcct gtcccaagac aaaagaggtc cattgtggta 60
tctcccattt taattccaga gaatcagaga cagcctttcc caagagatgt tggcaaggta 120
gtcgatagtg acaggccaga aaggtccaag ttccggctca ctggaaaggg agtggatcaa 180
gagcctaaag gaattttcag aatcaatgag aacacaggga gcgtctccgt gacacggacc 240
ttggacagag aagtaatcgc tgtttatcaa ctatttgtgg agaccactga tgtcaatggc 300
aaaactctcg aggggccggt gcctctggaa gtcattgtga ttgatcagaa tgacaaccga 360
ccgatctttc gggaaggccc ctacatcggc cacgtcatgg aagggtcacc cacaggcacc 420
acagtgatgc ggatgacagc ctttgatgca gatgacccag ccaccgataa tgccctcctg 480
cggtataata tccgtcagca gacgcctgac aagccatctc ccaacatgtt ctacatcgat 540
cctgagaaag gagacattgt cactgttgtg tcacctgcgc tgctggaccg agagactctg 600
gaaaatccca agtatgaact gatcatcgag gctcaagata tggctggact ggatgttgga 660
ttaacaggca cggccacagc cacgatcatg atcgatgaca aaaatgatca ctcaccaaaa 720
ttcaccaaga aagagtttca agccacagtc gaggaaggag ctgtgggagt tattgtcaat 780
ttgacagttg aagataagga tgaccccacc acaggtgcat ggagggctgc ctacaccatc 840
atcaacggaa accccgggca gagctttgaa atccacacca accctcaaac caacgaaggg 900
atgctttctg ttgtcaaacc attggactat gaaatttctg ccttccacac cctgctgatc 960
aaagtggaaa atgaagaccc actcgtaccc gacgtctcct acggccccag ctccacagcc 1020
accgtccaca tcactgtcct ggatgtcaac gagggcccag tcttctaccc agaccccatg 1080
atggtgacca ggcaggagga cctctctgtg ggcagcgtgc tgctgacagt gaatgccacg 1140
gaccccgact ccctgcagca tcaaaccatc aggtattctg tttacaagga cccagcaggt 1200
tggctgaata ttaaccccat caatgggact gttgacacca cagctgtgct ggaccgtgag 1260
tccccatttg tcgacaacag cgtgtacact gctctcttcc tggcaattga cagtggcaac 1320
cctcccgcta cgggcactgg gactttgctg ataaccctgg aggacgtgaa tgacaatgcc 1380
ccgttcattt accccacagt agctgaagtc tgtgatgatg ccaaaaacct cagtgtagtc 1440
attttgggag catcagataa ggatcttcac ccgaatacag atcctttcaa atttgaaatc 1500
cacaaacaag ctgttcctga taaagtctgg aagatctcca agatcaacaa tacacacgcc 1560
ctggtaagcc ttcttcaaaa tctgaacaaa gcaaactaca acctgcccat catggtgaca 1620
gattcaggga aaccacccat gacgaatatc acagatctca gggtacaagt gtgctcctgc 1680
aggaattcca aagtggactg caacgcggca ggggccctgc gcttcagcct gccctcagtc 1740
ctgctcctca gcctcttcag cttagcttgt ctgtga 1776
<210> 105
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Break-point of CDH13 gene fragment
<400> 105
gatatattta aatttgcaag a 21
<210> 106
<211> 713
<212> PRT
<213> Artificial Sequence
<220>
<223> CDH13 protein
<400> 106
Met Gln Pro Arg Thr Pro Leu Val Leu Cys Val Leu Leu Ser Gln Val
1 5 10 15
Leu Leu Leu Thr Ser Ala Glu Asp Leu Asp Cys Thr Pro Gly Phe Gln
20 25 30
Gln Lys Val Phe His Ile Asn Gln Pro Ala Glu Phe Ile Glu Asp Gln
35 40 45
Ser Ile Leu Asn Leu Thr Phe Ser Asp Cys Lys Gly Asn Asp Lys Leu
50 55 60
Arg Tyr Glu Val Ser Ser Pro Tyr Phe Lys Val Asn Ser Asp Gly Gly
65 70 75 80
Leu Val Ala Leu Arg Asn Ile Thr Ala Val Gly Lys Thr Leu Phe Val
85 90 95
His Ala Arg Thr Pro His Ala Glu Asp Met Ala Glu Leu Val Ile Val
100 105 110
Gly Gly Lys Asp Ile Gln Gly Ser Leu Gln Asp Ile Phe Lys Phe Ala
115 120 125
Arg Thr Ser Pro Val Pro Arg Gln Lys Arg Ser Ile Val Val Ser Pro
130 135 140
Ile Leu Ile Pro Glu Asn Gln Arg Gln Pro Phe Pro Arg Asp Val Gly
145 150 155 160
Lys Val Val Asp Ser Asp Arg Pro Glu Arg Ser Lys Phe Arg Leu Thr
165 170 175
Gly Lys Gly Val Asp Gln Glu Pro Lys Gly Ile Phe Arg Ile Asn Glu
180 185 190
Asn Thr Gly Ser Val Ser Val Thr Arg Thr Leu Asp Arg Glu Val Ile
195 200 205
Ala Val Tyr Gln Leu Phe Val Glu Thr Thr Asp Val Asn Gly Lys Thr
210 215 220
Leu Glu Gly Pro Val Pro Leu Glu Val Ile Val Ile Asp Gln Asn Asp
225 230 235 240
Asn Arg Pro Ile Phe Arg Glu Gly Pro Tyr Ile Gly His Val Met Glu
245 250 255
Gly Ser Pro Thr Gly Thr Thr Val Met Arg Met Thr Ala Phe Asp Ala
260 265 270
Asp Asp Pro Ala Thr Asp Asn Ala Leu Leu Arg Tyr Asn Ile Arg Gln
275 280 285
Gln Thr Pro Asp Lys Pro Ser Pro Asn Met Phe Tyr Ile Asp Pro Glu
290 295 300
Lys Gly Asp Ile Val Thr Val Val Ser Pro Ala Leu Leu Asp Arg Glu
305 310 315 320
Thr Leu Glu Asn Pro Lys Tyr Glu Leu Ile Ile Glu Ala Gln Asp Met
325 330 335
Ala Gly Leu Asp Val Gly Leu Thr Gly Thr Ala Thr Ala Thr Ile Met
340 345 350
Ile Asp Asp Lys Asn Asp His Ser Pro Lys Phe Thr Lys Lys Glu Phe
355 360 365
Gln Ala Thr Val Glu Glu Gly Ala Val Gly Val Ile Val Asn Leu Thr
370 375 380
Val Glu Asp Lys Asp Asp Pro Thr Thr Gly Ala Trp Arg Ala Ala Tyr
385 390 395 400
Thr Ile Ile Asn Gly Asn Pro Gly Gln Ser Phe Glu Ile His Thr Asn
405 410 415
Pro Gln Thr Asn Glu Gly Met Leu Ser Val Val Lys Pro Leu Asp Tyr
420 425 430
Glu Ile Ser Ala Phe His Thr Leu Leu Ile Lys Val Glu Asn Glu Asp
435 440 445
Pro Leu Val Pro Asp Val Ser Tyr Gly Pro Ser Ser Thr Ala Thr Val
450 455 460
His Ile Thr Val Leu Asp Val Asn Glu Gly Pro Val Phe Tyr Pro Asp
465 470 475 480
Pro Met Met Val Thr Arg Gln Glu Asp Leu Ser Val Gly Ser Val Leu
485 490 495
Leu Thr Val Asn Ala Thr Asp Pro Asp Ser Leu Gln His Gln Thr Ile
500 505 510
Arg Tyr Ser Val Tyr Lys Asp Pro Ala Gly Trp Leu Asn Ile Asn Pro
515 520 525
Ile Asn Gly Thr Val Asp Thr Thr Ala Val Leu Asp Arg Glu Ser Pro
530 535 540
Phe Val Asp Asn Ser Val Tyr Thr Ala Leu Phe Leu Ala Ile Asp Ser
545 550 555 560
Gly Asn Pro Pro Ala Thr Gly Thr Gly Thr Leu Leu Ile Thr Leu Glu
565 570 575
Asp Val Asn Asp Asn Ala Pro Phe Ile Tyr Pro Thr Val Ala Glu Val
580 585 590
Cys Asp Asp Ala Lys Asn Leu Ser Val Val Ile Leu Gly Ala Ser Asp
595 600 605
Lys Asp Leu His Pro Asn Thr Asp Pro Phe Lys Phe Glu Ile His Lys
610 615 620
Gln Ala Val Pro Asp Lys Val Trp Lys Ile Ser Lys Ile Asn Asn Thr
625 630 635 640
His Ala Leu Val Ser Leu Leu Gln Asn Leu Asn Lys Ala Asn Tyr Asn
645 650 655
Leu Pro Ile Met Val Thr Asp Ser Gly Lys Pro Pro Met Thr Asn Ile
660 665 670
Thr Asp Leu Arg Val Gln Val Cys Ser Cys Arg Asn Ser Lys Val Asp
675 680 685
Cys Asn Ala Ala Gly Ala Leu Arg Phe Ser Leu Pro Ser Val Leu Leu
690 695 700
Leu Ser Leu Phe Ser Leu Ala Cys Leu
705 710
<210> 107
<211> 591
<212> PRT
<213> Artificial Sequence
<220>
<223> CDH13 protein fragment
<400> 107
Asp Ile Phe Lys Phe Ala Arg Thr Ser Pro Val Pro Arg Gln Lys Arg
1 5 10 15
Ser Ile Val Val Ser Pro Ile Leu Ile Pro Glu Asn Gln Arg Gln Pro
20 25 30
Phe Pro Arg Asp Val Gly Lys Val Val Asp Ser Asp Arg Pro Glu Arg
35 40 45
Ser Lys Phe Arg Leu Thr Gly Lys Gly Val Asp Gln Glu Pro Lys Gly
50 55 60
Ile Phe Arg Ile Asn Glu Asn Thr Gly Ser Val Ser Val Thr Arg Thr
65 70 75 80
Leu Asp Arg Glu Val Ile Ala Val Tyr Gln Leu Phe Val Glu Thr Thr
85 90 95
Asp Val Asn Gly Lys Thr Leu Glu Gly Pro Val Pro Leu Glu Val Ile
100 105 110
Val Ile Asp Gln Asn Asp Asn Arg Pro Ile Phe Arg Glu Gly Pro Tyr
115 120 125
Ile Gly His Val Met Glu Gly Ser Pro Thr Gly Thr Thr Val Met Arg
130 135 140
Met Thr Ala Phe Asp Ala Asp Asp Pro Ala Thr Asp Asn Ala Leu Leu
145 150 155 160
Arg Tyr Asn Ile Arg Gln Gln Thr Pro Asp Lys Pro Ser Pro Asn Met
165 170 175
Phe Tyr Ile Asp Pro Glu Lys Gly Asp Ile Val Thr Val Val Ser Pro
180 185 190
Ala Leu Leu Asp Arg Glu Thr Leu Glu Asn Pro Lys Tyr Glu Leu Ile
195 200 205
Ile Glu Ala Gln Asp Met Ala Gly Leu Asp Val Gly Leu Thr Gly Thr
210 215 220
Ala Thr Ala Thr Ile Met Ile Asp Asp Lys Asn Asp His Ser Pro Lys
225 230 235 240
Phe Thr Lys Lys Glu Phe Gln Ala Thr Val Glu Glu Gly Ala Val Gly
245 250 255
Val Ile Val Asn Leu Thr Val Glu Asp Lys Asp Asp Pro Thr Thr Gly
260 265 270
Ala Trp Arg Ala Ala Tyr Thr Ile Ile Asn Gly Asn Pro Gly Gln Ser
275 280 285
Phe Glu Ile His Thr Asn Pro Gln Thr Asn Glu Gly Met Leu Ser Val
290 295 300
Val Lys Pro Leu Asp Tyr Glu Ile Ser Ala Phe His Thr Leu Leu Ile
305 310 315 320
Lys Val Glu Asn Glu Asp Pro Leu Val Pro Asp Val Ser Tyr Gly Pro
325 330 335
Ser Ser Thr Ala Thr Val His Ile Thr Val Leu Asp Val Asn Glu Gly
340 345 350
Pro Val Phe Tyr Pro Asp Pro Met Met Val Thr Arg Gln Glu Asp Leu
355 360 365
Ser Val Gly Ser Val Leu Leu Thr Val Asn Ala Thr Asp Pro Asp Ser
370 375 380
Leu Gln His Gln Thr Ile Arg Tyr Ser Val Tyr Lys Asp Pro Ala Gly
385 390 395 400
Trp Leu Asn Ile Asn Pro Ile Asn Gly Thr Val Asp Thr Thr Ala Val
405 410 415
Leu Asp Arg Glu Ser Pro Phe Val Asp Asn Ser Val Tyr Thr Ala Leu
420 425 430
Phe Leu Ala Ile Asp Ser Gly Asn Pro Pro Ala Thr Gly Thr Gly Thr
435 440 445
Leu Leu Ile Thr Leu Glu Asp Val Asn Asp Asn Ala Pro Phe Ile Tyr
450 455 460
Pro Thr Val Ala Glu Val Cys Asp Asp Ala Lys Asn Leu Ser Val Val
465 470 475 480
Ile Leu Gly Ala Ser Asp Lys Asp Leu His Pro Asn Thr Asp Pro Phe
485 490 495
Lys Phe Glu Ile His Lys Gln Ala Val Pro Asp Lys Val Trp Lys Ile
500 505 510
Ser Lys Ile Asn Asn Thr His Ala Leu Val Ser Leu Leu Gln Asn Leu
515 520 525
Asn Lys Ala Asn Tyr Asn Leu Pro Ile Met Val Thr Asp Ser Gly Lys
530 535 540
Pro Pro Met Thr Asn Ile Thr Asp Leu Arg Val Gln Val Cys Ser Cys
545 550 555 560
Arg Asn Ser Lys Val Asp Cys Asn Ala Ala Gly Ala Leu Arg Phe Ser
565 570 575
Leu Pro Ser Val Leu Leu Leu Ser Leu Phe Ser Leu Ala Cys Leu
580 585 590
<210> 108
<211> 7
<212> PRT
<213> Artificial Sequence
<220>
<223> Break-point of CDH13 protein fragment
<400> 108
Asp Ile Phe Lys Phe Ala Arg
1 5
<210> 109
<211> 2226
<212> DNA
<213> Artificial Sequence
<220>
<223> KRAS-CDH13 fusion gene
<400> 109
atgactgaat ataaacttgt ggtagttgga gctggtggcg taggcaagag tgccttgacg 60
atacagctaa ttcagaatca ttttgtggac gaatatgatc caacaataga ggattcctac 120
aggaagcaag tagtaattga tggagaaacc tgtctcttgg atattctcga cacagcaggt 180
caagaggagt acagtgcaat gagggaccag tacatgagga ctggggaggg ctttctttgt 240
gtatttgcca taaataatac taaatcattt gaagatattc accattatag agaacaaatt 300
aaaagagtta aggactctga agatgtacct atggtcctag taggaaataa atgtgatttg 360
ccttctagaa cagtagacac aaaacaggct caggacttag caagaagtta tggaattcct 420
tttattgaaa catcagcaaa gacaagacag gatatattta aatttgcaag aacttctcct 480
gtcccaagac aaaagaggtc cattgtggta tctcccattt taattccaga gaatcagaga 540
cagcctttcc caagagatgt tggcaaggta gtcgatagtg acaggccaga aaggtccaag 600
ttccggctca ctggaaaggg agtggatcaa gagcctaaag gaattttcag aatcaatgag 660
aacacaggga gcgtctccgt gacacggacc ttggacagag aagtaatcgc tgtttatcaa 720
ctatttgtgg agaccactga tgtcaatggc aaaactctcg aggggccggt gcctctggaa 780
gtcattgtga ttgatcagaa tgacaaccga ccgatctttc gggaaggccc ctacatcggc 840
cacgtcatgg aagggtcacc cacaggcacc acagtgatgc ggatgacagc ctttgatgca 900
gatgacccag ccaccgataa tgccctcctg cggtataata tccgtcagca gacgcctgac 960
aagccatctc ccaacatgtt ctacatcgat cctgagaaag gagacattgt cactgttgtg 1020
tcacctgcgc tgctggaccg agagactctg gaaaatccca agtatgaact gatcatcgag 1080
gctcaagata tggctggact ggatgttgga ttaacaggca cggccacagc cacgatcatg 1140
atcgatgaca aaaatgatca ctcaccaaaa ttcaccaaga aagagtttca agccacagtc 1200
gaggaaggag ctgtgggagt tattgtcaat ttgacagttg aagataagga tgaccccacc 1260
acaggtgcat ggagggctgc ctacaccatc atcaacggaa accccgggca gagctttgaa 1320
atccacacca accctcaaac caacgaaggg atgctttctg ttgtcaaacc attggactat 1380
gaaatttctg ccttccacac cctgctgatc aaagtggaaa atgaagaccc actcgtaccc 1440
gacgtctcct acggccccag ctccacagcc accgtccaca tcactgtcct ggatgtcaac 1500
gagggcccag tcttctaccc agaccccatg atggtgacca ggcaggagga cctctctgtg 1560
ggcagcgtgc tgctgacagt gaatgccacg gaccccgact ccctgcagca tcaaaccatc 1620
aggtattctg tttacaagga cccagcaggt tggctgaata ttaaccccat caatgggact 1680
gttgacacca cagctgtgct ggaccgtgag tccccatttg tcgacaacag cgtgtacact 1740
gctctcttcc tggcaattga cagtggcaac cctcccgcta cgggcactgg gactttgctg 1800
ataaccctgg aggacgtgaa tgacaatgcc ccgttcattt accccacagt agctgaagtc 1860
tgtgatgatg ccaaaaacct cagtgtagtc attttgggag catcagataa ggatcttcac 1920
ccgaatacag atcctttcaa atttgaaatc cacaaacaag ctgttcctga taaagtctgg 1980
aagatctcca agatcaacaa tacacacgcc ctggtaagcc ttcttcaaaa tctgaacaaa 2040
gcaaactaca acctgcccat catggtgaca gattcaggga aaccacccat gacgaatatc 2100
acagatctca gggtacaagt gtgctcctgc aggaattcca aagtggactg caacgcggca 2160
ggggccctgc gcttcagcct gccctcagtc ctgctcctca gcctcttcag cttagcttgt 2220
ctgtga 2226
<210> 110
<211> 42
<212> DNA
<213> Artificial Sequence
<220>
<223> Fused region of KRAS-CDH13 fusion gene
<400> 110
acatcagcaa agacaagaca ggatatattt aaatttgcaa ga 42
<210> 111
<211> 741
<212> PRT
<213> Artificial Sequence
<220>
<223> KRAS-CDH13 fusion protein
<400> 111
Met Thr Glu Tyr Lys Leu Val Val Val Gly Ala Gly Gly Val Gly Lys
1 5 10 15
Ser Ala Leu Thr Ile Gln Leu Ile Gln Asn His Phe Val Asp Glu Tyr
20 25 30
Asp Pro Thr Ile Glu Asp Ser Tyr Arg Lys Gln Val Val Ile Asp Gly
35 40 45
Glu Thr Cys Leu Leu Asp Ile Leu Asp Thr Ala Gly Gln Glu Glu Tyr
50 55 60
Ser Ala Met Arg Asp Gln Tyr Met Arg Thr Gly Glu Gly Phe Leu Cys
65 70 75 80
Val Phe Ala Ile Asn Asn Thr Lys Ser Phe Glu Asp Ile His His Tyr
85 90 95
Arg Glu Gln Ile Lys Arg Val Lys Asp Ser Glu Asp Val Pro Met Val
100 105 110
Leu Val Gly Asn Lys Cys Asp Leu Pro Ser Arg Thr Val Asp Thr Lys
115 120 125
Gln Ala Gln Asp Leu Ala Arg Ser Tyr Gly Ile Pro Phe Ile Glu Thr
130 135 140
Ser Ala Lys Thr Arg Gln Asp Ile Phe Lys Phe Ala Arg Thr Ser Pro
145 150 155 160
Val Pro Arg Gln Lys Arg Ser Ile Val Val Ser Pro Ile Leu Ile Pro
165 170 175
Glu Asn Gln Arg Gln Pro Phe Pro Arg Asp Val Gly Lys Val Val Asp
180 185 190
Ser Asp Arg Pro Glu Arg Ser Lys Phe Arg Leu Thr Gly Lys Gly Val
195 200 205
Asp Gln Glu Pro Lys Gly Ile Phe Arg Ile Asn Glu Asn Thr Gly Ser
210 215 220
Val Ser Val Thr Arg Thr Leu Asp Arg Glu Val Ile Ala Val Tyr Gln
225 230 235 240
Leu Phe Val Glu Thr Thr Asp Val Asn Gly Lys Thr Leu Glu Gly Pro
245 250 255
Val Pro Leu Glu Val Ile Val Ile Asp Gln Asn Asp Asn Arg Pro Ile
260 265 270
Phe Arg Glu Gly Pro Tyr Ile Gly His Val Met Glu Gly Ser Pro Thr
275 280 285
Gly Thr Thr Val Met Arg Met Thr Ala Phe Asp Ala Asp Asp Pro Ala
290 295 300
Thr Asp Asn Ala Leu Leu Arg Tyr Asn Ile Arg Gln Gln Thr Pro Asp
305 310 315 320
Lys Pro Ser Pro Asn Met Phe Tyr Ile Asp Pro Glu Lys Gly Asp Ile
325 330 335
Val Thr Val Val Ser Pro Ala Leu Leu Asp Arg Glu Thr Leu Glu Asn
340 345 350
Pro Lys Tyr Glu Leu Ile Ile Glu Ala Gln Asp Met Ala Gly Leu Asp
355 360 365
Val Gly Leu Thr Gly Thr Ala Thr Ala Thr Ile Met Ile Asp Asp Lys
370 375 380
Asn Asp His Ser Pro Lys Phe Thr Lys Lys Glu Phe Gln Ala Thr Val
385 390 395 400
Glu Glu Gly Ala Val Gly Val Ile Val Asn Leu Thr Val Glu Asp Lys
405 410 415
Asp Asp Pro Thr Thr Gly Ala Trp Arg Ala Ala Tyr Thr Ile Ile Asn
420 425 430
Gly Asn Pro Gly Gln Ser Phe Glu Ile His Thr Asn Pro Gln Thr Asn
435 440 445
Glu Gly Met Leu Ser Val Val Lys Pro Leu Asp Tyr Glu Ile Ser Ala
450 455 460
Phe His Thr Leu Leu Ile Lys Val Glu Asn Glu Asp Pro Leu Val Pro
465 470 475 480
Asp Val Ser Tyr Gly Pro Ser Ser Thr Ala Thr Val His Ile Thr Val
485 490 495
Leu Asp Val Asn Glu Gly Pro Val Phe Tyr Pro Asp Pro Met Met Val
500 505 510
Thr Arg Gln Glu Asp Leu Ser Val Gly Ser Val Leu Leu Thr Val Asn
515 520 525
Ala Thr Asp Pro Asp Ser Leu Gln His Gln Thr Ile Arg Tyr Ser Val
530 535 540
Tyr Lys Asp Pro Ala Gly Trp Leu Asn Ile Asn Pro Ile Asn Gly Thr
545 550 555 560
Val Asp Thr Thr Ala Val Leu Asp Arg Glu Ser Pro Phe Val Asp Asn
565 570 575
Ser Val Tyr Thr Ala Leu Phe Leu Ala Ile Asp Ser Gly Asn Pro Pro
580 585 590
Ala Thr Gly Thr Gly Thr Leu Leu Ile Thr Leu Glu Asp Val Asn Asp
595 600 605
Asn Ala Pro Phe Ile Tyr Pro Thr Val Ala Glu Val Cys Asp Asp Ala
610 615 620
Lys Asn Leu Ser Val Val Ile Leu Gly Ala Ser Asp Lys Asp Leu His
625 630 635 640
Pro Asn Thr Asp Pro Phe Lys Phe Glu Ile His Lys Gln Ala Val Pro
645 650 655
Asp Lys Val Trp Lys Ile Ser Lys Ile Asn Asn Thr His Ala Leu Val
660 665 670
Ser Leu Leu Gln Asn Leu Asn Lys Ala Asn Tyr Asn Leu Pro Ile Met
675 680 685
Val Thr Asp Ser Gly Lys Pro Pro Met Thr Asn Ile Thr Asp Leu Arg
690 695 700
Val Gln Val Cys Ser Cys Arg Asn Ser Lys Val Asp Cys Asn Ala Ala
705 710 715 720
Gly Ala Leu Arg Phe Ser Leu Pro Ser Val Leu Leu Leu Ser Leu Phe
725 730 735
Ser Leu Ala Cys Leu
740
<210> 112
<211> 14
<212> PRT
<213> Artificial Sequence
<220>
<223> Fused region of KRAS-CDH13 fusion protein
<400> 112
Thr Ser Ala Lys Thr Arg Gln Asp Ile Phe Lys Phe Ala Arg
1 5 10
<210> 113
<211> 4101
<212> DNA
<213> Artificial Sequence
<220>
<223> CDS of ZFYVE9 gene (NM_007324)
<400> 113
atggagaatt acttccaagc agaagcttac aacctggaca aggtgttaga tgaatttgaa 60
caaaacgaag atgaaacagt ttcttctact ttattggata caaagtggaa taagattcta 120
gatccccctt ctcaccggct gtcatttaac cctactttgg ccagtgtgaa tgaatctgca 180
gtttctaatg agtcacaacc acaactgaaa gtcttctccc tggctcattc agctcccctg 240
accacagagg aagaggatca ctgtgctaat ggacaggact gtaatctaaa tccagagatt 300
gccacaatgt ggattgatga aaatgctgtt gcagaagacc agttaattaa gagaaactat 360
agttgggatg atcaatgcag tgctgttgaa gtgggagaga agaaatgtgg aaacctggct 420
tgtctgccag atgagaagaa tgttcttgtt gtagccgtca tgcataactg tgataaaagg 480
acattacaaa acgatttaca ggattgtaat aattataata gtcaatccct tatggatgct 540
tttagctgtt cactggataa tgaaaacaga caaactgatc aatttagttt tagtataaat 600
gagtccactg aaaaagatat gaattcagag aaacaaatgg atccattgaa tagaccgaaa 660
acagagggga gatctgttaa ccatctgtgt cctacttcat ctgatagtct agccagtgtc 720
tgttcccctt cacaattaaa ggatgacgga agtataggta gagacccctc catgtctgcg 780
attacaagtt taacggttga ttcagtaatc tcatcccagg gaacagatgg atgtcctgct 840
gttaaaaagc aagagaacta tataccagat gaggacctca ctggcaaaat cagctctcct 900
aggacagatc tagggagtcc aaattccttt tcccacatga gtgaggggat tttgatgaaa 960
aaagagccag cagaggagag caccactgaa gaatccctcc ggtctggttt acctttgctt 1020
ctcaaaccag acatgcctaa tgggtctgga aggaataatg actgtgaacg gtgttcagat 1080
tgccttgtgc ctaatgaagt tagggctgat gaaaatgaag gttatgaaca tgaagaaact 1140
cttggcacta cagaattcct taatatgaca gagcatttct ctgaatctca ggacatgact 1200
aattggaagt tgactaaact aaatgagatg aatgatagcc aagtaaacga agaaaaggaa 1260
aagtttctac agattagtca gcctgaggac actaatggtg atagtggagg acagtgtgtt 1320
ggattggcag atgcaggtct agatttaaaa ggaacttgca ttagtgaaag tgaagaatgt 1380
gatttctcca ctgttataga cacaccagca gcaaattatc tatctaatgg ttgtgattcc 1440
tatggaatgc aagacccagg tgtttctttt gttccaaaga ctttaccctc caaagaagat 1500
tcagtaacag aagaaaaaga aatagaggaa agcaagtcag aatgctactc aaatatttat 1560
gaacagagag gaaatgaggc cacagaaggg agtggactac ttttaaacag cactggtgac 1620
ctaatgaaga aaaattattt acataatttc tgtagtcaag ttccatcagt gcttgggcaa 1680
tcttccccca aggtagtagc aagcctgcca tctatcagtg ttccttttgg tggtgcaaga 1740
cccaagcaac cttctaatct taaacttcaa attccaaagc cattatcaga ccatttacaa 1800
aatgactttc ctgcaaacag tggaaataat actaaaaata aaaatgatat tcttgggaaa 1860
gcaaaattag gggaaaactc agcaaccaat gtatgcagtc catctttggg aaacatctct 1920
aatgtcgata caaatgggga acatttagaa agttatgagg ctgagatctc cactagacca 1980
tgccttgcat tagctccaga tagcccagat aatgatctca gagctggtca gtttggaatt 2040
tctgccagaa agccattcac cactctgggt gaggtggctc cagtatgggt accggattct 2100
caggctccaa attgcatgaa atgtgaagcc aggtttacat tcaccaaaag gaggcatcac 2160
tgcagagcat gtgggaaggt tttctgtgct tcctgctgta gcctgaaatg taaactgtta 2220
tacatggaca gaaaggaagc tagagtgtgt gtaatctgcc attcagtgct aatgaatgtg 2280
gctcagccca gagagcagag gcgagtttgg tttgctgatg ggatcttgcc caatggagaa 2340
gttgctgatg cagccaaatt aacaatgaat ggaacttcct ctgcaggaac cctggctgtg 2400
tcacacgacc cagtcaagcc agtaactacc agtcctctac cagcagagac ggatatttgt 2460
ctattctctg ggagtataac tcaggttgga agtcctgttg gaagtgcaat gaatcttatt 2520
cctgaagatg gccttcctcc cattctcatc tccactggtg taaaaggaga ctatgctgtg 2580
gaagagaaac catcacagat ttcagtaatg cagcagttgg aggatggtgg ccctgaccca 2640
cttgtatttg ttttaaatgc aaatttgttg tcaatggtta aaattgtaaa ttatgtgaac 2700
aggaagtgct ggtgtttcac aaccaaggga atgcatgcag tgggtcagtc tgagatagtc 2760
attcttctac agtgtttacc ggatgaaaag tgtttgccaa aggatatctt taatcacttt 2820
gtgcagcttt atcgggatgc tctggcaggg aatgtggtga gcaacttggg acattccttc 2880
ttcagtcaaa gtttccttgg cagtaaagaa catggtggat tcttatatgt gacatctacc 2940
taccagtcac tgcaagacct agtactccca accccacctt acttgtttgg gattcttatc 3000
cagaaatggg aaactccttg ggctaaagta tttcctatcc gtctgatgtt gagacttgga 3060
gctgaatatc gactttatcc atgcccacta ttcagtgtca gatttcggaa gccattgttt 3120
ggagagacgg ggcataccat catgaatctt cttgcagact tcagaaatta ccagtatacc 3180
ttgccagtag ttcaaggttt ggtggttgat atggaagttc ggaaaactag catcaaaatt 3240
cccagcaaca gatacaatga gatgatgaaa gccatgaaca agtccaatga gcatgtcctg 3300
gcaggaggtg cctgcttcaa tgaaaaggca gactctcatc ttgtgtgtgt acagaatgat 3360
gatggaaact atcagaccca ggctatcagt attcacaatc agcccagaaa agtgactggt 3420
gccagtttct ttgtgttcag tggcgctctg aaatcctctt ctggatacct tgccaagtcc 3480
agtattgtgg aagatggtgt tatggtccag attactgcag agaacatgga ttccttgagg 3540
caggcactgc gagagatgaa ggacttcacc atcacctgtg ggaaggcgga cgcggaggaa 3600
ccccaggagc acatccacat ccagtgggtg gatgatgaca agaacgttag caagggtgtc 3660
gtaagtccta tagatgggaa gtccatggag actataacaa atgtgaagat attccatgga 3720
tcagaatata aagcaaatgg aaaagtaatc agatggacag aggtgttttt cctagaaaac 3780
gatgaccagc acaattgcct cagtgatcct gcagatcaca gtagattgac tgagcatgtt 3840
gccaaagctt tttgccttgc tctctgtcct cacctgaaac ttctgaagga agatggaatg 3900
accaaactgg gactacgtgt gacacttgac tcagatcagg ttggctatca agcagggagc 3960
aatggccagc cccttccctc gcagtacatg aatgatctgg acagcgcctt ggtgccggtg 4020
atccatggag gggcctgcca gcttagtgag ggccccgttg tcatggaact catcttttat 4080
attctggaaa acatcgtata a 4101
<210> 114
<211> 3656
<212> DNA
<213> Artificial Sequence
<220>
<223> ZFYVE9 gene fragment
<400> 114
atggagaatt acttccaagc agaagcttac aacctggaca aggtgttaga tgaatttgaa 60
caaaacgaag atgaaacagt ttcttctact ttattggata caaagtggaa taagattcta 120
gatccccctt ctcaccggct gtcatttaac cctactttgg ccagtgtgaa tgaatctgca 180
gtttctaatg agtcacaacc acaactgaaa gtcttctccc tggctcattc agctcccctg 240
accacagagg aagaggatca ctgtgctaat ggacaggact gtaatctaaa tccagagatt 300
gccacaatgt ggattgatga aaatgctgtt gcagaagacc agttaattaa gagaaactat 360
agttgggatg atcaatgcag tgctgttgaa gtgggagaga agaaatgtgg aaacctggct 420
tgtctgccag atgagaagaa tgttcttgtt gtagccgtca tgcataactg tgataaaagg 480
acattacaaa acgatttaca ggattgtaat aattataata gtcaatccct tatggatgct 540
tttagctgtt cactggataa tgaaaacaga caaactgatc aatttagttt tagtataaat 600
gagtccactg aaaaagatat gaattcagag aaacaaatgg atccattgaa tagaccgaaa 660
acagagggga gatctgttaa ccatctgtgt cctacttcat ctgatagtct agccagtgtc 720
tgttcccctt cacaattaaa ggatgacgga agtataggta gagacccctc catgtctgcg 780
attacaagtt taacggttga ttcagtaatc tcatcccagg gaacagatgg atgtcctgct 840
gttaaaaagc aagagaacta tataccagat gaggacctca ctggcaaaat cagctctcct 900
aggacagatc tagggagtcc aaattccttt tcccacatga gtgaggggat tttgatgaaa 960
aaagagccag cagaggagag caccactgaa gaatccctcc ggtctggttt acctttgctt 1020
ctcaaaccag acatgcctaa tgggtctgga aggaataatg actgtgaacg gtgttcagat 1080
tgccttgtgc ctaatgaagt tagggctgat gaaaatgaag gttatgaaca tgaagaaact 1140
cttggcacta cagaattcct taatatgaca gagcatttct ctgaatctca ggacatgact 1200
aattggaagt tgactaaact aaatgagatg aatgatagcc aagtaaacga agaaaaggaa 1260
aagtttctac agattagtca gcctgaggac actaatggtg atagtggagg acagtgtgtt 1320
ggattggcag atgcaggtct agatttaaaa ggaacttgca ttagtgaaag tgaagaatgt 1380
gatttctcca ctgttataga cacaccagca gcaaattatc tatctaatgg ttgtgattcc 1440
tatggaatgc aagacccagg tgtttctttt gttccaaaga ctttaccctc caaagaagat 1500
tcagtaacag aagaaaaaga aatagaggaa agcaagtcag aatgctactc aaatatttat 1560
gaacagagag gaaatgaggc cacagaaggg agtggactac ttttaaacag cactggtgac 1620
ctaatgaaga aaaattattt acataatttc tgtagtcaag ttccatcagt gcttgggcaa 1680
tcttccccca aggtagtagc aagcctgcca tctatcagtg ttccttttgg tggtgcaaga 1740
cccaagcaac cttctaatct taaacttcaa attccaaagc cattatcaga ccatttacaa 1800
aatgactttc ctgcaaacag tggaaataat actaaaaata aaaatgatat tcttgggaaa 1860
gcaaaattag gggaaaactc agcaaccaat gtatgcagtc catctttggg aaacatctct 1920
aatgtcgata caaatgggga acatttagaa agttatgagg ctgagatctc cactagacca 1980
tgccttgcat tagctccaga tagcccagat aatgatctca gagctggtca gtttggaatt 2040
tctgccagaa agccattcac cactctgggt gaggtggctc cagtatgggt accggattct 2100
caggctccaa attgcatgaa atgtgaagcc aggtttacat tcaccaaaag gaggcatcac 2160
tgcagagcat gtgggaaggt tttctgtgct tcctgctgta gcctgaaatg taaactgtta 2220
tacatggaca gaaaggaagc tagagtgtgt gtaatctgcc attcagtgct aatgaatgtg 2280
gctcagccca gagagcagag gcgagtttgg tttgctgatg ggatcttgcc caatggagaa 2340
gttgctgatg cagccaaatt aacaatgaat ggaacttcct ctgcaggaac cctggctgtg 2400
tcacacgacc cagtcaagcc agtaactacc agtcctctac cagcagagac ggatatttgt 2460
ctattctctg ggagtataac tcaggttgga agtcctgttg gaagtgcaat gaatcttatt 2520
cctgaagatg gccttcctcc cattctcatc tccactggtg taaaaggaga ctatgctgtg 2580
gaagagaaac catcacagat ttcagtaatg cagcagttgg aggatggtgg ccctgaccca 2640
cttgtatttg ttttaaatgc aaatttgttg tcaatggtta aaattgtaaa ttatgtgaac 2700
aggaagtgct ggtgtttcac aaccaaggga atgcatgcag tgggtcagtc tgagatagtc 2760
attcttctac agtgtttacc ggatgaaaag tgtttgccaa aggatatctt taatcacttt 2820
gtgcagcttt atcgggatgc tctggcaggg aatgtggtga gcaacttggg acattccttc 2880
ttcagtcaaa gtttccttgg cagtaaagaa catggtggat tcttatatgt gacatctacc 2940
taccagtcac tgcaagacct agtactccca accccacctt acttgtttgg gattcttatc 3000
cagaaatggg aaactccttg ggctaaagta tttcctatcc gtctgatgtt gagacttgga 3060
gctgaatatc gactttatcc atgcccacta ttcagtgtca gatttcggaa gccattgttt 3120
ggagagacgg ggcataccat catgaatctt cttgcagact tcagaaatta ccagtatacc 3180
ttgccagtag ttcaaggttt ggtggttgat atggaagttc ggaaaactag catcaaaatt 3240
cccagcaaca gatacaatga gatgatgaaa gccatgaaca agtccaatga gcatgtcctg 3300
gcaggaggtg cctgcttcaa tgaaaaggca gactctcatc ttgtgtgtgt acagaatgat 3360
gatggaaact atcagaccca ggctatcagt attcacaatc agcccagaaa agtgactggt 3420
gccagtttct ttgtgttcag tggcgctctg aaatcctctt ctggatacct tgccaagtcc 3480
agtattgtgg aagatggtgt tatggtccag attactgcag agaacatgga ttccttgagg 3540
caggcactgc gagagatgaa ggacttcacc atcacctgtg ggaaggcgga cgcggaggaa 3600
ccccaggagc acatccacat ccagtgggtg gatgatgaca agaacgttag caaggg 3656
<210> 115
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Break-point of ZFYVE9 gene fragment
<400> 115
gacaagaacg ttagcaaggg 20
<210> 116
<211> 1366
<212> PRT
<213> Artificial Sequence
<220>
<223> ZFYVE9 protein
<400> 116
Met Glu Asn Tyr Phe Gln Ala Glu Ala Tyr Asn Leu Asp Lys Val Leu
1 5 10 15
Asp Glu Phe Glu Gln Asn Glu Asp Glu Thr Val Ser Ser Thr Leu Leu
20 25 30
Asp Thr Lys Trp Asn Lys Ile Leu Asp Pro Pro Ser His Arg Leu Ser
35 40 45
Phe Asn Pro Thr Leu Ala Ser Val Asn Glu Ser Ala Val Ser Asn Glu
50 55 60
Ser Gln Pro Gln Leu Lys Val Phe Ser Leu Ala His Ser Ala Pro Leu
65 70 75 80
Thr Thr Glu Glu Glu Asp His Cys Ala Asn Gly Gln Asp Cys Asn Leu
85 90 95
Asn Pro Glu Ile Ala Thr Met Trp Ile Asp Glu Asn Ala Val Ala Glu
100 105 110
Asp Gln Leu Ile Lys Arg Asn Tyr Ser Trp Asp Asp Gln Cys Ser Ala
115 120 125
Val Glu Val Gly Glu Lys Lys Cys Gly Asn Leu Ala Cys Leu Pro Asp
130 135 140
Glu Lys Asn Val Leu Val Val Ala Val Met His Asn Cys Asp Lys Arg
145 150 155 160
Thr Leu Gln Asn Asp Leu Gln Asp Cys Asn Asn Tyr Asn Ser Gln Ser
165 170 175
Leu Met Asp Ala Phe Ser Cys Ser Leu Asp Asn Glu Asn Arg Gln Thr
180 185 190
Asp Gln Phe Ser Phe Ser Ile Asn Glu Ser Thr Glu Lys Asp Met Asn
195 200 205
Ser Glu Lys Gln Met Asp Pro Leu Asn Arg Pro Lys Thr Glu Gly Arg
210 215 220
Ser Val Asn His Leu Cys Pro Thr Ser Ser Asp Ser Leu Ala Ser Val
225 230 235 240
Cys Ser Pro Ser Gln Leu Lys Asp Asp Gly Ser Ile Gly Arg Asp Pro
245 250 255
Ser Met Ser Ala Ile Thr Ser Leu Thr Val Asp Ser Val Ile Ser Ser
260 265 270
Gln Gly Thr Asp Gly Cys Pro Ala Val Lys Lys Gln Glu Asn Tyr Ile
275 280 285
Pro Asp Glu Asp Leu Thr Gly Lys Ile Ser Ser Pro Arg Thr Asp Leu
290 295 300
Gly Ser Pro Asn Ser Phe Ser His Met Ser Glu Gly Ile Leu Met Lys
305 310 315 320
Lys Glu Pro Ala Glu Glu Ser Thr Thr Glu Glu Ser Leu Arg Ser Gly
325 330 335
Leu Pro Leu Leu Leu Lys Pro Asp Met Pro Asn Gly Ser Gly Arg Asn
340 345 350
Asn Asp Cys Glu Arg Cys Ser Asp Cys Leu Val Pro Asn Glu Val Arg
355 360 365
Ala Asp Glu Asn Glu Gly Tyr Glu His Glu Glu Thr Leu Gly Thr Thr
370 375 380
Glu Phe Leu Asn Met Thr Glu His Phe Ser Glu Ser Gln Asp Met Thr
385 390 395 400
Asn Trp Lys Leu Thr Lys Leu Asn Glu Met Asn Asp Ser Gln Val Asn
405 410 415
Glu Glu Lys Glu Lys Phe Leu Gln Ile Ser Gln Pro Glu Asp Thr Asn
420 425 430
Gly Asp Ser Gly Gly Gln Cys Val Gly Leu Ala Asp Ala Gly Leu Asp
435 440 445
Leu Lys Gly Thr Cys Ile Ser Glu Ser Glu Glu Cys Asp Phe Ser Thr
450 455 460
Val Ile Asp Thr Pro Ala Ala Asn Tyr Leu Ser Asn Gly Cys Asp Ser
465 470 475 480
Tyr Gly Met Gln Asp Pro Gly Val Ser Phe Val Pro Lys Thr Leu Pro
485 490 495
Ser Lys Glu Asp Ser Val Thr Glu Glu Lys Glu Ile Glu Glu Ser Lys
500 505 510
Ser Glu Cys Tyr Ser Asn Ile Tyr Glu Gln Arg Gly Asn Glu Ala Thr
515 520 525
Glu Gly Ser Gly Leu Leu Leu Asn Ser Thr Gly Asp Leu Met Lys Lys
530 535 540
Asn Tyr Leu His Asn Phe Cys Ser Gln Val Pro Ser Val Leu Gly Gln
545 550 555 560
Ser Ser Pro Lys Val Val Ala Ser Leu Pro Ser Ile Ser Val Pro Phe
565 570 575
Gly Gly Ala Arg Pro Lys Gln Pro Ser Asn Leu Lys Leu Gln Ile Pro
580 585 590
Lys Pro Leu Ser Asp His Leu Gln Asn Asp Phe Pro Ala Asn Ser Gly
595 600 605
Asn Asn Thr Lys Asn Lys Asn Asp Ile Leu Gly Lys Ala Lys Leu Gly
610 615 620
Glu Asn Ser Ala Thr Asn Val Cys Ser Pro Ser Leu Gly Asn Ile Ser
625 630 635 640
Asn Val Asp Thr Asn Gly Glu His Leu Glu Ser Tyr Glu Ala Glu Ile
645 650 655
Ser Thr Arg Pro Cys Leu Ala Leu Ala Pro Asp Ser Pro Asp Asn Asp
660 665 670
Leu Arg Ala Gly Gln Phe Gly Ile Ser Ala Arg Lys Pro Phe Thr Thr
675 680 685
Leu Gly Glu Val Ala Pro Val Trp Val Pro Asp Ser Gln Ala Pro Asn
690 695 700
Cys Met Lys Cys Glu Ala Arg Phe Thr Phe Thr Lys Arg Arg His His
705 710 715 720
Cys Arg Ala Cys Gly Lys Val Phe Cys Ala Ser Cys Cys Ser Leu Lys
725 730 735
Cys Lys Leu Leu Tyr Met Asp Arg Lys Glu Ala Arg Val Cys Val Ile
740 745 750
Cys His Ser Val Leu Met Asn Val Ala Gln Pro Arg Glu Gln Arg Arg
755 760 765
Val Trp Phe Ala Asp Gly Ile Leu Pro Asn Gly Glu Val Ala Asp Ala
770 775 780
Ala Lys Leu Thr Met Asn Gly Thr Ser Ser Ala Gly Thr Leu Ala Val
785 790 795 800
Ser His Asp Pro Val Lys Pro Val Thr Thr Ser Pro Leu Pro Ala Glu
805 810 815
Thr Asp Ile Cys Leu Phe Ser Gly Ser Ile Thr Gln Val Gly Ser Pro
820 825 830
Val Gly Ser Ala Met Asn Leu Ile Pro Glu Asp Gly Leu Pro Pro Ile
835 840 845
Leu Ile Ser Thr Gly Val Lys Gly Asp Tyr Ala Val Glu Glu Lys Pro
850 855 860
Ser Gln Ile Ser Val Met Gln Gln Leu Glu Asp Gly Gly Pro Asp Pro
865 870 875 880
Leu Val Phe Val Leu Asn Ala Asn Leu Leu Ser Met Val Lys Ile Val
885 890 895
Asn Tyr Val Asn Arg Lys Cys Trp Cys Phe Thr Thr Lys Gly Met His
900 905 910
Ala Val Gly Gln Ser Glu Ile Val Ile Leu Leu Gln Cys Leu Pro Asp
915 920 925
Glu Lys Cys Leu Pro Lys Asp Ile Phe Asn His Phe Val Gln Leu Tyr
930 935 940
Arg Asp Ala Leu Ala Gly Asn Val Val Ser Asn Leu Gly His Ser Phe
945 950 955 960
Phe Ser Gln Ser Phe Leu Gly Ser Lys Glu His Gly Gly Phe Leu Tyr
965 970 975
Val Thr Ser Thr Tyr Gln Ser Leu Gln Asp Leu Val Leu Pro Thr Pro
980 985 990
Pro Tyr Leu Phe Gly Ile Leu Ile Gln Lys Trp Glu Thr Pro Trp Ala
995 1000 1005
Lys Val Phe Pro Ile Arg Leu Met Leu Arg Leu Gly Ala Glu Tyr Arg
1010 1015 1020
Leu Tyr Pro Cys Pro Leu Phe Ser Val Arg Phe Arg Lys Pro Leu Phe
1025 1030 1035 1040
Gly Glu Thr Gly His Thr Ile Met Asn Leu Leu Ala Asp Phe Arg Asn
1045 1050 1055
Tyr Gln Tyr Thr Leu Pro Val Val Gln Gly Leu Val Val Asp Met Glu
1060 1065 1070
Val Arg Lys Thr Ser Ile Lys Ile Pro Ser Asn Arg Tyr Asn Glu Met
1075 1080 1085
Met Lys Ala Met Asn Lys Ser Asn Glu His Val Leu Ala Gly Gly Ala
1090 1095 1100
Cys Phe Asn Glu Lys Ala Asp Ser His Leu Val Cys Val Gln Asn Asp
1105 1110 1115 1120
Asp Gly Asn Tyr Gln Thr Gln Ala Ile Ser Ile His Asn Gln Pro Arg
1125 1130 1135
Lys Val Thr Gly Ala Ser Phe Phe Val Phe Ser Gly Ala Leu Lys Ser
1140 1145 1150
Ser Ser Gly Tyr Leu Ala Lys Ser Ser Ile Val Glu Asp Gly Val Met
1155 1160 1165
Val Gln Ile Thr Ala Glu Asn Met Asp Ser Leu Arg Gln Ala Leu Arg
1170 1175 1180
Glu Met Lys Asp Phe Thr Ile Thr Cys Gly Lys Ala Asp Ala Glu Glu
1185 1190 1195 1200
Pro Gln Glu His Ile His Ile Gln Trp Val Asp Asp Asp Lys Asn Val
1205 1210 1215
Ser Lys Gly Val Val Ser Pro Ile Asp Gly Lys Ser Met Glu Thr Ile
1220 1225 1230
Thr Asn Val Lys Ile Phe His Gly Ser Glu Tyr Lys Ala Asn Gly Lys
1235 1240 1245
Val Ile Arg Trp Thr Glu Val Phe Phe Leu Glu Asn Asp Asp Gln His
1250 1255 1260
Asn Cys Leu Ser Asp Pro Ala Asp His Ser Arg Leu Thr Glu His Val
1265 1270 1275 1280
Ala Lys Ala Phe Cys Leu Ala Leu Cys Pro His Leu Lys Leu Leu Lys
1285 1290 1295
Glu Asp Gly Met Thr Lys Leu Gly Leu Arg Val Thr Leu Asp Ser Asp
1300 1305 1310
Gln Val Gly Tyr Gln Ala Gly Ser Asn Gly Gln Pro Leu Pro Ser Gln
1315 1320 1325
Tyr Met Asn Asp Leu Asp Ser Ala Leu Val Pro Val Ile His Gly Gly
1330 1335 1340
Ala Cys Gln Leu Ser Glu Gly Pro Val Val Met Glu Leu Ile Phe Tyr
1345 1350 1355 1360
Ile Leu Glu Asn Ile Val
1365
<210> 117
<211> 1218
<212> PRT
<213> Artificial Sequence
<220>
<223> ZFYVE9 protein fragment
<400> 117
Met Glu Asn Tyr Phe Gln Ala Glu Ala Tyr Asn Leu Asp Lys Val Leu
1 5 10 15
Asp Glu Phe Glu Gln Asn Glu Asp Glu Thr Val Ser Ser Thr Leu Leu
20 25 30
Asp Thr Lys Trp Asn Lys Ile Leu Asp Pro Pro Ser His Arg Leu Ser
35 40 45
Phe Asn Pro Thr Leu Ala Ser Val Asn Glu Ser Ala Val Ser Asn Glu
50 55 60
Ser Gln Pro Gln Leu Lys Val Phe Ser Leu Ala His Ser Ala Pro Leu
65 70 75 80
Thr Thr Glu Glu Glu Asp His Cys Ala Asn Gly Gln Asp Cys Asn Leu
85 90 95
Asn Pro Glu Ile Ala Thr Met Trp Ile Asp Glu Asn Ala Val Ala Glu
100 105 110
Asp Gln Leu Ile Lys Arg Asn Tyr Ser Trp Asp Asp Gln Cys Ser Ala
115 120 125
Val Glu Val Gly Glu Lys Lys Cys Gly Asn Leu Ala Cys Leu Pro Asp
130 135 140
Glu Lys Asn Val Leu Val Val Ala Val Met His Asn Cys Asp Lys Arg
145 150 155 160
Thr Leu Gln Asn Asp Leu Gln Asp Cys Asn Asn Tyr Asn Ser Gln Ser
165 170 175
Leu Met Asp Ala Phe Ser Cys Ser Leu Asp Asn Glu Asn Arg Gln Thr
180 185 190
Asp Gln Phe Ser Phe Ser Ile Asn Glu Ser Thr Glu Lys Asp Met Asn
195 200 205
Ser Glu Lys Gln Met Asp Pro Leu Asn Arg Pro Lys Thr Glu Gly Arg
210 215 220
Ser Val Asn His Leu Cys Pro Thr Ser Ser Asp Ser Leu Ala Ser Val
225 230 235 240
Cys Ser Pro Ser Gln Leu Lys Asp Asp Gly Ser Ile Gly Arg Asp Pro
245 250 255
Ser Met Ser Ala Ile Thr Ser Leu Thr Val Asp Ser Val Ile Ser Ser
260 265 270
Gln Gly Thr Asp Gly Cys Pro Ala Val Lys Lys Gln Glu Asn Tyr Ile
275 280 285
Pro Asp Glu Asp Leu Thr Gly Lys Ile Ser Ser Pro Arg Thr Asp Leu
290 295 300
Gly Ser Pro Asn Ser Phe Ser His Met Ser Glu Gly Ile Leu Met Lys
305 310 315 320
Lys Glu Pro Ala Glu Glu Ser Thr Thr Glu Glu Ser Leu Arg Ser Gly
325 330 335
Leu Pro Leu Leu Leu Lys Pro Asp Met Pro Asn Gly Ser Gly Arg Asn
340 345 350
Asn Asp Cys Glu Arg Cys Ser Asp Cys Leu Val Pro Asn Glu Val Arg
355 360 365
Ala Asp Glu Asn Glu Gly Tyr Glu His Glu Glu Thr Leu Gly Thr Thr
370 375 380
Glu Phe Leu Asn Met Thr Glu His Phe Ser Glu Ser Gln Asp Met Thr
385 390 395 400
Asn Trp Lys Leu Thr Lys Leu Asn Glu Met Asn Asp Ser Gln Val Asn
405 410 415
Glu Glu Lys Glu Lys Phe Leu Gln Ile Ser Gln Pro Glu Asp Thr Asn
420 425 430
Gly Asp Ser Gly Gly Gln Cys Val Gly Leu Ala Asp Ala Gly Leu Asp
435 440 445
Leu Lys Gly Thr Cys Ile Ser Glu Ser Glu Glu Cys Asp Phe Ser Thr
450 455 460
Val Ile Asp Thr Pro Ala Ala Asn Tyr Leu Ser Asn Gly Cys Asp Ser
465 470 475 480
Tyr Gly Met Gln Asp Pro Gly Val Ser Phe Val Pro Lys Thr Leu Pro
485 490 495
Ser Lys Glu Asp Ser Val Thr Glu Glu Lys Glu Ile Glu Glu Ser Lys
500 505 510
Ser Glu Cys Tyr Ser Asn Ile Tyr Glu Gln Arg Gly Asn Glu Ala Thr
515 520 525
Glu Gly Ser Gly Leu Leu Leu Asn Ser Thr Gly Asp Leu Met Lys Lys
530 535 540
Asn Tyr Leu His Asn Phe Cys Ser Gln Val Pro Ser Val Leu Gly Gln
545 550 555 560
Ser Ser Pro Lys Val Val Ala Ser Leu Pro Ser Ile Ser Val Pro Phe
565 570 575
Gly Gly Ala Arg Pro Lys Gln Pro Ser Asn Leu Lys Leu Gln Ile Pro
580 585 590
Lys Pro Leu Ser Asp His Leu Gln Asn Asp Phe Pro Ala Asn Ser Gly
595 600 605
Asn Asn Thr Lys Asn Lys Asn Asp Ile Leu Gly Lys Ala Lys Leu Gly
610 615 620
Glu Asn Ser Ala Thr Asn Val Cys Ser Pro Ser Leu Gly Asn Ile Ser
625 630 635 640
Asn Val Asp Thr Asn Gly Glu His Leu Glu Ser Tyr Glu Ala Glu Ile
645 650 655
Ser Thr Arg Pro Cys Leu Ala Leu Ala Pro Asp Ser Pro Asp Asn Asp
660 665 670
Leu Arg Ala Gly Gln Phe Gly Ile Ser Ala Arg Lys Pro Phe Thr Thr
675 680 685
Leu Gly Glu Val Ala Pro Val Trp Val Pro Asp Ser Gln Ala Pro Asn
690 695 700
Cys Met Lys Cys Glu Ala Arg Phe Thr Phe Thr Lys Arg Arg His His
705 710 715 720
Cys Arg Ala Cys Gly Lys Val Phe Cys Ala Ser Cys Cys Ser Leu Lys
725 730 735
Cys Lys Leu Leu Tyr Met Asp Arg Lys Glu Ala Arg Val Cys Val Ile
740 745 750
Cys His Ser Val Leu Met Asn Val Ala Gln Pro Arg Glu Gln Arg Arg
755 760 765
Val Trp Phe Ala Asp Gly Ile Leu Pro Asn Gly Glu Val Ala Asp Ala
770 775 780
Ala Lys Leu Thr Met Asn Gly Thr Ser Ser Ala Gly Thr Leu Ala Val
785 790 795 800
Ser His Asp Pro Val Lys Pro Val Thr Thr Ser Pro Leu Pro Ala Glu
805 810 815
Thr Asp Ile Cys Leu Phe Ser Gly Ser Ile Thr Gln Val Gly Ser Pro
820 825 830
Val Gly Ser Ala Met Asn Leu Ile Pro Glu Asp Gly Leu Pro Pro Ile
835 840 845
Leu Ile Ser Thr Gly Val Lys Gly Asp Tyr Ala Val Glu Glu Lys Pro
850 855 860
Ser Gln Ile Ser Val Met Gln Gln Leu Glu Asp Gly Gly Pro Asp Pro
865 870 875 880
Leu Val Phe Val Leu Asn Ala Asn Leu Leu Ser Met Val Lys Ile Val
885 890 895
Asn Tyr Val Asn Arg Lys Cys Trp Cys Phe Thr Thr Lys Gly Met His
900 905 910
Ala Val Gly Gln Ser Glu Ile Val Ile Leu Leu Gln Cys Leu Pro Asp
915 920 925
Glu Lys Cys Leu Pro Lys Asp Ile Phe Asn His Phe Val Gln Leu Tyr
930 935 940
Arg Asp Ala Leu Ala Gly Asn Val Val Ser Asn Leu Gly His Ser Phe
945 950 955 960
Phe Ser Gln Ser Phe Leu Gly Ser Lys Glu His Gly Gly Phe Leu Tyr
965 970 975
Val Thr Ser Thr Tyr Gln Ser Leu Gln Asp Leu Val Leu Pro Thr Pro
980 985 990
Pro Tyr Leu Phe Gly Ile Leu Ile Gln Lys Trp Glu Thr Pro Trp Ala
995 1000 1005
Lys Val Phe Pro Ile Arg Leu Met Leu Arg Leu Gly Ala Glu Tyr Arg
1010 1015 1020
Leu Tyr Pro Cys Pro Leu Phe Ser Val Arg Phe Arg Lys Pro Leu Phe
1025 1030 1035 1040
Gly Glu Thr Gly His Thr Ile Met Asn Leu Leu Ala Asp Phe Arg Asn
1045 1050 1055
Tyr Gln Tyr Thr Leu Pro Val Val Gln Gly Leu Val Val Asp Met Glu
1060 1065 1070
Val Arg Lys Thr Ser Ile Lys Ile Pro Ser Asn Arg Tyr Asn Glu Met
1075 1080 1085
Met Lys Ala Met Asn Lys Ser Asn Glu His Val Leu Ala Gly Gly Ala
1090 1095 1100
Cys Phe Asn Glu Lys Ala Asp Ser His Leu Val Cys Val Gln Asn Asp
1105 1110 1115 1120
Asp Gly Asn Tyr Gln Thr Gln Ala Ile Ser Ile His Asn Gln Pro Arg
1125 1130 1135
Lys Val Thr Gly Ala Ser Phe Phe Val Phe Ser Gly Ala Leu Lys Ser
1140 1145 1150
Ser Ser Gly Tyr Leu Ala Lys Ser Ser Ile Val Glu Asp Gly Val Met
1155 1160 1165
Val Gln Ile Thr Ala Glu Asn Met Asp Ser Leu Arg Gln Ala Leu Arg
1170 1175 1180
Glu Met Lys Asp Phe Thr Ile Thr Cys Gly Lys Ala Asp Ala Glu Glu
1185 1190 1195 1200
Pro Gln Glu His Ile His Ile Gln Trp Val Asp Asp Asp Lys Asn Val
1205 1210 1215
Ser Lys
<210> 118
<211> 6
<212> PRT
<213> Artificial Sequence
<220>
<223> Break-point of ZFYVE9 protein fragment
<400> 118
Asp Lys Asn Val Ser Lys
1 5
<210> 119
<211> 493
<212> DNA
<213> Artificial Sequence
<220>
<223> CGA gene (NM_000735)
<400> 119
acactctgct ggtataaaag caggtgagga cttcattaac tgcagttact gagaactcat 60
aagacgaagc taaaatccct cttcggatcc acagtcaacc gccctgaaca catcctgcaa 120
aaagcccaga gaaaggagcg ccatggatta ctacagaaaa tatgcagcta tctttctggt 180
cacattgtcg gtgtttctgc atgttctcca ttccgctcct gatgtgcagg attgcccaga 240
atgcacgcta caggaaaacc cattcttctc ccagccgggt gccccaatac ttcagtgcat 300
gggctgctgc ttctctagag catatcccac tccactaagg tccaagaaga cgatgttggt 360
ccaaaagaac gtcacctcag agtccacttg ctgtgtagct aaatcatata acagggtcac 420
agtaatgggg ggtttcaaag tggagaacca cacggcgtgc cactgcagta cttgttatta 480
tcacaaatct taa 493
<210> 120
<211> 358
<212> DNA
<213> Artificial Sequence
<220>
<223> CGA gene fragment
<400> 120
gagcgccatg gattactaca gaaaatatgc agctatcttt ctggtcacat tgtcggtgtt 60
tctgcatgtt ctccattccg ctcctgatgt gcaggattgc ccagaatgca cgctacagga 120
aaacccattc ttctcccagc cgggtgcccc aatacttcag tgcatgggct gctgcttctc 180
tagagcatat cccactccac taaggtccaa gaagacgatg ttggtccaaa agaacgtcac 240
ctcagagtcc acttgctgtg tagctaaatc atataacagg gtcacagtaa tggggggttt 300
caaagtggag aaccacacgg cgtgccactg cagtacttgt tattatcaca aatcttaa 358
<210> 121
<211> 7
<212> DNA
<213> Artificial Sequence
<220>
<223> Break-point of CGA gene fragment
<400> 121
gagcgcc 7
<210> 122
<211> 116
<212> PRT
<213> Artificial Sequence
<220>
<223> CGA protein
<400> 122
Met Asp Tyr Tyr Arg Lys Tyr Ala Ala Ile Phe Leu Val Thr Leu Ser
1 5 10 15
Val Phe Leu His Val Leu His Ser Ala Pro Asp Val Gln Asp Cys Pro
20 25 30
Glu Cys Thr Leu Gln Glu Asn Pro Phe Phe Ser Gln Pro Gly Ala Pro
35 40 45
Ile Leu Gln Cys Met Gly Cys Cys Phe Ser Arg Ala Tyr Pro Thr Pro
50 55 60
Leu Arg Ser Lys Lys Thr Met Leu Val Gln Lys Asn Val Thr Ser Glu
65 70 75 80
Ser Thr Cys Cys Val Ala Lys Ser Tyr Asn Arg Val Thr Val Met Gly
85 90 95
Gly Phe Lys Val Glu Asn His Thr Ala Cys His Cys Ser Thr Cys Tyr
100 105 110
Tyr His Lys Ser
115
<210> 123
<211> 116
<212> PRT
<213> Artificial Sequence
<220>
<223> CGA protein fragment
<400> 123
Met Asp Tyr Tyr Arg Lys Tyr Ala Ala Ile Phe Leu Val Thr Leu Ser
1 5 10 15
Val Phe Leu His Val Leu His Ser Ala Pro Asp Val Gln Asp Cys Pro
20 25 30
Glu Cys Thr Leu Gln Glu Asn Pro Phe Phe Ser Gln Pro Gly Ala Pro
35 40 45
Ile Leu Gln Cys Met Gly Cys Cys Phe Ser Arg Ala Tyr Pro Thr Pro
50 55 60
Leu Arg Ser Lys Lys Thr Met Leu Val Gln Lys Asn Val Thr Ser Glu
65 70 75 80
Ser Thr Cys Cys Val Ala Lys Ser Tyr Asn Arg Val Thr Val Met Gly
85 90 95
Gly Phe Lys Val Glu Asn His Thr Ala Cys His Cys Ser Thr Cys Tyr
100 105 110
Tyr His Lys Ser
115
<210> 124
<211> 4014
<212> DNA
<213> Artificial Sequence
<220>
<223> ZFYVE9-CGA fusion gene
<400> 124
atggagaatt acttccaagc agaagcttac aacctggaca aggtgttaga tgaatttgaa 60
caaaacgaag atgaaacagt ttcttctact ttattggata caaagtggaa taagattcta 120
gatccccctt ctcaccggct gtcatttaac cctactttgg ccagtgtgaa tgaatctgca 180
gtttctaatg agtcacaacc acaactgaaa gtcttctccc tggctcattc agctcccctg 240
accacagagg aagaggatca ctgtgctaat ggacaggact gtaatctaaa tccagagatt 300
gccacaatgt ggattgatga aaatgctgtt gcagaagacc agttaattaa gagaaactat 360
agttgggatg atcaatgcag tgctgttgaa gtgggagaga agaaatgtgg aaacctggct 420
tgtctgccag atgagaagaa tgttcttgtt gtagccgtca tgcataactg tgataaaagg 480
acattacaaa acgatttaca ggattgtaat aattataata gtcaatccct tatggatgct 540
tttagctgtt cactggataa tgaaaacaga caaactgatc aatttagttt tagtataaat 600
gagtccactg aaaaagatat gaattcagag aaacaaatgg atccattgaa tagaccgaaa 660
acagagggga gatctgttaa ccatctgtgt cctacttcat ctgatagtct agccagtgtc 720
tgttcccctt cacaattaaa ggatgacgga agtataggta gagacccctc catgtctgcg 780
attacaagtt taacggttga ttcagtaatc tcatcccagg gaacagatgg atgtcctgct 840
gttaaaaagc aagagaacta tataccagat gaggacctca ctggcaaaat cagctctcct 900
aggacagatc tagggagtcc aaattccttt tcccacatga gtgaggggat tttgatgaaa 960
aaagagccag cagaggagag caccactgaa gaatccctcc ggtctggttt acctttgctt 1020
ctcaaaccag acatgcctaa tgggtctgga aggaataatg actgtgaacg gtgttcagat 1080
tgccttgtgc ctaatgaagt tagggctgat gaaaatgaag gttatgaaca tgaagaaact 1140
cttggcacta cagaattcct taatatgaca gagcatttct ctgaatctca ggacatgact 1200
aattggaagt tgactaaact aaatgagatg aatgatagcc aagtaaacga agaaaaggaa 1260
aagtttctac agattagtca gcctgaggac actaatggtg atagtggagg acagtgtgtt 1320
ggattggcag atgcaggtct agatttaaaa ggaacttgca ttagtgaaag tgaagaatgt 1380
gatttctcca ctgttataga cacaccagca gcaaattatc tatctaatgg ttgtgattcc 1440
tatggaatgc aagacccagg tgtttctttt gttccaaaga ctttaccctc caaagaagat 1500
tcagtaacag aagaaaaaga aatagaggaa agcaagtcag aatgctactc aaatatttat 1560
gaacagagag gaaatgaggc cacagaaggg agtggactac ttttaaacag cactggtgac 1620
ctaatgaaga aaaattattt acataatttc tgtagtcaag ttccatcagt gcttgggcaa 1680
tcttccccca aggtagtagc aagcctgcca tctatcagtg ttccttttgg tggtgcaaga 1740
cccaagcaac cttctaatct taaacttcaa attccaaagc cattatcaga ccatttacaa 1800
aatgactttc ctgcaaacag tggaaataat actaaaaata aaaatgatat tcttgggaaa 1860
gcaaaattag gggaaaactc agcaaccaat gtatgcagtc catctttggg aaacatctct 1920
aatgtcgata caaatgggga acatttagaa agttatgagg ctgagatctc cactagacca 1980
tgccttgcat tagctccaga tagcccagat aatgatctca gagctggtca gtttggaatt 2040
tctgccagaa agccattcac cactctgggt gaggtggctc cagtatgggt accggattct 2100
caggctccaa attgcatgaa atgtgaagcc aggtttacat tcaccaaaag gaggcatcac 2160
tgcagagcat gtgggaaggt tttctgtgct tcctgctgta gcctgaaatg taaactgtta 2220
tacatggaca gaaaggaagc tagagtgtgt gtaatctgcc attcagtgct aatgaatgtg 2280
gctcagccca gagagcagag gcgagtttgg tttgctgatg ggatcttgcc caatggagaa 2340
gttgctgatg cagccaaatt aacaatgaat ggaacttcct ctgcaggaac cctggctgtg 2400
tcacacgacc cagtcaagcc agtaactacc agtcctctac cagcagagac ggatatttgt 2460
ctattctctg ggagtataac tcaggttgga agtcctgttg gaagtgcaat gaatcttatt 2520
cctgaagatg gccttcctcc cattctcatc tccactggtg taaaaggaga ctatgctgtg 2580
gaagagaaac catcacagat ttcagtaatg cagcagttgg aggatggtgg ccctgaccca 2640
cttgtatttg ttttaaatgc aaatttgttg tcaatggtta aaattgtaaa ttatgtgaac 2700
aggaagtgct ggtgtttcac aaccaaggga atgcatgcag tgggtcagtc tgagatagtc 2760
attcttctac agtgtttacc ggatgaaaag tgtttgccaa aggatatctt taatcacttt 2820
gtgcagcttt atcgggatgc tctggcaggg aatgtggtga gcaacttggg acattccttc 2880
ttcagtcaaa gtttccttgg cagtaaagaa catggtggat tcttatatgt gacatctacc 2940
taccagtcac tgcaagacct agtactccca accccacctt acttgtttgg gattcttatc 3000
cagaaatggg aaactccttg ggctaaagta tttcctatcc gtctgatgtt gagacttgga 3060
gctgaatatc gactttatcc atgcccacta ttcagtgtca gatttcggaa gccattgttt 3120
ggagagacgg ggcataccat catgaatctt cttgcagact tcagaaatta ccagtatacc 3180
ttgccagtag ttcaaggttt ggtggttgat atggaagttc ggaaaactag catcaaaatt 3240
cccagcaaca gatacaatga gatgatgaaa gccatgaaca agtccaatga gcatgtcctg 3300
gcaggaggtg cctgcttcaa tgaaaaggca gactctcatc ttgtgtgtgt acagaatgat 3360
gatggaaact atcagaccca ggctatcagt attcacaatc agcccagaaa agtgactggt 3420
gccagtttct ttgtgttcag tggcgctctg aaatcctctt ctggatacct tgccaagtcc 3480
agtattgtgg aagatggtgt tatggtccag attactgcag agaacatgga ttccttgagg 3540
caggcactgc gagagatgaa ggacttcacc atcacctgtg ggaaggcgga cgcggaggaa 3600
ccccaggagc acatccacat ccagtgggtg gatgatgaca agaacgttag caaggggagc 3660
gccatggatt actacagaaa atatgcagct atctttctgg tcacattgtc ggtgtttctg 3720
catgttctcc attccgctcc tgatgtgcag gattgcccag aatgcacgct acaggaaaac 3780
ccattcttct cccagccggg tgccccaata cttcagtgca tgggctgctg cttctctaga 3840
gcatatccca ctccactaag gtccaagaag acgatgttgg tccaaaagaa cgtcacctca 3900
gagtccactt gctgtgtagc taaatcatat aacagggtca cagtaatggg gggtttcaaa 3960
gtggagaacc acacggcgtg ccactgcagt acttgttatt atcacaaatc ttaa 4014
<210> 125
<211> 27
<212> DNA
<213> Artificial Sequence
<220>
<223> Fused region of ZFYVE9-CGA fusion gene
<400> 125
gacaagaacg ttagcaaggg gagcgcc 27
<210> 126
<211> 1337
<212> PRT
<213> Artificial Sequence
<220>
<223> ZFYVE9-CGA fusion protein
<400> 126
Met Glu Asn Tyr Phe Gln Ala Glu Ala Tyr Asn Leu Asp Lys Val Leu
1 5 10 15
Asp Glu Phe Glu Gln Asn Glu Asp Glu Thr Val Ser Ser Thr Leu Leu
20 25 30
Asp Thr Lys Trp Asn Lys Ile Leu Asp Pro Pro Ser His Arg Leu Ser
35 40 45
Phe Asn Pro Thr Leu Ala Ser Val Asn Glu Ser Ala Val Ser Asn Glu
50 55 60
Ser Gln Pro Gln Leu Lys Val Phe Ser Leu Ala His Ser Ala Pro Leu
65 70 75 80
Thr Thr Glu Glu Glu Asp His Cys Ala Asn Gly Gln Asp Cys Asn Leu
85 90 95
Asn Pro Glu Ile Ala Thr Met Trp Ile Asp Glu Asn Ala Val Ala Glu
100 105 110
Asp Gln Leu Ile Lys Arg Asn Tyr Ser Trp Asp Asp Gln Cys Ser Ala
115 120 125
Val Glu Val Gly Glu Lys Lys Cys Gly Asn Leu Ala Cys Leu Pro Asp
130 135 140
Glu Lys Asn Val Leu Val Val Ala Val Met His Asn Cys Asp Lys Arg
145 150 155 160
Thr Leu Gln Asn Asp Leu Gln Asp Cys Asn Asn Tyr Asn Ser Gln Ser
165 170 175
Leu Met Asp Ala Phe Ser Cys Ser Leu Asp Asn Glu Asn Arg Gln Thr
180 185 190
Asp Gln Phe Ser Phe Ser Ile Asn Glu Ser Thr Glu Lys Asp Met Asn
195 200 205
Ser Glu Lys Gln Met Asp Pro Leu Asn Arg Pro Lys Thr Glu Gly Arg
210 215 220
Ser Val Asn His Leu Cys Pro Thr Ser Ser Asp Ser Leu Ala Ser Val
225 230 235 240
Cys Ser Pro Ser Gln Leu Lys Asp Asp Gly Ser Ile Gly Arg Asp Pro
245 250 255
Ser Met Ser Ala Ile Thr Ser Leu Thr Val Asp Ser Val Ile Ser Ser
260 265 270
Gln Gly Thr Asp Gly Cys Pro Ala Val Lys Lys Gln Glu Asn Tyr Ile
275 280 285
Pro Asp Glu Asp Leu Thr Gly Lys Ile Ser Ser Pro Arg Thr Asp Leu
290 295 300
Gly Ser Pro Asn Ser Phe Ser His Met Ser Glu Gly Ile Leu Met Lys
305 310 315 320
Lys Glu Pro Ala Glu Glu Ser Thr Thr Glu Glu Ser Leu Arg Ser Gly
325 330 335
Leu Pro Leu Leu Leu Lys Pro Asp Met Pro Asn Gly Ser Gly Arg Asn
340 345 350
Asn Asp Cys Glu Arg Cys Ser Asp Cys Leu Val Pro Asn Glu Val Arg
355 360 365
Ala Asp Glu Asn Glu Gly Tyr Glu His Glu Glu Thr Leu Gly Thr Thr
370 375 380
Glu Phe Leu Asn Met Thr Glu His Phe Ser Glu Ser Gln Asp Met Thr
385 390 395 400
Asn Trp Lys Leu Thr Lys Leu Asn Glu Met Asn Asp Ser Gln Val Asn
405 410 415
Glu Glu Lys Glu Lys Phe Leu Gln Ile Ser Gln Pro Glu Asp Thr Asn
420 425 430
Gly Asp Ser Gly Gly Gln Cys Val Gly Leu Ala Asp Ala Gly Leu Asp
435 440 445
Leu Lys Gly Thr Cys Ile Ser Glu Ser Glu Glu Cys Asp Phe Ser Thr
450 455 460
Val Ile Asp Thr Pro Ala Ala Asn Tyr Leu Ser Asn Gly Cys Asp Ser
465 470 475 480
Tyr Gly Met Gln Asp Pro Gly Val Ser Phe Val Pro Lys Thr Leu Pro
485 490 495
Ser Lys Glu Asp Ser Val Thr Glu Glu Lys Glu Ile Glu Glu Ser Lys
500 505 510
Ser Glu Cys Tyr Ser Asn Ile Tyr Glu Gln Arg Gly Asn Glu Ala Thr
515 520 525
Glu Gly Ser Gly Leu Leu Leu Asn Ser Thr Gly Asp Leu Met Lys Lys
530 535 540
Asn Tyr Leu His Asn Phe Cys Ser Gln Val Pro Ser Val Leu Gly Gln
545 550 555 560
Ser Ser Pro Lys Val Val Ala Ser Leu Pro Ser Ile Ser Val Pro Phe
565 570 575
Gly Gly Ala Arg Pro Lys Gln Pro Ser Asn Leu Lys Leu Gln Ile Pro
580 585 590
Lys Pro Leu Ser Asp His Leu Gln Asn Asp Phe Pro Ala Asn Ser Gly
595 600 605
Asn Asn Thr Lys Asn Lys Asn Asp Ile Leu Gly Lys Ala Lys Leu Gly
610 615 620
Glu Asn Ser Ala Thr Asn Val Cys Ser Pro Ser Leu Gly Asn Ile Ser
625 630 635 640
Asn Val Asp Thr Asn Gly Glu His Leu Glu Ser Tyr Glu Ala Glu Ile
645 650 655
Ser Thr Arg Pro Cys Leu Ala Leu Ala Pro Asp Ser Pro Asp Asn Asp
660 665 670
Leu Arg Ala Gly Gln Phe Gly Ile Ser Ala Arg Lys Pro Phe Thr Thr
675 680 685
Leu Gly Glu Val Ala Pro Val Trp Val Pro Asp Ser Gln Ala Pro Asn
690 695 700
Cys Met Lys Cys Glu Ala Arg Phe Thr Phe Thr Lys Arg Arg His His
705 710 715 720
Cys Arg Ala Cys Gly Lys Val Phe Cys Ala Ser Cys Cys Ser Leu Lys
725 730 735
Cys Lys Leu Leu Tyr Met Asp Arg Lys Glu Ala Arg Val Cys Val Ile
740 745 750
Cys His Ser Val Leu Met Asn Val Ala Gln Pro Arg Glu Gln Arg Arg
755 760 765
Val Trp Phe Ala Asp Gly Ile Leu Pro Asn Gly Glu Val Ala Asp Ala
770 775 780
Ala Lys Leu Thr Met Asn Gly Thr Ser Ser Ala Gly Thr Leu Ala Val
785 790 795 800
Ser His Asp Pro Val Lys Pro Val Thr Thr Ser Pro Leu Pro Ala Glu
805 810 815
Thr Asp Ile Cys Leu Phe Ser Gly Ser Ile Thr Gln Val Gly Ser Pro
820 825 830
Val Gly Ser Ala Met Asn Leu Ile Pro Glu Asp Gly Leu Pro Pro Ile
835 840 845
Leu Ile Ser Thr Gly Val Lys Gly Asp Tyr Ala Val Glu Glu Lys Pro
850 855 860
Ser Gln Ile Ser Val Met Gln Gln Leu Glu Asp Gly Gly Pro Asp Pro
865 870 875 880
Leu Val Phe Val Leu Asn Ala Asn Leu Leu Ser Met Val Lys Ile Val
885 890 895
Asn Tyr Val Asn Arg Lys Cys Trp Cys Phe Thr Thr Lys Gly Met His
900 905 910
Ala Val Gly Gln Ser Glu Ile Val Ile Leu Leu Gln Cys Leu Pro Asp
915 920 925
Glu Lys Cys Leu Pro Lys Asp Ile Phe Asn His Phe Val Gln Leu Tyr
930 935 940
Arg Asp Ala Leu Ala Gly Asn Val Val Ser Asn Leu Gly His Ser Phe
945 950 955 960
Phe Ser Gln Ser Phe Leu Gly Ser Lys Glu His Gly Gly Phe Leu Tyr
965 970 975
Val Thr Ser Thr Tyr Gln Ser Leu Gln Asp Leu Val Leu Pro Thr Pro
980 985 990
Pro Tyr Leu Phe Gly Ile Leu Ile Gln Lys Trp Glu Thr Pro Trp Ala
995 1000 1005
Lys Val Phe Pro Ile Arg Leu Met Leu Arg Leu Gly Ala Glu Tyr Arg
1010 1015 1020
Leu Tyr Pro Cys Pro Leu Phe Ser Val Arg Phe Arg Lys Pro Leu Phe
1025 1030 1035 1040
Gly Glu Thr Gly His Thr Ile Met Asn Leu Leu Ala Asp Phe Arg Asn
1045 1050 1055
Tyr Gln Tyr Thr Leu Pro Val Val Gln Gly Leu Val Val Asp Met Glu
1060 1065 1070
Val Arg Lys Thr Ser Ile Lys Ile Pro Ser Asn Arg Tyr Asn Glu Met
1075 1080 1085
Met Lys Ala Met Asn Lys Ser Asn Glu His Val Leu Ala Gly Gly Ala
1090 1095 1100
Cys Phe Asn Glu Lys Ala Asp Ser His Leu Val Cys Val Gln Asn Asp
1105 1110 1115 1120
Asp Gly Asn Tyr Gln Thr Gln Ala Ile Ser Ile His Asn Gln Pro Arg
1125 1130 1135
Lys Val Thr Gly Ala Ser Phe Phe Val Phe Ser Gly Ala Leu Lys Ser
1140 1145 1150
Ser Ser Gly Tyr Leu Ala Lys Ser Ser Ile Val Glu Asp Gly Val Met
1155 1160 1165
Val Gln Ile Thr Ala Glu Asn Met Asp Ser Leu Arg Gln Ala Leu Arg
1170 1175 1180
Glu Met Lys Asp Phe Thr Ile Thr Cys Gly Lys Ala Asp Ala Glu Glu
1185 1190 1195 1200
Pro Gln Glu His Ile His Ile Gln Trp Val Asp Asp Asp Lys Asn Val
1205 1210 1215
Ser Lys Gly Ser Ala Met Asp Tyr Tyr Arg Lys Tyr Ala Ala Ile Phe
1220 1225 1230
Leu Val Thr Leu Ser Val Phe Leu His Val Leu His Ser Ala Pro Asp
1235 1240 1245
Val Gln Asp Cys Pro Glu Cys Thr Leu Gln Glu Asn Pro Phe Phe Ser
1250 1255 1260
Gln Pro Gly Ala Pro Ile Leu Gln Cys Met Gly Cys Cys Phe Ser Arg
1265 1270 1275 1280
Ala Tyr Pro Thr Pro Leu Arg Ser Lys Lys Thr Met Leu Val Gln Lys
1285 1290 1295
Asn Val Thr Ser Glu Ser Thr Cys Cys Val Ala Lys Ser Tyr Asn Arg
1300 1305 1310
Val Thr Val Met Gly Gly Phe Lys Val Glu Asn His Thr Ala Cys His
1315 1320 1325
Cys Ser Thr Cys Tyr Tyr His Lys Ser
1330 1335
<210> 127
<211> 9
<212> PRT
<213> Artificial Sequence
<220>
<223> Fused region of ZFYVE9-CGA fusion protein
<400> 127
Asp Lys Asn Val Ser Lys Gly Ser Ala
1 5
<210> 128
<211> 3909
<212> DNA
<213> Artificial Sequence
<220>
<223> CDS of ERBB2IP gene (NM_001006600)
<400> 128
atgactacaa aacgaagttt gtttgtgcgg ttggtaccat gtcgctgtct acgaggggaa 60
gaggagactg tcactactct tgattattct cattgcagct tagaacaagt tccgaaagag 120
atttttactt ttgaaaaaac cttggaggaa ctctatttag atgctaatca gattgaagag 180
cttccaaagc aactttttaa ctgtcagtct ttacacaaac tgagtttgcc agacaatgat 240
ttaacaacgt taccagcatc cattgcaaac cttattaatc tcagggaact ggatgtcagc 300
aagaatggaa tacaggagtt tccagaaaat ataaaaaatt gtaaagtttt gacaattgtg 360
gaggccagtg taaaccctat ttccaagctc cctgatggat tttctcagct gttaaaccta 420
acccagttgt atctgaatga tgcttttctt gagttcttgc cagcaaattt tggcagatta 480
actaaactcc aaatattaga gcttagagaa aaccagttaa aaatgttgcc taaaactatg 540
aatagactga cccagctgga aagactggat ttgggaagta acgaattcac ggaagtgcct 600
gaagtacttg agcaactaag tggattgaaa gagttttgga tggatgctaa tagactgact 660
tttattccag ggtttattgg tagtttgaaa cagctcacat atttggatgt ttctaaaaat 720
aatattgaaa tggttgaaga aggaatttca acatgtgaaa accttcaaga cctcctatta 780
tcaagcaatt cacttcagca gcttcctgag actattggtt cgttgaagaa tataacaacg 840
cttaaaatag atgaaaacca gttaatgtat ctgccagact ctataggagg gttaatatca 900
gtagaagaac tggattgtag tttcaatgaa gttgaagctt tgccttcatc tattgggcag 960
cttactaact taagaacttt tgctgctgat cataattact tacagcagtt gcccccagag 1020
attggaagct ggaaaaatat aactgtgctg tttctccatt ccaataaact tgagacactt 1080
ccagaggaaa tgggtgatat gcaaaaatta aaagtcatta atttaagtga taatagatta 1140
aagaatttac cctttagctt tacaaagcta cagcaattga cagctatgtg gctctcagat 1200
aatcagtcca aacccctgat acctcttcaa aaagaaactg attcagagac ccagaaaatg 1260
gtgcttacca actacatgtt ccctcaacag ccaaggactg aggatgttat gtttatatca 1320
gataatgaaa gttttaaccc ttcattgtgg gaggaacaga ggaaacagcg ggctcaagtt 1380
gcatttgaat gtgatgaaga caaagatgaa agggaggcac ctcccaggga gggaaattta 1440
aaaagatatc caacaccata cccagatgag cttaagaata tggtcaaaac tgttcaaacc 1500
attgtacata gattaaaaga tgaagagacc aatgaagact caggaagaga tttgaaacca 1560
catgaagatc aacaagatat aaataaagat gtgggtgtga agacctcaga aagtactact 1620
acagtaaaaa gcaaagttga tgaaagagaa aaatatatga taggaaactc tgtacagaag 1680
atcagtgaac ctgaagctga gattagtcct gggagtttac cagtgactgc aaatatgaaa 1740
gcctctgaga acttgaagca tattgttaac catgatgatg tttttgagga atctgaagaa 1800
ctttcttctg atgaagagat gaaaatggcg gagatgcgac caccattaat tgaaacctct 1860
attaaccagc caaaagtcgt agcacttagt aataacaaaa aagatgatac aaaggaaaca 1920
gattctttat cagatgaagt tacacacaat agcaatcaga ataacagcaa ttgttcttct 1980
ccatctcgga tgtctgattc agtttctctt aatactgata gtagtcaaga cacctcactc 2040
tgctctccag tgaaacaaac tcatattgat attaattcca aaatcaggca agaagatgaa 2100
aattttaaca gccttttaca aaatggagat attttaaaca gttcaacaga ggaaaagttc 2160
aaagctcatg ataaaaaaga ttttaactta cctgaatatg atttgaatgt tgaagagcga 2220
ttagttctaa ttgagaaaag tgttgactca acagccacag ctgatgacac tcacaaatta 2280
gatcatatca atatgaatct taataaactt ataactaatg atacatttca accagagatc 2340
atggaaagat caaaaacaca ggatattgtg cttggaacaa gctttttaag cattaattct 2400
aaagaggaaa ctgagcactt ggaaaatgga aacaagtatc ctaatttgga atccgtaaat 2460
aaggtaaatg gacattctga ggaaacttcc cagtctccta ataggactga accacatgac 2520
agtgattgtt ctgttgactt aggtatttcc aaaagcactg aagatctctc ccctcagaaa 2580
agtggtccag ttggatctgt tgtgaaatct catagcataa ctaatatgga gattggaggg 2640
ctaaaaatct atgatattct tagtgataat ggacctcagc agccaagtac aaccgttaaa 2700
atcacatctg ctgttgatgg aaaaaatata gtcaggagca agtctgccac actgttgtat 2760
gatcaaccat tgcaggtatt tactggttct tcctcatctt ctgatttaat atcaggaaca 2820
aaggcaattt tcaagtttga ttcaaatcat aatcccgaag agccaaatat aataagaggc 2880
cccacaagtg gcccacaatc tgcacctcaa atatatggtc ctccacagta taatatccaa 2940
tacagtagca gtgctgcagt caaagacact ttgtggcact ccaaacaaaa tccccaaata 3000
gaccatgcca gttttcctcc tcagctcctt cctagatcag agagcacaga aaatcaaagt 3060
tatgctaaac attctgccaa tatgaatttc tctaatcata acaatgttcg agctaatact 3120
gcataccatt tacatcagag acttggccca gcaagacatg gggaaatgtg ggccatctca 3180
ccaaacgacc gacttattcc tgcagtaact cgaagtacaa tccagcgaca aagtagtgtg 3240
tcctccacag cctctgtaaa tcttggtgat ccaggctcta caaggcgggc tcagattcct 3300
gaaggagatt atttatcata cagagagttc cactcagcgg gaagaactcc tccaatgatg 3360
ccaggatcac agagacccct ttctgcacga acatacagca tagatggtcc aaatgcatca 3420
agacctcaga gtgctcgacc ctctattaat gaaataccag agagaactat gtcagttagt 3480
gatttcaatt attcacggac tagtccttca aaaagaccaa atgcaagggt tggttctgag 3540
cattctttat tagatcctcc aggaaaaagt aaagttcctc gtgactggag agaacaagta 3600
cttcgacata ttgaagccaa aaagttagaa aagattcgag tgagggttga aaaggatcca 3660
gaacttggat ttagcatatc aggtggtgtc gggggtagag gaaacccatt cagacctgat 3720
gatgatggta tatttgtaac aagggtacaa cctgaaggac cagcatcaaa attactgcag 3780
ccaggtgata aaattattca ggctaatggc tacagtttta taaatattga acatggacaa 3840
gcagtgtcct tgctaaaaac tttccagaat acagttgaac tcatcattgt acgagaagtt 3900
tcctcataa 3909
<210> 129
<211> 3801
<212> DNA
<213> Artificial Sequence
<220>
<223> ERBB2IP gene fragment
<400> 129
atgactacaa aacgaagttt gtttgtgcgg ttggtaccat gtcgctgtct acgaggggaa 60
gaggagactg tcactactct tgattattct cattgcagct tagaacaagt tccgaaagag 120
atttttactt ttgaaaaaac cttggaggaa ctctatttag atgctaatca gattgaagag 180
cttccaaagc aactttttaa ctgtcagtct ttacacaaac tgagtttgcc agacaatgat 240
ttaacaacgt taccagcatc cattgcaaac cttattaatc tcagggaact ggatgtcagc 300
aagaatggaa tacaggagtt tccagaaaat ataaaaaatt gtaaagtttt gacaattgtg 360
gaggccagtg taaaccctat ttccaagctc cctgatggat tttctcagct gttaaaccta 420
acccagttgt atctgaatga tgcttttctt gagttcttgc cagcaaattt tggcagatta 480
actaaactcc aaatattaga gcttagagaa aaccagttaa aaatgttgcc taaaactatg 540
aatagactga cccagctgga aagactggat ttgggaagta acgaattcac ggaagtgcct 600
gaagtacttg agcaactaag tggattgaaa gagttttgga tggatgctaa tagactgact 660
tttattccag ggtttattgg tagtttgaaa cagctcacat atttggatgt ttctaaaaat 720
aatattgaaa tggttgaaga aggaatttca acatgtgaaa accttcaaga cctcctatta 780
tcaagcaatt cacttcagca gcttcctgag actattggtt cgttgaagaa tataacaacg 840
cttaaaatag atgaaaacca gttaatgtat ctgccagact ctataggagg gttaatatca 900
gtagaagaac tggattgtag tttcaatgaa gttgaagctt tgccttcatc tattgggcag 960
cttactaact taagaacttt tgctgctgat cataattact tacagcagtt gcccccagag 1020
attggaagct ggaaaaatat aactgtgctg tttctccatt ccaataaact tgagacactt 1080
ccagaggaaa tgggtgatat gcaaaaatta aaagtcatta atttaagtga taatagatta 1140
aagaatttac cctttagctt tacaaagcta cagcaattga cagctatgtg gctctcagat 1200
aatcagtcca aacccctgat acctcttcaa aaagaaactg attcagagac ccagaaaatg 1260
gtgcttacca actacatgtt ccctcaacag ccaaggactg aggatgttat gtttatatca 1320
gataatgaaa gttttaaccc ttcattgtgg gaggaacaga ggaaacagcg ggctcaagtt 1380
gcatttgaat gtgatgaaga caaagatgaa agggaggcac ctcccaggga gggaaattta 1440
aaaagatatc caacaccata cccagatgag cttaagaata tggtcaaaac tgttcaaacc 1500
attgtacata gattaaaaga tgaagagacc aatgaagact caggaagaga tttgaaacca 1560
catgaagatc aacaagatat aaataaagat gtgggtgtga agacctcaga aagtactact 1620
acagtaaaaa gcaaagttga tgaaagagaa aaatatatga taggaaactc tgtacagaag 1680
atcagtgaac ctgaagctga gattagtcct gggagtttac cagtgactgc aaatatgaaa 1740
gcctctgaga acttgaagca tattgttaac catgatgatg tttttgagga atctgaagaa 1800
ctttcttctg atgaagagat gaaaatggcg gagatgcgac caccattaat tgaaacctct 1860
attaaccagc caaaagtcgt agcacttagt aataacaaaa aagatgatac aaaggaaaca 1920
gattctttat cagatgaagt tacacacaat agcaatcaga ataacagcaa ttgttcttct 1980
ccatctcgga tgtctgattc agtttctctt aatactgata gtagtcaaga cacctcactc 2040
tgctctccag tgaaacaaac tcatattgat attaattcca aaatcaggca agaagatgaa 2100
aattttaaca gccttttaca aaatggagat attttaaaca gttcaacaga ggaaaagttc 2160
aaagctcatg ataaaaaaga ttttaactta cctgaatatg atttgaatgt tgaagagcga 2220
ttagttctaa ttgagaaaag tgttgactca acagccacag ctgatgacac tcacaaatta 2280
gatcatatca atatgaatct taataaactt ataactaatg atacatttca accagagatc 2340
atggaaagat caaaaacaca ggatattgtg cttggaacaa gctttttaag cattaattct 2400
aaagaggaaa ctgagcactt ggaaaatgga aacaagtatc ctaatttgga atccgtaaat 2460
aaggtaaatg gacattctga ggaaacttcc cagtctccta ataggactga accacatgac 2520
agtgattgtt ctgttgactt aggtatttcc aaaagcactg aagatctctc ccctcagaaa 2580
agtggtccag ttggatctgt tgtgaaatct catagcataa ctaatatgga gattggaggg 2640
ctaaaaatct atgatattct tagtgataat ggacctcagc agccaagtac aaccgttaaa 2700
atcacatctg ctgttgatgg aaaaaatata gtcaggagca agtctgccac actgttgtat 2760
gatcaaccat tgcaggtatt tactggttct tcctcatctt ctgatttaat atcaggaaca 2820
aaggcaattt tcaagtttga ttcaaatcat aatcccgaag agccaaatat aataagaggc 2880
cccacaagtg gcccacaatc tgcacctcaa atatatggtc ctccacagta taatatccaa 2940
tacagtagca gtgctgcagt caaagacact ttgtggcact ccaaacaaaa tccccaaata 3000
gaccatgcca gttttcctcc tcagctcctt cctagatcag agagcacaga aaatcaaagt 3060
tatgctaaac attctgccaa tatgaatttc tctaatcata acaatgttcg agctaatact 3120
gcataccatt tacatcagag acttggccca gcaagacatg gggaaatgtg ggccatctca 3180
ccaaacgacc gacttattcc tgcagtaact cgaagtacaa tccagcgaca aagtagtgtg 3240
tcctccacag cctctgtaaa tcttggtgat ccaggctcta caaggcgggc tcagattcct 3300
gaaggagatt atttatcata cagagagttc cactcagcgg gaagaactcc tccaatgatg 3360
ccaggatcac agagacccct ttctgcacga acatacagca tagatggtcc aaatgcatca 3420
agacctcaga gtgctcgacc ctctattaat gaaataccag agagaactat gtcagttagt 3480
gatttcaatt attcacggac tagtccttca aaaagaccaa atgcaagggt tggttctgag 3540
cattctttat tagatcctcc aggaaaaagt aaagttcctc gtgactggag agaacaagta 3600
cttcgacata ttgaagccaa aaagttagaa aagattcgag tgagggttga aaaggatcca 3660
gaacttggat ttagcatatc aggtggtgtc gggggtagag gaaacccatt cagacctgat 3720
gatgatggta tatttgtaac aagggtacaa cctgaaggac cagcatcaaa attactgcag 3780
ccaggtgata aaattattca g 3801
<210> 130
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> Break-point of ERBB2IP gene fragment
<400> 130
cagccaggtg ataaaattat tcag 24
<210> 131
<211> 1302
<212> PRT
<213> Artificial Sequence
<220>
<223> ERBB2IP protein
<400> 131
Met Thr Thr Lys Arg Ser Leu Phe Val Arg Leu Val Pro Cys Arg Cys
1 5 10 15
Leu Arg Gly Glu Glu Glu Thr Val Thr Thr Leu Asp Tyr Ser His Cys
20 25 30
Ser Leu Glu Gln Val Pro Lys Glu Ile Phe Thr Phe Glu Lys Thr Leu
35 40 45
Glu Glu Leu Tyr Leu Asp Ala Asn Gln Ile Glu Glu Leu Pro Lys Gln
50 55 60
Leu Phe Asn Cys Gln Ser Leu His Lys Leu Ser Leu Pro Asp Asn Asp
65 70 75 80
Leu Thr Thr Leu Pro Ala Ser Ile Ala Asn Leu Ile Asn Leu Arg Glu
85 90 95
Leu Asp Val Ser Lys Asn Gly Ile Gln Glu Phe Pro Glu Asn Ile Lys
100 105 110
Asn Cys Lys Val Leu Thr Ile Val Glu Ala Ser Val Asn Pro Ile Ser
115 120 125
Lys Leu Pro Asp Gly Phe Ser Gln Leu Leu Asn Leu Thr Gln Leu Tyr
130 135 140
Leu Asn Asp Ala Phe Leu Glu Phe Leu Pro Ala Asn Phe Gly Arg Leu
145 150 155 160
Thr Lys Leu Gln Ile Leu Glu Leu Arg Glu Asn Gln Leu Lys Met Leu
165 170 175
Pro Lys Thr Met Asn Arg Leu Thr Gln Leu Glu Arg Leu Asp Leu Gly
180 185 190
Ser Asn Glu Phe Thr Glu Val Pro Glu Val Leu Glu Gln Leu Ser Gly
195 200 205
Leu Lys Glu Phe Trp Met Asp Ala Asn Arg Leu Thr Phe Ile Pro Gly
210 215 220
Phe Ile Gly Ser Leu Lys Gln Leu Thr Tyr Leu Asp Val Ser Lys Asn
225 230 235 240
Asn Ile Glu Met Val Glu Glu Gly Ile Ser Thr Cys Glu Asn Leu Gln
245 250 255
Asp Leu Leu Leu Ser Ser Asn Ser Leu Gln Gln Leu Pro Glu Thr Ile
260 265 270
Gly Ser Leu Lys Asn Ile Thr Thr Leu Lys Ile Asp Glu Asn Gln Leu
275 280 285
Met Tyr Leu Pro Asp Ser Ile Gly Gly Leu Ile Ser Val Glu Glu Leu
290 295 300
Asp Cys Ser Phe Asn Glu Val Glu Ala Leu Pro Ser Ser Ile Gly Gln
305 310 315 320
Leu Thr Asn Leu Arg Thr Phe Ala Ala Asp His Asn Tyr Leu Gln Gln
325 330 335
Leu Pro Pro Glu Ile Gly Ser Trp Lys Asn Ile Thr Val Leu Phe Leu
340 345 350
His Ser Asn Lys Leu Glu Thr Leu Pro Glu Glu Met Gly Asp Met Gln
355 360 365
Lys Leu Lys Val Ile Asn Leu Ser Asp Asn Arg Leu Lys Asn Leu Pro
370 375 380
Phe Ser Phe Thr Lys Leu Gln Gln Leu Thr Ala Met Trp Leu Ser Asp
385 390 395 400
Asn Gln Ser Lys Pro Leu Ile Pro Leu Gln Lys Glu Thr Asp Ser Glu
405 410 415
Thr Gln Lys Met Val Leu Thr Asn Tyr Met Phe Pro Gln Gln Pro Arg
420 425 430
Thr Glu Asp Val Met Phe Ile Ser Asp Asn Glu Ser Phe Asn Pro Ser
435 440 445
Leu Trp Glu Glu Gln Arg Lys Gln Arg Ala Gln Val Ala Phe Glu Cys
450 455 460
Asp Glu Asp Lys Asp Glu Arg Glu Ala Pro Pro Arg Glu Gly Asn Leu
465 470 475 480
Lys Arg Tyr Pro Thr Pro Tyr Pro Asp Glu Leu Lys Asn Met Val Lys
485 490 495
Thr Val Gln Thr Ile Val His Arg Leu Lys Asp Glu Glu Thr Asn Glu
500 505 510
Asp Ser Gly Arg Asp Leu Lys Pro His Glu Asp Gln Gln Asp Ile Asn
515 520 525
Lys Asp Val Gly Val Lys Thr Ser Glu Ser Thr Thr Thr Val Lys Ser
530 535 540
Lys Val Asp Glu Arg Glu Lys Tyr Met Ile Gly Asn Ser Val Gln Lys
545 550 555 560
Ile Ser Glu Pro Glu Ala Glu Ile Ser Pro Gly Ser Leu Pro Val Thr
565 570 575
Ala Asn Met Lys Ala Ser Glu Asn Leu Lys His Ile Val Asn His Asp
580 585 590
Asp Val Phe Glu Glu Ser Glu Glu Leu Ser Ser Asp Glu Glu Met Lys
595 600 605
Met Ala Glu Met Arg Pro Pro Leu Ile Glu Thr Ser Ile Asn Gln Pro
610 615 620
Lys Val Val Ala Leu Ser Asn Asn Lys Lys Asp Asp Thr Lys Glu Thr
625 630 635 640
Asp Ser Leu Ser Asp Glu Val Thr His Asn Ser Asn Gln Asn Asn Ser
645 650 655
Asn Cys Ser Ser Pro Ser Arg Met Ser Asp Ser Val Ser Leu Asn Thr
660 665 670
Asp Ser Ser Gln Asp Thr Ser Leu Cys Ser Pro Val Lys Gln Thr His
675 680 685
Ile Asp Ile Asn Ser Lys Ile Arg Gln Glu Asp Glu Asn Phe Asn Ser
690 695 700
Leu Leu Gln Asn Gly Asp Ile Leu Asn Ser Ser Thr Glu Glu Lys Phe
705 710 715 720
Lys Ala His Asp Lys Lys Asp Phe Asn Leu Pro Glu Tyr Asp Leu Asn
725 730 735
Val Glu Glu Arg Leu Val Leu Ile Glu Lys Ser Val Asp Ser Thr Ala
740 745 750
Thr Ala Asp Asp Thr His Lys Leu Asp His Ile Asn Met Asn Leu Asn
755 760 765
Lys Leu Ile Thr Asn Asp Thr Phe Gln Pro Glu Ile Met Glu Arg Ser
770 775 780
Lys Thr Gln Asp Ile Val Leu Gly Thr Ser Phe Leu Ser Ile Asn Ser
785 790 795 800
Lys Glu Glu Thr Glu His Leu Glu Asn Gly Asn Lys Tyr Pro Asn Leu
805 810 815
Glu Ser Val Asn Lys Val Asn Gly His Ser Glu Glu Thr Ser Gln Ser
820 825 830
Pro Asn Arg Thr Glu Pro His Asp Ser Asp Cys Ser Val Asp Leu Gly
835 840 845
Ile Ser Lys Ser Thr Glu Asp Leu Ser Pro Gln Lys Ser Gly Pro Val
850 855 860
Gly Ser Val Val Lys Ser His Ser Ile Thr Asn Met Glu Ile Gly Gly
865 870 875 880
Leu Lys Ile Tyr Asp Ile Leu Ser Asp Asn Gly Pro Gln Gln Pro Ser
885 890 895
Thr Thr Val Lys Ile Thr Ser Ala Val Asp Gly Lys Asn Ile Val Arg
900 905 910
Ser Lys Ser Ala Thr Leu Leu Tyr Asp Gln Pro Leu Gln Val Phe Thr
915 920 925
Gly Ser Ser Ser Ser Ser Asp Leu Ile Ser Gly Thr Lys Ala Ile Phe
930 935 940
Lys Phe Asp Ser Asn His Asn Pro Glu Glu Pro Asn Ile Ile Arg Gly
945 950 955 960
Pro Thr Ser Gly Pro Gln Ser Ala Pro Gln Ile Tyr Gly Pro Pro Gln
965 970 975
Tyr Asn Ile Gln Tyr Ser Ser Ser Ala Ala Val Lys Asp Thr Leu Trp
980 985 990
His Ser Lys Gln Asn Pro Gln Ile Asp His Ala Ser Phe Pro Pro Gln
995 1000 1005
Leu Leu Pro Arg Ser Glu Ser Thr Glu Asn Gln Ser Tyr Ala Lys His
1010 1015 1020
Ser Ala Asn Met Asn Phe Ser Asn His Asn Asn Val Arg Ala Asn Thr
1025 1030 1035 1040
Ala Tyr His Leu His Gln Arg Leu Gly Pro Ala Arg His Gly Glu Met
1045 1050 1055
Trp Ala Ile Ser Pro Asn Asp Arg Leu Ile Pro Ala Val Thr Arg Ser
1060 1065 1070
Thr Ile Gln Arg Gln Ser Ser Val Ser Ser Thr Ala Ser Val Asn Leu
1075 1080 1085
Gly Asp Pro Gly Ser Thr Arg Arg Ala Gln Ile Pro Glu Gly Asp Tyr
1090 1095 1100
Leu Ser Tyr Arg Glu Phe His Ser Ala Gly Arg Thr Pro Pro Met Met
1105 1110 1115 1120
Pro Gly Ser Gln Arg Pro Leu Ser Ala Arg Thr Tyr Ser Ile Asp Gly
1125 1130 1135
Pro Asn Ala Ser Arg Pro Gln Ser Ala Arg Pro Ser Ile Asn Glu Ile
1140 1145 1150
Pro Glu Arg Thr Met Ser Val Ser Asp Phe Asn Tyr Ser Arg Thr Ser
1155 1160 1165
Pro Ser Lys Arg Pro Asn Ala Arg Val Gly Ser Glu His Ser Leu Leu
1170 1175 1180
Asp Pro Pro Gly Lys Ser Lys Val Pro Arg Asp Trp Arg Glu Gln Val
1185 1190 1195 1200
Leu Arg His Ile Glu Ala Lys Lys Leu Glu Lys Ile Arg Val Arg Val
1205 1210 1215
Glu Lys Asp Pro Glu Leu Gly Phe Ser Ile Ser Gly Gly Val Gly Gly
1220 1225 1230
Arg Gly Asn Pro Phe Arg Pro Asp Asp Asp Gly Ile Phe Val Thr Arg
1235 1240 1245
Val Gln Pro Glu Gly Pro Ala Ser Lys Leu Leu Gln Pro Gly Asp Lys
1250 1255 1260
Ile Ile Gln Ala Asn Gly Tyr Ser Phe Ile Asn Ile Glu His Gly Gln
1265 1270 1275 1280
Ala Val Ser Leu Leu Lys Thr Phe Gln Asn Thr Val Glu Leu Ile Ile
1285 1290 1295
Val Arg Glu Val Ser Ser
1300
<210> 132
<211> 1267
<212> PRT
<213> Artificial Sequence
<220>
<223> ERBB2IP protein fragment
<400> 132
Met Thr Thr Lys Arg Ser Leu Phe Val Arg Leu Val Pro Cys Arg Cys
1 5 10 15
Leu Arg Gly Glu Glu Glu Thr Val Thr Thr Leu Asp Tyr Ser His Cys
20 25 30
Ser Leu Glu Gln Val Pro Lys Glu Ile Phe Thr Phe Glu Lys Thr Leu
35 40 45
Glu Glu Leu Tyr Leu Asp Ala Asn Gln Ile Glu Glu Leu Pro Lys Gln
50 55 60
Leu Phe Asn Cys Gln Ser Leu His Lys Leu Ser Leu Pro Asp Asn Asp
65 70 75 80
Leu Thr Thr Leu Pro Ala Ser Ile Ala Asn Leu Ile Asn Leu Arg Glu
85 90 95
Leu Asp Val Ser Lys Asn Gly Ile Gln Glu Phe Pro Glu Asn Ile Lys
100 105 110
Asn Cys Lys Val Leu Thr Ile Val Glu Ala Ser Val Asn Pro Ile Ser
115 120 125
Lys Leu Pro Asp Gly Phe Ser Gln Leu Leu Asn Leu Thr Gln Leu Tyr
130 135 140
Leu Asn Asp Ala Phe Leu Glu Phe Leu Pro Ala Asn Phe Gly Arg Leu
145 150 155 160
Thr Lys Leu Gln Ile Leu Glu Leu Arg Glu Asn Gln Leu Lys Met Leu
165 170 175
Pro Lys Thr Met Asn Arg Leu Thr Gln Leu Glu Arg Leu Asp Leu Gly
180 185 190
Ser Asn Glu Phe Thr Glu Val Pro Glu Val Leu Glu Gln Leu Ser Gly
195 200 205
Leu Lys Glu Phe Trp Met Asp Ala Asn Arg Leu Thr Phe Ile Pro Gly
210 215 220
Phe Ile Gly Ser Leu Lys Gln Leu Thr Tyr Leu Asp Val Ser Lys Asn
225 230 235 240
Asn Ile Glu Met Val Glu Glu Gly Ile Ser Thr Cys Glu Asn Leu Gln
245 250 255
Asp Leu Leu Leu Ser Ser Asn Ser Leu Gln Gln Leu Pro Glu Thr Ile
260 265 270
Gly Ser Leu Lys Asn Ile Thr Thr Leu Lys Ile Asp Glu Asn Gln Leu
275 280 285
Met Tyr Leu Pro Asp Ser Ile Gly Gly Leu Ile Ser Val Glu Glu Leu
290 295 300
Asp Cys Ser Phe Asn Glu Val Glu Ala Leu Pro Ser Ser Ile Gly Gln
305 310 315 320
Leu Thr Asn Leu Arg Thr Phe Ala Ala Asp His Asn Tyr Leu Gln Gln
325 330 335
Leu Pro Pro Glu Ile Gly Ser Trp Lys Asn Ile Thr Val Leu Phe Leu
340 345 350
His Ser Asn Lys Leu Glu Thr Leu Pro Glu Glu Met Gly Asp Met Gln
355 360 365
Lys Leu Lys Val Ile Asn Leu Ser Asp Asn Arg Leu Lys Asn Leu Pro
370 375 380
Phe Ser Phe Thr Lys Leu Gln Gln Leu Thr Ala Met Trp Leu Ser Asp
385 390 395 400
Asn Gln Ser Lys Pro Leu Ile Pro Leu Gln Lys Glu Thr Asp Ser Glu
405 410 415
Thr Gln Lys Met Val Leu Thr Asn Tyr Met Phe Pro Gln Gln Pro Arg
420 425 430
Thr Glu Asp Val Met Phe Ile Ser Asp Asn Glu Ser Phe Asn Pro Ser
435 440 445
Leu Trp Glu Glu Gln Arg Lys Gln Arg Ala Gln Val Ala Phe Glu Cys
450 455 460
Asp Glu Asp Lys Asp Glu Arg Glu Ala Pro Pro Arg Glu Gly Asn Leu
465 470 475 480
Lys Arg Tyr Pro Thr Pro Tyr Pro Asp Glu Leu Lys Asn Met Val Lys
485 490 495
Thr Val Gln Thr Ile Val His Arg Leu Lys Asp Glu Glu Thr Asn Glu
500 505 510
Asp Ser Gly Arg Asp Leu Lys Pro His Glu Asp Gln Gln Asp Ile Asn
515 520 525
Lys Asp Val Gly Val Lys Thr Ser Glu Ser Thr Thr Thr Val Lys Ser
530 535 540
Lys Val Asp Glu Arg Glu Lys Tyr Met Ile Gly Asn Ser Val Gln Lys
545 550 555 560
Ile Ser Glu Pro Glu Ala Glu Ile Ser Pro Gly Ser Leu Pro Val Thr
565 570 575
Ala Asn Met Lys Ala Ser Glu Asn Leu Lys His Ile Val Asn His Asp
580 585 590
Asp Val Phe Glu Glu Ser Glu Glu Leu Ser Ser Asp Glu Glu Met Lys
595 600 605
Met Ala Glu Met Arg Pro Pro Leu Ile Glu Thr Ser Ile Asn Gln Pro
610 615 620
Lys Val Val Ala Leu Ser Asn Asn Lys Lys Asp Asp Thr Lys Glu Thr
625 630 635 640
Asp Ser Leu Ser Asp Glu Val Thr His Asn Ser Asn Gln Asn Asn Ser
645 650 655
Asn Cys Ser Ser Pro Ser Arg Met Ser Asp Ser Val Ser Leu Asn Thr
660 665 670
Asp Ser Ser Gln Asp Thr Ser Leu Cys Ser Pro Val Lys Gln Thr His
675 680 685
Ile Asp Ile Asn Ser Lys Ile Arg Gln Glu Asp Glu Asn Phe Asn Ser
690 695 700
Leu Leu Gln Asn Gly Asp Ile Leu Asn Ser Ser Thr Glu Glu Lys Phe
705 710 715 720
Lys Ala His Asp Lys Lys Asp Phe Asn Leu Pro Glu Tyr Asp Leu Asn
725 730 735
Val Glu Glu Arg Leu Val Leu Ile Glu Lys Ser Val Asp Ser Thr Ala
740 745 750
Thr Ala Asp Asp Thr His Lys Leu Asp His Ile Asn Met Asn Leu Asn
755 760 765
Lys Leu Ile Thr Asn Asp Thr Phe Gln Pro Glu Ile Met Glu Arg Ser
770 775 780
Lys Thr Gln Asp Ile Val Leu Gly Thr Ser Phe Leu Ser Ile Asn Ser
785 790 795 800
Lys Glu Glu Thr Glu His Leu Glu Asn Gly Asn Lys Tyr Pro Asn Leu
805 810 815
Glu Ser Val Asn Lys Val Asn Gly His Ser Glu Glu Thr Ser Gln Ser
820 825 830
Pro Asn Arg Thr Glu Pro His Asp Ser Asp Cys Ser Val Asp Leu Gly
835 840 845
Ile Ser Lys Ser Thr Glu Asp Leu Ser Pro Gln Lys Ser Gly Pro Val
850 855 860
Gly Ser Val Val Lys Ser His Ser Ile Thr Asn Met Glu Ile Gly Gly
865 870 875 880
Leu Lys Ile Tyr Asp Ile Leu Ser Asp Asn Gly Pro Gln Gln Pro Ser
885 890 895
Thr Thr Val Lys Ile Thr Ser Ala Val Asp Gly Lys Asn Ile Val Arg
900 905 910
Ser Lys Ser Ala Thr Leu Leu Tyr Asp Gln Pro Leu Gln Val Phe Thr
915 920 925
Gly Ser Ser Ser Ser Ser Asp Leu Ile Ser Gly Thr Lys Ala Ile Phe
930 935 940
Lys Phe Asp Ser Asn His Asn Pro Glu Glu Pro Asn Ile Ile Arg Gly
945 950 955 960
Pro Thr Ser Gly Pro Gln Ser Ala Pro Gln Ile Tyr Gly Pro Pro Gln
965 970 975
Tyr Asn Ile Gln Tyr Ser Ser Ser Ala Ala Val Lys Asp Thr Leu Trp
980 985 990
His Ser Lys Gln Asn Pro Gln Ile Asp His Ala Ser Phe Pro Pro Gln
995 1000 1005
Leu Leu Pro Arg Ser Glu Ser Thr Glu Asn Gln Ser Tyr Ala Lys His
1010 1015 1020
Ser Ala Asn Met Asn Phe Ser Asn His Asn Asn Val Arg Ala Asn Thr
1025 1030 1035 1040
Ala Tyr His Leu His Gln Arg Leu Gly Pro Ala Arg His Gly Glu Met
1045 1050 1055
Trp Ala Ile Ser Pro Asn Asp Arg Leu Ile Pro Ala Val Thr Arg Ser
1060 1065 1070
Thr Ile Gln Arg Gln Ser Ser Val Ser Ser Thr Ala Ser Val Asn Leu
1075 1080 1085
Gly Asp Pro Gly Ser Thr Arg Arg Ala Gln Ile Pro Glu Gly Asp Tyr
1090 1095 1100
Leu Ser Tyr Arg Glu Phe His Ser Ala Gly Arg Thr Pro Pro Met Met
1105 1110 1115 1120
Pro Gly Ser Gln Arg Pro Leu Ser Ala Arg Thr Tyr Ser Ile Asp Gly
1125 1130 1135
Pro Asn Ala Ser Arg Pro Gln Ser Ala Arg Pro Ser Ile Asn Glu Ile
1140 1145 1150
Pro Glu Arg Thr Met Ser Val Ser Asp Phe Asn Tyr Ser Arg Thr Ser
1155 1160 1165
Pro Ser Lys Arg Pro Asn Ala Arg Val Gly Ser Glu His Ser Leu Leu
1170 1175 1180
Asp Pro Pro Gly Lys Ser Lys Val Pro Arg Asp Trp Arg Glu Gln Val
1185 1190 1195 1200
Leu Arg His Ile Glu Ala Lys Lys Leu Glu Lys Ile Arg Val Arg Val
1205 1210 1215
Glu Lys Asp Pro Glu Leu Gly Phe Ser Ile Ser Gly Gly Val Gly Gly
1220 1225 1230
Arg Gly Asn Pro Phe Arg Pro Asp Asp Asp Gly Ile Phe Val Thr Arg
1235 1240 1245
Val Gln Pro Glu Gly Pro Ala Ser Lys Leu Leu Gln Pro Gly Asp Lys
1250 1255 1260
Ile Ile Gln
1265
<210> 133
<211> 8
<212> PRT
<213> Artificial Sequence
<220>
<223> Break-point of ERBB2IP protein fragment
<400> 133
Gln Pro Gly Asp Lys Ile Ile Gln
1 5
<210> 134
<211> 7872
<212> DNA
<213> Artificial Sequence
<220>
<223> CDS of MAST4 gene (NM_001164664)
<400> 134
atgggggaga aagtttcgga ggcgccagag ccggtgcccc gcggctgcag tggccacggc 60
agccggactc cagcctctgc gctggtcgcc gcgtcctctc cgggtgcttc ctcggccgag 120
tcctcctcgg gctcagaaac tctgtcggag gaaggggagc ccggcggctt ctccagagag 180
catcagccgc cgccgccgcc gccgttggga ggcaccctgg gcgcccgggc gcccgccgcg 240
tgggctccgg caagcgtgct gctggagcgc ggagtccttg cgctgccgcc gccgcttccc 300
ggaggagctg tgccgcccgc gccccggggc agcagcgcgt cccaggagga gcaggacgag 360
gagcttgacc acatattatc ccctccaccc atgccgtttc ggaaatgcag caacccagat 420
gtggcttctg gccctggaaa atcactgaag tataaaagac agctgagtga ggatggaaga 480
cagctaaggc gagggagcct gggaggagcc ctgactggga ggtaccttct tccaaacccg 540
gtggcgggac aggcctggcc ggcctctgca gagacgtcca acctcgtgcg catgcgcagc 600
caggccctgg gccagtcggc gccctcgctc accgccagcc tgaaggagct gagtctcccc 660
agaagaggaa gtttttgccg aacaagcaac cggaaaagct taataggcaa tgggcagtca 720
ccagcattgc ctcgaccaca ctcacctctc tctgctcatg caggaaatag ccctcaagat 780
agtccaagaa atttctcccc cagtgcctca gcccattttt catttgcacg gaggactgat 840
ggacgccgct ggtcgttggc ttctctccct tcctctggct atgggacaaa cacacccagc 900
tctacggtct cttcatcctg ttcctcccag gagaagttgc atcagttacc ataccaacca 960
acaccagacg agttacactt cttatcaaaa catttctgta ccaccgaaag catcgccact 1020
gagaacagat gcaggaacac gccgatgcgc ccccgttccc gaagtctgag ccctggacgt 1080
tctcccgcct gctgtgacca tgaaataatt atgatgaacc atgtctacaa agaaaggttc 1140
ccaaaggcta cagctcagat ggaagaacgt ctaaaggaaa ttatcaccag ctactctcct 1200
gacaacgttc tacccttagc agatggagtg cttagtttca ctcaccacca gattattgaa 1260
ctggctcgag attgcttgga taaatcccac cagggcctca tcacctcacg atacttcctt 1320
gaattacagc acaaattaga taagttgcta caggaggctc atgatcgttc agaaagtgga 1380
gaattggcat ttattaaaca actagttcga aagatcctaa ttgttattgc ccgccctgct 1440
cggttattag agtgcctgga atttgatccg gaagaatttt actacctatt ggaagcagca 1500
gaaggccatg ccaaagaagg acagggtatt aaaaccgaca ttcccaggta catcattagc 1560
caactgggac tcaataagga tcccttggaa gaaatggctc atttgggaaa ctacgatagt 1620
gggacagcag aaacaccaga aacagatgaa tcagtgagta gctctaatgc ctccctgaaa 1680
cttcgaagga aacctcggga aagtgatttt gaaacgatta aattgattag caatggagcc 1740
tatggggcag tctactttgt tcggcataaa gaatcccggc agaggtttgc catgaagaag 1800
attaataaac agaacctcat ccttcgaaac cagatccagc aggcctttgt ggagcgggat 1860
atcctgactt ttgcagaaaa cccctttgtt gtcagcatgt attgctcctt tgaaacaagg 1920
cgccacttgt gcatggtcat ggaatatgtg gaagggggag actgtgctac tttaatgaaa 1980
aacatgggtc ctctccctgt tgatatggcc agaatgtact ttgctgagac ggtcttggcc 2040
ttggaatatt tacataatta tggaattgta cacagggatt tgaaaccaga caacttgttg 2100
gttacctcca tggggcacat aaagctgaca gattttggat tatctaaggt gggactaatg 2160
agcatgacta ccaaccttta cgagggtcat attgagaagg atgctagaga gttcctggat 2220
aaacaggtct gtggcacacc tgaatacatt gcaccagaag tgattctgag gcagggttat 2280
ggaaagccgg tggactggtg ggccatgggg attatcctct atgaatttct ggttggatgc 2340
gtgccattct ttggggatac tccagaggag ctatttggac aagtcatcag tgatgagatc 2400
aactggcctg agaaggatga ggcaccccca cctgatgccc aggatctgat taccttactc 2460
ctcaggcaga atcccctgga gaggctggga acaggtggtg catatgaagt caaacagcat 2520
cgattcttcc gttctttaga ctggaacagt ttgctgagac agaaggcaga atttattccc 2580
caactggaat ctgaggatga cacaagttat tttgatactc ggtctgagaa gtatcatcat 2640
atggaaacgg aggaagaaga tgacacaaat gatgaagact ttaatgtgga aataaggcag 2700
ttttcttcat gttcacacag gttttcaaaa gttttcagca gtatagatcg aatcactcag 2760
aattcagcag aagagaagga agactctgtg gacaaaacca aaagcaccac cttgccatcc 2820
acagaaacac tgagctggag ttcagaatat tctgaaatgc aacagctatc aacatccaac 2880
tcttcagata ctgaaagcaa cagacataaa ctcagttctg gcctacttcc caaactggct 2940
atttcaacag agggagagca agatgaagct gcctcctgcc ctggagaccc ccatgaggag 3000
ccaggaaagc cagcccttcc tcctgaagag tgtgcccagg aggagcctga ggtcaccacc 3060
ccagccagca ccatcagcag ctccaccctg tcagttggca gtttttcaga gcacttggat 3120
cagataaatg gacgaagcga gtgtgtggac agtacagata attcctcaaa gccatccagt 3180
gaacccgctt ctcacatggc tcggcagcga ttagaaagca cagaaaaaaa gaaaatctcg 3240
gggaaagtca caaagtccct ctctgccagt gctctttccc tcatgatccc aggagatatg 3300
tttgctgttt cccctctggg aagtccaatg tctccccatt ccctgtcctc ggacccttct 3360
tcttcacgag attcctctcc cagccgagat tcctcagcag cttctgccag tccacatcag 3420
ccgattgtga tccacagttc ggggaagaac tacggcttta ccatccgagc catccgggtg 3480
tatgtgggag acagtgacat ctatacagtg caccatatcg tctggaatgt agaagaagga 3540
agtccggcat gccaggcagg actgaaggct ggagatctta tcactcacat caatggagaa 3600
ccagtgcatg gacttgtcca cacagaagtt atagaactcc tactgaagag tgggaataag 3660
gtgtcaatca ctactacccc atttgaaaac acatcaatca aaactggacc agccaggaga 3720
aacagctata agagccggat ggtgaggcgg agcaagaaat ccaagaagaa agaaagtctc 3780
gaaaggagga gatctctttt caaaaagcta gccaagcagc cttctccttt actccacacc 3840
agccgaagtt tctcctgctt gaacagatcc ctgtcatcgg gtgagagcct cccaggttcc 3900
cccactcata gcttgtctcc ccggtctcca acaccaagct accgctccac ccctgacttc 3960
ccatctggta ctaattcctc ccagagcagc tcccctagtt ctagtgcccc caattcccca 4020
gcagggtccg ggcacatccg gcccagcact ctccacggtc ttgcacccaa actcggcggg 4080
cagcggtacc ggtccggaag gcgaaagtcc gccggcaaca tcccactgtc cccgctggcc 4140
cggacgccct ctccaacccc gcaacccacc tccccgcagc ggtcaccatc ccctcttctg 4200
ggacactcac tgggcaattc caagatcgcg caagcctttc ccagcaagat gcactccccg 4260
cccaccatcg tcagacacat cgtgaggccc aagagtgcgg agccccccag gtccccgctg 4320
ctcaagcgcg tgcagtccga ggagaagctg tcgccctctt acggcagtga caagaagcac 4380
ctgtgctccc gcaagcacag cctggaggtg acccaagagg aggtgcagcg ggagcagtcc 4440
cagcgggagg cgccgctgca gagcctggat gagaacgtgt gcgacgtgcc gccgctcagc 4500
cgcgcccggc cagtggagca aggctgcctg aaacgcccag tctcccggaa ggtgggccgc 4560
caggagtctg tggacgacct ggaccgcgac aagctgaagg ccaaggtggt ggtgaagaaa 4620
gcagacggct tcccagagaa acaggaatcc caccagaaat cccatggacc cgggagtgat 4680
ttggaaaact ttgctctgtt taagctggaa gagagagaga agaaagtcta tccgaaggct 4740
gtggaaaggt caagtacttt tgaaaacaaa gcgtctatgc aggaggcgcc accgctgggc 4800
agcctgctga aggatgctct tcacaagcag gccagcgtgc gcgccagcga gggtgcgatg 4860
tcggatggcc gggtgcctgc ggagcaccgc cagggtggcg gggacttcag acgggccccc 4920
gctcctggca ccctccagga tggtctctgc cactccctcg acaggggcat ctctgggaag 4980
ggggaaggca cggagaagtc ctcccaggcc aaggagcttc tccgatgtga aaagttagac 5040
agcaagctgg ccaacatcga ttacctccga aagaaaatgt cacttgagga caaagaggac 5100
aacctctgcc ctgtgctgaa gcccaagatg acagctggct cccacgaatg cctgccaggg 5160
aacccagtcc gacccacggg tgggcagcag gagcccccgc cggcttctga gagccgagct 5220
tttgtcagca gcacccatgc agctcagatg agtgccgtct cttttgttcc cctcaaggcc 5280
ttaacaggcc gggtggacag tggaacggag aagcctggct tggttgctcc tgagtcccct 5340
gttaggaaga gcccctccga gtataagctg gaaggtaggt ctgtctcatg cctgaagccg 5400
atcgagggca ctctggacat tgctctcctg tccggacctc aggcctccaa gacagaactg 5460
ccttccccag agtctgcaca gagccccagc ccaagtggtg acgtgagggc ctctgtgcca 5520
ccagttctcc ccagcagcag tgggaaaaag aacgatacca ccagtgcaag agagctttct 5580
ccttccagct taaagatgaa taaatcctac ctgctggagc cttggttcct gccccccagc 5640
cgaggtctcc agaattcacc agcagtttcc ctgcctgacc cagagttcaa gagggacagg 5700
aaaggtcccc atcctactgc caggagccct ggaacagtca tggaaagcaa tccccaacag 5760
agagagggca gctcccctaa acaccaagac cacaccactg accccaagct tctgacctgc 5820
ctggggcaga acctccacag ccctgacctg gccaggccac gctgcccgct cccacctgaa 5880
gcttccccct caagggagaa gccaggcctg agggaatcgt ctgaaagagg ccctcccaca 5940
gccagaagcg agcgctctgc tgcgagggct gacacatgca gagagccctc catggaactg 6000
tgctttccag aaactgcgaa aaccagtgac aactccaaaa atctcctctc tgtgggaagg 6060
acccacccag atttctatac acagacccag gccatggaga aagcatgggc gccgggtggg 6120
aaaacgaacc acaaagatgg cccaggtgag gcgaggcccc cgcccagaga caactcctct 6180
ctgcactcag ctggaattcc ctgtgagaag gagctgggca aggtgaggcg tggcgtggaa 6240
cccaagcccg aagcgcttct tgccaggcgg tctctgcagc cacctggaat tgagagtgag 6300
aagagtgaaa agctctccag tttcccatct ttgcagaaag atggtgccaa ggaacctgaa 6360
aggaaggagc agcctctaca aaggcatccc agcagcatcc ctccgccccc tctgacggcc 6420
aaagacctgt ccagcccggc tgccaggcag cattgcagtt ccccaagcca cgcttctggc 6480
agagagccgg gggccaagcc cagcactgca gagcccagct cgagccccca ggaccctccc 6540
aagcctgttg ctgcgcacag tgaaagcagc agccacaagc cccggcctgg ccctgacccg 6600
ggccctccaa agactaagca ccccgaccgg tccctctcct ctcagaaacc aagtgtcggg 6660
gccacaaagg gcaaagagcc tgccactcag tccctcggtg gctctagcag agaggggaag 6720
ggccacagta agagtgggcc ggatgtgttt cctgctaccc caggctccca gaacaaagcc 6780
agcgatggga ttggccaggg agaaggtggg ccctctgtcc cactgcacac tgacagggct 6840
cctctagacg ccaagccaca acccaccagt ggtgggcggc ccctggaggt gctggagaag 6900
cctgtgcatt tgccaaggcc gggacaccca gggcctagtg agccagcgga ccagaaactg 6960
tccgctgttg gtgaaaagca aaccctgtct ccaaagcacc ccaaaccatc cactgtgaaa 7020
gattgcccca ccctgtgcaa acagacagac aacagacaga cagacaaaag cccgagtcag 7080
ccggccgcca acaccgacag aagggcggaa gggaagaaat gcactgaagc actttatgct 7140
ccagcagagg gcgacaagct cgaggccggc ctttcctttg tgcatagcga gaaccggttg 7200
aaaggcgcgg agcggccagc cgcgggggtg gggaagggct tccctgaggc cagagggaaa 7260
gggcccggtc cccagaagcc accgacggag gcagacaagc ccaatggcat gaaacggtcc 7320
ccctcagcca ctgggcagag ttctttccga tccacggccc tcccggaaaa gtctctgagc 7380
tgctcctcca gcttccctga aaccagggcc ggagttagag aggcctctgc agccagcagc 7440
gacacctctt ctgccaaggc cgccgggggc atgctggagc ttccagcccc cagcaacagg 7500
gaccatagga aggctcagcc tgccggggag ggccgaaccc acatgacaaa gagtgactcc 7560
ctgccctcct tccgggtctc caccctgcct ctggagtcac accaccccga cccaaacacc 7620
atgggcgggg ccagccaccg ggacagggct ctctcggtga ctgccaccgt aggggaaacc 7680
aaagggaagg accctgcccc agcccagcct cccccagcta ggaaacagaa cgtgggcaga 7740
gacgtgacca agccatcccc agccccaaac actgaccgcc ccatctctct ttctaatgag 7800
aaggactttg tggtacggca gaggcggggg aaagagagtt tgcgtagcag ccctcacaaa 7860
aaggccttgt aa 7872
<210> 135
<211> 6726
<212> DNA
<213> Artificial Sequence
<220>
<223> MAST4 gene fragment
<400> 135
gctacagctc agatggaaga acgtctaaag gaaattatca ccagctactc tcctgacaac 60
gttctaccct tagcagatgg agtgcttagt ttcactcacc accagattat tgaactggct 120
cgagattgct tggataaatc ccaccagggc ctcatcacct cacgatactt ccttgaatta 180
cagcacaaat tagataagtt gctacaggag gctcatgatc gttcagaaag tggagaattg 240
gcatttatta aacaactagt tcgaaagatc ctaattgtta ttgcccgccc tgctcggtta 300
ttagagtgcc tggaatttga tccggaagaa ttttactacc tattggaagc agcagaaggc 360
catgccaaag aaggacaggg tattaaaacc gacattccca ggtacatcat tagccaactg 420
ggactcaata aggatccctt ggaagaaatg gctcatttgg gaaactacga tagtgggaca 480
gcagaaacac cagaaacaga tgaatcagtg agtagctcta atgcctccct gaaacttcga 540
aggaaacctc gggaaagtga ttttgaaacg attaaattga ttagcaatgg agcctatggg 600
gcagtctact ttgttcggca taaagaatcc cggcagaggt ttgccatgaa gaagattaat 660
aaacagaacc tcatccttcg aaaccagatc cagcaggcct ttgtggagcg ggatatcctg 720
acttttgcag aaaacccctt tgttgtcagc atgtattgct cctttgaaac aaggcgccac 780
ttgtgcatgg tcatggaata tgtggaaggg ggagactgtg ctactttaat gaaaaacatg 840
ggtcctctcc ctgttgatat ggccagaatg tactttgctg agacggtctt ggccttggaa 900
tatttacata attatggaat tgtacacagg gatttgaaac cagacaactt gttggttacc 960
tccatggggc acataaagct gacagatttt ggattatcta aggtgggact aatgagcatg 1020
actaccaacc tttacgaggg tcatattgag aaggatgcta gagagttcct ggataaacag 1080
gtctgtggca cacctgaata cattgcacca gaagtgattc tgaggcaggg ttatggaaag 1140
ccggtggact ggtgggccat ggggattatc ctctatgaat ttctggttgg atgcgtgcca 1200
ttctttgggg atactccaga ggagctattt ggacaagtca tcagtgatga gatcaactgg 1260
cctgagaagg atgaggcacc cccacctgat gcccaggatc tgattacctt actcctcagg 1320
cagaatcccc tggagaggct gggaacaggt ggtgcatatg aagtcaaaca gcatcgattc 1380
ttccgttctt tagactggaa cagtttgctg agacagaagg cagaatttat tccccaactg 1440
gaatctgagg atgacacaag ttattttgat actcggtctg agaagtatca tcatatggaa 1500
acggaggaag aagatgacac aaatgatgaa gactttaatg tggaaataag gcagttttct 1560
tcatgttcac acaggttttc aaaagttttc agcagtatag atcgaatcac tcagaattca 1620
gcagaagaga aggaagactc tgtggacaaa accaaaagca ccaccttgcc atccacagaa 1680
acactgagct ggagttcaga atattctgaa atgcaacagc tatcaacatc caactcttca 1740
gatactgaaa gcaacagaca taaactcagt tctggcctac ttcccaaact ggctatttca 1800
acagagggag agcaagatga agctgcctcc tgccctggag acccccatga ggagccagga 1860
aagccagccc ttcctcctga agagtgtgcc caggaggagc ctgaggtcac caccccagcc 1920
agcaccatca gcagctccac cctgtcagtt ggcagttttt cagagcactt ggatcagata 1980
aatggacgaa gcgagtgtgt ggacagtaca gataattcct caaagccatc cagtgaaccc 2040
gcttctcaca tggctcggca gcgattagaa agcacagaaa aaaagaaaat ctcggggaaa 2100
gtcacaaagt ccctctctgc cagtgctctt tccctcatga tcccaggaga tatgtttgct 2160
gtttcccctc tgggaagtcc aatgtctccc cattccctgt cctcggaccc ttcttcttca 2220
cgagattcct ctcccagccg agattcctca gcagcttctg ccagtccaca tcagccgatt 2280
gtgatccaca gttcggggaa gaactacggc tttaccatcc gagccatccg ggtgtatgtg 2340
ggagacagtg acatctatac agtgcaccat atcgtctgga atgtagaaga aggaagtccg 2400
gcatgccagg caggactgaa ggctggagat cttatcactc acatcaatgg agaaccagtg 2460
catggacttg tccacacaga agttatagaa ctcctactga agagtgggaa taaggtgtca 2520
atcactacta ccccatttga aaacacatca atcaaaactg gaccagccag gagaaacagc 2580
tataagagcc ggatggtgag gcggagcaag aaatccaaga agaaagaaag tctcgaaagg 2640
aggagatctc ttttcaaaaa gctagccaag cagccttctc ctttactcca caccagccga 2700
agtttctcct gcttgaacag atccctgtca tcgggtgaga gcctcccagg ttcccccact 2760
catagcttgt ctccccggtc tccaacacca agctaccgct ccacccctga cttcccatct 2820
ggtactaatt cctcccagag cagctcccct agttctagtg cccccaattc cccagcaggg 2880
tccgggcaca tccggcccag cactctccac ggtcttgcac ccaaactcgg cgggcagcgg 2940
taccggtccg gaaggcgaaa gtccgccggc aacatcccac tgtccccgct ggcccggacg 3000
ccctctccaa ccccgcaacc cacctccccg cagcggtcac catcccctct tctgggacac 3060
tcactgggca attccaagat cgcgcaagcc tttcccagca agatgcactc cccgcccacc 3120
atcgtcagac acatcgtgag gcccaagagt gcggagcccc ccaggtcccc gctgctcaag 3180
cgcgtgcagt ccgaggagaa gctgtcgccc tcttacggca gtgacaagaa gcacctgtgc 3240
tcccgcaagc acagcctgga ggtgacccaa gaggaggtgc agcgggagca gtcccagcgg 3300
gaggcgccgc tgcagagcct ggatgagaac gtgtgcgacg tgccgccgct cagccgcgcc 3360
cggccagtgg agcaaggctg cctgaaacgc ccagtctccc ggaaggtggg ccgccaggag 3420
tctgtggacg acctggaccg cgacaagctg aaggccaagg tggtggtgaa gaaagcagac 3480
ggcttcccag agaaacagga atcccaccag aaatcccatg gacccgggag tgatttggaa 3540
aactttgctc tgtttaagct ggaagagaga gagaagaaag tctatccgaa ggctgtggaa 3600
aggtcaagta cttttgaaaa caaagcgtct atgcaggagg cgccaccgct gggcagcctg 3660
ctgaaggatg ctcttcacaa gcaggccagc gtgcgcgcca gcgagggtgc gatgtcggat 3720
ggccgggtgc ctgcggagca ccgccagggt ggcggggact tcagacgggc ccccgctcct 3780
ggcaccctcc aggatggtct ctgccactcc ctcgacaggg gcatctctgg gaagggggaa 3840
ggcacggaga agtcctccca ggccaaggag cttctccgat gtgaaaagtt agacagcaag 3900
ctggccaaca tcgattacct ccgaaagaaa atgtcacttg aggacaaaga ggacaacctc 3960
tgccctgtgc tgaagcccaa gatgacagct ggctcccacg aatgcctgcc agggaaccca 4020
gtccgaccca cgggtgggca gcaggagccc ccgccggctt ctgagagccg agcttttgtc 4080
agcagcaccc atgcagctca gatgagtgcc gtctcttttg ttcccctcaa ggccttaaca 4140
ggccgggtgg acagtggaac ggagaagcct ggcttggttg ctcctgagtc ccctgttagg 4200
aagagcccct ccgagtataa gctggaaggt aggtctgtct catgcctgaa gccgatcgag 4260
ggcactctgg acattgctct cctgtccgga cctcaggcct ccaagacaga actgccttcc 4320
ccagagtctg cacagagccc cagcccaagt ggtgacgtga gggcctctgt gccaccagtt 4380
ctccccagca gcagtgggaa aaagaacgat accaccagtg caagagagct ttctccttcc 4440
agcttaaaga tgaataaatc ctacctgctg gagccttggt tcctgccccc cagccgaggt 4500
ctccagaatt caccagcagt ttccctgcct gacccagagt tcaagaggga caggaaaggt 4560
ccccatccta ctgccaggag ccctggaaca gtcatggaaa gcaatcccca acagagagag 4620
ggcagctccc ctaaacacca agaccacacc actgacccca agcttctgac ctgcctgggg 4680
cagaacctcc acagccctga cctggccagg ccacgctgcc cgctcccacc tgaagcttcc 4740
ccctcaaggg agaagccagg cctgagggaa tcgtctgaaa gaggccctcc cacagccaga 4800
agcgagcgct ctgctgcgag ggctgacaca tgcagagagc cctccatgga actgtgcttt 4860
ccagaaactg cgaaaaccag tgacaactcc aaaaatctcc tctctgtggg aaggacccac 4920
ccagatttct atacacagac ccaggccatg gagaaagcat gggcgccggg tgggaaaacg 4980
aaccacaaag atggcccagg tgaggcgagg cccccgccca gagacaactc ctctctgcac 5040
tcagctggaa ttccctgtga gaaggagctg ggcaaggtga ggcgtggcgt ggaacccaag 5100
cccgaagcgc ttcttgccag gcggtctctg cagccacctg gaattgagag tgagaagagt 5160
gaaaagctct ccagtttccc atctttgcag aaagatggtg ccaaggaacc tgaaaggaag 5220
gagcagcctc tacaaaggca tcccagcagc atccctccgc cccctctgac ggccaaagac 5280
ctgtccagcc cggctgccag gcagcattgc agttccccaa gccacgcttc tggcagagag 5340
ccgggggcca agcccagcac tgcagagccc agctcgagcc cccaggaccc tcccaagcct 5400
gttgctgcgc acagtgaaag cagcagccac aagccccggc ctggccctga cccgggccct 5460
ccaaagacta agcaccccga ccggtccctc tcctctcaga aaccaagtgt cggggccaca 5520
aagggcaaag agcctgccac tcagtccctc ggtggctcta gcagagaggg gaagggccac 5580
agtaagagtg ggccggatgt gtttcctgct accccaggct cccagaacaa agccagcgat 5640
gggattggcc agggagaagg tgggccctct gtcccactgc acactgacag ggctcctcta 5700
gacgccaagc cacaacccac cagtggtggg cggcccctgg aggtgctgga gaagcctgtg 5760
catttgccaa ggccgggaca cccagggcct agtgagccag cggaccagaa actgtccgct 5820
gttggtgaaa agcaaaccct gtctccaaag caccccaaac catccactgt gaaagattgc 5880
cccaccctgt gcaaacagac agacaacaga cagacagaca aaagcccgag tcagccggcc 5940
gccaacaccg acagaagggc ggaagggaag aaatgcactg aagcacttta tgctccagca 6000
gagggcgaca agctcgaggc cggcctttcc tttgtgcata gcgagaaccg gttgaaaggc 6060
gcggagcggc cagccgcggg ggtggggaag ggcttccctg aggccagagg gaaagggccc 6120
ggtccccaga agccaccgac ggaggcagac aagcccaatg gcatgaaacg gtccccctca 6180
gccactgggc agagttcttt ccgatccacg gccctcccgg aaaagtctct gagctgctcc 6240
tccagcttcc ctgaaaccag ggccggagtt agagaggcct ctgcagccag cagcgacacc 6300
tcttctgcca aggccgccgg gggcatgctg gagcttccag cccccagcaa cagggaccat 6360
aggaaggctc agcctgccgg ggagggccga acccacatga caaagagtga ctccctgccc 6420
tccttccggg tctccaccct gcctctggag tcacaccacc ccgacccaaa caccatgggc 6480
ggggccagcc accgggacag ggctctctcg gtgactgcca ccgtagggga aaccaaaggg 6540
aaggaccctg ccccagccca gcctccccca gctaggaaac agaacgtggg cagagacgtg 6600
accaagccat ccccagcccc aaacactgac cgccccatct ctctttctaa tgagaaggac 6660
tttgtggtac ggcagaggcg ggggaaagag agtttgcgta gcagccctca caaaaaggcc 6720
ttgtaa 6726
<210> 136
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> Break-point of MAST4 gene fragment
<400> 136
gctacagctc agatggaaga acgt 24
<210> 137
<211> 2623
<212> PRT
<213> Artificial Sequence
<220>
<223> MAST4 protein
<400> 137
Met Gly Glu Lys Val Ser Glu Ala Pro Glu Pro Val Pro Arg Gly Cys
1 5 10 15
Ser Gly His Gly Ser Arg Thr Pro Ala Ser Ala Leu Val Ala Ala Ser
20 25 30
Ser Pro Gly Ala Ser Ser Ala Glu Ser Ser Ser Gly Ser Glu Thr Leu
35 40 45
Ser Glu Glu Gly Glu Pro Gly Gly Phe Ser Arg Glu His Gln Pro Pro
50 55 60
Pro Pro Pro Pro Leu Gly Gly Thr Leu Gly Ala Arg Ala Pro Ala Ala
65 70 75 80
Trp Ala Pro Ala Ser Val Leu Leu Glu Arg Gly Val Leu Ala Leu Pro
85 90 95
Pro Pro Leu Pro Gly Gly Ala Val Pro Pro Ala Pro Arg Gly Ser Ser
100 105 110
Ala Ser Gln Glu Glu Gln Asp Glu Glu Leu Asp His Ile Leu Ser Pro
115 120 125
Pro Pro Met Pro Phe Arg Lys Cys Ser Asn Pro Asp Val Ala Ser Gly
130 135 140
Pro Gly Lys Ser Leu Lys Tyr Lys Arg Gln Leu Ser Glu Asp Gly Arg
145 150 155 160
Gln Leu Arg Arg Gly Ser Leu Gly Gly Ala Leu Thr Gly Arg Tyr Leu
165 170 175
Leu Pro Asn Pro Val Ala Gly Gln Ala Trp Pro Ala Ser Ala Glu Thr
180 185 190
Ser Asn Leu Val Arg Met Arg Ser Gln Ala Leu Gly Gln Ser Ala Pro
195 200 205
Ser Leu Thr Ala Ser Leu Lys Glu Leu Ser Leu Pro Arg Arg Gly Ser
210 215 220
Phe Cys Arg Thr Ser Asn Arg Lys Ser Leu Ile Gly Asn Gly Gln Ser
225 230 235 240
Pro Ala Leu Pro Arg Pro His Ser Pro Leu Ser Ala His Ala Gly Asn
245 250 255
Ser Pro Gln Asp Ser Pro Arg Asn Phe Ser Pro Ser Ala Ser Ala His
260 265 270
Phe Ser Phe Ala Arg Arg Thr Asp Gly Arg Arg Trp Ser Leu Ala Ser
275 280 285
Leu Pro Ser Ser Gly Tyr Gly Thr Asn Thr Pro Ser Ser Thr Val Ser
290 295 300
Ser Ser Cys Ser Ser Gln Glu Lys Leu His Gln Leu Pro Tyr Gln Pro
305 310 315 320
Thr Pro Asp Glu Leu His Phe Leu Ser Lys His Phe Cys Thr Thr Glu
325 330 335
Ser Ile Ala Thr Glu Asn Arg Cys Arg Asn Thr Pro Met Arg Pro Arg
340 345 350
Ser Arg Ser Leu Ser Pro Gly Arg Ser Pro Ala Cys Cys Asp His Glu
355 360 365
Ile Ile Met Met Asn His Val Tyr Lys Glu Arg Phe Pro Lys Ala Thr
370 375 380
Ala Gln Met Glu Glu Arg Leu Lys Glu Ile Ile Thr Ser Tyr Ser Pro
385 390 395 400
Asp Asn Val Leu Pro Leu Ala Asp Gly Val Leu Ser Phe Thr His His
405 410 415
Gln Ile Ile Glu Leu Ala Arg Asp Cys Leu Asp Lys Ser His Gln Gly
420 425 430
Leu Ile Thr Ser Arg Tyr Phe Leu Glu Leu Gln His Lys Leu Asp Lys
435 440 445
Leu Leu Gln Glu Ala His Asp Arg Ser Glu Ser Gly Glu Leu Ala Phe
450 455 460
Ile Lys Gln Leu Val Arg Lys Ile Leu Ile Val Ile Ala Arg Pro Ala
465 470 475 480
Arg Leu Leu Glu Cys Leu Glu Phe Asp Pro Glu Glu Phe Tyr Tyr Leu
485 490 495
Leu Glu Ala Ala Glu Gly His Ala Lys Glu Gly Gln Gly Ile Lys Thr
500 505 510
Asp Ile Pro Arg Tyr Ile Ile Ser Gln Leu Gly Leu Asn Lys Asp Pro
515 520 525
Leu Glu Glu Met Ala His Leu Gly Asn Tyr Asp Ser Gly Thr Ala Glu
530 535 540
Thr Pro Glu Thr Asp Glu Ser Val Ser Ser Ser Asn Ala Ser Leu Lys
545 550 555 560
Leu Arg Arg Lys Pro Arg Glu Ser Asp Phe Glu Thr Ile Lys Leu Ile
565 570 575
Ser Asn Gly Ala Tyr Gly Ala Val Tyr Phe Val Arg His Lys Glu Ser
580 585 590
Arg Gln Arg Phe Ala Met Lys Lys Ile Asn Lys Gln Asn Leu Ile Leu
595 600 605
Arg Asn Gln Ile Gln Gln Ala Phe Val Glu Arg Asp Ile Leu Thr Phe
610 615 620
Ala Glu Asn Pro Phe Val Val Ser Met Tyr Cys Ser Phe Glu Thr Arg
625 630 635 640
Arg His Leu Cys Met Val Met Glu Tyr Val Glu Gly Gly Asp Cys Ala
645 650 655
Thr Leu Met Lys Asn Met Gly Pro Leu Pro Val Asp Met Ala Arg Met
660 665 670
Tyr Phe Ala Glu Thr Val Leu Ala Leu Glu Tyr Leu His Asn Tyr Gly
675 680 685
Ile Val His Arg Asp Leu Lys Pro Asp Asn Leu Leu Val Thr Ser Met
690 695 700
Gly His Ile Lys Leu Thr Asp Phe Gly Leu Ser Lys Val Gly Leu Met
705 710 715 720
Ser Met Thr Thr Asn Leu Tyr Glu Gly His Ile Glu Lys Asp Ala Arg
725 730 735
Glu Phe Leu Asp Lys Gln Val Cys Gly Thr Pro Glu Tyr Ile Ala Pro
740 745 750
Glu Val Ile Leu Arg Gln Gly Tyr Gly Lys Pro Val Asp Trp Trp Ala
755 760 765
Met Gly Ile Ile Leu Tyr Glu Phe Leu Val Gly Cys Val Pro Phe Phe
770 775 780
Gly Asp Thr Pro Glu Glu Leu Phe Gly Gln Val Ile Ser Asp Glu Ile
785 790 795 800
Asn Trp Pro Glu Lys Asp Glu Ala Pro Pro Pro Asp Ala Gln Asp Leu
805 810 815
Ile Thr Leu Leu Leu Arg Gln Asn Pro Leu Glu Arg Leu Gly Thr Gly
820 825 830
Gly Ala Tyr Glu Val Lys Gln His Arg Phe Phe Arg Ser Leu Asp Trp
835 840 845
Asn Ser Leu Leu Arg Gln Lys Ala Glu Phe Ile Pro Gln Leu Glu Ser
850 855 860
Glu Asp Asp Thr Ser Tyr Phe Asp Thr Arg Ser Glu Lys Tyr His His
865 870 875 880
Met Glu Thr Glu Glu Glu Asp Asp Thr Asn Asp Glu Asp Phe Asn Val
885 890 895
Glu Ile Arg Gln Phe Ser Ser Cys Ser His Arg Phe Ser Lys Val Phe
900 905 910
Ser Ser Ile Asp Arg Ile Thr Gln Asn Ser Ala Glu Glu Lys Glu Asp
915 920 925
Ser Val Asp Lys Thr Lys Ser Thr Thr Leu Pro Ser Thr Glu Thr Leu
930 935 940
Ser Trp Ser Ser Glu Tyr Ser Glu Met Gln Gln Leu Ser Thr Ser Asn
945 950 955 960
Ser Ser Asp Thr Glu Ser Asn Arg His Lys Leu Ser Ser Gly Leu Leu
965 970 975
Pro Lys Leu Ala Ile Ser Thr Glu Gly Glu Gln Asp Glu Ala Ala Ser
980 985 990
Cys Pro Gly Asp Pro His Glu Glu Pro Gly Lys Pro Ala Leu Pro Pro
995 1000 1005
Glu Glu Cys Ala Gln Glu Glu Pro Glu Val Thr Thr Pro Ala Ser Thr
1010 1015 1020
Ile Ser Ser Ser Thr Leu Ser Val Gly Ser Phe Ser Glu His Leu Asp
1025 1030 1035 1040
Gln Ile Asn Gly Arg Ser Glu Cys Val Asp Ser Thr Asp Asn Ser Ser
1045 1050 1055
Lys Pro Ser Ser Glu Pro Ala Ser His Met Ala Arg Gln Arg Leu Glu
1060 1065 1070
Ser Thr Glu Lys Lys Lys Ile Ser Gly Lys Val Thr Lys Ser Leu Ser
1075 1080 1085
Ala Ser Ala Leu Ser Leu Met Ile Pro Gly Asp Met Phe Ala Val Ser
1090 1095 1100
Pro Leu Gly Ser Pro Met Ser Pro His Ser Leu Ser Ser Asp Pro Ser
1105 1110 1115 1120
Ser Ser Arg Asp Ser Ser Pro Ser Arg Asp Ser Ser Ala Ala Ser Ala
1125 1130 1135
Ser Pro His Gln Pro Ile Val Ile His Ser Ser Gly Lys Asn Tyr Gly
1140 1145 1150
Phe Thr Ile Arg Ala Ile Arg Val Tyr Val Gly Asp Ser Asp Ile Tyr
1155 1160 1165
Thr Val His His Ile Val Trp Asn Val Glu Glu Gly Ser Pro Ala Cys
1170 1175 1180
Gln Ala Gly Leu Lys Ala Gly Asp Leu Ile Thr His Ile Asn Gly Glu
1185 1190 1195 1200
Pro Val His Gly Leu Val His Thr Glu Val Ile Glu Leu Leu Leu Lys
1205 1210 1215
Ser Gly Asn Lys Val Ser Ile Thr Thr Thr Pro Phe Glu Asn Thr Ser
1220 1225 1230
Ile Lys Thr Gly Pro Ala Arg Arg Asn Ser Tyr Lys Ser Arg Met Val
1235 1240 1245
Arg Arg Ser Lys Lys Ser Lys Lys Lys Glu Ser Leu Glu Arg Arg Arg
1250 1255 1260
Ser Leu Phe Lys Lys Leu Ala Lys Gln Pro Ser Pro Leu Leu His Thr
1265 1270 1275 1280
Ser Arg Ser Phe Ser Cys Leu Asn Arg Ser Leu Ser Ser Gly Glu Ser
1285 1290 1295
Leu Pro Gly Ser Pro Thr His Ser Leu Ser Pro Arg Ser Pro Thr Pro
1300 1305 1310
Ser Tyr Arg Ser Thr Pro Asp Phe Pro Ser Gly Thr Asn Ser Ser Gln
1315 1320 1325
Ser Ser Ser Pro Ser Ser Ser Ala Pro Asn Ser Pro Ala Gly Ser Gly
1330 1335 1340
His Ile Arg Pro Ser Thr Leu His Gly Leu Ala Pro Lys Leu Gly Gly
1345 1350 1355 1360
Gln Arg Tyr Arg Ser Gly Arg Arg Lys Ser Ala Gly Asn Ile Pro Leu
1365 1370 1375
Ser Pro Leu Ala Arg Thr Pro Ser Pro Thr Pro Gln Pro Thr Ser Pro
1380 1385 1390
Gln Arg Ser Pro Ser Pro Leu Leu Gly His Ser Leu Gly Asn Ser Lys
1395 1400 1405
Ile Ala Gln Ala Phe Pro Ser Lys Met His Ser Pro Pro Thr Ile Val
1410 1415 1420
Arg His Ile Val Arg Pro Lys Ser Ala Glu Pro Pro Arg Ser Pro Leu
1425 1430 1435 1440
Leu Lys Arg Val Gln Ser Glu Glu Lys Leu Ser Pro Ser Tyr Gly Ser
1445 1450 1455
Asp Lys Lys His Leu Cys Ser Arg Lys His Ser Leu Glu Val Thr Gln
1460 1465 1470
Glu Glu Val Gln Arg Glu Gln Ser Gln Arg Glu Ala Pro Leu Gln Ser
1475 1480 1485
Leu Asp Glu Asn Val Cys Asp Val Pro Pro Leu Ser Arg Ala Arg Pro
1490 1495 1500
Val Glu Gln Gly Cys Leu Lys Arg Pro Val Ser Arg Lys Val Gly Arg
1505 1510 1515 1520
Gln Glu Ser Val Asp Asp Leu Asp Arg Asp Lys Leu Lys Ala Lys Val
1525 1530 1535
Val Val Lys Lys Ala Asp Gly Phe Pro Glu Lys Gln Glu Ser His Gln
1540 1545 1550
Lys Ser His Gly Pro Gly Ser Asp Leu Glu Asn Phe Ala Leu Phe Lys
1555 1560 1565
Leu Glu Glu Arg Glu Lys Lys Val Tyr Pro Lys Ala Val Glu Arg Ser
1570 1575 1580
Ser Thr Phe Glu Asn Lys Ala Ser Met Gln Glu Ala Pro Pro Leu Gly
1585 1590 1595 1600
Ser Leu Leu Lys Asp Ala Leu His Lys Gln Ala Ser Val Arg Ala Ser
1605 1610 1615
Glu Gly Ala Met Ser Asp Gly Arg Val Pro Ala Glu His Arg Gln Gly
1620 1625 1630
Gly Gly Asp Phe Arg Arg Ala Pro Ala Pro Gly Thr Leu Gln Asp Gly
1635 1640 1645
Leu Cys His Ser Leu Asp Arg Gly Ile Ser Gly Lys Gly Glu Gly Thr
1650 1655 1660
Glu Lys Ser Ser Gln Ala Lys Glu Leu Leu Arg Cys Glu Lys Leu Asp
1665 1670 1675 1680
Ser Lys Leu Ala Asn Ile Asp Tyr Leu Arg Lys Lys Met Ser Leu Glu
1685 1690 1695
Asp Lys Glu Asp Asn Leu Cys Pro Val Leu Lys Pro Lys Met Thr Ala
1700 1705 1710
Gly Ser His Glu Cys Leu Pro Gly Asn Pro Val Arg Pro Thr Gly Gly
1715 1720 1725
Gln Gln Glu Pro Pro Pro Ala Ser Glu Ser Arg Ala Phe Val Ser Ser
1730 1735 1740
Thr His Ala Ala Gln Met Ser Ala Val Ser Phe Val Pro Leu Lys Ala
1745 1750 1755 1760
Leu Thr Gly Arg Val Asp Ser Gly Thr Glu Lys Pro Gly Leu Val Ala
1765 1770 1775
Pro Glu Ser Pro Val Arg Lys Ser Pro Ser Glu Tyr Lys Leu Glu Gly
1780 1785 1790
Arg Ser Val Ser Cys Leu Lys Pro Ile Glu Gly Thr Leu Asp Ile Ala
1795 1800 1805
Leu Leu Ser Gly Pro Gln Ala Ser Lys Thr Glu Leu Pro Ser Pro Glu
1810 1815 1820
Ser Ala Gln Ser Pro Ser Pro Ser Gly Asp Val Arg Ala Ser Val Pro
1825 1830 1835 1840
Pro Val Leu Pro Ser Ser Ser Gly Lys Lys Asn Asp Thr Thr Ser Ala
1845 1850 1855
Arg Glu Leu Ser Pro Ser Ser Leu Lys Met Asn Lys Ser Tyr Leu Leu
1860 1865 1870
Glu Pro Trp Phe Leu Pro Pro Ser Arg Gly Leu Gln Asn Ser Pro Ala
1875 1880 1885
Val Ser Leu Pro Asp Pro Glu Phe Lys Arg Asp Arg Lys Gly Pro His
1890 1895 1900
Pro Thr Ala Arg Ser Pro Gly Thr Val Met Glu Ser Asn Pro Gln Gln
1905 1910 1915 1920
Arg Glu Gly Ser Ser Pro Lys His Gln Asp His Thr Thr Asp Pro Lys
1925 1930 1935
Leu Leu Thr Cys Leu Gly Gln Asn Leu His Ser Pro Asp Leu Ala Arg
1940 1945 1950
Pro Arg Cys Pro Leu Pro Pro Glu Ala Ser Pro Ser Arg Glu Lys Pro
1955 1960 1965
Gly Leu Arg Glu Ser Ser Glu Arg Gly Pro Pro Thr Ala Arg Ser Glu
1970 1975 1980
Arg Ser Ala Ala Arg Ala Asp Thr Cys Arg Glu Pro Ser Met Glu Leu
1985 1990 1995 2000
Cys Phe Pro Glu Thr Ala Lys Thr Ser Asp Asn Ser Lys Asn Leu Leu
2005 2010 2015
Ser Val Gly Arg Thr His Pro Asp Phe Tyr Thr Gln Thr Gln Ala Met
2020 2025 2030
Glu Lys Ala Trp Ala Pro Gly Gly Lys Thr Asn His Lys Asp Gly Pro
2035 2040 2045
Gly Glu Ala Arg Pro Pro Pro Arg Asp Asn Ser Ser Leu His Ser Ala
2050 2055 2060
Gly Ile Pro Cys Glu Lys Glu Leu Gly Lys Val Arg Arg Gly Val Glu
2065 2070 2075 2080
Pro Lys Pro Glu Ala Leu Leu Ala Arg Arg Ser Leu Gln Pro Pro Gly
2085 2090 2095
Ile Glu Ser Glu Lys Ser Glu Lys Leu Ser Ser Phe Pro Ser Leu Gln
2100 2105 2110
Lys Asp Gly Ala Lys Glu Pro Glu Arg Lys Glu Gln Pro Leu Gln Arg
2115 2120 2125
His Pro Ser Ser Ile Pro Pro Pro Pro Leu Thr Ala Lys Asp Leu Ser
2130 2135 2140
Ser Pro Ala Ala Arg Gln His Cys Ser Ser Pro Ser His Ala Ser Gly
2145 2150 2155 2160
Arg Glu Pro Gly Ala Lys Pro Ser Thr Ala Glu Pro Ser Ser Ser Pro
2165 2170 2175
Gln Asp Pro Pro Lys Pro Val Ala Ala His Ser Glu Ser Ser Ser His
2180 2185 2190
Lys Pro Arg Pro Gly Pro Asp Pro Gly Pro Pro Lys Thr Lys His Pro
2195 2200 2205
Asp Arg Ser Leu Ser Ser Gln Lys Pro Ser Val Gly Ala Thr Lys Gly
2210 2215 2220
Lys Glu Pro Ala Thr Gln Ser Leu Gly Gly Ser Ser Arg Glu Gly Lys
2225 2230 2235 2240
Gly His Ser Lys Ser Gly Pro Asp Val Phe Pro Ala Thr Pro Gly Ser
2245 2250 2255
Gln Asn Lys Ala Ser Asp Gly Ile Gly Gln Gly Glu Gly Gly Pro Ser
2260 2265 2270
Val Pro Leu His Thr Asp Arg Ala Pro Leu Asp Ala Lys Pro Gln Pro
2275 2280 2285
Thr Ser Gly Gly Arg Pro Leu Glu Val Leu Glu Lys Pro Val His Leu
2290 2295 2300
Pro Arg Pro Gly His Pro Gly Pro Ser Glu Pro Ala Asp Gln Lys Leu
2305 2310 2315 2320
Ser Ala Val Gly Glu Lys Gln Thr Leu Ser Pro Lys His Pro Lys Pro
2325 2330 2335
Ser Thr Val Lys Asp Cys Pro Thr Leu Cys Lys Gln Thr Asp Asn Arg
2340 2345 2350
Gln Thr Asp Lys Ser Pro Ser Gln Pro Ala Ala Asn Thr Asp Arg Arg
2355 2360 2365
Ala Glu Gly Lys Lys Cys Thr Glu Ala Leu Tyr Ala Pro Ala Glu Gly
2370 2375 2380
Asp Lys Leu Glu Ala Gly Leu Ser Phe Val His Ser Glu Asn Arg Leu
2385 2390 2395 2400
Lys Gly Ala Glu Arg Pro Ala Ala Gly Val Gly Lys Gly Phe Pro Glu
2405 2410 2415
Ala Arg Gly Lys Gly Pro Gly Pro Gln Lys Pro Pro Thr Glu Ala Asp
2420 2425 2430
Lys Pro Asn Gly Met Lys Arg Ser Pro Ser Ala Thr Gly Gln Ser Ser
2435 2440 2445
Phe Arg Ser Thr Ala Leu Pro Glu Lys Ser Leu Ser Cys Ser Ser Ser
2450 2455 2460
Phe Pro Glu Thr Arg Ala Gly Val Arg Glu Ala Ser Ala Ala Ser Ser
2465 2470 2475 2480
Asp Thr Ser Ser Ala Lys Ala Ala Gly Gly Met Leu Glu Leu Pro Ala
2485 2490 2495
Pro Ser Asn Arg Asp His Arg Lys Ala Gln Pro Ala Gly Glu Gly Arg
2500 2505 2510
Thr His Met Thr Lys Ser Asp Ser Leu Pro Ser Phe Arg Val Ser Thr
2515 2520 2525
Leu Pro Leu Glu Ser His His Pro Asp Pro Asn Thr Met Gly Gly Ala
2530 2535 2540
Ser His Arg Asp Arg Ala Leu Ser Val Thr Ala Thr Val Gly Glu Thr
2545 2550 2555 2560
Lys Gly Lys Asp Pro Ala Pro Ala Gln Pro Pro Pro Ala Arg Lys Gln
2565 2570 2575
Asn Val Gly Arg Asp Val Thr Lys Pro Ser Pro Ala Pro Asn Thr Asp
2580 2585 2590
Arg Pro Ile Ser Leu Ser Asn Glu Lys Asp Phe Val Val Arg Gln Arg
2595 2600 2605
Arg Gly Lys Glu Ser Leu Arg Ser Ser Pro His Lys Lys Ala Leu
2610 2615 2620
<210> 138
<211> 2241
<212> PRT
<213> Artificial Sequence
<220>
<223> MAST4 protein fragment
<400> 138
Ala Thr Ala Gln Met Glu Glu Arg Leu Lys Glu Ile Ile Thr Ser Tyr
1 5 10 15
Ser Pro Asp Asn Val Leu Pro Leu Ala Asp Gly Val Leu Ser Phe Thr
20 25 30
His His Gln Ile Ile Glu Leu Ala Arg Asp Cys Leu Asp Lys Ser His
35 40 45
Gln Gly Leu Ile Thr Ser Arg Tyr Phe Leu Glu Leu Gln His Lys Leu
50 55 60
Asp Lys Leu Leu Gln Glu Ala His Asp Arg Ser Glu Ser Gly Glu Leu
65 70 75 80
Ala Phe Ile Lys Gln Leu Val Arg Lys Ile Leu Ile Val Ile Ala Arg
85 90 95
Pro Ala Arg Leu Leu Glu Cys Leu Glu Phe Asp Pro Glu Glu Phe Tyr
100 105 110
Tyr Leu Leu Glu Ala Ala Glu Gly His Ala Lys Glu Gly Gln Gly Ile
115 120 125
Lys Thr Asp Ile Pro Arg Tyr Ile Ile Ser Gln Leu Gly Leu Asn Lys
130 135 140
Asp Pro Leu Glu Glu Met Ala His Leu Gly Asn Tyr Asp Ser Gly Thr
145 150 155 160
Ala Glu Thr Pro Glu Thr Asp Glu Ser Val Ser Ser Ser Asn Ala Ser
165 170 175
Leu Lys Leu Arg Arg Lys Pro Arg Glu Ser Asp Phe Glu Thr Ile Lys
180 185 190
Leu Ile Ser Asn Gly Ala Tyr Gly Ala Val Tyr Phe Val Arg His Lys
195 200 205
Glu Ser Arg Gln Arg Phe Ala Met Lys Lys Ile Asn Lys Gln Asn Leu
210 215 220
Ile Leu Arg Asn Gln Ile Gln Gln Ala Phe Val Glu Arg Asp Ile Leu
225 230 235 240
Thr Phe Ala Glu Asn Pro Phe Val Val Ser Met Tyr Cys Ser Phe Glu
245 250 255
Thr Arg Arg His Leu Cys Met Val Met Glu Tyr Val Glu Gly Gly Asp
260 265 270
Cys Ala Thr Leu Met Lys Asn Met Gly Pro Leu Pro Val Asp Met Ala
275 280 285
Arg Met Tyr Phe Ala Glu Thr Val Leu Ala Leu Glu Tyr Leu His Asn
290 295 300
Tyr Gly Ile Val His Arg Asp Leu Lys Pro Asp Asn Leu Leu Val Thr
305 310 315 320
Ser Met Gly His Ile Lys Leu Thr Asp Phe Gly Leu Ser Lys Val Gly
325 330 335
Leu Met Ser Met Thr Thr Asn Leu Tyr Glu Gly His Ile Glu Lys Asp
340 345 350
Ala Arg Glu Phe Leu Asp Lys Gln Val Cys Gly Thr Pro Glu Tyr Ile
355 360 365
Ala Pro Glu Val Ile Leu Arg Gln Gly Tyr Gly Lys Pro Val Asp Trp
370 375 380
Trp Ala Met Gly Ile Ile Leu Tyr Glu Phe Leu Val Gly Cys Val Pro
385 390 395 400
Phe Phe Gly Asp Thr Pro Glu Glu Leu Phe Gly Gln Val Ile Ser Asp
405 410 415
Glu Ile Asn Trp Pro Glu Lys Asp Glu Ala Pro Pro Pro Asp Ala Gln
420 425 430
Asp Leu Ile Thr Leu Leu Leu Arg Gln Asn Pro Leu Glu Arg Leu Gly
435 440 445
Thr Gly Gly Ala Tyr Glu Val Lys Gln His Arg Phe Phe Arg Ser Leu
450 455 460
Asp Trp Asn Ser Leu Leu Arg Gln Lys Ala Glu Phe Ile Pro Gln Leu
465 470 475 480
Glu Ser Glu Asp Asp Thr Ser Tyr Phe Asp Thr Arg Ser Glu Lys Tyr
485 490 495
His His Met Glu Thr Glu Glu Glu Asp Asp Thr Asn Asp Glu Asp Phe
500 505 510
Asn Val Glu Ile Arg Gln Phe Ser Ser Cys Ser His Arg Phe Ser Lys
515 520 525
Val Phe Ser Ser Ile Asp Arg Ile Thr Gln Asn Ser Ala Glu Glu Lys
530 535 540
Glu Asp Ser Val Asp Lys Thr Lys Ser Thr Thr Leu Pro Ser Thr Glu
545 550 555 560
Thr Leu Ser Trp Ser Ser Glu Tyr Ser Glu Met Gln Gln Leu Ser Thr
565 570 575
Ser Asn Ser Ser Asp Thr Glu Ser Asn Arg His Lys Leu Ser Ser Gly
580 585 590
Leu Leu Pro Lys Leu Ala Ile Ser Thr Glu Gly Glu Gln Asp Glu Ala
595 600 605
Ala Ser Cys Pro Gly Asp Pro His Glu Glu Pro Gly Lys Pro Ala Leu
610 615 620
Pro Pro Glu Glu Cys Ala Gln Glu Glu Pro Glu Val Thr Thr Pro Ala
625 630 635 640
Ser Thr Ile Ser Ser Ser Thr Leu Ser Val Gly Ser Phe Ser Glu His
645 650 655
Leu Asp Gln Ile Asn Gly Arg Ser Glu Cys Val Asp Ser Thr Asp Asn
660 665 670
Ser Ser Lys Pro Ser Ser Glu Pro Ala Ser His Met Ala Arg Gln Arg
675 680 685
Leu Glu Ser Thr Glu Lys Lys Lys Ile Ser Gly Lys Val Thr Lys Ser
690 695 700
Leu Ser Ala Ser Ala Leu Ser Leu Met Ile Pro Gly Asp Met Phe Ala
705 710 715 720
Val Ser Pro Leu Gly Ser Pro Met Ser Pro His Ser Leu Ser Ser Asp
725 730 735
Pro Ser Ser Ser Arg Asp Ser Ser Pro Ser Arg Asp Ser Ser Ala Ala
740 745 750
Ser Ala Ser Pro His Gln Pro Ile Val Ile His Ser Ser Gly Lys Asn
755 760 765
Tyr Gly Phe Thr Ile Arg Ala Ile Arg Val Tyr Val Gly Asp Ser Asp
770 775 780
Ile Tyr Thr Val His His Ile Val Trp Asn Val Glu Glu Gly Ser Pro
785 790 795 800
Ala Cys Gln Ala Gly Leu Lys Ala Gly Asp Leu Ile Thr His Ile Asn
805 810 815
Gly Glu Pro Val His Gly Leu Val His Thr Glu Val Ile Glu Leu Leu
820 825 830
Leu Lys Ser Gly Asn Lys Val Ser Ile Thr Thr Thr Pro Phe Glu Asn
835 840 845
Thr Ser Ile Lys Thr Gly Pro Ala Arg Arg Asn Ser Tyr Lys Ser Arg
850 855 860
Met Val Arg Arg Ser Lys Lys Ser Lys Lys Lys Glu Ser Leu Glu Arg
865 870 875 880
Arg Arg Ser Leu Phe Lys Lys Leu Ala Lys Gln Pro Ser Pro Leu Leu
885 890 895
His Thr Ser Arg Ser Phe Ser Cys Leu Asn Arg Ser Leu Ser Ser Gly
900 905 910
Glu Ser Leu Pro Gly Ser Pro Thr His Ser Leu Ser Pro Arg Ser Pro
915 920 925
Thr Pro Ser Tyr Arg Ser Thr Pro Asp Phe Pro Ser Gly Thr Asn Ser
930 935 940
Ser Gln Ser Ser Ser Pro Ser Ser Ser Ala Pro Asn Ser Pro Ala Gly
945 950 955 960
Ser Gly His Ile Arg Pro Ser Thr Leu His Gly Leu Ala Pro Lys Leu
965 970 975
Gly Gly Gln Arg Tyr Arg Ser Gly Arg Arg Lys Ser Ala Gly Asn Ile
980 985 990
Pro Leu Ser Pro Leu Ala Arg Thr Pro Ser Pro Thr Pro Gln Pro Thr
995 1000 1005
Ser Pro Gln Arg Ser Pro Ser Pro Leu Leu Gly His Ser Leu Gly Asn
1010 1015 1020
Ser Lys Ile Ala Gln Ala Phe Pro Ser Lys Met His Ser Pro Pro Thr
1025 1030 1035 1040
Ile Val Arg His Ile Val Arg Pro Lys Ser Ala Glu Pro Pro Arg Ser
1045 1050 1055
Pro Leu Leu Lys Arg Val Gln Ser Glu Glu Lys Leu Ser Pro Ser Tyr
1060 1065 1070
Gly Ser Asp Lys Lys His Leu Cys Ser Arg Lys His Ser Leu Glu Val
1075 1080 1085
Thr Gln Glu Glu Val Gln Arg Glu Gln Ser Gln Arg Glu Ala Pro Leu
1090 1095 1100
Gln Ser Leu Asp Glu Asn Val Cys Asp Val Pro Pro Leu Ser Arg Ala
1105 1110 1115 1120
Arg Pro Val Glu Gln Gly Cys Leu Lys Arg Pro Val Ser Arg Lys Val
1125 1130 1135
Gly Arg Gln Glu Ser Val Asp Asp Leu Asp Arg Asp Lys Leu Lys Ala
1140 1145 1150
Lys Val Val Val Lys Lys Ala Asp Gly Phe Pro Glu Lys Gln Glu Ser
1155 1160 1165
His Gln Lys Ser His Gly Pro Gly Ser Asp Leu Glu Asn Phe Ala Leu
1170 1175 1180
Phe Lys Leu Glu Glu Arg Glu Lys Lys Val Tyr Pro Lys Ala Val Glu
1185 1190 1195 1200
Arg Ser Ser Thr Phe Glu Asn Lys Ala Ser Met Gln Glu Ala Pro Pro
1205 1210 1215
Leu Gly Ser Leu Leu Lys Asp Ala Leu His Lys Gln Ala Ser Val Arg
1220 1225 1230
Ala Ser Glu Gly Ala Met Ser Asp Gly Arg Val Pro Ala Glu His Arg
1235 1240 1245
Gln Gly Gly Gly Asp Phe Arg Arg Ala Pro Ala Pro Gly Thr Leu Gln
1250 1255 1260
Asp Gly Leu Cys His Ser Leu Asp Arg Gly Ile Ser Gly Lys Gly Glu
1265 1270 1275 1280
Gly Thr Glu Lys Ser Ser Gln Ala Lys Glu Leu Leu Arg Cys Glu Lys
1285 1290 1295
Leu Asp Ser Lys Leu Ala Asn Ile Asp Tyr Leu Arg Lys Lys Met Ser
1300 1305 1310
Leu Glu Asp Lys Glu Asp Asn Leu Cys Pro Val Leu Lys Pro Lys Met
1315 1320 1325
Thr Ala Gly Ser His Glu Cys Leu Pro Gly Asn Pro Val Arg Pro Thr
1330 1335 1340
Gly Gly Gln Gln Glu Pro Pro Pro Ala Ser Glu Ser Arg Ala Phe Val
1345 1350 1355 1360
Ser Ser Thr His Ala Ala Gln Met Ser Ala Val Ser Phe Val Pro Leu
1365 1370 1375
Lys Ala Leu Thr Gly Arg Val Asp Ser Gly Thr Glu Lys Pro Gly Leu
1380 1385 1390
Val Ala Pro Glu Ser Pro Val Arg Lys Ser Pro Ser Glu Tyr Lys Leu
1395 1400 1405
Glu Gly Arg Ser Val Ser Cys Leu Lys Pro Ile Glu Gly Thr Leu Asp
1410 1415 1420
Ile Ala Leu Leu Ser Gly Pro Gln Ala Ser Lys Thr Glu Leu Pro Ser
1425 1430 1435 1440
Pro Glu Ser Ala Gln Ser Pro Ser Pro Ser Gly Asp Val Arg Ala Ser
1445 1450 1455
Val Pro Pro Val Leu Pro Ser Ser Ser Gly Lys Lys Asn Asp Thr Thr
1460 1465 1470
Ser Ala Arg Glu Leu Ser Pro Ser Ser Leu Lys Met Asn Lys Ser Tyr
1475 1480 1485
Leu Leu Glu Pro Trp Phe Leu Pro Pro Ser Arg Gly Leu Gln Asn Ser
1490 1495 1500
Pro Ala Val Ser Leu Pro Asp Pro Glu Phe Lys Arg Asp Arg Lys Gly
1505 1510 1515 1520
Pro His Pro Thr Ala Arg Ser Pro Gly Thr Val Met Glu Ser Asn Pro
1525 1530 1535
Gln Gln Arg Glu Gly Ser Ser Pro Lys His Gln Asp His Thr Thr Asp
1540 1545 1550
Pro Lys Leu Leu Thr Cys Leu Gly Gln Asn Leu His Ser Pro Asp Leu
1555 1560 1565
Ala Arg Pro Arg Cys Pro Leu Pro Pro Glu Ala Ser Pro Ser Arg Glu
1570 1575 1580
Lys Pro Gly Leu Arg Glu Ser Ser Glu Arg Gly Pro Pro Thr Ala Arg
1585 1590 1595 1600
Ser Glu Arg Ser Ala Ala Arg Ala Asp Thr Cys Arg Glu Pro Ser Met
1605 1610 1615
Glu Leu Cys Phe Pro Glu Thr Ala Lys Thr Ser Asp Asn Ser Lys Asn
1620 1625 1630
Leu Leu Ser Val Gly Arg Thr His Pro Asp Phe Tyr Thr Gln Thr Gln
1635 1640 1645
Ala Met Glu Lys Ala Trp Ala Pro Gly Gly Lys Thr Asn His Lys Asp
1650 1655 1660
Gly Pro Gly Glu Ala Arg Pro Pro Pro Arg Asp Asn Ser Ser Leu His
1665 1670 1675 1680
Ser Ala Gly Ile Pro Cys Glu Lys Glu Leu Gly Lys Val Arg Arg Gly
1685 1690 1695
Val Glu Pro Lys Pro Glu Ala Leu Leu Ala Arg Arg Ser Leu Gln Pro
1700 1705 1710
Pro Gly Ile Glu Ser Glu Lys Ser Glu Lys Leu Ser Ser Phe Pro Ser
1715 1720 1725
Leu Gln Lys Asp Gly Ala Lys Glu Pro Glu Arg Lys Glu Gln Pro Leu
1730 1735 1740
Gln Arg His Pro Ser Ser Ile Pro Pro Pro Pro Leu Thr Ala Lys Asp
1745 1750 1755 1760
Leu Ser Ser Pro Ala Ala Arg Gln His Cys Ser Ser Pro Ser His Ala
1765 1770 1775
Ser Gly Arg Glu Pro Gly Ala Lys Pro Ser Thr Ala Glu Pro Ser Ser
1780 1785 1790
Ser Pro Gln Asp Pro Pro Lys Pro Val Ala Ala His Ser Glu Ser Ser
1795 1800 1805
Ser His Lys Pro Arg Pro Gly Pro Asp Pro Gly Pro Pro Lys Thr Lys
1810 1815 1820
His Pro Asp Arg Ser Leu Ser Ser Gln Lys Pro Ser Val Gly Ala Thr
1825 1830 1835 1840
Lys Gly Lys Glu Pro Ala Thr Gln Ser Leu Gly Gly Ser Ser Arg Glu
1845 1850 1855
Gly Lys Gly His Ser Lys Ser Gly Pro Asp Val Phe Pro Ala Thr Pro
1860 1865 1870
Gly Ser Gln Asn Lys Ala Ser Asp Gly Ile Gly Gln Gly Glu Gly Gly
1875 1880 1885
Pro Ser Val Pro Leu His Thr Asp Arg Ala Pro Leu Asp Ala Lys Pro
1890 1895 1900
Gln Pro Thr Ser Gly Gly Arg Pro Leu Glu Val Leu Glu Lys Pro Val
1905 1910 1915 1920
His Leu Pro Arg Pro Gly His Pro Gly Pro Ser Glu Pro Ala Asp Gln
1925 1930 1935
Lys Leu Ser Ala Val Gly Glu Lys Gln Thr Leu Ser Pro Lys His Pro
1940 1945 1950
Lys Pro Ser Thr Val Lys Asp Cys Pro Thr Leu Cys Lys Gln Thr Asp
1955 1960 1965
Asn Arg Gln Thr Asp Lys Ser Pro Ser Gln Pro Ala Ala Asn Thr Asp
1970 1975 1980
Arg Arg Ala Glu Gly Lys Lys Cys Thr Glu Ala Leu Tyr Ala Pro Ala
1985 1990 1995 2000
Glu Gly Asp Lys Leu Glu Ala Gly Leu Ser Phe Val His Ser Glu Asn
2005 2010 2015
Arg Leu Lys Gly Ala Glu Arg Pro Ala Ala Gly Val Gly Lys Gly Phe
2020 2025 2030
Pro Glu Ala Arg Gly Lys Gly Pro Gly Pro Gln Lys Pro Pro Thr Glu
2035 2040 2045
Ala Asp Lys Pro Asn Gly Met Lys Arg Ser Pro Ser Ala Thr Gly Gln
2050 2055 2060
Ser Ser Phe Arg Ser Thr Ala Leu Pro Glu Lys Ser Leu Ser Cys Ser
2065 2070 2075 2080
Ser Ser Phe Pro Glu Thr Arg Ala Gly Val Arg Glu Ala Ser Ala Ala
2085 2090 2095
Ser Ser Asp Thr Ser Ser Ala Lys Ala Ala Gly Gly Met Leu Glu Leu
2100 2105 2110
Pro Ala Pro Ser Asn Arg Asp His Arg Lys Ala Gln Pro Ala Gly Glu
2115 2120 2125
Gly Arg Thr His Met Thr Lys Ser Asp Ser Leu Pro Ser Phe Arg Val
2130 2135 2140
Ser Thr Leu Pro Leu Glu Ser His His Pro Asp Pro Asn Thr Met Gly
2145 2150 2155 2160
Gly Ala Ser His Arg Asp Arg Ala Leu Ser Val Thr Ala Thr Val Gly
2165 2170 2175
Glu Thr Lys Gly Lys Asp Pro Ala Pro Ala Gln Pro Pro Pro Ala Arg
2180 2185 2190
Lys Gln Asn Val Gly Arg Asp Val Thr Lys Pro Ser Pro Ala Pro Asn
2195 2200 2205
Thr Asp Arg Pro Ile Ser Leu Ser Asn Glu Lys Asp Phe Val Val Arg
2210 2215 2220
Gln Arg Arg Gly Lys Glu Ser Leu Arg Ser Ser Pro His Lys Lys Ala
2225 2230 2235 2240
Leu
<210> 139
<211> 8
<212> PRT
<213> Artificial Sequence
<220>
<223> Break-point of MAST4 protein fragment
<400> 139
Ala Thr Ala Gln Met Glu Glu Arg
1 5
<210> 140
<211> 10527
<212> DNA
<213> Artificial Sequence
<220>
<223> ERBB2IP-MAST4 fusion gene
<400> 140
atgactacaa aacgaagttt gtttgtgcgg ttggtaccat gtcgctgtct acgaggggaa 60
gaggagactg tcactactct tgattattct cattgcagct tagaacaagt tccgaaagag 120
atttttactt ttgaaaaaac cttggaggaa ctctatttag atgctaatca gattgaagag 180
cttccaaagc aactttttaa ctgtcagtct ttacacaaac tgagtttgcc agacaatgat 240
ttaacaacgt taccagcatc cattgcaaac cttattaatc tcagggaact ggatgtcagc 300
aagaatggaa tacaggagtt tccagaaaat ataaaaaatt gtaaagtttt gacaattgtg 360
gaggccagtg taaaccctat ttccaagctc cctgatggat tttctcagct gttaaaccta 420
acccagttgt atctgaatga tgcttttctt gagttcttgc cagcaaattt tggcagatta 480
actaaactcc aaatattaga gcttagagaa aaccagttaa aaatgttgcc taaaactatg 540
aatagactga cccagctgga aagactggat ttgggaagta acgaattcac ggaagtgcct 600
gaagtacttg agcaactaag tggattgaaa gagttttgga tggatgctaa tagactgact 660
tttattccag ggtttattgg tagtttgaaa cagctcacat atttggatgt ttctaaaaat 720
aatattgaaa tggttgaaga aggaatttca acatgtgaaa accttcaaga cctcctatta 780
tcaagcaatt cacttcagca gcttcctgag actattggtt cgttgaagaa tataacaacg 840
cttaaaatag atgaaaacca gttaatgtat ctgccagact ctataggagg gttaatatca 900
gtagaagaac tggattgtag tttcaatgaa gttgaagctt tgccttcatc tattgggcag 960
cttactaact taagaacttt tgctgctgat cataattact tacagcagtt gcccccagag 1020
attggaagct ggaaaaatat aactgtgctg tttctccatt ccaataaact tgagacactt 1080
ccagaggaaa tgggtgatat gcaaaaatta aaagtcatta atttaagtga taatagatta 1140
aagaatttac cctttagctt tacaaagcta cagcaattga cagctatgtg gctctcagat 1200
aatcagtcca aacccctgat acctcttcaa aaagaaactg attcagagac ccagaaaatg 1260
gtgcttacca actacatgtt ccctcaacag ccaaggactg aggatgttat gtttatatca 1320
gataatgaaa gttttaaccc ttcattgtgg gaggaacaga ggaaacagcg ggctcaagtt 1380
gcatttgaat gtgatgaaga caaagatgaa agggaggcac ctcccaggga gggaaattta 1440
aaaagatatc caacaccata cccagatgag cttaagaata tggtcaaaac tgttcaaacc 1500
attgtacata gattaaaaga tgaagagacc aatgaagact caggaagaga tttgaaacca 1560
catgaagatc aacaagatat aaataaagat gtgggtgtga agacctcaga aagtactact 1620
acagtaaaaa gcaaagttga tgaaagagaa aaatatatga taggaaactc tgtacagaag 1680
atcagtgaac ctgaagctga gattagtcct gggagtttac cagtgactgc aaatatgaaa 1740
gcctctgaga acttgaagca tattgttaac catgatgatg tttttgagga atctgaagaa 1800
ctttcttctg atgaagagat gaaaatggcg gagatgcgac caccattaat tgaaacctct 1860
attaaccagc caaaagtcgt agcacttagt aataacaaaa aagatgatac aaaggaaaca 1920
gattctttat cagatgaagt tacacacaat agcaatcaga ataacagcaa ttgttcttct 1980
ccatctcgga tgtctgattc agtttctctt aatactgata gtagtcaaga cacctcactc 2040
tgctctccag tgaaacaaac tcatattgat attaattcca aaatcaggca agaagatgaa 2100
aattttaaca gccttttaca aaatggagat attttaaaca gttcaacaga ggaaaagttc 2160
aaagctcatg ataaaaaaga ttttaactta cctgaatatg atttgaatgt tgaagagcga 2220
ttagttctaa ttgagaaaag tgttgactca acagccacag ctgatgacac tcacaaatta 2280
gatcatatca atatgaatct taataaactt ataactaatg atacatttca accagagatc 2340
atggaaagat caaaaacaca ggatattgtg cttggaacaa gctttttaag cattaattct 2400
aaagaggaaa ctgagcactt ggaaaatgga aacaagtatc ctaatttgga atccgtaaat 2460
aaggtaaatg gacattctga ggaaacttcc cagtctccta ataggactga accacatgac 2520
agtgattgtt ctgttgactt aggtatttcc aaaagcactg aagatctctc ccctcagaaa 2580
agtggtccag ttggatctgt tgtgaaatct catagcataa ctaatatgga gattggaggg 2640
ctaaaaatct atgatattct tagtgataat ggacctcagc agccaagtac aaccgttaaa 2700
atcacatctg ctgttgatgg aaaaaatata gtcaggagca agtctgccac actgttgtat 2760
gatcaaccat tgcaggtatt tactggttct tcctcatctt ctgatttaat atcaggaaca 2820
aaggcaattt tcaagtttga ttcaaatcat aatcccgaag agccaaatat aataagaggc 2880
cccacaagtg gcccacaatc tgcacctcaa atatatggtc ctccacagta taatatccaa 2940
tacagtagca gtgctgcagt caaagacact ttgtggcact ccaaacaaaa tccccaaata 3000
gaccatgcca gttttcctcc tcagctcctt cctagatcag agagcacaga aaatcaaagt 3060
tatgctaaac attctgccaa tatgaatttc tctaatcata acaatgttcg agctaatact 3120
gcataccatt tacatcagag acttggccca gcaagacatg gggaaatgtg ggccatctca 3180
ccaaacgacc gacttattcc tgcagtaact cgaagtacaa tccagcgaca aagtagtgtg 3240
tcctccacag cctctgtaaa tcttggtgat ccaggctcta caaggcgggc tcagattcct 3300
gaaggagatt atttatcata cagagagttc cactcagcgg gaagaactcc tccaatgatg 3360
ccaggatcac agagacccct ttctgcacga acatacagca tagatggtcc aaatgcatca 3420
agacctcaga gtgctcgacc ctctattaat gaaataccag agagaactat gtcagttagt 3480
gatttcaatt attcacggac tagtccttca aaaagaccaa atgcaagggt tggttctgag 3540
cattctttat tagatcctcc aggaaaaagt aaagttcctc gtgactggag agaacaagta 3600
cttcgacata ttgaagccaa aaagttagaa aagattcgag tgagggttga aaaggatcca 3660
gaacttggat ttagcatatc aggtggtgtc gggggtagag gaaacccatt cagacctgat 3720
gatgatggta tatttgtaac aagggtacaa cctgaaggac cagcatcaaa attactgcag 3780
ccaggtgata aaattattca ggctacagct cagatggaag aacgtctaaa ggaaattatc 3840
accagctact ctcctgacaa cgttctaccc ttagcagatg gagtgcttag tttcactcac 3900
caccagatta ttgaactggc tcgagattgc ttggataaat cccaccaggg cctcatcacc 3960
tcacgatact tccttgaatt acagcacaaa ttagataagt tgctacagga ggctcatgat 4020
cgttcagaaa gtggagaatt ggcatttatt aaacaactag ttcgaaagat cctaattgtt 4080
attgcccgcc ctgctcggtt attagagtgc ctggaatttg atccggaaga attttactac 4140
ctattggaag cagcagaagg ccatgccaaa gaaggacagg gtattaaaac cgacattccc 4200
aggtacatca ttagccaact gggactcaat aaggatccct tggaagaaat ggctcatttg 4260
ggaaactacg atagtgggac agcagaaaca ccagaaacag atgaatcagt gagtagctct 4320
aatgcctccc tgaaacttcg aaggaaacct cgggaaagtg attttgaaac gattaaattg 4380
attagcaatg gagcctatgg ggcagtctac tttgttcggc ataaagaatc ccggcagagg 4440
tttgccatga agaagattaa taaacagaac ctcatccttc gaaaccagat ccagcaggcc 4500
tttgtggagc gggatatcct gacttttgca gaaaacccct ttgttgtcag catgtattgc 4560
tcctttgaaa caaggcgcca cttgtgcatg gtcatggaat atgtggaagg gggagactgt 4620
gctactttaa tgaaaaacat gggtcctctc cctgttgata tggccagaat gtactttgct 4680
gagacggtct tggccttgga atatttacat aattatggaa ttgtacacag ggatttgaaa 4740
ccagacaact tgttggttac ctccatgggg cacataaagc tgacagattt tggattatct 4800
aaggtgggac taatgagcat gactaccaac ctttacgagg gtcatattga gaaggatgct 4860
agagagttcc tggataaaca ggtctgtggc acacctgaat acattgcacc agaagtgatt 4920
ctgaggcagg gttatggaaa gccggtggac tggtgggcca tggggattat cctctatgaa 4980
tttctggttg gatgcgtgcc attctttggg gatactccag aggagctatt tggacaagtc 5040
atcagtgatg agatcaactg gcctgagaag gatgaggcac ccccacctga tgcccaggat 5100
ctgattacct tactcctcag gcagaatccc ctggagaggc tgggaacagg tggtgcatat 5160
gaagtcaaac agcatcgatt cttccgttct ttagactgga acagtttgct gagacagaag 5220
gcagaattta ttccccaact ggaatctgag gatgacacaa gttattttga tactcggtct 5280
gagaagtatc atcatatgga aacggaggaa gaagatgaca caaatgatga agactttaat 5340
gtggaaataa ggcagttttc ttcatgttca cacaggtttt caaaagtttt cagcagtata 5400
gatcgaatca ctcagaattc agcagaagag aaggaagact ctgtggacaa aaccaaaagc 5460
accaccttgc catccacaga aacactgagc tggagttcag aatattctga aatgcaacag 5520
ctatcaacat ccaactcttc agatactgaa agcaacagac ataaactcag ttctggccta 5580
cttcccaaac tggctatttc aacagaggga gagcaagatg aagctgcctc ctgccctgga 5640
gacccccatg aggagccagg aaagccagcc cttcctcctg aagagtgtgc ccaggaggag 5700
cctgaggtca ccaccccagc cagcaccatc agcagctcca ccctgtcagt tggcagtttt 5760
tcagagcact tggatcagat aaatggacga agcgagtgtg tggacagtac agataattcc 5820
tcaaagccat ccagtgaacc cgcttctcac atggctcggc agcgattaga aagcacagaa 5880
aaaaagaaaa tctcggggaa agtcacaaag tccctctctg ccagtgctct ttccctcatg 5940
atcccaggag atatgtttgc tgtttcccct ctgggaagtc caatgtctcc ccattccctg 6000
tcctcggacc cttcttcttc acgagattcc tctcccagcc gagattcctc agcagcttct 6060
gccagtccac atcagccgat tgtgatccac agttcgggga agaactacgg ctttaccatc 6120
cgagccatcc gggtgtatgt gggagacagt gacatctata cagtgcacca tatcgtctgg 6180
aatgtagaag aaggaagtcc ggcatgccag gcaggactga aggctggaga tcttatcact 6240
cacatcaatg gagaaccagt gcatggactt gtccacacag aagttataga actcctactg 6300
aagagtggga ataaggtgtc aatcactact accccatttg aaaacacatc aatcaaaact 6360
ggaccagcca ggagaaacag ctataagagc cggatggtga ggcggagcaa gaaatccaag 6420
aagaaagaaa gtctcgaaag gaggagatct cttttcaaaa agctagccaa gcagccttct 6480
cctttactcc acaccagccg aagtttctcc tgcttgaaca gatccctgtc atcgggtgag 6540
agcctcccag gttcccccac tcatagcttg tctccccggt ctccaacacc aagctaccgc 6600
tccacccctg acttcccatc tggtactaat tcctcccaga gcagctcccc tagttctagt 6660
gcccccaatt ccccagcagg gtccgggcac atccggccca gcactctcca cggtcttgca 6720
cccaaactcg gcgggcagcg gtaccggtcc ggaaggcgaa agtccgccgg caacatccca 6780
ctgtccccgc tggcccggac gccctctcca accccgcaac ccacctcccc gcagcggtca 6840
ccatcccctc ttctgggaca ctcactgggc aattccaaga tcgcgcaagc ctttcccagc 6900
aagatgcact ccccgcccac catcgtcaga cacatcgtga ggcccaagag tgcggagccc 6960
cccaggtccc cgctgctcaa gcgcgtgcag tccgaggaga agctgtcgcc ctcttacggc 7020
agtgacaaga agcacctgtg ctcccgcaag cacagcctgg aggtgaccca agaggaggtg 7080
cagcgggagc agtcccagcg ggaggcgccg ctgcagagcc tggatgagaa cgtgtgcgac 7140
gtgccgccgc tcagccgcgc ccggccagtg gagcaaggct gcctgaaacg cccagtctcc 7200
cggaaggtgg gccgccagga gtctgtggac gacctggacc gcgacaagct gaaggccaag 7260
gtggtggtga agaaagcaga cggcttccca gagaaacagg aatcccacca gaaatcccat 7320
ggacccggga gtgatttgga aaactttgct ctgtttaagc tggaagagag agagaagaaa 7380
gtctatccga aggctgtgga aaggtcaagt acttttgaaa acaaagcgtc tatgcaggag 7440
gcgccaccgc tgggcagcct gctgaaggat gctcttcaca agcaggccag cgtgcgcgcc 7500
agcgagggtg cgatgtcgga tggccgggtg cctgcggagc accgccaggg tggcggggac 7560
ttcagacggg cccccgctcc tggcaccctc caggatggtc tctgccactc cctcgacagg 7620
ggcatctctg ggaaggggga aggcacggag aagtcctccc aggccaagga gcttctccga 7680
tgtgaaaagt tagacagcaa gctggccaac atcgattacc tccgaaagaa aatgtcactt 7740
gaggacaaag aggacaacct ctgccctgtg ctgaagccca agatgacagc tggctcccac 7800
gaatgcctgc cagggaaccc agtccgaccc acgggtgggc agcaggagcc cccgccggct 7860
tctgagagcc gagcttttgt cagcagcacc catgcagctc agatgagtgc cgtctctttt 7920
gttcccctca aggccttaac aggccgggtg gacagtggaa cggagaagcc tggcttggtt 7980
gctcctgagt cccctgttag gaagagcccc tccgagtata agctggaagg taggtctgtc 8040
tcatgcctga agccgatcga gggcactctg gacattgctc tcctgtccgg acctcaggcc 8100
tccaagacag aactgccttc cccagagtct gcacagagcc ccagcccaag tggtgacgtg 8160
agggcctctg tgccaccagt tctccccagc agcagtggga aaaagaacga taccaccagt 8220
gcaagagagc tttctccttc cagcttaaag atgaataaat cctacctgct ggagccttgg 8280
ttcctgcccc ccagccgagg tctccagaat tcaccagcag tttccctgcc tgacccagag 8340
ttcaagaggg acaggaaagg tccccatcct actgccagga gccctggaac agtcatggaa 8400
agcaatcccc aacagagaga gggcagctcc cctaaacacc aagaccacac cactgacccc 8460
aagcttctga cctgcctggg gcagaacctc cacagccctg acctggccag gccacgctgc 8520
ccgctcccac ctgaagcttc cccctcaagg gagaagccag gcctgaggga atcgtctgaa 8580
agaggccctc ccacagccag aagcgagcgc tctgctgcga gggctgacac atgcagagag 8640
ccctccatgg aactgtgctt tccagaaact gcgaaaacca gtgacaactc caaaaatctc 8700
ctctctgtgg gaaggaccca cccagatttc tatacacaga cccaggccat ggagaaagca 8760
tgggcgccgg gtgggaaaac gaaccacaaa gatggcccag gtgaggcgag gcccccgccc 8820
agagacaact cctctctgca ctcagctgga attccctgtg agaaggagct gggcaaggtg 8880
aggcgtggcg tggaacccaa gcccgaagcg cttcttgcca ggcggtctct gcagccacct 8940
ggaattgaga gtgagaagag tgaaaagctc tccagtttcc catctttgca gaaagatggt 9000
gccaaggaac ctgaaaggaa ggagcagcct ctacaaaggc atcccagcag catccctccg 9060
ccccctctga cggccaaaga cctgtccagc ccggctgcca ggcagcattg cagttcccca 9120
agccacgctt ctggcagaga gccgggggcc aagcccagca ctgcagagcc cagctcgagc 9180
ccccaggacc ctcccaagcc tgttgctgcg cacagtgaaa gcagcagcca caagccccgg 9240
cctggccctg acccgggccc tccaaagact aagcaccccg accggtccct ctcctctcag 9300
aaaccaagtg tcggggccac aaagggcaaa gagcctgcca ctcagtccct cggtggctct 9360
agcagagagg ggaagggcca cagtaagagt gggccggatg tgtttcctgc taccccaggc 9420
tcccagaaca aagccagcga tgggattggc cagggagaag gtgggccctc tgtcccactg 9480
cacactgaca gggctcctct agacgccaag ccacaaccca ccagtggtgg gcggcccctg 9540
gaggtgctgg agaagcctgt gcatttgcca aggccgggac acccagggcc tagtgagcca 9600
gcggaccaga aactgtccgc tgttggtgaa aagcaaaccc tgtctccaaa gcaccccaaa 9660
ccatccactg tgaaagattg ccccaccctg tgcaaacaga cagacaacag acagacagac 9720
aaaagcccga gtcagccggc cgccaacacc gacagaaggg cggaagggaa gaaatgcact 9780
gaagcacttt atgctccagc agagggcgac aagctcgagg ccggcctttc ctttgtgcat 9840
agcgagaacc ggttgaaagg cgcggagcgg ccagccgcgg gggtggggaa gggcttccct 9900
gaggccagag ggaaagggcc cggtccccag aagccaccga cggaggcaga caagcccaat 9960
ggcatgaaac ggtccccctc agccactggg cagagttctt tccgatccac ggccctcccg 10020
gaaaagtctc tgagctgctc ctccagcttc cctgaaacca gggccggagt tagagaggcc 10080
tctgcagcca gcagcgacac ctcttctgcc aaggccgccg ggggcatgct ggagcttcca 10140
gcccccagca acagggacca taggaaggct cagcctgccg gggagggccg aacccacatg 10200
acaaagagtg actccctgcc ctccttccgg gtctccaccc tgcctctgga gtcacaccac 10260
cccgacccaa acaccatggg cggggccagc caccgggaca gggctctctc ggtgactgcc 10320
accgtagggg aaaccaaagg gaaggaccct gccccagccc agcctccccc agctaggaaa 10380
cagaacgtgg gcagagacgt gaccaagcca tccccagccc caaacactga ccgccccatc 10440
tctctttcta atgagaagga ctttgtggta cggcagaggc gggggaaaga gagtttgcgt 10500
agcagccctc acaaaaaggc cttgtaa 10527
<210> 141
<211> 48
<212> DNA
<213> Artificial Sequence
<220>
<223> Fused region of ERBB2IP-MAST4 fusion gene
<400> 141
cagccaggtg ataaaattat tcaggctaca gctcagatgg aagaacgt 48
<210> 142
<211> 3508
<212> PRT
<213> Artificial Sequence
<220>
<223> ERBB2IP-MAST4 fusion protein
<400> 142
Met Thr Thr Lys Arg Ser Leu Phe Val Arg Leu Val Pro Cys Arg Cys
1 5 10 15
Leu Arg Gly Glu Glu Glu Thr Val Thr Thr Leu Asp Tyr Ser His Cys
20 25 30
Ser Leu Glu Gln Val Pro Lys Glu Ile Phe Thr Phe Glu Lys Thr Leu
35 40 45
Glu Glu Leu Tyr Leu Asp Ala Asn Gln Ile Glu Glu Leu Pro Lys Gln
50 55 60
Leu Phe Asn Cys Gln Ser Leu His Lys Leu Ser Leu Pro Asp Asn Asp
65 70 75 80
Leu Thr Thr Leu Pro Ala Ser Ile Ala Asn Leu Ile Asn Leu Arg Glu
85 90 95
Leu Asp Val Ser Lys Asn Gly Ile Gln Glu Phe Pro Glu Asn Ile Lys
100 105 110
Asn Cys Lys Val Leu Thr Ile Val Glu Ala Ser Val Asn Pro Ile Ser
115 120 125
Lys Leu Pro Asp Gly Phe Ser Gln Leu Leu Asn Leu Thr Gln Leu Tyr
130 135 140
Leu Asn Asp Ala Phe Leu Glu Phe Leu Pro Ala Asn Phe Gly Arg Leu
145 150 155 160
Thr Lys Leu Gln Ile Leu Glu Leu Arg Glu Asn Gln Leu Lys Met Leu
165 170 175
Pro Lys Thr Met Asn Arg Leu Thr Gln Leu Glu Arg Leu Asp Leu Gly
180 185 190
Ser Asn Glu Phe Thr Glu Val Pro Glu Val Leu Glu Gln Leu Ser Gly
195 200 205
Leu Lys Glu Phe Trp Met Asp Ala Asn Arg Leu Thr Phe Ile Pro Gly
210 215 220
Phe Ile Gly Ser Leu Lys Gln Leu Thr Tyr Leu Asp Val Ser Lys Asn
225 230 235 240
Asn Ile Glu Met Val Glu Glu Gly Ile Ser Thr Cys Glu Asn Leu Gln
245 250 255
Asp Leu Leu Leu Ser Ser Asn Ser Leu Gln Gln Leu Pro Glu Thr Ile
260 265 270
Gly Ser Leu Lys Asn Ile Thr Thr Leu Lys Ile Asp Glu Asn Gln Leu
275 280 285
Met Tyr Leu Pro Asp Ser Ile Gly Gly Leu Ile Ser Val Glu Glu Leu
290 295 300
Asp Cys Ser Phe Asn Glu Val Glu Ala Leu Pro Ser Ser Ile Gly Gln
305 310 315 320
Leu Thr Asn Leu Arg Thr Phe Ala Ala Asp His Asn Tyr Leu Gln Gln
325 330 335
Leu Pro Pro Glu Ile Gly Ser Trp Lys Asn Ile Thr Val Leu Phe Leu
340 345 350
His Ser Asn Lys Leu Glu Thr Leu Pro Glu Glu Met Gly Asp Met Gln
355 360 365
Lys Leu Lys Val Ile Asn Leu Ser Asp Asn Arg Leu Lys Asn Leu Pro
370 375 380
Phe Ser Phe Thr Lys Leu Gln Gln Leu Thr Ala Met Trp Leu Ser Asp
385 390 395 400
Asn Gln Ser Lys Pro Leu Ile Pro Leu Gln Lys Glu Thr Asp Ser Glu
405 410 415
Thr Gln Lys Met Val Leu Thr Asn Tyr Met Phe Pro Gln Gln Pro Arg
420 425 430
Thr Glu Asp Val Met Phe Ile Ser Asp Asn Glu Ser Phe Asn Pro Ser
435 440 445
Leu Trp Glu Glu Gln Arg Lys Gln Arg Ala Gln Val Ala Phe Glu Cys
450 455 460
Asp Glu Asp Lys Asp Glu Arg Glu Ala Pro Pro Arg Glu Gly Asn Leu
465 470 475 480
Lys Arg Tyr Pro Thr Pro Tyr Pro Asp Glu Leu Lys Asn Met Val Lys
485 490 495
Thr Val Gln Thr Ile Val His Arg Leu Lys Asp Glu Glu Thr Asn Glu
500 505 510
Asp Ser Gly Arg Asp Leu Lys Pro His Glu Asp Gln Gln Asp Ile Asn
515 520 525
Lys Asp Val Gly Val Lys Thr Ser Glu Ser Thr Thr Thr Val Lys Ser
530 535 540
Lys Val Asp Glu Arg Glu Lys Tyr Met Ile Gly Asn Ser Val Gln Lys
545 550 555 560
Ile Ser Glu Pro Glu Ala Glu Ile Ser Pro Gly Ser Leu Pro Val Thr
565 570 575
Ala Asn Met Lys Ala Ser Glu Asn Leu Lys His Ile Val Asn His Asp
580 585 590
Asp Val Phe Glu Glu Ser Glu Glu Leu Ser Ser Asp Glu Glu Met Lys
595 600 605
Met Ala Glu Met Arg Pro Pro Leu Ile Glu Thr Ser Ile Asn Gln Pro
610 615 620
Lys Val Val Ala Leu Ser Asn Asn Lys Lys Asp Asp Thr Lys Glu Thr
625 630 635 640
Asp Ser Leu Ser Asp Glu Val Thr His Asn Ser Asn Gln Asn Asn Ser
645 650 655
Asn Cys Ser Ser Pro Ser Arg Met Ser Asp Ser Val Ser Leu Asn Thr
660 665 670
Asp Ser Ser Gln Asp Thr Ser Leu Cys Ser Pro Val Lys Gln Thr His
675 680 685
Ile Asp Ile Asn Ser Lys Ile Arg Gln Glu Asp Glu Asn Phe Asn Ser
690 695 700
Leu Leu Gln Asn Gly Asp Ile Leu Asn Ser Ser Thr Glu Glu Lys Phe
705 710 715 720
Lys Ala His Asp Lys Lys Asp Phe Asn Leu Pro Glu Tyr Asp Leu Asn
725 730 735
Val Glu Glu Arg Leu Val Leu Ile Glu Lys Ser Val Asp Ser Thr Ala
740 745 750
Thr Ala Asp Asp Thr His Lys Leu Asp His Ile Asn Met Asn Leu Asn
755 760 765
Lys Leu Ile Thr Asn Asp Thr Phe Gln Pro Glu Ile Met Glu Arg Ser
770 775 780
Lys Thr Gln Asp Ile Val Leu Gly Thr Ser Phe Leu Ser Ile Asn Ser
785 790 795 800
Lys Glu Glu Thr Glu His Leu Glu Asn Gly Asn Lys Tyr Pro Asn Leu
805 810 815
Glu Ser Val Asn Lys Val Asn Gly His Ser Glu Glu Thr Ser Gln Ser
820 825 830
Pro Asn Arg Thr Glu Pro His Asp Ser Asp Cys Ser Val Asp Leu Gly
835 840 845
Ile Ser Lys Ser Thr Glu Asp Leu Ser Pro Gln Lys Ser Gly Pro Val
850 855 860
Gly Ser Val Val Lys Ser His Ser Ile Thr Asn Met Glu Ile Gly Gly
865 870 875 880
Leu Lys Ile Tyr Asp Ile Leu Ser Asp Asn Gly Pro Gln Gln Pro Ser
885 890 895
Thr Thr Val Lys Ile Thr Ser Ala Val Asp Gly Lys Asn Ile Val Arg
900 905 910
Ser Lys Ser Ala Thr Leu Leu Tyr Asp Gln Pro Leu Gln Val Phe Thr
915 920 925
Gly Ser Ser Ser Ser Ser Asp Leu Ile Ser Gly Thr Lys Ala Ile Phe
930 935 940
Lys Phe Asp Ser Asn His Asn Pro Glu Glu Pro Asn Ile Ile Arg Gly
945 950 955 960
Pro Thr Ser Gly Pro Gln Ser Ala Pro Gln Ile Tyr Gly Pro Pro Gln
965 970 975
Tyr Asn Ile Gln Tyr Ser Ser Ser Ala Ala Val Lys Asp Thr Leu Trp
980 985 990
His Ser Lys Gln Asn Pro Gln Ile Asp His Ala Ser Phe Pro Pro Gln
995 1000 1005
Leu Leu Pro Arg Ser Glu Ser Thr Glu Asn Gln Ser Tyr Ala Lys His
1010 1015 1020
Ser Ala Asn Met Asn Phe Ser Asn His Asn Asn Val Arg Ala Asn Thr
1025 1030 1035 1040
Ala Tyr His Leu His Gln Arg Leu Gly Pro Ala Arg His Gly Glu Met
1045 1050 1055
Trp Ala Ile Ser Pro Asn Asp Arg Leu Ile Pro Ala Val Thr Arg Ser
1060 1065 1070
Thr Ile Gln Arg Gln Ser Ser Val Ser Ser Thr Ala Ser Val Asn Leu
1075 1080 1085
Gly Asp Pro Gly Ser Thr Arg Arg Ala Gln Ile Pro Glu Gly Asp Tyr
1090 1095 1100
Leu Ser Tyr Arg Glu Phe His Ser Ala Gly Arg Thr Pro Pro Met Met
1105 1110 1115 1120
Pro Gly Ser Gln Arg Pro Leu Ser Ala Arg Thr Tyr Ser Ile Asp Gly
1125 1130 1135
Pro Asn Ala Ser Arg Pro Gln Ser Ala Arg Pro Ser Ile Asn Glu Ile
1140 1145 1150
Pro Glu Arg Thr Met Ser Val Ser Asp Phe Asn Tyr Ser Arg Thr Ser
1155 1160 1165
Pro Ser Lys Arg Pro Asn Ala Arg Val Gly Ser Glu His Ser Leu Leu
1170 1175 1180
Asp Pro Pro Gly Lys Ser Lys Val Pro Arg Asp Trp Arg Glu Gln Val
1185 1190 1195 1200
Leu Arg His Ile Glu Ala Lys Lys Leu Glu Lys Ile Arg Val Arg Val
1205 1210 1215
Glu Lys Asp Pro Glu Leu Gly Phe Ser Ile Ser Gly Gly Val Gly Gly
1220 1225 1230
Arg Gly Asn Pro Phe Arg Pro Asp Asp Asp Gly Ile Phe Val Thr Arg
1235 1240 1245
Val Gln Pro Glu Gly Pro Ala Ser Lys Leu Leu Gln Pro Gly Asp Lys
1250 1255 1260
Ile Ile Gln Ala Thr Ala Gln Met Glu Glu Arg Leu Lys Glu Ile Ile
1265 1270 1275 1280
Thr Ser Tyr Ser Pro Asp Asn Val Leu Pro Leu Ala Asp Gly Val Leu
1285 1290 1295
Ser Phe Thr His His Gln Ile Ile Glu Leu Ala Arg Asp Cys Leu Asp
1300 1305 1310
Lys Ser His Gln Gly Leu Ile Thr Ser Arg Tyr Phe Leu Glu Leu Gln
1315 1320 1325
His Lys Leu Asp Lys Leu Leu Gln Glu Ala His Asp Arg Ser Glu Ser
1330 1335 1340
Gly Glu Leu Ala Phe Ile Lys Gln Leu Val Arg Lys Ile Leu Ile Val
1345 1350 1355 1360
Ile Ala Arg Pro Ala Arg Leu Leu Glu Cys Leu Glu Phe Asp Pro Glu
1365 1370 1375
Glu Phe Tyr Tyr Leu Leu Glu Ala Ala Glu Gly His Ala Lys Glu Gly
1380 1385 1390
Gln Gly Ile Lys Thr Asp Ile Pro Arg Tyr Ile Ile Ser Gln Leu Gly
1395 1400 1405
Leu Asn Lys Asp Pro Leu Glu Glu Met Ala His Leu Gly Asn Tyr Asp
1410 1415 1420
Ser Gly Thr Ala Glu Thr Pro Glu Thr Asp Glu Ser Val Ser Ser Ser
1425 1430 1435 1440
Asn Ala Ser Leu Lys Leu Arg Arg Lys Pro Arg Glu Ser Asp Phe Glu
1445 1450 1455
Thr Ile Lys Leu Ile Ser Asn Gly Ala Tyr Gly Ala Val Tyr Phe Val
1460 1465 1470
Arg His Lys Glu Ser Arg Gln Arg Phe Ala Met Lys Lys Ile Asn Lys
1475 1480 1485
Gln Asn Leu Ile Leu Arg Asn Gln Ile Gln Gln Ala Phe Val Glu Arg
1490 1495 1500
Asp Ile Leu Thr Phe Ala Glu Asn Pro Phe Val Val Ser Met Tyr Cys
1505 1510 1515 1520
Ser Phe Glu Thr Arg Arg His Leu Cys Met Val Met Glu Tyr Val Glu
1525 1530 1535
Gly Gly Asp Cys Ala Thr Leu Met Lys Asn Met Gly Pro Leu Pro Val
1540 1545 1550
Asp Met Ala Arg Met Tyr Phe Ala Glu Thr Val Leu Ala Leu Glu Tyr
1555 1560 1565
Leu His Asn Tyr Gly Ile Val His Arg Asp Leu Lys Pro Asp Asn Leu
1570 1575 1580
Leu Val Thr Ser Met Gly His Ile Lys Leu Thr Asp Phe Gly Leu Ser
1585 1590 1595 1600
Lys Val Gly Leu Met Ser Met Thr Thr Asn Leu Tyr Glu Gly His Ile
1605 1610 1615
Glu Lys Asp Ala Arg Glu Phe Leu Asp Lys Gln Val Cys Gly Thr Pro
1620 1625 1630
Glu Tyr Ile Ala Pro Glu Val Ile Leu Arg Gln Gly Tyr Gly Lys Pro
1635 1640 1645
Val Asp Trp Trp Ala Met Gly Ile Ile Leu Tyr Glu Phe Leu Val Gly
1650 1655 1660
Cys Val Pro Phe Phe Gly Asp Thr Pro Glu Glu Leu Phe Gly Gln Val
1665 1670 1675 1680
Ile Ser Asp Glu Ile Asn Trp Pro Glu Lys Asp Glu Ala Pro Pro Pro
1685 1690 1695
Asp Ala Gln Asp Leu Ile Thr Leu Leu Leu Arg Gln Asn Pro Leu Glu
1700 1705 1710
Arg Leu Gly Thr Gly Gly Ala Tyr Glu Val Lys Gln His Arg Phe Phe
1715 1720 1725
Arg Ser Leu Asp Trp Asn Ser Leu Leu Arg Gln Lys Ala Glu Phe Ile
1730 1735 1740
Pro Gln Leu Glu Ser Glu Asp Asp Thr Ser Tyr Phe Asp Thr Arg Ser
1745 1750 1755 1760
Glu Lys Tyr His His Met Glu Thr Glu Glu Glu Asp Asp Thr Asn Asp
1765 1770 1775
Glu Asp Phe Asn Val Glu Ile Arg Gln Phe Ser Ser Cys Ser His Arg
1780 1785 1790
Phe Ser Lys Val Phe Ser Ser Ile Asp Arg Ile Thr Gln Asn Ser Ala
1795 1800 1805
Glu Glu Lys Glu Asp Ser Val Asp Lys Thr Lys Ser Thr Thr Leu Pro
1810 1815 1820
Ser Thr Glu Thr Leu Ser Trp Ser Ser Glu Tyr Ser Glu Met Gln Gln
1825 1830 1835 1840
Leu Ser Thr Ser Asn Ser Ser Asp Thr Glu Ser Asn Arg His Lys Leu
1845 1850 1855
Ser Ser Gly Leu Leu Pro Lys Leu Ala Ile Ser Thr Glu Gly Glu Gln
1860 1865 1870
Asp Glu Ala Ala Ser Cys Pro Gly Asp Pro His Glu Glu Pro Gly Lys
1875 1880 1885
Pro Ala Leu Pro Pro Glu Glu Cys Ala Gln Glu Glu Pro Glu Val Thr
1890 1895 1900
Thr Pro Ala Ser Thr Ile Ser Ser Ser Thr Leu Ser Val Gly Ser Phe
1905 1910 1915 1920
Ser Glu His Leu Asp Gln Ile Asn Gly Arg Ser Glu Cys Val Asp Ser
1925 1930 1935
Thr Asp Asn Ser Ser Lys Pro Ser Ser Glu Pro Ala Ser His Met Ala
1940 1945 1950
Arg Gln Arg Leu Glu Ser Thr Glu Lys Lys Lys Ile Ser Gly Lys Val
1955 1960 1965
Thr Lys Ser Leu Ser Ala Ser Ala Leu Ser Leu Met Ile Pro Gly Asp
1970 1975 1980
Met Phe Ala Val Ser Pro Leu Gly Ser Pro Met Ser Pro His Ser Leu
1985 1990 1995 2000
Ser Ser Asp Pro Ser Ser Ser Arg Asp Ser Ser Pro Ser Arg Asp Ser
2005 2010 2015
Ser Ala Ala Ser Ala Ser Pro His Gln Pro Ile Val Ile His Ser Ser
2020 2025 2030
Gly Lys Asn Tyr Gly Phe Thr Ile Arg Ala Ile Arg Val Tyr Val Gly
2035 2040 2045
Asp Ser Asp Ile Tyr Thr Val His His Ile Val Trp Asn Val Glu Glu
2050 2055 2060
Gly Ser Pro Ala Cys Gln Ala Gly Leu Lys Ala Gly Asp Leu Ile Thr
2065 2070 2075 2080
His Ile Asn Gly Glu Pro Val His Gly Leu Val His Thr Glu Val Ile
2085 2090 2095
Glu Leu Leu Leu Lys Ser Gly Asn Lys Val Ser Ile Thr Thr Thr Pro
2100 2105 2110
Phe Glu Asn Thr Ser Ile Lys Thr Gly Pro Ala Arg Arg Asn Ser Tyr
2115 2120 2125
Lys Ser Arg Met Val Arg Arg Ser Lys Lys Ser Lys Lys Lys Glu Ser
2130 2135 2140
Leu Glu Arg Arg Arg Ser Leu Phe Lys Lys Leu Ala Lys Gln Pro Ser
2145 2150 2155 2160
Pro Leu Leu His Thr Ser Arg Ser Phe Ser Cys Leu Asn Arg Ser Leu
2165 2170 2175
Ser Ser Gly Glu Ser Leu Pro Gly Ser Pro Thr His Ser Leu Ser Pro
2180 2185 2190
Arg Ser Pro Thr Pro Ser Tyr Arg Ser Thr Pro Asp Phe Pro Ser Gly
2195 2200 2205
Thr Asn Ser Ser Gln Ser Ser Ser Pro Ser Ser Ser Ala Pro Asn Ser
2210 2215 2220
Pro Ala Gly Ser Gly His Ile Arg Pro Ser Thr Leu His Gly Leu Ala
2225 2230 2235 2240
Pro Lys Leu Gly Gly Gln Arg Tyr Arg Ser Gly Arg Arg Lys Ser Ala
2245 2250 2255
Gly Asn Ile Pro Leu Ser Pro Leu Ala Arg Thr Pro Ser Pro Thr Pro
2260 2265 2270
Gln Pro Thr Ser Pro Gln Arg Ser Pro Ser Pro Leu Leu Gly His Ser
2275 2280 2285
Leu Gly Asn Ser Lys Ile Ala Gln Ala Phe Pro Ser Lys Met His Ser
2290 2295 2300
Pro Pro Thr Ile Val Arg His Ile Val Arg Pro Lys Ser Ala Glu Pro
2305 2310 2315 2320
Pro Arg Ser Pro Leu Leu Lys Arg Val Gln Ser Glu Glu Lys Leu Ser
2325 2330 2335
Pro Ser Tyr Gly Ser Asp Lys Lys His Leu Cys Ser Arg Lys His Ser
2340 2345 2350
Leu Glu Val Thr Gln Glu Glu Val Gln Arg Glu Gln Ser Gln Arg Glu
2355 2360 2365
Ala Pro Leu Gln Ser Leu Asp Glu Asn Val Cys Asp Val Pro Pro Leu
2370 2375 2380
Ser Arg Ala Arg Pro Val Glu Gln Gly Cys Leu Lys Arg Pro Val Ser
2385 2390 2395 2400
Arg Lys Val Gly Arg Gln Glu Ser Val Asp Asp Leu Asp Arg Asp Lys
2405 2410 2415
Leu Lys Ala Lys Val Val Val Lys Lys Ala Asp Gly Phe Pro Glu Lys
2420 2425 2430
Gln Glu Ser His Gln Lys Ser His Gly Pro Gly Ser Asp Leu Glu Asn
2435 2440 2445
Phe Ala Leu Phe Lys Leu Glu Glu Arg Glu Lys Lys Val Tyr Pro Lys
2450 2455 2460
Ala Val Glu Arg Ser Ser Thr Phe Glu Asn Lys Ala Ser Met Gln Glu
2465 2470 2475 2480
Ala Pro Pro Leu Gly Ser Leu Leu Lys Asp Ala Leu His Lys Gln Ala
2485 2490 2495
Ser Val Arg Ala Ser Glu Gly Ala Met Ser Asp Gly Arg Val Pro Ala
2500 2505 2510
Glu His Arg Gln Gly Gly Gly Asp Phe Arg Arg Ala Pro Ala Pro Gly
2515 2520 2525
Thr Leu Gln Asp Gly Leu Cys His Ser Leu Asp Arg Gly Ile Ser Gly
2530 2535 2540
Lys Gly Glu Gly Thr Glu Lys Ser Ser Gln Ala Lys Glu Leu Leu Arg
2545 2550 2555 2560
Cys Glu Lys Leu Asp Ser Lys Leu Ala Asn Ile Asp Tyr Leu Arg Lys
2565 2570 2575
Lys Met Ser Leu Glu Asp Lys Glu Asp Asn Leu Cys Pro Val Leu Lys
2580 2585 2590
Pro Lys Met Thr Ala Gly Ser His Glu Cys Leu Pro Gly Asn Pro Val
2595 2600 2605
Arg Pro Thr Gly Gly Gln Gln Glu Pro Pro Pro Ala Ser Glu Ser Arg
2610 2615 2620
Ala Phe Val Ser Ser Thr His Ala Ala Gln Met Ser Ala Val Ser Phe
2625 2630 2635 2640
Val Pro Leu Lys Ala Leu Thr Gly Arg Val Asp Ser Gly Thr Glu Lys
2645 2650 2655
Pro Gly Leu Val Ala Pro Glu Ser Pro Val Arg Lys Ser Pro Ser Glu
2660 2665 2670
Tyr Lys Leu Glu Gly Arg Ser Val Ser Cys Leu Lys Pro Ile Glu Gly
2675 2680 2685
Thr Leu Asp Ile Ala Leu Leu Ser Gly Pro Gln Ala Ser Lys Thr Glu
2690 2695 2700
Leu Pro Ser Pro Glu Ser Ala Gln Ser Pro Ser Pro Ser Gly Asp Val
2705 2710 2715 2720
Arg Ala Ser Val Pro Pro Val Leu Pro Ser Ser Ser Gly Lys Lys Asn
2725 2730 2735
Asp Thr Thr Ser Ala Arg Glu Leu Ser Pro Ser Ser Leu Lys Met Asn
2740 2745 2750
Lys Ser Tyr Leu Leu Glu Pro Trp Phe Leu Pro Pro Ser Arg Gly Leu
2755 2760 2765
Gln Asn Ser Pro Ala Val Ser Leu Pro Asp Pro Glu Phe Lys Arg Asp
2770 2775 2780
Arg Lys Gly Pro His Pro Thr Ala Arg Ser Pro Gly Thr Val Met Glu
2785 2790 2795 2800
Ser Asn Pro Gln Gln Arg Glu Gly Ser Ser Pro Lys His Gln Asp His
2805 2810 2815
Thr Thr Asp Pro Lys Leu Leu Thr Cys Leu Gly Gln Asn Leu His Ser
2820 2825 2830
Pro Asp Leu Ala Arg Pro Arg Cys Pro Leu Pro Pro Glu Ala Ser Pro
2835 2840 2845
Ser Arg Glu Lys Pro Gly Leu Arg Glu Ser Ser Glu Arg Gly Pro Pro
2850 2855 2860
Thr Ala Arg Ser Glu Arg Ser Ala Ala Arg Ala Asp Thr Cys Arg Glu
2865 2870 2875 2880
Pro Ser Met Glu Leu Cys Phe Pro Glu Thr Ala Lys Thr Ser Asp Asn
2885 2890 2895
Ser Lys Asn Leu Leu Ser Val Gly Arg Thr His Pro Asp Phe Tyr Thr
2900 2905 2910
Gln Thr Gln Ala Met Glu Lys Ala Trp Ala Pro Gly Gly Lys Thr Asn
2915 2920 2925
His Lys Asp Gly Pro Gly Glu Ala Arg Pro Pro Pro Arg Asp Asn Ser
2930 2935 2940
Ser Leu His Ser Ala Gly Ile Pro Cys Glu Lys Glu Leu Gly Lys Val
2945 2950 2955 2960
Arg Arg Gly Val Glu Pro Lys Pro Glu Ala Leu Leu Ala Arg Arg Ser
2965 2970 2975
Leu Gln Pro Pro Gly Ile Glu Ser Glu Lys Ser Glu Lys Leu Ser Ser
2980 2985 2990
Phe Pro Ser Leu Gln Lys Asp Gly Ala Lys Glu Pro Glu Arg Lys Glu
2995 3000 3005
Gln Pro Leu Gln Arg His Pro Ser Ser Ile Pro Pro Pro Pro Leu Thr
3010 3015 3020
Ala Lys Asp Leu Ser Ser Pro Ala Ala Arg Gln His Cys Ser Ser Pro
3025 3030 3035 3040
Ser His Ala Ser Gly Arg Glu Pro Gly Ala Lys Pro Ser Thr Ala Glu
3045 3050 3055
Pro Ser Ser Ser Pro Gln Asp Pro Pro Lys Pro Val Ala Ala His Ser
3060 3065 3070
Glu Ser Ser Ser His Lys Pro Arg Pro Gly Pro Asp Pro Gly Pro Pro
3075 3080 3085
Lys Thr Lys His Pro Asp Arg Ser Leu Ser Ser Gln Lys Pro Ser Val
3090 3095 3100
Gly Ala Thr Lys Gly Lys Glu Pro Ala Thr Gln Ser Leu Gly Gly Ser
3105 3110 3115 3120
Ser Arg Glu Gly Lys Gly His Ser Lys Ser Gly Pro Asp Val Phe Pro
3125 3130 3135
Ala Thr Pro Gly Ser Gln Asn Lys Ala Ser Asp Gly Ile Gly Gln Gly
3140 3145 3150
Glu Gly Gly Pro Ser Val Pro Leu His Thr Asp Arg Ala Pro Leu Asp
3155 3160 3165
Ala Lys Pro Gln Pro Thr Ser Gly Gly Arg Pro Leu Glu Val Leu Glu
3170 3175 3180
Lys Pro Val His Leu Pro Arg Pro Gly His Pro Gly Pro Ser Glu Pro
3185 3190 3195 3200
Ala Asp Gln Lys Leu Ser Ala Val Gly Glu Lys Gln Thr Leu Ser Pro
3205 3210 3215
Lys His Pro Lys Pro Ser Thr Val Lys Asp Cys Pro Thr Leu Cys Lys
3220 3225 3230
Gln Thr Asp Asn Arg Gln Thr Asp Lys Ser Pro Ser Gln Pro Ala Ala
3235 3240 3245
Asn Thr Asp Arg Arg Ala Glu Gly Lys Lys Cys Thr Glu Ala Leu Tyr
3250 3255 3260
Ala Pro Ala Glu Gly Asp Lys Leu Glu Ala Gly Leu Ser Phe Val His
3265 3270 3275 3280
Ser Glu Asn Arg Leu Lys Gly Ala Glu Arg Pro Ala Ala Gly Val Gly
3285 3290 3295
Lys Gly Phe Pro Glu Ala Arg Gly Lys Gly Pro Gly Pro Gln Lys Pro
3300 3305 3310
Pro Thr Glu Ala Asp Lys Pro Asn Gly Met Lys Arg Ser Pro Ser Ala
3315 3320 3325
Thr Gly Gln Ser Ser Phe Arg Ser Thr Ala Leu Pro Glu Lys Ser Leu
3330 3335 3340
Ser Cys Ser Ser Ser Phe Pro Glu Thr Arg Ala Gly Val Arg Glu Ala
3345 3350 3355 3360
Ser Ala Ala Ser Ser Asp Thr Ser Ser Ala Lys Ala Ala Gly Gly Met
3365 3370 3375
Leu Glu Leu Pro Ala Pro Ser Asn Arg Asp His Arg Lys Ala Gln Pro
3380 3385 3390
Ala Gly Glu Gly Arg Thr His Met Thr Lys Ser Asp Ser Leu Pro Ser
3395 3400 3405
Phe Arg Val Ser Thr Leu Pro Leu Glu Ser His His Pro Asp Pro Asn
3410 3415 3420
Thr Met Gly Gly Ala Ser His Arg Asp Arg Ala Leu Ser Val Thr Ala
3425 3430 3435 3440
Thr Val Gly Glu Thr Lys Gly Lys Asp Pro Ala Pro Ala Gln Pro Pro
3445 3450 3455
Pro Ala Arg Lys Gln Asn Val Gly Arg Asp Val Thr Lys Pro Ser Pro
3460 3465 3470
Ala Pro Asn Thr Asp Arg Pro Ile Ser Leu Ser Asn Glu Lys Asp Phe
3475 3480 3485
Val Val Arg Gln Arg Arg Gly Lys Glu Ser Leu Arg Ser Ser Pro His
3490 3495 3500
Lys Lys Ala Leu
3505
<210> 143
<211> 16
<212> PRT
<213> Artificial Sequence
<220>
<223> Fused region of ERBB2IP-MAST4 fusion protein
<400> 143
Gln Pro Gly Asp Lys Ile Ile Gln Ala Thr Ala Gln Met Glu Glu Arg
1 5 10 15
<210> 144
<211> 530
<212> DNA
<213> Artificial Sequence
<220>
<223> CDS of TPD52L1 gene (NM_001003395)
<400> 144
atgctctctg aggaggaaaa ggaagagtta aaagcagagt tagttcagct agaagacgaa 60
attacaacac tacgacaagt tttgtcagcg aaagaaaggc atctagttga gataaaacaa 120
aaactcggca tgaacctgat gaatgaatta aaacagaact tcagcaaaag ctggcatgac 180
atgcagacta ccactgccta caagaaaaca catgaaaccc tgagtcacgc agggcaaaag 240
gcaactgcag ctttcagcaa cgttggaacg gccatcagca agaagttcgg agacatgaga 300
gttactccat tcgccattcc ataagtatgc ctgctatgag gaattctcct actttcaaat 360
catttgagga gagggttgag acaactgtca caagcctcaa gacgaaagta ggcggtacga 420
accctaatgg aggcagtttt gaggaggtcc tcagctccac ggcccatgcc agtgcccaga 480
gcttggcagg aggctcccgg cggaccaagg aggaggagct gcagtgctaa 530
<210> 145
<211> 299
<212> DNA
<213> Artificial Sequence
<220>
<223> TPD52L1 gene fragment
<400> 145
atgctctctg aggaggaaaa ggaagagtta aaagcagagt tagttcagct agaagacgaa 60
attacaacac tacgacaagt tttgtcagcg aaagaaaggc atctagttga gataaaacaa 120
aaactcggca tgaacctgat gaatgaatta aaacagaact tcagcaaaag ctggcatgac 180
atgcagacta ccactgccta caagaaaaca catgaaaccc tgagtcacgc agggcaaaag 240
gcaactgcag ctttcagcaa cgttggaacg gccatcagca agaagttcgg agacatgag 299
<210> 146
<211> 25
<212> DNA
<213> Artificial Sequence
<220>
<223> Break-point of TPD52L1 gene fragment
<400> 146
tcagcaagaa gttcggagac atgag 25
<210> 147
<211> 175
<212> PRT
<213> Artificial Sequence
<220>
<223> TPD52L1 protein
<400> 147
Met Leu Ser Glu Glu Glu Lys Glu Glu Leu Lys Ala Glu Leu Val Gln
1 5 10 15
Leu Glu Asp Glu Ile Thr Thr Leu Arg Gln Val Leu Ser Ala Lys Glu
20 25 30
Arg His Leu Val Glu Ile Lys Gln Lys Leu Gly Met Asn Leu Met Asn
35 40 45
Glu Leu Lys Gln Asn Phe Ser Lys Ser Trp His Asp Met Gln Thr Thr
50 55 60
Thr Ala Tyr Lys Lys Thr His Glu Thr Leu Ser His Ala Gly Gln Lys
65 70 75 80
Ala Thr Ala Ala Phe Ser Asn Val Gly Thr Ala Ile Ser Lys Lys Phe
85 90 95
Gly Asp Met Ser Tyr Ser Ile Arg His Ser Ile Ser Met Pro Ala Met
100 105 110
Arg Asn Ser Pro Thr Phe Lys Ser Phe Glu Glu Arg Val Glu Thr Thr
115 120 125
Val Thr Ser Leu Lys Thr Lys Val Gly Gly Thr Asn Pro Asn Gly Gly
130 135 140
Ser Phe Glu Glu Val Leu Ser Ser Thr Ala His Ala Ser Ala Gln Ser
145 150 155 160
Leu Ala Gly Gly Ser Arg Arg Thr Lys Glu Glu Glu Leu Gln Cys
165 170 175
<210> 148
<211> 99
<212> PRT
<213> Artificial Sequence
<220>
<223> TPD52L1 protein fragment
<400> 148
Met Leu Ser Glu Glu Glu Lys Glu Glu Leu Lys Ala Glu Leu Val Gln
1 5 10 15
Leu Glu Asp Glu Ile Thr Thr Leu Arg Gln Val Leu Ser Ala Lys Glu
20 25 30
Arg His Leu Val Glu Ile Lys Gln Lys Leu Gly Met Asn Leu Met Asn
35 40 45
Glu Leu Lys Gln Asn Phe Ser Lys Ser Trp His Asp Met Gln Thr Thr
50 55 60
Thr Ala Tyr Lys Lys Thr His Glu Thr Leu Ser His Ala Gly Gln Lys
65 70 75 80
Ala Thr Ala Ala Phe Ser Asn Val Gly Thr Ala Ile Ser Lys Lys Phe
85 90 95
Gly Asp Met
<210> 149
<211> 7
<212> PRT
<213> Artificial Sequence
<220>
<223> Break-point of TPD52L1 protein fragment
<400> 149
Ser Lys Lys Phe Gly Asp Met
1 5
<210> 150
<211> 1393
<212> DNA
<213> Artificial Sequence
<220>
<223> CDS of TRMT11 gene
<400> 150
atggcgctgt cgtgtaccct taacaggtat ctgctcctca tggcgcagga gcatctggag 60
ttccgcctgc cggaaataaa gtctttgctt ttgctttttg gaggtcagtt tgccagcagt 120
caagaaactt atggaaagtc accattttgg attcttagca ttccctctga agatattgca 180
agaaatttga tgaaacggac agtgtgtgcc aagtctatat ttgaactatg gggtcatgga 240
caatctcctg aggagctgta cagttctctt aaaaactacc ctgtggagaa gatggttcca 300
tttctacatt cggactctac atataaaata aagattcaca cttttaataa gacattgaca 360
caagaagaga aaatcaagcg aatagatgca cttgaatttc tgccatttga aggaaaagtg 420
aatttaaaga aaccgcaaca tgtattttct gttttggagg attatggttt agacccaaac 480
tgcatccctg agaatccaca taatatttat tttggtagat ggattgcaga tggacagaga 540
gagcttattg agtcatacag tgtcaaaaag agacacttta ttggaaatac aagtatggat 600
gctggtttgt cattcattat ggctaaccat ggaaaagtga aagaaaatga tattgtcttt 660
gatccatttg ttggaacagg tggcctgctg atagcatgtg ctcattttgg tgcatatgtg 720
tatgggacag acatagacta caacacagtt catggcttgg gaaaggctac taggaaaaac 780
cagaagtgga gaggaccaga tgaaaacatt agggccaatc ttcgtcaata tggtttagag 840
aagtattacc ttgatgtcct ggtttcagat gcatctaaac cttcctggag gaagggcaca 900
tattttgatg caatcattac tgatcctcca tatggtatca gagaatctac aagaagaaca 960
ggttcacaga aggagatacc aaaggggata gaaaaatggg aaaaatgtcc agaaagccat 1020
gttcctgttt ccttgagtta tcatctgagt gatatgtttc ttgacctgtt aaacttcgca 1080
gctgagaccc tcgttttagg tggaagacta gtctattggt taccggtgta tacgccagaa 1140
atacactgaa gagatggtgc cttggcaccc ttgcctggaa ctcgttagca actgcgagca 1200
gaagctttcc agtcacacat caaggcgctt gatcacaatg gaaaaggtga agaaatttga 1260
gaatcgggac cagtattcac atctgctaag tgatcatttt ctgccatacc aaggtcataa 1320
ttccttccgt gagaaatatt ttagtggggt aacaaaaaga attgccaagg aagaaaaatc 1380
cacccaggaa tga 1393
<210> 151
<211> 253
<212> DNA
<213> Artificial Sequence
<220>
<223> TRMT11 gene fragment
<400> 151
atacactgaa gagatggtgc cttggcaccc ttgcctggaa ctcgttagca actgcgagca 60
gaagctttcc agtcacacat caaggcgctt gatcacaatg gaaaaggtga agaaatttga 120
gaatcgggac cagtattcac atctgctaag tgatcatttt ctgccatacc aaggtcataa 180
ttccttccgt gagaaatatt ttagtggggt aacaaaaaga attgccaagg aagaaaaatc 240
cacccaggaa tga 253
<210> 152
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Break-point of TRMT11 gene fragment
<400> 152
atacactgaa gagatggtgc ct 22
<210> 153
<211> 463
<212> PRT
<213> Artificial Sequence
<220>
<223> TRMT11 protein
<400> 153
Met Ala Leu Ser Cys Thr Leu Asn Arg Tyr Leu Leu Leu Met Ala Gln
1 5 10 15
Glu His Leu Glu Phe Arg Leu Pro Glu Ile Lys Ser Leu Leu Leu Leu
20 25 30
Phe Gly Gly Gln Phe Ala Ser Ser Gln Glu Thr Tyr Gly Lys Ser Pro
35 40 45
Phe Trp Ile Leu Ser Ile Pro Ser Glu Asp Ile Ala Arg Asn Leu Met
50 55 60
Lys Arg Thr Val Cys Ala Lys Ser Ile Phe Glu Leu Trp Gly His Gly
65 70 75 80
Gln Ser Pro Glu Glu Leu Tyr Ser Ser Leu Lys Asn Tyr Pro Val Glu
85 90 95
Lys Met Val Pro Phe Leu His Ser Asp Ser Thr Tyr Lys Ile Lys Ile
100 105 110
His Thr Phe Asn Lys Thr Leu Thr Gln Glu Glu Lys Ile Lys Arg Ile
115 120 125
Asp Ala Leu Glu Phe Leu Pro Phe Glu Gly Lys Val Asn Leu Lys Lys
130 135 140
Pro Gln His Val Phe Ser Val Leu Glu Asp Tyr Gly Leu Asp Pro Asn
145 150 155 160
Cys Ile Pro Glu Asn Pro His Asn Ile Tyr Phe Gly Arg Trp Ile Ala
165 170 175
Asp Gly Gln Arg Glu Leu Ile Glu Ser Tyr Ser Val Lys Lys Arg His
180 185 190
Phe Ile Gly Asn Thr Ser Met Asp Ala Gly Leu Ser Phe Ile Met Ala
195 200 205
Asn His Gly Lys Val Lys Glu Asn Asp Ile Val Phe Asp Pro Phe Val
210 215 220
Gly Thr Gly Gly Leu Leu Ile Ala Cys Ala His Phe Gly Ala Tyr Val
225 230 235 240
Tyr Gly Thr Asp Ile Asp Tyr Asn Thr Val His Gly Leu Gly Lys Ala
245 250 255
Thr Arg Lys Asn Gln Lys Trp Arg Gly Pro Asp Glu Asn Ile Arg Ala
260 265 270
Asn Leu Arg Gln Tyr Gly Leu Glu Lys Tyr Tyr Leu Asp Val Leu Val
275 280 285
Ser Asp Ala Ser Lys Pro Ser Trp Arg Lys Gly Thr Tyr Phe Asp Ala
290 295 300
Ile Ile Thr Asp Pro Pro Tyr Gly Ile Arg Glu Ser Thr Arg Arg Thr
305 310 315 320
Gly Ser Gln Lys Glu Ile Pro Lys Gly Ile Glu Lys Trp Glu Lys Cys
325 330 335
Pro Glu Ser His Val Pro Val Ser Leu Ser Tyr His Leu Ser Asp Met
340 345 350
Phe Leu Asp Leu Leu Asn Phe Ala Ala Glu Thr Leu Val Leu Gly Gly
355 360 365
Arg Leu Val Tyr Trp Leu Pro Val Tyr Thr Pro Glu Tyr Thr Glu Glu
370 375 380
Met Val Pro Trp His Pro Cys Leu Glu Leu Val Ser Asn Cys Glu Gln
385 390 395 400
Lys Leu Ser Ser His Thr Ser Arg Arg Leu Ile Thr Met Glu Lys Val
405 410 415
Lys Lys Phe Glu Asn Arg Asp Gln Tyr Ser His Leu Leu Ser Asp His
420 425 430
Phe Leu Pro Tyr Gln Gly His Asn Ser Phe Arg Glu Lys Tyr Phe Ser
435 440 445
Gly Val Thr Lys Arg Ile Ala Lys Glu Glu Lys Ser Thr Gln Glu
450 455 460
<210> 154
<211> 83
<212> PRT
<213> Artificial Sequence
<220>
<223> TRMT11 protein fragment
<400> 154
Tyr Thr Glu Glu Met Val Pro Trp His Pro Cys Leu Glu Leu Val Ser
1 5 10 15
Asn Cys Glu Gln Lys Leu Ser Ser His Thr Ser Arg Arg Leu Ile Thr
20 25 30
Met Glu Lys Val Lys Lys Phe Glu Asn Arg Asp Gln Tyr Ser His Leu
35 40 45
Leu Ser Asp His Phe Leu Pro Tyr Gln Gly His Asn Ser Phe Arg Glu
50 55 60
Lys Tyr Phe Ser Gly Val Thr Lys Arg Ile Ala Lys Glu Glu Lys Ser
65 70 75 80
Thr Gln Glu
<210> 155
<211> 7
<212> PRT
<213> Artificial Sequence
<220>
<223> Break-point of TRMT11 protein fragment
<400> 155
Tyr Thr Glu Glu Met Val Pro
1 5
<210> 156
<211> 552
<212> DNA
<213> Artificial Sequence
<220>
<223> TPD52L1-TRMT11 fusion gene
<400> 156
atgctctctg aggaggaaaa ggaagagtta aaagcagagt tagttcagct agaagacgaa 60
attacaacac tacgacaagt tttgtcagcg aaagaaaggc atctagttga gataaaacaa 120
aaactcggca tgaacctgat gaatgaatta aaacagaact tcagcaaaag ctggcatgac 180
atgcagacta ccactgccta caagaaaaca catgaaaccc tgagtcacgc agggcaaaag 240
gcaactgcag ctttcagcaa cgttggaacg gccatcagca agaagttcgg agacatgaga 300
tacactgaag agatggtgcc ttggcaccct tgcctggaac tcgttagcaa ctgcgagcag 360
aagctttcca gtcacacatc aaggcgcttg atcacaatgg aaaaggtgaa gaaatttgag 420
aatcgggacc agtattcaca tctgctaagt gatcattttc tgccatacca aggtcataat 480
tccttccgtg agaaatattt tagtggggta acaaaaagaa ttgccaagga agaaaaatcc 540
acccaggaat ga 552
<210> 157
<211> 45
<212> DNA
<213> Artificial Sequence
<220>
<223> Fused region of TPD52L1-TRMT11 fusion gene
<400> 157
agcaagaagt tcggagacat gagatacact gaagagatgg tgcct 45
<210> 158
<211> 183
<212> PRT
<213> Artificial Sequence
<220>
<223> TPD52L1-TRMT11 fusion protein
<400> 158
Met Leu Ser Glu Glu Glu Lys Glu Glu Leu Lys Ala Glu Leu Val Gln
1 5 10 15
Leu Glu Asp Glu Ile Thr Thr Leu Arg Gln Val Leu Ser Ala Lys Glu
20 25 30
Arg His Leu Val Glu Ile Lys Gln Lys Leu Gly Met Asn Leu Met Asn
35 40 45
Glu Leu Lys Gln Asn Phe Ser Lys Ser Trp His Asp Met Gln Thr Thr
50 55 60
Thr Ala Tyr Lys Lys Thr His Glu Thr Leu Ser His Ala Gly Gln Lys
65 70 75 80
Ala Thr Ala Ala Phe Ser Asn Val Gly Thr Ala Ile Ser Lys Lys Phe
85 90 95
Gly Asp Met Arg Tyr Thr Glu Glu Met Val Pro Trp His Pro Cys Leu
100 105 110
Glu Leu Val Ser Asn Cys Glu Gln Lys Leu Ser Ser His Thr Ser Arg
115 120 125
Arg Leu Ile Thr Met Glu Lys Val Lys Lys Phe Glu Asn Arg Asp Gln
130 135 140
Tyr Ser His Leu Leu Ser Asp His Phe Leu Pro Tyr Gln Gly His Asn
145 150 155 160
Ser Phe Arg Glu Lys Tyr Phe Ser Gly Val Thr Lys Arg Ile Ala Lys
165 170 175
Glu Glu Lys Ser Thr Gln Glu
180
<210> 159
<211> 15
<212> PRT
<213> Artificial Sequence
<220>
<223> Fused region of TPD52L1-TRMT11 fusion protein
<400> 159
Ser Lys Lys Phe Gly Asp Met Arg Tyr Thr Glu Glu Met Val Pro
1 5 10 15
<210> 160
<211> 1656
<212> DNA
<213> Artificial Sequence
<220>
<223> CDS of TXNRD1 gene (NM_003330)
<400> 160
atgtcatgtg aggacggtcg ggccctggaa ggaacgctct cggaattggc cgcggaaacc 60
gatctgcccg ttgtgtttgt gaaacagaga aagataggcg gccatggtcc aaccttgaag 120
gcttatcagg agggcagact tcaaaagcta ctaaaaatga acggccctga agatcttccc 180
aagtcctatg actatgacct tatcatcatt ggaggtggct caggaggtct ggcagctgct 240
aaggaggcag cccaatatgg caagaaggtg atggtcctgg actttgtcac tcccacccct 300
cttggaacta gatggggtct cggaggaaca tgtgtgaatg tgggttgcat acctaaaaaa 360
ctgatgcatc aagcagcttt gttaggacaa gccctgcaag actctcgaaa ttatggatgg 420
aaagtcgagg agacagttaa gcatgattgg gacagaatga tagaagctgt acagaatcac 480
attggctctt tgaattgggg ctaccgagta gctctgcggg agaaaaaagt cgtctatgag 540
aatgcttatg ggcaatttat tggtcctcac aggattaagg caacaaataa taaaggcaaa 600
gaaaaaattt attcagcaga gagatttctc attgccactg gtgaaagacc acgttacttg 660
ggcatccctg gtgacaaaga atactgcatc agcagtgatg atcttttctc cttgccttac 720
tgcccgggta agaccctggt tgttggagca tcctatgtcg ctttggagtg cgctggattt 780
cttgctggta ttggtttaga cgtcactgtt atggttaggt ccattcttct tagaggattt 840
gaccaggaca tggccaacaa aattggtgaa cacatggaag aacatggcat caagtttata 900
agacagttcg taccaattaa agttgaacaa attgaagcag ggacaccagg ccgactcaga 960
gtagtagctc agtccaccaa tagtgaggaa atcattgaag gagaatataa tacggtgatg 1020
ctggcaatag gaagagatgc ttgcacaaga aaaattggct tagaaaccgt aggggtgaag 1080
ataaatgaaa agactggaaa aatacctgtc acagatgaag aacagaccaa tgtgccttac 1140
atctatgcca ttggcgatat attggaggat aaggtggagc tcaccccagt tgcaatccag 1200
gcaggaagat tgctggctca gaggctctat gcaggttcca ctgtcaagtg tgactatgaa 1260
aatgttccaa ccactgtatt tactcctttg gaatatggtg cttgtggcct ttctgaggag 1320
aaagctgtgg agaagtttgg ggaagaaaat attgaggttt accatagtta cttttggcca 1380
ttggaatgga cgattccgtc aagagataac aacaaatgtt atgcaaaaat aatctgtaat 1440
actaaagaca atgaacgtgt tgtgggcttt cacgtactgg gtccaaatgc tggagaagtt 1500
acacaaggct ttgcagctgc gctcaaatgt ggactgacca aaaagcagct ggacagcaca 1560
attggaatcc accctgtctg tgcagaggta ttcacaacat tgtctgtgac caagcgctct 1620
ggggcaagca tcctccaggc tggctgctga ggttaa 1656
<210> 161
<211> 1587
<212> DNA
<213> Artificial Sequence
<220>
<223> TXNRD1 gene fragment
<400> 161
atgtcatgtg aggacggtcg ggccctggaa ggaacgctct cggaattggc cgcggaaacc 60
gatctgcccg ttgtgtttgt gaaacagaga aagataggcg gccatggtcc aaccttgaag 120
gcttatcagg agggcagact tcaaaagcta ctaaaaatga acggccctga agatcttccc 180
aagtcctatg actatgacct tatcatcatt ggaggtggct caggaggtct ggcagctgct 240
aaggaggcag cccaatatgg caagaaggtg atggtcctgg actttgtcac tcccacccct 300
cttggaacta gatggggtct cggaggaaca tgtgtgaatg tgggttgcat acctaaaaaa 360
ctgatgcatc aagcagcttt gttaggacaa gccctgcaag actctcgaaa ttatggatgg 420
aaagtcgagg agacagttaa gcatgattgg gacagaatga tagaagctgt acagaatcac 480
attggctctt tgaattgggg ctaccgagta gctctgcggg agaaaaaagt cgtctatgag 540
aatgcttatg ggcaatttat tggtcctcac aggattaagg caacaaataa taaaggcaaa 600
gaaaaaattt attcagcaga gagatttctc attgccactg gtgaaagacc acgttacttg 660
ggcatccctg gtgacaaaga atactgcatc agcagtgatg atcttttctc cttgccttac 720
tgcccgggta agaccctggt tgttggagca tcctatgtcg ctttggagtg cgctggattt 780
cttgctggta ttggtttaga cgtcactgtt atggttaggt ccattcttct tagaggattt 840
gaccaggaca tggccaacaa aattggtgaa cacatggaag aacatggcat caagtttata 900
agacagttcg taccaattaa agttgaacaa attgaagcag ggacaccagg ccgactcaga 960
gtagtagctc agtccaccaa tagtgaggaa atcattgaag gagaatataa tacggtgatg 1020
ctggcaatag gaagagatgc ttgcacaaga aaaattggct tagaaaccgt aggggtgaag 1080
ataaatgaaa agactggaaa aatacctgtc acagatgaag aacagaccaa tgtgccttac 1140
atctatgcca ttggcgatat attggaggat aaggtggagc tcaccccagt tgcaatccag 1200
gcaggaagat tgctggctca gaggctctat gcaggttcca ctgtcaagtg tgactatgaa 1260
aatgttccaa ccactgtatt tactcctttg gaatatggtg cttgtggcct ttctgaggag 1320
aaagctgtgg agaagtttgg ggaagaaaat attgaggttt accatagtta cttttggcca 1380
ttggaatgga cgattccgtc aagagataac aacaaatgtt atgcaaaaat aatctgtaat 1440
actaaagaca atgaacgtgt tgtgggcttt cacgtactgg gtccaaatgc tggagaagtt 1500
acacaaggct ttgcagctgc gctcaaatgt ggactgacca aaaagcagct ggacagcaca 1560
attggaatcc accctgtctg tgcagag 1587
<210> 162
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Break-point of TXNRD1 gene fragment
<400> 162
aatccaccct gtctgtgcag ag 22
<210> 163
<211> 549
<212> PRT
<213> Artificial Sequence
<220>
<223> TXNRD1 protein wherein "U" and "G" are additionally added at
C-terminus
<400> 163
Met Ser Cys Glu Asp Gly Arg Ala Leu Glu Gly Thr Leu Ser Glu Leu
1 5 10 15
Ala Ala Glu Thr Asp Leu Pro Val Val Phe Val Lys Gln Arg Lys Ile
20 25 30
Gly Gly His Gly Pro Thr Leu Lys Ala Tyr Gln Glu Gly Arg Leu Gln
35 40 45
Lys Leu Leu Lys Met Asn Gly Pro Glu Asp Leu Pro Lys Ser Tyr Asp
50 55 60
Tyr Asp Leu Ile Ile Ile Gly Gly Gly Ser Gly Gly Leu Ala Ala Ala
65 70 75 80
Lys Glu Ala Ala Gln Tyr Gly Lys Lys Val Met Val Leu Asp Phe Val
85 90 95
Thr Pro Thr Pro Leu Gly Thr Arg Trp Gly Leu Gly Gly Thr Cys Val
100 105 110
Asn Val Gly Cys Ile Pro Lys Lys Leu Met His Gln Ala Ala Leu Leu
115 120 125
Gly Gln Ala Leu Gln Asp Ser Arg Asn Tyr Gly Trp Lys Val Glu Glu
130 135 140
Thr Val Lys His Asp Trp Asp Arg Met Ile Glu Ala Val Gln Asn His
145 150 155 160
Ile Gly Ser Leu Asn Trp Gly Tyr Arg Val Ala Leu Arg Glu Lys Lys
165 170 175
Val Val Tyr Glu Asn Ala Tyr Gly Gln Phe Ile Gly Pro His Arg Ile
180 185 190
Lys Ala Thr Asn Asn Lys Gly Lys Glu Lys Ile Tyr Ser Ala Glu Arg
195 200 205
Phe Leu Ile Ala Thr Gly Glu Arg Pro Arg Tyr Leu Gly Ile Pro Gly
210 215 220
Asp Lys Glu Tyr Cys Ile Ser Ser Asp Asp Leu Phe Ser Leu Pro Tyr
225 230 235 240
Cys Pro Gly Lys Thr Leu Val Val Gly Ala Ser Tyr Val Ala Leu Glu
245 250 255
Cys Ala Gly Phe Leu Ala Gly Ile Gly Leu Asp Val Thr Val Met Val
260 265 270
Arg Ser Ile Leu Leu Arg Gly Phe Asp Gln Asp Met Ala Asn Lys Ile
275 280 285
Gly Glu His Met Glu Glu His Gly Ile Lys Phe Ile Arg Gln Phe Val
290 295 300
Pro Ile Lys Val Glu Gln Ile Glu Ala Gly Thr Pro Gly Arg Leu Arg
305 310 315 320
Val Val Ala Gln Ser Thr Asn Ser Glu Glu Ile Ile Glu Gly Glu Tyr
325 330 335
Asn Thr Val Met Leu Ala Ile Gly Arg Asp Ala Cys Thr Arg Lys Ile
340 345 350
Gly Leu Glu Thr Val Gly Val Lys Ile Asn Glu Lys Thr Gly Lys Ile
355 360 365
Pro Val Thr Asp Glu Glu Gln Thr Asn Val Pro Tyr Ile Tyr Ala Ile
370 375 380
Gly Asp Ile Leu Glu Asp Lys Val Glu Leu Thr Pro Val Ala Ile Gln
385 390 395 400
Ala Gly Arg Leu Leu Ala Gln Arg Leu Tyr Ala Gly Ser Thr Val Lys
405 410 415
Cys Asp Tyr Glu Asn Val Pro Thr Thr Val Phe Thr Pro Leu Glu Tyr
420 425 430
Gly Ala Cys Gly Leu Ser Glu Glu Lys Ala Val Glu Lys Phe Gly Glu
435 440 445
Glu Asn Ile Glu Val Tyr His Ser Tyr Phe Trp Pro Leu Glu Trp Thr
450 455 460
Ile Pro Ser Arg Asp Asn Asn Lys Cys Tyr Ala Lys Ile Ile Cys Asn
465 470 475 480
Thr Lys Asp Asn Glu Arg Val Val Gly Phe His Val Leu Gly Pro Asn
485 490 495
Ala Gly Glu Val Thr Gln Gly Phe Ala Ala Ala Leu Lys Cys Gly Leu
500 505 510
Thr Lys Lys Gln Leu Asp Ser Thr Ile Gly Ile His Pro Val Cys Ala
515 520 525
Glu Val Phe Thr Thr Leu Ser Val Thr Lys Arg Ser Gly Ala Ser Ile
530 535 540
Leu Gln Ala Gly Cys
545
<210> 164
<211> 529
<212> PRT
<213> Artificial Sequence
<220>
<223> TXNRD1 protein fragment
<400> 164
Met Ser Cys Glu Asp Gly Arg Ala Leu Glu Gly Thr Leu Ser Glu Leu
1 5 10 15
Ala Ala Glu Thr Asp Leu Pro Val Val Phe Val Lys Gln Arg Lys Ile
20 25 30
Gly Gly His Gly Pro Thr Leu Lys Ala Tyr Gln Glu Gly Arg Leu Gln
35 40 45
Lys Leu Leu Lys Met Asn Gly Pro Glu Asp Leu Pro Lys Ser Tyr Asp
50 55 60
Tyr Asp Leu Ile Ile Ile Gly Gly Gly Ser Gly Gly Leu Ala Ala Ala
65 70 75 80
Lys Glu Ala Ala Gln Tyr Gly Lys Lys Val Met Val Leu Asp Phe Val
85 90 95
Thr Pro Thr Pro Leu Gly Thr Arg Trp Gly Leu Gly Gly Thr Cys Val
100 105 110
Asn Val Gly Cys Ile Pro Lys Lys Leu Met His Gln Ala Ala Leu Leu
115 120 125
Gly Gln Ala Leu Gln Asp Ser Arg Asn Tyr Gly Trp Lys Val Glu Glu
130 135 140
Thr Val Lys His Asp Trp Asp Arg Met Ile Glu Ala Val Gln Asn His
145 150 155 160
Ile Gly Ser Leu Asn Trp Gly Tyr Arg Val Ala Leu Arg Glu Lys Lys
165 170 175
Val Val Tyr Glu Asn Ala Tyr Gly Gln Phe Ile Gly Pro His Arg Ile
180 185 190
Lys Ala Thr Asn Asn Lys Gly Lys Glu Lys Ile Tyr Ser Ala Glu Arg
195 200 205
Phe Leu Ile Ala Thr Gly Glu Arg Pro Arg Tyr Leu Gly Ile Pro Gly
210 215 220
Asp Lys Glu Tyr Cys Ile Ser Ser Asp Asp Leu Phe Ser Leu Pro Tyr
225 230 235 240
Cys Pro Gly Lys Thr Leu Val Val Gly Ala Ser Tyr Val Ala Leu Glu
245 250 255
Cys Ala Gly Phe Leu Ala Gly Ile Gly Leu Asp Val Thr Val Met Val
260 265 270
Arg Ser Ile Leu Leu Arg Gly Phe Asp Gln Asp Met Ala Asn Lys Ile
275 280 285
Gly Glu His Met Glu Glu His Gly Ile Lys Phe Ile Arg Gln Phe Val
290 295 300
Pro Ile Lys Val Glu Gln Ile Glu Ala Gly Thr Pro Gly Arg Leu Arg
305 310 315 320
Val Val Ala Gln Ser Thr Asn Ser Glu Glu Ile Ile Glu Gly Glu Tyr
325 330 335
Asn Thr Val Met Leu Ala Ile Gly Arg Asp Ala Cys Thr Arg Lys Ile
340 345 350
Gly Leu Glu Thr Val Gly Val Lys Ile Asn Glu Lys Thr Gly Lys Ile
355 360 365
Pro Val Thr Asp Glu Glu Gln Thr Asn Val Pro Tyr Ile Tyr Ala Ile
370 375 380
Gly Asp Ile Leu Glu Asp Lys Val Glu Leu Thr Pro Val Ala Ile Gln
385 390 395 400
Ala Gly Arg Leu Leu Ala Gln Arg Leu Tyr Ala Gly Ser Thr Val Lys
405 410 415
Cys Asp Tyr Glu Asn Val Pro Thr Thr Val Phe Thr Pro Leu Glu Tyr
420 425 430
Gly Ala Cys Gly Leu Ser Glu Glu Lys Ala Val Glu Lys Phe Gly Glu
435 440 445
Glu Asn Ile Glu Val Tyr His Ser Tyr Phe Trp Pro Leu Glu Trp Thr
450 455 460
Ile Pro Ser Arg Asp Asn Asn Lys Cys Tyr Ala Lys Ile Ile Cys Asn
465 470 475 480
Thr Lys Asp Asn Glu Arg Val Val Gly Phe His Val Leu Gly Pro Asn
485 490 495
Ala Gly Glu Val Thr Gln Gly Phe Ala Ala Ala Leu Lys Cys Gly Leu
500 505 510
Thr Lys Lys Gln Leu Asp Ser Thr Ile Gly Ile His Pro Val Cys Ala
515 520 525
Glu
<210> 165
<211> 7
<212> PRT
<213> Artificial Sequence
<220>
<223> Break-point of TXNRD1 protein fragment
<400> 165
Ile His Pro Val Cys Ala Glu
1 5
<210> 166
<211> 2625
<212> DNA
<213> Artificial Sequence
<220>
<223> CDS of GPR133 gene (NM_198827)
<400> 166
atggaaaagc tgctgcggct gtgctgctgg tactcctggc tgctgctatt ttattacaac 60
tttcaggtgc gtggcgtcta ctccagatcg caggaccatc caggatttca ggtgttggcg 120
tctgcttccc attactggcc actggagaat gtggatggga tccatgaact tcaggataca 180
actggagata ttgtggaagg gaaggtcaac aaaggcattt acctgaaaga ggaaaaggga 240
gtcacgcttc tctattacgg caggtacaac agctcctgca tcagcaagcc agagcagtgt 300
ggccctgaag gggtcacgtt ttcttttttc tggaagacac aaggagaaca gtctagacca 360
atcccttctg cgtatggggg acaggtcatc tccaatgggt tcaaagtctg ctccagcggt 420
ggcagaggct ctgtggagct gtatacgcgg gacaattcca tgacatggga ggcctccttc 480
agccccccag gcccctattg gactcatgtc ctatttacat ggaaatccaa ggagggcctg 540
aaagtctacg tcaacgggac cctgagcacc tctgatccga gtggaaaagt gtctcgtgac 600
tatggagagt ccaacgtcaa cctcgtgata gggtctgagc aggaccaggc caagtgttat 660
gagaacggtg ctttcgatga gttcatcatc tgggagcggg ctctgactcc ggatgagatc 720
gccatgtact tcactgctgc cattggaaag catgctttat tgtcttcaac gctgccaagc 780
ctcttcatga catccacagc aagccccgtg atgcccacag atgcctacca tcccatcata 840
accaacctga cagaagagag aaaaaccttc caaagtcccg gagtgatact gagttacctc 900
caaaatgtat ccctcagctt acccagtaag tccctctcgg agcagacagc cttgaatctc 960
accaagacct tcttaaaagc cgtgggagag atccttctac tgcctggttg gattgctctg 1020
tcagaggaca gcgccgtggt actgagtctc atcgacacta ttgacaccgt catgggccat 1080
gtatcctcca acctgcacgg cagcacgccc caggtcaccg tggagggctc ctctgccatg 1140
gcagagtttt ccgtggccaa aatcctgccc aagaccgtga attcctccca ttaccgcttc 1200
ccggcccacg ggcagagctt catccagatc ccccacgagg ccttccacag gcacgcctgg 1260
agcaccgtcg tgggtctgct gtaccacagc atgcactact acctgaacaa catctggccc 1320
gcccacacca agatcgcgga ggccatgcat caccaggact gcctgctgtt cgccaccagc 1380
cacctgattt ccctggaggt gtccccacca cccaccctgt ctcagaacct gtcgggctct 1440
ccactcatta cggtccacct caagcacaga ttgacacgta agcagcacag tgaggccacc 1500
aacagcagca accgagtctt cgtgtactgc gccttcctgg acttcagctc cggagaaggg 1560
gtctggtcga accacggctg tgcgctcacg agaggaaacc tcacctactc cgtctgccgc 1620
tgcactcacc tcaccaactt tgccatcctc atgcaggtgg tcccgctgga gcttgcacgc 1680
ggacaccagg tggcgctgtc gtctatcagc tatgtgggct gctccctctc cgtgctctgc 1740
ctggtggcca cgctggtcac cttcgccgtg ctgtcctccg tgagcaccat ccggaaccag 1800
cgctaccaca tccacgccaa cctgtccttc gccgtgctgg tggcccaggt cctgctgctc 1860
attagtttcc gcctcgagcc gggcacgacc ccctgccaag tgatggccgt gctcctacac 1920
tacttcttcc tgagtgcctt cgcatggatg ctggtggagg ggctgcacct ctacagcatg 1980
gtgatcaagg tctttgggtc ggaggacagc aagcaccgtt actactatgg gatgggatgg 2040
ggttttcctc ttctgatctg catcatttca ctgtcatttg ccatggacag ttacggaaca 2100
agcaacaatt gctggctgtc gttggcgagt ggcgccatct gggcctttgt agcccctgcc 2160
ctgtttgtca tcgtggtcaa cattggcatc ctcatcgctg tgaccagagt catctcacag 2220
atcagcgccg acaactacaa gatccatgga gaccccagtg ccttcaagtt gacagccaag 2280
gcagtggccg tgctgctgcc catcctgggt acctcgtggg tctttggcgt gcttgctgtc 2340
aacggttgtg ctgtggtttt ccagtacatg tttgccacgc tcaactccct gcagggactg 2400
ttcatattcc tctttcattg tctcctgaat tcagaggtga gagccgcctt caagcacaaa 2460
accaaggtct ggtcgctcac gagcagctct gcccgcacct ccaacgcgaa gcccttccac 2520
tcggacctca tgaatgggac ccggccaggc atggcctcca ccaagctcag cccttgggac 2580
aagagcagcc actctgccca ccgcgtcgac ctgtcagccg tgtga 2625
<210> 167
<211> 1152
<212> DNA
<213> Artificial Sequence
<220>
<223> GPR133 gene fragment
<400> 167
acacgtaagc agcacagtga ggccaccaac agcagcaacc gagtcttcgt gtactgcgcc 60
ttcctggact tcagctccgg agaaggggtc tggtcgaacc acggctgtgc gctcacgaga 120
ggaaacctca cctactccgt ctgccgctgc actcacctca ccaactttgc catcctcatg 180
caggtggtcc cgctggagct tgcacgcgga caccaggtgg cgctgtcgtc tatcagctat 240
gtgggctgct ccctctccgt gctctgcctg gtggccacgc tggtcacctt cgccgtgctg 300
tcctccgtga gcaccatccg gaaccagcgc taccacatcc acgccaacct gtccttcgcc 360
gtgctggtgg cccaggtcct gctgctcatt agtttccgcc tcgagccggg cacgaccccc 420
tgccaagtga tggccgtgct cctacactac ttcttcctga gtgccttcgc atggatgctg 480
gtggaggggc tgcacctcta cagcatggtg atcaaggtct ttgggtcgga ggacagcaag 540
caccgttact actatgggat gggatggggt tttcctcttc tgatctgcat catttcactg 600
tcatttgcca tggacagtta cggaacaagc aacaattgct ggctgtcgtt ggcgagtggc 660
gccatctggg cctttgtagc ccctgccctg tttgtcatcg tggtcaacat tggcatcctc 720
atcgctgtga ccagagtcat ctcacagatc agcgccgaca actacaagat ccatggagac 780
cccagtgcct tcaagttgac agccaaggca gtggccgtgc tgctgcccat cctgggtacc 840
tcgtgggtct ttggcgtgct tgctgtcaac ggttgtgctg tggttttcca gtacatgttt 900
gccacgctca actccctgca gggactgttc atattcctct ttcattgtct cctgaattca 960
gaggtgagag ccgccttcaa gcacaaaacc aaggtctggt cgctcacgag cagctctgcc 1020
cgcacctcca acgcgaagcc cttccactcg gacctcatga atgggacccg gccaggcatg 1080
gcctccacca agctcagccc ttgggacaag agcagccact ctgcccaccg cgtcgacctg 1140
tcagccgtgt ga 1152
<210> 168
<211> 15
<212> DNA
<213> Artificial Sequence
<220>
<223> Break-point of GPR133 gene fragment
<400> 168
acacgtaagc agcac 15
<210> 169
<211> 874
<212> PRT
<213> Artificial Sequence
<220>
<223> GPR133 protein
<400> 169
Met Glu Lys Leu Leu Arg Leu Cys Cys Trp Tyr Ser Trp Leu Leu Leu
1 5 10 15
Phe Tyr Tyr Asn Phe Gln Val Arg Gly Val Tyr Ser Arg Ser Gln Asp
20 25 30
His Pro Gly Phe Gln Val Leu Ala Ser Ala Ser His Tyr Trp Pro Leu
35 40 45
Glu Asn Val Asp Gly Ile His Glu Leu Gln Asp Thr Thr Gly Asp Ile
50 55 60
Val Glu Gly Lys Val Asn Lys Gly Ile Tyr Leu Lys Glu Glu Lys Gly
65 70 75 80
Val Thr Leu Leu Tyr Tyr Gly Arg Tyr Asn Ser Ser Cys Ile Ser Lys
85 90 95
Pro Glu Gln Cys Gly Pro Glu Gly Val Thr Phe Ser Phe Phe Trp Lys
100 105 110
Thr Gln Gly Glu Gln Ser Arg Pro Ile Pro Ser Ala Tyr Gly Gly Gln
115 120 125
Val Ile Ser Asn Gly Phe Lys Val Cys Ser Ser Gly Gly Arg Gly Ser
130 135 140
Val Glu Leu Tyr Thr Arg Asp Asn Ser Met Thr Trp Glu Ala Ser Phe
145 150 155 160
Ser Pro Pro Gly Pro Tyr Trp Thr His Val Leu Phe Thr Trp Lys Ser
165 170 175
Lys Glu Gly Leu Lys Val Tyr Val Asn Gly Thr Leu Ser Thr Ser Asp
180 185 190
Pro Ser Gly Lys Val Ser Arg Asp Tyr Gly Glu Ser Asn Val Asn Leu
195 200 205
Val Ile Gly Ser Glu Gln Asp Gln Ala Lys Cys Tyr Glu Asn Gly Ala
210 215 220
Phe Asp Glu Phe Ile Ile Trp Glu Arg Ala Leu Thr Pro Asp Glu Ile
225 230 235 240
Ala Met Tyr Phe Thr Ala Ala Ile Gly Lys His Ala Leu Leu Ser Ser
245 250 255
Thr Leu Pro Ser Leu Phe Met Thr Ser Thr Ala Ser Pro Val Met Pro
260 265 270
Thr Asp Ala Tyr His Pro Ile Ile Thr Asn Leu Thr Glu Glu Arg Lys
275 280 285
Thr Phe Gln Ser Pro Gly Val Ile Leu Ser Tyr Leu Gln Asn Val Ser
290 295 300
Leu Ser Leu Pro Ser Lys Ser Leu Ser Glu Gln Thr Ala Leu Asn Leu
305 310 315 320
Thr Lys Thr Phe Leu Lys Ala Val Gly Glu Ile Leu Leu Leu Pro Gly
325 330 335
Trp Ile Ala Leu Ser Glu Asp Ser Ala Val Val Leu Ser Leu Ile Asp
340 345 350
Thr Ile Asp Thr Val Met Gly His Val Ser Ser Asn Leu His Gly Ser
355 360 365
Thr Pro Gln Val Thr Val Glu Gly Ser Ser Ala Met Ala Glu Phe Ser
370 375 380
Val Ala Lys Ile Leu Pro Lys Thr Val Asn Ser Ser His Tyr Arg Phe
385 390 395 400
Pro Ala His Gly Gln Ser Phe Ile Gln Ile Pro His Glu Ala Phe His
405 410 415
Arg His Ala Trp Ser Thr Val Val Gly Leu Leu Tyr His Ser Met His
420 425 430
Tyr Tyr Leu Asn Asn Ile Trp Pro Ala His Thr Lys Ile Ala Glu Ala
435 440 445
Met His His Gln Asp Cys Leu Leu Phe Ala Thr Ser His Leu Ile Ser
450 455 460
Leu Glu Val Ser Pro Pro Pro Thr Leu Ser Gln Asn Leu Ser Gly Ser
465 470 475 480
Pro Leu Ile Thr Val His Leu Lys His Arg Leu Thr Arg Lys Gln His
485 490 495
Ser Glu Ala Thr Asn Ser Ser Asn Arg Val Phe Val Tyr Cys Ala Phe
500 505 510
Leu Asp Phe Ser Ser Gly Glu Gly Val Trp Ser Asn His Gly Cys Ala
515 520 525
Leu Thr Arg Gly Asn Leu Thr Tyr Ser Val Cys Arg Cys Thr His Leu
530 535 540
Thr Asn Phe Ala Ile Leu Met Gln Val Val Pro Leu Glu Leu Ala Arg
545 550 555 560
Gly His Gln Val Ala Leu Ser Ser Ile Ser Tyr Val Gly Cys Ser Leu
565 570 575
Ser Val Leu Cys Leu Val Ala Thr Leu Val Thr Phe Ala Val Leu Ser
580 585 590
Ser Val Ser Thr Ile Arg Asn Gln Arg Tyr His Ile His Ala Asn Leu
595 600 605
Ser Phe Ala Val Leu Val Ala Gln Val Leu Leu Leu Ile Ser Phe Arg
610 615 620
Leu Glu Pro Gly Thr Thr Pro Cys Gln Val Met Ala Val Leu Leu His
625 630 635 640
Tyr Phe Phe Leu Ser Ala Phe Ala Trp Met Leu Val Glu Gly Leu His
645 650 655
Leu Tyr Ser Met Val Ile Lys Val Phe Gly Ser Glu Asp Ser Lys His
660 665 670
Arg Tyr Tyr Tyr Gly Met Gly Trp Gly Phe Pro Leu Leu Ile Cys Ile
675 680 685
Ile Ser Leu Ser Phe Ala Met Asp Ser Tyr Gly Thr Ser Asn Asn Cys
690 695 700
Trp Leu Ser Leu Ala Ser Gly Ala Ile Trp Ala Phe Val Ala Pro Ala
705 710 715 720
Leu Phe Val Ile Val Val Asn Ile Gly Ile Leu Ile Ala Val Thr Arg
725 730 735
Val Ile Ser Gln Ile Ser Ala Asp Asn Tyr Lys Ile His Gly Asp Pro
740 745 750
Ser Ala Phe Lys Leu Thr Ala Lys Ala Val Ala Val Leu Leu Pro Ile
755 760 765
Leu Gly Thr Ser Trp Val Phe Gly Val Leu Ala Val Asn Gly Cys Ala
770 775 780
Val Val Phe Gln Tyr Met Phe Ala Thr Leu Asn Ser Leu Gln Gly Leu
785 790 795 800
Phe Ile Phe Leu Phe His Cys Leu Leu Asn Ser Glu Val Arg Ala Ala
805 810 815
Phe Lys His Lys Thr Lys Val Trp Ser Leu Thr Ser Ser Ser Ala Arg
820 825 830
Thr Ser Asn Ala Lys Pro Phe His Ser Asp Leu Met Asn Gly Thr Arg
835 840 845
Pro Gly Met Ala Ser Thr Lys Leu Ser Pro Trp Asp Lys Ser Ser His
850 855 860
Ser Ala His Arg Val Asp Leu Ser Ala Val
865 870
<210> 170
<211> 383
<212> PRT
<213> Artificial Sequence
<220>
<223> GPR133 protein fragment
<400> 170
Thr Arg Lys Gln His Ser Glu Ala Thr Asn Ser Ser Asn Arg Val Phe
1 5 10 15
Val Tyr Cys Ala Phe Leu Asp Phe Ser Ser Gly Glu Gly Val Trp Ser
20 25 30
Asn His Gly Cys Ala Leu Thr Arg Gly Asn Leu Thr Tyr Ser Val Cys
35 40 45
Arg Cys Thr His Leu Thr Asn Phe Ala Ile Leu Met Gln Val Val Pro
50 55 60
Leu Glu Leu Ala Arg Gly His Gln Val Ala Leu Ser Ser Ile Ser Tyr
65 70 75 80
Val Gly Cys Ser Leu Ser Val Leu Cys Leu Val Ala Thr Leu Val Thr
85 90 95
Phe Ala Val Leu Ser Ser Val Ser Thr Ile Arg Asn Gln Arg Tyr His
100 105 110
Ile His Ala Asn Leu Ser Phe Ala Val Leu Val Ala Gln Val Leu Leu
115 120 125
Leu Ile Ser Phe Arg Leu Glu Pro Gly Thr Thr Pro Cys Gln Val Met
130 135 140
Ala Val Leu Leu His Tyr Phe Phe Leu Ser Ala Phe Ala Trp Met Leu
145 150 155 160
Val Glu Gly Leu His Leu Tyr Ser Met Val Ile Lys Val Phe Gly Ser
165 170 175
Glu Asp Ser Lys His Arg Tyr Tyr Tyr Gly Met Gly Trp Gly Phe Pro
180 185 190
Leu Leu Ile Cys Ile Ile Ser Leu Ser Phe Ala Met Asp Ser Tyr Gly
195 200 205
Thr Ser Asn Asn Cys Trp Leu Ser Leu Ala Ser Gly Ala Ile Trp Ala
210 215 220
Phe Val Ala Pro Ala Leu Phe Val Ile Val Val Asn Ile Gly Ile Leu
225 230 235 240
Ile Ala Val Thr Arg Val Ile Ser Gln Ile Ser Ala Asp Asn Tyr Lys
245 250 255
Ile His Gly Asp Pro Ser Ala Phe Lys Leu Thr Ala Lys Ala Val Ala
260 265 270
Val Leu Leu Pro Ile Leu Gly Thr Ser Trp Val Phe Gly Val Leu Ala
275 280 285
Val Asn Gly Cys Ala Val Val Phe Gln Tyr Met Phe Ala Thr Leu Asn
290 295 300
Ser Leu Gln Gly Leu Phe Ile Phe Leu Phe His Cys Leu Leu Asn Ser
305 310 315 320
Glu Val Arg Ala Ala Phe Lys His Lys Thr Lys Val Trp Ser Leu Thr
325 330 335
Ser Ser Ser Ala Arg Thr Ser Asn Ala Lys Pro Phe His Ser Asp Leu
340 345 350
Met Asn Gly Thr Arg Pro Gly Met Ala Ser Thr Lys Leu Ser Pro Trp
355 360 365
Asp Lys Ser Ser His Ser Ala His Arg Val Asp Leu Ser Ala Val
370 375 380
<210> 171
<211> 6
<212> PRT
<213> Artificial Sequence
<220>
<223> Break-point of GPR133 protein fragment
<400> 171
Thr Arg Lys Gln His Ser
1 5
<210> 172
<211> 2739
<212> DNA
<213> Artificial Sequence
<220>
<223> TXNRD1-GPR133 fusion gene
<400> 172
atgtcatgtg aggacggtcg ggccctggaa ggaacgctct cggaattggc cgcggaaacc 60
gatctgcccg ttgtgtttgt gaaacagaga aagataggcg gccatggtcc aaccttgaag 120
gcttatcagg agggcagact tcaaaagcta ctaaaaatga acggccctga agatcttccc 180
aagtcctatg actatgacct tatcatcatt ggaggtggct caggaggtct ggcagctgct 240
aaggaggcag cccaatatgg caagaaggtg atggtcctgg actttgtcac tcccacccct 300
cttggaacta gatggggtct cggaggaaca tgtgtgaatg tgggttgcat acctaaaaaa 360
ctgatgcatc aagcagcttt gttaggacaa gccctgcaag actctcgaaa ttatggatgg 420
aaagtcgagg agacagttaa gcatgattgg gacagaatga tagaagctgt acagaatcac 480
attggctctt tgaattgggg ctaccgagta gctctgcggg agaaaaaagt cgtctatgag 540
aatgcttatg ggcaatttat tggtcctcac aggattaagg caacaaataa taaaggcaaa 600
gaaaaaattt attcagcaga gagatttctc attgccactg gtgaaagacc acgttacttg 660
ggcatccctg gtgacaaaga atactgcatc agcagtgatg atcttttctc cttgccttac 720
tgcccgggta agaccctggt tgttggagca tcctatgtcg ctttggagtg cgctggattt 780
cttgctggta ttggtttaga cgtcactgtt atggttaggt ccattcttct tagaggattt 840
gaccaggaca tggccaacaa aattggtgaa cacatggaag aacatggcat caagtttata 900
agacagttcg taccaattaa agttgaacaa attgaagcag ggacaccagg ccgactcaga 960
gtagtagctc agtccaccaa tagtgaggaa atcattgaag gagaatataa tacggtgatg 1020
ctggcaatag gaagagatgc ttgcacaaga aaaattggct tagaaaccgt aggggtgaag 1080
ataaatgaaa agactggaaa aatacctgtc acagatgaag aacagaccaa tgtgccttac 1140
atctatgcca ttggcgatat attggaggat aaggtggagc tcaccccagt tgcaatccag 1200
gcaggaagat tgctggctca gaggctctat gcaggttcca ctgtcaagtg tgactatgaa 1260
aatgttccaa ccactgtatt tactcctttg gaatatggtg cttgtggcct ttctgaggag 1320
aaagctgtgg agaagtttgg ggaagaaaat attgaggttt accatagtta cttttggcca 1380
ttggaatgga cgattccgtc aagagataac aacaaatgtt atgcaaaaat aatctgtaat 1440
actaaagaca atgaacgtgt tgtgggcttt cacgtactgg gtccaaatgc tggagaagtt 1500
acacaaggct ttgcagctgc gctcaaatgt ggactgacca aaaagcagct ggacagcaca 1560
attggaatcc accctgtctg tgcagagaca cgtaagcagc acagtgaggc caccaacagc 1620
agcaaccgag tcttcgtgta ctgcgccttc ctggacttca gctccggaga aggggtctgg 1680
tcgaaccacg gctgtgcgct cacgagagga aacctcacct actccgtctg ccgctgcact 1740
cacctcacca actttgccat cctcatgcag gtggtcccgc tggagcttgc acgcggacac 1800
caggtggcgc tgtcgtctat cagctatgtg ggctgctccc tctccgtgct ctgcctggtg 1860
gccacgctgg tcaccttcgc cgtgctgtcc tccgtgagca ccatccggaa ccagcgctac 1920
cacatccacg ccaacctgtc cttcgccgtg ctggtggccc aggtcctgct gctcattagt 1980
ttccgcctcg agccgggcac gaccccctgc caagtgatgg ccgtgctcct acactacttc 2040
ttcctgagtg ccttcgcatg gatgctggtg gaggggctgc acctctacag catggtgatc 2100
aaggtctttg ggtcggagga cagcaagcac cgttactact atgggatggg atggggtttt 2160
cctcttctga tctgcatcat ttcactgtca tttgccatgg acagttacgg aacaagcaac 2220
aattgctggc tgtcgttggc gagtggcgcc atctgggcct ttgtagcccc tgccctgttt 2280
gtcatcgtgg tcaacattgg catcctcatc gctgtgacca gagtcatctc acagatcagc 2340
gccgacaact acaagatcca tggagacccc agtgccttca agttgacagc caaggcagtg 2400
gccgtgctgc tgcccatcct gggtacctcg tgggtctttg gcgtgcttgc tgtcaacggt 2460
tgtgctgtgg ttttccagta catgtttgcc acgctcaact ccctgcaggg actgttcata 2520
ttcctctttc attgtctcct gaattcagag gtgagagccg ccttcaagca caaaaccaag 2580
gtctggtcgc tcacgagcag ctctgcccgc acctccaacg cgaagccctt ccactcggac 2640
ctcatgaatg ggacccggcc aggcatggcc tccaccaagc tcagcccttg ggacaagagc 2700
agccactctg cccaccgcgt cgacctgtca gccgtgtga 2739
<210> 173
<211> 36
<212> DNA
<213> Artificial Sequence
<220>
<223> Fused region of TXNRD1-GPR133 fusion gene
<400> 173
atccaccctg tctgtgcaga gacacgtaag cagcac 36
<210> 174
<211> 912
<212> PRT
<213> Artificial Sequence
<220>
<223> TXNRD1-GPR133 fusion protein
<400> 174
Met Ser Cys Glu Asp Gly Arg Ala Leu Glu Gly Thr Leu Ser Glu Leu
1 5 10 15
Ala Ala Glu Thr Asp Leu Pro Val Val Phe Val Lys Gln Arg Lys Ile
20 25 30
Gly Gly His Gly Pro Thr Leu Lys Ala Tyr Gln Glu Gly Arg Leu Gln
35 40 45
Lys Leu Leu Lys Met Asn Gly Pro Glu Asp Leu Pro Lys Ser Tyr Asp
50 55 60
Tyr Asp Leu Ile Ile Ile Gly Gly Gly Ser Gly Gly Leu Ala Ala Ala
65 70 75 80
Lys Glu Ala Ala Gln Tyr Gly Lys Lys Val Met Val Leu Asp Phe Val
85 90 95
Thr Pro Thr Pro Leu Gly Thr Arg Trp Gly Leu Gly Gly Thr Cys Val
100 105 110
Asn Val Gly Cys Ile Pro Lys Lys Leu Met His Gln Ala Ala Leu Leu
115 120 125
Gly Gln Ala Leu Gln Asp Ser Arg Asn Tyr Gly Trp Lys Val Glu Glu
130 135 140
Thr Val Lys His Asp Trp Asp Arg Met Ile Glu Ala Val Gln Asn His
145 150 155 160
Ile Gly Ser Leu Asn Trp Gly Tyr Arg Val Ala Leu Arg Glu Lys Lys
165 170 175
Val Val Tyr Glu Asn Ala Tyr Gly Gln Phe Ile Gly Pro His Arg Ile
180 185 190
Lys Ala Thr Asn Asn Lys Gly Lys Glu Lys Ile Tyr Ser Ala Glu Arg
195 200 205
Phe Leu Ile Ala Thr Gly Glu Arg Pro Arg Tyr Leu Gly Ile Pro Gly
210 215 220
Asp Lys Glu Tyr Cys Ile Ser Ser Asp Asp Leu Phe Ser Leu Pro Tyr
225 230 235 240
Cys Pro Gly Lys Thr Leu Val Val Gly Ala Ser Tyr Val Ala Leu Glu
245 250 255
Cys Ala Gly Phe Leu Ala Gly Ile Gly Leu Asp Val Thr Val Met Val
260 265 270
Arg Ser Ile Leu Leu Arg Gly Phe Asp Gln Asp Met Ala Asn Lys Ile
275 280 285
Gly Glu His Met Glu Glu His Gly Ile Lys Phe Ile Arg Gln Phe Val
290 295 300
Pro Ile Lys Val Glu Gln Ile Glu Ala Gly Thr Pro Gly Arg Leu Arg
305 310 315 320
Val Val Ala Gln Ser Thr Asn Ser Glu Glu Ile Ile Glu Gly Glu Tyr
325 330 335
Asn Thr Val Met Leu Ala Ile Gly Arg Asp Ala Cys Thr Arg Lys Ile
340 345 350
Gly Leu Glu Thr Val Gly Val Lys Ile Asn Glu Lys Thr Gly Lys Ile
355 360 365
Pro Val Thr Asp Glu Glu Gln Thr Asn Val Pro Tyr Ile Tyr Ala Ile
370 375 380
Gly Asp Ile Leu Glu Asp Lys Val Glu Leu Thr Pro Val Ala Ile Gln
385 390 395 400
Ala Gly Arg Leu Leu Ala Gln Arg Leu Tyr Ala Gly Ser Thr Val Lys
405 410 415
Cys Asp Tyr Glu Asn Val Pro Thr Thr Val Phe Thr Pro Leu Glu Tyr
420 425 430
Gly Ala Cys Gly Leu Ser Glu Glu Lys Ala Val Glu Lys Phe Gly Glu
435 440 445
Glu Asn Ile Glu Val Tyr His Ser Tyr Phe Trp Pro Leu Glu Trp Thr
450 455 460
Ile Pro Ser Arg Asp Asn Asn Lys Cys Tyr Ala Lys Ile Ile Cys Asn
465 470 475 480
Thr Lys Asp Asn Glu Arg Val Val Gly Phe His Val Leu Gly Pro Asn
485 490 495
Ala Gly Glu Val Thr Gln Gly Phe Ala Ala Ala Leu Lys Cys Gly Leu
500 505 510
Thr Lys Lys Gln Leu Asp Ser Thr Ile Gly Ile His Pro Val Cys Ala
515 520 525
Glu Thr Arg Lys Gln His Ser Glu Ala Thr Asn Ser Ser Asn Arg Val
530 535 540
Phe Val Tyr Cys Ala Phe Leu Asp Phe Ser Ser Gly Glu Gly Val Trp
545 550 555 560
Ser Asn His Gly Cys Ala Leu Thr Arg Gly Asn Leu Thr Tyr Ser Val
565 570 575
Cys Arg Cys Thr His Leu Thr Asn Phe Ala Ile Leu Met Gln Val Val
580 585 590
Pro Leu Glu Leu Ala Arg Gly His Gln Val Ala Leu Ser Ser Ile Ser
595 600 605
Tyr Val Gly Cys Ser Leu Ser Val Leu Cys Leu Val Ala Thr Leu Val
610 615 620
Thr Phe Ala Val Leu Ser Ser Val Ser Thr Ile Arg Asn Gln Arg Tyr
625 630 635 640
His Ile His Ala Asn Leu Ser Phe Ala Val Leu Val Ala Gln Val Leu
645 650 655
Leu Leu Ile Ser Phe Arg Leu Glu Pro Gly Thr Thr Pro Cys Gln Val
660 665 670
Met Ala Val Leu Leu His Tyr Phe Phe Leu Ser Ala Phe Ala Trp Met
675 680 685
Leu Val Glu Gly Leu His Leu Tyr Ser Met Val Ile Lys Val Phe Gly
690 695 700
Ser Glu Asp Ser Lys His Arg Tyr Tyr Tyr Gly Met Gly Trp Gly Phe
705 710 715 720
Pro Leu Leu Ile Cys Ile Ile Ser Leu Ser Phe Ala Met Asp Ser Tyr
725 730 735
Gly Thr Ser Asn Asn Cys Trp Leu Ser Leu Ala Ser Gly Ala Ile Trp
740 745 750
Ala Phe Val Ala Pro Ala Leu Phe Val Ile Val Val Asn Ile Gly Ile
755 760 765
Leu Ile Ala Val Thr Arg Val Ile Ser Gln Ile Ser Ala Asp Asn Tyr
770 775 780
Lys Ile His Gly Asp Pro Ser Ala Phe Lys Leu Thr Ala Lys Ala Val
785 790 795 800
Ala Val Leu Leu Pro Ile Leu Gly Thr Ser Trp Val Phe Gly Val Leu
805 810 815
Ala Val Asn Gly Cys Ala Val Val Phe Gln Tyr Met Phe Ala Thr Leu
820 825 830
Asn Ser Leu Gln Gly Leu Phe Ile Phe Leu Phe His Cys Leu Leu Asn
835 840 845
Ser Glu Val Arg Ala Ala Phe Lys His Lys Thr Lys Val Trp Ser Leu
850 855 860
Thr Ser Ser Ser Ala Arg Thr Ser Asn Ala Lys Pro Phe His Ser Asp
865 870 875 880
Leu Met Asn Gly Thr Arg Pro Gly Met Ala Ser Thr Lys Leu Ser Pro
885 890 895
Trp Asp Lys Ser Ser His Ser Ala His Arg Val Asp Leu Ser Ala Val
900 905 910
<210> 175
<211> 13
<212> PRT
<213> Artificial Sequence
<220>
<223> Fused region of TXNRD1-GPR133 fusion protein
<400> 175
Ile His Pro Val Cys Ala Glu Thr Arg Lys Gln His Ser
1 5 10
<210> 176
<211> 266
<212> DNA
<213> Artificial Sequence
<220>
<223> SCAF11 gene (NM_004719) fragment
<400> 176
ggaggggagg ggagggagga ggctagacaa ggcgggggaa gggggagtag cggtggctta 60
agccgcgcgg agcagcgcaa cctgggtcgc tccctgcttc gccgccgcct ccggaccgag 120
ccagcggagt cagtgtccta gagaccctgt aacaccacaa agcggacgaa ggagtccatg 180
ttggggaact tggcagcgga gtgactggga cctgggaacc tactgtgggg ccgcggccgg 240
accgagcgcc tcgacctcgg tctgag 266
<210> 177
<211> 11
<212> DNA
<213> Artificial Sequence
<220>
<223> Break-point of SCAF11 gene fragment
<400> 177
ctcggtctga g 11
<210> 178
<211> 3282
<212> DNA
<213> Artificial Sequence
<220>
<223> PDGFRA gene (NM_006206) fragment
<400> 178
tttcccagag ctatggggac ttcccatccg gcgttcctgg tcttaggctg tcttctcaca 60
gggctgagcc taatcctctg ccagctttca ttaccctcta tccttccaaa tgaaaatgaa 120
aaggttgtgc agctgaattc atccttttct ctgagatgct ttggggagag tgaagtgagc 180
tggcagtacc ccatgtctga agaagagagc tccgatgtgg aaatcagaaa tgaagaaaac 240
aacagcggcc tttttgtgac ggtcttggaa gtgagcagtg cctcggcggc ccacacaggg 300
ttgtacactt gctattacaa ccacactcag acagaagaga atgagcttga aggcaggcac 360
atttacatct atgtgccaga cccagatgta gcctttgtac ctctaggaat gacggattat 420
ttagtcatcg tggaggatga tgattctgcc attatacctt gtcgcacaac tgatcccgag 480
actcctgtaa ccttacacaa cagtgagggg gtggtacctg cctcctacga cagcagacag 540
ggctttaatg ggaccttcac tgtagggccc tatatctgtg aggccaccgt caaaggaaag 600
aagttccaga ccatcccatt taatgtttat gctttaaaag caacatcaga gctggatcta 660
gaaatggaag ctcttaaaac cgtgtataag tcaggggaaa cgattgtggt cacctgtgct 720
gtttttaaca atgaggtggt tgaccttcaa tggacttacc ctggagaagt gaaaggcaaa 780
ggcatcacaa tgctggaaga aatcaaagtc ccatccatca aattggtgta cactttgacg 840
gtccccgagg ccacggtgaa agacagtgga gattacgaat gtgctgcccg ccaggctacc 900
agggaggtca aagaaatgaa gaaagtcact atttctgtcc atgagaaagg tttcattgaa 960
atcaaaccca ccttcagcca gttggaagct gtcaacctgc atgaagtcaa acattttgtt 1020
gtagaggtgc gggcctaccc acctcccagg atatcctggc tgaaaaacaa tctgactctg 1080
attgaaaatc tcactgagat caccactgat gtggaaaaga ttcaggaaat aaggtatcga 1140
agcaaattaa agctgatccg tgctaaggaa gaagacagtg gccattatac tattgtagct 1200
caaaatgaag atgctgtgaa gagctatact tttgaactgt taactcaagt tccttcatcc 1260
attctggact tggtcgatga tcaccatggc tcaactgggg gacagacggt gaggtgcaca 1320
gctgaaggca cgccgcttcc tgatattgag tggatgatat gcaaagatat taagaaatgt 1380
aataatgaaa cttcctggac tattttggcc aacaatgtct caaacatcat cacggagatc 1440
cactcccgag acaggagtac cgtggagggc cgtgtgactt tcgccaaagt ggaggagacc 1500
atcgccgtgc gatgcctggc taagaatctc cttggagctg agaaccgaga gctgaagctg 1560
gtggctccca ccctgcgttc tgaactcacg gtggctgctg cagtcctggt gctgttggtg 1620
attgtgatca tctcacttat tgtcctggtt gtcatttgga aacagaaacc gaggtatgaa 1680
attcgctgga gggtcattga atcaatcagc ccagatggac atgaatatat ttatgtggac 1740
ccgatgcagc tgccttatga ctcaagatgg gagtttccaa gagatggact agtgcttggt 1800
cgggtcttgg ggtctggagc gtttgggaag gtggttgaag gaacagccta tggattaagc 1860
cggtcccaac ctgtcatgaa agttgcagtg aagatgctaa aacccacggc cagatccagt 1920
gaaaaacaag ctctcatgtc tgaactgaag ataatgactc acctggggcc acatttgaac 1980
attgtaaact tgctgggagc ctgcaccaag tcaggcccca tttacatcat cacagagtat 2040
tgcttctatg gagatttggt caactatttg cataagaata gggatagctt cctgagccac 2100
cacccagaga agccaaagaa agagctggat atctttggat tgaaccctgc tgatgaaagc 2160
acacggagct atgttatttt atcttttgaa aacaatggtg actacatgga catgaagcag 2220
gctgatacta cacagtatgt ccccatgcta gaaaggaaag aggtttctaa atattccgac 2280
atccagagat cactctatga tcgtccagcc tcatataaga agaaatctat gttagactca 2340
gaagtcaaaa acctcctttc agatgataac tcagaaggcc ttactttatt ggatttgttg 2400
agcttcacct atcaagttgc ccgaggaatg gagtttttgg cttcaaaaaa ttgtgtccac 2460
cgtgatctgg ctgctcgcaa cgtcctcctg gcacaaggaa aaattgtgaa gatctgtgac 2520
tttggcctgg ccagagacat catgcatgat tcgaactatg tgtcgaaagg cagtaccttt 2580
ctgcccgtga agtggatggc tcctgagagc atctttgaca acctctacac cacactgagt 2640
gatgtctggt cttatggcat tctgctctgg gagatctttt cccttggtgg caccccttac 2700
cccggcatga tggtggattc tactttctac aataagatca agagtgggta ccggatggcc 2760
aagcctgacc acgctaccag tgaagtctac gagatcatgg tgaaatgctg gaacagtgag 2820
ccggagaaga gaccctcctt ttaccacctg agtgagattg tggagaatct gctgcctgga 2880
caatataaaa agagttatga aaaaattcac ctggacttcc tgaagagtga ccatcctgct 2940
gtggcacgca tgcgtgtgga ctcagacaat gcatacattg gtgtcaccta caaaaacgag 3000
gaagacaagc tgaaggactg ggagggtggt ctggatgagc agagactgag cgctgacagt 3060
ggctacatca ttcctctgcc tgacattgac cctgtccctg aggaggagga cctgggcaag 3120
aggaacagac acagctcgca gacctctgaa gagagtgcca ttgagacggg ttccagcagt 3180
tccaccttca tcaagagaga ggacgagacc attgaagaca tcgacatgat ggatgacatc 3240
ggcatagact cttcagacct ggtggaagac agcttcctgt aa 3282
<210> 179
<211> 12
<212> DNA
<213> Artificial Sequence
<220>
<223> Break-point of PDGFRA gene fragment
<400> 179
tttcccagag ct 12
<210> 180
<211> 3548
<212> DNA
<213> Artificial Sequence
<220>
<223> SCAF11-PDGFRA fusion gene
<400> 180
ggaggggagg ggagggagga ggctagacaa ggcgggggaa gggggagtag cggtggctta 60
agccgcgcgg agcagcgcaa cctgggtcgc tccctgcttc gccgccgcct ccggaccgag 120
ccagcggagt cagtgtccta gagaccctgt aacaccacaa agcggacgaa ggagtccatg 180
ttggggaact tggcagcgga gtgactggga cctgggaacc tactgtgggg ccgcggccgg 240
accgagcgcc tcgacctcgg tctgagtttc ccagagctat ggggacttcc catccggcgt 300
tcctggtctt aggctgtctt ctcacagggc tgagcctaat cctctgccag ctttcattac 360
cctctatcct tccaaatgaa aatgaaaagg ttgtgcagct gaattcatcc ttttctctga 420
gatgctttgg ggagagtgaa gtgagctggc agtaccccat gtctgaagaa gagagctccg 480
atgtggaaat cagaaatgaa gaaaacaaca gcggcctttt tgtgacggtc ttggaagtga 540
gcagtgcctc ggcggcccac acagggttgt acacttgcta ttacaaccac actcagacag 600
aagagaatga gcttgaaggc aggcacattt acatctatgt gccagaccca gatgtagcct 660
ttgtacctct aggaatgacg gattatttag tcatcgtgga ggatgatgat tctgccatta 720
taccttgtcg cacaactgat cccgagactc ctgtaacctt acacaacagt gagggggtgg 780
tacctgcctc ctacgacagc agacagggct ttaatgggac cttcactgta gggccctata 840
tctgtgaggc caccgtcaaa ggaaagaagt tccagaccat cccatttaat gtttatgctt 900
taaaagcaac atcagagctg gatctagaaa tggaagctct taaaaccgtg tataagtcag 960
gggaaacgat tgtggtcacc tgtgctgttt ttaacaatga ggtggttgac cttcaatgga 1020
cttaccctgg agaagtgaaa ggcaaaggca tcacaatgct ggaagaaatc aaagtcccat 1080
ccatcaaatt ggtgtacact ttgacggtcc ccgaggccac ggtgaaagac agtggagatt 1140
acgaatgtgc tgcccgccag gctaccaggg aggtcaaaga aatgaagaaa gtcactattt 1200
ctgtccatga gaaaggtttc attgaaatca aacccacctt cagccagttg gaagctgtca 1260
acctgcatga agtcaaacat tttgttgtag aggtgcgggc ctacccacct cccaggatat 1320
cctggctgaa aaacaatctg actctgattg aaaatctcac tgagatcacc actgatgtgg 1380
aaaagattca ggaaataagg tatcgaagca aattaaagct gatccgtgct aaggaagaag 1440
acagtggcca ttatactatt gtagctcaaa atgaagatgc tgtgaagagc tatacttttg 1500
aactgttaac tcaagttcct tcatccattc tggacttggt cgatgatcac catggctcaa 1560
ctgggggaca gacggtgagg tgcacagctg aaggcacgcc gcttcctgat attgagtgga 1620
tgatatgcaa agatattaag aaatgtaata atgaaacttc ctggactatt ttggccaaca 1680
atgtctcaaa catcatcacg gagatccact cccgagacag gagtaccgtg gagggccgtg 1740
tgactttcgc caaagtggag gagaccatcg ccgtgcgatg cctggctaag aatctccttg 1800
gagctgagaa ccgagagctg aagctggtgg ctcccaccct gcgttctgaa ctcacggtgg 1860
ctgctgcagt cctggtgctg ttggtgattg tgatcatctc acttattgtc ctggttgtca 1920
tttggaaaca gaaaccgagg tatgaaattc gctggagggt cattgaatca atcagcccag 1980
atggacatga atatatttat gtggacccga tgcagctgcc ttatgactca agatgggagt 2040
ttccaagaga tggactagtg cttggtcggg tcttggggtc tggagcgttt gggaaggtgg 2100
ttgaaggaac agcctatgga ttaagccggt cccaacctgt catgaaagtt gcagtgaaga 2160
tgctaaaacc cacggccaga tccagtgaaa aacaagctct catgtctgaa ctgaagataa 2220
tgactcacct ggggccacat ttgaacattg taaacttgct gggagcctgc accaagtcag 2280
gccccattta catcatcaca gagtattgct tctatggaga tttggtcaac tatttgcata 2340
agaataggga tagcttcctg agccaccacc cagagaagcc aaagaaagag ctggatatct 2400
ttggattgaa ccctgctgat gaaagcacac ggagctatgt tattttatct tttgaaaaca 2460
atggtgacta catggacatg aagcaggctg atactacaca gtatgtcccc atgctagaaa 2520
ggaaagaggt ttctaaatat tccgacatcc agagatcact ctatgatcgt ccagcctcat 2580
ataagaagaa atctatgtta gactcagaag tcaaaaacct cctttcagat gataactcag 2640
aaggccttac tttattggat ttgttgagct tcacctatca agttgcccga ggaatggagt 2700
ttttggcttc aaaaaattgt gtccaccgtg atctggctgc tcgcaacgtc ctcctggcac 2760
aaggaaaaat tgtgaagatc tgtgactttg gcctggccag agacatcatg catgattcga 2820
actatgtgtc gaaaggcagt acctttctgc ccgtgaagtg gatggctcct gagagcatct 2880
ttgacaacct ctacaccaca ctgagtgatg tctggtctta tggcattctg ctctgggaga 2940
tcttttccct tggtggcacc ccttaccccg gcatgatggt ggattctact ttctacaata 3000
agatcaagag tgggtaccgg atggccaagc ctgaccacgc taccagtgaa gtctacgaga 3060
tcatggtgaa atgctggaac agtgagccgg agaagagacc ctccttttac cacctgagtg 3120
agattgtgga gaatctgctg cctggacaat ataaaaagag ttatgaaaaa attcacctgg 3180
acttcctgaa gagtgaccat cctgctgtgg cacgcatgcg tgtggactca gacaatgcat 3240
acattggtgt cacctacaaa aacgaggaag acaagctgaa ggactgggag ggtggtctgg 3300
atgagcagag actgagcgct gacagtggct acatcattcc tctgcctgac attgaccctg 3360
tccctgagga ggaggacctg ggcaagagga acagacacag ctcgcagacc tctgaagaga 3420
gtgccattga gacgggttcc agcagttcca ccttcatcaa gagagaggac gagaccattg 3480
aagacatcga catgatggat gacatcggca tagactcttc agacctggtg gaagacagct 3540
tcctgtaa 3548
<210> 181
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Fused region of SCAF11-PDGFRA fusion gene
<400> 181
ctcggtctga gtttcccaga gc 22
<210> 182
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Forward Primer of CCDC6-ROS1
<400> 182
cctgcaggaa aaattagacc ag 22
<210> 183
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Reverse Primer of CCDC6-ROS1
<400> 183
agctcagcca actctttgtc tt 22
<210> 184
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Forward Primer of SCAF11-PDGFRA
<400> 184
cagcggagtc agtgtcctag ag 22
<210> 185
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Reverse Primer of SCAF11-PDGFRA
<400> 185
tgagaagaca gcctaagacc ag 22
<210> 186
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Forward Primer of FGFR2-CIT
<400> 186
acatgatgat gagggactgt tg 22
<210> 187
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Reverse Primer of FGFR2-CIT
<400> 187
acagctgtta cgaagagcat ca 22
<210> 188
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Forward Primer of AXL-MBIP
<400> 188
gcctgacgaa atcctctatg tc 22
<210> 189
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Reverse Primer of AXL-MBIP
<400> 189
caaaattccc tgacgttgtt tt 22
<210> 190
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Forward Primer of APLP2-TNFSF11
<400> 190
tgctgagaac aaagatcgct ta 22
<210> 191
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Reverse Primer of APLP2-TNFSF11
<400> 191
tgtcggtggc attaatagtg ag 22
<210> 192
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Forward Primer of MAP4K3-PRKCE
<400> 192
aggaggactt cgagctgatt c 21
<210> 193
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Reverse Primer of MAP4K3-PRKCE
<400> 193
acgaccctga gagatcgatg a 21
<210> 194
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Forward Primer of BCAS3-MAP3K3
<400> 194
catcccgtcc agtctctgat 20
<210> 195
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Reverse Primer of BCAS3-MAP3K3
<400> 195
ctgcctattt gagtgacctg tg 22
<210> 196
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> Forward Primer of KRAS-CDH13
<400> 196
ggaaataaat gtgatttgcc ttc 23
<210> 197
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Reverse Primer of KRAS-CDH13
<400> 197
aaggctgtct ctgattctct gg 22
<210> 198
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Forward Primer of ZFYVE9-CGA
<400> 198
actgcagaga acatggattc ct 22
<210> 199
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Reverse Primer of ZFYVE9-CGA
<400> 199
gaatggagaa catgcagaaa ca 22
<210> 200
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Forward Primer of ERBB2IP-MAST4
<400> 200
aacaagggta caacctgaag ga 22
<210> 201
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Reverse Primer of ERBB2IP-MAST4
<400> 201
tcaaggaagt atcgtgaggt ga 22
<210> 202
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> Forward Primer of TPD52L1-TRMT11
<400> 202
gaaaacacat gaaaccctga gtc 23
<210> 203
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Reverse Primer of TPD52L1-TRMT11
<400> 203
atgtgtgact ggaaagcttc tg 22
<210> 204
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Forward Primer of TXNRD1-GPR133
<400> 204
tccaaatgct ggagaagtta ca 22
<210> 205
<211> 22
<212> DNA
<213> Artificial Sequence
<220>
<223> Reverse Primer of TXNRD1-GPR133
<400> 205
agtacacgaa gactcggttg ct 22
Claims (23)
- ZFYVE9 (zinc finger, FYVE domain containing 9) 단백질 또는 그의 단편과 C-말단에 CGA (glycoprotein hormones, alpha polypeptide) 단백질 또는 그의 단편이 융합된 BCAS3-MAP3K3 융합 단백질.
- 제1항에 있어서,
상기 ZFYVE9 단백질의 단편은 NM_007324 또는 NM_004799의 첫 번째 엑손에서 엑손 16까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것이고,
CGA 단백질의 단편은 NM_000735의 엑손 2부터 마지막 엑손까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것인,
융합 단백질. - 제1항에 있어서,
5' 말단에 NM_007324의 173번째부터 3828번째까지의 뉴클레오타이드 서열과 3' 말단쪽에 NM_000735의 136번째부터 493번째까지의 뉴클레오타이드 서열이 연결된 융합 유전자에 의하여 암호화되는,
융합 단백질. - 제3항에 있어서,
상기 융합 단백질은 서열번호 126의 아미노산 서열을 갖는 것인,
융합 단백질. - 제1항 내지 제4항 중 어느 한 항의 융합 단백질을 암호화하는 융합 유전자.
- 제5항에 있어서,
상기 융합 유전자는 서열번호 124의 뉴클레오타이드 서열을 갖는 것인, 융합 유전자. - 제1항 내지 제4항 중 어느 한 항의 융합 단백질에 특이적으로 결합하는 분자, 및
상기 융합 단백질을 암호화하는 융합 유전자에 특이적으로 결합하는 분자
로 이루어진 군에서 선택된 1종 이상을 포함하는,
암 진단용 조성물. - 제7항에 있어서,
상기 융합 단백질에 특이적으로 결합하는 분자는 항체 및 압타머로 이루어진 군에서 선택된 것이고,
상기 융합 단백질을 암호화하는 융합 유전자에 특이적으로 결합하는 분자는 상기 융합 유전자 내의 융합 부위 (서열번호 125)를 포함하는 연속하는 50 내지 250개의 염기로 이루어진 DNA 분자를 증폭할 수 있도록 상기 DNA 분자의 양 말단에 인접하는 20 내지 100개의 염기서열 또는 이의 상보적인 서열과 혼성화 가능한 폴리뉴클레오타이드인,
암 진단용 조성물. - 제8항에 있어서,
상기 혼성화 가능한 폴리뉴클레오타이드는 서열번호 198 및 서열번호 199의 프라이머쌍인,
암 진단용 조성물. - 제7항에 있어서, 상기 암은 고형암인, 암 진단용 조성물.
- 제10항에 있어서, 상기 암은 폐암인, 암 진단용 조성물.
- 제11항에 있어서, 상기 암은 비소세포성 폐암(NSCLC)인, 암 진단용 조성물.
- 제7항에 있어서,
다음으로 이루어진 군에서 선택된 1종 이상과 특이적으로 결합하는 분자를 추가로 포함하는 암 진단용 조성물:
CCDC6 단백질 또는 그의 단편과 ROS1 단백질 또는 그의 단편이 융합된 CCDC6-ROS1융합 단백질;
FGFR 단백질 또는 그의 단편과 CIT 단백질 또는 그의 단편이 융합된 FGFR2-CIT 융합 단백질;
AXL 단백질 또는 그의 단편과 MBIP 단백질 또는 그의 단편이 융합된 AXL-MBIP 융합 단백질;
APLP2 단백질 또는 그의 단편과 TNFSF11 단백질 또는 그의 단편이 융합된 APLP2-TNFSF11 융합 단백질;
MAP4K3 단백질 또는 그의 단편과 PRKCE 단백질 또는 그의 단편이 융합된 MAP4K3-PRKCE 융합 단백질;
BCAS3 단백질 또는 그의 단편과 MAP3K3 단백질 또는 그의 단편이 융합된 BCAS3-MAP3K3 융합 단백질;
KRAS 단백질 또는 그의 단편과 CDH13 단백질 또는 그의 단편이 융합된 KRAS-CDH13 융합 단백질;
ERBB2IP 단백질 또는 그의 단편과 MAST4 단백질 또는 그의 단편이 융합된 ERBB2IP-MAST4 융합 단백질;
TPD52L1 단백질 또는 그의 단편과 TRMT11 단백질 또는 그의 단편이 융합된 TPD52L1-TRMT11 융합 단백질;
TXNRD1 단백질 또는 그의 단편과 GPR133 단백질 또는 그의 단편이 융합된 TXNRD1-GPR133 융합 단백질;
상기 융합 단백질을 암호화 하는 융합 유전자; 및
NM_004719의 엑손 1의 뉴클레오타이드 서열로 이루어진 SCAF11의 5UTR 부위와 NM_006206의 엑손 2부터 마지막 엑손까지의 뉴클레오타이드 서열로 이루어진 PDGFRA 유전자 단편이 융합된 SCAF11-PDGFRA 융합 유전자. - 제13항에 있어서,
상기 특이적으로 결합하는 분자는
서열번호 182 및 서열번호 183의 프라이머쌍, 서열번호 184 및 서열번호 185의 프라이머쌍, 서열번호186 및 서열번호 187의 프라이머쌍, 서열번호 188 및 서열번호 189의 프라이머쌍, 서열번호 190 및 서열번호 191의 프라이머쌍, 서열번호 192 서열번호 193의 프라이머쌍, 서열번호 194 및 서열번호 195의 프라이머쌍, 서열번호 196 및 서열번호 197의 프라이머쌍, 서열번호 200 및 서열번호 201의 프라이머쌍, 서열번호 202 및 서열번호 203의 프라이머쌍, 및 서열번호 204 및 서열번호 205의 프라이머쌍으로 이루어진 군에서 선택된 1종 이상인,
암 진단용 조성물. - 환자로부터 분리된 생물학적 시료에서 제1항 내지 제4항 중 어느 한 항의 융합 단백질, 상기 융합 단백질을 암호화하는 융합 유전자, 또는 상기 융합 유전자에 상응하는 mRNA를 검출하는 단계를 포함하고,
상기 융합 단백질, 융합 유전자 또는 mRNA가 검출되면 상기 환자를 암 환자로 결정하는 것을 특징으로 하는,
암 진단에 정보를 제공하는 방법. - 제15항에 있어서, 상기 암은 고형암인, 암 진단에 정보를 제공하는 방법.
- 제16항에 있어서, 상기 암은 폐암인, 암 진단에 정보를 제공하는 방법.
- 제17항에 있어서, 상기 암은 비소세포성 폐암(NSCLC)인, 암 진단에 정보를 제공하는 방법.
- 제15항에 있어서,
다음으로 이루어진 군에서 선택된 1종 이상을 검출하는 단계를 추가로 포함하는 것인, 폐암 진단에 정보를 제공하는 방법:
CCDC6 단백질 또는 그의 단편과 ROS1 단백질 또는 그의 단편이 융합된 CCDC6-ROS1융합 단백질;
FGFR 단백질 또는 그의 단편과 CIT 단백질 또는 그의 단편이 융합된 FGFR2-CIT 융합 단백질;
AXL 단백질 또는 그의 단편과 MBIP 단백질 또는 그의 단편이 융합된 AXL-MBIP 융합 단백질;
APLP2 단백질 또는 그의 단편과 TNFSF11 단백질 또는 그의 단편이 융합된 APLP2-TNFSF11 융합 단백질;
MAP4K3 단백질 또는 그의 단편과 PRKCE 단백질 또는 그의 단편이 융합된 MAP4K3-PRKCE 융합 단백질;
BCAS3 단백질 또는 그의 단편과 MAP3K3 단백질 또는 그의 단편이 융합된 BCAS3-MAP3K3 융합 단백질;
KRAS 단백질 또는 그의 단편과 CDH13 단백질 또는 그의 단편이 융합된 KRAS-CDH13 융합 단백질;
ERBB2IP 단백질 또는 그의 단편과 MAST4 단백질 또는 그의 단편이 융합된 ERBB2IP-MAST4 융합 단백질;
TPD52L1 단백질 또는 그의 단편과 TRMT11 단백질 또는 그의 단편이 융합된 TPD52L1-TRMT11 융합 단백질;
TXNRD1 단백질 또는 그의 단편과 GPR133 단백질 또는 그의 단편이 융합된 TXNRD1-GPR133 융합 단백질;
상기 융합 단백질을 암호화 하는 융합 유전자 또는 이에 상응하는 mRNA; 및
NM_004719의 엑손 1의 뉴클레오타이드 서열로 이루어진 SCAF11의 5UTR 부위와 NM_006206의 엑손 2부터 마지막 엑손까지의 뉴클레오타이드 서열로 이루어진 PDGFRA 유전자 단편이 융합된 SCAF11-PDGFRA 융합 유전자 또는 이에 상응하는 mRNA. - 제1항 내지 제4항 중 어느 한 항의 융합 단백질을 발현하는 세포에 후보 물질을 처리하는 단계; 및
상기 세포에서의 융합 단백질 발현 정도를 측정하는 단계를 포함하고,
상기 후보 물질이 처리된 세포에서의 융합 단백질의 발현 정도가 상기 후보 물질 처리 전 또는 상기 후보 물질이 처리되지 않은 세포와 비교하여 감소한 경우, 상기 후보 물질을 항암제로 결정하는 것을 특징으로 하는,
항암제 스크리닝 방법. - 제20항에 있어서,
상기 세포는 다음으로 이루어진 군에서 선택된 1종 이상을 추가로 발현하거나 포함하는 것인, 항암제 스크리닝 방법:
다음으로 이루어진 군에서 선택된 1종 이상을 추가로 포함하는 것인, 암 진단용 조성물:
CCDC6 단백질 또는 그의 단편과 ROS1 단백질 또는 그의 단편이 융합된 CCDC6-ROS1융합 단백질;
FGFR 단백질 또는 그의 단편과 CIT 단백질 또는 그의 단편이 융합된 FGFR2-CIT 융합 단백질;
AXL 단백질 또는 그의 단편과 MBIP 단백질 또는 그의 단편이 융합된 AXL-MBIP 융합 단백질;
APLP2 단백질 또는 그의 단편과 TNFSF11 단백질 또는 그의 단편이 융합된 APLP2-TNFSF11 융합 단백질;
MAP4K3 단백질 또는 그의 단편과 PRKCE 단백질 또는 그의 단편이 융합된 MAP4K3-PRKCE 융합 단백질;
BCAS3 단백질 또는 그의 단편과 MAP3K3 단백질 또는 그의 단편이 융합된 BCAS3-MAP3K3 융합 단백질;
KRAS 단백질 또는 그의 단편과 CDH13 단백질 또는 그의 단편이 융합된 KRAS-CDH13 융합 단백질;
ERBB2IP 단백질 또는 그의 단편과 MAST4 단백질 또는 그의 단편이 융합된 ERBB2IP-MAST4 융합 단백질;
TPD52L1 단백질 또는 그의 단편과 TRMT11 단백질 또는 그의 단편이 융합된 TPD52L1-TRMT11 융합 단백질;
TXNRD1 단백질 또는 그의 단편과 GPR133 단백질 또는 그의 단편이 융합된 TXNRD1-GPR133 융합 단백질;
상기 융합 단백질을 암호화 하는 융합 유전자; 및
NM_004719의 엑손 1의 뉴클레오타이드 서열로 이루어진 SCAF11의 5UTR 부위와 NM_006206의 엑손 2부터 마지막 엑손까지의 뉴클레오타이드 서열로 이루어진 PDGFRA 유전자 단편이 융합된 SCAF11-PDGFRA 융합 유전자. - 제1항 내지 제4항 중 어느 한 항의 융합 단백질의 억제제 및 상기 융합 단백질을 암호화하는 폴리뉴클레오타이드 분자 억제제로 이루어진 군에서 선택된 1종 이상을 유효성분으로 포함하는, 암 예방 또는 치료용 조성물.
- 제22항에 있어서,
상기 융합 단백질의 억제제는 상기 융합 단백질에 대한 항체, 압타머, 키나제 저해제, 및 신호 전달 저해제로 이루어진 군에서 선택된 1종 이상이고,
상기 융합 단백질을 암호화하는 폴리뉴클레오타이드 분자 억제제는 상기 폴리뉴클레오타이드 분자에 특이적으로 결합하는 siRNA, shRNA, 및 압타머로 이루어진 군에서 선택된 1종 이상인, 암 예방 또는 치료용 조성물.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020120099613A KR20140033619A (ko) | 2012-09-07 | 2012-09-07 | Zfyve9를 포함하는 융합 단백질 및 이를 포함하는 암 진단용 조성물 |
PCT/KR2013/008066 WO2014038884A1 (ko) | 2012-09-07 | 2013-09-06 | 융합 단백질 및 이를 포함하는 암 진단용 조성물 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020120099613A KR20140033619A (ko) | 2012-09-07 | 2012-09-07 | Zfyve9를 포함하는 융합 단백질 및 이를 포함하는 암 진단용 조성물 |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20140033619A true KR20140033619A (ko) | 2014-03-19 |
Family
ID=50644399
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020120099613A KR20140033619A (ko) | 2012-09-07 | 2012-09-07 | Zfyve9를 포함하는 융합 단백질 및 이를 포함하는 암 진단용 조성물 |
Country Status (1)
Country | Link |
---|---|
KR (1) | KR20140033619A (ko) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2023220688A1 (en) * | 2022-05-11 | 2023-11-16 | The Johns Hopkins University | Engineering of multifunctional peptides for controlled drug delivery |
-
2012
- 2012-09-07 KR KR1020120099613A patent/KR20140033619A/ko not_active Application Discontinuation
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2023220688A1 (en) * | 2022-05-11 | 2023-11-16 | The Johns Hopkins University | Engineering of multifunctional peptides for controlled drug delivery |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2020270508B2 (en) | C/EBP alpha short activating RNA compositions and methods of use | |
AU2017267184B2 (en) | Method for assessing a prognosis and predicting the response of patients with malignant diseases to immunotherapy | |
US11103538B2 (en) | Targeting epigenetic regulators using a bacterial delivery system | |
US6262333B1 (en) | Human genes and gene expression products | |
KR20180020125A (ko) | 변형된 t 세포 및 이의 제조 및 사용 방법 | |
KR20150129847A (ko) | 융합 단백질 및 이들의 방법 | |
KR20120082906A (ko) | 자가포식현상-향상 유전자 생성물의 조절을 통한 자가포식현상의 조절 방법 | |
KR20080043892A (ko) | 단일 카피 게놈 교잡 프로브 및 이의 생성 방법 | |
US20090305284A1 (en) | Methods for Identifying Risk of Breast Cancer and Treatments Thereof | |
JP2003088388A (ja) | 新規な全長cDNA | |
JP2003135075A (ja) | 新規な全長cDNA | |
KR20060135945A (ko) | 염색체 안정화에 관한 유전자를 표적으로 하는 암세포 특이적 아포토시스 유도제 | |
KR20220054401A (ko) | 감염의 숙주 rna 바이오마커의 신속한 조기-검출 및 인간의 covid-19 코로나바이러스 감염의 조기 식별을 위한 시스템, 방법 및 조성물 | |
CN1704478A (zh) | 评估急性髓性白血病患者的方法 | |
KR102039311B1 (ko) | Axl을 포함하는 융합 단백질 및 이를 포함하는 암 진단용 조성물 | |
JP2003159059A (ja) | 痛みに関連する分子の同定及び使用 | |
KR102661616B1 (ko) | Gpr156 변이체 및 이들의 용도 | |
CN115151558A (zh) | 哺乳动物序列中的靶向整合增强基因表达 | |
US20020137077A1 (en) | Genes regulated in activated T cells | |
KR20140033619A (ko) | Zfyve9를 포함하는 융합 단백질 및 이를 포함하는 암 진단용 조성물 | |
KR20140033618A (ko) | Aplp2를 포함하는 융합 단백질 및 이를 포함하는 암 진단용 조성물 | |
KR20140033617A (ko) | Kras를 포함하는 융합 단백질 및 이를 포함하는 암 진단용 조성물 | |
KR20140033282A (ko) | Ros1을 포함하는 융합 단백질 및 이를 포함하는 암 진단용 조성물 | |
KR20140033283A (ko) | Fgfr2를 포함하는 융합 단백질 및 이를 포함하는 암 진단용 조성물 | |
JP2002017375A (ja) | 全長cDNA合成用プライマー、およびその用途 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
N231 | Notification of change of applicant | ||
WITN | Withdrawal due to no request for examination |