US5734016A - Fuse binding protein and method for inhibiting expression of fuse binding protein - Google Patents
Fuse binding protein and method for inhibiting expression of fuse binding protein Download PDFInfo
- Publication number
- US5734016A US5734016A US08/726,160 US72616096A US5734016A US 5734016 A US5734016 A US 5734016A US 72616096 A US72616096 A US 72616096A US 5734016 A US5734016 A US 5734016A
- Authority
- US
- United States
- Prior art keywords
- gly
- pro
- gln
- ala
- ile
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 108091008324 binding proteins Proteins 0.000 title claims abstract description 29
- 238000000034 method Methods 0.000 title description 4
- 230000002401 inhibitory effect Effects 0.000 title description 2
- 102000014914 Carrier Proteins Human genes 0.000 title 2
- 102000023732 binding proteins Human genes 0.000 claims abstract description 27
- 230000004568 DNA-binding Effects 0.000 claims abstract description 25
- 101001030211 Homo sapiens Myc proto-oncogene protein Proteins 0.000 claims abstract description 10
- 108090000623 proteins and genes Proteins 0.000 claims description 98
- 102000004169 proteins and genes Human genes 0.000 claims description 88
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 claims description 63
- 150000001413 amino acids Chemical group 0.000 claims description 47
- 239000012634 fragment Substances 0.000 claims description 34
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 claims description 22
- 108010014614 prolyl-glycyl-proline Proteins 0.000 claims description 18
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 claims description 16
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 claims description 13
- 108010047495 alanylglycine Proteins 0.000 claims description 13
- 108010087924 alanylproline Proteins 0.000 claims description 12
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 claims description 12
- 108010055341 glutamyl-glutamic acid Proteins 0.000 claims description 12
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 claims description 11
- 108010010430 asparagine-proline-alanine Proteins 0.000 claims description 11
- 239000004471 Glycine Substances 0.000 claims description 10
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 claims description 10
- 108010089804 glycyl-threonine Proteins 0.000 claims description 10
- 108010036413 histidylglycine Proteins 0.000 claims description 10
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 claims description 9
- 108010077245 asparaginyl-proline Proteins 0.000 claims description 9
- 108010047857 aspartylglycine Proteins 0.000 claims description 9
- 108010051307 glycyl-glycyl-proline Proteins 0.000 claims description 9
- 108010061238 threonyl-glycine Proteins 0.000 claims description 9
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 8
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 claims description 8
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 claims description 8
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 claims description 8
- 108010068265 aspartyltyrosine Proteins 0.000 claims description 8
- 108010077112 prolyl-proline Proteins 0.000 claims description 8
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 claims description 8
- 108010079317 prolyl-tyrosine Proteins 0.000 claims description 8
- RRYLMJWPWBJFPZ-ACZMJKKPSA-N Gln-Asn-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RRYLMJWPWBJFPZ-ACZMJKKPSA-N 0.000 claims description 7
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 claims description 7
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 claims description 7
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 claims description 7
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 claims description 7
- 239000012190 activator Substances 0.000 claims description 7
- 108010010147 glycylglutamine Proteins 0.000 claims description 7
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 claims description 7
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 claims description 6
- 241000880493 Leptailurus serval Species 0.000 claims description 6
- MVBZBRKNZVJEKK-DTWKUNHWSA-N Met-Gly-Pro Chemical compound CSCC[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N MVBZBRKNZVJEKK-DTWKUNHWSA-N 0.000 claims description 6
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 claims description 6
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 claims description 6
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 claims description 6
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 claims description 6
- 108010008355 arginyl-glutamine Proteins 0.000 claims description 6
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 claims description 6
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 claims description 6
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 claims description 5
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 claims description 5
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 claims description 5
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 claims description 5
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 claims description 5
- YQGZIRIYGHNSQO-ZPFDUUQYSA-N Arg-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YQGZIRIYGHNSQO-ZPFDUUQYSA-N 0.000 claims description 5
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 claims description 5
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 claims description 5
- 238000009007 Diagnostic Kit Methods 0.000 claims description 5
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 claims description 5
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 claims description 5
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 claims description 5
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 claims description 5
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 claims description 5
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 claims description 5
- 241000169446 Promethis Species 0.000 claims description 5
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 claims description 5
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 claims description 5
- LKJCABTUFGTPPY-HJGDQZAQSA-N Thr-Pro-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O LKJCABTUFGTPPY-HJGDQZAQSA-N 0.000 claims description 5
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 claims description 5
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 claims description 5
- 108010043293 glycyl-prolyl-glycyl-glycine Proteins 0.000 claims description 5
- 108010054155 lysyllysine Proteins 0.000 claims description 5
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 claims description 5
- 108010070643 prolylglutamic acid Proteins 0.000 claims description 5
- 108010029020 prolylglycine Proteins 0.000 claims description 5
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 claims description 4
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 claims description 4
- XAXHGSOBFPIRFG-LSJOCFKGSA-N Ala-Pro-His Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XAXHGSOBFPIRFG-LSJOCFKGSA-N 0.000 claims description 4
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 claims description 4
- ZVWXMTTZJKBJCI-BHDSKKPTSA-N Ala-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 ZVWXMTTZJKBJCI-BHDSKKPTSA-N 0.000 claims description 4
- PXAFZDXYEIIUTF-LKTVYLICSA-N Ala-Trp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXAFZDXYEIIUTF-LKTVYLICSA-N 0.000 claims description 4
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 claims description 4
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 claims description 4
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 claims description 4
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 claims description 4
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 claims description 4
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 claims description 4
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 claims description 4
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 claims description 4
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 claims description 4
- LFIVHGMKWFGUGK-IHRRRGAJSA-N Gln-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LFIVHGMKWFGUGK-IHRRRGAJSA-N 0.000 claims description 4
- JXBZEDIQFFCHPZ-PEFMBERDSA-N Gln-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JXBZEDIQFFCHPZ-PEFMBERDSA-N 0.000 claims description 4
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 claims description 4
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 claims description 4
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 claims description 4
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 claims description 4
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 claims description 4
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 claims description 4
- ZUPVLBAXUUGKKN-VHSXEESVSA-N His-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CN=CN2)N)C(=O)O ZUPVLBAXUUGKKN-VHSXEESVSA-N 0.000 claims description 4
- YKRYHWJRQUSTKG-KBIXCLLPSA-N Ile-Ala-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKRYHWJRQUSTKG-KBIXCLLPSA-N 0.000 claims description 4
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 claims description 4
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 claims description 4
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 claims description 4
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 claims description 4
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 claims description 4
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 claims description 4
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 claims description 4
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 claims description 4
- DAHQKYYIXPBESV-UWVGGRQHSA-N Lys-Met-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O DAHQKYYIXPBESV-UWVGGRQHSA-N 0.000 claims description 4
- KFSALEZVQJYHCE-AVGNSLFASA-N Lys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N KFSALEZVQJYHCE-AVGNSLFASA-N 0.000 claims description 4
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 claims description 4
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 claims description 4
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 claims description 4
- RVYDCISQIGHAFC-ZPFDUUQYSA-N Met-Ile-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O RVYDCISQIGHAFC-ZPFDUUQYSA-N 0.000 claims description 4
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 claims description 4
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 claims description 4
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 claims description 4
- 108010002311 N-glycylglutamic acid Proteins 0.000 claims description 4
- 108010065395 Neuropep-1 Proteins 0.000 claims description 4
- GXDPQJUBLBZKDY-IAVJCBSLSA-N Phe-Ile-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GXDPQJUBLBZKDY-IAVJCBSLSA-N 0.000 claims description 4
- FHZJRBVMLGOHBX-GUBZILKMSA-N Pro-Pro-Asp Chemical compound OC(=O)C[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1)C(O)=O FHZJRBVMLGOHBX-GUBZILKMSA-N 0.000 claims description 4
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 claims description 4
- DLZBBDSPTJBOOD-BPNCWPANSA-N Pro-Tyr-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O DLZBBDSPTJBOOD-BPNCWPANSA-N 0.000 claims description 4
- QHSSUIHLAIWXEE-IHRRRGAJSA-N Pro-Tyr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O QHSSUIHLAIWXEE-IHRRRGAJSA-N 0.000 claims description 4
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 claims description 4
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 claims description 4
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 claims description 4
- DQDXHYIEITXNJY-BPUTZDHNSA-N Trp-Gln-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N DQDXHYIEITXNJY-BPUTZDHNSA-N 0.000 claims description 4
- CLEGSEJVGBYZBJ-MEYUZBJRSA-N Tyr-Thr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CLEGSEJVGBYZBJ-MEYUZBJRSA-N 0.000 claims description 4
- QVYFTFIBKCDHIE-ACRUOGEOSA-N Tyr-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O QVYFTFIBKCDHIE-ACRUOGEOSA-N 0.000 claims description 4
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 claims description 4
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 claims description 4
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 claims description 4
- 108010040443 aspartyl-aspartic acid Proteins 0.000 claims description 4
- 108010093581 aspartyl-proline Proteins 0.000 claims description 4
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 claims description 4
- 108010015792 glycyllysine Proteins 0.000 claims description 4
- 102000053563 human MYC Human genes 0.000 claims description 4
- 108010000761 leucylarginine Proteins 0.000 claims description 4
- 108010017391 lysylvaline Proteins 0.000 claims description 4
- 108010005942 methionylglycine Proteins 0.000 claims description 4
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 claims description 4
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 claims description 3
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 claims description 3
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 claims description 3
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 claims description 3
- CQJHFKKGZXKZBC-BPNCWPANSA-N Ala-Pro-Tyr Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CQJHFKKGZXKZBC-BPNCWPANSA-N 0.000 claims description 3
- MUGAESARFRGOTQ-IGNZVWTISA-N Ala-Tyr-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MUGAESARFRGOTQ-IGNZVWTISA-N 0.000 claims description 3
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 claims description 3
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 claims description 3
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 claims description 3
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 claims description 3
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 claims description 3
- MVXJBVVLACEGCG-PCBIJLKTSA-N Asn-Phe-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVXJBVVLACEGCG-PCBIJLKTSA-N 0.000 claims description 3
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 claims description 3
- QTKYFZCMSQLYHI-UBHSHLNASA-N Asn-Trp-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O QTKYFZCMSQLYHI-UBHSHLNASA-N 0.000 claims description 3
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 claims description 3
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 claims description 3
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 claims description 3
- MWZSCEAYQCMROW-GUBZILKMSA-N Cys-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N MWZSCEAYQCMROW-GUBZILKMSA-N 0.000 claims description 3
- DLOHWQXXGMEZDW-CIUDSAMLSA-N Gln-Arg-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DLOHWQXXGMEZDW-CIUDSAMLSA-N 0.000 claims description 3
- CKNUKHBRCSMKMO-XHNCKOQMSA-N Gln-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O CKNUKHBRCSMKMO-XHNCKOQMSA-N 0.000 claims description 3
- GMGKDVVBSVVKCT-NUMRIWBASA-N Gln-Asn-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GMGKDVVBSVVKCT-NUMRIWBASA-N 0.000 claims description 3
- XFKUFUJECJUQTQ-CIUDSAMLSA-N Gln-Gln-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XFKUFUJECJUQTQ-CIUDSAMLSA-N 0.000 claims description 3
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 claims description 3
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 claims description 3
- DOQUICBEISTQHE-CIUDSAMLSA-N Gln-Pro-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O DOQUICBEISTQHE-CIUDSAMLSA-N 0.000 claims description 3
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 claims description 3
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 claims description 3
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 claims description 3
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 claims description 3
- LKOAAMXDJGEYMS-ZPFDUUQYSA-N Glu-Met-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKOAAMXDJGEYMS-ZPFDUUQYSA-N 0.000 claims description 3
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 claims description 3
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 claims description 3
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 claims description 3
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 claims description 3
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 claims description 3
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 claims description 3
- HFXJIZNEXNIZIJ-BQBZGAKWSA-N Gly-Glu-Gln Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFXJIZNEXNIZIJ-BQBZGAKWSA-N 0.000 claims description 3
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 claims description 3
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 claims description 3
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 claims description 3
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 claims description 3
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 claims description 3
- OEROYDLRVAYIMQ-YUMQZZPRSA-N His-Gly-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O OEROYDLRVAYIMQ-YUMQZZPRSA-N 0.000 claims description 3
- HIJIJPFILYPTFR-ACRUOGEOSA-N His-Tyr-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HIJIJPFILYPTFR-ACRUOGEOSA-N 0.000 claims description 3
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 claims description 3
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 claims description 3
- KMBPQYKVZBMRMH-PEFMBERDSA-N Ile-Gln-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KMBPQYKVZBMRMH-PEFMBERDSA-N 0.000 claims description 3
- BALLIXFZYSECCF-QEWYBTABSA-N Ile-Gln-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N BALLIXFZYSECCF-QEWYBTABSA-N 0.000 claims description 3
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 claims description 3
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 claims description 3
- DLEBSGAVWRPTIX-PEDHHIEDSA-N Ile-Val-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)[C@@H](C)CC DLEBSGAVWRPTIX-PEDHHIEDSA-N 0.000 claims description 3
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 claims description 3
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 claims description 3
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 claims description 3
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 claims description 3
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 claims description 3
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 claims description 3
- RRIHXWPHQSXHAQ-XUXIUFHCSA-N Met-Ile-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O RRIHXWPHQSXHAQ-XUXIUFHCSA-N 0.000 claims description 3
- HSJIGJRZYUADSS-IHRRRGAJSA-N Met-Lys-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HSJIGJRZYUADSS-IHRRRGAJSA-N 0.000 claims description 3
- 108010079364 N-glycylalanine Proteins 0.000 claims description 3
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 claims description 3
- FGWUALWGCZJQDJ-URLPEUOOSA-N Phe-Thr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGWUALWGCZJQDJ-URLPEUOOSA-N 0.000 claims description 3
- KDIIENQUNVNWHR-JYJNAYRXSA-N Pro-Arg-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KDIIENQUNVNWHR-JYJNAYRXSA-N 0.000 claims description 3
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 claims description 3
- KQCCDMFIALWGTL-GUBZILKMSA-N Pro-Asn-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 KQCCDMFIALWGTL-GUBZILKMSA-N 0.000 claims description 3
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 claims description 3
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 claims description 3
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 claims description 3
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 claims description 3
- MZNUJZBYRWXWLQ-AVGNSLFASA-N Pro-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 MZNUJZBYRWXWLQ-AVGNSLFASA-N 0.000 claims description 3
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 claims description 3
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 claims description 3
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 claims description 3
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 claims description 3
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 claims description 3
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 claims description 3
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 claims description 3
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 claims description 3
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 claims description 3
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 claims description 3
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 claims description 3
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 claims description 3
- FNOQJVHFVLVMOS-AAEUAGOBSA-N Trp-Gly-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N FNOQJVHFVLVMOS-AAEUAGOBSA-N 0.000 claims description 3
- OOEUVMFKKZYSRX-LEWSCRJBSA-N Tyr-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OOEUVMFKKZYSRX-LEWSCRJBSA-N 0.000 claims description 3
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 claims description 3
- MWUYSCVVPVITMW-IGNZVWTISA-N Tyr-Tyr-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 MWUYSCVVPVITMW-IGNZVWTISA-N 0.000 claims description 3
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 claims description 3
- HURRXSNHCCSJHA-AUTRQRHGSA-N Val-Gln-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HURRXSNHCCSJHA-AUTRQRHGSA-N 0.000 claims description 3
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 claims description 3
- 108010013835 arginine glutamate Proteins 0.000 claims description 3
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 claims description 3
- 108010082286 glycyl-seryl-alanine Proteins 0.000 claims description 3
- 108010040030 histidinoalanine Proteins 0.000 claims description 3
- 108010092114 histidylphenylalanine Proteins 0.000 claims description 3
- 108010073472 leucyl-prolyl-proline Proteins 0.000 claims description 3
- 108010024607 phenylalanylalanine Proteins 0.000 claims description 3
- 108010004914 prolylarginine Proteins 0.000 claims description 3
- 108010015385 valyl-prolyl-proline Proteins 0.000 claims description 3
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 claims description 2
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 claims description 2
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 claims description 2
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 claims description 2
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 claims description 2
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 claims description 2
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 claims description 2
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 claims description 2
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 claims description 2
- PAXHINASXXXILC-SRVKXCTJSA-N Asn-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)O PAXHINASXXXILC-SRVKXCTJSA-N 0.000 claims description 2
- MYVBTYXSWILFCG-BQBZGAKWSA-N Asn-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N MYVBTYXSWILFCG-BQBZGAKWSA-N 0.000 claims description 2
- ICAYWNTWHRRAQP-FXQIFTODSA-N Asp-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N ICAYWNTWHRRAQP-FXQIFTODSA-N 0.000 claims description 2
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 claims description 2
- NZWDWXSWUQCNMG-GARJFASQSA-N Asp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)C(=O)O NZWDWXSWUQCNMG-GARJFASQSA-N 0.000 claims description 2
- GGRSYTUJHAZTFN-IHRRRGAJSA-N Asp-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O GGRSYTUJHAZTFN-IHRRRGAJSA-N 0.000 claims description 2
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 claims description 2
- YXPNKXFOBHRUBL-BJDJZHNGSA-N Cys-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N YXPNKXFOBHRUBL-BJDJZHNGSA-N 0.000 claims description 2
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 claims description 2
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 claims description 2
- JESJDAAGXULQOP-CIUDSAMLSA-N Gln-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N JESJDAAGXULQOP-CIUDSAMLSA-N 0.000 claims description 2
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 claims description 2
- LVCHEMOPBORRLB-DCAQKATOSA-N Glu-Gln-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O LVCHEMOPBORRLB-DCAQKATOSA-N 0.000 claims description 2
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 claims description 2
- GNBMOZPQUXTCRW-STQMWFEESA-N Gly-Asn-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)CN)C(O)=O)=CNC2=C1 GNBMOZPQUXTCRW-STQMWFEESA-N 0.000 claims description 2
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 claims description 2
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 claims description 2
- LXTRSHQLGYINON-DTWKUNHWSA-N Gly-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN LXTRSHQLGYINON-DTWKUNHWSA-N 0.000 claims description 2
- FJWSJWACLMTDMI-WPRPVWTQSA-N Gly-Met-Val Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O FJWSJWACLMTDMI-WPRPVWTQSA-N 0.000 claims description 2
- IMRNSEPSPFQNHF-STQMWFEESA-N Gly-Ser-Trp Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O IMRNSEPSPFQNHF-STQMWFEESA-N 0.000 claims description 2
- PYFIQROSWQERAS-LBPRGKRZSA-N Gly-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(=O)NCC(O)=O)=CNC2=C1 PYFIQROSWQERAS-LBPRGKRZSA-N 0.000 claims description 2
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 claims description 2
- ZNNNYCXPCKACHX-DCAQKATOSA-N His-Gln-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZNNNYCXPCKACHX-DCAQKATOSA-N 0.000 claims description 2
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 claims description 2
- LNDVNHOSZQPJGI-AVGNSLFASA-N His-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CN=CN1 LNDVNHOSZQPJGI-AVGNSLFASA-N 0.000 claims description 2
- YSMZBYPVVYSGOT-SZMVWBNQSA-N His-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N YSMZBYPVVYSGOT-SZMVWBNQSA-N 0.000 claims description 2
- FRDFAWHTPDKRHG-ULQDDVLXSA-N His-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CN=CN1 FRDFAWHTPDKRHG-ULQDDVLXSA-N 0.000 claims description 2
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 claims description 2
- PWWVAXIEGOYWEE-UHFFFAOYSA-N Isophenergan Chemical compound C1=CC=C2N(CC(C)N(C)C)C3=CC=CC=C3SC2=C1 PWWVAXIEGOYWEE-UHFFFAOYSA-N 0.000 claims description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 claims description 2
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 claims description 2
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 claims description 2
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 claims description 2
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 claims description 2
- HIIZIQUUHIXUJY-GUBZILKMSA-N Lys-Asp-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HIIZIQUUHIXUJY-GUBZILKMSA-N 0.000 claims description 2
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 claims description 2
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 claims description 2
- OJDFAABAHBPVTH-MNXVOIDGSA-N Lys-Ile-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O OJDFAABAHBPVTH-MNXVOIDGSA-N 0.000 claims description 2
- QEDGNYFHLXXIDC-DCAQKATOSA-N Met-Pro-Gln Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O QEDGNYFHLXXIDC-DCAQKATOSA-N 0.000 claims description 2
- CQRGINSEMFBACV-WPRPVWTQSA-N Met-Val-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O CQRGINSEMFBACV-WPRPVWTQSA-N 0.000 claims description 2
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 claims description 2
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 claims description 2
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 claims description 2
- ODPIUQVTULPQEP-CIUDSAMLSA-N Pro-Gln-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ODPIUQVTULPQEP-CIUDSAMLSA-N 0.000 claims description 2
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 claims description 2
- QAAYIXYLEMRULP-SRVKXCTJSA-N Pro-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 QAAYIXYLEMRULP-SRVKXCTJSA-N 0.000 claims description 2
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 claims description 2
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 claims description 2
- UIUWGMRJTWHIJZ-ULQDDVLXSA-N Pro-Tyr-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O UIUWGMRJTWHIJZ-ULQDDVLXSA-N 0.000 claims description 2
- SVWQEIRZHHNBIO-WHFBIAKZSA-N Ser-Gly-Cys Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CS)C(O)=O SVWQEIRZHHNBIO-WHFBIAKZSA-N 0.000 claims description 2
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 claims description 2
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 claims description 2
- BDENGIGFTNYZSJ-RCWTZXSCSA-N Thr-Pro-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O BDENGIGFTNYZSJ-RCWTZXSCSA-N 0.000 claims description 2
- RYXOUTORDIUWNI-BPUTZDHNSA-N Trp-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RYXOUTORDIUWNI-BPUTZDHNSA-N 0.000 claims description 2
- QHONGSVIVOFKAC-ULQDDVLXSA-N Tyr-Pro-His Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O QHONGSVIVOFKAC-ULQDDVLXSA-N 0.000 claims description 2
- VYQQQIRHIFALGE-UWJYBYFXSA-N Tyr-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VYQQQIRHIFALGE-UWJYBYFXSA-N 0.000 claims description 2
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 claims description 2
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 claims description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 claims description 2
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 claims description 2
- 108010031719 prolyl-serine Proteins 0.000 claims description 2
- 108010051110 tyrosyl-lysine Proteins 0.000 claims description 2
- 108010073969 valyllysine Proteins 0.000 claims description 2
- 108010044940 alanylglutamine Proteins 0.000 claims 4
- ZZJVYSAQQMDIRD-UWVGGRQHSA-N Gly-Pro-His Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ZZJVYSAQQMDIRD-UWVGGRQHSA-N 0.000 claims 3
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 claims 3
- XJFPXLWGZWAWRQ-UHFFFAOYSA-N 2-[[2-[[2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(=O)NCC(O)=O XJFPXLWGZWAWRQ-UHFFFAOYSA-N 0.000 claims 2
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 claims 2
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 claims 2
- QIWYWCYNUMJBTC-CIUDSAMLSA-N Arg-Cys-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QIWYWCYNUMJBTC-CIUDSAMLSA-N 0.000 claims 2
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 claims 2
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 claims 2
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 claims 2
- CSEJMKNZDCJYGJ-XHNCKOQMSA-N Asp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O CSEJMKNZDCJYGJ-XHNCKOQMSA-N 0.000 claims 2
- PLNJUJGNLDSFOP-UWJYBYFXSA-N Asp-Tyr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PLNJUJGNLDSFOP-UWJYBYFXSA-N 0.000 claims 2
- AWPWHMVCSISSQK-QWRGUYRKSA-N Asp-Tyr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O AWPWHMVCSISSQK-QWRGUYRKSA-N 0.000 claims 2
- AJDMYLOISOCHHC-YVNDNENWSA-N Gln-Gln-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AJDMYLOISOCHHC-YVNDNENWSA-N 0.000 claims 2
- RBWKVOSARCFSQQ-FXQIFTODSA-N Gln-Gln-Ser Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O RBWKVOSARCFSQQ-FXQIFTODSA-N 0.000 claims 2
- ZNTDJIMJKNNSLR-RWRJDSDZSA-N Gln-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZNTDJIMJKNNSLR-RWRJDSDZSA-N 0.000 claims 2
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 claims 2
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 claims 2
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 claims 2
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 claims 2
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 claims 2
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 claims 2
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 claims 2
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 claims 2
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 claims 2
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 claims 2
- FUKDBQGFSJUXGX-RWMBFGLXSA-N Lys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)C(=O)O FUKDBQGFSJUXGX-RWMBFGLXSA-N 0.000 claims 2
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 claims 2
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 claims 2
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 claims 2
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 claims 2
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 claims 2
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 claims 2
- BRBCKMMXKONBAA-KWBADKCTSA-N Trp-Ala-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 BRBCKMMXKONBAA-KWBADKCTSA-N 0.000 claims 2
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 claims 2
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 claims 2
- VSCIANXXVZOYOC-AVGNSLFASA-N Val-Pro-His Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VSCIANXXVZOYOC-AVGNSLFASA-N 0.000 claims 2
- 108010070783 alanyltyrosine Proteins 0.000 claims 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 claims 2
- 108010078326 glycyl-glycyl-valine Proteins 0.000 claims 2
- 108010050848 glycylleucine Proteins 0.000 claims 2
- 108010087823 glycyltyrosine Proteins 0.000 claims 2
- 108010057821 leucylproline Proteins 0.000 claims 2
- PEZMQPADLFXCJJ-ZETCQYMHSA-N 2-[[2-[[(2s)-1-(2-aminoacetyl)pyrrolidine-2-carbonyl]amino]acetyl]amino]acetic acid Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(=O)NCC(O)=O PEZMQPADLFXCJJ-ZETCQYMHSA-N 0.000 claims 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 claims 1
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 claims 1
- UGLPMYSCWHTZQU-AUTRQRHGSA-N Ala-Ala-Tyr Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UGLPMYSCWHTZQU-AUTRQRHGSA-N 0.000 claims 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 claims 1
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 claims 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 claims 1
- LBFXVAXPDOBRKU-LKTVYLICSA-N Ala-His-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LBFXVAXPDOBRKU-LKTVYLICSA-N 0.000 claims 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 claims 1
- RVDVDRUZWZIBJQ-CIUDSAMLSA-N Arg-Asn-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RVDVDRUZWZIBJQ-CIUDSAMLSA-N 0.000 claims 1
- GHNDBBVSWOWYII-LPEHRKFASA-N Arg-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GHNDBBVSWOWYII-LPEHRKFASA-N 0.000 claims 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 claims 1
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 claims 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 claims 1
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 claims 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 claims 1
- AKEBUSZTMQLNIX-UWJYBYFXSA-N Asn-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N AKEBUSZTMQLNIX-UWJYBYFXSA-N 0.000 claims 1
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 claims 1
- JZDZLBJVYWIIQU-AVGNSLFASA-N Asn-Glu-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JZDZLBJVYWIIQU-AVGNSLFASA-N 0.000 claims 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 claims 1
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 claims 1
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 claims 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 claims 1
- NNQHEEQNPQYPGL-FXQIFTODSA-N Gln-Ala-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NNQHEEQNPQYPGL-FXQIFTODSA-N 0.000 claims 1
- CITDWMLWXNUQKD-FXQIFTODSA-N Gln-Gln-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CITDWMLWXNUQKD-FXQIFTODSA-N 0.000 claims 1
- BLOXULLYFRGYKZ-GUBZILKMSA-N Gln-Glu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BLOXULLYFRGYKZ-GUBZILKMSA-N 0.000 claims 1
- KDXKFBSNIJYNNR-YVNDNENWSA-N Gln-Glu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDXKFBSNIJYNNR-YVNDNENWSA-N 0.000 claims 1
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 claims 1
- XKBASPWPBXNVLQ-WDSKDSINSA-N Gln-Gly-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XKBASPWPBXNVLQ-WDSKDSINSA-N 0.000 claims 1
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 claims 1
- CLPQUWHBWXFJOX-BQBZGAKWSA-N Gln-Gly-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O CLPQUWHBWXFJOX-BQBZGAKWSA-N 0.000 claims 1
- QQAPDATZKKTBIY-YUMQZZPRSA-N Gln-Gly-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O QQAPDATZKKTBIY-YUMQZZPRSA-N 0.000 claims 1
- NXPXQIZKDOXIHH-JSGCOSHPSA-N Gln-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N NXPXQIZKDOXIHH-JSGCOSHPSA-N 0.000 claims 1
- BVELAHPZLYLZDJ-HGNGGELXSA-N Gln-His-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O BVELAHPZLYLZDJ-HGNGGELXSA-N 0.000 claims 1
- ITZWDGBYBPUZRG-KBIXCLLPSA-N Gln-Ile-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O ITZWDGBYBPUZRG-KBIXCLLPSA-N 0.000 claims 1
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 claims 1
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 claims 1
- UXXIVIQGOODKQC-NUMRIWBASA-N Gln-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UXXIVIQGOODKQC-NUMRIWBASA-N 0.000 claims 1
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 claims 1
- UBRQJXFDVZNYJP-AVGNSLFASA-N Gln-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UBRQJXFDVZNYJP-AVGNSLFASA-N 0.000 claims 1
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 claims 1
- QBLCUWAGTGRXAY-UHFFFAOYSA-N Glu-Glu-Tyr-Tyr Chemical compound C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(NC(=O)C(CCC(O)=O)NC(=O)C(CCC(O)=O)N)CC1=CC=C(O)C=C1 QBLCUWAGTGRXAY-UHFFFAOYSA-N 0.000 claims 1
- ZMVCLTGPGWJAEE-JYJNAYRXSA-N Glu-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)O)N)O ZMVCLTGPGWJAEE-JYJNAYRXSA-N 0.000 claims 1
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 claims 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 claims 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 claims 1
- UERORLSAFUHDGU-AVGNSLFASA-N Glu-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UERORLSAFUHDGU-AVGNSLFASA-N 0.000 claims 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 claims 1
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 claims 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 claims 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 claims 1
- LGQZOQRDEUIZJY-YUMQZZPRSA-N Gly-Cys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CS)NC(=O)CN)C(O)=O LGQZOQRDEUIZJY-YUMQZZPRSA-N 0.000 claims 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 claims 1
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 claims 1
- NTOWAXLMQFKJPT-YUMQZZPRSA-N Gly-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN NTOWAXLMQFKJPT-YUMQZZPRSA-N 0.000 claims 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 claims 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 claims 1
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 claims 1
- HJARVELKOSZUEW-YUMQZZPRSA-N Gly-Pro-Gln Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJARVELKOSZUEW-YUMQZZPRSA-N 0.000 claims 1
- JNGHLWWFPGIJER-STQMWFEESA-N Gly-Pro-Tyr Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JNGHLWWFPGIJER-STQMWFEESA-N 0.000 claims 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 claims 1
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 claims 1
- JIUYRPFQJJRSJB-QWRGUYRKSA-N His-His-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)NCC(O)=O)C1=CN=CN1 JIUYRPFQJJRSJB-QWRGUYRKSA-N 0.000 claims 1
- SACHLUOUHCVIKI-GMOBBJLQSA-N Ile-Arg-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SACHLUOUHCVIKI-GMOBBJLQSA-N 0.000 claims 1
- SYVMEYAPXRRXAN-MXAVVETBSA-N Ile-Cys-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N SYVMEYAPXRRXAN-MXAVVETBSA-N 0.000 claims 1
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 claims 1
- DMZOUKXXHJQPTL-GRLWGSQLSA-N Ile-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N DMZOUKXXHJQPTL-GRLWGSQLSA-N 0.000 claims 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 claims 1
- SVZFKLBRCYCIIY-CYDGBPFRSA-N Ile-Pro-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SVZFKLBRCYCIIY-CYDGBPFRSA-N 0.000 claims 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 claims 1
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 claims 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 claims 1
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 claims 1
- LRKCBIUDWAXNEG-CSMHCCOUSA-N Leu-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRKCBIUDWAXNEG-CSMHCCOUSA-N 0.000 claims 1
- YRWCPXOFBKTCFY-NUTKFTJISA-N Lys-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N YRWCPXOFBKTCFY-NUTKFTJISA-N 0.000 claims 1
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 claims 1
- VQXAVLQBQJMENB-SRVKXCTJSA-N Lys-Glu-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O VQXAVLQBQJMENB-SRVKXCTJSA-N 0.000 claims 1
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 claims 1
- KJIXWRWPOCKYLD-IHRRRGAJSA-N Lys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N KJIXWRWPOCKYLD-IHRRRGAJSA-N 0.000 claims 1
- HZLSUXCMSIBCRV-RVMXOQNASA-N Met-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N HZLSUXCMSIBCRV-RVMXOQNASA-N 0.000 claims 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 claims 1
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 claims 1
- MPGJIHFJCXTVEX-KKUMJFAQSA-N Phe-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O MPGJIHFJCXTVEX-KKUMJFAQSA-N 0.000 claims 1
- NAOVYENZCWFBDG-BZSNNMDCSA-N Phe-His-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 NAOVYENZCWFBDG-BZSNNMDCSA-N 0.000 claims 1
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 claims 1
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 claims 1
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 claims 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 claims 1
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 claims 1
- CMOIIANLNNYUTP-SRVKXCTJSA-N Pro-Gln-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CMOIIANLNNYUTP-SRVKXCTJSA-N 0.000 claims 1
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 claims 1
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 claims 1
- GSPPWVHVBBSPSY-FHWLQOOXSA-N Pro-His-Trp Chemical compound OC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](Cc1cnc[nH]1)NC(=O)[C@@H]1CCCN1 GSPPWVHVBBSPSY-FHWLQOOXSA-N 0.000 claims 1
- GFHOSBYCLACKEK-GUBZILKMSA-N Pro-Pro-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GFHOSBYCLACKEK-GUBZILKMSA-N 0.000 claims 1
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 claims 1
- UCOYFSCEIWQYNL-FXQIFTODSA-N Ser-Cys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(O)=O UCOYFSCEIWQYNL-FXQIFTODSA-N 0.000 claims 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 claims 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 claims 1
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 claims 1
- MQVGIFJSFFVGFW-XEGUGMAKSA-N Trp-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MQVGIFJSFFVGFW-XEGUGMAKSA-N 0.000 claims 1
- SDNVRAKIJVKAGS-LKTVYLICSA-N Tyr-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N SDNVRAKIJVKAGS-LKTVYLICSA-N 0.000 claims 1
- QUILOGWWLXMSAT-IHRRRGAJSA-N Tyr-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QUILOGWWLXMSAT-IHRRRGAJSA-N 0.000 claims 1
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 claims 1
- DJSYPCWZPNHQQE-FHWLQOOXSA-N Tyr-Tyr-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=C(O)C=C1 DJSYPCWZPNHQQE-FHWLQOOXSA-N 0.000 claims 1
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 claims 1
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 claims 1
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 claims 1
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 claims 1
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 claims 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 claims 1
- IEBGHUMBJXIXHM-AVGNSLFASA-N Val-Lys-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N IEBGHUMBJXIXHM-AVGNSLFASA-N 0.000 claims 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 claims 1
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 claims 1
- 108010005233 alanylglutamic acid Proteins 0.000 claims 1
- 230000001747 exhibiting effect Effects 0.000 claims 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 claims 1
- 108010085325 histidylproline Proteins 0.000 claims 1
- 108010064235 lysylglycine Proteins 0.000 claims 1
- 108010026333 seryl-proline Proteins 0.000 claims 1
- 108010038745 tryptophylglycine Proteins 0.000 claims 1
- 108010020532 tyrosyl-proline Proteins 0.000 claims 1
- 108010003137 tyrosyltyrosine Proteins 0.000 claims 1
- 230000027455 binding Effects 0.000 abstract description 24
- 108091034117 Oligonucleotide Proteins 0.000 abstract description 23
- 101710135898 Myc proto-oncogene protein Proteins 0.000 abstract description 15
- 102100038895 Myc proto-oncogene protein Human genes 0.000 abstract description 15
- 101710150448 Transcriptional regulator Myc Proteins 0.000 abstract description 15
- 230000004069 differentiation Effects 0.000 abstract description 8
- 108020004999 messenger RNA Proteins 0.000 abstract description 8
- 238000011144 upstream manufacturing Methods 0.000 abstract description 6
- 239000000284 extract Substances 0.000 abstract description 5
- 230000001419 dependent effect Effects 0.000 abstract description 2
- 230000009870 specific binding Effects 0.000 abstract description 2
- 102000003839 Human Proteins Human genes 0.000 abstract 1
- 108090000144 Human Proteins Proteins 0.000 abstract 1
- 230000003247 decreasing effect Effects 0.000 abstract 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 abstract 1
- 235000018102 proteins Nutrition 0.000 description 56
- 108010035563 Chloramphenicol O-acetyltransferase Proteins 0.000 description 43
- 239000002299 complementary DNA Substances 0.000 description 41
- 235000001014 amino acid Nutrition 0.000 description 34
- 229940024606 amino acid Drugs 0.000 description 32
- 210000004027 cell Anatomy 0.000 description 26
- 108090000765 processed proteins & peptides Proteins 0.000 description 23
- 239000013612 plasmid Substances 0.000 description 19
- 150000007523 nucleic acids Chemical class 0.000 description 18
- 102000039446 nucleic acids Human genes 0.000 description 17
- 108020004707 nucleic acids Proteins 0.000 description 17
- 108020004414 DNA Proteins 0.000 description 15
- 238000003780 insertion Methods 0.000 description 13
- 230000037431 insertion Effects 0.000 description 13
- 239000000523 sample Substances 0.000 description 13
- 239000002773 nucleotide Substances 0.000 description 12
- 125000003729 nucleotide group Chemical group 0.000 description 12
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 11
- 125000003275 alpha amino acid group Chemical group 0.000 description 10
- 230000000692 anti-sense effect Effects 0.000 description 10
- 102000005720 Glutathione transferase Human genes 0.000 description 9
- 108010070675 Glutathione transferase Proteins 0.000 description 9
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 8
- 108700026244 Open Reading Frames Proteins 0.000 description 8
- 108020004705 Codon Proteins 0.000 description 7
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 7
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 7
- 238000012217 deletion Methods 0.000 description 7
- 230000037430 deletion Effects 0.000 description 7
- 102000037865 fusion proteins Human genes 0.000 description 7
- 108020001507 fusion proteins Proteins 0.000 description 7
- 102000004196 processed proteins & peptides Human genes 0.000 description 7
- 230000001105 regulatory effect Effects 0.000 description 7
- 239000013598 vector Substances 0.000 description 7
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 6
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 6
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 6
- 239000013604 expression vector Substances 0.000 description 6
- 239000013615 primer Substances 0.000 description 6
- 108091028043 Nucleic acid sequence Proteins 0.000 description 5
- 101000930770 Homo sapiens Far upstream element-binding protein 1 Proteins 0.000 description 4
- 101001023230 Homo sapiens Folate receptor alpha Proteins 0.000 description 4
- 101001028852 Homo sapiens Fructose-1,6-bisphosphatase 1 Proteins 0.000 description 4
- 230000005856 abnormality Effects 0.000 description 4
- 210000004899 c-terminal region Anatomy 0.000 description 4
- 238000006243 chemical reaction Methods 0.000 description 4
- 230000002068 genetic effect Effects 0.000 description 4
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 4
- 229960002743 glutamine Drugs 0.000 description 4
- 102000047994 human FBP1 Human genes 0.000 description 4
- 230000000977 initiatory effect Effects 0.000 description 4
- 108091027963 non-coding RNA Proteins 0.000 description 4
- 102000042567 non-coding RNA Human genes 0.000 description 4
- 239000000047 product Substances 0.000 description 4
- 230000002285 radioactive effect Effects 0.000 description 4
- 230000002829 reductive effect Effects 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical group O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 3
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 3
- 206010028980 Neoplasm Diseases 0.000 description 3
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 3
- 210000001744 T-lymphocyte Anatomy 0.000 description 3
- 230000033228 biological regulation Effects 0.000 description 3
- 230000010261 cell growth Effects 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 239000003623 enhancer Substances 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 210000005104 human peripheral blood lymphocyte Anatomy 0.000 description 3
- 210000004698 lymphocyte Anatomy 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 108700024542 myc Genes Proteins 0.000 description 3
- 230000001575 pathological effect Effects 0.000 description 3
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 3
- 230000000638 stimulation Effects 0.000 description 3
- 108091035539 telomere Proteins 0.000 description 3
- 102000055501 telomere Human genes 0.000 description 3
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 3
- QMOQBVOBWVNSNO-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(O)=O QMOQBVOBWVNSNO-UHFFFAOYSA-N 0.000 description 2
- 108020005029 5' Flanking Region Proteins 0.000 description 2
- HRPVXLWXLXDGHG-UHFFFAOYSA-N Acrylamide Chemical compound NC(=O)C=C HRPVXLWXLXDGHG-UHFFFAOYSA-N 0.000 description 2
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 2
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 2
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 2
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 2
- 108700010070 Codon Usage Proteins 0.000 description 2
- 238000012287 DNA Binding Assay Methods 0.000 description 2
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 2
- 108700020911 DNA-Binding Proteins Proteins 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 2
- KMSGYZQRXPUKGI-BYPYZUCNSA-N Gly-Gly-Asn Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(N)=O KMSGYZQRXPUKGI-BYPYZUCNSA-N 0.000 description 2
- 229930010555 Inosine Natural products 0.000 description 2
- RNKSNIBMTUYWSH-YFKPBYRVSA-N L-prolylglycine Chemical group [O-]C(=O)CNC(=O)[C@@H]1CCC[NH2+]1 RNKSNIBMTUYWSH-YFKPBYRVSA-N 0.000 description 2
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 2
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 2
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 2
- 238000010240 RT-PCR analysis Methods 0.000 description 2
- 108091081062 Repeated sequence (DNA) Proteins 0.000 description 2
- 108091027981 Response element Proteins 0.000 description 2
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 2
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 2
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- 230000004913 activation Effects 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 238000000376 autoradiography Methods 0.000 description 2
- 210000003719 b-lymphocyte Anatomy 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 201000011510 cancer Diseases 0.000 description 2
- 230000011712 cell development Effects 0.000 description 2
- 238000012512 characterization method Methods 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 239000002131 composite material Substances 0.000 description 2
- 238000005520 cutting process Methods 0.000 description 2
- 238000003745 diagnosis Methods 0.000 description 2
- 230000008034 disappearance Effects 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 238000002337 electrophoretic mobility shift assay Methods 0.000 description 2
- 239000013613 expression plasmid Substances 0.000 description 2
- 239000012894 fetal calf serum Substances 0.000 description 2
- 102000034238 globular proteins Human genes 0.000 description 2
- 108091005896 globular proteins Proteins 0.000 description 2
- RWSXRVCMGQZWBV-WDSKDSINSA-N glutathione Chemical compound OC(=O)[C@@H](N)CCC(=O)N[C@@H](CS)C(=O)NCC(O)=O RWSXRVCMGQZWBV-WDSKDSINSA-N 0.000 description 2
- 150000002333 glycines Chemical class 0.000 description 2
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 description 2
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 229960003786 inosine Drugs 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 230000002427 irreversible effect Effects 0.000 description 2
- 208000032839 leukemia Diseases 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 239000002853 nucleic acid probe Substances 0.000 description 2
- 230000026731 phosphorylation Effects 0.000 description 2
- 238000006366 phosphorylation reaction Methods 0.000 description 2
- 229920001184 polypeptide Polymers 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 230000001225 therapeutic effect Effects 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 1
- FEGOCLZUJUFCHP-CIUDSAMLSA-N Ala-Pro-Gln Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FEGOCLZUJUFCHP-CIUDSAMLSA-N 0.000 description 1
- 108020000948 Antisense Oligonucleotides Proteins 0.000 description 1
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 1
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 1
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 1
- AUIJUTGLPVHIRT-FXQIFTODSA-N Arg-Ser-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N AUIJUTGLPVHIRT-FXQIFTODSA-N 0.000 description 1
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 1
- QGABLMITFKUQDF-DCAQKATOSA-N Asn-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QGABLMITFKUQDF-DCAQKATOSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 238000009010 Bradford assay Methods 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 241000701022 Cytomegalovirus Species 0.000 description 1
- 108010017826 DNA Polymerase I Proteins 0.000 description 1
- 102000004594 DNA Polymerase I Human genes 0.000 description 1
- 230000008301 DNA looping mechanism Effects 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 241000388186 Deltapapillomavirus 4 Species 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 108060002716 Exonuclease Proteins 0.000 description 1
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 1
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- GURIQZQSTBBHRV-SRVKXCTJSA-N Gln-Lys-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GURIQZQSTBBHRV-SRVKXCTJSA-N 0.000 description 1
- UKKNTTCNGZLJEX-WHFBIAKZSA-N Gln-Ser Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(O)=O UKKNTTCNGZLJEX-WHFBIAKZSA-N 0.000 description 1
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 1
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- GRHXUHCFENOCOS-ZPFDUUQYSA-N Glu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N GRHXUHCFENOCOS-ZPFDUUQYSA-N 0.000 description 1
- GMAGZGCAYLQBKF-NHCYSSNCSA-N Glu-Met-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GMAGZGCAYLQBKF-NHCYSSNCSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- 108010024636 Glutathione Proteins 0.000 description 1
- JLJLBWDKDRYOPA-RYUDHWBXSA-N Gly-Gln-Tyr Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JLJLBWDKDRYOPA-RYUDHWBXSA-N 0.000 description 1
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 1
- DHNXGWVNLFPOMQ-KBPBESRZSA-N Gly-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN DHNXGWVNLFPOMQ-KBPBESRZSA-N 0.000 description 1
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 1
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 1
- LJKDGRWXYUTRSH-YVNDNENWSA-N Ile-Gln-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LJKDGRWXYUTRSH-YVNDNENWSA-N 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 208000026350 Inborn Genetic disease Diseases 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- 102100034343 Integrase Human genes 0.000 description 1
- 101710125507 Integrase/recombinase Proteins 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- 102000003855 L-lactate dehydrogenase Human genes 0.000 description 1
- 108700023483 L-lactate dehydrogenases Proteins 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- 206010025323 Lymphomas Diseases 0.000 description 1
- XDGFFEZAZHRZFR-RHYQMDGZSA-N Met-Leu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDGFFEZAZHRZFR-RHYQMDGZSA-N 0.000 description 1
- 102000008300 Mutant Proteins Human genes 0.000 description 1
- 108010021466 Mutant Proteins Proteins 0.000 description 1
- 101710150912 Myc protein Proteins 0.000 description 1
- 108700001591 MyoD Proteins 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- 239000000020 Nitrocellulose Substances 0.000 description 1
- 241001631646 Papillomaviridae Species 0.000 description 1
- 229930182555 Penicillin Natural products 0.000 description 1
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 1
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 1
- 102000012288 Phosphopyruvate Hydratase Human genes 0.000 description 1
- 108010022181 Phosphopyruvate Hydratase Proteins 0.000 description 1
- 208000007452 Plasmacytoma Diseases 0.000 description 1
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 description 1
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 1
- 229920001213 Polysorbate 20 Polymers 0.000 description 1
- FELJDCNGZFDUNR-WDSKDSINSA-N Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FELJDCNGZFDUNR-WDSKDSINSA-N 0.000 description 1
- JARJPEMLQAWNBR-GUBZILKMSA-N Pro-Asp-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JARJPEMLQAWNBR-GUBZILKMSA-N 0.000 description 1
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 1
- ZAUHSLVPDLNTRZ-QXEWZRGKSA-N Pro-Val-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZAUHSLVPDLNTRZ-QXEWZRGKSA-N 0.000 description 1
- 102000001708 Protein Isoforms Human genes 0.000 description 1
- 108010029485 Protein Isoforms Proteins 0.000 description 1
- 102000004022 Protein-Tyrosine Kinases Human genes 0.000 description 1
- 108090000412 Protein-Tyrosine Kinases Proteins 0.000 description 1
- 108020005067 RNA Splice Sites Proteins 0.000 description 1
- 230000026279 RNA modification Effects 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- 239000012979 RPMI medium Substances 0.000 description 1
- 101000702488 Rattus norvegicus High affinity cationic amino acid transporter 1 Proteins 0.000 description 1
- 239000006146 Roswell Park Memorial Institute medium Substances 0.000 description 1
- RZEQTVHJZCIUBT-WDSKDSINSA-N Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RZEQTVHJZCIUBT-WDSKDSINSA-N 0.000 description 1
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 1
- DLPXTCTVNDTYGJ-JBDRJPRFSA-N Ser-Ile-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O DLPXTCTVNDTYGJ-JBDRJPRFSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- 108020005038 Terminator Codon Proteins 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- 108700009124 Transcription Initiation Site Proteins 0.000 description 1
- HPYDSVWYXXKHRD-VIFPVBQESA-N Tyr-Gly Chemical compound [O-]C(=O)CNC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 HPYDSVWYXXKHRD-VIFPVBQESA-N 0.000 description 1
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- XXDVDTMEVBYRPK-XPUUQOCRSA-N Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O XXDVDTMEVBYRPK-XPUUQOCRSA-N 0.000 description 1
- 240000004922 Vigna radiata Species 0.000 description 1
- 235000010721 Vigna radiata var radiata Nutrition 0.000 description 1
- 235000011469 Vigna radiata var sublobata Nutrition 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 239000000074 antisense oligonucleotide Substances 0.000 description 1
- 238000012230 antisense oligonucleotides Methods 0.000 description 1
- 238000003782 apoptosis assay Methods 0.000 description 1
- 230000006907 apoptotic process Effects 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 238000003149 assay kit Methods 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 101150055766 cat gene Proteins 0.000 description 1
- 230000032823 cell division Effects 0.000 description 1
- 230000003915 cell function Effects 0.000 description 1
- 230000004663 cell proliferation Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000033077 cellular process Effects 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000003831 deregulation Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 238000006471 dimerization reaction Methods 0.000 description 1
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 1
- 230000003828 downregulation Effects 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000012869 ethanol precipitation Methods 0.000 description 1
- 102000013165 exonuclease Human genes 0.000 description 1
- WSFSSNUMVMOOMR-UHFFFAOYSA-N formaldehyde Substances O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 description 1
- 238000001415 gene therapy Methods 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- 229960003180 glutathione Drugs 0.000 description 1
- KZNQNBZMBZJQJO-YFKPBYRVSA-N glyclproline Chemical compound NCC(=O)N1CCC[C@H]1C(O)=O KZNQNBZMBZJQJO-YFKPBYRVSA-N 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 210000004408 hybridoma Anatomy 0.000 description 1
- 125000001165 hydrophobic group Chemical group 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 230000002163 immunogen Effects 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 239000003999 initiator Substances 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 230000000869 mutational effect Effects 0.000 description 1
- 229920001220 nitrocellulos Polymers 0.000 description 1
- 230000009701 normal cell proliferation Effects 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 229940049954 penicillin Drugs 0.000 description 1
- 210000005105 peripheral blood lymphocyte Anatomy 0.000 description 1
- 238000003752 polymerase chain reaction Methods 0.000 description 1
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 1
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000005522 programmed cell death Effects 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 230000000069 prophylactic effect Effects 0.000 description 1
- 239000011535 reaction buffer Substances 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 125000006850 spacer group Chemical group 0.000 description 1
- 210000000952 spleen Anatomy 0.000 description 1
- 230000004936 stimulating effect Effects 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 238000012916 structural analysis Methods 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000005945 translocation Effects 0.000 description 1
- 239000001226 triphosphate Substances 0.000 description 1
- 235000011178 triphosphate Nutrition 0.000 description 1
- UNXRWKVEANCORM-UHFFFAOYSA-N triphosphoric acid Chemical compound OP(O)(=O)OP(O)(=O)OP(O)(O)=O UNXRWKVEANCORM-UHFFFAOYSA-N 0.000 description 1
- 230000004565 tumor cell growth Effects 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
- C07K14/4701—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals not used
- C07K14/4702—Regulators; Modulating activity
- C07K14/4705—Regulators; Modulating activity stimulating, promoting or activating activity
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
Definitions
- the present invention relates to a novel human cDNA and the encoded protein which interacts with a cis-element activator, known as the FUSE binding protein.
- a novel DNA-binding regulator of c-myc expression (“DROME”) was purified, and then proteolytically and chemically degraded to peptides. These peptides were separated by HPLC and the sequences of multiple peptides were determined. Using the information from the peptide sequence, specific oligonucleotide primers were synthesized and then used as primers for the polymerase chain reaction employing human cDNA as a template. The resulting products were cloned and sequenced, and shown to encode additional peptides of the DROME protein.
- the c-myc proto-oncogene plays a central role in normal cell proliferation and programmed cell death (Y. Shi, J. Glynn, L. Guilbert, T. Cotter, R. Bissonnette, and D. Green, "Role for c-myc in activation-induced apoptotic cell death in T cell hybridomas," Science, 257:212-214 (1992)) and its deregulation contributes to the formation of a variety of tumors. (J. M. Bishop, Annu. Rev. Biochem. 52,301-354 (1983); M. D. Cole, "The myc oncogene: its role in transformation and differentiation.” Annu. Rev. Genet. 20, 361-384 (1986); S. Cory, Adv. Cancer Res. 47, 189-234 (1986)).
- FUSE Far Upstream Element
- DROME or FUSE binding protein FBP
- FBP FUSE binding protein
- the disappearance of this binding activity occurs 24 hours after addition of the differentiation agent coinciding with the loss of initiation of c-myc transcription.
- the FUSE site differs from other described positive regulatory elements for myc in a number of ways.
- the FUSE element Despite its placement a long distance from the transcription start site (-1500 bp relative to the myc P1 promoter), the FUSE element will not act as a traditional enhancer; multiple copies inserted upstream of a heterologous promoter do not stimulate transcription in transection experiments. However, when the FUSE site is present with additional c-myc regulatory sequences, specific stimulation of the c-myc promoter is observed, indicating that FUSE works in concert with other myc elements. These features suggest that the factor binding to this site may act to stimulate transcription by an unusual mechanism.
- the nucleotide sequence for a novel DNA-binding regulator of c-myc expression is provided, together with the amino acid sequence for the encoded protein.
- the protein interacts with an activator cis-element approximately 1500 basepairs 5' of the human c-myc gene promoter P1.
- the cDNA and variations thereof have the nucleotide sequences of SEQ ID NO:1, SEQ ID NO:5, SEQ ID NO:7 and SEQ ID NO:9, and the proteins have the amino acid sequences of SEQ ID NO:2, SEQ ID NO:6, SEQ ID NO:8 and SEQ ID NO:10.
- the gene encoding the DROME or FUSE binding protein is useful in the diagnosis of disease states.
- Specific nucleic acid probes derived from knowledge of the DROME sequence and genetic map for PCR or hybridization are useful to analyze mutations, translocations and other genetic derangements that are associated with abnormalities of DROME or c-myc expression. Because DROME is highly activated during T-cell stimulation, these probes are useful to analyze immune system abnormalities. The DROME probes are also useful to analyze a variety of physiologic and pathologic conditions.
- DROME knowledge of the cDNA and genomic structures of DROME allows the construction of vectors which express DROME or which express anti-sense DROME sequence. Oligonucleotide and expression vector anti-sense approaches to block DROME expression are useful to modulate DROME expression in vivo, which results in therapeutic modification of the levels of expression of genes regulated by DROME. Such strategies are of therapeutic value to certain pathologic conditions or can be used to provide prophylactic or beneficial changes in DROME expression in pre-pathologic conditions. If genetic disorders can be ascribed to abnormalities of the DROME protein or its expression, then gene therapy for such disorders will be heavily dependent on the information and materials derived from the characterization of the DROME gene and its cDNA.
- the DROME protein itself is useful as an immunogen to generate polyclonal and monoclonal antibodies which can then be used to detect and quantitate the DROME protein.
- the DROME protein itself is useful as a probe to identify and quantify proteins which interact with or modify DROME; similarly, if the DROME protein is immobilized as a ligand for larger scale operations, the DROME interacting or modifying molecules can be purified. All of these proteins can be employed singly or in combination together with existing technologies as assay kits to detect, quantitate and analyze DROME protein.
- DROME protein Structural analysis of the DROME protein has defined a new DNA binding motif composed of a unique repeating element followed by an amphipathic helix. Knowledge of these structural elements together with knowledge of the complete cDNA sequence permits the identification, cloning and characterization of the genes for related proteins and DROME homologs using hybridization based or PCR based methods.
- DROME homologs and other proteins with the newly defined DNA binding motif are also important regulators of cell growth and other cellular processes.
- the same approaches used to modify DROME expression by sense and anti-sense vectors and anti-sense oligonucleotides is applied to DROME homologs and proteins sharing the DROME DNA binding motif in order to alter cell growth and cellular properties.
- These proteins or fragments thereof, which are either DROME homologs or recognizable as DNA binding proteins because they possess the DNA binding motif described herein, are useful for the generation of antibodies and diagnostic kits to relate these molecules to cancer, genetic and other human maladies. These proteins, antibodies and kits are useful in the diagnosis, treatment and study of human diseases.
- altered forms of the protein are identified and characterized utilizing the protein, antibody and nucleic acid probes outlined above.
- Each of these new proteins and/or genes all related to DROME are then subjected to similar analyses.
- FIGS. 1A and 1B depict radiographs showing reduced expression over time of FBP mRNA.
- FIG. 2 gives the primary structure of DROME or FUSE binding protein (FBP) cDNA sequences.
- FIGS. 3A and 3B depict DNA binding assay radiographs showing that recombinant FBP binds specifically to the far upstream element.
- FIG. 4 shows assays depicting recombinant FBP activated expression of a chloramphenicol acetyltransferase (CAT) gene under the control of human c-myc regulatory sequence.
- CAT chloramphenicol acetyltransferase
- FIGS. 5A-5C show the structural features of the primary amino acid sequence of FBP.
- FIGS. 6A-6C illustrates a mutational analysis of the minimum DNA binding domain of recombinant FBP.
- FBP FUSE binding protein
- RT-PCR Reverse transcriptase-polymerase chain reaction
- Two amplified products contained open reading frames encoding segments identical to peptide sequences from purified FBP confirming that the authentic cDNA had been obtained.
- FIG. 1A shows RNA prepared from HL60 cells grown 48 hrs.
- FIG. 1B shows RNA prepared from HL60 cells cultured as indicated with or without 50 ng/ml 12-o-tetradecanohlphorbol 13-acetate (TPA) in the same medium as above.
- TPA 12-o-tetradecanohlphorbol 13-acetate
- a full length FBP cDNA sequence was assembled from overlapping clones obtained from three libraries using the PCR fragments as probes. All three cDNA libraries were prepared from poly A selected RNA and ligated into the lambda Zap II vector (Stratagene). Source RNAs were from undifferentiated HL60 cells, the B lymphoma cell line, BJAB, and PMA/PHA stimulated pooled human peripheral blood lymphocytes (PBLs).
- the composite 2,384 bp cDNA contains 26 bp of 5' untranslated sequence, 1932 bp of open reading frame, and 426 bp of 3' untranslated including a poly A addition signal and 60 bp of poly A tail, as shown in FIG. 2.
- the open box (FIG. 2) in the composite cDNA indicates the coding region, A n the poly A tail.
- the position of PCR products and clones are shown, solid lines represent vertically aligned identical sequence, dashed lines represent non-homologous sequence.
- Clone names at left indicate source or library of origin.
- the nonhomologous sequences in the HL60 clones, BJAB-1, and PBL-3 most likely are from reverse transcribed, unspliced pre-mRNA or extraneous cDNA fragments ligated together during the preparation of the libraries.
- the open circle on clone BJAB-2 indicates the position of the 3 base pair deletion. Dideoxy sequencing (Sequenase, U.S. Biochemical) followed the manufacturer's protocol.
- the nucleic acid sequence (SEQ ID NO:1) of the full length cDNA is as follows:
- Three clones from a BJAB cDNA library and three clones from an activated, human peripheral blood lymphocyte cDNA library contain the above sequence.
- Three clones from the same BJAB library and two clones from the activated lymphocyte library are lacking nucleotides 316, 317 and 318 (SEQ ID NO:9).
- the mRNA lacking these three nucleotides would encode a variant protein lacking serine 97 (SEQ ID NO:10).
- the deduced polypeptide 644 amino acids in length, has a calculated molecular weight of 67.5 kd consistent with the molecular weight of the purified protein.
- the amino acid sequence can be found in SEQ ID NO:2.
- oligo sequences SEQ ID NO:3
- Degenerate oligonucleotides were also synthesized to correspond to the peptide encoded by nucleotides 957 to 971 and used together with the above primer (SEQ ID NO:3) in a PCR reaction which amplified the PCR product encoded by nucleotides 861 to 956 from human cDNA.
- the sequences (SEQ ID NO:4) of this degenerate oligo pool are:
- the full length cDNA sequence has features which suggest multiple modes of FBP regulation. There is an unusual codon bias. For every amino acid with a degenerate codon, except glutamine, the FBP sequence avoids the codons preferred in a survey of 2,681 human genes (K-n. Wada, Y. Wada, F. Ishibashi, T. Gojobori and T. Ikemura. Codon usage tabulated from the GenBank genetic sequence data. Nucleic Acids Res. 20(supplement), 2111-2118 (1992)) suggesting translational regulation.
- the clone from an HL60 cDNA library labeled "3-1" contains 10 basepairs of 5' untranslated sequence, followed by an open reading frame comprised of the nucleotides from position 858 to position 1561 shown above in SEQ ID NO:1 and a unique 3' end cDNA sequence of 25 base pairs, followed by two adjacent stop codons and additional 3' untranslated sequence.
- the sequence of clone 3-1 is as follows (SEQ ID NO: 5):
- amino acid sequence for the protein encoded by the cDNA sequence for clone 3-1 can be found in SEQ ID NO:6.
- the clone from an activated, human peripheral blood lymphocyte cDNA library labeled "31-10" contains 24 basepairs of 5' untranslated sequence, followed by an open reading frame comprised of the basepairs from position 135 to position 1991 of the DROME sequence given in SEQ ID NO:1 with two exceptions.
- Clone 31-10 contains 63 nucleotides inserted at position 238 in the DROME cDNA sequence which probably result from an intron which had not been spliced out in the DROME mRNA molecule that gave rise to the 31-10 clone. The inserted nucleotides remain in the open reading frame and are translated into an alternate form of the DROME protein.
- the 31-10 clone also deviates in that nucleotides from position 1807 to 1952 in the DROME cDNA sequence are deleted. This deletion shifts out of frame the stop codons which would terminate translation in the other clones.
- additional amino acids must be added to the C-terminal end of the DROME protein.
- sequence of clone 31-10 is as follows (SEQ ID NO:7):
- amino acid sequence for the protein encoded by the cDNA sequence for clone 31-10 can be found in SEQ ID NO:8.
- the cloned cDNA encodes a protein with FUSE binding activity, referred to as DROME or FUSE binding protein ("FBP").
- FBP FUSE binding protein
- the coding region contained in the HL60 clones was expressed as a fusion protein in bacteria, purified from extracts and tested with electrophoretic mobility shift assays (EMSAs) for binding to an oligonucleotide containing the FUSE site.
- ESAs electrophoretic mobility shift assays
- GST-FBP Recombinant fusion protein bound effectively to the FUSE oligonucleotide.
- the open reading frame present in the HL60-1 and HL60-2 clones (amino acid residues 145 to 511) was expressed as a Glutathione-S-transferase fusion protein (GST-FBP) in the pGEX system.
- GST-FBP Glutathione-S-transferase fusion protein
- the inserts from the HL60-1 and HL60-2 clones were spliced together and the open reading frame region subcloned into the Sma I site of the pGEX-2T plasmid (AMARAD Corp.; D. B. Smith and K. S. Johnson, Gene. 67,31 (1988)) to express a GST-FBP fusion protein.
- Recombinant protein was purified from E.
- DNA binding assays were performed with an equivalent amount of GST-FBP or GST alone incubated in 25 mM Tris, 200 mM Glycine, 1mM EDTA, 0.5 mg/ml BSA, 0.1% Tween20, 10% glycerol, 100 ⁇ g/ml poly(dI:dC), and 0.2 ng of labeled probe.
- the probe was prepared by annealing two complementary synthetic oligonucleotides and 5' end labeling with T 4 polynucleotide kinase in the presence of ⁇ - 32 P-ATP.
- the top strand of the FUSE oligonucleotide was 5'-GATCACAAAATAAAAATCCCGAGGGAATATAG-3' (SEQ ID NO:11).
- the top strand of Mut A was 5'-GATCACAAAAAATCCGAGGAATATAG-3' (SEQ ID NO:12) (lower case indicates changes from FUSE oligomer sequence).
- the top strand of CRE was 5'-GATCTGACGTCATGACTGACGTCATGACTGACGTCATCA-3' (SEQ ID NO:13).
- the top strand of CTE was 5'-AATTCTCCTCCCCACCTTCCCCACCTCCCCA-3' (SEQ ID NO:14). Reaction mixtures were incubated 30 minutes at room temperature and protein-DNA complexes resolved by electrophoresis on a 4.2% acrylamide gel in 25 mM Tris, 200 mM Glycine, 1 mM EDTA buffer.
- Glutathione-S-transferase alone did not bind to the probe.
- Competitor oligonucleotides were as follows: Mut A, a mutant FUSE oligonucleotide with 17 residues covering the binding site changed; CRE, CAMP response element; CTE, CT element in c-myc 5' flanking region.
- Recombinant FBP and purified human FBP were also demonstrated to display similar DNA binding specificity.
- Full length GST-FBP and human FBP purified from HL60 cells by oligo-affinity chromatography were assayed for FUSE binding using EMSA.
- the DNA-protein complexes formed by human FBP in the presence of 25ng of the indicated non-radioactive competitors, visualized by autoradiography, are shown in the upper panel of FIG. 3B.
- the lower panel of FIG. 3B shows recombinant GST-FBP binding the FUSE probe in the presence of 25 ng of the same competitors.
- Competitor oligonucleotides are the same as in (A) with the addition of Mut B and Mut C, each with a different mutation in the FUSE sequence.
- the top strand of Mut B was 5'-GATCACAAAATAAAAAATggacgccGAATATAG-3' (SEQ ID NO:15) and the top strand of Mut C was 5' AATTCTCCTCCCACCTCCCACCTCCCA-GATCACAActacgtgctaggCGAGGGAATATAG-3' (SEQ ID NO:16).
- Recombinant fusion protein also failed to bind to the nonhomologous oligonucleotides when they were used as radioactive probes.
- the full length recombinant protein, and purified human FBP shared the same DNA binding specificity as indicated by challenging their binding to FUSE with a panel of competitors (FIG. 3B).
- An excess of cold FUSE significantly reduced binding of human and recombinant FBP to the radioactive probe; the same amount of the Mut B oligo which has minor changes in the FUSE sequence, bound both as well; but more divergent oligonucleotides did not significantly compete for binding to either protein.
- Full length FBP was inserted into an expression vector downstream from the cytomegalovirus enhancer/promoter in both the sense and anti-sense orientations.
- These plasmids or the expression vector alone were transfected into U937 cells with another vector containing a CAT gene under the control of myc regulator sequence (myc-CAT) or the reporter plasmid with the FUSE site deleted ( ⁇ FUSE myc-CAT).
- myc-CAT and ⁇ FUSE myc-CAT reporter plasmids are derivatives of pMP CAT (M. Avigan, B. Strober, and D. Levens, J. Biol.
- the ⁇ FUSE myc-CAT plasmid was produced from pMP CAT by cutting the parent plasmid at the Ava I site located in the FUSE element, partial digestion with mung bean exonuclease (Bethesda Research Labs) and religation. The DNA sequence of the deleted plasmid revealed that 68 nucleotides between position -1493 and -1561 relative to myc P1 were removed, completely deleting the FUSE element.
- U937 cells (5 ⁇ 10 6 ) were electroporated (Cell-porator, BRL, 200V, 1180 ⁇ F) with 10 ⁇ g of each plasmid in 250 ⁇ l of RPMI supplemented with 10% fetal calf serum. Transfected cells were added to 8 ml additional medium and incubated 48 hours before harvesting for CAT assays. (C. M. Gorman, L. F. Moffat, B. H. Howard, Mol. Cell. Biol. 2, 1044 (1982)). The same quantity of protein (Bradford method) was assayed for each extract. Two independent transfections are shown for each plasmid combination.
- the FUSE containing myc-CAT plasmid gave a 5-fold higher level of CAT activity than in the presence of the expression vector alone.
- this level of stimulation did not occur when the reporter plasmid had a 68 bp deletion that eliminated the FUSE site ( ⁇ FUSE myc-CAT) nor when the expression plasmid contained the FBP cDNA in the reverse orientation (FBP-antisense).
- the minor increase in the ⁇ FUSE myc-CAT expression with FBP co-transection could be due to secondary binding sites present in the myc regulatory sequence.
- FBP shows no significant homology to known DNA binding motifs in a search against the GenBank databases, however the primary amino acid sequence of FBP has distinct structural features which could constitute a DNA binding domain, as well as other features with potential functional significance.
- a Pustell matrix self-comparison of FBP revealed three domains in the primary amino acid sequence, each containing internally repeated sequences.
- the Pustell matrix (J. Pustell, F. C. Kafatos, N. A. R. 10, 4765 (1982); W. R. Pearson, Meth. Enzymol. 183, 63 (1990); MacVector software, International Biotechnologies, Inc.) revealed the repeated sequences in the three domains of FBP.
- a window size of 20 residues and a minimum score of 35% were used for this analysis.
- the diagram above the box in FIG. 5A symbolizes the structures in FBP: striped fill indicates glycine rich segments, the solid boxes indicate the FBP repeats, and the shaded boxes represent amphipathic helices.
- the amino-terminal domain is comprised of 106 amino acids featuring a string of 11 repeated glycine residues and the first of 5 predicted amphipathic alpha helices found in FBP. Following the helix, residues 63 to 106 are enriched for glutamine (16%) relative to the whole protein (9%).
- the central and largest FBP domain contains structures sufficient for sequence specific binding because a recombinant protein comprised of only this domain bound DNA specifically.
- the central domain is made up of four evenly spaced units each unit containing (1) a highly conserved 30 residue segment termed here the FBP repeat (FIG. 5B), followed by (2) a potential amphipathic alpha helix (FIG. 5C).
- FBP repeat is comprised of three structures in sequential order: a region of predicted ⁇ -sheet, separated by a turn at conserved glycines 13 and 14, from a segment of alpha-helix as predicted by Chou-Fasman and Robson-Garnier analysis. (P. Y. Chou and G. D.
- the invariant glycine at position 25 may not disrupt helix formation due to the hydrophilic character of neighboring residues in the predicted helix.
- This sheet-turn-helix pattern evokes the DNA-binding domain of the papilloma virus E2 protein.
- the C-terminal domain is separated from the central domain by a highly flexible, proline-glycine rich segment.
- J. A. Cooper, F. S. Esch, S. S. Taylor, and T. Hunter "Phosphorylation sites in enolase and lactate dehydrogenase utilized by tyrosine protein kinases in vivo and in vitro.” J. Biol. Chem. 259:7835-7841 (1984)).
- the plasmid which encodes the 278-511 mutant was constructed by inserting an EcoR I-Sac I fragment from the HL60-2 clone into the pGEX-1 vector.
- the fusion protein contains amino acids 278-511 of FBP plus the residues KEIEQKVQE (SEQ ID NO:17) (where K is Lys, E is Glu, I is Ile, Q is Gln, and V is Val) at the carboxyterminal end stopping at a termination codon unique to the HL60-2 clone.
- the 278-474 encoding plasmid was constructed from the 278-511 plasmid by inserting a 12 bp double stranded oligonucleotide with the sequence TTAGTTAACTAA (SEQ ID NO:18) into an Sfi I site. This oligonucleotide encodes stop codons in all 3 reading frames so that a truncated protein is produced.
- the 278-372 encoding plasmid was similarly constructed by inserting the termination oligonucleotide into a Dra III site.
- the 298-511 encoding plasmid was constructed from the 278-511 plasmid by deleting a fragment between the BamH I site in the pGEX vector and a Bcl I in the FBP cDNA.
- the removal of FBP repeat 4 and the rest of the C-terminus in a mutant with a larger truncation (278-372) or removal of the NH2-terminal portion of repeat 3 (298-511) abrogated binding.
- Insertion mutants further demonstrated the importance of two repeat-helix units for DNA binding. Three to six amino acids were introduced at eight positions throughout the 278-511 construct (FIG. 6B) without altering the reading frame.
- Insertion mutants were constructed by cutting the 278-511 plasmid at restriction sites, treating the ends with the large fragment of DNA polymerase I (if not already blunt) and inserting linkers of 8, 10 or 12 base pairs. The appropriate length was chosen to insert a small number of amino acids but restore the original reading frame leaving the rest of the sequence unchanged. Insertion mutant 1 had the amino acids ArgIleArg added between GST and the FBP polypeptide. Mutant 2 had residues IleGlySerArgIleArg (SEQ ID NO:19) added after the Met at position 297 in FBP, mutant 3 had IleArgIleArg (SEQ ID NO:20) added after Met297.
- Mutant 4 had ProArgIleArgGlu (SEQ ID NO:21) added while deleting Gln at amino acid 333.
- Mutant 5 had GlyIlePro added after amino acid 336.
- Mutant 6 had GlyIleProArg (SEQ ID NO:22) added after amino acid 343.
- Mutant 7 ArgIleArg added after amino acid 373.
- Mutant 8 had ArgAspProAla (SEQ ID NO:23) added while deleting GlnSer after amino acid 404.
- Mutant 9 had ArgGlySerGly (SEQ ID NO:24) inserted while deleting Pro after amino acid 475. All constructs were confirmed by DNA sequencing. The DNA binding property of each insertion mutant is indicated by a +, - or R (reduced binding) below the insertion site.
- insertion mutant number 1 An insertion at the junction between GST and the FBP sequence (insertion mutant number 1) does not effect binding. Neither do insertions into the region between helix 4 and repeat 4 (number 7) or the Pro-Gly region (number 9). In contrast, insertions 2 and 3 in repeat three, insertions 4 and 5 in helix four, and insertion 8 in repeat four all resulted in mutant proteins failing to bind DNA. Insertion number 6, near the end of helix four, reduced but did not eliminate binding.
- FBP human FUSE binding protein
- DROME human FUSE binding protein
- the purified FUSE binding protein is also useful in the production of monoclonal antibodies.
- a mouse is injected with purified FUSE binding protein, or a fragment thereof, which activates a number of B-lymphocytes in the mouse which produce antibodies against the protein.
- the mouse is sacrificed and spleen lymphoid cells, containing large quantities of these B-lymphocytes, are isolated and tested to assure production of antibody to FUSE binding protein.
- the lymphocytes producing antibody to FUSE binding protein are then fused with mouse plasmacytoma cells, ensuring a reproducible source of monoclonal antibody (Kohler, G. and Milstein, C. Nature, 256:495-97 (1975)). These antibodies, or fragments thereof, can then be used to detect and quantitate the FUSE binding protein.
- the anti-sense sequence for the FUSE binding protein cDNA is useful therapeutically to arrest cell development in a target area of uncontrolled cell growth.
- An expression vector or oligonucleotide is constructed incorporating the anti-sense sequence of the FUSE binding protein cDNA. The oligonucleotide or vector is then incorporated into the cells of the target area, and acts to antagonize or block expression of the FUSE binding protein, inhibiting cell division and proliferation.
- This is effective, for example, in arresting tumor cell growth, or an unwarranted immune response (i.e., arresting B- or T-lymphocytes).
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Gastroenterology & Hepatology (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Zoology (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Toxicology (AREA)
- Peptides Or Proteins (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
The Far Upstream Element (FUSE) of the human c-myc gene stimulates expression in undifferentiated cells. A FUSE binding protein (FBP), also referred to as DROME (DNA-binding regulator of c-myc expression), is active in undifferentiated but not differentiated cell extracts. Cloned FBP exhibits the same DNA-binding specificity as the purified human protein and can trans-activate in a FUSE dependent manner. Sequence-specific binding to the FUSE oligonucleotide requires at least two copies of a repeat-helix unit which defines a new DNA-binding motif. Expression of FBP mRNA declined in parallel with decreased FUSE binding activity upon differentiation suggesting transcriptional regulation of FBP.
Description
This application is a continuation of application Ser. No. 08/021,608, filed Feb. 22, 1993 now U.S. Pat. No. 5,580,760.
The present invention relates to a novel human cDNA and the encoded protein which interacts with a cis-element activator, known as the FUSE binding protein. A novel DNA-binding regulator of c-myc expression ("DROME") was purified, and then proteolytically and chemically degraded to peptides. These peptides were separated by HPLC and the sequences of multiple peptides were determined. Using the information from the peptide sequence, specific oligonucleotide primers were synthesized and then used as primers for the polymerase chain reaction employing human cDNA as a template. The resulting products were cloned and sequenced, and shown to encode additional peptides of the DROME protein. These DNA segments were then employed as probes to screen multiple phage libraries in order to reconstruct a full length reading frame from several overlapping clones. This information allows the expression of a full length protein. As used herein, the terms DROME and FUSE binding protein ("FBP") are synonymous.
The c-myc proto-oncogene plays a central role in normal cell proliferation and programmed cell death (Y. Shi, J. Glynn, L. Guilbert, T. Cotter, R. Bissonnette, and D. Green, "Role for c-myc in activation-induced apoptotic cell death in T cell hybridomas," Science, 257:212-214 (1992)) and its deregulation contributes to the formation of a variety of tumors. (J. M. Bishop, Annu. Rev. Biochem. 52,301-354 (1983); M. D. Cole, "The myc oncogene: its role in transformation and differentiation." Annu. Rev. Genet. 20, 361-384 (1986); S. Cory, Adv. Cancer Res. 47, 189-234 (1986)).
Down regulation of the c-myc proto-oncogene occurs in the human promonomyelocytic leukemia cell line HL60 and human monoblastic line, U937, upon induction of differentiation. (C. Dony, M. Kessel, and P. Gruss, Nature. 317, 636-639 (1985); L. E. Grosso, and H. C. Pitot, Cancer Res. 45, 847-850 (1985); T. Watanabe, E. Sariban, T. Mitchell, and D. Kufe, Biochem. Biophys. Res. Commun. 126, 999-1005 (1985); D. L. Bentley, and M. Groudine, Nature, 321, 702-706 (1986); D. Eick and G. W. Bornkamm, Nucleic Acids Res. 14, 8331-8346 (1986); T. Endo, and B. Nadal-Ginard, Mol. Cell. Biol. 6, 1412-1421 (1986)). This suppression of c-myc expression occurs by two mechanisms; within three hours there is a block to elongation which can be reversed by removal of the differentiation agent. Subsequently, transcriptional initiation ceases, coinciding with irreversible commitment to the differentiation pathway. (U. Siebenlist, P. Bressler, and K. Kelly, Mol. Cel. Biol. 8, 867-874 (1988)).
A Far Upstream Element ("FUSE") which is required for maximal transcription of c-myc, binds a factor (DROME or FUSE binding protein ("FBP")) which is present in extracts of undifferentiated cells, but disappears upon differentiation. (M. I. Avigan, B. Strober, and D. Levens, "A Far Upstream Element Stimulates c-myc Expression In Undifferentiated Leukemia Cells." J. Biol. Chem. 265, 18538-18545 (1990)). The disappearance of this binding activity occurs 24 hours after addition of the differentiation agent coinciding with the loss of initiation of c-myc transcription. The FUSE site differs from other described positive regulatory elements for myc in a number of ways. Despite its placement a long distance from the transcription start site (-1500 bp relative to the myc P1 promoter), the FUSE element will not act as a traditional enhancer; multiple copies inserted upstream of a heterologous promoter do not stimulate transcription in transection experiments. However, when the FUSE site is present with additional c-myc regulatory sequences, specific stimulation of the c-myc promoter is observed, indicating that FUSE works in concert with other myc elements. These features suggest that the factor binding to this site may act to stimulate transcription by an unusual mechanism.
The nucleotide sequence for a novel DNA-binding regulator of c-myc expression is provided, together with the amino acid sequence for the encoded protein. The protein interacts with an activator cis-element approximately 1500 basepairs 5' of the human c-myc gene promoter P1. The cDNA and variations thereof have the nucleotide sequences of SEQ ID NO:1, SEQ ID NO:5, SEQ ID NO:7 and SEQ ID NO:9, and the proteins have the amino acid sequences of SEQ ID NO:2, SEQ ID NO:6, SEQ ID NO:8 and SEQ ID NO:10.
With the given sequence for the cDNA and protein of the present invention, one may now study genetic abnormalities of c-myc expression, in addition to numerous other uses of the gene and encoded protein.
The gene encoding the DROME or FUSE binding protein is useful in the diagnosis of disease states. Specific nucleic acid probes derived from knowledge of the DROME sequence and genetic map for PCR or hybridization are useful to analyze mutations, translocations and other genetic derangements that are associated with abnormalities of DROME or c-myc expression. Because DROME is highly activated during T-cell stimulation, these probes are useful to analyze immune system abnormalities. The DROME probes are also useful to analyze a variety of physiologic and pathologic conditions.
Knowledge of the cDNA and genomic structures of DROME allows the construction of vectors which express DROME or which express anti-sense DROME sequence. Oligonucleotide and expression vector anti-sense approaches to block DROME expression are useful to modulate DROME expression in vivo, which results in therapeutic modification of the levels of expression of genes regulated by DROME. Such strategies are of therapeutic value to certain pathologic conditions or can be used to provide prophylactic or beneficial changes in DROME expression in pre-pathologic conditions. If genetic disorders can be ascribed to abnormalities of the DROME protein or its expression, then gene therapy for such disorders will be heavily dependent on the information and materials derived from the characterization of the DROME gene and its cDNA.
Expression of the DROME protein itself in prokaryotic and eukaryotic expression vectors is useful in several regards. The DROME protein or fragments thereof are useful as an immunogen to generate polyclonal and monoclonal antibodies which can then be used to detect and quantitate the DROME protein. The DROME protein itself is useful as a probe to identify and quantify proteins which interact with or modify DROME; similarly, if the DROME protein is immobilized as a ligand for larger scale operations, the DROME interacting or modifying molecules can be purified. All of these proteins can be employed singly or in combination together with existing technologies as assay kits to detect, quantitate and analyze DROME protein.
Structural analysis of the DROME protein has defined a new DNA binding motif composed of a unique repeating element followed by an amphipathic helix. Knowledge of these structural elements together with knowledge of the complete cDNA sequence permits the identification, cloning and characterization of the genes for related proteins and DROME homologs using hybridization based or PCR based methods.
Because proteins which share structural motifs often share biochemical properties as well as functional and physiological roles, it is expected that DROME homologs and other proteins with the newly defined DNA binding motif are also important regulators of cell growth and other cellular processes. The same approaches used to modify DROME expression by sense and anti-sense vectors and anti-sense oligonucleotides is applied to DROME homologs and proteins sharing the DROME DNA binding motif in order to alter cell growth and cellular properties. These proteins or fragments thereof, which are either DROME homologs or recognizable as DNA binding proteins because they possess the DNA binding motif described herein, are useful for the generation of antibodies and diagnostic kits to relate these molecules to cancer, genetic and other human maladies. These proteins, antibodies and kits are useful in the diagnosis, treatment and study of human diseases.
Similarly, altered forms of the protein, either due to post-translational modification or altered RNA processing resulting from altered splicing or other RNA modifications, are identified and characterized utilizing the protein, antibody and nucleic acid probes outlined above. Each of these new proteins and/or genes all related to DROME are then subjected to similar analyses.
FIGS. 1A and 1B depict radiographs showing reduced expression over time of FBP mRNA.
FIG. 2 gives the primary structure of DROME or FUSE binding protein (FBP) cDNA sequences.
FIGS. 3A and 3B depict DNA binding assay radiographs showing that recombinant FBP binds specifically to the far upstream element.
FIG. 4 shows assays depicting recombinant FBP activated expression of a chloramphenicol acetyltransferase (CAT) gene under the control of human c-myc regulatory sequence.
FIGS. 5A-5C show the structural features of the primary amino acid sequence of FBP.
FIGS. 6A-6C illustrates a mutational analysis of the minimum DNA binding domain of recombinant FBP.
A 70 kilodalton (kd) FUSE binding protein (FBP) has been purified from undifferentiated HL60 cells, the protein eluted from an SDS acrylamide gel and the amino acid sequence of internal peptides determined. The peptide sequences enabled the design of oligonucleotide PCR primers used to amplify cDNA template prepared from undifferentiated U937 cell total RNA.
Reverse transcriptase-polymerase chain reaction (RT-PCR) was performed as described in D. Rappolee, A. Wang, D. Mark, and Z. Werb, J. Cell. Biochem. 39,1 (1989). In the RT step 5 μg total U937 RNA, 1.5 μg of random hexamer primers (Promega) and 20 μMoles of each deoxynucleotide triphosphate (dNTP) were incubated in a 40 μl reaction with Superscript Reverse Transcriptase (BRL) and the manufacturer's reaction buffer adjusted to 10 mM dithiothreitol at 45° C. for 1 hour. After ethanol precipitation and reconstitution in 40 μl dH2 O, 2 μl of the RT reaction products were used as template in a 50 μl PCR reaction with 20 pMoles of each degenerate primer, 20 nMoles each dNTP, 1.5 units Amplitaq polymerase (Perkin Elmer Cetus) and the manufacturer's buffer adjusted to 7.5 mM MgCL2. Samples were incubated through 25 cycles of 94° C., 55° C., and 72° C., 2 minutes at each temperature.
Two amplified products contained open reading frames encoding segments identical to peptide sequences from purified FBP confirming that the authentic cDNA had been obtained.
The PCR fragments hybridized to a single 2.6 kilobase (kb) RNA which disappeared after DMSO induced differentiation (FIG. 1A) consistent with the loss of FUSE binding activity after DMSO treatment. Polyadenylated RNA (10 μg) from cultured cells (J. M. Chirgwin et al., Biochemistry, 18, 5294 (1979)) was separated on a 1% agarose-formaldehyde gel, transferred to nitrocellulose and hybridized with a 32 P probe from FBP, B-actin, or human c-myc cDNAs. FIG. 1A shows RNA prepared from HL60 cells grown 48 hrs. in the presence (+) or absence (-) of 1.3% Dimethyl Sulfoxide (DMSO) in RPMI medium supplemented with 10% fetal calf serum, glutamine, penicillin and streptomycin. FIG. 1B shows RNA prepared from HL60 cells cultured as indicated with or without 50 ng/ml 12-o-tetradecanohlphorbol 13-acetate (TPA) in the same medium as above.
As depicted in FIG. 1B, expression of the 2.6 kb transcript sharply declined after 24 hours of treatment with TPA and was undetectable at 48 hours paralleling: 1) the disappearance of the FUSE binding activity, 2) a dramatic decrease in c-myc transcriptional initiation and, 3) irreversible commitment to terminal differentiation. Hence, FBP mRNA and binding activity are shut off synchronously during differention.
A full length FBP cDNA sequence was assembled from overlapping clones obtained from three libraries using the PCR fragments as probes. All three cDNA libraries were prepared from poly A selected RNA and ligated into the lambda Zap II vector (Stratagene). Source RNAs were from undifferentiated HL60 cells, the B lymphoma cell line, BJAB, and PMA/PHA stimulated pooled human peripheral blood lymphocytes (PBLs).
The composite 2,384 bp cDNA contains 26 bp of 5' untranslated sequence, 1932 bp of open reading frame, and 426 bp of 3' untranslated including a poly A addition signal and 60 bp of poly A tail, as shown in FIG. 2.
The open box (FIG. 2) in the composite cDNA indicates the coding region, An the poly A tail. The position of PCR products and clones are shown, solid lines represent vertically aligned identical sequence, dashed lines represent non-homologous sequence. Clone names at left indicate source or library of origin. The nonhomologous sequences in the HL60 clones, BJAB-1, and PBL-3 most likely are from reverse transcribed, unspliced pre-mRNA or extraneous cDNA fragments ligated together during the preparation of the libraries. The open circle on clone BJAB-2 indicates the position of the 3 base pair deletion. Dideoxy sequencing (Sequenase, U.S. Biochemical) followed the manufacturer's protocol.
The nucleic acid sequence (SEQ ID NO:1) of the full length cDNA is as follows:
__________________________________________________________________________
GCGGCAGCGG CTCTTATAGT GCAACC
ATG
GCA
GAC
TAT
TCA
ACA
44
CGCCGTCGCC GAGAATATCA CGTTGG
TAC
CGT
CTG
ATA
AGT
TGT
Met Ala
Asp
Tyr
Ser
Thr
1 5
GTG
CCT
CCC
CCC
TCT
TCT
GGC
TCA
GCT
GGT
GGC
GGT
GGT
83
CAC
GGA
GGG
GGG
AGA
AGA
CCG
AGT
CGA
CCA
CCG
CCA
CCA
Val
Pro
Pro
Pro
Ser
Ser
Gly
Ser
Ala
Gly
Gly
Gly
Gly
10 15
GGC
GGC
GGT
GGT
GGT
GGA
GGA
GTT
AAC
GAC
GCT
TTC
AAA
122
CCG
CCG
CCA
CCA
CCA
CCT
CCT
CAA
TTG
CTG
CGA
AAG
TTT
Gly
Gly
Gly
Gly
Gly
Gly
Gly
Val
Asn
Asp
Ala
Phe
Lys
20 25 30
GAT
GCA
CTG
CAG
AGA
GCC
CGG
CAG
ATT
GCA
GCA
AAA
ATT
161
CTA
CGT
GAC
GTC
TCT
CGG
GCC
GTC
TAA
CGT
CGT
TTT
TAA
Asp
Ala
Leu
Gln
Arg
Ala
Arg
Gln
Ile
Ala
Ala
Lys
Ile
35 40 45
GGA
GGT
GAT
GCA
GGG
ACA
TCA
CTG
AAT
TCA
AAT
GAC
TAT
200
CCT
CCA
CTA
CGT
CCC
TGT
AGT
GAC
TTA
AGT
TTA
CTG
ATA
Gly
Gly
Asp
Ala
Gly
Thr
Ser
Leu
Asn
Ser
Asn
Asp
Tyr
50 55
GGT
TAT
GGG
GGA
CAA
AAA
AGA
CCT
TTA
GAA
GAT
GGA
GAT
239
CCA
ATA
CCC
CCT
GTT
TTT
TCT
GGA
AAT
CTT
CTA
CCT
CTA
Gly
Tyr
Gly
Gly
Gln
Lys
Arg
Pro
Leu
Glu
Asp
Gly
Asp
60 65 70
CAA
CCA
GAT
GCT
AAG
AAA
GTT
GCT
CCT
CAA
AAT
GAC
TCT
278
GTT
GGT
CTA
CGA
TTC
TTT
CAA
CGA
GGA
GTT
TTA
CTG
AGA
Gln
Pro
Asp
Ala
Lys
Lys
Val
Ala
Pro
Gln
Asn
Asp
Ser
75 80
TTT
GGA
ACA
CAG
TTA
CCA
CCG
ATG
CAT
CAG
CAG
CAA
AGC
317
AAA
CCT
TGT
GTC
AAT
GGT
GGC
TAC
GTA
GTC
GTC
GTT
TCG
Phe
Gly
Thr
Gln
Leu
Pro
Pro
Met
His
Gln
Gln
Gln
Ser
85 90 95
AGA
TCT
GTA
ATG
ACA
GAA
GAA
TAC
AAA
GTT
CCA
GAT
GGA
356
TCT
AGA
CAT
TAC
TGT
CTT
CTT
ATG
TTT
CAA
GGT
CTA
CCT
Arg
Ser
Val
Met
Thr
Glu
Glu
Tyr
Lys
Val
Pro
Asp
Gly
100 105 110
ATG
GTT
GGA
TTC
ATA
ATT
GGC
AGA
GGA
GGT
GAA
CAG
ATC
395
TAC
CAA
CCT
AAG
TAT
TAA
CCG
TCT
CCT
CCA
CTT
GTC
TAG
Met
Val
Gly
Phe
Ile
Ile
Gly
Arg
Gly
Gly
Glu
Gln
Ile
115 120
TCA
CGC
ATA
CAA
CAG
GAA
TCT
GGA
TGC
AAA
ATA
CAG
ATA
434
AGT
GCG
TAT
GTT
GTC
CTT
AGA
CCT
ACG
TTT
TAT
GTC
TAT
Ser
Arg
Ile
Gln
Gln
Glu
Ser
Gly
Cys
Lys
Ile
Gln
Ile
125 130 135
GCT
CCT
GAC
AGT
GGT
GGC
CTT
CCA
GAA
AGG
TCC
TGT
ATG
473
CGA
GGA
CTG
TCA
CCA
CCG
GAA
GGT
CTT
TCC
AGG
ACA
TAC
Ala
Pro
Asp
Ser
Gly
Gly
Leu
Pro
Glu
Arg
Ser
Cys
Met
140 145
TTA
ACT
GGA
ACA
CCT
GAA
TCT
GTC
CAG
TCA
GCA
AAA
CGG
512
AAT
TGA
CCT
TGT
GGA
CTT
AGA
CAG
GTC
AGT
CGT
TTT
GCC
Leu
Thr
Gly
Thr
Pro
Glu
Ser
Val
Gln
Ser
Ala
Lys
Arg
150 155 160
TTA
CTG
GAC
CAG
ATT
GTT
GAA
AAA
GGA
AGA
CCA
GCT
CCT
551
AAT
GAC
CTG
GTC
TAA
CAA
CTT
TTT
CCT
TCT
GGT
CGA
GGA
Leu
Leu
Asp
Gln
Ile
Val
Glu
Lys
Gly
Arg
Pro
Ala
Pro
165 170 175
GGC
TTC
CAT
CAT
GGC
GAT
GGA
CCG
GGA
AAT
GCA
GTT
CAA
590
CCG
AAG
GTA
GTA
CCG
CTA
CCT
GGC
CCT
TTA
CGT
CAA
GTT
Gly
Phe
His
His
Gly
Asp
Gly
Pro
Gly
Asn
Ala
Val
Gln
180 185
GAA
ATC
ATG
ATT
CCA
GCT
AGC
AAG
GCA
GGA
TTA
GTC
ATT
629
CTT
TAG
TAC
TAA
GGT
CGA
TCG
TTC
CGT
CCT
AAT
CAG
TAA
Glu
Ile
Met
Ile
Pro
Ala
Ser
Lys
Ala
Gly
Leu
Val
Ile
190 195 200
GGA
AAA
GGG
GGA
GAA
ACT
ATT
AAA
CAG
CTT
CAG
GAA
CGG
668
CCT
TTT
CCC
CCT
CTT
TGA
TAA
TTT
GTC
GAA
GTC
CTT
GCC
Gly
Lys
Gly
Gly
Glu
Thr
Ile
Lys
Gln
Leu
Gln
Glu
Arg
205 210
GCT
GGA
GTT
AAA
ATG
GTT
ATG
ATT
CAA
GAC
GGG
CCG
CAG
707
CGA
CCT
CAA
TTT
TAC
CAA
TAC
TAA
GTT
CTG
CCC
GGC
GTC
Ala
Gly
Val
Lys
Met
Val
Met
Ile
Gln
Asp
Gly
Pro
Gln
215 220 225
AAC
ACT
GGT
GCT
GAC
AAA
CCT
CTT
AGG
ATT
ACA
GGA
GAC
746
TTG
TGA
CCA
CGA
CTG
TTT
GGA
GAA
TCC
TAA
TGT
CCT
CTG
Asn
Thr
Gly
Ala
Asp
Lys
Pro
Leu
Arg
Ile
Thr
Gly
Asp
230 235 240
CCA
TAT
AAA
GTT
CAA
CAA
GCC
AAG
GAA
ATG
GTG
TTA
GAG
785
GGT
ATA
TTT
CAA
GTT
GTT
CGG
TTC
CTT
TAC
CAC
AAT
CTC
Pro
Tyr
Lys
Val
Gln
Gln
Ala
Lys
Glu
Met
Val
Leu
Glu
245 250
TTA
ATT
CGT
GAT
CAA
GGC
GGT
TTC
AGA
GAA
GTT
CGG
AAT
824
AAT
TAA
GCA
CTA
GTT
CCG
CCA
AAG
TCT
CTT
CAA
GCC
TTA
Leu
Ile
Arg
Asp
Gln
Gly
Gly
Phe
Arg
Glu
Val
Arg
Asn
255 260 265
GAG
TAT
GGG
TCA
AGA
ATA
GGA
GGA
AAT
GAA
GGG
ATA
GAT
863
CTC
ATA
CCC
AGT
TCT
TAT
CCT
CCT
TTA
CTT
CCC
TAT
CTA
Glu
Tyr
Gly
Ser
Arg
Ile
Gly
Gly
Asn
Glu
Gly
Ile
Asp
270 275
GTC
CCC
ATT
CCA
AGA
TTT
GCT
GTT
GGC
ATT
GTA
ATA
GGA
902
CAG
GGG
TAA
GGT
TCT
AAA
CGA
CAA
CCG
TAA
CAT
TAT
CCT
Val
Pro
Ile
Pro
Arg
Phe
Ala
Val
Gly
Ile
Val
Ile
Gly
280 285 290
AGA
AAT
GGA
GAG
ATG
ATC
AAA
AAA
ATA
CAA
AAT
GAT
GCT
941
TCT
TTA
CCT
CTC
TAC
TAG
TTT
TTT
TAT
GTT
TTA
CTA
CGA
Arg
Asn
Gly
Glu
Met
Ile
Lys
Lys
Ile
Gln
Asn
Asp
Ala
295 300 305
GGT
GTT
CGC
ATT
CAG
TTT
AAG
CCA
GAT
GAT
GGG
ACA
ACA
980
CCA
CAA
GCG
TAA
GTC
AAA
TTC
GGT
CTA
CTA
CCC
TGT
TGT
Gly
Val
Arg
Ile
Gln
Phe
Lys
Pro
Asp
Asp
Gly
Thr
Thr
310 315
CCC
GAA
AGG
ATA
GCA
CAA
ATA
ACA
GGA
CCT
CCA
GAC
CGA
1019
GGG
CTT
TCC
TAT
CGT
GTT
TAT
TGT
CCT
GGA
GGT
CTG
GCT
Pro
Glu
Arg
Ile
Ala
Gln
Ile
Thr
Gly
Pro
Pro
Asp
Arg
320 325 330
TGT
CAA
CAT
GCT
GCA
GAA
ATT
ATT
ACA
GAC
CTT
CTT
CGA
1058
ACA
GTT
GTA
CGA
CGT
CTT
TAA
TAA
TGT
CTG
GAA
GAA
GCT
Cys
Gln
His
Ala
Ala
Glu
Ile
Ile
Thr
Asp
Leu
Leu
Arg
335 340
AGT
GTT
CAG
GCT
GGT
AAT
CCT
GGT
GGA
CCT
GGA
CCT
GGT
1097
TCA
CAA
GTC
CGA
CCA
TTA
GGA
CCA
CCT
GGA
CCT
GGA
CCA
Ser
Val
Gln
Ala
Gly
Asn
Pro
Gly
Gly
Pro
Gly
Pro
Gly
345 350 355
GGT
CGA
GGA
AGA
GGT
AGA
GGT
CAA
GGC
AAC
TGG
AAC
ATG
1136
CCA
GCT
CCT
TCT
CCA
TCT
CCA
GTT
CCG
TTG
ACC
TTG
TAC
Gly
Arg
Gly
Arg
Gly
Arg
Gly
Gln
Gly
Asn
Trp
Asn
Met
360 365 370
GGA
CCA
CCT
GGT
GGA
TTA
CAG
GAA
TTT
AAT
TTT
ATT
GTG
1175
CCT
GGT
GGA
CCA
CCT
AAT
GTC
CTT
AAA
TTA
AAA
TAA
CAC
Gly
Pro
Pro
Gly
Gly
Leu
Gln
Glu
Phe
Asn
Phe
Ile
Val
375 380
CCA
ACT
GGG
AAA
ACT
GGA
TTA
ATA
ATA
GGA
AAA
GGA
GGT
1214
GGT
TGA
CCC
TTT
TGA
CCT
AAT
TAT
TAT
CCT
TTT
CCT
CCA
Pro
Thr
Gly
Lys
Thr
Gly
Leu
Ile
Ile
Gly
Lys
Gly
Gly
385 390 395
GAA
ACC
ATA
AAA
AGC
ATA
AGC
CAG
CAG
TCT
GGT
GCA
AGA
1253
CTT
TGG
TGT
TTT
TCG
TAT
TCG
GTC
GTC
AGA
CCA
CGT
TCT
Glu
Thr
Ile
Lys
Ser
Ile
Ser
Gln
Gln
Ser
Gly
Ala
Arg
400 405
ATA
GAA
CTT
CAG
AGA
AAT
CCT
CCA
CCA
AAT
GCA
GAT
CCT
1292
TAT
CTT
GAA
GTC
TCT
TTA
GGA
GGT
GGT
TTA
CGT
CTA
GGA
Ile
Glu
Leu
Gln
Arg
Asn
Pro
Pro
Pro
Asn
Ala
Asp
Pro
410 415 420
AAT
ATG
AAG
TTA
TTT
ACA
ATT
CGT
GGC
ACT
CCA
CAA
CAG
1331
TTA
TAC
TTC
AAT
AAA
TGT
TAA
GCA
CCG
TGA
GGT
GTT
GTC
Asn
Met
Lys
Leu
Phe
Thr
Ile
Arg
Gly
Thr
Pro
Gln
Gln
425 430 435
ATA
GAC
TAT
GCT
CGG
CAA
CTC
ATA
GAA
GAA
AAG
ATT
GGT
1370
TAT
CTG
ATA
CGA
GCC
GTT
GAG
TAT
CTT
CTT
TTC
TAA
CCA
Ile
Asp
Tyr
Ala
Arg
Gln
Leu
Ile
Glu
Glu
Lys
Ile
Gly
440 445
GGC
CCA
GTA
AAT
CCT
TTA
GGG
CCA
CCT
GTA
CCC
CAT
GGG
1409
CCG
GGT
CAT
TTA
GGA
AAT
CCC
GGT
GGA
CAT
GGG
GTA
CCC
Gly
Pro
Val
Asn
Pro
Leu
Gly
Pro
Pro
Val
Pro
His
Gly
450 455 460
CCC
CAT
GGT
GTC
CCA
GGC
CCC
CAT
GGA
CCT
CCT
GGG
CCT
1448
GGG
GTA
CCA
CAG
GGT
CCG
GGG
GTA
CCT
GGA
GGA
CCC
GGA
Pro
His
Gly
Val
Pro
Gly
Pro
His
Gly
Pro
Pro
Gly
Pro
465 470
CCA
GGG
CCT
GGA
ACT
CCA
ATG
GGA
CCA
TAC
AAC
CCT
GCA
1487
GGT
CCC
GGA
CCT
TGA
GGT
TAC
CCT
GGT
ATG
TTG
GGA
CGT
Pro
Gly
Pro
Gly
Thr
Pro
Met
Gly
Pro
Tyr
Asn
Pro
Ala
475 480 485
CCT
TAT
AAT
CCT
GGA
CCA
CCA
GGC
CCG
GCT
CCT
CAT
GGT
1526
GGA
ATA
TTA
GGA
CCT
GGT
GGT
CCG
GGC
CGA
GGA
GTA
CCA
Pro
Tyr
Asn
Pro
Gly
Pro
Pro
Gly
Pro
Ala
Pro
His
Gly
490 495 500
CCT
CCA
GCC
CCA
TAT
GCT
CCC
CAG
GGA
TGG
GGA
AAT
GCA
1565
GGA
GGT
CGG
GGT
ATA
CGA
GGG
GTC
CCT
ACC
CCT
TTA
CGT
Pro
Pro
Ala
Pro
Tyr
Ala
Pro
Gln
Gly
Trp
Gly
Asn
Ala
505 510
TAT
CCA
CAC
TGG
CAG
CAG
CAG
GCT
CCT
CCT
GAT
CCA
GCT
1604
ATA
GGT
GTG
ACC
GTC
GTC
GTC
CGA
GGA
GGA
CTA
GGT
CGA
Tyr
Pro
His
Trp
Gln
Gln
Gln
Ala
Pro
Pro
Asp
Pro
Ala
515 520 525
AAG
GCA
GGA
ACG
GAT
CCA
AAT
TCA
GCA
GCT
TGG
GCT
GCT
1643
TTC
CGT
CCT
TGC
CTA
GGT
TTA
AGT
CGT
CGA
ACC
CGA
CGA
Lys
Ala
Gly
Thr
Asp
Pro
Asn
Ser
Ala
Ala
Trp
Ala
Ala
530 535
TAT
TAC
GCT
CAC
TAT
TAT
CAA
CAG
CAA
GCA
CAG
CCA
CCA
1682
ATA
ATG
CGA
GTG
ATA
ATA
GTT
GTC
GTT
CGT
GTC
GGT
GGT
Tyr
Tyr
Ala
His
Tyr
Tyr
Gln
Gln
Gln
Ala
Gln
Pro
Pro
540 545 550
CCA
GCA
GCC
CCT
GCA
GGT
GCA
CCA
ACT
ACA
ACT
CAA
ACT
1721
GGT
CGT
CGG
GGA
CGT
CCA
CGT
GGT
TGA
TGT
TGA
GTT
TGA
Pro
Ala
Ala
Pro
Ala
Gly
Ala
Pro
Thr
Thr
Thr
Gln
Thr
555 560 565
AAT
GGA
CAA
GGA
GAT
CAG
CAG
AAT
CCA
GCC
CCA
GCT
GGA
1760
TTA
CCT
GTT
CCT
CTA
GTC
GTC
TTA
GGT
CGG
GGT
CGA
CCT
Asn
Gly
Gln
Gly
Asp
Gln
Gln
Asn
Pro
Ala
Pro
Ala
Gly
570 575
CAG
GTT
GAT
TAT
ACC
AAG
GCT
TGG
GAA
GAG
TAC
TAC
AAG
1799
GTC
CAA
CTA
ATA
TGG
TTC
CGA
ACC
CTT
CTC
ATG
ATG
TTC
Gln
Val
Asp
Tyr
Thr
Lys
Ala
Trp
Glu
Glu
Tyr
Tyr
Lys
580 585 590
AAA
ATG
GGT
CAG
GCA
GTT
CCT
GCT
CCG
ACT
GGG
GCT
CCT
1838
TTT
TAC
CCA
GTC
CGT
CAA
GGA
CGA
GGC
TGA
CCC
CGA
GGA
Lys
Met
Gly
Gln
Ala
Val
Pro
Ala
Pro
Thr
Gly
Ala
Pro
595 600
CCA
GGT
GGT
CAG
CCA
GAT
TAT
AGT
GCA
GCC
TGG
GCT
GAG
1877
GGT
CCA
CCA
GTC
GGT
CTA
ATA
TCA
CGT
CGG
ACC
CGA
CTC
Pro
Gly
Gly
Gln
Pro
Asp
Tyr
Ser
Ala
Ala
Trp
Ala
Glu
605 610 615
CAT
TAT
AGA
CAA
CAA
GCA
GCC
TAT
TAT
GCC
CAG
ACA
AGT
1916
GTA
ATA
TCT
GTT
GTT
CGT
CGG
ATA
ATA
CGG
GTC
TGT
TCA
His
Tyr
Arg
Gln
Gln
Ala
Ala
Tyr
Tyr
Ala
Gln
Thr
Ser
620 625 630
CCC
CAG
GGA
ATG
CCA
CAG
CAT
CCT
CCA
GCA
CCT
CAG
GGC
1955
GGG
GTC
CCT
TAC
GGT
GTC
GTA
GGA
GGT
CGT
GGA
GTC
CCG
Pro
Gln
Gly
Met
Pro
Gln
His
Pro
Pro
Ala
Pro
Gln
Gly
635 640
CAA
TAA
TAA
GAAGTGGACA
ATACAGTATT
TGCTTCATTG 1994
GTT
ATT
ATT
CTTCACCTGT
TATGTCATAA
ACGAAGTAAC
Gln
TGTGGGGGAA
AAAAACCTTT
GTTAAATATA
TGGATGCAGA 2034
ACACCCCCTT
TTTTTGGAAA
CAATTTATAT
ACCTACGTCT
CGACTTGATG
AAGATCTTAA
TTTTGTTTTT
GGTTTAAAAT 2074
GCTGAACTAC
TTCTAGAATT
AAAACAAAAA
CCAAATTTTA
AGTGTTTCCT
TTTTTTTTTT
TTTTTTTTTG
AAAATGTACA 2114
TCACAAAGGA
AAAAAAAAAA
AAAAAAAAAC
TTTTACATGT
AAATATCTAT
CACTACTGAT
AGGAGGTTAA
TATTTCTGTG 2154
TTTATAGATA
GTGATGACTA
TCCTCCAATT
ATAAAGACAC
TAGAAATGAA
AATTGGTTTG
TTTTTAGTAT
TTAGTGTAGA 2194
ATCTTTACTT
TTAACCAAAC
AAAAATCATA
AATCACATCT
TGTACACATT
CCAGCAAATG
TATTTGCAAT
TATGTGGTTG 2234
ACATGTGTAA
GGTCGTTTAC
ATAAACGTTA
ATACACCAAC
ATGCTTTGTG
ATATAAATGT
ACTTTTTCAA
TGTATACTTT 2274
TACGAAACAC
TATATTTACA
TGAAAAAGTT
ACATATGAAA
CACTTTCCAA
ATGCCTGTTT
TGTGCTTTAC
AATAAATGAT 2314
GTGAAAGGTT
TACGGACAAA
ACACGAAATG
TTATTTACTA
ATGAAACCTC
AAAAAAAAAA
AAAAAAAAAA
AAAAAAAAAA 2354
TACTTTGGAG
TTTTTTTTTT
TTTTTTTTTT
TTTTTTTTTT
AAAAAAAAAA
AAAAAAAAAA
AAAAAAAAAA 2384
TTTTTTTTTT
TTTTTTTTTT
TTTTTTTTTT
__________________________________________________________________________
Three clones from a BJAB cDNA library and three clones from an activated, human peripheral blood lymphocyte cDNA library contain the above sequence. Three clones from the same BJAB library and two clones from the activated lymphocyte library are lacking nucleotides 316, 317 and 318 (SEQ ID NO:9). The mRNA lacking these three nucleotides would encode a variant protein lacking serine 97 (SEQ ID NO:10).
Though the 5' untranslated region of the message appears to be unusually short and contains no in-frame stop codons, the initiator methionine is contained in a nine out of ten match with the Kozak consensus. (M. Kozak, "Structural features in eukaryotic mRNAs that modulate the initiation of translation". J. Biol. Chem. 266(30), 19867-19870 (1991)). The deduced polypeptide, 644 amino acids in length, has a calculated molecular weight of 67.5 kd consistent with the molecular weight of the purified protein. The amino acid sequence can be found in SEQ ID NO:2.
To create the full length cDNA, pools of degenerate oligonucleotides were synthesized to correspond to the amino acid sequence of the peptide encoded by nucleotides 843 to 860 and used as primers in a PCR reaction. The oligo sequences (SEQ ID NO:3) are:
5'-CAGAATTCGGIGGIAAYGARGGIANCG-3' where I indicates an inosine residue, and Y is either T or C, R is either A or G, and N is either I or T.
Degenerate oligonucleotides were also synthesized to correspond to the peptide encoded by nucleotides 957 to 971 and used together with the above primer (SEQ ID NO:3) in a PCR reaction which amplified the PCR product encoded by nucleotides 861 to 956 from human cDNA. The sequences (SEQ ID NO:4) of this degenerate oligo pool are:
5'-GAGTCGACRTCRTCRTCIGGYTTRAA-3' where R is either A or G, and Y is either C or T.
The full length cDNA sequence has features which suggest multiple modes of FBP regulation. There is an unusual codon bias. For every amino acid with a degenerate codon, except glutamine, the FBP sequence avoids the codons preferred in a survey of 2,681 human genes (K-n. Wada, Y. Wada, F. Ishibashi, T. Gojobori and T. Ikemura. Codon usage tabulated from the GenBank genetic sequence data. Nucleic Acids Res. 20(supplement), 2111-2118 (1992)) suggesting translational regulation. The existence of alternate isoforms of FBP generated by regulation of RNA processing is suggested by multiple independent clones, half of which exhibit a precise deletion of 3 nucleotides at basepair 316, 317 and 318, removing serine 97 (see FIG. 2B). The surrounding sequence is consistent with an alternate splice acceptor site (S. M. Mount, Nuc. Acids Res. 10,459 (1982)) which could generate this deletion.
Other variations of the cDNA sequence and encoded protein were found in separate clones. In a clone labeled "25-1" an A residue was found at nucleic acid position 473, which resulted in a complementary base T and a codon specifying an isoleucine at amino acid 149.
The clone from an HL60 cDNA library labeled "3-1" contains 10 basepairs of 5' untranslated sequence, followed by an open reading frame comprised of the nucleotides from position 858 to position 1561 shown above in SEQ ID NO:1 and a unique 3' end cDNA sequence of 25 base pairs, followed by two adjacent stop codons and additional 3' untranslated sequence. The sequence of clone 3-1 is as follows (SEQ ID NO: 5):
__________________________________________________________________________
GGAATTCCGG
ATA
GAT
GTC
CCC
ATT
CCA
AGA
TTT
GCT
GTT
40
CCTTAAGGCC
TAT
CTA
CAG
GGG
TAA
GGT
TCT
AAA
CGA
CAA
Ile
Asp
Val
Pro
Ile
Pro
Arg
Phe
Ala
Val
1 5 10
GGC
ATT
GTA
ATA
GGA
AGA
AAT
GGA
GAG
ATG
ATC
AAA
AAA
79
CCG
TAA
CAT
TAT
CCT
TCT
TTA
CCT
CTC
TAC
TAG
TTT
TTT
Gly
Ile
Val
Ile
Gly
Arg
Asn
Gly
Glu
Met
Ile
Lys
Lys
15 20
ATA
CAA
AAT
GAT
GCT
GGT
GTT
CGC
ATT
CAG
TTT
AAG
CCA
118
TAT
GTT
TTA
CTA
CGA
CCA
CAA
GCG
TAA
GTC
AAA
TTC
GGT
Ile
Gln
Asn
Asp
Ala
Gly
Val
Arg
Ile
Gln
Phe
Lys
Pro
25 30 35
GAT
GAT
GGG
ACA
ACA
CCC
GAA
AGG
ATA
GCA
CAA
ATA
ACA
157
CTA
CTA
CCC
TGT
TGT
GGG
CTT
TCC
TAT
CGT
GTT
TAT
TGT
Asp
Asp
Gly
Thr
Thr
Pro
Glu
Arg
Ile
Ala
Gln
Ile
Thr
40 45
GGA
CCT
CCA
GAC
CGA
TGT
CAA
CAT
GCT
GCA
GAA
ATT
ATT
196
CCT
GGA
GGT
CTG
GCT
ACA
GTT
GTA
CGA
CGT
CTT
TAA
TAA
Gly
Pro
Pro
Asp
Arg
Cys
Gln
His
Ala
Ala
Glu
Ile
Ile
50 55 60
ACA
GAC
CTT
CTT
CGA
AGT
GTT
CAG
GCT
GGT
AAT
CCT
GGT
235
TGT
CTG
GAA
GAA
GCT
TCA
CAA
GTC
CGA
CCA
TTA
GGA
CCA
Thr
Asp
Leu
Leu
Arg
Ser
Val
Gln
Ala
Gly
Asn
Pro
Gly
65 70 75
GGA
CCT
GGA
CCT
GGT
GGT
CGA
GGA
AGA
GGT
AGA
GGT
CAA
274
CCT
GGA
CCT
GGA
CCA
CCA
GCT
CCT
TCT
CCA
TCT
CCA
GTT
Gly
Pro
Gly
Pro
Gly
Gly
Arg
Gly
Arg
Gly
Arg
Gly
Gln
80 85
GGC
AAC
TGG
AAC
ATG
GGA
CCA
CCT
GGT
GGA
TTA
CAG
GAA
313
CCG
TTG
ACC
TTG
TAC
CCT
GGT
GGA
CCA
CCT
AAT
GTC
CTT
Gly
Asn
Trp
Asn
Met
Gly
Pro
Pro
Gly
Gly
Leu
Gln
Glu
90 95 100
TTT
AAT
TTT
ATT
GTG
CCA
ACT
GGG
AAA
ACT
GGA
TTA
ATA
352
AAA
TTA
AAA
TAA
CAC
GGT
TGA
CCC
TTT
TGA
CCT
AAT
TAT
Phe
Asn
Phe
Ile
Val
Pro
Thr
Gly
Lys
Thr
Gly
Leu
Ile
105 110
ATA
GGA
AAA
GGA
GGT
GAA
ACC
ATA
AAA
AGC
ATA
AGC
CAG
391
TAT
CCT
TTT
CCT
CCA
CTT
TGG
TGT
TTT
TCG
TAT
TCG
GTC
Ile
Gly
Lys
Gly
Gly
Glu
Thr
Ile
Lys
Ser
Ile
Ser
Gln
115 120 125
CAG
TCT
GGT
GCA
AGA
ATA
GAA
CTT
CAG
AGA
AAT
CCT
CCA
430
GTC
AGA
CCA
CGT
TCT
TAT
CTT
GAA
GTC
TCT
TTA
GGA
GGT
Gln
Ser
Gly
Ala
Arg
Ile
Glu
Leu
Gln
Arg
Asn
Pro
Pro
130 135 140
CCA
AAT
GCA
GAT
CCT
AAT
ATG
AAG
TTA
TTT
ACA
ATT
CGT
469
GGT
TTA
CGT
CTA
GGA
TTA
TAC
TTC
AAT
AAA
TGT
TAA
GCA
Pro
Asn
Ala
Asp
Pro
Asn
Met
Lys
Leu
Phe
Thr
Ile
Arg
145 150
GGC
ACT
CCA
CAA
CAG
ATA
GAC
TAT
GCT
CGG
CAA
CTC
ATA
508
CCG
TGA
GGT
GTT
GTC
TAT
CTG
ATA
CGA
GCC
GTT
GAG
TAT
Gly
Thr
Pro
Gln
Gln
Ile
Asp
Tyr
Ala
Arg
Gln
Leu
Ile
155 160 165
GAA
GAA
AAG
ATT
GGT
GGC
CCA
GTA
AAT
CCT
TTA
GGG
CCA
547
CTT
CTT
TTC
TAA
CCA
CCG
GGT
CAT
TTA
GGA
AAT
CCC
GGT
Glu
Glu
Lys
Ile
Gly
Gly
Pro
Val
Asn
Pro
Leu
Gly
Pro
170 175
CCT
GTA
CCC
CAT
GGG
CCC
CAT
GGT
GTC
CCA
GGC
CCC
CAT
586
GGA
CAT
GGG
GTA
CCC
GGG
GTA
CCA
CAG
GGT
CCG
GGG
GTA
Pro
Val
Pro
His
Gly
Pro
His
Gly
Val
Pro
Gly
Pro
His
180 185 190
GGA
CCT
CCT
GGG
CCT
CCA
GGG
CCT
GGA
ACT
CCA
ATG
GGA
625
CCT
GGA
GGA
CCC
GGA
GGT
CCC
GGA
CCT
TGA
GGT
TAC
CCT
Gly
Pro
Pro
Gly
Pro
Pro
Gly
Pro
Gly
Thr
Pro
Met
Gly
195 200 205
CCA
TAC
AAC
CCT
GCA
CCT
TAT
AAT
CCT
GGA
CCA
CCA
GGC
664
GGT
ATG
TTG
GGA
CGT
GGA
ATA
TTA
GGA
CCT
GGT
GGT
CCG
Pro
Tyr
Asn
Pro
Ala
Pro
Tyr
Asn
Pro
Gly
Pro
Pro
Gly
210 215
CCG
GCT
CCT
CAT
GGT
CCT
CCA
GCC
CCA
TAT
GCT
CCC
CAG
703
GGC
CGA
GGA
GTA
CCA
GGA
GGT
CGG
GGT
ATA
CGA
GGG
GTC
Pro
Ala
Pro
His
Gly
Pro
Pro
Ala
Pro
Tyr
Ala
Pro
Gln
220 225 230
GGA
TGG
GGA
AAG
GAA
ATT
GAG
CAG
AAG
GTA
CAG
GAG
TAA
742
CCT
ACC
CCT
TTC
CTT
TAA
CTC
GTC
TTC
CAT
GTC
CTC
ATT
Gly
Trp
Gly
Lys
Glu
Ile
Glu
Gln
Lys
Val
Gln
Glu
235 240
TAG
CAATTCCCTG
TAGCTCTCAA
AGCAAATTTT
GAGCTCATTT
785
ATC
GTTAAGGGAC
ATCGAGAGTT
TCGTTTAAAA
CTCGAGTAAA
TTCTTTTTCT
GCAAGCTCAG
CAGCAGAATG
CCCAGAGTCT 825
AAGAAAAAGA
CGTTCGAGTC
GTCGTCTTAC
GGGTCTCAGA
TCCCTGGTAG
ATGCAGGTTC
CATAGCGACG
TTCTCCTGCA 865
AGGGACCATC
TACGTCCAAG
GTATCGCTGC
AAGAGGACGT
ATGCACGCTG
GTATTCTGCA
ATAGCAGGCC
ATGTTTTCCT 905
TACGTGCGAC
CATAAGACGT
TATCGTCCGG
TACAAAAGGA
TGAGCCTGGA
TGCTTTGGAG
CCAAGCTTTC
GTCCCATGCA 945
ACTCGGACCT
ACGAAACCTC
GGTTCGAAAG
CAGGGTACGT
AGGGAAACAA
CCACTTCTGG
GATGTCCGCT
GCAATCTGCT 985
TCCCTTTGTT
GGTGAAGACC
CTACAGGCGA
CGTTAGACGA
CCGGGGCTGC
AGCAACCTCA
TCAGCTCTCT
TGCCTGGAGT 1025
GGCCCCGACG
TCGTTGGAGT
AGTCGAGAGA
ACGGACCTCA
GGCTCAGCCT
GGCCTGCAGG
GCCACCAGGA
GAATGGCAGC 1065
CCGAGTCGGA
CCGGACGTCC
CGGTGGTCCT
CTTACCGTCG
AAGGATGGCG
AGGGTCCTCA
TGGCTGGAAT
TC 1097
TTCCTACCGC
TCCCAGGAGT
ACCGACCTTA
AG
__________________________________________________________________________
The amino acid sequence for the protein encoded by the cDNA sequence for clone 3-1 can be found in SEQ ID NO:6.
In addition, the clone from an activated, human peripheral blood lymphocyte cDNA library labeled "31-10" contains 24 basepairs of 5' untranslated sequence, followed by an open reading frame comprised of the basepairs from position 135 to position 1991 of the DROME sequence given in SEQ ID NO:1 with two exceptions. Clone 31-10 contains 63 nucleotides inserted at position 238 in the DROME cDNA sequence which probably result from an intron which had not been spliced out in the DROME mRNA molecule that gave rise to the 31-10 clone. The inserted nucleotides remain in the open reading frame and are translated into an alternate form of the DROME protein.
The 31-10 clone also deviates in that nucleotides from position 1807 to 1952 in the DROME cDNA sequence are deleted. This deletion shifts out of frame the stop codons which would terminate translation in the other clones. When the mRNA represented by the 31-10 cDNA is translated, additional amino acids must be added to the C-terminal end of the DROME protein.
The sequence of clone 31-10 is as follows (SEQ ID NO:7):
__________________________________________________________________________
GAATTCCGGA
CGACAGCGGC
TCTG
AGA
GCC
CGG
CAG
ATT
GCA
42
CTTAAGGCCT
GCTGTCGCCG
AGAC
TCT
CGG
GCC
GTC
TAA
CGT
Arg
Ala
Arg
Gln
Ile
Ala
1 5
GCA
AAA
ATT
GGA
GGT
GAT
GCA GGG
ACA
TCA
CTG
AAT
TCA
81
CGT
TTT
TAA
CCT
CCA
CTA
CGT CCC
TGT
AGT
GAC
TTA
AGT
Ala
Lys
Ile
Gly
Gly
Asp
Ala Gly
Thr
Ser
Leu
Asn
Ser
10 15
AAT
GAC
TAT
GGT
TAT
GGG
GGA CAA
AAA
AGA
CCT
TTA
GAA
120
TTA
CTG
ATA
CCA
ATA
CCC
CCT GTT
TTT
TCT
GGA
AAT
CTT
Asn
Asp
Tyr
Gly
Tyr
Gly
Gly Gln
Lys
Arg
Pro
Leu
Glu
20 25 30
GAT
GGA
GAT
GGC
TCT
TGG
ACA AGT
CCG
AGC
AGT
ACA
ACA
159
CTA
CCT
CTA
CCG
AGA
ACC
TGT TCA
GGC
TCG
TCA
TGT
TGT
Asp
Gly
Asp
Gly
Ser
Trp
Thr Ser
Pro
Ser
Ser
Thr
Thr
35 40 45
CAC
TGG
GAG
GGA
ATG
CCC
TCT CCT
TTT
AAA
GAT
CAA
CCA
198
GTG
ACC
CTC
CCT
TAC
GGG
AGA GGA
AAA
TTT
CTA
GTT
GGT
His
Trp
Glu
Gly
Met
Pro
Ser Pro
Phe
Lys
Asp
Gln
Pro
50 55
GAT
GCT
AAG
AAA
GTT
GCT
CCT CAA
AAT
GAC
TCT
TTT
GGA
237
CTA
CGA
TTC
TTT
CAA
CGA
GGA GTT
TTA
CTG
AGA
AAA
CCT
Asp
Ala
Lys
Lys
Val
Ala
Pro Gln
Asn
Asp
Ser
Phe
Gly
60 65 70
ACA
CAG
TTA
CCA
CCG
ATG
CAT CAG
CAG
CAA
AGA
TCT
GTA
276
TGT
GTC
AAT
GGT
GGC
TAC
GTA GTC
GTC
GTT
TCT
AGA
CAT
Thr
Gln
Leu
Pro
Pro
Met
His Gln
Gln
Gln
Arg
Ser
Val
75 80
ATG
ACA
GAA
GAA
TAC
AAA
GTT CCA
GAT
GGA
ATG
GTT
GGA
315
TAC
TGT
CTT
CTT
ATG
TTT
CAA GGT
CTA
CCT
TAC
CAA
CCT
Met
Thr
Glu
Glu
Tyr
Lys
Val Pro
Asp
Gly
Met
Val
Gly
85 90 95
TTC
ATA
ATT
GGC
AGA
GGA
GGT GAA
CAG
ATC
TCA
CGC
ATA
354
AAG
TAT
TAA
CCG
TCT
CCT
CCA CTT
GTC
TAG
AGT
GCG
TAT
Phe
Ile
Ile
Gly
Arg
Gly
Gly Glu
Gln
Ile
Ser
Arg
Ile
100 105 110
CAA
CAG
GAA
TCT
GGA
TGC
AAA ATA
CAG
ATA
GCT
CCT
GAC
393
GTT
GTC
CTT
AGA
CCT
ACG
TTT TAT
GTC
TAT
CGA
GGA
CTG
Gln
Gln
Glu
Ser
Gly
Cys
Lys Ile
Gln
Ile
Ala
Pro
Asp
115 120
AGT
GGT
GGC
CTT
CCA
GAA
AGG TCC
TGT
ATG
TTA
ACT
GGA
432
TCA
CCA
CCG
GAA
GGT
CTT
TCC AGG
ACA
TAC
AAT
TGA
CCT
Ser
Gly
Gly
Leu
Pro
Glu
Arg Ser
Cys
Met
Leu
Thr
Gly
125 130 135
ACA
CCT
GAA
TCT
GTC
CAG
TCA GCA
AAA
CGG
TTA
CTG
GAC
471
TGT
GGA
CTT
AGA
CAG
GTC
AGT CGT
TTT
GCC
AAT
GAC
CTG
Thr
Pro
Glu
Ser
Val
Gln
Ser Ala
Lys
Arg
Leu
Leu
Asp
140 145
CAG
ATT
GTT
GAA
AAA
GGA
AGA CCA
GCT
CCT
GGC
TTC
CAT
510
GTC
TAA
CAA
CTT
TTT
CCT
TCT GGT
CGA
GGA
CCG
AAG
GTA
Gln
Ile
Val
Glu
Lys
Gly
Arg Pro
Ala
Pro
Gly
Phe
His
150 155 160
CAT
GGC
GAT
GGA
CCG
GGA
AAT GCA
GTT
CAA
GAA
ATC
ATG
549
GTA
CCG
CTA
CCT
GGC
CCT
TTA CGT
CAA
GTT
CTT
TAG
TAC
His
Gly
Asp
Gly
Pro
Gly
Asn Ala
Val
Gln
Glu
Ile
Met
165 170 175
ATT
CCA
GCT
AGC
AAG
GCA
GGA TTA
GTC
ATT
GGA
AAA
GGG
588
TAA
GGT
CGA
TCG
TTC
CGT
CCT AAT
CAG
TAA
CCT
TTT
CCC
Ile
Pro
Ala
Ser
Lys
Ala
Gly Leu
Val
Ile
Gly
Lys
Gly
180 185
GGA
GAA
ACT
ATT
AAA
CAG
CTT CAG
GAA
CGG
GCT
GGA
GTT
627
CCT
CTT
TGA
TAA
TTT
GTC
GAA GTC
CTT
GCC
CGA
CCT
CAA
Gly
Glu
Thr
Ile
Lys
Gln
Leu Gln
Glu
Arg
Ala
Gly
Val
190 195 200
AAA
ATG
GTT
ATG
ATT
CAA
GAC GGG
CCG
CAG
AAC
ACT
GGT
666
TTT
TAC
CAA
TAC
TAA
GTT
CTG CCC
GGC
GTC
TTG
TGA
CCA
Lys
Met
Val
Met
Ile
Gln
Asp Gly
Pro
Gln
Asn
Thr
Gly
205 210
GCT
GAC
AAA
CCT
CTT
AGG
ATT ACA
GGA
GAC
CCA
TAT
AAA
705
CGA
CTG
TTT
GGA
GAA
TCC
TAA TGT
CCT
CTG
GGT
ATA
TTT
Ala
Asp
Lys
Pro
Leu
Arg
Ile Thr
Gly
Asp
Pro
Tyr
Lys
215 220 225
GTT
CAA
CAA
GCC
AAG
GAA
ATG GTG
TTA
GAG
TTA
ATT
CGT
744
CAA
GTT
GTT
CGG
TTC
CTT
TAC CAC
AAT
CTC
AAT
TAA
GCA
Val
Gln
Gln
Ala
Lys
Glu
Met Val
Leu
Glu
Leu
Ile
Arg
230 235 240
GAT
CAA
GGC
GGT
TTC
AGA
GAA GTT
CGG
AAT
GAG
TAT
GGG
783
CTA
GTT
CCG
CCA
AAG
TCT
CTT CAA
GCC
TTA
CTC
ATA
CCC
Asp
Gln
Gly
Gly
Phe
Arg
Glu Val
Arg
Asn
Glu
Tyr
Gly
245 250
TCA
AGA
ATA
GGA
GGA
AAT
GAA GGG
ATA
GAT
GTC
CCC
ATT
822
AGT
TCT
TAT
CCT
CCT
TTA
CTT CCC
TAT
CTA
CAG
GGG
TAA
Ser
Arg
Ile
Gly
Gly
Asn
Glu Gly
Ile
Asp
Val
Pro
Ile
255 260 265
CCA
AGA
TTT
GCT
GTT
GGC
ATT GTA
ATA
GGA
AGA
AAT
GGA
861
GGT
TCT
AAA
CGA
CAA
CCG
TAA CAT
TAT
CCT
TCT
TTA
CCT
Pro
Arg
Phe
Ala
Val
Gly
Ile Val
Ile
Gly
Arg
Asn
Gly
270 275
GAG
ATG
ATC
AAA
AAA
ATA
CAA AAT
GAT
GCT
GGT
GTT
CGC
900
CTC
TAC
TAG
TTT
TTT
TAT
GTT TTA
CTA
CGA
CCA
CAA
GCG
Glu
Met
Ile
Lys
Lys
Ile
Gln Asn
Asp
Ala
Gly
Val
Arg
280 285 290
ATT
CAG
TTT
AAG
CCA
GAT
GAT GGG
ACA
ACA
CCC
GAA
AGG
939
TAA
GTC
AAA
TTC
GGT
CTA
CTA CCC
TGT
TGT
GGG
CTT
TCC
Ile
Gln
Phe
Lys
Pro
Asp
Asp Gly
Thr
Thr
Pro
Glu
Arg
295 300 305
ATA
GCA
CAA
ATA
ACA
GGA
CCT CCA
GAC
CGA
TGT
CAA
CAT
978
TAT
CGT
GTT
TAT
TGT
CCT
GGA GGT
CTG
GCT
ACA
GTT
GTA
Ile
Ala
Gln
Ile
Thr
Gly
Pro Pro
Asp
Arg
Cys
Gln
His
310 315
GCT
GCA
GAA
ATT
ATT
ACA
GAC CTT
CTT
CGA
AGT
GTT
CAG
1017
CGA
CGT
CTT
TAA
TAA
TGT
CTG GAA
GAA
GCT
TCA
CAA
GTC
Ala
Ala
Glu
Ile
Ile
Thr
Asp Leu
Leu
Arg
Ser
Val
Gln
320 325 330
GCT
GGT
AAT
CCT
GGT
GGA
CCT GGA
CCT
GGT
GGT
CGA
GGA
1056
CGA
CCA
TTA
GGA
CCA
CCT
GGA CCT
GGA
CCA
CCA
GCT
CCT
Ala
Gly
Asn
Pro
Gly
Gly
Pro Gly
Pro
Gly
Gly
Arg
Gly
335 340
AGA
GGT
AGA
GGT
CAA
GGC
AAC TGG
AAC
ATG
GGA
CCA
CCT
1095
TCT
CCA
TCT
CCA
GTT
CCG
TTG ACC
TTG
TAC
CCT
GGT
GGA
Arg
Gly
Arg
Gly
Gln
Gly
Asn Trp
Asn
Met
Gly
Pro
Pro
345 350 355
GGT
GGA
TTA
CAG
GAA
TTT
AAT TTT
ATT
GTG
CCA
ACT
GGG
1134
CCA
CCT
AAT
GTC
CTT
AAA
TTA AAA
TAA
CAC
GGT
TGA
CCC
Gly
Gly
Leu
Gln
Glu
Phe
Asn Phe
Ile
Val
Pro
Thr
Gly
360 365 370
AAA
ACT
GGA
TTA
ATA
ATA
GGA AAA
GGA
GGT
GAA
ACC
ATA
1173
TTT
TGA
CCT
AAT
TAT
TAT
CCT TTT
CCT
CCA
CTT
TGG
TGT
Lys
Thr
Gly
Leu
Ile
Ile
Gly Lys
Gly
Gly
Glu
Thr
Ile
375 380
AAA
AGC
ATA
AGC
CAG
CAG
TCT GGT
GCA
AGA
ATA
GAA
CTT
1212
TTT
TCG
TAT
TCG
GTC
GTC
AGA CCA
CGT
TCT
TAT
CTT
GAA
Lys
Ser
Ile
Ser
Gln
Gln
Ser Gly
Ala
Arg
Ile
Glu
Leu
385 390 395
CAG
AGA
AAT
CCT
CCA
CCA
AAT GCA
GAT
CCT
AAT
ATG
AAG
1251
GTC
TCT
TTA
GGA
GGT
GGT
TTA CGT
CTA
GGA
TTA
TAC
TTC
Gln
Arg
Asn
Pro
Pro
Pro
Asn Ala
Asp
Pro
Asn
Met
Lys
400 405
TTA
TTT
ACA
ATT
CGT
GGC
ACT CCA
CAA
CAG
ATA
GAC
TAT
1290
AAT
AAA
TGT
TAA
GCA
CCG
TGA GGT
GTT
GTC
TAT
CTG
ATA
Leu
Phe
Thr
Ile
Arg
Gly
Thr Pro
Gln
Gln
Ile
Asp
Tyr
410 415 420
GCT
CGG
CAA
CTC
ATA
GAA
GAA AAG
ATT
GGT
GGC
CCA
GTA
1329
CGA
GCC
GTT
GAG
TAT
CTT
CTT TTC
TAA
CCA
CCG
GGT
CAT
Ala
Arg
Gln
Leu
Ile
Glu
Glu Lys
Ile
Gly
Gly
Pro
Val
425 430 435
AAT
CCT
TTA
GGG
CCA
CCT
GTA CCC
CAT
GGG
CCC
CAT
GGT
1368
TTA
GGA
AAT
CCC
GGT
GGA
CAT GGG
GTA
CCC
GGG
GTA
CCA
Asn
Pro
Leu
Gly
Pro
Pro
Val Pro
His
Gly
Pro
His
Gly
440 445
GTC
CCA
GGC
CCC
CAT
GGA
CCT CCT
GGG
CCT
CCA
GGG
CCT
1407
CAG
GGT
CCG
GGG
GTA
CCT
GGA GGA
CCC
GGA
GGT
CCC
GGA
Val
Pro
Gly
Pro
His
Gly
Pro Pro
Gly
Pro
Pro
Gly
Pro
450 455 460
GGA
ACT
CCA
ATG
GGA
CCA
TAC AAC
CCT
GCA
CCT
TAT
AAT
1446
CCT
TGA
GGT
TAC
CCT
GGT
ATG TTG
GGA
CGT
GGA
ATA
TTA
Gly
Thr
Pro
Met
Gly
Pro
Tyr Asn
Pro
Ala
Pro
Tyr
Asn
465 470
CCT
GGA
CCA
CCA
GGC
CCG
GCT CCT
CAT
GGT
CCT
CCA
GCC
1485
GGA
CCT
GGT
GGT
CCG
GGC
CGA GGA
GTA
CCA
GGA
GGT
CGG
Pro
Gly
Pro
Pro
Gly
Pro
Ala Pro
His
Gly
Pro
Pro
Ala
475 480 485
CCA
TAT
GCT
CCC
CAG
GGA
TGG GGA
AAT
GCA
TAT
CCA
CAC
1524
GGT
ATA
CGA
GGG
GTC
CCT
ACC CCT
TTA
CGT
ATA
GGT
GTG
Pro
Tyr
Ala
Pro
Gln
Gly
Trp Gly
Asn
Ala
Tyr
Pro
His
490 495 500
TGG
CAG
CAG
CAG
GCT
CCT
CCT GAT
CCA
GCT
AAG
GCA
GGA
1563
ACC
GTC
GTC
GTC
CGA
GGA
GGA CTA
GGT
CGA
TTC
CGT
CCT
Trp
Gln
Gln
Gln
Ala
Pro
Pro Asp
Pro
Ala
Lys
Ala
Gly
505 510
ACG
GAT
CCA
AAT
TCA
GCA
GCT TGG
GCT
GCT
TAT
TAC
GCT
1602
TGC
CTA
GGT
TTA
AGT
CGT
CGA ACC
CGA
CGA
ATA
ATG
CGA
Thr
Asp
Pro
Asn
Ser
Ala
Ala Trp
Ala
Ala
Tyr
Tyr
Ala
515 520 525
CAC
TAT
TAT
CAA
CAG
CAA
GCA CAG
CCA
CCA
CCA
GCA
GCC
1641
GTG
ATA
ATA
GTT
GTC
GTT
CGT GTC
GGT
GGT
GGT
CGT
CGG
His
Tyr
Tyr
Gln
Gln
Gln
Ala Gln
Pro
Pro
Pro
Ala
Ala
530 535
CCT
GCA
GGT
GCA
CCA
ACT
ACA ACT
CAA
ACT
AAT
GGA
CAA
1680
GGA
CGT
CCA
CGT
GGT
TGA
TGT TGA
GTT
TGA
TTA
CCT
GTT
Pro
Ala
Gly
Ala
Pro
Thr
Thr Thr
Gln
Thr
Asn
Gly
Gln
540 545 550
GGA
GAT
CAG
CAG
AAT
CCA
GCC CCA
GCT
GGA
CAG
GTT
GAT
1719
CCT
CTA
GTC
GTC
TTA
GGT
CGG GGT
CGA
CCT
GTC
CAA
CTA
Gly
Asp
Gln
Gln
Asn
Pro
Ala Pro
Ala
Gly
Gln
Val
Asp
555 560 565
TAT
ACC
AAG
GCT
TGG
GAA
GAG TAC
TAC
AAG
AAA
ATG
GGG
1758
ATA
TGG
TTC
CGA
ACC
CTT
CTC ATG
ATG
TTC
TTT
TAC
CCC
Tyr
Thr
Lys
Ala
Trp
Glu
Glu Tyr
Tyr
Lys
Lys
Met
Gly
570 575
CCA
ATA
ATA
AGA
AGT
GGA
CAA TAC
AGT
ATT
TGC
TTC 1794
GGT
TAT
TAT
TCT
TCA
CCT
GGT ATG
TCA
TAA
ACG
AAG
Pro
Ile
Ile
Arg
Ser
Gly
Gln Tyr
Ser
Ile
Cys
Phe
580 585 590
AGGAATTCC 1803
TCCTTAAGG
__________________________________________________________________________
The amino acid sequence for the protein encoded by the cDNA sequence for clone 31-10 can be found in SEQ ID NO:8.
The following examples are for illustration only, and should not be used to limit the scope of the invention.
The cloned cDNA encodes a protein with FUSE binding activity, referred to as DROME or FUSE binding protein ("FBP"). The coding region contained in the HL60 clones was expressed as a fusion protein in bacteria, purified from extracts and tested with electrophoretic mobility shift assays (EMSAs) for binding to an oligonucleotide containing the FUSE site. Recombinant fusion protein (GST-FBP) bound effectively to the FUSE oligonucleotide.
The open reading frame present in the HL60-1 and HL60-2 clones (amino acid residues 145 to 511) was expressed as a Glutathione-S-transferase fusion protein (GST-FBP) in the pGEX system. The inserts from the HL60-1 and HL60-2 clones were spliced together and the open reading frame region subcloned into the Sma I site of the pGEX-2T plasmid (AMARAD Corp.; D. B. Smith and K. S. Johnson, Gene. 67,31 (1988)) to express a GST-FBP fusion protein. Recombinant protein was purified from E. coli extracts on a glutathione-agarose matrix (Sigma Chemical Co.), GST alone was prepared from a pGEX-2T plasmid with no insert in a similar manner. Fusion proteins were eluted with 20 mM glutathione, checked for purity, correct size and concentration with SDS PAGE.
Purified recombinant proteins were incubated with double stranded, 32 P labeled, oligonucleotide (Probe) in the presence or absence of the indicated quantity of unlabeled, double stranded oligonucleotide as competitor, and subjected to EMSA. (M. Fried and D. M. Crothers, Nucleic Acids Res. 9, 6505 (1981)).
DNA binding assays were performed with an equivalent amount of GST-FBP or GST alone incubated in 25 mM Tris, 200 mM Glycine, 1mM EDTA, 0.5 mg/ml BSA, 0.1% Tween20, 10% glycerol, 100 μg/ml poly(dI:dC), and 0.2 ng of labeled probe. The probe was prepared by annealing two complementary synthetic oligonucleotides and 5' end labeling with T4 polynucleotide kinase in the presence of γ-32 P-ATP. The top strand of the FUSE oligonucleotide was 5'-GATCACAAAATAAAAATCCCGAGGGAATATAG-3' (SEQ ID NO:11). The top strand of Mut A was 5'-GATCACAAAAAATCCGAGGAATATAG-3' (SEQ ID NO:12) (lower case indicates changes from FUSE oligomer sequence). The top strand of CRE was 5'-GATCTGACGTCATGACTGACGTCATGACTGACGTCATCA-3' (SEQ ID NO:13). The top strand of CTE was 5'-AATTCTCCTCCCCACCTTCCCCACCTCCCCA-3' (SEQ ID NO:14). Reaction mixtures were incubated 30 minutes at room temperature and protein-DNA complexes resolved by electrophoresis on a 4.2% acrylamide gel in 25 mM Tris, 200 mM Glycine, 1 mM EDTA buffer.
Radioactive DNA and DNA-protein complexes were visualized by autoradiography (FIG. 3A). Glutathione-S-transferase (GST) alone did not bind to the probe. Competitor oligonucleotides were as follows: Mut A, a mutant FUSE oligonucleotide with 17 residues covering the binding site changed; CRE, CAMP response element; CTE, CT element in c-myc 5' flanking region.
Recombinant FBP and purified human FBP were also demonstrated to display similar DNA binding specificity. Full length GST-FBP and human FBP purified from HL60 cells by oligo-affinity chromatography were assayed for FUSE binding using EMSA. The DNA-protein complexes formed by human FBP in the presence of 25ng of the indicated non-radioactive competitors, visualized by autoradiography, are shown in the upper panel of FIG. 3B. The lower panel of FIG. 3B shows recombinant GST-FBP binding the FUSE probe in the presence of 25 ng of the same competitors. Competitor oligonucleotides are the same as in (A) with the addition of Mut B and Mut C, each with a different mutation in the FUSE sequence. The top strand of Mut B was 5'-GATCACAAAATAAAAAATggacgccGAATATAG-3' (SEQ ID NO:15) and the top strand of Mut C was 5' AATTCTCCTCCCACCTCCCACCTCCCA-GATCACAActacgtgctaggCGAGGGAATATAG-3' (SEQ ID NO:16).
Recombinant fusion protein also failed to bind to the nonhomologous oligonucleotides when they were used as radioactive probes. The full length recombinant protein, and purified human FBP shared the same DNA binding specificity as indicated by challenging their binding to FUSE with a panel of competitors (FIG. 3B). An excess of cold FUSE significantly reduced binding of human and recombinant FBP to the radioactive probe; the same amount of the Mut B oligo which has minor changes in the FUSE sequence, bound both as well; but more divergent oligonucleotides did not significantly compete for binding to either protein.
Recombinant FBP activated expression of a chloramphenicol acetyltransferase (CAT) gene under the control of human c-myc regulatory sequence was tested in co-transection experiments (FIG. 4).
Full length FBP was inserted into an expression vector downstream from the cytomegalovirus enhancer/promoter in both the sense and anti-sense orientations. These plasmids or the expression vector alone were transfected into U937 cells with another vector containing a CAT gene under the control of myc regulator sequence (myc-CAT) or the reporter plasmid with the FUSE site deleted (ΔFUSE myc-CAT). The myc-CAT and ΔFUSE myc-CAT reporter plasmids are derivatives of pMP CAT (M. Avigan, B. Strober, and D. Levens, J. Biol. Chem., 265,18538 (1990)) with the deletion of a 580 bp Nsi I fragment from position -669 to -1249 relative to the myc P1 promoter. The Nsi I deletion results in more consistent CAT expression without disrupting FUSE mediated activation. The ΔFUSE myc-CAT plasmid was produced from pMP CAT by cutting the parent plasmid at the Ava I site located in the FUSE element, partial digestion with mung bean exonuclease (Bethesda Research Labs) and religation. The DNA sequence of the deleted plasmid revealed that 68 nucleotides between position -1493 and -1561 relative to myc P1 were removed, completely deleting the FUSE element.
U937 cells (5×106) were electroporated (Cell-porator, BRL, 200V, 1180 μF) with 10 μg of each plasmid in 250 μl of RPMI supplemented with 10% fetal calf serum. Transfected cells were added to 8 ml additional medium and incubated 48 hours before harvesting for CAT assays. (C. M. Gorman, L. F. Moffat, B. H. Howard, Mol. Cell. Biol. 2, 1044 (1982)). The same quantity of protein (Bradford method) was assayed for each extract. Two independent transfections are shown for each plasmid combination.
In the presence of the FBP expression plasmid (FBP-sense), the FUSE containing myc-CAT plasmid (myc-CAT) gave a 5-fold higher level of CAT activity than in the presence of the expression vector alone. In contrast, this level of stimulation did not occur when the reporter plasmid had a 68 bp deletion that eliminated the FUSE site (ΔFUSE myc-CAT) nor when the expression plasmid contained the FBP cDNA in the reverse orientation (FBP-antisense). The minor increase in the ΔFUSE myc-CAT expression with FBP co-transection could be due to secondary binding sites present in the myc regulatory sequence.
FBP shows no significant homology to known DNA binding motifs in a search against the GenBank databases, however the primary amino acid sequence of FBP has distinct structural features which could constitute a DNA binding domain, as well as other features with potential functional significance.
A Pustell matrix self-comparison of FBP revealed three domains in the primary amino acid sequence, each containing internally repeated sequences.
The Pustell matrix (J. Pustell, F. C. Kafatos, N. A. R. 10, 4765 (1982); W. R. Pearson, Meth. Enzymol. 183, 63 (1990); MacVector software, International Biotechnologies, Inc.) revealed the repeated sequences in the three domains of FBP. The repeated glycines in the N-terminal domain, the four copies of the FBP repeat in the central domain, the proline/glycine rich segment and the WAAYY (where W is Trp, A is Ala and Y is Tyr) repeat in the C-terminal domain all score as diagonals. A window size of 20 residues and a minimum score of 35% were used for this analysis. The diagram above the box in FIG. 5A symbolizes the structures in FBP: striped fill indicates glycine rich segments, the solid boxes indicate the FBP repeats, and the shaded boxes represent amphipathic helices.
The amino-terminal domain is comprised of 106 amino acids featuring a string of 11 repeated glycine residues and the first of 5 predicted amphipathic alpha helices found in FBP. Following the helix, residues 63 to 106 are enriched for glutamine (16%) relative to the whole protein (9%).
The central and largest FBP domain contains structures sufficient for sequence specific binding because a recombinant protein comprised of only this domain bound DNA specifically. The central domain is made up of four evenly spaced units each unit containing (1) a highly conserved 30 residue segment termed here the FBP repeat (FIG. 5B), followed by (2) a potential amphipathic alpha helix (FIG. 5C). Each FBP repeat is comprised of three structures in sequential order: a region of predicted β-sheet, separated by a turn at conserved glycines 13 and 14, from a segment of alpha-helix as predicted by Chou-Fasman and Robson-Garnier analysis. (P. Y. Chou and G. D. Fasman, "Prediction of the secondary structure of proteins from their amino acid sequence." Adv. Enzymol. Relat. Areas Mol. Biol. 47:45-148 (1978); B. Robson and E. Suzuki, "Conformational properties of amino acid residues in globular proteins." J. Mol. Biol. 107:327-356 (1976); J. Garnier, D. J. Osguthorpe, and B. Robson, "Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins." J. Mol. Biol. 120:97-120 (1978). Analysis was performed with MacVector Software (International Biotechnologies, Inc.)). The FBP repeats are found at amino acid positions 107-136, 192-221, 282-311 and 383-412.
The invariant glycine at position 25 may not disrupt helix formation due to the hydrophilic character of neighboring residues in the predicted helix. (L. Serrano, J. L. Neira, J. Sancho, and A. R. Fersht, "Effect of alanine versus glycine in alpha-helices on protein stability." Nature, 356:453-455 (1992)). This sheet-turn-helix pattern evokes the DNA-binding domain of the papilloma virus E2 protein. (R. S. Hegde, S. R. Grossman, L. A. Laimins, and P. Sigler, "Crystal structure at 1.7 A of the bovine papillomavirus-1 E2 DNA-binding domain bound to its DNA target." Nature 359:505-512 (1992)). An amphipathic helix follows each repeat after a spacer of 18-21 residues. The amphipathic helices are found at amino acid positions 155-170, 242-258, 330-347 and 434-447. The amphipathic character of the four central domain helices extends uniformly over their 16-18 residue lengths with the exception of hydrophilic residues at position 2 (FIG. 5C). The lack of a hydrophobic residue at this position deviates from the heptad repeat pattern shown to be important in coiled-coiled interactions. (F. H. C. Crick, Nature. 170, 882 (1952)). The helices are not flanked by basic regions as in basic-coiled-coil (S. C. Harrison, "Structural taxonomy of DNA-binding proteins." Nature, 353:715-719 (1991); W. H. Landschulz, P. F. Johnson, and S. L. McKnight. Science, 240:1759-1764 (1988)) or helix-loop-helix motifs. (C. Murre, P. Schonleber McCaw, and D. Baltimore, "A new DNA binding and dimerization motif in immunoglobulin enhancer binding, daughterless, MyoD, and myc proteins." Cell 56:777-783 (1989)). Thus the repeat-helix unit found in the central domain of FBP does not resemble known DNA binding motifs.
The C-terminal domain is separated from the central domain by a highly flexible, proline-glycine rich segment. This domain is also glutamine rich (22/140=15.7% from amino acid 505 to the C-terminal) and tyrosine rich (13/140=9.3%) including repeated tyrosine dyad motifs some of which conform to a tyrosine phosphorylation recognition site. (J. A. Cooper, F. S. Esch, S. S. Taylor, and T. Hunter, "Phosphorylation sites in enolase and lactate dehydrogenase utilized by tyrosine protein kinases in vivo and in vitro." J. Biol. Chem. 259:7835-7841 (1984)).
To define a minimum DNA binding motif within the central domain, further truncations were constructed as well as short insertions to disrupt the predicted structures. Mutant constructs were expressed as bacterial fusion proteins, purified and tested for DNA binding to the FUSE oligonucleotide.
The plasmid which encodes the 278-511 mutant was constructed by inserting an EcoR I-Sac I fragment from the HL60-2 clone into the pGEX-1 vector. The fusion protein contains amino acids 278-511 of FBP plus the residues KEIEQKVQE (SEQ ID NO:17) (where K is Lys, E is Glu, I is Ile, Q is Gln, and V is Val) at the carboxyterminal end stopping at a termination codon unique to the HL60-2 clone. The 278-474 encoding plasmid was constructed from the 278-511 plasmid by inserting a 12 bp double stranded oligonucleotide with the sequence TTAGTTAACTAA (SEQ ID NO:18) into an Sfi I site. This oligonucleotide encodes stop codons in all 3 reading frames so that a truncated protein is produced. The 278-372 encoding plasmid was similarly constructed by inserting the termination oligonucleotide into a Dra III site. The 298-511 encoding plasmid was constructed from the 278-511 plasmid by deleting a fragment between the BamH I site in the pGEX vector and a Bcl I in the FBP cDNA.
The ability of each construct to bind (+) or failure to bind (-) the FUSE DNA sequence is indicated at the right in FIG. 6A. The open boxes labeled "GST" indicate the glutathione-S-transferase fragment present in the fusion proteins. Other symbols are the same as in FIG. 5A.
The truncated protein containing the third and fourth units of the central domain and the proline/glycine rich region (278-511, FIG. 6A) bound DNA with the same specificity as full length DROME or FPB (FIG. 6C). A truncation mutant lacking a portion of the proline-glycine rich region (278-474) still bound DNA. In contrast, the removal of FBP repeat 4 and the rest of the C-terminus in a mutant with a larger truncation (278-372) or removal of the NH2-terminal portion of repeat 3 (298-511) abrogated binding. These truncations suggest that at least two intact FBP repeat-helix units are required to constitute a DNA binding domain and that at least half of the Pro-Gly region is not required.
Insertion mutants further demonstrated the importance of two repeat-helix units for DNA binding. Three to six amino acids were introduced at eight positions throughout the 278-511 construct (FIG. 6B) without altering the reading frame.
The insertion mutants were constructed by cutting the 278-511 plasmid at restriction sites, treating the ends with the large fragment of DNA polymerase I (if not already blunt) and inserting linkers of 8, 10 or 12 base pairs. The appropriate length was chosen to insert a small number of amino acids but restore the original reading frame leaving the rest of the sequence unchanged. Insertion mutant 1 had the amino acids ArgIleArg added between GST and the FBP polypeptide. Mutant 2 had residues IleGlySerArgIleArg (SEQ ID NO:19) added after the Met at position 297 in FBP, mutant 3 had IleArgIleArg (SEQ ID NO:20) added after Met297. Mutant 4 had ProArgIleArgGlu (SEQ ID NO:21) added while deleting Gln at amino acid 333. Mutant 5 had GlyIlePro added after amino acid 336. Mutant 6 had GlyIleProArg (SEQ ID NO:22) added after amino acid 343. Mutant 7 ArgIleArg added after amino acid 373. Mutant 8 had ArgAspProAla (SEQ ID NO:23) added while deleting GlnSer after amino acid 404. Mutant 9 had ArgGlySerGly (SEQ ID NO:24) inserted while deleting Pro after amino acid 475. All constructs were confirmed by DNA sequencing. The DNA binding property of each insertion mutant is indicated by a +, - or R (reduced binding) below the insertion site.
An insertion at the junction between GST and the FBP sequence (insertion mutant number 1) does not effect binding. Neither do insertions into the region between helix 4 and repeat 4 (number 7) or the Pro-Gly region (number 9). In contrast, insertions 2 and 3 in repeat three, insertions 4 and 5 in helix four, and insertion 8 in repeat four all resulted in mutant proteins failing to bind DNA. Insertion number 6, near the end of helix four, reduced but did not eliminate binding.
The combined evidence from RNA expression, DNA binding specificity and transection experiments indicate the presence of a human FUSE binding protein (referred to as FBP or DROME) that activates myc expression. FBP binds DNA through a novel, repeated motif. The presence of four sets of the repeat-helix unit in FBP, when only two are required for FUSE binding, suggests that this protein has the potential to form at least two binding sites. A dual binding capability may be important for its cellular function. For example, binding of FBP could facilitate DNA looping which stimulates the interaction of distant elements, a mechanism consistent with the far upstream location and the requirement for additional regulatory sequence to observe the stimulatory effect of the FUSE site.
The purified FUSE binding protein is also useful in the production of monoclonal antibodies. Thus, a mouse is injected with purified FUSE binding protein, or a fragment thereof, which activates a number of B-lymphocytes in the mouse which produce antibodies against the protein. The mouse is sacrificed and spleen lymphoid cells, containing large quantities of these B-lymphocytes, are isolated and tested to assure production of antibody to FUSE binding protein. The lymphocytes producing antibody to FUSE binding protein are then fused with mouse plasmacytoma cells, ensuring a reproducible source of monoclonal antibody (Kohler, G. and Milstein, C. Nature, 256:495-97 (1975)). These antibodies, or fragments thereof, can then be used to detect and quantitate the FUSE binding protein.
The anti-sense sequence for the FUSE binding protein cDNA is useful therapeutically to arrest cell development in a target area of uncontrolled cell growth. An expression vector or oligonucleotide is constructed incorporating the anti-sense sequence of the FUSE binding protein cDNA. The oligonucleotide or vector is then incorporated into the cells of the target area, and acts to antagonize or block expression of the FUSE binding protein, inhibiting cell division and proliferation.
This is effective, for example, in arresting tumor cell growth, or an unwarranted immune response (i.e., arresting B- or T-lymphocytes).
The invention has been described in detail with particular reference to a preferred embodiment thereof, but it will be understood that the invention is capable of other and different embodiments. As is readily apparent to those skilled in the art, variations and modifications can be effected within the spirit and scope of the invention. Accordingly, the foregoing disclosure, description, and figures are for illustrative purposes only, and do not in any way limit the invention, which is defined only by the claims.
__________________________________________________________________________
SEQUENCE LISTING
(1) GENERAL INFORMATION:
(iii) NUMBER OF SEQUENCES: 24
(2) INFORMATION FOR SEQ ID NO: 1:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2384
(B) TYPE: Nucleic acid
(C) STRANDEDNESS: Double
(D) TOPOLOGY: Unknown
(ii) MOLECULE TYPE: cDNA
(iii) HYPOTHETICAL: No
(iv) ANTI-SENSE: YES
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Human
(H) CELL LINE: HL60
(ix) FEATURE:
(D) OTHER INFORMATION: 473 bp
variable region where R is A or G.
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1:
GCGGCAGCGGCTCTTATAGTGCAACCATGGCAGACTATTCAACA44
GTGCCTCCCCCCTCTTCTGGCTCAGCTGGTGGCGGTGGT83
GGCGGCGGTGGTGGTGGAGGAGTTAACGACGCTTTCAAA122
GATGCACTGCAGAGAGCCCGGCAGATTGCAGCAAAAATT161
GGAGGTGATGCAGGGACATCACTGAATTCAAATGACTAT200
GGTTATGGGGGACAAAAAAGACCTTTAGAAGATGGAGAT239
CAACCAGATGCTAAGAAAGTTGCTCCTCAAAATGACTCT278
TTTGGAACACAGTTACCACCGATGCATCAGCAGCAAAGC317
AGATCTGTAATGACAGAAGAATACAAAGTTCCAGATGGA356
ATGGTTGGATTCATAATTGGCAGAGGAGGTGAACAGATC395
TCACGCATACAACAGGAATCTGGATGCAAAATACAGATA434
GCTCCTGACAGTGGTGGCCTTCCAGAAAGGTCCTGTATR473
TTAACTGGAACACCTGAATCTGTCCAGTCAGCAAAACGG512
TTACTGGACCAGATTGTTGAAAAAGGAAGACCAGCTCCT551
GGCTTCCATCATGGCGATGGACCGGGAAATGCAGTTCAA590
GAAATCATGATTCCAGCTAGCAAGGCAGGATTAGTCATT629
GGAAAAGGGGGAGAAACTATTAAACAGCTTCAGGAACGG668
GCTGGAGTTAAAATGGTTATGATTCAAGACGGGCCGCAG707
AACACTGGTGCTGACAAACCTCTTAGGATTACAGGAGAC746
CCATATAAAGTTCAACAAGCCAAGGAAATGGTGTTAGAG785
TTAATTCGTGATCAAGGCGGTTTCAGAGAAGTTCGGAAT824
GAGTATGGGTCAAGAATAGGAGGAAATGAAGGGATAGAT863
GTCCCCATTCCAAGATTTGCTGTTGGCATTGTAATAGGA902
AGAAATGGAGAGATGATCAAAAAAATACAAAATGATGCT941
GGTGTTCGCATTCAGTTTAAGCCAGATGATGGGACAACA980
CCCGAAAGGATAGCACAAATAACAGGACCTCCAGACCGA1019
TGTCAACATGCTGCAGAAATTATTACAGACCTTCTTCGA1058
AGTGTTCAGGCTGGTAATCCTGGTGGACCTGGACCTGGT1097
GGTCGAGGAAGAGGTAGAGGTCAAGGCAACTGGAACATG1136
GGACCACCTGGTGGATTACAGGAATTTAATTTTATTGTG1175
CCAACTGGGAAAACTGGATTAATAATAGGAAAAGGAGGT1214
GAAACCATAAAAAGCATAAGCCAGCAGTCTGGTGCAAGA1253
ATAGAACTTCAGAGAAATCCTCCACCAAATGCAGATCCT1292
AATATGAAGTTATTTACAATTCGTGGCACTCCACAACAG1331
ATAGACTATGCTCGGCAACTCATAGAAGAAAAGATTGGT1370
GGCCCAGTAAATCCTTTAGGGCCACCTGTACCCCATGGG1409
CCCCATGGTGTCCCAGGCCCCCATGGACCTCCTGGGCCT1448
CCAGGGCCTGGAACTCCAATGGGACCATACAACCCTGCA1487
CCTTATAATCCTGGACCACCAGGCCCGGCTCCTCATGGT1526
CCTCCAGCCCCATATGCTCCCCAGGGATGGGGAAATGCA1565
TATCCACACTGGCAGCAGCAGGCTCCTCCTGATCCAGCT1604
AAGGCAGGAACGGATCCAAATTCAGCAGCTTGGGCTGCT1643
TATTACGCTCACTATTATCAACAGCAAGCACAGCCACCA1682
CCAGCAGCCCCTGCAGGTGCACCAACTACAACTCAAACT1721
AATGGACAAGGAGATCAGCAGAATCCAGCCCCAGCTGGA1760
CAGGTTGATTATACCAAGGCTTGGGAAGAGTACTACAAG1799
AAAATGGGTCAGGCAGTTCCTGCTCCGACTGGGGCTCCT1838
CCAGGTGGTCAGCCAGATTATAGTGCAGCCTGGGCTGAG1877
CATTATAGACAACAAGCAGCCTATTATGCCCAGACAAGT1916
CCCCAGGGAATGCCACAGCATCCTCCAGCACCTCAGGGC1955
CAATAATAAGAAGTGGACAATACAGTATTTGCTTCATTG1994
TGTGGGGGAAAAAAACCTTTGTTAAATATATGGATGCAGA2034
CGACTTGATGAAGATCTTAATTTTGTTTTTGGTTTAAAAT2074
AGTGTTTCCTTTTTTTTTTTTTTTTTTTTGAAAATGTACA2114
AAATATCTATCACTACTGATAGGAGGTTAATATTTCTGTG2154
TAGAAATGAAAATTGGTTTGTTTTTAGTATTTAGTGTAGA2194
TGTACACATTCCAGCAAATGTATTTGCAATTATGTGGTTG2234
ATGCTTTGTGATATAAATGTACTTTTTCAATGTATACTTT2274
CACTTTCCAAATGCCTGTTTTGTGCTTTACAATAAATGAT2314
ATGAAACCTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA2354
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA2384
(2) INFORMATION FOR SEQ ID NO: 2:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 644
(B) TYPE: Amino Acid
(C) STRANDEDNESS: Single
(D) TOPOLOGY: Unknown
(ii) MOLECULE TYPE: Peptide/Protein
(iii) HYPOTHETICAL: No
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Human
(H) CELL LINE: HL60
(ix) FEATURE:
(D) OTHER INFORMATION:
Amino Acid 149 (Xaa) is Met or Ile
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2:
MetAlaAspTyrSerThrValProProProSerSerGly
1510
SerAlaGlyGlyGlyGlyGlyGlyGlyGlyGlyGlyGly
152025
ValAsnAspAlaPheLysAspAlaLeuGlnArgAlaArg
3035
GlnIleAlaAlaLysIleGlyGlyAspAlaGlyThrSer
404550
LeuAsnSerAsnAspTyrGlyTyrGlyGlyGlnLysArg
556065
ProLeuGluAspGlyAspGlnProAspAlaLysLysVal
7075
AlaProGlnAsnAspSerPheGlyThrGlnLeuProPro
808590
MetHisGlnGlnGlnSerArgSerValMetThrGluGlu
95100
TyrLysValProAspGlyMetValGlyPheIleIleGly
105110115
ArgGlyGlyGluGlnIleSerArgIleGlnGlnGluSer
120125130
GlyCysLysIleGlnIleAlaProAspSerGlyGlyLeu
135140
ProGluArgSerCysXaaLeuThrGlyThrProGluSer
145150155
ValGlnSerAlaLysArgLeuLeuAspGlnIleValGlu
160165
LysGlyArgProAlaProGlyPheHisHisGlyAspGly
170175180
ProGlyAsnAlaValGlnGluIleMetIleProAlaSer
185190195
LysAlaGlyLeuValIleGlyLysGlyGlyGluThrIle
200205
LysGlnLeuGlnGluArgAlaGlyValLysMetValMet
210215220
IleGlnAspGlyProGlnAsnThrGlyAlaAspLysPro
225230
LeuArgIleThrGlyAspProTyrLysValGlnGlnAla
235240245
LysGluMetValLeuGluLeuIleArgAspGlnGlyGly
250255260
PheArgGluValArgAsnGluTyrGlySerArgIleGly
265270
GlyAsnGluGlyIleAspValProIleProArgPheAla
275280285
ValGlyIleValIleGlyArgAsnGlyGluMetIleLys
290295
LysIleGlnAsnAspAlaGlyValArgIleGlnPheLys
300305310
ProAspAspGlyThrThrProGluArgIleAlaGlnIle
315320325
ThrGlyProProAspArgCysGlnHisAlaAlaGluIle
330335
IleThrAspLeuLeuArgSerValGlnAlaGlyAsnPro
340345350
GlyGlyProGlyProGlyGlyArgGlyArgGlyArgGly
355360
GlnGlyAsnTrpAsnMetGlyProProGlyGlyLeuGln
365370375
GluPheAsnPheIleValProThrGlyLysThrGlyLeu
380385390
IleIleGlyLysGlyGlyGluThrIleLysSerIleSer
395400
GlnGlnSerGlyAlaArgIleGluLeuGlnArgAsnPro
405410415
ProProAsnAlaAspProAsnMetLysLeuPheThrIle
420425
ArgGlyThrProGlnGlnIleAspTyrAlaArgGlnLeu
430435440
IleGluGluLysIleGlyGlyProValAsnProLeuGly
445450455
ProProValProHisGlyProHisGlyValProGlyPro
460465
HisGlyProProGlyProProGlyProGlyThrProMet
470475480
GlyProTyrAsnProAlaProTyrAsnProGlyProPro
485490
GlyProAlaProHisGlyProProAlaProTyrAlaPro
495500505
GlnGlyTrpGlyAsnAlaTyrProHisTrpGlnGlnGln
510515520
AlaProProAspProAlaLysAlaGlyThrAspProAsn
525530
SerAlaAlaTrpAlaAlaTyrTyrAlaHisTyrTyrGln
535540545
GlnGlnAlaGlnProProProAlaAlaProAlaGlyAla
550555
ProThrThrThrGlnThrAsnGlyGlnGlyAspGlnGln
560565570
AsnProAlaProAlaGlyGlnValAspTyrThrLysAla
575580585
TrpGluGluTyrTyrLysLysMetGlyGlnAlaValPro
590595
AlaProThrGlyAlaProProGlyGlyGlnProAspTyr
600605610
SerAlaAlaTrpAlaGluHisTyrArgGlnGlnAlaAla
615620
TyrTyrAlaGlnThrSerProGlnGlyMetProGlnHis
625630635
ProProAlaProGlnGlyGln
640
(2) INFORMATION FOR SEQ ID NO:3:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 27
(B) TYPE: Nucleic acid
(C) STRANDEDNESS: Single
(D) TOPOLOGY: Unknown
(ii) MOLECULE TYPE: cDNA
(iii) HYPOTHETICAL: No
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Human
(ix) FEATURE:
(A) NAME/KEY:
(B) LOCATION:
(C) IDENTIFICATION METHOD:
(D) OTHER INFORMATION: N at positions
11, 14, and 23 is inosine (I);
Y is either T or C; R is either
A or G; N at position 25 is either
I or T.
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 3:
CAGAATTCGGNGGNAAYGARGGNANCG27
(2) INFORMATION FOR SEQ ID NO:4:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 26
(B) TYPE: Nucleic acid
(C) STRANDEDNESS: Single
(D) TOPOLOGY: Unknown
(ii) MOLECULE TYPE: cDNA
(iii) HYPOTHETICAL: No
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Human
(ix) FEATURE:
(D) OTHER INFORMATION: N is inosine;
R is either A or G; Y is
either C or T.
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4:
GAGTCGACRTCRTCRTCNGGYTTRAA26
(2) INFORMATION FOR SEQ ID NO: 5:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1097
(B) TYPE: Nucleic acid
(C) STRANDEDNESS: Double
(D) TOPOLOGY: Unknown
(ii) MOLECULE TYPE: cDNA
(iii) HYPOTHETICAL: No
(iv) ANTI-SENSE: YES
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Human
(H) CELL LINE: HL60
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5:
GGAATTCCGGATAGATGTCCCCATTCCAAGATTTGCTGTT40
GGCATTGTAATAGGAAGAAATGGAGAGATGATCAAAAAA79
ATACAAAATGATGCTGGTGTTCGCATTCAGTTTAAGCCA118
GATGATGGGACAACACCCGAAAGGATAGCACAAATAACA157
GGACCTCCAGACCGATGTCAACATGCTGCAGAAATTATT196
ACAGACCTTCTTCGAAGTGTTCAGGCTGGTAATCCTGGT235
GGACCTGGACCTGGTGGTCGAGGAAGAGGTAGAGGTCAA274
GGCAACTGGAACATGGGACCACCTGGTGGATTACAGGAA313
TTTAATTTTATTGTGCCAACTGGGAAAACTGGATTAATA352
ATAGGAAAAGGAGGTGAAACCATAAAAAGCATAAGCCAG391
CAGTCTGGTGCAAGAATAGAACTTCAGAGAAATCCTCCA430
CCAAATGCAGATCCTAATATGAAGTTATTTACAATTCGT469
GGCACTCCACAACAGATAGACTATGCTCGGCAACTCATA508
GAAGAAAAGATTGGTGGCCCAGTAAATCCTTTAGGGCCA547
CCTGTACCCCATGGGCCCCATGGTGTCCCAGGCCCCCAT586
GGACCTCCTGGGCCTCCAGGGCCTGGAACTCCAATGGGA625
CCATACAACCCTGCACCTTATAATCCTGGACCACCAGGC664
CCGGCTCCTCATGGTCCTCCAGCCCCATATGCTCCCCAG703
GGATGGGGAAAGGAAATTGAGCAGAAGGTACAGGAGTAA742
TAGCAATTCCCTGTAGCTCTCAAAGCAAATTTTGAGCTCATTT785
TTCTTTTTCTGCAAGCTCAGCAGCAGAATGCCCAGAGTCT825
TCCCTGGTAGATGCAGGTTCCATAGCGACGTTCTCCTGCA865
ATGCACGCTGGTATTCTGCAATAGCAGGCCATGTTTTCCT905
TGAGCCTGGATGCTTTGGAGCCAAGCTTTCGTCCCATGCA945
AGGGAAACAACCACTTCTGGGATGTCCGCTGCAATCTGCT985
CCGGGGCTGCAGCAACCTCATCAGCTCTCTTGCCTGGAGT1025
GGCTCAGCCTGGCCTGCAGGGCCACCAGGAGAATGGCAGC1065
AAGGATGGCGAGGGTCCTCATGGCTGGAATTC1097
(2) INFORMATION FOR SEQ ID NO: 6:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 243
(B) TYPE: Amino Acid
(C) STRANDEDNESS: Single
(D) TOPOLOGY: Unknown
(ii) MOLECULE TYPE: Peptide/Protein
(iii) HYPOTHETICAL: No
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Human
(G) CELL TYPE: HL60
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 6:
IleAspValProIleProArgPheAlaVal
1510
GlyIleValIleGlyArgAsnGlyGluMetIleLysLys
1520
IleGlnAsnAspAlaGlyValArgIleGlnPheLysPro
253035
AspAspGlyThrThrProGluArgIleAlaGlnIleThr
4045
GlyProProAspArgCysGlnHisAlaAlaGluIleIle
505560
ThrAspLeuLeuArgSerValGlnAlaGlyAsnProGly
657075
GlyProGlyProGlyGlyArgGlyArgGlyArgGlyGln
8085
GlyAsnTrpAsnMetGlyProProGlyGlyLeuGlnGlu
9095100
PheAsnPheIleValProThrGlyLysThrGlyLeuIle
105110
IleGlyLysGlyGlyGluThrIleLysSerIleSerGln
115120125
GlnSerGlyAlaArgIleGluLeuGlnArgAsnProPro
130135140
ProAsnAlaAspProAsnMetLysLeuPheThrIleArg
145150
GlyThrProGlnGlnIleAspTyrAlaArgGlnLeuIle
155160165
GluGluLysIleGlyGlyProValAsnProLeuGlyPro
170175
ProValProHisGlyProHisGlyValProGlyProHis
180185190
GlyProProGlyProProGlyProGlyThrProMetGly
195200205
ProTyrAsnProAlaProTyrAsnProGlyProProGly
210215
ProAlaProHisGlyProProAlaProTyrAlaProGln
220225230
GlyTrpGlyLysGluIleGluGlnLysValGlnGlu
235240
(2) INFORMATION FOR SEQ ID NO: 7:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 1803
(B) TYPE: Nucleic acid
(C) STRANDEDNESS: Double
(D) TOPOLOGY: Unknown
(ii) MOLECULE TYPE: cDNA
(iii) HYPOTHETICAL: No
(iv) ANTI-SENSE: YES
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Human
(H) CELL LINE: HL60
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7:
GAATTCCGGACGACAGCGGCTCTGAGAGCCCGGCAGATTGCA42
GCAAAAATTGGAGGTGATGCAGGGACATCACTGAATTCA81
AATGACTATGGTTATGGGGGACAAAAAAGACCTTTAGAA120
GATGGAGATGGCTCTTGGACAAGTCCGAGCAGTACAACA159
CACTGGGAGGGAATGCCCTCTCCTTTTAAAGATCAACCA198
GATGCTAAGAAAGTTGCTCCTCAAAATGACTCTTTTGGA237
ACACAGTTACCACCGATGCATCAGCAGCAAAGATCTGTA276
ATGACAGAAGAATACAAAGTTCCAGATGGAATGGTTGGA315
TTCATAATTGGCAGAGGAGGTGAACAGATCTCACGCATA354
CAACAGGAATCTGGATGCAAAATACAGATAGCTCCTGAC393
AGTGGTGGCCTTCCAGAAAGGTCCTGTATGTTAACTGGA432
ACACCTGAATCTGTCCAGTCAGCAAAACGGTTACTGGAC471
CAGATTGTTGAAAAAGGAAGACCAGCTCCTGGCTTCCAT510
CATGGCGATGGACCGGGAAATGCAGTTCAAGAAATCATG549
ATTCCAGCTAGCAAGGCAGGATTAGTCATTGGAAAAGGG588
GGAGAAACTATTAAACAGCTTCAGGAACGGGCTGGAGTT627
AAAATGGTTATGATTCAAGACGGGCCGCAGAACACTGGT666
GCTGACAAACCTCTTAGGATTACAGGAGACCCATATAAA705
GTTCAACAAGCCAAGGAAATGGTGTTAGAGTTAATTCGT744
GATCAAGGCGGTTTCAGAGAAGTTCGGAATGAGTATGGG783
TCAAGAATAGGAGGAAATGAAGGGATAGATGTCCCCATT822
CCAAGATTTGCTGTTGGCATTGTAATAGGAAGAAATGGA861
GAGATGATCAAAAAAATACAAAATGATGCTGGTGTTCGC900
ATTCAGTTTAAGCCAGATGATGGGACAACACCCGAAAGG939
ATAGCACAAATAACAGGACCTCCAGACCGATGTCAACAT978
GCTGCAGAAATTATTACAGACCTTCTTCGAAGTGTTCAG1017
GCTGGTAATCCTGGTGGACCTGGACCTGGTGGTCGAGGA1056
AGAGGTAGAGGTCAAGGCAACTGGAACATGGGACCACCT1095
GGTGGATTACAGGAATTTAATTTTATTGTGCCAACTGGG1134
AAAACTGGATTAATAATAGGAAAAGGAGGTGAAACCATA1173
AAAAGCATAAGCCAGCAGTCTGGTGCAAGAATAGAACTT1212
CAGAGAAATCCTCCACCAAATGCAGATCCTAATATGAAG1251
TTATTTACAATTCGTGGCACTCCACAACAGATAGACTAT1290
GCTCGGCAACTCATAGAAGAAAAGATTGGTGGCCCAGTA1329
AATCCTTTAGGGCCACCTGTACCCCATGGGCCCCATGGT1368
GTCCCAGGCCCCCATGGACCTCCTGGGCCTCCAGGGCCT1407
GGAACTCCAATGGGACCATACAACCCTGCACCTTATAAT1446
CCTGGACCACCAGGCCCGGCTCCTCATGGTCCTCCAGCC1485
CCATATGCTCCCCAGGGATGGGGAAATGCATATCCACAC1524
TGGCAGCAGCAGGCTCCTCCTGATCCAGCTAAGGCAGGA1563
ACGGATCCAAATTCAGCAGCTTGGGCTGCTTATTACGCT1602
CACTATTATCAACAGCAAGCACAGCCACCACCAGCAGCC1641
CCTGCAGGTGCACCAACTACAACTCAAACTAATGGACAA1680
GGAGATCAGCAGAATCCAGCCCCAGCTGGACAGGTTGAT1719
TATACCAAGGCTTGGGAAGAGTACTACAAGAAAATGGGG1758
CCAATAATAAGAAGTGGACAATACAGTATTTGCTTC1794
AGGAATTCC1803
(2) INFORMATION FOR SEQ ID NO: 8:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 590
(B) TYPE: Amino Acid
(C) STRANDEDNESS: Single
(D) TOPOLOGY: Unknown
(ii) MOLECULE TYPE: Peptide/Protein
(iii) HYPOTHETICAL: No
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Human
(H) CELL LINE: HL60
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8:
ArgAlaArgGlnIleAla
15
AlaLysIleGlyGlyAspAlaGlyThrSerLeuAsnSer
1015
AsnAspTyrGlyTyrGlyGlyGlnLysArgProLeuGlu
202530
AspGlyAspGlySerTrpThrSerProSerSerThrThr
354045
HisTrpGluGlyMetProSerProPheLysAspGlnPro
5055
AspAlaLysLysValAlaProGlnAsnAspSerPheGly
606570
ThrGlnLeuProProMetHisGlnGlnGlnArgSerVal
7580
MetThrGluGluTyrLysValProAspGlyMetValGly
859095
PheIleIleGlyArgGlyGlyGluGlnIleSerArgIle
100105110
GlnGlnGluSerGlyCysLysIleGlnIleAlaProAsp
115120
SerGlyGlyLeuProGluArgSerCysMetLeuThrGly
125130135
ThrProGluSerValGlnSerAlaLysArgLeuLeuAsp
140145
GlnIleValGluLysGlyArgProAlaProGlyPheHis
150155160
HisGlyAspGlyProGlyAsnAlaValGlnGluIleMet
165170175
IleProAlaSerLysAlaGlyLeuValIleGlyLysGly
180185
GlyGluThrIleLysGlnLeuGlnGluArgAlaGlyVal
190195200
LysMetValMetIleGlnAspGlyProGlnAsnThrGly
205210
AlaAspLysProLeuArgIleThrGlyAspProTyrLys
215220225
ValGlnGlnAlaLysGluMetValLeuGluLeuIleArg
230235240
AspGlnGlyGlyPheArgGluValArgAsnGluTyrGly
245250
SerArgIleGlyGlyAsnGluGlyIleAspValProIle
255260265
ProArgPheAlaValGlyIleValIleGlyArgAsnGly
270275
GluMetIleLysLysIleGlnAsnAspAlaGlyValArg
280285290
IleGlnPheLysProAspAspGlyThrThrProGluArg
295300305
IleAlaGlnIleThrGlyProProAspArgCysGlnHis
310315
AlaAlaGluIleIleThrAspLeuLeuArgSerValGln
320325330
AlaGlyAsnProGlyGlyProGlyProGlyGlyArgGly
335340
ArgGlyArgGlyGlnGlyAsnTrpAsnMetGlyProPro
345350355
GlyGlyLeuGlnGluPheAsnPheIleValProThrGly
360365370
LysThrGlyLeuIleIleGlyLysGlyGlyGluThrIle
375380
LysSerIleSerGlnGlnSerGlyAlaArgIleGluLeu
385390395
GlnArgAsnProProProAsnAlaAspProAsnMetLys
400405
LeuPheThrIleArgGlyThrProGlnGlnIleAspTyr
410415420
AlaArgGlnLeuIleGluGluLysIleGlyGlyProVal
425430435
AsnProLeuGlyProProValProHisGlyProHisGly
440445
ValProGlyProHisGlyProProGlyProProGlyPro
450455460
GlyThrProMetGlyProTyrAsnProAlaProTyrAsn
465470
ProGlyProProGlyProAlaProHisGlyProProAla
475480485
ProTyrAlaProGlnGlyTrpGlyAsnAlaTyrProHis
490495500
TrpGlnGlnGlnAlaProProAspProAlaLysAlaGly
505510
ThrAspProAsnSerAlaAlaTrpAlaAlaTyrTyrAla
515520525
HisTyrTyrGlnGlnGlnAlaGlnProProProAlaAla
530535
ProAlaGlyAlaProThrThrThrGlnThrAsnGlyGln
540545550
GlyAspGlnGlnAsnProAlaProAlaGlyGlnValAsp
555560565
TyrThrLysAlaTrpGluGluTyrTyrLysLysMetGly
570575
ProIleIleArgSerGlyGlnTyrSerIleCysPhe
580585590
(2) INFORMATION FOR SEQ ID NO: 9:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 2381
(B) TYPE: Nucleic acid
(C) STRANDEDNESS: Double
(D) TOPOLOGY: Unknown
(ii) MOLECULE TYPE: cDNA
(iii) HYPOTHETICAL: No
(iv) ANTI-SENSE: YES
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Human
(H) CELL LINE: HL60
(ix) FEATURE:
(D) OTHER INFORMATION: 470 bp variable
region where R is A or G.
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9:
GCGGCAGCGGCTCTTATAGTGCAACCATGGCAGACTATTCAACA44
GTGCCTCCCCCCTCTTCTGGCTCAGCTGGTGGCGGTGGT83
GGCGGCGGTGGTGGTGGAGGAGTTAACGACGCTTTCAAA122
GATGCACTGCAGAGAGCCCGGCAGATTGCAGCAAAAATT161
GGAGGTGATGCAGGGACATCACTGAATTCAAATGACTAT200
GGTTATGGGGGACAAAAAAGACCTTTAGAAGATGGAGAT239
CAACCAGATGCTAAGAAAGTTGCTCCTCAAAATGACTCT278
TTTGGAACACAGTTACCACCGATGCATCAGCAGCAA314
AGATCTGTAATGACAGAAGAATACAAAGTTCCAGATGGA353
ATGGTTGGATTCATAATTGGCAGAGGAGGTGAACAGATC392
TCACGCATACAACAGGAATCTGGATGCAAAATACAGATA431
GCTCCTGACAGTGGTGGCCTTCCAGAAAGGTCCTGTATR470
TTAACTGGAACACCTGAATCTGTCCAGTCAGCAAAACGG509
TTACTGGACCAGATTGTTGAAAAAGGAAGACCAGCTCCT548
GGCTTCCATCATGGCGATGGACCGGGAAATGCAGTTCAA587
GAAATCATGATTCCAGCTAGCAAGGCAGGATTAGTCATT626
GGAAAAGGGGGAGAAACTATTAAACAGCTTCAGGAACGG665
GCTGGAGTTAAAATGGTTATGATTCAAGACGGGCCGCAG704
AACACTGGTGCTGACAAACCTCTTAGGATTACAGGAGAC743
CCATATAAAGTTCAACAAGCCAAGGAAATGGTGTTAGAG782
TTAATTCGTGATCAAGGCGGTTTCAGAGAAGTTCGGAAT821
GAGTATGGGTCAAGAATAGGAGGAAATGAAGGGATAGAT860
GTCCCCATTCCAAGATTTGCTGTTGGCATTGTAATAGGA899
AGAAATGGAGAGATGATCAAAAAAATACAAAATGATGCT938
GGTGTTCGCATTCAGTTTAAGCCAGATGATGGGACAACA977
CCCGAAAGGATAGCACAAATAACAGGACCTCCAGACCGA1016
TGTCAACATGCTGCAGAAATTATTACAGACCTTCTTCGA1055
AGTGTTCAGGCTGGTAATCCTGGTGGACCTGGACCTGGT1094
GGTCGAGGAAGAGGTAGAGGTCAAGGCAACTGGAACATG1133
GGACCACCTGGTGGATTACAGGAATTTAATTTTATTGTG1172
CCAACTGGGAAAACTGGATTAATAATAGGAAAAGGAGGT1211
GAAACCATAAAAAGCATAAGCCAGCAGTCTGGTGCAAGA1250
ATAGAACTTCAGAGAAATCCTCCACCAAATGCAGATCCT1289
AATATGAAGTTATTTACAATTCGTGGCACTCCACAACAG1328
ATAGACTATGCTCGGCAACTCATAGAAGAAAAGATTGGT1367
GGCCCAGTAAATCCTTTAGGGCCACCTGTACCCCATGGG1406
CCCCATGGTGTCCCAGGCCCCCATGGACCTCCTGGGCCT1445
CCAGGGCCTGGAACTCCAATGGGACCATACAACCCTGCA1484
CCTTATAATCCTGGACCACCAGGCCCGGCTCCTCATGGT1523
CCTCCAGCCCCATATGCTCCCCAGGGATGGGGAAATGCA1562
TATCCACACTGGCAGCAGCAGGCTCCTCCTGATCCAGCT1601
AAGGCAGGAACGGATCCAAATTCAGCAGCTTGGGCTGCT1640
TATTACGCTCACTATTATCAACAGCAAGCACAGCCACCA1679
CCAGCAGCCCCTGCAGGTGCACCAACTACAACTCAAACT1718
AATGGACAAGGAGATCAGCAGAATCCAGCCCCAGCTGGA1757
CAGGTTGATTATACCAAGGCTTGGGAAGAGTACTACAAG1796
AAAATGGGTCAGGCAGTTCCTGCTCCGACTGGGGCTCCT1835
CCAGGTGGTCAGCCAGATTATAGTGCAGCCTGGGCTGAG1874
CATTATAGACAACAAGCAGCCTATTATGCCCAGACAAGT1913
CCCCAGGGAATGCCACAGCATCCTCCAGCACCTCAGGGC1952
CAATAATAAGAAGTGGACAATACAGTATTTGCTTCATTG1991
TGTGGGGGAAAAAAACCTTTGTTAAATATATGGATGCAGA2031
CGACTTGATGAAGATCTTAATTTTGTTTTTGGTTTAAAAT2071
AGTGTTTCCTTTTTTTTTTTTTTTTTTTTGAAAATGTACA2111
AAATATCTATCACTACTGATAGGAGGTTAATATTTCTGTG2151
TAGAAATGAAAATTGGTTTGTTTTTAGTATTTAGTGTAGA2191
TGTACACATTCCAGCAAATGTATTTGCAATTATGTGGTTG2231
ATGCTTTGTGATATAAATGTACTTTTTCAATGTATACTTT2271
CACTTTCCAAATGCCTGTTTTGTGCTTTACAATAAATGAT2311
ATGAAACCTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA2351
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA2381
(2) INFORMATION FOR SEQ ID NO: 10:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 643
(B) TYPE: Amino Acid
(C) STRANDEDNESS: Single
(D) TOPOLOGY: Unknown
(ii) MOLECULE TYPE: Peptide/Protein
(iii) HYPOTHETICAL: No
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Human
(H) CELL LINE: HL60
(ix) FEATURE:
(D) OTHER INFORMATION:
Amino Acid 148 (Xaa) is Met or Ile
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10:
MetAlaAspTyrSerThrValProProProSerSerGly
1510
SerAlaGlyGlyGlyGlyGlyGlyGlyGlyGlyGlyGly
152025
ValAsnAspAlaPheLysAspAlaLeuGlnArgAlaArg
3035
GlnIleAlaAlaLysIleGlyGlyAspAlaGlyThrSer
404550
LeuAsnSerAsnAspTyrGlyTyrGlyGlyGlnLysArg
556065
ProLeuGluAspGlyAspGlnProAspAlaLysLysVal
7075
AlaProGlnAsnAspSerPheGlyThrGlnLeuProPro
808590
MetHisGlnGlnGlnArgSerValMetThrGluGlu
95100
TyrLysValProAspGlyMetValGlyPheIleIleGly
105110115
ArgGlyGlyGluGlnIleSerArgIleGlnGlnGluSer
120125
GlyCysLysIleGlnIleAlaProAspSerGlyGlyLeu
130135140
ProGluArgSerCysXaaLeuThrGlyThrProGluSer
145150155
ValGlnSerAlaLysArgLeuLeuAspGlnIleValGlu
160165
LysGlyArgProAlaProGlyPheHisHisGlyAspGly
170175180
ProGlyAsnAlaValGlnGluIleMetIleProAlaSer
185190
LysAlaGlyLeuValIleGlyLysGlyGlyGluThrIle
195200205
LysGlnLeuGlnGluArgAlaGlyValLysMetValMet
210215220
IleGlnAspGlyProGlnAsnThrGlyAlaAspLysPro
225230
LeuArgIleThrGlyAspProTyrLysValGlnGlnAla
235240245
LysGluMetValLeuGluLeuIleArgAspGlnGlyGly
250255
PheArgGluValArgAsnGluTyrGlySerArgIleGly
260265270
GlyAsnGluGlyIleAspValProIleProArgPheAla
275280285
ValGlyIleValIleGlyArgAsnGlyGluMetIleLys
290295
LysIleGlnAsnAspAlaGlyValArgIleGlnPheLys
300305310
ProAspAspGlyThrThrProGluArgIleAlaGlnIle
315320
ThrGlyProProAspArgCysGlnHisAlaAlaGluIle
325330335
IleThrAspLeuLeuArgSerValGlnAlaGlyAsnPro
340345350
GlyGlyProGlyProGlyGlyArgGlyArgGlyArgGly
355360
GlnGlyAsnTrpAsnMetGlyProProGlyGlyLeuGln
365370375
GluPheAsnPheIleValProThrGlyLysThrGlyLeu
380385
IleIleGlyLysGlyGlyGluThrIleLysSerIleSer
390395400
GlnGlnSerGlyAlaArgIleGluLeuGlnArgAsnPro
405410415
ProProAsnAlaAspProAsnMetLysLeuPheThrIle
420425
ArgGlyThrProGlnGlnIleAspTyrAlaArgGlnLeu
430435440
IleGluGluLysIleGlyGlyProValAsnProLeuGly
445450
ProProValProHisGlyProHisGlyValProGlyPro
455460465
HisGlyProProGlyProProGlyProGlyThrProMet
470475480
GlyProTyrAsnProAlaProTyrAsnProGlyProPro
485490
GlyProAlaProHisGlyProProAlaProTyrAlaPro
495500505
GlnGlyTrpGlyAsnAlaTyrProHisTrpGlnGlnGln
510515
AlaProProAspProAlaLysAlaGlyThrAspProAsn
520525530
SerAlaAlaTrpAlaAlaTyrTyrAlaHisTyrTyrGln
535540545
GlnGlnAlaGlnProProProAlaAlaProAlaGlyAla
550555
ProThrThrThrGlnThrAsnGlyGlnGlyAspGlnGln
560565570
AsnProAlaProAlaGlyGlnValAspTyrThrLysAla
575580
TrpGluGluTyrTyrLysLysMetGlyGlnAlaValPro
585590595
AlaProThrGlyAlaProProGlyGlyGlnProAspTyr
600605610
SerAlaAlaTrpAlaGluHisTyrArgGlnGlnAlaAla
615620
TyrTyrAlaGlnThrSerProGlnGlyMetProGlnHis
625630635
ProProAlaProGlnGlyGln
640
(2) INFORMATION FOR SEQ ID NO: 11:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33
(B) TYPE: Nucleic acid
(C) STRANDEDNESS: Double
(D) TOPOLOGY: Unknown
(ii) MOLECULE TYPE: DNA
(vi) ORIGINAL SOURCE:
(A) ORGANISM: Human
(ix) FEATURE:
(D) OTHER INFORMATION: FUSE
oligonucleotide
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11:
GATCACAAAATAAAAAATCCCGAGGGAATATAG33
(2) INFORMATION FOR SEQ ID NO: 12:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33
(B) TYPE: Nucleic acid
(C) STRANDEDNESS: Double
(D) TOPOLOGY: Unknown
(ii) MOLECULE TYPE: DNA
(iii) HYPOTHETICAL: YES
(ix) FEATURE:
(D) OTHER INFORMATION: Mut A
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12:
GATCACAACTACGTGCTAGGACGCCGAATATAG33
(2) INFORMATION FOR SEQ ID NO: 13:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 39
(B) TYPE: Nucleic acid
(C) STRANDEDNESS: Double
(D) TOPOLOGY: Unknown
(ii) MOLECULE TYPE: DNA
(iii) HYPOTHETICAL: No
(ix) FEATURE:
(D) OTHER INFORMATION: cAMP response
element
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13:
GATCTGACGTCATGACTGACGTCATGACTGACGTCATCA39
(2) INFORMATION FOR SEQ ID NO: 14:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 32
(B) TYPE: Nucleic acid
(C) STRANDEDNESS: Double
(D) TOPOLOGY: Unknown
(ii) MOLECULE TYPE: DNA
(iii) HYPOTHETICAL: NO
(ix) FEATURE:
(D) OTHER INFORMATION: CT element in c-myc
5'- flanking region ("CTE")
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14:
AATTCTCCTCCCCACCTTCCCCACCCTCCCCA32
(2) INFORMATION FOR SEQ ID NO: 15:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33
(B) TYPE: Nucleic acid
(C) STRANDEDNESS: Double
(D) TOPOLOGY: Unknown
(ii) MOLECULE TYPE: DNA
(iii) HYPOTHETICAL: YES
(ix) FEATURE:
(D) OTHER INFORMATION: Mut B
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15:
GATCACAAAATAAAAAATGGACGCCGAATATAG33
(2) INFORMATION FOR SEQ ID NO: 16:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 33
(B) TYPE: Nucleic acid
(C) STRANDEDNESS: Double
(D) TOPOLOGY: Unknown
(ii) MOLECULE TYPE: DNA
(iii) HYPOTHETICAL: YES
(ix) FEATURE:
(D) OTHER INFORMATION: Mut C
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16:
GATCACAACTACGTGCTAGGCGAGGGAATATAG33
(2) INFORMATION FOR SEQ ID NO: 17:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 9
(B) TYPE: Amino Acid
(C) STRANDEDNESS: Single
(D) TOPOLOGY: Unknown
(ii) MOLECULE TYPE: Peptide/Protein
(iii) HYPOTHETICAL: No
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17:
LysGluIleGluGlnLysValGlnGlu
15
(2) INFORMATION FOR SEQ ID NO: 18:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 12
(B) TYPE: Nucleic acid
(C) STRANDEDNESS: Double
(D) TOPOLOGY: Unknown
(ii) MOLECULE TYPE: DNA
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18:
TTAGTTAACTAA12
(2) INFORMATION FOR SEQ ID NO: 19:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 6
(B) TYPE: Amino Acid
(C) STRANDEDNESS: Single
(D) TOPOLOGY: Unknown
(ii) MOLECULE TYPE: Peptide/Protein
(ix) FEATURE:
(D) OTHER INFORMATION: Mutant 2
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 19:
IleGlySerArgIleArg
15
(2) INFORMATION FOR SEQ ID NO: 20:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 4
(B) TYPE: Amino Acid
(C) STRANDEDNESS: Single
(D) TOPOLOGY: Unknown
(ii) MOLECULE TYPE: Peptide/Protein
(ix) FEATURE:
(D) OTHER INFORMATION: Mutant 3
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20:
IleArgIleArg
15
(2) INFORMATION FOR SEQ ID NO: 21:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 5
(B) TYPE: Amino Acid
(C) STRANDEDNESS: Single
(D) TOPOLOGY: Unknown
(ii) MOLECULE TYPE: Peptide/Protein
(ix) FEATURE:
(D) OTHER INFORMATION: Mutant 4
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21:
ProArgIleArgGlu
15
(2) INFORMATION FOR SEQ ID NO: 22:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 4
(B) TYPE: Amino Acid
(C) STRANDEDNESS: Single
(D) TOPOLOGY: Unknown
(ii) MOLECULE TYPE: Peptide/Protein
(ix) FEATURE:
(D) OTHER INFORMATION: Mutant 6
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22:
GlyIleProArg
15
(2) INFORMATION FOR SEQ ID NO: 23:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 4
(B) TYPE: Amino Acid
(C) STRANDEDNESS: Single
(D) TOPOLOGY: Unknown
(ii) MOLECULE TYPE: Peptide/Protein
(ix) FEATURE:
(D) OTHER INFORMATION: Mutant 8
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23:
ArgAspProAla
15
(2) INFORMATION FOR SEQ ID NO: 24:
(i) SEQUENCE CHARACTERISTICS:
(A) LENGTH: 4
(B) TYPE: Amino Acid
(C) STRANDEDNESS: Single
(D) TOPOLOGY: Unknown
(ii) MOLECULE TYPE: Peptide/Protein
(ix) FEATURE:
(D) OTHER INFORMATION: Mutant 9
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24:
ArgGlySerGly
15
__________________________________________________________________________
Claims (30)
1. A protein which binds specifically to an activator cis-element approximately 1500 basepairs 5' of the human c-myc gene promoter P1, said protein comprised of the amino acid sequence (SEQ ID NO: 2):
______________________________________
Met Ala Asp Tyr Ser Thr Val Pro Pro Pro
1 5 10
Ser Ser Gly Ser Ala Gly Gly Gly Gly Gly
15 20
Gly Gly Gly Gly Gly Gly Val Asn Asp Ala
25 30
Phe Lys Asp Ala Leu Gln Arg Ala Arg Gln
35 40
Ile Ala Ala Lys Ile Gly Gly Asp Ala Gly
45 50
Thr Ser Leu Asn Ser Asn Asp Tyr Gly Tyr
55 60
Gly Gly Gln Lys Arg Pro Leu Glu Asp Gly
65 70
Asp Gln Pro Asp Ala Lys Lys Val Ala Pro
75 80
Gln Asn Asp Ser Phe Gly Thr Gln Leu Pro
85 90
Pro Met His Gln Gln Gln Ser Arg Ser Val
95 100
Met Thr Glu Glu Tyr Lys Val Pro Asp Gly
105 110
Met Val Gly Phe Ile Ile Gly Arg Gly Gly
115 120
Glu Gln Ile Ser Arg Ile Gln Gln Glu Ser
125 130
Gly Cys Lys Ile Gln Ile Ala Pro Asp Ser
135 140
Gly Gly Leu Pro Glu Arg Ser Cys Xaa Leu
145 150
Thr Gly Thr Pro Glu Ser Val Gln Ser Ala
155 160
Lys Arg Leu Leu Asp Gln Ile Val Glu Lys
165 170
Gly Arg Pro Ala Pro Gly Phe His His Gly
175 180
Asp Gly Pro Gly Asn Ala Val Gln Glu Ile
185 190
Met Ile Pro Ala Ser Lys Ala Gly Leu Val
195 200
Ile Gly Lys Gly Gly Glu Thr Ile Lys Gln
205 210
Leu Gln Glu Arg Ala Gly Val Lys Met Val
215 220
Met Ile Gln Asp Gly Pro Gln Asn Thr Gly
225 230
Ala Asp Lys Pro Leu Arg Ile Thr Gly Asp
235 240
Pro Tyr Lys Val Gln Gln Ala Lys Glu Met
245 250
Val Leu Glu Leu Ile Arq Asp Gln Gly Gly
255 260
Phe Arg Glu Val Arg Asn Glu Tyr Gly Ser
265 270
Arg Ile Gly Gly Asn Glu Gly Ile Asp Val
275 280
Pro Ile Pro Arg Phe Ala Val Gly Ile Val
285 290
Ile Gly Arg Asn Gly Glu Met Ile Lys Lys
295 300
Ile Gln Asn Asp Ala Gly Val Arg Ile Gln
305 310
Phe Lys Pro Asp Asp Gly Thr Thr Pro Glu
315 320
Arg Ile Ala Gln Ile Thr Gly Pro Pro Asp
325 330
Arg Cys Gln His Ala Ala Glu Ile Ile Thr
335 340
Asp Leu Leu Arg Ser Val Gln Ala Gly Asn
345 350
Pro Gly Gly Pro Gly Pro Gly Gly Arg Gly
355 360
Arg Gly Arg Gly Gln Gly Asn Trp Asn Met
365 370
Gly Pro Pro Gly Gly Leu Gln Glu Phe Asn
375 380
Phe Ile Val Pro Thr Gly Lys Thr Gly Leu
385 390
Ile Ile Gly Lys Gly Gly Glu Thr Ile Lys
395 400
Ser Ile Ser Gln Gln Ser Gly Ala Arg Ile
405 410
Glu Leu Gln Arg Asn Pro Pro Pro Asn Ala
415 420
Asp Pro Asn Met Lys Leu Phe Thr Ile Arg
425 430
Gly Thr Pro Gln Gln Ile Asp Tyr Ala Arg
435 440
Gln Leu Ile Glu Glu Lys Ile Gly Gly Pro
445 450
Val Asn Pro Leu Gly Pro Pro Val Pro His
455 460
Gly Pro His Gly Val Pro Gly Pro His Gly
465 470
Pro Pro Gly Pro Pro Gly Pro Gly Thr Pro
475 480
Met Gly Pro Tyr Asn Pro Ala Pro Tyr Asn
485 490
Pro Gly Pro Pro Gly Pro Ala Pro His Gly
495 500
Pro Pro Ala Pro Tyr Ala Pro Gln Gly Trp
505 510
Gly Asn Ala Tyr Pro His Trp Gln Gln Gln
515 520
Ala Pro Pro Asp Pro Ala Lys Ala Gly Thr
525 530
Asp Pro Asn Ser Ala Ala Trp Ala Ala Tyr
535 540
Tyr Ala His Tyr Tyr Gln Gln Gln Ala Gln
545 550
Pro Pro Pro Ala Ala Pro Ala Gly Ala Pro
555 560
Thr Thr Thr Gln Thr Asn Gly Gln Gly Asp
565 570
Gln Gln Asn Pro Ala Pro Ala Gly Gln Val
575 580
Asp Tyr Thr Lys Ala Trp Glu Glu Tyr Tyr
585 590
Lys Lys Met Gly Gln Ala Val Pro Ala Pro
595 600
Thr Gly Ala Pro Pro Gly Gly Gln Pro Asp
605 610
Tyr Ser Ala Ala Trp Ala Glu His Tyr Arg
615 620
Gln Gln Ala Ala Tyr Tyr Ala Gln Thr Ser
625 630
Pro Gln Gly Met Pro Gln His Pro Pro Ala
635 640
Pro Gln Gly Gln
______________________________________
wherein Xaa is Ile or Met.
2. The protein of claim 1, wherein Xaa is Met.
3. The protein of claim 1, wherein Xaa is Ile.
4. A protein which binds specifically to an activator cis-element approximately 1500 basepairs 5' of the human c-myc gene promoter P1, said protein comprised of the amino acid sequence (SEQ ID NO: 10):
______________________________________
Met Ala Asp Tyr Ser Thr Val Pro Pro Pro
1 5 10
Ser Ser Gly Ser Ala Gly Gly Gly Gly Gly
15 20
Gly Gly Gly Gly Gly Gly Val Asn Asp Ala
25 30
Phe Lys Asp Ala Leu Gln Arg Ala Arg Gln
35 40
Ile Ala Ala Lys Ile Gly Gly Asp Ala Gly
45 50
Thr Ser Leu Asn Ser Asn Asp Tyr Gly Tyr
55 60
Gly Gly Gln Lys Arg Pro Leu Glu Asp Gly
65 70
Asp Gln Pro Asp Ala Lys Lys Val Ala Pro
75 80
Gln Asn Asp Ser Phe Gly Thr Gln Leu Pro
85 90
Pro Met His Gln Gln Gln Arg Ser Val Met
95 100
Thr Glu Glu Tyr Lys Val Pro Asp Gly Met
105 110
Val Gly Phe Ile Ile Gly Arg Gly Gly Glu
115 120
Gln Ile Ser Arg Ile Gln Gln Glu Ser Gly
125 130
Cys Lys Ile Gln Ile Ala Pro Asp Ser Gly
135 140
Gly Leu Pro Glu Arg Ser Cys Xaa Leu Thr
145 150
Gly Thr Pro Glu Ser Val Gln Ser Ala Lys
155 160
Arg Leu Leu Asp Gln Ile Val Glu Lys Gly
165 170
Arg Pro Ala Pro Gly Phe His His Gly Asp
175 180
Gly Pro Gly Asn Ala Val Gln Glu Ile Met
185 190
Ile Pro Ala Ser Lys Ala Gly Leu Val Ile
195 200
Gly Lys Gly Gly Glu Thr Ile Lys Gln Leu
205 210
Gln Glu Arg Ala Gly Val Lys Met Val Met
215 220
Ile Gln Asp Gly Pro Gln Asn Thr Gly Ala
225 230
Asp Lys Pro Leu Arg Ile Thr Gly Asp Pro
235 240
Tyr Lys Val Gln Gln Ala Lys Glu Met Val
245 250
Leu Glu Leu Ile Arg Asp Gln Gly Gly Phe
255 260
Arg Glu Val Arg Asn Glu Tyr Gly Ser Arg
265 270
Ile Gly Gly Asn Glu Gly Ile Asp Val Pro
275 280
Ile Pro Arg Phe Ala Val Gly Ile Val Ile
285 290
Gly Arg Asn Gly Glu Met Ile Lys Lys Ile
295 300
Gln Asn Asp Ala Gly Val Arg Ile Gln Phe
305 310
Lys Pro Asp Asp Gly Thr Thr Pro Glu Arg
315 320
Ile Ala Gln Ile Thr Gly Pro Pro Asp Arg
325 330
Cys Gln His Ala Ala Glu Ile Ile Thr Asp
335 340
Leu Leu Arg Ser Val Gln Ala Gly Asn Pro
345 350
Gly Gly Pro Gly Pro Gly Gly Arg Gly Arg
355 360
Gly Arg Gly Gln Gly Asn Trp Asn Met Gly
365 370
Pro Pro Gly Gly Leu Gln Glu Phe Asn Phe
375 380
Ile Val Pro Thr Gly Lys Thr Gly Leu Ile
385 390
Ile Gly Lys Gly Gly Glu Thr Ile Lys Ser
395 400
Ile Ser Gln Gln Ser Gly Ala Arg Ile Glu
405 410
Leu Gln Arg Asn Pro Pro Pro Asn Ala Asp
415 420
Pro Asn Met Lys Leu Phe Thr Ile Arg Gly
425 430
Thr Pro Gln Gln Ile Asp Tyr Ala Arg Gln
435 440
Leu Ile Glu Glu Lys Ile Gly Gly Pro Val
445 450
Asn Pro Leu Gly Pro Pro Val Pro His Gly
455 460
Pro His Gly Val Pro Gly Pro His Gly Pro
465 470
Pro Gly Pro Pro Gly Pro Gly Thr Pro Met
475 480
Gly Pro Tyr Asn Pro Ala Pro Tyr Asn Pro
485 490
Gly Pro Pro Gly Pro Ala Pro His Gly Pro
495 500
Pro Ala Pro Tyr Ala Pro Gln Gly Trp Gly
505 510
Asn Ala Tyr Pro His Trp Gln Gln Gln Ala
515 520
Pro Pro Asp Pro Ala Lys Ala Gly Thr Asp
525 530
Pro Asn Ser Ala Ala Trp Ala Ala Tyr Tyr
535 540
Ala His Tyr Tyr Gln Gln Gln Ala Gln Pro
545 550
Pro Pro Ala Ala Pro Ala Gly Ala Pro Thr
555 560
Thr Thr Gln Thr Asn Gly Gln Gly Asp Gln
565 570
Gln Asn Pro Ala Pro Ala Gly Gln Val Asp
575 580
Tyr Thr Lys Ala Trp Glu Glu Tyr Tyr Lys
585 590
Lys Met Gly Gln Ala Val Pro Ala Pro Thr
595 600
Gly Ala Pro Pro Gly Gly Gln Pro Asp Tyr
605 610
Ser Ala Ala Trp Ala Glu His Tyr Arg Gln
615 620
Gln Ala Ala Tyr Tyr Ala Gln Thr Ser Pro
625 630
Gln Gly Met Pro Gln His Pro Pro Ala Pro
635 640
Gln Gly Gln
______________________________________
wherein Xaa is Ile or Met.
5. The protein of claim 4, wherein Xaa is Met.
6. The protein of claim 4, wherein Xaa is Ile.
7. A protein which binds specifically to an activator cis-element approximately 1500 basepairs 5' of the human c-myc gene promoter P1 comprised of the amino acid sequence (SEQ NO ID: 6):
__________________________________________________________________________
Ile
Asp
Val
Pro
Ile
Pro
Arg
Phe
Ala
Val
1 5 10
Gly
Ile
Val
Ile
Gly
Arg
Asn
Gly
Glu
Met
Ile
Lys
Lys
15 20
Ile
Gln
Asn
Asp
Ala
Gly
Val
Arg
Ile
Gln
Phe
Lys
Pro
25 30 35
Asp
Asp
Gly
Thr
Thr
Pro
Glu
Arg
Ile
Ala
Gln
Ile
Thr
40 45
Gly
Pro
Pro
Asp
Arg
Cys
Gln
His
Ala
Ala
Glu
Ile
Ile
50 55 60
Thr
Asp
Leu
Leu
Arg
Ser
Val
Gln
Ala
Gly
Asn
Pro
Gly
65 70 75
Gly
Pro
Gly
Pro
Gly
Gly
Arg
Gly
Arg
Gly
Arg
Gly
Gln
80 85
Gly
Asn
Trp
Asn
Met
Gly
Pro
Pro
Gly
Gly
Leu
Gln
Glu
90 95 100
Phe
Asn
Phe
Ile
Val
Pro
Thr
Gly
Lys
Thr
Gly
Leu
Ile
105 110
Ile
Gly
Lys
Gly
Gly
Glu
Thr
Ile
Lys
Ser
Ile
Ser
Gln
115 120 125
Gln
Ser
Gly
Ala
Arg
Ile
Glu
Leu
Gln
Arg
Asn
Pro
Pro
130 135 140
Pro
Asn
Ala
Asp
Pro
Asn
Met
Lys
Leu
Phe
Thr
Ile
Arg
145 150
Gly
Thr
Pro
Gln
Gln
Ile
Asp
Tyr
Ala
Arg
Gln
Leu
Ile
155 160 165
Glu
Glu
Lys
Ile
Gly
Gly
Pro
Val
Asn
Pro
Leu
Gly
Pro
170 175
Pro
Val
Pro
His
Gly
Pro
His
Gly
Val
Pro
Gly
Pro
His
180 185 190
Gly
Pro
Pro
Gly
Pro
Pro
Gly
Pro
Gly
Thr
Pro
Met
Gly
195 200 205
Pro
Tyr
Asn
Pro
Ala
Pro
Tyr
Asn
Pro
Gly
Pro
Pro
Gly
210 215
Pro
Ala
Pro
His
Gly
Pro
Pro
Ala
Pro
Tyr
Ala
Pro
Gln
220 225 230
Gly
Trp
Gly
Lys
Glu
Ile
Glu
Gln
Lys
Val
Gln
Glu.
235 240
__________________________________________________________________________
8. A protein which binds specifically to an activator cis-element approximately 1500 basepairs 5' of the human c-myc promoter P1 comprised of the amino acid sequence (SEQ ID NO: 8):
__________________________________________________________________________
Arg
Ala
Arg
Gln
Ile
Ala
1 5
Ala
Lys
Ile
Gly
Gly
Asp
Ala
Gly
Thr
Ser
Leu
Asn
Ser
10 15
Asn
Asp
Tyr
Gly
Tyr
Gly
Gly
Gln
Lys
Arg
Pro
Leu
Glu
20 25 30
Asp
Gly
Asp
Gly
Ser
Trp
Thr
Ser
Pro
Ser
Ser
Thr
Thr
35 40 45
His
Trp
Glu
Gly
Met
Pro
Ser
Pro
Phe
Lys
Asp
Gln
Pro
50 55
Asp
Ala
Lys
Lys
Val
Ala
Pro
Gln
Asn
Asp
Ser
Phe
Gly
60 65 70
Thr
Gln
Leu
Pro
Pro
Met
His
Gln
Gln
Gln
Arg
Ser
Val
75 80
Met
Thr
Glu
Glu
Tyr
Lys
Val
Pro
Asp
Gly
Met
Val
Gly
85 90 95
Phe
Ile
Ile
Gly
Arg
Gly
Gly
Glu
Gln
Ile
Ser
Arg
Ile
100 105 110
Gln
Gln
Glu
Ser
Gly
Cys
Lys
Ile
Gln
Ile
Ala
Pro
Asp
115 120
Ser
Gly
Gly
Leu
Pro
Glu
Arg
Ser
Cys
Met
Leu
Thr
Gly
125 130 135
Thr
Pro
Glu
Ser
Val
Gln
Ser
Ala
Lys
Arg
Leu
Leu
Asp
140 145
Gln
Ile
Val
Glu
Lys
Gly
Arg
Pro
Ala
Pro
Gly
Phe
His
150 155 160
His
Gly
Asp
Gly
Pro
Gly
Asn
Ala
Val
Gln
Glu
Ile
Met
165 170 175
Ile
Pro
Ala
Ser
Lys
Ala
Gly
Leu
Val
Ile
Gly
Lys
Gly
180 185
Gly
Glu
Thr
Ile
Lys
Gln
Leu
Gln
Glu
Arg
Ala
Gly
Val
190 195 200
Lys
Met
Val
Met
Ile
Gln
Asp
Gly
Pro
Gln
Asn
Thr
Gly
205 210
Ala
Asp
Lys
Pro
Leu
Arg
Ile
Thr
Gly
Asp
Pro
Tyr
Lys
215 220 225
Val
Gln
Gln
Ala
Lys
Glu
Met
Val
Leu
Glu
Leu
Ile
Arg
230 235 240
Asp
Gln
Gly
Gly
Phe
Arg
Glu
Val
Arg
Asn
Glu
Tyr
Gly
245 250
Ser
Arg
Ile
Gly
Gly
Asn
Glu
Gly
Ile
Asp
Val
Pro
Ile
255 260 265
Pro
Arg
Phe
Ala
Val
Gly
Ile
Val
Ile
Gly
Arg
Asn
Gly
270 275
Glu
Met
Ile
Lys
Lys
Ile
Gln
Asn
Asp
Ala
Gly
Val
Arg
280 285 290
Ile
Gln
Phe
Lys
Pro
Asp
Asp
Gly
Thr
Thr
Pro
Glu
Arg
295 300 305
Ile
Ala
Gln
Ile
Thr
Gly
Pro
Pro
Asp
Arg
Cys
Gln
His
310 315
Ala
Ala
Glu
Ile
Ile
Thr
Asp
Leu
Leu
Arg
Ser
Val
Gln
320 325 330
Ala
Gly
Asn
Pro
Gly
Gly
Pro
Gly
Pro
Gly
Gly
Arg
Gly
335 340
Arg
Gly
Arg
Gly
Gln
Gly
Asn
Trp
Asn
Met
Gly
Pro
Pro
345 350 355
Gly
Gly
Leu
Gln
Glu
Phe
Asn
Phe
Ile
Val
Pro
Thr
Gly
360 365 370
Lys
Thr
Gly
Leu
Ile
Ile
Gly
Lys
Gly
Gly
Glu
Thr
Ile
375 380
Lys
Ser
Ile
Ser
Gln
Gln
Ser
Gly
Ala
Arg
Ile
Glu
Leu
385 390 395
Gln
Arg
Asn
Pro
Pro
Pro
Asn
Ala
Asp
Pro
Asn
Met
Lys
400 405
Leu
Phe
Thr
Ile
Arg
Gly
Thr
Pro
Gln
Gln
Ile
Asp
Tyr
410 415 420
Ala
Arg
Gln
Leu
Ile
Glu
Glu
Lys
Ile
Gly
Gly
Pro
Val
425 430 435
Asn
Pro
Leu
Gly
Pro
Pro
Val
Pro
His
Gly
Pro
His
Gly
440 445
Val
Pro
Gly
Pro
His
Gly
Pro
Pro
Gly
Pro
Pro
Gly
Pro
450 455 460
Gly
Thr
Pro
Met
Gly
Pro
Tyr
Asn
Pro
Ala
Pro
Tyr
Asn
465 470
Pro
Gly
Pro
Pro
Gly
Pro
Ala
Pro
His
Gly
Pro
Pro
Ala
475 480 485
Pro
Tyr
Ala
Pro
Gln
Gly
Trp
Gly
Asn
Ala
Tyr
Pro
His
490 495 500
Trp
Gln
Gln
Gln
Ala
Pro
Pro
Asp
Pro
Ala
Lys
Ala
Gly
505 510
Thr
Asp
Pro
Asn
Ser
Ala
Ala
Trp
Ala
Ala
Tyr
Tyr
Ala
515 520 525
His
Tyr
Tyr
Gln
Gln
Gln
Ala
Gln
Pro
Pro
Pro
Ala
Ala
530 535
Pro
Ala
Gly
Ala
Pro
Thr
Thr
Thr
Gln
Thr
Asn
Gly
Gln
540 545 550
Gly
Asp
Gln
Gln
Asn
Pro
Ala
Pro
Ala
Gly
Gln
Val
Asp
555 560 565
Tyr
Thr
Lys
Ala
Trp
Glu
Glu
Tyr
Tyr
Lys
Lys
Met
Gly
570 575
Pro
Ile
Ile
Arg
Ser
Gly
Gln
Tyr
Ser
Ile
Cys
Phe
580 585 590.
__________________________________________________________________________
oe a fragment thereof.
9. A protein exhibiting a DNA binding motif, containing at least two repeating helix regions, each of said regions comprised of the following:
(a) a region comprised of about 20% glycine, followed by
(b) a short helix comprised of about 10-12 amino acids, followed by
(c) a long amphipathic helix comprised of about 16-18 amino acids.
10. A protein of claim 9 which binds specifically to an activator cis-element approximately 1500 basepairs 5' of the human c-myc gene promoter P1.
11. A protein of claim 10 selected from the group consisting of SEQ ID NO: 2, SEQ ID NO: 6, SEQ ID NO: 8 and SEQ ID NO: 10.
12. A diagnostic kit for FUSE binding protein including at least one of the proteins shown in SEQ ID NO: 2, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10.
13. A fragment of the protein of claim 1 comprising at least one intact fuse binding protein repeat unit.
14. A fragment of the protein of claim 4 comprising at least one intact fuse binding protein repeat unit.
15. A fragment of the protein of claim 7 comprising at least one intact fuse binding protein repeat unit.
16. A fragment of the protein of claim 8 comprising at least one intact fuse binding protein repeat unit.
17. A fragment of the protein of claim 9 comprising at least one intact fuse binding protein repeat unit.
18. A diagnostic kit for FUSE binding protein including a fragment of at least one of the proteins shown in SEQ ID NO: 2, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10, which fragment comprises at least one intact fuse binding protein repeat unit.
19. A fragment of the protein of claim 1 comprising at least 30 amino acids.
20. A fragment of the protein of claim 4 comprising at least 30 amino acids.
21. A fragment of the protein of claim 7 comprising at least 30 amino acids.
22. A fragment of the protein of claim 8 comprising at least 30 amino acids.
23. A fragment of the protein of claim 9 comprising at least 30 amino acids.
24. A diagnostic kit for FUSE binding protein including a fragment of at least one of the proteins shown in SEQ ID NO: 2, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10, which fragment comprises at least 30 amino acids.
25. A fragment of the protein of claim 1 comprising at least 16 amino acids.
26. A fragment of the protein of claim 4 comprising at least 16 amino acids.
27. A fragment of the protein of claim 7 comprising at least 16 amino acids.
28. A fragment of the protein of claim 8 comprising at least 16 amino acids.
29. A fragment of the protein of claim 9 comprising at least 16 amino acids.
30. A diagnostic kit for FUSE binding protein including a fragment of at least one of the proteins shown in SEQ ID NO: 2, SEQ ID NO: 6, SEQ ID NO: 8, or SEQ ID NO: 10, which fragment comprises at least 16 amino acids.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US08/726,160 US5734016A (en) | 1993-02-22 | 1996-10-04 | Fuse binding protein and method for inhibiting expression of fuse binding protein |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US08/021,608 US5580760A (en) | 1993-02-22 | 1993-02-22 | FUSE binding protein and cDNA therefor |
| US08/726,160 US5734016A (en) | 1993-02-22 | 1996-10-04 | Fuse binding protein and method for inhibiting expression of fuse binding protein |
Related Parent Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US08/021,608 Continuation US5580760A (en) | 1993-02-22 | 1993-02-22 | FUSE binding protein and cDNA therefor |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| US5734016A true US5734016A (en) | 1998-03-31 |
Family
ID=21805160
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US08/021,608 Expired - Lifetime US5580760A (en) | 1993-02-22 | 1993-02-22 | FUSE binding protein and cDNA therefor |
| US08/726,160 Expired - Lifetime US5734016A (en) | 1993-02-22 | 1996-10-04 | Fuse binding protein and method for inhibiting expression of fuse binding protein |
Family Applications Before (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US08/021,608 Expired - Lifetime US5580760A (en) | 1993-02-22 | 1993-02-22 | FUSE binding protein and cDNA therefor |
Country Status (3)
| Country | Link |
|---|---|
| US (2) | US5580760A (en) |
| AU (1) | AU6244694A (en) |
| WO (1) | WO1994019465A2 (en) |
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20030220487A1 (en) * | 1999-04-09 | 2003-11-27 | Gaines Patrick J. | Flea peritrophin nucleic acid molecules, proteins and uses thereof |
| WO2004013289A3 (en) * | 2002-08-02 | 2005-02-03 | Univ Ohio | Diagnosis of kidney damage and protection against same |
| US20090042230A1 (en) * | 1999-04-09 | 2009-02-12 | Gaines Patrick J | Flea allantoinase proteins and uses thereof |
Families Citing this family (18)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5580760A (en) * | 1993-02-22 | 1996-12-03 | The United States Of America As Represented By The Department Of Health And Human Services | FUSE binding protein and cDNA therefor |
| WO2004027061A1 (en) * | 2002-09-17 | 2004-04-01 | Yamanouchi Pharmaceutical Co., Ltd. | Method of screening remedy for proliferative disease |
| WO2005015225A1 (en) * | 2003-08-06 | 2005-02-17 | Roche Diagnostics Gmbh | Use of the far upstream element (fuse) binding protein (fubp) as a marker for colorectal cancer |
| WO2005015222A1 (en) * | 2003-08-07 | 2005-02-17 | Roche Diagnostics Gmbh | Use of the far upstream element (fuse) binding protein (fubp) as a marker for colorectal cancer |
| US7659426B2 (en) * | 2003-12-01 | 2010-02-09 | Reverse Proteomics Research Institute Co., Ltd. | Target protein of anticancer agent and novel anticancer agent (spnal) corresponding thereto |
| KR101441693B1 (en) | 2005-10-18 | 2014-09-25 | 내셔날 쥬이쉬 헬스 | Conditionally immortalized long-term stem cells and methods of making and using such cells |
| US8551968B2 (en) | 2007-03-13 | 2013-10-08 | National Jewish Health | Methods for generation of antibodies |
| SG190661A1 (en) | 2008-05-16 | 2013-06-28 | Taiga Biotechnologies Inc | Antibodies and processes for preparing the same |
| ES2525411T3 (en) | 2008-07-21 | 2014-12-22 | Taiga Biotechnologies, Inc. | Differentiated anucleated cells and method to prepare them |
| JP5812861B2 (en) | 2008-08-28 | 2015-11-17 | タイガ バイオテクノロジーズ,インク. | MYC modifier, method of using the MYC modifier, and method of identifying agents that modulate MYC |
| SG11201500429QA (en) | 2012-07-20 | 2015-03-30 | Taiga Biotechnologies Inc | Enhanced reconstitution and autoreconstitution of the hematopoietic compartment |
| US9365825B2 (en) | 2013-03-11 | 2016-06-14 | Taiga Biotechnologies, Inc. | Expansion of adult stem cells in vitro |
| US10272115B2 (en) | 2013-03-11 | 2019-04-30 | Taiga Biotechnologies, Inc. | Production and use of red blood cells |
| AU2017367730A1 (en) | 2016-12-02 | 2019-06-06 | Taiga Biotechnologies, Inc. | Nanoparticle formulations |
| US10149898B2 (en) | 2017-08-03 | 2018-12-11 | Taiga Biotechnologies, Inc. | Methods and compositions for the treatment of melanoma |
| SG11202009680XA (en) * | 2018-04-05 | 2020-10-29 | Hoffmann La Roche | Use of fubp1 inhibitors for treating hepatitis b virus infection |
| WO2020210231A1 (en) | 2019-04-08 | 2020-10-15 | Taiga Biotechnologies, Inc. | Compositions and methods for the cry opreservation of immune cells |
| CN114072166A (en) | 2019-05-14 | 2022-02-18 | 泰加生物工艺学公司 | Compositions and methods for treating T cell depletion |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5580760A (en) * | 1993-02-22 | 1996-12-03 | The United States Of America As Represented By The Department Of Health And Human Services | FUSE binding protein and cDNA therefor |
| US5585479A (en) * | 1992-07-24 | 1996-12-17 | The United States Of America As Represented By The Secretary Of The Navy | Antisense oligonucleotides directed against human ELAM-I RNA |
-
1993
- 1993-02-22 US US08/021,608 patent/US5580760A/en not_active Expired - Lifetime
-
1994
- 1994-02-22 AU AU62446/94A patent/AU6244694A/en not_active Abandoned
- 1994-02-22 WO PCT/US1994/001782 patent/WO1994019465A2/en not_active Ceased
-
1996
- 1996-10-04 US US08/726,160 patent/US5734016A/en not_active Expired - Lifetime
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5585479A (en) * | 1992-07-24 | 1996-12-17 | The United States Of America As Represented By The Secretary Of The Navy | Antisense oligonucleotides directed against human ELAM-I RNA |
| US5580760A (en) * | 1993-02-22 | 1996-12-03 | The United States Of America As Represented By The Department Of Health And Human Services | FUSE binding protein and cDNA therefor |
Non-Patent Citations (22)
| Title |
|---|
| Avigan, M.I., et al. "Cloning of the cellular factor which binds the far upstream element (FUSE) of the c-myc gene," J. Cell. Biochem., vol. 50, Suppl. 0 (16 part A), 1992, p. 83, abstract No. B 211, which is an abstract of an oral presentation given at the Keystone Symposium on Transcription Regulation, Tamorron, Colorado, USA, Jan. 13-20, 1992. |
| Avigan, M.I., et al. A Far Upstream Element Stimulates c myc Expression in Undifferentiated Leukemia Cells, J. Biol. Chem., 265, 18538 18545 (1990). * |
| Avigan, M.I., et al. Cloning of the cellular factor which binds the far upstream element (FUSE) of the c myc gene, J. Cell. Biochem., vol. 50, Suppl. 0 (16 part A), 1992, p. 83, abstract No. B 211, which is an abstract of an oral presentation given at the Keystone Symposium on Transcription Regulation, Tamorron, Colorado, USA, Jan. 13 20, 1992. * |
| Avigan, M.I., et al."A Far Upstream Element Stimulates c-myc Expression in Undifferentiated Leukemia Cells," J. Biol. Chem., 265, 18538-18545 (1990). |
| Barinaga, M. 1994 Science vol. 266, p. 1326. * |
| Crystal, R. 1995 Science vol. 270, pp. 404 410. * |
| Crystal, R. 1995 Science vol. 270, pp. 404-410. |
| Duncan, R., et al. "The human c-myc FUSE element binds a differentiation regulated protein with a novel structure," Biomed. Pharmacother., vol. 46, 5-7 (1992), p. 243, abstract No. 11, which is an abstract of an oral presentation given at the 5th International Congress on Differentiation Therapy, Villasimius, Italy, 2-5 Sep., 1992. |
| Duncan, R., et al. The human c myc FUSE element binds a differentiation regulated protein with a novel structure, Biomed. Pharmacother., vol. 46, 5 7 (1992), p. 243, abstract No. 11, which is an abstract of an oral presentation given at the 5th International Congress on Differentiation Therapy, Villasimius, Italy, 2 5 Sep., 1992. * |
| Gura, T. 1995 Science vol. 270, pp. 575 577. * |
| Gura, T. 1995 Science vol. 270, pp. 575-577. |
| Jolly, D. 1994 Cancer Gene Therapy vol. 1(1) pp. 51 64. * |
| Jolly, D. 1994 Cancer Gene Therapy vol. 1(1) pp. 51-64. |
| Marshall, E.M. 1995 Science vol. 269, pp. 1050 1055. * |
| Marshall, E.M. 1995 Science vol. 269, pp. 1050-1055. |
| Orkin et al. Dec. 1995 Report and Recommendations of the Panel to Assess the NIH Investment in Research on Gene Therapy. * |
| Roush, W. Science May 23, 1997 vol. 276, pp. 1192 1193. * |
| Roush, W. Science May 23, 1997 vol. 276, pp. 1192-1193. |
| Sigma Molecular Biology, published by Sigma Chemical Company St. Louis, MO; 51 52 (1989). * |
| Sigma Molecular Biology, published by Sigma Chemical Company St. Louis, MO; 51-52 (1989). |
| Wagner, R.W. 1994 Nature, vol. 372, pp. 333 335. * |
| Wagner, R.W. 1994 Nature, vol. 372, pp. 333-335. |
Cited By (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20030220487A1 (en) * | 1999-04-09 | 2003-11-27 | Gaines Patrick J. | Flea peritrophin nucleic acid molecules, proteins and uses thereof |
| US7247447B2 (en) * | 1999-04-09 | 2007-07-24 | Heska Corporation | Flea peritrophin nucleic acid molecules, proteins and uses thereof |
| US20080014624A1 (en) * | 1999-04-09 | 2008-01-17 | Gaines Patrick J | Flea peritrophin nucleic acid molecules |
| US20090042230A1 (en) * | 1999-04-09 | 2009-02-12 | Gaines Patrick J | Flea allantoinase proteins and uses thereof |
| US7662622B2 (en) | 1999-04-09 | 2010-02-16 | Heska Corporation | Flea peritrophin nucleic acid molecules |
| WO2004013289A3 (en) * | 2002-08-02 | 2005-02-03 | Univ Ohio | Diagnosis of kidney damage and protection against same |
Also Published As
| Publication number | Publication date |
|---|---|
| WO1994019465A2 (en) | 1994-09-01 |
| WO1994019465A3 (en) | 1994-10-27 |
| AU6244694A (en) | 1994-09-14 |
| US5580760A (en) | 1996-12-03 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US5734016A (en) | Fuse binding protein and method for inhibiting expression of fuse binding protein | |
| Ohkuma et al. | Structural motifs and potential a homologies in the large subunit of human general transcription factor TFIIE | |
| Tomasetto et al. | Lasp-1 (MLN 50) defines a new LIM protein subfamily characterized by the association of LIM and SH3 domains | |
| Kageyama et al. | Molecular characterization of transcription factors that bind to the cAMP responsive region of the substance P precursor gene. cDNA cloning of a novel C/EBP-related factor | |
| Aso et al. | Characterization of cDNA for the large subunit of the transcription initiation factor TFIIF | |
| Dwornczak et al. | Structure and expression of a human gene coding for a 71 kd heat shock ‘cognate’protein | |
| Phelan et al. | Cooperative interactions between HOX and PBX proteins mediated by a conserved peptide motif | |
| Bunz et al. | cDNAs encoding the large subunit of human replication factor C. | |
| Nata et al. | Molecular cloning, expression and chromosomal localization of a novel human REG family gene, REG III | |
| de la Brousse et al. | Molecular and genetic characterization of GABP beta. | |
| Pan et al. | Cloning, mapping, and characterization of a human homologue of the yeast longevity assurance gene LAG1 | |
| US5302519A (en) | Method of producing a Mad polypeptide | |
| Hemmerich et al. | Structural and functional properties of ribosomal protein L7 from humans and rodents | |
| Gecz et al. | Gene structure and subcellular localization of FMR2, a member of a new family of putative transcription activators | |
| Wenk et al. | Molecular cloning and primary structure of a protein phosphatase 2C isoform | |
| JPH0690764A (en) | Method of preparing human nerve growth factor by gene recombination | |
| Huh et al. | Presence of a bi-directional S phase-specific transcription regulatory element in the promoter shared by testis-specific TH2A and TH2B histone genes | |
| EP0681585A1 (en) | Tata-binding protein associated factors, nucleic acids encoding tafs, and methods of use | |
| DE69430121T2 (en) | METHOD AND COMPOSITIONS SIMILAR TO THE BINDING PROTEIN OF THE STEROL REGULATION ELEMENT | |
| JP2001514264A (en) | Human defensin polypeptide Def-X, genomic DNA and cDNA, compositions containing them and applications in diagnostic and therapeutic treatment | |
| Okano et al. | Cloning of CRP2, a novel member of the cysteine-rich protein family with two repeats of an unusual LIM/double zinc-finger motif | |
| Kalousová et al. | DNA binding and transactivating properties of the paired and homeobox protein Pax4 | |
| Weitzel et al. | The HRAS1 gene cluster: two upstream regions recognizing transcripts and a third encoding a gene with a leucine zipper domain | |
| CA2330228A1 (en) | Angiostatin-binding protein | |
| WO1997039105A1 (en) | Mad-related genes in the human |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
| FPAY | Fee payment |
Year of fee payment: 4 |
|
| FPAY | Fee payment |
Year of fee payment: 8 |
|
| FPAY | Fee payment |
Year of fee payment: 12 |