US20020077309A1 - Diagnostics and therapeutics for pancreatic disorders - Google Patents
Diagnostics and therapeutics for pancreatic disorders Download PDFInfo
- Publication number
- US20020077309A1 US20020077309A1 US09/864,711 US86471101A US2002077309A1 US 20020077309 A1 US20020077309 A1 US 20020077309A1 US 86471101 A US86471101 A US 86471101A US 2002077309 A1 US2002077309 A1 US 2002077309A1
- Authority
- US
- United States
- Prior art keywords
- protein
- polynucleotide
- molecules
- ligand
- expression
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 208000016222 Pancreatic disease Diseases 0.000 title abstract description 28
- 239000003814 drug Substances 0.000 title description 13
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 218
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 146
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 146
- 239000002157 polynucleotide Substances 0.000 claims abstract description 146
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 128
- 238000000034 method Methods 0.000 claims abstract description 94
- 239000000203 mixture Substances 0.000 claims abstract description 31
- 230000014509 gene expression Effects 0.000 claims description 53
- 238000009396 hybridization Methods 0.000 claims description 40
- 150000007523 nucleic acids Chemical group 0.000 claims description 30
- 108020004414 DNA Proteins 0.000 claims description 29
- 239000013598 vector Substances 0.000 claims description 29
- 239000003446 ligand Substances 0.000 claims description 18
- 206010067584 Type 1 diabetes mellitus Diseases 0.000 claims description 16
- 239000000758 substrate Substances 0.000 claims description 16
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 15
- -1 mimetics Proteins 0.000 claims description 13
- 230000009870 specific binding Effects 0.000 claims description 13
- 230000009918 complex formation Effects 0.000 claims description 11
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 10
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 9
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 8
- 108091093037 Peptide nucleic acid Proteins 0.000 claims description 6
- 239000005557 antagonist Substances 0.000 claims description 6
- 238000002372 labelling Methods 0.000 claims description 6
- 239000000556 agonist Substances 0.000 claims description 5
- 239000003937 drug carrier Substances 0.000 claims description 5
- 238000004113 cell culture Methods 0.000 claims description 4
- 210000004923 pancreatic tissue Anatomy 0.000 claims description 3
- 229920001184 polypeptide Polymers 0.000 claims description 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- 238000012258 culturing Methods 0.000 claims 1
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 abstract description 63
- 102000004877 Insulin Human genes 0.000 abstract description 31
- 108090001061 Insulin Proteins 0.000 abstract description 31
- 229940125396 insulin Drugs 0.000 abstract description 31
- 230000015572 biosynthetic process Effects 0.000 abstract description 19
- 238000003786 synthesis reaction Methods 0.000 abstract description 17
- 238000011282 treatment Methods 0.000 abstract description 13
- 239000013604 expression vector Substances 0.000 abstract description 12
- 238000002560 therapeutic procedure Methods 0.000 abstract description 11
- 238000003745 diagnosis Methods 0.000 abstract description 9
- 238000004393 prognosis Methods 0.000 abstract description 8
- 238000011156 evaluation Methods 0.000 abstract description 7
- 235000018102 proteins Nutrition 0.000 description 100
- 210000004027 cell Anatomy 0.000 description 60
- 239000000523 sample Substances 0.000 description 51
- 239000002299 complementary DNA Substances 0.000 description 33
- 239000002773 nucleotide Substances 0.000 description 31
- 125000003729 nucleotide group Chemical group 0.000 description 31
- 108020004999 messenger RNA Proteins 0.000 description 29
- 230000004186 co-expression Effects 0.000 description 28
- 241000282414 Homo sapiens Species 0.000 description 24
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 24
- 210000001519 tissue Anatomy 0.000 description 24
- 239000012528 membrane Substances 0.000 description 21
- 230000000295 complement effect Effects 0.000 description 20
- 102000039446 nucleic acids Human genes 0.000 description 19
- 108020004707 nucleic acids Proteins 0.000 description 19
- 210000004379 membrane Anatomy 0.000 description 18
- 108020004635 Complementary DNA Proteins 0.000 description 16
- 238000004458 analytical method Methods 0.000 description 16
- 150000001413 amino acids Chemical group 0.000 description 15
- 206010012601 diabetes mellitus Diseases 0.000 description 15
- 210000000496 pancreas Anatomy 0.000 description 15
- 150000001875 compounds Chemical class 0.000 description 14
- 208000035475 disorder Diseases 0.000 description 14
- 102000005311 colipase Human genes 0.000 description 13
- 108020002632 colipase Proteins 0.000 description 13
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 12
- 239000000243 solution Substances 0.000 description 12
- 101100289203 Rattus norvegicus Reg1 gene Proteins 0.000 description 11
- MASNOZXLGMXCHN-ZLPAWPGGSA-N glucagon Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)C(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CO)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)[C@@H](C)O)C1=CC=CC=C1 MASNOZXLGMXCHN-ZLPAWPGGSA-N 0.000 description 11
- 239000003550 marker Substances 0.000 description 11
- 239000013612 plasmid Substances 0.000 description 11
- 210000000130 stem cell Anatomy 0.000 description 11
- 101001081479 Homo sapiens Islet amyloid polypeptide Proteins 0.000 description 10
- 230000027455 binding Effects 0.000 description 10
- 201000010099 disease Diseases 0.000 description 10
- 230000000694 effects Effects 0.000 description 10
- 230000001225 therapeutic effect Effects 0.000 description 10
- 102000051325 Glucagon Human genes 0.000 description 9
- 108060003199 Glucagon Proteins 0.000 description 9
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 9
- 238000003556 assay Methods 0.000 description 9
- 230000001580 bacterial effect Effects 0.000 description 9
- 239000012634 fragment Substances 0.000 description 9
- 229960004666 glucagon Drugs 0.000 description 9
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 9
- 102000053642 Catalytic RNA Human genes 0.000 description 8
- 108090000994 Catalytic RNA Proteins 0.000 description 8
- 102000004882 Lipase Human genes 0.000 description 8
- 108090001060 Lipase Proteins 0.000 description 8
- 239000004367 Lipase Substances 0.000 description 8
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 8
- 230000006870 function Effects 0.000 description 8
- 229940040461 lipase Drugs 0.000 description 8
- 235000019421 lipase Nutrition 0.000 description 8
- 108091092562 ribozyme Proteins 0.000 description 8
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 7
- 238000003491 array Methods 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 7
- 238000010195 expression analysis Methods 0.000 description 7
- 238000004519 manufacturing process Methods 0.000 description 7
- 230000004048 modification Effects 0.000 description 7
- 238000012986 modification Methods 0.000 description 7
- 230000001105 regulatory effect Effects 0.000 description 7
- 241000283973 Oryctolagus cuniculus Species 0.000 description 6
- 241000700159 Rattus Species 0.000 description 6
- 239000000872 buffer Substances 0.000 description 6
- 210000004153 islets of langerhan Anatomy 0.000 description 6
- 210000004185 liver Anatomy 0.000 description 6
- 208000024691 pancreas disease Diseases 0.000 description 6
- 229920000642 polymer Polymers 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 238000000746 purification Methods 0.000 description 6
- 238000012216 screening Methods 0.000 description 6
- 238000012360 testing method Methods 0.000 description 6
- 108091035707 Consensus sequence Proteins 0.000 description 5
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 5
- 235000001014 amino acid Nutrition 0.000 description 5
- 125000000539 amino acid group Chemical group 0.000 description 5
- 238000010367 cloning Methods 0.000 description 5
- 230000002596 correlated effect Effects 0.000 description 5
- 229940079593 drug Drugs 0.000 description 5
- 108020001507 fusion proteins Proteins 0.000 description 5
- 102000037865 fusion proteins Human genes 0.000 description 5
- 210000000056 organ Anatomy 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 239000011780 sodium chloride Substances 0.000 description 5
- OPIFSICVWOWJMJ-AEOCFKNESA-N 5-bromo-4-chloro-3-indolyl beta-D-galactoside Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1OC1=CNC2=CC=C(Br)C(Cl)=C12 OPIFSICVWOWJMJ-AEOCFKNESA-N 0.000 description 4
- 238000000729 Fisher's exact test Methods 0.000 description 4
- 108091023040 Transcription factor Proteins 0.000 description 4
- 102000040945 Transcription factor Human genes 0.000 description 4
- 210000000601 blood cell Anatomy 0.000 description 4
- 238000006243 chemical reaction Methods 0.000 description 4
- 230000000875 corresponding effect Effects 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 239000012153 distilled water Substances 0.000 description 4
- 239000003623 enhancer Substances 0.000 description 4
- 230000002068 genetic effect Effects 0.000 description 4
- 239000008103 glucose Substances 0.000 description 4
- 108010050848 glycylleucine Proteins 0.000 description 4
- 208000033066 hyperinsulinemic hypoglycemia Diseases 0.000 description 4
- 238000003384 imaging method Methods 0.000 description 4
- 238000000338 in vitro Methods 0.000 description 4
- 210000003205 muscle Anatomy 0.000 description 4
- 230000037361 pathway Effects 0.000 description 4
- 230000026731 phosphorylation Effects 0.000 description 4
- 238000006366 phosphorylation reaction Methods 0.000 description 4
- 239000011541 reaction mixture Substances 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- 238000013519 translation Methods 0.000 description 4
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 description 3
- 206010002383 Angina Pectoris Diseases 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 3
- 108091026890 Coding region Proteins 0.000 description 3
- 208000007342 Diabetic Nephropathies Diseases 0.000 description 3
- 208000032131 Diabetic Neuropathies Diseases 0.000 description 3
- 206010012689 Diabetic retinopathy Diseases 0.000 description 3
- 238000002965 ELISA Methods 0.000 description 3
- 102000010911 Enzyme Precursors Human genes 0.000 description 3
- 108010062466 Enzyme Precursors Proteins 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- 108091060211 Expressed sequence tag Proteins 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 241000238631 Hexapoda Species 0.000 description 3
- 101000976580 Homo sapiens Zinc finger protein 133 Proteins 0.000 description 3
- 206010020772 Hypertension Diseases 0.000 description 3
- 241000880493 Leptailurus serval Species 0.000 description 3
- 102000011364 Major intrinsic proteins Human genes 0.000 description 3
- 108050001696 Major intrinsic proteins Proteins 0.000 description 3
- 241000699670 Mus sp. Species 0.000 description 3
- 206010028851 Necrosis Diseases 0.000 description 3
- 239000004677 Nylon Substances 0.000 description 3
- 108091034117 Oligonucleotide Proteins 0.000 description 3
- 208000018262 Peripheral vascular disease Diseases 0.000 description 3
- 108010029485 Protein Isoforms Proteins 0.000 description 3
- 102000001708 Protein Isoforms Human genes 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 208000025865 Ulcer Diseases 0.000 description 3
- 102100023575 Zinc finger protein 133 Human genes 0.000 description 3
- 238000010171 animal model Methods 0.000 description 3
- 210000001185 bone marrow Anatomy 0.000 description 3
- 210000004899 c-terminal region Anatomy 0.000 description 3
- 210000000845 cartilage Anatomy 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 108010016616 cysteinylglycine Proteins 0.000 description 3
- 230000003247 decreasing effect Effects 0.000 description 3
- 208000033679 diabetic kidney disease Diseases 0.000 description 3
- 238000007865 diluting Methods 0.000 description 3
- 230000005284 excitation Effects 0.000 description 3
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 3
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 3
- 239000008187 granular material Substances 0.000 description 3
- 230000012010 growth Effects 0.000 description 3
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 3
- 238000011534 incubation Methods 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- 239000003112 inhibitor Substances 0.000 description 3
- 230000005764 inhibitory process Effects 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- 210000000936 intestine Anatomy 0.000 description 3
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 3
- 108010017391 lysylvaline Proteins 0.000 description 3
- 208000010125 myocardial infarction Diseases 0.000 description 3
- 230000017074 necrotic cell death Effects 0.000 description 3
- 230000003472 neutralizing effect Effects 0.000 description 3
- 229920001778 nylon Polymers 0.000 description 3
- 239000008194 pharmaceutical composition Substances 0.000 description 3
- 102000054765 polymorphisms of proteins Human genes 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 239000000047 product Substances 0.000 description 3
- 238000003127 radioimmunoassay Methods 0.000 description 3
- 230000001172 regenerating effect Effects 0.000 description 3
- 230000008439 repair process Effects 0.000 description 3
- 210000002966 serum Anatomy 0.000 description 3
- 229940124597 therapeutic agent Drugs 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- 230000009261 transgenic effect Effects 0.000 description 3
- 208000001072 type 2 diabetes mellitus Diseases 0.000 description 3
- 230000036269 ulceration Effects 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- IAKHMKGGTNLKSZ-INIZCTEOSA-N (S)-colchicine Chemical compound C1([C@@H](NC(C)=O)CC2)=CC(=O)C(OC)=CC=C1C1=C2C=C(OC)C(OC)=C1OC IAKHMKGGTNLKSZ-INIZCTEOSA-N 0.000 description 2
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 2
- 108091023043 Alu Element Proteins 0.000 description 2
- 108091023037 Aptamer Proteins 0.000 description 2
- 102100029463 Aquaporin-8 Human genes 0.000 description 2
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 2
- 208000002381 Brain Hypoxia Diseases 0.000 description 2
- 241000282472 Canis lupus familiaris Species 0.000 description 2
- 239000004971 Cross linker Substances 0.000 description 2
- AOJJSUZBOXZQNB-TZSSRYMLSA-N Doxorubicin Chemical compound O([C@H]1C[C@@](O)(CC=2C(O)=C3C(=O)C=4C=CC=C(C=4C(=O)C3=C(O)C=21)OC)C(=O)CO)[C@H]1C[C@H](N)[C@H](O)[C@H](C)O1 AOJJSUZBOXZQNB-TZSSRYMLSA-N 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- 241000206602 Eukaryota Species 0.000 description 2
- KRHYYFGTRYWZRS-UHFFFAOYSA-N Fluorane Chemical compound F KRHYYFGTRYWZRS-UHFFFAOYSA-N 0.000 description 2
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
- VNCLJDOTEPPBBD-GUBZILKMSA-N Gln-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N VNCLJDOTEPPBBD-GUBZILKMSA-N 0.000 description 2
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- 101000771417 Homo sapiens Aquaporin-8 Proteins 0.000 description 2
- 108060003951 Immunoglobulin Proteins 0.000 description 2
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 2
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 2
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 2
- 102000018697 Membrane Proteins Human genes 0.000 description 2
- 108010052285 Membrane Proteins Proteins 0.000 description 2
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- 206010028980 Neoplasm Diseases 0.000 description 2
- 108091005461 Nucleic proteins Proteins 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 240000007594 Oryza sativa Species 0.000 description 2
- 235000007164 Oryza sativa Nutrition 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- 206010061902 Pancreatic neoplasm Diseases 0.000 description 2
- 206010033645 Pancreatitis Diseases 0.000 description 2
- JWQWPTLEOFNCGX-AVGNSLFASA-N Phe-Glu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWQWPTLEOFNCGX-AVGNSLFASA-N 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- 101710182846 Polyhedrin Proteins 0.000 description 2
- 108020004518 RNA Probes Proteins 0.000 description 2
- 239000003391 RNA probe Substances 0.000 description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 2
- 241000283984 Rodentia Species 0.000 description 2
- 229920002684 Sepharose Polymers 0.000 description 2
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 2
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 2
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 2
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 2
- 108091036066 Three prime untranslated region Proteins 0.000 description 2
- 239000007984 Tris EDTA buffer Substances 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 2
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- RJURFGZVJUQBHK-UHFFFAOYSA-N actinomycin D Natural products CC1OC(=O)C(C(C)C)N(C)C(=O)CN(C)C(=O)C2CCCN2C(=O)C(C(C)C)NC(=O)C1NC(=O)C1=C(N)C(=O)C(C)=C2OC(C(C)=CC=C3C(=O)NC4C(=O)NC(C(N5CCCC5C(=O)N(C)CC(=O)N(C)C(C(C)C)C(=O)OC4C)=O)C(C)C)=C3N=C21 RJURFGZVJUQBHK-UHFFFAOYSA-N 0.000 description 2
- 239000004480 active ingredient Substances 0.000 description 2
- 239000002671 adjuvant Substances 0.000 description 2
- 210000004504 adult stem cell Anatomy 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 230000000890 antigenic effect Effects 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 239000011324 bead Substances 0.000 description 2
- 239000012472 biological sample Substances 0.000 description 2
- AIYUHDOJVYHVIT-UHFFFAOYSA-M caesium chloride Chemical compound [Cl-].[Cs+] AIYUHDOJVYHVIT-UHFFFAOYSA-M 0.000 description 2
- 229960003669 carbenicillin Drugs 0.000 description 2
- FPPNZSSZRUTDAP-UWFZAAFLSA-N carbenicillin Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)C(C(O)=O)C1=CC=CC=C1 FPPNZSSZRUTDAP-UWFZAAFLSA-N 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 238000012790 confirmation Methods 0.000 description 2
- 210000002808 connective tissue Anatomy 0.000 description 2
- 238000012937 correction Methods 0.000 description 2
- 238000007405 data analysis Methods 0.000 description 2
- 239000003599 detergent Substances 0.000 description 2
- 238000000295 emission spectrum Methods 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 210000003754 fetus Anatomy 0.000 description 2
- 238000002509 fluorescent in situ hybridization Methods 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 238000001415 gene therapy Methods 0.000 description 2
- 210000004602 germ cell Anatomy 0.000 description 2
- 239000011521 glass Substances 0.000 description 2
- 230000013595 glycosylation Effects 0.000 description 2
- 238000006206 glycosylation reaction Methods 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 210000002216 heart Anatomy 0.000 description 2
- 210000003958 hematopoietic stem cell Anatomy 0.000 description 2
- 210000001551 hemic and immune system Anatomy 0.000 description 2
- 238000002744 homologous recombination Methods 0.000 description 2
- 230000006801 homologous recombination Effects 0.000 description 2
- 239000005556 hormone Substances 0.000 description 2
- 229940088597 hormone Drugs 0.000 description 2
- 210000004408 hybridoma Anatomy 0.000 description 2
- 210000002865 immune cell Anatomy 0.000 description 2
- 230000001900 immune effect Effects 0.000 description 2
- 210000000987 immune system Anatomy 0.000 description 2
- 238000003018 immunoassay Methods 0.000 description 2
- 102000018358 immunoglobulin Human genes 0.000 description 2
- 229940072221 immunoglobulins Drugs 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 230000007774 longterm Effects 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 210000002901 mesenchymal stem cell Anatomy 0.000 description 2
- 238000002493 microarray Methods 0.000 description 2
- 238000012544 monitoring process Methods 0.000 description 2
- 201000006417 multiple sclerosis Diseases 0.000 description 2
- 210000000663 muscle cell Anatomy 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 210000001178 neural stem cell Anatomy 0.000 description 2
- 210000002569 neuron Anatomy 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 201000002528 pancreatic cancer Diseases 0.000 description 2
- 239000008188 pellet Substances 0.000 description 2
- 239000000813 peptide hormone Substances 0.000 description 2
- 239000008177 pharmaceutical agent Substances 0.000 description 2
- 239000002953 phosphate buffered saline Substances 0.000 description 2
- 230000001323 posttranslational effect Effects 0.000 description 2
- 239000002243 precursor Substances 0.000 description 2
- 230000000644 propagated effect Effects 0.000 description 2
- 210000002307 prostate Anatomy 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 239000011347 resin Substances 0.000 description 2
- 229920005989 resin Polymers 0.000 description 2
- 238000010839 reverse transcription Methods 0.000 description 2
- 210000003705 ribosome Anatomy 0.000 description 2
- 235000009566 rice Nutrition 0.000 description 2
- 150000003839 salts Chemical class 0.000 description 2
- 229940016590 sarkosyl Drugs 0.000 description 2
- 108700004121 sarkosyl Proteins 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 239000001632 sodium acetate Substances 0.000 description 2
- 235000017281 sodium acetate Nutrition 0.000 description 2
- KSAVQLQVUXSOCR-UHFFFAOYSA-M sodium lauroyl sarcosinate Chemical compound [Na+].CCCCCCCCCCCC(=O)N(C)CC([O-])=O KSAVQLQVUXSOCR-UHFFFAOYSA-M 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 210000001550 testis Anatomy 0.000 description 2
- DSNBHJFQCNUKMA-SCKDECHMSA-N thromboxane A2 Chemical compound OC(=O)CCC\C=C/C[C@@H]1[C@@H](/C=C/[C@@H](O)CCCCC)O[C@@H]2O[C@H]1C2 DSNBHJFQCNUKMA-SCKDECHMSA-N 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 108091005703 transmembrane proteins Proteins 0.000 description 2
- 102000035160 transmembrane proteins Human genes 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- 238000010396 two-hybrid screening Methods 0.000 description 2
- 238000009281 ultraviolet germicidal irradiation Methods 0.000 description 2
- 241000701447 unidentified baculovirus Species 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 230000002792 vascular Effects 0.000 description 2
- 230000003612 virological effect Effects 0.000 description 2
- 210000005253 yeast cell Anatomy 0.000 description 2
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 1
- 101150072531 10 gene Proteins 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- ZPZDIFSPRVHGIF-UHFFFAOYSA-N 3-aminopropylsilicon Chemical compound NCCC[Si] ZPZDIFSPRVHGIF-UHFFFAOYSA-N 0.000 description 1
- SGOOQMRIPALTEL-UHFFFAOYSA-N 4-hydroxy-N,1-dimethyl-2-oxo-N-phenyl-3-quinolinecarboxamide Chemical compound OC=1C2=CC=CC=C2N(C)C(=O)C=1C(=O)N(C)C1=CC=CC=C1 SGOOQMRIPALTEL-UHFFFAOYSA-N 0.000 description 1
- 108020003589 5' Untranslated Regions Proteins 0.000 description 1
- STQGQHZAVUOBTE-UHFFFAOYSA-N 7-Cyan-hept-2t-en-4,6-diinsaeure Natural products C1=2C(O)=C3C(=O)C=4C(OC)=CC=CC=4C(=O)C3=C(O)C=2CC(O)(C(C)=O)CC1OC1CC(N)C(O)C(C)O1 STQGQHZAVUOBTE-UHFFFAOYSA-N 0.000 description 1
- 108010066676 Abrin Proteins 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 1
- IHRGVZXPTIQNIP-NAKRPEOUSA-N Ala-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C)N IHRGVZXPTIQNIP-NAKRPEOUSA-N 0.000 description 1
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- 201000011374 Alagille syndrome Diseases 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 208000024827 Alzheimer disease Diseases 0.000 description 1
- 102000010637 Aquaporins Human genes 0.000 description 1
- 108010063290 Aquaporins Proteins 0.000 description 1
- MCYJBCKCAPERSE-FXQIFTODSA-N Arg-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N MCYJBCKCAPERSE-FXQIFTODSA-N 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 1
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 1
- IRRMIGDCPOPZJW-ULQDDVLXSA-N Arg-His-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IRRMIGDCPOPZJW-ULQDDVLXSA-N 0.000 description 1
- FFEUXEAKYRCACT-PEDHHIEDSA-N Arg-Ile-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(O)=O FFEUXEAKYRCACT-PEDHHIEDSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 1
- OMKZPCPZEFMBIT-SRVKXCTJSA-N Arg-Met-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OMKZPCPZEFMBIT-SRVKXCTJSA-N 0.000 description 1
- VEAIMHJZTIDCIH-KKUMJFAQSA-N Arg-Phe-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEAIMHJZTIDCIH-KKUMJFAQSA-N 0.000 description 1
- XFXZKCRBBOVJKS-BVSLBCMMSA-N Arg-Phe-Trp Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 XFXZKCRBBOVJKS-BVSLBCMMSA-N 0.000 description 1
- QUBKBPZGMZWOKQ-SZMVWBNQSA-N Arg-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QUBKBPZGMZWOKQ-SZMVWBNQSA-N 0.000 description 1
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 1
- PAXHINASXXXILC-SRVKXCTJSA-N Asn-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)O PAXHINASXXXILC-SRVKXCTJSA-N 0.000 description 1
- FUHFYEKSGWOWGZ-XHNCKOQMSA-N Asn-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O FUHFYEKSGWOWGZ-XHNCKOQMSA-N 0.000 description 1
- OKZOABJQOMAYEC-NUMRIWBASA-N Asn-Gln-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OKZOABJQOMAYEC-NUMRIWBASA-N 0.000 description 1
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 1
- PLVAAIPKSGUXDV-WHFBIAKZSA-N Asn-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)N PLVAAIPKSGUXDV-WHFBIAKZSA-N 0.000 description 1
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 1
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 1
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 1
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 1
- IQCJOIHDVFJQFV-LKXGYXEUSA-N Asp-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O IQCJOIHDVFJQFV-LKXGYXEUSA-N 0.000 description 1
- 241000972773 Aulopiformes Species 0.000 description 1
- 241001203868 Autographa californica Species 0.000 description 1
- 208000023275 Autoimmune disease Diseases 0.000 description 1
- 108090001008 Avidin Proteins 0.000 description 1
- 102100026189 Beta-galactosidase Human genes 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 101150111062 C gene Proteins 0.000 description 1
- 102000000584 Calmodulin Human genes 0.000 description 1
- 108010041952 Calmodulin Proteins 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 102000052052 Casein Kinase II Human genes 0.000 description 1
- 108010010919 Casein Kinase II Proteins 0.000 description 1
- 102000011632 Caseins Human genes 0.000 description 1
- 108010076119 Caseins Proteins 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 108091006146 Channels Proteins 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- 108020004394 Complementary RNA Proteins 0.000 description 1
- QLCPDGRAEJSYQM-LPEHRKFASA-N Cys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N)C(=O)O QLCPDGRAEJSYQM-LPEHRKFASA-N 0.000 description 1
- UISYPAHPLXGLNH-ACZMJKKPSA-N Cys-Asn-Gln Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UISYPAHPLXGLNH-ACZMJKKPSA-N 0.000 description 1
- YZFCGHIBLBDZDA-ZLUOBGJFSA-N Cys-Asp-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YZFCGHIBLBDZDA-ZLUOBGJFSA-N 0.000 description 1
- SDXQKJAWASHMIZ-CIUDSAMLSA-N Cys-Glu-Met Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O SDXQKJAWASHMIZ-CIUDSAMLSA-N 0.000 description 1
- QVLKXRMFNGHDRO-FXQIFTODSA-N Cys-Met-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O QVLKXRMFNGHDRO-FXQIFTODSA-N 0.000 description 1
- MFMDKTLJCUBQIC-MXAVVETBSA-N Cys-Phe-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MFMDKTLJCUBQIC-MXAVVETBSA-N 0.000 description 1
- WZJLBUPPZRZNTO-CIUDSAMLSA-N Cys-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N WZJLBUPPZRZNTO-CIUDSAMLSA-N 0.000 description 1
- KFYPRIGJTICABD-XGEHTFHBSA-N Cys-Thr-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N)O KFYPRIGJTICABD-XGEHTFHBSA-N 0.000 description 1
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 108010092160 Dactinomycin Proteins 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 241000283086 Equidae Species 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- LLQPHQFNMLZJMP-UHFFFAOYSA-N Fentrazamide Chemical compound N1=NN(C=2C(=CC=CC=2)Cl)C(=O)N1C(=O)N(CC)C1CCCCC1 LLQPHQFNMLZJMP-UHFFFAOYSA-N 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- 206010071602 Genetic polymorphism Diseases 0.000 description 1
- MGJMFSBEMSNYJL-AVGNSLFASA-N Gln-Asn-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MGJMFSBEMSNYJL-AVGNSLFASA-N 0.000 description 1
- CLPQUWHBWXFJOX-BQBZGAKWSA-N Gln-Gly-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O CLPQUWHBWXFJOX-BQBZGAKWSA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- UWKPRVKWEKEMSY-DCAQKATOSA-N Gln-Lys-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWKPRVKWEKEMSY-DCAQKATOSA-N 0.000 description 1
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 1
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 1
- NYCVMJGIJYQWDO-CIUDSAMLSA-N Gln-Ser-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NYCVMJGIJYQWDO-CIUDSAMLSA-N 0.000 description 1
- KHHDJQRWIFHXHS-NRPADANISA-N Gln-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHHDJQRWIFHXHS-NRPADANISA-N 0.000 description 1
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- NKSGKPWXSWBRRX-ACZMJKKPSA-N Glu-Asn-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N NKSGKPWXSWBRRX-ACZMJKKPSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 1
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 1
- UMZHHILWZBFPGL-LOKLDPHHSA-N Glu-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O UMZHHILWZBFPGL-LOKLDPHHSA-N 0.000 description 1
- 101710173678 Glucagon-5 Proteins 0.000 description 1
- 102000005720 Glutathione transferase Human genes 0.000 description 1
- 108010070675 Glutathione transferase Proteins 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- AIJAPFVDBFYNKN-WHFBIAKZSA-N Gly-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN)C(=O)N AIJAPFVDBFYNKN-WHFBIAKZSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 1
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 1
- PEZZSFLFXXFUQD-XPUUQOCRSA-N Gly-Cys-Val Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O PEZZSFLFXXFUQD-XPUUQOCRSA-N 0.000 description 1
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 1
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 1
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 1
- AYBKPDHHVADEDA-YUMQZZPRSA-N Gly-His-Asn Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O AYBKPDHHVADEDA-YUMQZZPRSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- YHYDTTUSJXGTQK-UWVGGRQHSA-N Gly-Met-Leu Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(C)C)C(O)=O YHYDTTUSJXGTQK-UWVGGRQHSA-N 0.000 description 1
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 1
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 1
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 1
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- 229920002527 Glycogen Polymers 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 1
- PQKCQZHAGILVIM-NKIYYHGXSA-N His-Glu-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O PQKCQZHAGILVIM-NKIYYHGXSA-N 0.000 description 1
- FONIDUOGWNWEAX-XIRDDKMYSA-N His-Trp-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O FONIDUOGWNWEAX-XIRDDKMYSA-N 0.000 description 1
- PZUZIHRPOVVHOT-KBPBESRZSA-N His-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CN=CN1 PZUZIHRPOVVHOT-KBPBESRZSA-N 0.000 description 1
- 101001064774 Homo sapiens Peroxidasin-like protein Proteins 0.000 description 1
- XQFRJNBWHJMXHO-RRKCRQDMSA-N IDUR Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(I)=C1 XQFRJNBWHJMXHO-RRKCRQDMSA-N 0.000 description 1
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 1
- BSWLQVGEVFYGIM-ZPFDUUQYSA-N Ile-Gln-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BSWLQVGEVFYGIM-ZPFDUUQYSA-N 0.000 description 1
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 1
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 description 1
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 1
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- AKOYRLRUFBZOSP-BJDJZHNGSA-N Ile-Lys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N AKOYRLRUFBZOSP-BJDJZHNGSA-N 0.000 description 1
- WYUHAXJAMDTOAU-IAVJCBSLSA-N Ile-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WYUHAXJAMDTOAU-IAVJCBSLSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 1
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 1
- JERJIYYCOGBAIJ-OBAATPRFSA-N Ile-Tyr-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JERJIYYCOGBAIJ-OBAATPRFSA-N 0.000 description 1
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 1
- 102000001706 Immunoglobulin Fab Fragments Human genes 0.000 description 1
- 108010054477 Immunoglobulin Fab Fragments Proteins 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 102100034343 Integrase Human genes 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 1
- QPRQGENIBFLVEB-BJDJZHNGSA-N Leu-Ala-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QPRQGENIBFLVEB-BJDJZHNGSA-N 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 1
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 1
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 1
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 1
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 1
- OHZIZVWQXJPBJS-IXOXFDKPSA-N Leu-His-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OHZIZVWQXJPBJS-IXOXFDKPSA-N 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 1
- LINKCQUOMUDLKN-KATARQTJSA-N Leu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N)O LINKCQUOMUDLKN-KATARQTJSA-N 0.000 description 1
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 1
- QQXJROOJCMIHIV-AVGNSLFASA-N Leu-Val-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O QQXJROOJCMIHIV-AVGNSLFASA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 101710084378 Lipase 2 Proteins 0.000 description 1
- 101710084371 Lipase 7 Proteins 0.000 description 1
- 102000016997 Lithostathine Human genes 0.000 description 1
- 108010014691 Lithostathine Proteins 0.000 description 1
- 206010067125 Liver injury Diseases 0.000 description 1
- 239000006142 Luria-Bertani Agar Substances 0.000 description 1
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 1
- GQUDMNDPQTXZRV-DCAQKATOSA-N Lys-Arg-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GQUDMNDPQTXZRV-DCAQKATOSA-N 0.000 description 1
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 1
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 1
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 1
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 1
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 1
- PELXPRPDQRFBGQ-KKUMJFAQSA-N Lys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O PELXPRPDQRFBGQ-KKUMJFAQSA-N 0.000 description 1
- 101710175625 Maltose/maltodextrin-binding periplasmic protein Proteins 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- CNUPMMXDISGXMU-CIUDSAMLSA-N Met-Cys-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O CNUPMMXDISGXMU-CIUDSAMLSA-N 0.000 description 1
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 1
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 1
- 229930192392 Mitomycin Natural products 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- 101710135898 Myc proto-oncogene protein Proteins 0.000 description 1
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 1
- NWIBSHFKIJFRCO-WUDYKRTCSA-N Mytomycin Chemical compound C1N2C(C(C(C)=C(N)C3=O)=O)=C3[C@@H](COC(N)=O)[C@@]2(OC)[C@@H]2[C@H]1N2 NWIBSHFKIJFRCO-WUDYKRTCSA-N 0.000 description 1
- 230000004988 N-glycosylation Effects 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 208000012902 Nervous system disease Diseases 0.000 description 1
- 208000025966 Neurological disease Diseases 0.000 description 1
- 102100037624 Nuclear transition protein 2 Human genes 0.000 description 1
- 229930012538 Paclitaxel Natural products 0.000 description 1
- 108050006759 Pancreatic lipases Proteins 0.000 description 1
- 102000019280 Pancreatic lipases Human genes 0.000 description 1
- 208000018737 Parkinson disease Diseases 0.000 description 1
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 1
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 1
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 1
- DJPXNKUDJKGQEE-BZSNNMDCSA-N Phe-Asp-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DJPXNKUDJKGQEE-BZSNNMDCSA-N 0.000 description 1
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 1
- QEFHBVDWKFFKQI-PMVMPFDFSA-N Phe-His-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QEFHBVDWKFFKQI-PMVMPFDFSA-N 0.000 description 1
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 1
- KLXQWABNAWDRAY-ACRUOGEOSA-N Phe-Lys-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 KLXQWABNAWDRAY-ACRUOGEOSA-N 0.000 description 1
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 1
- GLJZDMZJHFXJQG-BZSNNMDCSA-N Phe-Ser-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLJZDMZJHFXJQG-BZSNNMDCSA-N 0.000 description 1
- XNMYNGDKJNOKHH-BZSNNMDCSA-N Phe-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XNMYNGDKJNOKHH-BZSNNMDCSA-N 0.000 description 1
- KUSYCSMTTHSZOA-DZKIICNBSA-N Phe-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N KUSYCSMTTHSZOA-DZKIICNBSA-N 0.000 description 1
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 1
- WECYCNFPGZLOOU-FXQIFTODSA-N Pro-Asn-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O WECYCNFPGZLOOU-FXQIFTODSA-N 0.000 description 1
- NOXSEHJOXCWRHK-DCAQKATOSA-N Pro-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 NOXSEHJOXCWRHK-DCAQKATOSA-N 0.000 description 1
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- WFHYFCWBLSKEMS-KKUMJFAQSA-N Pro-Glu-Phe Chemical compound N([C@@H](CCC(=O)O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 WFHYFCWBLSKEMS-KKUMJFAQSA-N 0.000 description 1
- YTWNSIDWAFSEEI-RWMBFGLXSA-N Pro-His-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N3CCC[C@@H]3C(=O)O YTWNSIDWAFSEEI-RWMBFGLXSA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 1
- QMABBZHZMDXHKU-FKBYEOEOSA-N Pro-Tyr-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QMABBZHZMDXHKU-FKBYEOEOSA-N 0.000 description 1
- QDDJNKWPTJHROJ-UFYCRDLUSA-N Pro-Tyr-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 QDDJNKWPTJHROJ-UFYCRDLUSA-N 0.000 description 1
- 102000035554 Proglucagon Human genes 0.000 description 1
- 108010058003 Proglucagon Proteins 0.000 description 1
- 102100034750 Protamine-2 Human genes 0.000 description 1
- 102000003923 Protein Kinase C Human genes 0.000 description 1
- 108090000315 Protein Kinase C Proteins 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 102000004022 Protein-Tyrosine Kinases Human genes 0.000 description 1
- 108090000412 Protein-Tyrosine Kinases Proteins 0.000 description 1
- 108700033844 Pseudomonas aeruginosa toxA Proteins 0.000 description 1
- 108010010469 Qa-SNARE Proteins Proteins 0.000 description 1
- 238000002123 RNA extraction Methods 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- MUPFEKGTMRGPLJ-RMMQSMQOSA-N Raffinose Natural products O(C[C@H]1[C@@H](O)[C@H](O)[C@@H](O)[C@@H](O[C@@]2(CO)[C@H](O)[C@@H](O)[C@@H](CO)O2)O1)[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 MUPFEKGTMRGPLJ-RMMQSMQOSA-N 0.000 description 1
- 108090000244 Rat Proteins Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 108700008625 Reporter Genes Proteins 0.000 description 1
- 108010039491 Ricin Proteins 0.000 description 1
- 206010039710 Scleroderma Diseases 0.000 description 1
- 206010040070 Septic Shock Diseases 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- SFZKGGOGCNQPJY-CIUDSAMLSA-N Ser-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N SFZKGGOGCNQPJY-CIUDSAMLSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 1
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 1
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- FGBLCMLXHRPVOF-IHRRRGAJSA-N Ser-Tyr-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FGBLCMLXHRPVOF-IHRRRGAJSA-N 0.000 description 1
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 1
- 241000256251 Spodoptera frugiperda Species 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- 102100026185 Syncollin Human genes 0.000 description 1
- 101710168213 Syncollin Proteins 0.000 description 1
- 102000050389 Syntaxin Human genes 0.000 description 1
- 108010017842 Telomerase Proteins 0.000 description 1
- ZMZDMBWJUHKJPS-UHFFFAOYSA-M Thiocyanate anion Chemical compound [S-]C#N ZMZDMBWJUHKJPS-UHFFFAOYSA-M 0.000 description 1
- 102000002933 Thioredoxin Human genes 0.000 description 1
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 1
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 1
- YAAPRMFURSENOZ-KATARQTJSA-N Thr-Cys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N)O YAAPRMFURSENOZ-KATARQTJSA-N 0.000 description 1
- FKIGTIXHSRNKJU-IXOXFDKPSA-N Thr-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CN=CN1 FKIGTIXHSRNKJU-IXOXFDKPSA-N 0.000 description 1
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 1
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 1
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 1
- XZUBGOYOGDRYFC-XGEHTFHBSA-N Thr-Ser-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O XZUBGOYOGDRYFC-XGEHTFHBSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- ZEJBJDHSQPOVJV-UAXMHLISSA-N Thr-Trp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZEJBJDHSQPOVJV-UAXMHLISSA-N 0.000 description 1
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 1
- 206010044248 Toxic shock syndrome Diseases 0.000 description 1
- 231100000650 Toxic shock syndrome Toxicity 0.000 description 1
- 108700009124 Transcription Initiation Site Proteins 0.000 description 1
- 101710150448 Transcriptional regulator Myc Proteins 0.000 description 1
- 108700019146 Transgenes Proteins 0.000 description 1
- HNIWONZFMIPCCT-SIXJUCDHSA-N Trp-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N HNIWONZFMIPCCT-SIXJUCDHSA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- JONPRIHUYSPIMA-UWJYBYFXSA-N Tyr-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JONPRIHUYSPIMA-UWJYBYFXSA-N 0.000 description 1
- IELISNUVHBKYBX-XDTLVQLUSA-N Tyr-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IELISNUVHBKYBX-XDTLVQLUSA-N 0.000 description 1
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 1
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 1
- QOEZFICGUZTRFX-IHRRRGAJSA-N Tyr-Cys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O QOEZFICGUZTRFX-IHRRRGAJSA-N 0.000 description 1
- XQYHLZNPOTXRMQ-KKUMJFAQSA-N Tyr-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XQYHLZNPOTXRMQ-KKUMJFAQSA-N 0.000 description 1
- BXPOOVDVGWEXDU-WZLNRYEVSA-N Tyr-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXPOOVDVGWEXDU-WZLNRYEVSA-N 0.000 description 1
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 1
- VUVVMFSDLYKHPA-PMVMPFDFSA-N Tyr-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC3=CC=C(C=C3)O)N VUVVMFSDLYKHPA-PMVMPFDFSA-N 0.000 description 1
- CNNVVEPJTFOGHI-ACRUOGEOSA-N Tyr-Lys-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNNVVEPJTFOGHI-ACRUOGEOSA-N 0.000 description 1
- JXGUUJMPCRXMSO-HJOGWXRNSA-N Tyr-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 JXGUUJMPCRXMSO-HJOGWXRNSA-N 0.000 description 1
- MNWINJDPGBNOED-ULQDDVLXSA-N Tyr-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 MNWINJDPGBNOED-ULQDDVLXSA-N 0.000 description 1
- BIVIUZRBCAUNPW-JRQIVUDYSA-N Tyr-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O BIVIUZRBCAUNPW-JRQIVUDYSA-N 0.000 description 1
- MUPFEKGTMRGPLJ-UHFFFAOYSA-N UNPD196149 Natural products OC1C(O)C(CO)OC1(CO)OC1C(O)C(O)C(O)C(COC2C(C(O)C(O)C(CO)O2)O)O1 MUPFEKGTMRGPLJ-UHFFFAOYSA-N 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 1
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- GXAZTLJYINLMJL-LAEOZQHASA-N Val-Asn-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GXAZTLJYINLMJL-LAEOZQHASA-N 0.000 description 1
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- WBUOKGBHGDPYMH-GUBZILKMSA-N Val-Cys-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)C(C)C WBUOKGBHGDPYMH-GUBZILKMSA-N 0.000 description 1
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 1
- KVRLNEILGGVBJX-IHRRRGAJSA-N Val-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CN=CN1 KVRLNEILGGVBJX-IHRRRGAJSA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 1
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 1
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 1
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 1
- AIWLHFZYOUUJGB-UFYCRDLUSA-N Val-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 AIWLHFZYOUUJGB-UFYCRDLUSA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- SDHZOOIGIUEPDY-JYJNAYRXSA-N Val-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 SDHZOOIGIUEPDY-JYJNAYRXSA-N 0.000 description 1
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 1
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 1
- PMKQKNBISAOSRI-XHSDSOJGSA-N Val-Tyr-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N PMKQKNBISAOSRI-XHSDSOJGSA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- JXLYSJRDGCGARV-WWYNWVTFSA-N Vinblastine Natural products O=C(O[C@H]1[C@](O)(C(=O)OC)[C@@H]2N(C)c3c(cc(c(OC)c3)[C@]3(C(=O)OC)c4[nH]c5c(c4CCN4C[C@](O)(CC)C[C@H](C3)C4)cccc5)[C@@]32[C@H]2[C@@]1(CC)C=CCN2CC3)C JXLYSJRDGCGARV-WWYNWVTFSA-N 0.000 description 1
- 208000027418 Wounds and injury Diseases 0.000 description 1
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 1
- 101710185494 Zinc finger protein Proteins 0.000 description 1
- 102100023597 Zinc finger protein 816 Human genes 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- RJURFGZVJUQBHK-IIXSONLDSA-N actinomycin D Chemical compound C[C@H]1OC(=O)[C@H](C(C)C)N(C)C(=O)CN(C)C(=O)[C@@H]2CCCN2C(=O)[C@@H](C(C)C)NC(=O)[C@H]1NC(=O)C1=C(N)C(=O)C(C)=C2OC(C(C)=CC=C3C(=O)N[C@@H]4C(=O)N[C@@H](C(N5CCC[C@H]5C(=O)N(C)CC(=O)N(C)[C@@H](C(C)C)C(=O)O[C@@H]4C)=O)C(C)C)=C3N=C21 RJURFGZVJUQBHK-IIXSONLDSA-N 0.000 description 1
- 230000010933 acylation Effects 0.000 description 1
- 238000005917 acylation reaction Methods 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 210000004100 adrenal gland Anatomy 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- MWPLVEDNUUSJAV-UHFFFAOYSA-N anthracene Chemical compound C1=CC=CC2=CC3=CC=CC=C3C=C21 MWPLVEDNUUSJAV-UHFFFAOYSA-N 0.000 description 1
- 230000002788 anti-peptide Effects 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- YZXBAPSDXZZRGB-DOFZRALJSA-N arachidonic acid Chemical class CCCCC\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O YZXBAPSDXZZRGB-DOFZRALJSA-N 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 206010003246 arthritis Diseases 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 238000002869 basic local alignment search tool Methods 0.000 description 1
- 210000000227 basophil cell of anterior lobe of hypophysis Anatomy 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- 210000003445 biliary tract Anatomy 0.000 description 1
- 238000004166 bioassay Methods 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 229930189065 blasticidin Natural products 0.000 description 1
- 210000002459 blastocyst Anatomy 0.000 description 1
- 239000002981 blocking agent Substances 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000004204 blood vessel Anatomy 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 229940098773 bovine serum albumin Drugs 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 210000004958 brain cell Anatomy 0.000 description 1
- 210000000621 bronchi Anatomy 0.000 description 1
- 101150046240 bsd gene Proteins 0.000 description 1
- 239000007975 buffered saline Substances 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 230000021523 carboxylation Effects 0.000 description 1
- 238000006473 carboxylation reaction Methods 0.000 description 1
- 210000004413 cardiac myocyte Anatomy 0.000 description 1
- 210000000748 cardiovascular system Anatomy 0.000 description 1
- 239000005018 casein Substances 0.000 description 1
- BECPQYXYKAMYBN-UHFFFAOYSA-N casein, tech. Chemical compound NCCCCC(C(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(CC(C)C)N=C(O)C(CCC(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(C(C)O)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(COP(O)(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(N)CC1=CC=CC=C1 BECPQYXYKAMYBN-UHFFFAOYSA-N 0.000 description 1
- 235000021240 caseins Nutrition 0.000 description 1
- 230000003915 cell function Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000000546 chi-square test Methods 0.000 description 1
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 210000002477 chromaffin system Anatomy 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000003200 chromosome mapping Methods 0.000 description 1
- 238000004140 cleaning Methods 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 230000001427 coherent effect Effects 0.000 description 1
- 229960001338 colchicine Drugs 0.000 description 1
- 210000001072 colon Anatomy 0.000 description 1
- 230000009137 competitive binding Effects 0.000 description 1
- 230000002860 competitive effect Effects 0.000 description 1
- 239000003184 complementary RNA Substances 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 239000013068 control sample Substances 0.000 description 1
- IDLFZVILOHSSID-OVLDLUHVSA-N corticotropin Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](C(C)C)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)NC(=O)[C@@H](N)CO)C1=CC=C(O)C=C1 IDLFZVILOHSSID-OVLDLUHVSA-N 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- ATDGTVJJHBUTRL-UHFFFAOYSA-N cyanogen bromide Chemical compound BrC#N ATDGTVJJHBUTRL-UHFFFAOYSA-N 0.000 description 1
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 1
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 1
- 229940127089 cytotoxic agent Drugs 0.000 description 1
- 239000002254 cytotoxic agent Substances 0.000 description 1
- 231100000599 cytotoxic agent Toxicity 0.000 description 1
- RGWHQCVHVJXOKC-SHYZEUOFSA-J dCTP(4-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-J 0.000 description 1
- 229960000640 dactinomycin Drugs 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- STQGQHZAVUOBTE-VGBVRHCVSA-N daunorubicin Chemical compound O([C@H]1C[C@@](O)(CC=2C(O)=C3C(=O)C=4C=CC=C(C=4C(=O)C3=C(O)C=21)OC)C(C)=O)[C@H]1C[C@H](N)[C@H](O)[C@H](C)O1 STQGQHZAVUOBTE-VGBVRHCVSA-N 0.000 description 1
- 229960000975 daunorubicin Drugs 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 210000002249 digestive system Anatomy 0.000 description 1
- 108010009297 diglycyl-histidine Proteins 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- SLPJGDQJLTYWCI-UHFFFAOYSA-N dimethyl-(4,5,6,7-tetrabromo-1h-benzoimidazol-2-yl)-amine Chemical compound BrC1=C(Br)C(Br)=C2NC(N(C)C)=NC2=C1Br SLPJGDQJLTYWCI-UHFFFAOYSA-N 0.000 description 1
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- BNIILDVGGAEEIG-UHFFFAOYSA-L disodium hydrogen phosphate Chemical compound [Na+].[Na+].OP([O-])([O-])=O BNIILDVGGAEEIG-UHFFFAOYSA-L 0.000 description 1
- 229910000397 disodium phosphate Inorganic materials 0.000 description 1
- 229960004679 doxorubicin Drugs 0.000 description 1
- 238000007878 drug screening assay Methods 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 230000013020 embryo development Effects 0.000 description 1
- 210000001671 embryonic stem cell Anatomy 0.000 description 1
- 210000002257 embryonic structure Anatomy 0.000 description 1
- 210000003372 endocrine gland Anatomy 0.000 description 1
- 210000000750 endocrine system Anatomy 0.000 description 1
- 230000007247 enzymatic mechanism Effects 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 229940088598 enzyme Drugs 0.000 description 1
- 210000000981 epithelium Anatomy 0.000 description 1
- 210000003238 esophagus Anatomy 0.000 description 1
- 229940011871 estrogen Drugs 0.000 description 1
- 239000000262 estrogen Substances 0.000 description 1
- 238000005530 etching Methods 0.000 description 1
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 1
- 229960005542 ethidium bromide Drugs 0.000 description 1
- VJJPUSNTGOMMGY-MRVIYFEKSA-N etoposide Chemical compound COC1=C(O)C(OC)=CC([C@@H]2C3=CC=4OCOC=4C=C3[C@@H](O[C@H]3[C@@H]([C@@H](O)[C@@H]4O[C@H](C)OC[C@H]4O3)O)[C@@H]3[C@@H]2C(OC3)=O)=C1 VJJPUSNTGOMMGY-MRVIYFEKSA-N 0.000 description 1
- 229960005420 etoposide Drugs 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 210000003499 exocrine gland Anatomy 0.000 description 1
- 230000028023 exocytosis Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 230000001605 fetal effect Effects 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 230000037433 frameshift Effects 0.000 description 1
- 238000010230 functional analysis Methods 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 229930182830 galactose Natural products 0.000 description 1
- 210000000609 ganglia Anatomy 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 239000003862 glucocorticoid Substances 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- 229940096919 glycogen Drugs 0.000 description 1
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- 108010084264 glycyl-glycyl-cysteine Proteins 0.000 description 1
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 230000002710 gonadal effect Effects 0.000 description 1
- ZJYYHGLJYGJLLN-UHFFFAOYSA-N guanidinium thiocyanate Chemical compound SC#N.NC(N)=N ZJYYHGLJYGJLLN-UHFFFAOYSA-N 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 231100000234 hepatic damage Toxicity 0.000 description 1
- 238000013537 high throughput screening Methods 0.000 description 1
- 230000003054 hormonal effect Effects 0.000 description 1
- 102000051533 human PRM1 Human genes 0.000 description 1
- 210000003016 hypothalamus Anatomy 0.000 description 1
- 230000036737 immune function Effects 0.000 description 1
- 230000003053 immunization Effects 0.000 description 1
- 238000002649 immunization Methods 0.000 description 1
- 230000000984 immunochemical effect Effects 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 208000014674 injury Diseases 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 229910052500 inorganic mineral Inorganic materials 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 238000001361 intraarterial administration Methods 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 238000007913 intrathecal administration Methods 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 238000007914 intraventricular administration Methods 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 210000000867 larynx Anatomy 0.000 description 1
- 231100000518 lethal Toxicity 0.000 description 1
- 230000001665 lethal effect Effects 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 1
- 230000029226 lipidation Effects 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- XIXADJRWDQXREU-UHFFFAOYSA-M lithium acetate Chemical compound [Li+].CC([O-])=O XIXADJRWDQXREU-UHFFFAOYSA-M 0.000 description 1
- 210000005229 liver cell Anatomy 0.000 description 1
- 230000008818 liver damage Effects 0.000 description 1
- 210000004072 lung Anatomy 0.000 description 1
- 210000002751 lymph Anatomy 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 239000012139 lysis buffer Substances 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 210000000713 mesentery Anatomy 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 238000012775 microarray technology Methods 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 239000011859 microparticle Substances 0.000 description 1
- 239000011707 mineral Substances 0.000 description 1
- 235000010755 mineral Nutrition 0.000 description 1
- 230000002438 mitochondrial effect Effects 0.000 description 1
- 229960004857 mitomycin Drugs 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 239000003068 molecular probe Substances 0.000 description 1
- 210000002894 multi-fate stem cell Anatomy 0.000 description 1
- 210000001665 muscle stem cell Anatomy 0.000 description 1
- 201000006938 muscular dystrophy Diseases 0.000 description 1
- 210000002346 musculoskeletal system Anatomy 0.000 description 1
- 210000000653 nervous system Anatomy 0.000 description 1
- 230000000955 neuroendocrine Effects 0.000 description 1
- 210000004498 neuroglial cell Anatomy 0.000 description 1
- 238000006386 neutralization reaction Methods 0.000 description 1
- 230000009871 nonspecific binding Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 201000008482 osteoarthritis Diseases 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 229960001592 paclitaxel Drugs 0.000 description 1
- 229940116369 pancreatic lipase Drugs 0.000 description 1
- 210000003899 penis Anatomy 0.000 description 1
- 238000010647 peptide synthesis reaction Methods 0.000 description 1
- 210000001539 phagocyte Anatomy 0.000 description 1
- 239000000546 pharmaceutical excipient Substances 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- 150000004713 phosphodiesters Chemical class 0.000 description 1
- 230000001817 pituitary effect Effects 0.000 description 1
- 210000002826 placenta Anatomy 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 210000004224 pleura Anatomy 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 229920002704 polyhistidine Polymers 0.000 description 1
- 239000011148 porous material Substances 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 108010066381 preproinsulin Proteins 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 108010076339 protamine 2 Proteins 0.000 description 1
- 230000012846 protein folding Effects 0.000 description 1
- 230000018883 protein targeting Effects 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 238000004445 quantitative analysis Methods 0.000 description 1
- MUPFEKGTMRGPLJ-ZQSKZDJDSA-N raffinose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO[C@@H]2[C@@H]([C@@H](O)[C@@H](O)[C@@H](CO)O2)O)O1 MUPFEKGTMRGPLJ-ZQSKZDJDSA-N 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 125000006853 reporter group Chemical group 0.000 description 1
- 230000001850 reproductive effect Effects 0.000 description 1
- 230000000241 respiratory effect Effects 0.000 description 1
- 210000002345 respiratory system Anatomy 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 206010039073 rheumatoid arthritis Diseases 0.000 description 1
- 239000003161 ribonuclease inhibitor Substances 0.000 description 1
- 229960003522 roquinimex Drugs 0.000 description 1
- 210000003079 salivary gland Anatomy 0.000 description 1
- 235000019515 salmon Nutrition 0.000 description 1
- 238000007423 screening assay Methods 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 210000004739 secretory vesicle Anatomy 0.000 description 1
- 210000001625 seminal vesicle Anatomy 0.000 description 1
- 230000009758 senescence Effects 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 210000000697 sensory organ Anatomy 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 238000012154 short term therapy Methods 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 210000002363 skeletal muscle cell Anatomy 0.000 description 1
- 210000002356 skeleton Anatomy 0.000 description 1
- 210000003491 skin Anatomy 0.000 description 1
- 210000002460 smooth muscle Anatomy 0.000 description 1
- 210000000329 smooth muscle myocyte Anatomy 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 210000001082 somatic cell Anatomy 0.000 description 1
- 210000001988 somatic stem cell Anatomy 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 108010044129 spermatid transition proteins Proteins 0.000 description 1
- 208000020431 spinal cord injury Diseases 0.000 description 1
- 210000000952 spleen Anatomy 0.000 description 1
- 208000010110 spontaneous platelet aggregation Diseases 0.000 description 1
- 230000000087 stabilizing effect Effects 0.000 description 1
- 230000010473 stable expression Effects 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 238000000528 statistical test Methods 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 210000002784 stomach Anatomy 0.000 description 1
- 210000001548 stomatognathic system Anatomy 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 125000000446 sulfanediyl group Chemical group *S* 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 201000000596 systemic lupus erythematosus Diseases 0.000 description 1
- RCINICONZNJXQF-MZXODVADSA-N taxol Chemical compound O([C@@H]1[C@@]2(C[C@@H](C(C)=C(C2(C)C)[C@H](C([C@]2(C)[C@@H](O)C[C@H]3OC[C@]3([C@H]21)OC(C)=O)=O)OC(=O)C)OC(=O)[C@H](O)[C@@H](NC(=O)C=1C=CC=CC=1)C=1C=CC=CC=1)O)C(=O)C1=CC=CC=C1 RCINICONZNJXQF-MZXODVADSA-N 0.000 description 1
- 210000002435 tendon Anatomy 0.000 description 1
- NRUKOCRGYNPUPR-QBPJDGROSA-N teniposide Chemical compound COC1=C(O)C(OC)=CC([C@@H]2C3=CC=4OCOC=4C=C3[C@@H](O[C@H]3[C@@H]([C@@H](O)[C@@H]4O[C@@H](OC[C@H]4O3)C=3SC=CC=3)O)[C@@H]3[C@@H]2C(OC3)=O)=C1 NRUKOCRGYNPUPR-QBPJDGROSA-N 0.000 description 1
- 229960001278 teniposide Drugs 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 1
- 108060008226 thioredoxin Proteins 0.000 description 1
- 229940094937 thioredoxin Drugs 0.000 description 1
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 210000001541 thymus gland Anatomy 0.000 description 1
- 230000017423 tissue regeneration Effects 0.000 description 1
- 210000002105 tongue Anatomy 0.000 description 1
- 230000000699 topical effect Effects 0.000 description 1
- 231100000419 toxicity Toxicity 0.000 description 1
- 230000001988 toxicity Effects 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 230000005758 transcription activity Effects 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 230000010474 transient expression Effects 0.000 description 1
- 238000011269 treatment regimen Methods 0.000 description 1
- GPRLSGONYQIRFK-MNYXATJNSA-N triton Chemical compound [3H+] GPRLSGONYQIRFK-MNYXATJNSA-N 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 238000002604 ultrasonography Methods 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 241000701366 unidentified nuclear polyhedrosis viruses Species 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- 210000000626 ureter Anatomy 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- 210000003932 urinary bladder Anatomy 0.000 description 1
- 210000001635 urinary tract Anatomy 0.000 description 1
- VBEQCZHXXJYVRD-GACYYNSASA-N uroanthelone Chemical group C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C(C)C)[C@@H](C)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)NC(=O)[C@@H](NC(=O)CNC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CS)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CS)NC(=O)CNC(=O)[C@H]1N(CCC1)C(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O)C(C)C)[C@@H](C)CC)C1=CC=C(O)C=C1 VBEQCZHXXJYVRD-GACYYNSASA-N 0.000 description 1
- 210000004291 uterus Anatomy 0.000 description 1
- 201000010653 vesiculitis Diseases 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 229960003048 vinblastine Drugs 0.000 description 1
- JXLYSJRDGCGARV-XQKSVPLYSA-N vincaleukoblastine Chemical compound C([C@@H](C[C@]1(C(=O)OC)C=2C(=CC3=C([C@]45[C@H]([C@@]([C@H](OC(C)=O)[C@]6(CC)C=CCN([C@H]56)CC4)(O)C(=O)OC)N3C)C=2)OC)C[C@@](C2)(O)CC)N2CCC2=C1NC1=CC=CC=C21 JXLYSJRDGCGARV-XQKSVPLYSA-N 0.000 description 1
- 229960004528 vincristine Drugs 0.000 description 1
- OGWKCGZFUXNPDA-XQKSVPLYSA-N vincristine Chemical compound C([N@]1C[C@@H](C[C@]2(C(=O)OC)C=3C(=CC4=C([C@]56[C@H]([C@@]([C@H](OC(C)=O)[C@]7(CC)C=CCN([C@H]67)CC5)(O)C(=O)OC)N4C=O)C=3)OC)C[C@@](C1)(O)CC)CC1=C2NC2=CC=CC=C12 OGWKCGZFUXNPDA-XQKSVPLYSA-N 0.000 description 1
- OGWKCGZFUXNPDA-UHFFFAOYSA-N vincristine Natural products C1C(CC)(O)CC(CC2(C(=O)OC)C=3C(=CC4=C(C56C(C(C(OC(C)=O)C7(CC)C=CCN(C67)CC5)(O)C(=O)OC)N4C=O)C=3)OC)CN1CCC1=C2NC2=CC=CC=C12 OGWKCGZFUXNPDA-UHFFFAOYSA-N 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 235000012431 wafers Nutrition 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- QAOHCFGKCWTBGC-QHOAOGIMSA-N wybutosine Chemical compound C1=NC=2C(=O)N3C(CC[C@H](NC(=O)OC)C(=O)OC)=C(C)N=C3N(C)C=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O QAOHCFGKCWTBGC-QHOAOGIMSA-N 0.000 description 1
- QAOHCFGKCWTBGC-UHFFFAOYSA-N wybutosine Natural products C1=NC=2C(=O)N3C(CCC(NC(=O)OC)C(=O)OC)=C(C)N=C3N(C)C=2N1C1OC(CO)C(O)C1O QAOHCFGKCWTBGC-UHFFFAOYSA-N 0.000 description 1
- 238000001086 yeast two-hybrid system Methods 0.000 description 1
- 239000011701 zinc Substances 0.000 description 1
- 229910052725 zinc Inorganic materials 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/46—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
- C07K14/47—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P3/00—Drugs for disorders of the metabolism
- A61P3/08—Drugs for disorders of the metabolism for glucose homeostasis
- A61P3/10—Drugs for disorders of the metabolism for glucose homeostasis for hyperglycaemia, e.g. antidiabetics
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
Definitions
- the invention relates to discovery of thirteen isolated polynucleotides and their encoded proteins that are highly co-expressed with genes known to be involved in insulin synthesis and useful for diagnosis, prognosis, and treatment of pancreatic disorders.
- Insulin is a hormone produced in the beta islet cells of the pancreas.
- Patients with diabetes have serum glucose levels that are chronically elevated above normal because they either produce insufficient insulin (type I diabetes) or are resistant to insulin (type II diabetes).
- Complications of diabetes include angina, hypertension, myocardial infarctions, peripheral vascular disease, diabetic retinopathy, diabetic nephropathy, diabetic necrosis, ulceration, and diabetic neuropathy (Davidson (1998) Diabetes Mellitus, W B Saunders, Philadelphia Pa.).
- the present invention satisfies a need in the art by providing new compositions that are useful for diagnosis, prognosis, treatment, and evaluation of therapies for pancreatic disorders, especially diabetes.
- a method for analyzing gene expression patterns has been used to identity thirteen polynucleotides that have highly significant co-expression with genes known to be involved with insulin-synthesis.
- the invention provides a composition comprising a plurality of polynucleotides having the nucleic acid sequences of SEQ ID NOs: 1-13 or the complements thereof that are highly significantly co-expressed with genes such as insulin, glucagon, lipase, colipase, human islet amyloid polypeptide (HiAPP) and Reg-1 alpha, Reg-1 beta, and Reg-related regenerating genes (Reg), known to involved in insulin synthesis.
- the invention also provides an isolated polynucleotide comprising a nucleic acid sequence selected from SEQ ID NOs: 1-13 or the complement thereof.
- the polynucleotide is used as a surrogate marker, as a probe, in an expression vector, and in the diagnosis, prognosis, evaluation of therapies and treatment of pancreatic disorders.
- the invention further provides a composition comprising a polynucleotide and a labeling moiety.
- the invention provides a method for using a composition or a polynucleotide of the invention to screen a plurality of molecules and compounds to identify ligands which specifically bind to the composition or the polynucleotide.
- the molecules are selected from DNA molecules, RNA molecules, peptide nucleic acids, peptides, mimetics, ribozymes, transcription factors, enhancers, and repressors.
- the invention also provides a method of using a composition or a polynucleotide to purify a ligand.
- the invention provides a method for using a composition or an isolated polynucleotide to detect gene expression in a sample by hybridizing the composition or polynucleotide to nucleic acids of the sample under conditions for formation of one or more hybridization complexes and detecting hybridization complex formation, wherein complex formation indicates gene expression in the sample.
- the composition or polynucleotide is attached to a substrate.
- the nucleic acids of the sample are amplified prior to hybridization.
- complex formation is compared with at least one standard and indicates the presence of a pancreatic disorder.
- the invention provides a method of using a protein to make an antibody that specifically binds to the protein of the invention, and methods for using the antibody to diagnose or treat a pancreatic disorder.
- the invention also provides a composition comprising a polynucleotide, a protein, or an antibody that specifically binds a protein and a pharmaceutical carrier.
- Sequence Listing provides exemplary polynucleotides comprising the nucleic acid sequences of SEQ ID NOs:1-13 some of which encode the proteins comprising the amino acid sequences of SEQ ID NOs:14 and 15. Each sequence is identified by a sequence identification number (SEQ ID NO) and by the Incyte clone number with which the sequence was first identified.
- Markers for pancreatic disorders refers to polynucleotides, proteins, and antibodies which are useful in the diagnosis, prognosis, evaluation of therapies and treatment of pancreatic disorders. Typically, this means that the marker gene is differentially expressed in samples from subjects predisposed to, manifesting, or diagnosed with a pancreatic disorder.
- “Differential expression” refers to an increased or up-regulated or a decreased or down-regulated expression as detected by presence, absence or at least about a two-fold change in the amount of transcribed messenger RNA or protein in a sample.
- Pantix disorders specifically include, but are not limited to, the following conditions, diseases, and disorders: type I and type II diabetes; complications of diabetes including angina, hypertension, myocardial infarctions, peripheral vascular disease, diabetic retinopathy, diabetic nephropathy, diabetic necrosis, ulceration, and diabetic neuropathy; islet cell hyperplasia; pancreatitis; and pancreatic tumor.
- isolated or purified refers to a polynucleotide or protein that is removed from its natural environment and that is separated from other components with which it is naturally present.
- Genes known to be highly expressed in insulin synthesis pathways which were used in the co-expression analysis included insulin, glucagon, lipase, colipase, human islet amyloid polypeptide (HiAPP) and Reg-1 alpha, Reg-1 beta, and Reg-related regenerating genes (Reg).
- Polynucleotide refers to an isolated cDNA. It can be of genomic or synthetic origin, double-stranded or single-stranded, and combined with vitamins, minerals, carbohydrates, lipids, proteins, or other nucleic acids to perform a particular activity or form a useful composition.
- Protein refers to a purified polypeptide whether naturally occurring or synthetic.
- sample is used in its broadest sense.
- a sample containing nucleic acids can comprise a bodily fluid; an extract from a cell; a chromosome, organelle, or membrane isolated from a cell; genomic DNA, RNA, or cDNA in solution or bound to a substrate; a cell; a tissue; a tissue print; and the like.
- Substrate refers to any rigid or semi-rigid support to which polynucleotides or proteins are bound and includes membranes, filters, chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, capillaries or other tubing, plates, polymers, and microparticles with a variety of surface forms including wells, trenches, pins, channels and pores.
- a “transcript image” is a profile of gene transcription activity in a particular tissue at a particular time.
- a “variant” refers to a polynucleotide or protein whose sequence diverges from about 5% to about 30% from the nucleic acid or amino acid sequences of the Sequence Listing.
- the present invention employed “guilt by association or GBA”, a method for using marker genes known to be associated with a particular condition, disease or disorder to identify surrogate markers, polynucleotides and their encoded proteins, that are similarly associated or co-expressed in the same condition, disease, or disorder (Walker and Volkmuth (1999) Prediction of gene function by genome-scale expression analysis: prostate-associated genes. Genome Res 9:1198-1203, incorporated herein by reference).
- the method identifies cDNAs cloned from mRNA transcripts which are active in tissues known to have been removed from subjects with pancreatic disorders.
- the polynucleotides, their encoded proteins and antibodies which specifically bind to the encoded proteins are useful for diagnosis, prognosis, evaluation of therapies, and treatment of pancreatic disorders.
- Guilt by association provides for the identification of polynucleotides that are expressed in a plurality of libraries.
- the polynucleotides represent genes of unknown function which are expressed in a specific signaling pathway, disease process, subcellular compartment, cell type, tissue, or species.
- the expression patterns of the genes known to be highly expressed during insulin synthesis; insulin, glucagon, lipase, colipase, HiAPP, and Reg; are compared with those of polynucleotides with unknown function to determine whether a specified co-expression probability threshold is met. Through this comparison, a subset of the polynucleotides having a high co-expression probability with the known marker genes can be identified.
- the polynucleotides originate from human cDNA libraries. These polynucleotides can also be selected from a variety of sequence types including, but not limited to, expressed sequence tags (ESTs), assembled polynucleotides, full length coding regions, and 3′ untranslated regions. To be considered in GBA or co-expression analysis, the polynucleotides had to have been expressed in at least five cDNA libraries. In this application, GBA was applied to a total of 41,419 assembled polynucleotide bins that met the criteria of having been expressed in at least five libraries.
- ESTs expressed sequence tags
- assembled polynucleotides full length coding regions
- 3′ untranslated regions To be considered in GBA or co-expression analysis, the polynucleotides had to have been expressed in at least five cDNA libraries. In this application, GBA was applied to a total of 41,419 assembled polynucleotide bins that met the criteria of having been expressed in at least five libraries.
- the polynucleotides are assembled from related sequences, such as sequence fragments derived from a single transcript. Assembly of the polynucleotide can be performed using sequences of various types including, but not limited to, ESTs, extension of the EST, shotgun sequences from a cloned insert, or full length cDNAs. In a most preferred embodiment, the polynucleotides are derived from human sequences that have been assembled using the algorithm disclosed in U.S. Ser. No. 9,276,534, filed Mar. 25, 1999, and used in U.S. Ser. No. 09/226,994, filed Jan. 7, 1999, both incorporated herein by reference.
- differential expression of the polynucleotides can be evaluated by methods including, but not limited to, differential display by spatial immobilization or by gel electrophoresis, genome mismatch scanning, representational difference analysis, and transcript imaging.
- the results of transcript imaging for SEQ ID NO:2 are shown in Example IX .
- Differential expression of SEQ ID NO:2 is highly specifically correlated with type I diabetes.
- the transcript image provided direct confirmation of the strength of co-expression analysis—the use of known genes to identify unknown polynucleotides and their encoded proteins which are highly significantly associated with insulin synthesis and pancreatic disorders. Additionally, differential expression can be assessed by microarray technology. These methods can be used alone or in combination.
- the procedure for identifying novel polynucleotides that exhibit a statistically significant co-expression pattern with known genes is as follows. First, the presence or absence of a polynucleotide in a cDNA library is defined: a polynucleotide is present in a cDNA library when at least one cDNA fragment corresponding to the polynucleotide is detected in a cDNA from that library, and a polynucleotide is absent from a library when no corresponding cDNA fragment is detected.
- the significance of co-expression is evaluated using a probability method to measure a due-to-chance probability of the co-expression.
- the probability method can be the Fisher exact test, the chi-squared test, or the kappa test. These tests and examples of their applications are well known in the art and can be found in standard statistics texts (Agresti (1990) Categorical Data Analysis , John Wiley & Sons, New York N.Y.; Rice (1988) Mathematical Statistics and Data Analysis , Duxbury Press, Pacific Grove Calif.).
- a Bonferroni correction (Rice, supra, p. 384) can also be applied in combination with one of the probability methods for correcting statistical results of one polynucleotide versus multiple other polynucleotides.
- the due-to-chance probability is measured by a Fisher exact test, and the threshold of the due-to-chance probability is set preferably to less than 0.001, more preferably to less than 0.00001.
- occurrence data vectors can be generated as illustrated in the table below. The presence of a gene occurring at least once in a library is indicated by a one, and its absence from the library, by a zero.
- Library 1 Library 2 Library 3 . . . Library N Gene A 1 1 0 . . . 0 Gene B 1 0 1 . . . 0
- the second table summarizes and presents: 1) the number of times gene A and B are both present in a library; 2) the number of times gene A and B are both absent in a library; 3) the number of times gene A is present, and gene B is absent; and 4) the number of times gene B is present, and gene A is absent.
- the upper left entry is the number of times the two genes co-occur in a library, and the middle right entry is the number of times neither gene occurs in a library.
- the off diagonal entries are the number of times one gene occurs, and the other does not.
- Both A and B are present eight times and absent 18 times. Gene A is present, and gene B is absent, two times; and gene B is present, and gene A is absent, two times.
- the probability (“p-value”) that the above association occurs due to chance as calculated using a Fisher exact test is 0.0003.
- This method of estimating the probability for co-expression makes several assumptions. The method assumes that the libraries are independent and are identically sampled. However, in practical situations, the selected cDNA libraries are not entirely independent, because more than one library can be obtained from a single subject or tissue. Nor are they entirely identically sampled, because different numbers of cDNAs can have been sequenced from each library. The number of cDNAs sequenced typically ranges from 5,000 to 10,000 cDNAs per library. After the Fisher exact co-expression probability is calculated for each polynucleotide versus all other assembled polynucleotides that occur, a Bonferroni correction for multiple statistical tests is applied.
- polynucleotides SEQ ID NOs: 1-13 and their encoded proteins, SEQ ID NOs: 14 and 15, that exhibit highly significant co-expression probability with known marker genes for pancreatic disorders.
- the results presented in Example VI show the direct (known gene to unknown polynucleotide) or indirect (known gene to unknown polynucleotide to a second unknown polynucleotide) associations among the novel polynucleotides and the known marker genes for pancreatic disorders. Therefore, by these associations, the novel polynucleotides are useful as surrogate markers for the co-expressed known marker genes in diagnosis, prognosis, evaluation of therapies and treatment of pancreatic disorders. Further, the proteins or peptides expressed from the novel polynucleotides are either potential therapeutics or targets for the identification and/or development of therapeutics.
- the present invention encompasses a composition comprising a plurality of polynucleotides having the nucleic acid sequences of SEQ ID NOs:1-13 or the complements thereof. These thirteen polynucleotides are shown by the method of the present invention to have significant co-expression with known genes associated with pancreatic disorders.
- the invention also provides a polynucleotide, its complement, a probe comprising the polynucleotide or the complement thereof selected from SEQ ID NOs: 1-13 and variants thereof.
- the polynucleotide can be used to search against the GenBank primate (pri), rodent (rod), mammalian (mam), vertebrate (vrtp), and eukaryote (eukp) databases; the encoded protein, against GenPept, SwissProt, BLOCKS (Bairoch et al. (1997) Nucleic Acids Res 25:217-221), PFAM, and other databases that contain previously identified and annotated protein sequences, motifs, and gene functions. Methods that search for primary sequence patterns with secondary structure gap penalties (Smith et al.
- polynucleotides that are capable of hybridizing to SEQ ID NOs:1-13 and the complements thereof under highly stringent conditions.
- Stringency can be defined by salt concentration, temperature, and other chemicals and conditions well known in the art. Conditions can be selected, for example, by varying the concentrations of salt in the prehybridization, hybridization, and wash solutions or by varying the hybridization and wash temperatures. With some substrates, the temperature can be decreased by adding a solvent such as formamide to the prehybridization and hybridization solutions.
- Hybridization can be performed at low stringency, with buffers such as 5 ⁇ SSC (saline sodium citrate) with 1% sodium dodecyl sulfate (SDS) at 60 C, which permits complex formation between two nucleic acid sequences that contain some mismatches. Subsequent washes are performed at higher stringency with buffers such as 0.2 ⁇ SSC with 0.1% SDS at either 45 C (medium stringency) or 68 C (high stringency), to maintain hybridization of only those complexes that contain completely complementary sequences. Background signals can be reduced by the use of detergents such as SDS, sarcosyl, or TRITON X-100 (Sigma-Aldrich, St.
- a polynucleotide can be extended utilizing primers and employing various PCR-based methods known in the art to detect upstream sequences such as promoters and other regulatory elements.
- PCR-based methods known in the art to detect upstream sequences such as promoters and other regulatory elements.
- Upstream sequences such as promoters and other regulatory elements.
- kits such as XL-PCR (Applied Biosystems, Foster City Calif.), cDNA libraries (Life Technologies, Rockville Md.) or genomic libraries (Clontech, Palo Alto Calif.) and nested primers can be used to extend the sequence.
- primers can be designed using commercially available software (LASERGENE software, DNASTAR, Madison Wis.) or another program, to be about 15 to 30 nucleotides in length, to have a GC content of about 50%, and to form a hybridization complex at temperatures of about 68 C to 72 C.
- the polynucleotide in another aspect of the invention, can be cloned into a recombinant vector that directs the expression of the protein, or structural or functional portions thereof, in host cells. Due to the inherent degeneracy of the genetic code, other DNA sequences which encode functionally equivalent amino acid sequence can be produced and used to express the protein encoded by the polynucleotide.
- the nucleotide sequences of the present invention can be engineered using methods generally known in the art in order to alter the nucleotide sequences for a variety of purposes including, but not limited to, modification of the cloning, processing, and/or expression of the gene product. DNA shuffling by random fragmentation, as described in U.S. Pat. No.
- oligonucleotide-mediated site-directed mutagenesis can be used to introduce mutations that create new restriction sites, alter glycosylation patterns, change codon preference, produce splice variants, and so forth.
- the polynucleotide or derivatives thereof can be inserted into an expression vector with elements for transcriptional and translational control of the inserted coding sequence in a particular host.
- elements include regulatory sequences, such as enhancers, constitutive and inducible promoters, and 5′ and 3′ untranslated regions.
- Methods which are well known to those skilled in the art can be used to construct such expression vectors. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in vivo genetic recombination (Ausubel, supra, unit 16).
- a variety of expression vector/host cell systems can be utilized to express the polynucleotide. These include, but are not limited to, microorganisms such as bacteria transformed with recombinant bacteriophage, plasmid, or cosmid expression vectors; yeast transformed with yeast expression vectors; insect cell systems infected with baculovirus vectors; plant cell systems transformed with viral or bacterial expression vectors; or animal cell systems. For long term production of recombinant proteins in mammalian systems, stable expression in cell lines is preferred.
- the polynucleotide can be transformed into cell lines using expression vectors which can contain viral origins of replication and/or endogenous expression elements and a selectable or visible marker gene on the same or on a separate vector.
- expression vectors which can contain viral origins of replication and/or endogenous expression elements and a selectable or visible marker gene on the same or on a separate vector.
- the invention is not to be limited by the vector or host cell employed.
- host cells that contain the polynucleotide and that express the protein can be identified by a variety of procedures known to those of skill in the art. These procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or chip-based technologies for the detection and/or quantification of nucleic acid or amino acid sequences. Immunological methods for detecting and measuring the expression of the protein using either specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and fluorescence activated cell sorting (FACS).
- ELISAs enzyme-linked immunosorbent assays
- RIAs radioimmunoassays
- FACS fluorescence activated cell sorting
- Host cells transformed with the polynucleotide can be cultured under conditions for the expression and recovery of the protein from cell culture.
- the protein produced by a transgenic cell can be secreted or retained intracellularly depending on the sequence and/or the vector used.
- expression vectors containing the polynucleotide can be designed to contain signal sequences which direct secretion of the protein through a prokaryotic cell wall or eukaryotic cell membrane.
- a host cell strain can be chosen for its ability to modulate expression of the inserted sequences or to process the expressed protein in the desired fashion.
- modifications of the protein include, but are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, lipidation, and acylation.
- Post-translational processing which cleaves a “prepro” form of the protein can also be used to specify protein targeting, folding, and/or activity.
- Different host cells which have specific cellular machinery and characteristic mechanisms for post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and W138) are available from the ATCC (Manassas VA) and can be chosen to ensure the correct modification and processing of the expressed protein.
- natural, modified, or recombinant polynucleotides are ligated to a heterologous sequence resulting in translation of a fusion protein containing heterologous protein moieties in any of the aforementioned host systems.
- heterologous protein moieties facilitate purification of fusion proteins using commercially available affinity matrices.
- moieties include, but are not limited to, glutathione S-transferase, maltose binding protein, thioredoxin, calmodulin binding peptide, 6-His, FLAG, c-myc, hemaglutinin, and monoclonal antibody epitopes.
- the polynucleotides are synthesized using chemical or enzymatic methods well known in the art (Caruthers et al. (1980) Nucl Acids Symp Ser (7) 215-233; Ausubel, supra, units 10.4 and 10.16). Peptide synthesis can be performed using various solid-phase techniques (Roberge et al. (1995) Science 269:202-204), and machines such as the ABI 431A peptide synthesizer (Applied Biosystems) can be used to automate synthesis. If desired, the amino acid sequence can be altered during synthesis to produce a more stable variant for therapeutic use.
- the polynucleotides can be used as surrogate markers in diagnosis, prognosis, evaluation of therapies and treatment of pancreatic disorders including, but not limited to, type I and type II diabetes; complications of diabetes including angina, hypertension, myocardial infarctions, peripheral vascular disease, diabetic retinopathy, diabetic nephropathy, diabetic necrosis, ulceration, and diabetic neuropathy; islet cell hyperplasia; pancreatitis; and pancreatic tumor.
- the polynucleotide can be used to screen a plurality or library of molecules and compounds for specific binding affinity.
- the assay can be used to screen DNA molecules, RNA molecules, peptide nucleic acids, peptides, mimetics, ribozymes, or proteins including transcription factors, enhancers, repressors, and the like which regulate the activity of the polynucleotide in the biological system.
- the assay involves providing a plurality of molecules and compounds, combining a polynucleotide or a composition of the invention with the plurality of molecules and compounds under conditions to allow specific binding, and detecting specific binding to identify at least one molecule or compound which specifically binds at least one polynucleotides of the invention.
- the proteins, or portions thereof can be used to screen a plurality or library of molecules or compounds in any of a variety of screening assays to identify a ligand.
- the protein employed in such screening can be free in solution, affixed to an abiotic substrate or expressed on the external, or a particular internal surface, of a bacterial, or other, cell. Specific binding between the protein and the ligand can be measured.
- the assay can be used to screen aptamers, DNA molecules, RNA molecules, peptide nucleic acids, peptides, mimetics, ribozymes, proteins, antibodies, agonists, antagonists, immunoglobulins, inhibitors, pharmaceutical agents or drug compounds and the like, which specifically bind the protein.
- One method for high throughput screening using very small assay volumes and very small amounts of test compound is described in Burbaum et al. U.S. Pat. No. 5,876,946, incorporated herein by reference, which screens large numbers of molecules for enzyme inhibition or receptor binding.
- the polynucleotides are used for diagnostic purposes to determine the differential expression of a gene in a sample.
- the polynucleotide consists of complementary RNA and DNA molecules, branched nucleic acids, and/or PNAs.
- the polynucleotides are used to detect and quantify gene expression in biopsied samples in which differential expression of the polynucleotide indicates the presence of a disorder.
- the polynucleotide can be used to detect genetic polymorphisms associated with a disease or disorder. In a preferred embodiment, these polymorphisms are detected in an mRNA transcribed from an endogenous gene.
- the polynucleotide is used as a probe. Specificity of the probe is determined by whether it is made from a unique region, a regulatory region, or from a region encoding a conserved motif. Both probe specificity and the stringency of the diagnostic hybridization or amplification will determine whether the probe identifies only naturally occurring, exactly complementary sequences, allelic variants, or related sequences. Probes designed to detect related sequences should preferably have at least 50% sequence identity to at least a fragment of a polynucleotide of the invention.
- Methods for producing hybridization probes include the cloning of nucleic acid sequences into vectors for the production of RNA probes.
- Such vectors are known in the art, are commercially available, and can be used to synthesize RNA probes in vitro by adding RNA polymerases and labeled nucleotides.
- Probes can incorporate nucleotides labeled by a variety of reporter groups including, but not limited to, radionuclides such as 32 P or 35S, enzymatic labels such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, fluorescent labels such as Cy3 and Cy5, and the like.
- the labeled polynucleotides can be used in Southern or northern analysis, dot blot, or other membrane-based technologies, on chips or other substrates, and in PCR technologies. Hybridization probes are also useful in mapping the naturally occurring genomic sequence. Fluorescent in situ hybridization (FISH) can be correlated with other physical chromosome mapping techniques and genetic map data as described in Heinz-Ulrich et al. (In: Meyers, supra, pp. 965-968). In many cases, genomic context helps identify genes that encode a particular protein family. (See, e.g., Kirschning et al. (1997) Genomics 46:416-25.).
- FISH Fluorescent in situ hybridization
- the polynucleotide can be labeled using standard methods and added to a sample from a subject under conditions for the formation and detection of hybridization complexes. After incubation the sample is washed, and the signal associated with complex formation is quantitated and compared with at least one standard value. Standard values are derived from any control sample, typically one that is free of the suspect disorder and from one that represents a single, specific and preferably, staged disorder. If the amount of signal in the subject sample is distinguishable from the standards, then differential expression in the subject sample indicates the presence of the disorder. Qualitative and quantitative methods for comparing complex formation in subject samples with previously established standards are well known in the art.
- Such assays can also be used to evaluate the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to monitor the treatment of an individual subject. Once the presence of the disorder has been established and a treatment protocol is initiated, hybridization, amplification, or antibody assays can be repeated on a regular basis to determine when gene or protein expression in the patient begins to approximate that which is observed in a healthy subject. The results obtained from successive assays can be used to show the efficacy of treatment over a period ranging from several hours, e.g. in the case of toxic shock, to many years, e.g. in the case of osteoarthritis.
- the polynucleotides can be used on a substrate such as a microarray to monitor gene expression, to identify splice variants, mutations, and polymorphisms. Information derived from analyses of expression patterns can be used to determine gene function, to understand the genetic basis of a disease, to diagnose a disorder, and to develop and monitor the activities of therapeutic agents used to treat a disorder. Microarrays can also be used to detect genetic diversity, single nucleotide polymorphisms, which may characterize a particular population, at the genomic level.
- antibodies of the present invention can be used for treatment, delivery of therapeutics, or monitoring therapy for pancreatic disorders.
- the polynucleotide, or its complement can be used therapeutically for the purpose of expressing mRNA and protein, or conversely to block transcription or translation of the mRNA.
- Expression vectors can be constructed using elements from retroviruses, adenoviruses, herpes or vaccinia viruses, or bacterial plasmids, and the like. These vectors can be used for delivery of nucleotide sequences to a particular target cell population, tissue, or organ. Methods well known to those skilled in the art can be used to construct vectors to express the polynucleotides or their complements. (See, e.g., Maulik et al.
- the polynucleotide or its complement can be used for somatic cell or stem cell gene therapy.
- Vectors can be introduced in vivo, in vitro, and ex vivo.
- vectors are introduced into stem cells taken from the subject, and the resulting transgenic cells are clonally propagated for autologous transplant back into that same subject.
- Delivery of the polynucleotide by transfection, liposome injections, or polycationic amino polymers can be achieved using methods which are well known in the art. (See, e.g., Goldman et al.
- endogenous gene expression can be inactivated using homologous recombination methods which insert an inactive gene sequence into the coding region or other targeted region of the genome. (See, e.g. Thomas et al. (1987) Cell 51: 503-512.).
- Vectors containing the polynucleotide can be transformed into a cell or tissue to express a missing protein or to replace a nonfunctional protein.
- a vector constructed to express the complement of the polynucleotide can be transformed into a cell to down-regulate protein expression.
- Complementary or antisense sequences can consist of an oligonucleotide derived from the transcription initiation site; nucleotides between about positions ⁇ 10 and +10 from the ATG are preferred.
- inhibition can be achieved using triple helix base-pairing methodology. Triple helix pairing is useful because it causes inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, transcription factors, or regulatory molecules.
- Ribozymes enzymatic RNA molecules
- Ribozymes can also be used to catalyze the cleavage of mRNA and decrease the levels of particular mRNAs, such as those comprising the polynucleotides of the invention.
- Ribozymes can cleave mRNA at specific cleavage sites.
- ribozymes can cleave mRNAs at locations dictated by flanking regions that form complementary base pairs with the target mRNA. The construction and production of ribozymes is well known in the art and is described in Meyers (supra).
- RNA molecules can be modified to increase intracellular stability and half-life. Possible modifications include, but are not limited to, the addition of flanking sequences at the 5′ and/or 3′ ends of the molecule, or the use of phosphorothioate or 2′ O-methyl rather than phosphodiester linkages within the backbone of the molecule.
- nontraditional bases such as inosine, queosine, and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases, can be included.
- an antagonist, or an antibody that binds specifically to the protein can be administered to a subject to treat a pancreatic disorder.
- the antagonist, antibody, or fragment can be used directly to inhibit the activity of the protein or indirectly to deliver a therapeutic agent to cells or tissues which express the protein.
- the therapeutic agent can be a cytotoxic agent selected from a group including, but not limited to, abrin, ricin, doxorubicin, daunorubicin, taxol, ethidium bromide, mitomycin, etoposide, tenoposide, vincristine, vinblastine, colchicine, dihydroxy anthracin dione, actinomycin D, diphteria toxin, Pseudomonas exotoxin A and 40, radioisotopes, and glucocorticoid.
- a cytotoxic agent selected from a group including, but not limited to, abrin, ricin, doxorubicin, daunorubicin, taxol, ethidium bromide, mitomycin, etoposide, tenoposide, vincristine, vinblastine, colchicine, dihydroxy anthracin dione, actinomycin D, diphteria toxin, Pseudom
- Antibodies to the protein can be generated using methods that are well known in the art. Such antibodies can include, but are not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and fragments produced by a Fab expression library. Neutralizing antibodies, such as those which inhibit dimer formation, are especially preferred for therapeutic use. Monoclonal antibodies to the protein can be prepared using any technique which provides for the production of antibody molecules by continuous cell lines in culture. These include, but are not limited to, the hybridoma, the human B-cell hybridoma, and the EBV-hybridoma techniques. In addition, techniques developed for the production of chimeric antibodies can be used.
- an agonist of the protein can be administered to a subject to treat a disorder associated with decreased expression, longevity or activity of the protein.
- An additional aspect of the invention relates to the administration of a pharmaceutical or sterile composition, in conjunction with a pharmaceutically acceptable carrier, for any of the therapeutic applications discussed above.
- Such pharmaceutical compositions can consist of the protein or antibodies, mimetics, agonists, antagonists, or inhibitors of the protein.
- the compositions can be administered alone or in combination with at least one other agent, such as a stabilizing compound, which can be administered in any sterile, biocompatible pharmaceutical carrier including, but not limited to, saline, buffered saline, dextrose, and water.
- the compositions can be administered to a subject alone or in combination with other agents, drugs, or hormones.
- compositions utilized in this invention can be administered by any number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, sublingual, or rectal means.
- these pharmaceutical compositions can contain pharmaceutically-acceptable carriers comprising excipients and auxiliaries which facilitate processing of the active compounds into preparations which can be used pharmaceutically. Further details on techniques for formulation and administration can be found in the latest edition of Remington's Pharmaceutical Sciences (Mack Publishing, Easton Pa.).
- the therapeutically effective dose can be estimated initially either in cell culture assays or in animal models such as mice, rats, rabbits, dogs, or pigs.
- animal models such as mice, rats, rabbits, dogs, or pigs.
- An animal model can also be used to determine the concentration range and route of administration. Such information can then be used to determine useful doses and routes for administration in humans.
- a therapeutically effective dose refers to that amount of active ingredient which ameliorates the symptoms or condition.
- Therapeutic efficacy and toxicity can be determined by standard pharmaceutical procedures in cell cultures or with experimental animals, such as by calculating and contrasting the ED 50 (the dose therapeutically effective in 50% of the population) and LD 50 (the dose lethal to 50% of the population) statistics. Any of the therapeutic compositions described above can be applied to any subject in need of such therapy, including, but not limited to, mammals such as dogs, cats, cows, horses, rabbits, monkeys, and most preferably, humans.
- SEQ ID NOs:1-13 can be useful in the differentiation of stem cells.
- Eukaryotic stem cells are able to differentiate into the multiple cell types of various tissues and organs and to play roles in embryogenesis and adult tissue regeneration (Gearhart (1998) Science 282:1061-1062; Watt and Hogan (2000) Science 287:1427-1430).
- stem cells can be totipotent with the potential to create every cell type in an organism and to generate a new organism, pluripotent with the potential to give rise to most cell types and tissues, but not a whole organism; or multipotent cells with the potential to differentiate into a limited number of cell types.
- Stem cells can be transfected with polynucleotides which can be transiently expressed or can be integrated within the cell as transgenes.
- Embryonic stem (ES) cell lines are derived from the inner cell masses of human blastocysts and are pluripotent (Thomson et al. (1998) Science 282:1145-1147). They have normal karyotypes and express high levels of telomerase which prevent senescence and allow the cells to replicate indefinitely. ES cells produce derivatives that give rise to embryonic epidermal, mesodermal and endodermal cells. Embryonic germ (EG) cell lines, which are produced from primordial germ cells isolated from gonadal ridges and mesenteries, also show stem cell behavior (Shamblott et al. (1998) Proc Natl Acad Sci 95:13726-13731). EG cells have normal karyotypes and appear to be pluripotent.
- Organ-specific adult stem cells differentiate into the cell types of the tissues from which they were isolated. They maintain their original tissues by replacing cells destroyed from disease or injury.
- Adult stem cells are multipotent and under proper stimulation can be used to generate cell types of various other tissues (Vogel (2000) Science 287:1418-1419).
- Hematopoietic stem cells from bone marrow provide not only blood and immune cells, but can also be induced to transdifferentiate to form brain, liver, heart, skeletal muscle and smooth muscle cells.
- mesenchymal stem cells can be used to produce bone marrow, cartilage, muscle cells, and some neuron-like cells, and stem cells from muscle have the ability to differentiate into muscle and blood cells (Jackson et al.
- Neural stem cells which produce neurons and glia, can also be induced to differentiate into heart, muscle, liver, intestine, and blood cells (Kuhn and Svendsen (1999) BioEssays 21:625-630); Clarke et al. (2000) Science 288:1660-1663; Gage (2000) Science 287:1433-1438; and Galli et al. (2000) Nature Neurosci 3:986-991).
- Neural stem cells can be used to treat neurological disorders such as Alzheimer's disease, Parkinson's disease, and multiple sclerosis and to repair tissue damaged by strokes and spinal cord injuries.
- Hematopoietic stem cells can be used to restore immune function in immunodeficient patients or to treat autoimmune disorders by replacing autoreactive immune cells with normal cells to treat diseases such as multiple sclerosis, scleroderma, rheumatoid arthritis, and systemic lupus erythematosus.
- Mesenchymal stem cells can be used to repair tendons or to regenerate cartilage to treat arthritis.
- Liver stem cells can be used to repair liver damage.
- Pancreatic stem cells can be used to replace islet cells to treat diabetes.
- Muscle stem cells can be used to regenerate muscle to treat muscular dystrophies (Fontes and Thomson (1999) BMJ 319:1-3; Weissman (2000) Science 287:1442-1446 Marshall (2000) Science 287:1419-1421; and Marmont (2000) Ann Rev Med 51:115-134).
- the cDNA library, PANCNOT05 was selected as an example to demonstrate the construction of the cDNA libraries from which the sequences used to identify genes associated with pancreatic disorders were derived.
- the PANCNOT05 cDNA library was constructed from cytologically normal pancreas tissue obtained from a 2-year-old Hispanic male who died of cerebral anoxia.
- the frozen tissue was homogenized and lysed using a POLYTRON homogenizer (Brinkmann Instruments, Westbury N.J.) in guanidinium isothiocyanate solution.
- the lysate was centrifuged over a 5.7 M CsCl cushion using an SW28 rotor in an L8-70M ultracentrifuge (BeckmanCoulter, Fullerton Calif.) for 18 hours at 25,000 rpm at ambient temperature.
- the RNA was extracted with acid phenol, pH 4.0, precipitated using 0.3 sodium acetate and 2.5 volumes of ethanol, resuspended in RNAse-free water, and DNAse treated at 37 C. RNA extraction and precipitation were repeated as before.
- the mRNA was isolated using the OLIGOTEX kit (Qiagen, Chatsworth Calif.) and used to construct the cDNA library.
- mRNA was handled according to the recommended protocols in the SUPERSCRIPT plasmid system (Life Technologies). cDNAs were fractionated on a SEPHAROSE CL4B column (Amersham Pharmacia Biotech), and those cDNAs exceeding 400 bp were ligated into pSport I plasmid. The plasmid was subsequently transformed into DH5a competent cells (Life Technologies).
- Plasmid DNA was released from the bacterial cells and purified using the REAL PREP 96 plasmid kit (Qiagen). This kit enabled the simultaneous purification of 96 samples in a 96-well block using multi-channel reagent dispensers.
- the recommended protocol was employed except for the following changes: 1) the bacteria were cultured in 1 ml of sterile TERRIFIC BROTH (BD Biosciences, San Jose Calif.) with carbenicillin at 25 mg/L and glycerol at 0.4%; 2) the cultures were incubated for 19 hours after inoculation and the cells were lysed in 0.3 ml of lysis buffer; and 3) the plasmid DNA pellet was precipitated in isopropanol and then resuspended in 0.1 ml of distilled water. After the last step in the protocol, samples were transferred to a 96-well block for storage at 4 C.
- the cDNAs were prepared using a MICROLAB 2200 system (Hamilton, Reno Nev.) in combination with DNA ENGINE thermal cyclers (MJ Research, Watertown Mass.). The cDNAs were sequenced by the method of Sanger and Coulson (1975; J Mol Biol 94:441-448) using ABI PRISM 377 DNA sequencing systems (Applied Biosystems). Most of the cDNAs were sequenced using standard ABI protocols and kits at solution volumes of 0.25 ⁇ -1.0 ⁇ . In the alternative, some of the cDNAs were sequenced using solutions and dyes from APB.
- polynucleotides used for co-expression analysis were assembled from EST sequences, 5′ and 3′ long read sequences, and full length coding sequences. Of the 41,419 assembled sequences used in the analysis, each was expressed in at least five cDNA libraries.
- Bins were annotated by screening the consensus sequence in each bin against public databases, such as GBpri and GenPept from NCBI.
- the annotation process involved a FASTn screen against the GBpri database in GenBank. Those hits with a percent identity of greater than or equal to 75% and an alignment length of greater than or equal to 100 base pairs were recorded as homolog hits.
- the residual unannotated sequences were screened by FASTx against GenPept. Those hits with an E value of less than or equal to 10 ⁇ 8 were recorded as homolog hits.
- Sequences were then reclustered using BLASTn and Cross-Match, a program for rapid amino acid and nucleic acid sequence comparison and database search (Green, supra), sequentially. Any BLAST alignment between a sequence and a consensus sequence with a score greater than 150 was realigned using cross-match. The sequence was added to the bin whose consensus sequence gave the highest Smith-Waterman score (Smith et al. (1992) Protein Engineering 5:35-51) amongst local alignments with at least 82% identity. Non-matching sequences were moved into new bins, and assembly processes were repeated.
- polynucleotides of the Sequence Listing or their encoded proteins were used to query databases such as GenBank, SwissProt, BLOCKS, and the like. These databases that contain previously identified and annotated sequences or domains were searched using BLAST or BLAST 2 (Altschul et al. supra; Altschul, supra) to produce alignments and to determine which sequences were exact matches or homologs. The alignments were to sequences of prokaryotic (bacterial) or eukaryotic (animal, fungal, or plant) origin. Alternatively, algorithms such as the one described in Smith and Smith (1992, Protein Engineering 5:35-51) could have been used to deal with primary sequence patterns and secondary structure gap penalties. All of the sequences disclosed in this application have lengths of at least 49 nucleotides, and no more than 12% uncalled bases (where N is recorded rather than A, C, G, or T).
- BLAST matches between a query sequence and a database sequence were evaluated statistically and only reported when they satisfied the threshold of 10 ⁇ 25 for nucleotides and 10 ⁇ 14 for peptides. Homology was also evaluated by product score calculated as follows: the % nucleotide or amino acid identity [between the query and reference sequences] in BLAST is multiplied by the % maximum possible BLAST score [based on the lengths of query and reference sequences] and then divided by 100. In comparison with hybridization procedures used in the laboratory, the electronic stringency for an exact match was set at 70, and the conservative lower limit for an exact match was set at approximately 40 (with 1-2% error due to uncalled bases).
- the BLAST software suite includes various sequence analysis programs including “blastn” that is used to align nucleic acid molecules and BLAST 2 that is used for direct pairwise comparison of either nucleic or amino acid molecules.
- BLAST programs are commonly used with gap and other parameters set to default settings, e.g.: Matrix: BLOSUM62; Reward for match: 1; Penalty for mismatch: ⁇ 2; Open Gap: 5 and Extension Gap: 2 penalties; Gap x drop-off: 50; Expect: 10; Word Size: 11; and Filter: on.
- polynucleotides of this application were compared with assembled consensus sequences or templates found in the LIFESEQ GOLD database.
- Component sequences from cDNA, extension, full length, and shotgun sequencing projects were subjected to PHRED analysis and assigned a quality score. All sequences with an acceptable quality score were subjected to various pre-processing and editing pathways to remove low quality 3′ ends, vector and linker sequences, polyA tails, Alu repeats, mitochondrial and ribosomal sequences, and bacterial contamination sequences.
- Edited sequences had to be at least 50 bp in length, and low-information sequences and repetitive elements such as dinucleotide repeats, Alu repeats, and the like, were replaced by “Ns” or masked.
- Edited sequences were subjected to assembly procedures in which the sequences were assigned to polynucleotide bins. Each sequence could only belong to one bin, and sequences in each bin were assembled to produce a template. Newly sequenced components were added to existing bins using BLAST and CROSSMATCH. To be added to a bin, the component sequences had to have a BLAST quality score greater than or equal to 150 and an alignment of at least 82% local identity. The sequences in each bin were assembled using PHRAP. Bins with several overlapping component sequences were assembled using DEEP PHRAP. The orientation of each template was determined based on the number and orientation of its component sequences.
- Bins were compared to one another and those having local similarity of at least 82% were combined and reassembled. Bins having templates with less than 95% local identity were split. Templates were subjected to analysis by STTCHER/EXON MAPPER algorithms that analyze the probabilities of the presence of splice variants, alternatively spliced exons, splice junctions, differential expression of alternative spliced genes across tissue types or disease states, and the like. Assembly procedures were repeated periodically, and templates were annotated using BLAST against GenBank databases such as GBpri.
- templates were subjected to BLAST, motif, and other functional analyses and categorized in protein hierarchies using methods described in U.S. Ser. No. 08/812,290 and U.S. Ser. No. 08/811,758, both filed Mar. 6, 1997; in U.S. Ser. No. 08/947,845, filed Oct. 9, 1997; and in U.S. Ser. No. 09/034,807, filed Mar. 4, 1998.
- templates were analyzed by translating each template in all three forward reading frames and searching each translation against the PFAM database of hidden Markov model-based protein families and domains using the HMMER software package (Washington University School of Medicine, St. Louis Mo.; http://pfam.wustl.edu/).
- polynucleotide was further analyzed using MACDNASIS PRO software (Hitachi Software Engineering), and LASERGENE software (DNASTAR) and queried against public databases such as the GenBank rodent, mammalian, vertebrate, prokaryote, and eukaryote databases, SwissProt, BLOCKS, PRINTS, PFAM, and Prosite.
- Reg Regenerating (Reg) gene family (Alternate name: lithostathine) whose members include Reg-1 alpha, Reg-1 beta, and Reg-related protein.
- Reg stimulates growth of the beta islet cells; and its expression is correlated with insulin expression (Baeza et al. (1996) Diabetes Metab 22:229-34).
- Reg-1 alpha is an effective therapy for diabetes in mice, in combination with the inimunoregulator drug linomide. (Gross et al. (1998) Endocrinology 139: 2369-74).
- Colipase is a pancreatic exocrine protein whose synthesis increases in diabetic rats; synthesis of colipase is inhibited by insulin (Duan et al. (1991) Pancreas 6:595-602; Duan and Erlanson-Albertsson (1992) Pancreas 7:465-71).
- HiAPP Human islet amyloid polypeptide is a hormone-like peptide expressed in the insulin-producing beta cells of the endocrine pancreas (Nishi et al. (1989) Mol Endocrinol 3:1775-81).
- the p-value is the probability that the observed co-expression is due to chance, using the Fisher Exact Test.
- the highest co-expression value is obtained when the highest p-value found along the horizontal line following each SEQ ID NO (clone number) is correlated with a known marker gene (numbers 1-8 along the top line of the table). For example, clone number 2383628 (number 15), has a p-value of 14 as it co-expresses with lipase (number 1) and a p-value of 16 as it co-expresses with colipase (number 2); these values greatly exceed the threshold p-value for this experiment and are very highly significant.
- the data above can be summarized by reducing it to a single highest co-expression ( ⁇ log p) value for each intersecting known gene and unknown polynucleotide and naming at least one pancreatic disorder associated with expression of the known gene.
- Polynucleotides comprising the nucleic acid sequences of SEQ ID NOs: 1-13 of the present invention were first identified as Incyte Clones 223163, 884692, 888246, 888309, 951335, 2091133, 2383628, 2774542, 2777115, 3664676, 3833667, 3835361, and 3836037, respectively; and assembled according to Example III. As described in Example IV, BLAST and other motif searches were performed for each sequence. SEQ ID NOs:1-13 were translated, and sequence identity with known sequences was sought. SEQ ID NOs:14 and 15 of the present invention were encoded by SEQ ID NOs: 1 and 8, respectively. SEQ ID NOs: 14 and 15 were also analyzed using BLAST and motif search tools, and the results of these analyses are described below.
- SEQ ID NO:2 is 924 nucleic acids in length and has about 92% identity from about nucleotide 211 to about nucleotide 923 with a gene that encodes human pancreatic zymogen granule membrane protein, GP-2 (gl2445 11) and about 96% match from about nucleotide 923 to about nucleotide 594 with a gene that encodes a human zinc finger protein, ZNF133 (g487782).
- GP-2 is a 75 kDa glycoprotein released from the membrane of mature zymogen granules by an enzymatic mechanism. The C-terminal region of GP-2 exhibit 26 conserved cysteine residues and includes one epidermal growth factor motif.
- ZNF133 is a protein that belongs to the human zinc finger Kruppel family and contains a Kruppel-associated box segment. ZNF133 was localized to chromosome 20p 11.2 that is close to the deleted region that characterizes Alagille syndrome.
- SEQ ID NO:3 is about 845 nucleotides in length; it shows about 80% identity from about nucleotide 560 to about nucleotide 840 with a complete coding sequence for human protamine 1, protamine 2 and transition protein 2 (g642458) and about 86% identity with a gene that encodes TXA2 gene (EP 490410).
- TXA2 is a unstable arachidonate metabolite that functions as a potent stimulator of platelet aggregation and a constrictor of vascular and respiratory smooth muscle.
- SEQ ID NO:7 is 646 nucleotides in length and shows 77% identity from about nucleotide 1 to about nucleotide 402 with a rat mRNA that encodes syncollin, a secretory granule protein that binds to syntaxin in a Ca ++ -sensitive manner and functions as a regulator of exocytosis in exocrine tissues (g2258437).
- SEQ ID NO:12 is 874 nucleotides in length and shows 98% identity from about nucleotide 363 to about nucleotide 873 with a gene that encodes human pancreatic zymogen granule membrane protein, GP-2 mRNA (gl244511). SEQ ID NO:12 also exhibits 99% identity from about nucleotide 432 to about nucleotide 924 with SEQ ID NO:2. Therefore, SEQ ID NO:2 and SEQ ID NO: 12 are potential splice variants with related cellular functions.
- SEQ ID NO: 1 is 1966 nucleotides in length and shows 77% identity from nucleotide 1 to about nucleotide 1930 with an mRNA that encodes a rat uterus-ovary specific trans-membrane protein (g2460315). This uterus-ovary specific rat protein is expressed upon induction by estrogen.
- SEQ ID NO: 14 an amino acid sequence encoded by SEQ ID NO:1, is 585 amino acid residues in length and shows about 74% identity from about amino acid residue 22 to about amino acid residue 608 with the rat uterus-ovary specific trans-membrane protein (g2460316).
- SEQ ID NO:14 also exhibits a transmembrane domain encompassing amino acid residues 576 to 593.
- SEQ ID NO:14 has eight potential N-glycosylation sites at N30, N58, N68, N149, N272, N371, N395, and N420; twelve potential casein kinase II phosphorylation sites at T23, S109, S290, S349, S372, T380, T409, S464, S521, T557, T613, and T632; three N-myristoylation sites at G21, G29, and G39; thirteen potential protein kinase C phosphorylation sites at T45, S70, S132, S255, S280, T308, T328, T442, T468, S521, S527, T589, and T643; and three potential tyrosine kinase phosphorylation sites at Y180, Y415, and Y528.
- SEQ ID NO:8 is 1354 nucleotides in length and shows 99% identity with the human mRNA that codes for AQP8 (g2346968), a member of a family of water channel proteins identified from rat testis that contains the conserved transmembrane domains of the major intrinsic protein (MIP) family.
- SEQ ID NO:15 the amino acid sequence encoded by SEQ ID NO:8, is 255 amino acids in length and shows 100% sequence identity with AQP8.
- BLIMPS analysis shows that SEQ ID NO: 15 has six conserved amino acid segments that match the conserved transmembrane domains of the MIP family proteins. These segments encompass amino acid residues 30 to 49, 66 to 90, 103 to 122, 154 to 172, 185 to 207, and 222 to 242.
- the polynucleotides are applied to a substrate by one of the following methods.
- a mixture of polynucleotides is fractionated by gel electrophoresis and transferred to a nylon membrane by capillary transfer.
- the polynucleotides are individually ligated to a vector and inserted into bacterial host cells to form a library.
- the polynucleotides are then arranged on a substrate by one of the following methods. In the first method, bacterial cells containing individual clones are robotically picked and arranged on a nylon membrane.
- the membrane is placed on LB agar containing selective agent (carbenicillin, kanamycin, ampicillin, or chloramphenicol depending on the vector used) and incubated at 37 C for 16 hr.
- the membrane is removed from the agar and consecutively placed colony side up in 10% SDS, denaturing solution (1.5 M NaCl, 0.5 M NaOH), neutralizing solution (1.5 M NaCl, 1 M Tris-HCl, pH 8.0), and twice in 2 ⁇ SSC for 10 min each.
- the membrane is then UV irradiated in a STRATALINKER UV-crosslinker (Stratagene).
- polynucleotides are amplified from bacterial vectors by thirty cycles of PCR using primers complementary to vector sequences flanking the insert. PCR amplification increases a starting concentration of 1-2 ng nucleic acid to a final quantity greater than 5 ⁇ g.
- Amplified nucleic acids from about 400 bp to about 5000 bp in length are purified using SEPHACRYL-400 beads (APB). Purified nucleic acids are arranged on a nylon membrane manually or using a dot/slot blotting manifold and suction device and are immobilized by denaturation, neutralization, and UV irradiation as described above.
- Purified nucleic acids are robotically arranged and immobilized on polymer-coated glass slides using the procedure described in U.S. Pat. No. 5,807,522.
- Polymer-coated slides are prepared by cleaning glass microscope slides (Corning, Acton Mass.) by ultrasound in 0.1% SDS and acetone, etching in 4% hydrofluoric acid (VWR Scientific Products, West Chester Pa.), coating with 0.05% aminopropyl silane (Sigma-Aldrich) in 95% ethanol, and curing in a 110 C oven. The slides are washed extensively with distilled water between and after treatments.
- the nucleic acids are arranged on the slide and then immobilized by exposing the array to UV irradiation using a STRATALINKER UV-crosslinker (Stratagene). Arrays are then washed at room temperature in 0.2% SDS and rinsed three times in distilled water. Non-specific binding sites are blocked by incubation of arrays in 0.2% casein in phosphate buffered saline (PBS; Tropix, Bedford Mass.) for 30 min at 60 C; then the arrays are washed in 0.2% SDS and rinsed in distilled water as before.
- PBS phosphate buffered saline
- Hybridization probes derived from the polynucleotides of the Sequence Listing are employed for screening cDNAs, mRNAs, or genomic DNA in membrane-based hybridizations. Probes are prepared by diluting the polynucleotides to a concentration of 40-50 ng in 45 ⁇ l TE buffer, denaturing by heating to 100 C for five min, and briefly centrifuging. The denatured polynucleotide is then added to a REDIPRIME tube (APB), gently mixed until blue color is evenly distributed, and briefly centrifuged. Five ⁇ l of [ 3 P]dCTP is added to the tube, and the contents are incubated at 37 C for 10 min.
- APB REDIPRIME tube
- the labeling reaction is stopped by adding 5 ⁇ l of 0.2M EDTA, and probe is purified from unincorporated nucleotides using a PROBEQUANT G-50 microcolumn (APB).
- the purified probe is heated to 100 C for five min, snap cooled for two min on ice, and used in membrane-based hybridizations as described below.
- Hybridization probes derived from mRNA isolated from samples are employed for screening polynucleotides of the Sequence Listing in array-based hybridizations.
- Probe is prepared using the GEMbright kit (Incyte Genomics) by diluting mRNA to a concentration of 200 ng in 9 ⁇ l TE buffer and adding 5 ⁇ l 5 ⁇ buffer, 1 ⁇ l 0.1 M DTT, 3 ⁇ l Cy3 or Cy5 labeling mix, 1 ⁇ l RNAse inhibitor, 1 ⁇ l reverse transcriptase, and 5 ⁇ l 1 ⁇ yeast control mRNAs.
- Yeast control mRNAs are synthesized by in vitro transcription from noncoding yeast genomic DNA (W. Lei, unpublished).
- one set of control mRNAs at 0.002 ng, 0.02 ng, 0.2 ng, and 2 ng are diluted into reverse transcription reaction mixture at ratios of 1:100,000, 1:10,000, 1:1000, and 1:100 (w/w) to sample mRNA respectively.
- a second set of control mRNAs are diluted into reverse transcription reaction mixture at ratios of 1:3, 3:1, 1:10, 10:1, 1:25, and 25:1 (w/w).
- the reaction mixture is mixed and incubated at 37 C for two hr.
- the reaction mixture is then incubated for 20 min at 85 C, and probes are purified using two successive CHROMA SPIN+TE 30 columns (Clontech, Palo Alto Calif.).
- Purified probe is ethanol precipitated by diluting probe to 90 ⁇ l in DEPC-treated water, adding 2 ⁇ l lmg/ml glycogen, 60 ⁇ l 5 M sodium acetate, and 300 ⁇ l 100% ethanol.
- the probe is centrifuged for 20 min at 20,800 ⁇ g, and the pellet is resuspended in 12 ⁇ l resuspension buffer, heated to 65 C for five min, and mixed thoroughly. The probe is heated and mixed as before and then stored on ice. Probe is used in high density array-based hybridizations as described below.
- Membranes are pre-hybridized in hybridization solution containing 1% Sarkosyl and lx high phosphate buffer (0.5 M NaCl, 0.1 M Na 2 HPO 4 , 5 mM EDTA, pH 7) at 55 C for two hr.
- the probe diluted in 15 ml fresh hybridization solution, is then added to the membrane.
- the membrane is hybridized with the probe at 55 C for 16 hr.
- the membrane is washed for 15 min at 25 C in 1 mM Tris (pH 8.0), 1% Sarkosyl, and four times for 15 min each at 25 C in lmM Tris (pH 8.0).
- XOMAT-AR film Eastman Kodak, Rochester N.Y.
- XOMAT-AR film Eastman Kodak, Rochester N.Y.
- Probe is heated to 65 C for five min, centrifuged five min at 9400 rpm in a 5415C microcentrifuge (Eppendorf Scientific, Westbury N.Y.), and then 18 ⁇ l are aliquoted onto the array surface and covered with a coverslip.
- the arrays are transferred to a waterproof chamber having a cavity just slightly larger than a microscope slide.
- the chamber is kept at 100% humidity internally by the addition of 140 ⁇ l of 5 ⁇ SSC in a corner of the chamber.
- the chamber containing the arrays is incubated for about 6.5 hr at 60 C.
- the arrays are washed for 10 min at 45 C in 1 ⁇ SSC, 0.1% SDS, and three times for 10 min each at 45 C in 0.1 ⁇ SSC, and dried.
- Hybridization reactions are performed in absolute or differential hybridization formats.
- absolute hybridization format probe from one sample is hybridized to array elements, and signals are detected after hybridization complexes form. Signal strength correlates with probe mRNA levels in the sample.
- differential hybridization format differential expression of a set of genes in two biological samples is analyzed. Probes from the two samples are prepared and labeled with different labeling moieties. A mixture of the two labeled probes is hybridized to the array elements, and signals are examined under conditions in which the emissions from the two different labels are individually detectable. Elements on the array that are hybridized to equal numbers of probes derived from both biological samples give a distinct combined fluorescence (Shalon WO95/35505).
- Hybridization complexes are detected with a microscope equipped with an INNOVA 70 mixed gas 10 W laser (Coherent, Santa Clara CA) capable of generating spectral lines at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5.
- the excitation laser light is focused on the array using a 20 ⁇ microscope objective (Nikon, Melville N.Y.).
- the slide containing the array is placed on a computer-controlled X-Y stage on the microscope and raster-scanned past the objective with a resolution of 20 micrometers.
- the two fluorophores are sequentially excited by the laser.
- Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, Hamamatsu Photonics Systems, Bridgewater N.J.) corresponding to the two fluorophores.
- PMT R1477 Hamamatsu Photonics Systems, Bridgewater N.J.
- Appropriate filters positioned between the array and the photomultiplier tubes are used to filter the signals.
- the emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for CyS.
- the sensitivity of the scans is calibrated using the signal intensity generated by the yeast control mRNAs added to the probe mix.
- a specific location on the array contains a complementary DNA sequence, allowing the intensity of the signal at that location to be correlated with a weight ratio of hybridizing species of 1:100,000.
- the output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital (A/D) conversion board (Analog Devices, Norwood Mass.) installed in an IBM-compatible PC computer.
- the digitized data are displayed as an image where the signal intensity is mapped using a linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high signal).
- the data is also analyzed quantitatively. Where two different fluorophores are excited and measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping emission spectra) between the fluorophores using the emission spectrum for each fluorophore.
- a grid is superimposed over the fluorescence signal image such that the signal from each spot is centered in each element of the grid.
- the fluorescence signal within each element is then integrated to obtain a numerical value corresponding to the average intensity of the signal.
- the software used for signal analysis is the GEMTOOLS program (Incyte Genomics).
- All sequences and cDNA libraries in the LIFESEQ database were categorized by system, organ/tissue and cell type.
- the categories included cardiovascular system, connective tissue, digestive system, embryonic structures, endocrine system, exocrine glands, female and male reproductive, germ cells, hemic/immune system, liver, musculoskeletal system, nervous system, pancreas, respiratory system, sense organs, skin, stomatognathic system, unclassified/mixed, and the urinary tract.
- the number of libraries in which the sequence was expressed were counted and shown over the total number of libraries in that category.
- all normalized or pooled libraries which have high copy number sequences removed prior to processing, and all mixed or pooled tissues, which are considered non-specific in that they contain more than one tissue type or more than one subject's tissue, can be excluded from the analysis.
- Cell lines and/or fetal tissue data can also be disregarded unless the elucidation of inherited disorders would be furthered by their inclusion in the analysis.
- transcript image for SEQ ID NO:2 is shown below. No libraries were excluded from the analysis. SEQ ID NO:2 was only expressed in pancreatic tissues, which agrees with the 100% specificity shown in Example VI above, and the transcript image both shows independent confirmation of the results of the co-expression analysis and demonstrates differential expression of SEQ ID NO:2 in type I diabetes. Expression exceeded that of any other diseased pancreas library, including tumor and cytologically normal tissue, by greater than five-fold.
- SEQ ID NO:2 (Category: Pancreas) Library cDNAs Description Abundance % Abundance PANCNOT23 3920 pancreas, type 9 0.2296 I diabetes, 43F PANCNOT17 4037 pancreas, 2 0.0495 mw/mets neuroendocrine CA of liver, 65F PANCNOT16 2812 pancreas, 1 0.0356 aw/Patau's, fetal, 20wM PANCNOT05 6805 pancreas, 2M 2 0.0294 PANCNOT19 3775 pancreas, 8M 1 0.0265 PANCNOT21 3846 pancreas, 8M 1 0.0260
- Complementary molecules include genomic sequences (such as enhancers or introns) and are used in “triple helix” base pairing to compromise the ability of the double helix to open sufficiently for the binding of polymerases, transcription factors, or regulatory molecules.
- a complementary molecule is designed to prevent ribosomal binding to the mRNA encoding the protein.
- Complementary molecules are placed in expression vectors and used to transform a cell line to test efficacy; into an organ, tumor, synovial cavity, or the vascular system for transient or short term therapy; or into a stem cell, zygote, or other reproducing lineage for long term or stable gene therapy.
- Transient expression lasts for a month or more with a non-replicating vector and for three months or more if appropriate elements for inducing vector replication are used in the transformation/expression system.
- Expression and purification of the protein are achieved using either a cell expression system or an insect cell expression system.
- the pUB6/V5-His vector system (Invitrogen, Carlsbad Calif.) is used to express protein in CHO cells.
- the vector contains the selectable bsd gene, multiple cloning sites, the promoter/enhancer sequence from the human ubiquitin C gene, a C-terminal V5 epitope for antibody detection with anti-V5 antibodies, and a C-terminal polyhistidine (6 ⁇ His) sequence for rapid purification on PROBOND resin (Invitrogen). Transformed cells are selected on media containing blasticidin.
- Spodoptera frugiperda (Sf9) insect cells are infected with recombinant Autographica californica nuclear polyhedrosis virus (baculovirus).
- the polyhedrin gene is replaced with the polynucleotide by homologous recombination and the polyhedrin promoter drives transcription.
- the protein is synthesized as a fusion protein with 6 ⁇ his which enables purification as described above. Purified protein is used in the following activity and to make antibodies.
- the protein is purified using polyacrylamide gel electrophoresis and used to immunize mice or rabbits. Antibodies are produced using the protocols below. Alternatively, the amino acid sequence of the expressed protein is analyzed using LASERGENE software (DNASTAR) to determine regions of high antigenicity. An antigenic epitope, usually found near the C-terminus or in a hydrophilic region is selected, synthesized, and used to raise antibodies.
- Naturally occurring or recombinant protein is purified by immunoaffinity chromatography using antibodies which specifically bind the protein.
- An immunoaffinity column is constructed by covalently coupling the antibody to CNBr-activated SEPHAROSE resin (APB). Media containing the protein is passed over the immunoaffinity column, and the column is washed using high ionic strength buffers in the presence of detergent to allow preferential absorbance of the protein. After coupling, the protein is eluted from the column using a buffer of pH 2-3 or a high concentration of urea or thiocyanate ion to disrupt antibody/protein binding, and the protein is collected.
- APB CNBr-activated SEPHAROSE resin
- the polynucleotide, or fragments thereof, or the protein, or portions thereof, are labeled with 32 P-dCTP, Cy3-dCTP, or Cy5-dCTP (APB), or with BIODIPY or FITC (Molecular Probes, Eugene Oreg.), respectively.
- Libraries of candidate molecules or compounds previously arranged on a substrate are incubated in the presence of composition, a labeled polynucleotide or protein. After incubation under conditions for either a nucleic acid or amino acid sequence, the substrate is washed, and any position on the substrate retaining label, which indicates specific binding or complex formation, is assayed, and the ligand is identified. Data obtained using different concentrations of the nucleic acid or protein are used to calculate affinity between the labeled nucleic acid or protein and the bound molecule.
- a yeast two-hybrid system MATCHMAKER LexA Two-Hybrid system (Clontech Laboratories, Palo Alto Calif.), is used to screen for peptides that bind the protein of the invention.
- a polynucleotide encoding the protein is inserted into the multiple cloning site of a pLexA vector, ligated, and transformed into E. coli .
- cDNA, prepared from mRNA is inserted into the multiple cloning site of a pB42AD vector, ligated, and transformed into E. coli to construct a cDNA library.
- the pLexA plasmid and pB42AD-cDNA library constructs are isolated from E.
- Transformed yeast cells are plated on synthetic dropout (SD) media lacking histidine (-His), tryptophan (-Trp), and uracil (-Ura), and incubated at 30 C until the colonies have grown up and are counted.
- SD synthetic dropout
- the colonies are pooled in a minimal volume of lx TE (pH 7.5), replated on SD/-His/-Leu/-Trp/-Ura media supplemented with 2% galactose (Gal), 1% raffinose (Raf), and 80 mg/ml 5-bromo-4-chloro-3-indolyl ⁇ -d-galactopyranoside (X-Gal), and subsequently examined for growth of blue colonies.
- Interaction between expressed protein and cDNA fusion proteins activates expression of a LEU2 reporter gene in EGY48 and produces colony growth on media lacking leucine (-Leu).
- Interaction also activates expression of ⁇ -galactosidase from the p8op-lacZ reporter construct that produces blue color in colonies grown on X-Gal.
- Histidine-requiring colonies are grown on SD/Gal/Raf/X-Gal/-Trp/-Ura, and white colonies are isolated and propagated.
- the pB42AD-cDNA plasmid which contains a polynucleotide encoding a protein that physically interacts with the protein, is isolated from the yeast cells and characterized.
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Diabetes (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Genetics & Genomics (AREA)
- Biochemistry (AREA)
- Gastroenterology & Hepatology (AREA)
- Zoology (AREA)
- Toxicology (AREA)
- Hematology (AREA)
- Animal Behavior & Ethology (AREA)
- Engineering & Computer Science (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Pharmacology & Pharmacy (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Endocrinology (AREA)
- Emergency Medicine (AREA)
- Obesity (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Peptides Or Proteins (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
Abstract
The invention provides compositions and novel polynucleotides and their encoded proteins that co-express with genes involved in insulin synthesis and known to be associated with pancreatic disorders. The invention also provides expression vectors, host cells, proteins encoded by the polynucleotides and antibodies which specifically bind the proteins. The invention also provides methods for the diagnosis, prognosis, evaluation of therapies and treatment of pancreatic disorders.
Description
- This application is a continuation-in-part of U.S. Ser. No. 09/226,994, filed Jan. 7, 1999.
- The invention relates to discovery of thirteen isolated polynucleotides and their encoded proteins that are highly co-expressed with genes known to be involved in insulin synthesis and useful for diagnosis, prognosis, and treatment of pancreatic disorders.
- Insulin is a hormone produced in the beta islet cells of the pancreas. Patients with diabetes have serum glucose levels that are chronically elevated above normal because they either produce insufficient insulin (type I diabetes) or are resistant to insulin (type II diabetes). Complications of diabetes include angina, hypertension, myocardial infarctions, peripheral vascular disease, diabetic retinopathy, diabetic nephropathy, diabetic necrosis, ulceration, and diabetic neuropathy (Davidson (1998)Diabetes Mellitus, W B Saunders, Philadelphia Pa.).
- While some genes that participate in or regulate insulin synthesis and release are known, many genes that function in these critical pathways remain to be identified. Identification of currently unknown genes will provide surrogate diagnostic markers and new therapeutic targets.
- Thus the present invention satisfies a need in the art by providing new compositions that are useful for diagnosis, prognosis, treatment, and evaluation of therapies for pancreatic disorders, especially diabetes. A method for analyzing gene expression patterns has been used to identity thirteen polynucleotides that have highly significant co-expression with genes known to be involved with insulin-synthesis.
- The invention provides a composition comprising a plurality of polynucleotides having the nucleic acid sequences of SEQ ID NOs: 1-13 or the complements thereof that are highly significantly co-expressed with genes such as insulin, glucagon, lipase, colipase, human islet amyloid polypeptide (HiAPP) and Reg-1 alpha, Reg-1 beta, and Reg-related regenerating genes (Reg), known to involved in insulin synthesis. The invention also provides an isolated polynucleotide comprising a nucleic acid sequence selected from SEQ ID NOs: 1-13 or the complement thereof. In different aspects, the polynucleotide is used as a surrogate marker, as a probe, in an expression vector, and in the diagnosis, prognosis, evaluation of therapies and treatment of pancreatic disorders. The invention further provides a composition comprising a polynucleotide and a labeling moiety.
- The invention provides a method for using a composition or a polynucleotide of the invention to screen a plurality of molecules and compounds to identify ligands which specifically bind to the composition or the polynucleotide. The molecules are selected from DNA molecules, RNA molecules, peptide nucleic acids, peptides, mimetics, ribozymes, transcription factors, enhancers, and repressors. The invention also provides a method of using a composition or a polynucleotide to purify a ligand.
- The invention provides a method for using a composition or an isolated polynucleotide to detect gene expression in a sample by hybridizing the composition or polynucleotide to nucleic acids of the sample under conditions for formation of one or more hybridization complexes and detecting hybridization complex formation, wherein complex formation indicates gene expression in the sample. In one aspect, the composition or polynucleotide is attached to a substrate. In another aspect, the nucleic acids of the sample are amplified prior to hybridization. In yet another aspect, complex formation is compared with at least one standard and indicates the presence of a pancreatic disorder.
- The invention provides a purified protein or a portion thereof selected from SEQ ID NOs: 14 and 15, which is encoded by a polynucleotide that is highly significantly co-expressed with genes known to involved in insulin synthesis and whose expression is associated with pancreatic disorders. The invention also provides a method for using a protein to screen a plurality of molecules to identify at least one ligand which specifically binds the protein. The molecules are selected from aptamers, DNA molecules, RNA molecules, peptide nucleic acids, peptides, mimetics, ribozymes, proteins, antibodies, agonists, antagonists, immunoglobulins, inhibitors, pharmaceutical agents or drug compounds. The invention further provides a method of using a protein to purify a ligand.
- The invention provides a method of using a protein to make an antibody that specifically binds to the protein of the invention, and methods for using the antibody to diagnose or treat a pancreatic disorder. The invention also provides a composition comprising a polynucleotide, a protein, or an antibody that specifically binds a protein and a pharmaceutical carrier.
- The Sequence Listing provides exemplary polynucleotides comprising the nucleic acid sequences of SEQ ID NOs:1-13 some of which encode the proteins comprising the amino acid sequences of SEQ ID NOs:14 and 15. Each sequence is identified by a sequence identification number (SEQ ID NO) and by the Incyte clone number with which the sequence was first identified.
- It must be noted that as used herein and in the appended claims, the singular forms “a”, “an”, and “the” include the plural reference unless the context clearly dictates otherwise. Thus, for example, a reference to “a host cell” includes a plurality of such host cells, and a reference to “an antibody” is a reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so forth.
- Definitions
- “Markers for pancreatic disorders” refers to polynucleotides, proteins, and antibodies which are useful in the diagnosis, prognosis, evaluation of therapies and treatment of pancreatic disorders. Typically, this means that the marker gene is differentially expressed in samples from subjects predisposed to, manifesting, or diagnosed with a pancreatic disorder.
- “Differential expression” refers to an increased or up-regulated or a decreased or down-regulated expression as detected by presence, absence or at least about a two-fold change in the amount of transcribed messenger RNA or protein in a sample.
- “Pancreatic disorders” specifically include, but are not limited to, the following conditions, diseases, and disorders: type I and type II diabetes; complications of diabetes including angina, hypertension, myocardial infarctions, peripheral vascular disease, diabetic retinopathy, diabetic nephropathy, diabetic necrosis, ulceration, and diabetic neuropathy; islet cell hyperplasia; pancreatitis; and pancreatic tumor.
- “Isolated or purified” refers to a polynucleotide or protein that is removed from its natural environment and that is separated from other components with which it is naturally present.
- “Genes known to be highly expressed in insulin synthesis pathways” which were used in the co-expression analysis included insulin, glucagon, lipase, colipase, human islet amyloid polypeptide (HiAPP) and Reg-1 alpha, Reg-1 beta, and Reg-related regenerating genes (Reg).
- “Polynucleotide” refers to an isolated cDNA. It can be of genomic or synthetic origin, double-stranded or single-stranded, and combined with vitamins, minerals, carbohydrates, lipids, proteins, or other nucleic acids to perform a particular activity or form a useful composition.
- “Protein” refers to a purified polypeptide whether naturally occurring or synthetic.
- “Sample” is used in its broadest sense. A sample containing nucleic acids can comprise a bodily fluid; an extract from a cell; a chromosome, organelle, or membrane isolated from a cell; genomic DNA, RNA, or cDNA in solution or bound to a substrate; a cell; a tissue; a tissue print; and the like.
- “Substrate” refers to any rigid or semi-rigid support to which polynucleotides or proteins are bound and includes membranes, filters, chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, capillaries or other tubing, plates, polymers, and microparticles with a variety of surface forms including wells, trenches, pins, channels and pores.
- A “transcript image” is a profile of gene transcription activity in a particular tissue at a particular time.
- A “variant” refers to a polynucleotide or protein whose sequence diverges from about 5% to about 30% from the nucleic acid or amino acid sequences of the Sequence Listing.
- The Invention
- The present invention employed “guilt by association or GBA”, a method for using marker genes known to be associated with a particular condition, disease or disorder to identify surrogate markers, polynucleotides and their encoded proteins, that are similarly associated or co-expressed in the same condition, disease, or disorder (Walker and Volkmuth (1999) Prediction of gene function by genome-scale expression analysis: prostate-associated genes. Genome Res 9:1198-1203, incorporated herein by reference). In particular, the method identifies cDNAs cloned from mRNA transcripts which are active in tissues known to have been removed from subjects with pancreatic disorders. The polynucleotides, their encoded proteins and antibodies which specifically bind to the encoded proteins are useful for diagnosis, prognosis, evaluation of therapies, and treatment of pancreatic disorders.
- Guilt by association provides for the identification of polynucleotides that are expressed in a plurality of libraries. The polynucleotides represent genes of unknown function which are expressed in a specific signaling pathway, disease process, subcellular compartment, cell type, tissue, or species. The expression patterns of the genes known to be highly expressed during insulin synthesis; insulin, glucagon, lipase, colipase, HiAPP, and Reg; are compared with those of polynucleotides with unknown function to determine whether a specified co-expression probability threshold is met. Through this comparison, a subset of the polynucleotides having a high co-expression probability with the known marker genes can be identified.
- The polynucleotides originate from human cDNA libraries. These polynucleotides can also be selected from a variety of sequence types including, but not limited to, expressed sequence tags (ESTs), assembled polynucleotides, full length coding regions, and 3′ untranslated regions. To be considered in GBA or co-expression analysis, the polynucleotides had to have been expressed in at least five cDNA libraries. In this application, GBA was applied to a total of 41,419 assembled polynucleotide bins that met the criteria of having been expressed in at least five libraries.
- The cDNA libraries used in the co-expression analysis were obtained from adrenal gland, biliary tract, bladder, blood cells, blood vessels, bone marrow, brain, bronchus, cartilage, chromaffin system, colon, connective tissue, cultured cells, embryonic stem cells, endocrine glands, epithelium, esophagus, fetus, ganglia, heart, hypothalamus, hemic/immune system, intestine, islets of Langerhans, kidney, larynx, liver, lung, lymph, muscles, neurons, ovary, pancreas, penis, phagocytes, pituitary, placenta, pleura, prostate, salivary glands, seminal vesicles, skeleton, spleen, stomach, testis, thymus, tongue, ureter, uterus, and the like. The number of cDNA libraries analyzed can range from as few as three to greater than 10,000 and preferably, the number of the cDNA libraries is greater than 500.
- In a preferred embodiment, the polynucleotides are assembled from related sequences, such as sequence fragments derived from a single transcript. Assembly of the polynucleotide can be performed using sequences of various types including, but not limited to, ESTs, extension of the EST, shotgun sequences from a cloned insert, or full length cDNAs. In a most preferred embodiment, the polynucleotides are derived from human sequences that have been assembled using the algorithm disclosed in U.S. Ser. No. 9,276,534, filed Mar. 25, 1999, and used in U.S. Ser. No. 09/226,994, filed Jan. 7, 1999, both incorporated herein by reference.
- Experimentally, differential expression of the polynucleotides can be evaluated by methods including, but not limited to, differential display by spatial immobilization or by gel electrophoresis, genome mismatch scanning, representational difference analysis, and transcript imaging. The results of transcript imaging for SEQ ID NO:2 are shown in Example IX . Differential expression of SEQ ID NO:2 is highly specifically correlated with type I diabetes. The transcript image provided direct confirmation of the strength of co-expression analysis—the use of known genes to identify unknown polynucleotides and their encoded proteins which are highly significantly associated with insulin synthesis and pancreatic disorders. Additionally, differential expression can be assessed by microarray technology. These methods can be used alone or in combination.
- Genes known to be highly expressed in pancreatic disorders can be selected based on research in which the genes are found to be key elements of insulin synthesis pathways or on the known use of the genes as diagnostic or prognostic markers or therapeutic targets for pancreatic disorders. Preferably, the known genes are insulin, glucagon, lipase, colipase, HiAPP, and Reg.
- The procedure for identifying novel polynucleotides that exhibit a statistically significant co-expression pattern with known genes is as follows. First, the presence or absence of a polynucleotide in a cDNA library is defined: a polynucleotide is present in a cDNA library when at least one cDNA fragment corresponding to the polynucleotide is detected in a cDNA from that library, and a polynucleotide is absent from a library when no corresponding cDNA fragment is detected.
- Second, the significance of co-expression is evaluated using a probability method to measure a due-to-chance probability of the co-expression. The probability method can be the Fisher exact test, the chi-squared test, or the kappa test. These tests and examples of their applications are well known in the art and can be found in standard statistics texts (Agresti (1990)Categorical Data Analysis, John Wiley & Sons, New York N.Y.; Rice (1988) Mathematical Statistics and Data Analysis, Duxbury Press, Pacific Grove Calif.). A Bonferroni correction (Rice, supra, p. 384) can also be applied in combination with one of the probability methods for correcting statistical results of one polynucleotide versus multiple other polynucleotides. In a preferred embodiment, the due-to-chance probability is measured by a Fisher exact test, and the threshold of the due-to-chance probability is set preferably to less than 0.001, more preferably to less than 0.00001.
- For example, to determine whether two genes, A and B, have similar co-expression patterns, occurrence data vectors can be generated as illustrated in the table below. The presence of a gene occurring at least once in a library is indicated by a one, and its absence from the library, by a zero.
Library 1 Library 2 Library 3 . . . Library N Gene A 1 1 0 . . . 0 Gene B 1 0 1 . . . 0 - For a given pair of genes, the occurrence data in the table above can be summarized in a 2×2 contingency table. The second table (below) presents co-occurrence data for gene A and gene B in a total of 30 libraries. Both gene A and gene B occur 10 times in the libraries.
Gene A Present Gene A Absent Total Gene B Present 8 2 10 Gene B Absent 2 18 20 Total 10 20 30 - The second table summarizes and presents: 1) the number of times gene A and B are both present in a library; 2) the number of times gene A and B are both absent in a library; 3) the number of times gene A is present, and gene B is absent; and 4) the number of times gene B is present, and gene A is absent. The upper left entry is the number of times the two genes co-occur in a library, and the middle right entry is the number of times neither gene occurs in a library. The off diagonal entries are the number of times one gene occurs, and the other does not. Both A and B are present eight times and absent 18 times. Gene A is present, and gene B is absent, two times; and gene B is present, and gene A is absent, two times. The probability (“p-value”) that the above association occurs due to chance as calculated using a Fisher exact test is 0.0003.
- This method of estimating the probability for co-expression makes several assumptions. The method assumes that the libraries are independent and are identically sampled. However, in practical situations, the selected cDNA libraries are not entirely independent, because more than one library can be obtained from a single subject or tissue. Nor are they entirely identically sampled, because different numbers of cDNAs can have been sequenced from each library. The number of cDNAs sequenced typically ranges from 5,000 to 10,000 cDNAs per library. After the Fisher exact co-expression probability is calculated for each polynucleotide versus all other assembled polynucleotides that occur, a Bonferroni correction for multiple statistical tests is applied.
- Using the method of the present invention, we have identified polynucleotides, SEQ ID NOs: 1-13 and their encoded proteins, SEQ ID NOs: 14 and 15, that exhibit highly significant co-expression probability with known marker genes for pancreatic disorders. The results presented in Example VI show the direct (known gene to unknown polynucleotide) or indirect (known gene to unknown polynucleotide to a second unknown polynucleotide) associations among the novel polynucleotides and the known marker genes for pancreatic disorders. Therefore, by these associations, the novel polynucleotides are useful as surrogate markers for the co-expressed known marker genes in diagnosis, prognosis, evaluation of therapies and treatment of pancreatic disorders. Further, the proteins or peptides expressed from the novel polynucleotides are either potential therapeutics or targets for the identification and/or development of therapeutics.
- In one embodiment, the present invention encompasses a composition comprising a plurality of polynucleotides having the nucleic acid sequences of SEQ ID NOs:1-13 or the complements thereof. These thirteen polynucleotides are shown by the method of the present invention to have significant co-expression with known genes associated with pancreatic disorders. The invention also provides a polynucleotide, its complement, a probe comprising the polynucleotide or the complement thereof selected from SEQ ID NOs: 1-13 and variants thereof.
- The polynucleotide can be used to search against the GenBank primate (pri), rodent (rod), mammalian (mam), vertebrate (vrtp), and eukaryote (eukp) databases; the encoded protein, against GenPept, SwissProt, BLOCKS (Bairoch et al. (1997) Nucleic Acids Res 25:217-221), PFAM, and other databases that contain previously identified and annotated protein sequences, motifs, and gene functions. Methods that search for primary sequence patterns with secondary structure gap penalties (Smith et al. (1992) Protein Engineering 5:35-51) as well as algorithms such as Basic Local Alignment Search Tool (BLAST; Altschul (1993) J Mol Evol 36:290-300; Altschul et al. (1990) J Mol Biol 215:403-410), BLOCKS (Henikoff and Henikoff (1991) Nucleic Acids Res 19:6565-6572), Hidden Markov Models (HMM; Eddy (1996) Cur Opin Str Biol 6:361-365; Sonnhammer et al. (1997) Proteins 28:405-420), and the like, can be used to manipulate and analyze nucleotide and amino acid sequences. These databases, algorithms and other methods are well known in the art and are described in Ausubel et al. (1997; Short Protocols in Molecular Biology, John Wiley & Sons, New York N.Y., unit 7.7) and in Meyers (1995; Molecular Biology and Biotechnology, Wiley VCH, New York N.Y., p 856-853).
- Also encompassed by the invention are polynucleotides that are capable of hybridizing to SEQ ID NOs:1-13 and the complements thereof under highly stringent conditions. Stringency can be defined by salt concentration, temperature, and other chemicals and conditions well known in the art. Conditions can be selected, for example, by varying the concentrations of salt in the prehybridization, hybridization, and wash solutions or by varying the hybridization and wash temperatures. With some substrates, the temperature can be decreased by adding a solvent such as formamide to the prehybridization and hybridization solutions.
- Hybridization can be performed at low stringency, with buffers such as 5× SSC (saline sodium citrate) with 1% sodium dodecyl sulfate (SDS) at 60 C, which permits complex formation between two nucleic acid sequences that contain some mismatches. Subsequent washes are performed at higher stringency with buffers such as 0.2× SSC with 0.1% SDS at either 45 C (medium stringency) or 68 C (high stringency), to maintain hybridization of only those complexes that contain completely complementary sequences. Background signals can be reduced by the use of detergents such as SDS, sarcosyl, or TRITON X-100 (Sigma-Aldrich, St. Louis Mo.), and/or a blocking agent, such as salmon sperm DNA. Hybridization methods are described in detail in Ausubel (supra, units 2.8-2.11, 3.18-3.19 and 4-6-4.9) and Sambrook et al. (1989; Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, Plainview N.Y.).
- A polynucleotide can be extended utilizing primers and employing various PCR-based methods known in the art to detect upstream sequences such as promoters and other regulatory elements. (See, e.g., Dieffenbach and Dveksler (1995)PCR Primer, a Laboratory Manual, Cold Spring Harbor Press, Plainview N.Y.) Commercially available kits such as XL-PCR (Applied Biosystems, Foster City Calif.), cDNA libraries (Life Technologies, Rockville Md.) or genomic libraries (Clontech, Palo Alto Calif.) and nested primers can be used to extend the sequence. For all PCR-based methods, primers can be designed using commercially available software (LASERGENE software, DNASTAR, Madison Wis.) or another program, to be about 15 to 30 nucleotides in length, to have a GC content of about 50%, and to form a hybridization complex at temperatures of about 68 C to 72 C.
- In another aspect of the invention, the polynucleotide can be cloned into a recombinant vector that directs the expression of the protein, or structural or functional portions thereof, in host cells. Due to the inherent degeneracy of the genetic code, other DNA sequences which encode functionally equivalent amino acid sequence can be produced and used to express the protein encoded by the polynucleotide. The nucleotide sequences of the present invention can be engineered using methods generally known in the art in order to alter the nucleotide sequences for a variety of purposes including, but not limited to, modification of the cloning, processing, and/or expression of the gene product. DNA shuffling by random fragmentation, as described in U.S. Pat. No. 5,830,721, and PCR reassembly of gene fragments and synthetic oligonucleotides can be used to engineer the nucleotide sequences. For example. oligonucleotide-mediated site-directed mutagenesis can be used to introduce mutations that create new restriction sites, alter glycosylation patterns, change codon preference, produce splice variants, and so forth.
- In order to express a biologically active protein, the polynucleotide or derivatives thereof, can be inserted into an expression vector with elements for transcriptional and translational control of the inserted coding sequence in a particular host. These elements include regulatory sequences, such as enhancers, constitutive and inducible promoters, and 5′ and 3′ untranslated regions. Methods which are well known to those skilled in the art can be used to construct such expression vectors. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in vivo genetic recombination (Ausubel, supra, unit 16).
- A variety of expression vector/host cell systems can be utilized to express the polynucleotide. These include, but are not limited to, microorganisms such as bacteria transformed with recombinant bacteriophage, plasmid, or cosmid expression vectors; yeast transformed with yeast expression vectors; insect cell systems infected with baculovirus vectors; plant cell systems transformed with viral or bacterial expression vectors; or animal cell systems. For long term production of recombinant proteins in mammalian systems, stable expression in cell lines is preferred. For example, the polynucleotide can be transformed into cell lines using expression vectors which can contain viral origins of replication and/or endogenous expression elements and a selectable or visible marker gene on the same or on a separate vector. The invention is not to be limited by the vector or host cell employed.
- In general, host cells that contain the polynucleotide and that express the protein can be identified by a variety of procedures known to those of skill in the art. These procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or chip-based technologies for the detection and/or quantification of nucleic acid or amino acid sequences. Immunological methods for detecting and measuring the expression of the protein using either specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and fluorescence activated cell sorting (FACS).
- Host cells transformed with the polynucleotide can be cultured under conditions for the expression and recovery of the protein from cell culture. The protein produced by a transgenic cell can be secreted or retained intracellularly depending on the sequence and/or the vector used. As will be understood by those of skill in the art, expression vectors containing the polynucleotide can be designed to contain signal sequences which direct secretion of the protein through a prokaryotic cell wall or eukaryotic cell membrane.
- In addition, a host cell strain can be chosen for its ability to modulate expression of the inserted sequences or to process the expressed protein in the desired fashion. Such modifications of the protein include, but are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a “prepro” form of the protein can also be used to specify protein targeting, folding, and/or activity. Different host cells which have specific cellular machinery and characteristic mechanisms for post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and W138) are available from the ATCC (Manassas VA) and can be chosen to ensure the correct modification and processing of the expressed protein.
- In another embodiment of the invention, natural, modified, or recombinant polynucleotides are ligated to a heterologous sequence resulting in translation of a fusion protein containing heterologous protein moieties in any of the aforementioned host systems. Such heterologous protein moieties facilitate purification of fusion proteins using commercially available affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase, maltose binding protein, thioredoxin, calmodulin binding peptide, 6-His, FLAG, c-myc, hemaglutinin, and monoclonal antibody epitopes.
- In another embodiment, the polynucleotides, wholly or in part, are synthesized using chemical or enzymatic methods well known in the art (Caruthers et al. (1980) Nucl Acids Symp Ser (7) 215-233; Ausubel, supra, units 10.4 and 10.16). Peptide synthesis can be performed using various solid-phase techniques (Roberge et al. (1995) Science 269:202-204), and machines such as the ABI 431A peptide synthesizer (Applied Biosystems) can be used to automate synthesis. If desired, the amino acid sequence can be altered during synthesis to produce a more stable variant for therapeutic use.
- Screening, Diagnostics and Therapeutics
- The polynucleotides can be used as surrogate markers in diagnosis, prognosis, evaluation of therapies and treatment of pancreatic disorders including, but not limited to, type I and type II diabetes; complications of diabetes including angina, hypertension, myocardial infarctions, peripheral vascular disease, diabetic retinopathy, diabetic nephropathy, diabetic necrosis, ulceration, and diabetic neuropathy; islet cell hyperplasia; pancreatitis; and pancreatic tumor.
- The polynucleotide can be used to screen a plurality or library of molecules and compounds for specific binding affinity. The assay can be used to screen DNA molecules, RNA molecules, peptide nucleic acids, peptides, mimetics, ribozymes, or proteins including transcription factors, enhancers, repressors, and the like which regulate the activity of the polynucleotide in the biological system. The assay involves providing a plurality of molecules and compounds, combining a polynucleotide or a composition of the invention with the plurality of molecules and compounds under conditions to allow specific binding, and detecting specific binding to identify at least one molecule or compound which specifically binds at least one polynucleotides of the invention.
- Similarly the proteins, or portions thereof, can be used to screen a plurality or library of molecules or compounds in any of a variety of screening assays to identify a ligand. The protein employed in such screening can be free in solution, affixed to an abiotic substrate or expressed on the external, or a particular internal surface, of a bacterial, or other, cell. Specific binding between the protein and the ligand can be measured. The assay can be used to screen aptamers, DNA molecules, RNA molecules, peptide nucleic acids, peptides, mimetics, ribozymes, proteins, antibodies, agonists, antagonists, immunoglobulins, inhibitors, pharmaceutical agents or drug compounds and the like, which specifically bind the protein. One method for high throughput screening using very small assay volumes and very small amounts of test compound is described in Burbaum et al. U.S. Pat. No. 5,876,946, incorporated herein by reference, which screens large numbers of molecules for enzyme inhibition or receptor binding.
- In one preferred embodiment, the polynucleotides are used for diagnostic purposes to determine the differential expression of a gene in a sample. The polynucleotide consists of complementary RNA and DNA molecules, branched nucleic acids, and/or PNAs. In one alternative, the polynucleotides are used to detect and quantify gene expression in biopsied samples in which differential expression of the polynucleotide indicates the presence of a disorder. In another alternative, the polynucleotide can be used to detect genetic polymorphisms associated with a disease or disorder. In a preferred embodiment, these polymorphisms are detected in an mRNA transcribed from an endogenous gene.
- In another preferred embodiment, the polynucleotide is used as a probe. Specificity of the probe is determined by whether it is made from a unique region, a regulatory region, or from a region encoding a conserved motif. Both probe specificity and the stringency of the diagnostic hybridization or amplification will determine whether the probe identifies only naturally occurring, exactly complementary sequences, allelic variants, or related sequences. Probes designed to detect related sequences should preferably have at least 50% sequence identity to at least a fragment of a polynucleotide of the invention.
- Methods for producing hybridization probes include the cloning of nucleic acid sequences into vectors for the production of RNA probes. Such vectors are known in the art, are commercially available, and can be used to synthesize RNA probes in vitro by adding RNA polymerases and labeled nucleotides. Probes can incorporate nucleotides labeled by a variety of reporter groups including, but not limited to, radionuclides such as32P or 35S, enzymatic labels such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, fluorescent labels such as Cy3 and Cy5, and the like. The labeled polynucleotides can be used in Southern or northern analysis, dot blot, or other membrane-based technologies, on chips or other substrates, and in PCR technologies. Hybridization probes are also useful in mapping the naturally occurring genomic sequence. Fluorescent in situ hybridization (FISH) can be correlated with other physical chromosome mapping techniques and genetic map data as described in Heinz-Ulrich et al. (In: Meyers, supra, pp. 965-968). In many cases, genomic context helps identify genes that encode a particular protein family. (See, e.g., Kirschning et al. (1997) Genomics 46:416-25.).
- The polynucleotide can be labeled using standard methods and added to a sample from a subject under conditions for the formation and detection of hybridization complexes. After incubation the sample is washed, and the signal associated with complex formation is quantitated and compared with at least one standard value. Standard values are derived from any control sample, typically one that is free of the suspect disorder and from one that represents a single, specific and preferably, staged disorder. If the amount of signal in the subject sample is distinguishable from the standards, then differential expression in the subject sample indicates the presence of the disorder. Qualitative and quantitative methods for comparing complex formation in subject samples with previously established standards are well known in the art.
- Such assays can also be used to evaluate the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to monitor the treatment of an individual subject. Once the presence of the disorder has been established and a treatment protocol is initiated, hybridization, amplification, or antibody assays can be repeated on a regular basis to determine when gene or protein expression in the patient begins to approximate that which is observed in a healthy subject. The results obtained from successive assays can be used to show the efficacy of treatment over a period ranging from several hours, e.g. in the case of toxic shock, to many years, e.g. in the case of osteoarthritis.
- The polynucleotides can be used on a substrate such as a microarray to monitor gene expression, to identify splice variants, mutations, and polymorphisms. Information derived from analyses of expression patterns can be used to determine gene function, to understand the genetic basis of a disease, to diagnose a disorder, and to develop and monitor the activities of therapeutic agents used to treat a disorder. Microarrays can also be used to detect genetic diversity, single nucleotide polymorphisms, which may characterize a particular population, at the genomic level.
- In another embodiment, antibodies or Fabs comprising an antigen binding site that specifically binds the protein can be used for the diagnosis of diseases characterized by the differential expression of the protein. A variety of protocols for measuring protein expression, including ELISAs, RIAs, FACS and antibody arrays, are well known in the art and provide a basis for diagnosing differential or abnormal levels of expression. Standard values for protein expression parallel those reviewed above for nucleotide expression. The amount of complex formation can be quantitated by various methods, preferably by photometric means. Quantities of the protein expressed in subject samples are compared with standard values. Deviation between standard and subject values establishes the parameters for diagnosing or monitoring a particular disorder. Alternatively, one can use competitive drug screening assays in which neutralizing antibodies capable of binding specifically with the protein compete with a test compound. Antibodies can be used to detect the presence of any peptide which shares one or more epitopes or antigenic determinants with the protein. In one aspect, the antibodies of the present invention can be used for treatment, delivery of therapeutics, or monitoring therapy for pancreatic disorders.
- In another aspect, the polynucleotide, or its complement, can be used therapeutically for the purpose of expressing mRNA and protein, or conversely to block transcription or translation of the mRNA. Expression vectors can be constructed using elements from retroviruses, adenoviruses, herpes or vaccinia viruses, or bacterial plasmids, and the like. These vectors can be used for delivery of nucleotide sequences to a particular target cell population, tissue, or organ. Methods well known to those skilled in the art can be used to construct vectors to express the polynucleotides or their complements. (See, e.g., Maulik et al. (1997)Molecular Biotechnology, Therapeutic Applications and Strategies, Wiley-Liss, New York N.Y.) Alternatively, the polynucleotide or its complement, can be used for somatic cell or stem cell gene therapy. Vectors can be introduced in vivo, in vitro, and ex vivo. For ex vivo therapy, vectors are introduced into stem cells taken from the subject, and the resulting transgenic cells are clonally propagated for autologous transplant back into that same subject. Delivery of the polynucleotide by transfection, liposome injections, or polycationic amino polymers can be achieved using methods which are well known in the art. (See, e.g., Goldman et al. (1997) Nature Biotechnology 15:462-466.) Additionally, endogenous gene expression can be inactivated using homologous recombination methods which insert an inactive gene sequence into the coding region or other targeted region of the genome. (See, e.g. Thomas et al. (1987) Cell 51: 503-512.).
- Vectors containing the polynucleotide can be transformed into a cell or tissue to express a missing protein or to replace a nonfunctional protein. Similarly a vector constructed to express the complement of the polynucleotide can be transformed into a cell to down-regulate protein expression. Complementary or antisense sequences can consist of an oligonucleotide derived from the transcription initiation site; nucleotides between about positions −10 and +10 from the ATG are preferred. Similarly, inhibition can be achieved using triple helix base-pairing methodology. Triple helix pairing is useful because it causes inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have been described in the literature. (See, e.g., Gee et al. In: Huber and Carr (1994)Molecular and Immunologic Approaches, Futura Publishing, Mt. Kisco N.Y., pp. 163-177.).
- Ribozymes, enzymatic RNA molecules, can also be used to catalyze the cleavage of mRNA and decrease the levels of particular mRNAs, such as those comprising the polynucleotides of the invention. (See, e.g., Rossi (1994) Current Biology 4: 469-471.) Ribozymes can cleave mRNA at specific cleavage sites. Alternatively, ribozymes can cleave mRNAs at locations dictated by flanking regions that form complementary base pairs with the target mRNA. The construction and production of ribozymes is well known in the art and is described in Meyers (supra).
- RNA molecules can be modified to increase intracellular stability and half-life. Possible modifications include, but are not limited to, the addition of flanking sequences at the 5′ and/or 3′ ends of the molecule, or the use of phosphorothioate or 2′ O-methyl rather than phosphodiester linkages within the backbone of the molecule. Alternatively, nontraditional bases such as inosine, queosine, and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases, can be included.
- Further, an antagonist, or an antibody that binds specifically to the protein can be administered to a subject to treat a pancreatic disorder. The antagonist, antibody, or fragment can be used directly to inhibit the activity of the protein or indirectly to deliver a therapeutic agent to cells or tissues which express the protein. The therapeutic agent can be a cytotoxic agent selected from a group including, but not limited to, abrin, ricin, doxorubicin, daunorubicin, taxol, ethidium bromide, mitomycin, etoposide, tenoposide, vincristine, vinblastine, colchicine, dihydroxy anthracin dione, actinomycin D, diphteria toxin, Pseudomonas exotoxin A and 40, radioisotopes, and glucocorticoid.
- Antibodies to the protein can be generated using methods that are well known in the art. Such antibodies can include, but are not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and fragments produced by a Fab expression library. Neutralizing antibodies, such as those which inhibit dimer formation, are especially preferred for therapeutic use. Monoclonal antibodies to the protein can be prepared using any technique which provides for the production of antibody molecules by continuous cell lines in culture. These include, but are not limited to, the hybridoma, the human B-cell hybridoma, and the EBV-hybridoma techniques. In addition, techniques developed for the production of chimeric antibodies can be used. (See, e.g., Pound (1998)Immunochemical Protocols, Methods Mol Biol Vol. 80.) Alternatively, techniques described for the production of single chain antibodies can be employed. Fabs which contain specific binding sites for the protein can also be generated. Various immunoassays can be used to identify antibodies having the desired specificity. Numerous protocols for competitive binding or immunoradiometric assays using either polyclonal or monoclonal antibodies with established specificities are well known in the art.
- Yet further, an agonist of the protein can be administered to a subject to treat a disorder associated with decreased expression, longevity or activity of the protein.
- An additional aspect of the invention relates to the administration of a pharmaceutical or sterile composition, in conjunction with a pharmaceutically acceptable carrier, for any of the therapeutic applications discussed above. Such pharmaceutical compositions can consist of the protein or antibodies, mimetics, agonists, antagonists, or inhibitors of the protein. The compositions can be administered alone or in combination with at least one other agent, such as a stabilizing compound, which can be administered in any sterile, biocompatible pharmaceutical carrier including, but not limited to, saline, buffered saline, dextrose, and water. The compositions can be administered to a subject alone or in combination with other agents, drugs, or hormones.
- The pharmaceutical compositions utilized in this invention can be administered by any number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, sublingual, or rectal means.
- In addition to the active ingredients, these pharmaceutical compositions can contain pharmaceutically-acceptable carriers comprising excipients and auxiliaries which facilitate processing of the active compounds into preparations which can be used pharmaceutically. Further details on techniques for formulation and administration can be found in the latest edition ofRemington's Pharmaceutical Sciences (Mack Publishing, Easton Pa.).
- For any compound, the therapeutically effective dose can be estimated initially either in cell culture assays or in animal models such as mice, rats, rabbits, dogs, or pigs. An animal model can also be used to determine the concentration range and route of administration. Such information can then be used to determine useful doses and routes for administration in humans.
- A therapeutically effective dose refers to that amount of active ingredient which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity can be determined by standard pharmaceutical procedures in cell cultures or with experimental animals, such as by calculating and contrasting the ED50 (the dose therapeutically effective in 50% of the population) and LD50 (the dose lethal to 50% of the population) statistics. Any of the therapeutic compositions described above can be applied to any subject in need of such therapy, including, but not limited to, mammals such as dogs, cats, cows, horses, rabbits, monkeys, and most preferably, humans.
- Stem Cells and Their Use
- SEQ ID NOs:1-13 can be useful in the differentiation of stem cells. Eukaryotic stem cells are able to differentiate into the multiple cell types of various tissues and organs and to play roles in embryogenesis and adult tissue regeneration (Gearhart (1998) Science 282:1061-1062; Watt and Hogan (2000) Science 287:1427-1430). Depending on their source and developmental stage, stem cells can be totipotent with the potential to create every cell type in an organism and to generate a new organism, pluripotent with the potential to give rise to most cell types and tissues, but not a whole organism; or multipotent cells with the potential to differentiate into a limited number of cell types. Stem cells can be transfected with polynucleotides which can be transiently expressed or can be integrated within the cell as transgenes.
- Embryonic stem (ES) cell lines are derived from the inner cell masses of human blastocysts and are pluripotent (Thomson et al. (1998) Science 282:1145-1147). They have normal karyotypes and express high levels of telomerase which prevent senescence and allow the cells to replicate indefinitely. ES cells produce derivatives that give rise to embryonic epidermal, mesodermal and endodermal cells. Embryonic germ (EG) cell lines, which are produced from primordial germ cells isolated from gonadal ridges and mesenteries, also show stem cell behavior (Shamblott et al. (1998) Proc Natl Acad Sci 95:13726-13731). EG cells have normal karyotypes and appear to be pluripotent.
- Organ-specific adult stem cells differentiate into the cell types of the tissues from which they were isolated. They maintain their original tissues by replacing cells destroyed from disease or injury. Adult stem cells are multipotent and under proper stimulation can be used to generate cell types of various other tissues (Vogel (2000) Science 287:1418-1419). Hematopoietic stem cells from bone marrow provide not only blood and immune cells, but can also be induced to transdifferentiate to form brain, liver, heart, skeletal muscle and smooth muscle cells. Similarly mesenchymal stem cells can be used to produce bone marrow, cartilage, muscle cells, and some neuron-like cells, and stem cells from muscle have the ability to differentiate into muscle and blood cells (Jackson et al. (1999) Proc Natl Acad Sci 96:14482-14486). Neural stem cells, which produce neurons and glia, can also be induced to differentiate into heart, muscle, liver, intestine, and blood cells (Kuhn and Svendsen (1999) BioEssays 21:625-630); Clarke et al. (2000) Science 288:1660-1663; Gage (2000) Science 287:1433-1438; and Galli et al. (2000) Nature Neurosci 3:986-991).
- Neural stem cells can be used to treat neurological disorders such as Alzheimer's disease, Parkinson's disease, and multiple sclerosis and to repair tissue damaged by strokes and spinal cord injuries. Hematopoietic stem cells can be used to restore immune function in immunodeficient patients or to treat autoimmune disorders by replacing autoreactive immune cells with normal cells to treat diseases such as multiple sclerosis, scleroderma, rheumatoid arthritis, and systemic lupus erythematosus. Mesenchymal stem cells can be used to repair tendons or to regenerate cartilage to treat arthritis. Liver stem cells can be used to repair liver damage. Pancreatic stem cells can be used to replace islet cells to treat diabetes. Muscle stem cells can be used to regenerate muscle to treat muscular dystrophies (Fontes and Thomson (1999) BMJ 319:1-3; Weissman (2000) Science 287:1442-1446 Marshall (2000) Science 287:1419-1421; and Marmont (2000) Ann Rev Med 51:115-134).
- It is to be understood that this invention is not limited to the particular devices, machines, materials and methods described. Although particular embodiments are described, equivalent embodiments can be used to practice the invention. The described embodiments are provided to illustrate the invention and are not intended to limit the scope of the invention which is limited only by the appended claims.
- I cDNA Library Construction
- The cDNA library, PANCNOT05, was selected as an example to demonstrate the construction of the cDNA libraries from which the sequences used to identify genes associated with pancreatic disorders were derived. The PANCNOT05 cDNA library was constructed from cytologically normal pancreas tissue obtained from a 2-year-old Hispanic male who died of cerebral anoxia.
- The frozen tissue was homogenized and lysed using a POLYTRON homogenizer (Brinkmann Instruments, Westbury N.J.) in guanidinium isothiocyanate solution. The lysate was centrifuged over a 5.7 M CsCl cushion using an SW28 rotor in an L8-70M ultracentrifuge (BeckmanCoulter, Fullerton Calif.) for 18 hours at 25,000 rpm at ambient temperature. The RNA was extracted with acid phenol, pH 4.0, precipitated using 0.3 sodium acetate and 2.5 volumes of ethanol, resuspended in RNAse-free water, and DNAse treated at 37 C. RNA extraction and precipitation were repeated as before. The mRNA was isolated using the OLIGOTEX kit (Qiagen, Chatsworth Calif.) and used to construct the cDNA library.
- The mRNA was handled according to the recommended protocols in the SUPERSCRIPT plasmid system (Life Technologies). cDNAs were fractionated on a SEPHAROSE CL4B column (Amersham Pharmacia Biotech), and those cDNAs exceeding 400 bp were ligated into pSport I plasmid. The plasmid was subsequently transformed into DH5a competent cells (Life Technologies).
- II Isolation and Sequencing of cDNA Clones
- Plasmid DNA was released from the bacterial cells and purified using the REAL PREP 96 plasmid kit (Qiagen). This kit enabled the simultaneous purification of 96 samples in a 96-well block using multi-channel reagent dispensers. The recommended protocol was employed except for the following changes: 1) the bacteria were cultured in 1 ml of sterile TERRIFIC BROTH (BD Biosciences, San Jose Calif.) with carbenicillin at 25 mg/L and glycerol at 0.4%; 2) the cultures were incubated for 19 hours after inoculation and the cells were lysed in 0.3 ml of lysis buffer; and 3) the plasmid DNA pellet was precipitated in isopropanol and then resuspended in 0.1 ml of distilled water. After the last step in the protocol, samples were transferred to a 96-well block for storage at 4 C.
- The cDNAs were prepared using a MICROLAB 2200 system (Hamilton, Reno Nev.) in combination with DNA ENGINE thermal cyclers (MJ Research, Watertown Mass.). The cDNAs were sequenced by the method of Sanger and Coulson (1975; J Mol Biol 94:441-448) using ABI PRISM 377 DNA sequencing systems (Applied Biosystems). Most of the cDNAs were sequenced using standard ABI protocols and kits at solution volumes of 0.25×-1.0×. In the alternative, some of the cDNAs were sequenced using solutions and dyes from APB.
- III Selection, Assembly, and Characterization of Sequences
- The polynucleotides used for co-expression analysis were assembled from EST sequences, 5′ and 3′ long read sequences, and full length coding sequences. Of the 41,419 assembled sequences used in the analysis, each was expressed in at least five cDNA libraries.
- The assembly process is described as follows. EST sequence chromatograms were processed and verified. Quality scores were obtained using PHRED (Ewing et al. (1998) Genome Res 8:175-185; Ewing and Green (1998) Genome Res 8:186-194), and edited sequences were loaded into a relational database management system (RDBMS). The sequences were clustered using BLAST with a product score of 50. All clusters of two or more sequences created a bin which represents one transcribed gene.
- Assembly of the component sequences within each bin was performed using a modification of Phrap, a publicly available program for assembling DNA fragments (Green, P. University of Washington, Seattle Wash.). Bins that showed 82% identity from a local pair-wise alignment between any of the consensus sequences were merged.
- Bins were annotated by screening the consensus sequence in each bin against public databases, such as GBpri and GenPept from NCBI. The annotation process involved a FASTn screen against the GBpri database in GenBank. Those hits with a percent identity of greater than or equal to 75% and an alignment length of greater than or equal to 100 base pairs were recorded as homolog hits. The residual unannotated sequences were screened by FASTx against GenPept. Those hits with an E value of less than or equal to 10−8 were recorded as homolog hits.
- Sequences were then reclustered using BLASTn and Cross-Match, a program for rapid amino acid and nucleic acid sequence comparison and database search (Green, supra), sequentially. Any BLAST alignment between a sequence and a consensus sequence with a score greater than 150 was realigned using cross-match. The sequence was added to the bin whose consensus sequence gave the highest Smith-Waterman score (Smith et al. (1992) Protein Engineering 5:35-51) amongst local alignments with at least 82% identity. Non-matching sequences were moved into new bins, and assembly processes were repeated.
- IV Homology Searching of Polynucleotides and their Encoded Proteins
- The polynucleotides of the Sequence Listing or their encoded proteins were used to query databases such as GenBank, SwissProt, BLOCKS, and the like. These databases that contain previously identified and annotated sequences or domains were searched using BLAST or BLAST 2 (Altschul et al. supra; Altschul, supra) to produce alignments and to determine which sequences were exact matches or homologs. The alignments were to sequences of prokaryotic (bacterial) or eukaryotic (animal, fungal, or plant) origin. Alternatively, algorithms such as the one described in Smith and Smith (1992, Protein Engineering 5:35-51) could have been used to deal with primary sequence patterns and secondary structure gap penalties. All of the sequences disclosed in this application have lengths of at least 49 nucleotides, and no more than 12% uncalled bases (where N is recorded rather than A, C, G, or T).
- As detailed in Karlin and Altschul (1993; Proc Natl Acad Sci 90:5873-5877), BLAST matches between a query sequence and a database sequence were evaluated statistically and only reported when they satisfied the threshold of 10−25 for nucleotides and 10−14 for peptides. Homology was also evaluated by product score calculated as follows: the % nucleotide or amino acid identity [between the query and reference sequences] in BLAST is multiplied by the % maximum possible BLAST score [based on the lengths of query and reference sequences] and then divided by 100. In comparison with hybridization procedures used in the laboratory, the electronic stringency for an exact match was set at 70, and the conservative lower limit for an exact match was set at approximately 40 (with 1-2% error due to uncalled bases).
- The BLAST software suite, freely available sequence comparison algorithms (NCBI, Bethesda Md.; http://www.ncbi.nlm.nih.gov/gorf/bl2.html), includes various sequence analysis programs including “blastn” that is used to align nucleic acid molecules and BLAST 2 that is used for direct pairwise comparison of either nucleic or amino acid molecules. BLAST programs are commonly used with gap and other parameters set to default settings, e.g.: Matrix: BLOSUM62; Reward for match: 1; Penalty for mismatch: −2; Open Gap: 5 and Extension Gap: 2 penalties; Gap x drop-off: 50; Expect: 10; Word Size: 11; and Filter: on. Identity or similarity is measured over the entire length of a sequence or some smaller portion thereof. Brenner et al. (1998; Proc Natl Acad Sci 95:6073-6078, incorporated herein by reference) analyzed the BLAST for its ability to identify structural homologs by sequence identity and found 30% identity is a reliable threshold for sequence alignments of at least 150 residues and 40%, for alignments of at least 70 residues.
- The polynucleotides of this application were compared with assembled consensus sequences or templates found in the LIFESEQ GOLD database. Component sequences from cDNA, extension, full length, and shotgun sequencing projects were subjected to PHRED analysis and assigned a quality score. All sequences with an acceptable quality score were subjected to various pre-processing and editing pathways to remove low quality 3′ ends, vector and linker sequences, polyA tails, Alu repeats, mitochondrial and ribosomal sequences, and bacterial contamination sequences. Edited sequences had to be at least 50 bp in length, and low-information sequences and repetitive elements such as dinucleotide repeats, Alu repeats, and the like, were replaced by “Ns” or masked.
- Edited sequences were subjected to assembly procedures in which the sequences were assigned to polynucleotide bins. Each sequence could only belong to one bin, and sequences in each bin were assembled to produce a template. Newly sequenced components were added to existing bins using BLAST and CROSSMATCH. To be added to a bin, the component sequences had to have a BLAST quality score greater than or equal to 150 and an alignment of at least 82% local identity. The sequences in each bin were assembled using PHRAP. Bins with several overlapping component sequences were assembled using DEEP PHRAP. The orientation of each template was determined based on the number and orientation of its component sequences.
- Bins were compared to one another and those having local similarity of at least 82% were combined and reassembled. Bins having templates with less than 95% local identity were split. Templates were subjected to analysis by STTCHER/EXON MAPPER algorithms that analyze the probabilities of the presence of splice variants, alternatively spliced exons, splice junctions, differential expression of alternative spliced genes across tissue types or disease states, and the like. Assembly procedures were repeated periodically, and templates were annotated using BLAST against GenBank databases such as GBpri. An exact match was defined as having from 95% local identity over 200 base pairs through 100% local identity over 100 base pairs and a homolog match as having an E-value (or probability score) of ≦1×10−8. The templates were also subjected to frameshift FASTx against GENPEPT, and homolog match was defined as having an E-value of ≦1×10−8. Template analysis and assembly was described in U.S. Ser. No. 09/276,534, filed Mar. 25, 1999.
- Following assembly, templates were subjected to BLAST, motif, and other functional analyses and categorized in protein hierarchies using methods described in U.S. Ser. No. 08/812,290 and U.S. Ser. No. 08/811,758, both filed Mar. 6, 1997; in U.S. Ser. No. 08/947,845, filed Oct. 9, 1997; and in U.S. Ser. No. 09/034,807, filed Mar. 4, 1998. Then templates were analyzed by translating each template in all three forward reading frames and searching each translation against the PFAM database of hidden Markov model-based protein families and domains using the HMMER software package (Washington University School of Medicine, St. Louis Mo.; http://pfam.wustl.edu/).
- The polynucleotide was further analyzed using MACDNASIS PRO software (Hitachi Software Engineering), and LASERGENE software (DNASTAR) and queried against public databases such as the GenBank rodent, mammalian, vertebrate, prokaryote, and eukaryote databases, SwissProt, BLOCKS, PRINTS, PFAM, and Prosite.
- V Description of Genes Known to be Associated with insulin Synthesis
- Eight genes known to be associated with insulin synthesis were selected to identify co-expressing novel polynucleotides. They are described below.
Gene Description & references Preproinsulin Precursor for insulin, a peptide hormone synthesized in the beta islet cells of the pancreas. Insulin regulates serum glucose (Darnell et al. (1990) Molecular Cell Biology, WH Freeman, New York NY, p. 743). Proglucagon Precursor for glucagon, a peptide hormone synthesized in the pancreas and intestines. Glucagon increases serum glucose levels by inducing the liver to produce and release glucose, thus counter-acting the effects of insulin. (Darnell et al. (supra) p. 743). Reg Regenerating (Reg) gene family (Alternate name: lithostathine) whose members include Reg-1 alpha, Reg-1 beta, and Reg-related protein. (Miyashita et al. (1995) FEBS Lett 377:429-33). Reg stimulates growth of the beta islet cells; and its expression is correlated with insulin expression (Baeza et al. (1996) Diabetes Metab 22:229-34). Reg-1 alpha is an effective therapy for diabetes in mice, in combination with the inimunoregulator drug linomide. (Gross et al. (1998) Endocrinology 139: 2369-74). Lipase Pancreatic lipase expression is elevated in diabetes and restored to normal levels by insulin (Tsai et al. (1994) Am J Physiol 267:G575-83; Sztalryd and Kraemer (1995) Metabolism 44:1391-6). Colipase Colipase is a pancreatic exocrine protein whose synthesis increases in diabetic rats; synthesis of colipase is inhibited by insulin (Duan et al. (1991) Pancreas 6:595-602; Duan and Erlanson-Albertsson (1992) Pancreas 7:465-71). HiAPP Human islet amyloid polypeptide (HiAPP) is a hormone-like peptide expressed in the insulin-producing beta cells of the endocrine pancreas (Nishi et al. (1989) Mol Endocrinol 3:1775-81). - VI Co-expression Among Known Marker Genes and Novel Polynucleotides
- The co-expression of the eight known genes, designated 1-8 on both the horizontal and vertical axes, with each other is shown below. The numbers in the table are the negative log of the p-value (−log p) for the co-expression between two genes. For example, reading the values at the intersection of the horizontal and vertical designations for each set, the co-expression between insulin (3) and colipase (2) at a p-value of 17, and between glucagon (7) and colipase (2), at a p-value of 11, are both very highly significant. The fact that co-expression analysis successfully identified the strong associations among the known genes validates the GBA or co-expression method for identifying polynucleotides that are co-expressed with the known genes. The degree of association was measured by probability values, and the threshold probability used in this analysis was less than 0.0001.
- Using the LIFESEQ GOLD database (Incyte Genomics), the method identified novel polynucleotides from among a total of 41,419 assembled polynucleotides that showed highly significant association with the known genes. The process was reiterated until the number of polynucleotides was reduced to the final thirteen polynucleotides shown below. The tabular entries show the p-value (−log p) for the co-expression between each known marker gene and each novel polynucleotide. The novel polynucleotides are identified in the table by their Incyte clone numbers and the known genes their abbreviated names as shown in Example IV above. For each polynucleotide, the p-value is the probability that the observed co-expression is due to chance, using the Fisher Exact Test.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 1 Lipase 2 Colipase 11 3 Insulin 11 17 4 Reg-1 beta 5 5 5 5 Reg-1 alpha 9 10 12 5 6 Reg-related 7 6 6 7 6 7 Glucagon 9 11 16 5 10 6 8 HiAPP 5 4 4 7 4 6 4 9 2091133 5 4 4 4 2 4 2 10 3836037 5 5 5 4 5 4 5 4 4 11 3833667 5 5 5 4 5 4 5 4 4 7 12 3664676 3 5 5 0 5 0 5 0 0 2 2 13 3835361 5 5 5 2 5 2 5 2 2 4 4 4 14 884692 3 5 5 2 5 2 5 2 2 2 2 4 4 15 2383628 14 16 16 5 10 7 12 5 5 5 5 3 5 3 16 888246 7 6 6 4 4 4 4 4 6 7 7 2 4 2 7 17 2774542 8 7 7 4 7 6 8 4 4 4 4 2 4 2 9 6 18 888309 5 5 5 4 5 4 5 4 4 7 7 2 4 2 5 7 4 19 951335 12 11 11 5 10 7 8 4 4 5 5 3 5 3 13 7 8 5 20 2777115 11 10 10 3 7 3 7 3 5 6 6 3 6 3 12 8 7 6 10 21 2075919 11 12 12 5 7 7 7 7 7 5 5 3 5 3 12 7 9 5 13 8 - The highest co-expression value is obtained when the highest p-value found along the horizontal line following each SEQ ID NO (clone number) is correlated with a known marker gene (numbers 1-8 along the top line of the table). For example, clone number 2383628 (number 15), has a p-value of 14 as it co-expresses with lipase (number 1) and a p-value of 16 as it co-expresses with colipase (number 2); these values greatly exceed the threshold p-value for this experiment and are very highly significant. The data above can be summarized by reducing it to a single highest co-expression (−log p) value for each intersecting known gene and unknown polynucleotide and naming at least one pancreatic disorder associated with expression of the known gene. The summary table shown below:
% p- SEQ Incyte specif- Gene value* ID clone Pancreatic Disorder icity** colipase 11 1 223163CT1 type 1 diabetes 77 insulin 5 2 884692CB1 type 1 diabetes 100 lipase 7 3 888246CB1 type 1 diabetes 99 insulin 5 4 888309CB1 type 1 diabetes 100 lipase 12 5 951335CB1 type 1 diabetes 99 HiAPP 6 6 2091133CT1 type 1 diabetes 92 colipase 16 7 2383628CB1 type 1 diabetes 95 glucagon 8 8 2774542CB1 islet cell hyperplasia 47 lipase 11 9 2777115CB1 type 1 diabetes 100 glucagon 5 10 3664676CB1 islet cell hyperplasia 100 insulin 5 11 3833667CB1 type 1 diabetes 96 colipase 5 12 3835361CB1 type 1 diabetes 100 reg-1 5 13 3836037CB1 cerebral anoxia 97 alpha - VII Novel Polynucleotides Identified Using GBA
- Using the method of Walker (supra), thirteen polynucleotides that exhibit strong association, or co-expression, with known genes that regulate, respond to, or participate in insulin synthesis have been identified.
- Polynucleotides comprising the nucleic acid sequences of SEQ ID NOs: 1-13 of the present invention were first identified as Incyte Clones 223163, 884692, 888246, 888309, 951335, 2091133, 2383628, 2774542, 2777115, 3664676, 3833667, 3835361, and 3836037, respectively; and assembled according to Example III. As described in Example IV, BLAST and other motif searches were performed for each sequence. SEQ ID NOs:1-13 were translated, and sequence identity with known sequences was sought. SEQ ID NOs:14 and 15 of the present invention were encoded by SEQ ID NOs: 1 and 8, respectively. SEQ ID NOs: 14 and 15 were also analyzed using BLAST and motif search tools, and the results of these analyses are described below.
- SEQ ID NO:2 is 924 nucleic acids in length and has about 92% identity from about nucleotide 211 to about nucleotide 923 with a gene that encodes human pancreatic zymogen granule membrane protein, GP-2 (gl2445 11) and about 96% match from about nucleotide 923 to about nucleotide 594 with a gene that encodes a human zinc finger protein, ZNF133 (g487782). GP-2 is a 75 kDa glycoprotein released from the membrane of mature zymogen granules by an enzymatic mechanism. The C-terminal region of GP-2 exhibit 26 conserved cysteine residues and includes one epidermal growth factor motif. ZNF133 is a protein that belongs to the human zinc finger Kruppel family and contains a Kruppel-associated box segment. ZNF133 was localized to chromosome 20p 11.2 that is close to the deleted region that characterizes Alagille syndrome.
- SEQ ID NO:3 is about 845 nucleotides in length; it shows about 80% identity from about nucleotide 560 to about nucleotide 840 with a complete coding sequence for human protamine 1, protamine 2 and transition protein 2 (g642458) and about 86% identity with a gene that encodes TXA2 gene (EP 490410). TXA2 is a unstable arachidonate metabolite that functions as a potent stimulator of platelet aggregation and a constrictor of vascular and respiratory smooth muscle.
- SEQ ID NO:7 is 646 nucleotides in length and shows 77% identity from about nucleotide 1 to about nucleotide 402 with a rat mRNA that encodes syncollin, a secretory granule protein that binds to syntaxin in a Ca++-sensitive manner and functions as a regulator of exocytosis in exocrine tissues (g2258437).
- SEQ ID NO:12 is 874 nucleotides in length and shows 98% identity from about nucleotide 363 to about nucleotide 873 with a gene that encodes human pancreatic zymogen granule membrane protein, GP-2 mRNA (gl244511). SEQ ID NO:12 also exhibits 99% identity from about nucleotide 432 to about nucleotide 924 with SEQ ID NO:2. Therefore, SEQ ID NO:2 and SEQ ID NO: 12 are potential splice variants with related cellular functions.
- SEQ ID NO: 1 is 1966 nucleotides in length and shows 77% identity from nucleotide 1 to about nucleotide 1930 with an mRNA that encodes a rat uterus-ovary specific trans-membrane protein (g2460315). This uterus-ovary specific rat protein is expressed upon induction by estrogen. SEQ ID NO: 14, an amino acid sequence encoded by SEQ ID NO:1, is 585 amino acid residues in length and shows about 74% identity from about amino acid residue 22 to about amino acid residue 608 with the rat uterus-ovary specific trans-membrane protein (g2460316). SEQ ID NO:14 also exhibits a transmembrane domain encompassing amino acid residues 576 to 593. Motif analysis shows that SEQ ID NO:14 has eight potential N-glycosylation sites at N30, N58, N68, N149, N272, N371, N395, and N420; twelve potential casein kinase II phosphorylation sites at T23, S109, S290, S349, S372, T380, T409, S464, S521, T557, T613, and T632; three N-myristoylation sites at G21, G29, and G39; thirteen potential protein kinase C phosphorylation sites at T45, S70, S132, S255, S280, T308, T328, T442, T468, S521, S527, T589, and T643; and three potential tyrosine kinase phosphorylation sites at Y180, Y415, and Y528.
- SEQ ID NO:8 is 1354 nucleotides in length and shows 99% identity with the human mRNA that codes for AQP8 (g2346968), a member of a family of water channel proteins identified from rat testis that contains the conserved transmembrane domains of the major intrinsic protein (MIP) family. SEQ ID NO:15, the amino acid sequence encoded by SEQ ID NO:8, is 255 amino acids in length and shows 100% sequence identity with AQP8. BLIMPS analysis shows that SEQ ID NO: 15 has six conserved amino acid segments that match the conserved transmembrane domains of the MIP family proteins. These segments encompass amino acid residues 30 to 49, 66 to 90, 103 to 122, 154 to 172, 185 to 207, and 222 to 242.
- VIII Hybridization Technologies and Analyses
- Immobilization of Polynucleotides on a Substrate
- The polynucleotides are applied to a substrate by one of the following methods. A mixture of polynucleotides is fractionated by gel electrophoresis and transferred to a nylon membrane by capillary transfer. Alternatively, the polynucleotides are individually ligated to a vector and inserted into bacterial host cells to form a library. The polynucleotides are then arranged on a substrate by one of the following methods. In the first method, bacterial cells containing individual clones are robotically picked and arranged on a nylon membrane. The membrane is placed on LB agar containing selective agent (carbenicillin, kanamycin, ampicillin, or chloramphenicol depending on the vector used) and incubated at 37 C for 16 hr. The membrane is removed from the agar and consecutively placed colony side up in 10% SDS, denaturing solution (1.5 M NaCl, 0.5 M NaOH), neutralizing solution (1.5 M NaCl, 1 M Tris-HCl, pH 8.0), and twice in 2× SSC for 10 min each. The membrane is then UV irradiated in a STRATALINKER UV-crosslinker (Stratagene).
- In the second method, polynucleotides are amplified from bacterial vectors by thirty cycles of PCR using primers complementary to vector sequences flanking the insert. PCR amplification increases a starting concentration of 1-2 ng nucleic acid to a final quantity greater than 5 μg. Amplified nucleic acids from about 400 bp to about 5000 bp in length are purified using SEPHACRYL-400 beads (APB). Purified nucleic acids are arranged on a nylon membrane manually or using a dot/slot blotting manifold and suction device and are immobilized by denaturation, neutralization, and UV irradiation as described above. Purified nucleic acids are robotically arranged and immobilized on polymer-coated glass slides using the procedure described in U.S. Pat. No. 5,807,522. Polymer-coated slides are prepared by cleaning glass microscope slides (Corning, Acton Mass.) by ultrasound in 0.1% SDS and acetone, etching in 4% hydrofluoric acid (VWR Scientific Products, West Chester Pa.), coating with 0.05% aminopropyl silane (Sigma-Aldrich) in 95% ethanol, and curing in a 110 C oven. The slides are washed extensively with distilled water between and after treatments. The nucleic acids are arranged on the slide and then immobilized by exposing the array to UV irradiation using a STRATALINKER UV-crosslinker (Stratagene). Arrays are then washed at room temperature in 0.2% SDS and rinsed three times in distilled water. Non-specific binding sites are blocked by incubation of arrays in 0.2% casein in phosphate buffered saline (PBS; Tropix, Bedford Mass.) for 30 min at 60 C; then the arrays are washed in 0.2% SDS and rinsed in distilled water as before.
- Probe Preparation for Membrane Hybridization
- Hybridization probes derived from the polynucleotides of the Sequence Listing are employed for screening cDNAs, mRNAs, or genomic DNA in membrane-based hybridizations. Probes are prepared by diluting the polynucleotides to a concentration of 40-50 ng in 45 μl TE buffer, denaturing by heating to 100 C for five min, and briefly centrifuging. The denatured polynucleotide is then added to a REDIPRIME tube (APB), gently mixed until blue color is evenly distributed, and briefly centrifuged. Five μl of [3P]dCTP is added to the tube, and the contents are incubated at 37 C for 10 min. The labeling reaction is stopped by adding 5 μl of 0.2M EDTA, and probe is purified from unincorporated nucleotides using a PROBEQUANT G-50 microcolumn (APB). The purified probe is heated to 100 C for five min, snap cooled for two min on ice, and used in membrane-based hybridizations as described below.
- Probe Preparation for Polymer Coated Slide Hybridization
- Hybridization probes derived from mRNA isolated from samples are employed for screening polynucleotides of the Sequence Listing in array-based hybridizations. Probe is prepared using the GEMbright kit (Incyte Genomics) by diluting mRNA to a concentration of 200 ng in 9 μl TE buffer and adding 5 μl 5× buffer, 1 μl 0.1 M DTT, 3 μl Cy3 or Cy5 labeling mix, 1 μl RNAse inhibitor, 1 μl reverse transcriptase, and 5 μl 1× yeast control mRNAs. Yeast control mRNAs are synthesized by in vitro transcription from noncoding yeast genomic DNA (W. Lei, unpublished). As quantitative controls, one set of control mRNAs at 0.002 ng, 0.02 ng, 0.2 ng, and 2 ng are diluted into reverse transcription reaction mixture at ratios of 1:100,000, 1:10,000, 1:1000, and 1:100 (w/w) to sample mRNA respectively. To examine mRNA differential expression patterns, a second set of control mRNAs are diluted into reverse transcription reaction mixture at ratios of 1:3, 3:1, 1:10, 10:1, 1:25, and 25:1 (w/w). The reaction mixture is mixed and incubated at 37 C for two hr. The reaction mixture is then incubated for 20 min at 85 C, and probes are purified using two successive CHROMA SPIN+TE 30 columns (Clontech, Palo Alto Calif.). Purified probe is ethanol precipitated by diluting probe to 90 μl in DEPC-treated water, adding 2 μl lmg/ml glycogen, 60 μl 5 M sodium acetate, and 300 μl 100% ethanol. The probe is centrifuged for 20 min at 20,800× g, and the pellet is resuspended in 12 μl resuspension buffer, heated to 65 C for five min, and mixed thoroughly. The probe is heated and mixed as before and then stored on ice. Probe is used in high density array-based hybridizations as described below.
- Membrane-based Hybridization
- Membranes are pre-hybridized in hybridization solution containing 1% Sarkosyl and lx high phosphate buffer (0.5 M NaCl, 0.1 M Na2HPO4, 5 mM EDTA, pH 7) at 55 C for two hr. The probe, diluted in 15 ml fresh hybridization solution, is then added to the membrane. The membrane is hybridized with the probe at 55 C for 16 hr. Following hybridization, the membrane is washed for 15 min at 25 C in 1 mM Tris (pH 8.0), 1% Sarkosyl, and four times for 15 min each at 25 C in lmM Tris (pH 8.0). To detect hybridization complexes, XOMAT-AR film (Eastman Kodak, Rochester N.Y.) is exposed to the membrane overnight at −70 C, developed, and examined visually.
- Polymer Coated Slide-based Hybridization
- Probe is heated to 65 C for five min, centrifuged five min at 9400 rpm in a 5415C microcentrifuge (Eppendorf Scientific, Westbury N.Y.), and then 18 μl are aliquoted onto the array surface and covered with a coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly larger than a microscope slide. The chamber is kept at 100% humidity internally by the addition of 140 μl of 5× SSC in a corner of the chamber. The chamber containing the arrays is incubated for about 6.5 hr at 60 C. The arrays are washed for 10 min at 45 C in 1× SSC, 0.1% SDS, and three times for 10 min each at 45 C in 0.1× SSC, and dried.
- Hybridization reactions are performed in absolute or differential hybridization formats. In the absolute hybridization format, probe from one sample is hybridized to array elements, and signals are detected after hybridization complexes form. Signal strength correlates with probe mRNA levels in the sample. In the differential hybridization format, differential expression of a set of genes in two biological samples is analyzed. Probes from the two samples are prepared and labeled with different labeling moieties. A mixture of the two labeled probes is hybridized to the array elements, and signals are examined under conditions in which the emissions from the two different labels are individually detectable. Elements on the array that are hybridized to equal numbers of probes derived from both biological samples give a distinct combined fluorescence (Shalon WO95/35505).
- Hybridization complexes are detected with a microscope equipped with an INNOVA 70 mixed gas 10 W laser (Coherent, Santa Clara CA) capable of generating spectral lines at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is focused on the array using a 20× microscope objective (Nikon, Melville N.Y.). The slide containing the array is placed on a computer-controlled X-Y stage on the microscope and raster-scanned past the objective with a resolution of 20 micrometers. In the differential hybridization format, the two fluorophores are sequentially excited by the laser. Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, Hamamatsu Photonics Systems, Bridgewater N.J.) corresponding to the two fluorophores. Appropriate filters positioned between the array and the photomultiplier tubes are used to filter the signals. The emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for CyS. The sensitivity of the scans is calibrated using the signal intensity generated by the yeast control mRNAs added to the probe mix. A specific location on the array contains a complementary DNA sequence, allowing the intensity of the signal at that location to be correlated with a weight ratio of hybridizing species of 1:100,000.
- The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital (A/D) conversion board (Analog Devices, Norwood Mass.) installed in an IBM-compatible PC computer. The digitized data are displayed as an image where the signal intensity is mapped using a linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping emission spectra) between the fluorophores using the emission spectrum for each fluorophore. A grid is superimposed over the fluorescence signal image such that the signal from each spot is centered in each element of the grid. The fluorescence signal within each element is then integrated to obtain a numerical value corresponding to the average intensity of the signal. The software used for signal analysis is the GEMTOOLS program (Incyte Genomics).
- IX Transcript Imaging
- The transcript image performed using the LIFESEQ GOLD database (AugOOrel, Incyte Genomics) allows assessment of the relative abundance of expressed genes in one or more cDNA libraries. Criteria for transcript imaging include category, number of cDNAs per library, description of the library, and the like
- All sequences and cDNA libraries in the LIFESEQ database were categorized by system, organ/tissue and cell type. The categories included cardiovascular system, connective tissue, digestive system, embryonic structures, endocrine system, exocrine glands, female and male reproductive, germ cells, hemic/immune system, liver, musculoskeletal system, nervous system, pancreas, respiratory system, sense organs, skin, stomatognathic system, unclassified/mixed, and the urinary tract. For each category, the number of libraries in which the sequence was expressed were counted and shown over the total number of libraries in that category. In some transcript images, all normalized or pooled libraries, which have high copy number sequences removed prior to processing, and all mixed or pooled tissues, which are considered non-specific in that they contain more than one tissue type or more than one subject's tissue, can be excluded from the analysis. Cell lines and/or fetal tissue data can also be disregarded unless the elucidation of inherited disorders would be furthered by their inclusion in the analysis.
- For purposes of example, the transcript image for SEQ ID NO:2 is shown below. No libraries were excluded from the analysis. SEQ ID NO:2 was only expressed in pancreatic tissues, which agrees with the 100% specificity shown in Example VI above, and the transcript image both shows independent confirmation of the results of the co-expression analysis and demonstrates differential expression of SEQ ID NO:2 in type I diabetes. Expression exceeded that of any other diseased pancreas library, including tumor and cytologically normal tissue, by greater than five-fold.
- SEQ ID NO:2 (Category: Pancreas)
Library cDNAs Description Abundance % Abundance PANCNOT23 3920 pancreas, type 9 0.2296 I diabetes, 43F PANCNOT17 4037 pancreas, 2 0.0495 mw/mets neuroendocrine CA of liver, 65F PANCNOT16 2812 pancreas, 1 0.0356 aw/Patau's, fetal, 20wM PANCNOT05 6805 pancreas, 2M 2 0.0294 PANCNOT19 3775 pancreas, 8M 1 0.0265 PANCNOT21 3846 pancreas, 8M 1 0.0260 - X Complementary Molecules
- The complement of the novel polynucleotide, from about 5 bp (e.g., a PNA) to about 5000 bp (e.g., the complement of a cDNA insert), are used to detect or inhibit gene expression. These molecules are selected using LASERGENE software (DNASTAR). Detection is described in Example VIII. To inhibit transcription by preventing promoter binding, the complementary molecule is designed to bind to the most unique 5′ sequence and includes nucleotides of the 5′ UTR upstream of the initiation codon of the open reading frame. Complementary molecules include genomic sequences (such as enhancers or introns) and are used in “triple helix” base pairing to compromise the ability of the double helix to open sufficiently for the binding of polymerases, transcription factors, or regulatory molecules. To inhibit translation, a complementary molecule is designed to prevent ribosomal binding to the mRNA encoding the protein.
- Complementary molecules are placed in expression vectors and used to transform a cell line to test efficacy; into an organ, tumor, synovial cavity, or the vascular system for transient or short term therapy; or into a stem cell, zygote, or other reproducing lineage for long term or stable gene therapy. Transient expression lasts for a month or more with a non-replicating vector and for three months or more if appropriate elements for inducing vector replication are used in the transformation/expression system.
- Stable transformation of appropriate dividing cells with a vector encoding the complementary molecule produces a transgenic cell line, tissue, or organism (U.S. Pat. No. 4,736,866). Those cells that assimilate and replicate sufficient quantities of the vector to allow stable integration also produce enough complementary molecules to compromise or entirely eliminate activity of the polynucleotide encoding the protein.
- XI Protein Expression
- Expression and purification of the protein are achieved using either a cell expression system or an insect cell expression system. The pUB6/V5-His vector system (Invitrogen, Carlsbad Calif.) is used to express protein in CHO cells. The vector contains the selectable bsd gene, multiple cloning sites, the promoter/enhancer sequence from the human ubiquitin C gene, a C-terminal V5 epitope for antibody detection with anti-V5 antibodies, and a C-terminal polyhistidine (6× His) sequence for rapid purification on PROBOND resin (Invitrogen). Transformed cells are selected on media containing blasticidin.
-
- XII Production of Antibodies
- The protein is purified using polyacrylamide gel electrophoresis and used to immunize mice or rabbits. Antibodies are produced using the protocols below. Alternatively, the amino acid sequence of the expressed protein is analyzed using LASERGENE software (DNASTAR) to determine regions of high antigenicity. An antigenic epitope, usually found near the C-terminus or in a hydrophilic region is selected, synthesized, and used to raise antibodies. Typically, epitopes of about 15 residues in length are produced using an ABI 431A peptide synthesizer (Applied Biosystems) using Fmoc-chemistry and coupled to KLH (Sigma-Aldrich) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester to increase antigenicity.
- Rabbits are immunized with the epitope-KLH complex in complete Freund's adjuvant. Immunizations are repeated at intervals thereafter in incomplete Freund's adjuvant. After a minimum of seven weeks for mouse or twelve weeks for rabbit, antisera are drawn and tested for antipeptide activity. Testing involves binding the peptide to plastic, blocking with 1% bovine serum albumin, reacting with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. Methods well known in the art are used to determine antibody titer and the amount of complex formation.
- XIII Purification of Naturally Occurring Protein Using Specific Antibodies
- Naturally occurring or recombinant protein is purified by immunoaffinity chromatography using antibodies which specifically bind the protein. An immunoaffinity column is constructed by covalently coupling the antibody to CNBr-activated SEPHAROSE resin (APB). Media containing the protein is passed over the immunoaffinity column, and the column is washed using high ionic strength buffers in the presence of detergent to allow preferential absorbance of the protein. After coupling, the protein is eluted from the column using a buffer of pH 2-3 or a high concentration of urea or thiocyanate ion to disrupt antibody/protein binding, and the protein is collected.
- XIV Screening Molecules FOR Specific Binding Using Polynucleotide or Protein
- The polynucleotide, or fragments thereof, or the protein, or portions thereof, are labeled with32P-dCTP, Cy3-dCTP, or Cy5-dCTP (APB), or with BIODIPY or FITC (Molecular Probes, Eugene Oreg.), respectively. Libraries of candidate molecules or compounds previously arranged on a substrate are incubated in the presence of composition, a labeled polynucleotide or protein. After incubation under conditions for either a nucleic acid or amino acid sequence, the substrate is washed, and any position on the substrate retaining label, which indicates specific binding or complex formation, is assayed, and the ligand is identified. Data obtained using different concentrations of the nucleic acid or protein are used to calculate affinity between the labeled nucleic acid or protein and the bound molecule.
- xv Two-hybrid Screen
- A yeast two-hybrid system, MATCHMAKER LexA Two-Hybrid system (Clontech Laboratories, Palo Alto Calif.), is used to screen for peptides that bind the protein of the invention. A polynucleotide encoding the protein is inserted into the multiple cloning site of a pLexA vector, ligated, and transformed intoE. coli. cDNA, prepared from mRNA, is inserted into the multiple cloning site of a pB42AD vector, ligated, and transformed into E. coli to construct a cDNA library. The pLexA plasmid and pB42AD-cDNA library constructs are isolated from E. coli and used in a 2:1 ratio to co-transform competent yeast EGY48[p8op-lacZ] cells using a polyethylene glycol/lithium acetate protocol. Transformed yeast cells are plated on synthetic dropout (SD) media lacking histidine (-His), tryptophan (-Trp), and uracil (-Ura), and incubated at 30 C until the colonies have grown up and are counted. The colonies are pooled in a minimal volume of lx TE (pH 7.5), replated on SD/-His/-Leu/-Trp/-Ura media supplemented with 2% galactose (Gal), 1% raffinose (Raf), and 80 mg/ml 5-bromo-4-chloro-3-indolyl β-d-galactopyranoside (X-Gal), and subsequently examined for growth of blue colonies. Interaction between expressed protein and cDNA fusion proteins activates expression of a LEU2 reporter gene in EGY48 and produces colony growth on media lacking leucine (-Leu). Interaction also activates expression of β-galactosidase from the p8op-lacZ reporter construct that produces blue color in colonies grown on X-Gal.
- Positive interactions between expressed protein and cDNA fusion proteins are verified by isolating individual positive colonies and growing them in SD/-Trp/-Ura liquid medium for 1 to 2 days at 30 C. A sample of the culture is plated on SD/-Trp/-Ura media and incubated at 30 C until colonies appear. The sample is replica-plated on SD/-Trp/-Ura and SD/-His/-Trp/-Ura plates. Colonies that grow on SD containing histidine but not on media lacking histidine have lost the pLexA plasmid. Histidine-requiring colonies are grown on SD/Gal/Raf/X-Gal/-Trp/-Ura, and white colonies are isolated and propagated. The pB42AD-cDNA plasmid, which contains a polynucleotide encoding a protein that physically interacts with the protein, is isolated from the yeast cells and characterized.
- All patents and publications mentioned in the specification are incorporated by reference herein. Various modifications and variations of the described method and system of the invention will be apparent to those skilled in the art without departing from the scope and spirit of the invention. Although the invention has been described in connection with specific preferred embodiments, it should be understood that the invention as claimed should not be unduly limited to such specific embodiments. Indeed, various modifications of the described modes for carrying out the invention that are obvious to those skilled in the field of molecular biology or related fields are intended to be within the scope of the following claims.
-
1 15 1 1966 DNA Homo sapiens 223163CT1 1 caaaatggag cttgtaagaa ggctcatgcc attgaccctc ttaattctct cctgtttggc 60 ggactgacaa tggcggaggc tgaaggcaat gcaagctgca cagtcagtct agggggtgcc 120 aatatggcag agacccacaa agccatgatc ctgcaactca atcccagtga gaactgcacc 180 tggacaatag aaagaccaga aaacaaaagc atcagaatta tcttttccta tgtccagctt 240 gatccagatg gaagctgtga aagtgaaaac attaaagtct ttgacggaac ctccagcaat 300 gggcctctgc tagggcaagt ctgcagtaaa aacgactatg ttcctgtatt tgaatcatca 360 tccagtacat tgacgtttca aatagttact gactcagcaa gaattcaaag aactgtcttt 420 gtcttctact acttcttctc tcctaacatc tctattccaa actgtggcgg ttacctggat 480 accttggaag gatccttcac cagccccaat tacccaaagc cgcatcctga gctggcttat 540 tgtgtgtggc acatacaagt ggagaaagat tacaagataa aactaaactt caaagagatt 600 ttcctagaaa tagacaaaca gtgcaaattt gattttcttg ccatctatga tggcccctcc 660 accaactctg gcctgattgg acaagtctgt ggccgtgtga ctcccacctt cgaatcgtca 720 tcaaactctc tgactgtcgt gttgtctaca gattatgcca attcttaccg gggattttct 780 gcttcctaca cctcaattta tgcagaaaac atcaacacta catctttaac ttgctcttct 840 gacaggatga gagttattat aagcaaatcc tacctagagg cttttaactc taatgggaat 900 aacttgcaac taaaagaccc aacttgcaga ccaaaattat caaatgttgt ggaattttct 960 gtccctctta atggatgtgg tacaatcaga aaggtagaag atcagtcaat tacttacacc 1020 aatataatca ccttttctgc atcctcaact tctgaagtga tcacccgtca gaaacaactc 1080 cagattattg tgaagtgtga aatgggacat aattctacag tggagataat atacataaca 1140 gaagatgatg taatacaaag tcaaaatgca ctgggcaaat ataacaccag catggctctt 1200 tttgaatcca attcatttga aaagactata cttgaatcac catattatgt ggatttgaac 1260 caaactcttt ttgttcaagt tagtctgcac acctcagatc caaatttggt ggtgtttctt 1320 gatacctgta gagcctctcc cacctctgac tttgcatctc caacctacga cctaatcaag 1380 agtggatgta gtcgagatga aacttgtaag gtgtatccct tatttggaca ctatgggaga 1440 ttccagttta atgcctttaa attcttgaga agtatgagct ctgtgtatct gcagtgtaaa 1500 gttttgatat gtgatagcag tgaccaccag tctcgctgca atcaaggttg tgtctccaga 1560 agcaaacgag acatttcttc atataaatgg aaaacagatt ccatcatagg acccattcgt 1620 ctgaaaaggg atcgaagtgc aagtggcaat tcaggatttc agcatgaaac acatgcggaa 1680 gaaactccaa accagccttt caacagtgtg catctgtttt ccttcatggt tctagctctg 1740 aatgtggtga ctgtagcgac aatcacagtg aggcattttg taaatcaacg ggcagactac 1800 aaataccaga agctgcagaa ctattaacta acaggtccaa ccctaagtga gacatgtttc 1860 tccaggatgc caaaggaaat gctacctcgt ggctacacat attatgaata aatgaggaag 1920 ggcctgaaag tgacacacag gcctgcatgt caaaaaaaaa aaaaaa 1966 2 924 DNA Homo sapiens 884692CB1 2 acacatctca ttttcatctt cacaaccagg taggtattat ttagttattg tagaaaggca 60 aagtcattgg ccccaaatta tatagctaaa agaaagtctc tacttgatga gattcaaacc 120 cagatttgtt tggcatgaca gtgataattt tctagattga gataaccaca gcatcggaat 180 tagggccata gcgtgaacca gttctggaca cagttcttgg tccagagctg cccattgtag 240 gagcagtcta gatagaatcc aggcatttaa attttgatat aataaaagtt catcatccct 300 acagtcttgc tcaagaagtc aagtccgcag tgaagtaccg gccatcgacc tagcccgggt 360 tctagatttg gggcccatca ctcggagagg tgcacagtct cccggtgtca tgaatggaac 420 ccctagcact gcagggttcc tggtggcctg gcctatggtc ctcctgactg tcctcctggc 480 ttggctgttc tgagagctcc gctgagcatc tggccttgaa gtttgtgttc ttccctctgg 540 caatggctcc cttcagcact tctgctttcc actccaattc acacaggctt ggtattaaca 600 gaatcaaggc caggctaggt taggaaaagg gaagagcttt caccttcttt aaaactctcg 660 gctgggcgca gtggctcatg cctgtaatcc cagcattttg ggaggctgag gcaggtggat 720 cacctgaggt cagcagttca aaatcagcct ggccaaaatg ctgaaactct gtctctacta 780 aaaatacaaa aattagccag gcatggtggc aggcgcctgt aatcccagct actcgggagg 840 ccaaggcagg agaattgctc gaactcaggg ggtggaggtt gcagtgagtt gagattgtgc 900 cattgcactc cagcctggca acat 924 3 845 DNA Homo sapiens 888246CB1 3 ttgcaatgag ccaatattgt gccactacac tccagcctgg gcaacagagt gagactccat 60 ctcaaaaaaa aaaaaaaaaa aaagaaaact aagattaagt tactacaatg acagaataga 120 aagtgtcacc tacatgtaat ataggtcaga aggagagcaa cagaagaata cacacatgtg 180 cacacacaca catacataca tggacatgtg tgcaacttgt gcatacacac acaaacacac 240 acacatgtgc gtgcaatata ccacaatata ccatcatcct ttctatttat gtggagacta 300 gttcaatcga tttttctgtc acctaagaat ttacctaccc caggagcctg ccttccacac 360 atacattaat aacaccaacc agtaatgtca aaaggaaaaa ttacaaaccc agaaaattaa 420 agtcattctg cacttgccct tggtttaaca ggcatttcac tcttggcacc tttcctgtcc 480 tatcattaat aagcatctta ttgatacagt ttatactcca aattctccag gcttgtgaaa 540 gtttcctcag gattgcttga aaatgaaagt cctggccagg tgcgcagtgg ctcatgcctg 600 taatcccagc actttgagag gccgaggcgg gtggatcacc cgaggtcagg agttcaagac 660 cagcgtagcc aacatggtga aaccctgtct ctactaaaag tacaaaaatt agccaggtgt 720 ggtcgcaggc gcctgtagtc ctagctactc aggaggctga ggcaggagaa ttgcttaaat 780 tcggaggcag aggttgcagt gagctgagat cgcgcctctg cactccagcc tggcgacaga 840 atggg 845 4 1739 DNA Homo sapiens 888309CB1 4 cccacgcgtc cgggggcatg gacctgaggt caagggaatg tgggctctcc aatccatttg 60 ctgtaaagcc agtgggtttg caaggatagg agggcagggt tggagcaaat ttccaggtca 120 gctgctgggc cgtggcctca ggaaatggtt ctgacatggg caggcttgac ccctgaggga 180 tgaagacact gaagatgata attctgctaa tgtaggagct atgttttcat agccacaggg 240 tcttcatgtc agggacatgg gcagacttct ggggacaagt cactactgtc tctgagcctg 300 aatatcctca tctgtaaaat gaggataagg taataataat acccaccata cagggctatt 360 gtgagaacta aatcagagca gtccaattgg gcaggctcag gaggtgatga atttctcgtc 420 ccaggaggta agcaagcaga gtgagatgtc ccatgggtag ggatgtcata gacaaacaag 480 cactaagccc tggacagggg atggatgagc ctcccactga gattatttcc ctccatcact 540 gaactctaac aagggccttt gatcttgcct ttggcacaag catgccttcc tctgagcaca 600 ctacaagtcc ctatggaaga gagagtgttc taggcagcag gacaagaagg agcatgacac 660 atttggaaaa cggagccaca gtgtgaacag ggcgatgctt agatgtgccc agcagaagca 720 ccctgggaaa tgaggggtag ggaacaacca acaaccttga tctccttgaa gactctttct 780 gctcattgag tggataaggc cccagagatt cagtgtggtt ttctggggtt tgggcccatc 840 acagagtcag attttgggct ttaaggaggc cctccctgta cctggatggg ctccaaggac 900 agtctcagct gactgagtga gcaggtggcc tgcctcaagt cttcatcagt ggccagcaca 960 atgatgagtg tccagtgggc cccattgctt gcagacacat ccctctgtgc tctgactttc 1020 acttccatct ccttctccca caccctgctc tcattttagg ttcctgcgcc tctgaactct 1080 gaaattccac aaatgcacca ttccctctat cccatctcca tgcttttgcc tctcctgttc 1140 ccttagcctg ggatgcgttc acttgcttta ctgacttgca aaactcctac ccacgtttca 1200 aatttcatac cactgtgaat ccttccctga cttcaccaag agactcagat agaccttctt 1260 ctctgctccc cctgcatctg tacatacttc tgtctgtatc tttatcatat tgaagtataa 1320 taaactgttg atatgttggt gtttacacaa gaccaagaaa tcctcatggg ccaagtccat 1380 gccttattta cttcatgttg aatgcaccta gcatttgaga aggtggttgg taaagtggct 1440 catgcctgta atcccaacag tttgggaggc tgaggccggc agatcgcttg aggtcaggag 1500 tttgaaacca gcctggccaa tatggcaaaa ccccatcttt ataaaaatac agaaattagc 1560 caggtgtggt ggctcatgcc tgtaatccca tgcctgtaat cccagccttg ggaggctgag 1620 gcaggagaat cacttgaatc caggaggcag aggttgcagt gaactgagat tggaccactg 1680 cactccagcc tgggcaacac tgagcaaaac tgcctgtcgt gaaaaaaaaa aaaaaaaaa 1739 5 438 DNA Homo sapiens 951335CB1 5 gcgcctgtaa tcccagctac tcgggaggcc aaggcaggag aattgctcga actcaggggg 60 tggaggttgc agtgagttga gattgtgcca ttgcactcca gcctgggcaa cagagcaaga 120 ctctgtctca ggaaaaaaaa aaaaaaaaaa aagaaaagca acatagtggg gtttctgtca 180 atctgtcctc ggctgccctt ctcatttgtt gatgggacct tgaaagcaag cttgctaggt 240 gccctctgtg gctccagcct ttaccggaag tgtggtgcat gtttttaact tcagggaagc 300 ggtatcctgt cactggggta tgggatgagc atggagaaga ggcaccagcc acgattcctt 360 cctaagcatc tcctgttctg actgctcatg aattgaagaa actgacaaaa aaaaaaatta 420 aaaaaaaaaa aaaaaaaa 438 6 483 DNA Homo sapiens 2091133CT1 6 tgtagcgtct gcatctgaaa ttgtttttac atctgtccca cctgcaccct tcaccccagg 60 ctgttagttt cttgaggaca aggacttcat cattttcaaa cattattggt caaataaatg 120 aagaaatagg ctgcatcctt tctctttatc ctttgacctc ctctatcatc ctgctgttat 180 cttccagaag gagaagaaac agcttcacag gaaaagtaga ggagattttc ccattatggt 240 gaaagtgcca aatcagaatg tgaaatagga attctgggct ctgtaccagg catttactcc 300 tatgctgtta gctgatgtta aagagggtgg atttcttttc ccttaggtct caccttctgt 360 gccttcaggg gaagttggtt ggaagtttga atggtttgtt gttgtcgtca ttgttttgta 420 ttaaggaggg ctgtaatgga acgaatacaa tggttattga tggagagtaa aaaaaaaaaa 480 aaa 483 7 646 DNA Homo sapiens 2383628CB1 7 tccccgctgc gcccgctgct gctggccctg gcccttgcct ccgtgccttg cgcccagggc 60 gcctgccccg cctccgccga cctcaagcac tcggacggga cgcgcacttg cgccaagctc 120 tatgacaaga gcgaccccta ctatgagaac tgctgcgggg gcgccgagct gtcgctggag 180 tcgggcgcag acctgcccta cctgccctcc aactgggcca acaccgcctc ctcacttgtg 240 gtggccccgc gctgcgagct caccgtgtgg tcccggcaag gcaaggcggg caagacgcac 300 aagttctctg ccggcaccta cccgcgcctg gaggagtacc gccggggcat cttaggagac 360 tggtccaacg ctatctccgc gctctactgc aggtgcagct gatgcattgc tggtctctca 420 tctgcagctt ccacagagtg ccaagcccct cactcagccc atccctgggc tctgctccgg 480 ggccccaaga cccaggagga ggagcgttct gcctgccccc tcccacctcc cctgcaatac 540 agcctttgtg cagttgaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa ataaaaaaaa 600 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa cgaaaaaaaa aaaaaa 646 8 1354 DNA Homo sapiens 2774542CB1 8 ggtgagccct ctgtcggcat cttcctctcc aggctggcag agcaaggggg gctgtgaatt 60 aattcaaggt tgggggtcgg ggccttctat atctggactt gcctcccacc cgtgtcctct 120 gtcccttttt ccctacggca gatagccatg tgtgagcctg aatttggcaa tgacaaggcc 180 agggagccga gcgtgggtgg caggtggcga gtgtcctggt acgaacggtt tgtgcagcca 240 tgtctggtcg aactgctggg ctctgctctc ttcatcttca tcgggtgcct gtcggtcatt 300 gagaatggga cggacactgg gctgctgcag ccggccctgg cccacgggct ggctttgggg 360 ctcgtgattg ccacgctggg gaatatcagt ggtggacact tcaaccctgc ggtgtccctg 420 gcagccatgc tgatcggagg cctcaacctg gtgatgctcc tcccgtactg ggtctcacag 480 ctgctcgggg ggatgctcgg ggctgccttg gccaaggcgg tgagtcctga ggagaggttc 540 tggaatgcat ctggggcggc ctttgtgaca gtccaggagc aggggcaggt ggcaggggcg 600 ttggtggcag agatcatcct gacgacgctg ctggccctgg ctgtatgcat gggtgccatc 660 aatgagaaga caaagggccc tctggccccg ttctccatcg gctttgccgt caccgtggat 720 atcctggctg ggggccctgt gtctggaggc tgcatgaatc ccgcccgtgc ttttggacct 780 gcggtggtgg ccaaccactg gaacttccac tggatctact ggctgggccc actcctggct 840 ggcctgcttg ttggactgct cattaggtgc ttcattggag atgggaagac ccgcctcatc 900 ctgaaggctc ggtgaagcag agctcgtggg attcctgctg ctccaggtgt cctcagctca 960 cctgtcccag actgaggaca ggggagttcc tgcatttcct gccagggcag aggcccagag 1020 gagcgacccc ctgcttccac tgcttgggcc tgctttctca gatagactga ctgctgagga 1080 ggctctaggt tcttggaatt cctttgtgct catcagagac cccagcctgg ggaacacgct 1140 gcccgcactg cccagagagc agtgcaaaca ccacaacacg agcgtgtttc ttgagaggaa 1200 tgtccccgag ttggacaagg aggctgtttc tgcacatcag ctcatttccc gcaccccatt 1260 tcttgcttga ttgctttgtt gggggcctgg ccacttcctt gcttctcaag ctgacaattc 1320 tcactttgca ataaatagtc cagtgtttcc ttcc 1354 9 681 DNA Homo sapiens 2777115 9 ccttacacat acaggaagac aagacctgag tggtgctgtc ttggtgtccg tcgtgtatgc 60 tcctccctgt cttcatttct tctcactctg tctctaaacc tctctctctc tccttgaccc 120 atcagtactt agtctacaga cctatgtgcg tgtccctatc cttctgtcct tttctctctt 180 cagctctccc tgcctctcac acacaatttt acatgccccg aggagccaag tttgggacat 240 ttaccctcca ggcatctgtg tcccctcttg aagagaaaac acacagcttc acacatccag 300 gcataggggg caagctcttg gggcatcagg accctggagc accaggtcct tcctggaata 360 ttagatccac ctggagcacc gggtctctct aagtctcacc tggggaattc ggtcccacct 420 ggggcaccag ttcccaccta gagcactgtg tcctgcccta gagcacaaag acctgctcct 480 cccgagactc tctctgactg cagccaggca tagtaccctt gcctgtgttt gctccctggt 540 ccacagattt ggtggctggg caggtgcctg gacagtgatg aggtcttgcc gccttaactg 600 tcccccccag tcacttctcc cacaggccca gcaggacgca gtcctgagga tcagggattc 660 tacagctgca ttaaaatcaa c 681 10 287 DNA Homo sapiens unsure 182, 186 a or g or c or t, unknown, or other 10 gcagggttcc agcgacagca gcactggact cgtccagagg gcggcgggtg agcggctggg 60 gccccgtgga gccaccatgg accccgcagg cagcagaccc ctcagtgcct cccaatcctt 120 tgactcacct gagcctgcag gacagatcag agatgcagct gcagagcgaa gccgacaggc 180 gnactncccg ggcacttgga ccaggtaacg gcggcgtggc agcgtgccct aggtggggac 240 tgccaggcag ctggagcaca cagaggcaac ggccgcattt aaccagg 287 11 449 DNA Homo sapiens 3833667CB1 11 taaatactga atgaatgaat gaagcactaa actgaatgca tataaggcaa agacacaaat 60 aacttaattt tgtgcagcca aatcagtttg taacttcacc aaacagttca catcaacatt 120 taatgagcgt ccctttgccc aaggcactgg gtgaaggatg agggggtatt ggtttgtgtt 180 tatgtagaat tttgcagttt gcaaagtccc ttctcttaca tctcttcatg agggtttcac 240 aacgactctg taaggtaggg gttgtcatta ttcctgcttt cccgataagg atacagaagc 300 tcagagaggg cagacatttg acctggagta gaactagggc aagaatacag gccactgtgt 360 gccccctcct cccacgctct gtttctctct gaagatgacc tggggacagc ataatacaaa 420 gtggatggaa tgggctgaga aaggagagg 449 12 874 DNA Homo sapiens 3835361CB1 12 ggaggcttta aggatcagac ctagatggtt gatgagagag caacaggata tataggaggc 60 tttaaggatc agacctagaa atggcacaga tgacttctat gcacatttta ttgaccagat 120 tcggtcacat ggccccacct agttgcaaag gacactggga aaaattgcat tcctgtgtgt 180 ccagaggaaa atgaaaaaaa tggttggtga ttagattgcc tctaccatgt gagtcccaga 240 gactataact aggccagata tcaaagatgc tttgcctttc tcatccttgt gttgtgaaag 300 acaaagaggc caacttatgt ttgctcctga ctcccaaagc ccaacacttg acagtcatat 360 ttcttgtatt tcagggttcc tggtggcctg gcctatggtc ctccctgact gtcctcctgg 420 cttggctgtt ctgagagctc cgctgagcat ctggccttga agtttgtgtt cttccctctg 480 gcaatggctc ccttcagcac ttctgctttc cactccaatt cacacaggct tggtattaac 540 agaatcaagg ccaggctagg ttaggaaaag ggaagagctt tcaccttctt taaaactctc 600 ggctgggcgc agtggctcat gcctgtaatc ccagcatttt gggaggctga ggcaggtgga 660 tcacctgagg tcagcagttc aaaatcagcc tggccaaaat gctgaaactc cgtctctact 720 aaaaatacaa aaattagcca ggcatggtgg caggcgcctg taatcccagc tactcgggag 780 gccaaggcag gagaattgct cgaactcagg gggtggaggt tgcagtgagt tgagattgtg 840 ccattgcact ccagcctggg caacagagca agat 874 13 1135 DNA Homo sapiens 3836037CB1 13 cttcatatag ggacaccagt catcgaattg gaggttcact ctactcaagt atgacgtcac 60 cgtgatttca ctgattttat gtcccaggcc gtattctaac aagggcacat cctgtgttct 120 gggaagggcg tgtcgctggg gaaatactct tcacccggct gcaacctctc actgtagaac 180 tgcctctgtg gagaagccca aagggcattt gcggcttcta ggagccaagt aggaggaggc 240 tgggatccgt gtttcaggcg ggactccagg cttgggcggg cctgatactc gagtccacat 300 gccccctcta gagaggaacc tgtctcctgc cagggccagg gaggggggca ctggctgctt 360 ctgtattttg gggtttgggg ccctggagct tcccatgcgg aattgccgtc cctcctccta 420 ggcgagtccc agggccaccc catcccacag ggacccgggc gccagcttct gaaagcatgg 480 ggcatctgcg gaagaactgg gttgtttccc agctttcgtc cctgcggagg ggcgatccgg 540 cccctccatg tcagcagtgt ttggtcgtcc acatgcttgt cagccccacg ctgtgctcct 600 gcgtctcttc ccgtctcatc catctggatg cttgacacct ctgacagcat ccctttcctg 660 tcatcttagg gcagcttcag gaaaccgaaa aacaggcttg tgtccttcca ttaacccctt 720 tatccacaag ttcagtatca gcatgagccc tggggagctc caaggctgca gccaggagcc 780 ccgtagccag ggatggtcct ggctgtgctg ctgcaccagg gccgccttcc ccaccttttc 840 cagaggaacc tgttctacgg ccagaagaac aagtaccgag caccccgagg gaagccggcc 900 ccggcctcag gggacaccca gacccctgca aaggggtcca gtgtccggga gcctgggcgc 960 agtggtgttg aggggccaca ttccagctga gtggccttgc tctgtgtgag ccccgtgcga 1020 gggccctgct tgtagctgga ccctggaacc ttctgtagct aagagggaat cctggccccc 1080 tccccagaag ccatttgtca ataaaccatt tctaagaaaa aaaaaaaaaa aaaaa 1135 14 585 PRT Homo sapiens 223163CD1 14 Met Ala Glu Ala Glu Gly Asn Ala Ser Cys Thr Val Ser Leu Gly 1 5 10 15 Gly Ala Asn Met Ala Glu Thr His Lys Ala Met Ile Leu Gln Leu 20 25 30 Asn Pro Ser Glu Asn Cys Thr Trp Thr Ile Glu Arg Pro Glu Asn 35 40 45 Lys Ser Ile Arg Ile Ile Phe Ser Tyr Val Gln Leu Asp Pro Asp 50 55 60 Gly Ser Cys Glu Ser Glu Asn Ile Lys Val Phe Asp Gly Thr Ser 65 70 75 Ser Asn Gly Pro Leu Leu Gly Gln Val Cys Ser Lys Asn Asp Tyr 80 85 90 Val Pro Val Phe Glu Ser Ser Ser Ser Thr Leu Thr Phe Gln Ile 95 100 105 Val Thr Asp Ser Ala Arg Ile Gln Arg Thr Val Phe Val Phe Tyr 110 115 120 Tyr Phe Phe Ser Pro Asn Ile Ser Ile Pro Asn Cys Gly Gly Tyr 125 130 135 Leu Asp Thr Leu Glu Gly Ser Phe Thr Ser Pro Asn Tyr Pro Lys 140 145 150 Pro His Pro Glu Leu Ala Tyr Cys Val Trp His Ile Gln Val Glu 155 160 165 Lys Asp Tyr Lys Ile Lys Leu Asn Phe Lys Glu Ile Phe Leu Glu 170 175 180 Ile Asp Lys Gln Cys Lys Phe Asp Phe Leu Ala Ile Tyr Asp Gly 185 190 195 Pro Ser Thr Asn Ser Gly Leu Ile Gly Gln Val Cys Gly Arg Val 200 205 210 Thr Pro Thr Phe Glu Ser Ser Ser Asn Ser Leu Thr Val Val Leu 215 220 225 Ser Thr Asp Tyr Ala Asn Ser Tyr Arg Gly Phe Ser Ala Ser Tyr 230 235 240 Thr Ser Ile Tyr Ala Glu Asn Ile Asn Thr Thr Ser Leu Thr Cys 245 250 255 Ser Ser Asp Arg Met Arg Val Ile Ile Ser Lys Ser Tyr Leu Glu 260 265 270 Ala Phe Asn Ser Asn Gly Asn Asn Leu Gln Leu Lys Asp Pro Thr 275 280 285 Cys Arg Pro Lys Leu Ser Asn Val Val Glu Phe Ser Val Pro Leu 290 295 300 Asn Gly Cys Gly Thr Ile Arg Lys Val Glu Asp Gln Ser Ile Thr 305 310 315 Tyr Thr Asn Ile Ile Thr Phe Ser Ala Ser Ser Thr Ser Glu Val 320 325 330 Ile Thr Arg Gln Lys Gln Leu Gln Ile Ile Val Lys Cys Glu Met 335 340 345 Gly His Asn Ser Thr Val Glu Ile Ile Tyr Ile Thr Glu Asp Asp 350 355 360 Val Ile Gln Ser Gln Asn Ala Leu Gly Lys Tyr Asn Thr Ser Met 365 370 375 Ala Leu Phe Glu Ser Asn Ser Phe Glu Lys Thr Ile Leu Glu Ser 380 385 390 Pro Tyr Tyr Val Asp Leu Asn Gln Thr Leu Phe Val Gln Val Ser 395 400 405 Leu His Thr Ser Asp Pro Asn Leu Val Val Phe Leu Asp Thr Cys 410 415 420 Arg Ala Ser Pro Thr Ser Asp Phe Ala Ser Pro Thr Tyr Asp Leu 425 430 435 Ile Lys Ser Gly Cys Ser Arg Asp Glu Thr Cys Lys Val Tyr Pro 440 445 450 Leu Phe Gly His Tyr Gly Arg Phe Gln Phe Asn Ala Phe Lys Phe 455 460 465 Leu Arg Ser Met Ser Ser Val Tyr Leu Gln Cys Lys Val Leu Ile 470 475 480 Cys Asp Ser Ser Asp His Gln Ser Arg Cys Asn Gln Gly Cys Val 485 490 495 Ser Arg Ser Lys Arg Asp Ile Ser Ser Tyr Lys Trp Lys Thr Asp 500 505 510 Ser Ile Ile Gly Pro Ile Arg Leu Lys Arg Asp Arg Ser Ala Ser 515 520 525 Gly Asn Ser Gly Phe Gln His Glu Thr His Ala Glu Glu Thr Pro 530 535 540 Asn Gln Pro Phe Asn Ser Val His Leu Phe Ser Phe Met Val Leu 545 550 555 Ala Leu Asn Val Val Thr Val Ala Thr Ile Thr Val Arg His Phe 560 565 570 Val Asn Gln Arg Ala Asp Tyr Lys Tyr Gln Lys Leu Gln Asn Tyr 575 580 585 15 255 PRT Homo sapiens 2774542CD1 15 Met Cys Glu Pro Glu Phe Gly Asn Asp Lys Ala Arg Glu Pro Ser 1 5 10 15 Val Gly Gly Arg Trp Arg Val Ser Trp Tyr Glu Arg Phe Val Gln 20 25 30 Pro Cys Leu Val Glu Leu Leu Gly Ser Ala Leu Phe Ile Phe Ile 35 40 45 Gly Cys Leu Ser Val Ile Glu Asn Gly Thr Asp Thr Gly Leu Leu 50 55 60 Gln Pro Ala Leu Ala His Gly Leu Ala Leu Gly Leu Val Ile Ala 65 70 75 Thr Leu Gly Asn Ile Ser Gly Gly His Phe Asn Pro Ala Val Ser 80 85 90 Leu Ala Ala Met Leu Ile Gly Gly Leu Asn Leu Val Met Leu Leu 95 100 105 Pro Tyr Trp Val Ser Gln Leu Leu Gly Gly Met Leu Gly Ala Ala 110 115 120 Leu Ala Lys Ala Val Ser Pro Glu Glu Arg Phe Trp Asn Ala Ser 125 130 135 Gly Ala Ala Phe Val Thr Val Gln Glu Gln Gly Gln Val Ala Gly 140 145 150 Ala Leu Val Ala Glu Ile Ile Leu Thr Thr Leu Leu Ala Leu Ala 155 160 165 Val Cys Met Gly Ala Ile Asn Glu Lys Thr Lys Gly Pro Leu Ala 170 175 180 Pro Phe Ser Ile Gly Phe Ala Val Thr Val Asp Ile Leu Ala Gly 185 190 195 Gly Pro Val Ser Gly Gly Cys Met Asn Pro Ala Arg Ala Phe Gly 200 205 210 Pro Ala Val Val Ala Asn His Trp Asn Phe His Trp Ile Tyr Trp 215 220 225 Leu Gly Pro Leu Leu Ala Gly Leu Leu Val Gly Leu Leu Ile Arg 230 235 240 Cys Phe Ile Gly Asp Gly Lys Thr Arg Leu Ile Leu Lys Ala Arg 245 250 255
Claims (20)
1. A composition comprising a plurality of polynucleotides having the nucleic acid sequences of SEQ ID NOs: 1-13 or the complements thereof.
2. An isolated polynucleotide comprising a nucleic acid sequence selected from SEQ ID NOs: 1-13 and the complements thereof.
3. A composition comprising a polynucleotide of claim 2 and a labeling moiety.
4. A method of using a polynucleotide to screen a plurality of molecules to identify at least one ligand which specifically binds the polynucleotide, the method comprising:
a) combining the composition of claim 1 with a plurality of molecules under conditions to allow specific binding; and
b) detecting specific binding, thereby identifying a ligand which specifically binds a polynucleotide.
5. The method of claim 4 wherein the composition is attached to a substrate.
6. The method of claim 4 wherein the molecules to be screened are selected from DNA molecules, RNA molecules, peptide nucleic acids, mimetics, and proteins.
7. A method of using a polynucleotide to purify a ligand, the method comprising:
a) combining the polynucleotide of claim 2 with a sample under conditions to allow specific binding;
b) recovering the bound polynucleotide; and
c) separating the ligand from the bound polynucleotide, thereby obtaining purified ligand.
8. The method of claim 7 wherein the polynucleotide is attached to a substrate.
9. A method for using a polynucleotide to detect gene expression in a sample, the method comprising:
a) hybridizing the composition of claim 1 to a sample thereby forming at least one hybridization complex;
b) detecting complex formation, wherein complex formation indicates gene expression in the sample.
10. The method of claim 9 wherein the polynucleotides of the composition are attached to a substrate.
11. The method of claim 9 wherein the sample is from pancreatic tissue.
12. The method of claim 9 wherein gene expression is compared to standards and indicates the presence of type I diabetes.
13. A vector comprising a polynucleotide of claim 2 .
14. A host cell comprising the vector of claim 13 .
15. A method for using a host cell to produce a protein, the method comprising:
a) culturing the host cell of claim 14 under conditions for expression of the protein; and
b) recovering the protein from cell culture.
16. A purified protein or a portion thereof comprising an amino acid sequence of SEQ ID NO: 14 or SEQ ID NO:15.
17. A composition comprising the protein of claim 16 and a pharmaceutical carrier or a labeling moiety.
18. A method for using a protein to screen a plurality of molecules to identify at least one ligand which specifically binds the protein, the method comprising:
a) combining the protein of claim 16 with the plurality of molecules under conditions to allow specific binding; and
b) detecting specific binding between the protein and ligand, thereby identifying a ligand which specifically binds the polypeptide.
19. The method of claim 18 wherein the plurality of molecules is selected from DNA molecules, RNA molecules, peptide nucleic acids, mimetics, proteins, agonists, antagonists, and antibodies.
20. A method of using a protein to purify a ligand from a sample, the method comprising:
a) combining the protein of claim 16 with a sample under conditions to allow specific binding;
b) recovering the bound protein; and
c) separating the ligand from the bound protein, thereby obtaining purified ligand.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/864,711 US20020077309A1 (en) | 1999-01-07 | 2001-05-23 | Diagnostics and therapeutics for pancreatic disorders |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US22699499A | 1999-01-07 | 1999-01-07 | |
US09/864,711 US20020077309A1 (en) | 1999-01-07 | 2001-05-23 | Diagnostics and therapeutics for pancreatic disorders |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US22699499A Continuation-In-Part | 1999-01-07 | 1999-01-07 |
Publications (1)
Publication Number | Publication Date |
---|---|
US20020077309A1 true US20020077309A1 (en) | 2002-06-20 |
Family
ID=22851316
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/610,906 Expired - Fee Related US6566066B1 (en) | 1999-01-07 | 2000-07-06 | Aquaporin-8 variant |
US09/864,711 Abandoned US20020077309A1 (en) | 1999-01-07 | 2001-05-23 | Diagnostics and therapeutics for pancreatic disorders |
US10/396,943 Abandoned US20030158085A1 (en) | 1999-01-07 | 2003-03-24 | Aquaporin-8 variant |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/610,906 Expired - Fee Related US6566066B1 (en) | 1999-01-07 | 2000-07-06 | Aquaporin-8 variant |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/396,943 Abandoned US20030158085A1 (en) | 1999-01-07 | 2003-03-24 | Aquaporin-8 variant |
Country Status (6)
Country | Link |
---|---|
US (3) | US6566066B1 (en) |
EP (1) | EP1144631A3 (en) |
JP (1) | JP2002534088A (en) |
AU (1) | AU2376200A (en) |
CA (1) | CA2357677A1 (en) |
WO (1) | WO2000040722A2 (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040114202A1 (en) * | 2002-09-30 | 2004-06-17 | Canon Kabushiki Kaisha | Image scanning apparatus |
EP1431763A1 (en) * | 2002-12-20 | 2004-06-23 | BioVisioN AG | Method for detecting a metabolic disease |
US9474772B2 (en) * | 2012-06-26 | 2016-10-25 | Seraxis, Inc. | Method for generating non-pluripotent progenitors of surrogate pancreatic cells |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2357677A1 (en) * | 1999-01-07 | 2000-07-13 | Incyte Pharmaceuticals, Inc. | Insulin-synthesis genes |
US7034132B2 (en) | 2001-06-04 | 2006-04-25 | Anderson David W | Therapeutic polypeptides, nucleic acids encoding same, and methods of use |
JP2003508087A (en) * | 1999-09-03 | 2003-03-04 | ヒューマン ジノーム サイエンシーズ, インコーポレイテッド | 29 human cancer-related proteins |
ES2679282T3 (en) | 2004-10-22 | 2018-08-23 | Revivicor Inc. | Transgenic pigs that lack endogenous immunoglobulin light chain |
ES2548377T3 (en) | 2008-10-27 | 2015-10-16 | Revivicor, Inc. | Immunosuppressed ungulates |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6566066B1 (en) * | 1999-01-07 | 2003-05-20 | Incyte Genomics, Inc. | Aquaporin-8 variant |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5817479A (en) * | 1996-08-07 | 1998-10-06 | Incyte Pharmaceuticals, Inc. | Human kinase homologs |
JP2002504806A (en) * | 1996-10-25 | 2002-02-12 | ラルセン,ペーテル モーゼ | Diabetes mediating proteins and their therapeutic use |
CA2303834A1 (en) * | 1997-09-17 | 1999-03-25 | Genentech, Inc. | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
WO1999031274A2 (en) * | 1997-12-15 | 1999-06-24 | Abbott Laboratories | Reagents and methods useful for detecting diseases of the pancreas |
-
1999
- 1999-12-20 CA CA002357677A patent/CA2357677A1/en not_active Abandoned
- 1999-12-20 AU AU23762/00A patent/AU2376200A/en not_active Abandoned
- 1999-12-20 JP JP2000592418A patent/JP2002534088A/en active Pending
- 1999-12-20 WO PCT/US1999/030537 patent/WO2000040722A2/en not_active Application Discontinuation
- 1999-12-20 EP EP99967492A patent/EP1144631A3/en not_active Withdrawn
-
2000
- 2000-07-06 US US09/610,906 patent/US6566066B1/en not_active Expired - Fee Related
-
2001
- 2001-05-23 US US09/864,711 patent/US20020077309A1/en not_active Abandoned
-
2003
- 2003-03-24 US US10/396,943 patent/US20030158085A1/en not_active Abandoned
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6566066B1 (en) * | 1999-01-07 | 2003-05-20 | Incyte Genomics, Inc. | Aquaporin-8 variant |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040114202A1 (en) * | 2002-09-30 | 2004-06-17 | Canon Kabushiki Kaisha | Image scanning apparatus |
EP1431763A1 (en) * | 2002-12-20 | 2004-06-23 | BioVisioN AG | Method for detecting a metabolic disease |
US9474772B2 (en) * | 2012-06-26 | 2016-10-25 | Seraxis, Inc. | Method for generating non-pluripotent progenitors of surrogate pancreatic cells |
US9968639B2 (en) | 2012-06-26 | 2018-05-15 | Seraxis, Inc. | Stem cells and pancreatic cells useful for the treatment of insulin-dependent diabetes mellitus |
Also Published As
Publication number | Publication date |
---|---|
EP1144631A3 (en) | 2002-01-30 |
EP1144631A2 (en) | 2001-10-17 |
AU2376200A (en) | 2000-07-24 |
CA2357677A1 (en) | 2000-07-13 |
US6566066B1 (en) | 2003-05-20 |
US20030158085A1 (en) | 2003-08-21 |
WO2000040722A2 (en) | 2000-07-13 |
WO2000040722A3 (en) | 2001-11-29 |
JP2002534088A (en) | 2002-10-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6602667B1 (en) | Inflammation-associated polynucleotides | |
US6524799B1 (en) | DNA encoding sparc-related proteins | |
US20020187472A1 (en) | Steap-related protein | |
US20030175795A1 (en) | Polynucleotides associated with cardiac muscle function | |
JP2010047588A (en) | Gene encoding new transmembrane protein | |
US20160282350A1 (en) | Methods of diagnosing cancer | |
US20020077309A1 (en) | Diagnostics and therapeutics for pancreatic disorders | |
US6262247B1 (en) | Polycyclic aromatic hydrocarbon induced molecules | |
US20030118579A1 (en) | Sparc-related proteins | |
US20030186333A1 (en) | Down syndrome critical region 1-like protein | |
US6590089B1 (en) | RVP-1 variant differentially expressed in Crohn's disease | |
US6222027B1 (en) | Molecules expressed in hippocampus | |
US6692923B2 (en) | Tapasin-like protein | |
US20030044812A1 (en) | Cell differentiation cDNAs induced by retinoic acid | |
US20030104418A1 (en) | Diagnostic markers for breast cancer | |
US20030170627A1 (en) | cDNAs co-expressed with placental steroid synthesis genes | |
US20030175787A1 (en) | Vesicle membrane proteins | |
US20030087253A1 (en) | Polynucleotide markers for ovarian cancer | |
US20020132238A1 (en) | Progesterone receptor complex p23-like protein | |
US20030124609A1 (en) | Ankyrin repeat domain 2 protein variant | |
US20030138835A1 (en) | Tumor suppressors | |
EP1319021A2 (en) | Atp-binding cassette protein | |
US20020055108A1 (en) | Human Sec6 vesicle transport protein | |
JP2002223778A (en) | Crfg-1 as target and marker for chronic renal failure | |
US20030113317A1 (en) | Molecules associated with apoptosis |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: INCYTE GENOMICS, INC., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:WALKER, MICHAEL G.;VOLKMUTH, WAYNE;KLINGLER, TOD M.;REEL/FRAME:012222/0145;SIGNING DATES FROM 20010904 TO 20010906 |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO PAY ISSUE FEE |