WO2001083507A1 - Identification de polypeptides et de molecules d'acide nucleique au moyen d'une liaison entre adn et polypeptide - Google Patents
Identification de polypeptides et de molecules d'acide nucleique au moyen d'une liaison entre adn et polypeptide Download PDFInfo
- Publication number
- WO2001083507A1 WO2001083507A1 PCT/US2001/014671 US0114671W WO0183507A1 WO 2001083507 A1 WO2001083507 A1 WO 2001083507A1 US 0114671 W US0114671 W US 0114671W WO 0183507 A1 WO0183507 A1 WO 0183507A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- nucleic acid
- interest
- acid molecule
- sequence
- substance
- Prior art date
Links
- 150000007523 nucleic acids Chemical class 0.000 title claims abstract description 627
- 102000039446 nucleic acids Human genes 0.000 title claims abstract description 572
- 108020004707 nucleic acids Proteins 0.000 title claims abstract description 572
- 108090000765 processed proteins & peptides Proteins 0.000 title claims abstract description 334
- 102000004196 processed proteins & peptides Human genes 0.000 title claims abstract description 296
- 229920001184 polypeptide Polymers 0.000 title claims abstract description 291
- 238000000034 method Methods 0.000 claims abstract description 305
- 125000005647 linker group Chemical group 0.000 claims abstract description 106
- 150000001875 compounds Chemical class 0.000 claims abstract description 99
- 238000012360 testing method Methods 0.000 claims abstract description 82
- 230000035897 transcription Effects 0.000 claims abstract description 45
- 238000013518 transcription Methods 0.000 claims abstract description 45
- 108700026244 Open Reading Frames Proteins 0.000 claims abstract description 39
- 239000000126 substance Substances 0.000 claims description 233
- 210000004027 cell Anatomy 0.000 claims description 195
- 108020004414 DNA Proteins 0.000 claims description 174
- 102000053602 DNA Human genes 0.000 claims description 117
- 230000027455 binding Effects 0.000 claims description 73
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N puromycin Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 claims description 73
- 239000007787 solid Substances 0.000 claims description 64
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 56
- 239000013598 vector Substances 0.000 claims description 55
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 50
- 239000003795 chemical substances by application Substances 0.000 claims description 47
- 229950010131 puromycin Drugs 0.000 claims description 39
- 241000700605 Viruses Species 0.000 claims description 36
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 claims description 26
- 239000008194 pharmaceutical composition Substances 0.000 claims description 26
- 230000002159 abnormal effect Effects 0.000 claims description 23
- 238000001727 in vivo Methods 0.000 claims description 23
- 230000005030 transcription termination Effects 0.000 claims description 23
- 125000006850 spacer group Chemical group 0.000 claims description 19
- 239000003814 drug Substances 0.000 claims description 17
- 230000001105 regulatory effect Effects 0.000 claims description 17
- 241000894006 Bacteria Species 0.000 claims description 15
- 229940079593 drug Drugs 0.000 claims description 15
- 239000002502 liposome Substances 0.000 claims description 15
- 125000003729 nucleotide group Chemical group 0.000 claims description 15
- 108091081024 Start codon Proteins 0.000 claims description 14
- 239000002773 nucleotide Substances 0.000 claims description 14
- 229960002685 biotin Drugs 0.000 claims description 13
- 235000020958 biotin Nutrition 0.000 claims description 13
- 239000011616 biotin Substances 0.000 claims description 13
- 102000052510 DNA-Binding Proteins Human genes 0.000 claims description 11
- 229920000642 polymer Polymers 0.000 claims description 11
- 102000053642 Catalytic RNA Human genes 0.000 claims description 10
- 108090000994 Catalytic RNA Proteins 0.000 claims description 10
- 101710096438 DNA-binding protein Proteins 0.000 claims description 10
- 244000045947 parasite Species 0.000 claims description 10
- 108091092562 ribozyme Proteins 0.000 claims description 10
- 108091000054 Prion Proteins 0.000 claims description 9
- 102000029797 Prion Human genes 0.000 claims description 9
- 210000005170 neoplastic cell Anatomy 0.000 claims description 9
- 239000013612 plasmid Substances 0.000 claims description 9
- 102000040650 (ribonucleotides)n+m Human genes 0.000 claims description 8
- 150000001720 carbohydrates Chemical class 0.000 claims description 8
- 239000012528 membrane Substances 0.000 claims description 8
- 239000011324 bead Substances 0.000 claims description 7
- 150000002632 lipids Chemical class 0.000 claims description 7
- 150000003384 small molecules Chemical class 0.000 claims description 7
- 239000013603 viral vector Substances 0.000 claims description 7
- 239000004005 microsphere Substances 0.000 claims description 6
- MGFYIUFZLHCRTH-UHFFFAOYSA-N nitrilotriacetic acid Chemical compound OC(=O)CN(CC(O)=O)CC(O)=O MGFYIUFZLHCRTH-UHFFFAOYSA-N 0.000 claims description 6
- 108091093037 Peptide nucleic acid Proteins 0.000 claims description 5
- VCORFLZFSPUNDN-UAGCYRGNSA-N [(2r,3s,5r)-5-(6-aminopurin-9-yl)-3-[[(2r,3s,5r)-5-(6-aminopurin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methyl [(2r,3s,5r)-5-(6-aminopurin-9-yl)-2-(phosphonooxymethyl)oxolan-3-yl] hydrogen phosphate Chemical compound C1=NC2=C(N)N=CN=C2N1[C@H](O[C@@H]1COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C3=NC=NC(N)=C3N=C2)COP(O)(O)=O)C[C@@H]1OP(O)(=O)OC[C@@H](O1)[C@@H](O)C[C@@H]1N1C(N=CN=C2N)=C2N=C1 VCORFLZFSPUNDN-UAGCYRGNSA-N 0.000 claims description 5
- SHIBSTMRCDJXLN-UHFFFAOYSA-N Digoxigenin Natural products C1CC(C2C(C3(C)CCC(O)CC3CC2)CC2O)(O)C2(C)C1C1=CC(=O)OC1 SHIBSTMRCDJXLN-UHFFFAOYSA-N 0.000 claims description 4
- QONQRTHLHBTMGP-UHFFFAOYSA-N digitoxigenin Natural products CC12CCC(C3(CCC(O)CC3CC3)C)C3C11OC1CC2C1=CC(=O)OC1 QONQRTHLHBTMGP-UHFFFAOYSA-N 0.000 claims description 4
- SHIBSTMRCDJXLN-KCZCNTNESA-N digoxigenin Chemical compound C1([C@@H]2[C@@]3([C@@](CC2)(O)[C@H]2[C@@H]([C@@]4(C)CC[C@H](O)C[C@H]4CC2)C[C@H]3O)C)=CC(=O)OC1 SHIBSTMRCDJXLN-KCZCNTNESA-N 0.000 claims description 4
- 238000011049 filling Methods 0.000 claims description 3
- 238000012163 sequencing technique Methods 0.000 claims 6
- 239000011159 matrix material Substances 0.000 claims 2
- 238000013519 translation Methods 0.000 abstract description 47
- 230000014616 translation Effects 0.000 description 56
- 230000000694 effects Effects 0.000 description 49
- 108090000623 proteins and genes Proteins 0.000 description 46
- 108020004682 Single-Stranded DNA Proteins 0.000 description 45
- 102000004169 proteins and genes Human genes 0.000 description 38
- 238000006243 chemical reaction Methods 0.000 description 36
- 235000018102 proteins Nutrition 0.000 description 36
- 239000000203 mixture Substances 0.000 description 32
- 150000001413 amino acids Chemical class 0.000 description 30
- 239000000243 solution Substances 0.000 description 28
- 238000003752 polymerase chain reaction Methods 0.000 description 26
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 23
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 23
- 210000001519 tissue Anatomy 0.000 description 21
- 108091034117 Oligonucleotide Proteins 0.000 description 19
- 235000001014 amino acid Nutrition 0.000 description 19
- 230000003321 amplification Effects 0.000 description 19
- 238000003199 nucleic acid amplification method Methods 0.000 description 19
- 241000894007 species Species 0.000 description 19
- 241000282414 Homo sapiens Species 0.000 description 18
- 229940024606 amino acid Drugs 0.000 description 17
- 238000000338 in vitro Methods 0.000 description 17
- 238000003556 assay Methods 0.000 description 16
- 210000003705 ribosome Anatomy 0.000 description 16
- 238000012216 screening Methods 0.000 description 16
- 239000003550 marker Substances 0.000 description 15
- 230000006870 function Effects 0.000 description 14
- 150000003839 salts Chemical class 0.000 description 14
- 102000040430 polynucleotide Human genes 0.000 description 13
- 108091033319 polynucleotide Proteins 0.000 description 13
- 239000002157 polynucleotide Substances 0.000 description 13
- 238000002360 preparation method Methods 0.000 description 13
- 239000000047 product Substances 0.000 description 13
- 238000003786 synthesis reaction Methods 0.000 description 13
- 102000004190 Enzymes Human genes 0.000 description 12
- 108090000790 Enzymes Proteins 0.000 description 12
- 230000015572 biosynthetic process Effects 0.000 description 12
- 230000001413 cellular effect Effects 0.000 description 12
- 239000003153 chemical reaction reagent Substances 0.000 description 12
- 239000003446 ligand Substances 0.000 description 11
- 108020004999 messenger RNA Proteins 0.000 description 11
- 241001465754 Metazoa Species 0.000 description 10
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 10
- 230000002255 enzymatic effect Effects 0.000 description 10
- 230000008569 process Effects 0.000 description 10
- 239000000523 sample Substances 0.000 description 10
- 230000001225 therapeutic effect Effects 0.000 description 10
- 230000031018 biological processes and functions Effects 0.000 description 9
- 230000019491 signal transduction Effects 0.000 description 9
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 8
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 8
- 238000010171 animal model Methods 0.000 description 8
- 238000010367 cloning Methods 0.000 description 8
- 244000005700 microbiome Species 0.000 description 8
- 239000011541 reaction mixture Substances 0.000 description 8
- 239000000758 substrate Substances 0.000 description 8
- 239000000725 suspension Substances 0.000 description 8
- 238000010189 synthetic method Methods 0.000 description 8
- 230000001988 toxicity Effects 0.000 description 8
- 231100000419 toxicity Toxicity 0.000 description 8
- 239000012634 fragment Substances 0.000 description 7
- 230000009870 specific binding Effects 0.000 description 7
- 101710163270 Nuclease Proteins 0.000 description 6
- 102000035195 Peptidases Human genes 0.000 description 6
- 108091005804 Peptidases Proteins 0.000 description 6
- 239000004365 Protease Substances 0.000 description 6
- 108700008625 Reporter Genes Proteins 0.000 description 6
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 6
- 230000004071 biological effect Effects 0.000 description 6
- 239000000872 buffer Substances 0.000 description 6
- 235000014633 carbohydrates Nutrition 0.000 description 6
- 238000001514 detection method Methods 0.000 description 6
- 239000003599 detergent Substances 0.000 description 6
- 239000000284 extract Substances 0.000 description 6
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 6
- 230000002209 hydrophobic effect Effects 0.000 description 6
- 238000002347 injection Methods 0.000 description 6
- 239000007924 injection Substances 0.000 description 6
- 230000003993 interaction Effects 0.000 description 6
- 238000001243 protein synthesis Methods 0.000 description 6
- 238000000746 purification Methods 0.000 description 6
- 238000010532 solid phase synthesis reaction Methods 0.000 description 6
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical group [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 5
- 108020004705 Codon Proteins 0.000 description 5
- 206010028980 Neoplasm Diseases 0.000 description 5
- 239000004202 carbamide Substances 0.000 description 5
- 229910052799 carbon Inorganic materials 0.000 description 5
- 230000000295 complement effect Effects 0.000 description 5
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 5
- 239000003937 drug carrier Substances 0.000 description 5
- 239000012530 fluid Substances 0.000 description 5
- 239000000499 gel Substances 0.000 description 5
- 238000001415 gene therapy Methods 0.000 description 5
- 210000000056 organ Anatomy 0.000 description 5
- 102000005962 receptors Human genes 0.000 description 5
- 108020003175 receptors Proteins 0.000 description 5
- 238000011084 recovery Methods 0.000 description 5
- 238000010187 selection method Methods 0.000 description 5
- 230000002110 toxicologic effect Effects 0.000 description 5
- 231100000027 toxicology Toxicity 0.000 description 5
- 210000004881 tumor cell Anatomy 0.000 description 5
- 241001430294 unidentified retrovirus Species 0.000 description 5
- 241000282412 Homo Species 0.000 description 4
- 241000124008 Mammalia Species 0.000 description 4
- 210000004666 bacterial spore Anatomy 0.000 description 4
- 230000003197 catalytic effect Effects 0.000 description 4
- 230000003196 chaotropic effect Effects 0.000 description 4
- 239000002299 complementary DNA Substances 0.000 description 4
- 201000010099 disease Diseases 0.000 description 4
- 238000010494 dissociation reaction Methods 0.000 description 4
- 230000005593 dissociations Effects 0.000 description 4
- 239000008298 dragée Substances 0.000 description 4
- 238000012377 drug delivery Methods 0.000 description 4
- 239000003623 enhancer Substances 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 230000004927 fusion Effects 0.000 description 4
- ZRALSGWEFCBTJO-UHFFFAOYSA-O guanidinium Chemical compound NC(N)=[NH2+] ZRALSGWEFCBTJO-UHFFFAOYSA-O 0.000 description 4
- 238000004128 high performance liquid chromatography Methods 0.000 description 4
- 238000003018 immunoassay Methods 0.000 description 4
- 238000010348 incorporation Methods 0.000 description 4
- 230000003834 intracellular effect Effects 0.000 description 4
- 238000013508 migration Methods 0.000 description 4
- 230000005012 migration Effects 0.000 description 4
- 230000000144 pharmacologic effect Effects 0.000 description 4
- 210000002729 polyribosome Anatomy 0.000 description 4
- 231100000723 toxicological property Toxicity 0.000 description 4
- 241001515965 unidentified phage Species 0.000 description 4
- 230000003612 virological effect Effects 0.000 description 4
- 108090001008 Avidin Proteins 0.000 description 3
- 230000004568 DNA-binding Effects 0.000 description 3
- 241000702421 Dependoparvovirus Species 0.000 description 3
- 108010054278 Lac Repressors Proteins 0.000 description 3
- 239000004677 Nylon Substances 0.000 description 3
- 241000283973 Oryctolagus cuniculus Species 0.000 description 3
- 102000009661 Repressor Proteins Human genes 0.000 description 3
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 3
- 238000010521 absorption reaction Methods 0.000 description 3
- 239000000556 agonist Substances 0.000 description 3
- 239000007864 aqueous solution Substances 0.000 description 3
- 239000008280 blood Substances 0.000 description 3
- 239000002775 capsule Substances 0.000 description 3
- 238000000423 cell based assay Methods 0.000 description 3
- 230000021615 conjugation Effects 0.000 description 3
- 238000004132 cross linking Methods 0.000 description 3
- 230000003247 decreasing effect Effects 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 210000003527 eukaryotic cell Anatomy 0.000 description 3
- 238000009472 formulation Methods 0.000 description 3
- 239000011521 glass Substances 0.000 description 3
- 230000000977 initiatory effect Effects 0.000 description 3
- 210000004185 liver Anatomy 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 210000003470 mitochondria Anatomy 0.000 description 3
- 229920001778 nylon Polymers 0.000 description 3
- 229940124276 oligodeoxyribonucleotide Drugs 0.000 description 3
- 239000000546 pharmaceutical excipient Substances 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 230000002285 radioactive effect Effects 0.000 description 3
- 235000008001 rakum palm Nutrition 0.000 description 3
- 230000002829 reductive effect Effects 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- 239000000377 silicon dioxide Substances 0.000 description 3
- 230000009261 transgenic effect Effects 0.000 description 3
- 230000014621 translational initiation Effects 0.000 description 3
- 238000011282 treatment Methods 0.000 description 3
- 241000701161 unidentified adenovirus Species 0.000 description 3
- 238000011144 upstream manufacturing Methods 0.000 description 3
- FJKROLUGYXJWQN-UHFFFAOYSA-N 4-hydroxybenzoic acid Chemical compound OC(=O)C1=CC=C(O)C=C1 FJKROLUGYXJWQN-UHFFFAOYSA-N 0.000 description 2
- LRFVTYWOQMYALW-UHFFFAOYSA-N 9H-xanthine Chemical compound O=C1NC(=O)NC2=C1NC=N2 LRFVTYWOQMYALW-UHFFFAOYSA-N 0.000 description 2
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- 241000282693 Cercopithecidae Species 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- 229930105110 Cyclosporin A Natural products 0.000 description 2
- PMATZTZNYRCHOR-CGLBZJNRSA-N Cyclosporin A Chemical compound CC[C@@H]1NC(=O)[C@H]([C@H](O)[C@H](C)C\C=C\C)N(C)C(=O)[C@H](C(C)C)N(C)C(=O)[C@H](CC(C)C)N(C)C(=O)[C@H](CC(C)C)N(C)C(=O)[C@@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)N(C)C(=O)[C@H](C(C)C)NC(=O)[C@H](CC(C)C)N(C)C(=O)CN(C)C1=O PMATZTZNYRCHOR-CGLBZJNRSA-N 0.000 description 2
- 108010036949 Cyclosporine Proteins 0.000 description 2
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 2
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 2
- FBPFZTCFMRRESA-JGWLITMVSA-N D-glucitol Chemical compound OC[C@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-JGWLITMVSA-N 0.000 description 2
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 2
- 108090000204 Dipeptidase 1 Proteins 0.000 description 2
- 108700006830 Drosophila Antp Proteins 0.000 description 2
- 102000016359 Fibronectins Human genes 0.000 description 2
- 108010067306 Fibronectins Proteins 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 2
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 2
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 2
- 229930195725 Mannitol Natural products 0.000 description 2
- 239000000020 Nitrocellulose Substances 0.000 description 2
- 101710149951 Protein Tat Proteins 0.000 description 2
- 241000125945 Protoparvovirus Species 0.000 description 2
- 241000700159 Rattus Species 0.000 description 2
- 241000702670 Rotavirus Species 0.000 description 2
- 229920002472 Starch Polymers 0.000 description 2
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 2
- DPXJVFZANSGRMM-UHFFFAOYSA-N acetic acid;2,3,4,5,6-pentahydroxyhexanal;sodium Chemical compound [Na].CC(O)=O.OCC(O)C(O)C(O)C(O)C=O DPXJVFZANSGRMM-UHFFFAOYSA-N 0.000 description 2
- 239000004480 active ingredient Substances 0.000 description 2
- 230000002411 adverse Effects 0.000 description 2
- 239000012491 analyte Substances 0.000 description 2
- 239000005557 antagonist Substances 0.000 description 2
- 230000001093 anti-cancer Effects 0.000 description 2
- 206010003246 arthritis Diseases 0.000 description 2
- 102000006635 beta-lactamase Human genes 0.000 description 2
- 238000005842 biochemical reaction Methods 0.000 description 2
- 210000004369 blood Anatomy 0.000 description 2
- 210000004556 brain Anatomy 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 201000011510 cancer Diseases 0.000 description 2
- 239000001768 carboxy methyl cellulose Substances 0.000 description 2
- 239000000969 carrier Substances 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 210000000170 cell membrane Anatomy 0.000 description 2
- 125000003636 chemical group Chemical group 0.000 description 2
- 229960001265 ciclosporin Drugs 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 238000012411 cloning technique Methods 0.000 description 2
- 238000000576 coating method Methods 0.000 description 2
- 238000005094 computer simulation Methods 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 238000005859 coupling reaction Methods 0.000 description 2
- 210000000805 cytoplasm Anatomy 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 239000003085 diluting agent Substances 0.000 description 2
- 239000000975 dye Substances 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 230000009881 electrostatic interaction Effects 0.000 description 2
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 2
- 210000003608 fece Anatomy 0.000 description 2
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- -1 hi addition Substances 0.000 description 2
- 235000014304 histidine Nutrition 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- JYGXADMDTFJGBT-VWUMJDOOSA-N hydrocortisone Chemical compound O=C1CC[C@]2(C)[C@H]3[C@@H](O)C[C@](C)([C@@](CC4)(O)C(=O)CO)[C@@H]4[C@@H]3CCC2=C1 JYGXADMDTFJGBT-VWUMJDOOSA-N 0.000 description 2
- 230000001900 immune effect Effects 0.000 description 2
- 238000000099 in vitro assay Methods 0.000 description 2
- 238000010874 in vitro model Methods 0.000 description 2
- 230000002427 irreversible effect Effects 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- 239000008101 lactose Substances 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 210000005265 lung cell Anatomy 0.000 description 2
- 239000000594 mannitol Substances 0.000 description 2
- 235000010355 mannitol Nutrition 0.000 description 2
- 238000004949 mass spectrometry Methods 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 230000003228 microsomal effect Effects 0.000 description 2
- 229920001220 nitrocellulos Polymers 0.000 description 2
- 239000004031 partial agonist Substances 0.000 description 2
- 230000036961 partial effect Effects 0.000 description 2
- 230000035699 permeability Effects 0.000 description 2
- 150000004713 phosphodiesters Chemical class 0.000 description 2
- 239000013600 plasmid vector Substances 0.000 description 2
- 239000003755 preservative agent Substances 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 230000035755 proliferation Effects 0.000 description 2
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 238000010839 reverse transcription Methods 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 238000007423 screening assay Methods 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 235000019812 sodium carboxymethyl cellulose Nutrition 0.000 description 2
- 229920001027 sodium carboxymethylcellulose Polymers 0.000 description 2
- 239000007790 solid phase Substances 0.000 description 2
- 239000000600 sorbitol Substances 0.000 description 2
- 239000003381 stabilizer Substances 0.000 description 2
- 238000012289 standard assay Methods 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 235000000346 sugar Nutrition 0.000 description 2
- 150000008163 sugars Chemical class 0.000 description 2
- 108700004027 tat Genes Proteins 0.000 description 2
- 101150098170 tat gene Proteins 0.000 description 2
- 231100000041 toxicology testing Toxicity 0.000 description 2
- 230000005029 transcription elongation Effects 0.000 description 2
- 210000002700 urine Anatomy 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- QIJRTFXNRTXDIP-UHFFFAOYSA-N (1-carboxy-2-sulfanylethyl)azanium;chloride;hydrate Chemical compound O.Cl.SCC(N)C(O)=O QIJRTFXNRTXDIP-UHFFFAOYSA-N 0.000 description 1
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- LNAZSHAWQACDHT-XIYTZBAFSA-N (2r,3r,4s,5r,6s)-4,5-dimethoxy-2-(methoxymethyl)-3-[(2s,3r,4s,5r,6r)-3,4,5-trimethoxy-6-(methoxymethyl)oxan-2-yl]oxy-6-[(2r,3r,4s,5r,6r)-4,5,6-trimethoxy-2-(methoxymethyl)oxan-3-yl]oxyoxane Chemical compound CO[C@@H]1[C@@H](OC)[C@H](OC)[C@@H](COC)O[C@H]1O[C@H]1[C@H](OC)[C@@H](OC)[C@H](O[C@H]2[C@@H]([C@@H](OC)[C@H](OC)O[C@@H]2COC)OC)O[C@@H]1COC LNAZSHAWQACDHT-XIYTZBAFSA-N 0.000 description 1
- BEJKOYIMCGMNRB-GRHHLOCNSA-N (2s)-2-amino-3-(4-hydroxyphenyl)propanoic acid;(2s)-2-amino-3-phenylpropanoic acid Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1.OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BEJKOYIMCGMNRB-GRHHLOCNSA-N 0.000 description 1
- IXPNQXFRVYWDDI-UHFFFAOYSA-N 1-methyl-2,4-dioxo-1,3-diazinane-5-carboximidamide Chemical compound CN1CC(C(N)=N)C(=O)NC1=O IXPNQXFRVYWDDI-UHFFFAOYSA-N 0.000 description 1
- IIZPXYDJLKNOIY-JXPKJXOSSA-N 1-palmitoyl-2-arachidonoyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCC\C=C/C\C=C/C\C=C/C\C=C/CCCCC IIZPXYDJLKNOIY-JXPKJXOSSA-N 0.000 description 1
- ASJSAQIRZKANQN-CRCLSJGQSA-N 2-deoxy-D-ribose Chemical compound OC[C@@H](O)[C@@H](O)CC=O ASJSAQIRZKANQN-CRCLSJGQSA-N 0.000 description 1
- 229940090248 4-hydroxybenzoic acid Drugs 0.000 description 1
- MSSXOMSJDRHRMC-UHFFFAOYSA-N 9H-purine-2,6-diamine Chemical compound NC1=NC(N)=C2NC=NC2=N1 MSSXOMSJDRHRMC-UHFFFAOYSA-N 0.000 description 1
- 208000030507 AIDS Diseases 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 102100027211 Albumin Human genes 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241000416162 Astragalus gummifer Species 0.000 description 1
- 241000282472 Canis lupus familiaris Species 0.000 description 1
- 201000009030 Carcinoma Diseases 0.000 description 1
- 108010001857 Cell Surface Receptors Proteins 0.000 description 1
- 229920002261 Corn starch Polymers 0.000 description 1
- 241000938605 Crocodylia Species 0.000 description 1
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 1
- 108700020911 DNA-Binding Proteins Proteins 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- LVGKNOAMLMIIKO-UHFFFAOYSA-N Elaidinsaeure-aethylester Natural products CCCCCCCCC=CCCCCCCCC(=O)OCC LVGKNOAMLMIIKO-UHFFFAOYSA-N 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 108091006027 G proteins Proteins 0.000 description 1
- 102000003688 G-Protein-Coupled Receptors Human genes 0.000 description 1
- 108090000045 G-Protein-Coupled Receptors Proteins 0.000 description 1
- 102000030782 GTP binding Human genes 0.000 description 1
- 108091000058 GTP-Binding Proteins 0.000 description 1
- 108010010803 Gelatin Proteins 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 208000031886 HIV Infections Diseases 0.000 description 1
- 208000037357 HIV infectious disease Diseases 0.000 description 1
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 102000004310 Ion Channels Human genes 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 102000004882 Lipase Human genes 0.000 description 1
- 108090001060 Lipase Proteins 0.000 description 1
- 239000004367 Lipase Substances 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 235000019759 Maize starch Nutrition 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- HSHXDCVZWHOWCS-UHFFFAOYSA-N N'-hexadecylthiophene-2-carbohydrazide Chemical compound CCCCCCCCCCCCCCCCNNC(=O)c1cccs1 HSHXDCVZWHOWCS-UHFFFAOYSA-N 0.000 description 1
- 238000005481 NMR spectroscopy Methods 0.000 description 1
- 102000007399 Nuclear hormone receptor Human genes 0.000 description 1
- 108020005497 Nuclear hormone receptor Proteins 0.000 description 1
- 108091005461 Nucleic proteins Proteins 0.000 description 1
- 102000045595 Phosphoprotein Phosphatases Human genes 0.000 description 1
- 108700019535 Phosphoprotein Phosphatases Proteins 0.000 description 1
- 239000004793 Polystyrene Substances 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 108010065868 RNA polymerase SP6 Proteins 0.000 description 1
- 230000004570 RNA-binding Effects 0.000 description 1
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 1
- IIDJRNMFWXDHID-UHFFFAOYSA-N Risedronic acid Chemical compound OP(=O)(O)C(P(O)(O)=O)(O)CC1=CC=CN=C1 IIDJRNMFWXDHID-UHFFFAOYSA-N 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 229920002684 Sepharose Polymers 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- 101710137500 T7 RNA polymerase Proteins 0.000 description 1
- 108020005038 Terminator Codon Proteins 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 229920001615 Tragacanth Polymers 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 102000004243 Tubulin Human genes 0.000 description 1
- 108090000704 Tubulin Proteins 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 229940050528 albumin Drugs 0.000 description 1
- 235000010443 alginic acid Nutrition 0.000 description 1
- 239000000783 alginic acid Substances 0.000 description 1
- 229920000615 alginic acid Polymers 0.000 description 1
- 229960001126 alginic acid Drugs 0.000 description 1
- 150000004781 alginic acids Chemical class 0.000 description 1
- 125000001931 aliphatic group Chemical group 0.000 description 1
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 1
- 150000001408 amides Chemical class 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000000844 anti-bacterial effect Effects 0.000 description 1
- 230000000845 anti-microbial effect Effects 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 230000000259 anti-tumor effect Effects 0.000 description 1
- 230000000840 anti-viral effect Effects 0.000 description 1
- 239000002246 antineoplastic agent Substances 0.000 description 1
- 229940041181 antineoplastic drug Drugs 0.000 description 1
- 239000003963 antioxidant agent Substances 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 108010028263 bacteriophage T3 RNA polymerase Proteins 0.000 description 1
- 230000004888 barrier function Effects 0.000 description 1
- 238000007630 basic procedure Methods 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 102000005936 beta-Galactosidase Human genes 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 239000012472 biological sample Substances 0.000 description 1
- 238000001574 biopsy Methods 0.000 description 1
- 230000037396 body weight Effects 0.000 description 1
- 210000000988 bone and bone Anatomy 0.000 description 1
- 208000035269 cancer or benign tumor Diseases 0.000 description 1
- 230000019522 cellular metabolic process Effects 0.000 description 1
- 230000004640 cellular pathway Effects 0.000 description 1
- 230000033077 cellular process Effects 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 210000003169 central nervous system Anatomy 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 230000000973 chemotherapeutic effect Effects 0.000 description 1
- 238000002512 chemotherapy Methods 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 229940124301 concurrent medication Drugs 0.000 description 1
- 239000000356 contaminant Substances 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- DMSZORWOGDLWGN-UHFFFAOYSA-N ctk1a3526 Chemical compound NP(N)(N)=O DMSZORWOGDLWGN-UHFFFAOYSA-N 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 229960002433 cysteine Drugs 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- 229960001305 cysteine hydrochloride Drugs 0.000 description 1
- 230000001472 cytotoxic effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 239000000032 diagnostic agent Substances 0.000 description 1
- 229940039227 diagnostic agent Drugs 0.000 description 1
- 239000005546 dideoxynucleotide Substances 0.000 description 1
- 235000005911 diet Nutrition 0.000 description 1
- 230000037213 diet Effects 0.000 description 1
- 235000014113 dietary fatty acids Nutrition 0.000 description 1
- 235000021186 dishes Nutrition 0.000 description 1
- PXEDJBXQKAGXNJ-QTNFYWBSSA-L disodium L-glutamate Chemical compound [Na+].[Na+].[O-]C(=O)[C@@H](N)CCC([O-])=O PXEDJBXQKAGXNJ-QTNFYWBSSA-L 0.000 description 1
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 1
- 239000002552 dosage form Substances 0.000 description 1
- 238000009510 drug design Methods 0.000 description 1
- 239000003596 drug target Substances 0.000 description 1
- 230000001804 emulsifying effect Effects 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 230000002616 endonucleolytic effect Effects 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- BEFDCLMNVWHSGT-UHFFFAOYSA-N ethenylcyclopentane Chemical compound C=CC1CCCC1 BEFDCLMNVWHSGT-UHFFFAOYSA-N 0.000 description 1
- LVGKNOAMLMIIKO-QXMHVHEDSA-N ethyl oleate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OCC LVGKNOAMLMIIKO-QXMHVHEDSA-N 0.000 description 1
- 229940093471 ethyl oleate Drugs 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 239000000194 fatty acid Substances 0.000 description 1
- 229930195729 fatty acid Natural products 0.000 description 1
- 239000010685 fatty oil Substances 0.000 description 1
- 239000000945 filler Substances 0.000 description 1
- 239000000796 flavoring agent Substances 0.000 description 1
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 1
- 108091006047 fluorescent proteins Proteins 0.000 description 1
- 102000034287 fluorescent proteins Human genes 0.000 description 1
- 235000013355 food flavoring agent Nutrition 0.000 description 1
- 230000037406 food intake Effects 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 238000002825 functional assay Methods 0.000 description 1
- 210000001035 gastrointestinal tract Anatomy 0.000 description 1
- 229920000159 gelatin Polymers 0.000 description 1
- 239000008273 gelatin Substances 0.000 description 1
- 235000019322 gelatine Nutrition 0.000 description 1
- 235000011852 gelatine desserts Nutrition 0.000 description 1
- 229960001031 glucose Drugs 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 210000002288 golgi apparatus Anatomy 0.000 description 1
- 239000008187 granular material Substances 0.000 description 1
- 239000005090 green fluorescent protein Substances 0.000 description 1
- 238000000227 grinding Methods 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 208000033519 human immunodeficiency virus infectious disease Diseases 0.000 description 1
- 229960000890 hydrocortisone Drugs 0.000 description 1
- 239000001866 hydroxypropyl methyl cellulose Substances 0.000 description 1
- 235000010979 hydroxypropyl methyl cellulose Nutrition 0.000 description 1
- 229920003088 hydroxypropyl methyl cellulose Polymers 0.000 description 1
- 229960003444 immunosuppressant agent Drugs 0.000 description 1
- 239000003018 immunosuppressive agent Substances 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 238000007913 intrathecal administration Methods 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 238000010253 intravenous injection Methods 0.000 description 1
- 238000007914 intraventricular administration Methods 0.000 description 1
- 229940125425 inverse agonist Drugs 0.000 description 1
- SZVJSHCCFOBDDC-UHFFFAOYSA-N iron(II,III) oxide Inorganic materials O=[Fe]O[Fe]O[Fe]=O SZVJSHCCFOBDDC-UHFFFAOYSA-N 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- FZWBNHMXJMCXLU-BLAUPYHCSA-N isomaltotriose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1OC[C@@H]1[C@@H](O)[C@H](O)[C@@H](O)[C@@H](OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C=O)O1 FZWBNHMXJMCXLU-BLAUPYHCSA-N 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 210000003127 knee Anatomy 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 229960001375 lactose Drugs 0.000 description 1
- 229910052747 lanthanoid Inorganic materials 0.000 description 1
- 150000002602 lanthanoids Chemical class 0.000 description 1
- 239000004816 latex Substances 0.000 description 1
- 229920000126 latex Polymers 0.000 description 1
- 235000010445 lecithin Nutrition 0.000 description 1
- 239000000787 lecithin Substances 0.000 description 1
- 229940067606 lecithin Drugs 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 235000019421 lipase Nutrition 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 239000006193 liquid solution Substances 0.000 description 1
- 239000006194 liquid suspension Substances 0.000 description 1
- 210000004072 lung Anatomy 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 229960001855 mannitol Drugs 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 102000006240 membrane receptors Human genes 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 150000002739 metals Chemical class 0.000 description 1
- 230000001394 metastastic effect Effects 0.000 description 1
- 206010061289 metastatic neoplasm Diseases 0.000 description 1
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 229920000609 methyl cellulose Polymers 0.000 description 1
- 239000001923 methylcellulose Substances 0.000 description 1
- 235000010981 methylcellulose Nutrition 0.000 description 1
- 230000002438 mitochondrial effect Effects 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 235000013923 monosodium glutamate Nutrition 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 231100000252 nontoxic Toxicity 0.000 description 1
- 230000003000 nontoxic effect Effects 0.000 description 1
- 108020004017 nuclear receptors Proteins 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- 229940046166 oligodeoxynucleotide Drugs 0.000 description 1
- 230000004768 organ dysfunction Effects 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 239000006179 pH buffering agent Substances 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 238000004091 panning Methods 0.000 description 1
- 238000007911 parenteral administration Methods 0.000 description 1
- 239000012466 permeate Substances 0.000 description 1
- 239000008177 pharmaceutical agent Substances 0.000 description 1
- 239000008196 pharmacological composition Substances 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 125000005642 phosphothioate group Chemical group 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 239000002504 physiological saline solution Substances 0.000 description 1
- 239000000049 pigment Substances 0.000 description 1
- 239000006187 pill Substances 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 229920000515 polycarbonate Polymers 0.000 description 1
- 239000004417 polycarbonate Substances 0.000 description 1
- 229920002223 polystyrene Polymers 0.000 description 1
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 1
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 1
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 1
- 229920001592 potato starch Polymers 0.000 description 1
- 108060006633 protein kinase Proteins 0.000 description 1
- 238000003757 reverse transcription PCR Methods 0.000 description 1
- PYWVYCXTNDRMGF-UHFFFAOYSA-N rhodamine B Chemical compound [Cl-].C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=CC=C1C(O)=O PYWVYCXTNDRMGF-UHFFFAOYSA-N 0.000 description 1
- 229940100486 rice starch Drugs 0.000 description 1
- 238000012502 risk assessment Methods 0.000 description 1
- 238000007790 scraping Methods 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 239000008159 sesame oil Substances 0.000 description 1
- 235000011803 sesame oil Nutrition 0.000 description 1
- 210000002027 skeletal muscle Anatomy 0.000 description 1
- 239000002002 slurry Substances 0.000 description 1
- 235000010413 sodium alginate Nutrition 0.000 description 1
- 239000000661 sodium alginate Substances 0.000 description 1
- 229940005550 sodium alginate Drugs 0.000 description 1
- WXMKPNITSTVMEF-UHFFFAOYSA-M sodium benzoate Chemical compound [Na+].[O-]C(=O)C1=CC=CC=C1 WXMKPNITSTVMEF-UHFFFAOYSA-M 0.000 description 1
- 235000010234 sodium benzoate Nutrition 0.000 description 1
- 239000004299 sodium benzoate Substances 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 229940073490 sodium glutamate Drugs 0.000 description 1
- 239000012439 solid excipient Substances 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 235000010199 sorbic acid Nutrition 0.000 description 1
- 239000004334 sorbic acid Substances 0.000 description 1
- 229940075582 sorbic acid Drugs 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000008174 sterile solution Substances 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 239000003774 sulfhydryl reagent Substances 0.000 description 1
- 229910052717 sulfur Inorganic materials 0.000 description 1
- 239000011593 sulfur Substances 0.000 description 1
- 239000000829 suppository Substances 0.000 description 1
- 239000000375 suspending agent Substances 0.000 description 1
- 239000006188 syrup Substances 0.000 description 1
- 235000020357 syrup Nutrition 0.000 description 1
- 238000007910 systemic administration Methods 0.000 description 1
- 238000012956 testing procedure Methods 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 231100000759 toxicological effect Toxicity 0.000 description 1
- 150000003626 triacylglycerols Chemical class 0.000 description 1
- 210000005239 tubule Anatomy 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 230000002485 urinary effect Effects 0.000 description 1
- 210000001635 urinary tract Anatomy 0.000 description 1
- 239000003981 vehicle Substances 0.000 description 1
- 229920002554 vinyl polymer Polymers 0.000 description 1
- 210000001835 viscera Anatomy 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 239000000080 wetting agent Substances 0.000 description 1
- 229940100445 wheat starch Drugs 0.000 description 1
- 229940075420 xanthine Drugs 0.000 description 1
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2/00—Peptides of undefined number of amino acids; Derivatives thereof
Definitions
- the present invention relates generally to the fields of molecular biology, in particular to methods for identifying nucleic acid molecules and polypeptides.
- Efforts to identify polypeptides that have biological activity such as enzymatic activity or binding activity and the nucleic acid molecules that encode the polypeptide have utilized a variety of methods. Such methods include genomics and combinatorial biology.
- Genomics generally identifies nucleic acid molecules within a genome, often without regard to the function of the nucleic acid molecule or the polypeptide encoded thereby. Genomics tends to provide information as to the sequence or partial sequence of a nucleic acid molecule but does not provide significant information as to the function of the nucleic acid sequence or the polypeptide encoded thereby.
- the outcome of genomics is generally the identification of expression sequence tags (ESTs) or the trapping of promoters or genes.
- Functional genomics generally attempts to contemporaneously identify a gene and its function. Functional genomics relies on the use of cell-based or organism-based assay systems or comparative analyses, which tend to be cumbersome, complicated, time-consuming and expensive.
- Combinatorial biology generally identifies nucleic acid sequences and polypeptides encoded thereby that are not isolated from a biological source but nonetheless have a biological activity.
- Combinatorial biology provides random or semi-random groups (or libraries) of nucleic acid molecules or polypeptides. The libraries are screened for an activity, such as a binding activity.
- One method of combinatorial biology known as SELEX, relies on the folding of RNA molecules to provide an RNA molecule that has receptor-ligand binding capabilities. This type of receptor-ligand binding, though interesting, is a rather rare event in cellular processes.
- Another method of combinatorial biology provides a library of bacteriophages that display a variety of random polypeptides on their surface.
- the genome of the bacteriophage includes the nucleic acid molecule that encodes the random polypeptide displayed on the surface.
- the binding of a phage to a receptor during "panning" procedure results in the isolation of a bacteriophage that includes the random polypeptide and the encoding nucleic acid molecule.
- This type of combinatorial biology results in the identification of interesting polypeptides and nucleic acid sequences, but the methods used rely on complex in vivo biological processes to produce bacteriophage. These complex processes tend to reduce the complexity of the combinatorial biology libraries and make these methods not particularly suitable for automation.
- random nucleic acid sequence can be made part of an RNA molecule that is translated by a plurality of ribosomes to form a polysome.
- the polysome structure can be "stalled" such that the RNA molecule is attached to the ribosomes and a partially translated polypeptide. This stalling can be accomplished using a variety of methods, such as lowering the temperature and adding chemicals to stabilize the polypeptide-ribosome-RNA ternary structure.
- the stalled polysomes structures can then be panned for binding to a ligand to identify polysomes that include random nucleic acid molecules that encode polypeptides that can bind with a ligand.
- RNA-protein fusions Another method for in vitro combinatorial biology methods is using RNA-protein fusions. This method relies on incorporation of puromycin that is ligated to the 3 '-end of a messenger RNA into the C-terminal of a polypeptide by a ribosome. The RNA and polypeptide are linked or "fused" by a covalent bond. The ribosome is then dissociated from the translational machinery and the RNA- polypeptide fusion can be screened against target molecules. The RNA on the selected RNA- polypeptide fusion can be enriched by reverse-transcription polymerase chain reaction (PCR).
- PCR reverse-transcription polymerase chain reaction
- ligating puromycin to the 3 '-end of RNA is not efficient and RNA is easily degraded in experiment. More importantly, both of these methods require a separate step to transcribe the DNA template and subsequently purify the RNA transcript for translation reaction. Therefore, it is cumbersome to perform the experiment and the entire procedures
- the present invention provided methods and articles of manufacture that address the problems associated with combinatorial biology.
- the present invention provides related benefits as well.
- FIG. 1 illustrates one aspect of structural features of the DNA construct in a transcription ternary complex.
- FIG. 2 illustrates one aspect of structural features of the DNA construct in a transcription ternary complex
- FIG. 3 illustrates one aspect of structural features of the DNA construct in a transcription ternary complex.
- FIG. 4 illustrates one aspect of structural features of the DNA construct in a transcription ternary complex.
- FIG. 5 illustrates one method of linking a DNA molecule to its owning encoding peptide and selecting the desired DNA-polypeptide complexes.
- FIG. 6 illustrates one aspect of making a DNA-polypeptide complex from natural mRNA and selecting desired DNA-polypeptide complexes.
- FIG. 7 depicts one structural aspect of a single-stranded DNA molecule of the present, invention.
- FIG. 8 depicts one structural aspect of a single-stranded DNA molecule of the present invention.
- FIG. 9 depicts a schematic diagram of one aspect of the present invention.
- FIG. 10 depicts a schematic diagram of one aspect of the present invention.
- FIG. 11 illustrates one aspect of making a DNA-polypeptide complex from natural mRNA and selecting desired DNA-polypeptide complexes.
- the present invention provides compositions and efficient methods for identifying nucleic acids and peptides and polypeptides encoded thereby. These methods are performed using translation systems and methods, or using transcription and translation systems or methods. When transcription and translation methods are used, the transcription and translation reactions may or may not be coupled. The result of these methods is a complex that includes a polypeptide that is covalently or non-covalently linked with its own encoding nucleic acid molecule.
- the nucleic acid molecule can comprise a moiety that links the nucleic acid molecule to its own encoded polypeptide.
- a first aspect of the present invention is a nucleic acid molecule that comprises a transcription regulatory region, a transcription termination moiety, and, preferably, a linking moiety that can directly or indirectly link the nucleic acid molecule with a peptide, and encodes an open reading frame for a peptide or polypeptide.
- the nucleic acid molecule can optionally encode a ribosome binding RNA sequence.
- the nucleic acid molecule can be provided in a vector.
- a second aspect of the present invention is a library of nucleic acid molecules of the first aspect of present invention, either alone, linked with their own encoded polypeptides or with a substance of interest or as part of a vector.
- a third aspect of the present invention is a method of linking a nucleic acid molecule of the present invention to a peptide that is encoded by the nucleic acid molecule.
- the method includes: transcribing at least a portion of a nucleic acid molecule of the present invention using at least one RNA polymerase, such that at least one RNA polymerase terminates at a transcription termination site so that transcription elongation ternary complexes, which comprise a nucleic acid molecule template, an RNA transcript, and RNA polymerase, are formed.
- Translation systems are employed that can translate, at least in part, the RNA template in the complex.
- the peptide or polypeptide translated from the RNA template can preferably bind or be coupled to a linking moiety that is bound by the nucleic acid molecule.
- the result of these procedures is a peptide or polypeptide that is directly or indirectly bound to the nucleic acid molecule that encodes the polypeptide.
- a fourth aspect of the present invention is a method for identifying a nucleic acid molecule using the methods of the second aspect of the present invention.
- a fifth aspect of the present invention is a method for identifying a polypeptide using the methods of the second aspect of the present invention.
- a sixth aspect of the present invention is a DNA molecule that 1) comprises a linking moiety and 2) encodes an open reading frame for a peptide or polypeptide. Also, the DNA molecule can contain a ribosome binding sequence (RBS).
- RBS ribosome binding sequence
- a seventh aspect of the present invention is a library of DNA molecules of the sixth aspect of the present invention, either alone, linked with their own encoded polypeptides, or with a substance of interest or as part of a vector.
- An eighth aspect of the present invention is a method for of linking a DNA molecule of the seventh aspect of the present invention to a peptide that is encoded by at least a portion of the nucleic acid molecule.
- the method includes: translating a DNA that comprises an open reading frame and a linking moiety or binding region so that complexes that comprise the polypeptide encoded by the open reading frame become bound to the DNA molecule.
- the result of these procedures is a peptide or polypeptide that is directly or indirectly bound to the nucleic acid molecule that encodes the polypeptide.
- a ninth aspect of the present invention is a method for identifying a DNA molecule using the methods of the eighth aspect of the present invention.
- a tenth aspect of the present invention is a method for identifying a polypeptide using the methods of the eighth aspect of the present invention.
- An eleventh aspect of the present invention is methods of identifying test compounds using the methods present invention and test compounds and pharmaceutical compositions identified by such methods.
- a twelfth aspect aspect of the present invention is methods of identifying targets using the methods present invention and targets identified by such methods.
- Membrane permeable derivative refers to a chemical derivative of a compound that increases membrane permeability of the compound. These derivatives are made better able to cross cell membranes because hydrophilic groups are masked to provide more hydrophobic derivatives. Also, the permeability-making groups can be designed to be cleaved from the compound within a cell to make the compound more hydrophilic once within the cell. Because the substrate is more hydrophilic than the membrane permeate derivative, it preferentially localizes within the cell (U.S. Patent No. 5,741,657 to Tsien et al., issued April 21, 1998).
- isolated polynucleotide refers to a polynucleotide of genomic, cDNA, or synthetic origin, or some combination thereof, which by virtue of its origin, the isolated polynucleotide (1) is not associated with the cell in which the isolated polynucleotide is found in nature, or (2) is operably linked to a polynucleotide that it is not linked to in nature.
- the isolated polynucleotide can optionally be linked to promoters, enhancers, or other regulatory sequences.
- isolated protein refers to a protein of cDNA, RNA derived from cDNA, DNA, RNA or synthetic origin, or some combination thereof, which by virtue of its origin the isolated protein (1) is not associated with proteins normally found within nature, or (2) is isolated from the cell in which it normally occurs, or (3) is isolated free of other proteins from the same cellular source (for example, free of cellular proteins), or (4) is expressed by a cell from a different species, or (5) does not occur in nature.
- Peptide is a sequence of two or more amino acids joined by peptide bonds. Peptides can include other moieties, such as chemical groups, drug molecules, detectable labels, or specific binding members that are reversibly or irreversibly bound to one or more amino acids of the peptide.
- Polypeptide is used herein as a generic term to refer to protein or fragments or analogs of a protein.
- Active fragment refers to a fragment of a parent molecule, such as an organic molecule, nucleic acid molecule, or protein or polypeptide, or combinations thereof, that retains at least one activity of the parent molecule.
- Naturally occurring refers to the fact that an object can be found in nature.
- a polypeptide or polynucleotide sequence that is present in an organism, including viruses, that can be isolated from a source in nature and which has not been intentionally modified by man in the laboratory is naturally occurring.
- Operaably linked refers to a juxtaposition wherein the components so described are in a relationship permitting them to function in their intended manner.
- a control sequence operably linked to a coding sequence is ligated in such a way that expression of the coding sequence is achieved under conditions compatible with the control sequences.
- Control sequences refers to polynucleotide sequences that effect the expression of coding and non-coding sequences to which they are operably linked. When the control sequences are used to control the transcription of DNA template, it is also called a transcription regulatory region. The nature of such control sequences differs depending upon the host organisms and enzymes; in prokaryotes, such control sequences generally include promoter, ribosomal binding site, and translation initiation and termination codons; in eukaryotes, generally, such control sequences include promoters and translation initiation and termination sequences.
- the term control sequences is intended to include components whose presence can influence expression, and can also include additional components whose presence is advantageous, for example, leader sequences and fusion partner sequences.
- a "transcription regulatory region” is a region of a nucleic acid that controls the transcription of a nucleic acid sequence to which it is operably linked.
- Ribosome binding site or “ribosome binding sequence” or “RBS” is a nucleotide sequence that allows the binding of the ribosome to a nucleic acid molecule.
- Ribosome binding sites known in the art that allow for ribosome binding and the initiation of translation are, for example, Shine-Dalgarno sequences, Kozak sequences, and IRES sequences.
- Shine-Dalgorno sequences and Kozak sequences can be identified canonical sequences, or substantially homologous sequences that can be bound by ribosomes and thereby initiate translation.
- IRES sequences can be those aheady identified, or any identified in the future, such as by functional assay.
- a "ribosome RNA binding sequence” specifies that the nucleotide sequence consists of or essentially consists of an RNA sequence.
- Nucleic acid or “nucleic acid molecule” or “polynucleotide” refers to a polymeric form of nucleotides of a least ten bases in length.
- a nucleic acid molecule can be DNA, RNA, or a combination of both.
- a nucleic acid molecule can also include sugars other than ribose and deoxyribose incorporated into the backbone, and thus can be other than DNA or RNA.
- a nucleic acid can comprise nucleobases that are naturally occurring or that do not occur in nature, such as xanthine, derivatives of nucleobases such as 2-aminoadenine and the like.
- a nucleic acid molecule of the present invention can have linkages other than phosphodiester linkages.
- a nucleic acid molecule can also be a peptide nucleic acid molecule (PNA) or can comprise PNA residues.
- PNA peptide nucleic acid molecule
- a nucleic acid molecule can be of any length, and can be single-stranded or double- stranded, or partially single-stranded and partially double-stranded.
- a nucleic acid molecule can comprise other entities, such as drug molecules, detectable labels, linking moieties, or specific binding members.
- Directly in the context of a biological process or processes, refers to direct causation of a process that does not require intermediate steps, usually caused by one molecule contacting or binding.to another molecule (the same type or different type of molecule). For example, molecule A contacts molecule B, which can cause molecule B to exert effect X that is part of a biological process. In terms of binding, “directly” means that molecule A contacts and binds molecule B without intermediate molecules that mediate the binding.
- Indirectly in the context of a biological process or processes, refers to indirect causation that requires intermediate steps, usually caused by two or more direct steps. For example, molecule A contacts molecule B to exert effect X which in turn causes effect Y. In terms of binding, “indirectly” means that molecule A binds molecule B by contacting at least one intermediate molecule that mediates the binding.
- Sequence homology refers to the proportion of base matches between two nucleic acid sequences or the proportion of amino acid matches between two amino acid sequences. When sequence homology is expressed as a percentage, for example 50%, the percentage denotes the proportion of matches of the length of sequences from a desired sequence that is compared to some other sequence. Gaps (in either of the two sequences) are permitted to maximize matching; gap lengths of 15 bases or less are usually used, 6 bases or less are preferred with 2 bases or less more preferred.
- the sequence homology between the target nucleic acid and the oligonucleotide sequence is generally not less than 17 target base matches out of 20 possible oligonucleotide base pair matches (85%); preferably not less than 9 matches out of 10 possible base pair matches (90%), and most preferably not less than 19 matches out of 20 possible base pair matches (95%).
- Corresponds to refers to a polynucleotide sequence that is homologous (for example is identical, not strictly evolutionarily related) to all or a portion of a reference polynucleotide sequence, or to a polypeptide sequence that is identical to all or a portion of a reference polypeptide sequence.
- the term “complementary to” is used herein to mean that the complementary sequence will base pair with all or a portion of a reference polynucleotide sequence.
- the nucleotide sequence TATAC corresponds to a reference sequence TATAC and is complementary to a reference sequence GTATA.
- Constant amino acid substitutions refer to the interchangeability of residues having similar side chains.
- a group of amino acids having aliphatic side chains is glycine, alanine, valine, leucine, and isoleucine; a group of amino acids having aliphatic-hydroxyl side chains is serine and threonine; a group of amino acids having amide-containing side chains is asparagine and glutamine; a group of amino acids having aromatic side chains is phenylalanine, tyrosine and tryptophan; a group of amino acids having basic side chains is lysine, arginine and histidine; a group of amino acids having acidic side chains is aspartic acid and glutamic acid; and a group of amino acids having sulfur-containing side chain is cysteine and methionine.
- Preferred conservative amino acid substitution groups are: valine-leucine-isoleucine; phenylalanine- tyrosine; lysine-arginine; alanine-valine; glutamic acid-aspartic acid; and asparagine-glutamine.
- Test compound refers to a chemical, compound, composition or extract to be tested by at least one method of the present invention for at least one activity for at least one activity such as putative modulation of a biological process or specific binding capability.
- Test compounds can include small molecules, drugs, proteins or peptides or active fragments thereof, such as antibodies or fragments or active fragments thereof, nucleic acid molecules such as DNA, RNA or combinations thereof, antisense molecules or ribozymes, or other organic or inorganic molecules, such as lipids, carbohydrates, or any combinations thereof.
- Test compounds that include nucleic acid molecules can be provided in a vector, such as a viral vector, such as a retrovirus, adenovirus or adeno-associated virus, a liposome, a plasmid or with a lipofection agent.
- Test compounds once identified, can be agonists, antagonists, partial agonists or inverse agonists of a target.
- a test compound is usually not known to bind to the target of interest.
- Control test compound refers to a compound known to bind to the target (for example, a known agonist, antagonist, partial agonist or inverse agonist). Test compound does not typically include a compound added to a mixture as a control condition that alters the function of the target to determine signal specificity in an assay.
- control compounds or conditions include chemicals that (1) non-specifically or substantially disrupt protein structure (for example denaturing agents such as urea or guandium, sulfhydryl reagents such as dithiothreitol and beta- mercaptoethanol), (2) generally inhibit cell metabolism (for example mitochondrial uncouplers) and (3) non-specifically disrupt electrostatic or hydrophobic interactions of a protein (for example, high salt concentrations or detergents at concentrations sufficient to non-specifically disrupt hydrophobic or electrostatic interactions).
- test compound also does not typically include compounds known to be unsuitable for a therapeutic use for a particular indication due to toxicity to the subject. Usually, various predetermined concentrations of test compounds are used for determining their activity.
- the concentration of test chemical used can be expressed on a weight to volume basis. Under these circumstances, the following ranges of concentrations can be used: between about 0.001 micrograms/ml and about 1 milligram/ml, preferably between about 0.01 micrograms/ml and about 100 micrograms/ml, and more preferably between about 0.1 micro grams/ml and about 10 micrograms/ml.
- Target refers to a biochemical entity involved in a biological process. Targets are typically proteins that play a useful role in the physiology or biology of an organism. A therapeutic composition or compound typically binds to a target to alter or modulate its function. As used herein, targets can include, but not be limited to, cell surface receptors, G-proteins, G- protein coupled receptors, kinases, phosphatases, ion channels, lipases, phosholipases, nuclear receptors, transcription factors, intracellular structures, tubules, tubulin, antibodies and the like.
- a “therapeutic target” or a “pharmaceutical target” is a target that when modulated can have a therapeutic effect.
- a “purification target” is a target that is useful in purification schemes, such as, for example, regions of antibodies such as the Fc region.
- a “diagnostic target” is a target that is useful in diagnostics, such as cell surface epitopes or markers on etiological agents.
- Label refers to incorporation of a marker which may or may not be used for detection purpose. For example by incorporation of a radiolabled compound or attachment to a polypeptide of moieties such as biotin that can be detected by the binding of a section moiety, such as marked avidin. On the other hand, if a protein is labeled by a biotin, the protein can be attached to a nucleic acid that is labeled with an avidin. Thus, the protein and nucleic acid can form a complex.
- Various methods of labeling polypeptide, nucleic acids, carbohydrates, and other biological or organic molecules are known in the art.
- Such labels can have a variety of readouts, such as radioactivity, fluorescence, color, chemiluminescence or other readouts known in the art or later developed.
- the readouts can be based on enzymatic activity, such as beta- galactosidase, beta-lactamase, horseradish peroxidase, alkaline phosphatase, luciferase; radioisotopes such as 3 H, 14 C, 35 S, 32 P, 125 I or 131 I); fluorescent proteins, such as green fluorescent proteins; or other fluorescent labels, such as FITC, rhodamine, and lanthanides. Where appropriate, these labels can be the product of the expression of reporter genes, as that term is understood in the art.
- reporter genes are beta-lactamase (U.S. Patent No. 5,741,657 to Tsien et al., issued April 21, 1998) and green fluorescent protein (U.S. Patent No. 5,777,079 to Tsien et al., issued July 7, 1998; U.S. Patent No. 5,804,387 to Cormack et al., issued September 8, 1998).
- Specific binding member is one of two different molecules having an area on the surface or in a cavity which specifically binds to and is thereby defined as complementary with a particular spatial and polar organization of the other molecule.
- a specific binding member can be a member of an immunological pair such as antigen-antibody, biotin-avidin, hormone-hormone receptor, nucleic acid duplexes, IgG-protein A, DNA-DNA, DNA-RNA, and the like.
- substantially pure refers to an object species or activity that is the predominant species or activity present (for example on a molar basis it is more abundant than any other individual species or activities in the composition) and preferably a substantially purified fraction is a composition wherein the object species or activity comprises at least about 50 percent (on a molar, weight or activity basis) of all macromolecules or activities present.
- substantially pure composition will comprise more than about 80 percent of all macromolecular species or activities present in a composition, more preferably more than about 85%, 90%, 95% and 99%.
- the object species or activity is purified to essential homogeneity, wherein contaminant species or activities cannot be detected by conventional detection methods) wherein the composition consists essentially of a single macromolecular species or activity.
- an activity may be caused, directly or indirectly, by a single species or a plurality of species within a composition, particularly with extracts.
- “Pharmaceutical agent or drug” refers to a chemical, composition or activity capable of inducing a desired therapeutic effect when property administered by an appropriate dose, regime, route of administration, time and delivery modality.
- Sample means any biological sample, preferably derived from a test animal, such as a mouse, rat, rabbit or monkey, or a patient, such as a human. Samples can be from any tissue or fluid, such as neural tissues, central nervous tissues, internal organs such as pancreas, liver, lung, kidney, muscle, skeletal muscle, urine, feces, blood, fluids from body cavities or the central nervous system, or samples from various body cavities such as the mouth or nose. Samples derived from urine and feces contain cells of the immunological, urinary or digestive tract and can be a rich source of sample. Such samples can be obtained using methods known in the art, such as biopsies, aspirations, scrapings or simple collection. A sample can be taken from a test animal or patient that is either living or dead.
- Ribozyme means enzymatic RNA molecules capable of catalyzing the specific cleavage of RNA.
- the mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme molecule to complementary target RNA, followed by endonucleolytic cleavage.
- a "DNA” molecule refers to either single- or double-stranded deoxyribonucleic acid.
- a DNA molecule can include nucleotide analogues or derivatives, such as dideoxynucleotides, or nucleotides comprising non-naturally occurring bases, such as inosine, and can comprise one or more linkages other than phosphodiester linkages, such as for example, phosphoramide or phosphothioate linkages.
- a DNA molecule can also comprise other chemical moieties, such as labels, specific binding members, and linking moieties.
- a “moiety” refers to any chemical or biochemical structure.
- a moiety is a structure that can be incorporated into, or covalently or noncovalently, reversibly or irreversibly bound to nucleic acid, polypeptide, or both.
- a “transcription termination moiety” refers to a region or structure of a nucleic acid molecule, a protein or polypeptide, or a compound or any other entity or moiety, that is directly or indirectly linked to a nucleic acid molecule and impedes with the progress of, stalls, or stops the functional migration of an RNA polymerase along the nucleic acid template.
- linking moiety refers to an entity that is capable of directly or indirectly linking a peptide or polypeptide to a nucleic acid molecule, such as its own encoding nucleic acid molecule.
- a linking moiety can be a compound or an entity directly or indirectly linked to a nucleic acid.
- a puromycin molecule bound to a DNA molecule can be a linking moiety that can be incorporated to the polypeptide encoded by the DNA molecule by a ribosome.
- linking moieties are peptide nucleic acid sequences, peptide sequences, nucleic acid sequences, nitrilotriacetic acid, digoxigenin, and biotin, all of which can optionally but preferably be covalently bound to a nucleic acid molecule of the present invention.
- Link refers to any direct or indirect interactions between two molecules, which includes covalent bonds, coordinate bonds, hydrophobic interaction, electrostatic interaction, or a combination thereof.
- Random sequence refers to a fully random, partially random, or semi-random sequence of nucleic acid bases that forms a nucleic acid molecule or amino acids that form a polypeptide. Random sequences can be made using synthetic methods as they are known in the art, such as solid phase nucleic acid or solid phase polypeptide synthesis, or by enzymatic methods, such as polymerase reactions or digesting polypeptides or nucleic acids of natural or synthetic origin to obtain fragments thereof, or by any combination of these methods.
- Fully random refers to 1) sequences that have been made without statistical weight to the probability of inserting any one of the set of naturally-occurring bases or amino acids at a given position of the random sequence, or 2) sequences that have been made by fragmentation of at least one nucleic acid molecule.
- Semi-random refers to sequences that have been made with statistical weight as bases/amino acids and/or their sequence and can be made using synthetic methods known in the art or by digesting polypeptides or nucleic acid molecules (see, U.S. Patent No. 5,270,163 to Gold et al., issued December 14, 1993; and U.S. Patent No. 5,747,253 to Ecker et al., issued May 5, 1998).
- Semi-random sequences can be nucleic acid or amino acid sequences that have been synthesized such that particular sequence combinations are preferred over other sequence combinations.
- a semi-random nucleic acid sequence can be biased to preferentially include only a subset of the nucleic acid codons that encode particular amino acids, or can be biased such that the frequency of stop codons in the sequence is reduced.
- a semi- random nucleic acid sequence can be synthesized such that, for example, codons for hydrophobic amino acids are less abundant in the sequence than would occur if the sequence were totally random.
- Semi-random sequences can be made by directed chemical synthesis, and can, for example, be based on the synthesis of preferred codons that can be built into a multi-codon sequence as disclosed in PCT application US99/22436 (WO 00/18778) to Lohse et al, published , April 6, 2000, which is herein incorporated by reference.
- Partially random sequences are sequences that are in part known or identified sequences and are in part fully random or partially random sequences, and can also be made by modifying or adding to identified or fixed sequences (Pasqualini and Ruoslahti, Nature 380:364-366 (1999); and U.S. Patent No. 5,270,163 to Gold et al., issued December 14, 1993).
- sequence of interest refers to a nucleic acid sequence or nucleic acid molecule that has been selected for by screening or otherwise identified.
- a sequence of interest can also be at least a portion of a known nucleic acid molecule or nucleic acid sequence.
- an activity, such as an enzymatic activity or binding activity, of the amino acid sequence that can be partially or entirely encoded by the sequence of interest is known (but that need not be the case), and the sequence of interest includes sequences encoding at least one such activity or a portion of such activity.
- Secondary structure refers to a structure in a nucleic acid molecule that is more than the primary linear structure of the sequence of bases. Secondary structures can include a variety of configurations based at least in part on base-pairings, such as stem-loop configurations or hairpin configurations. (See, U.S. Patent No. 5,270,163 to Gold et al., issued December 14, 1993; and U.S. Patent No. 5,747,253 to Ecker, issued May 5, 1998).
- “Stem-forming sequence” is a sequence of bases in a nucleic acid molecule that is comprised of two half-stem sequences wherein the first half of the stem-forming sequence is able to base pair with the second half stem-forming sequence when the first and second stem-forming sequences are in single-stranded form.
- Stem-forming sequences may base-pair to form a double- stranded structure in a continuous strand of nucleic acids.
- Such double-stranded structures may be of any length, and may be a part of larger secondary structures of the nucleic acid molecule, such as stem-loop structures and hairpin structures, as they are known in the art.
- “Substance of interest” refers to a compound that has been selected for screening peptides or complexes of the present invention, or has been identified using the methods of the present invention.
- Solid support refers to any solid support that can be used in a method of the present invention.
- a solid support is used to immobilize a nucleic acid molecule of the present invention or a complex of the present invention.
- a solid support can be used to immobilize a substance of interest, a cell, an etio logical agent, or other moiety.
- Solid substrates can take any form, such as sheets, membranes (such as nitrocellulose or nylon), polymeric surfaces, including wells (such as microtiter wells), beads, or chips, such as glass, nylon, or silica sheets that comprise arrays of nucleic acids, proteins, or other molecules.
- Solid supports can be of any appropriate material, such as polymers, metals, glass, or silica and can be magnetic in nature.
- Preferred solid substrates include polystyrene, polycarbonate, latex, polyacrylamide, sepharose, nylon, nitrocellulose, glass, silica, and magnetite.
- On or within a cell refers to a moiety, such as a receptor or biomolecule that resides on the surface of a cell, within the outer membrane of a cell, or within a cell.
- Within a cell refers to any locus within a cell, such as in the cytoplasm or within or associated with an organelle, such as, for example, a mitochondria, nucleus or Golgi apparatus.
- a “cell” refers to any cell, such as a of prokaryotic (such as bacterial) or eukaryotic origin.
- Eukaryotic cells include, for example, single cell organisms such as yeast and multicellular organisms such as invertebrates, plants and vertebrates.
- Invertebrates include parasites such as worms and vertebrates include cold-blooded organisms (such as reptiles and amphibians) and warm-blood organisms, such as mammals, including humans.
- a cell can be part of a sample of tissue, fluid or organ of a multicellular organism, or can be part of a multicellular organism itself.
- In vitro refers to procedures that are performed outside of a cell.
- purified enzymes or extracts of cells can be used to perform procedures in a vessel, such as a test tube.
- Ex vtvo refers to procedures that are performed outside of a multicellular organism, but use whole cells.
- live cells from a subject, such as a human can be cultured outside of the body and these cells can be used in testing procedures.
- In vivo refers to procedures that are performed on a whole organism, such as a subject, including a human, such as in clinical trials. In vivo procedures can also be performed on non- human subjects, such as animal models.
- a "normal cell” refers to a cells whose processes and characteristics are in conformance with an average cell of that type. For example, a normal lung cell does not exhibit the proliferation and metastatic capabilities of a cancerous lung cell.
- abnormal cell refers to a cell whose processes and characteristics are not in conformance with an average cell of that type.
- a normal CD4+ does not exhibit the lifespan of a CD4+ cell infected with a virus, such as H1N.
- a “neoplastic cell” refers to a cell that exhibits the processes and characteristics of a neoplasm, such as tumors, cancers, carcinomas and the like.
- virus infected cell refers to a cell that has been infected with a viable virus and exhibits or will exhibit characteristics of that infection.
- an “etiological agent” refers to any etiological agent, such as bacteria, parasites, fungi, viruses, prions and the like.
- a “library” refers to a group of two or more compounds or compositions.
- the members of a library can be mixed into a single population, such as in a single container.
- the members of a library can be provided separately in different containers, such as in microtiter plates or separate containers in a larger container, such as vials in a box.
- such separate containers can include one or more members of a library.
- the present invention includes several general and useful aspects, including:
- nucleic acid molecule comprising a transcription regulatory region, an open reading frame comprising a random sequence or sequence of interest, and a transcription termination moiety.
- the nucleic acid molecule preferably comprises a linking moiety that can link a polypeptide encoded by the open reading frame to the nucleic acid molecule, so that transcription elongation ternary complexes can form.
- Such nucleic acid molecules can be provided in vectors,
- nucleic acid molecules of the present invention are linked to peptides or polypeptides that are encoded by the nucleic acid molecules.
- a library of nucleic acid molecule of the present invention can comprise nucleic acid molecules in vectors.
- a library of nucleic acid molecule of the present invention can be with or without a substance of interest.
- the present invention includes nucleic acid molecules that are useful for a variety of purposes, including methods of the present invention.
- the nucleic acid molecules can be provided in vectors.
- the nucleic acid molecule can be any nucleic acid molecule that can be transcribed by RNA polymerase, but preferably comprises double-stranded DNA, single- stranded DNA, or partially double-stranded or single-stranded oligonucleotides, but is most preferably DNA that is at least partially double-stranded.
- a nucleic acid molecule of the present invention preferably comprises a transcription regulatory region, at least one random sequence or sequence of interest, and at least one transcription termination moiety, and preferably comprises or is bound to a linking moiety.
- a nucleic acid molecule of the present invention can encode a linking moiety that is a region or domain or a polypeptide.
- Linking moieties can be any compounds that can directly or indirectly link a polypeptide to a nucleic acid molecule of the present invention that encodes it.
- a linking moiety is covalently bound to a nucleic acid molecule of the present invention.
- this is not a requirement of the present invention.
- linking moiety is puromycin, or another tRNA mimetic, that is bound to a nucleic acid molecule of the present invention can be incorporated into a polypeptide by ribosome.
- Methods of binding puromycin to the 3' and 5' ends of nucleic acid molecules are known in the art, see for example, PCT applications WO 00/72869, WO 01/07657, WO 01/04265, and WO 00/32823, herein incorporated by reference.
- a linking moeity such as puromycin can also be bound to a DNA binding molecule, such as lac repressor protein or the RNA polymerase itself.
- the linkage between a DNA molecule of the present invention that includes a binding site for the DNA binding protein and a polypeptide encoded by the DNA molecule is via the bridge of DNA-DNA binding molecule-puromycin-polypeptide.
- puromycin that is bound to a protein is bound to the protein by a linker (such as a carbon chain) that can allow the puromycin access to a ribosome that translates the RNA transcribed by the RNA polymerase.
- the puromycin can be incorporated into a polypeptide by a ribosome, and thereby linked, via RNA polymerase, to a DNA molecule that encodes the polypeptide.
- nucleic acid molecules that comprise, at or near a 3' or 5' terminus, linking moieties such as, but not limited to, biotin, digoxigenin, nitrilotriacetic acid, nucleic acid sequences, peptide nucleic acid sequences, or peptide sequences.
- nucleic acid molecules can optionally be oligonucleotides that can be used as primers that become incorporated into the 5' ends of nucleic acid molecules of the present invention in polymerase reactions.
- Moieties can also be attached to nucleic acid molecules, such as oligomers, that can optionally be hybridized and then ligated to the 3' ends of nucleic acid molecules of the present invention.
- a portion of the nascent polypeptide synthesized by the ribosome can be a binding domain that can specifically bind a linking moiety.
- This region or domain of the polypeptide encoded by a nucleic acid molecule of the present invention can bind to the nucleic acid molecule or to a linking moiety that is directly or indirectly linked to the nucleic acid molecule, such as those listed above.
- the nucleic acid construct in addition to comprising a random sequence or sequence or interest, also comprises a sequence encoding the amino acid sequence of the polypeptide linking domain.
- the sequence of interest or random sequence and the sequence encoding the polypeptide linking domain are both part of the same open reading frame.
- a nucleic acid molecule of the present invention does not comprise a linking moiety.
- a ribosome can be a linking moiety, and a DNA molecule of the present invention can be linked to the polypeptide it encodes via the following linkages: DNA-RNA polymerase-RNA-ribosome-polypeptide.
- linking moieties such as the types of compound and sequence and length of nucleic acid or peptide that relate to the function of these structures can be selected based on reports in the literature, or by screening compounds for the desired activity using standard assay methods as they are known in the art.
- Experiments and assays that test for the linkage of a polypeptide to its own encoding nucleic acid molecule can be designed using, for example, labels that can be incorporated into nucleic acid molecules and polypeptides, and by using separation techniques such as gel electrophoresis and enzymes such as nucleases and proteases to demonstrate the coupling of a nucleic acid molecule to the polypeptide it encodes. See, for example, PCT application number WO 98/31700 and PCT application number WO 00/26511.
- the transcription regulatory region can be a prokaryotic promoter that controls the transcription of the DNA to RNA by DNA-dependent RNA polymerase, such as, but not limited to, E. coli, T7, T3, or SP6 RNA polymerase.
- the transcription regulatory region can also be a eukaryotic promoter and, optionally, enhancer, that controls the transcription of the DNA by an RNA polymerase of a eukaryotic source, for example, RNA polymerase U from HeLa cells.
- Transcription regulatory regions or "promoters” and "enhancers” are well known in the art, as are assays for determining the promoter and enhancer activity of DNA sequences. Accordingly, transcription regulatory control regions can be those known in the art, modified versions of those known in the art, or later determined.
- a random sequence or sequence of interest can comprise either complete random sequence or partially random sequence or semi-random sequence, or a specified sequence combined with a complete random sequence or partially random sequence or semi-random sequence, or can be any sequences, known or unknown, for example of cellular or viral origin.
- a random sequence or sequence of interest can be of any length, but is preferably from about twelve to about 10,000, more preferably from about twenty to about 5,000, and most preferably from about thirty to about 1 ,000 bases in length.
- a transcription termination moiety is a nucleic acid sequence or structure, a compound covalently or non-covalently attached to the DNA, or a molecule or any entity that binds to the DNA, so that the migration of RNA polymerase along the DNA template is impeded at the transcription termination moiety site.
- the migration of RNA polymerase can stop, can pause, or can be slowed at the transcription termination site with respect to its migration in the absence of a transcription termination moiety site.
- transcription termination moieties are drug molecules (Shi et al, (1988) J. Mol. Biol. 199, 277-293; Shi et al., (1988) J. Biol. Chem. 263, 527-534); White R. J. & Philips, D.
- nucleic acid sequences or secondary structures that can be used for transcription termination moieties are also known in the art (Arndt and Chamberlin (1990) J. Mol. Biol. 213: 79-108) and can be selected according to published literatures or from nucleic acid libraries using appropriate methods.
- the transcription regulatory region, the ORF sequence, the transcription termination moiety and the linking moiety need not be directly linked together, immediately adjacent to each other or be part of the same nucleic acid molecule, but are preferably operably linked.
- These elements of the nucleic acid molecule of the present invention are preferably provided on a nucleic acid construct in the order of transcription control region, random sequence or sequence of interest, transcription termination region and linking moiety. However, this order can be completely or partially altered in some cases.
- nucleic acid molecules of the present invention can be made using any appropriate method, including synthetic methods or cloning methods as they are known in the art (Sambrook et al., supra, (1989)).
- the nucleic acid molecule comprises double-stranded DNA and the linking moiety is puromycin that is covalently bound to the DNA.
- the DNA also encodes a ribosome binding RNA sequence that promotes translational intitiation and a translation start codon, such as dAdTdG.
- the linking moiety is preferably covalently linked to the DNA downstream of the transcription termination moiety (TTM) (a), it also may be linked to the upstream of the transcription termination moiety (TTM) (b), or anywhere along the DNA (c).
- the nucleic acid molecule comprises double- stranded DNA.
- the double-stranded DNA also encodes a ribosome binding RNA sequence and a downstream translation start codon.
- the DNA also comprises a specific protein binding region that can be recognized and specifically bound by a DNA binding protein (DBP).
- DBP DNA binding protein
- the protein that specifically binds to the protein binding region of the DNA can be directly or indirectly bound to a linking moiety, such as puromycin.
- the protein binding region of the DNA molecule can be located either downstream of the transcription termination moiety (a) or upstream or the transcription regulatory region (b), or anywhere along the DNA (c).
- the nucleic acid molecule comprises double- stranded DNA that preferably encodes a ribosome binding RNA sequence and a translation start codon.
- the DNA molecule also comprises a specific protein binding region that can be recognized and specifically bound by a DNA binding protein (DBP).
- the DNA molecule further comprises a sequence that encodes a peptide that can bind to a domain of the protein or compound attached to the DNA binding protein.
- the protein binding region of the DNA molecule can be located either downstream of the transcription termination moiety (TTM) (a) upstream of the transcription regulatory region (TTM) (b), or anywhere along the DNA (c).
- Nucleic acid molecules of the present invention can be of any length in total, but are preferably between about twenty bases and about 10,000 bases, more preferably between about forty bases and about 1,000 bases.
- the nucleic acid molecule of the present invention can further include at least one random sequence or at least one sequence of interest or a combination of at least one random sequence and at least one sequence of interest.
- the random sequence or sequence of interest can be made using appropriate methods in the art, such as by cloning techniques, including PCR techniques and other enzymatic techniques such as, but not limited to, reverse-transcription from cellular mRNA; by solid phase synthesis; by fragmenting nucleic acid molecules using a variety of methods, such as sheer forces, vibrational energy or restriction enzymes; or by any combination of these methods.
- the polynuleotides can be of any length, but are preferably between about twenty bases and about 500 bases, more preferably between about forty bases and about 150 bases in length.
- the nucleic acid molecule can be of any length, preferably between about fifty bases and 10,000 bases, more preferably between about 100 bases and 1,000 bases.
- Random sequences made by chemical synthesis can be completely random, in which case no bias is given to the incorporation of particular nucleotides (preferably A,C,G, and T) in any position in the synthesized nucleic acid.
- semi-random sequences can be synthesized by specifying that certain subsets of one or more nucleotides can be employed for one or more positions in the sequence.
- a random sequence can also be partially random, in which some positions in the sequence are specified, and others are completely random or semi- random.
- the linking moiety and the nucleic acid molecule of the present invention can be operably linked.
- the linking moiety and the nucleic acid molecule of the present invention are covalently linked, but this is not a requirement of the present invention.
- a linking moiety that is bound to a nucleic acid molecule can be 5' to the sequence encoding the random sequence or sequence of interest or 3' to the random sequence or sequence of interest.
- the nucleic acid molecules of the present invention can be made using any appropriate method, including synthetic methods or cloning methods as they are known in the art (Sambrook et al., supra, (1989)).
- the nucleic acid molecule of the present invention can further include sequences or other polymers that function as at least one spacer region.
- a spacer region can include nucleic acid sequences and long chain polymers that preferably do not interact with nucleic acids.
- a spacer region can be made using appropriate methods in the art, such as solid phase synthesis or cloning methods known in the art.
- a spacer region can be of any length, but is preferably between about ten bases and about 100 bases, more preferably between about twenty bases and about fifty bases in length, and of equivalent length if the spacer is non-nucleic acid molecule.
- Preferred spacer regions include secondary structure-free DNA sequences, long chain carbon molecules, or combination of both.
- a spacer region can located in any portion of a nucleic acid molecule of the present invention. Preferably, though, a spacer region occurs between an open reading frame and a linking moiety.
- the transcription control region, ORF, transcription termination region, and linking moiety need not be directly linked together, immediately adjacent to each other or provided on the same nucleic acid molecule, but are preferably operably linked.
- These elements of the nucleic acid molecule of the present invention can be provided on a nucleic acid construct in any order or orientation.
- the linking moiety can be anywhere in the nucleic acid, preferably at or near the 5 'or 3 '-end of the nucleic acid molecule (see FIGs. 1, 2, and 3).
- the nucleic acid molecules of the present invention can be made using any appropriate method, including synthetic methods or cloning methods as they are known in the art (Sambrook et al., supra, (1989)) and described in the previously.
- the nucleic acid molecules of the present invention may further include sequences that encode peptides that mediate the entry of peptides and other molecules into cells.
- sequences include sequences that encode portions of the tat gene of H1N (Anderson et al. Biochem. Biophys. Res. Commun. 194: 876-884 (1993); Fawell et al., Proc. ⁇ atl. Acad. Sci. USA 91: 664-668 (1994); Kim et al., J. Immunol. 159: 1666-1668 (1997); Vives et al. J. Biol. Chem. 272: 16010-16017 (1997); Nocero-Akbani et al. at. Med.
- nucleic acids of the present inventions may further include sequences that encode peptides that direct molecules to particular cellular compartments, for example, the endoplasmic reticulum or the mitochondria, as such sequences are known in the art or are later identified or developed.
- the nucleic acid molecule of the present invention preferably include at least one control sequence, such as an expression control sequence, that drives or regulates the transcription and/or translation of the nucleic acid molecule of the present invention.
- control sequences are "Shine-Dalgarno" sequences or "Kozak sequences" and at least one start codon and with or without a stop codon.
- the nucleic acid may not include such a control sequence.
- the nucleic acid molecules of the present invention may be directly or indirectly labeled with a detectable marker.
- the detectable marker may be a radioisotope or a nonradioactive detectable molecule such as biotin or fluorescein, or other detectable markers as they are known or developed in the art.
- the marker may be directly or indirectly bound to the nucleic acid.
- the nucleic acid molecule of the present invention and complexes that include such nucleic acid molecules can also be provided in a cell.
- the nucleic acid molecule can be introduced into a cell using methods known in the art, such as Hpofection or electroporation.
- nucleic acid molecules can be introduced into cells using vectors, such as viruses or phages.
- the cells can be any cell, including prokaryotic or eukaryotic cells, and can be ex vivo or in vivo, including within a whole organism, including a mammal, including a human.
- the nucleic acid molecule can be transcribed and/or translated to produce a complex of the present invention.
- the nucleic acid molecules of the present invention can also be provided in a vector.
- Vectors can be viral vectors, liposomes, microspheres, plasmids, phages or a linear dsDNA molecules.
- Vectors preferably include double-stranded DNA molecules of the present invention, but the invention is not limited to such vectors.
- various viral vectors include double- or single-stranded DNA (parvoviruses), single-stranded RNA (retroviruses) or double-stranded RNA (rotaviruses).
- Vectors are useful for making libraries of nucleic acid molecules of the present invention, particularly libraries that include nucleic acid molecules that have different random sequences or sequences of interest. Furthermore, such vectors are convenient for making, storing and transporting nucleic acid molecules of the present invention. Vectors can be made or modified using methods in molecular biology as they are known in the art (Sambrook et al., supra, (1989)). The vector of the present invention can be of any vector as that term is known in the art.
- Vectors can be, for example, retroviruses (U.S. Patent No. 5,399,346 to Anderson et al., issued March 21, 1995, Bandara et al., DNA and Cell Biol. 11:227-231 (1992)); adenoviruses (Berkner, BioTechniques 6: 616-629 (1989); adeno-associated viruses (Larrick and Burck, Gene Therapy. Application of Molecular Biology, Elsevier, New York (1991); plasmid vectors (U.S. Patent No. 5,240,846 to Collins et al., issued August 31, 1993); liposomes (Holmberg et al., J. Liposome Res.
- the present invention also includes a cell that includes or has been transfected or transformed by a vector of the present invention.
- a cell can be ex vivo or in vivo in a subject, including a test animal or a human.
- the nucleic acid molecule of the present invention in the vector can be expressed in the cell.
- the nucleic acid molecule includes a sequence of interest such that when translated retains at least one activity of the polypeptide encoded by the sequence of interest
- the vector includes at least one random sequence. The activity of the polypeptide encoded by the random sequence in the cell can be monitored by observing or interrogating the cell using methods known in the art, including the use of reporter genes to report changes in signal transduction within a cell.
- the nucleic acid molecule of the present invention can be operably linked to the polypeptide it encodes.
- the operable link between a nucleic acid molecule and a polypeptide of the present invention occurs with covalent bonding, but this is not a requirement of the present invention.
- a ribosome can translate the RNA into a polypeptide.
- a linking moiety is covalently bound to DNA and is incorporated into the peptide, providing a nucleic acid molecule bound to the polypeptide it encodes.
- Complexes comprising nucleic acid molecules of the present invention operably linked to polypeptides may also be labeled with a detectable label.
- the detectable label may be a radioisotope or a nonradioactive detectable molecule such as biotin, or other detectable moieties as they are known or developed in the art.
- the label may be directly or indirectly bound to the nucleic acid of the complex or to a polypeptide of the complex, or both.
- the polypeptide of the complex labeled with a detectable marker need not be the polypeptide encoded by or partially encoded by the random sequence or sequence of interest.
- a nucleic acid molecule of the present invention can be linked to a polypeptide encoded by a random sequence or sequence of interest.
- the polypeptide can bind with a substance of interest.
- This aspect of the invention allows the selection of polypeptides that bind with a substance of interest, while at the same time selecting the nucleic acid molecule that encodes the polypeptide that binds with a substance of interest.
- the substance of interest can be on a solid support, and can be immobilized on such a solid support using methods known in the art such as absorption, chemical conjugation or cross- linking.
- the structure formed by a nucleic acid of the present invention and a polypeptide encoded by a random sequence or sequence of interest can be immobilized on a solid support and one or more substances of interest may be bound with or capable of reacting with the fixed complex or complexes.
- the substance of interest can be on or within a cell, and the cell can optionally be immobilized on a solid support using appropriate methods, such as solid supports covered with fibronectin or other adhesion molecules, entrapped thereon.
- the cell can be ex vivo and can be provided as a cell, culture of cells, or part of a sample of tissue, fluid or organ. Alternatively, the cell can be in vivo in a subject.
- the substance of interest maybe a cell-type-specific or tissue-specific molecule.
- Nucleic acids or peptides of the present invention that specifically bind to the substance of interest can be identified.
- Peptides that bind such cell-type-specific and tissue-specific molecules can be used to target drug delivery to specific cells or tissues.
- the cell can be any cell, including a normal cell or an abnormal cell, such as, for example, a neoplastic cell or a virus infected cell.
- the substance of interest can also be on or within an etiological agent, such as, for example, a virus, a bacteria, a bacterial spore, a parasite or a prion.
- the substance of interest may be one or a plurality of molecules on or within an etiological agent, virus, bacterium, protozoan, tumor cell or abnormal cell.
- the substance of interest used for selection may be whole cells, viruses, or microorganisms fixed to a solid support o in solution, or may be a portion or fractionated preparation of one or more cells, viruses, or microorganisms fixed to a solid support or in solution.
- the substance of interest can also include at least one organic molecule, an inorganic molecule, a polymer, a polypeptide, a lipid, a carbohydrate, a small molecule, a nucleic acid molecule, a ribozyme, a biomacromolecule or a drug.
- the present invention also includes a library of nucleic acid molecules of the present invention.
- libraries include nucleic acid molecules that contain linking moieties so that they are able to covalently or noncovalently bind to the polypeptides they encode.
- the library of nucleic acid molecules can include at least two different random sequences, at least two different sequences of interest or a combination of at least one random sequence and at least one sequence of interest.
- a library of nucleic acid molecules can include two or more such nucleic acid molecules.
- Each nucleic acid molecule can have a different random sequence or a different sequence of interest.
- one or more nucleic acid molecule can have a random sequence and the other nucleic acid molecule can have a sequence of interest.
- the library can optionally be fixed to a solid support.
- libraries containing random sequences which may include sequences in which one or a few sequence positions have been randomly or semi-randomly varied, or libraries containing sequences of interest, can be fixed to a chip or array for screening with one or more substances of interest.
- a translated library of complexes can also be fixed to a solid support.
- libraries of complexes containing random sequences which may include sequences in which one or a few sequence positions have been randomly or semi-randomly varied, or libraries of complexes containing sequences of interest, can be fixed to a chip or array for screening with one or more substances of interest.
- the detectable label may be a radioisotope or a nonradioactive detectable molecule such as biotin, or other detectable moieties as they are known or developed in the art.
- the label may be directly or indirectly bound to the members of the library.
- Library members may be labeled by direct or indirect binding of a detectable marker to the nucleic acid or to a polypeptide of the library member.
- a library of nucleic acid molecules can also include at least one substance of interest.
- the substance of interest can be bound with a nucleic acid molecule, a polypeptide, or complex of the present invention, can be unbound, or can be bound to some members of the library and not bound to other members of the library.
- the substance of interest can be directly bound or indirectly bound to a nucleic acid molecule of the present invention, but is preferably indirectly bound to a nucleic acid molecule of the present invention or directly bound to a complex of the present invention (particularly the polypeptide encoded by the random sequence or sequence of interest).
- the substance of interest can be a substrate, such as an enzymatic substrate, with which a nucleic acid molecule, polypeptide, or complex of the present invention interacts.
- Reactions of the nucleic acid molecule, polypeptide, or complex of the present invention with the substance of interest may be monitored and quantitated using appropriate assays, for example spectrophotometric assays or assays that measure the release of a radioactive moiety.
- the substance of interest can be directly or indirectly bound on a solid support or in solution.
- the structure formed by the construct of the present invention and the polypeptide encoded by a random sequence or sequence of interest can be immobilized on a solid support and one or more substances of interest may be unbound, may be bound with the fixed one or more complexes of the library, or may be acted upon in a biochemical reaction catalyzed by or modulated by one or more complexes of the library.
- the substance of interest can be on or within a cell, wherein the cell can be ex vivo or in vivo, such as in a subject.
- the cell can be any cell, including a normal cell or an abnormal cell, such as, for example, a neoplastic cell or a virus infected cell.
- the substance of interest can also be one or a plurality of molecules on or within an abnormal or normal cell or on or within an etiological agent, such as, for example, a virus, a bacterium, a bacterial spore, a parasite or a prion.
- the substance of interest may be one or a plurality of molecules on or within an etiological agent, virus, bacterium, protozoan, tumor cell or abnormal cell.
- the substance of interest used for selection may be whole cells, viruses, or microorganisms fixed to a solid support or in solution, or may be a portion or fractionated preparation of one or more cells, viruses, or microorganisms fixed to a solid support or in solution.
- the substance of interest may be a cell-type-specific or tissue-specific molecule, such that nucleic acids or peptides of the present invention that specifically bind to the substance of interest can be identified.
- Such cell-type-specific and tissue-specific molecules can be used to target drug delivery to specific cells or tissues.
- the substance of interest can also include at least one organic molecule, an inorganic molecule, a polymer, a polypeptide, a lipid, a carbohydrate, a small molecule, a nucleic acid molecule, a ribozyme, a biomacromolecule or a drug.
- the present invention also includes a library of vectors of the present invention.
- the library of vectors includes at least two different random sequences, at least two different sequences of interest, or a combination of one or more random sequences and one or more sequences of interest.
- the present invention includes methods for identifying nucleic acid molecules, particularly from a library of random sequences or sequences of interest that encode polypeptides that can bind with a substance of interest.
- This method includes: providing at least one nucleic acid molecule of the present invention that comprises at least one open reading frame, where the open reading frame comprises at least one random sequence or at least one sequence of interest; transcribing the nucleic acid molecule to form at least one transcription complex, wherein the transcription complex comprises a nucleic acid molecule operably linked to the its own transcribed RNA; translating the RNA to form a nucleic acid-polypeptide complex, wherein the nucleic acid peptide complex comprises a nucleic acid operably linked to a polypeptide that the nucleic acid encodes; contacting at least one nucleic acid-polypeptide complex with at least one substance of interest; selecting at least one complex that binds with the at least one substance of interest; and identifying the nucleic acid molecule that comprises at least one random sequence or sequence of interest.
- the nucleic acid molecule including the random sequence or sequence of interest can be sequenced, or can be detected by hybridization with probe nucleic acid molecules.
- a nucleic acid molecule can be amplified before it is identified.
- a nucleic acid molecule can be cloned before it is identified.
- a DNA molecule of the present invention or a library thereof can be transcribed to RNA by a DNA-dependent RNA polymerase.
- the transcription reaction is arrested by a transcription termination moiety on the template so that the RNA, RNA polymerase and DNA form a ternary complex.
- One or more ribosomes can translate the RNA into a peptide or polypeptide.
- the linking moiety such as puromycin, which can be attached to either the 3 '-(a) or 5'- (b) end of the DNA molecule, can therefore be linked to the nascent polypeptide.
- the transcription ternary complex is dissociated and the DNA-polypeptide hybrid molecule is released.
- prokaryotic and eukaryotic in vitro transcription and translation systems are well known in the art. It is also possible to perform transcription and translation in vivo, by introducing nucleic acid molecules of the present invention into cells, but this is not preferred. Transcription and translation systems can be used that are compatible with the transcription and translation regulatory sequences of the nucleic acid molecule used in the methods of the present invention. Transcription and translation reactions can be coupled, occurring simultaneously, or can be performed sequentially.
- the DNA-polypeptide hybrid is added to a mixture of one or more target compounds and the polypeptide portion of the complex can bind with a target molecule. Bound complexes are separated from the unbound complexes.
- the nucleic acid molecules on the bound complexes can be eluted from the complex or be used directly as the template for nucleic amplification reactions such as PCR.
- the amplified nucleic acid can be sequenced or expressed in living organisms. If one of the primers used in amplification reactions is 5 '-puromycin linked, the amplified DNA can be used as the template for another round of selection by the same procedures as described above. Where the linking moiety is other than puromycin or puromycin-like molecules as illustrated in FIGs. 2 and 3, or a ribosome as in FIG. 4, the same basic procedures can also be applied.
- the complex may optionally be depleted of ribosomes by treating the mixture of translated constructs with reagents that are known to cause the dissociation of ribosomes from RNA.
- EDTA may be added to deplete free Mg 2+ in the reaction mixture. This may be desirable for screening applications where the ribosome may impede binding to the substance of interest, or impede the entry of complexes into cells.
- Complexes can be purified or substantially purified from a translation reaction mixture using reagents that bind parts of the complex.
- translated polypeptides can contain a stretched of adjacent six histidine residues so that the DNA-polypeptide complex may be purified from the translation reaction using nitrilotriacetic- liriked beads.
- histidines can be encoded by nucleic acid molecules used in the methods of the present invention.
- Purified or substantially purified complexes may be stored under conditions that promote the stability of nucleic acids and polypeptides, for example, at 4°C in a buffer that contains BSA and EDTA.
- the complex can be contacted with one or more substances of interest under conditions that promote the binding of the complex, or reaction of the complex, particularly the polypeptide encoded by the random sequence or sequence of interest, with the substance of interest.
- the substance of interest can be on a solid support or in solution.
- a solid support may be a chip or array.
- the substance of interest can be on or within a cell and can be on or within an etiological agent.
- a substance of interest can bind with a complex that includes a polypeptide encoded by a random sequence or a sequence of interest and the random sequence or sequence of interest itself.
- Complexes that are not bound to a substance of interest can be separated from bound complexes using methods known in the art.
- the complexes that are free in solution can be washed away using methods known in the art for receptor-ligand reactions, such as immunoassay methods.
- the complexes of the present invention may be fixed to a chip or array, and the substance or substances of interest may be contacted with the chip or array to allow the substance of interest to bind or react with complexes for which the substance of interest has affinity.
- the substance of interest may be labeled with a detectable marker, or may be detected with a reagent specific for the substance of interest.
- Nonspecifically bound substance of interest maybe washed off using appropriate methods as they are known in the art prior to detection of the bound substance of interest.
- the nucleic acid molecule encoding a peptide that binds with or reacts with a substance of interest has been selected using this method.
- the selected complex or portions thereof can be isolated by recovery using a variety of methods. For example, changes in pH, detergents, denaturing agents (such as phenol, urea or guanidinium), concentration and types of salts, such as chaotropic or anti-chaotropic salts, or combinations thereof can be used to elute the complex or portions thereof.
- the complex can be digested using enzymes, such as proteases or nucleases to free portions of the complex such as the polypeptide or the nucleic acid molecule.
- the bound nucleic acid is recovered and enriched using appropriate nucleic amplification reactions, such as polymerase chain reaction (PCR).
- PCR polymerase chain reaction
- the enriched nucleic acid can be sequenced using appropriate methods known to the art.
- the recovered nucleic acid molecules that contain a random sequence or sequence of interest can also optionally be cloned into appropriate vectors, such as plasmids, which can be amplified in an appropriate host.
- the recovered nucleic acid which may contain several DNA species, may also be separated using the methods that exploit the sequence and conformation of the nucleic acid. It may be necessary to separate the double-stranded PCR product to single- stranded in order to use the said method. These methods can be capillary affinity gel for nucleic acid and HPLC, or the combination thereof.
- the individual species of the nucleic acid molecules can then be sequenced.
- the amino acid sequence of the polypeptide that is able to bind to the substance of interest can be deduced from the nucleic acid sequence.
- the entire selection procedures, or portions thereof, may be automated. Translated complexes can be contacted with targets and unbound complexes can be washed away by a programmable machine. Another component of the automated machine may perform amplification reactions on the nucleic acid molecules of the bound complexes. Several rounds of selection and amplification may be automated in a linked process, and the final PCR products can be separated on a column that utilizing the difference in sequence and conformation. Individual nucleic acid molecules may be transferred directly to an automated sequencer and sequenced, for example using fluorescently tagged nucleotides that may be read spectrophotometrically.
- the present invention includes nucleic acid molecules that comprise at least a portion of a random sequence or selected nucleic acid sequence identified by this method.
- the present invention also includes polypeptides that include at least a portion of a polypeptide encoded by an identified random sequence or sequence of interest.
- the present invention also includes a method for identifying a nucleic acid molecule or sequence in other forms, which include double-stranded DNA, single- or double-stranded RNA or RNA-DNA duplex.
- the sources for these forms of nucleic acid can be either synthetic or biological such as viral, prokaryotic and eukaryotic. All these nucleic acid must be converted to DNA as the template for the synthesis of RNA that is in turn used as the template for protein translation. For example (FIG. 6), total eukaryotic cellular mRNA may be converted to DNA using the methods known to the art. The DNA is linked with a linking moiety. Then, the selection and enrichment of the nucleic acid sequences can be carried out using the procedures described in the previous section.
- the present invention includes nucleic acid molecules that include at least a portion of a random sequence or selected nucleic acid sequence identified by this method.
- the present invention also includes polypeptides that include at least a portion of a polypeptide encoded by an identified random sequence or sequence of interest.
- the present invention includes methods for identifying polypeptides, particularly polypeptides encoded by random sequences or sequences of interest that bind with a substance of interest.
- This method includes: providing at least one nucleic acid molecule of the present invention that comprises at least one open reading frame, where the open reading frame comprises at least one random sequence or at least one sequence of interest; transcribing the nucleic acid molecule to form at least one transcription complex, wherein the transcription complex comprises a nucleic acid molecule operably linked to the its own transcribed RNA; translating the RNA to form a nucleic acid-polypeptide complex, wherein the nucleic acid peptide complex comprises a nucleic acid operably linked to a polpeptide that the nucleic acid encodes; contacting at least one nucleic acid-polypeptide complex with at least one substance of interest; selecting at least one complex that binds with said at least one substance of interest; and identifying said random sequence or said DNA sequence of interest.
- the nucleic acid molecule including the random sequence, sequence of interest, or selected sequence, can be sequenced.
- the amino acid sequences of polypeptides that bind to the target molecules can be deduced from the DNA sequence.
- the polypeptides can be synthesized chemically or expressed in living organisms.
- a DNA molecule of the present invention or a library thereof can be transcribed to RNA by a DNA-dependent RNA polymerase.
- the transcription reaction is a ⁇ ested by a transcription termination moiety on the template so that the RNA, RNA polymerase and DNA form a ternary complex in which the RNA can be translated to protein.
- a linking moiety such as puromycin, which can be attached to either 3'- (a) or 5 '-end (b) of the DNA molecule, can therefore be linked to the nascent polypeptide.
- the transcription ternary complex is dissociated and the DNA-polypeptide hybrid molecule is released.
- the DNA-polypeptide hybrid is put in the mixture of target molecules or molecule of interest and the polypeptide portion of the complex binds with the target molecule or the molecule of interest.
- the bound complex is separated from the bound complexes.
- the nucleic acid on the bound complex can be eluted from the complex or be used directly as the template for nucleic amplification reactions such as PCR.
- the amplified nucleic acid can be sequenced or expressed in living organisms. If one of the primer is a 5 '-puromycin linked, the amplified DNA can be used as the template for another round of selection by the same procedures as described above.
- linking moiety is other than puromycin or puromycin-like molecules as illustrated in FIGs. 2 and 3, or the ribosome in FIG. 4, the same procedures can also be applied.
- the complex may optionally be depleted of ribosomes by treating the mixture of translated constructs with reagents that are known to cause the dissociation of ribosomes from RNA.
- EDTA may be added to deplete free Mg 2+ in the reaction mixture. This may be desirable for screening applications where the ribosome may impede binding to the substance of interest, or impede the entry of complexes into cells.
- Complexes including libraries of complexes, maybe purified or substantially purified from the reaction mixture using reagents that bind parts of the complex. For example, if the polypeptide contains a stretched of adjacent six histidine residues, the DNA-polypeptide complex may be purified from the translation reaction using nitrilotriacetic-linked beads. Purified or substantially purified complexes may be stored under conditions that promote the stability of nucleic acids and polypeptides, for example, at 4°C in a buffer that contains BSA and EDTA.
- the complex can be contacted with one or more substances of interest under conditions that promote the binding of the complex, or reaction of the complex, particularly the polypeptide encoded by the random sequence or sequence of interest, with the substance of interest.
- the substance of interest can be on a solid support or in solution.
- a solid support maybe a chip or array.
- the substance of interest can be on or within a cell and can be on or within an etiological agent.
- a substance of interest is bound with a complex that includes a polypeptide encoded by a random sequence or a sequence of interest and the random sequence or sequence of interest itself.
- Complexes that are not bound to a substance of interest can be separated from bound complexes using methods known in the art.
- the complexes that are free in solution can be washed away using methods known in the art for receptor-ligand reactions, such as immunoassay methods.
- the complexes of the present invention may be fixed to a chip or array, and the substance or substances of interest may be contacted with the chip or array to allow the substance of interest to bind or react with complexes for which the substance of interest has affinity.
- the substance of interest may be labeled with a detectable marker, or may be detected with a reagent specific for the substance of interest.
- Nonspecifically bound substance of interest maybe washed off using appropriate methods as they are known in the art prior to detection of the bound substance of interest.
- the nucleic acid molecule encoding a peptide that binds with or reacts with a substance of interest has been selected using this method.
- the selected complex or portions thereof, such as the polypeptide or the nucleic acid molecule encoding the polypeptide can be isolated by recovery using a variety of methods. For example, changes in pH, detergents, denaturing agents (such as phenol, urea or guanidinium), concentration and types of salts, such as chaotropic or anti-chaotropic salts, or combinations thereof can be used to elute the complex or portions thereof.
- the complex can be digested using enzymes, such as proteases or nucleases to free portions of the complex such as the polypeptide or the nucleic acid molecule.
- the bound nucleic acid is recovered and enriched using appropriate nucleic amplification reactions, such as polymerase chain reaction (PCR).
- PCR polymerase chain reaction
- the enriched nucleic acid can be sequenced using appropriate methods known to the art.
- the recovered nucleic acid molecules that contain a random sequence or sequence of interest can be cloned into appropriate vectors, such as plasmids, which can be amplified in an appropriate host.
- the recovered nucleic acid which may contain several DNA species, may also be separated using the methods that exploit the sequence and conformation of the nucleic acid. It may be necessary to separate the double-stranded PCR product to single-stranded in order to use such methods. These methods can be capillary affinity gel for nucleic acid and HPLC, or the combination thereof.
- the individual species of the nucleic acid molecules can then be sequenced.
- the amino acid sequences of polypeptides that bind to the target molecules can be deduced from the DNA sequence. If desired, the selected polypeptides can be either synthesized chemically or expressed in living organisms.
- the entire selection procedures, or portions thereof, may be automated. Translated complexes can be contacted with targets and unbound complexes can be washed away by a programmable machine. Another component of the automated machine may perform amplification reactions on the nucleic acid molecules of the bound complexes. Several rounds of selection and amplification may be automated in a linked process, and the final PCR products can be separated on a column that utilizing the difference in sequence and conformation. Individual nucleic acid molecules may be transfe ⁇ ed directly to an automated sequencer and sequenced, for example using fluorescently tagged nucleotides that may be read spectrophotometrically.
- the present invention includes nucleic acid molecules that comprise at least a portion of a random sequence or selected nucleic acid sequence identified by this method.
- the present invention also includes polypeptides that include at least a portion of a polypeptide encoded by an identified random sequence or sequence of interest.
- the present invention also includes a method for identifying a nucleic acid molecule or sequence in other forms, which include double-stranded DNA, single- or double-stranded RNA or RNA-DNA duplex.
- the sources for these forms of nucleic acid can be either synthetic or biological such as viral, prokaryotic and eukaryotic. All these nucleic acid must be converted to DNA as the template for the synthesis of RNA that is in turn used as the template for protein translation.
- total eukaryotic cellular mRNA may be converted to DNA using the methods known to the art.
- the DNA is then tagged with a linking moiety.
- the selection and enrichment of the nucleic acid sequences can be carried out using the procedures described in the previous section.
- the present invention includes nucleic acid molecules that include at least a portion of a random sequence or selected nucleic acid sequence identified by this method.
- the present invention also includes polypeptides that include at least a portion of a polypeptide encoded by an identified random sequence or sequence of interest.
- the present invention includes nucleic acid constructs that are useful for a variety of purposes, including methods of the present invention.
- a nucleic acid molecule of the present invention preferably comprises a linking moiety and an open reading frame (ORF).
- Linking moieties are domains of a nucleic acid molecule, or chemical compounds intrinsic to or bound to nucleic acid molecules, that can link with a polypeptide encoded by the ORF.
- the linking moiety is covalently bound to the nucleic acid.
- this is not a requirement of the invention.
- Linking moieties can be any compounds that can directly or indirectly link a polypeptide to a nucleic acid molecule of the present invention that encodes it.
- puromycin, or other tRNA mimetics or amino acid analogs bound to a nucleic acid molecule of the present invention can be incorporated into a polypeptide catalyzed by ribosome. Methods of binding puromycin to the 3' and 5' ends of nucleic acid molecules are known in the art, see for example, PCT applicationsWO 00/72869, WO 01/07657, WO 01/04265, and WO 00/32823, all herein incorporated by reference.
- a linking moeity such as puromycin can also be bound to a DNA binding molecule, such as lac repressor protein or the RNA polymerase itself.
- the linkage between a DNA molecule of the present invention that includes a binding site for the DNA binding protein and a polypeptide encoded by the DNA molecule is via the bridge of DNA-DNA binding molecule-puromycin-polypeptide.
- puromycin that is bound to a protein is bound to the protein by a linker (such as a carbon chain) that can allow the puromycin access to a ribosome that translates the RNA transcribed by the RNA polymerase.
- the puromycin can be incorporated into a polypeptide by a ribosome, and thereby linked, via RNA polymerase, to a DNA molecule that encodes the polypeptide.
- a portion of the nascent polypeptide synthesized by the ribosome can be a linking moiety, and can bind to the nucleic acid molecule or to a compound or any other entity that is directly or indirectly linked to the nucleic acid molecule.
- nucleic acid molecules that can comprise, at or near a 3' or 5' terminus, linking moieties such as, but not limited to, biotin, digoxigenin, nitrilotriacetic acid, nucleic acid sequences, peptide nucleic acid sequences, or peptide sequences. All of these compounds can be bound by binding domains encoded by the ORF of the nucleic acid molecule.
- the binding domain can comprise at least a portion of an avidin or streptavidin protein, a sequence of consecutive histidine residues that can bind nitrilotriacetic acid, or at least a portion of an antibody or other specific binding member that binds a linking moiety such as digoxygenin or a peptide coupled to the nucleic acid molecule.
- a linking moiety such as digoxygenin or a peptide coupled to the nucleic acid molecule.
- nucleic acid construct in addition to comprising a random sequence or sequence or interest, also comprises a sequence encoding the amino acid sequence of the binding domain.
- sequence of interest or random sequence and the sequence encoding the polypeptide binding domain are both part of the same open reading frame.
- linking moieties such as the types of compound and sequence and length of nucleic acid or peptide that relate to the function of these structures can be selected based on reports in the literature, or by screening compounds for the desired activity using standard assay methods as they are known in the art.
- Experiments and assays that test for the linkage of a polypeptide to its own encoding nucleic acid molecule can be designed using, for example, labels that can be incorporated into nucleic acid molecules and polypeptides, and by using separation techniques such as gel electrophoresis and enzymes such as nucleases and proteases to demonstrate the coupling of a nucleic acid molecule to the polypeptide it encodes. See, for example, PCT application number WO 98/31700 and PCT application number WO 00/26511, both herein incorporated by reference.
- the ORF of the nucleic acid molecule of the present invention can include at least one random sequence or at least one sequence of interest or a combination of at least one random sequence and at least one sequence of interest.
- the random sequence or sequence of interest can be made using appropriate methods in the art, such as cloning techniques, including PCR techniques and other enzymatic techniques such as reverse-transcription from cellular mRNA, solid phase synthesis, or fragmenting nucleic acid molecules using a variety of methods, such as sheer forces, vibrational energy or restriction enzymes or a combination of these methods.
- the polynucleotides can be of any length, but are preferably between about twenty bases and about 500 bases, more preferably between about forty bases and about 150 bases in length. For.
- the nucleic acid can be of any length, preferably between about 100 bases and 10,000 bases, more preferably between about eighty bases and 1 ,000 bases.
- a sequence of interest can be any sequence of interest, and can be known or unknown, for example, it can be known sequences of one or more proteins, or it can be one or more sequences of an unfractionated, fractionated, or partially fractionated population of nucleic acid molecules.
- the ORF can also comprises random sequences combined with sequences of interest in any way.
- the nucleic acid molecule can be any nucleic acid molecule, but is preferably single- stranded DNA or double-stranded DNA, and can also be a DNA/RNA duplex molecule.
- the nucleic acid constructs can be provided in vectors with linking moieties. Linking moieties can be chemical groups or compounds that are bound to or preferably incorporated into a nucleic acid molecule and can bind or be incorporated into the encoded polypeptide by the ribosome. For example, tRNA analogues such as, but not limited to, puromycin, incorporated into the nucleic acid can be linked to a polypeptide by ribosome. Puromycin can also be bound to a DNA binding molecule, such as lac repressor protein.
- the linkage between a DNA molecule of the present invention that includes a binding site for the DNA binding protein and a polypeptide encoded by the DNA molecule is via the bridge of DNA-DNA binding molecule-puromycin- polypeptide.
- a linker such as a carbon chain
- the puromycin can be incorporated into a polypeptide by a ribosome, and thereby linked to a DNA molecule that encodes the polypeptide.
- the linking moiety and the ORF sequence encoding a sequence of interest or random sequence need not be directly linked together or immediately adjacent to each other, but are preferably operably linked.
- These elements of the nucleic acid molecule of the present invention can be provided on a nucleic acid construct in any order or orientation.
- the nucleic acid molecules of the present invention can be made using any appropriate method, including synthetic methods or cloning methods as they are known in the art (Sambrook et al., supra, (1989)).
- Nucleic acid molecules of the present invention can be of any length in total, but are preferably between about twenty bases and about 10,000 bases, more preferably between about forty bases and about 1,000 bases.
- the linking moiety and the nucleic acid molecule are operably linked, preferably covalently linked together.
- the linking moiety can be 5' to the sequence encoding the interacting domain or 3 1 to the random sequence or sequence of interest.
- the nucleic acid molecules of the present invention can be made using any appropriate method, including synthetic methods or cloning methods as they are known in the art (Sambrook et al., supra, (1989)).
- the nucleic acid molecule of the present invention preferably include at least one ribosome binding sequence (RBS), that promotes the initiation of translation of a nucleic acid molecule of the present invention.
- RBS ribosome binding sequence
- a ribosome binding sequence, or translation initiation sequence can be, for example, a Shine-Dalgarno sequence, a Kozak sequence, or an IRES sequence.
- preferred sequences for the initiation of translation are "Shine- Dalgarno" sequences and at least one start codon.
- a preferred start codon is dAdTdG.
- the nucleic acid may not include such control sequences.
- a nucleic acid molecule of the present invention also includes a ribosome stalling sequence, where translation is halted or dramatically slowed while the ribosome remains bound to the nucleic acid template.
- a ribosome stalling sequence is positioned 3' of an open reading frame of a nucleic acid molecule of the present invention.
- Preferred ribosome stalling sequences for nucleic acid molecules of the present invention include polydA and sequences having stable secondary structure, such as hairpin structure.
- the nucleic acid molecule of the present invention can further include sequences or other polymers that function as at least one spacer region.
- the spacer region includes nucleic acid sequences or long chain polymers that preferably do not interact with nucleic acids.
- the spacer region can be made using appropriate methods in the art, such as solid phase synthesis or cloning methods known in the art.
- the spacer region can be of any length, but is preferably between about five bases and about 100 bases, more preferably between about ten bases and about fifty bases in length, equivalent in length if the spacer is non-nucleic acid molecule.
- Preferred spacer regions include secondary structure-free DNA sequences, long chain carbon molecules, or combination of both (see FIGs. 7 and 8).
- a spacer region separates a linking moiety from the attached nucleic acid molecule.
- the linking moiety, the ribosome binding sequence, sequence encoding polypeptide or the random sequence or sequence of interest, ribosome stalling sequence and spacer region need not be directly linked together, immediately adjacent to each other or provided on the same nucleic acid molecule, but are preferably operably linked.
- These elements of the nucleic acid molecule of the present invention can be provided on a nucleic acid construct in any order or orientation.
- the linking moiety can be anywhere in the nucleic acid, preferably at the 5 'or 3'- end of the nucleic acid (see FIGs. 7 and 8).
- the nucleic acid molecules of the present invention can be made using any appropriate method, including synthetic methods or cloning methods as they are known in the art (Sambrook et al., supra, (1989)).
- the nucleic acid molecules of the present invention may further include sequences that encode peptides that mediate the entry of peptides and other molecules into cells.
- sequences include sequences that encode portions of the tat gene of HIV (Anderson et al. Biochem. Biophys. Res. Commun. 194: 876-884 (1993); Fawell et al., Proc. Natl. Acad. Sci. USA 91: 664-668 (1994); Kim et al., J. Immunol. 159: 1666-1668 (1997); Vives et al. J. Biol. Chem. 272: 16010-16017 (1997); Vocero-Akbani et al. Nat. Med.
- nucleic acids of the present inventions may further include sequences that encode peptides that direct molecules to particular cellular compartments, for example, the endoplasmic reticulum or the mitochondria, as such sequences are known in the art or are later identified or developed.
- the nucleic acid molecules of the present invention may be directly or indirectly labeled with a detectable marker.
- the detectable marker may be a radioisotope or a nonradioactive detectable molecule such as biotin or fluorescein, or other detectable markers as they are known or developed in the art.
- the marker may be directly or indirectly bound to the nucleic acid.
- the nucleic acid molecule is a single-stranded DNA and comprises a puromycin at its 5 '-end that serves as the linking moiety.
- the single-stranded DNA further comprises a ribosome binding sequence and a downstream translation start codon, such as dAdTdG, and a ribosome stalling sequence such as poly(dA) court or a region with strong secondary structure at the 3 '-end, where poly(dA)n is preferably between five and fifty dAs, more preferably between ten and forty dAs, or the secondary structure region is preferably longer than five base pairs, or more preferably, longer than ten base pairs.
- the linking moiety is preferably at the 5' end of the nucleic acid, and the sequences encoding the polypeptide are located between the ribosome binding sequence and ribosome stalling sequence.
- the nucleic acid molecule is single- stranded DNA and is labeled with a puromycin at its 3 '-end that serves as the linking moiety.
- the single-stranded DNA further comprises a ribosome binding sequence and a downstream translation start codon, such as dAdTdG, and a ribosome stalling sequence such as poly(dA) n or a region with strong secondary structure at the 3 '-end.
- the linking moiety is preferably at the 3' end of the nucleic acid, and the sequences encoding the polypeptide are located between the ribosome binding sequence and ribosome stalling sequence.
- the ribosome stalling sequence can also serve as a spacer so that the linking moiety may be able to incorporated into the nascent polypeptide.
- Such length can be between five to 300, preferably tebn to 100 or more preferably fifteen to fifty nucleotides in length.
- the nucleic acid molecule of the present invention and complexes that include such nucleic acid molecules can also be provided in a cell.
- the nucleic acid molecule can be introduced into a cell using methods known in the art, such as hpofection or electroporation.
- nucleic acid molecules can be introduced into cells using vectors, such as viruses or phages.
- the cells can be any cell, including prokaryotic or eukaryotic cells, and can be ex vivo or in vivo, including within a whole organism, including a mammal, including a human.
- the nucleic acid molecule can be transcribed and/or translated to produce a complex of the present invention.
- nucleic acid molecules of the present invention can also be provided in other linear form of nucleic acid molecule, such as double-stranded DNA and RNA and DNA/RNA duplex. If the nucleic acids provided are in these forms, they all have to be converted to single-stranded DNA so that the nucleic acid can be translated to polypeptide.
- the nucleic acid molecules of the present invention can also be provided in a vector.
- Vectors can be viral vectors, liposomes, microspheres, plasmids, phages or a linear dsDNA molecules.
- Vectors preferably include double-stranded DNA molecules of the present invention, but the invention is not limited to such vectors.
- various viral vectors include single- stranded DNA (parvoviruses), single-stranded RNA (retroviruses) or double-stranded RNA (rotaviruses).
- Vectors are useful for making libraries of nucleic acid molecules of the present invention, particularly libraries that include nucleic acid molecules that have different random sequences or sequences of interest.
- vectors are convenient for making, storing and transporting nucleic acid molecules of the present invention.
- Vectors can be made or modified using methods in molecular biology as they are known in the art (Sambrook et al., supra, (1989)).
- the vector of the present invention can be of any vector as that term is known in the art.
- Vectors can be any vector known in the art, for example, retroviruses (U.S. Patent No. 5,399,346 to Anderson et al., issued March 21, 1995, Bandara et al., DNA and Cell Biol. 11:227- 231 (1992)); adenoviruses (Berkner, BioTechniques 6: 616-629 (1989); adeno-associated viruses (Larrick and Burck, Gene Therapy. Application of Molecular Biology, Elsevier, New York (1991); plasmid vectors (U.S. Patent No. 5,240,846 to Collins et al., issued August 31, 1993); liposomes (Holmberg et al., J. Liposome Res.
- the present invention includes a cell that includes or has been transfected or transformed by a vector of the present invention.
- a cell can be ex vivo or in vivo in a subject, including a test animal or a human.
- the nucleic acid molecule of the present invention in the vector can be expressed in the cell.
- the nucleic acid molecule includes a sequence of interest such that when translated retains at least one activity of the polypeptide encoded by the sequence of interest. In that way, the number of translated polypeptides encoded by the sequence of interest is reduced, which results in a dosing effect of the polypeptide of interest within the cell.
- the vector includes at least one random sequence. The activity of the polypeptide encoded by the random sequence in the cell can be monitored by observing or interrogating the cell using methods known in the art, including the use of reporter genes to report changes in signal transduction within a cell.
- the nucleic acid molecule of the present invention is operably linked to its encoding polypeptide.
- the operable link between the nucleic acid molecule and the polypeptide of the present invention occurs with covalent bonding.
- a ribosome translates the ssDNA sequence directly into a polypeptide.
- the linking moiety is incorporated into the growing peptide at its C-terminal.
- the interactions involved in the direct linking of the linking moiety to the polypeptide can be any of interactions that result in a irreversible binding. Irreversible binding is characterized by covalent bond as they are known in the art.
- a nucleic acid molecule of the present invention can also be operably and covalently linked to a polypeptide encoded by a random sequence or a sequence of interest.
- Operably linked in this instance refers to the case where the nucleic acid can directly or indirectly bind with the linking moiety and the polypeptide encoded by the random sequence or sequence of interest is capable of binding with a substance of interest, such as a ligand.
- Complexes comprising nucleic acid molecules of the present invention operably linked to polypeptides may also be labeled with a detectable label.
- the detectable label may be a radioisotope or a nonradioactive detectable molecule such as biotin, or other detectable moieties as they are known or developed in the art.
- the label may be directly or indirectly bound to the nucleic acid of the complex or to a polypeptide of the complex, or both.
- the polypeptide of the complex labeled with a detectable marker need not be the polypeptide encoded by or partially encoded by the random sequence or sequence of interest.
- a nucleic acid molecule of the present invention can link to a polypeptide encoded by said sequence or sequence of interest.
- the polypeptide can form a structure to bind with a substance of interest.
- This aspect of the invention allows the selection of polypeptides that bind with a substance of interest, while at the same time selecting the nucleic acid molecule that encodes the polypeptide that binds with a substance of interest.
- the substance of interest can be on a solid support, and can be immobilized on such a solid support using methods known in the art such as absorption, chemical conjugation or cross-linking.
- the structure formed by a nucleic acid of the present invention and a polypeptide encoded by a random sequence or sequence of interest can be immobilized on a solid support and one or more substances of interest maybe bound with or capable of reacting with the fixed complex or complexes.
- the substance of interest can be on or within a cell, and the cell can be immobilized on a solid support using appropriate methods, such as solid supports covered with fibronectin or other adhesion molecules, entrapped thereon.
- the cell can be ex vivo and can be provided as a cell, culture of cells, or part of a sample of tissue, fluid or organ. Alternatively, the cell can be in vivo in a subject.
- the substance of interest maybe a cell-type-specific or tissue-specific molecule, such that nucleic acids or peptides of the present invention that specifically bind to the substance of interest can be identified.
- Such cell-type-specific and tissue-specific molecules can be used to target drug delivery to specific cells or tissues.
- the cell can be any cell, including a normal cell or an abnormal cell, such as, for example, a neoplastic cell or a virus infected cell.
- the substance of interest can also be on or within an etiological agent, such as, for example, a virus, a bacteria, a bacterial spore, a parasite or a prion.
- the substance of interest may be one or a plurality of molecules on or within an etiological agent, virus, bacterium, protozoan, tumor cell or abnormal cell.
- the substance of interest used for selection may be whole cells, viruses, or microorganisms fixed to a solid support or in solution, or may be a portion or fractionated preparation of one or more cells, viruses, or microorganisms fixed to a solid support or in solution.
- the substance of interest can also include at least one organic molecule, an inorganic molecule, a polymer, a polypeptide, a lipid, a carbohydrate, a small molecule, a nucleic acid molecule, a ribozyme, a biomacromolecule or a drug.
- the present invention also includes a library of nucleic acid molecules.
- libraries include nucleic acid molecules contain linking moieties so that they are able to covalently link to the respective encoding polypeptides.
- the library of nucleic acid molecules can include at least two different random sequences, at least two different sequences of interest or a combination of at least one random sequence and at least one sequence of interest.
- a library of nucleic acid molecules can include two such nucleic acid molecules.
- Each nucleic acid molecule can have a different random sequence or a different sequences of interest.
- one nucleic acid molecule can have a random sequence and the other nucleic acid molecule can have a sequence of interest.
- the library can be fixed to a solid support.
- libraries containing random sequences which may include sequences in which one or a few sequence positions have been randomly or semi-randomly varied, or libraries containing sequences of interest, can be fixed to a chip or array for screening with one or more substances of interest.
- a translated library of complexes may also be fixed to a solid support.
- libraries of complexes containing random sequences which may include sequences in which one or a few sequence positions have been randomly or semi-randomly varied, or libraries of complexes containing sequences of interest, can be fixed to a chip or array for screening with one or more substances of interest.
- the detectable label may be a radioisotope or a nonradioactive detectable molecule such as biotin, or other detectable moieties as they are known or developed in the art.
- the label may be directly or indirectly bound to the members of the library.
- Library members may be labeled by direct or indirect binding of a detectable marker to the nucleic acid or to a polypeptide of the library member.
- a library of nucleic acid molecules can also include at least one substance of interest.
- the substance of interest can be bound with a nucleic acid molecule, a polypeptide, or complex of the present invention, can be unbound, or can be bound to some members of the library and not bound to other members of the library.
- the substance of interest can be directly bound or indirectly bound to a nucleic acid molecule of the present invention, but is preferably indirectly bound to a nucleic acid molecule of the present invention or directly bound to a complex of the present invention (particularly the polypeptide encoded by the random sequence or sequence of interest).
- the substance of interest can be a substrate, such as an enzymatic substrate, with which a nucleic acid molecule, polypeptide, or complex of the present invention interacts.
- Reactions of the nucleic acid molecule, polypeptide, or complex of the present invention with the substance of interest may be monitored and quantitated using appropriate assays, for example spectrophotometric assays or assays that measure the release of a radioactive moiety.
- the substance of interest can be directly or indirectly bound on a solid support or in solution.
- the structure formed by the construct of the present invention and the polypeptide encoded by a random sequence or sequence of interest can be immobilized on a solid support and one or more substances of interest may be unbound, may be bound with the fixed one or more complexes of the library, or may be acted upon in a biochemical reaction catalyzed by or modulated by one or more complexes of the library.
- the substance of interest can be on or within a cell, wherein the cell can be ex vivo or in vivo, such as in a subject.
- the cell can be any cell, including a normal cell or an abnormal cell, such as, for example, a neoplastic cell or a virus infected cell.
- the substance of interest can also be one or a plurality of molecules on or within an abnormal or normal cell or on or within an etiological agent, such as, for example, a virus, a bacterium, a bacterial spore, a parasite or a prion.
- the substance of interest may be one or a plurality of molecules on or within an etiological agent, virus, bacterium, protozoan, tumor cell or abnormal cell.
- the substance of interest used for selection may be whole cells, viruses, or microorganisms fixed to a solid support or in solution, or may be a portion or fractionated preparation of one or more cells, viruses, or microorganisms fixed to a solid support or in solution.
- the substance of interest maybe a cell-type-specific or tissue-specific molecule, such that nucleic acids or peptides of the present invention that specifically bind to the substance of interest can be identified.
- Such cell-type-specific and tissue-specific molecules can be used to target drug delivery to specific cells or tissues.
- the substance of interest can also include at least one organic molecule, an inorganic molecule, a polymer, a polypeptide, a lipid, a carbohydrate, a small molecule, a nucleic acid molecule, a ribozyme, a biomacromolecule or a drug.
- Library of vectors include at least one organic molecule, an inorganic molecule, a polymer, a polypeptide, a lipid, a carbohydrate, a small molecule, a nucleic acid molecule, a ribozyme, a biomacromolecule or a drug.
- the present invention also includes a library of vectors of the present invention.
- the library of vectors includes at least two different random sequences, at least two different sequences of interest, or a combination or random sequences and sequences of interest.
- the present invention includes methods for identifying nucleic acid molecules, particularly from a library of random sequences or sequences of interest that encode polypeptides that bind with a substance of interest.
- This method includes: providing at least one single-stranded nucleic acid molecule of the present invention that includes at least one random sequence or at least one sequence of interest; translating the single-stranded DNA molecule to form at least one complex, wherein the complex comprises a single-stranded DNA operably linked to the its own encoding polypeptide; contacting at least one complex with at least one substance of interest; selecting at least one complex that binds with said at least one substance of interest; and identifying said random sequence or said nucleic acid sequence of interest or nucleic acid molecule of interest.
- the nucleic acid molecule, including the random sequence or selected sequence can be sequenced.
- a nucleic acid molecule of the present invention or a library thereof in the form of a complex can be made by directly translating single-stranded DNA.
- Single-stranded DNA can be translated under certain conditions by ribosomes that use single-stranded DNA as a template and d(ATG) as a start codon (see, for example, Morgan et al. (1967) J Mol Biol 26: 477-497; Hulen et al. (1977) Biochimie 59:179-188; Salas and Bollum, J. (1969) Biol. Chem.
- the single-stranded DNA may be made from a variety of methods that are known to the art (for example Ellington, A.D. and Szostak, J.W (1992) Nature 355, 850; Cui, Y. et al. (1995) J. Bacerial. 177, 4872; Kujau, M.J. and Wolfl, S. (1997) Mol. Biotech. 7, 333; Williams, K.P. and Bartel, D.P. (1995) Nucleic Acid Res. 23, 4220-4221; Guo, L.H. and Wu, R. (1982) Nucleic Acid Res. 10, 2065-2084).
- the linking moiety can be attached to either 3'- or 5 '-end of the single-stranded DNA molecule using chemical and/or enzymatic synthesis.
- a puromycin molecule can be chemically linked to an oligonucleotide at its 5 '-end using appropriate chemical synthesis.
- the oligonucleotide can be used as one of a pair of PCR primer to synthesize double-stranded DNA.
- the double-stranded DNA duplex can then be dissociated or degraded to single-stranded using the methods known to the art (for example Ellington, A.D.
- the chemically synthesized 5'- puromycin labeled oligodeoxyribonucleotide can be used directly the template for protein synthesis if the oligodeoxyribonucleotide contains the features as a regular messenger for protein synthesis.
- the single-stranded DNA is used as the template for protein translation by ribosome, and the ribosome incorporate the linking moiety, such as puromycin, into the growing peptide chain and thus forms a ssDNA-polypeptide complex.
- the complex is put in the mixture of target molecule or molecule of interest and the polypeptide portion of the complex binds with the target molecule or the molecule of interest.
- the bound complex is separated from the bound complexes.
- the nucleic acid on the bound complex can be eluted from the complex or be used directly as the template for nucleic amplification reactions such as PCR.
- the amplified nucleic acid can be sequenced or expressed in living organisms. If one of the primers is linking moiety- labeled, the amplified nucleic acid would be labeled with the linking moiety. Therefore, the linking moiety-labeled single-stranded DNA can be generated and the entire selection procedure can be repeated.
- the linking moiety can also be at 3 '-end of single-stranded DNA.
- a puromycin moiety can be coupled to the 3 '-end of an oligonucleotide using appropriate methods such as solid phase synthesis.
- the single-stranded oligonucleotide can be annealed to a complimentary sequence to form a double-stranded oligonucleotide adapter.
- the adapter can be ligated to a double-stranded DNA with appropriate end.
- the adapter-ligated double-stranded DNA is converted to single-stranded DNA using methods known to the art (for example Ellington, A.D.
- the single-stranded DNA can be used as the template for protein translation by ribosomes, and the ribosome can incorporate the linking moiety, such as puromycin, into the growing peptide chain and thus form a ssDNA-polypeptide complex.
- the complex can be contacted with one or more substances of interest such that the polypeptide portion of one or more complexes can bind with one or more substances of interest.
- Bound complexes are separated from the unbound complexes.
- Nucleic acid molecules of the bound complex can be eluted from the complexes or can be used directly as templates for amplification reactions such as PCR.
- the amplified nucleic acid molecules can be sequenced or expressed in living organisms.
- the PCR product can also be ligated to the double-stranded oligonucleotide adapter for a subsequent round of selection.
- the complex may optionally be depleted of ribosomes by treating the mixture of translated constructs with reagents that are known to cause the dissociation of ribosomes from single-stranded DNA.
- EDTA may be added to deplete free Mg 2+ in the reaction mixture. This may be desirable for screening applications where the ribosome may impede binding to the substance of interest, or impede the entry of complexes into cells.
- Complexes including libraries of complexes, may be purified or substantially purified from the reaction mixture using reagents that bind parts of the complex. For example, if the polypeptide contains a stretched of adjacent six histidine residues, the single-stranded DNA- polypeptide complex may be purified from the translation reaction using nitrilotriacetic acid - linked beads. Purified or substantially purified complexes may be stored under conditions that promote the stability of nucleic acids and polypeptides, for example, at 4°C in a buffer that contains BSA and EDTA.
- the single-stranded nucleic acid in said complex can be converted to double- stranded DNA using the methods known in the art, such as using T4 DNA polymerase.
- the complex is contacted with one or more substances of interest under conditions that promote the binding of the complex, or reaction of the complex, particularly the polypeptide encoded by the random sequence or sequence of interest, with the substance of interest.
- the substance of interest can be on a solid support or in solution.
- a solid support maybe a chip or array.
- the substance of interest can be on or within a cell and can be on or within an etiological agent.
- a substance of interest is bound with a complex that includes a polypeptide encoded by a random sequence or a sequence of interest and the random sequence or sequence of interest itself.
- Complexes that are not bound to a substance of interest can be separated from bound complexes using methods known in the art.
- the complexes that are free in solution can be washed away using methods known in the art for receptor-ligand reactions, such as immunoassay methods.
- the complexes of the present invention may be fixed to a chip or array, and the substance or substances of interest may be contacted with the chip or array to allow the substance of interest to bind or react with complexes for which the substance of interest has affinity.
- the substance of interest maybe labeled with a detectable marker, or may be detected with a reagent specific for the substance of interest.
- Nonspecifically bound substance of interest may be washed off using appropriate methods as they are known in the art prior to detection of the bound substance of interest.
- the nucleic acid molecule encoding a peptide that binds with or reacts with a substance of interest has been selected using this method.
- the selected complex or portions thereof can be isolated by recovery using a variety of methods. For example, changes in pH, detergents, denaturing agents (such as phenol, urea or guanidinium), concentration and types of salts, such as chaotropic or anti-chaotropic salts, or combinations thereof can be used to elute the complex or portions thereof.
- the complex can be digested using enzymes, such as proteases or nucleases to free portions of the complex such as the polypeptide or the nucleic acid molecule.
- the bound nucleic acid is recovered and enriched using appropriate nucleic amplification reactions, such as polymerase chain reaction.
- the enriched nucleic acid can be sequenced using appropriate methods known to the art.
- the enriched nucleic acid can be converted to single-stranded DNA again and used as the template for protein synthesis. If the linking moiety is at 5' of the translation template, the single-strand converted to single-stranded DNA directly from the double-stranded DNA. If the linking moiety needs to be at 3' of the translating template, a linking moiety-attached adapter nucleic acid is needed so that the linking moiety can be ligated to the double-stranded DNA via the adapter. Then the adapter-linked double-stranded DNA can be converted to single stranded DNA using the methods known to the art (for example Ellington, A.D.
- the recovered nucleic acid which may contain several DNA species, may also be separated using the methods that exploit the sequence and conformation of the nucleic acid. It may be necessary to separate the double-stranded PCR product to single-stranded in order to use the said method. These methods can be capillary affinity gel for nucleic acid and HPLC, or the combination thereof. The individual species of the nucleic acid molecules can then be sequenced. The amino acid sequence of the polypeptide that is able to bind to the substance of interest can be deduced from the nucleic acid sequence.
- the entire selection procedures, or portions thereof, may be automated. Translated complexes can be contacted with targets and unbound complexes may be washed away by a programmable machine. Another component of the automated machine may perform amplification reactions on the nucleic acid molecules of the bound complexes. Several rounds of selection and amplification may be automated in a linked process, and the final PCR products can be separated on a column that utilizing the difference in sequence and conformation. Individual nucleic acid molecules may be transmitted directly to an automated sequencer and sequenced, for example using fluorescently tagged nucleotides that may be read spectrophotometrically.
- the present invention includes nucleic acid molecules that comprise at least a portion of a random sequence or selected nucleic acid sequence identified by this method.
- the present invention also includes polypeptides that include at least a portion of a polypeptide encoded by an identified random sequence or sequence of interest.
- the present invention also includes a method for identifying a nucleic acid molecule or sequence in other forms, which include double-stranded DNA, single- or double-stranded RNA or RNA-DNA duplex.
- the sources for these forms of nucleic acid can be either synthetic or biological such as viral, prokaryotic and eukaryotic. All these nucleic acid must be directly or indirectly converted to single-stranded DNA as the template for protein synthesis in order to form a polypeptide-single-stranded DNA complex in this invention.
- total eukaryotic cellular mRNA maybe converted to double- stranded DNA using the methods known to the art.
- the single-stranded DNA corresponding to the mRNA sequences can be ligated with a linking moiety. Then, the selection and enrichment of the nucleic acid sequences can be carried out using the procedures described in the previous section.
- the present invention includes nucleic acid molecules that include at least a portion of a random sequence or selected nucleic acid sequence identified by this method.
- the present invention also includes polypeptides that include at least a portion of a polypeptide encoded by an identified random sequence or sequence of interest.
- the present invention includes methods for identifying nucleic polypeptides, particularly polypeptides encoded by random sequences or sequences of interest that bind with a substance of interest.
- This method includes: providing at least one nucleic acid molecule of the present invention as a single-stranded DNA that includes at least one random sequence or at least one sequence of interest; translating the single-stranded DNA molecule to form at least one complex, wherein the complex comprises a single-stranded DNA operably linked to the its own encoding polypeptide; contacting at least one complex with at least one substance of interest; selecting at least one complex that binds with said at least one substance of interest; and identifying said random sequence or said nucleic acid sequence of interest or nucleic acid molecule of interest.
- the nucleic acid molecule, including the random sequence or selected sequence can be sequenced.
- a nucleic acid molecule of the present invention or a library thereof in the form of a complex can be made by directly translating single-stranded DNA.
- Single-stranded DNA can be translated under certain conditions by ribosomes that use single-stranded DNA as a template and d(ATG) as a start codon (see, for example, Morgan et al. (1967) J Mol Biol 26: 477-497; Hulen et al. (1977) Biochimie 59:179-188; Salas and Bollum, J. (1969) Biol. Chem.
- the single-sfranded DNA may be made from a variety of methods that are known to the art (for example Ellington, A.D. and Szostak, J.W (1992) Nature 355, 850; Cui, Y. et al. (1995) J. Bacerial. 177, 4872; Kujau, M.J. and Wolfl, S. (1997) Mol. Biotech. 7, 333; Williams, K.P. and Bartel, D.P. (1995) Nucleic Acid Res. 23, 4220-4221; Guo, L.H. and Wu, R. (1982) Nucleic Acid Res. 10, 2065-2084).
- the linking moiety can be attached to either 3'- or 5 '-end of the single-sfranded DNA molecule using chemical and/or enzymatic synthesis.
- a puromycin molecule can be chemically linked to an oligonucleotide at its 5 '-end using appropriate chemical synthesis.
- the oligonucleotide can be used as one of a pair of PCR primer to synthesize double-stranded DNA.
- the double-stranded DNA duplex can then be dissociated or degraded to single-stranded using the methods known to the art (for example Ellington, A.D.
- a puromycin can be labeled to the 3 '-end of an oligonucleotide using appropriate chemical synthesis method.
- the single-stranded oligonucleotide can be annealed to a complimentary sequence to form a double-stranded oligonucleotide adapter.
- the adapter can be ligated to a double-stranded DNA with appropriate end.
- the adapter-ligated double-stranded DNA is converted to single-stranded DNA using the method known to the art (for example Ellington, A.D. and Szostak, J.W (1992) Nature 355, 850; Cui, Y. et al. (1995) J. Bacerial.
- the complex may optionally be depleted of ribosomes by treating the mixture of translated constructs with reagents that are known to cause the dissociation of ribosomes from single-sfranded DNA. For example, following translation, EDTA may be added to deplete free Mg 2+ in the reaction mixture. This may be desirable for screening applications where the ribosome may impede binding to the substance of interest, or impede the entry of complexes into cells.
- Complexes including libraries of complexes, may be purified or substantially purified from the reaction mixture using reagents that bind parts of the complex. For example, if the polypeptide contains a stretched of adjacent six histidine residues, the single-stranded DNA- polypeptide complex may be purified from the translation reaction using nitrilotriacetic-linked beads. Purified or substantially purified complexes may be stored under conditions that promote the stability of nucleic acids and polypeptides, for example, at 4°C in a buffer that contains BSA and EDTA.
- the single-stranded nucleic acid in said complex can be converted to double- stranded DNA using the methods known in the art, such as using T4 DNA polymerase.
- the complex is contacted with one or more substances of interest under conditions that promote the binding of the complex, or reaction of the complex, particularly the polypeptide encoded by the random sequence or sequence of interest, with the substance of interest.
- the substance of interest can be on a solid support or in solution.
- a solid support may be a chip or array.
- the substance of interest can be on or within a cell and can be on or within an etiological agent.
- a substance of interest is bound with a complex that includes a polypeptide encoded by a random sequence or a sequence of interest and the random sequence or sequence of interest itself.
- Complexes that are not bound to a substance of interest can be separated from bound complexes using methods known in the art.
- the complexes that are free in solution can be washed away using methods known in the art for receptor-ligand reactions, such as immunoassay methods.
- the complexes of the present invention may be fixed to a chip or array, and the substance or substances of interest may be contacted with the chip or array to allow the substance of interest to bind or react with complexes for which the substance of interest has affinity.
- the substance of interest may be labeled with a detectable marker, or may be detected with a reagent specific for the substance of interest.
- Nonspecifically bound substance of interest maybe washed off using appropriate methods as they are known in the art prior to detection of the bound substance of interest.
- the nucleic acid molecule encoding a peptide that binds with or reacts with a substance of interest has been selected using this method.
- the selected complex or portions thereof can be isolated by recovery using a variety of methods. For example, changes in pH, detergents, denaturing agents (such as phenol, urea or guanidinium), concentration and types of salts, such as chaotropic or anti-chaotropic salts, or combinations thereof can be used to elute the complex or portions thereof.
- the complex can be digested using enzymes, such as proteases or nucleases to free portions of the complex such as the polypeptide or the nucleic acid molecule.
- the bound nucleic acid is recovered and enriched using appropriate nucleic amplification reactions, such as polymerase chain reaction.
- the enriched nucleic acid can be sequenced using appropriate methods known to the art. In other applications where the selection requires multiple rounds of selection, the enriched nucleic acid can be converted to single-stranded DNA again and used as the template for protein synthesis. If the linking moiety is at 5' of the translation template, the single-strand converted to single-stranded DNA directly from the double-stranded DNA. If the linking moiety needs to be at 3' of the translating template, a linking moiety-attached adapter nucleic acid is needed so that the linking moiety can be ligated to the double-sfranded DNA via the adapter.
- the adapter-linked double-stranded DNA can be converted to single stranded DNA using the methods known to the art (for example Ellington, A.D. and Szostak, J.W (1992) Nature 355, 850; Cui, Y. et al. (1995) J. Bacerial. 177, 4872; Kujau, M.J. and Wolfl, S. (1997) Mol. Biotech. 7, 333; Williams, K.P. and Bartel, D.P. (1995) Nucleic Acid Res. 23, 4220-4221; Guo, L.H. and Wu, R. (1982) Nucleic Acid Res. 10, 2065-2084).
- the methods known to the art for example Ellington, A.D. and Szostak, J.W (1992) Nature 355, 850; Cui, Y. et al. (1995) J. Bacerial. 177, 4872; Kujau, M.J. and Wolfl, S. (1997) Mol. Biotech. 7, 333; Williams, K.
- the recovered nucleic acid molecules that contain a random sequence or sequence of interest can be cloned into appropriate vectors, such as plasmids, which can be amplified in an appropriate host.
- the recovered nucleic acid which may contain several DNA species, may also be separated using the methods that exploit the sequence and conformation of the nucleic acid. It may be necessary to separate the double-stranded PCR product to single-stranded in order to use the said method. These methods can be capillary affinity gel for nucleic acid and HPLC, or the combination thereof.
- the individual species of the nucleic acid molecules can then be sequenced.
- the amino acid sequence of the polypeptide that is able to bind to the substance of interest can be deduced from the nucleic acid sequence.
- the entire selection procedures, or portions thereof, may be automated. Translated complexes can be contacted with targets and unbound complexes may be washed away by a programmable machine. Another component of the automated machine may perform amplification reactions on the nucleic acid molecules of the bound complexes. Several rounds of selection and amplification may be automated in a linked process, and the final PCR products can be separated on a column that utilizing the difference in sequence and conformation. Individual nucleic acid molecules may be transmitted directly to an automated sequencer and sequenced, for example using fluorescently tagged nucleotides that may be read spectrophotometrically.
- the present invention includes nucleic acid molecules that comprise at least a portion of a random sequence or selected nucleic acid sequence identified by this method.
- the present invention also includes polypeptides that include at least a portion of a polypeptide encoded by an identified random sequence or sequence of interest.
- the sequence of the polypeptides that bind to the target molecule or molecule of interest can be deduced from the nucleic acid sequence.
- the polypeptide may be obtained by expressing the genes in appropriate organisms or chemical synthesis.
- the obtained polypeptide may be assayed for its binding or biological activity.
- the present invention also includes a method for identifying a nucleic acid molecule or sequence in other forms, which include double-stranded DNA, single- or double-stranded RNA or RNA-DNA duplex.
- all these nucleic acid must be directly or indirectly converted to single-sfranded DNA in order to form a polypeptide- single-stranded DNA complex in this invention.
- the sequence of the polypeptides that bind to the target molecule or molecule of interest may be deduced from the nucleic acid sequence.
- the polypeptide may be obtained by expressing the genes in appropriate organisms or chemical synthesis.
- the obtained polypeptide may be assayed for its binding or biological activity.
- the present invention includes methods for identifying test compounds, test compounds identified by this method and pharmaceutical compositions identified by this method.
- One aspect of the present invention is a method for identifying a test compound, including: 1) contacting a target with a complex that comprises a nucleic acid molecule that comprises an open reading frame, a linking moiety, and a polypeptide encoded, at least in part, by the open reading frame, wherein the open reading frame comprises a random sequence or sequence of interest, and wherein the linking moiety is directly or indirectly bound to the nucleic acid molecule and to the polypeptide; 2) identifying complexes bound with said target, or identifying complexes on the basis of catalytic function or the results of cellular assays; determining the structure of the polypeptide encoded by the random sequence or sequence of interest; and 3) identifying moieties that have structures that have space filling shapes that are similar to at least a portion of said identified moiety.
- the present invention also includes a test compound identified by this method and a pharmaceutical composition identified by this method.
- nucleic acid molecules and polypeptides of the present invention that bind with a substance of interest, such as a target, including a pharmaceutical target, or complexes that comprise peptides or nucleic acids with desirable catalytic properties, can be identified using methods of the present invention.
- the structure of the identified nucleic acid molecule or amino acid can be determined using methods such as NMR and mass spectroscopy.
- the identified nucleic acid molecule sequences or amino acid sequences can be provided to a processing unit and appropriate computer models and software to model the three dimensional configuration of the peptide that binds the target encoded therein.
- Appropriate computer models and software can also provide structures of chemical libraries that correspond to at least a portion of the three dimensional configuration. These chemical libraries can be synthesized in whole or in part by combinatorial chemistry methodologies. These libraries can then be screened for activity, such as pharmaceutical activity, using methods known in the art and described herein. Pharmacology and toxicity of test compounds
- test compound The structure of a test compound can be determined or confirmed by methods known in the art, such as mass spectroscopy. For test compounds stored for extended periods of time under a variety of conditions, the structure, activity and potency thereof can be confirmed.
- Identified test compounds can be evaluated for a particular activity using are- recognized methods and those disclosed herein. For example, if an identified test compound is found to have anticancer cell activity in vitro, then the test compound would have presumptive pharmacological properties as a chemotherapeutic to treat cancer. Such nexuses are known in the art for several disease states, and more are expected to be discovered over time. Based on such nexuses, appropriate confirmatory in vitro and in vivo tests of pharmacological activity, and toxicology, and be selected and performed. The methods described herein can also be used to assess pharmacological selectivity and specificity, and toxicity.
- test compounds can be evaluated for toxicological effects using known methods (see, Lu, Basic Toxicology, Fundamentals, Target Organs, and Risk Assessment, Hemisphere Publishing Corp., Washington (1985); U.S. Patent Nos; 5,196,313 to Culbreth (issued March 23, 1993) and 5,567,952 to Benet (issued October 22, 1996)).
- toxicology of a test compound can be established by determining in vitro toxicity towards a cell line, such as a mammalian, for example human, cell line.
- Test compounds can be treated with, for example, tissue extracts, such as preparations of liver, such as microsomal preparations, to determine increased or decreased toxicological properties of the test compound after being metabolized by a whole organism.
- tissue extracts such as preparations of liver, such as microsomal preparations
- the toxicological properties of a test compound in an animal model can be determined using established methods (see, Lu, supra (1985); and Creasey, Drug Disposition in Humans, The Basis of Clinical Pharmacology, Oxford University Press, Oxford (1979)).
- an animal model such as mice, rats, rabbits, dogs or monkeys
- the skilled artisan would not be burdened to determine appropriate doses, LD 5 0 values, routes of administration and regimes that would be appropriate to determine the toxicological properties of the test compound.
- test compound can be established using several art recognized methods, such as in vitro methods, animal models or human clinical trials (see, Creasey, supra (1979)). Recognized in vitro models exist for several diseases or conditions. For example, the ability of a test compound to extend the life-span of HlV-infected cells in vitro is recognized as an acceptable model to identify chemicals expected to be efficacious to treat HIV infection or AIDS (see, Daluge et al., Antimicro. Agents Chemother. 41:1082-1093 (1995)).
- CsA cyclosporin A
- acceptable animal models can be used to establish efficacy of test compounds to treat various diseases or conditions.
- the rabbit knee is an accepted model for testing agents for efficacy in treating arthritis (see, Shaw and Lacy, J. Bone Joint Surg. (Br.) 55:197-205 (1973)).
- Hydrocortisone which is approved for use in humans to treat arthritis, is efficacious in this model which confirms the validity of this model (see, McDonough, Phys. Ther. 62:835-839 (1982)).
- the selectivity of a test compound can be established in vitro by testing the toxicity and effect of a test compound on a plurality of cell lines that exhibit a variety of cellular pathways and sensitivities.
- the data obtained form these in vitro toxicity studies can be extended to animal model studies, including human clinical trials, to determine toxicity, efficacy and selectivity of a test compound.
- test compounds can be often improved by generating additional test compounds based on the structure/property relationship of a test compound originally identified as having activity.
- Test compounds can be modified to improve various properties, such as affinity, life-time in blood, toxicology, specificity and membrane permeability.
- Such refined test compounds can be subjected to additional assays as they are known in the art or described herein. Methods for generating and analyzing such compounds or compositions are known in the art, such as U.S. Patent No. 5,574,656 to Agrafiotis et al.
- Pharmaceutical compositions are known in the art, such as U.S. Patent No. 5,574,656 to Agrafiotis et al.
- the present invention also encompasses a test compound in a pharmaceutical composition
- a pharmaceutically acceptable carrier prepared for storage and preferably subsequent administration, which have a pharmaceutically effective amount of the test compound in a pharmaceutically acceptable carrier or diluent.
- Acceptable carriers or diluents for therapeutic use are well known in the pharmaceutical art, and are described, for example, in Remington's Pharmaceutical Sciences, Mack Publishing Co., (A.R. Gennaro edit. (1985)).
- Preservatives, stabilizers, dyes and even flavoring agents can be provided in the pharmaceutical composition.
- sodium benzoate, sorbic acid and esters of p-hydroxybenzoic acid can be added as preservatives, hi addition, antioxidants and suspending agents can be used.
- test compounds of the present invention can be formulated and used as tablets, capsules or elixirs for oral administration; suppositories for rectal administration; sterile solutions, suspensions or injectable administration; and the like.
- injectables can be prepared in conventional forms either as liquid solutions or suspensions, solid forms suitable for solution or suspension in liquid prior to injection, or as emulsions. Suitable excipients are, for example, water, saline, dextrose, mannitol, lactose, lecithin, albumin, sodium glutamate, cysteine hydrochloride and the like.
- the injectable pharmaceutical compositions can contain minor amounts of nontoxic auxiliary substances, such as wetting agents, pH buffering agents and the like. If desired, absorption enhancing preparation, such as liposomes, can be used.
- the pharmaceutically effective amount of a test compound required as a dose will depend on the route of administration, the type of animal or patient being treated, and the physical characteristics of the specific animal under consideration.
- the dose can be tailored to achieve a desired effect, but will depend on such factors as weight, diet, concurrent medication and other factors which those skilled in the medical arts will recognize.
- the pharmaceutical compositions can be used alone or in combination with one another, or in combination with other therapeutic or diagnostic agents. These products can be utilized in vivo, preferably in a mammalian patient, preferably in a human, or in vitro.
- the pharmaceutical compositions can be administered to the patient in a variety of ways, including parenterally, intravenously, subcutaneously, intramuscularly, colonically, rectally, nasally or intraperiotoneally, employing a variety of dosage forms. Such methods can also be used in testing the activity of test compounds in vivo.
- the useful in vivo dosage to be administered and the particular mode of administration will vary depending upon the age, weight and type of patient being treated, the particular pharmaceutical composition employed, and the specific use for which the pharmaceutical composition is employed.
- the determination of effective dosage levels can be accomplished by one skilled in the art using routine methods as discussed above, and can be guided by agencies such as the USFDA or NTH.
- human clinical applications of products are commenced at lower dosage levels, with dosage level being increased until the desired effect is achieved.
- acceptable in vitro studies can be used to establish useful doses and routes of administration of the test compounds.
- the dosage for the test compounds of the present invention can range broadly depending upon the desired affects, the therapeutic indication, route of administration and purity and activity of the test compound.
- dosages can be between about 1 ng/kg and about 10 mg/kg, preferably between about 10 ng/kg and about 1 mg/kg, more preferably between about 100 ng/kg and about 100 micrograms/kg, and most preferably between about 1 microgram/kg and about 10 micrograms/kg.
- the exact formulation, route of administration and dosage can be chosen by the individual physician in view of the patient's condition (see, Fingle et al., in The Pharmacological Basis of Therapeutics (1975)). It should be noted that the attending physician would know how to and when to terminate, interrupt or adjust administration due to toxicity, organ dysfunction or other adverse effects. Conversely, the attending physician would also know to adjust treatment to higher levels if the clinical response were not adequate.
- the magnitude of an administrated does in the management of the disorder of interest will vary with the severity of the condition to be treated and to the route of administration. The severity of the condition may, for example, be evaluated, in part, by standard prognostic evaluation methods. Further, the dose and perhaps dose frequency, will also vary according to the age, body weight and response of the individual patient, including those for veterinary applications.
- compositions can be formulated and administered systemically or locally.
- Techniques for formation and administration can be found in Remington's Pharmaceutical Sciences, 18th Ed., Mack Publishing Co., Easton, PA (1990). Suitable routes of administration can include oral, nasal, rectal, transdermal, otic, ocular, vaginal, transmucosal or intestinal administration; parenteral delivery, including intramuscular, subcutaneous, intramedullary injections, as well as intrathecal, direct intraventricular, intravenous, intraperitoneal, infranasal, or intraocular injections.
- the pharmaceutical compositions of the present invention can be formulated in aqueous solutions, preferably in physiologically compatible buffers such as Hanks' solution, Ringer's solution or physiological saline buffer.
- physiologically compatible buffers such as Hanks' solution, Ringer's solution or physiological saline buffer.
- penetrans appropriate to the barrier to be permeated are used in the formulation.
- Such penetrans are generally known in the art.
- Use of pharmaceutically acceptable carriers to formulate the pharmaceutical compositions herein disclosed for the practice of the invention into dosages suitable for systemic administration is within the scope of the invention.
- the compositions of the present invention in particular, those formulation as solutions, can be administered parenterally, such as by intravenous injection.
- compositions can be formulated readily using pharmaceutically acceptable carriers well known in the art into dosages suitable for oral administrations.
- Such carriers enable the test compounds of the invention to be formulated as tables, pills, capsules, liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a patient to be treated.
- Agents intended to be administered intracellularly may be administered using techniques well known to those of ordinary skill in the art. For example, such agents may be encapsulated into liposomes, then administered as described above. Intracellular delivery of drugs may be acheived by linking peptides such as the translocating domain of the tat protein of HJV to the agent. Linkage of hydrophobic molecules such as biotin to the attached tat peptide or similar translocating peptides may improve intracellular delivery further (Chen et al. Analyt. Biochem. 227: 168-175 (1995)).Substantially all molecules present in an aqueous solution at the time of liposome formation are incorporated into or within the liposomes thus formed. The liposomal contents are both protected from the external micro-environment and, because liposomes fuse will cell membranes, are efficiently delivered into the cell cytoplasm. Additionally, due to their hydrophobicity, small organic molecules can be directly administered intracellularly.
- compositions suitable for use in the present invention include compositions wherein the active ingredients are contained in an effective amount to achieve its intended purpose. Determination of the effective amount of a pharmaceutical composition is well within the capability of those skilled in the art, especially in light of the detailed disclosure provided herein.
- these pharmaceutical compositions can contain suitable pharmaceutically acceptable carriers comprising excipients and auxiliaries which facilitate processing of the active chemicals into preparations which can be used pharmaceutically.
- the preparations formulated for oral administration maybe in the form of tables, dragees, capsules or solutions.
- compositions of the present invention can be manufactured in a manner that is itself known, for example by means of conventional mixing, dissolving, granulating, dragee-making, emulsifying, encapsulating, entrapping or lyophilizing processes.
- Pharmaceutical formulations for parenteral administration include aqueous solutions of active chemicals in water-soluble form.
- suspensions of the active chemicals maybe prepared as appropriate oily injection suspensions.
- Suitable lipophilic solvents or vehicles include fatty oils such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate or triglycerides or liposomes.
- Aqueous injection suspensions may contain substances what increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol or dextran.
- the suspension can also contain suitable stabilizers or agents that increase the solubility of the chemicals to allow for the preparation of highly concentrated solutions.
- compositions for oral use can be obtained by combining the active chemicals with solid excipient, optionally grinding a resulting mixture, and processing the mixture of granules, after adding suitable auxiliaries, if desired, to obtain tables or dragee cores.
- suitable excipients are, in particular, fillers such as sugars, including lactose, sucrose, mannitol or sorbitol; cellulose preparations such as, for example, maize starch, wheat starch, rice starch, potato starch, gelatin, gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium carboxymethylcellulose and/or polyvinylpyrrolidone.
- disintegrating agents can be added, such as the cross- linked polyvinyl pyrolidone, agar, alginic acid or a salt thereof such as sodium alginate.
- Dragee cores can be provided with suitable coatings. Dyes or pigments can be added to the tablets or dragee coatings for identification or to characterize different combinations of active doses.
- test compounds of the present invention and pharmaceutical compositions that include such test compounds are useful for treating a variety of ailments in a patient, including a human.
- the test compounds of the present invention have antibacterial, antimicrobial, antiviral, anticancer cell, antitumor and cytotoxic activity.
- a patient in need of such treatment can be provided a test compound of the present invention, preferably in a pharmacological composition in an effective amount to reduce the number or growth rate of bacteria, microbes, cancer cells or tumor cells in said patent, or to reduce the infectivity of viruses in said patient.
- the amount, dosage, route of administration, regime and endpoint can all be determined using the procedures described herein or by appropriate government agencies, such as the United Stated Food and Drug Administration.
- the present invention includes methods for identifying targets such as pharmaceutical targets, purification targets or diagnostic targets.
- the present invention also includes targets and pharmaceutical targets identified by such methods.
- Another aspect of the present invention are methods for identifying a target, such as a pharmaceutical target, that include: contacting a substance of interest with a complex that: comprises an open reading frame, a linking moiety, and a polypeptide encoded, at least in part, by the open reading frame, wherein the open reading frame comprises a random sequence or sequence of interest, and wherein the linking moiety is directly or indirectly bound to the nucleic acid molecule and to the polypeptide; 2) identifying complexes bound with said target, or identifying complexes on the basis of catalytic function or the results of cellular assays; determining the sequence of the polypeptide encoded by the random sequence or sequence of interest.
- the present invention also includes a target identified by this method, including pharmaceutical targets.
- complexes comprising polypeptides that bind the etiological agent are selected, and the nucleic acids of the complexes are recovered and amplified.
- the individual species of the PCR product can be sequenced and the polypeptide sequence is deduced from the nucleic acid sequence. All the polypeptide can be synthesized using the deduced sequences, preferably by solid phase synthesis. The each peptide or as a combination is assayed to determine whether the peptide or peptides have desired biological effect, such as inhibit infectivity, on the etiological agent.
- the selected peptides that show desired biological effect on the etiological agent can be used as the probe to screen the phage cDNA library derived from the RNA of the etiological agent.
- the phages that are selected by the probes may contain the genes or part of the genes that is responsible for the infectivity of the etiological agent, which can be used as the potential drug target.
- the ability of a complex to modulate signal transduction pathways can be determined.
- the ability of a complex to modulate an identified signal transduction pathways identifies such signal transduction pathway as a therapeutic target.
- a variety of cells that comprise reporter genes that report an increased or decreased activity of a signal transduction pathway in response to a compound are known in the art. Such cells can also be made using methods known in the art (see, WO 98/13353 to Whitney, published April 2, 1999; U.S. Patent No. 5,298,429 to Evans et al., issued March 29, 1994; and Skarnes et al., Genes and Development 6:903-918 (1992)).
- Complexes of the present invention can be contacted with such cells and the expression of the reporter gene monitored to identify signal transduction pathways modulated by the complex.
- Such identified signal transduction pathways are themselves pharmaceutical targets, as are the individual components of the identified signal transduction pathway
- Peptides encoded by random sequences or sequences of interest may also be selected for desireable catalytic functions.
- Assays may be developed in which enhanced or altered function of peptides of the present invention is detectable, for example colorometric assays or assays that measure the release of radioactive moieties from substrates.
- Intracellular and in vitro assays may be done in appropriate formats, such as in microtiter dishes and using plate readers.
- the complexes selected by such assays or portions thereof can be isolated using various purification methods and amplification methods as they are known in the art.
- complexes may be recovered from positively screening assay wells using antibodies or nucleic acids of complexes maybe recovered from positively screening assay wells by amplification reactions using specific primers. Detergents, denaturing agents, and partial purification steps such as centrifugation may be used prior to recovery of the complexes or their components.
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2001261233A AU2001261233A1 (en) | 2000-05-05 | 2001-05-04 | Identification of polypeptides and nucleic acid molecules using linkage between dna and polypeptide |
Applications Claiming Priority (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US20206600P | 2000-05-05 | 2000-05-05 | |
US60/202,066 | 2000-05-05 | ||
US22653500P | 2000-08-16 | 2000-08-16 | |
US60/226,535 | 2000-08-16 | ||
USPCT/US00/26511 | 2000-09-27 | ||
PCT/US2000/026511 WO2001025249A1 (fr) | 1999-10-01 | 2000-09-27 | Compositions et procedes permettant d'identifier des polypetides et des molecules d'acide nucleique |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2001083507A1 true WO2001083507A1 (fr) | 2001-11-08 |
Family
ID=27359007
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2001/014671 WO2001083507A1 (fr) | 2000-05-05 | 2001-05-04 | Identification de polypeptides et de molecules d'acide nucleique au moyen d'une liaison entre adn et polypeptide |
Country Status (2)
Country | Link |
---|---|
AU (1) | AU2001261233A1 (fr) |
WO (1) | WO2001083507A1 (fr) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5270163A (en) * | 1990-06-11 | 1993-12-14 | University Research Corporation | Methods for identifying nucleic acid ligands |
WO1998031700A1 (fr) * | 1997-01-21 | 1998-07-23 | The General Hospital Corporation | Selection de proteines a l'aide de fusions arn-proteine |
-
2001
- 2001-05-04 WO PCT/US2001/014671 patent/WO2001083507A1/fr active Search and Examination
- 2001-05-04 AU AU2001261233A patent/AU2001261233A1/en not_active Withdrawn
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5270163A (en) * | 1990-06-11 | 1993-12-14 | University Research Corporation | Methods for identifying nucleic acid ligands |
WO1998031700A1 (fr) * | 1997-01-21 | 1998-07-23 | The General Hospital Corporation | Selection de proteines a l'aide de fusions arn-proteine |
Non-Patent Citations (2)
Title |
---|
ROBERTS ET AL.: "RNA-peptide fusions for the in vitro selection of peptides and proteins", PROC. NATL. ACAD. SCI. USA, vol. 94, no. 23, November 1997 (1997-11-01), pages 12297 - 12302, XP002944663 * |
TOMONAGA ET AL.: "Activating transcription from single stranded DNA", PROC. NATL. ACAD. SCI. USA, vol. 93, June 1996 (1996-06-01), pages 5830 - 5835, XP002944664 * |
Also Published As
Publication number | Publication date |
---|---|
AU2001261233A1 (en) | 2001-11-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1151143B1 (fr) | Selection de proteines par fusions de proteines et d'arn | |
US6716973B2 (en) | Use of a ribozyme to join nucleic acids and peptides | |
EP2474613B2 (fr) | Procédés et compositions | |
US6258558B1 (en) | Method for selection of proteins using RNA-protein fusions | |
US8207093B2 (en) | Selection of proteins using RNA-protein fusions | |
US6143503A (en) | Use of a ribozyme to join nucleic acids and peptides | |
KR20020059370A (ko) | 융합 라이브러리의 제작 및 사용을 위한 방법 및 조성물 | |
CA2217641A1 (fr) | Polypeptides presentant un domaine fonctionnel important, et leurs procedes d'identification et d'utilisation | |
US20120283136A1 (en) | Compositions and methods for the rapid biosynthesis and in vivo screening of biologically relevant peptides | |
WO2002066653A2 (fr) | Banques procaryotiques et leurs utilisations | |
Satz et al. | Selections and screenings of DNA-encoded chemical libraries against enzyme and cellular targets | |
US20010046680A1 (en) | Identification of polypeptides and nucleic acid molecules using linkage between DNA and polypeptide | |
KR20020064140A (ko) | 단백질 발현 어레이의 제작 방법 | |
He et al. | From DNA to protein: No living cells required | |
US7816098B2 (en) | Methods of making and using a protein array | |
EP1404867B1 (fr) | Procede de criblage et de transfection d'oligonucleotides | |
WO2001083507A1 (fr) | Identification de polypeptides et de molecules d'acide nucleique au moyen d'une liaison entre adn et polypeptide | |
US20030104604A1 (en) | Genetically engineered bacterial strains for the display of foreign peptides on filamentous phage | |
JP3706942B2 (ja) | 物質と蛋白質との間の相互作用の検出方法、物質と相互作用する蛋白質のスクリーニング方法、及び、物質とその物質と相互作用する蛋白質との複合体の形成方法 | |
JP4122694B2 (ja) | タンパク質−dna連結分子及びその利用 | |
JPWO2003048363A1 (ja) | 対応付け分子とc末端ラベル化蛋白質の複合体および対応付け分子の複合体、ならびにそれらの複合体を利用した蛋白質間相互作用解析方法 | |
US20030091999A1 (en) | Compositions and methods for identifying polypeptides and nucleic acid molecules | |
US20040253578A1 (en) | Dynamic action reference tools | |
WO2001025249A1 (fr) | Compositions et procedes permettant d'identifier des polypetides et des molecules d'acide nucleique | |
CA2408652A1 (fr) | Procede de designation et de recherche systematique de bibliotheques aleatoires de composes |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |