EP1051519A2 - Verfahren zur identifizierung von polynukleotid- und polypeptidsequenzen welche mit physiologischen und medizinischen zuständen assoziiert sind - Google Patents
Verfahren zur identifizierung von polynukleotid- und polypeptidsequenzen welche mit physiologischen und medizinischen zuständen assoziiert sindInfo
- Publication number
- EP1051519A2 EP1051519A2 EP99904442A EP99904442A EP1051519A2 EP 1051519 A2 EP1051519 A2 EP 1051519A2 EP 99904442 A EP99904442 A EP 99904442A EP 99904442 A EP99904442 A EP 99904442A EP 1051519 A2 EP1051519 A2 EP 1051519A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- human
- sequence
- sequences
- protein
- polypeptide
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
- 238000000034 method Methods 0.000 title claims abstract description 243
- 108091033319 polynucleotide Proteins 0.000 title claims abstract description 168
- 102000040430 polynucleotide Human genes 0.000 title claims abstract description 168
- 239000002157 polynucleotide Substances 0.000 title claims abstract description 168
- 108090000765 processed proteins & peptides Proteins 0.000 title claims abstract description 151
- 229920001184 polypeptide Polymers 0.000 title claims abstract description 147
- 102000004196 processed proteins & peptides Human genes 0.000 title claims abstract description 147
- 241000282414 Homo sapiens Species 0.000 claims abstract description 477
- 241000282577 Pan troglodytes Species 0.000 claims abstract description 110
- 208000030507 AIDS Diseases 0.000 claims abstract description 76
- 238000011161 development Methods 0.000 claims abstract description 62
- 230000004962 physiological condition Effects 0.000 claims abstract description 48
- 201000010099 disease Diseases 0.000 claims abstract description 45
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims abstract description 45
- 230000001225 therapeutic effect Effects 0.000 claims abstract description 17
- 108090000623 proteins and genes Proteins 0.000 claims description 223
- 230000008859 change Effects 0.000 claims description 111
- 239000003795 chemical substances by application Substances 0.000 claims description 111
- 239000002773 nucleotide Substances 0.000 claims description 99
- 125000003729 nucleotide group Chemical group 0.000 claims description 99
- 102000004169 proteins and genes Human genes 0.000 claims description 88
- 230000006870 function Effects 0.000 claims description 81
- 241000282412 Homo Species 0.000 claims description 66
- 238000006467 substitution reaction Methods 0.000 claims description 46
- 210000004556 brain Anatomy 0.000 claims description 45
- 230000000694 effects Effects 0.000 claims description 41
- 241000282575 Gorilla Species 0.000 claims description 29
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 26
- 241000282405 Pongo abelii Species 0.000 claims description 26
- 208000035473 Communicable disease Diseases 0.000 claims description 25
- 208000015181 infectious disease Diseases 0.000 claims description 25
- 102000003839 Human Proteins Human genes 0.000 claims description 19
- 108090000144 Human Proteins Proteins 0.000 claims description 19
- 230000003925 brain function Effects 0.000 claims description 18
- 108091026890 Coding region Proteins 0.000 claims description 17
- 206010028980 Neoplasm Diseases 0.000 claims description 17
- 108020004635 Complementary DNA Proteins 0.000 claims description 11
- 241000282576 Pan paniscus Species 0.000 claims description 11
- 201000011510 cancer Diseases 0.000 claims description 10
- 230000003612 virological effect Effects 0.000 claims description 7
- 239000000203 mixture Substances 0.000 claims description 5
- 230000003920 cognitive function Effects 0.000 claims description 4
- 238000007619 statistical method Methods 0.000 abstract description 3
- 238000007423 screening assay Methods 0.000 abstract 1
- 210000004027 cell Anatomy 0.000 description 99
- 230000018109 developmental process Effects 0.000 description 59
- 239000002299 complementary DNA Substances 0.000 description 52
- 102100037871 Intercellular adhesion molecule 3 Human genes 0.000 description 44
- 108010064600 Intercellular Adhesion Molecule-3 Proteins 0.000 description 42
- 241000282579 Pan Species 0.000 description 40
- 108010064593 Intercellular Adhesion Molecule-1 Proteins 0.000 description 39
- 102100037872 Intercellular adhesion molecule 2 Human genes 0.000 description 37
- 101710148794 Intercellular adhesion molecule 2 Proteins 0.000 description 36
- 102000015271 Intercellular Adhesion Molecule-1 Human genes 0.000 description 35
- 230000027455 binding Effects 0.000 description 35
- 241000725303 Human immunodeficiency virus Species 0.000 description 34
- 241000288906 Primates Species 0.000 description 33
- 238000004458 analytical method Methods 0.000 description 33
- 230000003044 adaptive effect Effects 0.000 description 32
- 241000894007 species Species 0.000 description 29
- 150000001413 amino acids Chemical class 0.000 description 28
- 230000014509 gene expression Effects 0.000 description 28
- 108020004999 messenger RNA Proteins 0.000 description 25
- 101100452015 Pan troglodytes ICAM1 gene Proteins 0.000 description 24
- 238000012163 sequencing technique Methods 0.000 description 22
- 239000003446 ligand Substances 0.000 description 18
- 238000012216 screening Methods 0.000 description 17
- 210000001519 tissue Anatomy 0.000 description 17
- 238000003556 assay Methods 0.000 description 15
- 239000000047 product Substances 0.000 description 15
- 238000013519 translation Methods 0.000 description 15
- 238000011282 treatment Methods 0.000 description 15
- 230000002068 genetic effect Effects 0.000 description 14
- 230000015572 biosynthetic process Effects 0.000 description 13
- 150000001875 compounds Chemical class 0.000 description 13
- 230000003993 interaction Effects 0.000 description 13
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 12
- 239000003814 drug Substances 0.000 description 12
- 238000000338 in vitro Methods 0.000 description 12
- 208000010648 susceptibility to HIV infection Diseases 0.000 description 12
- 238000012360 testing method Methods 0.000 description 12
- 241000713772 Human immunodeficiency virus 1 Species 0.000 description 11
- 241001465754 Metazoa Species 0.000 description 11
- 238000013518 transcription Methods 0.000 description 11
- 230000035897 transcription Effects 0.000 description 11
- 108020004414 DNA Proteins 0.000 description 10
- 101000599852 Homo sapiens Intercellular adhesion molecule 1 Proteins 0.000 description 10
- 230000010076 replication Effects 0.000 description 10
- 230000008901 benefit Effects 0.000 description 9
- 239000000523 sample Substances 0.000 description 9
- 208000002874 Acne Vulgaris Diseases 0.000 description 8
- 108700008625 Reporter Genes Proteins 0.000 description 8
- 241000700605 Viruses Species 0.000 description 8
- 206010000496 acne Diseases 0.000 description 8
- 238000013459 approach Methods 0.000 description 8
- 229940079593 drug Drugs 0.000 description 8
- 208000031886 HIV Infections Diseases 0.000 description 7
- 208000037357 HIV infectious disease Diseases 0.000 description 7
- 238000000692 Student's t-test Methods 0.000 description 7
- 238000006243 chemical reaction Methods 0.000 description 7
- 102000043559 human ICAM1 Human genes 0.000 description 7
- 208000033519 human immunodeficiency virus infectious disease Diseases 0.000 description 7
- 210000000987 immune system Anatomy 0.000 description 7
- 210000003205 muscle Anatomy 0.000 description 7
- 238000002864 sequence alignment Methods 0.000 description 7
- 241000699666 Mus <mouse, genus> Species 0.000 description 6
- 101100452023 Pan troglodytes ICAM3 gene Proteins 0.000 description 6
- 108091023040 Transcription factor Proteins 0.000 description 6
- 102000040945 Transcription factor Human genes 0.000 description 6
- 210000005013 brain tissue Anatomy 0.000 description 6
- 238000002474 experimental method Methods 0.000 description 6
- 230000003779 hair growth Effects 0.000 description 6
- 210000005260 human cell Anatomy 0.000 description 6
- 210000000265 leukocyte Anatomy 0.000 description 6
- 230000007246 mechanism Effects 0.000 description 6
- 230000035772 mutation Effects 0.000 description 6
- 230000007935 neutral effect Effects 0.000 description 6
- 238000002360 preparation method Methods 0.000 description 6
- 238000011160 research Methods 0.000 description 6
- 238000012353 t test Methods 0.000 description 6
- 239000013598 vector Substances 0.000 description 6
- 101000599862 Homo sapiens Intercellular adhesion molecule 3 Proteins 0.000 description 5
- 108091092195 Intron Proteins 0.000 description 5
- 108090000412 Protein-Tyrosine Kinases Proteins 0.000 description 5
- 238000012512 characterization method Methods 0.000 description 5
- 230000000052 comparative effect Effects 0.000 description 5
- 238000009396 hybridization Methods 0.000 description 5
- 230000009456 molecular mechanism Effects 0.000 description 5
- 230000037361 pathway Effects 0.000 description 5
- 210000002845 virion Anatomy 0.000 description 5
- 201000004384 Alopecia Diseases 0.000 description 4
- 102000019034 Chemokines Human genes 0.000 description 4
- 108010012236 Chemokines Proteins 0.000 description 4
- 108700018351 Major Histocompatibility Complex Proteins 0.000 description 4
- 108091034117 Oligonucleotide Proteins 0.000 description 4
- 108700026244 Open Reading Frames Proteins 0.000 description 4
- 210000001744 T-lymphocyte Anatomy 0.000 description 4
- 108010067390 Viral Proteins Proteins 0.000 description 4
- 208000036142 Viral infection Diseases 0.000 description 4
- 230000003321 amplification Effects 0.000 description 4
- 230000008827 biological function Effects 0.000 description 4
- 230000005540 biological transmission Effects 0.000 description 4
- 238000004422 calculation algorithm Methods 0.000 description 4
- 239000003153 chemical reaction reagent Substances 0.000 description 4
- 230000003930 cognitive ability Effects 0.000 description 4
- 238000004590 computer program Methods 0.000 description 4
- 230000034994 death Effects 0.000 description 4
- 229940011871 estrogen Drugs 0.000 description 4
- 239000000262 estrogen Substances 0.000 description 4
- 238000007429 general method Methods 0.000 description 4
- 238000003199 nucleic acid amplification method Methods 0.000 description 4
- 238000004806 packaging method and process Methods 0.000 description 4
- 230000036961 partial effect Effects 0.000 description 4
- 238000013081 phylogenetic analysis Methods 0.000 description 4
- 238000002818 protein evolution Methods 0.000 description 4
- 108091006024 signal transducing proteins Proteins 0.000 description 4
- 102000034285 signal transducing proteins Human genes 0.000 description 4
- 238000010561 standard procedure Methods 0.000 description 4
- 230000020382 suppression by virus of host antigen processing and presentation of peptide antigen via MHC class I Effects 0.000 description 4
- 238000001890 transfection Methods 0.000 description 4
- 230000009385 viral infection Effects 0.000 description 4
- 108010070743 3(or 17)-beta-hydroxysteroid dehydrogenase Proteins 0.000 description 3
- 241000282693 Cercopithecidae Species 0.000 description 3
- 102000004127 Cytokines Human genes 0.000 description 3
- 108090000695 Cytokines Proteins 0.000 description 3
- 102100034067 Dehydrogenase/reductase SDR family member 11 Human genes 0.000 description 3
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 3
- 208000005176 Hepatitis C Diseases 0.000 description 3
- 102000000588 Interleukin-2 Human genes 0.000 description 3
- 108010002350 Interleukin-2 Proteins 0.000 description 3
- 241000699670 Mus sp. Species 0.000 description 3
- 241000244206 Nematoda Species 0.000 description 3
- 101100452020 Pan troglodytes ICAM2 gene Proteins 0.000 description 3
- 241000288935 Platyrrhini Species 0.000 description 3
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 3
- 102000004022 Protein-Tyrosine Kinases Human genes 0.000 description 3
- 125000000539 amino acid group Chemical group 0.000 description 3
- 239000000427 antigen Substances 0.000 description 3
- 108091007433 antigens Proteins 0.000 description 3
- 102000036639 antigens Human genes 0.000 description 3
- 230000006907 apoptotic process Effects 0.000 description 3
- 210000004369 blood Anatomy 0.000 description 3
- 239000008280 blood Substances 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 238000009510 drug design Methods 0.000 description 3
- 238000004520 electroporation Methods 0.000 description 3
- 210000003527 eukaryotic cell Anatomy 0.000 description 3
- 239000013604 expression vector Substances 0.000 description 3
- 230000036541 health Effects 0.000 description 3
- 230000028993 immune response Effects 0.000 description 3
- 238000000099 in vitro assay Methods 0.000 description 3
- 238000001727 in vivo Methods 0.000 description 3
- 238000005462 in vivo assay Methods 0.000 description 3
- 238000012750 in vivo screening Methods 0.000 description 3
- 230000010365 information processing Effects 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 230000008449 language Effects 0.000 description 3
- 230000000670 limiting effect Effects 0.000 description 3
- 238000012423 maintenance Methods 0.000 description 3
- 210000004962 mammalian cell Anatomy 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 230000003278 mimic effect Effects 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- 210000000056 organ Anatomy 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 230000002441 reversible effect Effects 0.000 description 3
- 150000003384 small molecules Chemical class 0.000 description 3
- 238000003860 storage Methods 0.000 description 3
- 238000012451 transgenic animal system Methods 0.000 description 3
- 230000009261 transgenic effect Effects 0.000 description 3
- 230000036642 wellbeing Effects 0.000 description 3
- 102000001556 1-Phosphatidylinositol 4-Kinase Human genes 0.000 description 2
- 108010029190 1-Phosphatidylinositol 4-Kinase Proteins 0.000 description 2
- VOXZDWNPVJITMN-ZBRFXRBCSA-N 17β-estradiol Chemical compound OC1=CC=C2[C@H]3CC[C@](C)([C@H](CC4)O)[C@@H]4[C@@H]3CCC2=C1 VOXZDWNPVJITMN-ZBRFXRBCSA-N 0.000 description 2
- 108020005065 3' Flanking Region Proteins 0.000 description 2
- 108020005029 5' Flanking Region Proteins 0.000 description 2
- 241000837181 Andina Species 0.000 description 2
- 102100037904 CD9 antigen Human genes 0.000 description 2
- 101100289995 Caenorhabditis elegans mac-1 gene Proteins 0.000 description 2
- 101710205625 Capsid protein p24 Proteins 0.000 description 2
- 108050000299 Chemokine receptor Proteins 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- 230000004568 DNA-binding Effects 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- 102000003816 Interleukin-13 Human genes 0.000 description 2
- 108090000176 Interleukin-13 Proteins 0.000 description 2
- 102000004388 Interleukin-4 Human genes 0.000 description 2
- 108090000978 Interleukin-4 Proteins 0.000 description 2
- 102000004889 Interleukin-6 Human genes 0.000 description 2
- 108090001005 Interleukin-6 Proteins 0.000 description 2
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 2
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 2
- 102000007651 Macrophage Colony-Stimulating Factor Human genes 0.000 description 2
- 108010046938 Macrophage Colony-Stimulating Factor Proteins 0.000 description 2
- 108010038807 Oligopeptides Proteins 0.000 description 2
- 102000015636 Oligopeptides Human genes 0.000 description 2
- 241000283973 Oryctolagus cuniculus Species 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- 238000010222 PCR analysis Methods 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- 102000003993 Phosphatidylinositol 3-kinases Human genes 0.000 description 2
- 108090000430 Phosphatidylinositol 3-kinases Proteins 0.000 description 2
- 101710096328 Phospholipase A2 Proteins 0.000 description 2
- 101710177166 Phosphoprotein Proteins 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 2
- 241000283984 Rodentia Species 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- 101710149279 Small delta antigen Proteins 0.000 description 2
- 241000255588 Tephritidae Species 0.000 description 2
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 238000009825 accumulation Methods 0.000 description 2
- 230000004913 activation Effects 0.000 description 2
- 231100000360 alopecia Toxicity 0.000 description 2
- 230000004075 alteration Effects 0.000 description 2
- 238000010171 animal model Methods 0.000 description 2
- 238000002306 biochemical method Methods 0.000 description 2
- 230000004071 biological effect Effects 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 229920002678 cellulose Polymers 0.000 description 2
- 239000001913 cellulose Substances 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 239000013599 cloning vector Substances 0.000 description 2
- 230000001149 cognitive effect Effects 0.000 description 2
- 230000002153 concerted effect Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000003745 diagnosis Methods 0.000 description 2
- 230000006397 emotional response Effects 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 230000001667 episodic effect Effects 0.000 description 2
- 230000001747 exhibiting effect Effects 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 238000002825 functional assay Methods 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 2
- 230000003676 hair loss Effects 0.000 description 2
- 238000001114 immunoprecipitation Methods 0.000 description 2
- 238000011065 in-situ storage Methods 0.000 description 2
- 230000001965 increasing effect Effects 0.000 description 2
- 230000004054 inflammatory process Effects 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 150000002484 inorganic compounds Chemical class 0.000 description 2
- 229910010272 inorganic material Inorganic materials 0.000 description 2
- 230000033001 locomotion Effects 0.000 description 2
- 230000027928 long-term synaptic potentiation Effects 0.000 description 2
- 229920002521 macromolecule Polymers 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000000877 morphologic effect Effects 0.000 description 2
- 150000002894 organic compounds Chemical class 0.000 description 2
- 230000036407 pain Effects 0.000 description 2
- 238000000053 physical method Methods 0.000 description 2
- 230000002265 prevention Effects 0.000 description 2
- 230000002250 progressing effect Effects 0.000 description 2
- 230000000644 propagated effect Effects 0.000 description 2
- 102000005962 receptors Human genes 0.000 description 2
- 108020003175 receptors Proteins 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 239000000790 retinal pigment Substances 0.000 description 2
- 238000010839 reverse transcription Methods 0.000 description 2
- 230000035807 sensation Effects 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 230000008786 sensory perception of smell Effects 0.000 description 2
- 230000019491 signal transduction Effects 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 239000013589 supplement Substances 0.000 description 2
- 229940124597 therapeutic agent Drugs 0.000 description 2
- 238000002560 therapeutic procedure Methods 0.000 description 2
- 229960005486 vaccine Drugs 0.000 description 2
- 230000029812 viral genome replication Effects 0.000 description 2
- 101150084750 1 gene Proteins 0.000 description 1
- GOZMBJCYMQQACI-UHFFFAOYSA-N 6,7-dimethyl-3-[[methyl-[2-[methyl-[[1-[3-(trifluoromethyl)phenyl]indol-3-yl]methyl]amino]ethyl]amino]methyl]chromen-4-one;dihydrochloride Chemical compound Cl.Cl.C=1OC2=CC(C)=C(C)C=C2C(=O)C=1CN(C)CCN(C)CC(C1=CC=CC=C11)=CN1C1=CC=CC(C(F)(F)F)=C1 GOZMBJCYMQQACI-UHFFFAOYSA-N 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 108700028369 Alleles Proteins 0.000 description 1
- 241000270728 Alligator Species 0.000 description 1
- 208000024827 Alzheimer disease Diseases 0.000 description 1
- 229940088872 Apoptosis inhibitor Drugs 0.000 description 1
- 102100021569 Apoptosis regulator Bcl-2 Human genes 0.000 description 1
- 201000001320 Atherosclerosis Diseases 0.000 description 1
- 241000271566 Aves Species 0.000 description 1
- 102100026189 Beta-galactosidase Human genes 0.000 description 1
- 101710125089 Bindin Proteins 0.000 description 1
- 206010006187 Breast cancer Diseases 0.000 description 1
- 208000026310 Breast neoplasm Diseases 0.000 description 1
- 102100035875 C-C chemokine receptor type 5 Human genes 0.000 description 1
- 101710149870 C-C chemokine receptor type 5 Proteins 0.000 description 1
- 101710155856 C-C motif chemokine 3 Proteins 0.000 description 1
- 102100031092 C-C motif chemokine 3 Human genes 0.000 description 1
- 102100031650 C-X-C chemokine receptor type 4 Human genes 0.000 description 1
- 102000002110 C2 domains Human genes 0.000 description 1
- 108050009459 C2 domains Proteins 0.000 description 1
- 102100032912 CD44 antigen Human genes 0.000 description 1
- 108010009575 CD55 Antigens Proteins 0.000 description 1
- 102100021868 Calnexin Human genes 0.000 description 1
- 108010056891 Calnexin Proteins 0.000 description 1
- 208000005623 Carcinogenesis Diseases 0.000 description 1
- 102000000844 Cell Surface Receptors Human genes 0.000 description 1
- 108010001857 Cell Surface Receptors Proteins 0.000 description 1
- 108010055166 Chemokine CCL5 Proteins 0.000 description 1
- 102000009410 Chemokine receptor Human genes 0.000 description 1
- 108010035563 Chloramphenicol O-acetyltransferase Proteins 0.000 description 1
- 102100025680 Complement decay-accelerating factor Human genes 0.000 description 1
- 102000000634 Cytochrome c oxidase subunit IV Human genes 0.000 description 1
- 108090000365 Cytochrome-c oxidases Proteins 0.000 description 1
- 241000252212 Danio rerio Species 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 108700002304 Drosophila can Proteins 0.000 description 1
- 238000012286 ELISA Assay Methods 0.000 description 1
- 241000257465 Echinoidea Species 0.000 description 1
- 244000148064 Enicostema verticillatum Species 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 108700039887 Essential Genes Proteins 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 102100031547 HLA class II histocompatibility antigen, DO alpha chain Human genes 0.000 description 1
- 108010054147 Hemoglobins Proteins 0.000 description 1
- 102000001554 Hemoglobins Human genes 0.000 description 1
- 102000009331 Homeodomain Proteins Human genes 0.000 description 1
- 108010048671 Homeodomain Proteins Proteins 0.000 description 1
- 101000971171 Homo sapiens Apoptosis regulator Bcl-2 Proteins 0.000 description 1
- 101000777387 Homo sapiens C-C motif chemokine 3 Proteins 0.000 description 1
- 101000922348 Homo sapiens C-X-C chemokine receptor type 4 Proteins 0.000 description 1
- 101000868273 Homo sapiens CD44 antigen Proteins 0.000 description 1
- 101000856022 Homo sapiens Complement decay-accelerating factor Proteins 0.000 description 1
- 101000866278 Homo sapiens HLA class II histocompatibility antigen, DO alpha chain Proteins 0.000 description 1
- 101000599858 Homo sapiens Intercellular adhesion molecule 2 Proteins 0.000 description 1
- 101000608935 Homo sapiens Leukosialin Proteins 0.000 description 1
- 101000835093 Homo sapiens Transferrin receptor protein 1 Proteins 0.000 description 1
- 241000714260 Human T-lymphotropic virus 1 Species 0.000 description 1
- 241000282620 Hylobates sp. Species 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 102000000521 Immunophilins Human genes 0.000 description 1
- 108010016648 Immunophilins Proteins 0.000 description 1
- 206010061218 Inflammation Diseases 0.000 description 1
- 102000006992 Interferon-alpha Human genes 0.000 description 1
- 108010047761 Interferon-alpha Proteins 0.000 description 1
- 102000008070 Interferon-gamma Human genes 0.000 description 1
- 108010074328 Interferon-gamma Proteins 0.000 description 1
- 102000000589 Interleukin-1 Human genes 0.000 description 1
- 108010002352 Interleukin-1 Proteins 0.000 description 1
- 102000003814 Interleukin-10 Human genes 0.000 description 1
- 108090000174 Interleukin-10 Proteins 0.000 description 1
- 102000015696 Interleukins Human genes 0.000 description 1
- 108010063738 Interleukins Proteins 0.000 description 1
- 206010024229 Leprosy Diseases 0.000 description 1
- 238000003657 Likelihood-ratio test Methods 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- 108700005089 MHC Class I Genes Proteins 0.000 description 1
- 102000043129 MHC class I family Human genes 0.000 description 1
- 108091054437 MHC class I family Proteins 0.000 description 1
- 231100000002 MTT assay Toxicity 0.000 description 1
- 238000000134 MTT assay Methods 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 108020005196 Mitochondrial DNA Proteins 0.000 description 1
- 101710151805 Mitochondrial intermediate peptidase 1 Proteins 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 241001529936 Murinae Species 0.000 description 1
- 101100452019 Mus musculus Icam2 gene Proteins 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- 230000004988 N-glycosylation Effects 0.000 description 1
- 108700020796 Oncogene Proteins 0.000 description 1
- 208000001388 Opportunistic Infections Diseases 0.000 description 1
- 108091007960 PI3Ks Proteins 0.000 description 1
- 241001504519 Papio ursinus Species 0.000 description 1
- 102000003992 Peroxidases Human genes 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 208000000236 Prostatic Neoplasms Diseases 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 238000012181 QIAquick gel extraction kit Methods 0.000 description 1
- 244000088415 Raphanus sativus Species 0.000 description 1
- 235000006140 Raphanus sativus var sativus Nutrition 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 241000555745 Sciuridae Species 0.000 description 1
- 102100040293 Serine/threonine-protein kinase LMTK1 Human genes 0.000 description 1
- 101710118516 Serine/threonine-protein kinase LMTK1 Proteins 0.000 description 1
- 241000270295 Serpentes Species 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 102000018679 Tacrolimus Binding Proteins Human genes 0.000 description 1
- 108010027179 Tacrolimus Binding Proteins Proteins 0.000 description 1
- 102100026144 Transferrin receptor protein 1 Human genes 0.000 description 1
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 1
- 108060008683 Tumor Necrosis Factor Receptor Proteins 0.000 description 1
- 102000000852 Tumor Necrosis Factor-alpha Human genes 0.000 description 1
- 101150044453 Y gene Proteins 0.000 description 1
- 230000004308 accommodation Effects 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 239000008186 active pharmaceutical agent Substances 0.000 description 1
- 230000001154 acute effect Effects 0.000 description 1
- 210000000577 adipose tissue Anatomy 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 230000036436 anti-hiv Effects 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 239000000158 apoptosis inhibitor Substances 0.000 description 1
- MXWJVTOOROXGIU-UHFFFAOYSA-N atrazine Chemical compound CCNC1=NC(Cl)=NC(NC(C)C)=N1 MXWJVTOOROXGIU-UHFFFAOYSA-N 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 230000037429 base substitution Effects 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 230000003542 behavioural effect Effects 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- 210000000481 breast Anatomy 0.000 description 1
- 244000309464 bull Species 0.000 description 1
- 238000010805 cDNA synthesis kit Methods 0.000 description 1
- 230000036952 cancer formation Effects 0.000 description 1
- 231100000504 carcinogenesis Toxicity 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 230000021164 cell adhesion Effects 0.000 description 1
- 230000022131 cell cycle Effects 0.000 description 1
- 230000003915 cell function Effects 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000037410 cognitive enhancement Effects 0.000 description 1
- 238000010835 comparative analysis Methods 0.000 description 1
- 230000002860 competitive effect Effects 0.000 description 1
- 239000013068 control sample Substances 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 239000003431 cross linking reagent Substances 0.000 description 1
- 230000005574 cross-species transmission Effects 0.000 description 1
- 238000011461 current therapy Methods 0.000 description 1
- 231100000135 cytotoxicity Toxicity 0.000 description 1
- 230000003013 cytotoxicity Effects 0.000 description 1
- 238000002784 cytotoxicity assay Methods 0.000 description 1
- 231100000263 cytotoxicity test Toxicity 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000007123 defense Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000006471 dimerization reaction Methods 0.000 description 1
- 238000007878 drug screening assay Methods 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 238000007824 enzymatic assay Methods 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 229940088598 enzyme Drugs 0.000 description 1
- 229960005309 estradiol Drugs 0.000 description 1
- 108010085279 eukaryotic translation initiation factor 5A Proteins 0.000 description 1
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 1
- 238000000198 fluorescence anisotropy Methods 0.000 description 1
- 238000000799 fluorescence microscopy Methods 0.000 description 1
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 230000007849 functional defect Effects 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 230000004545 gene duplication Effects 0.000 description 1
- 238000011331 genomic analysis Methods 0.000 description 1
- 108060003196 globin Proteins 0.000 description 1
- 102000018146 globin Human genes 0.000 description 1
- 125000000404 glutamine group Chemical group N[C@@H](CCC(N)=O)C(=O)* 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 239000001056 green pigment Substances 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 238000010562 histological examination Methods 0.000 description 1
- 238000000265 homogenisation Methods 0.000 description 1
- 102000056475 human ICAM2 Human genes 0.000 description 1
- 102000058166 human ICAM3 Human genes 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 239000012678 infectious agent Substances 0.000 description 1
- 230000002458 infectious effect Effects 0.000 description 1
- 102000006495 integrins Human genes 0.000 description 1
- 108010044426 integrins Proteins 0.000 description 1
- 229960003130 interferon gamma Drugs 0.000 description 1
- 229940028885 interleukin-4 Drugs 0.000 description 1
- 229940100601 interleukin-6 Drugs 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 230000002147 killing effect Effects 0.000 description 1
- 108020001756 ligand binding domains Proteins 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 210000001165 lymph node Anatomy 0.000 description 1
- 201000004792 malaria Diseases 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 229940127554 medical product Drugs 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 238000007431 microscopic evaluation Methods 0.000 description 1
- 238000000302 molecular modelling Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 210000005088 multinucleated cell Anatomy 0.000 description 1
- 231100000219 mutagenic Toxicity 0.000 description 1
- 230000003505 mutagenic effect Effects 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 210000003061 neural cell Anatomy 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 230000009871 nonspecific binding Effects 0.000 description 1
- 150000007523 nucleic acids Chemical group 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 238000010397 one-hybrid screening Methods 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- 108040007629 peroxidase activity proteins Proteins 0.000 description 1
- 239000008177 pharmaceutical agent Substances 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 230000001766 physiological effect Effects 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 238000003752 polymerase chain reaction Methods 0.000 description 1
- 102000054765 polymorphisms of proteins Human genes 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 244000062645 predators Species 0.000 description 1
- 230000037452 priming Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 125000001500 prolyl group Chemical group [H]N1C([H])(C(=O)[*])C([H])([H])C([H])([H])C1([H])[H] 0.000 description 1
- 230000020978 protein processing Effects 0.000 description 1
- 230000004844 protein turnover Effects 0.000 description 1
- 230000004850 protein–protein interaction Effects 0.000 description 1
- 239000001054 red pigment Substances 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 238000003571 reporter gene assay Methods 0.000 description 1
- 230000001850 reproductive effect Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 210000000952 spleen Anatomy 0.000 description 1
- 238000012289 standard assay Methods 0.000 description 1
- 238000000528 statistical test Methods 0.000 description 1
- 238000001356 surgical procedure Methods 0.000 description 1
- 208000011580 syndromic disease Diseases 0.000 description 1
- 230000009897 systematic effect Effects 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 239000003104 tissue culture media Substances 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000011830 transgenic mouse model Methods 0.000 description 1
- 230000014616 translation Effects 0.000 description 1
- 102000003298 tumor necrosis factor receptor Human genes 0.000 description 1
- 230000007306 turnover Effects 0.000 description 1
- 238000010396 two-hybrid screening Methods 0.000 description 1
- 230000035899 viability Effects 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1034—Isolating an individual clone by screening libraries
- C12N15/1072—Differential gene expression library synthesis, e.g. subtracted libraries, differential screening
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1034—Isolating an individual clone by screening libraries
- C12N15/1079—Screening libraries by altering the phenotype or phenotypic trait of the host
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A90/00—Technologies having an indirect contribution to adaptation to climate change
- Y02A90/10—Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation
Definitions
- TECHNICAL FIELD This invention relates to using molecular and evolutionary techniques to identify polynucleotide and polypeptide sequences corresponding to evolved traits that may be relevant to human diseases or conditions, such as unique or enhanced human brain functions, longer human life spans, susceptibility or resistance to development of infectious disease (such as AIDS and hepatitis C), susceptibility or resistance to development of cancer, and aesthetic traits, such as hair growth, susceptibility or resistance to acne, or enhanced muscle mass.
- infectious disease such as AIDS and hepatitis C
- susceptibility or resistance to development of cancer and aesthetic traits, such as hair growth, susceptibility or resistance to acne, or enhanced muscle mass.
- brain function e.g.,
- the homeo domain with DNA binding activity first discovered in the fruit fly Drosophila was used to identify human homologues that possess similar activities.
- comparison of homologous genes or proteins between human and a lower model organism may provide useful information with respect to evolutionarily conserved molecular sequences and functional features, this approach is of limited use in identifying genes whose sequences have changed due to natural selection.
- K A /K S involves pairwise comparisons between aligned protein-coding nucleotide sequences of the ratios of
- K A /K s -type methods includes this and similar methods. These methods have been used to demonstrate the occurrence of Darwinian molecular-level positive selection, resulting in amino acid differences in homologous proteins. Several groups have used such methods to document that a particular protein has evolved more rapidly than the neutral substitution rate, and thus supports the existence of Darwinian molecular-level positive selection. For example, McDonald and Kreitman (1991) Nature 351:652-654 propose a statistical test of neutral protein evolution hypothesis based on comparison of the number of amino acid replacement substitutions to synonymous substitutions in the coding region of a locus.
- Edwards et al. (1995) use degenerate primers to pull out MHC loci from various species of birds and an alligator species, which are then analyzed by the Nei and Gojobori methods (d N : d s ratios) to extend MHC studies to nonmammalian vertebrates.
- Whitfield et al. (1993) Nature 364:713-715 use Ka/Ks analysis to look for directional selection in the regions flanking a conserved region in the SR Y gene (that determines male sex). They suggest that the rapid evolution of SR Y could be a significant cause of reproductive isolation, leading to new species.
- Wettsetin et al. (1996) Mol. Biol. Evol. 13(l):56-66 apply the MEGA program of Kumar, Tamura and Nei and phylogenetic analysis to investigate the diversification of MHC class I genes in squirrels and related rodents.
- Parham and Ohta (1996) Science 272:67-74 state that a population biology approach, including tests for selection as well as for gene conversion and neutral drift are required to analyze the generation and maintenance of human MHC class I polymorphism. Hughes (1997) Mol. Biol. Evol.
- K A /K s -type methods analytical methods of molecular evolution to identify rapidly evolving genes
- K A /K s -type methods can be applied to achieve many different purposes, most commonly to confirm the existence of Darwinian molecular-level positive selection, but also to assess the frequency of Darwinian molecular- level positive selection, to understand phylogenetic relationships, to elucidate mechanisms by which new species are formed, or to establish single or multiple origin for specific gene polymorphisms.
- K A /K s -type methods to identify evolutionary solutions, specific evolved changes, that could be mimicked or used in the development of treatments to prevent or cure human conditions or diseases or to modulate unique or enhanced human functions. They have not used K A /K S type analysis as a systematic tool for identifying human or non-human primate genes that contain evolutionarily significant sequence changes and exploiting such genes and the identified changes in the development of treatments for human conditions or diseases.
- the identification of human genes that have evolved to confer unique or enhanced human functions compared to homologous chimpanzee genes could be applied to developing agents to modulate these unique human functions or to restore function when the gene is defective.
- the identification of the underlying chimpanzee (or other non-human primate) genes and the specific nucleotide changes that have evolved, and the further characterization of the physical and biochemical changes in the proteins encoded by these evolved genes, could provide valuable information, for example, on what determines susceptibility and resistance to infectious diseases, such as AIDS, what determines susceptibility or resistance to the development of certain cancers, what determines susceptibility or resistance to acne, how hair growth can be controlled, and how to control the formation of muscle versus fat.
- the present invention provides methods for identifying polynucleotide and polypeptide sequences having evolutionarily significant changes, which are associated with physiological conditions, including medical conditions.
- the invention applies comparative primate genomics to identify specific gene changes which may be associated with, and thus responsible for, physiological conditions, such as medically or commercially relevant evolved traits, and using the information obtained from these evolved genes to develop human treatments.
- the non-human primate sequences employed in the methods described herein may be any non-human primate, and is preferably a member of the hominoid group, more preferably a chimpanzee, bonobo, gorilla and/or orangutan, and most preferably a chimpanzee.
- a non-human primate polynucleotide or polypeptide has undergone natural selection that resulted in a positive evolutionarily significant change (i.e., the non-human primate polynucleotide or polypeptide has a positive attribute not present in humans).
- the positively selected polynucleotide or polypeptide may be associated with susceptibility or resistance to certain diseases or with other commercially relevant traits. Examples of this embodiment include, but are not limited to, polynucleotides and polypeptides that are positively selected in non-human primates, preferably chimpanzees, that may be associated with susceptibility or resistance to infectious diseases and cancer.
- An example of a commercially relevant trait may include aesthetic traits such as hair growth, muscle mass, susceptibility or resistance to acne.
- An example of this embodiment includes polynucleotides and polypeptides associated with the susceptibility or resistance to HIV dissemination, propagation and/or development of AIDS.
- the present invention can thus be useful in gaining insight into the molecular mechanisms that underlie resistance to HIV dissemination, propagation and/or development of AIDS, providing information that can also be useful in discovering and/or designing agents such as drugs that prevent and/or delay development of AIDS.
- Specific genes that have been positively selected in chimpanzees that may relate to AIDS or other infectious diseases are ICAM-1, ICAM-2, ICAM-3 and MIP-1- ⁇ .
- 17- ⁇ -hydroxysteroid dehydrogenase Type IV is a specific gene has been positively selected in chimpanzees that may relate to cancer.
- a human polynucleotide or polypeptide has undergone natural selection that resulted in a positive evolutionarily significant change (i.e., the human polynucleotide or polypeptide has a positive attribute not present in non-human primates).
- the polynucleotide or polypeptide may be associated with unique or enhanced functional capabilities of the human brain compared to non-human primates. Another is the longer life-span of humans compared to non-human primates.
- the present invention can thus be useful in gaining insight into the molecular mechanisms that underlie unique or enhanced human functions, providing information which can also be useful in designing agents such as drugs that modulate such unique or enhanced human functions, and in designing treatment of diseases or conditions related to humans.
- the present invention can thus be useful in gaining insight into the molecular mechanisms that underlie human cognitive function, providing information which can also be useful in designing agents such as drugs that enhance human brain function, and in designing treatment of diseases related to human brain.
- a specific example of a human gene that has positive evolutionarily significant changes when compared to non-human primates is a tyrosine kinase gene, KIAA 641.
- the invention provides methods for identifying a polynucleotide sequence encoding a polypeptide, wherein said polypeptide may be associated with a physiological condition (such as a medically or commercial relevant positive evolutionarily significant change).
- a physiological condition such as a medically or commercial relevant positive evolutionarily significant change.
- the positive evolutionarily significant change can be found in humans or in non-human primates.
- methods are provided for identifying a non-human polynucleotide sequence encoding a polypeptide, wherein said polypeptide may be associated with a physiological condition in the non-human primate, including but not limited to those physiological conditions listed (and throughout the specification), such as susceptibility or resistance to the development of a medically relevant disease state, such as an infectious disease (including viral disease, such as AIDS) or cancer.
- a physiological condition in the non-human primate including but not limited to those physiological conditions listed (and throughout the specification), such as susceptibility or resistance to the development of a medically relevant disease state, such as an infectious disease (including viral disease, such as AIDS) or cancer.
- methods comprise the steps of a) comparing non-human primate, preferably a chimpanzee, protein-coding polynucleotide sequences to protein- coding polynucleotide sequences of a human, wherein said human does not have the physiological condition; and b) selecting a non-human polynucleotide sequence that contains a nucleotide change as compared to corresponding sequence of the human, wherein said change is evolutionarily significant.
- the non-human protein-coding sequences correspond to cDNA.
- the sequences compared are from brain.
- the non-human protein coding sequence (and/or the polypeptide encoded therein) may be associated with development and/or maintenance of a physiological trait.
- the physiological condition may be any physiological condition, including those listed herein, such as, for example, disease
- cancer including susceptibility or resistance to disease
- infectious disease including viral diseases such as AIDS
- life span including cognitive function.
- brain function including cognitive function.
- methods for identifying a polynucleotide sequence encoding a human polypeptide, wherein said polypeptide may be associated with a physiological condition that is present in human(s), comprising the steps of: a) comparing human protein-coding polynucleotide sequences to protein-coding polynucleotide sequences of a non-human primate, wherein the non-human primate does not have the physiological condition; and b) selecting a human polynucleotide sequence that contains a nucleotide change as compared to corresponding sequence of the non-human primate, wherein said change is evolutionarily significant.
- the human protein coding sequence (and/or the polypeptide encoded therein) may be associated with development and/or maintenance of a physiological condition.
- the human protein-coding sequences correspond to cDNA.
- the sequences compared are from brain.
- the physiological condition is life span.
- the physiological condition is a brain function.
- the brain function is cognitive function.
- methods comprise the steps of: (a) comparing human protein-coding nucleotide sequences to protein-coding nucleotide sequences of a non-human primate, preferably a chimpanzee, that is resistant to a particular medically relevant disease state, wherein the human protein coding sequence is associated with development of the disease; and (b) selecting a non-human polynucleotide sequence that contains at least one nucleotide change as compared to the corresponding sequence of the human, wherein the change is evolutionarily significant.
- the sequences identified by these methods may be further characterized and/or analyzed to confirm that they are associated with the development of the disease state or condition.
- the invention provides methods for identifying a polynucleotide sequence encoding a polypeptide, wherein said polypeptide may be associated with resistance to development of AIDS, comprising the steps of: (a) comparing AIDS resistant non-human primate protein coding sequences to human protein coding sequences, wherein the human protein coding sequences are associated with development of AIDS; and (b) selecting an AIDS resistant non-human primate sequence that contains at least one nucleotide change as compared to the corresponding human sequence, wherein the nucleotide change is evolutionarily significant.
- these methods can be accomplished, for example, by aligning sequences according to their sequence homology and identifying a human polynucleotide sequence that comprises at least one unique nucleotide change over the corresponding polynucleotide sequence of the non-human primate, wherein the unique nucleotide change is positively selected according to an evolutionary analysis (as described herein).
- methods for identifying an evolutionarily significant change in a human brain protein-coding polynucleotide sequence, comprising the steps of a) comparing human brain protein-coding polynucleotide sequences to corresponding sequences of a non-human primate; and b) selecting a human polynucleotide sequence that contains a nucleotide change as compared to corresponding sequence of the non-human primate, wherein said change is evolutionarily significant.
- the human brain protein coding nucleotide sequences correspond to human brain cDNAs.
- Another aspect of the invention includes methods for identifying a positively selected human evolutionarily significant change.
- These methods comprise the steps of: (a) comparing human protein-coding nucleotide sequences to protein-coding nucleotide sequences of a non-human primate; and (b) selecting a human polynucleotide sequence that contains at least one (i.e., one or more) nucleotide change as compared to corresponding sequence of the non-human primate, wherein said change is evolutionarily significant.
- the sequences identified by this method may be further characterized and/or analyzed for their possible association with biologically or medically relevant functions unique or enhanced in humans.
- Another embodiment of the present invention is a method for large scale sequence comparison between human protein-coding polynucleotide sequences and the protein- coding polynucleotide sequences from a non-human primate, e.g., chimpanzee, comprising:
- the protein coding sequences are from brain.
- a nucleotide change identified by any of the methods described herein is a non-synonymous substitution.
- the evolutionary significance of the nucleotide change is determined according to the nonsynonymous substitution rate (K A ) of the nucleotide sequence.
- the evolutionarily significant changes are assessed by determining the K A /K S ratio between the human gene and the homologous gene from non-human primate (such as chimpanzee), and preferably that ratio is at least about 0.75, more preferably greater than about 1 (unity) (i.e., at least about 1), more preferably at least about 1.25, more preferably at least about 1.50, and more preferably at least about 2.00.
- a positively selected gene has been identified between human and a non-human primate (such as chimpanzee)
- further comparisons are performed with other non-human primates to confirm whether the human or the non-human primate (such as chimpanzee) gene has undergone positive selection.
- the invention provides methods for correlating an evolutionarily significant human nucleotide change to a physiological condition in a human (or humans), which comprise analyzing a functional effect (which includes determining the presence of a functional effect), if any, of (the presence or absence of) a polynucleotide sequence identified by any of the methods described herein, wherein presence of a functional effect indicates a correlation between the evolutionarily significant nucleotide change and the physiological condition.
- a functional effect if any may be assessed using a polypeptide sequence (or a portion of the polypeptide sequence) encoded by a nucleotide sequence identified by any of the methods described herein.
- the present invention also provides comparison of the identified polypeptides by physical and biochemical methods widely used in the art to determine the structural or biochemical consequences of the evolutionarily significant changes.
- Physical methods are meant to include methods that are used to examine structural changes to proteins encoded by genes found to have undergone adaptive evolution.
- Side-by-side comparison of the three-dimensional structures of a protein (either human or non-human primate) and the evolved homologous protein (either non-human primate or human, respectively) will provide valuable information for developing treatments for related human conditions and diseases.
- the invention provides methods for identifying a target site (which includes one or more target sites) which may be suitable for therapeutic intervention, comprising comparing a human polypeptide (or a portion of the polypeptide) encoded in a sequence identified by any of the methods described herein, with a corresponding non-human polypeptide (or a portion of the polypeptide), wherein a location of a molecular difference, if any, indicates a target site.
- the invention provides methods for identifying a target site (which includes one or more target sites) which may be suitable for therapeutic intervention, comprising comparing a non-human polypeptide (or a portion of the polypeptide) encoded in a sequence identified by any of the methods described herein, with a corresponding human polypeptide (or a portion of the polypeptide), wherein a location of a molecular difference, such as an amino acid difference, if any, indicates a target site.
- Biochemical methods are meant to include methods that are used to examine functional differences, such as binding specificity, binding strength, or optimal binding conditions, for a protein encoded by a gene that has undergone adaptive evolution. Side- by-side comparison of biochemical characteristics of a protein (either human or non-human primate) and the evolved homologous protein (either non-human primate or human, respectively) will reveal valuable information for developing treatments for related human conditions and diseases.
- the invention provides methods of identifying an agent which may modulate a physiological condition, said method comprising contacting an agent (i.e., at least one agent to be tested) with a cell that has been transfected with a polynucleotide sequence identified by any of the methods described herein, wherein an agent is identified by its ability to modulate function of the polynucleotide sequence.
- the invention provides methods of identifying an agent which may modulate a physiological condition, said method comprising contacting an agent (i.e., at least one agent) to be tested with a polypeptide (or a fragment of a polypeptide and/or a composition comprising a polypeptide or fragment of a polypeptide) encoded in or within a polynucleotide identified by any of the methods described herein, wherein an agent is identified by its ability to modulate function of the polypeptide.
- an agent i.e., at least one agent
- a polypeptide or a fragment of a polypeptide and/or a composition comprising a polypeptide or fragment of a polypeptide
- the invention also provides agents which are identified using the screening methods described herein.
- the invention provides methods of screening agents which may modulate the activity of the human polynucleotide or polypeptide to either modulate a unique or enhanced human function or to mimic the non-human primate trait of interest, such as susceptibility or resistance to development of a disease, such as AIDS.
- These methods comprise contacting a cell which has been transfected with a polynucleotide sequence with an agent to be tested, and identifying agents based on their ability to modulate function of the polynucleotide or contacting a polypeptide preparation with an agent to be tested and identifying agents based upon their ability to modulate function of the polypeptide.
- Figure 1 depicts a phylogenetic tree for primates within the hominoid group. The branching orders are based on well-supported mitochondrial DNA phylogenies. Messier and Stewart (1991) Nature 385:151-154.
- Figure 2 is a nucleotide sequence alignment between human and chimpanzee ICAM-1 sequences (GenBank accession numbers X06990 and X86848, respectively). The amino acid translation of the chimpanzee sequence is shown below the alignment.
- Figure 3 shows the nucleotide sequence of gorilla ICAM-1 (SEQ ID NO:4).
- Figure 4 shows the nucleotide sequence of orangutan ICAM-1 (SEQ ID NO:5).
- Figures 5(A)-(E) show the polypeptide sequence alignment of ICAM-1 from several primate species.
- Figures 6(A)-(B) show the polypeptide sequence alignment of ICAM-2 from several primate species.
- Figures7(A)-(P) show the polypeptide sequence alignment of ICAM-3 from several primate species.
- Figure 8 depicts a schematic representation of a procedure for comparing human/primate brain polynucleotides, selecting sequences with evolutionarily significant changes, and further characterizing the selected sequences.
- the diagram of Figure 8 illustrates a preferred embodiment of the invention and together with the description serves to explain the principles of the invention, along with elaboration and optional additional steps. It is understood that any human/primate polynucleotide sequence can be compared by a similar procedure and that the procedure is not limited to brain polynucleotides.
- the present invention applies comparative genomics to identify specific gene changes which are associated with, and thus may contribute to or be responsible for, physiological conditions, such as medically or commercially relevant evolved traits.
- the invention comprises a comparative genomics approach to identify specific gene changes responsible for differences in functions and diseases distinguishing humans from other non- humans, particularly primates, and most preferably chimpanzees, including the two known species, common chimpanzees and bonobos (pygmy chimpanzees).
- chimpanzees and humans are 98.5% identical at the DNA sequence level and the present invention can identify the adaptive molecular changes underlying differences between the species in a number of areas, including unique or enhanced human cognitive abilities and chimpanzee resistance to AIDS and certain cancers.
- the present invention provides exact information on evolutionary solutions that eliminate disease or provide unique functions.
- the present invention identifies genes that have evolved to confer an evolutionary advantage and the specific evolved changes.
- the present invention results from the observation that human protein-coding polynucleotides may contain sequence changes that are found in humans but not in other evolutionarily closely related species such as non-human primates, as a result of adaptive selection during evolution.
- the present invention further results from the observation that the genetic information of non-human primates may contain changes that are found in a particular non- human primate but not in humans, as a result of adaptive selection during evolution.
- a non-human primate polynucleotide or polypeptide has undergone natural selection that resulted in a positive evolutionarily significant change (i.e., the non-human primate polynucleotide or polypeptide has a positive attribute not present in humans).
- the positively selected polynucleotide or polypeptide may be associated with susceptibility or resistance to certain diseases or other commercially relevant traits.
- Medically relevant examples of this embodiment include, but are not limited to, polynucleotides and polypeptides that are positively selected in non-human primates, preferably chimpanzees, that may be associated with susceptibility or resistance to infectious diseases and cancer.
- An example of this embodiment includes polynucleotides and polypeptides associated with the susceptibility or resistance to progression from HIV infection to development of AIDS.
- the present invention can thus be useful in gaining insight into the molecular mechanisms that underlie resistance to progression from HIV infection to development of AIDS, providing information that can also be useful in discovering and/or designing agents such as drugs that prevent and/or delay development of AIDS.
- Commercially relevant examples include, but are not limited to, polynucleotides and polypeptides that are positively selected in non-human primates that may be associated with aesthetic traits, such as hair growth, acne or muscle mass.
- Positively selected human evolutionarily significant changes in polynucleotide and polypeptide sequences may be attributed to human capabilities that provide humans with competitive advantages, particularly when compared to the closest evolutionary relative, chimpanzee, such as unique or enhanced human brain functions.
- the present invention identifies human genes that evolved to provide unique or enhanced human cognitive abilities and the actual protein changes that confer functional differences will be quite useful in therapeutic approaches to treat cognitive deficiencies as well as cognitive enhancement for the general population.
- a "polynucleotide” refers to a polymeric form of nucleotides of any length, either ribonucleotides or deoxyribonucleotides, or analogs thereof.
- polynucleotide and “nucleotide sequence” are used interchangeably.
- a “gene” refers to a polynucleotide or portion of a polynucleotide comprising a sequence that encodes a protein. It is well understood in the art that a gene also comprises non-coding sequences, such as 5' and 3' flanking sequences (such as promoters, enhancers, repressors, and other regulatory sequences) as well as introns.
- the terms "polypeptide,” “peptide,” and “protein” are used interchangeably herein to refer to polymers of amino acids of any length. These terms also include proteins that are post-translationally modified through reactions that include glycosylation, acetylation and phosphorylation.
- a “physiological condition” is a term well-understood in the art and means any condition or state that be measured and/or observed.
- a “physiological condition” includes, but is not limited to, a physical condition, such as degree of body fat, alopecia (baldness), acne; life-expectancy; disease states (which include susceptibility and/or resistance to diseases), such as cancer or infectious diseases. Examples of physiological conditions are provided below (see, e.g., definitions of "human medically relevant medical condition”, “human commercially relevant condition”, “medically relevant evolved trait”, and “commercially relevant evolved trait”) and throughout the specification, and it is understood that these terms and examples refer to a physiological condition.
- a physiological condition may be, but is not necessarily, the result of multiple factors, any of which in turn may be considered a physiological condition.
- a physiological condition which is "present" in a human or non-human primate occurs within a given population, and includes those physiological conditions which are unique and/or enhanced in a given population when compared to another population.
- human medically relevant condition or “human commercially relevant condition” are used herein to refer to human conditions for which medical or non-medical (respectively) intervention is desired.
- medically relevant evolved trait is used herein to refer to traits that have evolved in humans or non-human primates whose analysis could provide information (e.g., physical or biochemical data) relevant to the development of a human medical treatment.
- commercially relevant evolved trait is used herein to refer to traits that have evolved in humans or non-human primates whose analysis could provide information (e.g., physical or biochemical data) relevant to the development of a non-medical product or treatment for human use.
- K A /K s -type methods means methods that evaluate differences, frequently (but not always) shown as a ratio, between the number of nonsynonymous substitutions and synonymous substitutions in homologous genes (including the more rigorous methods that determine non-synonymous and synonymous sites). These methods are designated using several systems of nomenclature, including but not limited to K A /K S , d N /d s , D N /D S .
- evolutionarily significant change or “adaptive evolutionary change” refers to one or more nucleotide or peptide sequence change(s) between two species that may be attributed to a positive selective pressure.
- One method for determining the presence of an evolutionarily significant change is to apply a K A /K s -type analytical method, such as to measure a K A /K S ratio.
- a K A /K S ratio at least about 0.75, more preferably at least about 1.0, more preferably at least about 1.25, more preferably at least about 1.5 and most preferably at least about 2.0 indicates the action of positive selection and is considered to be an evolutionarily significant change.
- positive evolutionarily significant change means an evolutionarily significant change in a particular species that results in an adaptive change that is positive as compared to other related species.
- positive evolutionarily significant changes are changes that have resulted in enhanced cognitive abilities in humans and adaptive changes in chimpanzees that have resulted in the ability of the chimpanzees infected with HIV to be resistant to progression to full-blown AIDS.
- resistant means that an organism, such as a chimpanzee, exhibits an ability to avoid, or diminish the extent of, a disease condition and/or development of the disease, preferably when compared to non-resistant organisms, typically humans.
- a chimpanzee is resistant to certain impacts of HIV and other viral infections, and/or it does not develop the ultimate disease - AIDS.
- susceptibility means that an organism, such as a human, fails to avoid, or diminish the extent of, a disease condition and/or development of the disease condition, preferably when compared to an organism that is known to be resistant, such as a non- human primate, such as chimpanzee.
- a human is susceptible to certain impacts of HIV and other viral infections and/or development of the ultimate disease - AIDS.
- resistance and susceptibility vary from individual to individual, and that, for purposes of this invention, these terms also apply to a group of individuals within a species, and comparisons of resistance and susceptibility generally refer to overall, average differences between species, although intra-specific comparisons may be used.
- the term "homologous” or “homologue” or “ortholog” is known and well understood in the art and refers to related sequences that share a common ancestor and is determined based on degree of sequence identity. These terms describe the relationship between a gene found in one species and the corresponding or equivalent gene in another species. For purposes of this invention homologous sequences are compared.
- “Homologous sequences” or “homologues” or “orthologs” are thought, believed, or known to be functionally related.
- a functional relationship may be indicated in any one of a number of ways, including, but not limited to, (a) degree of sequence identity; (b) same or similar biological function. Preferably, both (a) and (b) are indicated.
- the degree of sequence identity may vary, but is preferably at least 50% (when using standard sequence alignment programs known in the art), more preferably at least 60%, more preferably at least about 15%, more preferably at least about 85%.
- Homology can be determined using software programs readily available in the art, such as those discussed in Current Protocols in Molecular Biology (F.M.
- nucleotide change refers to nucleotide substitution, deletion, and/or insertion, as is well understood in the art.
- human protein-coding nucleotide sequence which is "associated with susceptibility to AIDS” as used herein refers to a human nucleotide sequence that encodes a protein that is associated with HIV dissemination (within the organism, i.e., intra-organism infectivity), propagation and/or development of AIDS. Due to the extensive research in the mechanisms underlying progression from HIV infection to the development of AIDS, a number of candidate human genes are believed or known to be associated with one or more of these phenomena.
- a polynucleotide (including any polypeptide encoded therein) sequence associated with susceptibility to AIDS is one which is either known or implicated to play a role in HIV dissemination, replication, and/or subsequent progression to full- blown AIDS.
- AIDS resistant means that an organism, such as a chimpanzee, exhibits an ability to avoid, or diminish the extent of, the result of HIV infection (such as propagation and dissemination) and/or development of AIDS, preferably when compared to AIDS- susceptible humans.
- susceptibility to AIDS means that an organism, such as a human, fails to avoid, or diminish the extent of, the result of HIV infection (such as propagation and dissemination) and/or development of AIDS, preferably when compared to an organism that is known to be AIDS resistant, such as a non-human primate, such as chimpanzee.
- the term "brain protein-coding nucleotide sequence” as used herein refers to a nucleotide sequence expressed in the brain that encodes a protein.
- brain protein-coding nucleotide sequence is a brain cDNA sequence.
- brain functions unique or enhanced in humans or “unique functional capabilities of the human brain” or “brain functional capability that is unique or enhanced in humans” refers to any brain function, either in kind or in degree, that is identified and/or observed to be enhanced in humans compared to other non-human primates.
- Such brain functions include, but are not limited to high capacity information processing, storage and retrieval capabilities, creativity, memory, language abilities, brain- mediated emotional response, locomotion, pain/pleasure sensation, olfaction, and temperament.
- “Housekeeping genes” is a term well understood in the art and means those genes associated with general cell function, including but not limited to growth, division, stasis, metabolism, and/or death.
- “Housekeeping” genes generally perform functions found in more than one cell type. In contrast, cell-specific genes generally perform functions in a particular cell type (such as neurons) and/or class (such as neural cells).
- the term "agent”, as used herein, means a biological or chemical compound such as a simple or complex organic or inorganic molecule, a peptide, a protein or an oligonucleotide. A vast array of compounds can be synthesized, for example oligomers, such as oligopeptides and oligonucleotides, and synthetic organic and inorganic compounds based on various core structures, and these are also included in the term "agent".
- various natural sources can provide compounds for screening, such as plant or animal extracts, and the like. Compounds can be tested singly or in combination with one another.
- to modulate function of a polynucleotide or a polypeptide means that the function of the polynucleotide or polypeptide is altered when compared to not adding an agent. Modulation may occur on any level that affects function.
- a polynucleotide or polypeptide function may be direct or indirect, and measured directly or indirectly.
- a "function of a polynucleotide” includes, but is not limited to, replication; translation; expression pattern(s).
- a polynucleotide function also includes functions associated with a polypeptide encoded within the polynucleotide.
- an agent which acts on a polynucleotide and affects protein expression, conformation, folding (or other physical characteristics), binding to other moieties (such as ligands), activity (or other functional characteristics), regulation and/or other aspects of protein structure or function is considered to have modulated polynucleotide function.
- a "function of a polypeptide” includes, but is not limited to, conformation, folding (or other physical characteristics), binding to other moieties (such as ligands), activity (or other functional characteristics), and/or other aspects of protein structure or functions.
- an agent that acts on a polypeptide and affects its conformation, folding (or other physical characteristics), binding to other moieties (such as ligands), activity (or other functional characteristics), and/or other aspects of protein structure or functions is considered to have modulated polypeptide function.
- modulate susceptibility to development of AIDS and “modulate resistance to development of AIDS”, as used herein, include modulating intra-organism cell-to-cell transmission or infectivity of HIV.
- the terms further include reducing susceptibility to development of AIDS and/or cell-to-cell transmission or infectivity of HIV.
- the terms further include increasing resistance to development of AIDS and/or cell- to-cell transmission or infectivity of HIV.
- One means of assessing whether an agent is one that modulates susceptibility or resistance to development of AIDS is to determine whether at least one index of HIV susceptibility is affected, using a cell-based system as described herein, as compared with an appropriate control.
- Indicia of HIV susceptibility include, but are not limited to, cell-to-cell transmission of the virus, as measured by total number of cells infected with HIV and syncytia formation.
- the term "target site” means a location in a polypeptide which can beg a single amino acid and/or is a part of, a structural and/or functional motif, e.g., a binding site, a dimerization domain, or a catalytic active site. Target sites may be a useful for direct or indirect interaction with an agent, such as a therapeutic agent.
- molecular difference includes any structural and/or functional difference. Methods to detect such differences, as well as examples of such differences, are described herein.
- a "functional effect” is a term well known in the art, and means any effect which is exhibited on any level of activity, whether direct or indirect.
- the source of the human and non-human polynucleotide can be any suitable source, e.g., genomic sequences or cDNA sequences.
- genomic sequences or cDNA sequences Preferably, cDNA sequences from human and a non-human primate are compared.
- Human protein-coding sequences can be obtained from public databases such as the Genome
- human protein-coding sequences may be obtained from, for example, sequencing of cDNA reverse transcribed from mRNA expressed in human cells, or after PCR amplification, according to methods well known in the art.
- human genomic sequences may be used for sequence comparison. Human genomic sequences can be obtained from public databases or from a sequencing of commercially available human genomic DNA libraries or from genomic DNA, after PCR.
- the non-human primate protein-coding sequences can be obtained by, for example, sequencing cDNA clones that are randomly selected from a non-human primate cDNA library.
- the non-human primate cDNA library can be constructed from total mRNA expressed in a primate cell using standard techniques in the art.
- the cDNA is prepared from mRNA obtained from a tissue at a determined developmental stage, or a tissue obtained after the primate has been subjected to certain environmental conditions.
- cDNA libraries used for the sequence comparison of the present invention can be constructed using conventional cDNA library construction techniques that are explained fully in the literature of the art. Total mRNAs are used as templates to reverse-transcribe cDNAs.
- Transcribed cDNAs are subcloned into appropriate vectors to establish a cDNA library.
- the established cDNA library can be maximized for full-length cDNA contents, although less than full-length cDNAs may be used.
- sequence frequency can be normalized according to, for example, Bonaldo et al. (1996) Genome Research
- cDNA clones randomly selected from the constructed cDNA library can be sequenced using standard automated sequencing techniques. Preferably, full-length cDNA clones are used for sequencing. Either the entire or a large portion of cDNA clones from a cDNA library may be sequenced, although it is also possible to practice some embodiments of the invention by sequencing as little as a single cDNA, or several cDNA clones.
- non-human primate cDNA clones to be sequenced can be pre-selected according to their expression specificity.
- the cDNAs can be subject to subtraction hybridization using mRNAs obtained from other organs, tissues or cells of the same animal. Under certain hybridization conditions with appropriate stringency and concentration, those cDNAs that hybridize with non-tissue specific mRNAs and thus likely represent "housekeeping" genes will be excluded from the cDNA pool. Accordingly, remaining cDNAs to be sequenced are more likely to be associated with tissue-specific functions.
- non- tissue-specific mRNAs can be obtained from one organ, or preferably from a combination of different organs and cells. The amount of non-tissue-specific mRNAs are maximized to saturate the tissue-specific cDNAs.
- information from online public databases can be used to select or give priority to cDNAs that are more likely to be associated with specific functions.
- the non-human primate cDNA candidates for sequencing can be selected by PCR using primers designed from candidate human cDNA sequence.
- Candidate human cDNA sequences are, for example, those that are only found in a specific tissue, such as brain, or that correspond to genes likely to be important in the specific function, such as brain function.
- Such human tissue-specific cDNA sequences can be obtained by searching online human sequence databases such as GenBank, in which information with respect to the expression profile and/or biological activity for cDNA sequences are specified.
- Sequences of non-human primate (for example, from an AIDS-resistant non-human primate) homologue(s) to a known human gene may be obtained using methods standard in the art, such as from public databases such as GenBank or PCR methods (using, for example, GeneAmp PCR System 9700 thermocyclers (Applied Biosystems, Inc.)).
- non-human primate cDNA candidates for sequencing can be selected by PCR using primers designed from candidate human cDNA sequences.
- primers may be made from the human sequences using standard methods in the art, including publicly available primer design programs such as PRIMER® (Whitehead Institute).
- the sequence amplified may then be sequenced using standard methods and equipment in the art, such as automated sequencers (Applied Biosystems, Inc.).
- nucleotide sequences are obtained from a human source and a non-human source.
- the human and non-human nucleotide sequences are compared to one another to identify sequences that are homologous.
- the homologous sequences are analyzed to identify those that have nucleic acid sequence differences between the two species.
- molecular evolution analysis is conducted to evaluate quantitatively and qualitatively the evolutionary significance of the differences. For genes that have been positively selected between two species, e.g., human and chimp, it is useful to determine whether the difference occurs in other non-human primates.
- the sequence is characterized in terms of molecular/genetic identity and biological function.
- the information can be used to identify agents useful in diagnosis and treatment of human medically or commercially relevant conditions.
- the general methods of the invention entail comparing human protein-coding nucleotide sequences to protein-coding nucleotide sequences of a non-human, preferably a primate, and most preferably a chimpanzee.
- non-human primates bonobo, gorilla, orangutan, gibbon, Old World monkeys, and New World monkeys.
- a phylogenetic tree for primates within the hominoid group is depicted in FIG. 1.
- Bioinformatics is applied to the comparison and sequences are selected that contain a nucleotide change or changes that is/are evolutionarily significant change(s).
- the invention enables the identification of genes that have evolved to confer some evolutionary advantage and the identification of the specific evolved changes.
- Protein-coding sequences of human and another non-human primate are compared to identify homologous sequences. Any appropriate mechanism for completing this comparison is contemplated by this invention. Alignment may be performed manually or by software (examples of suitable alignment programs are known in the art). Preferably, protein-coding sequences from a non-human primate are compared to human sequences via database searches, e.g., BLAST searches. The high scoring "hits," i.e., sequences that show a significant similarity after BLAST analysis, will be retrieved and analyzed. Sequences showing a significant similarity can be those having at least about 60%, at least about 75%, at least about 80%, at least about 85%, or at least about 90% sequence identity.
- sequences showing greater than about 80% identity are further analyzed.
- the homologous sequences identified via database searching can be aligned in their entirety using sequence alignment methods and programs that are known and available in the art, such as the commonly used simple alignment program CLUSTAL V by Higgins et al. (1992) CABIOS 8:189-191.
- the sequencing and homologous comparison of protein-coding sequences between human and a non-human primate may be performed simultaneously by using the newly developed sequencing chip technology. See, for example, Rava et al. US Patent 5,545,531.
- the aligned protein-coding sequences of human and another non-human primate are analyzed to identify nucleotide sequence differences at particular sites. Again, any suitable method for achieving this analysis is contemplated by this invention. If there are no nucleotide sequence differences, the non-human primate protein coding sequence is not usually further analyzed.
- the detected sequence changes are generally, and preferably, initially checked for accuracy.
- the initial checking comprises performing one or more of the following steps, any and all of which are known in the art: (a) finding the points where there are changes between the non-human primate and human sequences; (b) checking the sequence fluorogram (chromatogram) to determine if the bases that appear unique to non-human primate correspond to strong, clear signals specific for the called base; (c) checking the human hits to see if there is more than one human sequence that corresponds to a sequence change. Multiple human sequence entries for the same gene that have the same nucleotide at a position where there is a different nucleotide in a non-human primate sequence provides independent support that the human sequence is accurate, and that the change is significant. Such changes are examined using public database information and the genetic code to determine whether these nucleotide sequence changes result in a change in the amino acid sequence of the encoded protein. As the definition of
- nucleotide change makes clear, the present invention encompasses at least one nucleotide change, either a substitution, a deletion or an insertion, in a human protein-coding polynucleotide sequence as compared to corresponding sequence from a non-human primate.
- the change is a nucleotide substitution. More preferably, more than one substitution is present in the identified human sequence and is subjected to molecular evolution analysis.
- K A /K s -type methods can be employed to evaluate quantitatively and qualitatively the evolutionary significance of the identified nucleotide changes between human gene sequences and that of a non-human primate.
- the K A /K S analysis by Li et al. is used to carry out the present invention, although other analysis programs that can detect positively selected genes between species can also be used.
- the K A /K S method which comprises a comparison of the rate of non-synonymous substitutions per non-synonymous site with the rate of synonymous substitutions per synonymous site between homologous protein-coding region of genes in terms of a ratio, is used to identify sequence substitutions that may be driven by adaptive selections as opposed to neutral selections during evolution.
- a synonymous (“silent") substitution is one that, owing to the degeneracy of the genetic code, makes no change to the amino acid sequence encoded; a non-synonymous substitution results in an amino acid replacement.
- the extent of each type of change can be estimated as K A and K s , respectively, the numbers of synonymous substitutions per synonymous site and non-synonymous substitutions per non-synonymous site.
- Calculations of K A /K S may be performed manually or by using software.
- An example of a suitable program is MEGA (Molecular Genetics Institute, Pennsylvania State University).
- MEGA Molecular Genetics Institute, Pennsylvania State University
- K A and K s either complete or partial human protein- coding sequences are used to calculate total numbers of synonymous and non-synonymous substitutions, as well as non-synonymous and synonymous sites.
- the length of the polynucleotide sequence analyzed can be any appropriate length.
- the entire coding sequence is compared, in order to determine any and all significant changes.
- Publicly available computer programs such as Li93 (Li (1993) J Mol. Evol. 36:96-99) or LNA, can be used to calculate the K A and K s values for all pairwise comparisons.
- This analysis can be further adapted to examine sequences in a "sliding window” fashion such that small numbers of important changes are not masked by the whole sequence.
- “Sliding window” refers to examination of consecutive, overlapping subsections of the gene (the subsections can be of any length).
- the comparison of non-synonymous and synonymous substitution rates is represented by the K A /K s ratio.
- K A /K s has been shown to be a reflection of the degree to which adaptive evolution has been at work in the sequence under study. Full length or partial segments of a coding sequence can be used for the K A /K s analysis. The higher the K A /K s ratio, the more likely that a sequence has undergone adaptive evolution and the non- synonymous substitutions are evolutionarily significant.
- the K A /K S ratio is at least about 0.75, more preferably at least about 1.0, more preferably at least about 1.25, more preferably at least about 1.50, or more preferably at least about 2.00.
- statistical analysis is performed on all elevated K A /K S ratios, including, but not limited to, standard methods such as Student's t-test and likelihood ratio tests described by Yang (1998) Mol. Biol Evol. 37:441-456.
- K A /K S ratios significantly greater than unity strongly suggest that positive selection has fixed greater numbers of amino acid replacements than can be expected as a result of chance alone, and is in contrast to the commonly observed pattern in which the ratio is less than or equal to one.
- Ratios less than one generally signify the role of negative, or purifying selection: there is strong pressure on the primary structure of functional, effective proteins to remain unchanged.
- All methods for calculating K A /K S ratios are based on a pairwise comparison of the number of nonsynonymous substitutions per nonsynonymous site to the number of synonymous substitutions per synonymous site for the protein-coding regions of homologous genes from related species.
- Each method implements different corrections for estimating "multiple hits" (i.e., more than one nucleotide substitution at the same site).
- Each method also uses different models for how DNA sequences change over evolutionary time.
- a combination of results from different algorithms is used to increase the level of sensitivity for detection of positively-selected genes and confidence in the result.
- K A /K S ratios should be calculated for orthologous gene pairs, as opposed to paralogous gene pairs (i.e., a gene which results from speciation, as opposed to a gene that is the result of gene duplication) Messier and Stewart (1997).
- This distinction may be made by performing additional comparisons with other non-human primates, such as gorilla and orangutan, which allows for phylogenetic tree-building.
- Orthologous genes when used in tree-building will yield the known "species tree", i.e., will produce a tree that recovers the known biological tree.
- paralogous genes will yield trees which will violate the known biological tree.
- sequences that are functionally related to human protein-coding sequences.
- sequences may include, but are not limited to, non-coding sequences or coding sequences that do not encode human proteins.
- These related sequences can be, for example, physically adjacent to the human protein-coding sequences in the human genome, such as introns or 5'- and 3'- flanking sequences (including control elements such as promoters and enhancers).
- These related sequences may be obtained via searching a public human genome database such as GenBank or, alternatively, by screening and sequencing a human genomic library with a protein-coding sequence as probe.
- the evolutionarily significant nucleotide changes which are detected by molecular evolution analysis such as the K A /K S analysis, can be further assessed for their unique occurrence in humans (or the non-human primate) or the extent to which these changes are unique in humans (or the non-human primate). For example, the identified changes can be tested for presence/absence in other non-human primate sequences.
- sequences with at least one evolutionarily significant change between human and one non-human primate can be used as primers for PCR analysis of other non-human primate protein-coding sequences, and resulting polynucleotides are sequenced to see whether the same change is present in other non-human primates. These comparisons allow further discrimination as to whether the adaptive evolutionary changes are unique to the human lineage as compared to other non-human primates or whether the adaptive change is unique to the non-human primates
- chimpanzee as compared to humans and other non-human primates.
- a nucleotide change that is detected in a non- human primate (i.e., chimpanzee) that is not detected in humans or other non-human primates likely represents a chimpanzee adaptive evolutionary change.
- Other non-human primates used for comparison can be selected based on their phylogenetic relationships with human. Closely related primates can be those within the hominoid sublineage, such as chimpanzee, bonobo, gorilla, and orangutan.
- Non-human primates can also be those that are outside the hominoid group and thus not so closely related to human, such as the Old World monkeys and New World monkeys. Statistical significance of such comparisons may be determined using established available programs, e.g., t-test as used by Messier and Stewart (1997) Nature 385:151-154. Those genes showing statistically highK A /K s ratios are very likely to have undergone adaptive evolution.
- Sequences with significant changes can be used as probes in genomes from different human populations to see whether the sequence changes are shared by more than one human population.
- Gene sequences from different human populations can be obtained from databases made available by, for example, the Human Genome Project, the human genome diversity project or, alternatively, from direct sequencing of PCR-amplified DNA from a number of unrelated, diverse human populations. The presence of the identified changes in different human populations would further indicate the evolutionary significance of the changes.
- Chimpanzee sequences with significant changes can be obtained and evaluated using similar methods to determine whether the sequence changes are shared among many chimpanzees.
- Sequences with significant changes between species can be further characterized in terms of their molecular/genetic identities and biological functions, using methods and techniques known to those of ordinary skill in the art.
- the sequences can be located genetically and physically within the human genome using publicly available bio- informatics programs.
- the newly identified significant changes within the nucleotide sequence may suggest a potential role of the gene in human evolution and a potential association with human-unique functional capabilities.
- the putative gene with the identified sequences may be further characterized by, for example, homologue searching.
- Shared homology of the putative gene with a known gene may indicate a similar biological role or function.
- Another exemplary method of characterizing a putative gene sequence is on the basis of known sequence motifs. Certain sequence patterns are known to code for regions of proteins having specific biological characteristics such as signal sequences, DNA binding domains, or transmembrane domains.
- the identified human sequences with significant changes can also be further evaluated by looking at where the gene is expressed in terms of tissue- or cell type- specificity.
- the identified coding sequences can be used as probes to perform in situ mRNA hybridization that will reveal the expression patterns of the sequences.
- Genes that are expressed in certain tissues may be better candidates as being associated with important human functions associated with that tissue, for example brain tissue.
- the timing of the gene expression during each stage of human development can also be determined.
- the functional roles of the identified nucleotide sequences with significant changes can be assessed by conducting functional assays for different alleles of an identified gene in a model system, such as yeast, nematode, Drosophila, and mouse.
- Model systems may be cell-based or in vivo, such as transgenic animals.
- the transgenic mouse system is used. Methods of making cell-based systems and/or transgenic animal systems are known in the art and need not be described in detail herein.
- the use of computer programs allows modeling and visualizing the three-dimensional structure of the homologous proteins from human and chimpanzee.
- chimpanzee ICAM-3 contains a glutamine residue (Q101) at the site in which human ICAM-3 contains a proline (P101).
- Q101 glutamine residue
- P101 proline
- the human protein is known to bend sharply at this point. Replacement of the proline by glutamine in the chimpanzee protein is likely to result in a much less sharp bend at this point. This has clear implications for packaging of the ICAM-3 chimpanzee protein into HIV virions.
- the present invention provides methods for identifying agents that are useful in modulating human-unique or human-enhanced functional capabilities and/or correcting defects in these capabilities using these sequences. These methods employ, for example, screening techniques known in the art, such as in vitro systems, cell-based expression systems and transgenic animal systems.
- screening techniques known in the art, such as in vitro systems, cell-based expression systems and transgenic animal systems.
- the approach provided by the present invention not only identifies rapidly evolved genes, but indicates modulations that can be made to the protein that may not be too toxic because they exist in another species. Screening methods
- the present invention also provides screening methods using the polynucleotides and polypeptides identified and characterized using the above-described methods. These screening methods are useful for identifying agents which may modulate the function(s) of the polynucleotides or polypeptides in a manner that would be useful for a human treatment.
- the methods entail contacting at least one agent to be tested with either a cell that has been transfected with a polynucleotide sequence identified by the methods described above, or a preparation of the polypeptide encoded by such polynucleotide sequence, wherein an agent is identified by its ability to modulate function of either the polynucleotide sequence or the polypeptide.
- the term "agent” means a biological or chemical compound such as a simple or complex organic or inorganic molecule, a peptide, a protein or an oligonucleotide.
- a vast array of compounds can be synthesized, for example oligomers, such as oligopeptides and oligonucleotides, and synthetic organic and inorganic compounds based on various core structures, and these are also included in the term "agent".
- various natural sources can provide compounds for screening, such as plant or animal extracts, and the like. Compounds can be tested singly or in combination with one another.
- modulate function of a polynucleotide or a polypeptide means that the function of the polynucleotide or polypeptide is altered when compared to not adding an agent. Modulation may occur on any level that affects function.
- a polynucleotide or polypeptide function may be direct or indirect, and measured directly or indirectly.
- a "function" of a polynucleotide includes, but is not limited to, replication, translation, and expression pattern(s).
- a polynucleotide function also includes functions associated with a polypeptide encoded within the polynucleotide.
- an agent which acts on a polynucleotide and affects protein expression, conformation, folding (or other physical characteristics), binding to other moieties (such as ligands), activity (or other functional characteristics), regulation and/or other aspects of protein structure or function is considered to have modulated polynucleotide function.
- a "function" of a polypeptide includes, but is not limited to, conformation, folding (or other physical characteristics), binding to other moieties (such as ligands), activity (or other functional characteristics), and/or other aspects of protein structure or functions.
- an agent that acts on a polypeptide and affects its conformation, folding (or other physical characteristics), binding to other moieties (such as ligands), activity (or other functional characteristics), and/or other aspects of protein structure or functions is considered to have modulated polypeptide function.
- agents to be screened is governed by several parameters, such as the particular polynucleotide or polypeptide target, its perceived function, its three- dimensional structure (if known or surmised), and other aspects of rational drug design.
- an in vivo screening assay may have several advantages over conventional drug screening assays: 1) if an agent must enter a cell to achieve a desired therapeutic effect, an in vivo assay can give an indication as to whether the agent can enter a cell; 2) an in vivo screening assay can identify agents that, in the state in which they are added to the assay system are ineffective to elicit at least one characteristic which is associated with modulation polynucleotide or polypeptide function, but that are modified by cellular components once inside a cell in such a way that they become effective agents; 3) most importantly, an in vivo assay system allows identification of agents affecting any component of a pathway that ultimately results in characteristics that are associated with polynucleotide or polypeptide function.
- screening can be performed by adding an agent to a sample of appropriate cells which have been transfected with a polynucleotide identified using the methods of the present invention, and monitoring the effect, i.e., modulation of a function of the polynucleotide or the polypeptide encoded within the polynucleotide.
- the experiment preferably includes a control sample which does not receive the candidate agent.
- the treated and untreated cells are then compared by any suitable phenotypic criteria, including but not limited to microscopic analysis, viability testing, ability to replicate, histological examination, the level of a particular RNA or polypeptide associated with the cells, the level of enzymatic activity expressed by the cells or cell lysates, the interactions of the cells when exposed to infectious agents, such as HIV, and the ability of the cells to interact with other cells or compounds.
- suitable phenotypic criteria including but not limited to microscopic analysis, viability testing, ability to replicate, histological examination, the level of a particular RNA or polypeptide associated with the cells, the level of enzymatic activity expressed by the cells or cell lysates, the interactions of the cells when exposed to infectious agents, such as HIV, and the ability of the cells to interact with other cells or compounds.
- the transfected cells can be exposed to the agent to be tested and, before, during, or after treatment with the agent, the cells can be infected with a virus, such as HIV, and tested for any indication of susceptibility of the cells to viral infection, including, for example, susceptibility of the cells to cell-to-cell viral infection, replication of the virus, production of a viral protein, and/or syncytia formation following infection with the virus. Differences between treated and untreated cells indicate effects attributable to the candidate agent. Optimally, the agent has a greater effect on experimental cells than on control cells.
- Appropriate host cells include, but are not limited to, eukaryotic cells, preferably mammalian cells. The choice of cell will at least partially depend on the nature of the assay contemplated.
- a suitable host cell transfected with a polynucleotide of interest, such that the polynucleotide is expressed is contacted with an agent to be tested.
- An agent would be tested for its ability to result in increased expression of mRNA and/or polypeptide.
- Methods of making vectors and transfection are well known in the art. "Transfection” encompasses any method of introducing the endogenous sequence, including, for example, lipofection, transduction, infection or electroporation.
- the exogenous polynucleotide may be maintained as a non-integrated vector (such as a plasmid) or may be integrated into the host genome.
- transcription regulatory regions could be linked to a reporter gene and the construct added to an appropriate host cell.
- reporter gene means a gene that encodes a gene product that can be identified (i.e., a reporter protein). Reporter genes include, but are not limited to, alkaline phosphatase, chloramphenicol acetyltransferase, ⁇ -galactosidase, luciferase and green fluorescence protein (GFP).
- reporter genes include, but are not limited to, enzymatic assays and fluorimetric assays. Reporter genes and assays to detect their products are well known in the art and are described, for example in Ausubel et al. (1987) and periodic updates. Reporter genes, reporter gene assays, and reagent kits are also readily available from commercial sources. Examples of appropriate cells include, but are not limited to, fungal, yeast, mammalian, and other eukaryotic cells.
- a practitioner of ordinary skill will be well acquainted with techniques for transfecting eukaryotic cells, including the preparation of a suitable vector, such as a viral vector; conveying the vector into the cell, such as by electroporation; and selecting cells that have been transformed, such as by using a reporter or drug sensitivity element. The effect of an agent on transcription from the regulatory region in these constructs would be assessed through the activity of the reporter gene product.
- a suitable vector such as a viral vector
- conveying the vector into the cell such as by electroporation
- selecting cells that have been transformed such as by using a reporter or drug sensitivity element.
- the effect of an agent on transcription from the regulatory region in these constructs would be assessed through the activity of the reporter gene product.
- expression could be decreased when it would normally be expressed.
- An agent could accomplish this through a decrease in transcription rate and the reporter gene system described above would be a means to assay for this.
- the host cells to assess such agents would be need to be permissive for expression.
- Cells transcribing mRNA could be used to identify agents that specifically modulate the half-life of mRNA and/or the translation of mRNA. Such cells would also be used to assess the effect of an agent on the processing and/or post-translational modification of the polypeptide.
- An agent could modulate the amount of polypeptide in a cell by modifying the turn-over (i.e., increase or decrease the half-life) of the polypeptide.
- the specificity of the agent with regard to the mRNA and polypeptide would be determined by examining the products in the absence of the agent and by examining the products of unrelated mRNAs and polypeptides. Methods to examine mRNA half-life, protein processing, and protein turn-over are well know to those skilled in the art.
- agents that modulate polypeptide function could also be useful in the identification of agents that modulate polypeptide function through the interaction with the polypeptide directly. Such agents could block normal polypeptide-ligand interactions, if any, or could enhance or stabilize such interactions. Such agents could also alter a conformation of the polypeptide. The effect of the agent could be determined using immunoprecipitation reactions.
- Appropriate antibodies would be used to precipitate the polypeptide and any protein tightly associated with it.
- an agent could be identified that would augment or inhibit polypeptide-ligand interactions, if any.
- Polypeptide-ligand interactions could also be assessed using cross-linking reagents that convert a close, but noncovalent interaction between polypeptides into a covalent interaction. Techniques to examine protein-protein interactions are well known to those skilled in the art. Techniques to assess protein conformation are also well known to those skilled in the art.
- screening methods can involve in vitro methods, such as cell-free transcription or translation systems.
- transcription or translation is allowed to occur, and an agent is tested for its ability to modulate function.
- an in vitro transcription/translation system may be used for an assay that determines whether an agent modulates the translation of mRNA or a polynucleotide.
- these systems are available commercially and provide an in vitro means to produce mRNA corresponding to a polynucleotide sequence of interest. After mRNA is made, it can be translated in vitro and the translation products compared. Comparison of translation products between an in vitro expression system that does not contain any agent (negative control) with an in vitro expression system that does contain an agent indicates whether the agent is affecting translation.
- Comparison of translation products between control and test polynucleotides indicates whether the agent, if acting on this level, is selectively affecting translation (as opposed to affecting translation in a general, non-selective or non-specific fashion).
- the modulation of polypeptide function can be accomplished in many ways including, but not limited to, the in vivo and in vitro assays listed above as well as in in vitro assays using protein preparations.
- Polypeptides can be extracted and/or purified from natural or recombinant sources to create protein preparations.
- An agent can be added to a sample of a protein preparation and the effect monitored; that is whether and how the agent acts on a polypeptide and affects its conformation, folding (or other physical characteristics), binding to other moieties (such as ligands), activity (or other functional characteristics), and/or other aspects of protein structure or functions is considered to have modulated polypeptide function.
- a polypeptide is first recombinantly expressed in a prokaryotic or eukaryotic expression system as a native or as a fusion protein in which a polypeptide (encoded by a polynucleotide identified as described above) is conjugated with a well-characterized epitope or protein. Recombinant polypeptide is then purified by, for instance, immunoprecipitation using appropriate antibodies or anti-epitope antibodies or by binding to immobilized ligand of the conjugate.
- An affinity column made of polypeptide or fusion protein is then used to screen a mixture of compounds which have been appropriately labeled.
- Suitable labels include, but are not limited to fluorochromes, radioisotopes, enzymes and chemiluminescent compounds.
- the unbound and bound compounds can be separated by washes using various conditions (e.g. high salt, detergent ) that are routinely employed by those skilled in the art.
- Non-specific binding to the affinity column can be minimized by pre-clearing the compound mixture using an affinity column containing merely the conjugate or the epitope. Similar methods can be used for screening for an agent(s) that competes for binding to polypeptides.
- affinity chromatography there are other techniques such as measuring the change of melting temperature or the fluorescence anisotropy of a protein which will change upon binding another molecule.
- a BIAcore assay using a sensor chip supplied by Pharmacia Biosensor, Stitt et al. (1995) Cell 80: 661-670) that is covalently coupled to polypeptide may be performed to determine the binding activity of different agents.
- the in vitro screening methods of this invention include structural, or rational, drug design, in which the amino acid sequence, three-dimensional atomic structure or other property (or properties) of a polypeptide provides a basis for designing an agent which is expected to bind to a polypeptide.
- the design and/or choice of agents in this context is governed by several parameters, such as side-by-side comparison of the structures of a human and homologous non-human primate polypeptides, the perceived function of the polypeptide target, its three-dimensional structure (if known or surmised), and other aspects of rational drug design. Techniques of combinatorial chemistry can also be used to generate numerous permutations of candidate agents. Also contemplated in screening methods of the invention are transgenic animal systems, which are known in the art.
- a secondary screen may comprise testing the agent(s) in an infectivity assay using mice and other animal models (such as rat), which are known in the art.
- a cytotoxicity assay would be performed as a further corroboration that an agent which tested positive in a primary screen would be suitable for use in living organisms. Any assay for cytotoxicity would be suitable for this purpose, including, for example the MTT assay (Promega).
- the invention also includes agents identified by the screening methods described herein.
- a non-human primate polynucleotide or polypeptide has undergone natural selection that resulted in a positive evolutionarily significant change (i.e., the non-human primate polynucleotide or polypeptide has a positive attribute not present in humans).
- the positively selected polynucleotide or polypeptide may be associated with susceptibility or resistance to certain diseases or with other commercially relevant traits.
- Examples of this embodiment include, but are not limited to, polynucleotides and polypeptides that have been positively selected in non- human primates, preferably chimpanzees, that may be associated with susceptibility or resistance to infectious diseases, cancer, or acne or may be associated with aesthetic conditions of interest to humans, such as hair growth or muscle mass.
- An example of this embodiment includes polynucleotides and polypeptides associated with the susceptibility or resistance to HIV progression to AIDS. The present invention can thus be useful in gaining insight into the molecular mechanisms that underlie resistance to HIV infection progressing to development of AIDS, providing information that can also be useful in discovering and/or designing agents such as drugs that prevent and/or delay development of AIDS.
- Commercially relevant examples include, but are not limited to, polynucleotides and polypeptides that are positively selected in non-human primates that may be associated with aesthetic traits, such as hair growth, acne, or muscle mass.
- the invention provides methods for identifying a polynucleotide sequence encoding a polypeptide, wherein said polypeptide may be associated with a medically relevant positive evolutionarily significant change.
- the positive evolutionarily significant change can be found in humans or in non-human primates, but the positively selected non-human primate evolutionarily significant change will be described first herein.
- the method comprises the steps of: (a) comparing human protein-coding nucleotide sequences to protein-coding nucleotide sequences of a non- human primate; and (b) selecting a human polynucleotide sequence that contains at least one nucleotide change as compared to corresponding sequence of the non-human primate, wherein said change is evolutionarily significant.
- sequences identified by this method may be further characterized and/or analyzed for their possible association with biologically or medically relevant functions unique or enhanced in humans.
- a method for identifying a positive evolutionarily significant change within human protein-coding nucleotide sequences comprising the steps of: (a) comparing human protein-coding nucleotide sequences to corresponding sequences of a non-human primate; and (b) selecting a human polynucleotide sequence that contains at least one nucleotide change as compared to the corresponding sequence of the non-human primate, wherein said change is evolutionarily significant.
- This invention specifically provides methods for identifying human polynucleotide and polypeptide sequences that may be associated with unique or enhanced functional capabilities of the human, for example, brain function or longer life span. More particularly, these methods identify those genetic sequences that may be associated with capabilities that are unique or enhanced in humans, including, but not limited to, brain functions such as high capacity information processing, storage and retrieval capabilities, creativity, and language abilities. Moreover, these methods identify those sequences that may be associated to other brain functional features with respect to which the human brain performs at enhanced levels as compared to other non-human primates; these differences may include brain-mediated emotional response, locomotion, pain/pleasure sensation, olfaction, temperament and longer life span In this method, the general methods of the invention are applied as described above.
- the methods described herein entail (a) comparing human protein-coding polynucleotide sequences to that of a non-human primate; and (b) selecting those human protein-coding polynucleotide sequences having evolutionarily significant changes that may be associated with unique or enhanced functional capabilities of the human as compared to that of the non-human primate.
- the human sequence includes the evolutionarily significant change (i.e., the human sequence differs from more than one non-human primate species sequence in a manner that suggests that such a change is in response to a selective pressure).
- the identity and function of the protein encoded by the gene that contains the evolutionarily significant change is characterized and a determination is made whether or not the protein can be involved in a unique or enhanced human function. If the protein is involved in a unique or enhanced human function, the information is used in a manner to identify agents that can supplement or otherwise modulate the unique or enhanced human function.
- identifying the genetic (i.e., nucleotide sequence) differences underlying the functional uniqueness of human brain may provide a basis for designing agents that can modulate human brain functions and/or help correct functional defects. These sequences could also be used in developing diagnostic reagents and/or biomedical research tools.
- the invention also provides methods for a large- scale comparison of human brain protein-coding sequences with that from a non-human primate.
- the identified human sequence changes can be used in establishing a database of candidate human genes that may be involved in human brain function. Candidates are ranked as to the likelihood that the gene is responsible for the unique or enhanced functional capabilities found in the human brain compared to chimpanzee or other non- human primates. Moreover, the database not only provides an ordered collection of candidate genes, it also provides the precise molecular sequence differences that exist between human and chimpanzee (and other non-human primates), and thus defines the changes that underlie the functional differences. This information can be useful in the identification of potential sites on the protein that may serve as useful targets for pharmaceutical agents.
- the present invention also provides methods for correlating an evolutionarily significant nucleotide change to a brain functional capability that is unique or enhanced in humans, comprising (a) identifying a human nucleotide sequence according to the methods described above; and (b) analyzing the functional effect of the presence or absence of the identified sequence in a model system.
- the putative function can be assayed in appropriate in vitro assays using transiently or stably transfected mammalian cells in culture, or using mammalian cells transfected with an antisense clone to inhibit expression of the identified polynucleotide to assess the effect of the absence of expression of its encoded polypeptide.
- Studies such as one-hybrid and two- hybrid studies can be conducted to determine, for example, what other macromolecules the polypeptide interacts with.
- Transgenic nematodes or Drosophila can be used for various functional assays, including behavioral studies.
- protein coding polynucleotides may contain sequence changes that are found in chimpanzees (as well as other AIDS-resistant primates) but not in humans, likely as a result of positive adaptive selection during evolution.
- polynucleotide and polypeptide sequences may be attributed to an AIDS-resistant non- human primate's (such as chimpanzee) ability to resist development of AIDS.
- the methods of this invention employ selective comparative analysis to identify candidate genes which may be associated with susceptibility or resistance to AIDS, which may provide new host targets for therapeutic intervention as well as specific information on the changes that evolved to confer resistance. Development of therapeutic approaches that involve host proteins (as opposed to viral proteins and/or mechanisms) may delay or even avoid the emergence of resistant viral mutants.
- the invention also provides screening methods using the sequences and structural differences identified.
- This invention provides methods for identifying human polynucleotide and polypeptide sequences that may be associated with susceptibility to post-infection development of AIDS.
- the invention also provides methods for identifying polynucleotide and polypeptide sequences from an AIDS-resistant non-human primate (such as chimpanzee) that may be associated with resistance to development of AIDS. Identifying the genetic (i.e., nucleotide sequence) and the resulting protein structural and biochemical differences underlying susceptibility or resistance to development of AIDS will likely provide a basis for discovering and/or designing agents that can provide prevention and/or therapy for HIV infection progressing to AIDS. These differences could also be used in developing diagnostic reagents and/or biomedical research tools. For example, identification of proteins which confer resistance may allow development of diagnostic reagents or biomedical research tools based upon the disruption of the disease pathway of which the resistant protein plays a part.
- the methods described herein entail (a) comparing human protein-coding polynucleotide sequences to that of an AIDS resistant non-human primate (such as chimpanzee), wherein the human protein coding polynucleotide sequence is associated with development of AIDS; and (b) selecting those human protein-coding polynucleotide sequences having evolutionarily significant changes that may be associated with susceptibility to development of AIDS.
- an AIDS resistant non-human primate such as chimpanzee
- the methods entail (a) comparing human protein-coding polynucleotide sequences to that of an AIDS-resistant non-human primate (such as chimpanzee), wherein the human protein coding polynucleotide sequence is associated with development of AIDS; and (b) selecting those non-human primate protein-coding polynucleotide sequences having evolutionarily significant changes that may be associated with resistance to development of AIDS.
- an AIDS-resistant non-human primate such as chimpanzee
- the methods could be used in a situation in which a non-human primate is known or believed to have harbored the infectious disease for a significant period (i.e., a sufficient time to have allowed positive selection) and is resistant to development of the disease.
- the invention provides methods for identifying a polynucleotide sequence encoding a polypeptide, wherein said polypeptide may be associated with resistance to development of an infectious disease, comprising the steps of: (a) comparing infectious disease-resistant non-human primate protein coding sequences to human protein coding sequences, wherein the human protein coding sequence is associated with development of the infectious disease; and (b) selecting an infectious disease-resistant non-human primate sequence that contains at least one nucleotide change as compared to the corresponding human sequence, wherein the nucleotide change is evolutionarily significant.
- the invention provides methods for identifying a human polynucleotide sequence encoding a polypeptide, wherein said polypeptide may be associated with susceptibility to development of an infectious disease, comprising the steps of: (a) comparing human protein coding sequences to protein-coding polynucleotide sequences of an infectious disease-resistant non-human primate, wherein the human protein coding sequence is associated with development of the infectious disease; and (b) selecting a human polynucleotide sequence that contains at least one nucleotide change as compared to the corresponding sequence of an infectious disease-resistant non-human primate, wherein the nucleotide change is evolutionarily significant.
- human sequences to be compared with a homologue from an AIDS-resistant non-human primate are selected based their known or implicated association with HIV propagation (i.e., replication), dissemination and/or subsequent progression to AIDS.
- Such knowledge is obtained, for example, from published literature and/or public databases (including sequence databases such as GenBank).
- sequence databases such as GenBank.
- Table 1 contains a exemplary list of genes to be examined. The sequences are generally known in the art.
- PCD promoter bcl-2 apoptosis inhibitor lck tyrosine kinase MAPK (mitogen activated protein kinase) protein kinase
- TNF-receptor II receptor interferon ⁇
- IFN- ⁇ cytokine interleukin 1
- IL-l ⁇ cytokine interleukin 1
- IL-l ⁇ cytokine interleukin l ⁇ (IL-l ⁇ )
- IL-4 cytokine interleukin 4
- IL-6 cytokine interleukin 6
- IL- 10 cytokine interleukin 10
- IL-13 cytokine interleukin 13
- M-CSF macrophage colony-stimulating factor
- PI 3 -kinase cytokine phosphatidylinositol 3 -kinase
- PI 4-kinase PI 4-kinase
- HLA class I ⁇ chain histocompatibility antigen ⁇ 2 microglobulin lymphocyte antigen
- CD55 decay-accelerating factor CD59 complement protein CD63 glycoprotein antigen CD71 interferon ⁇ (IFN- ⁇ ) cytokine CD44 cell adhesion CD8 glycoprotein
- Aligned protein-coding sequences of human and an AIDS resistant non-human primate such as chimpanzee are analyzed to identify nucleotide sequence differences at particular sites.
- the detected sequence changes are generally, and preferably, initially checked for accuracy as described above.
- the evolutionarily significant nucleotide changes, which are detected by molecular evolution analysis such as the K A /K S analysis, can be further assessed to determine whether the non-human primate gene or the human gene has been subjected to positive selection. For example, the identified changes can be tested for presence/absence in other AIDS- resistant non-human primate sequences.
- sequences with at least one evolutionarily significant change between human and one AIDS-resistant non-human primate can be used as primers for PCR analysis of other non- human primate protein-coding sequences, and resulting polynucleotides are sequenced to see whether the same change is present in other non-human primates.
- These comparisons allow further discrimination as to whether the adaptive evolutionary changes are unique to the AIDS-resistant non-human primate (such as chimpanzee) as compared to other non- human primates. For example, a nucleotide change that is detected in chimpanzee but not other primates more likely represents positive selection on the chimpanzee gene.
- Other non-human primates used for comparison can be selected based on their phylogenetic relationships with human.
- Closely related primates can be those within the hominoid sublineage, such as chimpanzee, bonobo, gorilla, and orangutan.
- Non-human primates can also be those that are outside the hominoid group and thus not so closely related to human, such as the Old World monkeys and New World monkeys.
- Statistical significance of such comparisons may be determined using established available programs, e.g., t-test as used by Messier and Stewart (1997) Nature 385:151-154.
- sequences with significant changes can be used as probes in genomes from different humans to see whether the sequence changes are shared by more than one individual. For example, certain individuals are slower to progress to AIDS ("slow progressers") and comparison (a) between a chimpanzee sequence and the homologous sequence from the slow-progresser human individual and/or (b) between an AIDS- susceptible individual and a slow-progresser individual would be of interest.
- Gene sequences from different human populations can be obtained from databases made available by, for example, the human genome diversity project or, alternatively, from direct sequencing of PCR-amplified DNA from a number of unrelated, diverse human populations. The presence of the identified changes in human slow progressers would further indicate the evolutionary significance of the changes.
- a chimpanzee cDNA library is constructed using chimpanzee tissue.
- RNA is extracted from the tissue (RNeasy kit, Quiagen; RNAse-free Rapid
- RNA kit 5 Prime ⁇ 3 Prime, Inc.
- integrity and purity of the RNA are determined according to conventional molecular cloning methods.
- Poly A+ RNA is isolated (Mini-Oligo(dT) Cellulose Spin Columns, 5 Prime-3 Prime, Inc.) and used as template for the reverse-transcription of cDNA with oligo (dT) as a primer.
- the synthesized cDNA is treated and modified for cloning using commercially available kits.
- Recombinants are then packaged and propagated in a host cell line. Portions of the packaging mixes are amplified and the remainder retained prior to amplification.
- the library can be normalized and the numbers of independent recombinants in the library is determined.
- Suitable primers based on a candidate human gene are prepared and used for PCR amplification of chimpanzee cDNA either from a cDNA library or from cDNA prepared from mRNA. Selected chimpanzee cDNA clones from the cDNA library are sequenced using an automated sequencer, such as an ABI 377. Commonly used primers on the cloning vector such as the Ml 3 Universal and Reverse primers are used to carry out the sequencing. For inserts that are not completely sequenced by end sequencing, dye-labeled terminators are used to fill in remaining gaps.
- the detected sequence differences are initially checked for accuracy, for example by finding the points where there are differences between the chimpanzee and human sequences; checking the sequence fluorogram (chromatogram) to determine if the bases that appear unique to human correspond to strong, clear signals specific for the called base; checking the human hits to see if there is more than one human sequence that corresponds to a sequence change; and other methods known in the art, as needed.
- Multiple human sequence entries for the same gene that have the same nucleotide at a position where there is a different chimpanzee nucleotide provides independent support that the human sequence is accurate, and that the chimpanzee/human difference is real.
- Such changes are examined using public database information and the genetic code to determine whether these DNA sequence changes result in a change in the amino acid sequence of the encoded protein.
- the sequences can also be examined by direct sequencing of the encoded protein.
- K A /K S The chimpanzee and human sequences under comparison are subjected to K A /K S analysis.
- publicly available computer programs such as Li 93 and INA, are used to determine the number of non-synonymous changes per site (K A ) divided by the number of synonymous changes per site (K s ) for each sequence under study as described above.
- Full-length coding regions or partial segments of a coding region can be used.
- K A /K S ratio the more likely that a sequence has undergone adaptive evolution.
- Statistical significance of K A /K S values is determined using established statistic methods and available programs such as the t-test.
- sequence under study can be compared in multiple chimpanzee individuals and in other non-human primates, e.g., gorilla, orangutan, bonobo. These comparisons allow further discrimination as to whether the adaptive evolutionary changes are unique to the human lineage compared to other non-human primates.
- sequences can also be examined by direct sequencing of the gene of interest from representatives of several diverse human populations to assess to what degree the sequence is conserved in the human species.
- the intercellular adhesion molecules ICAM-1, ICAM-2 and ICAM-3 have been shown to have been strongly positively selected.
- the ICAM molecules are involved in several immune response interactions and are known to play a role in progression to AIDS in HIV infected humans.
- the ICAM proteins members of the Ig superfamily, are ligands for the integrin leukocyte associated function 1 molecule (LFA-1). Makgoba et ⁇ /. (1988) Nature 331 :86-88. LFA-1 is expressed on the surface of most leukocytes, while ICAMs are expressed on the surface of both leukocytes and other cell types. Larson et al. (1989) J Cell Biol. 108:703-712. ICAM and LFA-1 proteins are involved in several immune response interactions, including
- RNA was isolated from total RNA using the Mini-Oligo(dT) Cellulose Spin Columns (5 Prime - 3 Prime, Inc.).
- cDNA was synthesized from mRNA with oligo dT and/or random priming using the cDNA Synthesis Kit (Stratagene).
- the protein-coding region of the primate ICAM-1 gene was amplified from cDNA using primers (concentration ⁇ 00 nmole/ ⁇ l) designed by hand from the published human sequence.
- PCR conditions for ICAM-1 amplification were 94°C initial pre-melt (4 min), followed by 35 cycles of 94°C (15 sec), 58°C (1 min 15 sec), 72°C (1 min 15 sec), and a final 72°C extension for 10 minutes.
- PCR was accomplished using Ready-to-Go PCR beads (Amersham Pharmacia Biotech) in a 50 microliter total reaction volume. Appropriately- sized products were purified from agarose gels using the QiaQuick Gel Extraction kit
- a sequence identified by the methods of this invention may be further tested and characterized by cell transfection experiments.
- human cells in culture when transfected with a chimpanzee polynucleotide identified by the methods described herein
- ICAM-1 ICAM-2 or ICAM-3
- ICAM-1 ICAM-1 (or ICAM-2 or ICAM-3); see below
- ICAM-2 or ICAM-3 ICAM-2 or ICAM-3
- Other indicia may also be measured, depending on the perceived or apparent functional nature of the polynucleotide/polypeptide to be tested.
- syncytia formation may be measured and compared to control (untransfected) cells. This would test whether the resistance arises from prevention of syncytia formation in infected cells.
- Cells which are useful in characterizing sequences identified by the methods of this invention and their effects on cell-to-cell infection by HIV-1 are human T-cell lines which are permissive for infection with HIV-1, including, e.g., H9 and HUT78 cell lines, which are available from the ATCC.
- ICAM-1 or ICAM-2 or ICAM-3) cD ⁇ A (or any cD ⁇ A identified by the methods described herein) can be cloned into an appropriate expression vector.
- the cloned ICAM-1 (or ICAM-2 or ICAM-3) coding region is operably linked to a promoter which is active in human T cells, such as, for example, an IL-2 promoter.
- an ICAM-1 (or ICAM-2 or ICAM-3) cDNA can be placed under transcriptional control of a strong constitutive promoter, or an inducible promoter.
- Expression systems are well known in the art, as are methods for introducing an expression vector into cells.
- an expression vector comprising an ICAM-1 (or ICAM-2 or ICAM-3) cDNA can be introduced into cells by DEAE-dextran or by electroporation, or any other known method. The cloned ICAM-1 (or ICAM-2 or ICAM-3) molecule is then expressed on the surface of the cell.
- Determination of whether an ICAM-1 (or ICAM-2 or ICAM-3) cDNA is expressed on the cell surface can be accomplished using antibody(ies) specific for ICAM-1 (or ICAM-2 or ICAM-3).
- antibody(ies) specific for ICAM-1 or ICAM-2 or ICAM-3.
- an antibody which distinguishes between chimpanzee and human ICAM-1 (or ICAM- 2 or ICAM-3) can be used.
- This antibody can be labeled with a detectable label, such as a fluorescent dye.
- Cells expressing chimpanzee ICAM-1 (or ICAM-2 or ICAM-3) on their surfaces can be detected using fluorescence-activated cell sorting and the anti-ICAM-1 (or ICAM-2 or ICAM-3) antibody appropriately labeled, using well-established techniques.
- Transfected human cells expressing chimpanzee ICAM-1 (or ICAM-2 or ICAM-3) on their cell surface can then be tested for syncytia formation, and/or for HIV replication, and/or for number of cells infected as an index of cell-to-cell infectivity.
- ICAM-1 (or ICAM-2 or ICAM-3)-expressing cells can be infected with HIV-1 at an appropriate dose, for example tissue culture infectious dose 50, i.e., a dose which can infect 50% of the cells.
- tissue culture infectious dose 50 i.e., a dose which can infect 50% of the cells.
- Cells can be plated at a density of about 5 x 10 5 cells/ml in appropriate tissue culture medium, and, after infection, monitored for syncytia formation, and/or viral replication, and/or number of infected cells in comparison to control, uninfected cells.
- chimpanzee ICAM-1 cells which have not been transfected with chimpanzee ICAM-1 (or ICAM-2 or ICAM-3) also serve as controls. Syncytia formation is generally observed in HIV-1 -infected cells (which are not expressing chimpanzee ICAM-1 (or ICAM-2 or ICAM-3)) approximately 10 days post-infection. To monitor HIV replication, cell supematants can be assayed for the presence and amount of p24 antigen.
- any assay method to detect p24 can be used, including, for example, an ELISA assay in which rabbit anti-p24 antibodies are used as capture antibody, biotinylated rabbit anti-p24 antibodies serve as detection antibody, and the assay is developed with avidin-horse radish peroxidase.
- any known method including indirect immunofiuorescence methods, can be used.
- indirect immunofiuorescence methods human HIV-positive serum can be used as a source of anti-HIV antibodies to bind to infected cells. The bound antibodies can be detected using FITC-conjugated anti-human IgG, the cells visualized by fluorescence microscopy and counted.
- Another method for assessing the role of a molecule such as ICAM-1 involves successive infection of cells with HIV.
- Human cell lines preferably those that do not express endogenous ICAM (although cell lines that do express endogenous ICAM may also be used), are transfected with either human or chimpanzee ICAM -1 or -2 or -3.
- HIV is collected from the supernatant of HIV-infected human ICAM-1 (or ICAM-2 or ICAM-3)-expressing cells and used to infect chimpanzee ICAM-1 (or ICAM-2 or ICAM-3)-expressing cells or human ICAM-1 (or
- ICAM-2 or ICAM-3 ICAM-2 or ICAM-3 )-expressing cells.
- Initial infectivity, measured as described above, of both the chimpanzee ICAM-1 (or ICAM-2 or ICAM-3)- and the human ICAM-1 (or ICAM-2 or ICAM-3)-expressing cells would be expected to be high.
- cell to cell infectivity would be expected to decrease in the chimpanzee ICAM-1 (or ICAM-2 or ICAM-3) expressing cells, if chimpanzee ICAM-1 (or ICAM-2 or ICAM-3
- ICAM-3) confers resistance.
- HIV is collected from the supernatant of HIV-infected chimpanzee ICAM-1 (or ICAM-2 or ICAM-3)-expressing cells, and used to infect human ICAM-1 (or ICAM-2 or ICAM-3)-expressing cells.
- the initial infectivity would be expected to be much lower than in the first set of experiments, if ICAM-1 (or ICAM-2 or ICAM-3) is involved in susceptibility to HIV progression. After several rounds of replication, the cell to cell infectivity would be expected to increase.
- the identified human sequences can be used in establishing a database of candidate human genes that may be involved in conferring, or contributing to, AIDS susceptibility or resistance. Moreover, the database not only provides an ordered collection of candidate genes, it also provides the precise molecular sequence differences that exist between human and an AIDS-resistant non-human primate (such as chimpanzee) and thus defines the changes that underlie the functional differences.
- the human ICAM-3 protein would be rendered resistant to packaging into HIV virions, thus mimicking (in HIV-1 infected humans) the postulated pathway by which infected chimpanzees resist progression to AIDS.
- MlP-l ⁇ is a chemokine that has been shown to suppress HIV-1 replication in human cells in vitro (Cocchi, F. et al, 1995 Science 270:1811-1815).
- the chimpanzee homologue of the human MIP-1 ⁇ gene was PCR-amplified and sequenced. Calculation of the K A /K S ratio (2.1, P ⁇ 0.05) and comparison to the gorilla homologue reveals that the chimpanzee gene has been positively-selected.
- the nature of the chimpanzee amino acid replacements is being examined to determine how to exploit the chimpanzee protein for therapeutic intervention.
- the human gene, 17- ⁇ hydroxysteroid dehydrogenase type IV codes for a protein known to degrade the two most potent estrogens, ⁇ -estradiol, and 5-diol (Adamski, J. et al. 1995 Biochem J. 311 :437-443).
- chimpanzees are resistant to tumorigenesis, especially those that are estrogen-related. This protein may have been positively-selected in chimpanzees to allow more efficient degradation of estrogens, thus conferring upon chimpanzees resistance to such cancers. If so, the specific amino acid replacements observed in the chimpanzee protein may supply important information for therapeutic intervention in human cancers.
- a chimpanzee brain cDNA library is constructed using chimpanzee brain tissue.
- the chimpanzee brain tissue can be obtained after natural death so that no killing of animal is necessary for this study. In order to increase the chance of obtaining intact mRNAs expressed in brain, however, the brain is obtained as soon as possible after the animal's death. Preferably, the weight and age of the animal are determined prior to death.
- the brain tissue used for constructing a cDNA library is preferably the whole brain in order to maximize the inclusion of mRNA expressed in the entire brain. Brain tissue is dissected from the animal following standard surgical procedures.
- RNA is extracted from the brain tissue and the integrity and purity of the RNA are determined according to conventional molecular cloning methods.
- Poly A+ RNA is selected and used as template for the reverse-transcription of cDNA with oligo (dT) as a primer.
- the synthesized cDNA is treated and modified for cloning using commercially available kits. Recombinants are then packaged and propagated in a host cell line. Portions of the packaging mixes are amplified and the remainder retained prior to amplification.
- the library can be normalized and the numbers of independent recombinants in the library is determined.
- EXAMPLE 11 Sequence Comparison
- Randomly selected chimpanzee brain cDNA clones from the cDNA library are sequenced using an automated sequencer, such as the ABI 377. Commonly used primers on the cloning vector such as the Ml 3 Universal and Reverse primers are used to carry out the sequencing. For inserts that are not completely sequenced by end sequencing, dye- labeled terminators are used to fill in remaining gaps.
- the resulting chimpanzee sequences are compared to human sequences via database searches, e.g., BLAST searches.
- the high scoring "hits," i.e., sequences that show a significant (e.g., >80%) similarity after BLAST analysis, are retrieved and analyzed.
- the two homologous sequences are then aligned using the alignment program CLUSTAL V developed by Higgins et al. Any sequence divergence, including nucleotide substitution, insertion and deletion, can be detected and recorded by the alignment.
- the detected sequence differences are initially checked for accuracy by finding the points where there are differences between the chimpanzee and human sequences; checking the sequence fluorogram (chromatogram) to determine if the bases that appear unique to human correspond to strong, clear signals specific for the called base; checking the human hits to see if there is more than one human sequence that corresponds to a sequence change; and other methods known in the art as needed.
- Multiple human sequence entries for the same gene that have the same nucleotide at a position where there is a different chimpanzee nucleotide provides independent support that the human sequence is accurate, and that the chimpanzee/human difference is real.
- Such changes are examined using public database information and the genetic code to determine whether these DNA sequence changes result in a change in the amino acid sequence of the encoded protein.
- the sequences can also be examined by direct sequencing of the encoded protein.
- K A /K S The chimpanzee and human sequences under comparison are subjected to K A /K S analysis.
- publicly available computer programs such as Li 93 and INA, are used to determine the number of non-synonymous changes per site (K A ) divided by the number of synonymous changes per site (K s ) for each sequence under study as described above.
- K A /Ks This ratio, K A /Ks, has been shown to be a reflection of the degree to which adaptive evolution, i.e., positive selection, has been at work in the sequence under study.
- full-length coding regions have been used in these comparative analyses. However, partial segments of a coding region can also be used effectively.
- the higher the K A /K S ratio the more likely that a sequence has undergone adaptive evolution.
- K A /K S values is determined using established statistic methods and available programs such as the t-test. Those genes showing statistically high K A /K S ratios between chimpanzee and human genes are very likely to have undergone adaptive evolution.
- sequence under study can be compared in other non-human primates, e.g., gorilla, orangutan, bonobo.
- sequences can also be examined by direct sequencing of the gene of interest from representatives of several diverse human populations to assess to what degree the sequence is conserved in the human species.
- Human brain nucleotide sequences containing evolutionarily significant changes are further characterized in terms of their molecular and genetic properties, as well as their biological functions.
- the identified coding sequences are used as probe to perform in situ mRNA hybridization that reveals the expression pattern of the gene, either or both in terms of what tissues and cell types in which the sequences are expressed, and when they are expressed during the course of development or during cell cycle. Sequences that are expressed in brain may be better candidates as being associated with important human brain functions.
- the putative gene with the identified sequences are subjected to a homologue searching in order to determine what functional classes the sequences belong to.
- the identified human sequence changes may be useful in estimating the functional consequence of the change.
- a database of candidate genes can be generated. Candidates are ranked as to the likelihood that the gene is responsible for the unique or enhanced abilities found in the human brain compared to chimpanzee or other non-human primates, such as high capacity information processing, storage and retrieval capabilities, language abilities, as well as others. In this way, this approach provides a new strategy by which such genes can be identified.
- the database not only provides an ordered collection of candidate genes, it also provides the precise molecular sequence differences that exist between human and chimpanzee (and other non-human primates), and thus defines the changes that underlie the functional differences.
- LTP long term potentiation
- EXAMPLE 14 Identification of Positive Selection in a Human Tyrosine Kinase Gene
- ABO 14541 expressed in brain has been identified, that has been positively-selected as compared to its chimpanzee homologue.
- This gene which codes for a tyrosine kinase, is homologous to a well-characterized mouse gene (GenBank Acc.# AF011908) whose gene product, called AATYK, is known to trigger apoptosis (Gaozza, E. et al. 1997 Oncogene 15:3127-3135).
- the literature suggests that this protein controls apoptosis in the developing mouse brain (thus, in effect, "sculpting" the developing brain).
- the tyrosine kinase domain of this protein is highly conserved between mouse, chimpanzee, and human (as are most tyrosine kinases). Interestingly, however, the region of the protein to which signaling proteins bind has been positively-selected in humans, but strongly conserved in both chimpanzees and mice. The region of the human protein to which signaling proteins bind has not only been positively-selected as a result of point nucleotide mutation, but additionally displays duplication of several SH2 binding domains that exist only as single copies in mouse and chimpanzee.
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- General Health & Medical Sciences (AREA)
- Bioinformatics & Computational Biology (AREA)
- Crystallography & Structural Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Plant Pathology (AREA)
- Immunology (AREA)
- Analytical Chemistry (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Investigating Or Analysing Biological Materials (AREA)
- Peptides Or Proteins (AREA)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US7326398P | 1998-01-30 | 1998-01-30 | |
US73263P | 1998-01-30 | ||
PCT/US1999/001964 WO1999039006A2 (en) | 1998-01-30 | 1999-01-29 | Methods to identify polynucleotide and polypeptide sequences which may be associated with physiological and medical conditions |
Publications (1)
Publication Number | Publication Date |
---|---|
EP1051519A2 true EP1051519A2 (de) | 2000-11-15 |
Family
ID=22112723
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP99904442A Ceased EP1051519A2 (de) | 1998-01-30 | 1999-01-29 | Verfahren zur identifizierung von polynukleotid- und polypeptidsequenzen welche mit physiologischen und medizinischen zuständen assoziiert sind |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP1051519A2 (de) |
JP (1) | JP2002501761A (de) |
CA (1) | CA2318772A1 (de) |
WO (1) | WO1999039006A2 (de) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6280953B1 (en) * | 1998-01-30 | 2001-08-28 | Evolutionary Genomics, L.L.C. | Methods to identify polynucleotide and polypeptide sequences which may be associated with physiological and medical conditions |
US7247425B2 (en) | 1998-01-30 | 2007-07-24 | Evolutionary Genomics, Llc | Methods to identify polynucleotide and polypeptide sequences which may be associated with physiological and medical conditions |
US6866996B1 (en) | 1998-01-30 | 2005-03-15 | Evolutionary Genomics, Llc | Methods to identify polynucleotide and polypeptide sequences which may be associated with physiological and medical conditions |
EP1649067A4 (de) * | 2003-06-30 | 2007-01-03 | Evolutionary Genomics Llc | Verfahren zur identifizierung von polynukleotid- und polypeptidsequenzen, die mit physiologischen und medizinischen zuständen assoziiert sein können |
-
1999
- 1999-01-29 JP JP2000529463A patent/JP2002501761A/ja active Pending
- 1999-01-29 WO PCT/US1999/001964 patent/WO1999039006A2/en active IP Right Grant
- 1999-01-29 CA CA002318772A patent/CA2318772A1/en not_active Abandoned
- 1999-01-29 EP EP99904442A patent/EP1051519A2/de not_active Ceased
Non-Patent Citations (3)
Title |
---|
HUGHES A.L.; NEI M.: "NUCLEOTIDE SUBSTITUTION AT MAJOR HISTOCOMPATIBILITY COMPLEX CLASS II LOCI: EVIDENCE FOR OVERDOMINANT SELECTION", PROC. NATL. ACAD. SCI. USA, vol. 86, February 1989 (1989-02-01), WASHINGTON, DC, US, pages 958 - 962, XP001237260 * |
HUGHES A.L.; NEI M.: "PATTERN OF NUCLEOTIDE SUBSTITUTION AT MAJOR HISTOCOMPATIBILITY COMPLEX CLASS I LOCI REVEALS OVERDOMINANT SELECTION", NATURE, vol. 335, 8 September 1988 (1988-09-08), NATURE PUBLISHING GROUP, LONDON, GB, pages 167 - 170, XP001237263 * |
TANAKA T.; NEI M.: "POSITIVE DARWINIAN SELECTION OBSERVED AT THE VARIABLE REGION GENES OF IMMUNOGLOBULINS", MOLECULAR BIOLOGY AND EVOLUTION, vol. 6, no. 5, 1989, THE UNIVERSITY OF CHICAGO PRESS, US, pages 447 - 459, XP001237259 * |
Also Published As
Publication number | Publication date |
---|---|
WO1999039006A3 (en) | 1999-11-04 |
WO1999039006A2 (en) | 1999-08-05 |
JP2002501761A (ja) | 2002-01-22 |
CA2318772A1 (en) | 1999-08-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7462460B2 (en) | Methods for identifying agents that increase the p44 function of microtubule assembly or resistance to HCV infection | |
AU769931B2 (en) | Methods to identify polynucleotide and polypeptide sequences which may be associated with physiological and medical conditions | |
US20090304653A1 (en) | Methods to identify polynucleotide and polypeptide sequences which may be associated with physiological and medical conditions | |
AU2003245488A8 (en) | Functional sites | |
US6274319B1 (en) | Methods to identify evolutionarily significant changes in polynucleotide and polypeptide sequences in domesticated plants and animals | |
AU2001275303B2 (en) | Methods to identify polynucleotide and polypeptide sequences which may be associated with physiological and medical conditions | |
US20080003607A1 (en) | Methods to identify polynucleotide and polypeptide sequences which may be associated with physiological and medical conditions | |
AU2001275303A1 (en) | Methods to identify polynucleotide and polypeptide sequences which may be associated with physiological and medical conditions | |
WO1999039006A2 (en) | Methods to identify polynucleotide and polypeptide sequences which may be associated with physiological and medical conditions | |
US7247425B2 (en) | Methods to identify polynucleotide and polypeptide sequences which may be associated with physiological and medical conditions | |
WO2000012764A1 (en) | Methods to identify polynucleotide and polypeptide sequences which may be associated with physiological and medical conditions | |
Zee et al. | Frequencies of variants of candidate genes in different age groups of hypertensives | |
EP1250449B1 (de) | Methoden zur identifizierung evolutionär signifikanter änderungen in polynukleotid- und polypeptidsequenzen in domestizierten pflanzen und tieren | |
AU2007202866A1 (en) | Methods to identify polynucleotide and polypeptide sequences which may be associated with physiological and medical conditions | |
EP2048249A1 (de) | Verfahren zur Identifizierung von Polynucleotiden und Polypeptidfolgen, die mit physiologischen und medizinischen Zuständen in Zusammenhang gebracht werden können | |
US20050234654A1 (en) | Detection of evolutionary bottlenecking by dna sequencing as a method to discover genes of value | |
Fearnley et al. | Ultrafast, alignment-free detection of repeat expansions in next-generation DNA and RNA sequencing data | |
AU2003298556A8 (en) | Functional sites | |
Pannecoucke | EVALUATION OF THE INVOLVEMENT OF NOTCH1 VARIANTS IN CAROTID AND VERTEBRAL ARTERY DISSECTION | |
EP1737975A4 (de) | Verfahren zur identifizierung evolutionär signifikanter änderungen in polynukleotid- und polypeptidsequenzen in prokaryonten | |
ERA et al. | PHARMACOGENOMICS: PHARMACOLOGY AND |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20000829 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: EVOLUTIONARY GENOMICS, LLC |
|
17Q | First examination report despatched |
Effective date: 20021220 |
|
RIN1 | Information on inventor provided before grant (corrected) |
Inventor name: MESSIER, WALTER Inventor name: SIKELA, JAMES, M. |
|
APBN | Date of receipt of notice of appeal recorded |
Free format text: ORIGINAL CODE: EPIDOSNNOA2E |
|
APBR | Date of receipt of statement of grounds of appeal recorded |
Free format text: ORIGINAL CODE: EPIDOSNNOA3E |
|
APAF | Appeal reference modified |
Free format text: ORIGINAL CODE: EPIDOSCREFNE |
|
APBT | Appeal procedure closed |
Free format text: ORIGINAL CODE: EPIDOSNNOA9E |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED |
|
18R | Application refused |
Effective date: 20100304 |