EP0996715A1 - Heterologous expression of proteins by "rescued" vector comprising an intron - Google Patents
Heterologous expression of proteins by "rescued" vector comprising an intronInfo
- Publication number
- EP0996715A1 EP0996715A1 EP98935143A EP98935143A EP0996715A1 EP 0996715 A1 EP0996715 A1 EP 0996715A1 EP 98935143 A EP98935143 A EP 98935143A EP 98935143 A EP98935143 A EP 98935143A EP 0996715 A1 EP0996715 A1 EP 0996715A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- gene
- intron
- expression
- construct
- sequence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 105
- 102000004169 proteins and genes Human genes 0.000 title claims description 30
- 239000013598 vector Substances 0.000 title abstract description 40
- 238000000034 method Methods 0.000 claims abstract description 36
- 108091026890 Coding region Proteins 0.000 claims abstract description 34
- 108020005065 3' Flanking Region Proteins 0.000 claims abstract description 29
- 108091026898 Leader sequence (mRNA) Proteins 0.000 claims abstract description 21
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 12
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 12
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 12
- 230000009261 transgenic effect Effects 0.000 claims description 37
- 108020004414 DNA Proteins 0.000 claims description 36
- 101800004937 Protein C Proteins 0.000 claims description 36
- 229960000856 protein c Drugs 0.000 claims description 32
- 108010076119 Caseins Proteins 0.000 claims description 29
- 101800001700 Saposin-D Proteins 0.000 claims description 28
- 241001465754 Metazoa Species 0.000 claims description 26
- 102000011632 Caseins Human genes 0.000 claims description 24
- 235000018102 proteins Nutrition 0.000 claims description 23
- 108010035532 Collagen Proteins 0.000 claims description 16
- 102000008186 Collagen Human genes 0.000 claims description 16
- 229920001436 collagen Polymers 0.000 claims description 15
- 108020004705 Codon Proteins 0.000 claims description 11
- 235000021240 caseins Nutrition 0.000 claims description 11
- 108010085238 Actins Proteins 0.000 claims description 10
- 108700019146 Transgenes Proteins 0.000 claims description 7
- 238000013519 translation Methods 0.000 claims description 7
- 239000005018 casein Substances 0.000 claims description 6
- 241000894006 Bacteria Species 0.000 claims description 4
- 241000233866 Fungi Species 0.000 claims description 4
- 108010011756 Milk Proteins Proteins 0.000 claims description 4
- 210000003527 eukaryotic cell Anatomy 0.000 claims description 4
- 238000002360 preparation method Methods 0.000 claims description 4
- 108010049003 Fibrinogen Proteins 0.000 claims description 3
- 102000008946 Fibrinogen Human genes 0.000 claims description 3
- 102000014171 Milk Proteins Human genes 0.000 claims description 3
- 210000004962 mammalian cell Anatomy 0.000 claims description 3
- 235000021239 milk protein Nutrition 0.000 claims description 3
- 238000012545 processing Methods 0.000 claims description 3
- 229940012952 fibrinogen Drugs 0.000 claims description 2
- 108020004999 messenger RNA Proteins 0.000 claims 4
- 102100036546 Salivary acidic proline-rich phosphoprotein 1/2 Human genes 0.000 claims 1
- 210000000805 cytoplasm Anatomy 0.000 claims 1
- 230000006641 stabilisation Effects 0.000 claims 1
- 230000005030 transcription termination Effects 0.000 claims 1
- 239000012634 fragment Substances 0.000 description 82
- 239000013612 plasmid Substances 0.000 description 54
- 239000002299 complementary DNA Substances 0.000 description 42
- 108010060630 Lactoglobulins Proteins 0.000 description 32
- 102000017975 Protein C Human genes 0.000 description 27
- 235000013336 milk Nutrition 0.000 description 22
- 239000008267 milk Substances 0.000 description 22
- 210000004080 milk Anatomy 0.000 description 22
- 108091092195 Intron Proteins 0.000 description 18
- 102000008192 Lactoglobulins Human genes 0.000 description 17
- 238000003752 polymerase chain reaction Methods 0.000 description 17
- 241000699660 Mus musculus Species 0.000 description 16
- 238000011830 transgenic mouse model Methods 0.000 description 16
- 238000010367 cloning Methods 0.000 description 14
- 238000005516 engineering process Methods 0.000 description 14
- 108020004635 Complementary DNA Proteins 0.000 description 13
- 108700024394 Exon Proteins 0.000 description 13
- 108091034117 Oligonucleotide Proteins 0.000 description 12
- 229940100689 human protein c Drugs 0.000 description 11
- BECPQYXYKAMYBN-UHFFFAOYSA-N casein, tech. Chemical compound NCCCCC(C(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(CC(C)C)N=C(O)C(CCC(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(C(C)O)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(COP(O)(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(N)CC1=CC=CC=C1 BECPQYXYKAMYBN-UHFFFAOYSA-N 0.000 description 10
- 238000003776 cleavage reaction Methods 0.000 description 10
- 230000007017 scission Effects 0.000 description 10
- 102000004190 Enzymes Human genes 0.000 description 9
- 108090000790 Enzymes Proteins 0.000 description 9
- 101500025568 Homo sapiens Saposin-D Proteins 0.000 description 9
- 229940088598 enzyme Drugs 0.000 description 9
- 239000000047 product Substances 0.000 description 9
- 235000021247 β-casein Nutrition 0.000 description 9
- 241000283690 Bos taurus Species 0.000 description 8
- 241000699670 Mus sp. Species 0.000 description 8
- 238000006243 chemical reaction Methods 0.000 description 8
- 239000000499 gel Substances 0.000 description 8
- 241001494479 Pecora Species 0.000 description 7
- 108010050808 Procollagen Proteins 0.000 description 7
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 7
- 238000010276 construction Methods 0.000 description 7
- 238000011144 upstream manufacturing Methods 0.000 description 7
- 102000007469 Actins Human genes 0.000 description 6
- 241000699666 Mus <mouse, genus> Species 0.000 description 6
- 102000015395 alpha 1-Antitrypsin Human genes 0.000 description 6
- 108010050122 alpha 1-Antitrypsin Proteins 0.000 description 6
- 229940024142 alpha 1-antitrypsin Drugs 0.000 description 6
- 229940021722 caseins Drugs 0.000 description 6
- 210000004027 cell Anatomy 0.000 description 6
- 108020005029 5' Flanking Region Proteins 0.000 description 5
- 108020003589 5' Untranslated Regions Proteins 0.000 description 5
- 108091028043 Nucleic acid sequence Proteins 0.000 description 5
- 230000000903 blocking effect Effects 0.000 description 5
- 239000000872 buffer Substances 0.000 description 5
- 230000000747 cardiac effect Effects 0.000 description 5
- 210000005075 mammary gland Anatomy 0.000 description 5
- 241000894007 species Species 0.000 description 5
- 101000741065 Bos taurus Beta-casein Proteins 0.000 description 4
- 108020004511 Recombinant DNA Proteins 0.000 description 4
- 235000013365 dairy product Nutrition 0.000 description 4
- 230000029087 digestion Effects 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 244000144972 livestock Species 0.000 description 4
- 102000035118 modified proteins Human genes 0.000 description 4
- 108091005573 modified proteins Proteins 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 239000002753 trypsin inhibitor Substances 0.000 description 4
- 239000011534 wash buffer Substances 0.000 description 4
- 108010077544 Chromatin Proteins 0.000 description 3
- 102000003839 Human Proteins Human genes 0.000 description 3
- 108090000144 Human Proteins Proteins 0.000 description 3
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 3
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 3
- 241000124008 Mammalia Species 0.000 description 3
- 238000012408 PCR amplification Methods 0.000 description 3
- 108091081024 Start codon Proteins 0.000 description 3
- 241000282887 Suidae Species 0.000 description 3
- 108020005038 Terminator Codon Proteins 0.000 description 3
- 210000003483 chromatin Anatomy 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 3
- 238000002347 injection Methods 0.000 description 3
- 239000007924 injection Substances 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 210000003205 muscle Anatomy 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 102000013415 peroxidase activity proteins Human genes 0.000 description 3
- 108040007629 peroxidase activity proteins Proteins 0.000 description 3
- 239000002953 phosphate buffered saline Substances 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 108091008146 restriction endonucleases Proteins 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- 102000042089 Actin family Human genes 0.000 description 2
- 108091080272 Actin family Proteins 0.000 description 2
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 2
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 2
- 241000283707 Capra Species 0.000 description 2
- 206010053567 Coagulopathies Diseases 0.000 description 2
- 238000002965 ELISA Methods 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 241000289695 Eutheria Species 0.000 description 2
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- 241000701024 Human betaherpesvirus 5 Species 0.000 description 2
- 102100037872 Intercellular adhesion molecule 2 Human genes 0.000 description 2
- 101710148794 Intercellular adhesion molecule 2 Proteins 0.000 description 2
- 241001529936 Murinae Species 0.000 description 2
- 241000283973 Oryctolagus cuniculus Species 0.000 description 2
- 229920001213 Polysorbate 20 Polymers 0.000 description 2
- 241000700159 Rattus Species 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- 102000004357 Transferases Human genes 0.000 description 2
- 108090000992 Transferases Proteins 0.000 description 2
- 102000004903 Troponin Human genes 0.000 description 2
- 108090001027 Troponin Proteins 0.000 description 2
- 108010000134 Vascular Cell Adhesion Molecule-1 Proteins 0.000 description 2
- 101710087237 Whey acidic protein Proteins 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 210000004369 blood Anatomy 0.000 description 2
- 239000008280 blood Substances 0.000 description 2
- 229940098773 bovine serum albumin Drugs 0.000 description 2
- 229910052791 calcium Inorganic materials 0.000 description 2
- 239000011575 calcium Substances 0.000 description 2
- 230000002759 chromosomal effect Effects 0.000 description 2
- 230000035602 clotting Effects 0.000 description 2
- 239000011248 coating agent Substances 0.000 description 2
- 238000000576 coating method Methods 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 239000003112 inhibitor Substances 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 2
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 2
- 125000001500 prolyl group Chemical group [H]N1C([H])(C(=O)[*])C([H])([H])C([H])([H])C1([H])[H] 0.000 description 2
- 239000000523 sample Substances 0.000 description 2
- 210000002460 smooth muscle Anatomy 0.000 description 2
- 238000010561 standard procedure Methods 0.000 description 2
- 239000012089 stop solution Substances 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- XMTQQYYKAHVGBJ-UHFFFAOYSA-N 3-(3,4-DICHLOROPHENYL)-1,1-DIMETHYLUREA Chemical compound CN(C)C(=O)NC1=CC=C(Cl)C(Cl)=C1 XMTQQYYKAHVGBJ-UHFFFAOYSA-N 0.000 description 1
- 241000251468 Actinopterygii Species 0.000 description 1
- 101710081722 Antitrypsin Proteins 0.000 description 1
- 101100437783 Arabidopsis thaliana BOB2 gene Proteins 0.000 description 1
- 102100035687 Bile salt-activated lipase Human genes 0.000 description 1
- 102000004506 Blood Proteins Human genes 0.000 description 1
- 108010017384 Blood Proteins Proteins 0.000 description 1
- 241000030939 Bubalus bubalis Species 0.000 description 1
- 101100028791 Caenorhabditis elegans pbs-5 gene Proteins 0.000 description 1
- 241000282832 Camelidae Species 0.000 description 1
- 108090000994 Catalytic RNA Proteins 0.000 description 1
- 102000053642 Catalytic RNA Human genes 0.000 description 1
- 102100022641 Coagulation factor IX Human genes 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 102100036912 Desmin Human genes 0.000 description 1
- 108010044052 Desmin Proteins 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 108010076282 Factor IX Proteins 0.000 description 1
- 102000001390 Fructose-Bisphosphate Aldolase Human genes 0.000 description 1
- 108010068561 Fructose-Bisphosphate Aldolase Proteins 0.000 description 1
- 229940121710 HMGCoA reductase inhibitor Drugs 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 101000823116 Homo sapiens Alpha-1-antitrypsin Proteins 0.000 description 1
- 101000946384 Homo sapiens Alpha-lactalbumin Proteins 0.000 description 1
- 101000715643 Homo sapiens Bile salt-activated lipase Proteins 0.000 description 1
- 101000763314 Homo sapiens Thrombomodulin Proteins 0.000 description 1
- PMMYEEVYMWASQN-DMTCNVIQSA-N Hydroxyproline Chemical compound O[C@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-DMTCNVIQSA-N 0.000 description 1
- 101150102264 IE gene Proteins 0.000 description 1
- 102000019223 Interleukin-1 receptor Human genes 0.000 description 1
- 108050006617 Interleukin-1 receptor Proteins 0.000 description 1
- 102000011782 Keratins Human genes 0.000 description 1
- 108010076876 Keratins Proteins 0.000 description 1
- 102000004407 Lactalbumin Human genes 0.000 description 1
- 108090000942 Lactalbumin Proteins 0.000 description 1
- 108091036060 Linker DNA Proteins 0.000 description 1
- 102000004882 Lipase Human genes 0.000 description 1
- 108090001060 Lipase Proteins 0.000 description 1
- 239000004367 Lipase Substances 0.000 description 1
- 102000005741 Metalloproteases Human genes 0.000 description 1
- 108010006035 Metalloproteases Proteins 0.000 description 1
- 101000929494 Mus musculus Adenosine deaminase Proteins 0.000 description 1
- 101000976048 Mus musculus Involucrin Proteins 0.000 description 1
- 102100030476 POU domain class 2-associating factor 1 Human genes 0.000 description 1
- 101710114665 POU domain class 2-associating factor 1 Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 108700001094 Plant Genes Proteins 0.000 description 1
- 108010022233 Plasminogen Activator Inhibitor 1 Proteins 0.000 description 1
- 102000004179 Plasminogen Activator Inhibitor 2 Human genes 0.000 description 1
- 108090000614 Plasminogen Activator Inhibitor 2 Proteins 0.000 description 1
- 102100039418 Plasminogen activator inhibitor 1 Human genes 0.000 description 1
- 108010069381 Platelet Endothelial Cell Adhesion Molecule-1 Proteins 0.000 description 1
- 102000037602 Platelet Endothelial Cell Adhesion Molecule-1 Human genes 0.000 description 1
- 102000003946 Prolactin Human genes 0.000 description 1
- 108010057464 Prolactin Proteins 0.000 description 1
- 102000004079 Prolyl Hydroxylases Human genes 0.000 description 1
- 108010043005 Prolyl Hydroxylases Proteins 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 101710132633 Protein C5 Proteins 0.000 description 1
- 229940096437 Protein S Drugs 0.000 description 1
- 102000029301 Protein S Human genes 0.000 description 1
- 108010066124 Protein S Proteins 0.000 description 1
- 101710089766 Ribonuclease P protein component Proteins 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 102000008847 Serpin Human genes 0.000 description 1
- 108050000761 Serpin Proteins 0.000 description 1
- 101710172711 Structural protein Proteins 0.000 description 1
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 1
- 108010006785 Taq Polymerase Proteins 0.000 description 1
- 102100030951 Tissue factor pathway inhibitor Human genes 0.000 description 1
- 108700009124 Transcription Initiation Site Proteins 0.000 description 1
- 102000007544 Whey Proteins Human genes 0.000 description 1
- 108010046377 Whey Proteins Proteins 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 230000001475 anti-trypsic effect Effects 0.000 description 1
- 239000000074 antisense oligonucleotide Substances 0.000 description 1
- 238000012230 antisense oligonucleotides Methods 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000004888 barrier function Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 108010087173 bile salt-stimulated lipase Proteins 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 239000010839 body fluid Substances 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- GEHJBWKLJVFKPS-UHFFFAOYSA-N bromochloroacetic acid Chemical compound OC(=O)C(Cl)Br GEHJBWKLJVFKPS-UHFFFAOYSA-N 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 1
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- -1 deoxyribonucleotide triphosphates Chemical class 0.000 description 1
- 210000005045 desmin Anatomy 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 229940042399 direct acting antivirals protease inhibitors Drugs 0.000 description 1
- PMMYEEVYMWASQN-UHFFFAOYSA-N dl-hydroxyproline Natural products OC1C[NH2+]C(C([O-])=O)C1 PMMYEEVYMWASQN-UHFFFAOYSA-N 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 235000013601 eggs Nutrition 0.000 description 1
- 230000003511 endothelial effect Effects 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 238000012869 ethanol precipitation Methods 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 229960004222 factor ix Drugs 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 239000003102 growth factor Substances 0.000 description 1
- 230000023597 hemostasis Effects 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 102000052905 human CEL Human genes 0.000 description 1
- 102000051206 human THBD Human genes 0.000 description 1
- 239000002471 hydroxymethylglutaryl coenzyme A reductase inhibitor Substances 0.000 description 1
- 229960002591 hydroxyproline Drugs 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 235000019421 lipase Nutrition 0.000 description 1
- 108010013555 lipoprotein-associated coagulation inhibitor Proteins 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 1
- 238000002205 phenol-chloroform extraction Methods 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 230000001323 posttranslational effect Effects 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 229940097325 prolactin Drugs 0.000 description 1
- 230000001681 protective effect Effects 0.000 description 1
- 230000004853 protein function Effects 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000007115 recruitment Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 239000003001 serine protease inhibitor Substances 0.000 description 1
- 210000002027 skeletal muscle Anatomy 0.000 description 1
- 210000002784 stomach Anatomy 0.000 description 1
- 239000001117 sulphuric acid Substances 0.000 description 1
- 235000011149 sulphuric acid Nutrition 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 238000004448 titration Methods 0.000 description 1
- FGMPLJWBKKVCDB-UHFFFAOYSA-N trans-L-hydroxy-proline Natural products ON1CCCC1C(O)=O FGMPLJWBKKVCDB-UHFFFAOYSA-N 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 239000001226 triphosphate Substances 0.000 description 1
- 235000011178 triphosphate Nutrition 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 235000021119 whey protein Nutrition 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
- 235000021249 α-casein Nutrition 0.000 description 1
- 235000021241 α-lactalbumin Nutrition 0.000 description 1
- 235000021246 κ-casein Nutrition 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/8509—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells for producing genetically modified animals, e.g. transgenic
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K67/00—Rearing or breeding animals, not otherwise provided for; New or modified breeds of animals
- A01K67/027—New or modified breeds of vertebrates
- A01K67/0275—Genetically modified vertebrates, e.g. transgenic
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/705—Receptors; Cell surface antigens; Cell surface determinants
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/78—Connective tissue peptides, e.g. collagen, elastin, laminin, fibronectin, vitronectin or cold insoluble globulin [CIG]
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/81—Protease inhibitors
- C07K14/8107—Endopeptidase (E.C. 3.4.21-99) inhibitors
- C07K14/811—Serine protease (E.C. 3.4.21) inhibitors
- C07K14/8121—Serpins
- C07K14/8125—Alpha-1-antitrypsin
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2217/00—Genetically modified animals
- A01K2217/05—Animals comprising random inserted nucleic acids (transgenic)
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2227/00—Animals characterised by species
- A01K2227/10—Mammal
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2227/00—Animals characterised by species
- A01K2227/10—Mammal
- A01K2227/105—Murine
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2267/00—Animals characterised by purpose
- A01K2267/01—Animal expressing industrially exogenous proteins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2830/00—Vector systems having a special element relevant for transcription
- C12N2830/008—Vector systems having a special element relevant for transcription cell type or tissue specific enhancer/promoter combination
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2830/00—Vector systems having a special element relevant for transcription
- C12N2830/42—Vector systems having a special element relevant for transcription being an intron or intervening sequence for splicing and/or stability of RNA
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2830/00—Vector systems having a special element relevant for transcription
- C12N2830/80—Vector systems having a special element relevant for transcription from vertebrates
- C12N2830/85—Vector systems having a special element relevant for transcription from vertebrates mammalian
Definitions
- This invention relates to the expression of proteins in heterologous host systems, particularly in, but not limited to, the mammary gland of transgenic animals.
- mice transgemc for the latter construct alone only one out of eight mice expressed and this at a level of only 3.9 ⁇ g/ml.
- an expression level of 108 ⁇ g/ml from a wild type human protein C cDNA construct was achieved (WO-A-9211358). This represents approximately 20% the expression level obtained with an equivalent genomic based construct.
- the cDNA construct alone gave no expression in 11 lines of transgenic mice.
- the "rescue" .phenomenon has been rationalised as follows. Strongly expressing genes have an innate ability to 'dominate' their chromosomal environment such that they are able to initiate and maintain a high expressing state. Intronless genes are deficient in some, as yet identified, feature which provides them with this capability. However, the dominant effect of the strong gene extends some way 5' and 3' to the gene itself and therefore by linking a 'weak' and 'strong' gene, some of the properties of the high expressing gene are conferred on the intronless gene. Clark and colleagues propose that this probably results in an open chromatin conformation associated with the actively expressing gene which encompasses adjacent intronless genes.
- the actively expressing gene may thus create a permissive domain allowing access to the intronless genes by the transcriptional machinery of the cell.
- the intronless construct may be inaccessible, probably residing in condensed chromatin.
- Other possible explanations for this phenomenon include enhancer-like sequences present in the actively expressing gene but absent from the intronless construct interacting positively with the latter or simply that the actively expressing gene insulates the intronless gene from the negative effects of adjacent chromatin.
- the two different genes may be present but not at the same locus. Subsequently they may segregate upon breeding. Finally, the physical structure of a BLG/pMAD array is not determined prior to injection and there is no control over it. The relative copy numbers of the two genes may vary especially if the DNA concentrations of the two constructs are not tightly controlled.
- an expression construct comprising:
- the intron (b) is not derived from the same gene as that from which either the promoter (a) or the coding sequence (c) is derived, and, in particular wherein the promoter (a) drives expression of the coding sequence (c) at a level which is elevated by virtue of the presence of the intron (b) .
- Elevated levels of expression include expression where previously none was measurable (or obtained). Elevated levels is optionally defined as a level higher than obtained by the construct without the intron (and optionally the 3' flanking sequence) described above.
- the expression construct is a DNA expression construct.
- the coding sequence is a protein-coding sequence although it may code non-protein substances such as ribozymes.
- the construct is effective for two particular reasons; firstly, the promoter (a) drives expression of the coding sequence (c) at a level which is elevated by virtue of the presence of the intron (b) and/or secondly the coding sequence (c) is more likely to be expressed in a transgenic host, by virtue of the presence of the intron (b).
- the second effect is particularly important when taking into account the length of time and the efforts required to produce transgemc animals useful as bioreactors for the production of useful proteins, etc. It is also important in laboratory scale trials to determine and obtain transgenic hosts.
- constructs as described in the claims have shown that an increased number of transgenic hosts express the coding sequence over use of the constructs without specific intron described herein (e.g see number of expressing founders in Table 3).
- the elevated level of expression of the coding sequence and/or the expression of the coding sequence may be by virtue of the presence of the intron (b) and the 3'- flanking sequence (d).
- the DNA expression construct may be useful for expression in any suitable host system such as, for example, prokaryotes, (e.g. E.col ⁇ ), fungi, plant and animal (including mammalian) cell lines and transgenic plants and animals (including mammals). However, it is in transgenic animal hosts that the expression constructs of the invention are most useful. In principle, the invention is applicable to all animals, including birds such as domestic fowl, amphibian species and fish species.
- the protein may be harvested from body fluids (such as milk, blood or urine) or other body products (such as eggs, where appropriate). In practice, it will be to (non-human) mammals, particularly placental mammals, that the greatest commercially useful applicability is presently envisaged.
- constructs of the invention may also be useful in genetic therapy in humans or other animals.
- the promoter can be any suitable promoter chosen from a gene different from the source of the intron (b). Within that constraint, it will be chosen having regard to its desired properties in the construct of the expression system to be used and its ability to derive expression of heterologous sequences in cell culture or in a transgenic organism.
- a promoter is any sequence which drives expression of a coding sequence.
- the BLG promoter does not express particularly highly in cells which do not respond to prolactin (such as COS cells).
- a 'cell' promoter according to the invention is the HCMV (human cytomegalovirus) IE gene promoter.
- promoters of the invention include, endothelial promoters such as vascular cell adhesion molecule (VCAM), platelet endothelial cell adhesion molecule- 1 (PEC AM), inter-cellular adhesion molecule-2 (ICAM) and smooth muscle promoters, such as Desrnin E and Desmin P.
- VCAM vascular cell adhesion molecule
- PEC AM platelet endothelial cell adhesion molecule- 1
- IAM inter-cellular adhesion molecule-2
- smooth muscle promoters such as Desrnin E and Desmin P.
- a preferred promoter is one which drives expression of the protein coding sequence in mammalian cells.
- the preferred expression system involves expression in the mammary gland of transgenic placental mammals.
- milk protein promoters will generally be used, preferably but not necessarily derived from the species chosen as an expression host.
- the promoter may be a casein promoter (such as an ⁇ -, ⁇ - or ⁇ -casein promoter), but it is preferred that it be a non-casein promoter, such as the human Bile Salt Stimulated Lipase (BSSL) promoter, more preferably a whey protein promoter, such as that of whey acidic protein (WAP), ⁇ -lactalbumin or, most preferred of all, ⁇ - lactoglobulin.
- Figure 3 is a schematic representation of the cloning of pCASLAC and obtaining transgene constructs therefrom.
- pCASLAC corresponds to pCASMAD ⁇ (see Fig. 2, 4 and 7) only using the more tightly regulated ⁇ -lac promoter.
- the present invention covers promoters, in the constructs described, which have not yet been isolated or characterized.
- One general way for isolating specific promoters (such as mammalian promoters) for use in the present invention is to isolate specific cDNAs by differential display or from subtractive cDNA libraries. These, in turn, are used to screen genomic libraries for the cognate promoters.
- the present invention encompasses the use of a modified low expressing naturally occurring promoter in vitro to an increased level of expression (eg. by addition of an enhancer) or to use a promoter with a higher level of expression in a crossed species (eg. the human ⁇ -lactalbumin promoter expresses better in mice than the endogenous mouse promoter).
- a modified low expressing naturally occurring promoter in vitro to an increased level of expression (eg. by addition of an enhancer) or to use a promoter with a higher level of expression in a crossed species (eg. the human ⁇ -lactalbumin promoter expresses better in mice than the endogenous mouse promoter).
- a promoter according to the invention may also be a viral or modified cellular promoter or a completely artificial promoter having the properties of high level expression (preferably mammalian species). Details of suitable promoters can be found in Houdibine, J-M., J. Biotech., 34: 269-287 (1994); Garner, I. & Dalrymple, M., in "Encyclopedia of Molecular Biology: Fundamentals and Applications ", Robert A. Myers (Ed.), Weinheim, NY.
- Element (b) of a construct of the invention is an intron whose natural position is within the 5 '-untranslated region (5'-UTR) of its natural gene (i.e the gene with which it is naturally associated).
- the whole intron is not necessarily required. Fragments or portions may be sufficient.
- the requirement for the present invention is that the level of protein expression, from any construct according to the invention, is elevated by virtue of the presence of the intron, or parts thereof. It has been shown that the first and third portions of an intron (which has been divided into three fairly equal parts), recombined, are often effective. Generally speaking, the intron for inclusion in a construct according to the invention will be the first intron of such a gene.
- genes with such known introns include; human and rat aldolase A, human type II IL-1 receptor, human UDP-N- acetylglucosaminyl transferase, mouse involucrin and mouse adenosine deaminase.
- Some genes have more than one intron whose natural positions are all within the 5' untranslated region of its natural gene.
- the present invention recognises this and covers, within element (b), one or more of such introns (for example in a gene with two introns naturally positioned in the 5' UTR, they may separately, together or parts of each cojoined be included in a part of a construct according to the invention).
- introns whose natural position is within the 5' untranslated region of its natural gene may be used according to the present invention. Also included are: the introns of several gene families including; the actin family (two skeletal muscle actins-alpha cardiac and alpha skeletal, two smooth muscle actins-alpha smooth and gamma smooth, and two non- muscle actins-beta and gamma cytoplasmic actin), the troponin family (cardiac, skeletal and foetal troponins) and the casein family ( ⁇ SI, ⁇ S2, ⁇ and K).
- the intron is the first intron of the family.
- the most preferred gene family from which the intron may come is the casein family or the actin family.
- the intron may preferably be from the same source of organism as the promoter and/or the expression system which it is in (e.g. mammalian, bovine, ovine, etc.).
- DNA expression constructs of the present invention are different from that of Barash et al. (Nucl. Acids Res. 24(4) 602-610 (1996)), in that Barash et ⁇ /. 's constructs include ⁇ -lactoglobulin intragenic sequences which are not within the 5'- untranslated region. Barash et al. do not refer to the possibility of using an intron whose natural position is within the 5 '-untranslated region of its natural gene.
- Caseins whose genes represent a preferred source of the 5' -UTR introns useful in the present invention, are the major mammalian milk proteins and are encoded by a small gene family, which in cows and sheep consists of four members, ⁇ sl , ⁇ , ⁇ s2 and K, and in mice and rats five, ⁇ , ⁇ , ⁇ , ⁇ and K (Yu-Lee & Rosen, J. Bio I Chem., 258 10794-10804 (1983); Jones et al, J. Biol. Chem., 206 7042- 7048(1985); Thompson et al, DNA, 4 263-271 (1985)/ reviewed by Mercier & Vilotte, J.
- the first intron of the calcium sensitive casein genes is naturally positioned in the
- the intron may be obtained by PCR amplification from genomic DNA. The resulting DNA fragment may be cloned into a suitable site of an appropriate vector, such as the pMAD6 vector described above.
- Constructs of the invention also contain a coding sequence (c), whose expression is driven by the promoter (a) under the beneficial influence of the intron (b).
- the protein-coding sequence may code for any (natural or modified) protein of interest, particularly those which may be advantageously produced in the preferred mammary gland expression systems.
- proteins involved in haemostasis including factors V, VII, VIII, IX, X, XIII, PAI-1, PAI-2, TFPI
- protein C protein C
- protein S protein S
- alpha 1-antitrypsin (AAT) details of which can be found in general from Perlino et al EMBO Journal, 6, 2767-2771, 1987 and WO90/05188
- tPA alpha 1-antitrypsin
- fibrinogen details for which may be found in WO95/23868 and the references cited therein
- other protease inhibitors such as serpins, Kazal/Kunitz inhibitors, kinninogens, stefms, cy statins or tissue inhibitors of metalloproteinases
- growth factors protein hormones
- structural proteins such as collagens (details of which may be found in
- BLG promoter + bovine ⁇ -casein intron 1 + BLG 3' sequence (particularly the 3' sequence beginning immediately 3' to the natural ⁇ - lactoglobulin stop codon and continuing to at least about 30 bases 3' of the poly-A site), optionally including ovine beta- lactoglobulin intron 6 (preferred positioned 5' to the flanking sequence and 3' to any coding sequence); in particular the construct pCASMAD ⁇ as described in Fig. 2, 4 or 7;
- BLG promoter + muscle cardiac actin intron 1 + BLG 3' sequence (particularly the 3' sequence beginning immediately 3' to the natural ⁇ - lactoglobulin stop codon and continuing to at least about 30 bases 3' of the poly-A site), optionally including ovine beta-lactogloulin intron 6 (preferred position 5' to the flanking sequence and 3' to any coding sequence); in particular the construct pACTMAD ⁇ as described in Fig. 2, 5 or 7;
- BLG promoter + ovine ⁇ -casein intron 1 -I- ovine ⁇ -casein 3' flanking sequence in particular the construct pBOB as described in Fig 2, 6 or 7.
- 3 '-sequences as may be necessary or appropriate. In the invention at its broadest, it is not thought that the nature of such 3 '-sequences is particularly limited.
- the 3'- flanking sequence may or may not include its natural intron.
- Suitable 3' flanking sequences preferably comprise functional elements which are able to direct the correct transcription, termination and 3' end processing. These can be determined, without undue burden, by the person skilled in the art.
- 3 '-flanking sequences have been found to be particularly useful. These include, but are not restricted, to: (i) a poly-A site (poly A addition site), (ii) a ⁇ -lactoglobulin gene 3 '-sequence beginning immediately 3' to the natural ⁇ -lactoglobulin stop codon and continuing to at least about 30 bases 3' of the poly-A site (as found in pMAD6 and pCASMAD ⁇ and PACTMAD6), or (iii) ⁇ -casein 3' sequences including poly A signal. These sequences (as used in pBOB) consist of 6.5Kbp of DNA incorporating ovine ⁇ -casein exons 7 to 9, introns 7 and 8, and approximately 4.8Kbp of 3 ' sequence .
- the ⁇ -casein 3' sequences may be cloned from the ⁇ -casein gene cr amplified by PCR and cloned from genomic DNA, which may be of ovine origin.
- Appropriate signal and/or secretory sequences, operably linked to the construct may be present if necessary or desirable.
- the invention is directed to:
- the process comprises linking together selected nucleotide bases and/or nucleotide sequences;
- the vector may be plasmid, phage, cosmid or other vector type, for example derived from yeast.
- the vector may be an expression vector;
- a process for the preparation of a host comprising introducing a DNA expression construct (or a vector), as described above, into a suitable organism; the process, in particular provides a host which expresses elevated levels of the coding sequence in the construct;
- a host organism preferably an expression host organism
- the host may be a eukaryotic or prokaryotic cell/organism, such as bacteria, insect or yeast cells, as well as animal tissues (cells in culture) and animals themselves.
- Such animals are transgenic and preferred transgenic animals include mammals, in particular non-human placental mammals such as pigs, sheep, cattle and goats.
- the host e.g.
- transgemc animal according to the invention has the construct (of the first aspect of the invention) integrated into its genome. It is particularly preferred that the transgenic animal transmits the construct to its progeny, thereby enabling the production of at least one subsequent generation of producer animals.
- a host organism in particular expresses elevated levels of the coding sequence in the construct;
- the protein may be a fusion protein
- nucleic acid expression construct comprising a promoter, an intron whose natural position is within the 5 '-untranslated region of a gene from which it is derived, a coding sequence and a 3' flanking sequence to obtain a transgenic host, preferably with elevated levels of the expressed coding sequence;
- nucleic acid construct comprising a promoter, an intron whose natural position is within the 5' untranslated region of a gene from which it is derived, a coding sequence and a 3' flanking sequence to increase the likelihood of expression of the coding sequence from a transgenic host which incorporates the nucleic acid construct; • a process for improving whether an individual or a number of transgenic hosts express a transgene coding sequence, the process comprising introducing into a host, a nucleic acid construct comprising a promoter, an intron whose natural position is within the 5' untranslated region of a gene from which it is derived, a coding sequence and a 3' flanking sequence.
- an empty 'cassette' including all feamres in claim 1, without the coding sequence.
- Such a "cassette” provides an easy means by which to provide a high expressing vector for other parties to use by simply introducing coding sequences of interest by restriction endonuclease cutting of the empty cassette and religation (according to standard techniques).
- the empty "cassette” is for use with an incorporated coding sequence of interest.
- Preferred feamres for each aspect of the invention are as for each other aspect mutatis mutandis.
- the present invention also provides, as a separate aspect, the novel expression of collagen cDNA (natural procollagen chains or modified collagen).
- collagen cDNA naturally procollagen chains or modified collagen
- the collagen cDNA is expressed via a construct according to the first aspect of the invention.
- Preferred feamres of and all different aspects of the invention described herein above in relation to the construct also apply to the expression of collagen cDNA.
- Particular preferred details in relation to collagen are described above under a discussion of the protein-coding sequences, including references thereto.
- expression hosts may co-express prolyl 4-hydroxylase, which is a post- trans lational enzyme important in the natural biosynthesis of procollagen.
- FIGURE 1 shows the ⁇ -lactoglobulin (BLG) sequences and the plasmids pMAD and pMAD ⁇ .
- FIGURE 2 shows the origin of sequences present in plasmids pMAD, pMAD6, pCASMAD ⁇ and pACTMAD ⁇ .
- FIGURE 3 shows the construction of pCASLAC
- FIGURE 4 shows the construction of pCASMAD ⁇ .
- FIGURE 5 shows the construction of pACTMAD ⁇ .
- FIGURE 6 shows the construction of pBOB.
- FIGURE 7 shows details of pMAD, pMAD ⁇ , pCASMAD ⁇ , pACTMAD ⁇ and pBOB.
- Preferred embodiments of the invention are based on the use of the BLG promoter, and are designed to express cDNAs from the BLG gene.
- the structure of pMAD ⁇ is indicated in Figures 1, 2 and 7. This vector contains the same 5' and 3' flanking sequences present in the ovine BLG gene which itself always gives rise to high level expression in transgenic mice.
- the resulting DNA fragment of approximately 2 Kbp, was cloned and subsequently subcloned into the EcoRV site of pMAD ⁇ in such a way that the original EcoRV site was destroyed and reformed on the 3' side of the intron.
- a cDNA encoding human protein C was inserted into the unique EcoRV site of pCASMAD ⁇ and the new construct called pCORI69.
- the cDNA utilised encodes a mutant form of the natural protein C (PC962): the mutation was designed to allow more efficient processing of the mature protein (Foster et al, Biochemistry, 29:347-354 (1990)).
- This mutant form of the human protein C cDNA has been incorporated into a construct pCORP9, exactly analogous to pCORP2 (see WO-A-9211358).
- pCORP9 expressed particularly poorly, the highest expressing line being 3 ⁇ g/ml compared to 108 ⁇ g/ml for the wild type cDNA.
- All references to the DNA sequence of the ⁇ -lactoglobulin gene utilise the numbering of the sequence allocated ⁇ MBL Accession No. X 12817 (Harris et al , NAR 16: 10379-80 91988).
- the multiple cloning site of the vector pUC18 (Yanisch-Perron et al. , (1985) Gene 33: 103-119) was removed and replaced with a synthetic, double stranded, oligonucleotide containing the new restriction sites: PvuVMluVSall/EcoRY/Xbal/ Pvull MM, and flanked by 5 '-overhangs compatible with the restriction sites EcoRI and HmdIII.
- pUC18 DNA was cleaved with both EcoRI and HmdIII and the new linker DNA was ligated into pUC18. The DNA sequence across the new multiple cloning site was confirmed. This new vector was called pUCPM.
- Plasmid pUCXS The ⁇ -lactoglobulin gene sequences from plasmid pSSltgXS (see WO-A-9201358) were excised on a SaWXbal fragment and recloned into the vector pUCPM, cut with Sail and Xbal, to give plasmid pUCXS.
- Plasmid pUCXS/RV Plasmid pUCXS/RV
- the plasmid pSSltgSE contains: ⁇ -lactoglobulin gene sequences from the Sphl site at position 754 to the EcoRI site at 2050, a region spanning a unique NotI site at position 1148.
- This insert contains a single Pvull site (832) which lies in the 5 '-untranslated region of the ⁇ -lactoglobulin mR ⁇ A.
- the D ⁇ A sequences bounded by Sphl and NotI were then excised and used to replace the equivalent fragment in the plasmid pUCXS, thus effectively introducing a unique EcoRV site into the ⁇ -lactoglobulin gene placed in such a way as to allow the insertion of any additional D ⁇ A sequences under the control of the ⁇ - lactoglobulin gene promoter and 3' to the initiation of transcription.
- the resulting plasmid was called pUCXS/RV.
- pUCXS/RV A derivative of pUCXS/RV, containing only the 4.3 Kbp of the ⁇ -lactoglobulin gene which lie 5' to the transcription initiation site (the promoter), was constructed by subcloning the S ⁇ /I-EcoRV fragment into pUCPM; this plasmid is called pUCSV.
- Plasmid pBLAClOO A fragment of the 3' flanking sequence of the ⁇ -lactoglobulin gene was subcloned in such a way as to eliminate all introns. Plasmid D ⁇ A of pUCXS/RV was partially digested with Smal by performing an enzyme titration with lower and lower concentrations of enzyme at a fixed D ⁇ A concentration. The Smal protein was removed by phenol-chloroform extraction and ethanol precipitation and the D ⁇ A resuspended in water. This D ⁇ A was subsequently digested to completion with the enzyme Xbal. DNA cut once at the Smal site, position 5286 and then cleaved with Xbal gave a characteristic band of size 2.1 Kbp. This band was purifL ⁇ from an agarose gel slice and ligated into Smal and Xbal cut pBSIISK+ (Stratagene Ltd., Cambridge Science Park, Cambridge, UK) to give the plasmid pBLAClOO.
- the ⁇ -lactoglobulin cloning vector pMAD was constructed to allow rapid insertion of cDNAs under the control of the ⁇ -lactoglobulin gene promoter and 3 '-flanking sequences. Such constructs contain no introns.
- the plasmid pBLAClOO was opened by digestion with both EcoRV and Sail, the vector fragment was gel purified. Into this was ligated the 4.3 Kbp promoter fragment from the plasmid pUCSV as a Sall-EcoRV fragment. This construct is termed pSTl and constitutes a ⁇ -lactoglobulin mini-gene encoding the 4.3 Kbp promoter and 2.1 Kbp of 3' flanking sequences.
- a unique EcoRV site is present to allow blunt-end cloning of any auditional DNA sequences.
- Mlul the entire mini-gene from pSTl was excised on a Xhol-Notl fragment, the DNA termini made flush with Klenow polymerase, under standard conditions, and blunt-end cloned into the EcoRV site of pUCPM to give pM AD.
- Two primers complementary to sequences at the 5' and 3' boundaries of the first intron of the ovine BLG gene, were used to amplify a ⁇ 650bp fragment encompassing the entire sequence of intron 1 of the BLG gene from pUCXSRV template.
- the primers introduce a 5' Smal site and a 3' EcoRV site at the ends of the PCR fragment.
- This fragment was cloned in Eco RV digested pBluescriptSK to which single 3' dATP overhangs were added, using Taq polymerase.
- This construct was named pSTIl.
- the orientation of the insert with respect of the multiple cloning site in pSTIl was determined by restriction digestion.
- the intron 1 sequence was excised from pSTIl on a 5' Smal -3'Hind l fragment, the recessed 3' terminus generated at the Hindlll end was repaired using Klenow, and the resulting blunt ended fragment was ligated with EcoRV digested pMAD to make pMADl .
- the correct orientation of the intron fragment with respect to the remainder of the BLG sequences was determined by DNA sequencmg. This step effectively moves the EcoRV site to the 3' end of the BLG intron.
- Plasmid pCORP2 (see WO-A-9211358)
- the cDNA was excised as a Kpnl fragment, the 3' overhangs made flush by treatment with T4 DNA polymerase, the fragment gel purified and blunt-end cloned into the EcoRV site of pMAD. Orientation was determined by restriction digest and confirmed by DNA sequencing.
- This construct is plasmid pCORP2 and contains the human protein C cDNA under the transcriptional control of the ⁇ -lactoglobulin gene 5' and 3' flanking sequences. There are no introns. Plasmid pCORP5
- the 1450bp protein C cDNA fragment used in the construction of pCORP2 was placed into pMADl ⁇ to make pCORP5.
- PC962 (Foster et al ., ibid), into pMAD, the plasmid was modified to incorporate EcoRV sites at the extremities of the protein C cDNA insert.
- a 769 bp Sstll-Pstl fragment encompassing the 3' end of PC962 was cloned between the Sstll and Pstl sites of pBluescript II SK+ (Stratagene, La Jolla, CA). The fragment was excised with Sstll/ EcoRV and purified.
- the 5' portion of PC962 was modified by PCR.
- the sense oligonucleotide primer for this reaction covered the 5' ATG region of the cDNA and provided an EcoRV site upstream of this in the product.
- the antisense oligonucleotide primer covered the Sstll site used to generate the Sstll - EcoRV fragment.
- the resulting PCR product was digested with EcoRV and Sstll and ligated with the Sstll-EcoRV 3' fragment and EcoRV digested pMAD.
- the resulting plasmid, designated pCORP9 effectively contained the PC962 cDNA flanked by EcoRV sites in an intronless fusion driven by the ⁇ - lactoglobulin promoter.
- This genomic construct designated GPClO-1, changed the sequence 16 base pairs upstream of the ATG from the native protein C sequence to the ⁇ -lactoglobulin sequence and introduced mutations in the propeptide cleavage site located in exon 2, and the two-chain cleavage site located in exon 6, as described below.
- the construct was assembled using four fragments designated A, B, C and D and encompassed the protein C gene sequence from the ATG to a BamHl site in exon VIII, immediately upstream of the stop codon.
- the fragments were generated from a human genomic library in Charon 4A phage which was screened with a radiolabeled cDNA probe for human protein C.
- oligonucleotides ZC6303 (5'-ATT TGC GGC CGC CTG CAG CCA TGT GGC AGC TCA CAA GCC TCC TGC-3') and ZC6337 (5'-CAG GAA GGA GTT GGC GCG CTT GCG CCG TTG CAG CAC CTG CTG GGC-3", a D ⁇ A fragment was generated by polymerase chain reaction (PCR).
- Oligonucleotide ZC6303 changed the sequence 16 based pairs 5' to the ATG sequence from the native protein C sequence to the equivalent sequence from the ⁇ -lactoglobulin gene and introduced a NotI site.
- Oligonucleotide ZC6337 changed the propetide cleavage site from Arg-Ile-Arg- Lys-Arg to Gln-Arg-Arg-Lys-Arg.
- the resulting PCR generated fragment was digested with NotI and BssHlL, and a 1402 base pair fragment was gel purified and designated Al.
- a second fragment was prepared using a ⁇ gtll clone of PC ⁇ l as a template with oligonucleotides ZC6306 (5' -CTT CTT CCT GAA TTC TGT TTC TTG C-3') and ZC6338 (5' -CGG ATC CGC AAG CGC GCC AAC TCC TTC C-3') in a polymerase chain reaction.
- the resulting D ⁇ A fragment, designated A3 was digested with 5wHII and EcoRI and gel purified, resulting in a 296 base pair fragment.
- Fragments Al and A3 were ligated into the Bluescript II KS + phagemid vector (Stratagene. La Jolla, CA).
- the resulting plasmid, designated GPC 2-2 was digested with NotI and EcoRI, gel purified and the Notl-EcoRI D ⁇ A fragment was designated Fragment A.
- pCR 2-14 is a subclone which contains an EcoRI to EcoRI DNA fragment of PC ⁇ 8 (Foster et al ., 1985, ibid.).
- the plasmid was digested with EcoRI and SstI and gel purified. The resulting figment was designated Fragment B.
- Plasmid pCR 2-14 was used as a template DNA with oligonucleotides ZC6373 (5' -AAA GTA AAA AAA GAT CTA AAA ATT TAA C-3') and ZC6305 (5' - GTG TCT CGT TTT CTT AAG TGA CTG CGC-3'), which introduced za AfTO. site and the RRKR mutation of the native (KR) two-chain cleavage site, in a polymerase chain reaction.
- the resulting PCR-generated fragment was digested with Bgl ⁇ l and Afl ⁇ and gel purified, resulting in a 1441 base pair fragment, designated ⁇ l.
- Fragment ⁇ I was used in a ligation reaction with oligonucleotides ZC6302 (5' -TTA AGA AGA AAA CGA GAC ACA GAA GAC CAA GAA GAC CAA GTA GAT CCG C-3') and ZC6304 (5' -GGA
- a fourth fragment was generated by digestion of a genomic subclone (pHCB7-l) of PC ⁇ 8.
- pHCB7-l contained a Bglll to Bglll fragment that encompassed exons VI through VIII.
- pHCB7-l was digested with Sstll and Bam ⁇ l and a 2702 base pair fragment was gel purified. The fragment was designated Fragment D.
- a five-part ligation reaction was prepared using NotI and Bam ⁇ l digested and linearized Bluescript II KS+ phagemid vector (Stratagene) with Fragment A (5' NotI to 3' EcoRI) that contained exons I and II, Fragment B (5' EcoRI to 3 'SstI) that contained exons III, IV and V, Fragment C (5' SstI to 3' Sstll) that contained the 5' portion of exon VI and Fragment D (5' Sstll to BamHl) that contained the remaining 3' portion of exon VI and exons VII and VIII.
- Fragment A 5' NotI to 3' EcoRI
- Fragment B (5' EcoRI to 3 'SstI) that contained exons III, IV and V
- Fragment C (5' SstI to 3' Sstll) that contained the 5' portion of exon VI
- Fragment D (5' Sstll to BamHl) that contained the remaining 3
- the resulting D ⁇ A was 8950 base pairs and designated GPC 10-1.
- GPClO-1 was originally generated with BLG sequences and a NotI site upstream of the ATG initiator codon and modifications to both cleavage sites.
- pPC12/BS was generated using PCR amplification of a 1 kb ⁇ Otl- Sc ⁇ l fragment that covered the 5' region of the protein C gene and contained the wild-type ATG codon environment. This introduced an EcoRV site immediately s downstream of the NotI site, adjacent to the ATG codon, and a BamRl site was incorporated 3' of the Seal site to facilitate cloning.
- the PCR product was cloned into Notl/BamUl digested Bluescript II KS+ phagemid vector (Strategene).
- the Notl-EcoRV-Scal fragment present in pPC12/BS was excised, purified and ligated to GPClO-1, which had been o linearized with NotI and partially digested with Seal (the pUC amplillicin gene has an internal Seal site).
- the resulting clone was designated GPC 10-2 and possesses an EcoRV site immedately upstream of the ATG initiator codon.
- GPC 10-1 and GPC 10-2 both terminated at the final 5 ⁇ mHI site in exon VIII of the protein C gene.
- oligonucleotides were synthesized with flanking 5 ⁇ mHI (5') and Bglll (3') restriction sites. Following annealing of the oligonucleotides, the product was cloned into 5 ⁇ mHI digested pBST + to generate plasmid pPC3 ' .
- pBST+ is a derivative of pBS (Stratagene) with a new polylinker. The addition of the polylinker added Bglll, Xhol, Narl and CZ ⁇ l restriction sites from the o vector polylinker downstream of the destroyed Bglll site of the oligonucleotide construct.
- Notl-BamHl fragment of GPC 10-1 was subcloned into NotI/ BamRl digested pPC3' to add 3' coding sequences of protein C, the TAG termination codon followed by Bglll-Xhol-Narl-Clal.
- the 3' region of the protein C gene beginning with the EcoRV site in intron V was excised from this plasmid on an EcoRV-C/ ⁇ l fragment.
- a further genomic construct was generated from pCORP13 which contained only the modified two-chain cleavage site. This was achived using PCR amplification to modify two fragments which result in restoration of the coding capabilitiy of exon 2 from the mutant Gln-Arg-Arg-Lys-Arg to the wild-type Arg-Ile-Arg-Lys- Arg. pCORP13 was used as template for these reaction. The first fragment was
- the sense primer was designed to add a HmdIII site 5' to the EcoRV site proximal to the ATG initiation codon.
- the antisense primer was designed to restore the wild-type sequences in exon 2, which included a restored 5 ⁇ m ⁇ I site.
- a 7.5kb Xhol fragment from pCORP13 was ligated to Xhol digested pGEMPC1.5 to generate a complete protein C genomic sequence covering exons 1-8 with a wild-type propeptide cleavage site and a modified two-chain cleavage site.
- the plasmid was designated pGEMPC14.
- the sequence was excised from pGEMPC14 as a Hindl ⁇ l/Sall fragment. The DNA termini was repaired using a Klenow reaction and the fragment was blunt-end ligated into EcoRV digested pMAD ⁇ to generate pCORP14.
- the modified protein C cDNA (PC962) was excited from the plasmid pCORP9 (see above) as an EcoRV fragment and ligated with EcoRV pMAD ⁇ .
- the resulting construct has been named pCORPl ⁇ .
- Bovine ⁇ -Casein intron 1 (BBCI 1; BOVCASl (5'-AGG CCT ATT CAG CTC CTC CTT CAC TTC TT-3') and BOVCAS2 (5'-GAT ATC GGC TCT CAA TTC CTG GGA ATG GG-3') approximately 2 Kbp) was PCR amplified from dairy cow DNA.
- the 5' primer incorporates a Stul site and the 3' primer incorporates an EcoRV site.
- the purified 2 Kb fragment was cloned into the pG ⁇ M-T vector (Promega) to give construct p ⁇ ' 10.
- Plasmid pCASMAD ⁇ pMAD ⁇ was modified by inserting a linker, containing Spe l/Not I/Sac II sites, into the EcoRV site. Both orientations of the linker were obtained and thus two new cloning vectors were obtained. These were called pMAD ⁇ /STOPS (5'
- the modified protein C cDNA (PC962) was excised from the plasmid pCORP9 (see above) as an EcoRV fragment and ligated with EcoRV digested pCASMAD ⁇ . This places the AUG translation start downstream with respect to the ⁇ -casein intron sequence.
- the resulting construct was named pCOR169.
- Two primers (Sequences ACTPl 5'-AGG CCT AGT GCC TGC CAC CAG CGC CAG CC-3' ACTP2 5' -GAT ATC CCT GGC AC A GCT TTG TGT GGT TC-3') complementary to the opposing strands of the 3' end of the first exon and the 5' end of the second exon of the murine cardiac actin gene respectively, were used in a PCR reaction to amplify a 0.8 Kb fragment encompassing the intervening sequences from a template of mouse genomic DNA.
- the two primers introduced a 5' SnaBl and a 3' EcoRV restriction site at the ends of the PCR product. This DNA fragment was cloned in pG ⁇ M-T to give a construct which was named pG ⁇ M-AI. DNA sequence analysis confirmed that the sequence of the amplified product beyond the primers matched that published for the murine beta actin gene.
- actin intron 1 sequence was excised from pG ⁇ M-AI on a 5' Sn ⁇ BI- 3 'EcoRV fragment which was then ligated with ⁇ coRV digested pMAD ⁇ to give vector pACTMAD ⁇ .
- This cloning step effectively moves the EcoRV site from the 3' end of the BLG promoter downstream to the 3 ' end of the actin gene intron segment.
- the modified protein C cDNA (PC962) was excised from the plasmid pCORP9 as an EcoRV fragment and ligated with EcoRV digested pACTMAD ⁇ . This places the AUG translation start downstream with respect to the actin intron sequence.
- the resulting construct was named pCOR170.
- PCR primers were designed to amplify the region of the ovine ⁇ -casein gene from exon 1 to exon 2 (BOB1: 5'-CGG GAT CCG TCG ACC ATT CAG CTT CTC CTT CAC TTC TTC TC-3'; BOB2: 5'-CGG GAT CCG GGT CCC TAC GTA GGC TCT CGA TTC CTG TGA ATG GGA-3').
- the size of this product is 2.1Kbp and has, engineered into it, the sites BamRl/Sall at the 5' end and BamRl/ Ppuml/ SnaBl at the 3' end.
- a Xhol linker had been cloned into the EcoRV site of pMAD ⁇ and the modified plasmid named pMADX.
- the ovine BLG promoter from pMAD6X was cloned into the Sail site of pBOB ⁇ prom as a SaWXhol fragment giving rise to pBOB.
- the modified protein C cDNA (PC962) was excised from the plasmid pCORP9 as an EcoRV fragment and ligated with EcoRV digested pBOB. This places the AUG translation start downstream with respect to the actin intron sequence.
- the resulting construct was named pCORB71.
- Purified human Protein C stored at 50 ⁇ g/ml in Phosphate Buffered Saline (PBS)/1 % bovine Serum Albumin (BSA) at 20 ° C. Dilute to 500ng/ml in blocking buffer for use. Standard curve range of ELISA is 3.9-125ng/ml.
- Dako Rabbit Anti-human Protein C antibody Peroxidase conjugate Dilute 1/5000 in blocking buffer.
- Reference human plasma is used as a protective control at 1/40 dilution.
- Transgenic mice were prepared as in Example 1. Analysis of AAT in the milk of transgenic mice was according to standard procedures, for example as described in Wright, G. , Carver, A., Cottom, D., Reeves, D. , Scott, A. , Simons, J.P., Wilmut, I., Garner, I., and Colman A., 1991. High level expression of active human ⁇ l antitrypsin in the milk of transgemc sheep. Bio/Technology 9: 830-834.
- Constructs pMAD ⁇ and pCASMAD ⁇ were prepared incorporating DNA encoding an antibody binding fragment to give constructs pMAD6-AB and pCASMAD ⁇ - AB.
- the constructs were used to obtain transgenic mice according to Example 1. Expression of the antibody fragment was determined by standard protocols.
- Const r ucts pMAD ⁇ and pCASMAD ⁇ were prepared incorporating DNA encoding IgG to give constructs pMAD6-IgG and pCASMAD6-IgG.
- the constructs were used to obtain transgenic mice according to Example 1. Expression of IgG in the mice milk was determined by standard ELISA protocol.
- pMAD ⁇ and pCASMAD ⁇ were prepared incorporating DNA encoding a soluble adhesion molecule (SAM) to give constructs pMAD ⁇ -SAM and pCASMAD ⁇ -SAM.
- SAM soluble adhesion molecule
- the expression level range ( ⁇ g/ml) in pCASMAD ⁇ -SAM transgenic mice was up to 500.
- the maximum level detected in the pMAD ⁇ -SAM transgenic mice was 80.
- the CASMAD6 vector was used.
- Collagen cDNA human truncated pro-collagen ⁇ 2(l) homotrimer
- This was coinjected with two transgenes expressing ⁇ and ⁇ subunits of prolyl 4- hydro yiase, an enzyme for the post-translational modification of procollagen.
- Transgenic animals were obtained as in Example 1. Determination of collagen expression in mouse milk was as described according to standard protocols and described in WO97/08311.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- Medicinal Chemistry (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- General Engineering & Computer Science (AREA)
- Gastroenterology & Hepatology (AREA)
- Immunology (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Veterinary Medicine (AREA)
- Environmental Sciences (AREA)
- Plant Pathology (AREA)
- Toxicology (AREA)
- Animal Behavior & Ethology (AREA)
- Animal Husbandry (AREA)
- Biodiversity & Conservation Biology (AREA)
- Cell Biology (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
A nucleic acid expression construct comprising: (a) a promoter; (b) an intron whose natural position is within the 5'-untranslated region of a gene from which it is derived; (c) a coding sequence; and (d) a 3'-flanking sequence wherein the intron (b) is not derived from the same gene as that from which either the promoter (a) or the protein-coding sequence (c) is derived and processes, vectors, hosts and uses involving such a construct to obtain inter alia an increase in the level of expression of the coding sequence.
Description
HETEROLOGOUS EXPRESSION OF PROTEINS BY " RESCUED" VECTOR COMPRIS ING AN INTRON
This invention relates to the expression of proteins in heterologous host systems, particularly in, but not limited to, the mammary gland of transgenic animals.
5
It has been shown, using regulatory DNA elements from milk protein genes, that it is possible to express heterologous proteins in the milk of transgenic livestock. One such gene, that for ovine β-lactoglobulin (BLG), has been cloned and characterised (Ali and Clark, J. Mol. Biol, 199 145-426(1988)). The authors lo subsequently demonstrated consistent, high level, expression of ovine BLG in the milk υf mice transgenic for the entire gene (Simons et al, Nature, 328 530- 532(1987); Harris et al, Developmental Genetics, 12 299-307(1991)). Further experiments demonstrated that the BLG promoter region can direct high levels of expression of a heterologous human protein to the milk of transgenic mice is (Archibald et al, Proc. Natl Acad. Sci. USA, 87 5178-5182(1990)). The generation of sheep, expressing human proteins in their milk using BLG regulatory elements, indicated that this technology was applicable to transgenic livestock (Simons et al, Bio/Technology, 6 179-183(1988); Clark et al, Bio/Technology, 7 487-492(1989)). The commercial feasibility of this technology, as a means of
20 producing recombinant therapeutics in livestock milk, has been confirmed by the demonstration of high level expression of human αrantitrypsin in the milk of transgenic sheep (Wright et al, Bio/Technology, 9 830-834(1991); Carver et al, Cytotechnology, 9 77-84(1992); Carver et al, Bio/Technology, 11 1263- 1270(1993); Cooper and Dalrymple, The Japanese Journal of Expeήmental 25 Medicine, Developmental Biotechnology supplement, 12(2) 124-132(1994)).
This high level of expression of a heterologous protein in livestock milk was the result of using a fusion of the BLG promoter region to human genomic sequences (Wright et a , Bio/Technology, 9 830-834(1991)). Analogous cDNA based
30 constructs were poorly expressed in transgenic mice (Whitelaw et al, Transgenic
Res., 1 3-13(1991)). Despite some notable exceptions in the field as a whole, (Ebert et al, Bio /Technology, 9 835-838(1991); Velander et al, Proc. Natl. Acad. Sci. USA, 89 12003-12007(1992)) the general inefficient expression of cDNA based constructs is well documented (Brinster et al, Proc. Natl. Acad. Sci. USA, 85 836-840(1988); Palmiter et al, Proc. Natl. Acad. Sci. USA 88 478-482 (1991); Whitelaw et al, Biochem. J., 286: 31-39 (1992)). Observed problems include the influence of chromosomal position effects and distinct spatial and/or temporal expression in lines transgenic for the same construct. Such constructs can be improved by the addition of some natural or heterologous introns. However, expression levels from such constructs rarely match levels attained with constructs containing some or all natural introns in the region encoding a heterologous protein. The successful use of less than a full complement of introns is the subject of WO-A-9005188. In spite of that useful advance in the art, however, the genetic material encoding many potential target human proteins which may be produced by the transgenic mammary gland is very often, due to immediate non-availability or the size of the natural gene, limited to cDNAs. As such, a technique giving more consistent expression from transgene constructs containing intronless cDNA sequences is highly desirable.
A further advance in the expression of cDNAs is the so-called "rescue" technology, an approach developed by Clark and co-workers (Clark et al, Bio/Technology, 10 1450-1454(1992); WO-A-9211358)) to overcome cDNA- related expression problems. It makes use of the observation that co-injection of an actively expressed transgene, such as the entire ovine BLG gene, together with an intronless construct results in the expression of the second construct where no expression is achieved when it is injected alone. Clark and colleagues have demonstrated the expression of up to 800μg/ml of human α,-antitrypsin (AAT) in the milk of mice transgenic for both BLG and an intronless human AAT construct. In mice transgemc for the latter construct alone, only one out of eight mice expressed and this at a level of only 3.9μg/ml. Similarly, using this technology,
an expression level of 108μg/ml from a wild type human protein C cDNA construct was achieved (WO-A-9211358). This represents approximately 20% the expression level obtained with an equivalent genomic based construct. The cDNA construct alone gave no expression in 11 lines of transgenic mice.
The "rescue" .phenomenon has been rationalised as follows. Strongly expressing genes have an innate ability to 'dominate' their chromosomal environment such that they are able to initiate and maintain a high expressing state. Intronless genes are deficient in some, as yet identified, feature which provides them with this capability. However, the dominant effect of the strong gene extends some way 5' and 3' to the gene itself and therefore by linking a 'weak' and 'strong' gene, some of the properties of the high expressing gene are conferred on the intronless gene. Clark and colleagues propose that this probably results in an open chromatin conformation associated with the actively expressing gene which encompasses adjacent intronless genes. The actively expressing gene may thus create a permissive domain allowing access to the intronless genes by the transcriptional machinery of the cell. In the absence of adjacent actively expressing genes, the intronless construct may be inaccessible, probably residing in condensed chromatin. Other possible explanations for this phenomenon include enhancer-like sequences present in the actively expressing gene but absent from the intronless construct interacting positively with the latter or simply that the actively expressing gene insulates the intronless gene from the negative effects of adjacent chromatin.
To take advantage of "rescue" technology, we have constructed a vector, pMAD, from the ovine BLG gene for the cloning of cDNAs (Figures 1 and 2). This vector contains the same 5' and 3' flanking sequences present in the BLG gene which itself always gives rise to high level expression in transgenic mice. However, it lacks all coding sequences and introns of the intact gene. Cloning of cDNAs in the unique EcoRV site between 5' and 3' flanking sequences results in constructs suitable for expression by the "rescue" approach. However, the issue of co-
injection of two covalently unlinked genes is not without its difficulties. There is always the risk that one gene or other is not represented in the final transgenic lines. Additionally, the two different genes may be present but not at the same locus. Subsequently they may segregate upon breeding. Finally, the physical structure of a BLG/pMAD array is not determined prior to injection and there is no control over it. The relative copy numbers of the two genes may vary especially if the DNA concentrations of the two constructs are not tightly controlled.
cDNAs have been successfully expressed at high levels, in a limited number of cases. It is not clear from the literature why this should be the case. However, the fact is that a cDNA has never (to our knowledge) been expressed at high levels from a BLG construct other than by rescue.
We had noted the work of Brinster and Palmiter (ibid) and others and we sought to incorporate a BLG intron into our cDNA constructs. To this end the vector pMAD6 was constructed, containing almost all the BLG sequence 3' to the natural BLG stop codon, i.e. a portion of exon 6, intron 6, all of exon 7 and those available sequences downstream of the polyadenylation site (see Figures 1 and 2). A protein C cDNA in this vector (pCORP3) expresses at detectable levels but not nearly as well as "rescued" intronless pCORP2 (see table 2). Thus we can conclude that the mere presence of a BLG intron is insufficient to achieve high level expression.
Noting that certain genes have an intron in the 5 '-untranslated region (5'-UTR), we engineered the natural BLG first intron into the 5'-UTR of the BLG sequences in pMAD (to give pMADl) and into pMAD6 (to give pMAD16). When protein C cDNAs were put into these vectors, there was no detectable expression of protein C in the milk of lactating female transgenics (see table 2; pCORP6). This indicates that the mere presence of intronic sequences in the 5'-UTR of a gene is in general insufficient to allow expression of a cDNA.
We have now found, however, that if, instead of the BLG first intron, an intron whose natural position is within the 5 '-untranslated region of its gene is used, good expression results.
According to a first aspect of the invention, there is provided an expression construct comprising:
(a) a promoter;
(b) an intron whose natural position is within the 5 '-untranslated region of a gene from which it is derived;
(c) a coding sequence; and
(d) a 3 '-flanking sequence,
wherein the intron (b) is not derived from the same gene as that from which either the promoter (a) or the coding sequence (c) is derived, and, in particular wherein the promoter (a) drives expression of the coding sequence (c) at a level which is elevated by virtue of the presence of the intron (b) .
Elevated levels of expression include expression where previously none was measurable (or obtained). Elevated levels is optionally defined as a level higher than obtained by the construct without the intron (and optionally the 3' flanking sequence) described above.
Preferably the expression construct is a DNA expression construct. Preferably the coding sequence is a protein-coding sequence although it may code non-protein substances such as ribozymes. The construct is effective for two particular reasons; firstly, the promoter (a) drives expression of the coding sequence (c) at a
level which is elevated by virtue of the presence of the intron (b) and/or secondly the coding sequence (c) is more likely to be expressed in a transgenic host, by virtue of the presence of the intron (b). The second effect is particularly important when taking into account the length of time and the efforts required to produce transgemc animals useful as bioreactors for the production of useful proteins, etc. It is also important in laboratory scale trials to determine and obtain transgenic hosts. Use of constructs as described in the claims have shown that an increased number of transgenic hosts express the coding sequence over use of the constructs without specific intron described herein (e.g see number of expressing founders in Table 3). The elevated level of expression of the coding sequence and/or the expression of the coding sequence may be by virtue of the presence of the intron (b) and the 3'- flanking sequence (d).
The DNA expression construct may be useful for expression in any suitable host system such as, for example, prokaryotes, (e.g. E.colϊ), fungi, plant and animal (including mammalian) cell lines and transgenic plants and animals (including mammals). However, it is in transgenic animal hosts that the expression constructs of the invention are most useful. In principle, the invention is applicable to all animals, including birds such as domestic fowl, amphibian species and fish species. The protein may be harvested from body fluids (such as milk, blood or urine) or other body products (such as eggs, where appropriate). In practice, it will be to (non-human) mammals, particularly placental mammals, that the greatest commercially useful applicability is presently envisaged. This is because expression in the mammary gland, with subsequent optional recovery of the expression product from the milk, is a proven and preferred technology. It is with ungulates, particularly economically important ungulates such as cattle, sheep, goats, water buffalo, camels and pigs that the invention is likely to be most useful. The generation and usefulness of such mammalian transgenic mammary expression systems is both generally, and in certain instances specifically, disclosed in WO-A-8800239 and WO-A-9005188.
In this text, the meaning of a sequence being derived from a gene does not require that the sequence has actually been obtained from the gene in question. Rather, all and any copy, as well as the original sequence is meant. Further, any modification to the sequence which does not remove the desired end result can be used.
In addition to being useful in transgenic animal expression for non-therapeutic purposes (as far as the host is concerned), constructs of the invention may also be useful in genetic therapy in humans or other animals.
The promoter can be any suitable promoter chosen from a gene different from the source of the intron (b). Within that constraint, it will be chosen having regard to its desired properties in the construct of the expression system to be used and its ability to derive expression of heterologous sequences in cell culture or in a transgenic organism. A promoter is any sequence which drives expression of a coding sequence. For example, the BLG promoter does not express particularly highly in cells which do not respond to prolactin (such as COS cells). A 'cell' promoter according to the invention is the HCMV (human cytomegalovirus) IE gene promoter. Other promoters of the invention include, endothelial promoters such as vascular cell adhesion molecule (VCAM), platelet endothelial cell adhesion molecule- 1 (PEC AM), inter-cellular adhesion molecule-2 (ICAM) and smooth muscle promoters, such as Desrnin E and Desmin P. A preferred promoter is one which drives expression of the protein coding sequence in mammalian cells. In relation to expression in transgenic animal hosts, the preferred expression system involves expression in the mammary gland of transgenic placental mammals. For this purpose, milk protein promoters will generally be used, preferably but not necessarily derived from the species chosen as an expression host. The promoter may be a casein promoter (such as an α-, β- or κ-casein promoter), but it is preferred that it be a non-casein promoter, such as the human Bile Salt Stimulated Lipase (BSSL) promoter, more preferably a whey protein promoter, such as that of
whey acidic protein (WAP), α-lactalbumin or, most preferred of all, β- lactoglobulin. Figure 3 is a schematic representation of the cloning of pCASLAC and obtaining transgene constructs therefrom. pCASLAC corresponds to pCASMADό (see Fig. 2, 4 and 7) only using the more tightly regulated α-lac promoter. Of course, the present invention covers promoters, in the constructs described, which have not yet been isolated or characterized. One general way for isolating specific promoters (such as mammalian promoters) for use in the present invention is to isolate specific cDNAs by differential display or from subtractive cDNA libraries. These, in turn, are used to screen genomic libraries for the cognate promoters.
In addition, the present invention encompasses the use of a modified low expressing naturally occurring promoter in vitro to an increased level of expression (eg. by addition of an enhancer) or to use a promoter with a higher level of expression in a crossed species (eg. the human α-lactalbumin promoter expresses better in mice than the endogenous mouse promoter).
A promoter according to the invention may also be a viral or modified cellular promoter or a completely artificial promoter having the properties of high level expression (preferably mammalian species). Details of suitable promoters can be found in Houdibine, J-M., J. Biotech., 34: 269-287 (1994); Garner, I. & Dalrymple, M., in "Encyclopedia of Molecular Biology: Fundamentals and Applications ", Robert A. Myers (Ed.), Weinheim, NY.
Element (b) of a construct of the invention is an intron whose natural position is within the 5 '-untranslated region (5'-UTR) of its natural gene (i.e the gene with which it is naturally associated). The whole intron is not necessarily required. Fragments or portions may be sufficient. The requirement for the present invention is that the level of protein expression, from any construct according to the invention, is elevated by virtue of the presence of the intron, or parts thereof. It
has been shown that the first and third portions of an intron (which has been divided into three fairly equal parts), recombined, are often effective. Generally speaking, the intron for inclusion in a construct according to the invention will be the first intron of such a gene. Examples of genes with such known introns include; human and rat aldolase A, human type II IL-1 receptor, human UDP-N- acetylglucosaminyl transferase, mouse involucrin and mouse adenosine deaminase. Some genes have more than one intron whose natural positions are all within the 5' untranslated region of its natural gene. The present invention recognises this and covers, within element (b), one or more of such introns (for example in a gene with two introns naturally positioned in the 5' UTR, they may separately, together or parts of each cojoined be included in a part of a construct according to the invention). These and other yet unidentified introns whose natural position is within the 5' untranslated region of its natural gene may be used according to the present invention. Also included are: the introns of several gene families including; the actin family (two skeletal muscle actins-alpha cardiac and alpha skeletal, two smooth muscle actins-alpha smooth and gamma smooth, and two non- muscle actins-beta and gamma cytoplasmic actin), the troponin family (cardiac, skeletal and foetal troponins) and the casein family (α SI, α S2, β and K). Preferably the intron is the first intron of the family. In the case of transgenic mammary specific expression, the most preferred gene family from which the intron may come is the casein family or the actin family. The intron may preferably be from the same source of organism as the promoter and/or the expression system which it is in (e.g. mammalian, bovine, ovine, etc.).
DNA expression constructs of the present invention are different from that of Barash et al. (Nucl. Acids Res. 24(4) 602-610 (1996)), in that Barash et α/. 's constructs include β-lactoglobulin intragenic sequences which are not within the 5'- untranslated region. Barash et al. do not refer to the possibility of using an intron whose natural position is within the 5 '-untranslated region of its natural gene. Caseins, whose genes represent a preferred source of the 5' -UTR introns useful in
the present invention, are the major mammalian milk proteins and are encoded by a small gene family, which in cows and sheep consists of four members, αsl, β, αs2 and K, and in mice and rats five, α, β, γ, ε and K (Yu-Lee & Rosen, J. Bio I Chem., 258 10794-10804 (1983); Jones et al, J. Biol. Chem., 206 7042- 7048(1985); Thompson et al, DNA, 4 263-271 (1985)/ reviewed by Mercier & Vilotte, J. Dairy Sci., 76 3079-3098(1993)). The evolution of the calcium sensitive caseins (α and β) is believed to have occurred by recruitment of exons encoding discrete functional domains, followed by intragenic and intergenic duplication to create the present number of similar exons within a given gene, and of genes within a family (Jones et al, J. Biol. Chem., 206 7042-7048(1985); Groenen et al, Gene, 123 187(1993); reviewed by Mercier & Vilotte, J. Dairy Sci., 16 3079-3098(1993)). There is no evidence that K casein is evolutionally related to the other caseins. Both in sequence homology and protein function it appears to be related to γ fibrinogen (Jolles et al, Biochim. Biophys. Acta., 365 335(1974); Thompson et al, DNA, 4 263-271(1985); Alexander et al, Eur. J. Biochem, 178 395-401(1988)), which performs a cleavage-induced clotting function in blood similar to the clotting function of K casein in the stomach. The caseins all map to a single chromosome in rodents, sheep, cows, humans and pigs (reviewed by Mercier & Villotte, J. Dairy Sc , 16 3079-3098(1993)), all four bovine caseins have been mapped to a single 250 Kb locus (Ferretti et al, Nucleic Acids Res. , 18 6829-6833(1990); Threadgill & Womack, Nucleic Acids Res., 18 6935-6942(1990)) and all five mouse caseins to a 400 Kbp region (Tomlinson et al, Mammalian Genome, 1 542-544).
The first intron of the calcium sensitive casein genes is naturally positioned in the
5' -UTR, upstream of the start of translation. The position of this intron is conserved across species barriers, indicating that there may be some critical function for an intron in this position. The intron may be obtained by PCR amplification from genomic DNA. The resulting DNA fragment may be cloned into a suitable site of an appropriate vector, such as the pMAD6 vector described
above.
Constructs of the invention also contain a coding sequence (c), whose expression is driven by the promoter (a) under the beneficial influence of the intron (b). The protein-coding sequence may code for any (natural or modified) protein of interest, particularly those which may be advantageously produced in the preferred mammary gland expression systems. Examples of classes of such proteins, and specific instances within those classes, are as follows: blood proteins involved in haemostasis including factors V, VII, VIII, IX, X, XIII, PAI-1, PAI-2, TFPI, protein C (details of protein C according to the present invention can be found for example in EP-A-191606 and W097/20043), protein S, alpha 1-antitrypsin (AAT) (details of which can be found in general from Perlino et al EMBO Journal, 6, 2767-2771, 1987 and WO90/05188), tPA, fibrinogen (details for which may be found in WO95/23868 and the references cited therein); other protease inhibitors such as serpins, Kazal/Kunitz inhibitors, kinninogens, stefms, cy statins or tissue inhibitors of metalloproteinases; growth factors; protein hormones; structural proteins such as collagens (details of which may be found in WO93/07889, WO94/ 16570, WO97/08311 and the references cited in these publications) and keratins; enzymes such as lipases, other proteases and transferases; and antibodies. While the protein-coding sequence may in principle be any suitable sequence, such as either the full natural genomic structure, a minigene sequence consisting of some, but not all, of the introns naturally present in the gene, or a cDNA (containing no introns), it will generally be with cDNA sequences that the invention is most useful. This is because the invention may conveniently enable the expression of protein from cDNAs which may otherwise only be achievable using minigenes or full genomic sequences. Furthermore, some proteins may be expressed in nature from intronless genes (e.g. bacterial or yeast genes, human thrombomodulin) or have natural intron structures incompatible with the chosen host (e.g. invertebrate or plant genes in a mammalian cell). In these cases the 'cDNA' route is the only one available.
The intron (b) is preferably positioned upstream of the translation start site for the protein-coding sequence (c), by analogy with its position in its natural environment.
Particularly preferred constructs according to the present invention are the BLG promoter with either (i) the first intron from bovine β-casein or (ii) the first intron from muscle cardiac actin or (iii) the first intron from ovine β-casein. More preferably, the 3' flanking sequence is from BLG as described below for preferred 3' sequences under (i). Particularly preferred constucts of the present invention include the following:
(i) BLG promoter + bovine β-casein intron 1 + BLG 3' sequence (particularly the 3' sequence beginning immediately 3' to the natural β- lactoglobulin stop codon and continuing to at least about 30 bases 3' of the poly-A site), optionally including ovine beta- lactoglobulin intron 6 (preferred positioned 5' to the flanking sequence and 3' to any coding sequence); in particular the construct pCASMADό as described in Fig. 2, 4 or 7;
(ii) BLG promoter + muscle cardiac actin intron 1 + BLG 3' sequence (particularly the 3' sequence beginning immediately 3' to the natural β- lactoglobulin stop codon and continuing to at least about 30 bases 3' of the poly-A site), optionally including ovine beta-lactogloulin intron 6 (preferred position 5' to the flanking sequence and 3' to any coding sequence); in particular the construct pACTMADό as described in Fig. 2, 5 or 7;
(iii) BLG promoter + ovine β-casein intron 1 -I- ovine β-casein 3' flanking sequence; in particular the construct pBOB as described in Fig 2, 6 or 7.
Preferably, following the coding sequence will be such 3 '-sequences as may be necessary or appropriate. In the invention at its broadest, it is not thought that the nature of such 3 '-sequences is particularly limited. The 3'- flanking sequence may or may not include its natural intron. Suitable 3' flanking sequences preferably comprise functional elements which are able to direct the correct transcription, termination and 3' end processing. These can be determined, without undue burden, by the person skilled in the art. However, certain 3 '-flanking sequences have been found to be particularly useful. These include, but are not restricted, to: (i) a poly-A site (poly A addition site), (ii) a β-lactoglobulin gene 3 '-sequence beginning immediately 3' to the natural β-lactoglobulin stop codon and continuing to at least about 30 bases 3' of the poly-A site (as found in pMAD6 and pCASMADό and PACTMAD6), or (iii) β-casein 3' sequences including poly A signal. These sequences (as used in pBOB) consist of 6.5Kbp of DNA incorporating ovine β-casein exons 7 to 9, introns 7 and 8, and approximately 4.8Kbp of 3 ' sequence .
The presence of such 3 '-sequences in the construct adds stability to it. It is believed that the relative orientation of the first and last intron may contribute to this stability.
The β-lactoglobulin gene 3 '-sequence may be cloned from a β-lactoglobulin gene or amplified by PCR and cloned from genomic DNA, which may be of ovine origin. As mentioned above, it begins with the natural β-lactoglobulin gene sequence immediately 3' to the stop codon, which is a TAG codon occurring in exon VI. It extends to at least about 30 bases 3' of the poly-A site, which is in exon VII. Exons VI and VII bracket intron 6 which is present in its entirety. The preferred minimum length of the β-lactoglobulin-derived 3 '-sequences is about 2.3 Kb. For additional preference at least about 50 bases 3' to the poly A site are present. Similarly, the β-casein 3' sequences may be cloned from the β-casein gene cr amplified by PCR and cloned from genomic DNA, which may be of ovine
origin.
Appropriate signal and/or secretory sequences, operably linked to the construct may be present if necessary or desirable.
In other aspects, the invention is directed to:
• a process for the preparation of a construct according to any feature of the first aspect. The process comprises linking together selected nucleotide bases and/or nucleotide sequences;
• a vector comprising a construct according to the first aspect of the invention. The vector may be plasmid, phage, cosmid or other vector type, for example derived from yeast. The vector may be an expression vector;
• a process for the preparation of a vector described above, comprising introduction of a construct according to the first aspect of the invention into a vector construct;
• a process for the preparation of a host (preferably an expression host), the process comprising introducing a DNA expression construct (or a vector), as described above, into a suitable organism; the process, in particular provides a host which expresses elevated levels of the coding sequence in the construct;
• a host organism (preferably an expression host organism) incorporating a DNA expression construct (or a vector) as described above (and preferably capable of giving rise to expression of protein encoded by the construct, although non-expressing hosts such as Escherichia coli and other procaryotes may be useful as cloning hosts); The host may be a eukaryotic
or prokaryotic cell/organism, such as bacteria, insect or yeast cells, as well as animal tissues (cells in culture) and animals themselves. Such animals are transgenic and preferred transgenic animals include mammals, in particular non-human placental mammals such as pigs, sheep, cattle and goats. Preferably the host (e.g. transgemc animal) according to the invention has the construct (of the first aspect of the invention) integrated into its genome. It is particularly preferred that the transgenic animal transmits the construct to its progeny, thereby enabling the production of at least one subsequent generation of producer animals. Such a host organism, in particular expresses elevated levels of the coding sequence in the construct;
• a process of preparing a protein, the process comprising allowing an expression host to express a DNA expression construct as described above, and optionally subsequently purifying the protein;
• a protein when prepared by such a process. The protein may be a fusion protein;
• the use of a nucleic acid expression construct comprising a promoter, an intron whose natural position is within the 5 '-untranslated region of a gene from which it is derived, a coding sequence and a 3' flanking sequence to obtain a transgenic host, preferably with elevated levels of the expressed coding sequence;
• the use of a nucleic acid construct comprising a promoter, an intron whose natural position is within the 5' untranslated region of a gene from which it is derived, a coding sequence and a 3' flanking sequence to increase the likelihood of expression of the coding sequence from a transgenic host which incorporates the nucleic acid construct;
• a process for improving whether an individual or a number of transgenic hosts express a transgene coding sequence, the process comprising introducing into a host, a nucleic acid construct comprising a promoter, an intron whose natural position is within the 5' untranslated region of a gene from which it is derived, a coding sequence and a 3' flanking sequence.
In addition to the construct according to the first aspect of the invention, there is provided an empty 'cassette', including all feamres in claim 1, without the coding sequence. Such a "cassette" provides an easy means by which to provide a high expressing vector for other parties to use by simply introducing coding sequences of interest by restriction endonuclease cutting of the empty cassette and religation (according to standard techniques). The empty "cassette" is for use with an incorporated coding sequence of interest.
Preferred feamres for each aspect of the invention are as for each other aspect mutatis mutandis.
The present invention also provides, as a separate aspect, the novel expression of collagen cDNA (natural procollagen chains or modified collagen). Preferably the collagen cDNA is expressed via a construct according to the first aspect of the invention. Preferred feamres of and all different aspects of the invention described herein above in relation to the construct, also apply to the expression of collagen cDNA. Particular preferred details in relation to collagen are described above under a discussion of the protein-coding sequences, including references thereto. For example, for the expression of all collagen DNA (cDNA or otherwise), expression hosts may co-express prolyl 4-hydroxylase, which is a post- trans lational enzyme important in the natural biosynthesis of procollagen.
The invention will now be illustrated by the following examples. The examples refer to the drawings, in which:
FIGURE 1 shows the β-lactoglobulin (BLG) sequences and the plasmids pMAD and pMADό.
FIGURE 2 shows the origin of sequences present in plasmids pMAD, pMAD6, pCASMADό and pACTMADό.
FIGURE 3 shows the construction of pCASLAC
FIGURE 4 shows the construction of pCASMADό.
FIGURE 5 shows the construction of pACTMADό.
FIGURE 6 shows the construction of pBOB.
FIGURE 7 shows details of pMAD, pMADό, pCASMADό, pACTMADό and pBOB.
Preferred embodiments of the invention are based on the use of the BLG promoter, and are designed to express cDNAs from the BLG gene. The structure of pMADό is indicated in Figures 1, 2 and 7. This vector contains the same 5' and 3' flanking sequences present in the ovine BLG gene which itself always gives rise to high level expression in transgenic mice.
However, it lacks all protein coding sequences and introns 1 to 5 of the intact gene. The 3' non coding exons of the gene remain in this vector together with the final intron of the BLG gene. Cloning of cDNAs in the unique EcoRY site between 5' and 3' flanking sequences results in constructs suitable for expression of cDNAs. Incorporation of the BLG 3' sequences are not essential for the invention. Such BLG 3 ' sequences can be substituted by any competent 3 ' flanking sequences with or without an intron situated downstream of the last (stop) codon of such a gene.
In outline, the first intron of the bovine β-casein gene was amplified by PCR from genomic DNA. The resulting DNA fragment, of approximately 2 Kbp, was cloned and subsequently subcloned into the EcoRV site of pMADό in such a way that the original EcoRV site was destroyed and reformed on the 3' side of the intron. This gave the vector pCASMADό (Figures 1, 2, 4 and 7). A cDNA encoding human protein C was inserted into the unique EcoRV site of pCASMADό and the new construct called pCORI69. The cDNA utilised encodes a mutant form of the natural protein C (PC962): the mutation was designed to allow more efficient processing of the mature protein (Foster et al, Biochemistry, 29:347-354 (1990)).
This mutant form of the human protein C cDNA has been incorporated into a construct pCORP9, exactly analogous to pCORP2 (see WO-A-9211358). In "rescue" experiments pCORP9 expressed particularly poorly, the highest expressing line being 3μg/ml compared to 108μg/ml for the wild type cDNA. This indicates that this mutant cDNA is particularly difficult to express at high levels and therefore is a very exacting test of any cDNA expression system. All references to the DNA sequence of the β-lactoglobulin gene utilise the numbering of the sequence allocated ΕMBL Accession No. X 12817 (Harris et al , NAR 16: 10379-80 91988).
EXAMPLES
General
Where not specifically detailed, recombinant DNA and moleuclar biological procedures were after Maniatis et al ("Molecular Cloning" Cold Spring Harbor (1982)) "Recombinant DNA" Methods in Enzymology Volume 68, (edited by R. Wu), Academic Press (1979); "Recombinant DNA part B" Methods in Enzymology Volume 100, (Wu, Grossman and Moldgave, Eds), Academic Press
(1983); "Recombinant DNA part C" Methods in Enzymology Volume 101, (Wu, Grossman and Moldgave, Eds), Academic Press (1983); and "Guide to Molecular Cloning Techniques", Methods in Enzymology Volume 152 (edited by S.L. Berger & A.R. Kimmel), Academic Press (1987). Unless specifically stated, all chemicals were purchased from BDH Chemicals Ltd, Poole, Dorset, England or the Sigma Chemical Company, Poole, Dorset, England. Unless specifically stated all DNA modiiymg enzymes and restriction endonucleases were purchased from BCL, Boehringer Mannheim House, Bell Lane, Lewes, East Sussex BN7 1LG, UK.
[Abbreviations: bp = base pairs; kb - Kilobase pairs, AAT =alphal-antitrypsin; BLG = beta-lactoglobulin; FIX = factor IX; E. coli = Escherichia coli; dNTPs = deoxyribonucleotide triphosphates; restriction enonucl eases are abbreviated thus e.g. BamHI: the addition of -O after a site for a restriction endonuclease e.g. PvuII-O indicates that the recognition site has been destroyed] .
Construction of Plasmids
Vectors
Plasmid pUCPM
The multiple cloning site of the vector pUC18 (Yanisch-Perron et al. , (1985) Gene 33: 103-119) was removed and replaced with a synthetic, double stranded, oligonucleotide containing the new restriction sites: PvuVMluVSall/EcoRY/Xbal/ Pvull MM, and flanked by 5 '-overhangs compatible with the restriction sites EcoRI and HmdIII. pUC18 DNA was cleaved with both EcoRI and HmdIII and the new linker DNA was ligated into pUC18. The DNA sequence across the new multiple cloning site was confirmed. This new vector was called pUCPM.
Plasmid pUCXS The β-lactoglobulin gene sequences from plasmid pSSltgXS (see WO-A-9201358)
were excised on a SaWXbal fragment and recloned into the vector pUCPM, cut with Sail and Xbal, to give plasmid pUCXS.
Plasmid pUCXS/RV The plasmid pSSltgSE (see WO-A-8800239) contains: β-lactoglobulin gene sequences from the Sphl site at position 754 to the EcoRI site at 2050, a region spanning a unique NotI site at position 1148. This insert contains a single Pvull site (832) which lies in the 5 '-untranslated region of the β-lactoglobulin mRΝA. Into this site was blunt-end ligated a double stranded, 8bp, DΝA linker encoding the recognition site for the enzyme EcoRV, to give the plasmid pSSltgSΕ/RV. The DΝA sequences bounded by Sphl and NotI were then excised and used to replace the equivalent fragment in the plasmid pUCXS, thus effectively introducing a unique EcoRV site into the β-lactoglobulin gene placed in such a way as to allow the insertion of any additional DΝA sequences under the control of the β- lactoglobulin gene promoter and 3' to the initiation of transcription. The resulting plasmid was called pUCXS/RV.
Plasmid pUCSV
A derivative of pUCXS/RV, containing only the 4.3 Kbp of the β-lactoglobulin gene which lie 5' to the transcription initiation site (the promoter), was constructed by subcloning the Sα/I-EcoRV fragment into pUCPM; this plasmid is called pUCSV.
Plasmid pBLAClOO A fragment of the 3' flanking sequence of the β-lactoglobulin gene was subcloned in such a way as to eliminate all introns. Plasmid DΝA of pUCXS/RV was partially digested with Smal by performing an enzyme titration with lower and lower concentrations of enzyme at a fixed DΝA concentration. The Smal protein was removed by phenol-chloroform extraction and ethanol precipitation and the DΝA resuspended in water. This DΝA was subsequently digested to completion
with the enzyme Xbal. DNA cut once at the Smal site, position 5286 and then cleaved with Xbal gave a characteristic band of size 2.1 Kbp. This band was purifLά from an agarose gel slice and ligated into Smal and Xbal cut pBSIISK+ (Stratagene Ltd., Cambridge Science Park, Cambridge, UK) to give the plasmid pBLAClOO.
Plasmid pMAD
The β-lactoglobulin cloning vector pMAD was constructed to allow rapid insertion of cDNAs under the control of the β-lactoglobulin gene promoter and 3 '-flanking sequences. Such constructs contain no introns. The plasmid pBLAClOO was opened by digestion with both EcoRV and Sail, the vector fragment was gel purified. Into this was ligated the 4.3 Kbp promoter fragment from the plasmid pUCSV as a Sall-EcoRV fragment. This construct is termed pSTl and constitutes a β-lactoglobulin mini-gene encoding the 4.3 Kbp promoter and 2.1 Kbp of 3' flanking sequences. A unique EcoRV site is present to allow blunt-end cloning of any auditional DNA sequences. In order to allow excision of novel β-lactoglobulin gene constructs with the enzyme Mlul the entire mini-gene from pSTl was excised on a Xhol-Notl fragment, the DNA termini made flush with Klenow polymerase, under standard conditions, and blunt-end cloned into the EcoRV site of pUCPM to give pM AD.
Plasmid pMADό
Previously described in WO 95/23868, and shown in Figures 1 and 2.
Plasmid pMADl
Two primers, complementary to sequences at the 5' and 3' boundaries of the first intron of the ovine BLG gene, were used to amplify a ~ 650bp fragment encompassing the entire sequence of intron 1 of the BLG gene from pUCXSRV template. The primers introduce a 5' Smal site and a 3' EcoRV site at the ends of the PCR fragment. This fragment was cloned in Eco RV digested pBluescriptSK
to which single 3' dATP overhangs were added, using Taq polymerase. This construct was named pSTIl. The orientation of the insert with respect of the multiple cloning site in pSTIl was determined by restriction digestion.
The intron 1 sequence was excised from pSTIl on a 5' Smal -3'Hind l fragment, the recessed 3' terminus generated at the Hindlll end was repaired using Klenow, and the resulting blunt ended fragment was ligated with EcoRV digested pMAD to make pMADl . The correct orientation of the intron fragment with respect to the remainder of the BLG sequences was determined by DNA sequencmg. This step effectively moves the EcoRV site to the 3' end of the BLG intron.
Plasmid pMAD 16
This was constructed using essentially the same strategy as that described for pMADl, except that in the final cloning step the BLG intron was ligated with ΕcoRV cleaved pMADό (instead of pMAD) to construct pMADlό.
Expression Constructs
Plasmid pCORP2 (see WO-A-9211358)
A 1450bp cDNA of the human protein C gene, flanked by Kpή sites, was obtained in the form of plasmid pWAPC2. The cDNA was excised as a Kpnl fragment, the 3' overhangs made flush by treatment with T4 DNA polymerase, the fragment gel purified and blunt-end cloned into the EcoRV site of pMAD. Orientation was determined by restriction digest and confirmed by DNA sequencing. This construct is plasmid pCORP2 and contains the human protein C cDNA under the transcriptional control of the β-lactoglobulin gene 5' and 3' flanking sequences. There are no introns.
Plasmid pCORP5
The 1450bp protein C cDNA fragment used in the construction of pCORP2 was placed into pMADlό to make pCORP5.
Plasmid pCORP9
To facilitate the cloning of the protein C cDNA, PC962 (Foster et al ., ibid), into pMAD, the plasmid was modified to incorporate EcoRV sites at the extremities of the protein C cDNA insert. A 769 bp Sstll-Pstl fragment encompassing the 3' end of PC962 was cloned between the Sstll and Pstl sites of pBluescript II SK+ (Stratagene, La Jolla, CA). The fragment was excised with Sstll/ EcoRV and purified. The 5' portion of PC962 was modified by PCR. The sense oligonucleotide primer for this reaction covered the 5' ATG region of the cDNA and provided an EcoRV site upstream of this in the product. The antisense oligonucleotide primer covered the Sstll site used to generate the Sstll - EcoRV fragment. The resulting PCR product was digested with EcoRV and Sstll and ligated with the Sstll-EcoRV 3' fragment and EcoRV digested pMAD. The resulting plasmid, designated pCORP9 effectively contained the PC962 cDNA flanked by EcoRV sites in an intronless fusion driven by the β- lactoglobulin promoter.
Plasmid pCORP14
A genomic DNA construct, containing exons I through VIII of the human protein C gene, was made. This genomic construct, designated GPClO-1, changed the sequence 16 base pairs upstream of the ATG from the native protein C sequence to the β-lactoglobulin sequence and introduced mutations in the propeptide cleavage site located in exon 2, and the two-chain cleavage site located in exon 6, as described below. The construct was assembled using four fragments designated A, B, C and D and encompassed the protein C gene sequence from
the ATG to a BamHl site in exon VIII, immediately upstream of the stop codon. The fragments were generated from a human genomic library in Charon 4A phage which was screened with a radiolabeled cDNA probe for human protein C. The screening of the λ library produced three clones that together mapped the entire protein C5 gene (Foster et al ., 1985, Proc. Natl. Acad. Sci. USA, 82: 4673-4677). These clones were designated PC λl, PC λό and PC λ8. Fragment A was a NotI to EcoRI fragment that contained exons I and II of the genomic sequence and was 1698 bp. A subclone of PC λό contained an EcoRI to EcoRI fragment and was designated pHCR4.4-l . Using pHCR4.4-l as a template and oligonucleotides ZC6303 (5'-ATT TGC GGC CGC CTG CAG CCA TGT GGC AGC TCA CAA GCC TCC TGC-3') and ZC6337 (5'-CAG GAA GGA GTT GGC GCG CTT GCG CCG TTG CAG CAC CTG CTG GGC-3", a DΝA fragment was generated by polymerase chain reaction (PCR). Oligonucleotide ZC6303 changed the sequence 16 based pairs 5' to the ATG sequence from the native protein C sequence to the equivalent sequence from the β-lactoglobulin gene and introduced a NotI site.
Oligonucleotide ZC6337 changed the propetide cleavage site from Arg-Ile-Arg- Lys-Arg to Gln-Arg-Arg-Lys-Arg. The resulting PCR generated fragment was digested with NotI and BssHlL, and a 1402 base pair fragment was gel purified and designated Al. A second fragment was prepared using a λ gtll clone of PC λl as a template with oligonucleotides ZC6306 (5' -CTT CTT CCT GAA TTC TGT TTC TTG C-3') and ZC6338 (5' -CGG ATC CGC AAG CGC GCC AAC TCC TTC C-3') in a polymerase chain reaction. The resulting DΝA fragment, designated A3, was digested with 5wHII and EcoRI and gel purified, resulting in a 296 base pair fragment.
Fragments Al and A3 were ligated into the Bluescript II KS + phagemid vector (Stratagene. La Jolla, CA). The resulting plasmid, designated GPC 2-2, was digested with NotI and EcoRI, gel purified and the Notl-EcoRI DΝA fragment
was designated Fragment A.
pCR 2-14 is a subclone which contains an EcoRI to EcoRI DNA fragment of PC λ8 (Foster et al ., 1985, ibid.). The plasmid was digested with EcoRI and SstI and gel purified. The resulting figment was designated Fragment B.
Plasmid pCR 2-14 was used as a template DNA with oligonucleotides ZC6373 (5' -AAA GTA AAA AAA GAT CTA AAA ATT TAA C-3') and ZC6305 (5' - GTG TCT CGT TTT CTT AAG TGA CTG CGC-3'), which introduced za AfTO. site and the RRKR mutation of the native (KR) two-chain cleavage site, in a polymerase chain reaction. The resulting PCR-generated fragment was digested with Bglϊl and Aflϊ and gel purified, resulting in a 1441 base pair fragment, designated Εl. Fragment ΕI was used in a ligation reaction with oligonucleotides ZC6302 (5' -TTA AGA AGA AAA CGA GAC ACA GAA GAC CAA GAA GAC CAA GTA GAT CCG C-3') and ZC6304 (5' -GGA
TCT ACT TGG TCT TCT TGG TCT GTG TCT CGT TTT CTT C-3'). These oligonucleotides form Aβll and Sstll restriction sites when annealed and were ligated to the 3' end of fragment Εl, resulting in a fragment with a 5' Bglll site and a 3' Sstll site. This frament was used in a ligation reaction with a BamΑl- Sstll digested Bluescript II KS+ phagemid vector (Stratagene). The resulting plasmid was designated GPC 8-5 and digested with SsrI and Sstll, generating a 626 base pair fragment, designated Fragment C.
A fourth fragment was generated by digestion of a genomic subclone (pHCB7-l) of PC λ8. pHCB7-l contained a Bglll to Bglll fragment that encompassed exons VI through VIII. pHCB7-l was digested with Sstll and BamΑl and a 2702 base pair fragment was gel purified. The fragment was designated Fragment D.
A five-part ligation reaction was prepared using NotI and BamΑl digested and linearized Bluescript II KS+ phagemid vector (Stratagene) with Fragment A (5'
NotI to 3' EcoRI) that contained exons I and II, Fragment B (5' EcoRI to 3 'SstI) that contained exons III, IV and V, Fragment C (5' SstI to 3' Sstll) that contained the 5' portion of exon VI and Fragment D (5' Sstll to BamHl) that contained the remaining 3' portion of exon VI and exons VII and VIII.
5
The resulting DΝA was 8950 base pairs and designated GPC 10-1.
GPClO-1 was originally generated with BLG sequences and a NotI site upstream of the ATG initiator codon and modifications to both cleavage sites. A clone, o designated pPC12/BS, was generated to ensure that the 5' NotI site of GPClO-1 would not introduce secondary structure into mRΝA molecules that could hinder translation. pPC12/BS was generated using PCR amplification of a 1 kb ΛOtl- Scαl fragment that covered the 5' region of the protein C gene and contained the wild-type ATG codon environment. This introduced an EcoRV site immediately s downstream of the NotI site, adjacent to the ATG codon, and a BamRl site was incorporated 3' of the Seal site to facilitate cloning. Following a Notl/BamHl digestion, the PCR product was cloned into Notl/BamUl digested Bluescript II KS+ phagemid vector (Strategene). The Notl-EcoRV-Scal fragment present in pPC12/BS was excised, purified and ligated to GPClO-1, which had been o linearized with NotI and partially digested with Seal (the pUC amplillicin gene has an internal Seal site). The resulting clone was designated GPC 10-2 and possesses an EcoRV site immedately upstream of the ATG initiator codon. GPC 10-1 and GPC 10-2 both terminated at the final 5αmHI site in exon VIII of the protein C gene. To reconstitute the 56 bp of sequence, ending at the 5 termination codon, two oligonucleotides were synthesized with flanking 5αmHI (5') and Bglll (3') restriction sites. Following annealing of the oligonucleotides, the product was cloned into 5αmHI digested pBST + to generate plasmid pPC3 ' . pBST+ is a derivative of pBS (Stratagene) with a new polylinker. The addition of the polylinker added Bglll, Xhol, Narl and CZαl restriction sites from the o vector polylinker downstream of the destroyed Bglll site of the oligonucleotide
construct.
The Notl-BamHl fragment of GPC 10-1 was subcloned into NotI/ BamRl digested pPC3' to add 3' coding sequences of protein C, the TAG termination codon followed by Bglll-Xhol-Narl-Clal. The 3' region of the protein C gene beginning with the EcoRV site in intron V was excised from this plasmid on an EcoRV-C/αl fragment.
The EcoRV-EcoRV fragment from GPC 10-2, covering the 5' portion of the protein C gene, and the 5 above EcoRl-Clal fragment covering the 3' portion of the protein C gene were combined between the EcoRV and Clal sites of pMADό to generate pCORP13. This effectively placed a genomic portion of the protein C gene with modified propeptide and two-chain cleavage site under the control of the β-lactoglobulin promoter.
A further genomic construct was generated from pCORP13 which contained only the modified two-chain cleavage site. This was achived using PCR amplification to modify two fragments which result in restoration of the coding capabilitiy of exon 2 from the mutant Gln-Arg-Arg-Lys-Arg to the wild-type Arg-Ile-Arg-Lys- Arg. pCORP13 was used as template for these reaction. The first fragment was
1.3kb, which encompassed the 5' end of the protein C gene up to the 5αmHI site in excn 2. For this reason, the sense primer was designed to add a HmdIII site 5' to the EcoRV site proximal to the ATG initiation codon. The antisense primer was designed to restore the wild-type sequences in exon 2, which included a restored 5αmΗI site. A second fragment of 0.2kb from the 5αmHI site in exon 2 to the Xhol site in intron 2, was amplified. The two fragments were combined in pGΕMII (Promega, Madison, WI) to generate pGΕMOC1.5. A 7.5kb Xhol fragment from pCORP13 was ligated to Xhol digested pGEMPC1.5 to generate a complete protein C genomic sequence covering exons 1-8 with a wild-type propeptide cleavage site and a modified two-chain cleavage site. The plasmid
was designated pGEMPC14. The sequence was excised from pGEMPC14 as a Hindlϊl/Sall fragment. The DNA termini was repaired using a Klenow reaction and the fragment was blunt-end ligated into EcoRV digested pMADό to generate pCORP14.
Plasmid pCORPlό
The modified protein C cDNA (PC962) was excited from the plasmid pCORP9 (see above) as an EcoRV fragment and ligated with EcoRV pMADό. The resulting construct has been named pCORPlό.
The Vector pCASMADό
Plasmid pΕ' 10
The Bovine β-Casein intron 1 (BBCI 1; BOVCASl (5'-AGG CCT ATT CAG CTC CTC CTT CAC TTC TT-3') and BOVCAS2 (5'-GAT ATC GGC TCT CAA TTC CTG GGA ATG GG-3') approximately 2 Kbp) was PCR amplified from dairy cow DNA. The 5' primer incorporates a Stul site and the 3' primer incorporates an EcoRV site. The purified 2 Kb fragment was cloned into the pGΕM-T vector (Promega) to give construct pΕ' 10.
Plasmid pCASMADό pMADό was modified by inserting a linker, containing Spe l/Not I/Sac II sites, into the EcoRV site. Both orientations of the linker were obtained and thus two new cloning vectors were obtained. These were called pMADό/STOPS (5'
Sαdl/Notl/S el 3*) and pMADό/SPOTS (5' S el/Notl/SαcII 3').
BBCI 1 was excised from pΕ' 10 on a SαcII and Spel partial (due to an internal Spel site in the β-Casein intron) and cloned into Sacll/Spel digested pMADό/SPOTS. The new vector was called pCASMADό.
Plasmid pCORI69
The modified protein C cDNA (PC962) was excised from the plasmid pCORP9 (see above) as an EcoRV fragment and ligated with EcoRV digested pCASMADό. This places the AUG translation start downstream with respect to the β-casein intron sequence. The resulting construct was named pCOR169.
The Vector pACTMADό
Plasmid pGΕM-AI Two primers, (Sequences ACTPl 5'-AGG CCT AGT GCC TGC CAC CAG CGC CAG CC-3' ACTP2 5' -GAT ATC CCT GGC AC A GCT TTG TGT GGT TC-3') complementary to the opposing strands of the 3' end of the first exon and the 5' end of the second exon of the murine cardiac actin gene respectively, were used in a PCR reaction to amplify a 0.8 Kb fragment encompassing the intervening sequences from a template of mouse genomic DNA. The two primers introduced a 5' SnaBl and a 3' EcoRV restriction site at the ends of the PCR product. This DNA fragment was cloned in pGΕM-T to give a construct which was named pGΕM-AI. DNA sequence analysis confirmed that the sequence of the amplified product beyond the primers matched that published for the murine beta actin gene.
Plasmid pACTGMADό
The actin intron 1 sequence was excised from pGΕM-AI on a 5' SnαBI- 3 'EcoRV fragment which was then ligated with ΕcoRV digested pMADό to give vector pACTMADό. This cloning step effectively moves the EcoRV site from the 3' end of the BLG promoter downstream to the 3 ' end of the actin gene intron segment.
Plasmid pCORI70
The modified protein C cDNA (PC962) was excised from the plasmid pCORP9 as an EcoRV fragment and ligated with EcoRV digested pACTMADό. This places the AUG translation start downstream with respect to the actin intron sequence.
The resulting construct was named pCOR170.
The Vector pBOB
Plasmid pBOB
PCR primers were designed to amplify the region of the ovine β-casein gene from exon 1 to exon 2 (BOB1: 5'-CGG GAT CCG TCG ACC ATT CAG CTT CTC CTT CAC TTC TTC TC-3'; BOB2: 5'-CGG GAT CCG GGT CCC TAC GTA GGC TCT CGA TTC CTG TGA ATG GGA-3'). The size of this product is 2.1Kbp and has, engineered into it, the sites BamRl/Sall at the 5' end and BamRl/ Ppuml/ SnaBl at the 3' end.
The construction of pBOB toook place in three steps: the 2.1 Kbp PCR fragment (above) was blunt-end cloned into the EcoRV site of pBSIISK+ (Stratagene) to give the plasmid poβ- casΕxl/2. The 6.5Kbp Ppuml fragment from the ovine β- casein gene, containing exons 7 to 9 and 3' flanking sequences, was cloned into the now unique Ppuml site of pOβ- caseΕxl/2. A clone was obtained with the 6.5Kbp fragment in its natural orientation with respect to the first intron and this clone was named pBOBΔprom.
Previously, a Xhol linker had been cloned into the EcoRV site of pMADό and the modified plasmid named pMADX. Finally, the ovine BLG promoter from pMAD6X was cloned into the Sail site of pBOBΔprom as a SaWXhol fragment giving rise to pBOB.
Plasmid pCORB71
The modified protein C cDNA (PC962) was excised from the plasmid pCORP9 as an EcoRV fragment and ligated with EcoRV digested pBOB. This places the AUG translation start downstream with respect to the actin intron sequence. The resulting construct was named pCORB71.
Example 1
Analysis of constructs of the present invention in the expression of protein C in transgenic animals. Results are shown in Tables 1 and 2.
Generation of transgenic animals
Transgenic mice were generated as described in Prunkard et al (Namre Biotechnology, 14:867-871, 1996).
Protein C Assays Human protein C in the milk of transgenic animals was assayed according to the following procedure:
Protein C Standard
Purified human Protein C stored at 50μg/ml in Phosphate Buffered Saline (PBS)/1 % bovine Serum Albumin (BSA) at 20 ° C. Dilute to 500ng/ml in blocking buffer for use. Standard curve range of ELISA is 3.9-125ng/ml.
Blocking buffer
IX PBS 5% Milk powder 0.01 % Tween 20
Wash buffer
IX PBS 0.05% Tween 20
Coating Antibody
Dako Rabbit Anti-human Protein C antibody diluted to lOμg/ml in PBS
Detection Antibody
Dako Rabbit Anti-human Protein C antibody Peroxidase conjugate. Dilute 1/5000 in blocking buffer.
Substrate
TMB 1 Component Peroxidase substrate
Stop Solution
0.2M Sulphuric acid
Method
1. Coat Costar High Binding capacity 96 well plate with 150μl/well of coating antibody. Incubate in a damp box o/n in fridge.
2. Wash wells with wash buffer (3X 200μl/well). 3. Load lOOμl blocking buffer into wells.
4. Dilute samples appropriately in blocking buffer.
Reference human plasma is used as a protective control at 1/40 dilution.
5. Load standard and samples into plate by columns with doubling dilution (lOOμl per well). Standard is loaded in rows 1 and 12 in duplicate. 6. Incubate for 2 hours in damp box in fridge.
7. Wash plate with wash buffer (3X 200μl/well).
8. Load lOOμl/well peroxidase conjugate. Incubate in damp box for 2 hours in fridge.
9. Wash plate with wash buffer (3X 200μl/well). Drain plate. 10. Add lOOμl/well of substrate and leave for 5 minutes.
Stop reaction by addition of lOOμl/well of stop solution.
11. Read plate on plate reader with 650nm filter.
12. Plot standard curve using mean of duplicates (O.D v. log concentration PC) and calculate regression line equation. Use equation to calculate sample values. Data handling can be performed by PC assay programme on Dynex
Revelation software.
Example 2
Expression of AAT from cDNA in constructs according to the present invention.
Transgenic mice were prepared as in Example 1. Analysis of AAT in the milk of transgenic mice was according to standard procedures, for example as described in Wright, G. , Carver, A., Cottom, D., Reeves, D. , Scott, A. , Simons, J.P., Wilmut, I., Garner, I., and Colman A., 1991. High level expression of active human αl antitrypsin in the milk of transgemc sheep. Bio/Technology 9: 830-834.
Results are shown in Table 3.
Example 3.
Expression of antibody fragment from constructs according to the present invention.
Constructs pMADό and pCASMADό were prepared incorporating DNA encoding an antibody binding fragment to give constructs pMAD6-AB and pCASMADό- AB. The constructs were used to obtain transgenic mice according to Example 1. Expression of the antibody fragment was determined by standard protocols.
No expression of the antibody fragment was found in the transgenic mice with pMAD6-AB. Levels ranging from 0 to 129 μg/ml were found in pCASMAD6-AB mice.
The results are given in Table 4.
Example 4
Expression of IgG from constructs according to the present invention. Constructs pMADό and pCASMADό were prepared incorporating DNA encoding IgG to give constructs pMAD6-IgG and pCASMAD6-IgG. The constructs were used to obtain transgenic mice according to Example 1. Expression of IgG in the mice milk was determined by standard ELISA protocol.
Results are given in Table 5.
Example 5
Expression of adhesion molecule (soluble) from constructs according to the present invention.
Constructs pMADό and pCASMADό were prepared incorporating DNA encoding a soluble adhesion molecule (SAM) to give constructs pMADό-SAM and pCASMADό-SAM. Transgenic mice were prepared according to Example 1.
The expression level range (μg/ml) in pCASMADό-SAM transgenic mice was up to 500. The maximum level detected in the pMADό-SAM transgenic mice was 80.
Example 6
Expression of collagen cDNA from constructs according to the present invention.
The CASMAD6 vector was used. Collagen cDNA (human truncated pro-collagen α2(l) homotrimer) was inserted as the DNA for the protein of interest. This was coinjected with two transgenes expressing α and β subunits of prolyl 4- hydro yiase, an enzyme for the post-translational modification of procollagen.
Transgenic animals were obtained as in Example 1. Determination of collagen expression in mouse milk was as described according to standard protocols and described in WO97/08311.
11 transgenic lines bearing the collagen cDNA construct and the two prolyl 4- hydroxlase trans genes were analysed. Three lines were found to express procollagen α2(l) homotrimer protein. The amount of collagen present was estimated by measurement of hydroxyproline content and Western analysis in comparison with bovine collagen standard. Three independently derived mouse lines were found to express detectable amounts of collagen. The levels present in milk of these three lines was estimated as: lOμg/ml, 30μg/ml, and 120-240μg/ml. Collagen protein was absent from the milk of non- transgenic mice. Milk from the highest expressing line was analysed further and the procollagen present was found to have formed a correctly aligned triple helical molecule.
These results demonstrate secretion of relatively high levels of recombinant procollagen in the milk of transgenic mice by expression of cDNA under the control of the β-lactoglobulin promoter.
From the foregoing it will be appreciated that although specific embodiments of the invenuon have been described herein for purposes of illustration, various modifications may be made without deviating from the spirit and scope of the invention. Accordingly, the invention is not limited. All documents and papers cited or mentioned herein are fully incorporated by reference.
TABLE 1
Data supporting the improved expression from constructs according to the invention
*refers to different protein C cDNAs
Table 2: Protein C Expression data
O) x m m
H
3Ϊ
C r m- t
O)
TABLE 3
Expression of cDNA Constructs
CO
ID CO
H
H
C H 0 rπ
CO x m m
H c m t
TABLE 4
pMAD6-AB
pCASMADό- AB
TABLE 5
Expression Levels of IgG in Transgenic
Mouse Milk
Claims
1. A nucleic acid expression construct comprising: (a) a promoter; (b) an intron whose natural position is within the 5 '-untranslated region of a gene from which it is derived;
(c) a coding sequence; and
(d) a 3 '-flanking sequence wherein the intron (b) is not derived from the same gene as that from which either the promoter (a) or the protein-coding sequence (c) is derived.
2. An expression construct as claimed in claim 1 wherein the promoter is a gene promoter which drives expression of the coding sequence (c) in mammalian cells, in particular, a milk protein promoter.
3. An expression construct as claimed in claim 1 or claim 2 wherein the intron (b) is the first intron from a gene where the intron is namrally located entirely within the 5' untranslated region of the gene.
4. An expression construct as claimed in 1, 2, or 3, wherein the intron (b) is the first intron from a gene which is a member of a family of genes where the intron is namrally located entirely within the 5' untranslated region of the gene.
5. An expression construct as claimed in any one of claims 1 to 4 wherein the intron is from the casein gene family.
6. An expression construct as claimed in any one of claims 1 to 4 wherein the intron is from the actin gene family.
7. An expression construct as claimed in any one of claims 1 to 6 wherein the 3' flanking sequence is any sequence which supports the correct transcription termination, mRNA 3' end processing, mRNA stabilisation, mRNA transport from the nucleus to cytoplasm and mRNA translation.
8. An expression construct as claimed in any one of claims 1 to 7 wherein the 3 'flanking sequence is a poly-A site or a ╬▓-lactoglobulin gene 3 '-sequence beginning 3' to the namral ╬▓-lactoglobulin stop codon and continuing to at least about 50 bases 3' of the poly-A site.
9. An expression construct as claimed in any one of claims 1 to 7 wherein the 3 '-flanking sequence is a ╬▓-casein gene 3 '-sequence beginning 3' to the namral ╬▓- casein stop codon and continuing to at least about 50 bases 3' of the poly-A site.
10. A process for the preparation of a host organism the process comprising introducing an expression construct, as claimed in any one of claims 1 to 9, into a suitable organism.
11. A process as claimed in claim 10 wherein the suitable organism is a prokaryote, a fungi, a plant, an animal, or a eukaryotic cell.
12. A process as claimed in claim 11 wherein the animal is a non-human mammal.
13. A host organism incorporating a DNA expression construct as claimed in any one of claims 1 to 9.
14. A host organism as claimed in claim 13 which is a procaryote (eg. E.colϊ), a fungi, a plant, an animal or a eukaryotic cell.
15. A host organism, as claimed in claim 14, wherein the animal is a non- human mammal.
16. A process of preparing a protein, the process comprising allowing an expression host to express a DNA expression construct as claimed in any one of claims 1 to 9.
17. A process as claimed in claim 16, further including a process of purifying the pi oiein.
18. A process as claimed in claim 15 or claim 16 wherein the protein is protein C, fibrinogen, AAT or collagen.
19. A process as claimed in claim 16 or claim 17 wherein the expression host is a prokaryote, a fungi, a plant, an animal or a eukaryotic cell.
20. A process, as claimed in claimed 19, wherein the animal is a non-human mammal.
21. A protein prepared by a process as claimed in any one of claims 16 to 20.
22. The use of a nucleic acid expression construct comprising a promoter, an intron whose namral position is within the 5' untranslated region of a gene from which it is derived, a coding sequence and a 3' flanking sequence to obtain a transgenic host.
23. The use of nucleic acid construct comprising a promoter, an intron whose namral position is within the 5' untranslated region of a gene from which it is derived, a coding sequence and a 3' flanking sequence to increase the likelihood of expression of the coding sequence from a transgenic host which incorporates the nucleic acid construct.
24. A process for improving the number of transgenic hosts which express a transgene coding sequence, the process comprising introducing into the host a nucleic acid construct comprising a promoter, an intron whose namral position is within the 5' untranslated region of a gene from which it is derived, a coding sequence and a 3' flanking sequence.
25. The use as claimed in any one of claims 22 or 23 or a process as claimed in claim 24 wherein the construct is as set out in any one of claims 1 to 9.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB9715064 | 1997-07-17 | ||
GB9715064A GB9715064D0 (en) | 1997-07-17 | 1997-07-17 | Protein expression |
PCT/GB1998/002130 WO1999003981A1 (en) | 1997-07-17 | 1998-07-17 | Nucleic acid expression constructs incorporating a heterologous intron whose natural position is within the 5'untranslated region of its gene |
Publications (1)
Publication Number | Publication Date |
---|---|
EP0996715A1 true EP0996715A1 (en) | 2000-05-03 |
Family
ID=10816004
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP98935143A Withdrawn EP0996715A1 (en) | 1997-07-17 | 1998-07-17 | Heterologous expression of proteins by "rescued" vector comprising an intron |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP0996715A1 (en) |
AU (1) | AU8450298A (en) |
GB (1) | GB9715064D0 (en) |
WO (1) | WO1999003981A1 (en) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
PT1159415E (en) | 1999-03-04 | 2010-03-31 | Revivicor Inc | Genetic modification of somatic cells and uses thereof |
AU2004279991B2 (en) * | 2003-10-10 | 2010-11-25 | Powderject Vaccines, Inc. | Nucleic acid constructs |
AU2012348332B2 (en) * | 2011-12-08 | 2017-06-29 | Haifeng Chen | Vectors harboring toxic genes, methods and uses therefor |
US10610606B2 (en) | 2018-02-01 | 2020-04-07 | Homology Medicines, Inc. | Adeno-associated virus compositions for PAH gene transfer and methods of use thereof |
EP3755795A4 (en) | 2018-02-19 | 2022-07-20 | Homology Medicines, Inc. | Adeno-associated virus compositions for restoring f8 gene function and methods of use thereof |
TW202140791A (en) | 2020-01-13 | 2021-11-01 | 美商霍蒙拉奇醫藥公司 | Methods of treating phenylketonuria |
TW202208632A (en) | 2020-05-27 | 2022-03-01 | 美商同源醫藥公司 | Adeno-associated virus compositions for restoring pah gene function and methods of use thereof |
CN118086341B (en) * | 2024-04-25 | 2024-08-02 | 上海凌医生物科技有限公司 | Expression cassette for high expression of human glucocerebrosidase gene in liver |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0174608A1 (en) * | 1984-09-13 | 1986-03-19 | The Board Of Trustees Of The Leland Stanford Junior University | Beta-actin gene and regulatory elements, preparation and use |
GB8615942D0 (en) * | 1986-06-30 | 1986-08-06 | Animal & Food Research Council | Peptide production |
EP0832981A1 (en) * | 1987-02-17 | 1998-04-01 | Pharming B.V. | DNA sequences to target proteins to the mammary gland for efficient secretion |
GB8717430D0 (en) * | 1987-07-23 | 1987-08-26 | Celltech Ltd | Recombinant dna product |
JPH02261386A (en) * | 1989-03-31 | 1990-10-24 | Kyowa Hakko Kogyo Co Ltd | Recombinant vector |
US6270989B1 (en) * | 1991-11-05 | 2001-08-07 | Transkaryotic Therapies, Inc. | Protein production and delivery |
US5298422A (en) * | 1991-11-06 | 1994-03-29 | Baylor College Of Medicine | Myogenic vector systems |
AU672409B2 (en) * | 1992-04-30 | 1996-10-03 | Baylor College Of Medicine | Development of a vector to target gene expression to the epidermis of transgenic animals |
AU4537393A (en) * | 1992-06-15 | 1994-01-04 | Gene Pharming Europe Bv | Production of recombinant polypeptides by bovine species and transgenic methods |
US5639940A (en) * | 1994-03-03 | 1997-06-17 | Pharmaceutical Proteins Ltd. | Production of fibrinogen in transgenic animals |
IL115873A0 (en) * | 1995-11-03 | 1996-01-31 | Peri Dev Applic 1985 Ltd | Transgenic protein production |
WO1997020043A1 (en) * | 1995-11-30 | 1997-06-05 | Zymogenetics, Inc. | Protein c production in transgenic animals |
-
1997
- 1997-07-17 GB GB9715064A patent/GB9715064D0/en active Pending
-
1998
- 1998-07-17 AU AU84502/98A patent/AU8450298A/en not_active Abandoned
- 1998-07-17 WO PCT/GB1998/002130 patent/WO1999003981A1/en not_active Application Discontinuation
- 1998-07-17 EP EP98935143A patent/EP0996715A1/en not_active Withdrawn
Non-Patent Citations (1)
Title |
---|
See references of WO9903981A1 * |
Also Published As
Publication number | Publication date |
---|---|
WO1999003981A1 (en) | 1999-01-28 |
WO1999003981A8 (en) | 1999-04-22 |
GB9715064D0 (en) | 1997-09-24 |
AU8450298A (en) | 1999-02-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA1340723C (en) | Production of exogenous peptide in milk | |
US5994616A (en) | Targeted synthesis of protein in mammary gland of a non-human transgenic mammal | |
US5965789A (en) | Engineering protein posttranslational modification by PACE/furin in transgenic non-human mammals | |
JPH06506105A (en) | Homologous recombination in mammalian cells | |
JPH09506779A (en) | Transformative production of antibodies in milk | |
US20100305039A1 (en) | Production of collagen in the milk of transgenic mammals | |
US5648243A (en) | Human serum albumin expression construct | |
EP0599978A1 (en) | Gene encoding a human beta-casein process for obtaining the protein and use thereof in an infant formula | |
JP4523165B2 (en) | Transgenic and cloned mammals | |
Hennighausen et al. | Transgenic animals—production of foreign proteins in milk | |
AU661290B2 (en) | Increased expression by a second transferred sequence in transgenic organisms | |
EP0996715A1 (en) | Heterologous expression of proteins by "rescued" vector comprising an intron | |
US7057086B2 (en) | Therapeutic methods employing PAI-1 inhibitors and transgenic non-human animal for screening candidate PAI-1 inhibitors | |
Gutiérrez et al. | Expression of a bovine κ-CN cDNA in the mammary gland of transgenic mice utilizing a genomic milk protein gene as an expression cassette | |
US5714345A (en) | Increased expression of a gene by a second transferred mammary gland specific sequence transgenic | |
WO2001022810A2 (en) | Transgenic animals expressing von willebrand factor (vwf) and vwf-related polypeptides | |
EP1181351B1 (en) | Method of purifying heterologous proteins | |
US20020157127A1 (en) | Identification and purification of higher order transcription complexes from transgenic non-human animals | |
JP2017506676A (en) | Treatment of hereditary angioedema with C1 inhibitors | |
JPH11509404A (en) | Creating post-translational modifications of proteins in transgenic animals | |
MXPA97008516A (en) | Engineering protein posttranslational modification in transgenic organisms |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20000111 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20030201 |