GB1568047A - Purification of nucleotide sequences suitable for expression in bacteria - Google Patents

Purification of nucleotide sequences suitable for expression in bacteria Download PDF

Info

Publication number
GB1568047A
GB1568047A GB22418/78A GB2241878A GB1568047A GB 1568047 A GB1568047 A GB 1568047A GB 22418/78 A GB22418/78 A GB 22418/78A GB 2241878 A GB2241878 A GB 2241878A GB 1568047 A GB1568047 A GB 1568047A
Authority
GB
United Kingdom
Prior art keywords
dna
sequence
fragments
specific
fragment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired
Application number
GB22418/78A
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of California
Original Assignee
University of California
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of California filed Critical University of California
Publication of GB1568047A publication Critical patent/GB1568047A/en
Expired legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/575Hormones
    • C07K14/61Growth hormone [GH], i.e. somatotropin
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/575Hormones
    • C07K14/57518Placental lactogen; Chorionic somatomammotropin
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/575Hormones
    • C07K14/62Insulins
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1096Processes for the isolation, preparation or purification of DNA or RNA cDNA Synthesis; Subtracted cDNA library construction, e.g. RT, RT-PCR

Landscapes

  • Health & Medical Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Genetics & Genomics (AREA)
  • Zoology (AREA)
  • Endocrinology (AREA)
  • Molecular Biology (AREA)
  • Biochemistry (AREA)
  • Biophysics (AREA)
  • General Health & Medical Sciences (AREA)
  • Toxicology (AREA)
  • Medicinal Chemistry (AREA)
  • Gastroenterology & Hepatology (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biotechnology (AREA)
  • Wood Science & Technology (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Microbiology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Plant Pathology (AREA)
  • Pregnancy & Childbirth (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Physics & Mathematics (AREA)
  • Diabetes (AREA)
  • Enzymes And Modification Thereof (AREA)
  • Saccharide Compounds (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Micro-Organisms Or Cultivation Processes Thereof (AREA)
  • Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Peptides Or Proteins (AREA)

Description

(54) PURIFICATION OF NUCLEOTIDE SEQUENCES SUITABLE FOR EXPRESSION IN BACTERIA (71) We, THE REGENTS OF THE UNIVERSITY OF CALIFORNIA. an educational institution chartered by the State of California, U.S.A., having a principal place of business at 2200 University Avenue, Berkeley, State of California 94720, United States of America, do hereby declare the invention for which we pray that a patent may be granted to us, and the method by which it is to be performed, to be particularly described in and by the following statement: The invention relates to a method of purifying a fragment of a specific deoxyribonucleotide sequence, to recombinant DNA transfer vectors comprising such a fragment and microorganism strains containing such a fragment or such a DNA transfer vector.
Proteins and peptides are synthesized in almost endless variety by living organisms. Many have proven to have medical, argricultural or industrial utility. Some proteins are enzymes, useful as specific catalysts for complex chemical reactions. Others function as hormones, which act to affect the growth or development of an organism or to affect the function of specific tissues in medically significant ways. Specific binding proteins may have commercial significance for the isolation and purification of trace substances and for the removal of contaminating substances. Both proteins and peptides are composed of linear chains of amino acids, the latter term being applied to short, single-chain sequences, the former referring to long-chain and multi-chain substances. The principles of the present invention apply equally to both proteins and peptides.
Proteins and peptides are generally high molecular weight substances, each having a specific sequence of amino acids. Except for the smaller peptides, chemical synthesis of peptides and proteins is frequently impractical, costly and time consuming, if not impossible. In the majority of instances, in order to make practical use of a desired protein, it must first be isolated from the organism which makes it. Frequently, the desired protein is present only in miniscule amounts. Often, the source organism cannot be obtained in quantities sufficient to provide an adequate amount of the desired protein. Consequently, many potential argircultural, industrial and medical applications for specific proteins are known, but remain undeveloped simply because an adequate supply of the desired protein or peptide does not exist.
Recently developed techniques have made it possible to employ microorganisms, capable of rapid and abundant growth, for the synthesis of commercially useful proteins and peptides, regardless of their source in nature. These techniques make it possible to genetically endow a suitable microorganism with the ability to synthesize a protein or peptide normally made by another organism. The technique makes use of a fundamental relationship which exists in all living organisms between the genetic material. usually DNA, and the proteins synthesized by the organism. This relationship is such that the amino acid sequence of the protein is reflected in the nucleotide sequence of the DNA. There are one or more trinucleotide sequence groups specifically related to each of the twenty amino acids most commonly occurring in proteins. The specific relationship between each given trinucleotide sequence and its corresponding amino acid constitutes the genetic code. The genetic code is believed to be the same or similar for all living organisms. As a consequence, the amino acid sequence of every protein or peptide is reflected by a corresponding nucleotide sequence, according to a well understood relationship. Furthermore, this sequence of nucleotides can, in principle, be translated by any living organism.
TABLE 1 Genetic code Phenylalanine(Phe) TTK Histidine(His) CAK Leucine(Leu) XTY Glutamine(Gln) CAJ Isoleucine(Ile) ATM Asparagine(Asn) AAK Methionine(Met) ATG Lysine(Lys) AAJ Valine(Val) GTL Aspartic acid(Asp) GAK Serine(Ser) QRS Glutamic acid(GLU) GAJ Proline(Pro) CCL Cysteine(Cys) TGK Threonine(Thr) ACL Tryptophan(Try) TGG Alanine(Ala) GCL Arginine(Arg) WGZ Tyrosine(Tyr) TAK Glycine(Gly) GGL Termination signal TAJ Termination signal TGA Key: Each 3-letter triplet represents a trinucleotide of DNA, having a 5' end on the left and a 3' end on the right. The letters stand for the purine or pyrimidine bases forming the nucleotide sequence.
A =adenine G = guanine C = cytosine T = thymine J = A or G K = T or C L = A, T, C or G M = A, C or T X = T or C if Y is A or G X = C if Y is C or T Y = A, G, C or T if X is C Y = A or G if X is T W = C or A if Z is A or G W= C if Z is C or T Z = A, G, C or T if W is C Z = A or G if W is A QR = TC if S is A, G, C or T QR = AG if S is T or C S = A, G, C or T if QR is TC S = T or C if QR is AG The trinucleotides of Table 1, termed codons, are presented as DNA trinucleotides, as they exist in the genetic material of a living organism. Expression of these codons in protein synthesis requires the intermediate formation of messenger RNA (mRNA), as described more fully, infra. The mRNA codons have the same sequences as the DNA codons of Table 1, except that uracil (U) is found in place of thymine. Complementary trinucleotide DNA sequences having opposite strand polarity are functionally equivalent to the codons of Table 1, as is understood in the art. An important and well known feature of the genetic code is its redundancy, whereby, for most of the amino acids used to make proteins, more than one coding nucleotide triplet may be employed. Therefore, a number of different nucleotide sequences may code for a given amino acid squence. Such nucleotide sequences are considered functionally equivalent since they can result in the production of the same amino acid sequence in all organisms, although certain strains may translate some sequences more efficiently than they do others. Occasionally, a methylated variant of a purine or pyrimidine may be found in a given nucleotide sequence. Such methylations do not affect the coding relationship in any way.
In its basic outline. a method of endowing a microorganism with the ability to synthesize a new protein involves three general steps: (1) isolation and purification of the specific gene or nucleotide sequence containing the genetically coded information for the amino acid sequence of the desired protein, (2) recombination of the isolated nucleotide sequence with an appropriate transfer vector, typically the DNA of a bacteriophage or plasmid, and (3) transfer of the vector to the appropriate microorganism and selection of a strain of the recipient microorganism containing the desired genetic information.
A fundamental difficulty encountered in attempts to exploit commercially the abovedescribed general process lies in the first step, the isolation and purification of the desired specific genetic information. DNA exists in all living cells in the form of extremely high molecular weight chains of nucleotides. A cell may contain more than 10,000 structural genes, coding for the amino acid sequences of over 10,000 specific proteins, each gene having a sequence many hundreds of nucleotides in length. For the most part, four different nucleotide bases make up all the existing sequences. These are adenine (A), guanine (G), cytosine (C), and thymine (T). The long sequences comprising the structural genes of specific proteins are consequently very similar to overall chemical composition and physical properties. The separation of one such sequence from the plethora of other sequences present in isolated DNA cannot ordinarily be accomplished by conventional physical and chemical preparative methods.
Messenger RNA functions in the process of converting the nucleotide sequence information of DNA into the amino acid sequence structure of a protein. In the firsrt step of this process, termed transcription, a local segment of DNA having a nucleotide sequence with specifies a protein to be made, is first copied into RNA. RNA is a polynucleotide similar to DNA except that ribose is substituted for deoxyribose and uracil is used in place of thymine. The nucleotide bases in RNA are capable of entering into the same kind of base pairing relationships that are known to exist between the complementary strands of DNA.
A and U (T) are complementary, and G and C are complementary. The RNA transcript of a DNA nucleotide sequence will be complementary to the copied sequence. Such RNA is termed messenger RNA (mRNA) because of its status as intermediary between the genetic apparatus of the cell and its protein synthesizing apparatus. Generally, the only mRNA sequences present in the cell at any given time are those which correspond to proteins being actively synthesized at that time. Therefore, a differentiated cell whose function is devoted primarily to the synthesis of a single protein will contain primarily the RNA species corresponding to that protein. In those instances where it is feasible, the isolation and purification of the appropriate nucleotide sequence coding for a given protein can be accomplished by isolating mRNA coding for the specialized synthesis of such protein in differentiated cells.
A major disadvantage of the foregoing procedure is that it is applicable only in the relatively rare instances where cells can be found engaged in synthesizing primarily a single protein. The majority of proteins of commercial interest are not synthesized in such a specialized way. The desired proteins may be one of a hundred or so different proteins being produced by the cells of a tissue or organism at a given time. Nevertheless, the mRNA isolation technique is potentially useful since the set of RNA species present in the cell usually represents only a fraction of the total sequences existing in the DNA, and thus provides an initial purification. In order to take advantage of such purification, however, a method is needed whereby sequences present in low frequencies, such as a few percent, can be isolated in high purity.
The present invention provides a method whereby nucleotide sequences can be isolated and purified even when present at a frequency as low as 2% of a heterogeneous population of mRNA sequences. Furthermore, the method may be combined with known methods of fractionating mRNA to isolate and purify sequences present in even lower frequency in the total RNA population as initially isolated. The method is generally applicable to mRNA species extracted from virtually any organism and is therefore expected to provide a powerful basic tool for the ultimate production of proteins of commercial and research interest, in useful quantities.
Human growth hormone has medical utility in the treatment of defective pituitary function. Animal growth hormones have commercial utility in veterinary medicine and in agriculture, particularly in the case of animals used as food sources, where large size and rapid maturation are desirable attributes. Human chorionic somatomammotropin is of medical significance because of its role in the floetal maturation process.
The method of the present invention takes advantage of certain structural features of mRNA and DNA, and makes use of certain enzyme catalyzed reactions. The nature of these reactions and structural details as they are understood in the prior art are described herewith. The symbols and abbreviations used herein are set forth in the following table: TABLE 2 DNA - deoxyribonucleic acid A - Adenine RNA - ribonucleic acid T - Thymine cDNA - complementary DNA G - Guanine (enzymatically synthesized) C - Cytosine from an mRNA sequence) U - Uracil mRNA - messenger RNA Tris - 2-Amino-2-hydroxyethyldATP - deoxyadenosine triphosphate 1,3-propanediol dGTP - deoxyguanosine triphosphate EDTA - ethylenediamine dCTP - deoxycytidine triphosphate tetraacetic acid HCS - Human Chorionic Somatomammo- ATP - adenosine triphosphate tropin dTTP - tymidine triphosphate TCA - Trichloroacetic acid HGH - Human Growth Hormone As a consequence of the known base-pairing relationships that govern DNA replication and transcription, isolating the mRNA which contains the nucleotide sequence coding for the amino acid sequence of a particular protein is equivalent to the isolation of the same sequence, or gene, from the DNA itself. If the mRNA is retranscribed to form DNA complementary thereto (cDNA), the exact DNA sequence is thereby reconstituted and can, by appropriate techniques, be inserted into the genetic material of another organism.
The two complementary versions of a given sequence are therefore interconvertible, and functionally equivalent to each other.
The nucleotide subunits of DNA and RNA are linked together by phosphodiester bonds between the 5' position of one nucleotide sugar and the 3' position of its next neighbor.
Reiteration of such linkages produces a linear polynucleotide which has polarity in the sense that one end can be distinguished from the other. The 3' end may have a free 3'-hydroxyl, or the hydroxyl may be substituted with a phosphate or a more complex structure. The same is true of the 5' end. In eucaryotic orgamsms i.e., those having a defined nucleus and mitotic apparatus, the synthesis of functional mRNA usually includes the addition of polyadenylic acid to the 3' end of the mRNA. Messenger RNA can therefore be separated from other classes of RNA isolated from an eucaryotic organism by column chromatography on cellulose to which is attached olythymidylic acid. See Aviv, H., and Leder, P., Proc. Nat. Acad. Sci. U.S.A. 69, 1408 (1972). Other chromatographic methods, exploiting the base-pairing affinity of poly A for chromatographic packing materials containing oligo dT. poly U, or combinations of poly T and poly U, for example, poly U-Sepharose, are likewise suitable ("Sepharose" is a trademark).
Reverse transcriptase catalyzes the synthesis of DNA complementary to an RNA template strand in the presence of the RNA template, a primer which may be any complementary oligo nucleotide or polynucleotide having a 3'-hydroxyl, and the four deoxynucleoside triphosphates, dATP, dGTP, dCTP, and dTTP. The reaction is initiated by the non-convalent association of the oligodeoxynucleotide primer near the 3' end of mRNA followed by stepwise addition of the appropriate deoxynucleotides, as determined by base-pairing relationships with the mRNA nucleotide sequence, to the 3' end of the growing chain. The product molecule may be described as a hairpin structure in which the original RNA is paired by hydrogen bonding with a complementary strand of DNA partly folded back upon itself at one end. The DNA and RNA strands are not covalently joined to each other. Reverse transcriptase is also capable of catalyzing a similar reaction using a single-stranded DNA hairpin having a loop of single-stranded DNA joining one set of ends.
See Avid, H. and Leder, P., Proc. Nat'l. Acad. Sci. U.S.A. 69, 1408 (1972) and Efstratiadis, A.. Kafatos, F.C., Maxam, A.M.. and Maniatis, T.. Cell 7, 279 (1976).
Restriction endonucleases are enzymes capable of hydrolyzing phosphodiester bonds in DNA, thereby creating a break in the continuity of the DNA strand. If the DNA is in the form of a closed loop, the loop is converted to a linear structure. The principal feature of a restriction enzyme is that its hydrolytic action is exerted only at a point where a specific nucleotide sequence occurs. Such a sequence herein is termed the restriction site or recognition site for the restriction endonuclease. Restriction endonucleases from a variety of sources have been isolated and characterized in terms of the nucleotide sequence of their restriction sites. When acting on doublestranded DNA, some restriction endonucleases hydrolyze the phosphodiester bonds on both strands at the same point. producing blunt ends. Others catalyze hydrolysis of bonds separated by a few nucleotides from each other, producing free single-stranded regions at each end of the cleaved molecule. Such single-stranded ends are self-complementary, hence cohesive, and may be used to rejoin the hydrolyzed DNA. Since any DNA susceptible to cleavage by such an enzyme must contain the same recognition site, the same cohesive ends will be produced, so that it is possible to join heterogeneous sequences of DNA which have been treated with restriction endonuclease to other sequences similarly treated. See Roberts, R.J., Crit. Rev. Biochem 4, 123 (1976).
It has been observed that restriction sites for a given enzyme are relatively rare and are nonuniformly distributed. Whether a specific restriction site exists within a given segment is a matter which must be empirically determined. However, there is a large and growing number of restriction endonucleases, isolated from a variety of sources with varied site specificity, so that there is a reasonable probability that a given segment of a thousand nucleotides will contain one or more restriction sites.
The present invention now provides a method for purifying a fragment of a specific deoxyribonucleotide sequence for recombination with a DNA transfer vector, and transfer to a microoganism, from a population of polyribonucleotides heterogeneous in length and sequence, comprising: (a) providing a population of cDNA transcripts of the polyribonucleotides, at least a portion of said cDNA transcripts having at least two restriction sites, (b) subjecting the cDNA transcripts to the action of a restriction endonuclease preparation capable of catalyzing the hydrolysis of the cDNA transcripts at each of the two restriction sites in order to produce a fragment of a specific deoxyribonucleotide sequence which is homogeneous in length, (c) fractionating the fragments produced by restriction endonuclease action according to their length, thereby separating the homogeneous length fragments of the specific deoxyribonucleotide sequence from cDNA transcripts of different length, whereby a fragment of the specific deoxyribonucleotide sequence is purified for recombination with a DNA transfer vector and transfer to a microorganism.
The invention also provides a method of further purifying linear DNA of specific nucleotide sequence containing a restriction site from a population of essentially homogeneous length fragments of specific deoxyribonucleotide sequence, prepared according to the aforesaid method of this invention, comprising: (d) treating the DNA in said fragments to remove any 5' phosphate end groups, (e) incubating the DNA with a restriction endonuclease capable of acting on the specific desired nucleotide sequence to produce two linear sub-fragments thereof, (f) fractionating the sub-fragments according to their length.
(g) isolating the two sub-fragments, (h) rejoining the sub-fragments covalently, in a DNA ligase catalyzed reaction, to reconstitute the original specific sequence, and (i) fractionating the rejoined DNA molecules according to their length, whereby the specific sequence is purified from a population of DNA molecules of essentially homogeneous length, for recombination with a DNA transfer vector and transfer to a microorganism.
The invention also provides a recombinant DNA transfer vector comprising a fragment of a specific deoxyribonucleotide sequence prepared by either of the aforesaid methods.
The invention also provides a microorganism strain modified to contain (a) such a DNA transfer vector or (b) a fragment of a specific deoxyribonucleotide sequence prepared by either of the aforesaid purification methods.
A particular recombinant DNA transfer vector. according to the invention, comprising codons for HCS, comprises the nucleotide sequence: 5'-GGCL24ATM25GAK26ACL27TAK28CAJ29GAJ30TTK31GAK32 GAJ33ACL34TAK35ATM36CCL37AAJ38GAK39CAJ40AAJ41TAK42 QR43S43TTK44X45TY45CAK46GAK47QR48S48CAJ49ACL50QR51 S51TTK52TGK53TTK54QR55S55GAK56QR57S57ATM58CCL59 ACL60CCL61QR62S62AAK63ATG64GAJ65GAJ66ACL67CAJ68CAJ69 AAJ70QR71S71AAK72X73TY73GAJ74X75TY75X76TY76W77GZ77 ATM78QR79S79X80TY80X81TY81X82TY82ATM83GAJ84QR85S85 TGG86X87TY87GAJ88CCL89GTL90W91GZ91TTK92X93TY93W94 GZ94QR95S95ATG96TTK97GCL98AAK99AAk100X101TY101GTL102 TAK103GAK104ACL105QR106S106GAK107QR108S108GAK109 GAK110TAK111CAK112X113TY113TY114AAJ115GAK116 X117TY117GAJ118GAJ119GGL120ATM121CAJ122ACL123X124 TY124ATG125GGL126W127GZ127X128TY128GAJ129GAK130 GGL131QR132S132W133GZ133W134GZ134ACL135GGL136CAJ137 ATM138X139TY139AAJ140CAJ141ACL142TAK143OR144S144 AAJ145TTK146GAK147ACL148AAK149QR150S150CAK151 AAK152CAK153GAK154GCL155X156TY156X157TY157AAJ158 AAK159TAK160GGL161X162TY162X163TY163TAK164TGK165 TTK166W167GZ167AAJ168GAK169ATG170GAK171AAJ172 GTL173GAJ174ACL175TTK176X177TY177W178GZ178ATG179 GTL180CAJ181TGK182W183GZ183QR184S184GTL185GAJ186 GGL187QR188S188TGK189GGL190TTK191 TAGGTGCCCGAGTAGCATCCTGTGACCCCTCCCCAGTGCCTCTCCTGGCC - 3' wherein A is deoxyadenyl, G is deoxyguanyl, C is deoxycytosyl, T is thymidyl, J is A or G; K is T or C: L is A, T, C or G; M is A, C or T; Xn is T or C, if Yn is A or G, and C if Yn is C or T; Yn is A, G, C or T, if Xn is C, and A or G if Xn is T; Wn is C or A, if Zn is G or A, and C if Zn is C or T; Zn is A, G, C or T, if Wn is C, and A or G if Wn is A; QRn is TC, if Sn is A, G, C or T, and AG if Sn is T or C; Sn is A, G, C or T, if QRn is TC, and T or C if QRn is AG and subscript numerals, n, refer to the amino acid position in human chorionic somatomammotropin, for which the nucleotide sequence corresponds, according to the genetic code, the amino acid positions being numbered frm the amino end.
This DNA transfer vector may contain a plasmid, for example pBR-322. A suitable microorganism that can be modified to contain the DNA transfer vector is a bacterium, e.g.
Escherichia coli X-1776.
By means of this invention. the purification of cDNA of desired nucleotide sequence complementary to an individual mRNA species can be effected. The method employs restriction endonucleas cleavage of cDNA transcribed from a complex mixture of mRNA.
The method does not require any extensive purification of RNA but instead makes use of transcription of RNA into cDNA. the sequence specific fragmentation of this cDNA with one or two restriction endonucleases. and the fractionation of the cDNA restriction fragments on the basis of their length. The use of restriction endonucleases eliminates size heterogeneity and produces homogeneous length DNA fragments from any cDNA species which contains at least two restriction sites. From the initially heterogeneous population of cDNA transcripts. uniform size fragments of desired sequence are produced. The fragments may be several hundred nucleotides in length and may in some instances include the entire structural gene for the desired protein. The length of the fragments depends on the number of nucleotides separating the restricton sites and will usually be different from different regions of DNA. Fractionation by length enables purification of a homogeneous population of fragments having the desired sequence. The fragments will be homogeneous in size annd highly pure in terms of nucleotide sequennce. Current separation and analysis methods enable the isolation of such fragments from a corresponding mRNA species representing at least 2% of the mass of the RNA transcribed. The use of prior art RNA fractionation methods to prepurify the mRNA before the transcription will result in lowering the actual lower limit of detection to less than 2% of the total mRRNA isolated from the organism.
Specific sequences purified by the procedure outlined above may be further purified by a second specific cleavage with a restriction endonuclease capable of cleaving the desired sequence at an internal site. This cleavage results in formation of two sub-fragments of the desired sequence, separable on the basis of their lengths. The sub-fragments are separated from uncleaved and specifically cleaved contaminating sequences havingg substantially the same original size. The method is found upon the rarity and randomness of placement of restriction endonuclease recognition sites, which results in an extremely low probability that a contaminant having the same original length will be cleaved by the same enzyme to yield fragments having the same length as those yielded by the desired sequence. After preparation from the contaminants, the sub-fragments of the desired sequence may be rejoined using techniques known in the art, to reconstitute the original sequence. The two subfragments must be prevented from joining together in the reverse order of their original sequence. A method is disclosed whereby the sub-fragments can only join to each other in the proper order.
The above-recited methods may be used in combination with the appropriate labelling techniques to obtain accurate, quantitative measurements of the purity of the isolated sequences. The combined techniques have been applied to produce a known nucleotide sequence with greater than 99% purity.
The cDNA isolated and purified by the described methods may be recombined with a suitable transfer vector and transferred to a suitable host microorganism by known methods. Novel plasmids have been produced, containing the nucleotide sequences coding for the major portions of human chorionic somatomammotropin and human growth hormone, respectively. Novel microorganisms have been produced having as part of their genetic makeup the major porton of HCS and the major portion of HGH, respectively. The disclosed techniques may be used for the isolation and purification of growth hormones from other animal species and for the construction of novel transfer vectors and microorganisms containing these genes.
The present invention may employ, as starting material, a polyadenylated, crude or partially purified messenger RNA, which may be heterogeneous in sequence and in molecular size. The selectivity of the RNA isolation procedure is enhanced by any method which results in an enrichment of the desired mRNA in the heterodisperse population of mRNA isolated. Any such prepurification method may be employed in conjunction with the method of the present invention, provided the method does not introduce endonucleolytic cleavage of the mRNA. An important initial consideration is the selection of an appropriate source tissue for the desired mRNA. Often. this choice will be dictated by the fact that the protein ultimately to be produced is only made by a certain specialized tissue of a differentiated organism. Such is the case. for example, with the peptide hormones, such as growth hormone of HCS. In other cases, it will be found that a variety of cell types or microbial species can serve as a source of tthe desired mRNA. In those cases, some preliminary experimentation will be necessary in order to determine the optimal source.
Frequently, it will be found that the proportion of desired mRNA can be increased by taking advantage of cellular responses to environmental stimuli. For example, treatment with a hormone may cause increased production of the desired mRNA. Other techniques include growth at a particular temperature and exposure to a specific nutrient or other chemical substance.
Prepurification to enrich for desired mRNA sequences may also be carried out using conventional methods for fractionating RNA, after its isolation from the cell. Any technique which does not result in degradation of the RNA may be employed. The techniques of preparative sedimentation in a sucrose gradient and gel electrophoresis are especially suitable.
The mRNA must be isolated from the source cells under conditions which preclude degradation of the mRNA. The action of RNase enzymes is particularly to be avoided because these enzymes are capable of hydrolytic cleavage of the RNA nucleotide sequence.
The hydrolysis of one bond in the sequence results in disruption of that sequence loss of the RNA fragment containing the original 5' end of the sequence. A suitable method for inhibiting RNase during extraction from cells is disclosed in our copending U.K. Patent Application No. 22417/78 (Serial No. 1565190). The method involves the use of 4M guanidinium thiocyanate and 1 M (3-mercaptoethanol during the cell disruption step. In addition, a low temperature and a pH near 5.0 are helpful in further reducing RNase degradation of the isolated RNA.
Prior to application of the method of the present invention. mRNA must be prepared essentially free of contaminating protein, DNA, polysaccharides and lipids. Standard methods are well known in the art for accomplishing such purification. RNA thus isolated contains non-messenger as well as messenger RNA. A convenient method for separating the mRNA of eucaryotes is chromatography on columns of oligo-dT cellulose, or other oligonucleotide-substituted column material such as poly U-Sepharose, taking advantage of the hydrogen bonding specificity conferred by the presence of polyadenylic acid on the 3' end of eucaryotic mRNA.
The initial step in the method of the present invention is the formation of DNA complementary to the isolated heterogeneous sequences of mRNA. The enzyme of choice for this reaction is reverse transcriptase, although in principle any enzyme capable of forming a faithful complementary DNA copy of the mRNA template could be u existence of two such sites. If the sites are identical, a single enzyme will be sufficient. The desired sequence will be cleaved at both sites, eliminating size heterogeneity as far as the desired cDNA sequence is concerned, and creating a population of molecules, termed fragments, containing the desired sequence and homogeneous in length. If the restriction sites are diffferent, two enzymes will be required in order to produce the desired homogeneous length fragments.
The choice of restriction enzyme(s) capable of producing an optimal length nucleotide sequence fragment coding for all or part of the desired protein must be made empirically. If the amino acid sequence of the desired protein is known, it is possible to compare the nucleotide sequence of uniform length nucleotide fragments produced by restriction endonuclease cleavage with the amino acid sequence for which it codes, using the known relationship of the genetic code common to all forms of life. A complete amino acid sequence for the desired protein is not necessary, however, since a reasonable accurate identification may be made on the basis of a partial sequence. Where the amino acid sequence of the desired protein is not known, the uniform length polynucleotides produced by restriction endonuclease cleavage may be used as probes capable of identifying the synthesis of the desired protein in an appropriate in vitro protein synthesizing system.
Alternatively, the mRNA may be purified by affinity chromatography. Other techniques which may be suggested to those skilled in the art will be appropriate for this purpose.
The number of restriction enzymes suitable for use depends upon whether singlestranded or double-stranded cDNA is used. The preferred enzymes are those capable of acting on single-straded DNA, which is the immediate reaction product of mRNA reverse transcription. The number of restriction enzymes now known to be capable of acting on single-stranded DNA is limited. The enzymes HaeIII, HhaI and Hin(f)I are presently known to be suitable. In addition, the enzyme MboII may act on single-stranded Where further study reveals that other restriction enzymes can act on single-stranded DNA. Where further study reveals that other restriction enzymes can act on single-stranded DNA, such other enzymes may appropriately be included in the list of preferred enzymes. Additional suitable enzymes include those specified for double-stranded cDNA. Such enzymes are not preferred since additional reactions are required in order to produce double-stranded cDNA, providiing increased opportunities for the loss of longer sequences and for other losses due to incomplete recovery. The use of double-stranded cDNA presents the additional technical disadvantage that subsequent sequence analysis is more complex and laborious. For these reasons, single-stranded cDNA is preferred, but the use of double-stranded DNA is feasible.
The cDNA prepared for restriction endonuclease treatment may be radioactively labeled so that it may be detected after subsequent separation steps. A preferred technique is to incorporate a radioactive label such as 32p in the alpha position of one of the four deoxynucleoside triphosphate precursors. Highest activity is obtained when the concentration of radioactive precursor is high relative to the concentration of the non-radioactive form. However, the total concentration of any deoxynucleoside triphosphate should be greater than 301lM, in order to maximize the length of cDNA obtained in the reverse transcriptase reaction. See Efstratiadis, A.. Maniatis. T.. Kafatos, F.C., Jeffrey, A., and Vournakis, J.N., Cell 4, 367 (1975). For the purpose of determining the nucleotide sequence of cDNA, the 5' ends may be conveniently labeled with '2p in a reaction catalyzed by the enzyme polynucleotide kinase. See Maxam. A.M. and Gilbert, W., Proc. Nat'l.
Acad. Sci. USA 74, 560 (1977).
Fragments which have been produced by the action of a restriction enzyme or combination of two restriction enzymes may be separated from each other and from heterodisperse sequences lacking recognition sites by any appropriate technique capable of separating polynucleotides on the basis of differences in length. Such methods include a variety of electrophoretic techniques and sedimentation techniques using an ultracentrifuge. Gel electrophoresis is preferred because it provides the best resolution on the basis of polynucleotide length. In addition, the method readily permits quantitative recovery of separated materials. Convenient gel electrophoresis methods have been described by Dingman, C.W., and Peacock, A.C., Biochemistly 7 659 (1968). and by Maniatis, T., Jeffrey, A. and van de Sande, H., Biochemistrv 14. 3787 (1975).
Prior to restriction endonuclease treatment, cDNA transcripts obtained from most sources will be found to be heterodisperse in length. By the action of a properly chosen restriction endonuclease, or pair of endonucleases. polynucleotide chains containing the desired sequence will be cleaved at the respective restriction sites to yield polynucleotide fragments of uniform length. Upon gel electrophoresis. these will be observed to form a distinct band. Depending on the presence or absence of restriction sites on other sequences, other discrete bands may be formed as well, which will most likely be of different length than that of the desired sequence. Therefore. as a consequence of restriction endonuclease action, the gel electrophoresis pattern will reveal the appearance of one or more discrete bands, while the remainder of the cDNA will continue to be heterodisperse. In the case where the desired cDNA sequence comprises the major polynucleotide species present, the electrophoresis pattern will reveal that most of the cDNA is present in the discrete band.
Although it is unlikely that two different sequences will be cleaved by restriction enzymes to yield fragments of essentially similar length, a method for determining the purity of the defined length fragments is desirable. Sequence analysis of the electrophoresis band may be used to detect impurities representing 10% or more of the material in the band. A method for detecting lower levels of impurities has been developed, as part of the present invention, founded upon the same general principles applied in the initial isolation method. The method requires that the desired nucleotide sequence fragment contain a recognition site for a restriction endonuclease not employed in the initial isolation. Treatment of polynucleotide material, eluted from a gel electrophoresis band, with a restriction endonuclease capable of acting internally upon the desired sequence will result in cleavage of the desired sequence into two sub-fragments, most probably of unequal length. These sub-fragments upon electrophoresis will form two discrete bands at positions corresponding to their respective lengths, the sum of which will equal the length of the polynucleotide prior to cleavage. Contaminants in the original band that are not susceptible to the restriction enzyme may be expected to migrate to the original position. Contaminants containing one or more recognition sites for the enzyme may be expected to yield two or more sub-fragments. Since the distribution of recognition sites is believed to be essentially random, the probability that a contaminant will also yield sub-fragments of the same size as those of the fragment of desired sequence is extremely low. The amount of material present in any band of radioactively labeled polynucleotide can be determined by quantitative measurement of the amount of radioactivity present in each band, or by any other appropriate method. A quantitative measure of the purity of the fragments of desired sequence can be obtained by comparing the relative amounts of material present in those bands representing sub-fragments of the desired sequence with the total amount of material.
Following the foregoing separation, the desired sequence may be reconstituted. The enzyme DNA ligase, which catalyzes the end-to-end joining of DNA fragments, may be employed for this purpose. Thee gel electrophoresis bands representing the sub-fragments of the desired sequence may be separately eluted and combined in the presence of DNA ligase, under the appropriate conditions. See Sgaramella, V., Van de Sande, J.H., and Khorana, H.G., Proc. Natl. Acad. Sci. USA 67, 1468 (1970). where the sequences to be joined are not blunt-ended, the ligase contained from E. Coli may be used, Modrich, P., and Lehman, I.R., J. Biol. Chem. 245, 3626 (1970).
The efficiency of reconstituting the original sequence from subfragments produced by restriction endonuclease treatment will be greatly enhanced by the use of a method for preventing reconstitution in improper sequence. This unwanted result is prevented by treatment of the homogenous length cDNA fragment of desired sequence with an agent capable of removing the 5'-terminal phosphate groups on the cDNA prior to cleavage of the homogenous cDNA with a restriction endonuclease. The enzyme, alkaline phosphate. is preferred. The 5'-terminal phosphate groups are a structural prerequisite for the subsequent joining action of DNA ligase used to reconstitute the cleaved sub-fragments.
Therefore, ends which lack a 5'-terminal phosphate cannot be covalently joined. The DNA sub-fragments can only be joined at the ends containing a 5'-phosphate generated by the restriction endonuclease cleavage performed on the isolated DNA fragments. The method is essentially that described in detail in U.K. Patent Application 22417/78 (Serial No.
1565190).
The majority of cDNA transcripts, under the conditions employed, are derived from the mRNA region containing the 5'-end of the mRNA template by specifically priming on the same template with a fragment obtained by restriction endonuclease cleavage. In this way, the above-described method may be used to obtain not only fragments of specific nucleotide sequence related to a desired protein. but also the entire nucleotide sequence related to a desired protein. but also the entire nucleotide sequence coding for the protein of interest.
The purification method is of a special significance in the cloning of human genes, which, under U.S. Federal Regulations. can only be put into recombinant DNA and then into bacteria after the genes have been verv carefully purified. or if the experiments are carried out in special high-risk (P4) facilities. See Federal Register. Vol. 41, No. 131, July 7, 1967, PP 27902-27943. The present method has enabled the production of sufficiently pure human genes, comprising the majority of the structure of HCS and HGH. Human genetic material, isolated and purified as described above. may be incorporated into recombinant plasmids or other transfer vectors. Double stranded chemically synthesized oligonucleotide linkers, containing the recognition sequence for a restriction endonuclease, may be attached to the ends of the isolated cDNA, to facilitate subsequent enzymatic removal of the human gene portion from the transfer vector DNA. See Scheller, R.H., et al., Science 196, 177 (1977).
The transfer vector DNA is converted from a continuous loop to a linear form by treatment with an appropriate restriction endonuclease. The ends thereby formed are treated with alkaline phosphatase to remove 5'-phosphate end groups so that the transfer vector DNA may not reform a continuous loop in a DNA ligase reaction without first incorporating a segment of the human DNA. The cDNA, with attached linker oligonucleotides, and the treated transfer vector DNA are mixed together with a DNA ligase enzyme, to join the cDNA to the vector DNA, forming a continuous loop of recombinant vector DNA having the cDNA incorported therein. Where a plasmid transfer vector is used, usually the closed loop will be the only form able to transform a bacterium. Transformation, as is understood in the art and used herein, is the term used to denote the process whereby microorganism incorporates extracellular DNA into its own genetic constitution. Plasmid DNA in the form of a closed loop may be so incorporated under appropriate environmental conditions. The incorporated closed loop plasmid undergoes replication in the transformed cell, and the replicated copies are distributed to progeny cells when cell division occurs. As a result, a new cell line is established, containing the plasmid and carrying the genetic determinants thereof. Transformation by a plasmid in this manner, where the plasmid genes are maintained in the cell line by plasmid replication, occurs at high frequency when the transforming plasmid DNA is in closed loop form, and does not or rarely occurs if linear plasmid DNA is used. Once a recombinant transfer vector has been made, transformation of a suitable microorganism is a straightforward process, and novel microorganisms strains containing the human gene may readily be isolated, using appropriate selection techniques, as understood in the art.
Using the above-described methods purification and analysis, a desired nucleotide sequence containing most of the structural gene for HCS has been isolated and shown to be greater than 99% pure. The structural gene for HGH has been isolated to a comparable degree of purity. Novel plasmids containing the isolated HCS or HGH sequences have been synthesized. Novel microorganisms containing the isolated HCS or HGH sequences as part of their genetic material have been produced.
The accompanying figures and drawing are provided to demonstrate the results obtained in the specific examples illustrating the invention.
Figure 1 is an autoradiogram of a series of gel electrophoresis runs of 32P-labeled cdna, as described in detail in exaple 1.
Figure 2 is a schematic representation of the nucleotide sequence coding for HCS, showing the relative locations of various restriction sites, as described in detail in example 1.
Figure 3 is an autoradiogram of gel electrophoresis results using 32P-labeled cDNA, as described in detail in example 2.
Figure 4 and 5 are autoradiograms of gel electrophoresis results using 32P-labeled cDNA, as described in detail in example 3.
Unless otherwise indicated or clear from the context, percentages in the examples are by volume.
Example 1 The general procedure for isolating a specific cDNA sequence has been demonstrated by isolating a sequence comprising a portion of the coding region for HCS. extracted from placental tissue. mRNA Extraction from placenta Human term placentas obtained from Caesarean section were quickfrozen in liquid nitrogen and stored at -60 C. For extraction of total RNA. 40 g of the frozen placental tissue was broken into small pieces and dissolved with the aid of a blender in 140 ml of freshly prepared 7 M guanidinium-HCl (Cox. R.A., Methods in Enzymology 12, 120 (1968), 20 mM Tris-HC1 pH 7.5, 1 mM EDTA, 1% (w/v) Sarcosyl (Trademark, Ciba-Geigy Corp., Greensboro, N.C.) at 0 C. After adding 0.5g CsCl to each ml, the dark brown solution was heated at 65"C. for 5 min., quick-cooled in ice, layered on top of a 5 ml cushion of 5.7 M CsCl, 10 mM Tris-HCl. pH 7.5, 1 mM EDTA in 1 in. x 3-1/2 in. nitrocellulose tubes and centrifuged in an SW27 rotor (Beckman Instruments Corp., Fullerton, California) at 27.000 rpm for 16 hr. at 15"C, (Glisin. V.. Crkvenjakov, R., and Ryus, C., Biochem. 13 2633 (1974). After centrifugation. the tube contents were decanted, the tubes were drained, and the bottom 1/2 cm containing the clear RNA pellet was cut off with razor blade. Pellets were transferred into a sterile erlenmeyer flask and dissolved in 20 ml 10 mM Tris-HC1, pH 7.5. 1 mM EDTA. 5% (w/v) Sarcosyl and 5% phenol. The solution was then made 0.1 M in NaCI and vigorously shaken with 40 ml of a 50% phenol-50% chloroform mixture. RNA was precipitated from the aqueous phase with ethanol in the presence of 0.2 M Na-acetate pH 5.5. RNA pellets were washed with 95% ethanol, dried, and dissolved in sterile water. Usually 40g of placental tissue yielded about 30mg of RNA from which approximately 300,ug of polyadenylated RNA was obtained after twice chromatographing on oligo-dT cellulose. See Aviv, and Leder, supra.
Synthesis of cDNA.
Analytical reactions were performed in 5 l containing 50 mM Tris-HCl, pH 8.3; 0.1 mM EDTA; 7 mM MgCl2; 20 mM KCl; 10mM -mercaptoethanol; 40 M dCTP (50,000 cpm 32P per pmole); 500 m each dCTP, dATP, and dTTP; 100 g/ml of polyadenylated RNA; 20 ug/ml oligo-dT 12-18 obtained from Collaborative Research, Waltham, Mass.; and 100 units/ml reverse transcriptase from avian myeloblastosis virus. The enzyme is available from Dr. D.J. Beard, Life Science Incorporated, St. Petersburg, Florida, who produces the enzyme under contract with the National Institutes of Health, by the procedure of Kacian, D.L. and Spiegelman, S., in Methods in Enzymology 29, L. Grossman, and K. Moldave, eds., Academic Press, N.Y. (1974), p. 150. Reactions were started by the addition of enzyme at 0 C and synthesis was for 6 min at 420C. Under these conditions approximately 106 cpm 32P were incorporated into TCA-precipitable material and each ug of RNA yielded about 50 ng of cDNA. To obtain enough cDNA for sequence analysis , the reaction volumes were increased to 100 Ill and the dCTP concentration was raised to 250 MM (specific activity of 500 cpm 32P per pmole). Under these conditions about 200,00 cpm of Z3P-labeled dCMP were incorporated into cDNA.
Restriction endonuclease treatment For restriction endonuclease digestions the analytical reactions were stopped by the addition of 20 l of ice-cold water, boiled for 2 min. quick-cooled on ice, and made 7 mM in MgCl2. Aliquots (5 l, about 2 x 105cpm) were digested using an excess amount of restriction endonuclease(s) HaeIII or HhaI or both, for 1 hr at 37 C. HaeIII was prepared according to the method of Middleton, J.H., Edgell, M.H., and Hutchinson, C.A. III, J.
Virol. 10 42 (1972). HhaI and HpaII were obtained from New England Bio-Labs, Beverly, Mass. HaeIII is also available from the latter source. The amount of enzyme used was empirically determined to be in excess of the amount needed to completely digest an equivalent amount of restriction-sensitive DNA under identical reaction conditions.
Reactions were stopped with 5 l of 20 mM EDTA, 20% (w/v) sucrose, 0.05% (w/v) bromophenol blue, heated to 100 C for 1 min and then analyzed by polyacrylamide gel electrophoresis. The products were separated on a composite 4.5% - 10% (w/v) polyacrylamide slab gel for 2.5 hr at 150V in Tris-Borate-1-EDTA buffer (Dingman, C.W. and Peacock, A.C., supra) was visualized by autoradiography of the dry gel.
Figure 1 shows the results of gel electrophoresis and autoradiography of 32P-labeled cDNA, prepared as described above. The smaples were initially spotted at the origin and migrated electrophoretically through 4.5% (w/v) acrylamide and then through 10% (w/v) acrylamide. A bar is placed on the left-hand side of the figure to indicate the position of the boundary between the two gel regions. Lane A represents the electrophoretic migration of the total cDNA transcript. Lane B shows the migration of HhaI treated cDNA. Lane C shows the migration of HaeIII treated cDNA. Lane D shows the electrophoretic migration of total cDNA treated with both HhaI and HaeIII. Lane E demonstrates the electrophoretic migration of the material isolated from the prominent band in Lane C. Lane F shows electrophoretic migration of isolated material from the prominent band of Lane C after treatment with HhaI. Lane G shows the electrophoretic migration of HneIII cleaved 5'-32P end-labeled single-stranded phage M13 DNA used as a size standard. according to Horiuchi, K., and Zinder, N.D., Proc. Nat. Acad. Sci. USA 72. 2555 (1975). The approximate lengths in nucleotides of these DNA fragments are indicated by the numbers on the right.
The result in Lane A demonstrates that the cDNA transcript from term placental mRNA is heterodisperse. Treatment with HhaI, Lane B or HaeII, Lane C results in the accumulation of polynucleotides of discrete length. The production of such discrete bands indicates the presence, in a heterogeneous population of cDNA transcripts. of at least one sequence present in multiple copies and having two restriction sites for HhaI and HaeIII. respectively. Cleavage with HhaI produces a fragment of about 470 nucleotides. and HneIII digestion produces a fragment of approximately 550 nucleotides in length. Digestion by both enzymes yields three fragments designated A. 90 nucleotides long. B. 460 nucleotides long. and C. approximately 10 nucleotides long. Due to its small size. fragment C migratcd off the gel under the conditions used in Figure 1. The band of material appearing at the interface between 10% w/v and 4.5% w/v gel represents heterogeneous material which was too large to enter the 10% gel and therefore accummulated at the interface. As judged from the simple band pattern of Lane D, fragments A and B seem to originate from the same cDNA molecule. This conclusion was confirmed by elution of the larger HaeIII fragment from the gel, migrating as shown in Lane E, followed by redigestion with HhaI. Such treatment produced two fragments comigrating with the bands released by combined HaeIII and HhaI digestion of the total cDNA, as seen by comparing Lanes D and F. In the total cDNA digest, Lane D, the autoradiographic density, which is a measure of the total radioactivity present in the band, is greater for fragment A than fragment B, although the reverse may be expected on the basis of size differences. This observation suggests that fragment A is transcribed from a region closer to the 3'-end of the mRNA than is fragment B.
Figure 2 is a schematic representation of the cDNA molecule showing the relative locations of the HaeIII and HhaI restriction sites. DNA FRAGMENTS A and B, derived from the same cDNA molecule, were ordered on the basis of their relative intensity on the autoradiogram shown in Figure 1, Lane D. The existence of DNA fragment C was inferred from the difference in the electrophoretic mobility of the band appearing in Lane B and Lane D of Figure 1. The size of DNA fragment A is known exactly from a determination of its nucleotide sequence by the method of Maxam, A. and Gilbert, W., supra. The size of DNA fragment B was determined by comparison with the M13 DNA size markers shown in Figure 1, Lane G.
The nucleotide sequences of DNA fragment A and a portion of the 5'-end of fragment B were determined by the procedure of Maxam, A. and Gilbert, W., supra. Since the amino acid sequence of HCS is known, the nucleotide sequence of the two fragments could be compared with the amino acid sequence, using the known relationships of the genetic code.
On the basis of these relationships it was demonstrated that the specific sequences did in fact code for portions of the HCS molecule, and further confirmed the ordering of these fragments shown in Figure 2.
Example 2 The ability of the process of the present invention to purify a desired nucleotide sequence that is a minority proportion of the total population of nucleotide sequences is demonstrated in the following reconstruction experiment. Defined RNA mixtures containing purified rabbit globin RNA and human polyadenylated placental RNA were used as template for reverse transcriptase in the presence of alpha-32P dCTP, final specific activity, 105 cpm per pmole. The cDNA products were cleaved with endonuclease HaeIII and the cleavage products were separated on 4.5% - 10% w/v composite polyacrylamide slab gel. The cDNA fragments were visualized by autoradiography of the dried gel.
Figure 3 shows the results of the experiments. The gels were run essentially as described in example 1. Size markers prepared by endonuclease HaeIII cleavage of phage M13 DNA and 5,~32p 3 P end labelling of the fragments thereby produced. were run in lanes A and H. The approximate lengths in nucleotides of these DNA fragments are indicated by the numbers on the left. Lanes B-G show the electrophoresis patterns produced by initiating the foregoing sequence of reactions with mixtures of globin RNA and placental RNA in varying proportions, as shown in the following table.
Globin RNA Placental RNA Lane nanograms nanograms B 300 0 C 60 240 D 30 270 E 15 285 F 7.5 292.5 G 0 300 It can be seen that a 320 nucleotide long HaelII fragment is derived from globin cDNA.
The globin cDNA transcript can still be detected if globin RNA represents as little as 2-5% by weight of the total RNA. If an RNA species is present as isolated in too low a copy number to be amenable to this mode of analysis. it can be first partially purified by any one of the known RNA purification schemes until it represents 2-5% by weight of the remaining species mixture.
Example 3 The purification of a nucleotide sequence fragment approximately 550 base-pairs in length comprising a portion of the coding region for HCS is described, together with a method of measuring the purity of the isolated sequence. The purified fragment is demonstrated to be greater than 99% pure.
Purification of HCS cDNA Polyadenylated placental RNA isolated as described in example 1 was enriched for HCS mRNA by sedimentation in a 5% to 20% (w/v) sucrose gradient at 4"C in the SW 27 rotor of a Beckman Instruments ultracentrifuge at 25,000 rpm for 16 hours. The 11S-14S region of the gradient was pooled and 100 g of this RNA used for the synthesis of double-stranded cDNA as described by Ullrich, A., et awl., supra. Synthesis of the second strand was stopped by extraction of the reaction mixture with one volume of ethanol at -700C. Digestion of the cDNA with HaeIII endonuclease was carried out in 50 Ill of 6 mM Tris-HcL, pH 7.5, 6 mM MgC12, 6mM -mercaptoethanol with 2 units of HaeIII enzyme at 37"C for two hours, following which 0.1 units of bacterial alkaline phosphatase (type BAPF, Worthington Biochemical Corp., Freehold, N.J., units as defined by manufacturer) were added and digestion continued at 60"C for ten minutes. Following extraction with one volume of phenol-chloroform, the DNA was precipitated with two volumes of ethanoll -700C, dissolved in 20 Fl of 10 mM Tris-HC1, pH 8, 1 mM EDTA, and subjected to electrophoresis on a 6% (w/v) polyacrylamide gel. Figure 4((F) shows the electrophoresis pattern of the foregoing reaction mixture, which reveals a prominent band corresponding to a nucleotide sequence approximately 550 base-pairs in length. The 550 base-pairs fragment was excised from the gel, and eluted electrophoretically, with the result shown in Figure 4(E).
The remaining material corresponding to the 550 base-pair fragment shown in Figure 4(E) was digested with 4 units of HhaI endonuclease in 50 1 of the same buffer used for digestion with HaeIII endonuclease, at 37"C for 2 hours. Following phenol-chloroform extraction and ethanol precipitation, the digestion products were separated by electrophoresis on a 6% (w/v) polyacrylamide gel. The result is shown in Figure 4(D).
The two fragments were eluted electrophoretically, combined and rejoined by incubation in 20 Ill of 66 mM Tris-HCl, pH 7.6. 6 mM MgCl. 15 mM dithiothreitol. 1 mM ATP containing 20 Fg/ml of T4 DNA ligase at 15"C for two hours. The reaction mixture was then diluted to 200 1 with 0.1 M NaCI, extracted with 1 volume of phenol-chloroform and the DNA precipitated with 2 volumes of ethanol. After resuspension in 20 1 of 10 mM Tris-HC1. pH 8, 1 mM EDTA, the ligation products were separated by electrophoresis in the 6% (w/v) polyacrylamide gel. The result is shown in Figure 1(C). It can be seen from the electrophoresis pattern of Figure 4(C) that the 550 nucleotide fragment was reconstituted by the ligation treatment. The prior treatment with alkaline phosphatase insured that the two HhaI fragments were rejoined in the original sequence relative to each other to reconstitute the 550 nucleotide segment. The additional bands seen in Figure 4(C) were the result of dimer formation between the HhaI fragments. since dimer form that the purified human HCS cDNA reconstituted HaeIII fragment was greater than 99% homogeneous.
Example 4 Synthesis of a plasmid containing a nucleotide sequence of 550 base-pairs comprising the majority of the coding region for HCS is described.
A 550 nucleotide fragment of HCS cDNA of greater than 99% purity was prepared as described in example 3. Terminal 5' phosphate end groups were restored in a reaction mixture containing 50 mM Tris-HCl, pH 8.5, 10 mM MgCl2, 0.1 mM spermidine, 5 mM frmercaptoethanol, 5% (w/v) glycerol, 333 pmole ATP, 5 units of T4 polynucleotide ki ase incubated in a final volume of 40 Ill at 37"C for two hours. DNA was separated from the reaction mixture by phenol extraction followed by ethanol precipitation. Synthetic decanucleotide linkers having restriction site specificity for EcoRI and have the sequence, 5'-CCciAATTCGci-3', prepared according to Scheller, et al., supra, were then ligated to the HCS DNA in a molar ratio of approximately 50:1 in 50 cull of 66 mM Tris-HCl, pH 7.6, 9 mM MgCl2, 15 mM dithiothreitol, 1 mM ATP and 20 ,ug/ml T4 DNA ligase. Linkers are commercially available from Collaborative Research, Waltham, Massachusetts. After incubation at 40C for 18 hours, the reaction was stopped by extraction with phenolchloroform. The ligation products were precipitated with ethanol, redissolved in 50 Ill 100 mM NaCl, 50 mM Tris-HCl, pH 7.6, 7 mM MgCl2, and digested with 50 units EcoRI endonuclease at 370C for 2 hours. Digestion with the endonuclease resulted in cleavage at the EcoRI site of the decamers giving rise to HCS cDNA with EcoRI cohesive ends as well as cleaved unreacted decanucleotides and self-ligated decanucleotides. As the cleaved decamers also contained EcoRI termini and would compete with the HCS cDNA for recombination with the similarly cleaved plasmid, the HCS cDNA was isolated by gel electrophoresis before reaction with the transfer vector. The use of the foregoing decanucleotide linker has the advantage that the HCS cDNA fragment may be reisolated from the plasmid in a form identical to that of the original fragment.
The transfer vector employed was the bacterial plasmid pMB-9, a 3.5 x 106 molecular weight molecule containing a single EcoRI site, prepared as described by Rodriguez, R.L., Bolivar, F., Goodman, H.M., Boyer, H.W. and Betlachz M. in ICN- UCLA Symposium On Molecular and Genetic Biology, D. P. Wierlich, W. J. Rutter, and C.F. Fox, E. (Academic Press, New York, 1976), pp 471-477. The plasmids pMB-9 and pBR-322 (Example 5) are commercially available from Bethesda Research Labs. Rockville, Maryland. Infection of Escherichia Coli with pMB-9 confers resistance to tetracycline. Incorporation of DNA into the EcoRI site of pMB-9 does not affect the tetracycline resistance or any other known property of the plasmid. Consequently, there are no phenotypic differences between recombinant and normal plasmids. Therefore the EcoRI cut pMB-9 was first treated with alkaline phosphatase, according to a method described in detail in U. K. Patent Application No. 22417/78 (Serial No. 1565190) See also, Ullrich. et al.. supra. Alkaline phosphatase treatment removes the 5' phosphates from the EcoRI generated ends of the plasmid and prevents self-ligation of the plasmid DNA, insuring that circle formation and hence transformation is dependent on the insertion of a DNA fragment containing 5' phosphorylated termini. The alkaline phosphatase treatment was carried out in a reaction mixture at the level of 1.0 enzyme units/mg of plasmid DNA in 25 mM Tris- HCI, pH 8, for 30 minutes at 65"C, followed by phenol extraction to remove the phosphatase, and ethanol precipitation of the DNA. Ligation of HCS cDNA to pMB-9 treated as described was carried out in 50 ul reactions containing 60 mM Tris-HCl, pH 8, 10 mM (3-mercaptoethanol, 8 mM MgCl2, between 10 and 50 ng of the purified HCS cDNA and approximately 500 ng of EcoRI-cleaved 5' dephosphorylated plasmid DNA. Reactions were begun by addition of T4 DNA ligase to 5 llg/ml, allowed to proceed at 15"C for 1 hour and mixture diluted to 0.25 ml with 120 mM NaCl, 1 mM EDTA. The diluted reaction mixture was used directly for transformation of E. Coli X-1776.
E. Coli X-1776 is a host strain especially developed for recombinant DNA work, certified by NIH as an EK-2 host under the FEDERAL GUIDELINES. The strain is available from Dr. Roy Curtiss III, University of Alabama, Department of Microbiology, Birmingham, Alabama. The bacteria were grown in 150 ml of nutrient broth supplemented with 100 Iml diaminopimelic (DAP) and 40 llg/ml thymine to a cell density of approximately 2 x 1 cells/ml. The cells were harvested by centrifugation and washed in 60 ml of 10 mm NaCl, recentrifuged and resuspended in 60 ml of transformation buffer containing 10 mM Tris-HCl, pH 8, 140 mM NaCl, 75 mM CaCl. The cell suspension was kept on ice for 15 minutes, the cells collected by centrifugation and resuspended in 1.5 ml of the same transformation buffer. The cell suspension, 0.5 ml. was added to 0.25 ml of diluted ligaticn reaction mixture and incubated on ice for 15 minutes. then transferred to 25"C for 4 minutes, then on ice again for 30 minutes. The cell suspension. 0.2 ml, was plated directly onto nutrient agar plates supplemented with 100 Ill/ml DAP and 40 Fl/ml thymine and 20 ,ug/ml tetracycline, Four transformants were obtained, all of which contained a 550 base-pair insertion which was released from the plasmid DNA by either EcoRI or HaeIII endonuclease digestion.
A transformant clone designated pHCS-1 was selected for sequence analysis. E. Coli X-1776--pHCS-1 was grown in suitable nutrient medium, plasmid DNA was isolated therefrom the cleaved with EcoRI endonuclease. The 550 base-pair insertion was isolated from linear pMB-9 by electrophoresis in a 6% polyacrylamid gel and subjected to a DNA sequence analysis using the procedure of Maxam and Gilbert, supra. Sub-fragments of the HCS DNA were prepared by incubation with HpaII restriction endonuclease and the 5' termini were labeled using y 32P-ATP and polynucleotide kinase. Following the sequence analysis procedure of Maxam and Gilbert, the nucleotide sequence of cloned HCS-DNA was determined. By comparison with the known amino acid sequence of HCS, the 557 nucleotide sequence represented that portion of the coding region of HCS mRNA from amino acids 24 to 191, plus 50 nucleotides of the 3'-untranslated region. See Niall. H. D., Hogan, M. L., Sauer, R., Rosenblum, I. Y. and Greenwood, F. C., Proc. Nat. Acad. Sci.
USA 68, 866 (1971). The primary structure of HCS mRNA as determined from the DNA sequence of cloned fragment pHCS-1 is shown in Table 3, together with the amino acid sequence predicted therefrom on the basis of the known genetic code. The amino acid sequence determined from the nucleotide sequence is identical with the previously published amino acid sequence determined by chemical means. This demonstrates that the initially isolated HCS mRNA has been copied in vitro with high fidelity and that the cloned HCS DNA fragment was replicated with high fidelity in the transformed bacteria.
TABLE 3 Nucleotide sequence of one strand of HCS DNA from cloned pHCS-1. The numbers refer to amino acid sequence beginning at the amino terminus. The DNA sequence shown corresponds to the mRNA sequence for HCS, except that U replaces T in the mRNA. The amino acid sequence from positions 1 through 23 is also shown.
1 10 20 Val-Gln-Thr-Val-Pro-Leu-Ser-Arg-Leu-Phe-Asp-His-Ala-Met-Leu-Glu-Ala-His-Arg-Ala-His-Gln-Leu24 40 Ala Ile Asp Thr Tyr Gln Glu Phe Glu Glu Thr Tyr Ile Pro Lys Asp Gln Lys Tyr Ser Phe Leu His Asp Ser Gin 5'- --G GCC ATT GAC ACC TAC CAG GAG TTT GAA GAA ACC TAT ATC CCA AAG GAC CAG AAG TAT TCG TTC CTG CAT GAC TCC CAG 60 Thr Ser Phe Cys Phe Ser Asp Ser Ile Pro Thr Pro Ser Asn Met Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu ACC TCC TTC TGC TTC TCA GAC TCT ATT CCG ACA CCC TCC AAC ATG GAG GAA ACG CAA CAG AAA TCC AAT CTA GAG CTG CTC 80 100 Arg Ile Ser Leu Leu Leu Ile Glu Ser Trp Leu Glu Pro Val Arg Phe Leu Arg Ser Met Phe Ala Asn Asn Leu Val Tyr CGC ATC TCC CTG CTG CTC ATC GAG TCG TGG CTG GAG CCC GTG CGG TTC CTC AGG AGT ATG TTC GCC AAC AAC CTG GTG TAT 120 Asp Thr Ser Asp Ser Asp Asp Tyr His Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu Glu Asp GAC ACC TCG GAC AGC GAT GAC TAT CAC CTC CTA AAG GAC CTA GAG GAA GGC ATC CAA ACG CTG ATG GGG AGG CTG GAA GAC 140 Gly Ser Arg Arg Thr Gly Gln Ile Leu Lys Gln Thr Tyr Ser Lys Phe Asp Thr Asn Ser His Asn His Asp Ala Leu Leu GGC AGC CGC CGG ACT GGG CAG ATC CUC AAG CAG ACC TAC AGC AAG TTT GAC ACA AAC TCG CAC AAC CAT GAC GCA CTG CTC 160 180 Lys Asn Tyr Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe Leu Arg Met Val Gln Cys Arg Ser AAG AAC TAC GGG CTG CUC TAC TGC TTC AGG AAG GAC ATG GAC AAG GTC GAG ACA TTC CTG CGC ATG GTG CAG TGC CGC TCT 191 Val Glu Gly Ser Cys Gly Phe GTG GAG GGC AGC TGT GGC TTC TAG GTGCCCGAGTAGCATCCTGTGACCCCTCCCCAGTGCCTCTCCTGGCC-----3' Example 5 The purification of DNA whose nucleotide sequence comprises most of the coding region for HGH is described, together with the synthesis of a plasmid transfer vector containing the purified DNA and the construction of a microorganism strain having the DNA as part of its genetic makeup. HGH was purified essentially as described for HCS in Example 3, except as noted below.
Five benign human pituitary tumors, quick-frozen in liquid nitrogen after surgical removal, weighing 0.4 g to 1.5 g each were thawed and homgenized in 4 M guanidinium thiocyanate containing 1 M mercaptoethanol buffered to pH 5.0 at 40C. The homogenate was layered over 1.2 ml 5.7 M CsCl containing 100 mM EDTA and centrifuged for 18 hours at 37,000 rpm in the SW 50.1 rotor of a Beckman (Trade Mark) ultra-centrifuge at 15"C (Beckman Instrument Company, Fullerton, California). RNA travelled to the bottom of the tube. Further purification, using an oligo-dT column and sucrose gradient sedimentation was as described previously in Examples 1 and 3. About 10% by weight of the RNA thus isolated coded for growth hormone, as judged by incorporation of a radioactive amino acid precursor into anti-growth hormone precipitable material in a cell-free translation system derived from wheat germ. See Roberts, B.E. and Patterson, B. M., Proc. Nat.
Acad. Sci. USA 70, 2330 (1973). Single-stranded cDNA and double-stranded cDNA were synthesized as described in Example 3. HGH cDNA was then treated with restriction endonuclease HaeIII and alkaline phosphatase as described in Example 3, then fractionated by gel electrophoresis. A distinct band in a position corresponding to about 550 nucleotides in length was observed. and isolated for further purification.
For further purification, the previously described technique of dividing the DNA into sub-fragments and separately purifying the recombining the sub-fragments was carried out as previously described, except that for HGH, the restriction endonuclease PvuII was used to produce two sub-fragments of approximately 490 and approximately 60 nucleotides length, respectively. All restriction enzymes used herein are commercially available from New England Biolabs, Massachusetts. The religated product. about 550 base-pairs in length, was greater than 99% as judged by sub-fractionation in four separate restriction endonuclease systems.
Synthesis of a recombinant transfer vector contaning HGH DNA was carried out essentially as described in Example 4 except that the decanucleotide linkers and plasmid employed were different. A decanucleotide linker having Hind III specificity was employed, sequence 5'-CCAAGCTTGG-3'. Treatment with HsuI, yielded HGH cDNA with cohesive ends. HsuI and Hind III have the same site specificity and may be used interchangeably. The plasmid pBR-322 was used as the transfer vector. This plasmid confers host resistance to the antibiotics ampicillin and tetracycline. DNA insertions into the Hind III site have been found to reduce or abolish tetracycline resistance.
Recombinants were therefore selected by growth on nutrient plates containing ampicillin, and by their inability to grow on 20 Fg/ml of tetracycline. HGH-cDNA was recombined with HsuI-cleaved alkaline phosphatase-treated pBR-322, under conditions essentially as described in Example 4.
Products of the ligase reaction were used to transform E. Coli X-1776 under conditions as described in Example 4. Seven colonies were isolated based upon their ability to grow in the presence of ampicillin and their inability to grow in the presence of tetracycline. Five of the seven colonies carried the recombinant plasmid containing the approximately 550 base-pair portion of HGH DNA. One of the bacterial strains, pHGH-1, carrying HGH DNA as part of its genetic makeup, was grown in quantity to provide a source of plasmid DNA from which the HGH DNA could be reisolated by treatment with Hizld III or HsuI. This isolated HGH DNA, having undergone many replications. was subjected to sequence analysis as described in Example 4. The results are shown in Table 4.
TABLE 4 Nucleotide sequence of one strand of HGH-DNA of cloned pHGH-1. The numbers refer to the amino acid sequence of HGH beginning at the amino terminus. The DNA sequence shown corresponds to the mRNA sequence for HGH, except that U replaces T in the mRNA.
24 34 40 43 Ala Phe Asp Thr Tyr Gln Glu Phe Glu Glu Ala Tyr Ile Pro Lys Glu Gln Lys Tyr Ser Phe Leu Gln Asn Pro Gin 5'- --G GCC TTT GAC ACC TAC CAG GAG TTT GAA GAA GCC TAT ATC CCA AAG GAA CAG AAG TAT TCA TTC CTG CAG AAC CCC CAG 60 Thr Ser Leu Cys Phe Ser Glu Ser Ile Pro Thr Pro Ser Asn Asg Glu Glu Thr Gln Gln Lys Ser Asn Leu Glu Leu Leu ACC TCC CTC TGT TTC TCA GAG TCT ATT CCG ACA CCC TCC AAC AGG GAG GAA ACA CAA CAG AAA TCC AAC CTA GAG CTG CTC 80 100 Arg Ile Ser Leu Leu Leu Ile Gln Ser Trp Leu Glu Pro Val Gln Phe Leu Arg Ser Val Phe Ala Asn Asn Leu Val Tyr CGC ATC TCC CTG CTG CTC ATC CAG TCG TGG CTG GAG CCC GTG CAG TTC CTC AGG AGT GTC TTC GCC AAC AAC CTG GTG TAC 120 Gly Ala Ser Asp Ser Asn Val Tyr Asp Leu Leu Lys Asp Leu Glu Glu Gly Ile Gln Thr Leu Met Gly Arg Leu Glu Asp GGC GCC TCT GAC AGC AAC GTC TAT GAC CTC CTA AAG GAC CTA GAG GAA GGC ATC CAA ACG CTG ATG GGG AGG CTG GAA GAC 140 Gly Ser Pro Arg Thr Gly Gln Ile Phe Lys Gln Thr Tyr Ser Lys Phe Asp Thr Asn Ser His Asn His Asp Ala Leu Leu GGC AGC CCC CGG ACT GGG CAG ATC TTC AAG CAG ACC TAC AGC AAG TTC GAC ACA AAC TCA CAC AAC CAT GAC GCA CTA CTC 160 180 Lys Asn Tyr Gly Leu Leu Tyr Cys Phe Arg Lys Asp Met Asp Lys Val Glu Thr Phe Leu Arg Ile Val Gln Cys Arg Ser AAG AAC TAC GGG CTG CTC TAC TGC TTC AGG AAG GAC ATG GAC AAG GTC GAG ACA TTC CTG CGC ATC GTG CAG TGC CGC TCT 191 Val Glu Gly Ser Cys Gly Phe GTG GAG GGC AGC TGT GGC TTC TAG CTGCCCGGGTGGCATCCCTGTGACCCCTCCCCAGTGCCTCTCCTGGCC-----3' The method of the present invention provides for the first time a method of general applicability for purifying desired specific nucleotide sequences. These sequences may be correlated with the production of a specific protein of commercial or medical significance.
The disclosed method results in the purification of nucleotide sequences which may be fragments of a larger sequence coding for the desired protein. The present methodmay be used in combination with known ancillary procedures to produce the entire nucleotide sequence coding or a specific protein.
In addition, a method has been disclosed whereby a nucleotide sequence of specific length, however derived, may be highly purified. A method for measuring the degree of purity of such fragments is also disclosed. By these means, a nucleotide sequence coding for a portion of human HCS has been isolated, purified and shown to be at least 99% (by weight) pure.
Transfer vectors containing most of the nucleotide sequence coding for HCS and most of the sequence coding for HGH have been synthesized. Novel microorganism strains containing the foregoing genes and portions of genes have been produced. The foregoing nucleotide sequences have been reisolated after many cycles of replication in the host microorganism and found to contain essentially the identical nucleotide sequence to that existing in the source organism.
On the basis of the genetic code, there exists a finite set of nucleotide sequences which can genetically code for a given amino acid sequence. All such equivalent nucleotide sequences are operable variants of the disclosed sequences, since all give rise to the same protein hormone, having the same amino acid sequence, during the course of in vivo transcription and translation. Consequently, all such variants are included in the scope of the present invention.

Claims (19)

WHAT WE CLAIM IS:
1. A method for purifying a fragment of a specific deoxyribonucleotide sequences for recombination with a DNA transfer vextor, and transfer to- a microorganism. from a population of polyribonucleotides heterogeneous in length and sequence, comprising: (a) providing a population of cDNA transcripts of the polyribonuucleotides. at least a portion of said CDNA transcripts having at least two restriction sites, (b) subjecting the cDNA transcripts to the action of a restriction endonuclease preparation capable of catalyzing the hydrolysis of the cDNA transcripts at each of the two restriction sites in order to produce a fragment of a specific deoxyribonucleotide sequence which is homogeneous in length, (c) fractionating the fragments produced bv restriction endonuclease action according to their length, thereby separating the homogeneous length fragments of the specific deoxyribonucleotide sequence from cDNA transcripts of different length, whereby a fragment of the specific deoxyribonucleotide sequence is purified for recombination with a DNA transfer vector and transfer to a microorganism.
2. A method of further purifying linear DNA of specific nucleotide sequence containing a restriction site, a population of essentially homogeneous length fragments of specific deoxy nibonucleotide sequence. prepared according to the method of claim 1, comprising (d) treating the DNA in said fragments to remove any 5' phosphate end groups, (e) incubating the DNA with a restriction endonuclease capable of acting on the specific desired nucleotide sequence to produce two linear subfragments thereof, (f) fractionating the sub-fragments according to their length.
(g) isolating the two sub-fragments.
(h) rejoining the sub-fragments covalently. in a DNA ligase catalyzed reaction, to reconstitute the original specific sequence, and (i) fractionating the rejoined DNA molecules according to their length, whereby the specific sequence is purified from a population of essentially homogeneous length fragments of specific deoxyribonucleotide sequence for recombination with a DNA transfer vector and transfer to a microorganism.
3. A method according to claim 2 wherein alkaline phosphatase is used to remove the 5'-phosphate groups.
4. A method according to claim 2 wherein fractionation by length is accomplished by gel electrophoresis.
5. A method according to any of claims 1 to 4 wherein the endonuclease is selected from the enzymes Hae III. Hha I and Hin (f) I.
6. A recombinant DNA transfer vector comprising a fragment of a specific deoxyribonucleotide sequence prepared by the method of claim 1.
7. A recombinant DNA transfer vector comprising a fragment of a specific deoxyribonucleotide sequence prepared by the method of claim 2.
8. A microorganism strain modified to contain a fragment of a specific deoxyribonuc leotide sequence prepared by the method of claim 1.
9. A microorganism strain modified to contain a fragment of a specific deoxyibonucleotide sequence prepared by the method of claim 2.
10. A microorganism strain modified to contain the DNA transfer vector of claim 6.
11. A microorganism strain modified to contain the DNA transfer vector of claim 7.
12. A recombinant DNA transfer vector comprising codons for human chorionic somatomammotropin comprising the nucleotide sequence: 5'- G GCL24ATM25GAK26ACL27TAK28CAJ29GAJ30TTK31GAJ32 GAJ33ACL34TAK35ATM36CCL37AAJ38GAK39CAJ40AAJ41TAK42 QR43S43TTK44X45TY45CAK46GAK47QR48S48CAJ49ACL50QR51 S51TTK52TGK53TTK54QR55S55GAK56QR57S57ATM58CCL59 ACL60CCL61QR62S62AAK63ATC64GAJ65GAJ66ACL67CAJ68CAJ69 AAJ70QR71S71AAK72X73TY73GAJ74X75TY75X76TY76W77GZ77 ATM78QR79S79X80RY80X81TY81X82TY82ATM83GAJ84QR85S85 TGG86X87TY87GAJ88CCL89GTL90W91GZ91TTK92X93TY93W94 GZ94QR95S95ATG96TTK97GCL98AAK99AAK100X101TY101 GTL102TAK103GAK104ACL105QR106S106GAK107QR108 S108GAK109GAK110TAK111CAK112X113TY113X114TY114 AAJ115GAK116X117TY117GAJ118GAJ119GGL120ATM121CAJ122 ACL123X 124TY 124ATci 125GGL126W127ciZ 127X12gTY12gGAJ12g GAK130GGL131QR132S132W133GZ133W134GZ134ACL135 GGL136CAJ137ATM138X139TY139AAJ140CAJ141ACL142TAK143 QR144S144AAJ145TTK146GAK147ACL148AAK149QR150S150 CAK151AAK152CAK153GAK154GCL155X156TY156X157TY157 AAJ158AAK159TAK160GGL161X162TY162X163TY163TAK164 TGK165TTK166W167GZ167AAJ168GAK169ATG170GAK171 AAJ172GTL173GAJ174ACL175TTK176X177TY177W178GZ178 ATGG179TL180CAJ181TGK182W183GZ183QR184S184GTL185 GAJ186GGL187QR188S188TGK189GGL190TTK191 TAGGTGCCCGAGTAGCATCCTGTGACCCCTCCCCAGTGCCTCTCCTGGCC-3' wherein A is deoxyadenyl, G is deoxyguayl, C is deoxycytosyl, T is thymidyl, J is A or G; K is T or C; L is A, T, C or GI: M is A, C or T; Xn is T or C, if Yn is A or G, and C if Y0 is C or T; Y0 is A, G, C or T, if X0 is C, and A or G if X0 is T; Wn is C or A, if Zn is G or A, and C if Z0 is C or T; Zn is A, G, C or T, if Wn is C, and A or G if Wn is A; QRn is TC, if Sn is A, G, C or T, and AG if Sn is T or C: Sn is A, G, C or T, if QRn is TC, and T or C if QRn is AG and subscriptnumerals, n, refer to the amino acid position in human chorionic somatomammotropin, for which the nucleotide sequence corresponds, according to the genetic code. the amino acid positions being numbered from the amino end.
13. A DNA transfer vector according to claim 12 comprising a plasmid.
14. The DNA transfer vector of claim 12 wherein the plasmid is pBR-322.
15. A microorganism modified to contain the DNA transfer vector of claim 12.
16. A microorganism according to claim 15. comprising a bacterium.
17. A microorganism according to claim 16. comprising Escherichia coli X-1776 containing plasmid pBR-322.
18. A method according to claim 1 of purifying a fragment of a specific deoxyribonucleotide sequence, being a method substantially as described in Example 1, 2, 3 or 5.
19. A method according to claim 2 of purifying a gragment of a specific deoxyribonucleotide sequence, being a method substantially as described in Example 3 or 5.
GB22418/78A 1977-09-23 1978-05-25 Purification of nucleotide sequences suitable for expression in bacteria Expired GB1568047A (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US83621877A 1977-09-23 1977-09-23

Publications (1)

Publication Number Publication Date
GB1568047A true GB1568047A (en) 1980-05-21

Family

ID=25271469

Family Applications (1)

Application Number Title Priority Date Filing Date
GB22418/78A Expired GB1568047A (en) 1977-09-23 1978-05-25 Purification of nucleotide sequences suitable for expression in bacteria

Country Status (25)

Country Link
JP (1) JPS5448775A (en)
AT (2) AT369386B (en)
AU (1) AU519528B2 (en)
BE (1) BE868853A (en)
CS (1) CS235069B2 (en)
DD (2) DD145917A5 (en)
DE (1) DE2825595A1 (en)
DK (1) DK233378A (en)
ES (1) ES470340A1 (en)
FI (1) FI64187C (en)
FR (1) FR2404013A1 (en)
GB (1) GB1568047A (en)
GR (1) GR73558B (en)
IE (1) IE47332B1 (en)
IL (1) IL54791A (en)
IT (1) IT1098340B (en)
LU (1) LU79938A1 (en)
NL (1) NL7806358A (en)
NZ (1) NZ187342A (en)
PL (1) PL209748A1 (en)
PT (1) PT68127A (en)
RO (2) RO78407A (en)
SE (3) SE7806087L (en)
YU (1) YU214378A (en)
ZA (1) ZA782933B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4338397A (en) 1980-04-11 1982-07-06 President And Fellows Of Harvard College Mature protein synthesis
US4530901A (en) * 1980-01-08 1985-07-23 Biogen N.V. Recombinant DNA molecules and their use in producing human interferon-like polypeptides
US5670371A (en) * 1983-07-15 1997-09-23 Bio-Technology General Corp. Bacterial expression of superoxide dismutase
US6229003B1 (en) 1980-03-24 2001-05-08 Yeda Research And Development Company Ltd. Production of bovine growth hormone by microorganisms
US6268122B1 (en) 1978-12-22 2001-07-31 Biogen, Inc. Recombinant DNA molecules and their method of production
US6835557B1 (en) 1980-01-08 2004-12-28 Biogen, Inc. DNA sequences, recombinant DNA molecules and processes for producing human interferon-like polypeptides
US7588755B1 (en) 1980-04-03 2009-09-15 Biogen Idec Ma Inc. DNA sequences, recombinant DNA molecules and processes for producing human fibroblast interferon-like polypeptides

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
BR7807290A (en) * 1977-11-08 1979-06-12 Genentech Inc PLASMIDEO, PROCESS FOR THE PRODUCTION OF A SPECIFIC POLYPEPTIDE CLONAL VEHICLE, TRANSFORMING BACTERIAL CULTURE, PROCESS TO PRODUCE AN IMMUNOGENIC SUBSTANCE, PROCESS TO PREPARE A STRUCTURAL, AND POLYDESOXIRRIBONUCLEOTIDE GENE
ZA802992B (en) * 1979-06-01 1981-10-28 Univ California Human pre-growth hormone
US4342832A (en) * 1979-07-05 1982-08-03 Genentech, Inc. Method of constructing a replicable cloning vehicle having quasi-synthetic genes
CH659826A5 (en) * 1981-06-02 1987-02-27 Wakunaga Yakuhin Kk PROCESS FOR OBTAINING A MICROORGANISM TRANSFORMED BY A PLASMID COMPRISING A GENE OF 27-DESAMIDOSECRETINE STRUCTURE.

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
NZ187300A (en) * 1977-05-27 1982-08-17 Univ California Dna transfer vector and micro-organism modified to contain a nucleotide sequence equivalent to the gene of a higher organism
AT373281B (en) * 1977-11-08 1984-01-10 Genentech Inc METHOD FOR PRODUCING A STRUCTURAL GENE

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6268122B1 (en) 1978-12-22 2001-07-31 Biogen, Inc. Recombinant DNA molecules and their method of production
US6270955B1 (en) 1978-12-22 2001-08-07 Biogen, Inc. Pharmaceutical compositions and methods for producing antibodies to hepatitis b virus and kits and methods for detecting antibodies to hepatitis b virus
US6297355B1 (en) 1978-12-22 2001-10-02 Biogen, Inc. Polypeptides displaying HBV antigenicity or hbv antigen specificity
US4530901A (en) * 1980-01-08 1985-07-23 Biogen N.V. Recombinant DNA molecules and their use in producing human interferon-like polypeptides
US6835557B1 (en) 1980-01-08 2004-12-28 Biogen, Inc. DNA sequences, recombinant DNA molecules and processes for producing human interferon-like polypeptides
US6229003B1 (en) 1980-03-24 2001-05-08 Yeda Research And Development Company Ltd. Production of bovine growth hormone by microorganisms
US7588755B1 (en) 1980-04-03 2009-09-15 Biogen Idec Ma Inc. DNA sequences, recombinant DNA molecules and processes for producing human fibroblast interferon-like polypeptides
US4338397A (en) 1980-04-11 1982-07-06 President And Fellows Of Harvard College Mature protein synthesis
US5670371A (en) * 1983-07-15 1997-09-23 Bio-Technology General Corp. Bacterial expression of superoxide dismutase

Also Published As

Publication number Publication date
IT1098340B (en) 1985-09-07
PT68127A (en) 1978-07-01
RO80006A (en) 1982-10-11
NZ187342A (en) 1983-09-02
AT369386B (en) 1982-12-27
FI64187C (en) 1983-10-10
GR73558B (en) 1984-03-15
BE868853A (en) 1978-11-03
LU79938A1 (en) 1978-12-12
DK233378A (en) 1979-03-24
JPS5448775A (en) 1979-04-17
ES470340A1 (en) 1979-09-16
NL7806358A (en) 1979-03-27
FR2404013B1 (en) 1983-07-22
SE8305163L (en) 1983-09-23
AT375673B (en) 1984-08-27
FR2404013A1 (en) 1979-04-20
PL209748A1 (en) 1979-12-03
SE8305162D0 (en) 1983-09-23
FI64187B (en) 1983-06-30
ATA456678A (en) 1982-05-15
SE8305163D0 (en) 1983-09-23
ZA782933B (en) 1979-05-30
IT7824641A0 (en) 1978-06-16
YU214378A (en) 1984-04-30
CS235069B2 (en) 1985-04-16
SE7806087L (en) 1979-03-24
ATA248881A (en) 1984-01-15
AU3648978A (en) 1979-11-29
SE8305162L (en) 1983-09-23
AU519528B2 (en) 1981-12-10
IE47332B1 (en) 1984-02-22
IE781054L (en) 1979-03-23
IL54791A (en) 1983-07-31
FI781676A (en) 1979-03-24
DD145917A5 (en) 1981-01-14
RO78407A (en) 1982-02-26
DD153393A5 (en) 1982-01-06
DE2825595A1 (en) 1979-04-12

Similar Documents

Publication Publication Date Title
US4363877A (en) Recombinant DNA transfer vectors
KR870000701B1 (en) The human growth hormone preparing method
US4440859A (en) Method for producing recombinant bacterial plasmids containing the coding sequences of higher organisms
FI64640B (en) FOERFARANDE FOER FRAMSTAELLNING AV EN KOMBINATIONS-DNA-MOLEKYLSOM INNEHAOLLER EN FOER INSULIN KODANDE NUCLEOTIDSEKVENS HOCSOM KAN ANVAENDAS VID FRAMSTAELLNING AV EN INSULIN PRODU RCENDE MICROORGANISM
EP0020147B1 (en) A dna transfer vector for human pre-growth hormone, a microorganism transformed thereby, and a method of cloning therefor
US4407948A (en) Purification of nucleotide sequences suitable for expression in bacteria
US4652525A (en) Recombinant bacterial plasmids containing the coding sequences of insulin genes
GB1568047A (en) Purification of nucleotide sequences suitable for expression in bacteria
EP0095350A2 (en) A method of producing human gamma-interferon-like polipeptide
US4283489A (en) Purification of nucleotide sequences suitable for expression in bacteria
US4447538A (en) Microorganism containing gene for human chorionic somatomammotropin
US4322499A (en) Adrenocorticotropin-lipotropin precursor gene
Wu et al. Synthetic oligodeoxynucleotides for analyses of DNA structure and function
WO1988005082A1 (en) Microbial production of peptide oligomers
FI68261C (en) FOERFARANDE FOER FRAMSTAELLNING AV EN REKOMBINATIONS-DNA-MOLEKYL VILKEN HAR EN HUMANT
Miller Use of recombinant DNA technology for the production of polypeptides
US4738928A (en) Method for cloning genes
KR840002126B1 (en) Method for purifying of nucleotide sequences suitable for expression in bacteria
CA1156166A (en) Recombinant bacterial plasmids containing the coding sequences of insulin genes
FI74732B (en) DNA-OEVERFOERINGSVEKTOR, VILKEN INNEHAOLLER EN NUCLEOTIDSEKVENS SOM KODAR FOER TILLVAEXTHORMON.
Padayatty et al. Cloning of chicken globin cDNA in bacterial plasmids
FI75185B (en) DNA-OEVERFOERINGSVEKTOR, VILKEN INNEHAOLLER EN FOER INSULIN KODANDE NUCLEOTIDSEKVENS.
FI65446C (en) FOERFARANDE FOER FRAMSTAELLNING AV EN BAKTERIE SOM INNEHAOLLEREN FOER INSULIN KODANDE NUCLEOTIDSEKVENS
HU193866B (en) Process for preparing plasmid and e.coli bakterium comprising dna sequence and coding humane growth hormone
Gaastra et al. The Determination of DNA Sequences

Legal Events

Date Code Title Description
PS Patent sealed [section 19, patents act 1949]
PCNP Patent ceased through non-payment of renewal fee