AU630888B2 - Probes for the detection of rflp in eucaryotic genomes - Google Patents

Probes for the detection of rflp in eucaryotic genomes Download PDF

Info

Publication number
AU630888B2
AU630888B2 AU42220/89A AU4222089A AU630888B2 AU 630888 B2 AU630888 B2 AU 630888B2 AU 42220/89 A AU42220/89 A AU 42220/89A AU 4222089 A AU4222089 A AU 4222089A AU 630888 B2 AU630888 B2 AU 630888B2
Authority
AU
Australia
Prior art keywords
dna
polynucleotide
probe
sequence
polynucleotide sequence
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
AU42220/89A
Other versions
AU4222089A (en
Inventor
Howard Jay Baum
Nancy Yuk-Yu Ip
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Lifecodes Corp
Original Assignee
Lifecodes Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lifecodes Corp filed Critical Lifecodes Corp
Publication of AU4222089A publication Critical patent/AU4222089A/en
Application granted granted Critical
Publication of AU630888B2 publication Critical patent/AU630888B2/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6813Hybridisation assays
    • C12Q1/6827Hybridisation assays for detection of mutation or polymorphism
    • C12Q1/683Hybridisation assays for detection of mutation or polymorphism involving restriction enzymes, e.g. restriction fragment length polymorphism [RFLP]
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/156Polymorphic or mutational markers

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Organic Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Zoology (AREA)
  • Wood Science & Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Analytical Chemistry (AREA)
  • Microbiology (AREA)
  • Immunology (AREA)
  • Molecular Biology (AREA)
  • Biotechnology (AREA)
  • Biophysics (AREA)
  • Physics & Mathematics (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Description

*CORRECTED
L VERSION* .t "t WORLD INTELLEC P TR GA IATI International Bureau PCr INTERNATIONAL APPLICATION Pt! JSHED UNDER THE PATENT COOPERATION TREATY (PCT) (51) International Patent Classification 5 (11) International Publication Number: WO 90/02818 C12Q 1/68 Al (43) International Publication Date: 22 March 1990 (22.03.90) (21) International Application Number: PCT/US89/03473 (81) Designated States: AT (European patent), AU, BE (European patent), BF (OAPI patent), BJ (OAPI patent), CF (22) International Filing Date: 14 August 1989 (14.08.89) (OAPI patent), CG (OAPI patent), CH (European patent), CM (OAPI patent), DE (European patent), FR (European patent), GA (OAPI patent), GB (European Priority data: patent), IT (European patent), JP, LU (European pa- 241,639 8 September 1988 (08.09.88) US tent), ML (OAPI patent), MR (OAPI patent), NL (European patent), SE (European patent), SN (OAPI patent), SU, TD (OAPI patent), TG (OAPI patent).
(71) Applicant: LIFECODES CORPORATION [US/US]; Old Saw Mill River Road, Valhalla, NY 10595 (US).
Published (72) Inventors: IP, Nancy, Yuk-Yu 23 Emery Drive, Westover ilth international search report.
Park, Stamford, CT 06902 BAUM, Howard, Jay Before the expiration of the time limit for amending the 18 Stubbe Drive, Stony Point, NY 10980 claims and to be republished in the event of the receipt of amendments.
(74) Agents: SALIWANCHIK, Roman et al.; Saliwanchik Saliwanchik, 2421 N.W. 41st Street, Suite A-1, Gainesville, FL 32606 (US).
(54)Title: PROBES FOR THE DETECTION OF RFLP IN EUCARYOTIC GENOMES pAC 365 z tn c %4- C: C Ln
II
EcoRI 0 u I BamHI 0.65 1.35 (57) Abstract A method for obtaining polynucleotide sequences which are useful for detecting Variable Tandem Repeat polymorphism at multiple genetic loci and other-genetic analyses under high stringency conditions (hence, with high specificity) is herein disclosed. Also disclosed are polynucleotide sequences and other compositions useful for DNA polymorphism and other genetic analyses.
WO 90/02818 PCT/US89/03 4 7 3 1
DESCRIPTION
Probes for the detection of RFLP in eucaryotic genomes.
FIELD OF THE INVENTION This: :invention relates to the field of molecular genetics. More specifically, this invention relates to polynucleotides useful for nucleic acid hybridizations, methods for. producing these polynucleotides, and methods for applying these polynucleotides in genetic analysis.
BACKGROUND OF THE INVENTION Double stranded DNA is the most common form of depository of genetic information of organisms. Double stranded :DNA has two complementary strands. Each strand is a polynucleotide sequence and the base sequences on the two complementary strands form Watson-Crick base pairs.
The duplex-structure of DNA can be disrupted in a number of ways, for example, by heating a duplex DNA solution in a 0.1 ,M NaCl to 100 C for a few minutes. At this temperature, the two. strands of duplex DNA separate. If the solution. is gradually cooled, the two strands of duplex DNA can re-associate to reform the duplex structure.
The process .of duplex formation from complementary polynucleotide. or oligonucleotide sequences has been used advantageously fortgenetic analysis. Typically, a labeled polynucleotide or oligonucleotide sequence is used in a reassociation process whereby it forms a duplex structure with. a substantially complementary sequence from a genetic source of interest. Because the labeled polynucleotide or oligonucleotide sequence is normally, though not necessarily, obtained from a source other than the source of interest, the .process of association between complementary sequences has been known as nucleic acid hybridization, or just hybridization for short. The WO 90/02818 IPCT/US89/03473 2 associational event provides genetic information about the source of interest through detection of the label on the labeled polynucleotide or oligonucleotide sequence. For this reason, the labeled polynucleotide or oligonucleotide sequence is called a probe. The label can be any suitable signal-generating moiety, and many such moieties are well known in the art.
Nucleic acid hybridization has been successfully applied in the study of DNA structure, gene purification, gene localization, the establishment of paternity and other familial relationship, genetic identity for forensic purposes, genetic identity of transplants, and detection and diagnosis of diseases and genetic traits.
One very powerful technique in the application of nucleic acid hybridization involves the fractionation of the complex genetic material to be analyzed prior to hybridization. E.M. Southern's procedure is the most celebrated and the most widely used of this genus. See Southern, J. Mol. Biol. 98: 503-517 (1975). Such a genetic analysis can reveal not only the presence or absence of complementary target nucleic acid sequences, but also the size of the restriction fragment(s) containing the targe sequence. Genetic variations within a species may be reflected by variations among individuals in the size of the restriction fragments containing a particular target sequence. Conversely, genetic relatedness of a group of individuals may be reflected by a deviation from random variations that exist among unrelated individuals. This aspect of genetic analysis has been called Restriction Fragment Length Polymorphism (RFLP).
"Single-copy" DNA probes have been used in this approach with success. For example, certain genetic traits and disease states have been identified this way.
See Gusella et al., Nature, 306: 234-238 (1983); Orkin, Cell 47:845-850 (1986).
WO 90/02818 PCT/US89/03473 3 The genetic information which can be adduced using "single-copy" DNA probes depends on the number of probes used, the number of genetic loci each probe is capable of detecting, the heterozygosities and the allele frequency of the relevant genetic loci. To date, "single-copy" DNA sequences are known to detect only a single locus per sequence. Moreover, heterozygosity of DNA in higher organism is low. In man, it is about 0.001 per base pair.
Finally, most polymorphic states detected are only dimorphic there are only two representational states: absence or presence of a relevant restriction site on the restriction fragment in question). As is often the case, critical-individuals in a genetic analysis are homozygous, and the genetic analysis may be uninformative.
Genetic analysis in higher organisms has been simplified considerably by the availability of probes for hypervariable regions of genomic DNA. These hypervariable regions show multi-allelic variation and high heterozygosities. These regions also appear to be widely interspersed within the genome. In each case, the hypervariable region comprises a variable number of tandem repeats of a short sequence (thus, Variable Tandem Repeats or VTR), and polymorphism results from allelic differences in the number of repeats at a given locus. This type of polymorphism, a subclass of RFLP has, been called VTR Polymorphism. It is believed that the variation in repeat number arises by mitotic or meiotic unequal exchanges or by DNA "slippage" during replication. Therefore, if genomic DNA is digested with a restriction endonuclease which does not cut within the repeat unit, and if a genetic locus encompasses a variable tandem repeat or VTR, allelic markers would exist for that locus. (It should be noted that the so-called repeat unit is a hypothetical consensus sequence, and any actual VTR sequence in the genome is really a string of short "core" sequences, each of which is I-.i WO 90/02818 PCT/US8/03473 4 very highly homologous, but usually not identical to the consensus sequence. Indeed, a "core" sequence may differ in length from the consensus sequence. The consensus sequence is derived from examining and "averaging" over a large number of "core" sequences. A "core" sequence is typically at least 70%, but often more than 70%, homologous to the consensus sequence.) Jarman et al. have described a hypervariable region of DNA located 8-kb downstream of the human alpha globin complex. The EMBO J. 5: 1857-63 (1986). This hypervariable region is composed of an array of imperfect 17-bp tandem repeats, the number of which differs considerably (70-450) from one allele to another. Thus, this locus is highly polymorphic. Genetic polymorphism which reflects variations in the number of such tandem among individuals has been called Variable Tandem Repeat Length Polymorphism.
The VTR described by Jarman et al., supra, crosshybridizes with other hypervariable genetic loci at low stringency. Thus a polynucleotide probe prepared from this region is potentially a very powerful probe, capable of probing many genetic loci in a single try.
A typical RFLP analysis involves digesting target genomic DNA with a restriction endonuclease, separating the digested DNA by gel electrophoresis, transferring the fractionated DNA in a denatured state to a binding surface, hybridizing the transferred DNA with a suitable probe, detecting the signals generated by the probe molecules which have become hybridized to the target DNA. The pattern of the signals generated would provide information about the target DNA. The pattern of signals can also be stored for later use, for instance, to determine or donfirm an individual's identification the pattern would be the individual's genetic fingerprint).
More commonly, two or more target DNA's are processed WO 90/02818 PCT/US89/03473 for RFLP analysis. Depending on the sources of the target DNA,'.the. information generated by comparison of the Spatterns .can.be used immediately as in the case of genetic identity identification of a suspect of a crime), or in the case where a high degree of genetic relatedness is present paternity testing, sib analysis and the like). In other cases, the information derived from pattern, comparison may form a part of a larger information-gathering effort. Pedigree analysis of distant relatives and correlation of a gene of genotype with a trait or medical condition are but two examples.
However the RFLP analysis is to be used, the pattern of signals: is controlled in large part by the probe or probes used in the analysis. A polynucleotide probe may be useful 'for any of a number of features.
First, a probe may be able to detect polymorphism at a locus that other probes cannot detect. The locus may be particularly, useful .for genetic analysis in the general population because it has many evenly distributed alleles.
Alternatively, the locus may be particularly useful for genetic analysis in a highly restricted segment of the population because it has a' rare allele.
Second;' a probe may be able to detect many loci simultaneously and unambiguously when a particular restriction endonuclease is used to digest the target DNA.
In this connection, it is useful to note that certain restriction, endonucleases may be preferred because of the history of the target DNA samples, e.g. forensic samples which has been exposed to the elements for an extended period of time.
Third, probes are often used in combination simultaneously because their resolving power may be compounded. Compounding is obtained when the signals produced by the several probes do not overlap and permit unambiguous. assignment of each (or substantially each) I~ C~ WO 90/02818 PCT/US89/03473 6 signal to an allele of a locus. See, "The Application Of DNA-Print For The Estimation Of Paternity", Baird et al. in Advances in Forensic Haemogenetics 2: 354- 358, Springer-Verlag, New York (1987).
How RFLP phenotypes can be practically applied for paternity and forensic determinations have been discussed in Baird et al., supra; Baird et al. "The Application Of DNA-PRINTTM For Identification From Forensic Biological Materials", in Adv. in Forensic Haemogenetics 2: 396-402, Springer-Verlag, New York (1987); and Baird et al. (III), Am. J. Hum. Genet. 39:489-501 (1986) and citations therein.
These papers are hereby incorporated by reference.
Reference has been made earlier in the instant disclosure that hybrid formation can take place even where there is a certain degree of mismatch between a probe and its substantially complementary target sequence. This process is particularly important for the utility of multilocus probes. Such probes generally form well-matched hybrids with target sequences which originate from the same genetic locus as the probe, but they form less well-matched "cross-hybrids" with target sequences from other loci. As a result, the loci that can be analyzed with a given prob.
may vary significantly with the reaction (association and washing) conditions of the hybridization test. Thus, many loci are detectable under low stringency conditions, but only a single locus is detectable under high stringency conditions. Therefore, a polynucleotide sequence which is capable of probing multiple polymorphic loci even under high stringency conditions represent an additional bonus.
Strictly speaking, stringency of conditions has two components: conditions which govern formation of the hybrids and conditions which govern the stability of (the duplex structure of) the hybrids. Typically, however, a hybridization test is performed at low (or relaxed) conditions during the hybrid formation phase to speed up WO 90/02818 PCT/US89/03473 7 the process of association. Therefore, for the purpose of this application, stringency of conditions refer solely to .conditions which govern the stability of the hybrids. (Of i course, if a hybrid is not stable under a given set of conditions, it would not be formed in the first place under those conditions.) :The-factors which govern the stability of a hybrid are many, including, but not limited to the temperature, the ionic strength, the molecular species of the salts used, the degree of modification or elimination of bases on a polynucleotide sequence, the degree and nature of mismatch, and the length and type of polynucleotides sequence.
Variation in one factor may be compensated or aggravated by variations in other factors. These and other relevant facts are well known to a person of ordinary skill in the art of molecular genetics.
For the purpose of this invention, low stringency conditions mean an aqueous environment containing about 2X SSC at about 50-65 C, or the equivalents thereof; and high stringency conditions mean an aqueous environment containing about 0.1X SSC or less at about 65°C, or the equivalents thereof. [For formulation of iX SSC, see Example 3 in Section 6 below].
For.the purpose of this invention, a "discrete polynucleotide sequence or subsequence" means a polynucleotide sequence or subsequence of greater than 15 nucleotides, but preferably greater than 50 nucleotides, and very preferably greater than 100 nucleotides; and a polynucleotide means a chain of about 15 nucleotides or more, and embraces the upper range of what sometimes passes as oligonucleotides.
Many "single copy" DNA probes are known in the art.
These probes do not relate closely to the present invention because their utility is generally limited to providing genetic information at a single locus; and i WO 90/02818 PCT/US89/03473 8 limited to detecting polymorphis: caused by alteration of a restriction site in the neighborhood of the target genomic sequences. The polymorphic probes of the present invention are of the VTR type and do not suffer from these limitations.
Polymorphic probes of the VTR type have also been described. However, the hybrids formed between a VTR probe and its target genomic sequences tend to be stable only under low to moderate stringency conditions, except for hybrids between the probe and target sequences from a single genetic locus. See Nakamura et al. Science 235: 1616-1622 (1987); Jeffreys et al., Nature 314:67-73 (1985). The exceptional hybrids are stable even under high stringency conditions, possible reflecting the fact that the probes originated from this locus.
Sometimes a genetic locus detectable with a VTR type probe may be very large, spanning several hundred kilobases. In a restriction fragment length polymorphism analysis of such a large locus, a VTR can yield many polymorphic bands under high stringency conditions.
However, the information which can be derived from such an analysis remains confined to the one locus. In fact, VTF probes for loci of this kind have disadvantages. First, recombination within the large locus (which is expected to be more frequent than a similar but smaller locus) can complicate data analysis. Second, to obtain more extensive information than is obtainable from only a single locus, two or more probes are preferably used in combination. The multiplicity of non-informational bands from the large locus may obscure bands detected by other probes used in combination, thereby making data analysis very difficult.
The alternative to using probes in combination would be more costly multiple analyses of restriction fragment length polymorphism. Therefore, the difference between a WO 90/02818 PCT/US89/03473 9 probe for a,.large single locus and a probe for multip.e loci is substantive, and not merely semantic.
BRIEF SUMMARY OF THE INVENTION The subject invention concerns the following: polynucleotide sequences useful for detecting polymorphism' in a -species of organism of interest, or a subpopulation thereof; a polynucleotide sequence useful for detecting polymorphism at multiple genetic loci, and characterized by its ability to form hybrids with restriction fragments of DNA, of 7.1, 6.2, 4.4, 4.2, 4.1, 3.7, 3.6, 2.6 and 2.2 kilobases, produced by PstI digestion of genomic DNA extracted from K562 cells; a polynucleotide sequence useful for detecting polymorphism at multiple genetic loci and capable of forming hybrids with genomic DNA fragments produced by complete digestion of Caucasoid, American Black or Hispanic genomic DNA with the restriction endonuclease PstI of approximate, allelic lengths and allelic frequencies as given in Table 2; a polynucleotide sequence useful for detecting polymorphism at multiple genetic loci with high specificity, the polynucleotide sequence forms hybrids with. genomic sequences at multiple genetic loci which remain stable even under high stringency conditions; a method for determining or obtaining a polynucleotide sequence useful for detecting polymorphism at multiple genetic loci with high specificity; the use of the above-described polynucleotide sequence as a probe for polymorphism; the use- of the above-described polynucleotide sequence as a probe for polymorphism at multiple genetic loci; a method of genetic analysis comprising: I digesting a DNA sample with a restriction endonuclease; separating the DNA restriction fragments according to size by electrophoresis; transferring the separated DNA to a binding surface; hybridizing the transferred DNA with a polynucleotide probe labeled with a signal-generating moiety, wherein the polynucleotide probe is a polynucleotide probe of the present invention; and detecting the signal generated; whereby the pattern of signals generated provides information about the composition of the DNA sample; and recombinant vectors and cells useful for producing polynucleotides of the invention.
According to a first embodiment of this invention, there is provided a polynucler'ide sequence capable of forming hybrids with genomic Variable Tancem Repeat DNA sequences at multiple, polymorphic genetic loci of a eukaryote under hybridization conditions wherein said hybrids are stable under high stringency conditions.
According to a second embodiment of this invention, there is provided a polynucleotide sequence represented by the human DNA insert in S 20 pAC329.
According to a third embodiment of this invention, there is provided a polynucleotide sequence represented by the human DNA insert in pAC344.
According to a fourth embodiment of this invention, there is provided a polynucleotide sequence represented by the restriction fragment of about 1.35 kilobase pairs produced by digestion of pAC365 with EcoRI and BamHI.
According to a fifth embodiment of this invention, there is provided a polynucleotide sequence represented by the restriction fragment of about 0.55 kilobase pairs produced by digestion of pAC365 with EcoRI and NcoI.
According to a sixth embodiment of this invention, there is provided a discrete polynucleotide subsequence of the polynucleotide sequence of any of the second, third, fourth or fifth embodiments, wherein said subsequence is capable of forming hybrids with genomic DNA at one or more polymorphic genetic loci of a eukaryote, and mutational, insertional or deletional variants which form stable hybrids under high amg/1391v Ii stringency conditions.
According to a seventh embodiment of this invention, there is provided a polynucleotide sequence capable of forming a hybrid with the polynucleotide sequence of any of the second, third, fourth or fifth embodiments, wherein the hybrid is stable under high stringency conditions.
According to a eighth embodiment of this invention, there is provided a polynucleotide sequence capable of forming a hybrid with the discrete polynucleotide subsequence or with its variants of the sixth embodiment, wherein the hybrid is stable under high stringency conditions.
According to a ninth embodiment of this invention, there is provided a polynucleotide sequence capable of forming hybrids with genomic DNA fragments, produced by complete digestion of American Blacks' and Hispanics' genomic DNAs with the restriction endonuclease PstI, of approximate lengths and approximate allele frequencies as described in Table 2.
According to a tenth embodiment of this invention, there is provided a polynucleotide sequence useful for detecting polymorphism at multiple genetic loci, and characterized by its ability to form hybrids with restriction fragments of DNA, of 7.1, 6.2, 4.4, 4.2, 4.1, 3.7, 3.6, 2.6 and 2.2 kilobases, produced by PstI digestion of genomic DNA extracted from K562 cells.
According to an eleventh embodiment of this invention, there is provided a recombinant vector comprising the polynucleotide sequence of any of the second, third, fourth or fifth embodiments.
According to a twelfth embodiment of this invention, there is provided a recombinant vector comprising the polynucleotide subsequence of the sixth embodiment.
According to a thirteenth embodiment of this invention, there is provided a recombinant vector comprising the polynucleotide sequence of the seventh embodiment.
According to a fourteenth embodiment of this invention, there is provided a recombinant vector comprising the polynucleotide sequence of the eighth embodiment.
According to a fifteenth embodiment of this invention, there is provided a recombinant vector comprising the polynucleotide sequence of the ninth or tenth embodiment.
amg/1391v L i According provided a cell embodiment.
According provided a cell embodiment.
According provided a cell embodiment.
According provided a cell embodiment.
According provided a cell embodiment.
According to a sixteenth embodiment of this invention, there is containing the recombinant vector of the eleventh to a seventeenth embodiment of this invention, there is containing the recombinant vector of the twelfth to a eighteenth embodiment of this invention, there is containing the recombinant vector of the thirteenth to a nineteenth embodiment of this invention, there is containing the recombinant vector of the fourteenth to a twentieth embodiment of this invention, there is containing the recombinant vector of the fifteenth to a twenty first embodiment of this invention, there is provided a method for obtaining the polynucleotide sequence of the first embodiment comprising: hybridizing a library of genomic sequences of an organism under hybridization conditions with a probe comprising a natural or consensus sequence of a Variable Tandem Repeat; selecting library sequences of said library which hybridize with said probe; testing and further selecting from the library sequences selected in step a library sequence which is capable of detecting Variable Tandem Repeat Length polymorphism at multiple genetic loci of said organism or a different organism under high stringency conditions.
According to a twenty second embodiment of this invention, there is provided a method for obtaining the polynucleotide sequence of the first embodiment comprising: hybridizing a library of genomic sequences of an organism a under hybridization conditions with a probe comprising a natural or consensus sequence of a Variable Tandem Repeat; selecting library sequences of said library which hybridize with said probe; producing subsequences of said selected library sequences by enzymatic means; amgl1391v
-LIC
-lOctesting and selecting from the subsequences produced in step a subsequence which is capable of detecting Variable Tandem Repeat Length polymorphism at multiple genetic loci of said organism or a different organism under high stringency conditions.
According to a twenty third embodiment of this invention, there is provided a polynucleotide probe comprising the polynucleotide sequence of any of the second, third, fourth or fifth embodiments, or mutational, insertional, or deletional variants which form stable hybrids under high stringency conditions.
According to a twenty fourth embodiment of this invention, there is provided a polynucleotide probe comprising the discrete polynucleotide subsequence of the sixth embodiment, or mutational, insertional, or deletional variants which form stable hybrids under high stringency conditions.
According to a twenty fifth embodiment of this invention, there is provided a polynucleotide probe comprising the discrete polynucleotide subsequence of the seventh embodiment, or mutational, insertional, or deletional variants which form stable hybrids under high stringency conditions.
According to a twenty sixth embodiment of this invention, there is provided a polynucleotide probe comprising the polynucleotide sequence of the eighth embodiment, or mutational, insertional, or deletional variants which form stable hybrids under high stringency conditions.
According to a twenty seventh embodiment of this invention, there is provided a polynucleotide probe comprising the polyii'icleotide sequence of the ninth or tenth embodiments, or mutational, insertional, or deletional variants which form stable hybrids under high stringency conditions.
According to a twenty eighth embodiment of this invention, there is 30 provided a method of genetic analysis comprising: digesting a DNA sample with a restriction endonuclease; separating the DNA restriction fragments according to size by electrophoresis; transferring the separated DNA to a binding surface; hybridizing the transferred DNA with polynucleotide probe labeled with a signal-generating moiety, wherein the polynucleotide probe is the polynucleotide probe of the twenty third embodiment; and r ce ii a ~-i~o a o a o a oa o or os r D amg/1391v -lOddetecting the signals generated; whereby the pattern of signals generated provides information about the composition of the DNA sample.
According to a twenty ninth embodiment of this invention, there is provided a method of genetic analysis comprising: digesting a DNA sample with a restriction endonuclease; separating the DNA restriction fragments according to size by electrophoresis; transferring the separated DNA to a binding surface; hybridizing the transferred DNA with a polynucleotide probe labeled with a signal-generating moiety, wherein the polynucleotide probe is the polynucleotide probe of the twenty fourth embodiment; and detecting the signals generated; whereby the pattern of signals generated provides information about the composition of the DNA sample.
According to a thirtieth embodiment of this invention, there is provided a method of genetic analysis comprising: digesting a DNA sample with a restriction endonuclease; .iparating the DNA restriction fragments according to size by electrophoresis; transferring the separated DNA to a binding surface; hybridizing the transferred DNA with a polynucleotide probe o labeled with a signal-generating moiety, wherein the polynucleotide probe Sis the polynucleotide probe of the twenty fifth embodiment; and detecting the signals generated; whereby the pattern of signals generated provides information about the composition of the DNA sample.
According to a thirty first embodiment of this invention, there is provided a method of genetic analysis comprising: digesting a DNA sample with a restriction endonuclease; separating the DNA restriction fragments according to size by electrophoresis; transferring the separated DNA to a binding surface; hybridizing the transferred DNA with a polynucleotide probe labeled with a signal-generating moiety, wherein the polynucleotide probe is the polynucleotide probe of the twenty sixth embodiment; and detecting the signals generated; whereby the pattern of amg/1391v -lOesignals generated provides information ahout the composition of the DNA sample.
According to a thirty second embodiment of this invention, there is provided a method of genetic analysis comprising: digesting a DNA sample with a restriction endonuclease; separating the DNA restriction fragments according to size by electrophoresis; transferring the separated DNA to a binding surface; hybridizing the transferred DNA with a polynucleotide probe labeled with a signal-generating moiety, wherein the polynucleotide probe is the polynucleotide probe of the twenty seventh embodiment; and detecting the signals generated; whereby the pattern of signals generated provides information about the composition of the DNA sample.
BRIEF DESCRIPTION OF THE DRAWINGS Figure 1 shows the electrophoretic pattern of restriction fragments of DNA's from recombinant lambda phage 19 which was digested with EcoRI (Figure 1A), and the results of a Southern hybridization blot when the DNA fragments were transferred and probed with oligo-5'-CCCCCCGTGTCGCTGTT-3' (Figure 1B).
Figure 2 shows the results of 2 hybridization blots. PstI digested human genomic DNA from a group of 4 unrelated individuals were probed with either DNA from Bands C of Figure 1A (Figure 2A), or DNA from Band D of Figure 1A (Figure 2B).
Figure 3 shows the results of a hybridization blot where a panel of PstI digested human genomic DNAs from five unrelated individuals was hybridized with either the insert of pAC329 (Figure 3A), or the insert of pAC344 (Figure 3B).
'Figure 4 shows the electrophoretic patterns of the Rsal digests of the inserts of pAC329 and pAC344.
WO 90/02818 PCT/US89/03473 11 Figure 5 shows the results of a hybridization blot where PstI digested human genomic DNA was probed with RsaI restriction',fragments of the inserts of pAC329 and pAC344.
Figures 5A, 5B,- 5C and 5D shows the results when Band 1, 2, 3 or 4 of Figure 4 respectively was used as probe.
Figure 6 shows the restriction map of pAC365.
Figure 7 shows the results of a hybridization blot where PstI digested human genomic DNA was probed with the insert of pAC365.
Figure 8 shows the electrophoretic pattern of the restriction fragments produced by digesting the 1.35 kbp human insert of pAC365 with BstNI.
Figure 9 shows the results of a hybridization blot where PstI digested human genomic DNA was probed with the EcoRI-PstI (Figure 9A) or the EcoRI-NcoI (Figure 9B) subfragment of the 1.35 kbp human insert of pAC365.
Figure 10 shows the results of hybridization blots where PstI digested human genomic DNA was hybridized with the insert of pAC365 under low associational stringency and low stringency wash (Figure 10A); low associational stringency and high stringency wash (Figure 10B); and (3) high associational stringency and high stringency wash (Figure Figure 11 shows the results of hybridization blots where PstI digested human genomic DNA was probed with the 1.35 kbp human insert of pAC256, and washed under either low stringency conditions (Figure 11A), or high stringency conditions (Figure 11B).
Figure' 12 shows the comparative banding patterns in hybridization blots where PstI digested human genomic DNA was probed with either "Band C DNA" (Figure 12A), or with 19-MSP (Figure 12B).
Figure 13 shows the results of a hybridization blot where PstI" digested genomic DNAs from a family spanning WO 90/02818 PCT/US89/03473 12 three generations were probed with the 1.35 kbp human insert of pAC365.
Figure 14 shows the allelic distributions of genetic loci detectable by pAC365 in American Blacks (Figure 14A), Caucasoids (Figure 14B), and Hispanics (Figure 14C).
Figure 15 shows the results of hybridization blots for genetic identification purposes. pAC365 insert was used as probe in a paternity test (Figure 15A), and a forensic test (Figure Figure 16 shows the relationships among the human sequences of the invention.
DETAILED DESCRIPTION OF THE INVENTION One embodiment of the instant invention is a method to obtain polynucleotide sequences useful for detecting polymorphism in a species, or a subpopulation thereof. A library of genomic DNA digested with one or more restriction endonucleases and cloned in a suitable recombinant vector is screened with a polynucleotide probe which comprises a string of "core" sequences (hereinafter "screening probe"). This string of "core" sequences can, but need not be, a monomer, an oligomer or a polymer or E mixture of oligomers and polymers of a consensus sequence or "core" sequence of a VTR. Preferably, the screening probe is a mixture of oligomers of a consensus sequence, because a short consensus sequence can be easily synthesized chemically in large amounts and ligated to form a mixture of oligomers. In a preferred embodiment, the consensus sequence is 5'-CCCCCCGTGTCGCTGTT-3'.
For the purpose of generating a genomic library, it is preferred that the restriction endonuclease digestion of genomic DNA be incomplete. One reason is that many genomic VTR sequences may otherwise evade detection. This would be so if the relevant restriction endonuclease cuts within the VTR sequences, and the bulk of the VTR sequences will be in WO 90/02818 PCT/US89/03473 13 relatively small pieces. The smaller the pieces, the greater the number of recombinant molecules which must be studied so that the human genome will be covered. For the same reason, it is preferred that a recombinant vector which can accommodate large DNA insert be used. Finally, where the recombinant vector has a restricted cloning range, incomplete digestion of the genomic DNA would also tend to avoid under-representation in the library of completely digested products which are smaller than the preferred cloning sizes.
The recombinants which react positively with the screening probe in a hybridization test (hereinafter "positive recombinants") are selected for further examination. In a preferred embodiment, the recombinants are bacteriophages. The standard method of "phage lifts" can be used to identify the recombinants containing DNA inserts which hybridizes to the probe. See Maniatis et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratories, Cold Spring Harbor, New York, (1982).
Briefly, a portion of a phage plaque is transferred to a nylon membrane where the DNA of the phage is immobilized and probed. Many plaques can be transferred in a single lift; moreover, the position of a plaque on the growth plate is in a one-to-one correspondence with the position on the membrane, thus permitting identification of the plaques which give rise to positive results in a hybridization test with the probe. Obviously, many variations of this basic technique can be designed with other cloning and/or transfer and/or identification systems.
Once the positive recombinants have been identified, they can be subjected to tests which prove or disprove their utility. They are used as probes in hybridization tests against genomic sequences of a species of organism of interest, or a relevant subpopulation thereof. While the WO 90/02818 IPCT/US89/03473 14 present invention broadly encompass eukaryotic organisms, one of the more commercially significant use is that of probing mammalian genomes, particularly, the human genome.
In any case, it is very highly preferred that the probe sequences of the present invention be derived from the same species of organisms as the genetic materials which are to be tested in a hybridization test. Thus, for applications of human genetic analysis, the starting library should preferably be a human genomic library. To avoid verbosity, the embodiments of this invention are described as if they apply to humans specifically. The present invention is not so limited, and is to be construed to be applicable generally to mammals and other eukaryotes.
The useful positive recombinants are those which can detect Variable Tandem Repeat Length Polymorphism in humans, at multiple loci, and under high stringency conditions. Genomic DNAs from family members are separately digested with a restriction endonuclease, the digests are separately subjected to size fractionation by, for example, electrophoresis, and the fractionated restriction fragments are prepared for hybridization in any standard method. Positive recombinants or the humai.
sequences or discrete polynucleotide subsequences inserted therein (jointly and severally "test sequences") are used to probe the restriction digests. Preferably, a single test sequence is used at a time. However, several sequences can be grouped together in preliminary tests to determine whether the group as a whole contains any useful sequences.
The hybridization "banding" pattern for each individual member is determined. In particular, the sizes of restriction fragments which hybridize to the test sequences are determined. The segregation scheme of each band within a family or, more commonly, a number of families will inform as to the nature of the genetic locus WO 90/02818 PCT/US89/03473 (loci) being detected. The nature of a genetic locus includes, but is not limited to, the following: 1.
Mendelian or non-Mendelian segregation; 2. phenotype and frequency of alleles (reflected by the size of restriction fragments produced by the restriction endonuclease used to digest the genomic DNA, and the frequency of occurrence in a population); 3. linkage to another locus (reflected by co-segregation with other bands). If the test sequence detects multiple bands which segregate independently, it is capable of detecting multiple genetic loci. If test sequence detects bands of a locus which bands represent different-s.zed fragments among different individuals, it is capable of detecting polymorphism in that polymorphic locus. The determination of the nature of the locus (loci) detected by a test sequence from the segregation scheme is a straight forward application of classical genetics, and is well within the command of a person of ordinary skill in the art of molecular genetics. The size of families, and the number of families needed to provide sufficient information to work out the segregation scheme would vary with the number of genetic loci being detected by the test sequence, the number of alleles in these loci, and the frequency of each allele. An ordinarily skilled artisan would also know how to determine the number and size of families to be studied.
Another embodiment of the instant invention is the test sequences which can detect polymorphism in an species of organism of interest, or a subpopulation thereof (hereinafter"useful test sequence").
In another embodiment of the instant invention, useful test sequences are cloned in recombinant vectors.
In another embodiment, the recombinant vectors comprising the useful test sequences are harbored in a cell.
Molecular cloning and transformation methods are well known in the art.
WO 90/02818 PCT/US89/03473 16 In a preferred embodiment, the useful test sequences are such that the hybrids formed between the test sequence and genomic sequences from at least two genetic loci are stably associated under high stringency conditions.
Because the segregation scheme is both lengthy and expensive to workout, it is sometimes preferably to defer the study of segregation until a test sequence has been better characterized. Thus, it may be preferable to modify the method described hereinabove, namely, to use instead of genomic DNAs from members of families, merely genomic DNAs from random, unrelated individuals. If the banding pattern appear to vary from individual to individual, the test sequence is presumptively treated as being useful as a polymorphic probe. If the number of bands detected in each of several individuals is significantly greater than one or two, the test sequence is presumptively treated as being useful as a multi-locus probe.
The presumptively useful test sequence is analyzed, and less desirable sequence(s) are removed to produce an improved test sequence. For example, a test sequence may comprise a subsequence which is polymorphic as well as a subsequence which is non-polymorphic in the relevant population. The presence of non-polymorphic bands yields no useful genetic information about a human individual, but can interfere with genetic analysis by, for example, obscuring a partially or totally informational band detected by a second probe used in combination with the test sequence. Another example is where the test sequence comprises a subsequence which is a highly repetitive sequence in the human genome. An example of a highly repetitive sequence is the "Alu sequence". See Houck et al., J. Mol. Biol. 132: 289 -306 (1979). Presence 6f such highly repetitive sequences in a probe often cause a high non-informational background signal in a hybridization blot. This background signal can be avoided by eliminating WO 90/02818 PCT/US89/03473 17 the highly repetitive sequence component from the test sequence. See, for example, Sealey et al., Nuc. Acids Res., 13: 1905-1922 (1985).
Still another example is where the test sequence comprise a first subsequence which delivers only a small signal in a hybridization blot relative to the signal delivered by a second subsequence. Here, it may be more advantageous to eliminate the first test subsequence so that a more "cost-effective" probe which delivers a higher signal on a per nucleotide basis may be produced. A more specific example of this type is where the first subsequence is a "single-copy" sequence, and the second subsequence is a Variable Tandem Repeat Sequence On a per nucleotide basis, the second sequence delivers more signal whenever there are more than a single copy at a genetic locus.
Discrete polynucleotide subsequences may be obtained from a test sequence in a number of ways, and are well within the capability of an ordinarily skilled artisan.
For example, one end of the test sequence may be progressively removed by an exonuclease or S1 enzyme, while the other end is being protected. Another example is digestion with a restriction enzyme. Other methods of obtaining subsequences are within the contemplation of the present invention.
After the less desirable subsequences have been eliminated from a test sequence, the remaining portion of the test sequence is used in familial tests for the determination of the nature of the genetic locus (loci) which it is capable of detecting as described above.
Finally, nearby genomic sequences (including nearby VTR sequences) may be reached by chromosome walk.
As discussed in the Background section of this application, duplex formation and stability depend on substantial complementarity between the two strands of a
L,
WO 90/02818 PCT/US89/03473 18 hybrid, and a certain degree of mismatch can be tolerated.
Therefore, whenever a test sequence obtained as described hereinabove has been determined to be useful in probing target polynucleotides of interest, mutations (both single and multiple), deletions, insertions of the useful test sequence, and combinations thereof, wherein said mutations, insertions and deletions permit formation of stable hybrids with said target polynucleotide of interest, are part of the present invention. Mutations, insertions and deletions can be produced in a given polynucleotide sequence in many ways, and these methods are known to an ordinarily skilled artisan. Other methods may become known in the future. The known methods include, but are not limited to: 1. determining analytically the sequence of a test sequence of the present invention, synthesize chemically or otherwise an artificial sequence which is a mutation, insertion or deletion of the test sequence; 2. using a test sequence of the present invention to obtain via hybridization a genomic sequence or otherwise which is a mutation, insertion or deletion of the test sequence; and 3. mutating, inserting or deleting a test sequence in vitro or in vivo.
It is important to note that the mutational, insertional, and deletional variants generated from a given test sequence may be more or less efficient than the test sequences, in the sense that more or fewer genetic loci may become detectable, more or fewer alleles of a particular locus may become detectable, (c) more or less stable under stringent hybridization conditions, and any combination of the above.
Notwithstanding such differences in efficiency, these variants are within the scope of the present invention.
In another embodiment of the present invention, the useful test sequences described hereinabove are used for genetic analysis, used as probes), including but c 1_ WO 90/02818 PCT/US89/03473 19 not limited to analysis of genetic identity, relatedness or alteration. In one preferred embodiment, the method of genetic analysis comprises: digesting a DNA sample with a restriction endonuclease; separating the DNA restriction fragments according to size by electrophoresis; transferring the separated DNA in a state suitable for hybridization to a binding surface; hybridizing the transferred DNA with a useful test sequence labeled with a signal-generating moiety, and detecting the signals generated; whereby the pattern of signals generated provides information about the composition of the DNA sample.
This and other embodiments of the present invention involve the use of useful test sequences as probes in methods of genetic analysis.
EXAMPLES
Following are examples which illustrate procedures, including the best mode, for practicing the invention.
These examples should not be construed as limiting. All percentages are by weight and all 'solvent mixture proportions are by volume unless otherwise noted.
Example 1 Detection Of Genomic Sequences Which Hybridize With Oligomers Of A Consensus Sequence Of A VTR Human genomic DNA incompletely digested with the restriction endonuclease EcoRI was cloned into the bacteriophage lambda Charon 30. Restriction fragments ranging from 4.5 kbp to 17.5 kbp can be cloned into this vector. Gene 12: 301-309 (1980). About 5000 phage plaques were screened according to the method of Maniatis _l WO 90/02818 PCT/US89/03473 et al., supra, at page 321. Oligomers of CCCCCCGTGTCGCTGTT-3', the 17-base consensus sequence of VTR at the 3' end of the human alpha globin complex, with an average length of 200-300 bases, were used to screen the phage "lifts".
Example 2 Analysis Of Human Sequences In Recombinant Phages Whose DNA Hybridizes With Oligomers of VTR Consensus Sequence A number of recombinant phages whose DNA hybridized with oligo-5'-CCCCCCGTGTCGCTGTT-3' were analyzed. The results obtained with Phage 19 are presented below. DNA from Phage 19 was extracted and digested with the restriction endonuclease EcoRI. The digests were subjected to electrophoresis in an agarose gel. The electrophoretic pattern obtained is shown in Figure 1A. Lane M contained bacteriophage lambda HindIII fragments as molecular weight markers. Bands A and B are the arms of the cloning phage vector. Bands C and D are human genomic sequence inserts.
The DNA fragments in Figure 1A were transferred to a nylon membrane, and probed with CCCCCCGTGTCGCTGTT-3' according to the procedure oi Southern, supra. Figure 1B shows the results of the hybridization blot. The numbers on the side of the Figure state the molecular weight in kilobase pairs. Only the larger human insert band, Band C was detected. See Figure lB. Therefore, only Band C contained human DNA which is homologous with oligo-5'-CCCCCCGTGTCGCTGTT-3'.
Example 3 Human Genomic Sequence Inserts from Phage 19 As Polymorphic Probes DNAs from Bands C and D of Figure 1A, representing the human genomic sequence insert in Phage 19, were extracted separately. They were used separately in the WO 90/02818 PCT/US89/03473 21 Southern format to probe human genomic DNAs from 4 unrelated individuals.
The target human sequences were restriction fragments produced by digestion with one of the following restriction endonucleases: PstI, HinfI, HindIII, EcoRI, TaqI, MspI, PvuII, RsaI, and BstNI.
The following experiment and results were representative. 5 ug of PstI digested DNA from each of 4 individuals were electrophoresed and blotted onto a nylon membrane. Nylon membranes were prepared in duplicate.
Either 3 2 p]-labeled DNA from Bands C of Figure 1A, or 3 2 p]-labeled DNA from Band D of Figure 1A was mixed with excess unlabeled total human genomic DNA, and used to probe the target sequences on the membranes. Radioactive labeling was achieved by random 6-mer primed enzymatic synthesis, using radioactive precursors as substrates.
However, other methods of labeling would also work as well.
Total unlabeled human genomic DNA was ded as a precautionary measure. It was known that the human genome contains widely dispersed highly repetitive sequences such as the Alu sequences. If the human insert in Phage 19 contained these and/or similar highly repetitive sequences, such repetitive sequences would produce a heavy background signal over the entire area on the blot where human target DNA could be found. The introduction of total human genomic DNA would serve to suppress this background signal.
Sealey et al., supra.
The hybridization was carried out at 65°C in ,5X SSPE, 1-2% SDS (sodium dodecylsulfate), 0.5-1 mg/ml heparin. The blot was washed in 0.1X SSC, 2.5 mM sodium phosphate, 1% SDS at 65 C. [lX SSPE 0.16 M NaCl, 0.01 M sodium phosphate, and 1 mM ethylenediaminetetraacetic acid. iX SSC 0.15 M NaC1, 0.015 M sodium citrate.] Figure 2 shows the results of these hybridization blots. Band C DNA was used in Figure 2A. Band D DNA was L- WO 90/02818 PCT/US89/03473 22 used in Figure 2B. In Figures 2A, the lane marked M contained lambda HindIII fragments as molecular weight markers, and the remaining four lanes contained PstI digested DNA from unrelated individuals. The same preparation of human DNAs was used in Figure 2B.
An average of about 7-10 bands were detected in each individual when Band C DNA was used as probe. See Figure 2A. By contrast, Band D DNA could detect only a subset of the bands. See Figure 2B. The subset, indicated by the arrows, were present in all 4 unrelated individuals and would therefore appear to be non-polymorphic. The other bands were present in some but not all individual, and therefore, polymorphic. See Figure 2A. The fact that these polymorphic bands relate to multiple loci was confirmed in Example Band C human insert DNA was also able to detect multiple polymorphic loci when human genomic DNAs were digested with Hinfl, HindIII, EcoRI, TaqI, Mspl, PvuII, and Rsal. However, polymorphism was not detected when the human genomic DNA was digested with BstNI. Because genetic variations other than the VTR type are rare in man (about 0.001 per base pair), the probability that a non-VTR type polymorphism, a polymorphism caused by alterations in restriction sites in the neighborhood of the target sequence, would be revealed by at least seven restriction enzyme digestion is vanishingly small. Correlatively, the evidence is overwhelming that Band C DNA is detecting a VTR type polymorphism. The BstNI data is consistent with the hypothesis that the VTR being detected here has an internal BstNI site. The results of Example 6 as well as partial sequencing data support this hypothesis.
Therefore, Band C DNA, but not Band D DNA is useful as a multi-locus, polymorphic probe.
i; i WO 90/02818 PCT/US89/03473 23 Example 4 Cloning Of The Human "Polymorphic Probe" Sequence From Phage 19 DNA in Band C of Figure 1A, representing part of the human sequence insert in Phage 19 was extracted, and cloned into Bluescript, which is a trademarked cloning vector of Stratagene, San Diego, CA 92121. Bluescript is a derivative of M13 as well as of pBR322, and is about 3kbp long. The EcoRI insertion site was used. Several Bluescript recombinants containing Band C human DNA were obtained.
Two clones designated pAC329 and pAC344 were used to probe a panel of human genomic DNAs. DNA from 5 unrelated individuals were digested with PstI and two duplicate filters were prepared for the Southern procedure. The hybridization conditions were the same as described in Example 3. Figures 3A and 3B show the results obtained when the inserts of pAC329 and pAC344 were used as probes respectively. It is readily seen that the same pattern (of multiple bands) were obtained. However, the pAC344 insert is slightly longer, and more sensitive, produces a stronger signal under comparable conditions.
A third Bluescript recombinant was able to detect only non-polymorphic bands, i.e. the banding patterns are identical in several unrelated individuals. (Data not shown).
Band C appears to have more than one species of DNA sequences each approximately 5 kbp long. This is consistent with the preferred cloning size of the,- Charon vector (see Example 1 above).
This Example shows that both pAC329 and pAC344 are useful as multi-..ocus, polymorphic probes.
Example 5 Analysis Of Human DNAs In pAC329 And pAC344 pAC329 DNA was digested with EcoRI and the approximately 5 kbp human DNA insert was isolated. The WO 90/02818 PCT/US89/03473 24 same was done with pAC344 DNA. The purified inserts were further separately digested with RsaI, and the products of the restriction digest was electrophoresed in a 1% agarose gel. The electrophoretic pattern is shown in Figure 4.
Lane contained molecular weight markers (lambda HindIII and phi-x HaeIII fragments). Lane contained restriction fragments from the insert of pAC329; and lane contained those from the insert of pAC344.
DNA from each of the four marked bands shown in Figure 4 was isolated and used to probe unrelated, human genomic DNAs in the Southern format. The hybridization conditions were as described in Example 3, except that unlabelled human genomic DNA was not present in the probe mix. Relevant results of hybridization blots against PstI digested human genomic DNA are shown in Figure 5. The general conclusion is that DNA from each of Bands 1, 2, 3 and 4 are useful as multi-locus, polymorphic probes.
Figure 5A shows the hybridization results when DNA from Band 1 of Figure 4 was used as the probe. Under the above-indicated hybridization conditions, if the probe contained highly repetitive sequences such as the "Alu sequences", a background signal in addition to an_ specific bands might be produced, because the probe would have hybridized to homologous highly repetitive sequences which are widely dispersed throughout the human genome.
Such background signal is indeed indicated in Figure Figure 5B shows the hybridization results when DNA from Band 2 of Figure 4 was used as the probe. Otherwise, the target and the hybridization conditions were identical.
In contrast with Figure 5A, background signals which are indicative of the presence of highly repetitive sequences in the probe are not as evident here.
Similarly, Figures 5C and 5D show the respective results when DNA from Bands 3 and 4 were used as the
LL--
WO 90/02818 PCT/US89/03473 respective probes. Figure 5D shows rather less background signal than Figure This Example shows that the subsequences of pAC329 and pAC344 represented by Bands 2 and 4 are improvements over pAC329 and pAC344 as multi-locus, polymorphic probes.
DNA from Band the second largest restriction fragment from RsaI digestion of the human DNA insert in pAC344 was cloned into the SmaI insertion site of Bluescript by "blunt end cloning". The recombinant molecule has been designated pAC365.
Example 6 Analysis Of pAC365 A restriction map of pAC365 DNA was obtained using a standard method. The results of the mapping is shown in Figure 6. The human sequence can be excised from the recombinant with EcoRI and BamHI in the form of a 1.35 kbp (approx.) fragment. The human sequence is flanked by 7 nucleotides on one end and 11 on the other both from the polylinker region of Bluescript. See Figure 6.
The 1.35 kbp fragment was excised and used to probe PstI digested human genomic DNA identically prepared as in Example 5, and under identical conditions. The results of the hybridization is shown in Figure 7. A comparison of Figure 5D with Figure 7 shows that the signal patterns in the two blots were identical. This shows that the DNA sequence from Band 4 of Figure 4 was successfully cloned into pAC365.
The 1.35 kbp fragment was digested with BstNI and the restriction digest was electrophoresed in a 3% NuSieve agarose gel. [NuSieve is a trademarked product of FMC Corporation]. The electrophoretic pattern is shown in Figure 8. Lane M of Figure 8 contained phi-x HaeIII molecular weight markers. The BstNI digest of the 1.35 kbp fragment shows a broad band at about 70-100 bp (indicated by an arrow in the Figure) and a band at a higher molecular bj i WO 90/02818 PCT/US89/03473 26 weight of approximately 0.9 kbp. The sum of the molecular weights pertaining to these two bands is below 1.35 kbp.
Therefore, the results indicate that the 1.35kbp fragment comprises several copies of an approximately 70-100 bp VTR "core" sequence.
Example 7 Analysis Of The Subfragments Of The pAC365 Insert The 1.35 kbp human sequence cloned into pAC365 was excised from the plasmid along with 18 nucleotides from the polylinker region of Bluescript by digestion with EcoRI and BamHI. The excised fragment is further digested with one of the following enzymes: PstI, Xbal, Sau3A I, Hinfl, and NcoI, thus generating five subfragments that each had the EcoRI site of 1.35kbp fragment at one end.
Each of these five subfragments were tested for its utility as multi-locus, polymorphic probes. Each is hybridized to a replicate panel of PstI digested, unrelated, human genomic DNAs. The hybridization conditions were as described in Exam-ple 5. Parts of the results are shown in Figure 9. Figures 9A, and 9B show the results where EcoRI-PstI, and EcoRI-NcoI subfragments respectively, were used as the probe. It is readily seen that each of these subfragments generated the same pattern as the full pAC365 insert. Compare Figure 7 with Figures 9A, and 9B. Furthermore, the other subfragments also yielded the identical banding pattern. Hence, only the results for the largest (EcoRI-PstI) and the smallest (EcoRI-Ncol) subfragments are shown.
Finally, the DNA in the 100 bp area of Figure 8 was used as probe, and was able to generate an identical pattern on a replicate panel of target human DNA. This is further evidence that pAC365 contains a VTR sequence.
This Example shows that each of these subfragments are useful multi-locus, polymorphic bands.
WO 90/02818 PCT/US89/03473 27 Example 8 Hybrid Formation And Stability Under High and Low Stringency Conditions The 1.35 kbp insert of pAC365 was used to probe a panel of PstI digested human genomic DNA. The results are shown in Figure 10. Figure 10A shows a Southern blot where hybrid formation took place at 50 C, in 5X SSPE, and the blot was washed at 65°C in 2X SSC. Figure 10B shows a duplicate blot except it was washed at 65 C in 0.1X SSC.
No unlabeled total human genomic DNA was added to the probe mix.
It is believed that the probe contained a small amount of Alu-like sequence, and this accounts for the background signal. More significantly, the background is dramatically lower in Figure 10B than in Figure 10A. Most importantly, the specific signals, the polymorphic bands are strong and unambiguous when the blot was washed under high stringency conditions.
When hybrid formation and washing take place under more stringent conditions, background suppression is further improved. Figure 10C shows a blot where hybrids were formed at 65 C in 5X SSPE, and where the blot was washed at 65 C in 0.1X SSC. The hybrids were actually stable at the even more stringent conditions: 0.01X SSC at These results can be juxtaposed with those obtained using pAC256 as a probe. pAC256 is a Bluescript recombinant containing a human insert sequence which contains a Variable Tandem Repeat Sequence. See McClain et al., Am. J. of Human Genetics 41(Suppl.):A259 (1987).
pAC256 behaves in conformity with prior art expectation.
Prior art teaches that even though VTR probes can detect many loci at low stringency, the probes can only identify a single locus at high stringency. See, for example, Nakamura et al. (1987), Science 235: 1616, at 1618.
WO 90/02818 PCT/US89/03473 28 Figure 11 illustrates just this point. Genomic human DNA were extracted, digested with PstI, and hybridized in the Southern format to pAC256 human insert at 65°C in SSPE. Figure 11A shows the results when the blot was washed at 65*C in 2X SSC. Figure 11B shows the results when the blot was instead washed at 65°C in 0.1X SSC. It is readily seen that many of the bands detected under the less stringent conditions were unstable under the more stringent conditions and were washed away. The fact that no more than two bands were seen in each of the lanes containing target human DNA is consistent with the notion that at most a single locus the locus from which the probe sequence originated can be detected.
Example 9 Equivalency Of Various DNA Sequences As Polymorphic Probes The following polynucleotide sequences were tested for differences, if any, in terms of the ability to detect polymorphism at multiple loci: DNA contained in Band C of Figure 1A ("Band C DNA"); 19-MSP (defined below); the insert of pAC365; and the 5 subfragments described in Example 7.
19-MSP was obtained from "Band C DNA" by digesting the latter with MspI. "Band C DNA" appears to contain more than one species of DNA sequences of about 5 kbp. One species is cut into several smaller pieces by MspI. This species does not detect polymorphism. The other species is also cut by Mspl, but the larger digestion product is only slightly smaller than 5 kbp. This approximately 5 kbp product is designated 19-MSP.
19-MSP and "Band C DNA" detected the same polymorphic bands when tested against genomic DNA from more than twenty genetically unrelated individuals. "Band C DNA" was able to detect, in addition to these polymorphic bands, several non-polymorphic bands. These results are illustrated in WO 90/02818 PCT/US89/03473 29 Figure 12. Figures 12A and 12B show Southern blots probed respectively with "Band C DNA" and 19-MSP, but are otherwise prepared in duplicate. Each blot contained PstI digested genomic DNA from genetically unrelated individuals. When one of test lanes, for example, the rightmost test lane in Figure 12A is compared with its counterpart, the rightmost test lane of Figure 12B, it is seen that the former contain extra bands which are marked with arrows. However, these are non-polymorphic bands, as evidenced by the appearance of the same sized bands in all other test lanes in Figure 12A. These results show that "Band C DNA" and 19-MSP recognize the same DNA polymorphisms.
19-MSP and the insert of pAC365 detected the same polymorphic bands when tested against genomic DNA from 107 genetically unrelated individuals. These results show that the insert of pAC365 and 19-MSP are equivalent probes.
Similarly, each of the five subfragments described in Example 7 have been shown to be equivalent to pAC365.
(Representative data shown in Figures 7 and 9).
Example 10 pAC365 Detects Multiple Loci Which Segregate Independently In the Mendelian Fashion Genomic DNAs were extracted from individuals belonging to families spanning three generations, digested with PstI, and probed with pAC365 insert in the Southern Format.
Figure 13 shows the results of one such family study. The family tree at the top of the figure indicate the source of DNA in each of the test lanes. The lanes marked (m) contained molecular weight markers.
A total of 27 bands of varying sizes were detected by hybridization with pAC365. These are partially numbered on the side of the figure for identification. Table 1 shows the phenotype of each individual.
C
WO 90/02818 PCT/US89/03473 Several conclusions can be drawn from these results.
First, each and every band which is present in any one of the eight children is also present in either the father or the mother. This result is consistent with stable chromosomal inheritance. Similarly, every band which is present in the father is present in either the paternal grandfather or grand-mother.
Second, the results are consistent with independent Mendelian segregation of alleles present on sister chromosomes. For example, Bands 1 and 4 appear to be two alleles to the same gene. The mother has both of these alleles, presumably one on each sister chromosome.
However, each of the eight children inherits one or the other allele, but never both. Thus, the mode of inheritance is consistent with independent Mendelian segregation. Similarly, Bands 10 and 17 in the father segregate in a manner which is consistent with the notion that they are two alleles on the same gene.
r j WO 90/02818 Pcr/US89/03473 31 Table 1. Inheritance And Segregation Of Allele Within A Family* Earni±ly Member 1 2 3 4 5 6 7 8 Bands 1 2 3 4 6 7 8 9 11 12 13 14 +i 16 17 18 19 21 22 23 24 26 27 *No data from maternal grandmother.
WO 90/02818 PCT/US89/03473 32 Third, the results are consistent with the notion that Band 1 is not linked to any of Band 7, 9, 13 or 19.
These bands are all present in the mother. However, child #8 inherited Band 1, but not Band 7 from the mother.
Therefore, Band 1 is not linked to Band 7. Child #1 inherited Bands 9, 13 and 19, but not Band 1 from the mother. Therefore, Band 1 is not linked to Band 9, 13 or 19.
Similar analyses lead to the conclusion that Band 7 is not linked to any of the other Bands present exclusively in the mother, nor Band 9, nor Band 13 and 19. Bands 1, 7, 9, 13 and 19 represent five genetic loci.
Example 11 Population Genetics and Allele Frequency DNAs from 423 genetically unrelated individuals were tested in this study. Each DNA sample was digested with PstI and probed in a Southern hybridization procedure. For 127 samples, the insert from pAC365 was used as the probe.
For the other 296 samples, 19-MSP was used as the probe.
However, it had been established that the two probes recognize the same DNA polymorphisms. Therefore, pooling of results obtained by using these probes is valid. Th.
pooled results have been sorted according to their ethnic origins American Blacks, Caucasoids, and Hispanics). The frequency v. allele size distributions are shown in Figure 14. Figures 14A, 14B, and 14C show the distributions in American Blacks, Caucasoids, and Hispanics respectively. The y-axis is measured in per cent, and the x-axis is measured in kilobase pairs. The frequency distributions are more fully set forth in Table 2.
fj
I
WO 90/02818 PCT/US89/03473 Table 2. Allele Frequencies For Three Racial Groups** Fragment Size (kilobases)*** 2.25 2.30 2.35 2.40 2.45 2.50 2.55 2.60 2.65 Blacks 42 Caucasoids 33 Hispanics 2.70 2.75 2.80 2.85 2.90 2.95 3.00 3.05 Blacks 13 Caucasoids 11 Hispanics 19 3.10 3.15 3.20 3.25 3.30 3.35 3.40 3.45 Blacks 1 Caucasoids 1 Hispanics 2 3.50 3.55 3.60 3.65 3.70 3.75 3.80 3.85 Blacks 5 26 Caucasoids 2 Hispanics 3 6 3.90 3.95 4.00 4.05 4.10 4.15 4.20 4.25 4.3 Blacks 17 Caucasoids 23 Hispanics 26 4.35 4.40 4.45 4.50 4.55 4.60 4.65 4.70 4.75 Blacks 3 Caucasoids 8 Hispanics 12 10 7 8 11 6 14 4.80 4.85 4.90 4.95 5.00 5.05 5.10 5.15 5.20 Blacks Caucasoids 8 Hispanics 7 WO 90/02818 WO 90/2818 cri US89/03473 34 5.25 5.30 5.35 5.40 5.45 5.50 5.55 5.60 5.65 Blacks Caucasoids Hispanics 5.70 5.75 5.80 5.85 5.90 5.95 6.0 6.1 6.2 Blacks Caucasoids Hispanics 6.3 6.4 6.5 6.6 6.7 6.8 6.9 7.0 7.1 Blacks Caucasoids Hispanics 3 4 11 12 12 9 6 6 7.2 7.3 7.4 7.5 7.6 7.7 7.8 7.9 Blacks Caucasoids His panics 8.1 8.2 8.3 8.4 8.5 8.6 8.7 8.8 8.9 Blacks Caucasoids Hispanics 9.1. 9.2 9.3 9.4 9.5 9.6 9.7 9.8 Bl1ack s Caucasoids Hispanics 9.9 10 10.2 10.4 10.6 10.8 11.0 11.2 11.4 Blacks Cau cas oid s Hispanics Ii i.
WO 90/02818 PCT/US89/03473 11.6 11.8 12.0 12.2 12.4 12.6 12.8 13.0 13.2 Blacks Caucasoids Hispanics 13.4 13.6 13.8 14.0 14.2 14.4 14.6 14.8 15.0 Blacks Caucasoids Hispanics 15.2 15.4 15.6 15.8 16.0 16.2 16.4 16.6 16.8 Blacks 3 0 1 1 2 0 1 0 0 Caucasoids 1 0 0 0 0 1 0 0 1 Hispanics 1 1 0 0 2 0 0 0 0 17.0 17.2 17.4 17.6 17.8 18.0 18.2 18.4 Blacks Caucasoids 0 Hispanics 2 18.6 18.8 19.0 19.2 19.4 19.6 19.8 20.0 Blacks 0 Caucasoids 0 Hispanics 0 Allele frequencies are stated in per cent in Table 2.
The standard error for fragment size is approximately 0.6% of the size of the fragment. Therefore, DNA fragments whose sizes are within 2% of each other (3 standard deviations) are considered indistinguishable.
WO 90/02818 PCT/US89/03473 36 Example 12 Characterization of pAC365 Genomic DNA from various human cell lines were extracted, digested with PstI, and hybridized with pAC365 insert in the Southern format. 7024, 7351, 7047, 7432, 7433 and 7015 were obtained from Centre d'Etude du Polymorphisme Humain in France. 1202 was obtained from the National Institute of General Medical Sciences (NIGMS) Human Genetic Cell Repository (Catalog Number 1202B). It is a lymphoblast cell line with 49 chromosomes (XXXXY).
CEM and Jurket are T lymphoblastoid cell lines. K562 is a erythroleukemia cell line and HL60 is a promyelocytic cell line. CEM, K562, and HL60 can be obtained from the American Type Culture Collection ("ATCC") under ATCC catalog numbers CCL119, CCL243, and CCL240, respectively.
The bands detected in the Southern blot are set forth in Table 3 below. For example, pAC365 insert detected 9 bands ranging from 7.1 kilobase pairs to 2.2 kilobase pairs when hybridized with PstI digested K562 cell DNA. The banding pattern obtained for each cell line is unique.
Therefore, when used for probe purposes polynucleotide sequences can be characterized, or "fingerprinted" by the banding pattern with known target DNA.
WO 90/02818 PCT/US89/03473 37 Table 3. "Fingerprint" of pAC365 7024 7351 CEM 1202 HL60 Jur. 7047 7432 7433 7015 K562 6.3# 6.5 20 11 10.7 10.6 9.6 9.7 9.3 6.5 7.1 5.3 5.6 7.7 6.3 6.6 5.2 9.2 9.3 5.7 5.7 6.2 4.9 5.3 6.1 5.8 5.0 5.1 7.1 6.4 5.6 5.4 4.4 4.4 5.0 5.0 4.6 4.6 4.9 5.7 5.7 5.1 4.9 4.2 4.2 4.9 4.6 4.2 4.5 4.7 5.4 5.4 4.9 4.4 4.1 3.7 4.2 4.1 2.6 4.3 4.3 5.2 5.2 4.2 4.1 3.7 3.6 4.1 4.0 2.3 3.8 4.2 4.9 5.1 3.8 3.6 3.6 2.6 3.8 3.8 2.2 3.5 4.0 4.7 4.7 3.7 2.6 2.6 2.3 2.6 3.6 2.6 2.6 4.2 4.6 2.6 2.2 2.2 2.2 2.2 2.3 2.2 2.3 3.9 3.6 2.2 2.2 2.2 3.7 2.6 2.7 2.2 2.2 Sizes of the fragments detected are stated in kbp in Table 3.
Example 13 Paternity Testing and Forensic Testing Genomic DNAs were extracted from a child, the mother of the child and the alleged father. The DNAs were digested with PstI, electrophoresed and transferred for Southern hybridization. The DNA targets were probed with 19-MSP. Figure 15A shows the results of the blot. Lanes labeled contained molecular weight markers. Lanes and contained DNA from the mother, the child, and the alleged father of the child respectively. Lane (d) contained a mixture of the child's DNA and the alleged father's DNA. The last lane often helps to resolve ambiguity whenever a band detected in the child's lane is close in size to a band detected in the alleged father's lane. In such a case, the presence of a singlet band in the relevant size region in the "child plus alleged father" WO 90/02818 PCT/US89/03473 38 lane would tend to indicate a common allele; and a doublet band would indicate distinct alleles.
Figure 15A shows that at least 5 bands in lane (b) were not inherited from the mother because they are not present in lane However, each of these bands (marked with arrows) are present in lane Therefore, the evidence supports the theory that the alleged father is indeed the biological father.
DNAs were extracted from a rape victim, semen found on the victim and from a suspect of the crime. The DNAs were digested with PstI, and subjected to the Southern hybridization procedure. 19-MSP was used as a probe.
Figure 15B shows the results of the hybridization blot.
Lane contained DNA from the victim. Lane and (c) contained DNA from semen found on the victim, and from a suspect, respectively. At least 8 bands (marked with arrows) in lane do not match the bands in lane clearly indicating that these bands did not arise from cells of the victim which some-how contaminated the semen sample. However, all 8 bands matched bands of the same sizes in lane Therefore, the evidence that the semen came from the suspect was exceedingly strong.
Example 14 Relationships Among The Polymorphic Probes Of The Invention Figure 16 shows the relationships among the various DNA sequences of the present invention, which are useful as polymorphic probes.
DEPOSIT OF MICROORGANISM Many polynucleotide sequences may be used to practice the present invention. Exemplary of such sequences are human genomic sequences which have been cloned into recombinant plasmids designated pAC329, pAC344, and pAC365.
L
WO 90/02818 PCT/US89/03473 39 Figure 16 shows the relationships among the cloned sequences of this invention.
An E. coli strain HB101 carrying the plasmid pAC329, an E. coli strain HB101 carrying the plasmid pAC344, and an E. coli strain HB101 carrying the pAC365 plasmid have been deposited with the Agricultural Research Culture Collection (NRRL), Peoria, IL, on September 2, 1988, and have been assigned accession numbers NRRL B-18403, NRRL B- 18404, and NRRL B-18405, respectively.
The subject cultures have been deposited under conditions that assure that access to the cultures will be available during the pendency of this patent application to one determined by the Commissioner of Patents and Trademarks to be entitled thereto under 37 CFR 1.14 and U.S.C. 122. The deposits are available as required by foreign patent laws in countries wherein counterparts of the subject application, or its progeny, are filed.
However, it should be understood that the availability of a deposit does not constitute a license to practice the subject invention in derogation of patent rights granted by governmental action.
Further, the subject culture deposits will be stored and made available to the public in accord with the provisions of the Budapest Treaty for the Deposit of Microorganisms, they will be stored .with all the care necessary to keep them viable and uncontaminated for a period of at least five years after the most recent request for the furnishing of a sample of the deposit, and in any case, for a period of at least thirty (30) years after the date of deposit or for the enforceable life of any patent which may issue disclosing the cultures. The depositor acknowledges the duty to replace the deposits should the depository be unable to furnish a sample when requested, due to the condition of the deposits. All restrictions on the availability to the public of the subject culture WO 90/02818 PCT/US89/03473 deposits will be irrevocably removed upon the granting of a patent dislcosing them.
E. coli HB101 is available from the NRRL repository where its accession number is NRRL B-11371. Plasmids can be isolated from the E. coli host by use of standard procedures, using cleared lysate-isopycnic density gradient procedures, and the like.
The present invention is not to be limited in scope by the microorganisms deposited, since the deposited embodiment is intended as a single illustration of one aspect of the invention. Many variations of this invention as herein set forth may be made without departing from the spirit and scope thereof. The specific embodiments described are given by way of example only, and the invention is limited only by the terms of the appended claims.
LL-~ i -7 WO 90/02818 P CT/US89/03473 :~~t~ONAL~lR ifeccd~es Corrcration Saw IMilll River Poad Vaihall, NY 10595 FLWE:?-r 11; CASE OF1' N ORC:NXL 0r'05z7 ±zsued pur-suant to Rulea 7.1 by =e x~Z7MXEu(ATNXL DElaS=7AR AUrllORI= Idrn=!Iad Z.e bornm of page NAIfM XND XDDRESS OF orPOS=7Rro Idenm.,.!icmmtlcn re~erenca q~.ver by~ tne Atennnmon numoer g-..vcn ny z. ie Escherichia coli HB1O1 (pAC3 29) NRRL B-18403 z. 5ci rl DqC-MrPZON AND/OR PROPOSED =LONOMIC 0ZS=MZAON =.c~toganllim ldcnm-.,:iad under 1 above we-% accor~anted by: a scient=lI~c dacp.:Ln a proposed r-txonoo±Lc designtion (~Axr. t~a croms where 2a-p.cable) Mmis Ifltn--nLriola± DdpomiTAX7A'thr acrep-s this miorcan-tx idnnmitIcd under I &bove, wnl.~h was received by lz onSep-. 2, 1988 Cdar O~ ct t-e =3.q3.nal deposir4l TV. ffTEIXT:[OHLM DECSZJCI aC~R e:Agriculrural Research Cailtre c: CS) so (s I h~i-c t re pos-e: Callect±on CNRRL) to reru.nrre .IntatnArzonai. Incer-macional Dejos±=ParV xuu-hcrxiy y or a, &u£srorzeota1 f±es:18153 N. Unve~si7 Scree= Peoriz, 12..1±nois 6160A~ U.S.A. D&re! Where Rule 6.4 Cdl a??1-4es, zu-% is mhe dazz n Olwhb.=Zh the S=2m. L zrnar.onal dr~s=was woe~w a deczL=. =ad& aUrz.a Bt::aoe~z mrv L±-r Mne ct zne LmR~so I.za 2.rnal± dnmcrz1~ Zy .L5 =ve=-ed m. a cS005..m =dr= sucz 1-s m-e damze =O.0~re sas:vdn l 3'/4 a:Re) SUBSTITUTE SHEET jc A/FpP WO 90/02818 P CTIUS89/03473 ~~x~:ON or. Na n C- TLi feccdes Cr~a.f Saw~. MIj -1Rver Read Valhal-l, IYY 10595 :N THE OFs M? AU o uart t ue7.1. by ta der at= bottom ce -aq or DE'oS::7' :~Tc:Nor =M e- 7tZO Rc:SM Mdan=:t!-c=. of rreeencir given by the Artessian nur-nocr qtvan by tne Escherichia coli HB1O1 (pAC3 44) NRRL B-184 0 4 moIcrorgan-' -£danz.LI~d under anove wa~s dcc mgnicd by-, a asclnt.L!Ir drpion ~Ja proposed taxonorric deiqnzsoon (Kae-k writh a cr0.8 where a=rPicahie) ==FT AnrD Ac~p,;Nr- I ntexnnatonai. D400fitary AUthOt± t accepts the o-Icorqanisn idanti :ied under I aaove, which was received by .it onSe.p-.
2 19d 8 (date of t-he *riqinal deposit) 1 IV. INERkNA'IONL DE2os=:ARY AtrTUORr-Y :-.am.ze Agricu~ltural Research Cult:ure Siqrnature(s) at =crson(s) having tne pouco: Collection (NRRL) to represent the nternationai. Deposit -:y InCernational Depositar-7 Auchority Authority or ot a ~hrizad of!Lcial(s): )Addxe,..- 1815 1N. jnivergity Street Peoria, ILl-inois 61604 U.S.A. Date: I Where Rule 6 .4 W ad) i s such dare is the daze an wh c e 3= r s of i e na: n l de s t ry autnority was acquired; where a deposit made outsIde the Budapest Treaty after the ac-qiist-.on of t-hQ Status Of interniational depositary aut-hority is converzed into a devosit under tne Budapesz Treaty, such date is the date an which the =~croarganism as rwcc*u b n inte-laational depositary aur-hority. vdbt.
FOrM BP/4 (sole page) SUBSTITUTE S0HI-E WO 90/02818 PCT/US89/03473 3=AES-- ON 2=.x R--CG~tj':ON or THEf DE2CS:- OF FOR THE PURPOSE-S OF PEi ?RO=U.E ICZO-TNX=ON FORM4 Dr. Nancy Ip Uifeccdes cor-rraticn Saw Mi11 River Poad Valhall, NY 10595 RE=E?T 1N 14 C0SE Or AN ORIC:iAr. DLOrrv- Isnsued purs3uant to Rule 7.1 by trie rERNAI~ NX.DEOS11-R tmY- AuHoRI7 ldent~.±iod Am =no botm o =3Page A.80 XODRESS OF I. DE ~cAONor ac-po~o=cNIsm Identilicarion re~erenca given by tne )XCCezsiof num±,r given by tne DZ CS=OR' ~n1-'!ADOkL DEVOSI:ARY AW-, ORn: Escherichia coli HB102. (PAC3 65 N1RRL B-18405 r.SC==jijrxC DESC:U?=OH AND/OR PPOnPOSED T;XcoNs-C ors1C7xrToH T'le =ntr=crqani.sm ident-'Ia~d under I &bove8 was acc--cpanied by: Sa scienmlfic cdescr±pt~Icn W v a proposed taxonomtc designation (Mark vitLri cross where a-pplcable) REPr7 AIM ACEp~jkN= This lnrarnat~ionaJ. Depositary Autharit a c ots the micoarazanis= ident: !Icd under: I &nove, whrich was received try it onSep- .2 ,19 5 (dlacte ol the original dCVoritjl~ TV. f1=ZH.NXEIoNAL Dr~OsiV.RY ADnoru= AgrictiJLtural Research CulureSignatue i )oa -,erson(s) having t~e power CaI-lectoa (.NFRL) to represent the craionaJ. Depsitz~y Immet-atioia1 Deposizary Auchorizy Autflociy ar C31 Au oMC):=cd ±ca() Ad.. sia 11 N. uni'versiry Screet Peoria, Ill~inois 61604 U.S.A. DtZLt'P 1Where Rule 6.4td) aps2.±.s, iruch data is theu date an- whiU.m the zzatus o! Inzer-azicnAl deposizazry aiutnc=-.ty was ac-hd;w.e a depoit made out.s:.de t±le ludape3t ::saty Ll~mr =c AnIts-cO-= s=3Lmta.- dazosi'zzrY aut~cr,-:v .s cnn-ven~ed !nto a decosit under tfle nud"L.sm suc datai Is the dame c-1 w.nn e~cax a eevd~ ~~at~-~was reerve: tat autc:t SUBSTITUTE SHEET C IA 1;=p

Claims (26)

1. A polynucleotide sequence capable of forming hybrids with genomic Variable Tandem Repeat DNA sequences at multiple, polymorphic genetic loci of a eukaryote under hybridization conditions wherein said hybrids are stable under high stringency conditions.
2. The polynucleotide sequence of claim 1 wherein eukaryote is a mammal.
3. The polynucleotide sequence of claim 2 wherein the mammal is a human.
4. A polynucleotide sequence represented by the human DNA insert in pAC329. A polynucleotide sequence represented by the human DNA insert in pAC344.
6. A polynucleotide sequence represented by the restriction fragment of about 1.35 kilobase pairs produced by digestion of pAC365 with EcoRI and BamHI.
7. A polynucleotide sequence represented by the restriction fragment of about 0.55 kilobase pairs produced by digestion of pAC365 with EcoRI and N;:oI.
8. A discrete polynucleotide subsequence of the polynucleotide sequence of any of the claims 4 to 7, wherein said subsequence is capable o of forming hybrids with genomic DNA at one or more polymorphic genetic loci of a eukaryote, and mutational, insertional or deletional variants j which form stable hybrids under high stringency conditions. amg/1391v WO 90/02818 PCT/US89/03473 42 1 9. The discrete polynucleotide subsequence of claim 8 2 wherein the eukaryote is a human. 1 10. A polynucleotide sequence capable of forming a 2 hybrid with the polynucleotide sequence of any of the 3 claims 4 to 7, wherein the hybrid is stable under high 4 stringency conditions. 1 11. A polynucleotide sequence capable of forming a 2 hybrid with the discrete polynucleotide subsequence or 3 with its variants of claim 8, wherein the hybrid is stable 4 under high stringency conditions. 1 12. A polynucleotide sequence capable of forming 2 hybrids with genomic DNA fragments, produced by complete 3 digestion of Caucasoids', American Blacks' and Hispanics' 4 genomic DNAs with the restriction endonuclease PstI, of approximate lengths and approximate allele frequencies as 6 described in Table 2. 1 13. A polynucleotide sequence useful for detecting 2 polymorphism at multiple genetic loci, and characterized by 3 its ability to form hybrids with restriction fragments of 4 DNA, of 7.1, 6.2, 4.4, 4.2, 4.1, 3.7, 3.6, 2.6 and 2.2 kilobases, produced by PstI digestion of genomic DNA 7 extracted from K562 cells. 1 14. A recombinant vector comprising the polynucleotide 2 sequence of any of the claims 4 to 7. 1 15. A recombinant vector comprising the polynucleotide 2 subsequence of claim 8. 1 16. A recombinant vector comprising the polynucleotide 2 sequence of claim WO 90/02818 PCT/US89/03473
17. A recombinant vector comprising sequence of claim 11. the polynucleotide
18. sequence A recombinant vector comprising of claim 12 or 13. the polynucleotide
19. A cell containing the recombinant vector of claim A cell containing the recombinant vector of claim
21. A cell containing the recombinant vector of claim 16. 17.
22. A cell containing the recombinant vector of claim
23. A cell containing the recombinant vector of claim
24. A method for obtaining the polynucleotide sequence 18. of claim 1 comprising: hybridizing a library of genomic sequences of an organism under hybridization conditions with a probe comprising a natural or consensus sequence of a Variable Tandem Repeat; selecting library sequences of said library which hybridize with said probe; testing and further selecting from the library sequences selected in step a library sequence which is capable of detecting Variable Tandem Repeat Length polymorphism at multiple -44- genetic loci of said organism or a different organism under high stringency conditions. A method for obtaining the Polynucleotide sequence of claim 1 comprising: hybridizing a library of genomic sequences of an organism under hybridization conditions with a probe comprising a natural or consensus sequence of a Variable Tandem Repeat; selecting library sequences of said library which hybridize with said probe; producing subsequences of said selected library sequences by enzymatic means; testing and selecting from the subsequences produced in step a subsequence which is capable of detecting Variable Tandem Repeat Length polymorphism at multiple genetic loci of said organism or a different organism under high stringency conditions.
26. A polynucleotide probe comprising the polynucleotide sequence of any of the cla.ms 4 to 7, or mutational, insertional, or deletional variants which form stable hybrids under high stringency conditions.
27. A polynucleotide probe comprising the discrete polynucleotide subsequence of claim 8, or mutational, insertional, or deletional variants which form stable hybrids under high stringency conditions.
28. A polynucleotide probe comprising the discrete polynucleotide subsequence of claim 10, or mutational, insertional, or deletional variants which form stable hybrids under high stringency conditions.
29. A polynucleotide probe comprising the polynucleotide sequence of claim 11, or mutational, insertional, or deletional variants which form stable hybrids under high stringency conditions. A polynucleotide probe comprising the polynucleotide sequence of claim 12 or 13, or mutational, insertional, or deletional variants which form stable hybrids under high stringency conditions.
31.. A method of genetic analysis comprising: digesting a DNA sample with a restriction endonuclease; separating the DNA restriction fragments according to size by electrophoresis; transferring the separated DNA to a binding surface; hybridizing the transferred DNA with polynucleotide probe labeled with a signal-generating moiety, wherein the polynucleotide probe I amg/1391v is the polynucleotide probe of claim 26; and detecting the signals generated; whereby the pattern of signals generated provides information about the composition of the DNA sample.
32. A method of genetic analysis comprising: digesting a DNA sample with a restriction endonuclease; separating the DNA restriction fragments according to size by electrophoresis; transferring the separated DNA to a binding surface; hybridizing the transferred DNA with polynucleotide probe labeled with a signal-generating moiety, wherein the polynucleotide probe is the polynucleotide probe of claim 27; and detecting the signals generated; whereby the o o c, 0 a 1 n o 1 o ca o L:, co c es a amg/1391v WO 90/02818 PCT/US89/03473 46 14 pattern of signals generated provides information about the composition of the DNA sample. 1 33. A method of genetic analysis comprising: 2 digesting a DNA sample with a restriction 3 endonuclease; 4 separating the DNA restriction fragments according to size by electrophoresis; 6 transferring the separated DNA to a binding 7 surface; 8 hybridizing the transferred DNA with a 9 polynucleotide probe labeled with a signal- generating moiety, wherein the polynucleotide 11 probe is the polynucleotide probe of claim 28; 12 and 13 detecting the signals generated; whereby the 14 pattern of signals generated provides information about the composition of the DNA sample. 1 34. A method of genetic analysis comprising: 2 digesting a DNA sample with a restriction 3 endonuclease; 4 separating the DNA restriction fragments according to size by electro-ioresis; 6 transferring the separated DNA to a binding 7 surface; 8 hybridizing the transferred DNA with a 9 polynucleotide probe labeled with a signal- generating moiety, wherein the polynucleotide 11 probe is the polynucleotide probe of claim 29; 12 and 13 detecting the signals generated; whereby the 14 pattern of signals generated provides information about the composition of the DNA sample. -47- A method of genetic analysis comprising: digesting a DNA sample with a restriction endonuclease; separating the DNA restriction fragments according to size by electrophoresis; transferrin the separated DNA to a binding surface; hybridizing the transferred DNA with a polynucleotide probe labeled with a signal-generating moiety, wherein the polynucleotide probe is the polynucleotide probe of claim 30; and detecting the signals generated; whereby the pattern of signals generated provides information about the composition of the DNA sample.
36. A polynucleotide sequence capable of forming hybrids with genomic Variable Tandem Repeat DNA sequences at multiple, polymorphic genetic loci of a eukaryote under hyrbridization conditions which polynucleotide sequence is substantially as herein described with reference to any one of Examples 1 to 14 and any one of Figures 1A to 12B.
37. A polynucleotide sequence useful for detecting polymorphism at multiple genetic loci substantially as herein described with reference to any one of Examples 1 to 14 and any one of Figures 1A to 12B.
38. A recombinant vector comprising the polynucleotide sequence of 'claims 36 or 37.
39. A cell containing the recombinant vector of claim 38. A method for obtaining the polynucleotide sequence of claim 36 which method is substantially as herein described with reference to any one of Examples 1 to 4.
41. A polynucleotide probe substantially as herein described with reference to any one of Examples 5 to
42. A method of genetic analysis which method is substantially as herein described with reference to Examples 10 or 11. DATED this SEVENTH day of SEPTEMBER 1992 Lifecodes Corporation Patent Attorneys for the Applicant SPRUSON FERGUSON amg/1391v
AU42220/89A 1988-09-08 1989-08-14 Probes for the detection of rflp in eucaryotic genomes Ceased AU630888B2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US24163988A 1988-09-08 1988-09-08
US241639 1988-09-08

Publications (2)

Publication Number Publication Date
AU4222089A AU4222089A (en) 1990-04-02
AU630888B2 true AU630888B2 (en) 1992-11-12

Family

ID=22911543

Family Applications (1)

Application Number Title Priority Date Filing Date
AU42220/89A Ceased AU630888B2 (en) 1988-09-08 1989-08-14 Probes for the detection of rflp in eucaryotic genomes

Country Status (3)

Country Link
EP (1) EP0433367A1 (en)
AU (1) AU630888B2 (en)
WO (1) WO1990002818A1 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5192658A (en) * 1989-03-30 1993-03-09 Lifecodes Corporation Compositions and protocols applicable to genetic analysis
US5851762A (en) * 1990-07-11 1998-12-22 Gene Type Ag Genomic mapping method by direct haplotyping using intron sequence analysis
EP0570371B1 (en) * 1990-07-11 1999-10-06 Genetype Ag Genomic mapping method by direct haplotyping using intron sequence analysis
DE69210588T2 (en) * 1991-02-05 1996-09-19 Lifecodes Corp Molecular genetic identification using probes that recognize polymorphic sites

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0238329A2 (en) * 1986-03-19 1987-09-23 Zeneca Limited Improvements in genetic probes
AU581582B2 (en) * 1984-11-12 1989-02-23 Lister Institute Of Preventive Medicine Dna probes to fingerprint genomes at hypervariable or minisatellite regions

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS62111699A (en) * 1985-08-05 1987-05-22 コラボラテイブ・リサ−チ・インコ−ポレ−テツド Determination of gene form due to pleomorphism of limit fragment length

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU581582B2 (en) * 1984-11-12 1989-02-23 Lister Institute Of Preventive Medicine Dna probes to fingerprint genomes at hypervariable or minisatellite regions
EP0238329A2 (en) * 1986-03-19 1987-09-23 Zeneca Limited Improvements in genetic probes

Also Published As

Publication number Publication date
AU4222089A (en) 1990-04-02
WO1990002818A1 (en) 1990-03-22
EP0433367A1 (en) 1991-06-26

Similar Documents

Publication Publication Date Title
US5206137A (en) Compositions and methods useful for genetic analysis
US5514547A (en) Molecular genetic identification using probes that recognize polymorphic loci
Mowatt et al. DNA sequence of the murine gamma 1 switch segment reveals novel structural elements.
EP0164054B1 (en) Method for detection of polymorphic restriction sites and nucleic acid sequences
JP3535159B2 (en) Selective approach to DNA analysis
AU637768B2 (en) Genetic identification employing dna probes of variable number tandem repeat loci
JPH08500723A (en) Genome improper scanning
US4963663A (en) Genetic identification employing DNA probes of variable number tandem repeat loci
JP2777170B2 (en) Method for selecting chromosome-specific repetitive DNA sequences using recombinant DNA clones
US6709827B2 (en) Mismatch repair detection
Iwasaki et al. A minisatellite and a microsatellite polymorphism within 1.5 kb at the human muscle glycogen phosphorylase (PYGM) locus can be amplified by PCR and have combined informativeness of PIC 0.95
JP2002537855A (en) Compositions and methods for genetic analysis
EP0175689B1 (en) Method of detecting nucleic acid sequences
Shalit et al. Physical analysis of the CYC1-sup4 interval in Saccharomyces cerevisiae
AU630888B2 (en) Probes for the detection of rflp in eucaryotic genomes
CA1313633C (en) Purification of polymorphic components of complex genomes
US5192658A (en) Compositions and protocols applicable to genetic analysis
Faham et al. A novel in vivo method to detect DNA sequence variation.
AU780056B2 (en) Mismatch repair detection
Koehler et al. Cytogenetics of the genus Leporinus (Pisces, Anostomidae). II. Molecular cytogenetics, organization and evolutionary conservation of a chromosome-specific satellite DNA from Leporinus obtusidens
Hunter et al. Toward the construction of integrated physical and genetic maps of the mouse genome using interspersed repetitive sequence PCR (IRS-PCR) genomics.
WO1997012993A1 (en) Mismatch repair detection
Dolf DNA Fingerprinting: Approaches and Applications
Arenstorf An investigation of linking clones as tools for mapping the human X chromosome
Glover et al. The physical characterization of cloned DNA segments and their counterparts within chromosomes