WO1999043855A1 - Unique identifier for biological samples - Google Patents

Unique identifier for biological samples Download PDF

Info

Publication number
WO1999043855A1
WO1999043855A1 PCT/US1999/004094 US9904094W WO9943855A1 WO 1999043855 A1 WO1999043855 A1 WO 1999043855A1 US 9904094 W US9904094 W US 9904094W WO 9943855 A1 WO9943855 A1 WO 9943855A1
Authority
WO
WIPO (PCT)
Prior art keywords
polymo
unique identifier
åhisms
sample
biological sample
Prior art date
Application number
PCT/US1999/004094
Other languages
French (fr)
Inventor
David H. Bing
Janice M. Williamson
Original Assignee
Genomics Collaborative, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Genomics Collaborative, Inc. filed Critical Genomics Collaborative, Inc.
Priority to CA002321963A priority Critical patent/CA2321963A1/en
Priority to JP2000533594A priority patent/JP2002505088A/en
Priority to EP99908461A priority patent/EP1056888A1/en
Priority to AU27892/99A priority patent/AU2789299A/en
Publication of WO1999043855A1 publication Critical patent/WO1999043855A1/en

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B50/00ICT programming tools or database systems specially adapted for bioinformatics
    • G16B50/20Heterogeneous data integration
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids

Definitions

  • sample's history and/or source Most samples and their sources are given a common alphanumeric designation, and this designation is also linked to information about the source and the sample (e.g., patient name, sample type, disease condition, etc.). A loss of this designation from its association with either the source, or the sample, or the information will often result in a complete loss of utility of all three.
  • the present invention relates to a method of creating a unique identifier for reliably identifying samples, their sources, and associated information.
  • the use of the identification system described herein substantially decreases potential mixups and misidentification of samples, their sources, and associated information.
  • the present invention provides a method for creating a unique identifier which is used to label the sample, its source, or the associated information, based on the polymo ⁇ hisms inherent in the sample and its source.
  • One or more polymorphisms in the sample is detected, and the resulting polymorphism data is used to produce a unique identifier, which is then used to identify the sample.
  • This unique identifier cairalso be linked with the source and/or any information that may be associated with either the sample or the source (i.e. the unique identifier can be used as a common designation for the sample, its source and/or other relevant information).
  • this unique identifier is separated from the sample, then the polymorphisms within the sample simply need to be re-detected to reproduce the polymorphism data which is then used to produce the unique identifier, thereby recreating the proper unique identifier, and, ultimately, its link to its source.
  • the invention features a method for producing a unique identifier for a biological sample, comprising detecting one or more polymorphisms within the biological sample, and selecting one or more polymo ⁇ hisms sufficient to form a unique identifier.
  • the biological sample can be from a vertebrate, an invertebrate, a plant, or consist of microorganisms.
  • the biological sample can also be from a mammal, particularly a human.
  • the sample can be blood, saliva, hair, body fluid, tissues, organs, one or more cells, or a whole organism.
  • the polymo ⁇ hisms can be nucleic acid polymo ⁇ hisms, protein polymo ⁇ hisms, enzyme polymo ⁇ hisms, chemical polymo ⁇ hisms, biochemical polymo ⁇ hisms, phenotypic polymo ⁇ hisms, and quantitative polymo ⁇ hisms, particularly a nucleic acid sequence polymo ⁇ hism, a nucleic acid length polymo ⁇ hism, or a short tandem repeat (STR).
  • the unique identifier can also be linked to the source of the biological sample, or relevant information about the biological sample or the source of the biological sample.
  • the unique identifier can be in the form of an alphanumeric string, or a bar code.
  • the invention also features a method for establishing a repository containing a collection of biological samples, comprising obtaining a biological sample from a source, detecting one or more polymo ⁇ hisms in the sample, selecting one or more polymo ⁇ hisms sufficient to form a unique identifier, using the unique identifier to identify the sample, storing the sample with the unique identifier, and repeating these steps for biological samples from other sources.
  • the samples are DNA-containing samples, particularly from humans, and the polymo ⁇ hisms are nucleic acid polymo ⁇ hisms, protein polymo ⁇ hisms, enzyme polymo ⁇ hisms, chemical polymo ⁇ hisms, biochemical polymo ⁇ hisms, phenotypic polymo ⁇ hisms, -4-
  • the unique identifier can be in the form of an alphanumeric string, or a bar code, and can also be linked to the source of the biological sample, or relevant information about the biological sample or the source of the biological sample.
  • the invention features a method of determining, by means of a u que identifier, if a source is represented by a sample within the repository, comprising obtaining a sample from the source, detecting one or more polymo ⁇ hisms in the sample selecting one or more polymo ⁇ hisms sufficient to form a unique identifier, and comparing the unique identifier so produced to the unique identifier of each sample in the repository, where shared identity between the two unique identifiers indicates that the source is already represented in the repository.
  • the samples are DNA-containing samples, preferably from humans.
  • the polymo ⁇ hisms are nucleic acid polymo ⁇ hisms, particularly short tandem repeats (STR), protein polymo ⁇ hisms, enzyme polymo ⁇ hisms, chemical polymo ⁇ hisms, biochemical polymo ⁇ hisms, phenotypic polymo ⁇ hisms, and quantitative polymo ⁇ hisms.
  • the unique identifier can also be linked to the source of the biological sample, or relevant information about the biological sample or the source of the biological sample.
  • the unique identifier can be in the form of an alphanumeric string, or a bar code.
  • the invention also features a method for linking, by means of a unique identifier, a first biological object lacking a unique identifier with a second object having a unique identifier, comprising detecting one or more polymo ⁇ hisms in the first biological object, selecting one or more polymo ⁇ hisms sufficient to form a unique identifier, and comparing the unique identifier so made to the unique identifier of the second object, where shared identity between the two unique identifiers links the first biological object with the second object.
  • the biological sample can be from a vertebrate, an invertebrate, a plant, or consist of microorganisms.
  • the biological sample can also be from a mammal, particularly a human.
  • the sample can be blood, saliva, hair, body fluid, tissues, organs, one or more cells, or a whole organism.
  • the polymo ⁇ hisms can be nucleic acid polymo ⁇ hisms, particularly short tandem repeats (STR), protein polymo ⁇ hisms, enzyme polymo ⁇ hisms, chemical polymo ⁇ hisms, biochemical polymo ⁇ hisms, phenotypic polymo ⁇ hisms, and quantitative polymo ⁇ hisms, particularly a nucleic acid sequence polymo ⁇ hism, or a nucleic acid length polymo ⁇ hism.
  • the unique identifier can also be linked to the source of the biological sample, or relevant information about the biological sample or the source of the biological sample.
  • the unique identifier can be in the form of an alphanumeric string, or a bar code.
  • the present invention provides a method for creating a unique identifier for identifying biological samples, their sources, or associated information based on the polymo ⁇ hisms inherent in the sample and source themselves.
  • the nucleic acid contained within the sample itself is used to produce the unique identifier for identifying and linking the sample, its source, and associated information.
  • No external material needs to be added to the sample which could dilute or alter the accuracy of other test results.
  • An advantage of the invention therefore, is that it is unnecessary to add "identifying sequences" to the samples, and that without such additions, one may conduct studies of genetics, disease associations, evolutionary relationships, etc., without the results being tainted by the added identifying sequences.
  • a “source” or “the source from which the sample is derived” refers to the originating material for a sample.
  • a source of a biological sample for example, can be a human, any animal, plant, insect, or a population or strain of microorganisms.
  • a source of a biological sample does not have to be living, and can be a deposit in a tissue repository, herbaria or museum specimens, forensic specimens, or fossils.
  • a “potential source” as used herein, means a source from which the sample may possibly have been taken in the past.
  • sample is meant a portion of source biological material that originated elsewhere, i.e., the sample was removed from its source.
  • a sample can be any biological sample, (e.g., blood, saliva, hair, organs, biopsies, bodily fluids, one or more cells), and can be taken from any vertebrate, including mammals such as humans, or plant, insect, reptile.
  • the sample can also be a strain or mixed -6-
  • Samples can also be biological materials taken from defunct or extinct organisms, e.g., samples can be taken from pressed plants in herbarium collections, or from pelts, taxidermy displays, fossils, or other materials in museum collections.
  • Information associated with the sample or source from which the sample is derived is meant to include, without limitation, any information that might be necessary or advantageous to be linked to the sample or the sample source, e.g. name, address, sex, medical history (in the case of human samples), species, collection data, provenance (in case of non-human samples), etc.
  • a biological sample is taken from its source, the sample is tested across one or more polymo ⁇ hic loci, and the polymo ⁇ hic data produced are used to create a unique identifier, which is identifiably linked to the sample, and serves as its unique designation.
  • This unique identifier can also be identifiably linked to the sample's source, and/or any information that may exist concerning the source and/or the sample.
  • the umque identifier is "identifiably linked" to the sample, sample source, or related information means that it is connected in some way with any or all of these three things, e.g.
  • the unique identifier may be on a label attached to a container holding the sample, the unique identifier may exist as a field in a database record containing medical data regarding the source, etc.
  • the genetic code of the sample itself which is unique and forms the basis of the polymo ⁇ hisms tested, serves as the unique identifier. Because the unique identifier is based on the genetic code, which is unique between individuals, the umque identifier will also be unique between samples from different source individuals.
  • a "polymo ⁇ hism" is an allelic variation between two samples.
  • the term includes differences between proteins (e.g., enzymes, blood groups, blood proteins), differences in the chemicals and biochemicals (e.g., secondary metabolites) produced by the source organism(s), differences between nucleic acids involving differences in the nucleotide sequence (e.g., restriction site maps), or differences in length of a stretch of nucleic acid (e.g., RFLPs (restriction fragment length polymo ⁇ hisms), microsatellites, STRs (short tandem repeats), SSRs (simple sequence repeats), SSLPs (simple sequence length polymo ⁇ hisms), VNTRs -7-
  • Allelic variation can also result in phenotypic (i.e., visually- apparent) polymo ⁇ hisms, or variations in quantitative characters (e.g. variation in height, length, yield of fruit, etc.) between the organisms that serve as the source of the samples.
  • phenotypic differences may be visible in the samples themselves, e.g., kernels of different types of "Indian" corn often appear very different from each other, with red, yellow, white, blue, streaked kernels, etc. With such samples, phenotypic polymo ⁇ hisms could also be used to produce the unique identifier.
  • a polymo ⁇ hism is not limited by the function or effect it may have on the organism as a whole, and can therefore include allelic differences which may also be a mutation, insertion, deletion, point mutation, or structural difference, as well as a strand break or chemical modification that results in an allelic variant.
  • allelic differences may also be a mutation, insertion, deletion, point mutation, or structural difference, as well as a strand break or chemical modification that results in an allelic variant.
  • a polymo ⁇ hism between two nucleic acids can occur naturally, or be caused intentionally by treatment (e.g., with chemicals or enzymes), or can be caused by circumstances normally associated with damage to nucleic acids (e.g., exposure to ultraviolet radiation, mutagens or carcinogens).
  • sequence polymo ⁇ hism is a difference in the sequence of two nucleic acids or two amino acids.
  • Two amino acid sequences can differ by having different residues at a particular position (i.e., and amino acid substitution), or some residues may be deleted, or new residues inserted or added to one or more ends.
  • Two nucleic acids differing in sequence may have the same number of base pairs (e.g., "AT£rC” vs. "ATT_C”), but may also include some differences in overall sequence length as well (e.g., "AT£A£ ⁇ TG” vs. "ATCACACATG”).
  • Types of commonly-studied polymo ⁇ hisms caused by sequence differences include restriction site polymo ⁇ hisms, isozymes, differences in protein conformation, and length polymo ⁇ hisms. If the nucleic acid is sequenced, then a sequence difference itself (as represented by the string of letters) serves as the polymo ⁇ hism.
  • a "length polymo ⁇ hism” is a difference in the length of two nucleic acids. Two different nucleic acids with a length polymo ⁇ hism between them also have a sequence polymo ⁇ hism, but many methods used to detect a length polymo ⁇ hism do not reveal the exact sequence polymo ⁇ hism. Commonly-used -8-
  • length polymo ⁇ hisms include RFLPs (restriction fragment length polymo ⁇ hisms), microsatellites, STRs (short tandem repeats), SSRs (simple sequence repeats), SSLPs (simple sequence length polymo ⁇ hisms), and VNTRs (variable number tandem repeats).
  • RFLPs restriction endonucleases are used to cut a nucleic acid molecule into fragments, which are then separated on an agarose gel.
  • the differences between two individuals are measured by the changes in size of the resultant nucleic acid fragments, and so are referred to as length polymo ⁇ hisms, yet those differences are caused by differences in the underlying sequence, which is the basis for the change in restriction sites, and therefore the changes in the sizes of the nucleic acid fragments. Because the method of detection/visualization can only differentiate on the basis of fragment length, the RFLPs are generally classed as length polymo ⁇ hisms..
  • a "polymo ⁇ hic locus" is a segment of nucleic acid which may contain a polymo ⁇ hism as described above. It is not required that the precise sequence of the nucleic acid be known.
  • a polymo ⁇ hic locus is not limited to those loci which are polymo ⁇ hic in all situations, e.g., a polymo ⁇ hic locus which displays an allelic variation between individuals A and B, but not between individuals A and C, remains a polymo ⁇ hic locus for purposes of comparing individuals A and B, as well as individuals B and C.
  • Nucleic acid means deoxyribonucleic acid (DNA), ribonucleic acid (RNA), nucleic acids from mammals or other animals, plants, insects, bacteria, viruses, or other organisms.
  • unique identifier is meant an identification tag, designation, or code to be linked to a sample, its source, or other information, such as patient case history, disease testing results, genetic testing results, geographic or temporal collection data, or any other information which may be useful when linked with the sample or source.
  • the unique identifier can exist in the form of an alphanumeric string, a bar -9-
  • allelic polymo ⁇ hism has been studied for decades, and there are many genetic systems which have been commonly used in assessing polymo ⁇ hism between populations or individuals. These include classical blood groups, blood proteins, isozymes, distribution of restriction endonuclease sites, restriction fragment length polymo ⁇ hisms (RFLPs), and others.
  • RFLPs restriction fragment length polymo ⁇ hisms
  • STRs are stretches of DNA that consist of repeated sequences repeats.
  • the base sequence is usually just a few base pairs long, typically two to twelve base pairs, but longer base repeats have been seen. This base sequence is then tandemly repeated, and the number of times it is repeated can vary greatly, depending on the STR locus being studied.
  • An STR can therefore be expressed as (X) n , where X is the repeated sequence, (e.g. "CA”) and n is the number of times that X is repeated.
  • the base sequence may be repeated 5 times in individual A, but may be repeated 8 times in mdividual B and 20 times in individual C.
  • tandem repeats are believed to be caused by "slippage" of the DNA polymerase enzyme as the DNA is replicated.
  • n increases over generations, and the amount of slippage varies over time and in different lines.
  • the variability of these repeated sequences is generally correlated to the length of the base repeat, with STRs composed of longer base repeats exhibiting less variability between individuals than shorter base repeats.
  • a two base pair repeat may consist of a two base pair unit being repeated hundreds of time in an individual, while a 12-base pair unit may only be repeated a few times.
  • Short repeats tend to exhibit higher rates of polymo ⁇ hism between individuals, while 10- or 12-base pair repeats may show little or no variability.
  • STRs can be amplified and detected by known procedures. For example, they can be detected by electrophoretic separation followed by radionuclide or fluorescent labeling, or silver staining. They have many advantages over other methods of detecting polymo ⁇ hisms (e.g., RFLPs) because of their small size, the ease and speed with which they can be detected and analyzed, and the fact that the process is amenable to automation.
  • RFLPs radionuclide or fluorescent labeling
  • the more recent generations of large-scale genetic maps have been made using STRs (Hudson, TJ. et al., Science 270:1945- 1954 (1995); Dietrich, W.F., et al., Nature Genetics 7:220-245 (1994); Yerle, M., et al., Ma m.
  • kits for amplifying STR loci are commercially available, and the rates of polymo ⁇ hism of these loci in different ethnic backgrounds are known. These include AmpFlSTRTM Profiler, AmpE/STRTM Profiler Plus, AmpE/STRTM Green I (P ⁇ Applied Biosystems, Foster City, California, USA), the GeneprintTM STR Systems (Promega Co ⁇ ., Madison, WI), including GeneprintTM PowerPlexTM 1.1, GeneprintTM PowerPlexTM 1.2, GeneprintTM PowerPlexTM 2, and GeneprintTM PowerPlexTM 16, Sex Determination Systems, and others. These STR systems were developed for use in humans, but microsatellite markers have been developed in other organisms, including horse, cattle, sheep, goat, dog, pig, mouse, rat, barley, corn, soybean, and others.
  • loci can be used singly, or can be combined, depending on the power of discrimination required. As the number of organisms being studied and the number of individuals from which samples are removed and archived increases, the degree of polymo ⁇ hism required to uniquely identify each sample also increases, -11-
  • Locus 2 alone could serve as the unique identifier, because by itself, it can serve to distinguish between samples from all three individuals.
  • the Power of Discrimination (P D ) of a given system of loci is defined as the probability that two individuals selected at random will differ with respect to that given system of loci.
  • the P D is related to the Probability of Identity (P ⁇ ) by the equation
  • the allelic frequencies within different ethnic populations are known for many of the polymo ⁇ hisms of the STR loci in the commercially-available kits, so a set of STRs which will provide a unique identifier for every sample can be chosen, even if the final number of samples is not known.
  • Combinations of loci can be chosen that have matching probabilities of less than 1 in several million or more (See, for example Table 1). -12-
  • loci Once the polymo ⁇ hism rates are known for a series of loci, one can choose which loci and how many will go into making up the unique identifier. If it is anticipated that the final number of samples will be relatively small, than only a few loci are sufficient to form the unique identifier, and one or more of the loci may not need to have a high P D . On the other hand, if one intends to store a very large number of samples, then it would be prudent to use more loci, each with a high P D . The loci seleected will be based on considerations such as P D , anticipated size of the repository, ease of use, applicability to the organisms being sampled, cost, and availability.
  • the polymo ⁇ hisms that can be used in the invention will vary depending on the types of samples being stored. STRs are well-studied in humans, and kits are commercially available for amplifying a number of STR loci. Genetic maps based on STRs have been built for other organisms (e.g., mouse, rat, pig). STRs appear to exist in most higher organisms, and are easy to isolate and characterize. Because the methods used to identify and assess STRs are virtually identical for different organisms, one skilled in the art can isolate STRs in an organism of choice, assess the polymo ⁇ hism rates, and choose those most useful in the present invention. For many organisms, STRs and their primer sequences have been published in the scientific literature.
  • STRs One wishing to use previously published STRs need only order those primers (e.g., custom primers can be ordered and received in 48 hours from Research Genetics, Huntsville, Alabama, USA), and then use them to amplify the STRs in the DNA of the collected samples. -13-
  • microsatellite markers are not necessary to use commercially-available primers to practice the present invention, nor is it necessary to use microsatellite markers developed by others.
  • the present invention allows one to use any polymo ⁇ hic marker that is convenient, so long as it provides a power of discrimination between individuals.
  • microsatellite markers have become so successful is that they are easy to develop for previously unstudied organisms.
  • the method described herein will be of particular use in a pathology laboratory or testing facility, or a large-scale cryogenic repository. Maintaining the integrity of the sample labels is of paramount importance in these situations, as quality control problems often result from failure of the record-keeping system. Naturally, such a method will also be of use to blood banks, tissue banks, and veterinary hospitals and testing facilities.
  • the method can also be used by large repositories to identify misplaced or misidentified samples.
  • a tissue bank may take a small piece (e.g., a sample) of a stored tissue (e.g., a source) for testing (e.g., tissue typing for a potential recipient of the tissue). If the identification were disassociated from the sample (e.g., the label fell off the test tube), those test results would normally be lost.
  • Using the unique identifier of the present invention one would simply test the sample for the polymo ⁇ hic loci, and recreate the unique identifier. The sample (and the tissue typing test results) could then be reassociated with the source in storage.
  • the method is especially useful to maintain the long-term integrity of samples and associated information, especially in tissue repositories.
  • Many biomedical studies require analysis of tissue samples from large populations of individuals with known medical, dietary, genetic, social, and cultural backgrounds. During the course of a study requiring several years to complete, it may be necessary -14-
  • a blood sample repository contained samples from 100,000 individuals, and associated medical data on those same individuals in computerized form
  • a medical study could be conducted by selecting individuals with desired characteristics (as listed in the computerized medical data), and then retrieving samples (or more likely, sub-samples) from those individuals, which are held in the repository.
  • the method of the present invention is useful in establishing such a repository, because the method greatly reduces the likelihood of samples being misidentified and allows confident re-retrieval of samples.
  • Another advantage of the invention is that if the unique identifier is de- associated from the sample within the repository (e.g., the label falls off the tube) analysis of the polymo ⁇ hisms in the sample allows re-creation of the unique identifier.
  • Use of the method of the invention also provides a method for preventing repository deposit of samples from duplicate sources, because when the unique identifier is created for a sample from a new source, one need only search the repository records for that same unique identifier to see if the source is already represented by a sample in the repository.
  • the invention also has uses outside of the medical field. Because of the increasing ease with which samples from various sources (e.g., plant, animal, microbial, fungal, viral) can be tested for polymo ⁇ hisms, the invention is applicable in any situation where a large number of biological samples may be stored.
  • sources e.g., plant, animal, microbial, fungal, viral
  • This sample identification method can be used to keep track of samples in any study or collection where there are a large number of biological samples being stored for a period of time, and where there is a chance that samples may become misplaced or mislabeled.
  • a biological sample is obtained from a human, and an aliquot is taken for polymo ⁇ hism testing.
  • DNA is isolated by methods well known in the art (Maniatis et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, New York; Ausubel, F.M. et al, eds., Current Protocols in Molecular Biology). An amount of this isolated DNA is removed, GenePrintTM primers (Promega Co ⁇ ., Madison, WI) for the CSF1PO locus are added to it, and amplification is carried out, all according to the manufacturer's instructions (supplemental information on thermocycling are well known in the art, see e.g., Innis, M.S., et al.
  • the unique identifier can be created for the sample.
  • a convenient conversion method is to simply list each locus by letter, followed by the two alleles for that locus.
  • the unique identifier would be "A35B27...HXY".
  • the precise conversion method could be varied, depending on the number of repeats in the loci, e.g., a locus with 3 - 12 repeats would require 4 digits after the locus letter.
  • Red blood cells lack DNA because they are enucleated, and must therefore be lysed to facilitate their separation from white blood cells, which contain genomic DNA.
  • the white blood cells are then lysed with an anionic detergent in the presence of a DNA stabilizer, which limits the activity of DNase.
  • Contaminating RNA is then degraded with RNase, and the RNA, proteins, and other contaminants are then removed by salt precipitation.
  • the genomic DNA is recovered by alcohol preciptation, dissolved in TE buffer, and stored. Because the genomic DNA will be used in a nucleic acid amplification -17-
  • Isolation of genomic DNA from whole blood can be accomplished by following any of a variety of protocols, including using the PUREGENE ® kit (Gentra Systems, Minneapolis, Minnesota, USA), and following the manufacturer's instructions. Place 30 ml of RBC Lysis Solution into a 50 ml tube, add 7 ml to 10 ml of whole blood, mix by inverting several times, and incubate for 10 minutes at room temperature. Invert the tube again at least once during the incubation.
  • DNA Hydration Solution (Gentra Systems, Minneapolis, Minnesota, USA), and rehydrate the DNA by incubating at 65 °C for 1 hour, overnight at room temperature, and at 65 °C 1 hour the next day. Tap the tube periodically to help disperse the DNA.
  • the DNA in solution can be stored indefinitely at 4°C.
  • a blood sample is drawn from a human. Two ⁇ l of blood are placed on a piece of FTATMpaper (FITZCO, Inc., Maple Plain, Minnesota, USA), dried, and stored until ready to be processed.
  • FTATMpaper FITZCO, Inc., Maple Plain, Minnesota, USA
  • a 1 mm disc is punched directly into a 2 ml microcentrifuge tube, and 200 ⁇ l of FT ATM purification reagent is placed on the disc.
  • the tube is capped, vortexed for 3-5 seconds, then centrifuged in a microcentrifuge at 12,000 x g for 30 seconds.
  • the wash solution is then aspirated and discarded.
  • the wash is then repeated with another 200 ⁇ l of purification reagent.
  • the disc is washed twice with TE as follows: 200 ⁇ l of TE buffer is added, and the disc vortexed for 3-5 seconds, the tube and disc are then centrifuged at 12,000 x g for 30 seconds and the filtrate removed and discarded. After the disc has been washed twice with TE, the disc is subjected to polymo ⁇ hism analysis.
  • a 1 mm punch of FT ATM paper containing a blood sample, processed as described in Example 2, supra, is placed in a 0.5 ml tube, and tested with the AmpE/STR Profiler PlusTM system (Perkin Elmer Applied Biosystems, Foster City, California, USA), according to the manufacturer's instructions.
  • AmpE/STR Profiler PlusTM system Perkin Elmer Applied Biosystems, Foster City, California, USA
  • to the tube is added 10.5 ⁇ l of Profiler Plus Reaction Mixture, 0.5 ⁇ l of Taq Gold, and 5.5 ⁇ l of Primer Mixture.
  • the tube is sealed, and placed in a thermocycler under the following conditions: 95 °C for 11 minutes, followed by 24 cycles of: 94 °C for 1 -19-
  • reaction mixture is placed at 60 °C for up to 83 minutes. After thermocycling is complete, the reaction is held at 4°C until ready for gel electrophoresis.
  • Example 5 Analysis of Amplification Products by Gel Electrophoresis
  • Five ⁇ l of amplification product (produced as described above in Example 3) are mixed with 5 ⁇ l of 2X loading buffer (0.25% Bromphenol Blue, 12.5% Ficoll 400, 50 mM EDTA, 5X TAN (10X TAN: 0.4 M Tris, 40 mM Na Acetate Trihydrate, 10 mM EDTA, pH to 7.9 with acetic acid)).
  • the 10 ⁇ l mixture is loaded into a well in a 1% agarose gel prepared with TE buffer and containing 0.5 ⁇ g of etiiidium bromide per ml of agarose gel. Appropriate size ladder is also loaded on the gel.
  • the gel is then electrophoresed in TAE buffer for 1 hour at 100 volts, and then illuminated with UV light on a fransilluminator, and photographed.
  • the bands in the photograph are then compared to the literature supplied by the manufacturer to determine the precise alleles present in the sample.
  • the polymo ⁇ hism data for a sample can be coded in a number of ways.
  • the raw data for individual #1 for example, is as follows:
  • This data can be used "raw” as the unique identifier (i.e., "as is,” as above), with no alteration. For repositories with very large numbers of samples, this may be desireable, as it is the most “foolproof method.
  • the STR loci can be "coded,” i.e., each locus represented by a combination of numbers or letters, e.g., D3S1358 can be represented by "A” or "01,” vWA by "B” or "02,” etc.
  • the raw data so coded would then be:

Landscapes

  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Bioethics (AREA)
  • Biophysics (AREA)
  • Databases & Information Systems (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Biotechnology (AREA)
  • Evolutionary Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Theoretical Computer Science (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

The present invention provides a method for internal labelling of a biological sample by which the sample is identifiably linked to its source and other relevant information, based on the polymorphisms inherent in the sample itself. A set of polymorphisms in the sample is detected, and the resulting data is used as a unique identifier which is then used to identify the sample. This unique identifier can also be used to identify the source of the sample, and any other relevant information.

Description

-1-
UNIQUE IDENTIFIER FOR BIOLOGICAL SAMPLES
RELATED APPLICAΗON(S)
This application claims priority to application 60/076,081, filed February 26, 1998, the entire teachings of which are incorporated herein by reference.
BACKGROUND OF THE INVENTION
With the advent of the Human Genome Project and the advances in technology that have resulted, biological and genetic testing have become increasingly more common. Hospitals and other health care entities are using new tests for diseases, and are processing more samples for testing than ever before. The ease and speed of many biological tests has also increased enormously, so that these tests are now being widely used outside of the health care industry. Veterinarians, of course, have always closely followed advances in human health care. But law enforcement agencies now routinely employ DNA-based methods in forensics, and even population geneticists, ecologists, and evolutionary biologists use these methods to track the evolution and variability within and between populations of organisms.
When handling large numbers of samples, accurate and reliable tracking of samples and quality control of associated information is vital. In hospital settings, aberrant test results are always a cause for concern because doubts are then cast on the state of the patient's health. In a tissue repository, it must be possible for a sample (or portions of a sample) to be reliably and repeatedly retrieved with no doubts as to the sample's identity. Mislabeling or loss of labeling of a sample may mean that the sample is rendered useless if it cannot be accurately connected back to -2-
the sample's history and/or source. Most samples and their sources are given a common alphanumeric designation, and this designation is also linked to information about the source and the sample (e.g., patient name, sample type, disease condition, etc.). A loss of this designation from its association with either the source, or the sample, or the information will often result in a complete loss of utility of all three.
This potential loss of association between the designation and the sample is especially likely in settings where very large numbers of samples are being processed. Machine errors, while problematic, generally result in the destruction of large numbers of samples, and so are noticed easily. Human error, however, has the potential to cause serious errors that go unnoticed for a period of time. These include transcription errors, misplacing or swapping of samples, destruction of labels, off-by-one errors (resulting in a series of samples where the designation or information from each sample is misassociated with the next sample). In addition, pages from lab notebooks can be obliterated or lost, and magnetic media corrupted. Databases containing all of this information can be backed up, but intervening data added to the database since the last backup is usually lost. If an error is introduced and not discovered until after a backup is made, then this error effectively replaces the "true" data. In addition, many facilities save only the most recent backup, or store backups at the same site as the current data, resulting in loss of all information in the event of a physical disaster (e.g. a fire).
SUMMARY OF THE INVENTION
The present invention relates to a method of creating a unique identifier for reliably identifying samples, their sources, and associated information. The use of the identification system described herein substantially decreases potential mixups and misidentification of samples, their sources, and associated information.
Specifically, the present invention provides a method for creating a unique identifier which is used to label the sample, its source, or the associated information, based on the polymoφhisms inherent in the sample and its source. One or more polymorphisms in the sample is detected, and the resulting polymorphism data is used to produce a unique identifier, which is then used to identify the sample. This unique identifier cairalso be linked with the source and/or any information that may be associated with either the sample or the source (i.e. the unique identifier can be used as a common designation for the sample, its source and/or other relevant information). If this unique identifier is separated from the sample, then the polymorphisms within the sample simply need to be re-detected to reproduce the polymorphism data which is then used to produce the unique identifier, thereby recreating the proper unique identifier, and, ultimately, its link to its source.
In general, the invention features a method for producing a unique identifier for a biological sample, comprising detecting one or more polymorphisms within the biological sample, and selecting one or more polymoφhisms sufficient to form a unique identifier. The biological sample can be from a vertebrate, an invertebrate, a plant, or consist of microorganisms. The biological sample can also be from a mammal, particularly a human. The sample can be blood, saliva, hair, body fluid, tissues, organs, one or more cells, or a whole organism. The polymoφhisms can be nucleic acid polymoφhisms, protein polymoφhisms, enzyme polymoφhisms, chemical polymoφhisms, biochemical polymoφhisms, phenotypic polymoφhisms, and quantitative polymoφhisms, particularly a nucleic acid sequence polymoφhism, a nucleic acid length polymoφhism, or a short tandem repeat (STR). The unique identifier can also be linked to the source of the biological sample, or relevant information about the biological sample or the source of the biological sample. The unique identifier can be in the form of an alphanumeric string, or a bar code.
The invention also features a method for establishing a repository containing a collection of biological samples, comprising obtaining a biological sample from a source, detecting one or more polymoφhisms in the sample, selecting one or more polymoφhisms sufficient to form a unique identifier, using the unique identifier to identify the sample, storing the sample with the unique identifier, and repeating these steps for biological samples from other sources. The samples, in general, are DNA-containing samples, particularly from humans, and the polymoφhisms are nucleic acid polymoφhisms, protein polymoφhisms, enzyme polymoφhisms, chemical polymoφhisms, biochemical polymoφhisms, phenotypic polymoφhisms, -4-
and quantitative polymoφhisms, or short tandem repeat (STR). The unique identifier can be in the form of an alphanumeric string, or a bar code, and can also be linked to the source of the biological sample, or relevant information about the biological sample or the source of the biological sample. In addition, the invention features a method of determining, by means of a u que identifier, if a source is represented by a sample within the repository, comprising obtaining a sample from the source, detecting one or more polymoφhisms in the sample selecting one or more polymoφhisms sufficient to form a unique identifier, and comparing the unique identifier so produced to the unique identifier of each sample in the repository, where shared identity between the two unique identifiers indicates that the source is already represented in the repository. In general, the samples are DNA-containing samples, preferably from humans. The polymoφhisms are nucleic acid polymoφhisms, particularly short tandem repeats (STR), protein polymoφhisms, enzyme polymoφhisms, chemical polymoφhisms, biochemical polymoφhisms, phenotypic polymoφhisms, and quantitative polymoφhisms. The unique identifier can also be linked to the source of the biological sample, or relevant information about the biological sample or the source of the biological sample. The unique identifier can be in the form of an alphanumeric string, or a bar code. The invention also features a method for linking, by means of a unique identifier, a first biological object lacking a unique identifier with a second object having a unique identifier, comprising detecting one or more polymoφhisms in the first biological object, selecting one or more polymoφhisms sufficient to form a unique identifier, and comparing the unique identifier so made to the unique identifier of the second object, where shared identity between the two unique identifiers links the first biological object with the second object. The biological sample can be from a vertebrate, an invertebrate, a plant, or consist of microorganisms. The biological sample can also be from a mammal, particularly a human. The sample can be blood, saliva, hair, body fluid, tissues, organs, one or more cells, or a whole organism. The polymoφhisms can be nucleic acid polymoφhisms, particularly short tandem repeats (STR), protein polymoφhisms, enzyme polymoφhisms, chemical polymoφhisms, biochemical polymoφhisms, phenotypic polymoφhisms, and quantitative polymoφhisms, particularly a nucleic acid sequence polymoφhism, or a nucleic acid length polymoφhism. The unique identifier can also be linked to the source of the biological sample, or relevant information about the biological sample or the source of the biological sample. The unique identifier can be in the form of an alphanumeric string, or a bar code.
DETAILED DESCRIPTION OF THE INVENTION
The present invention provides a method for creating a unique identifier for identifying biological samples, their sources, or associated information based on the polymoφhisms inherent in the sample and source themselves. In this method, the nucleic acid contained within the sample itself is used to produce the unique identifier for identifying and linking the sample, its source, and associated information. No external material needs to be added to the sample which could dilute or alter the accuracy of other test results. An advantage of the invention, therefore, is that it is unnecessary to add "identifying sequences" to the samples, and that without such additions, one may conduct studies of genetics, disease associations, evolutionary relationships, etc., without the results being tainted by the added identifying sequences.
A "source" or "the source from which the sample is derived" refers to the originating material for a sample. A source of a biological sample, for example, can be a human, any animal, plant, insect, or a population or strain of microorganisms. A source of a biological sample does not have to be living, and can be a deposit in a tissue repository, herbaria or museum specimens, forensic specimens, or fossils. A "potential source" as used herein, means a source from which the sample may possibly have been taken in the past.
By "sample" is meant a portion of source biological material that originated elsewhere, i.e., the sample was removed from its source. A sample can be any biological sample, (e.g., blood, saliva, hair, organs, biopsies, bodily fluids, one or more cells), and can be taken from any vertebrate, including mammals such as humans, or plant, insect, reptile. The sample can also be a strain or mixed -6-
population of microbes. Samples can also be biological materials taken from defunct or extinct organisms, e.g., samples can be taken from pressed plants in herbarium collections, or from pelts, taxidermy displays, fossils, or other materials in museum collections. "Information associated with" the sample or source from which the sample is derived is meant to include, without limitation, any information that might be necessary or advantageous to be linked to the sample or the sample source, e.g. name, address, sex, medical history (in the case of human samples), species, collection data, provenance (in case of non-human samples), etc. Once a biological sample is taken from its source, the sample is tested across one or more polymoφhic loci, and the polymoφhic data produced are used to create a unique identifier, which is identifiably linked to the sample, and serves as its unique designation. This unique identifier can also be identifiably linked to the sample's source, and/or any information that may exist concerning the source and/or the sample. By saying that the umque identifier is "identifiably linked" to the sample, sample source, or related information means that it is connected in some way with any or all of these three things, e.g. the unique identifier may be on a label attached to a container holding the sample, the unique identifier may exist as a field in a database record containing medical data regarding the source, etc. In essence, the genetic code of the sample itself, which is unique and forms the basis of the polymoφhisms tested, serves as the unique identifier. Because the unique identifier is based on the genetic code, which is unique between individuals, the umque identifier will also be unique between samples from different source individuals. A "polymoφhism" is an allelic variation between two samples. As used herein, the term includes differences between proteins (e.g., enzymes, blood groups, blood proteins), differences in the chemicals and biochemicals (e.g., secondary metabolites) produced by the source organism(s), differences between nucleic acids involving differences in the nucleotide sequence (e.g., restriction site maps), or differences in length of a stretch of nucleic acid (e.g., RFLPs (restriction fragment length polymoφhisms), microsatellites, STRs (short tandem repeats), SSRs (simple sequence repeats), SSLPs (simple sequence length polymoφhisms), VNTRs -7-
(variable number tandem repeats)). Allelic variation can also result in phenotypic (i.e., visually- apparent) polymoφhisms, or variations in quantitative characters (e.g. variation in height, length, yield of fruit, etc.) between the organisms that serve as the source of the samples. With some types of biological material, phenotypic differences may be visible in the samples themselves, e.g., kernels of different types of "Indian" corn often appear very different from each other, with red, yellow, white, blue, streaked kernels, etc. With such samples, phenotypic polymoφhisms could also be used to produce the unique identifier.
A polymoφhism is not limited by the function or effect it may have on the organism as a whole, and can therefore include allelic differences which may also be a mutation, insertion, deletion, point mutation, or structural difference, as well as a strand break or chemical modification that results in an allelic variant. A polymoφhism between two nucleic acids can occur naturally, or be caused intentionally by treatment (e.g., with chemicals or enzymes), or can be caused by circumstances normally associated with damage to nucleic acids (e.g., exposure to ultraviolet radiation, mutagens or carcinogens).
As used herein, a "sequence polymoφhism" is a difference in the sequence of two nucleic acids or two amino acids. Two amino acid sequences can differ by having different residues at a particular position (i.e., and amino acid substitution), or some residues may be deleted, or new residues inserted or added to one or more ends. Two nucleic acids differing in sequence may have the same number of base pairs (e.g., "AT£rC" vs. "ATT_C"), but may also include some differences in overall sequence length as well (e.g., "AT£A£ΔTG" vs. "ATCACACATG"). Types of commonly-studied polymoφhisms caused by sequence differences include restriction site polymoφhisms, isozymes, differences in protein conformation, and length polymoφhisms. If the nucleic acid is sequenced, then a sequence difference itself (as represented by the string of letters) serves as the polymoφhism.
As used herein, a "length polymoφhism" is a difference in the length of two nucleic acids. Two different nucleic acids with a length polymoφhism between them also have a sequence polymoφhism, but many methods used to detect a length polymoφhism do not reveal the exact sequence polymoφhism. Commonly-used -8-
types of length polymoφhisms include RFLPs (restriction fragment length polymoφhisms), microsatellites, STRs (short tandem repeats), SSRs (simple sequence repeats), SSLPs (simple sequence length polymoφhisms), and VNTRs (variable number tandem repeats). In general, the difference between "length polymoφhisms" and "sequence polymoφhisms" is generally in the methods used to detect them. With RFLPs, for example, restriction endonucleases are used to cut a nucleic acid molecule into fragments, which are then separated on an agarose gel. The differences between two individuals are measured by the changes in size of the resultant nucleic acid fragments, and so are referred to as length polymoφhisms, yet those differences are caused by differences in the underlying sequence, which is the basis for the change in restriction sites, and therefore the changes in the sizes of the nucleic acid fragments. Because the method of detection/visualization can only differentiate on the basis of fragment length, the RFLPs are generally classed as length polymoφhisms..
As used herein, a "polymoφhic locus" is a segment of nucleic acid which may contain a polymoφhism as described above. It is not required that the precise sequence of the nucleic acid be known. A polymoφhic locus is not limited to those loci which are polymoφhic in all situations, e.g., a polymoφhic locus which displays an allelic variation between individuals A and B, but not between individuals A and C, remains a polymoφhic locus for purposes of comparing individuals A and B, as well as individuals B and C.
"Nucleic acid" means deoxyribonucleic acid (DNA), ribonucleic acid (RNA), nucleic acids from mammals or other animals, plants, insects, bacteria, viruses, or other organisms.
By "unique identifier" is meant an identification tag, designation, or code to be linked to a sample, its source, or other information, such as patient case history, disease testing results, genetic testing results, geographic or temporal collection data, or any other information which may be useful when linked with the sample or source. The unique identifier can exist in the form of an alphanumeric string, a bar -9-
code, an entry in a database, or any other useful human-readable or machine- readable form.
The extent to which such an identifier will be unique depends on the loci chosen for polymoφhism testing. Allelic polymoφhism has been studied for decades, and there are many genetic systems which have been commonly used in assessing polymoφhism between populations or individuals. These include classical blood groups, blood proteins, isozymes, distribution of restriction endonuclease sites, restriction fragment length polymoφhisms (RFLPs), and others. The most successful to date, however, have been microsatellites, also known as short tandem repeats (STRs), or simple sequence length polymoφhisms (SSLPs), or variable number tandem repeats (NNTRs).
STRs are stretches of DNA that consist of repeated sequences repeats. The base sequence is usually just a few base pairs long, typically two to twelve base pairs, but longer base repeats have been seen. This base sequence is then tandemly repeated, and the number of times it is repeated can vary greatly, depending on the STR locus being studied. An STR can therefore be expressed as (X)n, where X is the repeated sequence, (e.g. "CA") and n is the number of times that X is repeated.
Most individuals in a population will have the same STR at the same location in the genome, that is different individuals will have the same base repeat at the same location, but the precise number of repeats often varies from individual to individual. For example, for a given STR mapped to a particular location on the genome, the base sequence may be repeated 5 times in individual A, but may be repeated 8 times in mdividual B and 20 times in individual C.
These tandem repeats are believed to be caused by "slippage" of the DNA polymerase enzyme as the DNA is replicated. In general, n increases over generations, and the amount of slippage varies over time and in different lines. The variability of these repeated sequences is generally correlated to the length of the base repeat, with STRs composed of longer base repeats exhibiting less variability between individuals than shorter base repeats. For example, a two base pair repeat may consist of a two base pair unit being repeated hundreds of time in an individual, while a 12-base pair unit may only be repeated a few times. In general, the amount -10-
of slippage that occurs during replication, and therefore the amount of variability in the number of repeats that results from that slippage, is also correlated to the length of the base repeat. Short repeats tend to exhibit higher rates of polymoφhism between individuals, while 10- or 12-base pair repeats may show little or no variability.
STRs can be amplified and detected by known procedures. For example, they can be detected by electrophoretic separation followed by radionuclide or fluorescent labeling, or silver staining. They have many advantages over other methods of detecting polymoφhisms (e.g., RFLPs) because of their small size, the ease and speed with which they can be detected and analyzed, and the fact that the process is amenable to automation. The more recent generations of large-scale genetic maps have been made using STRs (Hudson, TJ. et al., Science 270:1945- 1954 (1995); Dietrich, W.F., et al., Nature Genetics 7:220-245 (1994); Yerle, M., et al., Ma m. Genome 6:176-186 (1995); Jacob H.J., et al., Nature Genetics 9:63-69 (1995)). Because of the extremely high rate of polymoφhism of some of the STR loci, they are also used in forensic tests by law enforcement agencies.
A number of kits for amplifying STR loci are commercially available, and the rates of polymoφhism of these loci in different ethnic backgrounds are known. These include AmpFlSTR™ Profiler, AmpE/STR™ Profiler Plus, AmpE/STR™ Green I (PΕ Applied Biosystems, Foster City, California, USA), the Geneprint™ STR Systems (Promega Coφ., Madison, WI), including Geneprint™ PowerPlex™ 1.1, Geneprint™ PowerPlex™ 1.2, Geneprint™ PowerPlex™ 2, and Geneprint™ PowerPlex™ 16, Sex Determination Systems, and others. These STR systems were developed for use in humans, but microsatellite markers have been developed in other organisms, including horse, cattle, sheep, goat, dog, pig, mouse, rat, barley, corn, soybean, and others.
These loci can be used singly, or can be combined, depending on the power of discrimination required. As the number of organisms being studied and the number of individuals from which samples are removed and archived increases, the degree of polymoφhism required to uniquely identify each sample also increases, -11-
and the number of polymoφhic loci that need to be tested to have a sufficient number to create the unique identifier also increases.
For example, if three individuals possessed the following alleles at three different loci:
Locus 1 Locus 2 Locus 3
Individual A 1,3 1,2 3,5
Individual B 2,3 2,2 1,4
Individual C 2,3
Figure imgf000013_0001
1,5 3,5
then detection of the alleles at Locus 1 would allow a sample from Individual A to be distinguished from a sample from B or C, but samples from B and C could not be uniquely distinguished from each other, and a second locus would need to be tested.
On the other hand, Locus 2 alone could serve as the unique identifier, because by itself, it can serve to distinguish between samples from all three individuals.
The Power of Discrimination (PD) of a given system of loci is defined as the probability that two individuals selected at random will differ with respect to that given system of loci. The PD is related to the Probability of Identity (Pτ) by the equation
PD = 1 - Pι, Where Pi is determined by solving the equation Pι = ∑Xi2, where X, is the frequency in the population of the tth allele. The allelic frequencies within different ethnic populations are known for many of the polymoφhisms of the STR loci in the commercially-available kits, so a set of STRs which will provide a unique identifier for every sample can be chosen, even if the final number of samples is not known. Combinations of loci can be chosen that have matching probabilities of less than 1 in several million or more (See, for example Table 1). -12-
Table 1. Matching probabilities of various populations in the Geneprint™ STR system, using fluorescent detection (Promega Coφ., Madison, Wisconsin, USA).
Caucasian-American African-American Hispanic-American
CTTv Quadriplex 1/6623 1/25575 1/7194
FFFL Quadriplex 1/2632 1/16807 1/3279
Both combined 1/17,400,000 1/430,000,000 1/23,600,000
Figure imgf000014_0001
Once the polymoφhism rates are known for a series of loci, one can choose which loci and how many will go into making up the unique identifier. If it is anticipated that the final number of samples will be relatively small, than only a few loci are sufficient to form the unique identifier, and one or more of the loci may not need to have a high PD. On the other hand, if one intends to store a very large number of samples, then it would be prudent to use more loci, each with a high PD. The loci seleected will be based on considerations such as PD, anticipated size of the repository, ease of use, applicability to the organisms being sampled, cost, and availability.
Polymoφhisms used
The polymoφhisms that can be used in the invention will vary depending on the types of samples being stored. STRs are well-studied in humans, and kits are commercially available for amplifying a number of STR loci. Genetic maps based on STRs have been built for other organisms (e.g., mouse, rat, pig). STRs appear to exist in most higher organisms, and are easy to isolate and characterize. Because the methods used to identify and assess STRs are virtually identical for different organisms, one skilled in the art can isolate STRs in an organism of choice, assess the polymoφhism rates, and choose those most useful in the present invention. For many organisms, STRs and their primer sequences have been published in the scientific literature. One wishing to use previously published STRs need only order those primers (e.g., custom primers can be ordered and received in 48 hours from Research Genetics, Huntsville, Alabama, USA), and then use them to amplify the STRs in the DNA of the collected samples. -13-
It is not necessary to use commercially-available primers to practice the present invention, nor is it necessary to use microsatellite markers developed by others. The present invention allows one to use any polymoφhic marker that is convenient, so long as it provides a power of discrimination between individuals. There are many species worthy of study for which no genetic map exists. One of the reasons that microsatellite markers have become so successful is that they are easy to develop for previously unstudied organisms. One already familiar with an organism for which there are no microsatellite markers can develop them with relative ease using methods well-known in the art.
Uses of the invention
The method described herein will be of particular use in a pathology laboratory or testing facility, or a large-scale cryogenic repository. Maintaining the integrity of the sample labels is of paramount importance in these situations, as quality control problems often result from failure of the record-keeping system. Naturally, such a method will also be of use to blood banks, tissue banks, and veterinary hospitals and testing facilities.
The method can also be used by large repositories to identify misplaced or misidentified samples. For example, a tissue bank may take a small piece (e.g., a sample) of a stored tissue (e.g., a source) for testing (e.g., tissue typing for a potential recipient of the tissue). If the identification were disassociated from the sample (e.g., the label fell off the test tube), those test results would normally be lost. Using the unique identifier of the present invention, however, one would simply test the sample for the polymoφhic loci, and recreate the unique identifier. The sample (and the tissue typing test results) could then be reassociated with the source in storage.
The method is especially useful to maintain the long-term integrity of samples and associated information, especially in tissue repositories. Many biomedical studies require analysis of tissue samples from large populations of individuals with known medical, dietary, genetic, social, and cultural backgrounds. During the course of a study requiring several years to complete, it may be necessary -14-
to test a particular sample several times. It is therefore vital to the accuracy of the study to confidently re-retrieve that sample.
Many biomedical studies involve analysis of a group of individuals with a set of characteristics in common (e.g., cigarette smoking, ethnic background, incidence of particular cancers). At present, the amount of time and effort involved in assembling a set of individuals appropriate for a study may be greater than the effort of conducting the study itself. If samples from a large number of individuals could be collected in a repository along with associated data on the individuals, then it should be possible to "assemble" a set of individuals for a given study by selecting samples from these individuals chosen on the basis of a defined set of characteristics. For example, if a blood sample repository contained samples from 100,000 individuals, and associated medical data on those same individuals in computerized form, then a medical study could be conducted by selecting individuals with desired characteristics (as listed in the computerized medical data), and then retrieving samples (or more likely, sub-samples) from those individuals, which are held in the repository. The method of the present invention is useful in establishing such a repository, because the method greatly reduces the likelihood of samples being misidentified and allows confident re-retrieval of samples.
Another advantage of the invention is that if the unique identifier is de- associated from the sample within the repository (e.g., the label falls off the tube) analysis of the polymoφhisms in the sample allows re-creation of the unique identifier.
Use of the method of the invention also provides a method for preventing repository deposit of samples from duplicate sources, because when the unique identifier is created for a sample from a new source, one need only search the repository records for that same unique identifier to see if the source is already represented by a sample in the repository.
The invention also has uses outside of the medical field. Because of the increasing ease with which samples from various sources (e.g., plant, animal, microbial, fungal, viral) can be tested for polymoφhisms, the invention is applicable in any situation where a large number of biological samples may be stored. An -15-
example of such a situation would be a field study of biodiversity of a wild population. New tests for assessing diversity (i.e., assessing polymoφhism) are continually being created, and samples from previous collecting expeditions represent a "snapshot" of the biodiversity that existed in the past. Such previously- collected samples can be re-tested using current techniques, but the results are only useful if the integrity of the sample designations is still sound, and the samples can be linked to their original collection data. Maintenance of quality of the record keeping is especially important if the field samples are from species which are endangered or extinct. The method of the invention provided here has potential uses in studies of population genetics, evolutionary genetics, and ecology. In studies of flora and fauna from locales that are either increasing or decreasing in pollution, for example, it is necessary to both store the samples for a period of time and also maintain their identification. Such sampling at periodic intervals is also a requirement of an effective bioremediation plan. There are many situations where one would want to keep a biological sample for a period of time against the possibility of testing it again later. For example, even if one has conducted a population genetics study on a series of samples (collection of organisms), a new test developed at a future time may allow the testing of different hypotheses, and provide the answer to new questions, without necessitating collection of new samples in the field. Therefore, the method of the invention described herein would be especially useful in maintaining collections of samples from endangered species. The unique identifier and identity of each sample can be re- verified from the sample itself.
This sample identification method can be used to keep track of samples in any study or collection where there are a large number of biological samples being stored for a period of time, and where there is a chance that samples may become misplaced or mislabeled.
EXAMPLES,
Example 1 : Use of STRs to Create a Umque Identifier -16-
A biological sample is obtained from a human, and an aliquot is taken for polymoφhism testing. DNA is isolated by methods well known in the art (Maniatis et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, New York; Ausubel, F.M. et al, eds., Current Protocols in Molecular Biology). An amount of this isolated DNA is removed, GenePrint™ primers (Promega Coφ., Madison, WI) for the CSF1PO locus are added to it, and amplification is carried out, all according to the manufacturer's instructions (supplemental information on thermocycling are well known in the art, see e.g., Innis, M.S., et al. (1990) PCR Protocols: A Guide to Methods and Applications, Academic Press, Inc. San Diego, CA). After amplification, fluorescence detection is carried out, also according to the manufacturer's recommendations. The process is repeated for the other loci in the CTTv Quadriplex (TPOX, TH01, vWA), and also the four loci in the FFFL Quadriplex (F13A01, FESFPS, F13B, and LPL).
Once detection of the polymoφhism(s) is complete, the unique identifier can be created for the sample. For a system such as these eight loci, where the alleles are 3 to 7 repeats in length, a convenient conversion method is to simply list each locus by letter, followed by the two alleles for that locus. For a sample with alleles of 3 and 5 tandem repeats at the first locus, alleles of 2 and 7 repeats at the second, etc., the unique identifier would be "A35B27...HXY". The precise conversion method could be varied, depending on the number of repeats in the loci, e.g., a locus with 3 - 12 repeats would require 4 digits after the locus letter.
Example 2: Preparation of DNA from Samples of Whole Blood
Red blood cells lack DNA because they are enucleated, and must therefore be lysed to facilitate their separation from white blood cells, which contain genomic DNA. After the red blood cells are lysed and removed, the white blood cells are then lysed with an anionic detergent in the presence of a DNA stabilizer, which limits the activity of DNase. Contaminating RNA is then degraded with RNase, and the RNA, proteins, and other contaminants are then removed by salt precipitation. The genomic DNA is recovered by alcohol preciptation, dissolved in TE buffer, and stored. Because the genomic DNA will be used in a nucleic acid amplification -17-
method, it is advisable to also have a "blank" control tube (containing reagents but no blood) accompany the blood sample tube through the extraction process. After extraction, the "DNA" from the "blank" control tube would be amplified to ensure that no extraneous DNA has contaminated the extraction process Isolation of genomic DNA from whole blood can be accomplished by following any of a variety of protocols, including using the PUREGENE® kit (Gentra Systems, Minneapolis, Minnesota, USA), and following the manufacturer's instructions. Place 30 ml of RBC Lysis Solution into a 50 ml tube, add 7 ml to 10 ml of whole blood, mix by inverting several times, and incubate for 10 minutes at room temperature. Invert the tube again at least once during the incubation.
Centrifuge the tube for 10 minutes at 2,000 x g, pour off the supernatant, leaving behind the visible white cell pellet and about 200 μl of residual liquid. Vortex the tube vigorously for 20 seconds to resuspend the cells in the residual liquid. Add 10 ml of Cell Lysis + RNase A (made fresh that day), and vortex on high speed for 10 seconds. Incubate the tube at 37 °C for 15 to 30 minutes to allow digestion of the RNA.
Cool the sample to room temperature by placing in an ice bath for 10 minutes. Add 3.33 ml of the Protein Precipitation Solution (Gentra Systems, Minneapolis, Minnesota, USA) into the tube. Vortex at high speed for 20 seconds to mix uniformly. Centrifuge at 2,000 x g for 10 minutes. If a tight, dark brown pellet is not formed, repeat the 20-second vortex, followed by a 5 -minute incubation on ice, and repeat the 10-minute centrifugation at 2,000 x g.
Pour off the supernatant into a clean 50 ml tube containing 10 ml of 100% isopropanol. Mix by inverting gently 50 times (do not vortex, or the DNA will be sheared). The DNA is stable at this point, and can be stored indefinitely in the isopropanol.
Centrifuge at 2,000 x g for 3 minutes. Carefully pour off the supernatant, leaving behind the white pellet, and drain the tube upside down on clean absorbent paper. Add 10 ml of 70% ethanol, and wash the pellet by inverting gently, avoiding dislodging the pellet. Centrifuge at 2,000 x g for 1 minute. Carefully pour off the -18-
ethanol, leaving the pellet behind. Invert carefully so as to not dislodge the pellet, and drain the tube on clean absorbent paper for 10 minutes.
Add 1 ml of DNA Hydration Solution (Gentra Systems, Minneapolis, Minnesota, USA), and rehydrate the DNA by incubating at 65 °C for 1 hour, overnight at room temperature, and at 65 °C 1 hour the next day. Tap the tube periodically to help disperse the DNA. The DNA in solution can be stored indefinitely at 4°C.
Example 3: Preparation of Blood Sample With FT A™ Paper
A blood sample is drawn from a human. Two μl of blood are placed on a piece of FTA™paper (FITZCO, Inc., Maple Plain, Minnesota, USA), dried, and stored until ready to be processed.
To analyze the polymoφhisms in the sample, a 1 mm disc is punched directly into a 2 ml microcentrifuge tube, and 200 μl of FT A™ purification reagent is placed on the disc. The tube is capped, vortexed for 3-5 seconds, then centrifuged in a microcentrifuge at 12,000 x g for 30 seconds. The wash solution is then aspirated and discarded. The wash is then repeated with another 200 μl of purification reagent. After the second wash solution has been aspirated and discarded, the disc is washed twice with TE as follows: 200 μl of TE buffer is added, and the disc vortexed for 3-5 seconds, the tube and disc are then centrifuged at 12,000 x g for 30 seconds and the filtrate removed and discarded. After the disc has been washed twice with TE, the disc is subjected to polymoφhism analysis.
Example 4: Analysis of Polymoφhisms in a Blood Sample
A 1 mm punch of FT A™ paper containing a blood sample, processed as described in Example 2, supra, is placed in a 0.5 ml tube, and tested with the AmpE/STR Profiler Plus™ system (Perkin Elmer Applied Biosystems, Foster City, California, USA), according to the manufacturer's instructions. In general, to the tube is added 10.5 μl of Profiler Plus Reaction Mixture, 0.5 μl of Taq Gold, and 5.5 μl of Primer Mixture. The tube is sealed, and placed in a thermocycler under the following conditions: 95 °C for 11 minutes, followed by 24 cycles of: 94 °C for 1 -19-
minute, 59°C for 1 minute, 72°C for one minute. After the 25th cycle, the reaction mixture is placed at 60 °C for up to 83 minutes. After thermocycling is complete, the reaction is held at 4°C until ready for gel electrophoresis.
Example 5: Analysis of Amplification Products by Gel Electrophoresis Five μl of amplification product (produced as described above in Example 3) are mixed with 5 μl of 2X loading buffer (0.25% Bromphenol Blue, 12.5% Ficoll 400, 50 mM EDTA, 5X TAN (10X TAN: 0.4 M Tris, 40 mM Na Acetate Trihydrate, 10 mM EDTA, pH to 7.9 with acetic acid)). The 10 μl mixture is loaded into a well in a 1% agarose gel prepared with TE buffer and containing 0.5 μg of etiiidium bromide per ml of agarose gel. Appropriate size ladder is also loaded on the gel. The gel is then electrophoresed in TAE buffer for 1 hour at 100 volts, and then illuminated with UV light on a fransilluminator, and photographed. The bands in the photograph are then compared to the literature supplied by the manufacturer to determine the precise alleles present in the sample.
Example 6: Creation of the Unique identifier
A set of blood samples was prepared and tested with the AmpE/STR Profiler Plus™ system as described in Examples 2 through 4, and the results are shown in Table 2.
Table 2. Alleles found in seven human individuals when tested for eight STR loci in the AmpE/STR Profiler Plus™ system (PE Applied Biosystems, Foster City, California, USA).
Locus #1 #2 #3 #4 #5 #6 #7
D3S1358 15 14,15 15,18 16,17 15,17 17,18 15 vWA 15,18 15,16 13,14 16,17 15,20 17,19 14,15
FGA 19,24 20-21 22,24 23 21,22 21,23 24,28
Amelogenin χ,γ χ,γ X,Y X,Y X X X
D8S1179 12,15 13,15 12,14 13,15 14 13 14,15
D21S11 33.2 29,30 30,31.2 29,32 28,31 28,33.2 32.2,38
Figure imgf000021_0001
-20-
D5S818 12 11,13 8,12 1012 13,14 11 12,13
D13S317 12,13 11,13 11,12 9,12 12,14 9,14 12
D7S820 8,9 8,12 10,11 8,11 10,12 10,11 9,10
D18S51 11,15 9,14.2 13.2,20 18,24 9,18 10,12 10,18
Figure imgf000022_0001
The polymoφhism data for a sample can be coded in a number of ways. The raw data for individual #1, for example, is as follows:
D3S1358 15;vWA 15,18;FGA 19,24;Amelogenin XN;D8S1179 12,15; D21S 11,33.2;D5S818 12;D13S317 12,13;D7S820 8,9;D18S51 11,15
This data can be used "raw" as the unique identifier (i.e., "as is," as above), with no alteration. For repositories with very large numbers of samples, this may be desireable, as it is the most "foolproof method.
Alternatively, the STR loci can be "coded," i.e., each locus represented by a combination of numbers or letters, e.g., D3S1358 can be represented by "A" or "01," vWA by "B" or "02," etc. The raw data so coded would then be:
A,15,B,15,18,C,19,24,D,X,YJ.i,12,15,F,33.2,G,12,H,12,13,I,8,9,J,ll,15, or
01,15,02,15,18,03,19,24,04^X05,12,15,06,33.2,07,12,08,12,13,09,8,9,10,11,15
All patents, patent applications, and references cited above are hereby incoφorated by reference in their entirety. While this invention has been particularly shown and described with references to preferred embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.

Claims

-21-CLAIMSWhat is claimed is:
1. A method for producing a unique identifier for a biological sample, the method comprising: (a) detecting one or more polymoφhisms within the biological sample; and (b) selecting one or more polymoφhisms sufficient to form a unique identifier; thereby producing a unique identifier for a biological sample.
2. The method of Claim 1 , wherein the biological sample is taken from an organism selected from the group consisting of: vertebrates, invertebrates, plants, and microorganisms.
3. The method of Claim 1 , wherein the biological sample is from a mammal.
4. The method of Claim 3, wherein the mammal is a human.
5. The method of Claim 3 , wherein the biological sample is selected from a group consisting of blood, saliva, hair, body fluid, tissues, organs, and one or more cells.
6. The method of Claim 5, wherein the polymoφhism is selected from the group consisting of nucleic acid polymoφhisms, protein polymoφhisms, enzyme polymoφhisms, chemical polymoφhisms, biochemical polymoφhisms, phenotypic polymoφhisms, and quantitative polymoφhisms. -22-
7. The method of Claim 6, wherein the polymoφhism is a nucleic acid sequence polymoφhism.
8. The method of Claim 7, wherein the polymoφhism is a nucleic acid length polymoφhism.
9. The method of Claim 8, wherein the polymoφhism is a short tandem repeat (STR).
10. The method of Claim 1 , wherein the unique identifier is also linked to the source of the biological sample.
11. The method of Claim 1 , wherein the unique identifier is also linked to relevant information about the biological sample or the source of the biological sample.
12. The method of Claim 1 , wherein the unique identifier is selected from the group consisting of an alphanumeric string, and a bar code.
13. A method for estabhshing a repository containing a collection of biological samples, wherein each biological sample has a unique identifier associated with it, the method comprising:
(a) obtaining a biological sample from a source;
(b) detecting one or more polymoφhisms in the sample;
(c) selecting one or more polymoφhisms sufficient to form a unique identifier;
(d) using the unique identifier to identify the sample;
(e) storing the sample with the unique identifier;
(f) repeating steps (a) through (e) for biological samples from other sources; -23-
thereby establishing a repository containing a collection of biological samples, wherein each such biological sample has a unique identifier associated with it.
14. The method of Claim 13, wherein the samples are DNA-containing samples.
15. The method of Claim 13, wherein the sample source is a human.
16. The method of Claim 13, wherein the polymoφhism is selected from the group consisting of nucleic acid polymoφhisms, protein polymoφhisms, enzyme polymoφhisms, chemical polymoφhisms, biochemical polymoφhisms, phenotypic polymoφhisms, and quantitative polymoφhisms.
17. The method of Claim 16, wherein the polymoφhism is a short tandem repeat (STR).
18. The method of Claim 13, wherein the unique identifier is selected from the group consisting of an alphanumeric string, and a bar code.
19. The method of Claim 13, wherein the unique identifier is also linked to the source of the biological sample.
20. The method of Claim 13, wherein the unique identifier is also linked to relevant information about the biological sample or the source of the biological sample.
21. A method of determining, by means of a unique identifier, if a source is represented by a sample within the repository of Claim 15, the method comprising: (a) obtaining a sample from the source; -24-
(b) detecting one or more polymoφhisms in the sample;
(c) selecting one or more polymoφhisms sufficient to form a unique identifier, wherein the polymoφhisms selected are those used to form unique identifiers for the samples within the repository; (d) comparing the unique identifier of (c) to the unique identifier of each sample in the repository; wherein shared identity between the unique identifier of (c) to a unique identifier of a sample in the repository indicates that the source is represented by a sample within the repository.
22. The method of Claim 21, wherem the samples are DNA-containing samples.
23. The method of Claim 21 , wherein the source of the sample is a human.
24. The method of Claim 21, wherein the polymoφhism is selected fromihe group consisting of nucleic acid polymoφhisms, protein polymoφhisms, enzyme polymoφhisms, chemical polymoφhisms, biochemical polymoφhisms, phenotypic polymoφhisms, and quantitative polymoφhisms.
25. The method of Claim 24, wherein the polymoφhism is a short tandem repeat (STR).
26. The method of Claim 21 , wherein the unique identifier is also linked to the source of the biological sample.
27. The method of Claim 21, wherein the umque identifier is also linked to relevant information about the biological sample or the source of the biological sample. -25-
28. The method of Claim 21, wherein the unique identifier is selected from the group consisting of an alphanumeric string, and a bar code.
29. A method for linking, by means of a unique identifier, a member of a first group with a member of a second group, wherein the first group comprises a biological sample lacking a unique identifier, and a source of a biological sample lacking a unique identifier, and wherein the second group comprises a biological sample having a unique identifier, a source of a biological sample having a unique identifier, or information having a unique identifier, the method comprising: (a) detecting one or more polymoφhisms in the member of the first group; (b) selecting one or more polymoφhisms sufficient to form a unique identifier, wherein the polymoφhisms selected are those used to form the unique identifier for the members of the second group; (c) comparing the unique identifier of the member of the first group to the unique identifier of the member of the second group; wherein shared identity between the unique identifier of the member of the first group and the unique identifier of member of the second group, links the member of the first group with the member of the second group.
PCT/US1999/004094 1998-02-26 1999-02-25 Unique identifier for biological samples WO1999043855A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
CA002321963A CA2321963A1 (en) 1998-02-26 1999-02-25 Unique identifier for biological samples
JP2000533594A JP2002505088A (en) 1998-02-26 1999-02-25 Unique identifier of biological sample
EP99908461A EP1056888A1 (en) 1998-02-26 1999-02-25 Unique identifier for biological samples
AU27892/99A AU2789299A (en) 1998-02-26 1999-02-25 Unique identifier for biological samples

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US7608198P 1998-02-26 1998-02-26
US60/076,081 1998-02-26

Publications (1)

Publication Number Publication Date
WO1999043855A1 true WO1999043855A1 (en) 1999-09-02

Family

ID=22129826

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1999/004094 WO1999043855A1 (en) 1998-02-26 1999-02-25 Unique identifier for biological samples

Country Status (5)

Country Link
EP (1) EP1056888A1 (en)
JP (1) JP2002505088A (en)
AU (1) AU2789299A (en)
CA (1) CA2321963A1 (en)
WO (1) WO1999043855A1 (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002038804A1 (en) * 2000-11-08 2002-05-16 Agrobiogen Gmbh Biotechnologie Method for marking samples containing dna by means of oligonucleotides
US6534307B1 (en) 2001-02-08 2003-03-18 Clinomics Biosciences, Inc. Frozen tissue microarrayer
US6713258B2 (en) 2000-03-29 2004-03-30 Center For The Application Of Molecular Biology To International Agriculture (Cambia) Methods for genotyping by hybridization analysis
US6716619B1 (en) 2001-02-08 2004-04-06 Clinomics Biosciences, Inc. Stylet for use with tissue microarrayer and molds
EP1842926A1 (en) * 2006-03-10 2007-10-10 Epigenomics AG A method of identifying a biological sample for methylation analysis
US7860727B2 (en) 2003-07-17 2010-12-28 Ventana Medical Systems, Inc. Laboratory instrumentation information management and control network
EP2411933A2 (en) * 2009-03-23 2012-02-01 Intresco B.V. Methods for providing a set of symbols uniquely distinguishing an organism such as a human individual
US8278034B2 (en) 2000-06-22 2012-10-02 Nuclea Biotechnologies, Inc. Methods of making frozen tissue microarrays
US8719053B2 (en) 2003-07-17 2014-05-06 Ventana Medical Systems, Inc. Laboratory instrumentation information management and control network
WO2024003114A1 (en) 2022-06-29 2024-01-04 Actome Gmbh Detection of biomolecules in single cells

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2556158B1 (en) * 2010-04-08 2019-12-18 Qiagen GmbH Chromatographic device and method for isolating and purifying nucleic acids

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1992013969A1 (en) * 1991-01-31 1992-08-20 Baylor College Of Medicine Dna typing with short tandem repeat polymorphisms and identification of polymorphic short tandem repeats
WO1996010648A2 (en) * 1994-09-30 1996-04-11 Promega Corporation Multiplex amplification of short tandem repeat loci
WO1996019586A1 (en) * 1994-12-22 1996-06-27 Visible Genetics Inc. Method and composition for internal identification of samples
US5556955A (en) * 1991-08-22 1996-09-17 Etat Francais Represente Par Le Delegue General Pour A'armement Process for detection of new polymorphic loci in a DNA sequence, nucleotide sequences forming hybridization probes and their applications
WO1998014275A1 (en) * 1996-10-04 1998-04-09 Intronn Llc Sample collection devices and methods using markers and the use of such markers as controls in sample validation, laboratory evaluation and/or accreditation

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1992013969A1 (en) * 1991-01-31 1992-08-20 Baylor College Of Medicine Dna typing with short tandem repeat polymorphisms and identification of polymorphic short tandem repeats
US5556955A (en) * 1991-08-22 1996-09-17 Etat Francais Represente Par Le Delegue General Pour A'armement Process for detection of new polymorphic loci in a DNA sequence, nucleotide sequences forming hybridization probes and their applications
WO1996010648A2 (en) * 1994-09-30 1996-04-11 Promega Corporation Multiplex amplification of short tandem repeat loci
WO1996019586A1 (en) * 1994-12-22 1996-06-27 Visible Genetics Inc. Method and composition for internal identification of samples
WO1998014275A1 (en) * 1996-10-04 1998-04-09 Intronn Llc Sample collection devices and methods using markers and the use of such markers as controls in sample validation, laboratory evaluation and/or accreditation

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
SHUBER A P ET AL: "HIGH THROUGHPUT PARALLEL ANALYSIS OF HUNDREDS OF PATIENT SAMPLES FOR MORE THAN 100 MUTATIONS IN MULTIPLE DISEASE GENES", HUMAN MOLECULAR GENETICS, vol. 6, no. 3, 1 March 1997 (1997-03-01), pages 337 - 347, XP000673446 *

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6713258B2 (en) 2000-03-29 2004-03-30 Center For The Application Of Molecular Biology To International Agriculture (Cambia) Methods for genotyping by hybridization analysis
US8278034B2 (en) 2000-06-22 2012-10-02 Nuclea Biotechnologies, Inc. Methods of making frozen tissue microarrays
WO2002038804A1 (en) * 2000-11-08 2002-05-16 Agrobiogen Gmbh Biotechnologie Method for marking samples containing dna by means of oligonucleotides
US6534307B1 (en) 2001-02-08 2003-03-18 Clinomics Biosciences, Inc. Frozen tissue microarrayer
US6582967B2 (en) 2001-02-08 2003-06-24 Clinomics Biosciences, Inc. Method of forming a microarray from frozen tissue
US6716619B1 (en) 2001-02-08 2004-04-06 Clinomics Biosciences, Inc. Stylet for use with tissue microarrayer and molds
US7860727B2 (en) 2003-07-17 2010-12-28 Ventana Medical Systems, Inc. Laboratory instrumentation information management and control network
US8719053B2 (en) 2003-07-17 2014-05-06 Ventana Medical Systems, Inc. Laboratory instrumentation information management and control network
US8812329B2 (en) 2003-07-17 2014-08-19 Ventana Medical Systems, Inc. Laboratory instrumentation information management and control network
EP1842926A1 (en) * 2006-03-10 2007-10-10 Epigenomics AG A method of identifying a biological sample for methylation analysis
EP2411933A2 (en) * 2009-03-23 2012-02-01 Intresco B.V. Methods for providing a set of symbols uniquely distinguishing an organism such as a human individual
WO2024003114A1 (en) 2022-06-29 2024-01-04 Actome Gmbh Detection of biomolecules in single cells

Also Published As

Publication number Publication date
JP2002505088A (en) 2002-02-19
AU2789299A (en) 1999-09-15
CA2321963A1 (en) 1999-09-02
EP1056888A1 (en) 2000-12-06

Similar Documents

Publication Publication Date Title
Bessey et al. Passive eDNA collection enhances aquatic biodiversity analysis
Ushio et al. Environmental DNA enables detection of terrestrial mammals from forest pond water
Butler Advanced topics in forensic DNA typing: methodology
Butler Forensic DNA typing: biology & technology behind STR markers
Parker et al. What molecules can tell us about populations: choosing andusing a molecular marker
US9809840B2 (en) Reference markers for biological samples
Visvikis et al. DNA extraction and stability for epidemiological studies
WO1999043855A1 (en) Unique identifier for biological samples
CN104769111A (en) Methods for one step nucleic acid amplification of non-eluted samples
Stevens et al. Molecular genetic methods for Forensic Entomology
Thomas et al. [30] DNA sequences from old tissue remains
Brinkmann Overview of PCR-based systems in identity testing
Coyle Nonhuman DNA typing: theory and casework applications
Amar et al. Employing of the amplified fragment length polymorphism (AFLP) methodology as an efficient population genetic tool for symbiotic cnidarians
Phillips Crime scene genetics: transforming forensic science through molecular technologies
Behl et al. Tools and techniques used in forensic DNA typing
Wells et al. Molecular methods for forensic entomology
WO2020180670A1 (en) Systems and methods for separating decoded arrays
US20080177477A1 (en) Compositions and Methods Comprising Biological Samples for Quality Controls
Association for Molecular Pathology Training and Education Committee Goals and objectives for molecular pathology education in residency programs
Weedn et al. DNA analysis
WO2010010194A1 (en) Kit and method for evaluating detection properties in amplification reactions
Weedn et al. DNA testing in the forensic laboratory
Lee Forensic Biology
Mohammed Study of Autosomal STR Markers in United Arab Emirates Population

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW SD SL SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
121 Ep: the epo has been informed by wipo that ep was designated in this application
ENP Entry into the national phase

Ref document number: 2321963

Country of ref document: CA

Ref country code: CA

Ref document number: 2321963

Kind code of ref document: A

Format of ref document f/p: F

ENP Entry into the national phase

Ref country code: JP

Ref document number: 2000 533594

Kind code of ref document: A

Format of ref document f/p: F

WWE Wipo information: entry into national phase

Ref document number: 1999908461

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: KR

WWP Wipo information: published in national office

Ref document number: 1999908461

Country of ref document: EP

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

WWW Wipo information: withdrawn in national office

Ref document number: 1999908461

Country of ref document: EP