US20180291439A1 - High throughput detection of molecular markers based on aflp and high through-put sequencing - Google Patents
High throughput detection of molecular markers based on aflp and high through-put sequencing Download PDFInfo
- Publication number
- US20180291439A1 US20180291439A1 US16/000,252 US201816000252A US2018291439A1 US 20180291439 A1 US20180291439 A1 US 20180291439A1 US 201816000252 A US201816000252 A US 201816000252A US 2018291439 A1 US2018291439 A1 US 2018291439A1
- Authority
- US
- United States
- Prior art keywords
- sequence
- sample
- aflp
- kit according
- primer
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 239000002773 nucleotides Substances 0 abstract claims description 43
- 125000003729 nucleotide group Chemical group 0 abstract claims description 42
- 150000007523 nucleic acids Chemical class 0 claims description 33
- 239000011324 beads Substances 0 claims description 31
- 108010076804 DNA Restriction Enzymes Proteins 0 claims description 24
- 238000003752 polymerase chain reaction Methods 0 claims description 22
- 230000000295 complement Effects 0 claims description 19
- 229920001850 Nucleic acid sequence Polymers 0 claims description 10
- 238000000137 annealing Methods 0 claims description 10
- 108020004707 nucleic acids Proteins 0 claims description 9
- 108020000887 polymerases Proteins 0 claims description 4
- 102000003640 polymerases Human genes 0 claims description 4
- 108020003180 Ligase family Proteins 0 claims description 2
- 239000007787 solids Substances 0 claims description 2
- 102000005965 Ligase family Human genes 0 claims 1
- 239000003147 molecular marker Substances 0 abstract 1
- 229920003013 deoxyribonucleic acids Polymers 0 description 58
- 230000003321 amplification Effects 0 description 31
- 238000003199 nucleic acid amplification method Methods 0 description 31
- 238000005516 engineering processes Methods 0 description 27
- 239000000203 mixtures Substances 0 description 22
- 239000003550 marker Substances 0 description 21
- 102000004190 enzymes Human genes 0 description 16
- 108090000790 enzymes Proteins 0 description 16
- 229940088598 Enzyme Drugs 0 description 15
- 239000000499 gels Substances 0 description 15
- 229920002287 Amplicon Polymers 0 description 11
- 230000037230 mobility Effects 0 description 9
- 238000003786 synthesis Methods 0 description 9
- 230000027455 binding Effects 0 description 8
- 238000009739 binding Methods 0 description 8
- 238000001962 electrophoresis Methods 0 description 8
- 230000002068 genetic Effects 0 description 8
- 101700082998 ENRN family Proteins 0 description 7
- 101700026665 LORF2 family Proteins 0 description 7
- 102100005410 LORF2_HUMAN Human genes 0 description 7
- 101700006494 NUCA family Proteins 0 description 7
- 229920000272 Oligonucleotide Polymers 0 description 7
- 101700009464 PO11 family Proteins 0 description 7
- 101700053215 PO12 family Proteins 0 description 7
- 101700047511 PO13 family Proteins 0 description 7
- 101700082423 PO14 family Proteins 0 description 7
- 101700014043 PO21 family Proteins 0 description 7
- 101700007740 PO22 family Proteins 0 description 7
- 101700010127 PO23 family Proteins 0 description 7
- 101700005691 PO24 family Proteins 0 description 7
- 101700030467 POL family Proteins 0 description 7
- 101700040843 POL1 family Proteins 0 description 7
- 101700021625 POL2 family Proteins 0 description 7
- 101700023985 POL3 family Proteins 0 description 7
- 101700027061 POL4 family Proteins 0 description 7
- 101700067012 POL5 family Proteins 0 description 7
- 101700063765 POLR family Proteins 0 description 7
- 101700035922 POLX family Proteins 0 description 7
- 101700015626 POLY family Proteins 0 description 7
- 108020004682 Single-Stranded DNA Proteins 0 description 7
- 238000000034 methods Methods 0 description 7
- 230000001721 combination Effects 0 description 6
- 239000003921 oil Substances 0 description 6
- 239000000047 products Substances 0 description 6
- 229920000160 (ribonucleotides)n+m Polymers 0 description 4
- 101700051619 PDLI4 family Proteins 0 description 4
- 229940081202 RNA Drugs 0 description 4
- 230000000875 corresponding Effects 0 description 4
- 230000029087 digestion Effects 0 description 4
- 238000001502 gel electrophoresis Methods 0 description 4
- 238000009396 hybridization Methods 0 description 4
- 210000001736 Capillaries Anatomy 0 description 3
- 101700035985 IDS family Proteins 0 description 3
- 101700015915 NCASE family Proteins 0 description 3
- 239000002004 ayurvedic oil Substances 0 description 3
- 238000005251 capillar electrophoresis Methods 0 description 3
- 125000002637 deoxyribonucleotide group Chemical group 0 description 3
- 239000005547 deoxyribonucleotides Substances 0 description 3
- 201000010099 diseases Diseases 0 description 3
- 230000001976 improved Effects 0 description 3
- 230000001965 increased Effects 0 description 3
- 238000005070 sampling Methods 0 description 3
- 239000007790 solid phases Substances 0 description 3
- 238000010186 staining Methods 0 description 3
- 229920002676 Complementary DNA Polymers 0 description 2
- 241000196324 Embryophyta Species 0 description 2
- 108010042407 Endonucleases Proteins 0 description 2
- 102000004533 Endonucleases Human genes 0 description 2
- 241000282414 Homo sapiens Species 0 description 2
- XPPKVPWEQAFLFU-UHFFFAOYSA-J Pyrophosphate Chemical compound   [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0 description 2
- 240000008042 Zea mays Species 0 description 2
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0 description 2
- 235000002017 Zea mays subsp mays Nutrition 0 description 2
- 238000004458 analytical methods Methods 0 description 2
- 238000004873 anchoring Methods 0 description 2
- 230000015572 biosynthetic process Effects 0 description 2
- 238000006243 chemical reaction Methods 0 description 2
- 239000003153 chemical reaction reagent Substances 0 description 2
- 239000002299 complementary DNA Substances 0 description 2
- 235000011180 diphosphates Nutrition 0 description 2
- 238000009826 distribution Methods 0 description 2
- 239000000839 emulsions Substances 0 description 2
- 239000011133 lead Substances 0 description 2
- 235000009973 maize Nutrition 0 description 2
- 238000003976 plant breeding Methods 0 description 2
- 229920000642 polymers Polymers 0 description 2
- 230000001603 reducing Effects 0 description 2
- 238000004805 robotics Methods 0 description 2
- 238000003530 single readout Methods 0 description 2
- 239000000126 substances Substances 0 description 2
- 230000002194 synthesizing Effects 0 description 2
- 229930002728 5-Methyluracil Natural products 0 description 1
- 229960000643 Adenine Drugs 0 description 1
- 229930011612 Adenine Natural products 0 description 1
- 229920000936 Agarose Polymers 0 description 1
- 206010001897 Alzheimer's diseases Diseases 0 description 1
- 108020004998 Chloroplast DNA Proteins 0 description 1
- 229920000138 Chloroplast DNA Polymers 0 description 1
- 108060001656 ClpE family Proteins 0 description 1
- OPTASPLRGRRNAP-UHFFFAOYSA-N Cytosine Chemical compound   NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0 description 1
- 229940104302 Cytosine Drugs 0 description 1
- 239000003155 DNA primer Substances 0 description 1
- 101700032932 DPO1 family Proteins 0 description 1
- 101700065272 DPO2 family Proteins 0 description 1
- 101700034367 DPOL family Proteins 0 description 1
- 101700011961 DPOM family Proteins 0 description 1
- 101700006428 EIF3A family Proteins 0 description 1
- ZMMJGEGLRURXTF-UHFFFAOYSA-N Ethidium bromide Chemical compound   [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0 description 1
- UYTPUPDQBNUYGX-UHFFFAOYSA-N Guanine Chemical compound   O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0 description 1
- 229940104228 Guanine Drugs 0 description 1
- 229920001681 Heteroduplex Polymers 0 description 1
- 101700002776 IKZF2 family Proteins 0 description 1
- 101700028499 LECG family Proteins 0 description 1
- 229920002320 Organellar DNA Polymers 0 description 1
- 230000035980 PAA Effects 0 description 1
- 101700049810 PAT family Proteins 0 description 1
- 229920002224 Peptide nucleic acid Polymers 0 description 1
- KDCGOANMDULRCW-UHFFFAOYSA-N Purine Chemical compound   N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0 description 1
- 229920001914 Ribonucleotide Polymers 0 description 1
- 108010006785 Taq Polymerase Proteins 0 description 1
- 229940113082 Thymine Drugs 0 description 1
- RWQNBRDOKXIBIV-UHFFFAOYSA-N Thymine Chemical compound   CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0 description 1
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound   O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0 description 1
- 229930002907 Uracil Natural products 0 description 1
- 229940035893 Uracil Drugs 0 description 1
- 238000007792 addition Methods 0 description 1
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound   C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0 description 1
- -1 animal Species 0 description 1
- 238000003975 animal breeding Methods 0 description 1
- 230000003935 attention Effects 0 description 1
- 238000005047 biotechnology Methods 0 description 1
- 238000004364 calculation methods Methods 0 description 1
- 201000011510 cancer Diseases 0 description 1
- 210000004027 cells Anatomy 0 description 1
- 238000003776 cleavage Methods 0 description 1
- 238000007796 conventional methods Methods 0 description 1
- 230000002596 correlated Effects 0 description 1
- 238000007405 data analysis Methods 0 description 1
- 230000036425 denaturation Effects 0 description 1
- 238000004925 denaturation Methods 0 description 1
- 230000001809 detectable Effects 0 description 1
- 238000003745 diagnosis Methods 0 description 1
- 238000004945 emulsification Methods 0 description 1
- 230000001804 emulsifying Effects 0 description 1
- 230000002255 enzymatic Effects 0 description 1
- 238000006911 enzymatic reaction Methods 0 description 1
- 235000013305 food Nutrition 0 description 1
- 238000005194 fractionation Methods 0 description 1
- 239000000727 fractions Substances 0 description 1
- 238000006062 fragmentation Methods 0 description 1
- 238000000338 in vitro Methods 0 description 1
- 238000003780 insertion Methods 0 description 1
- 238000005304 joining Methods 0 description 1
- 230000004301 light adaptation Effects 0 description 1
- 230000002438 mitochondrial Effects 0 description 1
- 238000002156 mixing Methods 0 description 1
- 230000004048 modification Effects 0 description 1
- 238000006011 modification Methods 0 description 1
- 238000009740 moulding (composite fabrication) Methods 0 description 1
- 235000016709 nutrition Nutrition 0 description 1
- 230000035764 nutrition Effects 0 description 1
- 229920001888 polyacrylic acid Polymers 0 description 1
- 125000002924 primary amino group Chemical group   [H]N([H])* 0 description 1
- 238000000746 purification Methods 0 description 1
- 150000003230 pyrimidines Chemical class 0 description 1
- 230000002285 radioactive Effects 0 description 1
- 238000009790 rate-determining step (RDS) Methods 0 description 1
- 230000002829 reduced Effects 0 description 1
- 238000006722 reduction reaction Methods 0 description 1
- 238000009877 rendering Methods 0 description 1
- 230000002441 reversible Effects 0 description 1
- 125000002652 ribonucleotide group Chemical group 0 description 1
- 239000002336 ribonucleotides Substances 0 description 1
- 238000005204 segregation Methods 0 description 1
- 239000004332 silver Substances 0 description 1
- 241000894007 species Species 0 description 1
- 238000006467 substitution reaction Methods 0 description 1
- 210000001519 tissues Anatomy 0 description 1
- 230000035899 viability Effects 0 description 1
- 230000004304 visual acuity Effects 0 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6844—Nucleic acid amplification reactions
- C12Q1/6853—Nucleic acid amplification reactions using modified primers or templates
- C12Q1/6855—Ligating adaptors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6844—Nucleic acid amplification reactions
- C12Q1/6853—Nucleic acid amplification reactions using modified primers or templates
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2525/00—Reactions involving modified oligonucleotides, nucleic acids, or nucleotides
- C12Q2525/10—Modifications characterised by
- C12Q2525/155—Modifications characterised by incorporating/generating a new priming site
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2525/00—Reactions involving modified oligonucleotides, nucleic acids, or nucleotides
- C12Q2525/10—Modifications characterised by
- C12Q2525/191—Modifications characterised by incorporating an adaptor
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2535/00—Reactions characterised by the assay type for determining the identity of a nucleotide base or a sequence of oligonucleotides
- C12Q2535/122—Massive parallel sequencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2535/00—Reactions characterised by the assay type for determining the identity of a nucleotide base or a sequence of oligonucleotides
- C12Q2535/138—Amplified fragment length polymorphism [AFLP]
Abstract
The present invention relates to a high throughput method for the identification and detection of molecular markers wherein restriction fragments are generated and suitable adaptors comprising (sample-specific) identifiers are ligated. The adapter-ligated restriction fragments may be selectively amplified with adaptor compatible primers carrying selective nucleotides at their 3′ end. The amplified adapter-ligated restriction fragments are, at least partly, sequenced using high throughput sequencing methods and the sequence parts of the restriction fragments together with the sample-specific identifiers serve as molecular marker.
Description
- This application is a continuation of U.S. application Ser. No. 14/285,430, filed May 22, 2014, which is a continuation of U.S. application Ser. No. 13/449,629, filed Apr. 18, 2012, now abandoned, which is a continuation of U.S. application Ser. No. 13/364,799, filed Feb. 2, 2012, now abandoned, which is a continuation of U.S. application Ser. No. 12/296,009, filed Feb. 6, 2009, now abandoned, which is the U.S. National Stage of International Application No. PCT/NL2007/000094, filed Apr. 4, 2007; which claims priority to U.S. Provisional Application Nos. 60/788,706, filed Apr. 4, 2006; and 60/880,052, filed Jan. 12, 2007. Each of these applications is incorporated herein by reference in its entirety.
- The instant application contains a Sequence Listing which has been submitted in ASCII format via EFS-WEB and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Jun. 5, 2018, is named 085342-2600SequenceListing.txt and is 2 KB.
- The present invention relates to the field of molecular biology and biotechnology. In particular, the invention relates to the field of nucleic acid detection identification. More in particular the invention relates to methods for the detection and identification of markers, in particular molecular markers. The invention is concerned with the provision of high throughput methods for the detection and identification of molecular markers. The invention further relates to the application of the method in the identification of and/or detection of nucleotide sequences that are related to a wide variety of genetic traits, genes, haplotypes and combinations thereof. The invention can be used in the field of high throughput detection and identification of molecular markers from any origin, be it plant, animal, human, artificial or otherwise.
- Exploration of genomic DNA has long been desired by the scientific, in particular medical, community. Genomic DNA holds the key to identification, diagnosis and treatment of diseases such as cancer and Alzheimer's disease. In addition to disease identification and treatment, exploration of genomic DNA may provide significant advantages in plant and animal breeding efforts, which may provide answers to food and nutrition problems in the world.
- Many diseases are known to be associated with specific genetic components, in particular with polymorphisms in specific genes. The identification of polymorphisms in large samples such as genomes is at present a laborious and time-consuming task. However, such identification is of great value to areas such as biomedical research, developing pharmacy products, tissue typing, genotyping and population studies.
- Markers, i.c. genetic markers, have been used for a very long time as a genetic typing method, i.e. to connect a phenotypic trait to the presence, absence or amount of a particular part of DNA (gene). One of the most versatile genetic typing technologies is AFLP, already around for many years and widely applicable to any organism (for reviews see Savelkoul et al. J. Clin. Microbiol, 1999, 37(10), 3083-3091; Bensch et al. Molecular Ecology, 2005, 14, 2899-2914)
- The AFLP technology (Zabeau & Vos, 1993; Vos et al., 1995) has found widespread use in plant breeding and other field since its invention in the early nineties. This is due to several characteristics of AFLP, of which the most important is that no prior sequence information is needed to generate large numbers of genetic markers in a reproducible fashion. In addition, the principle of selective amplification, a cornerstone of AFLP, ensures that the number of amplified fragments can be brought in line with the resolution of the detection system, irrespective of genome size or origin.
- Detection of AFLP fragments is commonly carried out by electrophoresis on slab-gels (Vos et al., 1995) or capillary electrophoresis (van der Meulen et al., 2002). The majority of AFLP markers scored in this way represent (single nucleotide) polymorphisms occurring either in the restriction enzyme recognition sites used for AFLP template preparation or their flanking nucleotides covered by selective AFLP primers. The remainder of the AFLP markers are insertion/deletion polymorphisms occurring in the internal sequences of the restriction fragments and a very small fraction on single nucleotide substitutions occurring in small restriction fragments (<approximately 100 bp), which for these fragments cause reproducible mobility variations between both alleles which can be observed upon electrophoresis; these AFLP markers can be scored co-dominantly without having to rely on band intensities.
- In a typical AFLP fingerprint, the AFLP markers therefore constitute the minority of amplified fragments (less than 50 percent but often less than 20 percent), while the remainder are commonly referred to as constant AFLP fragments. The latter are nevertheless useful in the gel scoring procedure as they serve as anchor points to calculate fragments mobilities of AFLP markers and aid in quantifying the markers for co-dominant scoring. Co-dominant scoring (scoring for homo- or heterozygosity) of AFLP markers currently is restricted to the context of fingerprinting a segregating population. In a panel of unrelated lines, only dominant scoring is possible.
- Although the throughput of AFLP is very high due to high multiplexing levels in the amplification and detection steps, the rate limiting step is the resolving power of electrophoresis. Electrophoresis allows unique identification of the majority of amplified fragments based on the combination of restriction enzyme combinations (EC), primer combinations (PC) and mobility, but electrophoresis is only capable to distinguish the amplified fragments based on differences in mobility. Fragments of similar mobility are often found as so-called ‘stacked bands’ and with electrophoresis, no attention can be given to the information that is contained in so-called ‘constant bands’, i.e. amplified restriction fragments that do not appear to differ between compared species. Furthermore on a typical gel-based system, or on a capillary system such as a MegaBACE, samples must be run in parallel and only about 100-150 bands per lane on a gel or per capillary can be analysed. These limitations also hamper throughput.
- Ideally, the detection system should be capable of determining the entire sequence of the amplified fragments to capture all amplified restriction fragments. However, most high throughput sequencing technologies cannot yet provide sequencing reads that encompass entire AFLP fragments, which are typically 100-500 bp in length.
- So far, detection of AFLP markers/sequences by sequencing has not been economically feasible due to, among other limitations, cost limitations of Sanger dideoxy sequencing technology and other conventional sequencing technologies.
- Detection by sequencing instead of mobility determination will increase throughput because:
- 1) polymorphisms located in the internal sequences will be detected in most (or all) amplified fragments; this will increase the number of markers per PC considerably.
- 2) no loss of AFLP markers due to co-migration of AFLP markers and constant bands.
- 3) co-dominant scoring does not rely on quantification of band intensities and is independent of the relatedness of the individuals fingerprinted.
- However, detection by sequencing of the entire restriction fragment is still relatively uneconomical. Furthermore, the current state of the art sequencing technology such as disclosed herein elsewhere (from 454 Life Sciences, www.454.com and Solexa, www.solexa.com), despite their overwhelming sequencing power, can only provide sequencing fragments of limited length. Also the current methods do not allow for the simultaneous processing of many samples in one run.
- In the following description and examples a number of terms are used. In order to provide a clear and consistent understanding of the specification and claims, including the scope to be given such terms, the following definitions are provided. Unless otherwise defined herein, all technical and scientific terms used have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. The disclosures of all publications, patent applications, patents and other references are incorporated herein in their entirety by reference.
- Nucleic acid: a nucleic acid according to the present invention may include any polymer or oligomer of pyrimidine and purine bases, preferably cytosine, thymine, and uracil, and adenine and guanine, respectively (See Albert L. Lehninger, Principles of Biochemistry, at 793-800 (Worth Pub. 1982) which is herein incorporated by reference in its entirety for all purposes). The present invention contemplates any deoxyribonucleotide, ribonucleotide or peptide nucleic acid component, and any chemical variants thereof, such as methylated, hydroxymethylated or glycosylated forms of these bases, and the like. The polymers or oligomers may be heterogenous or homogenous in composition, and may be isolated from naturally occurring sources or may be artificially or synthetically produced. In addition, the nucleic acids may be DNA or RNA, or a mixture thereof, and may exist permanently or transitionally in single-stranded or double-stranded form, including homoduplex, heteroduplex, and hybrid states.
- AFLP: AFLP refers to a method for selective amplification of nucleic acids based on digesting a nucleic acid with one or more restriction endonucleases to yield restriction fragments, ligating adaptors to the restriction fragments and amplifying the adaptor-ligated restriction fragments with at least one primer that is (part) complementary to the adaptor, (part) complementary to the remains of the restriction endonuclease, and that further contains at least one randomly selected nucleotide from amongst A, C, T, or G (or U as the case may be). AFLP does not require any prior sequence information and can be performed on any starting DNA. In general, AFLP comprises the steps of:
-
- (a) digesting a nucleic acid, in particular a DNA or cDNA, with one or more specific restriction endonucleases, to fragment the DNA into a corresponding series of restriction fragments;
- (b) ligating the restriction fragments thus obtained with a double-stranded synthetic oligonucleotide adaptor, one end of which is compatible with one or both of the ends of the restriction fragments, to thereby produce adaptor-ligated, preferably tagged, restriction fragments of the starting DNA;
- (c) contacting the adaptor-ligated, preferably tagged, restriction fragments under hybridizing conditions with one or more oligonucleotide primers that contain selective nucleotides at their 3′-end;
- (d) amplifying the adaptor-ligated, preferably tagged, restriction fragment hybridised with the primers by PCR or a similar technique so as to cause further elongation of the hybridised primers along the restriction fragments of the starting DNA to which the primers hybridised; and
- (e) detecting, identifying or recovering the amplified or elongated DNA fragment thus obtained.
- AFLP thus provides a reproducible subset of adaptor-ligated fragments. AFLP is described in EP 534858, U.S. Pat. No. 6,045,994 and in Vos et al. Reference is made to these publications for further details regarding AFLP. The AFLP is commonly used as a complexity reduction technique and a DNA fingerprinting technology. Within the context of the use of AFLP as a fingerprinting technology, the concept of an AFLP marker has been developed.
- AFLP marker: An AFLP marker is an amplified adaptor-ligated restriction fragment that is different between two samples that have been amplified using AFLP (fingerprinted), using the same set of primers. As such, the presence or absence of this amplified adaptor-ligated restriction fragment can be used as a marker that is linked to a trait or phenotype. In conventional gel technology, an AFLP marker shows up as a band in the gel located at a certain mobility. Other electrophoretic techniques such as capillary electrophoresis may not refer to this as a band, but the concept remains the same, i.e. a nucleic acid with a certain length and mobility. Absence or presence of the band may be indicative of (or associated with) the presence or absence of the phenotype. AFLP markers typically involve SNPs in the restriction site of the endonuclease or the selective nucleotides. Occasionally, AFLP markers may involve indels in the restriction fragment. Constant band: a constant band in the AFLP technology is an amplified adaptor-ligated restriction fragment that is relatively invariable between samples. Thus, a constant band in the AFLP technology will, over a range of samples, show up at about the same position in the gel, i.e. has the same length/mobility. In conventional AFLP these are typically used to anchor the lanes corresponding to samples on a gel or electropherograms of multiple AFLP samples detected by capillary electrophoresis. Typically, a constant band is less informative than an AFLP marker.
- Nevertheless, as AFLP markers customary involve SNPs in the selective nucleotides or the restriction site, constant bands may comprise SNPs in the restriction fragments themselves, rendering the constant bands an interesting alternative source of genetic information that is complementary to AFLP markers.
- Selective base: Located at the 3′ end of the primer that contains a part that is complementary to the adaptor and a part that is complementary to the remains of the restriction site, the selective base is randomly selected from amongst A, C, T or G. By extending a primer with a selective base, the subsequent amplification will yield only a reproducible subset of the adaptor-ligated restriction fragments, i.e. only the fragments that can be amplified using the primer carrying the selective base. Selective nucleotides can be added to the 3′end of the primer in a number varying between 1 and 10. Typically 1-4 suffice. Both primers may contain a varying number of selective bases. With each added selective base, the subset reduces the amount of amplified adaptor-ligated restriction fragments in the subset by a factor of about 4. Typically, the number of selective bases used in AFLP is indicated by +N+M, wherein one primer carries N selective nucleotides and the other primers carries M selective nucleotides. Thus, an Eco/Mse +1/+2 AFLP is shorthand for the digestion of the starting DNA with EcoRI and MseI, ligation of appropriate adaptors and amplification with one primer directed to the EcoRI restricted position carrying one selective base and the other primer directed to the MseI restricted site carrying 2 selective nucleotides. A primer used in AFLP that carries at least one selective nucleotide at its 3′ end is also depicted as an AFLP-primer. Primers that do not carry a selective nucleotide at their 3′ end and which in fact are complementary to the adaptor and the remains of the restriction site are sometimes indicated as AFLP+0 primers.
- Clustering: with the term “clustering” is meant the comparison of two or more nucleotide sequences based on the presence of short or long stretches of identical or similar nucleotides. Several methods for alignment of nucleotide sequences are known in the art, as will be further explained below. Sometimes the terms “assembly” or “alignment” are used as synonyms.
- Identifier: a short sequence that can be added to an adaptor or a primer or included in its sequence or otherwise used as label to provide a unique identifier. Such a sequence identifier can be a unique base sequence of varying but defined length uniquely used for identifying a specific nucleic acid sample. For instance 4 bp tags allow 4(exp4)=256 different tags. Typical examples are ZIP sequences, known in the art as commonly used tags for unique detection by hybridization (Iannone et al. Cytometry 39:131-140, 2000). Using such an identifier, the origin of a PCR sample can be determined upon further processing. In the case of combining processed products originating from different nucleic acid samples, the different nucleic acid samples are generally identified using different identifiers.
- Sequencing: The term sequencing refers to determining the order of nucleotides (base sequences) in a nucleic acid sample, e.g. DNA or RNA.
- High-throughput screening: High-throughput screening, often abbreviated as HTS, is a method for scientific experimentation especially relevant to the fields of biology and chemistry. Through a combination of modern robotics and other specialised laboratory hardware, it allows a researcher to effectively screen large amounts of samples simultaneously.
- Restriction endonuclease: a restriction endonuclease or restriction enzyme is an enzyme that recognizes a specific nucleotide sequence (target site) in a double-stranded DNA molecule, and will cleave both strands of the DNA molecule at or near every target site.
- Restriction fragments: the DNA molecules produced by digestion with a restriction endonuclease are referred to as restriction fragments. Any given genome (or nucleic acid, regardless of its origin) will be digested by a particular restriction endonuclease into a discrete set of restriction fragments. The DNA fragments that result from restriction endonuclease cleavage can be further used in a variety of techniques and can for instance be detected by gel electrophoresis.
- Gel electrophoresis: in order to detect restriction fragments, an analytical method for fractionating DNA molecules on the basis of size can be required. The most commonly used technique for achieving such fractionation is (capillary) gel electrophoresis. The rate at which DNA fragments move in such gels depends on their molecular weight; thus, the distances traveled decrease as the fragment lengths increase. The DNA fragments fractionated by gel electrophoresis can be visualized directly by a staining procedure e.g. silver staining or staining using ethidium bromide, if the number of fragments included in the pattern is sufficiently small. Alternatively further treatment of the DNA fragments may incorporate detectable labels in the fragments, such as fluorophores or radioactive labels, which are preferably used to label one strand of the AFLP product.
- Ligation: the enzymatic reaction catalyzed by a ligase enzyme in which two double-stranded DNA molecules are covalently joined together is referred to as ligation. In general, both DNA strands are covalently joined together, but it is also possible to prevent the ligation of one of the two strands through chemical or enzymatic modification of one of the ends of the strands. In that case the covalent joining will occur in only one of the two DNA strands.
- Synthetic oligonucleotide: single-stranded DNA molecules having preferably from about 10 to about 50 bases, which can be synthesized chemically are referred to as synthetic oligonucleotides. In general, these synthetic DNA molecules are designed to have a unique or desired nucleotide sequence, although it is possible to synthesize families of molecules having related sequences and which have different nucleotide compositions at specific positions within the nucleotide sequence. The term synthetic oligonucleotide will be used to refer to DNA molecules having a designed or desired nucleotide sequence.
- Adaptors: short double-stranded DNA molecules with a limited number of base pairs, e.g. about 10 to about 30 base pairs in length, which are designed such that they can be ligated to the ends of restriction fragments. Adaptors are generally composed of two synthetic oligonucleotides which have nucleotide sequences which are partially complementary to each other. When mixing the two synthetic oligonucleotides in solution under appropriate conditions, they will anneal to each other forming a double-stranded structure. After annealing, one end of the adaptor molecule is designed such that it is compatible with the end of a restriction fragment and can be ligated thereto; the other end of the adaptor can be designed so that it cannot be ligated, but this need not be the case (double ligated adaptors).
- Adaptor-ligated restriction fragments: restriction fragments that have been capped by adaptors.
- Primers: in general, the term primers refer to DNA strands which can prime the synthesis of DNA. DNA polymerase cannot synthesize DNA de novo without primers: it can only extend an existing DNA strand in a reaction in which the complementary strand is used as a template to direct the order of nucleotides to be assembled. We will refer to the synthetic oligonucleotide molecules which are used in a polymerase chain reaction (PCR) as primers.
- DNA amplification: the term DNA amplification will be typically used to denote the in vitro synthesis of double-stranded DNA molecules using PCR. It is noted that other amplification methods exist and they may be used in the present invention without departing from the gist.
- The present inventors have found that the above described problems and other problems in the art can be overcome by devising a generic way wherein the versatility and applicability of (AFLP) marker technology can be combined with that of state-of-the-art high throughput sequencing technology.
- Thus, the present inventors have found that by incorporation of a sample-specific identifier in the adaptor-ligated restriction fragment and/or the determination of only part of the sequence of the restriction fragment provides for a very efficient and reliable improvement of the existing technologies. It was found that by incorporation of a sample-specific identifier, multiple samples can be sequenced in a single run and by sequencing only part of the restriction fragment, adequate identification of the restriction fragment can be achieved.
-
FIG. 1 : is a schematic representation of the adaptor structure that is used in a regular AFLP-based approach for AFLP detection short tag sequencing. A typical AFLP fragment derived form a digest of a DNA sample with EcoRI and MseI and subsequent adapter ligation is shown, followed by a typical adaptor for the EcoRI site. The adaptor comprises, from the 5′ to 3′ end, a 5′ primer sequence, which is optional, and can be used to anchor amplification primers or to anchor the adapter-ligated fragment to a bead or surface. Further an identifier is shown (given as NNNNNN in a degenerate form), followed by remains of a recognition sequence of a restriction enzyme (in this EcoRI, i.e. AATTC). The last nucleotide of the identifier preferably does not comprise a G in order to destroy the EcoRI restriction site. A suitable primer is provided that comprises the optional 5′ primer sequence, an example of a specific primer (ACTGAC), remains of the recognition site and a section that may contain one or more selective nucleotides at the 3′ end. -
FIG. 2 : is a schematic representation of the embodiment wherein a recognition sequence for a type IIs restriction endonuclease is incorporated in the adaptor. After restriction with the type IIs enzyme, type IIs compatible adaptors can be ligated to one or both of the restricted fragments A and B. The type IIs adaptor comprises an optional primer binding (or anchoring) sequence, an identifier and a section containing (degenerate) nucleotides (NN) to hybridize to the overhang of the IIs restriction site. The associated primer may contain one or more selective nucleotides (XYZ) at its 3′ end. - In one aspect, the invention relates to a method for the identification of restriction fragments in a sample, comprising the steps of:
-
- (a) providing a sample nucleic acid;
- (b) digesting the sample nucleic acid with at least one restriction endonuclease to obtain a set of restriction fragments;
- (c) providing double stranded synthetic adaptors comprising
- a 5′ primer-compatible sequence,
- a sample-specific identifier section,
- a section that is complementary to the remains of the recognition sequence of the restriction endonuclease;
- (d) ligating the double stranded synthetic adaptors to the restriction fragments in the set, to provide a set of adaptor-ligated restriction fragments;
- (e) amplification of the set of adaptor-ligated restriction fragments, with one or more primers that are at least complementary to:
- the sample-specific identifier section,
- the section that is complementary to the remains of the recognition sequence of the restriction endonuclease,
- to provide for amplified adaptor-ligated restriction fragments (amplicons);
- (f) determining the sequence of at least the sample-specific identifier section, the remains of the recognition sequence of the restriction endonuclease and of part of the sequence of the restriction fragment located adjacent thereto of (part of) the amplified adaptor-ligated restriction fragments.
- (g) identifying the presence or absence of amplified adaptor-ligated restriction fragments in the sample.
- By treating a sample nucleic acid in this way, a set of amplified restriction fragments is obtained for every sample that is sequenced. Every restriction fragment can be identified as originating from a certain sample via the sample specific identifier which is different for each sample. Sequencing of the amplified adaptor-ligated restriction fragments provides sequence information on at least part of the adaptor-ligated restriction fragment. The information contained in the adaptor-derived part contains information about the sample from which the fragment is obtained, whereas sequence information from the restriction fragment itself provides information about the restriction fragment and allows for identification of the restriction fragment. This sequence information on the restriction fragment is used to identify the restriction fragment with an accuracy that depends on the number of nucleotides that is determined and the number of restriction fragments in the set of amplified adaptor-ligated restriction fragments.
- To provide a solution to the problem of sampling variation which affects the accuracy of identifying molecular markers by sequencing contained in a set of multiple fragments, the present inventors have also found that detection of markers via sequencing is preferably performed with sufficient redundancy (depth) to sample all amplified fragments at least once and accompanied by statistical means which address the issue of sampling variation in relation to the accuracy of the genotypes called. Furthermore, just as with AFLP scoring, in the context of a segregating population, the simultaneous scoring of the parent individuals in one experiment, will aid in determining the statistical threshold.
- Thus, in certain embodiments, the redundancy of the tagged amplified adaptor-ligated restriction fragments is at least 6, preferably at least 7, more preferably at least 8 and most preferably at least 9. In certain embodiments, the sequence of each adaptor-ligated restriction fragment is determined at least 6, preferably at least 7, more preferably at least 8 and most preferably at least 9 fold. In certain embodiments, the redundancy is selected such, assuming a 50/50 overall chance of identifying the locus correctly as homozygous, that the chance of correct identification of the locus is more than 95%, 96%, 97%, 98%, 99%, 99.5%.
- In this respect the following calculation may be illustrative: The sequencing technology of Solexa as described herein elsewhere, provides for 40.000.000 reads of about 25 bp each, totaling a staggering 1 billion bp in one single run. Assuming a redundancy in sampling of 10 times, 4.000.000 unique fragments can be assessed in one run. Combining 100 samples allows for 40.000 fragments to be sequences for each sample. Seen from the perspective of AFLP, this amounts to 160 primer combinations with 250 fragments each.
- This method allows for the identification of restriction fragments in way that is different from that of the conventional marker detection based on electrophoresis.
- In the first step of the method for the identification of restriction fragments a sample nucleic acid is provided. The nucleic acids in the sample will usually be in the form of DNA. However, the nucleotide sequence information contained in the sample may be from any source of nucleic acids, including e. g. RNA, polyA+RNA, cDNA, genomic DNA, organellar DNA such as mitochondrial or chloroplast DNA, synthetic nucleic acids, DNA libraries (such as BAC libraries/pooled BAC clones), clone banks or any selection or combinations thereof. The DNA in the nucleic acid sample may be double stranded, single stranded, and double stranded DNA denatured into single stranded DNA. The DNA sample can be from any organism, whether plant, animal, synthetic or human.
- The nucleic acid sample is restricted (or digested) with at least one restriction endonuclease to provide for a set of restriction fragments. In certain embodiments, two or more endonucleases can be used to obtain restriction fragments. The endonuclease can be a frequent cutter (a recognition sequence of 3-5 bp, such as MseI) or a rare cutter (recognition sequence of >5 bp, such as EcoRI). In certain preferred embodiments, a combination of a rare and a frequent cutter is preferred. In certain embodiments, in particular when the sample contains or is derived from a relative large genome, it may be preferred to use a third enzyme (rare or frequent cutter) to obtain a larger set of restriction fragments of shorter size.
- As restriction endonucleases, any endonuclease will suffice. Typically, Type II endonucleases are preferred such as EcoRI, MseI, PstI etc. In certain embodiments a type IIs endonuclease may be used, i.e. an endonuclease of which the recognition sequence is located distant from the restriction site, i.e such as AceIII, AlwI, AlwXI, Alw26I, BbvI, BbvII, BbsI, BccI, Bce83I, BcefI, BcgI, BinI, BsaI, BsgI, BsmAI, BsmFl, BspMI, EarI, EciI, Eco31I, Eco57I, Esp3I, FauI, FokI, GsuI, HgaI, HinGUII, HphI, Ksp632I, MboII, MmeI, Mn1I, NgoVIII, PleI, RleAI, SapI, SfaNI, TaqJI and Zthll 1II. The use of this type of restriction endonuclease leads to certain adaptations to the method as will be described herein elsewhere.
- Restriction fragments can be blunt-ended or have protruding ends, depending on the endonuclease used. To these ends, adaptors can be ligated. Typically, the adaptors used in the present invention have a particular design. The adaptors used in the present invention may comprise a 5′-primer compatible sequence, which may be optional to provide for sufficient length of the adaptor for subsequent primer annealing, followed by a sample-specific identifier section that may comprise from 4-16 nucleotides. Preferably the sample-specific identifier does not contain 2 or more consecutive identical bases to prevent readthroughs during the sequencing step. Furthermore, in case 2 or more sample are combined and multiple sample specific identifiers are used to distinguish the origin of the samples, there is preferably a difference between the sample-specific identifiers of at least 2, preferably 3 bp. This allows for improved discrimination between the different sample-specific identifiers within a combined pool of samples. At the 3′ end of the adaptor a section is located that is complementary to the remains of the recognition sequence of the restriction endonuclease. For instance, EcoRI recognises 5′-GAATTC-3′ and cuts between G and AATTC. For EcoRI, the section complementary to the remains of the recognition sequence of the restriction endonuclease hence is a C-nucleotide.
- The adaptor is ligated (covalently connected) with one or both sides of the restriction fragment. When digestion is performed with more than one endonuclease, different adaptors may be used which will give rise to different sets of adaptor-ligated restriction fragments.
- The adaptor-ligated restriction fragments are subsequently amplified with a set of one or more primers. The primer may be complementary to the adaptor only, i.e. non-selective amplification. The primer preferably contains a section that is complementary to the sample-specific identifier and a section that is complementary to the remains of the recognition sequence of the restriction endonuclease. In certain embodiments, the primer may contain at its 3′ end one or more selective nucleotides to provide for a subset of amplified adapter-ligated restriction fragments. The primer may at its 5′end also contain further nucleotides to aid in anchoring the primer to the adapter-ligated restriction fragments. In certain embodiments, the primer may contain nucleotides that express improved hybridisation characteristics such as LNAs or PNAs. To amplify adapter-ligated restriction fragments from combined samples in a pool it is possible to use sets of degenerated primers, i.e. primer sets wherein for each sample, the corresponding sample-identifier is incorporated in the primer. In certain embodiments, it is possible to use primer sets wherein the identifier section is completely degenerated (or at least to a large extent) i.e. (almost) every combination of nucleotides is provided in the sample specific identifier section. Combined with stringent hybridisation conditions in the amplification and the optional use of LNA or PNA-type nucleotides to increase hybridisation characteristics, this may lead to a very efficient amplification.
- The amplification of the adapter-ligated restriction fragments lead to a set of amplified adapter-ligated restriction fragments, sometimes referred to as amplicons.
- The amplicons (or at least part thereof) are subjected to a step that comprises at least the determination of the sequence of the sample specific identifier to determine the origin of the fragment and of part of the sequence of the restriction fragment. In practice this amounts also to the determination of the sections located in-between such as the remains of the recognition sequence of the restriction endonuclease. By sequencing the sample specific identifier in combination with part of the fragment located adjacent to the adapter derived sequence, it is possible to uniquely identify restriction fragments. When correlated to the presence or absence of a phenotype, these uniquely identified restriction fragments can be used as molecular markers.
- This allows for the definition of a new generation of markers and amounts hence to a novel marker technology with the proven versatility of AFLP technology, yet that is suitable for high-throughput technologies and is generally applicable amongst any type of organism or nucleic acid. Uniquely identifying restriction fragments in a sample by determination of part of their sequence by this method can be repeated for multiple samples. The presence or absence of the restriction fragments with the depicted sequence in the sample is indicative for the presence or absence of a phenotype.
- A further advantage of the presently invented marker technology based on the combination of AFLP and high throughput sequencing is the additional information that can be obtained compared to conventional AFLP technology. In AFLP, amplicons that are designated as AFLP markers typically contain polymorphism in the recognition site, the restriction site or, optionally, in the selective nucleotides. Polymorphisms located further in the restriction fragment typical do not qualify as AFLP markers (apart from perhaps indel polymorphisms). With the present sequencing step, the nucleotides adjacent to the optional selective nucleotides are also determined and this leads to the identification of an increased number of molecular markers and to an improvement in the existing marker technology.
- The high throughput sequencing used in the present invention is a method for scientific experimentation especially relevant to the fields of biology and chemistry. Through a combination of modern robotics and other specialised laboratory hardware, it allows a researcher to effectively screen large amounts of samples simultaneously.
- It is preferred that the sequencing is performed using high-throughput sequencing methods, such as the methods disclosed in WO 03/004690, WO 03/054142, WO 2004/069849, WO 2004/070005, WO 2004/070007, and WO 2005/003375 (all in the name of 454 Life Sciences), by Seo et al. (2004) Proc. Natl. Acad. Sci. USA 101:5488-93, and technologies of Helios, Solexa, US Genomics, etcetera, which are herein incorporated by reference.
- In certain embodiments, it is preferred that sequencing is performed using the apparatus and/or method disclosed in WO 03/004690, WO 03/054142, WO 2004/069849, WO 2004/070005, WO 2004/070007, and WO 2005/003375 (all in the name of 454 Life Sciences), which are herein incorporated by reference. The technology described allows sequencing of 40 million bases in a single run and is 100 times faster and cheaper than competing technology. The sequencing technology roughly consists of 5 steps: 1) fragmentation of DNA and ligation of specific adaptors to create a library of single-stranded DNA (ssDNA); 2) annealing of ssDNA to beads, emulsification of the beads in water-in-oil microreactors and performing emulsion PCR to amplify the individual ssDNA molecules on beads; 3) selection of/enrichment for beads containing amplified ssDNA molecules on their surface 4) deposition of DNA carrying beads in a PICOTITER™ Plate; and 5) simultaneous sequencing in 100,000 wells by generation of a pyrophosphate light signal. The method will be explained in more detail below.
- In a preferred embodiment, the sequencing comprises the steps of:
-
- (a) annealing adapted fragments to beads, each bead being annealed with a single adapted fragment;
- (b) emulsifying the beads in water-in-oil microreactors, each water-in-oil microreactor comprising a single bead;
- (c) loading the beads in wells, each well comprising a single bead; and generating a pyrophosphate signal.
- In the first step (a), sequencing adaptors are ligated to fragments within the combination library. Said sequencing adaptor includes at least a “key” region for annealing to a bead, a sequencing primer region and a PCR primer region. Thus, adapted fragments are obtained.
- In a first step, adapted fragments are annealed to beads, each bead annealing with a single adapted fragment. To the pool of adapted fragments, beads are added in excess as to ensure annealing of one single adapted fragment per bead for the majority of the beads (Poisson distribution).
- In a next step, the beads are emulsified in water-in-oil microreactors, each water-in-oil microreactor comprising a single bead. PCR reagents are present in the water-in-oil microreactors allowing a PCR reaction to take place within the microreactors. Subsequently, the microreactors are broken, and the beads comprising DNA (DNA positive beads) are enriched.
- In a following step, the beads are loaded in wells, each well comprising a single bead. The wells are preferably part of a PICOTITER™ Plate allowing for simultaneous sequencing of a large amount of fragments.
- After addition of enzyme-carrying beads, the sequence of the fragments is determined using pyrosequencing. In successive steps, the PICOTITER™ Plate and the beads as well as the enzyme beads therein are subjected to different deoxyribonucleotides in the presence of conventional sequencing reagents, and upon incorporation of a deoxyribonucleotide a light signal is generated which is recorded. Incorporation of the correct nucleotide will generate a pyrosequencing signal which can be detected.
- Pyrosequencing itself is known in the art and described inter alia on www.biotagebio.com; www.pyrosequencing.com/section technology. The technology is further applied in e.g. WO 03/004690, WO 03/054142, WO 2004/069849, WO 2004/070005, WO 2004/070007, and WO 2005/003375 (all in the name of 454 Life Sciences), which are herein incorporated by reference. In the present invention, the beads are preferably equipped with primer (binding) sequences or parts thereof that are capable of binding the amplicons, as the case may be. In other embodiments, the primers used in the amplification are equipped with sequences, for instance at their 5′-end, that allow binding of the amplicons to the beads in order to allow subsequent emulsion polymerisation followed by sequencing. Alternatively the amplicons may be ligated with sequencing adaptors prior to ligation to the beads or the surface. The sequenced amplicons will reveal the identity of the identifier and thus of the presence or absence of the restriction fragment in the sample.
- One of the methods for high throughput sequencing is available from Solexa, United Kingdom (www.solexa.co.uk) and described inter alia in WO0006770, WO0027521, WO0058507, WO0123610, WO0157248, WO0157249, WO02061127, WO03016565, WO03048387, WO2004018497, WO2004018493, WO2004050915, WO2004076692, WO2005021786, WO2005047301, WO2005065814, WO2005068656, WO2005068089, WO2005078130. In essence, the method start with adaptor-ligated fragments of genomic DNA. The adaptor-ligated DNA is randomly attached to a dense lawn of primers that are attached to a solid surface, typically in a flow cell. The other end of the adaptor ligated fragment hybridizes to a complementary primer on the surface. The primers are extended in the presence of nucleotides and polymerases in a so-called solid-phase bridge amplification to provide double stranded fragments. This solid phase bridge amplification may be a selective amplification. Denaturation and repetition of the solid-phase bridge amplification results in dense clusters of amplified fragments distributed over the surface. The sequencing is initiated by adding four differently labelled reversible terminator nucleotides, primers and polymerase to the flow cell. After the first round of primer extension, the labels are detected, the identity of the first incorporated bases is recorded and the blocked 3′ terminus and the fluorophore are removed from the incorporated base. Then the identity of the second base is determined in the same way and so sequencing continues.
- In the present invention, the adaptor ligated restriction fragments or the amplicons are bound to the surface via the primer binding sequence or the primer sequence. The sequence is determined as outlined, including the identifier sequence and (part of) the restriction fragment. Currently available Solexa technology allows for the sequencing of fragments of about 25 base pairs. By economical design of the adaptors and the surface bound primers, the sequencing step reads through the sample identifier, the remains of the recognition sequence of the restriction endonuclease and any optional selective bases. When a 6 bp sample identifier is used, the remains are from the rare cutter EcoRI (AACCT), the use of two selective bases yields an internal sequence of the restriction fragment of 12 bp that can be used to uniquely identify the restriction fragment in the sample.
- In a preferred embodiment based on the Solexa sequencing technology above, the amplification of the adapter ligated restriction fragments is performed with a primer that contains at most one selective nucleotide at its 3′end, preferably no selective nucleotides at its 3′ end, i.e. the primer is only complementary to the adaptor (a +0 primer).
- In alternative embodiments directed to the sequencing methods described herein, the primers used in the amplification may contain specific sections (as alternative to the herein described primer or primer binding sequences) that are used in the subsequent sequencing step to bind the adaptor-capped restriction fragments or amplicons to the surface. These are generally depicted as the key region or the 5′-primer compatible sequence.
- In one embodiment of the invention, the nucleic acid sample is digested with at least one restriction enzyme and at least one adapter is ligated that comprises a recognition sequence for a type IIs restriction endonuclease. The subsequent digestion of the adapter-ligated restriction fragment with a type IIs restriction endonuclease yields, as the distance between the recognition and restriction site of a type IIs enzyme is relatively short (up to about 30 nucleotides), a shorter and a longer restriction fragment, to which a IIs restriction site compatible adaptor can be ligated. Typically, the overhang of the IIs-restricted site is unknown such that a set of adaptors may be used that are degenerated in the overhang. After (selective) amplification, the amplicons can be sequenced. The adaptor sequence in this embodiment generally follows: 5′-primer binding site—sample identifier sequence—degenerate type IIs cohesive end sequence-3′. The associated PCR primer generally follows: primer sequence—sample identifier sequence—degenerate type IIs cohesive end sequence—selective nucleotides-3′. The primer used to initiate the sequencing-by-synthesis then generally has the structure: 5′-primer binding site-3′. A size selection step may be preferred after digesting with the IIs enzyme to remove the smaller fragments. As in this embodiment the remains of the restriction site are for this type of enzyme typically in the order of 2-4 bp, this results in combination with a 6 bp sample identifier in the sequencing of 15-17 bp of a restriction fragment.
- In a further aspect, the invention relates to kits comprising one or more primer, and/or one or more adaptors for use in the method, aside from conventional components for kits per se. Furthermore the present invention finds application in, amongst others, use of the method for the identification of molecular markers, for genotyping, bulk segregant analysis, genetic mapping, marker-assisted back-crossing, mapping of quantitative trait loci, linkage disequilibrium mapping.
- DNA was isolated from 2 parents and 88 offspring using conventional methods. Parents (2x) and offspring (=4x) were in duplex with different indices to test reproducibility. Tags used to distinguish samples from each other differed at least in 2 nucleotides from any other tag used in the experiments. Quality is being tested throughout the various steps using agarose and PAA gels.
- For each DNA sample a restriction-ligation step is performed using EcoRI and MseI as enzymes. Adaptors are based on the hybridizing sequences located on the surface of the Solexa high throughput sequencing system, more in particular the EcoRI adapter contains the P5 sequence (sequence primer part) and the MseI adaptor contains the P7 sequence (bridge PCR primer sequence). The EcoRI adaptor further contains the sample identifying tag. 96 different EcoRI adaptors and one MseI adaptor are used. It is possible to use a degenerated EcoRI adaptor. The template preparation is inclusive of a size selection step by incubation of the mixture for 10 minutes at 80 degrees Celsius after the restriction (EcoRI+MseI) step but prior to the adapter ligation step. Fragments smaller than 130 nt are removed (in a maize sample).
- The complexity of the mixture is reduced by a selective preamplification using +1 primers (i.e. containing one randomly selective nucleotide at the 3′ end, using 96 EcoRI+1 primers and one MseI+1 primer (or one tag-degenerated EcoRI+1 primer and one MseI+1 primer). Selective amplification to reduce the complexity of the mixture to the desired size is performed using EcoRI+2 (=P5 side) and MseI+3 (=P7 side) primers necessitating the use of 96 EcoRI+2 primers and one MseI+3 primer. Tail PCR is performed using an EcoRI primer with the P5 bridge PCR primer sequence as the tail. The products are purified using SEPHADEX™ columns. Concentrations are determined and normalised and pools are created. The pools are subjected to massive parallel sequencing based on Solexa technology comprising bridge PCR amplification and sequencing followed by data analysis to determine the genotypes of the parents and the offspring.
- An alternative scenario does not use tail PCR, but employs phosphorylated EcoRI+2 primers. Due to the mismatch with the original adaptor, the annealing temperature in the amplification profile is lowered by 3 degrees Celsius to 13 cycles touch-down from 62-53 degrees Celsius followed by 23 cycles at 53 degrees Celsius. After ligation of the adaptor with the P5 bridging PCR sequence, PCR is performed with P5 and P7 bridge PCR primers.
- A second alternative scenario is based on standard template preparation as outlined herein before, selective (pre)amplification to reduce the complexity. Selective amplification is performed with primers that contain the reconstituted EcoRI and MseI restriction sites. This allows for removal of the adaptor sequences prior to sequencing, thereby reducing the amount of data to be analysed. Purification of the products by SEPHADEX™ columns to remove remains of Taq DNA polymerase. Template preparation wherein (reconstituted site) adapter sequences are replaced by Solexa adaptors using ten-fold increased EcoRI adaptor and EcoRI enzyme to compensate for the increased number of EcoRI sites compared to genomic DNA. The Solexa EcoRI adaptors also contain the tags, hence 96 tagged Solexa EcoRI adaptors are needed. The bottom strand of the adaptor is blocked at the 3′ end (in this case by 3′amino) to block extension by a polymerase. PCR is performed with P5 and P7 bridge PCR primers. Products are purified by Qiagen columns.
- Sequence-based detection of AFLP fragments was performed using Solexa's Clonal Single Molecule Array (CSMATM) technology, a Sequencing-by-Synthesis platform capable of analyzing up to 40 million individual fragments in a single sequence run.
- The experimental sequence involves AFLP template preparation, selective (AFLP) amplification, single molecule bridge amplification and sequencing of millions of sequence tags from one restriction enzyme end of the AFLP fragments. Maize parental lines B73 and Mo17 and 87 Recombinant Inbred Lines (RILs) were used and sequenced over 8.9 million EcoRI AFLP fragment termini were sequenced to provide proof-of-principle for sequence-based AFLP detection.
- Parental lines B73 and Mo17 and 87 RILs were selected. AFLP templates were prepared using restriction enzyme combination EcoRI/MseI. Selective amplification was performed using +2/+3 AFLP primers.
- Template fragments for Solexa CSMA bridge amplification were prepared by performing a second restriction/ligation using EcoRI adaptors containing unique 5 bp sample identification (ID) tag sequences. Parental lines and three RIL samples were included twice using different 5 bp sample ID tags to measure within-experiment reproducibility.
- Sequence-based AFLP markers were identified by extracting 27 bp sequence tags observed at different frequencies in B73 and Mo17, segregating in the RIL offspring.
- Sequence-based AFLP marker data were compared to AFLP marker scores obtained by conventional AFLP fingerprinting using length-based detection of the four corresponding EcoRI/MseI+3/+3 primer combinations.
- # sequence tags generated 8,941,407
# sequence tags with known sample IDs 8,029,595
# different sequence tags with known sample IDs 206,758
# Mbp sequence data generated 241.4 - frequency range total # sequence tags per sample 55,374-112,527
- # sequence tag AFLP markers 125
- frequency range sequence tag AFLP markers in
- parent scoring present 90-17,218
- tabulate sequence tags representation per sample
- remove sequence tags with unknown sample IDs
- normalize sample representation based on total sequence tags per sample
- remove sequence tags with >2 fold frequency difference in parental duplos
- average tag frequencies parental duplos
- define sequence tag AFLP marker if frequency P 1/P2 exceeds threshold value
- score presence/absence of sequence tag markers in RIL offspring
-
-
EcoRI + 3 base total +A +C +G +T # sequence tag AFLP markers 125 34 37 37 17 # gel-based AFLP markers 82 29 18 17 18 - # sequence tag AFLP markers scored 125
# number of data-points in comparison 375
# data-points identical for duplos 372
% concordancy within experiment duplos 99,2% -
AFLP marker B73 Mo17 1 2 3 4 5 6 7 8 9 10 11 12 Conventional slab gel detection: E36/M50-175.9 − + + − − − − + − + − − − + E36/M50-280 + − + − − + − + + − + − − − E36/M50-405.8 − + + − − − + + + + − + − + E36/M50-243.7 + − + − − − − − + + + + + + E36/M50-124.02 + − + − + + + + − − − − + + E36/M50-379 + − + − − + + + + + + + − + E36/M50-468.9 + − + − + + − + − + + + + + Solexa-based detection CGGCGACGTACCGC − + + − − − − + − + − − − + CTAGTAATTATTCC + − + − − + − + + − + − − − CAGCGCCTTCTCCT − + + − − − + + + + − + − + CAGAACTCTGACTT + − + − − − − − + + + + + + CAAATCTGTTAGAT + − + − + + + + − + − − + + CATGAAGGATTTAT + − + − − + + + + + + + − + CAAACAGACAACCG + − + − + + − + − + + + + + - The viability sequenced-based AFLP marker detection was generated using Solexa's CSMA technology. whereby a larger number of AFLP markers is scored using sequence-based detection than on conventional slab gels, presumably due to improved resolution (fragment size) and deep sequencing which also captures low abundance fragments. Marker data vector comparisons reveal similar segregation patterns between sequence-based detection and slab gel detection: proof of concordancy awaits sequencing gel-based AFLP markers.
Claims (16)
1.-22. (canceled)
23. A kit for use in a method for detecting one or more polymorphisms in a plurality of nucleic acid samples, comprising:
one or more adaptors comprising a primer-compatible sequence and an sample-specific identifier sequence, and
one or more primers comprising sequences at its 3′-end a sequence for hybridizing to one or more sample nucleic acid sequences of the plurality of nucleic acid samples.
24. The kit according to claim 23 , wherein the kit further comprises one or more primers comprising sequences for hybridizing to the primer-compatible sequence of the one or more adaptors.
25. The kit according to claim 23 , wherein the one or more primers comprising sequences at its 3′-end a sequence for hybridizing to one or more sample nucleic acids the plurality of nucleic samples sequence further comprises sequences for hybridizing to the primer-compatible sequence of the one or more adaptors.
26. The kit according to claim 23 , wherein the one or more adapters comprise from 5′-end to 3′-end the primer-compatible sequence, the sample-specific identifier sequence, and an end that can be ligated to the blunt or protruding end of a restriction fragment.
27. The kit according to claim 23 , wherein the sample-specific identifier sequence comprises from 4-16 nucleotides.
28. The kit according to claim 23 , wherein the sample-specific identifier sequence does not contain 2 or more consecutive identical bases.
29. The kit according to claim 23 , wherein the adaptor comprises sequences for annealing to a solid support.
30. The kit according to claim 23 , wherein the adaptor comprises sequences for annealing to a bead.
31. The kit according to claim 23 , wherein at least one of the primers comprises a sequence complementary to the sample specific-identifier sequence of the one or more adaptors.
32. The kit according to claim 23 , wherein the one or more primers comprise at its 3′-end sequences for hybridizing to a subset of nucleic acids of the plurality of nucleic acid samples.
33. The kit according to claim 23 , wherein the one or more primers further comprise nucleotides selected from locked nucleic acids (LNAs), and peptide nucleic acids (PNAs).
34. The kit according to claim 23 , wherein at least one of the primers is phosphorylated.
35. The kit according to claim 23 , wherein the kit further comprises a polymerase for a polymerase chain reaction (PCR).
36. The kit according to claim 23 , wherein the kit further comprises a ligase for ligating the one or more adaptors to the nucleic acid samples.
37. The kit according to claim 23 , wherein the kit further comprises a restriction endonuclease.
Priority Applications (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US78870606P true | 2006-04-04 | 2006-04-04 | |
US88005207P true | 2007-01-12 | 2007-01-12 | |
PCT/NL2007/000094 WO2007114693A2 (en) | 2006-04-04 | 2007-04-04 | High throughput detection of molecular markers based on aflp and high throughput sequencing |
US29600909A true | 2009-02-06 | 2009-02-06 | |
US13/364,799 US20120135871A1 (en) | 2006-04-04 | 2012-02-02 | High throughput detection of molecular markers based on aflp and high through-put sequencing |
US13/449,629 US20120202698A1 (en) | 2006-04-04 | 2012-04-18 | High throughput detection of molecular markers based on aflp and high through-put sequencing |
US14/285,430 US10023907B2 (en) | 2006-04-04 | 2014-05-22 | High throughput detection of molecular markers based on AFLP and high through-put sequencing |
US16/000,252 US20180291439A1 (en) | 2006-04-04 | 2018-06-05 | High throughput detection of molecular markers based on aflp and high through-put sequencing |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US16/000,252 US20180291439A1 (en) | 2006-04-04 | 2018-06-05 | High throughput detection of molecular markers based on aflp and high through-put sequencing |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date | |
---|---|---|---|---|
US14/285,430 Continuation US10023907B2 (en) | 2006-04-04 | 2014-05-22 | High throughput detection of molecular markers based on AFLP and high through-put sequencing |
Publications (1)
Publication Number | Publication Date |
---|---|
US20180291439A1 true US20180291439A1 (en) | 2018-10-11 |
Family
ID=38508899
Family Applications (5)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/296,009 Abandoned US20090253581A1 (en) | 2006-04-04 | 2007-04-04 | High Throughput Detection of Molecular Markers Based on AFLP and High Throughput Sequencing |
US13/364,799 Abandoned US20120135871A1 (en) | 2006-04-04 | 2012-02-02 | High throughput detection of molecular markers based on aflp and high through-put sequencing |
US13/449,629 Abandoned US20120202698A1 (en) | 2006-04-04 | 2012-04-18 | High throughput detection of molecular markers based on aflp and high through-put sequencing |
US14/285,430 Active 2029-05-19 US10023907B2 (en) | 2006-04-04 | 2014-05-22 | High throughput detection of molecular markers based on AFLP and high through-put sequencing |
US16/000,252 Abandoned US20180291439A1 (en) | 2006-04-04 | 2018-06-05 | High throughput detection of molecular markers based on aflp and high through-put sequencing |
Family Applications Before (4)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/296,009 Abandoned US20090253581A1 (en) | 2006-04-04 | 2007-04-04 | High Throughput Detection of Molecular Markers Based on AFLP and High Throughput Sequencing |
US13/364,799 Abandoned US20120135871A1 (en) | 2006-04-04 | 2012-02-02 | High throughput detection of molecular markers based on aflp and high through-put sequencing |
US13/449,629 Abandoned US20120202698A1 (en) | 2006-04-04 | 2012-04-18 | High throughput detection of molecular markers based on aflp and high through-put sequencing |
US14/285,430 Active 2029-05-19 US10023907B2 (en) | 2006-04-04 | 2014-05-22 | High throughput detection of molecular markers based on AFLP and high through-put sequencing |
Country Status (8)
Country | Link |
---|---|
US (5) | US20090253581A1 (en) |
EP (3) | EP2963127B1 (en) |
JP (2) | JP5389638B2 (en) |
DK (1) | DK2002017T3 (en) |
ES (2) | ES2645661T3 (en) |
HK (1) | HK1219761A1 (en) |
PT (1) | PT2963127T (en) |
WO (1) | WO2007114693A2 (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10266883B2 (en) | 2009-04-30 | 2019-04-23 | Prognosys Biosciences, Inc. | Nucleic acid constructs and methods of use |
US10308982B2 (en) | 2010-04-05 | 2019-06-04 | Prognosys Biosciences, Inc. | Spatially encoded biological assays |
Families Citing this family (37)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE602006019855D1 (en) | 2005-05-10 | 2011-03-10 | Oregon State | Method of mapping polymorphisms and polymorphism microarrays |
JP5220597B2 (en) | 2005-06-23 | 2013-06-26 | キージーン ナムローゼ フェンノートシャップ | Method for identifying one or more polymorphisms and methods of use thereof |
US10316364B2 (en) | 2005-09-29 | 2019-06-11 | Keygene N.V. | Method for identifying the source of an amplicon |
CA2623539C (en) | 2005-09-29 | 2015-12-15 | Keygene N.V. | High throughput screening of mutagenized populations |
EP2363504A1 (en) | 2005-12-22 | 2011-09-07 | Keygene N.V. | Method for high-throughtput AFLP-based polymorphism detection |
US8628927B2 (en) | 2008-11-07 | 2014-01-14 | Sequenta, Inc. | Monitoring health and disease status using clonotype profiles |
GB2483810B (en) | 2008-11-07 | 2012-09-05 | Sequenta Inc | Methods for correlating clonotypes with diseases in a population |
US9528160B2 (en) | 2008-11-07 | 2016-12-27 | Adaptive Biotechnolgies Corp. | Rare clonotypes and uses thereof |
US9043160B1 (en) | 2009-11-09 | 2015-05-26 | Sequenta, Inc. | Method of determining clonotypes and clonotype profiles |
US9365901B2 (en) | 2008-11-07 | 2016-06-14 | Adaptive Biotechnologies Corp. | Monitoring immunoglobulin heavy chain evolution in B-cell acute lymphoblastic leukemia |
US8748103B2 (en) | 2008-11-07 | 2014-06-10 | Sequenta, Inc. | Monitoring health and disease status using clonotype profiles |
US9506119B2 (en) | 2008-11-07 | 2016-11-29 | Adaptive Biotechnologies Corp. | Method of sequence determination using sequence tags |
ES2726702T3 (en) | 2009-01-15 | 2019-10-08 | Adaptive Biotechnologies Corp | Adaptive immunity profiling and methods for the generation of monoclonal antibodies |
GB2472371B (en) * | 2009-04-24 | 2011-10-26 | Selectamark Security Systems Plc | Synthetic nucleotide containing compositions for use in security marking of property and/or for marking a thief or attacker |
EP2248914A1 (en) * | 2009-05-05 | 2010-11-10 | Max-Planck-Gesellschaft zur Förderung der Wissenschaften e.V. | The use of class IIB restriction endonucleases in 2nd generation sequencing applications |
CN102459643B (en) | 2009-06-25 | 2016-06-01 | 弗雷德哈钦森癌症研究中心 | The method of detection acquired immunity |
US8766054B2 (en) | 2010-12-08 | 2014-07-01 | Hm.Clause | Phytophthora resistance in sweet peppers |
US10385475B2 (en) | 2011-09-12 | 2019-08-20 | Adaptive Biotechnologies Corp. | Random array sequencing of low-complexity libraries |
CA2853088C (en) | 2011-10-21 | 2018-03-13 | Adaptive Biotechnologies Corporation | Quantification of adaptive immune cell genomes in a complex mixture of cells |
WO2013086450A1 (en) | 2011-12-09 | 2013-06-13 | Adaptive Biotechnologies Corporation | Diagnosis of lymphoid malignancies and minimal residual disease detection |
US9499865B2 (en) | 2011-12-13 | 2016-11-22 | Adaptive Biotechnologies Corp. | Detection and measurement of tissue-infiltrating lymphocytes |
CN108611398A (en) | 2012-01-13 | 2018-10-02 | Data生物有限公司 | Genotyping is carried out by new-generation sequencing |
PL2814959T3 (en) | 2012-02-17 | 2018-07-31 | Fred Hutchinson Cancer Research Center | Compositions and methods for accurately identifying mutations |
ES2662128T3 (en) | 2012-03-05 | 2018-04-05 | Adaptive Biotechnologies Corporation | Determination of paired immune receptor chains from the frequency of matching subunits |
WO2013142389A1 (en) | 2012-03-20 | 2013-09-26 | University Of Washington Through Its Center For Commercialization | Methods of lowering the error rate of massively parallel dna sequencing using duplex consensus sequencing |
PT2831276T (en) | 2012-05-08 | 2016-07-26 | Adaptive Biotechnologies Corp | Compositions and method for measuring and calibrating amplification bias in multiplexed pcr reactions |
EP3330384B1 (en) | 2012-10-01 | 2019-09-25 | Adaptive Biotechnologies Corporation | Immunocompetence assessment by adaptive immune receptor diversity and clonality characterization |
US9708657B2 (en) | 2013-07-01 | 2017-07-18 | Adaptive Biotechnologies Corp. | Method for generating clonotype profiles using sequence tags |
US10066265B2 (en) | 2014-04-01 | 2018-09-04 | Adaptive Biotechnologies Corp. | Determining antigen-specific t-cells |
EP3132059A4 (en) | 2014-04-17 | 2017-09-06 | Adaptive Biotechnologies Corporation | Quantification of adaptive immune cell genomes in a complex mixture of cells |
CA2966201A1 (en) | 2014-10-29 | 2016-05-06 | Adaptive Biotechnologies Corp. | Highly-multiplexed simultaneous detection of nucleic acids encoding paired adaptive immune receptor heterodimers from many samples |
US10246701B2 (en) | 2014-11-14 | 2019-04-02 | Adaptive Biotechnologies Corp. | Multiplexed digital quantitation of rearranged lymphoid receptors in a complex mixture |
CN105441572B (en) * | 2016-01-07 | 2019-10-22 | 南京中医药大学 | Identify the DNA molecular marker and its application of Radix Angelicae Sinensis morning a kind of sedge |
JP6515884B2 (en) | 2016-06-29 | 2019-05-22 | トヨタ自動車株式会社 | Method of preparing DNA probe and genomic DNA analysis method using DNA probe |
US10428325B1 (en) | 2016-09-21 | 2019-10-01 | Adaptive Biotechnologies Corporation | Identification of antigen-specific B cell receptors |
JP2018191598A (en) | 2017-05-19 | 2018-12-06 | トヨタ自動車株式会社 | Random primer sets and methods for preparing dna libraries using the same |
WO2019121603A1 (en) | 2017-12-18 | 2019-06-27 | Keygene N.V. | Chemical mutagenesis of cassava |
Family Cites Families (85)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH04504356A (en) | 1989-01-31 | 1992-08-06 | ||
US20100267023A1 (en) * | 1992-09-24 | 2010-10-21 | Keygene N.V. | Selective restriction fragment amplification: fingerprinting |
HU223760B1 (en) | 1991-09-24 | 2005-01-28 | Keygene N.V. | Selective restriction fragmentumsokszorosítás general method of DNA fingerprinting analysis |
WO1995019697A1 (en) | 1994-01-21 | 1995-07-27 | North Carolina State University | Methods for within family selection in woody perennials using genetic markers |
EG23907A (en) | 1994-08-01 | 2007-12-30 | Delta & Pine Land Co | Control of plant gene expression |
US6013445A (en) | 1996-06-06 | 2000-01-11 | Lynx Therapeutics, Inc. | Massively parallel signature sequencing by ligation of encoded adaptors |
EP0804618B1 (en) * | 1994-11-28 | 1999-01-27 | E.I. Du Pont De Nemours And Company | Compound microsatellite primers for the detection of genetic polymorphisms |
US5565340A (en) | 1995-01-27 | 1996-10-15 | Clontech Laboratories, Inc. | Method for suppressing DNA fragment amplification during PCR |
AU6024598A (en) * | 1997-01-10 | 1998-08-03 | Pioneer Hi-Bred International, Inc. | Hybridization-based genetic amplification and analysis |
US6090556A (en) | 1997-04-07 | 2000-07-18 | Japan Science & Technology Corporation | Method for quantitatively determining the expression of a gene |
ES2163271T3 (en) | 1997-05-13 | 2002-01-16 | Azign Bioscience As | Procedure to clone differentially expressed mRNAs and display transcriptionists (dodets). |
DE69838210T2 (en) | 1997-12-15 | 2008-05-15 | Csl Behring Gmbh | Labeled primer, suitable for the detection of nucleic acids |
CA2273616A1 (en) | 1998-06-08 | 1999-12-08 | The Board Of Trustees Of The Leland Stanford Junior University | Method for parallel screening of allelic variation |
DE69816286T2 (en) | 1998-07-29 | 2004-05-27 | Keygene N.V. | A method for the detection of nucleic acid methylation by AFLP |
DE69928265T3 (en) | 1998-07-30 | 2013-11-28 | Illumina Cambridge Ltd. | Matrices of biomolecules and their use in sequencing |
WO2002061126A2 (en) | 2001-01-30 | 2002-08-08 | Solexa Ltd. | The preparation of polynucleotide arrays |
US6232067B1 (en) * | 1998-08-17 | 2001-05-15 | The Perkin-Elmer Corporation | Adapter directed expression analysis |
US6703228B1 (en) | 1998-09-25 | 2004-03-09 | Massachusetts Institute Of Technology | Methods and products related to genotyping and DNA analysis |
EP1001037A3 (en) | 1998-09-28 | 2003-10-01 | Whitehead Institute For Biomedical Research | Pre-selection and isolation of single nucleotide polymorphisms |
US6361947B1 (en) | 1998-10-27 | 2002-03-26 | Affymetrix, Inc. | Complexity management and analysis of genomic DNA |
US6958225B2 (en) | 1999-10-27 | 2005-10-25 | Affymetrix, Inc. | Complexity management of genomic DNA |
US6480791B1 (en) * | 1998-10-28 | 2002-11-12 | Michael P. Strathmann | Parallel methods for genomic analysis |
AU758630B2 (en) | 1998-11-06 | 2003-03-27 | Solexa Ltd. | A method for reproducing molecular arrays |
WO2000040755A2 (en) | 1999-01-06 | 2000-07-13 | Cornell Research Foundation, Inc. | Method for accelerating identification of single nucleotide polymorphisms and alignment of clones in genomic sequencing |
US20040029155A1 (en) | 1999-01-08 | 2004-02-12 | Curagen Corporation | Method for identifying a biomolecule |
DE19911130A1 (en) | 1999-03-12 | 2000-09-21 | Hager Joerg | A method for identifying chromosomal regions and genes |
AU3567900A (en) | 1999-03-30 | 2000-10-16 | Solexa Ltd. | Polynucleotide sequencing |
EP1190092A2 (en) * | 1999-04-06 | 2002-03-27 | Yale University | Fixed address analysis of sequence tags |
US7169552B1 (en) | 1999-04-09 | 2007-01-30 | Keygene N.V. | Detection of polymorphisms in AFLP fragments using primer extension techniques |
US20020119448A1 (en) | 1999-06-23 | 2002-08-29 | Joseph A. Sorge | Methods of enriching for and identifying polymorphisms |
US20030204075A9 (en) | 1999-08-09 | 2003-10-30 | The Snp Consortium | Identification and mapping of single nucleotide polymorphisms in the human genome |
AU7712400A (en) | 1999-09-23 | 2001-04-24 | Gene Logic, Inc. | Indexing populations |
EP1218543A2 (en) | 1999-09-29 | 2002-07-03 | Solexa Ltd. | Polynucleotide sequencing |
US6287778B1 (en) | 1999-10-19 | 2001-09-11 | Affymetrix, Inc. | Allele detection using primer extension with sequence-coded identity tags |
AU1413601A (en) | 1999-11-19 | 2001-06-04 | Takara Bio Inc. | Method of amplifying nucleic acids |
GB0002310D0 (en) | 2000-02-01 | 2000-03-22 | Solexa Ltd | Polynucleotide sequencing |
GB0002389D0 (en) | 2000-02-02 | 2000-03-22 | Solexa Ltd | Molecular arrays |
WO2001075167A1 (en) | 2000-03-31 | 2001-10-11 | Fred Hutchinson Cancer Research Center | Reverse genetic strategy for identifying functional mutations in genes of known sequence |
US20040053236A1 (en) * | 2001-03-30 | 2004-03-18 | Mccallum Claire M. | Reverse genetic strategy for identifying functional mutations in genes of known sequences |
EP1282729A2 (en) | 2000-05-15 | 2003-02-12 | Keygene N.V. | Microsatellite-aflp |
US7300751B2 (en) | 2000-06-30 | 2007-11-27 | Syngenta Participations Ag | Method for identification of genetic markers |
DE60131903T2 (en) | 2000-10-24 | 2008-11-27 | The Board of Trustees of the Leland S. Stanford Junior University, Palo Alto | Direct multiplex characterization of genomic dna |
EP1386005A1 (en) | 2001-04-20 | 2004-02-04 | Karolinska Innovations AB | Methods for high throughput genome analysis using restriction site tagged microarrays |
EP1417475A4 (en) | 2001-07-06 | 2006-06-28 | 454 Corp | Method for isolation of independent, parallel chemical micro-reactions using a porous filter |
EP1362929A3 (en) | 2002-05-17 | 2004-05-19 | Affymetrix, Inc. | Methods for genotyping |
GB0119719D0 (en) | 2001-08-13 | 2001-10-03 | Solexa Ltd | DNA sequence analysis |
US6902921B2 (en) | 2001-10-30 | 2005-06-07 | 454 Corporation | Sulfurylase-luciferase fusion proteins and thermostable sulfurylase |
AT431354T (en) | 2002-08-23 | 2009-05-15 | Illumina Cambridge Ltd | Marked nucleotide |
EP2607369B1 (en) | 2002-08-23 | 2015-09-23 | Illumina Cambridge Limited | Modified nucleotides for polynucleotide sequencing |
US7057026B2 (en) | 2001-12-04 | 2006-06-06 | Solexa Limited | Labelled nucleotides |
US6815167B2 (en) | 2002-04-25 | 2004-11-09 | Geneohm Sciences | Amplification of DNA to produce single-stranded product of defined sequence and length |
US7108976B2 (en) | 2002-06-17 | 2006-09-19 | Affymetrix, Inc. | Complexity management of genomic DNA by locus specific amplification |
WO2004001074A1 (en) * | 2002-06-21 | 2003-12-31 | Lynx Therapeutics, Inc. | Method for detecting foreign dna in a host genome |
AT358182T (en) | 2002-09-05 | 2007-04-15 | Plant Bioscience Ltd | Genome division |
US20040157238A1 (en) | 2002-09-20 | 2004-08-12 | Quinn John J. | Method for detection of multiple nucleic acid sequence variations |
EP1567669B1 (en) | 2002-12-02 | 2010-03-24 | Illumina Cambridge Limited | Determination of methylation of nucleic acid sequences |
CA2510381C (en) | 2002-12-18 | 2014-07-08 | Third Wave Technologies, Inc. | Detection of small nucleic acids |
JP2004208586A (en) | 2002-12-27 | 2004-07-29 | Wakunaga Pharmaceut Co Ltd | Detection of hla(human leukocyte antigen) |
US7970548B2 (en) | 2003-01-10 | 2011-06-28 | Keygene N.V. | AFLP-based method for integrating physical and genetic maps |
EP2145955B1 (en) | 2003-01-29 | 2012-02-22 | 454 Life Sciences Corporation | Bead emulsion nucleic acid amplification |
GB0304371D0 (en) | 2003-02-26 | 2003-04-02 | Solexa Ltd | DNA Sequence analysis |
JP4691014B2 (en) | 2003-02-26 | 2011-06-01 | カリダ ゲノミクス,インコーポレーテッド | Random array DNA analysis by hybridization |
JP4888876B2 (en) | 2003-06-13 | 2012-02-29 | 英 夫 原 | Recombinant adeno-associated virus vector for the treatment of Alzheimer's disease |
GB0320059D0 (en) | 2003-08-27 | 2003-10-01 | Solexa Ltd | A method of sequencing |
US7365179B2 (en) | 2003-09-09 | 2008-04-29 | Compass Genetics, Llc | Multiplexed analytical platform |
US20050153317A1 (en) | 2003-10-24 | 2005-07-14 | Metamorphix, Inc. | Methods and systems for inferring traits to breed and manage non-beef livestock |
GB0326073D0 (en) | 2003-11-07 | 2003-12-10 | Solexa Ltd | Improvements in or relating to polynucleotide arrays |
WO2005065814A1 (en) | 2004-01-07 | 2005-07-21 | Solexa Limited | Modified molecular arrays |
GB0400584D0 (en) | 2004-01-12 | 2004-02-11 | Solexa Ltd | Nucleic acid chacterisation |
GB0400974D0 (en) | 2004-01-16 | 2004-02-18 | Solexa Ltd | Multiple inexact matching |
US20050233354A1 (en) * | 2004-01-22 | 2005-10-20 | Affymetrix, Inc. | Genotyping degraded or mitochandrial DNA samples |
GB0402895D0 (en) | 2004-02-10 | 2004-03-17 | Solexa Ltd | Arrayed polynucleotides |
US7407757B2 (en) | 2005-02-10 | 2008-08-05 | Population Genetics Technologies | Genetic analysis by sequence-specific sorting |
US7393665B2 (en) | 2005-02-10 | 2008-07-01 | Population Genetics Technologies Ltd | Methods and compositions for tagging and identifying polynucleotides |
DE602005018166D1 (en) | 2004-02-12 | 2010-01-21 | Population Genetics Technologi | Genetic analysis by sequence-specific sorting |
US7709262B2 (en) | 2004-02-18 | 2010-05-04 | Trustees Of Boston University | Method for detecting and quantifying rare mutations/polymorphisms |
EP1574585A1 (en) | 2004-03-12 | 2005-09-14 | Plant Research International B.V. | Method for selective amplification of DNA fragments for genetic fingerprinting |
US7220549B2 (en) | 2004-12-30 | 2007-05-22 | Helicos Biosciences Corporation | Stabilizing a nucleic acid for nucleic acid sequencing |
DE602006019855D1 (en) | 2005-05-10 | 2011-03-10 | Oregon State | Method of mapping polymorphisms and polymorphism microarrays |
JP5220597B2 (en) | 2005-06-23 | 2013-06-26 | キージーン ナムローゼ フェンノートシャップ | Method for identifying one or more polymorphisms and methods of use thereof |
CN101278058A (en) | 2005-06-23 | 2008-10-01 | 科因股份有限公司 | Improved strategies for sequencing complex genomes using high throughput sequencing technologies |
US20070020640A1 (en) | 2005-07-21 | 2007-01-25 | Mccloskey Megan L | Molecular encoding of nucleic acid templates for PCR and other forms of sequence analysis |
CA2623539C (en) | 2005-09-29 | 2015-12-15 | Keygene N.V. | High throughput screening of mutagenized populations |
CN101310024B (en) | 2005-11-14 | 2012-10-03 | 科因股份有限公司 | Method for high throughput screening of transposon tagging populations and massive parallel sequence identification of insertion sites |
WO2007087312A2 (en) | 2006-01-23 | 2007-08-02 | Population Genetics Technologies Ltd. | Molecular counting |
-
2007
- 2007-04-04 WO PCT/NL2007/000094 patent/WO2007114693A2/en active Application Filing
- 2007-04-04 EP EP15158621.1A patent/EP2963127B1/en active Active
- 2007-04-04 EP EP07747276.9A patent/EP2002017B1/en active Active
- 2007-04-04 ES ES15158621.1T patent/ES2645661T3/en active Active
- 2007-04-04 JP JP2009504137A patent/JP5389638B2/en active Active
- 2007-04-04 PT PT151586211T patent/PT2963127T/en unknown
- 2007-04-04 DK DK07747276.9T patent/DK2002017T3/en active
- 2007-04-04 US US12/296,009 patent/US20090253581A1/en not_active Abandoned
- 2007-04-04 EP EP17163116.1A patent/EP3239304A1/en active Pending
- 2007-04-04 ES ES07747276.9T patent/ES2545264T3/en active Active
-
2012
- 2012-02-02 US US13/364,799 patent/US20120135871A1/en not_active Abandoned
- 2012-04-18 US US13/449,629 patent/US20120202698A1/en not_active Abandoned
-
2013
- 2013-07-29 JP JP2013156870A patent/JP2013215212A/en not_active Withdrawn
-
2014
- 2014-05-22 US US14/285,430 patent/US10023907B2/en active Active
-
2016
- 2016-07-01 HK HK16107675.9A patent/HK1219761A1/en unknown
-
2018
- 2018-06-05 US US16/000,252 patent/US20180291439A1/en not_active Abandoned
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10266883B2 (en) | 2009-04-30 | 2019-04-23 | Prognosys Biosciences, Inc. | Nucleic acid constructs and methods of use |
US10266884B2 (en) | 2009-04-30 | 2019-04-23 | Prognosys Biosciences, Inc. | Nucleic acid constructs and methods of use |
US10308982B2 (en) | 2010-04-05 | 2019-06-04 | Prognosys Biosciences, Inc. | Spatially encoded biological assays |
US10472669B2 (en) | 2010-04-05 | 2019-11-12 | Prognosys Biosciences, Inc. | Spatially encoded biological assays |
US10480022B2 (en) | 2010-04-05 | 2019-11-19 | Prognosys Biosciences, Inc. | Spatially encoded biological assays |
Also Published As
Publication number | Publication date |
---|---|
EP2963127A1 (en) | 2016-01-06 |
JP2013215212A (en) | 2013-10-24 |
PT2963127T (en) | 2017-10-06 |
EP2963127B1 (en) | 2017-08-16 |
EP3239304A1 (en) | 2017-11-01 |
DK2002017T3 (en) | 2015-09-07 |
US20090253581A1 (en) | 2009-10-08 |
WO2007114693A3 (en) | 2007-12-21 |
JP2009536817A (en) | 2009-10-22 |
HK1219761A1 (en) | 2017-04-13 |
ES2545264T3 (en) | 2015-09-09 |
US20120202698A1 (en) | 2012-08-09 |
US10023907B2 (en) | 2018-07-17 |
US20140303007A1 (en) | 2014-10-09 |
EP2002017A2 (en) | 2008-12-17 |
JP5389638B2 (en) | 2014-01-15 |
WO2007114693A2 (en) | 2007-10-11 |
US20120135871A1 (en) | 2012-05-31 |
EP2002017B1 (en) | 2015-06-10 |
ES2645661T3 (en) | 2017-12-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6291181B1 (en) | Nucleic acid adapters containing a type IIs restriction site and methods of using the same | |
US9309556B2 (en) | Direct capture, amplification and sequencing of target DNA using immobilized primers | |
US6977162B2 (en) | Rapid analysis of variations in a genome | |
EP2341151B1 (en) | Methods for determining sequence variants using ultra-deep sequencing | |
US9822395B2 (en) | Methods for producing a paired tag from a nucleic acid sequence and methods of use thereof | |
US7166429B2 (en) | Method for generating oligonucleotides, in particular for the detection of amplified restriction fragments obtained using AFLP® | |
US20120135872A1 (en) | Methods of fetal abnormality detection | |
US7745178B2 (en) | Complexity management of genomic DNA | |
US20070259357A1 (en) | Nucleic acid analysis using sequence tokens | |
AU2013382098B2 (en) | Methods and compositions for nucleic acid sequencing | |
CN103937899B (en) | Method for the high flux polymorphic detection based on AFLP | |
US7247428B2 (en) | Methods for rapid screening of polymorphisms, mutations and methylation | |
ES2338459T3 (en) | Exploration of high redemption of mutagenized populations. | |
EP0530009B1 (en) | Method of characterising genomic DNA | |
US20060088826A1 (en) | Discrimination and detection of target nucleotide sequences using mass spectrometry | |
US20040002090A1 (en) | Methods for detecting genome-wide sequence variations associated with a phenotype | |
US20170107560A1 (en) | Nucleic acid enrichment using cas9 | |
KR101862756B1 (en) | 3-D genomic region of interest sequencing strategies | |
ES2387878T3 (en) | Strategies for the identification of high performance and the detection of polymorphisms | |
CN1882703B (en) | Multiplexed nucleic acid analysis by fragmentation of double-stranded DNA | |
EP1256632A2 (en) | High throughput polymorphism screening | |
DE60219199T2 (en) | Analysis and detection of multiple target sequences using circular samples | |
AU2010330936B2 (en) | Restriction enzyme based whole genome sequencing | |
US20070287151A1 (en) | Methods and Means for Nucleic Acid Sequencing | |
US20100028873A1 (en) | Methods and means for nucleic acid sequencing |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
STCB | Information on status: application discontinuation |
Free format text: EXPRESSLY ABANDONED -- DURING EXAMINATION |