NL2026403B1 - Next generation sequencing method of zoobenthos cytochrome oxidase subunit i gene and use thereof - Google Patents
Next generation sequencing method of zoobenthos cytochrome oxidase subunit i gene and use thereof Download PDFInfo
- Publication number
- NL2026403B1 NL2026403B1 NL2026403A NL2026403A NL2026403B1 NL 2026403 B1 NL2026403 B1 NL 2026403B1 NL 2026403 A NL2026403 A NL 2026403A NL 2026403 A NL2026403 A NL 2026403A NL 2026403 B1 NL2026403 B1 NL 2026403B1
- Authority
- NL
- Netherlands
- Prior art keywords
- dna
- pcr
- ngs
- gene
- library
- Prior art date
Links
- 238000007481 next generation sequencing Methods 0.000 title claims abstract description 48
- 238000000034 method Methods 0.000 title claims abstract description 38
- 101000921808 Haemophilus influenzae (strain ATCC 51907 / DSM 11121 / KW20 / Rd) Probable cytochrome oxidase subunit 1 Proteins 0.000 title claims description 13
- 238000003752 polymerase chain reaction Methods 0.000 claims abstract description 45
- 238000012163 sequencing technique Methods 0.000 claims abstract description 17
- 230000003321 amplification Effects 0.000 claims abstract description 13
- 238000003199 nucleic acid amplification method Methods 0.000 claims abstract description 13
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 9
- 101150087323 COI gene Proteins 0.000 claims description 29
- 208000035342 Isolated congenital onychodysplasia Diseases 0.000 claims description 25
- 201000010158 nonsyndromic congenital nail disorder 7 Diseases 0.000 claims description 25
- 238000012408 PCR amplification Methods 0.000 claims description 22
- 238000004925 denaturation Methods 0.000 claims description 18
- 230000036425 denaturation Effects 0.000 claims description 18
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 claims description 16
- 229910021642 ultra pure water Inorganic materials 0.000 claims description 12
- 239000012498 ultrapure water Substances 0.000 claims description 12
- 238000001514 detection method Methods 0.000 claims description 11
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 claims description 9
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 claims description 9
- 238000000137 annealing Methods 0.000 claims description 9
- 238000005138 cryopreservation Methods 0.000 claims description 9
- 238000012544 monitoring process Methods 0.000 claims description 8
- 239000011324 bead Substances 0.000 claims description 7
- 238000006243 chemical reaction Methods 0.000 claims description 6
- 108060002716 Exonuclease Proteins 0.000 claims description 4
- 238000007405 data analysis Methods 0.000 claims description 4
- 102000013165 exonuclease Human genes 0.000 claims description 4
- 238000001976 enzyme digestion Methods 0.000 claims description 3
- 108091008146 restriction endonucleases Proteins 0.000 claims description 3
- 230000002860 competitive effect Effects 0.000 claims description 2
- 239000000839 emulsion Substances 0.000 claims description 2
- 238000012175 pyrosequencing Methods 0.000 claims 2
- 102000000634 Cytochrome c oxidase subunit IV Human genes 0.000 abstract description 2
- 108050008072 Cytochrome c oxidase subunit IV Proteins 0.000 abstract description 2
- 108020004414 DNA Proteins 0.000 description 68
- 230000015572 biosynthetic process Effects 0.000 description 18
- 238000003786 synthesis reaction Methods 0.000 description 18
- 239000000523 sample Substances 0.000 description 17
- 241000894007 species Species 0.000 description 7
- 230000000877 morphologic effect Effects 0.000 description 6
- 241000566145 Otus Species 0.000 description 5
- 239000012634 fragment Substances 0.000 description 5
- 230000007613 environmental effect Effects 0.000 description 4
- 239000003344 environmental pollutant Substances 0.000 description 4
- 230000006353 environmental stress Effects 0.000 description 4
- 231100000719 pollutant Toxicity 0.000 description 4
- 238000005070 sampling Methods 0.000 description 4
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 3
- 102000053602 DNA Human genes 0.000 description 3
- 108010006785 Taq Polymerase Proteins 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 3
- 229910052760 oxygen Inorganic materials 0.000 description 3
- 239000001301 oxygen Substances 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- 238000007400 DNA extraction Methods 0.000 description 2
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- 241000237852 Mollusca Species 0.000 description 2
- 230000004075 alteration Effects 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 238000012165 high-throughput sequencing Methods 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- 108091093088 Amplicon Proteins 0.000 description 1
- 241000243818 Annelida Species 0.000 description 1
- 241000254173 Coleoptera Species 0.000 description 1
- 241000238424 Crustacea Species 0.000 description 1
- 241000255925 Diptera Species 0.000 description 1
- 241001414834 Ephemeroptera Species 0.000 description 1
- 238000000342 Monte Carlo simulation Methods 0.000 description 1
- 241001277610 Oligochaeta <Asteraceae> Species 0.000 description 1
- BPQQTUXANYXVAA-UHFFFAOYSA-N Orthosilicate Chemical compound [O-][Si]([O-])([O-])[O-] BPQQTUXANYXVAA-UHFFFAOYSA-N 0.000 description 1
- OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 description 1
- 241000501478 Plecoptera <stoneflies, order> Species 0.000 description 1
- 241001414983 Trichoptera Species 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- XKMRRTOUMJRJIA-UHFFFAOYSA-N ammonia nh3 Chemical compound N.N XKMRRTOUMJRJIA-UHFFFAOYSA-N 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 238000003149 assay kit Methods 0.000 description 1
- 239000012472 biological sample Substances 0.000 description 1
- 238000004140 cleaning Methods 0.000 description 1
- 238000007621 cluster analysis Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 239000012470 diluted sample Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000012921 fluorescence analysis Methods 0.000 description 1
- 239000013505 freshwater Substances 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 210000003470 mitochondria Anatomy 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 239000002773 nucleotide Substances 0.000 description 1
- 125000003729 nucleotide group Chemical group 0.000 description 1
- 238000001543 one-way ANOVA Methods 0.000 description 1
- 229910052698 phosphorus Inorganic materials 0.000 description 1
- 239000011574 phosphorus Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 230000035882 stress Effects 0.000 description 1
- 238000007671 third-generation sequencing Methods 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Immunology (AREA)
- Biotechnology (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Analytical Chemistry (AREA)
- Physics & Mathematics (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The present invention relates to the technical field of zoobenthos identification, and in particular to a next generation sequencing (NGS) method of a zoobenthos cytochrome oxidase subunit T (COT) gene and use thereof. The method 5 includes the following steps: Sl, amplifying the COT gene by polymerase chain reaction (PCR) with a DNA sample of a mixed zoobenthos sample as a template; SZ, amplifying a PCR.product obtained in step Sl as a template by PCR, to obtain a PCR product as a DNA library, where an amplified primer pair includes a 10 pyroseguencing—NGS adapter and each sample corresponds to one pyroseguencing—NGS adapter; and S3, conducting guality detectioncm1aresultinglflwxlibraryy andselectingeagualified DNA_library to conduct emPCR.amplification.and.NGS. The method obtains a high—throughput full—length (~658 bp) COT gene 15 sequence with read length covering large benthic communities by all—at—once sequencing, providing a large taxonomic resolutionandavoidingfalsepositivesdueixnexcessivelyhigh throughput.
Description
TECHNICAL FIELD The present invention relates to the technical field of zoobenthos identification, and in particular to a next generation sequencing (NGS) method of a zoobenthos cytochrome oxidase subunit I (COI) gene and use thereof.
BACKGROUND Benthic invertebrates usually refer to invertebrate communities that live on the bottom of a water body in the whole or most of life history, principally including molluscs, aquatic insects, annelida, and crustaceans. In water ecological monitoring, benthic invertebrates are essential for evaluating ecological conditions of the freshwater ecosystem. Morphological characteristics play a very important role in species identification but are easily limited by biological sex and specific developmental stages and influenced by phenotypic plasticity and genetic variability; moreover, ubiquitous cryptic species and high demands for morphological identification lead to misidentification easily. Scientists anticipate that there are a total of more than 10-15 million biological species globally. If these species are identified by using conventional morphological methods, all taxonomists worldwide will spend thousands of years working together to achieve complete identification. Therefore, species identification urgently needs a new technical method. DNA barcoding technology developed in recent years just fulfills this need. Cytochrome oxidase subunit I (COI) is ubiquitous in mitochondria. The gene is functionally conserved but has certain evolution rate; with a substantial amount of information, the gene is widely used in research on DNA barcoding. DNA barcoding is weak in mixed samples and environmental samples, but NGS solves the problem that DNA barcoding can merely identify a single individual. However, Illumina platforms in NGS are limited by read length, most of which are sequenced based on ~313 bp COI fragments; even if a maximum read length of 300 PE is used, 658 bp COI barcode still has a gap of 100 bp, such that a full length of barcode region cannot be obtained by all-at-once sequencing; the presence of gap substantially influences data integrity, so as not to meet the requirement for species identification. Moreover, Illumina sequencing, due to excessively high throughput, detects non-target DNA amplicons with an extremely low copy number very easily. With longer read length, third generation sequencing, such as Oxford nancpore and PacBio SMAT, can reduce splicing costs; however, because of high error rate of single read length, there is a need for error correction by repeated sequencing, leading to an increase in sequencing cost, less abundant bioinformatics software, and low data accumulation.
SUMMARY Therefore, the technical problem to be solved in the present invention is to provide an NGS method of a zoobenthos COI gene and use thereof. With high throughput of the sequencing method, data for ~658 bp COI standard barcode gene is acquired by all-at-once sequencing, with high accuracy, providing a larger taxonomic resolution.
For this purpose, the technical solutions of the present invention are as follows: An NGS method of a zoobenthos COI gene includes the following steps: 51, amplifying the COI gene by polymerase chain reaction (PCR) with a DNA sample of a mixed zoobenthos sample as a template; S2, amplifying a PCR product obtained in step Sl as a template by PCR, to obtain a PCR product as a DNA library, wherein an amplified primer pair comprises a forward primer and a reverse primer, the primer pair comprises a pyrosequencing-NGS adapter, and each sample corresponds to one pyrosequencing-NGS adapter; and 53, conducting quality detection on a resulting DNA library obtained in step S2, and selecting a qualified DNA library to conduct emulsion PCR (emPCR) amplification and NGS.
For the method, in step Sl, a system of the PCRamplification is as follows: 10x Buffer, containing 20 mM Mg?** plus, 5 ul; dNTP, 2.5 mM, 4 ul; forward primer COIF, 5 uM, 2 ul; reverse primer COIR, 5 uM, 2 ul; Ex Tag DNA Polymerase, 5 U/ul, 0.25 ul; DNA template, 5 ng; and ultrapure water, 35.75 ul.
For the method, in step Sl, a reaction program of the PCR amplification is as follows: initial denaturation at 95°C for 3 min; 30-35 cycles of denaturation at 95°C for 30 s, annealing at 55°C for 30 s, and extension at 72°C for 45 s; 72°C for 7 min, and cryopreservation at 4°C.
For the method, in step S2, a system of the PCR amplification is as follows: 10x Buffer, containing 20 mM Mg?" plus, 5 ul; dNTP, 2.5 mM, 4 pl; 454AL+MID +COIF, 5 uM, 2 ul; 454B+ COIR, 5 uM, 2 ul; Ex Tag DNA Polymerase, 5 U/ul, 0.25 ul; DNA template, 1 ng; and ultrapure water, 35.75 ul.
For the method, in step S2, a reaction program of the PCR amplification is as follows: initial denaturation at 925°C for 3 min; 18 cycles of denaturation at 95°C for 30 s, annealing at 68°C for 30 s, and extension at 72°C for 45 s; 72°C for 7 min, and cryopreservation at 4°C.
For the method, in step S3, the quality detection is conducted on the library by quantitative competitive polymerase chain reaction (QC PCR), and a system of the QC PCR is as follows: 10x Buffer, containing 20 mM Mg?" plus, 5 ul;
dNTP, 2.5 mM, 8 nl; forward primer QC-F, 100 uM, 1 ul; reverse primer QC-R, 100 pM, 1 ul; Ex Taq DNA Polymerase, 5 U/ul, 1 ul; DNA library, 1 x 10% molecules/ul, 2 ul; and ultrapure water, 32 pl. For the method, in step S83, the quality detection is conducted on the library by QC PCR, and a program of the QC PCR is as follows: initial denaturation at 94°C for 11 min; 30 cycles of denaturation at 94°C for 1 min, annealing at 60°C for 1 min, and extension at 72°C for 1 min; 72°C for 10 min, and cryopreservation at 4°C. For the method, in step S3, a reaction system after QC PCR is incubated with exonuclease, reaction systems before and after restriction enzyme digestion are electrophoresed, and DNA libraries with a sequencing library length of ~750 bp are selected as qualified DNA libraries. For the method, in step 83, during the emPCR amplification, the DNA libraries are subjected to emPCR lv amplification in a ratio of the number of DNA molecules to the DNA capture beads count, 1.e., 3. For the method, step S3 further includes a step of data analysis of sequencing results. An NGS kit for a zoobenthos COI gene is provided, including a primer pair used for amplifying the zoobenthos COI gene, the system of the PCR amplification, the system of the PCR amplification, and/or the system of the QC PCR. Use of the NGS kit for a zoobenthos COI gene in obtaining a zoobenthos COI gene barcode is provided; preferably, the use includes use thereof in water ecological monitoring. The technical solutions of the present invention have the following advantages:
1. The NGS method of a zoobenthos COI gene provided by the present invention includes the following steps: Sl, amplifying the COI gene by PCR with a DNA sample of a mixed zoobenthos sample as a template; S2, amplifying a PCR product obtained in step S1 as a template by PCR, to obtain a PCR product as a DNA library, where an amplified primer pair includes a forward primer and a reverse primer, the primer pair includes a pyrosequencing-NGS adapter, and each sample corresponds to one pyrosequencing-NGS adapter; and S3, conducting quality 5 detection on a resulting DNA library obtained in step S2, and selecting a qualified DNA library to conduct emPCR amplification and NGS. The NGS method of a zoobenthos COI gene obtains a high-throughput full-length (~658 bp) COI gene sequence with read length covering large benthic communities by all-at-once sequencing, providing a large taxonomic resolution and avoiding false positives due to excessively high throughput.
2. The use of the NGS kit for a zoobenthos COI gene provided by the present invention in obtaining a zoobenthos COI gene barcode is provided; the use includes the use thereof in water ecological monitoring; environmental stress factors influencing macrobenthos in the Wei River Basin are identified efficiently as conductivity and total nitrogen by the NGS kit for a zoobenthos COI gene and/or the NGS method of a zoobenthos COI gene.
BRIEF DESCRIPTION OF THE DRAWINGS To describe the technical solutions in the examples of the present invention or in the prior art more clearly, the following briefly describes the accompanying drawings required for describing the examples or the prior art. Apparently, the accompanying drawings in the following description show some examples of the present invention, and a person of ordinary skill in the art may still derive other drawings from these accompanying drawings without creative efforts.
FIG. 1 illustrates a standard curve of precise and quantitative double-stranded deoxyribonucleic acid (dsDNA) based on fluorescence analysis in Example 1; FIG. 2 is a gel electrophoresis map of detection of COI gene libraries by Agilent 2100 Bioanalyzer in Example 1; FIG. 3 is an electrophoresis map of detection of COI gene libraries by Agilent 2100 Bioanalyzer in Example 1; FIG. 4 is a plot of dilution curves of different samples in Example 1 of the present invention.
DETAILED DESCRIPTION The following examples serve to provide further appreciation of the present invention, but are not limited to the preferred examples and do not limit the spirit and scope of the invention; any product that is the same as or similar to the inventionmade in light of the invention or by combination of the present invention with other features of the prior art shall fall within the scope of the invention.
If specific experimental procedure or conditions are not indicated in the present invention, operations or conditions of conventional experimental procedures known in the art shall be used. If manufacturers of reagents and apparatus used are not indicated, conventional reagent products may be commercially available.
Example 1 The example provided an NGS method of a zoobenthos COI gene, including the following steps: First, samples related to the example were prepared as follows: Representative sampling points were set up according to the geographical and water quality characteristics of the water body. Zoobenthos samples were collected from different habitats using a Surber net (30 x 30 cm, tuck net aperture:
0.5 mm) or a Petersen grab (open area: 1/16 m*); three parallel samples were collected at each point, which were placed in 300 mL sample bottles after cleaning and rough selection and fixed in 95% alcohol for storage.
(1) Mixed biological samples prepared above were extracted by a tissue genomic DNA extraction kit; DNA concentrations were determined and uniformly diluted to a final concentration of 5 ng/ul. The COT gene was amplified by PCR using the same concentration of diluted DNA as a template. The forward primer COIF is shown as SEQ ID NO. 1, and the reverse primer COIR is shown as SEQ ID NO. 2. A system of the PCR amplification was as follows: 10x Buffer, containing 20 mM Mg** plus, 5 ul; dNTP, 2.5 mM, 4 ul; forward primer COIF, 5 uM, 2 ul; reverse primer COIR, 5 uM, 2 ul; Ex Tag DNA Polymerase, 5 U/ul, 0.25 ul; DNA template, 5 ng; and ultrapure water, 35.75 ul. A reaction program of the PCR amplification was as follows: initial denaturation at 95°C for 3 min; 30-35 cycles of denaturation at 95°C for 30 s, annealing at 55°C for 30 s, and extension at 72°C for 45 5; 72°C for 7 min, and cryopreservation at 4°C. An amplified product was extracted and quantified by 1% agarose gel electrophoresis. (2) One nanogram of amplified product extracted in step (1) was amplified by PCR. A primer pair to be amplified included a forward primer and a reverse primer; the primer pair included a pyrosequencing-NGS adapter, and each sample corresponded to one pyrosequencing-NGS adapter, i.e., a forward pyrosequencing-NGS adapter was added at 5'-end of the primer COIF; a sequence of the forward pyrosequencing-NGS adapter (573') was composed of a 454AL sequence and an MID sequence successively. The 454AL sequence is shown as SEQ ID NO. 3. The MID sequence was composed of eight arbitrary nucleotides, and one MID sequence was designed for each sample (double~underlined sequences in Table 1) to further ensure that each sample corresponded to one pyroseguencing-NGS adapter. A reverse pyrosequencing-NGS adapter 454B sequence was added at 5'-end of the primer COIR. The 454B sequence is shown as SEQ ID NO. 4. Partial primer 454AL+MID (A1l-A12}) +COIF and primer 454B+ COIR are shown in Table 1. Table 1 Partial primer 454AL+MID{(A1-A12) +COIF and primer 454B+
454AL+A1+COIF ccatctcatccctgcgtgtctccgactcagacgagtgcggtcaacaaatcalaaagara SEQ ID NO:5 454AL+A2+COIF ccatctcatccctgcgtgtctccgactcagacgctcgaggtcaacaaatcalaaagara SEQ ID NO:6 454AL+A3+COIF ccatctcatccctgcgtgtctccgactcagagacgcacggtcaacaaatcalaaagara SEQ ID NO:7 454AL+A4+COIF ccatctcatccctgcgtgtctccgactcagagcactgtggtcaacaaatcalaaagara SEQ ID NO:3 454AL+A5+COIF ccatctcatccctgegtgtetccgactcagatcagacaggtcaacaaatrcataaagala SEQ ID NO:9 454ATL+A6+COIF ccatctcatccctgegtgtetccgactcagatetegcgggtcaacaaatrcataaagala SEQ ID NO:10
454AL+A7+COIF
Ccatctcatccctgcgtgtctccgactcagcgtgtctcggtcaacaaatcaltaaagara SEQ ID NO:11 454AL+A8+COIF ccatctcatccctgcgtgtctccgactcagctcgcgtgggtcaacaaatcalaaagara SEQ ID NO:12 454AL+A9+COIF ccatctcatccctgcgtgtctccgactcagtagtatcaggtcaacaaatcalaaagara SEQ ID NO:13 454AL+A10+COIF ccatctecatcecectgegtgtetecgactecagtctctatgggtcaacaaatcataaagata SEQ ID NO:14 454AL+A11+COIF ccatctcatccctgcgtgtctccgactcagtgatacgtggtcaacaaatcalaaagara SEQ ID NO:15 454AL+A12+COIF ccatctcatccctgcgtgtctccgactcagtactgagcggtcaacaaatcalaaagara SEQ ID NO:16
454B+COIR cctatcccectgtgtgcettggcagtctcagtaaacttcagggtgaccaaaaaatca
SEQ ID NO:17
A system of the PCR amplification was as follows: 10x Buffer, containing 20 mM Mg** plus, 5 ul; dNTP, 2.5 mM, 4 ul; 454AL+MID +COIF, 5 uM, 2 ul; 454B+ COIR, 5 uM, 2 ul; Ex Tag DNA Polymerase, 5 U/ul, 0.25 ul; DNA template, 1 ng; and ultrapure water, 35.75 ul.
A reaction program of the PCR amplification is as follows: initial denaturation at 95°C for 3 min; 18 cycles of denaturation at 95°C for 30 s, annealing at 68°C for 30 s, and extension at 72°C for 45 5; 72°C for 7 min, and cryopreservation at 4°C.
PCR amplification fragments were extracted by gel extraction.
Concentrations of extracted PCR amplification fragments were determined precisely by Promega Quant-iT dsDNA Assay Kit.
A standard curve was plotted as shown in FIG. 1, and the copy number of DNA molecules was calculated according to the following formula: copies/mL = (6,022x102: copies/mole) x concentration (ng/mL)/MW (g/mol). Each sample was diluted to 105 molecules/uL.
The above diluted samples were grouped; 10 ul each of each sample was pipetted and mixed well as a DNA library. (3) Quality detection was conducted on the DNA library by QC PCR.
Forward primer QC~-F was 5f GGTCAACAAATCATAAAGATATTGG-3 (SEQ ID NO: 18), and reverse primer QC-R was 5' -TAAACTTCAGGGTGACCAAAAAATCA-3’ (SEQ ID NO: 2). A system of the QC PCR was as follows: 10x Buffer, containing 20 mM Mg** plus, 5 ul; dNTP, 2.5 mM, 8 pul; forward primer QC-F, 100 pM, 1 ul; reverse primer QC-R, 100 pM, 1 ul; Ex Tag DNA Polymerase, 5 U/ul, 1 ul; DNA library, 1 x 10% molecules/ul, 2 ul; and ultrapure water, 32 ul.
A program of the QC PCR was as follows: initial denaturation at 94°C for 11 min; 30 cycles of denaturation at 94°C for 1 min, annealing at 60°C for 1 min, and extension at 72°C for
1 min; 72°C for 10 min, and cryopreservation at 4°C.
A reaction system after QC PCR was incubated with 1 ul of exonuclease for 1 hat 37°C, 1 ul each of reaction systems before and after restriction enzyme digestion was run on Agilent Bioanalyzer 2100 (DNA 7500 Chip). Sample libraries without small fragment amplification were qualified. Successfully constructed sequencing libraries had a length of ~750 bp, including a sequencing adapter and PCR primers.
The above qualified DNA libraries were subjected to emPCR amplification and high-throughput sequencing. The DNA libraries were subjected to emPCR lv amplification in a ratio of the number of DNA molecules to the DNA capture beads count, i.e., 3, and a given number of DNA capture bead sets were arranged in a sequencing plate for high-throughput sequencing.
Each sequencing region obtained 1,000,000 effective sequences, and each sample had a data quantity of approximately 1,000,000.
Example 2 Data analysis was conducted on sequencing results obtained in Example 1, including the following steps: 1) A list file of an MID sequence was input; Split-libraries.py in QIIME software was run to sort out MIDs of different samples, and noise sequences with degenerate bases, low-quality scores, chimeras, and a length of <150 bp were eliminated to acquire clean data.
2} The clean data acquired above were subjected to OTU clustering by using Usearch and CD-HIT software and OTU picking by using a plurality of thresholds (97%, 98%, etc.), to enhance the result reliability. Species annotation analysis was made by matching representative sequences of OTUs with 80% sequence similarity with sequences in the BOLD database, and a list of OTUs was generated.
3) Alpha-diversity indices of samples, such as Shannon index, Simpson index, and Chao index, were calculated based on relative abundances of OTUs. Sample cluster analysis was conducted by using the abundance data of sequences and the data for the presence or absence of OTUs; PCoA analysis was conducted on the dissimilarity of each sampling point using coefficients of the data of Jaccard (presence/absence) and Bray-Curtis (relative abundance). Example 3 Use in water ecological monitoring Environmental stress factors and main pollutants influencing macrobenthos communities in a river basin were determined based on the data acquired in Example 2: hydrochemical parameters were normalized by logarithmic transformation.
One-way ANOVA was conducted by analyzing the number of various environmental factors in operational taxonomic units among groups and differences in environmental factors, and then distance-based redundancy analysis (dbRDA) was conducted; meanwhile, ineffective environment variables were deleted gradually by previous picking (P>0.05); environmental stress factors and pollutants influencing the distribution of macrobenthos communities were selected by the Monte Carlo method, and an ordination plot was used to reflect the relationship between community distribution and pollutants and identify main stress factors and pollutants influencing macrobenthos communities in a river basin.
Example 4 The example provided an NGS kit for a zoobenthos COI gene, including a primer pair used for amplifying the zoobenthos COI gene, the system of the PCR amplification in step S1, the system of the PCR amplification in step S2, and/or the system of the QC PCR.
For the primer pair used for amplifying the zoobenthos COI gene, the forward primer COIF is shown as SEQ ID NO. 1, and the reverse primer COIR is shown as SEQ ID NO. 2. The system of the PCRamplification in step Sl was as follows: 10x Buffer, containing 20 mM Mg? plus, 5 ul; dNTP, 2.5 mM, 4 ul; forward primer COIF, 5 pM, 2 ul; reverse primer COIR, 5 uM, 2 ul; Ex Tag DNA Polymerase, 5 U/ul, 0.25 ul; DNA template, 5 ng; and ultrapure water, 35.75 nl.
The systemof the PCRamplification in step S2 was as follows:
10x Buffer, containing 20 mM Mg* plus, 5 ul; dNTP, 2.5 mM, 4 ul; 454AL+MID +COIF, 5 uM, 2 ul; 454B+ COIR, 5 uM, 2 ul; Ex Taq DNA Polymerase, 5 U/ul, 0.25 ul; DNA template, 1 ng; and ultrapure water, 35.75 ul.
The system of the QC PCR was as follows: 10x Buffer, containing 20 mM Mg* plus, 5 ul; dNTP, 2.5 mM, 8 ul; forward primer QC-F, 100 pM, 1 ul; reverse primer QC-R, 100 pM, 1 ul; Ex Taq DNA Polymerase, 5 U/ul, 1 ul; DNA library, 1 x 10% molecules/ul, 2 ul; and ultrapure water, 32 ul.
Example 5 The example provided use of the NGS method of a zoobenthos COI gene in Example 1 or the NGS kit for a zoobenthos COI gene in Example 3 in water ecological monitoring of the Wei River Basin, including the following steps: Sample collection and determination of physical and chemical parameters of water: Macrobenthos samples were collected from different habitats of river reaches of the Weil River Basin, fixed in 95% alcohol for storage, and taken back to the laboratory to complete DNA extraction. Water samples were collected in amber bottles, and national standard method was used to determine the total nitrogen (TN), total phosphorus (TP), ammonia-nitrogen (NH4'-N), biological oxygen demand (BODs)}), chemical oxygen demand (COD), and silicate content of the water body. A water quality meter (YSI) was used to determine the water temperature, pH, conductivity, salinity, dissolved oxygen, suspended matter, and total dissolved solids (TDS) .
COI standard barcoding and morphological identification: Macrobenthos samples were collected in duplicate from each sampling point: one was used for morphological identification, and the other was used for NGS based on COI barcoding. Herein,
morphological identification of zoobenthos refers to the Fauna Sinica and local faunas; the NGS method was the same as that in Example 1, the data analysis method was the same as that in Example 2, and the water ecological monitoring method was the same as that in Example 3.
Water ecological monitoring results of the Wei River Basin were as follows. Representative sequences of OTUs were annotated in the BOLD database. Creatures included aquatic insects (Ephemeroptera, Trichoptera, Plecoptera, Diptera, and Coleoptera), oligochaeta, and molluscs; 84.7% of sequences could be annotated at a genus level. Finally, conductivity and TN were identified as significant environmental stress factors influencing the benthic community distribution in the Wei River Basin.
Effect Example
1. Optimum ratio of the number of DNA molecules to the DNA capture beads count DNA libraries qualified in Example 1 were subjected to emPCR sv (SV EMPCR KIT (LIB-L)) assay to determine an optimum ratio of the number of DNA molecules to the DNA capture beads count. The number of DNA molecules with an enrichment ratio of 10%-20% was the optimum ratio. Table 2 shows the effects of the use of the DNA libraries in Example 1 in ratios of the number of DNA molecules to the DNA capture beads count (0.5, 1, and 4).
From the table, when CPB = 1, the enrichment ratio is 11%; when CPB = 4, the enrichment ratio is 25%. Thus, in order to ensure a high enrichment ratio and meet the optimum ratio, CPB = 3 is used for subsequent emPCR lv (LV emPCR Kit (Lib-1L)) amplification.
Table 2 emPCR sv data table Initial Ratio Library 1x107 mole/uL CPB = 4
2. QC PCR systems before and after exonuclease digestion in Example 1 were detected by Agilent Bicanalyzer 2100. A gel electrophoresis map is shown in FIG. 2, and an electrophoresis map is shown in FIG. 3. From these figures, DNA libraries prepared in Example 1 show no small fragment amplification after quality detection, indicating that the method of Example 1 is mature and reliable.
3. Dilution curves were plotted for different samples in Example 1. As shown in FIG. 4, the number of NGS sequences of the mixed sample at each sampling point reaches a plateau, and the number of sequences was reasonable, indicating that throughput conditions of the NGS method of a zoobenthos COI gene provided by the present invention fully satisfy the demands for subsequent research and use.
Obviously, the above examples are merely intended to describe the examples clearly, but not to limit the embodiment. Different forms of variations or alterations may also be made by those of ordinary skill in the art based on the above descriptions. All examples do not have to and cannot be presented herein. Obvious variations or alterations extended thereby still fall within the scope of the claimed invention.
Sequence listing-DEF
SEQUENCE LISTING <110> Chinese Research Academy of Environmental Sciences <120> NEXT GENERATION SEQUENCING METHOD OF ZOOBENTHOS CYTOCHROME OXIDASE
SUBUNIT I GENE AND USE THEREOF <130> HA201802876 <160> 18 <170> PatentIn version 3.3 <210> 1 <211> 21 <212> DNA <213> Artificial Synthesis <400> 1 ggtcaacaaa tcataaagat a 21 <210> 2 <211> 26 <212> DNA <213> Artificial Synthesis <400> 2 taaacttcag ggtgaccaaa aaatca 26 <210> 3 <211> 30 <212> DNA <213> Artificial Synthesis <400> 3 ccatctcatc cctgcgtgtc tccgactcag 30 <210> 4 <211> 30 <212> DNA <213> Artificial Synthesis <400> 4 cctatcccct gtgtgccttg gcagtctcag 30 <210> 5 <211> 59 <212> DNA <213> Artificial Synthesis Pagina 1
Sequence listing-DEF
<400> 5 ccatctcatccctgcgtgtctccgactcagacgagtgcggtcaacaaatcataaagata 59
<210> 6
<211> 59
<212> DNA
<213> Artificial Synthesis
<400> 6 ccatctcatccctgcgtgtctccgactcagacgctcgaggtcaacaaatcataaagata 59
<210> 7
<211> 59
<212> DNA
<213> Artificial Synthesis
<400> 7 ccatctcatccctgcgtgtctccgactcagacgctcgaggtcaacaaatcataaagata 59
<210> 8
<211> 59
<212> DNA
<213> Artificial Synthesis
<400> 8 ccatctcatccctgcgtgtctccgactcagagcactgtggtcaacaaatcataaagata 59
<210> 9
<211> 59
<212> DNA
<213> Artificial Synthesis
<400> 9 ccatctcatccctgcgtgtctccgactcagatcagacaggtcaacaaatcataaagata 59
<210> 10
<211> 59
<212> DNA
<213> Artificial Synthesis
<400> 10 ccatctcatccctgcgtgtctccgactcagatatcgcgggtcaacaaatcataaagata 59
Pagina 2
Sequence listing-DEF
<210> 11
<211> 59
<212> DNA
<213> Artificial Synthesis
<400> 11 ccatctcatccctgcgtgtctccgactcagcgtgtctcggtcaacaaatcataaagata
59
<210> 12
<211> 59
<212> DNA
<213> Artificial Synthesis
<400> 12 ccatctcatccctgcgtgtctccgactcagctcgcgtgggtcaacaaatcataaagata 59
<210> 13
<211> 59
<212> DNA
<213> Artificial Synthesis
<400> 13 ccatctcatccctgcgtgtctccgactcagtagtatcaggtcaacaaatcataaagata 59
<210> 14
<211> 59
<212> DNA
<213> Artificial Synthesis
<400> 14 ccatctcatccctgcgtgtctccgactcagtctctatgggtcaacaaatcataaagata 59
<210> 15
<211> 59
<212> DNA
<213> Artificial Synthesis
<400> 15 ccatctcatccctgcgtgtctccgactcagtgatacgtggtcaacaaatcataaagata
59
<210> 16
<211> 59
<212> DNA
<213> Artificial Synthesis
<400> 16 ccatctcatccctgcgtgtctccgactcagtactgagcggtcaacaaatcataaagata
Pagina 3
Sequence listing-DEF 59 <210> 17 <211> 56 <212> DNA <213> Artificial Synthesis <400> 17 cctatcccctgtgtgccttggcagtctcagtaaacttcagggtgaccaaaaaatca 56 <210> 18 <211> 25 <212> DNA <213> Artificial Synthesis <400> 18 ggtcaacaaatcataaagatattgg 25 Pagina 4
Claims (10)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910840869.XA CN110894526A (en) | 2019-09-05 | 2019-09-05 | High-throughput sequencing method for benthonic animal COI gene and application thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
NL2026403A NL2026403A (en) | 2021-05-04 |
NL2026403B1 true NL2026403B1 (en) | 2022-07-19 |
Family
ID=69785903
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
NL2026403A NL2026403B1 (en) | 2019-09-05 | 2020-09-03 | Next generation sequencing method of zoobenthos cytochrome oxidase subunit i gene and use thereof |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN110894526A (en) |
NL (1) | NL2026403B1 (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111593099A (en) * | 2020-06-11 | 2020-08-28 | 南京易基诺环保科技有限公司 | Method for monitoring freshwater benthonic animals by utilizing environmental sediment samples |
CN112359119A (en) * | 2020-11-18 | 2021-02-12 | 中国科学院生态环境研究中心 | Environmental DNA macro-bar code method for large-scale benthonic animal community structure research |
CN113142034B (en) * | 2021-03-26 | 2023-02-28 | 北京大学 | Method for synchronously identifying floating algae and benthic algae in aquatic ecosystem |
CN116590381A (en) * | 2023-05-12 | 2023-08-15 | 青岛理工大学 | Method for screening key water quality factors influencing river biodiversity by reclaimed water supplementing |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102719528A (en) * | 2012-04-27 | 2012-10-10 | 上海派森诺生物科技有限公司 | emPCR method of 454 high-throughput sequencing based on pyrophosphoric acid principle |
CN104531879B (en) * | 2015-01-06 | 2017-02-22 | 上海海洋大学 | Environment DNA identification method for fish community structure researching |
CN104630244A (en) * | 2015-01-26 | 2015-05-20 | 中国科学院海洋研究所 | COI gene sequence of species of coral reef benthos and species identification method utilizing COI gene sequence |
CN106560519A (en) * | 2016-06-11 | 2017-04-12 | 华中农业大学 | Amplimers for mitochondrial COI gene of Bellamya and application thereof |
CN107746889A (en) * | 2017-07-17 | 2018-03-02 | 国家海洋局北海环境监测中心 | For expanding the primer of benthic polychaeta annelid mitochondria CO I gene fragments |
-
2019
- 2019-09-05 CN CN201910840869.XA patent/CN110894526A/en active Pending
-
2020
- 2020-09-03 NL NL2026403A patent/NL2026403B1/en not_active IP Right Cessation
Also Published As
Publication number | Publication date |
---|---|
CN110894526A (en) | 2020-03-20 |
NL2026403A (en) | 2021-05-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
NL2026403B1 (en) | Next generation sequencing method of zoobenthos cytochrome oxidase subunit i gene and use thereof | |
AU2019250200B2 (en) | Error Suppression In Sequenced DNA Fragments Using Redundant Reads With Unique Molecular Indices (UMIs) | |
Ushio et al. | Environmental DNA enables detection of terrestrial mammals from forest pond water | |
Gamble et al. | Restriction site-associated DNA sequencing (RAD-seq) reveals an extraordinary number of transitions among gecko sex-determining systems | |
Lind et al. | Genome of the Komodo dragon reveals adaptations in the cardiovascular and chemosensory systems of monitor lizards | |
Wangensteen et al. | Metabarcoding techniques for assessing biodiversity of marine animal forests | |
US10920275B2 (en) | Methods for diagnosis of tuberculosis | |
Cammen et al. | Genomic methods take the plunge: recent advances in high-throughput sequencing of marine mammals | |
Monaghan et al. | Accelerated species inventory on Madagascar using coalescent-based models of species delineation | |
Korpelainen et al. | RNA-seq data analysis: a practical approach | |
Thomson et al. | Developing markers for multilocus phylogenetics in non-model organisms: a test case with turtles | |
Nielsen et al. | ZZ/ZW sex chromosomes in the endemic Puerto Rican leaf-toed gecko (Phyllodactylus wirshingi) | |
CN109505012A (en) | A kind of kit of the mRNA bis- generations sequencing library building for FFPE sample and its application | |
Macé et al. | Evaluating bioinformatics pipelines for population‐level inference using environmental DNA | |
Cloutier et al. | First nuclear genome assembly of an extinct moa species, the little bush moa (Anomalopteryx didiformis) | |
US20240038327A1 (en) | Rapid single-cell multiomics processing using an executable file | |
Tsuji et al. | Evaluating intraspecific diversity of a fish population using environmental DNA: An approach to distinguish true haplotypes from erroneous sequences | |
CN113801945B (en) | Molecular marker C768 for identifying ammonia nitrogen tolerance character of portunus trituberculatus and application thereof | |
US20230203600A1 (en) | Method for treating cell population and method for analyzing genes included in cell population | |
JP7444488B2 (en) | Contamination detection method | |
Ogundolie et al. | Microbiome characterization and identification: key emphasis on molecular approaches | |
Mehni | Genomic sequencing of freshwater mussel unio tumidus using the minion | |
Chen et al. | Detection of fish species composition using environmental DNA in aquarium trials. | |
Barbaro | Overview of NGS platforms and technological advancements for forensic applications | |
Oliveira | Analysis of RAD sequencing data from species of Mediterranean cicadas |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
MM | Lapsed because of non-payment of the annual fee |
Effective date: 20231001 |