WO2022259995A1 - 真核微生物を定量するための内部標準核酸 - Google Patents
真核微生物を定量するための内部標準核酸 Download PDFInfo
- Publication number
- WO2022259995A1 WO2022259995A1 PCT/JP2022/022726 JP2022022726W WO2022259995A1 WO 2022259995 A1 WO2022259995 A1 WO 2022259995A1 JP 2022022726 W JP2022022726 W JP 2022022726W WO 2022259995 A1 WO2022259995 A1 WO 2022259995A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- nucleic acid
- acid sequence
- sequence
- seq
- flanking
- Prior art date
Links
- 150000007523 nucleic acids Chemical class 0.000 title claims abstract description 319
- 108020004707 nucleic acids Proteins 0.000 title claims abstract description 131
- 102000039446 nucleic acids Human genes 0.000 title claims abstract description 131
- 244000005700 microbiome Species 0.000 title description 21
- 108091028043 Nucleic acid sequence Proteins 0.000 claims abstract description 160
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 39
- 108020005029 5' Flanking Region Proteins 0.000 claims abstract description 36
- 108020005065 3' Flanking Region Proteins 0.000 claims abstract description 35
- 230000000295 complement effect Effects 0.000 claims abstract description 16
- 239000002773 nucleotide Substances 0.000 claims description 31
- 125000003729 nucleotide group Chemical group 0.000 claims description 31
- 239000000523 sample Substances 0.000 claims description 26
- 108700022487 rRNA Genes Proteins 0.000 claims description 13
- 239000013604 expression vector Substances 0.000 claims description 12
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 2
- 108091036078 conserved sequence Proteins 0.000 description 38
- 108020004418 ribosomal RNA Proteins 0.000 description 29
- 108091023242 Internal transcribed spacer Proteins 0.000 description 22
- 108020004465 16S ribosomal RNA Proteins 0.000 description 15
- 239000002689 soil Substances 0.000 description 15
- 241000736262 Microbiota Species 0.000 description 13
- 108020004414 DNA Proteins 0.000 description 11
- 241000233866 Fungi Species 0.000 description 10
- 238000004458 analytical method Methods 0.000 description 10
- 210000004027 cell Anatomy 0.000 description 10
- 238000011002 quantification Methods 0.000 description 9
- 108020004463 18S ribosomal RNA Proteins 0.000 description 8
- 108091093088 Amplicon Proteins 0.000 description 8
- 230000003321 amplification Effects 0.000 description 8
- 238000000034 method Methods 0.000 description 8
- 238000003199 nucleic acid amplification method Methods 0.000 description 8
- 238000006243 chemical reaction Methods 0.000 description 6
- 241000894006 Bacteria Species 0.000 description 5
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 238000000605 extraction Methods 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 239000000243 solution Substances 0.000 description 4
- 238000011144 upstream manufacturing Methods 0.000 description 4
- 208000010603 vasculitis due to ADA2 deficiency Diseases 0.000 description 4
- 108020004565 5.8S Ribosomal RNA Proteins 0.000 description 3
- 102000053602 DNA Human genes 0.000 description 3
- 230000002538 fungal effect Effects 0.000 description 3
- 239000013612 plasmid Substances 0.000 description 3
- 239000013600 plasmid vector Substances 0.000 description 3
- GUAHPAJOXVYFON-ZETCQYMHSA-N (8S)-8-amino-7-oxononanoic acid zwitterion Chemical compound C[C@H](N)C(=O)CCCCCC(O)=O GUAHPAJOXVYFON-ZETCQYMHSA-N 0.000 description 2
- 108020000946 Bacterial DNA Proteins 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 108020000949 Fungal DNA Proteins 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- 230000000813 microbial effect Effects 0.000 description 2
- 239000011259 mixed solution Substances 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 238000001668 nucleic acid synthesis Methods 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- 235000001674 Agaricus brunnescens Nutrition 0.000 description 1
- 241000224489 Amoeba Species 0.000 description 1
- 241000633183 Anaerolinea Species 0.000 description 1
- 240000006439 Aspergillus oryzae Species 0.000 description 1
- 235000002247 Aspergillus oryzae Nutrition 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 108010077805 Bacterial Proteins Proteins 0.000 description 1
- 241000606215 Bacteroides vulgatus Species 0.000 description 1
- 241000222178 Candida tropicalis Species 0.000 description 1
- 241000192731 Chloroflexus aurantiacus Species 0.000 description 1
- 241000193401 Clostridium acetobutylicum Species 0.000 description 1
- 241000579714 Deinococcus grandis Species 0.000 description 1
- 241000228124 Desulfitobacterium hafniense Species 0.000 description 1
- 241000228138 Emericella Species 0.000 description 1
- 241000620209 Escherichia coli DH5[alpha] Species 0.000 description 1
- 241001131785 Escherichia coli HB101 Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 108700005088 Fungal Genes Proteins 0.000 description 1
- 241000719997 Gemmatimonas aurantiaca Species 0.000 description 1
- 241001069755 Hymenoscyphus varicosporoides Species 0.000 description 1
- 241001618116 Marasmius purpureostriatus Species 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 241000605154 Nitrobacter winogradskyi Species 0.000 description 1
- 241000605121 Nitrosomonas europaea Species 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 241000223785 Paramecium Species 0.000 description 1
- 241000589776 Pseudomonas putida Species 0.000 description 1
- 241000235346 Schizosaccharomyces Species 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 241000589907 Treponema bryantii Species 0.000 description 1
- 241000499912 Trichoderma reesei Species 0.000 description 1
- 241000222126 [Candida] glabrata Species 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 238000003149 assay kit Methods 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 208000032343 candida glabrata infection Diseases 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 210000000936 intestine Anatomy 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 244000005706 microflora Species 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- 239000002096 quantum dot Substances 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 239000013598 vector Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/74—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
- C12N15/81—Vectors or expression systems specially adapted for eukaryotic hosts for fungi for yeasts
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N5/00—Undifferentiated human, animal or plant cells, e.g. cell lines; Tissues; Cultivation or maintenance thereof; Culture media therefor
- C12N5/10—Cells modified by introduction of foreign genetic material
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
Definitions
- the present invention relates to an internal standard nucleic acid for quantifying eukaryotic microorganisms.
- microorganisms live in all environments, including natural environments such as soil and oceans, the intestines of animals, and human living spaces such as houses. In many cases, they are colonized in each environment with a unique configuration, and such collections of microorganisms are called microbiomes.
- rRNA ribosomal RNA
- 18S rRNA gene the 18S rRNA gene
- ITS Internal Transcribed Spacer
- 25-28S rRNA gene sequences for eukaryotes are used as indicators. Taxonomy-based metagenomic analysis methods are being actively used.
- the purpose of the present invention is to provide an internal standard nucleic acid optimized for the detection of eukaryotic microorganisms and/or prokaryotic microorganisms that make up microbiota and the accuracy control of their quantification.
- the present inventors have already developed an internal standard nucleic acid optimized for accuracy control of detection and quantification of prokaryotic microorganisms (Patent No. 6479336). Subsequently, the present inventors succeeded in producing an internal standard nucleic acid for precision control of detection and quantification of eukaryotic microorganisms, leading to the completion of the present invention.
- the present invention provides (1) a 5′ flanking sequence comprising a nucleic acid sequence derived from a eukaryotic rRNA-related gene, (2) an artificial nucleic acid sequence comprising a non-naturally occurring nucleic acid sequence, and (3) A nucleic acid comprising at least one partial nucleic acid sequence consisting of a 3' flanking sequence containing a nucleic acid sequence derived from a eukaryotic rRNA-related gene and/or a complementary sequence thereof, wherein the partial nucleic acid sequence is: Partial nucleic acid sequences (a) to (d) of: (a1) a 5′ flanking sequence containing at least 20 consecutive nucleotides in the nucleic acid sequence of SEQ ID NO: 1, (a2) an artificial nucleic acid sequence consisting of the nucleic acid sequence of any one of SEQ ID NOs: 8 to 19, and (a3) ) a partial nucleic acid sequence (a) consisting of a 3' flanking sequence comprising
- the partial nucleic acid sequence (a) includes (a1′) a 5′ flanking sequence containing the nucleic acid sequence of SEQ ID NO: 1, and (a2) an artificial nucleic acid sequence consisting of the nucleic acid sequence of any one of SEQ ID NOS: 8 to 19.
- said partial nucleic acid sequence (b) comprises (b1′) a 5′ flanking sequence comprising the nucleic acid sequence of SEQ ID NO:2, ( b2) an artificial nucleic acid sequence selected from the group consisting of SEQ ID NOs: 20 to 31, and (b3′) a 3′ flanking sequence comprising the nucleic acid sequence of SEQ ID NO: 3; ') a 5' flanking sequence comprising the nucleic acid sequence of SEQ ID NO: 3, (c2) an artificial nucleic acid sequence selected from the group consisting of SEQ ID NOS: 32-43, and (c3') a 3' comprising the nucleic acid sequence of SEQ ID NO: 4.
- said partial nucleic acid sequence (d) is selected from the group consisting of (d1′) a 5′ flanking sequence comprising the nucleic acid sequence of SEQ ID NO: 4, (d2) SEQ ID NOS: 44-55 and (d3') a 3' flanking sequence comprising the nucleic acid sequence of SEQ ID NO:5.
- the nucleic acid includes (e4) a 5' flanking sequence comprising a nucleic acid sequence derived from a prokaryotic rRNA gene, (e5) an artificial nucleic acid sequence comprising a non-naturally occurring nucleic acid sequence, and (e6) a nucleic acid sequence derived from a prokaryotic rRNA gene. It is preferred to further comprise an additional partial nucleic acid sequence (e) consisting of the 3' flanking sequence comprising and/or its complementary sequence.
- the additional partial nucleic acid sequence (e) includes (e4') a 5' flanking sequence comprising the nucleic acid sequence of SEQ ID NO: 6, (e5') the artificial nucleic acid sequence of SEQ ID NO: 56 or 57, and (e6') SEQ ID NO: It preferably consists of 3' flanking sequences comprising 7 nucleic acid sequences.
- the nucleic acid consists of a nucleic acid sequence selected from the group consisting of SEQ ID NOS: 58-69 and/or its complementary sequence.
- the present invention provides an expression vector containing the above nucleic acid.
- the present invention provides a transformed cell containing the expression vector.
- the present invention also provides, according to one embodiment, a nucleic acid sequence that is at least 90% identical to a nucleic acid sequence comprising at least 15 consecutive nucleotides in an artificial nucleic acid sequence selected from the group consisting of SEQ ID NOS:8-57. or provide a probe comprising its complementary sequence.
- the nucleic acid according to the present invention has a non-naturally occurring nucleic acid sequence and can be amplified in the same manner as eukaryotic rRNA-related genes using known universal primers for amplifying eukaryotic rRNA-related genes. can be done. Therefore, according to the nucleic acid according to the present invention, in the analysis of various microbiota samples containing eukaryotic microorganisms, it is possible to strictly control the accuracy of metagenomic analysis based on rRNA-related genes, which is generally employed at present. .
- FIG. 1 is a schematic diagram showing exemplary configurations of nucleic acids of the present invention.
- FIG. 2 is a plot evaluating the quantification of nucleic acids 1-12 as internal standards using the universal primer set for the ITS1 region.
- FIG. 3 is a plot evaluating the quantification of nucleic acids 1-12 as internal standards using a universal primer set for the 25-28S rRNA D1-D2 region.
- FIG. 4 is a plot evaluating the quantification of nucleic acids 1-12 as internal standards using a universal primer set for the 16S rRNA V4 region.
- FIG. 5 is a plot showing the correlation between the amount of soil added to the sample and the number of reads from nucleic acids 1-12.
- FIG. 1 is a schematic diagram showing exemplary configurations of nucleic acids of the present invention.
- FIG. 2 is a plot evaluating the quantification of nucleic acids 1-12 as internal standards using the universal primer set for the ITS1 region.
- FIG. 3 is a plot evaluating the
- FIG. 6 is a plot showing the correlation between the amount of soil added to the sample and the total amount of fungi estimated based on the number of reads from nucleic acids 1-12.
- FIG. 7 is a plot showing the copy number of the ITS1 region in mixed samples of fungal/bacterial DNA (actual values and estimated values based on measured values derived from internal standard nucleic acids 3-10).
- FIG. 8 is a plot showing the mixing ratio of fungal/bacterial DNA (actual values and estimated values based on measured values derived from internal standard nucleic acids 3 to 10).
- FIG. 9 is a plot showing the number of reads from nucleic acid 4 added at various copy numbers to DNA extracted from soil.
- FIG. 10 is a graph showing the abundance of microorganisms for each taxonomy estimated based on the number of reads derived from nucleic acid 4.
- FIG. 10 is a graph showing the abundance of microorganisms for each taxonomy estimated based on the number of reads derived from nucleic acid 4.
- the present invention provides (1) a 5' flanking sequence comprising a nucleic acid sequence derived from a eukaryotic rRNA-related gene, (2) an artificial nucleic acid sequence comprising a non-naturally occurring nucleic acid sequence, and (3) A nucleic acid comprising at least one partial nucleic acid sequence consisting of a 3′ flanking sequence containing a nucleic acid sequence derived from a eukaryotic rRNA-related gene and/or a complementary sequence thereof, wherein the partial nucleic acid sequence is: Partial nucleic acid sequences (a) to (d) of: (a1) a 5′ flanking sequence containing at least 20 consecutive nucleotides in the nucleic acid sequence of SEQ ID NO: 1, (a2) an artificial nucleic acid sequence consisting of the nucleic acid sequence of any one of SEQ ID NOs: 8 to 19, and (a3) ) a partial nucleic acid sequence (a) consisting of a 3' flanking sequence comprising at
- the term "eukaryotic rRNA-related gene” refers to genes encoding 18S, 5.8S, and 25-28S rRNA subunits that constitute eukaryotic ribosomes, and ITS (Internal Transcribed Spacer) region. There is an ITS1 region between the 18S rRNA gene and the 5.8S rRNA gene, and an ITS2 region between the 5.8S rRNA gene and the 25-28S rRNA gene, both of which are eukaryotic rRNAs in this embodiment. Included in related genes.
- the 5′ flanking sequences and 3′ flanking sequences in this embodiment are sequences containing at least 20 contiguous nucleotides among the following conserved sequences 1-5 that are highly conserved in eukaryotic rRNA-associated genes: (hereinafter collectively referred to as “sequences derived from conserved sequences”).
- conserveed sequences 1-5 are respectively upstream of the V9 region of the 18S rRNA gene, downstream of the V9 region of the 18S rRNA gene/upstream of the ITS1 region, 5.8S rRNA gene, downstream of the ITS2 region/D1- of the 25-28S rRNA gene Sequences upstream of the D2 region and downstream of the D1-D2 region of the 25-28S rRNA gene.
- a sequence comprising at least 20 contiguous nucleotides in the conserved sequence, used as the 5′ flanking sequence and the 3′ flanking sequence in this embodiment, is a known sequence for amplifying eukaryotic rRNA-related genes. Any position of the conserved sequence may be selected as long as it is recognized by a universal primer (see, for example, Stefanos Banos et al., 2018, BMC Microbiology, Vol. 18, Article number: 190). Sequences derived from conserved sequences used as 5′ flanking sequences and 3′ flanking sequences in this embodiment preferably contain at least 30 contiguous nucleotides in the conserved sequences, more preferably the entire length. .
- the partial nucleic acid sequences in this embodiment include a sequence derived from conserved sequence 1 and a sequence derived from conserved sequence 2, a sequence derived from conserved sequence 2 and a sequence derived from conserved sequence 3, a sequence derived from conserved sequence 3 and a sequence derived from conserved sequence 4. or a sequence derived from conserved sequence 4 and a sequence derived from conserved sequence 5 are combined as 5′ flanking sequences and 3′ flanking sequences, and an artificial nucleic acid consisting of a non-naturally occurring nucleic acid sequence between the combined sequences Contains arrays.
- the partial nucleic acid sequence in this embodiment is between the conserved sequence 1-derived sequence and the conserved sequence 2-derived sequence (i.e., 18S V9 region) in the sequence of the eukaryotic rRNA-related gene. and the sequence from conserved sequence 3 (i.e., the ITS1 region), between the sequence from conserved sequence 3 and the sequence from conserved sequence 4 (i.e., the ITS2 region), or the sequence from conserved sequence 4 and the conserved sequence 5-derived sequence (ie, the 25-28S D1-D2 region) is replaced by a non-naturally occurring nucleic acid sequence.
- a partial nucleic acid sequence (a) containing a sequence (a1) derived from conserved sequence 1 as a 5′ flanking sequence and a sequence (a3) derived from conserved sequence 2 as a 3′ flanking sequence is any of SEQ ID NOs: 8 to 19. ⁇ (a2) ⁇ :ATTGTCAGTCTAGCGAATCATTATACCGAAGAACATCCGTTTATGAGAACGTGCTACCAATTAACTGTACTAAGCTGTCC( ⁇ 8) ⁇ TTACTGATCGAACGTCGTATAATGCTGAGGCATCTGTTATTAACCGTACCTTTCAAGGATTACCATGTGGCAACATAAGT( ⁇ 9) ⁇ TTGGCCTTCAGTCGAGAACTTGTTGAAACTGTCCTGACGCACTGGAACGAGCTTCCATTGATTCGCTAGAAATGCCGACC( ⁇ 10) ⁇ CCTAGAAAGCTCGCCATTAGCCGCAGTAGTGATTGGACATCAGAGTTTCGCTCACAACGTCACCGCTCGTTATGAAACTT( ⁇ 11) ⁇ TCAGGAAGTGTGTCCCATTGCCGGA
- a partial nucleic acid sequence (b) containing a sequence (b1) derived from conserved sequence 2 as a 5′ flanking sequence and a sequence (b3) derived from conserved sequence 3 as a 3′ flanking sequence is any of SEQ ID NOS: 20 to 31. ⁇ (b2) ⁇ :TCATAAGCAGAGCCTTTATCCCATATAAGCTATTGTCACGAAGTGTCACTGTGAACGAATGTTCTCTAAACTTACTACGGCTTCAGATGTAACGGATTCAGACTACTCTATTCATAACGGACTACAGATTGCGTCAACTACGATATTCTCTCTTGAGATCACGATTAGCAAGTACCTTTGCAGCTTGAAATTAACCAGACCTTTCCTTGGAATGCCTATACAGAGATTTATCATACCAGGAGTTCTCCAGATTACCTAGATGTCTTAACGAGATACAGGACTTACACGATGACTTAGTGTGTTGTTTGCATCAACCTAACAGTAACTGAGCGAATTGTACCAACGTATTCTTTACCGGAAGT( ⁇ 20) ⁇ CATCCTTGGTC
- a partial nucleic acid sequence (c) containing a sequence (c1) derived from conserved sequence 3 as a 5′ flanking sequence and a sequence (c3) derived from conserved sequence 4 as a 3′ flanking sequence is any of SEQ ID NOS: 32 to 43. ⁇ (c2) ⁇ :AGTTGTCTGCCAGAAATCATTGAACATTCCGACGAATATCGACATGGTTGCTTATCTAAGACCTTAAACGGTACTTGGTTAGCTGATCGCAATACTTGAAAGACTTGATCCTGTACTTACCTGGACACGATGTAATAATCTCACACAGTTATGAGAAGCTGGTTGCACCTAAATAGTCAATTAGCACGTAGTAACGTAGACTTGCCACTGATGAAACATA( ⁇ 32) ⁇ CATTGAACACTTCGTAAGGTACACCTATGGATCAACGATTAAGTCTCGATACCGTAAGATGGTAACTCTAGTCAGTGATAATCAACAGCGTAGTACATTCGTAAGCAGTCTTGGACATTACTTTCTGAGTGCAACATTCAACGTCAA
- a partial nucleic acid sequence (d) containing a sequence (d1) derived from conserved sequence 4 as a 5′ flanking sequence and a sequence (d3) derived from conserved sequence 5 as a 3′ flanking sequence is any of SEQ ID NOS: 44 to 55. ⁇ (d2) ⁇ :GAACGATTGAAGATGTACTCAGATATTCATTGATGGGCCTACGTCTACTTACTATGGGAATGTAAATACTCTGTTCCAGCCTAAGGTTAGCTTTGCGAATACAAATGTTCTTATCGACGCACAGTCATACGGATTACGATCAAGTTAATGGTTACTCCCTACCGATTGCATCCAGATCATATTGAGAGGAATCACCTGTACGGTTTAGAAATCAGCTCTACTAGAAGACACTATTGCCATACGTCAAATTGCAGTGAGTTTCACCAAATCATGGAGATGTTACCCAGTTAGCATACAACTCTTTGCACAAGTGCATAATGTAGTCCCTATGTCACAAGGTTATACGAAGCATGTCAAATCATCGCCTTTAGTTACGATGT
- the nucleic acid of this embodiment comprises at least one of the partial nucleic acid sequences of (a), (b), (c) or (d) and/or its complementary sequence. That is, the nucleic acids of this embodiment may be either single-stranded or double-stranded.
- the nucleic acid in this embodiment may be DNA, RNA, modified nucleic acid, etc., and the nucleic acid in this embodiment can be prepared using one or more of these.
- nucleic acids in this embodiment may be, for example, single-stranded RNA, single-stranded DNA, double-stranded RNA/DNA hybrids, double-stranded DNA, and the like.
- nucleic acid sequence composed of DNA is shown, but it can be read as another nucleic acid sequence such as RNA as appropriate, and the nucleic acid in this embodiment also includes them.
- thymine (T) and uracil (U) can be appropriately substituted.
- the nucleic acid of this embodiment preferably comprises two or more different partial nucleic acid sequences selected from (a), (b), (c) and (d) and/or complementary sequences thereof, and (a ), (b), (c) and (d) all partial nucleic acid sequences and/or complementary sequences thereof.
- the arrangement order of the two or more partial nucleic acid sequences is not particularly limited.
- the nucleic acid of the present embodiment contains consecutive partial nucleic acid sequences (a) and (b), (b) and (c), or (c) and (d)
- the former 3' Part or all of the flanking sequence and the latter 5' flanking sequence may overlap, but preferably such overlapping sequences are not included twice in the nucleic acid.
- each conserved sequence-derived sequence is preferably unique in the nucleic acids of the present embodiments.
- the nucleic acid of this embodiment includes (e4) a 5' flanking sequence comprising a nucleic acid sequence derived from a prokaryotic rRNA gene, (e5) an artificial nucleic acid sequence comprising a non-naturally occurring nucleic acid sequence, and (e6) a prokaryotic rRNA gene-derived It may further comprise an additional partial nucleic acid sequence (e) consisting of a 3' flanking sequence comprising the nucleic acid sequence of.
- nucleic acid sequences derived from prokaryotic rRNA genes used as (e4) and (e6) in the nucleic acids of the present embodiment may be any sequence highly conserved in prokaryotic rRNA genes, but metagenome analysis of prokaryotic organisms may be used. It preferably contains a sequence recognized by the universal primer used in .
- (e4) in the nucleic acid of the present embodiment preferably contains at least 20 consecutive nucleotides in the upstream sequence of the V4 region of the 16S rRNA gene: CACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGGTGCAAGCGTTAATCGGAATTACTGGGCGTAAAGCGCACGCAGGCGGTT (SEQ ID NO: 6), and preferably contains the full length. is more preferred.
- (e6) in the nucleic acid of this embodiment preferably contains at least 20 consecutive nucleotides in the sequence downstream of the V4 region of the 16S rRNA gene: GTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGAT (SEQ ID NO: 7), more preferably the full length preferable.
- the artificial nucleic acid sequence (e5) in the nucleic acid of the present embodiment may be any sequence as long as it is a non-naturally occurring nucleic acid sequence and is different from the artificial nucleic acid sequences of SEQ ID NOs: 8 to 55, ⁇ ATAAGAGCTTTGAGCCCACCCGCATACTGATTTGACTGCCTTAACTTGGTGAAGCCCTCGGACGGAAACTTGACATCTCGTTCTATCTGAATGAGCGCGGCACAGCTTGAGTCTACTTGGAATTGCATTAGCACCGGCCTGCCTTACAACACTGTTGCGTATTGGACTAACTAGCGGCCT( ⁇ 56) ⁇ GTAGTTAGGCAACTCTAGGCGGCAACTGCTCATCAACTAGGAGTACAGTCAATCTGACGGACGCGCTACTGCATACTTAGTCATCTACTGGTTCCAGAGCCACGGGTCATCGTAAATTGGGTATTCCGAAATGGCCCACACGCCGTTCACGTTTCAAATGATTGGCATCTAGGGACACCT( ⁇ 57) ⁇
- nucleic acids of the present embodiment include the nucleic acid sequences of SEQ ID NOs: 58-69.
- the nucleic acid sequences of SEQ ID NOs:58, 59 and 62-69 include all partial nucleic acid sequences (a)-(d).
- the nucleic acid sequences of SEQ ID NOS: 60 and 61 include all partial nucleic acid sequences (a)-(d) and further include an additional partial nucleic acid sequence (e).
- Nucleic acid sequences comprising partial nucleic acid sequences (a) to (d) are eukaryotic rRNA-related gene sequences in which the 18S V9 region, ITS1 region, ITS2 region and 25-28S D1-D2 regions are replaced with non-naturally occurring nucleic acid sequences and the nucleic acid sequence comprising the partial nucleic acid sequence (e) may be a prokaryotic rRNA gene sequence in which the 16S V4 region has been replaced by a non-naturally occurring nucleic acid sequence.
- each of the partial nucleic acid sequences (a) to (e) is preferably contained in the nucleic acid molecule at a ratio of 1:1.
- the nucleic acid of the present embodiment can be incorporated into an expression vector and introduced into cells.
- the nucleic acid of this embodiment can be easily prepared by any conventionally known nucleic acid synthesis method.
- the nucleic acid of this embodiment may be used by adding it to the sample to be analyzed at an appropriate timing.
- the nucleic acid of the present embodiment can be added to a microbiota sample before nucleic acid extraction, and in this case, it is possible to control the accuracy of the entire analysis from nucleic acid extraction to amplification.
- the nucleic acid of the present embodiment can be added to a nucleic acid solution extracted from a microbiota sample, and in this case, accuracy control of only the nucleic acid amplification reaction becomes possible.
- microbiota means a collection of multiple microorganisms that exist in a specific environment.
- Microbiota can be composed of, for example, at least 100, 300, 500, 700, 1,000, or more microorganisms.
- the microorganisms that make up the microflora may be of any class of prokaryotic and/or eukaryotic microorganisms and may include known as well as unknown microorganisms.
- Eukaryotic microorganisms means any unicellular or multicellular eukaryotic organisms of a size that cannot be distinguished by the naked eye, e.g., fungi such as yeast, mushrooms, molds; protozoa such as paramecium, amoeba, and the like, but are not limited thereto.
- the present invention is an expression vector comprising the above nucleic acid.
- Expression vectors that can be used in this embodiment are not particularly limited, and may be, for example, pUC19 plasmid vector, pT7Blue plasmid vector, pGEM plasmid vector, and the like.
- the expression vector of this embodiment can be added to a sample to be analyzed in the same manner as the nucleic acid of the first embodiment.
- the expression vector of this embodiment can be used by introducing it into a microbial cell.
- a third embodiment of the present invention is a transformed cell containing the expression vector.
- Cells that can be used in this embodiment may be any microbial cells, such as E. coli DH5 ⁇ , E. coli HB101, and E. coli JM109 (Nippon Gene Co., Ltd.).
- Introduction of an expression vector into a cell can be performed by a method well known in the art, depending on the type of cell, such as chemical transformation or electroporation.
- the transformed cells of this embodiment can be added to a microbiota sample before nucleic acid extraction, which enables precision control of the entire analysis from nucleic acid extraction to amplification.
- the present invention provides a nucleic acid sequence that is at least 90% identical to a nucleic acid sequence comprising at least 15 consecutive nucleotides in an artificial nucleic acid sequence selected from the group consisting of SEQ ID NOS: 8-57. or a probe containing its complementary sequence.
- the probe of this embodiment may be an oligonucleotide that specifically hybridizes with the amplification product containing the artificial nucleic acid sequence. Therefore, the probe of this embodiment is a nucleic acid sequence comprising at least 15, preferably 20 or more consecutive nucleotides selected from any position in the artificial nucleic acid sequence, and at least 90%, preferably 95% It includes the same nucleic acid sequence as above or its complementary sequence.
- the probes of this embodiment are preferably labeled with a labeling substance (for example, a fluorescent dye such as FITC or Cy5) in order to detect the corresponding amplification product.
- a labeling substance for example, a fluorescent dye such as FITC or Cy5
- the probe of the present embodiment can be easily prepared by any conventionally known nucleic acid synthesis method, and can be labeled by a conventionally known method, if necessary.
- the accuracy control of the analysis of the microbiota sample can be improved. It becomes possible.
- nucleic acid sequences of SEQ ID NOS: 58-66 shown below were designed: In eukaryotic rRNA-related genes, the 18S V9 region, ITS1 region, ITS2 region, and 25-28S D1-D2 region were replaced by non-naturally occurring artificial nucleic acid sequences.
- nucleic acids 1, 2, 5-12 SEQ ID NOs: 58, 59 and 62-69
- a nucleic acid sequence in which a partial sequence of the prokaryotic 16S rRNA gene in which the 16S V4 region is replaced by an artificial nucleic acid sequence is added to the sequence in which the D2 region is replaced by an artificial nucleic acid sequence that does not exist in nature (nucleic acids 3 and 4 (SEQ ID NOs: 60 and 61 )); and partial sequences of the prokaryotic 16S rRNA gene in which the 16S V4 region was replaced by an artificial nucleic acid sequence (nucleic acids 13-17 (SEQ ID NOs:70-74)).
- uppercase letters indicate conserved sequences derived from rRNA-related genes
- lowercase letters indicate artificial nucleic acid sequences that do not exist in nature.
- Nucleic acid 1 (SEQ ID NO: 58) TGATTACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTAattgtctagcgaatcattataccgaagaacatccgtttatgagaacgtgctaccaattaactgtactaagctgtccAAACTTGGTCATTTAGAGGAACTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCATTAtcataagcagagcctttatcccatataagctattgtcacgaagtgtcactgtgaacgaatgttctctaaacttactacggcttcagatgtaacggattcagactactctattcataacggactacagattgtgtaacggattcagactactctattcataacggactacagattgcgtcaactacgatattctctt
- Nucleic acid 2 (SEQ ID NO: 59) TGATTACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTAttactgatcgaacgtcgtataatgctgaggcatctgttattaaccgtacctttcaaggattaccatgtggcaacataagtAAACTTGGTCATTTAGAGGAAGTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCAcatcctttggtctaagaaagtgcatgatttgagcataccaatcgccattacgataaagatcctttgagtctaacgtacactgtgtcatctgtaagataccattgtcactacttcagtcagaACTTTCAACAACGGATCTCTTGGCTTCCACATCGATGAAGAACGCAGCGAAATGCGATAAGTAATGTGAATTGCAGAATTCAGTGAATCATCGAATCTTTGAACGCA
- Nucleic acid 3 (SEQ ID NO: 60) TGATTACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTAttggccttcagtcgagaacttgttgaaactgtcctgacgcactggaacgagcttccattgattcgctagaaatgccgaccAAACTTGGTCATTTAGAGGAACTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCATTAcacagtgtggatctgacgaattaccaaggcactccatgtgtgccatctacgtctcaggaattgtacctgctaccactaggcatcgagaacgctgcatgtattcaccgagtaaggtcttccagactccgataccgtatgtgttcccaggagaaatgtcgcttagccggttcaagccatcatgtcatgtg
- Nucleic acid 4 (SEQ ID NO: 61) TGATTACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTAcctagaaagctcgccattagccgcagtagtgattggacatcagagtttcgctcacaacgtcaccgctcgttatggaacttAAACTTGGTCATTTAGAGGAACTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCATTAaagcgttggttcgttacgcaaggctctacgaaagcagtgtctacttagcgttcagtgcagcgatccacaatctcatgggtatgtcatcgaccagctacgacgcaagtttcccagatcaagattaggtgcccttcaagcacggttggaactctaccgagctacgacgcaagtttccca
- Nucleic acid 5 (SEQ ID NO: 62) TGATTACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTAtcaggaagtgtgtcccattgccggaggagtcctattgaatcacggattacgtctgtaacgctggaccgaggttgtatcatAAACTTGGTCATTTAGAGGAACTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCATTAgcttcgattacgatgcccaaatacgatccgcgtagtttccacgaggtctacagtaccctattgttcgaggcagtaacctgaaccgcgtctgtcaacagttatgtgacggcaaacctgaaccgcgtctgtcaacagttatgtgacggcaagttgtccaagtccgagccat
- Nucleic acid 6 (SEQ ID NO: 63) TGATTACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTAtcccgcaaatacctttggagtgcgtcactatctaggagtgtgccgatgactcgtaatctccatcctcgaagttgcacgatAAACTTGGTCATTTAGAGGAACTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCATTAataatccagggtccacgagtgaatgccctgcaaatgtaccaagttcctgaccttctggcatgtgaagccgatcttatcgctgaagagtctcgaagtcgctgacatacacccgtattgtcgatctgtggcgtaacggacacccttggcgtgtaacggacaccct
- Nucleic acid 7 (SEQ ID NO: 64) TGATTACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTAgacaccctgttcagattagcgagcctcagttacaccagattccgagttcgtaagatcgagaggagccatcatggacgtttAAACTTGGTCATTTAGAGGAACTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCATTActgacggaccaatctgtatgtaaagcggctattcaggagcctatccgacgagttgatgcttacaaggcgatctatccctgaccagtgctaaccatgtgcataagagcagtctcactcacgagtctcggttccttagacgattcaatgccaagttgtgccggagaacacctgtgtgc
- Nucleic acid 8 (SEQ ID NO: 65) TGATTACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTAcatgactggaaaccctctgacgtgtaactctggaagctcagttatcggaaacggcgctaagctacgtgatcgtaagcagtAAACTTGGTCATTTAGAGGAACTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCATTActctgatggacctggtgatacacggtactattttggcatggtcacatcgggcatctgtaagacctccagtttgtagtgtgtgcagagttcccagacagtctaagacggcattgactatggccttgtggttcgagaaccgaacatccaagagtttcgcgttcatggcgataacccttcaaacctgtgtggta
- Nucleic acid 9 (SEQ ID NO: 66) TGATTACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTAgcacctagcctttaacgagaagaatgtagccctacgccatcggcatgtgattccatacgatgttacgaaacctgaggcagAAACTTGGTCATTTAGAGGAAGTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCActtctgaaactatgacgcgccaaccggaatcgtgtaatggattgacctacttgctcggacgacggataacgctgtatgcaaatgtgcctctgtaactcggctctgcgaactgctctgatctaACTTTCAACAACGGATCTCTTGGCTTCCACATCGATGAAGAACGCAGCGAAATGCGATAAGTAATGTGAATTGCAGAATTCAGTGA
- Nucleic acid 10 (SEQ ID NO: 67) TGATTACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTAtgcggagcatcctagtacaatatccggttgcctataagcccggtatgcgcgaattaacctaactgccagagatgagttccAAACTTGGTCATTTAGAGGAAGTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCAtaggtcacgctagtaccaaggagactcagaccttacagcttgcttgcagacagatcggaatcccacagcagagtttagacgtttggagacagtcccacttcagtcgttggatgcacttagACTTTCAACAACGGATCTCTTGGCTTCCACATCGATGAAGAACGCAGCGAAATGCGATAAGTAATGTGAATTGCAGAATTCAGTGAATCATCGAATCTTTGAACGCAACT
- Nucleic acid 11 (SEQ ID NO: 68) TGATTACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTAacggcactgatgttcacccgccgtcgatcatacacgcagggcgatgactctatgcgaggctccgaccagtaacaggcgctAAACTTGGTCATTTAGAGGAACTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCATTAcctggcgaatgtctaaggcgtccatatccgaggtgcagcgcgttgcctgaccattaggcccgtatagttcggcgtgaccgagatgccgctcagtacgacggtctaacaagctggcccgcacttgccaacctgtcgcgcttaacaagctggctcgcacttgccaacctgtcgc
- Nucleic acid 12 (SEQ ID NO: 69) TGATTACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTAcgtacctgtcagcacgctgttgaccttagcccgtggcaacgactgtgaagcctccgacacgtactgagggcgattcccagAAACTTGGTCATTTAGAGGAAGTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCAccatactgcgaatgggagccgcggaggtaagtccttccctgatgaccttgcgcgtagggccgggtaagagcttctccactgactgtcaaccgtgggcacgccgaggatgctactcatgACTTTCAACAACGGATCTCTTGGCTTCCACATCGATGAAGAACGCAGCGAAATGCGATAAGTAATGTGAATTGCAGAATTCAGTGA
- Nucleic acid 13 (SEQ ID NO: 70) AACTTTCAACAACGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAATTGCAGAATTCCGTGAATCATCGAATCTTTGAACGCACATTGCCCCTTGGTATTCCAGGGGGCATGCCTGTTTGAGCGTCATTTgtcgggcgactgctctcatgaccagcgtgggcgtccatggctgagcctcgtggctggcgagctctggcgggagggctggtcgagctgctgccacgctctcggctcgatcaccgtgtgacgtcggcgactccaccacggcacggcgacggtgtcacgctcctgggGTTTGACCTCAAATCAGGTAGGAGTACCCGCTGAACTTAAGCATATCA
- Nucleic acid 14 (SEQ ID NO: 71) AACTTTCAACAACGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAATTGCAGAATTCCGTGAATCATCGAATCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATGCCTGTTTGAGCGTCATTTcccaggagcgcgtgacaccgtcgccgtggtggagtcgccgacgtcacacggtgatcgagccgagagcgtggcagcatttatattgcaatataaatgctgccacgctctcggctcgatcaccgtgtgacgtcggcgactccaccacggcacggcgacggtgtcacgctctgggGTTTGACCTCAAATCAGGTAGGAGTACCCGCTGAACTTAAGCATATCAATAAGCGGAGGAAAAGAA
- Nucleic acid 15 (SEQ ID NO: 72) AACTTTCAACAACGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAATTGCAGAATTCCGTGAATCATCGAATCTTTGAACGCACATTGCCCCTTGGTATTCCAGGGGGCATGCCTGTTTGAGCGTCATTTtaaggcccatgttgtaggtcgaattgctagcaattcgacctacaacatgggcctttaatgctgtgcgcaccaagaggatcaaccagtgtcggatgcatcccgacactggttgatcctcttggtgcgcacagcatttacccagaagtgtattcctctcgaggaatacacttcctcttggtgcgcacagcatttacccagaagtgtattcctcgaggaatacacttctctctc
- Nucleic acid 16 (SEQ ID NO:73) AACTTTCAACAACGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAATTGCAGAATTCCGTGAATCATCGAATCTTTGAACGCACATTGCCCCTTGGTATTCCAGGGGGCATGCCTGTTTGAGCGTCATTTgtggtggagtcgccgacgtcacggtgatcgagccgagagcgtggcagcatttatattgcaatataaatgctgccacgctctcggctcgatcaccgtgtgacgtcggcgactccaccaccaccaccaccacggcacggcggcgacggtgtcacgcgctcctgggtaccgcggctagttcggcgtggctggcacGTTTGACCTCAAATCAGGTAGGAGTACCCGCTGAACTTAAGCATATCAATAAGCGGAGGAAAAGAAACC
- Nucleic acid 17 (SEQ ID NO: 74) AACTTTCAACAACGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAATTGCAGAATTCCGTGAATCATCGAATCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATGCCTGTTTGAGCGTCATTTggggcggttaaggaaagtcaaactcccgggctgtgaaggcccagtaggttgcgtagctaagacagcacctcataggcatgctgtgcgcaccaagaggatcatgcctatgaggtgctgtcttagctacgcaacctactgggcctaccaagagacgttacccgttaccgcggcggctggcacGTTTGACCTCAAATCAGGTAGGAGTACCCGCTGAACTTAAGCATATCAATAAGCGGAGGAAAAGAAACCAAC
- GenScript Japan Co., Ltd. was commissioned to synthesize the nucleic acids 1 to 17 above.
- nucleic acids 1 to 13 could be synthesized, but nucleic acids 14 to 17 could not be synthesized within the time required for synthesizing nucleic acid 13. This indicates that randomly designed non-naturally occurring artificial sequences can contain sequences that are difficult to synthesize.
- PCR was performed using the following universal primers.
- Universal primers utilized the eukaryotic 18S rRNA V9 region, the eukaryotic ITS1 region, the eukaryotic ITS2 region, the eukaryotic 25-28S rRNA D1-D2 region, or the prokaryotic 16S rRNA V4 region.
- PCR reaction solution composition 1x KAPA HiFi and 500 nM primers.
- PCR reaction conditions ITS1/ITS2, 95°C for 3 minutes; then 95°C for 30 seconds, 52°C for 30 seconds, and 72°C for 30 seconds 25 times; 72°C for 5 minutes.
- 95°C for 3 minutes for 25-28S rRNA D1D2 and 18S rRNA V9; then 95°C for 30 seconds, 57°C for 30 seconds, 72°C for 30 seconds 25 times; 72°C for 5 minutes.
- 16S rRNA V4 95°C for 3 minutes; then 95°C for 30 seconds, 50°C for 30 seconds, 72°C for 30 seconds 25 times; 72°C for 5 minutes.
- nucleic acid 1 to 12 was amplified with appropriate efficiency using universal primers.
- nucleic acid 13 was amplified only with extremely low efficiency, and was confirmed to be unsuitable as a standard nucleic acid.
- a plasmid was prepared in which nucleic acids 1 to 12 were integrated into the pUC19 vector.
- the plasmid was linearized by cutting with BsaI or BpmI and purified by AMpure XP (Agencourt). Concentrations were determined using the Qubit Assay Kit (Thermo Fisher Scientific) and the copy number of the nucleic acid was calculated. Concentrations were adjusted to prepare mixed solutions of plasmids containing nucleic acids 1-12 (10-10 6 copies of each nucleic acid).
- a sample was prepared by adding DNA (1 ng) extracted from soil using FastDNA Spin Kit for Soil (MP Biomedicals) to the mixed solution. PCR was performed using a universal primer set for the biological 25-28S rRNA D1-D2 regions or a universal primer set for the prokaryotic 16S rRNA V4 region to obtain an amplicon library.
- Composition of PCR reaction solution 1x KAPA HiFi and 500 nM primers.
- PCR reaction conditions ITS1/ITS2, 95°C for 3 minutes; then 95°C for 30 seconds, 52°C for 30 seconds, and 72°C for 30 seconds 25 times; 72°C for 5 minutes.
- 95°C for 3 minutes for 25-28S rRNA D1D2 and 18S rRNA V9; then 95°C for 30 seconds, 57°C for 30 seconds, 72°C for 30 seconds 25 times; 72°C for 5 minutes.
- 16S rRNA V4 95°C for 3 minutes; then 95°C for 30 seconds, 50°C for 30 seconds, 72°C for 30 seconds 25 times; 72°C for 5 minutes.
- Amplicons were sequenced using MiSeq (Illumina). The results were evaluated by a DADA2-based analytical pipeline and quantitative results were calculated.
- Figure 2 shows the results of the universal primer set for the ITS1 region
- Figure 3 shows the results of the universal primer set for the 25-28S rRNA D1-D2 region
- Figure 3 shows the results of the universal primer set for the 16S rRNA V4 region.
- the horizontal axis indicates the added amount of nucleic acids 1 to 12
- the vertical axis indicates the ratio of the number of reads derived from nucleic acids 1 to 12 to the number of reads of the target sequence derived from DNA extracted from soil.
- the results are shown in Figure 5.
- the horizontal axis indicates the amount of soil added to the sample, and the vertical axis indicates the number of reads derived from nucleic acids 1 to 12 when the total number of reads in each sample is unified.
- the theoretically expected number of internal standard gene reads decreased as the amount of soil increased.
- the total amount of fungi estimated based on the number of reads derived from nucleic acids 1-12 is shown in FIG. Correlation between soil amount and fungi was confirmed.
- Nucleic acids 3-10 (5 ⁇ 10 4 copies each) were added to the resulting dilutions using a universal primer set for the prokaryotic 16S rRNA V4 region and a universal primer set for the eukaryotic ITS1 region. PCR was performed under the same conditions as in 1 above to obtain an amplicon library for each sample. Amplicons were sequenced using MiSeq (Illumina) and results were analyzed by the DADA2 pipeline.
- FIG. 7 the horizontal axis indicates the estimated copy number of the ITS1 region in the unit artificial sequence
- the vertical axis indicates the measured copy number of the ITS1 region
- the horizontal axis indicates the estimated mixture ratio of fungi/bacteria.
- the vertical axis indicates the measured fungal/bacterial mixing ratio.
- Sc5001 indicates nucleic acid 3 (SEQ ID NO: 60)
- Sc5002 indicates nucleic acid 4 (SEQ ID NO: 61).
- a sample was prepared by adding nucleic acid 4 (8.3-8.3 ⁇ 10 3 copies) to DNA (1 ng) extracted from soil, and a universal primer set for the prokaryotic 16S rRNA V4 region and a eukaryotic primer set were prepared.
- a universal primer set for the biological ITS1 region PCR was performed under the same conditions as in 1 above to obtain an amplicon library for each sample. Amplicons were sequenced using MiSeq (Illumina) and results were analyzed by the DADA2 pipeline.
- FIG. 9 shows the number of reads derived from nucleic acid 4 when the total number of reads is unified with respect to the amount of nucleic acid 4 added.
- a high correlation was observed between the amount of nucleic acid 4 added and the read count for both the universal primer set for the prokaryotic 16S rRNA V4 region and the universal primer set for the eukaryotic ITS1 region.
- FIG. 10 shows the abundance (absolute amount) of microorganisms for each taxonomy (phylum) estimated based on the number of reads derived from nucleic acid 4.
- the absolute abundance of fungi/bacteria in a sample can be estimated by using nucleic acid 4 as an internal standard nucleic acid.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Mycology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Analytical Chemistry (AREA)
- Immunology (AREA)
- Cell Biology (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Description
(a1)配列番号1の核酸配列中の、少なくとも20個の連続するヌクレオチドを含む5’フランキング配列、(a2)配列番号8~19のいずれかの核酸配列からなる人工核酸配列、および(a3)配列番号2の核酸配列中の、少なくとも20個の連続するヌクレオチドを含む3’フランキング配列からなる部分核酸配列(a);
(b1)配列番号2の核酸配列中の、少なくとも20個の連続するヌクレオチドを含む5’フランキング配列、(b2)配列番号20~31からなる群から選択される人工核酸配列、および(b3)配列番号3の核酸配列中の、少なくとも20個の連続するヌクレオチドを含む3’フランキング配列からなる部分核酸配列(b);
(c1)配列番号3の核酸配列中の、少なくとも20個の連続するヌクレオチドを含む5’フランキング配列、(c2)配列番号32~43からなる群から選択される人工核酸配列、および(c3)配列番号4の核酸配列中の、少なくとも20個の連続するヌクレオチドを含む3’フランキング配列からなる部分核酸配列(c);ならびに
(d1)配列番号4の核酸配列中の、少なくとも20個の連続するヌクレオチドを含む5’フランキング配列、(d2)配列番号44~55からなる群から選択される人工核酸配列、および(d3)配列番号5の核酸配列中の、少なくとも20個の連続するヌクレオチドを含む3’フランキング配列からなる部分核酸配列(d)
からなる群から選択される、核酸を提供するものである。
(a1)配列番号1の核酸配列中の、少なくとも20個の連続するヌクレオチドを含む5’フランキング配列、(a2)配列番号8~19のいずれかの核酸配列からなる人工核酸配列、および(a3)配列番号2の核酸配列中の、少なくとも20個の連続するヌクレオチドを含む3’フランキング配列からなる部分核酸配列(a);
(b1)配列番号2の核酸配列中の、少なくとも20個の連続するヌクレオチドを含む5’フランキング配列、(b2)配列番号20~31からなる群から選択される人工核酸配列、および(b3)配列番号3の核酸配列中の、少なくとも20個の連続するヌクレオチドを含む3’フランキング配列からなる部分核酸配列(b);
(c1)配列番号3の核酸配列中の、少なくとも20個の連続するヌクレオチドを含む5’フランキング配列、(c2)配列番号32~43からなる群から選択される人工核酸配列、および(c3)配列番号4の核酸配列中の、少なくとも20個の連続するヌクレオチドを含む3’フランキング配列からなる部分核酸配列(c);ならびに
(d1)配列番号4の核酸配列中の、少なくとも20個の連続するヌクレオチドを含む5’フランキング配列、(d2)配列番号44~55からなる群から選択される人工核酸配列、および(d3)配列番号5の核酸配列中の、少なくとも20個の連続するヌクレオチドを含む3’フランキング配列からなる部分核酸配列(d)
からなる群から選択される、核酸である。
TGATTACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTA
AAACTTGGTCATTTAGAGGAASTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCA
ACTTTCAACAACGGATCTCTTGGYTYYCRCATCGATGAAGAACGCAGCGAAATGCGATAMGTAATGTGAATTGCAGAATTCMGTGAATCATCGAATCTTTGAACGCAMMTTGCGCCCYTTGGTATTCCGAAGGGCATGCCTGTTTGRG
ACCCGCTGAACTTAAGCATATCAATAAGCGGAGGAAAAGAAACYAAC
CCCGTCTTGAAACACGGACCAAGGAGTCTAAC
AGGTCCTCAGAGGCTAATGTTTCATGCAATGAGATCCCGCGTGGACACCACCAAGATTCTACTGTTGTCAAGATACGGGCGACTCGACATGGAGCTACTATTCTATCAGAAGAGCCCTGCCAGGCGTTCAATCGCATTTCCATTTAATGGCTGACTCGCGCAGACGAAGTCTCCTAGAGTTAAGTCTTACGAGCACCGCTTGTGTGAGCACGATCATACGATACTGACTAAGGCGTCACCGAGTTTCAGACCCTACGACATGACTGTCTTTAGGCCAGAGTCTACTAGACCGAGCTTTGGATGCCAACCTTTCCGAAGTGAGATTTACCCACAGCGTTCGTGTGTTCGACTAACCCGCAAAGTGTTACCATAGGCTGGTCCTATTTCGCAGTGGCTAGAGAGCAATGTTCCAGGATGTGCTACTACTTGCCGTGAGCTAGACATACCGATGGCTAAGTGGATACGTTACAGGCGCACGTAGTTCTAACCGGCTTATACGGATAACCTGACCCGAGCGTTATTCTTATGCCGCAGAGAGGTTTCTTACCCGAAGGCACTAG(配列番号52)、GTCACATGCAAGCTGTTTCCTTCTACATGACGAGCCTCTGCGATAGGTGAGTATCCCACTCATTGATAGCTGCCGCAAGTCAGGAGAATACGTCCGTTAGTAAACTGTCCCATGCCGAAGCTCAAGACCTGGAAGTCCTTGATAACTGGCACACTCTGAGCCAACTGAACGTGTACGCATTACAACTCCGGTGTTAGCCTGCTTAGCTGAACCAGCAGTAATTGTTAGGCGTCCCAACGATCCATGATCCGCGTGAAGAAATCTTTAGCGCCCATAGGCAGTAAGGTAGCCCGACATAGTGTCTATTAGGCCCGAAATCCCTTAGGGAGCCCAATACATGATCTTAGCCGAGTCGTAGGAACGTCCATCTCGAAAGTCGTTTGCTAGGGCAATCCAAGTCTCGATCCCGATAAGTTCTGGCTAGGTTGACAAAGCGTCCAGATCCGACGAGTAAATGGTCCCTGTTAATCCGATAGTCGCGCACCACGGTGAATATAGTCCGATGACATTGACCTGTACCAGACCGCGTCTCAAATTGACGAAAGCGATGTTCGTAACCG(配列番号53)、GGTGGAAAGCTCGTCTCCCAATGCCATTAGCCTCGGCGGAGCGATAGCAGCTCCTCTGGAAGCATCAGTGCGTCTGCCCAAGGCGTTCCTCGTCGGTACAACGTAGACTGCCGCTACGGACGGTGTCACCAGGGATACACTCCATAGCATCCGGGTCGCAAGGTGTGCGTGCCAACTACCCGACTTCTAACAGGGCTGGCCGATACTGCGGGCTCAAGTGACTCAGATCCTGAAGGGCGCACCACGTCGCGGACTACAGTGTTCACATGAAGCGCGGTCGTGCAGCGCATGGTCCATACCAACTGCCTAGTACGCGGGACTGGCGTCGAATCGACTCGTCCTTCGGAAACATGACGGCGCGGCCTAAGCGAGAACTCTGCTCGTGTCCATCAACGGCTGGCGGCGATATGTCCTGACCTCAGCCATAGTGCCTACCTCGGGAGCGTTCAAGCGATCCTCGGTCTTAACGGGCGAACTCGGGCTCGAAAGCGAATGCCTCCCTAAGCTCTTCGGTGGCGGACGCGGAATCATAGCTCAGCGAACTCTCACGGTTGCAGGCG(配列番号54)、およびGTCGTGACACGCTTCGACGATTGAGTCGCCGCCTACGACTGACGATCTTCCGCCTGTAGCTGGATGTGCCCGATCCGTGAGGACATTCCCACCTGGACTGACTCGCATGGAGACTGCCACGGTGATTCGCAACAGCCCGTAGAGGCTTCGTTCGACCACCCGATGCTGAAAGCTGCTGCGCTGATCTGAGACCTCGGAGGGCGTAAACTGGACACCTGCCACTCGGACTGTGTTCGCACGTCGGCTTCATAGCCACTGGCAACCGCGCTTGTGTGCAGACGGAACCCTTTAGTGCCTGGCGATGACCCTACTCCCGGTGAACGGCAATGCAATGGGCCTGGAACTGTGACGCTCCCGTACCTTCCCTTGAGAGGACCTGGCATCTGGACGCAACTCCTGGGTGTGACCTGTGAGCAACGCCTCCTACTGGGTATAGCCCGCGCTTAGACGCTGCTAGAGCCGGAGACATACGATCCCTGCGCTTACACGCACGCGATAGGTGCGCTCGATAATCTCGGCCCGGTAGTGCAACCTGACCAGCGGTAGACCTTGATGACGGC(配列番号55)。
以下に示す配列番号58~66の核酸配列を設計した:真核生物rRNA関連遺伝子において、18S V9領域、ITS1領域、ITS2領域、および25-28S D1-D2領域を天然に存在しない人工核酸配列により置換した核酸配列(核酸1、2、5~12(配列番号58、59および62~69));真核生物rRNA関連遺伝子において、18S V9領域、ITS1領域、ITS2領域、および25-28S D1-D2領域を天然に存在しない人工核酸配列により置換した配列に、16S V4領域を人工核酸配列により置換した原核生物16S rRNA遺伝子の部分配列を付加した核酸配列(核酸3および4(配列番号60および61));ならびに16S V4領域を人工核酸配列により置換した原核生物16S rRNA遺伝子の部分配列(核酸13~17(配列番号70~74))。配列中、大文字はrRNA関連遺伝子由来の保存配列を示し、小文字は天然に存在しない人工核酸配列を示す。
TGATTACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTAattgtcagtctagcgaatcattataccgaagaacatccgtttatgagaacgtgctaccaattaactgtactaagctgtccAAACTTGGTCATTTAGAGGAACTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCATTAtcataagcagagcctttatcccatataagctattgtcacgaagtgtcactgtgaacgaatgttctctaaacttactacggcttcagatgtaacggattcagactactctattcataacggactacagattgcgtcaactacgatattctcttgagatcacgattagcaagtacctttgcagcttgaaattaaccagacctttccttggaatgcctatacagagatttatcataccaggagttctccagattacctagatgtcttaacgagatacaggacttacacgatgacttagtgtgttgtttgcatcaacctaacagtaactgagcgaattgtaccaacgtattctttaccggaagtAAACTTTCAACAACGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAATTGCAGAATTCCGTGAATCATCGAATCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATGCCTGTTTGAGCGTCATTTagttgtctgccagaaatcattgaacattccgacgaatatcgacatggttgcttatctaagaccttaaacggtacttggttagctgatcgcaatacttgaaagacttgatcctgtacttacctggacacgatgtaataatctcacacagttatgagaagctggttgcacctaaatagtcaattagcacgtagtaacgtagacttgccactgatgaaacataGTTTGACCTCAAATCAGGTAGGAGTACCCGCTGAACTTAAGCATATCAATAAGCGGAGGAAAAGAAACCAACgaacgattgaagatgtactcagatattcattgatgggcctacgtctacttactatgggaatgtaaatactctgttccagcctaaggttagctttgcgaatacaaatgttcttatcgacgcacagtcatacggattacgatcaagttaatggttactccctaccgattattgcatccagatcatattgagaggaatcacctgtacggtttagaaatcagctctactagaagacactattgccatacgtcaaattgcagtgagtttcaccaaatcatggagatgttacccagttagcatacaactctttgcacaagtgcataatgtagtccctatgtcacaaggttatacgaagcatgtcaaatcatcgcctttagttacgatgtagttccacaagcgaaattagtttccgaaatggtcaagcatccaagtttagctcgaatctttaaggagatactcgaagtgcctatattacggaggtattatcatgtagcaagcgttacctagcttattagtccacgaatcatgtgttagaagtcgtcaagttcatgttatcctaccagCCGCCCGTCTTGAAACACGGACCAAGGAGTCTAAC
TGATTACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTAttactgatcgaacgtcgtataatgctgaggcatctgttattaaccgtacctttcaaggattaccatgtggcaacataagtAAACTTGGTCATTTAGAGGAAGTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCAcatccttggtctaagaaagtgcatgatttgagcataccaatcgccattacgataaagatcctttgagtctaacgtacactgtgtcatctgtaagataccattgtcactacttcagtcagaACTTTCAACAACGGATCTCTTGGCTTCCACATCGATGAAGAACGCAGCGAAATGCGATAAGTAATGTGAATTGCAGAATTCAGTGAATCATCGAATCTTTGAACGCAACTTGCGCCCTTTGGTATTCCGAAGGGCATGCCTGTTTGAGAGcattgaacacttcgtaaggtacacctatggatcaacgattaagtctcgataccgtaagatggtaactctagtcagtgataatcaacagcgtagtacattcgtaagcagtcttggacattactttctgagtgcaacattcaacgtctaaacgggttaaatctctcataacggaacttgtgtgcaacagatgctatatggtatgcaaatgcgatacactttgACCCGCTGAACTTAAGCATATCAATAAGCGGAGGAAAAGAAACTAACgtaaagctattaaccggagtgaatccttcattaaagtcgcacaagctgtattaccgttacgcaacgtatttgattgaccatgtgaacagaagtaccctattgacctagattatgcagcaatgcctaagactatttgcctaattcgggctatttagaccaatcctccatgatgtatatcagtcaaggctagtttggaacatacacgaaagtccttatgtagtagagtgcaattctcgtatccttcaacagtgttatcgagtatcgaacgattatcctatgggtatccacttatagaacgtgtgtagactaacctgtaaacgatgtctctgaaagcaagactacttatctgagatcggatgtttaagacgctatgacaccattaacttatgccagtgctagtcattatgaccacgatttggaatttatggctatcgccactatgaaatgctaagctacctgaacaatttgtacgcagtgacagtagatcctttgatccagaacttattaagagctgaccctatgaaacgtgatgtcctattcattattacgggaaaccgtagCGACCCGTCTTGAAACACGGACCAAGGAGTCTAAC
TGATTACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTAttggccttcagtcgagaacttgttgaaactgtcctgacgcactggaacgagcttccattgattcgctagaaatgccgaccAAACTTGGTCATTTAGAGGAACTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCATTAcacagtgtggatctgacgaattaccaaggcactccatgtgtgccatctacgtctcaggaattgtacctgctaccactaggcatcgagaacgctgcatgtattcaccgagtaaggtcttccagactccgataccgtatgtgttcccaggagaaatgtcgcttagccggttcaagccatcatgtgctagactagacacgtctatcgcggtttacacgaccatcagttgagccaatgctatccttgcgggtcaaacagagcttacggatcacccatagttgtcacgccacgttaaagttccgagcgaaacgctatctcttcgagagctgtcccaatgaaactctgcacggacttgtattgcacAAACTTTCAACAACGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAATTGCAGAATTCCGTGAATCATCGAATCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATGCCTGTTTGAGCGTCATTTactatgaggcccacagttacgaacgactagaccactgtcttacgagtgtcgcaccataagatggcgagtaatccgctcaatccactggttcctgagaaagagccggaaatctgaggtcattctgcccatgatagctggaaacacccgagtctctaagtgtgagtagcctgatctactgcaaacgcccgatacatatcgtgagagtctgctaggactgatcGTTTGACCTCAAATCAGGTAGGAGTACCCGCTGAACTTAAGCATATCAATAAGCGGAGGAAAAGAAACCAACtcaggctatattgaggcaccgcctggctagtagattacgacagctataacttcgggcaagccggttgatccaactatcgaaacctcgttagagcagtgtgtggcctaatggcatactggaacctatctgttacgccgagaactcgtgagcaactcagtctcataaagtcatggtccgcactgatgctgcacaaagctaccgattgatacgttcgccgactgtgatgcgtgaatcattccgtcaaagtgtccacccgtgtaggcattggtatatcgaccgatccaagaagcgacgcttagtacgcgattacattgggcagatggtacagctcccataaacgctaggaactgttcgcaagagtcctgtgtcagagtcaaggataccgttcagaggcaaactgaccgtcattcgtgctaaacgatgtgatccgccctttcagacgctagtgttacctggaagaagattggcgctacctatgtcccatacagcgacaaggtcttgtagaaggcatgtcaagctccctaaatggctccgctaaagtacgtgttgagggtctccaaCCGCCCGTCTTGAAACACGGACCAAGGAGTCTAACaaaCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGGTGCAAGCGTTAATCGGAATTACTGGGCGTAAAGCGCACGCAGGCGGTTataagagctttgagcccacccgcatactgatttgactgccttaacttggtgaagccctcggacggaaacttgacatctcgttctatctgaatgagcgcggcacagcttgagtctacttggaattgcattagcaccggcctgccttacaacactgttgcgtattggactaactagcggcctGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGAT
TGATTACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTAcctagaaagctcgccattagccgcagtagtgattggacatcagagtttcgctcacaacgtcaccgctcgttatggaacttAAACTTGGTCATTTAGAGGAACTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCATTAaagcgttggttcgttacgcaaggctctacgaaagcagtgtctacttagcgttcagtgcagcgatccacaatctcatgggtatgtcatcgaccagctacgacgcaagtttcccagatcaagattaggtgcccttcaagcacggttggaactctaccgacaattacgaggtcccaattacgggtggcaactatgctgtaccagtaagatcctgccgattcgacgcacagtcataactcagtgtacgtgtatcctggcaaggaggaagctccctttacatgctagtgcaatgtccgcagtttgcgagaggactatatccagtctaccacaggtcagaggttacaccctggctatctagtatggAAACTTTCAACAACGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAATTGCAGAATTCCGTGAATCATCGAATCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATGCCTGTTTGAGCGTCATTTaccgtaaagctaggtcaggtcttcactgggcaacgacataatgggtaactcacttccagcctacatcagcggtgtcaaaggtagatgcctatcgtaccacccacaatgctctagggtttcagagaagctgtgtcttccgatggtcaccagatggattcgactcaaggtcatacaggagtgtcgcgtaacatagcctatgcaaccgttcggttaaggacgtGTTTGACCTCAAATCAGGTAGGAGTACCCGCTGAACTTAAGCATATCAATAAGCGGAGGAAAAGAAACCAACgctgcttagcctataccgtaatcggtgtgcgtgaacactagccaggtactgaatctaggatcgctgtggatctaaccagtccgctacgacaagagtttactaggaccgcctaaatcatcggcgcttaccgttaagaaacctgtccggcgacatatacagtgccattgcgcttgagaatcatgctgtgcgagagacatacacggttccgagttgacatctacgtgaagggcatctttcgatgctgacccgaagtttatctgggaagctacgtcatttgcctaccgctgcgactaatctttgcagacgacatgctatgagcttgctggaccacgaatcgttaccagtcatctgagacacttggcatacgcttgggcttgatacacctatggatgggatacactgatcggctgccgcataatttgctacgccttacagagaagtgcagtctaccggctgttaatactccggctttacacgagaagctactgagggccatttgacacaatcgcgtgagtttgctgatctgacatgggctgaaacatgagcctccgaactatcgtCCGCCCGTCTTGAAACACGGACCAAGGAGTCTAACaaaCACCGGCTAACTCCGTGCCAGCAGCCGCGGTAATACGGAGGGTGCAAGCGTTAATCGGAATTACTGGGCGTAAAGCGCACGCAGGCGGTTgtagttaggcaactctaggcggcaactgctcatcaactaggagtacagtcaatctgacggacgcgctactgcatacttagtcatctactggttccagagccacgggtcatcgtaaattgggtattccgaaatggcccacacgccgttcacgtttcaaatgattggcatctagggacacctGTGGGGAGCAAACAGGATTAGATACCCTGGTAGTCCACGCCGTAAACGAT
TGATTACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTAtcaggaagtgtgtcccattgccggaggagtcctattgaatcacggattacgtctgtaacgctggaccgaggttgtatcatAAACTTGGTCATTTAGAGGAACTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCATTAgcttcgattacgatgcccaaatacgatccgcgtagtttccacgaggtctacagtaccctattgttcgaggcagtaacctgaaccgcgtctgtcaacagttatgtgacggcaagttgtccaagtccgagccatactatcagtcgtcttagctcatgggaagctcgcagtgttaagctcagtaggcaaattccagcgtgatgccgatccagtgtacgagaatccttacatgcaagtgtcgcaggccagatcagtttcgagaaagagtacgttctatccctggcgtcctcagtgactcaagatgagattacatccacacggtctcggtccattcgcaaagtacagtgtttccttagcagcaggAAACTTTCAACAACGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAATTGCAGAATTCCGTGAATCATCGAATCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATGCCTGTTTGAGCGTCATTTaacatgctgcgtagtacgtcgatcaccaagctatgagcgttgtcaaaggagtgtcaaccgacgagtccaggtttcatcaccttgctaggtatccacaggtgcattaggcggctaagtcttccacatcgtattgccgaagtgtatcgcccagacattcaagctgtcagaactctgcgttacagaacgtgccgtcaagattcaggctatcatccgtgaaccaGTTTGACCTCAAATCAGGTAGGAGTACCCGCTGAACTTAAGCATATCAATAAGCGGAGGAAAAGAAACCAACtacgtgagatcggtccgatatgagctgtccacaatagccatagactaggagtcacccttcgagtggttctagcacatccagatgacacactaagtgccctgttcgggacttgtaaagcacgattccttggttaagacgcctcccagtcagtatcatggtcgtaaagttcgtccagtggtcaacgctcttcgtcaagcgataagttaaagccggtagctgctcaagcctgccatacggattagttcaaacgagcctgtcgtgtacgttctccgcacaatgtctaacaatggtacggtgcagatagcttccgcccaggttattaaggcaaattggcccatccattctgtcggtcggcaaacagttcctgaaattccgctgaggttgtaagacccggtctgaatagccagatcaatacgtcggtgctgatgagtgccatcacagtttctctaggatagcgcacgttcatgtcgcgtaacgcatctagcatttaggtgcaacggtactacgtccaccagtaggaagttcgcataaacggtcaccttagcctgagtagccgtcaaCCGCCCGTCTTGAAACACGGACCAAGGAGTCTAAC
TGATTACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTAtcccgcaaatacctttggagtgcgtcactatctaggagtgtgccgatgactcgtaatctccatcctcgaagttgcacgatAAACTTGGTCATTTAGAGGAACTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCATTAataatccagggtccacgagtgaatgccctgcaaatgtaccaagttcctgaccttctggcatgtgaagccgatcttatcgctgaagagtctcgaagtcgctgacatacacccgtattgtcgatctgttggcgtaacggacatacgatgcactgacagcagttgcttagagcctagacacgacattgccttgaacgaccttgctactcatagggatacccgacgtagacgtttagtcctgcaagtcgaaagccctttgtgagagtcgccttatagtaccggatagtctcccagccatattggagagtccatatagccacggtagaatgctccgaggtaacctgagtcaaattgccgcactagAAACTTTCAACAACGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAATTGCAGAATTCCGTGAATCATCGAATCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATGCCTGTTTGAGCGTCATTTagtgacagttcacggtagcagctaaatcttcgggcatcacgagtacatgagtctcccatcgttaatccagcaagccgatgtggagctatttcaacgggacgtatatgtcgtccatccgagttgcggactatctacagggtgaattatgcgactgactgccttgccactacgaaacagtgcgttcaaattgcgctaagggcgtgcgaatacttatgcaggcGTTTGACCTCAAATCAGGTAGGAGTACCCGCTGAACTTAAGCATATCAATAAGCGGAGGAAAAGAAACCAACatgtccaaccgaaactcgtgatcttagtgaccgcacggatctgtcattcgagaagcgtagagacttatgcctgggccttaacttgtgctcagtagcctcaagagaactgcctcctgtctattacgggtaaactcctggtgatccagagacgtagtgtcagaacagcctagatgtgttgccacgacctgtaaacggctttcttacgacgcaatgctgatggtgactggcgattaacgaaccgaatcatcctgtgtgcatcctacggtgtgccatttgaaccagagagtatcttcgaccacgatctgcaagggtgtcatgcttgacctagagtaccacgttcagttgcctcatagggcttagcagcgtattcatgcgacttgcgataacgatgtcctgtacggacgttccatagtccgacaaacccatgtatgtctgcgagaggttagccaagagtgcttactccacctagtgagatgtagcgacaacgactgtgagtgtacgactccttagggtatagcgttgccaaacttcccaaggtagggagcctttcccattacgaaCCGCCCGTCTTGAAACACGGACCAAGGAGTCTAAC
TGATTACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTAgacaccctgttcagattagcgagcctcagttacaccagattccgagttcgtaagatcgagaggagccatcatggacgtttAAACTTGGTCATTTAGAGGAACTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCATTActgacggaccaatctgtatgtaaagcggctattcaggagcctatccgacgagttgatgcttacaaggcgatctatccctgaccagtgctaaccatgtgcataagagcagtctcactcacgagtctcggttccttagacgattcaatgccaagttgtgccggagaacacctgttgatcctcgacaatgattcagtccaccgggatgtctgtagttcccaacgccaatatgtagagcttcggtccacgaaagtaccgtggtagccatgatatgacttacgcccgacaaagttcgggagtttctcgcatgtgaagtttccgcaaccatgagcaaggtcgtttgacctggaagtgtatgatccgAAACTTTCAACAACGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAATTGCAGAATTCCGTGAATCATCGAATCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATGCCTGTTTGAGCGTCATTTatctgacagccttctacgagcctgctgaatcagatgaaccacttggtcgcaatgatcgcaaggtcgggtatatcttcacggttagatccgaactgctccactgggtacaacacactgacttggtaactcggtcatacacgtcgggaacataactgcctgtgatagcacgcactcttaggacagtcgcattctctaggtcatggaatagcgcaacatcgctGTTTGACCTCAAATCAGGTAGGAGTACCCGCTGAACTTAAGCATATCAATAAGCGGAGGAAAAGAAACCAACtccacagtatcatccgatggagcgattcgcatacgacagtcaatggctattggtcaggacctagcttccaagtcaagggaaggtttcaggatcgtcgcatcgtactttcctacgaagtgcctaaagggatcactctccgaacggtttgtatcagcgtgcagatgtacctgttacgccagaggaatgacattctacccgagggatcttacagtccgggatttgtgcaatcacagttgggctctaacgtcaagcgaggtgtatgtcccatgaataaggacggctttctcaggccaagaagtctacgcagaagttacccagctcgtttacggtgtccactcaaagtctagcatgttccggtgacctagttgatggcagtagcagtaccatgacaagaggcttccgattatccagacccagttgtgggctaatatgagcagcaccctagtatttcgcgcaatgccggttatatgaaggccacgtacaagtttctccgcgcatgtgtcagatagtatccggttccacagcataagtccgccagttggttcactaagttgccgacaCCGCCCGTCTTGAAACACGGACCAAGGAGTCTAAC
TGATTACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTAcatgactggaaaccctctgacgtgtaactctggaagctcagttatcggaaacggcgctaagctacgtgatcgtaagcagtAAACTTGGTCATTTAGAGGAACTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCATTActctgatggacctggtgatacacggtactatttggcatggtcacatcgggcatctgtaagacctccagttgtagtgtgcagagttcccagacagtctaagacggcattgactatggccttgtggttcgagaaccgaacatccaagagtttcgctcgttcatggcgataacccttcaacgtgtggtaacctgtaacgcagtcagctttagcgcgtgaataccttgaggcaatacaccgagttgtgctaccctagtgatgacagaatggcaccttatgctccggtacacctacggaatcatgcaagtggaatccctttcgagagcaggctcagtttagttgcgaagtgatctccgcatttccAAACTTTCAACAACGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAATTGCAGAATTCCGTGAATCATCGAATCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATGCCTGTTTGAGCGTCATTTaacttagggagtatgccgtcgaacatcgctcgtgagtaacttatcgtgcggatacacctcgtacatgccactcggtacttagaatagctggtaacctccgatgctcgcaatgcgtagttctggattccaatggaccaacggtcattcctgggtgacaaagcaatctcctgtagcaggtcacagttctcgtctcgcagtaacgaagtcctcttacgtcatgGTTTGACCTCAAATCAGGTAGGAGTACCCGCTGAACTTAAGCATATCAATAAGCGGAGGAAAAGAAACCAACtattgacgaccgttgccagagagccatcacttggtttcgactataacgacagatccgtggcctcctaaagttgcgtatgcagtatcgagatgtaccctgcgaaccgagtgtactaacgtgtctgaggaatccattcccgtatcgggcacaacagtatgtgtcttccagatagagggcctttgctgacgaagtcctagactatcgcttagagacgcctacagaccagtaatcgtgaccttctacctgagatgccgtgaacataggtgctaatccgagagcatgtgtacgaactccgaaccttgccattaagggatgagcctactgaactaccgctgatcgtgcgagtatatcctgctgctaacgtaaactcctgagggctacagctaaacagcttggacctagtgtcatatcgccgttccaactgactccttgagagactgcgtaagatttccgccgacattgccaaacgctaattgccgatggtgtaaacgacccgcattccattggttgctaaagcctcgtaagaatccgggctgactatcatgtgagcttgacgctacCCGCCCGTCTTGAAACACGGACCAAGGAGTCTAAC
TGATTACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTAgcacctagcctttaacgagaagaatgtagccctacgccatcggcatgtgattccatacgatgttacgaaacctgaggcagAAACTTGGTCATTTAGAGGAAGTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCActtctgaaactatgacgcgccaaccggaatcgtgtaatggattgacctacttgctcggacgacggataacgctgtatgcaaatgtgcctgtaactcggctctgcgaactgctctgatctaACTTTCAACAACGGATCTCTTGGCTTCCACATCGATGAAGAACGCAGCGAAATGCGATAAGTAATGTGAATTGCAGAATTCAGTGAATCATCGAATCTTTGAACGCAACTTGCGCCCTTTGGTATTCCGAAGGGCATGCCTGTTTGAGAGtccacgtaaatcagcgcgttatgggtctgacgtaagcacaagggtcctatacacgctactctggttatccctgagaagtcggttaccatgtcacacagtcaggctatatgccctcacgttgattcgagcgaagttactgcaccaagtctggcgtagttagtgttccgtagagcaagtcactcaatcccgagcaaagtgtcgtgatgctgttcagcaagacACCCGCTGAACTTAAGCATATCAATAAGCGGAGGAAAAGAAACTAACaggtcctcagaggctaatgtttcatgcaatgagatcccgcgtggacaccaccaagattctactgttgtcaagatacgggcgactcgacatggagctactattctatcagaagagccctgccaggcgttcaatcgcatttccatttaatggctgactcgcgcagacgaagtctcctagagttaagtcttacgagcaccgcttgtgtgagcacgatcatacgatactgactaaggcgtcaccgagtttcagaccctacgacatgactgtctttaggccagagtctactagaccgagctttggatgccaacctttccgaagtgagatttacccacagcgttcgtgtgttcgactaacccgcaaagtgttaccataggctggtcctatttcgcagtggctagagagcaatgttccaggatgtgctactacttgccgtgagctagacataccgatggctaagtggatacgttacaggcgcacgtagttctaaccggcttatacggataacctgacccgagcgttattcttatgccgcagagaggtttcttacccgaaggcactagCGACCCGTCTTGAAACACGGACCAAGGAGTCTAAC
TGATTACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTAtgcggagcatcctagtacaatatccggttgcctataagcccggtatgcgcgaattaacctaactgccagagatgagttccAAACTTGGTCATTTAGAGGAAGTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCAtaggtcacgctagtaccaaggagactcagaccttacagcttgcttgcagacagatcggaatcccacagcagagtttagacgtttggagacagtcccacttcagtcgttggatgcacttagACTTTCAACAACGGATCTCTTGGCTTCCACATCGATGAAGAACGCAGCGAAATGCGATAAGTAATGTGAATTGCAGAATTCAGTGAATCATCGAATCTTTGAACGCAACTTGCGCCCTTTGGTATTCCGAAGGGCATGCCTGTTTGAGAGcagggttccctagtaagtacgattccaatacgcgatccgaatgcggcgtttcctaagcaaggtataatctcctgacgaggagtcgggtccataaggtttccatagttcaccgtgagactgcgatggtctgccaatgttcacttcaagtccgtaagacacggcaagagcctagcatctgttcgttcagagtcatggtatcggacaactgcctgatcttcgaACCCGCTGAACTTAAGCATATCAATAAGCGGAGGAAAAGAAACTAACgtcacatgcaagctgtttccttctacatgacgagcctctgcgataggtgagtatcccactcattgatagctgccgcaagtcaggagaatacgtccgttagtaaactgtcccatgccgaagctcaagacctggaagtccttgataactggcacactctgagccaactgaacgtgtacgcattacaactccggtgttagcctgcttagctgaaccagcagtaattgttaggcgtcccaacgatccatgatccgcgtgaagaaatctttagcgcccataggcagtaaggtagcccgacatagtgtctattaggcccgaaatcccttagggagcccaatacatgatcttagccgagtcgtaggaacgtccatctcgaaagtcgtttgctagggcaatccaagtctcgatcccgataagttctggctaggttgacaaagcgtccagatccgacgagtaaatggtccctgttaatccgatagtcgcgcaccacggtgaatatagtccgatgacattgacctgtaccagaccgcgtctcaaattgacgaaagcgatgttcgtaaccgCGACCCGTCTTGAAACACGGACCAAGGAGTCTAAC
TGATTACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTAacggcactgatgttcacccgccgtcgatcatacacgcagggcgatgactctatgcgaggctccgaccagtaacaggcgctAAACTTGGTCATTTAGAGGAACTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCATTAcctggcgaatgtctaaggcgtccatatccgaggtgcagcgcgttgcctgaccattaggcccgtatagttcggcgtgaccgagatgccgctcagtacgacggtctaacaagctggccgcacttgccaacctgtcgcggactgtcttaacggtggcccgacttgctaccacacccgtgggattgtgctacgaagcgtcccgaaggtcctcagcccaagagtcctgtagtgagtacccggagcctcgaccctgatgtgatccgaccagattggagccggtgaccctcagacggagtcaaggtcctacctgtgaagccctgacggcgtggattcctgctagagccaaggagagtgtcccgctacAAACTTTCAACAACGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAATTGCAGAATTCCGTGAATCATCGAATCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATGCCTGTTTGAGCGTCATTTgcggacgatgcctttgtcgataatgctcccgctgtaggccagcgccaatcggctgtgcatttagcgaggtctcacgccagtgcgagtacgagccttcctcctaagcgttcggtcggacaggacatctggatcgcggaaccctaatcccgtgggacaccgtcacttggtcgatgcgcgtagcttgtcaccgcagggactgagaggtcaacccatgcgactgGTTTGACCTCAAATCAGGTAGGAGTACCCGCTGAACTTAAGCATATCAATAAGCGGAGGAAAAGAAACCAACggtggaaagctcgtctcccaatgccattagcctcggcggagcgatagcagctcctctggaagcatcagtgcgtctgcccaaggcgttcctcgtcggtacaacgtagactgccgctacggacggtgtcaccagggatacactccatagcatccgggtcgcaaggtgtgcgtgccaactacccgacttctaacagggctggccgatactgcgggctcaagtgactcagatcctgaagggcgcaccacgtcgcggactacagtgttcacatgaagcgcggtcgtgcagcgcatggtccataccaactgcctagtacgcgggactggcgtcgaatcgactcgtccttcggaaacatgacggcgcggcctaagcgagaactctgctcgtgtccatcaacggctggcggcgatatgtcctgacctcagccatagtgcctacctcgggagcgttcaagcgatcctcggtcttaacgggcgaactcgggctcgaaagcgaatgcctccctaagctcttcggtggcggacgcggaatcatagctcagcgaactctcacggttgcaggcgCCGCCCGTCTTGAAACACGGACCAAGGAGTCTAAC
TGATTACGTCCCTGCCCTTTGTACACACCGCCCGTCGCTAcgtacctgtcagcacgctgttgaccttagcccgtggcaacgactgtgaagcctccgacacgtactgagggcgattcccagAAACTTGGTCATTTAGAGGAAGTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCAccatactgcgaatgggagccgccggaggtaagtcctttccctgatgaccttgcgcgtagggccgggtaagagcttctccactgactgtcaaccgtgggcacgccgaggatgctactcatgACTTTCAACAACGGATCTCTTGGCTTCCACATCGATGAAGAACGCAGCGAAATGCGATAAGTAATGTGAATTGCAGAATTCAGTGAATCATCGAATCTTTGAACGCAACTTGCGCCCTTTGGTATTCCGAAGGGCATGCCTGTTTGAGAGggcagctttacggttcccagtgcctaatgaggacgcctgggcggaatcgagccttcggaaagacatctgcagcacggtgcctgcaacctgtcggtgacgtatcaggacctggtgtccacccgttgtcagggcttccaaggtcaagcaagtggtgaccggccatgcgtggtcgcttcacagaacatcacggcagtcgccgtatcggcccgagtgagactagACCCGCTGAACTTAAGCATATCAATAAGCGGAGGAAAAGAAACTAACgtcgtgacacgcttcgacgattgagtcgccgcctacgactgacgatcttccgcctgtagctggatgtgcccgatccgtgaggacattcccacctggactgactcgcatggagactgccacggtgattcgcaacagcccgtagaggcttcgttcgaccacccgatgctgaaagctgctgcgctgatctgagacctcggagggcgtaaactggacacctgccactcggactgtgttcgcacgtcggcttcatagccactggcaaccgcgcttgtgtgcagacggaaccctttagtgcctggcgatgaccctactcccggtgaacggcaatgcaatgggcctggaactgtgacgctcccgtaccttcccttgagaggacctggcatctggacgcaactcctgggtgtgacctgtgagcaacgcctcctactgggtatagcccgcgcttagacgctgctagagccggagacatacgatccctgcgcttacacgcacgcgataggtgcgctcgataatctcggcccggtagtgcaacctgaccagcggtagaccttgatgacggcCGACCCGTCTTGAAACACGGACCAAGGAGTCTAAC
AACTTTCAACAACGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAATTGCAGAATTCCGTGAATCATCGAATCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATGCCTGTTTGAGCGTCATTTgtcgggcgactgctctcatgaccagcgtgggcgtccatggctgagcctcgtgtggctcgagccgacgtctggccgtgagctcgggagggctggtcgagctgctgccacgctctcggctcgatcaccgtgtgacgtcggcgactccaccacggcacggcgacggtgtcacgcgctcctgggGTTTGACCTCAAATCAGGTAGGAGTACCCGCTGAACTTAAGCATATCAATAAGCGGAGGAAAAGAAACCAAC
AACTTTCAACAACGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAATTGCAGAATTCCGTGAATCATCGAATCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATGCCTGTTTGAGCGTCATTTcccaggagcgcgtgacaccgtcgccgtgccgtggtggagtcgccgacgtcacacggtgatcgagccgagagcgtggcagcatttatattgcaatataaatgctgccacgctctcggctcgatcaccgtgtgacgtcggcgactccaccacggcacggcgacggtgtcacgcgctcctgggGTTTGACCTCAAATCAGGTAGGAGTACCCGCTGAACTTAAGCATATCAATAAGCGGAGGAAAAGAAACCAAC
AACTTTCAACAACGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAATTGCAGAATTCCGTGAATCATCGAATCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATGCCTGTTTGAGCGTCATTTtaaggcccatgttgtaggtcgaattgctagcaattcgacctacaacatgggccttaatgctgtgcgcaccaagaggatcaaccagtgtcggatgcatccgacactggttgatcctcttggtgcgcacagcatttacccagaagtgtattcctcgaggaatacacttctgggtaagcgtagGTTTGACCTCAAATCAGGTAGGAGTACCCGCTGAACTTAAGCATATCAATAAGCGGAGGAAAAGAAACCAAC
AACTTTCAACAACGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAATTGCAGAATTCCGTGAATCATCGAATCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATGCCTGTTTGAGCGTCATTTgtggtggagtcgccgacgtcacacggtgatcgagccgagagcgtggcagcatttatattgcaatataaatgctgccacgctctcggctcgatcaccgtgtgacgtcggcgactccaccacggcacggcgacggtgtcacgcgctcctgggttaccgcggctagttcggcgtggctggcacGTTTGACCTCAAATCAGGTAGGAGTACCCGCTGAACTTAAGCATATCAATAAGCGGAGGAAAAGAAACCAAC
AACTTTCAACAACGGATCTCTTGGTTCTCGCATCGATGAAGAACGCAGCGAAATGCGATACGTAATGTGAATTGCAGAATTCCGTGAATCATCGAATCTTTGAACGCACATTGCGCCCCTTGGTATTCCAGGGGGCATGCCTGTTTGAGCGTCATTTggggcggttaaggaaagtcaaactcccgggctgtgaaggcccagtaggttgcgtagctaagacagcacctcataggcatgctgtgcgcaccaagaggatcatgcctatgaggtgctgtcttagctacgcaacctactgggcctaccaagagacgttacccgttaccgcggcggctggcacGTTTGACCTCAAATCAGGTAGGAGTACCCGCTGAACTTAAGCATATCAATAAGCGGAGGAAAAGAAACCAAC
核酸1~12がpUC19ベクターに組み込まれたプラスミドを作製した。このプラスミドをBsaIまたはBpmIで切断することにより線状化した後、AMpure XP(Agencourt社)により精製した。Qubitアッセイキット(サーモフィッシャーサイエンティフィック)を用いて濃度を測定し、核酸のコピー数を算出した。濃度を調整し、核酸1~12を含むプラスミドの混合溶液(各核酸につき10~106コピー)を調製した。
核酸1~12の混合物(4×106コピー)を様々な量の土壌(300、150、75または37.5mg)に加えた試料から、FastDNA Spin Kit for Soil(MP Biomedicals)を用いてDNAを抽出した。ITS1領域のためのユニバーサルプライマーセットを用いて上記1と同様の条件によりPCRを行い、各試料ごとのアンプリコンライブラリーを得た。MiSeq(イルミナ)を用いてアンプリコンのシークエンシングを行い、結果をDADA2パイプラインにより解析した。
10種類の真菌(Aspergillus oryzae、Candida glabrata、Candida tropicalis、Saccharomyces cerevisiae、Schizosaccharomyces pompe、Trichoderma reesei、Marasmius purpureostriatus Hongo、Hymenoscyphus varicosporoides Tubaki、Emericella nidulansおよびCryptococcus neoformans)ならびに14種類の細菌(Clostridium acetobutylicum、Bacillus subtilis、Bacteroides vulgatus、Pseudomonas putida、Desulfitobacterium hafniense、Deinococcus grandis、Nitrosomonas europaea、Nitrobacter winogradskyi、Escherichia coli、Treponema bryantii、Gemmatimonas aurantiaca、Chloroflexus aurantiacus、Anaerolinea thermophilaおよびDesulfovibrio vulgaris)(個々の真菌および細菌は理化学研究所JCM等から入手)のゲノムDNAが既知量混合された標品を用いて、細菌遺伝子1コピーに対して真菌遺伝子1.5×105コピーが含まれる溶液を調製し、段階的に希釈した。得られた希釈液に対し核酸3~10(各5×104コピー)を添加し、原核生物16S rRNA V4領域のためのユニバーサルプライマーセットおよび真核生物ITS1領域のためのユニバーサルプライマーセットを用いて上記1と同様の条件によりPCRを行い、各試料ごとのアンプリコンライブラリーを得た。MiSeq(イルミナ)を用いてアンプリコンのシークエンシングを行い、結果をDADA2パイプラインにより解析した。
Claims (9)
- (1)真核生物rRNA関連遺伝子由来の核酸配列を含む5’フランキング配列、
(2)天然に存在しない核酸配列からなる人工核酸配列、および
(3)真核生物rRNA関連遺伝子由来の核酸配列を含む3’フランキング配列
からなる部分核酸配列および/またはその相補配列を少なくとも1つ含んでなる核酸であって、
前記部分核酸配列が、以下の部分核酸配列(a)~(d):
(a1)配列番号1の核酸配列中の、少なくとも20個の連続するヌクレオチドを含む5’フランキング配列、
(a2)配列番号8~19のいずれかの核酸配列からなる人工核酸配列、および
(a3)配列番号2の核酸配列中の、少なくとも20個の連続するヌクレオチドを含む3’フランキング配列
からなる部分核酸配列(a);
(b1)配列番号2の核酸配列中の、少なくとも20個の連続するヌクレオチドを含む5’フランキング配列、
(b2)配列番号20~31からなる群から選択される人工核酸配列、および
(b3)配列番号3の核酸配列中の、少なくとも20個の連続するヌクレオチドを含む3’フランキング配列
からなる部分核酸配列(b);
(c1)配列番号3の核酸配列中の、少なくとも20個の連続するヌクレオチドを含む5’フランキング配列、
(c2)配列番号32~43からなる群から選択される人工核酸配列、および
(c3)配列番号4の核酸配列中の、少なくとも20個の連続するヌクレオチドを含む3’フランキング配列
からなる部分核酸配列(c);ならびに
(d1)配列番号4の核酸配列中の、少なくとも20個の連続するヌクレオチドを含む5’フランキング配列、
(d2)配列番号44~55からなる群から選択される人工核酸配列、および
(d3)配列番号5の核酸配列中の、少なくとも20個の連続するヌクレオチドを含む3’フランキング配列
からなる部分核酸配列(d)
からなる群から選択される、核酸。 - 前記部分核酸配列(a)が、
(a1’)配列番号1の核酸配列を含む5’フランキング配列、
(a2)配列番号8~19のいずれかの核酸配列からなる人工核酸配列、および
(a3’)配列番号2の核酸配列を含む3’フランキング配列
からなる;
前記部分核酸配列(b)が、
(b1’)配列番号2の核酸配列を含む5’フランキング配列、
(b2)配列番号20~31からなる群から選択される人工核酸配列、および
(b3’)配列番号3の核酸配列を含む3’フランキング配列
からなる;
前記部分核酸配列(c)が、
(c1’)配列番号3の核酸配列を含む5’フランキング配列、
(c2)配列番号32~43からなる群から選択される人工核酸配列、および
(c3’)配列番号4の核酸配列を含む3’フランキング配列
からなる;ならびに/または
前記部分核酸配列(d)が、
(d1’)配列番号4の核酸配列を含む5’フランキング配列、
(d2)配列番号44~55からなる群から選択される人工核酸配列、および
(d3’)配列番号5の核酸配列を含む3’フランキング配列
からなる、請求項1に記載の核酸。 - (e4)原核生物rRNA遺伝子由来の核酸配列を含む5’フランキング配列、
(e5)天然に存在しない核酸配列からなる人工核酸配列、および
(e6)原核生物rRNA遺伝子由来の核酸配列を含む3’フランキング配列
からなる追加の部分核酸配列(e)および/またはその相補配列をさらに含む、請求項1または2に記載の核酸。 - 前記追加の部分核酸配列(e)が、
(e4’)配列番号6の核酸配列を含む5’フランキング配列、
(e5’)配列番号56または57の人工核酸配列、および
(e6’)配列番号7の核酸配列を含む3’フランキング配列
からなる、請求項3に記載の核酸。 - 配列番号58、59および62~69からなる群から選択される核酸配列および/またはその相補配列からなる、請求項1または2に記載の核酸。
- 配列番号60または61の核酸配列および/またはその相補配列からなる、請求項3または4に記載の核酸。
- 請求項1~6のいずれか1項に記載の核酸を含む発現ベクター。
- 請求項7に記載の発現ベクターを含む形質転換細胞。
- 配列番号8~57からなる群から選択される人工核酸配列中の、少なくとも15個の連続するヌクレオチドを含む核酸配列と少なくとも90%同一の核酸配列またはその相補配列を含むプローブ。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2023527846A JPWO2022259995A1 (ja) | 2021-06-08 | 2022-06-06 |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2021096217 | 2021-06-08 | ||
JP2021-096217 | 2021-06-08 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2022259995A1 true WO2022259995A1 (ja) | 2022-12-15 |
Family
ID=84425023
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2022/022726 WO2022259995A1 (ja) | 2021-06-08 | 2022-06-06 | 真核微生物を定量するための内部標準核酸 |
Country Status (2)
Country | Link |
---|---|
JP (1) | JPWO2022259995A1 (ja) |
WO (1) | WO2022259995A1 (ja) |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2015204813A (ja) * | 2014-04-23 | 2015-11-19 | 国立研究開発法人産業技術総合研究所 | 微生物の16SrRNA遺伝子定量用内部標準遺伝子 |
-
2022
- 2022-06-06 JP JP2023527846A patent/JPWO2022259995A1/ja active Pending
- 2022-06-06 WO PCT/JP2022/022726 patent/WO2022259995A1/ja active Application Filing
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2015204813A (ja) * | 2014-04-23 | 2015-11-19 | 国立研究開発法人産業技術総合研究所 | 微生物の16SrRNA遺伝子定量用内部標準遺伝子 |
Non-Patent Citations (2)
Title |
---|
DE FILIPPIS FRANCESCA, LAIOLA MANOLO, BLAIOTTA GIUSEPPE, ERCOLINI DANILO: "Different Amplicon Targets for Sequencing-Based Studies of Fungal Diversity", APPLIED AND ENVIRONMENTAL MICROBIOLOGY, AMERICAN SOCIETY FOR MICROBIOLOGY, US, vol. 83, no. 17, 1 September 2017 (2017-09-01), US , XP093013488, ISSN: 0099-2240, DOI: 10.1128/AEM.00905-17 * |
DIETER M. TOURLOUSSE, YOSHIIKE SATOWA, OHASHI AKIKO, MATSUKURA SATOKO, NODA NAOHIRO, SEKIGUCHI YUJI: "Synthetic spike-in standards for high-throughput 16S rRNA gene amplicon sequencing", NUCLEIC ACIDS RESEARCH, OXFORD UNIVERSITY PRESS, GB, vol. 45, no. 4, 15 December 2016 (2016-12-15), GB , pages 1 - 14, XP055615183, ISSN: 0305-1048, DOI: 10.1093/nar/gkw984 * |
Also Published As
Publication number | Publication date |
---|---|
JPWO2022259995A1 (ja) | 2022-12-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108350484B (zh) | Rna分析的组合物和方法 | |
CN113166797A (zh) | 基于核酸酶的rna耗尽 | |
JP6479336B2 (ja) | 微生物の16SrRNA遺伝子定量用内部標準遺伝子 | |
CN104164488A (zh) | 一种单引物引发的核酸恒温扩增方法 | |
WO2019131470A1 (en) | A primer for next generation sequencer and a method for producing the same, a dna library obtained through the use of a primer for next generation sequencer and a method for producing the same, and a dna analyzing method using a dna library | |
CN111315884A (zh) | 测序文库的归一化 | |
JP6474173B2 (ja) | 標的核酸の定量方法のためのキット | |
Thies | Molecular approaches to studying the soil biota | |
CN105907878A (zh) | 基于高分辨熔解曲线技术的结核分枝杆菌耐药性基因突变检测 | |
US10214759B2 (en) | Probability-directed isolation of nucleotide sequences (PINS) | |
JP6074036B2 (ja) | 拡大された基質範囲を有する新規のdnaポリメラーゼ | |
WO2016119448A2 (zh) | 不同种属微生物间种类和丰度比较的人工外源性参照分子 | |
WO2022259995A1 (ja) | 真核微生物を定量するための内部標準核酸 | |
JP2007060953A (ja) | 細菌叢の分析方法 | |
Nakatsu | Microbial genetics | |
CN110577981B (zh) | 一种基于人工模拟核酸的分子信标 | |
CN112852938A (zh) | 一种用于革兰氏阳性菌耐药基因高通量扩增子测序的引物组及应用 | |
WO2019163672A1 (ja) | 白癬菌の遺伝子をlamp法により検出するためのプライマーセット及びそれを含むキット並びにそれらを用いて白癬菌を検出する方法 | |
CN109355412A (zh) | 一种用于测定细菌与真菌种类和丰度比较的人工合成外源性参照分子 | |
De Maayer et al. | The current state of metagenomic analysis | |
JP6112652B2 (ja) | 標的dnaの特異的増幅方法 | |
US20020086313A1 (en) | Application of bioinformatics for direct study of unculturable microorganisms | |
Lawson et al. | Application of molecular methods for microbial identification in dairy products | |
US6528257B1 (en) | Method for the simultaneous monitoring of individual mutants in mixed populations | |
WO2008015975A1 (fr) | Procédé d'amplification d'un fragment d'adn |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 22820176 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2023527846 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 18568042 Country of ref document: US |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 22820176 Country of ref document: EP Kind code of ref document: A1 |