WO2023159817A1 - Sondes de diagnostic génétique et leur utilisation - Google Patents

Sondes de diagnostic génétique et leur utilisation Download PDF

Info

Publication number
WO2023159817A1
WO2023159817A1 PCT/CN2022/100272 CN2022100272W WO2023159817A1 WO 2023159817 A1 WO2023159817 A1 WO 2023159817A1 CN 2022100272 W CN2022100272 W CN 2022100272W WO 2023159817 A1 WO2023159817 A1 WO 2023159817A1
Authority
WO
WIPO (PCT)
Prior art keywords
nucleic acid
strand
cancer
complementary
probe
Prior art date
Application number
PCT/CN2022/100272
Other languages
English (en)
Chinese (zh)
Inventor
李冰思
宿静
邱福俊
王晨阳
李晓玲
张之宏
汉雨生
Original Assignee
广州燃石医学检验所有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 广州燃石医学检验所有限公司 filed Critical 广州燃石医学检验所有限公司
Publication of WO2023159817A1 publication Critical patent/WO2023159817A1/fr

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • C12Q1/6883Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
    • C12Q1/6886Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/154Methylation markers

Definitions

  • This application relates to the field of biomedicine, in particular to a gene diagnostic probe and its application.
  • DNA methylation is an epigenetic modification, which is catalyzed by DNA methyl-transferase (DNA methyl-transferase, DNMT) to convert S-adenosylmethionine (SAM) into a methyl group.
  • DNA methyl-transferase DNA methyl-transferase
  • DNMT DNA methyl-transferase
  • SAM S-adenosylmethionine
  • the cytosine of the CG two nucleotides of the DNA is selectively methylated, mainly forming 5-methylcytosine (5-mC) (commonly found in the 5'-CG-3' sequence of genes)
  • 5-methylcytosine (5-mC) commonly found in the 5'-CG-3' sequence of genes
  • N6-mA and 7-methylguanine (7-mG) structural genes contain many CpG structures, and the 5th carbon atom of two cytosines in 2CpG and 2GPC is usually methylated , and the two methyl groups present a specific three-dimensional structure in the DNA double-strand major groove.
  • DNA methylation plays an important role in the regulation of gene expression. Aberrant DNA methylation marks have been reported in the development of various diseases, including cancer. As a high-resolution, high-throughput technology, DNA methylation sequencing is increasingly recognized for its role in early cancer screening, diagnosis, and monitoring.
  • WGBS Whole Genome Bisulfite Sequencing
  • WGBS Whole Genome Bisulfite Sequencing
  • CG dinucleotide is the most important methylation site, it is unevenly distributed in the genome, there are hypermethylation, hypomethylation and non-methylation regions, mC accounts for about the total C in mammals 2-7%.
  • CpG islands are abundant in the genome, and these detection and analysis can be greatly assisted by massively parallel nucleic acid sequencing (also known as “high-throughput sequencing” or “next-generation sequencing” (NGS)), making it possible to predict Where and where the cancer occurs becomes possible.
  • massively parallel nucleic acid sequencing also known as “high-throughput sequencing” or “next-generation sequencing” (NGS)
  • Embodiment 3 of Chinese Patent Announcement CN112646888B mentions a kind of utilization Single Cell Kit (Qiagen, Cat#150343) and Mung Bean Nuclease (NEB, Cat#M0250L) were used to process NA12878 DNA to prepare 0% methylation standard.
  • NAB Mung Bean Nuclease
  • human tumor gene detection preparations can be prepared to achieve early detection or early screening of cancers including but not limited to: brain cancer, lung cancer, skin cancer, nasopharyngeal cancer, throat cancer Cancer, liver cancer, bone cancer, lymphoma, pancreatic cancer, skin cancer, bowel cancer, rectal cancer, thyroid cancer, bladder cancer, kidney cancer, oral cancer, stomach cancer, solid tumors, ovarian cancer, esophagus cancer, gallbladder cancer, biliary tract cancer , breast cancer, cervical cancer, uterine cancer, prostate cancer, head and neck cancer, sarcoma, thoracic malignancy (except lung), melanoma, and testicular cancer.
  • cancers including but not limited to: brain cancer, lung cancer, skin cancer, nasopharyngeal cancer, throat cancer Cancer, liver cancer, bone cancer, lymphoma, pancreatic cancer, skin cancer, bowel cancer, rectal cancer, thyroid cancer, bladder cancer, kidney cancer, oral cancer, stomach cancer, solid tumors, ovarian cancer
  • the present application provides a combination of nucleic acid molecules, the combination of nucleic acid molecules comprising at least one nucleic acid probe set covering a target region of a nucleic acid to be detected, characterized in that the set of nucleic acid probes at least comprises: (1) and A first probe complementary to the first strand, the first strand being the sequence of the target region after base substitution; (2) a second probe complementary to the second strand, the second strand being the sequence of the target region The sequence of the complementary region of the target region after base substitution; and contains any one or both of the following two probes: (3) a third probe complementary to the third strand, and the third strand is complementary to the third strand The first strand is complementary; (4) a fourth probe complementary to the fourth strand, which is the complementary sequence of the second strand.
  • the application provides an application of the nucleic acid molecule combination described in the application in the preparation of human tumor gene detection preparations.
  • the present application provides a standard nucleic acid molecule used for assessing the accuracy of the degree of base modification detected in the application of the present application
  • the standard nucleic acid molecule comprises a candidate region where the degree of base modification is about 0%, and the candidate region The total length is about 1 bp to about 10000 bp.
  • Figure 1 shows the methylation measurement results of the "20% standard” and “50% standard” of the application, as well as the “zero methylation standard” and “full methylation standard” of the application ” methylation measurements.
  • FIGS 2A-2C show the uniformity measurement results of the probes designed in this application.
  • Figure 3 shows the repeatability measurement results of the probes designed in this application.
  • FIGS 4A-4C show the bias measurement results of the probes designed in this application.
  • Fig. 5 shows an exemplary reference schematic diagram of the capture probe design of the present application.
  • Fig. 6 shows an exemplary reference schematic diagram for calculating the methylation level in the present application.
  • next-generation gene sequencing NGS
  • high-throughput sequencing or “next-generation sequencing” generally refer to the second-generation high-throughput sequencing technology and higher-throughput sequencing methods developed thereafter.
  • Next-generation sequencing platforms include but are not limited to existing sequencing platforms such as Illumina. With the continuous development of sequencing technology, those skilled in the art can understand that other sequencing methods and devices can also be used for this method. For example, two Generation gene sequencing can have the advantages of high sensitivity, high throughput, high sequencing depth, or low cost.
  • Massively Parallel Signature Sequencing Massively Parallel Signature Sequencing, MPSS
  • Polony Sequencing 454pyro sequencing
  • Illumina (Solexa) sequencing Illumina (Solexa) sequencing
  • Ion semi conductor sequencing DNA nano-ball sequencing
  • Complete Genomics' DNA nanoarray and combined probe anchor ligation sequencing method etc.
  • the second-generation gene sequencing can make it possible to analyze the transcriptome and genome of a species in detail, so it is also called deep sequencing ( deep sequencing).
  • deep sequencing deep sequencing
  • the method of the present application can also be applied to first-generation gene sequencing, second-generation gene sequencing, third-generation gene sequencing or single molecule sequencing (SMS).
  • SMS single molecule sequencing
  • sample to be tested generally refers to a sample that needs to be tested. For example, it is possible to detect whether one or more gene regions on the sample to be tested are modified.
  • complementary region generally refers to a region that is complementary to a reference nucleotide sequence.
  • a complementary nucleic acid can be a nucleic acid molecule that optionally has an opposite orientation.
  • the complementary may refer to having the following complementary associations: guanine and cytosine; adenine and thymine; adenine and uracil.
  • hybridization generally refers to a reaction in which one or more polynucleotides react to form a complex stabilized by hydrogen bonds between the bases of the nucleotide residues. Hydrogen bonding can occur through Watson-Crick base pairing, Hoogstein binding, or in any other sequence-specific manner based on base complementarity.
  • the complex may comprise two strands forming a double helix, three or more strands forming a multi-strand complex, self-hybridizing single strands, or any combination of these.
  • the hybridization reaction may constitute a step in a wider method, such as the initiation of PCR or the enzymatic cleavage of polynucleotides by endonucleases.
  • a second sequence that is completely complementary to a first sequence or that is polymerized by a polymerase using the first sequence as a template is said to be "complementary" to said first sequence.
  • hybridizable refers to the ability of a polynucleotide to form complexes that are stabilized by hydrogen bonds between the bases of the nucleotide residues in a hybridization reaction.
  • a hybridizable nucleotide sequence is at least about 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, or 100% complementary to the sequence to which it hybridizes.
  • polynucleotide represents polymeric forms of nucleotides (deoxyribonucleotides or ribonucleotides) of any length, or analogs thereof.
  • a polynucleotide can have any three-dimensional structure and can perform any function, whether known or unknown.
  • polynucleotides coding or non-coding regions of genes or gene segments, loci (loci) defined by linkage analysis, exons, introns, messenger RNA (mRNA), translocation RNA (tRNA), ribosomal RNA (rRNA), short interfering RNA (siRNA), short-hairpin RNA (shRNA), microRNA (miRNA), ribozyme, cDNA, recombinant polynucleotide, branched polynucleotide, Plasmids, vectors, isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid probes, primers and linkers.
  • loci defined by linkage analysis, exons, introns, messenger RNA (mRNA), translocation RNA (tRNA), ribosomal RNA (rRNA), short interfering RNA (siRNA), short-hairpin RNA (shRNA), microRNA (miRNA), ribozyme, cDNA, recombinant polynu
  • a polynucleotide may include one or more modified nucleotides, such as methylated nucleotides and nucleotide analogs. Modifications to the nucleotide structure, if present, can be imparted either before or after assembly of the polymer. Nucleotide sequences can be interrupted by non-nucleotide components. Polynucleotides may be further modified after polymerization, such as by conjugation with labeling components.
  • the term "modification state” generally refers to the modification state of the gene fragment, nucleotide or its base in the present application.
  • the modification state in the present application may refer to the modification state of cytosine.
  • a gene segment of the present application having a modified state may have altered gene expression activity.
  • the modification status of the present application may refer to the methylation modification of a base.
  • the modified state in this application may refer to the covalent bonding of a methyl group at the 5' carbon position of cytosine in the CpG region of genomic DNA, for example, it may become 5-methylcytosine (5mC).
  • a modification state can refer to the presence or absence of 5-methylcytosine ("5-mCyt") within the DNA sequence.
  • methylation generally refers to the methylation state of a gene fragment, nucleotide or its base in this application.
  • the DNA fragment where the gene in this application is located may have methylation on one strand or multiple strands.
  • the DNA fragment where the gene in this application is located may have methylation at one site or multiple sites.
  • transformation generally refers to the transformation of one or more structures into another structure.
  • the transformations of the present application can be specific.
  • cytosine without methylation modification can be converted into other structures (such as uracil), and cytosine with methylation modification can be substantially unchanged after conversion.
  • cytosine without methylation modification can be cleaved after conversion, and cytosine with methylation modification can be substantially unchanged after conversion.
  • the term “bisulfite”, or “bisulfite” generally refers to a reagent that can distinguish DNA regions with and without modification states.
  • the bisulfite may include bisulfite, or an analog thereof, or a combination thereof.
  • bisulfite can deaminate the amino group of unmodified cytosine to distinguish it from modified cytosine.
  • the term “analogue” generally refers to a substance having a similar structure and/or function.
  • analogs of bisulfite may have a similar structure to bisulfite.
  • an analog of bisulfite may refer to a reagent that can also distinguish between DNA regions that have a modified state and those that do not.
  • the term "about” generally refers to a range of 0.5%-10% above or below the specified value, such as 0.5%, 1%, 1.5%, 2%, 2.5%, above or below the specified value. 3%, 3.5%, 4%, 4.5%, 5%, 5.5%, 6%, 6.5%, 7%, 7.5%, 8%, 8.5%, 9%, 9.5%, or 10%.
  • the present application provides a combination of nucleic acid molecules, the nucleic acid molecule in the combination of nucleic acid molecules has a binding free energy for a nucleic acid sequence derived from a target region and a binding free energy for a nucleic acid sequence derived from a non-target region differs by more than one specific threshold.
  • the specified threshold is from about 12 to about 50 kcal/mol.
  • the specific threshold is about 20 to 30 kcal/mol.
  • the specific threshold is about 20 kcal/mol.
  • the combination of nucleic acid molecules of the present application is based on the screening of candidate target regions to determine suitable nucleic acid molecules.
  • the nucleic acid molecule combination designed for the candidate target region in the present application has a higher binding free energy for the nucleic acid sequence derived from the target region.
  • the combination of nucleic acid molecules designed in the present application for candidate target regions has higher binding free energy for nucleic acid sequences derived from target regions.
  • the nucleic acid molecules in the combination of nucleic acid molecules of the present application have a binding free energy for a nucleic acid sequence derived from a target region that differs from the binding free energy for a nucleic acid sequence derived from a non-target region by about 12 or more.
  • the nucleic acid molecules in the nucleic acid molecule combination of the present application have a binding free energy for the nucleic acid sequence derived from the target region, which is about 12 kcal/mol, about 13 kcal/mol higher than the binding free energy of the nucleic acid sequence derived from the non-target region.
  • the present application provides a combination of nucleic acid molecules, the combination of nucleic acid molecules comprising at least one nucleic acid probe set covering the target region of the nucleic acid to be tested, the set of nucleic acid probes at least comprising: (1) complementary to the first strand
  • the first probe of the first strand is the sequence of the target region after base substitution; (2) the second probe complementary to the second strand, the second strand is the complementary sequence of the target region
  • the sequence of the region after base substitution and may contain any one of the following two probes or may contain two of the following two probes at the same time: (3) the third probe complementary to the third strand, so The third strand is complementary to the first strand; (4) a fourth probe complementary to the fourth strand, the fourth strand being the complementary sequence of the second strand.
  • the nucleic acid molecule combination of the present application is aimed at the target region assuming that the nucleic acid to be tested is zero methylation, the first strand (target upper strand, OT strand) of the region after base replacement, and the complementary region of the region
  • the second strand after base substitution (the lower strand of the target, OB strand), and for the complementary strand of the first strand (the complementary strand of the upper strand of the target, CTOT strand), design a third probe complementary to the third strand ; Simultaneously design a fourth probe complementary to the fourth strand according to the complementary strand of the second strand (the complementary strand of the lower strand of the target, CTOB strand).
  • nucleic acid molecules of the present application are combined with capture probes for methylation detection.
  • the base-substituted site includes a site where cytosine is present.
  • the base substitution comprises a nucleic acid sequence in which cytosine is replaced by thymine or uracil through chemical and/or biological processes.
  • the base substitution includes obtaining a nucleic acid sequence in which all cytosines are replaced with thymine or uracil.
  • the base replacement may include bisulfite conversion treatment, and the unmethylated C in the original upper chain and the original lower chain is converted into uracil
  • the nucleic acid probe set further comprises: (1) a fifth probe complementary to the fifth strand, the fifth strand being the sequence of the target region without base substitution (2) a sixth probe complementary to the sixth strand, the sixth strand being a sequence in which the complementary region of the target region has not undergone base substitution; (3) a seventh probe complementary to the seventh strand, The seventh strand is complementary to the fifth strand; (4) an eighth probe complementary to the eighth strand, the eighth strand being the complementary sequence of the sixth strand.
  • four other probes are designed for the target region assumed to be fully methylated in the nucleic acid to be tested.
  • the combination of nucleic acid molecules comprises a set of nucleic acid probes covering 10,000 or more different target regions of the nucleic acid to be tested.
  • the nucleic acid molecule combination of the present application is aimed at 10000 or more, 15000 or more, 20000 or more, 25000 or more, 30000 or more, 40000 or more, or 50,000 or more different target areas for design.
  • the present application provides a combination of nucleic acid molecules, in which standards for specific methylation levels, for example, the detection results of methylation standards for 20% and/or 50% methylation levels
  • standards for specific methylation levels for example, the detection results of methylation standards for 20% and/or 50% methylation levels
  • the index selected from the following group is met: the fluctuation of the detection result of the methylation level is 25% or lower, and the repeatability is 9E-05 or lower.
  • the fluctuation is the difference between the maximum value and the minimum value of the detection result, and the repeatability is the median value of the mean square error of the methylation level among multiple wells.
  • fluctuations in methylation levels are used to assess the accuracy of nucleic acid molecule combinations.
  • the detection result fluctuation of the nucleic acid molecule combination of the present application is 22% or lower, 23% or lower, 24% or lower, 25% or lower. % or less, 26% or less, or 27% or less.
  • the mean square error of the methylation levels detected by the candidate capture probe combinations for two or more repeated measurements of the nucleic acid molecule combinations of the present application Between about 1.3E-05 and about 2.7E-04, preferably 9E-05 or lower, more preferably about 8E-05 or lower, further preferably about 7E-05 or lower.
  • the nucleic acid molecules in the combination of nucleic acid molecules are about 80 to about 120 bases in length.
  • the nucleic acid molecules in the combination of nucleic acid molecules are about 80, about 90, about 100, about 110, or about 120 bases in length.
  • the region where any two nucleic acid molecules in the combination of nucleic acid molecules overlap comprises about 10 to about 110 bases.
  • the region where any two nucleic acid molecules in the combination of nucleic acid molecules overlap comprises about 10, about 20, about 50, about 70, about 80, about 90, about 100, or about 110 bases.
  • the region to which the nucleic acid molecules in the combination of nucleic acid molecules are complementary does not contain 10 or more consecutive bases overlapping the repeat region.
  • the information of the repeating region can be described in what is known in the art, such as the repeating region (repeats) described in repeatmasker.org.
  • the present application provides a method for designing a combination of nucleic acid molecules, based on the first strand derived from the target region and subjected to base substitution and its complementary strand, and the second strand derived from the target region and subjected to base substitution and its complementary strand.
  • Complementary chains designing combinations of said nucleic acid molecules that are complementary to three or more of the above-mentioned chains.
  • the present application provides a method for designing a combination of nucleic acid molecules, which includes (1) screening target regions, and the combination of nucleic acid molecules designed for candidate target regions has a higher binding free energy for nucleic acid sequences derived from the target region (2) Design 4 probes for the candidate target region, the nucleic acid molecule combination of the present application is aimed at the target region assuming that the nucleic acid to be tested is fully methylated, and the first base substitution in the region One strand and its complementary strand, and four probes are designed for the second strand and its complementary strand after base substitution in the complementary region of the region; (3) screening nucleic acid molecule combinations, for the standard of specific methylation level As a product, a combination of nucleic acid molecules meeting the criteria selected from the following group is screened: the methylation level detection result fluctuation is 25% or lower, and the repeatability is 9E-05 or lower.
  • the present application provides a method for designing a combination of nucleic acid molecules, which includes (1) the combination of nucleic acid molecules designed for a candidate target region of the present application, the binding free energy of the nucleic acid sequence derived from the target region is higher than that of the source The binding free energy of the nucleic acid sequence in the non-target region is about 12 or higher; (2) the nucleic acid molecule combination of the present application is aimed at the target region assuming that the nucleic acid to be tested is fully methylated, and the base substitution of the region is The first strand (Top strand), the second strand (Bottom strand) after base substitution in the complementary region of this region, and the complementary strand (CTOT strand) to the first strand, the second strand complementary to the third strand is designed.
  • the fourth probe complementary to the fourth strand is designed; (3) screening nucleic acid molecule combinations, for the standards of specific methylation levels, screening meets the selection
  • the fluctuation is the difference between the maximum value and the minimum value of the detection result
  • the repeatability is the median value of the mean square error of the methylation level among multiple wells.
  • the nucleic acid molecule is combined with a capture probe for methylation detection.
  • the standard of specific methylation level used in the method of the present application the standard of specific methylation level is prepared by the method of the present application.
  • the present application provides the nucleic acid molecule combination obtained by the design method of the nucleic acid molecule combination of the present application.
  • the nucleic acid molecule is combined with a capture probe for methylation detection.
  • the present application provides a kit comprising the nucleic acid molecule combination of the present application.
  • the present application provides the application of the nucleic acid molecule combination of the present application and/or the kit of the present application in the preparation of human tumor gene detection preparations.
  • the detection preparation is used to detect the base modification level of the target region.
  • the base modification includes methylation modification.
  • the human tumors are from homogenous tumors, heterogeneous tumors, hematological cancers and/or solid tumors.
  • the human tumor is from one or more of the following group of cancers: brain cancer, lung cancer, skin cancer, nasopharyngeal cancer, throat cancer, liver cancer, bone cancer, lymphoma, pancreatic cancer, skin cancer, intestinal Cancer, rectal cancer, thyroid cancer, bladder cancer, kidney cancer, oral cancer, stomach cancer, solid tumor, ovarian cancer, esophageal cancer, gallbladder cancer, biliary tract cancer, breast cancer, cervical cancer, uterine cancer, prostate cancer, head and neck cancer, sarcoma , Thoracic malignancy (except lung), melanoma, testicular cancer.
  • group of cancers brain cancer, lung cancer, skin cancer, nasopharyngeal cancer, throat cancer, liver cancer, bone cancer, lymphoma, pancreatic cancer, skin cancer, intestinal Cancer, rectal cancer, thyroid cancer, bladder cancer, kidney cancer, oral cancer, stomach cancer, solid tumor, ovarian cancer, esophageal cancer, gallbladder cancer, biliary tract cancer
  • the present application provides a method for detecting the level of base modification, comprising providing the nucleic acid molecule combination of the present application and/or the kit of the present application.
  • the base modification includes methylation modification.
  • the present application provides a storage medium, which records a program capable of running the method of the present application.
  • the non-transitory computer readable storage medium may include a floppy disk, a flexible disk, a hard disk, a solid state storage (SSS) (such as a solid state drive (SSD)), a solid state card (SSC), a solid state module (SSM)), an enterprise high-grade flash drives, tape, or any other non-transitory magnetic media, etc.
  • SSD solid state drive
  • SSC solid state card
  • SSM solid state module
  • Non-transitory computer readable storage media may also include punched cards, paper tape, cursor sheets (or any other physical media having a pattern of holes or other optically identifiable markings), compact disc read only memory (CD-ROM) , Rewritable Disc (CD-RW), Digital Versatile Disc (DVD), Blu-ray Disc (BD) and/or any other non-transitory optical media.
  • CD-ROM compact disc read only memory
  • CD-RW Rewritable Disc
  • DVD Digital Versatile Disc
  • BD Blu-ray Disc
  • the present application provides a device, and the device includes the storage medium of the present application.
  • the device further includes a processor coupled to the storage medium, and the processor is configured to execute based on a program stored in the storage medium to implement the method of the present application.
  • the present application provides a nucleic acid molecule used as a standard for detecting the degree of base modification, the nucleic acid molecule comprising a candidate region with a degree of base modification of about 0%.
  • the total length of the candidate region of the present application is about 1 bp to about 10000 bp.
  • the total length of the candidate region of the present application is about 1 bp, about 10 bp, about 100 bp, about 1000 bp, about 10000 bp, about 50000 bp, or about 100000 bp.
  • the nucleic acid molecule can be selected from one or more of the following cell lines: GM24385, GM12878, GM12877, GM24631.
  • the present application provides a method for preparing a base modification degree detection standard, the method comprising determining a candidate region in a nucleic acid molecule where the base modification degree is about 0%.
  • the present application provides the use of a nucleic acid molecule in the preparation of a base modification degree detection standard, the nucleic acid molecule comprising a candidate region with a base modification degree of about 0%.
  • the degree of base modification includes the degree of methylation of cytosine in the candidate region.
  • the present application provides a set of candidate regions before base modification treatment as described in the nucleic acid molecule of the present application, which is used as a standard without base modification treatment.
  • the nucleic acid molecule can serve as a zero methylation standard.
  • the present application provides a method for preparing a base modification degree detection standard, the method comprising determining a candidate region in a nucleic acid molecule where the base modification degree is about 0%.
  • the present application provides the use of a nucleic acid molecule in the preparation of a base modification degree detection standard, the nucleic acid molecule comprising a candidate region with a base modification degree of about 0%.
  • the present application provides a collection of all base-modified candidate regions as described in the nucleic acid molecule of the present application, which is used as a base-modified standard.
  • the nucleic acid molecule can serve as a permethylation standard.
  • the present application provides a method for preparing a base modification degree detection standard, the method comprising determining a candidate region in a nucleic acid molecule where the base modification degree is about 0% before base modification treatment, and subjecting the nucleic acid molecule to Base modification treatment.
  • the present application provides the use of a nucleic acid molecule in the preparation of a base modification degree detection standard, the nucleic acid molecule comprises a candidate region with a base modification degree of about 0% before base modification treatment, and the nucleic acid molecule Perform base modification.
  • a methylation standard with a predetermined degree of base modification is obtained.
  • the base modification treatment comprises contacting the nucleic acid molecule with a methyltransferase. For example, if m% of the above-mentioned full methylation standard is mixed with 1-m% of the above-mentioned zero-methylation standard, the m% methylation standard can be obtained, and the m% methylation The degree of methylation of the standard in the candidate region is m%.
  • the present application provides a kit comprising the nucleic acid molecule of the present application.
  • the kit can be used as a standard for capture probes for methylation detection.
  • next-generation sequencing treated with bisulfite reduces the complexity of the library and poses a great challenge to the specificity of target capture.
  • the hybridization capture method is often suitable for long probes, thus providing Better specificity and tolerance to single-nucleotide polymorphism (SNP, single-nucleotide polymorphism).
  • SNP single-nucleotide polymorphism
  • Tm melting temperature
  • Kinetic calculation method achieves highly uniform capture and good repeatability of genomic target regions.
  • the hybridization process realizes the specific binding of the target DNA (T, target): RNA (P, probe) complementary sequence.
  • the equilibrium constant R eq of this dynamic reaction can be calculated by the standard free energy ⁇ G o , and the latter can be calculated by bisulfite treatment All conversions, or all non-transformations are assumed to be calculated.
  • Hybridization yield ( ⁇ ) can be calculated by forming DNA:RNA complementary binding or single-stranded morphology. Considering that P is far in excess in the system, in order to simplify the calculation,
  • probes in this hybridization capture system can be defined as
  • ⁇ G o ( ⁇ G o (T f P)- ⁇ G o (T f ) ⁇ G o (P))-( ⁇ G o (T n P)- ⁇ G o (T n P) ⁇ G o (P))
  • T n refers to the hybridization product against the target sequence
  • T f refers to the non-specific hybridization product
  • a suitable target region is such that the difference between the free energy of binding ( ⁇ G o ) of the candidate probe for a nucleic acid sequence derived from the target region and the nucleic acid sequence derived from the non-target region is greater than a specified threshold value of about 12 to 50kcal/mol.
  • the difference ( ⁇ G o ) between the free energy of binding of the candidate probes for the target region and the nucleic acid sequence derived from the non-target region is about 20 kcal/mol or higher, about 50 kcal/mol or higher.
  • a 120nt probe only needs to have a 60nt similar sequence with the capture sequence to capture the sequence.
  • the sliding window is performed to obtain the target subsequence set of each probe, and the length of each subsequence is 60nt, and the distance between each subsequence and the probe sequence is calculated.
  • ⁇ G o The calculation of ⁇ G o between two sequences can refer to calculation methods known in the art, such as Zhang, D et al. Nature Chemistry 4, 208-214 (2012).
  • the range of ⁇ G o that can be used to select suitable target regions is about 12 kcal/mol or higher.
  • the ⁇ G o range for selecting a suitable target region is about 20 kcal/mol or higher, about 50 kcal/mol or higher.
  • ⁇ G o can be calculated by the following example:
  • Example 1 (inappropriate probe region, that is, the probe has a similar sequence on the genome, and the ⁇ G o between the similar sequence and the probe is less than the threshold, and the probe is filtered):
  • the ⁇ G o value (full methylation status, only non-CpG C converted to T) calculated from the similar sequence and the probe sequence was 11.55. Regions of interest with a ⁇ G o range of less than about 12 were discarded.
  • Example 2 (inappropriate probe region, the probe has a similar sequence on the genome, the ⁇ G o between the similar sequence and the probe is less than the threshold, and the probe is filtered):
  • the ⁇ G o value (full non-methylation state, all C converted to T) calculated by the similar sequence 1 and the probe sequence is 2.28;
  • the calculated ⁇ G o value (full methylation status, only non-CpG C converted to T) of the similar sequence 1 and the probe sequence was 2.28. Regions of interest with a ⁇ G o range of less than about 12 were discarded.
  • the calculated ⁇ G o value (full methylation status, only non-CpG C converted to T) of the similar sequence 2 and the probe sequence was 4.67. Regions of interest with a ⁇ G o range of less than about 12 were discarded.
  • Example 3 (appropriate probe region, the probe does not have a longer similar sequence on the genome, and the probe is retained):
  • the ⁇ G o value calculated from the similar sequence and the probe sequence is 50.77;
  • the ⁇ G o value (full methylation status, only non-CpG C converted to T) calculated from the similar sequence and the probe sequence was 50.85. Target regions with a ⁇ G o range greater than about 12, the probe region is retained.
  • Figure 5 provides a reference example for illustration only, a double-stranded DNA fragment expected to be detected for methylation shown above, sorted in the direction of the arrow, including the original upper strand (CCGGCATGTTTAAACGCT) and the original lower strand (AGCGTTTAAACATGCCGG), Some of them assume that the cytosine (C) in all CpGs is methylated, marked with -mC.
  • the unmethylated (-mC) modified C in the original upper strand and the original lower strand is converted into uracil (U ), while the methylated C remains C.
  • the base paired with adenine (A) introduced in the PCR amplification of DNA is thymine (T).
  • T thymine
  • the target upper strand complementary strand (CTOT) complementary to the original upper strand with uracil (U) after bisulfite treatment is first formed, and the target upper strand complementary strand (CTOT) with the original upper strand after bisulfite treatment is formed.
  • the target lower strand complementary strand (CTOB) to which the original lower strand of uracil (U) is complementary.
  • the target upper strand (OT) transformed from the original upper strand and complementary to CTOT, and the target lower strand (OB) transformed from the original lower strand complementary to CTOB were formed.
  • the comparison shows that the unmethylated C in the original upper strand and the original lower strand is replaced by T in the target upper strand and the target lower strand, while the methylated C (underlined) remains unchanged .
  • the number and position of methylated C can be identified by measuring the C after bisulfite conversion treatment, so as to achieve the purpose of DNA methylation detection.
  • the above-mentioned process is expressed in the description herein as C being converted to T, C being replaced by T, C being replaced by T, and the like.
  • the capture probes of the present application can be designed based on target regions that are assumed to be unmethylated.
  • the sequence corresponding to Figure 5 is: T T GGTATGTTTAAA T GTT, and design a first probe complementary to the first strand; All C in the original lower strand is converted to T as the second strand, corresponding to the sequence in Figure 5: AGTGTTTAAATATGTTGG, design a second probe complementary to the second strand; at the same time, use the complementary strand of the first strand as the third strand , corresponding to the sequence in Figure 5 is: AACATTTAAACATACCAA, design the third probe complementary to the third strand; at the same time, according to the complementary strand of the second strand as the fourth strand, corresponding to the sequence in Figure 5 is: CC A ACATATTTAAAC A CT to design a fourth probe complementary to the fourth strand.
  • the probes of the present application are also designed for the complementary strands of the two target strands, achieving good coverage. Proven to improve capture performance, such as probe accuracy and reproducibility. It should be noted that the above-mentioned FIG. 5 is only an example for convenience of explanation, and the number of target chains to be selected is actually very large, and is not limited to the sequence in FIG. 5 .
  • the capture probe of the present application can also be further designed according to the target region assumed to be fully methylated.
  • the target region assumed to be fully methylated In the case of CpG islands as the main body of methylation determination, only base sequencing is considered (for example, in Figure 5 The direction of the arrow) means that the base C in "CG" will be methylated, and in other cases, it is considered that methylation will not occur.
  • the target regions of the present application are preferably about 10,000 or more.
  • the performance of the capture probe combination is detected through specific methylation standards, and the final probes used in the probe set are determined.
  • the methylation level is 20% and/or 50% standard test samples
  • the detection deviation is calculated as the difference between the methylation level detected by the candidate capture probe combination and the actual (or theoretical) methylation level/the actual (or theoretical) methylation level.
  • the collection of all probe combinations should cover more than 90% of the target region.
  • the collection of all probe combinations should cover more than 95% of the target area.
  • the collection of all probe combinations should cover more than 99% of the target area. Even more preferably, the collection of all probe combinations should cover 100% of the target area.
  • the repeatability RMSE is calculated as the mean squared error of the methylation levels detected by the candidate capture probe combination for two or more replicate measurements for a standard test sample with a specific methylation level of 20% and/or 50% .
  • the reproducibility, ie, the median squared error of the methylation level between duplicate wells, for a suitable capture probe combination is about 9E-05 or less.
  • the uniformity and bias of the combination of capture probes is tested.
  • the uniformity CV is calculated as,
  • d i represents the sequencing depth of the i-th probe, Indicates the mean of the sequencing depth of all probes.
  • the coverage uniformity CV of a suitable capture probe combination should be less than 1; preferably, the CV should be less than 0.5; more preferably, the CV should be less than 0.3; further preferably, the CV should be less than 0.2.
  • x i represents the sequencing depth of the i-th probe for the target strand (OT+OB)
  • y i represents the sequencing depth of the i-th probe for the complementary strand (CTOT+CTOB)
  • y represents the average value of the sequencing depth of all probes for the complementary strand.
  • OT represents the target upper strand of the target region
  • CTOT represents the complementary strand of the target upper strand
  • OB represents the target lower strand of the target region
  • CTOB represents the complementary strand of the target lower strand
  • the capture probes in the capture probe set are about 80 to about 120 bases in length.
  • the region of overlap between any two capture probes in the capture probe set comprises about 10 to about 110 bases.
  • the region of the capture probe set to which the capture probes are complementary does not contain 10 or more contiguous bases that overlap the repeat region. Repeated regions are described in what is known in the art, for example repeats described in repeatmasker.org.
  • the current methylation standards come from samples obtained by whole genome amplification. However, in the process of obtaining "zero methylation standards" in the amplification process, it may appear that all cytosines in the standards are actually unmethylated, making the above samples After bisulfite conversion, there is no cytosine, and it is prone to large capture deviation, so it is not suitable as a standard for evaluating the performance of capture methods.
  • the present application provides a method for the construction of a methylation standard for a capture probe pair.
  • a methyltransferase such as M.sssI
  • PC perfectly methylated standard
  • NC zero methylation standard
  • Methylation sequencing was carried out for "zero methylated standard” and “full methylated standard”; A specific region with a methylation level of 100% in the “Standards for Methylation” was used as the standard region.
  • the methylation level in the standard area is the actual methylation level of the methylation standard in this application level. For example: after mixing 20% full methylation standard with 80% zero methylation standard, in the selected specific region, it can be regarded as the actual methylation level (also called theoretical methylation level ) is 20%.
  • the reaction conditions for the methyltransferase (eg M.SssI) enzyme are: react at 37° C. for 15 minutes, and react at 65° C. for 20 minutes.
  • the left and right sides of Figure 1 and Figure 3 show the methylation measurement results of the "zero methylation standard" and "full methylation standard" of the present application.
  • the methylation level of NC is 0-0.002
  • the methylation level of PC is 0.97-1.00
  • the methylation standard of this application is suitable for the evaluation of capture probes.
  • the complementary strand group means that the capture probe is only designed for the complementary strand (CTOT+CTOB)
  • the target strand group means that the capture probe is designed for the target strand (OT+OB)
  • the double-strand group means the double-strand for the target strand and the complementary strand Design capture probes.
  • OT means the upper chain of the target
  • CTOT means the complementary chain of the upper chain of the target
  • OB means the lower chain of the target
  • CTOB means the complementary chain of the lower chain of the target.
  • the uniformity of the capture probe combination is used to evaluate the uniformity of probe coverage on different target regions, and the CV range of the coefficient of variation.
  • the horizontal axis of the figure below represents different methylation levels, and the vertical axis represents the sequencing depth.
  • Figures 2A-2C show the uniformity measurement results of the three probes designed by the present application for the double strand, the target strand, and the complementary strand. The results showed that the uniformity of the double-strand probe design was better than that of the complementary-strand probe design alone, and was close to that of the traditional target-strand probe design.
  • the horizontal axis represents different methylation levels
  • the vertical axis represents the deviation between repeated samples
  • Figure 3 shows the repeatability measurement results of the three probes designed by the present application for the double strand, the target strand, and the complementary strand. The results showed that the reproducibility of the double-strand probe design was better than that of the complementary-strand probe design alone, and was close to that of the target-strand probe design.
  • the median estimated repeatability was 1.22E-04 for the 20% methylated standard and 1.23E-04 for the 50% methylated standard; for the complementary strand, 20% methylated Median repeatability of assessments for methylated standards was 1.12E-04, median repeatability for assessments of 50% methylated standards was 9.16E-05; for double strands preferred for this application, 20% methylated standards assessed The median repeatability value of 8.05E-05 was 8.05E-05, and the repeatability median value of 50% methylation standard evaluation was 7.03E-05.
  • the capture strand preference evaluates the depth of capture of the target strand (OT+OB) and complementary strand (CTOT+CTOB) by different probes.
  • the horizontal axis of the figure indicates the coverage depth of the target strand, and the vertical axis indicates the sequencing depth of the complementary strand.
  • the results showed a lower strand preference R ⁇ 2 for capture using double-stranded probes.
  • Figures 4A-4C show the bias measurement results of the three probes designed by the present application for the double strand, the target strand, and the complementary strand. The results showed that the design bias of the double-strand probe was better than that of the complementary-strand probe design alone, and was close to that of the traditional target-strand probe design.

Landscapes

  • Chemical & Material Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Organic Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Engineering & Computer Science (AREA)
  • Immunology (AREA)
  • Pathology (AREA)
  • Analytical Chemistry (AREA)
  • Zoology (AREA)
  • Genetics & Genomics (AREA)
  • Wood Science & Technology (AREA)
  • Physics & Mathematics (AREA)
  • Biotechnology (AREA)
  • Microbiology (AREA)
  • Molecular Biology (AREA)
  • Hospice & Palliative Care (AREA)
  • Biophysics (AREA)
  • Oncology (AREA)
  • Biochemistry (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

L'invention concerne des sondes de diagnostic génétique et leur utilisation. Une sonde est une combinaison de molécules d'acide nucléique, la combinaison de molécules d'acide nucléique comprenant au moins un groupe de sondes nucléiques qui recouvre une zone cible d'un acide nucléique à détecter, et le groupe de sondes nucléiques comprenant au moins des sondes nucléiques qui couvrent des brins sens et antisens de la zone cible ainsi que leurs brins complémentaires respectifs.
PCT/CN2022/100272 2022-02-28 2022-06-22 Sondes de diagnostic génétique et leur utilisation WO2023159817A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210185115.7 2022-02-28
CN202210185115.7A CN114438080A (zh) 2022-02-28 2022-02-28 一种基因诊断探针及其应用

Publications (1)

Publication Number Publication Date
WO2023159817A1 true WO2023159817A1 (fr) 2023-08-31

Family

ID=81373469

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/100272 WO2023159817A1 (fr) 2022-02-28 2022-06-22 Sondes de diagnostic génétique et leur utilisation

Country Status (2)

Country Link
CN (1) CN114438080A (fr)
WO (1) WO2023159817A1 (fr)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114438080A (zh) * 2022-02-28 2022-05-06 广州燃石医学检验所有限公司 一种基因诊断探针及其应用

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120149593A1 (en) * 2009-01-23 2012-06-14 Hicks James B Methods and arrays for profiling dna methylation
CN107447004A (zh) * 2017-08-11 2017-12-08 北京呈诺医学科技有限公司 Dna甲基化pcr检测引物或探针的特异性检测方法
CN108018336A (zh) * 2018-01-05 2018-05-11 山东师范大学 一种dna甲基化检测试剂盒及其使用方法
CN112522407A (zh) * 2020-12-14 2021-03-19 北京起源聚禾生物科技有限公司 一种用于血浆游离dna基因甲基化检测的超灵敏检测方法
CN114438080A (zh) * 2022-02-28 2022-05-06 广州燃石医学检验所有限公司 一种基因诊断探针及其应用

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
ES2534200T3 (es) * 2009-08-03 2015-04-20 Epigenomics Ag Métodos para la preservación de la complejidad de la secuencia del ADN genómico

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120149593A1 (en) * 2009-01-23 2012-06-14 Hicks James B Methods and arrays for profiling dna methylation
CN107447004A (zh) * 2017-08-11 2017-12-08 北京呈诺医学科技有限公司 Dna甲基化pcr检测引物或探针的特异性检测方法
CN108018336A (zh) * 2018-01-05 2018-05-11 山东师范大学 一种dna甲基化检测试剂盒及其使用方法
CN112522407A (zh) * 2020-12-14 2021-03-19 北京起源聚禾生物科技有限公司 一种用于血浆游离dna基因甲基化检测的超灵敏检测方法
CN114438080A (zh) * 2022-02-28 2022-05-06 广州燃石医学检验所有限公司 一种基因诊断探针及其应用

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
YUANHUA WEI, PAN SHIYANG, WANG XIAN, CHEN DAN, ZHANG LIXIA, HUANG PEIJUN, TONG MINGQIN: "Quality control of the procedure of methylation-specific PCR ", CHINESE JOURNAL OF CLINICAL LABORATORY SCIENCE, vol. 24, no. 1, 15 January 2006 (2006-01-15), pages 44 - 48, XP093087182 *

Also Published As

Publication number Publication date
CN114438080A (zh) 2022-05-06

Similar Documents

Publication Publication Date Title
Khodakov et al. Diagnostics based on nucleic acid sequence variant profiling: PCR, hybridization, and NGS approaches
KR102028375B1 (ko) 희귀 돌연변이 및 카피수 변이를 검출하기 위한 시스템 및 방법
TWI832482B (zh) 核酸鹼基修飾的測定
CN110785490A (zh) 用于检测基因组变异和dna甲基化状态的组合物和方法
CN110964814B (zh) 用于核酸序列变异检测的引物、组合物及方法
CN114574581A (zh) 检测稀有突变和拷贝数变异的系统和方法
CN110982907B (zh) 甲状腺结节相关rDNA甲基化标志物及其应用
JP2019536474A (ja) メチル化dnaの多重検出方法
US20190106735A1 (en) Method for analyzing cancer gene using multiple amplification nested signal amplification and kit
WO2012143481A2 (fr) Marqueurs du cancer de la prostate
JP2021531016A (ja) 無細胞dna損傷分析およびその臨床応用
WO2023159817A1 (fr) Sondes de diagnostic génétique et leur utilisation
CN101575639A (zh) 可二次验证碱基信息的dna测序方法
CN112210601A (zh) 基于粪便样本的结直肠癌筛查试剂盒
CN115103909A (zh) 用于靶向核酸捕获的系统和方法
CN114667355A (zh) 检测结直肠癌的方法
WO2022262831A1 (fr) Substance et procédé pour l'évaluation tumorale
US8377657B1 (en) Primers for analyzing methylated sequences and methods of use thereof
US20130309667A1 (en) Primers for analyzing methylated sequences and methods of use thereof
CN113637754B (zh) 生物标志物在诊断食管癌中的应用
CN112210602B (zh) 基于粪便样本的结直肠癌筛查方法
CN115491411A (zh) 鉴别胰腺炎和胰腺癌的甲基化标志物及其应用
KR20190116773A (ko) 분자 인덱스된 바이설파이트 시퀀싱
KR102559496B1 (ko) 핵산의 메틸화 여부 확인용 조성물 및 핵산의 메틸화 여부를 확인하는 방법
WO2022188776A1 (fr) Marqueur de méthylation génique ou combinaison associée pouvant être utilisés pour le diagnostic compagnon du carcinome gastrique her2, et son utilisation

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22928088

Country of ref document: EP

Kind code of ref document: A1