WO2022144003A1 - 一种用于高通量靶向测序的多重pcr文库构建方法 - Google Patents
一种用于高通量靶向测序的多重pcr文库构建方法 Download PDFInfo
- Publication number
- WO2022144003A1 WO2022144003A1 PCT/CN2021/143948 CN2021143948W WO2022144003A1 WO 2022144003 A1 WO2022144003 A1 WO 2022144003A1 CN 2021143948 W CN2021143948 W CN 2021143948W WO 2022144003 A1 WO2022144003 A1 WO 2022144003A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- mocode
- barcode
- sequence
- sequencing
- adapter
- Prior art date
Links
- 238000012163 sequencing technique Methods 0.000 title claims abstract description 96
- 238000000034 method Methods 0.000 title claims abstract description 56
- 238000007403 mPCR Methods 0.000 title claims abstract description 35
- 238000006243 chemical reaction Methods 0.000 claims abstract description 27
- 238000010276 construction Methods 0.000 claims abstract description 27
- 108010042407 Endonucleases Proteins 0.000 claims abstract description 20
- 102000004533 Endonucleases Human genes 0.000 claims abstract description 13
- 230000003321 amplification Effects 0.000 claims description 35
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 35
- 125000003729 nucleotide group Chemical group 0.000 claims description 32
- 102000004190 Enzymes Human genes 0.000 claims description 24
- 108090000790 Enzymes Proteins 0.000 claims description 24
- 239000011324 bead Substances 0.000 claims description 23
- 239000002773 nucleotide Substances 0.000 claims description 21
- 108090000623 proteins and genes Proteins 0.000 claims description 15
- 230000000295 complement effect Effects 0.000 claims description 14
- UFJPAQSLHAGEBL-RRKCRQDMSA-N dITP Chemical compound O1[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C[C@@H]1N1C(N=CNC2=O)=C2N=C1 UFJPAQSLHAGEBL-RRKCRQDMSA-N 0.000 claims description 10
- 238000012165 high-throughput sequencing Methods 0.000 claims description 9
- AHCYMLUZIRLXAA-SHYZEUOFSA-N Deoxyuridine 5'-triphosphate Chemical compound O1[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C[C@@H]1N1C(=O)NC(=O)C=C1 AHCYMLUZIRLXAA-SHYZEUOFSA-N 0.000 claims description 7
- 230000002457 bidirectional effect Effects 0.000 claims description 6
- 238000007385 chemical modification Methods 0.000 claims description 6
- 230000006862 enzymatic digestion Effects 0.000 claims description 5
- 108091027568 Single-stranded nucleotide Proteins 0.000 claims description 2
- 230000008569 process Effects 0.000 abstract description 9
- 239000000047 product Substances 0.000 description 84
- 239000013615 primer Substances 0.000 description 58
- 108020004414 DNA Proteins 0.000 description 37
- 150000007523 nucleic acids Chemical class 0.000 description 28
- 102000039446 nucleic acids Human genes 0.000 description 26
- 108020004707 nucleic acids Proteins 0.000 description 26
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 23
- 238000011144 upstream manufacturing Methods 0.000 description 20
- 238000010586 diagram Methods 0.000 description 17
- 239000000203 mixture Substances 0.000 description 13
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 12
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 12
- 230000000694 effects Effects 0.000 description 12
- 239000012634 fragment Substances 0.000 description 12
- 239000000523 sample Substances 0.000 description 12
- 108091034117 Oligonucleotide Proteins 0.000 description 11
- 238000003776 cleavage reaction Methods 0.000 description 11
- 230000007017 scission Effects 0.000 description 11
- 230000029087 digestion Effects 0.000 description 10
- 102000012410 DNA Ligases Human genes 0.000 description 9
- 108010061982 DNA Ligases Proteins 0.000 description 9
- 210000004369 blood Anatomy 0.000 description 9
- 239000008280 blood Substances 0.000 description 9
- 102100031780 Endonuclease Human genes 0.000 description 8
- 238000000246 agarose gel electrophoresis Methods 0.000 description 8
- 238000005516 engineering process Methods 0.000 description 7
- 108091033319 polynucleotide Proteins 0.000 description 7
- 102000040430 polynucleotide Human genes 0.000 description 7
- 238000002360 preparation method Methods 0.000 description 7
- 239000002096 quantum dot Substances 0.000 description 7
- 108091008146 restriction endonucleases Proteins 0.000 description 7
- 108060002716 Exonuclease Proteins 0.000 description 6
- 210000004027 cell Anatomy 0.000 description 6
- 239000003153 chemical reaction reagent Substances 0.000 description 6
- 102000013165 exonuclease Human genes 0.000 description 6
- 238000007481 next generation sequencing Methods 0.000 description 6
- 239000002157 polynucleotide Substances 0.000 description 6
- 239000011541 reaction mixture Substances 0.000 description 6
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 5
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 5
- 239000000539 dimer Substances 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 238000012986 modification Methods 0.000 description 5
- 239000000178 monomer Substances 0.000 description 5
- 238000006467 substitution reaction Methods 0.000 description 5
- LSNNMFCWUKXFEE-UHFFFAOYSA-M Bisulfite Chemical compound OS([O-])=O LSNNMFCWUKXFEE-UHFFFAOYSA-M 0.000 description 4
- 108020004635 Complementary DNA Proteins 0.000 description 4
- 238000010804 cDNA synthesis Methods 0.000 description 4
- 239000002299 complementary DNA Substances 0.000 description 4
- 238000010790 dilution Methods 0.000 description 4
- 239000012895 dilution Substances 0.000 description 4
- 239000012530 fluid Substances 0.000 description 4
- 239000010931 gold Substances 0.000 description 4
- 229910052737 gold Inorganic materials 0.000 description 4
- -1 oligomers Proteins 0.000 description 4
- 210000002381 plasma Anatomy 0.000 description 4
- 239000012264 purified product Substances 0.000 description 4
- 102200120949 rs199517715 Human genes 0.000 description 4
- 102000053602 DNA Human genes 0.000 description 3
- 108010017826 DNA Polymerase I Proteins 0.000 description 3
- 102000004594 DNA Polymerase I Human genes 0.000 description 3
- 102000003960 Ligases Human genes 0.000 description 3
- 108090000364 Ligases Proteins 0.000 description 3
- 101710163270 Nuclease Proteins 0.000 description 3
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 3
- LSNNMFCWUKXFEE-UHFFFAOYSA-N Sulfurous acid Chemical compound OS(O)=O LSNNMFCWUKXFEE-UHFFFAOYSA-N 0.000 description 3
- 108010006785 Taq Polymerase Proteins 0.000 description 3
- 239000012472 biological sample Substances 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 238000001962 electrophoresis Methods 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 239000000499 gel Substances 0.000 description 3
- 239000002751 oligonucleotide probe Substances 0.000 description 3
- 238000003752 polymerase chain reaction Methods 0.000 description 3
- 239000002342 ribonucleoside Substances 0.000 description 3
- 210000002966 serum Anatomy 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 108010037497 3'-nucleotidase Proteins 0.000 description 2
- 108091093088 Amplicon Proteins 0.000 description 2
- 239000003155 DNA primer Substances 0.000 description 2
- 108010082610 Deoxyribonuclease (Pyrimidine Dimer) Proteins 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 2
- 108020005196 Mitochondrial DNA Proteins 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- 235000014443 Pyrus communis Nutrition 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- 241000589500 Thermus aquaticus Species 0.000 description 2
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 2
- 108020004566 Transfer RNA Proteins 0.000 description 2
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 2
- 239000000980 acid dye Substances 0.000 description 2
- 239000011543 agarose gel Substances 0.000 description 2
- 210000004436 artificial bacterial chromosome Anatomy 0.000 description 2
- 238000001369 bisulfite sequencing Methods 0.000 description 2
- 238000011109 contamination Methods 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000004925 denaturation Methods 0.000 description 2
- 230000036425 denaturation Effects 0.000 description 2
- 239000000975 dye Substances 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 238000003384 imaging method Methods 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 108020004999 messenger RNA Proteins 0.000 description 2
- 230000011987 methylation Effects 0.000 description 2
- 238000007069 methylation reaction Methods 0.000 description 2
- 238000012164 methylation sequencing Methods 0.000 description 2
- 239000002777 nucleoside Substances 0.000 description 2
- 239000002987 primer (paints) Substances 0.000 description 2
- 238000011002 quantification Methods 0.000 description 2
- 108020004418 ribosomal RNA Proteins 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 108010068698 spleen exonuclease Proteins 0.000 description 2
- FMKJUUQOYOHLTF-OWOJBTEDSA-N (e)-4-azaniumylbut-2-enoate Chemical compound NC\C=C\C(O)=O FMKJUUQOYOHLTF-OWOJBTEDSA-N 0.000 description 1
- YKBGVTZYEHREMT-KVQBGUIXSA-N 2'-deoxyguanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](CO)O1 YKBGVTZYEHREMT-KVQBGUIXSA-N 0.000 description 1
- CKTSBUTUHBMZGZ-ULQXZJNLSA-N 4-amino-1-[(2r,4s,5r)-4-hydroxy-5-(hydroxymethyl)oxolan-2-yl]-5-tritiopyrimidin-2-one Chemical compound O=C1N=C(N)C([3H])=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 CKTSBUTUHBMZGZ-ULQXZJNLSA-N 0.000 description 1
- OIRDTQYFTABQOQ-KQYNXXCUSA-N Adenosine Natural products C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 108020000946 Bacterial DNA Proteins 0.000 description 1
- 108090000994 Catalytic RNA Proteins 0.000 description 1
- 102000053642 Catalytic RNA Human genes 0.000 description 1
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 1
- 238000000018 DNA microarray Methods 0.000 description 1
- 108010008286 DNA nucleotidylexotransferase Proteins 0.000 description 1
- 102100029764 DNA-directed DNA/RNA polymerase mu Human genes 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 1
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 1
- 102100037696 Endonuclease V Human genes 0.000 description 1
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N EtOH Substances CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 1
- 108010046914 Exodeoxyribonuclease V Proteins 0.000 description 1
- 102100037091 Exonuclease V Human genes 0.000 description 1
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 241000221961 Neurospora crassa Species 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 108091093037 Peptide nucleic acid Proteins 0.000 description 1
- 102000004861 Phosphoric Diester Hydrolases Human genes 0.000 description 1
- 108090001050 Phosphoric Diester Hydrolases Proteins 0.000 description 1
- 239000004952 Polyamide Substances 0.000 description 1
- 206010036790 Productive cough Diseases 0.000 description 1
- 239000013614 RNA sample Substances 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 1
- 108091081021 Sense strand Proteins 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 108020004459 Small interfering RNA Proteins 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- 108010017842 Telomerase Proteins 0.000 description 1
- 241000204666 Thermotoga maritima Species 0.000 description 1
- 101000803944 Thermus filiformis DNA ligase Proteins 0.000 description 1
- 241000589497 Thermus sp. Species 0.000 description 1
- 241000589499 Thermus thermophilus Species 0.000 description 1
- 101000803959 Thermus thermophilus (strain ATCC 27634 / DSM 579 / HB8) DNA ligase Proteins 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-N Thiophosphoric acid Chemical class OP(O)(S)=O RYYWUUFWQRZTIU-UHFFFAOYSA-N 0.000 description 1
- 108020005202 Viral DNA Proteins 0.000 description 1
- 108020000999 Viral RNA Proteins 0.000 description 1
- 125000000217 alkyl group Chemical group 0.000 description 1
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 1
- 150000001408 amides Chemical class 0.000 description 1
- 210000004381 amniotic fluid Anatomy 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 239000003146 anticoagulant agent Substances 0.000 description 1
- 229940127219 anticoagulant drug Drugs 0.000 description 1
- 210000001106 artificial yeast chromosome Anatomy 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- 210000000941 bile Anatomy 0.000 description 1
- 238000001574 biopsy Methods 0.000 description 1
- 210000000481 breast Anatomy 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 210000001175 cerebrospinal fluid Anatomy 0.000 description 1
- YTRQFSDWAXHJCC-UHFFFAOYSA-N chloroform;phenol Chemical compound ClC(Cl)Cl.OC1=CC=CC=C1 YTRQFSDWAXHJCC-UHFFFAOYSA-N 0.000 description 1
- 210000004252 chorionic villi Anatomy 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 230000009918 complex formation Effects 0.000 description 1
- 230000000536 complexating effect Effects 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 238000012864 cross contamination Methods 0.000 description 1
- 230000001351 cycling effect Effects 0.000 description 1
- 235000013365 dairy product Nutrition 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 239000005549 deoxyribonucleoside Substances 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 239000005447 environmental material Substances 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- 238000012869 ethanol precipitation Methods 0.000 description 1
- 150000002170 ethers Chemical class 0.000 description 1
- 210000003608 fece Anatomy 0.000 description 1
- 210000004700 fetal blood Anatomy 0.000 description 1
- 230000001605 fetal effect Effects 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 230000002496 gastric effect Effects 0.000 description 1
- 229930182470 glycoside Natural products 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 210000004251 human milk Anatomy 0.000 description 1
- 235000020256 human milk Nutrition 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 239000006210 lotion Substances 0.000 description 1
- 210000002751 lymph Anatomy 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- YACKEPLHDIMKIO-UHFFFAOYSA-N methylphosphonic acid Chemical class CP(O)(O)=O YACKEPLHDIMKIO-UHFFFAOYSA-N 0.000 description 1
- 238000009629 microbiological culture Methods 0.000 description 1
- 210000003097 mucus Anatomy 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 125000006501 nitrophenyl group Chemical group 0.000 description 1
- 238000001668 nucleic acid synthesis Methods 0.000 description 1
- 150000003833 nucleoside derivatives Chemical class 0.000 description 1
- 150000004713 phosphodiesters Chemical class 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- QVLTXCYWHPZMCA-UHFFFAOYSA-N po4-po4 Chemical class OP(O)(O)=O.OP(O)(O)=O QVLTXCYWHPZMCA-UHFFFAOYSA-N 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 229920002647 polyamide Polymers 0.000 description 1
- 238000006116 polymerization reaction Methods 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 239000003761 preservation solution Substances 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 210000004908 prostatic fluid Anatomy 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 210000000582 semen Anatomy 0.000 description 1
- 238000011896 sensitive detection Methods 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000004904 shortening Methods 0.000 description 1
- 239000002689 soil Substances 0.000 description 1
- 210000000952 spleen Anatomy 0.000 description 1
- 210000003802 sputum Anatomy 0.000 description 1
- 208000024794 sputum Diseases 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 210000004243 sweat Anatomy 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 210000001138 tear Anatomy 0.000 description 1
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical class CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 230000005945 translocation Effects 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1034—Isolating an individual clone by screening libraries
- C12N15/1065—Preparation or screening of tagged libraries, e.g. tagged microorganisms by STM-mutagenesis, tagged polynucleotides, gene tags
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1034—Isolating an individual clone by screening libraries
- C12N15/1093—General methods of preparing gene libraries, not provided for in other subgroups
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
Definitions
- the present disclosure relates to the field of biomedicine, and more particularly, to a method for constructing a DNA library, in particular to a method for constructing a multiplex PCR library for high-throughput targeted sequencing.
- the present disclosure relates to the technical field of library construction, in particular to a method for constructing a targeted high-throughput DNA library.
- a method for constructing a targeted high-throughput DNA library In the past decade, with the continuous advancement of next-generation sequencing technology, the application of life science research has also continued to expand. Different nucleic acid preparation methods and sequencing library construction methods are also more efficient.
- High-throughput sequencing High-Throughput Sequencing
- NGS next-generation sequencing technology
- the sequencing read length is short, and the general sequencing length is 2x300bp or 2x150bp.
- the obtained short-read sequences are very difficult to align and assemble when there is no reference genome alignment and assembly, or when the genome contains highly complex structural sequences.
- the splicing and assembly of short sequences can be assisted by a large-span large fragment library (mate pair library).
- mate pair library a large-span large fragment library
- structural variation of large fragments of chromosomes such as insertions, deletions, inversions, and translocations, can be detected.
- High-throughput targeted sequencing is a very cost-effective and highly sensitive detection method, and the key link is the targeted enrichment of target genes.
- the main methods to achieve targeted enrichment include hybridization capture and PCR-based methods.
- Library Construction Methods In general, hybrid capture-based methods require the use of streptavidin-coated magnetic beads, which are expensive and complicated, and require more DNA samples.
- PCR-based targeted enrichment technology using molecular barcode (Unique Molecular Identifier, UMI) technology has made great progress, which can solve the original difficulty of removing PCR repeats. Errors in UMI are still difficult to eliminate and the operation steps are cumbersome. Therefore, it is necessary to provide an accurate, efficient and simple method for constructing a multiplex PCR-targeted enrichment library.
- molecular barcode Unique Molecular Identifier
- PCR-based targeted enrichment library construction methods mainly include AmpliSeq (thermo), SLIM Amplification, Relay PCR, etc. These methods all include a two-step PCR reaction, that is, the first step is targeted amplification of the target fragment, and the second step is PCR enrichment after adapter ligation, but these methods all use traditional TA ligation or blunt-end ligation, and the overall library construction process does not add control. In the non-specific amplification link, the non-specific amplification products cannot be well removed. This situation is particularly prominent in targeted methylation sequencing. Due to the bisulfite-treated DNA, most cytosines are changed to thymines, which makes it easier to form primer-dimers or non-specific amplification between multiple primers.
- the purpose of the present disclosure is to provide a multiplex PCR library construction method for high-throughput targeted sequencing.
- the present disclosure relates to a method for constructing a multiplex PCR library for high-throughput targeted sequencing.
- the specific amplification product is added with a polybasic MoCODE barcode, and the MoCODE barcode is used to make the amplification product and the MoCODE barcode decoding sequence.
- Sequencing adapters are efficiently connected to build a library.
- the MoCODE barcode refers to the protruding single-stranded nucleotide sequence that constitutes the two sticky ends of the obtained PCR product after digesting multiple PCR products with a specific endonuclease.
- the MoCODE barcode The barcode decoding sequence is the nucleotide sequence complementary to the MoCODE barcode.
- the generation method of the MoCODE barcode includes: one or more of modified nucleotides, nicking enzymes, endonucleases, chemical modifications, photolytic bases, etc.; preferably, the Modified nucleotides include one or more of dUTP, dITP, and RNA bases.
- the MoCODE barcodes may or may not be identical within the molecule.
- the MoCODE barcode is a non-random specific barcode.
- the length of the MoCODE barcode is 2-20nt.
- the MoCODE barcode decoding sequence and the MoCODE barcode sequence are complementary sequences with a length of 2-20 nt.
- the sequencing adapter can be artificially designed and synthesized, or match the sequence of the target segment itself.
- the sequencing adapter can be a single adapter or a bidirectional adapter.
- each specific segment enrichment can be decoded by single-linker decoding, double-linker decoding or automatic circularization decoding.
- the present disclosure also relates to a primer for multiplex PCR for high-throughput targeted sequencing, the primer comprising a MoCODE barcode generation sequence, preferably, the sequence of the primer comprises Seq ID Nos: 1-22, 27-52, Sequences shown in 53, 55, 57-104, 109, 111.
- the present disclosure also relates to a sequencing adapter for multiplex PCR for high-throughput targeted sequencing
- the sequencing adapter comprises a MoCODE barcode decoding sequence
- the sequencing adapter further comprises a sequencing adapter, an index of a sequencing platform
- the sequencing adapter includes a high-throughput sequencing universal sequence, an index tag and the MoCODE barcode decoding sequence
- the sequence of the sequencing adapter includes Seq ID Nos: 23-26, 54 , 56, 105-108, 110, 112.
- a multiplex PCR library construction method for high-throughput targeted sequencing of the present disclosure includes the following steps:
- each primer participating in the multiple PCR reactions includes a specific MoCODE barcode generation sequence, preferably, the primers also include gene-specific sequences;
- step 6) connecting the purified PCR product containing the MoCODE barcode obtained in step 5) and a sequencing adapter, the sequencing adapter containing the MoCODE barcode decoding sequence complementary to the MoCODE;
- step 6) Purify the ligation product obtained in step 6) with magnetic beads to complete the construction of a multiplex PCR library for high-throughput targeted sequencing.
- the generation method of the MoCODE barcode in step 4) includes: one or more of modified nucleotides, nicking enzymes, endonucleases, chemical modifications, photodegradable bases, etc.; preferably Typically, the modified nucleotides include one or more of dUTP, dITP, and RNA bases. More preferably, the MoCODE barcode is generated by enzymatic digestion with a specific endonuclease.
- a MoCODE barcode is generated at each of the 5' and 3' sticky ends, wherein the MoCODE barcodes of the 5' and 3' sticky ends may be the same or different.
- the sequencing adapter in step 6) can be a single adapter, a bidirectional adapter or a circularization adapter.
- the present disclosure has the following advantages:
- the library construction process is more efficient. Compared with other companies' PCR-based targeted enrichment library construction methods, the manual operation time is reduced by 40-50%, and the overall library construction is reduced by 40-50%. 30-40% reduction in time.
- Fig. 1 is the process of using different MoCODE to construct the library of the disclosed method
- Figure 2 is a schematic diagram of the upstream and downstream primer structures of the disclosed multiplex PCR
- FIG. 3 is a schematic structural diagram of the upstream and downstream joints of the present disclosure.
- 4A is a schematic diagram of the MoCODE (not identical) double-stranded structure at both ends of the PCR product in Example 3 of the disclosure;
- 4B is a schematic diagram of the double-stranded structure of the upstream linker in Example 3 of the disclosure.
- 4C is a schematic diagram of the double-stranded structure of the downstream linker in Example 3 of the disclosure.
- 5A is a schematic diagram of the MoCODE (identical) double-stranded structure at both ends of the PCR product in Example 4 of the disclosure;
- 5B is a schematic diagram of the double-stranded structure of the upstream linker in Example 4 of the disclosure.
- 5C is a schematic diagram of the double-stranded structure of the downstream linker in Example 4 of the disclosure.
- 6A is a schematic diagram of the primers used in generating MoCODE barcodes by utilizing the MoCODE generating sequence contained in the amplification target segment itself;
- 6B is a schematic diagram of the target fragment of PCR amplification that itself contains MoCODE generation sequence when generating MoCODE barcode by using the MoCODE generation sequence contained in the amplification target segment itself;
- 6C is a schematic diagram of a PCR product that generates a MoCODE barcode when the MoCODE generation sequence contained in the amplification target segment itself is used to generate a MoCODE barcode for the disclosure;
- FIG. 8 is the result of agarose gel electrophoresis of the products connected by sequencing adapters in Example 2 of the present disclosure.
- sample including a sample or culture (eg, a microbial culture) comprising nucleic acid
- the sample may include a sample of synthetic origin.
- Biological samples include whole blood, serum, plasma, umbilical cord blood, chorionic villus, amniotic fluid, cerebrospinal fluid, spinal fluid, lavage fluid (eg, bronchoalveolar, gastric, peritoneal, catheter, ear, arthroscopic lotion), biopsy samples, urine, feces, sputum, saliva, nasal mucus, prostatic fluid, semen, lymph, bile, tears, sweat, breast milk, breast fluid, embryonic and fetal cells.
- the biological sample is blood, and more preferably plasma.
- blood as used herein includes whole blood or any blood fraction, such as serum and plasma as conventionally defined.
- Blood plasma refers to the whole blood fraction produced by centrifugation of anticoagulant-treated blood.
- Blood serum refers to the watery portion of the fluid that remains after a blood sample has clotted.
- Environmental samples include environmental materials such as surface materials, soil, water, and industrial samples, as well as samples obtained from food and dairy processing units, instruments, equipment, utensils, disposable and non-disposable items. These examples should not be construed as limiting the types of samples applicable to the present invention.
- target target nucleic acid
- gene of interest refers to any molecule whose presence is to be detected or measured, or whose function, interaction or property is to be studied.
- nucleic acid and “nucleic acid molecule” are used interchangeably throughout this disclosure.
- the terms refer to oligonucleotides, oligomers, polynucleotides, deoxyribonucleotides (DNA), genomic DNA, mitochondrial DNA (mtDNA), complementary DNA (cDNA), bacterial DNA, viral DNA, viral RNA , RNA, messenger RNA (mRNA), transfer RNA (tRNA), ribosomal RNA (rRNA), siRNA, catalytic RNA, clone, plasmid, M13, P1, cosmid, bacterial artificial chromosome (BAC), yeast artificial chromosome ( YAC), amplified nucleic acids, amplicons, PCR products, and other types of amplified nucleic acids, RNA/DNA hybrids, and polyamide nucleic acids (PNA), all of which may be in single- or double-stranded form, and unless otherwise Without limitation, known analogs of natural nucleotides, and combinations and//
- nucleotide refers to naturally occurring and modified/non-naturally occurring nucleotides, including tri-, di- and monophosphate nucleosides, as well as monophosphate monophosphates present within polynucleic acids or oligonucleotides body. Nucleotides can also be ribose; 2'-deoxy; 2',3'-deoxy and numerous other nucleotide mimetics well known in the art.
- Mimics include chain terminating nucleotides, such as 3'-O-methyl, halobases, or sugar substitutions; alternative sugar structures, including non-sugar, alkyl ring structures; alternative bases, including inosine; deaza modifications chi and psi, linker-modified; mass marker-modified; phosphodiester modifications or substitutions, including phosphorothioates, methylphosphonates, boranophosphates, amides, esters, ethers; and basic Or complete internucleotide substitutions, including cleavage linkages, such as photocleavable nitrophenyl moieties.
- nucleotides such as 3'-O-methyl, halobases, or sugar substitutions
- alternative sugar structures including non-sugar, alkyl ring structures
- alternative bases including inosine
- deaza modifications chi and psi linker-modified
- mass marker-modified mass marker-modified
- phosphodiester modifications or substitutions
- amplification reaction refers to any in vitro means for amplifying copies of a target nucleic acid sequence.
- Amplification refers to the step of subjecting a solution to conditions sufficient to allow amplification.
- Components of an amplification reaction can include, but are not limited to, for example, primers, polynucleotide templates, polymerases, nucleotides, dNTPs, and the like.
- the term “amplification” generally refers to an "exponential" increase in a target nucleic acid. However, “amplification” as used herein may also refer to a linear increase in the number of selected target nucleic acid sequences, but is different from a one-time, single primer extension step.
- PCR polymerase chain reaction
- oligonucleotide refers to a linear oligomer of natural or modified nucleoside monomers linked by phosphodiester bonds or analogs thereof. Oligonucleotides include deoxyribonucleosides, ribonucleosides, anomeric forms thereof, peptide nucleic acids (PNA), and the like, which are capable of specifically binding a target nucleic acid. Typically, monomers are linked by phosphodiester bonds or analogs thereof to form oligonucleotides ranging in size from a few monomer units (eg, 3-4) to tens of monomer units (eg 40-60).
- oligonucleotide is represented by a sequence of letters (such as "ATGCCTG"), it should be understood that, unless otherwise indicated, the nucleotides are in 5'-3' order from left to right, and "A” refers to deoxyadenosine Glycosides, “C” refers to deoxycytidine, “G” refers to deoxyguanosine, “T” refers to deoxythymidine, and “U” refers to ribonucleoside, uridine.
- oligonucleotides contain the four natural deoxynucleotides; however, they may also contain ribonucleosides or non-natural nucleotide analogs.
- oligonucleotide or polynucleotide substrate requirements for activity eg single-stranded DNA, RNA/DNA duplexes, etc.
- appropriate composition for the oligonucleotide or polynucleotide substrate is entirely within the knowledge of the ordinary skilled person.
- oligonucleotide primer refers to a polynucleotide sequence that hybridizes to a sequence on a target nucleic acid template and facilitates detection by an oligonucleotide probe.
- oligonucleotide primers serve as starting points for nucleic acid synthesis.
- oligonucleotide primers can be used to create structures that can be cleaved by cleavage reagents.
- Primers can be of various lengths, and are typically less than 50 nucleotides in length. The lengths and sequences of primers used in PCR can be designed based on principles known to those skilled in the art.
- mismatched nucleotide or “mismatch” refers to a nucleotide that is not complementary to the target sequence at the one or more positions. Oligonucleotide probes can have at least one mismatch, but can also have 2, 3, 4, 5, 6, or 7 or more mismatched nucleotides.
- telomere binding refers to the recognition, contact and stable complexing between the two molecules formation of a molecule, and greatly reduced recognition, contact, or complex formation of that molecule with other molecules.
- annealing refers to the formation of a stable complex between two molecules.
- cleavage reagent refers to any tool, including but not limited to enzymes, capable of cleaving an oligonucleotide to produce fragments.
- the cleavage reagent may be used only to cleave, degrade, or otherwise isolate the second portion of the oligonucleotide probe, or a fragment thereof.
- the cleavage reagent can be an enzyme.
- Cleavage reagents can be natural, synthetic, unmodified or modified.
- the cleavage reagent is preferably an enzyme having both synthetic (or polymerization) activity and nuclease activity.
- Such enzymes are typically nucleic acid amplification enzymes.
- nucleic acid amplification enzymes are nucleic acid polymerases such as Thermus aquaticus (Taq), DNA polymerases or E. coli DNA polymerase I.
- the enzymes may be naturally occurring, unmodified or modified.
- nucleic acid polymerase refers to an enzyme that catalyzes the incorporation of nucleotides into nucleic acids.
- exemplary nucleic acid polymerases include DNA polymerases, RNA polymerases, terminal transferases, reverse transcriptases, telomerases, and the like.
- thermostable DNA polymerase refers to a DNA polymerase that is stable (ie, resistant to decomposition or denaturation) and retains sufficient catalytic activity when subjected to elevated temperatures for a selected period of time.
- thermostable DNA polymerases retain sufficient activity to effect subsequent primer extension reactions when subjected to high temperatures for the time necessary to denature double-stranded nucleic acids.
- the heating conditions necessary for nucleic acid denaturation are well known in the art and are exemplified in US Pat. Nos. 4,683,202 and 4,683,195.
- Thermostable polymerases as used herein are generally suitable for use in temperature cycling reactions such as the polymerase chain reaction ("PCR").
- thermostable nucleic acid polymerases examples include Thermus aquaticus Taq DNA polymerase, Thermus sp. Z05 polymerase, Thermus flavus polymerase, Thermotoga maritima polymerase, Such as TMA-25 and TMA-30 polymerase, Tth DNA polymerase, etc.
- modified polymerase refers to a polymerase in which at least one monomer differs from a reference sequence, such as a native or wild-type form of the polymerase or another modified form of the polymerase. Exemplary modifications include monomeric insertions, deletions and substitutions. Modified polymerases also include chimeric polymerases having identifiable component sequences (eg, structural or functional domains, etc.) derived from two or more parents. Also included in the definition of modified polymerase are those chemically modified polymerases that contain the reference sequence.
- modified polymerases include G46E E678G CS5 DNA polymerase, G46EL329A E678G CS5 DNA polymerase, G46E L329A D640G S671F CS5 DNA polymerase, G46E L329AD640G S671F E678G CS5 DNA polymerase, G46E E678G DNA polymerase, CS06 ⁇ Z05 polymerase, ⁇ Z05-Gold polymerase, ⁇ Z05R polymerase, E615G Taq DNA polymerase, E678G TMA-25 polymerase, E678G TMA-30 polymerase, etc.
- 5' to 3' nuclease activity or "5'-3' nuclease activity” refers to the activity of a nucleic acid polymerase, typically associated with nucleic acid strand synthesis, whereby nucleotides are removed from the 5' end of a nucleic acid strand , for example, E. coli DNA polymerase I has this activity, but the Klenow fragment does not.
- Some enzymes with 5' to 3' nuclease activity are 5' to 3' exonucleases. Examples of such 5' to 3' exonuclease include: exonuclease from B.
- subtilis phosphodiesterase from spleen, lambda exonuclease, exonuclease from yeast Enzyme II, exonuclease V from yeast and exonuclease from Neurospora crassa.
- MoCODE barcode “Molecular Code” and “specific molecular barcode” used in the present disclosure refer to the two sticky ends that constitute the obtained PCR product after digestion of multiple PCR products with specific endonucleases the overhanging single-stranded sequence.
- MoCODE barcode decoding sequence or “molecular barcode decoding sequence” used in the present disclosure refers to the nucleotide sequence complementary to the “MoCODE barcode", “Molecular Code” and “specific molecular barcode”.
- the MoCODE barcodes of each pair of amplification primers can be different or the same.
- MoCODE barcodes can be from 2nt-20nt in length or longer.
- the matching connection between the MoCODE barcode and the connector is a sticky end connection. Compared with the current TA connection or blunt end connection for library building, this method can improve the connection efficiency and the final detection sensitivity.
- Amplification Gene-specific and universal amplification, and the introduction of MoCODE barcodes can be implemented in the same PCR reaction, shortening the operation steps and manual operation time, avoiding cross-contamination during library construction, reducing costs, and improving clinical practicability.
- MoCODE barcodes can be used with UMI to further improve the mutation detection accuracy of targeted sequencing through error correction.
- a method for constructing a multiplex PCR library for high-throughput targeted sequencing of the present disclosure includes adding MoCODE barcodes to specific amplification products, and using matching sequencing adapters containing MoCODE barcode decoding sequences for efficient ligation and construction library.
- the sample source of the specific amplification product includes, but is not limited to, genomic DNA, cell-free DNA, cell-free, cDNA generated by reverse transcription of an RNA sample, and the like.
- the template DNA of the multiplex PCR reaction can be DNA, bisulfite-converted DNA, cDNA, and the like.
- the extraction method of the template DNA of the multiplex PCR reaction may be column extraction, magnetic bead method, phenol-chloroform extraction-ethanol or isopropanol precipitation, and the like.
- the primers participating in the multiplex PCR reaction comprise a specific MoCODE barcode generating sequence, preferably, the primers also comprise gene-specific sequences;
- the MoCODE barcode generation method includes: modified nucleotides (dUTP, dITP, RNA Base), nicking enzymes (Nicking enzymes), endonucleases, chemical modifications, photolytic bases Base et al. The purpose is to make a recognizable cleavage site at the end of the PCR product, and then cut out the sticky end containing the MoCODE barcode.
- the MoCODE barcode is generated in a manner that, in the primers of the multiplex PCR reaction, in addition to a gene-specific sequence, the 5' end may also include a specific nucleic acid that is common among primers
- the recognition site of the nuclease followed by digestion of the purified PCR product with specific endonucleases (one or two).
- the enzymatically digested PCR product will contain two sticky ends.
- the protruding single-stranded sequence of each sticky end forms a specific molecular barcode, namely the Molecular CODE (MoCODE) barcode.
- the primer sequence comprises the sequence shown in Seq ID No: 1-22, 27-52, 53, 55, 57-104, 109, 111, wherein n represents the nucleotide dITP or dUTP.
- the MoCODE barcode is generated in a manner that, in each primer of the multiplex PCR reaction, in addition to a gene-specific sequence, it also includes a dITP site, which is a site that is After specific enzyme digestion and recognition, a sticky end of 6 bases can be formed, that is, the MoCODE barcode sequence is generated.
- the MoCODE barcodes may or may not be identical within the molecule, eg, "identical” means that the MoCODE barcodes at both ends of the same PCR product molecule are recognized by an endonuclease After cleavage is formed, the “different” means that the MoCODE barcodes at both ends of the same PCR product molecule are recognized by two different endonucleases and formed after cleavage.
- a MoCODE barcode is contained within the same nucleotide molecule, eg, the same MoCODE barcode generated at the 5' and 3' sticky ends of a PCR product molecule.
- two MoCODE barcodes are contained within the same nucleotide molecule, eg, the MoCODE barcodes generated at the 5' and 3' sticky ends of a PCR product molecule are different.
- the MoCODE barcode is a non-random specific barcode.
- the MoCODE barcode is 2-20 nt in length.
- the MoCODE barcode sequence comprises the sequences shown in Seq ID Nos: 53, 59, 109, 111.
- the MoCODE barcode decoding sequence and the MoCODE barcode sequence are complementary sequences, with a length of 2-20 nt.
- the MoCODE barcode decoding sequence comprises the sequences shown in Seq ID Nos: 54, 56, 110, 112.
- the sequencing adapter comprising the MoCODE barcode decoding sequence may be artificially designed and synthesized, or may match the sequence of the target segment itself.
- the sequencing adapter comprising the MoCODE barcode decoding sequence can be matched with the sequence of the target segment itself.
- the primer at the 5' end of the PCR does not need to have the MoCODE generating sequence; if the target segment of amplification itself contains MoCODE and will be used to generate the MoCODE barcode at the 3' end, then the PCR The 3' primers do not need to carry the MoCODE generating sequence ( Figure 6A).
- the sequencing adapters comprise sequences shown in Seq ID Nos: 23-26, 105-108, wherein "nnnnnnn", [i5] or [i7] represent an index tag, such as an 8nt Illumina Index tag sequence.
- index tag such as an 8nt Illumina Index tag sequence.
- the 5' end for sticky linking can be phosphorylated as is known in the art.
- the "n” or “I” at position 5 in the primer sequence Seq ID No: 57-104 is "dITP”.
- the PCR-amplified fragments of interest may contain one or two self-MoCODE generating sequences within themselves (FIG. 6B).
- the own MoCODE generating sequence can be used to generate MoCODE barcodes on one or both ends of the DNA molecule.
- the corresponding MoDODE barcodes can be generated at one or both ends of the PCR product through endonuclease digestion corresponding to the self-generated MoCODE sequence (Fig. 6C).
- the sequencing adapters comprising MoCODE barcode decoding sequences can be single adapters, bidirectional adapters, and each specific segment enrichment can be decoded by single adapter decoding, double adapter decoding, or automatic circularization decoding.
- the use of the "single linker” occurs when the MoCODE barcodes at both ends of the PCR product are “identical”; the use of the “bidirectional linker” occurs when the barcodes at both ends of the PCR product are "different”, it is understandable that, When different adapters are used, the adapters on both sides of the non-specific product are the same, which cannot form the correct test product, which is eliminated in the sequencing process.
- the "circularization” can use a variety of different MoCODE barcodes, and the structure is MoCODE + common sequence bound by sequencing primer + gene specific sequence.
- the circularization decoding steps are: PCR, digestion, circularization (circularization), exonuclease digestion (exonuclease digestion), add-on PCR (adding a complete sequencing primer binding point + library index + sequence adapter), which can be used for Multiple amplicons are formed.
- the sequencing adaptor comprising a MoCODE barcode decoding sequence comprises an upstream sequencing adaptor and a downstream sequencing adaptor comprising a MoCODE complementary to the MoCODE barcode at the 5' end of the digested PCR product A barcode decoding sequence, the downstream sequencing adapter comprises a MoCODE barcode decoding sequence complementary to the MoCODE barcode at the 3' end of the digested PCR product.
- the upstream sequencing adapter and the downstream sequencing adapter further comprise an adapter upper strand and an adapter lower strand, respectively, the adapter upper strand is a sense strand, and the adapter lower strand is an antisense strand.
- the MoCODE barcode decoding sequence may be located at the 3' end of the upper chain of the adapter of the upstream sequencing adapter or the 5' end of the lower chain of the adapter of the upstream sequencing adapter, or may be located at the upper chain of the adapter of the downstream sequencing adapter.
- the 5' end or the 3' end of the linker lower strand located at the downstream sequencing linker ( Figure 3).
- multiplex amplification of 2-1000 target segments can be achieved, and each target segment can have its own specific barcode, or multiple target segments can share the same barcode.
- the MoCODE barcodes are non-random specific barcodes that can also be used for multi-purpose segment cancatmerization.
- the DNA polymerase used in the multiplex PCR can be Taq polymerase, PFx, KOD, Pfu, Q5, Bst, Phusion and other commercialized enzymes.
- the ligase used in the multiplex PCR can be T4 DNA ligase, 9 NTM DNA ligase, Taq DNA ligase, Tth DNA ligase, TfiDNA ligase, AmpligaseR, and the like.
- the excess removal of the sequencing adapter can be performed by magnetic bead method, column extraction method, ethanol precipitation method, agarose or polyacrylamide gel recovery method, and the like.
- the constructed library is suitable for high-throughput sequencing platforms such as Illumina, Roche, ThermoFisher, Pacific Biosciences, BGI, Oxford Nanopore Technologies, Huayinkang, and Hanhai Gene.
- the method for constructing a multiplex PCR library for high-throughput targeted sequencing includes the following steps (an exemplary library construction process is shown in Figure 1 ):
- Step 1 Prepare the sample to be tested to extract DNA, if it is a methylation sequencing library construction, then bisulfite conversion is required;
- Step 2 Using the DNA sample processed in Step 1 as a template, perform multiple PCR reactions with high-fidelity PCR enzymes and multiple pairs of primers (Figure 2). At its 5' end, it contains a specific molecular barcode generation sequence common among primers.
- Step 3 Purify the PCR product of Step 2 with magnetic beads
- Step 4 The purified product of Step 3 is digested with a specific endonuclease.
- the 3' and 5' ends of correctly amplified multiplex PCR products should contain a specific barcode generation site, which, when digested with specific endonucleases, will form sticky ends, i.e. generate MoCODE barcode sequences, which are used in the mediation step Five connections.
- barcode generation site which, when digested with specific endonucleases, will form sticky ends, i.e. generate MoCODE barcode sequences, which are used in the mediation step Five connections.
- barcode generation site There are many ways to generate barcodes, including: modified nucleotides, dUTP, dITP, RNA Base, nickase, endonuclease, chemical modification, photodegradable base, etc.;
- Step 5 Perform magnetic bead purification on the enzymatic digestion product in Step 4;
- Step 6 The purified enzymatic digestion product obtained in Step 5 is introduced into the upstream sequencing adapter and the downstream sequencing adapter by using a ligase that can catalyze the ligation between the sticky ends.
- the introduced upstream sequencing adapter contains a high-throughput sequencing universal sequence (which can include an index tag sequence) and a MoCODE barcode decoding sequence that can be complementary to the MOCODE at the 5' end of the digested PCR product obtained in step 4.
- the introduced downstream sequencing adapter contains high-throughput sequencing universal sequence (including index tag sequence) and MoCODE barcode decoding sequence that can be complementary to the MOCODE at the 3' end of the digested PCR product obtained in step 4 ( Figure 3);
- Step 7 Purify the ligated product in step 6 with magnetic beads and complete the construction of the sequencing library.
- Example 1 Targeted methylation multiplex PCR enrichment using MoCODE to eliminate non-specific PCR products
- each pair of BSP primers in the experimental group included a specific molecular (MoCODE) barcode generation sequence common between primers at its 5' end; in the control group, each pair of BSP primers only contained gene-specific sequences, It does not contain a specific molecule (MoCODE) barcode generating sequence at its 5' end.
- MoCODE specific molecular
- the two MoCODE barcode sequences were generated via digestion of the PCR product with two restriction enzymes. Then the enrichment effect of the two groups of products was observed by agarose gel electrophoresis.
- Hela cell genomic DNA (NEB Company, USA) was converted into bisulfite with EZ DNA Methylation-Gold Kit (ZYMO Company, USA).
- Step 1 94°C, 2 minutes.
- Second step 6 cycles (98°C, 10 seconds; 59°C, 5 seconds; 68°C, 5 seconds).
- Step 3 35 cycles (98°C, 10 seconds; 68°C, 10 seconds).
- Step 4 68°C, 1 minute.
- Step 5 Keep at 8°C.
- the reaction mixture was purified using HiPrep PCR magnetic beads (1.2x) and eluted in 15 ⁇ l of water.
- the universal specific molecular barcode generation sequences of upstream primers and downstream primers are Seq ID Nos: 1 and 12, respectively, the upstream primer sequences of Moko1-10 are Seq ID Nos: 2-11, and the downstream primer sequences of Moko1-10 are Seq ID No. 2-11 respectively. ID Nos: 13-22.
- sequencing adapter ligation was performed on the PCR products purified by the restriction endonuclease treatment of the experimental group in Example 1. The effect of sequencing adapter ligation was then observed via agarose gel electrophoresis.
- Annealing program 82°C, 2 minutes; 570x ⁇ 82°C, 3 seconds, -0.1°C/cycle ⁇ ; 4°C hold.
- T4 DNA Ligase Buffer 10x T4 DNA Ligase Buffer (NEB) 2 ⁇ l Purified digested PCR product 15 ⁇ l Upstream adapter (10 ⁇ M) 1 ⁇ l Downstream adapter (10 ⁇ M) 1 ⁇ l T4 DNA ligase (NEB, 200U/ ⁇ l) 1 ⁇ l total capacity 20 ⁇ l
- Hela cell genomic DNA (NEB Company, USA) was converted into bisulfite with EZ DNA Methylation-Gold Kit (ZYMO Company, USA).
- Step 1 94°C, 2 minutes.
- Second step 6 cycles (98°C, 10 seconds; 59°C, 5 seconds; 68°C, 5 seconds).
- Step 3 35 cycles (98°C, 10 seconds; 68°C, 10 seconds).
- Step 4 68°C, 1 minute.
- Step 5 Keep at 8°C.
- the reaction mixture was purified using HiPrep PCR magnetic beads (1.2x) and eluted in 15 ⁇ l of water.
- Annealing program 82°C, 2 minutes; 570x ⁇ 82°C, 3 seconds, -0.1°C/cycle ⁇ ; 4°C hold.
- T4 DNA Ligase Buffer 10x T4 DNA Ligase Buffer (NEB) 2 ⁇ l Purified digested PCR product 15 ⁇ l Upstream adapter (10 ⁇ M) 1 ⁇ l Downstream adapter (10 ⁇ M) 1 ⁇ l T4 DNA ligase (NEB, 200U/ ⁇ l) 1 ⁇ l total capacity 20 ⁇ l
- the ligation mixture was purified using HiPrep PCR magnetic beads (1x) and eluted in 10 ⁇ l of water.
- the concentration of the 1:10,000 dilution was determined with the Kapa library quantification kit.
- the concentration of the library was adjusted to 4 nM with water.
- Illumina paired-end sequencing raw .fastq files were assembled into complete tested segments by PEAR software. The sequencing result after each assembly is compared with the target segment sequence, and the sequence that meets the expected read length generated by the correct paired primers is identified as on-target, and the on-target rate is the number of on-target sequences in the total reads. The proportion of the number taken.
- the upstream primer sequences of Moko11-23 are Seq ID No: 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51
- the downstream primer sequences of Moko11-23 are Seq ID No: 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52.
- MoCODE barcode sequence (5'>3') MoCODE barcode decoding sequence (5'>3') upstream connector TGTA (Seq ID No: 53) TACA (Seq ID No: 54) downstream adapter GAT (Seq ID No: 55) ATC (Seq ID No: 56)
- TCT/LCT Thin-Cytologic Test/Liquid-based cytologic test
- the first step 94°C, 2 minutes;
- Step 2 6 cycles (98°C, 10 seconds; 59°C, 5 seconds; 68°C, 5 seconds);
- the third step 35 cycles (98°C, 10 seconds; 64°C, 5 seconds; 68°C, 5 seconds);
- the fourth step 68 °C, 1 minute
- Step 5 Keep at 8°C.
- reaction mixture was purified using AMPure XP magnetic beads (1.5x) and eluted in 13 ⁇ l water.
- Annealing program 82°C, 2 minutes; 570x ⁇ 82°C, 3 seconds, -0.1°C/cycle ⁇ ; 4°C hold.
- the ligation mixture was purified using AMPure XP magnetic beads (1.2x) and eluted in 10 ⁇ l of water.
- Illumina paired-end sequencing raw .fastq files were assembled into complete tested segments by PEAR software. The sequencing result after each assembly is compared with the target segment sequence, and the sequence with the expected read length generated by the correctly paired primers is identified as on-target, and the on-target rate is the number of on-target sequences in the total read reads. The proportion of the number taken.
- Sample 1 Sample 2 total reads 1225399 1143004 hit rate 98.0% 98.2%
- the underlined sequence fragment is the specific target gene sequence
- MoCODE barcode sequence (5'>3') MoCODE barcode decoding sequence 5'>3') upstream connector CACAT (Seq ID No: 109) ATGTG (Seq ID No: 110) downstream adapter CGGAA (Seq ID No: 111) TTCCG (Seq ID No: 112)
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Physics & Mathematics (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Crystallography & Structural Chemistry (AREA)
- Plant Pathology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Immunology (AREA)
- Analytical Chemistry (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Description
组分 | 体积 |
无核酸酶水 | 21.5μl |
2倍KOD-Multi Epi PCR预混液(TOYOBO) | 25μl |
引物混合液(10μM) | 1.5μl |
亚硫酸盐处理过的Hela细胞基因组DNA | 1μl(50ng) |
KOD-Multi&Ep(TOYOBO) | 1μl |
总体积 | 50μl |
组分 | 体积 |
10倍Cutsmart缓冲液(NEB) | 2μl |
BbvI(NEB,2U/μl) | 1μl |
EarI(NEB,20U/μl) | 0.5μl |
纯化PCR产物 | 5μl 50ng |
无核酸酶水 | 11.5μl |
总体积 | 20μl |
组分 | 容量 |
10倍T4DNA连接酶缓冲液(NEB) | 2μl |
纯化的酶切PCR产物 | 15μl |
上游接头(10μM) | 1μl |
下游接头(10μM) | 1μl |
T4DNA连接酶(NEB,200U/μl) | 1μl |
总体积 | 20μl |
组分 | 体积 |
无核酸酶水 | 21.5μl |
2倍KOD-Multi Epi PCR预混液(TOYOBO) | 25μl |
引物混合液(10μM) | 1.5μl |
亚硫酸盐处理过的Hela细胞基因组DNA | 1μl(50ng) |
KOD-Multi&Ep(TOYOBO) | 1μl |
总体积 | 50μl |
组分 | 体积 |
10倍Cutsmart缓冲液(NEB) | 2μl |
BbvI(NEB,2U/μl) | 1μl |
EarI(NEB,20U/μl) | 0.5μl |
纯化PCR产物 | 5μl 50ng |
无核酸酶水 | 11.5μl |
总体积 | 20μl |
组分 | 容量 |
10倍T4DNA连接酶缓冲液(NEB) | 2μl |
纯化的酶切PCR产物 | 15μl |
上游接头(10μM) | 1μl |
下游接头(10μM) | 1μl |
T4DNA连接酶(NEB,200U/μl) | 1μl |
总体积 | 20μl |
MoCODE条码序列(5’>3’) | MoCODE条码解码序列(5’>3’) | |
上游接头 | TGTA(Seq ID No:53) | TACA(Seq ID No:54) |
下游接头 | GAT(Seq ID No:55) | ATC(Seq ID No:56) |
组分 | 体积 |
无核酸酶水 | 17.5μl |
2倍KOD-Multi Epi PCR预混液(TOYOBO) | 25μl |
引物混合液(10μM) | 1.5μl |
亚硫酸盐处理过的基因组DNA | 5μl(50ng) |
KOD-Multi&Ep(TOYOBO) | 1μl |
总体积 | 50μl |
组分 | 体积 |
10倍缓冲液4(NEB) | 2μl |
Endonuclease V(NEB,10U/μl) | 1μl |
纯化PCR产物 | 5μl(100ng) |
无核酸酶水 | 12μl |
总体积 | 20μl |
组分 | 容量 |
10倍T4DNA连接酶缓冲液(NEB) | 2μl |
纯化的酶切PCR产物 | 13μl |
上游接头(10μM) | 2μl |
下游接头(10μM) | 2μl |
T4DNA连接酶(NEB,200U/μl) | 1μl |
总体积 | 20μl |
样品1 | 样品2 | |
总读取数 | 1225399 | 1143004 |
中靶率 | 98.0% | 98.2% |
MoCODE条码序列(5’>3’) | MoCODE条码解码序列5’>3’) | |
上游接头 | CACAT(Seq ID No:109) | ATGTG(Seq ID No:110) |
下游接头 | CGGAA(Seq ID No:111) | TTCCG(Seq ID No:112) |
Claims (10)
- 一种用于高通量靶向测序的多重PCR文库的构建方法,其特征在于,通过对特异性扩增产物加入多碱基MoCODE条码,并利用MoCODE条码使扩增产物与包含MoCODE条码解码序列的测序接头高效连接建库,所述MoCODE条码是指用特异性核酸内切酶消化多重PCR产物后,组成所获得的PCR产物的两个粘性末端的突出的单链核苷酸序列,所述MoCODE条码解码序列为与所述MoCODE条码互补的核苷酸序列。
- 如权利要求1的方法,其中,所述MoCODE条码的生成方式包括:修饰核苷酸、切口酶(Nicking enzyme)、内切酶、化学修饰、可光解碱基等中的一种或多种;优选地,所述修饰核苷酸包括dUTP,dITP,RNA碱基中的一种或多种。
- 如权利要求1或2的方法,其中,所述MoCODE条码在分子内可以是相同的或不相同的。
- 如权利要求1-3任一项的方法,其中,所述MoCODE条码为非随机特异性条码。
- 如权利要求1-4任一项的方法,其中,所述MoCODE条码的长度2-20nt,优选地,所述MoCODE条码解码序列与MoCODE条码序列为互补序列,长度2-20nt。
- 如权利要求1-5任一项的方法,所述测序接头可以是人工设计合成、或与目的区段自身片段序列匹配;优选地,所述测序接头可以为单一接头、双向接头,优选地,每一个特定区段富集可通过单一接头解码、双接头解码或自动环化解码。
- 一种用于高通量靶向测序的多重PCR的引物,其特征在于,所述引物包含MoCODE条码生成序列,优选地,所述引物的序列包含选自Seq ID No:1-22、27-52、53、55、57-104、109、111所示序列。
- 一种用于高通量靶向测序的多重PCR的测序接头,其特征在于,所述测序接头包含MoCODE条码解码序列,优选地,所述测序接头还包含测序平台的测序接头、index标签中的一种或多种,优选地,所述测序接头包含高通量测序通用序列、index标签和所述MoCODE条码解码序列,优选地,所述测序接头的序列包含选自Seq ID No:23-26、54、56、105-108、110、112所示序列。
- 一种用于高通量靶向测序的多重PCR文库构建方法,其特征在于,所述方法包括以下步骤:1)从待检样本中提取DNA;2)进行多重PCR反应,参与多重PCR反应的每条引物包含一段特异的MoCODE条码生成序列,优选地,所述引物还包含基因特异性序列;3)用磁珠法纯化步骤2)所得PCR产物;4)使步骤3)所得纯化PCR产物产生5’和3’粘性末端,并分别在5’和/或3’粘性末端生成MoCODE条码;5)用磁珠法纯化步骤4)的含有MoCODE条码的PCR产物;6)连接步骤5)所得的纯化的含有MoCODE条码的PCR产物和测序接头,所述测序接头含有与MoCODE互补的MoCODE条码解码序列;7)用磁珠纯化步骤6)所得连接产物,完成用于高通量靶向测序的多重PCR文库的构建。
- 如权利要求9的方法,其中,步骤4)中所述MoCODE条码的生成方式包括:修饰核苷酸、切口酶(Nicking enzyme)、内切酶、化学修饰、可光解碱基等中的一种或多种;优选地,所述修饰核苷酸包括dUTP,dITP,RNA碱基中的一种或多种,更优选地,所述MoCODE条码的生成方式为利用特异性核酸内切酶进行酶消化;优选地,步骤4)中所述在5’和3’粘性末端各生成一个MoCODE条码,其中所述5’和3’粘性末端的MoCODE条码可以相同也可以不同;优选地,步骤6)中所述测序接头可以为单一结头、双向接头或环化接头。
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202180088322.4A CN116888276A (zh) | 2020-12-31 | 2021-12-31 | 一种用于高通量靶向测序的多重pcr文库构建方法 |
US18/270,492 US20240076653A1 (en) | 2020-12-31 | 2021-12-31 | Method for constructing multiplex pcr library for high-throughput targeted sequencing |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011628234 | 2020-12-31 | ||
CN202011628234.2 | 2020-12-31 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2022144003A1 true WO2022144003A1 (zh) | 2022-07-07 |
Family
ID=82260289
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2021/143948 WO2022144003A1 (zh) | 2020-12-31 | 2021-12-31 | 一种用于高通量靶向测序的多重pcr文库构建方法 |
Country Status (3)
Country | Link |
---|---|
US (1) | US20240076653A1 (zh) |
CN (1) | CN116888276A (zh) |
WO (1) | WO2022144003A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115992243A (zh) * | 2022-11-11 | 2023-04-21 | 深圳凯瑞思医疗科技有限公司 | 一种检测卵巢癌的引物组合、试剂盒及文库构建方法 |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5508169A (en) * | 1990-04-06 | 1996-04-16 | Queen's University At Kingston | Indexing linkers |
US6045994A (en) * | 1991-09-24 | 2000-04-04 | Keygene N.V. | Selective restriction fragment amplification: fingerprinting |
WO2001075154A2 (de) * | 2000-04-03 | 2001-10-11 | Axaron Bioscience Ag | Neue verfahren zur parallelen sequenzierung eines nukleinsäuregemisches an einer oberfläche |
US20030232348A1 (en) * | 2002-06-17 | 2003-12-18 | Affymetrix, Inc. | Complexity management of genomic DNA by locus specific amplification |
US20060024681A1 (en) * | 2003-10-31 | 2006-02-02 | Agencourt Bioscience Corporation | Methods for producing a paired tag from a nucleic acid sequence and methods of use thereof |
CN101374963A (zh) * | 2005-12-22 | 2009-02-25 | 凯津公司 | 用于基于aflp的高通量多态性检测的方法 |
US20090092967A1 (en) * | 2006-06-26 | 2009-04-09 | Epoch Biosciences, Inc. | Method for generating target nucleic acid sequences |
CN102373287A (zh) * | 2011-11-30 | 2012-03-14 | 盛司潼 | 一种检测肺癌易感基因的方法及试剂盒 |
WO2018040961A1 (zh) * | 2016-08-30 | 2018-03-08 | 广州康昕瑞基因健康科技有限公司 | 一种建库方法及snp分型方法 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110734908B (zh) * | 2019-11-15 | 2021-06-08 | 福州福瑞医学检验实验室有限公司 | 高通量测序文库的构建方法以及用于文库构建的试剂盒 |
-
2021
- 2021-12-31 WO PCT/CN2021/143948 patent/WO2022144003A1/zh active Application Filing
- 2021-12-31 US US18/270,492 patent/US20240076653A1/en active Pending
- 2021-12-31 CN CN202180088322.4A patent/CN116888276A/zh active Pending
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5508169A (en) * | 1990-04-06 | 1996-04-16 | Queen's University At Kingston | Indexing linkers |
US6045994A (en) * | 1991-09-24 | 2000-04-04 | Keygene N.V. | Selective restriction fragment amplification: fingerprinting |
WO2001075154A2 (de) * | 2000-04-03 | 2001-10-11 | Axaron Bioscience Ag | Neue verfahren zur parallelen sequenzierung eines nukleinsäuregemisches an einer oberfläche |
US20030232348A1 (en) * | 2002-06-17 | 2003-12-18 | Affymetrix, Inc. | Complexity management of genomic DNA by locus specific amplification |
US20060024681A1 (en) * | 2003-10-31 | 2006-02-02 | Agencourt Bioscience Corporation | Methods for producing a paired tag from a nucleic acid sequence and methods of use thereof |
CN101374963A (zh) * | 2005-12-22 | 2009-02-25 | 凯津公司 | 用于基于aflp的高通量多态性检测的方法 |
US20090092967A1 (en) * | 2006-06-26 | 2009-04-09 | Epoch Biosciences, Inc. | Method for generating target nucleic acid sequences |
CN102373287A (zh) * | 2011-11-30 | 2012-03-14 | 盛司潼 | 一种检测肺癌易感基因的方法及试剂盒 |
WO2018040961A1 (zh) * | 2016-08-30 | 2018-03-08 | 广州康昕瑞基因健康科技有限公司 | 一种建库方法及snp分型方法 |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115992243A (zh) * | 2022-11-11 | 2023-04-21 | 深圳凯瑞思医疗科技有限公司 | 一种检测卵巢癌的引物组合、试剂盒及文库构建方法 |
CN115992243B (zh) * | 2022-11-11 | 2024-01-26 | 深圳凯瑞思医疗科技有限公司 | 一种检测卵巢癌的引物组合、试剂盒及文库构建方法 |
Also Published As
Publication number | Publication date |
---|---|
CN116888276A (zh) | 2023-10-13 |
US20240076653A1 (en) | 2024-03-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11697843B2 (en) | Methods for creating directional bisulfite-converted nucleic acid libraries for next generation sequencing | |
US20220017893A1 (en) | Capture methodologies for circulating cell free dna | |
JP7535611B2 (ja) | ライブラリー調製方法ならびにそのための組成物および使用 | |
CN109511265B (zh) | 通过链鉴定改进测序的方法 | |
US20220364169A1 (en) | Sequencing method for genomic rearrangement detection | |
CN111801427B (zh) | 用于单分子的单链环状dna模板的产生 | |
CN110603326A (zh) | 扩增靶核酸的方法 | |
WO2022144003A1 (zh) | 一种用于高通量靶向测序的多重pcr文库构建方法 | |
EP3601611B1 (en) | Polynucleotide adapters and methods of use thereof | |
US20220348940A1 (en) | Method for introducing mutations | |
CA3223987A1 (en) | Methods, compositions, and kits for preparing sequencing library | |
CN110468179A (zh) | 选择性扩增核酸序列的方法 | |
KR20230124636A (ko) | 멀티플렉스 반응에서 표적 서열의 고 감응성 검출을위한 조성물 및 방법 | |
JP2022546485A (ja) | 腫瘍高精度アッセイのための組成物および方法 | |
US12091715B2 (en) | Methods and compositions for reducing base errors of massive parallel sequencing using triseq sequencing | |
WO2018009677A1 (en) | Fast target enrichment by multiplexed relay pcr with modified bubble primers | |
CN115279918A (zh) | 用于测序的新型核酸模板结构 | |
KR20230028450A (ko) | 앰플리콘 포괄적 풍부화 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21914731 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 202180088322.4 Country of ref document: CN |
|
WWE | Wipo information: entry into national phase |
Ref document number: 18270492 Country of ref document: US |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 10/10/2023) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 21914731 Country of ref document: EP Kind code of ref document: A1 |