CN110158157B - Method for synthesizing DNA library with fixed length and specific terminal sequence based on template material - Google Patents
Method for synthesizing DNA library with fixed length and specific terminal sequence based on template material Download PDFInfo
- Publication number
- CN110158157B CN110158157B CN201810150519.6A CN201810150519A CN110158157B CN 110158157 B CN110158157 B CN 110158157B CN 201810150519 A CN201810150519 A CN 201810150519A CN 110158157 B CN110158157 B CN 110158157B
- Authority
- CN
- China
- Prior art keywords
- dna
- linker
- sequence
- ligase
- seq
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims abstract description 112
- 239000000463 material Substances 0.000 title claims abstract description 13
- 230000002194 synthesizing effect Effects 0.000 title claims abstract description 13
- 108020004414 DNA Proteins 0.000 claims abstract description 213
- 102000053602 DNA Human genes 0.000 claims abstract description 190
- 230000002441 reversible effect Effects 0.000 claims abstract description 19
- 125000002887 hydroxy group Chemical group [H]O* 0.000 claims abstract description 16
- 108010090804 Streptavidin Proteins 0.000 claims abstract description 11
- 238000007385 chemical modification Methods 0.000 claims abstract description 11
- 239000007790 solid phase Substances 0.000 claims abstract description 8
- 230000008439 repair process Effects 0.000 claims abstract description 6
- YBJHBAHKTGYVGT-ZXFLCMHBSA-N 5-[(3ar,4r,6as)-2-oxo-1,3,3a,4,6,6a-hexahydrothieno[3,4-d]imidazol-4-yl]pentanoic acid Chemical compound N1C(=O)N[C@H]2[C@@H](CCCCC(=O)O)SC[C@H]21 YBJHBAHKTGYVGT-ZXFLCMHBSA-N 0.000 claims abstract description 5
- 238000010369 molecular cloning Methods 0.000 claims abstract description 4
- 239000011324 bead Substances 0.000 claims description 41
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical group [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 claims description 33
- 102000012410 DNA Ligases Human genes 0.000 claims description 23
- 108010061982 DNA Ligases Proteins 0.000 claims description 23
- 239000000872 buffer Substances 0.000 claims description 22
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 20
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 claims description 20
- 239000002773 nucleotide Substances 0.000 claims description 19
- 125000003729 nucleotide group Chemical group 0.000 claims description 19
- 238000011084 recovery Methods 0.000 claims description 19
- 239000011534 wash buffer Substances 0.000 claims description 18
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 claims description 17
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 claims description 17
- 238000004925 denaturation Methods 0.000 claims description 13
- 230000036425 denaturation Effects 0.000 claims description 13
- 238000000746 purification Methods 0.000 claims description 13
- 102000003960 Ligases Human genes 0.000 claims description 12
- 108090000364 Ligases Proteins 0.000 claims description 12
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 claims description 12
- 238000010367 cloning Methods 0.000 claims description 12
- 239000013598 vector Substances 0.000 claims description 12
- 238000005406 washing Methods 0.000 claims description 12
- 108091033409 CRISPR Proteins 0.000 claims description 10
- 239000011780 sodium chloride Substances 0.000 claims description 10
- 101710163270 Nuclease Proteins 0.000 claims description 9
- 229920001213 Polysorbate 20 Polymers 0.000 claims description 9
- 239000012634 fragment Substances 0.000 claims description 9
- 238000009396 hybridization Methods 0.000 claims description 9
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 claims description 9
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 claims description 9
- 230000005526 G1 to G0 transition Effects 0.000 claims description 8
- 239000007983 Tris buffer Substances 0.000 claims description 8
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 claims description 8
- 239000012528 membrane Substances 0.000 claims description 7
- 238000002156 mixing Methods 0.000 claims description 7
- 239000000203 mixture Substances 0.000 claims description 7
- 108091008146 restriction endonucleases Proteins 0.000 claims description 7
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 claims description 6
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 claims description 6
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 claims description 6
- 239000007853 buffer solution Substances 0.000 claims description 6
- 239000011347 resin Substances 0.000 claims description 6
- 229920005989 resin Polymers 0.000 claims description 6
- 239000000741 silica gel Substances 0.000 claims description 6
- 229910002027 silica gel Inorganic materials 0.000 claims description 6
- -1 tri (2-carboxyacetic acid) phosphine hydrochloride Chemical compound 0.000 claims description 6
- 239000004677 Nylon Substances 0.000 claims description 5
- 108020004682 Single-Stranded DNA Proteins 0.000 claims description 5
- 238000000137 annealing Methods 0.000 claims description 5
- 238000003776 cleavage reaction Methods 0.000 claims description 5
- 229920001778 nylon Polymers 0.000 claims description 5
- 108091092236 Chimeric RNA Proteins 0.000 claims description 4
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 claims description 4
- 241000588650 Neisseria meningitidis Species 0.000 claims description 4
- XUIMIQQOPSSXEZ-UHFFFAOYSA-N Silicon Chemical compound [Si] XUIMIQQOPSSXEZ-UHFFFAOYSA-N 0.000 claims description 4
- 241000194020 Streptococcus thermophilus Species 0.000 claims description 4
- 108010006785 Taq Polymerase Proteins 0.000 claims description 4
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 claims description 4
- 239000004202 carbamide Substances 0.000 claims description 4
- 239000003398 denaturant Substances 0.000 claims description 4
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 claims description 4
- 230000007017 scission Effects 0.000 claims description 4
- 238000012360 testing method Methods 0.000 claims description 4
- 108010017826 DNA Polymerase I Proteins 0.000 claims description 3
- 102000004594 DNA Polymerase I Human genes 0.000 claims description 3
- 230000004568 DNA-binding Effects 0.000 claims description 3
- PZBFGYYEXUXCOF-UHFFFAOYSA-N TCEP Chemical compound OC(=O)CCP(CCC(O)=O)CCC(O)=O PZBFGYYEXUXCOF-UHFFFAOYSA-N 0.000 claims description 3
- 238000012411 cloning technique Methods 0.000 claims description 3
- 239000005289 controlled pore glass Substances 0.000 claims description 3
- 239000011521 glass Substances 0.000 claims description 3
- 238000001556 precipitation Methods 0.000 claims description 3
- 229910052710 silicon Inorganic materials 0.000 claims description 3
- 239000010703 silicon Substances 0.000 claims description 3
- 239000000377 silicon dioxide Substances 0.000 claims description 3
- 208000034454 F12-related hereditary angioedema with normal C1Inh Diseases 0.000 claims description 2
- 229910003177 MnII Inorganic materials 0.000 claims description 2
- 241000589886 Treponema Species 0.000 claims description 2
- 240000004922 Vigna radiata Species 0.000 claims description 2
- 235000010721 Vigna radiata var radiata Nutrition 0.000 claims description 2
- 235000011469 Vigna radiata var sublobata Nutrition 0.000 claims description 2
- 239000002253 acid Substances 0.000 claims description 2
- 208000016861 hereditary angioedema type 3 Diseases 0.000 claims description 2
- 238000002493 microarray Methods 0.000 claims description 2
- 239000005373 porous glass Substances 0.000 claims description 2
- 239000011148 porous material Substances 0.000 claims description 2
- 238000010839 reverse transcription Methods 0.000 claims description 2
- 235000010378 sodium ascorbate Nutrition 0.000 claims description 2
- PPASLZSBLFJQEF-RKJRWTFHSA-M sodium ascorbate Substances [Na+].OC[C@@H](O)[C@H]1OC(=O)C(O)=C1[O-] PPASLZSBLFJQEF-RKJRWTFHSA-M 0.000 claims description 2
- 229960005055 sodium ascorbate Drugs 0.000 claims description 2
- PPASLZSBLFJQEF-RXSVEWSESA-M sodium-L-ascorbate Chemical compound [Na+].OC[C@H](O)[C@H]1OC(=O)C(O)=C1[O-] PPASLZSBLFJQEF-RXSVEWSESA-M 0.000 claims description 2
- 108010061979 CEL I nuclease Proteins 0.000 claims 1
- 238000010276 construction Methods 0.000 abstract description 9
- ASJSAQIRZKANQN-CRCLSJGQSA-N 2-deoxy-D-ribose Chemical group OC[C@@H](O)[C@@H](O)CC=O ASJSAQIRZKANQN-CRCLSJGQSA-N 0.000 abstract description 3
- 150000002972 pentoses Chemical class 0.000 abstract 1
- 238000006243 chemical reaction Methods 0.000 description 27
- 210000001671 embryonic stem cell Anatomy 0.000 description 20
- 239000000499 gel Substances 0.000 description 17
- 230000015572 biosynthetic process Effects 0.000 description 16
- 108010051779 histone H3 trimethyl Lys4 Proteins 0.000 description 16
- 238000003786 synthesis reaction Methods 0.000 description 16
- 108020005004 Guide RNA Proteins 0.000 description 15
- 239000007858 starting material Substances 0.000 description 14
- 239000000047 product Substances 0.000 description 12
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 12
- 108091027544 Subgenomic mRNA Proteins 0.000 description 10
- 108091028043 Nucleic acid sequence Proteins 0.000 description 8
- 238000000246 agarose gel electrophoresis Methods 0.000 description 8
- 238000005516 engineering process Methods 0.000 description 7
- 102000039446 nucleic acids Human genes 0.000 description 7
- 108020004707 nucleic acids Proteins 0.000 description 7
- 150000007523 nucleic acids Chemical class 0.000 description 7
- 238000002360 preparation method Methods 0.000 description 7
- 108090000623 proteins and genes Proteins 0.000 description 7
- 239000006228 supernatant Substances 0.000 description 7
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 6
- 239000003153 chemical reaction reagent Substances 0.000 description 6
- 238000005520 cutting process Methods 0.000 description 6
- 230000004048 modification Effects 0.000 description 6
- 238000012986 modification Methods 0.000 description 6
- 238000012216 screening Methods 0.000 description 6
- 239000000243 solution Substances 0.000 description 6
- 229920000936 Agarose Polymers 0.000 description 5
- 241000894006 Bacteria Species 0.000 description 5
- 238000010354 CRISPR gene editing Methods 0.000 description 5
- 108010077544 Chromatin Proteins 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- 210000003483 chromatin Anatomy 0.000 description 5
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 5
- 229960005542 ethidium bromide Drugs 0.000 description 5
- 239000012160 loading buffer Substances 0.000 description 5
- 238000002844 melting Methods 0.000 description 5
- 230000008018 melting Effects 0.000 description 5
- 239000011259 mixed solution Substances 0.000 description 5
- 229920002401 polyacrylamide Polymers 0.000 description 5
- 238000001308 synthesis method Methods 0.000 description 5
- 238000012408 PCR amplification Methods 0.000 description 4
- 230000027455 binding Effects 0.000 description 4
- 239000013599 cloning vector Substances 0.000 description 4
- 238000000749 co-immunoprecipitation Methods 0.000 description 4
- 238000009826 distribution Methods 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 230000001105 regulatory effect Effects 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 108091062157 Cis-regulatory element Proteins 0.000 description 3
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 3
- 108020005198 Long Noncoding RNA Proteins 0.000 description 3
- 241001529936 Murinae Species 0.000 description 3
- YXFVVABEGXRONW-UHFFFAOYSA-N Toluene Chemical compound CC1=CC=CC=C1 YXFVVABEGXRONW-UHFFFAOYSA-N 0.000 description 3
- 101150063416 add gene Proteins 0.000 description 3
- 229960002685 biotin Drugs 0.000 description 3
- 235000020958 biotin Nutrition 0.000 description 3
- 239000011616 biotin Substances 0.000 description 3
- 210000004027 cell Anatomy 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 230000029087 digestion Effects 0.000 description 3
- 238000003505 heat denaturation Methods 0.000 description 3
- 230000026731 phosphorylation Effects 0.000 description 3
- 238000006366 phosphorylation reaction Methods 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- FQVLRGLGWNWPSS-BXBUPLCLSA-N (4r,7s,10s,13s,16r)-16-acetamido-13-(1h-imidazol-5-ylmethyl)-10-methyl-6,9,12,15-tetraoxo-7-propan-2-yl-1,2-dithia-5,8,11,14-tetrazacycloheptadecane-4-carboxamide Chemical compound N1C(=O)[C@@H](NC(C)=O)CSSC[C@@H](C(N)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)NC(=O)[C@@H]1CC1=CN=CN1 FQVLRGLGWNWPSS-BXBUPLCLSA-N 0.000 description 2
- 102100034035 Alcohol dehydrogenase 1A Human genes 0.000 description 2
- 101100118093 Drosophila melanogaster eEF1alpha2 gene Proteins 0.000 description 2
- 101000892220 Geobacillus thermodenitrificans (strain NG80-2) Long-chain-alcohol dehydrogenase 1 Proteins 0.000 description 2
- 101000756632 Homo sapiens Actin, cytoplasmic 1 Proteins 0.000 description 2
- 101000780443 Homo sapiens Alcohol dehydrogenase 1A Proteins 0.000 description 2
- 101000579123 Homo sapiens Phosphoglycerate kinase 1 Proteins 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- KWYHDKDOAIKMQN-UHFFFAOYSA-N N,N,N',N'-tetramethylethylenediamine Chemical compound CN(C)CCN(C)C KWYHDKDOAIKMQN-UHFFFAOYSA-N 0.000 description 2
- KJWZYMMLVHIVSU-IYCNHOCDSA-N PGK1 Chemical compound CCCCC[C@H](O)\C=C\[C@@H]1[C@@H](CCCCCCC(O)=O)C(=O)CC1=O KJWZYMMLVHIVSU-IYCNHOCDSA-N 0.000 description 2
- 102100028251 Phosphoglycerate kinase 1 Human genes 0.000 description 2
- 101710182846 Polyhedrin Proteins 0.000 description 2
- 101100010928 Saccharolobus solfataricus (strain ATCC 35092 / DSM 1617 / JCM 11322 / P2) tuf gene Proteins 0.000 description 2
- 101000916237 Schizosaccharomyces pombe (strain 972 / ATCC 24843) Cullin-3 Proteins 0.000 description 2
- 239000008051 TBE buffer Substances 0.000 description 2
- 101150001810 TEAD1 gene Proteins 0.000 description 2
- 101150074253 TEF1 gene Proteins 0.000 description 2
- 102100029898 Transcriptional enhancer factor TEF-1 Human genes 0.000 description 2
- 101150109071 UBC gene Proteins 0.000 description 2
- ROOXNKNUYICQNP-UHFFFAOYSA-N ammonium persulfate Chemical compound [NH4+].[NH4+].[O-]S(=O)(=O)OOS([O-])(=O)=O ROOXNKNUYICQNP-UHFFFAOYSA-N 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 239000012148 binding buffer Substances 0.000 description 2
- 230000007321 biological mechanism Effects 0.000 description 2
- 101150038500 cas9 gene Proteins 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000001976 enzyme digestion Methods 0.000 description 2
- 238000011049 filling Methods 0.000 description 2
- 102000009543 guanyl-nucleotide exchange factor activity proteins Human genes 0.000 description 2
- 108040001860 guanyl-nucleotide exchange factor activity proteins Proteins 0.000 description 2
- 238000012165 high-throughput sequencing Methods 0.000 description 2
- 208000015181 infectious disease Diseases 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 244000052769 pathogen Species 0.000 description 2
- 230000001717 pathogenic effect Effects 0.000 description 2
- 108090000765 processed proteins & peptides Proteins 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 241001515965 unidentified phage Species 0.000 description 2
- PBVAJRFEEOIAGW-UHFFFAOYSA-N 3-[bis(2-carboxyethyl)phosphanyl]propanoic acid;hydrochloride Chemical compound Cl.OC(=O)CCP(CCC(O)=O)CCC(O)=O PBVAJRFEEOIAGW-UHFFFAOYSA-N 0.000 description 1
- HRPVXLWXLXDGHG-UHFFFAOYSA-N Acrylamide Chemical compound NC(=O)C=C HRPVXLWXLXDGHG-UHFFFAOYSA-N 0.000 description 1
- 241000203069 Archaea Species 0.000 description 1
- 102000016897 CCCTC-Binding Factor Human genes 0.000 description 1
- 108010014064 CCCTC-Binding Factor Proteins 0.000 description 1
- 108091028732 Concatemer Proteins 0.000 description 1
- 101100329224 Coprinopsis cinerea (strain Okayama-7 / 130 / ATCC MYA-4618 / FGSC 9003) cpf1 gene Proteins 0.000 description 1
- 238000010442 DNA editing Methods 0.000 description 1
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 1
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- 206010059866 Drug resistance Diseases 0.000 description 1
- 101001116388 Homo sapiens Melatonin-related receptor Proteins 0.000 description 1
- 101001094700 Homo sapiens POU domain, class 5, transcription factor 1 Proteins 0.000 description 1
- 108010086093 Mung Bean Nuclease Proteins 0.000 description 1
- 101100481002 Mus musculus Tff1 gene Proteins 0.000 description 1
- 108091092724 Noncoding DNA Proteins 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 102100035423 POU domain, class 5, transcription factor 1 Human genes 0.000 description 1
- 101150082761 POU5F1 gene Proteins 0.000 description 1
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 description 1
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 1
- 238000012228 RNA interference-mediated gene silencing Methods 0.000 description 1
- 108091027967 Small hairpin RNA Proteins 0.000 description 1
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 239000007984 Tris EDTA buffer Substances 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 229910001870 ammonium persulfate Inorganic materials 0.000 description 1
- 210000004436 artificial bacterial chromosome Anatomy 0.000 description 1
- 210000001106 artificial yeast chromosome Anatomy 0.000 description 1
- 230000004791 biological behavior Effects 0.000 description 1
- 230000031018 biological processes and functions Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 101150059443 cas12a gene Proteins 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000007248 cellular mechanism Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000015271 coagulation Effects 0.000 description 1
- 238000005345 coagulation Methods 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 101150113466 cul-3 gene Proteins 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 230000005782 double-strand break Effects 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 238000012869 ethanol precipitation Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000012467 final product Substances 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 238000003198 gene knock in Methods 0.000 description 1
- 238000003209 gene knockout Methods 0.000 description 1
- 230000009368 gene silencing by RNA Effects 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000010448 genetic screening Methods 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 241000411851 herbal medicine Species 0.000 description 1
- 230000003100 immobilizing effect Effects 0.000 description 1
- 230000007124 immune defense Effects 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000002458 infectious effect Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 201000001441 melanoma Diseases 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 230000009456 molecular mechanism Effects 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- ZIUHHBKFKCYYJD-UHFFFAOYSA-N n,n'-methylenebisacrylamide Chemical compound C=CC(=O)NCNC(=O)C=C ZIUHHBKFKCYYJD-UHFFFAOYSA-N 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 238000011027 product recovery Methods 0.000 description 1
- 230000009465 prokaryotic expression Effects 0.000 description 1
- 102000004169 proteins and genes Human genes 0.000 description 1
- 239000011535 reaction buffer Substances 0.000 description 1
- 230000008263 repair mechanism Effects 0.000 description 1
- 238000012827 research and development Methods 0.000 description 1
- 238000010187 selection method Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 239000004055 small Interfering RNA Substances 0.000 description 1
- 239000001632 sodium acetate Substances 0.000 description 1
- 235000017281 sodium acetate Nutrition 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 238000002604 ultrasonography Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1034—Isolating an individual clone by screening libraries
- C12N15/1068—Template (nucleic acid) mediated chemical library synthesis, e.g. chemical and enzymatical DNA-templated organic molecule synthesis, libraries prepared by non ribosomal polypeptide synthesis [NRPS], DNA/RNA-polymerase mediated polypeptide synthesis
-
- C—CHEMISTRY; METALLURGY
- C40—COMBINATORIAL TECHNOLOGY
- C40B—COMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
- C40B50/00—Methods of creating libraries, e.g. combinatorial synthesis
- C40B50/06—Biochemical methods, e.g. using enzymes or whole viable microorganisms
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Molecular Biology (AREA)
- Biotechnology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- General Chemical & Material Sciences (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Crystallography & Structural Chemistry (AREA)
- Plant Pathology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Medicinal Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Health & Medical Sciences (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The invention relates to the field of biology, and particularly provides a method for synthesizing a DNA library with fixed length and specific terminal sequence based on template materials, which comprises the following steps: obtaining fragmented double-stranded DNA; DNA end repair and 3' end tailing; linkage of the 3' Biotin modified a1 linker; purifying a connection product; DNA is fixed on a solid phase with streptavidin; denaturing the double-stranded DNA into single strands; hybridizing the primer 1; extending bases with dNTPs having reversible chemical modification at the 3' hydroxyl of a deoxyribose group; recovering the hydroxyl at the 3' end of the pentose; a fixed length DNA extension; flattening the 5' end; selecting a terminal sequence; removing a terminal sequence; ligation of linkers for molecular cloning. The method can conveniently construct the region fixed length and the specific terminal sequence library which have no reference genome and high sequence variability, and greatly reduces the construction cost of the library.
Description
Technical Field
The invention relates to the field of synthetic biology, in particular to a preparation method of a DNA library with fixed length and specific terminal sequence based on template material synthesis.
Background
A set of molecular biological mechanisms for regulation and control are arranged behind the life process and the biological behavior, and the rule of the molecular mechanisms is the central rule, namely the rule of information transmission among DNA, RNA and protein. In the last 30 years, the means for studying biological mechanisms mainly include forward genetics and reverse genetics, wherein the forward genetics are based on phenomena to search key molecules and mechanisms, and the reverse genetics are based on molecules to disclose mechanisms and search corresponding biological processes by controlling the molecules. For forward genetic research, a large technical means is genetic screening, and initially, aiming at a cell mechanism, unbiased screening is mostly adopted, while with the advent of the RNA interference technology, crispr (clustered regulated Short Palindromic repeat) technology, the screening of a library pool also becomes one of important gene screening modes.
The CRISPR system is a part of the bacteriophage-resistant immunity of bacteria, with which about 40% of bacteria and 90% of archaea resist bacteriophage infection. When the bacteria are attacked by phage or plasmid, a small part of the bacteria can integrate the DNA of the infectious source into the DNA cluster of CRISPR, and when infected again by the same pathogen, transcription of this region can be initiated, and can form a complex with Cas9 protein, clearing the DNA of the pathogen.
In 2013 or so, Zhang Feng et al achieved genomic DNA editing in mammals based on the immune defense system of bacteria. The system mainly comprises 2 components, wherein one component is a guide RNA for identifying genomic DNA according to base complementary pairing, and the other component is an endonuclease Cas9 recruited by the guide RNA, when the components and the genomic DNA form a complex, double-strand breaks of DNA can be caused, and insertion or deletion mutation can be generated at the breaks due to non-homologous end repair mechanisms of mammals, so that the genomic DNA sequence is edited. At the beginning of the development of the technology, gene knockout or knock-in is mainly realized in a cell line or a living body as a tool for manipulating genomic DNA. Due to the simplicity of the guide RNA design, the technology is changed into a powerful genomics research tool by a plurality of groups, for example, the technology is utilized by Zhang Feng to screen out melanoma drug resistance related genes NF1, NF2 and CUL 3. Due to the ease of manipulation of the guide RNA for genes, scientists have also designed Cas9 protein as an activated or suppressed version, conveniently achieving gene activation or suppression. Mammalian cells are primarily cis-regulated compared to lower organisms such as Drosophila. Zhang Feng et al studied the function of the non-coding region DNA around the CUL3 gene using CRISPR technique, and they found that the binding site of transcription factors such as CTCF, YY1 had a regulatory effect on CUL 3. The CRISPR technology of anyone et al studied cis-acting elements around POU5F1 gene in human embryonic stem cells, and they found that cis-acting elements regulating POU5F1 exist inside other genes, while when the system of zhangfeng et al studied lncRNA, it was found that activating lncRNA transcription on genome could produce cellular resistance, but lncRNA itself did not cause resistance, suggesting that multi-layer annotation may be required for genome annotation, not just as one state. Klann TS et al found that different cis-acting elements were required to regulate gene transcription in different cells, suggesting that the genomic DNA annotation of the present invention needs to be performed in different tissue cells. In view of the above, the study of genomic DNA functions using omics is becoming an important field.
WO2015065964 discloses a method for designing sgrnas against a reference genome using bioinformatics tools, and synthesizing a sgRNA library using a pool of primers. The price of the existing primer pool synthesis method is high, and if tens of millions of sgRNAs are synthesized, the cost is very high; meanwhile, the strategy must depend on a reference genome, so that species without the reference sequence genome, such as intestinal microorganisms or unknown species, cannot be subjected to CRISPR library construction; meanwhile, sgrnas designed from reference genomes may not function in individuals without sequencing, having nucleotide sequences that differ between individuals of the same species. The cost is reduced, and meanwhile, the construction of a gRNA library by using natural nucleic acid as a material is a great technical development direction at present.
WO 2017081097A 1 discloses a method for constructing a gRNA library of natural nucleic acid by using a 20bp short fragment library obtained by using a type II S restriction endonuclease, but the method is complex in operation process, and sgRNA diversity is greatly lost due to the fact that a plurality of enzyme cutting sites need to be fixed, so that a simpler and cheaper method is needed for preparing the gRNA library.
Therefore, there is a need for further research and development of new, simpler, efficient methods for the preparation of gRNA libraries.
Disclosure of Invention
Aiming at the current technical situation, the method can convert the naturally-occurring DNA or nucleic acid into the sgRNA library, and the method does not need to design the sgRNA by using the reference sequence, but directly converts the natural DNA into DNA with fixed length, such as 19-24bp, and the fragments with fixed length are derived from DNA materials, so that the condition that the design of the reference sequence is inaccurate does not exist.
The invention provides a sequence selection method of a specific end and is used together with DNA (deoxyribonucleic acid) with a fixed length, so that a sgRNA library can be obtained.
The invention relates to a preparation method of a DNA library with fixed length and specific terminal sequence based on template material synthesis, which comprises the following steps:
1) primer hybridization: taking a single-stranded DNA with a linker A1 immobilized on a stationary phase with streptavidin,
in one embodiment, the linker a1 in step 1) is a peptide having the nucleotide sequence shown in SEQ ID No.1 and/or SEQ ID No.2, or the linker a1 is CMV, EF1a, SV40, PGK1, Ubc, human beta actin, CAG, TRE, UAS, Ac5, Polyhedrin, CaMKIIa, (Gal1,10), TEF1, GDS, ADH1, CaMV35s, Ubi, H1, U6; or a 3' terminal sequence of T7, T7lac, Sp6, araBAD, trp, lac, Ptac, or Pl;
the A1 linker is used to clone the final product into the promoter region of the cloning vector, and thus this sequence is derived from the promoter region sequence of the cloning vector, typically the sequence at the 3 'end of the promoter region, and may extend the sequence at the 5' end of the A1 linker; as one of the embodiments, cloning vector promoter sequences for eukaryotic expression include CMV, EF1a, SV40, PGK1, Ubc, human beta actin, CAG, TRE, UAS, Ac5, Polyhedrin, CaMKIIa, (Gal1,10), TEF1, GDS, ADH1, CaMV35s, Ubi, H1, or U6; as another embodiment, cloning vector promoters for prokaryotic expression include T7, T7lac, Sp6, araBAD, trp, lac, Ptac, pL.
As one embodiment, it is preferred that the linker A1 in step 1) is a peptide having the nucleotide sequence shown in SEQ ID NO.1 and/or SEQ ID NO. 2;
as one embodiment, the primer 1 in step 1) is SEQ ID NO: 11;
wherein, the primer 1and the primer 2 are respectively complementary sequences to A1-R and A1-F;
as one embodiment, the gradient annealing in the step 1) is 85 ℃ for 30s and 65 ℃ for 1min, and the temperature is slowly reduced to 40 ℃ at 0.1 ℃ per second and is 5min at 40 ℃;
as one embodiment, the DNA hybridization buffer solution in the step 1) is a5 XSSC buffer, which is a mixed solution of (0.75M sodium chloride, 0.075M sodium acetate, pH 7.0)/0.1% Tween 20.
2) Base elongation: then adding DNA polymerase and dNTPs with reversible chemical modification at the 3 'end to prolong the base, and then recovering the hydroxyl at the 3' end;
as one embodiment, the DNA polymerase in step 2) is 9 ° N (D141A/E143A/a485L) DNA polymerase;
as one embodiment, the dNTPs with reversible chemical modification in step 2) are 3' -O-azidomethyl-dNTPs, 3' -O-allyl-dNTPs or 3' -O-allyloxycarbonyn-dNTPs;
as one embodiment, the step 2) is performed to restore the 3' terminal hydroxyl group using tris (2-carboxyacetic acid) phosphine hydrochloride, TECP or DTT;
as one embodiment, the step 2) further comprises adding extension buffer solution when extending the base, incubating at 65 degrees for 2min, washing the magnetic beads with washing buffer solution 1 for 2 times, then incubating at 65 degrees with 3' hydroxyl-terminated recovery buffer solution for 1min, washing with washing buffer solution 2 for 2 times, and repeating the above processes if necessary;
as one embodiment, the extension buffer in the step 2) is a mixed solution of 1. mu.M 3' -O-azidomethyl-dNTPs mix, 0.015. mu.g/ml 9 ℃ N DNA polymerase,50mM Tris pH 9.0,50mM NaCl,6mM MgSO4,1mM EDTA and 0.05% Tween 20;
as one embodiment, the washing buffer 1 in the step 2) is a mixed solution of 50mM Tris pH 9.0,50mM NaCl,1mM EDTA and 0.05% Tween 20;
as one embodiment, the 3' -terminal hydroxyl group-recovering buffer in the step 2) is a mixed solution of 100mM TCEP (Tris- (2-Carboxyethyl) phosphine, Tris (2-Carboxyethyl) phosphine hydrochloride), 100mM Tris pH 9.0,100mM NaCl,50mM sodium ascorbate, and 0.05% Tween 20;
as one embodiment, the washing buffer 2 in the step 2) is a mixed solution of 100mM Tris pH 9.0,100mM NaCl, and 0.05% Tween 20;
3) fixed length DNA extension: repeating the cycle of the step 2) for 15-100 times, preferably 20-30 pb, and further preferably 20-23 pb;
4) flattening the tail end: the 5' -end was removed by adding a single-stranded nuclease,
as one embodiment, the single strand nuclease in step 4) is mungbean acid nuclease or S1 nuclease CEL I; as one of the further embodiments, mung bean nuclease is preferably used;
5) selection of terminal sequences: the selection of the terminal sequence has been carried out by adding DNA ligase and linker B1; or replacing to add base for extension in the circulation corresponding to the position of the sequence to be selected, and purifying DNA with complete length by polyacrylamide gel electrophoresis to complete end selection;
as one embodiment, said DNA ligase in said step 5) is T4DNA ligase or blunt-end/TA ligase;
as one embodiment, the linker B1 in step 5) comprises a sequence of AscI, Acc65I, AccB1I, ApaI, Asp718I, AspS9I, AvaII, BamHI, BanI, BbeI, Bme18, 18I, BmgT120I, BmiI, BshFI, BshNI, BsnI, Bsp120I, BspLI, BstEII, BstPI, BsuRI, BtsCI, Eco91I, EheI, FseI, HaeIII, KasI, KpnI, PhoI, sanddi, SfoI, SgsI, or ssdi cleavage sites; preferably, linker B1 is the nucleotide sequence shown in SEQ ID NO.3 and/or SEQ ID NO.4, or the sequence of AsiSI, BsePI, BssHII, NotI, HhaI, PauI, SfaAI;
as an embodiment, optionally, the step 5) further comprises selecting the terminal sequence after selecting the terminal sequence by using a B1 linker or selecting the terminal sequence again by using a B2 linker, wherein the linker B2 is the nucleotide sequence shown in SEQ ID NO.5 and/or SEQ ID NO. 6.
6) Removal of terminal sequences: adding DNA ligase and a linker B3, and removing a terminal sequence;
in the method of the present invention, as one of the embodiments, the DNA ligase is T4DNA ligase, or blunt end/TA ligase;
in the method of the present invention, as one embodiment, the linker B3 is a sequence comprising Type II S restriction endonuclease, preferably a sequence comprising BbsI, AcuI, AlwI, BaeI, BccI, bcei, BbvI, BcoDI, BfuAI, BpuEI, BsaXI, BseRI, BsgI, BsmBI, BsmFI, bsmmi, BspCNI, BspMI, BspQI, BtgZI, EarI, Ecil, FauI, HgaI, HphI, HpyAV, MmeI, MnII, SapI, or nisattype II S restriction endonuclease; further preferably, the linker B3 is the nucleotide sequence of SEQ ID NO.7 and/or SEQ ID NO. 8;
7) ligation of cloning adaptor: adding DNA ligase and a joint B4 to carry out the connection of the cloning joint; in the method of the present invention, as one embodiment, the DNA ligase is T4DNA ligase, blunt end/TA ligase;
in the method of the present invention, as one embodiment, the linker B4 comprises: 1) cas9 knockdown of the relevant chimeric RNA sequence (addgene: #52961, LentiCRISPR V2); 2) cas9 activates the relevant chimeric RNA sequence (addgene: # 73797); 3) cpf1 (addgene: #84739 and # 84740).
In the method of the present invention, as one embodiment, in the step 2), 1 base extension is performed using dNTPs having reversible chemical modification of the hydroxyl group at the 3' end of the deoxyribose group: adding dNTPs containing DNA polymerase and reversible chemical modification at the 3' end into the magnetic bead DNA compound in the step 1); the method of the present invention is preferably O-azidomethyl at the 3 'end, but the present invention can also adopt other dNTPs with 3' hydroxyl of deoxyribose group and reversible chemical modification to replace 3'-O-azidomethyl-dNTPs used by the method, and the present invention includes but is not limited to 3' -O-azidomethyl-dNTPs, 3 '-O-allyl-dNTPs or 3' -O-allyloxy-borony-dNTPs.
dNTPs with reversible chemical modification of the hydroxyl group at the 3' end have 2 characteristics, and a) can be doped into an extended chain by DNA polymerase; b) the modification can be efficiently cleaved off and 3' restored to hydroxyl; thus, reversible end modifications having both of the above-described characteristics can be used in the method of the present invention.
In the method of the present invention, as one embodiment, the step 3) is a step of extending the fixed-length DNA; the extension of base pairs is carried out by those skilled in the art according to actual needs, such as but not limited to 15-100 pb, for example, 23bp extension is needed, and the step 2) is repeated twenty times. The invention repeats step 2) to carry out multi-cycle reversible end reaction, and then obtains a sequence with fixed length through the single-stranded nuclease reaction of step 3), wherein the length of the sequence can be 15bp to 100 bp; via step 3), an extended DNA single strand may be obtained via a DNA denaturation step such as heat denaturation or other denaturing amount, and then linked to PAM using a DNA single strand ligase to select the relevant single stranded oligonucleotide SEQ ID NO:4, obtaining the DNA structure in the step 5) through PCR reaction.
In the method of the present invention, as one embodiment, the step 5) is optional, wherein, during the step 3) cycle, specific 3' -O-azidomethyl-dNTPs are added in the 1 st cycle reaction, and after the sequence extension with specific characteristics is completed, the step 5) operation is not required;
adding DNA ligase and a B1 adaptor in the step 5), combining with the tail end of the extended DNA to form an enzyme cutting site, and selecting the required fragment by using the enzyme cutting site. The B1 joint comprises 1 enzyme cutting site part, and can form 1 enzyme cutting site together with the terminal nucleic acid extended in the step 2), and a sequence with a specific terminal can be selected after being cut by a DNA endonuclease.
In the method, in the step 6), DNA ligase and a B3 adaptor are added for later PCR amplification of a ligation product; wherein the B3 adaptor contains 1 type 2S restriction enzyme site, and can cut DNA and excise needed number of bases.
In the method of the present invention, in the step 7), a DNA ligase and a B4 linker are added, and the step 7) is used for connecting sequences required for molecular cloning, and fragments can be cloned on a vector for testing by using an existing cloning technique such as a conventional cloning technique or a seamless connection technique; the sequence can be selected by those skilled in the art according to the final vector and general knowledge in the art, and the vectors of the present invention include, but are not limited to, chimeric RNA sequence, LentiCRISPR V2 or Lenti-guide sequence, among other vectors available for commercialization at present.
In the method of the present invention, as one embodiment, the single-stranded DNA having the linker a1 immobilized on the stationary phase having streptavidin in the step 1) can be prepared by a method commonly used in the art, for example, the double-stranded DNA can be first denatured into single strands by heat denaturation or a denaturing agent sodium hydroxide, and the solution can be returned to 1M sodium chloride environment for streptavidin and biological affinity reaction, thereby immobilizing the DNA having the a1 linker on the stationary phase.
As one embodiment, the present invention includes, but is not limited to, the following method:
1-1) acquisition of fragmented double-stranded DNA: double-stranded DNA can be reverse transcribed using either ultrasound or fragmented RNA to obtain fragments of 10bp to 40 Kbp;
1-2) DNA end repair and 3' end tailing, namely mixing the DNA obtained in the step (1) with DNA polymerase to carry out DNA 3' end flattening, and adding A or G to the 3' end;
in the method of the present invention, as one embodiment, the DNA polymerase includes: 3' end with A tail, using a klenow fragment (3' → 5' exo)-) (ii) a A G-tail was added to the 3' end, and Taq DNA polymerase was used.
1-3) ligation of 3' -Biotin modified A1 linker: mixing the product of the step (2) with library ligase, and adding an A1 linker; the ligase is blunt end/TA ligase;
1-4) purification of the ligation product: double-stranded DNA was purified using 1 Xvolume of Ampure XP beads, or DNA was recovered using ethanol or isopropanol precipitation or DNA silica membrane column purification kits.
1-5) binding of DNA to a solid phase: mixing the DNA obtained in the step 1-4) with an isovolumetric stationary phase with the concatemer, and incubating at room temperature;
in the method of the present invention, as one embodiment, the solid phase in step 1-5) comprises 2X B & W buffer resuspended T1 magnetic beads, Controlled porous Glass (Controlled Pore Glass), silica gel (silica gel), porous resin (Pore resin), microarray slides (Glass slides), silicon-based chips (silica chips), or nylon membrane (nylon membrane); in the method of the present invention, as one embodiment, the incubation time at room temperature includes, but is not limited to, 15 min.
1-6) denaturation of double-stranded DNA into single strands: modifying DNA on the magnetic beads with 0.1N NaOH to obtain single strands;
in the method of the present invention, as one of the embodiments, the denaturation in the step 1-6) includes thermal denaturation or denaturation using a DNA denaturant, and the denaturant is sodium hydroxide, formamide or urea; sodium hydroxide is preferred.
In the method of the present invention, as one embodiment, the double-stranded DNA in step 1-1) may be a nucleic acid sample of any origin, including genomic DNA of any species, cDNA obtained by reverse transcription of RNA, DNA library obtained by targeted capture technology, DNA obtained by chromatin co-immunoprecipitation, and the like, DNA clones generated from biological genomic DNA, such as gene cDNA clones, bacterial artificial chromosome clones, yeast artificial chromosome clones, and the fragment size thereof is in the range of 10bp to 40 Kbp.
In the method of the present invention, as one embodiment, in the step 1-2), the DNA polymerase may flatten the end of DNA3 'and add a or G to the 3' end; as one embodiment, the DNA polymerase includes: when the A tail is added to the ' end of DNA3, the Klenow fragment (3' → 5' exo) is used-) (ii) a When the 3' end of the DNA is added with a G tail, Taq DNA polymerase is used; as one embodiment, if Taq DNA polymerase is used for adding G to the 3 'end, T needs to be changed to G at the 3' end of the forward primer of the A1 adaptor;
the DNA is connected with an A1 linker to prepare for the subsequent reaction of accessing dNTPs with reversible chemical modification at the 3'; meanwhile, the linker A1 comprises a part of the sequence for molecular cloning, and if the vector is changed, the part of the sequence of the linker can be adaptively designed according to the vector, wherein the change of the A1 is mainly changed according to the target vector of cloning.
The DNA library produced by the method of the present invention can be cloned into a DNA vector, and according to the currently mainstream cloning method of the cloning library such as the seamless cloning method (seamless cloning method), if designed into a seamless cloning library, the sequence of A1 should be the same as the sequence of the assembly arm required for assembly, and one side is usually a promoter sequence, for example, A1 can be changed into the 3' end sequence of mU6 or T7 promoter; in the traditional cloning method, restriction endonuclease is used for preparing a library, A1 needs to contain a vector sequence to a restriction enzyme cutting site.
In the method of the present invention, as one embodiment, in the steps 1-4), the other PCR purification reagents include, but are not limited to, a DNA purification kit, and currently, DNA purification methods mainly used in laboratories include ethanol or isopropanol precipitation, silica gel column purification, and magnetic nucleic acid bead purification; nucleic acid magnetic bead purification methods are preferred, including, for example, but not limited to, Beckman brand Ampure XP magnetic beads.
In the method of the present invention, as one embodiment, in the step 1-5), when 2X B & W buffer is used, DNA is mixed with an equal volume of 2X B & W buffer resuspended T1 magnetic beads, and incubated at room temperature for 15 min.
As one embodiment, in the steps 1-5), other major solid phases for DNA single nucleotide chain synthesis include, but are not limited to, for example, Controlled Pore Glass, silica gel, porous resin, or Glass slides or silicon chips or nylon membrane used in a method for synthesizing DNA single nucleotide chains on a chip, wherein DNA single nucleotide chains of a specific sequence may be first immobilized on the surface thereof for hybridizing a DNA library, and before synthesizing DNA of a fixed length, the DNA may be hybridized by DNA single strand denaturation and annealing, i.e., the DNA single nucleotide chains may be first immobilized on the surface thereof; or by using DNA library with special chemical modification such as biotin modification, and directly fixing DNA to streptavidin resin or magnetic beads, and these materials are characterized by using a pair of molecules with strong interaction such as covalent binding, and other molecules with covalent bond interaction are modified, and can be used for synthesizing DNA library by using the method of the present invention.
In the method of the present invention, the immobilization of the DNA on the stationary phase enables efficient and rapid switching of the DNA reacted in the previous step into a solution, particularly in the step of DNA extension of a fixed length.
In the method of the present invention, as one embodiment, in the steps 1 to 6), the denaturation is performed by heat, or by using a DNA denaturing agent such as sodium hydroxide, formamide, or urea; sodium hydroxide is preferred.
As one embodiment, the present invention denatures DNA and then hybridizes with a primer to provide a DNA structure for reversible end reaction, which can achieve extension of the fixed length of the extended strand by incorporating dNTPs with reversible modification of the 3 'end into the 3' end of the primer using DNA polymerase, or can be formed using other methods, such as designing primer 1 to have a biotin-modified form, followed by heat denaturation annealing with the library and immobilization onto a solid phase.
In one embodiment of the method of the present invention, the fixed length is 15 to 100bp, preferably 20 to 30pb, and more preferably 20 to 23pb, which may be, for example, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 30, 35, 40, 45, 50, 60, 70, 80, 90, or 100 pb. It will be understood by those skilled in the art that the fixed length means that the resulting DNA samples all have the same length, for example, all 20, 21, 22 or 23bp in length; the fixed length sequences may be the same base or different bases; the bases are the same or different, depending primarily on the template strand, if the bases are the same as the template strand bases, if the template has only one base, then the bases in the resulting sequence are the same; if there are multiple templates, the bases of the products are not identical.
In the method of the present invention, as one embodiment, the terminal specific sequence is a DNA library or a target DNA sample selected by using the library of the present invention has an NGG, NAG or NAA sequence at the terminal; as one embodiment, SpCas 93' NGG is preferred; SpCas9VRER:3' NGCG; SpCas9EQR:3' NGAG; SpCas9VQR is 3' NGAN or NGNG; saca 9:3' NNGRRT or NNGRR (N); AsCpf1and LbCpf1:5' TTTV; AsCpf1RR:5' TYCV; LbCpf1RR 5' TYCV; AsCpf1RVR:5' TATV; neisseria Meningitidis (NM):3' NNGATT; streptococcus Thermophilus (ST):3' NNAGAAW; or Treponema Dentanola (TD):3' NAAAAC.
As one embodiment, the method of the present invention can be used not only for the construction of a fixed length DNA of a criprpr vector, but also for the construction of an shRNA library.
In a second aspect, the present invention provides a method for sequence selection, the method comprising: in the method, only specific dNTP is doped during terminal sequence synthesis, the length of a target sequence is the target length, but the length of a non-target sequence is shorter than the length of the target sequence, the target length DNA is distinguished from the non-target length DNA by gel electrophoresis, and the target length DNA is purified, namely the screening of the specific sequence is completed.
The method can conveniently construct the fixed length and the specific terminal sequence library of the region without the reference genome and with high sequence variability, and simultaneously, the method can greatly reduce the construction cost of the library. The present invention proves the method of synthesizing library with fixed length and specific terminal sequence based on template strand, the DNA library synthesized by the method has the sequence structure as shown in figure 2-D, the middle part is 20bp sequence from initial DNA material, the fixed length of the library means that the length of the 20bp is fixed, the terminal is specific sequence, that is, before obtaining 20bp DNA in figure 2-D, the sequence following 20bp is shown in figure 2-A, the result shown in figure 2-C can be obtained by 2 methods provided by the present patent for screening, according to the left-to-right direction in the figure, the sequence on the right side of 20bp is NGG, the sequence on the terminal is NGG sequence.
Drawings
FIG. 1: the invention constructs a flow chart of sgRNA library construction;
FIG. 2: the DNA sequence of the PCR product after the B1-B4 joint is connected in the embodiment 1 of the invention;
FIG. 3: the results obtained in the end sequence selection and excision steps of the present invention are shown in example 1 by DNA agarose gel electrophoresis. A. After 23 times of 1bp extension and 3' hydroxyl recovery of the template DNA, a band of 144bp appears in the 5 th lane, and the sequence is shown in FIG. 2-A. B. After AscI digestion and ligation with B2 linker, a 142bp band appeared in lane 4, and its sequence is shown in FIG. 2-B. C. The DNA in Panel B was digested with AscI, resulting in a band of about 81bp in lane 5. D. The 4 th lane generated a 141bp band by BbsI cleavage to remove the terminal base, 5' end filling up and ligation of B4 adaptor, and the DNA sequence is shown in FIG. 2-D. D. The DNA in the third lane is distributed about 59 bp-60 bp by the amplification of the short primer and the electrophoresis of the modified polyacrylamide;
FIG. 4: the invention in the embodiment 1 generates a representative diagram of the distribution of the sequences in the 20bp sequence library in the mouse genome; the green part is the distribution of the starting DNA material-mouse embryonic stem cell H3K4me3 chromatin co-immunoprecipitated DNA in the mouse genome, and the red part is the representative distribution map of the 20bp library generated by the starting material through the operation steps of the invention in the mouse genome;
FIG. 5: the method of the embodiment 1 of the invention synthesizes a comparison chart of the conditions of 20bp sgRNA library and starting DNA in the mouse embryonic stem cell H3K4me3 area according to the template strand: FIG. A shows the DNA alignment of the starting material of example 1 to a murine embryonic stem cell H3K4me3, and FIG. C shows the alignment of the library obtained in example 1 to a murine embryonic stem cell H3K4me 3; panel B is the case of example 1 where the DNA of the starting material immediately follows the NGG sequence, and panel D is the proportion of the library obtained in example 1 immediately following the NGG sequence;
FIG. 6: the comparative figure of the condition of synthesizing 20bp sgRNA library of the human embryonic stem cell H3K4me3 area and the starting DNA according to the template strand in the method of the embodiment 2 of the invention is as follows: FIG. A shows the DNA alignment of the starting material of example 2 to human embryonic stem cells H3K4me3, and FIG. C shows the alignment of the library obtained in example 2 to human embryonic stem cells H3K4me 3; panel B is the case of example 2 where the DNA of the starting material immediately follows the NGG sequence, and panel D is the proportion of the library obtained in example 2 immediately following the NGG sequence;
FIG. 7: the comparison of the synthetic library of the method of example 2 of the present invention and the synthetic library of the primer pool is shown in the following figure: panel A is a proportion of DNA of each length obtained by synthesis in example 2, panel B is a proportion of DNA of 19 and 20bp immediately followed by NGG sequence in example 2, panel C is a proportion of DNA of each length obtained by synthesis in a commercial primer pool, and panel D is a proportion of DNA of 20bp immediately followed by NGG sequence obtained by synthesis in a commercial primer pool.
Detailed Description
The present invention will be further described with reference to the following examples or test examples, but the present invention is not limited thereto.
In the following examples, 20bp libraries were synthesized using mouse embryonic stem cell (example 1) and human embryonic stem cell (example 2) H3K4me3 modified region DNAs as starting materials, and examples 1and 2 were constructed according to the following procedure, except that the starting materials were different. These libraries are all 20bp long, but are not intended to limit in any way the utility of the invention to the synthesis of libraries of other lengths. Wherein FIGS. 1and 2 are process flow diagrams for library construction of the present invention, wherein the reagents and materials used in examples 1and 2 are as follows:
firstly, a reagent:
1. example 1 starting material: mouse embryonic stem cell H3K4me3 chromatin co-immunoprecipitation DNA
2. Example 2 starting material: human embryonic stem cell H3K4me3 chromatin co-immunoprecipitation DNA
NEBNext ultrafast end repair/dA tail addition Module, NEB, E7442
NEBNext ultra-fast connection module, NEB, E7445;
5.Ampure XP beads,Beckman,A63881;
6.DynabeadsTMt1 magnetic bead, inteviety, 65602;
7.2 Xbind and wash buffer;
8. nuclease-free water, Ambion, 10977023;
9.1M sodium hydroxide, Sigma, S2770-100 ML;
10. a hybridization buffer;
11. an extension buffer;
12.3' hydroxy recovery buffer;
13. washing buffer solution 1;
14. washing buffer solution 2;
15.10X MBN buffer, NEB, M0250;
16.MBN(10U/μL),NEB,M0250;
17.10X T4DNA ligation buffer, NEB, B0202;
t4 polynucleotide kinase, NEB, M0201;
t4DNA ligase, NEB, M0202;
qiaquick gel recovery kit, Qiagen, 28704;
qiaquick PCR product recovery kit, Qiagen, 28104;
22.10 XCutsmart buffer, NEB, B7204;
23.10X NEBuffer 2.1,NEB,B7202;
t4DNA polymerase (3U/. mu.l), NEB, M0203;
25.AscI,NEB,R0558;
26.BbsI,NEB,R3559;
27.5X Q5 reaction buffer, NEB, M0491;
28.dNTPs 10mM,NEB,N0447;
q5DNA polymerase, NEB, M0491;
30.40% acrylamide: methylene bisacrylamide (29:1), Sangon, B546013;
31.10 XTBE buffer, Sangon, B540024;
32. urea, Sangon, a 510907;
33. ammonium persulfate, Sangon, a 00486;
TEMED (N, N' -tetramethylethylenediamine), Sangon, a 100761;
32.2 XUrea-TBE buffer, Sangon, C506046.
Example 1
The present invention will be further described with reference to the following examples or test examples, but the present invention is not limited thereto.
(1) Prepare all joints
1. Preparation of the joint
A1-R-3' -biotin (SEQ ID NO.1), B1-F (SEQ ID NO.3), B2-F (SEQ ID NO.5), B3-F (SEQ ID NO.7), B4-F (SEQ ID NO.9) were phosphorylated using the following systems, respectively:
adding 3 μ l of reverse primers A1-F (SEQ ID NO.2), B1-R (SEQ ID NO.4), B2-R (SEQ ID NO.6), B3-R (SEQ ID NO.8), B4-R (SEQ ID NO.10) and 7ul of H2O, respectively, cooling to 25 ℃ at 0.1 ℃ per second on a PCR instrument for 2 minutes, and storing the adaptor in a refrigerator of minus 20.
(2) DNA end repair and 3' end tailing:
DNA end repair and 3' end addition of A. reaction System configuration was performed according to the following System
Incubate at 20 ℃ for 30min and 65 ℃ for 30 min.
(3) Ligation of the 3' -Biotin modified a1 linker and purification of the ligation product:
3. connection A1 Joint
Incubate at 20 ℃ for 4 hours DNA was purified using 80. mu.l Ampure XP beads, eluting at 100. mu. l H2O.
(4) Binding of DNA to a solid phase:
4. Immobilization of DNA to streptavidin magnetic beads:
50 μ l Dynabeads were takenTMT1 magnetic beads to 1 eppendorf tube, adding 200. mu.l of 1 Xbinding and washing buffer, placing eppendorf tube on magnetic rack for 1min, discarding supernatant, washing repeatedly, resuspending Dynabeads with 100. mu.l of binding and washing bufferTMT1 magnetic bead, adding 100. mu.l DNA, mixing, incubating at room temperature for 15min, placing eppendorf tube on magnetic frame for 1min, discarding supernatant, washing magnetic bead with 200. mu.l 1X binding and washing buffer for 2 times, washing magnetic bead with 100. mu.l 0.1M NaOH for 1 time
(5) Denaturation of double-stranded DNA into single strands
5. Denaturation of double-stranded DNA into single strands:
resuspend the beads with 100. mu.l 0.1M NaOH, incubate for 10 min at room temperature, place the eppendorf tube on the magnetic frame for 1-2 min, discard the supernatant, wash 1 time with 100. mu.L 0.1M NaOH, wash 2 times with 100. mu.l hybridization buffer.
(6) Hybridization of primers
6. Primer 1(SEQ ID NO.11) hybridizes:
mu.l of 10. mu.M primer 1 to 95. mu.l of hybridization buffer were diluted 5. mu.l, resuspended in magnetic beads, incubated at 85 ℃ for 30s, at 65 ℃ for 60s, cooled to 40 ℃ at a rate of 0.1 ℃/s, incubated at 40 ℃ for 5 minutes, Eppendorf tubes were placed on a magnetic frame, the supernatant was discarded, and the magnetic beads were washed 2 times with wash 1.
(7) Base extension and fixed length DNA extension:
7.23 cycles of reversible end reactions:
adding extension buffer solution, incubating at 65 ℃ for 1min, placing Eppendorf on a magnetic frame, discarding the supernatant, washing the magnetic beads with the washing buffer 1 for 2 times; adding 3' hydroxyl recovery buffer, incubating at 65 ℃ for 30s, placing the tube on a magnetic frame, discarding the supernatant, adding washing buffer 2, incubating at 65 ℃ for 30s, placing the Eppendorf tube on the magnetic frame, discarding the supernatant, washing the washing buffer 2 times, washing the washing buffer 1 time, and repeating the steps until 23 cycles are completed.
(8) Flattening the tail end:
flattening the 5' end suspension arm of DNA
The beads were washed 2 times with TE and the reagents resuspended beads were prepared as follows:
incubate at 30 ℃ for 30min, and TE wash the beads for 3 times.
DNA 5' end phosphorylation
The reagent resuspension beads were prepared as follows:
incubating at 37 ℃ for 30min, and washing the magnetic beads for 2 times with TE
(9) Selection of terminal sequences:
B1 Joint connection
The reagent reselection beads were prepared as follows:
incubating at 16 ℃ overnight, washing the beads with TE 2 times
PCR amplification
The PCR system was configured as follows:
the PCR tube was placed on a thermal cycler and the PCR reaction was performed according to the following procedure:
DNA agarose gel electrophoresis and DNA recovery
The DNA sequence generated in step 11 is shown in FIG. 2-A. A1.8% low melting point agarose TAE gel containing 50. mu.g/ml ethidium bromide was prepared, the PCR product was mixed with loading buffer, then added to the gel spot wells, run at 70V for 35 minutes, developed under 300nm UV light, as shown in FIG. 3-A, resulting in a 144bp band, which was cut and DNA purified using the gel extraction kit.
And (3) adding biotin to the end of the DNA5 for modification.
The PCR system was prepared as follows:
the PCR tube was placed on a thermal cycler and the following procedure was followed:
DNA recovery Using PCR clean-up kit, 40. mu.l water elution, concentration about 70 ng/. mu.l
14. First round of NGG PAM selection
200ng Biotin-DNA is fixed on streptavidin magnetic beads, and an enzyme digestion system is configured according to the following system: :
the Eppendorf tube was placed in a 37 ℃ water bath and incubated overnight.
B2 Joint connection
The reaction system was prepared as follows:
after 2 hours reaction at room temperature, the beads were washed 2 times with TE.
PCR amplification
The PCR reaction was prepared as follows:
the PCR tube was placed in a thermal cycler and the PCR reaction was performed according to the following procedure:
DNA agarose gel electrophoresis and DNA recovery
DNA generated in step 16A 1.8% low melting agarose TAE gel containing 50ug/ml ethidium bromide was prepared as shown in FIG. 2-B, the PCR product was mixed with loading buffer, added to the gel spot wells, run at 70V for 35 min, developed under 300nm UV light to obtain a 142bp band as shown in FIG. 3-B, the band was excised, and the DNA was purified using Qiagen gel recovery kit.
18. Second round NGG PAM selection:
the cleavage reaction was prepared as follows:
incubate at 37 degrees overnight.
DNA agarose gel electrophoresis and DNA recovery
2% low melting point agarose TAE gel containing 50ug/ml ethidium bromide was prepared, the digestion product was mixed with loading buffer, added to the gel spot wells, run at 35V for 90 minutes, developed under 300nm UV light, cut a band of about 81bp as shown in FIG. 2-C, DNA purified using Qiagen gel recovery kit, and eluted with 44.8. mu.l water.
Smoothing of hanging arms at DNA 5' ends
The reaction system was prepared as follows:
incubate at 30 ℃ for 30min, purify the DNA using 100. mu.l Ampure XP beads, and elute with 17.5. mu.l water.
Phosphorylation of DNA 5' end
The reaction system was prepared as follows:
incubate 30 minutes at 37 ℃ and 20min at 65 ℃.
(10) Removal of terminal sequences
22. The reaction system was prepared as follows:
incubate overnight at 16 ℃. 60ul Ampure XP beads purified DNA, 34.5. mu. l H2O eluted DNA.
23.PCR
The reaction system was prepared as follows:
the PCR tube was placed in a thermal cycler, and PCR reaction was performed according to the following procedure
DNA agarose gel electrophoresis and DNA recovery:
the DNA sequence generated in step 23 is shown in FIG. 2-C, and 1.8% low melting agarose TAE gel containing 50. mu.g/ml ethidium bromide is prepared, the PCR product is mixed with loading buffer, then added to the gel spot wells, run at 70V for 35 minutes, developed under 300nm UV light, the 143bp band is excised, and the DNA is purified using Qiagen gel recovery kit.
DNA 5' end with biotin modification.
The PCR system was prepared as follows:
the PCR tube was placed on a thermal cycler and the following procedure was followed:
DNA recovery was performed using Qiagen PCR product purification kit, eluted with 40. mu.l water at a concentration of about 70 ng/. mu.l
NGG PAM removal
200ng Biotin-DNA is fixed on streptavidin magnetic beads, and an enzyme digestion system is configured according to the following system: :
the Eppendorf tube was placed in a 37 ℃ water bath and incubated overnight.
Steps 27 to 31 correspond to claims 1 to 7):
completion of DNA 3' end
The reaction system was prepared as follows:
incubate at 12 ℃ for 15min, and wash the beads 2 times with TE.
Phosphorylation of DNA 5' end
The reaction system was prepared as follows:
incubate at 37 ℃ for 30min, and TE wash the beads 2 times.
(11) Ligation of cloning adaptors
29. The reaction system was prepared as follows:
incubate at 16 ℃ overnight, and wash the beads 2 times with TE buffer.
PCR amplification:
the reaction system was prepared as follows:
the PCR tube was placed in a thermal cycler, and PCR reaction was performed according to the following procedure
DNA agarose gel electrophoresis and DNA recovery
The DNA sequence generated in step 30 is shown in FIG. 2-D. A1.8% low melting point agarose TAE gel containing 50. mu.g/ml ethidium bromide was prepared, the PCR product was mixed with loading buffer, then added to the gel spot wells, run at 70V for 35 minutes, developed under 300nm UV light to give a 141bp band below 200bp as shown in FIG. 2-D, the band was excised, and the DNA was purified using the Qiagen gel recovery kit.
(12) Short DNA Strand preparation
32. Short DNA Strand preparation
The reaction system was prepared as follows:
the PCR tube was placed in a thermal cycler, and PCR reaction was performed according to the following procedure
DNA was purified by 100% ethanol precipitation.
33. DNA purification by denaturing polyacrylamide gel electrophoresis
Preparing DNA modified polyacrylamide gel:
after coagulation, pre-run at 180V for 30min, mix DNA from step 32 with 2x urea-TBE buffer, denature at 90 ℃ for 5min, place directly on ice, start loading immediately, run at 180V for 40 hours, strip, stain with 0.1% toluene azure for 30min, decolour with water, excise more than 60nt portions, incubate at 65 ℃ overnight in TE.
(13) Construction of high throughput sequencing libraries
34. Constructing a high-throughput sequencing library, and analyzing the content of 20bp in the library;
the PCR tube was placed in a thermal cycler, and PCR reaction was performed according to the following procedure
A215 bp fragment was recovered by 1.8% DNA agarose gel electrophoresis.
The invention constructs the library of example 1and verifies 1) synthesis according to a template by example 1; 2) fixed length DNA; 3) and (4) selecting a terminal sequence.
Wherein figure 3: the results obtained in the terminal sequence selection and excision steps of the present invention were plotted by DNA agarose gel electrophoresis. A. After 23 times of 1bp extension and 3' hydroxyl recovery of the template DNA, a band of 144bp appears in the 5 th lane, and the sequence is shown in FIG. 2-A. B. After AscI digestion and ligation with B2 linker, a 142bp band appeared in lane 4, and its sequence is shown in FIG. 2-B. The DNA in Panel B was digested with AscI, resulting in a band of about 81bp in lane 5. D. The 4 th lane generated a 141bp band by BbsI cleavage to remove the terminal base, 5' end filling up and ligation of B1 adaptor, and the DNA sequence is shown in FIG. 2-D. D. After amplification with short primers and denaturing polyacrylamide gel electrophoresis, the DNA distribution in the third lane is about 59 bp-60 bp.
From FIG. 4, it can be seen that the 20bp sequence is aligned to the modified region of murine H3K4me3, and the resulting sequence from this method is derived from the starting material DNA of the present invention. As can be seen from FIG. 5A, C, the ratio of 20bp gRNA to starting DNA in the target region was substantially identical, demonstrating that the method of the present invention can construct a DNA library of a fixed length from the DNA in the target region.
From FIGS. 5-B, D, it can be seen that the gRNA of H3K4me3 region of mouse embryonic stem cells is followed by NGG sequence in about 96.65% of genome, which proves that the method can select DNA of specific terminal sequence from the library.
By combining the two points, the invention can construct a library of the mouse embryonic stem cell H3K4me3 region with fixed length and capable of being used together with spCas 9.
Example 2 preparation of a library of human H9 embryonic Stem cells
This example 2 constructs a fixed-length library using human embryonic stem cell H3K4me3 chromatin co-immunoprecipitation DNA, which was the same as in example 1 except that the template was different. The results are shown in FIGS. 6 and 7.
As shown in FIG. 6, the synthetic libraries, which are identical to the starting material, are located in the H3K4me3 region of human embryonic stem cells, and these DNAs are followed by NGG sequences, which demonstrates that the screening sequences of the present invention are also perfectly reproducible.
FIG. 6: for comparison of the case of synthesizing a 20bp sgRNA library of the H3K4me3 region of human embryonic stem cells from the template strand with the starting DNA: panel A is the case of starting material DNA alignment to human embryonic stem cells H3K4me3, and panel C is the case of the library alignment obtained to human embryonic stem cells H3K4me 3; panel B is the case where the starting material DNA follows the NGG sequence, and Panel D is the proportion of the resulting library that follows the NGG sequence; as can be seen in FIG. 6, the library is consistent with the starting DNA material in the proportion of the H3K4me3 modified region, and 20bp followed by the NGG sequence.
FIG. 7: the comparison of the synthetic library of the method of the invention and the synthetic library of the primer pool is shown in the figure: panel A is a proportion of DNA of each length obtained by synthesis, panel B is a proportion of DNA of 19 and 20bp followed by NGG sequence, panel C is a proportion of DNA of each length obtained by synthesis of a commercial primer pool, and panel D is a proportion of DNA of 20bp followed by NGG sequence obtained by synthesis of a commercial primer pool.
The current mainstream method for synthesizing a library by a gRNA is a primer pool synthesis method, and the inventor compares the method with the primer pool synthesis method, mainly shows whether a reference sequence is needed or not, whether a large-scale instrument is needed or not and whether the cost is low or not. The method can realize the synthesis of gRNAs without reference sequences which cannot be realized by a primer pool synthesis method, and simultaneously ensures that more gRNAs with functions can be realized by a final library. The invention does not need large-scale equipment, which provides convenience for the synthesis of gRNA in a common laboratory, and from the cost perspective, the invention manufactures a library containing about 13 ten thousand gRNAs, and through calculation, the cost for constructing the library is about 1000 yuan, which is far lower than the synthesis cost of a primer pool; from the attached figure 7, it can be seen that about 85% of the libraries of the present invention are 20bp and 19bp (both can be used as grnas because the sequences immediately following on the genome are NGG), about 89% of the grnas synthesized by the primer pool are 20bp, but more than 90% of 20bp and 19bp of the present invention are immediately following on the genome NGG sequences, and the proportion of the libraries synthesized by the primer pool is only 89%, so the method of the present invention has great potential to replace the current expensive primer pool synthesis method.
This patent | Primer pool synthetic sequence | |
Whether a reference genome is required | Does not need to use | Need to make sure that |
Whether or not precision instruments are required | Does not need to use | Need to make sure that |
Cost actuarial | At present, 13 ten thousand pieces of the Chinese herbal medicine are synthesized, about 1000 yuan | 1 ten thousand yuan, about 1 yuan and 1 piece |
By combining example 1and example 2, it can be seen that the method of the present invention not only can robustly realize the synthesis of libraries of fixed length and specific terminal sequence according to the starting materials, but also can greatly save the cost.
Sequence listing
<110> Zhejiang university
<120> method for synthesizing DNA library with fixed length and specific terminal sequence based on template material
<130> LAC170846P
<160> 16
<170> SIPOSequenceListing 1.0
<210> 1
<211> 18
<212> DNA/RNA
<213> Artificial Sequence (Artificial Sequence)
<400> 1
gaaaggacga aacaccgt 18
<210> 2
<211> 41
<212> DNA/RNA
<213> Artificial Sequence (Artificial Sequence)
<400> 2
cggtgtttcg tcctttccac aagatagatc ggaagagcgt c 41
<210> 3
<211> 58
<212> DNA/RNA
<213> Artificial Sequence (Artificial Sequence)
<400> 3
cgcgcccaca cgtctgaact ccagtcacag tcaacaatct cgtatgccgt cttctgct 58
<210> 4
<211> 60
<212> DNA/RNA
<213> Artificial Sequence (Artificial Sequence)
<400> 4
caagcagaag acggcatacg agattgttga ctgtgactgg agttcagacg tgtgggcgcg 60
<210> 5
<211> 56
<212> DNA/RNA
<213> Artificial Sequence (Artificial Sequence)
<400> 5
cgcgcctcat cgcgtcgacc aatccatatg actagtagat cctctagagt cgacct 56
<210> 6
<211> 54
<212> DNA/RNA
<213> Artificial Sequence (Artificial Sequence)
<400> 6
gaaggtcgac tctagaggat ctactagtca tatggattgg tcgacgcgat gagg 54
<210> 7
<211> 58
<212> DNA/RNA
<213> Artificial Sequence (Artificial Sequence)
<400> 7
tcttcaagct tggcgtaatc atggtcatag ctgtttcctg tgtgaaattg ttatccgc 58
<210> 8
<211> 59
<212> DNA/RNA
<213> Artificial Sequence (Artificial Sequence)
<400> 8
agcggataac aatttcacac aggaaacagc tatgaccatg attacgccaa gcttgaaga 59
<210> 9
<211> 28
<212> DNA/RNA
<213> Artificial Sequence (Artificial Sequence)
<400> 9
gttttagagc tagaaatagc aagttaaa 28
<210> 10
<211> 60
<212> DNA/RNA
<213> Artificial Sequence (Artificial Sequence)
<400> 10
actttttcaa gttgataacg gactagcctt attttaactt gctatttcta gctctaaaac 60
<210> 11
<211> 44
<212> DNA/RNA
<213> Artificial Sequence (Artificial Sequence)
<400> 11
tggacgctct tccgatctat cttgtggaaa ggacgaaaca ccgt 44
<210> 12
<211> 60
<212> DNA/RNA
<213> Artificial Sequence (Artificial Sequence)
<400> 12
taacttgaaa gtatttcgat ttcttggctt tatatatctt gtggaaagga cgaaacaccg 60
<210> 13
<211> 19
<212> DNA/RNA
<213> Artificial Sequence (Artificial Sequence)
<400> 13
tggaaaggac gaaacaccg 19
<210> 14
<211> 21
<212> DNA/RNA
<213> Artificial Sequence (Artificial Sequence)
<400> 14
tgctatttct agctctaaaa c 21
<210> 15
<211> 106
<212> DNA/RNA
<213> Artificial Sequence (Artificial Sequence)
<400> 15
caagcagaag acggcatacg agatcgatta gtgactggag ttcagacgtg tgctcttccg 60
acgtgtgctc ttccgatccc ttattttaac ttgctatttc tagctc 106
<210> 16
<211> 82
<212> DNA/RNA
<213> Artificial Sequence (Artificial Sequence)
<400> 16
aatgatacgg cgaccaccga gatctacact ctttccctac acgacgctct tccgatcttc 60
ttgtggaaag gacgaaacac cg 82
Claims (24)
1. A method for synthesizing a fixed-length and specific-end-sequence DNA library based on a template material, the method comprising:
1) primer hybridization: taking single-stranded DNA with a joint A1 fixed on a stationary phase with streptavidin, adding the single-stranded DNA into a DNA hybridization buffer solution and a primer 1, and carrying out gradient annealing;
the joint A1 comprises A1-R-3 '-biotin and A1-F, wherein A1-R-3' -biotin is a nucleotide sequence shown in SEQ ID NO.1, and A1-F is a nucleotide sequence shown in SEQ ID NO. 2;
the primer 1 has the sequence shown in SEQ ID NO: 11;
2) base elongation: then adding DNA polymerase and dNTPs with reversible chemical modification at the 3 'end into the product obtained in the step 1) to prolong the base, and then recovering the hydroxyl at the 3' end;
3) fixed length DNA extension: repeating the step 2), and circulating for 15-100 times;
4) flattening the tail end: removing the 5' -end by adding a single-stranded nuclease, wherein the single-stranded nuclease is mung bean acid nuclease, S1 nuclease or CEL I nuclease;
5) selection of terminal sequences: adding a DNA ligase and a linker B1, said linker B1 being a sequence comprising AscI, Acc65I, AccB1I, ApaI, Asp718I, AspS9I, AvaII, BamHI, BanI, BbeI, Bme18I, BmgT120I, BmiI, BshFI, BshNI, BsnI, Bsp120I, BspLI, BstEII, BstPI, BtsuI, BtsCI, Eco91I, EheI, FseI, HaeIII, KasI, KpnI, PhoI, SanDI, SfoI, SgsI, or SspDI cleavage sites to complete end selection; or
Replacing the base used for extension in the circulation corresponding to the position of the sequence to be selected, and then purifying the DNA with the complete length by polyacrylamide gel electrophoresis so as to complete the end selection;
6) removal of terminal sequences: adding a DNA ligase and a linker B3 to remove terminal sequences, wherein the linker B3 is a sequence comprising a Type II S restriction endonuclease site comprising BbsI, AcuI, AlwI, BaeI, BccI, BceAI, BbvI, BcoDI, BfuAI, BpuEI, BsaXI, BseRI, BsgI, BsmBI, BsmFI, BsmI, BspCI, BspMI, BspQI, BtgZI, EarI, Ecil, FauI, HgaI, HphI, HpyAV, MmeI, MnII, SapI, or StaStaStaNi;
7) ligation of cloning adaptor: adding DNA ligase and a linker B4 to connect a linker B4, wherein the linker B4 comprises B4-F and B4-R, wherein B4-F is a nucleotide sequence shown as SEQ ID NO.9, and B4-R is a nucleotide sequence shown as SEQ ID NO. 10;
wherein the single-stranded DNA having the linker A1 immobilized on the stationary phase having streptavidin in the step 1) is prepared by the following method:
1-1) acquisition of fragmented double-stranded DNA: reverse transcription of double-stranded DNA using sonicated or fragmented RNA to obtain fragments of 10bp to 40 Kbp;
1-2) DNA end repair and 3' end tailing, namely mixing the DNA obtained in the step 1-1) with DNA polymerase to carry out DNA 3' end flattening, and adding A or G to the 3' end;
1-3) ligation of 3' -Biotin modified A1 linker: mixing the product obtained in the step 1-2) with ligase, and adding a 3' -Biotin modified A1 linker; the ligase is blunt-end/TA DNA ligase;
1-4) purification of the ligation product: purifying double-stranded DNA by using 1X volume Ampure XP beads, or recovering DNA by using an ethanol or isopropanol precipitation method or a DNA silicon membrane column purification kit;
1-5) binding of DNA to a solid phase: mixing the DNA obtained in the step 1-4) with an isovolumetric stationary phase with streptavidin, and incubating at room temperature;
1-6) denaturation of double-stranded DNA into single strands: and (3) denaturing the DNA on the magnetic beads into single strands to obtain the DNA.
2. The method according to claim 1, wherein the cycle in the step 3) is 20 to 30 times.
3. The method according to claim 2, wherein the cycle in the step 3) is 20 to 23 times.
4. The method as claimed in claim 1, wherein the linker B1 in step 5) comprises B1-F and B1-R, wherein B1-F is the nucleotide sequence shown in SEQ ID NO.3 and B1-R is the nucleotide sequence shown in SEQ ID NO. 4.
5. The method as claimed in claim 1, wherein the linker B3 in step 6) comprises B3-F and B3-R, wherein B3-F is the nucleotide sequence shown in SEQ ID NO.7 and B3-R is the nucleotide sequence shown in SEQ ID NO. 8.
6. The method of claim 1, wherein the DNA hybridization buffer solution in step 1) is a5 XSSC buffer.
7. The method as claimed in claim 1, wherein the gradient annealing in step 1) is 85 ℃ for 30s,65 ℃ for 1min, and the temperature is slowly reduced to 40 ℃ at 0.1 ℃/s for 5min at 40 ℃.
8. The method of claim 1, wherein the step 2) further comprises:
the DNA polymerase is 9 ℃ N (D141A/E143A/A485L) DNA polymerase;
the dNTPs with reversible chemical modification are 3' -O-azidomethyl-dNTPs, 3' -O-allyl-dNTPs or 3' -O-allyloxycarbonylnes-dNTPs; or
The recovered 3' end hydroxyl is tri (2-carboxyacetic acid) phosphine hydrochloride or DTT.
9. The method of claim 1, wherein the step 2) further comprises adding an extension buffer while extending the base, incubating, washing the magnetic beads with washing buffer 1, then incubating with 3' hydroxyl-terminated recovery buffer, and washing with washing buffer 2.
10. The method according to claim 9, wherein in step 2),
the extension buffer composition includes 1. mu.M 3' -O-azidomethyl-dNTPs mix, 0.015. mu.g/ml 9 ℃ N DNA polymerase,50mM Tris pH 9.0,50mM NaCl,6mM MgSO4,1mM EDTA and 0.05% Tween 20;
the composition of the washing buffer 1 comprises 50mM Tris pH 9.0,50mM NaCl,1mM EDTA and 0.05% Tween 20;
the 3' hydroxyl recovery buffer composition includes 100mM TCEP,100mM Tris pH 9.0,100mM NaCl,50mM sodium ascorbate and 0.05% Tween 20; or
The composition of the wash buffer 2 included 100mM Tris pH 9.0,100mM NaCl and 0.05% Tween 20.
11. The method of claim 1, wherein the DNA ligase in step 5) is T4DNA ligase or blunt-ended/TA ligase, and the ligated B1 linker is used for terminal sequence selection.
12. The method of claim 1, wherein the step 5) further comprises: after the selection of the terminal sequence is carried out by using a B1 linker, the selection of the terminal sequence is carried out again by using a B2 linker, wherein the linker B2 comprises B2-F and B2-R, B2-F is the nucleotide sequence shown in SEQ ID NO.5, and B2-R is the nucleotide sequence shown in SEQ ID NO. 6.
13. The method of claim 1, wherein the DNA ligase in step 6) is T4DNA ligase or blunt-ended/TA ligase, and the ligated B3 linker is used for terminal excision.
14. The method of claim 1, wherein the DNA ligase in step 7) is T4DNA ligase or blunt-end/TA ligase, and ligated B4 is used for cloning adaptor ligation.
15. The method of claim 1, wherein the step 7) further comprises: for ligation of sequences required for molecular cloning, the fragments were cloned by existing cloning techniques into vectors for testing, the vector sequences being chimeric RNA sequences.
16. The method according to claim 1, wherein in the step 1-2), the DNA polymerase comprises: when the DNA is tailed at the 3' end with an A, the Klenow fragment (3' → 5' exo)-) (ii) a When the 3' end of the DNA is added with a G tail, Taq DNA polymerase is used.
17. The method of claim 1, wherein the solid phase in step 1-5) comprises 2X B & W buffer resuspended T1 magnetic beads, Controlled porous Glass (Controlled Pore Glass), silica gel (silica gel), porous resin (Pore resin), microarray slides (Glass slides), silicon-based chips (silica chips), or nylon membrane (nylon membrane).
18. The method according to claim 1, wherein the denaturation in the step 1-6) is thermal denaturation or denaturation with a DNA denaturant.
19. The method of claim 18, wherein the DNA denaturant is sodium hydroxide, formamide, or urea.
20. The method of claim 1, wherein the fixed length is 15-100 bp.
21. The method of claim 20, wherein the fixed length is 20-30 bp.
22. The method of claim 21, wherein the fixed length is 20-23 bp.
23. The method of claim 1, wherein the specific end sequence is an NGG, NAG, or NAA sequence.
24. The method of claim 23, wherein the specific end sequence is SpCas 9:3' NGG; or SpCas9VRER mutant 3' NGCG; SpCas9EQR mutant 3' NGAG; SpCas9VQR mutant 3' NGAN or NGNG; 3' NNGRRT or NNGRR (N) of the SacAS9 mutant; AsCpf1and LbCpf1:5' TTTV; AsCpf1RR mutant 5' TYCV; LbCpf1RR mutant: 5' TYCV; AsCpf1RVR mutant 5' TATV; neisseria Meningitidis (NM):3' NNGATT; streptococcus Thermophilus (ST):3' NNAGAAW; or Treponema DentataA (TD):3' NAAAAC, of the desired terminal sequence of the Cas9 protein.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810150519.6A CN110158157B (en) | 2018-02-13 | 2018-02-13 | Method for synthesizing DNA library with fixed length and specific terminal sequence based on template material |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810150519.6A CN110158157B (en) | 2018-02-13 | 2018-02-13 | Method for synthesizing DNA library with fixed length and specific terminal sequence based on template material |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110158157A CN110158157A (en) | 2019-08-23 |
CN110158157B true CN110158157B (en) | 2021-02-02 |
Family
ID=67635379
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810150519.6A Expired - Fee Related CN110158157B (en) | 2018-02-13 | 2018-02-13 | Method for synthesizing DNA library with fixed length and specific terminal sequence based on template material |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110158157B (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP4065723A4 (en) * | 2019-11-29 | 2024-01-17 | MGI Holdings Co., Limited | Enzymatic synthesis of oligonucleotides |
CN111041070B (en) * | 2019-12-27 | 2022-08-19 | 北京优迅医学检验实验室有限公司 | Method for detecting DNA conversion efficiency constructed by high-throughput sequencing library |
CN114293264A (en) * | 2021-12-21 | 2022-04-08 | 翌圣生物科技(上海)股份有限公司 | Preparation method of enzyme method target sequence random sgRNA library |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1308683A (en) * | 1998-06-29 | 2001-08-15 | 菲洛斯公司 | Methods for generating highly diverse libraries |
CN103343392A (en) * | 2013-07-03 | 2013-10-09 | 中山大学 | MRNA (Messenger Ribonucleic Acid) 3' terminal library as well as construction and sequence measuring methods and application thereof |
CN103667273A (en) * | 2013-12-05 | 2014-03-26 | 北京诺禾致源生物信息科技有限公司 | Double chain adaptor, application thereof and method for constructing end-pairing DNA library |
WO2016196805A1 (en) * | 2015-06-05 | 2016-12-08 | The Regents Of The University Of California | Methods and compositions for generating crispr/cas guide rnas |
WO2017049129A2 (en) * | 2015-09-18 | 2017-03-23 | President And Fellows Of Harvard College | Methods of making guide rna |
WO2017081097A1 (en) * | 2015-11-09 | 2017-05-18 | Ifom Fondazione Istituto Firc Di Oncologia Molecolare | Crispr-cas sgrna library |
CN106987905A (en) * | 2017-04-06 | 2017-07-28 | 深圳华大基因股份有限公司 | A kind of construction method and kit in BRCA1/2 genetic tests library |
CN107365793A (en) * | 2017-06-19 | 2017-11-21 | 百格基因科技(江苏)有限公司 | A kind of method of extensive genome editor suitable for plant |
WO2018013558A1 (en) * | 2016-07-12 | 2018-01-18 | Life Technologies Corporation | Compositions and methods for detecting nucleic acid regions |
-
2018
- 2018-02-13 CN CN201810150519.6A patent/CN110158157B/en not_active Expired - Fee Related
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1308683A (en) * | 1998-06-29 | 2001-08-15 | 菲洛斯公司 | Methods for generating highly diverse libraries |
CN103343392A (en) * | 2013-07-03 | 2013-10-09 | 中山大学 | MRNA (Messenger Ribonucleic Acid) 3' terminal library as well as construction and sequence measuring methods and application thereof |
CN103667273A (en) * | 2013-12-05 | 2014-03-26 | 北京诺禾致源生物信息科技有限公司 | Double chain adaptor, application thereof and method for constructing end-pairing DNA library |
WO2016196805A1 (en) * | 2015-06-05 | 2016-12-08 | The Regents Of The University Of California | Methods and compositions for generating crispr/cas guide rnas |
WO2017049129A2 (en) * | 2015-09-18 | 2017-03-23 | President And Fellows Of Harvard College | Methods of making guide rna |
WO2017081097A1 (en) * | 2015-11-09 | 2017-05-18 | Ifom Fondazione Istituto Firc Di Oncologia Molecolare | Crispr-cas sgrna library |
WO2018013558A1 (en) * | 2016-07-12 | 2018-01-18 | Life Technologies Corporation | Compositions and methods for detecting nucleic acid regions |
CN106987905A (en) * | 2017-04-06 | 2017-07-28 | 深圳华大基因股份有限公司 | A kind of construction method and kit in BRCA1/2 genetic tests library |
CN107365793A (en) * | 2017-06-19 | 2017-11-21 | 百格基因科技(江苏)有限公司 | A kind of method of extensive genome editor suitable for plant |
Non-Patent Citations (5)
Title |
---|
A Molecular Chipper technology for CRISPR sgRNA library generation and functional mapping of noncoding regions.;Cheng J, et al.;《Nat Commun.》;20160330;第7卷;全文 * |
Dense sgRNA Library Construction Using a Molecular Chipper Approach.;Cheng J, et al.;《Bio Protoc.》;20170630;第7卷(第12期);全文 * |
Generation of an arrayed CRISPR-Cas9 library targeting epigenetic regulators: from high-content screens to in vivo assays.;Henser-Brownhill T, et al.;《Epigenetics.》;20171231;第12卷(第2期);全文 * |
Genome-Wide CRISPR/Cas9 Screening for High-Throughput Functional Genomics in Human Cells.;Zhu S, et al.;《Methods Mol Biol.》;20170815;第1656卷;全文 * |
Synthesis of an arrayed sgRNA library targeting the human genome.;Schmidt T, et al.;《Sci Rep.》;20151008;第5卷;全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN110158157A (en) | 2019-08-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2016365720B2 (en) | Methods and compositions for the making and using of guide nucleic acids | |
JP6483249B2 (en) | Isolated oligonucleotides and their use in sequencing nucleic acids | |
ES2900102T3 (en) | Compositions for Nucleic Acid Sequencing in Mixtures | |
EP2373793B1 (en) | Method for assembly of polynucleic acid sequences | |
JP2018501776A (en) | Dislocations that maintain continuity | |
CN110158157B (en) | Method for synthesizing DNA library with fixed length and specific terminal sequence based on template material | |
CN109593757B (en) | Probe and method for enriching target region by using same and applicable to high-throughput sequencing | |
JP2022516821A (en) | Transposome complex with surface-bound complex | |
US11667941B2 (en) | Compositions and methods for template-free geometric enzymatic nucleic acid synthesis | |
WO2018227025A1 (en) | Creation and use of guide nucleic acids | |
JP2019503712A (en) | Methods and compositions for nucleic acid assembly | |
US10364455B2 (en) | Methods and compositions for improving removal of ribosomal RNA from biological samples | |
US20190153528A1 (en) | Method for target specific rna transcription of dna sequences | |
US20200255823A1 (en) | Guide strand library construction and methods of use thereof | |
WO2020172199A1 (en) | Guide strand library construction and methods of use thereof | |
US20150099670A1 (en) | Method of preparing post-bisulfite conversion DNA library | |
CN116043337A (en) | DNA methylation marker screening kit and method | |
CN113462748A (en) | Preparation method and kit of DNA sequencing library | |
CN113528612A (en) | NicE-C technology for detecting chromatin interaction between chromatin open sites | |
EP3615683B1 (en) | Methods for linking polynucleotides | |
AU2017217868B2 (en) | Method for target specific RNA transcription of DNA sequence | |
CN109852668A (en) | A kind of simplified gene order-checking library and its banking process | |
CN114807084A (en) | Mutant Tn5 transposase and kit | |
CN115820824A (en) | Detection method for plant whole genome RNA-chromatin interaction | |
US20210163922A1 (en) | Assembly and error reduction of synthetic genes from oligonucleotides |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20210202 |
|
CF01 | Termination of patent right due to non-payment of annual fee |