WO2021032155A1 - Système d'édition de bases et son procédé d'utilisation - Google Patents
Système d'édition de bases et son procédé d'utilisation Download PDFInfo
- Publication number
- WO2021032155A1 WO2021032155A1 PCT/CN2020/110207 CN2020110207W WO2021032155A1 WO 2021032155 A1 WO2021032155 A1 WO 2021032155A1 CN 2020110207 W CN2020110207 W CN 2020110207W WO 2021032155 A1 WO2021032155 A1 WO 2021032155A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- base editing
- fusion protein
- deaminase
- domain
- nucleic acid
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 90
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 103
- 108020001507 fusion proteins Proteins 0.000 claims abstract description 97
- 102000037865 fusion proteins Human genes 0.000 claims abstract description 97
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 93
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 93
- 108010052875 Adenine deaminase Proteins 0.000 claims abstract description 41
- 230000008685 targeting Effects 0.000 claims abstract description 31
- 108010080611 Cytosine Deaminase Proteins 0.000 claims abstract description 15
- 102000000311 Cytosine Deaminase Human genes 0.000 claims abstract description 15
- 210000004027 cell Anatomy 0.000 claims description 118
- 108090000623 proteins and genes Proteins 0.000 claims description 96
- 241000196324 Embryophyta Species 0.000 claims description 95
- 108091033409 CRISPR Proteins 0.000 claims description 90
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 77
- 108020005004 Guide RNA Proteins 0.000 claims description 65
- 102000004169 proteins and genes Human genes 0.000 claims description 56
- 239000002773 nucleotide Substances 0.000 claims description 54
- 125000003729 nucleotide group Chemical group 0.000 claims description 54
- 238000006467 substitution reaction Methods 0.000 claims description 54
- 108020004414 DNA Proteins 0.000 claims description 47
- 240000007594 Oryza sativa Species 0.000 claims description 46
- 235000007164 Oryza sativa Nutrition 0.000 claims description 46
- 235000009566 rice Nutrition 0.000 claims description 46
- 230000014509 gene expression Effects 0.000 claims description 40
- 210000001938 protoplast Anatomy 0.000 claims description 40
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 claims description 36
- 238000010354 CRISPR gene editing Methods 0.000 claims description 32
- 230000009615 deamination Effects 0.000 claims description 31
- 238000006481 deamination reaction Methods 0.000 claims description 31
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 31
- 101710163270 Nuclease Proteins 0.000 claims description 30
- 230000001419 dependent effect Effects 0.000 claims description 30
- 101710172430 Uracil-DNA glycosylase inhibitor Proteins 0.000 claims description 26
- 229930024421 Adenine Natural products 0.000 claims description 25
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 claims description 25
- 229960000643 adenine Drugs 0.000 claims description 25
- 239000012636 effector Substances 0.000 claims description 23
- 229920001184 polypeptide Polymers 0.000 claims description 22
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 22
- 230000009466 transformation Effects 0.000 claims description 22
- 241000588724 Escherichia coli Species 0.000 claims description 19
- 230000001404 mediated effect Effects 0.000 claims description 19
- 244000062793 Sorghum vulgare Species 0.000 claims description 18
- 229940104302 cytosine Drugs 0.000 claims description 18
- 241000272517 Anseriformes Species 0.000 claims description 16
- 244000038559 crop plants Species 0.000 claims description 15
- 108091035705 tRNA adenine Proteins 0.000 claims description 15
- 240000008042 Zea mays Species 0.000 claims description 12
- 235000002017 Zea mays subsp mays Nutrition 0.000 claims description 12
- 101000964378 Homo sapiens DNA dC->dU-editing enzyme APOBEC-3A Proteins 0.000 claims description 11
- 241000124008 Mammalia Species 0.000 claims description 11
- 235000021307 Triticum Nutrition 0.000 claims description 11
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 claims description 11
- 235000005822 corn Nutrition 0.000 claims description 11
- 108091081021 Sense strand Proteins 0.000 claims description 10
- 102000048646 human APOBEC3A Human genes 0.000 claims description 10
- 229920000742 Cotton Polymers 0.000 claims description 9
- 241000287828 Gallus gallus Species 0.000 claims description 9
- 244000068988 Glycine max Species 0.000 claims description 9
- 235000010469 Glycine max Nutrition 0.000 claims description 9
- 244000299507 Gossypium hirsutum Species 0.000 claims description 9
- 244000020551 Helianthus annuus Species 0.000 claims description 9
- 235000003222 Helianthus annuus Nutrition 0.000 claims description 9
- 241000282412 Homo Species 0.000 claims description 9
- 240000005979 Hordeum vulgare Species 0.000 claims description 9
- 235000007340 Hordeum vulgare Nutrition 0.000 claims description 9
- 235000007688 Lycopersicon esculentum Nutrition 0.000 claims description 9
- 240000003183 Manihot esculenta Species 0.000 claims description 9
- 235000016735 Manihot esculenta subsp esculenta Nutrition 0.000 claims description 9
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 claims description 9
- 235000002637 Nicotiana tabacum Nutrition 0.000 claims description 9
- 240000000111 Saccharum officinarum Species 0.000 claims description 9
- 235000007201 Saccharum officinarum Nutrition 0.000 claims description 9
- 240000003768 Solanum lycopersicum Species 0.000 claims description 9
- 244000061456 Solanum tuberosum Species 0.000 claims description 9
- 235000002595 Solanum tuberosum Nutrition 0.000 claims description 9
- 235000011684 Sorghum saccharatum Nutrition 0.000 claims description 9
- 235000013330 chicken meat Nutrition 0.000 claims description 9
- 235000019713 millet Nutrition 0.000 claims description 9
- 241000283690 Bos taurus Species 0.000 claims description 8
- 241000282472 Canis lupus familiaris Species 0.000 claims description 8
- 241000282693 Cercopithecidae Species 0.000 claims description 8
- 241000282326 Felis catus Species 0.000 claims description 8
- 241000699670 Mus sp. Species 0.000 claims description 8
- 241001494479 Pecora Species 0.000 claims description 8
- 241000700159 Rattus Species 0.000 claims description 8
- 241000282887 Suidae Species 0.000 claims description 8
- 241000700605 Viruses Species 0.000 claims description 8
- 230000000692 anti-sense effect Effects 0.000 claims description 8
- 238000004519 manufacturing process Methods 0.000 claims description 8
- 244000144977 poultry Species 0.000 claims description 8
- 235000013594 poultry meat Nutrition 0.000 claims description 8
- 241000209510 Liliopsida Species 0.000 claims description 7
- 210000004899 c-terminal region Anatomy 0.000 claims description 7
- 241001233957 eudicotyledons Species 0.000 claims description 7
- 230000004927 fusion Effects 0.000 claims description 7
- 102100022433 Single-stranded DNA cytosine deaminase Human genes 0.000 claims description 6
- 101710143275 Single-stranded DNA cytosine deaminase Proteins 0.000 claims description 6
- 108091027544 Subgenomic mRNA Proteins 0.000 claims description 6
- 238000001890 transfection Methods 0.000 claims description 6
- 241000589158 Agrobacterium Species 0.000 claims description 5
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 4
- 230000030648 nucleus localization Effects 0.000 claims description 4
- MZZYGYNZAOVRTG-UHFFFAOYSA-N 2-hydroxy-n-(1h-1,2,4-triazol-5-yl)benzamide Chemical compound OC1=CC=CC=C1C(=O)NC1=NC=NN1 MZZYGYNZAOVRTG-UHFFFAOYSA-N 0.000 claims description 3
- 102000012758 APOBEC-1 Deaminase Human genes 0.000 claims description 3
- 108010079649 APOBEC-1 Deaminase Proteins 0.000 claims description 3
- 108010004483 APOBEC-3G Deaminase Proteins 0.000 claims description 3
- 102100038076 DNA dC->dU-editing enzyme APOBEC-3G Human genes 0.000 claims description 3
- 241000702421 Dependoparvovirus Species 0.000 claims description 3
- 101000658622 Homo sapiens Testis-specific Y-encoded-like protein 2 Proteins 0.000 claims description 3
- 241000713666 Lentivirus Species 0.000 claims description 3
- 102100034917 Testis-specific Y-encoded-like protein 2 Human genes 0.000 claims description 3
- 241000700618 Vaccinia virus Species 0.000 claims description 3
- 208000036142 Viral infection Diseases 0.000 claims description 3
- 229910000389 calcium phosphate Inorganic materials 0.000 claims description 3
- 239000001506 calcium phosphate Substances 0.000 claims description 3
- 235000011010 calcium phosphates Nutrition 0.000 claims description 3
- 239000003795 chemical substances by application Substances 0.000 claims description 3
- 238000004520 electroporation Methods 0.000 claims description 3
- 238000001638 lipofection Methods 0.000 claims description 3
- 238000000520 microinjection Methods 0.000 claims description 3
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 claims description 3
- 241000701161 unidentified adenovirus Species 0.000 claims description 3
- 230000009385 viral infection Effects 0.000 claims description 3
- 102100035102 E3 ubiquitin-protein ligase MYCBP2 Human genes 0.000 claims 1
- 240000004658 Medicago sativa Species 0.000 claims 1
- 244000061176 Nicotiana tabacum Species 0.000 claims 1
- 244000098338 Triticum aestivum Species 0.000 claims 1
- 230000035772 mutation Effects 0.000 description 77
- 235000001014 amino acid Nutrition 0.000 description 58
- 235000018102 proteins Nutrition 0.000 description 53
- 150000001413 amino acids Chemical class 0.000 description 35
- 201000010099 disease Diseases 0.000 description 28
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 28
- 108091028043 Nucleic acid sequence Proteins 0.000 description 22
- 230000000694 effects Effects 0.000 description 22
- 241000894007 species Species 0.000 description 20
- 210000001519 tissue Anatomy 0.000 description 17
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 16
- 108091026890 Coding region Proteins 0.000 description 14
- 230000004048 modification Effects 0.000 description 13
- 238000012986 modification Methods 0.000 description 13
- 230000001105 regulatory effect Effects 0.000 description 13
- 239000013598 vector Substances 0.000 description 13
- 108020004705 Codon Proteins 0.000 description 12
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 12
- 230000009418 agronomic effect Effects 0.000 description 12
- 108091093088 Amplicon Proteins 0.000 description 11
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 10
- 241000209140 Triticum Species 0.000 description 10
- 230000006870 function Effects 0.000 description 10
- 239000000203 mixture Substances 0.000 description 10
- 230000008859 change Effects 0.000 description 9
- 241000219823 Medicago Species 0.000 description 8
- 241000208125 Nicotiana Species 0.000 description 8
- 238000002703 mutagenesis Methods 0.000 description 8
- 231100000350 mutagenesis Toxicity 0.000 description 8
- 238000012163 sequencing technique Methods 0.000 description 8
- 238000013518 transcription Methods 0.000 description 8
- 230000035897 transcription Effects 0.000 description 8
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 7
- 108091033380 Coding strand Proteins 0.000 description 7
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 7
- 230000000295 complement effect Effects 0.000 description 7
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 7
- 238000000338 in vitro Methods 0.000 description 7
- 102000040430 polynucleotide Human genes 0.000 description 7
- 108091033319 polynucleotide Proteins 0.000 description 7
- 239000002157 polynucleotide Substances 0.000 description 7
- 238000012216 screening Methods 0.000 description 7
- 238000013519 translation Methods 0.000 description 7
- NYHBQMYGNKIUIF-UUOKFMHZSA-N Guanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O NYHBQMYGNKIUIF-UUOKFMHZSA-N 0.000 description 6
- 238000010362 genome editing Methods 0.000 description 6
- 238000011065 in-situ storage Methods 0.000 description 6
- 239000008194 pharmaceutical composition Substances 0.000 description 6
- 102000053602 DNA Human genes 0.000 description 5
- 230000015572 biosynthetic process Effects 0.000 description 5
- 239000004009 herbicide Substances 0.000 description 5
- 229910052757 nitrogen Inorganic materials 0.000 description 5
- 230000001131 transforming effect Effects 0.000 description 5
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 4
- 108700010070 Codon Usage Proteins 0.000 description 4
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 4
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 4
- 229930010555 Inosine Natural products 0.000 description 4
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 4
- 108700026244 Open Reading Frames Proteins 0.000 description 4
- 108020004682 Single-Stranded DNA Proteins 0.000 description 4
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 4
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 4
- 229960005305 adenosine Drugs 0.000 description 4
- 238000003776 cleavage reaction Methods 0.000 description 4
- 230000005782 double-strand break Effects 0.000 description 4
- 238000012239 gene modification Methods 0.000 description 4
- 230000002363 herbicidal effect Effects 0.000 description 4
- 229960003786 inosine Drugs 0.000 description 4
- 108020004999 messenger RNA Proteins 0.000 description 4
- 229930182817 methionine Natural products 0.000 description 4
- 230000033607 mismatch repair Effects 0.000 description 4
- 239000013612 plasmid Substances 0.000 description 4
- 229920000642 polymer Polymers 0.000 description 4
- 230000008929 regeneration Effects 0.000 description 4
- 238000011069 regeneration method Methods 0.000 description 4
- 230000037432 silent mutation Effects 0.000 description 4
- 230000001052 transient effect Effects 0.000 description 4
- 102000000452 Acetyl-CoA carboxylase Human genes 0.000 description 3
- 108010016219 Acetyl-CoA carboxylase Proteins 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 3
- 238000010453 CRISPR/Cas method Methods 0.000 description 3
- MIKUYHXYGGJMLM-GIMIYPNGSA-N Crotonoside Natural products C1=NC2=C(N)NC(=O)N=C2N1[C@H]1O[C@@H](CO)[C@H](O)[C@@H]1O MIKUYHXYGGJMLM-GIMIYPNGSA-N 0.000 description 3
- NYHBQMYGNKIUIF-UHFFFAOYSA-N D-guanosine Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(CO)C(O)C1O NYHBQMYGNKIUIF-UHFFFAOYSA-N 0.000 description 3
- 208000035240 Disease Resistance Diseases 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 3
- 108020004485 Nonsense Codon Proteins 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- 108091081024 Start codon Proteins 0.000 description 3
- 241000193996 Streptococcus pyogenes Species 0.000 description 3
- 108091028113 Trans-activating crRNA Proteins 0.000 description 3
- 108020004566 Transfer RNA Proteins 0.000 description 3
- 102000006943 Uracil-DNA Glycosidase Human genes 0.000 description 3
- 108010072685 Uracil-DNA Glycosidase Proteins 0.000 description 3
- 230000002159 abnormal effect Effects 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 239000000539 dimer Substances 0.000 description 3
- 235000013399 edible fruits Nutrition 0.000 description 3
- 239000003623 enhancer Substances 0.000 description 3
- 239000013604 expression vector Substances 0.000 description 3
- 230000005017 genetic modification Effects 0.000 description 3
- 235000013617 genetically modified food Nutrition 0.000 description 3
- 230000012010 growth Effects 0.000 description 3
- 229940029575 guanosine Drugs 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- 230000010354 integration Effects 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 230000037434 nonsense mutation Effects 0.000 description 3
- 238000005457 optimization Methods 0.000 description 3
- 210000000056 organ Anatomy 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 229920006395 saturated elastomer Polymers 0.000 description 3
- 125000006850 spacer group Chemical group 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- GOJUJUVQIVIZAV-UHFFFAOYSA-N 2-amino-4,6-dichloropyrimidine-5-carbaldehyde Chemical group NC1=NC(Cl)=C(C=O)C(Cl)=N1 GOJUJUVQIVIZAV-UHFFFAOYSA-N 0.000 description 2
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 2
- 208000024827 Alzheimer disease Diseases 0.000 description 2
- 206010003805 Autism Diseases 0.000 description 2
- 208000020706 Autistic disease Diseases 0.000 description 2
- 102220484559 C-type lectin domain family 4 member A_H36L_mutation Human genes 0.000 description 2
- 101000909256 Caldicellulosiruptor bescii (strain ATCC BAA-1888 / DSM 6725 / Z-1320) DNA polymerase I Proteins 0.000 description 2
- 208000024172 Cardiovascular disease Diseases 0.000 description 2
- 108010031325 Cytidine deaminase Proteins 0.000 description 2
- 238000007400 DNA extraction Methods 0.000 description 2
- 101710177611 DNA polymerase II large subunit Proteins 0.000 description 2
- 101710184669 DNA polymerase II small subunit Proteins 0.000 description 2
- 230000007018 DNA scission Effects 0.000 description 2
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 2
- AEMRFAOFKBGASW-UHFFFAOYSA-N Glycolic acid Chemical compound OCC(O)=O AEMRFAOFKBGASW-UHFFFAOYSA-N 0.000 description 2
- 102000029812 HNH nuclease Human genes 0.000 description 2
- 108060003760 HNH nuclease Proteins 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- 208000026350 Inborn Genetic disease Diseases 0.000 description 2
- 206010061218 Inflammation Diseases 0.000 description 2
- 108091092195 Intron Proteins 0.000 description 2
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 2
- 206010028980 Neoplasm Diseases 0.000 description 2
- 108091005461 Nucleic proteins Proteins 0.000 description 2
- 102220558133 Olfactory receptor 2A1/2A42_I156F_mutation Human genes 0.000 description 2
- 241001426452 Oryza sativa alphaendornavirus Species 0.000 description 2
- 208000018737 Parkinson disease Diseases 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- 101000902592 Pyrococcus furiosus (strain ATCC 43587 / DSM 3638 / JCM 8422 / Vc1) DNA polymerase Proteins 0.000 description 2
- 108020004511 Recombinant DNA Proteins 0.000 description 2
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 2
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 2
- 238000009825 accumulation Methods 0.000 description 2
- 206010064930 age-related macular degeneration Diseases 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 230000033590 base-excision repair Effects 0.000 description 2
- 235000013339 cereals Nutrition 0.000 description 2
- 230000001276 controlling effect Effects 0.000 description 2
- 238000012350 deep sequencing Methods 0.000 description 2
- 239000003937 drug carrier Substances 0.000 description 2
- 210000005069 ears Anatomy 0.000 description 2
- 239000012634 fragment Substances 0.000 description 2
- 208000016361 genetic disease Diseases 0.000 description 2
- 235000003869 genetically modified organism Nutrition 0.000 description 2
- 230000013595 glycosylation Effects 0.000 description 2
- 238000006206 glycosylation reaction Methods 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 239000000833 heterodimer Substances 0.000 description 2
- 230000002779 inactivation Effects 0.000 description 2
- 230000004054 inflammatory process Effects 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 150000002632 lipids Chemical class 0.000 description 2
- 208000002780 macular degeneration Diseases 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 239000000178 monomer Substances 0.000 description 2
- 210000004940 nucleus Anatomy 0.000 description 2
- 235000021049 nutrient content Nutrition 0.000 description 2
- 235000015097 nutrients Nutrition 0.000 description 2
- 238000003976 plant breeding Methods 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000001172 regenerating effect Effects 0.000 description 2
- 102200012576 rs111033648 Human genes 0.000 description 2
- 102200001270 rs121909081 Human genes 0.000 description 2
- 102200018639 rs122458142 Human genes 0.000 description 2
- 102220182843 rs182603751 Human genes 0.000 description 2
- 102220138225 rs759718991 Human genes 0.000 description 2
- 102220089709 rs869320709 Human genes 0.000 description 2
- 201000000980 schizophrenia Diseases 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 230000009261 transgenic effect Effects 0.000 description 2
- 241000701447 unidentified baculovirus Species 0.000 description 2
- 229940035893 uracil Drugs 0.000 description 2
- DIGQNXIGRZPYDK-WKSCXVIASA-N (2R)-6-amino-2-[[2-[[(2S)-2-[[2-[[(2R)-2-[[(2S)-2-[[(2R,3S)-2-[[2-[[(2S)-2-[[2-[[(2S)-2-[[(2S)-2-[[(2R)-2-[[(2S,3S)-2-[[(2R)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[2-[[(2S)-2-[[(2R)-2-[[2-[[2-[[2-[(2-amino-1-hydroxyethylidene)amino]-3-carboxy-1-hydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxypropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1,5-dihydroxy-5-iminopentylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxybutylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-sulfanylpropylidene]amino]-1-hydroxyethylidene]amino]hexanoic acid Chemical compound C[C@@H]([C@@H](C(=N[C@@H](CS)C(=N[C@@H](C)C(=N[C@@H](CO)C(=NCC(=N[C@@H](CCC(=N)O)C(=NC(CS)C(=N[C@H]([C@H](C)O)C(=N[C@H](CS)C(=N[C@H](CO)C(=NCC(=N[C@H](CS)C(=NCC(=N[C@H](CCCCN)C(=O)O)O)O)O)O)O)O)O)O)O)O)O)O)O)N=C([C@H](CS)N=C([C@H](CO)N=C([C@H](CO)N=C([C@H](C)N=C(CN=C([C@H](CO)N=C([C@H](CS)N=C(CN=C(C(CS)N=C(C(CC(=O)O)N=C(CN)O)O)O)O)O)O)O)O)O)O)O)O DIGQNXIGRZPYDK-WKSCXVIASA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- YKBGVTZYEHREMT-KVQBGUIXSA-N 2'-deoxyguanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](CO)O1 YKBGVTZYEHREMT-KVQBGUIXSA-N 0.000 description 1
- CKTSBUTUHBMZGZ-SHYZEUOFSA-N 2'‐deoxycytidine Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 CKTSBUTUHBMZGZ-SHYZEUOFSA-N 0.000 description 1
- JTBBWRKSUYCPFY-UHFFFAOYSA-N 2,3-dihydro-1h-pyrimidin-4-one Chemical compound O=C1NCNC=C1 JTBBWRKSUYCPFY-UHFFFAOYSA-N 0.000 description 1
- CKTSBUTUHBMZGZ-ULQXZJNLSA-N 4-amino-1-[(2r,4s,5r)-4-hydroxy-5-(hydroxymethyl)oxolan-2-yl]-5-tritiopyrimidin-2-one Chemical compound O=C1N=C(N)C([3H])=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 CKTSBUTUHBMZGZ-ULQXZJNLSA-N 0.000 description 1
- 101710169336 5'-deoxyadenosine deaminase Proteins 0.000 description 1
- 230000005730 ADP ribosylation Effects 0.000 description 1
- 241000093740 Acidaminococcus sp. Species 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 102000055025 Adenosine deaminases Human genes 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 1
- 102100026189 Beta-galactosidase Human genes 0.000 description 1
- 239000002028 Biomass Substances 0.000 description 1
- 241000743774 Brachypodium Species 0.000 description 1
- 108010075254 C-Peptide Proteins 0.000 description 1
- 101150069031 CSN2 gene Proteins 0.000 description 1
- 102000007132 Carboxyl and Carbamoyl Transferases Human genes 0.000 description 1
- 108010072957 Carboxyl and Carbamoyl Transferases Proteins 0.000 description 1
- 241000701489 Cauliflower mosaic virus Species 0.000 description 1
- 108010077544 Chromatin Proteins 0.000 description 1
- 102220584721 Coordinator of PRMT5 and differentiation stimulator_P48A_mutation Human genes 0.000 description 1
- 101150074775 Csf1 gene Proteins 0.000 description 1
- 102000005381 Cytidine Deaminase Human genes 0.000 description 1
- 241000701022 Cytomegalovirus Species 0.000 description 1
- 102100040263 DNA dC->dU-editing enzyme APOBEC-3A Human genes 0.000 description 1
- 238000012270 DNA recombination Methods 0.000 description 1
- 230000006820 DNA synthesis Effects 0.000 description 1
- 230000004568 DNA-binding Effects 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- CKTSBUTUHBMZGZ-UHFFFAOYSA-N Deoxycytidine Natural products O=C1N=C(N)C=CN1C1OC(CO)C(O)C1 CKTSBUTUHBMZGZ-UHFFFAOYSA-N 0.000 description 1
- 206010012335 Dependence Diseases 0.000 description 1
- 206010059866 Drug resistance Diseases 0.000 description 1
- 241000214054 Equine rhinitis A virus Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 241000710198 Foot-and-mouth disease virus Species 0.000 description 1
- 241000588088 Francisella tularensis subsp. novicida U112 Species 0.000 description 1
- 108010002537 Fruit Proteins Proteins 0.000 description 1
- 206010020649 Hyperkeratosis Diseases 0.000 description 1
- 241000689670 Lachnospiraceae bacterium ND2006 Species 0.000 description 1
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 102000003792 Metallothionein Human genes 0.000 description 1
- 108090000157 Metallothionein Proteins 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 240000008467 Oryza sativa Japonica Group Species 0.000 description 1
- 108010064851 Plant Proteins Proteins 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 241000714474 Rous sarcoma virus Species 0.000 description 1
- 241000191967 Staphylococcus aureus Species 0.000 description 1
- 238000010459 TALEN Methods 0.000 description 1
- 241001420369 Thosea Species 0.000 description 1
- 108010043645 Transcription Activator-Like Effector Nucleases Proteins 0.000 description 1
- 102100037111 Uracil-DNA glycosylase Human genes 0.000 description 1
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 238000007792 addition Methods 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 230000037429 base substitution Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- 230000027455 binding Effects 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 210000003763 chloroplast Anatomy 0.000 description 1
- 210000003483 chromatin Anatomy 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 101150055601 cops2 gene Proteins 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- QDOXWKRWXJOMAK-UHFFFAOYSA-N dichromium trioxide Chemical compound O=[Cr]O[Cr]=O QDOXWKRWXJOMAK-UHFFFAOYSA-N 0.000 description 1
- 230000006806 disease prevention Effects 0.000 description 1
- 230000003828 downregulation Effects 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 206010013663 drug dependence Diseases 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000002532 enzyme inhibitor Substances 0.000 description 1
- 229940125532 enzyme inhibitor Drugs 0.000 description 1
- 231100000221 frame shift mutation induction Toxicity 0.000 description 1
- 230000037433 frameshift Effects 0.000 description 1
- 230000006251 gamma-carboxylation Effects 0.000 description 1
- 238000012214 genetic breeding Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 230000007614 genetic variation Effects 0.000 description 1
- 125000000291 glutamic acid group Chemical group N[C@@H](CCC(O)=O)C(=O)* 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 239000000710 homodimer Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 230000033444 hydroxylation Effects 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 235000009973 maize Nutrition 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 1
- 210000003470 mitochondria Anatomy 0.000 description 1
- 230000002438 mitochondrial effect Effects 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 230000009437 off-target effect Effects 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 238000010647 peptide synthesis reaction Methods 0.000 description 1
- 230000001863 plant nutrition Effects 0.000 description 1
- 235000021118 plant-derived protein Nutrition 0.000 description 1
- 210000002706 plastid Anatomy 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 238000006116 polymerization reaction Methods 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 230000002028 premature Effects 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 108020001580 protein domains Proteins 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 102220273513 rs373435521 Human genes 0.000 description 1
- 102220335283 rs574731221 Human genes 0.000 description 1
- 102220082375 rs863224226 Human genes 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 208000011117 substance-related disease Diseases 0.000 description 1
- 230000019635 sulfation Effects 0.000 description 1
- 238000005670 sulfation reaction Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 238000012225 targeting induced local lesions in genomes Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 229940104230 thymidine Drugs 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 230000003827 upregulation Effects 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K19/00—Hybrid peptides, i.e. peptides covalently bound to nucleic acids, or non-covalently bound protein-protein complexes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/62—DNA sequences coding for fusion proteins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
Definitions
- the invention belongs to the field of genetic engineering. Specifically, the present invention relates to a base editing system and its use method. More specifically, the present invention relates to a base editing system and method capable of generating de novo mutations, such as saturation mutations, in situ in an organism.
- the base editing system and method are based on the inclusion of deaminase and CRISPR.
- the base editing fusion protein of the effector protein is based on the inclusion of deaminase and CRISPR.
- CRISPR/Cas Clusters of regularly spaced short palindromic repeats and their related (CRISPR/Cas) systems can produce double strand breaks (DSB) at endogenous target sites.
- DSB double strand breaks
- CRISPR/Cas Clusters of regularly spaced short palindromic repeats and their related (CRISPR/Cas) systems can produce double strand breaks (DSB) at endogenous target sites.
- DSB double strand breaks
- a base editing fusion protein which comprises a nucleic acid targeting domain, a cytosine deamination domain, and an adenine deamination domain.
- Item 2 The base editing fusion protein of Item 1, wherein the nucleic acid targeting domain comprises at least one CRISPR effector protein polypeptide.
- the base editing fusion protein of item 2 wherein the CRISPR effector protein is Cas9 nuclease or a functional variant thereof, preferably, the CRISPR effector protein is nuclease-inactivated Cas9, more preferably, The nuclease-inactivated Cas9 includes the amino acid sequence shown in SEQ ID NO: 2, and most preferably, the nuclease-inactivated Cas9 includes the amino acid sequence shown in SEQ ID NO: 3.
- Item 4 The base editing fusion protein of any one of items 1 to 3, wherein the cytosine deaminase domain comprises at least one cytosine deaminase polypeptide.
- Item 5 The base editing fusion protein of item 4, wherein the cytosine deaminase is selected from the group consisting of APOBEC1 deaminase, activation-induced cytidine deaminase (AID), APOBEC3G, CDA1, human APOBEC3A deaminase, or Their functional variants.
- cytosine deaminase is selected from the group consisting of APOBEC1 deaminase, activation-induced cytidine deaminase (AID), APOBEC3G, CDA1, human APOBEC3A deaminase, or Their functional variants.
- Item 6 The base editing fusion protein of Item 5, wherein the cytosine deaminase is human APOBEC3A deaminase or a functional variant thereof, for example, the human APOBEC3A deaminase comprises SEQ ID NO: 4 Amino acid sequence.
- Item 7 The base editing fusion protein of any one of items 1 to 6, wherein the adenine deaminase domain comprises at least one DNA-dependent adenine deaminase polypeptide.
- Item 8 The base editing fusion protein of item 7, wherein the DNA-dependent adenine deaminase is derived from wild-type E. coli tRNA adenine deaminase TadA (ecTadA), for example, the DNA-dependent adenine deaminase Aminase includes the amino acid sequence shown in SEQ ID NO: 6.
- Item 9 The base editing fusion protein of Item 7 or 8, wherein the adenine deaminase domain comprises two DNA-dependent adenine deaminase.
- Item 10 The base editing fusion protein of item 7, wherein the adenine deaminase domain further comprises wild-type E. coli tRNA adenine deaminase TadA (ecTadA) fused to the DNA-dependent adenine deaminase
- the DNA-dependent adenine deaminase is fused to the C-terminus of the wild-type E. coli tRNA adenine deaminase TadA (ecTadA).
- Item 11 The base editing fusion protein of Item 7 or 8, wherein the adenine deamination domain comprises the amino acid sequence shown in SEQ ID NO: 7 or 8.
- Item 12 The base editing fusion protein of any one of items 1-11, wherein the nucleic acid targeting domain, the cytosine deaminization domain and the adenine deaminization domain are fused via a linker, for example,
- the linker comprises an amino acid sequence selected from SEQ ID NO: 9-11.
- Item 13 The base editing fusion protein of any one of items 1-12, wherein the base editing fusion protein comprises from N-terminal to C-terminal in the following order: cytosine deaminization domain, adenine deaminization structure A domain and a nucleic acid targeting domain, or the base editing fusion protein comprises an adenine deamination domain, a cytosine deamination domain, and a nucleic acid targeting domain in the following order from N-terminal to C-terminal.
- UFI uracil DNA glycosylase inhibitor
- the enzyme inhibitor (UGI) comprises the amino acid sequence shown in SEQ ID NO: 12.
- Item 15 The base editing fusion protein of any one of items 1-14, wherein the base editing fusion protein further comprises one or more nuclear localization sequences (NLS).
- NLS nuclear localization sequences
- Item 16 The base editing fusion protein of Item 1, wherein the base editing fusion protein comprises the amino acid sequence shown in any one of SEQ ID NOs: 13-19.
- a base editing system for modifying target nucleic acid regions in the genome which comprises:
- the at least one guide RNA is directed to at least one target sequence in the target nucleic acid region.
- Item 18 The base editing system of Item 17, wherein the guide RNA is sgRNA, for example, the sgRNA includes the scaffold sequence shown in SEQ ID NO: 27 or SEQ ID NO: 28.
- Item 19 The base editing system of Item 17 or 18, wherein the target sequence targeted by the guide RNA contains a PAM sequence at the 3'end, such as 5'-NGG-3' or 5'-NG-3'.
- Item 20 The base editing system of any one of Items 17-19, wherein the at least one guide RNA is directed to a target sequence located on the sense strand and/or antisense strand in the target nucleic acid region of the cell genome.
- Item 21 The base editing system of any one of items 17-20, wherein the nucleotide sequence encoding the base editing fusion protein is codon-optimized for the organism whose genome is to be modified.
- Item 22 The base editing system of Item 21, wherein the base editing fusion protein is encoded by any nucleotide sequence shown in SEQ ID NO: 20-26.
- Item 23 A method of producing at least one genetically modified cell, comprising introducing the base editing system of any one of items 17-22 into at least one of the cells, thereby resulting in a target nucleic acid region in the at least one cell One or more nucleotide substitutions within.
- Item 24 The method of Item 23, further comprising the step of selecting cells having the desired one or more nucleotide substitutions from the at least one cell.
- Item 25 The method of Item 23 or 24, wherein the base editing system is introduced into the cell by a method selected from the group consisting of calcium phosphate transfection, protoplast fusion, electroporation, liposome transfection, microinjection, viral infection (such as Baculovirus, vaccinia virus, adenovirus, adeno-associated virus, lentivirus and other viruses), gene bombardment, PEG-mediated transformation of protoplasts, and Agrobacterium-mediated transformation.
- viral infection such as Baculovirus, vaccinia virus, adenovirus, adeno-associated virus, lenti
- Item 26 The method of any one of items 23-25, wherein the cells are derived from mammals such as humans, mice, rats, monkeys, dogs, pigs, sheep, cattle, cats; poultry such as chickens, ducks, and geese; Plants, including monocots and dicots, preferably crop plants such as wheat, rice, corn, soybean, sunflower, sorghum, rape, alfalfa, cotton, barley, millet, sugar cane, tomato, tobacco, cassava and potato.
- mammals such as humans, mice, rats, monkeys, dogs, pigs, sheep, cattle, cats
- poultry such as chickens, ducks, and geese
- Plants including monocots and dicots, preferably crop plants such as wheat, rice, corn, soybean, sunflower, sorghum, rape, alfalfa, cotton, barley, millet, sugar cane, tomato, tobacco, cassava and potato.
- Item 27 A method for producing a genetically modified plant, comprising introducing the base editing system of any one of items 17-22 into at least one of the plants, thereby resulting in a target nucleic acid region in the genome of the at least one plant One or more nucleotide substitutions within.
- Item 28 The method of Item 27, further comprising selecting plants having the desired one or more nucleotide substitutions from the at least one plant.
- Item 29 The method of Item 27 or 28, wherein the base editing system is introduced into the plant by a method selected from the group consisting of: gene bombardment, PEG-mediated transformation of protoplasts, Agrobacterium-mediated transformation, plant virus-mediated Transformation, pollen tube passage method and ovary injection method.
- Item 30 The method of Item 28, wherein the importing is performed in the absence of selective pressure.
- Item 31 The method of any one of items 27-30, wherein the introducing comprises transforming the base editing system into an isolated plant cell or tissue, and then regenerating the transformed plant cell or tissue into a whole plant Preferably, the regeneration is performed in the absence of selective pressure.
- Item 32 The method of any one of items 27-30, wherein the introducing comprises transforming the base editing system into leaves, stem tips, pollen tubes, young ears, or hypocotyls on intact plants.
- Item 33 The method of any one of items 27-30, wherein the expression construct is an in vitro transcribed RNA molecule.
- Item 34 The method of any one of items 27 to 33, wherein the genetically modified plant does not contain an exogenous polynucleotide integrated into its genome.
- Item 35 The method of any one of items 27 to 34, wherein the modified target nucleic acid region is related to plant traits such as agronomic traits.
- Item 36 The method of any one of items 27 to 35, further comprising the step of screening plants for desired traits such as agronomic traits.
- Item 37 The method of any one of items 27 to 36, further comprising obtaining progeny of the genetically modified plant.
- a method for plant breeding comprising combining the genetically modified first plant that contains one or more nucleotide substitutions in the target nucleic acid region obtained by the method of any one of items 27-37 and does not contain the A second plant with one or more nucleotide substitutions is crossed, thereby introducing the one or more nucleotide substitutions into the second plant.
- the genetically modified first plant has desired traits such as agronomic traits .
- Item 39 A method for in-situ saturation mutation of an endogenous target nucleic acid region in a cell or organism to obtain a mutation of interest in the target nucleic acid region, comprising
- the base editing system of any one of items 17-22 is introduced into the population of said cells or organisms, resulting in one or more mutations in the endogenous target nucleic acid region of the cells or organisms of said population;
- Item 40 The method of Item 39, wherein the base editing system comprises a plurality of guide RNAs and/or a plurality of expression constructs containing nucleotide sequences encoding the plurality of guide RNAs, preferably, the plurality of The guide RNA targets different target sequences within the target nucleic acid region.
- Item 41 The method of Item 39 or 40, wherein the plurality of guide RNAs comprise 2 to 250 or more, such as 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 50, 75, 100, 150, 200, 250, 300 or more guide RNAs.
- Item 42 The method of any one of items 39-41, wherein the target sequences and/or complementary sequences targeted by at least some of the plurality of guide RNAs partially overlap each other and/or are adjacent to each other.
- Item 43 The method of any one of items 39-42, wherein the target nucleic acid region has a length of about 20bp to about 10000bp or longer, for example about 20bp, about 40bp, about 60bp, about 80bp, about 100bp, about 120bp, About 140bp, about 160bp, about 180bp, about 200bp, about 300bp, about 400bp, about 500bp, about 1000bp, about 1500bp, about 2000bp, about 3000bp, about 4000bp, about 5000bp, about 6000bp or longer; or, wherein said The target nucleic acid sequence encodes an amino acid sequence of about 5 to about 2000 amino acids in length, for example, it can encode about 5, about 10, about 15, about 20, about 25, about 30, or about 35.
- Item 44 The method of any one of items 39 to 43, wherein the target sequence targeted by the plurality of guide RNAs substantially covers the target nucleic acid region.
- Item 45 The method of any one of items 39-44, wherein at least a portion of the plurality of guide RNAs target the sense strand of the target nucleic acid region.
- Item 46 The method of any one of items 39-45, wherein at least a portion of the plurality of guide RNAs target the antisense strand of the target nucleic acid region.
- Item 47 The method of any one of items 39 to 46, wherein the plurality of guide RNAs and/or a plurality of expression constructs containing nucleotide sequences encoding the plurality of guide RNAs are each independently introduced into the cell Or a population of organisms; or the plurality of guide RNAs and/or a plurality of expression constructs containing nucleotide sequences encoding the plurality of guide RNAs are introduced into the population of cells or organisms in combination with each other.
- Item 48 The method of any one of items 39-47, wherein the mutation is a nucleotide substitution, such as a C to T substitution, A to G substitution, G to A substitution, or T to C substitution.
- a nucleotide substitution such as a C to T substitution, A to G substitution, G to A substitution, or T to C substitution.
- Item 49 The method of any one of items 39-48, wherein the target nucleic acid region is located in a coding region of a protein, for example, the target nucleic acid region encodes a functionally related motif or domain of the protein.
- Item 50 The method of Item 49, wherein the mutation in the target nucleic acid region causes an amino acid substitution in the amino acid sequence of the protein, preferably, the mutation causes a change in the function of the protein.
- Item 51 The method of any one of items 39-50, wherein the target nucleic acid region is related to a trait of the cell or organism, for example, a mutation in the target nucleic acid region causes the trait of the cell or organism change.
- Item 52 The method of any one of items 39-51, in step iii), screening cells or organisms with mutations of interest by screening cells or organisms with changes in traits of interest, for example, the induction
- the trait change of interest is selected from increased growth rate, increased yield, increased nutrient content, increased cold resistance, increased drought resistance, increased insect resistance, increased disease resistance, and increased herbicide resistance.
- Item 53 The method of any one of items 39-52, wherein the cells are derived from mammals such as humans, mice, rats, monkeys, dogs, pigs, sheep, cattle, cats; poultry such as chickens, ducks, and geese; plants , Including monocot plants and dicot plants, preferably crop plants, including but not limited to wheat, rice, corn, soybean, sunflower, sorghum, rape, alfalfa, cotton, barley, millet, sugar cane, tomato, tobacco, cassava and potato,
- the cell is a plant cell, more preferably a crop plant cell, more preferably a rice cell.
- Item 54 The method of any one of items 39-52, wherein the organism is selected from mammals such as humans, mice, rats, monkeys, dogs, pigs, sheep, cattle, cats; poultry such as chickens, ducks, and geese ; Plants, including monocots and dicots, preferably crop plants, including but not limited to wheat, rice, corn, soybean, sunflower, sorghum, rape, alfalfa, cotton, barley, millet, sugarcane, tomato, tobacco, cassava and potato,
- the organism is a plant, more preferably a crop plant, more preferably rice.
- Item 55 A method of treating a disease in a subject in need thereof, comprising delivering to the subject an effective amount of the base editing system of any one of items 17-22 to modify genes related to the disease.
- Item 56 Use of the base editing system of any one of items 17-22 in the preparation of a pharmaceutical composition for treating a disease in a subject in need, wherein the base editing system is used to modify the disease Related genes.
- Item 57 A pharmaceutical composition for treating a disease in a subject in need, comprising the base editing system of any one of items 17-22, and optionally a pharmaceutically acceptable carrier, wherein the base editing The system is used to modify genes related to the disease.
- Item 58 The method of item 55, the use of item 56 or the pharmaceutical composition of item 57, wherein the subject is a mammal, such as a human.
- Item 59 The method of item 55, the use of item 56 or the pharmaceutical composition of item 57, wherein the disease is selected from the group consisting of tumor, inflammation, Parkinson's disease, cardiovascular disease, Alzheimer's disease, autism, and drug composition Addiction, age-related macular degeneration, schizophrenia, genetic diseases.
- Item 60 A kit comprising the base editing fusion protein of any one of items 1-16 and/or an expression construct containing a nucleotide sequence encoding the base editing fusion protein, or comprising item 17 -22 base editing system.
- STEME performs base editing through the fusion of cytidine and adenine deaminase.
- FIG. 1 pOsU3-esgRNA expression vector.
- the present invention preferably uses esgRNA.
- Figure 3 Editing efficiency of STEME-1, STEME-2, STEME-3 and STEME-4 in protoplasts.
- (a) Comparison of C>T base editing frequency between A3A-PBE and four STEME constructs (n 3).
- (b) Comparison of A>G base editing frequency between PABE-7 and four STEME constructs (n 3).
- Untreated protoplast samples were used as controls. The values and error bars reflect the mean ⁇ s.e.m of three independent biological replicates.
- FIG. 1 Activity and product purity of STEME-1, STEME-2, STEME-3 and STEME-4 in rice protoplasts.
- (a)-(f) are OsAAT, OsACC, OsCDC48, OsDEP1, OsEV and OsOD target sequences, respectively.
- Figure 5 Comparison of base editing efficiency of STEME-1, STEME-5 and STEME-6 in rice protoplasts.
- (a) The structure of STEME-5 and STEME-6. ecTadA7.10: evolved E. coli TadA; aa, amino acid; XTEN: 16 amino acid linker.
- (b) Comparison of C>T base editing frequency of A3A-PBE, STEME-1, STEME-5 and STEME-6 constructs.
- STEME-NG is used for saturation de novo mutation in rice protoplasts.
- Figure 8. Design of NG PAM targets at different rice sites.
- STEME-NG can widely edit target sequences with NGA, NGT, NGC or NGG PAM.
- NGG PAM and STEME-1 of untreated protoplast samples were used as controls. The values and error bars reflect the mean ⁇ s.e.m of three independent biological replicates.
- Figure 10 The activity and product purity of STEME-NG in rice protoplasts.
- FIG. 11 The basic editing efficiency of C>T and A>G in rice protoplasts by A3A-PBE-NG and PABE7-NG, respectively. Both A3A-PBE-NG(a) and PABE7-NG(b) have extensive capabilities to edit NGA, NGT, NGC or NGG PAM targets. Untreated protoplast samples were used as controls. The values and error bars reflect the mean ⁇ s.e.m of three independent biological replicates.
- FIG. 12 De novo saturation mutation of key domains of OsACC protein with STEME-NG.
- (c) The heat map shows the conversion saturation of STEME-NG on the 56 amino acids involved in (b) in rice protoplasts. Statistics are collected for silent mutation, missense mutation and nonsense mutation.
- FIG. 13 The 168bp of the CT domain of OsACC saturation mutation by A3A-PBE-NG in rice protoplasts.
- A3A-PBE-NG (a) and untreated control (b) are shown.
- sgRNAs convert the same cytidine and guanosine
- the values and error bars reflect the mean ⁇ s.e.m of three independent biological replicates.
- the term “and/or” encompasses all combinations of items connected by the term, and should be treated as if each combination has been individually listed herein.
- “A and/or B” encompasses “A”, “A and B”, and “B”.
- “A, B, and/or C” encompasses "A”, “B”, “C”, “A and B”, “A and C”, “B and C”, and "A and B and C”.
- the protein or nucleic acid may be composed of the sequence, or may have additional amino acids or cores at one or both ends of the protein or nucleic acid. Glycolic acid, but still has the activity of the present invention.
- methionine encoded by the start codon at the N-terminus of the polypeptide will be retained under certain practical conditions (for example, when expressed in a specific expression system), but does not substantially affect the function of the polypeptide.
- Gene as used herein not only covers chromosomal DNA present in the nucleus, but also includes organelle DNA present in the subcellular components of the cell (such as mitochondria, plastids).
- Genetically modified organism or “genetically modified cell” means an organism or cell that contains exogenous polynucleotides or contains modified genes or expression control sequences in its genome.
- exogenous polynucleotides can be stably integrated into the genome of organisms or cells, and inherited for successive generations.
- the exogenous polynucleotide can be integrated into the genome alone or as part of a recombinant DNA construct.
- the modified gene or expression control sequence is that the gene or expression control sequence in the genome of an organism or cell contains one or more deoxynucleotide substitutions, deletions and additions.
- Form in terms of sequence means a sequence from a foreign species or, if from the same species, a sequence whose composition and/or locus has been significantly altered from its natural form through deliberate human intervention.
- nucleic acid sequence is used interchangeably and are single-stranded or double-stranded RNA or DNA polymers, optionally containing synthetic, non-natural Or changed nucleotide bases.
- Nucleotides are referred to by their single letter names as follows: “A” is adenosine or deoxyadenosine (respectively for RNA or DNA), “C” is cytidine or deoxycytidine, and “G” is guanosine or Deoxyguanosine, “U” means uridine, “T” means deoxythymidine, “R” means purine (A or G), “Y” means pyrimidine (C or T), “K” means G or T, “ H” means A or C or T, “D” means A, T or G, “I” means inosine, and “N” means any nucleotide.
- Polypeptide “peptide”, and “protein” are used interchangeably in the present invention and refer to a polymer of amino acid residues.
- the term applies to amino acid polymers in which one or more amino acid residues are artificial chemical analogs of the corresponding naturally occurring amino acids, as well as to naturally occurring amino acid polymers.
- the terms “polypeptide”, “peptide”, “amino acid sequence” and “protein” may also include modified forms, including but not limited to glycosylation, lipid linkage, sulfation, gamma carboxylation of glutamic acid residues, hydroxyl And ADP-ribosylation.
- expression construct refers to a vector, such as a recombinant vector, suitable for expression of a nucleotide sequence of interest in an organism.
- “Expression” refers to the production of a functional product.
- the expression of a nucleotide sequence may refer to the transcription of the nucleotide sequence (such as transcription to generate mRNA or functional RNA) and/or the translation of RNA into a precursor or mature protein.
- the "expression construct" of the present invention can be a linear nucleic acid fragment, a circular plasmid, a viral vector, or, in some embodiments, can be RNA (such as mRNA) that can be translated, for example, RNA generated by in vitro transcription.
- RNA such as mRNA
- the "expression construct" of the present invention may contain regulatory sequences and nucleotide sequences of interest from different sources, or regulatory sequences and nucleotide sequences of interest from the same source but arranged in a way different from those normally occurring in nature.
- regulatory sequence and “regulatory element” can be used interchangeably and refer to the upstream (5' non-coding sequence), middle or downstream (3' non-coding sequence) of the coding sequence, and affect the transcription, RNA processing or processing of the related coding sequence. Stability or translated nucleotide sequence. Regulatory sequences may include, but are not limited to, promoters, translation leader sequences, introns, and polyadenylation recognition sequences.
- Promoter refers to a nucleic acid fragment capable of controlling the transcription of another nucleic acid fragment.
- the promoter is a promoter capable of controlling gene transcription in a cell, regardless of whether it is derived from the cell.
- the promoter can be a constitutive promoter or a tissue-specific promoter or a developmentally regulated promoter or an inducible promoter.
- tissue-specific promoter and “tissue-preferred promoter” are used interchangeably, and refer to mainly but not necessarily exclusively expressed in a tissue or organ, and can also be expressed in a specific cell or cell type The promoter.
- tissue-preferred promoter refers to a promoter whose activity is determined by developmental events.
- inducible promoters selectively express operably linked DNA sequences in response to endogenous or exogenous stimuli (environment, hormones, chemical signals, etc.).
- promoters include, but are not limited to, polymerase (pol) I, pol II, or pol III promoters.
- pol I promoters include chicken RNA pol I promoter.
- pol II promoters include, but are not limited to, cytomegalovirus immediate early (CMV) promoter, Rous sarcoma virus long terminal repeat (RSV-LTR) promoter, and simian virus 40 (SV40) immediate early promoter.
- pol III promoters include U6 and H1 promoters.
- An inducible promoter such as a metallothionein promoter can be used.
- promoters include T7 phage promoter, T3 phage promoter, ⁇ -galactosidase promoter, and Sp6 phage promoter.
- the promoter may be cauliflower mosaic virus 35S promoter, corn Ubi-1 promoter, wheat U6 promoter, rice U3 promoter, corn U3 promoter, rice actin promoter.
- operably linked refers to the connection of regulatory elements (for example, but not limited to, promoter sequences, transcription termination sequences, etc.) to nucleic acid sequences (for example, coding sequences or open reading frames) such that the nucleotides The transcription of the sequence is controlled and regulated by the transcription control element.
- regulatory elements for example, but not limited to, promoter sequences, transcription termination sequences, etc.
- nucleic acid sequences for example, coding sequences or open reading frames
- Introducing a nucleic acid molecule (such as a plasmid, linear nucleic acid fragment, RNA, etc.) or protein into an organism refers to transforming the cell of the organism with the nucleic acid or protein so that the nucleic acid or protein can function in the cell.
- the "transformation” used in the present invention includes stable transformation and transient transformation.
- Stable transformation refers to the introduction of exogenous nucleotide sequences into the genome, resulting in stable inheritance of the exogenous gene. Once stably transformed, the exogenous nucleic acid sequence is stably integrated into the genome of the organism and any successive generations thereof.
- Transient transformation refers to the introduction of nucleic acid molecules or proteins into cells to perform functions without stable inheritance of foreign genes. In transient transformation, the foreign nucleic acid sequence is not integrated into the genome.
- Proteins refer to the physiological, morphological, biochemical or physical characteristics of cells or organisms.
- “Agronomic traits” especially refer to the measurable index parameters of crop plants, including but not limited to: leaf green, grain yield, growth rate, total biomass or accumulation rate, fresh weight at maturity, dry weight at maturity, fruit Yield, seed yield, plant total nitrogen content, fruit nitrogen content, seed nitrogen content, plant nutrient tissue nitrogen content, plant total free amino acid content, fruit free amino acid content, seed free amino acid content, plant nutrient tissue free amino acid content, plant total protein Content, fruit protein content, seed protein content, plant nutrition tissue protein content, herbicide resistance, drought resistance, nitrogen absorption, root lodging, harvest index, stem lodging, plant height, ear height, ear length, disease resistance Resistance, cold resistance, salt resistance and tiller number.
- the present invention provides a base editing fusion protein comprising a nucleic acid targeting domain, a cytosine deaminization domain, and an adenine deaminization domain.
- base editing fusion protein and “base editor” are used interchangeably, and refer to those that can mediate the substitution of one or more nucleotides of a target sequence in the genome in a sequence-specific manner. protein.
- nucleic acid targeting domain refers to a domain capable of mediating the attachment of the base editing fusion protein to a specific target sequence in the genome in a sequence-specific manner (for example, through a guide RNA).
- the nucleic acid targeting domain comprises at least one (e.g., one) CRISPR effector polypeptide.
- CRISPR effector protein generally refers to a nuclease (CRISPR nuclease) or a functional variant thereof found in a naturally occurring CRISPR system.
- the term covers any effector protein based on the CRISPR system that can achieve sequence-specific targeting in cells.
- a "functional variant" in terms of CRISPR nuclease means that it retains at least the guide RNA-mediated sequence-specific targeting ability.
- the functional variant is a nuclease-inactivated variant, that is, it lacks double-stranded nucleic acid cleavage activity.
- CRISPR nucleases lacking double-stranded nucleic acid cleavage activity also encompass nickases, which form nicks in double-stranded nucleic acid molecules, but do not completely cut double-stranded nucleic acids.
- the CRISPR effector protein of the present invention has nickase activity.
- the functional variant recognizes a different PAM (proximal region sequence adjacent motif) sequence relative to the wild-type nuclease.
- the "CRISPR effector protein” can be derived from Cas9 nuclease, including Cas9 nuclease or functional variants thereof.
- the Cas9 nuclease may be a Cas9 nuclease from different species, such as spCas9 from S. pyogenes or SaCas9 derived from S. aureus.
- Cas9 nuclease and Cas9 are used interchangeably herein, and refer to RNA comprising Cas9 protein or fragments thereof (for example, a protein containing the active DNA cleavage domain of Cas9 and/or the gRNA binding domain of Cas9) Guided nuclease.
- Cas9 is a component of CRISPR/Cas (clustered regularly spaced short palindrome repeats and related systems) genome editing system, which can target and cut DNA target sequences under the guidance of guide RNA to form DNA double-strand breaks (DSB) ).
- DSB DNA double-strand breaks
- An exemplary amino acid sequence of wild-type spCas9 is shown in SEQ ID NO:1.
- CRISPR effector protein can also be derived from Cpf1 nuclease, including Cpf1 nuclease or functional variants thereof.
- the Cpf1 nuclease may be Cpf1 nuclease from different species, for example, Cpf1 nuclease from Francisella novicida U112, Acidaminococcus sp. BV3L6 and Lachnospiraceae bacterium ND2006.
- CRISPR effector proteins can also be derived from Cas3, Cas8a, Cas5, Cas8b, Cas8c, Cas10d, Cse1, Cse2, Csy1, Csy2, Csy3, GSU0054, Cas10, Csm2, Cmr5, Cas10, Csx11, Csx10, Csf1, Csn2 , Cas4, C2c1, C2c3 or C2c2 nucleases, for example, include these nucleases or functional variants thereof.
- the CRISPR effector protein is nuclease-inactivated Cas9.
- the DNA cleavage domain of Cas9 nuclease is known to contain two subdomains: HNH nuclease subdomain and RuvC subdomain.
- the HNH subdomain cleaves the strand complementary to gRNA, while the RuvC subdomain cleaves the non-complementary strand. Mutations in these subdomains can inactivate the nuclease activity of Cas9, forming "nuclease-inactivated Cas9".
- the Cas9 inactivated by the nuclease still retains the DNA binding ability guided by gRNA.
- the nuclease-inactivated Cas9 of the present invention can be derived from Cas9 of different species, for example, derived from S. pyogenes Cas9 (SpCas9), or derived from Staphylococcus aureus (S. aureus) Cas9 (SaCas9). ). Simultaneously mutating the HNH nuclease subdomain and RuvC subdomain of Cas9 (for example, including the mutations D10A and H840A) makes the nuclease of Cas9 inactive and becomes nuclease death Cas9 (dCas9). Mutation and inactivation of one of the subdomains can make Cas9 have nickase activity, that is, obtain Cas9 nickase (nCas9), for example, nCas9 with only mutation D10A.
- SpCas9 S. pyogenes Cas9
- SaCas9 Staphyloc
- the nuclease-inactivated Cas9 variant of the present invention contains amino acid substitution D10A and/or H840A relative to wild-type Cas9, wherein the amino acid number refers to SEQ ID NO:1.
- the nuclease-inactivated Cas9 contains the amino acid substitution D10A relative to the wild-type Cas9, wherein the amino acid number refers to SEQ ID NO:1.
- the nuclease-inactivated Cas9 comprises the amino acid sequence shown in SEQ ID NO: 2 (nCas9(D10A)).
- Cas9 nuclease When Cas9 nuclease is used for gene editing, it usually requires the target sequence to have a 5'-NGG-3' PAM (proximal region sequence adjacent motif) sequence at the 3'end.
- PAM proximal region sequence adjacent motif
- CRISPR effector proteins that recognize different PAM sequences are preferably used in the present invention, for example, functional variants of Cas9 nuclease with different PAM sequences.
- the CRISPR effector protein is a Cas9 variant that recognizes the 5'-NG-3' of the PAM sequence.
- the Cas9 variant that recognizes the PAM sequence 5'-NG-3' is also referred to herein as Cas9-NG.
- Cas9-NG includes the following amino acid substitutions R1335V, L1111R, D1135V, G1218R, E1219F, A1322R, T1337R relative to wild-type Cas9, wherein the amino acid number refers to SEQ ID NO:1.
- the CRISPR effector protein is nuclease-inactivated and recognizes the Cas9 variant of the PAM sequence 5'-NG-3'.
- the nuclease-inactivated Cas9 variant that recognizes the PAM sequence 5'-NG-3' contains the following amino acid substitutions D10A, R1335V, L1111R, D1135V, G1218R, E1219F, A1322R, T1337R relative to wild-type Cas9 , Where the amino acid number refers to SEQ ID NO:1.
- the nuclease-inactivated Cas9 variant that recognizes the PAM sequence 5'-NG-3' comprises the amino acid sequence shown in SEQ ID NO: 3 (nCas9-NG(D10A)).
- cytosine deamination domain refers to a domain that can accept single-stranded DNA as a substrate and catalyze the deamination of cytidine or deoxycytidine into uracil or deoxyuracil, respectively.
- the cytosine deaminase domain comprises at least one (eg, one or two) cytosine deaminase polypeptides.
- the cytidine deaminization domain in the fusion protein can convert the cytidine deamination of the single-stranded DNA produced during the formation of the fusion protein-guide RNA-DNA complex into U, and then realize base mismatch repair C to T base substitution.
- cytosine deaminase examples include, but are not limited to, for example, APOBEC1 deaminase, activation-induced cytidine deaminase (AID), APOBEC3G, CDA1, human APOBEC3A deaminase, or their functional modifications body.
- the cytosine deaminase is human APOBEC3A deaminase or a functional variant thereof.
- the human APOBEC3A deaminase comprises the amino acid sequence shown in SEQ ID NO:4.
- adenine deamination domain refers to a domain that can accept single-stranded DNA as a substrate and catalyze the formation of inosine (I) from adenosine or deoxyadenosine (A).
- the adenine deaminase domain comprises at least one (eg, one) DNA-dependent adenine deaminase polypeptide.
- the adenine deamination domain in the fusion protein can convert the adenosine deamination of the single-stranded DNA generated during the formation of the CRISPR effector protein-guide RNA-DNA complex into inosine (I), due to DNA polymerization
- the enzyme treats inosine (I) as guanine (G), so A to G substitution can be achieved through base mismatch repair.
- the DNA-dependent adenine deaminase is a variant of E. coli tRNA adenine deaminase TadA (ecTadA).
- ecTadA E. coli tRNA adenine deaminase TadA
- An exemplary wild-type ecTadA amino acid sequence is shown in SEQ ID NO: 5.
- the wild-type ecTadA amino acid sequence may not include the N-terminal methionine in SEQ ID NO: 5.
- the DNA-dependent adenine deaminase comprises one or more sets of mutations selected from the following relative to wild-type ecTadA:
- the amino acid number refers to SEQ ID NO: 5.
- the DNA-dependent adenine deaminase contains the following mutations relative to wild-type ecTadA: W23R, H36L, R51L, S146C, K157N, A106V, D108N, P48A, L84F, H123Y, I156F, For D147Y, E155V and R152P, the amino acid number refers to SEQ ID NO: 5.
- the DNA-dependent adenine deaminase comprises the amino acid sequence shown in SEQ ID NO:6.
- E. coli tRNA adenine deaminase usually functions as a dimer, it is expected that two DNA-dependent adenine deaminase will form a dimer or DNA-dependent adenine deaminase and wild-type adenine The formation of dimers by deaminase can significantly increase the editing activity of fusion proteins A to G.
- the adenine deaminase domain comprises two of the DNA-dependent adenine deaminase.
- the adenine deaminase domain further comprises a corresponding DNA-dependent adenine deaminase (such as a DNA-dependent variant of E. coli tRNA adenine deaminase TadA) fused to Wild-type adenine deaminase (eg E. coli tRNA adenine deaminase TadA).
- the DNA-dependent adenine deaminase (such as a DNA-dependent variant of E. coli tRNA adenine deaminase TadA) is fused to a corresponding wild-type adenine deaminase (such as E. coli The C-terminus of tRNA adenine deaminase (TadA).
- DNA-dependent adenine deaminase e.g., a DNA-dependent variant of E. coli tRNA adenine deaminase TadA
- DNA-dependent adenine deaminase e.g., A DNA-dependent variant of E. coli tRNA adenine deaminase TadA
- wild-type adenine deaminase such as E. coli tRNA adenine deaminase TadA
- the adenine deamination domain comprises the amino acid sequence shown in SEQ ID NO: 7 or 8.
- the nucleic acid targeting domain, the cytosine deamination domain and the adenine deamination domain are fused via a linker.
- linkers can be 1-50 long (e.g. 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20 or 20-25, 25-50) or more amino acids, non-functional amino acid sequences without secondary or higher structure.
- the linker may be a flexible linker, such as GGGGS, GS, GAP, (GGGGS)x3, GGS and (GGS)x7, etc.
- the linker is 32 amino acids long, for example, the linker comprises the amino acid sequence shown in SEQ ID NO:9.
- the linker is 48 amino acids long, for example, the linker comprises the amino acid sequence shown in SEQ ID NO:10.
- the linker is an XTEN linker comprising the amino acid sequence shown in SEQ ID NO: 11.
- the base editing fusion protein comprises the following order from the N-terminus to the C-terminus: a cytosine deamination domain, an adenine deamination domain, and a nucleic acid targeting domain. In some embodiments, the base editing fusion protein comprises in the following order from the N-terminal to the C-terminal: an adenine deamination domain, a cytosine deamination domain, and a nucleic acid targeting domain.
- uracil DNA glycosylase catalyzes the removal of U from DNA and initiates base excision repair (BER), resulting in the repair of U:G to C:G. Therefore, without being limited by any theory, the combination of the base editing fusion protein of the present invention and Uracil DNA Glycosylase Inhibitor (UGI) will increase the efficiency of C to T base editing.
- Uracil DNA Glycosylase Inhibitor Uracil DNA Glycosylase Inhibitor
- the base editing fusion protein is co-expressed with uracil DNA glycosylase inhibitor (UGI).
- UFI uracil DNA glycosylase inhibitor
- the base editing fusion protein further comprises Uracil DNA Glycosylase Inhibitor (UGI).
- UBI Uracil DNA Glycosylase Inhibitor
- UGI is connected to other parts of the base editing fusion protein through a linker.
- UGI is connected to other parts of the base editing fusion protein through a "self-cleaving peptide”.
- self-cleaving peptide means a peptide that can achieve self-cleavage within a cell.
- the self-cleaving peptide may include a protease recognition site, so that it can be recognized and specifically cleaved by the protease in the cell.
- the self-cleaving peptide may be a 2A polypeptide.
- the 2A polypeptide is a type of short peptide derived from viruses, and its self-cleavage occurs during translation. When 2A polypeptide is used to connect two different target polypeptides and expressed in the same reading frame, the two target polypeptides are almost produced at a ratio of 1:1.
- 2A polypeptides can be P2A from porcine techovirus-1, T2A from Thosea asignis virus, E2A from equine rhinitis A virus And F2A from foot-and-mouth disease virus.
- T2A porcine techovirus-1
- E2A from equine rhinitis A virus
- F2A foot-and-mouth disease virus.
- a variety of functional variants of these 2A polypeptides are also known in the art, and these variants can also be used in the present invention.
- the self-cleavable peptide does not exist between or within the nucleic acid targeting domain, the cytosine deamination domain and the adenine deamination domain.
- UGI is located at the N-terminus or C-terminus of the base editing fusion protein, preferably the C-terminus.
- the uracil DNA glycosylase inhibitor comprises the amino acid sequence shown in SEQ ID NO: 12.
- the fusion protein of the present invention may also include a nuclear localization sequence (NLS).
- NLS nuclear localization sequence
- one or more NLS in the fusion protein should have sufficient strength to drive the accumulation of the fusion protein in an amount that can achieve its base editing function in the nucleus of the cell.
- the strength of nuclear localization activity is determined by the number and location of NLS in the fusion protein, one or more specific NLS used, or a combination of these factors.
- the NLS of the fusion protein of the present invention may be located at the N-terminal and/or C-terminal. In some embodiments of the present invention, the NLS of the fusion protein of the present invention may be located between the adenine deamination domain, cytosine deamination domain, nucleic acid targeting domain and/or UGI. In some embodiments, the fusion protein comprises about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more NLS. In some embodiments, the fusion protein comprises about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more NLS at or near the N-terminus. In some embodiments, the fusion protein comprises about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more NLS at or near the C-terminus.
- the polypeptide includes a combination of these, such as one or more NLS at the N-terminus and one or more NLS at the C-terminus. When there is more than one NLS, each one can be selected as not dependent on the other NLS.
- NLS consists of one or more short sequences of positively charged lysine or arginine exposed on the surface of the protein, but other types of NLS are also known.
- Non-limiting examples of NLS include: KKRKV, PKKKRKV, or KRPAATKKAGQAKKKK.
- the fusion protein of the present invention may also include other positioning sequences, such as cytoplasmic positioning sequences, chloroplast positioning sequences, mitochondrial positioning sequences and the like.
- the base editing fusion protein comprises the amino acid sequence shown in any one of SEQ ID NO: 13-19.
- the present invention provides a base editing system for modifying target nucleic acid regions in the genome, which comprises:
- the at least one guide RNA is directed to at least one target sequence in the target nucleic acid region.
- base editing system refers to a combination of components required for base editing of the genome of a cell or organism.
- the various components of the system such as base editing fusion protein, one or more guide RNAs, can exist independently of each other, or can exist in any combination as a composition.
- guide RNA and “gRNA” are used interchangeably, and refer to RNA that can form a complex with the CRISPR effector protein and can target the complex to the target sequence due to a certain identity with the target sequence molecular.
- the guide RNA targets the target sequence by base pairing with the complementary strand of the target sequence.
- the gRNA used by Cas9 nuclease or its functional variants is usually composed of crRNA and tracrRNA molecules that are partially complementary to form a complex, wherein the crRNA contains sufficient identity with the target sequence to hybridize with the complementary strand of the target sequence and guide
- the CRISPR complex (Cas9+crRNA+tracrRNA) is a guide sequence (also called a seed sequence) that specifically binds to the target sequence.
- sgRNA single guide RNA
- the gRNA used by Cpf1 nuclease or its functional variants is usually composed of mature crRNA molecules only, which can also be called sgRNA. Designing a suitable gRNA based on the CRISPR nuclease used and the target sequence to be edited is within the abilities of those skilled in the art.
- the sequence of sgRNA may include the following scaffold sequence:
- the target sequence targeted by the guide RNA usually contains 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27 , 28, 29, 30, preferably 20 nucleotides.
- the guide sequence in the guide RNA usually includes 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26 1, 27, 28, 29, 30, preferably 20 nucleotides.
- the target sequence targeted by the guide RNA also requires a PAM sequence at one end, such as the 3'end, to be recognized by the CRISPR effector protein (or fusion protein)-guide RNA complex.
- the location, type, and length of the PAM required for the target sequence depends on the CRISPR effector protein used.
- the PAM sequence is 5'-NGG-3' located at the 3'end of the target sequence.
- the PAM sequence is 5'-NG-3' located at the 3'end of the target sequence.
- the base editing fusion protein and the guide RNA can form a complex, and the complex specifically targets the target sequence under the guidance of the guide RNA, and causes One or more Cs are replaced by T and/or one or more A is replaced by G in the target sequence.
- the C to T base editing window of the base editing fusion protein of the present invention is located at positions 1-17 of the target sequence. That is to say, the base editing fusion protein of the present invention can have one or more Cs in the range of positions 1-17 from the 5'end of the target sequence replaced by T.
- the A to G base editing window of the base editing fusion protein of the present invention is located at positions 4-8 of the target sequence. That is to say, the base editing fusion protein of the present invention can make one or more A in the range of 4-8 positions from the 5'end of the target sequence replaced by G.
- the at least one guide RNA may be directed to a target sequence located on the sense strand (such as the protein coding strand) and/or the antisense strand within the target nucleic acid region of the genome.
- the base editing composition of the present invention can cause one or more Cs in the target sequence on the sense strand (e.g., protein coding strand) to be replaced by T and/ Or one or more A is replaced by G.
- the base editing composition of the present invention can cause one or more Gs in the target sequence on the sense strand (for example, protein coding strand) to be replaced by A and/or one or more T Replaced by C.
- the nucleotide sequence encoding the base editing fusion protein is codon optimized for the organism whose genome is to be modified.
- Codon optimization refers to replacing at least one codon (e.g., about or more than about 1, 2, 3, 4, 5, 10) of the natural sequence with a codon that is used more frequently or most frequently in the gene of the host cell. , 15, 20, 25, 50 or more codons while maintaining the natural amino acid sequence to modify the nucleic acid sequence to enhance expression in the host cell of interest.
- Codon preference (the difference in codon usage between organisms) is often related to the translation efficiency of messenger RNA (mRNA), and the translation efficiency is considered to depend on the nature and nature of the codon being translated
- tRNA transfer RNA
- genes can be tailored to be the best in a given organism based on codon optimization. Good gene expression. Codon utilization tables can be easily obtained, such as the "Codon Usage Database” available at www.kazusa.orjp/codon/ , and these tables can be adjusted in different ways Applicable. See, Nakamura Y. et al., "Codon usage tabulated from the international DNA sequence databases: status for the year 2000. Nucl. Acids Res., 28:292 (2000).
- the base editing fusion protein of the present invention is encoded by any one of the nucleotide sequences shown in SEQ ID NO: 20-26.
- Organisms whose genome can be modified by the base editing system of the present invention include any organisms suitable for base editing, preferably eukaryotes.
- organisms include, but are not limited to, mammals such as humans, mice, rats, monkeys, dogs, pigs, sheep, cattle, and cats; poultry such as chickens, ducks, and geese; plants, including monocots and dicots
- the plant is a crop plant, including but not limited to wheat, rice, corn, soybean, sunflower, sorghum, rape, alfalfa, cotton, barley, millet, sugarcane, tomato, tobacco, cassava, and potato.
- the organism is a plant. More preferably, the organism is rice.
- the present invention also provides a method for producing at least one genetically modified cell, comprising introducing the base editing system of the present invention into at least one of the cells, thereby causing a target nucleic acid region in the at least one cell One or more nucleotide substitutions within.
- the method further includes the step of selecting cells having the desired one or more nucleotide substitutions from the at least one cell.
- the methods of the invention are performed in vitro.
- the cell is an isolated cell, or a cell in an isolated tissue or organ.
- the present invention also provides a genetically modified organism, which comprises a genetically modified cell or its progeny cells produced by the method of the present invention.
- a genetically modified organism which comprises a genetically modified cell or its progeny cells produced by the method of the present invention.
- the genetically modified cell or its progeny cells have the desired one or more nucleotide substitutions.
- the target nucleic acid region to be modified can be located anywhere in the genome, for example, within a functional gene such as a protein coding gene, or, for example, can be located in a gene expression regulatory region such as a promoter region or an enhancer region, so as to achieve The modification of gene function or the modification of gene expression.
- the desired nucleotide substitution results in a desired gene function modification or gene expression modification.
- the target nucleic acid region is related to the trait of the cell or organism. In some embodiments, the mutation in the target nucleic acid region causes a change in the trait of the cell or organism. In some embodiments, the target nucleic acid region is located in the coding region of the protein. In some embodiments, the target nucleic acid region encodes a functionally related motif or domain of the protein. In some preferred embodiments, one or more nucleotide substitutions in the target nucleic acid region result in amino acid substitutions in the amino acid sequence of the protein. In some embodiments, the one or more nucleotide substitutions result in a change in the function of the protein.
- the base editing system can be introduced into cells by various methods well known to those skilled in the art.
- Methods that can be used to introduce the base editing system of the present invention into cells include, but are not limited to: calcium phosphate transfection, protoplast fusion, electroporation, liposome transfection, microinjection, viral infection (such as baculovirus, vaccinia virus, Adenovirus, adeno-associated virus, lentivirus and other viruses), gene bombardment, PEG-mediated transformation of protoplasts, and Agrobacterium-mediated transformation.
- Cells that can be base-edited by the method of the present invention can be derived from, for example, mammals such as humans, mice, rats, monkeys, dogs, pigs, sheep, cattle, and cats; poultry such as chickens, ducks, and geese; plants, including Monocotyledonous plants and dicotyledonous plants, preferably crop plants, including but not limited to wheat, rice, corn, soybean, sunflower, sorghum, rape, alfalfa, cotton, barley, millet, sugarcane, tomato, tobacco, cassava and potato.
- mammals such as humans, mice, rats, monkeys, dogs, pigs, sheep, cattle, and cats
- poultry such as chickens, ducks, and geese
- plants including Monocotyledonous plants and dicotyledonous plants, preferably crop plants, including but not limited to wheat, rice, corn, soybean, sunflower, sorghum, rape, alfalfa, cotton, barley, millet, sugarcane, tomato
- the base editing fusion protein, the base editing system and the method for producing genetically modified cells of the present invention are particularly suitable for genetic modification of plants.
- the plant is a crop plant, including but not limited to wheat, rice, corn, soybean, sunflower, sorghum, rape, alfalfa, cotton, barley, millet, sugarcane, tomato, tobacco, cassava and potato. More preferably, the plant is rice.
- the present invention provides a method for producing a genetically modified plant, comprising introducing the base editing system of the present invention into at least one of the plants, thereby causing a target nucleic acid region in the genome of the at least one plant One or more nucleotide substitutions within.
- the method further includes screening the at least one plant for plants having the desired one or more nucleotide substitutions.
- the base editing composition can be introduced into plants by various methods well known to those skilled in the art.
- Methods that can be used to introduce the base editing system of the present invention into plants include, but are not limited to: gene bombardment, PEG-mediated transformation of protoplasts, Agrobacterium-mediated transformation, plant virus-mediated transformation, pollen tube channel method, and seed Room injection.
- the base editing composition is introduced into the plant by transient transformation.
- the target sequence can be modified by introducing or producing the base editing fusion protein and guide RNA into plant cells, and the modification can be inherited stably, without the need to encode the base Exogenous polynucleotides that are components of the editing system stably transform plants. This avoids the potential off-target effects of the stable (continuously produced) base editing composition, and also avoids the integration of foreign nucleotide sequences in the plant genome, thereby having higher biological safety.
- the introduction is carried out in the absence of selective pressure, so as to avoid the integration of foreign nucleotide sequences in the plant genome.
- the introduction includes transforming the base editing system of the present invention into an isolated plant cell or tissue, and then regenerating the transformed plant cell or tissue into a whole plant.
- the regeneration is performed in the absence of selective pressure, that is, no selective agent for the selective gene carried on the expression vector is used in the tissue culture process.
- no selection agent can improve the regeneration efficiency of plants and obtain modified plants without exogenous nucleotide sequences.
- the base editing system of the present invention can be transformed to a specific part of the whole plant, such as leaves, stem tips, pollen tubes, young ears or hypocotyls. This is particularly suitable for the transformation of plants that are difficult to undergo tissue culture regeneration.
- the protein expressed in vitro and/or the RNA molecule transcribed in vitro (for example, the expression construct is an RNA molecule transcribed in vitro) is directly transformed into the plant.
- the protein and/or RNA molecule can realize base editing in plant cells and then be degraded by the cell, avoiding the integration of foreign nucleotide sequences in the plant genome.
- genetic modification and breeding of plants using the method of the present invention can obtain plants whose genomes are not integrated with foreign polynucleotides, that is, transgene-free modified plants.
- the modified target nucleic acid region is related to plant traits such as agronomic traits, whereby the one or more nucleotide substitutions result in the plant having altered characteristics relative to wild-type plants.
- plant traits such as agronomic traits
- the one or more nucleotide substitutions result in the plant having altered characteristics relative to wild-type plants.
- improved traits such as agronomic traits.
- the method further includes the step of screening for plants having desired one or more nucleotide substitutions and/or desired traits such as agronomic traits.
- the method further includes obtaining progeny of the genetically modified plant.
- the genetically modified plant or its progeny has desired one or more nucleotide substitutions and/or desired traits such as agronomic traits.
- the present invention also provides a genetically modified plant or its progeny or part thereof, wherein the plant is obtained by the above-mentioned method of the present invention.
- the genetically modified plant or progeny or part thereof is non-transgenic.
- the genetically modified plant or its progeny have desired genetic modification and/or desired traits such as agronomic traits.
- the present invention also provides a plant breeding method, which comprises combining the genetically modified first plant that contains one or more nucleotide substitutions in the target nucleic acid region obtained by the above-mentioned method of the present invention with the one that does not contain The second plant with the one or more nucleotide substitutions is crossed, thereby introducing the one or more nucleotide substitutions into the second plant.
- the genetically modified first plant has desired traits such as agronomic traits.
- the present invention provides a method for in situ saturation mutation of an endogenous target nucleic acid region in a cell or organism to obtain a mutation of interest in the target nucleic acid region, comprising
- the methods of the invention are performed in vitro.
- the cell is an isolated cell, or a cell in an isolated tissue or organ.
- the base editing system includes multiple guide RNAs and/or multiple expression constructs containing nucleotide sequences encoding the multiple guide RNAs.
- the multiple guide RNAs are directed to different target sequences in the target nucleic acid region.
- the plurality of guide RNAs may be 2 to 250 or more, for example 2, 3, 4, 5, 6, 7, 8, 9, 10. Species, 15 species, 20 species, 25 species, 50 species, 75 species, 100 species, 150 species, 200 species, 250 species, 300 species or more.
- the target sequences and/or complementary sequences targeted by at least some of the plurality of guide RNAs partially overlap each other and/or are adjacent to each other.
- the base editing system of the present invention can realize base editing of a longer target nucleic acid region.
- the target nucleic acid region may have a length of about 20bp to about 10000bp or longer, such as about 20bp, about 40bp, about 60bp, about 80bp, about 100bp, about 120bp, about 140bp, about 160bp, about 180bp, about 200bp, About 300bp, about 400bp, about 500bp, about 1000bp, about 1500bp, about 2000bp, about 3000bp, about 4000bp, about 5000bp, about 6000bp or longer.
- the target nucleic acid sequence may encode an amino acid sequence of about 5 to about 2000 amino acids in length, for example, may encode an amino acid sequence of about 5, about 10, or about 15 in length.
- the target sequence targeted by the plurality of guide RNAs substantially covers the target nucleic acid region.
- At least a portion of the plurality of guide RNAs target the sense strand of the target nucleic acid region.
- At least a portion of the plurality of guide RNAs target the antisense strand of the target nucleic acid region.
- the plurality of guide RNAs and/or a plurality of expression constructs containing nucleotide sequences encoding the plurality of guide RNAs can each be independently introduced into the population of the cell or organism. In some embodiments, the plurality of guide RNAs and/or a plurality of expression constructs containing nucleotide sequences encoding the plurality of guide RNAs can be introduced into the population of the cell or organism in combination with each other.
- each guide RNA or its expression construct is introduced into a subpopulation of the cells or organisms, and finally all the subpopulations constitute the population of cells or organisms that have been introduced into the gene editing system; or, every two guide RNAs Or a mixture of expression constructs thereof is used to introduce a subpopulation of said cells or organisms, and finally all subpopulations constitute a population of cells or organisms that have been introduced into the gene editing system; and so on.
- the mutation is a nucleotide substitution, such as a C to T substitution, A to G substitution, G to A substitution, or T to C substitution.
- the target nucleic acid region is located in the coding region of the protein. In some embodiments, the target nucleic acid region encodes a functionally related motif or domain of the protein. In some embodiments, the mutations in the target nucleic acid region may be silent mutations, missense mutations, or nonsense mutations. In some preferred embodiments, mutations in the target nucleic acid region result in amino acid substitutions in the amino acid sequence of the protein. In some embodiments, the mutation results in a change in the function of the protein.
- the "saturation” mutation does not necessarily mean that the population of cells or organisms contains all nucleotide mutations in the target nucleic acid region or all amino acid mutations in the amino acid sequence encoded by the target nucleic acid region. “Near saturation” mutations are also encompassed, such as mutations in which more than 50% of the nucleotides in the target nucleic acid region are contained in the population of cells or organisms or more than 50% of the amino acid mutations in the amino acid sequence encoded by the target nucleic acid region.
- the target nucleic acid region is related to the trait of the cell or organism.
- the mutation in the target nucleic acid region causes a change in the trait of the cell or organism. Therefore, in some embodiments, mutations of interest can be screened for changes in the traits of cells or organisms.
- mutant of interest generally refers to a mutation that causes a change in a trait of interest of a cell or organism. Therefore, in step iii), cells or organisms with mutations of interest can be screened by screening cells or organisms with changes in traits of interest.
- the cells can be derived from, for example, mammals such as humans, mice, rats, monkeys, dogs, pigs, sheep, cattle, cats; poultry such as chickens, ducks, and geese; plants, including monocots and dicots, preferably Crop plants, including but not limited to wheat, rice, corn, soybean, sunflower, sorghum, rape, alfalfa, cotton, barley, millet, sugarcane, tomato, tobacco, cassava, and potato.
- the cell is a plant cell, more preferably a crop plant cell, more preferably a rice cell.
- the organism may be, for example, mammals such as humans, mice, rats, monkeys, dogs, pigs, sheep, cattle, and cats; poultry such as chickens, ducks, and geese; plants, including monocots and dicots, Preferred crop plants include, but are not limited to, wheat, rice, corn, soybean, sunflower, sorghum, rape, alfalfa, cotton, barley, millet, sugar cane, tomato, tobacco, cassava, and potato.
- the organism is a plant, more preferably a crop plant, more preferably rice.
- the traits of interest include improved agronomic traits, including but limited to increased growth rate, increased yield, increased nutrient content, increased cold resistance, increased drought resistance, Increased insect resistance, increased disease resistance, increased herbicide resistance, etc.
- the traits of interest include but are not limited to drug resistance.
- the present invention also covers the application of the base editing system of the present invention in the treatment of diseases.
- Modification of disease-related genes by the base editing system of the present invention can realize up-regulation, down-regulation, inactivation, activation or mutation correction of disease-related genes, etc., thereby achieving disease prevention and/or treatment.
- the target nucleic acid region in the present invention can be located in the protein coding region of a disease-related gene, or, for example, can be located in a gene expression regulatory region such as a promoter region or an enhancer region, so that functional modification or modification of the disease-related gene can be achieved.
- Modification of disease-related gene expression. Therefore, modifying disease-related genes described herein includes modification of disease-related genes themselves (such as protein coding regions), as well as modification of their expression regulatory regions (such as promoters, enhancers, introns, etc.).
- a “disease-related” gene refers to any gene that produces transcription or translation products at abnormal levels or in an abnormal form in cells derived from tissues affected by the disease, compared to tissues or cells that are not disease control. In the case where the altered expression is related to the appearance and/or progression of the disease, it may be a gene expressed at an abnormally high level; it may be a gene expressed at an abnormally low level.
- Disease-related genes also refer to genes that have one or more mutations or genetic variants directly responsible for or linkage disequilibrium with one or more genes responsible for the etiology of the disease.
- the mutation or genetic variation is, for example, a single nucleotide variation (SNV).
- SNV single nucleotide variation
- the present invention also provides a method of treating a disease in a subject in need thereof, comprising delivering to the subject an effective amount of the base editing system of the present invention to modify genes related to the disease.
- the present invention also provides the use of the base editing system of the present invention in preparing a pharmaceutical composition for treating diseases in a subject in need, wherein the base editing system is used to modify genes related to the disease.
- the present invention also provides a pharmaceutical composition for treating diseases in a subject in need, which comprises the base editing system of the present invention, and optionally a pharmaceutically acceptable carrier, wherein the base editing system is used for modification and The genes related to the disease.
- the subject is a mammal, such as a human.
- diseases include, but are not limited to, tumors, inflammation, Parkinson's disease, cardiovascular diseases, Alzheimer's disease, autism, drug addiction, age-related macular degeneration, schizophrenia, genetic diseases and the like.
- the present invention also includes a kit for the method of the present invention, which includes the base editing fusion protein of the present invention and/or an expression construct containing a nucleotide sequence encoding the base editing fusion protein, or The base editing system of the present invention.
- the kit generally includes a label indicating the intended use and/or method of use of the contents of the kit.
- the term label includes any written or recorded material provided on or with the kit or otherwise provided with the kit.
- the kit of the present invention may also contain suitable materials for constructing the expression vector in the base editing system of the present invention.
- the kit of the present invention may also include reagents suitable for transforming the base editing fusion protein or base editing composition of the present invention into cells.
- the cytidine deaminase, adenosine deaminase, nCas9 (D10A) and UGI parts of STEME-1, STEME-2, STEME-3 and STEME-4 are codon-optimized for cereal plants and synthesized commercially (GENEWIZ, Suzhou , China).
- the Cas9 variant nCas9-NG (D10A) containing the R1335V/L1111R/D1135V/G1218R/E1219F/A1322R/T1337R mutation was obtained from the mutant nCas9 (Mut Express MultiS Fast Mutagenesis Kit, Vazyme, Nanjing, China) by the Gibson assembly method D10A) obtained by mutation.
- the japonica rice variety Nipponbare was used to prepare the protoplasts used in this study. Protoplast isolation and transformation were performed as described previously (reference 3). 10 ⁇ g of base editor and sgRNA plasmid DNA were introduced into protoplasts by PEG-mediated transfection, and the average transformation efficiency was measured to be 40-55%. The transfected protoplasts were incubated at 23°C. 60 hours after transfection, protoplasts were collected to extract genomic DNA for deep sequencing of amplicons.
- PCR was directly performed to amplify the protospacer of the binary vector from the transgenic callus.
- the barcode is added to both ends of the PCR product through primers.
- two rounds of PCR were performed. In the first round of PCR, site-specific primers are used to amplify the target region.
- forward and reverse barcodes were added to the ends of PCR products for library construction. The same amount of PCR products were combined and purified by gel DNA extraction, and commercial sequencing was performed on the samples using the Illumina NextSeq 500 platform (Genewiz, Suzhou, China). Check the pre-spacer sequence in the sequencing reads to analyze C>T and/or A>G substitutions and indels. Using genomic DNA extracted from three independent protoplast samples, repeat amplicon sequencing for each target sequence three times.
- CBE cytosine base editor
- ABE adenine base editor
- a new type of base editor is designed, which can simultaneously generate C:G>T:A and A:T>G:C on the same target site with only one sgRNA.
- Mutation in order to carry out endogenous sequence targeted saturation mutagenesis (saturated targeting endogenous mutagenesis, STEM) on the selected target gene.
- the inventors fused cytosine deaminase and adenine deaminase into a new deaminase, and developed a saturated targeting endogenous mutagenesis editor (STEME) for endogenous sequence targeting.
- the components of STEME also include nCas9 (D10A) and uracil DNA glycosylase inhibitor (UGI) ( Figure 1a).
- the fused deaminase can deaminate C and/or A in the deamination window, nCas9 can promote the mismatch repair mechanism (mismatch repair, MMR) in the cell, and UGI is used to inhibit uracil DNA glycosylase (uracil DNA glycosylase, UDG), so that the damaged DNA chain can be repaired according to the target chain being deamination.
- MMR mismatch repair
- UGI uracil DNA glycosylase
- UDG uracil DNA glycosylase
- PABE-7 is composed of artificially evolved ecTadA-ecTadA7.10 heterodimer and nCas9 with 3 NLS at the N-terminal.
- the inventors designed two forms of fusion deaminase: APOBEC3A-ecTadA-ecTadA7.10 and ecTadA-ecTadA7.10-APOBEC3A , And fused them to the N-terminal of nCas9 (D10A) respectively, and fused one UGI or two UGIs freely expressed (through the T2A connecting peptide) to the C-terminal of nCas9 (D10A) to construct STEME-1, STEME -2, STEME-3 and STEME-4 four carriers (Figure 1b). All STEME vectors are optimized according to the codons of the crops, and expressed by the Ubiquitin-1 (Ubi-1) promoter of maize.
- Ubiquitin-1 Ubiquitin-1
- the 20nt sgRNA spacer sequence was constructed on the esgRNA vector driven by the OsU3 promoter ( Figure 2).
- A3A-PBE and PABE-7 were used as controls for C>T and A>G, and wild-type Cas9 was used as a control for indel production.
- Perform amplicon sequencing on each sample get about 30,000-310,000 sequence reads per sample, and analyze the base editing efficiency.
- the results show that all four STEME vectors can produce high-efficiency C>T and/or A>G base transversion in rice protoplasts.
- STEME-1 has the highest C>T efficiency (0.1%-61.61%) ( Figure 3a and Figure 4).
- the base editing window of STEME to C>T is the same as that of A3A-PBE, which is C1 to C17 ( Figure 3a and Figure 4).
- the editing efficiency of STEME-1 from C5 to C14 averaged 29.59%, which was 1.3 times that of A3A-PBE ( Figure 3a and Figure 4).
- STEME-1 still has the highest A>G efficiency (0.69%-15.5%) among the four vectors ( Figure 3b and Figure 4).
- the A>G base editing window of STEME is the same as PABE-7, ranging from A4 to A8, but the four types of STEME (0.07%-15.5%) are all higher than the A>G base produced by PABE-7 (1.74%-21.54%) Editing efficiency is low ( Figure 3b and Figure 4).
- STEME According to the amplicon sequencing data of rice protoplasts, STEME has high product purity at the six target sites tested (Figure 4), and the efficiency of indels produced by it is consistent with that of the untreated group, much lower than that of Cas9 (6.3%-15.61%) (Figure 6).
- STEME especially STEME-1
- STEME-1 can simultaneously achieve C:G>T:A and/or A:T>G:C base transversion using only one sgRNA.
- STEME-1 produces higher C:G>T:A base editing efficiency from C5 to C14 than A3A-PBE, and the base transversion of A:T>G:C can increase the target of directed evolution through saturation mutation To the mutation type.
- Cas9 derived from Streptococcus pyogenes requires NGG PAM in the target sequence, which limits the number of sgRNAs available on the rice genome.
- the bio-information analysis of the rice reference genome showed that the use of Cas9-NG (VRVRFRR) extended the editable range to 79%, while the use of Cas9 could only target 19% of the rice genome ( Figure 7a). Therefore, in order to expand the editing scope of STEME, nCas9 (D10A) on STEME-1 was replaced with nCas9-NG (D10A), and STEME-NG was constructed ( Figure 7b).
- A3A-PBE-NG, PABE7-NG and pCas9-NG were also constructed (Figure 8a).
- a 20nt target sequence with PAM as NGA, NGT, NGC and NGG was designed to reduce the impact of chromatin state ( Figure 8b and c), and the target sequence Constructed on pOsU3-esgRNA vector.
- STEME-NG has editing activity on target sequences with PAM as NGA, NGT, NGC and NGG ( Figure 9 and Figure 10).
- STEME-NG can edit the cytosine in C1 to C17 and the adenine in A4 to A8, but like STEME-1, the base editing efficiency of A>G is lower than that of C>T ( Figure 9).
- the activity of STEME-NG on the NGG PAM target is lower than that of STEME-1 ( Figure 9 and Figure 10), and it has higher editing on NGA and NGT PAM Activity ( Figure 9 and Figure 10).
- STEME-NG has a higher editing activity on the OsODEV-NGC target, and the editing activity on the other three NGC targets is relatively low ( Figure 9 and Figure 10) .
- STEME-NG The activities of A3A-PBE-NG and PABE7-NG at the target sites of NGA, NGT, NGC and NGG PAM are consistent with those of STEME-NG ( Figure 11).
- STEME-NG, A3A-PBE-NG and PABE7-NG all have lower indels values compared with pCas9-NG.
- STEME-NG greatly expands the base editing range of C>T and/or A>G on the genome, and promotes mutations produced by saturation mutations and directed evolution in plants.
- This example uses OsACC as an example to illustrate the ability of STEME-mediated saturation de novo mutation to produce directed evolution in protoplasts.
- Acetyl-coenzyme A carboxylase is a key enzyme in the lipid synthesis pathway, and the carboxyltransferase domain (CT) is the herbicide-resistant active site of this enzyme ( Figure 12a).
- CT carboxyltransferase domain
- these 20 target sites can cover 90.32% of C, 40.43% of A, 77.78% of G and 38.89% of T on the coding chain, covering a total of 61.31% of the bases on the coding chain ( Figure 12a, Table 3).
- STEME-NG produced a total of 212 mutation types, which is 2.7 times that of A3A-PBE-NG.
- 212 mutation types 18.4% of the mutations were caused by the simultaneous mutations of C:G>T:A and A:T>G:C.
- pCas9-NG still has a higher indel efficiency (0.32%-39.72%) than STEME-NG and A3A-PBE-NG.
- STEME-NG can generate a variety of mutation types in the coding chain. Different from the recently reported directed evolution methods that rely on bacteria and yeasts (such as PACE, EvolvR, CREATE, CHAnGE, etc.), STEME will be able to directly generate saturated de novo mutation types in situ and can be used for directed evolution of plant proteins .
- STEME-1 APOBEC3A-48aa linker-ecTadA-32aa linker-ecTadA7.10-32aa linker-nCas9(D10A)-NLS-UGI-NLS) coding sequence
- STEME-NG (APOBEC3A-48aa linker-ecTadA-32aa linker-ecTadA7.10-32aa linker-nCas9-NG(D10A)-NLS-UGI-NLS) coding sequence
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Biomedical Technology (AREA)
- Zoology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Crystallography & Structural Chemistry (AREA)
- Medicinal Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
L'invention concerne une protéine de fusion d'édition de bases, comprenant un domaine de ciblage d'acide nucléique, un domaine cytosine désaminase et un domaine d'adénine désaminase. L'invention concerne également un système d'édition de bases et un procédé de génération d'au moins une cellule génétiquement modifiée.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202080059623.XA CN114945670A (zh) | 2019-08-20 | 2020-08-20 | 一种碱基编辑系统和其使用方法 |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910767418 | 2019-08-20 | ||
CN201910767418.8 | 2019-08-20 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021032155A1 true WO2021032155A1 (fr) | 2021-02-25 |
Family
ID=74659768
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/110207 WO2021032155A1 (fr) | 2019-08-20 | 2020-08-20 | Système d'édition de bases et son procédé d'utilisation |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN114945670A (fr) |
WO (1) | WO2021032155A1 (fr) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113201517A (zh) * | 2021-05-12 | 2021-08-03 | 广州大学 | 一种胞嘧啶单碱基编辑器工具及其应用 |
CN115704015A (zh) * | 2021-08-12 | 2023-02-17 | 清华大学 | 基于腺嘌呤和胞嘧啶双碱基编辑器的靶向诱变系统 |
WO2024051850A1 (fr) * | 2022-09-09 | 2024-03-14 | 中国科学院遗传与发育生物学研究所 | Système et procédé d'édition du génome fondé sur une adn polymérase |
CN118086285A (zh) * | 2024-04-23 | 2024-05-28 | 天津凯莱英生物科技有限公司 | 蛋白定向进化的方法 |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115820603B (zh) * | 2022-11-15 | 2024-07-05 | 吉林大学 | 一种基于dCasRx-NSUN6单基因特异性M5C修饰编辑方法 |
CN116751799B (zh) * | 2023-06-14 | 2024-01-26 | 江南大学 | 一种多位点双重碱基编辑器及其应用 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109517841A (zh) * | 2018-12-05 | 2019-03-26 | 华东师范大学 | 一种用于核苷酸序列修饰的组合物、方法与应用 |
CN109957569A (zh) * | 2017-12-22 | 2019-07-02 | 中国科学院遗传与发育生物学研究所 | 基于cpf1蛋白的碱基编辑系统和方法 |
WO2019147014A1 (fr) * | 2018-01-23 | 2019-08-01 | 기초과학연구원 | Arn guide simple étendu et utilisation associée |
CN110835634A (zh) * | 2018-08-15 | 2020-02-25 | 华东师范大学 | 一种新型碱基转换编辑系统及其应用 |
-
2020
- 2020-08-20 WO PCT/CN2020/110207 patent/WO2021032155A1/fr active Application Filing
- 2020-08-20 CN CN202080059623.XA patent/CN114945670A/zh active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109957569A (zh) * | 2017-12-22 | 2019-07-02 | 中国科学院遗传与发育生物学研究所 | 基于cpf1蛋白的碱基编辑系统和方法 |
WO2019147014A1 (fr) * | 2018-01-23 | 2019-08-01 | 기초과학연구원 | Arn guide simple étendu et utilisation associée |
CN110835634A (zh) * | 2018-08-15 | 2020-02-25 | 华东师范大学 | 一种新型碱基转换编辑系统及其应用 |
CN109517841A (zh) * | 2018-12-05 | 2019-03-26 | 华东师范大学 | 一种用于核苷酸序列修饰的组合物、方法与应用 |
Non-Patent Citations (2)
Title |
---|
LI CHAO; ZHANG RUI; MENG XIANGBING; CHEN SHA; ZONG YUAN; LU CHUNJU; QIU JIN-LONG; CHEN YU-HANG; LI JIAYANG; GAO CAIXIA: "Targeted, random mutagenesis of plant genes with dual cytosine and adenine base editors", NATURE BIOTECHNOLOGY, GALE GROUP INC., NEW YORK, US, vol. 38, no. 7, 13 January 2020 (2020-01-13), us, pages 875 - 882, XP037187539, ISSN: 1087-0156, DOI: 10.1038/s41587-019-0393-7 * |
LIU, JIAHUI ET AL.: "Research Progress of Base Editing System", WORLD SCI-TECH R&D, vol. 39, no. 6, 31 December 2017 (2017-12-31), XP055763468, ISSN: 1006-6055, DOI: 10.16507/j.issn.1006-6055.2017.09.004 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113201517A (zh) * | 2021-05-12 | 2021-08-03 | 广州大学 | 一种胞嘧啶单碱基编辑器工具及其应用 |
CN113201517B (zh) * | 2021-05-12 | 2022-11-01 | 广州大学 | 一种胞嘧啶单碱基编辑器工具及其应用 |
CN115704015A (zh) * | 2021-08-12 | 2023-02-17 | 清华大学 | 基于腺嘌呤和胞嘧啶双碱基编辑器的靶向诱变系统 |
WO2024051850A1 (fr) * | 2022-09-09 | 2024-03-14 | 中国科学院遗传与发育生物学研究所 | Système et procédé d'édition du génome fondé sur une adn polymérase |
CN118086285A (zh) * | 2024-04-23 | 2024-05-28 | 天津凯莱英生物科技有限公司 | 蛋白定向进化的方法 |
Also Published As
Publication number | Publication date |
---|---|
CN114945670A (zh) | 2022-08-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2021032155A1 (fr) | Système d'édition de bases et son procédé d'utilisation | |
US11820990B2 (en) | Method for base editing in plants | |
WO2019120310A1 (fr) | Système et procédé d'édition de bases reposant sur la protéine cpf1 | |
US11447785B2 (en) | Method for base editing in plants | |
KR20200103769A (ko) | 연장된 단일 가이드 rna 및 그 용도 | |
CN108866092A (zh) | 抗除草剂基因的产生及其用途 | |
CN107027313A (zh) | 用于多元rna引导的基因组编辑和其它rna技术的方法和组合物 | |
WO2021185358A1 (fr) | Procédé d'amélioration de la transformation génétique de plantes et de l'efficacité d'édition génomique | |
US20210403901A1 (en) | Targeted mutagenesis using base editors | |
WO2021175289A1 (fr) | Procédé et système d'édition de génome multiplex | |
WO2023169454A1 (fr) | Adénine désaminase et son utilisation dans la réécriture de base | |
WO2021082830A1 (fr) | Procédé de modification ciblée de séquence de génome de plante | |
JP2022511508A (ja) | ゲノム編集による遺伝子サイレンシング | |
WO2023169410A1 (fr) | Cytosine désaminase et son utilisation dans l'édition de bases | |
CN112805385B (zh) | 基于人apobec3a脱氨酶的碱基编辑器及其用途 | |
CN117295817A (zh) | Dna修饰酶及其活性片段和变体以及使用方法 | |
WO2021175288A1 (fr) | Système amélioré d'édition de base de cytosine | |
WO2023227050A1 (fr) | Procédé pour l'insertion spécifique d'un site d'une séquence exogène dans le génome | |
WO2022199665A1 (fr) | Procédé pour améliorer l'efficacité de la transformation génétique des plantes et de l'édition génétique | |
WO2024051850A1 (fr) | Système et procédé d'édition du génome fondé sur une adn polymérase | |
WO2022127894A1 (fr) | Plante résistante aux herbicides | |
WO2023232109A1 (fr) | Nouveau système d'édition de gène crispr | |
US20230227835A1 (en) | Method for base editing in plants | |
IL303583A (en) | Cannabis plant resistant to herbicides |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20854097 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20854097 Country of ref document: EP Kind code of ref document: A1 |