CN113913499A - Method for detecting target mutation by using Cas12j effector protein - Google Patents
Method for detecting target mutation by using Cas12j effector protein Download PDFInfo
- Publication number
- CN113913499A CN113913499A CN202011567811.1A CN202011567811A CN113913499A CN 113913499 A CN113913499 A CN 113913499A CN 202011567811 A CN202011567811 A CN 202011567811A CN 113913499 A CN113913499 A CN 113913499A
- Authority
- CN
- China
- Prior art keywords
- nucleic acid
- target
- grna
- target nucleic
- mutation
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000035772 mutation Effects 0.000 title claims abstract description 165
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 137
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 104
- 238000000034 method Methods 0.000 title claims abstract description 56
- 239000012636 effector Substances 0.000 title claims abstract description 52
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 227
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 227
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 226
- 108020005004 Guide RNA Proteins 0.000 claims abstract description 154
- 238000010453 CRISPR/Cas method Methods 0.000 claims abstract description 17
- 238000010354 CRISPR gene editing Methods 0.000 claims abstract description 15
- 230000007026 protein scission Effects 0.000 claims abstract description 5
- 108091033409 CRISPR Proteins 0.000 claims abstract 14
- 238000001514 detection method Methods 0.000 claims description 82
- 230000008685 targeting Effects 0.000 claims description 22
- 239000003153 chemical reaction reagent Substances 0.000 claims description 19
- 108020004414 DNA Proteins 0.000 claims description 18
- 238000012217 deletion Methods 0.000 claims description 17
- 230000037430 deletion Effects 0.000 claims description 17
- 239000000203 mixture Substances 0.000 claims description 17
- 102000053602 DNA Human genes 0.000 claims description 13
- -1 for example Proteins 0.000 claims description 11
- 238000006467 substitution reaction Methods 0.000 claims description 10
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 8
- 108020004682 Single-Stranded DNA Proteins 0.000 claims description 7
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 6
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 claims description 6
- 238000002360 preparation method Methods 0.000 claims description 5
- 108091028664 Ribonucleotide Proteins 0.000 claims description 4
- 239000005547 deoxyribonucleotide Substances 0.000 claims description 4
- 125000002637 deoxyribonucleotide group Chemical group 0.000 claims description 4
- 239000002336 ribonucleotide Substances 0.000 claims description 4
- 125000002652 ribonucleotide group Chemical group 0.000 claims description 4
- 125000000539 amino acid group Chemical group 0.000 claims description 3
- 239000010931 gold Substances 0.000 claims description 3
- 229910052737 gold Inorganic materials 0.000 claims description 3
- 239000002105 nanoparticle Substances 0.000 claims description 3
- 239000006185 dispersion Substances 0.000 claims description 2
- 238000000835 electrochemical detection Methods 0.000 claims description 2
- 238000002875 fluorescence polarization Methods 0.000 claims description 2
- 239000004065 semiconductor Substances 0.000 claims description 2
- 230000007704 transition Effects 0.000 claims description 2
- 238000007792 addition Methods 0.000 claims 1
- 230000003321 amplification Effects 0.000 description 26
- 230000000694 effects Effects 0.000 description 26
- 238000003199 nucleic acid amplification method Methods 0.000 description 26
- 125000003729 nucleotide group Chemical group 0.000 description 19
- 238000005516 engineering process Methods 0.000 description 16
- 239000002773 nucleotide Substances 0.000 description 16
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 15
- 238000003780 insertion Methods 0.000 description 15
- 230000037431 insertion Effects 0.000 description 15
- 125000006850 spacer group Chemical group 0.000 description 15
- 238000011144 upstream manufacturing Methods 0.000 description 15
- 210000004027 cell Anatomy 0.000 description 14
- 239000012634 fragment Substances 0.000 description 13
- 208000035657 Abasia Diseases 0.000 description 12
- YLQBMQCUIZJEEH-UHFFFAOYSA-N Furan Chemical compound C=1C=COC=1 YLQBMQCUIZJEEH-UHFFFAOYSA-N 0.000 description 12
- 230000000295 complement effect Effects 0.000 description 9
- 238000012986 modification Methods 0.000 description 9
- 230000004048 modification Effects 0.000 description 9
- 239000000523 sample Substances 0.000 description 9
- 238000007397 LAMP assay Methods 0.000 description 8
- 241000700605 Viruses Species 0.000 description 8
- 238000003776 cleavage reaction Methods 0.000 description 8
- 230000007017 scission Effects 0.000 description 8
- 229960002685 biotin Drugs 0.000 description 7
- 235000020958 biotin Nutrition 0.000 description 7
- 239000011616 biotin Substances 0.000 description 7
- 210000001519 tissue Anatomy 0.000 description 7
- 108090001008 Avidin Proteins 0.000 description 6
- 108090000790 Enzymes Proteins 0.000 description 6
- 102000004190 Enzymes Human genes 0.000 description 6
- 230000027455 binding Effects 0.000 description 6
- 238000007834 ligase chain reaction Methods 0.000 description 6
- 239000013598 vector Substances 0.000 description 6
- 241000196324 Embryophyta Species 0.000 description 5
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 5
- 239000013612 plasmid Substances 0.000 description 5
- 102000040430 polynucleotide Human genes 0.000 description 5
- 108091033319 polynucleotide Proteins 0.000 description 5
- 239000002157 polynucleotide Substances 0.000 description 5
- WYURNTSHIVDZCO-UHFFFAOYSA-N Tetrahydrofuran Chemical compound C1CCOC1 WYURNTSHIVDZCO-UHFFFAOYSA-N 0.000 description 4
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 4
- 108010013835 arginine glutamate Proteins 0.000 description 4
- 108010068380 arginylarginine Proteins 0.000 description 4
- 239000012472 biological sample Substances 0.000 description 4
- 238000001574 biopsy Methods 0.000 description 4
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 4
- 201000010099 disease Diseases 0.000 description 4
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 4
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 4
- 108090000765 processed proteins & peptides Proteins 0.000 description 4
- 125000006853 reporter group Chemical group 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 4
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 3
- 229930024421 Adenine Natural products 0.000 description 3
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 3
- 241000282414 Homo sapiens Species 0.000 description 3
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 3
- 241001465754 Metazoa Species 0.000 description 3
- 238000011529 RT qPCR Methods 0.000 description 3
- 108010091086 Recombinases Proteins 0.000 description 3
- 102000018120 Recombinases Human genes 0.000 description 3
- 229960000643 adenine Drugs 0.000 description 3
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 3
- 150000001413 amino acids Chemical class 0.000 description 3
- 210000004369 blood Anatomy 0.000 description 3
- 239000008280 blood Substances 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 238000007796 conventional method Methods 0.000 description 3
- 238000001962 electrophoresis Methods 0.000 description 3
- 210000003527 eukaryotic cell Anatomy 0.000 description 3
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 3
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 3
- 108010050848 glycylleucine Proteins 0.000 description 3
- 208000015181 infectious disease Diseases 0.000 description 3
- 238000011901 isothermal amplification Methods 0.000 description 3
- 108010034529 leucyl-lysine Proteins 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 210000000056 organ Anatomy 0.000 description 3
- 229920001184 polypeptide Polymers 0.000 description 3
- 102000004196 processed proteins & peptides Human genes 0.000 description 3
- 238000010791 quenching Methods 0.000 description 3
- 230000000171 quenching effect Effects 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 2
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 2
- ZPWMEWYQBWSGAO-ZJDVBMNYSA-N Arg-Thr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZPWMEWYQBWSGAO-ZJDVBMNYSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 241000711573 Coronaviridae Species 0.000 description 2
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 2
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 2
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 2
- 108060004795 Methyltransferase Proteins 0.000 description 2
- 241000204031 Mycoplasma Species 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- 206010028980 Neoplasm Diseases 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- 108091093037 Peptide nucleic acid Proteins 0.000 description 2
- 206010036790 Productive cough Diseases 0.000 description 2
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 2
- 108091028113 Trans-activating crRNA Proteins 0.000 description 2
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 2
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 230000008827 biological function Effects 0.000 description 2
- 239000013592 cell lysate Substances 0.000 description 2
- 238000005520 cutting process Methods 0.000 description 2
- 229940104302 cytosine Drugs 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 238000003745 diagnosis Methods 0.000 description 2
- 238000006073 displacement reaction Methods 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 238000009396 hybridization Methods 0.000 description 2
- FDGQSTZJBFJUBT-UHFFFAOYSA-N hypoxanthine Chemical compound O=C1NC=NC2=C1NC=N2 FDGQSTZJBFJUBT-UHFFFAOYSA-N 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 244000000010 microbial pathogen Species 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 244000052769 pathogen Species 0.000 description 2
- 230000001717 pathogenic effect Effects 0.000 description 2
- 108010018625 phenylalanylarginine Proteins 0.000 description 2
- 125000004437 phosphorous atom Chemical group 0.000 description 2
- 229910052698 phosphorus Inorganic materials 0.000 description 2
- 210000002381 plasma Anatomy 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 210000003296 saliva Anatomy 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 210000003802 sputum Anatomy 0.000 description 2
- 208000024794 sputum Diseases 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 230000002194 synthesizing effect Effects 0.000 description 2
- 229940113082 thymine Drugs 0.000 description 2
- 229940035893 uracil Drugs 0.000 description 2
- 210000002700 urine Anatomy 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- UDGUGZTYGWUUSG-UHFFFAOYSA-N 4-[4-[[2,5-dimethoxy-4-[(4-nitrophenyl)diazenyl]phenyl]diazenyl]-n-methylanilino]butanoic acid Chemical compound COC=1C=C(N=NC=2C=CC(=CC=2)N(C)CCCC(O)=O)C(OC)=CC=1N=NC1=CC=C([N+]([O-])=O)C=C1 UDGUGZTYGWUUSG-UHFFFAOYSA-N 0.000 description 1
- 241000589158 Agrobacterium Species 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 1
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- PXAFZDXYEIIUTF-LKTVYLICSA-N Ala-Trp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXAFZDXYEIIUTF-LKTVYLICSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 1
- XPSGESXVBSQZPL-SRVKXCTJSA-N Arg-Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XPSGESXVBSQZPL-SRVKXCTJSA-N 0.000 description 1
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- BBYTXXRNSFUOOX-IHRRRGAJSA-N Arg-Cys-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BBYTXXRNSFUOOX-IHRRRGAJSA-N 0.000 description 1
- RYRQZJVFDVWURI-SRVKXCTJSA-N Arg-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N RYRQZJVFDVWURI-SRVKXCTJSA-N 0.000 description 1
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 1
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 1
- HPSVTWMFWCHKFN-GARJFASQSA-N Arg-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O HPSVTWMFWCHKFN-GARJFASQSA-N 0.000 description 1
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- CVKOQHYVDVYJSI-QTKMDUPCSA-N Arg-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N)O CVKOQHYVDVYJSI-QTKMDUPCSA-N 0.000 description 1
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 1
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 1
- CVXXSWQORBZAAA-SRVKXCTJSA-N Arg-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N CVXXSWQORBZAAA-SRVKXCTJSA-N 0.000 description 1
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 1
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 1
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 1
- MNBHKGYCLBUIBC-UFYCRDLUSA-N Arg-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCNC(N)=N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MNBHKGYCLBUIBC-UFYCRDLUSA-N 0.000 description 1
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 1
- FVBZXNSRIDVYJS-AVGNSLFASA-N Arg-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N FVBZXNSRIDVYJS-AVGNSLFASA-N 0.000 description 1
- JPAWCMXVNZPJLO-IHRRRGAJSA-N Arg-Ser-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JPAWCMXVNZPJLO-IHRRRGAJSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- QJWLLRZTJFPCHA-STECZYCISA-N Arg-Tyr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QJWLLRZTJFPCHA-STECZYCISA-N 0.000 description 1
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 1
- LFWOQHSQNCKXRU-UFYCRDLUSA-N Arg-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 LFWOQHSQNCKXRU-UFYCRDLUSA-N 0.000 description 1
- 206010053555 Arthritis bacterial Diseases 0.000 description 1
- 206010003445 Ascites Diseases 0.000 description 1
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 1
- DNYRZPOWBTYFAF-IHRRRGAJSA-N Asn-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)O DNYRZPOWBTYFAF-IHRRRGAJSA-N 0.000 description 1
- SQZIAWGBBUSSPJ-ZKWXMUAHSA-N Asn-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N SQZIAWGBBUSSPJ-ZKWXMUAHSA-N 0.000 description 1
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 1
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 1
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 1
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 1
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 1
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 1
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 1
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 1
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 1
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 1
- SWTQDYFZVOJVLL-KKUMJFAQSA-N Asp-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)O SWTQDYFZVOJVLL-KKUMJFAQSA-N 0.000 description 1
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 1
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- YODBPLSWNJMZOJ-BPUTZDHNSA-N Asp-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N YODBPLSWNJMZOJ-BPUTZDHNSA-N 0.000 description 1
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 1
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 1
- 241000589941 Azospirillum Species 0.000 description 1
- 241000606125 Bacteroides Species 0.000 description 1
- 208000025721 COVID-19 Diseases 0.000 description 1
- 108010040467 CRISPR-Associated Proteins Proteins 0.000 description 1
- 241000589876 Campylobacter Species 0.000 description 1
- 108700004991 Cas12a Proteins 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- VNLYIYOYUNGURO-ZLUOBGJFSA-N Cys-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N VNLYIYOYUNGURO-ZLUOBGJFSA-N 0.000 description 1
- MRVSLWQRNWEROS-SVSWQMSJSA-N Cys-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CS)N MRVSLWQRNWEROS-SVSWQMSJSA-N 0.000 description 1
- XLLSMEFANRROJE-GUBZILKMSA-N Cys-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N XLLSMEFANRROJE-GUBZILKMSA-N 0.000 description 1
- SAEVTQWAYDPXMU-KATARQTJSA-N Cys-Thr-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O SAEVTQWAYDPXMU-KATARQTJSA-N 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 241000186394 Eubacterium Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 241000589565 Flavobacterium Species 0.000 description 1
- 206010064571 Gene mutation Diseases 0.000 description 1
- PGPJSRSLQNXBDT-YUMQZZPRSA-N Gln-Arg-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O PGPJSRSLQNXBDT-YUMQZZPRSA-N 0.000 description 1
- QYTKAVBFRUGYAU-ACZMJKKPSA-N Gln-Asp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QYTKAVBFRUGYAU-ACZMJKKPSA-N 0.000 description 1
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 1
- QZQYITIKPAUDGN-GVXVVHGQSA-N Gln-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N QZQYITIKPAUDGN-GVXVVHGQSA-N 0.000 description 1
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 1
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 1
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- CLROYXHHUZELFX-FXQIFTODSA-N Glu-Gln-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CLROYXHHUZELFX-FXQIFTODSA-N 0.000 description 1
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 1
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 1
- ZIYGTCDTJJCDDP-JYJNAYRXSA-N Glu-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZIYGTCDTJJCDDP-JYJNAYRXSA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 1
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 1
- ZTNHPMZHAILHRB-JSGCOSHPSA-N Glu-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)NCC(O)=O)=CNC2=C1 ZTNHPMZHAILHRB-JSGCOSHPSA-N 0.000 description 1
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- 241000032681 Gluconacetobacter Species 0.000 description 1
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 1
- BIRKKBCSAIHDDF-WDSKDSINSA-N Gly-Glu-Cys Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BIRKKBCSAIHDDF-WDSKDSINSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 1
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 1
- GWNIGUKSRJBIHX-STQMWFEESA-N Gly-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN)O GWNIGUKSRJBIHX-STQMWFEESA-N 0.000 description 1
- 108091093094 Glycol nucleic acid Proteins 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 201000005569 Gout Diseases 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- 108010014594 Heterogeneous Nuclear Ribonucleoprotein A1 Proteins 0.000 description 1
- YPLYIXGKCRQZGW-SRVKXCTJSA-N His-Arg-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YPLYIXGKCRQZGW-SRVKXCTJSA-N 0.000 description 1
- JBJNKUOMNZGQIM-PYJNHQTQSA-N His-Arg-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JBJNKUOMNZGQIM-PYJNHQTQSA-N 0.000 description 1
- TTZAWSKKNCEINZ-AVGNSLFASA-N His-Arg-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O TTZAWSKKNCEINZ-AVGNSLFASA-N 0.000 description 1
- FLYSHWAAHYNKRT-JYJNAYRXSA-N His-Gln-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FLYSHWAAHYNKRT-JYJNAYRXSA-N 0.000 description 1
- AKEDPWJFQULLPE-IUCAKERBSA-N His-Glu-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O AKEDPWJFQULLPE-IUCAKERBSA-N 0.000 description 1
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 1
- YERBCFWVWITTEJ-NAZCDGGXSA-N His-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CN=CN3)N)O YERBCFWVWITTEJ-NAZCDGGXSA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 241000711467 Human coronavirus 229E Species 0.000 description 1
- 241001109669 Human coronavirus HKU1 Species 0.000 description 1
- 241000482741 Human coronavirus NL63 Species 0.000 description 1
- 241001428935 Human coronavirus OC43 Species 0.000 description 1
- 206010020649 Hyperkeratosis Diseases 0.000 description 1
- UGQMRVRMYYASKQ-UHFFFAOYSA-N Hypoxanthine nucleoside Natural products OC1C(O)C(CO)OC1N1C(NC=NC2=O)=C2N=C1 UGQMRVRMYYASKQ-UHFFFAOYSA-N 0.000 description 1
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 1
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 1
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 1
- 208000026350 Inborn Genetic disease Diseases 0.000 description 1
- 208000004575 Infectious Arthritis Diseases 0.000 description 1
- 206010061218 Inflammation Diseases 0.000 description 1
- 108010065920 Insulin Lispro Proteins 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- 241000186660 Lactobacillus Species 0.000 description 1
- 241000589248 Legionella Species 0.000 description 1
- 208000007764 Legionnaires' Disease Diseases 0.000 description 1
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- HGUUMQWGYCVPKG-DCAQKATOSA-N Leu-Pro-Cys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HGUUMQWGYCVPKG-DCAQKATOSA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 1
- CNWDWAMPKVYJJB-NUTKFTJISA-N Leu-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CNWDWAMPKVYJJB-NUTKFTJISA-N 0.000 description 1
- WGAZVKFCPHXZLO-SZMVWBNQSA-N Leu-Trp-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N WGAZVKFCPHXZLO-SZMVWBNQSA-N 0.000 description 1
- 241000186781 Listeria Species 0.000 description 1
- GQUDMNDPQTXZRV-DCAQKATOSA-N Lys-Arg-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GQUDMNDPQTXZRV-DCAQKATOSA-N 0.000 description 1
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 1
- WLXGMVVHTIUPHE-ULQDDVLXSA-N Lys-Phe-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O WLXGMVVHTIUPHE-ULQDDVLXSA-N 0.000 description 1
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 1
- XGZDDOKIHSYHTO-SZMVWBNQSA-N Lys-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 XGZDDOKIHSYHTO-SZMVWBNQSA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- MTBVQFFQMXHCPC-CIUDSAMLSA-N Met-Glu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MTBVQFFQMXHCPC-CIUDSAMLSA-N 0.000 description 1
- FGAMAYQCWQCUNF-DCAQKATOSA-N Met-His-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FGAMAYQCWQCUNF-DCAQKATOSA-N 0.000 description 1
- NHXXGBXJTLRGJI-GUBZILKMSA-N Met-Pro-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O NHXXGBXJTLRGJI-GUBZILKMSA-N 0.000 description 1
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 1
- YNAVUWVOSKDBBP-UHFFFAOYSA-N Morpholine Natural products C1COCCN1 YNAVUWVOSKDBBP-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
- 241000588653 Neisseria Species 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 241001631646 Papillomaviridae Species 0.000 description 1
- 241001386753 Parvibaculum Species 0.000 description 1
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 1
- LGBVMDMZZFYSFW-HJWJTTGWSA-N Phe-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N LGBVMDMZZFYSFW-HJWJTTGWSA-N 0.000 description 1
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 1
- GYEPCBNTTRORKW-PCBIJLKTSA-N Phe-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O GYEPCBNTTRORKW-PCBIJLKTSA-N 0.000 description 1
- KZRQONDKKJCAOL-DKIMLUQUSA-N Phe-Leu-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZRQONDKKJCAOL-DKIMLUQUSA-N 0.000 description 1
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 description 1
- PTDAGKJHZBGDKD-OEAJRASXSA-N Phe-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O PTDAGKJHZBGDKD-OEAJRASXSA-N 0.000 description 1
- VFDRDMOMHBJGKD-UFYCRDLUSA-N Phe-Tyr-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N VFDRDMOMHBJGKD-UFYCRDLUSA-N 0.000 description 1
- FRMKIPSIZSFTTE-HJOGWXRNSA-N Phe-Tyr-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FRMKIPSIZSFTTE-HJOGWXRNSA-N 0.000 description 1
- 208000002151 Pleural effusion Diseases 0.000 description 1
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 1
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 1
- LXLFEIHKWGHJJB-XUXIUFHCSA-N Pro-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 LXLFEIHKWGHJJB-XUXIUFHCSA-N 0.000 description 1
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 1
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 1
- 241000588769 Proteus <enterobacteria> Species 0.000 description 1
- 241000125945 Protoparvovirus Species 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 241001383286 Rochelia Species 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 241001256145 Satrapia Species 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 1
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 1
- VMVNCJDKFOQOHM-GUBZILKMSA-N Ser-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N VMVNCJDKFOQOHM-GUBZILKMSA-N 0.000 description 1
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- NIOYDASGXWLHEZ-CIUDSAMLSA-N Ser-Met-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O NIOYDASGXWLHEZ-CIUDSAMLSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 1
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 1
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- 206010040102 Seroma Diseases 0.000 description 1
- 241000949716 Sphaerochaeta Species 0.000 description 1
- 241000191940 Staphylococcus Species 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- 241000194017 Streptococcus Species 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 1
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 1
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 1
- QWMPARMKIDVBLV-VZFHVOOUSA-N Thr-Cys-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O QWMPARMKIDVBLV-VZFHVOOUSA-N 0.000 description 1
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 1
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 1
- 108091046915 Threose nucleic acid Proteins 0.000 description 1
- 241000589886 Treponema Species 0.000 description 1
- QNTBGBCOEYNAPV-CWRNSKLLSA-N Trp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O QNTBGBCOEYNAPV-CWRNSKLLSA-N 0.000 description 1
- HJTYJQVRIQXMHM-XIRDDKMYSA-N Trp-Asp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N HJTYJQVRIQXMHM-XIRDDKMYSA-N 0.000 description 1
- PZXUIGWOEWWFQM-SRVKXCTJSA-N Tyr-Asn-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O PZXUIGWOEWWFQM-SRVKXCTJSA-N 0.000 description 1
- XMNDQSYABVWZRK-BZSNNMDCSA-N Tyr-Asn-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XMNDQSYABVWZRK-BZSNNMDCSA-N 0.000 description 1
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 1
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 1
- QSFJHIRIHOJRKS-ULQDDVLXSA-N Tyr-Leu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QSFJHIRIHOJRKS-ULQDDVLXSA-N 0.000 description 1
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 1
- KGSDLCMCDFETHU-YESZJQIVSA-N Tyr-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O KGSDLCMCDFETHU-YESZJQIVSA-N 0.000 description 1
- PGEFRHBWGOJPJT-KKUMJFAQSA-N Tyr-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O PGEFRHBWGOJPJT-KKUMJFAQSA-N 0.000 description 1
- QVYFTFIBKCDHIE-ACRUOGEOSA-N Tyr-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O QVYFTFIBKCDHIE-ACRUOGEOSA-N 0.000 description 1
- TYGHOWWWMTWVKM-HJOGWXRNSA-N Tyr-Tyr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 TYGHOWWWMTWVKM-HJOGWXRNSA-N 0.000 description 1
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 1
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 1
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 1
- WKWJJQZZZBBWKV-JYJNAYRXSA-N Val-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WKWJJQZZZBBWKV-JYJNAYRXSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- DLMNFMXSNGTSNJ-PYJNHQTQSA-N Val-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N DLMNFMXSNGTSNJ-PYJNHQTQSA-N 0.000 description 1
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 1
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 1
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 1
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 1
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 1
- ZLMFVXMJFIWIRE-FHWLQOOXSA-N Val-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](C(C)C)N ZLMFVXMJFIWIRE-FHWLQOOXSA-N 0.000 description 1
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 1
- 229930003756 Vitamin B7 Natural products 0.000 description 1
- 206010000269 abscess Diseases 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 210000001742 aqueous humor Anatomy 0.000 description 1
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 238000011888 autopsy Methods 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 244000000007 bacterial human pathogen Species 0.000 description 1
- 244000052616 bacterial pathogen Species 0.000 description 1
- 230000037429 base substitution Effects 0.000 description 1
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 1
- 210000000941 bile Anatomy 0.000 description 1
- 239000013060 biological fluid Substances 0.000 description 1
- 210000003103 bodily secretion Anatomy 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 150000001735 carboxylic acids Chemical class 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 210000001175 cerebrospinal fluid Anatomy 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- 210000004081 cilia Anatomy 0.000 description 1
- 230000001143 conditioned effect Effects 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 230000002380 cytological effect Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 210000000416 exudates and transudate Anatomy 0.000 description 1
- 210000003608 fece Anatomy 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- 230000007614 genetic variation Effects 0.000 description 1
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 230000002440 hepatic effect Effects 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 230000004054 inflammatory process Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 229940039696 lactobacillus Drugs 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 125000005647 linker group Chemical group 0.000 description 1
- 210000002751 lymph Anatomy 0.000 description 1
- 125000001570 methylene group Chemical group [H]C([H])([*:1])[*:2] 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 210000003097 mucus Anatomy 0.000 description 1
- 239000002777 nucleoside Substances 0.000 description 1
- 150000003833 nucleoside derivatives Chemical class 0.000 description 1
- 201000008482 osteoarthritis Diseases 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 102000054765 polymorphisms of proteins Human genes 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 206010039073 rheumatoid arthritis Diseases 0.000 description 1
- 210000000582 semen Anatomy 0.000 description 1
- 201000001223 septic arthritis Diseases 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 239000002689 soil Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 230000009870 specific binding Effects 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 210000001179 synovial fluid Anatomy 0.000 description 1
- MPLHNVLQVRSVEE-UHFFFAOYSA-N texas red Chemical compound [O-]S(=O)(=O)C1=CC(S(Cl)(=O)=O)=CC=C1C(C1=CC=2CCCN3CCCC(C=23)=C1O1)=C2C1=C(CCC1)C3=[N+]1CCCC3=C2 MPLHNVLQVRSVEE-UHFFFAOYSA-N 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- 108010077037 tyrosyl-tyrosyl-phenylalanine Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 239000011782 vitamin Substances 0.000 description 1
- 229930003231 vitamin Natural products 0.000 description 1
- 235000013343 vitamin Nutrition 0.000 description 1
- 229940088594 vitamin Drugs 0.000 description 1
- 239000011735 vitamin B7 Substances 0.000 description 1
- 235000011912 vitamin B7 Nutrition 0.000 description 1
- 210000004127 vitreous body Anatomy 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6813—Hybridisation assays
- C12Q1/6827—Hybridisation assays for detection of mutation or polymorphism
Abstract
The invention provides a method for detecting a target mutation by using a Cas12j effector protein. The method is a method of detecting the presence or absence of a mutation of interest in a target nucleic acid using a Cas12j effector protein, comprising contacting a sample with a type V CRISPR/Cas effector protein, a gRNA (guide RNA) comprising a region that binds to the CRISPR/Cas effector protein and a guide sequence that hybridizes to a mutant target nucleic acid, and a single-stranded nucleic acid detector; detecting a detectable signal generated by the CRISPR/CAS effector protein cleavage single-stranded nucleic acid detector, wherein the target mutation is positioned at positions 1-20, preferably positions 9-10 from the 5' end of the gRNA guide sequence.
Description
Technical Field
The invention relates to the field of nucleic acid detection, and relates to a method for detecting target mutation by using a CRISPR (clustered regularly interspaced short palindromic repeats) technology, in particular to a method for detecting target mutation by using Cas12j effector protein.
Background
The method for specifically detecting Nucleic acid molecules (Nucleic acid detection) has important application values, such as pathogen detection, genetic disease detection and the like. In the aspect of pathogen detection, each pathogenic microorganism has a unique characteristic nucleic acid molecule sequence, so that nucleic acid molecule detection for a specific species, also called Nucleic Acid Diagnostics (NADs), can be developed, and is important in the fields of food safety, detection of environmental microbial contamination, infection of human pathogenic bacteria, and the like. Another aspect is the detection of Single Nucleotide Polymorphisms (SNPs) in humans or other species. Understanding the relationship between genetic variation and biological functions at the genomic level provides a new perspective for modern molecular biology, and SNPs are closely related to biological functions, evolution, diseases and the like, so the development of detection and analysis techniques of SNPs is particularly important.
The detection of specific nucleic acid molecules established today usually requires two steps, the first step being the amplification of the nucleic acid of interest and the second step being the detection of the nucleic acid of interest. The existing detection technologies include restriction endonuclease methods, Southern, Northern, dot blot, fluorescent PCR detection technologies, LAMP loop-mediated isothermal amplification technologies, recombinase polymerase amplification technologies (RPA) and the like. After 2012, CRISPR gene editing technology arose, a new nucleic acid diagnosis technology (SHERLOCK technology) of targeted RNA with Cas13 as a core was developed by the zhanfeng team based on RPA technology, a diagnosis technology (DETECTR technology) with Cas12 enzyme as a core was developed by the Doudna team, and a new nucleic acid detection technology (HOLMES technology) based on Cas12 was also developed by the royal doctor of the institute of physiology and ecology of plants in the shanghai of the chinese academy of sciences. Nucleic acid detection techniques developed based on CRISPR technology are playing an increasingly important role.
The invention applies the CRISPR nucleic acid detection technology to the detection of whether the target nucleic acid has mutation in the target region and the detection of whether the target nucleic acid has the target mutation, and particularly provides a high-efficiency detection method.
Disclosure of Invention
The invention provides a method, a system and a kit for detecting whether a target mutation site exists in a target nucleic acid and detecting whether a mutation exists in a target region by using a Cas12j effector protein.
Method for detecting target mutation
In one aspect, the invention provides a method of detecting the presence or absence of a target mutation site in a target nucleic acid using a Cas12j effector protein, the method comprising contacting the target nucleic acid with a type V CRISPR/Cas effector protein, a gRNA (guide RNA) comprising a region that binds to the CRISPR/Cas effector protein and a guide sequence that hybridizes to a mutant target nucleic acid containing a mutation of interest, the guide sequence comprising a base that pairs with the target mutation site; detecting a detectable signal generated by the CRISPR/CAS effector protein cleavage single-stranded nucleic acid detector.
In one embodiment, the target mutation is a site where the wild-type target nucleic acid is not identical to the mutant target nucleic acid within the region targeted by the gRNA targeting sequence; since the guide sequence of the gRNA hybridizes to the mutant target nucleic acid, including the base pairing with the target mutation site, the target mutation site also refers to a site where the guide sequence of the gRNA does not coincide with the wild-type target nucleic acid sequence to which it is targeted.
In one embodiment, the guide sequence of the gRNA comprises at least 13 bases, e.g., 13-30 bases, e.g., 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, or 29 bases.
In one embodiment, the mutation of interest comprises a single base mutation, two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, fifteen base mutations, or more base mutations; the target mutation may be a continuous base mutation or a discontinuous base mutation; preferably, the target mutation comprises a single base mutation or a two base mutation, and preferably, the two base mutations are mutations in two consecutive bases.
In one embodiment, the base that pairs with the mutation site of interest is located at one or more of positions 1-20, specifically 1 st, 2 nd, 3 rd, 4 th, 5 th, 6 th, 7 th, 8 th, 9 th, 10 th, 11 th, 12 th, 13 th, 14 th, 15 th, 16 th, 17 th, 18 th, 19 th, and 20 th, of the 5' end of the gRNA targeting sequence; preferably, one or more of positions 7-16 of the 5 'terminus, more preferably, positions 9 and/or 10 of the 5' terminus.
In one embodiment, the mutation of interest is a single base mutation, and the base pairing with the mutation site of interest is located at positions 1 to 20, specifically, positions 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, and 20, of the 5' end of the gRNA targeting sequence; preferably, the 7 th to 16 th positions of the 5 'end, more preferably, the 9 th or 10 th positions of the 5' end;
in the above method, the intensity of the detectable signal for detecting the mutant-type target nucleic acid is significantly different from the signal for detecting the wild-type target nucleic acid; specifically, the detectable signal for detecting the mutant-type target nucleic acid is significantly stronger than that for detecting the wild-type target nucleic acid, and in this case, the presence or absence of the target mutation site in the target nucleic acid can be determined based on the intensity of the detectable signal.
In one embodiment, the detectable signal of the wild-type target nucleic acid and the detectable signal of the mutant-type target nucleic acid may be different detectable signals, or the detectable signal of the wild-type target nucleic acid and the detectable signal of the mutant-type target nucleic acid may be the same detectable signal.
Preferably, the method further comprises the step of detecting the wild-type target nucleic acid for control, and the method further comprises the step of providing a standard wild-type target nucleic acid.
Preferably, the method further comprises the step of detecting the mutant target nucleic acid for control, and the method further comprises the step of providing a standard mutant target nucleic acid.
In one embodiment, the present invention also provides the use of the above-described type V CRISPR/CAS effector proteins, grnas (guide RNAs), and single-stranded nucleic acid detectors in the preparation of a reagent, composition, or kit for detecting the presence or absence of a mutation site of interest in a target nucleic acid.
In one embodiment, the target mutation site includes a substitution (substitution), insertion or deletion, preferably. The mutation is a point mutation.
In one embodiment, the target nucleic acid is a target nucleic acid for which the presence of a 500bp upstream (5 'end) to 500bp downstream (3' end) of the mutation site of interest, preferably a 300bp upstream (5 'end) to 300bp downstream (3' end) of the mutation site of interest, a 200bp upstream (5 'end) to 200bp downstream (3' end) of the mutation site of interest, more preferably a 100bp upstream (5 'end) to 100bp downstream (3' end) of the mutation site of interest, has been confirmed by an amplification method, the amplification method comprises common amplification methods such as PCR, NASBA, RPA, SDA, LAMP, HAD, NEAR, MDA, RCA, LCR, RAM and the like, preferably PCR method, the confirmation of the existence of the target mutation site upstream and downstream fragments is realized by a method for confirming the existence of an amplification product in a conventional mode such as electrophoresis, qPCR and the like after amplification. The method can confirm that an amplification product is obtained, but cannot confirm a specific sequence, particularly, cannot confirm the presence or absence of a mutation site of interest.
Method for detecting mutation in target region
In one aspect, the invention provides a method for detecting the presence or absence of a mutation in a target region using a Cas12j effector protein, the method comprising contacting a sample with a type V CRISPR/Cas effector protein, a gRNA (guide RNA) comprising a region that binds to the CRISPR/Cas effector protein and a guide sequence that hybridizes to a wild-type target nucleic acid, and a single-stranded nucleic acid detector; detecting a detectable signal generated by the CRISPR/CAS effector protein cleavage single-stranded nucleic acid detector.
In one embodiment, the guide sequence of the gRNA comprises at least 13 bases, e.g., 13-30 bases, e.g., 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, or 29 bases.
In one embodiment, the target mutation site includes a substitution (substitution), insertion or deletion, preferably. The mutation is a point mutation.
In one embodiment, the target region refers to the position of the target nucleic acid targeted at positions 1-20 of the 5' end of the gRNA targeting sequence, and the presence of a mutation refers to the presence of a single base mutation, two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, thirteen, fourteen, fifteen, sixteen, seventeen, eighteen, nineteen, twenty base mutations within the target region; the mutation may be a continuous base mutation or a discontinuous base mutation.
In one embodiment, the target region refers to the position of the target nucleic acid targeted from position 7 to position 16 at the 5' end of the gRNA targeting sequence; the existence of the mutation refers to that the target region contains single base mutation, two, three, four, five, six, seven, eight, nine and ten base mutations, and the mutation can be continuous base mutation or discontinuous base mutation.
In one embodiment, the target region refers to the position of the target nucleic acid targeted from positions 9 to 10 at the 5' end of the gRNA targeting sequence; the existence of mutation refers to the fact that a target region contains single base mutation or two base mutation, and the existence of mutation refers to the fact that the 9 th position and/or the 10 th position of the target nucleic acid at the 5' end of the gRNA guide sequence has mutation.
In the above method, the wild-type target nucleic acid and the mutant-type target nucleic acid may generate significantly different detectable signals; specifically, the detectable signal for detecting the mutant-type target nucleic acid is significantly weaker than that for detecting the wild-type target nucleic acid, and in this case, the presence or absence of the target mutation site in the target nucleic acid can be determined based on the strength of the detectable signal.
In other embodiments, the present invention also provides the use of the above-described type V CRISPR/CAS effector proteins, grnas (guide RNAs), and single-stranded nucleic acid detectors in the preparation of a reagent, composition, or kit for detecting the presence or absence of a mutation in a target nucleic acid in a target region; preferably, the target region is the position of the target nucleic acid targeted from position 1 to position 20 at the 5 ' end of the gRNA targeting sequence, preferably, the target region is the position of the target nucleic acid targeted from position 7 to position 16 at the 5 ' end of the gRNA targeting sequence, and preferably, the target region is the position of the target nucleic acid targeted from position 9 to position 10 at the 5 ' end of the gRNA targeting sequence.
In one embodiment, the target nucleic acid is a target nucleic acid whose presence of a fragment of 500bp upstream (5 'end) to 500bp downstream (3' end) of the target region, preferably a fragment of 300bp upstream (5 'end) to 300bp downstream (3' end) of the target region, a fragment of 200bp upstream (5 'end) to 200bp downstream (3' end) of the target region, more preferably a fragment of 100bp upstream (5 'end) to 100bp downstream (3' end) of the target region is confirmed by an amplification method including a common amplification method such as PCR, NASBA, RPA, SDA, LAMP, HAD, NEAR, MDA, RCA, LCR, RAM, etc., preferably a PCR method, in which presence of a fragment upstream and downstream of the target mutation site is confirmed by a conventional method such as electrophoresis, qPCR, etc., and which can confirm the presence of an amplification product, however, the specific sequence, particularly the presence or absence of the mutation site of interest, cannot be confirmed.
Reagents, kits, compositions
In one aspect, the present invention provides a reagent, kit or composition for detecting the presence or absence of a mutation site of interest in a target nucleic acid using a Cas12j effector protein, the reagent, kit or composition comprising the above-described CRISPR/Cas effector protein type V, a gRNA (guide RNA) comprising a region binding to the CRISPR/Cas effector protein and a guide sequence hybridized to a mutant target nucleic acid containing a mutation of interest, the guide sequence comprising a base pairing with the mutation site of interest, and a single-stranded nucleic acid detector.
In one aspect, the present invention provides a reagent, kit or composition for detecting whether a target nucleic acid has a mutation in a target region using a Cas12j effector protein, the reagent, kit or composition comprising the above-described V-type CRISPR/Cas effector protein, a gRNA (guide RNA) comprising a region binding to the CRISPR/Cas effector protein and a guide sequence hybridizing to a wild-type target nucleic acid, the target region being the position of the target nucleic acid targeted at positions 1 to 13 and 20 from the 5 ' end of the gRNA guide sequence, preferably, the target region being the position of the target nucleic acid targeted at positions 7 to 16 from the 5 ' end of the gRNA guide sequence, preferably, the target region being the position of the target nucleic acid targeted at positions 9 to 10 from the 5 ' end of the gRNA guide sequence.
Single-stranded nucleic acid detector
In some embodiments, the single-stranded nucleic acid detector does not hybridize to the gRNA.
In one embodiment, the single-stranded nucleic acid detector comprises different reporter groups or marker molecules at both ends, which do not exhibit a reporter signal when in an initial state (i.e., non-cleaved state) and exhibit a detectable signal when cleaved, i.e., exhibit a detectable difference after cleavage from before cleavage.
In some embodiments, the single-stranded nucleic acid detector is provided with different reporter groups at its 5 'end and 3' end, respectively, which can exhibit a detectable reporter signal when the single-stranded nucleic acid detector is cleaved; for example, a single-stranded nucleic acid detector is provided with a fluorescent group and a quenching group at both ends thereof; or a first molecule (such as FAM or FITC) and a second molecule (such as biotin) connected to the 3' end are respectively arranged at two ends of the single-stranded nucleic acid detector.
When a fluorescent group and a quencher group are disposed at both ends of the single-stranded nucleic acid detector, respectively, a detectable fluorescent signal can be exhibited when the single-stranded nucleic acid detector is cleaved. The fluorescent group is selected from one or more of FAM, FITC, VIC, JOE, TET, CY3, CY5, ROX, Texas Red or LC RED 460. The quenching group is selected from one or more of BHQ1, BHQ2, BHQ3, Dabcy1 or Tamra.
When a first molecule (such as FAM or FITC) and a second molecule (such as biotin) are respectively arranged at two ends of the single-stranded nucleic acid detector, the reaction system containing the single-stranded nucleic acid detector is matched with the flow strip to detect the characteristic sequence (preferably, a colloidal gold detection mode). The flow strip is designed with two capture lines, with an antibody that binds to a first molecule (i.e. a first molecular antibody) at the sample contacting end (colloidal gold), an antibody that binds to the first molecular antibody at the first line (control line), and an antibody that binds to a second molecule (i.e. a second molecular antibody, such as avidin) at the second line (test line). As the reaction flows along the strip, the first molecular antibody binds to the first molecule carrying the cleaved or uncleaved oligonucleotide to the capture line, the cleaved reporter will bind to the antibody of the first molecular antibody at the first capture line, and the uncleaved reporter will bind to the second molecular antibody at the second capture line. Binding of the reporter group at each line will result in a strong readout/signal (e.g. color). As more reporters are cut, more signal will accumulate at the first capture line and less signal will appear at the second line.
In one embodiment, the single stranded nucleic acid detector comprises one or more of: 1) base modified nucleotides, 2) sugar modified nucleotides, 3) altered chemical bonds, 4) modified backbones.
In one embodiment, the nucleotide is one or more of ribonucleotide, deoxyribonucleotide, and nucleic acid analog; the base of the ribonucleotide is selected from one or more of adenine A, uracil U, cytosine C, guanine G, thymine T and hypoxanthine I; the base of the deoxyribonucleotide is selected from A, T, C, G, U, I or any of the bases.
In one embodiment, the base modification is the result of a chemical modification of the adenine, cytosine, guanine, uracil, or thymine component of a nucleotide. Other similar base modifications will be readily apparent to those skilled in the art and such other methods are intended to be within the scope of the present invention.
In one embodiment, the base-modified nucleotides further comprise an abasic spacer (single-stranded nucleic acid detectors comprising locked nucleic acids are also described in chinese application CN 2020108880363); the Spacer without base is selected from one or more of dSpacer, Spacer C3, Spacer C6, Spacer C12, Spacer9, Spacer12, Spacer18, inserted Abasic Site (dSpacer Abasic furan) and rAbasic Site (rSpacer Abasic furan).
In one embodiment, the glycosyl modified nucleotides include, 2' -fluoro modification, 2' oxymethyl modification, locked nucleic acid (single stranded nucleic acid detector comprising locked nucleic acid is also described in chinese application CN 2020105609327), bridge nucleic acid, morpholine nucleic acid, ethylene glycol nucleic acid, hexitol nucleic acid, threose nucleic acid, arabinose nucleic acid, 2' methoxyacetyl modification, 2' -amino modification, 4 ' -thio RNA, Peptide Nucleic Acid (PNA), cyclohexenyl nucleic acid (CENA), and combinations thereof; the base of the glycosyl modified nucleotide is selected from one or any more of A, U, C, G, T, I bases.
In one embodiment, the altered chemical bond comprises a modified nucleic acid backbone and a non-natural internucleoside linkage, and nucleic acids having a modified backbone comprise those that retain a phosphorus atom in the backbone and those that do not have a phosphorus atom in the backbone.
In one embodiment, the single stranded nucleic acid detector may be linear or circular.
In one embodiment, the detection method can be used for quantitative detection of the characteristic sequence to be detected. The quantitative detection index can be quantified according to the signal intensity of the reporter group, such as the luminous intensity of a fluorescent group, or the width of a color development strip.
CRISPR/CAS effector proteins
Further, the type V CRISPR/CAS effector protein is selected from the group consisting of:
(1) a protein shown as SEQ ID No. 1;
(2) derived proteins which are formed by substituting, deleting or adding one or more (such as 2, 3, 4, 5, 6, 7, 8, 9 or 10) amino acid residues of the amino acid sequence shown in SEQ ID No.1 or active fragments thereof and have basically the same functions;
(3) a protein having 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% identity to the sequence shown in SEQ ID No. 1.
In one embodiment, the Cas protein mutant comprises amino acid substitutions, deletions or substitutions, and the mutant retains at least its trans cleavage activity. Preferably, the mutant has Cis and trans cleavage activity.
Target nucleic acid
In the present invention, the target nucleic acid includes ribonucleotide or deoxyribonucleotide, and includes single-stranded nucleic acid, double-stranded nucleic acid such as single-stranded DNA, double-stranded DNA, single-stranded RNA, double-stranded RNA, DNA-RNA hybrid, or nucleic acid modification.
In one embodiment, the target nucleic acid is derived from a sample of a virus, bacterium, microorganism, soil, water source, human, animal, plant, or the like.
In one embodiment, the target nucleic acid is a product enriched or amplified by PCR, NASBA, RPA, SDA, LAMP, HAD, NEAR, MDA, RCA, LCR, RAM, or the like.
In one embodiment, the method further comprises the step of obtaining the target nucleic acid from the sample.
In one embodiment, the target nucleic acid is a viral nucleic acid, a bacterial nucleic acid, a specific nucleic acid associated with a disease, such as a specific mutation site or SNP site or a nucleic acid that is different from a control; preferably, the virus is a plant virus or an animal virus, e.g., papilloma virus, hepatic DNA virus, herpes virus, adenovirus, poxvirus, parvovirus, coronavirus; preferably, the virus is a coronavirus, preferably SARS, SARS-CoV2(COVID-19), HCoV-229E, HCoV-OC43, HCoV-NL63, HCoV-HKU1, Mers-CoV.
In some embodiments, the target nucleic acid is derived from a cell, e.g., from a cell lysate.
In one embodiment, the target nucleic acid is a target nucleic acid whose presence of a fragment of 500bp upstream (5 'end) to 500bp downstream (3' end) of the target region, preferably a fragment of 300bp upstream (5 'end) to 300bp downstream (3' end) of the target region, a fragment of 200bp upstream (5 'end) to 200bp downstream (3' end) of the target region, more preferably a fragment of 100bp upstream (5 'end) to 100bp downstream (3' end) of the target region is confirmed by an amplification method including a common amplification method such as PCR, NASBA, RPA, SDA, LAMP, HAD, NEAR, MDA, RCA, LCR, RAM, etc., preferably a PCR method, in which presence of a fragment upstream and downstream of the target mutation site is confirmed by a conventional method such as electrophoresis, qPCR, etc., and which can confirm the presence of an amplification product, however, the specific sequence, particularly the presence or absence of the mutation site of interest, cannot be confirmed. Detectable signal
In some embodiments, the methods of the invention further comprise the step of measuring a detectable signal produced by the CRISPR/CAS effector protein (CAS protein). After contacting with the gRNA and the target nucleic acid, the V-type CRISPR/CAS effector protein is excited to have trans activity, so that the single-stranded nucleic acid detector can be cut more efficiently, and a detectable signal is reflected.
In the present invention, the detectable signal may be any signal generated when the single-stranded nucleic acid detector is cleaved. For example, detection based on gold nanoparticles, fluorescence polarization, fluorescence signal, colloidal phase transition/dispersion, electrochemical detection, semiconductor-based sensing.
The detectable signal may be read by any suitable means, including but not limited to: measurement of a detectable fluorescent signal, gel electrophoresis detection (by detecting a change in a band on the gel), detection of the presence or absence of a color based on vision or a sensor, or a difference in the presence of a color (e.g., based on gold nanoparticles) and a difference in an electrical signal.
In some embodiments, the measurement of the detectable signal may be quantitative, and in other embodiments, the measurement of the detectable signal may be qualitative.
Ratio of
In one embodiment, the Cas protein and gRNA are used in a molar ratio of (0.8-1.2): 1.
in one embodiment, the Cas protein is used in a final concentration of 20-200nM, preferably, 30-100nM, more preferably, 40-80nM, more preferably, 50 nM.
In one embodiment, the gRNA is used in a final concentration of 20-200nM, preferably, 30-100nM, more preferably, 40-80nM, and more preferably, 50 nM.
In one embodiment, the target nucleic acid is used in a final concentration of 5-100nM, preferably, 10-50 nM.
In one embodiment, the single stranded nucleic acid detector is used at a final concentration of 100-.
Applications of
In another aspect, the invention also provides an application of Cas12i in preparing a composition, a reagent or a kit for detecting whether a target nucleic acid has a target mutation.
In another aspect, the invention also provides the application of Cas12i in preparing a composition, a reagent or a kit for detecting whether a target nucleic acid has a mutation in a target region.
In another aspect, the invention also provides the use of Cas12i as described above to detect the presence or absence of a mutation of interest in a target nucleic acid and to detect the presence or absence of a mutation in a region of interest in a target nucleic acid.
In another aspect, the present invention also provides the use of the above-described composition, reagent or kit for detecting the presence of a mutation of interest in a target nucleic acid and for detecting the presence of a mutation of a target nucleic acid in a region of interest.
In another aspect, the present invention also provides the use of the above-described mutant base in a non-target region to improve the efficiency of detecting the presence or absence of a mutation in a target nucleic acid in the target region.
General definition:
unless defined otherwise, technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art.
The terms "hybridize" or "complementary" or "substantially complementary" refer to a nucleic acid (e.g., RNA, DNA) that comprises a nucleotide sequence that enables it to bind non-covalently, i.e., to form base pairs and/or G/U base pairs with another nucleic acid in a sequence-specific, antiparallel manner (i.e., the nucleic acid binds specifically to the complementary nucleic acid), "anneal" or "hybridize". Hybridization requires that the two nucleic acids contain complementary sequences, although mismatches between bases are possible. Suitable conditions for hybridization between two nucleic acids depend on the length and degree of complementarity of the nucleic acids, variables well known in the art. Typically, the length of the hybridizable nucleic acid is 8 nucleotides or more (e.g., 10 nucleotides or more, 12 nucleotides or more, 15 nucleotides or more, 20 nucleotides or more, 22 nucleotides or more, 25 nucleotides or more, or 30 nucleotides or more).
It is understood that the sequence of a polynucleotide need not be 100% complementary to the sequence of its target nucleic acid to specifically hybridize. A polynucleotide may comprise 60% or more, 65% or more, 70% or more, 75% or more, 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or a target region that hybridizes thereto has 100% sequence complementarity of the target region.
The term "amino acid" refers to a carboxylic acid containing an amino group. Each protein in an organism is composed of 20 basic amino acids.
The terms "polynucleotide", "nucleotide sequence", "nucleic acid molecule" and "nucleic acid" are used interchangeably and include DNA, RNA or hybrids thereof, whether double-stranded or single-stranded.
The term "homology" or "identity" is used to refer to the match of sequences between two polypeptides or between two nucleic acids. When a position in both of the sequences being compared is occupied by the same base or amino acid monomer subunit (e.g., a position in each of two DNA molecules is occupied by adenine, or a position in each of two polypeptides is occupied by lysine), then the molecules are identical at that position. Between the two sequences. Typically, the comparison is made when the two sequences are aligned to yield maximum identity. Such an alignment can be determined by using, for example, the identity of the amino acid sequences by conventional methods, by computerized algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics package, Genetics, and Genetics Computer Group), with reference to, for example, the teachings of Smith and Waterman,1981, adv.Appl.Math.2:482Pearson Lipman,1988, Proc.Natl.Acad.Sci.USA85:2444, Thompson et al, 1994, Nucleic Acids Res 22:467380, etc. The BLAST algorithm, available from the national center for Biotechnology information (NCBI www.ncbi.nlm.nih.gov /), can also be used, determined using default parameters.
As used herein, "biotin", also known as vitamin H, is a small molecule vitamin with a molecular weight of 244 Da. "avidin", also called avidin, is a basic glycoprotein having 4 binding sites with extremely high affinity to biotin, and streptavidin is a commonly used avidin. The very strong affinity of biotin to avidin can be used to amplify or enhance the detection signal in the detection system. For example, biotin is easily bonded to a protein (such as an antibody) by a covalent bond, and an avidin molecule bonded to an enzyme reacts with a biotin molecule bonded to a specific antibody, so that not only is a multi-stage amplification effect achieved, but also color is developed due to the catalytic effect of the enzyme when the enzyme meets a corresponding substrate, and the purpose of detecting an unknown antigen (or antibody) molecule is achieved.
Spacer without base
As used herein, "Spacer-free" refers to a nucleoside that does not contain specific coding information. The abasic spacer may be associated with the oligonucleotide, including the 3 'and 5' ends, or within the nucleotide chain. Common spacers include: dSpacer (abacic furan), Spacer C3, Spacer C6, Spacer C12, Spacer9, Spacer12, Spacer18, inserted Abasic Site (dSpacer Abasic furan) and rAbasic Site (rSpacer Abasic furan).
Such abasic spacers are well known in the art and are disclosed, for example, in U.S. Pat. No. 4, 8153772, 2 to dSpacer, Spacer9, Spacer18, Spacer C3; chinese patent application CN101454451A discloses dSpacer.
Preferred herein are the abasic spacers "dspacers" also known as abasic sites, Tetrahydrofuran (THF) or apurinic/apyrimidic (ap) sites, or abasic linkers, wherein the methylene group is located at the 1-position of the 2' -deoxyribose. The dSpacer is not only very similar in structure to the native site, but is also quite stable. The structure is as follows:
the dSpacer, when in nucleotide linkage, may form the following structure:
target nucleic acid
As used herein, the "target nucleic acid" refers to a polynucleotide molecule extracted from a biological sample (sample to be tested). The biological sample is any solid or fluid sample obtained, excreted or secreted from any organism, including but not limited to single-celled organisms such as bacteria, yeasts, protozoa and amoebae and the like, multicellular organisms (e.g. plants or animals, including samples from healthy or superficially healthy human subjects or human patients affected by a condition or disease to be diagnosed or investigated, e.g. infection by a pathogenic microorganism such as a pathogenic bacterium or virus). For example, the biological sample may be a biological fluid obtained from, for example, blood, plasma, serum, urine, feces, sputum, mucus, lymph, synovial fluid, bile, ascites, pleural effusion, seroma, saliva, cerebrospinal fluid, aqueous or vitreous humor, or any bodily secretion, exudate (e.g., obtained from an abscess or any other site of infection or inflammation), or a fluid obtained from a joint (e.g., a normal joint or a joint affected by a disease, such as rheumatoid arthritis, osteoarthritis, gout, or septic arthritis), or a swab of a skin or mucosal surface. The sample may also be a sample obtained from any organ or tissue (including biopsies or autopsy specimens, e.g., tumor biopsies) or may comprise cells (primary cells or cultured cells) or culture medium conditioned by any cell, tissue or organ. Exemplary samples include, but are not limited to, cells, cell lysates, blood smears, cytocentrifuge preparations, cytological smears, bodily fluids (e.g., blood, plasma, serum, saliva, sputum, urine, bronchoalveolar lavage, semen, etc.), tissue biopsies (e.g., tumor biopsies), fine needle aspirates, and/or tissue sections (e.g., cryostat tissue sections and/or paraffin-embedded tissue sections).
In other embodiments, the biological sample may be a plant cell, callus, tissue or organ (e.g., root, stem, leaf, flower, seed, fruit), and the like.
In the present invention, the target nucleic acid also includes a DNA molecule formed by reverse transcription of RNA, and further, the target nucleic acid can be amplified by a technique known in the art, such as isothermal amplification techniques, such as nucleic acid sequencing-based amplification (NASBA), Recombinase Polymerase Amplification (RPA), loop-mediated isothermal amplification (LAMP), Strand Displacement Amplification (SDA), helicase-dependent amplification (HDA), or Nicking Enzyme Amplification (NEAR), and non-isothermal amplification techniques. In certain exemplary embodiments, non-isothermal amplification methods may be used, including, but not limited to, PCR, Multiple Displacement Amplification (MDA), Rolling Circle Amplification (RCA), Ligase Chain Reaction (LCR), or derivative amplification methods (RAM).
Further, the detection method of the present invention further comprises a step of amplifying the target nucleic acid; the detection system further comprises a reagent for amplifying the target nucleic acid. The reagents for amplification include one or more of the following: DNA polymerase, strand displacing enzyme, helicase, recombinase, single-strand binding protein, and the like.
CRISPR
As used herein, the "CRISPR" refers to Clustered, regularly interspaced short palindromic repeats (Clustered regular interspersed short palindromic repeats) derived from the immune system of a microorganism.
Cas protein
As used herein, "Cas protein" refers to a CRISPR-associated protein, preferably from type V or type VI CRISPR/Cas protein (CRISPR/Cas effector protein), which upon binding (i.e., forming a ternary complex of Cas protein-gRNA-target sequence) to a signature (target sequence) to be detected, can induce its trans activity, i.e., random cleavage of non-targeted single-stranded nucleotides (i.e., the single-stranded nucleic acid detector described herein). When the Cas protein is combined with the characteristic sequence, the protein can induce the trans activity by cutting or not cutting the characteristic sequence; preferably, it induces its trans activity by cleaving the signature sequence; more preferably, it induces its trans activity by cleaving the single-stranded signature sequence. The Cas protein recognizes the characteristic sequence by recognizing PAM (protospacer adjacenttoment motif) adjacent to the characteristic sequence.
The Cas protein is a protein at least having trans cleavage activity, and preferably, the Cas protein is a protein having Cis and trans cleavage activity. The Cis activity refers to the activity that the Cas protein can recognize a PAM site and specifically cut a target sequence under the action of the gRNA.
The Cas protein provided by the invention comprises V-type CRISPR/CAS effector proteins, including protein families such as Cas12 and Cas 14. Preferably, e.g., Cas12 proteins, e.g., Cas12a, Cas12 b, Cas12 j; preferably, the Cas protein is Cas12 j.
In embodiments, a Cas protein, as referred to herein, such as Cas12, also encompasses a functional variant of Cas or a homolog or ortholog thereof. As used herein, a "functional variant" of a protein refers to a variant of such a protein that at least partially retains the activity of the protein. Functional variants may include mutants (which may be insertion, deletion or substitution mutants), including polymorphs and the like. Also included in functional variants are fusion products of such proteins with another, usually unrelated, nucleic acid, protein, polypeptide or peptide. Functional variants may be naturally occurring or may be artificial. Advantageous embodiments may relate to engineered or non-naturally occurring V-type DNA targeting effector proteins.
In one embodiment, one or more nucleic acid molecules encoding a Cas protein, such as Cas12, or orthologs or homologs thereof, may be codon optimized for expression in a eukaryotic cell. Eukaryotes can be as described herein. One or more nucleic acid molecules may be engineered or non-naturally occurring.
In one embodiment, the Cas12 protein or ortholog or homolog thereof may comprise one or more mutations (and thus the nucleic acid molecule encoding it may have one or more mutations.
In one embodiment, the Cas protein may be from: cilium, listeria, corynebacterium, satrapia, legionella, treponema, Proteus, eubacterium, Streptococcus, Lactobacillus, Mycoplasma, Bacteroides, Flavivivola, Flavobacterium, Azospirillum, Sphaerochaeta, gluconacetobacter, Neisseria, Rochelia, Parvibaculum, Staphylococcus, Nitrarefactor, Mycoplasma, Campylobacter, and Muspirillum.
In one embodiment, the Cas protein is selected from the group consisting of proteins consisting of:
(1) a protein shown as SEQ ID No. 1;
(2) derived proteins which are formed by substituting, deleting or adding one or more (such as 2, 3, 4, 5, 6, 7, 8, 9 or 10) amino acid residues in the amino acid sequence shown in SEQ ID No.1 or active fragments thereof and have basically the same functions.
In one embodiment, the Cas protein further includes proteins having 50%, preferably 55%, preferably 60%, preferably 65%, preferably 70%, preferably 75%, preferably 80%, preferably 85%, preferably 90%, preferably 99%, sequence identity (homology) to the above sequences and having trans activity.
The Cas protein can be obtained by recombinant expression vector technology, namely, a nucleic acid molecule encoding the protein is constructed on a proper vector and then is transformed into a host cell, so that the encoding nucleic acid molecule is expressed in the cell, and the corresponding protein is obtained. The protein can be secreted by cells, or the protein can be obtained by breaking cells through a conventional extraction technology. The encoding nucleic acid molecule may or may not be integrated into the genome of the host cell for expression. The vector may further comprise regulatory elements which facilitate sequence integration, or self-replication. The vector may be, for example, of the plasmid, virus, cosmid, phage, etc. type, which are well known to those skilled in the art, and preferably, the expression vector of the present invention is a plasmid. The vector further comprises one or more regulatory elements selected from the group consisting of promoters, enhancers, ribosome binding sites for translation initiation, terminators, polyadenylation sequences, and selectable marker genes.
The host cell may be a prokaryotic cell, such as E.coli, Streptomyces, Agrobacterium: or lower eukaryotic cells, such as yeast cells; or higher eukaryotic cells, such as plant cells. It will be clear to one of ordinary skill in the art how to select an appropriate vector and host cell.
gRNA
As used herein, the "gRNA" is also referred to as guide RNA or guide RNA and has a meaning commonly understood by those skilled in the art. In general, the guide RNA may comprise, or consist essentially of, a direct repeat and a guide sequence (guide sequence). grnas may include crRNA and tracrRNA or only crRNA depending on Cas protein on which they depend in different CRISPR systems. The crRNA and tracrRNA may be artificially engineered to fuse to form single guide RNA (sgRNA). In certain cases, the guide sequence is any polynucleotide sequence that is sufficiently complementary to the target sequence (the signature sequence described in the present invention) to hybridize with and guide specific binding of the CRISPR/Cas complex to the target sequence, typically having a sequence length of 12-25nt, and in preferred embodiments, the guide sequence has a sequence length of 13-20nt, e.g., 14nt, 15nt, 16nt, 17nt, 18nt, 19nt, or 20 nt. The direct repeat sequence can fold to form a specific structure (such as a stem-loop structure) for recognition by the Cas protein to form a complex. The targeting sequence need not be 100% complementary to the signature sequence (target sequence). The targeting sequence is not complementary to the single stranded nucleic acid detector.
In certain embodiments, the degree of complementarity (degree of match) between a targeting sequence and its corresponding target sequence is at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 99%, when optimally aligned. Determining the optimal alignment is within the ability of one of ordinary skill in the art. For example, there are published and commercially available alignment algorithms and programs such as, but not limited to, ClustalW, the Smith-Waterman algorithm in matlab (Smith-Waterman), Bowtie, Geneius, Biopython, and SeqMan.
The gRNA of the invention can be natural, and can also be artificially modified or designed and synthesized.
Sequence information
SEQ ID No.1 | Details of | Type (B) |
1 | Cas12j | Protein |
Drawings
FIG. 1 shows the results of detection of each site of the gRNA-targeted region in the target nucleic acid by mutation in sequence when the target gene is OsTGW6 and the gRNA is 16bp in length.
FIG. 2 shows the results of detection of each site of the gRNA-targeted region in the target nucleic acid by mutation in sequence when the target gene is OsTGW6 and the gRNA is 18bp in length.
FIG. 3 shows the results of detection of each site of the gRNA-targeted region in the target nucleic acid by mutation in sequence when the target gene is OsTGW6 and the gRNA is 20bp in length.
FIG. 4 shows the detection results of sequential mutations at each site of the gRNA-targeted region on the target nucleic acid when the target gene is CV19 and the gRNA length is 16 bp.
FIG. 5 shows the results of detection of each site of the gRNA-targeted region in the target nucleic acid by mutation in sequence when the target gene is OsTGW6 and the gRNA is 16bp in length.
FIG. 6 shows the results of detection of each site of the gRNA-targeted region in the target nucleic acid by mutation in sequence when the target gene is OsTGW6 and the gRNA is 18bp in length.
FIG. 7 shows the results of detection of each site of the gRNA-targeted region in the target nucleic acid by mutation in sequence when the target gene is OsTGW6 and the gRNA is 20bp in length.
FIG. 8 shows the detection results of sequential mutations at each site of the gRNA-targeted region on the target nucleic acid when the target gene is CV19 and the gRNA length is 16 bp.
FIG. 9 shows the results of detection of each site in the region targeted by the gRNA in the target nucleic acid by mutation in sequence when the target gene is Ngene and the gRNA has a length of 16 bp.
FIG. 10 shows that when OsTGW6 is detected by using 14bp or 16bp gRNAs, the two lengths of gRNAs have little influence on the detection result; when 14bp or 16bp gRNA is used for detecting CV19, the detection result of 14bp gRNA is better, and the difference between a wild type and a mutant type is more obvious.
FIG. 11 shows that in the single-base substitution mutation, no matter what base the mutation is, the detection result is not affected by either.
FIG. 12 shows the results of detection of deletion mutations sequentially in the region targeted by the gRNA in the target nucleic acid when the gRNA has a length of 14bp and the target gene is Ngene.
FIG. 13 shows the results of detection of insertion mutations sequentially in the region targeted by the gRNA in the target nucleic acid when the gRNA has a length of 14bp and the target gene is Ngene.
FIG. 14 shows the results of detection of deletion mutations sequentially placed in the region targeted by the gRNA on the target nucleic acid when the target gene is OsTGW6 and the gRNA is 14bp in length.
FIG. 15 shows the results of detection of insertion mutations sequentially placed in the region targeted by the gRNA on the target nucleic acid when the target gene is OsTGW6 and the gRNA is 14bp in length.
Fig. 16 shows the detection results of deletion mutations sequentially provided in the region targeted by the gRNA on the target nucleic acid when the target gene is CV19 and the gRNA is 14bp in length.
Fig. 17 shows the detection results of insertion mutations sequentially placed in the region targeted by the gRNA on the target nucleic acid when the target gene is CV19 and the gRNA is 14bp in length.
FIG. 18 shows the detection efficiency of the mutant base at a base (position 2 or position 6) of the gRNA targeting sequence that is not paired with the target mutation site when the target gene is OsTGW6 and the gRNA is 14bp in length.
FIG. 19 shows the detection efficiency of a mutant base at a base (position 2 or position 6) of a gRNA targeting sequence that does not pair with the target mutation site when the gRNA is 14bp in length because the target gene is Ngene.
FIG. 20 shows the detection efficiency of the mutant base at a base (position 2 or position 6) of the gRNA targeting sequence that is not paired with the target mutation site when the target gene is CV19 and the gRNA length is 14 bp.
Detailed description of the preferred embodiments
The present invention will be further described with reference to the following examples, which are intended to be illustrative only and not to be limiting of the invention in any way, and any person skilled in the art can modify the present invention by applying the teachings disclosed above and applying them to equivalent embodiments with equivalent modifications. Any simple modification or equivalent changes made to the following embodiments according to the technical essence of the present invention, without departing from the technical spirit of the present invention, fall within the scope of the present invention.
The technical scheme of the invention is based on the following principle, the nucleic acid of a sample to be detected is obtained, for example, a target nucleic acid can be obtained by an amplification method, and the gRNA which can be paired with the target nucleic acid is used for guiding the Cas protein to be identified and combined on the target nucleic acid; subsequently, the Cas protein activates the cleavage activity of the single-stranded nucleic acid detector, thereby cleaving the single-stranded nucleic acid detector in the system; the two ends of the single-stranded nucleic acid detector are respectively provided with a fluorescent group and a quenching group, and if the single-stranded nucleic acid detector is cut, fluorescence can be excited; if the single-stranded nucleic acid detector cannot be cleaved, fluorescence is not excited; in other embodiments, both ends of the single-stranded nucleic acid detector may be provided with a label capable of being detected by colloidal gold.
Example 1 test of detection efficiency by setting different mutations in the region targeted by gRNA using double-stranded DNA as the target nucleic acid
Synthesizing OT-ssDNA (primers) containing different OsTGW6 gene mutation sites and complementary chains thereof, annealing, performing T-Blunt connection, selecting a single clone for correct sequencing, performing plasmid extraction, and performing experimental addition according to the plasmid concentration to enable the final concentration to reach 5 nM; CV19-Lamb-j19g1-16bp corresponding gene is Orflab-A (the gene is constructed on a vector), OT-ssDNA primers and T7 primers containing different mutation sites are respectively utilized to amplify plasmid Orflab-A, then PCR products are recovered and tested, and test addition is carried out according to the concentration of the PCR products, so that the final concentration is controlled at 10 nM. The final concentration of Cas12j19 is 50nM, the final concentration of gRNA is 50nM, and the concentration of Reporter-FB-T is 200nM, when different target sequences are verified and detected, gRNAs with different lengths are used for detecting the influence of different positions of mutation sites on a gRNA target nucleic acid region on detection results, namely the influence of detection on the in vitro trans activity of Cas12j 19.
TABLE 1 Experimental arrangement when the target nucleic acid is double-stranded DNA
The region sequence of the gRNA combined with the Cas protein is GUGCUGCUGUCUCCCAGACGGGAGGCAGAACUGCAC, and the guide sequence is positioned at the 3' end of the sequence; counting the differential sites from one end near the PAM sequence (namely the 5 'end of the guide sequence on the gRNA, namely the 5' end of the Spacer), and sequentially carrying out single base mutation; valid means that the difference is significantly different between the detection of the presence and absence of the difference.
As shown in fig. 1, when the target gene is OsTGW6 and the gRNA is 16bp in length, the 1 st (the 1 st base at the 5 ' end of the gRNA is different from the target nucleic acid, i.e., the 1 st base near the PAM end in the region targeted by the gRNA for the target nucleic acid, and the 2 nd base at the 5 ' end of the gRNA means that the 2 nd base at the 5 ' end of the gRNA is different from the target nucleic acid, i.e., the 2 nd base near the PAM end in the region targeted by the gRNA for the target nucleic acid, and so on) and the 16 th site are different from the target nucleic acid (i.e., when detecting SNP, mutation exists between the site and the wild-type gene or SNP exists at the site) both have a significant effect on the detection result (reduction of the fluorescence signal). This indicates that any mutation at the position targeted by the gRNA can be clearly observed when detecting the mutation in this sequence.
As shown in FIG. 2, when the target gene is OsTGW6 and the gRNA length is 18bp, any difference (mutation) between No.1 and No. 3, No. 7 and No. 14 or No. 16 can bring obvious influence (fluorescence signal reduction) on the detection result. Thus, when the mutation on the sequence is detected, the mutation of at least any one of No. 1-3, No. 7-14 or No. 16 of the 5' end of the gRNA target position can be obviously observed.
As shown in FIG. 3, when the target gene is CV19 and the gRNA length is 20bp, the difference in at least any one of the genes 1-5 or 7-10 significantly affects the detection result (the fluorescence signal is decreased). Thus, when detecting the mutation in the sequence, the mutation at the 1 st-3 rd, 7 th-14 th or 16 th position of the 5' end of the gRNA target position can be obviously observed.
As shown in FIG. 4, when the target gene is OsTGW6 and the gRNA length is 16bp, the difference mutation between at least any of the 5-15 positions and the target nucleic acid will have a significant effect on the detection result (decrease of fluorescence signal). Thus, when detecting the mutation on the sequence, the 5-15 position mutation of the 5' end of the gRNA target position can be obviously observed.
Example 2 testing the efficiency of detection by setting different mutations in the region targeted by gRNA using single-stranded DNA as the target nucleic acid
Synthesizing an OT-ssDNA primer as a target nucleic acid with the final concentration of 50nM, verifying and detecting the influence of gRNAs with different lengths on detecting templates (target nucleic acids) containing different mutation sites under the conditions that the final concentration of Cas12j19 is 50nM, the final concentration of gRNAs is 50nM, and the concentration of Reporter-FB-T is 200nM, namely detecting the influence on the in vitro trans activity of Cas12j 19.
TABLE 2 Experimental arrangement when the target nucleic acid is double-stranded DNA
The region sequence of the gRNA combined with the Cas protein is GUGCUGCUGUCUCCCAGACGGGAGGCAGAACUGCAC, and the guide sequence is positioned at the 3' end of the sequence; differential sites were counted from the 5' end of the gRNA; effective means that the difference is significantly different between the presence and absence of the difference.
As shown in FIG. 5, the target gene is OsTGW6, and when the gRNA length is 16bp, any one of positions 9-13 has a significant influence on the detection result (the fluorescence signal is reduced). This indicates that when mutations are detected in this sequence, mutations in at least any one of positions 9-13 of the 5' end of the gRNA target can be clearly observed.
As shown in FIG. 6, when the target gene is OsTGW6 and the gRNA length is 18bp, the difference between 9-10 or 16-17 will have a significant effect on the detection result (the fluorescence signal is reduced). Thus, when the mutation on the sequence is detected, the mutation of at least any one of positions 9-10 or 16-17 at the 5' end of the gRNA target position can be obviously observed.
As shown in FIG. 7, when the target gene is CV19 and the gRNA length is 20bp, any one of differences No. 6, No. 9-10, No. 12 or No. 14-19 will have a significant effect on the detection result (decrease in fluorescence signal). This indicates that when mutations are detected in this sequence, mutations in at least any one of positions 6, 9-10, 12, or 14-19 at the 5' end of the gRNA target can be clearly observed.
As shown in FIG. 8, when the target gene is CV19 and the gRNA length is 16bp, the difference between at least any one of the target nucleic acids and 7-15 has a significant effect on the detection result (decrease in fluorescence signal). Thus, when the mutation on the sequence is detected, the mutation at any position 7-15 of the 5' end of the gRNA target position can be obviously observed.
As shown in FIG. 9, when the target gene is Ngene and the gRNA length is 16bp, the difference between any of the 6-15 positions and the target nucleic acid will have a significant effect on the detection result (decrease in fluorescence signal). Thus, when the mutation on the sequence is detected, the mutation at any position 6-15 of the 5' end of the gRNA target position can be obviously observed.
Example 3 Using a single-stranded DNA as a target nucleic acid, different mutations were placed in the region targeted by the gRNA, and a gRNA of 14bp in length had higher detection efficiency
According to the reaction systems of examples 1 and 2, two genes were selected in the arrangement shown in Table 3 to verify the effect of the position difference at the 9 th position of the 5' end on gRNAs with different lengths on the detection effect.
TABLE 3 Experimental arrangement for different lengths of gRNA when the target nucleic acid is single-stranded DNA
The region sequence of the gRNA combined with the Cas protein is GUGCUGCUGUCUCCCAGACGGGAGGCAGAACUGCAC, and the guide sequence is positioned at the 3' end of the sequence; differential sites were counted from the 5' end of the gRNA; effective means that the difference is significantly different between the presence and absence of the difference.
As shown in FIG. 10, when OsTGW6 was detected using 14bp or 16bp gRNAs, the two lengths of gRNAs had little effect on the detection results; when 14bp or 16bp gRNA is used for detecting CV19, the detection result of 14bp gRNA is better, and the difference between a wild type and a mutant type is more obvious.
Example 4 verification of detection efficiency by mutating the 9 th position of the region targeted by gRNA to a different base using single-stranded DNA as the target nucleic acid
Grnas as shown in table 4 were synthesized, and the 9 th position set in the region targeted by the grnas was mutated into different bases in order, and whether or not the different base mutations had an influence on the detection results was examined.
TABLE 4 gRNA names, sequences and experimental results
The region sequence of the gRNA combined with the Cas protein is GUGCUGCUGUCUCCCAGACGGGAGGCAGAACUGCAC, and the guide sequence is positioned at the 3' end of the sequence; effective in the table means that mutations to such bases can be detected.
As a result, as shown in FIG. 11, in the single-base substitution mutation, the detection result was not affected regardless of the base to which the mutation was made.
Example 5 verification of detection efficiency by setting different insertion or deletion mutations in the region targeted by gRNA using single-stranded DNA as target nucleic acid
Three gRNAs shown in Table 5 are synthesized, respectively target different target nucleic acids, and mutations with insertions or deletions at different positions are sequentially designed in the region targeted by the gRNAs, so that whether the method can detect the mutations with insertions or deletions at different positions in the region targeted by the gRNAs is verified.
TABLE 5 gRNA names, sequences and experimental results
The region sequence of the gRNA combined with the Cas protein is GUGCUGCUGUCUCCCAGACGGGAGGCAGAACUGCAC, and the guide sequence is positioned at the 3' end of the sequence; sequentially setting insertion or deletion of a base at the 1 st to 14 th/1 st to 16 th positions of the 5' end of the guide sequence; insertion of a base means insertion of a base at the 5 'end of the position, for example, insertion at position 1 means insertion of a base at the 5' end of position 1; deletion of a single base means deletion of a single base at that position, for example, deletion at position 1 means deletion of a base at position 1, and counting is as before.
Example 6 setting of a mutated base at a base (position 2 or 6) of the gRNA targeting sequence that is not paired with the target mutation site to verify detection efficiency
According to the grnas of table 6, a mutation is provided at position 9 in a region targeted by the gRNA, and a deliberate mutation is designed at position other than position 9 (position 2 or position 9), and the detection efficiency of a mutation designed at a base not paired with the target mutation site is detected.
The experimental results are shown in fig. 18-20, mutations are designed at positions on the gRNA corresponding to non-target mutations, and the detection efficiency is not affected.
TABLE 6 gRNA names, sequences and experimental results
TABLE 7 target nucleic acid name, sequence
Name (R) | Sequence of |
12j19-ostgw6-3-ssdna9-insert-2a | CCCCGCCTTTTGGACCAACTCGCtATCAATACCATGTAGGCGTCGGCGATG |
12j19-ostgw6-3-ssdna9-insert-6a | CCCCGCCTTTTGGACCAACTCGCtATAAATCCCATGTAGGCGTCGGCGATG |
12j19-ostgw6-3-ssdna2a | CCCCGCCTTTTGGACCAACTCGCATCAATACCATGTAGGCGTCGGCGATG |
12j19-ostgw6-3-ssdna6a | CCCCGCCTTTTGGACCAACTCGCATAAATCCCATGTAGGCGTCGGCGATG |
n-b-12j19g1-ssdna89 10-insert2a | CCCAGCGCTTCAGCGTTCTTCGGAaATGTCGAGCATTGGCATGGAAGTCACAC |
n-b-12j19g1-ssdna89 10-insert6a | CCCAGCGCTTCAGCGTTCTTCGGAaATATCGCGCATTGGCATGGAAGTCACAC |
n-b-j19g1-ssdna2a | CCCAGCGCTTCAGCGTTCTTCGGAATGTCGAGCATTGGCATGGAAGTCACAC |
n-b-j19g1-ssdna6a | CCCAGCGCTTCAGCGTTCTTCGGAATATCGCGCATTGGCATGGAAGTCACAC |
cv-j19g1-ssdna0 | GGCACCAAATTCCAAAGGTTTACCTTGGTAATCATCTTCAGTACCATACTCATATTGAG |
cv-j19g1-ssdna789-insert | GGCACCAAATTCCAAAGGTTTACCTTGGTAATCATCtTTCAGTACCATACTCATATTGAG |
cv-j19g1-ssdna789-insert2c | GGCACCAAATTCCAAAGGTTTACCTTGGTAATCATCtTTCAGTcCCATACTCATATTGAG |
cv-j19g1-ssdna789-insert6a | GGCACCAAATTCCAAAGGTTTACCTTGGTAATCATCtTTaAGTACCATACTCATATTGAG |
cv-j19g1-ssdna2c | GGCACCAAATTCCAAAGGTTTACCTTGGTAATCATCTTCAGTCCCATACTCATATTGAG |
cv-j19g1-ssdna6a | GGCACCAAATTCCAAAGGTTTACCTTGGTAATCATCTTAAGTACCATACTCATATTGAG |
The results of the above examples demonstrate that two nucleic acid sequences having at least one different base (single base substitution, insertion or deletion) can be rapidly detected and distinguished by using the designed gRNA, and the wild type and the mutant type can be respectively identified without performing amplification and sequencing, thereby providing a faster, convenient and accurate method for rapidly classifying target nucleic acids.
Sequence listing
<110> Shunheng Biotech Co., Ltd
<120> method for detecting target mutation by using Cas12j effector protein
<130> JH-CNP202142DJ
<160> 1
<170> PatentIn version 3.5
<210> 1
<211> 908
<212> PRT
<213> Artificial Sequence
<220>
<223> Cas12 j
<400> 1
Met Pro Ser Tyr Lys Ser Ser Arg Val Leu Val Arg Asp Val Pro Glu
1 5 10 15
Glu Leu Val Asp His Tyr Glu Arg Ser His Arg Val Ala Ala Phe Phe
20 25 30
Met Arg Leu Leu Leu Ala Met Arg Arg Glu Pro Tyr Ser Leu Arg Met
35 40 45
Arg Asp Gly Thr Glu Arg Glu Val Asp Leu Asp Glu Thr Asp Asp Phe
50 55 60
Leu Arg Ser Ala Gly Cys Glu Glu Pro Asp Ala Val Ser Asp Asp Leu
65 70 75 80
Arg Ser Phe Ala Leu Ala Val Leu His Gln Asp Asn Pro Lys Lys Arg
85 90 95
Ala Phe Leu Glu Ser Glu Asn Cys Val Ser Ile Leu Cys Leu Glu Lys
100 105 110
Ser Ala Ser Gly Thr Arg Tyr Tyr Lys Arg Pro Gly Tyr Gln Leu Leu
115 120 125
Lys Lys Ala Ile Glu Glu Glu Trp Gly Trp Asp Lys Phe Glu Ala Ser
130 135 140
Leu Leu Asp Glu Arg Thr Gly Glu Val Ala Glu Lys Phe Ala Ala Leu
145 150 155 160
Ser Met Glu Asp Trp Arg Arg Phe Phe Ala Ala Arg Asp Pro Asp Asp
165 170 175
Leu Gly Arg Glu Leu Leu Lys Thr Asp Thr Arg Glu Gly Met Ala Ala
180 185 190
Ala Leu Arg Leu Arg Glu Arg Gly Val Phe Pro Val Ser Val Pro Glu
195 200 205
His Leu Asp Leu Asp Ser Leu Lys Ala Ala Met Ala Ser Ala Ala Glu
210 215 220
Arg Leu Lys Ser Trp Leu Ala Cys Asn Gln Arg Ala Val Asp Glu Lys
225 230 235 240
Ser Glu Leu Arg Lys Arg Phe Glu Glu Ala Leu Asp Gly Val Asp Pro
245 250 255
Glu Lys Tyr Ala Leu Phe Glu Lys Phe Ala Ala Glu Leu Gln Gln Ala
260 265 270
Asp Tyr Asn Val Thr Lys Lys Leu Val Leu Ala Val Ser Ala Lys Phe
275 280 285
Pro Ala Thr Glu Pro Ser Glu Phe Lys Arg Gly Val Glu Ile Leu Lys
290 295 300
Glu Asp Gly Tyr Lys Pro Leu Trp Glu Asp Phe Arg Glu Leu Gly Phe
305 310 315 320
Val Tyr Leu Ala Glu Arg Lys Trp Glu Arg Arg Arg Gly Gly Ala Ala
325 330 335
Val Thr Leu Cys Asp Ala Asp Asp Ser Pro Ile Lys Val Arg Phe Gly
340 345 350
Leu Thr Gly Arg Gly Arg Lys Phe Val Leu Ser Ala Ala Gly Ser Arg
355 360 365
Phe Leu Ile Thr Val Lys Leu Pro Cys Gly Asp Val Gly Leu Thr Ala
370 375 380
Val Pro Ser Arg Tyr Phe Trp Asn Pro Ser Val Gly Arg Thr Thr Ser
385 390 395 400
Asn Ser Phe Arg Ile Glu Phe Thr Lys Arg Thr Thr Glu Asn Arg Arg
405 410 415
Tyr Val Gly Glu Val Lys Glu Ile Gly Leu Val Arg Gln Arg Gly Arg
420 425 430
Tyr Tyr Phe Phe Ile Asp Tyr Asn Phe Asp Pro Glu Glu Val Ser Asp
435 440 445
Glu Thr Lys Val Gly Arg Ala Phe Phe Arg Ala Pro Leu Asn Glu Ser
450 455 460
Arg Pro Lys Pro Lys Asp Lys Leu Thr Val Met Gly Ile Asp Leu Gly
465 470 475 480
Ile Asn Pro Ala Phe Ala Phe Ala Val Cys Thr Leu Gly Glu Cys Gln
485 490 495
Asp Gly Ile Arg Ser Pro Val Ala Lys Met Glu Asp Val Ser Phe Asp
500 505 510
Ser Thr Gly Leu Arg Gly Gly Ile Gly Ser Gln Lys Leu His Arg Glu
515 520 525
Met His Asn Leu Ser Asp Arg Cys Phe Tyr Gly Ala Arg Tyr Ile Arg
530 535 540
Leu Ser Lys Lys Leu Arg Asp Arg Gly Ala Leu Asn Asp Ile Glu Ala
545 550 555 560
Arg Leu Leu Glu Glu Lys Tyr Ile Pro Gly Phe Arg Ile Val His Ile
565 570 575
Glu Asp Ala Asp Glu Arg Arg Arg Thr Val Gly Arg Thr Val Lys Glu
580 585 590
Ile Lys Gln Glu Tyr Lys Arg Ile Arg His Gln Phe Tyr Leu Arg Tyr
595 600 605
His Thr Ser Lys Arg Asp Arg Thr Glu Leu Ile Ser Ala Glu Tyr Phe
610 615 620
Arg Met Leu Phe Leu Val Lys Asn Leu Arg Asn Leu Leu Lys Ser Trp
625 630 635 640
Asn Arg Tyr His Trp Thr Thr Gly Asp Arg Glu Arg Arg Gly Gly Asn
645 650 655
Pro Asp Glu Leu Lys Ser Tyr Val Arg Tyr Tyr Asn Asn Leu Arg Met
660 665 670
Asp Thr Leu Lys Lys Leu Thr Cys Ala Ile Val Arg Thr Ala Lys Glu
675 680 685
His Gly Ala Thr Leu Val Ala Met Glu Asn Ile Gln Arg Val Asp Arg
690 695 700
Asp Asp Glu Val Lys Arg Arg Lys Glu Asn Ser Leu Leu Ser Leu Trp
705 710 715 720
Ala Pro Gly Met Val Leu Glu Arg Val Glu Gln Glu Leu Lys Asn Glu
725 730 735
Gly Ile Leu Ala Trp Glu Val Asp Pro Arg His Thr Ser Gln Thr Ser
740 745 750
Cys Ile Thr Asp Glu Phe Gly Tyr Arg Ser Leu Val Ala Lys Asp Thr
755 760 765
Phe Tyr Phe Glu Gln Asp Arg Lys Ile His Arg Ile Asp Ala Asp Val
770 775 780
Asn Ala Ala Ile Asn Ile Ala Arg Arg Phe Leu Thr Arg Tyr Arg Ser
785 790 795 800
Leu Thr Gln Leu Trp Ala Ser Leu Leu Asp Asp Gly Arg Tyr Leu Val
805 810 815
Asn Val Thr Arg Gln His Glu Arg Ala Tyr Leu Glu Leu Gln Thr Gly
820 825 830
Ala Pro Ala Ala Thr Leu Asn Pro Thr Ala Glu Ala Ser Tyr Glu Leu
835 840 845
Val Gly Leu Ser Pro Glu Glu Glu Glu Leu Ala Gln Thr Arg Ile Lys
850 855 860
Arg Lys Lys Arg Glu Pro Phe Tyr Arg His Glu Gly Val Trp Leu Thr
865 870 875 880
Arg Glu Lys His Arg Glu Gln Val His Glu Leu Arg Asn Gln Val Leu
885 890 895
Ala Leu Gly Asn Ala Lys Ile Pro Glu Ile Arg Thr
900 905
Claims (10)
1. A method of detecting the presence or absence of a mutation site of interest in a target nucleic acid using a Cas12j effector protein, the method comprising contacting the target nucleic acid with a type V CRISPR/Cas effector protein, a gRNA (guide RNA) comprising a region that binds to the CRISPR/Cas effector protein and a guide sequence that hybridizes to a mutant target nucleic acid containing a mutation of interest, the guide sequence comprising a base that pairs with the mutation site of interest; detecting a detectable signal generated by the CRISPR/CAS effector protein cleavage single-stranded nucleic acid detector;
the base matched with the target mutation site is arranged at one or more positions from 1 st to 20 th at the 5' end of the gRNA guide sequence;
preferably, the base pairing with the target mutation site is provided at one or more of positions 7 to 16 of the 5' end of the gRNA guide sequence;
more preferably, the base pairing with the mutation site of interest is located at one or more of positions 9 to 10 of the 5' end of the gRNA guide sequence.
2. A method of detecting the presence or absence of a mutation in a target region using a Cas12j effector protein, the method comprising contacting a sample with a type V CRISPR/Cas effector protein, a gRNA (guide RNA) comprising a region that binds to the CRISPR/Cas effector protein and a guide sequence that hybridizes to a wild-type target nucleic acid, and a single-stranded nucleic acid detector; detecting a detectable signal generated by the CRISPR/CAS effector protein cleavage single-stranded nucleic acid detector;
the target region refers to the position of the target nucleic acid targeted from 1 st to 20 th positions at the 5' end of the guide sequence of the gRNA; preferably, the target region refers to the position of the target nucleic acid targeted from position 7 to position 16 at the 5' end of the guide sequence of the gRNA; more preferably, the target region refers to the position of the target nucleic acid targeted from positions 9 to 10 at the 5' end of the gRNA targeting sequence.
3. The method of claim 1 or 2, wherein the detectable signal is detected by: vision-based detection, sensor-based detection, color detection, gold nanoparticle-based detection, fluorescence polarization, fluorescence signal-based detection, colloidal phase transition/dispersion, electrochemical detection, and semiconductor-based detection.
4. The method of claims 1-3, wherein the target nucleic acid comprises ribonucleotides or deoxyribonucleotides; preferably, it includes single-stranded nucleic acids, double-stranded nucleic acids, for example, single-stranded DNA, double-stranded DNA, single-stranded RNA.
5. The method according to claims 1 to 4, wherein the type V CRISPR/CAS effector protein is a Cas12j protein, preferably the type V CRISPR/CAS effector protein comprises the sequence of SEQ ID No.1, or the type V CRISPR/CAS effector protein is a derivative protein formed by the substitution, deletion or addition of one or more (such as 2, 3, 4, 5, 6, 7, 8, 9 or 10) amino acid residues of the amino acid sequence shown in SEQ ID No.1 and having substantially the same function; or the V-type CRISPR/CAS effector protein is a protein which has 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% and 99% of identity with the sequence shown in SEQ ID No.1, and preferably the amino acid sequence of the V-type CRISPR/CAS effector protein is shown in SEQ ID No. 1.
6. A reagent, kit or composition for detecting the presence or absence of a target mutation site in a target nucleic acid, the reagent, kit or composition comprising a CRISPR/CAS effector protein type V according to the method of claim 1, a gRNA (guide RNA) comprising a region that binds to the CRISPR/CAS effector protein and a guide sequence that hybridizes to a mutant target nucleic acid containing a target mutation, the guide sequence comprising a base that pairs with the target mutation site, and a single-stranded nucleic acid detector.
7. A reagent, kit or composition for detecting the presence or absence of a mutation in a target nucleic acid at a target region, the reagent, kit or composition comprising a type V CRISPR/CAS effector protein, a gRNA (guide RNA) comprising a region that binds to the CRISPR/CAS effector protein and a targeting sequence that hybridizes to a wild-type target nucleic acid in the method of claim 2, the target region being the position of the target nucleic acid targeted at positions 1 to 20 from the 10 th position of the 5 ' end of the gRNA targeting sequence, preferably, the target region being the position of the target nucleic acid targeted at positions 7 to 16 from the 5 ' end of the gRNA targeting sequence, more preferably, the target region being the position of the target nucleic acid targeted at positions 9 to 10 from the 5 ' end of the gRNA targeting sequence.
8. Use of the reagent, kit or composition of claim 6 or 7 for detecting the presence of a mutation site of interest in a target nucleic acid, or for detecting the presence of a mutation in a region of interest in a target nucleic acid.
9. Use of the type V CRISPR/CAS effector protein, gRNA (guide RNA), and single-stranded nucleic acid detector of any one of claim 1, claim 3, claim 4, or claim 5 in the preparation of a reagent, composition, or kit for detecting the presence or absence of a mutation site of interest in a target nucleic acid.
10. Use of the type V CRISPR/CAS effector protein, gRNA (guide RNA), and single-stranded nucleic acid detector of any one of claim 2, claim 3, claim 4, or claim 5 in the preparation of a reagent, composition, or kit for detecting the presence or absence of a mutation in a target nucleic acid at a target region.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011567811.1A CN113913499A (en) | 2020-12-25 | 2020-12-25 | Method for detecting target mutation by using Cas12j effector protein |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011567811.1A CN113913499A (en) | 2020-12-25 | 2020-12-25 | Method for detecting target mutation by using Cas12j effector protein |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113913499A true CN113913499A (en) | 2022-01-11 |
Family
ID=79232515
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011567811.1A Pending CN113913499A (en) | 2020-12-25 | 2020-12-25 | Method for detecting target mutation by using Cas12j effector protein |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113913499A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114438056A (en) * | 2022-03-03 | 2022-05-06 | 吉林省农业科学院 | CasF2 protein, CRISPR/Cas gene editing system and application thereof in plant gene editing |
WO2024040874A1 (en) * | 2022-08-22 | 2024-02-29 | 山东舜丰生物科技有限公司 | Mutated cas12j protein and use thereof |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109897852A (en) * | 2018-07-25 | 2019-06-18 | 广州普世利华科技有限公司 | The gRNA of tumour related mutation gene based on C2c2, detection method, detection kit |
CN110396543A (en) * | 2019-04-30 | 2019-11-01 | 广州普世利华科技有限公司 | A kind of tumour associated gene mutation site screening method |
CN111508558A (en) * | 2020-03-23 | 2020-08-07 | 广州赛业百沐生物科技有限公司 | Method and system for designing point mutation model based on CRISPR-Cas9 technology |
CN111690720A (en) * | 2020-06-16 | 2020-09-22 | 山东舜丰生物科技有限公司 | Method for detecting target nucleic acid using modified single-stranded nucleic acid |
CN111733216A (en) * | 2020-06-22 | 2020-10-02 | 山东舜丰生物科技有限公司 | Method for improving detection efficiency of target nucleic acid |
CN111996236A (en) * | 2020-05-29 | 2020-11-27 | 山东舜丰生物科技有限公司 | Method for detecting target nucleic acid based on CRISPR technology |
-
2020
- 2020-12-25 CN CN202011567811.1A patent/CN113913499A/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109897852A (en) * | 2018-07-25 | 2019-06-18 | 广州普世利华科技有限公司 | The gRNA of tumour related mutation gene based on C2c2, detection method, detection kit |
CN110396543A (en) * | 2019-04-30 | 2019-11-01 | 广州普世利华科技有限公司 | A kind of tumour associated gene mutation site screening method |
CN111508558A (en) * | 2020-03-23 | 2020-08-07 | 广州赛业百沐生物科技有限公司 | Method and system for designing point mutation model based on CRISPR-Cas9 technology |
CN111996236A (en) * | 2020-05-29 | 2020-11-27 | 山东舜丰生物科技有限公司 | Method for detecting target nucleic acid based on CRISPR technology |
CN111690720A (en) * | 2020-06-16 | 2020-09-22 | 山东舜丰生物科技有限公司 | Method for detecting target nucleic acid using modified single-stranded nucleic acid |
CN111733216A (en) * | 2020-06-22 | 2020-10-02 | 山东舜丰生物科技有限公司 | Method for improving detection efficiency of target nucleic acid |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114438056A (en) * | 2022-03-03 | 2022-05-06 | 吉林省农业科学院 | CasF2 protein, CRISPR/Cas gene editing system and application thereof in plant gene editing |
CN114438056B (en) * | 2022-03-03 | 2023-11-21 | 吉林省农业科学院 | CasF2 protein, CRISPR/Cas gene editing system and application thereof in plant gene editing |
WO2024040874A1 (en) * | 2022-08-22 | 2024-02-29 | 山东舜丰生物科技有限公司 | Mutated cas12j protein and use thereof |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112391446B (en) | Method for detecting target nucleic acid based on CRISPR technology | |
CN111690720B (en) | Method for detecting target nucleic acid using modified single-stranded nucleic acid | |
CN112795625B (en) | Method for detecting multiple nucleic acids based on CRISPR technology | |
CN111690773B (en) | Method and system for detecting target nucleic acid by using novel Cas enzyme | |
CN112795624B (en) | Method for detecting target nucleic acid using nucleic acid detector containing abasic spacer | |
CN111733216B (en) | Method for improving detection efficiency of target nucleic acid | |
CN113667718B (en) | Method for detecting target nucleic acid by double-stranded nucleic acid detector | |
CN113913499A (en) | Method for detecting target mutation by using Cas12j effector protein | |
CN111876469B (en) | Method for detecting target nucleic acid by using nucleic acid analogue | |
CN113913498A (en) | Method for detecting target mutation based on CRISPR technology | |
CN114634972B (en) | Method for detecting nucleic acid by using Cas enzyme | |
CN113293198B (en) | Method for performing multiple detection on target nucleic acid based on CRISPR technology | |
CN113234795B (en) | Method for detecting nucleic acid by using Cas protein | |
CN114517224A (en) | Method for detecting nucleic acid by using optimized single-stranded nucleic acid detector | |
WO2021254267A1 (en) | Method for detecting target nucleic acid using nucleic acid analogue or base modification | |
CN113913497A (en) | Method for detecting target nucleic acid using base-modified single-stranded nucleic acid | |
CN114507665B (en) | Method for detecting cucumber green mottle mosaic virus based on CRISPR technology | |
CN115044649A (en) | Improved method for detecting target nucleic acid based on CRISPR technology | |
CN115838784A (en) | Method for detecting gene editing rice based on CRISPR technology | |
CN117587163A (en) | Method for detecting African swine fever by using Cas enzyme | |
CN114058735A (en) | Method for detecting hand-foot-and-mouth disease based on CRISPR technology | |
CN113913429A (en) | Method for detecting sweet potato leaf curl virus based on CRISPR technology | |
CN114480384A (en) | Method for detecting foot-and-mouth disease virus based on CRISPR technology | |
CN116732140A (en) | Nucleic acid detection system and application thereof in detecting DNA mutation | |
CN114457073A (en) | Method for detecting mycobacterium paratuberculosis based on CRISPR technology |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |