CN112301019A - 新型耐高温arCas12a蛋白在核酸检测方面的应用 - Google Patents
新型耐高温arCas12a蛋白在核酸检测方面的应用 Download PDFInfo
- Publication number
- CN112301019A CN112301019A CN202010728504.0A CN202010728504A CN112301019A CN 112301019 A CN112301019 A CN 112301019A CN 202010728504 A CN202010728504 A CN 202010728504A CN 112301019 A CN112301019 A CN 112301019A
- Authority
- CN
- China
- Prior art keywords
- nucleic acid
- arcas12a
- protein
- lys
- detection
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000001514 detection method Methods 0.000 title claims abstract description 132
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 103
- 150000007523 nucleic acids Chemical class 0.000 title claims abstract description 102
- 102000039446 nucleic acids Human genes 0.000 title claims abstract description 87
- 108020004707 nucleic acids Proteins 0.000 title claims abstract description 87
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 84
- 108020005004 Guide RNA Proteins 0.000 claims abstract description 73
- 108020004414 DNA Proteins 0.000 claims abstract description 71
- 108091033409 CRISPR Proteins 0.000 claims abstract description 62
- 238000010354 CRISPR gene editing Methods 0.000 claims abstract description 57
- 102000053602 DNA Human genes 0.000 claims abstract description 37
- 230000000694 effects Effects 0.000 claims abstract description 33
- 238000005520 cutting process Methods 0.000 claims abstract description 19
- 238000006243 chemical reaction Methods 0.000 claims description 41
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 claims description 18
- 238000003776 cleavage reaction Methods 0.000 claims description 17
- 230000007017 scission Effects 0.000 claims description 17
- 238000002360 preparation method Methods 0.000 claims description 15
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 claims description 12
- 238000004587 chromatography analysis Methods 0.000 claims description 10
- 239000002773 nucleotide Substances 0.000 claims description 9
- 125000003729 nucleotide group Chemical group 0.000 claims description 9
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 7
- 239000003153 chemical reaction reagent Substances 0.000 claims description 7
- 229960002685 biotin Drugs 0.000 claims description 6
- 235000020958 biotin Nutrition 0.000 claims description 6
- 239000011616 biotin Substances 0.000 claims description 6
- 101710163270 Nuclease Proteins 0.000 claims description 5
- 239000002245 particle Substances 0.000 claims description 5
- 230000008859 change Effects 0.000 claims description 4
- 238000000338 in vitro Methods 0.000 claims description 4
- 230000007018 DNA scission Effects 0.000 claims description 2
- SHIBSTMRCDJXLN-UHFFFAOYSA-N Digoxigenin Natural products C1CC(C2C(C3(C)CCC(O)CC3CC2)CC2O)(O)C2(C)C1C1=CC(=O)OC1 SHIBSTMRCDJXLN-UHFFFAOYSA-N 0.000 claims description 2
- QONQRTHLHBTMGP-UHFFFAOYSA-N digitoxigenin Natural products CC12CCC(C3(CCC(O)CC3CC3)C)C3C11OC1CC2C1=CC(=O)OC1 QONQRTHLHBTMGP-UHFFFAOYSA-N 0.000 claims description 2
- SHIBSTMRCDJXLN-KCZCNTNESA-N digoxigenin Chemical compound C1([C@@H]2[C@@]3([C@@](CC2)(O)[C@H]2[C@@H]([C@@]4(C)CC[C@H](O)C[C@H]4CC2)C[C@H]3O)C)=CC(=O)OC1 SHIBSTMRCDJXLN-KCZCNTNESA-N 0.000 claims description 2
- 230000001404 mediated effect Effects 0.000 claims description 2
- 238000007794 visualization technique Methods 0.000 claims 2
- 108091034117 Oligonucleotide Proteins 0.000 claims 1
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- 238000001727 in vivo Methods 0.000 claims 1
- 108700004991 Cas12a Proteins 0.000 abstract description 20
- 108020004682 Single-Stranded DNA Proteins 0.000 abstract description 20
- 230000035945 sensitivity Effects 0.000 abstract description 11
- 238000011160 research Methods 0.000 abstract description 5
- 238000012986 modification Methods 0.000 abstract description 2
- 230000004048 modification Effects 0.000 abstract description 2
- 238000000034 method Methods 0.000 description 26
- 239000013615 primer Substances 0.000 description 22
- 239000000523 sample Substances 0.000 description 22
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 20
- 239000000047 product Substances 0.000 description 19
- 239000012634 fragment Substances 0.000 description 18
- 238000000746 purification Methods 0.000 description 14
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 12
- 230000003321 amplification Effects 0.000 description 12
- 238000003199 nucleic acid amplification method Methods 0.000 description 12
- 201000010099 disease Diseases 0.000 description 11
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 11
- 230000008685 targeting Effects 0.000 description 11
- 239000011780 sodium chloride Substances 0.000 description 10
- 238000012360 testing method Methods 0.000 description 10
- 239000007983 Tris buffer Substances 0.000 description 8
- 239000000872 buffer Substances 0.000 description 8
- 230000014509 gene expression Effects 0.000 description 8
- 239000013612 plasmid Substances 0.000 description 8
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 8
- 238000012408 PCR amplification Methods 0.000 description 7
- 241000700605 Viruses Species 0.000 description 7
- 238000013461 design Methods 0.000 description 7
- 238000001976 enzyme digestion Methods 0.000 description 7
- 239000007850 fluorescent dye Substances 0.000 description 7
- 238000003908 quality control method Methods 0.000 description 7
- 239000003161 ribonuclease inhibitor Substances 0.000 description 7
- 239000003298 DNA probe Substances 0.000 description 6
- 102000004190 Enzymes Human genes 0.000 description 6
- 108090000790 Enzymes Proteins 0.000 description 6
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 6
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 6
- 108010054155 lysyllysine Proteins 0.000 description 6
- 239000013598 vector Substances 0.000 description 6
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 5
- 108091081062 Repeated sequence (DNA) Proteins 0.000 description 5
- 150000001413 amino acids Chemical group 0.000 description 5
- 108010038633 aspartylglutamate Proteins 0.000 description 5
- 229960005156 digoxin Drugs 0.000 description 5
- 239000000758 substrate Substances 0.000 description 5
- 238000011144 upstream manufacturing Methods 0.000 description 5
- 241000894006 Bacteria Species 0.000 description 4
- 241000238557 Decapoda Species 0.000 description 4
- LTMHDMANZUZIPE-AMTYYWEZSA-N Digoxin Natural products O([C@H]1[C@H](C)O[C@H](O[C@@H]2C[C@@H]3[C@@](C)([C@@H]4[C@H]([C@]5(O)[C@](C)([C@H](O)C4)[C@H](C4=CC(=O)OC4)CC5)CC3)CC2)C[C@@H]1O)[C@H]1O[C@H](C)[C@@H](O[C@H]2O[C@@H](C)[C@H](O)[C@@H](O)C2)[C@@H](O)C1 LTMHDMANZUZIPE-AMTYYWEZSA-N 0.000 description 4
- 206010059866 Drug resistance Diseases 0.000 description 4
- 241000196324 Embryophyta Species 0.000 description 4
- 238000007397 LAMP assay Methods 0.000 description 4
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 4
- 238000003556 assay Methods 0.000 description 4
- LTMHDMANZUZIPE-PUGKRICDSA-N digoxin Chemical compound C1[C@H](O)[C@H](O)[C@@H](C)O[C@H]1O[C@@H]1[C@@H](C)O[C@@H](O[C@@H]2[C@H](O[C@@H](O[C@@H]3C[C@@H]4[C@]([C@@H]5[C@H]([C@]6(CC[C@@H]([C@@]6(C)[C@H](O)C5)C=5COC(=O)C=5)O)CC4)(C)CC3)C[C@@H]2O)C)C[C@@H]1O LTMHDMANZUZIPE-PUGKRICDSA-N 0.000 description 4
- LTMHDMANZUZIPE-UHFFFAOYSA-N digoxine Natural products C1C(O)C(O)C(C)OC1OC1C(C)OC(OC2C(OC(OC3CC4C(C5C(C6(CCC(C6(C)C(O)C5)C=5COC(=O)C=5)O)CC4)(C)CC3)CC2O)C)CC1O LTMHDMANZUZIPE-UHFFFAOYSA-N 0.000 description 4
- 238000011901 isothermal amplification Methods 0.000 description 4
- 239000007788 liquid Substances 0.000 description 4
- UEGPKNKPLBYCNK-UHFFFAOYSA-L magnesium acetate Chemical compound [Mg+2].CC([O-])=O.CC([O-])=O UEGPKNKPLBYCNK-UHFFFAOYSA-L 0.000 description 4
- 239000011654 magnesium acetate Substances 0.000 description 4
- 229940069446 magnesium acetate Drugs 0.000 description 4
- 235000011285 magnesium acetate Nutrition 0.000 description 4
- 238000012827 research and development Methods 0.000 description 4
- 238000012216 screening Methods 0.000 description 4
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 3
- 238000010453 CRISPR/Cas method Methods 0.000 description 3
- 241000282465 Canis Species 0.000 description 3
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 3
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 3
- 108010042407 Endonucleases Proteins 0.000 description 3
- 102000004533 Endonucleases Human genes 0.000 description 3
- 241001465754 Metazoa Species 0.000 description 3
- 102000018120 Recombinases Human genes 0.000 description 3
- 108010091086 Recombinases Proteins 0.000 description 3
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 3
- 241001491715 archaeon CG10_big_fil_rev_8_21_14_0_10_43_11 Species 0.000 description 3
- 108010047857 aspartylglycine Proteins 0.000 description 3
- 210000004027 cell Anatomy 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 3
- 208000015181 infectious disease Diseases 0.000 description 3
- 238000003367 kinetic assay Methods 0.000 description 3
- 108010034529 leucyl-lysine Proteins 0.000 description 3
- 239000012139 lysis buffer Substances 0.000 description 3
- 108010064235 lysylglycine Proteins 0.000 description 3
- 229910001629 magnesium chloride Inorganic materials 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 2
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 2
- 208000031295 Animal disease Diseases 0.000 description 2
- 241000203069 Archaea Species 0.000 description 2
- IIAXFBUTKIDDIP-ULQDDVLXSA-N Arg-Leu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IIAXFBUTKIDDIP-ULQDDVLXSA-N 0.000 description 2
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 2
- LSJQOMAZIKQMTJ-SRVKXCTJSA-N Asn-Phe-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LSJQOMAZIKQMTJ-SRVKXCTJSA-N 0.000 description 2
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 2
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 208000019838 Blood disease Diseases 0.000 description 2
- 208000014644 Brain disease Diseases 0.000 description 2
- 206010006187 Breast cancer Diseases 0.000 description 2
- 208000026310 Breast neoplasm Diseases 0.000 description 2
- 101150018759 CG10 gene Proteins 0.000 description 2
- 206010009944 Colon cancer Diseases 0.000 description 2
- 208000001333 Colorectal Neoplasms Diseases 0.000 description 2
- 208000035473 Communicable disease Diseases 0.000 description 2
- 241000725619 Dengue virus Species 0.000 description 2
- 108090000204 Dipeptidase 1 Proteins 0.000 description 2
- 208000017701 Endocrine disease Diseases 0.000 description 2
- ULGZDMOVFRHVEP-RWJQBGPGSA-N Erythromycin Chemical compound O([C@@H]1[C@@H](C)C(=O)O[C@@H]([C@@]([C@H](O)[C@@H](C)C(=O)[C@H](C)C[C@@](C)(O)[C@H](O[C@H]2[C@@H]([C@H](C[C@@H](C)O2)N(C)C)O)[C@H]1C)(C)O)CC)[C@H]1C[C@@](C)(OC)[C@@H](O)[C@H](C)O1 ULGZDMOVFRHVEP-RWJQBGPGSA-N 0.000 description 2
- 241000282324 Felis Species 0.000 description 2
- 241000233866 Fungi Species 0.000 description 2
- 241000287828 Gallus gallus Species 0.000 description 2
- KDXKFBSNIJYNNR-YVNDNENWSA-N Gln-Glu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDXKFBSNIJYNNR-YVNDNENWSA-N 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 2
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 2
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 2
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 2
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 2
- 208000026350 Inborn Genetic disease Diseases 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 2
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 2
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 2
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 2
- YVMQJGWLHRWMDF-MNXVOIDGSA-N Lys-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N YVMQJGWLHRWMDF-MNXVOIDGSA-N 0.000 description 2
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 2
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 2
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 2
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 2
- 108060004795 Methyltransferase Proteins 0.000 description 2
- 101100219625 Mus musculus Casd1 gene Proteins 0.000 description 2
- 241000204031 Mycoplasma Species 0.000 description 2
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 2
- 206010028980 Neoplasm Diseases 0.000 description 2
- 208000012902 Nervous system disease Diseases 0.000 description 2
- 206010033128 Ovarian cancer Diseases 0.000 description 2
- 206010061535 Ovarian neoplasm Diseases 0.000 description 2
- 208000002606 Paramyxoviridae Infections Diseases 0.000 description 2
- 208000005342 Porcine Reproductive and Respiratory Syndrome Diseases 0.000 description 2
- 238000011530 RNeasy Mini Kit Methods 0.000 description 2
- 229920002684 Sepharose Polymers 0.000 description 2
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 2
- WBINSDOPZHQPPM-AVGNSLFASA-N Ser-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)O WBINSDOPZHQPPM-AVGNSLFASA-N 0.000 description 2
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 2
- 241000194017 Streptococcus Species 0.000 description 2
- 101710137500 T7 RNA polymerase Proteins 0.000 description 2
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 2
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 2
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 2
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 2
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 2
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 2
- 210000004102 animal cell Anatomy 0.000 description 2
- 241000617156 archaeon Species 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 102000006635 beta-lactamase Human genes 0.000 description 2
- 239000007853 buffer solution Substances 0.000 description 2
- RYYVLZVUVIJVGH-UHFFFAOYSA-N caffeine Chemical compound CN1C(=O)N(C)C(=O)C2=C1N=CN2C RYYVLZVUVIJVGH-UHFFFAOYSA-N 0.000 description 2
- 201000011510 cancer Diseases 0.000 description 2
- 101150055766 cat gene Proteins 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 230000009089 cytolysis Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- 238000006073 displacement reaction Methods 0.000 description 2
- 239000012149 elution buffer Substances 0.000 description 2
- 208000030172 endocrine system disease Diseases 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 238000001506 fluorescence spectroscopy Methods 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- 208000014951 hematologic disease Diseases 0.000 description 2
- 208000018706 hematopoietic system disease Diseases 0.000 description 2
- 208000026278 immune system disease Diseases 0.000 description 2
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 230000004060 metabolic process Effects 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 238000010369 molecular cloning Methods 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 230000009465 prokaryotic expression Effects 0.000 description 2
- 238000001742 protein purification Methods 0.000 description 2
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N puromycin Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 description 2
- 238000003259 recombinant expression Methods 0.000 description 2
- 238000011084 recovery Methods 0.000 description 2
- 230000003252 repetitive effect Effects 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 238000011895 specific detection Methods 0.000 description 2
- 238000006467 substitution reaction Methods 0.000 description 2
- 230000002194 synthesizing effect Effects 0.000 description 2
- 108010051110 tyrosyl-lysine Proteins 0.000 description 2
- 108010003137 tyrosyltyrosine Proteins 0.000 description 2
- 238000000108 ultra-filtration Methods 0.000 description 2
- INOZZBHURUDQQR-AJNGGQMLSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-(1h-imidazol-5-yl)propanoyl]amino]-3-carboxypropanoyl]amino]-4-carboxybutanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 INOZZBHURUDQQR-AJNGGQMLSA-N 0.000 description 1
- RRBGTUQJDFBWNN-MUGJNUQGSA-N (2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-6-amino-2-[[(2s)-2,6-diaminohexanoyl]amino]hexanoyl]amino]hexanoyl]amino]hexanoic acid Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O RRBGTUQJDFBWNN-MUGJNUQGSA-N 0.000 description 1
- MINDHVHHQZYEEK-UHFFFAOYSA-N (E)-(2S,3R,4R,5S)-5-[(2S,3S,4S,5S)-2,3-epoxy-5-hydroxy-4-methylhexyl]tetrahydro-3,4-dihydroxy-(beta)-methyl-2H-pyran-2-crotonic acid ester with 9-hydroxynonanoic acid Natural products CC(O)C(C)C1OC1CC1C(O)C(O)C(CC(C)=CC(=O)OCCCCCCCCC(O)=O)OC1 MINDHVHHQZYEEK-UHFFFAOYSA-N 0.000 description 1
- 108010036211 5-HT-moduline Proteins 0.000 description 1
- 108700016155 Acyl transferases Proteins 0.000 description 1
- 208000007407 African swine fever Diseases 0.000 description 1
- UGLPMYSCWHTZQU-AUTRQRHGSA-N Ala-Ala-Tyr Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UGLPMYSCWHTZQU-AUTRQRHGSA-N 0.000 description 1
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 1
- MBWYUTNBYSSUIQ-HERUPUMHSA-N Ala-Asn-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N MBWYUTNBYSSUIQ-HERUPUMHSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 1
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 1
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 1
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 1
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 1
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- 206010001557 Albinism Diseases 0.000 description 1
- 108700028369 Alleles Proteins 0.000 description 1
- 201000004384 Alopecia Diseases 0.000 description 1
- 208000024827 Alzheimer disease Diseases 0.000 description 1
- 241001492342 Anatid alphaherpesvirus 1 Species 0.000 description 1
- 206010002556 Ankylosing Spondylitis Diseases 0.000 description 1
- 101100519158 Arabidopsis thaliana PCR2 gene Proteins 0.000 description 1
- 241000180579 Arca Species 0.000 description 1
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 1
- MCYJBCKCAPERSE-FXQIFTODSA-N Arg-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N MCYJBCKCAPERSE-FXQIFTODSA-N 0.000 description 1
- XPSGESXVBSQZPL-SRVKXCTJSA-N Arg-Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XPSGESXVBSQZPL-SRVKXCTJSA-N 0.000 description 1
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 1
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 1
- QAXCZGMLVICQKS-SRVKXCTJSA-N Arg-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QAXCZGMLVICQKS-SRVKXCTJSA-N 0.000 description 1
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 1
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 1
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 1
- RIIVUOJDDQXHRV-SRVKXCTJSA-N Arg-Lys-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O RIIVUOJDDQXHRV-SRVKXCTJSA-N 0.000 description 1
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 1
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 1
- DPLFNLDACGGBAK-KKUMJFAQSA-N Arg-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N DPLFNLDACGGBAK-KKUMJFAQSA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 1
- QMQZYILAWUOLPV-JYJNAYRXSA-N Arg-Tyr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CC1=CC=C(O)C=C1 QMQZYILAWUOLPV-JYJNAYRXSA-N 0.000 description 1
- 206010003445 Ascites Diseases 0.000 description 1
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 1
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 1
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 1
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 1
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 1
- HJRBIWRXULGMOA-ACZMJKKPSA-N Asn-Gln-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJRBIWRXULGMOA-ACZMJKKPSA-N 0.000 description 1
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 1
- JZDZLBJVYWIIQU-AVGNSLFASA-N Asn-Glu-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JZDZLBJVYWIIQU-AVGNSLFASA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 1
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 1
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- ALHMNHZJBYBYHS-DCAQKATOSA-N Asn-Lys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ALHMNHZJBYBYHS-DCAQKATOSA-N 0.000 description 1
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 description 1
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 1
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 1
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 1
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 1
- MYRLSKYSMXNLLA-LAEOZQHASA-N Asn-Val-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MYRLSKYSMXNLLA-LAEOZQHASA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- HMQDRBKQMLRCCG-GMOBBJLQSA-N Asp-Arg-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HMQDRBKQMLRCCG-GMOBBJLQSA-N 0.000 description 1
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 1
- HOQGTAIGQSDCHR-SRVKXCTJSA-N Asp-Asn-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HOQGTAIGQSDCHR-SRVKXCTJSA-N 0.000 description 1
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 1
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 1
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 1
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 1
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 1
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 1
- YWLDTBBUHZJQHW-KKUMJFAQSA-N Asp-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N YWLDTBBUHZJQHW-KKUMJFAQSA-N 0.000 description 1
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 1
- UCHSVZYJKJLPHF-BZSNNMDCSA-N Asp-Phe-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UCHSVZYJKJLPHF-BZSNNMDCSA-N 0.000 description 1
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 1
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 1
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 1
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 1
- CXEFNHOVIIDHFU-IHPCNDPISA-N Asp-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)O)N CXEFNHOVIIDHFU-IHPCNDPISA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 1
- SQIARYGNVQWOSB-BZSNNMDCSA-N Asp-Tyr-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQIARYGNVQWOSB-BZSNNMDCSA-N 0.000 description 1
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 1
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 1
- 241000711404 Avian avulavirus 1 Species 0.000 description 1
- 241000223836 Babesia Species 0.000 description 1
- 241000193738 Bacillus anthracis Species 0.000 description 1
- 241000124815 Barbus barbus Species 0.000 description 1
- 241000606660 Bartonella Species 0.000 description 1
- 108010006654 Bleomycin Proteins 0.000 description 1
- 241000588832 Bordetella pertussis Species 0.000 description 1
- 241000589562 Brucella Species 0.000 description 1
- 241000711506 Canine coronavirus Species 0.000 description 1
- 241000701157 Canine mastadenovirus A Species 0.000 description 1
- 241000712083 Canine morbillivirus Species 0.000 description 1
- 241000701931 Canine parvovirus Species 0.000 description 1
- 229930186147 Cephalosporin Natural products 0.000 description 1
- 241000606161 Chlamydia Species 0.000 description 1
- 241001647372 Chlamydia pneumoniae Species 0.000 description 1
- 241000606153 Chlamydia trachomatis Species 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 206010010356 Congenital anomaly Diseases 0.000 description 1
- 241000709687 Coxsackievirus Species 0.000 description 1
- QFMCHXSGIZPBKG-ZLUOBGJFSA-N Cys-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N QFMCHXSGIZPBKG-ZLUOBGJFSA-N 0.000 description 1
- UWXFFVQPAMBETM-ZLUOBGJFSA-N Cys-Asp-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UWXFFVQPAMBETM-ZLUOBGJFSA-N 0.000 description 1
- DZSICRGTVPDCRN-YUMQZZPRSA-N Cys-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N DZSICRGTVPDCRN-YUMQZZPRSA-N 0.000 description 1
- IRDBEBCCTCNXGZ-AVGNSLFASA-N Cys-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N)O IRDBEBCCTCNXGZ-AVGNSLFASA-N 0.000 description 1
- 201000003883 Cystic fibrosis Diseases 0.000 description 1
- 108010026925 Cytochrome P-450 CYP2C19 Proteins 0.000 description 1
- 108010000543 Cytochrome P-450 CYP2C9 Proteins 0.000 description 1
- 102100029363 Cytochrome P450 2C19 Human genes 0.000 description 1
- 102100029358 Cytochrome P450 2C9 Human genes 0.000 description 1
- 108020001019 DNA Primers Proteins 0.000 description 1
- 108020003215 DNA Probes Proteins 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 206010011878 Deafness Diseases 0.000 description 1
- 241000439574 Decapod penstyldensovirus 1 Species 0.000 description 1
- 206010012310 Dengue fever Diseases 0.000 description 1
- 206010012438 Dermatitis atopic Diseases 0.000 description 1
- 201000010374 Down Syndrome Diseases 0.000 description 1
- 241001115402 Ebolavirus Species 0.000 description 1
- 206010014596 Encephalitis Japanese B Diseases 0.000 description 1
- 241000709661 Enterovirus Species 0.000 description 1
- 241000186810 Erysipelothrix rhusiopathiae Species 0.000 description 1
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 1
- 241000701087 Felid alphaherpesvirus 1 Species 0.000 description 1
- 208000002613 Feline Panleukopenia Diseases 0.000 description 1
- 241000714201 Feline calicivirus Species 0.000 description 1
- 241000725579 Feline coronavirus Species 0.000 description 1
- 208000007212 Foot-and-Mouth Disease Diseases 0.000 description 1
- 241000710198 Foot-and-mouth disease virus Species 0.000 description 1
- 208000001914 Fragile X syndrome Diseases 0.000 description 1
- 208000025499 G6PD deficiency Diseases 0.000 description 1
- 241000701047 Gallid alphaherpesvirus 2 Species 0.000 description 1
- 206010051066 Gastrointestinal stromal tumour Diseases 0.000 description 1
- SXIJQMBEVYWAQT-GUBZILKMSA-N Gln-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXIJQMBEVYWAQT-GUBZILKMSA-N 0.000 description 1
- BLOXULLYFRGYKZ-GUBZILKMSA-N Gln-Glu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BLOXULLYFRGYKZ-GUBZILKMSA-N 0.000 description 1
- LWDGZZGWDMHBOF-FXQIFTODSA-N Gln-Glu-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LWDGZZGWDMHBOF-FXQIFTODSA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 1
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 1
- ILKYYKRAULNYMS-JYJNAYRXSA-N Gln-Lys-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ILKYYKRAULNYMS-JYJNAYRXSA-N 0.000 description 1
- BZULIEARJFRINC-IHRRRGAJSA-N Gln-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N BZULIEARJFRINC-IHRRRGAJSA-N 0.000 description 1
- XZUUUKNKNWVPHQ-JYJNAYRXSA-N Gln-Phe-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O XZUUUKNKNWVPHQ-JYJNAYRXSA-N 0.000 description 1
- JUUNNOLZGVYCJT-JYJNAYRXSA-N Gln-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JUUNNOLZGVYCJT-JYJNAYRXSA-N 0.000 description 1
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 1
- VOUSELYGTNGEPB-NUMRIWBASA-N Gln-Thr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O VOUSELYGTNGEPB-NUMRIWBASA-N 0.000 description 1
- OUBUHIODTNUUTC-WDCWCFNPSA-N Gln-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OUBUHIODTNUUTC-WDCWCFNPSA-N 0.000 description 1
- UQKVUFGUSVYJMQ-IRIUXVKKSA-N Gln-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N)O UQKVUFGUSVYJMQ-IRIUXVKKSA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 1
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 1
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 1
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 1
- XHWLNISLUFEWNS-CIUDSAMLSA-N Glu-Gln-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XHWLNISLUFEWNS-CIUDSAMLSA-N 0.000 description 1
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 1
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- BIHMNDPWRUROFZ-JYJNAYRXSA-N Glu-His-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BIHMNDPWRUROFZ-JYJNAYRXSA-N 0.000 description 1
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 1
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 1
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 1
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 1
- JDUKCSSHWNIQQZ-IHRRRGAJSA-N Glu-Phe-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JDUKCSSHWNIQQZ-IHRRRGAJSA-N 0.000 description 1
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- VJVAQZYGLMJPTK-QEJZJMRPSA-N Glu-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VJVAQZYGLMJPTK-QEJZJMRPSA-N 0.000 description 1
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 1
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 1
- UPADCCSMVOQAGF-LBPRGKRZSA-N Gly-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)CN)C(O)=O)=CNC2=C1 UPADCCSMVOQAGF-LBPRGKRZSA-N 0.000 description 1
- ZOTGXWMKUFSKEU-QXEWZRGKSA-N Gly-Ile-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O ZOTGXWMKUFSKEU-QXEWZRGKSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- TVUWMSBGMVAHSJ-KBPBESRZSA-N Gly-Leu-Phe Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TVUWMSBGMVAHSJ-KBPBESRZSA-N 0.000 description 1
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 1
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 1
- YLEIWGJJBFBFHC-KBPBESRZSA-N Gly-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 YLEIWGJJBFBFHC-KBPBESRZSA-N 0.000 description 1
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 1
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- 206010069767 H1N1 influenza Diseases 0.000 description 1
- 241000590002 Helicobacter pylori Species 0.000 description 1
- 208000031220 Hemophilia Diseases 0.000 description 1
- 208000009292 Hemophilia A Diseases 0.000 description 1
- 241000711549 Hepacivirus C Species 0.000 description 1
- 241000700721 Hepatitis B virus Species 0.000 description 1
- 241000724675 Hepatitis E virus Species 0.000 description 1
- 208000037262 Hepatitis delta Diseases 0.000 description 1
- 241000724709 Hepatitis delta virus Species 0.000 description 1
- 208000002972 Hepatolenticular Degeneration Diseases 0.000 description 1
- 241000709721 Hepatovirus A Species 0.000 description 1
- 208000028782 Hereditary disease Diseases 0.000 description 1
- VOEGKUNRHYKYSU-XVYDVKMFSA-N His-Asp-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O VOEGKUNRHYKYSU-XVYDVKMFSA-N 0.000 description 1
- ZJSMFRTVYSLKQU-DJFWLOJKSA-N His-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZJSMFRTVYSLKQU-DJFWLOJKSA-N 0.000 description 1
- SDTPKSOWFXBACN-GUBZILKMSA-N His-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O SDTPKSOWFXBACN-GUBZILKMSA-N 0.000 description 1
- LBQAHBIVXQSBIR-HVTMNAMFSA-N His-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N LBQAHBIVXQSBIR-HVTMNAMFSA-N 0.000 description 1
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 1
- BRQKGRLDDDQWQJ-MBLNEYKQSA-N His-Thr-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O BRQKGRLDDDQWQJ-MBLNEYKQSA-N 0.000 description 1
- DEMIXZCKUXVEBO-BWAGICSOSA-N His-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O DEMIXZCKUXVEBO-BWAGICSOSA-N 0.000 description 1
- 101000621945 Homo sapiens Vitamin K epoxide reductase complex subunit 1 Proteins 0.000 description 1
- 241000701024 Human betaherpesvirus 5 Species 0.000 description 1
- 241000725303 Human immunodeficiency virus Species 0.000 description 1
- 241000342334 Human metapneumovirus Species 0.000 description 1
- 241000701806 Human papillomavirus Species 0.000 description 1
- 241000484121 Human parvovirus Species 0.000 description 1
- 241000430519 Human rhinovirus sp. Species 0.000 description 1
- 208000023105 Huntington disease Diseases 0.000 description 1
- 206010020772 Hypertension Diseases 0.000 description 1
- BOTVMTSMOUSDRW-GMOBBJLQSA-N Ile-Arg-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O BOTVMTSMOUSDRW-GMOBBJLQSA-N 0.000 description 1
- SACHLUOUHCVIKI-GMOBBJLQSA-N Ile-Arg-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SACHLUOUHCVIKI-GMOBBJLQSA-N 0.000 description 1
- NULSANWBUWLTKN-NAKRPEOUSA-N Ile-Arg-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N NULSANWBUWLTKN-NAKRPEOUSA-N 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 1
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 1
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 1
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 1
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 1
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 1
- DMZOUKXXHJQPTL-GRLWGSQLSA-N Ile-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N DMZOUKXXHJQPTL-GRLWGSQLSA-N 0.000 description 1
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 1
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 1
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 1
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 1
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 1
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 1
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 1
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 1
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 1
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 1
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 1
- YTRFFJUOYBMLPN-UHFFFAOYSA-N Ile-Lys-Lys-Ser Chemical compound CCC(C)C(N)C(=O)NC(CCCCN)C(=O)NC(CCCCN)C(=O)NC(CO)C(O)=O YTRFFJUOYBMLPN-UHFFFAOYSA-N 0.000 description 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 1
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 1
- ZSESFIFAYQEKRD-CYDGBPFRSA-N Ile-Val-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N ZSESFIFAYQEKRD-CYDGBPFRSA-N 0.000 description 1
- 241000702626 Infectious bursal disease virus Species 0.000 description 1
- 241000712431 Influenza A virus Species 0.000 description 1
- 241000713196 Influenza B virus Species 0.000 description 1
- 208000002979 Influenza in Birds Diseases 0.000 description 1
- LPHGQDQBBGAPDZ-UHFFFAOYSA-N Isocaffeine Natural products CN1C(=O)N(C)C(=O)C2=C1N(C)C=N2 LPHGQDQBBGAPDZ-UHFFFAOYSA-N 0.000 description 1
- 201000005807 Japanese encephalitis Diseases 0.000 description 1
- 241000710842 Japanese encephalitis virus Species 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 241000589242 Legionella pneumophila Species 0.000 description 1
- 241000589902 Leptospira Species 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- VKOAHIRLIUESLU-ULQDDVLXSA-N Leu-Arg-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VKOAHIRLIUESLU-ULQDDVLXSA-N 0.000 description 1
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 1
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 1
- RFUBXQQFJFGJFV-GUBZILKMSA-N Leu-Asn-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RFUBXQQFJFGJFV-GUBZILKMSA-N 0.000 description 1
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- DDEMUMVXNFPDKC-SRVKXCTJSA-N Leu-His-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N DDEMUMVXNFPDKC-SRVKXCTJSA-N 0.000 description 1
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- FOBUGKUBUJOWAD-IHPCNDPISA-N Leu-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FOBUGKUBUJOWAD-IHPCNDPISA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 1
- SYRTUBLKWNDSDK-DKIMLUQUSA-N Leu-Phe-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYRTUBLKWNDSDK-DKIMLUQUSA-N 0.000 description 1
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- CNWDWAMPKVYJJB-NUTKFTJISA-N Leu-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CNWDWAMPKVYJJB-NUTKFTJISA-N 0.000 description 1
- WUHBLPVELFTPQK-KKUMJFAQSA-N Leu-Tyr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O WUHBLPVELFTPQK-KKUMJFAQSA-N 0.000 description 1
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 1
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 1
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 1
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 206010058467 Lung neoplasm malignant Diseases 0.000 description 1
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 1
- WALVCOOOKULCQM-ULQDDVLXSA-N Lys-Arg-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WALVCOOOKULCQM-ULQDDVLXSA-N 0.000 description 1
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 1
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 description 1
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 1
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 1
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 1
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 1
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 1
- DAOSYIZXRCOKII-SRVKXCTJSA-N Lys-His-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O DAOSYIZXRCOKII-SRVKXCTJSA-N 0.000 description 1
- ZMMDPRTXLAEMOD-BZSNNMDCSA-N Lys-His-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZMMDPRTXLAEMOD-BZSNNMDCSA-N 0.000 description 1
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 1
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- PFZWARWVRNTPBR-IHPCNDPISA-N Lys-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N PFZWARWVRNTPBR-IHPCNDPISA-N 0.000 description 1
- KJIXWRWPOCKYLD-IHRRRGAJSA-N Lys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N KJIXWRWPOCKYLD-IHRRRGAJSA-N 0.000 description 1
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 1
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 1
- XFOAWKDQMRMCDN-ULQDDVLXSA-N Lys-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)CC1=CC=CC=C1 XFOAWKDQMRMCDN-ULQDDVLXSA-N 0.000 description 1
- IPTUBUUIFRZMJK-ACRUOGEOSA-N Lys-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 IPTUBUUIFRZMJK-ACRUOGEOSA-N 0.000 description 1
- AEIIJFBQVGYVEV-YESZJQIVSA-N Lys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCCN)N)C(=O)O AEIIJFBQVGYVEV-YESZJQIVSA-N 0.000 description 1
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 1
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- CUHGAUZONORRIC-HJGDQZAQSA-N Lys-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O CUHGAUZONORRIC-HJGDQZAQSA-N 0.000 description 1
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 1
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 1
- BVRNWWHJYNPJDG-XIRDDKMYSA-N Lys-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N BVRNWWHJYNPJDG-XIRDDKMYSA-N 0.000 description 1
- PELXPRPDQRFBGQ-KKUMJFAQSA-N Lys-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O PELXPRPDQRFBGQ-KKUMJFAQSA-N 0.000 description 1
- XATKLFSXFINPSB-JYJNAYRXSA-N Lys-Tyr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O XATKLFSXFINPSB-JYJNAYRXSA-N 0.000 description 1
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 1
- LMMBAXJRYSXCOQ-ACRUOGEOSA-N Lys-Tyr-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O LMMBAXJRYSXCOQ-ACRUOGEOSA-N 0.000 description 1
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 1
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 1
- 208000024556 Mendelian disease Diseases 0.000 description 1
- DRXODWRPPUFIAY-DCAQKATOSA-N Met-Asn-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN DRXODWRPPUFIAY-DCAQKATOSA-N 0.000 description 1
- NCVJJAJVWILAGI-SRVKXCTJSA-N Met-Gln-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N NCVJJAJVWILAGI-SRVKXCTJSA-N 0.000 description 1
- JACAKCWAOHKQBV-UWVGGRQHSA-N Met-Gly-Lys Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN JACAKCWAOHKQBV-UWVGGRQHSA-N 0.000 description 1
- CHDYFPCQVUOJEB-ULQDDVLXSA-N Met-Leu-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CHDYFPCQVUOJEB-ULQDDVLXSA-N 0.000 description 1
- QLESZRANMSYLCZ-CYDGBPFRSA-N Met-Pro-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QLESZRANMSYLCZ-CYDGBPFRSA-N 0.000 description 1
- 241000127282 Middle East respiratory syndrome-related coronavirus Species 0.000 description 1
- 102000007474 Multiprotein Complexes Human genes 0.000 description 1
- 241000711386 Mumps virus Species 0.000 description 1
- 241000187479 Mycobacterium tuberculosis Species 0.000 description 1
- 241000202934 Mycoplasma pneumoniae Species 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- 241000588653 Neisseria Species 0.000 description 1
- 241001263478 Norovirus Species 0.000 description 1
- 108091005461 Nucleic proteins Proteins 0.000 description 1
- 208000009620 Orthomyxoviridae Infections Diseases 0.000 description 1
- 206010061902 Pancreatic neoplasm Diseases 0.000 description 1
- 208000018737 Parkinson disease Diseases 0.000 description 1
- 241000606856 Pasteurella multocida Species 0.000 description 1
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 1
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 1
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 1
- KKYHKZCMETTXEO-AVGNSLFASA-N Phe-Cys-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKYHKZCMETTXEO-AVGNSLFASA-N 0.000 description 1
- RJYBHZVWJPUSLB-QEWYBTABSA-N Phe-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N RJYBHZVWJPUSLB-QEWYBTABSA-N 0.000 description 1
- WYPVCIACUMJRIB-JYJNAYRXSA-N Phe-Gln-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N WYPVCIACUMJRIB-JYJNAYRXSA-N 0.000 description 1
- CDQCFGOQNYOICK-IHRRRGAJSA-N Phe-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDQCFGOQNYOICK-IHRRRGAJSA-N 0.000 description 1
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 1
- PSKRILMFHNIUAO-JYJNAYRXSA-N Phe-Glu-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N PSKRILMFHNIUAO-JYJNAYRXSA-N 0.000 description 1
- GYEPCBNTTRORKW-PCBIJLKTSA-N Phe-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O GYEPCBNTTRORKW-PCBIJLKTSA-N 0.000 description 1
- KRYSMKKRRRWOCZ-QEWYBTABSA-N Phe-Ile-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KRYSMKKRRRWOCZ-QEWYBTABSA-N 0.000 description 1
- KZRQONDKKJCAOL-DKIMLUQUSA-N Phe-Leu-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZRQONDKKJCAOL-DKIMLUQUSA-N 0.000 description 1
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 1
- HBXAOEBRGLCLIW-AVGNSLFASA-N Phe-Ser-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HBXAOEBRGLCLIW-AVGNSLFASA-N 0.000 description 1
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 1
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 1
- 201000011252 Phenylketonuria Diseases 0.000 description 1
- 208000002151 Pleural effusion Diseases 0.000 description 1
- 241000202347 Porcine circovirus Species 0.000 description 1
- 241001135549 Porcine epidemic diarrhea virus Species 0.000 description 1
- 241000702619 Porcine parvovirus Species 0.000 description 1
- 241000702665 Porcine rotavirus Species 0.000 description 1
- FCCBQBZXIAZNIG-LSJOCFKGSA-N Pro-Ala-His Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O FCCBQBZXIAZNIG-LSJOCFKGSA-N 0.000 description 1
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- STASJMBVVHNWCG-IHRRRGAJSA-N Pro-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 STASJMBVVHNWCG-IHRRRGAJSA-N 0.000 description 1
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 1
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 description 1
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 1
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 1
- SMFQZMGHCODUPQ-ULQDDVLXSA-N Pro-Lys-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SMFQZMGHCODUPQ-ULQDDVLXSA-N 0.000 description 1
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 1
- FHZJRBVMLGOHBX-GUBZILKMSA-N Pro-Pro-Asp Chemical compound OC(=O)C[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1)C(O)=O FHZJRBVMLGOHBX-GUBZILKMSA-N 0.000 description 1
- UIUWGMRJTWHIJZ-ULQDDVLXSA-N Pro-Tyr-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O UIUWGMRJTWHIJZ-ULQDDVLXSA-N 0.000 description 1
- 206010036790 Productive cough Diseases 0.000 description 1
- 206010060862 Prostate cancer Diseases 0.000 description 1
- 208000000236 Prostatic Neoplasms Diseases 0.000 description 1
- 206010037660 Pyrexia Diseases 0.000 description 1
- 108010003201 RGH 0205 Proteins 0.000 description 1
- 241000711798 Rabies lyssavirus Species 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 241000725643 Respiratory syncytial virus Species 0.000 description 1
- 208000006289 Rett Syndrome Diseases 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 241000711897 Rinderpest morbillivirus Species 0.000 description 1
- 241000702670 Rotavirus Species 0.000 description 1
- 241000607142 Salmonella Species 0.000 description 1
- ICHZYBVODUVUKN-SRVKXCTJSA-N Ser-Asn-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ICHZYBVODUVUKN-SRVKXCTJSA-N 0.000 description 1
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 1
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 1
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 1
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- XPVIVVLLLOFBRH-XIRDDKMYSA-N Ser-Trp-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](N)CO)C(O)=O XPVIVVLLLOFBRH-XIRDDKMYSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- 206010040102 Seroma Diseases 0.000 description 1
- 241000607768 Shigella Species 0.000 description 1
- 241000700584 Simplexvirus Species 0.000 description 1
- 241000191967 Staphylococcus aureus Species 0.000 description 1
- 208000005718 Stomach Neoplasms Diseases 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- 208000006011 Stroke Diseases 0.000 description 1
- 241000701093 Suid alphaherpesvirus 1 Species 0.000 description 1
- 239000012505 Superdex™ Substances 0.000 description 1
- 208000033809 Suppuration Diseases 0.000 description 1
- 241000282898 Sus scrofa Species 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- 208000002903 Thalassemia Diseases 0.000 description 1
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 1
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 1
- IRKWVRSEQFTGGV-VEVYYDQMSA-N Thr-Asn-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IRKWVRSEQFTGGV-VEVYYDQMSA-N 0.000 description 1
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 1
- WDFPMSHYMRBLKM-NKIYYHGXSA-N Thr-Glu-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O WDFPMSHYMRBLKM-NKIYYHGXSA-N 0.000 description 1
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 description 1
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- CJXURNZYNHCYFD-WDCWCFNPSA-N Thr-Lys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CJXURNZYNHCYFD-WDCWCFNPSA-N 0.000 description 1
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 1
- HSQXHRIRJSFDOH-URLPEUOOSA-N Thr-Phe-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HSQXHRIRJSFDOH-URLPEUOOSA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 1
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 1
- NJGMALCNYAMYCB-JRQIVUDYSA-N Thr-Tyr-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJGMALCNYAMYCB-JRQIVUDYSA-N 0.000 description 1
- CJEHCEOXPLASCK-MEYUZBJRSA-N Thr-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=C(O)C=C1 CJEHCEOXPLASCK-MEYUZBJRSA-N 0.000 description 1
- SJPDTIQHLBQPFO-VLCNGCBASA-N Thr-Tyr-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SJPDTIQHLBQPFO-VLCNGCBASA-N 0.000 description 1
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- 208000024770 Thyroid neoplasm Diseases 0.000 description 1
- 241000223997 Toxoplasma gondii Species 0.000 description 1
- 108091028113 Trans-activating crRNA Proteins 0.000 description 1
- 108700019146 Transgenes Proteins 0.000 description 1
- 241000711484 Transmissible gastroenteritis virus Species 0.000 description 1
- 241000589884 Treponema pallidum Species 0.000 description 1
- 241000610628 Trichoptilium incisum Species 0.000 description 1
- 208000006284 Trisomy 13 Syndrome Diseases 0.000 description 1
- 208000007159 Trisomy 18 Syndrome Diseases 0.000 description 1
- MVHHTXAUJCIOMZ-WDSOQIARSA-N Trp-Arg-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N MVHHTXAUJCIOMZ-WDSOQIARSA-N 0.000 description 1
- HJTYJQVRIQXMHM-XIRDDKMYSA-N Trp-Asp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N HJTYJQVRIQXMHM-XIRDDKMYSA-N 0.000 description 1
- DQDXHYIEITXNJY-BPUTZDHNSA-N Trp-Gln-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N DQDXHYIEITXNJY-BPUTZDHNSA-N 0.000 description 1
- VMBBTANKMSRJSS-JSGCOSHPSA-N Trp-Glu-Gly Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VMBBTANKMSRJSS-JSGCOSHPSA-N 0.000 description 1
- SNJAPSVIPKUMCK-NWLDYVSISA-N Trp-Glu-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SNJAPSVIPKUMCK-NWLDYVSISA-N 0.000 description 1
- OGZRZMJASKKMJZ-XIRDDKMYSA-N Trp-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N OGZRZMJASKKMJZ-XIRDDKMYSA-N 0.000 description 1
- MEZCXKYMMQJRDE-PMVMPFDFSA-N Trp-Leu-Tyr Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CC(C)C)C(O)=O)C1=CC=C(O)C=C1 MEZCXKYMMQJRDE-PMVMPFDFSA-N 0.000 description 1
- GQEXFCQNAJHJTI-IHPCNDPISA-N Trp-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GQEXFCQNAJHJTI-IHPCNDPISA-N 0.000 description 1
- WSMVEHPVOYXPAQ-XIRDDKMYSA-N Trp-Ser-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N WSMVEHPVOYXPAQ-XIRDDKMYSA-N 0.000 description 1
- FHHYVSCGOMPLLO-IHPCNDPISA-N Trp-Tyr-Asp Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 FHHYVSCGOMPLLO-IHPCNDPISA-N 0.000 description 1
- 239000004182 Tylosin Substances 0.000 description 1
- 229930194936 Tylosin Natural products 0.000 description 1
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 description 1
- CKKFTIQYURNSEI-IHRRRGAJSA-N Tyr-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CKKFTIQYURNSEI-IHRRRGAJSA-N 0.000 description 1
- FJKXUIJOMUWCDD-FHWLQOOXSA-N Tyr-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N)O FJKXUIJOMUWCDD-FHWLQOOXSA-N 0.000 description 1
- LMLBOGIOLHZXOT-JYJNAYRXSA-N Tyr-Glu-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O LMLBOGIOLHZXOT-JYJNAYRXSA-N 0.000 description 1
- FIRUOPRJKCBLST-KKUMJFAQSA-N Tyr-His-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O FIRUOPRJKCBLST-KKUMJFAQSA-N 0.000 description 1
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 1
- GYKDRHDMGQUZPU-MGHWNKPDSA-N Tyr-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GYKDRHDMGQUZPU-MGHWNKPDSA-N 0.000 description 1
- ZOBLBMGJKVJVEV-BZSNNMDCSA-N Tyr-Lys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O ZOBLBMGJKVJVEV-BZSNNMDCSA-N 0.000 description 1
- PGEFRHBWGOJPJT-KKUMJFAQSA-N Tyr-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O PGEFRHBWGOJPJT-KKUMJFAQSA-N 0.000 description 1
- LRHBBGDMBLFYGL-FHWLQOOXSA-N Tyr-Phe-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=C(O)C=C1 LRHBBGDMBLFYGL-FHWLQOOXSA-N 0.000 description 1
- JXGUUJMPCRXMSO-HJOGWXRNSA-N Tyr-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 JXGUUJMPCRXMSO-HJOGWXRNSA-N 0.000 description 1
- NVZVJIUDICCMHZ-BZSNNMDCSA-N Tyr-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O NVZVJIUDICCMHZ-BZSNNMDCSA-N 0.000 description 1
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 1
- LVILBTSHPTWDGE-PMVMPFDFSA-N Tyr-Trp-Lys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(O)=O)C1=CC=C(O)C=C1 LVILBTSHPTWDGE-PMVMPFDFSA-N 0.000 description 1
- AGDDLOQMXUQPDY-BZSNNMDCSA-N Tyr-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O AGDDLOQMXUQPDY-BZSNNMDCSA-N 0.000 description 1
- 241000202921 Ureaplasma urealyticum Species 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 1
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 1
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 1
- KNYHAWKHFQRYOX-PYJNHQTQSA-N Val-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N KNYHAWKHFQRYOX-PYJNHQTQSA-N 0.000 description 1
- BZWUSZGQOILYEU-STECZYCISA-N Val-Ile-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BZWUSZGQOILYEU-STECZYCISA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 1
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 1
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 1
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 1
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 1
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 1
- DOBHJKVVACOQTN-DZKIICNBSA-N Val-Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 DOBHJKVVACOQTN-DZKIICNBSA-N 0.000 description 1
- 108010059993 Vancomycin Proteins 0.000 description 1
- 102100023485 Vitamin K epoxide reductase complex subunit 1 Human genes 0.000 description 1
- 208000001455 Zika Virus Infection Diseases 0.000 description 1
- 208000035332 Zika virus disease Diseases 0.000 description 1
- 208000020329 Zika virus infectious disease Diseases 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 208000026935 allergic disease Diseases 0.000 description 1
- 231100000360 alopecia Toxicity 0.000 description 1
- 229940126575 aminoglycoside Drugs 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 206010002026 amyotrophic lateral sclerosis Diseases 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 230000003064 anti-oxidating effect Effects 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 201000008937 atopic dermatitis Diseases 0.000 description 1
- 206010064097 avian influenza Diseases 0.000 description 1
- 201000008680 babesiosis Diseases 0.000 description 1
- 229940065181 bacillus anthracis Drugs 0.000 description 1
- 238000003287 bathing Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 235000013361 beverage Nutrition 0.000 description 1
- 238000003766 bioinformatics method Methods 0.000 description 1
- 229960001561 bleomycin Drugs 0.000 description 1
- OYVAGSVQBOHSSS-UAPAGMARSA-O bleomycin A2 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC=C(N=1)C=1SC=C(N=1)C(=O)NCCC[S+](C)C)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C OYVAGSVQBOHSSS-UAPAGMARSA-O 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 229960001948 caffeine Drugs 0.000 description 1
- VJEONQKOZGKCAK-UHFFFAOYSA-N caffeine Natural products CN1C(=O)N(C)C(=O)C2=C1C=CN2C VJEONQKOZGKCAK-UHFFFAOYSA-N 0.000 description 1
- 108010068385 carbapenemase Proteins 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 229940124587 cephalosporin Drugs 0.000 description 1
- 150000001780 cephalosporins Chemical class 0.000 description 1
- 210000001175 cerebrospinal fluid Anatomy 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 229940038705 chlamydia trachomatis Drugs 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 230000019771 cognition Effects 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 208000029078 coronary artery disease Diseases 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 231100000895 deafness Toxicity 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 208000025729 dengue disease Diseases 0.000 description 1
- 239000000645 desinfectant Substances 0.000 description 1
- 206010012601 diabetes mellitus Diseases 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 208000022802 disorder of glycosylation Diseases 0.000 description 1
- 239000012636 effector Substances 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 229960003276 erythromycin Drugs 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000004744 fabric Substances 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 210000003608 fece Anatomy 0.000 description 1
- 208000005098 feline infectious peritonitis Diseases 0.000 description 1
- 238000012921 fluorescence analysis Methods 0.000 description 1
- 238000001917 fluorescence detection Methods 0.000 description 1
- 229940124307 fluoroquinolone Drugs 0.000 description 1
- 238000007710 freezing Methods 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 239000007789 gas Substances 0.000 description 1
- 206010017758 gastric cancer Diseases 0.000 description 1
- 201000011243 gastrointestinal stromal tumor Diseases 0.000 description 1
- 238000005227 gel permeation chromatography Methods 0.000 description 1
- 108091008053 gene clusters Proteins 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- 208000008605 glucosephosphate dehydrogenase deficiency Diseases 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- 208000007345 glycogen storage disease Diseases 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010050848 glycylleucine Proteins 0.000 description 1
- 208000021245 head disease Diseases 0.000 description 1
- 208000016354 hearing loss disease Diseases 0.000 description 1
- 229940037467 helicobacter pylori Drugs 0.000 description 1
- 108010041601 histidyl-aspartyl-glutamyl-leucine Proteins 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 239000005457 ice water Substances 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 230000002458 infectious effect Effects 0.000 description 1
- 206010022000 influenza Diseases 0.000 description 1
- 208000015978 inherited metabolic disease Diseases 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 125000001449 isopropyl group Chemical group [H]C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 229940115932 legionella pneumophila Drugs 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 210000000265 leukocyte Anatomy 0.000 description 1
- 201000005202 lung cancer Diseases 0.000 description 1
- 208000020816 lung neoplasm Diseases 0.000 description 1
- 210000004880 lymph fluid Anatomy 0.000 description 1
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010076718 lysyl-glutamyl-tryptophan Proteins 0.000 description 1
- 239000003120 macrolide antibiotic agent Substances 0.000 description 1
- 208000002780 macular degeneration Diseases 0.000 description 1
- 208000015486 malignant pancreatic neoplasm Diseases 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 108700023046 methionyl-leucyl-phenylalanine Proteins 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 210000004877 mucosa Anatomy 0.000 description 1
- 210000003097 mucus Anatomy 0.000 description 1
- 230000036457 multidrug resistance Effects 0.000 description 1
- 201000006417 multiple sclerosis Diseases 0.000 description 1
- 229960003128 mupirocin Drugs 0.000 description 1
- 229930187697 mupirocin Natural products 0.000 description 1
- DDHVILIIHBIMQU-YJGQQKNPSA-L mupirocin calcium hydrate Chemical compound O.O.[Ca+2].C[C@H](O)[C@H](C)[C@@H]1O[C@H]1C[C@@H]1[C@@H](O)[C@@H](O)[C@H](C\C(C)=C\C(=O)OCCCCCCCCC([O-])=O)OC1.C[C@H](O)[C@H](C)[C@@H]1O[C@H]1C[C@@H]1[C@@H](O)[C@@H](O)[C@H](C\C(C)=C\C(=O)OCCCCCCCCC([O-])=O)OC1 DDHVILIIHBIMQU-YJGQQKNPSA-L 0.000 description 1
- 206010028320 muscle necrosis Diseases 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 201000002523 pancreas lymphoma Diseases 0.000 description 1
- 201000002528 pancreatic cancer Diseases 0.000 description 1
- 208000008443 pancreatic carcinoma Diseases 0.000 description 1
- 238000004091 panning Methods 0.000 description 1
- 229940051027 pasteurella multocida Drugs 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 210000002381 plasma Anatomy 0.000 description 1
- 208000030761 polycystic kidney disease Diseases 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000002035 prolonged effect Effects 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 230000004952 protein activity Effects 0.000 description 1
- 230000005180 public health Effects 0.000 description 1
- 229950010131 puromycin Drugs 0.000 description 1
- 238000010791 quenching Methods 0.000 description 1
- 230000000171 quenching effect Effects 0.000 description 1
- 230000035484 reaction time Effects 0.000 description 1
- 230000009257 reactivity Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- -1 saliva Substances 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 230000001932 seasonal effect Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000012772 sequence design Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 238000010008 shearing Methods 0.000 description 1
- 208000007056 sickle cell anemia Diseases 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 239000002689 soil Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 108010005652 splenotritin Proteins 0.000 description 1
- 210000003802 sputum Anatomy 0.000 description 1
- 208000024794 sputum Diseases 0.000 description 1
- 201000011549 stomach cancer Diseases 0.000 description 1
- 229940124530 sulfonamide Drugs 0.000 description 1
- 150000003456 sulfonamides Chemical class 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 201000010740 swine influenza Diseases 0.000 description 1
- 208000011580 syndromic disease Diseases 0.000 description 1
- 210000001179 synovial fluid Anatomy 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 201000002510 thyroid cancer Diseases 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 108700004896 tripeptide FEG Proteins 0.000 description 1
- 108010029384 tryptophyl-histidine Proteins 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- WBPYTXDJUQJLPQ-VMXQISHHSA-N tylosin Chemical compound O([C@@H]1[C@@H](C)O[C@H]([C@@H]([C@H]1N(C)C)O)O[C@@H]1[C@@H](C)[C@H](O)CC(=O)O[C@@H]([C@H](/C=C(\C)/C=C/C(=O)[C@H](C)C[C@@H]1CC=O)CO[C@H]1[C@@H]([C@H](OC)[C@H](O)[C@@H](C)O1)OC)CC)[C@H]1C[C@@](C)(O)[C@@H](O)[C@H](C)O1 WBPYTXDJUQJLPQ-VMXQISHHSA-N 0.000 description 1
- 229960004059 tylosin Drugs 0.000 description 1
- 235000019375 tylosin Nutrition 0.000 description 1
- 201000011296 tyrosinemia Diseases 0.000 description 1
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 241000712461 unidentified influenza virus Species 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- MYPYJXKWCTUITO-LYRMYLQWSA-N vancomycin Chemical compound O([C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1OC1=C2C=C3C=C1OC1=CC=C(C=C1Cl)[C@@H](O)[C@H](C(N[C@@H](CC(N)=O)C(=O)N[C@H]3C(=O)N[C@H]1C(=O)N[C@H](C(N[C@@H](C3=CC(O)=CC(O)=C3C=3C(O)=CC=C1C=3)C(O)=O)=O)[C@H](O)C1=CC=C(C(=C1)Cl)O2)=O)NC(=O)[C@@H](CC(C)C)NC)[C@H]1C[C@](C)(N)[C@H](O)[C@H](C)O1 MYPYJXKWCTUITO-LYRMYLQWSA-N 0.000 description 1
- 229960003165 vancomycin Drugs 0.000 description 1
- MYPYJXKWCTUITO-UHFFFAOYSA-N vancomycin Natural products O1C(C(=C2)Cl)=CC=C2C(O)C(C(NC(C2=CC(O)=CC(O)=C2C=2C(O)=CC=C3C=2)C(O)=O)=O)NC(=O)C3NC(=O)C2NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(CC(C)C)NC)C(O)C(C=C3Cl)=CC=C3OC3=CC2=CC1=C3OC1OC(CO)C(O)C(O)C1OC1CC(C)(N)C(O)C(C)O1 MYPYJXKWCTUITO-UHFFFAOYSA-N 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 239000002023 wood Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6844—Nucleic acid amplification reactions
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A50/00—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE in human health protection, e.g. against extreme weather
- Y02A50/30—Against vector-borne diseases, e.g. mosquito-borne, fly-borne, tick-borne or waterborne diseases whose impact is exacerbated by climate change
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Molecular Biology (AREA)
- Genetics & Genomics (AREA)
- Microbiology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Medicinal Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Physics & Mathematics (AREA)
- Analytical Chemistry (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Immunology (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
本发明公开了新型耐高温arCas12a蛋白在核酸检测方面的应用。本发明研究发现一种在gRNA介导下对靶标序列特异性识别并具有非特异单链DNA切割活性的新型Cas12a蛋白,可作为新型CRISPR/arCas12a系统应用于核酸检测,为基于Cas12a的修饰及分子检测提供了一种新的必要工具选择。同时提供了一种包括arCas12a蛋白和gRNA的新型核酸检测系统和试剂盒,可在25℃‑55℃的室温下实现高灵敏、高精度的分子检测,检测特异性好、灵敏度高,成本低廉、操作方便、快捷、应用范围广,在核酸检测方面具有很好的应用前景。
Description
技术领域
本发明属于分子生物学技术领域。更具体地,涉及一种新型arCas12a蛋白在核酸检测方面的应用。
背景技术
2015年,一种全新的第二类CRISPR-Cas系统-V型系统被发现,该系统中的效应蛋白被命名为Cpf1/Cas12a。张锋团队于2015年11月22日在《Cell》发表的一篇题为“Cpf1 isa single RNA-guided endonuclease of a Class 2 CRISPR-Cas system”的文章。该系统基本的工作流程与CRISPR/Cas9类似,还是借助CRISPR序列的“黑名单”系统对入侵者进行打击。但是gRNA形成的方式与CRISPR/Cas9系统不同:Cpf1/Cas12a蛋白会与未成熟的gRNA复合,并对gRNA进行加工,gRNA则会与PAM附近的互补区域杂交。最后,外源双链DNA(doublestrand DNA,dsDNA)会被剪切,其基因表达也会被沉默。然而,在切割靶dsDNA的同时,激活了的Cpf1/Cas12a还会降解靶dsDNA邻近的单链DNA(single strand DNA,ssDNA),称之为“附属切割”,这种活性是新开发的核酸检测平台的关键特征。2018年4月27日,Doudna团队和张锋团队同时在《Science》发表了两篇题为“Two distinct RNase activities ofCRISPR-C2c2 enable guide-RNA processing and RNA detection”和“Multiplexed andportable nucleic acid detection platform with Cas13, Cas12a, and Csm6”的论文。表明Cpf1/Cas12a在切割靶dsDNA的同时,还会降解靶dsDNA邻近的ssDNA。这两个独立的实验室分别改造了靶向dsDNA的V型CRISPR系统,使其成为了快速、便宜且高度灵敏的诊断工具。这一发现有望为科学研究以及全球公共卫生带来变革性的影响。利用这种新的CRISPR技术:CRISPR-Cpf1/Cas12a,可以高灵敏度检测包括寨卡病毒感染,登革热病毒感染等在内的疾病,其原理在于将CRISPR-Cpf1/Cas12a与等温核酸扩增结合,检测特异性的RNA或DNA。另外,该系统还包含有一个切割时发出荧光的报告ssDNA。当Cpf1/Cas12a检测到靶标dsDNA序列时,其ssDNase活性就会切割报告ssDNA,释放可检测的荧光信号。将这两种技术结合起来的新系统能够以极低浓度检测RNA和单链DNA分子,具有很好的应用前景。根据张锋团队的研究结果显示,Cpf1/Cas12a蛋白同族间具有较大的差异,某些同族蛋白无活性。目前报道使用较多的lbaCas12a蛋白,其活性温度范围为25-37℃,在高于40℃条件下,lbaCas12a蛋白活性大幅下降,这限制了其与更多其它技术/高温酶的联用,影响了恒温扩增与Cas12a检测一管法的开发及应用。
发明内容
本发明涉及一种用于核酸检测的arCas12a蛋白或其功能变体,所述arCas12a蛋白氨基酸序列如SEQ ID NO.1所示;和/或,所述arCas12a蛋白核苷酸序列如SEQ ID NO.2或SEQ ID NO.3所示。
本发明还涉及一种基于CRISPR/Cas12a的核酸检测系统,包括上述的arCas12a蛋白或其功能变体,还包括gRNA。
本发明还涉及基于CRISPR/Cas12a的核酸检测系统的核酸检测方法,其利用上述的arCas12a蛋白或其功能变体、对应靶标序列的gRNA进行CRISPR核酸检测。
本发明还涉及上述arCas12a蛋白或其功能变体的应用,包括:
在切割DNA方面的应用;在作为或制备DNA切割工具方面的应用;在核酸检测方面的应用;在基于CRISPR/Cas12a的核酸检测方面的应用;在作为或制备核酸检测工具方面的应用;在作为或制备基于CRISPR/Cas12a的核酸检测工具方面的应用。
本发明具有以下有益效果:
本发明研究发现一种在gRNA介导下对靶标序列特异性识别并具有非特异单链DNA切割活性的新型Cas12a蛋白,可作为新型CRISPR/arCas12a系统应用于核酸检测,为基于Cas12a的核酸分子检测提供了一种必要工具的新选择。
而且基于arCas12a蛋白的核酸检测系统可在25-55℃实现高灵敏、高精度的分子检测,更宽广的活性温度范围将有利于该新型CRISPR/arCas12a系统与其余多种恒温扩增一步法联用方案的开发。同时,该新型CRISPR/arCas12a系统具有很好的特异性和兼容性,检测灵敏度高,且检测成本低廉、操作方便、快捷、应用范围广,在核酸检测方面具有很好的应用前景。
附图说明
图1为arCas12a蛋白三维结构图。
图2为arCas12a片段PCR扩增结果。
图3为arCas12a蛋白表达结果。
图4为arCas12a蛋白纯化结果。
图5为arCas12a gRNA纯化结果。
图6为基于CRISPR/arCas12a的核酸检测结果。
图7为基于CRISPR/arCas12a 体系配对的gRNA框架序列筛选结果。
图8为基于CRISPR/arCas12a的核酸灵敏度检测结果。
图9为基于CRISPR/arCas12a的核酸特异性检测结果。
图10为胶体金试纸条判断方法示意图。
图11为基于CRISPR/arCas12a的胶体金试纸条方法核酸灵敏度检测结果。
图12为基于CRISPR/arCas12a的胶体金试纸条方法核酸特异性检测结果。
图13为基于CRISPR/arCas12a的检测系统在37-55℃检测结果。
具体实施方式
以下结合说明书附图和具体实施例来进一步说明本发明,但实施例并不对本发明做任何形式的限定。对于本领域技术人员来说,其他的任何未背离本发明的精神实质与原理下所作的改变、修饰、替代、组合、简化,均应为等效的置换方式,都包含在本发明的保护范围之内。
除非特别说明,本发明采用的试剂、方法和设备为本技术领域常规试剂、方法和设备。除非特别说明,以下实施例所用试剂和材料均为市购。
除非另有说明,本发明采用的免疫学、生物化学、化学、分子生物学、微生物学、细胞生物学、基因组学和重组DNA等是本领域的常规技能。参见萨姆布鲁克( Sambrook )、弗里奇( Fritsch )和马尼亚蒂斯( Maniatis ),《分子克隆:实验室手册》( MOLECULARCLONING:A LABORATORY MANUAL ),第2次编辑( 1989 );《当代分子生物学实验手册》(CURRENT PROTOCOLS IN MOLECULAR BIOLOGY )( F .M .奥苏贝尔( F .M .Ausubel )等人编辑,( 1987 ));《酶学方法》( METHODS IN ENZYMOLOGY )系列(学术出版公司):《PCR2:实用方法》( PCR 2:A PRACTICAL APPROACH )( M .J .麦克弗森( M .J .MacPherson )、B.D .黑姆斯( B .D .Hames )和G .R .泰勒( G .R .Taylor )编辑( 1995 ))、哈洛(Harlow )和拉内( Lane )编辑( 1988 )《抗体:实验室手册》( ANTIBODIES ,A LABORATORYMANUAL ),以及《动物细胞培养》( ANIMAL CELL CULTURE )(R .I .弗雷谢尼(R .I.Freshney )编辑(1987 ))。
如下述实施例中所用的异丙基硫化-D-半乳糖苷(IPTG)购买自Sigma公司。NiSepharose FF购买自GE Healthcare。蛋白纯化耗材购买自碧云天公司。Amicon 4 30kDa超滤管购买自Millipore公司。Phusion DNA聚合酶、FastDigestNotI、FastDigestAscI内切酶、T4连接酶购买自Thermo公司。PCR clean up和凝胶回收试剂盒均购买自Qiagen公司。
如本文所用,术语“gRNA”是指向导RNA,其引导Cas蛋白特异性结合靶标DNA序列的RNA。
如本文所用,术语“CRISPR”是指成簇的、规律间隔的短回文重复序列(clusteredregularly interspaced short palindromic repeats),该序列是许多原核生物的免疫系统。
如本文所用,术语“Cas12a”(旧称“Cpf1”)是指crRNA依赖的内切酶,它是CRISPR系统分类中V-A型(type V-A)的酶。
如本文所用,术语“PAM”是指靶向序列邻近的前间隔序列邻近基序(protospaceradjacent motif),是CRISPR/Cas系统特异识别靶DNA的重要组成部分。
如本文所用,术语“Cas核酸酶”是能够在gRNA的配合下特异性切割靶序列(DNA或RNA)的酶。
本发明的第一方面在于提供一种用于核酸检测的arCas12a蛋白或其功能变体,所述arCas12a蛋白氨基酸序列如SEQ ID NO.1所示;和/或,所述arCas12a蛋白核苷酸序列如SEQ ID NO.2或SEQ ID NO.3所示。
本发明为基于Cas12a的核酸分子检测提供了一种必要工具的新选择,提供一种在gRNA介导下对靶标序列特异性识别并具有非特异单链DNA切割活性的新型CRISPR/Cas12a系统的蛋白- arCas12a蛋白,可应用于特异性核酸检测。
另外,除了本发明所述的arCas12a蛋白本身,其功能变体或其同源物或直向同源物皆有可能保留部分或全部蛋白质活性,即arCas12a蛋白的功能变体或其同源物或直向同源物的应用,也应在本发明的范围之内。
所述功能变体可以包括arCas12a突变体(可以是插入,缺失或替换序列的突变体)、多晶型等。所述功能变体也包括arCas12a蛋白与另一种通常不相关的核酸、蛋白质或多肽的融合产物。功能变体可以是天然存在的,也可以是人造的。
因此,arCas12a蛋白或其功能变体的以下应用均应在本发明的范围之内:
在切割DNA方面的应用;在作为或制备DNA切割工具方面的应用;在核酸检测方面的应用;在基于CRISPR/Cas12a的核酸检测方面的应用;在作为或制备核酸检测工具方面的应用;在作为或制备基于CRISPR/Cas12a的核酸检测工具方面的应用。
另外基于上述应用,本发明的第二方面在于提供一种基于CRISPR/arCas12a的核酸检测系统,包括arCas12a蛋白或其功能变体、gRNA。
其中所述gRNA的设计原则为:在选取gRNA靶向序列时,靶向序列5’端应具有5’-TTTN-3’序列,且靶向序列本身、靶向序列和其余序列间不形成稳定二级结构。
其中,所述gRNA包括a)与Cas核酸酶相互作用的框架核酸片段,以及b)与靶核酸结合的特异性核酸片段,所述与 Cas核酸酶相互作用的框架核酸片段为SEQ ID NO.4~7中至少一种所示。
作为一种可选择的案例,当靶序列如SEQ ID NO.12所示时,gRNA的序列如SEQ IDNO.13所示。
本发明的第三方面在于提供基于CRISPR/Cas12a的核酸检测系统的核酸检测方法,其利用上述的arCas12a蛋白或其功能变体、对应靶标序列的gRNA进行CRISPR核酸检测。
具体地,所述基于CRISPR/arCas12a的核酸检测方法为:将待测核酸样品、arCas12a蛋白、gRNA、非特异单链荧光探针和反应所需缓冲液混合为反应体系,进行检测反应。具体是反应体系置于荧光分析仪(BioTek)进行荧光分析,以激发波长530nm/发射波长580nm读取反应孔荧光值。
优选地,检测体系包括:2μl RPA产物,45nM arCas12a,22.5nM gRNA,100nM非特异单链DNA荧光探针,及检测缓冲液。
所述缓冲液的各组分在检测体系的终浓度为:20 mM Tris, 60 mM NaCl, 10 mMMgCl2 , pH 7.3。
反应条件为:37℃下反应1-3小时。
另外,基于本发明的上述技术方案,应当涵盖如下内容:
(1)本发明通过arCas12a附属切割带荧光基团与淬灭基团的报告DNA链,释放荧光基团,然后将反应体系置于荧光分析仪中检测核酸产物。除本发明所述的核酸检测信号报告方法,本发明也可通过其他利用激活arCas12a产生附属切割效应后,实现信号检测的方案来检测样本中存在的一种或多种靶分子。
(2)在(1)中所述方案,包括基于arCas12a附属切割带生物素、荧光基团、地高辛或其他标记的核酸片段后,通过胶体金侧向层析方式实现核酸产物检测。本发明也可通过arCas12a附属切割聚集化的胶体金颗粒,使胶体金颗粒颜色变化,从而检测核酸产物信号。
(3)本发明在其他特异性核酸检测方案中,一种或多种gRNA可被设计为靶向一种或多种诊断疾病状态的靶分子。所述疾病,可以是人类疾病,动物疾病,植物疾病;
(4)根据(3),所述人类疾病,可以是人类感染性疾病,癌症,器官疾病,血液疾病,免疫系统疾病,脑和神经系统疾病,内分泌疾病,遗传性疾病。
(5)根据(4),所述人类感染性疾病,可以是由病毒、细菌或真菌引起。可以是呼吸道合胞病毒、甲型流感病毒、乙型流感病毒、季节性流感病毒、副流感病毒、腺病毒、人鼻病毒、人偏肺病毒、腮腺炎病毒、肺炎支原体、肺炎衣原体、结核分枝杆菌、中东呼吸综合征冠状病毒、百日咳杆菌、嗜肺军团杆菌、A链球菌;可以是人类免疫缺陷病毒、淋球菌、沙眼衣原体、解脲脲原体、人乳头瘤病毒、梅毒螺旋体、单纯疱疹病毒、人细小病毒;可以是甲型肝炎病毒、乙型肝炎病毒、丙型肝炎病毒、丁型肝炎病毒、戊型肝炎病毒;可以是人巨细胞病毒、人疱疹病毒、柯萨奇病毒、肠道病毒EV71/CA16、登革热病毒、沙门氏菌、志贺氏菌、幽门螺旋杆菌、诺如病毒、肠道腺病毒、轮状病毒、埃博拉病毒。登革热病毒。
(6)根据(4),癌症可以是肺癌、结直肠癌、胃癌、胃肠间质瘤、乳腺癌、卵巢癌、前列腺癌、甲状腺癌、胰腺癌、淋巴瘤等。
(7)根据(4),血液疾病及遗传性疾病可以是:地中海贫血、血友病、镰刀型细胞贫血、Rett综合征、囊性纤维化、亨廷顿病、脆性X综合征、13三体综合征、18三体综合征、21三体综合征、遗传性代谢疾病、遗传性耳聋、遗传多囊肾病、先天性糖基化病、G6PD缺乏症、苯丙酮尿症、酪氨酸血症、肝豆状核变性、白化病、糖原累积病、遗传性乳腺癌、遗传性卵巢癌、遗传性结直肠癌等;
(8)根据(4),所述器官疾病,免疫系统疾病,脑和神经系统疾病,内分泌疾病,可以是脑卒中、高血压、冠心病、肌萎缩侧索硬化症、帕金森病、阿尔兹海默病、过敏性疾病、类风湿、多发性硬化症、遗传性过敏性皮炎、糖尿病、黄斑变性、强直性脊柱炎等。
(9)根据(3),所述动物疾病可以是:猪流行性腹泻病毒、猪轮状病毒A群、猪传染性胃肠炎病毒、口蹄疫、蓝耳病、猪瘟、猪圆环病毒、非洲猪瘟、猪伪狂犬病毒、猪日本乙型脑炎、猪细小病毒、猪流感、蓝耳病、猪链球菌、猪丹毒杆菌、牛瘟、小反刍兽疫病毒、绵羊痘病毒、多杀性巴氏杆菌、禽流感、新城疫病毒、鸭瘟病毒、鸡马立克氏病病毒、鸡传染性法氏囊病毒、猫衣原体、猫冠状病毒、猫支原体、猫传染性腹膜炎、猫杯状病毒、猫疱疹病毒、猫泛白细胞减少症、犬支原体、犬腺病毒、犬副流感、犬甲型流感、犬细小病毒、犬瘟热病毒、犬冠状病毒、狂犬病毒、巴尔通体、弓形虫、钩端螺旋体、巴贝斯虫、布氏杆菌、 对虾传染性肌肉坏死病毒、对虾黄头病病毒、对虾淘拉综合症病毒、对虾传染性皮下及造血组织坏死病毒、炭疽杆菌等。
(10)本发明在其他特异性核酸检测方案中,一种或多种gRNA可被设计为靶向一种或多种微生物抗性基因。所述抗性基因,可以是四环素耐药、氨基糖苷类药物耐药、耐消毒剂、红霉素耐药、大环内酯外排、万古霉素耐药、多药耐药外排泵、莫匹罗星耐药、磺胺类耐药、泰洛星耐药、氟喹诺酮耐药、beta内酰胺酶类耐药、头孢菌素耐药、碳青霉烯酶耐药、金黄色葡萄球菌耐药、氯霉素酰基转移酶基因、博来霉素基因、嘌呤霉素基因、卡那霉素基因、氨苄霉素基因、超广谱 β- 内酰胺酶耐药基因等。
(11)本发明在其他特异性核酸检测方案中,一种或多种gRNA可被设计为靶向一种或多种个体基因型的靶分子。所述个体基因型,可以是人类单核苷酸多态性及基因型,动物基因型,植物基因型等。
(12)根据(11),所述人类单核苷酸多态性及基因型可以是疾病相关多态性位点,包括VKORC1、CYP2C9、CYP2C19等;可以是性状相关多态性位点,包括乳糖耐受基因、咖啡因代谢、酒精代谢、皮肤抗氧化、味觉敏感度、脱发等;可以是人类白细胞抗原(HLA);
(13)根据(11),所述动物基因型,植物基因型,可以是单核苷酸多态性、等位基因、育种鉴定、转基因鉴定等。
(14)本发明在其他特异性核酸检测方案中,一种或多种gRNA可被设计为靶向一种或多种检测环境样本状态的靶分子。所述环境样品来自食品样品,饮料样品,纸张表面,织物表面,金属表面,木材表面,塑料表面,土壤样品,水样品,大气或其他气体样品,或其组合。所述检测环境样本状态,可以是病毒,细菌,真菌等各类微生物核酸存在状态,也可以是动物,植物基因组来源核酸存在状态,也可以是转基因核酸存在状态;
(15)本发明在其他特异性核酸检测方案中,一种或多种样本类型可用于核酸检测。所述样本类型,可以是组织样本,唾液,血液,血浆,血清,粪便,尿液,痰液,粘液,淋巴液,滑液,脑脊液,腹水,胸腔积液,血清肿,脓或皮肤或粘膜表面的拭子,盥洗液等;
(16)本发明在其他特异性核酸检测方案中,该核酸检测反应可以承载于不同基质上;所述基质,可以是试管,液滴,固体腔路,微孔,特定基质(如纸基质)等。
实施例1 arCas12a基因的发现
本研发团队在研究古细菌archaeon(GenBank: PEZQ01000008.1)过程中,发现了其基因组中有CRISPR/Cas系统特征,即Cas1、Cas2/3及基因簇附近的重复序列区段在数种微生物中鉴定出其中的CRISPR/Cas系统,该系统包含一个Cas12a家族蛋白、Cas1、Cas2/3、和重复序列区段。其中标签为“COT72_04385”基因编码的蛋白(protein_id:PIN99799.1)具有Cas12蛋白结构中的关键结构结构域,RuvC内切核酸酶结构域和Nuc结构域。通过重组表达纯化后,发现此蛋白在gRNA介导下对外源靶标序列进行特异性识别且具有非特异切割单链DNA的活性。该蛋白长度为1434 个氨基酸,由于来源于archaeon CG10_big_fil_rev_8_21_14_0_10_43_11菌株,蛋白我们称之为arCas12a。源自古细菌archaeonCG10基因组中编码arCas12a蛋白的核酸序列如SEQ ID NO.2所示,本发明中为了提高arCas12a重组蛋白的表达量,我们对其蛋白表达质粒进行了密码子优化,核酸序列如SEQ ID NO.3所示。arCas12a蛋白的氨基酸序列如SEQ ID NO.1所示,三维结构图所图1所示。
然后我们利用CRISPRfinder(http://crispr.i2bc.paris-saclay.fr/),在上述重复序列区段中找出可能存在的gRNA骨架,体外转录出arCas12a的gRNA。在体外把双链DNA底物、arCas12a蛋白、gRNA、标记荧光基团的非特异单链DNA加入反应体系,发现标记荧光基团的非特异单链DNA可在gRNA介导下被arCas12a蛋白特异切割。
同时,本发明人团队研究发现,在选取gRNA靶向序列时,靶向序列5’端应具有5’-TTTN-3’序列,且靶向序列本身、靶向序列和其余序列间不形成稳定二级结构,在此gRNA设计原则下,arCas12a具有特异性识别体外DNA序列且具有非特异切割单链DNA的活性。
以下实施例给出了arCas12a蛋白的制备以及活性验证实验案例。
实施例2 arCas12a基因的克隆和蛋白表达
1、PCR扩增arCas12a序列
(1)设计引物
根据arCas12a序列设计了上下游引物,序列如下:
上游引物(SEQ ID NO.14所示):ATGAACAAAAAAGGCAAATGGGA;
下游引物(SEQ ID NO.15所示):TCATTTAATAGGCATCTCGCTAGAT。
(2)PCR扩增
利用上述的上下游引物,使用高保真DNA聚合酶(phusion DNA聚合酶)在不同退火温度下对目的片段进行PCR扩增。结果如附图2所示,PCR目的条带(约4000bp)。
2、构建重组质粒pET-28a-arCas12a
(1)PCR扩增产物纯化:PCR扩增后产物通过Qiagen公司的纯化试剂盒(Clean up试剂盒)进行纯化处理;
(2)使用Thermo公司的快速限制性内切酶NotI(FastDigestNotI)和NotI(FastDigestAscI)进行双酶切;
(3)酶切产物通过Qiagen公司的微量样品凝胶回收试剂盒(MiniElute)纯化回收;
(4)纯化回收的产物连接至同样经过NotI和AscI双酶切处理的pET28a-ccdB-CmR载体上,得到重组质粒pET-28a-arCas12a;
其中,所用pET-28-ccdB-CmR载体为本实验室保存,是以原核表达载体pET28a(购于生物通代理)基础,经改造在HindIII和XhoI酶切位点之间添加NotI-ccdB-CmR-AscI序列,制成pET-28-ccdB-CmR载体。
3、重组质粒pET-28a-arCas12a的鉴定
为鉴定pET-28a-arCas12a重组载体正确与否,我们对重组质粒pET-28a-arCas12a进行酶切鉴定和测序鉴定。
分别使用Asc I或NotI单酶切和Asc I或NotI双酶切进行酶切鉴定,实验结果表明,所有实验组的酶切产物大小均为与预期相符,因而可初步判断我们得到的载体为正确的pET-28a-arCas12a载体。
另外,测序结果也表明arCas12a序列被正确地克隆至pET28a。
4、arCas12a蛋白的原核表达
(1)将鉴定正确的重组质粒pET-28a-arCas12a转化至BL21(DE3)表达菌株(购于Transgen公司)中。经过阳性鉴定获得重组菌。
(2)挑取重组菌的单克隆至50mL LB培养基中37℃培养过夜。按照1:100的接种量,将过夜菌种接种至1L LB培养基中37℃培养至OD600=0.6,冰水浴30min,加IPTG至终浓度为0.5mM,于15℃继续培养4h。离心收集菌体于-80℃保存。
5、检测并优化arCas12a蛋白表达
将重组质粒pET-28a-arCas12a转入BL21(DE3)中,在37℃以0.2mM诱导蛋白表达,并对裂解后沉淀和上清进行电泳分析。(如附图3所示)
实施例3 arCas12a蛋白的纯化
1、arCas12a蛋白的纯化方法
诱导表达后的菌液进行离心,将菌体重悬于裂解缓冲液中,进行超声破碎(70%振幅,2sOn/4s Off,3分钟,Sonics 750w超声仪),离心分离上清液。将蛋白裂解上清上样至平衡后的Ni Sepharose FF,以大于30倍柱床体积的裂解缓冲液洗去杂蛋白,以洗脱缓冲液进行洗脱,以Superdex 200,Tricorn 10/300凝胶色谱柱进行纯化。洗脱后进行SDS-PAGE分析观察结果以及凝胶柱纯化,获得的纯化后的Cas12a蛋白。其中,裂解缓冲液包含50mM Tris-HCl、pH8.0 300mM NaCl、5%甘油、20mM咪唑。洗脱缓冲液包含50mM Tris-HCl、pH8.0 300mMNaCl、5%甘油、250mM咪唑。
得到的蛋白以50mM Tris-HCl pH8.0 300mM NaCl5%甘油稀释三倍,并以30kDa超滤管浓缩。添加甘油至终浓度50%后,分装液氮速冻保存于-80℃。
2、arCas12a蛋白纯化结果
在优化纯化步骤后再次进行大量纯化,目的条带约为130kDa。如附图4所示,可见纯化的纯度及产量较高。
本发明arCas12a纯化方案简化了TEV切割标签步骤,大幅精简了纯化流程和纯化成本,同时,该方法能确保蛋白活性。
实施例4 基于CRISPR/Cas12a体系对arCas12a的活性检测
1、制备靶标核酸片段
靶标核酸片段可经过PCR扩增、重组酶聚合酶扩增(RPA), NASBA 等温扩增或环介导等温扩增(LAMP)、链置换扩增(SDA)、解旋酶依赖性扩增(HDA)和切口酶扩增反应(NEAR)方式扩增靶标核酸片段。
重组酶聚合酶扩增RPA(Recombinase Polymerase Amplification):采用NCBIPrimer blast 设计RPA引物,扩增片段大小为80-120nt,引物的变性温度可为54-67℃,长度为30-35nt,引物中GC含量为40-60%,根据设计序列合成DNA引物,引物合成委托上海捷瑞生物工程有限公司合成。
模板序列(SEQ ID NO.12):
TTATCTTAAAAAATTACAGGATATTTATAAGAAGCTTGAGGGTCACCCCTTTCTTTTTAGTCCGTCGAAAACCAATGAAAAAGAGTTTATTACTCTGCTAAACCAAGCCTTGGCCTCGACGCAGCTTTACCGCAGCATACAACAGCTGTTTTTAACGATGTATAAGCTAGATCCCATTGGGTTTGTTAACTATATTAAAGCGAGTAAACAAGAGTATTTATGTCTGTTGATTAATCCTAAACTAGTCACTAAGTTTTTAAAAATAACGAGCTTTAAAATTTACATTAATTTCAGGCTAAAAACTTTCTATATAAGTCCTAATAA
引物序列:
上游引物(SEQ ID NO.16):TACTCTGCTAAACCAAGCCTTGGCCTCGAC
下游引物(SEQ ID NO.17):CTCTTGTTTACTCGCTTTAATATAGTTAAC
分别参考TwistAmp® Basic和 TwistAmp® BasicRT (TwistDx)试剂盒进行RPA反应,不同的是,在模板片段加入之前,先加入280mM的MgAc,即乙酸镁。在37℃下反应30分钟。
反应完成后的产物使用胶分离以及纯化(采用MinElute gel extraction kit(Qiagen) 试剂盒),纯化后的dsDNA即为靶标产物。
2、gRNA设计
gRNA引物序列设计原则:选取靶向序列时,靶向序列5’端应具有5’-TTTN-3’序列;且靶向序列本身、靶向序列和其余序列间不形成稳定二级结构。可通过http://www.rgenome.net/cas-designer/ 在线软件辅助设计。
arCas12a蛋白识别,不需要tracrRNA,只需要crRNA,其gRNA设计只需要crRNA框架和靶标序列即可。
gRNA结构为:“TAATTTCTACTAAGTGTAGAT”—“靶标序列”。
其中“TAATTTCTACTAAGTGTAGAT”序列可替换为“TAATTTCTACTATTGTAGAT”,可替换为“GTCTAATATCAATATTCAATTTCTACTTTCGTAGAT”可替换为“TCAATTTCTACTTTCGTAGAT”。
设计含T7启动子的引物进行扩增双链DNA。参照T7 RNA Polymerase(Thermo)试剂盒说明书,将带T7启动子的DNA片段、T7聚合酶混合,37℃孵育过夜;再使用RNeasy minikit(Qiagen),获得纯化的gRNAs。(如附图5所示)
3、arCas12a活性检测
检测体系包括:2μl RPA产物,45nM 纯化的arCas12a,22.5nM gRNA,100nM在arCas12a切割时可发出荧光的报告DNA链,即非特异单链DNA荧光探针(DNAseAlert QC SystemThermo Scientific),0.5μl RNase抑制剂(Promega),及检测缓冲液(20 mM Tris, 60 mMNaCl, 10 mM MgCl2 , pH 7.3)。同时设置CRISPR/LbCas12a体系作为对照组,检测体系包括2μl RPA产物,45nM 纯化的LbCas12a,22.5nM gRNA,100nM在arCas12a切割时可发出荧光的报告DNA链,即非特异单链DNA荧光探针(DNAseAlert QC System Thermo Scientific),0.5μl RNase抑制剂(Promega),及检测缓冲液(20 mM Tris, 60 mM NaCl, 10 mM MgCl2 ,pH 7.3)。
反应体系置于荧光分析仪(BioTek),在37℃(除非另有说明)下反应1-3小时,荧光动力检测5分钟一次。
检测结果如图6所示,arCas12a也具有与LbCas12a相同的在gRNA介导下对靶标序列进行特异性识别并具有非特异单链DNA的切割活性。
实施例5 CRISPR/arCas12a体系配对的gRNA框架序列筛选
在CRISPR/arCas12a体系中,与arCas12a蛋白配套的gRNA序列至关重要,是arCas12a蛋白能够特异性识别靶标序列的基础。gRNA包括与arCas12a蛋白相互作用的框架核酸片段,以及与靶核酸结合的特异性核酸片段。如何寻找和确认与arCas12a蛋白适配的gRNA的框架核酸序列,是搭建CRISPR/arCas12a技术平台的关键。
本专利中,研发团队利用生物信息学的方法筛选arCas12a蛋白gRNA的框架核酸序列,首先在archaeon CG10_big_fil_rev_8_21_14_0_10_43_11菌株的基因组序列中,分别在“COT72_04385”基因的上游和下游2000-5000bp范围内寻找短片段重复序列,根据重复序列的长度及重复次数进行片段筛选;同时本研发团队还比对了与archaeon CG10_big_fil_rev_8_21_14_0_10_43_11近源的基因组序列,从中找到与编码arCas12a蛋白“COT72_04385”基因高度同源的基因序列,以同样的方式寻找可能短片段重复序列,筛选出的这些短片段重复序列可以作为arCas12a蛋白gRNA的框架核酸序列。共筛选出8条候选的框架核酸序列,其序列信息如下:
框架序列1(SEQ ID NO. 4):TAATTTCTACTAAGTGTAGAT
框架序列2(SEQ ID NO. 5):TAATTTCTACTATTGTAGAT
框架序列3(SEQ ID NO. 6):GTCTAATATCAATATTCAATTTCTACTTTCGTAGAT
框架序列4(SEQ ID NO. 7):TCAATTTCTACTTTCGTAGAT
框架序列5(SEQ ID NO. 8):ATCTACAAAAGTAGAAATGTGCTATCTGTATTTGAG
框架序列6(SEQ ID NO. 9):GTCTAATATCAATATTCAATTTCTACTTTCGTAGAT
框架序列7(SEQ ID NO. 10):TCAATTTCTACTTTCGTAGAT
框架序列8(SEQ ID NO. 11):GAAACTGTAAGCGGAATGTCTACT
arCas12a蛋白的gRNA结构包括gRNA框架序列和靶标序列,将上述寻找到的
“COT72_04385”基因及其同源基因周围的短片段重复序列与我们要检测的靶标序列结合,就得到了gRNA序列。
研发团队通过大量生物信息学比对和序列筛选,并设计了含T7启动子的引物进行扩增含有gRNA序列的双链DNA。参照T7 RNA Polymerase(Thermo)试剂盒说明书,将带T7启动子的DNA片段、T7聚合酶混合,37℃孵育过夜;再使用RNeasy mini kit(Qiagen),获得纯化的gRNAs。然后按照实施例4中的arCas12a活性检测方法验证候选的gRNA框架序列与arCas12a蛋白是否匹配。
通过系列的实验,筛选结果如图7所示,筛选出四条与arCas12a蛋白适配的gRNA框架序列,分别为框架序列1(SEQ ID NO. 4)、框架序列2(SEQ ID NO. 5)、框架序列3(SEQ IDNO. 6)、框架序列4(SEQ ID NO. 7)。
实施例6 基于CRISPR/arCas12a的核酸检测灵敏度
1、制备靶标核酸片段
模板稀释:将含靶标片段的质粒,10倍梯度稀释,得到浓度为10^6c~10^0c的质粒。
按照实施例4的方法,分别参考TwistAmp® Basic和 TwistAmp® BasicRT(TwistDx)试剂盒进行RPA反应,不同的是,在模板片段加入之前,先加入280mM的MgAc,即乙酸镁。在37℃下反应30分钟。
2、CRISPR/arCas12a检测
检测体系包括:2μl RPA产物,45nM 纯化的arCas12a,22.5nM gRNA,100nM在arCas12a切割时可发出荧光的报告DNA链,即非特异单链DNA荧光探针(DNAseAlert QC SystemThermo Scientific),0.5μl RNase抑制剂(Promega),及检测缓冲液(20 mM Tris, 60 mMNaCl, 10 mM MgCl2 , pH 7.3)。同时设置CRISPR/LbCas12a体系作为对照组,检测体系包括2μl RPA产物,45nM 纯化的LbCas12a,22.5nM gRNA,100nM在arCas12a切割时可发出荧光的报告DNA链,即非特异单链DNA荧光探针(DNAseAlert QC System Thermo Scientific),0.5μl RNase抑制剂(Promega),及检测缓冲液(20 mM Tris, 60 mM NaCl, 10 mM MgCl2 ,pH 7.3)。
反应体系置于荧光分析仪(BioTek),在37℃(除非另有说明)下反应1-3小时,荧光动力检测5分钟一次。
结果分析:为了计算去除背景的荧光数据,方便不同条件之间的比较,样品的初始荧光被去除。背景荧光(无靶核苷酸或无gRNA的条件下)会从样品中去除,从而获得扣除背景荧光的数据。检测结果如图8所示,CRISPR/arCas12a也具有与CRISPR/LbCas12a相同的检测灵敏度。
实施例7 基于CRISPR/arCas12a的核酸检测特异性
1、制备靶标核酸片段
按照实施例4的方法,分别参考TwistAmp® Basic和 TwistAmp® BasicRT (TwistDx)试剂盒,将靶标核酸、4个非靶标核酸加入RPA体系,在37℃下反应30分钟。
2、CRISPR/arCas12a检测
检测体系包括:2μl RPA产物,45nM 纯化的arCas12a,22.5nM gRNA,100nM在arCas12a切割时可发出荧光的报告DNA链,即非特异单链DNA荧光探针(DNAseAlert QC SystemThermo Scientific),0.5μl RNase抑制剂(Promega),及检测缓冲液(20 mM Tris, 60 mMNaCl, 10 mM MgCl2 , pH 7.3)。
反应体系置于荧光分析仪(BioTek),在37℃(除非另有说明)下反应1-3小时,荧光动力检测5分钟一次。
结果分析:为了计算去除背景的荧光数据,方便不同条件之间的比较,样品的初始荧光被去除。背景荧光(无靶核苷酸或无gRNA的条件下)会从样品中去除,从而获得扣除背景荧光的数据。
检测结果如图9所示,CRISPR/arCas12a体系具有良好的反应特异性。
实施例8 基于CRISPR/arCas12a的核酸检测侧向层析方法
CRISPR/arCas12a检测体系中Cas12a蛋白具有在gRNA介导下对靶标序列特异性识别并具有非特异单链DNA切割活性,运用这一特性,把CRISPR/arCas12a检测体系中的单链DNA探针进行特殊标记,然后通过与之配套的胶体金试纸条,在不同的划线区域固定不同的抗体进行信号捕获,从而实现通过胶体金侧向层析的方法对CRISPR/arCas12a检测体系进行结果判读。
检测过程:
1、制备靶标核酸片段
本实施例按照实施例6和实施例7,基于CRISPR/arCas12a体系对10倍梯度稀释的靶标(SEQ ID NO.12)进行灵敏度测试,同时进行了特异性测试。
靶标DNA可经过重组酶聚合酶扩增技术(RPA)、PCR扩增、NASBA 等温扩增、环介导等温扩增(LAMP)、链置换扩增(SDA)、解旋酶依赖性扩增(HDA)和切口酶扩增反应(NEAR)等方式进行扩增。
按照实施例4的方法,分别参考TwistAmp® Basic和 TwistAmp® BasicRT(TwistDx)试剂盒进行RPA反应,不同的是,在模板片段加入之前,先加入280mM的MgAc,即乙酸镁。在37℃下反应30分钟。
2、CRISPR/arCas12a检测
检测体系包括:2μl RPA产物,45nM 纯化的arCas12a,22.5nM gRNA,100nM非特异单链DNA探针(探针5端标记生物素,3端标记地高辛),0.5μl RNase抑制剂(Promega),及检测缓冲液(20 mM Tris, 60 mM NaCl, 10 mM MgCl2 , pH 7.3)。反应体系置37℃反应30min。
3、侧向层析反应
取出胶体金检测试剂条,将红色印记端朝下,插入上一步骤CRISPR反应管中,层析反应2分钟,然后根据试纸条的条带变化进行结果判读。CRISPR-侧向层析法检测的结果判读标准如图10所示:
在本实施例中,单链DNA探针5 '端标记地高辛,3 '端标记生物素,且胶体金颗粒上标记有地高辛单抗。质控线C上固定包被抗地高辛抗体的二抗,检测线T上固定包被有链霉亲和素。当反应体系中有靶序列时,单链DNA探针被切割完全,生物素被切割成游离状态,检测线T不会显色,此时质控线C显色而检测线T不显色,指示待检靶标为阳性;当反应体系中没有靶序列时,单链DNA探针不被切割,此时质控线C和检测线T两条线均显色,指示待检靶标为阴性;若质控线C不显色,则提示检测失败或试纸已失效,结果无效;若质控线C显色而检测线T信号微弱,建议重复检测;延长CRISPR剪切时间再观察结果,若质控线C显色,而检测线T信号依然微弱,则判定为待检靶标为阴性;若质控线C显色,而检测线T信号无显色,则判定为阳性。
灵敏度检测结果如图11所示,所述arCas12a蛋白和所设计的gRNA可识别靶标位点并切割体系中的胶体金探针,通过胶体金检测试纸条侧向层析可得到与荧光检测法相符的检测效果,及对靶标(SEQ ID NO.12)的灵敏度达到10^1c。
特异性检测结果如图12所示,所述arCas12a蛋白和所设计的gRNA可特异性识别靶标位点并切割体系中的胶体金探针,而不识别非特异靶标,具有良好的特异性。
实施例9 CRISPR/arCas12a不同温度下的反应活性测试
arCas12a是新型CRISPR/Cas12a系统的蛋白,为探索其在不同温度下的反应活性,设计了本实施例方法进行测试,进一步加深对该蛋白反应条件的认知。
检测过程:
1、制备靶标核酸片段
按照实施例4的方法,分别参考TwistAmp® Basic和 TwistAmp® BasicRT (TwistDx)试剂盒进行RPA反应,不同的是,在模板片段加入之前,先加入280mM的MgAc,即乙酸镁。在37℃下反应30分钟。
2、CRISPR/arCas12a检测
检测体系包括:2μl RPA产物,45nM 纯化的arCas12a,22.5nM gRNA,100nM非特异单链DNA探针(探针5端标记生物素,3端标记地高辛),0.5μl RNase抑制剂(Promega),及检测缓冲液(20 mM Tris, 60 mM NaCl, 10 mM MgCl2 , pH 7.3)。
反应温度分别设置为25℃,30℃,35℃,40℃,45℃,50℃,55℃,反应30min。
3、侧向层析反应
取出胶体金检测试剂条,将红色印记端朝下,插入上一步骤CRISPR反应管中,层析反应2分钟,然后根据试纸条的条带变化进行结果判读。CRISPR-侧向层析法检测的结果判读标准如图10所示。检测结果如图13所示,CRISPR/arCas12a在温度25~50℃均观察到良好的反应活性,在温度55℃的反应活性下降。提示arCas12a蛋白适用温度范围广。
本领域技术人员可以理解的是,可以采用本领域常规的替代方法替换本发明实施例中常规的arCas12a基因的克隆、重组表达载体的构建、arCas12a蛋白的表达及纯化、靶核苷酸/目标基因片段的扩增等步骤中的一种或多种,以期获得类似或等同的效果。
序列表
<110> 广州普世君安生物科技有限公司
<120> 新型耐高温arCas12a蛋白在核酸检测方面的应用
<160> 17
<170> SIPOSequenceListing 1.0
<210> 1
<211> 1434
<212> PRT
<213> arCas12a蛋白氨基酸序列(SEQ ID NO.1)
<400> 1
Met Asn Lys Lys Gly Lys Trp Asp Lys Phe Thr Asn Leu Tyr Ser Leu
1 5 10 15
Ser Lys Thr Leu Arg Phe Glu Leu Arg Pro Ile Gly Lys Asp Gly Val
20 25 30
Val Leu Ser Pro Glu Asp Ala Thr Glu Leu Leu Thr Lys Ile Ile Glu
35 40 45
Lys Asp Arg Leu Ile Lys Ala Ala Tyr Asp Ala Leu Lys Pro Val Leu
50 55 60
Asp Lys Ile His Glu Asp Ile Ile Asn Lys Ser Leu Thr Ser Asp Glu
65 70 75 80
Ala Lys Gln Ile Asp Phe Ser Glu Tyr Phe Glu Glu Tyr Lys Lys Gly
85 90 95
Lys Glu Lys Lys Leu Asp Gly Phe Glu Lys Lys Leu Arg Glu Gln Ile
100 105 110
Gly Lys Thr Phe Glu Lys Thr Val Lys Thr Tyr Lys Ile Thr Lys Ile
115 120 125
Thr Lys Lys Lys Glu Glu Glu Glu Lys Pro Leu Phe Glu Ile Lys Asn
130 135 140
Gly Val Pro Thr Ala Lys Ala Glu Ile Ile Gly Tyr Leu Ser Glu Gln
145 150 155 160
Tyr Lys Asp Asn Val Glu Leu Trp Ala His Ile Glu Glu Phe Glu Gly
165 170 175
Phe Phe Gly Tyr Phe Ser Gly Tyr Asn Thr Asn Arg Ser Asn Tyr Tyr
180 185 190
Glu Tyr Lys Lys Glu Ala Ser Thr Ala Val Ala Thr Arg Ile Val His
195 200 205
Glu Asn Leu Pro Lys Phe Cys Asp Asn Val Ile Gln Phe Leu Ile Gly
210 215 220
Lys Val Gln Lys Lys Lys Lys Asn Asp Thr Arg Thr Glu Thr Ile Val
225 230 235 240
Ser Arg Lys Glu Glu Tyr Leu Lys Ala Tyr Gln Tyr Leu Lys Asp Asn
245 250 255
Arg Gly Thr Ile Gln Ile Lys Asp Ala Lys Thr Asn Gln Leu Ile Glu
260 265 270
Ala Gln Pro Val Ser Glu Asp Trp Phe Asn Leu Asp Lys Phe Pro Lys
275 280 285
Tyr Leu Ser Gln Glu Gly Ile Asp Glu Tyr Asn Arg Val Met Gly His
290 295 300
Tyr Asn Leu Leu Ile Asn Leu Tyr Asn Gln Glu Arg Lys Asp Glu Lys
305 310 315 320
Asp Phe Lys Lys Leu Ser Gln Phe Lys Thr Leu Phe Lys Gln Ile Gly
325 330 335
Cys Gly Lys Gln Ser Leu Phe Glu Gln Ile Lys Asp Asp Thr Glu Leu
340 345 350
Lys Glu Lys Leu Ser Lys Ile Ser Lys Ala Gly Glu Lys Tyr Phe Ala
355 360 365
Glu Gln Ile Asp Asp Thr Leu Ile Thr Ile Tyr Thr Phe Ile Glu Trp
370 375 380
Leu Arg Glu Asn Asn Asp Trp Glu Gly Thr Tyr Trp Ser Lys Ala Ala
385 390 395 400
Val Asp Lys Ile Ser Asn Lys Tyr Leu Ala Asn Trp His Asp Ile Lys
405 410 415
Asp Arg Ile Gln Thr Asp Leu Gln Gly Lys Asp Lys Gly Leu Lys Glu
420 425 430
Thr Leu Lys Ser Val Ala Thr Tyr Asn Lys Glu Arg Glu Glu Gln Leu
435 440 445
Lys Ile Asn Asp Ala Val Glu Leu Ser Gly Leu Phe Glu Ile Leu Asn
450 455 460
His Asp Ala Val Gln Gly Trp Ser Lys Asp Phe Phe Arg Glu His Ile
465 470 475 480
Leu Glu Glu Tyr Lys Asp Leu Ile Asp Glu Lys Leu Thr Pro Ser Gln
485 490 495
Asn Leu Ile Lys Leu Ile Cys Ala Asp Met Gln Lys Leu Ala Lys Glu
500 505 510
Phe Cys Glu Lys Ser Glu Asp Ala Leu Lys Ile Ile Asp Tyr Lys Asn
515 520 525
Glu Asn Asn Ile Leu Gln Ile Lys Glu Trp Leu Asp Arg Ser Lys Trp
530 535 540
Leu Leu Trp Ile Val Lys Tyr Phe Glu Val Lys Glu Ser Lys Val Lys
545 550 555 560
Gly Asn Ser Ile Asn Pro Glu Leu Thr Asn Ile Leu Ser Ala Leu Leu
565 570 575
Arg Ala Asp Asp Ser Asn Trp Phe Asp Trp Tyr Asp Leu Val Arg Asn
580 585 590
Tyr Leu Ser Lys Lys Pro Gln Glu Asp Ala Lys Lys Asn Lys Leu Lys
595 600 605
Leu Asn Phe Glu Ser Ser Ser Phe Leu Gly Gly Trp Pro Pro Asp Tyr
610 615 620
Ala Lys Lys Ala Gly Leu Leu Tyr Lys Lys Asp Gly Leu Tyr Tyr Leu
625 630 635 640
Ala Ile Asn Tyr Asn Leu Ser Lys Glu Asp Ile Lys Thr Leu Lys Gln
645 650 655
Pro Asn Gly Glu Thr Ala Thr Arg Ile Ile Leu Asp Phe Gln Lys Pro
660 665 670
Asp Asn Lys Asn Thr Pro Arg Leu Phe Ile Arg Ser Lys Gly Asp Ser
675 680 685
Phe Ala Pro Ala Val Glu Lys Tyr Asn Leu Pro Ile Asn Asp Ile Leu
690 695 700
Asp Ile Tyr Asp Thr Gly Lys Phe Arg Thr Glu His Arg Lys Lys Asn
705 710 715 720
Glu Glu Glu Tyr Lys Ile Ser Leu Gly Lys Leu Ile Asp Tyr Phe Lys
725 730 735
Lys Gly Phe Leu Lys His Asp Ser Tyr Lys His Phe Asn Phe Asp Trp
740 745 750
Lys Lys Thr Ser Glu Tyr Lys Asp Ile Ala Glu Phe Tyr His Asp Thr
755 760 765
Glu Val Ser Cys Tyr Gln Ile Lys Glu Glu Asn Thr Ser Trp Lys Lys
770 775 780
Leu Leu Glu Phe Ile Asp Glu Gly Lys Val Phe Leu Phe Gln Ile Tyr
785 790 795 800
Asn Lys Asp Phe Ser Gln Arg Lys Thr Val Arg Gly Lys Asp Asn Ile
805 810 815
His Thr Tyr Tyr Trp Lys Met Leu Phe Ser Glu Glu Asn Lys Arg Asn
820 825 830
Val Ile Tyr Lys Leu Asn Gly Glu Ser Glu Ile Phe Phe Arg Asn Leu
835 840 845
Ala Lys Gly Ile Lys Lys Ser Pro Ala His Thr Thr Lys Asp Tyr Val
850 855 860
Leu Asn Arg Arg Glu Lys Glu Thr Asn Lys Thr Ile Pro Tyr Lys Ile
865 870 875 880
His Asp Glu Leu Arg Leu Phe Ala Asn Lys Asn Lys Ser Ile Glu Ala
885 890 895
Leu Ser Asp Glu Ala Lys Ala Tyr Leu Asp Lys Asn Asn Glu Ile Asp
900 905 910
Glu Asn Arg Val Thr Ile Lys Lys Leu Lys His Asp Ile Val Lys Asp
915 920 925
Lys Arg Phe Thr Thr Asn Lys Phe Phe Leu His Cys Pro Ile Thr Leu
930 935 940
Asn Phe Lys Ala Tyr Gly Asn Arg Asn Val Thr Glu Thr Val Asn Asp
945 950 955 960
Asn Phe Thr Gln Thr Lys Asp Ile Gln Lys Asp Ile Gln Phe Leu Gly
965 970 975
Ile Asp Arg Gly Glu Lys His Leu Ile Tyr Tyr Ser Leu Val Asn Ala
980 985 990
Asn Gly Glu Ile Ile Glu Gln Asp His Phe Asp Val Ile Asn Asn Lys
995 1000 1005
Asp Tyr Leu Gln Glu Ile Asn Asn Ala Ala Asp Arg Arg Lys Lys Lys
1010 1015 1020
Gln Glu Asn Trp Gln Gln Lys Gly Asn Ile Ser Asn Leu Lys Asp Gly
1025 1030 1035 1040
Tyr Ile Ser Leu Val Ile His Glu Ile Ile Lys Lys Met Arg Asp Lys
1045 1050 1055
Asp Gly Asn Tyr Lys Ser Thr Phe Ile Val Leu Glu Asp Leu Asn Pro
1060 1065 1070
Gly Phe Lys Arg Arg Arg Gln Lys Phe Glu Gln Gln Val Tyr Gln Lys
1075 1080 1085
Phe Glu Leu Ala Leu Ala Lys Lys Leu Asn Tyr Leu Val Asp Lys Asn
1090 1095 1100
Val Thr Asp Met Gly Lys Ile Gly Ser Val Ser Lys Ala Leu Gln Leu
1105 1110 1115 1120
Val Pro Pro Val Thr Asn Tyr Arg Asp Ile Glu Asn Arg Lys Gln Val
1125 1130 1135
Gly Ile Met Leu Tyr Thr Arg Ala Asn Tyr Thr Ser Val Thr Asp Pro
1140 1145 1150
Val Thr Gly Trp Arg Lys Thr Ile Tyr Leu Lys Lys Gly Ser Glu Ala
1155 1160 1165
Asp Ile Lys Lys Gln Ile Leu Ser Ala Phe Thr Glu Ile Gly Val Tyr
1170 1175 1180
Ser Gly Asp Tyr Phe Phe Gln Tyr Thr Asp Val Asn Gly Lys Glu Trp
1185 1190 1195 1200
Lys Leu Trp Ser Gly Lys Gly Gly Lys Ala Leu Glu Arg Tyr Arg Ala
1205 1210 1215
Lys Arg Gly Lys Asp Lys Asn Glu Tyr Ile Ile Glu Pro Phe Asp Val
1220 1225 1230
Lys Glu Leu Leu Asp Lys Leu Phe Glu Asn Phe Asp Asn Ser Lys Ser
1235 1240 1245
Leu Arg Gln Gln Phe Glu Glu Arg Val Glu Leu Lys Lys Val Asn Glu
1250 1255 1260
His Thr Ala Trp Glu Thr Leu Arg Phe Val Ile Asp Leu Ile Gln Gln
1265 1270 1275 1280
Ile Arg Asn Ser Gly Asp Ile Thr Lys Lys Gln Glu Asp Lys Phe Tyr
1285 1290 1295
Gly Ala Asp Thr Asn Lys Asn Gln Asp Asp Asn Phe Leu Leu Ser Pro
1300 1305 1310
Ile Arg Asp Glu Gln Gly Glu His Phe Asp Ser Arg Lys Tyr Gln Ser
1315 1320 1325
Gln Glu Ile Pro His Leu Pro Ile Asp Ala Asp Ala Asn Gly Ala Tyr
1330 1335 1340
Asn Ile Ala Arg Lys Gly Ile Val Met Tyr Glu His Ile Lys Gln Trp
1345 1350 1355 1360
Ile Asn Asp Gly Lys Gln Lys Thr Lys Ser Lys Arg Asp Asp Ser Lys
1365 1370 1375
Glu Thr Thr Asp Leu Asp Leu Phe Ile Ser Asp Lys Glu Trp Asp Leu
1380 1385 1390
Trp Leu Tyr Asp Arg Glu Lys Trp Asn Glu Gln Leu Pro Ile Phe Ala
1395 1400 1405
Ser Arg Lys Leu Asn Gln Lys Glu Asn Thr Lys Gln Ser Asp Val Asn
1410 1415 1420
Ile Asn Glu Ser Ser Glu Met Pro Ile Lys
1425 1430
<210> 2
<211> 4305
<212> DNA
<213> arCas12a蛋白基因序列(SEQ ID NO.2)
<400> 2
atgaacaaaa aaggcaaatg ggataaattt acaaatttgt acagcttatc gaaaactttg 60
aggtttgagt taagaccaat tgggaaggat ggagtggtat tatcacctga agatgcaact 120
gaattactca cgaaaattat tgagaaggat agactgatta aagcagctta tgatgctcta 180
aaacctgtct tagacaagat acatgaagat ataattaata agagtttgac ttcagacgaa 240
gcgaaacaga ttgatttttc agaatatttt gaagaataca aaaaaggcaa agaaaaaaag 300
ttggatggtt ttgaaaagaa attacgggaa caaataggca agacttttga aaaaactgtt 360
aagacataca agataaccaa aataactaag aagaaagaag aagaagagaa gcctttattt 420
gaaataaaaa acggggttcc aacagcaaaa gcagaaatca tagggtattt gtcagagcaa 480
tataaagata atgtagaatt atgggcacat atcgaagaat ttgagggttt ctttggatat 540
ttttcagggt ataataccaa taggtctaat tattatgaat ataaaaaaga agcaagcacg 600
gcagtagcta cgagaatagt tcatgaaaac ttgcctaaat tttgtgataa cgtcattcag 660
tttttaatcg gaaaggttca gaaaaagaaa aaaaatgata cccgcacaga aacaattgtt 720
tcccgaaaag aagaatatct aaaagcttat caatacttga aagataatcg cgggactata 780
caaataaaag atgcaaaaac caatcaactg attgaggcac aacccgtaag cgaagactgg 840
tttaatcttg ataagtttcc aaaatatctt tctcaagaag gaattgacga atacaacaga 900
gtcatgggac attacaatct gctgataaat ttgtataatc aggaaagaaa agacgaaaag 960
gattttaaaa aattgtcaca gttcaaaacc ttatttaaac agattggatg tggtaaacaa 1020
agtttgtttg agcaaattaa agatgatacg gaactcaaag aaaaattgag caaaataagt 1080
aaagctggtg aaaaatattt tgccgaacaa atcgatgata ccctaatcac catttacaca 1140
ttcatagaat ggttaagaga aaacaatgat tgggagggca cctattggtc gaaagcagca 1200
gttgataaga tttctaataa gtatttagca aactggcatg atataaaaga tcgcattcaa 1260
actgatctac aaggaaaaga caaagggtta aaagaaacat taaaatcagt tgcaacctac 1320
aacaaagagc gagaagaaca actgaaaata aacgatgccg ttgagctgtc ggggttgttt 1380
gaaattttga atcatgatgc tgttcaaggt tggagtaaag actttttcag ggaacatatt 1440
ttagaagaat ataaagattt aattgatgag aagcttacac caagccaaaa tcttatcaag 1500
cttatttgcg cagatatgca aaagttggca aaggaatttt gcgagaaatc agaagatgct 1560
ttaaaaataa tcgactataa aaatgaaaac aatattctac agataaaaga atggctggat 1620
agatcaaagt ggttgctgtg gattgtgaaa tattttgaag taaaggagag caaggtaaaa 1680
ggaaacagca ttaaccctga actgaccaat atactttctg ctttgctacg tgcggatgat 1740
tccaactggt ttgattggta cgatttggtg agaaattacc taagcaagaa accacaggaa 1800
gatgcaaaga aaaacaagct gaaactgaac tttgaaagca gttcttttct tggtggatgg 1860
ccaccagatt atgccaaaaa ggcaggactt ctatataaaa aagatgggct ttattattta 1920
gcgattaatt acaatttatc aaaagaagat attaaaacac taaagcagcc aaatggtgaa 1980
accgcaacaa ggataatcct tgattttcaa aaaccagata ataagaatac accacgttta 2040
ttcatccgtt caaagggaga tagttttgca cctgcggtgg aaaaatacaa tttaccaatt 2100
aatgatatac ttgacattta tgatacgggt aaattcagaa cagaacaccg aaaaaagaac 2160
gaagaagaat ataaaatatc attgggaaaa ttaattgatt attttaaaaa aggattttta 2220
aaacacgatt catacaaaca ctttaatttt gattggaaaa aaacatctga atacaaagac 2280
attgcagagt tctatcatga cacagaggta tcgtgttatc aaataaagga ggaaaacact 2340
tcttggaaaa aattactgga atttattgat gaagggaaag tatttctgtt tcaaatttac 2400
aacaaggatt tctctcaaag aaaaactgtt aggggtaaag ataacataca cacttattat 2460
tggaaaatgt tgtttagtga agaaaacaag agaaatgtga tctataaact taatggagaa 2520
tctgaaatat ttttcagaaa tcttgccaaa ggaatcaaaa aaagtcccgc acacacaact 2580
aaagattatg tactcaacag acgagaaaaa gaaacaaaca aaacaattcc gtacaaaatt 2640
catgatgaat tacgtttgtt tgccaataag aataaatcaa ttgaagcatt aagtgatgag 2700
gcaaaggctt atttagataa aaacaacgag attgatgaaa atagagttac tataaaaaaa 2760
ttaaaacatg acattgttaa agataaaaga tttacaacta ataaattctt cttgcattgc 2820
ccaattacat tgaattttaa ggcatacggc aacaggaatg ttacagaaac tgtcaacgac 2880
aacttcactc aaactaaaga tattcaaaaa gatattcaat ttctgggcat tgatcgtggt 2940
gagaagcatt tgatttacta ttcgttggtt aatgcaaatg gagaaatcat tgaacaggac 3000
cattttgatg ttattaacaa taaagattat ttgcaggaga taaataatgc tgcagataga 3060
cgtaagaaaa agcaggagaa ctggcaacaa aaaggcaata tatctaactt gaaagacggc 3120
tatatttcgt tggtaatcca tgagattatc aaaaagatga gagataaaga cggtaattat 3180
aaatccactt tcattgtatt ggaagatttg aatccaggat ttaaacgtag gcgtcaaaaa 3240
ttcgagcaac aagtatatca aaagtttgaa ttggcgttgg ctaaaaagct gaattacctt 3300
gtagataaaa atgtaacgga tatgggcaaa ataggttctg tgtcaaaagc cctgcaactt 3360
gtgccaccag tgactaatta cagagatatt gaaaatagaa aacaagttgg cattatgttg 3420
tatactcgtg ccaattacac ttctgtaact gatcctgtaa ctggttggag aaaaactatt 3480
tatctgaaaa agggaagcga agccgatatt aaaaaacaaa tactcagtgc ttttacagag 3540
attggcgttt atagtggtga ttatttcttt caatatactg atgtgaatgg caaagaatgg 3600
aaattgtggt ctggtaaagg cggtaaagcg ttggaacgtt acagggcaaa aagggggaaa 3660
gacaaaaacg aatatattat cgaacctttt gatgtaaaag aattattaga caaattattt 3720
gagaattttg ataactcaaa gtctttgaga cagcaatttg aagaaagagt agaacttaag 3780
aaagtcaatg agcacactgc ttgggaaaca cttcgttttg taattgatct gattcaacaa 3840
atcagaaatt caggagatat cactaaaaaa caagaagata aattctatgg tgcagataca 3900
aataaaaatc aagacgacaa ctttttgctt tctcctataa gggatgaaca aggagaacat 3960
tttgattcac gtaaatatca aagccaagaa ataccacatt tacccataga tgctgatgca 4020
aacggagcat ataatattgc ccgtaaagga atagttatgt atgagcatat caaacagtgg 4080
ataaatgatg ggaagcaaaa aacaaaatcc aagagggatg atagtaaaga aacaactgat 4140
ttggatcttt ttatttctga taaagagtgg gacttgtggt tatatgatag ggagaaatgg 4200
aatgaacaat tgccaatctt cgcttcaaga aaattaaatc agaaagagaa tacgaaacag 4260
agtgatgtta acattaatga atctagcgag atgcctatta aatga 4305
<210> 3
<211> 4305
<212> DNA
<213> arCas12a蛋白基因序列(SEQ ID NO.3)
<400> 3
atgaacaaaa aaggcaaatg ggataaattt accaacctgt atagcctgag caaaaccctg 60
cgctttgaac tgcgcccgat tggcaaagat ggcgtggtgc tgagcccgga agatgcgacc 120
gaactgctga ccaaaattat tgaaaaagat cgcctgatta aagcggcgta tgatgcgctg 180
aaaccggtgc tggataaaat tcatgaagat attattaaca aaagcctgac cagcgatgaa 240
gcgaaacaga ttgattttag cgaatatttt gaagaatata aaaaaggcaa agaaaaaaaa 300
ctggatggct ttgaaaaaaa actgcgcgaa cagattggca aaacctttga aaaaaccgtg 360
aaaacctata aaattaccaa aattaccaaa aagaaagaag aagaagaaaa accgctgttt 420
gaaattaaaa acggcgtgcc gaccgcgaaa gcggaaatta ttggctatct gagcgaacag 480
tataaagata acgtggaact gtgggcgcat attgaagaat ttgaaggctt ttttggctat 540
tttagcggct ataacaccaa ccgcagcaac tattatgaat ataaaaaaga agcgagcacc 600
gcggtggcga cccgcattgt gcatgaaaac ctgccgaaat tttgcgataa cgtgattcag 660
tttctgattg gcaaagtgca gaagaaaaag aaaaacgata cccgcaccga aaccattgtg 720
agccgcaaag aagaatatct gaaagcgtat cagtatctga aagataaccg cggcaccatt 780
cagattaaag atgcgaaaac caaccagctg attgaagcgc agccggtgag cgaagattgg 840
tttaacctgg ataaatttcc gaaatatctg agccaggaag gcattgatga atataaccgc 900
gtgatgggcc attataacct gctgattaac ctgtataacc aggaacgcaa agatgaaaaa 960
gattttaaaa aactgagcca gtttaaaacc ctgtttaaac agattggctg cggcaaacag 1020
agcctgtttg aacagattaa agatgatacc gaactgaaag aaaaactgag caaaattagc 1080
aaagcgggcg aaaaatattt tgcggaacag attgatgata ccctgattac catttatacc 1140
tttattgaat ggctgcgcga aaacaacgat tgggaaggca cctattggag caaagcggcg 1200
gtggataaaa ttagcaacaa atatctggcg aactggcatg atattaaaga tcgcattcag 1260
accgatctgc agggcaaaga taaaggcctg aaagaaaccc tgaaaagcgt ggcgacctat 1320
aacaaagaac gcgaagaaca gctgaaaatt aacgatgcgg tggaactgag cggcctgttt 1380
gaaattctga accatgatgc ggtgcagggc tggagcaaag atttttttcg cgaacatatt 1440
ctggaagaat ataaagatct gattgatgaa aaactgaccc cgagccagaa cctgattaaa 1500
ctgatttgcg cggatatgca gaaactggcg aaagaatttt gcgaaaaaag cgaagatgcg 1560
ctgaaaatta ttgattataa aaacgaaaac aacattctgc agattaaaga atggctggat 1620
cgcagcaaat ggctgctgtg gattgtgaaa tattttgaag tgaaagaaag caaagtgaaa 1680
ggcaacagca ttaacccgga actgaccaac attctgagcg cgctgctgcg cgcggatgat 1740
agcaactggt ttgattggta tgatctggtg cgcaactatc tgagcaaaaa accgcaggaa 1800
gatgcgaaga aaaacaaact gaaactgaac tttgaaagca gcagctttct gggcggctgg 1860
ccgccggatt atgcgaaaaa agcgggcctg ctgtataaaa aagatggcct gtattatctg 1920
gcgattaact ataacctgag caaagaagat attaaaaccc tgaaacagcc gaacggcgaa 1980
accgcgaccc gcattattct ggattttcag aaaccggata acaaaaacac cccgcgcctg 2040
tttattcgca gcaaaggcga tagctttgcg ccggcggtgg aaaaatataa cctgccgatt 2100
aacgatattc tggatattta tgataccggc aaatttcgca ccgaacatcg caagaaaaac 2160
gaagaagaat ataaaattag cctgggcaaa ctgattgatt attttaaaaa aggctttctg 2220
aaacatgata gctataaaca ttttaacttt gattggaaaa aaaccagcga atataaagat 2280
attgcggaat tttatcatga taccgaagtg agctgctatc agattaaaga agaaaacacc 2340
agctggaaaa aactgctgga atttattgat gaaggcaaag tgtttctgtt tcagatttat 2400
aacaaagatt ttagccagcg caaaaccgtg cgcggcaaag ataacattca tacctattat 2460
tggaaaatgc tgtttagcga agaaaacaaa cgcaacgtga tttataaact gaacggcgaa 2520
agcgaaattt tttttcgcaa cctggcgaaa ggcattaaaa aaagcccggc gcataccacc 2580
aaagattatg tgctgaaccg ccgcgaaaaa gaaaccaaca aaaccattcc gtataaaatt 2640
catgatgaac tgcgcctgtt tgcgaacaaa aacaaaagca ttgaagcgct gagcgatgaa 2700
gcgaaagcgt atctggataa aaacaacgaa attgatgaaa accgcgtgac cattaaaaaa 2760
ctgaaacatg atattgtgaa agataaacgc tttaccacca acaaattttt tctgcattgc 2820
ccgattaccc tgaactttaa agcgtatggc aaccgcaacg tgaccgaaac cgtgaacgat 2880
aactttaccc agaccaaaga tattcagaaa gatattcagt ttctgggcat tgatcgcggc 2940
gaaaaacatc tgatttatta tagcctggtg aacgcgaacg gcgaaattat tgaacaggat 3000
cattttgatg tgattaacaa caaagattat ctgcaggaaa ttaacaacgc ggcggatcgc 3060
cgcaaaaaga aacaggaaaa ctggcagcag aaaggcaaca ttagcaacct gaaagatggc 3120
tatattagcc tggtgattca tgaaattatt aaaaaaatgc gcgataaaga tggcaactat 3180
aaaagcacct ttattgtgct ggaagatctg aacccgggct ttaaacgccg ccgccagaaa 3240
tttgaacagc aggtgtatca gaaatttgaa ctggcgctgg cgaaaaaact gaactatctg 3300
gtggataaaa acgtgaccga tatgggcaaa attggcagcg tgagcaaagc gctgcagctg 3360
gtgccgccgg tgaccaacta tcgcgatatt gaaaaccgca aacaggtggg cattatgctg 3420
tatacccgcg cgaactatac cagcgtgacc gatccggtga ccggctggcg caaaaccatt 3480
tatctgaaaa aaggcagcga agcggatatt aaaaaacaga ttctgagcgc gtttaccgaa 3540
attggcgtgt atagcggcga ttattttttt cagtataccg atgtgaacgg caaagaatgg 3600
aaactgtgga gcggcaaagg cggcaaagcg ctggaacgct atcgcgcgaa acgcggcaaa 3660
gataaaaacg aatatattat tgaaccgttt gatgtgaaag aactgctgga taaactgttt 3720
gaaaactttg ataacagcaa aagcctgcgc cagcagtttg aagaacgcgt ggaactgaaa 3780
aaagtgaacg aacataccgc gtgggaaacc ctgcgctttg tgattgatct gattcagcag 3840
attcgcaaca gcggcgatat taccaaaaaa caggaagata aattttatgg cgcggatacc 3900
aacaaaaacc aggatgataa ctttctgctg agcccgattc gcgatgaaca gggcgaacat 3960
tttgatagcc gcaaatatca gagccaggaa attccgcatc tgccgattga tgcggatgcg 4020
aacggcgcgt ataacattgc gcgcaaaggc attgtgatgt atgaacatat taaacagtgg 4080
attaacgatg gcaaacagaa aaccaaaagc aaacgcgatg atagcaaaga aaccaccgat 4140
ctggatctgt ttattagcga taaagaatgg gatctgtggc tgtatgatcg cgaaaaatgg 4200
aacgaacagc tgccgatttt tgcgagccgc aaactgaacc agaaagaaaa caccaaacag 4260
agcgatgtga acattaacga aagcagcgaa atgccgatta aataa 4305
<210> 4
<211> 21
<212> DNA
<213> gRNA框架序列(SEQ ID NO.4)
<400> 4
taatttctac taagtgtaga t 21
<210> 5
<211> 20
<212> DNA
<213> gRNA框架序列(SEQ ID NO.5)
<400> 5
taatttctac tattgtagat 20
<210> 6
<211> 36
<212> DNA
<213> gRNA框架序列(SEQ ID NO.6)
<400> 6
gtctaatatc aatattcaat ttctactttc gtagat 36
<210> 7
<211> 21
<212> DNA
<213> gRNA框架序列(SEQ ID NO.7)
<400> 7
tcaatttcta ctttcgtaga t 21
<210> 8
<211> 36
<212> DNA
<213> gRNA框架序列(SEQ ID NO.8)
<400> 8
atctacaaaa gtagaaatgt gctatctgta tttgag 36
<210> 9
<211> 36
<212> DNA
<213> gRNA框架序列(SEQ ID NO.9)
<400> 9
gtctaatatc aatattcaat ttctactttc gtagat 36
<210> 10
<211> 21
<212> DNA
<213> gRNA框架序列(SEQ ID NO.10)
<400> 10
tcaatttcta ctttcgtaga t 21
<210> 11
<211> 24
<212> DNA
<213> gRNA框架序列(SEQ ID NO.11)
<400> 11
gaaactgtaa gcggaatgtc tact 24
<210> 12
<211> 324
<212> DNA
<213> 靶序列(SEQ ID NO.12)
<400> 12
ttatcttaaa aaattacagg atatttataa gaagcttgag ggtcacccct ttctttttag 60
tccgtcgaaa accaatgaaa aagagtttat tactctgcta aaccaagcct tggcctcgac 120
gcagctttac cgcagcatac aacagctgtt tttaacgatg tataagctag atcccattgg 180
gtttgttaac tatattaaag cgagtaaaca agagtattta tgtctgttga ttaatcctaa 240
actagtcact aagtttttaa aaataacgag ctttaaaatt tacattaatt tcaggctaaa 300
aactttctat ataagtccta ataa 324
<210> 13
<211> 41
<212> DNA
<213> gRNA序列(SEQ ID NO.13)
<400> 13
taatttctac taagtgtaga tacgatgtat aagctagatc c 41
<210> 14
<211> 23
<212> DNA
<213> 引物序列(SEQ ID NO.14)
<400> 14
atgaacaaaa aaggcaaatg gga 23
<210> 15
<211> 25
<212> DNA
<213> 引物序列(SEQ ID NO.15)
<400> 15
tcatttaata ggcatctcgc tagat 25
<210> 16
<211> 30
<212> DNA
<213> 引物序列(SEQ ID NO.16)
<400> 16
tactctgcta aaccaagcct tggcctcgac 30
<210> 17
<211> 30
<212> DNA
<213> 引物序列(SEQ ID NO.17)
<400> 17
ctcttgttta ctcgctttaa tatagttaac 30
Claims (9)
1.一种用于核酸检测的arCas12a蛋白,其特征在于,所述arCas12a蛋白氨基酸序列如SEQ ID NO.1所示;和/或,所述arCas12a蛋白核苷酸序列如SEQ ID NO.2或SEQ ID NO.3所示。
2.一种基于CRISPR/Cas12a的核酸检测系统,其特征在于,包括权利要求1所述的arCas12a蛋白和gRNA。
3.如权利要求2所述的核酸检测系统,其特征在于,所述gRNA包括a)与Cas核酸酶相互作用的框架核酸片段,以及b)与靶核酸结合的特异性核酸片段,所述与 Cas核酸酶相互作用的框架核酸片段为SEQ ID NO.4~7中至少一种所示。
4.如权利要求2所述的核酸检测系统,其特征在于,作为一种可选择的案例,当靶序列如SEQ ID NO.12所示时,gRNA的序列如SEQ ID NO.13所示。
5.一种基于权利要求2~4任一所述系统的核酸检测方法,其特征在于,利用权利要求1所述的arCas12a蛋白和对应靶标序列的gRNA进行CRISPR核酸检测。
6.一种基于权利要求2~4任一所述系统的核酸检测方法,其特征在于,核酸检测最终结果的获取通过可视化方法实现,所述可视化方法为基于所述arCas12a蛋白附属切割活性的荧光信号检测方式、所述arCas12a蛋白附属切割活性的其他信号检测方式、基于所述arCas12a附属切割活性的胶体金侧向层析方式中的一种或多种。
7.如权利要求6所述的核酸检测方法,其特征在于,所述基于arCas12a蛋白附属切割活性的荧光信号检测方式为通过arCas12a的附属切割活性切割标记了荧光基团与淬灭基团的报告DNA链后,反应体系置于荧光分析仪中检测荧光信号;或通过所述arCas12a的附属切割活性切割标记了生物素、荧光基团、地高辛或其他基团的核酸片段后,反应体系通过胶体金侧向层析方式实现核酸产物检测;或通过所述arCas12a的附属切割活性切割聚集化的标记有寡核苷酸的胶体金颗粒,使胶体金颗粒颜色发生变化,记录颜色变化实现检测。
8.权利要求1所述的arCas12a蛋白的下述任一应用:
a)在切割DNA方面的应用;
b)在作为或制备DNA切割工具方面的应用;
c)在核酸检测方面的应用;
d)在基于CRISPR/Cas12a的核酸检测方面的应用;
e)在作为或制备核酸检测工具方面的应用;
f)在作为或制备基于CRISPR/Cas12a的核酸检测工具方面的应用。
9.如权利要求8所述的应用,其特征在于,所述arCas12a蛋白具有靶向体外DNA序列或体内基因组序列进行特异切割的活性;和/或,具有gRNA介导的DNA切割活性。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010728504.0A CN112301019B (zh) | 2020-07-27 | 2020-07-27 | 新型耐高温arCas12a蛋白在核酸检测方面的应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010728504.0A CN112301019B (zh) | 2020-07-27 | 2020-07-27 | 新型耐高温arCas12a蛋白在核酸检测方面的应用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112301019A true CN112301019A (zh) | 2021-02-02 |
CN112301019B CN112301019B (zh) | 2023-11-24 |
Family
ID=74483603
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010728504.0A Active CN112301019B (zh) | 2020-07-27 | 2020-07-27 | 新型耐高温arCas12a蛋白在核酸检测方面的应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112301019B (zh) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112980929A (zh) * | 2021-04-13 | 2021-06-18 | 广州海思医疗科技有限公司 | 一种快速检测hla-b*1502的试剂盒 |
CN113444777A (zh) * | 2021-07-20 | 2021-09-28 | 安徽医科大学第四附属医院 | 一种用于碳青霉烯酶检测的CrRNA、CRISPR-Cas12a系统及应用 |
CN113512596A (zh) * | 2021-09-15 | 2021-10-19 | 北京林业大学 | 用于检测松材线虫的crRNA及试剂盒 |
CN114214348A (zh) * | 2021-11-16 | 2022-03-22 | 河南农业大学 | 应用于核酸检测的Crispr/LbCas12a突变体蛋白及其制备方法和应用 |
CN116103378A (zh) * | 2022-05-10 | 2023-05-12 | 四川大学 | 一种检测单核苷酸多态性的crispr试剂盒 |
-
2020
- 2020-07-27 CN CN202010728504.0A patent/CN112301019B/zh active Active
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112980929A (zh) * | 2021-04-13 | 2021-06-18 | 广州海思医疗科技有限公司 | 一种快速检测hla-b*1502的试剂盒 |
CN112980929B (zh) * | 2021-04-13 | 2021-07-20 | 广州海思医疗科技有限公司 | 一种快速检测hla-b*1502的试剂盒 |
CN113444777A (zh) * | 2021-07-20 | 2021-09-28 | 安徽医科大学第四附属医院 | 一种用于碳青霉烯酶检测的CrRNA、CRISPR-Cas12a系统及应用 |
CN113512596A (zh) * | 2021-09-15 | 2021-10-19 | 北京林业大学 | 用于检测松材线虫的crRNA及试剂盒 |
CN113512596B (zh) * | 2021-09-15 | 2021-12-24 | 北京林业大学 | 用于检测松材线虫的crRNA及试剂盒 |
CN114214348A (zh) * | 2021-11-16 | 2022-03-22 | 河南农业大学 | 应用于核酸检测的Crispr/LbCas12a突变体蛋白及其制备方法和应用 |
CN116103378A (zh) * | 2022-05-10 | 2023-05-12 | 四川大学 | 一种检测单核苷酸多态性的crispr试剂盒 |
Also Published As
Publication number | Publication date |
---|---|
CN112301019B (zh) | 2023-11-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112301016B (zh) | 新型mlCas12a蛋白在核酸检测方面的应用 | |
CN112301019B (zh) | 新型耐高温arCas12a蛋白在核酸检测方面的应用 | |
JP5794572B2 (ja) | 核酸を修飾するためのトランスポゾン末端組成物及び方法 | |
EP1007654B1 (en) | In vitro peptide or protein expression library | |
JP6745599B2 (ja) | 分子の作製 | |
KR100957057B1 (ko) | 핵산과 신호 프로브의 동시 등온증폭을 이용한 핵산의검출방법 | |
US11066662B2 (en) | Methods for the production of libraries for directed evolution | |
CN109666662A (zh) | 新型ScCas12a在核酸检测方面的应用 | |
CN107532213B (zh) | 用于同时检测样品中多个核酸序列的方法 | |
CN102625838A (zh) | 用于生成rRNA除尽的样本或者用于从样本分离rRNA的方法、组合物和试剂盒 | |
KR20210114918A (ko) | 복합체 표면-결합 트랜스포좀 복합체 | |
CN113046475B (zh) | 一种快速检测突变型新型冠状病毒的引物组合物及试剂盒 | |
Glick et al. | Medical biotechnology | |
CN117551746A (zh) | 一种检测目标核酸及其邻近区域核酸序列的方法 | |
Karash et al. | Preparation of transposon library and Tn-seq amplicon library for Salmonella Typhimurium | |
CN114507903A (zh) | 质粒测序方法 | |
Luan et al. | Seamless site-directed mutagenesis in complex cloned DNA sequences using the RedEx method | |
Tavanti | Synthetic DNA libraries for protein engineering toward process improvement in drug synthesis | |
US20240117449A1 (en) | Detecting Listeria Monocytogenes with Helicase-Dependent Amplification Assay | |
CN113337489A (zh) | 一种新型嵌合体TsCas12a蛋白及制备技术 | |
Smiley et al. | Sequence-directed covalent protein-RNA linkages using engineered HUH-Tags | |
CN116829713A (zh) | 发夹寡核苷酸及其用途 | |
JP5300335B2 (ja) | ミスマッチ塩基対含有核酸断片の高効率調製方法 | |
NZ509766A (en) | Method for separating and characterising functions potentially present in a biological sample containing nucleic acids | |
Lardelli | Generation and PCR Screening of Bacteriophage λ Sublibraries Enriched for Rare Clones (the “Sublibrary Method”) |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |