CN110747227B - Blue light induced and activated Cre recombination optimization system and application thereof - Google Patents
Blue light induced and activated Cre recombination optimization system and application thereof Download PDFInfo
- Publication number
- CN110747227B CN110747227B CN201910746260.6A CN201910746260A CN110747227B CN 110747227 B CN110747227 B CN 110747227B CN 201910746260 A CN201910746260 A CN 201910746260A CN 110747227 B CN110747227 B CN 110747227B
- Authority
- CN
- China
- Prior art keywords
- cre
- coding gene
- seq
- protein
- cre recombinase
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000005457 optimization Methods 0.000 title claims abstract description 19
- 230000006798 recombination Effects 0.000 title claims abstract description 12
- 238000005215 recombination Methods 0.000 title claims abstract description 12
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 126
- 108010051219 Cre recombinase Proteins 0.000 claims abstract description 67
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 48
- 230000006698 induction Effects 0.000 claims abstract description 22
- 102100026280 Cryptochrome-2 Human genes 0.000 claims abstract description 15
- 101000855613 Homo sapiens Cryptochrome-2 Proteins 0.000 claims abstract description 15
- 239000013612 plasmid Substances 0.000 claims description 31
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 24
- 239000013598 vector Substances 0.000 claims description 22
- 108010075254 C-Peptide Proteins 0.000 claims description 16
- 239000002773 nucleotide Substances 0.000 claims description 16
- 125000003729 nucleotide group Chemical group 0.000 claims description 16
- 102100032216 Calcium and integrin-binding protein 1 Human genes 0.000 claims description 15
- 101000943475 Homo sapiens Calcium and integrin-binding protein 1 Proteins 0.000 claims description 15
- 108010077850 Nuclear Localization Signals Proteins 0.000 claims description 14
- 230000009261 transgenic effect Effects 0.000 claims description 14
- 239000003446 ligand Substances 0.000 claims description 12
- 239000013603 viral vector Substances 0.000 claims description 12
- 241001465754 Metazoa Species 0.000 claims description 11
- 230000033228 biological regulation Effects 0.000 claims description 7
- 108010091086 Recombinases Proteins 0.000 claims description 5
- 238000011160 research Methods 0.000 claims description 5
- 238000013518 transcription Methods 0.000 claims description 5
- 230000035897 transcription Effects 0.000 claims description 5
- 241000124008 Mammalia Species 0.000 claims description 4
- 102000018120 Recombinases Human genes 0.000 claims description 4
- 239000013600 plasmid vector Substances 0.000 claims description 4
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 3
- 240000007019 Oxalis corniculata Species 0.000 claims description 3
- 238000010008 shearing Methods 0.000 claims description 3
- 208000009889 Herpes Simplex Diseases 0.000 claims description 2
- 101710135898 Myc proto-oncogene protein Proteins 0.000 claims description 2
- 102100038895 Myc proto-oncogene protein Human genes 0.000 claims description 2
- 101710150448 Transcriptional regulator Myc Proteins 0.000 claims description 2
- 206010046865 Vaccinia virus infection Diseases 0.000 claims description 2
- 230000001177 retroviral effect Effects 0.000 claims description 2
- 238000011820 transgenic animal model Methods 0.000 claims description 2
- 208000007089 vaccinia Diseases 0.000 claims description 2
- 108700026220 vif Genes Proteins 0.000 claims description 2
- 108091093126 WHP Posttrascriptional Response Element Proteins 0.000 claims 2
- 238000003776 cleavage reaction Methods 0.000 claims 1
- 230000008521 reorganization Effects 0.000 claims 1
- 230000000694 effects Effects 0.000 abstract description 21
- 102000037865 fusion proteins Human genes 0.000 abstract description 3
- 108020001507 fusion proteins Proteins 0.000 abstract description 3
- 230000008859 change Effects 0.000 abstract description 2
- 210000004027 cell Anatomy 0.000 description 24
- 241000699670 Mus sp. Species 0.000 description 21
- 108020004414 DNA Proteins 0.000 description 11
- 238000011830 transgenic mouse model Methods 0.000 description 10
- 108090000765 processed proteins & peptides Proteins 0.000 description 7
- 241000699666 Mus <mouse, genus> Species 0.000 description 6
- 241000699660 Mus musculus Species 0.000 description 6
- 238000000034 method Methods 0.000 description 6
- 108060001084 Luciferase Proteins 0.000 description 5
- 239000005089 Luciferase Substances 0.000 description 5
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 5
- 108010061238 threonyl-glycine Proteins 0.000 description 5
- 210000003462 vein Anatomy 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- 108010079364 N-glycylalanine Proteins 0.000 description 4
- 108010047495 alanylglycine Proteins 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 238000005286 illumination Methods 0.000 description 4
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 3
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 3
- 108010038633 aspartylglutamate Proteins 0.000 description 3
- 238000003556 assay Methods 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 108010050848 glycylleucine Proteins 0.000 description 3
- 238000011503 in vivo imaging Methods 0.000 description 3
- 108010054155 lysyllysine Proteins 0.000 description 3
- 238000004445 quantitative analysis Methods 0.000 description 3
- 230000011218 segmentation Effects 0.000 description 3
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 3
- HKZAAJSTFUZYTO-LURJTMIESA-N (2s)-2-[[2-[[2-[[2-[(2-aminoacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]-3-hydroxypropanoic acid Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O HKZAAJSTFUZYTO-LURJTMIESA-N 0.000 description 2
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 2
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 2
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 2
- 108020004684 Internal Ribosome Entry Sites Proteins 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 2
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 2
- MSSJHBAKDDIRMJ-SRVKXCTJSA-N Met-Lys-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MSSJHBAKDDIRMJ-SRVKXCTJSA-N 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 2
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 2
- 108010020764 Transposases Proteins 0.000 description 2
- 102000008579 Transposases Human genes 0.000 description 2
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 2
- 238000012512 characterization method Methods 0.000 description 2
- 230000001276 controlling effect Effects 0.000 description 2
- 230000007547 defect Effects 0.000 description 2
- 235000013601 eggs Nutrition 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 238000002347 injection Methods 0.000 description 2
- 239000007924 injection Substances 0.000 description 2
- 210000004185 liver Anatomy 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000001124 posttranscriptional effect Effects 0.000 description 2
- 102000004196 processed proteins & peptides Human genes 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- 238000010200 validation analysis Methods 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- PIDRBUDUWHBYSR-UHFFFAOYSA-N 1-[2-[[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O PIDRBUDUWHBYSR-UHFFFAOYSA-N 0.000 description 1
- QMOQBVOBWVNSNO-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(O)=O QMOQBVOBWVNSNO-UHFFFAOYSA-N 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- SDMAQFGBPOJFOM-GUBZILKMSA-N Ala-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SDMAQFGBPOJFOM-GUBZILKMSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 1
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- IDLBLNBDLCTPGC-HERUPUMHSA-N Ala-Trp-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CS)C(=O)O)N IDLBLNBDLCTPGC-HERUPUMHSA-N 0.000 description 1
- TVUFMYKTYXTRPY-HERUPUMHSA-N Ala-Trp-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O TVUFMYKTYXTRPY-HERUPUMHSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 1
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- NTAZNGWBXRVEDJ-FXQIFTODSA-N Arg-Asp-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NTAZNGWBXRVEDJ-FXQIFTODSA-N 0.000 description 1
- YFBGNGASPGRWEM-DCAQKATOSA-N Arg-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFBGNGASPGRWEM-DCAQKATOSA-N 0.000 description 1
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 1
- GIVWETPOBCRTND-DCAQKATOSA-N Arg-Gln-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GIVWETPOBCRTND-DCAQKATOSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 1
- SSZGOKWBHLOCHK-DCAQKATOSA-N Arg-Lys-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N SSZGOKWBHLOCHK-DCAQKATOSA-N 0.000 description 1
- XUGATJVGQUGQKY-ULQDDVLXSA-N Arg-Lys-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XUGATJVGQUGQKY-ULQDDVLXSA-N 0.000 description 1
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 1
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 1
- NYDIVDKTULRINZ-AVGNSLFASA-N Arg-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NYDIVDKTULRINZ-AVGNSLFASA-N 0.000 description 1
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 1
- WAEWODAAWLGLMK-OYDLWJJNSA-N Arg-Trp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WAEWODAAWLGLMK-OYDLWJJNSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- BWMMKQPATDUYKB-IHRRRGAJSA-N Arg-Tyr-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=C(O)C=C1 BWMMKQPATDUYKB-IHRRRGAJSA-N 0.000 description 1
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 1
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 1
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 1
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 1
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- DQTIWTULBGLJBL-DCAQKATOSA-N Asn-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N DQTIWTULBGLJBL-DCAQKATOSA-N 0.000 description 1
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 1
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 1
- COUZKSSMBFADSB-AVGNSLFASA-N Asn-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N COUZKSSMBFADSB-AVGNSLFASA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 1
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 1
- AEZCCDMZZJOGII-DCAQKATOSA-N Asn-Met-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O AEZCCDMZZJOGII-DCAQKATOSA-N 0.000 description 1
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 1
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 1
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 1
- VWADICJNCPFKJS-ZLUOBGJFSA-N Asn-Ser-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O VWADICJNCPFKJS-ZLUOBGJFSA-N 0.000 description 1
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 1
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 1
- BEHQTVDBCLSCBY-CFMVVWHZSA-N Asn-Tyr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BEHQTVDBCLSCBY-CFMVVWHZSA-N 0.000 description 1
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 1
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 1
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 1
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 1
- WJHYGGVCWREQMO-GHCJXIJMSA-N Asp-Cys-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WJHYGGVCWREQMO-GHCJXIJMSA-N 0.000 description 1
- PMEHKVHZQKJACS-PEFMBERDSA-N Asp-Gln-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PMEHKVHZQKJACS-PEFMBERDSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 1
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 1
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- SARSTIZOZFBDOM-FXQIFTODSA-N Asp-Met-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SARSTIZOZFBDOM-FXQIFTODSA-N 0.000 description 1
- BPTFNDRZKBFMTH-DCAQKATOSA-N Asp-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N BPTFNDRZKBFMTH-DCAQKATOSA-N 0.000 description 1
- APXVSPGLYXFFSD-AJNGGQMLSA-N Asp-Phe-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)N)CC1=CC=CC=C1 APXVSPGLYXFFSD-AJNGGQMLSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 1
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- NBKLEMWHDLAUEM-CIUDSAMLSA-N Asp-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N NBKLEMWHDLAUEM-CIUDSAMLSA-N 0.000 description 1
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 1
- XYPJXLLXNSAWHZ-SRVKXCTJSA-N Asp-Ser-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XYPJXLLXNSAWHZ-SRVKXCTJSA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- OTKUAVXGMREHRX-CFMVVWHZSA-N Asp-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 OTKUAVXGMREHRX-CFMVVWHZSA-N 0.000 description 1
- AMRLSQGGERHDHJ-FXQIFTODSA-N Cys-Ala-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMRLSQGGERHDHJ-FXQIFTODSA-N 0.000 description 1
- XGIAHEUULGOZHH-GUBZILKMSA-N Cys-Arg-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N XGIAHEUULGOZHH-GUBZILKMSA-N 0.000 description 1
- WDQXKVCQXRNOSI-GHCJXIJMSA-N Cys-Asp-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WDQXKVCQXRNOSI-GHCJXIJMSA-N 0.000 description 1
- BLGNLNRBABWDST-CIUDSAMLSA-N Cys-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BLGNLNRBABWDST-CIUDSAMLSA-N 0.000 description 1
- WTEACWBAULENKE-SRVKXCTJSA-N Cys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N WTEACWBAULENKE-SRVKXCTJSA-N 0.000 description 1
- SRZZZTMJARUVPI-JBDRJPRFSA-N Cys-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N SRZZZTMJARUVPI-JBDRJPRFSA-N 0.000 description 1
- 241000702421 Dependoparvovirus Species 0.000 description 1
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 1
- XXLBHPPXDUWYAG-XQXXSGGOSA-N Gln-Ala-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XXLBHPPXDUWYAG-XQXXSGGOSA-N 0.000 description 1
- WOACHWLUOFZLGJ-GUBZILKMSA-N Gln-Arg-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O WOACHWLUOFZLGJ-GUBZILKMSA-N 0.000 description 1
- IWUFOVSLWADEJC-AVGNSLFASA-N Gln-His-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IWUFOVSLWADEJC-AVGNSLFASA-N 0.000 description 1
- MFHVAWMMKZBSRQ-ACZMJKKPSA-N Gln-Ser-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N MFHVAWMMKZBSRQ-ACZMJKKPSA-N 0.000 description 1
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 1
- AKDOUBMVLRCHBD-SIUGBPQLSA-N Gln-Tyr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AKDOUBMVLRCHBD-SIUGBPQLSA-N 0.000 description 1
- ICRKQMRFXYDYMK-LAEOZQHASA-N Gln-Val-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ICRKQMRFXYDYMK-LAEOZQHASA-N 0.000 description 1
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 1
- SRZLHYPAOXBBSB-HJGDQZAQSA-N Glu-Arg-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SRZLHYPAOXBBSB-HJGDQZAQSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 1
- ALCAUWPAMLVUDB-FXQIFTODSA-N Glu-Gln-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ALCAUWPAMLVUDB-FXQIFTODSA-N 0.000 description 1
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 1
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 1
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- UJMNFCAHLYKWOZ-DCAQKATOSA-N Glu-Lys-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UJMNFCAHLYKWOZ-DCAQKATOSA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 1
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 1
- GMAGZGCAYLQBKF-NHCYSSNCSA-N Glu-Met-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GMAGZGCAYLQBKF-NHCYSSNCSA-N 0.000 description 1
- YUXIEONARHPUTK-JBACZVJFSA-N Glu-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CCC(=O)O)N YUXIEONARHPUTK-JBACZVJFSA-N 0.000 description 1
- CQAHWYDHKUWYIX-YUMQZZPRSA-N Glu-Pro-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O CQAHWYDHKUWYIX-YUMQZZPRSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- JVZLZVJTIXVIHK-SXNHZJKMSA-N Glu-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)O)N JVZLZVJTIXVIHK-SXNHZJKMSA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- PHONXOACARQMPM-BQBZGAKWSA-N Gly-Ala-Met Chemical compound [H]NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O PHONXOACARQMPM-BQBZGAKWSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 1
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 1
- LGQZOQRDEUIZJY-YUMQZZPRSA-N Gly-Cys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CS)NC(=O)CN)C(O)=O LGQZOQRDEUIZJY-YUMQZZPRSA-N 0.000 description 1
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 1
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 1
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 1
- NTOWAXLMQFKJPT-YUMQZZPRSA-N Gly-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN NTOWAXLMQFKJPT-YUMQZZPRSA-N 0.000 description 1
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 1
- LPCKHUXOGVNZRS-YUMQZZPRSA-N Gly-His-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O LPCKHUXOGVNZRS-YUMQZZPRSA-N 0.000 description 1
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- FCKPEGOCSVZPNC-WHOFXGATSA-N Gly-Ile-Phe Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FCKPEGOCSVZPNC-WHOFXGATSA-N 0.000 description 1
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- BBTCXWTXOXUNFX-IUCAKERBSA-N Gly-Met-Arg Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O BBTCXWTXOXUNFX-IUCAKERBSA-N 0.000 description 1
- YHYDTTUSJXGTQK-UWVGGRQHSA-N Gly-Met-Leu Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(C)C)C(O)=O YHYDTTUSJXGTQK-UWVGGRQHSA-N 0.000 description 1
- ZWRDOVYMQAAISL-UWVGGRQHSA-N Gly-Met-Lys Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCCN ZWRDOVYMQAAISL-UWVGGRQHSA-N 0.000 description 1
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 1
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 1
- ONSARSFSJHTMFJ-STQMWFEESA-N Gly-Trp-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ONSARSFSJHTMFJ-STQMWFEESA-N 0.000 description 1
- UMBDRSMLCUYIRI-DVJZZOLTSA-N Gly-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN)O UMBDRSMLCUYIRI-DVJZZOLTSA-N 0.000 description 1
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- PDSUIXMZYNURGI-AVGNSLFASA-N His-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 PDSUIXMZYNURGI-AVGNSLFASA-N 0.000 description 1
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 1
- FIMNVXRZGUAGBI-AVGNSLFASA-N His-Glu-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FIMNVXRZGUAGBI-AVGNSLFASA-N 0.000 description 1
- JBSLJUPMTYLLFH-MELADBBJSA-N His-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O JBSLJUPMTYLLFH-MELADBBJSA-N 0.000 description 1
- SKYULSWNBYAQMG-IHRRRGAJSA-N His-Leu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SKYULSWNBYAQMG-IHRRRGAJSA-N 0.000 description 1
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 1
- KHUFDBQXGLEIHC-BZSNNMDCSA-N His-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 KHUFDBQXGLEIHC-BZSNNMDCSA-N 0.000 description 1
- WYKXJGWSJUULSL-AVGNSLFASA-N His-Val-Arg Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O WYKXJGWSJUULSL-AVGNSLFASA-N 0.000 description 1
- GYXDQXPCPASCNR-NHCYSSNCSA-N His-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N GYXDQXPCPASCNR-NHCYSSNCSA-N 0.000 description 1
- WSXNWASHQNSMRX-GVXVVHGQSA-N His-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WSXNWASHQNSMRX-GVXVVHGQSA-N 0.000 description 1
- 241000701024 Human betaherpesvirus 5 Species 0.000 description 1
- ASCFJMSGKUIRDU-ZPFDUUQYSA-N Ile-Arg-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O ASCFJMSGKUIRDU-ZPFDUUQYSA-N 0.000 description 1
- LEDRIAHEWDJRMF-CFMVVWHZSA-N Ile-Asn-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LEDRIAHEWDJRMF-CFMVVWHZSA-N 0.000 description 1
- FUOYNOXRWPJPAN-QEWYBTABSA-N Ile-Glu-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FUOYNOXRWPJPAN-QEWYBTABSA-N 0.000 description 1
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 1
- KOPIAUWNLKKELG-SIGLWIIPSA-N Ile-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N KOPIAUWNLKKELG-SIGLWIIPSA-N 0.000 description 1
- GLLAUPMJCGKPFY-BLMTYFJBSA-N Ile-Ile-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 GLLAUPMJCGKPFY-BLMTYFJBSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 1
- JTBFQNHKNRZJDS-SYWGBEHUSA-N Ile-Trp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C)C(=O)O)N JTBFQNHKNRZJDS-SYWGBEHUSA-N 0.000 description 1
- CZOAJJGXTGUYOJ-SPOWBLRKSA-N Ile-Trp-Cys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@@H](C)CC)C(=O)N[C@@H](CS)C(O)=O)=CNC2=C1 CZOAJJGXTGUYOJ-SPOWBLRKSA-N 0.000 description 1
- PMAOIIWHZHAPBT-HJPIBITLSA-N Ile-Tyr-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N PMAOIIWHZHAPBT-HJPIBITLSA-N 0.000 description 1
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 1
- ZSESFIFAYQEKRD-CYDGBPFRSA-N Ile-Val-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N ZSESFIFAYQEKRD-CYDGBPFRSA-N 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- HUEBCHPSXSQUGN-GARJFASQSA-N Leu-Cys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N HUEBCHPSXSQUGN-GARJFASQSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 1
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 1
- JRJLGNFWYFSJHB-HOCLYGCPSA-N Leu-Gly-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRJLGNFWYFSJHB-HOCLYGCPSA-N 0.000 description 1
- XBCWOTOCBXXJDG-BZSNNMDCSA-N Leu-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 XBCWOTOCBXXJDG-BZSNNMDCSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 1
- GNRPTBRHRRZCMA-RWMBFGLXSA-N Leu-Met-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N GNRPTBRHRRZCMA-RWMBFGLXSA-N 0.000 description 1
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 1
- GOFJOGXGMPHOGL-DCAQKATOSA-N Leu-Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C GOFJOGXGMPHOGL-DCAQKATOSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- LINKCQUOMUDLKN-KATARQTJSA-N Leu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N)O LINKCQUOMUDLKN-KATARQTJSA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- SQXUUGUCGJSWCK-CIUDSAMLSA-N Lys-Asp-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N SQXUUGUCGJSWCK-CIUDSAMLSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 1
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- KJIXWRWPOCKYLD-IHRRRGAJSA-N Lys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N KJIXWRWPOCKYLD-IHRRRGAJSA-N 0.000 description 1
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 1
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 1
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 1
- IEVXCWPVBYCJRZ-IXOXFDKPSA-N Lys-Thr-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IEVXCWPVBYCJRZ-IXOXFDKPSA-N 0.000 description 1
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- FBQMBZLJHOQAIH-GUBZILKMSA-N Met-Asp-Met Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O FBQMBZLJHOQAIH-GUBZILKMSA-N 0.000 description 1
- YLLWCSDBVGZLOW-CIUDSAMLSA-N Met-Gln-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O YLLWCSDBVGZLOW-CIUDSAMLSA-N 0.000 description 1
- FGAMAYQCWQCUNF-DCAQKATOSA-N Met-His-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FGAMAYQCWQCUNF-DCAQKATOSA-N 0.000 description 1
- GETCJHFFECHWHI-QXEWZRGKSA-N Met-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCSC)N GETCJHFFECHWHI-QXEWZRGKSA-N 0.000 description 1
- HWROAFGWPQUPTE-OSUNSFLBSA-N Met-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCSC)N HWROAFGWPQUPTE-OSUNSFLBSA-N 0.000 description 1
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 1
- UFOWQBYMUILSRK-IHRRRGAJSA-N Met-Lys-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 UFOWQBYMUILSRK-IHRRRGAJSA-N 0.000 description 1
- CULGJGUDIJATIP-STQMWFEESA-N Met-Tyr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 CULGJGUDIJATIP-STQMWFEESA-N 0.000 description 1
- VEKRTVRZDMUOQN-AVGNSLFASA-N Met-Val-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 VEKRTVRZDMUOQN-AVGNSLFASA-N 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 1
- VHWOBXIWBDWZHK-IHRRRGAJSA-N Phe-Arg-Asp Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 VHWOBXIWBDWZHK-IHRRRGAJSA-N 0.000 description 1
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 description 1
- IQXOZIDWLZYYAW-IHRRRGAJSA-N Phe-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IQXOZIDWLZYYAW-IHRRRGAJSA-N 0.000 description 1
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 1
- RSPUIENXSJYZQO-JYJNAYRXSA-N Phe-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RSPUIENXSJYZQO-JYJNAYRXSA-N 0.000 description 1
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 1
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 1
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 description 1
- WKLMCMXFMQEKCX-SLFFLAALSA-N Phe-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O WKLMCMXFMQEKCX-SLFFLAALSA-N 0.000 description 1
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 1
- CXMSESHALPOLRE-MEYUZBJRSA-N Phe-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O CXMSESHALPOLRE-MEYUZBJRSA-N 0.000 description 1
- SJRQWEDYTKYHHL-SLFFLAALSA-N Phe-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O SJRQWEDYTKYHHL-SLFFLAALSA-N 0.000 description 1
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 1
- WWAQEUOYCYMGHB-FXQIFTODSA-N Pro-Asn-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 WWAQEUOYCYMGHB-FXQIFTODSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- QVIZLAUEAMQKGS-GUBZILKMSA-N Pro-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 QVIZLAUEAMQKGS-GUBZILKMSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 1
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 1
- QNZLIVROMORQFH-BQBZGAKWSA-N Pro-Gly-Cys Chemical compound C1C[C@H](NC1)C(=O)NCC(=O)N[C@@H](CS)C(=O)O QNZLIVROMORQFH-BQBZGAKWSA-N 0.000 description 1
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 1
- KLOQCCRTPHPIFN-DCAQKATOSA-N Pro-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 KLOQCCRTPHPIFN-DCAQKATOSA-N 0.000 description 1
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 1
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 1
- XNJVJEHDZPDPQL-BZSNNMDCSA-N Pro-Trp-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H]1CCCN1)C(O)=O XNJVJEHDZPDPQL-BZSNNMDCSA-N 0.000 description 1
- VGFFUEVZKRNRHT-ULQDDVLXSA-N Pro-Trp-Glu Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCC(=O)O)C(=O)O VGFFUEVZKRNRHT-ULQDDVLXSA-N 0.000 description 1
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 1
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 1
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 1
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 1
- UICKAKRRRBTILH-GUBZILKMSA-N Ser-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N UICKAKRRRBTILH-GUBZILKMSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- HMRAQFJFTOLDKW-GUBZILKMSA-N Ser-His-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMRAQFJFTOLDKW-GUBZILKMSA-N 0.000 description 1
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 1
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 1
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- XGQKSRGHEZNWIS-IHRRRGAJSA-N Ser-Pro-Tyr Chemical compound N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O XGQKSRGHEZNWIS-IHRRRGAJSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 1
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 1
- STIAINRLUUKYKM-WFBYXXMGSA-N Ser-Trp-Ala Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 STIAINRLUUKYKM-WFBYXXMGSA-N 0.000 description 1
- SGZVZUCRAVSPKQ-FXQIFTODSA-N Ser-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N SGZVZUCRAVSPKQ-FXQIFTODSA-N 0.000 description 1
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 description 1
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 1
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 1
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 1
- ZBKDBZUTTXINIX-RWRJDSDZSA-N Thr-Ile-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZBKDBZUTTXINIX-RWRJDSDZSA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 1
- PCMDGXKXVMBIFP-VEVYYDQMSA-N Thr-Met-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMDGXKXVMBIFP-VEVYYDQMSA-N 0.000 description 1
- BDENGIGFTNYZSJ-RCWTZXSCSA-N Thr-Pro-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O BDENGIGFTNYZSJ-RCWTZXSCSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- MYNYCUXMIIWUNW-IEGACIPQSA-N Thr-Trp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MYNYCUXMIIWUNW-IEGACIPQSA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- 108700029229 Transcriptional Regulatory Elements Proteins 0.000 description 1
- RSUXQZNWAOTBQF-XIRDDKMYSA-N Trp-Arg-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RSUXQZNWAOTBQF-XIRDDKMYSA-N 0.000 description 1
- LTLBNCDNXQCOLB-UBHSHLNASA-N Trp-Asp-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 LTLBNCDNXQCOLB-UBHSHLNASA-N 0.000 description 1
- SAKLWFSRZTZQAJ-GQGQLFGLSA-N Trp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N SAKLWFSRZTZQAJ-GQGQLFGLSA-N 0.000 description 1
- UUIYFDAWNBSWPG-IHPCNDPISA-N Trp-Lys-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N UUIYFDAWNBSWPG-IHPCNDPISA-N 0.000 description 1
- LFGHEUIUSIRJAE-TUSQITKMSA-N Trp-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N LFGHEUIUSIRJAE-TUSQITKMSA-N 0.000 description 1
- CSOBBJWWODOYGW-ILWGZMRPSA-N Trp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N)C(=O)O CSOBBJWWODOYGW-ILWGZMRPSA-N 0.000 description 1
- GEGYPBOPIGNZIF-CWRNSKLLSA-N Trp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O GEGYPBOPIGNZIF-CWRNSKLLSA-N 0.000 description 1
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 1
- JJNXZIPLIXIGBX-HJPIBITLSA-N Tyr-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JJNXZIPLIXIGBX-HJPIBITLSA-N 0.000 description 1
- WDGDKHLSDIOXQC-ACRUOGEOSA-N Tyr-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WDGDKHLSDIOXQC-ACRUOGEOSA-N 0.000 description 1
- PLXQRTXVLZUNMU-RNXOBYDBSA-N Tyr-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CC4=CC=C(C=C4)O)N PLXQRTXVLZUNMU-RNXOBYDBSA-N 0.000 description 1
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 1
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 description 1
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 1
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 1
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 1
- UZFNHAXYMICTBU-DZKIICNBSA-N Val-Phe-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UZFNHAXYMICTBU-DZKIICNBSA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 241001492404 Woodchuck hepatitis virus Species 0.000 description 1
- 108010081404 acein-2 Proteins 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000004913 activation Effects 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000006037 cell lysis Effects 0.000 description 1
- 210000003855 cell nucleus Anatomy 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- YTRQFSDWAXHJCC-UHFFFAOYSA-N chloroform;phenol Chemical compound ClC(Cl)Cl.OC1=CC=CC=C1 YTRQFSDWAXHJCC-UHFFFAOYSA-N 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000013467 fragmentation Methods 0.000 description 1
- 238000006062 fragmentation reaction Methods 0.000 description 1
- 238000010362 genome editing Methods 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 238000010859 live-cell imaging Methods 0.000 description 1
- 210000004072 lung Anatomy 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 238000010172 mouse model Methods 0.000 description 1
- 230000008212 organismal development Effects 0.000 description 1
- 230000035515 penetration Effects 0.000 description 1
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 1
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 108010015840 seryl-prolyl-lysyl-lysine Proteins 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 230000009885 systemic effect Effects 0.000 description 1
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K67/00—Rearing or breeding animals, not otherwise provided for; New or modified breeds of animals
- A01K67/027—New or modified breeds of vertebrates
- A01K67/0275—Genetically modified vertebrates, e.g. transgenic
- A01K67/0276—Knock-out vertebrates
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
- C12N15/902—Stable introduction of foreign DNA into chromosome using homologous recombination
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
- C12N15/902—Stable introduction of foreign DNA into chromosome using homologous recombination
- C12N15/907—Stable introduction of foreign DNA into chromosome using homologous recombination in mammalian cells
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2207/00—Modified animals
- A01K2207/15—Humanized animals
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2217/00—Genetically modified animals
- A01K2217/07—Animals genetically altered by homologous recombination
- A01K2217/075—Animals genetically altered by homologous recombination inducing loss of function, i.e. knock out
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2227/00—Animals characterised by species
- A01K2227/10—Mammal
- A01K2227/105—Murine
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2267/00—Animals characterised by purpose
- A01K2267/03—Animal model, e.g. for test or diseases
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/30—Vector systems comprising sequences for excision in presence of a recombinase, e.g. loxP or FRT
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Biotechnology (AREA)
- Organic Chemistry (AREA)
- Biomedical Technology (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Chemical & Material Sciences (AREA)
- Wood Science & Technology (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Environmental Sciences (AREA)
- Mycology (AREA)
- Veterinary Medicine (AREA)
- Animal Behavior & Ethology (AREA)
- Animal Husbandry (AREA)
- Biodiversity & Conservation Biology (AREA)
- Cell Biology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
The application discloses a blue light induction activated Cre recombination optimization system and application. The Cre recombinase generated by the system mainly comprises two fusion proteins CIB1-creC which are separated from each other 106‑343 And CRY2-CreN 19‑104 Forming; when the two proteins are under certain blue light conditions, CRY2-CreN 19‑104 The CRY2 in (A) undergoes conformational change with CIB1-CreC 106‑343 In (3) to allow CreC 106‑343 And CreN 19‑343 Binding forms a protein having Cre recombinase activity, thereby editing the target gene located between loxP sites. In order to improve the expression efficiency and the induction multiple of Cre recombinase in the system, the system is optimized, so that the expression efficiency of the Cre recombinase activated by blue light induction is obviously improved, and the induction multiple is improved by about 40 times.
Description
Technical Field
The invention relates to the technical field of biology, in particular to a blue light induced and activated Cre recombination optimization system and application thereof.
Background
The Cre recombinase is derived from P1 bacteriophage and belongs to a tyrosine recombinase family. It can specifically recognize a DNA sequence (loxp) with the length of 34bp, so that the gene sequence between loxp sites is deleted or recombined. The Cre recombinase has high efficiency and high sequence recognition specificity, so scientists construct a plurality of Cre recombinase transgenic mouse models which have incomparable important roles in the aspects of researching gene function and organism development. According to the literature, it is reported that the Cre recombinase can affect the development of tissues such as heart and lung of mice if being expressed in vivo, so that the effective control of the expression of the recombinase is particularly important.
The mouse models for transgenosis of inducible Cre recombinase which have been reported at present comprise Cre-ERT2 and DD-Cre. Both are induced by chemical drugs, and have certain limitations, namely long induction time and large action range, and can disturb endogenous signal pathways of cells. Although spatial specificity can be achieved to some extent with specific promoters, precise manipulation of cell populations lacking tissue specificity for systemic distribution is not possible.
Disclosure of Invention
Aiming at the defects in the prior art, the invention aims to provide a blue light induced and activated Cre recombination optimization system and application thereof.
The invention optimizes and reforms the Cre recombinase system PA-Cre which is induced and activated by blue light, improves the activity of the Cre recombinase and greatly reduces the leakage of the system. The transgenic cell line or the transgenic animal obtained by transferring the gene into a cell line or an animal not only can realize more accurate space-time specific regulation and control by controlling the time and the space of illumination, but also provides a more appropriate tool for subsequent research.
In one aspect, the invention provides a blue light induction activated Cre recombination optimization system, which comprises a blue light induction activated Cre recombinase expression cassette, wherein the expression cassette comprises genes connected in the following sequence: coding genes of a photosensitive protein ligand CIB1, a Cre recombinase C end coding gene, a photosensitive protein CRY2 and a Cre recombinase N end coding gene;
the C end of the Cre recombinase comprises an amino acid sequence shown in SEQ ID No.1 (106-343 th site of the N end of the Cre recombinase); the N end of the Cre recombinase comprises an amino acid sequence shown in SEQ ID No.3 (19 th to 104 th positions of the N end of the Cre recombinase);
preferably, the coding gene sequence of the Cre recombinase C end is shown in SEQ ID No. 2; the coding gene sequence of the Cre recombinase N end is shown in SEQ ID No. 4.
In order to facilitate the expression of Cre recombinase and the study of gene functions in cells or animals, the system further comprises two spaced loxP recognized by Cre recombinase, wherein the two spaced loxP are located on a vector different from the expression cassette or integrated into the genome of the cells or animals.
In a preferred embodiment, the expression cassette further comprises a gene encoding a first linker peptide and a gene encoding a second linker peptide,
the coding gene of the first connecting peptide is positioned between the coding gene of the photosensitive protein ligand CIB1 and the coding gene of the Cre recombinase C end,
the coding gene of the second connecting peptide is positioned between the coding gene of the photosensitive protein CRY2 and the coding gene at the N end of the Cre recombinase,
preferably, said first linker peptide and said second linker peptide comprise the amino acid sequences shown in SEQ ID nos. 5, 6, 7, or 8, more preferably, SEQ ID nos. 5;
more preferably, the genes encoding the first and second connecting peptides comprise the nucleotide sequences shown in SEQ ID No. 9.
In a preferred embodiment, the expression cassette further comprises a gene encoding a nuclear localization signal NLS,
preferably, the coding gene of the nuclear localization signal NLS is positioned before the coding gene of the light sensitive protein CRY2 or before the coding gene of the light sensitive protein ligand CIB1, more preferably, before the coding gene of the light sensitive protein ligand CIB1,
preferably, the nuclear localization signal NLS comprises an amino acid sequence shown in SEQ ID No.10, and more preferably, the coding gene of the nuclear localization signal NLS comprises a nucleotide sequence shown in SEQ ID No. 11.
In a preferred embodiment, the expression cassette further comprises a promoter which is CAG or CMV, preferably CAG;
more preferably, the nucleotide sequence of CAG is shown in SEQ ID No. 12.
In a preferred embodiment, the expression cassette further comprises a gene encoding a self-cleaving protein or an IRES (internal ribosome entry site sequence),
the coding gene or IRES of the self-shearing protein is positioned between the coding gene of the Cre recombinase C end and the coding gene of the photosensitive protein CRY2,
preferably, the nucleotide sequence of the IRES is as shown in SEQ ID No.13,
the self-cleaving protein is P2A, T2A, or F2A, more preferably, T2A,
more preferably, the amino acid sequence of the T2A is shown as SEQ ID No.14, and more preferably, the nucleotide sequence of the T2A is shown as SEQ ID No. 15.
In a preferred embodiment, the expression cassette further comprises a transcriptional regulatory element,
the transcription regulation element is positioned behind the coding gene at the N end of the Cre recombinase,
the WPRE element is a Woodchuck Hepatitis Virus post-transcriptional regulatory element (WPRE), plays an important role in the transcriptional and post-transcriptional regulatory stages, and can improve the expression level of protein to a certain extent;
preferably, the WPRE element comprises the nucleotide sequence shown as SEQ ID No. 16;
and/or, the expression cassette also comprises a coding gene of the protein tag sequence,
preferably, the coding gene of the protein tag sequence is positioned between the promoter and the coding gene of the nuclear localization signal NLS,
the protein Tag comprises any one or at least two of MyC, his, GST, HA, flag, MBP, avi Tag, SUMO and c-Myc Tag; preferably, flag, more preferably 3 × Flag.
In a preferred embodiment, the light-sensitive protein ligand CIB1 comprises an amino acid sequence shown in SEQ ID No.17, and preferably, the coding gene sequence of the light-sensitive protein ligand CIB1 comprises a nucleotide sequence shown in SEQ ID No. 18.
In a preferred embodiment, the light-sensitive protein CRY2 comprises an amino acid sequence shown in SEQ ID No.19, and preferably, the coding gene sequence of the light-sensitive protein CRY2 comprises a nucleotide sequence shown in SEQ ID No. 20.
In another aspect, the present invention provides a recombinant vector comprising any one of the above expression cassettes, wherein the recombinant vector is a plasmid vector, a lentiviral vector, a retroviral vector, an adenoviral vector, an adeno-associated viral vector, a simian viral vector, a vaccinia viral vector, a sendai viral vector, an EB viral vector, or a herpes simplex viral vector;
preferably, the plasmid vector is Cre/loxP recombinase system plasmid or sleeping beauty transposon system plasmid.
In another aspect, the invention provides the use of any one of the above expression cassettes or recombinant vectors in the preparation of a blue light-induced and activated Cre recombinase transgenic cell line or transgenic animal model,
the animal is a mammal, preferably a mouse or rat, more preferably a mouse.
In the application, in the Cre recombinase transgenic cell line activated by blue light induction, the intensity of the blue light is 3-5mw/cm 2 The irradiation mode is blue light irradiation for 20-40s and 2-5min off;
in the blue light induction activated Cre recombinase transgenic animal, the intensity of the blue light is 15-25mw/cm 2 The irradiation mode is 0.5-2min on and 3-5min off.
On the other hand, the invention protects the application of the blue light induction activated Cre recombinase transgenic cell line or transgenic animal prepared by any one of the Cre recombination optimization systems or the recombinant vectors in gene function research, lineage tracing and cell clearing.
The invention has the following beneficial effects:
1. the light-controlled Cre recombinase transgenic mouse model is established for the first time.
The existing light-controlled Cre recombinase system is subjected to various researches on cells, but the cell level cannot fully simulate the animal level. The light control system is applied to a mouse for the first time, so that the application range of the light-sensitive protein is expanded, and a reference is provided for establishing an animal model by other light-sensitive proteins.
2. The space-time specificity regulation Cre recombinase expression is realized for the first time.
The existing Cre recombinase induction system can only realize the lack of space specificity of time specificity, and the gene editing of space-time specificity can be realized by controlling the time and space of illumination of the light-controlled Cre recombinase transgenic mouse model established by people, so that the defects of the traditional research method are overcome.
3. The Cre recombinase generated by the system provided by the invention mainly comprises two fusion proteins CIB1-creC which are separated from each other 106-343 And CRY2-CreN 19-104 Forming; when the two proteins are under certain blue light conditions, CRY2-CreN 19-104 The CRY2 in (A) undergoes conformational change with CIB1-CreC 106-343 In (3) to allow CreC 106-343 And CreN 19-343 Binding forms a protein having Cre recombinase activity, thereby editing the target gene located between the loxP sites. In order to improve the expression efficiency and the induction multiple of Cre recombinase in the system, the system is optimized, so that the expression efficiency of the Cre recombinase activated by blue light induction is obviously improved, and the induction multiple is improved by about 40 times.
Drawings
FIG. 1 is a schematic diagram of the structure of a plasmid containing different PA-Cre fragmentation sites (59/60 and 104/106), wherein the following symbols from left to right are used: CMV is a strong mammalian-expressed promoter of human cytomegalovirus origin; the CIB1 is a coding gene of a ligand of a light-sensitive protein CRY 2; cre (r. Cre. R. C) 60-343 And Cre 106-343 Coding gene of Cre recombinase C end; IRES is an internal ribosome entry site sequence such as SEQ ID No. 13; CRY2 L384F The coding gene of the L384F mutant of the photosensitive protein CRY 2; cre (r. Cre. R. C) 19-59 And Cre 19-104 Is the coding gene of Cre recombinase N end.
FIG. 2 is the result of detecting the effect of different segmented sites of PA-Cre on the luciferase activity of the cell line.
FIG. 3 is a schematic diagram of PA-Cre with Linker peptides (Linker) with different amino acid sequences.
FIG. 4 shows the results of the assays for the effect of different PA-Cre on luciferase activity of the cell line shown in FIG. 3.
FIG. 5 is a schematic diagram of PA-Cre structure at different NLS positions.
FIG. 6 shows the results of the assays for the effect of different PA-Cre on luciferase activity of the cell line shown in FIG. 5.
FIG. 7 is a schematic diagram of the optimized PA-Cre structure.
FIG. 8 is a graph showing the results of the assay of the effect of PA-Cre on luciferase activity of the cell line shown in FIG. 7.
FIG. 9 shows the results of detection of protein expression level of PA-Cre after optimization, transfection of HEK293 cells, lysis of cells after 48h, and taking of protein as western blot.
Fig. 10 is a quantitative analysis of the results of fig. 9, with the GADPH protein in HEK293 cells as the internal control.
FIG. 11 shows the activity ratio of optimized PA-Cre in wild-type mice, CMV-loxp-stop-loxp-luc plasmid injected into the tail vein of fluid dynamics, one group of mice are dark, the other group of mice are irradiated by blue light, and the in vivo imaging result is 24h later.
FIG. 12 is the quantitative analysis of the results of FIG. 11, the upper panel corresponding to CMV-PACre and the lower panel corresponding to CAG-PACre.
FIG. 13 shows CAG-PACre transgenic carrier injected into CAG-loxp-stop-loxp-tdtomato mice in tail vein by fluid dynamics, one group of mice is dark, the other group of mice is irradiated by blue light, after 48h, the detection result of the frozen section of the liver of the mice is taken out, the scale at the lower right corner in the figure represents 200 μm, the length of the line segment marked in the right side of the second row in the figure represents that the irradiation depth is 400 μm, wherein Merge is the result of the superposition of the images of the first two columns.
FIG. 14 is a flow chart of the construction of PA-Cre transgenic mice using the transposase system.
FIG. 15 shows the results of protein expression characterization of PA-Cre transgenic mice.
FIG. 16 shows the result of in vivo imaging before and after the seventh day of light irradiation of the first generation PA-Cre transgenic mouse injected with AAV8CMV-loxp-stop-loxp-luc virus into the tail vein, and the reference light intensity and color bar values are the same as those in FIG. 11.
FIG. 17 is a quantitative analysis of the results of FIG. 16.
FIG. 18 shows the results of protein expression characterization of the first generation PA-Cre transgenic mice.
In fig. 2, 4, 6, 8, and 17, the values above the columns are the ratio of the light to dark results for the same treatment.
Detailed Description
The experimental procedures used in the following examples are all conventional procedures unless otherwise specified.
Materials, reagents and the like used in the following examples are commercially available unless otherwise specified.
Example 1 optimization of the PA-Cre System
In the embodiment, the framework vector for expressing the Cre recombinase is a commercial vector pT2/HB, and the insertion site of the PA-Cre recombinase gene in the framework vector is between XbaI and EcoRI.
The reporter plasmid CMV-loxp-stop-loxp-LUC used in this example was obtained by ligating the coding sequences of CMV, loxp, stop, loxp, and LUC (SEQ ID No. 21) in this order between the NcoI and EcoRI sites of the commercial vector px 330.
1. Determination of Cre recombinase segmentation sites
The common segmentation sites for Cre recombinase are 59/60 and 104/106, and two plasmids with different segmentation sites, PACre59/60 and PACre104/106 (FIG. 1) are constructed.
In FIG. 1, cre in the plasmid PACre104/106 19-104 The amino acid sequence of (A) is shown as SEQ ID No.3, the coding gene sequence is shown as SEQ ID No.4, cre 106-343 The amino acid sequence of (A) is shown as SEQ ID No.1, and the coding gene sequence is shown as SEQ ID No. 2; the amino acid sequence of CIB1 is shown as SEQ ID No.17, and the coding geneThe sequence is shown as SEQ ID No. 18; CRY2 L384F The amino acid sequence of (A) is shown as SEQ ID No.19, and the coding gene sequence is shown as SEQ ID No. 20.
Co-transforming HEK293 cells with the two plasmids PACre59/60 and PACre104/106 and the reporter plasmid CMV-loxp-stop-loxp-LUC respectively, and after 24 hours, one group is dark and light-proof, and the other group is used with the intensity of 4mw/cm 2 After 48 hours, the luciferase LUC was detected, and as a result, as shown in FIG. 2, the 59/60 segmented Cre recombinase had almost no activity, so we selected 104/106 of the segmented sites.
2. Selection of different Linker peptides (Linker)
In the construction of fusion proteins, the linker peptide is important in relation to the activity of the protein. We selected 4 different amino acid sequence connecting peptides (figure 3), the amino acid sequence of which is respectively shown in SEQ ID No.5, 6, 7, 8, and the coding gene of each connecting peptide is simultaneously arranged at the upstream of two Cre recombinase segments to obtain four different plasmids L1, L2, L3, L4;
wherein, the coding gene sequence of the connecting peptide used by the plasmid L1 is shown as SEQ ID No. 9.
The four plasmids and a reporter plasmid CMV-loxp-stop-loxp-LUC are co-transformed into HEK293 cells respectively, and then the enzyme activity of the LUC is detected according to the method in the step 1, and the result is shown in figure 4, wherein the L1 effect is optimal, and at the moment, the Cre recombinase has high activity and almost no background (namely, the Cre recombinase has almost no activity under dark conditions, namely, the system has almost no leakage).
3. Selection of nuclear localization signal sequence (NLS) positions
In order to further improve the enzymatic activity of Cre recombinase, the expression efficiency of Cre recombinase is improved by adding NLS and adjusting the relative position of NLS so that NLS is better enriched in cell nucleus. To this end we designed four plasmids N 0 、N C 、N R 、N CR Respectively is as follows: NLS is not added, NLS is added only at the N-terminal of CIB1, NLS is added only at the N-terminal of CRY2, and two nuclear localization signal sequences are added (FIG. 5).
Wherein, the amino acid sequence of the nuclear localization signal NLS is shown as SEQ ID No.10, and the coding gene sequence is shown as SEQ ID No. 11.
The four plasmids and a report plasmid CMV-loxp-stop-loxp-LUC are respectively co-transformed into HEK293 cells, then the enzyme activity of the LUC is detected according to the method in the step 1, and the result is shown in figure 6, and NLS is only added at the N end of CIB1, namely the activation multiple of Cre recombinase of the plasmid Nc is the highest; is improved by about 40 times compared with the plasmid PACre104/106 before optimization.
4. Optimization of expression cassettes/vectors
Setting the promoter as CAG (broad-spectrum strong promoter), adding 3 Xflag label in front of nuclear localization signal for convenient detection of protein expression, and setting transcription regulation element WPRE at the end to obtain plasmid containing CAG-PACre, and setting CMV as the plasmid CMV-PACre of promoter as control (FIG. 7);
the equimolar CMV-PACre-containing plasmid and the equimolar CAG-PACre-containing plasmid were co-transformed into HEK293 cells together with the reporter plasmid CMV-loxp-stop-loxp-LUC, and then the LUC enzyme activity was detected according to the method of step 1, as shown in FIG. 8, the CAG-PACre induction factor was higher.
Western blot after cell lysis, it was found that the expression level of equimolar CAG-PACre was much higher than that of CMV-PACre (FIG. 9), and that the expression of the plasmid was not affected by blue light irradiation (FIG. 10).
Wherein, the nucleotide sequence of the promoter CAG is shown as SEQ ID No. 12; the amino acid sequence of the T2A is shown as SEQ ID No.14, and the coding gene sequence is shown as SEQ ID No. 15; the nucleotide sequence of the WPRE element is shown as SEQ ID No. 16.
Example 2 validation of the function of the PA-Cre transgenic vector in mice
1. Verification on WT mice
Two plasmids containing CAG-PACre or CMV-PACre obtained in step 4 of example 1 were co-injected into mice with CMV-loxp-stop-loxp-LUC reporter plasmid, respectively, using the hydrodynamic tail vein injection method. After 8h, one group was dark and protected from light, and the other group was used at an intensity of 20mw/cm 2 Blue light (1 min on, 4min off) illumination. After 16h, the in vivo imaging results (FIG. 11) were consistent with those verified on HEK293 cells, and were found to beThe equimolar CAG-PACre not only has high induction fold, but also has enzyme activity far higher than that of CMV-PACre (figure 12).
2. Validation on CAG-loxp-stop-loxp-tdtomato mice
In order to stabilize the experimental results, we selected a reporter mouse of Cre recombinase, CAG-loxp-stop-loxp-tdtomato mouse. The plasmid containing CAG-PACre was injected into the reporter mice using the hydrodynamic tail vein injection of plasmids. After 8h, one group was dark and protected from light, and the other group was used at an intensity of 20mw/cm 2 Blue light (1 min on, 4min off) illumination. After 48h, a frozen section of mouse liver is taken out, and the CAG-PACre is detected to effectively cut off the stop sequence to express tdtomato. CAG-PACre has a stronger spatial specificity compared to the control Cre, expressed only in the part where blue light can penetrate, with a penetration thickness of about 400 μm (fig. 13).
Example 3 construction of PA-Cre transgenic mice Using sleeping beauty transposon subsystem
Phenol chloroform was used to extract CAG-PACre-containing plasmid to remove RNA enzyme, and then was microinjected into fertilized eggs together with in vitro transcribed SB transposase mRNA, and healthy and viable fertilized eggs were transplanted into pseudopregnant mice (FIG. 14). After birth, the mice with positive genotype identification result are subjected to protein identification, and the result shows that 7 mice (2 #, 7#, 24#, 25#, 33#, 43#, 44 #) successfully express PA-Cre protein and can be subjected to passage (figure 15).
Example 4 functional verification of PA-Cre transgenic mice
According to the results of protein identification, 4 mice (2 #, 24#, 25#, 44 #) successfully expressing PA-Cre protein were selected from example 3 and mated with wild-type mice for generation. After 5 weeks of F1 birth, we selected mice in which PA-Cre protein expression was positive, and injected with AAV8CMV-loxp-stop-loxp-LUC adeno-associated virus, respectively. After 7 days, mice were imaged in vivo without blue light, and no expression of LUC was detected (fig. 16, first row). Then using an intensity of 20mw/cm 2 After 12h of blue light (1 min on, 4min off) exposure, and re-live imaging, some mice detected expression of LUC (fig. 16 second row, fig. 17, fig. 18), with the highest fold induction of 175, confirming iThe PA-Cre transgenic mice are successfully established and can be stably passaged.
Those not described in detail in this specification are within the skill of the art. The above description is only an example of the present application and is not intended to limit the present application. Various modifications and changes may occur to those skilled in the art. Any modification, equivalent replacement, improvement or the like made within the spirit and principle of the present application shall be included in the scope of the claims of the present application.
SEQUENCE LISTING
<110> Shanghai Bodhisae Biotech Co., ltd, university of east China
<120> blue light induction activated Cre recombination optimization system and application thereof
<130> JH-CNP190788
<160> 21
<170> PatentIn version 3.5
<210> 1
<211> 238
<212> PRT
<213> Artificial sequence
<400> 1
Arg Pro Ser Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg Ile Arg
1 5 10 15
Lys Glu Asn Val Asp Ala Gly Glu Arg Ala Lys Gln Ala Leu Ala Phe
20 25 30
Glu Arg Thr Asp Phe Asp Gln Val Arg Ser Leu Met Glu Asn Ser Asp
35 40 45
Arg Cys Gln Asp Ile Arg Asn Leu Ala Phe Leu Gly Ile Ala Tyr Asn
50 55 60
Thr Leu Leu Arg Ile Ala Glu Ile Ala Arg Ile Arg Val Lys Asp Ile
65 70 75 80
Ser Arg Thr Asp Gly Gly Arg Met Leu Ile His Ile Gly Arg Thr Lys
85 90 95
Thr Leu Val Ser Thr Ala Gly Val Glu Lys Ala Leu Ser Leu Gly Val
100 105 110
Thr Lys Leu Val Glu Arg Trp Ile Ser Val Ser Gly Val Ala Asp Asp
115 120 125
Pro Asn Asn Tyr Leu Phe Cys Arg Val Arg Lys Asn Gly Val Ala Ala
130 135 140
Pro Ser Ala Thr Ser Gln Leu Ser Thr Arg Ala Leu Glu Gly Ile Phe
145 150 155 160
Glu Ala Thr His Arg Leu Ile Tyr Gly Ala Lys Asp Asp Ser Gly Gln
165 170 175
Arg Tyr Leu Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg
180 185 190
Asp Met Ala Arg Ala Gly Val Ser Ile Pro Glu Ile Met Gln Ala Gly
195 200 205
Gly Trp Thr Asn Val Asn Ile Val Met Asn Tyr Ile Arg Asn Leu Asp
210 215 220
Ser Glu Thr Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp
225 230 235
<210> 2
<211> 714
<212> DNA
<213> Artificial sequence
<400> 2
cgaccaagtg acagcaatgc tgtttcactg gttatgcggc ggatccgaaa agaaaacgtt 60
gatgccggtg aacgtgcaaa acaggctcta gcgttcgaac gcactgattt cgaccaggtt 120
cgttcactca tggaaaatag cgatcgctgc caggatatac gtaatctggc atttctgggg 180
attgcttata acaccctgtt acgtatagcc gaaattgcca ggatcagggt taaagatatc 240
tcacgtactg acggtgggag aatgttaatc catattggca gaacgaaaac gctggttagc 300
accgcaggtg tagagaaggc acttagcctg ggggtaacta aactggtcga gcgatggatt 360
tccgtctctg gtgtagctga tgatccgaat aactacctgt tttgccgggt cagaaaaaat 420
ggtgttgccg cgccatctgc caccagccag ctatcaactc gcgccctgga agggattttt 480
gaagcaactc atcgattgat ttacggcgct aaggatgact ctggtcagag atacctggcc 540
tggtctggac acagtgcccg tgtcggagcc gcgcgagata tggcccgcgc tggagtttca 600
ataccggaga tcatgcaagc tggtggctgg accaatgtaa atattgtcat gaactatatc 660
cgtaacctgg atagtgaaac aggggcaatg gtgcgcctgc tggaagatgg ggat 714
<210> 3
<211> 86
<212> PRT
<213> Artificial sequence
<400> 3
Thr Ser Asp Glu Val Arg Lys Asn Leu Met Asp Met Phe Arg Asp Arg
1 5 10 15
Gln Ala Phe Ser Glu His Thr Trp Lys Met Leu Leu Ser Val Cys Arg
20 25 30
Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn Arg Lys Trp Phe Pro Ala
35 40 45
Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu Tyr Leu Gln Ala Arg Gly
50 55 60
Leu Ala Val Lys Thr Ile Gln Gln His Leu Gly Gln Leu Asn Met Leu
65 70 75 80
His Arg Arg Ser Gly Leu
85
<210> 4
<211> 258
<212> DNA
<213> Artificial sequence
<400> 4
acgagtgatg aggttcgcaa gaacctgatg gacatgttca gggatcgcca ggcgttttct 60
gagcatacct ggaaaatgct tctgtccgtt tgccggtcgt gggcggcatg gtgcaagttg 120
aataaccgga aatggtttcc cgcagaacct gaagatgttc gcgattatct tctatatctt 180
caggcgcgcg gtctggcagt aaaaactatc cagcaacatt tgggccagct aaacatgctt 240
catcgtcggt ccgggctg 258
<210> 5
<211> 15
<212> PRT
<213> Artificial sequence
<400> 5
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Arg
1 5 10 15
<210> 6
<211> 10
<212> PRT
<213> Artificial sequence
<400> 6
Leu Glu Ala Ser Thr Gly Gly Ser Gly Thr
1 5 10
<210> 7
<211> 16
<212> PRT
<213> Artificial sequence
<400> 7
Leu Glu Ala Ser Pro Ser Asn Pro Gly Ala Ser Asn Gly Ser Gly Thr
1 5 10 15
<210> 8
<211> 15
<212> PRT
<213> Artificial sequence
<400> 8
Glu Ala Ala Ala Lys Glu Ala Ala Ala Lys Glu Ala Ala Ala Lys
1 5 10 15
<210> 9
<211> 45
<212> DNA
<213> Artificial sequence
<400> 9
ggtggcggtg gctctggagg tggtgggtcc ggaggaggcg gccgc 45
<210> 10
<211> 12
<212> PRT
<213> Artificial sequence
<400> 10
Ala Ser Pro Lys Lys Lys Arg Lys Val Glu Ala Ser
1 5 10
<210> 11
<211> 36
<212> DNA
<213> Artificial sequence
<400> 11
gccagtccca agaagaagag aaaggtggag gccagt 36
<210> 12
<211> 1716
<212> DNA
<213> Artificial sequence
<400> 12
ctagttatta atagtaatca attacggggt cattagttca tagcccatat atggagttcc 60
gcgttacata acttacggta aatggcccgc ctggctgacc gcccaacgac ccccgcccat 120
tgacgtcaat aatgacgtat gttcccatag taacgccaat agggactttc cattgacgtc 180
aatgggtgga gtatttacgg taaactgccc acttggcagt acatcaagtg tatcatatgc 240
caagtacgcc ccctattgac gtcaatgacg gtaaatggcc cgcctggcat tatgcccagt 300
acatgacctt atgggacttt cctacttggc agtacatcta cgtattagtc atcgctatta 360
ccatggtcga ggtgagcccc acgttctgct tcactctccc catctccccc ccctccccac 420
ccccaatttt gtatttattt attttttaat tattttgtgc agcgatgggg gcgggggggg 480
ggggggggcg cgcgccaggc ggggcggggc ggggcgaggg gcggggcggg gcgaggcgga 540
gaggtgcggc ggcagccaat cagagcggcg cgctccgaaa gtttcctttt atggcgaggc 600
ggcggcggcg gcggccctat aaaaagcgaa gcgcgcggcg ggcggggagt cgctgcgacg 660
ctgccttcgc cccgtgcccc gctccgccgc cgcctcgcgc cgcccgcccc ggctctgact 720
gaccgcgtta ctcccacagg tgagcgggcg ggacggccct tctcctccgg gctgtaatta 780
gcgcttggtt taatgacggc ttgtttcttt tctgtggctg cgtgaaagcc ttgaggggct 840
ccgggagggc cctttgtgcg gggggagcgg ctcggggggt gcgtgcgtgt gtgtgtgcgt 900
ggggagcgcc gcgtgcggct ccgcgctgcc cggcggctgt gagcgctgcg ggcgcggcgc 960
ggggctttgt gcgctccgca gtgtgcgcga ggggagcgcg gccgggggcg gtgccccgcg 1020
gtgcgggggg ggctgcgagg ggaacaaagg ctgcgtgcgg ggtgtgtgcg tgggggggtg 1080
agcagggggt gtgggcgcgt cggtcgggct gcaacccccc ctgcaccccc ctccccgagt 1140
tgctgagcac ggcccggctt cgggtgcggg gctccgtacg gggcgtggcg cggggctcgc 1200
cgtgccgggc ggggggtggc ggcaggtggg ggtgccgggc ggggcggggc cgcctcgggc 1260
cggggagggc tcgggggagg ggcgcggcgg cccccggagc gccggcggct gtcgaggcgc 1320
ggcgagccgc agccattgcc ttttatggta atcgtgcgag agggcgcagg gacttccttt 1380
gtcccaaatc tgtgcggagc cgaaatctgg gaggcgccgc cgcaccccct ctagcgggcg 1440
cggggcgaag cggtgcggcg ccggcaggaa ggaaatgggc ggggagggcc ttcgtgcgtc 1500
gccgcgccgc cgtccccttc tccctctcca gcctcggggc tgtccgcggg gggacggctg 1560
ccttcggggg ggacggggca gggcggggtt cggcttctgg cgtgtgaccg gcggctctag 1620
agcctctgct aaccatgttc atgccttctt ctttttccta cagctcctgg gcaacgtgct 1680
ggttattgtg ctgtctcatc attttggcaa agaatt 1716
<210> 13
<211> 587
<212> DNA
<213> Artificial sequence
<400> 13
cccctctccc tccccccccc ctaacgttac tggccgaagc cgcttggaat aaggccggtg 60
tgcgtttgtc tatatgttat tttccaccat attgccgtct tttggcaatg tgagggcccg 120
gaaacctggc cctgtcttct tgacgagcat tcctaggggt ctttcccctc tcgccaaagg 180
aatgcaaggt ctgttgaatg tcgtgaagga agcagttcct ctggaagctt cttgaagaca 240
aacaacgtct gtagcgaccc tttgcaggca gcggaacccc ccacctggcg acaggtgcct 300
ctgcggccaa aagccacgtg tataagatac acctgcaaag gcggcacaac cccagtgcca 360
cgttgtgagt tggatagttg tggaaagagt caaatggctc tcctcaagcg tattcaacaa 420
ggggctgaag gatgcccaga aggtacccca ttgtatggga tctgatctgg ggcctcggta 480
cacatgcttt acatgtgttt agtcgaggtt aaaaaaacgt ctaggccccc cgaaccacgg 540
ggacgtggtt ttcctttgaa aaacacgatg ataatatggc cacaacc 587
<210> 14
<211> 21
<212> PRT
<213> Artificial sequence
<400> 14
Gly Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys Gly Asp Val Glu
1 5 10 15
Glu Asn Pro Gly Pro
20
<210> 15
<211> 63
<212> DNA
<213> Artificial sequence
<400> 15
ggcagtggag agggcagagg aagtctgcta acatgcggtg acgtcgagga gaatcctggc 60
cca 63
<210> 16
<211> 589
<212> DNA
<213> Artificial sequence
<400> 16
aatcaacctc tggattacaa aatttgtgaa agattgactg gtattcttaa ctatgttgct 60
ccttttacgc tatgtggata cgctgcttta atgcctttgt atcatgctat tgcttcccgt 120
atggctttca ttttctcctc cttgtataaa tcctggttgc tgtctcttta tgaggagttg 180
tggcccgttg tcaggcaacg tggcgtggtg tgcactgtgt ttgctgacgc aacccccact 240
ggttggggca ttgccaccac ctgtcagctc ctttccggga ctttcgcttt ccccctccct 300
attgccacgg cggaactcat cgccgcctgc cttgcccgct gctggacagg ggctcggctg 360
ttgggcactg acaattccgt ggtgttgtcg gggaaatcat cgtcctttcc ttggctgctc 420
gcctgtgttg ccacctggat tctgcgcggg acgtccttct gctacgtccc ttcggccctc 480
aatccagcgg accttccttc ccgcggcctg ctgccggctc tgcggcctct tccgcgtctt 540
cgccttcgcc ctcagacgag tcggatctcc ctttgggccg cctccccgc 589
<210> 17
<211> 334
<212> PRT
<213> Artificial sequence
<400> 17
Asn Gly Ala Ile Gly Gly Asp Leu Leu Leu Asn Phe Pro Asp Met Ser
1 5 10 15
Val Leu Glu Arg Gln Arg Ala His Leu Lys Tyr Leu Asn Pro Thr Phe
20 25 30
Asp Ser Pro Leu Ala Gly Phe Phe Ala Asp Ser Ser Met Ile Thr Gly
35 40 45
Gly Glu Met Asp Ser Tyr Leu Ser Thr Ala Gly Leu Asn Leu Pro Met
50 55 60
Met Tyr Gly Glu Thr Thr Val Glu Gly Asp Ser Arg Leu Ser Ile Ser
65 70 75 80
Pro Glu Thr Thr Leu Gly Thr Gly Asn Phe Lys Lys Arg Lys Phe Asp
85 90 95
Thr Glu Thr Lys Asp Cys Asn Glu Lys Lys Lys Lys Met Thr Met Asn
100 105 110
Arg Asp Asp Leu Val Glu Glu Gly Glu Glu Glu Lys Ser Lys Ile Thr
115 120 125
Glu Gln Asn Asn Gly Ser Thr Lys Ser Ile Lys Lys Met Lys His Lys
130 135 140
Ala Lys Lys Glu Glu Asn Asn Phe Ser Asn Asp Ser Ser Lys Val Thr
145 150 155 160
Lys Glu Leu Glu Lys Thr Asp Tyr Ile His Val Arg Ala Arg Arg Gly
165 170 175
Gln Ala Thr Asp Ser His Ser Ile Ala Glu Arg Val Arg Arg Glu Lys
180 185 190
Ile Ser Glu Arg Met Lys Phe Leu Gln Asp Leu Val Pro Gly Cys Asp
195 200 205
Lys Ile Thr Gly Lys Ala Gly Met Leu Asp Glu Ile Ile Asn Tyr Val
210 215 220
Gln Ser Leu Gln Arg Gln Ile Glu Phe Leu Ser Met Lys Leu Ala Ile
225 230 235 240
Val Asn Pro Arg Pro Asp Phe Asp Met Asp Asp Ile Phe Ala Lys Glu
245 250 255
Val Ala Ser Thr Pro Met Thr Val Val Pro Ser Pro Glu Met Val Leu
260 265 270
Ser Gly Tyr Ser His Glu Met Val His Ser Gly Tyr Ser Ser Glu Met
275 280 285
Val Asn Ser Gly Tyr Leu His Val Asn Pro Met Gln Gln Val Asn Thr
290 295 300
Ser Ser Asp Pro Leu Ser Cys Phe Asn Asn Gly Glu Ala Pro Ser Met
305 310 315 320
Trp Asp Ser His Val Gln Asn Leu Tyr Gly Asn Leu Gly Val
325 330
<210> 18
<211> 1002
<212> DNA
<213> Artificial sequence
<400> 18
aatggagcca tcggcggcga cctgctgctg aacttccccg atatgagcgt gctggagaga 60
cagagggccc acctgaagta cctgaacccc acctttgaca gccctctggc tggcttcttc 120
gccgactcca gcatgatcac cggcggagag atggacagct atctgagcac cgccggcctg 180
aacctgccca tgatgtacgg cgagacaaca gtggagggcg actccagact gtccatcagc 240
cccgaaacaa ccctgggcac cggcaacttc aagaagagga agttcgacac cgagaccaaa 300
gactgcaacg aaaagaagaa aaagatgacc atgaacagag atgacctggt ggaggagggc 360
gaggaggaga agagcaagat caccgagcag aacaacggca gcaccaagtc catcaagaag 420
atgaagcata aggccaagaa ggaagagaac aatttcagca acgacagctc caaggtgacc 480
aaggagctcg agaagaccga ctacatccac gtgagggcca gaaggggcca ggccacagac 540
agccactcca ttgccgagag ggtcagaagg gagaagatct ccgagaggat gaagttcctg 600
caagacctgg tgcccggctg tgacaaaatc accggcaagg ccggcatgct ggacgagatc 660
atcaactacg tccagagcct ccagaggcag atcgagttcc tctccatgaa actggccatc 720
gtgaatccca ggcccgactt cgacatggac gacatcttcg ccaaggaggt cgcctccacc 780
cccatgacag tcgtgcccag ccccgagatg gtgctgtccg gatacagcca cgagatggtg 840
cacagcggct acagctccga gatggtgaac tccggctacc tgcacgtgaa tcccatgcag 900
caggtgaaca catcctccga tcccctgagc tgcttcaaca acggagaggc ccccagcatg 960
tgggactccc acgtgcagaa cctgtacgga aatctgggcg tg 1002
<210> 19
<211> 611
<212> PRT
<213> Artificial sequence
<400> 19
Lys Met Asp Lys Lys Thr Ile Val Trp Phe Arg Arg Asp Leu Arg Ile
1 5 10 15
Glu Asp Asn Pro Ala Leu Ala Ala Ala Ala His Glu Gly Ser Val Phe
20 25 30
Pro Val Phe Ile Trp Cys Pro Glu Glu Glu Gly Gln Phe Tyr Pro Gly
35 40 45
Arg Ala Ser Arg Trp Trp Met Lys Gln Ser Leu Ala His Leu Ser Gln
50 55 60
Ser Leu Lys Ala Leu Gly Ser Asp Leu Thr Leu Ile Lys Thr His Asn
65 70 75 80
Thr Ile Ser Ala Ile Leu Asp Cys Ile Arg Val Thr Gly Ala Thr Lys
85 90 95
Val Val Phe Asn His Leu Tyr Asp Pro Val Ser Leu Val Arg Asp His
100 105 110
Thr Val Lys Glu Lys Leu Val Glu Arg Gly Ile Ser Val Gln Ser Tyr
115 120 125
Asn Gly Asp Leu Leu Tyr Glu Pro Trp Glu Ile Tyr Cys Glu Lys Gly
130 135 140
Lys Pro Phe Thr Ser Phe Asn Ser Tyr Trp Lys Lys Cys Leu Asp Met
145 150 155 160
Ser Ile Glu Ser Val Met Leu Pro Pro Pro Trp Arg Leu Met Pro Ile
165 170 175
Thr Ala Ala Ala Glu Ala Ile Trp Ala Cys Ser Ile Glu Glu Leu Gly
180 185 190
Leu Glu Asn Glu Ala Glu Lys Pro Ser Asn Ala Leu Leu Thr Arg Ala
195 200 205
Trp Ser Pro Gly Trp Ser Asn Ala Asp Lys Leu Leu Asn Glu Phe Ile
210 215 220
Glu Lys Gln Leu Ile Asp Tyr Ala Lys Asn Ser Lys Lys Val Val Gly
225 230 235 240
Asn Ser Thr Ser Leu Leu Ser Pro Tyr Leu His Phe Gly Glu Ile Ser
245 250 255
Val Arg His Val Phe Gln Cys Ala Arg Met Lys Gln Ile Ile Trp Ala
260 265 270
Arg Asp Lys Asn Ser Glu Gly Glu Glu Ser Ala Asp Leu Phe Leu Arg
275 280 285
Gly Ile Gly Leu Arg Glu Tyr Ser Arg Tyr Ile Cys Phe Asn Phe Pro
290 295 300
Phe Thr His Glu Gln Ser Leu Leu Ser His Leu Arg Phe Phe Pro Trp
305 310 315 320
Asp Ala Asp Val Asp Lys Phe Lys Ala Trp Arg Gln Gly Arg Thr Gly
325 330 335
Tyr Pro Leu Val Asp Ala Gly Met Arg Glu Phe Trp Ala Thr Gly Trp
340 345 350
Met His Asn Arg Ile Arg Val Ile Val Ser Ser Phe Ala Val Lys Phe
355 360 365
Leu Leu Leu Pro Trp Lys Trp Gly Met Lys Tyr Phe Trp Asp Thr Leu
370 375 380
Leu Asp Ala Asp Leu Glu Cys Asp Ile Leu Gly Trp Gln Tyr Ile Ser
385 390 395 400
Gly Ser Ile Pro Asp Gly His Glu Leu Asp Arg Leu Asp Asn Pro Ala
405 410 415
Leu Gln Gly Ala Lys Tyr Asp Pro Glu Gly Glu Tyr Ile Arg Gln Trp
420 425 430
Leu Pro Glu Leu Ala Arg Leu Pro Thr Glu Trp Ile His His Pro Trp
435 440 445
Asp Ala Pro Leu Thr Val Leu Lys Ala Ser Gly Val Glu Leu Gly Thr
450 455 460
Asn Tyr Ala Lys Pro Ile Val Asp Ile Asp Thr Ala Arg Glu Leu Leu
465 470 475 480
Ala Lys Ala Ile Ser Arg Thr Arg Glu Ala Gln Ile Met Ile Gly Ala
485 490 495
Ala Pro Asp Glu Ile Val Ala Asp Ser Phe Glu Ala Leu Gly Ala Asn
500 505 510
Thr Ile Lys Glu Pro Gly Leu Cys Pro Ser Val Ser Ser Asn Asp Gln
515 520 525
Gln Val Pro Ser Ala Val Arg Tyr Asn Gly Ser Lys Arg Val Lys Pro
530 535 540
Glu Glu Glu Glu Glu Arg Asp Met Lys Lys Ser Arg Gly Phe Asp Glu
545 550 555 560
Arg Glu Leu Phe Ser Thr Ala Glu Ser Ser Ser Ser Ser Ser Val Phe
565 570 575
Phe Val Ser Gln Ser Cys Ser Leu Ala Ser Glu Gly Lys Asn Leu Glu
580 585 590
Gly Ile Gln Asp Ser Ser Asp Gln Ile Thr Thr Ser Leu Gly Lys Asn
595 600 605
Gly Cys Lys
610
<210> 20
<211> 1833
<212> DNA
<213> Artificial sequence
<400> 20
aagatggaca agaaaaccat cgtctggttc aggagggacc tgaggatcga ggataacccc 60
gctctggctg ctgctgctca cgagggttct gtcttccctg tgtttatttg gtgccctgag 120
gaggagggac agttctatcc tggcagggcc agcaggtggt ggatgaagca gtccctggct 180
cacctgtccc agagcctgaa ggctctgggc agcgatctca ccctcatcaa aacccacaac 240
accatctccg ccatcctcga ctgcatcaga gtcaccggcg ccaccaaggt ggtgttcaac 300
catctctacg accctgtgtc cctggtcaga gaccacacag tcaaggagaa gctcgtcgaa 360
agaggaatct ccgtgcagtc ctacaacggc gacctgctgt acgagccctg ggagatttac 420
tgcgagaagg gcaagccctt cacatccttc aacagctact ggaagaagtg tctggacatg 480
tccatcgaga gcgtcatgct gccccctcct tggagactga tgcccattac cgctgccgct 540
gaggctatct gggcttgttc catcgaagaa ctcggcctgg agaacgaggc cgaaaagccc 600
agcaacgccc tgctcaccag agcttggtcc cccggctgga gcaatgccga caagctgctc 660
aacgagttca tcgagaagca gctgatcgac tatgccaaga acagcaagaa agtggtgggc 720
aatagcacca gcctgctgag cccctacctg catttcggag agatttccgt gaggcacgtc 780
ttccagtgcg ccaggatgaa gcaaatcatc tgggccagag acaagaacag cgaaggagag 840
gagtccgccg atctctttct caggggaatc ggcctcagag agtatagcag gtacatttgc 900
ttcaacttcc cctttaccca tgagcagagc ctcctgagcc acctcagatt ctttccttgg 960
gacgccgatg tggacaaatt caaagcctgg aggcagggaa ggaccggata ccctctggtg 1020
gacgccggca tgagagagtt ttgggctacc ggctggatgc acaacaggat tagggtcatc 1080
gtgagcagct ttgccgtcaa attcctcctg ctgccctgga agtggggcat gaagtatttc 1140
tgggacacac tgctggatgc cgatctcgag tgcgacatcc tgggctggca gtatatcagc 1200
ggctccatcc ctgatggcca cgagctcgac agactggaca accctgccct gcagggcgct 1260
aagtacgacc ccgaaggcga gtacatcaga caatggctgc ctgaactggc cagactccct 1320
acagagtgga ttcatcaccc ctgggacgcc cctctgaccg tcctgaaggc cagcggagtg 1380
gagctgggca ccaactacgc taaacccatc gtcgacatcg acacagccag ggagctcctc 1440
gccaaggcca tctccagaac cagggaagct cagatcatga tcggcgccgc tcccgatgag 1500
atcgtggccg attccttcga agccctggga gctaacacca tcaaggagcc cggactgtgc 1560
ccctccgtga gcagcaacga ccagcaagtg ccctccgccg tgaggtataa cggctccaag 1620
agagtgaaac ccgaagagga ggaagagaga gacatgaaga agagcagggg cttcgacgaa 1680
agggagctgt tttccaccgc tgaatccagc agctcctcct ccgtcttctt cgtgagccag 1740
tcctgtagcc tggccagcga gggcaagaac ctggaaggaa tccaggacag ctccgaccag 1800
attaccacca gcctcggaaa gaacggctgc aag 1833
<210> 21
<211> 1650
<212> DNA
<213> Artificial sequence
<400> 21
gaagacgcca aaaacataaa gaaaggcccg gcgccattct atccgctgga agatggaacc 60
gctggagagc aactgcataa ggctatgaag agatacgccc tggttcctgg aacaattgct 120
tttacagatg cacatatcga ggtggacatc acttacgctg agtacttcga aatgtccgtt 180
cggttggcag aagctatgaa acgatatggg ctgaatacaa atcacagaat cgtcgtatgc 240
agtgaaaact ctcttcaatt ctttatgccg gtgttgggcg cgttatttat cggagttgca 300
gttgcgcccg cgaacgacat ttataatgaa cgtgaattgc tcaacagtat gggcatttcg 360
cagcctaccg tggtgttcgt ttccaaaaag gggttgcaaa aaattttgaa cgtgcaaaaa 420
aagctcccaa tcatccaaaa aattattatc atggattcta aaacggatta ccagggattt 480
cagtcgatgt acacgttcgt cacatctcat ctacctcccg gttttaatga atacgatttt 540
gtgccagagt ccttcgatag ggacaagaca attgcactga tcatgaactc ctctggatct 600
actggtctgc ctaaaggtgt cgctctgcct catagaactg cctgcgtgag attctcgcat 660
gccagagatc ctatttttgg caatcaaatc attccggata ctgcgatttt aagtgttgtt 720
ccattccatc acggttttgg aatgtttact acactcggat atttgatatg tggatttcga 780
gtcgtcttaa tgtatagatt tgaagaagag ctgtttctga ggagccttca ggattacaag 840
attcaaagtg cgctgctggt gccaacccta ttctccttct tcgccaaaag cactctgatt 900
gacaaatacg atttatctaa tttacacgaa attgcttctg gtggcgctcc cctctctaag 960
gaagtcgggg aagcggttgc caagaggttc catctgccag gtatcaggca aggatatggg 1020
ctcactgaga ctacatcagc tattctgatt acacccgagg gggatgataa accgggcgcg 1080
gtcggtaaag ttgttccatt ttttgaagcg aaggttgtgg atctggatac cgggaaaacg 1140
ctgggcgtta atcaaagagg cgaactgtgt gtgagaggtc ctatgattat gtccggttat 1200
gtaaacaatc cggaagcgac caacgccttg attgacaagg atggatggct acattctgga 1260
gacatagctt actgggacga agacgaacac ttcttcatcg ttgaccgcct gaagtctctg 1320
attaagtaca aaggctatca ggtggctccc gctgaattgg aatccatctt gctccaacac 1380
cccaacatct tcgacgcagg tgtcgcaggt cttcccgacg atgacgccgg tgaacttccc 1440
gccgccgttg ttgttttgga gcacggaaag acgatgacgg aaaaagagat cgtggattac 1500
gtcgccagtc aagtaacaac cgcgaaaaag ttgcgcggag gagttgtgtt tgtggacgaa 1560
gtaccgaaag gtcttaccgg aaaactcgac gcaagaaaaa tcagagagat cctcataaag 1620
gccaagaagg gcggaaagat cgccgtgtaa 1650
Claims (23)
1. A blue light induction activated Cre recombination optimization system is characterized in that: the system comprises a blue light-induced activated Cre recombinase expression cassette comprising genes linked in the following order: coding genes of a photosensitive protein ligand CIB1, a Cre recombinase C end coding gene, a photosensitive protein CRY2 and a Cre recombinase N end coding gene;
the amino acid sequence of the C end of the Cre recombinase is shown as SEQ ID No.1, the amino acid sequence of the N end of the Cre recombinase is shown as SEQ ID No.3, the coding gene sequence of the C end of the Cre recombinase is shown as SEQ ID No.2, and the coding gene sequence of the N end of the Cre recombinase is shown as SEQ ID No. 4;
the expression cassette also comprises a coding gene of a nuclear localization signal NLS, the coding gene of the nuclear localization signal NLS is positioned in front of the coding gene of the photosensitive protein ligand CIB1, and the amino acid sequence of the nuclear localization signal NLS is shown as SEQ ID No. 10;
the expression cassette also comprises a coding gene of a first connecting peptide and a coding gene of a second connecting peptide, wherein the coding gene of the first connecting peptide is positioned between the coding gene of the photosensitive protein ligand CIB1 and the coding gene of the Cre recombinase C end, and the coding gene of the second connecting peptide is positioned between the coding gene of the photosensitive protein CRY2 and the coding gene of the Cre recombinase N end;
a promoter is also included in the expression cassette; the expression cassette also comprises a coding gene or IRES of the self-shearing protein, and the coding gene or IRES of the self-shearing protein is positioned between the coding gene at the C end of the Cre recombinase and the coding gene of the light-sensitive protein CRY 2; the expression cassette also comprises a transcription regulation element, the transcription regulation element is positioned behind the coding gene at the N end of the Cre recombinase, and the transcription regulation element is a WPRE element.
2. A Cre reorganisation optimisation system as claimed in claim 1, wherein: the nucleotide sequence of the coding gene of the nuclear localization signal NLS is shown as SEQ ID No. 11.
3. A Cre reorganization optimization system according to claim 1, wherein: the amino acid sequences of the first connecting peptide and the second connecting peptide are shown in SEQ ID No.5, 6, 7 or 8.
4. A Cre reassembly optimization system according to claim 3, wherein: the amino acid sequences of the first connecting peptide and the second connecting peptide are shown in SEQ ID No. 5.
5. A Cre reassembly optimization system according to claim 4, wherein: the nucleotide sequences of the coding genes of the first connecting peptide and the second connecting peptide are shown in SEQ ID No. 9.
6. A Cre reorganisation optimisation system as claimed in claim 1, wherein: the promoter is CAG or CMV.
7. A Cre recombination optimization system as claimed in claim 6, wherein: the promoter is CAG.
8. A Cre reorganisation optimisation system as claimed in claim 7, wherein: the nucleotide sequence of the promoter CAG is shown as SEQ ID No. 12.
9. A Cre reorganisation optimisation system as claimed in claim 1, wherein: the self-cleavage protein is T2A.
10. A Cre reorganisation optimisation system as claimed in claim 1, wherein: the nucleotide sequence of the IRES is shown as SEQID No.13, and the self-cutting protein is T2A.
11. A Cre reorganisation optimisation system as claimed in claim 10, wherein: the amino acid sequence of the T2A is shown as SEQID No. 14.
12. A Cre reorganisation optimisation system as claimed in claim 11, wherein: the nucleotide sequence of the coding gene of the T2A is shown as SEQ ID No. 15.
13. A Cre reorganisation optimisation system as claimed in claim 1, wherein: the expression cassette also comprises a coding gene of a protein Tag sequence, the coding gene of the protein Tag sequence is positioned between the promoter and the coding gene of the nuclear localization signal NLS, and the protein Tag comprises any one or at least two of MyC, his, GST, HA, flag, MBP, avi Tag, SUMO and c-Myc Tag;
and/or the photosensitive protein ligand CIB1 is an amino acid sequence shown in SEQ ID No. 17;
and/or the light sensitive protein CRY2 is an amino acid sequence shown in SEQ ID No. 19.
14. A Cre reorganisation optimisation system as claimed in claim 1, wherein: the nucleotide sequence of the WPRE element is shown as SEQID No. 16;
and/or the nucleotide sequence of the coding gene sequence of the photosensitive protein ligand CIB1 is shown as SEQ ID No. 18;
and/or the nucleotide sequence of the coding gene sequence of the photosensitive protein CRY2 is shown as SEQ ID No. 20.
15. A Cre reorganisation optimisation system as claimed in claim 13, wherein: the protein tag is Flag.
16. A Cre reorganisation optimisation system as claimed in claim 15 wherein: the protein tag is 3 XFlag.
17. A recombinant vector comprising the Cre recombinant optimization system according to any one of claims 1 to 16, wherein the recombinant vector is a plasmid vector, a lentiviral vector, a retroviral vector, an adenoviral vector, an adeno-associated viral vector, a simian viral vector, a vaccinia viral vector, a sendai viral vector, an EB viral vector, or a herpes simplex viral vector.
18. The recombinant vector according to claim 17, wherein: the plasmid vector is Cre/loxP recombinase system plasmid or sleeping beauty transposon system plasmid.
19. Use of the Cre recombination optimisation system of any one of claims 1 to 16 or the recombinant vector of any one of claims 17 to 18 for the preparation of a transgenic cell line or a transgenic animal model of a blue light induced activated Cre recombinase, said animal being a mammal.
20. The use as claimed in claim 19, wherein: the mammal is a mouse or rat.
21. The use as claimed in claim 20, wherein: the mammal is a mouse.
22. Use according to any one of claims 19 to 21, wherein: in the blue light induction activated Cre recombinase transgenic cell line, the intensity of the blue light is 3-5mw/cm 2 The irradiation mode is that blue light is irradiated for 20-40s and is switched off for 2-5 min;
in the blue light induction activated Cre recombinase transgenic animal, the intensity of the blue light is 15-25mw/cm 2 The irradiation mode is 0.5-2min on and 3-5min off.
23. Use of a transgenic cell line or a transgenic animal of the Cre recombinase activated by blue light induction prepared by the Cre recombination optimization system of any one of claims 1-16 or the recombinant vector of any one of claims 17-18 in gene function research, lineage tracing and cell depletion.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910746260.6A CN110747227B (en) | 2019-08-13 | 2019-08-13 | Blue light induced and activated Cre recombination optimization system and application thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910746260.6A CN110747227B (en) | 2019-08-13 | 2019-08-13 | Blue light induced and activated Cre recombination optimization system and application thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110747227A CN110747227A (en) | 2020-02-04 |
CN110747227B true CN110747227B (en) | 2023-03-31 |
Family
ID=69275843
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910746260.6A Active CN110747227B (en) | 2019-08-13 | 2019-08-13 | Blue light induced and activated Cre recombination optimization system and application thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110747227B (en) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108070035A (en) * | 2017-10-12 | 2018-05-25 | 中国科学院上海生命科学研究院 | Inducibility Genetic Recombination enzyme system CrexER |
CN109517048A (en) * | 2018-12-04 | 2019-03-26 | 厦门大学 | A kind of light genetic tool that blue light down regulation chromatin interacts over long distances |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102643852B (en) * | 2011-02-28 | 2015-04-08 | 华东理工大学 | Optical controllable gene expression system |
-
2019
- 2019-08-13 CN CN201910746260.6A patent/CN110747227B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108070035A (en) * | 2017-10-12 | 2018-05-25 | 中国科学院上海生命科学研究院 | Inducibility Genetic Recombination enzyme system CrexER |
CN109517048A (en) * | 2018-12-04 | 2019-03-26 | 厦门大学 | A kind of light genetic tool that blue light down regulation chromatin interacts over long distances |
Non-Patent Citations (4)
Title |
---|
Achieving tight control of a photoactivatable Cre recombinase gene switch: new design strategies and functional characterization in mammalian cells and rodent;Meador K 等;《Nucleic Acids Res》;20190709;第47卷(第17期);e97页 * |
Optimized second-generation CRY2-CIB dimerizers and photoactivatable Cre recombinase;Taslimi Amir等;《Nat Chem Biol》;20160411;第12卷(第6期);摘要,第4页左栏第1-2段,图4d,附件表1,Online methods * |
Rapid blue-light-mediated induction of protein interactions in living cells;Matthew J Kennedy等;《Nat Methods》;20101031;第7卷(第12期);973–975页 * |
光控诱导重组系统的开发与应用;谢甜 等;《遗传》;20220713;第44卷(第8期);655-671页 * |
Also Published As
Publication number | Publication date |
---|---|
CN110747227A (en) | 2020-02-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110891420B (en) | CAS transgenic mouse embryonic stem cell, mouse and application thereof | |
DK2150617T3 (en) | Regions with increased expression and stability | |
JP2009017889A (en) | Expression of heterologous gene according to targeted expression profile | |
CN104202970A (en) | Transgenic animals with customizable traits | |
CN111182790A (en) | CRISPR reporter non-human animals and uses thereof | |
US20030106074A1 (en) | Collections of transgenic animal lines (living library) | |
US7741113B2 (en) | Cell-specific molecule and method for importing DNA into osteoblast nuclei | |
US20230323369A1 (en) | Screening model and method for hbv cccdna-targeting drug | |
EP2621268B1 (en) | Model for muscle atrophy | |
JP6871544B2 (en) | How to Produce a DNA Vector from a Molecular Component Containing a Sequence of Interest | |
CN107760707A (en) | A kind of foundation for the self-activation Gal4/UAS system expression boxes for strengthening gene expression | |
KR20060123413A (en) | EXPRESSION SYSTEM WITH THE USE OF MAMMALIAN beta;-ACTIN PROMOTER | |
CN112266931B (en) | Construction method and application of transgenic rat specifically expressing hMRGPRX4 | |
CN110747227B (en) | Blue light induced and activated Cre recombination optimization system and application thereof | |
CN114072510A (en) | Recombinant transfer vectors for protein expression in insect and mammalian cells | |
KR20100102235A (en) | A method to reduce directional bias in transcription | |
WO2022232348A1 (en) | Angiotensin-converting enzyme ii (ace2) transgenic animal and uses thereof | |
CN110484563B (en) | Mammal cell combined expression vector, expression system, preparation method and application | |
CN112921052B (en) | In vivo cell proliferation marker and tracer system and application thereof | |
CN114134183A (en) | Construction method and application of SIGLEC15 gene humanized animal model | |
CN113444722A (en) | Application of single base editing mediated splicing repair in preparation of drugs for treating spinal muscular atrophy | |
US20240093206A1 (en) | System of stable gene expression in cell lines and methods of making and using the same | |
US20230287460A1 (en) | Mammalian cells comprising integrated cas9 genes to produce stable integration sites, and mammalian cells comprising stable integration sites and other sites | |
KR102156089B1 (en) | Inflammation-forming transgenic animal model through selective over-expression of lipocalin-2 and a method for producing the same | |
RU2782356C2 (en) | Methods and compositions for assessment of mediated crispr/cas destruction or cutout and induced crispr/cas recombination with exogeneous donor nucleic acid in vivo |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |